UnicodeDecodeError with classifier.baseline()

This is a similar but different issue an another posted here.

```
$ python jnlp-test-sentencePolarityScore.py
Traceback (most recent call last):
  File "jnlp-test-sentencePolarityScore.py", line 9, in <module>
    print classifier.baseline(text)
  File "build/bdist.macosx-10.13-intel/egg/jNlp/jSentiments.py", line 56, in baseline
  File "build/bdist.macosx-10.13-intel/egg/jNlp/jSentiments.py", line 49, in polarScores_text
  File "build/bdist.macosx-10.13-intel/egg/jNlp/jTokenize.py", line 30, in jTokenize
  File "build/bdist.macosx-10.13-intel/egg/jNlp/jCabocha.py", line 27, in cabocha
UnicodeDecodeError: 'utf8' codec can't decode byte 0xb5 in position 105: invalid start byte
```
At first, I also had this same error with `classifier.train()`, but once I ran `- ./configure --with-charset=utf8` for the mecab dictionary and for cabocha, the error disappeared.

However, with `classifier.baseline()` the error remains.
Is there another part of the toolchain that I need to configure for utf-8?
Am I missing something really basic?

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

UnicodeDecodeError with classifier.baseline() #14

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

UnicodeDecodeError with classifier.baseline() #14

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions