Skip to content

Korean POS tagging poor performance #20

@jdoerfler

Description

@jdoerfler

SpaCy is having trouble with POS tagging in Korean. When inserting a break point in the code and using the sample sentences from SpaCy documentation, the parser does perfectly, so this may be a bit more insidious of a bug.

The below example should be getting a number of different tokens but instead gets one big one.

실제적인효용은크않을지몰라도
Attributes for Token: 실제적인효용은크않을지몰라도
Tag: ncn+jp+ecs
POS: VERB

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions