Skip to content

Notes on tests#117

Open
LourensVeen wants to merge 1 commit into
NLeSC:masterfrom
LourensVeen:tests-notes
Open

Notes on tests#117
LourensVeen wants to merge 1 commit into
NLeSC:masterfrom
LourensVeen:tests-notes

Conversation

@LourensVeen

Copy link
Copy Markdown
Member

See the inline comments

annotators=['tokenize', 'ssplit', 'pos', 'lemma'])
saf = stanford_to_saf(lines)
assert_equal({t['lemma'] for t in saf['tokens']},
{'Cesar', 'hit', 'Hovik'})

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is the order reversed here relative to the input?

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

{} is set builder syntax, so the order is considered irrelevant. From the looks of stanford_to_saf output, the order shouldn't matter because each token carries an offset.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants