Skip to content

Advice to reproduce for other languages / newer version of UD #3

Description

@OrianeN

Hello, I'm working on Occitan and it would be interesting to explore the evaluation of LLMs on an Occitan version of MultiBLIMP.

Unfortunately the Occitan treebank was only added officially to UD in v2.16, while your dataset was built from v2.15 😢.

Unless you are willing to build it in the next few days/weeks, I can try to use your scripts and the pipeline description from your paper. Any advice (small or large) you might have is welcome (order and usage of scripts, format of files, where to do manual spot checks...) !

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions