Hello, I'm working on Occitan and it would be interesting to explore the evaluation of LLMs on an Occitan version of MultiBLIMP.
Unfortunately the Occitan treebank was only added officially to UD in v2.16, while your dataset was built from v2.15 😢.
Unless you are willing to build it in the next few days/weeks, I can try to use your scripts and the pipeline description from your paper. Any advice (small or large) you might have is welcome (order and usage of scripts, format of files, where to do manual spot checks...) !
Hello, I'm working on Occitan and it would be interesting to explore the evaluation of LLMs on an Occitan version of MultiBLIMP.
Unfortunately the Occitan treebank was only added officially to UD in v2.16, while your dataset was built from v2.15 😢.
Unless you are willing to build it in the next few days/weeks, I can try to use your scripts and the pipeline description from your paper. Any advice (small or large) you might have is welcome (order and usage of scripts, format of files, where to do manual spot checks...) !