Creating semantic concept-feature norms using STRUDEL
Below is the process you need to run in order to contribute a language to the collection.
- Open up
process_strudels.Rmdin the root (~/) folder. It is the same folder that contains thisREADME.md. - Select a language not currently completed.
See
~/data/udpipe_languages.csv'sCompletedcolumn. - Select a number of sub processes. Each sub process takes one core and ~2GB of RAM.
- Run, NOT KNIT, everything.
This will:
- Download a new language file into the
~/datafolder. - Download a new language udpipe control file into the
~/folder. - Splits the langage file into smaller files for parallel processing.
This makes more files in the
~/datafolder. - Runs each smaller file in its own process.
This generates files in the
~/concept-featurefolder. - Combines the files into a single file.
- Download a new language file into the
- Upload the combined file to the releases in GitHub.
- Update the releases and
~/data/udpipe_languages.csvnoting the progress.