Using Fastai library to classify Twitter jokes in Spanish
Code assocaited with 3rd place finish in F1 score.
- Install conda with
conda env create -f environment.yml - place twitter data in
./data/all_file.txt source activate fastaiv1_devjupter notebookin the home directory, go to theLM Train in Notebookand run- Put
haha_2019_train.csvandhaha_2019_test.csvin./data/directory - Run
Finetune LMnotebook $cd ./prod/' run$./mult_seed_run_fwd_finetune.sh | tee --append out_fwd_1.txt`- run run
$./mult_seed_regr_finetune.sh | tee --append out_reg_1.txtGenerate Submission entry: - Run the
Ensemble 20 Seeds select best F1 0610.ipynbNotebook for the classification - Run the
Ensemble 20 modesl select best MSE 0610.ipynbNotebook for the regression outputs on the test set
Note:
- Data is installed in the same directory in
./data/directory (but not checked into this repo.)