GitHub - wxy-nlp/MultiTaskNAT

Implementation for the EMNLP2022 paper "Helping the Weak Makes You Strong: Simple Multi-Task Learning Improves Non-Autoregressive Translators." .

Overview

We propose a simple multi-task learning framework that introduces auxiliary weak AR decoders to make NAR models stronger. The AR decoders are as weak as possible, so they can not model the target sequence on their own unless the knowledge provided by the NAR decoder is sufficiently useful. Therefore, the NAR model has to become stronger to support AR decoders.

Our framework is plug-and-play and model-agnostic, so you can easily add a new NAR model to our multi-task framework. The "multitasknat" folder contains examples of CTC-based NAR model and vanilla NAR model.

Requirements & Installations

Python >= 3.7
Pytorch >= 1.11.0

git clone https://github.com/wxy-nlp/MultiTaskNAT.git && cd MultiTaskNAT && pip install -e .

(Optional) We use the ctcdecode tool from parlance to support CTC beam search. You can use the following command to install it.

cd ctcdecode && pip install .

P.S. In order to install correctly, you should have g++ >= 7.5.0.

Training and Evaluating

We integrate the training and evaluating pipeline in a shell file in the "run" folder. You should check the configuration and modify it to adapt to your environment if neccesary. After this, you can just execute the command like "sh run_wmt14.sh" and start training and evaluating processes.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github		.github
ctcdecode		ctcdecode
docs		docs
examples		examples
fairseq		fairseq
fairseq_cli		fairseq_cli
multitasknat		multitasknat
run		run
scripts		scripts
tests		tests
thirdparty/imputer-pytorch		thirdparty/imputer-pytorch
.gitignore		.gitignore
.gitmodules		.gitmodules
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
generate.py		generate.py
hubconf.py		hubconf.py
interactive.py		interactive.py
model.png		model.png
preprocess.py		preprocess.py
pyproject.toml		pyproject.toml
setup.py		setup.py
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Requirements & Installations

Training and Evaluating

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Overview

Requirements & Installations

Training and Evaluating

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages