MLoRQ: Bridging Low-Rank and Quantization for Effective Transformers Compression

PyTorch implementation of MLoRQ. MLoRQ is a compression method for transformers-based networks. It utilizes joint low-rank and quantization optimization for effective compression.

Models

We provide a large set of pre-trained models to compress. The models are based on their implementation in the timm.

The names of the available models can be found under models_config.json. It includes the following models:

Model	Model usage name
ViT-Small	vit_s
ViT-Base	vit_b
DeiT-Tiny	deit_t
Deit-Small	deit_s
DeiT-Base	deit_b
Swin-Small	swin_s
Swin-Base	swin_b

Setup

pip install -r requirements.txt

Usage

MLoRQ

python main.py --model_name deit_s --weight_n_bits 3 --activation_n_bits 4 --train_data_path <path_to_training_dataset> --val_data_path <path_to_validation_dataset>

This example would execute MLoRQ to compress DieT-S with 3 bits for weights and 4 bits for activations.

MLoRQ with Activation Mixed precision quantization

We also enable activation mixed precision quantization, in which different activations tensors can be quantized with other bit-width.

python main.py --model_name deit_s --weight_n_bits 4 --activation_n_bits 4 --activation_mp --train_data_path <path_to_training_dataset> --val_data_path <path_to_validation_dataset>

This example would execute MLoRQ to compress DieT-S with 4 bits for weights and 4 bits for activations mixed precision.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
compression		compression
helpers		helpers
images		images
model_managers		model_managers
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
argument_handler.py		argument_handler.py
constants.py		constants.py
main.py		main.py
models_config.yaml		models_config.yaml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MLoRQ: Bridging Low-Rank and Quantization for Effective Transformers Compression

Models

Setup

Usage

MLoRQ

MLoRQ with Activation Mixed precision quantization

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MLoRQ: Bridging Low-Rank and Quantization for Effective Transformers Compression

Models

Setup

Usage

MLoRQ

MLoRQ with Activation Mixed precision quantization

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages