MetaLint

Code for Non-Idiomatic Python & Java Code Detection with LLMs (work done with Oracle Labs East)

Training & Evaluation Workflow

SFT training

First train SFT (alignment-handbook):

cd alignment-handbook

ACCELERATE_LOG_LEVEL=info accelerate launch --config_file recipes/accelerate_configs/deepspeed_zero3.yaml scripts/run_sft.py /path/to/sft/config.yaml

SFT Evaluation

Then evaluate SFT checkpoints at various steps to find the best one:

cd ..

python src/model/eval_llm_meta_linter_vllm_futures.py --model_name /path/to/checkpoint --write_path /path/to/where/you/want/to/write/results --test_file data/ruff_meta_linting/test_v4_new_format_with_lineno.json

To compute metrics (transfer setting):

python src/metrics/meta_linting/idiom_detection_and_localization_v3.py <STEP_NO>

python src/metrics/meta_linting/idiom_detection_and_localization_all_idioms.py <STEP_NO>

To launch vLLM server (needed for both evaluation and DPO data generation):

bash src/model/launch_vllm_server.sh /path/to/model/checkpoint

DPO data generation

Then based on the best checkpoint path generate samples for DPO training:

python src/model/eval_llm_meta_linter_vllm_multi_sample.py --model_name /path/to/best/checkpoint --write_path data/dpo_samples/<FILENAME>

convert saved DPO samples into training pairs in Huggingface format:

python src/dpo/convert_dpo_samples_to_pairs.py

After this upload the data to Huggingface!

DPO training

cd alignment-handbook

ACCELERATE_LOG_LEVEL=info accelerate launch --config_file recipes/accelerate_configs/deepspeed_zero3.yaml scripts/run_dpo.py /path/to/dpo/config.yaml

Citation

If you find our work useful, please cite us as follows:

@article{naik2025metalint,
  title={MetaLint: Generalizable Idiomatic Code Quality Analysis through Instruction-Following and Easy-to-Hard Generalization},
  author={Naik, Atharva and Baghel, Lawanya and Govindarajan, Dhakshin and Agrawal, Darsh and Fried, Daniel and Rose, Carolyn},
  journal={arXiv preprint arXiv:2507.11687},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 182 Commits
alignment-handbook		alignment-handbook
data		data
plots		plots
scripts		scripts
semantic_peps		semantic_peps
src		src
statistical_testing		statistical_testing
.gitattributes		.gitattributes
.gitignore		.gitignore
CoT_detection_failure_qwen3_4b_think_dpo.json		CoT_detection_failure_qwen3_4b_think_dpo.json
GPU_DETAILS.txt		GPU_DETAILS.txt
README.md		README.md
experiments		experiments
filter_codereviewer_data.py		filter_codereviewer_data.py
handbook.yml		handbook.yml
peft_requirements.txt		peft_requirements.txt
py3.13.yml		py3.13.yml
ruff.toml		ruff.toml
vllm_env.yaml		vllm_env.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MetaLint

Training & Evaluation Workflow

SFT training

SFT Evaluation

DPO data generation

DPO training

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MetaLint

Training & Evaluation Workflow

SFT training

SFT Evaluation

DPO data generation

DPO training

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages