Software Engineering student at Queen's University Belfast, focused on ML robustness, LLM evaluation, and building systems from first principles.
- Investigating positional bias in multiple-choice LLM evaluation using two-stage prompting and mitigation baselines.
- Running open-source model generalization experiments on Kelvin2 HPC with Qwen 2.5 models.
- Contributing to a survey on LLM/MLLM robustness.
-
Two-Stage Prompting for MCQ Evaluation Research project evaluating whether decomposing MCQ answering into free-text reasoning and option matching reduces positional bias.
-
MCQ Bias Generalization Experiments Open-source model experiments testing whether MCQ bias-mitigation methods generalize across model scale, dataset, and method family.
-
Multilayer Perceptron from Scratch NumPy-only neural network trained on Fashion-MNIST, with reproducible training artifacts and evaluation.
-
Logistic Regression from Scratch From-scratch implementation focused on optimization, decision boundaries, and reproducible experiment outputs.
Python · NumPy · PyTorch · Hugging Face · LaTeX · Git · Linux · Bash · Docker