Levi levizwang

Hi, I'm Levi Wang 👋 — AI Evaluation & LLM Infrastructure Engineer

I specialize in AI evaluation, LLM infrastructure, and quality assurance for production-grade systems. Currently focused on end-to-end RAG development, automated evaluation pipelines, and multi-modal model testing to ensure reliable, high-precision AI outcomes.

🧰 Tech Stack

Languages

AI / LLM

Frameworks / Platforms

Tools

🧪 Core Expertise & Projects

RAGAS Evaluation Framework Integration — integrated automated evaluation across end-to-end RAG systems.
Multi-modal AI Testing — built visual evaluation datasets and benchmarking workflows for VLMs.
Automated Evaluation Pipelines — designed modular “evals” framework for one-click benchmarking and insights.
Speech & Voice AI QA — fine-tuned Whisper for ASR optimization and production-grade voice agents.

📊 Weekly Coding Stats

From: 29 April 2026 - To: 29 May 2026

Total Time: 185 hrs 28 mins

Python        67 hrs 39 mins        ████████▓░░░░░░░░░░░░░░░░   35.10 %
JSON          41 hrs 4 mins         █████▒░░░░░░░░░░░░░░░░░░░   21.31 %
Markdown      30 hrs 9 mins         ████░░░░░░░░░░░░░░░░░░░░░   15.65 %
Text          15 hrs 22 mins        ██░░░░░░░░░░░░░░░░░░░░░░░   07.97 %
Bash          10 hrs 41 mins        █▒░░░░░░░░░░░░░░░░░░░░░░░   05.55 %
Other         7 hrs 17 mins         █░░░░░░░░░░░░░░░░░░░░░░░░   03.78 %
CSV           4 hrs 37 mins         ▓░░░░░░░░░░░░░░░░░░░░░░░░   02.40 %
Git Config    3 hrs 5 mins          ▒░░░░░░░░░░░░░░░░░░░░░░░░   01.60 %
TOML          3 hrs 4 mins          ▒░░░░░░░░░░░░░░░░░░░░░░░░   01.60 %
Jinja2        2 hrs 49 mins         ▒░░░░░░░░░░░░░░░░░░░░░░░░   01.47 %

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Levi levizwang

Achievements

Achievements

Block or report levizwang

Hi, I'm Levi Wang 👋 — AI Evaluation & LLM Infrastructure Engineer

🧰 Tech Stack

🧪 Core Expertise & Projects

📊 Weekly Coding Stats

Popular repositories Loading

Uh oh!