Skip to content
View Pranshu0204's full-sized avatar

Highlights

  • Pro

Block or report Pranshu0204

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Pranshu0204/README.md

banner

typing intro

GitHub   Twitter/X   profile views


// whoami

🎓MSc Computer Science · TU Darmstadt · LLMs, CV, PGMs, Quantum Computing, Scalable Data Management
📍Darmstadt, DE 🇩🇪 — by way of Chennai, IN
🌙Architecture decisions happen on late-night walks. The whiteboard comes later.
🎮Call of Duty Mobile between training runs
📖Reads documentation the way other people read fiction. Unironically.
🤝Open to: healthcare-AI research collabs, applied ML roles, LLM systems engineering
🧭Currently circling: evaluation frameworks, responsible AI design, multi-agent orchestration, fine-tuning pipelines
"If it has no eval loop, it's just a demo."


// featured projects

🔬 StyleShift NEW

Intrinsic multi-author style change detection — no reference texts, no author profiles, pure internal comparison, framed as pairwise binary classification.

Dual-stream ensemble: 163-dim stylometric difference vectors → SVM, plus a Siamese Transformer over frozen all-mpnet-base-v2, fused by an LR meta-learner trained strictly on the validation split — leakage kept out by construction. Three difficulty tiers (Easy/Medium/Hard by topic diversity) double as a built-in ablation axis; the Hard tier shows exactly where pure stylometry hits its ceiling once topic is controlled — consistent with published PAN SOTA.

Ensemble Macro F1 = 0.606 · AUC-PR = 0.404 · PAN @ CLEF 2025

Python PyTorch scikit-learn sentence-transformers

→ repo

🧭 RecruitSense

Agentic resume intelligence — bias detection runs in a parallel LangGraph branch, decoupled from scoring by design, not as an afterthought.

RAG Fusion with Reciprocal Rank Fusion over Qdrant, parallel DAG orchestration, and QLoRA fine-tuning with automatic device detection (CUDA/MPS/CPU). Retrieval and judgment are separate concerns, wired that way on purpose.

RRF fusion · parallel DAG · QLoRA

LangGraph Qdrant FastAPI PEFT/QLoRA

🩺 AEGIS — Clinical AI Assistant

Non-diagnostic by design — structured JSON triage with graceful Markdown fallback when parsing fails, because clinical systems can't crash silently.

Clinician-supervised multimodal assistant: image + text intake, severity triage, and localization across 9 languages, streamed over WebSockets.

9 languages · graceful-degradation parser

Gemini 2.0 React TypeScript WebSocket

⚖️ Dual-Engine Sentiment Analysis

Built as a mini eval framework — same input, two engines, one /benchmark endpoint reporting per-method accuracy and per-sample agreement.

LLM-based and lexicon-based sentiment side by side, so disagreement is a first-class signal instead of noise you average away.

/benchmark endpoint · per-sample agreement

Flask Gemini TextBlob NumPy

🛣️ Road Lane Detection System

Treats CV as a streaming data system — per-frame JSON records with DETECTED / PARTIAL / NO_LANES tier classification and summary metrics on exit.

Classical pipeline (Canny + Hough + ROI masking) instrumented like a production service, not a notebook demo.

per-frame JSON telemetry

Python OpenCV NumPy

🧪 Research & Experiments

Beyond the Siren — emergency-response analysis work; the depth indicator, not the headline.

Maxwell's Rule in AR — physics visualization in augmented reality, because some intuitions need to be walked around, not read about.

research · AR / physics

Research AR
More projects
  • MLJAK2-Biotech — ML pipeline for JAK2 mutation analysis in biotech workflows.
  • Company Culture Analysis — NLP over employee-review corpora to surface culture signals beyond star ratings.

// tech stack

⚡ Current Focus

StyleShift QLoRA Eval frameworks

LLM & Agents

LangGraph LangChain Gemini PEFT/QLoRA Hugging Face

RAG & Data

Qdrant BGE-large sentence-transformers Pandas

ML · CV · NLP

PyTorch scikit-learn OpenCV spaCy NumPy

Backend & Web

FastAPI Flask React TypeScript WebSocket

Languages & Infra

Python SQL Git Docker Vite


// github stats

GitHub stats streak stats

top languages

activity graph

trophies


// connect

If you're building something where evaluation rigor matters — healthcare AI, LLM systems, agentic pipelines — my inbox is open. I'd rather see your failure cases than your demo video.

Currently benchmarking on PAN @ CLEF 2025 — if you're working on authorship analysis, forensic NLP, or style-based evaluation, I want to hear from you.

footer

Pinned Loading

  1. RecruitSense RecruitSense Public

    LLM-powered resume screener with RAG Fusion, multi-agent orchestration (LangGraph), bias detection, and QLoRA fine-tuning.

    Python

  2. Aegis Aegis Public

    Clinician-supervised, non-diagnostic AI agent for overloaded clinics

    TypeScript

  3. Beyond-the-Siren-Evaluation-of-Equitable-Emergency-Care-in-Low-and-Middle-Income-Countries Beyond-the-Siren-Evaluation-of-Equitable-Emergency-Care-in-Low-and-Middle-Income-Countries Public

    Jupyter Notebook

  4. Culture-Analysis-for-CulturOS Culture-Analysis-for-CulturOS Public

    Jupyter Notebook

  5. Dual-Engine-Twitter-Sentiment-Analysis Dual-Engine-Twitter-Sentiment-Analysis Public

    Python

  6. MLJAK2-Biotech- MLJAK2-Biotech- Public

    Jupyter Notebook