Kunal Bisht aiKunalBisht

Kunal Bisht

AI Engineer from Pithoragarh, India — building LLM systems that actually work.

LinkedIn · kunalbisht909@gmail.com · JLPT N5 🇯🇵

What I've built

TranscriptAI — Meeting intelligence for EN · HI · JA

"LLMs are great at extraction. They're terrible at cultural nuance."

I built a meeting analysis system and hit a wall at 22% accuracy. The problem wasn't the LLM — it was that no amount of prompting teaches a model what nemawashi means in a real boardroom.

So I went and learned it myself. Two months of Japanese business linguistics, 16 soft-rejection patterns, MeCab morphological analysis for keigo detection. Then wired it as a post-processing NLP layer on top of the LLM output.

93% accuracy. 5 iterations. Every drop in accuracy had a specific root cause and a specific fix.

What's actually interesting here:

Hallucination guard is rule-based token overlap — the LLM never grades its own output
PII masking runs locally before the transcript reaches any API. Zero data leaves the machine unmasked.
Same person written as 田中, Tanaka, and "the Director" resolves to one speaker identity
3-tier fallback: Groq → Ollama → Mock. Explicit UX signal at each tier, not silent degradation

github.com/aiKunalBisht/Transcript-ai · Live on HuggingFace

FastAPI Groq MeCab FAISS Streamlit GitHub Actions 21 tests passing

RAG-Wikipedia — QA grounded in 241k articles

Built to answer a specific question: how bad does RAG actually get at scale?

241k Wikipedia articles → ~39k semantic chunks with FAISS
Sub-50ms vector search on CPU
100% faithfulness on DeepEval test set
FastAPI endpoint

github.com/aiKunalBisht/rag-wikipedia

Production backend — 5,000+ users

Backend engineering on a live college review platform.

The interesting parts: brought response time from ~300ms → ~85ms through query profiling and indexing, not rewrites. JWT RBAC across user roles. Monolith → microservices migration that actually reduced deploy complexity instead of adding it.

Node.js MongoDB TypeScript Next.js

TransitIntel — ML pipeline from CSV to deployed API

End-to-end: raw data → feature engineering → XGBoost → prediction API → 4 Plotly dashboards → Gunicorn + Nginx. First time I owned the full MLOps cycle.

github.com/aiKunalBisht/TransitIntel

Stack

AI/ML — Python, scikit-learn, PyTorch, Pandas, NumPy, LangChain, FAISS, Groq, Whisper, MeCab, Sentence-Transformers, RAG, LLM eval (DeepEval, RAGAS), Prompt Engineering

Backend — FastAPI, Node.js, Express, Flask, PostgreSQL, MongoDB, Redis, Docker, JWT

Frontend — React, Next.js, Tailwind CSS, TypeScript

Background

Master's in Data Science & Business Analytics — Simplilearn x IBM

B.Tech Computer Science & Engineering

Statistics · Probability · Model training and evaluation · Data analytics

Open to AI/ML Engineer and GenAI Developer roles — India and Japanese MNCs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly