Skip to content
View aiKunalBisht's full-sized avatar
🎯
Focusing
🎯
Focusing
  • AI Engineer
  • Remote
  • 16:06 (UTC +05:30)
  • LinkedIn in/kunalhere

Block or report aiKunalBisht

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
aiKunalBisht/README.md

Kunal Bisht

AI Engineer from Pithoragarh, India — building LLM systems that actually work.

LinkedIn · kunalbisht909@gmail.com · JLPT N5 🇯🇵


What I've built

TranscriptAI — Meeting intelligence for EN · HI · JA

"LLMs are great at extraction. They're terrible at cultural nuance."

I built a meeting analysis system and hit a wall at 22% accuracy. The problem wasn't the LLM — it was that no amount of prompting teaches a model what nemawashi means in a real boardroom.

So I went and learned it myself. Two months of Japanese business linguistics, 16 soft-rejection patterns, MeCab morphological analysis for keigo detection. Then wired it as a post-processing NLP layer on top of the LLM output.

93% accuracy. 5 iterations. Every drop in accuracy had a specific root cause and a specific fix.

What's actually interesting here:

  • Hallucination guard is rule-based token overlap — the LLM never grades its own output
  • PII masking runs locally before the transcript reaches any API. Zero data leaves the machine unmasked.
  • Same person written as 田中, Tanaka, and "the Director" resolves to one speaker identity
  • 3-tier fallback: Groq → Ollama → Mock. Explicit UX signal at each tier, not silent degradation

github.com/aiKunalBisht/Transcript-ai · Live on HuggingFace

FastAPI Groq MeCab FAISS Streamlit GitHub Actions 21 tests passing


RAG-Wikipedia — QA grounded in 241k articles

Built to answer a specific question: how bad does RAG actually get at scale?

  • 241k Wikipedia articles → ~39k semantic chunks with FAISS
  • Sub-50ms vector search on CPU
  • 100% faithfulness on DeepEval test set
  • FastAPI endpoint

github.com/aiKunalBisht/rag-wikipedia


Production backend — 5,000+ users

Backend engineering on a live college review platform.

The interesting parts: brought response time from ~300ms → ~85ms through query profiling and indexing, not rewrites. JWT RBAC across user roles. Monolith → microservices migration that actually reduced deploy complexity instead of adding it.

Node.js MongoDB TypeScript Next.js


TransitIntel — ML pipeline from CSV to deployed API

End-to-end: raw data → feature engineering → XGBoost → prediction API → 4 Plotly dashboards → Gunicorn + Nginx. First time I owned the full MLOps cycle.

github.com/aiKunalBisht/TransitIntel


Stack

AI/ML — Python, scikit-learn, PyTorch, Pandas, NumPy, LangChain, FAISS, Groq, Whisper, MeCab, Sentence-Transformers, RAG, LLM eval (DeepEval, RAGAS), Prompt Engineering

Backend — FastAPI, Node.js, Express, Flask, PostgreSQL, MongoDB, Redis, Docker, JWT

Frontend — React, Next.js, Tailwind CSS, TypeScript


Background

Master's in Data Science & Business Analytics — Simplilearn x IBM

B.Tech Computer Science & Engineering

Statistics · Probability · Model training and evaluation · Data analytics


GitHub Stats Top Languages


Open to AI/ML Engineer and GenAI Developer roles — India and Japanese MNCs.

Popular repositories Loading

  1. data-science-portfolio data-science-portfolio Public

    HTML 1

  2. srchr srchr Public

    HTML 1

  3. rag-wikipedia rag-wikipedia Public

    My first RAG Project

    Python 1

  4. Transcript-ai Transcript-ai Public

    In modern international business, conversations often shift between languages. Transcript AI bridges this gap by utilizing Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) to e…

    Python 1

  5. streamify-video-calls-master streamify-video-calls-master Public archive

    JavaScript

  6. ai-resume-builder-main ai-resume-builder-main Public

    JavaScript