Skip to content
View Bhavd33p's full-sized avatar

Block or report Bhavd33p

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Bhavd33p/README.md

Bhavdeep Singh

Software Engineer

AI Engineering · Full-Stack Development · LLM Systems

LinkedIn Email GitHub


Bhavd33p's Contribution Graph


Skills

Languages

Python TypeScript JavaScript C++ Java SQL

AI / ML

PyTorch HuggingFace LangChain TensorFlow scikit-learn

Backend / APIs

FastAPI Node.js Express.js

Frontend

React Next.js

Databases

PostgreSQL MongoDB Redis MySQL

DevOps / Tools

Docker Git Linux Vercel


About

I build AI systems that ship — from fine-tuning LLMs on custom hardware to full-stack apps with real auth and real users. I care about making things fast, reliable, and cost-efficient. Currently looking to join an early-stage team as a Software / AI Engineer where I can own features end-to-end.


Projects

🧠 Multi-Tenant LLM Serving Engine

Production inference server — multiple LoRA adapters, one quantized base model

FastAPI server that hot-swaps between task-specific LoRA adapters (code generation, finance extraction) on a 4-bit quantized Qwen2.5 model. Trained adapters from scratch on Apple Silicon using custom data pipelines.

  • 4-bit NF4 quantization — ~75% memory reduction, fits on consumer GPU/MPS
  • O(1) adapter switching — pointer swap at inference time, no reload overhead
  • Streaming SSE responses — token-by-token via TextIteratorStreamer + async FastAPI
  • Full training pipeline — data prep → LoRA fine-tuning → adapter serving in one repo

FastAPI PyTorch HuggingFace LoRA/PEFT BitsAndBytes Apple Silicon MPS

→ View Repo


🤖 AI TestCase Generator

RAG pipeline that turns requirement docs into test cases

Ingests SWAD/ICD/BRD documents and generates structured test cases using adaptive retrieval and feedback loops. Benchmarked on RTX 4090.

  • Adaptive RAG with vector search over requirement chunks
  • Human-in-the-loop feedback for iterative refinement
  • Cuts manual test case writing by ~70–80%

Python LangChain RAG Vector DBs FastAPI

→ View Repo


💬 PDF Converse

Multi-user collaborative chat over any PDF

Upload a PDF, ask questions, get grounded answers. Multiple users, multiple threads, all indexed and retrieved with semantic search.

  • Chunking + embeddings + conversational retrieval chain
  • Multi-user session management with isolated threads
  • Answers grounded strictly in uploaded document context

Python LangChain Vector Embeddings FastAPI React

→ View Repo


✅ Mini Task Manager

Full-stack project management with role-based auth

End-to-end task and project management system. Clean API, JWT auth, role separation, due dates.

  • .NET backend + React + TypeScript frontend
  • JWT authentication with role-based access control
  • User-scoped projects and tasks with due date tracking

.NET React TypeScript JWT PostgreSQL

→ View Repo


🖊️ Web Annotator

Browser extension for in-place web annotation

Chrome extension to highlight, underline, and annotate text on any webpage — saved persistently per-URL.

JavaScript Chrome Extensions API DOM Manipulation

→ View Repo


What I'm Looking For

Open to Software Engineer or AI Engineer roles at early-stage companies where I can:

  • Own features from problem definition through design to production
  • Build reliable, cost-efficient LLM systems and internal tooling
  • Work on teams that move fast and care about quality

📬 singhbhavdeep364@gmail.com · LinkedIn

Popular repositories Loading

  1. Potato-Disease-Classifier Potato-Disease-Classifier Public

    The Project Made Under INTRA BHAWAN TECHTONIC is a Potato Disease Classifier Made using CNN

    Jupyter Notebook

  2. PDF-Converse PDF-Converse Public

    The PDF Converse tool can be used to upload PDFs and effectively answer all queries. With this tool, users can collaborate and engage in discussions within the context of the document itself.

    Python

  3. Web-Annotator Web-Annotator Public

    Web Annotator is a browser extension that allows you to highlight, underline, and annotate any text on web pages.

    JavaScript

  4. CP-codes CP-codes Public

    This repo contains all non classical algo's and codes.

    C++

  5. TestCase_Generator TestCase_Generator Public

    Forked from Bhavd33p-Singh/TestCase_Generator

    Python

  6. Mini-Task-Manager Mini-Task-Manager Public

    A full-stack Mini Project Management System built with .NET and React + TypeScript, featuring secure JWT-based authentication, user-specific projects, and task tracking with due dates and completio…

    C#