Skip to content
View Umair-IITD's full-sized avatar

Highlights

  • Pro

Block or report Umair-IITD

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Umair-IITD/README.md

Md Umair Alam

B.Tech Student at IIT Delhi · AI/ML & GenAI · Full-Stack Engineer · Building real-world products

I build things that work in production — from AI-powered web platforms to enterprise-grade RAG pipelines. My work sits at the intersection of applied machine learning and full-stack engineering, with a focus on shipping systems that solve real problems.

LinkedIn GitHub


About Me

I'm a student at IIT Delhi with a deep interest in applied AI, GenAI systems, and product engineering. I don't just study these fields — I build with them. My projects range from deployed full-stack applications with real users to rigorous ML pipelines evaluated against enterprise benchmarks.

I'm drawn to problems where good engineering and thoughtful product design intersect: making AI accessible, building systems that scale securely, and extracting actionable insight from data. I'm actively looking for software engineering and AI/ML internship opportunities where I can contribute meaningfully from day one.


Featured Projects

PortfolioLite — Privacy-First Cross-Platform Finance App

Live at portfoliolite.tech

A premium React Native mobile application for personal net worth tracking — engineered local-first with zero cloud dependency, biometric security, and a hardened payment & license system.

  • What it does: Tracks assets across categories with interactive growth timelines, allocation insights, and biometric-protected access. All data stays exclusively on the user's device.
  • What makes it technically deep: Built a device-tethered license engine using Supabase Edge Functions (Deno) with HMAC SHA-256 webhook verification to prevent code sharing; engineered a brute-force-resistant parameter parser for inconsistent Razorpay redirect payloads; implemented IP-based rate limiting (5 failed attempts → block) on the edge; automated transactional email delivery via Resend API; DNS-mapped custom domain to GitHub Pages with 100/100 performance score.
  • Stack: React Native, Expo SDK 52, TypeScript, Supabase (PostgreSQL + Edge Functions), Razorpay, Resend API, GitHub Pages

Hapiimood — AI Mental Wellness Platform

Live at hapiimood.me

A production-deployed full-stack web application built to support student mental health through anonymous, AI-driven conversations and mood analytics.

  • What it does: Real-time empathetic chat powered by Groq's Llama-3.1, background sentiment analysis, and a personalized analytics dashboard (mood trends, sentiment distribution, weekly engagement).
  • What makes it technically deep: Solved a non-trivial Clerk × Supabase RLS identity bridge using Next.js Server Actions; implemented serverless-safe background logging via Vercel's waitUntil; built in-memory IP rate limiting (60 req/min) to protect LLM API costs; configured custom DNS CNAME routing and dynamic CSP headers for production hardening.
  • Stack: Next.js 15, TypeScript, Supabase (PostgreSQL + RLS), Clerk Auth, Groq API, Tailwind CSS v4, Framer Motion, Recharts, Vercel

Enterprise RAG Pipeline — WixQA — Applied NLP Research

An end-to-end Retrieval-Augmented Generation system benchmarked on the WixQA dataset (6,221 enterprise help articles from Wix AI Research).

  • What it does: Full pipeline from knowledge base ingestion → hyperparameter search → retrieval evaluation → generation evaluation → system improvements (reranking, query rewriting, semantic chunking).
  • Results: Achieved 100% Context Recall at optimal configuration; reranking improved F1 by +16% on expert queries and +32% on simulated user queries.
  • Stack: Python, FAISS, SentenceTransformers (BAAI/bge-base-en-v1.5), Groq (Llama-3.1-8B), LangChain, ROUGE evaluation, LLM-as-judge

Tabular Prediction Using LLMs — Prompt Engineering Research

A rigorous empirical study comparing 5 prompt engineering strategies across 2 LLMs on the Titanic survival dataset — 10 experimental conditions, 7,000+ real API calls.

  • What it does: Evaluates Zero-shot, Few-shot, Chain-of-Thought, Self-Consistency, and Tree-of-Thought prompting against Accuracy, Precision, Recall, and F1 metrics; includes systematic hard-case failure analysis identifying 3 structural LLM reasoning blind spots.
  • Key finding: Zero-shot with the 70B model achieved 82.2% accuracy — and complex prompting strategies couldn't overcome strong statistical priors, revealing a fundamental ceiling for prompting-only approaches on tabular tasks.
  • Stack: Python, Groq API (Llama-3.1-8B & 3.3-70B), Pandas, scikit-learn, Matplotlib, Seaborn, Jupyter

Study Buddy — Production AI Academic Assistant

Live at Study-Buddy

A production-grade AI study companion for students — built around the insight that good tutoring means guiding thinking, not just supplying answers.

  • What it does: Ultra-low-latency AI tutoring via Groq inference, personalized study planning, multi-session persistence, and full markdown rendering for code and formulas.
  • Engineering highlights: Secure serverless API routing (API keys never exposed client-side); migrated from Gemini to Groq to reduce response latency for a real-time study feel; full light/dark mode with persistent local session history.
  • Stack: Next.js 15, React 19, Groq SDK (Llama-3.1-8B), Tailwind CSS, Vercel

Modern Heritage Growth Analysis — Retail Business Analytics

A comprehensive data analysis project for a retail business, focused on translating raw sales and inventory data into actionable business decisions.

  • What it does: Sales performance analysis, inventory health assessment, logistics optimization, and category-level growth insights — structured as a consulting-style analytical report.
  • Stack: Python, Jupyter Notebook, Pandas, data visualization libraries

Weather App — Real-Time Weather Dashboard

A responsive weather application with real-time data fetching via third-party weather API integration, location-based lookup, and a clean, intuitive UI.

  • What it does: Live weather conditions, temperature, humidity, wind speed, and multi-day forecast for any searched city or detected location — with dynamic UI that responds to weather state.
  • Engineering highlights: Demonstrates clean API integration patterns, asynchronous data fetching, error handling for invalid queries, and responsive layout design.
  • Stack: HTML, CSS, JavaScript, OpenWeatherMap API

Tech Stack

Languages Python · TypeScript · JavaScript · SQL

Frontend & Frameworks Next.js · React · React Native · Expo · Tailwind CSS · Framer Motion

Backend & Infrastructure Supabase · PostgreSQL · Vercel · Clerk Auth · Next.js API Routes

AI / ML Groq API · LLaMA 3.1 · FAISS · SentenceTransformers · LangChain · RAG · Prompt Engineering

Data & Analysis Pandas · Jupyter Notebook · ROUGE · LLM-as-judge evaluation


What I'm Working On

  • Deepening expertise in LLM systems — retrieval, evaluation, and agentic workflows
  • Exploring Bioinformatics applications of AI at IIT Delhi
  • Expanding Hapiimood's feature set and user base
  • Applying advanced prompt engineering strategies (CoT, Self-Consistency, ToT) to tabular and structured data tasks

GitHub Stats

GitHub Streak


Profile Views Repos Focus


Connect

I'm open to internship opportunities, research collaborations, and conversations about AI systems and product engineering.

LinkedIn Email


IIT Delhi · Open to SWE & AI/ML Internships

Pinned Loading

  1. enterprise-rag-wixqa enterprise-rag-wixqa Public

    End-to-end Retrieval-Augmented Generation (RAG) system using WixQA dataset with FAISS, LLM evaluation, and system improvements (reranking, query rewriting).

    Jupyter Notebook

  2. Tabular_Prediction_Using_LLM Tabular_Prediction_Using_LLM Public

    LLM-based tabular prediction system using advanced prompt engineering (Zero-shot, Few-shot, CoT, Self-Consistency, ToT) on the Titanic dataset. Includes 10 experimental setups across multiple LLMs …

    Jupyter Notebook

  3. Study-Buddy Study-Buddy Public

    A production-ready AI chatbot for students that assists with assignments, academic queries, and study planning using modern LLM techniques, intuitive UI, and scalable architecture.

    JavaScript

  4. Hapiimood Hapiimood Public

    Hapiimood is an AI-powered mental wellness platform that helps users track emotions, gain insights, and improve their mental health through intelligent conversations and analytics. Built as a full-…

    TypeScript

  5. portfoliolite portfoliolite Public

    PortfolioLite – A premium, local-first net worth and asset tracker. Built for privacy, it stores all your financial data exclusively on your device. Features include interactive growth timelines, d…

    TypeScript

  6. Modern_Heritage_Growth_Analysis Modern_Heritage_Growth_Analysis Public

    A comprehensive retail data analysis project for Modern Heritage, focusing on sales performance, inventory health, and logistics optimization. The analysis uncovers actionable insights on category …

    Jupyter Notebook