⚡ SkillBridge — AI-Adaptive Onboarding Engine

An AI-driven system that analyzes a candidate's resume against a job description, identifies precise skill gaps, and generates a personalized, prioritized learning roadmap.

🎯 Problem Statement

Corporate onboarding suffers from a one-size-fits-all problem:

Experienced hires waste time on concepts they already know
Beginners get overwhelmed by advanced modules
No one measures the actual gap between hire capabilities and role requirements

SkillBridge solves this by generating a personalized learning pathway in under 100 seconds using AI.

🚀 Live Demo

Upload a resume (PDF/DOCX) + job description (PDF/TXT) → get a fully personalized roadmap with reasoning traces.

✨ Features

Feature	Description
🤖 Intelligent Parsing	Hybrid LLM + alias extraction from resume & JD
📊 Skill Gap Analysis	3-layer comparison: exact match → skill family → cosine similarity
🗺️ Adaptive Pathway	Original WGT (Weighted Graph Traversal) algorithm
🔍 Reasoning Traces	Every recommendation explained with evidence from your resume
🎨 Interactive Roadmap	react-flow DAG with color-coded nodes and filter controls
🚫 Zero Hallucinations	All course recommendations from curated closed catalog only
⚡ Real-time Progress	Live 6-stage processing pipeline with WebSocket-style polling

🏗️ Architecture

┌─────────────────────────────────────────────────────────┐
│              FRONTEND (React 18 + Vite)                 │
│   Upload → Processing → Dashboard → Roadmap (DAG)       │
└──────────────────────────┬──────────────────────────────┘
                           │ REST API
┌──────────────────────────▼──────────────────────────────┐
│               BACKEND (FastAPI + Python 3.11)           │
│  /analyze  /status  /results  /trace  /health           │
└──────┬──────────────────────────────────────────────────┘
       │
┌──────▼──────────────────────────────────────────────────┐
│                    AI / ML LAYER                        │
│  Parser (LLM + aliases)  →  Gap Analyzer               │
│  WGT Algorithm (NetworkX)  →  Trace Generator (LLM)    │
└─────────────────────────────────────────────────────────┘

🧠 Skill Gap Analysis Logic

3-Layer Matching Pipeline

For each JD skill requirement:

Layer 1 — Exact ID Match
  Resume has exact skill? → compute level coverage score
  coverage ≥ 0.85 → fully covered (skip)
  coverage < 0.85 → weak gap

Layer 2 — Skill Family Match  
  e.g. Resume has PostgreSQL + MySQL → covers SQL requirement (0.84 coverage)
  e.g. Resume has React → partial credit for Next.js
  Prevents false gaps from skill variant naming

Layer 3 — Semantic Similarity (cosine)
  Embed all skill names with local 384-dim hashing vectors
  Cosine similarity search
  score ≥ 0.62 → weak gap
  score < 0.62 → missing gap

WGT Priority Score Formula (Original Algorithm)

P(skill) = 0.40 × gap_severity
         + 0.30 × requirement_weight  
         + 0.20 × dependency_urgency
         + 0.10 × experience_penalty

Where:
  gap_severity       = 1.0 - coverage_score
  requirement_weight = 1.5 (required) or 1.0 (preferred), normalized
  dependency_urgency = blocked_gap_skills / total_gap_skills
  experience_penalty = reduced for trivial skills on senior hires

Skills are then ordered via Kahn's topological sort with P-score tie-breaking, ensuring prerequisites always appear before dependent skills.

🛠️ Tech Stack

Frontend

Library	Version	Purpose
React	18.3	UI framework
Vite	5.x	Build tool
@xyflow/react	11.x	DAG roadmap visualization
recharts	2.x	Skill gap charts
react-dropzone	14.x	File uploads
zustand	4.x	State management

Backend

Library	Version	Purpose
FastAPI	0.111	REST API framework
SQLAlchemy	2.0	ORM + SQLite storage
pdfplumber	0.11	PDF text extraction
python-docx	1.1	DOCX text extraction

AI / ML

Model / Library	Purpose
Mistral via local Ollama or Llama 3.1 via Groq	Skill extraction enhancement
Local hashing vectors	384-dim skill-name similarity
NetworkX	Skill dependency DAG

Datasets & Ontologies

Dataset	Source	Rows Processed	Usage
Kaggle Resume Dataset	snehaanbhawal	2,484 resumes	Enriched 11 skill aliases in taxonomy
Kaggle JD Dataset	kshitizregmi	2,277 JDs	Computed demand weights for WGT P-Score
O*NET Database 28.1	onetcenter.org	Full database	Canonical skill IDs + prerequisite graph

⚙️ Setup Instructions

Option A — Docker (Recommended for Judges)

# 1. Clone the repository
git clone https://github.com/your-team/skillbridge.git
cd skillbridge

# 2. Create your Docker environment file
cp backend/.env.docker.example backend/.env.docker

# 3. Choose an LLM provider in backend/.env.docker
# Option A: local Ollama (free)
#   LLM_PROVIDER=ollama
#   OLLAMA_BASE_URL=http://host.docker.internal:11434
#   OLLAMA_MODEL=mistral
#
# Option B: Groq API (hosted)
#   LLM_PROVIDER=groq
#   GROQ_API_KEY=gsk_your_key_here
#   GROQ_MODEL=llama-3.1-8b-instant

# 4. Start everything
docker compose up --build

# 5. Open the app
# Frontend: http://localhost:3000
# API Docs: http://localhost:8000/docs

For Ollama with Docker Desktop on Windows/macOS, keep OLLAMA_BASE_URL=http://host.docker.internal:11434 so the backend container can reach Ollama running on your host machine. For native Linux Docker, use http://172.17.0.1:11434 or expose Ollama on an address reachable from Docker.

For Groq, create an API key at https://console.groq.com/keys and paste it into backend/.env.docker. Do not commit the real .env.docker file.

Option B — Local Development

Prerequisites

Python 3.11+
Node.js 18+
Ollama for local LLM OR a Groq API key

Backend Setup

cd skillbridge/backend

# Create virtual environment
python -m venv venv
source venv/bin/activate        # Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

# No additional NLP model download is required

# Configure environment
# Create backend/.env and set the same LLM variables shown in .env.docker.example.
# For local development, use DATABASE_URL=sqlite:///./skillbridge.db.
# If using local Ollama without Docker, use OLLAMA_BASE_URL=http://localhost:11434.

# Start server
uvicorn app.main:app --reload --port 8000

Frontend Setup

cd skillbridge/frontend

# Install dependencies
npm install

# Configure environment
# Edit .env — set VITE_API_URL=http://localhost:8000

# Start dev server
npm run dev
# Opens at http://localhost:5173

LLM Setup

Local Ollama:

# Install Ollama from https://ollama.com/download
ollama pull mistral

# If running the backend directly on your machine, use:
# LLM_PROVIDER=ollama
# OLLAMA_BASE_URL=http://localhost:11434
# OLLAMA_MODEL=mistral

# If running with Docker, use:
# OLLAMA_BASE_URL=http://host.docker.internal:11434

Groq API:

# Create a key at https://console.groq.com/keys
# Update backend/.env or backend/.env.docker:
# LLM_PROVIDER=groq
# GROQ_API_KEY=gsk_your_key_here
# GROQ_MODEL=llama-3.1-8b-instant

📁 Project Structure

skillbridge/
├── backend/
│   ├── app/
│   │   ├── api/
│   │   │   └── routes.py          # All API endpoints
│   │   ├── core/
│   │   │   ├── config.py          # Settings management
│   │   │   ├── database.py        # SQLite/PostgreSQL setup
│   │   │   └── llm.py             # Unified LLM caller
│   │   ├── data/
│   │   │   ├── skill_taxonomy.json    # 71 skills + prerequisites
│   │   │   └── course_catalog.json   # 58 curated modules
│   │   ├── models/
│   │   │   ├── db_models.py       # SQLAlchemy ORM models
│   │   │   └── schemas.py         # Pydantic schemas
│   │   ├── services/
│   │   │   ├── parser.py          # Resume + JD extraction
│   │   │   ├── embeddings.py      # Sentence transformer service
│   │   │   ├── gap_analyzer.py    # 3-layer gap analysis
│   │   │   ├── wgt_engine.py      # Adaptive pathing algorithm
│   │   │   ├── trace_generator.py # LLM reasoning traces
│   │   │   └── data_loader.py     # Taxonomy + catalog loader
│   │   └── main.py                # FastAPI app entrypoint
│   ├── requirements.txt
│   ├── .env.docker.example
│   └── Dockerfile
├── frontend/
│   ├── src/
│   │   ├── components/
│   │   │   ├── UploadScreen.jsx
│   │   │   ├── ProcessingScreen.jsx
│   │   │   ├── ResultsDashboard.jsx
│   │   │   └── RoadmapView.jsx
│   │   ├── store/
│   │   │   └── useAppStore.js     # Zustand + localStorage persist
├── frontend/
│   ├── src/
│   │   ├── components/
│   │   │   ├── UploadScreen.jsx
│   │   │   ├── ProcessingScreen.jsx
│   │   │   ├── ResultsDashboard.jsx
│   │   │   └── RoadmapView.jsx
│   │   ├── store/
│   │   │   └── useAppStore.js     # Zustand + localStorage persist
│   │   └── api/
│   │       └── client.js          # Axios API client
│   └── Dockerfile
├── docker-compose.yml
└── README.md

🔌 API Reference

Endpoint	Method	Description
`POST /api/v1/analyze`	POST	Upload resume + JD, returns job_id
`GET /api/v1/status/{job_id}`	GET	Poll job progress (0-100%)
`GET /api/v1/results/{job_id}`	GET	Get full pathway + gap report
`GET /api/v1/trace/{job_id}/{skill_id}`	GET	Get reasoning trace for a skill
`GET /api/v1/stats`	GET	System stats
`GET /health`	GET	Health check (detailed)
`GET /api/health`	GET	Keep-alive health check (external monitors)
`GET /docs`	GET	Interactive Swagger UI

📊 Evaluation Metrics

Metric	Target	Approach
Skill Extraction F1	> 0.82	Hybrid LLM + alias matching
Gap Detection Recall	> 85%	3-layer matching pipeline
Pathway Validity	100%	Topological sort guarantee
Hallucination Rate	< 1%	Closed course catalog
E2E Latency (p95)	< 30s	Async background tasks

🔄 Keeping Render Awake

The backend is hosted on Render's free tier, which spins down after 15 minutes of inactivity. To prevent cold starts, this project uses an external uptime monitor that periodically pings:

GET /api/health

The endpoint returns an instant HTTP 200 with no database access or auth — keeping the dyno warm without any cost.

Quick setup (2 minutes)

Deploy the backend to Render
Create a free account at cron-job.org
Create a new cron job:
- URL: https://<your-service>.onrender.com/api/health
- Schedule: */10 * * * * (every 10 minutes)
- Method: GET
Save — done.

For full configuration details, troubleshooting, and UptimeRobot as an alternative, see KEEP_ALIVE_SETUP.md.

📄 License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.github/workflows		.github/workflows
backend		backend
frontend		frontend
scripts		scripts
.dockerignore		.dockerignore
.gitignore		.gitignore
.python-version		.python-version
KEEP_ALIVE_SETUP.md		KEEP_ALIVE_SETUP.md
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

⚡ SkillBridge — AI-Adaptive Onboarding Engine

🎯 Problem Statement

🚀 Live Demo

✨ Features

🏗️ Architecture

🧠 Skill Gap Analysis Logic

3-Layer Matching Pipeline

WGT Priority Score Formula (Original Algorithm)

🛠️ Tech Stack

Frontend

Backend

AI / ML

Datasets & Ontologies

⚙️ Setup Instructions

Option A — Docker (Recommended for Judges)

Option B — Local Development

Prerequisites

Backend Setup

Frontend Setup

LLM Setup

📁 Project Structure

🔌 API Reference

📊 Evaluation Metrics

🔄 Keeping Render Awake

Quick setup (2 minutes)

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

⚡ SkillBridge — AI-Adaptive Onboarding Engine

🎯 Problem Statement

🚀 Live Demo

✨ Features

🏗️ Architecture

🧠 Skill Gap Analysis Logic

3-Layer Matching Pipeline

WGT Priority Score Formula (Original Algorithm)

🛠️ Tech Stack

Frontend

Backend

AI / ML

Datasets & Ontologies

⚙️ Setup Instructions

Option A — Docker (Recommended for Judges)

Option B — Local Development

Prerequisites

Backend Setup

Frontend Setup

LLM Setup

📁 Project Structure

🔌 API Reference

📊 Evaluation Metrics

🔄 Keeping Render Awake

Quick setup (2 minutes)

📄 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages