RepoRAG

A GitHub repository RAG assistant with an agentic code maintenance workbench — source-grounded answers, hybrid retrieval, LangGraph agent orchestration, human-in-the-loop approvals, sandboxed execution, MCP tools, and trajectory-based evaluation.

30-second verification

Run the no-database unit checks:

git clone https://github.com/ZedingZhang/reporag.git
cd reporag
python -m pip install -e ".[dev,mcp]"
python -m pytest tests/test_chunkers.py tests/test_citations.py tests/test_command_guard.py tests/test_patch.py -q

Core Features

RAG Engine (Phase 1-4)

Code-aware chunking — Markdown by headings, Python by AST functions/classes, JS/TS by regex fallback; preserves line numbers and commit SHAs
Hybrid retrieval — vector (pgvector) + keyword (PostgreSQL full-text search) with Reciprocal Rank Fusion
Citation validation — every answer includes GitHub permalinks with file paths and line ranges; fabricated citations detected and stripped
LangGraph RAG pipeline — question classification → query rewrite → hybrid retrieval → rerank → evidence check → generate → validate

Agentic Extension (Phase 1-7)

LangGraph agent workflow — classify task → retrieve context → build plan → propose patch → request approval → apply/run tests → summarize
Human-in-the-loop approvals — high-risk actions (apply_patch, run_command, create_pr) require explicit approval
Security guards — CommandGuard (allowlist: pytest/ruff/python; blocks rm/sudo/curl/ssh + shell metacharacters), PathGuard (blocks .env/.git/credentials + parent traversal)
Safe execution — subprocess.run without shell=True, 60s timeout, stdout/stderr capture, ToolExecution audit log
Patch proposal — LLM generates unified diffs, auto-validated for diff format correctness
MCP server — 4 tools (search_code, create_agent_run, get_agent_run, resolve_approval) for Claude Code integration
Agent evaluation — plan_success, context_hit_rate, approval_accuracy, patch_validity, latency

Screenshots

Q&A with Citations

Agent Run with Approval Gate

Architecture

flowchart TB
    UI["Streamlit UI<br/>Q&A Tab | Agent Tab"]
    ChatAPI["/api/chat"]
    AgentAPI["/api/agent/runs"]
    Human["Human approval"]
    DB[("PostgreSQL + pgvector<br/>repositories | documents | chunks<br/>agent_runs | agent_steps | approval_requests | tool_executions")]
    MCP["RepoRAG MCP Server<br/>Claude Code tools"]

    UI --> ChatAPI
    UI --> AgentAPI

    subgraph RAG["RAG Graph"]
        R1["classify"] --> R2["rewrite"] --> R3["retrieve"] --> R4["rerank"] --> R5["evidence"] --> R6["generate"] --> R7["validate"]
    end

    subgraph Agent["Agent Graph (LangGraph)"]
        A1["classify_task"] --> A2["retrieve_context"] --> A3["build_plan"] --> A4["propose_patch"] --> A5["request_approval"]
        A5 --> A6["wait_for_approval"]
        A6 --> A7["apply_patch"] --> A8["run_tests (executor)"] --> A9["summarize"]
        A6 --> A9
        A5 -.-> Human
        Human -.->|approve/reject| A6
    end

    ChatAPI --> R1
    AgentAPI --> A1
    R3 --> DB
    R7 --> DB
    A2 --> DB
    A5 --> DB
    A9 --> DB
    MCP --> ChatAPI
    MCP --> AgentAPI

Agent Workflow (LangGraph)

classify_task ──► retrieve_context ──► build_plan
                                          │
                   plan_only ──► summarize ◄── approved/rejected
                   propose_patch / execute ──► propose_patch
                                                  │
                                            request_approval
                                                  │
                              not_required ──► summarize
                              pending ──► wait_for_approval ──► END
                              approved ──► apply_patch ──► run_tests ──► summarize

Safety and Approval Model

Risk Level	Examples	Requires Approval
Low	read-only retrieval, list repos	No
Medium	pytest, ruff check	Yes (in execute mode)
High	apply patch, rm, curl, git push, create PR	Yes

CommandGuard blocks: rm, sudo, curl, wget, ssh, nc, chmod, chown, git push, git reset --hard, shell metacharacters (|, ;, &&, $(), `, >, <)

PathGuard blocks: .env, .git/, credentials files, parent traversal (..)

Tech Stack

Layer	Technology
Backend	Python 3.11+, FastAPI
RAG/Agent	LangChain (langchain-openai, langchain-core), LangGraph
Database	PostgreSQL + pgvector (+ full-text search)
Frontend	Streamlit (Q&A + Agent tabs)
LLM	DeepSeek V4 (OpenAI-compatible, swappable)
Embeddings	OpenAI-compatible provider (swappable)
MCP	FastMCP (Python >=3.10)
Dev tools	pytest (159 tests), ruff, Alembic

Quick Start

git clone https://github.com/ZedingZhang/reporag.git
cd reporag
cp .env.example .env           # edit with your API keys
docker compose up --build       # auto-runs migrations

FastAPI docs: http://localhost:8000/docs
Streamlit UI: http://localhost:8501

Environment Variables

See .env.example. Key variables:

Variable	Description
`DATABASE_URL`	PostgreSQL connection string
`GITHUB_TOKEN`	GitHub token (raises rate limit from 60→5000 req/h)
`DEEPSEEK_API_KEY`	Chat model API key
`DEEPSEEK_MODEL`	Model name (e.g., `deepseek-v4-pro`)
`EMBEDDING_API_KEY`	Embedding API key
`EMBEDDING_DIMENSIONS`	Vector dimensions (must match model)

API

RAG Endpoints

Method	Path	Description
POST	`/api/repos/index`	Index a GitHub repo (async background)
POST	`/api/chat`	Ask a question, get cited answer
GET	`/api/repos`	List indexed repos
GET	`/api/repos/{id}/status`	Get repo indexing status

Agent Endpoints

Method	Path	Description
POST	`/api/agent/runs`	Create an agent run
GET	`/api/agent/runs/{id}`	Get run (plan, patch, approvals, steps)
GET	`/api/agent/runs/{id}/steps`	List run steps
POST	`/api/agent/runs/{id}/continue`	Resume run after approval
POST	`/api/agent/runs/{id}/cancel`	Cancel a run
POST	`/api/agent/approvals/{aid}/resolve`	Approve or reject

MCP Integration

# Install MCP SDK locally (requires Python >=3.10)
python3 -m pip install -e ".[mcp]"

# Start the MCP server
python3 -m app.mcp.server

# Or configure Claude Code to launch it automatically:
cp .mcp.example.json ~/.claude/mcp.json  # edit with your API keys

Claude Code can then use these tools: search_code, create_agent_run, get_agent_run, resolve_approval.

Evaluation

RAG Evaluation

python scripts/evaluate.py --dataset examples/eval_dataset.jsonl

Metrics: Recall@5, MRR, Citation Coverage, Latency.

Agent Evaluation

python scripts/evaluate_agent.py --dataset examples/agent_tasks.jsonl

Metrics: plan_success, context_hit_rate, approval_required_accuracy, patch_validity, avg_latency.

Project Structure

app/
  core/          Config, logging, provider adapters (ChatOpenAI, OpenAIEmbeddings)
  db/            SQLAlchemy models (7 tables), Alembic migrations
  github/        GitHub REST API client
  ingestion/     Chunkers (markdown, Python AST, JS/TS), embedding client
  retrieval/     Vector search, keyword search, hybrid fusion, reranker
  rag/           LangGraph RAG pipeline, prompts, citation validation
  agent/         LangGraph agent workflow, state, prompts, service
  tools/         repo_context, patch, executor, github_tools
  security/      CommandGuard, PathGuard, ApprovalPolicy, ApprovalManager
  mcp/           FastMCP server for Claude Code
  api/           FastAPI routes (RAG + Agent)
streamlit_app/   Q&A tab + Agent tab
scripts/         ingest_repo, evaluate, evaluate_agent
tests/           159 pytest tests

Resume Highlights

Built RepoRAG, a GitHub repository RAG assistant with code-aware chunking, hybrid retrieval, citation validation, and LangGraph-based answer generation.

Extended RepoRAG into an agentic code maintenance workbench using LangGraph, MCP tools, human-in-the-loop approvals, sandboxed command execution, trace logging, and trajectory-based agent evaluation.

Designed secure agent tooling for repository maintenance with schema-validated tools, path and command guards, approval gates for write operations, and eval metrics covering context hit rate, patch validity, unsafe-command blocking, latency, and tool-call count.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/workflows		.github/workflows
app		app
docs/screenshots		docs/screenshots
examples		examples
scripts		scripts
streamlit_app		streamlit_app
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.mcp.example.json		.mcp.example.json
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
README.zh.md		README.zh.md
alembic.ini		alembic.ini
docker-compose.yml		docker-compose.yml
docker-entrypoint.sh		docker-entrypoint.sh
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RepoRAG

30-second verification

Core Features

RAG Engine (Phase 1-4)

Agentic Extension (Phase 1-7)

Screenshots

Q&A with Citations

Agent Run with Approval Gate

Architecture

Agent Workflow (LangGraph)

Safety and Approval Model

Tech Stack

Quick Start

Environment Variables

API

RAG Endpoints

Agent Endpoints

MCP Integration

Evaluation

RAG Evaluation

Agent Evaluation

Project Structure

Resume Highlights

Roadmap

About

Uh oh!

Releases

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RepoRAG

30-second verification

Core Features

RAG Engine (Phase 1-4)

Agentic Extension (Phase 1-7)

Screenshots

Q&A with Citations

Agent Run with Approval Gate

Architecture

Agent Workflow (LangGraph)

Safety and Approval Model

Tech Stack

Quick Start

Environment Variables

API

RAG Endpoints

Agent Endpoints

MCP Integration

Evaluation

RAG Evaluation

Agent Evaluation

Project Structure

Resume Highlights

Roadmap

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Contributors

Uh oh!

Languages