Skip to content
View Hritikd's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Hritikd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Hritikd/README.md

Hritik Datta

Product @ Pre6 AI  ·  I build production-grade AI agent systems.

Product by title, builder by craft — I design AI products and ship the engineering behind them: multi-agent orchestration, agent evaluation, and AI safety infrastructure.


What I work on

I care about the unglamorous half of AI products — the part that decides whether they survive contact with real users. Most demos route a single LLM call. Production systems need orchestration, evaluation, safety gates, and observability. That gap is what I build into.

  • Multi-agent orchestration — supervisor/specialist architectures with typed state, tool binding, and streaming traces.
  • Agent reliability — measurable, auditable evaluation of agent runs across reliability, safety, latency, and cost.
  • LLM safety — scanning retrieval context for prompt injection, secret leakage, PII, and exfiltration before it reaches a model.
  • Developer tooling — sharp CLIs that turn fuzzy engineering signals into decisions teams can act on.

Featured work

Project What it is Stack Links
gemma4-multi-agent Production-ready multi-agent system — a Supervisor routes work across 4 specialist agents with live reasoning traces and sandboxed tool execution. Python · LangGraph · Gemini · Streamlit Code
rag-safety-gateway AI security gateway that scans RAG context for prompt injection, secrets, PII, and exfiltration risk, producing deterministic allow/redact/quarantine decisions. TypeScript · React · CI Live Demo · Code
agent-evals-lab Evaluation workbench for agent reliability — typed scoring engine, policy rules, regression detection, and a trace-inspection dashboard. TypeScript · React · CI Live Demo · Code
repo-pulse CLI that turns any Git repo into an engineering-health report — churn × complexity hotspot scoring you can paste into a review. Python Code
contract-watch CLI that diffs two OpenAPI contracts and flags breaking API changes before they reach clients. CI-friendly. TypeScript Code
ai-code-reviewer Structured AI code review from the terminal — severity-rated, line-specific feedback in pretty / JSON / Markdown. Python Code

Every project ships with tests, CI, and documentation — and the AI tooling runs without API keys so anyone can review it in under a minute.

repo-pulse generating a real engineering-health report
repo-pulse in action — a real, unedited run, no keys or config.


How I build

Typed contracts first   →  domain models before logic, so behavior is auditable
Deterministic by default →  scoring and decisions reproducible without a live model
Measurable, then pretty  →  evals and telemetry before dashboards
Reviewable in 60 seconds →  clone, run, understand — no API keys to start

Stack

Python · TypeScript · LangGraph · LangChain · React · Streamlit · Google Gemini · OpenAI · pytest · Vitest · GitHub Actions · uv


Open to conversations on AI agent engineering, evals, and LLM safety.

Pinned Loading

  1. ai-code-reviewer ai-code-reviewer Public

    AI-powered code review CLI tool — get structured feedback on any code file using GPT-4o

    Python

  2. agent-evals-lab agent-evals-lab Public

    AI agent evaluation workbench for reliability, safety, tool-use, latency, and cost

    TypeScript

  3. contract-watch contract-watch Public

    CLI that diffs two OpenAPI contracts and flags breaking API changes before they reach clients — CI-friendly

    TypeScript

  4. gemma4-multi-agent gemma4-multi-agent Public

    Production-ready multi-agent AI system — Supervisor + 4 specialist agents — Google Gemini, LangGraph & Streamlit

    Python

  5. rag-safety-gateway rag-safety-gateway Public

    AI security gateway for scanning RAG context for prompt injection, secrets, PII, and exfiltration risk

    TypeScript

  6. repo-pulse repo-pulse Public

    CLI that turns any Git repo into an engineering-health report — churn × complexity hotspot scoring

    Python