Bottega is a multi-agent orchestration tool for engineering teams who want an "AI sandwich" approach: Human at the start & the end, AI in the middle.
Bottega ships as a specification plus a working reference implementation.
It runs as a web UI, is team-first (multi-user), and remote-first (you can run it locally, but it's designed to run on a server).
The tool aims to replicate a typical dev workflow: plan, implement, run the QA scenarios that define the goal, loop until that goal is met (the Ralph Wiggum loop), respond to PR comments on GitHub, etc.
It supports Claude Code, Codex, and OpenCode so you can mix and match models on the same task. For example, use Opus for planning, Sonnet for implementation, Codex for code review, and an open-source model to manage the PR (monitor the CI, fix conflicts), etc.
See the demo:
demo.mp4
Want to see how we use it in practice? We share our experience after shipping our 1000th user story with this tool on a production codebase (a large Rails monolith) launch post
Bottega replicates a normal dev workflow as a loop of specialised agents, with a human at both ends.
flowchart TD
H1["Human — define the task"]:::human
H1 --> PLAN["Planning agent<br/>fills the plan template"]
PLAN --> APPROVE["Human — review & approve the plan"]:::human
APPROVE --> IMPL["Implementation agent"]
IMPL --> REVIEW["Adversarial code-review agent — reviews the code, runs the test scenarios, and verifies the goal"]
REVIEW -->|"incomplete, goal not met, or errors"| IMPL
REVIEW -->|"matches the plan, goal met"| PR["Open the PR, keep CI green"]
PR --> H2["Human — review the PR on GitHub"]:::human
H2 -->|"leaves comments"| HOOK["GitHub webhook"]
HOOK --> FEEDBACK["PR-feedback agent<br/>addresses comments, keeps CI green"]
H2 -->|"approves"| MERGE["Merge"]:::done
classDef human fill:#fff1d6,stroke:#e0962f,stroke-width:1px,color:#5c3d0a;
classDef done fill:#e7f6ec,stroke:#3f9d63,stroke-width:1px,color:#143524;
The specification is the product. This project ships specs so you can build your own version of the tool.
A complete, working implementation is provided in reference/.
Every team will probably want a different flavor: point your preferred coding agent at this repository, explain your requirements, and build your own version.
SPEC.mdis the entry point: a short master index.core/specifies the universal orchestration engine — theplanning → (implementation ⇄ review loop) → pull request.extra/specifies opinionated, optional features (the Kanban board, different harness integrations, multi-user auth, …). Pick the ones you want; skip the rest.reference/is one complete, working implementation. It's the implementation we use daily internally, with our own set of preferences. It's a citation for the spec, not the canonical version.
Read the spec for what to build and why; read reference/ for one
way it was solved. Where the two ever disagree, the spec wins.
Most of the value we get out of this tool is easy to implement. The core is a tight orchestration loop and nothing more. Different teams will want different things: support for Gemini, adding a compliance agent in the loop, tasks sourced from Jira or Notion instead of a kanban board. Fork it and build your own.
Contributions are most welcome in the form of additional specs — see "How to Contribute".
- Clone or fork this repo.
- Point a coding agent at
SPEC.md: "Read SPEC.md and build this." - Implement all of
core/for a minimal working tool; add whicheverextra/features fit your team. - Swap the opinionated parts for your own — replace the board with a Jira sync, add a harness the reference doesn't ship, change the agent prompts.
You don't have to use the reference's stack (TypeScript / React / Express / SQLite). The spec describes behavior; build it however you like.
The reference app lives in reference/, and all commands below run
from there.
- React 18 + Vite + Tailwind (frontend)
- Node.js + Express + WebSocket (backend)
- SQLite (better-sqlite3) for metadata and message storage
- Node.js 18+ (tested on Node 20 and 22)
- pnpm 11 — install with
npm install -g pnpm@11 - At least one agent runtime — install whichever provider(s) you plan to
use:
- Claude Code
- Codex
- OpenCode
Notes
- Each provider's SDK is a thin wrapper around its CLI, which Bottega spawns as a subprocess — so the runtime must be installed on the host.
- You don't need to pre-authenticate Claude Code or Codex; Bottega's built-in OAuth flow handles login on first use.
git clone https://github.com/vdaubry/bottega.git
cd bottega/reference # the reference implementation lives here
npm install -g pnpm@11 # skip if pnpm 11 is already installed
pnpm install
cp .env.example .envOpen .env and set JWT_SECRET to a random secret:
openssl rand -hex 64 # paste the output as JWT_SECRET in .envThen run the interactive setup and start the server:
pnpm onboarding # creates your admin account and seeds a sample project
pnpm dev- Frontend: http://localhost:5173
- Backend: http://localhost:3001
Before your first chat or agent run, Bottega shows a blocking Connect a provider modal — you must connect at least one of Claude Code, Codex, or OpenCode. Credentials are stored per user, so each teammate connects their own Claude/Codex/OpenCode account. Each provider authenticates differently:
- Claude Code / Codex — click Authenticate, open the generated URL in your browser, authorize, then paste the returned code back into the app. No API key needed; it uses your subscription via OAuth.
- OpenCode — paste a Zen API key (from opencode.ai/zen) into the panel.
pnpm onboarding runs in two steps and is idempotent — re-running it is safe and
skips completed steps:
- Create an admin account. Prompts for username and password, stored as a bcrypt hash. The first user is automatically granted admin.
- Seed a sample project. Copies
examples/landing-page/to~/bottega-examples/landing-page/, initializes a git repo there, and adds it to your dashboard with one pending task. This gives you something concrete to point your agent at on your first conversation.
If you'd rather start from your own repo, skip pnpm onboarding, run pnpm dev,
register an admin via the web UI, and add a project pointing at any git repo on
your machine.
Bottega is built to be forked. The most valuable thing you can do is build your own version, and we'd love to hear about what you make — come say hello in Discussions.
Contributions back to this repository are welcome, with one guideline that keeps the project small:
- Spec changes — encouraged. New or improved spec files are the
contributions we most want: clarifications, new
extra/features, better explanations, or fixes to thecore/docs. Open a PR that touchesSPEC.md,core/, orextra/. - Reference implementation — bug fixes only. The reference is the tool we chose to use; you're welcome to use it if you share our preferences. It's not intended to accumulate features and become a bloated, one-size-fits-all tool.
We'll review changes that correct a defect or bring the reference back in line with the spec.
New features can be proposed as a new
extra/spec here rather than as reference code.
In short: grow the spec, fix the reference, fork for everything else.
Questions, ideas, feedback, or want to share what you built? GitHub Discussions is the place — you don't need a bug or a technical problem to start a thread, just say hi.
Please keep Issues for bug reports and spec PRs (see How to Contribute); everything else belongs in Discussions.
As we were working on this, a bunch of orchestration tools emerged. Variants of the same workflow we were converging on.
- Conductor, by Melty Labs (YC S24).
- GasTown, by Steve Yegge.
- gstack, by Garry Tan.
- SpecKit, by GitHub.
- Symphony, by OpenAI.
There is a lot of overlap with what we built. For us, this is a huge confirmation that we were on the right path.
Where Bottega differs:
Remote-first and multi-player. While you can run it on your laptop, Bottega is remote-first by design — we run it on a shared dev box. It supports multiple concurrent users out of the box. Side benefit 1: sandboxing autonomous agents on a remote server was easier for us than sandboxing them on each laptop. Side benefit 2: a lot of non-technical people use it internally. Side benefit 3: because it's always-on, reacting to GitHub is trivial: a PR review posted on GitHub fires a webhook at the box and kicks off a fresh agent run to address the comments. No laptop needs to be awake.
Multi-harness. Bottega drives Claude Code, Codex, and OpenCode behind one interface, so you can assign a different model to each role on the same task.
Minimalist UX. The core ideas are super simple: we are just recreating the typical web developer workflow. And we wanted the tool to reflect that simplicity. Side benefit: easy to onboard the whole product team.
MIT — see LICENSE.