Foreman

Or: How to Get Eight Artificial Minds to Build Something Without Arguing About It Forever

It is a well-established fact that a single AI coding agent, left to its own devices, will produce code that works. It will also produce code that is structured in a way that makes perfect sense to it and absolutely no sense to anyone who has to maintain it six months later, including, somewhat ironically, itself.

It is a less well-established but equally true fact that if you give two AI coding agents the ability to talk to each other, they will immediately begin disagreeing about architecture and never stop.

Foreman solves this by doing something remarkably similar to what humans have done on construction sites for thousands of years: putting one person in charge and giving everyone else a job title that sounds important but mostly just tells them to stay in their lane.

What Is This

Foreman is a skill for Claude Code that turns multiple Claude Code sessions into a collaborative coding team. It is built on top of Claude Relay, which handles the part where the agents actually talk to each other. Foreman handles the considerably more difficult part where they talk to each other productively.

You give the Orchestrator a goal. It hands the goal to the Architect, who reads the codebase and writes a concrete plan. The Dissenter stress-tests the plan. Workers build in isolated git worktrees. An Inspector audits the result before anything is committed. A Cleaner tidies up throughout. A Circuit Breaker watches for the inevitable moment when two agents start going in circles, and politely but firmly tells them to stop.

And then there is the Muse, who runs on an entirely different model, does not appear to do any actual work, and yet somehow makes everyone else better at theirs. Every job site has one.

The Crew

Role	Model	What They Do	What They Emphatically Do Not Do
Orchestrator	Claude Opus 4.6	Approves plans, delegates, tracks, reports	Write code, ever, under any circumstances
Architect	Qwen3.5 (Ollama)	Reads codebase, writes `CURRENT_PLAN.md`	Touch the repo during the build
Dissenter	Gemini 3.1 Pro	Challenges plans (First Principles first) and results	Touch the filesystem or look at actual code
Inspector	gpt-5.3-codex (Codex CLI, high reasoning)	Full audit: correctness, security, plan conformance	Rubber-stamp anything
Worker	Claude Sonnet	Builds in isolated git worktrees	Argue about architecture (that ship has sailed)
Cleaner	Claude Haiku	Linting, formatting, dead code removal	Modify application logic
Circuit Breaker	Claude Haiku	Detects and resolves conversational loops	Take sides until forced to
Muse	Gemma 4 (Ollama)	Reframes problems sideways	Anything resembling real work

Prerequisites

You will need:

Claude Code (2.1.80 or later, though frankly the version number is changing so fast that by the time you read this sentence it may already be wrong)
Claude Relay installed as a plugin
A GEMINI_API_KEY environment variable set (for the Dissenter — get one at aistudio.google.com)
Ollama with qwen3.5 and gemma4 pulled (for the Architect and Muse — optional but recommended)
Codex desktop app with an OpenAI API key (for the Inspector)

Quick Start

The quickest way to understand Foreman is to watch it work.

Step 1. Install Claude Relay if you haven't:

# From any Claude Code session
/plugin marketplace add innestic/claude-relay
/plugin install relay@claude-relay

Step 2. Install the Foreman skill. (Place the foreman/ directory in your Claude Code skills path.)

Step 3. Open Claude Code in your project directory with the Relay channel flag:

claude --dangerously-load-development-channels plugin:relay@claude-relay

Step 4. Say something like:

"Spin up Foreman. Build me a REST API for user authentication with JWT tokens, bcrypt password hashing, and refresh token rotation."

Step 5. Watch in mild astonishment as terminal windows begin appearing, agents begin talking to each other, and code begins materializing in your project directory as if by magic, except that it is not magic, it is just several language models being very organized about it.

How It Works

The workflow is, in principle, simple. In practice it is also simple, which is what makes it work.

You give the Orchestrator a goal. This is the only agent you talk to. Chain of command exists for a reason.
The Architect writes the plan. It reads your codebase (read-only), then writes a concrete, phased CURRENT_PLAN.md. No plan comes from thin air.
The Dissenter reviews the plan. Before a single line of code is written, the Dissenter stress-tests the reasoning — First Principles first, then approach. Not the code. The reasoning. This is an important distinction that most review processes get wrong.
Workers build. Each Worker gets an isolated git worktree. They make their own implementation decisions without checking in on every variable name. They are, after all, competent.
The Cleaner cleans. Continuously. Like the tide, but for dead code.
The Architect checks conformance. When Workers complete, the Architect verifies the implementation matches CURRENT_PLAN.md.
The Inspector audits. The Inspector (gpt-5.3-codex, high reasoning) reads everything — the plan, all changed files, affected existing code. A BLOCK finding halts the commit. Nothing bypasses the Inspector without an explicit override recorded in DECISIONS.md.
The Cleaner does a final sweep. After Inspector clearance: lint, dead code, imports, formatting.
The Dissenter reviews the results. A second pass after the work is done, before anything is committed.
The Orchestrator approves. You get your code.

The Circuit Breaker watches all of this passively and intervenes only when two agents have gone back and forth three times on the same point without progress. At four round-trips, it forces a decision. Unless the Orchestrator is one of the looping parties, in which case it escalates to you, because even on a construction site, sometimes the foreman needs the owner to make a call.

The Muse, if present, sits off to the side and offers a completely different perspective when asked. It runs Gemma 4, not Claude, which means it literally thinks differently. This is not a metaphor. The weights are different. The latent space is different. It will say things none of the Claude agents would think of, and occasionally those things will be exactly what was needed.

The Bootstrap Script

The Orchestrator spawns crew members using scripts/foreman-bootstrap.sh. Each invocation opens a new terminal session with the correct model, role instructions, and Relay connection.

# The Orchestrator handles this automatically, but if you are curious:
./scripts/foreman-bootstrap.sh orchestrator
./scripts/foreman-bootstrap.sh architect
./scripts/foreman-bootstrap.sh dissenter
./scripts/foreman-bootstrap.sh inspector
./scripts/foreman-bootstrap.sh worker 1
./scripts/foreman-bootstrap.sh worker 2
./scripts/foreman-bootstrap.sh cleaner
./scripts/foreman-bootstrap.sh circuit-breaker
./scripts/foreman-bootstrap.sh muse

Worker sessions create isolated git worktrees under /tmp. After a session completes, run git worktree prune to clean up any leftover branches.

File Structure

foreman/
├── SKILL.md                          # Main skill trigger and protocol
├── scripts/
│   ├── foreman-bootstrap.sh          # Spawns crew sessions
│   ├── foreman-architect-bridge.py   # Architect bridge (Qwen3.5 via Ollama)
│   ├── foreman-dissenter-bridge.py   # Dissenter bridge (Gemini)
│   └── foreman-muse-bridge.py        # Muse bridge (Gemma 4 via Ollama)
└── references/
    ├── protocol.md                   # Shared communication norms (all agents)
    ├── relay-setup.md                # Relay installation guide
    └── roles/
        ├── orchestrator.md           # The foreman
        ├── architect.md              # The planner
        ├── dissenter.md              # The professional skeptic
        ├── inspector.md              # The auditor
        ├── worker.md                 # The builders
        ├── cleaner.md                # The invisible hand of lint
        ├── circuit-breaker.md        # The conversation referee
        └── muse.md                   # The one making coffee

Philosophy

The central insight of Foreman is not that AI agents can talk to each other. Claude Relay already proved that. The insight is that talking is not the same as collaborating, and collaboration requires structure: clear roles, a chain of command, defined communication norms, and someone whose job it is to say "actually, have you considered that you might be building the wrong thing?"

Most multi-agent coding setups are either a pipeline (agent A generates, agent B reviews, repeat until heat death) or a free-for-all (everyone talks to everyone and nothing gets decided). Foreman is neither. It is a job site. There is a foreman. There are workers. There is a plan. There is someone whose literal job is to disagree with the plan before anyone picks up a hammer.

And there is someone making coffee.

This may seem like a small thing, but Douglas Adams once noted that the problem with the future is that it keeps turning into the present. The same is true of software architecture. The Muse exists because sometimes the most valuable contribution is not a better algorithm but the observation that you are solving the wrong problem.

v1 Limitations

In the spirit of honesty, which is a trait undervalued in README files:

Single repo only. All agents work in the same project directory. Cross-repo coordination is a v2 problem.
No persistence. When sessions close, the crew is gone. Each job is a fresh start.
Same host only. Relay uses Unix sockets. Your agents all live on one machine.
The bootstrap script may need tweaking. CLI flags for Claude Code and Ollama evolve quickly. If a session fails to spawn, check the launch command first.

Credits

Foreman is built on Claude Relay by Innestic. Without Relay, these agents would be very organized and completely unable to speak to each other, which, come to think of it, describes most software teams already.

License

MIT. Do with it what you will. If you build something wonderful with it, that is its own reward. If you build something terrible, we would prefer not to know, but we acknowledge your right to do so.

"The major difference between a thing that might go wrong and a thing that cannot possibly go wrong is that when a thing that cannot possibly go wrong goes wrong it usually turns out to be impossible to get at or repair."

Keep your agents talking. Keep your Dissenter dissenting. Keep your Muse caffeinated.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
references		references
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SKILL.md		SKILL.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Foreman

Or: How to Get Eight Artificial Minds to Build Something Without Arguing About It Forever

What Is This

The Crew

Prerequisites

Quick Start

How It Works

The Bootstrap Script

File Structure

Philosophy

v1 Limitations

Credits

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Foreman

Or: How to Get Eight Artificial Minds to Build Something Without Arguing About It Forever

What Is This

The Crew

Prerequisites

Quick Start

How It Works

The Bootstrap Script

File Structure

Philosophy

v1 Limitations

Credits

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages