Harnessable

Harness Engineering is the practice of designing the operating environment for an AI agent, including context, tools, permissions, enforcement, verification, and observability.

This repository is the operational governance layer for autonomous agents doing high-stakes production work: four roles, a structured artifact chain, a state machine, a deployable enforcement layer, context-continuity hooks, and a recursive self-improvement loop that governs the framework by governing itself. Runtime-agnostic — reference implementation for Claude Code, Codex adapter included.

All framework concepts — roles, artifacts, enumerations, and their relationships — are defined in KNOWLEDGE_GRAPH.yaml.

Problem Statement

Prompt instructions are advisory unless backed by enforcement. Models are probabilistic, and long workflows compound errors.

A 95%-accurate model running a 20-step workflow succeeds 36% of the time (0.95²⁰ = 0.358). That figure assumes each step fails independently — which is optimistic. In practice, a wrong decision in step 3 corrupts the state that step 4 operates on, making downstream failures more likely, not equally likely. The real number is lower.

Reliability requires controls around the model, not only model selection or prompt design.

Agent = Model + Context + Tools + Enforcement + Verification + Observability

A model without operational controls is difficult to validate, audit, and recover. A constrained, verified, observable agent can be operated as part of any high-stakes workflow.

What This Is Not

Not a framework for one-shot prompts or chat assistants
Not a general AI application toolkit — it is specifically for teams operating autonomous agents in environments where actions have real consequences
Not model-specific — the reference implementation uses Claude Code hooks, but the protocols and enforcement architecture apply to any autonomous agent runtime that supports lifecycle hooks or pre-execution guards

The Harness Engineering Manifesto

In production environments where agent actions have real-world consequences, we treat AI agents as regulated systems rather than probabilistic experiments. We define this discipline as Harness Engineering.

The Core Principle

We operate on the fundamental equation:

Agent = Model + Harness

The Model is the black-box engine providing raw reasoning capability.
The Harness is the operational infrastructure — the environment that constrains, validates, and governs the model. The Problem Statement expands the Harness into its component layers: Context, Tools, Enforcement, Verification, and Observability.

From "Vibe Coding" to Systems Engineering

Traditional agent development often falls into the trap of "vibe coding" — repeatedly refining prompts in hopes of achieving deterministic results. Harness Engineering rejects this in favour of structural control:

Guides (Feed-Forward): Strict architectural constraints — capability allow-lists, rigid file-system boundaries, and context-injection protocols — define the agent's lane before it acts.
Sensors (Feedback): Post-action validation — automated testing, static analysis, and loop detection — enforces correctness after it acts. If an agent fails, we do not simply tweak the prompt; we harden the infrastructure.
Observability: Every action taken by the model must be audit-ready. The harness provides the logs and state persistence necessary to debug failures as structural gaps rather than random noise.

Why `harnessable`?

harnessable is the realisation of this manifesto. It provides the toolset to build, test, and enforce these structural boundaries, moving AI projects out of the experimental phase and into professional, production-ready software architecture.

The Framework

Four Roles

Roles are functional, not personal. One human or one agent session may perform multiple roles, but the active role must be explicit and role boundaries must be preserved.

Role	Responsibility	Produces
Architect	Define intent. Own the mandate. Review outcomes.	Design Mandate Task (DMT)
Engineer	Translate intent into an implementable plan.	Design Implementation Plan (DIP)
Coder	Execute the plan exactly as designed.	Task Implementation Report (TIR)
QA	Verify independently. Treat implementation claims as unverified until checked.	QA Verdict

No role approves its own work. The Coder cannot be the QA. The Engineer must not write code.

The Artifact Chain

Architect creates DMT
    │
    │  Problem statement, constraints, and acceptance criteria.
    ▼
Engineer authors DIP
    │
    │  Recon findings, architecture decisions, ordered steps,
    │  verification checklists, and containment plan.
    ▼
Coder implements + streams TIR
    │
    │  Completed work with evidence: verification output,
    │  deviations filed, and gates checked.
    ▼
QA verifies + issues verdict
    │
    │  Independently executed checks and verdict:
    │  PASS / CONDITIONAL_PASS / FAIL.
    ▼
Architect accepts → DONE

Artifacts are append-only after their stage closes. A closed mandate's DIP is immutable except for ## Post-Close Notes.

The State Machine

BACKLOG → MANDATED → IN_RECON → PLANNED → IN_PROGRESS → IN_REVIEW → VERIFIED → DONE
                                                              ↕
                                                           BLOCKED
                                                              ↕
                                                        NEEDS_REVISION

Every transition has a defined owner, a trigger condition, and invariants that must hold. Illegal jumps (e.g. PLANNED → IN_REVIEW with no implementation) are protocol violations that any agent must refuse.

Full transition table and invariants: framework/vendor/harnessable/references/state-machine.md

Harness Layers

User Request
      │
      ▼
Context Layer      ← AGENTS.md: project rules, allowed/blocked actions, completion gate
      │
      ▼
Tool Layer         ← controlled tools with schemas, validation, audit logging
      │
      ▼
Enforcement Layer  ← PreToolUse: five guards block before execution
      │                 bouncer.py          AGENTS.md ## Blocked policy
      │                 secrets_guard.py    credential reads and exfiltration
      │                 database_guard.py   DROP / TRUNCATE / WHERE-less DELETE
      │                 git_guard.py        force push / hard reset / branch destruction
      │                 communication_guard.py  email / Slack / SMS without approval
      │               PostToolUse: audit_logger.py records every tool call
      ▼
Execution
      │
      ▼
Verification Layer ← Stop: completion_gate.py (completion gate before turn completes)
      │               Independent QA, automated checks, evidence required
      ▼
Approved Output

Prompts live in the Context Layer. They are useful but not enforceable. Enforcement lives in hooks and gates that run regardless of what the model decides. The hooks/ directory makes the Enforcement Layer operational — copy it into your project and wire it via .claude/settings.json.

Guards in Action

The pre-built guards ship ready to use. Each shows the difference between advisory and enforced:

database_guard.py — intent detection, not keyword blocking

Agent proposes:  psql -c "DELETE FROM users"
Guard blocks:    DELETE without a WHERE clause would delete every row.
                 Add WHERE or have a human run it directly.

Agent proposes:  psql -c "DELETE FROM users WHERE last_login < '2020-01-01'"
Guard allows:    ✓

git_guard.py — history protection with a safe alternative

Agent proposes:  git push origin main --force
Guard blocks:    Rewrites shared remote history; destroys others' commits.
                 Use --force-with-lease or have a human approve.

Agent proposes:  git push origin main --force-with-lease
Guard allows:    ✓

communication_guard.py — the chief-of-staff case

Agent proposes:  curl -X POST https://api.sendgrid.com/v3/mail/send -d '{...}'
Guard blocks:    All outbound email must be reviewed and approved by a human
                 before sending. Draft the message and present it for review.

Agent proposes:  [presents draft to human for approval]
Human approves:  human runs the send command directly  ✓

In every case the agent receives a specific, actionable reason — not a generic failure — so it can propose a corrected approach immediately.

Repository Structure

harnessable/
│
├── framework/                     One-command install: cp -r framework/ docs/harness/
│   │
│   ├── agents/                    Tier 1 (copy and own) — role-specific agent protocols
│   │   ├── engineer.md            Recon passes, DIP authoring standards, sub-agent delegation
│   │   ├── coder.md               Build discipline, pre-completion hook runner, exit gate
│   │   └── qa.md                  Adversarial verification protocol, verdict criteria
│   │
│   ├── hooks/                     Tier 1 (copy and own) — Enforcement Layer
│   │   ├── run.py                 Universal dispatcher: discovers and runs *.py scripts per event
│   │   ├── pre_tool_use/          Scripts run on PreToolUse (add files here to extend)
│   │   │   ├── bouncer.py         Blocks commands matching AGENTS.md ## Blocked (policy-driven)
│   │   │   ├── secrets_guard.py   Hardcoded floor: blocks credential reads and exfiltration
│   │   │   ├── database_guard.py  Blocks DROP, TRUNCATE, and WHERE-less DELETE/UPDATE
│   │   │   ├── git_guard.py       Blocks force push, hard reset, branch and history destruction
│   │   │   └── communication_guard.py  Blocks unauthorized email, Slack, SMS, and calendar writes
│   │   ├── post_tool_use/         Scripts run on PostToolUse (add files here to extend)
│   │   │   └── audit_logger.py    Appends every tool call to .harnessable/logs/audit.YYYY-MM-DD.jsonl
│   │   ├── stop/                  Scripts run on Stop (add files here to extend)
│   │   │   └── completion_gate.py Runs AGENTS.md ## Completion Gate commands; blocks if any fail
│   │   └── claude_code_settings_template.json  Drop-in .claude/settings.json — all events wired through run.py
│   │
│   ├── templates/                 Tier 1 (copy and own)
│   │   └── dip.md                 Design Implementation Plan template (all required sections)
│   │
│   └── vendor/                    Tier 2 (pin and never modify)
│       └── harnessable/
│           ├── KNOWLEDGE_GRAPH.yaml    Framework concept graph — roles, artifacts, enumerations, relationships
│           ├── HARNESSABLE_VERSION     One line: the release tag or commit SHA pinned here
│           └── references/            Reference documents — do not modify
│               ├── roles.md           Full role definitions, permissions, prohibitions
│               ├── state-machine.md   Board status transitions and invariants
│               ├── error-modes.md     Classified failure patterns and expected responses
│               ├── continuous-improvement.md  Failure → RCA → harness improvement loop
│               ├── hooks.md           Hook lifecycle events, installation, and extension guide
│               └── knowledge-graph.md Knowledge graph model, vendoring instructions, and project extension guide
│
├── CHEAT_SHEET.md                 Condensed harness engineering reference
└── docs/                          Mandate history and implementation plans (not part of the install)

Key Concepts

Knowledge Graph

KNOWLEDGE_GRAPH.yaml is the authoritative semantic layer for the framework: it declares every concept in the harnessable namespace — roles, artifacts, enumerations, and their relationships — as a machine-readable graph that agents and guards reason against, not merely read. The framework graph is vendored unchanged under docs/harness/vendor/harnessable/; project-specific concepts extend it in a separate docs/knowledge-graph.yaml. When two platforms use the same label for different concepts, an alignment entry marks safe_assumption: false — an active instruction to any agent working across those platforms to treat the concepts as distinct regardless of shared labels.

The knowledge graph is also the pipeline's second output. Every mandate produces working software and an enriched graph — both are required for DONE. Every role is a scout: the Engineer amends the graph during recon, the Coder and QA file ONTOLOGY_GAP when they encounter undeclared concepts, and the Architect grounds mandate intent in the graph before the DMT is finalised and confirms enrichment before closure. A concept discovered during any stage that is not in the graph halts work until declared. ONTOLOGY_GAP resolutions and graph enrichment are also prerequisites for framework improvement — when the same gap class recurs across three mandates, it triggers a MetaMandate.

Full model and extension guide: framework/vendor/harnessable/references/knowledge-graph.md

Context Continuity

The PreCompact hook pair fires before every compaction event, preserving operational state before the context window is truncated. transcript_archive.py compresses the full session transcript and indexes it in .harness/transcripts/ — nothing is auto-purged; operators set the retention policy. mandate_snapshot.py writes .harness/compaction-handover.md, a structured document capturing board status, open discoveries, active role, and git state across all configured codebases. The next session loads this handover document and resumes with full operational context rather than starting from the compaction summary. Codebase paths are configured in .harness/config.json.

Recursive Self-Improvement

Every role performs a Framework Observation at the end of every session — unconditionally, not only when something fails. Observations are filed as HARNESS_IMPROVEMENT discoveries with structured fields: gap, stage, and proposal common to all roles, plus upstream_opportunity for QA and propagation for the Architect. PropagationDistance — the number of pipeline stages a gap traveled before detection — determines improvement priority. When the same gap_class appears in three or more ImprovementSignal entries, the Architect creates a MetaMandate: a mandate whose codebase is the framework itself, running through the full four-role pipeline. The framework improves itself by governing itself.

External Fact Verification

An agent's training data ends at a cutoff. Any claim about a third-party system's current state — package compatibility, API surface, deprecation status, SDK constraints — reflects the world as it was at training time, not as it is now. Harnessable treats this as an engineering constraint: Engineer Pass 4 requires all external dependencies to be verified against live-fetched sources at the installed version, with the URL and fetch date cited in Recon Findings. A DIP that asserts compatibility based on training knowledge alone is incomplete. tools/web_verify.py provides a single entry point for version resolution, URL fetch, and web search during recon.

Field Discoveries

When any acting agent finds something not anticipated in the mandate, they must stop and file a discovery before proceeding. Discoveries are classified:

Class	Meaning
`INFO`	Noted; no design change needed
`DEVIATION`	Design must be updated before proceeding
`BLOCKER`	Work cannot continue; Architect must review
`HARNESS_IMPROVEMENT`	A missing control was identified

Silent deviations, where implementation differs from the plan without being logged, are a protocol violation.

Containment Checklist

Every non-trivial implementation step in a DIP must answer four questions before the Coder touches it:

Detect — how will a failure surface?
Contain — what prevents it from cascading?
Recover — what is the rollback path?
Prevent recurrence — what check or policy would catch this class of failure earlier?

If a step has no answer for any of these, the DIP has a design gap.

Continuous Improvement

Each failure should be reviewed for missing or ineffective controls. The framework treats its own protocol files as a codebase: any agent may file a HARNESS_IMPROVEMENT discovery, which creates a child task and eventually flows through the same four-role pipeline as any other mandate.

Incident review should focus on the control gap, not only the model output.

Core Principles

System reliability is an engineering responsibility. Model access does not provide workflow reliability by itself.
Account for model error. Design for detection, containment, and recovery rather than assuming perfect behaviour.
Pair capability with controls. Model capability must be supported by validation, permissions, verification, and observability.
Require verification. Claims are not evidence. "It should work" is not acceptable. "I verified it works because [output]" is.
Treat failures as control gaps. Review incidents by asking what control was missing or ineffective.
External facts expire. An agent's training data is a snapshot fixed at its cutoff date. Any claim about a third-party system — package compatibility, API surface, deprecation status, rate limits, authentication schemes — must be verified against a live-fetched source at the installed version. Training knowledge tells you where to look; it does not tell you what is true now.

Getting Started

1. Set up your project tracker

Create a board or workflow in your project tracker of choice (GitHub Projects, Jira, Linear, Asana, or any tool that supports custom status columns) with these statuses:

BACKLOG · MANDATED · IN_RECON · PLANNED · IN_PROGRESS · IN_REVIEW · BLOCKED · NEEDS_REVISION · VERIFIED · DONE

If using GitHub Projects: all ten columns can be created in one gh CLI command rather than through the UI. Column names must exactly match the list above — a typo causes status transitions to fail silently. First fetch the project ID and the Status field ID:

gh api graphql -f query='
query($org: String!, $number: Int!) {
  organization(login: $org) {
    projectV2(number: $number) {
      id
      fields(first: 20) {
        nodes {
          ... on ProjectV2SingleSelectField { id name }
        }
      }
    }
  }
}' -F org=YOUR_ORG -F number=YOUR_PROJECT_NUMBER

Then set all ten options in one mutation (replace PROJECT_ID and FIELD_ID with the values returned above):

gh api graphql -f query='
mutation($projectId: ID!, $fieldId: ID!) {
  updateProjectV2Field(input: {
    projectId: $projectId
    fieldId: $fieldId
    singleSelectOptions: [
      {name: "BACKLOG",        color: GRAY},
      {name: "MANDATED",       color: BLUE},
      {name: "IN_RECON",       color: BLUE},
      {name: "PLANNED",        color: BLUE},
      {name: "IN_PROGRESS",    color: YELLOW},
      {name: "IN_REVIEW",      color: ORANGE},
      {name: "BLOCKED",        color: RED},
      {name: "NEEDS_REVISION", color: RED},
      {name: "VERIFIED",       color: GREEN},
      {name: "DONE",           color: GREEN}
    ]
  }) {
    projectV2Field {
      ... on ProjectV2SingleSelectField {
        options { id name color }
      }
    }
  }
}' -F projectId=PROJECT_ID -F fieldId=FIELD_ID

If the board already has a Status field with existing options, this mutation replaces all options; export existing item statuses first if any items are already assigned a value.

Declare the tool and integration method in your project's AGENTS.md under ## Project Tracker so every agent session knows how to read and update board state.

2. Copy the framework directory

All installable files are pre-structured under framework/. Copy the directory into your project:

cp -r framework/. path/to/your-project/docs/harness/

The directory is already organized. Tier 1 files (agents/, hooks/, templates/) are ready to customize. Tier 2 files (vendor/harnessable/) define the framework semantics — do not modify them.

The PreCompact hooks in framework/hooks/pre_compact/ require wiring in .claude/settings.json to activate — add them alongside the PreToolUse and Stop hooks. The settings template at framework/hooks/claude_code_settings_template.json already includes the PreCompact block. Configure .harness/config.json with your project's codebase paths if the agent works across multiple repositories.

Update docs/harness/vendor/harnessable/HARNESSABLE_VERSION with the release tag or commit SHA you copied from.

See framework/vendor/harnessable/references/knowledge-graph.md for the full extension model.

2a. Wire the enforcement hooks

Copy framework/hooks/claude_code_settings_template.json to .claude/settings.json at the root of your project (or merge it into an existing settings file). Update the base path if you placed framework/ somewhere other than docs/harness/.

This registers hooks/run.py as the dispatcher for three lifecycle events:

Event	Subdirectory	What runs
PreToolUse	`hooks/pre_tool_use/`	`bouncer.py`, `secrets_guard.py`, `database_guard.py`, `git_guard.py`, `communication_guard.py`
PostToolUse	`hooks/post_tool_use/`	`audit_logger.py`
Stop	`hooks/stop/`	`completion_gate.py`

Adding a new check later requires only dropping a .py file into the relevant subdirectory — no further changes to settings.json.

To verify the enforcement layer is live after wiring, run these three checks. Pipe each payload as a standalone command — do not embed them in a compound shell script. Guards inspect the outer command string: a compound script containing git push --force will be blocked by the bouncer on the test invocation itself, before the JSON payload is evaluated.

# 1. Safe command — must exit 0, no output
printf '{"tool_name":"Bash","tool_input":{"command":"echo ok"}}' \
  | python3 docs/harness/hooks/run.py pre_tool_use
echo "Exit: $?"

# 2. Force push guard — must exit 2 with a GitGuard message
printf '{"tool_name":"Bash","tool_input":{"command":"git push origin main --force"}}' \
  | python3 docs/harness/hooks/run.py pre_tool_use 2>&1
echo "Exit: $?"

# 3. WHERE-less DELETE guard — must exit 2 with a DatabaseGuard message
# Use printf '%s' so the argument is printed as-is; bare printf interprets \" as " and produces invalid JSON.
printf '%s' '{"tool_name":"Bash","tool_input":{"command":"psql -c \"DELETE FROM users\""}}' \
  | python3 docs/harness/hooks/run.py pre_tool_use 2>&1
echo "Exit: $?"

2b. Add `.harnessable/` to `.gitignore`

audit_logger.py begins writing to .harnessable/logs/audit.YYYY-MM-DD.jsonl on the first tool call after hooks are wired, rotated daily and compressed to .gz. Add the directory to .gitignore before your first git add:

# .gitignore
.harnessable/

Ignoring the directory (not just the log file) protects all runtime output the framework may write there. If a specific artifact later needs to be versioned, add a negation entry (!.harnessable/filename).

3. Load agent context at session start

At the start of each agent session, tell the agent which role it is playing and point it to the relevant files:

You are operating as the [Engineer | Coder | QA].

Role definition and permissions: docs/harness/vendor/harnessable/references/roles.md
State machine: docs/harness/vendor/harnessable/references/state-machine.md
Your protocol: docs/harness/agents/[engineer|coder|qa].md

4. Create your first DMT

The Architect creates a task in the project tracker with:

A clear problem statement
Measurable acceptance criteria
Explicit constraints and out-of-scope declarations

Set status to MANDATED. The Engineer may begin.

5. Run the workflow

Each role reads its protocol file before starting any work. No role begins without the preceding artifact existing and the board in the correct state. The agents/ files are the operating instructions; the references/ files are the rulebook.

Codex compatibility

Harnessable works with Codex through:

AGENTS.md at the repo root for persistent repository instructions, discovered automatically by Codex at session start
.agents/skills/harnessable/SKILL.md for progressive, task-specific protocol loading — invoke with "Use the harnessable skill." in any prompt
codex/ for install scripts and role prompt examples
Harnessable guards adapted as shell or Python checks where Codex lifecycle hooks are available

Install into your project:

bash codex/install.sh /path/to/your-project

Then invoke any role:

codex "Use the harnessable skill. Act as Engineer and produce a DIP for issue #12."

See codex/examples/ for complete role prompt templates.

Anti-Patterns

Anti-pattern	Replace with
Unlimited shell access	Controlled tools with schemas and permission checks
Prompt-only safety (`"never delete data"`)	Enforced hooks via `hooks/run.py` that block regardless of model intent
Self-verification	Independent QA that re-executes checks themselves
Huge agent contexts	Sub-agents with scoped tasks, summarised findings passed to parent
No audit trail	Structured TIR with real output evidence
Silent deviations	Filed field discoveries with original vs. actual

Engineering Model

This framework borrows practices from regulated engineering disciplines where failure review, independent verification, and change control are required. Comparable practices in civil and structural engineering include:

Work does not proceed without stamped drawings (DMT → DIP)
Field changes require documented RFIs (DEVIATION field discoveries)
Third-party inspection is independent of the implementing contractor (QA ≠ Coder)
Every failure produces a root cause analysis and a control improvement

Software teams running AI agents on production work need comparable controls for authorization, verification, deviation handling, and incident review.

There is a constraint specific to AI agents that has no direct analogue in traditional engineering: training data ends at a fixed point in time. A human engineer reaching for the documentation of a third-party library retrieves the current version without thinking about it. An AI agent retrieving the same documentation may silently fetch a prior version — the one it knew at training time — and reason from a compatibility table that is months or years out of date. This is not a flaw in the model; it is an epistemic property of how models are built. The engineering response is the same one applied to any known constraint: design for it explicitly. External facts must be verified live, at the installed version, with the source cited. The framework encodes this in Engineer Pass 4 and provides tooling to make live verification the path of least resistance rather than an extra step.

Influences & Acknowledgements

This framework did not emerge from a single source. It developed through practice building real systems with LLMs, accumulated reading across several fields, and iterative refinement over many sessions.

The intellectual traditions it draws on include:

Regulated engineering disciplines — civil and structural engineering practices around stamped drawings, field RFIs, third-party inspection, and mandatory root cause analysis after failure. These supplied the core analogy and much of the vocabulary.
Site Reliability Engineering and lean manufacturing — particularly the focus on error budgets, failure modes, containment over perfection, and the idea that reliability is a systemic property rather than a property of individual components.
AI safety and alignment research — especially work on corrigibility, human oversight, and the importance of maintaining meaningful human control over systems that can act autonomously.
Software engineering practice — decades of accumulated thinking on separation of concerns, audit trails, and the value of independent review.

Parts of this framework were developed in collaboration with Claude (Anthropic) through extended brainstorming and stress-testing sessions. The ideas were challenged, refined, and sometimes reversed through that process.

If you recognise a specific source that clearly influenced something here, contributions to this section are welcome — open an issue or a pull request.

Licence

AGPL-3.0

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.agents/skills/harnessable		.agents/skills/harnessable
codex		codex
framework		framework
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHEAT_SHEET.md		CHEAT_SHEET.md
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Harnessable

Contents

Problem Statement

What This Is Not

The Harness Engineering Manifesto

The Core Principle

From "Vibe Coding" to Systems Engineering

Why harnessable?

The Framework

Four Roles

The Artifact Chain

The State Machine

Harness Layers

Guards in Action

Repository Structure

Key Concepts

Knowledge Graph

Context Continuity

Recursive Self-Improvement

External Fact Verification

Field Discoveries

Containment Checklist

Continuous Improvement

Core Principles

Getting Started

1. Set up your project tracker

2. Copy the framework directory

2a. Wire the enforcement hooks

2b. Add .harnessable/ to .gitignore

3. Load agent context at session start

4. Create your first DMT

5. Run the workflow

Codex compatibility

Anti-Patterns

Engineering Model

Influences & Acknowledgements

Licence

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Why `harnessable`?

2b. Add `.harnessable/` to `.gitignore`

Packages