advisor-strategy

A live benchmark comparing three model configurations on the same research query — streaming in parallel, with full cost, latency, and quality metrics.

Three agents running simultaneously. The center column shows an advisor call badge mid-stream — Sonnet escalating to Opus for a complex decision before continuing its research loop.

What it shows

Configuration	Cost	Quality	Latency	Notes
Sonnet solo	$0.17	7.8/10	88.8s	Baseline — full agentic loop, no advisor
Sonnet + Opus advisor	$0.83	8.5/10	187.5s	Sweet spot — Opus consulted 2× on hard decisions
Opus solo	$1.19	8.5/10	98.8s	Gold standard — full frontier cost

Sonnet + Advisor matched Opus quality at 70% of the cost.

The Advisor Strategy

The Advisor Strategy is a multi-model orchestration pattern where a capable executor model (Sonnet) drives the full agentic loop, but escalates to a more powerful model (Opus) via a dedicated tool call — only for decisions that actually warrant it.

{
  "type": "advisor_20260301",
  "name": "advisor",
  "model": "claude-opus-4-6",
  "max_uses": 5
}

Required beta header: anthropic-beta: advisor-tool-2026-03-01

The executor stays in control. The advisor provides targeted judgment exactly where it changes the result. Advisor calls surface in the stream as server_tool_use; token cost lands in message_delta.usage.iterations[] where type === "advisor_message", split into input/output for accurate billing at Opus rates ($15/$75 per million).

Full results

The bottom panel shows per-dimension quality scores (source depth, reasoning, completeness, accuracy) judged by a separate Opus call after all three runs complete. The summary bar compares cost side-by-side.

Stack

Next.js 15 — App Router, Server Components, route handlers
TypeScript — end to end
Tailwind CSS v4 — custom design tokens via @theme
Anthropic SDK — baseline and Opus agents via @anthropic-ai/sdk
Raw fetch — advisor agent (SDK doesn't yet expose the advisor tool natively)
Brave Search API — web search tool execution
SSE — three parallel streaming agent runs to the client

Project structure

app/
  page.tsx                   # Main UI — query input, three-column grid, quality chart
  api/
    research/
      baseline/route.ts      # Sonnet solo agent — SSE stream
      advisor/route.ts       # Sonnet + Opus advisor agent — SSE stream
      opus/route.ts          # Opus solo agent — SSE stream
    judge/route.ts           # Quality scoring — Opus as judge
components/
  ComparisonGrid.tsx          # Three-column layout
  AgentColumn.tsx             # Per-agent streaming output + metrics
  MetricsCard.tsx             # Cost / tokens / latency display
  QualityChart.tsx            # Dimension breakdown bar chart
lib/
  agents/
    baseline-agent.ts         # Sonnet agentic loop (SDK streaming)
    advisor-agent.ts          # Sonnet + advisor loop (raw fetch, beta header)
    opus-agent.ts             # Opus agentic loop (SDK streaming)
    shared.ts                 # System prompts, tool definitions, web search/fetch
  metrics.ts                  # Pricing constants, cost calculation, formatters
  types.ts                    # Shared TypeScript types

Setup

git clone https://github.com/popand/advisor-strategy
cd advisor-strategy
npm install

Create .env.local:

ANTHROPIC_API_KEY=sk-ant-...
BRAVE_API_KEY=...           # optional — falls back to placeholder results

npm run dev

Open http://localhost:3000, enter a research query, and click Run Comparison.

Notes

The advisor feature requires beta access: anthropic-beta: advisor-tool-2026-03-01
All three agents run in parallel — expect the full comparison to take 60–120 seconds depending on query complexity
Web search falls back to a placeholder if BRAVE_API_KEY is not set; agents still run using training knowledge
Quality scores are judged by a separate Opus call after all three runs complete — expect some variance across runs

References

Built by Andrei Pop · Principal Engineer, Alethia

Alethia Prism is the intelligence layer that identifies what is forming across systems, context, and time — so organizations can act before outcomes harden.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
app		app
components		components
docs/superpowers		docs/superpowers
lib		lib
public		public
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
README.md		README.md
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

advisor-strategy

What it shows

The Advisor Strategy

Full results

Stack

Project structure

Setup

Notes

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

advisor-strategy

What it shows

The Advisor Strategy

Full results

Stack

Project structure

Setup

Notes

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages