Skip to content

docs(benchmark): token-economy receipt + Phase 1 update#12

Merged
postmunnet merged 1 commit into
mainfrom
docs/token-economy-benchmark
May 19, 2026
Merged

docs(benchmark): token-economy receipt + Phase 1 update#12
postmunnet merged 1 commit into
mainfrom
docs/token-economy-benchmark

Conversation

@postmunnet

Copy link
Copy Markdown
Owner

Adds the canonical token-economy benchmark comparing three paths for "open + read a GitHub README":

curl raw README 1 round-trip, 0 schema tax
browser-cli 1 compound, 0 schema tax
MCP chrome 3 round-trips, +schema tax (deferred mode)

v1 (2026-05-18) baseline:
curl 1,821 tokens · browser-cli 1,828 · MCP 2,560 (+41%)

Phase 1 update (2026-05-19) — same-day re-measurement after a whitespace normalizer landed in the browser-cli sibling repo (lib/commands/read.js):
curl 2,189 (README content grew between days)
browser-cli 1,896 (now wins curl by 13.4%, junk-line ratio
50% → 19%)

Framed as an architectural moat — "Why CLI Agents Have a 50-Year Head Start" — not a token-shaving optimization. The moat is the load-once command vocabulary of CLI tools vs MCP's per-tool schema charge per turn.

Companion doc docs/contracts/browser-cli/USE_CASE_ROUTING.md formalizes when each tool fits (content-addressable → curl; auth/JS/provenance/policy → browser-cli) so readers don't reduce the comparison to "just use curl."

Raw I/O captured byte-for-byte under raw/ for reproducibility. Token proxy: chars / 3.8 (documented in count.py).

Adds the canonical token-economy benchmark comparing three paths for
"open + read a GitHub README":

  curl raw README   1 round-trip,  0 schema tax
  browser-cli       1 compound,    0 schema tax
  MCP chrome        3 round-trips, +schema tax (deferred mode)

v1 (2026-05-18) baseline:
  curl 1,821 tokens · browser-cli 1,828 · MCP 2,560 (+41%)

Phase 1 update (2026-05-19) — same-day re-measurement after a
whitespace normalizer landed in the browser-cli sibling repo
(lib/commands/read.js):
  curl 2,189 (README content grew between days)
  browser-cli 1,896 (now wins curl by 13.4%, junk-line ratio
                     50% → 19%)

Framed as an architectural moat — "Why CLI Agents Have a 50-Year
Head Start" — not a token-shaving optimization. The moat is the
load-once command vocabulary of CLI tools vs MCP's per-tool schema
charge per turn.

Companion doc docs/contracts/browser-cli/USE_CASE_ROUTING.md
formalizes when each tool fits (content-addressable → curl;
auth/JS/provenance/policy → browser-cli) so readers don't
reduce the comparison to "just use curl."

Raw I/O captured byte-for-byte under raw/ for reproducibility.
Token proxy: chars / 3.8 (documented in count.py).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@postmunnet postmunnet merged commit b3d98e8 into main May 19, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant