landing: demote Phalanx from marketing, lead with precision (backed by spiritwriter-core PR #58)#4
Merged
Conversation
Six Phalanx mentions removed (nav, manifesto body, primitive vi
header + tag, folio iv label, Bear-section body) per the cleanup
decisions in spiritwriter-core/docs/cleanup/cmc-phalanx-canonicalize.md.
"Phalanx" is the deep-dive system name (lives in
docs/entity-resolution.md); marketing surfaces use capability-shaped
language ("entity resolution", "the resolver").
Sidebar stat at line 1153 replaced. The >=85% recall framing was the
cmc-spec's full-pipeline target (with LLM clustering); CMC-Lite is the
deterministic subset and doesn't claim that number. New stat uses the
precision-first claim now backed by spiritwriter-core PR #58's
multi-corpus benchmark campaign: 100% auto-merge precision across 5
corpora, 0 false merges, 12/12 hand-curated collision pairs correctly
distinguished. Measured under hostile falsification battery, not
aspirational.
Added new homepage entry ix (Shingled extraction). The
batch-with-overlap pattern was previously bundled under Phalanx by
accident; it's a distinct primitive (text -> atoms vs. shards -> store
vs. records -> canonical) and giving it its own card on the page closes
the loop the other primitives describe.
Footer Specs section retains the "cmc-lite" link label (appropriate
audience for the spec name). No CSS / visual changes.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
5 tasks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Marketing-copy rewrite of
templates/landing.htmlto demote insider jargon ("Phalanx") from the homepage and replace the wishy-washy≥85% recallsidebar stat with the precision-first claim now backed by spiritwriter-core PR #58's measured campaign numbers.Driven by the decisions captured in spiritwriter-core's
docs/cleanup/cmc-phalanx-canonicalize.md(decisions A/B/C: Phalanx is a deep-dive system name, not marketing). Backed by the multi-corpus benchmark campaign indocs/benchmarks/runs-log.md.Changes
All 6 Phalanx mentions demoted to capability-shaped language (per cleanup doc §3 table):
PhalanxResolution(anchor unchanged)*Phalanx* — entity resolutionEntity *resolution*(matches header form of other primitives)cmc-lite · ess digest · tieredsqlite-backed · domain-agnostic · zero-infrastructurefolio iv — phalanxfolio iv — resolutionPrimitive vi body also extended with the competitive frame from the cleanup doc rewrite: "No graph database to operate, no embedding service to call — define your identifying fields, hand in records, get canonical IDs back."
Sidebar stat (line 1153) replaced with the precision-first claim now backed by measured numbers:
The
≥85%was the cmc-spec's target for the full CMC pipeline (with LLM clustering). CMC-Lite is the deterministic subset and doesn't claim that number. PR #58's falsification battery confirmed the precision invariant holds across 5 corpora and hostile collision pairs — that's the defensible headline.New homepage entry ix — Shingled extraction added per cleanup doc §2:
Rationale: shingled extraction was previously bundled under Phalanx by accident. It's a distinct primitive (text → atoms vs. shards → store vs. records → canonical) and it closes the loop the other primitives describe (now the reader knows how atoms get made).
What's NOT touched
cmc-litelink to the spec file — appropriate audience for that name (deep-dive readers), per cleanup doc decision A.Test plan
grep -n Phalanxreturns nothing)cmc-liteonly appears in the footer Specs link target (deep-dive territory, by design)≥85%removedgrep "<div class=\"num\">ix"finds the new primitive card)Cross-repo references
🤖 Generated with Claude Code