decode-concept-page#210
Conversation
…ncept discovery surface]
…ing stages, and sampling]
…ode concept-page slice]
|
Completed story decode-concept-page-004. Added a focused canonical-page contract test for /docs/concepts/decode that asserts the route, the concept.decode registry record, and the default English colocated page bundle resolve together via localDocsRoute(...) plus validateColocatedPageBundle(...). Verified locally with make typecheck, make lint, bun run pretest, and the focused decode and serving-path discovery test sweep. PR CI is green. |
|
BLOCKING review summary for PR #210 ( Quality checks
Project acceptance criteria
Behavioral assertion check
Standards review
Review rules application
Required fix
|
…concept discovery surface]
|
Addressed the blocking architecture-surface feedback on head Changes made:
Validation:
This addresses the blocking review requirement that |
…concept discovery surface]
|
Addressed the CI mergeability failure on head What changed:
Why this was needed:
Validation:
This is mergeability-only follow-up on the current PR head; no PRD story scope changed. |
{
"project": "Model Atlas — Decode Canonical Concept Page",
"branchName": "decode-concept-page",
"description": "Publish one canonical English
decodeconcept page, backed by the existingconcept.decoderegistry record and localized messages, so readers can understand the repeated next-token stage, distinguish it from prefill and sampling, and navigate cleanly through adjacent serving and generation docs.","context": {
"customerAsk": "Customer ask alignment: keep the refill batch customer-visible and page-local while strengthening the inference-serving and GPT reader journey with a concept page that already has a registry record but no canonical docs page on current
main. Add the canonical English concept page fordecodeusing the existing publishedconcept.decoderegistry record. Scope: create the canonical concept page plus colocatedmessages/en.jsonand any neededassets.json; classify it correctly with the project docs; wire aliases, tags, and related links so readers can move betweenprefill,prefill-decode-split,kv-cache,autoregressive-generation,sampling-overview,batching,continuous-batching,memory, andspeculative-decoding; and add only the focused validation needed for the touched page and discovery surfaces. The page should explain what the decode stage is in plain language, why it repeats token by token, why its performance profile differs from prefill, and how readers should separate decode mechanics from higher-level sampling choices. Keep the slice English-only and avoid reopening locale infrastructure, serving runtime code, or unrelated graph-family churn. Acceptance criteria: thedecodeconcept page exists on currentmainas a canonical docs page, resolves against the existing registry record, supports serving-path discovery, and focused touched checks pass.","problem": "The site already has the published
concept.decoderegistry record and nearby pages coveringprefill,prefill-decode-split,kv-cache,autoregressive-generation,sampling-overview, and several serving-system topics, but it still lacks the canonical broad explainer that teaches decode itself as the repeated next-token stage of generation. Readers therefore have to infer decode from narrower cache, batching, or sampling pages instead of landing on one plain-language concept surface that explains why decode repeats, why inter-token latency feels different from prefill latency, and why decode mechanics are not the same thing as token-selection policy.","solution": "Publish a canonical English
/docs/concepts/decodepage using the standard concept-page contract, bind it to the existingconcept.decoderegistry record, make the concept route the clear canonical broad explainer instead of leaving glossary-only discovery as the main surface, connect the required aliases, tags, and related-doc paths to nearby serving and generation pages, and add only the focused validation needed to prove route, message, registry, and discovery integrity for this narrow slice."},
"acceptanceCriteria": [
"A published canonical docs page exists for
decodeunder the concepts docs tree, with matching English messages and any required local assets.","Discovery surfaces treat the concept route as the canonical broad explainer for
decodeand do not leave readers with competing glossary-only canonical targets for the same concept.","The page resolves cleanly to the existing
concept.decodediscovery record and preserves accurate registry-backed discovery through aliases, tags, and related-doc relationships.","The page opens with one folded
openingSummary, explains decode in layperson terms, clearly distinguishes the repeated next-token decode loop from prefill, and explains why decode performance often shows up as inter-token latency and cache-growth pressure.","The page clearly separates decode mechanics from higher-level sampling choices and guides readers toward
sampling-overviewwhen they want to learn how the next token is chosen.","Readers can navigate between
decode,prefill,prefill-decode-split,kv-cache,autoregressive-generation,sampling-overview,batching,continuous-batching,memory, andspeculative-decodingthrough shipped nearby discovery surfaces with no broken or misleading links.","Focused validation covers the touched page contract plus the registry-backed related-doc, route, or search expectations needed for this slice, without broad unrelated test expansion.",
"Quality gate:
make typecheck,make lint, and focused tests pass."],
"userStories": [
{
"id": "decode-concept-page-001",
"title": "Promote
decodeto the canonical concept discovery surface","description": "As a reader searching for decode, I want the existing
concept.decoderecord and route behavior to lead me to one canonical broad explainer so I do not have to infer the topic only from narrower cache, latency, or batching pages.","acceptanceCriteria": [
"The existing
concept.decoderecord remains the canonical backing record and is updated only as needed with aliases, related ids, tags, citations, or other controlled metadata that match a broad concept page.","Discovery surfaces expose
/docs/concepts/decodeas the canonical broad explainer fordecode, and any retained glossary surface does not compete as the main canonical destination for the same concept.","Registry relationships connect the concept page to shipped nearby docs for
prefill,prefill-decode-split,kv-cache,autoregressive-generation,sampling-overview,batching,continuous-batching,memory, andspeculative-decoding.","Discovery metadata distinguishes the broad
decodeconcept from neighboring serving-system pages and sampling pages without introducing duplicate canonical targets or broad taxonomy churn.","Typecheck passes",
"Tests pass"
],
"priority": 1,
"passes": true,
"notes": ""
},
{
"id": "decode-concept-page-002",
"title": "Publish the canonical decode concept page",
"description": "As a technical layperson learning about LLM generation and serving, I want a dedicated
decodeconcept page so I can understand the repeated next-token stage before I dive into batching, memory pressure, or token-selection techniques.","acceptanceCriteria": [
"A canonical
decodedocs page exists with matching frontmatter,messages/en.json, and any required localassets.json.","The page opens with one folded
openingSummaryand explains decode in plain language before narrowing into inference-serving context.","The page clearly explains that decode begins after prefill, reuses the current prefix and saved key-value state, computes one next-token step, appends that token, and repeats until generation stops.",
"The page clearly explains why decode repeats token by token instead of rereading the whole prompt as one initial pass.",
"The page clearly explains why decode performance often depends on inter-token latency, cache growth, and memory movement in ways that differ from prefill's prompt-side cost profile.",
"Typecheck passes",
"Tests pass",
"Verify in browser using the Browser plugin"
],
"priority": 2,
"passes": true,
"notes": ""
},
{
"id": "decode-concept-page-003",
"title": "Route readers between decode, serving stages, and sampling",
"description": "As a reader exploring generation tradeoffs, I want related docs, tags, and search surfaces to guide me into the broad
decodeexplainer and then onward to the right neighboring serving and sampling pages.","acceptanceCriteria": [
"Representative queries such as
decode,decoding,token-by-token generation,next-token step, andinter-token latencyreturn the canonical concept page as a direct relevant result.","The concept page renders registry-backed related docs and tag surfaces that connect it to
prefill,prefill-decode-split,kv-cache,autoregressive-generation,sampling-overview,batching,continuous-batching,memory, andspeculative-decoding.","The rendered content explicitly distinguishes decode mechanics from token-selection methods and points readers to
sampling-overviewwhen they want to learn how the next token is chosen.","At least one neighboring shipped page or discovery surface presents the new concept page as a navigable destination so readers can reach it without typing the exact slug.",
"Browser-visible rendering shows the title, folded summary, tags, related-doc links, and references without broken links or missing-content placeholders.",
"Typecheck passes",
"Tests pass",
"Verify in browser using the Browser plugin"
],
"priority": 3,
"passes": true,
"notes": ""
},
{
"id": "decode-concept-page-004",
"title": "Add focused validation for the decode concept-page slice",
"description": "As a maintainer, I want targeted automated proof for the
decodeconcept-page slice so route, message, registry, and discovery regressions are caught without unrelated test expansion.","acceptanceCriteria": [
"Validation or tests confirm the
/docs/concepts/decoderoute,concept.decoderecord, and default English messages resolve together.","Coverage asserts at least one page-specific discovery outcome and at least one page-specific related-link expectation for the new concept page.",
"Focused checks stay limited to touched content and discovery integrity rather than inventory snapshots, locale churn, runtime serving code, or unrelated suite expansion.",
"Typecheck passes",
"Tests pass"
],
"priority": 4,
"passes": true,
"notes": ""
}
]
}