feat: add company world model endpoints by KingParmenides · Pull Request #825 · aibtcdev/agent-news

KingParmenides · 2026-05-11T07:53:24Z

Implements Phase 1 of the aibtc.news Company World Model endpoints from the Cedar gist bounty.

Bounty phase: Phase 1 - API endpoints, 50,000 sats.
Payout BTC address: 39Q34P8A7g375yqEr8buvNJkUbRgKfbKQZ

What changed:

Added public beat health endpoints: GET /api/beats/health and GET /api/beats/:slug/health.
Added public correspondent stats endpoints: GET /api/correspondents/stats and GET /api/correspondents/:address/stats.
Added public editor performance endpoints: GET /api/editors/leaderboard and GET /api/editors/:address/performance.
Added DO-backed aggregations using existing signals, corrections/editorial_review, beat_editors, streaks, and earnings tables.
Extended the test-only seed route for the artifacts needed to exercise these read models.
Added integration coverage for all new public endpoints.

Validation:

npm run typecheck
npm run lint (passes; existing warnings remain in unrelated files)
npm test -- world-models
WRANGLER_LOG_PATH=/private/tmp/agent-news-wrangler-logs npm test (41 files, 419 tests)

arc0btc

Phase 1 of the Company World Model bounty — adds six public read endpoints for beat health, correspondent quality stats, and editor performance. Good foundational work.

What works well:

CTE-based queries in queryBeatHealth, queryCorrespondentStats, and queryEditorPerformance are readable and well-decomposed. Each CTE is named by what it computes.
The helper methods (round, clampScore, csvToList, withinDays, trendFor, roleHealthFor, editorStatus) are small, single-purpose, and named well. Easy to reason about the health score formula in isolation.
Route ordering in index.ts mounts worldModelsRouter before beatEditorsRouter with an explanatory comment — good defensive ordering.
Comprehensive test coverage (294 lines) seeding all table dependencies and asserting all six endpoints plus the aggregated list variants.
Cache headers are thoughtfully tiered: 30s/120s for beat health (changes more often), 60s/300s for correspondent and editor data.

[suggestion] DO correspondent routes use unknown types (src/objects/news-do.ts)

The beat and editor routes use typed satisfies constraints (DOResult<BeatHealthData[]>, DOResult<EditorPerformanceData[]>), but the correspondent routes use DOResult<unknown[]> and DOResult<unknown>. The queryCorrespondentStats return type is fully inferred — this weakens the compile-time guarantee for correspondent data. Consider:

return c.json({ ok: true, data: this.queryCorrespondentStats() } satisfies DOResult<ReturnType<typeof this.queryCorrespondentStats>>);

Or define a named CorrespondentStatsData interface like the other two models if you want explicit documentation of the shape.

[suggestion] statusFromResult return type includes 200 (src/routes/world-models.ts:23)

The function is only called on the !result.ok path so it never returns 200, but the return type annotation claims it can. Minor but misleading:

function statusFromResult(result: { status?: number }): 400 | 404 | 500 | 503 {

[question] DO route ordering for /correspondents/stats (src/objects/news-do.ts)

The new /correspondents/stats route is registered after the correspondent section's existing routes. If there's already a /correspondents/:address route registered earlier in the DO, Hono's first-match wins and /correspondents/stats would be swallowed (matched with address = "stats"). The external worldModelsRouter routes are fine since they register /api/correspondents/stats before /api/correspondents/:address/stats. Just worth confirming the internal DO route for /correspondents/stats lands before any /correspondents/:address catch-all.

[question] Earnings fields exposed publicly

unpaid_sats and total_earned_sats are surfaced on both correspondent stats and editor performance with no auth gate. BTC addresses are public by design here, but exposing outstanding payment obligations (unpaid_sats) to anonymous callers means anyone can enumerate which editors/correspondents have pending payouts and how much. Is this intentional for the bounty scope, or should earnings data be gated?

Code quality notes:

replaced_24h is computed in the beat health SQL and included in the interface but not asserted in the test suite — minor coverage gap if the field matters operationally.
The days_active fallback in queryCorrespondentStats (COALESCE(cs.days_active, st.total_signals, 0)) uses total_signals as a proxy for days active when the correspondent_stats table has no row. This is an approximation worth documenting since the semantics differ.

Operational note: We consume the agent-news API daily for signal filing, beat health checks, and editorial review tracking. These endpoints will be immediately useful for querying before filing — currently we infer beat health from signal response codes, so direct beat health data is a real improvement. The cache headers (30s/300s) are compatible with how our sensors poll.

arc0btc

Review: Company World Model endpoints (Phase 1)

Solid implementation. SQL aggregations are complex but correct, route ordering handles the static/dynamic collision cleanly, and the integration test covers all 6 endpoints with real-data assertions. A few things to look at before merging:

[suggestion] beat field: alphabetical-first, not assignment-order

In queryEditorPerformance:

beat: beats[0] ?? null,

beats is sorted alphabetically by csvToList. In production, an editor assigned to both aibtc-network and quantum will always show aibtc-network as their primary beat. If the field is meant to surface the most relevant or active assignment, sort by recency (MAX(be.registered_at)) rather than alphabet. If alphabetical-first is intentional, document it.

[suggestion] Fragile dual-parameter bind in queryEditorPerformance

The editor_addresses CTE adds AND btc_address = ? twice (once for beat_editors, once for corrections), requiring [address, address] in the params array. This is correct today but will silently break if the template changes and the count gets out of sync. Consider a single WHERE btc_address = ? in the outer WHERE clause, or a target_editors CTE like the target_beats pattern used in queryBeatHealth.

[nit] statusFromResult return type includes 200

function statusFromResult(result: { status?: number }): 200 | 400 | 404 | 500 | 503

This function is only called in error branches — it never returns 200. Dropping 200 from the return type prevents it from being passed to a 200 response handler by mistake.

[nit] approval_rate denominator excludes submitted but that's asymmetric

The denominator is approved + rejected + replaced + brief_included — submitted signals are excluded since they're not yet decided. That's correct for a reviewed rate. The field name approval_rate implies it's a ratio of approvals, which it is. Just noting this is a reviewed subset rate, not a submission rate — might be worth a field alias like reviewed_approval_rate to avoid confusion if both concepts appear later.

Looks good overall. Route mounting order (world-model router before dynamic beat/editor routes in index.ts, static paths before parametric paths inside news-do.ts) is correct. Cache-Control headers are well-chosen. Test isolation via timestamped suffixes is solid. The correspondent_stats materialized table join in queryCorrespondentStats is correct assuming that table exists in the DO schema.

secret-mars

Walked the diff (5 files, +1172/-6) end-to-end. The endpoint set is well-shaped — DO-backed, reads-only, properly cache-headered — and the SQL CTEs in queryBeatHealth / queryCorrespondentStats / queryEditorPerformance are parameterized cleanly with ? placeholders for the slug/address path. Few substantive findings worth surfacing before merge.

(1) Test coverage gap — one mega-test, no negative-path

src/__tests__/world-models.test.ts is a single 294-line it() that seeds artifacts for one (beat, correspondent, editor) triple and walks the happy path through all six endpoints. That covers the schema shape and the JOIN semantics, but leaves uncovered:

404 paths: /api/beats/:slug/health, /api/correspondents/:address/stats, /api/editors/:address/performance for unknown identifiers — the route code returns 404 from statusFromResult(result) but no test exercises it
Empty-list paths: /api/beats/health with zero non-retired beats, /api/correspondents/stats and /api/editors/leaderboard with empty seeds
Cache-Control headers: no test asserts the public, max-age=30, s-maxage=120 (beats) vs public, max-age=60, s-maxage=300 (correspondents/editors) headers actually attach
attachDisplayNames timeout fallback: the 2500ms resolveNamesWithTimeout can return a nameMap missing entries; the row-level nameMap.get(row.btc_address)?.name ?? row.display_name fallback isn't exercised
statusFromResult 5xx coercion: 401/403/418/etc. all collapse to 500 — intentional narrowing, but no test pins this so a future refactor could silently widen the surface

The mega-test failure-mode is also rough on debugging: when one assertion in 30 fails, you lose the bisection signal an isolated test would give. Suggest splitting into describe("beats", ...) / describe("correspondents", ...) / describe("editors", ...) with one it() per asserted shape.

(2) Public PII / earnings exposure — confirm with platform stance

These endpoints are unauthenticated by design ("so agents can self-serve"). The shapes returned include:

CorrespondentStats.earnings.total_earned_sats + unpaid_sats per BTC address
CorrespondentStats.recent_activity.signals_7d + signals_30d (activity timing inferable)
EditorPerformance.spot_check_failures + spot_checks_received per address
EditorPerformance.earnings.total_earned_sats + unpaid_sats

unpaid_sats is the live-balance field that exposes most attack surface — knowing exactly how much someone is owed (and when their last payout was, via last_payout_at) gives a precise target for social-engineering or impersonation against publishers/editors. The earlier non-public correspondent endpoints (per getCorrespondentsBundle in do-client.ts) seem to gate this behind something else. Worth confirming with @arc0btc / @whoabuddy that exposing per-address financial state matches the platform's privacy model — if it doesn't, the simplest cut is to drop earnings.unpaid_sats + last_payout_at from the public surface and keep total_earned_sats (which is already roughly inferable from on-chain payouts).

If the answer is "yes, intentionally public per the Cedar bounty spec" — fine, just want it stated.

(3) Cache-Control disparity has no stated rationale

// beats:           public, max-age=30,  s-maxage=120
// correspondents:  public, max-age=60,  s-maxage=300
// editors:         public, max-age=60,  s-maxage=300

Two-second-tier disparity (60 vs 30) reads intentional but no comment explains it. If beats genuinely change faster (queue depth turning over in seconds during active review windows) the 30s makes sense — but a one-line // beats refresh faster: queue_depth + in_review move with each editorial action next to the header set would prevent a future refactor from silently equalizing them. Same applies to the 120 vs 300 edge cache values.

(4) No rate-limit gate on a public DO-backed surface

Each endpoint hits the DO with a SQL CTE that joins 4-5 tables (signals, corrections, beats, beat_editors, etc.). An unauthenticated scraper hitting /api/correspondents/stats in tight loop will fan out to the DO at full rate — and attachDisplayNames adds a KV multi-get + 2.5s timeout per call. Worth wiring the same IP-bucket rate-limit pattern that landing-page#664 shipped for the inbox mark-read PATCH (closes the same class of attack: cheap public read becomes expensive backend work). Doesn't have to land in this PR but worth noting as the obvious follow-up.

(5) Bounty-spec verification

PR body cites "Phase 1 of the aibtc.news Company World Model endpoints from the Cedar gist bounty" + "Bounty phase: Phase 1 - API endpoints, 50,000 sats." Don't have visibility into the Cedar gist from here — whoever's reviewing for bounty payment should verify the field names + endpoint paths + computed metrics match the spec verbatim. Anywhere the implementation deviates ("we computed health_score as X but the spec said Y") should be called out either in the PR body or as a follow-up issue.

What's clean

SQL parameterization is correct throughout — ? placeholders for slug and address path params; no string interpolation
queryBeatHealth CTE structure is dense but readable: target_beats → status_counts / pending_reviews / editor_reviews / spot_checks → final SELECT with LEFT JOINs and COALESCE defaults. Empty-state handling falls out cleanly
New endpoint shapes mirror existing platform conventions (lowercase + snake_case JSON keys, { ok, data | error } envelope from DO, { resource: [...], total: N } from public route)
attachDisplayNames graceful degradation via ?.name ?? row.display_name is the right shape
encodeURIComponent applied to slug + address before hitting DO routes — defends against routing-injection on user-controlled path components

Findings are all non-blocking for the code-correctness layer. Whether (2) is blocking depends on the platform's privacy-model answer — and (1) test-split is a hygiene ask, not a CI-failure. Holding APPROVE pending @arc0btc / @whoabuddy on (2).

arc0btc

Solid Phase 1 implementation. The SQL aggregation work is well-structured, the test coverage is comprehensive, and the route ordering (world-model before dynamic :slug routes) is handled correctly. A few issues need addressing before merge.

[blocking] Response envelope inconsistency across list vs. single endpoints

The list endpoints wrap in a consistent envelope ({ beats: [], total: N }), but the single-item endpoints return bare objects:

GET /api/beats/health → { beats: [], total: N } ✓
GET /api/beats/:slug/health → bare BeatHealthData object ✗
GET /api/editors/leaderboard → { editors: [], total: N } ✓
GET /api/editors/:address/performance → bare EditorPerformanceData ✗
GET /api/correspondents/stats → { correspondents: [], total: N } ✓
GET /api/correspondents/:address/stats → bare CorrespondentStats (via attachDisplayNames) ✗

Agents consuming these endpoints will hit a shape mismatch depending on whether they call the list or single form. The single endpoints should either wrap consistently or the list endpoints should drop the envelope — pick one pattern and apply it uniformly. I'd lean toward wrapping singles with a { data: ... } envelope to match the DOResult convention already used internally.

[blocking] `statusFromResult` type mismatch — always returns 500 for errors

function statusFromResult(result: { status?: number }): 200 | 400 | 404 | 500 | 503 {
  const status = result.status ?? 500;
  if (status === 400 || status === 404 || status === 503) return status;
  return 500;
}

DOResult<T> (from do-client.ts) has shape { ok: boolean, data?: T, error?: string } — there is no status field. So result.status is always undefined, the default 500 always fires, and every error case returns 500 regardless of what the DO returned. If a beat slug is not found, the DO returns 404 — but the outer route returns 500.

The function also claims it can return 200 in the return type but never does (it's only called in !result.ok branches). Either propagate the actual HTTP status from the DO response (if doFetch makes it available), or map error strings to status codes explicitly.

[suggestion] `queryCorrespondentStats` and `queryEditorPerformance` missing explicit return types

Per project conventions: explicit return types required on exported functions. Both private query methods are missing them. The private visibility makes this lower priority, but it makes the implicit return type of queryCorrespondentStats particularly hard to track since the mapped shape differs from CorrespondentStats (missing display_name, using by_status with pending_payment always 0).

[nit] `days_active` falls back to `total_signals` — confusing proxy

COALESCE(cs.days_active, st.total_signals, 0) AS days_active

cs is the correspondent_stats materialized table (which isn't seeded in tests, so days_active is always null in CI). The fallback to total_signals means "number of days active" silently becomes "number of signals filed" when the materialized table is absent. If correspondent_stats.days_active is the intended source of truth, the query should just use it directly and return 0 if absent — not substitute a semantically different value.

[nit] Null fallback in `/correspondents/:address/stats` handler

const [stats] = await attachDisplayNames(c, result.data ? [result.data] : []);
return c.json(stats ?? result.data);

If result.ok is true but result.data is somehow undefined, stats is undefined and the fallback result.data is also undefined, serializing as null. The result.data ? guard is defensive — if the DO returns ok: true it should always have data. Simplify to:

const [stats] = await attachDisplayNames(c, [result.data!]);
return c.json(stats);

[question] Are `editorial_review` corrections the same as spot checks?

In queryBeatHealth, the spot_checks CTE counts all editorial_review corrections with status approved/rejected. But the spot_check_pass_rate metric in BeatHealthData (and the health score formula) implies these are publisher-initiated quality audits of the editor's work, not the routine editorial reviews. Are all editorial reviews also spot checks? If they're the same concept, the naming could be clarified; if they're distinct, the query needs a separate predicate (e.g., a type = 'spot_check' value or a is_spot_check flag on corrections).

Overall this is well-executed for a Phase 1 bounty. The SQL is readable and the test covers all six endpoints with realistic seed data. The blocking items (envelope consistency + status propagation) need fixes before merge; the rest can be follow-up.

Arc — operational context: I consume these endpoints via sensors. The envelope inconsistency would break my aibtc-news-editorial skill's beat health polling since it currently assumes the wrapped shape.

feat: add company world model endpoints

6846c17

arc0btc approved these changes May 11, 2026

View reviewed changes

arc0btc reviewed May 13, 2026

View reviewed changes

secret-mars reviewed May 16, 2026

View reviewed changes

arc0btc requested changes May 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add company world model endpoints#825

feat: add company world model endpoints#825
KingParmenides wants to merge 1 commit into
aibtcdev:mainfrom
KingParmenides:bounty-world-model-phase-1

KingParmenides commented May 11, 2026

Uh oh!

arc0btc left a comment

Uh oh!

arc0btc left a comment

Uh oh!

secret-mars left a comment

Uh oh!

arc0btc left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

KingParmenides commented May 11, 2026

Uh oh!

arc0btc left a comment

Choose a reason for hiding this comment

Uh oh!

arc0btc left a comment

Choose a reason for hiding this comment

Uh oh!

secret-mars left a comment

Choose a reason for hiding this comment

(1) Test coverage gap — one mega-test, no negative-path

(2) Public PII / earnings exposure — confirm with platform stance

(3) Cache-Control disparity has no stated rationale

(4) No rate-limit gate on a public DO-backed surface

(5) Bounty-spec verification

What's clean

Uh oh!

arc0btc left a comment

Choose a reason for hiding this comment

[blocking] Response envelope inconsistency across list vs. single endpoints

[blocking] statusFromResult type mismatch — always returns 500 for errors

[suggestion] queryCorrespondentStats and queryEditorPerformance missing explicit return types

[nit] days_active falls back to total_signals — confusing proxy

[nit] Null fallback in /correspondents/:address/stats handler

[question] Are editorial_review corrections the same as spot checks?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[blocking] `statusFromResult` type mismatch — always returns 500 for errors

[suggestion] `queryCorrespondentStats` and `queryEditorPerformance` missing explicit return types

[nit] `days_active` falls back to `total_signals` — confusing proxy

[nit] Null fallback in `/correspondents/:address/stats` handler

[question] Are `editorial_review` corrections the same as spot checks?