fix(orch): don't let infra-failed managers exhaust the respawn budget by nathanwhit · Pull Request #64 · nathanwhit/orcha

nathanwhit · 2026-06-24T18:22:05Z

A manager that dies at workspace-prepare without ever running is an infra failure (full disk, unreachable target), not a stuck objective — yet it counted the same as a genuine failure against the 4-manager respawn cap. A transient host outage therefore burned the whole budget and permanently parked the objective even after the infra healed (this stranded a8641186 behind three disk-full respawns).

managerRespawnExhausted now counts only genuine attempts (a manager that reached running, plus any still in flight — StartedAt is stamped only on the transition to running) against maxManagerSessions. A separate hard ceiling (maxManagerSpawns) on total spawns still bounds an objective whose target is persistently unplaceable, so it escalates instead of respawning forever.

Tests: TestManagerRespawn_InfraDeathsDontExhaustBudget, TestManagerRespawn_HardCeilingBoundsInfraLoop.

A manager that dies at workspace-prepare without ever running is an INFRA failure (a full disk, an unreachable target), not a stuck objective — yet it counted the same as a genuine failure against the 4-manager respawn cap. A transient host outage therefore burned the whole budget and permanently parked the objective even after the infra healed: exactly what stranded a8641186 behind three disk-full respawns. managerRespawnExhausted now counts only GENUINE attempts (a manager that reached running, plus any still in flight — StartedAt is stamped only on the transition to running) against maxManagerSessions. A separate hard ceiling (maxManagerSpawns) on TOTAL spawns still bounds an objective whose target is persistently unplaceable, so it escalates instead of respawning forever.

nathanwhit merged commit e3f7bb1 into main Jun 24, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(orch): don't let infra-failed managers exhaust the respawn budget#64

fix(orch): don't let infra-failed managers exhaust the respawn budget#64
nathanwhit merged 1 commit into
mainfrom
manager-respawn-budget

nathanwhit commented Jun 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

nathanwhit commented Jun 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant