Skip to content

Show why ruleset crash checks failed#23

Merged
relh merged 1 commit into
mainfrom
relh/ruleset-crash-check-failure-evidence
Jun 16, 2026
Merged

Show why ruleset crash checks failed#23
relh merged 1 commit into
mainfrom
relh/ruleset-crash-check-failure-evidence

Conversation

@relh

@relh relh commented Jun 9, 2026

Copy link
Copy Markdown
Contributor

Summary

Ruleset round completion could say that crash checks failed without carrying enough evidence about what actually failed. That made the transition hard to debug from the round result alone.

This PR carries failed episode messages from the websocket into ruleset round completion. It also adds scheduled and failed episode counts, failed request ids, and sampled errors to the transition evidence. The docker smoke assertion is relaxed so bundled configs with multi-episode stages still pass the smoke invariant.

Verification

  • uv run pytest tests/test_commissioner_strategies.py tests/test_commissioner_websocket.py -q
  • uv run pytest -q
  • uv run ruff check commissioners/common/models.py commissioners/common/adapters.py commissioners/common/server.py commissioners/common/ruleset_strategy/commissioner.py commissioners/common/ruleset_strategy/round_start.py commissioners/common/ruleset_strategy/membership_events.py tests/test_commissioner_strategies.py tests/test_commissioner_websocket.py tests/test_docker_smoke.py
  • git diff --check

@relh relh changed the title Expose ruleset crash-check failure evidence Show why ruleset crash checks failed Jun 9, 2026
Co-authored-by: Codex GPT-5 <codex@openai.com>
@relh relh force-pushed the relh/ruleset-crash-check-failure-evidence branch from 9ec4a74 to 149d91e Compare June 16, 2026 20:47
@relh relh merged commit c93f964 into main Jun 16, 2026
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant