Skip to content

Tracking: CI improvements #18036

@radical

Description

@radical

Goal

Track in-flight and planned CI improvements for the repo — pipeline performance, test-pipeline unification, build/publish validation, workflow security, and a sub-track of agentic automation to reduce manual CI monitoring.

Pipeline performance

Test pipeline unification

Build & publish validation

Workflow security

PR testing / infra

  • Improve pr-testing to handle and test infra changes (branch ankj/pr-testing-infra-skill, not yet pushed)

Agentic CI automation

Reduce hand-monitoring of CI: agents spot, analyze, file issues, and open PRs for humans to review/approve, then validate their own work. Builds on the repo's existing gh-aw setup and CI plumbing.

Failure visibility / monitoring

Quarantine automation

  • gh-aw quarantine tracker + auto-(un)quarantine — track quarantined-test runs, update per-test issues with per-OS pass/fail history, open human-approved quarantine/unquarantine PRs. Implements the "separate process" described in docs/unquarantine-policy.md (relates to [tests] Quarantined tests dashboard #8813)

CI health

  • gh-aw CI-health report — scheduled report (pinned-issue, repo-pulse pattern)
  • gh-aw CI-health remediation — act on CI-health signals (split slow test projects; add recurring flaky/network signatures to auto-rerun patterns)

Guardrails for the agentic items: confidence thresholds before acting; keep rolling pass-rate even when a rerun recovers; quarantine budget + enforced de-quarantine; cost/recursion caps; global /agent-stop; every code or quarantine mutation goes through a human-approved PR.

Metadata

Metadata

Assignees

Labels

area-engineering-systemsinfrastructure helix infra engineering repo stuff

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions