-
Notifications
You must be signed in to change notification settings - Fork 1
Issues
is:issue state:open
is:issue state:open
Issue creation is restricted in this repository
Search results
- Status: Open.#20 In mark-allwyn/BenchPress;
v3 Slice 6: Pareto frontier + over-time tracking + saturation + classical item stats
ready-for-agentReady for autonomous agent pickupReady for autonomous agent pickupStatus: Open.#19 In mark-allwyn/BenchPress;v3 Slice 5: Provider breadth + per-vendor native-config policy + parallel runner
ready-for-agentReady for autonomous agent pickupReady for autonomous agent pickupStatus: Open.#18 In mark-allwyn/BenchPress;v3 Slice 4: Full causal domain - all 20 bundles x 5 variants + remaining scorers
ready-for-agentReady for autonomous agent pickupReady for autonomous agent pickupStatus: Open.#17 In mark-allwyn/BenchPress;v3 Slice 3: Stats core - bootstrapped CIs, marginals, single-source metrics, export
ready-for-agentReady for autonomous agent pickupReady for autonomous agent pickupStatus: Open.#16 In mark-allwyn/BenchPress;v3 Slice 2: Tracer bullet - one causal item end-to-end (generate, run, score, leaderboard)
ready-for-agentReady for autonomous agent pickupReady for autonomous agent pickupStatus: Open.#15 In mark-allwyn/BenchPress;v3 Slice 1: Scaffold + spine - package, registries, tagged-text extractor, status taxonomy
ready-for-agentReady for autonomous agent pickupReady for autonomous agent pickupStatus: Open.#14 In mark-allwyn/BenchPress;PRD: Benchpress v3 - deterministic causal-reasoning frontier benchmark
ready-for-agentReady for autonomous agent pickupReady for autonomous agent pickupStatus: Open.#13 In mark-allwyn/BenchPress;