Enforce hard evaluation budgets and checkpoint-safe runs by isty2e · Pull Request #46 · isty2e/variopt

isty2e · 2026-07-02T13:21:20Z

Summary

This PR makes evaluation budgeting strict by default. Study.run(...) and Study.optimize(...) now charge max_evaluations against reported logical evaluation cost, including local-search inner evaluations, rather than only counting returned records. Callers that want the old outer-record behavior can still pass count_evaluation_cost=False.

It also adds stop_at_checkpoint_boundary=True so CSA runs can return the latest checkpoint-safe state when the budget ends inside an unsafe generation segment. Structured and SciPy local-search kernels now reserve enough budget for later proposals in the same batch before spending extra local-search evaluations.

User impact

This is a behavior change for code that relied on the old default budget accounting. Over-budget reported evaluation cost now raises EvaluationBudgetExhausted instead of silently assimilating an over-budget step. The changelog and local-optimization/checkpointing docs call out the migration path.

Validation

uv run --python 3.11 --extra test ruff check pyproject.toml src tests
uv run --python 3.11 --extra test basedpyright src tests
uv run --python 3.11 --extra test pytest tests -q
mypy src/variopt still reports the existing scipy/joblib missing-stub errors only; the branch-introduced kernel error is gone.

isty2e added 6 commits July 2, 2026 22:05

feat(study): enforce hard evaluation budgets by default

ba9cb10

fix(local-search): reserve batch budget during refinement

88f0985

docs: align budget and checkpoint guidance

6db4a3b

fix(local-search): avoid scipy outcome type narrowing conflict

6e7bf0b

docs: document hard budget migration impact

3079b64

refactor(study): inline budget batch sizing

fa02a3d

isty2e merged commit 3054e20 into main Jul 2, 2026
3 checks passed

isty2e deleted the fix/hard-evaluation-budget-checkpoints branch July 2, 2026 13:22

This was referenced Jul 2, 2026

[High] Documented checkpointing example fails verbatim against the public Study API #23

Closed

[High] Stale-async execution overshoots the evaluation budget by up to batch_size - 1 #26

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enforce hard evaluation budgets and checkpoint-safe runs#46

Enforce hard evaluation budgets and checkpoint-safe runs#46
isty2e merged 6 commits into
mainfrom
fix/hard-evaluation-budget-checkpoints

isty2e commented Jul 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

isty2e commented Jul 2, 2026

Summary

User impact

Validation

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant