REF Unify data shapes in forecasting solvers by felixdivo · Pull Request #28 · benchopt/benchmark_tsfm

felixdivo · 2026-05-29T12:09:00Z

#22 was merged a bit early. This PR now changes:

Deduplicate type signatures and docs on types
Add some consistency checks in data classes
Consolidate docs
Unify shapes for forecasting and document them
Confirmed Chronos 1 and 2 to be working with the new API

felixdivo · 2026-05-29T12:09:22Z

I'm currently still testing it locally to make sure it runs nicely.

…dataclasses

felixdivo · 2026-05-29T15:14:18Z

TODOs:

Test thoroughly by hand (e.g., Toto 2 and metrics are likely broken now)

felixdivo · 2026-05-29T15:14:40Z

cc @kalebphipps :)

rtavenar · 2026-06-01T07:18:45Z

I'm OK with this PR, but there are a few conflicts to be resolved before merging.

felixdivo · 2026-06-01T14:11:07Z

Thanks for checking! I can take of them later this evening :)

Resolve conflicts and converge the forecasting data layout to the branch convention (n_cutoffs, H, C, Q) — Q last — everywhere: - outputs.py: keep Q-last point/__post_init__, add main's flatten() adapted to (M, H, C, Q). - chronos.py: take Chronos-2 docstring/References; AD adapter uses the build_adapter() return style without the invalid prediction_length kwarg. Rewrite _ChronosForecaster to consume Chronos-2's native quantile output (list[(C, Q, H)]) instead of v1 sample draws — fixes "AttributeError: 'list' object has no attribute 'float'". - chronos2.py: fix the Chronos2Pipeline import path and make the module self-contained (no unresolvable cross-solver import under benchopt); _Chronos2Forecaster produces Q-last output. - toto2.py, moment.py, metrics.py, leakage tests: align to Q-last. - objective.py: expose a 'value' key for benchopt's stopping criterion on the non-leaky forecasting path. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

chronos.py had been switched to Chronos 2 (Chronos2Pipeline) during the BaseTSFMSolver iteration, leaving both solver files on the same model. Restore chronos.py to the Chronos v1 ChronosPipeline (amazon/chronos-t5-*) on plain BaseSolver — mirroring upstream/main and chronos2.py's structure. Three adaptations beyond upstream's v1 forecaster were required by this branch's evolved contracts: - _assemble_output emits the branch's (n_cutoffs, H, C, Q) quantile layout. - Seed before Monte-Carlo sampling so the behavioural leakage probe (benchmark_utils.leakage), which compares two predict() calls on identical history, does not read sampling noise as a leak (would force value=inf). - Predict in bounded chunks so the full-batch T5 forward (O(L^2) attention) does not exhaust memory on datasets with many long series (e.g. m4_weekly). Verified end-to-end on CPU: benchopt run -s chronos -d "Monash[dataset_name=m4_weekly_dataset]" benchopt run -s chronos2 -d "Monash[dataset_name=m4_weekly_dataset]" both produce finite metrics with leakage=0 (chronos v1 mase=0.43). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

Resolve conflicts from main's "FIX benchopt tests (benchopt#40)". All conflicts were the quantile-axis convention: the branch uses quantiles-last (n_cutoffs, H, C, Q) while main only reformatted the old quantiles-second (n_cutoffs, Q, H, C) layout — kept the branch's convention everywhere (naive, seasonal_naive, tfc_api, metrics, outputs, base_solver, leakage tests). For chronos2.py both sides independently rebased onto BaseTSFMAdapter; kept the branch's typed predict + quantiles-last _assemble_output and switched to the centralized POOLERS from benchmark_utils.adapters (dropping the duplicate local dict / imports). Wrapped two over-length lines in base_solver.py to satisfy ruff E501. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

felixdivo · 2026-06-08T19:29:04Z

I merged the main branch into this to resolve any conflicts + have the CI pipeline run checks before merging.

@tomMoral could you allow the CI pipeline to run?

felixdivo · 2026-06-09T12:15:53Z

Ah yeah, I realized removing sampling_strategy = "run_once" was a mistake while improving the CI pipeline on another branch I'm currently working on. ~~I'll undo that specific change in a minute (was in multiple locations).~~ Your change looks good to me.

felixdivo · 2026-06-09T14:13:10Z

@rtavenar I think this PR is ready to be merged. Do you have the time to review it?

tomMoral

LGTM! a few nitpicks but good to go otherwise :)

Co-authored-by: Thomas Moreau <thomas.moreau.2010@gmail.com>

REF Improve benchopt#22

24b29bc

Merge branch 'main' into base-tsfm-solver-iteration

fc1b07b

rtavenar reviewed May 29, 2026

View reviewed changes

Comment thread benchmark_utils/base_solver.py

Fix implmenetation of chronos, add consistency checks in Forecasting …

4b8b842

…dataclasses

felixdivo requested a review from rtavenar May 29, 2026 15:07

felixdivo mentioned this pull request May 29, 2026

Design choices on the solver side #31

Open

Remove unused sampling_strategy definition from solver classes

2ab5984

felixdivo changed the title ~~REF Improve over #22~~ REF Unify data shapes in forecasting solvers Jun 6, 2026

felixdivo commented Jun 7, 2026

View reviewed changes

Comment thread benchmark_utils/base_solver.py

felixdivo commented Jun 7, 2026

View reviewed changes

Comment thread benchmark_utils/inputs.py

Cleanups after merge

3dc2e32

felixdivo mentioned this pull request Jun 7, 2026

FEAT: Forecasting: Use multiple batches for large datasets #43

Open

felixdivo and others added 2 commits June 7, 2026 06:13

tomMoral reviewed Jun 9, 2026

View reviewed changes

Comment thread objective.py

Apply suggestion from @tomMoral

8d4b736

felixdivo self-assigned this Jun 9, 2026

tomMoral approved these changes Jun 9, 2026

View reviewed changes

Comment thread benchmark_utils/base_solver.py Outdated

Comment thread benchmark_utils/base_solver.py Outdated

Apply suggestions from code review

e14ccdd

Co-authored-by: Thomas Moreau <thomas.moreau.2010@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

REF Unify data shapes in forecasting solvers#28

REF Unify data shapes in forecasting solvers#28
felixdivo wants to merge 10 commits into
benchopt:mainfrom
felixdivo:base-tsfm-solver-iteration

felixdivo commented May 29, 2026 •

edited

Loading

Uh oh!

felixdivo commented May 29, 2026

Uh oh!

Uh oh!

felixdivo commented May 29, 2026

Uh oh!

felixdivo commented May 29, 2026

Uh oh!

rtavenar commented Jun 1, 2026

Uh oh!

felixdivo commented Jun 1, 2026

Uh oh!

Uh oh!

Uh oh!

felixdivo commented Jun 8, 2026 •

edited

Loading

Uh oh!

Uh oh!

felixdivo commented Jun 9, 2026 •

edited

Loading

Uh oh!

felixdivo commented Jun 9, 2026

Uh oh!

tomMoral left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

felixdivo commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

felixdivo commented May 29, 2026

Uh oh!

Uh oh!

felixdivo commented May 29, 2026

Uh oh!

felixdivo commented May 29, 2026

Uh oh!

rtavenar commented Jun 1, 2026

Uh oh!

felixdivo commented Jun 1, 2026

Uh oh!

Uh oh!

Uh oh!

felixdivo commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

felixdivo commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

felixdivo commented Jun 9, 2026

Uh oh!

tomMoral left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

felixdivo commented May 29, 2026 •

edited

Loading

felixdivo commented Jun 8, 2026 •

edited

Loading

felixdivo commented Jun 9, 2026 •

edited

Loading