Extended standalone driver test by msimberg · Pull Request #1303 · C2SM/icon4py

msimberg · 2026-06-10T13:05:36Z

Nothing to see here. Still depends on #1295 (for parameterizable standalone driver tests) and #1294 (for easier CI parameterization), but opening to experiment with compilation options for bitwise reproducible results between single- and multi-rank runs.

Adds a new testing level (extended) and one test that uses it (a seven-day standalone driver test, we can bikeshed about how "extended" this should be).

Make cscs ci test jobs correspond more closely to nox sessions.

msimberg · 2026-06-25T11:11:03Z

cscs-ci run default;SESSIONS=model_mpi;MODEL_MPI_SUBPACKAGES=standalone_driver;common;BACKENDS=gtfn_cpu:dace_cpu:gtfn_gpu;LEVELS=unit:integration:validation;MODEL_MPI_SUBSETS=datatest

msimberg · 2026-06-25T11:20:24Z

cscs-ci run default;SESSIONS=model_mpi;MODEL_MPI_SUBPACKAGES=standalone_driver:common;BACKENDS=gtfn_cpu:dace_cpu:gtfn_gpu;LEVELS=unit:integration:validation;MODEL_MPI_SUBSETS=datatest

msimberg · 2026-06-25T13:07:14Z

cscs-ci run default;SESSIONS=model_mpi;MODEL_MPI_SUBPACKAGES=standalone_driver;BACKENDS=gtfn_cpu:dace_cpu:gtfn_gpu;LEVELS=integration:validation;MODEL_MPI_SUBSETS=datatest

msimberg · 2026-06-25T15:11:19Z

cscs-ci run default;SESSIONS=model_mpi;MODEL_MPI_SUBPACKAGES=standalone_driver;BACKENDS=gtfn_cpu:dace_cpu:gtfn_gpu;LEVELS=integration:validation;MODEL_MPI_SUBSETS=datatest

This PR unifies the default, extra, distributed, dace pipelines into one pipeline that can be parameterized to run different subsets of tests. The test jobs are generated by a new script `generate_ci_pipeline.py` because we can't use variables to dynamically filter gitlab matrix jobs. The test jobs are triggered as a gitlab child pipeline (https://docs.gitlab.com/ci/pipelines/downstream_pipelines/#parent-child-pipelines). There are now three main CI pipelines (in addition to the benchmark pipelines which are mostly unchanged, except to adapt to the changes in `base.yml`): - `default` still exists, and should be the "default" pipeline to run when you want to check if the PR is roughly in shape. This runs serial and MPI tests with one GPU backend. Bikeshedding on what belongs here welcome, but in my opinion this should be relatively small. `default` can and should be customized to run a subset of tests that are relevant for your changes (see below). A future extension would be to automatically detect which tests actually need to run based on the changes in the PR (see e.g. #1291). This will not be implemented in this PR. - `merge` is new, and will be made gating for PR merges. `default` will not be required for merges since it's a subset of `merge`. `merge` is roughly like the current `default` pipeline in that it runs most tests with all backends. This can be triggered when a PR has been approved and auto-merge has been enabled. A future extension is to make this pipeline run using merge queues (https://docs.github.com/en/repositories/configuring-branches-and-merges-in-your-repository/configuring-pull-request-merges/managing-a-merge-queue). - `all` is also new, and runs _all_ tests, including future "extended" or "validation" tests (#1303). This is meant to be run e.g. once a week on `main`. I've currently set up a run that runs daily at 2 AM on this branch (first run here: https://gitlab.com/cscs-ci/ci-testing/webhook-ci/mirrors/5125340235196978/2255149825504677/-/pipelines/2624464232). I'll update this to `main` and weekly when this is merged. As mentioned above, the `default` pipeline is intended to be customized with CI variables, e.g. > cscs-ci run default;SESSIONS=model;MODEL_SUBPACKAGES=dycore All variables can be overwritten. Otherwise they take the default values from the `default` pipeline definition. The test reminder has been updated with some hints and reminders about which pipelines to run and how to customize them. The `dace`, `extra`, and `distributed` pipelines have been removed since they're covered by the other pipelines now. Slightly out of scope, I've changed some of the scripts in `scripts/` to use `--only-group scripts` to not pull in unnecessary dependencies. The new `generate_ci_pipeline.py` script is run directly with `python scripts/python/generate_ci_pipeline.py` instead of through `scripts/run` because the latter pulls in many unnecessary dependencies. By running it directly with `--only-group scripts`, `generate_ci_pipeline.py` can be run in a very slim container which is much faster to pull. Since the `all` pipeline now ran tests that were previously not run in CI, it exposed some numpy/cupy mismatches in the RBF tests. I've fixed those at the same time. --------- Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

msimberg · 2026-06-26T09:53:49Z

  stage: test
  timeout: 8h
  variables:
+    ICON4PY_DALLCLOSE_PRINT_INSTEAD_OF_FAIL: true


Remove this.

msimberg · 2026-06-26T10:15:17Z

cscs-ci run default;SESSIONS=model_mpi;MODEL_MPI_SUBPACKAGES=standalone_driver;BACKENDS=gtfn_cpu:dace_cpu:gtfn_gpu;LEVELS=integration:validation;MODEL_MPI_SUBSETS=datatest

msimberg · 2026-06-26T11:02:45Z

cscs-ci run default;SESSIONS=model_mpi;MODEL_MPI_SUBPACKAGES=standalone_driver;BACKENDS=gtfn_cpu:dace_cpu:gtfn_gpu;LEVELS=integration:validation;MODEL_MPI_SUBSETS=datatest

github-actions · 2026-06-26T11:02:51Z

Mandatory Tests

Before merging, run the merge pipeline with cscs-ci run merge. Merging is blocked unless this pipeline passes.

When developing, you can test your changes on CSCS CI before merge with the default pipeline: cscs-ci run default. This will run a default subset of tests.

You can pass options to override pipeline variables, for example:

cscs-ci run default;BACKENDS=gtfn_cpu;LEVELS=unit
cscs-ci run default;MODEL_SUBPACKAGES=common:driver;SESSIONS=model
Avoid running the pipeline for all tests when you are developing.

Available options are:

SESSIONS: model, model_mpi, or tools (correspond to nox sessions)
MODEL_SUBSETS: datatest, basic, or stencils (correspond to nox session selections)
MODEL_SUBPACKAGES: subpackages for non-MPI tests (last component, e.g. diffusion, standalone_driver)
MODEL_MPI_SUBPACKAGES: subpackages for MPI tests (as above)
BACKENDS: backends
GRIDS: grids for stencil tests (simple, icon_regional, or icon_global)
LEVELS: testing level for non-stencil tests (any, unit, or integration)

See scripts/python/generate_ci_pipeline.py and noxfile.py for available values for each option.

The all pipeline can be run with cscs-ci run all. This will run all icon4py tests in CSCS CI which can be expensive. This pipeline runs on a schedule on main, and can be run when extensive validation is needed (e.g. before releases).

Optional Tests

To run benchmarks you can use:

cscs-ci run benchmark-bencher

For more detailed information please look at CI in the EXCLAIM universe.

msimberg · 2026-06-26T11:11:02Z

cscs-ci run default;SESSIONS=model_mpi;MODEL_MPI_SUBPACKAGES=standalone_driver;BACKENDS=gtfn_cpu:dace_cpu:gtfn_gpu;LEVELS=integration:validation;MODEL_MPI_SUBSETS=datatest

msimberg added 30 commits May 27, 2026 16:21

Use common base image for default and distributed ci pipelines

17ef83e

POC first unified default pipeline

e627a5c

Remove mpi PYVERSION_PREFIX

4a62ab7

Add test selection filter to ci

df1cce7

simplify pipeline definitions

fcae5c6

Use : as separator for ci pipelines

fb4906b

fixes to regexes and names

6578a60

Debugging rules

9975f03

Try to expand regex patterns

095674e

try more abstraction

d458256

Use child pipelines for dynamically filtered CI pipelines

0afde56

Add another test stage

ce1f03f

Declare correct dependency

5a2ded5

Fix where test.yaml is output

43913aa

Forward pipeline variables

8c46f61

Use lightweight ci rnuner to generate test jobs

99d9e4a

Add helper for computing analytical mean geometry quantities

c593a81

Use different base image

53fb7c5

More memory

af70f10

Small fixes

37d59f8

Change scripts to use only scripts group

3c2a912

Use --group scripts for those that need icon4py deps

95bbbe5

Remove UV_PROJECT_ENVIRONMENT

ac9a052

Cleanup

b6eb010

More unification

4147e47

Make cscs ci test jobs correspond more closely to nox sessions.

Merge branch 'main' into unified-ci-pipelines

d3bf876

Add nox, clang-format etc. to path via global .venv

3740caa

Cleanup

0a5ad48

Revert some unnecessary changes

8646ef2

Cleanup

a835a86

msimberg added 6 commits June 25, 2026 10:00

Add dace.yml shorthand for dace-specific testing

9efc9d2

Update scripts/python/generate_ci_pipeline.py

32fd766

Update .github/workflows/mandatory_and_optional_test_reminder.yml

1605396

Remove dace.yml again

2aff76f

Add comment about all pipeline to test reminder

2914011

Expect zero diff on more tests

a5dc683

msimberg mentioned this pull request Jun 25, 2026

Set nonzero tolerance for RHO_REF_ME with dace_gpu in test_parallel_grid_manager.py #1343

Merged

msimberg added 4 commits June 25, 2026 14:01

Revert zero diff on more tests, keep only standalone driver for now

8e9350a

Merge branch 'unified-ci-pipelines' into extended-ci-tests

37cec05

Revert some unnecessary changes

f33b122

Remove longer dycore time limit again

c1ae6ff

msimberg added 3 commits June 25, 2026 15:13

Merge branch 'unified-ci-pipelines' into extended-ci-tests

57f37a7

Clean up analytical means

98d6efd

Merge branch 'deterministic-means' into extended-ci-tests

0ea0738

Merge branch 'main' into extended-ci-tests

a33a3d7

msimberg commented Jun 26, 2026

View reviewed changes

msimberg added 4 commits June 26, 2026 12:01

Merge remote-tracking branch 'origin/main' into deterministic-means

05844d8

Update references for mean quantities

4d7ad4f

Merge branch 'deterministic-means' into extended-ci-tests

35210ff

Do seven days again for validation test

765fa26

Remove dallclose print instead of fail

6b3ae3c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Extended standalone driver test#1303

Extended standalone driver test#1303
msimberg wants to merge 181 commits into
C2SM:mainfrom
msimberg:extended-ci-tests

msimberg commented Jun 10, 2026

Uh oh!

msimberg commented Jun 25, 2026

Uh oh!

msimberg commented Jun 25, 2026

Uh oh!

msimberg commented Jun 25, 2026

Uh oh!

msimberg commented Jun 25, 2026

Uh oh!

msimberg Jun 26, 2026

Uh oh!

msimberg commented Jun 26, 2026

Uh oh!

msimberg commented Jun 26, 2026

Uh oh!

github-actions Bot commented Jun 26, 2026

Uh oh!

msimberg commented Jun 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

msimberg commented Jun 10, 2026

Uh oh!

msimberg commented Jun 25, 2026

Uh oh!

msimberg commented Jun 25, 2026

Uh oh!

msimberg commented Jun 25, 2026

Uh oh!

msimberg commented Jun 25, 2026

Uh oh!

msimberg Jun 26, 2026

Choose a reason for hiding this comment

Uh oh!

msimberg commented Jun 26, 2026

Uh oh!

msimberg commented Jun 26, 2026

Uh oh!

github-actions Bot commented Jun 26, 2026

Uh oh!

msimberg commented Jun 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant