Update plot_events script by AnnaKwa · Pull Request #1274 · ai2cm/ace

AnnaKwa · 2026-06-12T22:35:14Z

Add option to cache the downloaded dataset instead of saving to temp directory. Currently it defaults to caching so that it's less tedious to call the script, but if desired I can change the default to not cache.
Fix bug where the coarse data would select the 00 hour timestep if the event filename only had YYYYMMDD in the name. Now the script checks the config in the beaker dataset to get the exact event timestamp.
Add TMP2m to the coarse variables to read
Remove coarse PRESsfc relabeling to PRMSL since we now use PRMSL for both inputs and outputs.

The coarse panel was selected using a timestamp parsed from the event filename, defaulting to 12Z when the filename had no hour suffix. Most events are not at 12Z (e.g. the heat wave events are at 00Z), so the coarse field showed the wrong time of day. Read each event's date from the config.yaml saved with the beaker dataset instead, falling back to filename parsing (now defaulting to 00Z, the evaluator convention) only when the event is missing from the config. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>

find_event_files keyed files by the event name with the date stripped, so when a dataset contained several dates for the same event (e.g. three Phl_tc_landfall files) only the last one alphabetically was processed. Key by the full filename stem instead; this also makes the keys match the event names in the evaluator config.yaml. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>

AnnaKwa · 2026-06-15T16:33:45Z

-    """Find netCDF files matching the event naming pattern, keyed by event name."""
+    """Find netCDF files matching the event naming pattern, keyed by filename
+    stem (event name including date, so multiple dates of the same event are
+    kept)."""


Before, if there were files with the same name and different dates, only one was plotted.

frodre

I think there's one leftover error from the adjusted PRMSL conditional, and I think I would prefer an opt in for the caching instead of defaulting to storage (I don't feel strongly). My reasoning being that the datasets are usually small and quick to download. At the very least if not making opt-in, I would suggest the default cache be tied to something that gets wiped with the machine reset like /tmp/beaker, but would still be persistent across script usage.

frodre · 2026-06-27T19:16:50Z

-        ds_["PRMSL_coarse"].values[:] = np.nan
-        # For colorbar range, use only target and predicted (coarse is hidden)
-        arr = ds_[["PRMSL_target", "PRMSL_predicted"]].to_array()
    else:


Does this else block need to be adjusted? It looks like it pairs with the if len(samples)... from above

frodre · 2026-06-27T20:22:54Z

-        ds_["PRMSL_coarse"].values[:] = np.nan
-        # For colorbar range, use only target and predicted (coarse is hidden)
-        arr = ds_[["PRMSL_target", "PRMSL_predicted"]].to_array()
    else:


Does this else block need to be adjusted? It looks like it pairs with the if len(samples)... from above

frodre · 2026-06-27T20:49:31Z

+def fetch_beaker_dataset(
+    dataset_id: str,
+    target_dir: str,
+    prefix: str | None = None,


Prefix seems like a straightforward addition, but I didn't see it actually used anywhere.

frodre · 2026-06-27T20:54:29Z

+    dataset_id: str,
+    target_dir: str,
+    prefix: str | None = None,
+    cache_dir: str | None = "~/Downloads/beaker_cache",


I think persistent cache would make more sense as opt-in not opt-out.

frodre · 2026-06-27T20:59:35Z

+    """
+    if cache_dir is not None:
+        cached = Path(cache_dir).expanduser() / dataset_id
+        if cached.is_dir() and any(cached.iterdir()):


One thing for persistent caches flagged by Claude: a partial beaker fetch that failed would still pass this check. You could add a sentinel file after the subprocess completes successfully.

AnnaKwa and others added 6 commits June 11, 2026 11:51

add tmp2m and remove pressfc handling

3940898

cache downloaded data

bb8bfa6

show cache path in message

e4d3903

fix typo

3809fe7

AnnaKwa commented Jun 15, 2026

View reviewed changes

frodre requested changes Jun 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update plot_events script#1274

Update plot_events script#1274
AnnaKwa wants to merge 6 commits into
mainfrom
scripts/update-plot-events

AnnaKwa commented Jun 12, 2026

Uh oh!

AnnaKwa Jun 15, 2026

Uh oh!

frodre left a comment •

edited

Loading

Uh oh!

frodre Jun 27, 2026

Uh oh!

frodre Jun 27, 2026

Uh oh!

frodre Jun 27, 2026

Uh oh!

frodre Jun 27, 2026

Uh oh!

frodre Jun 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

AnnaKwa commented Jun 12, 2026

Uh oh!

AnnaKwa Jun 15, 2026

Choose a reason for hiding this comment

Uh oh!

frodre left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

frodre Jun 27, 2026

Choose a reason for hiding this comment

Uh oh!

frodre Jun 27, 2026

Choose a reason for hiding this comment

Uh oh!

frodre Jun 27, 2026

Choose a reason for hiding this comment

Uh oh!

frodre Jun 27, 2026

Choose a reason for hiding this comment

Uh oh!

frodre Jun 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

frodre left a comment •

edited

Loading