Skip to content

RA Toolkit end-to-end pipeline enhancements#1355

Closed
allisonmcampbell wants to merge 3 commits into
blue-marble:developfrom
allisonmcampbell:ra-toolkit-e2e
Closed

RA Toolkit end-to-end pipeline enhancements#1355
allisonmcampbell wants to merge 3 commits into
blue-marble:developfrom
allisonmcampbell:ra-toolkit-e2e

Conversation

@allisonmcampbell
Copy link
Copy Markdown
Contributor

Summary

  • Add hydro_balancing_type parameter to data toolkit scripts for controlling hydro balancing granularity (day/week/month)
  • Aggregate all projects to BA level for RA studies (portfolio, opchar, capacity, availability, fixed cost, load zone, fuel assignments)
  • Fix pandas 2.x LossySetitemError in PUDL extraction (int64 casts for datetime columns)
  • Update settings CSV format to 5-column layout (script,setting,value,script_true_false_arg,reverse_default_behavior)
  • Add complete e2e scenario configuration files (raw_data_ra_toolkit_e2e/) including temporal definitions for 28-subproblem synchronized run (14 weather years × 2 hydro years), Monte Carlo iteration configs, and all user-defined mapping tables
  • Add ra_toolkit_e2e_settings_sample.csv for running the full e2e pipeline
  • Add documentation: user guide (docs/ra_toolkit_e2e_guide.md) and detailed changelog (docs/ra_toolkit_e2e_changes.md)

Test plan

  • Run gridpath_run_data_toolkit --settings_csv data_toolkit/ra_toolkit_e2e_settings_sample.csv with PUDL and RA Toolkit raw data in place
  • Verify 691 CSV files generated in db/csvs_ra_toolkit_e2e/
  • Build database and load scenarios: gridpath_create_database, gridpath_load_csvs, gridpath_load_scenarios
  • Run gridpath_run_e2e --scenario ra_toolkit_e2e_sync --solver cbc and confirm 28 subproblems solve
  • Verify existing open_data_toolkit_settings_sample.csv still works with its pipeline

🤖 Generated with Claude Code

@allisonmcampbell allisonmcampbell marked this pull request as draft April 22, 2026 15:51
@allisonmcampbell allisonmcampbell marked this pull request as ready for review April 22, 2026 15:51
allisonmcampbell and others added 3 commits April 24, 2026 09:34
…nario configs

Add hydro_balancing_type parameter to control hydro balancing granularity,
aggregate all projects to BA level for RA studies, fix pandas 2.x dtype
compatibility, and provide complete e2e scenario configuration files
(temporal definitions, iterations, user-defined configs) so the full
pipeline from PUDL download through reliability metrics is reproducible
via gridpath_run_data_toolkit.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
…t names

AGG_PROJECT_NAME_STR now falls back to gridpath_technology when
agg_project is NULL, fixing a DuckDB struct type mismatch in the
open_data test (test_data_toolkit_open_data). Technologies without
agg_project (e.g. BA, CT) now get proper aggregated names like
Batteries_Zone1 instead of NULL.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Adds battery_duration (4h) and pumped_storage_duration (12h) defaults
so storage projects without EIA-860 energy capacity data get filled in.
Includes empty copy files CSV required by the manual_adjustments step.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@allisonmcampbell
Copy link
Copy Markdown
Contributor Author

Closing this PR in favor of three smaller, focused PRs:

  1. Pandas 2.x int64 fix and hydro_balancing_type parameter #1359 — Pandas 2.x int64 fix + --hydro_balancing_type parameter (bug fixes + small feature)
  2. Optional BA-level project aggregation via --aggregate_projects #1360 — Optional BA-level project aggregation via --aggregate_projects flag (new option, backward-compatible)
  3. E2E pipeline template: PUDL-to-GridPath config and docs #1361 — E2E pipeline template: config files, settings sample, and docs (all new files)

This split makes each easier to review independently. The original bundled approach made it harder to evaluate the design changes separately from the bug fixes and configuration files.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant