Consolidate experiment design-matrix attributes into xr.Dataset by drbenvincent · Pull Request #849 · pymc-labs/CausalPy

drbenvincent · 2026-04-15T20:13:37Z

Summary

Bundles loose xr.DataArray attributes on 9 experiment classes into xr.Dataset objects, reducing attribute sprawl while keeping related design matrices together.
- Pre/post classes (ITS, SC, SDiD): 4 DataArrays consolidated into pre_design / post_design Datasets.
- Formula-based classes (DiD, RD, RK, PrePostNEGD, PiecewiseITS, PanelRegression): 2 DataArrays consolidated into a single design Dataset.
All old attribute names preserved via a centralized deprecation mechanism (BaseExperiment.__getattr__ driven by a declarative _deprecated_design_aliases mapping) that emits DeprecationWarning for backward compatibility.
Updates reporting.py, maketables_adapters.py, checks/convex_hull.py, and tests to use the new Dataset access patterns internally, so CausalPy never warns against itself.
InversePropensityWeighting, InstrumentalVariable, and StaggeredDiD are intentionally not migrated: their design matrices have non-standard layouts (IPW pairs covariates with a treatment vector t, IV has an instrument matrix Z plus an endogenous treatment, StaggeredDiD maintains a train/full split) that don't fit the shared X/y Dataset shape, so they keep their existing attributes with no deprecation shims.
UML diagrams (docs/source/_static/classes.png / packages.png) regenerated via make uml to reflect the consolidated attributes.

Closes #199. Follow-up notebook migration tracked in #848. Removal of the deprecation shims is tracked in draft PR #958 (do not merge that until a release has shipped with the warnings in place).

Test plan

All 1102 existing tests pass (0 failures), including a parametrized backward-compatibility suite (test_deprecated_design_aliases.py) covering every alias on every migrated class
prek run --all-files passes (lint, format, mypy, codespell, notebook validation)
Deprecated properties still work (notebooks and external code unaffected)

Made with Cursor

Add _predictor_data_name and _target_data_name class attributes to PyMCModel so subclasses using non-default data node names can customize without re-implementing _data_setter. Validate at predict time and raise a clear ValueError if expected nodes are missing. Also fix pre-existing mypy type: ignore codes in panel_regression.py (attr-defined -> union-attr). Made-with: Cursor

Remove _predictor_data_name / _target_data_name class attributes (added complexity for a case no existing subclass needs). Keep the validation that raises a clear ValueError when expected data nodes are missing. Revert undeclared y_dtype behavioral change. Improve test coverage with separate X-missing and y-missing error paths. Made-with: Cursor

Bundle loose xr.DataArray attributes on experiment classes into xr.Dataset objects to reduce attribute sprawl. Pre/post classes (ITS, SC) use pre_design/post_design; formula-based classes use a single design Dataset. Deprecated @Property accessors preserve backward compatibility. Closes #199. Made-with: Cursor

codecov · 2026-04-15T20:20:47Z

Codecov Report

❌ Patch coverage is 98.50187% with 4 lines in your changes missing coverage. Please review.
✅ Project coverage is 95.16%. Comparing base (0197f17) to head (db9545c).

Files with missing lines	Patch %	Lines
causalpy/tests/test_pymc_models.py	93.18%	2 Missing and 1 partial ⚠️
causalpy/experiments/base.py	93.75%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #849      +/-   ##
==========================================
+ Coverage   95.14%   95.16%   +0.02%     
==========================================
  Files          92       93       +1     
  Lines       14860    15030     +170     
  Branches      890      896       +6     
==========================================
+ Hits        14138    14304     +166     
- Misses        505      507       +2     
- Partials      217      219       +2

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

drbenvincent

Adversarial review from gpt-5.4-xhigh.

causalpy/checks/convex_hull.py still calls deprecated aliases. ConvexHullCheck.run() reads sc.datapre_control and sc.datapre_treated, which on this branch now route through the new deprecation shims. I reproduced this locally and the check emits:
- datapre_control is deprecated, use pre_design['control']
- datapre_treated is deprecated, use pre_design['treated']
That means a normal synthetic-control workflow now makes CausalPy warn against itself. Immediate fix direction: do a repo-wide sweep for internal uses of the deprecated names, switch those callers to design[...] / pre_design[...] / post_design[...], and add a warning-focused test in causalpy/tests/test_cross_cutting_checks.py asserting that ConvexHullCheck.run() stays warning-free.
The refactor still has a lot of duplicate migration boilerplate, and that is making the change less elegant than it could be. Right now the same migration pattern is hand-written across the experiment classes:
- build one or two xr.Datasets with near-identical X/y or control/treated wrapping logic,
- preserve temporary raw arrays just long enough to build those datasets,
- add near-identical deprecated properties that warn and then forward to the new dataset-backed location.
You can see this pattern repeated in diff_in_diff.py, panel_regression.py, piecewise_its.py, prepostnegd.py, regression_discontinuity.py, regression_kink.py, plus the split pre/post variants in interrupted_time_series.py and synthetic_control.py. The practical problem is not just aesthetics: every repeated shim and dataset wrapper is another place to miss an internal migration, drift in warning text, or leave coverage behind. The ConvexHullCheck miss feels like a direct symptom of that copy-paste surface area.

Suggested fix direction: centralize more of this in BaseExperiment (or a small mixin/helper module) so subclasses only describe what is experiment-specific. For example:
- one helper for the common single-design case that builds the standard {"X", "y"} dataset given raw arrays, observation index, labels, and treated-unit labels;
- one helper for split designs so ITS-style classes can reuse the same builder for pre/post periods;
- one shared deprecation-forwarding helper so the compatibility properties become one-liners instead of each class repeating its own warnings.warn(...); return self.design[...] block.
I would prefer that over adding more bespoke per-class migration code. If you want to avoid magic, you do not need a dynamic __getattr__; even explicit properties backed by a shared helper would already remove most of the duplication.
The backward-compatibility layer is still under-tested, and the remote results reflect that. Every non-Codecov check is green, but codecov/patch is failing at 84.71% with 37 missing lines, concentrated in the new deprecated properties across the experiment classes. Since preserving old attribute access is one of the main claims of the PR, I’d expect targeted tests that exercise those legacy aliases and assert both the warning and the data equivalence to the new dataset-backed API.

Suggested fix direction: add a small parametrized compatibility test suite rather than lots of bespoke tests. I would cover:
- representative single-design aliases (X, y) and split-design aliases (pre_X, pre_y, post_X, post_y);
- the synthetic-control aliases (datapre_control, datapre_treated, datapost_control, datapost_treated);
- equality of the returned object/data with the new dataset-backed access path;
- correct deprecation behavior via pytest.deprecated_call() or explicit warning capture.
If you centralize the alias metadata, you can drive these tests from the same mapping and shrink both implementation duplication and test duplication at the same time.

Net: I like the direction of moving related arrays into xr.Datasets, but I don’t think this is ready yet. The current version is functionally plausible, but not yet simple or especially elegant because the migration logic is still spread across too many classes and one real regression has already slipped through. I’d first eliminate internal uses of deprecated aliases, then centralize the migration helpers, then land this with focused compatibility coverage.

Move _build_design_dataset helper and __getattr__ deprecation forwarding into BaseExperiment, replacing per-class @Property blocks with a declarative _deprecated_design_aliases dict. Fix convex_hull.py and maketables_adapters.py to use the new API directly. Add parametrized backward-compatibility tests covering all deprecated aliases. Made-with: Cursor

read-the-docs-community · 2026-04-15T21:05:40Z

Documentation build overview

📚 causalpy | 🛠️ Build #33076655 | 📁 Comparing db9545c against latest (1cf3387)

🔍 Preview build

393 files changed · + 65 added · ± 327 modified · - 1 deleted

+ Added

± Modified

- Deleted

api/generated/causalpy.experiments.base.BaseExperiment.plot.html

drbenvincent

Follow-up adversarial review from gpt-5.4-xhigh.

No new blocking findings from me on the latest revision.

What satisfies my earlier review:

The duplicate migration boilerplate is now materially reduced. Moving deprecated alias forwarding into BaseExperiment.__getattr__ with a declarative _deprecated_design_aliases mapping is the kind of centralization I was asking for, and _build_design_dataset() usefully consolidates the common design["X"] / design["y"] wrapping path.
The self-warning regression is fixed. causalpy/checks/convex_hull.py now reads pre_design[...] directly, and causalpy/maketables_adapters.py also no longer relies on deprecated design aliases internally.
The backward-compatibility contract is now tested in the right way. causalpy/tests/test_deprecated_design_aliases.py checks both deprecation behavior and data identity, and it adds a warning-free test for the convex-hull path.
The remote results now support the implementation: codecov/patch is green, both test jobs are green, notebooks are green, docs are green, and prek is green.

I also ran the two most relevant local test targets from this follow-up:

pytest causalpy/tests/test_deprecated_design_aliases.py -q
pytest causalpy/tests/test_maketables_plugin.py -q

Both passed locally. The standalone invocations tripped the repo-wide coverage floor, which is expected for narrow pytest runs here and not a concern for the PR itself.

Residual note, not a blocker: the shared dataset helper currently covers the standard X/y layout but not the SyntheticControl control/treated layout, so there is still some experiment-specific wrapping code there. That no longer looks like problematic duplication to me; the important compatibility and warning-forwarding boilerplate is now centralized.

This satisfies my earlier objections.

…y-datasets

cetagostini · 2026-04-27T13:21:59Z


    _default_model_class: type[PyMCModel] | None = None

+    _deprecated_design_aliases: dict[str, tuple[str, str]] = {}


Quick one — InversePropensityWeighting, InstrumentalVariable, and StaggeredDiD didn't get migrated to design (still using numpy self.X / self.y, see e.g. causalpy/experiments/inverse_propensity_weighting.py:111, instrumental_variable.py:155, staggered_did.py:332-338) and accordingly don't show up in _deprecated_design_aliases here. Assuming intentional given their non-standard layouts (IPW has t + outcome, IV has Z + endogenous treatment, staggered has the train/full split)? If so, maybe worth a sentence in the PR body so it's not mistaken for an oversight.

Yes, intentional — IPW, IV, and StaggeredDiD have non-standard design layouts (IPW pairs covariates with a treatment vector, IV has an instrument matrix plus endogenous treatment, StaggeredDiD keeps a train/full split) that don't fit the shared X/y Dataset shape. I've added a sentence to the PR body making this explicit. Note that SyntheticDifferenceInDifferences (added on main in #823 after this branched) did fit the SC-style four-quadrant pattern, so it has now been migrated to pre_design/post_design with the same deprecated aliases.

cetagostini · 2026-04-27T13:24:02Z

+                self.pre_design["treated"].isel(treated_units=i).values,
+                self.pre_design["control"].values,


Heads-up on the input convention: this call site converts to numpy via .values, but the new ConvexHullCheck.run in causalpy/checks/convex_hull.py:60-62 hands the same data to check_convex_hull_violation as xarray.DataArray directly (no .values). Both work — I checked end-to-end with violation cases and got identical results — but it's worth normalizing on one style across the two call sites and adding a one-liner in the helper's docstring saying it accepts numpy or xarray. Otherwise this becomes a quiet footgun the first time someone changes the helper to assume one shape.

Good catch — normalized both call sites on xarray: SyntheticControl._check_convex_hull now passes the DataArrays directly (no .values), matching ConvexHullCheck.run, and check_convex_hull_violation's docstring and type hints now state it accepts np.ndarray or xr.DataArray.

cetagostini · 2026-04-27T13:24:02Z

+        datapre_control = sc.pre_design["control"]  # type: ignore[attr-defined]
+        datapre_treated = sc.pre_design["treated"]  # type: ignore[attr-defined]


Tiny nit: since applicable_methods = {SyntheticControl} and validate() already enforces the type, you could drop both # type: ignore[attr-defined] markers by adding assert isinstance(experiment, SyntheticControl) (or just calling self.validate(experiment)) at the top of run() — that narrows sc to SyntheticControl and pre_design becomes a known attribute for mypy. Same effect, no escape hatches.

Done — run() now calls self.validate(experiment) followed by an isinstance assert to narrow the type, and both type: ignore markers are gone. mypy passes via prek.

cetagostini · 2026-04-27T13:24:03Z

+        for name in ("X", "y"):
+            if name not in self.named_vars:
+                raise ValueError(
+                    f"Data node '{name}' not found in model. "


Optional polish: the class docstring above already points users at BayesianBasisExpansionTimeSeries as a concrete override example. Worth echoing that in the runtime message itself so users hitting this in a notebook don't have to dig — e.g. f"... override _data_setter() (see BayesianBasisExpansionTimeSeries for an example).". Pure DX nicety, not blocking.

Done — the runtime message now ends with "override _data_setter() (see BayesianBasisExpansionTimeSeries for an example)."

…y-datasets Co-authored-by: Cursor <cursoragent@cursor.com> # Conflicts: # causalpy/experiments/base.py # causalpy/experiments/diff_in_diff.py # causalpy/experiments/panel_regression.py # causalpy/experiments/piecewise_its.py # causalpy/pymc_models.py

- Migrate SyntheticDifferenceInDifferences (added on main in #823) to the pre_design/post_design Dataset pattern with deprecated aliases, matching SyntheticControl, and extend the alias test suite to cover it. - Normalize convex-hull call sites on xarray inputs and document that check_convex_hull_violation accepts numpy or xarray. - Replace type: ignore escape hatches in ConvexHullCheck.run with an isinstance assertion that narrows the type for mypy. - Point users at BayesianBasisExpansionTimeSeries in the _data_setter error message. Co-authored-by: Cursor <cursoragent@cursor.com>

Co-authored-by: Cursor <cursoragent@cursor.com>

drbenvincent added 3 commits April 15, 2026 20:32

github-actions Bot added the refactor Refactor, clean up, or improvement with no visible changes to the user label Apr 15, 2026

drbenvincent mentioned this pull request Apr 15, 2026

Update notebooks to use new xr.Dataset API #848

Open

6 tasks

drbenvincent commented Apr 15, 2026

View reviewed changes

drbenvincent requested review from NathanielF and juanitorduz April 15, 2026 21:08

drbenvincent marked this pull request as ready for review April 15, 2026 21:08

Merge remote-tracking branch 'origin/main' into 199-consolidate-xarra…

ba0faf5

…y-datasets

cetagostini reviewed Apr 27, 2026

View reviewed changes

drbenvincent mentioned this pull request Jun 10, 2026

Exploratory experiment class refactor, focussing on InterruptedTimeSeries #524

Draft

drbenvincent and others added 2 commits June 10, 2026 14:44

drbenvincent mentioned this pull request Jun 10, 2026

Remove deprecated design-matrix alias shims (follow-up to #849) #958

Draft

6 tasks

Update UML diagrams after design-dataset consolidation

db9545c

Co-authored-by: Cursor <cursoragent@cursor.com>

drbenvincent merged commit 984163e into main Jun 10, 2026
19 checks passed

drbenvincent deleted the 199-consolidate-xarray-datasets branch June 10, 2026 14:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consolidate experiment design-matrix attributes into xr.Dataset#849

Consolidate experiment design-matrix attributes into xr.Dataset#849
drbenvincent merged 8 commits into
mainfrom
199-consolidate-xarray-datasets

drbenvincent commented Apr 15, 2026 •

edited

Loading

Uh oh!

codecov Bot commented Apr 15, 2026 •

edited

Loading

Uh oh!

drbenvincent left a comment •

edited

Loading

Uh oh!

read-the-docs-community Bot commented Apr 15, 2026 •

edited

Loading

Uh oh!

drbenvincent left a comment

Uh oh!

cetagostini Apr 27, 2026

Uh oh!

drbenvincent Jun 10, 2026

Uh oh!

cetagostini Apr 27, 2026

Uh oh!

drbenvincent Jun 10, 2026

Uh oh!

cetagostini Apr 27, 2026

Uh oh!

drbenvincent Jun 10, 2026

Uh oh!

cetagostini Apr 27, 2026

Uh oh!

drbenvincent Jun 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		_default_model_class: type[PyMCModel] \| None = None

		_deprecated_design_aliases: dict[str, tuple[str, str]] = {}

		self.pre_design["treated"].isel(treated_units=i).values,
		self.pre_design["control"].values,

		datapre_control = sc.pre_design["control"] # type: ignore[attr-defined]
		datapre_treated = sc.pre_design["treated"] # type: ignore[attr-defined]

Conversation

drbenvincent commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

codecov Bot commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

drbenvincent left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

read-the-docs-community Bot commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Documentation build overview

Uh oh!

drbenvincent left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

drbenvincent commented Apr 15, 2026 •

edited

Loading

codecov Bot commented Apr 15, 2026 •

edited

Loading

drbenvincent left a comment •

edited

Loading

read-the-docs-community Bot commented Apr 15, 2026 •

edited

Loading