Add cross_cov_matrix to transition data by xjules · Pull Request #13416 · equinor/ert

xjules · 2026-04-24T13:49:38Z

Issue
Resolves #13296
Relates to #13378

Approach
This introduces:

AnalysisMatrixEvent - for sending the matrix to update_run_model.
AnalysisStorageEvent - for storing the event
ensemble endpoint updated to account for blobs

Update: I might need to re-think this a bit due to fact when loading the data back how the endpoint should look like.

(Screenshot of new behavior in GUI if applicable)

PR title captures the intent of the changes, and is fitting for release notes.
Added appropriate release note label
Commit history is consistent and clean, in line with the contribution guidelines.
Make sure unit tests pass locally after every commit (git rebase -i main --exec 'just rapid-tests')

When applicable

When there are user facing changes: Updated documentation
New behavior or changes to existing untested code: Ensured that unit tests are added (See Ground Rules).
Large PR: Prepare changes in small commits for more convenient review
Bug fix: Add regression test for the bug
Bug fix: Add backport label to latest release (format: 'backport release-branch-name')

codecov-commenter · 2026-04-24T14:02:04Z

Codecov Report

❌ Patch coverage is 97.46835% with 2 lines in your changes missing coverage. Please review.
✅ Project coverage is 89.54%. Comparing base (9def804) to head (1e551ee).
⚠️ Report is 8 commits behind head on main.
✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
src/ert/dark_storage/endpoints/ensembles.py	84.61%	2 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main   #13416   +/-   ##
=======================================
  Coverage   89.54%   89.54%           
=======================================
  Files         464      464           
  Lines       32776    32845   +69     
=======================================
+ Hits        29349    29411   +62     
- Misses       3427     3434    +7

Flag	Coverage Δ
cli-tests	`35.83% <68.35%> (+0.07%)`	⬆️
fuzz	`43.93% <45.56%> (+0.11%)`	⬆️
gui-tests	`59.81% <64.55%> (-0.01%)`	⬇️
performance-and-unit-tests	`78.10% <97.46%> (+0.02%)`	⬆️
test	`45.38% <39.24%> (-0.02%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
src/ert/analysis/__init__.py	`100.00% <ø> (ø)`
src/ert/analysis/_enif_update.py	`96.19% <ø> (ø)`
src/ert/analysis/_es_update.py	`94.35% <ø> (ø)`
src/ert/analysis/_update_strategies/_adaptive.py	`98.24% <100.00%> (+0.32%)`	⬆️
src/ert/analysis/event.py	`98.27% <100.00%> (+0.09%)`	⬆️
src/ert/dark_storage/json_schema/__init__.py	`100.00% <100.00%> (ø)`
src/ert/dark_storage/json_schema/ensemble.py	`100.00% <100.00%> (ø)`
src/ert/run_models/ensemble_information_filter.py	`100.00% <ø> (ø)`
src/ert/run_models/event.py	`100.00% <100.00%> (ø)`
src/ert/run_models/manual_update_enif.py	`93.75% <ø> (ø)`
... and 5 more

... and 4 files with indirect coverage changes

codspeed-hq · 2026-04-24T14:38:10Z

Merging this PR will not alter performance

✅ 36 untouched benchmarks

_{Comparing xjules:store_cross_cov_mtx (1e551ee) with main (c81c829)}

xjules · 2026-05-05T06:41:17Z

Locally the test passes.
I've created an issue for the flaky test: #13475

This adds a new update event AnalysisMatrixEvent, which sends the correlation matrix in the callback as a part of the event. Is is saved to posterior / transition storage section together with serialized event. Add AnalysisStorageEvent and sparse flag Rename posterior_id to ensemble_id Add artifacts endpoint This returns all the AnalysisStorageEvents as a list Add update endpoint Save matrix after threshold being applied Replace transition with blob Rebase with main Fixup for corr matrix to bytes conv Make progress_callback a partial function to provide ensemble automatyically Fixups for posterior ensemble Fixup test

jonathan-eq · 2026-05-22T11:58:32Z

+            buf = io.BytesIO()
+            np.save(buf, corr_XY_matrix)


Do we have to do this manually, or can we use numpy.array.tobytes() directly?

The difference is the tobytes just stores the data itself while np.save saves also the header, which might be a thing we want.

xjules · 2026-05-27T13:24:01Z

+            sp.sparse.save_npz(blob_path, sparse_blob)
+        else:
+            blob_path = blob_dir / f"{stem}.npy"
+            np.save(blob_path, blob)


this should be bytes.

xjules · 2026-05-27T13:38:12Z

    userdata: Mapping[str, Any] = {}


+class BlobOut(BaseModel):


Replace with the actual BlobStorageData | BlobStorageMatrix and init with validate_python

xjules · 2026-05-27T13:49:03Z

    uri: str
    file_size: int
    ensemble_id: str
+


update_algorithm: ...

xjules · 2026-05-27T13:49:39Z

+
+
+class MatrixStorageData(BlobStorageData):
+    sparse: bool = False


dtype: float64, ...

xjules · 2026-05-27T13:51:16Z

@@ -15,3 +16,8 @@ class BlobStorageData(BaseModel):
    uri: str


{uuid}.blob <- bytes

xjules · 2026-05-27T13:52:44Z

@@ -15,3 +16,8 @@ class BlobStorageData(BaseModel):
    uri: str
    file_size: int
    ensemble_id: str


file_type: "parquet", "numpy"

dafeda · 2026-05-28T13:01:56Z

+    data_type = str(matrix.dtype)
+
+    sparsity = 1.0 - (np.count_nonzero(matrix) / matrix.size)
+    sparse = bool(sparsity > 0.5)


Is there a good reason for 0.5?

xjules self-assigned this Apr 24, 2026

xjules force-pushed the store_cross_cov_mtx branch 3 times, most recently from 903391e to 71010a6 Compare April 29, 2026 10:57

xjules marked this pull request as ready for review April 29, 2026 10:57

xjules force-pushed the store_cross_cov_mtx branch from 7fcb860 to 77e71a6 Compare May 4, 2026 13:38

xjules mentioned this pull request May 5, 2026

Transition data: Add local storage API for rho matrices #12996

Open

xjules force-pushed the store_cross_cov_mtx branch 2 times, most recently from db79d7f to 291f99c Compare May 6, 2026 07:04

xjules force-pushed the store_cross_cov_mtx branch 2 times, most recently from c050e3d to 41e1e07 Compare May 21, 2026 12:54

xjules force-pushed the store_cross_cov_mtx branch from 427c302 to db39c86 Compare May 21, 2026 21:00

xjules added 4 commits May 22, 2026 09:36

Simplify test

437ffe7

More updates

e0444b2

Add test for save blob as matrix

c2607c4

Fixup

1bc26c7