Skip to content

fix resuming experiments#126

Merged
clemsgrs merged 13 commits into
mainfrom
atomic-process-list-writes
May 4, 2026
Merged

fix resuming experiments#126
clemsgrs merged 13 commits into
mainfrom
atomic-process-list-writes

Conversation

@clemsgrs
Copy link
Copy Markdown
Owner

@clemsgrs clemsgrs commented May 1, 2026

Summary

  • a crashed mid-write to process_list.csv previously left a corrupted CSV that resume could not parse, forcing slides to be re-processed even though their artifacts were on disk
  • adds slide2vec.utils.tiling_io.atomic_write_dataframe_csv (temp file + atomic rename) and routes the two non-atomic writers through it: update_process_list_after_embedding (called per slide during embedding) and record_slide_metadata_in_process_list
  • pairs with the matching incremental-write fix in hs2p so resume is robust to crashes at any pipeline stage.

A crashed mid-write previously left a corrupted CSV that resume could
not parse. Both per-slide embedding updates and the post-tiling slide
metadata recorder now write to a temp file and rename onto the target,
so resume can always trust process_list.csv.
@clemsgrs clemsgrs changed the title Make process_list.csv writes atomic make process_list.csv writes atomic May 1, 2026
@clemsgrs clemsgrs merged commit 2ea013b into main May 4, 2026
3 checks passed
@clemsgrs clemsgrs deleted the atomic-process-list-writes branch May 4, 2026 14:17
@clemsgrs clemsgrs changed the title make process_list.csv writes atomic fix resuming experiments May 4, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant