fix resuming experiments by clemsgrs · Pull Request #126 · clemsgrs/slide2vec

clemsgrs · 2026-05-01T19:36:47Z

Summary

a crashed mid-write to process_list.csv previously left a corrupted CSV that resume could not parse, forcing slides to be re-processed even though their artifacts were on disk
adds slide2vec.utils.tiling_io.atomic_write_dataframe_csv (temp file + atomic rename) and routes the two non-atomic writers through it: update_process_list_after_embedding (called per slide during embedding) and record_slide_metadata_in_process_list
pairs with the matching incremental-write fix in hs2p so resume is robust to crashes at any pipeline stage.

A crashed mid-write previously left a corrupted CSV that resume could not parse. Both per-slide embedding updates and the post-tiling slide metadata recorder now write to a temp file and rename onto the target, so resume can always trust process_list.csv.

clemsgrs changed the title ~~Make process_list.csv writes atomic~~ make process_list.csv writes atomic May 1, 2026

clemsgrs added 12 commits May 1, 2026 20:44

bump hs2p version

589d5fb

fallback for CIFS process_list writes

8b0000d

Simplify Docker Python installation

0b94d54

Keep local task notes out of repo

ad35952

bump python version to 3.11

f02ee3b

bump hs2p version

c9bd91a

fix docker missing dep

c364852

Update process list during distributed embedding

abebd77

Install GPG tooling before adding Deadsnakes PPA

c79ef0e

fix test

fa00a9a

Fix resume embedding skips

59d8d35

bump hs2p version

39a73a9

clemsgrs merged commit 2ea013b into main May 4, 2026
3 checks passed

clemsgrs deleted the atomic-process-list-writes branch May 4, 2026 14:17

clemsgrs changed the title ~~make process_list.csv writes atomic~~ fix resuming experiments May 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix resuming experiments#126

fix resuming experiments#126
clemsgrs merged 13 commits into
mainfrom
atomic-process-list-writes

clemsgrs commented May 1, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

clemsgrs commented May 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

clemsgrs commented May 1, 2026 •

edited

Loading