fix: tighten oneshot memory ordering by tisonkun · Pull Request #111 · fast/mea

tisonkun · 2026-05-13T01:07:06Z

Summary

tighten oneshot sender/receiver state transitions to synchronize final channel ownership handoff
avoid racing with the sender when dropping a pending async receiver by first moving RECEIVING back to EMPTY
add concurrent send/drop with try_recv/poll regression tests inspired by upstream oneshot Miri coverage

Verification

cargo x test
cargo +nightly x lint

Copilot

Pull request overview

Tightens the oneshot channel’s state-transition memory orderings to better synchronize message/waker visibility and final channel deallocation ownership, and adds regression tests covering concurrent send/drop vs try_recv/poll.

Changes:

Adjusted atomic orderings and added dedicated acquire fences around MESSAGE-handling paths to explicitly synchronize message visibility and final deallocation handoff.
Updated drop paths (Sender/Receiver/Recv) to better coordinate which side frees the channel under races.
Added concurrent send/drop completion tests for both try_recv and Future::poll paths.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
`mea/src/oneshot/mod.rs`	Updates atomic orderings/fences and drop-path state transitions to strengthen synchronization and ownership handoff.
`mea/src/oneshot/tests.rs`	Adds multi-threaded regression tests for concurrent send/drop vs `try_recv`/`poll`, plus a small thread helper.

Comments suppressed due to low confidence (3)

mea/src/oneshot/tests.rs:363

This test’s receiver thread loops with spin_loop() until Disconnected is observed; without any timeout/iteration cap, failures can turn into an infinite hang during join(). Consider adding a timeout/budget and failing fast if the expected state isn’t reached.

    let receiver_thread = spawn_named("receiver", move || {
        loop {
            match receiver.try_recv() {
                Ok(value) => panic!("unexpected value: {value}"),
                Err(TryRecvError::Empty) => spin_loop(),
                Err(TryRecvError::Disconnected) => break,
            }
        }

mea/src/oneshot/tests.rs:389

This test polls in a tight unbounded spin_loop() until completion. If the wake/state transition breaks, the test will hang indefinitely at join(). Add a bounded timeout/iteration budget (and panic on timeout) to keep CI reliable.

        loop {
            match Pin::new(&mut receiver).poll(&mut context) {
                Poll::Ready(Ok(999)) => break,
                Poll::Ready(result) => panic!("unexpected result: {result:?}"),
                Poll::Pending => spin_loop(),
            }
        }

mea/src/oneshot/tests.rs:415

Similar to the other concurrent tests, this thread spins in an unbounded loop until it observes the disconnect. A regression could cause an infinite hang. Add a timeout/iteration cap and fail fast rather than spinning forever.

        loop {
            match Pin::new(&mut receiver).poll(&mut context) {
                Poll::Ready(Err(oneshot::RecvError::Disconnected)) => break,
                Poll::Ready(result) => panic!("unexpected result: {result:?}"),
                Poll::Pending => spin_loop(),
            }
        }

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+        // If this receiver was previously polled, but was not polled to completion, then the
+        // channel is in the RECEIVING state and has a waker written. We must move away from the
+        // RECEIVING state before dropping the waker, otherwise we could race with the sender
+        // taking the same waker and deallocating the channel.
+        if channel.state.load(Ordering::Relaxed) == RECEIVING
+            && channel
+                .state
+                .compare_exchange(RECEIVING, EMPTY, Ordering::Relaxed, Ordering::Relaxed)
+                .is_ok()
+        {
+            // SAFETY: The RECEIVING state guarantees we have written a waker.
+            unsafe { channel.drop_waker() };
+        }


+    let receiver_thread = spawn_named("receiver", move || {
+        loop {
+            match receiver.try_recv() {
+                Ok(999) => break,
+                Ok(value) => panic!("unexpected value: {value}"),
+                Err(TryRecvError::Empty) => spin_loop(),
+                Err(TryRecvError::Disconnected) => panic!("unexpected disconnect"),
+            }
+        }


orthur2 · 2026-05-13T18:20:57Z

Memory ordering here is tough to think through and guarantee correctness, so my review may not be very comprehensive.

First and foremost, following Copilot's review, Recv::drop needs acquire synchronization before dropping the stored waker. Channel::write_waker publishes the waker with a Release CAS, but the new RECEIVING -> EMPTY CAS in Recv::drop is Relaxed. Since Recv<T> is Send, the future may be polled on one thread and dropped on another, so Relaxed is too weak here. And the Recv::poll waker-replacement path also has the same issue.

Then, I found an issue that existed before this PR. I am concerned that the AWAKING branch in Recv::drop can still deallocate the channel too early. This path writes DISCONNECTED before it handles AWAKING. So if the sender has just moved the state to AWAKING and is still taking the waker, the drop path can read back its own DISCONNECTED write and free the channel too early.

As Copilot noted, the new spin-loop tests should also be bounded with a timeout or iteration budget.

orthur2 · 2026-05-13T18:25:03Z

+        // ORDERING: Release is required so that in the states where the sender becomes responsible
+        // for deallocating the channel, they can synchronize with this final state write from us.
+        // Acquire is required by the branches below to synchronize with writes from the sender.
+        match channel.state.swap(DISCONNECTED, Ordering::AcqRel) {


I think this can free the channel too early. When this swap() sees AWAKING, it also writes DISCONNECTED, so the loop below may just read our own write back and stop waiting while the sender is still taking the waker or publishing the final state.

fix: tighten oneshot memory ordering

f1166a8

tisonkun requested review from Copilot and orthur2 May 13, 2026 03:14

Copilot started reviewing on behalf of tisonkun May 13, 2026 03:15 View session

Copilot AI reviewed May 13, 2026

View reviewed changes

tisonkun marked this pull request as draft May 13, 2026 03:20

orthur2 reviewed May 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: tighten oneshot memory ordering#111

fix: tighten oneshot memory ordering#111
tisonkun wants to merge 1 commit into
mainfrom
codex/oneshot-ordering-fixes

tisonkun commented May 13, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

orthur2 commented May 13, 2026

Uh oh!

orthur2 May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

tisonkun commented May 13, 2026

Summary

Verification

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

orthur2 commented May 13, 2026

Uh oh!

orthur2 May 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants