ring: compute toSubmit from the kernel-consumed SQ head (liburing-style) — EBUSY can strand published SQEs

**Severity: low** — latent edge case, likely rare-to-unreachable on modern kernels, but the fix is ~3 lines and matches the reference implementation.

## The bookkeeping assumption

The guarantee chain for every op is *staged → counted into an enter → consumed by the kernel → CQE*. Step two has an assumption:

`Ring.SubmitAndWait` computes what to submit from **its own last-published tail** ([Ring.cs#L127-L142](https://github.com/MDA2AV/ioxide/blob/cf2fce344eea6d4dc023745c4d4d9687a720a448/ioxide/io_uring/Ring.cs#L127-L142)):

```csharp
uint published = *_sqTail;            // our last-published tail
uint toSubmit  = _sqeTail - published;
if (toSubmit > 0) Volatile.Write(ref *_sqTail, _sqeTail);   // publish BEFORE enter
return io_uring_enter(_fd, toSubmit, waitFor, flags);
```

liburing instead derives pending work from the **kernel-consumed head**: `sq_ready = tail − acquire-load(*khead)`, so entries that were published but not consumed by a previous enter are re-counted on the next call.

## Failure scenario

1. Tail is published, `io_uring_enter` returns `-EBUSY`/`-EAGAIN` (e.g. CQ-overflow backpressure) having consumed **zero** entries. The loop tolerates the errno and continues ([Reactor.Loop.SharedRing.cs#L57-L62](https://github.com/MDA2AV/ioxide/blob/cf2fce344eea6d4dc023745c4d4d9687a720a448/ioxide/Reactor/Loop/Reactor.Loop.SharedRing.cs#L57-L62)).
2. Next iteration: `published == _sqeTail`, so `toSubmit = 0` — the stranded entries are in the ring but never counted into any subsequent `to_submit`.
3. They drain only as *later* staged SQEs bump the count (the kernel consumes FIFO from its head): the 250 ms timer re-arm trickles them out roughly one per tick, each new SQE releasing one stranded op and stranding itself.

Consequence: operations delayed by seconds under exactly the conditions that produce `-EBUSY` (overload) — the worst possible timing. Not a leak, not a deadlock (the timer guarantees eventual drain), but a latency anomaly that would be near-impossible to diagnose without knowing this mechanism.

## Fix

Compute the submit count from the kernel-visible head, liburing-style:

```csharp
uint khead    = Volatile.Read(ref *_sqHead);   // kernel-written consumed head
uint toSubmit = _sqeTail - khead;              // everything staged and not yet consumed
```

(publish the tail as today; `to_submit` then re-covers any stranded entries). Additionally: `io_uring_enter`'s return value is the number consumed — debug-assert it equals `toSubmit` and count shortfalls in the per-reactor stats (#101).

## Notes

- #95 (`IORING_SETUP_CQSIZE`) shrinks the main `-EBUSY` trigger, and `DEFER_TASKRUN` on ≥6.1 may make the zero-consumed case unreachable in practice — the fix is still free insurance and aligns the ring with the reference accounting.
- Related handoff context in #104 ("one caveat at the accounting layer").


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ring: compute toSubmit from the kernel-consumed SQ head (liburing-style) — EBUSY can strand published SQEs #107

The bookkeeping assumption

Failure scenario

Fix

Notes

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

ring: compute toSubmit from the kernel-consumed SQ head (liburing-style) — EBUSY can strand published SQEs #107

Description

The bookkeeping assumption

Failure scenario

Fix

Notes

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions