feat(executor): stream command stdout via chunk callback by jisung-02 · Pull Request #320 · alpacax/alpamon

Jisung Chae (jisung-02) · 2026-05-28T08:35:46Z

Summary

Stream system shell command stdout/stderr to alpacon-server in real time via a new /api/events/commands/{id}/chunk/ endpoint, replacing the previous fin-only post-completion model.
A chunkWriter emits chunks on newline boundaries or when the in-memory buffer crosses 4 KB, whichever comes first; partial lines are carried over between writes and flushed at completion.
The CommandRunner owns the monotonic seq counter so multiple chunkWriter instances spawned across shell operators (&&, ||, ;) in executeWithOperators share one series per command_id, preventing the unique (command, seq) collision that the initial implementation had.
Fin POST priority is bumped from 10 to 11 so pending chunk POSTs drain ahead of fin. The server marks the command handled on fin and drops late chunks, so ordering is enforced on the alpamon side.

Changes

pkg/executor/executor.go: new chunkWriter (sync io.Writer), ChunkCallback on CommandOptions, ExecWithStreamingHook entry point. runCommand routes through chunkWriter when streaming is requested and falls back to the existing CombinedOutput / Start+Wait paths otherwise.
pkg/executor/handlers/common: CommandExecutor.ExecWithStreamingHook interface, ChunkCallback field on CommandArgs, mock implementation.
pkg/executor/handlers/shell/shell.go: forwards ChunkCallback through executeCommand and executeWithOperators, choosing the streaming or non-streaming executor entry point per call.
pkg/runner/client.go: new eventCommandChunkURL constant.
pkg/runner/command.go: builds the per-command chunk callback closure (owns seq), posts each chunk to the chunk URL, and lowers fin priority to 11.

Test plan

go build ./pkg/executor/... ./pkg/runner/...
go vet ./pkg/executor/... ./pkg/runner/...
go test ./pkg/executor/... -count=1 -p 1 (all packages pass)
New unit tests: chunkWriter newline emission, multi-line single write, 4 KB threshold trigger, partial line carry-over, Flush remainder, Bytes() full output, Write length return
New integration tests: real /bin/sh -c invocation through ExecWithStreamingHook verifies chunk ordering and concatenation equals the returned combined output
Shell streaming regression tests: callback forwarded for single command, callback invoked once per sub-command across && / || / ; with a caller-owned monotonic counter (locks in the seq-collision fix), nil callback falls back to legacy path
go run ./cmd/alpamon boots cleanly against the existing config (connection refused only because no local alpacon-server is running, which is unrelated to this change)

Notes

This depends on the server-side /api/events/commands/{id}/chunk/ endpoint being deployed. Under streaming, command stdout/stderr is delivered solely via chunk POSTs—the fin payload no longer carries the full combined output, only short diagnostics (e.g. privilege-demotion errors and the timeout banner). This keeps the executor from retaining unbounded command output in memory.
Because output is not teed into fin, the chunk endpoint is required for output delivery: if it is unavailable, successful command output is not recovered via fin. The console degrades to diagnostics-only rather than the full pre-streaming output.

Wire a chunk callback through the executor pipeline so system shell commands stream stdout/stderr to alpacon-server in real time. - Add ChunkCallback on CommandArgs and CommandOptions - chunkWriter emits chunks on newline boundaries or when the buffer exceeds 4 KB (hybrid line + size strategy) - ExecWithStreamingHook added to CommandExecutor and the mock - ShellHandler forwards ChunkCallback through executeCommand and executeWithOperators - CommandRunner posts each chunk via scheduler.Rqueue to the new /api/events/commands/{id}/chunk/ endpoint When ChunkCallback is nil the existing CombinedOutput / Start+Wait paths are unchanged.

chunkWriter no longer manages sequence numbers. Multiple chunkWriter instances spawned across shell operators in executeWithOperators previously each restarted seq from 0, which violated the unique (command, seq) constraint on the server. The runner callback now owns a closure-scoped counter so each command_id receives one monotonic seq series across all operator branches. - ChunkCallback signature simplified to func(content string) - chunkWriter: drop seq field - ExecWithStreamingHook and mock signatures updated to match - CommandRunner: closure captures `var seq int` and increments per chunk

- chunkWriter: newline emission, multi-line single write, buffer threshold, Flush, concurrent Writes - Executor.ExecWithStreamingHook: real /bin/sh -c invocation verifies chunks arrive in order and concatenate to the returned output - ShellHandler streaming: ChunkCallback forwarded through executeCommand and executeWithOperators for both single and operator-separated runs

Multiple Reporter goroutines share one priority queue; when fin and chunk both used priority 10 their pop order was undefined, letting fin race ahead of trailing chunk POSTs and produce empty CLI output on short commands. Raise the fin priority value so the queue drains chunks first; the server-side fix accepts late chunks as well, but enforcing pop order keeps the common path monotonic.

Copilot

Pull request overview

This PR adds real-time streaming of system command stdout/stderr from Alpamon to Alpacon by introducing a per-command chunk callback that posts output increments to a new /api/events/commands/{id}/chunk/ endpoint, while still returning/sending the full combined output on fin.

Changes:

Added streaming execution support in the executor via a chunkWriter and a new ExecWithStreamingHook API that wires stdout/stderr to a chunk callback.
Plumbed ChunkCallback through runner → handler args → shell handler → executor, ensuring a runner-owned monotonic seq across operator-split subcommands.
Added unit/integration tests for chunk emission behavior and shell/operator forwarding behavior.

Reviewed changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
pkg/runner/command.go	Builds per-command chunk callback (seq ownership) and adjusts fin queue priority for chunk-first draining.
pkg/runner/client.go	Adds chunk endpoint URL constant.
pkg/executor/handlers/shell/shell.go	Forwards chunk callback and selects streaming vs legacy execution paths.
pkg/executor/handlers/shell/shell_streaming_test.go	Adds regression tests to ensure callback forwarding and operator-split execution behavior.
pkg/executor/handlers/common/args.go	Adds `ChunkCallback` to handler command args.
pkg/executor/handlers/common/interfaces.go	Extends `CommandExecutor` with `ExecWithStreamingHook`.
pkg/executor/handlers/common/testing.go	Updates mock executor to implement streaming hook and emit chunks.
pkg/executor/executor.go	Implements `chunkWriter`, adds `ChunkCallback` to options, and adds streaming execution path + `ExecWithStreamingHook`.
pkg/executor/executor_test.go	Adds integration-style test validating streamed chunks concatenate to final output.
pkg/executor/chunk_writer_test.go	Adds focused unit tests for newline/threshold/flush behavior and full-output collection.

- Wrap PIDHook invocations with deferred recover so a bad hook can't crash the agent, matching the CommandOptions.PIDHook contract. - Reword fin priority comment: priority 11 is lower than chunks' 10, so trailing chunks drain before fin. - Tighten runCommand and PIDHook docstrings.

Copilot

Pull request overview

Copilot reviewed 10 out of 10 changed files in this pull request and generated 2 comments.

Split partial lines into chunkSizeThreshold-sized pieces in chunkWriter so a single very long no-newline line cannot produce an arbitrarily large payload. Sub-threshold tail stays buffered. Wrap every ChunkCallback invocation (Write + Flush) in a nil-guarded helper that recovers panics, matching the PIDHook contract so a faulty callback cannot crash the agent mid-stream or during teardown.

Copilot

Pull request overview

Copilot reviewed 10 out of 10 changed files in this pull request and generated 2 comments.

Move to a chunks-only streaming contract: chunkWriter no longer retains the full body, so fin no longer double-ships output and multi-GB streams cannot OOM the agent. The newline path now also splits payloads >4 KB (buffered tail + next Write could exceed the cap). Surface cmd.Start and demote failures so the fin/stream carry diagnostics instead of an empty result.

…cate helpers Funnel the shell handler through a single ExecWithStreamingHook call, merge runCommand's streaming and pidHook branches, and reuse chunkWriter for the demote-failure path so the third nil-guard helper goes away. Move the chunk payload to a typed protocol.CommandChunk to cut per-chunk allocations and give the wire shape one definition.

…output # Conflicts: # pkg/executor/executor.go # pkg/executor/executor_test.go

Copilot

Pull request overview

Copilot reviewed 13 out of 13 changed files in this pull request and generated 2 comments.

The streaming privilege-demotion failure path returned an empty result, so the fin payload omitted the error text and it was only delivered as a chunk. Return the diagnostic in result (keeping the in-band emit) so fin still carries it when chunk delivery fails, consistent with the streaming timeout path. Addresses Copilot review on PR #320.

Copilot

Pull request overview

Copilot reviewed 13 out of 13 changed files in this pull request and generated 1 comment.

Copilot

Pull request overview

Copilot reviewed 13 out of 13 changed files in this pull request and generated 2 comments.

…tion

Copilot

Pull request overview

Copilot reviewed 13 out of 13 changed files in this pull request and generated no new comments.

Trim multi-line rationale comments while preserving intent; no logic changes.

Eunyoung Jeong (eunyoung14)

Review — output delivery reliability & resource usage

The chunkWriter mechanics are solid: memory stays bounded (the strings.Clone avoids pinning a large backing array, and TestChunkWriter_LargeStreamDoesNotRetainBody locks that in), the caller-owned seq counter keeps a single monotonic series across shell operators (&&/||/;), and the callback panic recovery is in place. A few things to consider before merge, all scoped to alpamon itself.

🟠 One POST per line — unbatched, on a shared bounded queue

chunkWriter emits a chunk on every newline; the 4 KB threshold only caps a chunk, it does not batch small lines. So a high-volume command (yes, seq 1 10000000, cat large.log) produces roughly one HTTP POST per output line.

Those POSTs go through the shared scheduler.Rqueue (MaxQueueSize = 36000, drained by 4 reporters). When the queue is full, request() logs-and-drops the entry with no retry. Consequences:

A single chatty command can saturate the queue that is shared with all other agent telemetry (acks, fins, command results, events) → unrelated reports get dropped agent-wide.
chunkWriter.Write always returns immediately and Post never blocks, so the child process gets no backpressure — alpamon converts the firehose into unbounded queue churn.

Suggestion: add a batching window to chunkWriter (coalesce up to N lines / 4 KB / ~50–100 ms into one chunk), and optionally apply real backpressure when the queue is near capacity so the child is throttled rather than silently dropped.

🟠 No retained copy when chunk delivery fails

Under streaming, a successful command's fin payload carries an empty result (runCommand returns nil bytes on the cw != nil path) and the output is delivered solely via best-effort chunk POSTs. Nothing retains the output, so a dropped or retry-exhausted chunk POST (see the queue concern above) means that output is permanently unrecoverable — there is no fallback.

Suggestion: tee a bounded buffer (cap, e.g. 1 MiB, truncate-middle) into the fin payload so a complete-up-to-cap copy is delivered reliably in a single request, independent of per-chunk delivery. This keeps the memory bound this PR introduced while removing the "all-or-nothing per chunk" fragility.

Notes (non-blocking)

command.go:56-57 honestly documents that the priority bump (10→11) is a best-effort hint, not an ordering guarantee — with 4 concurrent reporters a chunk can still land after fin. That's fine as long as the receiver tolerates post-fin, out-of-order arrival keyed by (command_id, seq).
stdout and stderr are merged into one stream (parity with the old CombinedOutput), so the receiver can't distinguish them — worth noting as a known characteristic.
Test coverage for the streaming path is good; a test exercising queue-pressure / drop-on-full behavior would be a nice addition given the concern above.

Coalesce stdout/stderr into chunks emitted on a 4KB threshold or a periodic flush tick instead of one POST per output line, and throttle the producing command (dropping past ctx/maxWait) when the shared queue nears capacity so a chatty command can't starve other telemetry.

Under streaming the fin payload carried an empty result, so output lived only in best-effort chunk POSTs and was permanently lost if a chunk dropped. Tee a bounded (1 MiB, truncate-middle) copy into the returned output so fin reliably carries the command's start and end for audit, while memory stays bounded.

…ance streaming result accumulation

Jisung Chae (jisung-02) · 2026-06-03T01:57:01Z

Addressed review feedback

Thanks for the review — summary of the changes made since then (all on this branch).

🟠 One POST per line / no backpressure — fixed (`29d0648`)

chunkWriter now coalesces output and emits on the 4 KB threshold or a ~100 ms flush tick, whichever comes first, so a chatty command no longer produces roughly one POST per output line.
Added RequestQueue.PostChunk: while the shared queue is at/above a high-water mark (80% of MaxQueueSize), chunk POSTs apply backpressure — throttling the producing command through its blocked stdout pipe — instead of flooding the queue and starving other telemetry. It falls back to dropping past a max wait or on context cancellation so a command can't stall indefinitely. Only chunk traffic uses this path; acks/fins/events keep their non-blocking semantics.

🟠 No retained copy when chunk delivery fails — fixed (`8d3a5e5`, `ff2c009`)

Reintroduced a fin audit copy, but bounded: a 1 MiB truncate-middle capBuffer is teed alongside the chunk stream, keeping the command's start and end and dropping the middle. Retained memory stays bounded regardless of stream size (locked by TestChunkWriter_StreamsAllWithBoundedCapture + capBuffer tests), so this does not reintroduce the earlier unbounded-retention concern.
runCommand's streaming path returns this capped copy; the timeout path carries it too (banner appended only when there is output).
Extended the same guarantee to the allow_sh=false operator path (&&/||/;), which was still dropping per-segment output under streaming. Segments are now accumulated and the total re-capped via a shared utils.TruncateMiddle / utils.AuditOutputCap, so both execution paths deliver an identical ≤1 MiB audit copy.

Net effect: chunks are the live, best-effort channel; the fin payload is the reliable, bounded audit copy.

Notes from the review

Queue-pressure / drop-on-full coverage: added (PostChunk blocks-until-space, drops-after-max-wait, drops-on-cancel), plus a regression test that the operator path's fin result carries the accumulated output.
Priority bump is best-effort and stdout/stderr are merged into one stream: left as-is — both are intentional, documented characteristics; the receiver reassembles by (command_id, seq).

Cleanup

4f35b3f removes a now-unreachable chunk-splitting loop in flush — batching moved threshold splitting into Write, so flush always sees a sub-threshold remainder.

All affected packages build and tests pass, including -race on the streaming and queue paths.

Jisung Chae (jisung-02) added 4 commits May 28, 2026 16:12

Jisung Chae (jisung-02) self-assigned this May 28, 2026

chore(executor): tighten chunk streaming docstrings

60237c3

Jisung Chae (jisung-02) marked this pull request as ready for review May 29, 2026 08:53

Copilot AI review requested due to automatic review settings May 29, 2026 08:53

Copilot started reviewing on behalf of Jisung Chae (jisung-02) May 29, 2026 08:53 View session

Copilot AI reviewed May 29, 2026

View reviewed changes

Comment thread pkg/runner/command.go Outdated

Comment thread pkg/executor/executor.go Outdated

Jisung Chae (jisung-02) requested a review from Copilot May 29, 2026 09:10

Copilot started reviewing on behalf of Jisung Chae (jisung-02) May 29, 2026 09:10 View session

Copilot AI reviewed May 29, 2026

View reviewed changes

Comment thread pkg/executor/executor.go

Comment thread pkg/executor/executor.go

Jisung Chae (jisung-02) requested a review from Copilot May 29, 2026 09:22

Copilot started reviewing on behalf of Jisung Chae (jisung-02) May 29, 2026 09:22 View session

Copilot AI reviewed May 29, 2026

View reviewed changes

Comment thread pkg/executor/executor.go Outdated

Comment thread pkg/executor/chunk_writer_test.go

Jisung Chae (jisung-02) added 3 commits May 29, 2026 19:05

Merge remote-tracking branch 'origin/main' into feat/executor-stream-…

3ddfc3d

…output # Conflicts: # pkg/executor/executor.go # pkg/executor/executor_test.go

Jisung Chae (jisung-02) requested a review from Copilot June 2, 2026 00:46

Copilot started reviewing on behalf of Jisung Chae (jisung-02) June 2, 2026 00:47 View session

Copilot AI reviewed Jun 2, 2026

View reviewed changes

Comment thread pkg/executor/executor.go Outdated

Comment thread pkg/executor/executor.go

Jisung Chae (jisung-02) requested a review from Copilot June 2, 2026 01:04

Copilot started reviewing on behalf of Jisung Chae (jisung-02) June 2, 2026 01:04 View session

Copilot AI reviewed Jun 2, 2026

View reviewed changes

Comment thread pkg/runner/command.go

fix(runner): advance chunk seq before Post to keep it monotonic on panic

8cb8cd6

Jisung Chae (jisung-02) requested a review from Copilot June 2, 2026 01:20

Copilot started reviewing on behalf of Jisung Chae (jisung-02) June 2, 2026 01:20 View session

Copilot AI reviewed Jun 2, 2026

View reviewed changes

Comment thread pkg/executor/executor.go Outdated

Comment thread pkg/executor/executor.go Outdated

fix(executor): clone chunk before emit to avoid pinning source alloca…

2b63900

…tion

Jisung Chae (jisung-02) requested a review from Copilot June 2, 2026 01:32

Copilot started reviewing on behalf of Jisung Chae (jisung-02) June 2, 2026 01:32 View session

Copilot AI reviewed Jun 2, 2026

View reviewed changes

docs(executor): tidy streaming comments for concision

0118620

Trim multi-line rationale comments while preserving intent; no logic changes.

Jisung Chae (jisung-02) requested a review from Eunyoung Jeong (eunyoung14) June 2, 2026 06:08

Eunyoung Jeong (eunyoung14) reviewed Jun 2, 2026

View reviewed changes

Eunyoung Jeong (eunyoung14) mentioned this pull request Jun 2, 2026

feat(exec): stream command output via WebSocket with REST fallback alpacax/alpacon-cli#198

Open

7 tasks

Jisung Chae (jisung-02) added 5 commits June 3, 2026 10:09

refactor(executor): rename Flush method to flush for consistency

b7bf533

feat(executor): implement output truncation for audit payload and enh…

ff2c009

…ance streaming result accumulation

refactor(executor): simplify chunk emission logic in flush method

4f35b3f

Jisung Chae (jisung-02) requested a review from Eunyoung Jeong (eunyoung14) June 4, 2026 01:00

Conversation

Jisung Chae (jisung-02) commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Test plan

Notes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Eunyoung Jeong (eunyoung14) left a comment

Choose a reason for hiding this comment

Review — output delivery reliability & resource usage

🟠 One POST per line — unbatched, on a shared bounded queue

🟠 No retained copy when chunk delivery fails

Notes (non-blocking)

Uh oh!

Jisung Chae (jisung-02) commented Jun 3, 2026

Addressed review feedback

🟠 One POST per line / no backpressure — fixed (29d0648)

🟠 No retained copy when chunk delivery fails — fixed (8d3a5e5, ff2c009)

Notes from the review

Cleanup

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Jisung Chae (jisung-02) commented May 28, 2026 •

edited

Loading

🟠 One POST per line / no backpressure — fixed (`29d0648`)

🟠 No retained copy when chunk delivery fails — fixed (`8d3a5e5`, `ff2c009`)