feat(models): run_tool_loop shared turn-loop primitive by pradeepvrd · Pull Request #26 · pradeepvrd/devops-bench

pradeepvrd · 2026-06-20T21:05:33Z

The agent turn-loop used to be inline in pkg/agents/runner/api/api.py (the run_api_agent loop, coupled to MCP/skill tools); this extracts it into a shared, provider-agnostic devops_bench/models/loop.py — run_tool_loop over a neutral tool-dispatch seam — reused by the API agent and the chaos agent. No new deps.

Behavior changes

An explicit max_turns cap is enforced (and logged when hit) instead of looping until the model stops on its own.
Tool-dispatch errors propagate to the caller through the dispatcher seam instead of being swallowed and appended as error strings inside the loop.
Returns a typed LoopResult (response, contents, final_text, latency, tools_used) instead of a dict; latency uses time.monotonic().

Bugs fixed

The previous loop had no turn cap and could run unbounded against a misbehaving model.
The model's final summary was dropped when a tool call landed on the last turn; the text is now captured every turn.

The agent turn-loop used to be inline in `pkg/agents/runner/api/api.py` (the `run_api_agent` loop, coupled to MCP/skill tools); this extracts it into a shared, provider-agnostic `devops_bench/models/loop.py` — `run_tool_loop` over a neutral tool-dispatch seam — reused by the API agent and the chaos agent. No new deps. **Behavior changes** - An explicit `max_turns` cap is enforced (and logged when hit) instead of looping until the model stops on its own. - Tool-dispatch errors propagate to the caller through the dispatcher seam instead of being swallowed and appended as error strings inside the loop. - Returns a typed `LoopResult` (response, contents, final_text, latency, tools_used) instead of a dict; latency uses `time.monotonic()`. **Bugs fixed** - The previous loop had no turn cap and could run unbounded against a misbehaving model. - The model's final summary was dropped when a tool call landed on the last turn; the text is now captured every turn.

pradeepvrd mentioned this pull request Jun 20, 2026

Cross-cutting harness refactor: layered devops_bench (Stage 1.5–3, reconciled) #23

Closed

pradeepvrd force-pushed the submit/1-models-loop branch 2 times, most recently from 88c9939 to 2827366 Compare June 23, 2026 06:37

pradeepvrd force-pushed the submit/1-models-loop branch from 2827366 to 49f0968 Compare June 23, 2026 17:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(models): run_tool_loop shared turn-loop primitive#26

feat(models): run_tool_loop shared turn-loop primitive#26
pradeepvrd wants to merge 1 commit into
integration/devops-bench-stage1from
submit/1-models-loop

pradeepvrd commented Jun 20, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

pradeepvrd commented Jun 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

pradeepvrd commented Jun 20, 2026 •

edited

Loading