[rollout, vllm] feat: reject server-side tool parser for TITO agent rollout by Jiang020609 · Pull Request #6844 · verl-project/verl

Jiang020609 · 2026-06-25T06:54:01Z

What does this PR do?

Follow-up to #6560. That PR proposed exposing server-side vLLM tool-calling
config (enable_auto_tool_choice, tool_call_parser, tool_parser_plugin) in
the rollout config. Per review feedback there, RL rollout uses
token-in-token-out (TITO)
generation, so tool parsing happens client-side in the AgentLoop and these
server-side parser args should not be passed to vLLM at all.

This PR implements that guidance as a fail-fast guardrail: when a vLLM
rollout enables the AgentLoop tool path (multi-turn, a non-default
default_agent_loop, tool_config_path, or function_tool_path), configuring
any of the server-side vLLM tool parser args in rollout.engine_kwargs.vllm
now raises a clear ValueError instead of being silently passed through to the
engine (where it is redundant/conflicting with client-side parsing).

Plain chat-completion rollouts (multi-turn disabled) are unaffected and may
still set these args.

Checklist Before Starting

Search for similar PRs:
- https://github.com/verl-project/verl/pulls?q=tool_call_parser → only [rollout,vllm,cfg] feat: expose vLLM tool-calling config #6560 (closed, opposite direction)
- Related but non-duplicate: [rollout] feat: add reasoning parser to strip think blocks before tool extraction (fixes #6424) #6434 (client-side reasoning parser), [agent_loop, tool] fix: support hermes-format tool calls on gpt-oss tokenizer models #6481 (hermes tool calls on gpt-oss)
PR title formatted as [{modules}] {type}: {description}

Test

Config validation is CPU-testable; no training experiment is needed (no change
to training dynamics — this only rejects an invalid config combination).

python -m pytest tests/workers/config/test_rollout_config_on_cpu.py -q
# 6 passed

The test file is named *_on_cpu.py, so it is auto-collected by
.github/workflows/cpu_unit_tests.yml (which runs tests/**/test_*_on_cpu.py
on CPU). Coverage includes:

vLLM multi-turn rejects server-side parser args (dataclass and dict config)
vLLM tool_agent rejects server-side parser args
vLLM function_tool_path rejects server-side parser args
chat-completion (multi-turn disabled) still allows the args
unrelated engine_kwargs.vllm (e.g. gpu_memory_utilization) is untouched

API and Usage Example

No new config fields. Behavior change only: an invalid combination now fails early.

from verl.workers.config import RolloutConfig, MultiTurnConfig

# Raises ValueError: server-side tool parser args are not allowed for TITO AgentLoop rollout
RolloutConfig(
    name="vllm",
    multi_turn=MultiTurnConfig(enable=True),
    engine_kwargs={"vllm": {"enable_auto_tool_choice": True, "tool_call_parser": "hermes"}},
)

# Still allowed without multi-turn (plain chat-completion rollout)
RolloutConfig(
    name="vllm",
    multi_turn=MultiTurnConfig(enable=False),
    engine_kwargs={"vllm": {"enable_auto_tool_choice": True, "tool_call_parser": "hermes"}},
)

Design & Code Changes

verl/workers/config/rollout.py: in RolloutConfig.__post_init__, after the
existing rollout validations, detect AgentLoop tool usage and, for the vllm
backend, raise ValueError if any of {enable_auto_tool_choice, tool_call_parser, tool_parser_plugin} is set to a non-default value.
tests/workers/config/test_rollout_config_on_cpu.py: new CPU unit tests
covering the rejection paths and the allowed paths.

Checklist Before Submitting

Read the Contribute Guide.
Apply pre-commit checks (ruff / ruff-format / mypy pass; files unchanged by formatters).
Add / Update the documentation. — N/A: no new config surface; this only rejects an already-invalid combination.
Add unit test(s) to the CI workflow — auto-collected by cpu_unit_tests.yml via the *_on_cpu.py suffix.
Not related to the recipe submodule.

…ollout Follow-up to verl-project#6560. Per review feedback that RL rollout uses token-in-token-out (TITO) generation with client-side AgentLoop tool parsing, reject server-side vLLM tool parser args (enable_auto_tool_choice, tool_call_parser, tool_parser_plugin) when multi-turn / tool-agent rollout is enabled, instead of exposing them. Add CPU coverage for multi_turn, tool_agent, and function_tool_path configurations. Co-authored-by: OpenAI Codex <codex@openai.com>

gemini-code-assist

Code Review

This pull request introduces validation logic in RolloutConfig to prevent the configuration of server-side vLLM tool parser arguments when client-side AgentLoop tool parsing is used (such as in multi-turn RL rollouts). It also adds corresponding unit tests to verify that the appropriate ValueError is raised under these conditions. There are no review comments, so I have no feedback to provide.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist Bot reviewed Jun 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[rollout, vllm] feat: reject server-side tool parser for TITO agent rollout#6844

[rollout, vllm] feat: reject server-side tool parser for TITO agent rollout#6844
Jiang020609 wants to merge 1 commit into
verl-project:mainfrom
Jiang020609:fix/vllm-tito-tool-parser-config

Jiang020609 commented Jun 25, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

Jiang020609 commented Jun 25, 2026

What does this PR do?

Checklist Before Starting

Test

API and Usage Example

Design & Code Changes

Checklist Before Submitting

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant