Skip to content

docs: fill CLI reference gaps for data prep and rollout collection#1498

Closed
lbliii wants to merge 7 commits into
mainfrom
docs/cli-reference-cleanup
Closed

docs: fill CLI reference gaps for data prep and rollout collection#1498
lbliii wants to merge 7 commits into
mainfrom
docs/cli-reference-cleanup

Conversation

@lbliii

@lbliii lbliii commented Jun 2, 2026

Copy link
Copy Markdown
Contributor

Summary

Fills documented gaps in the CLI reference and installation pages. All CLI additions were verified against nemo_gym/prompt.py and nemo_gym/rollout_collection.py.

Review feedback addressed

  • Copilot review: clarified that responses_create_params overrides use a shallow merge (dict | overrides in _preprocess_rows_from_config), so nested Hydra overrides replace the entire nested dict rather than deep-merging fields.

Verification

  • npm run check (fern check) passes with 0 errors.

Add missing CLI reference coverage in cli-commands.mdx:

- ng_materialize_prompts: full parameter table, example, and a note
  comparing it to ng_prepare_data (closes #1347).
- resume_from_cache: params-table entries for ng_collect_rollouts and
  ng_e2e_collect_rollouts plus a "Resume interrupted runs" section
  covering materialized inputs, incremental flush, matching, and the
  stale-cache footgun (closes #1239).
- Generation parameters: dedicated section documenting how temperature,
  top_p, and max_output_tokens are passed via
  ++responses_create_params.<field> rather than as standalone flags
  (closes #637).

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
Signed-off-by: Lawrence Lane <llane@nvidia.com>
- "See" -> "Refer to" in the new cross-references (accessibility wording).
- Title-case the two new subheadings (Generation Parameters, Resume
  Interrupted Runs); anchors are unchanged so links still resolve.
- Drop the academic adverb "gracefully".

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
Signed-off-by: Lawrence Lane <llane@nvidia.com>

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Updates the NeMo Gym Fern documentation CLI reference (fern/versions/latest/pages/reference/cli-commands.mdx) to close gaps around rollout collection resume behavior, generation parameter overrides, and the ng_materialize_prompts data-prep command.

Changes:

  • Adds resume_from_cache to the ng_collect_rollouts and ng_e2e_collect_rollouts parameter tables and documents the resume workflow/artifacts.
  • Adds a Generation Parameters section clarifying how to override sampling params via ++responses_create_params.<field>.
  • Adds a full CLI reference entry for ng_materialize_prompts, including parameters, example usage, and guidance vs. ng_prepare_data.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.


#### Generation Parameters

Sampling parameters such as `temperature`, `max_output_tokens`, and `top_p` are not standalone CLI flags — they are passed as overrides inside `responses_create_params` using Hydra's nested dot syntax. Each override is merged into every input row's existing `responses_create_params`:

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Addressed in 91526fc — the Generation Parameters section now states that overrides use a shallow merge (top-level keys only), matching dict | overrides in _preprocess_rows_from_config.

++responses_create_params.max_output_tokens=4096
```

The same syntax works for `ng_e2e_collect_rollouts`. Any field accepted by the Responses API create params can be set this way (for example, `++responses_create_params.reasoning.effort=low`).

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Addressed in 91526fc — narrowed the nested-object guidance: top-level fields are straightforward, but nested Hydra overrides replace the entire nested dict (other fields under that key are not preserved).

Add PyPI-first installation tabs (#1191), mirror CLI reference additions
into v0.3.0 stable docs, and clarify that responses_create_params
overrides use a shallow merge per review feedback on #1498.
@copy-pr-bot

copy-pr-bot Bot commented Jun 23, 2026

Copy link
Copy Markdown

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

lbliii added 3 commits June 23, 2026 11:15
No-op commit to satisfy DCO on the pip install and shallow-merge doc
updates pushed in 91526fc.

Signed-off-by: Lawrence Lane <llane@nvidia.com>
…0 mirror"

This reverts commit 91526fc.

Signed-off-by: Lawrence Lane <llane@nvidia.com>
Add PyPI-first installation tabs (#1191), mirror CLI reference additions
into v0.3.0 stable docs, and clarify that responses_create_params
overrides use a shallow merge per review feedback on #1498.

Signed-off-by: Lawrence Lane <llane@nvidia.com>
@github-actions

Copy link
Copy Markdown

@lbliii

lbliii commented Jun 23, 2026

Copy link
Copy Markdown
Contributor Author

Closing in favor of #1675 — DCO was failing because commit 91526fce lacked Signed-off-by, and branch protection on docs/cli-reference-cleanup blocks force-push/delete so the history couldn't be cleaned up. #1675 has the identical diff as a single signed commit on lbliii/docs-cli-dco-fix (DCO passing).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

2 participants