Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add online DPO training community-request Documentation Improvements or additions to documentation
#2456 opened May 10, 2026 by taivu1998 Draft
feat(grpo): add SAPO actor loss community-request Documentation Improvements or additions to documentation
#2455 opened May 10, 2026 by taivu1998 Draft
feat(grpo): support async multiple dataloaders community-request Documentation Improvements or additions to documentation
#2454 opened May 10, 2026 by taivu1998 Draft
fix(infra): dev pod RBAC, macOS install scripts, helm fixes
#2450 opened May 10, 2026 by terrykong Collaborator Loading…
5 tasks
fix(nrl-k8s): rewrite --config in entrypoint to honor CLI RECIPE arg
#2449 opened May 9, 2026 by hemildesai Contributor Loading…
2 of 3 tasks
refactor: refactor async utils CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2448 opened May 9, 2026 by yuki-97 Contributor Draft
fix(megatron): delegate packed CP slicing to MCore
#2445 opened May 8, 2026 by zyzhou5 Loading…
4 tasks
feat(vllm): add delta-compressed collective refit
#2444 opened May 8, 2026 by HollowMan6 Member Loading…
4 tasks done
fix: fix skip_reference_policy_logprobs_calculation and skip_prev_logprobs CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2443 opened May 8, 2026 by jinglinglingling Loading…
Mxin/moe mamba sft Documentation Improvements or additions to documentation
#2442 opened May 8, 2026 by mxinO Contributor Draft
4 tasks
feat: data plane transfer queue integration CI:L1 Run doctests, unit tests, and functional tests
#2439 opened May 7, 2026 by ZhiyuLi-Nvidia Contributor Loading…
4 tasks done
Dynamo Nemo-RL K8s integration
#2429 opened May 6, 2026 by jthomson04 Contributor Draft
4 tasks
[WIP] don't review Documentation Improvements or additions to documentation
#2420 opened May 6, 2026 by shuyixiong Contributor Draft
4 tasks
feat: Auto research skill community-request waiting-on-maintainers Waiting on maintainers to respond
#2419 opened May 6, 2026 by vinhngx Contributor Loading…
fix: handle non-contiguous tensors in IPC weight refit CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version) community-request waiting-on-customer Waiting on the original author to respond
#2418 opened May 5, 2026 by jlcanta Loading…
3 of 4 tasks
feat: Support Megatron + SGLang community-request waiting-on-maintainers Waiting on maintainers to respond
#2416 opened May 5, 2026 by pengdurice Contributor Loading…
2 of 4 tasks
[WIP] New refit integration branch
#2413 opened May 5, 2026 by youngeunkwon0405 Contributor Draft
4 tasks
ProTip! no:milestone will show everything without a milestone.