-
Notifications
You must be signed in to change notification settings - Fork 4.1k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(clip_grads): handle empty grads_for_norm in inf-norm and p-norm paths
community-request
#5530
opened Jun 28, 2026 by
Mattral
Loading…
4 tasks done
Remove deprecated and unsafe type=bool usage on boolean CLI flags
complexity: low
Run CICD
Run functional tests
Run tests
#5528
opened Jun 28, 2026 by
ahmadki
Member
Loading…
3 of 6 tasks
[CI] Fix
gpt_dynamic_inference_tp2_pp2_ep2_gptoss_20b_swa tests
complexity: low
#5527
opened Jun 28, 2026 by
asolergi-nv
Contributor
Loading…
1 of 6 tasks
Preserve DSA output across fused inverse RoPE
complexity: low
#5526
opened Jun 28, 2026 by
kunlunl
Contributor
Loading…
6 tasks
[codex] Fix tensor-parallel label smoothing
complexity: low
Final Review
PR is in the "final review" stage
#5522
opened Jun 27, 2026 by
ilml
Contributor
Loading…
[training migration] Delete model builders
complexity: medium
Run functional tests
#5521
opened Jun 27, 2026 by
maanug-nv
Contributor
Loading…
6 tasks
build: bump transformer-engine to release_v2.16.post
complexity: low
Run functional tests
#5517
opened Jun 26, 2026 by
ko3n1g
Contributor
Loading…
[training migration] Finish ModelBuilder integration
complexity: medium
Run functional tests
#5516
opened Jun 26, 2026 by
maanug-nv
Contributor
Loading…
1 of 6 tasks
Add forward all-gather overlap to experimental FSDP
complexity: medium
#5513
opened Jun 26, 2026 by
wujingyue
Contributor
Loading…
Add CI duties to oncall
docs-only
documentation only (docs or docstrings)
#5510
opened Jun 26, 2026 by
Phlip79
Member
Loading…
1 task done
Add hetero MIMO (Nemotron6-MoE VLM) training entrypoint on the stock pretrain loop
#5504
opened Jun 25, 2026 by
yashaswikarnati
Contributor
•
Draft
chore: nightly sync main into dev (25_06_2026)
complexity: high
Run functional tests
Run MBridge tests
Attach this for testing this PR against MBridge main
#5503
opened Jun 25, 2026 by
svcnvidia-nemo-ci
Loading…
Hybrid prefix caching fixes
complexity: medium
#5502
opened Jun 25, 2026 by
santhnm2
Contributor
Loading…
1 of 6 tasks
add safe version of numpy.load
complexity: low
Expert Review
[deprecated] Apply this label to indicate that your PR is ready for expert review.
Final Review
PR is in the "final review" stage
#5500
opened Jun 25, 2026 by
dimapihtar
Contributor
Loading…
1 of 6 tasks
Add cos/sin width guard to fused MLA RoPE kernels
community-request
Final Review
PR is in the "final review" stage
waiting-on-maintainers
Waiting on maintainers to respond
#5497
opened Jun 25, 2026 by
ShauryaaSharma
Loading…
4 of 6 tasks
Document stacked dependent PR handling in split PR skill
docs-only
documentation only (docs or docstrings)
#5496
opened Jun 25, 2026 by
wujingyue
Contributor
Loading…
Inference: Use FA4 for prefill and FA2 for decode
#5494
opened Jun 25, 2026 by
sidsingh-nvidia
Contributor
•
Draft
1 of 6 tasks
Add statistics logging for params and activations
#5492
opened Jun 25, 2026 by
philipcmonk
•
Draft
4 of 6 tasks
Refactor RL rollout pipeline
complexity: medium
#5491
opened Jun 25, 2026 by
lauradang
Contributor
Loading…
6 tasks
[Main] Numerical fix for moe single grouped weight with fp8 fp4 primary weight and grad norm spikes
complexity: high
#5487
opened Jun 24, 2026 by
zhongbozhu
Contributor
Loading…
1 of 6 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.