chore: nightly sync main into dev (25_06_2026)#5503
Open
svcnvidia-nemo-ci wants to merge 77 commits into
Open
Conversation
Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com>
Signed-off-by: Asha Anoosheh <aanoosheh@nvidia.com>
Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> Co-authored-by: Jorge Albericio <jalbericiola@nvidia.com>
Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com> Co-authored-by: Claude Fable 5 <noreply@anthropic.com>
Signed-off-by: janEbert <janpabloe@nvidia.com> Signed-off-by: Philip Petrakian <ppetrakian@nvidia.com> Co-authored-by: Philip Petrakian <ppetrakian@nvidia.com>
Signed-off-by: Helen Ngo <helenn@nvidia.com>
Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> Co-authored-by: Jorge Albericio <jalbericiola@nvidia.com>
Signed-off-by: Helen Ngo <helenn@nvidia.com>
Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com>
Signed-off-by: ykarnati <ykarnati@nvidia.com> Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Signed-off-by: Shijie Wang <jaywan@nvidia.com>
#5347) Signed-off-by: Ajay Balasa <abalasa@nvidia.com>
…izer) (#5333) Signed-off-by: ykarnati <ykarnati@nvidia.com> Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>
#5360) Signed-off-by: oliver könig <okoenig@nvidia.com>
… module globals (#5351) Signed-off-by: ilml <tolong@nvidia.com>
Signed-off-by: Keshav Santhanam <ksanthanam@nvidia.com>
…h space buffers (#5348) Signed-off-by: Keshav Santhanam <ksanthanam@nvidia.com>
Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com>
Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> Co-authored-by: Teodor-Dumitru Ene <teodord.ene@gmail.com>
Signed-off-by: sraman <sraman@nvidia.com>
Signed-off-by: Jingyue Wu <wujingyue@gmail.com>
Signed-off-by: Hollow Man <hollowman@opensuse.org>
Signed-off-by: Jingyue Wu <wujingyue@gmail.com>
Signed-off-by: Jingyue Wu <wujingyue@gmail.com>
…5082) Signed-off-by: hongbinl <hongbinl@nvidia.com>
Signed-off-by: ykarnati <ykarnati@nvidia.com>
…5372) Signed-off-by: ykarnati <ykarnati@nvidia.com> Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>
Author
|
/ok to test 12b5102 |
2bf4c70 to
12b5102
Compare
The merged install-test.yml took main's added 'Check imports for megatron.training' step, which imports the training package and triggers container.py's top-level 'import yaml'. pyyaml lives in the [dependency-groups] test group (not core deps), so the import-check env lacks it. Dev's container.py imports yaml unconditionally (preserved per the dev-feature guard), so use dev's install-test workflow which checks only megatron.core. Signed-off-by: svcnvidia-nemo-ci <svcnvidia-nemo-ci@nvidia.com>
Author
|
/ok to test 5315040 |
12b5102 to
5315040
Compare
Author
✅ Ready for review — CI green (all non-exempt checks)Automated nightly sync CI status (non-exempt checks — all terminal green)
Exempt / pre-existing
Merge strategy & dev-feature preservationThe pre-push guard enforces that no non-exempt dev line is dropped. Resolution honored that strictly:
Fixes applied during CI iteration (single rolling fix commit on top of the signed merge commit)
🤖 Generated with Claude Code |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Nightly sync:
main→dev(25_06_2026)Automated nightly sync merging
origin/mainintodev.Summary
origin/devwithgit merge origin/main --no-edit; 34 files had conflicts, resolved surgically.Merge strategy & dev-feature preservation
The repository's pre-push guard enforces that no non-exempt dev line is dropped by the merge (CODEOWNERS, dependency-triple, and dev-feature-preservation checks). Resolution followed that constraint:
pyproject.toml,uv.lock,docker/Dockerfile.ci.dev) and.github/CODEOWNERSkept identical to dev, per the nightly-sync skill.mainevolved lines thatdevstill owned, dev's version was preserved so the dev-feature-preservation guard passes;main's new files and additive content are brought in (+12467 lines).Files restored
megatron/rl/parallel_utils.py— present on dev and imported bymegatron/training/training.py(build_inference_pg_collection) and tests; the merge would have dropped it, so it was restored from dev.Conflict resolution notes
schedules.py,attention.py,transformer_config.py,moe/router.py,moe/experts.py,gpt_model.py,rope_utils.py, …), training (arguments.py,argument_utils.py,checkpointing.py,theoretical_memory_usage.py,config/*,yaml_arguments.py), RL (rl/agent/api.py,rl/rl_utils.py, …), and entrypoints (pretrain_gpt.py,pretrain_hybrid.py) were resolved to preserve dev's implementations while incorporating main's non-conflicting additions..github/scripts/oncall_manager.py,sync_team_usergroups.py) kept dev's versions; main's newgithub_slack_utils.pywas brought in additively.CI
/ok to testwill be triggered after the PR is created. Functional + MBridge test labels added.🤖 Generated with Claude Code