forked from NVIDIA/Megatron-LM
-
Notifications
You must be signed in to change notification settings - Fork 0
Pull requests: ISEEKYAN/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix MLite microbatch loss and forward-only output contracts
#68
opened Jun 28, 2026 by
ISEEKYAN
Owner
Loading…
[lite] GLM5.2 (DeepSeek-V3.2) IndexShare DSA support
#66
opened Jun 27, 2026 by
ISEEKYAN
Owner
Loading…
[codex] DSv4 DSA: opt-in torch fallback paths for smoke/CI (default-off)
#64
opened Jun 23, 2026 by
Meirtz
Loading…
[codex] Fix DSv4 DCP checkpoint placements for DTensor-like params
#61
opened Jun 21, 2026 by
Meirtz
Loading…
ds4 (deepseek_v4): DSA indexer aux-loss scale hook + lift forced CP=1 (MTP/dense-CSA support CP>1)
#59
opened Jun 18, 2026 by
ISEEKYAN
Owner
Loading…
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.