Skip to content

feat: add load_weights_from_async_generator to Bridge#117

Open
sl-1314 wants to merge 1 commit into
ISEEKYAN:mainfrom
sl-1314:wsl_dev/load_weights_from_per_tensor
Open

feat: add load_weights_from_async_generator to Bridge#117
sl-1314 wants to merge 1 commit into
ISEEKYAN:mainfrom
sl-1314:wsl_dev/load_weights_from_per_tensor

Conversation

@sl-1314

@sl-1314 sl-1314 commented Apr 14, 2026

Copy link
Copy Markdown

Sometimes we need to receive HF weights and load them into Megatron-Core models, this pr adds Bridge.load_weights_from_async_generator, which accepts an AsyncGenerator[tuple[str, torch.Tensor], None] and loads weights into Megatron-Core models on the fly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant