GitHub - LunNova/ScionLight-reimpl: An implementation of the ScionLight optimizer from https://arxiv.org/abs/2502.07529 Training Deep Learning Models with Norm-Constrained LMOs

From scratch implementation of the ScionLight optimizer from Training Deep Learning Models with Norm-Constrained LMOs because the reference implementation was not yet available and I wanted to try it out for a ≈1.6B param training run on some local hardware.

ScionLight can be thought of as an alternative formulation of Muon with better hyperparam scaling rules and a neat trick for grad accumulation memory use.

Make sure not to zero grads between steps! This optimizer accumulates momentum in grads.

See the self-contained ./scionlight.py file for API.

The official reference implementation is now available at github:LIONS-EPFL/scion.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.ruff.toml		.ruff.toml
LICENSE		LICENSE
README.md		README.md
scionlight.py		scionlight.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages