Skip to content

LunNova/ScionLight-reimpl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

From scratch implementation of the ScionLight optimizer from Training Deep Learning Models with Norm-Constrained LMOs because the reference implementation was not yet available and I wanted to try it out for a ≈1.6B param training run on some local hardware.

ScionLight can be thought of as an alternative formulation of Muon with better hyperparam scaling rules and a neat trick for grad accumulation memory use.

Make sure not to zero grads between steps! This optimizer accumulates momentum in grads.

See the self-contained ./scionlight.py file for API.

The official reference implementation is now available at github:LIONS-EPFL/scion.

About

An implementation of the ScionLight optimizer from https://arxiv.org/abs/2502.07529 Training Deep Learning Models with Norm-Constrained LMOs

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages