Norm Balancing Optimizers

This repository contains the code accompanying the blog post Norm Balancing Optimizers. It implements BAM (Balanced Axis Momentum), a stripped-down Muon variant that replaces Newton–Schulz orthogonalization with SinkNorm.

The nanoGPT training scripts (with different optimizers) live in nanogpt. The CIFAR-10 MLP and ResNet-18 experiments can be run via run.py using the configs in config. We'll be updating this repository with sbatch scripts that we used to run our sweeps soon!

Note: this code was ported over from an experimental, private repo. If there are issues or broken scripts, let us know!

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
configs		configs
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Norm Balancing Optimizers

About

Uh oh!

Releases

Packages

Languages

knightron0/bam

Folders and files

Latest commit

History

Repository files navigation

Norm Balancing Optimizers

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages