Accelerating LMO-Based Optimization via Implicit Gradient Transport

📰 ArXiv cs.AI

arXiv:2605.05577v1 Announce Type: cross Abstract: Recent optimizers such as Lion and Muon have demonstrated strong empirical performance by normalizing gradient momentum via linear minimization oracles (LMOs). While variance reduction has been explored to accelerate LMO-based methods, it typically incurs substantial computational overhead due to additional gradient evaluations. At the same time, the theoretical understanding of LMO-based methods remains fragmented across unconstrained and constr

Published 9 May 2026

Read full paper → ← Back to Reads