Accelerating LMO-Based Optimization via Implicit Gradient Transport
📰 ArXiv cs.AI
arXiv:2605.05577v1 Announce Type: cross Abstract: Recent optimizers such as Lion and Muon have demonstrated strong empirical performance by normalizing gradient momentum via linear minimization oracles (LMOs). While variance reduction has been explored to accelerate LMO-based methods, it typically incurs substantial computational overhead due to additional gradient evaluations. At the same time, the theoretical understanding of LMO-based methods remains fragmented across unconstrained and constr
DeepCamp AI