The Role of Symmetry in Optimizing Overparameterized Networks

📰 ArXiv cs.AI

arXiv:2604.25150v2 Announce Type: cross Abstract: Overparameterization is central to the success of deep learning, yet the mechanisms by which it improves optimization remain incompletely understood. We analyze weight-space symmetries in neural networks and show that overparameterization introduces additional symmetries that benefit optimization in two distinct ways. First, we prove that these symmetries act as a form of diagonal preconditioning on the Hessian, enabling the existence of better-c

Published 29 Apr 2026
Read full paper → ← Back to Reads