The Role of Symmetry in Optimizing Overparameterized Networks
📰 ArXiv cs.AI
arXiv:2604.25150v2 Announce Type: cross Abstract: Overparameterization is central to the success of deep learning, yet the mechanisms by which it improves optimization remain incompletely understood. We analyze weight-space symmetries in neural networks and show that overparameterization introduces additional symmetries that benefit optimization in two distinct ways. First, we prove that these symmetries act as a form of diagonal preconditioning on the Hessian, enabling the existence of better-c
DeepCamp AI