VOLTA: The Surprising Ineffectiveness of Auxiliary Losses for Calibrated Deep Learning
📰 ArXiv cs.AI
arXiv:2604.08639v1 Announce Type: cross Abstract: Uncertainty quantification (UQ) is essential for deploying deep learning models in safety critical applications, yet no consensus exists on which UQ method performs best across different data modalities and distribution shifts. This paper presents a comprehensive benchmark of ten widely used UQ baselines including MC Dropout, SWAG, ensemble methods, temperature scaling, energy based OOD, Mahalanobis, hyperbolic classifiers, ENN, Taylor Sensus, an
DeepCamp AI