Measuring multi-calibration
📰 ArXiv cs.AI
arXiv:2506.11251v2 Announce Type: replace-cross Abstract: A suitable scalar metric can help measure multi-calibration, defined as follows. When the expected values of observed responses are equal to corresponding predicted probabilities, the probabilistic predictions are known as "perfectly calibrated." When the predicted probabilities are perfectly calibrated simultaneously across several subpopulations, the probabilistic predictions are known as "perfectly multi-calibrated." In practice, predi
DeepCamp AI