Skip to content
arXiv stat.ML · Papers

Evaluation Metrics as Averaged Outcomes of Fair Gambles

arXiv:2401.14483v4 Announce Type: replace-cross Abstract: In the current practices of machine learning, the evaluation of forecasts has become a cornerstone of scientific progress. A multitude of evaluation metrics have been suggested and used to qualify "good" forecasts. What do those metrics share? How are they relat