Evaluating Calibration Techniques for Reliable Predictions
2025 (English)In: Machine Learning and Soft Computing: 9th International Conference, ICMLSC 2025, Tokyo, Japan, January 24–26, 2025, Revised Selected Papers, Part II / [ed] Letian Huang, Springer, 2025, p. 159-175Chapter in book (Refereed)
Abstract [en]
In data-driven decision support, having access to reliable confidence measures for individual predictions is crucial. Machine learning algorithms can provide probabilistic predictions, but these are often poorly calibrated, resulting in misleading decision support. This study empirically evaluates a set of readily available state-of-the-art calibration techniques, including both scaling and binning approaches. Using four different underlying models, and in total 40 publicly available datasets, the results analyzed using rigorous statistical testing show that calibration is generally successful. Specifically, applying a post-hoc calibration will reduce both log losses and calibration errors, without significantly lowering the predictive accuracy. However, the choice of calibration technique should depend on both the underlying model and the size of the dataset, resulting in the following guidelines for calibration: (i) We recommend Venn-Abers for decision trees and naïve Bayes (ii) Beta calibration for Extreme Gradient Boosting (XGB), (iii) Platt scaling for small datasets and Venn-Abers for larger ones when using random forests.
Place, publisher, year, edition, pages
Springer, 2025. p. 159-175
Series
Communications in Computer and Information Science, ISSN 1865-0929, E-ISSN 1865-0937 ; 2488
Keywords [en]
Calibration, Decision support, Machine learning, Probabilistic prediction, Reliability, Calibration techniques, Confidence Measure, Data driven decision, Decision supports, Individual prediction, Machine learning algorithms, Machine-learning, Scalings, State of the art
National Category
Other Computer and Information Science
Research subject
Interaction Lab (ILAB)
Identifiers
URN: urn:nbn:se:his:diva-25196DOI: 10.1007/978-981-96-6403-0_14Scopus ID: 2-s2.0-105007227591ISBN: 978-981-96-6402-3 (print)ISBN: 978-981-96-6403-0 (electronic)OAI: oai:DiVA.org:his-25196DiVA, id: diva2:1968035
Conference
Machine Learning and Soft Computing 9th International Conference, ICMLSC 2025, Tokyo, Japan, January 24–26, 2025, Revised Selected Papers, Part II
Note
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025
Correspondence Address: A. Maalej; Department of Computing, Jönköping University, Jönköping, Sweden; email: aicha.maalej@ju.se
2025-06-122025-06-122025-09-29