Högskolan i Skövde

his.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Evaluating Calibration Techniques for Reliable Predictions
University of Skövde, School of Informatics. University of Skövde, Informatics Research Environment. Department of Computing, Jönköping University, Sweden. (Interaction Lab (iLab))ORCID iD: 0000-0003-4224-3740
Department of Computing, Jönköping University, Sweden.
Department of Computing, Jönköping University, Sweden.
2025 (English)In: Machine Learning and Soft Computing: 9th International Conference, ICMLSC 2025, Tokyo, Japan, January 24–26, 2025, Revised Selected Papers, Part II / [ed] Letian Huang, Springer, 2025, p. 159-175Chapter in book (Refereed)
Abstract [en]

In data-driven decision support, having access to reliable confidence measures for individual predictions is crucial. Machine learning algorithms can provide probabilistic predictions, but these are often poorly calibrated, resulting in misleading decision support. This study empirically evaluates a set of readily available state-of-the-art calibration techniques, including both scaling and binning approaches. Using four different underlying models, and in total 40 publicly available datasets, the results analyzed using rigorous statistical testing show that calibration is generally successful. Specifically, applying a post-hoc calibration will reduce both log losses and calibration errors, without significantly lowering the predictive accuracy. However, the choice of calibration technique should depend on both the underlying model and the size of the dataset, resulting in the following guidelines for calibration: (i) We recommend Venn-Abers for decision trees and naïve Bayes (ii) Beta calibration for Extreme Gradient Boosting (XGB), (iii) Platt scaling for small datasets and Venn-Abers for larger ones when using random forests.

Place, publisher, year, edition, pages
Springer, 2025. p. 159-175
Series
Communications in Computer and Information Science, ISSN 1865-0929, E-ISSN 1865-0937 ; 2488
Keywords [en]
Calibration, Decision support, Machine learning, Probabilistic prediction, Reliability, Calibration techniques, Confidence Measure, Data driven decision, Decision supports, Individual prediction, Machine learning algorithms, Machine-learning, Scalings, State of the art
National Category
Other Computer and Information Science
Research subject
Interaction Lab (ILAB)
Identifiers
URN: urn:nbn:se:his:diva-25196DOI: 10.1007/978-981-96-6403-0_14Scopus ID: 2-s2.0-105007227591ISBN: 978-981-96-6402-3 (print)ISBN: 978-981-96-6403-0 (electronic)OAI: oai:DiVA.org:his-25196DiVA, id: diva2:1968035
Conference
Machine Learning and Soft Computing 9th International Conference, ICMLSC 2025, Tokyo, Japan, January 24–26, 2025, Revised Selected Papers, Part II
Note

© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025

Correspondence Address: A. Maalej; Department of Computing, Jönköping University, Jönköping, Sweden; email: aicha.maalej@ju.se

Available from: 2025-06-12 Created: 2025-06-12 Last updated: 2025-09-29

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Maalej, Aicha

Search in DiVA

By author/editor
Maalej, Aicha
By organisation
School of InformaticsInformatics Research Environment
Other Computer and Information Science

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 132 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf