Högskolan i Skövde

his.sePublikasjoner
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Predicting Customer Churn in Retailing
Högskolan i Skövde, Institutionen för informationsteknologi. Högskolan i Skövde, Forskningsmiljön Informationsteknologi.ORCID-id: 0000-0001-5378-0862
Dept. of Computing, Jönköping University, Sweden.
Dept. of Information Technology, University of Borås, Sweden.
Högskolan i Skövde, Institutionen för informationsteknologi. Högskolan i Skövde, Forskningsmiljön Informationsteknologi. (Interaction Lab (iLab))ORCID-id: 0000-0002-7554-2301
2022 (engelsk)Inngår i: Proceedings 21st IEEE International Conference on Machine Learning and Applications ICMLA 2022: 12–14 December 2022 Nassau, The Bahamas / [ed] M. Arif Wani; Mehmed Kantardzic; Vasile Palade; Daniel Neagu; Longzhi Yang; Kit-Yan Chan, IEEE, 2022, s. 635-640Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Customer churn is one of the most challenging problems for digital retailers. With significantly higher costs for acquiring new customers than retaining existing ones, knowledge about which customers are likely to churn becomes essential. This paper reports a case study where a data-driven approach to churn prediction is used for predicting churners and gaining insights about the problem domain. The real-world data set used contains approximately 200 000 customers, describing each customer using more than 50 features. In the pre-processing, exploration, modeling and analysis, attributes related to recency, frequency, and monetary concepts are identified and utilized. In addition, correlations and feature importance are used to discover and understand churn indicators. One important finding is that the churn rate highly depends on the number of previous purchases. In the segment consisting of customers with only one previous purchase, more than 75% will churn, i.e., not make another purchase in the coming year. For customers with at least four previous purchases, the corresponding churn rate is around 25%. Further analysis shows that churning customers in general, and as expected, make smaller purchases and visit the online store less often. In the experimentation, three modeling techniques are evaluated, and the results show that, in particular, Gradient Boosting models can predict churners with relatively high accuracy while obtaining a good balance between precision and recall. 

sted, utgiver, år, opplag, sider
IEEE, 2022. s. 635-640
Emneord [en]
Sales, Case-studies, Churn rates, Correlation, Customer churn prediction, Customer churns, Digital retailing, Feature importance, High costs, RFM analysis, Top probability, Forecasting, correlations, top probabilities
HSV kategori
Forskningsprogram
Interaction Lab (ILAB)
Identifikatorer
URN: urn:nbn:se:his:diva-22430DOI: 10.1109/ICMLA55696.2022.00105ISI: 000980994900094Scopus ID: 2-s2.0-85152214345ISBN: 978-1-6654-6283-9 (digital)ISBN: 978-1-6654-6284-6 (tryckt)OAI: oai:DiVA.org:his-22430DiVA, id: diva2:1751909
Konferanse
2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA), 12-14 December 2022, Nassau, Bahamas
Forskningsfinansiär
Knowledge Foundation, 20160035, 20170215
Merknad

© 2022 IEEE.

The current research is a part of the Industrial Graduate School in Digital Retailing (INSiDR) at the University of Borås, funded by the Swedish Knowledge Foundation, grants nr. 20160035, 20170215.

Tilgjengelig fra: 2023-04-20 Laget: 2023-04-20 Sist oppdatert: 2025-09-29bibliografisk kontrollert
Inngår i avhandling
1. Data-driven decision support in digital retailing
Åpne denne publikasjonen i ny fane eller vindu >>Data-driven decision support in digital retailing
2023 (engelsk)Licentiatavhandling, med artikler (Annet vitenskapelig)
Abstract [en]

In the digital era and advent of artificial intelligence, digital retailing has emerged as a notable shift in commerce. It empowers e-tailers with data-driven insights and predictive models to navigate a variety of challenges, driving informed decision-making and strategic formulation. While predictive models are fundamental for making data-driven decisions, this thesis spotlights binary classifiers as a central focus. These classifiers reveal the complexities of two real-world problems, marked by their particular properties. Specifically, binary decisions are made based on predictions, relying solely on predicted class labels is insufficient because of the variations in classification accuracy. Furthermore, prediction outcomes have different costs associated with making different mistakes, which impacts the utility.

To confront these challenges, probabilistic predictions, often unexplored or uncalibrated, is a promising alternative to class labels. Therefore, machine learning modelling and calibration techniques are explored, employing benchmark data sets alongside empirical studies grounded in industrial contexts. These studies analyse predictions and their associated probabilities across diverse data segments and settings. The thesis found, as a proof of concept, that specific algorithms inherently possess calibration while others, with calibrated probabilities, demonstrate reliability. In both cases, the thesis concludes that utilising top predictions with the highest probabilities increases the precision level and minimises the false positives. In addition, adopting well-calibrated probabilities is a powerful alternative to mere class labels. Consequently, by transforming probabilities into reliable confidence values through classification with a rejection option, a pathway emerges wherein confident and reliable predictions take centre stage in decision-making. This enables e-tailers to form distinct strategies based on these predictions and optimise their utility.

This thesis highlights the value of calibrated models and probabilistic prediction and emphasises their significance in enhancing decision-making. The findings have practical implications for e-tailers leveraging data-driven decision support. Future research should focus on producing an automated system that prioritises high and well-calibrated probability predictions while discarding others and optimising utilities based on the costs and gains associated with the different prediction outcomes to enhance decision support for e-tailers.

sted, utgiver, år, opplag, sider
Skövde: University of Skövde, 2023. s. xiii, 108
Serie
Dissertation Series ; 53
Emneord
Digital Retailing, Decision Support, Probabilistic Prediction, Calibration, Product Returns, Customer Churn, Binary Classification, Scikit-Learn
HSV kategori
Identifikatorer
urn:nbn:se:his:diva-23279 (URN)978-91-987906-7-2 (ISBN)
Presentation
2023-10-31, G111, Högskolan i Skövde, Skövde, 13:15 (engelsk)
Opponent
Veileder
Forskningsfinansiär
Knowledge Foundation
Merknad

The current thesis is a part of the industrial graduate school in digital retailing (INSiDR) at the University of Borås and funded by the Swedish Knowledge Foundation.

Tilgjengelig fra: 2023-10-03 Laget: 2023-10-03 Sist oppdatert: 2025-09-29bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

Forlagets fulltekstScopus

Person

Sweidan, DirarAlenljung, Beatrice

Søk i DiVA

Av forfatter/redaktør
Sweidan, DirarAlenljung, Beatrice
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric

doi
isbn
urn-nbn
Totalt: 640 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf