Högskolan i Skövde

his.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Product verification using OCR classification and Mondrian conformal prediction
Department of Computing, Jönköping AI Lab, Jönköping University, Sweden.
University of Skövde, School of Engineering Science. University of Skövde, Virtual Engineering Research Environment. ITAB Shop Products AB, Sweden ; Department of Computing, Jönköping AI Lab, Jönköping University, Sweden. (Produktion och automatiseringsteknik, Production and Automation Engineering)ORCID iD: 0000-0001-8880-7965
Department of Computing, Jönköping AI Lab, Jönköping University, Sweden.
2022 (English)In: Expert systems with applications, ISSN 0957-4174, E-ISSN 1873-6793, Vol. 188, article id 115942Article in journal (Refereed) Published
Abstract [en]

The retail sector is undergoing an apparent digital transformation that completely revolutionises shopping operations. To stay competitive, retailer stakeholders are forced to rethink and improve their business models to provide an attractive personalised experience to consumers. The self-service checkout process is at the heart of this transformation and should be designed to identify the products accurately and detect any possible anomalous behaviour. In this paper, we introduce a product verification system based on OCR classification and Mondrian conformal prediction. The proposed system includes three components: OCR reading, text classification and product verification. By using image data from existing grocery stores, the system can detect anomalies with high performance, even when there is partial text information on the products. This makes the system applicable for reducing shrinkage loss (caused, for example, by employee theft or shoplifting) in grocery stores by identifying fraudulent behaviours such as barcode switching and miss-scan. Additionally, OCR reading with NLP classification shows that it is in itself a powerful classifier of products. 

Place, publisher, year, edition, pages
Elsevier, 2022. Vol. 188, article id 115942
Keywords [en]
Mondrian conformal prediction, OCR classification, Retail product verification, Smart self-checkout system, Classification (of information), Forecasting, Text processing, Business models, Conformal predictions, Digital transformation, Grocery stores, Mondrian, Product verification, Sales
National Category
Computer Sciences
Research subject
Production and Automation Engineering
Identifiers
URN: urn:nbn:se:his:diva-20675DOI: 10.1016/j.eswa.2021.115942ISI: 000768193500002Scopus ID: 2-s2.0-85117127725OAI: oai:DiVA.org:his-20675DiVA, id: diva2:1606938
Note

CC BY-NC-ND 4.0

© 2021 The Authors

Corresponding author: rachid.oucheikh@ju.se (R. Oucheikh)

Corresponding author at ITAB Shop Products AB, Sweden: tobias.pettersson@itab.com (T. Pettersson)

tuwe.lofstrom@ju.se (T. Löfström)

URL: https://ju.se/jail/datakind (R. Oucheikh)

Available from: 2021-10-29 Created: 2021-10-29 Last updated: 2025-12-15Bibliographically approved
In thesis
1. Product Recognition with OCR Text: Advancing Grocery Product Recognition through Robust Approaches, Fine-Grained Recognition, and Domain Adaptation for Real-Time Performance
Open this publication in new window or tab >>Product Recognition with OCR Text: Advancing Grocery Product Recognition through Robust Approaches, Fine-Grained Recognition, and Domain Adaptation for Real-Time Performance
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The physical retail sector faces challenges in improving operational efficiency, reducing costs, and enhancing customer experience. Over the past decade, companies have introduced product recognition technology solutions to improve checkout efficiency, inventory management, and fraud detection. However, most initiatives have struggled to scale or achieve sufficient accuracy due to the complex nature of physical retail, which includes a large number of products that continuously change, as well as varied environmental conditions. In parallel, academic research has tackled many of these challenges by providing datasets and new methods to improve recognition performance, but considerable challenges persist.

This thesis addresses three main challenges in grocery product recognition: robust recognition, recognition of visually similar products, and domain adaptation between different retail systems. To address these challenges, the work centers on the use of Optical Character Recognition (OCR) to extract textual information found on product packaging for product recognition. With extensive experiments and the creation of a dataset, the results show that OCR-based methods for product recognition can improve recognition robustness, enable more accurate differentiation between similar products, and also work across different retail systems.

Therefore, the main contribution of this thesis is the development and validation of these OCR text-based methods and approaches, specifically designed to address the requirements in physical retail.

Abstract [sv]

Den fysiska detaljhandeln står inför utmaningar med att förbättra den operativa effektiviteten, sänka kostnader och stärka kundupplevelsen. Under det senaste decenniet har företag introducerat lösningar baserade på produktigenkänningsteknik för att effektivisera kassaprocesser, förbättra lagerhantering och upptäcka bedrägerier. De flesta initiativ har dock haft svårt att skala upp eller uppnå tillräcklig noggrannhet på grund av detaljhandelns dynamiska natur, som kännetecknas av ett stort och ständigt föränderligt produktsortiment samt skiftande butiksmiljöer. Parallellt har akademisk forskning adresserat många av dessa utmaningar genom att tillhandahålla datamängder och nya metoder för att förbättra igenkänningsprestandan, men betydande utmaningar kvarstår.

Denna avhandling adresserar tre huvudsakliga utmaningar inom produktigenkänning i dagligvaruhandeln: robust igenkänning, finmaskig igenkänning av visuellt liknande produkter samt domänanpassning mellan olika retailsystem. För att hantera dessa utmaningar fokuserar arbetet på att använda optisk teckenläsning (OCR) för att extrahera textinformation från produktförpackningar för produktigenkänning. Genom omfattande experiment och skapandet av en datamängd visar resultaten att OCR-baserade metoder kan förbättra robustheten i igenkänningen, möjliggöra mer noggrann differentiering mellan produkter samt fungera över olika retailmiljöer.

Avhandlingens huvudsakliga bidrag är utvecklingen och valideringen av metoder och tillvägagångssätt för produktigenkänning med text från OCR som möter de unika kraven inom den fysiska detaljhandeln.

Place, publisher, year, edition, pages
Skövde: University of Skövde, 2025. p. xv, 140
Series
Dissertation Series ; 67
National Category
Computer graphics and computer vision Natural Language Processing Artificial Intelligence
Research subject
Virtual Production Development (VPD)
Identifiers
urn:nbn:se:his:diva-26062 (URN)978-91-989080-7-7 (ISBN)978-91-989080-8-4 (ISBN)
Public defence
2026-01-23, ASSAR Industrial Innovation Arena, Kavelbrovägen 2B, 541 36, Skövde, 09:15 (English)
Opponent
Supervisors
Note

PUBLICATIONS WITH LOW RELEVANCE [se även rubriken Delarbeten/List of papers]

7. Puneet Mishra, Aneesh Chauhan, and Tobias Pettersson. “Seeing through plastics: A novel combination of NIR hyperspectral imaging and spectral orthogonalization for detecting fresh fruit inside plastic packaging to support automated barcode less checkouts in supermarkets”. In: Food Control 150 (2023), p. 109762.

8. Faeze Zakaryapour Sayyad, Tobias Pettersson, Seyed Jalaleddin Mousavirad, Irida Shallari, and Mattias O’Nils. “AdAPT: Advertisement detector adaptation under newspaper domain shift with null-based pseudo-labeling”. In: Machine Learning with Applications (2025), p. 100806.DOI: https://doi.org/10.1016/j.mlwa.2025.100806.

Available from: 2025-12-15 Created: 2025-12-12 Last updated: 2026-03-09Bibliographically approved

Open Access in DiVA

fulltext(1249 kB)488 downloads
File information
File name FULLTEXT01.pdfFile size 1249 kBChecksum SHA-512
d7f06756e181b17b030042b66b94b32246abb8c66769560b448c0ca47a0a05a5a71edce81854930771e39aa26540eebb93893cd9d60f42c17c456ca91c37c9c6
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Pettersson, Tobias

Search in DiVA

By author/editor
Pettersson, Tobias
By organisation
School of Engineering ScienceVirtual Engineering Research Environment
In the same journal
Expert systems with applications
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 489 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 817 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf