Högskolan i Skövde

his.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Real-Time Automatic Checkout via Prompt-Based Product Extraction and Cross-Domain Learning
University of Skövde, School of Engineering Science. University of Skövde, Virtual Engineering Research Environment. Jönköping University, Sweden ; Itab Shop Products AB, Jönköping, Sweden. (Virtual Production Development (VPD))ORCID iD: 0000-0001-8880-7965
Dept. of Computer Science and Informatics, Jönköping University, Sweden.ORCID iD: 0000-0003-2900-9335
Dept. of Computing, Jönköping University, Sweden.
2024 (English)In: Proceedings 2024 International Conference on Machine Learning and Applications ICMLA 2024: Miami, Florida 18-20 December 2024 / [ed] M. Arif Wani; Plamen Angelov; Feng Luo; Mitsunori Ogihara Xintao Wu; Radu-Emil Precup; Ramin Ramezani; Xiaowei Gu, IEEE, 2024, p. 1396-1403Conference paper, Published paper (Refereed)
Abstract [en]

Automatic checkout systems are designed to predict a complete shopping receipt using an image from the checkout area. These systems require high classification accuracy across numerous classes and must operate in real-time, despite domain differences between training data and real-world conditions. Building on recent advancements, we propose a method that outperforms current solutions and can be applied in real-time in automatic checkout systems. Our method leverages the Segment Anything Model to extract high-quality masks from lab product images, which are then transformed into synthetic checkout images and adapted to the real domain using contrastive unpaired translation. We train a product recognition model with data augmentation, named SCA+Y8, and further improve it through fine-tuning with pseudo-labels from unlabeled checkout images, resulting in an improved model called SCAFT+Y8. SCAFT+Y8 achieves a great increase in state-of-the-art performance, with an average receipt classification accuracy of 97.58%, and shows strong performance in smaller models, indicating the potential for deployment on low-cost edge devices. 

Place, publisher, year, edition, pages
IEEE, 2024. p. 1396-1403
Series
International Conference on Machine Learning and Applications (ICMLA), ISSN 1946-0740, E-ISSN 1946-0759
Keywords [en]
Automatic Checkout, Domain Adaptation, Object Detection, YOLOv8, Contrastive Learning, Image enhancement, Image segmentation, Object recognition, Classification accuracy, Cross-domain learning, Domain differences, Objects detection, Real- time, Real-world, Training data
National Category
Computer Sciences Computer graphics and computer vision
Research subject
Virtual Production Development (VPD)
Identifiers
URN: urn:nbn:se:his:diva-24982DOI: 10.1109/ICMLA61862.2024.00217ISI: 001468515500208Scopus ID: 2-s2.0-105000879245ISBN: 979-8-3503-7489-6 (print)ISBN: 979-8-3503-7488-9 (electronic)OAI: oai:DiVA.org:his-24982DiVA, id: diva2:1949605
Conference
2024 International Conference on Machine Learning and Applications ICMLA 2024, Miami, Florida, 18-20 December 2024
Funder
Knowledge Foundation, 2020-0044Swedish Research Council, 2022-06725
Note

© 2024 IEEE

The authors would like to thank ITAB Shop Products AB and Smart Industry Sweden (KKS-2020-0044) for their support. The machine learning training was enabled by resources provided by the National Academic Infrastructure for Supercomputing in Sweden (NAISS), partially funded by the Swedish Research Council through grant agreement no. 2022-06725.

Available from: 2025-04-03 Created: 2025-04-03 Last updated: 2025-12-15Bibliographically approved
In thesis
1.
The record could not be found. The reason may be that the record is no longer available or you may have typed in a wrong id in the address field.

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Pettersson, TobiasRiveiro, Maria

Search in DiVA

By author/editor
Pettersson, TobiasRiveiro, Maria
By organisation
School of Engineering ScienceVirtual Engineering Research Environment
Computer SciencesComputer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 200 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf