Högskolan i Skövde

his.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
DIGITNET: A Deep Handwritten Digit Detection and Recognition Method Using a New Historical Handwritten Digit Dataset
University of Skövde, School of Informatics. University of Skövde, Informatics Research Environment. Department of Computer Science, Blekinge Institute of Technology, Karlskrona. (Skövde Artificial Intelligence Lab (SAIL))ORCID iD: 0000-0001-5762-6678
Department of Mechatronics Engineering, KTO Karatay University, Konya, Turkey.
Arkiv Digital, Växjö, Sweden.
Department of Computer Science, School of Engineering, Jönköping University, Sweden.
2021 (English)In: Big Data Research, ISSN 2214-5796, E-ISSN 2214-580X, Vol. 23, article id 100182Article in journal (Refereed) Published
Abstract [en]

This paper introduces a novel deep learning architecture, named DIGITNET, and a large-scale digit dataset, named DIDA, to detect and recognize handwritten digits in historical document images written in the nineteen century. To generate the DIDA dataset, digit images are collected from 100,000 Swedish handwritten historical document images, which were written by different priests with different handwriting styles. This dataset contains three sub-datasets including single digit, large-scale bounding box annotated multi-digit, and digit string with 250,000, 25,000, and 200,000 samples in Red-Green-Blue (RGB) color spaces, respectively. Moreover, DIDA is used to train the DIGITNET network, which consists of two deep learning architectures, called DIGITNET-dect and DIGITNET-rec, respectively, to isolate digits and recognize digit strings in historical handwritten documents. In DIGITNET-dect architecture, to extract features from digits, three residual units where each residual unit has three convolution neural network structures are used and then a detection strategy based on You Look Only Once (YOLO) algorithm is employed to detect handwritten digits at two different scales. In DIGITNET-rec, the detected isolated digits are passed through 3 different designed Convolutional Neural Network (CNN) architectures and then the classification results of three different CNNs are combined using a voting scheme to recognize digit strings. The proposed model is also trained with various existing handwritten digit datasets and then validated over historical handwritten digit strings. The experimental results show that the proposed architecture trained with DIDA (publicly available from: https://didadataset.github.io/DIDA/) outperforms the state-of-the-art methods. 

Place, publisher, year, edition, pages
Elsevier, 2021. Vol. 23, article id 100182
Keywords [en]
DIDA handwritten digit dataset, Digit string recognition, Ensemble deep learning, Handwritten digit detection, Historical handwritten documents
National Category
Computer graphics and computer vision Computer Sciences
Research subject
Skövde Artificial Intelligence Lab (SAIL)
Identifiers
URN: urn:nbn:se:his:diva-19396DOI: 10.1016/j.bdr.2020.100182ISI: 000609166100006Scopus ID: 2-s2.0-85098972737OAI: oai:DiVA.org:his-19396DiVA, id: diva2:1517779
Note

CC BY 4.0

Available from: 2021-01-14 Created: 2021-01-14 Last updated: 2025-09-29Bibliographically approved

Open Access in DiVA

fulltext(3887 kB)362 downloads
File information
File name FULLTEXT01.pdfFile size 3887 kBChecksum SHA-512
271c2817473952231b550ba86aa1a407ecf2c146e30dc62d6b2be98724edb95e5a08c06c0145068595e8b35c93eac06674976b4281046b948e34c373fa2de9d2
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Kusetogullari, Huseyin

Search in DiVA

By author/editor
Kusetogullari, Huseyin
By organisation
School of InformaticsInformatics Research Environment
In the same journal
Big Data Research
Computer graphics and computer visionComputer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 362 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 781 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf