Högskolan i Skövde

his.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Classification of aluminium microscopic images: Leveraging domain knowledge for feature extraction
University of Skövde, School of Informatics.
2024 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

This study seeks to gain a thorough understanding of the unique features exhibited by aluminum inclusions in microscopic images. The ultimate goal is to leverage this knowledge to create a reliable machine learning-based classification system for accurately classifying these inclusions. Through a detailed methodology encompassing domain knowledge acquisition, preprocessing, feature selection, and modelling, the research investigates the performance of various classifiers, such as Support Vector Machine (SVM), Random Forest, and k-Nearest Neighbors (KNN), as well as ensemble techniques, specifically stacking. Initial results indicate suboptimal performance of traditional classifiers without domain-driven feature extraction, with SVM achieving an accuracy of 43%, Random Forest 63%, and KNN 29%. However, significant improvement in accuracy, precision, recall, and F1-score metrics was observed after employing preprocessing steps and leveraging domain knowledge. Specifically, SVM achieved 80% accuracy, KNN 75%, and Random Forest 78% accuracy after preprocessing. Further refinement through hyperparameter tuning and cross-validation led to SVM achieving 90% accuracy, KNN 82%, and Random Forest 90%. Finally, the stacking classifier, combining predictions from diverse base models with a logistic regression meta-learner, demonstrated superior performance with an accuracy of 93.4%. These findings underscore the critical role of domain knowledge in enhancing machine learning-based classification systems for microscopic image analysis, offering valuable insights applicable beyond aluminum manufacturing.

Place, publisher, year, edition, pages
2024. , p. 40
National Category
Information Systems, Social aspects
Identifiers
URN: urn:nbn:se:his:diva-24172OAI: oai:DiVA.org:his-24172DiVA, id: diva2:1881859
External cooperation
Hydro
Subject / course
Informationsteknologi
Educational program
Data Science - Master’s Programme
Supervisors
Examiners
Available from: 2024-07-04 Created: 2024-07-04 Last updated: 2024-07-04Bibliographically approved

Open Access in DiVA

fulltext(821 kB)174 downloads
File information
File name FULLTEXT01.pdfFile size 821 kBChecksum SHA-512
bae186df708f496e2546e78a87c154f1f3dd146250c893a664df574efdb8be52a78b55a482004f1e2aa5f074866ca73b9885ad722ad6e6ac6c0e6fe270c979c8
Type fulltextMimetype application/pdf

By organisation
School of Informatics
Information Systems, Social aspects

Search outside of DiVA

GoogleGoogle Scholar
Total: 181 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 785 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf