Högskolan i Skövde

his.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Analysis of deep learning techniques for cancer genomics using federated learning framework
Högskolan i Skövde, Institutionen för informationsteknologi.
2021 (Engelska)Självständigt arbete på avancerad nivå (magisterexamen), 10 poäng / 15 hpStudentuppsats (Examensarbete)
Abstract [en]

Genomics data is complex, high dimensional and confidential. Analysis of such type of data can lead to better understanding of diseases such as cancer. Single cell RNA sequencing is a recent advancement in technology for analyzing cancer at the single cell level. Deep learning techniques have been effectively used by researchers to analyze genomics data. These techniques include feed forward neural networks (FFNN), convolutional neural network (CNN) and long short term memory (LSTM) networks. Federated learning is a framework that involves sharing and aggregation of the machine learning model instead of the data. The project investigates the performance of deep learning techniques for cancer genomics data when implemented using federated learning framework. The two cancer datasets from the Gene Expression Omnibus (GEO) database are identified for the project that contains cell type information based on the gene expressions data. The three deep learning techniques are applied to solve a classification problem. The performance of the models is measured in terms of the f1-score due to class imbalance as these datasets are from tumor sites; therefore the majority class is that of the tumor cell type. Each of the three deep learning models is implemented using the centralised learning framework as well as the federated learning framework. The results demonstrate that the performance of the deep learning models using federated learning for gene expression data is slightly better as compared to that using the centralised learning framework. This supports the fact for further investigation into building deep learning models for heterogeneous genomics data using federated learning to better understand complex diseases such as cancer.

Ort, förlag, år, upplaga, sidor
2021. , s. vi, 42
Nyckelord [en]
Cancer genomics, single cell gene expression, deep learning, federated learning, classification
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
URN: urn:nbn:se:his:diva-20655OAI: oai:DiVA.org:his-20655DiVA, id: diva2:1603679
Ämne / kurs
Informationsteknologi
Utbildningsprogram
Data Science - magisterprogram
Handledare
Examinatorer
Tillgänglig från: 2021-10-17 Skapad: 2021-10-17 Senast uppdaterad: 2021-10-17Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Av organisationen
Institutionen för informationsteknologi
Datavetenskap (datalogi)

Sök vidare utanför DiVA

GoogleGoogle Scholar

urn-nbn

Altmetricpoäng

urn-nbn
Totalt: 772 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf