Högskolan i Skövde

his.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Deep neural network prediction of genome-wide transcriptome signatures – beyond the Black-box
University of Skövde, School of Bioscience. University of Skövde, Systems Biology Research Environment. Bioinformatics, Department of Physics, Chemistry and Biology, Linköping University, Sweden. (Translational Bioinformatics)ORCID iD: 0000-0001-9395-6025
Biological and Environmental Sciences and Engineering Division, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia ; Unit of Computational Medicine, Department of Medicine, Solna, Center for Molecular Medicine, Karolinska Institutet, Stockholm, Sweden ; Science for Life Laboratory, Solna, Sweden.
Bioinformatics, Department of Physics, Chemistry and Biology, Linköping University, Sweden.
2022 (English)In: npj Systems Biology and Applications, E-ISSN 2056-7189, Vol. 8, no 1, article id 9Article in journal (Refereed) Published
Abstract [en]

Prediction algorithms for protein or gene structures, including transcription factor binding from sequence information, have been transformative in understanding gene regulation. Here we ask whether human transcriptomic profiles can be predicted solely from the expression of transcription factors (TFs). We find that the expression of 1600 TFs can explain >95% of the variance in 25,000 genes. Using the light-up technique to inspect the trained NN, we find an over-representation of known TF-gene regulations. Furthermore, the learned prediction network has a hierarchical organization. A smaller set of around 125 core TFs could explain close to 80% of the variance. Interestingly, reducing the number of TFs below 500 induces a rapid decline in prediction performance. Next, we evaluated the prediction model using transcriptional data from 22 human diseases. The TFs were sufficient to predict the dysregulation of the target genes (rho = 0.61, P < 10−216). By inspecting the model, key causative TFs could be extracted for subsequent validation using disease-associated genetic variants. We demonstrate a methodology for constructing an interpretable neural network predictor, where analyses of the predictors identified key TFs that were inducing transcriptional changes during disease.

Place, publisher, year, edition, pages
Springer Nature, 2022. Vol. 8, no 1, article id 9
National Category
Bioinformatics and Systems Biology Genetics Biochemistry and Molecular Biology
Research subject
Bioinformatics
Identifiers
URN: urn:nbn:se:his:diva-20941DOI: 10.1038/s41540-022-00218-9ISI: 000760233400001PubMedID: 35197482Scopus ID: 2-s2.0-85125212745OAI: oai:DiVA.org:his-20941DiVA, id: diva2:1640666
Note

CC BY 4.0

Correspondence and requests for materials should be addressed to Rasmus Magnusson email: rasmus.magnusson@his.se

Open access funding provided by University of Skövde.

Available from: 2022-02-25 Created: 2022-02-25 Last updated: 2024-08-30Bibliographically approved

Open Access in DiVA

fulltext(1507 kB)160 downloads
File information
File name FULLTEXT01.pdfFile size 1507 kBChecksum SHA-512
6a42fcde12f596c06b4d3d484b971281b4056c6b1a9cbf2a708b795d517659ac262bc8f6a5052e8368800d7c5d28960f204f454ea0fffb5ffa6ce7dba589f6b9
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMedScopus

Authority records

Magnusson, Rasmus

Search in DiVA

By author/editor
Magnusson, Rasmus
By organisation
School of BioscienceSystems Biology Research Environment
In the same journal
npj Systems Biology and Applications
Bioinformatics and Systems BiologyGeneticsBiochemistry and Molecular Biology

Search outside of DiVA

GoogleGoogle Scholar
Total: 160 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 230 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf