Högskolan i Skövde

his.sePublications
Planned maintenance
A system upgrade is planned for 24/9-2024, at 12:00-14:00. During this time DiVA will be unavailable.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Nanometa Live: A real-time metagenomic analysis pipeline and interface for species classification and pathogen characterization
University of Skövde, School of Bioscience.
2023 (English)Independent thesis Basic level (degree of Bachelor), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Metagenomics studies the totality of genomes of all species in a microbial community. It is a young, growing field with medical, industrial, and ecological applications. Abundant metagenomic data is being produced today, but there is a lack of interpretation and visualization tools. The aim of this project was to create Nanometa Live: a user-friendly, real-time data processing pipeline and graphical user interface that enables visualization of the general species content in a sample, as well as detection of a set of predetermined pathogens. The pipeline was created using Snakemake, with classification by Kraken 2, and sequence validation by BLAST, with the input of the pipeline being fastq batch files from an Oxford Nanopore. The interface was coded in Python using the framework Dash, and utilizes the data produced by the pipeline to visualize results. A Sankey plot and a list of most abundant taxa displays the general species content, while a separate table and a gauge, colored to show the pathogenicity of the sample, displays the user-determined pathogens that the program looks for. Further exploration of the species composition is enabled by a sunburst plot and an icicle chart. Nanometa Live is a fully functioning prototype and can be considered on par with existing tools when it comes to analysis speed, computer requirements, and general user-friendliness. Its strengths are ease of interpretation and flexibility in visualizations, with weaknesses being lack of functionality, such as antibiotic resistance detection, and imperfections in code, structure and packaging.

Place, publisher, year, edition, pages
2023. , p. 44
National Category
Bioinformatics and Systems Biology
Identifiers
URN: urn:nbn:se:his:diva-22746OAI: oai:DiVA.org:his-22746DiVA, id: diva2:1770687
External cooperation
Totalförsvarets forskningsinstitut (FOI)
Subject / course
Bioinformatics
Educational program
Molekylär bioinformatik
Supervisors
Examiners
Available from: 2023-06-19 Created: 2023-06-19 Last updated: 2023-06-19Bibliographically approved

Open Access in DiVA

fulltext(2169 kB)397 downloads
File information
File name FULLTEXT01.pdfFile size 2169 kBChecksum SHA-512
e1450b18138818dbd237c1076102c6fc47b4ba65b3db46518547264b6829020f206f62dbfeb77d1edf95a77c453cd8cdb5734acf6da3bf39d9b43566dbae609b
Type fulltextMimetype application/pdf

By organisation
School of Bioscience
Bioinformatics and Systems Biology

Search outside of DiVA

GoogleGoogle Scholar
Total: 397 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 645 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf