Comparative Analysis of Over-Representation Analysis and Gene Set Enrichment Analysis Methods in KEGG Pathway Enrichment: Insights from Inflammatory Bowel Disease Studies
2024 (English)Independent thesis Advanced level (degree of Master (One Year)), 20 credits / 30 HE credits
Student thesis
Abstract [en]
Inflammatory Bowel Disease (IBD) is a complicated and multifaceted illness marked by chronic inflammation of the gastrointestinal system. Understanding the underlying molecular causes of IBD is critical for creating tailored therapeutics and better patient outcomes. This thesis compares Over-Representation Analysis (ORA) and Gene Set Enrichment Analysis (GSEA) methods on two datasets, GSE66407 and GSE186582. Three ORA tools—ClusterProfiler, WebGestalt, and Enrichr—were tested to determine their sensitivity, specificity, and robustness in discovering enriched pathways. The results showed that the techniques were quite consistent, especially in the GSE186582 dataset, where there was perfect overlap in pathway discovery. However, small differences were seen in the GSE66407 dataset, with ClusterProfiler and Enrichr detecting distinct pathways. Furthermore, GSEA was conducted using clusterProfiler, WebGestalt, and fgsea, which provided a dynamic analysis of the whole ranked list of genes. GSEA results from both datasets consistently revealed immune-related pathways involved in IBD pathogenesis, including IL-17 signaling, NF-kappaB signaling, and TNF signaling. Particularly, ClusterProfiler, WebGestalt, and fgsea both identified many distinct pathways, demonstrating differences in pathway recognition depending on the approach utilized. While ORA approaches are strong and accurate for quickly identifying pathways, GSEA may give a more detailed knowledge of the biological processes by capturing minor gene expression variations.
Place, publisher, year, edition, pages
2024. , p. 33
Keywords [en]
IBD, pathway analysis, ORA, GSEA, ClusterProfiler, WebGestalt, fgsea, Enrichr
National Category
Bioinformatics and Computational Biology
Identifiers
URN: urn:nbn:se:his:diva-24715OAI: oai:DiVA.org:his-24715DiVA, id: diva2:1914650
Subject / course
Bioinformatics
Educational program
Bioinformatics - Master’s Programme
Supervisors
Examiners
2024-11-202024-11-202025-09-29Bibliographically approved