Gene set enrichment analysis (GSEA) subsequent to differential expression analysis is a standard step in transcriptomics and proteomics data analysis. Although many tools for this step are available, the results are often difficult to reproduce because set annotations can change in the databases, that is, new features can be added or existing features can be removed. Finally, such changes in set compositions can have an impact on biological interpretation. We present bootGSEA, a novel computational pipeline, to study the robustness of GSEA. By repeating GSEA based on bootstrap samples, the variability and robustness of results can be studied. In our pipeline, not all genes or proteins are involved in the different bootstrap replicates of the analyses. Finally, we aggregate the ranks from the bootstrap replicates to obtain a score per gene set that shows whether it gains or loses evidence compared to the ranking of the standard GSEA. Rank aggregation is also used to combine GSEA results from different omics levels or from multiple independent studies at the same omics level. By applying our approach to six independent cancer transcriptomics datasets, we showed that bootstrap GSEA can aid in the selection of more robust enriched gene sets. Additionally, we applied our approach to paired transcriptomics and proteomics data obtained from a mouse model of spinal muscular atrophy (SMA), a neurodegenerative and neurodevelopmental disease associated with multi-system involvement. After obtaining a robust ranking at both omics levels, both ranking lists were combined to aggregate the findings from the transcriptomics and proteomics results. Furthermore, we constructed the new R-package "bootGSEA," which implements the proposed methods and provides graphical views of the findings. Bootstrap-based GSEA was able in the example datasets to identify gene or protein sets that were less robust when the set composition changed during bootstrap analysis. The rank aggregation step was useful for combining bootstrap results and making them comparable to the original findings on the single-omics level or for combining findings from multiple different omics levels.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11021641 | PMC |
http://dx.doi.org/10.3389/fbinf.2024.1380928 | DOI Listing |
Alzheimers Dement
December 2024
Theme Inflammation and Aging, Karolinska University Hospital, Stockholm, Sweden.
Background: Alzheimer disease (AD) is a progressive neurodegenerative disease that is accountable for the leading case of dementia in elder people. Before, only symptomatic treatments are available for AD. Since 2021, two anti-amyloid antibodies aducanumab and lecanemab have been approved by the US Food and Drug Administration.
View Article and Find Full Text PDFJ Cancer
January 2025
Department of Medical Oncology, The Second Affiliated Hospital, School of Medicine, South China University of Technology, Guangzhou, Guangdong 510180, China.
The pathogenesis of metabolic dysfunction-associated steatotic liver disease-associated hepatocellular carcinoma (MASLD-HCC) is complex and exhibits sex-specific differences. Effective methods for monitoring MASLD progression to HCC are lacking. Transcriptomic data from liver tissue samples sourced from multiple public databases were integrated.
View Article and Find Full Text PDFFood Res Int
January 2025
Faculty of Bioscience Engineering, Department of Food Technology, Safety and Health, Ghent University, Ghent, Belgium.
While reducing the consumption of animal-source foods is recommended for planetary and human health, potential emerging food safety risks associated with the transition to dietary patterns featuring plant-based meat (PBMA) and dairy alternatives (PBDA) remain unexplored. We assessed the exposure to mycotoxins and ranked the associated health risks related to the consumption of PBMA and PBDA. We simulated diets by replacing animal-source proteins with their plant-based alternatives.
View Article and Find Full Text PDFInt J Mol Sci
December 2024
Laboratory for Pathology Dynamics, Department of Biomedical Engineering, Georgia Institute of Technology & Emory University School of Medicine, Atlanta, GA 30322, USA.
The overlapping molecular pathophysiology of Alzheimer's Disease (AD), Amyotrophic Lateral Sclerosis (ALS), and Frontotemporal Dementia (FTD) was analyzed using relationships from a knowledge graph of 33+ million biomedical journal articles. The unsupervised learning rank aggregation algorithm from SemNet 2.0 compared the most important amino acid, peptide, and protein (AAPP) nodes connected to AD, ALS, or FTD.
View Article and Find Full Text PDFComput Biol Med
January 2025
Department of Mathematics, COMSATS University Islamabad, Vehari Campus, 61100, Vehari, Pakistan. Electronic address:
Topological indices, derived from molecular graphs, provide valuable numerical descriptors for the comprehensive analysis of pharmaceuticals. These indices are pivotal in the physicochemical characterization and predictive assessment of various drugs. In this study, we calculate several degree-based topological indices for a range of migraine treatment medications, including aspirin, caffeine, eletriptan, ergotamine, sumatriptan, rizatriptan, verapamil, diclofenac, frovatriptan, and droperidol.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!