Background: Gene-set analysis (GSA) has been commonly used to identify significantly altered pathways or functions from omics data. However, GSA often yields a long list of gene-sets, necessitating efficient post-processing for improved interpretation. Existing methods cluster the gene-sets based on the extent of their overlap to summarize the GSA results without considering interactions between gene-sets.

Results: Here, we presented a novel network-weighted gene-set clustering that incorporates both the gene-set overlap and protein-protein interaction (PPI) networks. Three examples were demonstrated for microarray gene expression, GWAS summary, and RNA-sequencing data to which different GSA methods were applied. These examples as well as a global analysis show that the proposed method increases PPI densities and functional relevance of the resulting clusters. Additionally, distinct properties of gene-set distance measures were compared. The methods are implemented as an R/Shiny package GScluster that provides gene-set clustering and diverse functions for visualization of gene-sets and PPI networks.

Conclusions: Network-weighted gene-set clustering provides functionally more relevant gene-set clusters and related network analysis.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6507172PMC
http://dx.doi.org/10.1186/s12864-019-5738-6DOI Listing

Publication Analysis

Top Keywords

gene-set clustering
16
network-weighted gene-set
12
gene-set
8
data gsa
8
gscluster network-weighted
4
clustering
4
analysis
4
clustering analysis
4
analysis background
4
background gene-set
4

Similar Publications

Identification of Bacterial Lipopolysaccharide-Associated Genes and Molecular Subtypes in Autism Spectrum Disorder.

Pharmgenomics Pers Med

January 2025

Department of Clinical Medicine, North Sichuan Medical College, Nanchong, Sichuan, 637000, People's Republic of China.

Background: Autism spectrum disorder (ASD) is a complex neurodevelopmental condition marked by diverse symptoms affecting social interaction, communication, and behavior. This research aims to explore bacterial lipopolysaccharide (LPS)- and immune-related (BLI) molecular subgroups in ASD to enhance understanding of the disorder.

Methods: We analyzed 89 control samples and 157 ASD samples from the GEO database, identifying BLI signatures using least absolute shrinkage and selection operator regression (LASSO) and logistic regression machine learning algorithms.

View Article and Find Full Text PDF

Background: Pancreatic ductal adenocarcinoma (PDAC) has a heterogeneous make-up of myeloid cells that influences the therapeutic response and prognosis. However, understanding the myeloid cell at both a genetic and cellular level remains a significant challenge.

Methods: Single-cell RNA sequencing (scRNA-seq) data were downloaded from t the Tumor Immune Single-cell Hub and gene expression data were retrieved from The Cancer Genome Atlas (TCGA) database and the Gene Expression Omnibus (GEO) database.

View Article and Find Full Text PDF

Identification of pain-related long non-coding RNAs for pulpitis prediction.

Clin Oral Investig

January 2025

Department of Endodontics, Guangdong Engineering Research Center of Oral Restoration and Reconstruction, Guangzhou Key Laboratory of Basic and Applied Research of Oral Regenerative Medicine, Affiliated Stomatology Hospital of Guangzhou Medical University, Guangzhou, China.

Objectives: We investigated the recently generated RNA-sequencing dataset of pulpitis to identify the potential pain-related lncRNAs for pulpitis prediction.

Materials And Methods: Differential analysis was performed on the gene expression profile between normal and pulpitis samples to obtain pulpitis-related genes. The co-expressed gene modules were identified by weighted gene coexpression network analysis (WGCNA).

View Article and Find Full Text PDF

Background: Sarcopenia, an aseptic chronic inflammatory disease, is a complex and debilitating disease characterized by the progressive degeneration of skeletal muscle. PANoptosis, a novel proinflammatory programmed cell death pathway, has been linked to various diseases. However, the precise role of PANoptosis-related features in sarcopenia remains uncertain.

View Article and Find Full Text PDF

Background: Metabolic Syndrome (MS) is a cluster of conditions that significantly increase the risk of infertility in women. Granulosa cells are crucial for ovarian folliculogenesis and fertility. Understanding molecular alterations in these cells can provide insights into MS-associated infertility.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!