Reassessing the modularity of gene co-expression networks using the Stochastic Block Model.

PLoS Comput Biol

Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America.

Published: July 2024

AI Article Synopsis

  • Gene co-expression networks are often analyzed through community detection algorithms that assume genes form distinct modules based on their associations.
  • This study explores a new approach using the weighted degree corrected stochastic block model (SBM), which does not require assumptions about modular organization, to identify gene communities.
  • The findings reveal that SBM can discover significantly more gene groups than traditional methods, with many identified communities being non-modular yet functionally enriched, suggesting a more complex structure in gene co-expression than previously recognized.

Article Abstract

Finding communities in gene co-expression networks is a common first step toward extracting biological insight from these complex datasets. Most community detection algorithms expect genes to be organized into assortative modules, that is, groups of genes that are more associated with each other than with genes in other groups. While it is reasonable to expect that these modules exist, using methods that assume they exist a priori is risky, as it guarantees that alternative organizations of gene interactions will be ignored. Here, we ask: can we find meaningful communities without imposing a modular organization on gene co-expression networks, and how modular are these communities? For this, we use a recently developed community detection method, the weighted degree corrected stochastic block model (SBM), that does not assume that assortative modules exist. Instead, the SBM attempts to efficiently use all information contained in the co-expression network to separate the genes into hierarchically organized blocks of genes. Using RNAseq gene expression data measured in two tissues derived from an outbred population of Drosophila melanogaster, we show that (a) the SBM is able to find ten times as many groups as competing methods, that (b) several of those gene groups are not modular, and that (c) the functional enrichment for non-modular groups is as strong as for modular communities. These results show that the transcriptome is structured in more complex ways than traditionally thought and that we should revisit the long-standing assumption that modularity is the main driver of the structuring of gene co-expression networks.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11309492PMC
http://dx.doi.org/10.1371/journal.pcbi.1012300DOI Listing

Publication Analysis

Top Keywords

gene co-expression
16
co-expression networks
16
stochastic block
8
block model
8
community detection
8
assortative modules
8
modules exist
8
gene
7
co-expression
5
genes
5

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!