Summary: The increasing number of publicly available bacterial gene expression data sets provides an unprecedented resource for the study of gene regulation in diverse conditions, but emphasizes the need for self-supervised methods for the automated generation of new hypotheses. One approach for inferring coordinated regulation from bacterial expression data is through neural networks known as denoising autoencoders (DAEs) which encode large datasets in a reduced bottleneck layer. We have generalized this application of DAEs to include deep networks and explore the effects of network architecture on gene set inference using deep learning. We developed a DAE-based pipeline to extract gene sets from transcriptomic data in , validate our method by comparing inferred gene sets with known pathways, and have used this pipeline to explore how the choice of network architecture impacts gene set recovery. We find that increasing network depth leads the DAEs to explain gene expression in terms of fewer, more concisely defined gene sets, and that adjusting the width results in a tradeoff between generalizability and biological inference. Finally, leveraging our understanding of the impact of DAE architecture, we apply our pipeline to an independent uropathogenic dataset to identify genes uniquely induced during human colonization.

Availability And Implementation: https://github.com/BarquistLab/DAE_architecture_exploration.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11256956PMC
http://dx.doi.org/10.1093/bioadv/vbae066DOI Listing

Publication Analysis

Top Keywords

gene sets
16
gene
9
network depth
8
denoising autoencoders
8
gene expression
8
expression data
8
network architecture
8
gene set
8
sets
5
network
4

Similar Publications

Flax ( L.) is known as a dual-purpose crop, producing both fiber and oil, which have a wide range of uses. Successful flax breeding requires knowledge on the genetic determinants of flax traits.

View Article and Find Full Text PDF

Reproductive success and ultimately species survival at a population level is contingent on a plethora of neuroendocrine signals working in concert to regulate gonadal function and reproductive behavior. Among these, the neuropeptide kisspeptin (encoded by the KISS1/Kiss1 gene) has emerged as the master regulator of the hypothalamic-pituitary-gonadal axis. Besides the hypothalamus, both kisspeptin and its cognate receptor are extensively expressed throughout cortico-limbic brain structures in rodents and humans, which are regions traditionally implicated in behavioral and emotional responses.

View Article and Find Full Text PDF

Multi-omics analysis reveals distinct gene regulatory mechanisms between primary and organoid-derived human hepatocytes.

Dis Model Mech

January 2025

Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Science, Radboud University, Nijmegen 6525GA, The Netherlands.

Hepatic organoid cultures are a powerful model to study liver development and diseases in vitro. However, hepatocyte-like cells differentiated from these organoids remain immature compared to primary human hepatocytes (PHHs), which are the benchmark in the field. Here, we applied integrative single-cell transcriptome and chromatin accessibility analysis to reveal gene regulatory mechanisms underlying these differences.

View Article and Find Full Text PDF

Average nucleotide identity (ANI) is a widely used metric to estimate genetic relatedness, especially in microbial species delineation. While ANI calculation has been well optimized for bacteria and closely related viral genomes, accurate estimation of ANI below 80%, particularly in large reference data sets, has been challenging due to a lack of accurate and scalable methods. To bridge this gap, we introduce MANIAC, an efficient computational pipeline optimized for estimating ANI and alignment fraction (AF) in viral genomes with divergence around ANI of 70%.

View Article and Find Full Text PDF

Transcriptome-Wide Association Study Identified Novel Blood Tissue Gene Biomarkers for Prostate Cancer Risk.

Prostate

January 2025

Cancer Epidemiology Division, Population Sciences in the Pacific Program, University of Hawaii Cancer Center, University of Hawaii at Manoa, Honolulu, Hawaii, USA.

Objective: A number of susceptibility genes in prostate tissue have been identified to be associated with prostate cancer (PCa) risk. However, the reported genes based on assessing prostate tissue could not fully explain PCa genetic susceptibility. It is believed that genes functioning in the immune system may fill in the gap of some missing heritability.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!