Learning regulatory programs that accurately predict differential expression with MEDUSA.

Ann N Y Acad Sci

Department of Computer Science, Center for Computational Learning Systems, Columbia University, New York, NY 10065, USA.

Published: December 2007

Inferring gene regulatory networks from high-throughput genomic data is one of the central problems in computational biology. In this paper, we describe a predictive modeling approach for studying regulatory networks, based on a machine learning algorithm called MEDUSA. MEDUSA integrates promoter sequence, mRNA expression, and transcription factor occupancy data to learn gene regulatory programs that predict the differential expression of target genes. Instead of using clustering or correlation of expression profiles to infer regulatory relationships, MEDUSA determines condition-specific regulators and discovers regulatory motifs that mediate the regulation of target genes. In this way, MEDUSA meaningfully models biological mechanisms of transcriptional regulation. MEDUSA solves the problem of predicting the differential (up/down) expression of target genes by using boosting, a technique from statistical learning, which helps to avoid overfitting as the algorithm searches through the high-dimensional space of potential regulators and sequence motifs. Experimental results demonstrate that MEDUSA achieves high prediction accuracy on held-out experiments (test data), that is, data not seen in training. We also present context-specific analysis of MEDUSA regulatory programs for DNA damage and hypoxia, demonstrating that MEDUSA identifies key regulators and motifs in these processes. A central challenge in the field is the difficulty of validating reverse-engineered networks in the absence of a gold standard. Our approach of learning regulatory programs provides at least a partial solution for the problem: MEDUSA's prediction accuracy on held-out data gives a concrete and statistically sound way to validate how well the algorithm performs. With MEDUSA, statistical validation becomes a prerequisite for hypothesis generation and network building rather than a secondary consideration.

Download full-text PDF

Source
http://dx.doi.org/10.1196/annals.1407.020DOI Listing

Publication Analysis

Top Keywords

regulatory programs
16
target genes
12
medusa
10
learning regulatory
8
predict differential
8
differential expression
8
gene regulatory
8
regulatory networks
8
expression target
8
prediction accuracy
8

Similar Publications

Biophysical constraints limit the specificity with which transcription factors (TFs) can target regulatory DNA. While individual nontarget binding events may be low affinity, the sheer number of such interactions could present a challenge for gene regulation by degrading its precision or possibly leading to an erroneous induction state. Chromatin can prevent nontarget binding by rendering DNA physically inaccessible to TFs, at the cost of energy-consuming remodeling orchestrated by pioneer factors (PFs).

View Article and Find Full Text PDF

The homo-dodecameric ring-shaped RNA binding attenuation protein (TRAP) from binds up to twelve tryptophan ligands (Trp) and becomes activated to bind a specific sequence in the 5' leader region of the operon mRNA, thereby downregulating biosynthesis of Trp. Thermodynamic measurements of Trp binding have revealed a range of cooperative behavior for different TRAP variants, even if the averaged apparent affinities for Trp have been found to be similar. Proximity between the ligand binding sites, and the ligand-coupled disorder-to-order transition has implicated nearest-neighbor interactions in cooperativity.

View Article and Find Full Text PDF

VCP controls KCC2 degradation through FAF1 recruitment and accelerates emergence from anesthesia.

Proc Natl Acad Sci U S A

January 2025

Department of Medical Neuroscience, SUSTech Center for Pain Medicine, School of Medicine, Southern University of Science and Technology, Shenzhen 518055, China.

Ubiquitin-proteasomal degradation of K/Cl cotransporter 2 (KCC2) in the ventral posteromedial nucleus (VPM) has been demonstrated to serve as a common mechanism by which the brain emerges from anesthesia and regains consciousness. Ubiquitin-proteasomal degradation of KCC2 during anesthesia is driven by E3 ligase Fbxl4. However, the mechanism by which ubiquitinated KCC2 is targeted to the proteasome has not been elucidated.

View Article and Find Full Text PDF

Cell-Type Specific miRNA Regulatory Network Responses to ABA Stress Revealed by Time Series Transcriptional Atlases in Arabidopsis.

Adv Sci (Weinh)

January 2025

School of Advanced Agriculture Sciences and School of Life Sciences, State Key Laboratory of Protein and Plant Gene Research, Peking University, Beijing, 100871, China.

In plants, microRNAs (miRNAs) participate in complex gene regulatory networks together with the transcription factors (TFs) in response to biotic and abiotic stresses. To date, analyses of miRNAs-induced transcriptome remodeling are at the whole plant or tissue levels. Here, Arabidopsis's ABA-induced single-cell RNA-seq (scRNA-seq) is performed at different stages of time points-early, middle, and late.

View Article and Find Full Text PDF

Bisphenol A (BPA) is an "environmental obesogen" and this study aims to investigate the intergenerational impacts of BPA-induced metabolic syndrome (MetS), specifically focusing on unraveling mechanisms. Exposure to BPA induces metabolic disorders in the paternal mice, which are then transmitted to offspring, leading to late-onset MetS. Mechanistically, BPA upregulates Srebf1, which in turn promotes the Pparg-dependent transcription of Dicer1 in spermatocytes, increasing the levels of multiple sperm microRNAs (miRNAs).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!