Inference of gene pathways using mixture Bayesian networks.

BMC Syst Biol

Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana-Champaign, IL 61801, USA.

Published: May 2009

Background: Inference of gene networks typically relies on measurements across a wide range of conditions or treatments. Although one network structure is predicted, the relationship between genes could vary across conditions. A comprehensive approach to infer general and condition-dependent gene networks was evaluated. This approach integrated Bayesian network and Gaussian mixture models to describe continuous microarray gene expression measurements, and three gene networks were predicted.

Results: The first reconstructions of a circadian rhythm pathway in honey bees and an adherens junction pathway in mouse embryos were obtained. In addition, general and condition-specific gene relationships, some unexpected, were detected in these two pathways and in a yeast cell-cycle pathway. The mixture Bayesian network approach identified all (honey bee circadian rhythm and mouse adherens junction pathways) or the vast majority (yeast cell-cycle pathway) of the gene relationships reported in empirical studies. Findings across the three pathways and data sets indicate that the mixture Bayesian network approach is well-suited to infer gene pathways based on microarray data. Furthermore, the interpretation of model estimates provided a broader understanding of the relationships between genes. The mixture models offered a comprehensive description of the relationships among genes in complex biological processes or across a wide range of conditions. The mixture parameter estimates and corresponding odds that the gene network inferred for a sample pertained to each mixture component allowed the uncovering of both general and condition-dependent gene relationships and patterns of expression.

Conclusion: This study demonstrated the two main benefits of learning gene pathways using mixture Bayesian networks. First, the identification of the optimal number of mixture components supported by the data offered a robust approach to infer gene relationships and estimate gene expression profiles. Second, the classification of conditions and observations into groups that support particular mixture components helped to uncover both gene relationships that are unique or common across conditions. Results from the application of mixture Bayesian networks substantially augmented the understanding of gene networks and demonstrated the added-value of this methodology to infer gene networks.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2701418PMC
http://dx.doi.org/10.1186/1752-0509-3-54DOI Listing

Publication Analysis

Top Keywords

mixture bayesian
20
gene networks
20
gene relationships
20
gene
15
gene pathways
12
bayesian networks
12
bayesian network
12
infer gene
12
mixture
11
inference gene
8

Similar Publications

Much research in the behavioral sciences aims to characterize the "typical" person. A statistically significant group-averaged effect size is often interpreted as evidence that the typical person shows an effect, but that is only true under certain distributional assumptions for which explicit evidence is rarely presented. Mean effect size varies with both within-participant effect size and population prevalence (proportion of population showing effect).

View Article and Find Full Text PDF

Background: Phthalate exposure has been hypothesized to influence cholesterol metabolism and gallstone pathogenesis, but previous studies are limited. We aimed to examine the associations between urinary phthalate metabolites and prevalence of gallstone disease in a nationally representative sample.

Methods: We analyzed data on 1,696 adults aged ≥ 30 years from the National Health and Nutrition Examination Survey (NHANES) 2017-2018.

View Article and Find Full Text PDF

Background: Cancer is a complex set of diseases, and many have decades-long lag times between possible exposure and diagnosis. Environmental exposures, such as per- and poly-fluoroalkyl substances (PFAS) and area-level risk factors (e.g.

View Article and Find Full Text PDF

Background: U.S. Immigration and Customs Enforcement (ICE) facilities had high rates of COVID-19 infections and mortality during the global pandemic.

View Article and Find Full Text PDF

Biomarkers are measured repeatedly in clinical studies until a pre-defined endpoint, such as death from certain causes, is reached. Such repeated measurements may present a dynamic process for understanding when to expect the study's endpoint. Joint modelling is often employed to handle such a model.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!