Deriving transcriptional programs and functional processes from gene expression databases.

Bioinformatics

Department of Integrative Biology and Pharmacology, The University of Texas Health Science Center in Houston, Houston, TX 77030, USA.

Published: April 2012

Motivation: A system-wide approach to revealing the underlying molecular state of a cell is a long-standing biological challenge. Developed over the last decade, gene expression profiles possess the characteristics of such an assay. They have the capacity to reveal both underlying molecular events as well as broader phenotypes such as clinical outcomes. To interpret these profiles, many gene sets have been developed that characterize biological processes. However, the full potential of these gene sets has not yet been achieved. Since the advent of gene expression databases, many have posited that they can reveal properties of activities that are not evident from individual datasets, analogous to how the expression of a single gene generally cannot reveal the activation of a biological process.

Results: To address this issue, we have developed a high-throughput method to mine gene expression databases for the regulation of gene sets. Given a set of genes, we scored it against each gene expression dataset by looking for enrichment of co-regulated genes relative to an empirical null distribution. After validating the method, we applied it to address two biological problems. First, we deciphered the E2F transcriptional network. We confirmed that true transcriptional targets exhibit a distinct regulatory profile across a database. Second, we leveraged the patterns of regulation across a database of gene sets to produce an automatically generated catalog of biological processes. These demonstrations revealed the power of a global analysis of the data contained within gene expression databases, and the potential for using them to address biological questions.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3324522PMC
http://dx.doi.org/10.1093/bioinformatics/bts112DOI Listing

Publication Analysis

Top Keywords

gene expression
24
expression databases
16
gene sets
16
gene
11
underlying molecular
8
biological processes
8
address biological
8
expression
7
biological
6
deriving transcriptional
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!