Global protein function annotation through mining genome-scale data in yeast Saccharomyces cerevisiae.

Nucleic Acids Res

UT-ORNL Graduate School of Genome Science and Technology, Oak Ridge, TN, USA.

Published: December 2004

As we are moving into the post genome-sequencing era, various high-throughput experimental techniques have been developed to characterize biological systems on the genomic scale. Discovering new biological knowledge from the high-throughput biological data is a major challenge to bioinformatics today. To address this challenge, we developed a Bayesian statistical method together with Boltzmann machine and simulated annealing for protein functional annotation in the yeast Saccharomyces cerevisiae through integrating various high-throughput biological data, including yeast two-hybrid data, protein complexes and microarray gene expression profiles. In our approach, we quantified the relationship between functional similarity and high-throughput data, and coded the relationship into 'functional linkage graph', where each node represents one protein and the weight of each edge is characterized by the Bayesian probability of function similarity between two proteins. We also integrated the evolution information and protein subcellular localization information into the prediction. Based on our method, 1802 out of 2280 unannotated proteins in yeast were assigned functions systematically.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC535686	PMC
http://dx.doi.org/10.1093/nar/gkh978	DOI Listing

Publication Analysis

Top Keywords

yeast saccharomyces

saccharomyces cerevisiae

high-throughput biological

biological data

data

global protein

protein function

function annotation

annotation mining

mining genome-scale

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!