Background: With the huge amount of uncharacterized protein sequences generated in the post-genomic age, it is highly desirable to develop effective computational methods for quickly and accurately predicting their functions. The information thus obtained would be very useful for both basic research and drug development in a timely manner.

Methodology/principal Findings: Although many efforts have been made in this regard, most of them were based on either sequence similarity or protein-protein interaction (PPI) information. However, the former often fails to work if a query protein has no or very little sequence similarity to any function-known proteins, while the latter had similar problem if the relevant PPI information is not available. In view of this, a new approach is proposed by hybridizing the PPI information and the biochemical/physicochemical features of protein sequences. The overall first-order success rates by the new predictor for the functions of mouse proteins on training set and test set were 69.1% and 70.2%, respectively, and the success rate covered by the results of the top-4 order from a total of 24 orders was 65.2%.

Conclusions/significance: The results indicate that the new approach is quite promising that may open a new avenue or direction for addressing the difficult and complicated problem.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3023709PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0014556PLOS

Publication Analysis

Top Keywords

predicting functions
8
protein-protein interaction
8
protein sequences
8
sequence similarity
8
functions proteins
4
proteins mouse
4
mouse based
4
based weighted
4
weighted protein-protein
4
interaction network
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!