Using pseudo-amino acid composition and support vector machine to predict protein structural class.

Chao Chen Yuan-Xin Tian Xiao-Yong Zou Pei-Xiang Cai Jin-Yuan Mo

J Theor Biol

School of Chemistry and Chemical Engineering, Sun Yat-Sen University, Guangzhou 510275, People's Republic of China.

Published: December 2006

The gap between known protein sequences and structural classes is increasing, highlighting the need for effective computational methods to predict protein structures.
A new predictor utilizing support vector machine learning and a novel pseudo-amino acid composition (PseAA) has been developed to improve prediction accuracy by considering sequence-order effects.
Results from testing on 204 proteins show promising outcomes, suggesting that this new method could significantly enhance current prediction systems.

As a result of genome and other sequencing projects, the gap between the number of known protein sequences and the number of known protein structural classes is widening rapidly. In order to narrow this gap, it is vitally important to develop a computational prediction method for fast and accurately determining the protein structural class. In this paper, a novel predictor is developed for predicting protein structural class. It is featured by employing a support vector machine learning system and using a different pseudo-amino acid composition (PseAA), which was introduced to, to some extent, take into account the sequence-order effects to represent protein samples. As a demonstration, the jackknife cross-validation test was performed on a working dataset that contains 204 non-homologous proteins. The predicted results are very encouraging, indicating that the current predictor featured with the PseAA may play an important complementary role to the elegant covariant discriminant predictor and other existing algorithms.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.jtbi.2006.06.025	DOI Listing

Publication Analysis

Top Keywords

protein structural

structural class

pseudo-amino acid

acid composition

support vector

vector machine

number protein

protein

composition support

machine predict

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!