Prediction of protein structure classes by incorporating different protein descriptors into general Chou's pseudo amino acid composition.

Loris Nanni Sheryl Brahnam Alessandra Lumini

J Theor Biol

DISI, Università di Bologna, Via Venezia 52, 47521 Cesena, Italy. Electronic address:

Published: November 2014

Successful protein structure identification enables researchers to estimate the biological functions of proteins, yet it remains a challenging problem. The most common method for determining an unknown protein's structural class is to perform expensive and time-consuming manual experiments. Because of the availability of amino acid sequences generated in the post-genomic age, it is possible to predict an unknown protein's structural class using machine learning methods given a protein's amino-acid sequence and/or its secondary structural elements. Following recent research in this area, we propose a new machine learning system that is based on combining several protein descriptors extracted from different protein representations, such as position specific scoring matrix (PSSM), the amino-acid sequence, and secondary structural sequences. The prediction engine of our system is operated by an ensemble of support vector machines (SVMs), where each SVM is trained on a different descriptor. The results of each SVM are combined by sum rule. Our final ensemble produces a success rate that is substantially better than previously reported results on three well-established datasets. The MATLAB code and datasets used in our experiments are freely available for future comparison at http://www.dei.unipd.it/node/2357.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.jtbi.2014.07.003	DOI Listing

Publication Analysis

Top Keywords

protein structure

protein descriptors

amino acid

unknown protein's

protein's structural

structural class

machine learning

amino-acid sequence

secondary structural

prediction protein

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!