Prediction of protein folds: extraction of new features, dimensionality reduction, and fusion of heterogeneous classifiers.

IEEE Trans Nanobioscience

Praxis Softek Solutions Pvt. Ltd., Kolkata 700091, India.

Published: March 2009

Here, we consider a two-level (four classes in level 1 and 27 folds in level 2) protein fold determination problem. We propose several new features and use some existing features including frequencies of adjacent residues, frequencies of residues separated by one residue, and triplets (trio) of amino acid compositions (AACs). The dimensionality of the trio AAC features is drastically reduced using a neural network based novel online feature selection scheme. We also propose new sets of features called trio potential computed using the hydrophobicity values considering only the selected trio AACs. We demonstrate that the proposed features including the selected trio AACs and trio potential have good discriminating power for protein fold determination. As machine learning tools, we use multilayer perceptron network, radial basis function network, and support vector machine. To improve the recognition accuracies further, we use fusion of different classifiers using the same set of features as well as different sets of features. The effectiveness of our schemes is demonstrated with a benchmark structural classification of proteins (SCOP) dataset. Our system achieves 84.9% test accuracy for the SCOP structural class (four classes) determination and 68.6% test accuracy for the fold recognition with 27 folds. In order to demonstrate the consistency of feature sets and fusion schemes, we also perform the fivefold cross-validation experiments.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNB.2009.2016488DOI Listing

Publication Analysis

Top Keywords

features
8
protein fold
8
fold determination
8
features including
8
sets features
8
trio potential
8
selected trio
8
trio aacs
8
test accuracy
8
trio
6

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!