A novel feature extraction scheme for prediction of protein-protein interaction sites.

Mol Biosyst

Key Laboratory of Intelligent Computing & Signal Processing, Ministry of Education, Anhui University, Anhui, China.

Published: February 2015

Identifying protein-protein interaction (PPI) sites plays an important and challenging role in some topics of biology. Although many methods have been proposed, this problem is still far away to be solved. Here, a feature selection approach with an 11-sliding window and random forest algorithm is proposed, which is called DX-RF. This method has achieved an accuracy of 88.79%, recall of 82.09%, and precision of 85.76% with top-ranked 34 features on the Hetero test dataset and has 91.6% accuracy, 89.2% precision, 83.54% recall with top-ranked 25 features set on the Homo test dataset. Compared to other methods, the results indicate that the DX-RF method has a strong ability to select relevance features to get a higher performance. Moreover, in order to further understand protein interactions, feature analysis in this study is also performed.

Download full-text PDF

Source
http://dx.doi.org/10.1039/c4mb00625aDOI Listing

Publication Analysis

Top Keywords

protein-protein interaction
8
dx-rf method
8
top-ranked features
8
test dataset
8
novel feature
4
feature extraction
4
extraction scheme
4
scheme prediction
4
prediction protein-protein
4
interaction sites
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!