Application of data mining tools for classification of protein structural class from residue based averaged NMR chemical shifts.

Biochim Biophys Acta

Department of Chemistry, California State University, Fresno, CA 93740, United States; Department of Pathology and Laboratory Medicine, School of Medicine, University of California, Davis, CA 95616, United States. Electronic address:

Published: October 2015

The number of protein sequences deriving from genome sequencing projects is outpacing our knowledge about the function of these proteins. With the gap between experimentally characterized and uncharacterized proteins continuing to widen, it is necessary to develop new computational methods and tools for protein structural information that is directly related to function. Nuclear magnetic resonance (NMR) provides powerful means to determine three-dimensional structures of proteins in the solution state. However, translation of the NMR spectral parameters to even low-resolution structural information such as protein class requires multiple time consuming steps. In this paper, we present an unorthodox method to predict the protein structural class directly by using the residue's averaged chemical shifts (ACS) based on machine learning algorithms. Experimental chemical shift information from 1491 proteins obtained from Biological Magnetic Resonance Bank (BMRB) and their respective protein structural classes derived from structural classification of proteins (SCOP) were used to construct a data set with 119 attributes and 5 different classes. Twenty four different classification schemes were evaluated using several performance measures. Overall the residue based ACS values can predict the protein structural classes with 80% accuracy measured by Matthew correlation coefficient. Specifically protein classes defined by mixed αβ or small proteins are classified with >90% correlation. Our results indicate that this NMR-based method can be utilized as a low-resolution tool for protein structural class identification without any prior chemical shift assignments.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4547871PMC
http://dx.doi.org/10.1016/j.bbapap.2015.02.016DOI Listing

Publication Analysis

Top Keywords

protein structural
24
structural class
12
protein
9
structural
8
residue based
8
chemical shifts
8
magnetic resonance
8
predict protein
8
chemical shift
8
structural classes
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!