Accurate prediction of chemical shifts for aqueous protein structure on "Real World" data.

Jie Li Kochise C Bennett Yuchen Liu Michael V Martin Teresa Head-Gordon

Chem Sci

Pitzer Center for Theoretical Chemistry, University of California Berkeley CA 94720 USA

Published: March 2020

Here we report a new machine learning algorithm for protein chemical shift prediction that outperforms existing chemical shift calculators on realistic data that is not heavily curated, nor eliminates test predictions . Our UCBShift predictor implements two modules: a transfer prediction module that employs both sequence and structural alignment to select reference candidates for experimental chemical shift replication, and a redesigned machine learning module based on random forest regression which utilizes more, and more carefully curated, feature extracted data. When combined together, this new predictor achieves state-of-the-art accuracy for predicting chemical shifts on a randomly selected dataset without careful curation, with root-mean-square errors of 0.31 ppm for amide hydrogens, 0.19 ppm for Hα, 0.84 ppm for C', 0.81 ppm for Cα, 1.00 ppm for Cβ, and 1.81 ppm for N. When similar sequences or structurally related proteins are available, UCBShift shows superior native state selection from misfolded decoy sets compared to SPARTA+ and SHIFTX2, and even without homology we exceed current prediction accuracy of all other popular chemical shift predictors.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8152569	PMC
http://dx.doi.org/10.1039/c9sc06561j	DOI Listing

Publication Analysis

Top Keywords

chemical shift

chemical shifts

machine learning

chemical

ppm

accurate prediction

prediction chemical

shifts aqueous

aqueous protein

protein structure

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!