Blind assessment of monomeric AlphaFold2 protein structure models with experimental NMR data.

J Magn Reson

Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180, USA. Electronic address:

Published: July 2023

AI Article Synopsis

  • Advances in molecular modeling, particularly through AlphaFold-2 (AF2) from DeepMind, are revolutionizing structural biology by providing highly accurate protein structure predictions using AI.
  • The study specifically tested AF2's ability to model small, monomeric proteins that were not part of its training data, using nine open-source NMR datasets.
  • Results showed that AF2's predictions often matched or exceeded the fit of existing NMR structure models, highlighting its potential as a valuable tool for protein structure analysis and hypothesis generation in research.

Article Abstract

Recent advances in molecular modeling of protein structures are changing the field of structural biology. AlphaFold-2 (AF2), an AI system developed by DeepMind, Inc., utilizes attention-based deep learning to predict models of protein structures with high accuracy relative to structures determined by X-ray crystallography and cryo-electron microscopy (cryoEM). Comparing AF2 models to structures determined using solution NMR data, both high similarities and distinct differences have been observed. Since AF2 was trained on X-ray crystal and cryoEM structures, we assessed how accurately AF2 can model small, monomeric, solution protein NMR structures which (i) were not used in the AF2 training data set, and (ii) did not have homologous structures in the Protein Data Bank at the time of AF2 training. We identified nine open-source protein NMR data sets for such "blind" targets, including chemical shift, raw NMR FID data, NOESY peak lists, and (for 1 case) N-H residual dipolar coupling data. For these nine small (70-108 residues) monomeric proteins, we generated AF2 prediction models and assessed how well these models fit to these experimental NMR data, using several well-established NMR structure validation tools. In most of these cases, the AF2 models fit the NMR data nearly as well, or sometimes better than, the corresponding NMR structure models previously deposited in the Protein Data Bank. These results provide benchmark NMR data for assessing new NMR data analysis and protein structure prediction methods. They also document the potential for using AF2 as a guiding tool in protein NMR data analysis, and more generally for hypothesis generation in structural biology research.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10659763PMC
http://dx.doi.org/10.1016/j.jmr.2023.107481DOI Listing

Publication Analysis

Top Keywords

nmr data
32
data
13
nmr
12
protein nmr
12
protein
9
af2
9
protein structure
8
structure models
8
experimental nmr
8
protein structures
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!