Automatic Inference of Sequence from Low-Resolution Crystallographic Data.

Structure

Department of Biological Chemistry, The Hebrew University of Jerusalem, Jerusalem 91904, Israel. Electronic address:

Published: November 2018

AI Article Synopsis

  • At resolutions worse than 3.5 Å, it becomes challenging to accurately place protein sequences along a backbone due to weak or absent electron density for side chains.
  • A new fully automated computational approach is introduced, which leverages reciprocal-space indicators to identify erroneous sequence assignments at low resolution.
  • The method successfully assigns amino acid types to a significant portion of backbone positions, achieving 15%, 13%, and 9% accuracy for different resolution datasets, while also maintaining high threading accuracy for sequences compared to deposited PDB structures.

Article Abstract

At resolutions worse than 3.5 Å, the electron density is weak or nonexistent at the locations of the side chains. Consequently, the assignment of the protein sequences to their correct positions along the backbone is a difficult problem. In this work, we propose a fully automated computational approach to assign sequence at low resolution. It is based on our surprising observation that standard reciprocal-space indicators, such as the initial unrefined R value, are sensitive enough to detect an erroneous sequence assignment of even a single backbone position. Our approach correctly determines the amino acid type for 15%, 13%, and 9% of the backbone positions in crystallographic datasets with resolutions of 4.0 Å, 4.5 Å, and 5.0 Å, respectively. We implement these findings in an application for threading a sequence onto a backbone structure. For the three resolution ranges, the application threads 83%, 81%, and 64% of the sequences exactly as in the deposited PDB structures.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6221995PMC
http://dx.doi.org/10.1016/j.str.2018.08.011DOI Listing

Publication Analysis

Top Keywords

automatic inference
4
sequence
4
inference sequence
4
sequence low-resolution
4
low-resolution crystallographic
4
crystallographic data
4
data resolutions
4
resolutions worse
4
worse 35 Å
4
35 Å electron
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!