Detecting distant-homology protein structures by aligning deep neural-network based contact maps.

PLoS Comput Biol

Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, United States of America.

Published: October 2019

Accurate prediction of atomic-level protein structure is important for annotating the biological functions of protein molecules and for designing new compounds to regulate the functions. Template-based modeling (TBM), which aims to construct structural models by copying and refining the structural frameworks of other known proteins, remains the most accurate method for protein structure prediction. Due to the difficulty in recognizing distant-homology templates, however, the accuracy of TBM decreases rapidly when the evolutionary relationship between the query and template vanishes. In this study, we propose a new method, CEthreader, which first predicts residue-residue contacts by coupling evolutionary precision matrices with deep residual convolutional neural-networks. The predicted contact maps are then integrated with sequence profile alignments to recognize structural templates from the PDB. The method was tested on two independent benchmark sets consisting collectively of 1,153 non-homologous protein targets, where CEthreader detected 176% or 36% more correct templates with a TM-score >0.5 than the best state-of-the-art profile- or contact-based threading methods, respectively, for the Hard targets that lacked homologous templates. Moreover, CEthreader was able to identify 114% or 20% more correct templates with the same Fold as the query, after excluding structures from the same SCOPe Superfamily, than the best profile- or contact-based threading methods. Detailed analyses show that the major advantage of CEthreader lies in the efficient coupling of contact maps with profile alignments, which helps recognize global fold of protein structures when the homologous relationship between the query and template is weak. These results demonstrate an efficient new strategy to combine ab initio contact map prediction with profile alignments to significantly improve the accuracy of template-based structure prediction, especially for distant-homology proteins.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6818797PMC
http://dx.doi.org/10.1371/journal.pcbi.1007411DOI Listing

Publication Analysis

Top Keywords

contact maps
12
profile alignments
12
protein structures
8
protein structure
8
structure prediction
8
relationship query
8
query template
8
correct templates
8
profile- contact-based
8
contact-based threading
8

Similar Publications

Recipes and ingredients for deep learning models of 3D genome folding.

Curr Opin Genet Dev

January 2025

Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA. Electronic address:

Three-dimensional genome folding plays roles in gene regulation and disease. In this review, we compare and contrast recent deep learning models for predicting genome contact maps. We survey preprocessing, architecture, training, evaluation, and interpretation methods, highlighting the capabilities and limitations of different models.

View Article and Find Full Text PDF

Background: Cadaverine and hydrocinnamic acid are frequent metabolites in inflamed periodontal areas. Their role as a metabolite for plant growth inhibition has been established, but their relevance in humans has yet to be determined. Moreover, Vascular endothelial growth factor (VGEF) is a consistent growth factor in neo-angiogenesis in periodontal regeneration.

View Article and Find Full Text PDF

During the blood coagulation cascade, coagulation factor VIII (FVIII) is activated by thrombin to form activated factor VIII (FVIIIa). FVIIIa associates with platelet surfaces at the site of vascular damage to form an intrinsic tenase complex with activated factor IX. A working model for FVIII membrane binding involves the association of positively charged FVIII residues with negatively charged lipid headgroups and the burial of hydrophobic residues into the membrane interior.

View Article and Find Full Text PDF

The terahertz (THz) security scanner offers advantages such as non-contact inspection and the ability to detect various types of dangerous goods, playing an important role in preventing terrorist attacks. We aim to accurately and quickly detect concealed objects in THz security images. However, current object detection algorithms face many challenges when applied to THz images.

View Article and Find Full Text PDF

Serious issues with cryo-EM structures of human prothrombinase.

Open Biol

January 2025

Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, The Keith Peters Building, Hills Road , Cambridge CB2 0XY, UK.

Thrombin is generated from prothrombin through sequential cleavage at two sites by the enzyme complex prothrombinase, composed of a serine protease, factor (f) Xa and a cofactor, fVa, on phospholipid membranes. In a recent paper published in , Ruben . (Ruben .

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!