Background: Accurate prediction of inter-residue contacts of a protein is important to calculating its tertiary structure. Analysis of co-evolutionary events among residues has been proved effective in inferring inter-residue contacts. The Markov random field (MRF) technique, although being widely used for contact prediction, suffers from the following dilemma: the actual likelihood function of MRF is accurate but time-consuming to calculate; in contrast, approximations to the actual likelihood, say pseudo-likelihood, are efficient to calculate but inaccurate. Thus, how to achieve both accuracy and efficiency simultaneously remains a challenge.

Results: In this study, we present such an approach (called clmDCA) for contact prediction. Unlike plmDCA using pseudo-likelihood, i.e., the product of conditional probability of individual residues, our approach uses composite-likelihood, i.e., the product of conditional probability of all residue pairs. Composite likelihood has been theoretically proved as a better approximation to the actual likelihood function than pseudo-likelihood. Meanwhile, composite likelihood is still efficient to maximize, thus ensuring the efficiency of clmDCA. We present comprehensive experiments on popular benchmark datasets, including PSICOV dataset and CASP-11 dataset, to show that: i) clmDCA alone outperforms the existing MRF-based approaches in prediction accuracy. ii) When equipped with deep learning technique for refinement, the prediction accuracy of clmDCA was further significantly improved, suggesting the suitability of clmDCA for subsequent refinement procedure. We further present a successful application of the predicted contacts to accurately build tertiary structures for proteins in the PSICOV dataset.

Conclusions: Composite likelihood maximization algorithm can efficiently estimate the parameters of Markov Random Fields and can improve the prediction accuracy of protein inter-residue contacts.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6821021PMC
http://dx.doi.org/10.1186/s12859-019-3051-7DOI Listing

Publication Analysis

Top Keywords

inter-residue contacts
16
composite likelihood
16
actual likelihood
12
prediction accuracy
12
protein inter-residue
8
likelihood maximization
8
deep learning
8
markov random
8
contact prediction
8
likelihood function
8

Similar Publications

Motivation: Characterizing the structure of flexible proteins, particularly within the realm of intrinsic disorder, presents a formidable challenge due to their high conformational variability. Currently, their structural representation relies on (possibly large) conformational ensembles derived from a combination of experimental and computational methods. The detailed structural analysis of these ensembles is a difficult task, for which existing tools have limited effectiveness.

View Article and Find Full Text PDF

We have recently shown how physically realizable protein-folding pathways can be generated using directed walks in the space of inter-residue contact-maps; combined with a back-transformation to move from protein contact-maps to Cartesian coordinates, we have demonstrated how this approach can generate protein-folding trajectory ensembles without recourse to molecular dynamics. In this article, we demonstrate that this framework can be used to study a challenging protein-folding problem that is known to exhibit two different folding paths which were previously identified through molecular dynamics simulation at several different temperatures. From the viewpoint of protein-folding mechanism prediction, this particular problem is extremely challenging to address, specifically involving folding to an identical nontrivial compact native structure along distinct pathways defined by heterogeneous secondary structural elements.

View Article and Find Full Text PDF

Thermal Energy Transport through Nonbonded Native Contacts in Protein.

J Phys Chem B

September 2024

Graduate School of Science, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8602, Japan.

Within the protein interior, where we observe various types of interactions, nonuniform flow of thermal energy occurs along the polypeptide chain and through nonbonded native contacts, leading to inhomogeneous transport efficiencies from one site to another. The folded native protein serves not merely as thermal transfer medium but, more importantly, as sophisticated molecular nanomachines in cells. Therefore, we are particularly interested in what sort of "communication" is mediated through native contacts in the folded proteins and how such features are quantitatively depicted in terms of local transport coefficients of heat currents.

View Article and Find Full Text PDF

Accurate estimation of the normalized mutual information of multidimensional data.

J Chem Phys

August 2024

Biomolecular Dynamics, Institute of Physics, University of Freiburg, 79104 Freiburg, Germany.

While the linear Pearson correlation coefficient represents a well-established normalized measure to quantify the inter-relation of two stochastic variables X and Y, it fails for multidimensional variables, such as Cartesian coordinates. Avoiding any assumption about the underlying data, the mutual information I(X, Y) does account for multidimensional correlations. However, unlike the normalized Pearson correlation, it has no upper bound (I ∈ [0, ∞)), i.

View Article and Find Full Text PDF

Decoding allostery at the atomic level is essential for understanding the relationship between a protein's sequence, structure, and dynamics. Recently, we have shown that decomposing temperature responses of inter-residue contacts can reveal allosteric couplings and provide useful insight into the functional dynamics of proteins. The details of this Chemically Accurate Contact Response Analysis (ChACRA) are presented here along with its application to two well-known allosteric proteins.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!