Prediction of binding sites from sequence can significantly help toward determining the function of uncharacterized proteins on a genomic scale. The task is highly challenging due to the enormous amount of alternative candidate configurations. Previous research has only considered this prediction problem starting from 3D information. When starting from sequence alone, only methods that predict the bonding state of selected residues are available. The sole exception consists of pattern-based approaches, which rely on very specific motifs and cannot be applied to discover truly novel sites. We develop new algorithmic ideas based on structured-output learning for determining transition-metal-binding sites coordinated by cysteines and histidines. The inference step (retrieving the best scoring output) is intractable for general output types (i.e., general graphs). However, under the assumption that no residue can coordinate more than one metal ion, we prove that metal binding has the algebraic structure of a matroid, allowing us to employ a very efficient greedy algorithm. We test our predictor in a highly stringent setting where the training set consists of protein chains belonging to SCOP folds different from the ones used for accuracy estimation. In this setting, our predictor achieves 56 percent precision and 60 percent recall in the identification of ligand-ion bonds.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCBB.2011.94DOI Listing

Publication Analysis

Top Keywords

predicting metal-binding
4
sites
4
metal-binding sites
4
sites protein
4
protein sequence
4
sequence prediction
4
prediction binding
4
binding sites
4
sites sequence
4
sequence help
4

Similar Publications

Precise binding free-energy predictions for ligands targeting metalloproteins, especially zinc-containing histone deacetylase (HDAC) enzymes, require specialized computational approaches due to the unique interactions at metal-binding sites. This study evaluates a docking algorithm optimized for zinc coordination to determine whether it could accurately differentiate between protonated and deprotonated states of hydroxamic acid ligands, a key functional group in HDAC inhibitors (HDACi). By systematically analyzing both protonation states, we sought to identify which state produces docking poses and binding energy estimates most closely aligned with experimental values.

View Article and Find Full Text PDF

Metal-Independent Correlations for Site-Specific Binding Energies of Relevant Catalytic Intermediates.

JACS Au

December 2024

SUNCAT Center for Interface Science and Catalysis, SLAC National Accelerator Laboratory, 2575 Sand Hill Road, Menlo Park, California 94025, United States.

Establishing energy correlations among different metals can accelerate the discovery of efficient and cost-effective catalysts for complex reactions. Using a recently introduced coordination-based model, we can predict site-specific metal binding energies (Δ ) that can be used as a descriptor for chemical reactions. In this study, we have examined a range of metals including Ag, Au, Co, Cu, Ir, Ni, Os, Pd, Pt, Rh, and Ru and found linear correlations between predicted Δ and adsorption energies of CH and O (Δ and Δ ) at various coordination environments for all the considered metals.

View Article and Find Full Text PDF

MetalNet2: an enhanced server for predicting metal-binding sites in proteomes.

Natl Sci Rev

December 2024

Beijing National Laboratory for Molecular Sciences, College of Chemistry and Molecular Engineering, Peking University, China.

View Article and Find Full Text PDF

The current "consensus" order in which amino acids were added to the genetic code is based on potentially biased criteria, such as the absence of sulfur-containing amino acids from the Urey-Miller experiment which lacked sulfur. More broadly, abiotic abundance might not reflect biotic abundance in the organisms in which the genetic code evolved. Here, we instead identify which protein domains date to the last universal common ancestor (LUCA) and then infer the order of recruitment from deviations of their ancestrally reconstructed amino acid frequencies from the still-ancient post-LUCA controls.

View Article and Find Full Text PDF

Post-transcriptional modifications at the anticodon stem-loop of tRNAs are key to the translation function. Metabolic pathways to these modifications often incorporate complex enzymology. A notable example is the hypermodified nucleoside, queuosine, found at the wobble position of Asn, Asp, His, and Tyr encoding tRNAs.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!