The HADDOCK team participated in CAPRI rounds 47-55 as server, manual predictor, and scorers. Throughout these CAPRI rounds, we used a plethora of computational strategies to predict the structure of protein complexes. Of the 10 targets comprising 24 interfaces, we achieved acceptable or better models for 3 targets in the human category and 1 in the server category.
View Article and Find Full Text PDFSummary: The Local Disordered Region Sampling (LDRS, pronounced loaders) tool is a new module developed for IDPConformerGenerator, a previously validated approach to model intrinsically disordered proteins (IDPs). The IDPConformerGenerator LDRS module provides a method for generating all-atom conformations of intrinsically disordered protein regions at N- and C-termini of and in loops or linkers between folded regions of an existing protein structure. These disordered elements often lead to missing coordinates in experimental structures or low confidence in predicted structures.
View Article and Find Full Text PDFWe present the results for CAPRI Round 54, the 5th joint CASP-CAPRI protein assembly prediction challenge. The Round offered 37 targets, including 14 homodimers, 3 homo-trimers, 13 heterodimers including 3 antibody-antigen complexes, and 7 large assemblies. On average ~70 CASP and CAPRI predictor groups, including more than 20 automatics servers, submitted models for each target.
View Article and Find Full Text PDFThe Local Disordered Region Sampling (LDRS, pronounced ) tool, developed for the IDPConformerGenerator platform (Teixeira 2022), provides a method for generating all-atom conformations of intrinsically disordered regions (IDRs) at N- and C-termini of and in loops or linkers between folded regions of an existing protein structure. These disordered elements often lead to missing coordinates in experimental structures or low confidence in predicted structures. Requiring only a pre-existing PDB structure of the protein with missing coordinates or with predicted confidence scores and its full-length primary sequence, LDRS will automatically generate physically meaningful conformational ensembles of the missing flexible regions to complete the full-length protein.
View Article and Find Full Text PDFThe structural characterization of proteins with a disorder requires a computational approach backed by experiments to model their diverse and dynamic structural ensembles. The selection of conformational ensembles consistent with solution experiments of disordered proteins highly depends on the initial pool of conformers, with currently available tools limited by conformational sampling. We have developed a Generative Recurrent Neural Network (GRNN) that uses supervised learning to bias the probability distributions of torsions to take advantage of experimental data types such as nuclear magnetic resonance J-couplings, nuclear Overhauser effects, and paramagnetic resonance enhancements.
View Article and Find Full Text PDFProteins form complex interactions in the cellular environment to carry out their functions. They exhibit a wide range of binding modes depending on the cellular conditions, which result in a variety of ordered or disordered assemblies. To help rationalise the binding behavior of proteins, the FuzPred server predicts their sequence-based binding modes without specifying their binding partners.
View Article and Find Full Text PDFDuring muscle cell differentiation, the alternatively spliced, acidic β-domain potentiates transcription of Myocyte-specific Enhancer Factor 2 (Mef2D). Sequence analysis by the FuzDrop method indicates that the β-domain can serve as an interaction element for Mef2D higher-order assembly. In accord, we observed Mef2D mobile nuclear condensates in C2C12 cells, similar to those formed through liquid-liquid phase separation.
View Article and Find Full Text PDFWe consider a generic representation problem of internal coordinates (bond lengths, valence angles, and dihedral angles) and their transformation to 3-dimensional Cartesian coordinates of a biomolecule. We show that the internal-to-Cartesian process relies on correctly predicting chemically subtle correlations among the internal coordinates themselves, and learning these correlations increases the fidelity of the Cartesian representation. We developed a machine learning algorithm, Int2Cart, to predict bond lengths and bond angles from backbone torsion angles and residue types of a protein, which allows reconstruction of protein structures better than using fixed bond lengths and bond angles or a static library method that relies on backbone torsion angles and residue types in a local environment.
View Article and Find Full Text PDFHow do proteins interact in the cellular environment? Which interactions stabilize liquid-liquid phase separated condensates? Are the concepts, which have been developed for specific protein complexes also applicable to higher-order assemblies? Recent discoveries prompt for a universal framework for protein interactions, which can be applied across the scales of protein communities. Here, we discuss how our views on protein interactions have evolved from rigid structures to conformational ensembles of proteins and discuss the open problems, in particular related to biomolecular condensates. Protein interactions have evolved to follow changes in the cellular environment, which manifests in multiple modes of interactions between the same partners.
View Article and Find Full Text PDFThe power of structural information for informing biological mechanisms is clear for stable folded macromolecules, but similar structure-function insight is more difficult to obtain for highly dynamic systems such as intrinsically disordered proteins (IDPs) which must be described as structural ensembles. Here, we present IDPConformerGenerator, a flexible, modular open-source software platform for generating large and diverse ensembles of disordered protein states that builds conformers that obey geometric, steric, and other physical restraints on the input sequence. IDPConformerGenerator samples backbone phi (φ), psi (ψ), and omega (ω) torsion angles of relevant sequence fragments from loops and secondary structure elements extracted from folded protein structures in the RCSB Protein Data Bank and builds side chains from robust Monte Carlo algorithms using expanded rotamer libraries.
View Article and Find Full Text PDFIntrinsically disordered proteins and unfolded proteins have fluctuating conformational ensembles that are fundamental to their biological function and impact protein folding, stability, and misfolding. Despite the importance of protein dynamics and conformational sampling, time-dependent data types are not fully exploited when defining and refining disordered protein ensembles. Here we introduce a computational framework using an elastic network model and normal-mode displacements to generate a dynamic disordered ensemble consistent with NMR-derived dynamics parameters, including transverse relaxation rates and Lipari-Szabo order parameters ( values).
View Article and Find Full Text PDFCoupling of side chain dynamics over long distances is an important component of allostery. Methionine side chains show the largest intrinsic flexibility among methyl-containing residues but the actual degree of conformational averaging depends on the proximity and mobility of neighboring residues. The C NMR chemical shifts of the methyl groups of methionine residues located at long distances in the same protein show a similar scaling with respect to the values predicted from the static X-ray structure by quantum methods.
View Article and Find Full Text PDFThe Protein Data Bank (PDB) file format remains a popular format used and supported by many software to represent coordinates of macromolecular structures. It however suffers from drawbacks such as error-prone manual editing. Because of that, various software toolkits have been developed to facilitate its editing and manipulation, but, to date, there is no online tool available for this purpose.
View Article and Find Full Text PDFEmergence of coronaviruses poses a threat to global health and economy. The current outbreak of SARS-CoV-2 has infected more than 28,000,000 people and killed more than 915,000. To date, there is no treatment for coronavirus infections, making the development of therapies to prevent future epidemics of paramount importance.
View Article and Find Full Text PDFProteins with intrinsic or unfolded state disorder comprise a new frontier in structural biology, requiring the characterization of diverse and dynamic structural ensembles. We introduce a comprehensive Bayesian framework, the Extended Experimental Inferential Structure Determination (X-EISD) method, that calculates the maximum log-likelihood of a disordered protein ensemble. X-EISD accounts for the uncertainties of a range of experimental data and back-calculation models from structures, including NMR chemical shifts, J-couplings, Nuclear Overhauser Effects (NOEs), paramagnetic relaxation enhancements (PREs), residual dipolar couplings (RDCs), hydrodynamic radii ( ), single molecule fluorescence Förster resonance energy transfer (smFRET) and small angle X-ray scattering (SAXS).
View Article and Find Full Text PDFThe pdb-tools are a collection of Python scripts for working with molecular structure data in the Protein Data Bank (PDB) format. They allow users to edit, convert, and validate PDB files, from the command-line, in a simple but efficient manner. The pdb-tools are implemented in Python, without any external dependencies, and are freely available under the open-source Apache License at https://github.
View Article and Find Full Text PDFThe c-Src oncogene is anchored to the cytoplasmic membrane through its N-terminal myristoylated SH4 domain. This domain is part of an intramolecular fuzzy complex with the SH3 and Unique domains. Here we show that the N-terminal myristoyl group binds to the SH3 domain in the proximity of the RT loop, when Src is not anchored to a lipid membrane.
View Article and Find Full Text PDFCalcineurin is an essential calcium-activated serine/threonine phosphatase. The six NMR-observable methionine methyl groups in the catalytic domain of human calcineurin Aα (CNA) were assigned and used as reporters of the presence of potential cis-trans isomers in solution. Proline 84 is found in the cis conformation in most calcineurin X-ray structures, and proline 309, which is part of a highly conserved motif in phosphoprotein phosphatases, was modeled with a cis peptide bond in one of the two molecules present in the asymmetric unit of CNA.
View Article and Find Full Text PDFThe function of the intrinsically disordered Unique domain of the Src family of tyrosine kinases (SFK), where the largest differences between family members are concentrated, remains poorly understood. Recent studies in c-Src have demonstrated that the Unique region forms transient interactions, described as an intramolecular fuzzy complex, with the SH3 domain and suggested that similar complexes could be formed by other SFKs. Src and Lyn are members of a distinct subfamily of SFKs.
View Article and Find Full Text PDFStructural disorder is an essential ingredient for function in many proteins and protein complexes. Fuzzy complexes describe the many instances where disorder is maintained as a critical element of protein interactions. In this minireview we discuss how intramolecular fuzzy interactions function in signaling complexes.
View Article and Find Full Text PDFWe present Farseer-NMR ( https://git.io/vAueU ), a software package to treat, evaluate and combine NMR spectroscopic data from sets of protein-derived peaklists covering a range of experimental conditions. The combined advances in NMR and molecular biology enable the study of complex biomolecular systems such as flexible proteins or large multibody complexes, which display a strong and functionally relevant response to their environmental conditions, e.
View Article and Find Full Text PDFThe N-terminal regulatory region of c-Src including the SH4, Unique, and SH3 domains adopts a compact, yet highly dynamic, structure that can be described as an intramolecular fuzzy complex. Most of the long-range interactions within the Unique domain are also observed in constructs lacking the structured SH3, indicating a considerable degree of preorganization of the disordered Unique domain. Here we report that members of the Src family of kinases (SFK) share well-conserved sequence features involving aromatic residues in their Unique domains.
View Article and Find Full Text PDFThe Hha and TomB proteins from Escherichia coli form an oxygen-dependent toxin-antitoxin (TA) system. Here we show that YmoB, the Yersinia orthologue of TomB, and its single cysteine variant [C117S]YmoB can replace TomB as antitoxins in E. coli.
View Article and Find Full Text PDF