Protein structure prediction is fundamental to molecular biology and has numerous applications in areas such as drug discovery and protein engineering. Machine learning techniques have greatly advanced protein 3D modeling in recent years, particularly with the development of AlphaFold2 (AF2), which can analyze sequences of amino acids and predict 3D structures with near experimental accuracy. Since the release of AF2, numerous studies have been conducted, either using AF2 directly for large-scale modeling or building upon the software for other use cases. Many reviews have been published discussing the impact of AF2 in the field of protein bioinformatics, particularly in relation to neural networks, which have highlighted what AF2 can and cannot do. It is evident that AF2 and similar approaches are open to further development and several new approaches have emerged, in addition to older refinement approaches, for improving the quality of predictions. Here we provide a brief overview, aimed at the general biologist, of how machine learning techniques have been used for improvement of 3D models of proteins following AF2, and we highlight the impacts of these approaches. In the most recent experiment on the Critical Assessment of Techniques for Protein Structure Prediction (CASP15), the most successful groups all developed their own tools for protein structure modeling that were based at least in some part on AF2. This improvement involved employing techniques such as generative modeling, changing parameters such as dropout to generate more AF2 structures, and data-driven approaches including using alternative templates and MSAs.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1007/978-1-0716-4196-5_7 | DOI Listing |
Proc Natl Acad Sci U S A
January 2025
Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139.
Protein language models (PLMs) have demonstrated impressive success in modeling proteins. However, general-purpose "foundational" PLMs have limited performance in modeling antibodies due to the latter's hypervariable regions, which do not conform to the evolutionary conservation principles that such models rely on. In this study, we propose a transfer learning framework called Antibody Mutagenesis-Augmented Processing (AbMAP), which fine-tunes foundational models for antibody-sequence inputs by supervising on antibody structure and binding specificity examples.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
January 2025
Chinese Academy of Sciences Key Laboratory of Tropical Marine Bio Resources and Ecology, Guangdong Key Laboratory of Marine Materia Medica, Innovation Academy of South China Sea Ecology and Environmental Engineering, Guangdong Provincial Observation and Research Station for Coastal Upwelling Ecosystem, South China Sea Institute of Oceanology, Chinese Academy of Sciences, Guangzhou 511458, China.
Rotation of the bacterial flagellum, the first identified biological rotary machine, is driven by its stator units. Knowledge gained about the function of stator units has increasingly led to studies of rotary complexes in different cellular pathways. Here, we report that a tetrameric PilZ family protein, FlgX, is a structural component underneath the stator units in the flagellar motor of .
View Article and Find Full Text PDFProc Natl Acad Sci U S A
January 2025
Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA 02138.
C-Terminal cyclic imides are posttranslational modifications that can arise from spontaneous intramolecular cleavage of asparagine or glutamine residues resulting in a form of irreversible protein damage. These protein damage events are recognized and removed by the E3 ligase substrate adapter cereblon (CRBN), indicating that these aging-related modifications may require cellular quality control mechanisms to prevent deleterious effects. However, the factors that determine protein or peptide susceptibility to C-terminal cyclic imide formation or their effect on protein stability have not been explored in detail.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
January 2025
Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX 75390.
Neurotransmitter release is triggered in microseconds by Ca-binding to the Synaptotagmin-1 C-domains and by SNARE complexes that form four-helix bundles between synaptic vesicles and plasma membranes, but the coupling mechanism between Ca-sensing and membrane fusion is unknown. Release requires extension of SNARE helices into juxtamembrane linkers that precede transmembrane regions (linker zippering) and binding of the Synaptotagmin-1 CB domain to SNARE complexes through a "primary interface" comprising two regions (I and II). The Synaptotagmin-1 Ca-binding loops were believed to accelerate membrane fusion by inducing membrane curvature, perturbing lipid bilayers, or helping bridge the membranes, but SNARE complex binding through the primary interface orients the Ca-binding loops away from the fusion site, hindering these putative activities.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
January 2025
Department of Chemistry and Biochemistry, The Ohio State University, Columbus, OH 43210.
The homo-dodecameric ring-shaped RNA binding attenuation protein (TRAP) from binds up to twelve tryptophan ligands (Trp) and becomes activated to bind a specific sequence in the 5' leader region of the operon mRNA, thereby downregulating biosynthesis of Trp. Thermodynamic measurements of Trp binding have revealed a range of cooperative behavior for different TRAP variants, even if the averaged apparent affinities for Trp have been found to be similar. Proximity between the ligand binding sites, and the ligand-coupled disorder-to-order transition has implicated nearest-neighbor interactions in cooperativity.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!