AlphaFold2 revolutionized structural biology with the ability to predict protein structures with exceptionally high accuracy. Its implementation, however, lacks the code and data required to train new models. These are necessary to (1) tackle new tasks, like protein-ligand complex structure prediction, (2) investigate the process by which the model learns and (3) assess the model's capacity to generalize to unseen regions of fold space. Here we report OpenFold, a fast, memory efficient and trainable implementation of AlphaFold2. We train OpenFold from scratch, matching the accuracy of AlphaFold2. Having established parity, we find that OpenFold is remarkably robust at generalizing even when the size and diversity of its training set is deliberately limited, including near-complete elisions of classes of secondary structure elements. By analyzing intermediate structures produced during training, we also gain insights into the hierarchical manner in which OpenFold learns to fold. In sum, our studies demonstrate the power and utility of OpenFold, which we believe will prove to be a crucial resource for the protein modeling community.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11645889 | PMC |
http://dx.doi.org/10.1038/s41592-024-02272-z | DOI Listing |
GM Crops Food
December 2024
Regulatory Science, Bayer Crop Science, Chesterfield, MO, USA.
A transgenic protein is frequently expressed as different homologous variants in genetically modified crops due to differential processing of targeting peptides or optimization of activity and specificity. The aim of this study was to develop a science-based approach for risk assessment of homologous protein variants using dicamba mono-oxygenase (DMO) as a case study. In this study, DMO expressed in the next-generation dicamba-tolerant maize, sugar beet and soybean crops exhibited up to 27 amino acid sequence differences in the N-terminus.
View Article and Find Full Text PDFPLoS One
September 2024
Regulatory Science, Bayer Crop Science, Chesterfield, Missouri, United States of America.
Transgenic soybean, cotton, and maize tolerant to protoporphyrinogen IX oxidase (PPO)-inhibiting herbicides have been developed by introduction of a bacterial-derived PPO targeted into the chloroplast. PPO is a membrane-associated protein with an intrinsic tendency for aggregation, making expression, purification, and formulation at high concentrations difficult. In this study, transgenic PPO expressed in three crops was demonstrated to exhibit up to a 13 amino acid sequence difference in the N-terminus due to differential processing of the chloroplast transit peptide (CTP).
View Article and Find Full Text PDFJAMA Ophthalmol
November 2024
Institute for Vision Research, University of Iowa, Iowa City.
Importance: This research confirms and further establishes that pathogenic variants in a fourth gene, METTL23, are associated with autosomal dominant normal-tension glaucoma (NTG).
Objective: To determine the frequency of glaucoma-causing pathogenic variants in the METTL23 gene in a cohort of patients with NTG from Iowa.
Design, Setting, And Participants: This case-control study took place at a single tertiary care center in Iowa from January 1997 to January 2024, with analysis occurring between January 2023 and January 2024.
Nat Biotechnol
June 2024
Nature Biotechnology, .
Sci Data
June 2024
Innophore, San Francisco, CA, USA.
Human proteins are crucial players in both health and disease. Understanding their molecular landscape is a central topic in biological research. Here, we present an extensive dataset of predicted protein structures for 42,042 distinct human proteins, including splicing variants, derived from the UniProt reference proteome UP000005640.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!