Large language models trained on sequence information alone are capable of learning high level principles of protein design. However, beyond sequence, the three-dimensional structures of proteins determine their specific function, activity, and evolvability. Here we show that a general protein language model augmented with protein structure backbone coordinates and trained on the inverse folding problem can guide evolution for diverse proteins without needing to explicitly model individual functional tasks. We demonstrate inverse folding to be an effective unsupervised, structure-based sequence optimization strategy that also generalizes to multimeric complexes by implicitly learning features of binding and amino acid epistasis. Using this approach, we screened ~30 variants of two therapeutic clinical antibodies used to treat SARS-CoV-2 infection and achieved up to 26-fold improvement in neutralization and 37-fold improvement in affinity against antibody-escaped viral variants-of-concern BQ.1.1 and XBB.1.5, respectively. In addition to substantial overall improvements in protein function, we find inverse folding performs with leading experimental success rates among other reported machine learning-guided directed evolution methods, without requiring any task-specific training data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10769282PMC
http://dx.doi.org/10.1101/2023.12.19.572475DOI Listing

Publication Analysis

Top Keywords

inverse folding
16
language model
8
protein
5
inverse
4
folding protein
4
protein complexes
4
complexes structure-informed
4
structure-informed language
4
model enables
4
enables unsupervised
4

Similar Publications

Amino acid insertions and deletions (indels) are among the most common protein mutations and necessitate changes to a protein's backbone geometry. Examining how indels affect protein folding stability (and especially how indels can increase stability) can help reveal the role of backbone energetics on stability and introduce new protein engineering strategies. Tsuboyama et al.

View Article and Find Full Text PDF

R3Design: deep tertiary structure-based RNA sequence design and beyond.

Brief Bioinform

November 2024

AI Lab, Research Center for Industries of the Future, Westlake University, Zhejiang 310058, China.

The rational design of Ribonucleic acid (RNA) molecules is crucial for advancing therapeutic applications, synthetic biology, and understanding the fundamental principles of life. Traditional RNA design methods have predominantly focused on secondary structure-based sequence design, often neglecting the intricate and essential tertiary interactions. We introduce R3Design, a tertiary structure-based RNA sequence design method that shifts the paradigm to prioritize tertiary structure in the RNA sequence design.

View Article and Find Full Text PDF

The task of RNA design given a target structure aims to find a sequence that can fold into that structure. It is a computationally hard problem where some version(s) have been proven to be NP-hard. As a result, heuristic methods such as local search have been popular for this task, but by only exploring a fixed number of candidates.

View Article and Find Full Text PDF

Psoriasis (PsO) is a chronic, systemic, and autoimmune dermatologic condition characterized by dry, scaly, and erythematous plaques on the skin. PsO can present in various forms, including guttate (small, round lesions commonly over the upper trunk and extremities that can be raised and scaly), inverse (smooth plaques of inflamed skin within skin folds of the groin, buttock, and breasts), pustular (white painful pustules within red inflamed blotches widespread over the body), and erythrodermic (red rash present over most of the body). Individuals with PsO can present differently, with unique symptoms and patterns on the skin.

View Article and Find Full Text PDF

Achieving asymmetry parameter-insensitive resonant modes through relative shift-induced quasi-bound states in the continuum.

Nanophotonics

April 2024

National Laboratory of Solid-State Microstructures, College of Engineering and Applied Sciences and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, China.

High-Q resonances in metasurfaces, stemming from symmetry-protected bound states in the continuum (BICs), have proven to be effective for achieving high-performance optical devices. However, the properties associated with symmetry-protected BICs are inherently limited, as even a slight variation in the asymmetry parameter leads to a noticeable shift in the resonance location. Herein, we introduce the concept of relative shift-induced quasi-BICs (QBICs) within dimerized silicon (Si) meta-lattices (DSMs), which can be excited when a nonzero relative shift occurs, a result of in-plane inversion symmetry breaking and Brillouin zone folding within the structure.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!