Generative design of proteins based on secondary structure constraints using an attention-based diffusion model.

Chem

Laboratory for Atomistic and Molecular Mechanics (LAMM), Massachusetts Institute of Technology, 77 Massachusetts Ave., Cambridge, MA 02139, USA.

Published: July 2023

We report two generative deep learning models that predict amino acid sequences and 3D protein structures based on secondary structure design objectives via either overall content or per-residue structure. Both models are robust regarding imperfect inputs and offer design capacity as they can discover new protein sequences not yet discovered from natural mechanisms or systems. The residue-level secondary structure design model generally yields higher accuracy and more diverse sequences. These findings suggest unexplored opportunities for protein designs and functional outcomes within the vast amino acid sequences beyond known proteins. Our models, based on an attention-based diffusion model and trained on a dataset extracted from experimentally known 3D protein structures, offer numerous downstream applications in conditional generative design of various biological or engineering systems. Future work may include additional conditioning, and an exploration of other functional properties of the generated proteins for various properties beyond structural objectives.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10443900PMC
http://dx.doi.org/10.1016/j.chempr.2023.03.020DOI Listing

Publication Analysis

Top Keywords

secondary structure
12
generative design
8
based secondary
8
attention-based diffusion
8
diffusion model
8
amino acid
8
acid sequences
8
protein structures
8
structure design
8
design proteins
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!