Self-supervised pretraining on protein sequences has led to state-of-the art performance on protein function and fitness prediction. However, sequence-only methods ignore the rich information contained in experimental and predicted protein structures. Meanwhile, inverse folding methods reconstruct a protein's amino-acid sequence given its structure, but do not take advantage of sequences that do not have known structures. In this study, we train a masked inverse folding protein masked language model parameterized as a structured graph neural network. During pretraining, this model learns to reconstruct corrupted sequences conditioned on the backbone structure. We then show that using the outputs from a pretrained sequence-only protein masked language model as input to the inverse folding model further improves pretraining perplexity. We evaluate both of these models on downstream protein engineering tasks and analyze the effect of using information from experimental or predicted structures on performance.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1093/protein/gzad015 | DOI Listing |
Brief Bioinform
November 2024
AI Lab, Research Center for Industries of the Future, Westlake University, Zhejiang 310058, China.
The rational design of Ribonucleic acid (RNA) molecules is crucial for advancing therapeutic applications, synthetic biology, and understanding the fundamental principles of life. Traditional RNA design methods have predominantly focused on secondary structure-based sequence design, often neglecting the intricate and essential tertiary interactions. We introduce R3Design, a tertiary structure-based RNA sequence design method that shifts the paradigm to prioritize tertiary structure in the RNA sequence design.
View Article and Find Full Text PDFThe task of RNA design given a target structure aims to find a sequence that can fold into that structure. It is a computationally hard problem where some version(s) have been proven to be NP-hard. As a result, heuristic methods such as local search have been popular for this task, but by only exploring a fixed number of candidates.
View Article and Find Full Text PDFCureus
November 2024
Rheumatology, Kiran C. Patel College of Osteopathic Medicine, Nova Southeastern University, Davie, USA.
Psoriasis (PsO) is a chronic, systemic, and autoimmune dermatologic condition characterized by dry, scaly, and erythematous plaques on the skin. PsO can present in various forms, including guttate (small, round lesions commonly over the upper trunk and extremities that can be raised and scaly), inverse (smooth plaques of inflamed skin within skin folds of the groin, buttock, and breasts), pustular (white painful pustules within red inflamed blotches widespread over the body), and erythrodermic (red rash present over most of the body). Individuals with PsO can present differently, with unique symptoms and patterns on the skin.
View Article and Find Full Text PDFNanophotonics
April 2024
National Laboratory of Solid-State Microstructures, College of Engineering and Applied Sciences and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, China.
High-Q resonances in metasurfaces, stemming from symmetry-protected bound states in the continuum (BICs), have proven to be effective for achieving high-performance optical devices. However, the properties associated with symmetry-protected BICs are inherently limited, as even a slight variation in the asymmetry parameter leads to a noticeable shift in the resonance location. Herein, we introduce the concept of relative shift-induced quasi-BICs (QBICs) within dimerized silicon (Si) meta-lattices (DSMs), which can be excited when a nonzero relative shift occurs, a result of in-plane inversion symmetry breaking and Brillouin zone folding within the structure.
View Article and Find Full Text PDFHeliyon
November 2024
Department of General Surgery, The Central Theater Hospital of the Chinese People's Liberation Army, Wuhan, 430070, China.
Background: Pancreatic cancer (PC) is a devastating human malignancy with a poor survival outcome (5-year survival less than 10 %). In recent years, the regulatory roles of long non-coding RNAs (lncRNAs) in various types of cancers have been widely reported. Based on bioinformatics analysis, LINC01857 is shown to be highly expressed in PC tissue.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!