ProteinMAE: masked autoencoder for protein surface self-supervised learning.

Bioinformatics

Digital Medical Research Center, School of Basic Medical Sciences, Fudan University, Shanghai 200032, China.

Published: December 2023

Summary: The biological functions of proteins are determined by the chemical and geometric properties of their surfaces. Recently, with the booming progress of deep learning, a series of learning-based surface descriptors have been proposed and achieved inspirational performance in many tasks such as protein design, protein-protein interaction prediction, etc. However, they are still limited by the problem of label scarcity, since the labels are typically obtained through wet experiments. Inspired by the great success of self-supervised learning in natural language processing and computer vision, we introduce ProteinMAE, a self-supervised framework specifically designed for protein surface representation to mitigate label scarcity. Specifically, we propose an efficient network and utilize a large number of accessible unlabeled protein data to pretrain it by self-supervised learning. Then we use the pretrained weights as initialization and fine-tune the network on downstream tasks. To demonstrate the effectiveness of our method, we conduct experiments on three different downstream tasks including binding site identification in protein surface, ligand-binding protein pocket classification, and protein-protein interaction prediction. The extensive experiments show that our method not only successfully improves the network's performance on all downstream tasks, but also achieves competitive performance with state-of-the-art methods. Moreover, our proposed network also exhibits significant advantages in terms of computational cost, which only requires less than a tenth of memory cost of previous methods.

Availability And Implementation: https://github.com/phdymz/ProteinMAE.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10713117PMC
http://dx.doi.org/10.1093/bioinformatics/btad724DOI Listing

Publication Analysis

Top Keywords

protein surface
12
self-supervised learning
12
downstream tasks
12
protein-protein interaction
8
interaction prediction
8
label scarcity
8
protein
6
proteinmae masked
4
masked autoencoder
4
autoencoder protein
4

Similar Publications

Background: The dried root of Inula helenium L., known as Inulae Radix in Mongolian medicine, is a widely used heat-clearing plant drug within the Asteraceae family. Alantolactone (ATL), a compound derived from Inulae Radix, is a sesquiterpene lactone with a range of biological activities.

View Article and Find Full Text PDF

P-cadherin (pCAD) and LI-cadherin (CDH17) are cell-surface proteins belonging to the cadherin superfamily that are both highly expressed in colorectal cancer. This co-expression profile presents a novel and attractive opportunity for a dual targeting approach using an antibody-drug conjugate (ADC). In this study, we used a unique avidity-driven screening approach to generate pCAD x CDH17 bispecific antibodies that selectively target cells expressing both antigens over cells expressing only pCAD or only CDH17.

View Article and Find Full Text PDF

Objective: This study explores whether hyaluronic acid (HA) of different molecular weights and collagen, given their role in tendon extracellular matrix maintenance, have a synergistic effect on human tendon-derived cells, with the aim to improve the treatment of tendinopathy.

Material: Human monocytes (CRL-9855™) and primary Achilles tendon-derived cells.

Treatment: The collagen/HA ratio was based on the formulation of the commercial food supplement TendoGenIAL™.

View Article and Find Full Text PDF

Mediating role of blood metabolites in the relationship between immune cell traits and sepsis: a Mendelian randomization and mediation analysis.

Inflamm Res

January 2025

Department of Emergency Medicine, Institute of Disaster Medicine and Institute of Emergency Medicine, West China Hospital, West China School of Medicine, Sichuan University, Chengdu, 610041, People's Republic of China.

Background: A significant association between immune cells and sepsis has been suggested by observational studies. However, the precise biological mechanisms underlying this association remain unclear. Therefore, we employed a Mendelian randomization (MR) approach to investigate the causal relationship between immune cells and genetic susceptibility to sepsis, and to explore the potential mediating role of blood metabolites.

View Article and Find Full Text PDF

Connexin 43 contributes to perioperative neurocognitive disorder by attenuating perineuronal net of hippocampus in aged mice.

Cell Mol Life Sci

January 2025

Shanghai Key Laboratory of Anesthesiology and Brain Functional Modulation, Clinical Research Center for Anesthesiology and Perioperative Medicine, Translational Research Institute of Brain and Brain-Like Intelligence, Department of Anesthesiology and Perioperative MedicineSchool of Medicine, Shanghai Fourth People's Hospital, School of Medicine, Tongji University, 1239 Sanmen Road, Hongkou District, Shanghai, 200434, China.

Background: Perioperative neurocognitive disorder (PND) is a prevalent form of cognitive impairment in elderly patients following anesthesia and surgery. The underlying mechanisms of PND are closely related to perineuronal nets (PNNs). PNNs, which are complexes of extracellular matrix primarily surrounding neurons in the hippocampus, play a critical role in neurocognitive function.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!