A protein sequence-based deep transfer learning framework for identifying human proteome-wide deubiquitinase-substrate interactions.

Nat Commun

State Key Laboratory of Medical Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, 102206, China.

Published: May 2024

Protein ubiquitination regulates a wide range of cellular processes. The degree of protein ubiquitination is determined by the delicate balance between ubiquitin ligase (E3)-mediated ubiquitination and deubiquitinase (DUB)-mediated deubiquitination. In comparison to the E3-substrate interactions, the DUB-substrate interactions (DSIs) remain insufficiently investigated. To address this challenge, we introduce a protein sequence-based ab initio method, TransDSI, which transfers proteome-scale evolutionary information to predict unknown DSIs despite inadequate training datasets. An explainable module is integrated to suggest the critical protein regions for DSIs while predicting DSIs. TransDSI outperforms multiple machine learning strategies against both cross-validation and independent test. Two predicted DUBs (USP11 and USP20) for FOXP3 are validated by "wet lab" experiments, along with two predicted substrates (AR and p53) for USP22. TransDSI provides new functional perspective on proteins by identifying regulatory DSIs, and offers clues for potential tumor drug target discovery and precision drug application.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11133436PMC
http://dx.doi.org/10.1038/s41467-024-48446-3DOI Listing

Publication Analysis

Top Keywords

protein sequence-based
8
protein ubiquitination
8
protein
5
dsis
5
sequence-based deep
4
deep transfer
4
transfer learning
4
learning framework
4
framework identifying
4
identifying human
4

Similar Publications

We aimed to build a robust classifier for the MGMT methylation status of glioblastoma in multiparametric MRI. We focused on multi-habitat deep image descriptors as our basic focus. A subset of the BRATS 2021 MGMT methylation dataset containing both MGMT class labels and segmentation masks was used.

View Article and Find Full Text PDF

Protein-Protein Interaction (PPI) prediction plays a pivotal role in understanding cellular processes and uncovering molecular mechanisms underlying health and disease. Structure-based PPI prediction has emerged as a robust alternative to sequence-based methods, offering greater biological accuracy by integrating three-dimensional spatial and biochemical features. This work summarizes the recent advances in computational approaches leveraging protein structure information for PPI prediction, focusing on machine learning (ML) and deep learning (DL) techniques.

View Article and Find Full Text PDF

Comprehensive Analysis of the Immune Response to SARS-CoV-2 Epitopes: Unveiling Potential Targets for Vaccine Development.

Biology (Basel)

January 2025

Chaoshan Branch of State Key Laboratory for Esophageal Cancer Prevention and Treatment, Shantou University Medical College, Shantou 515041, China.

SARS-CoV-2 continues to be a major global health threat. In this study, we performed a comprehensive meta-analysis on the epitopes of SARS-CoV-2, revealing its immunological landscape. Furthermore, using Shannon entropy for sequence conservation analysis and structural network-based methods identified candidate epitopes that are highly conserved and evolutionarily constrained in SARS-CoV-2 and other zoonotic coronaviruses.

View Article and Find Full Text PDF

AFFIPred: AlphaFold2 structure-based Functional Impact Prediction of missense variations.

Protein Sci

February 2025

Department of Biostatistics and Bioinformatics, Institute of Health Sciences, Acibadem University, Atasehir, Istanbul, Turkey.

Protein structure holds immense potential for pathogenicity prediction, albeit structure-based predictors are limited compared to the sequence-based counterparts due to the "structure knowledge gap" between large number of available protein sequences and relatively limited number of structures. Leveraging the highly accurate protein structures predicted by AlphaFold2 (AF2), we introduce AFFIPred, an ensemble machine learning classifier that combines sequence and AF2-based structural characteristics to predict missense variant pathogenicity. Based on the assessments on unseen datasets, AFFIPred reached a comparable level of performance with the state-of-the-art predictors such as AlphaMissense.

View Article and Find Full Text PDF

Predicting the location of coordinated metal ion-ligand binding sites using geometry-aware graph neural networks.

Comput Struct Biotechnol J

December 2024

Department of Electrical Engineering and Computer Science, Bond Life Sciences Center, University of Missouri, Columbia, MO, USA.

More than 50 % of proteins bind to metal ions. Interactions between metal ions and proteins, especially coordinated interactions, are essential for biological functions, such as maintaining protein structure and signal transport. Physiological metal-ion binding prediction is pivotal for both elucidating the biological functions of proteins and for the design of new drugs.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!