UbNiRF: A Hybrid Framework Based on Null Importances and Random Forest that Combines Multiple Features to Predict Ubiquitination Sites in and .

Front Biosci (Landmark Ed)

Hunan Engineering & Technology Research Centre for Agricultural Big Data Analysis & Decision-Making, Hunan Agricultural University, 410128 Changsha, Hunan, China.

Published: May 2024

Background: Ubiquitination is a crucial post-translational modification of proteins that regulates diverse cellular functions. Accurate identification of ubiquitination sites in proteins is vital for understanding fundamental biological mechanisms, such as cell cycle and DNA repair. Conventional experimental approaches are resource-intensive, whereas machine learning offers a cost-effective means of accurately identifying ubiquitination sites. The prediction of ubiquitination sites is species-specific, with many existing models being tailored for () and (). However, these models have shortcomings in sequence window selection and feature extraction, leading to suboptimal performance.

Methods: This study initially employed the chi-square test to determine the optimal sequence window. Subsequently, a combination of six features was assessed: Binary Encoding (BE), Composition of K-Spaced Amino Acid Pair (CKSAAP), Enhanced Amino Acid Composition (EAAC), Position Weight Matrix (PWM), 531 Properties of Amino Acids (AA531), and Position-Specific Scoring Matrix (PSSM). Comparative evaluation involved three feature selection methods: Minimum Redundancy-Maximum Relevance (mRMR), Elastic net, and Null importances. Alongside these were four classifiers: Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), and Extreme Gradient Boosting (XGBoost). The Null importances combined with the RF model exhibited superior predictive performance, and was denoted as UbNiRF (: ArUbNiRF; : HoUbNiRF).

Results: A comprehensive assessment indicated that UbNiRF is superior to existing prediction tools across five performance metrics. It notably excelled in the Matthews Correlation Coefficient (MCC), with values of 0.827 for the dataset and 0.781 for the dataset. Feature analysis underscores the significance of integrating six features and demonstrates their critical role in enhancing model performance.

Conclusions: UbNiRF is a valuable predictive tool for identifying ubiquitination sites in both and . Its robust performance and species-specific discovery capabilities make it extremely useful for elucidating biological processes and disease mechanisms associated with ubiquitination.

Download full-text PDF

Source
http://dx.doi.org/10.31083/j.fbl2905197DOI Listing

Publication Analysis

Top Keywords

ubiquitination sites
20
null importances
12
random forest
8
identifying ubiquitination
8
sequence window
8
amino acid
8
ubiquitination
7
sites
5
ubnirf
4
ubnirf hybrid
4

Similar Publications

Exploring the druggability of the UEV domain of human TSG101 in search for broad-spectrum antivirals.

Protein Sci

January 2025

Department of Physical Chemistry, Institute of Biotechnology, and Unit of Excellence in Chemistry Applied to Biomedicine and Environment, School of Sciences, University of Granada, Granada, Spain.

The ubiquitin E2 variant domain of TSG101 (TSG101-UEV) plays a pivotal role in protein sorting and virus budding by recognizing PTAP motifs within ubiquitinated proteins. Disruption of TSG101-UEV/PTAP interactions has emerged as a promising strategy for the development of host-oriented broad-spectrum antivirals with low susceptibility to resistance. TSG101 is a challenging target characterized by an extended and flat binding interface, low affinity for PTAP ligands, and complex binding energetics.

View Article and Find Full Text PDF

Ring finger protein 5 mediates STING degradation through ubiquitinating K135 and K155 in a teleost fish.

Front Immunol

December 2024

School of Marine Sciences, State Key Laboratory for Biocontrol/Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Guangdong Provincial Key Laboratory of Marine Resources and Coastal Engineering & Guangdong Provincial Observation and Research Station for Marine Ranching of the Lingdingyang Bay, Sun Yat-sen University, Guangzhou, China.

Stimulator of interferon genes (STING) is a key connector protein in interferon (IFN) signaling, crucial for IFN induction during the activation of antiviral innate immunity. In mammals, ring finger protein 5 (RNF5) functions as an E3 ubiquitin ligase, mediating STING regulation through K150 ubiquitylation to prevent excessive IFN production. However, the mechanisms underlying RNF5's regulation of STING in teleost fish remain unknown.

View Article and Find Full Text PDF

E3 ubiquitin ligases have been linked to developmental diseases including autism, Angelman syndrome (UBE3A), and Johanson-Blizzard syndrome (JBS) (UBR1). Here, we report variants in the E3 ligase UBR5 in 29 individuals presenting with a neurodevelopmental syndrome that includes developmental delay, autism, intellectual disability, epilepsy, movement disorders, and/or genital anomalies. Their phenotype is distinct from JBS due to the absence of exocrine pancreatic insufficiency and the presence of autism, epilepsy, and, in some probands, a movement disorder.

View Article and Find Full Text PDF

Autophagy is an essential cellular recycling process that maintains protein and organelle homeostasis. ATG9A vesicle recruitment is a critical early step in autophagy to initiate autophagosome biogenesis. The mechanisms of ATG9A vesicle recruitment are best understood in the context of starvation-induced non-selective autophagy, whereas less is known about the signals driving ATG9A vesicle recruitment to autophagy initiation sites in the absence of nutrient stress.

View Article and Find Full Text PDF

Ubiquitination is a dynamic post-translational modification governing protein abundance, function, and localization in eukaryotes. The Ubiquitin protein is conjugated to lysine residues of target proteins, but can also repeatedly be ubiquitinated itself, giving rise to a complex code of ubiquitin chains with different linkage types. To enable studying the cellular dynamics of linkage-specific ubiquitination, light-activatable polyubiquitin chain formation is reported here.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!