Background: Ubiquitination is a crucial post-translational modification of proteins that regulates diverse cellular functions. Accurate identification of ubiquitination sites in proteins is vital for understanding fundamental biological mechanisms, such as cell cycle and DNA repair. Conventional experimental approaches are resource-intensive, whereas machine learning offers a cost-effective means of accurately identifying ubiquitination sites. The prediction of ubiquitination sites is species-specific, with many existing models being tailored for () and (). However, these models have shortcomings in sequence window selection and feature extraction, leading to suboptimal performance.
Methods: This study initially employed the chi-square test to determine the optimal sequence window. Subsequently, a combination of six features was assessed: Binary Encoding (BE), Composition of K-Spaced Amino Acid Pair (CKSAAP), Enhanced Amino Acid Composition (EAAC), Position Weight Matrix (PWM), 531 Properties of Amino Acids (AA531), and Position-Specific Scoring Matrix (PSSM). Comparative evaluation involved three feature selection methods: Minimum Redundancy-Maximum Relevance (mRMR), Elastic net, and Null importances. Alongside these were four classifiers: Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), and Extreme Gradient Boosting (XGBoost). The Null importances combined with the RF model exhibited superior predictive performance, and was denoted as UbNiRF (: ArUbNiRF; : HoUbNiRF).
Results: A comprehensive assessment indicated that UbNiRF is superior to existing prediction tools across five performance metrics. It notably excelled in the Matthews Correlation Coefficient (MCC), with values of 0.827 for the dataset and 0.781 for the dataset. Feature analysis underscores the significance of integrating six features and demonstrates their critical role in enhancing model performance.
Conclusions: UbNiRF is a valuable predictive tool for identifying ubiquitination sites in both and . Its robust performance and species-specific discovery capabilities make it extremely useful for elucidating biological processes and disease mechanisms associated with ubiquitination.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.31083/j.fbl2905197 | DOI Listing |
Protein Sci
January 2025
Department of Physical Chemistry, Institute of Biotechnology, and Unit of Excellence in Chemistry Applied to Biomedicine and Environment, School of Sciences, University of Granada, Granada, Spain.
The ubiquitin E2 variant domain of TSG101 (TSG101-UEV) plays a pivotal role in protein sorting and virus budding by recognizing PTAP motifs within ubiquitinated proteins. Disruption of TSG101-UEV/PTAP interactions has emerged as a promising strategy for the development of host-oriented broad-spectrum antivirals with low susceptibility to resistance. TSG101 is a challenging target characterized by an extended and flat binding interface, low affinity for PTAP ligands, and complex binding energetics.
View Article and Find Full Text PDFFront Immunol
December 2024
School of Marine Sciences, State Key Laboratory for Biocontrol/Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Guangdong Provincial Key Laboratory of Marine Resources and Coastal Engineering & Guangdong Provincial Observation and Research Station for Marine Ranching of the Lingdingyang Bay, Sun Yat-sen University, Guangzhou, China.
Stimulator of interferon genes (STING) is a key connector protein in interferon (IFN) signaling, crucial for IFN induction during the activation of antiviral innate immunity. In mammals, ring finger protein 5 (RNF5) functions as an E3 ubiquitin ligase, mediating STING regulation through K150 ubiquitylation to prevent excessive IFN production. However, the mechanisms underlying RNF5's regulation of STING in teleost fish remain unknown.
View Article and Find Full Text PDFAm J Hum Genet
December 2024
Department of Genetics, CHU Sainte-Justine, Montréal, QC, Canada. Electronic address:
E3 ubiquitin ligases have been linked to developmental diseases including autism, Angelman syndrome (UBE3A), and Johanson-Blizzard syndrome (JBS) (UBR1). Here, we report variants in the E3 ligase UBR5 in 29 individuals presenting with a neurodevelopmental syndrome that includes developmental delay, autism, intellectual disability, epilepsy, movement disorders, and/or genital anomalies. Their phenotype is distinct from JBS due to the absence of exocrine pancreatic insufficiency and the presence of autism, epilepsy, and, in some probands, a movement disorder.
View Article and Find Full Text PDFMol Biol Cell
December 2024
Department of Chemistry and Biochemistry, Brigham Young University, Provo, UT, USA.
Autophagy is an essential cellular recycling process that maintains protein and organelle homeostasis. ATG9A vesicle recruitment is a critical early step in autophagy to initiate autophagosome biogenesis. The mechanisms of ATG9A vesicle recruitment are best understood in the context of starvation-induced non-selective autophagy, whereas less is known about the signals driving ATG9A vesicle recruitment to autophagy initiation sites in the absence of nutrient stress.
View Article and Find Full Text PDFAdv Sci (Weinh)
December 2024
Department of Chemistry and Chemical Biology, TU Dortmund University, Otto-Hahn Str. 4a, 44227, Dortmund, Germany.
Ubiquitination is a dynamic post-translational modification governing protein abundance, function, and localization in eukaryotes. The Ubiquitin protein is conjugated to lysine residues of target proteins, but can also repeatedly be ubiquitinated itself, giving rise to a complex code of ubiquitin chains with different linkage types. To enable studying the cellular dynamics of linkage-specific ubiquitination, light-activatable polyubiquitin chain formation is reported here.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!