Glutarylation, which is a newly identified posttranslational modification that occurs on lysine residues, has recently emerged as an important regulator of several metabolic and mitochondrial processes. However, the specific sites of modification on individual proteins, as well as the extent of glutarylation throughout the proteome, remain largely uncharacterized. Though informative, proteomic approaches based on mass spectrometry can be expensive, technically challenging and time-consuming. Therefore, the ability to predict glutarylation sites from protein primary sequences can complement proteomics analyses and help researchers study the characteristics and functional consequences of glutarylation. To this end, we used Random Forest (RF) machine learning strategies to identify the physiochemical and sequence-based features that correlated most substantially with glutarylation. We then used these features to develop a novel method to predict glutarylation sites from primary amino acid sequences using RF. Based on 10-fold cross-validation, the resulting algorithm, termed 'RF-GlutarySite', achieved efficiency scores of 75%, 81%, 68% and 0.50 with respect to accuracy (ACC), sensitivity (SN), specificity (SP) and Matthew's correlation coefficient (MCC), respectively. Likewise, using an independent test set, RF-GlutarySite exhibited ACC, SN, SP and MCC scores of 72%, 73%, 70% and 0.43, respectively. Results using both 10-fold cross validation and an independent test set were on par with or better than those achieved by existing glutarylation site predictors. Notably, RF-GlutarySite achieved the highest SN score among available glutarylation site prediction tools. Consequently, our method has the potential to uncover new glutarylation sites and to facilitate the discovery of relationships between glutarylation and well-known lysine modifications, such as acetylation, methylation and SUMOylation, as well as a number of recently identified lysine modifications, such as malonylation and succinylation.

Download full-text PDF

Source
http://dx.doi.org/10.1039/c9mo00028cDOI Listing

Publication Analysis

Top Keywords

glutarylation sites
16
glutarylation
11
random forest
8
predict glutarylation
8
independent test
8
test set
8
glutarylation site
8
lysine modifications
8
sites
5
rf-glutarysite random
4

Similar Publications

Article Synopsis
  • Protein acetylation is a widely studied post-translational modification, and recent research has identified three new forms: lysine malonylation, succinylation, and glutarylation, which mainly affect energy metabolism in diseases caused by Mycobacterium pathogens.
  • Methods involved using high-affinity antibody enrichment and LC-MS/MS analysis to characterize these new lysine modifications and assess their functional impacts in certain proteins.
  • Results showed significant global substrate characterization for these acylations, revealing connections to ribosomal function and various metabolic pathways, highlighting their importance in cellular processes.
View Article and Find Full Text PDF

As an important post-translational modification, glutarylation plays a crucial role in a variety of cellular functions. Recently, diverse computational methods for glutarylation site identification have been proposed. However, the class imbalance problem due to data noise and uncertainty of non-glutarylation sites remains a great challenge.

View Article and Find Full Text PDF

GBDT_KgluSite: An improved computational prediction model for lysine glutarylation sites based on feature fusion and GBDT classifier.

BMC Genomics

December 2023

Jiangsu Center for the Collaboration and Innovation of Cancer Biotherapy, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China.

Background: Lysine glutarylation (Kglu) is one of the most important Post-translational modifications (PTMs), which plays significant roles in various cellular functions, including metabolism, mitochondrial processes, and translation. Therefore, accurate identification of the Kglu site is important for elucidating protein molecular function. Due to the time-consuming and expensive limitations of traditional biological experiments, computational-based Kglu site prediction research is gaining more and more attention.

View Article and Find Full Text PDF

Lysine Nɛ-acylations, such as acetylation or succinylation, are post-translational modifications that regulate protein function. In mitochondria, lysine acylation is predominantly non-enzymatic, and only a specific subset of the proteome is acylated. Coenzyme A (CoA) can act as an acyl group carrier via a thioester bond, but what controls the acylation of mitochondrial lysines remains poorly understood.

View Article and Find Full Text PDF

As a key issue in orchestrating various biological processes and functions, protein post-translational modification (PTM) occurs widely in the mechanism of protein's function of animals and plants. Glutarylation is a type of protein-translational modification that occurs at active ε-amino groups of specific lysine residues in proteins, which is associated with various human diseases, including diabetes, cancer, and glutaric aciduria type I. Therefore, the issue of prediction for glutarylation sites is particularly important.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!