Computational Method for Identifying Malonylation Sites by Using Random Forest Algorithm.

Comb Chem High Throughput Screen

School of Life Sciences, Shanghai University, Shanghai 200444, China.

Published: April 2021

Background: As a newly uncovered post-translational modification on the ε-amino group of lysine residue, protein malonylation was found to be involved in metabolic pathways and certain diseases. Apart from experimental approaches, several computational methods based on machine learning algorithms were recently proposed to predict malonylation sites. However, previous methods failed to address imbalanced data sizes between positive and negative samples.

Objective: In this study, we identified the significant features of malonylation sites in a novel computational method which applied machine learning algorithms and balanced data sizes by applying synthetic minority over-sampling technique.

Method: Four types of features, namely, amino acid (AA) composition, position-specific scoring matrix (PSSM), AA factor, and disorder were used to encode residues in protein segments. Then, a two-step feature selection procedure including maximum relevance minimum redundancy and incremental feature selection, together with random forest algorithm, was performed on the constructed hybrid feature vector.

Results: An optimal classifier was built from the optimal feature subset, which featured an F1-measure of 0.356. Feature analysis was performed on several selected important features.

Conclusion: Results showed that certain types of PSSM and disorder features may be closely associated with malonylation of lysine residues. Our study contributes to the development of computational approaches for predicting malonyllysine and provides insights into molecular mechanism of malonylation.

Download full-text PDF

Source
http://dx.doi.org/10.2174/1386207322666181227144318DOI Listing

Publication Analysis

Top Keywords

malonylation sites
12
computational method
8
random forest
8
forest algorithm
8
machine learning
8
learning algorithms
8
data sizes
8
feature selection
8
malonylation
6
feature
5

Similar Publications

Comprehensive analysis of protein post-translational modifications reveals PTPN2-STAT1-AOX axis-mediated tumor progression in hepatocellular carcinomas.

Transl Oncol

January 2025

Zhejiang Provincial Key Laboratory of Pancreatic Disease, the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China; Department of Hepatobiliary and Pancreatic Surgery, the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China; Clinical Research Center of Hepatobiliary and Pancreatic Diseases, Zhejiang Province, China; Zhejiang University Cancer Center, Hangzhou, China. Electronic address:

Hepatocellular carcinoma (HCC) is a common malignant tumor. Although the proteomics of HCC is well studied, the landscape of post-translational modifications (PTMs) in HCC is poorly understood. The PTMs themselves and their crosstalk might be deeply involved in HCC development and progression.

View Article and Find Full Text PDF

In recent years, it was found that lysine malonylation modification can affect biological metabolism and play an important role in plant life activities. Platycodon grandiflorus, an economic crop and medicinal plant, had no reports on malonylation in the related literature. This study qualitatively introduces lysine malonylation in P.

View Article and Find Full Text PDF

Background: Posttranslational modifications (PTMs) play critical roles in hepatocellular carcinoma (HCC). However, the locations of PTM-modified sites across protein secondary structures and regulatory patterns in HCC remain largely uncharacterized.

Methods: Total proteome and nine PTMs (phosphorylation, acetylation, crotonylation, ubiquitination, lactylation, N-glycosylation, succinylation, malonylation, and β-hydroxybutyrylation) in tumor sections and paired normal adjacent tissues derived from 18 HCC patients were systematically profiled by 4D-Label free proteomics analysis combined with PTM-based peptide enrichment.

View Article and Find Full Text PDF
Article Synopsis
  • Protein acetylation is a widely studied post-translational modification, and recent research has identified three new forms: lysine malonylation, succinylation, and glutarylation, which mainly affect energy metabolism in diseases caused by Mycobacterium pathogens.
  • Methods involved using high-affinity antibody enrichment and LC-MS/MS analysis to characterize these new lysine modifications and assess their functional impacts in certain proteins.
  • Results showed significant global substrate characterization for these acylations, revealing connections to ribosomal function and various metabolic pathways, highlighting their importance in cellular processes.
View Article and Find Full Text PDF

Global Insights into the Lysine Acetylome Reveal the Role of Lysine Acetylation in the Adaptation of to Salt Stress.

J Proteome Res

January 2025

College of Life Sciences and Shandong Engineering Research Center of Plant-Microbial Restoration for Saline-Alkali Land, Shandong Agricultural University, Tai'an 271018, China.

Article Synopsis
  • A certain beneficial microorganism in the rhizosphere enhances plant growth and tolerance to salt stress, but the specific mechanisms at the post-translational modification level were not well understood.
  • Researchers identified various lysine modifications, specifically noting that acetylation levels increased significantly in response to salt stress, affecting a large number of proteins involved in vital processes related to salt tolerance.
  • The study also highlighted how specific acetylation sites on a protective enzyme, thiol peroxidase, play a crucial role in managing reactive oxygen species, thus influencing bacterial growth under salt conditions.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!