Protein sumoylation is one of the most important post-translational modifications. Accurate prediction of sumoylation sites is very useful for the analysis of proteome. Though the putative motif Psi K XE can be used, optimization of prediction models still remains a challenge. In this study, we developed a prediction system based on feature selection strategy. A total of 1,272 peptides with 14 residues from SUMOsp (Xue et al. [8] Nucleic Acids Res 34:W254-W257, 2006) were investigated in this study, including 212 substrates and 1,060 non-substrates. Among the substrates, only 162 substrates comply to the motif Psi K XE. First, 1,272 substrates were divided into training set and test set. All the substrates were encoded into feature vectors by hundreds of amino acid properties collected by Amino Acid Index Database (AAIndex, http://www.genome.jp/aaindex ). Then, mRMR (minimum redundancy-maximum relevance) method was applied to extract the most informative features. Finally, Nearest Neighbor Algorithm (NNA) was used to produce the prediction models. Tested by Leave-one-out (LOO) cross-validation, the optimal prediction model reaches the accuracy of 84.4% for the training set and 76.4% for the test set. Especially, 180 substrates were correctly predicted, which was 18 more than using the motif Psi K XE. The final selected features indicate that amino acid residues with two-residue downstream and one-residue upstream of the sumoylation sites play the most important role in determining the occurrence of sumoylation. Based on the feature selection strategy, our prediction system can not only be used for high throughput prediction of sumoylation sites but also as a tool to investigate the mechanism of sumoylation.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1007/s11030-009-9149-5 | DOI Listing |
Cell Mol Life Sci
December 2024
Institute of Biochemistry, Justus-Liebig-University Giessen, Friedrichstrasse 24, 35392, Giessen, Germany.
Post-translational modifications (PTMs) are implicated in many biological processes including receptor activation, signal transduction, transcriptional regulation and protein turnover. Lysine's side chain is particularly notable, as it can undergo methylation, acetylation, SUMOylation and ubiquitination. Methylation affects not only lysine but also arginine residues, both of which are implicated in epigenetic regulation.
View Article and Find Full Text PDFInt J Mol Sci
November 2024
Division of Neurobiology, Department of Psychiatry, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA.
Epigenomes
November 2024
KNU G-LAMP Project Group, KNU Institute of Basic Sciences, Kyungpook National University, Daegu 41566, Republic of Korea.
Cryptic transcription refers to the unintended expression of non-canonical sites within the genome, producing aberrant RNA and proteins that may disrupt cellular functions. In this opinion piece, I will explore the role of histone modifications in modulating cryptic transcription and its implications for gene expression and cellular integrity, particularly with a focus on H3K36 and H3K4 methylation marks. H3K36 tri-methylation plays a crucial role in maintaining chromatin integrity by facilitating the recruitment of the Rpd3S histone deacetylase (HDAC) complex, which helps restore closed chromatin states following transcription and prevents cryptic initiation within gene bodies.
View Article and Find Full Text PDFJ Am Soc Mass Spectrom
December 2024
Jiangsu Provincial Key Laboratory of Drug Metabolism and Pharmacokinetics, State Key Laboratory of Natural Medicines, China Pharmaceutical University, Tongjiaxiang No. 24, Nanjing 210009, Jiangsu, China.
Protein lactylation is a novel post-translational modification (PTM) involved in many important physiological processes such as macrophage polarization, immune regulation, and tumor cell growth. However, traditional methodologies for studying lactylation have predominantly relied on peptide enrichment from whole-cell lysates, which tend to favor the detection of high-abundance peptides, thus limiting the identification of low-abundance lactylated peptides. To address this limitation, here, we employed subcellular fractionation to separate proteins and map lactylated peptides from each isolated subcellular fraction using a model cell line.
View Article and Find Full Text PDFJ Biol Chem
November 2024
Department of Biochemistry and Molecular Biology, University of Maryland School of Medicine, Baltimore, Maryland, USA; Molecular and Structural Biology Program, University of Maryland Marlene and Stewart Greenebaum Comprehensive Cancer Center, Baltimore, Maryland, USA. Electronic address:
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!