Motifs tree: a new method for predicting post-translational modifications.

Bioinformatics

Department of Computer Science, University of Geneva, 1227 Carouge and Swiss Institute of Bioinformatics, Centre Médical Universitaire, Geneva 4, SwitzerlandDepartment of Computer Science, University of Geneva, 1227 Carouge and Swiss Institute of Bioinformatics, Centre Médical Universitaire, Geneva 4, Switzerland.

Published: July 2014

Motivation: Post-translational modifications (PTMs) are important steps in the maturation of proteins. Several models exist to predict specific PTMs, from manually detected patterns to machine learning methods. On one hand, the manual detection of patterns does not provide the most efficient classifiers and requires an important workload, and on the other hand, models built by machine learning methods are hard to interpret and do not increase biological knowledge. Therefore, we developed a novel method based on patterns discovery and decision trees to predict PTMs. The proposed algorithm builds a decision tree, by coupling the C4.5 algorithm with genetic algorithms, producing high-performance white box classifiers. Our method was tested on the initiator methionine cleavage (IMC) and N(α)-terminal acetylation (N-Ac), two of the most common PTMs.

Results: The resulting classifiers perform well when compared with existing models. On a set of eukaryotic proteins, they display a cross-validated Matthews correlation coefficient of 0.83 (IMC) and 0.65 (N-Ac). When used to predict potential substrates of N-terminal acetyltransferaseB and N-terminal acetyltransferaseC, our classifiers display better performance than the state of the art. Moreover, we present an analysis of the model predicting IMC for Homo sapiens proteins and demonstrate that we are able to extract experimentally known facts without prior knowledge. Those results validate the fact that our method produces white box models.

Availability And Implementation: Predictors for IMC and N-Ac and all datasets are freely available at http://terminus.unige.ch/.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btu165DOI Listing

Publication Analysis

Top Keywords

post-translational modifications
8
machine learning
8
learning methods
8
white box
8
motifs tree
4
method
4
tree method
4
method predicting
4
predicting post-translational
4
modifications motivation
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!