The number of protein 3D structures without function annotation in Protein Data Bank (PDB) has been steadily increased. This fact has led in turn to an increment of demand for theoretical models to give a quick characterization of these proteins. In this work, we present a new and fast Markov chain model (MCM) to predict the enzyme classification (EC) number. We used both linear discriminant analysis (LDA) and/or artificial neural networks (ANN) in order to compare linear vs. non-linear classifiers. The LDA model found is very simple (three variables) and at the same time is able to predict the first EC number with an overall accuracy of 79% for a data set of 4755 proteins (859 enzymes and 3896 non-enzymes) divided into both training and external validation series. In addition, the best non-linear ANN model is notably more complex but has an overall accuracy of 98.85%. It is important to emphasize that this method may help us to predict not only new enzyme proteins but also to select peptide candidates found on the peptide mass fingerprints (PMFs) of new proteins that may improve enzyme activity. In order to illustrate the use of the model in this regard, we first report the 2D electrophoresis (2DE) and MADLI-TOF mass spectra characterization of the PMF of a new possible malate dehydrogenase sequence from Leishmania infantum. Next, we used the models to predict the contribution to a specific enzyme action of 30 peptides found in the PMF of the new protein. We implemented the present model in a server at portal Bio-AIMS (http://miaja.tic.udc.es/Bio-AIMS/EnzClassPred.php). This free on-line tool is based on PHP/HTML/Python and MARCH-INSIDE routines. This combined strategy may be used to identify and predict peptides of prokaryote and eukaryote parasites and their hosts as well as other superior organisms, which may be of interest in drug development or target identification.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.bbapap.2009.08.020DOI Listing

Publication Analysis

Top Keywords

predict enzyme
8
enzyme
5
model
5
predict
5
entropy moments
4
moments prediction
4
prediction enzyme
4
enzyme classes
4
classes experimental-theoretic
4
experimental-theoretic study
4

Similar Publications

Modeling predicts facile release of nitrite but not nitric oxide from the thionitrate CHSNO with relevance to nitroglycerin bioactivation.

Sci Rep

December 2024

Department of Chemistry and Biochemistry, Centre for Research in Molecular Modeling (CERMM), Concordia University, 7141 Sherbrooke Street West, Montréal, QC, H4B 1R6, Canada.

Nitroglycerin is a potent vasodilator in clinical use since the late 1800s. It functions as a prodrug that is bioactivated by formation of an enzyme-based thionitrate, E-Cys-NO. This intermediate reportedly decomposes to release NO and NO but their relative yields remain controversial.

View Article and Find Full Text PDF

Telomere attrition is a hallmark of biological aging, contributing to cellular replicative senescence. However, few studies have examined the determinants of telomere attrition in vivo in humans. Mitochondrial Health Index (MHI), a composite marker integrating mitochondrial energy-transformation capacity and content, may be one important mediator of telomere attrition, as it could impact telomerase activity, a direct regulator of telomere maintenance.

View Article and Find Full Text PDF

Hepatitis C virus (HCV) presents a significant global health issue due to its widespread prevalence and the absence of a reliable vaccine for prevention. While significant progress has been achieved in therapeutic interventions since the disease was first identified, its resurgence underscores the need for innovative strategies to combat it. The nonstructural protein NS5A is crucial in the life cycle of the HCV, serving as a significant factor in both viral replication and assembly processes.

View Article and Find Full Text PDF

Titin is the third contractile filament in the sarcomere, and it plays a critical role in sarcomere integrity and both passive and active tension. Unlike the thick and thin filaments, which are polymers of myosin and actin, respectively, titin is a single protein that spans from Z-disk to M-line. The N2A region within titin has been identified as a signaling hub for the muscle and is shown to be involved in multiple interactions.

View Article and Find Full Text PDF

Temperate forests cover 25% of the world's forest area and most of them are managed for timber production. To increase yields, native deciduous trees have been commonly replaced by fast-growing conifers, especially in Western and Central Europe. Despite the importance of forest soils for a variety of ecosystem functions, the effects of forest management intensity on soil biological processes have not yet been sufficiently understood.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!