Predicting the protein sequence information of enzymes and non-enzymes is an important but a very challenging task. Existing methods use protein geometric structures only or protein sequences alone to predict enzymatic functions. Thus, their prediction results are unsatisfactory. In this paper, we propose a novel approach for predicting the amino acid sequences of enzymes and non-enzymes via Convolutional Neural Network (CNN). In CNN, the roles of enzymes are predicted from multiple sides of biological information, including information on sequences and structures. We propose the use of two-dimensional data via 2DCNN to predict the proteins of enzymes and non-enzymes by using the same fivefold cross-validation function. We also use an independent dataset to test the performance of our model, and the results demonstrate that we are able to solve the overfitting problem. We used the CNN model proposed herein to demonstrate the superiority of our model for classifying an entire set of filters, such as 32, 64, and 128 parameters, with the fivefold validation test set as the independent classification. Via the Dipeptide Deviation from Expected Mean (DDE) matrix, mutation information is extracted from amino acid sequences and structural information with the distance and angle of amino acids is conveyed. The derived feature maps are then encoded in DDE exploitation The independent datasets are then compared with other two methods, namely, GRU and XGBOOST. All analyses were conducted using 32, 64 and 128 filters on our proposed CNN method. The cross-validation datasets achieved an accuracy score of 0.8762%, whereas the accuracy of independent datasets was 0.7621%. Additional variables were derived on the basis of ROC AUC with fivefold cross-validation was achieved score is 0.95%. The performance of our model and that of other models in terms of sensitivity (0.9028%) and specificity (0.8497%) was compared. The overall accuracy of our model was 0.9133% compared with 0.8310% for the other model.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8670239PMC
http://dx.doi.org/10.3389/fgene.2021.759384DOI Listing

Publication Analysis

Top Keywords

enzymes non-enzymes
12
convolutional neural
8
neural network
8
amino acid
8
acid sequences
8
fivefold cross-validation
8
performance model
8
independent datasets
8
model
6
identification enzymes-specific
4

Similar Publications

Red blood cells (RBC), are the most unique and abundant cell types. The diameter of RBCs is 7-8 μm. They have an essential role in transporting circulatory oxygen.

View Article and Find Full Text PDF

: Dapsone (DP) is employed in the management of various skin conditions, including autoimmune bullous diseases to non-enzymes (n-eAIBDs). This study aimed to assess the advantages and safety profile of DP treatment in n-eAIBDs patients. The evaluation focused on clinical remission, reduction in glucocorticosteroid (GCS) usage, and adverse incidents during a 12-month observation in a dermatology department at a Central European university.

View Article and Find Full Text PDF

Based on their chemical structure and catalytic features, carbon dots (CDs) demonstrate great advantages for agricultural systems. The improvements in growth, photosynthesis, nutrient assimilation and resistance are provided by CDs treatments under control or adverse conditions. However, there is no data on how CDs can enhance the tolerance against chromium toxicity on gas exchange, photosynthetic machinery and ROS-based membrane functionality.

View Article and Find Full Text PDF

Seed priming is an effective and novel technique and the use of eco-friendly biological agents improves the physiological functioning in the vegetative stage of plants. This procedure ensures productivity and acquired stress resilience in plants against adverse conditions without contaminating the environment. Though the mechanisms of bio-priming-triggered alterations have been widely explained under induvial stress conditions, the interaction of combined stress conditions on the defense system and the functionality of photosynthetic apparatus in the vegetative stage after the inoculation to seeds has not been fully elucidated.

View Article and Find Full Text PDF

During the fermentation of soy sauce, the metabolism of microorganisms and the Maillard reaction produce a wide variety of metabolites that contribute to the unique and rich flavor characteristics of soy sauce, such as amino acids, organic acids and peptides. Amino acid derivatives, a relatively new taste compounds, formed by the reaction of enzymes or non-enzymes from sugars, amino acids, and organic acids released through metabolism by microorganisms during soy sauce fermentation, have begun to gain more and more attention in recent years. This review focused on our existing knowledge of the sources, taste characteristics and synthesis methods of the 6 categories of amino acid derivatives, including Amadori compounds, γ-glutamyl peptides, pyroglutamyl amino acids, N-lactoyl amino acids, N-acetyl amino acids and N-succinyl amino acids.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!