PreTKcat: A pre-trained representation learning and machine learning framework for predicting enzyme turnover number.

Comput Biol Chem

College of Artificial Intelligence, Tianjin University of Science and Technology, No. 9, 13th Street, Tianjin Economic-Technological Development Area, Tianjin, 300457, China. Electronic address:

Published: January 2025

The enzyme turnover number (k) is crucial for understanding enzyme kinetics and optimizing biotechnological processes. However, experimentally measured k values are limited due to the high cost and labor intensity of wet-lab measurements, necessitating robust computational methods. To address this issue, we propose PreTKcat, a framework that integrates pre-trained representation learning and machine learning to predict k values. PreTKcat utilizes the ProtT5 protein language model to encode enzyme sequences and the MolGNet molecular representation learning model to encode substrate molecular graphs. By integrating these representations, the ExtraTrees model is employed to predict k values. Additionally, PreTKcat accounts for the impact of temperature on k prediction. In addition, PreTKcat can also be used to predict enzyme-substrate affinity, i.e. km values. Comparative assessments with various state-of-the-art models highlight the superior performance of PreTKcat. PreTKcat serves as an effective tool for investigating enzyme kinetics, offering new perspectives for enzyme engineering and its industrial uses.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiolchem.2024.108327DOI Listing

Publication Analysis

Top Keywords

representation learning
12
pre-trained representation
8
learning machine
8
machine learning
8
enzyme turnover
8
turnover number
8
enzyme kinetics
8
predict values
8
model encode
8
pretkcat
7

Similar Publications

FP-YOLOv8: Surface Defect Detection Algorithm for Brake Pipe Ends Based on Improved YOLOv8n.

Sensors (Basel)

December 2024

School of Mechanical and Power Engineering, Zhengzhou University, Zhengzhou 450000, China.

To address the limitations of existing deep learning-based algorithms in detecting surface defects on brake pipe ends, a novel lightweight detection algorithm, FP-YOLOv8, is proposed. This algorithm is developed based on the YOLOv8n framework with the aim of improving accuracy and model lightweight design. First, the C2f_GhostV2 module has been designed to replace the original C2f module.

View Article and Find Full Text PDF

Arrhythmia Detection by Data Fusion of ECG Scalograms and Phasograms.

Sensors (Basel)

December 2024

Department of Information Engineering, Electronics and Telecommunications (DIET), Sapienza University of Rome, Via Eudossiana 18, 00184 Rome, Italy.

The automatic detection of arrhythmia is of primary importance due to the huge number of victims caused worldwide by cardiovascular diseases. To this aim, several deep learning approaches have been recently proposed to automatically classify heartbeats in a small number of classes. Most of these approaches use convolutional neural networks (CNNs), exploiting some bi-dimensional representation of the ECG signal, such as spectrograms, scalograms, or similar.

View Article and Find Full Text PDF

In this study, we propose a novel framework for time-series representation learning that integrates a learnable masking-augmentation strategy into a contrastive learning framework. Time-series data pose challenges due to their temporal dependencies and feature-extraction complexities. To address these challenges, we introduce a masking-based reconstruction approach within a contrastive learning context, aiming to enhance the model's ability to learn discriminative temporal features.

View Article and Find Full Text PDF

In order to achieve infrared aircraft detection under interference conditions, this paper proposes an infrared aircraft detection algorithm based on high-resolution feature-enhanced semantic segmentation network. Firstly, the designed location attention mechanism is utilized to enhance the current-level feature map by obtaining correlation weights between pixels at different positions. Then, it is fused with the high-level feature map rich in semantic features to construct a location attention feature fusion network, thereby enhancing the representation capability of target features.

View Article and Find Full Text PDF

NeCA: 3D Coronary Artery Tree Reconstruction from Two 2D Projections via Neural Implicit Representation.

Bioengineering (Basel)

December 2024

Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford OX3 7DQ, UK.

Cardiovascular diseases (CVDs) are the most common health threats worldwide. 2D X-ray invasive coronary angiography (ICA) remains the most widely adopted imaging modality for CVD assessment during real-time cardiac interventions. However, it is often difficult for the cardiologists to interpret the 3D geometry of coronary vessels based on 2D planes.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!