Efficient prediction of anticancer peptides through deep learning.

Abdu Salam Faizan Ullah Farhan Amin Izaz Ahmad Khan Eduardo Garcia Villena Angel Kuc Castilla Isabel de la Torre

PeerJ Comput Sci

University of Valladolid, Valladolid, Spain.

Published: July 2024

Background: Cancer remains one of the leading causes of mortality globally, with conventional chemotherapy often resulting in severe side effects and limited effectiveness. Recent advancements in bioinformatics and machine learning, particularly deep learning, offer promising new avenues for cancer treatment through the prediction and identification of anticancer peptides.

Objective: This study aimed to develop and evaluate a deep learning model utilizing a two-dimensional convolutional neural network (2D CNN) to enhance the prediction accuracy of anticancer peptides, addressing the complexities and limitations of current prediction methods.

Methods: A diverse dataset of peptide sequences with annotated anticancer activity labels was compiled from various public databases and experimental studies. The sequences were preprocessed and encoded using one-hot encoding and additional physicochemical properties. The 2D CNN model was trained and optimized using this dataset, with performance evaluated through metrics such as accuracy, precision, recall, F1-score, and area under the receiver operating characteristic curve (AUC-ROC).

Results: The proposed 2D CNN model achieved superior performance compared to existing methods, with an accuracy of 0.87, precision of 0.85, recall of 0.89, F1-score of 0.87, and an AUC-ROC value of 0.91. These results indicate the model's effectiveness in accurately predicting anticancer peptides and capturing intricate spatial patterns within peptide sequences.

Conclusion: The findings demonstrate the potential of deep learning, specifically 2D CNNs, in advancing the prediction of anticancer peptides. The proposed model significantly improves prediction accuracy, offering a valuable tool for identifying effective peptide candidates for cancer treatment.

Future Work: Further research should focus on expanding the dataset, exploring alternative deep learning architectures, and validating the model's predictions through experimental studies. Efforts should also aim at optimizing computational efficiency and translating these predictions into clinical applications.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11323142	PMC
http://dx.doi.org/10.7717/peerj-cs.2171	DOI Listing

Publication Analysis

Top Keywords

deep learning

anticancer peptides

prediction anticancer

prediction accuracy

experimental studies

cnn model

anticancer

learning

deep

prediction

Similar Publications

MMFuncPhos: A Multi-Modal Learning Framework for Identifying Functional Phosphorylation Sites and Their Regulatory Types.

Adv Sci (Weinh)

January 2025

Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100871, China.

Juan Xie Ruihan Dong Jintao Zhu Haoyu Lin Shiwei Wang

Protein phosphorylation plays a crucial role in regulating a wide range of biological processes, and its dysregulation is strongly linked to various diseases. While many phosphorylation sites have been identified so far, their functionality and regulatory effects are largely unknown. Here, a deep learning model MMFuncPhos, based on a multi-modal deep learning framework, is developed to predict functional phosphorylation sites.

View Article and Find Full Text PDF

Similar Publications

Unveiling patterns in spatial transcriptomics data: a novel approach utilizing graph attention autoencoder and multiscale deep subspace clustering network.

Gigascience

January 2025

School of Computer Science, Hunan University of Technology, Zhuzhou 412007, Hunan, China.

Liqian Zhou Xinhuai Peng Min Chen Xianzhi He Geng Tian

Background: The accurate deciphering of spatial domains, along with the identification of differentially expressed genes and the inference of cellular trajectory based on spatial transcriptomic (ST) data, holds significant potential for enhancing our understanding of tissue organization and biological functions. However, most of spatial clustering methods can neither decipher complex structures in ST data nor entirely employ features embedded in different layers.

Results: This article introduces STMSGAL, a novel framework for analyzing ST data by incorporating graph attention autoencoder and multiscale deep subspace clustering.

View Article and Find Full Text PDF

Similar Publications

Clinical Decision Support Using Speech Signal Analysis: Systematic Scoping Review of Neurological Disorders.

J Med Internet Res

January 2025

Knight Foundation of Computing & Information Sciences, Florida International University, Miami, FL, United States.

Upeka De Silva Samaneh Madanian Sharon Olsen John Michael Templeton Christian Poellabauer

Background: Digital biomarkers are increasingly used in clinical decision support for various health conditions. Speech features as digital biomarkers can offer insights into underlying physiological processes due to the complexity of speech production. This process involves respiration, phonation, articulation, and resonance, all of which rely on specific motor systems for the preparation and execution of speech.

View Article and Find Full Text PDF

Similar Publications

PHIStruct: Improving phage-host interaction prediction at low sequence similarity settings using structure-aware protein embeddings.

Bioinformatics

January 2025

Bioinformatics Lab, Advanced Research Institute for Informatics, Computing and Networking, De La Salle University, Manila, 1004, Philippines.

Mark Edward M Gonzales Jennifer C Ureta Anish M S Shrestha

Motivation: Recent computational approaches for predicting phage-host interaction have explored the use of sequence-only protein language models to produce embeddings of phage proteins without manual feature engineering. However, these embeddings do not directly capture protein structure information and structure-informed signals related to host specificity.

Results: We present PHIStruct, a multilayer perceptron that takes in structure-aware embeddings of receptor-binding proteins, generated via the structure-aware protein language model SaProt, and then predicts the host from among the ESKAPEE genera.

View Article and Find Full Text PDF

Similar Publications

EnrichRBP: an automated and interpretable computational platform for predicting and analyzing RNA-binding protein events.

Bioinformatics

January 2025

School of Artificial Intelligence, Jilin University, Jilin, China.

Yubo Wang Haoran Zhu Yansong Wang Yuning Yang Yujian Huang

Motivation: Predicting RNA-binding proteins (RBPs) is central to understanding post-transcriptional regulatory mechanisms. Here, we introduce EnrichRBP, an automated and interpretable computational platform specifically designed for the comprehensive analysis of RBP interactions with RNA.

Results: EnrichRBP is a web service that enables researchers to develop original deep learning and machine learning architectures to explore the complex dynamics of RNA-binding proteins.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!