Owing to its superior performance, the Transformer model, based on the 'Encoder- Decoder' paradigm, has become the mainstream model in natural language processing. However, bioinformatics has embraced machine learning and has led to remarkable progress in drug design and protein property prediction. Cell-penetrating peptides (CPPs) are a type of permeable protein that is a convenient 'postman' in drug penetration tasks. However, only a few CPPs have been discovered, limiting their practical applications in drug permeability. CPPs have led to a new approach that enables the uptake of only macromolecules into cells (i.e., without other potentially harmful materials found in the drug). Most previous studies have utilized trivial machine learning techniques and hand-crafted features to construct a simple classifier. CPPFormer was constructed by implementing the attention structure of the Transformer, rebuilding the network based on the characteristics of CPPs according to their short length, and using an automatic feature extractor with a few manually engineered features to co-direct the predicted results. Compared to all previous methods and other classic text classification models, the empirical results show that our proposed deep model-based method achieves the best performance, with an accuracy of 92.16% in the CPP924 dataset, and passes various index tests.

Download full-text PDF

Source
http://dx.doi.org/10.2174/0929867328666210920103140DOI Listing

Publication Analysis

Top Keywords

performance transformer
8
prediction cell-penetrating
8
cell-penetrating peptides
8
machine learning
8
better performance
4
transformer cppformer
4
cppformer precise
4
precise prediction
4
peptides superior
4
superior performance
4

Similar Publications

Background: Early detection and diagnosis of cancer are vital to improving outcomes for patients. Artificial intelligence (AI) models have shown promise in the early detection and diagnosis of cancer, but there is limited evidence on methods that fully exploit the longitudinal data stored within electronic health records (EHRs). This review aims to summarise methods currently utilised for prediction of cancer from longitudinal data and provides recommendations on how such models should be developed.

View Article and Find Full Text PDF

Dilated SE-DenseNet for brain tumor MRI classification.

Sci Rep

January 2025

Department of Applied Mathematics, University of Waterloo, Waterloo, ON, N2L 3G1, Canada.

In the field of medical imaging, particularly MRI-based brain tumor classification, we propose an advanced convolutional neural network (CNN) leveraging the DenseNet-121 architecture, enhanced with dilated convolutional layers and Squeeze-and-Excitation (SE) networks' attention mechanisms. This novel approach aims to improve upon state-of-the-art methods of tumor identification. Our model, trained and evaluated on a comprehensive Kaggle brain tumor dataset, demonstrated superior performance over established convolution-based and transformer-based models: ResNet-101, VGG-19, original DenseNet-121, MobileNet-V2, ViT-L/16, and Swin-B across key metrics: F1-score, accuracy, precision, and recall.

View Article and Find Full Text PDF

In agriculture, promptly and accurately identifying leaf diseases is crucial for sustainable crop production. To address this requirement, this research introduces a hybrid deep learning model that combines the visual geometric group version 19 (VGG19) architecture features with the transformer encoder blocks. This fusion enables the accurate and précised real-time classification of leaf diseases affecting grape, bell pepper, and tomato plants.

View Article and Find Full Text PDF

Background And Objective: Serum protein electrophoresis (SPEP) plays a critical role in diagnosing diseases associated with M-proteins. However, its clinical application is limited by a heavy reliance on experienced experts.

Methods: A dataset comprising 85,026 SPEP outcomes was utilized to develop artificial intelligence diagnostic models for the classification and localization of M-proteins.

View Article and Find Full Text PDF

Multiple token rearrangement Transformer network with explicit superpixel constraint for segmentation of echocardiography.

Med Image Anal

January 2025

General Hospital of the Southern Theatre Command, PLA, Guangzhou, China; The First School of Clinical Medicine, Southern Medical University, Guangzhou, China. Electronic address:

Diagnostic cardiologists have considerable clinical demand for precise segmentation of echocardiography to diagnose cardiovascular disease. The paradox is that manual segmentation of echocardiography is a time-consuming and operator-dependent task. Computer-aided segmentation can reduce the workflow greatly.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!