Many proteins bear multi-locational characteristics, and this phenomenon is closely related to biological function. However, most of the existing methods can only deal with single-location proteins. Therefore, an automatic and reliable ensemble classifier for protein subcellular multi-localization is needed. We propose a new ensemble classifier combining the KNN (K-nearest neighbour) and SVM (support vector machine) algorithms to predict the subcellular localization of eukaryotic, Gram-negative bacterial and viral proteins based on the general form of Chou's pseudo amino acid composition, i.e., GO (gene ontology) annotations, dipeptide composition and AmPseAAC (Amphiphilic pseudo amino acid composition). This ensemble classifier was developed by fusing many basic individual classifiers through a voting system. The overall prediction accuracies obtained by the KNN-SVM ensemble classifier are 95.22, 93.47 and 80.72% for the eukaryotic, Gram-negative bacterial and viral proteins, respectively. Our prediction accuracies are significantly higher than those by previous methods and reveal that our strategy better predicts subcellular locations of multi-location proteins.

Download full-text PDF

Source
http://dx.doi.org/10.2174/092986612799789369DOI Listing

Publication Analysis

Top Keywords

ensemble classifier
16
pseudo amino
12
amino acid
12
acid composition
12
protein subcellular
8
subcellular multi-localization
8
based general
8
general form
8
form chou's
8
chou's pseudo
8

Similar Publications

Background: Lewy body (LB) diseases can present with overlapping prodromal, cognitive, motor, autonomic or neuropsychiatric symptoms. Intuitively, greater symptom severity should correlate with greater pathological burden, but this has not been consistently shown. LB pathology does not translate to clinical expression in Incidental LB disease.

View Article and Find Full Text PDF

Evaluation of machine learning algorithms and computational structural validation of CYP2D6 in predicting the therapeutic response to tamoxifen in breast cancer.

Eur Rev Med Pharmacol Sci

December 2024

Department of Pharmacology & Therapeutics, College of Medicine and Health Sciences, Arabian Gulf University, Manama, Kingdom of Bahrain.

Objective: CYP2D6 plays a critical role in metabolizing tamoxifen into its active metabolite, endoxifen, which is crucial for its therapeutic effect in estrogen receptor-positive breast cancer. Single nucleotide polymorphisms (SNPs) in the CYP2D6 gene can affect enzyme activity and thus impact tamoxifen efficacy. This study aimed to use machine learning algorithms (MLAs) to identify significant predictors of Breast Cancer-Free Interval (BCFI) and to apply bioinformatics tools to investigate the structural and functional implications of CYP2D6 SNPs.

View Article and Find Full Text PDF

Enhancing the performance of SSVEP-based BCIs by combining task-related component analysis and deep neural network.

Sci Rep

January 2025

Department of Biomedical Engineering, School of Biomedical Engineering, Tsinghua University, Beijing, 100084, China.

Steady-State Visually Evoked Potential (SSVEP) signals can be decoded by either a traditional machine learning algorithm or a deep learning network. Combining the two methods is expected to enhance the performance of an SSVEP-based brain-computer interface (BCI) by exploiting their advantages. However, an efficient strategy for integrating the two methods has not yet been established.

View Article and Find Full Text PDF

A dual-stage model for classifying Parkinson's disease severity, through a detailed analysis of Gait signals using force sensors and machine learning approaches, is proposed in this study. Parkinson's disease is the primary neurodegenerative disorder that results in a gradual reduction in motor function. Early detection and monitoring of the disease progression is highly challenging due to the gradual progression of symptoms and the inadequacy of conventional methods in identifying subtle changes in mobility.

View Article and Find Full Text PDF

Magnetic resonance (MR) images are commonly used to diagnose prolapsed lumbar intervertebral disc (PLID). However, for a computer-aided diagnostic (CAD) system, distinguishing between pathological abnormalities of PLID in MR images is a challenging and intricate task. Here, we propose a comprehensive model for the automatic detection and cropping of regions of interest (ROI) from sagittal MR images using the YOLOv8 framework to solve this challenge.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!