Near perfect protein multi-label classification with deep neural networks.

Methods

PIT Bioinformatics Group, Eötvös University, H-1117 Budapest, Hungary; Uratim Ltd., H-1118 Budapest, Hungary. Electronic address:

Published: January 2018

Biological sequences can be considered as data items of high-, non-fixed dimensions, corresponding to the length of those sequences. The comparison and the classification of biological sequences in their relations to large databases are important areas of research today. Artificial neural networks (ANNs) have gained a well-deserved popularity among machine learning tools upon their recent successful applications in image- and sound processing and classification problems. ANNs have also been applied for predicting the family or function of a protein, knowing its residue sequence. Here we present two new ANNs with multi-label classification ability, showing impressive accuracy when classifying protein sequences into 698 UniProt families (AUC=99.99%) and 983 Gene Ontology classes (AUC=99.45%).

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ymeth.2017.06.034DOI Listing

Publication Analysis

Top Keywords

multi-label classification
8
neural networks
8
biological sequences
8
perfect protein
4
protein multi-label
4
classification
4
classification deep
4
deep neural
4
networks biological
4
sequences
4

Similar Publications

Multi-label zero-shot learning (ML-ZSL) strives to recognize all objects in an image, regardless of whether they are present in the training data. Recent methods incorporate an attention mechanism to locate labels in the image and generate class-specific semantic information. However, the attention mechanism built on visual features treats label embeddings equally in the prediction score, leading to severe semantic ambiguity.

View Article and Find Full Text PDF

Robust fault detection and classification in power transmission lines via ensemble machine learning models.

Sci Rep

January 2025

Department of Theoretical Electrical Engineering and Diagnostics of Electrical Equipment, Institute of Electrodynamics, National Academy of Sciences of Ukraine, Beresteyskiy, 56, Kyiv-57, Kyiv, 03680, Ukraine.

Transmission lines are vital for delivering electricity over long distances, yet they face reliability challenges due to faults that can disrupt power supply and pose safety risks. This research introduces a novel approach for fault detection and classification by analyzing voltage and current patterns across transmission line phases. Leveraging a comprehensive dataset of diverse fault scenarios, various machine learning algorithms-including Random Forest (RF), K-Nearest Neighbors (KNN), and Long Short-Term Memory (LSTM) networks-are evaluated.

View Article and Find Full Text PDF

Predicting distant metastatic sites of cancer using perturbed correlations of miRNAs with competing endogenous RNAs.

Comput Biol Chem

January 2025

Department of Computer Engineering, Inha University, 100 Inha-ro, Michuhol-gu, Incheon, 22212, Republic of Korea. Electronic address:

Cancer metastasis is the dissemination of tumor cells from the primary tumor site to other parts of the body via the lymph system or bloodstream. Metastasis is the leading cause of cancer associated death. Despite the significant advances in cancer research and treatment over the past decades, metastasis is not fully understood and difficult to predict in advance.

View Article and Find Full Text PDF

Abductive multi-instance multi-label learning for periodontal disease classification with prior domain knowledge.

Med Image Anal

January 2025

National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, 210023, China; School of Artificial Intelligence, Nanjing University, Nanjing, 210023, China.

Machine learning is widely used in dentistry nowadays, offering efficient solutions for diagnosing dental diseases, such as periodontitis and gingivitis. Most existing methods for diagnosing periodontal diseases follow a two-stage process. Initially, they detect and classify potential Regions of Interest (ROIs) and subsequently determine the labels of the whole images.

View Article and Find Full Text PDF

A Multi-Label Deep Learning Model for Detailed Classification of Alzheimer's Disease.

Actas Esp Psiquiatr

January 2025

Department of Clinical Laboratory, Ningbo No.2 Hospital, 315099 Ningbo, Zhejiang, China.

Background: Accurate diagnosis and classification of Alzheimer's disease (AD) are crucial for effective treatment and management. Traditional diagnostic models, largely based on binary classification systems, fail to adequately capture the complexities and variations across different stages and subtypes of AD, limiting their clinical utility.

Methods: We developed a deep learning model integrating a dot-product attention mechanism and an innovative labeling system to enhance the diagnosis and classification of AD subtypes and severity levels.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!