Comparative Study of Classification Algorithms for Various DNA Microarray Data.

Genes (Basel)

School of Software, Kwangwoon University, 20 Kwangwoon-ro, Nowon-gu, Seoul 01897, Korea.

Published: March 2022

AI Article Synopsis

  • Microarrays are advanced tools that integrate electrical engineering with biology to measure gene expression and assess diseases simultaneously.
  • The study compares five machine learning algorithms—MultiLayer Perceptron (MLP), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), and k-Nearest Neighbors (KNN)—to evaluate their effectiveness in classifying microarray datasets.
  • Results showed that DT and RF often performed worse compared to MLP, SVM, and KNN, highlighting the importance of choosing the right algorithm based on specific data characteristics for optimal classification results.

Article Abstract

Microarrays are applications of electrical engineering and technology in biology that allow simultaneous measurement of expression of numerous genes, and they can be used to analyze specific diseases. This study undertakes classification analyses of various microarrays to compare the performances of classification algorithms over different data traits. The datasets were classified into test and control groups based on five utilized machine learning methods, including MultiLayer Perceptron (MLP), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), and -Nearest Neighbors (KNN), and the resulting accuracies were compared. -fold cross-validation was used in evaluating the performance and the result was analyzed by comparing the performances of the five machine learning methods. Through the experiments, it was observed that the two tree-based methods, DT and RF, showed similar trends in results and the remaining three methods, MLP, SVM, and DT, showed similar trends. DT and RF generally showed worse performance than other methods except for one dataset. This suggests that, for the effective classification of microarray data, selecting a classification algorithm that is suitable for data traits is crucial to ensure optimum performance.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8951024PMC
http://dx.doi.org/10.3390/genes13030494DOI Listing

Publication Analysis

Top Keywords

classification algorithms
8
microarray data
8
data traits
8
machine learning
8
learning methods
8
classification
5
methods
5
comparative study
4
study classification
4
algorithms dna
4

Similar Publications

Diabetic retinopathy, a retinal disorder resulting from diabetes mellitus, is a prominent cause of visual degradation and loss among the global population. Therefore, the identification and classification of diabetic retinopathy are of utmost importance in the clinical diagnosis and therapy. Currently, these duties are extensively carried out by manual examination utilizing the human visual system.

View Article and Find Full Text PDF

Sharing cooking recipes is a great way to exchange culinary ideas and provide instructions for food preparation. However, categorizing raw recipes found online into appropriate food genres can be challenging due to a lack of adequate labeled data. In this study, we present a dataset named the "Assorted, Archetypal, and Annotated Two Million Extended (3A2M+) Cooking Recipe Dataset" that contains two million culinary recipes labeled in respective categories with extended named entities extracted from recipe descriptions.

View Article and Find Full Text PDF

Lymphadenopathy is associated with lymph node abnormal size or consistency due to many causes. We employed the deep convolutional neural network ResNet-34 to detect and classify CT images from patients with abdominal lymphadenopathy and healthy controls. We created a single database containing 1400 source CT images for patients with abdominal lymphadenopathy (n = 700) and healthy controls (n = 700).

View Article and Find Full Text PDF

Phenotypic Classification of Multisystem Inflammatory Syndrome in Children Using Latent Class Analysis.

JAMA Netw Open

January 2025

Coronavirus and Other Respiratory Viruses Division, National Center for Immunization and Respiratory Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia.

Importance: Multisystem inflammatory syndrome in children (MIS-C) is an uncommon but severe hyperinflammatory illness that occurs 2 to 6 weeks after SARS-CoV-2 infection. Presentation overlaps with other conditions, and risk factors for severity differ by patient. Characterizing patterns of MIS-C presentation can guide efforts to reduce misclassification, categorize phenotypes, and identify patients at risk for severe outcomes.

View Article and Find Full Text PDF

Purpose: This brief report aims to summarize and discuss the methodologies of eXplainable Artificial Intelligence (XAI) and their potential applications in surgery.

Methods: We briefly introduce explainability methods, including global and individual explanatory features, methods for imaging data and time series, as well as similarity classification, and unraveled rules and laws.

Results: Given the increasing interest in artificial intelligence within the surgical field, we emphasize the critical importance of transparency and interpretability in the outputs of applied models.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!