AI Article Synopsis

Article Abstract

We are moving into the age of 'Big Data' in biomedical research and bioinformatics. This trend could be encapsulated in this simple formula: D = S * F, where the volume of data generated (D) increases in both dimensions: the number of samples (S) and the number of sample features (F). Frequently, a typical omics classification includes redundant and irrelevant features (e.g. genes or proteins) that can result in long computation times; decrease of the model performance and the selection of suboptimal features (genes and proteins) after the classification/regression step. Multiple algorithms and reviews has been published to describe all the existing methods for feature selection, their strengths and weakness. However, the selection of the correct FS algorithm and strategy constitutes an enormous challenge. Despite the number and diversity of algorithms available, the proper choice of an approach for facing a specific problem often falls in a 'grey zone'. In this study, we select a subset of FS methods to develop an efficient workflow and an R package for bioinformatics machine learning problems. We cover relevant issues concerning FS, ranging from domain's problems to algorithm solutions and computational tools. Finally, we use seven different proteomics and gene expression datasets to evaluate the workflow and guide the FS process.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5738110PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0189875PLOS

Publication Analysis

Top Keywords

feature selection
8
features genes
8
genes proteins
8
accurate fast
4
fast feature
4
selection
4
selection workflow
4
workflow high-dimensional
4
high-dimensional omics
4
omics data
4

Similar Publications

Background: Rex rabbit is famous for its silky and soft fur coat, a characteristic predominantly attributed to its hair follicles. Numerous studies have confirmed the crucial roles of mRNAs and non-coding RNAs (ncRNAs) in regulating key cellular processes such as cell proliferation, differentiation, apoptosis and immunity. However, their involvement in the regulation of the hair cycle in Rex rabbits remains unknown.

View Article and Find Full Text PDF

Optical techniques, such as functional near-infrared spectroscopy (fNIRS), contain high potential for the development of non-invasive wearable systems for evaluating cerebral vascular condition in aging, due to their portability and ability to monitor real-time changes in cerebral hemodynamics. In this study, thirty-six healthy adults were measured by single channel fNIRS to explore differences between two age groups using machine learning (ML). The subjects, measured during functional magnetic resonance imaging (fMRI) at Oulu University Hospital, were divided into young (age ≤ 32) and elderly (age ≥ 57) groups.

View Article and Find Full Text PDF

Mechanical ventilation is the process through which breathing support is provided to patients who face inconvenience during respiration. During the pandemic, many people were suffering from lung disorders, which elevated the demand for mechanical ventilators. The handling of mechanical ventilators is to be done under the assistance of trained professionals and demands the selection of ideal parameters.

View Article and Find Full Text PDF

Diabetes is a growing health concern in developing countries, causing considerable mortality rates. While machine learning (ML) approaches have been widely used to improve early detection and treatment, several studies have shown low classification accuracies due to overfitting, underfitting, and data noise. This research employs parallel and sequential ensemble ML approaches paired with feature selection techniques to boost classification accuracy.

View Article and Find Full Text PDF

To establish a multivariate linear regression model for predicting the difficulty of high-intensity focused ultrasound (HIFU) ablation of uterine fibroids based on multi-sequence magnetic resonance imaging radiomics features. A retrospective analysis was conducted on 218 patients with uterine fibroids who underwent HIFU treatment, including 178 cases from Yongchuan Hospital of Chongqing Medical University and 40 cases from the Second Affiliated Hospital of Chongqing Medical University (external validation set). Radiomics features were extracted and selected from magnetic resonance images, and potentially related imaging features were collected.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!