Biomarker discovery in inflammatory bowel diseases using network-based feature selection.

PLoS One

College of Information Sciences and Technology, Pennsylvania State University, University Park, PA, United States of America.

Published: March 2020

Reliable identification of Inflammatory biomarkers from metagenomics data is a promising direction for developing non-invasive, cost-effective, and rapid clinical tests for early diagnosis of IBD. We present an integrative approach to Network-Based Biomarker Discovery (NBBD) which integrates network analyses methods for prioritizing potential biomarkers and machine learning techniques for assessing the discriminative power of the prioritized biomarkers. Using a large dataset of new-onset pediatric IBD metagenomics biopsy samples, we compare the performance of Random Forest (RF) classifiers trained on features selected using a representative set of traditional feature selection methods against NBBD framework, configured using five different tools for inferring networks from metagenomics data, and nine different methods for prioritizing biomarkers as well as a hybrid approach combining best traditional and NBBD based feature selection. We also examine how the performance of the predictive models for IBD diagnosis varies as a function of the size of the data used for biomarker identification. Our results show that (i) NBBD is competitive with some of the state-of-the-art feature selection methods including Random Forest Feature Importance (RFFI) scores; and (ii) NBBD is especially effective in reliably identifying IBD biomarkers when the number of data samples available for biomarker discovery is small.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6874333PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0225382PLOS

Publication Analysis

Top Keywords

feature selection
16
biomarker discovery
12
metagenomics data
8
methods prioritizing
8
random forest
8
selection methods
8
feature
5
biomarkers
5
nbbd
5
biomarker
4

Similar Publications

Comparative analysis of regression algorithms for drug response prediction using GDSC dataset.

BMC Res Notes

January 2025

Department of Computer Engineering, Chungbuk National University, Chungdae-ro 1, Cheongju, 28644, Republic of Korea.

Background: Drug response prediction can infer the relationship between an individual's genetic profile and a drug, which can be used to determine the choice of treatment for an individual patient. Prediction of drug response is recently being performed using machine learning technology. However, high-throughput sequencing data produces thousands of features per patient.

View Article and Find Full Text PDF

Growth of microbes in competitive lifestyles promotes increased ARGs in soil microbiota: insights based on genetic traits.

Microbiome

January 2025

Key Laboratory of Environment Remediation and Ecological Health, Ministry of Education, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou, 310058, China.

Background: The widespread selective pressure of antibiotics in the environment has led to the propagation of antibiotic resistance genes (ARGs). However, the mechanisms by which microbes balance population growth with the enrichment of ARGs remain poorly understood. To address this, we employed microcosm cultivation at different antibiotic (i.

View Article and Find Full Text PDF

Upregulated astrocyte HDAC7 induces Alzheimer-like tau pathologies via deacetylating transcription factor-EB and inhibiting lysosome biogenesis.

Mol Neurodegener

January 2025

College of Life Sciences and Oceanography, Brain Disease and Big Data Research Institute, Shenzhen University, Shenzhen, 518060, Guangdong, China.

Background: Astrocytes, the most abundant glial cell type in the brain, will convert into the reactive state in response to proteotoxic stress such as tau accumulation, a characteristic feature of Alzheimer's disease (AD) and other tauopathies. The formation of reactive astrocytes is partially attributed to the disruption of autophagy lysosomal signaling, and inhibiting of some histone deacetylases (HDACs) has been demonstrated to reduce the molecular and functional characteristics of reactive astrocytes. However, the precise role of autophagy lysosomal signaling in astrocytes that regulates tau pathology remains unclear.

View Article and Find Full Text PDF

CompàreGenome: a command-line tool for genomic diversity estimation in prokaryotes and eukaryotes.

BMC Bioinformatics

January 2025

Technology Park of Sardinia, Bioecopest Srl, SP 55 Km 8.400, Tramariglio, Alghero, SS, Italy.

Background: The increasing availability of sequenced genomes has enabled comparative analyses of various organisms. Numerous tools and online platforms have been developed for this purpose, facilitating the identification of unique features within selected organisms. However, choosing the most appropriate tools can be unclear during the initial stages of analysis, often requiring multiple attempts to match the specific characteristics of the data.

View Article and Find Full Text PDF

Best current practice in the analysis of dynamic contrast enhanced (DCE)-MRI is to employ a voxel-by-voxel model selection from a hierarchy of nested models. This nested model selection (NMS) assumes that the observed time-trace of contrast-agent (CA) concentration within a voxel, corresponds to a singular physiologically nested model. However, admixtures of different models may exist within a voxel's CA time-trace.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!