Reliable Biomarker discovery from Metagenomic data via RegLRSD algorithm.

BMC Bioinformatics

College of Veterinary Medicine and Biomedical Sciences, Gastrointestinal Laboratory, Texas A&M University, College Station, 77843-3128, TX, USA.

Published: July 2017

Background: Biomarker detection presents itself as a major means of translating biological data into clinical applications. Due to the recent advances in high throughput sequencing technologies, an increased number of metagenomics studies have suggested the dysbiosis in microbial communities as potential biomarker for certain diseases. The reproducibility of the results drawn from metagenomic data is crucial for clinical applications and to prevent incorrect biological conclusions. The variability in the sample size and the subjects participating in the experiments induce diversity, which may drastically change the outcome of biomarker detection algorithms. Therefore, a robust biomarker detection algorithm that ensures the consistency of the results irrespective of the natural diversity present in the samples is needed.

Results: Toward this end, this paper proposes a novel Regularized Low Rank-Sparse Decomposition (RegLRSD) algorithm. RegLRSD models the bacterial abundance data as a superposition between a sparse matrix and a low-rank matrix, which account for the differentially and non-differentially abundant microbes, respectively. Hence, the biomarker detection problem is cast as a matrix decomposition problem. In order to yield more consistent and solid biological conclusions, RegLRSD incorporates the prior knowledge that the irrelevant microbes do not exhibit significant variation between samples belonging to different phenotypes. Moreover, an efficient algorithm to extract the sparse matrix is proposed. Comprehensive comparisons of RegLRSD with the state-of-the-art algorithms on three realistic datasets are presented. The obtained results demonstrate that RegLRSD consistently outperforms the other algorithms in terms of reproducibility performance and provides a marker list with high classification accuracy.

Conclusions: The proposed RegLRSD algorithm for biomarker detection provides high reproducibility and classification accuracy performance regardless of the dataset complexity and the number of selected biomarkers. This renders RegLRSD as a reliable and powerful tool for identifying potential metagenomic biomarkers.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5504766PMC
http://dx.doi.org/10.1186/s12859-017-1738-1DOI Listing

Publication Analysis

Top Keywords

biomarker detection
20
reglrsd algorithm
12
metagenomic data
8
reglrsd
8
clinical applications
8
biological conclusions
8
sparse matrix
8
biomarker
6
algorithm
5
detection
5

Similar Publications

Alzheimer's disease (AD) is the leading cause of dementia worldwide, and the development of early screening methods can address its significant health and social consequences. In this paper, we present a rotary-valve assisted paper-based immunoassay device (RAPID) for early screening of AD, featuring a highly integrated on-chip rotary micro-valve that enables fully automated and efficient detection of the AD biomarker (amyloid beta 42, Aβ42) in artificial plasma. The microfluidic paper-based analytical device (μPAD) of the RAPID pre-stores the required assay reagents on a μPAD and automatically controls the liquid flow through a single valve.

View Article and Find Full Text PDF

Objective: To analyze the clinical characteristics and molecular biomarkers of adult T-cell lymphoblastic lymphoma (T-LBL) to identify prognostic factors, and to evaluate the efficacy of different chemotherapy regimens, providing a basis for optimizing treatment strategies for T-LBL.

Methods: A total of 89 Patients aged 18-72 years with T-LBL, confirmed via histopathological examination of lymph nodes, extranodal tissues, or bone marrow, were retrospectively included. Clinical data, treatment details, and mutational profiles were collected.

View Article and Find Full Text PDF

Rare Cell Population Analysis in Early-Stage Breast Cancer Patients.

Breast Cancer (Auckl)

January 2025

Department of Surgery, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok, Thailand.

Background: Circulating rare cells participate in breast cancer evolution as systemic components of the disease and thus, are a source of theranostic information. Exploration of cancer-associated rare cells is in its infancy.

Objectives: We aimed to investigate and classify abnormalities in the circulating rare cell population among early-stage breast cancer patients using fluorescence marker identification and cytomorphology.

View Article and Find Full Text PDF

Background And Aims: Cadherins are adhesion proteins, and their dysregulation may result in the development of atherosclerosis, plaque rupture, or lesions of the vascular wall. The aim of the present study was to detect the associations of cadherins-P, -E, and -H, with atherosclerosis and pathological cardiovascular conditions.

Methods And Results: The present study with 3-year follow up evaluated atherosclerosis and fasting levels of P-, E-, and H-cadherins in the serum samples of 214 patients in a hospital setting.

View Article and Find Full Text PDF

Breath biopsy is emerging as a rapid and non-invasive diagnostic tool that links exhaled chemical signatures with specific medical conditions. Despite its potential, clinical translation remains limited by the challenge of reliably detecting endogenous, disease-specific biomarkers in breath. Synthetic biomarkers represent an emerging paradigm for precision diagnostics such that they amplify activity-based biochemical signals associated with disease fingerprints.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!