Biofluid-based metabolomics has the potential to provide highly accurate, minimally invasive diagnostics. Metabolomics studies using mass spectrometry typically reduce the high-dimensional data to only a small number of statistically significant features, that are often chemically identified-where each feature corresponds to a mass-to-charge ratio, retention time, and intensity. This practice may remove a substantial amount of predictive signal. To test the utility of the complete feature set, we train machine learning models for health state-prediction in 35 human metabolomics studies, representing 148 individual data sets. Models trained with all features outperform those using only significant features and frequently provide high predictive performance across nine health state categories, despite disparate experimental and disease contexts. Using only non-significant features it is still often possible to train models and achieve high predictive performance, suggesting useful predictive signal. This work highlights the potential for health state diagnostics using all metabolomics features with data-driven analysis.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7572502PMC
http://dx.doi.org/10.1038/s41598-020-74823-1DOI Listing

Publication Analysis

Top Keywords

biofluid-based metabolomics
8
machine learning
8
diagnostics metabolomics
8
metabolomics studies
8
predictive signal
8
high predictive
8
predictive performance
8
health state
8
metabolomics
5
features
5

Similar Publications

Metabolic biomarkers in lung cancer screening and early diagnosis (Review).

Oncol Lett

June 2023

Department of Thoracic Surgery, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100021, P.R. China.

Late diagnosis is one of the major contributing factors to the high mortality rate of lung cancer, which is now the leading cause of cancer-associated mortality worldwide. At present, low-dose CT (LDCT) screening in the high-risk population, in which lung cancer incidence is higher than that of the low-risk population is the predominant diagnostic strategy. Although this has efficiently reduced lung cancer mortality in large randomized trials, LDCT screening has high false-positive rates, resulting in excessive subsequent follow-up procedures and radiation exposure.

View Article and Find Full Text PDF

Background: Metabolomics is a potential means for biofluid-based lung cancer detection. We conducted a non-targeted, data-driven assessment of plasma from early-stage non-small cell lung cancer (ES-NSCLC) cases versus cancer-free controls (CFC) to explore and identify the classes of metabolites for further targeted metabolomics biomarker development.

Methods: Plasma from 250 ES-NSCLC cases and 250 CFCs underwent ultra-high-performance liquid chromatography/quadrupole time-of-flight mass spectrometry (UHPLC-QTOF-MS) in positive and negative electrospray ionization (ESI) modes.

View Article and Find Full Text PDF

Objective: Development of biofluid-based biomarkers is attractive for the diagnosis of chronic obstructive pulmonary disease (COPD) but still lacking. Thus, here we aimed to identify serum metabolic biomarkers for the diagnosis of COPD.

Methods: In this study, we investigated serum metabolic features between COPD patients (n = 54) and normal individuals (n = 74) using a H NMR-based metabolomics approach and developed an integrated method of least-squares support vector machine (LS-SVM) and serum metabolic biomarkers to assist COPD diagnosis.

View Article and Find Full Text PDF

Biofluid-based metabolomics has the potential to provide highly accurate, minimally invasive diagnostics. Metabolomics studies using mass spectrometry typically reduce the high-dimensional data to only a small number of statistically significant features, that are often chemically identified-where each feature corresponds to a mass-to-charge ratio, retention time, and intensity. This practice may remove a substantial amount of predictive signal.

View Article and Find Full Text PDF

Prostate cancer (PCa) is one of the most frequently diagnosed cancers and a leading cause of death among men worldwide. Despite extensive efforts in biomarker discovery during the last years, currently used clinical biomarkers are still lacking enough specificity and sensitivity for PCa early detection, patient prognosis, and monitoring. Therefore, more precise biomarkers are required to improve the clinical management of PCa patients.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!