The development of machine learning approaches in two-dimensional NMR data interpretation for metabolomics applications.

Anal Biochem

Department of Chemistry and Chemical Engineering, Florida Institute of Technology, 150 West University Boulevard, Melbourne, FL, 32901-6975, USA. Electronic address:

Published: December 2024

AI Article Synopsis

  • Metabolomics is used to study metabolite changes in human diseases and the environment, but one-dimensional NMR (1D NMR) faces issues with overlapping peaks that hinder accurate analysis.
  • This research developed an automated method using machine learning to process two-dimensional NMR (2D NMR) data, which overcomes the peak overlap problem present in 1D NMR.
  • The study found that machine learning techniques like artificial neural networks achieved about 90% accuracy in peak selection, suggesting their strong potential for quantitative applications in 2D NMR-based metabolomics.

Article Abstract

Metabolomics has been widely applied in human diseases and environmental science to study the systematic changes of metabolites over diverse types of stimuli. NMR-based metabolomics has been widely used, but the peak overlap problems in the one-dimensional (1D) NMR spectrum could limit the accuracy of quantitative analysis for metabolomics applications. Two-dimensional (2D) NMR has been applied to solve the 1D NMR overlap problem, but the data processing is still challenging. In this study, we built an automatic approach to process the 2D NMR data for quantitative applications using machine learning approaches. Partial least square discriminant analysis (PLS-DA), artificial neural network classification (ANN-DA), gradient boosted trees classification (XGBoost-DA), and artificial deep learning neural network classification (ANNDL-DA) were applied in combination with an automatic peak selection approach. Standard mixtures, sea anemone extracts, and mouse fecal samples were tested to demonstrate the approach. Our results showed that ANN-DA and ANNDL-DA have high accuracy in selecting 2D NMR peaks (around 90 %), which have a high potential application in 2D NMR-based metabolomics quantitively study, while PLS-DA and XGBoost-DA showed limitations in either data variation or overfitting. Our study built an automatic approach to applying 2D NMR data to routine quantitative analysis in metabolomics.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ab.2024.115654DOI Listing

Publication Analysis

Top Keywords

nmr data
12
machine learning
8
learning approaches
8
two-dimensional nmr
8
metabolomics applications
8
nmr-based metabolomics
8
quantitative analysis
8
analysis metabolomics
8
study built
8
built automatic
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!