AI Article Synopsis

  • In untargeted metabolomics, identifying small molecules is difficult due to the large number of potential candidates, but techniques like liquid chromatography-mass spectrometry (LC-MS) and retention time (RT) prediction help narrow down options.
  • Recent advancements in artificial intelligence (AI) have improved RT prediction, especially with the availability of a large RT dataset, which has supported the application of deep learning models for better metabolite annotation.
  • This review outlines the progress in AI applications for RT prediction over the last five years, discussing available datasets, molecular representation challenges, and the effectiveness and remaining issues of AI in assisting small molecule structure identification.

Article Abstract

In untargeted metabolomics, structures of small molecules are annotated using liquid chromatography-mass spectrometry by leveraging information from the molecular retention time (RT) in the chromatogram and m/z (formerly called ''mass-to-charge ratio'') in the mass spectrum. However, correct identification of metabolites is challenging due to the vast array of small molecules. Therefore, various in silico tools for mass spectrometry peak alignment and compound prediction have been developed; however, the list of candidate compounds remains extensive. Accurate RT prediction is important to exclude false candidates and facilitate metabolite annotation. Recent advancements in artificial intelligence (AI) have led to significant breakthroughs in the use of deep learning models in various fields. Release of a large RT dataset has mitigated the bottlenecks limiting the application of deep learning models, thereby improving their application in RT prediction tasks. This review lists the databases that can be used to expand training datasets and concerns the issue about molecular representation inconsistencies in datasets. It also discusses the application of AI technology for RT prediction, particularly in the 5 years following the release of the METLIN small molecule RT dataset. This review provides a comprehensive overview of the AI applications used for RT prediction, highlighting the progress and remaining challenges. SCIENTIFIC CONTRIBUTION: This article focuses on the advancements in small molecule retention time prediction in computational metabolomics over the past five years, with a particular emphasis on the application of AI technologies in this field. It reviews the publicly available datasets for small molecule retention time, the molecular representation methods, the AI algorithms applied in recent studies. Furthermore, it discusses the effectiveness of these models in assisting with the annotation of small molecule structures and the challenges that must be addressed to achieve practical applications.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11460055PMC
http://dx.doi.org/10.1186/s13321-024-00905-1DOI Listing

Publication Analysis

Top Keywords

small molecule
20
molecule retention
12
deep learning
12
retention time
12
small molecules
8
learning models
8
molecular representation
8
small
7
prediction
6
molecule
5

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!