: Paired Untargeted LC-HRMS Metabolomics Feature Matching and Concatenation of Disparately Acquired Data Sets.

Anal Chem

Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Arbor, Michigan 48109, United States.

Published: March 2021

AI Article Synopsis

  • LC-HRMS experiments often identify a limited number of compounds from thousands detected, challenging data processing due to varying chromatographic conditions.
  • A new computational pipeline was developed to align and match features from untargeted LC-MS data sets, improving collaborative compound identification and enabling meta-analyses.
  • The method showed high accuracy in feature matching and retention time predictions across different sample types and is available as an R package for broader use in metabolomics research.

Article Abstract

LC-HRMS experiments detect thousands of compounds, with only a small fraction of them identified in most studies. Traditional data processing pipelines contain an alignment step to assemble the measurements of overlapping features across samples into a unified table. However, data sets acquired under nonidentical conditions are not amenable to this process, mostly due to significant alterations in chromatographic retention times. Alignment of features between disparately acquired LC-MS metabolomics data could aid collaborative compound identification efforts and enable meta-analyses of expanded data sets. Here, we describe , a new computational pipeline for matching known and unknown features in a pair of untargeted LC-MS data sets and concatenating their abundances into a combined table of intersecting feature measurements. groups features by mass-to-charge (/) values to generate a search space of possible feature pair alignments, fits a spline through a set of selected retention time ordered pairs, and ranks alignments by /, mapped retention time, and relative abundance similarity. We evaluated this workflow on a pair of plasma metabolomics data sets acquired with different gradient elution methods, achieving a mean absolute retention time prediction error of roughly 0.06 min and a weighted per-compound matching accuracy of approximately 90%. We further demonstrate the utility of this method by comprehensively mapping features in urine and muscle metabolomics data sets acquired from different laboratories. has the potential to bridge the gap between otherwise incompatible metabolomics data sets and is available as an R package at https://github.com/hhabra/metabCombiner and .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9906987PMC
http://dx.doi.org/10.1021/acs.analchem.0c03693DOI Listing

Publication Analysis

Top Keywords

data sets
28
metabolomics data
16
sets acquired
12
retention time
12
data
9
disparately acquired
8
sets
7
metabolomics
5
acquired
5
features
5

Similar Publications

CryoCrane: an open-source GUI for analyzing cryo-EM screening data sets.

Acta Crystallogr F Struct Biol Commun

February 2025

Institute for Biochemistry and Biology, University of Potsdam, Am Neuen Palais 10, 14469 Potsdam, Germany.

Screening of cryo-EM samples is essential for the generation of high-resolution cryo-EM structures. Often, it is cumbersome to correlate the appearance of specific grid squares and micrograph quality. Here, CryoCrane (Correlate atlas and exposures), a visualization tool for cryo-EM screening data, is presented.

View Article and Find Full Text PDF

Screening of obstructive sleep apnea and diabetes mellitus -related biomarkers based on integrated bioinformatics analysis and machine learning.

Sleep Breath

January 2025

Department of Respiratory and Critical Care Medicine, Medical School of Nantong University, Nantong Key Laboratory of Respiratory Medicine, Affiliated Hospital of Nantong University, Nantong, 226001, China.

Background: The pathophysiology of obstructive sleep apnea (OSA) and diabetes mellitus (DM) is still unknown, despite clinical reports linking the two conditions. After investigating potential roles for DM-related genes in the pathophysiology of OSA, our goal is to investigate the molecular significance of the condition. Machine learning is a useful approach to understanding complex gene expression data to find biomarkers for the diagnosis of OSA.

View Article and Find Full Text PDF

Self-esteem and depressive symptoms are important predictors of a range of societally relevant outcomes and are theorized to influence each other reciprocally over time. However, existing research offers only a limited understanding of how their dynamics unfold across different timescales. Using three data sets with different temporal resolutions, we aimed to advance our understanding of the temporal unfolding of the reciprocal dynamics between self-esteem and depressive symptoms.

View Article and Find Full Text PDF

4D-DIA Proteomics Uncovers New Insights into Host Salivary Response Following SARS-CoV-2 Omicron Infection.

J Proteome Res

January 2025

PPGEMN, School of Engineering, Mackenzie Presbyterian University & MackGraphe - Mackenzie Institute for Research in Graphene and Nanotechnologies, Mackenzie Presbyterian Institute, São Paulo, São Paulo 01302-907, Brazil.

Since late 2021, Omicron variants have dominated the epidemiological scenario as the most successful severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) sublineages, driving new and breakthrough infections globally over the past two years. In this study, we investigated for the first time the host salivary response of COVID-19 patients infected with Omicron variants (BA.1, BA.

View Article and Find Full Text PDF

Background: Biliary tract cancers (BTCs) represent distinct biological and genomic entities. Anatomic and geographic heterogeneity in genomic profiling of BTC subtypes, genomic co-alterations, and their impact on long-term outcomes are not well defined.

Methods: Genomic data to characterize alterations among patients with BTCs were derived from the AACR GENIE registry (v15.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!