IDSL.CSA: Composite Spectra Analysis for Chemical Annotation of Untargeted Metabolomics Datasets.

Anal Chem

Department of Environmental Medicine and Public Health, Icahn School of Medicine at Mount Sinai, New York, New York 10029, United States.

Published: June 2023

Poor chemical annotation of high-resolution mass spectrometry data limits applications of untargeted metabolomics datasets. Our new software, the Integrated Data Science Laboratory for Metabolomics and Exposomics─Composite Spectra Analysis (IDSL.CSA) R package, generates composite mass spectra libraries from MS1-only data, enabling the chemical annotation of high-resolution mass spectrometry coupled with liquid chromatography peaks regardless of the availability of MS2 fragmentation spectra. We demonstrate comparable annotation rates for commonly detected endogenous metabolites in human blood samples using IDSL.CSA libraries MS/MS libraries in validation tests. IDSL.CSA can create and search composite spectra libraries from any untargeted metabolomics dataset generated using high-resolution mass spectrometry coupled to liquid or gas chromatography instruments. The cross-applicability of these libraries across independent studies may provide access to new biological insights that may be missed due to the lack of MS2 fragmentation data. The IDSL.CSA package is available in the R-CRAN repository at https://cran.r-project.org/package=IDSL.CSA. Detailed documentation and tutorials are provided at https://github.com/idslme/IDSL.CSA.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11080491PMC
http://dx.doi.org/10.1021/acs.analchem.3c00376DOI Listing

Publication Analysis

Top Keywords

chemical annotation
12
untargeted metabolomics
12
high-resolution mass
12
mass spectrometry
12
composite spectra
8
spectra analysis
8
metabolomics datasets
8
annotation high-resolution
8
idslcsa package
8
spectra libraries
8

Similar Publications

Transglutaminases (TGases) are enzymes highly conserved among prokaryotic and eukaryotic organisms, where their role is to catalyze protein cross-linking. One of the putative TGases of has previously been shown to be localized to the cell wall. Based on sequence similarity we were able to identify six more genes annotated as putative TGases and show that these seven genes group together in phylogenetic analysis.

View Article and Find Full Text PDF

Myocardial infarction (MI) is a major cause of death worldwide. Exercise rehabilitation (ER) is a powerful tool to improve life quality and prognosis of MI patients. Herein, we developed an untargeted metabolomics combined with lipidomics method to qualitatively and quantitatively detect metabolites in plasma.

View Article and Find Full Text PDF

Pseudolabel guided pixels contrast for domain adaptive semantic segmentation.

Sci Rep

December 2024

The Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education, East China University of Science and Technology, Shanghai, 200237, China.

Semantic segmentation is essential for comprehending images, but the process necessitates a substantial amount of detailed annotations at the pixel level. Acquiring such annotations can be costly in the real-world. Unsupervised domain adaptation (UDA) for semantic segmentation is a technique that uses virtual data with labels to train a model and adapts it to real data without labels.

View Article and Find Full Text PDF

The iPhylo suite: an interactive platform for building and annotating biological and chemical taxonomic trees.

Brief Bioinform

November 2024

MOE Key Laboratory of Biosystems Homeostasis & Protection, and Zhejiang Provincial Key Laboratory of Cancer Molecular Cell Biology, Life Sciences Institute, Zhejiang University, 866 Yuhangtang Road, Xihu District, Hangzhou, Zhejiang 310030, China.

Accurate and rapid taxonomic classifications are essential for systematically exploring organisms and metabolites in diverse environments. Many tools have been developed for biological taxonomic trees, but limitations apply, and a streamlined method for constructing chemical taxonomic trees is notably absent. We present the iPhylo suite (https://www.

View Article and Find Full Text PDF

Estimating baselines of Raman spectra based on transformer and manually annotated data.

Spectrochim Acta A Mol Biomol Spectrosc

December 2024

Department of Agricultural Technology, Center for Precision Agriculture, Norwegian Institute of Bioeconomy Research (NIBIO), Nylinna 226 2849, Kapp, Norway.

Raman spectroscopy is a powerful and non-invasive analytical method for determining the chemical composition and molecular structure of a wide range of materials, including complex biological tissues. However, the captured signals typically suffer from interferences manifested as noise and baseline, which need to be removed for successful data analysis. Effective baseline correction is critical in quantitative analysis, as it may impact peak signature derivation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!