Collision Cross Section Prediction with Molecular Fingerprint Using Machine Learning.

Molecules

Van 't Hoff Institute for Molecular Sciences (HIMS), University of Amsterdam, Science Park 904, 1098 XH Amsterdam, The Netherlands.

Published: September 2022

High-resolution mass spectrometry is a promising technique in non-target screening (NTS) to monitor contaminants of emerging concern in complex samples. Current chemical identification strategies in NTS experiments typically depend on spectral libraries, chemical databases, and in silico fragmentation tools. However, small molecule identification remains challenging due to the lack of orthogonal sources of information (e.g., unique fragments). Collision cross section (CCS) values measured by ion mobility spectrometry (IMS) offer an additional identification dimension to increase the confidence level. Thanks to the advances in analytical instrumentation, an increasing application of IMS hybrid with high-resolution mass spectrometry (HRMS) in NTS has been reported in the recent decades. Several CCS prediction tools have been developed. However, limited CCS prediction methods were based on a large scale of chemical classes and cross-platform CCS measurements. We successfully developed two prediction models using a random forest machine learning algorithm. One of the approaches was based on chemicals' super classes; the other model was direct CCS prediction using molecular fingerprint. Over 13,324 CCS values from six different laboratories and PubChem using a variety of ion-mobility separation techniques were used for training and testing the models. The test accuracy for all the prediction models was over 0.85, and the median of relative residual was around 2.2%. The models can be applied to different IMS platforms to eliminate false positives in small molecule identification.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9572128PMC
http://dx.doi.org/10.3390/molecules27196424DOI Listing

Publication Analysis

Top Keywords

ccs prediction
12
collision cross
8
prediction molecular
8
molecular fingerprint
8
machine learning
8
high-resolution mass
8
mass spectrometry
8
small molecule
8
molecule identification
8
ccs values
8

Similar Publications

Modeling Hydrogen Markets: Energy System Model Development Status and Decarbonization Scenario Results.

Energy Clim Chang

December 2024

South China University of Technology, School of Future Technology, 777 Xingye Ave East, Panyu District, Guangzhou, Guangdong, 511442, China.

Hydrogen can be used as an energy carrier and chemical feedstock to reduce greenhouse gas emissions, especially in difficult-to-decarbonize markets such as medium- and heavy-duty vehicles, aviation and maritime, iron and steel, and the production of fuels and chemicals. Significant literature has been accumulated on engineering-based assessments of various hydrogen technologies, and real-world projects are validating technology performance at larger scales and for low-carbon supply chains. While energy system models continue to be updated to track this progress, many are currently limited in their representation of hydrogen, and as a group they tend to generate highly variable results under decarbonization constraints.

View Article and Find Full Text PDF

Objectives: Femtosecond laser image guided high precision trabeculotomy (FLigHT) is a novel open-angle glaucoma treatment. The procedure non-invasively creates aqueous humor (AH) drainage channel from the anterior chamber (AC) to Schlemm's canal (SC) through the trabecular meshwork (TM) to decrease intraocular pressure (IOP). The purpose of this study was to develop a 3D finite element model (FEM) of the FLigHT procedure and to simulate clinical results for different drainage channel cross-sectional areas.

View Article and Find Full Text PDF

Small RNA sequencing analysis in two chickpea genotypes, JG 62 (Fusarium wilt-susceptible) and WR 315 (Fusarium wilt-resistant), under Fusarium wilt stress led to identification of 544 miRNAs which included 406 known and 138 novel miRNAs. A total of 115 miRNAs showed differential expression in both the genotypes across different combinations. A miRNA, Car-miR398 targeted copper chaperone for superoxide dismutase (CCS) that, in turn, regulated superoxide dismutase (SOD) activity during chickpea-Foc interaction.

View Article and Find Full Text PDF

Interpretation of machine learning-based prediction models and functional metagenomic approach to identify critical genes in HBCD degradation.

J Hazard Mater

December 2024

Institute of Information Science, Academia Sinica, No. 128, Section 2, Academia Road, Nankang, Taipei 11529, Taiwan; Institute of Fisheries Science, National Taiwan University, No. 1, Section 4, Roosevelt Rd., Taipei 10617, Taiwan. Electronic address:

Hexabromocyclododecane (HBCD) poses significant environmental risks, and identifying HBCD-degrading microbes and their enzymatic mechanisms is challenging due to the complexity of microbial interactions and metabolic pathways. This study aimed to identify critical genes involved in HBCD biodegradation through two approaches: functional annotation of metagenomes and the interpretation of machine learning-based prediction models. Our functional analysis revealed a rich metabolic potential in Chiang Chun soil (CCS) metagenomes, particularly in carbohydrate metabolism.

View Article and Find Full Text PDF

Traumatic brain injury (TBI) is a global issue and a major cause of patient mortality, and cerebral contusions (CCs) is a common primary TBI. The haemorrhagic progression of a contusion (HPC) poses a significant risk to patients' lives, and effectively predicting changes in haematoma volume is an urgent clinical challenge that needs to be addressed. As a branch of artificial intelligence, machine learning (ML) can proficiently handle a wide range of complex data and identify connections between data for tasks such as prediction and decision making.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!