Deep learning under mass-to-charge ratio pre-retrieval to realize electron ionization mass spectrometry library retrieval.

Rapid Commun Mass Spectrom

Zhejiang Engineering Research Center of Advcanced Mass Spectrometry and Clinical Application, Ningbo University, Ningbo, P. R. China.

Published: December 2022

AI Article Synopsis

  • GC-MS is an analytical method that helps identify compounds by comparing their mass spectra against a large database, but current algorithms struggle with accuracy and can be time-consuming.
  • This study introduces a deep learning ranking model that combines mass-to-charge ratio pre-retrieval with deep learning to enhance search accuracy and reduce processing time.
  • The new approach achieves about a 4% improvement in retrieval accuracy compared to traditional methods, and it also cuts computational time by more than 2 hours, offering a more effective solution for compound identification.

Article Abstract

Rationale: Gas chromatography-mass spectrometry (GC-MS) is an analytical technique widely used in materials science, biomedicine, and other fields. The target compound in the experiment is identified by searching for its mass spectrum in a large mass spectrum database using some algorithms. This work introduces the use of deep learning ranking for the identification of small molecules using low-resolution electron ionization MS. Because different spectra are often very similar, the algorithm produces wrong search results, and the search accuracy needs improvement. Due to the library's large amount of data, the algorithm sometimes requires a large amount of calculation and is very time consuming.

Methods: Given these two problems, this work aims to develop a model for ranking based on mass-to-charge ratio (m/z) pre-retrieval method combined with deep learning to improve search accuracy and reduce the algorithm's computational time. The master spectral library maintained by the National Institute of Standards and Technology is used as the reference library for all the experiments, and the replicate library is used as the query library to evaluate the method's performance.

Results: Compared with non-machine learning algorithms, the combination of m/z matching pre-retrieval and deep learning significantly improves library retrieval accuracy by about 4%. Moreover, compared with the deep learning sorting algorithm that does not use the pre-retrieval process, it improves the accuracy of spectral library retrieval by about 0.1% and reduces the computational time of the algorithm by more than 2 h.

Conclusions: This method identifies compounds more efficiently and accurately than non-machine learning and deep learning algorithms without a pre-retrieval process.

Download full-text PDF

Source
http://dx.doi.org/10.1002/rcm.9398DOI Listing

Publication Analysis

Top Keywords

deep learning
24
library retrieval
12
mass-to-charge ratio
8
electron ionization
8
mass spectrum
8
search accuracy
8
large amount
8
computational time
8
spectral library
8
non-machine learning
8

Similar Publications

Background: Primary intracranial germ cell tumors (iGCTs) are highly malignant brain tumors that predominantly occur in children and adolescents, with an incidence rate ranking third among primary brain tumors in East Asia (8%-15%). Due to their insidious onset and impact on critical functional areas of the brain, these tumors often result in irreversible abnormalities in growth and development, as well as cognitive and motor impairments in affected children. Therefore, early diagnosis through advanced screening techniques is vital for improving patient outcomes and quality of life.

View Article and Find Full Text PDF

A comparative analysis of CNNs and LSTMs for ECG-based diagnosis of arrythmia and congestive heart failure.

Comput Methods Biomech Biomed Engin

January 2025

Centre for Research Impact & Outcome, Chitkara University Institute of Engineering and Technology, Chitkara University, Rajpura, Punjab, India.

Cardiac arrhythmias are major global health concern and their early detection is critical for diagnosis. This study comprehensively evaluates the effectiveness of CNNs and LSTMs for the classification of cardiac arrhythmias, considering three PhysioNet datasets. ECG records are segmented to accommodate around ∼10s of ECG data.

View Article and Find Full Text PDF

ISLRWR: A network diffusion algorithm for drug-target interactions prediction.

PLoS One

January 2025

Shanghai Xinhao Information Technology Co., Ltd., Shanghai, China.

Machine learning techniques and computer-aided methods are now widely used in the pre-discovery tasks of drug discovery, effectively improving the efficiency of drug development and reducing the workload and cost. In this study, we used multi-source heterogeneous network information to build a network model, learn the network topology through multiple network diffusion algorithms, and obtain compressed low-dimensional feature vectors for predicting drug-target interactions (DTIs). We applied the metropolis-hasting random walk (MHRW) algorithm to improve the performance of the random walk with restart (RWR) algorithm, forming the basis by which the self-loop probability of the current node is removed.

View Article and Find Full Text PDF

Can Focusing on One Deep Learning Architecture Improve Fault Diagnosis Performance?

J Chem Inf Model

January 2025

Department of Chemical and Materials Engineering, Pontifical Catholic University of Rio de Janeiro, 225, Marquês de São Vicente Street, Gávea, Rio de Janeiro, RJ 22451-900, Brazil.

Machine learning approaches often involve evaluating a wide range of models due to various available architectures. This standard strategy can lead to a lack of depth in exploring established methods. In this study, we concentrated our efforts on a single deep learning architecture type to assess whether a focused approach could enhance performance in fault diagnosis.

View Article and Find Full Text PDF

Background: With the global population aging and advancements in the medical system, long-term care in healthcare institutions and home settings has become essential for older adults with disabilities. However, the diverse and scattered care requirements of these individuals make developing effective long-term care plans heavily reliant on professional nursing staff, and even experienced caregivers may make mistakes or face confusion during the care plan development process. Consequently, there is a rigid demand for intelligent systems that can recommend comprehensive long-term care plans for older adults with disabilities who have stable clinical conditions.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!