Motivation: A detailed analysis of multidimensional NMR spectra of macromolecules requires the identification of individual resonances (peaks). This task can be tedious and time-consuming and often requires support by experienced users. Automated peak picking algorithms were introduced more than 25 years ago, but there are still major deficiencies/flaws that often prevent complete and error free peak picking of biological macromolecule spectra. The major challenges of automated peak picking algorithms is both the distinction of artifacts from real peaks particularly from those with irregular shapes and also picking peaks in spectral regions with overlapping resonances which are very hard to resolve by existing computer algorithms. In both of these cases a visual inspection approach could be more effective than a 'blind' algorithm.
Results: We present a novel approach using computer vision (CV) methodology which could be better adapted to the problem of peak recognition. After suitable 'training' we successfully applied the CV algorithm to spectra of medium-sized soluble proteins up to molecular weights of 26 kDa and to a 130 kDa complex of a tetrameric membrane protein in detergent micelles. Our CV approach outperforms commonly used programs. With suitable training datasets the application of the presented method can be extended to automated peak picking in multidimensional spectra of nucleic acids or carbohydrates and adapted to solid-state NMR spectra.
Availability And Implementation: CV-Peak Picker is available upon request from the authors.
Contact: gsw@mol.biol.ethz.ch; michal.walczak@mol.biol.ethz.ch; adam.gonczarek@pwr.edu.pl
Supplementary Information: Supplementary data are available at Bioinformatics online.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1093/bioinformatics/btv318 | DOI Listing |
Methods Mol Biol
January 2025
Research and Innovation Centre, Fondazione E. Mach, Trento, Italy.
Liquid Chromatography-Mass Spectrometry (LC-MS) untargeted experiments require complex bioinformatic strategies to extract information from the experimental data. Here we discuss the "data preprocessing," the set of procedures performed on the raw data to produce a data matrix which will be the starting point for the subsequent statistical analysis. Data preprocessing is a crucial step on the path to knowledge extraction, which should be carefully controlled and optimized in order to maximize the output of any untargeted metabolomics investigation.
View Article and Find Full Text PDFNat Commun
January 2025
Physical and Theoretical Chemistry, University of Oxford, Oxford, UK.
To extract information from NMR experiments, users need to identify the number of resonances in the spectrum, together with characteristic features such as chemical shifts and intensities. In many applications, particularly those involving biomolecules, this procedure is typically a manual and laborious process. While many algorithms are available to tackle this problem, their performance tends to be inferior to that of an experienced user.
View Article and Find Full Text PDFMass Spectrom (Tokyo)
December 2024
Bruker Japan K.K., 3-9 Moriya-cho, Kanagawa-ku, Yokohama, Kanagawa 221-0022, Japan.
In the interpretation step of mass spectra obtained from synthetic polymers, isotope peaks are typically intense and cannot be ignored, especially in the higher mass range. To reduce the complexity of the spectra, deisotope processing is used. In this study, a deisotope processing method that is effective even for mixtures of different types of polymers is investigated.
View Article and Find Full Text PDFJ Proteome Res
January 2025
Computational Biology Laboratory, Centre de recherche du CHU de Québec, Université Laval, Québec City, Québec G1V 4G2, Canada.
In targeted proteomics utilizing Selected Reaction Monitoring (SRM), the precise detection of specific peptides within complex mixtures remains a significant challenge, particularly due to noise and interference in chromatograms. Existing methodologies, such as isotopic labeling and scoring algorithms, offer partial solutions but are constrained by high run times and elevated false discovery rates. To address these limitations, we have developed ProPickML a machine learning-based tool designed to accurately identify peptide peaks across diverse data sets, independent of the assumed presence of the peptide.
View Article and Find Full Text PDFPLoS One
October 2024
Department of Environmental and Occupational Health, Chair in Toxicological Risk Assessment and Management, and Public Health Research Center (CReSP), University of Montreal, Montreal, Quebec, Canada.
A toxicokinetic model of the pyrethroid insecticide lambda-cyhalothrin (LCT) was developed to relate absorbed doses to urinary cis-3-(2-chloro-3,3,3-trifluoroprop-1-en-1-yl)-2,2-dimethylcyclopropanecarboxylic acid (CFMP) metabolite levels used as a biomarker of exposure. The model then served to reconstruct absorbed doses in agricultural workers and their probability of exceeding the EFSA Acceptable occupational Exposure Level (AOEL). The toxicokinetic model was able to reproduce the temporal profiles of CFMP in the urine of operators spraying pesticides using the optimized model parameters (adjusted to human volunteer data).
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!