Compound identification by database searching that matches experimental with library mass spectra is commonly used in mass spectrometric (MS) data analysis. Vendor software often outputs scores that represent the quality of each spectral match for the identified compounds. However, software-generated identification results can differ drastically depending on the initial search parameters. Machine learning is applied here to provide a statistical evaluation of software-generated compound identification results from experimental tandem MS data. This task was accomplished using the logistic regression algorithm to assign an identification probability value to each identified compound. Logistic regression is usually used for classification, but here it is used to generate identification probabilities without setting a threshold for classification. Liquid chromatography coupled with quadrupole-time-of-flight tandem MS was used to analyze the organic monomers leached from resin-based dental composites in a simulated oral environment. The collected tandem MS data were processed with vendor software, followed by statistical evaluation of these results using logistic regression. The assigned identification probability to each compound provides more confidence in identification beyond solely by database matching. A total of 21 distinct monomers were identified among all samples, including five intact monomers and chemical degradation products of bisphenol A glycidyl methacrylate (BisGMA), oligomers of bisphenol-A ethoxylate methacrylate (BisEMA), triethylene glycol dimethacrylate (TEGDMA), and urethane dimethacrylate (UDMA). The logistic regression model can be used to evaluate any database-matched liquid chromatography-tandem MS result by training a new model using analytical standards of compounds present in a chosen database and then generating identification probabilities for candidates from unknown data using the new model.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.analchem.2c04362DOI Listing

Publication Analysis

Top Keywords

logistic regression
20
dental composites
8
identification
8
compound identification
8
vendor software
8
statistical evaluation
8
tandem data
8
identification probability
8
identification probabilities
8
logistic
5

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!