Predicting Peptide Ionization Efficiencies for Electrospray Ionization Mass Spectrometry Using Machine Learning.

Justin A Kaskow Eric T Hahnert Thomas K Porter Yali Lu Valentin Stanev Chendi Niu Wei Xu Methal Albarghouthi Chunlei Wang

J Am Soc Mass Spectrom

Analytical Sciences, BioPharmaceuticals R&D, AstraZeneca, Gaithersburg, Maryland 20878, United States.

Published: October 2024

Mass spectrometry (MS) is inherently an information-rich technique. In this era of big data, label-free MS quantification for nontargeted studies has gained increasing popularity, especially for complex systems. One of the cornerstones of successful label-free quantification is the predictive modeling of ionization efficiency (IE) based on solutes' physicochemical properties. While many have studied IE modeling for small molecules, there are limited reports on peptide IEs. In this study, we leverage the stoichiometric relationship in trypsin digests of well-characterized monoclonal antibodies (mAbs) to compile a data set of relative ionization efficiencies (RIEs) for 241 peptides. From each peptide's sequence, we computed a set of physiochemical descriptors, which were then used to train machine learning regression models to predict RIEs. Peptides shorter than 20 amino acids had RIEs that were highly correlated to their molecular weight. A random forest (RF) model was able to best predict the RIEs of a test data set with a mean relative error of 23.9%. For larger peptides, a multilayer perceptron (MLP) model improved RIE prediction compared to current best practices, reducing mean relative error from 60.5% to 32.0%. Finally, we also show the application of the RF model in label-free relative protein quantification and improving the quantification of peptide post-translational modifications (PTMs). This approach to predicting peptide IEs from their sequences enables the development of accurate label-free quantification workflows for peptide and protein analysis.

Download full-text PDF	Source
http://dx.doi.org/10.1021/jasms.4c00137	DOI Listing

Publication Analysis

Top Keywords

label-free quantification

predicting peptide

ionization efficiencies

mass spectrometry

machine learning

peptide ies

data set

set relative

predict ries

relative error

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!