Peptide-spectrum matching is one of the most time-consuming portion of the database search method for assignment of tandem mass spectra to peptides. In this study, we develop a parallel algorithm for peptide-spectrum matching using Single-Instruction Multiple Data (SIMD) instructions. Unlike other parallel algorithms in peptide-spectrum matching, our algorithm parallelizes the computation of matches between a single spectrum and a given peptide sequence from the database. It also significantly reduces the number of comparison operations. Extra improvements are obtained by using SIMD instructions to avoid conditional branches and unnecessary memory access within the algorithm. The implementation of the developed algorithm is based on the Streaming SIMD Extensions technology that is embedded in most Intel microprocessors. Similar technology also exists in other modern microprocessors. A simulation shows that the developed algorithm achieves an 18-fold speedup over the previous version of Real-Time Peptide-Spectrum Matching algorithm [F. X. Wu et al., Rapid Commun. Mass Sepctrom. 2006, 20, 1199-1208]. Therefore, the developed algorithm can be employed to develop real-time control methods for MS/MS.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1002/pmic.201100182 | DOI Listing |
J Vis Exp
January 2025
Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota;
Clinical metaproteomics reveals host-microbiome interactions underlying diseases. However, challenges to this approach exist. In particular, the characterization of microbial proteins present in low abundance relative to host proteins is difficult.
View Article and Find Full Text PDFJ Proteome Res
January 2025
Institute of Pharmacy and Molecular Biotechnology, Heidelberg University, 69120 Heidelberg, Germany.
The first step in bottom-up proteomics is the assignment of measured fragmentation mass spectra to peptide sequences, also known as peptide spectrum matches. In recent years novel algorithms have pushed the assignment to new heights; unfortunately, different algorithms come with different strengths and weaknesses and choosing the appropriate algorithm poses a challenge for the user. Here we introduce PeptideForest, a semisupervised machine learning approach that integrates the assignments of multiple algorithms to train a random forest classifier to alleviate that issue.
View Article and Find Full Text PDFJ Proteome Res
December 2024
Department of Artificial Intelligence, Hanyang University, Seoul 04763, Republic of Korea.
peptide sequencing is a valuable technique in mass-spectrometry-based proteomics, as it deduces peptide sequences directly from tandem mass spectra without relying on sequence databases. This database-independent method, however, relies solely on imperfect scoring functions that often lead to erroneous peptide identifications. To boost correct identification, we present NovoRank, a postprocessing tool that employs spectral clustering and machine learning to assign more plausible peptide sequences to spectra.
View Article and Find Full Text PDFJ Proteome Res
December 2024
Department of Neurology, University of Tennessee Health Science Center, Memphis, Tennessee 38103, United States.
The identification of peptides is a cornerstone of mass spectrometry-based proteomics. Spectral library-based algorithms are well-established methods to enhance the identification efficiency of peptides during database searches in proteomics. However, these algorithms are not specifically tailored for tandem mass tag (TMT)-based proteomics due to the lack of high-quality TMT spectral libraries.
View Article and Find Full Text PDFThe goal of proteomics is to identify and quantify peptides and proteins within a biological sample. Almost all algorithms for the identification of peptides in LC-MS/MS data employ two steps: peptide/spectrum matching and peptide-identity-propagation (PIP), also known as match-between-runs. PIP was originally envisioned as a backup method to overcome measurement stochasticity.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!