Peptide-spectrum matching is one of the most time-consuming portion of the database search method for assignment of tandem mass spectra to peptides. In this study, we develop a parallel algorithm for peptide-spectrum matching using Single-Instruction Multiple Data (SIMD) instructions. Unlike other parallel algorithms in peptide-spectrum matching, our algorithm parallelizes the computation of matches between a single spectrum and a given peptide sequence from the database. It also significantly reduces the number of comparison operations. Extra improvements are obtained by using SIMD instructions to avoid conditional branches and unnecessary memory access within the algorithm. The implementation of the developed algorithm is based on the Streaming SIMD Extensions technology that is embedded in most Intel microprocessors. Similar technology also exists in other modern microprocessors. A simulation shows that the developed algorithm achieves an 18-fold speedup over the previous version of Real-Time Peptide-Spectrum Matching algorithm [F. X. Wu et al., Rapid Commun. Mass Sepctrom. 2006, 20, 1199-1208]. Therefore, the developed algorithm can be employed to develop real-time control methods for MS/MS.

Download full-text PDF

Source
http://dx.doi.org/10.1002/pmic.201100182DOI Listing

Publication Analysis

Top Keywords

peptide-spectrum matching
20
developed algorithm
12
matching single-instruction
8
simd instructions
8
matching algorithm
8
algorithm
7
peptide-spectrum
5
matching
5
speed improvements
4
improvements peptide-spectrum
4

Similar Publications

Clinical metaproteomics reveals host-microbiome interactions underlying diseases. However, challenges to this approach exist. In particular, the characterization of microbial proteins present in low abundance relative to host proteins is difficult.

View Article and Find Full Text PDF

The first step in bottom-up proteomics is the assignment of measured fragmentation mass spectra to peptide sequences, also known as peptide spectrum matches. In recent years novel algorithms have pushed the assignment to new heights; unfortunately, different algorithms come with different strengths and weaknesses and choosing the appropriate algorithm poses a challenge for the user. Here we introduce PeptideForest, a semisupervised machine learning approach that integrates the assignments of multiple algorithms to train a random forest classifier to alleviate that issue.

View Article and Find Full Text PDF

NovoRank: Refinement for Peptide Sequencing Based on Spectral Clustering and Deep Learning.

J Proteome Res

December 2024

Department of Artificial Intelligence, Hanyang University, Seoul 04763, Republic of Korea.

peptide sequencing is a valuable technique in mass-spectrometry-based proteomics, as it deduces peptide sequences directly from tandem mass spectra without relying on sequence databases. This database-independent method, however, relies solely on imperfect scoring functions that often lead to erroneous peptide identifications. To boost correct identification, we present NovoRank, a postprocessing tool that employs spectral clustering and machine learning to assign more plausible peptide sequences to spectra.

View Article and Find Full Text PDF

The identification of peptides is a cornerstone of mass spectrometry-based proteomics. Spectral library-based algorithms are well-established methods to enhance the identification efficiency of peptides during database searches in proteomics. However, these algorithms are not specifically tailored for tandem mass tag (TMT)-based proteomics due to the lack of high-quality TMT spectral libraries.

View Article and Find Full Text PDF

The goal of proteomics is to identify and quantify peptides and proteins within a biological sample. Almost all algorithms for the identification of peptides in LC-MS/MS data employ two steps: peptide/spectrum matching and peptide-identity-propagation (PIP), also known as match-between-runs. PIP was originally envisioned as a backup method to overcome measurement stochasticity.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!