Speeding up tandem mass spectrometry-based database searching by longest common prefix.

BMC Bioinformatics

Key Lab of Intelligent Information Processing, Chinese Academy of Sciences, Beijing 100190, China.

Published: November 2010

Background: Tandem mass spectrometry-based database searching has become an important technology for peptide and protein identification. One of the key challenges in database searching is the remarkable increase in computational demand, brought about by the expansion of protein databases, semi- or non-specific enzymatic digestion, post-translational modifications and other factors. Some software tools choose peptide indexing to accelerate processing. However, peptide indexing requires a large amount of time and space for construction, especially for the non-specific digestion. Additionally, it is not flexible to use.

Results: We developed an algorithm based on the longest common prefix (ABLCP) to efficiently organize a protein sequence database. The longest common prefix is a data structure that is always coupled to the suffix array. It eliminates redundant candidate peptides in databases and reduces the corresponding peptide-spectrum matching times, thereby decreasing the identification time. This algorithm is based on the property of the longest common prefix. Even enzymatic digestion poses a challenge to this property, but some adjustments can be made to this algorithm to ensure that no candidate peptides are omitted. Compared with peptide indexing, ABLCP requires much less time and space for construction and is subject to fewer restrictions.

Conclusions: The ABLCP algorithm can help to improve data analysis efficiency. A software tool implementing this algorithm is available at http://pfind.ict.ac.cn/pfind2dot5/index.htm.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3000425PMC
http://dx.doi.org/10.1186/1471-2105-11-577DOI Listing

Publication Analysis

Top Keywords

longest common
16
common prefix
16
database searching
12
peptide indexing
12
tandem mass
8
mass spectrometry-based
8
spectrometry-based database
8
enzymatic digestion
8
time space
8
space construction
8

Similar Publications

Decoding KRAS mutation in non-small cell lung cancer patients receiving immunotherapy: A retrospective institutional comparison and literature review.

Lung Cancer

December 2024

Medical Oncology Department, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy; Department of Electronics, Information and Bioengineering, Polytechnic University of Milan, Milan, Italy.

Introduction: KRAS mutation the most common molecular alteration in advanced non-small cell lung cancer (NSCLC) and is associated with an unfavourable prognosis, largely due to the lack of targeted therapeutic options for the majority of the KRAS mutated isoforms. The landscape of NSCLC treatment has expanded with the introduction of immune checkpoint inhibitors (ICIs). Nonetheless, data regarding the efficacy of ICI in NSCLC patients harbouring KRAS mutations are conflicting.

View Article and Find Full Text PDF

Background: Hepatocellular carcinoma (HCC) is the most common cause of cancer-related death in Saudi Arabia. Our study aimed to investigate the patterns of HCC and the effect of TNM staging, Alfa-fetoprotein (AFP), and Child-Turcotte Pugh (CTP) on patients' overall survival (OS).

Methods: A retrospective analysis was conducted on 43 HCC patients at a single oncology center in Saudi Arabia from 2015 to 2020.

View Article and Find Full Text PDF

Objectives: Volar locking plate (VLP) fixation is a very common procedure due to the high incidence of distal radius fractures (DRFs). Attritional flexor tendon rupture is a rare, but recognized complication after VLP fixation. There is no current consensus to prevent the condition.

View Article and Find Full Text PDF

Objectives: Although croup is a common respiratory illness, there is little published regarding symptom course. We aimed to assess symptom progression and caregiver burden, and whether age, sex or season and initial severity of disease are associated with symptom duration.

Design, Setting And Participants: We conducted a secondary analysis of two Canadian prospective cohorts of children 0-16 years old diagnosed with croup; one recruited from a paediatric emergency department (ED) (307 children) between November 1999 and March 2000, and the other from 26 general EDs (1214 children) between September 2002 and April 2006.

View Article and Find Full Text PDF

Objectives: The aims of this study were 1) to investigate seasonal epidemiological variations of pyogenic spondylodiscitis, including Methicillin-resistant Staphylococcus aureus (MRSA) infection, in Japan, and 2) to evaluate associated inpatient outcomes.

Methods: We performed a retrospective nationwide study using data from the Japanese Diagnosis Procedure Combination (DPC) inpatient database, covering the period from 2010 to 2022. The parameters assessed were seasonal incidence, demographic characteristics, inpatient mortality, complications, and medical costs.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!