The problem of identifying proteins from a shotgun proteomics experiment has not been definitively solved. Identifying the proteins in a sample requires ranking them, ideally with interpretable scores. In particular, "degenerate" peptides, which map to multiple proteins, have made such a ranking difficult to compute. The problem of computing posterior probabilities for the proteins, which can be interpreted as confidence in a protein's presence, has been especially daunting. Previous approaches have either ignored the peptide degeneracy problem completely, addressed it by computing a heuristic set of proteins or heuristic posterior probabilities, or estimated the posterior probabilities with sampling methods. We present a probabilistic model for protein identification in tandem mass spectrometry that recognizes peptide degeneracy. We then introduce graph-transforming algorithms that facilitate efficient computation of protein probabilities, even for large data sets. We evaluate our identification procedure on five different well-characterized data sets and demonstrate our ability to efficiently compute high-quality protein posteriors.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2948606PMC
http://dx.doi.org/10.1021/pr100594kDOI Listing

Publication Analysis

Top Keywords

posterior probabilities
16
mass spectrometry
8
identifying proteins
8
peptide degeneracy
8
data sets
8
probabilities
5
proteins
5
efficient marginalization
4
marginalization compute
4
protein
4

Similar Publications

Background: Patients with vertebral compression fractures may experience unpredictable residual pain following vertebral augmentation. Clinical prediction models have shown potential for early prevention and intervention of such residual pain. However, studies focusing on the quality and accuracy of these prediction models are lacking.

View Article and Find Full Text PDF

The use of mixed-effect models to understand the evolution of the human immunodeficiency virus (HIV) and the progression of acquired immune deficiency syndrome (AIDS) has been the cornerstone of longitudinal data analysis in recent years. However, data from HIV/AIDS clinical trials have several complexities. Some of the most common recurrences are related to the situation where the HIV viral load can be undetectable, and the measures of the patient can be registered irregularly due to some problems in the data collection.

View Article and Find Full Text PDF

Bayesian method for comparing F1 scores in the absence of a gold standard.

J Biopharm Stat

January 2025

Department of Biostatistics, School of Medicine, Yokohama City University, Yokohama, Japan.

In the field of medicine, evaluating the diagnostic performance of new diagnostic methods can be challenging, especially in the absence of a gold standard. This study proposes a methodology for assessing the performance of diagnostic tests by estimating the posterior distribution of the score using latent class analysis, without relying on a gold standard. The proposed method utilizes Markov Chain Monte Carlo sampling to estimate the posterior distribution of the score, enabling a comprehensive evaluation of diagnostic test methods.

View Article and Find Full Text PDF

To develop and validate a predictive model for assessing the risk of early postoperative high intraocular pressure (HIOP) following posterior chamber intraocular lens implantation. The clinical data of patients who underwent posterior chamber intraocular lens implantation at the Second Affiliated Hospital of Zhejiang University School of Medicine between May 2023 and April 2024 were retrospectively reviewed. Patients were divided into a modeling group and a validation group with a 7∶3 ratio using computerized random allocation.

View Article and Find Full Text PDF

Background: Atrial fibrillation (AF) is the most prevalent arrhythmia encountered in clinical practice. Triglyceride glucose index (Tyg), a convenient evaluation variable for insulin resistance, has shown associations with adverse cardiovascular outcomes. However, studies on the Tyg index's predictive value for adverse prognosis in patients with AF without diabetes are lacking.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!