Targeted Feature Detection for Data-Dependent Shotgun Proteomics.

J Proteome Res

Proteomic Mass Spectrometry, Wellcome Trust Sanger Institute, Cambridge CB10 1SA, United Kingdom.

Published: August 2017

Label-free quantification of shotgun LC-MS/MS data is the prevailing approach in quantitative proteomics but remains computationally nontrivial. The central data analysis step is the detection of peptide-specific signal patterns, called features. Peptide quantification is facilitated by associating signal intensities in features with peptide sequences derived from MS2 spectra; however, missing values due to imperfect feature detection are a common problem. A feature detection approach that directly targets identified peptides (minimizing missing values) but also offers robustness against false-positive features (by assigning meaningful confidence scores) would thus be highly desirable. We developed a new feature detection algorithm within the OpenMS software framework, leveraging ideas and algorithms from the OpenSWATH toolset for DIA/SRM data analysis. Our software, FeatureFinderIdentification ("FFId"), implements a targeted approach to feature detection based on information from identified peptides. This information is encoded in an MS1 assay library, based on which ion chromatogram extraction and detection of feature candidates are carried out. Significantly, when analyzing data from experiments comprising multiple samples, our approach distinguishes between "internal" and "external" (inferred) peptide identifications (IDs) for each sample. On the basis of internal IDs, two sets of positive (true) and negative (decoy) feature candidates are defined. A support vector machine (SVM) classifier is then trained to discriminate between the sets and is subsequently applied to the "uncertain" feature candidates from external IDs, facilitating selection and confidence scoring of the best feature candidate for each peptide. This approach also enables our algorithm to estimate the false discovery rate (FDR) of the feature selection step. We validated FFId based on a public benchmark data set, comprising a yeast cell lysate spiked with protein standards that provide a known ground-truth. The algorithm reached almost complete (>99%) quantification coverage for the full set of peptides identified at 1% FDR (PSM level). Compared with other software solutions for label-free quantification, this is an outstanding result, which was achieved at competitive quantification accuracy and reproducibility across replicates. The FDR for the feature selection was estimated at a low 1.5% on average per sample (3% for features inferred from external peptide IDs). The FFId software is open-source and freely available as part of OpenMS ( www.openms.org ).

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5547443PMC
http://dx.doi.org/10.1021/acs.jproteome.7b00248DOI Listing

Publication Analysis

Top Keywords

feature detection
20
feature candidates
12
feature
10
label-free quantification
8
data analysis
8
features peptide
8
missing values
8
identified peptides
8
fdr feature
8
feature selection
8

Similar Publications

Improved deep canonical correlation fusion approach for detection of early mild cognitive impairment.

Med Biol Eng Comput

January 2025

Non-Invasive Imaging and Diagnostic Laboratory, Department of Applied Mechanics and Biomedical Engineering, Indian Institute of Technology Madras, Chennai, India.

Detection of early mild cognitive impairment (EMCI) is clinically challenging as it involves subtle alterations in multiple brain sub-anatomic regions. Among different brain regions, the corpus callosum and lateral ventricles are primarily affected due to EMCI. In this study, an improved deep canonical correlation analysis (CCA) based framework is proposed to fuse magnetic resonance (MR) image features from lateral ventricular and corpus callosal structures for the detection of EMCI condition.

View Article and Find Full Text PDF

A novel electrochemical aptasensor based on bimetallic zirconium and copper oxides embedded within mesoporous carbon (denoted as ZrOCuO@mC) was constructed to detect miRNA. The porous ZrOCuO@mC was created through the pyrolysis of bimetallic zirconium/copper-based metal-organic framework (ZrCu-MOF). The substantial surface area and high porosity of ZrOCuO@mC nanocomposite along with its robust affinity toward aptamer strands, facilitated the effective anchoring of aptamer strands on the ZrOCuO@mC-modified electrode surface.

View Article and Find Full Text PDF

VirDetect-AI: a residual and convolutional neural network-based metagenomic tool for eukaryotic viral protein identification.

Brief Bioinform

November 2024

Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos 62210, México.

This study addresses the challenging task of identifying viruses within metagenomic data, which encompasses a broad array of biological samples, including animal reservoirs, environmental sources, and the human body. Traditional methods for virus identification often face limitations due to the diversity and rapid evolution of viral genomes. In response, recent efforts have focused on leveraging artificial intelligence (AI) techniques to enhance accuracy and efficiency in virus detection.

View Article and Find Full Text PDF

Unlabelled: This study investigated the impact of the coronavirus disease 2019 (COVID-19) pandemic and its associated restrictive measures on infections in children with acute respiratory tract infection. The study aimed to elucidate the epidemiological characteristics of infections before and during the pandemic and following the easing of restrictive measures. Pharyngeal secretions were collected from 1,0174 pediatric patients with acute respiratory infection (ARI) who were admitted to Shaoxing Maternity and Child Health Care Hospital (Shaoxing, China) between May 2018 and December 2023.

View Article and Find Full Text PDF

is a bacterium associated with colorectal cancer (CRC) tumorigenesis, progression, and metastasis. Fap2 is a fusobacteria-specific outer membrane galactose-binding lectin that mediates adherence to and invasion of CRC tumors. Advances in omics analyses provide an opportunity to profile and identify microbial genomic features that correlate with the cancer-associated bacterial virulence factor Fap2.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!