An integrated mass-spectrometry pipeline identifies novel protein coding-regions in the human genome.

PLoS One

Applied Computational Biology and Bioinformatics Group, Cancer Research UK, Paterson Institute for Cancer Research, The University of Manchester, Manchester, United Kingdom.

Published: January 2010

Background: Most protein mass spectrometry (MS) experiments rely on searches against a database of known or predicted proteins, limiting their ability as a gene discovery tool.

Results: Using a search against an in silico translation of the entire human genome, combined with a series of annotation filters, we identified 346 putative novel peptides [False Discovery Rate (FDR)<5%] in a MS dataset derived from two human breast epithelial cell lines. A subset of these were then successfully validated by a different MS technique. Two of these correspond to novel isoforms of Heterogeneous Ribonuclear Proteins, while the rest correspond to novel loci.

Conclusions: MS technology can be used for ab initio gene discovery in human data, which, since it is based on different underlying assumptions, identifies protein-coding genes not found by other techniques. As MS technology continues to evolve, such approaches will become increasingly powerful.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2812506PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0008949PLOS

Publication Analysis

Top Keywords

human genome
8
integrated mass-spectrometry
4
mass-spectrometry pipeline
4
pipeline identifies
4
identifies novel
4
novel protein
4
protein coding-regions
4
coding-regions human
4
genome background
4
background protein
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!