A statistical framework for combining and interpreting proteomic datasets.

Michael A Gilchrist Laura A Salter Andreas Wagner

Bioinformatics

Department of Biology, University of New Mexico, Albuquerque 87106, USA.

Published: March 2004

Motivation: To identify accurately protein function on a proteome-wide scale requires integrating data within and between high-throughput experiments. High-throughput proteomic datasets often have high rates of errors and thus yield incomplete and contradictory information. In this study, we develop a simple statistical framework using Bayes' law to interpret such data and combine information from different high-throughput experiments. In order to illustrate our approach we apply it to two protein complex purification datasets.

Results: Our approach shows how to use high-throughput data to calculate accurately the probability that two proteins are part of the same complex. Importantly, our approach does not need a reference set of verified protein interactions to determine false positive and false negative error rates of protein association. We also demonstrate how to combine information from two separate protein purification datasets into a combined dataset that has greater coverage and accuracy than either dataset alone. In addition, we also provide a technique for estimating the total number of proteins which can be detected using a particular experimental technique.

Availability: A suite of simple programs to accomplish some of the above tasks is available at www.unm.edu/~compbio/software/DatasetAssess

Download full-text PDF	Source
http://dx.doi.org/10.1093/bioinformatics/btg469	DOI Listing

Publication Analysis

Top Keywords

statistical framework

proteomic datasets

high-throughput experiments

protein

framework combining

combining interpreting

interpreting proteomic

datasets motivation

motivation identify

identify accurately

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!