Publications by Erik Pitzer

Publications by authors named "Erik Pitzer"

Page 1 of 1

PositionMatcher: A Fast Custom-Annotation Tool for Short DNA Sequences.

Erik Pitzer Jihoon Kim Kiltesh Patel Pedro A Galante Lucila Ohno-Machado

Summit Transl Bioinform

March 2010

Microarray probes and reads from massively parallel sequencing technologies are two most widely used genomic tags for a transcriptome study. Names and underlying technologies might differ, but expression technologies share a common objective-to obtain mRNA abundance values at the gene level, with high sensitivity and specificity. However, the initial tag annotation becomes obsolete as more insight is gained into biological references (genome, transcriptome, SNP, etc.

View Article and Find Full Text PDF

DSGeo: software tools for cross-platform analysis of gene expression data in GEO.

Ronilda Lacson Erik Pitzer Jihoon Kim Pedro Galante Christian Hinske

J Biomed Inform

October 2010

The Gene Expression Omnibus (GEO) is the largest resource of public gene expression data. While GEO enables data browsing, query and retrieval, additional tools can help realize its potential for aggregating and comparing data across multiple studies and platforms. This paper describes DSGeo-a collection of valuable tools that were developed for annotating, aggregating, integrating, and analyzing data deposited in GEO.

View Article and Find Full Text PDF

Towards large-scale sample annotation in gene expression repositories.

Erik Pitzer Ronilda Lacson Christian Hinske Jihoon Kim Pedro Af Galante

BMC Bioinformatics

September 2009

Background: Large repositories of biomedical research data are most useful to translational researchers if their data can be aggregated for efficient queries and analyses. However, inconsistent or non-existent annotations describing important sample details such as name of tissue or cell line, histopathological type, and subject characteristics like demographics, treatment, and survival are seldom present in data repositories, making it difficult to aggregate data.

Results: We created a flexible software tool that allows efficient annotation of samples using a controlled vocabulary, and report on its use for the annotation of over 12,500 samples.

View Article and Find Full Text PDF

Evaluation of a large-scale biomedical data annotation initiative.

Ronilda Lacson Erik Pitzer Christian Hinske Pedro Galante Lucila Ohno-Machado

BMC Bioinformatics

September 2009

Background: This study describes a large-scale manual re-annotation of data samples in the Gene Expression Omnibus (GEO), using variables and values derived from the National Cancer Institute thesaurus. A framework is described for creating an annotation scheme for various diseases that is flexible, comprehensive, and scalable. The annotation structure is evaluated by measuring coverage and agreement between annotators.

View Article and Find Full Text PDF

Assessing peptide de novo sequencing algorithms performance on large and diverse data sets.

Erik Pitzer Alexandre Masselot Jacques Colinge

Proteomics

September 2007

De novo peptide sequencing algorithms are often tested on relatively small data sets made of excellent spectra. Since there are always more and more tandem mass spectra available, we have assembled six large, reliable, and diverse (three mass spectrometer types) data sets intended for such tests and we make them accessible via a web server. To exemplify their use we investigate the performance of Lutefisk, PepNovo, and PepNovoTag, three well-established peptide de novo sequencing programs.

View Article and Find Full Text PDF