Robust adjustment of sequence tag abundance.

Bioinformatics

Department of Statistics, Purdue University, West Lafayette, IN 47907, USA.

Published: March 2014

Motivation: The majority of next-generation sequencing technologies effectively sample small amounts of DNA or RNA that are amplified (i.e. copied) before sequencing. The amplification process is not perfect, leading to extreme bias in sequenced read counts. We present a novel procedure to account for amplification bias and demonstrate its effectiveness in mitigating gene length dependence when estimating true gene expression.

Results: We tested the proposed method on simulated and real data. Simulations indicated that our method captures true gene expression more effectively than classic censoring-based approaches and leads to power gains in differential expression testing, particularly for shorter genes with high transcription rates. We applied our method to an unreplicated Arabidopsis RNA-seq dataset resulting in disparate gene ontologies arising from gene set enrichment analyses.

Availability And Implementation: R code to perform the RASTA procedures is freely available on the web at www.stat.purdue.edu/∼doerge/.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btt575DOI Listing

Publication Analysis

Top Keywords

true gene
8
gene
5
robust adjustment
4
adjustment sequence
4
sequence tag
4
tag abundance
4
abundance motivation
4
motivation majority
4
majority next-generation
4
next-generation sequencing
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!