Estimation of data-specific constitutive exons with RNA-Seq data.

BMC Bioinformatics

School of Mathematics and Statistics, University of Sydney, Sydney NSW 2006, Australia.

Published: January 2013

Background: RNA-Seq has the potential to answer many diverse and interesting questions about the inner workings of cells. Estimating changes in the overall transcription of a gene is not straightforward. Changes in overall gene transcription can easily be confounded with changes in exon usage which alter the lengths of transcripts produced by a gene. Measuring the expression of constitutive exons--xons which are consistently conserved after splicing--ffers an unbiased estimation of the overall transcription of a gene.

Results: We propose a clustering-based method, exClust, for estimating the exons that are consistently conserved after splicing in a given data set. These are considered as the exons which are "constitutive" in this data. The method utilises information from both annotation and the dataset of interest. The method is implemented in an openly available R function package, sydSeq.

Conclusion: When used on two real datasets exClust includes more than three times as many reads as the standard UI method, and improves concordance with qRT-PCR data. When compared to other methods, our method is shown to produce robust estimates of overall gene transcription.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3656776PMC
http://dx.doi.org/10.1186/1471-2105-14-31DOI Listing

Publication Analysis

Top Keywords

gene transcription
8
consistently conserved
8
method
5
estimation data-specific
4
data-specific constitutive
4
constitutive exons
4
exons rna-seq
4
data
4
rna-seq data
4
data background
4

Similar Publications

One key determinant of HIV-1 latency reversal is the activation of the viral long terminal repeat (LTR) by cellular transcription factors such as NF-κB and AP-1. Interestingly, the activity of these two transcription factors can be modulated by glucocorticoid receptors (GRs). Furthermore, the HIV-1 genome contains multiple binding sites for GRs.

View Article and Find Full Text PDF

Next-generation cancer phenomics by deployment of multiple molecular endophenotypes coupled with high-throughput analyses of gene expression offer veritable opportunities for triangulation of discovery findings in non-small cell lung cancer (NSCLC) research. This study reports differentially expressed genes in NSCLC using publicly available datasets (GSE18842 and GSE229253), uncovering 130 common genes that may potentially represent crucial molecular signatures of NSCLC. Additionally, network analyses by GeneMANIA and STRING revealed significant coexpression and interaction patterns among these genes, with four notable hub genes-, , and -identified as pivotal in NSCLC progression.

View Article and Find Full Text PDF

Serum uric acid is an end-product of purine metabolism. Uric acid concentrations in excess of the physiological range may lead to diseases such as gout, cardiovascular disease, and kidney injury. The kidney includes a variety of cell types with specialized functions such as fluid and electrolyte homeostasis, detoxification, and endocrine functions.

View Article and Find Full Text PDF

Nuclear Factor Y (NF-Y) represents a group of transcription factors commonly present in higher eukaryotes, typically consisting of three subunits: NF-YA, NF-YB, and NF-YC. They play crucial roles in the embryonic development, photosynthesis, flowering, abiotic stress responses, and other essential processes in plants. To better understand the genome-wide NF-Y domain-containing proteins, the protein physicochemical properties, chromosomal localization, synteny, phylogenetic relationships, genomic structure, promoter -elements, and protein interaction network of NtNF-Ys in tobacco ( L.

View Article and Find Full Text PDF

Previous studies in sports science suggested that regular exercise has a positive impact on human health. However, the effects of endurance sports and their underlying mechanisms are still not completely understood. One of the main debates regards the modulation of immune dynamics in high-intensity exercise.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!