RNA-Seq optimization with eQTL gold standards.

BMC Genomics

McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, USA.

Published: December 2013

Background: RNA-Sequencing (RNA-Seq) experiments have been optimized for library preparation, mapping, and gene expression estimation. These methods, however, have revealed weaknesses in the next stages of analysis of differential expression, with results sensitive to systematic sample stratification or, in more extreme cases, to outliers. Further, a method to assess normalization and adjustment measures imposed on the data is lacking.

Results: To address these issues, we utilize previously published eQTLs as a novel gold standard at the center of a framework that integrates DNA genotypes and RNA-Seq data to optimize analysis and aid in the understanding of genetic variation and gene expression. After detecting sample contamination and sequencing outliers in RNA-Seq data, a set of previously published brain eQTLs was used to determine if sample outlier removal was appropriate. Improved replication of known eQTLs supported removal of these samples in downstream analyses. eQTL replication was further employed to assess normalization methods, covariate inclusion, and gene annotation. This method was validated in an independent RNA-Seq blood data set from the GTEx project and a tissue-appropriate set of eQTLs. eQTL replication in both data sets highlights the necessity of accounting for unknown covariates in RNA-Seq data analysis.

Conclusion: As each RNA-Seq experiment is unique with its own experiment-specific limitations, we offer an easily-implementable method that uses the replication of known eQTLs to guide each step in one's data analysis pipeline. In the two data sets presented herein, we highlight not only the necessity of careful outlier detection but also the need to account for unknown covariates in RNA-Seq experiments.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3890578PMC
http://dx.doi.org/10.1186/1471-2164-14-892DOI Listing

Publication Analysis

Top Keywords

rna-seq data
12
rna-seq
8
rna-seq experiments
8
gene expression
8
assess normalization
8
data
8
data set
8
replication eqtls
8
eqtl replication
8
data sets
8

Similar Publications

Background: Ovarian cancers (OC) and cervical cancers (CC) have poor survival rates. Tumor-infiltrating lymphocytes (TILs) play a pivotal role in prognosis, but shared immune mechanisms remain elusive.

Methods: We integrated single-cell RNA sequencing (scRNA-seq) and spatial transcriptomics (ST) to explore immune regulation in OC and CC, focusing on the PI3K/AKT pathway and FLT3 as key modulators.

View Article and Find Full Text PDF

Genetic variation in IL-4 activated tissue resident macrophages determines strain-specific synergistic responses to LPS epigenetically.

Nat Commun

January 2025

Type 2 Immunity Section, Laboratory of Parasitic Diseases, National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH), Bethesda, MD, USA.

How macrophages in the tissue environment integrate multiple stimuli depends on the genetic background of the host, but this is still poorly understood. We investigate IL-4 activation of male C57BL/6 and BALB/c strain specific in vivo tissue-resident macrophages (TRMs) from the peritoneal cavity. C57BL/6 TRMs are more transcriptionally responsive to IL-4 stimulation, with induced genes associated with more super enhancers, induced enhancers, and topologically associating domains (TAD) boundaries.

View Article and Find Full Text PDF

Background: B7 homolog 3 (B7-H3), an overexpressed antigen across multiple solid cancers, represents a promising target for CAR T cell therapy. This study investigated the expression of B7-H3 across various solid tumors and developed novel monoclonal antibodies (mAbs) targeting B7-H3 for CAR T cell therapy.

Methods: Expression of B7-H3 across various solid tumors was evaluated using RNA-seq data from TCGA, TARGET, and GTEx datasets and by flow cytometry staining.

View Article and Find Full Text PDF

MMP2 regulates proliferation and differentiation in chicken primary myoblasts, and RNA-seq screens for key genes.

Gene

January 2025

Jiangxi Provincial Key Laboratory of Poultry Genetic Improvement, Nanchang 330032 China. Electronic address:

The growth and development of chicken skeletal muscle directly affects chicken meat production, which is very important for broiler industry. Matrix metallopeptidase 2 (MMP2) exists in skeletal muscle. However, the underlying regulating of MMP2 remain unknown.

View Article and Find Full Text PDF

HemaScope: A Tool for Analyzing Single-cell and Spatial Transcriptomics Data of Hematopoietic Cells.

Genomics Proteomics Bioinformatics

January 2025

Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Research Unit of Hematologic Malignancies Genomics and Translational Research of Chinese Academy of Medical Sciences, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China.

Single-cell RNA sequencing (scRNA-seq) and spatial transcriptomics (ST) techniques hold great value in evaluating the heterogeneity and spatial characteristics of hematopoietic cells within tissues. These two techniques are highly complementary, with scRNA-seq offering single-cell resolution and ST retaining spatial information. However, there is an urgent demand for well-organized and user-friendly toolkits capable of handling single-cell and spatial information.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!