Guidelines for using sigQC for systematic evaluation of gene signatures.

Nat Protoc

Computational Biology and Integrative Genomics Lab, MRC/CRUK Oxford Institute and Department of Oncology, University of Oxford, Oxford, UK.

Published: May 2019

With the increased use of next-generation sequencing generating large amounts of genomic data, gene expression signatures are becoming critically important tools for the interpretation of these data, and are poised to have a substantial effect on diagnosis, management, and prognosis for a number of diseases. It is becoming crucial to establish whether the expression patterns and statistical properties of sets of genes, or gene signatures, are conserved across independent datasets. Conversely, it is necessary to compare established signatures on the same dataset to better understand how they capture different clinical or biological characteristics. Here we describe how to use sigQC, a tool that enables a streamlined, systematic approach for the evaluation of previously obtained gene signatures across multiple gene expression datasets. We implemented sigQC in an R package, making it accessible to users who have knowledge of file input/output and matrix manipulation in R and a moderate grasp of core statistical principles. SigQC has been adopted in basic biology and translational studies, including, but not limited to, the evaluation of multiple gene signatures for potential clinical use as cancer biomarkers. This protocol uses a previously obtained signature for breast cancer metastasis as an example to illustrate the critical quality control steps involved in evaluating its expression, variability, and structure in breast tumor RNA-sequencing data, a different dataset from that in which the signature was originally derived. We demonstrate how the outputs created from sigQC can be used for the evaluation of gene signatures on large-scale gene expression datasets.

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41596-019-0136-8DOI Listing

Publication Analysis

Top Keywords

gene signatures
20
evaluation gene
12
gene expression
12
gene
8
multiple gene
8
expression datasets
8
signatures
7
expression
5
guidelines sigqc
4
sigqc systematic
4

Similar Publications

Stroke is the second-leading global cause of death. The damage attributed to the immune storm triggered by ischemia-reperfusion injury (IRI) post-stroke is substantial. However, data on the transcriptomic dynamics of pyroptosis in IRI are limited.

View Article and Find Full Text PDF

Unraveling the potential mechanism and prognostic value of pentose phosphate pathway in hepatocellular carcinoma: a comprehensive analysis integrating bulk transcriptomics and single-cell sequencing data.

Funct Integr Genomics

January 2025

Institute of Infectious Diseases, Guangdong Province, Guangzhou Eighth People's Hospital, Guangzhou Medical University, 8 Huaying Road, Baiyun District, Guangzhou, 510440, China.

Hepatocellular carcinoma (HCC) remains a malignant and life-threatening tumor with an extremely poor prognosis, posing a significant global health challenge. Despite the continuous emergence of novel therapeutic agents, patients exhibit substantial heterogeneity in their responses to anti-tumor drugs and overall prognosis. The pentose phosphate pathway (PPP) is highly activated in various tumor cells and plays a pivotal role in tumor metabolic reprogramming.

View Article and Find Full Text PDF

The expansion of single-cell analytical techniques has empowered the exploration of diverse biological questions at the individual cells. Droplet-based single-cell RNA sequencing (scRNA-seq) methods have been particularly widely used due to their high-throughput capabilities and small reaction volumes. While commercial systems have contributed to the widespread adoption of droplet-based scRNA-seq, their relatively high cost limits the ability to profile large numbers of cells and samples.

View Article and Find Full Text PDF

Background: Non-small cell lung cancer (NSCLC) is a fatal disease, and radioresistance is an important factor leading to treatment failure and disease progression. The objective of this research was to detect radioresistance-related genes (RRRGs) with prognostic value in NSCLC.

Methods: The weighted gene coexpression network analysis (WGCNA) and differentially expressed genes (DEGs) analysis were performed to identify RRRGs using expression profiles from TCGA and GEO databases.

View Article and Find Full Text PDF

Prioritizing Context-Dependent Cancer Gene Signatures in Networks.

Cancers (Basel)

January 2025

Avantyx Pharmaceuticals, Miami, FL 33136, USA.

There are numerous ways of portraying cancer complexity based on combining multiple types of data. A common approach involves developing signatures from gene expression profiles to highlight a few key reproducible features that provide insight into cancer risk, progression, or recurrence. Normally, a selection of such features is made through relevance or significance, given a reference context.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!