With the increased use of next-generation sequencing generating large amounts of genomic data, gene expression signatures are becoming critically important tools for the interpretation of these data, and are poised to have a substantial effect on diagnosis, management, and prognosis for a number of diseases. It is becoming crucial to establish whether the expression patterns and statistical properties of sets of genes, or gene signatures, are conserved across independent datasets. Conversely, it is necessary to compare established signatures on the same dataset to better understand how they capture different clinical or biological characteristics. Here we describe how to use sigQC, a tool that enables a streamlined, systematic approach for the evaluation of previously obtained gene signatures across multiple gene expression datasets. We implemented sigQC in an R package, making it accessible to users who have knowledge of file input/output and matrix manipulation in R and a moderate grasp of core statistical principles. SigQC has been adopted in basic biology and translational studies, including, but not limited to, the evaluation of multiple gene signatures for potential clinical use as cancer biomarkers. This protocol uses a previously obtained signature for breast cancer metastasis as an example to illustrate the critical quality control steps involved in evaluating its expression, variability, and structure in breast tumor RNA-sequencing data, a different dataset from that in which the signature was originally derived. We demonstrate how the outputs created from sigQC can be used for the evaluation of gene signatures on large-scale gene expression datasets.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1038/s41596-019-0136-8 | DOI Listing |
Mol Neurobiol
January 2025
Department of Anesthesiology, Yijishan Hospital, First Affiliated Hospital of Wannan Medical College, Wuhu, 241004, China.
Stroke is the second-leading global cause of death. The damage attributed to the immune storm triggered by ischemia-reperfusion injury (IRI) post-stroke is substantial. However, data on the transcriptomic dynamics of pyroptosis in IRI are limited.
View Article and Find Full Text PDFFunct Integr Genomics
January 2025
Institute of Infectious Diseases, Guangdong Province, Guangzhou Eighth People's Hospital, Guangzhou Medical University, 8 Huaying Road, Baiyun District, Guangzhou, 510440, China.
Hepatocellular carcinoma (HCC) remains a malignant and life-threatening tumor with an extremely poor prognosis, posing a significant global health challenge. Despite the continuous emergence of novel therapeutic agents, patients exhibit substantial heterogeneity in their responses to anti-tumor drugs and overall prognosis. The pentose phosphate pathway (PPP) is highly activated in various tumor cells and plays a pivotal role in tumor metabolic reprogramming.
View Article and Find Full Text PDFNucleic Acids Res
January 2025
Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, 10257, Lithuania.
The expansion of single-cell analytical techniques has empowered the exploration of diverse biological questions at the individual cells. Droplet-based single-cell RNA sequencing (scRNA-seq) methods have been particularly widely used due to their high-throughput capabilities and small reaction volumes. While commercial systems have contributed to the widespread adoption of droplet-based scRNA-seq, their relatively high cost limits the ability to profile large numbers of cells and samples.
View Article and Find Full Text PDFAnn Med
December 2025
Department of Radio-Chemotherapy, Affiliated Hospital of Yangzhou University, Yangzhou University, Yangzhou, Jiangsu, China.
Background: Non-small cell lung cancer (NSCLC) is a fatal disease, and radioresistance is an important factor leading to treatment failure and disease progression. The objective of this research was to detect radioresistance-related genes (RRRGs) with prognostic value in NSCLC.
Methods: The weighted gene coexpression network analysis (WGCNA) and differentially expressed genes (DEGs) analysis were performed to identify RRRGs using expression profiles from TCGA and GEO databases.
Cancers (Basel)
January 2025
Avantyx Pharmaceuticals, Miami, FL 33136, USA.
There are numerous ways of portraying cancer complexity based on combining multiple types of data. A common approach involves developing signatures from gene expression profiles to highlight a few key reproducible features that provide insight into cancer risk, progression, or recurrence. Normally, a selection of such features is made through relevance or significance, given a reference context.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!