Exploratory analysis of single-cell RNA sequencing (scRNA-seq) typically relies on hard clustering over two-dimensional projections like uniform manifold approximation and projection (UMAP). However, such methods can severely distort the data and have many arbitrary parameter choices. Methods that can model scRNA-seq data as non-discrete "gene expression programs" (GEPs) can better preserve the data's structure, but currently, they are often not scalable, not consistent across repeated runs, and lack an established method for choosing key parameters.
View Article and Find Full Text PDFRetinoic acid (RA) is a standard-of-care neuroblastoma drug thought to be effective by inducing differentiation. Curiously, RA has little effect on primary human tumors during upfront treatment but can eliminate neuroblastoma cells from the bone marrow during post-chemo consolidation therapy-a discrepancy that has never been explained. To investigate this, we treated a large cohort of neuroblastoma cell lines with RA and observed that the most RA-sensitive cells predominantly undergo apoptosis or senescence, rather than differentiation.
View Article and Find Full Text PDFNeuroblastoma is a common pediatric cancer, where preclinical studies suggest that a mesenchymal-like gene expression program contributes to chemotherapy resistance. However, clinical outcomes remain poor, implying we need a better understanding of the relationship between patient tumor heterogeneity and preclinical models. Here, we generated single-cell RNA-seq maps of neuroblastoma cell lines, patient-derived xenograft models (PDX), and a genetically engineered mouse model (GEMM).
View Article and Find Full Text PDFCombination chemotherapy is crucial for successfully treating cancer. However, the enormous number of possible drug combinations means discovering safe and effective combinations remains a significant challenge. To improve this process, we conduct large-scale targeted CRISPR knockout screens in drug-treated cells, creating a genetic map of druggable genes that sensitize cells to commonly used chemotherapeutics.
View Article and Find Full Text PDFGene set analysis (GSA) remains a common step in genome-scale studies because it can reveal insights that are not apparent from results obtained for individual genes. Many different computational tools are applied for GSA, which may be sensitive to different types of signals; however, most methods implicitly test whether there are differences in the distribution of the effect of some experimental condition between genes in gene sets of interest. We have developed a unifying framework for GSA that first fits effect size distributions, and then tests for differences in these distributions between gene sets.
View Article and Find Full Text PDFSpatial transcriptomics technologies have recently emerged as a powerful tool for measuring spatially resolved gene expression directly in tissues sections, revealing cell types and their dysfunction in unprecedented detail. However, spatial transcriptomics technologies are limited in their ability to separate transcriptionally similar cell types and can suffer further difficulties identifying cell types in slide regions where transcript capture is low. Here, we describe a conceptually novel methodology that can computationally integrate spatial transcriptomics data with cell-type-informative paired tissue images, obtained from, for example, the reverse side of the same tissue section, to improve inferences of tissue cell type composition in spatial transcriptomics data.
View Article and Find Full Text PDFSurvival in high-risk pediatric neuroblastoma has remained around 50% for the last 20 years, with immunotherapies and targeted therapies having had minimal impact. Here, we identify the small molecule CX-5461 as selectively cytotoxic to high-risk neuroblastoma and synergistic with low picomolar concentrations of topoisomerase I inhibitors in improving survival in vivo in orthotopic patient-derived xenograft neuroblastoma mouse models. CX-5461 recently progressed through phase I clinical trial as a first-in-human inhibitor of RNA-POL I.
View Article and Find Full Text PDF(1) Background: Drug imputation methods often aim to translate in vitro drug response to in vivo drug efficacy predictions. While commonly used in retrospective analyses, our aim is to investigate the use of drug prediction methods for the generation of novel drug discovery hypotheses. Triple-negative breast cancer (TNBC) is a severe clinical challenge in need of new therapies.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
October 2019
Large-scale cancer cell line screens have identified thousands of protein-coding genes (PCGs) as biomarkers of anticancer drug response. However, systematic evaluation of long noncoding RNAs (lncRNAs) as pharmacogenomic biomarkers has so far proven challenging. Here, we study the contribution of lncRNAs as drug response predictors beyond spurious associations driven by correlations with proximal PCGs, tissue lineage, or established biomarkers.
View Article and Find Full Text PDFLong non-coding RNAs (lncRNAs) play an important role in gene regulation and are increasingly being recognized as crucial mediators of disease pathogenesis. However, the vast majority of published transcriptome datasets lack high-quality lncRNA profiles compared to protein-coding genes (PCGs). Here we propose a framework to harnesses the correlative expression patterns between lncRNA and PCGs to impute unknown lncRNA profiles.
View Article and Find Full Text PDFExpression quantitative trait loci (eQTLs) identified using tumor gene expression data could affect gene expression in cancer cells, tumor-associated normal cells, or both. Here, we have demonstrated a method to identify eQTLs affecting expression in cancer cells by modeling the statistical interaction between genotype and tumor purity. Only one third of breast cancer risk variants, identified as eQTLs from a conventional analysis, could be confidently attributed to cancer cells.
View Article and Find Full Text PDFObtaining accurate drug response data in large cohorts of cancer patients is very challenging; thus, most cancer pharmacogenomics discovery is conducted in preclinical studies, typically using cell lines and mouse models. However, these platforms suffer from serious limitations, including small sample sizes. Here, we have developed a novel computational method that allows us to impute drug response in very large clinical cancer genomics data sets, such as The Cancer Genome Atlas (TCGA).
View Article and Find Full Text PDFCarter and colleagues propose a systematic analysis of the germline and somatic genome in cancer. They identify interactions that occur between germline and somatic variants. This elucidates the function of the germline genome in the context of cancer risk and development.
View Article and Find Full Text PDFThe Huang Lab was established in 2009 at the University of Chicago and has since been active in conducting pharmacogenomic research. Our laboratory's main research focus is translational pharmacogenomics with a particular interest in the pharmacogenomics of anticancer agents. By systematically evaluating the human genome and its relationships to drug response and toxicity, our goal is to develop clinically useful models that predict risk for adverse drug reactions and nonresponse prior to administration of chemotherapy.
View Article and Find Full Text PDFWe show that variability in general levels of drug sensitivity in pre-clinical cancer models confounds biomarker discovery. However, using a very large panel of cell lines, each treated with many drugs, we could estimate a general level of sensitivity to all drugs in each cell line. By conditioning on this variable, biomarkers were identified that were more likely to be effective in clinical trials than those identified using a conventional uncorrected approach.
View Article and Find Full Text PDFBMC Bioinformatics
September 2015
Background: Quantifying gene expression by RNA-Seq has several advantages over microarrays, including greater dynamic range and gene expression estimates on an absolute, rather than a relative scale. Nevertheless, microarrays remain in widespread use, demonstrated by the ever-growing numbers of samples deposited in public repositories.
Results: We propose a novel approach to microarray analysis that attains many of the advantages of RNA-Seq.
J Natl Cancer Inst
November 2015
Background: Many disparate biomarkers have been proposed as predictors of response to histone deacetylase inhibitors (HDI); however, all have failed when applied clinically. Rather than this being entirely an issue of reproducibility, response to the HDI vorinostat may be determined by the additive effect of multiple molecular factors, many of which have previously been demonstrated.
Methods: We conducted a large-scale gene expression analysis using the Cancer Genome Project for discovery and generated another large independent cancer cell line dataset across different cancers for validation.
We recently described a methodology that reliably predicted chemotherapeutic response in multiple independent clinical trials. The method worked by building statistical models from gene expression and drug sensitivity data in a very large panel of cancer cell lines, then applying these models to gene expression data from primary tumor biopsies. Here, to facilitate the development and adoption of this methodology we have created an R package called pRRophetic.
View Article and Find Full Text PDFAm J Respir Crit Care Med
September 2014
Rationale: Most genomic studies of lung function have used phenotypic data derived from a single time-point (e.g., presence/absence of disease) without considering the dynamic progression of a chronic disease.
View Article and Find Full Text PDFObjectives: Poly (ADP-ribose) polymerase inhibitors (PARPi) have shown single agent activity against tumors with deficiencies in the DNA repair mechanism homologous recombination including, but not limited to those harboring BRCA mutations. We hypothesized that, in the context of homologous recombination deficiency (HRD), PARPi could have an effect in head and neck cancer (HNC).
Materials And Methods: We evaluated TCGA data for evidence of HRD using a copy number data signature established for breast cancer.
Background: Using genome-wide genetic, gene expression, and microRNA expression (miRNA) data, we developed an integrative approach to investigate the genetic and epigenetic basis of chemotherapeutic sensitivity.
Results: Through a sequential multi-stage framework, we identified genes and miRNAs whose expression correlated with platinum sensitivity, mapped these to genomic loci as quantitative trait loci (QTLs), and evaluated the associations between these QTLs and platinum sensitivity. A permutation analysis showed that top findings from our approach have a much lower false discovery rate compared to those from a traditional GWAS of drug sensitivity.