Spatial molecular data has transformed the study of disease microenvironments, though, larger datasets pose an analytics challenge prompting the direct adoption of single-cell RNA-sequencing tools including normalization methods. Here, we demonstrate that library size is associated with tissue structure and that normalizing these effects out using commonly applied scRNA-seq normalization methods will negatively affect spatial domain identification. Spatial data should not be specifically corrected for library size prior to analysis, and algorithms designed for scRNA-seq data should be adopted with caution.
View Article and Find Full Text PDFThe utilization of single-cell resolved spatial transcriptomics to delineate immune responses during SARS-CoV-2 infection was able to identify M1 macrophages to have elevated expression of IFI27 in areas of infection.
View Article and Find Full Text PDFFunctional analysis of high throughput experiments using pathway analysis is now ubiquitous. Though powerful, these methods often produce thousands of redundant results owing to knowledgebase redundancies upstream. This scale of results hinders extensive exploration by biologists and can lead to investigator biases due to previous knowledge and expectations.
View Article and Find Full Text PDFTo gain a better understanding of the complexity of gene expression in normal and diseased tissues it is important to account for the spatial context and identity of cells in situ. State-of-the-art spatial profiling technologies, such as the Nanostring GeoMx Digital Spatial Profiler (DSP), now allow quantitative spatially resolved measurement of the transcriptome in tissues. However, the bioinformatics pipelines currently used to analyse GeoMx data often fail to successfully account for the technical variability within the data and the complexity of experimental designs, thus limiting the accuracy and reliability of the subsequent analysis.
View Article and Find Full Text PDFKAT6A, and its paralog KAT6B, are histone lysine acetyltransferases (HAT) that acetylate histone H3K23 and exert an oncogenic role in several tumor types including breast cancer where KAT6A is frequently amplified/overexpressed. However, pharmacologic targeting of KAT6A to achieve therapeutic benefit has been a challenge. Here we describe identification of a highly potent, selective, and orally bioavailable KAT6A/KAT6B inhibitor CTx-648 (PF-9363), derived from a benzisoxazole series, which demonstrates anti-tumor activity in correlation with H3K23Ac inhibition in KAT6A over-expressing breast cancer.
View Article and Find Full Text PDFGene-set analysis (GSA) dominates the functional interpretation of omics data and downstream hypothesis generation. Despite its ability to summarise thousands of measurements into semantically interpretable components, GSA often results in hundreds of significantly enriched gene-sets. However, summarisation and effective visualisation of GSA results to facilitate hypothesis generation is still lacking.
View Article and Find Full Text PDFBackground: Medulloblastoma (MB) is a malignant tumour of the cerebellum which can be classified into four major subgroups based on gene expression and genomic features. Single-cell transcriptome studies have defined the cellular states underlying each MB subgroup; however, the spatial organisation of these diverse cell states and how this impacts response to therapy remains to be determined.
Methods: Here, we used spatially resolved transcriptomics to define the cellular diversity within a sonic hedgehog (SHH) patient-derived model of MB and show that cells specific to a transcriptional state or spatial location are pivotal for CDK4/6 inhibitor, Palbociclib, treatment response.
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is known to present with pulmonary and extra-pulmonary organ complications. In comparison with the 2009 pandemic (pH1N1), SARS-CoV-2 infection is likely to lead to more severe disease, with multi-organ effects, including cardiovascular disease. SARS-CoV-2 has been associated with acute and long-term cardiovascular disease, but the molecular changes that govern this remain unknown.
View Article and Find Full Text PDFThe development of therapies that target specific disease subtypes has dramatically improved outcomes for patients with breast cancer. However, survival gains have not been uniform across patients, even within a given molecular subtype. Large collections of publicly available drug screening data matched with transcriptomic measurements have facilitated the development of computational models that predict response to therapy.
View Article and Find Full Text PDFBackground: The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) which emerged in late 2019 has spread globally, causing a pandemic of respiratory illness designated coronavirus disease 2019 (COVID-19). A better definition of the pulmonary host response to SARS-CoV-2 infection is required to understand viral pathogenesis and to validate putative COVID-19 biomarkers that have been proposed in clinical studies.
Methods: Here, we use targeted transcriptomics of formalin-fixed paraffin-embedded tissue using the NanoString GeoMX platform to generate an in-depth picture of the pulmonary transcriptional landscape of COVID-19, pandemic H1N1 influenza and uninfected control patients.
Background: Medulloblastoma (MB) is the most common malignant paediatric brain tumour and a leading cause of cancer-related mortality and morbidity. Existing treatment protocols are aggressive in nature resulting in significant neurological, intellectual and physical disabilities for the children undergoing treatment. Thus, there is an urgent need for improved, targeted therapies that minimize these harmful side effects.
View Article and Find Full Text PDFGene expression signatures have been critical in defining the molecular phenotypes of cells, tissues, and patient samples. Their most notable and widespread clinical application is stratification of breast cancer patients into molecular (PAM50) subtypes. The cost and relatively large amounts of fresh starting material required for whole-transcriptome sequencing has limited clinical application of thousands of existing gene signatures captured in repositories such as the Molecular Signature Database.
View Article and Find Full Text PDFAdvances in RNA sequencing (RNA-seq) technologies that measure the transcriptome of biological samples have revolutionised our ability to understand transcriptional regulatory programs that underpin diseases such as cancer. We recently published singscore - a single sample, rank-based gene set scoring method which quantifies how concordant the transcriptional profile of individual samples are relative to specific gene sets of interest. Here we demonstrate the application of singscore to investigate transcriptional profiles associated with specific mutations or genetic lesions in acute myeloid leukemia.
View Article and Find Full Text PDFBackground: Elucidation of regulatory networks, including identification of regulatory mechanisms specific to a given biological context, is a key aim in systems biology. This has motivated the move from co-expression to differential co-expression analysis and numerous methods have been developed subsequently to address this task; however, evaluation of methods and interpretation of the resulting networks has been hindered by the lack of known context-specific regulatory interactions.
Results: In this study, we develop a simulator based on dynamical systems modelling capable of simulating differential co-expression patterns.
Background: Gene set scoring provides a useful approach for quantifying concordance between sample transcriptomes and selected molecular signatures. Most methods use information from all samples to score an individual sample, leading to unstable scores in small data sets and introducing biases from sample composition (e.g.
View Article and Find Full Text PDF