While context-type-specific regulation of genes is largely determined by cis-regulatory regions, attempts to identify cell type-specific eQTLs are complicated by the nested nature of cell types. We present hierarchical eQTL (H-eQTL), a network-based model for hierarchical annotation of bulk-derived eQTLs to levels of a cell type tree using single-cell chromatin accessibility data and no clustering of cells into discrete cell types. Using our model, we annotate bulk-derived eQTLs from the developing brain with high specificity to levels of a cell type hierarchy, which allows sensitive detection of genes with multiple distinct non-coding elements regulating their expression in different cell types.
View Article and Find Full Text PDFCellWalker2 is a graph diffusion-based method for single-cell genomics data integration. It extends the CellWalker model by incorporating hierarchical relationships between cell types, providing estimates of statistical significance, and adding data structures for analyzing multi-omics data so that gene expression and open chromatin can be jointly modeled. Our open-source software enables users to annotate cells using existing ontologies and to probabilistically match cell types between two or more contexts, including across species.
View Article and Find Full Text PDFNucleotide changes in gene regulatory elements are important determinants of neuronal development and diseases. Using massively parallel reporter assays in primary human cells from mid-gestation cortex and cerebral organoids, we interrogated the cis-regulatory activity of 102,767 open chromatin regions, including thousands of sequences with cell type-specific accessibility and variants associated with brain gene regulation. In primary cells, we identified 46,802 active enhancer sequences and 164 variants that alter enhancer activity.
View Article and Find Full Text PDFHuman accelerated regions (HARs) are conserved genomic loci that evolved at an accelerated rate in the human lineage and may underlie human-specific traits. We generated HARs and chimpanzee accelerated regions with an automated pipeline and an alignment of 241 mammalian genomes. Combining deep learning with chromatin capture experiments in human and chimpanzee neural progenitor cells, we discovered a significant enrichment of HARs in topologically associating domains containing human-specific genomic variants that change three-dimensional (3D) genome organization.
View Article and Find Full Text PDFGenomic regulatory elements active in the developing human brain are notably enriched in genetic risk for neuropsychiatric disorders, including autism spectrum disorder (ASD), schizophrenia, and bipolar disorder. However, prioritizing the specific risk genes and candidate molecular mechanisms underlying these genetic enrichments has been hindered by the lack of a single unified large-scale gene regulatory atlas of human brain development. Here, we uniformly process and systematically characterize gene, isoform, and splicing quantitative trait loci (xQTLs) in 672 fetal brain samples from unique subjects across multiple ancestral populations.
View Article and Find Full Text PDFNucleotide changes in gene regulatory elements are important determinants of neuronal development and disease. Using massively parallel reporter assays in primary human cells from mid-gestation cortex and cerebral organoids, we interrogated the -regulatory activity of 102,767 sequences, including differentially accessible cell-type specific regions in the developing cortex and single-nucleotide variants associated with psychiatric disorders. In primary cells, we identified 46,802 active enhancer sequences and 164 disorder-associated variants that significantly alter enhancer activity.
View Article and Find Full Text PDFChronic inflammation and tissue fibrosis are common stress responses that worsen organ function, yet the molecular mechanisms governing their crosstalk are poorly understood. In diseased organs, stress-induced changes in gene expression fuel maladaptive cell state transitions and pathological interaction between diverse cellular compartments. Although chronic fibroblast activation worsens dysfunction of lung, liver, kidney, and heart, and exacerbates many cancers, the stress-sensing mechanisms initiating the transcriptional activation of fibroblasts are not well understood.
View Article and Find Full Text PDFMotivation: Unsupervised clustering of single-cell transcriptomics is a powerful method for identifying cell populations. Static visualization techniques for single-cell clustering only display results for a single resolution parameter. Analysts will often evaluate more than one resolution parameter but then only report one.
View Article and Find Full Text PDFSummary: CellWalkR is an R package that integrates single-cell open chromatin data with cell type labels and bulk epigenetic data to identify cell type-specific regulatory regions. A Graphics Processing Unit (GPU) implementation and downsampling strategies enable thousands of cells to be processed in seconds. CellWalkR's user-friendly interface provides interactive analysis and visualization of cell labels and regulatory region mappings.
View Article and Find Full Text PDFDuring mammalian development, differences in chromatin state coincide with cellular differentiation and reflect changes in the gene regulatory landscape. In the developing brain, cell fate specification and topographic identity are important for defining cell identity and confer selective vulnerabilities to neurodevelopmental disorders. Here, to identify cell-type-specific chromatin accessibility patterns in the developing human brain, we used a single-cell assay for transposase accessibility by sequencing (scATAC-seq) in primary tissue samples from the human forebrain.
View Article and Find Full Text PDFIn diseased organs, stress-activated signalling cascades alter chromatin, thereby triggering maladaptive cell state transitions. Fibroblast activation is a common stress response in tissues that worsens lung, liver, kidney and heart disease, yet its mechanistic basis remains unclear. Pharmacological inhibition of bromodomain and extra-terminal domain (BET) proteins alleviates cardiac dysfunction, providing a tool to interrogate and modulate cardiac cell states as a potential therapeutic approach.
View Article and Find Full Text PDFSingle-cell and bulk genomics assays have complementary strengths and weaknesses, and alone neither strategy can fully capture regulatory elements across the diversity of cells in complex tissues. We present CellWalker, a method that integrates single-cell open chromatin (scATAC-seq) data with gene expression (RNA-seq) and other data types using a network model that simultaneously improves cell labeling in noisy scATAC-seq and annotates cell type-specific regulatory elements in bulk data. We demonstrate CellWalker's robustness to sparse annotations and noise using simulations and combined RNA-seq and ATAC-seq in individual cells.
View Article and Find Full Text PDFTo discover regulatory elements driving the specificity of gene expression in different cell types and regions of the developing human brain, we generated an atlas of open chromatin from nine dissected regions of the mid-gestation human telencephalon, as well as microdissected upper and deep layers of the prefrontal cortex. We identified a subset of open chromatin regions (OCRs), termed predicted regulatory elements (pREs), that are likely to function as developmental brain enhancers. pREs showed temporal, regional, and laminar differences in chromatin accessibility and were correlated with gene expression differences across regions and gestational ages.
View Article and Find Full Text PDFIdentifying cancer-relevant mutations in noncoding regions is challenging due to the large numbers of such mutations, their low levels of recurrence, and difficulties in interpreting their functional impact. To uncover genes that are dysregulated due to somatic mutations in cis, we build upon the concept of differential allele-specific expression (ASE) and introduce methods to identify genes within an individual's cancer whose ASE differs from what is found in matched normal tissue. When applied to breast cancer tumor samples, our methods detect the known allele-specific effects of copy number variation and nonsense-mediated decay.
View Article and Find Full Text PDFDue to an error introduced during copyediting of this article [1], reference [8] incorrectly reads.
View Article and Find Full Text PDFA major aim of cancer genomics is to pinpoint which somatically mutated genes are involved in tumor initiation and progression. We introduce a new framework for uncovering cancer genes, differential mutation analysis, which compares the mutational profiles of genes across cancer genomes with their natural germline variation across healthy individuals. We present DiffMut, a fast and simple approach for differential mutational analysis, and demonstrate that it is more effective in discovering cancer genes than considerably more sophisticated approaches.
View Article and Find Full Text PDFSummary: SimBoolNet is an open source Cytoscape plugin that simulates the dynamics of signaling transduction using Boolean networks. Given a user-specified level of stimulation to signal receptors, SimBoolNet simulates the response of downstream molecules and visualizes with animation and records the dynamic changes of the network. It can be used to generate hypotheses and facilitate experimental studies about causal relations and crosstalk among cellular signaling pathways.
View Article and Find Full Text PDFBackground: Cellular response to external stimuli requires propagation of corresponding signals through molecular signaling pathways. However, signaling pathways are not isolated information highways, but rather interact in a number of ways forming sophisticated signaling networks. Since defects in signaling pathways are associated with many serious diseases, understanding of the crosstalk between them is fundamental for designing molecularly targeted therapy.
View Article and Find Full Text PDF