Publications by authors named "Dobin A"

Objective: The optimal therapeutic response in cancer patients is highly dependent upon the differentiation state of their tumours. Pancreatic ductal adenocarcinoma (PDA) is a lethal cancer that harbours distinct phenotypic subtypes with preferential sensitivities to standard therapies. This study aimed to investigate intratumour heterogeneity and plasticity of cancer cell states in PDA in order to reveal cell state-specific regulators.

View Article and Find Full Text PDF

Glioblastoma multiforme (GBM) is an aggressive, heterogeneous brain tumor in which glioblastoma stem cells (GSCs) are known culprits of therapy resistance. Long non-coding RNAs (lncRNAs) have been shown to play a critical role in both cancer and normal biology. A few studies have suggested that aberrant expression of lncRNAs is associated with GSCs.

View Article and Find Full Text PDF

The cognitive abilities of humans are distinctive among primates, but their molecular and cellular substrates are poorly understood. We used comparative single-nucleus transcriptomics to analyze samples of the middle temporal gyrus (MTG) from adult humans, chimpanzees, gorillas, rhesus macaques, and common marmosets to understand human-specific features of the neocortex. Human, chimpanzee, and gorilla MTG showed highly similar cell-type composition and laminar organization as well as a large shift in proportions of deep-layer intratelencephalic-projecting neurons compared with macaque and marmoset MTG.

View Article and Find Full Text PDF
Article Synopsis
  • - The study investigates how enhanced cognitive functions in humans may relate to increased brain cell diversity and cortical expansion, focusing on single-cell expression data from five primate species, including humans and non-human primates.
  • - Researchers identified 57 homologous cell types and found significant gene expression differences, with 24% of genes showing variation between humans and non-human primates, which are linked to various brain disorders.
  • - The analysis reveals that certain genes exhibit unique human-specific expression patterns and co-expression relationships, suggesting these genes may have evolved under relaxed constraints, potentially influencing the rapid evolution of brain function in humans.
View Article and Find Full Text PDF

Recurrent chromosomal rearrangements found in rhabdomyosarcoma (RMS) produce the PAX3-FOXO1 fusion protein, which is an oncogenic driver and a dependency in this disease. One important function of PAX3-FOXO1 is to arrest myogenic differentiation, which is linked to the ability of RMS cells to gain an unlimited proliferation potential. Here, we developed a phenotypic screening strategy for identifying factors that collaborate with PAX3-FOXO1 to block myo-differentiation in RMS.

View Article and Find Full Text PDF

The Encyclopedia of DNA elements (ENCODE) project is a collaborative effort to create a comprehensive catalog of functional elements in the human genome. The current database comprises more than 19000 functional genomics experiments across more than 1000 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory and transcriptional landscape of the and genomes. All experimental data, metadata, and associated computational analyses created by the ENCODE consortium are submitted to the Data Coordination Center (DCC) for validation, tracking, storage, and distribution to community resources and the scientific community.

View Article and Find Full Text PDF

Here, we present FusionInspector for characterization and interpretation of candidate fusion transcripts from RNA sequencing (RNA-seq) and exploration of their sequence and expression characteristics. We applied FusionInspector to thousands of tumor and normal transcriptomes and identified statistical and experimental features enriched among biologically impactful fusions. Through clustering and machine learning, we identified large collections of fusions potentially relevant to tumor and normal biological processes.

View Article and Find Full Text PDF

The Encyclopedia of DNA elements (ENCODE) project is a collaborative effort to create a comprehensive catalog of functional elements in the human genome. The current database comprises more than 19000 functional genomics experiments across more than 1000 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory and transcriptional landscape of the and genomes. All experimental data, metadata, and associated computational analyses created by the ENCODE consortium are submitted to the Data Coordination Center (DCC) for validation, tracking, storage, and distribution to community resources and the scientific community.

View Article and Find Full Text PDF
Article Synopsis
  • * A deep-learning model can predict allele-specific activity using only local nucleotide sequences, emphasizing key transcription-factor-binding motifs affected by genetic variants.
  • * Combining EN-TEx with previous genome annotations shows significant connections between allele-specific loci and GWAS loci, and aids in transferring known eQTLs to challenging tissue types, improving personal functional genomics research.
View Article and Find Full Text PDF

Glioblastoma multiforme (GBM) is an aggressive, heterogeneous grade IV brain tumor. Glioblastoma stem cells (GSCs) initiate the tumor and are known culprits of therapy resistance. Mounting evidence has demonstrated a regulatory role of long non-coding RNAs (lncRNAs) in various biological processes, including pluripotency, differentiation, and tumorigenesis.

View Article and Find Full Text PDF

The Human Reference Genome serves as the foundation for modern genomic analyses. However, in its present form, it does not adequately represent the vast genetic diversity of the human population. In this study, we explored the consensus genome as a potential successor of the current reference genome and assessed its effect on the accuracy of RNA-seq read alignment.

View Article and Find Full Text PDF

The primary motor cortex (M1) is essential for voluntary fine-motor control and is functionally conserved across mammals. Here, using high-throughput transcriptomic and epigenomic profiling of more than 450,000 single nuclei in humans, marmoset monkeys and mice, we demonstrate a broadly conserved cellular makeup of this region, with similarities that mirror evolutionary distance and are consistent between the transcriptome and epigenome. The core conserved molecular identities of neuronal and non-neuronal cell types allow us to generate a cross-species consensus classification of cell types, and to infer conserved properties of cell types across species.

View Article and Find Full Text PDF

We have produced RNA sequencing data for 53 primary cells from different locations in the human body. The clustering of these primary cells reveals that most cells in the human body share a few broad transcriptional programs, which define five major cell types: epithelial, endothelial, mesenchymal, neural, and blood cells. These act as basic components of many tissues and organs.

View Article and Find Full Text PDF

The human and mouse genomes contain instructions that specify RNAs and proteins and govern the timing, magnitude, and cellular context of their production. To better delineate these elements, phase III of the Encyclopedia of DNA Elements (ENCODE) Project has expanded analysis of the cell and tissue repertoires of RNA transcription, chromatin structure and modification, DNA methylation, chromatin looping, and occupancy by transcription factors and RNA-binding proteins. Here we summarize these efforts, which have produced 5,992 new experimental datasets, including systematic determinations across mouse fetal development.

View Article and Find Full Text PDF

MaizeCODE is a project aimed at identifying and analyzing functional elements in the maize genome. In its initial phase, MaizeCODE assayed up to five tissues from four maize strains (B73, NC350, W22, TIL11) by RNA-Seq, Chip-Seq, RAMPAGE, and small RNA sequencing. To facilitate reproducible science and provide both human and machine access to the MaizeCODE data, we enhanced SciApps, a cloud-based portal, for analysis and distribution of both raw data and analysis results.

View Article and Find Full Text PDF

MicroRNAs (miRNAs) play a critical role as posttranscriptional regulators of gene expression. The ENCODE Project profiled the expression of miRNAs in an extensive set of organs during a time-course of mouse embryonic development and captured the expression dynamics of 785 miRNAs. We found distinct organ-specific and developmental stage-specific miRNA expression clusters, with an overall pattern of increasing organ-specific expression as embryonic development proceeds.

View Article and Find Full Text PDF

Background: Accurate fusion transcript detection is essential for comprehensive characterization of cancer transcriptomes. Over the last decade, multiple bioinformatic tools have been developed to predict fusions from RNA-seq, based on either read mapping or de novo fusion transcript assembly.

Results: We benchmark 23 different methods including applications we develop, STAR-Fusion and TrinityFusion, leveraging both simulated and real RNA-seq.

View Article and Find Full Text PDF

The use of the human reference genome has shaped methods and data across modern genomics. This has offered many benefits while creating a few constraints. In the following opinion, we outline the history, properties, and pitfalls of the current human reference genome.

View Article and Find Full Text PDF

Long noncoding RNAs (lncRNAs) can regulate target gene expression by acting in (locally) or in (non-locally). Here, we performed genome-wide expression analysis of Toll-like receptor (TLR)-stimulated human macrophages to identify pairs of -acting lncRNAs and protein-coding genes involved in innate immunity. A total of 229 gene pairs were identified, many of which were commonly regulated by signaling through multiple TLRs and were involved in the cytokine responses to infection by group B We focused on elucidating the function of one lncRNA, named or (Regulator of Cytokines and Inflammation), which was induced by multiple TLR stimuli and acted as a master regulator of inflammatory responses.

View Article and Find Full Text PDF

Many tools are available for RNA-seq alignment and expression quantification, with comparative value being hard to establish. Benchmarking assessments often highlight methods' good performance, but are focused on either model data or fail to explain variation in performance. This leaves us to ask, what is the most meaningful way to assess different alignment choices? And importantly, where is there room for progress? In this work, we explore the answers to these two questions by performing an exhaustive assessment of the STAR aligner.

View Article and Find Full Text PDF

Background: A comparison of transcriptional profiles derived from different tissues in a given species or among different species assumes that commonalities reflect evolutionarily conserved programs and that differences reflect species or tissue responses to environmental conditions or developmental program staging. Apparently conflicting results have been published regarding whether organ-specific transcriptional patterns dominate over species-specific patterns, or vice versa, making it unclear to what extent the biology of a given organism can be extrapolated to another. These studies have in common that they treat the transcriptomes monolithically, implicitly ignoring that each gene is likely to have a specific pattern of transcriptional variation across organs and species.

View Article and Find Full Text PDF