Graph convolutional networks for epigenetic state prediction using both sequence and 3D genome data.

Bioinformatics

Department of Computer Science, University of Virginia, Charlottesville, VA 22903, USA.

Published: December 2020

Motivation: Predictive models of DNA chromatin profile (i.e. epigenetic state), such as transcription factor binding, are essential for understanding regulatory processes and developing gene therapies. It is known that the 3D genome, or spatial structure of DNA, is highly influential in the chromatin profile. Deep neural networks have achieved state of the art performance on chromatin profile prediction by using short windows of DNA sequences independently. These methods, however, ignore the long-range dependencies when predicting the chromatin profiles because modeling the 3D genome is challenging.

Results: In this work, we introduce ChromeGCN, a graph convolutional network for chromatin profile prediction by fusing both local sequence and long-range 3D genome information. By incorporating the 3D genome, we relax the independent and identically distributed assumption of local windows for a better representation of DNA. ChromeGCN explicitly incorporates known long-range interactions into the modeling, allowing us to identify and interpret those important long-range dependencies in influencing chromatin profiles. We show experimentally that by fusing sequential and 3D genome data using ChromeGCN, we get a significant improvement over the state-of-the-art deep learning methods as indicated by three metrics. Importantly, we show that ChromeGCN is particularly useful for identifying epigenetic effects in those DNA windows that have a high degree of interactions with other DNA windows.

Availability And Implementation: https://github.com/QData/ChromeGCN.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btaa793DOI Listing

Publication Analysis

Top Keywords

chromatin profile
16
graph convolutional
8
epigenetic state
8
genome data
8
profile prediction
8
long-range dependencies
8
chromatin profiles
8
genome
6
dna
6
chromatin
6

Similar Publications

The immune landscape of fetal chorionic villous tissue in term placenta.

Front Immunol

January 2025

Department of Microbiology, Immunology and Molecular Genetics, University of Kentucky, Lexington, KY, United States.

Introduction: The immune compartment within fetal chorionic villi is comprised of fetal Hofbauer cells (HBC) and invading placenta-associated maternal monocytes and macrophages (PAMM). Recent studies have characterized the transcriptional profile of the first trimester (T1) placenta; however, the phenotypic and functional diversity of chorionic villous immune cells at term (T3) remain poorly understood.

Methods: To address this knowledge gap, immune cells from human chorionic villous tissues obtained from full-term, uncomplicated pregnancies were deeply phenotyped using a combination of flow cytometry, single-cell RNA sequencing (scRNA-seq, CITE-seq) and chromatin accessibility profiling (snATAC-seq).

View Article and Find Full Text PDF

Multiplexed spatial mapping of chromatin features, transcriptome and proteins in tissues.

Nat Methods

January 2025

Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.

The phenotypic and functional states of cells are modulated by a complex interactive molecular hierarchy of multiple omics layers, involving the genome, epigenome, transcriptome, proteome and metabolome. Spatial omics approaches have enabled the study of these layers in tissue context but are often limited to one or two modalities, offering an incomplete view of cellular identity. Here we present spatial-Mux-seq, a multimodal spatial technology that allows simultaneous profiling of five different modalities: two histone modifications, chromatin accessibility, whole transcriptome and a panel of proteins at tissue scale and cellular level in a spatially resolved manner.

View Article and Find Full Text PDF

Many transcription factors (TFs) have been shown to bind to super-enhancers, forming transcriptional condensates to activate transcription in various cellular systems. However, the genomic and epigenomic determinants of phase-separated transcriptional condensate formation remain poorly understood. Questions regarding which TFs tend to associate with transcriptional condensates and what factors influence their association are largely unanswered.

View Article and Find Full Text PDF

Unlabelled: Once considered rare in eukaryotes, polycistronic mRNA expression has been identified in kinetoplastids and, more recently, green algae, red algae, and certain fungi. This study provides comprehensive evidence supporting the existence of polycistronic mRNA expression in the apicomplexan parasite . Leveraging long-read RNA-seq data from different parasite strains and using multiple long-read technologies, we demonstrate the existence of defined polycistronic transcripts containing 2-4 protein encoding genes, several validated with RT-PCR.

View Article and Find Full Text PDF

Transcriptional silencers are -regulatory elements that downregulate the expression of target genes. Although thousands of silencers have been identified experimentally, a predictive chromatin signature of silencers has not been found. H4K20me1 previously was reported to be highly enriched among human silencers, but our reanalysis of those data using an appropriate background revealed that the enrichment is only marginal.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!