Background: Chromatin 3D conformation plays important roles in regulating gene or protein functions. High-throughout chromosome conformation capture (3C)-based technologies, such as Hi-C, have been exploited to acquire the contact frequencies among genomic loci at genome-scale. Various computational tools have been proposed to recover the underlying chromatin 3D structures from in situ Hi-C contact map data. As connected residuals in a polymer, neighboring genomic loci have intrinsic mutual dependencies in building a 3D conformation. However, current methods seldom take this feature into account.

Results: We present a method called ShNeigh, which combines the classical MDS technique with local dependence of neighboring loci modeled by a Gaussian formula, to infer the best 3D structure from noisy and incomplete contact frequency matrices. We validated ShNeigh by comparing it to two typical distance-based algorithms, ShRec3D and ChromSDE. The comparison results on simulated Hi-C dataset showed that, while keeping the high-speed nature of classical MDS, ShNeigh can recover the true structure better than ShRec3D and ChromSDE. Meanwhile, ShNeigh is more robust to data noise. On the publicly available human GM06990 Hi-C data, we demonstrated that the structures reconstructed by ShNeigh are more reproducible between different restriction enzymes than by ShRec3D and ChromSDE, especially at high resolutions manifested by sparse contact maps, which means ShNeigh is more robust to signal coverage.

Conclusions: Our method can recover stable structures in high noise and sparse signal settings. It can also reconstruct similar structures from Hi-C data obtained using different restriction enzymes. Therefore, our method provides a new direction for enhancing the reconstruction quality of chromatin 3D structures.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7329537PMC
http://dx.doi.org/10.1186/s12859-020-03612-4DOI Listing

Publication Analysis

Top Keywords

genomic loci
12
shrec3d chromsde
12
chromatin structures
8
classical mds
8
shneigh robust
8
hi-c data
8
restriction enzymes
8
shneigh
6
hi-c
5
structures
5

Similar Publications

Soybean () is a leguminous plant with a broad range of applications, particularly in agriculture and food production, where its seed composition-especially oil and protein content-is highly valued. Improving these traits is a primary focus of soybean breeding programs. In this study, we conducted a genome-wide association study (GWAS) to identify genetic loci linked to oil and protein content in seeds, using imputed genotype data for 180 Eurasian soybean varieties and the novel "genotypic twins" approach.

View Article and Find Full Text PDF

Faba bean ( L.) is a valuable ingredient in plant-based foods such as meat and dairy analogues. However, its typical taste and aroma are considered off-flavours in these food applications, representing a bottleneck during processing.

View Article and Find Full Text PDF

Extrachromosomal circular DNAs (eccDNAs) has been found to be widespread and functional in various organisms. However, comparative analyses of pre- and post-infection of virus are rarely known. Herein, we investigated the changes in expression patterns of eccDNA following infection with cytoplasmic polyhedrosis virus (BmCPV) and explore the role of eccDNA in viral infection.

View Article and Find Full Text PDF

A Haplotype GWAS in Syndromic Familial Colorectal Cancer.

Int J Mol Sci

January 2025

Department of Molecular Medicine and Surgery, Karolinska Institutet, 17176 Stockholm, Sweden.

A previous genome-wide association study (GWAS) in colorectal cancer (CRC) patients with gastric and/or prostate cancer in their families suggested genetic loci with a shared risk for these three cancers. A second haplotype GWAS was undertaken in the same colorectal cancer patients and different controls with the aim of confirming the result and finding novel loci. The haplotype GWAS analysis involved 685 patients with colorectal cancer cases and 1642 healthy controls from Sweden.

View Article and Find Full Text PDF

is a member of the cruciferous family with rich glucosinolate (GSL) content, particularly glucobrassicin (3-indolylmethyl glucosinolate, I3M), that can be metabolized into indole-3-carbinol (I3C), a compound with promising anticancer properties. To unravel the genetic mechanism influencing I3C content in rapeseed seedlings, a comprehensive study was undertaken with a doubled haploid (DH) population. By quantitative trait loci (QTL) mapping, seven QTL that were located on A01, A07, and C04 were identified, with the most significant contribution to phenotypic variation observed on chromosome A07 (11.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!