The spatial organization of genomes is becoming increasingly understood. In mammals, where it is most investigated, this organization ties in with transcription, so an important research objective is to understand whether gene activity is a cause or a consequence of genome folding in space. In this regard, the phenomena of X-chromosome inactivation and reactivation open a unique window of investigation because of the singularities of the inactive X chromosome.
View Article and Find Full Text PDFSeveral viruses hide in the genome of their host. To complete their replication cycle, they need to integrate in the form of a provirus and express their genes. In vertebrates, integrated viruses can be silenced by chromatin, implying that some specific mechanisms exist to detect non-self genes.
View Article and Find Full Text PDFBackground: Biases of DNA repair can shape the nucleotide landscape of genomes at evolutionary timescales. The molecular mechanisms of those biases are still poorly understood because it is difficult to isolate the contributions of DNA repair from those of DNA damage.
Results: Here, we develop a genome-wide assay whereby the same DNA lesion is repaired in different genomic contexts.
A hallmark of chromosome organization is the partition into transcriptionally active A and repressed B compartments, and into topologically associating domains (TADs). Both structures were regarded to be absent from the inactive mouse X chromosome, but to be re-established with transcriptional reactivation and chromatin opening during X-reactivation. Here, we combine a tailor-made mouse iPSC reprogramming system and high-resolution Hi-C to produce a time course combining gene reactivation, chromatin opening and chromosome topology during X-reactivation.
View Article and Find Full Text PDFThe increasing throughput of DNA sequencing technologies creates a need for faster algorithms. The fate of most reads is to be mapped to a reference sequence, typically a genome. Modern mappers rely on heuristics to gain speed at a reasonable cost for accuracy.
View Article and Find Full Text PDFHIV-1 persists lifelong in memory cells of the immune system as latent provirus that rebounds upon treatment interruption. Therefore, the latent reservoir is the main target for an HIV cure. Here, we studied the direct link between integration site and transcription using LEDGINs and Barcoded HIV-ensembles (B-HIVE).
View Article and Find Full Text PDFHIV-1 recurrently targets active genes and integrates in the proximity of the nuclear pore compartment in CD4 T cells. However, the genomic features of these genes and the relevance of their transcriptional activity for HIV-1 integration have so far remained unclear. Here we show that recurrently targeted genes are proximal to super-enhancer genomic elements and that they cluster in specific spatial compartments of the T cell nucleus.
View Article and Find Full Text PDFHow cells adopt different identities has long fascinated biologists. Signal transduction in response to environmental cues results in the activation of transcription factors that determine the gene-expression program characteristic of each cell type. Technological advances in the study of 3D chromatin folding are bringing the role of genome conformation in transcriptional regulation to the fore.
View Article and Find Full Text PDFCurr Protoc Mol Biol
April 2018
The latent HIV reservoir is the main barrier to curing AIDS, because infected cells escape the immune system and antiretroviral therapies. Developing new treatment strategies requires technologies to trace latent proviruses. Here, we describe a genome-wide technique called Barcoded HIV Ensembles (B-HIVE) to measure HIV expression at the single provirus level.
View Article and Find Full Text PDFAll organisms regulate transcription of their genes. To understand this process, a complete understanding of how transcription factors find their targets in cellular nuclei is essential. The DNA sequence and other variables are known to influence this binding, but the distribution of transcription factor binding patterns remains mostly unexplained in metazoan genomes.
View Article and Find Full Text PDFThe three-dimensional conformation of genomes is an essential component of their biological activity. The advent of the Hi-C technology enabled an unprecedented progress in our understanding of genome structures. However, Hi-C is subject to systematic biases that can compromise downstream analyses.
View Article and Find Full Text PDFUpon HIV-1 infection, a reservoir of latently infected resting T cells prevents the eradication of the virus from patients. To achieve complete depletion, the existing virus-suppressing antiretroviral therapy must be combined with drugs that reactivate the dormant viruses. We previously described a novel chemical scaffold compound, MMQO (8-methoxy-6-methylquinolin-4-ol), that is able to reactivate viral transcription in several models of HIV latency, including J-Lat cells, through an unknown mechanism.
View Article and Find Full Text PDFChromosomal architecture is known to influence gene expression, yet its role in controlling cell fate remains poorly understood. Reprogramming of somatic cells into pluripotent stem cells (PSCs) by the transcription factors (TFs) OCT4, SOX2, KLF4 and MYC offers an opportunity to address this question but is severely limited by the low proportion of responding cells. We have recently developed a highly efficient reprogramming protocol that synchronously converts somatic into pluripotent stem cells.
View Article and Find Full Text PDFT47D_rep2 and b1913e6c1_51720e9cf were 2 Hi-C samples. They were born and processed at the same time, yet their fates were very different. The life of b1913e6c1_51720e9cf was simple and fruitful, while that of T47D_rep2 was full of accidents and sorrow.
View Article and Find Full Text PDFDNA methylation is an essential epigenetic mark in mammals that has to be re-established after each round of DNA replication. The protein UHRF1 is essential for this process; it has been proposed that the protein targets newly replicated DNA by cooperatively binding hemi-methylated DNA and H3K9me2/3, but this model leaves a number of questions unanswered. Here, we present evidence for a direct recruitment of UHRF1 by the replication machinery via DNA ligase 1 (LIG1).
View Article and Find Full Text PDFThe sequence of a genome is insufficient to understand all genomic processes carried out in the cell nucleus. To achieve this, the knowledge of its three-dimensional architecture is necessary. Advances in genomic technologies and the development of new analytical methods, such as Chromosome Conformation Capture (3C) and its derivatives, provide unprecedented insights in the spatial organization of genomes.
View Article and Find Full Text PDFHousekeeping genes of animal genomes cluster in the same chromosomal regions. It has long been suggested that this organization contributes to their steady expression across all the tissues of the organism. Here, we show that the activity of housekeeping gene promoters depends on the expression of their neighbors.
View Article and Find Full Text PDFThe main obstacle to curing HIV is the presence of latent proviruses in the bodies of infected patients. The partial success of reactivation therapies suggests that the genomic context of integrated proviruses can interfere with treatment. Here we developed a method called Barcoded HIV ensembles (B-HIVE) to map the chromosomal locations of thousands of individual proviruses while tracking their transcriptional activities in an infected cell population.
View Article and Find Full Text PDFBioinformatics
October 2016
Motivation: Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) is the standard method to investigate chromatin protein composition. As the number of community-available ChIP-seq profiles increases, it becomes more common to use data from different sources, which makes joint analysis challenging. Issues such as lack of reproducibility, heterogeneous quality and conflicts between replicates become evident when comparing datasets, especially when they are produced by different laboratories.
View Article and Find Full Text PDFThe prediction of protein folding rates is a necessary step towards understanding the principles of protein folding. Due to the increasing amount of experimental data, numerous protein folding models and predictors of protein folding rates have been developed in the last decade. The problem has also attracted the attention of scientists from computational fields, which led to the publication of several machine learning-based models to predict the rate of protein folding.
View Article and Find Full Text PDFThe two-sample Kolmogorov-Smirnov (KS) test is often used to decide whether two random samples have the same statistical distribution. A popular modification of the KS test is to use a signed version of the KS statistic to infer whether the values of one sample are statistically larger than the values of the other. The underlying hypotheses of the KS test are intrinsically incompatible with this approach and the test can produce false positives supported by extremely low p-values.
View Article and Find Full Text PDF