Functional assessment of disease-associated sequence variation at non-coding regulatory elements is complicated by their high degree of context sensitivity to both the local chromatin and nuclear environments. Allelic profiling of DNA accessibility across individuals has shown that only a select minority of sequence variation affects transcription factor (TF) occupancy, yet low sequence diversity in human populations means that no experimental assessment is available for the majority of disease-associated variants. Here we describe high-resolution in vivo maps of allelic DNA accessibility in liver, kidney, lung and B cells from 5 increasingly diverged strains of F1 hybrid mice. The high density of heterozygous sites in these hybrids enables precise quantification of effect size and cell-type specificity for hundreds of thousands of variants throughout the mouse genome. We show that chromatin-altering variants delineate characteristic sensitivity profiles for hundreds of TF motifs. We develop a compendium of TF-specific sensitivity profiles accounting for genomic context effects. Finally, we link maps of allelic accessibility to allelic transcript levels in the same samples. This work provides a foundation for quantitative prediction of cell-type specific effects of non-coding variation on TF activity, which will facilitate both fine-mapping and systems-level analyses of common disease-associated variation in human genomes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8121920PMC
http://dx.doi.org/10.1038/s41467-021-23139-3DOI Listing

Publication Analysis

Top Keywords

sequence variation
8
dna accessibility
8
maps allelic
8
sensitivity profiles
8
variation
5
tissue context
4
context determines
4
determines penetrance
4
penetrance regulatory
4
regulatory dna
4

Similar Publications

Large-scale gene-environment interaction (GxE) discovery efforts often involve analytical compromises for the sake of data harmonization and statistical power. Refinement of exposures, covariates, outcomes, and population subsets may be helpful to establish often-elusive replication and evaluate potential clinical utility. Here, we used additional datasets, an expanded set of statistical models, and interrogation of lipoprotein metabolism via nuclear magnetic resonance (NMR)-based lipoprotein subfractions to refine a previously discovered GxE modifying the relationship between physical activity (PA) and HDL-cholesterol (HDL-C).

View Article and Find Full Text PDF

Interpreting Variants of Uncertain Significance in PCD: Abnormal Splicing Caused by a Missense Variant of DNAAF3.

Mol Genet Genomic Med

January 2025

The State Key Laboratory for Complex Severe and Rare Diseases, the State Key Sci-Tech Infrastructure for Translational Medicine, Peking Union Medical College Hospital, Beijing, China.

Background: Primary ciliary dyskinesia (PCD) is a rare autosomal recessive disorder characterized by dysfunction of motile cilia. While approximately 50 genes have been identified, around 25% of PCD patients remain genetically unexplained; elucidating the pathogenicity of specific variants remains a challenge.

Methods: Whole exome sequencing (WES) and Sanger sequencing were conducted to identify potential pathogenic variants of PCD.

View Article and Find Full Text PDF

The study found a significant causal relationship between coffee intake and obsessive-compulsive disorder, showing a negative correlation. There was no causal relationship between coffee intake and other mental disorders. The sensitivity analysis test found no pleiotropy affecting the results, and no single nucleotide polymorphism had a major impact on the robustness of the results, indicating that the results are stable and reliable.

View Article and Find Full Text PDF

Antidepressants exhibit a considerable variation in efficacy, and increasing evidence suggests that individual genetics contribute to antidepressant treatment response. Here, we combined data on antidepressant non-response measured using rating scales for depressive symptoms, questionnaires of treatment effect, and data from electronic health records, to increase statistical power to detect genomic loci associated with non-response to antidepressants in a total sample of 135,471 individuals prescribed antidepressants (25,255 non-responders and 110,216 responders). We performed genome-wide association meta-analyses, genetic correlation analyses, leave-one-out polygenic prediction, and bioinformatics analyses for genetically informed drug prioritization.

View Article and Find Full Text PDF

Case-control genome-wide association studies (GWAS) are often used to find associations between genetic variants and diseases. When case-control GWAS are conducted, researchers must make decisions regarding how many cases and how many controls to include in the study. Depending on differing availability and cost of controls and cases, varying case fractions are used in case-control GWAS.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!