Non-coding variants located within regulatory elements may alter gene expression by modifying transcription factor (TF) binding sites, thereby leading to functional consequences. Different TF models are being used to assess the effect of DNA sequence variants, such as single nucleotide variants (SNVs). Often existing methods are slow and do not assess statistical significance of results. We investigated the distribution of absolute maximal differential TF binding scores for general computational models that affect TF binding. We find that a modified Laplace distribution can adequately approximate the empirical distributions. A benchmark on and datasets showed that our approach improves upon an existing method in terms of performance and speed. Applications on eQTLs and on a genome-wide association study illustrate the usefulness of our statistics by highlighting cell type-specific regulators and target genes. An implementation of our approach is freely available on GitHub and as bioconda package.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11088338PMC
http://dx.doi.org/10.1016/j.isci.2024.109765DOI Listing

Publication Analysis

Top Keywords

single nucleotide
8
nucleotide variants
8
transcription factor
8
factor binding
8
statistical approach
4
approach identifying
4
identifying single
4
variants
4
variants affect
4
affect transcription
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!