Variable Number Tandem Repeats (VNTRs) are tandem repeat (TR) loci that vary in copy number across a population. Using our program, VNTRseek, we analyzed human whole genome sequencing datasets from 2770 individuals in order to detect minisatellite VNTRs, i.e., those with pattern sizes ≥7 bp. We detected 35 638 VNTR loci and classified 5676 as commonly polymorphic (i.e. with non-reference alleles occurring in >5% of the population). Commonly polymorphic VNTR loci were found to be enriched in genomic regions with regulatory function, i.e. transcription start sites and enhancers. Investigation of the commonly polymorphic VNTRs in the context of population ancestry revealed that 1096 loci contained population-specific alleles and that those could be used to classify individuals into super-populations with near-perfect accuracy. Search for quantitative trait loci (eQTLs), among the VNTRs proximal to genes, indicated that in 187 genes expression differences correlated with VNTR genotype. We validated our predictions in several ways, including experimentally, through the identification of predicted alleles in long reads, and by comparisons showing consistency between sequencing platforms. This study is the most comprehensive analysis of minisatellite VNTRs in the human population to date.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8096271PMC
http://dx.doi.org/10.1093/nar/gkab224DOI Listing

Publication Analysis

Top Keywords

minisatellite vntrs
12
commonly polymorphic
12
population-specific alleles
8
expression differences
8
vntr loci
8
vntrs
6
loci
5
genome-wide characterization
4
characterization human
4
human minisatellite
4

Similar Publications

Genome-wide investigation of VNTR motif polymorphisms in 8,222 genomes: Implications for biological regulation and human traits.

Cell Genom

December 2024

Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China. Electronic address:

Article Synopsis
  • VNTRs (Variable number tandem repeats) are genetic features that differ in length and sequence, yet their functional effects are not fully understood.
  • The study presents a comprehensive VNTR polymorphism map with over 2.5 million VNTR length and 11 million VNTR motif polymorphisms found in 8,222 genomes, revealing a significant number of rare mutations.
  • It identifies specific VNTRs linked to gene expression changes and explores the potential influence of these polymorphisms on phenotypes and disease susceptibility, aiming to enhance the understanding of their biological roles.
View Article and Find Full Text PDF

Short tandem repeats (STRs) and variable-number tandem repeats (VNTRs) are repetitive genomic sequences seen widely throughout the genome. These repeat expansions are currently known to cause ∼60 diseases, with expansions in new loci linked to rare diseases continuing to be discovered. Genome sequencing is an important tool for detecting disease-causing variants and several computational tools have been developed to analyze tandem repeats from genomic data, enabling the genotyping and the identification of expanded alleles.

View Article and Find Full Text PDF
Article Synopsis
  • * Using two methods (long-read analysis with MIRUReader and standard amplification), results showed a high agreement between the two, with only 11 discrepancies out of 3,024 loci analyzed.
  • * The research suggests that long-read sequencing can improve the integration of historical TB data with genomic analysis, potentially enhancing tracking of TB transmission patterns.
View Article and Find Full Text PDF

Structural and genetic diversity in the secreted mucins MUC5AC and MUC5B.

Am J Hum Genet

August 2024

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA; Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA. Electronic address:

Article Synopsis
  • MUC5AC and MUC5B are special proteins that help protect our bodies by catching germs and helping us clear mucus!
  • Researchers studied the differences in these proteins by looking at DNA from humans and primates and found that MUC5B is mostly the same in humans, while MUC5AC has many variations!
  • The study also showed that people from East Asia have unique versions of the MUC5AC protein that might have helped them in survival, while another version is more common in Europeans!
View Article and Find Full Text PDF

Background: Variable number tandem repeats (VNTRs) are highly polymorphic DNA regions harboring many potentially disease-causing variants. However, VNTRs often appear unresolved ("dark") in variation databases due to their repetitive nature. One particularly complex and medically relevant VNTR is the KIV-2 VNTR located in the cardiovascular disease gene LPA which encompasses up to 70% of the coding sequence.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!