Genome-wide maps of highly-similar intrachromosomal repeats that can mediate ectopic recombination in three human genome assemblies.

HGG Adv

International Laboratory for Human Genome Research, Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Juriquilla, Querétaro, México. Electronic address:

Published: December 2024

Repeated sequences spread throughout the genome play important roles in shaping the structure of chromosomes and facilitating the generation of new genomic variation through structural rearrangements. Several mechanisms of structural variation formation use shared nucleotide similarity between repeated sequences as substrate for ectopic recombination. We performed genome-wide analyses of direct and inverted intrachromosomal repeated sequence pairs with >200bp and >80% sequence identity in three human genome assemblies, GRCh37, GRCh38, and the T2T-CHM13 alternate assembly. Overall, the composition and distribution of direct and inverted repeats identified was similar among the three assemblies involving 13-15% of the haploid genome, with an increased, albeit not significant, number of repeated sequences in T2T-CHM13. Interestingly, the majority of repeated sequences are below 1 Kb in length with a median of 84.2% identity, highlighting the potential relevance of smaller, less identical repeats, such as Alu-Alu pairs, for ectopic recombination. We cross-referenced the identified repeated sequences with protein-coding genes to identify those at risk for being involved in genomic disorders. Olfactory receptors and immune response genes were enriched among those impacted. We have produced a catalog of highly-identical directly and inversely oriented intrachromosomal repeated sequences across the currently three most widely used human genome assemblies. Bioinformatic analyses of these sequences and their contribution to genome architecture can reveal regions that are susceptible to genomic instability. Understanding how their architectural genomic features such as identity, length, and distance can lead to genomic rearrangements can provide further insights into the molecular mechanisms leading to genomic disorders and genome evolution.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.xhgg.2024.100396DOI Listing

Publication Analysis

Top Keywords

repeated sequences
24
ectopic recombination
12
three human
12
human genome
12
genome assemblies
12
direct inverted
8
intrachromosomal repeated
8
genomic disorders
8
genome
7
repeated
7

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!