About 5% of the human genome consists of large-scale duplicated segments of almost identical sequences. Segmental duplications (SDs) have been proposed to be involved in non-allelic homologous recombination leading to recurrent genomic variation and disease. It has also been suggested that these SDs are associated with syntenic rearrangements that have shaped the human genome. We have analyzed 14 members of a single family of closely related SDs in the human genome, some of which are associated with common inversion polymorphisms at chromosomes 8p23 and 4p16. Comparative analysis with the mouse genome revealed syntenic inversions for these two human polymorphic loci. In addition, 12 of the 14 SDs, while absent in the mouse genome, occur at the breaks of synteny; suggesting a non-random involvement of these sequences in genome evolution. Furthermore, we observed a syntenic familial relationship between 8 and 12 breakpoint-loci, where broken synteny that ends at one family member resumes at another, even across different chromosomes. Subsequent genome-wide assessment revealed that this relationship, which we named continuation-of-synteny, is not limited to the 8p23 family and occurs 46 times in the human genome with high frequency at specific chromosomes. Our analysis supports a non-random breakage model of genomic evolution with an active involvement of segmental duplications for specific regions of the human genome.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s00439-006-0277-zDOI Listing

Publication Analysis

Top Keywords

human genome
20
segmental duplications
12
genome
8
mouse genome
8
human
7
analysis segmental
4
duplications reveals
4
reveals distinct
4
distinct pattern
4
pattern continuation-of-synteny
4

Similar Publications

The nucleolus is a major subnuclear compartment where ribosomal DNA (rDNA) is transcribed and ribosomes are assembled. In addition, recent studies have shown that the nucleolus is a dynamic organizer of chromatin architecture that modulates developmental gene expression. rDNA gene units are assembled into arrays located in the p-arms of five human acrocentric chromosomes.

View Article and Find Full Text PDF

GDBr: genomic signature interpretation tool for DNA double-strand break repair mechanisms.

Nucleic Acids Res

January 2025

Department of Convergent Bioscience and Informatics, College of Bioscience and Biotechnology, Chungnam National University, 99, Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea.

Large genetic variants can be generated via homologous recombination (HR), such as polymerase theta-mediated end joining (TMEJ) or single-strand annealing (SSA). Given that these HR-based mechanisms leave specific genomic signatures, we developed GDBr, a genomic signature interpretation tool for DNA double-strand break repair mechanisms using high-quality genome assemblies. We applied GDBr to a draft human pangenome reference.

View Article and Find Full Text PDF

Synthetic rational design of live-attenuated Zika viruses based on a computational model.

Nucleic Acids Res

January 2025

SynVaccine Ltd, Ramat Hachayal, 3 Golda Meir Street, Science Park, Nes Ziona 7403648, Israel.

Many viruses of the Flaviviridae family, including the Zika virus (ZIKV), are human pathogens of significant public health concerns. Despite extensive research, there are currently no approved vaccines available for ZIKV and specifically no live-attenuated Zika vaccine. In this current study, we suggest a novel computational algorithm for generating live-attenuated vaccines via the introduction of silent mutation into regions that undergo selection for strong or weak local RNA folding or into regions that exhibit medium levels of sequence conservation.

View Article and Find Full Text PDF

Circulating glycine levels have been associated with reduced risk of coronary artery disease (CAD) in humans but these associations have not been observed in all studies. We evaluated whether the relationship between glycine levels and atherosclerosis was causal using genetic analyses in humans and feeding studies in mice. Serum glycine levels were evaluated for association with risk of CAD in the UK Biobank.

View Article and Find Full Text PDF

A pseudogene is a non-functional copy of a protein-coding gene. Processed pseudogenes, which are created by the reverse transcription of mRNA and subsequent integration of the resulting cDNA into the genome, being a major pseudogene class, represent a significant challenge in genome analysis due to their high sequence similarity to the parent genes and their frequent absence in the reference genome. This homology can lead to errors in variant identification, as sequences derived from processed pseudogenes can be incorrectly assigned to parental genes, complicating correct variant calling.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!