The genome-sequencing gold rush has facilitated the use of comparative genomics to uncover patterns of genome evolution, although their causal mechanisms remain elusive. One such trend, ubiquitous to prokarya and eukarya, is the association of insertion/deletion mutations (indels) with increases in the nucleotide substitution rate extending over hundreds of base pairs. The prevailing hypothesis is that indels are themselves mutagenic agents. Here, we employ population genomics data from Escherichia coli, Saccharomyces paradoxus, and Drosophila to provide evidence suggesting that it is not the indels per se but the sequence in which indels occur that causes the accumulation of nucleotide substitutions. We found that about two-thirds of indels are closely associated with repeat sequences and that repeat sequence abundance could be used to identify regions of elevated sequence diversity, independently of indels. Moreover, the mutational signature of indel-proximal nucleotide substitutions matches that of error-prone DNA polymerases. We propose that repeat sequences promote an increased probability of replication fork arrest, causing the persistent recruitment of error-prone DNA polymerases to specific sequence regions over evolutionary time scales. Experimental measures of the mutation rates of engineered DNA sequences and analyses of experimentally obtained collections of spontaneous mutations provide molecular evidence supporting our hypothesis. This study uncovers a new role for repeat sequences in genome evolution and provides an explanation of how fine-scale sequence contextual effects influence mutation rates and thereby evolution.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3114760PMC
http://dx.doi.org/10.1371/journal.pbio.1000622DOI Listing

Publication Analysis

Top Keywords

repeat sequences
16
nucleotide substitutions
12
insertion/deletion mutations
8
associated repeat
8
genome evolution
8
error-prone dna
8
dna polymerases
8
mutation rates
8
indels
6
repeat
5

Similar Publications

Direct repeats found in the vicinity of intron splice sites.

Naturwissenschaften

January 2025

Department of Biology, University of Washington, Seattle, WA, 98195, USA.

Four main classes of introns (group I, group II, spliceosomal, and archaeal) have been reported for all major types of RNA from nuclei and organelles of a wide range of taxa. When and how introns inserted within the genic regions of genomes, however, is often unclear. Introns were examined from Archaea, Bacteria, and Eukarya.

View Article and Find Full Text PDF

Repetitive elements are the main components of many plant genomes and play a crucial role in the variation of genome size and structure, ultimately impacting species diversification and adaptation. Alstroemeriaceae exhibits species with large genomes, not attributed to polyploidy. In this study, we analysed the repetitive fraction of the genome of Bomarea edulis through low-coverage sequencing and in silico characterization, and compared it to the repeats of Alstroemeria longistaminea, a species from a sister genus that has been previously characterized.

View Article and Find Full Text PDF

fungal species are considered major plant pathogens, infecting various crops and resulting in significant agricultural losses. Additionally, these species can contaminate grain with multiple mycotoxins that are harmful to humans and animals. Efficient pest management relies on timely detection and identification of phytopathogens in plant and grain samples, facilitating prompt selection of a crop protection strategy.

View Article and Find Full Text PDF

Ditylenchus destructor, commonly known as the potato rot nematode, is a significant plant-parasitic pathogen affecting over 120 plant species globally. Effective control measures for D. destructor are limited, underscoring the need a high-quality reference genome to understand its pathogenic mechanisms.

View Article and Find Full Text PDF

Calligonum polygonoides, an endangered species of desert due to poor regeneration and overexploitation, which requires immediate conservation attention. Genetic diversity analysis is crucial for effective conservation and management initiatives, for elite genotypes. Therefore, in the present study, SCoT (start codon target) and ISSR (inter simple sequence repeat) markers were used to investigate the genetic variability in 120 individuals of Calligonum polygonoides.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!