Unique Features of Tandem Repeats in Bacteria.

J Bacteriol

Department of Computer Science, Universitat Politècnica de Catalunya, Barcelona, Spain.

Published: October 2020

DNA tandem repeats, or satellites, are well described in eukaryotic species, but little is known about their prevalence across prokaryotes. Here, we performed the most complete characterization to date of satellites in bacteria. We identified 121,638 satellites from 12,233 fully sequenced and assembled bacterial genomes with a very uneven distribution. We also determined the families of satellites which have a related sequence. There are 85 genomes that are particularly satellite rich and contain several families of satellites of yet unknown function. Interestingly, we only found two main types of noncoding satellites, depending on their repeat sizes, 22/44 or 52 nucleotides (nt). An intriguing feature is the constant size of the repeats in the genomes of different species, whereas their sequences show no conservation. Individual species also have several families of satellites with the same repeat length and different sequences. This result is in marked contrast with previous findings in eukaryotes, where noncoding satellites of many sizes are found in any species investigated. We describe in greater detail these noncoding satellites in the spirochete and in several bacilli. These satellites undoubtedly play a specific role in the species which have acquired them. We discuss the possibility that they represent binding sites for transcription factors not previously described or that they are involved in the stabilization of the nucleoid through interaction with proteins. We found an enigmatic group of noncoding satellites in 85 bacterial genomes with a constant repeat size but variable sequence. This pattern of DNA organization is unique and had not been previously described in bacteria. These findings strongly suggest that satellite size in some bacteria is under strong selective constraints and thus that satellites are very likely to play a fundamental role. We also provide a list and properties of all satellites in 12,233 genomes, which may be used for further genomic analysis.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7549362PMC
http://dx.doi.org/10.1128/JB.00229-20DOI Listing

Publication Analysis

Top Keywords

noncoding satellites
16
satellites
13
families satellites
12
tandem repeats
8
satellites 12233
8
bacterial genomes
8
species
5
genomes
5
unique features
4
features tandem
4

Similar Publications

Human satellitess(HSats) are pericentric, tandemly repeating satellite DNA sequences in the human genome. While silent in normal cells, a subset of HSat2 noncoding RNA is expressed and accumulates in the nucleus of cancer cells. We developed a FISH-based approach for identification of the distribution of three subfamilies of HSat2 (A1, A2, B) sequences on individual human chromosomes.

View Article and Find Full Text PDF

Comparative analysis of predicted DNA secondary structures infers complex human centromere topology.

Am J Hum Genet

December 2024

Laboratory of Genome Evolution, Department of Biology and Biotechnology Charles Darwin, Sapienza University of Rome, 00185 Rome, Italy. Electronic address:

Article Synopsis
  • - The text discusses how secondary structures, which are unique arrangements of nucleic acids caused by internal interactions, can occur in both RNA and single-stranded DNA, impacting key processes like DNA replication and transcription, thus affecting genome stability.
  • - The study focuses on the comparison of secondary structures in linear single-stranded DNA from different specialized human loci, such as centromeres and coding regions, revealing that centromeres have the highest complexity and instability in their secondary structures.
  • - Findings indicate that the intricate self-hybridizing properties of centromeric repeats may lead to chromosome missegregation when chromatin is disrupted, highlighting the functional importance of these structures in various DNA processes like transcription and recombination.
View Article and Find Full Text PDF

Cerebral palsy (CP) is a pediatric onset disorder with poorly understood molecular causes and progression, making early diagnosis difficult. Circular RNAs are regulatory RNAs that show promise as biomarkers in various diseases but the role of circular RNAs in CP is beginning to be understood. This study identified the role of circNFIX in regulating the expression of myocyte-specific enhancer factor 2C (MEF2C), an important transcription factor for sarcomere development.

View Article and Find Full Text PDF

Chromosomal heteromorphisms (CHs) are morphological variations predominantly found in constitutive heterochromatic regions of the genome, primarily composed of tandemly repetitive sequences of satellite DNA. Although not completely devoid of genes, these regions are typically not transcribed into proteins and lack obvious phenotypic impact. Nonetheless, their clinical importance is increasingly under scrutiny, with several studies aiming to assess their influence on human diseases and susceptibilities, especially as they are seemingly part of the long noncoding RNAs in certain tissues.

View Article and Find Full Text PDF
Article Synopsis
  • * The study analyzed lncRNA expression in sheep at different developmental stages (embryos, lambs, and adults) and identified 4,738 lncRNAs, with 997 showing differential expression relevant to muscle development.
  • * A specific lncRNA, GTL2, was found to decrease during muscle development and was crucial for regulating satellite cell activities via the PKA-CREB signaling pathway, providing new insights into muscle growth mechanisms.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!