A variety of simple sequence repeats (SSRs) have been identified in the genome of Shigella flexneri serotype 2a (strain Sf301), an enteric pathogen that causes bacillary dysentery in man. The distribution of SSRs, with unit length ranging from 1 to 9 nucleotides, was biased in different regions of the genome. The tri-, tetra- and hexanucleotide SSRs prevailed in the coding regions while the mono- and dinucleotide SSRs were more common in the noncoding regions. Many intergenic SSRs are less than 30 bp away from the downstream open reading frames (ORFs), suggesting a potential role in transcriptional regulation. To study polymorphism of SSRs, we compared 17 coding-region SSRs from strain Sf301 with the corresponding sequences from 23 other strains of four Shigella species. Five chromosomal loci were found to be polymorphic, of which those from S. flexneri strains were most variable. Particularly interesting is the C5-1 locus in the coding sequence of the hcaD gene encoding a subunit of ferredoxin reductase. Depending on the insertion of variable numbers of the unit sequence (CGCAG), the Shigella hcaD genes can encode truncated products due to premature stop codons or frame shifts, or products with extended core alpha helices that leads to radical alterations in the predicted tertiary structure. Hence, SSRs may serve as genotyping markers for epidemiological investigations, and may offer insights into evolutionary adaptation of the pathogens.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.gene.2003.09.017 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!