Simple sequence repeats in prokaryotic genomes.
Ontology highlight
ABSTRACT: Simple sequence repeats (SSRs) in DNA sequences are composed of tandem iterations of short oligonucleotides and may have functional and/or structural properties that distinguish them from general DNA sequences. They are variable in length because of slip-strand mutations and may also affect local structure of the DNA molecule or the encoded proteins. Long SSRs (LSSRs) are common in eukaryotes but rare in most prokaryotes. In pathogens, SSRs can enhance antigenic variance of the pathogen population in a strategy that counteracts the host immune response. We analyze representations of SSRs in >300 prokaryotic genomes and report significant differences among different prokaryotes as well as among different types of SSRs. LSSRs composed of short oligonucleotides (1-4 bp length, designated LSSR(1-4)) are often found in host-adapted pathogens with reduced genomes that are not known to readily survive in a natural environment outside the host. In contrast, LSSRs composed of longer oligonucleotides (5-11 bp length, designated LSSR(5-11)) are found mostly in nonpathogens and opportunistic pathogens with large genomes. Comparisons among SSRs of different lengths suggest that LSSR(1-4) are likely maintained by selection. This is consistent with the established role of some LSSR(1-4) in enhancing antigenic variance. By contrast, abundance of LSSR(5-11) in some genomes may reflect the SSRs' general tendency to expand rather than their specific role in the organisms' physiology. Differences among genomes in terms of SSR representations and their possible interpretations are discussed.
SUBMITTER: Mrazek J
PROVIDER: S-EPMC1895974 | biostudies-literature |
REPOSITORIES: biostudies-literature
ACCESS DATA