Unknown

Dataset Information

0

Patterns of microsatellite distribution across eukaryotic genomes.


ABSTRACT:

Background

Microsatellites, or Simple Sequence Repeats (SSRs), are short tandem repeats of 1-6?nt motifs present in all genomes. Emerging evidence points to their role in cellular processes and gene regulation. Despite the huge resource of genomic information currently available, SSRs have been studied in a limited context and compared across relatively few species.

Results

We have identified ~?685 million eukaryotic microsatellites and analyzed their genomic trends across 15 taxonomic subgroups from protists to mammals. The distribution of SSRs reveals taxon-specific variations in their exonic, intronic and intergenic densities. Our analysis reveals the differences among non-related species and novel patterns uniquely demarcating closely related species. We document several repeats common across subgroups as well as rare SSRs that are excluded almost throughout evolution. We further identify species-specific signatures in pathogens like Leishmania as well as in cereal crops, Drosophila, birds and primates. We also find that distinct SSRs preferentially exist as long repeating units in different subgroups; most unicellular organisms show no length preference for any SSR class, while many SSR motifs accumulate as long repeats in complex organisms, especially in mammals.

Conclusions

We present a comprehensive analysis of SSRs across taxa at an unprecedented scale. Our analysis indicates that the SSR composition of organisms with heterogeneous cell types is highly constrained, while simpler organisms such as protists, green algae and fungi show greater diversity in motif abundance, density and GC content. The microsatellite dataset generated in this work provides a large number of candidates for functional analysis and for studying their roles across the evolutionary landscape.

SUBMITTER: Srivastava S 

PROVIDER: S-EPMC6387519 | biostudies-literature | 2019 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Patterns of microsatellite distribution across eukaryotic genomes.

Srivastava Surabhi S   Avvaru Akshay Kumar AK   Sowpati Divya Tej DT   Mishra Rakesh K RK  

BMC genomics 20190222 1


<h4>Background</h4>Microsatellites, or Simple Sequence Repeats (SSRs), are short tandem repeats of 1-6 nt motifs present in all genomes. Emerging evidence points to their role in cellular processes and gene regulation. Despite the huge resource of genomic information currently available, SSRs have been studied in a limited context and compared across relatively few species.<h4>Results</h4>We have identified ~ 685 million eukaryotic microsatellites and analyzed their genomic trends across 15 taxo  ...[more]

Similar Datasets

| S-EPMC7953163 | biostudies-literature
| S-EPMC9951716 | biostudies-literature
| S-EPMC2636830 | biostudies-other
| S-EPMC1448744 | biostudies-literature
| S-EPMC9925362 | biostudies-literature
| S-EPMC3530887 | biostudies-other
| S-EPMC1431726 | biostudies-literature
| S-EPMC7058166 | biostudies-literature
| S-EPMC3321234 | biostudies-literature
| S-EPMC3091302 | biostudies-literature