Unknown

Dataset Information

0

Analysis on frequency and density of microsatellites in coding sequences of several eukaryotic genomes.


ABSTRACT: Microsatellites or simple sequence repeats (SSRs) have been found in most organisms during the last decade. Since large-scale sequences are being generated, especially those that can be used to search for microsatellites, the development of these markers is getting more convenient. Keeping SSRs in viewing the importance of the application, available CDS (coding sequences) or ESTs (expressed sequence tags) of some eukaryotic species were used to study the frequency and density of various types of microsatellites. On the basis of surveying CDS or EST sequences amounting to 66.6 Mb in silkworm, 37.2 Mb in fly, 20.8 Mb in mosquito, 60.0 Mb in mouse, 34.9 Mb in zebrafish and 33.5 Mb in Caenorhabditis elegans, the frequency of SSRs was 1/1.00 Kb in silkworm, 1/0.77 Kb in fly, 1/1.03 Kb in mosquito, 1/1.21 Kb in mouse, 1/1.25 Kb in zebrafish and 1/1.38 Kb in C. elegans. The overall average SSR frequency of these species is 1/1.07 Kb. Hexanucleotide repeats (64.5%-76.6%) are the most abundant class of SSR in the investigated species, followed by trimeric, dimeric, tetrameric, monomeric and pentameric repeats. Furthermore, the A-rich repeats are predominant in each type of SSRs, whereas G-rich repeats are rare in the coding regions.

SUBMITTER: Li B 

PROVIDER: S-EPMC5172436 | biostudies-literature | 2004 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Analysis on frequency and density of microsatellites in coding sequences of several eukaryotic genomes.

Li Bin B   Xia Qingyou Q   Lu Cheng C   Zhou Zeyang Z   Xiang Zhonghuai Z  

Genomics, proteomics & bioinformatics 20040201 1


Microsatellites or simple sequence repeats (SSRs) have been found in most organisms during the last decade. Since large-scale sequences are being generated, especially those that can be used to search for microsatellites, the development of these markers is getting more convenient. Keeping SSRs in viewing the importance of the application, available CDS (coding sequences) or ESTs (expressed sequence tags) of some eukaryotic species were used to study the frequency and density of various types of  ...[more]

Similar Datasets

| S-EPMC1395342 | biostudies-literature
| S-EPMC1347423 | biostudies-literature
| S-EPMC9765019 | biostudies-literature
| S-EPMC1182411 | biostudies-literature
| S-EPMC1635734 | biostudies-literature
| S-EPMC6146118 | biostudies-literature
| S-EPMC7779043 | biostudies-literature
| S-EPMC9576210 | biostudies-literature
| PRJNA587984 | ENA
| S-EPMC3280489 | biostudies-literature