Unknown

Dataset Information

0

Two common profiles exist for genomic oligonucleotide frequencies.


ABSTRACT:

Background

It was reported that there is a majority profile for trinucleotide frequencies among genomes. And further study has revealed that two common profiles, rather than one majority profile, exist for genomic trinucleotide frequencies. However, the origins of the common/majority profile remain elusive. Moreover, it is not clear whether the features of common profile may be extended to oligonucleotides other than trinucleotides.

Findings

We analyzed 571 prokaryotic genomes (chromosomes) and some selected eukaryotic nuclear genomes as well as other genetic systems to study their compositional features. We found that there are also two common profiles for genomic oligonucleotide frequencies: one is from low-GC content genomes, and the other is from high-GC content genomes. Furthermore, each common profile is highly correlated to the average profile of random sequences with corresponding GC content and generated according to first-order symmetry.

Conclusions

The causes for the existence of two common profiles would mainly be GC content variations and strand symmetry of genomic sequences. Therefore, both GC content and strand symmetry would play important roles in genome evolution.

SUBMITTER: Zhang SH 

PROVIDER: S-EPMC3532236 | biostudies-literature | 2012 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Two common profiles exist for genomic oligonucleotide frequencies.

Zhang Shang-Hong SH   Wang Lei L  

BMC research notes 20121117


<h4>Background</h4>It was reported that there is a majority profile for trinucleotide frequencies among genomes. And further study has revealed that two common profiles, rather than one majority profile, exist for genomic trinucleotide frequencies. However, the origins of the common/majority profile remain elusive. Moreover, it is not clear whether the features of common profile may be extended to oligonucleotides other than trinucleotides.<h4>Findings</h4>We analyzed 571 prokaryotic genomes (ch  ...[more]

Similar Datasets

| S-EPMC2924895 | biostudies-literature
| S-EPMC2883551 | biostudies-literature
| S-EPMC2911379 | biostudies-literature
| S-EPMC535426 | biostudies-literature
| S-EPMC7178395 | biostudies-literature
| S-EPMC2289816 | biostudies-literature
| S-EPMC2666768 | biostudies-literature
| S-EPMC2753849 | biostudies-literature
| S-EPMC5885355 | biostudies-literature
| S-EPMC296061 | biostudies-other