Unknown

Dataset Information

0

Gaussian-Distributed Codon Frequencies of Genomes.


ABSTRACT: DNA encodes protein primary structure using 64 different codons to specify 20 different amino acids and a stop signal. Frequencies of codon occurrence when ordered in descending sequence provide a global characterization of a genome's preference (bias) for using the different codons of the redundant genetic code. Whereas frequency/rank relations have been described by empirical expressions, here we propose a statistical model in which two different forms of codon usage co-exist in a genome. We investigate whether such a model can account for the range of codon usages observed in a large set of genomes from different taxa. The differences in frequency/rank relations across these genomes can be expressed in a single parameter, the proportion of the two codon compartments. One compartment uses different codons with weak bias according to a Gaussian distribution of frequency, the other uses different codons with strong bias. In prokaryotic genomes both compartments appear to be present in a wide range of proportions, whereas in eukaryotic genomes the compartment with Gaussian distribution tends to dominate. Codon frequencies that are Gaussian-distributed suggest that many evolutionary conditions are involved in shaping weakly-biased codon usage, whereas strong bias in codon usage suggests dominance of few evolutionary conditions.

SUBMITTER: Khomtchouk BB 

PROVIDER: S-EPMC6505138 | biostudies-other | 2019 May

REPOSITORIES: biostudies-other

altmetric image

Publications

Gaussian-Distributed Codon Frequencies of Genomes.

Khomtchouk Bohdan B BB   Nonner Wolfgang W  

G3 (Bethesda, Md.) 20190507 5


DNA encodes protein primary structure using 64 different codons to specify 20 different amino acids and a stop signal. Frequencies of codon occurrence when ordered in descending sequence provide a global characterization of a genome's preference (bias) for using the different codons of the redundant genetic code. Whereas frequency/rank relations have been described by empirical expressions, here we propose a statistical model in which two different forms of codon usage co-exist in a genome. We i  ...[more]

Similar Datasets

| S-EPMC7300273 | biostudies-literature
| S-EPMC5710610 | biostudies-literature
| S-EPMC6089687 | biostudies-literature
| S-EPMC231558 | biostudies-other
| S-EPMC1936986 | biostudies-literature
| S-EPMC1919372 | biostudies-literature
| S-EPMC6129299 | biostudies-literature
| S-EPMC3422295 | biostudies-literature
| S-EPMC2717381 | biostudies-literature
| S-EPMC6513156 | biostudies-literature