Unknown

Dataset Information

0

Elucidation of Codon Usage Signatures across the Domains of Life.


ABSTRACT: Because of the degeneracy of the genetic code, multiple codons are translated into the same amino acid. Despite being "synonymous," these codons are not equally used. Selective pressures are thought to drive the choice among synonymous codons within a genome, while GC content, which is typically attributed to mutational drift, is the major determinant of variation across species. Here, we find that in addition to GC content, interspecies codon usage signatures can also be detected. More specifically, we show that a single amino acid, arginine, is the major contributor to codon usage bias differences across domains of life. We then exploit this finding and show that domain-specific codon bias signatures can be used to classify a given sequence into its corresponding domain of life with high accuracy. We then wondered whether the inclusion of codon usage codon autocorrelation patterns, which reflects the nonrandom distribution of codon occurrences throughout a transcript, might improve the classification performance of our algorithm. However, we find that autocorrelation patterns are not domain-specific, and surprisingly, are unrelated to tRNA reusage, in contrast to previous reports. Instead, our results suggest that codon autocorrelation patterns are a by-product of codon optimality throughout a sequence, where highly expressed genes display autocorrelated "optimal" codons, whereas lowly expressed genes display autocorrelated "nonoptimal" codons.

SUBMITTER: Novoa EM 

PROVIDER: S-EPMC6759073 | biostudies-literature | 2019 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Elucidation of Codon Usage Signatures across the Domains of Life.

Novoa Eva Maria EM   Jungreis Irwin I   Jaillon Olivier O   Kellis Manolis M  

Molecular biology and evolution 20191001 10


Because of the degeneracy of the genetic code, multiple codons are translated into the same amino acid. Despite being "synonymous," these codons are not equally used. Selective pressures are thought to drive the choice among synonymous codons within a genome, while GC content, which is typically attributed to mutational drift, is the major determinant of variation across species. Here, we find that in addition to GC content, interspecies codon usage signatures can also be detected. More specific  ...[more]

Similar Datasets

| S-EPMC1447655 | biostudies-literature
| S-EPMC8317675 | biostudies-literature
| S-EPMC2839124 | biostudies-literature
| S-EPMC8613526 | biostudies-literature
| S-EPMC6934141 | biostudies-literature
| S-EPMC2585594 | biostudies-literature
| S-EPMC3283889 | biostudies-literature
| S-EPMC6669465 | biostudies-literature
| S-EPMC7468442 | biostudies-literature
| S-EPMC3400985 | biostudies-literature