Strand asymmetry and codon usage bias in the chloroplast genome of Euglena gracilis.
Ontology highlight
ABSTRACT: It is shown that the two strands of the chloroplast genome from Euglena gracilis are asymmetric with regards to nucleotide composition. This asymmetry switches at both the origin of replication and a location that is halfway around the circular genome from the origin. In both halves of the genome the leading strand is G+T-rich, having a bias toward G over C and T over A, and the lagging strand is A+C-rich. This asymmetry is probably the result of a difference in mutation dynamics between the leading and lagging strands. In addition to composition asymmetry, the two strands differ with regards to coding content. In both halves of the genome the vast majority of genes are coded by the leading strand. These two aspects of strand asymmetry are then applied to a statistical test for selection on codon usage. The results indicate that selection on codon usage is limited to genes on the leading strand; no gene on the A+C-rich lagging strand shows evidence for selection, suggesting that highly expressed genes are coded predominantly on the strand of DNA that is the leading strand during replication. On the basis of these observations it is proposed that the coding strand bias is generated by selection to code highly expressed genes on the leading strand to coordinate the direction of replication and transcription, thereby increasing the potential rate of both reactions.
SUBMITTER: Morton BR
PROVIDER: S-EPMC21827 | biostudies-literature | 1999 Apr
REPOSITORIES: biostudies-literature
ACCESS DATA