Unknown

Dataset Information

0

Basic principles of the genetic code extension.


ABSTRACT: Compounds including non-canonical amino acids (ncAAs) or other artificially designed molecules can find a lot of applications in medicine, industry and biotechnology. They can be produced thanks to the modification or extension of the standard genetic code (SGC). Such peptides or proteins including the ncAAs can be constantly delivered in a stable way by organisms with the customized genetic code. Among several methods of engineering the code, using non-canonical base pairs is especially promising, because it enables generating many new codons, which can be used to encode any new amino acid. Since even one pair of new bases can extend the SGC up to 216 codons generated by a six-letter nucleotide alphabet, the extension of the SGC can be achieved in many ways. Here, we proposed a stepwise procedure of the SGC extension with one pair of non-canonical bases to minimize the consequences of point mutations. We reported relationships between codons in the framework of graph theory. All 216 codons were represented as nodes of the graph, whereas its edges were induced by all possible single nucleotide mutations occurring between codons. Therefore, every set of canonical and newly added codons induces a specific subgraph. We characterized the properties of the induced subgraphs generated by selected sets of codons. Thanks to that, we were able to describe a procedure for incremental addition of the set of meaningful codons up to the full coding system consisting of three pairs of bases. The procedure of gradual extension of the SGC makes the whole system robust to changing genetic information due to mutations and is compatible with the views assuming that codons and amino acids were added successively to the primordial SGC, which evolved minimizing harmful consequences of mutations or mistranslations of encoded proteins.

SUBMITTER: Blazej P 

PROVIDER: S-EPMC7062095 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC5133446 | biostudies-literature
| S-EPMC6550552 | biostudies-literature
| S-EPMC3744885 | biostudies-literature
| S-EPMC3667586 | biostudies-literature
| S-EPMC10090134 | biostudies-literature
| S-EPMC7082724 | biostudies-literature
| S-EPMC4000838 | biostudies-literature
| S-EPMC4943584 | biostudies-literature
| S-EPMC7070569 | biostudies-literature
| S-EPMC4671265 | biostudies-literature