Unknown

Dataset Information

0

Assessing optimal: inequalities in codon optimization algorithms.


ABSTRACT:

Background

Custom genes have become a common resource in recombinant biology over the last 20?years due to the plummeting cost of DNA synthesis. These genes are often "optimized" to non-native sequences for overexpression in a non-native host by substituting synonymous codons within the coding DNA sequence (CDS). A handful of studies have compared native and optimized CDSs, reporting different levels of soluble product due to the accumulation of misfolded aggregates, variable activity of enzymes, and (at least one report of) a change in substrate specificity. No study, to the best of our knowledge, has performed a practical comparison of CDSs generated from different codon optimization algorithms or reported the corresponding protein yields.

Results

In our efforts to understand what factors constitute an optimized CDS, we identified that there is little consensus among codon-optimization algorithms, a roughly equivalent chance that an algorithm-optimized CDS will increase or diminish recombinant yields as compared to the native DNA, a near ubiquitous use of a codon database that was last updated in 2007, and a high variability of output CDSs by some algorithms. We present a case study, using KRas4B, to demonstrate that a median codon frequency may be a better predictor of soluble yields than the more commonly utilized CAI metric.

Conclusions

We present a method for visualizing, analyzing, and comparing algorithm-optimized DNA sequences for recombinant protein expression. We encourage researchers to consider if DNA optimization is right for their experiments, and work towards improving the reproducibility of published recombinant work by publishing non-native CDSs.

SUBMITTER: Ranaghan MJ 

PROVIDER: S-EPMC7893858 | biostudies-literature | 2021 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Assessing optimal: inequalities in codon optimization algorithms.

Ranaghan Matthew J MJ   Li Jeffrey J JJ   Laprise Dylan M DM   Garvie Colin W CW  

BMC biology 20210219 1


<h4>Background</h4>Custom genes have become a common resource in recombinant biology over the last 20 years due to the plummeting cost of DNA synthesis. These genes are often "optimized" to non-native sequences for overexpression in a non-native host by substituting synonymous codons within the coding DNA sequence (CDS). A handful of studies have compared native and optimized CDSs, reporting different levels of soluble product due to the accumulation of misfolded aggregates, variable activity of  ...[more]

Similar Datasets

2019-05-17 | GSE123611 | GEO
2015-06-04 | E-GEOD-67387 | biostudies-arrayexpress
| S-EPMC2700274 | biostudies-literature
| S-EPMC8555812 | biostudies-literature
| S-EPMC2839124 | biostudies-literature
| S-EPMC5315462 | biostudies-literature
| S-EPMC3278643 | biostudies-literature
2019-05-17 | GSE123608 | GEO
2019-05-17 | GSE112085 | GEO
| S-EPMC4436220 | biostudies-literature