Unknown

Dataset Information

0

Predicting synonymous codon usage and optimizing the heterologous gene for expression in E. coli.


ABSTRACT: Of the 20 common amino acids, 18 are encoded by multiple synonymous codons. These synonymous codons are not redundant; in fact, all of codons contribute substantially to protein expression, structure and function. In this study, the codon usage pattern of genes in the E. coli was learned from the sequenced genomes of E. coli. A machine learning based method, Presyncodon was proposed to predict synonymous codon selection in E. coli based on the learned codon usage patterns of the residue in the context of the specific fragment. The predicting results indicate that Presycoden could be used to predict synonymous codon selection of the gene in the E. coli with the high accuracy. Two reporter genes (egfp and mApple) were designed with a combination of low- and high-frequency-usage codons by the method. The fluorescence intensity of eGFP and mApple expressed by the (egfp and mApple) designed by this method was about 2.3- or 1.7- folds greater than that from the genes with only high-frequency-usage codons in E. coli. Therefore, both low- and high-frequency-usage codons make positive contributions to the functional expression of the heterologous proteins. This method could be used to design synthetic genes for heterologous gene expression in biotechnology.

SUBMITTER: Tian J 

PROVIDER: S-EPMC5577221 | biostudies-literature | 2017 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Predicting synonymous codon usage and optimizing the heterologous gene for expression in E. coli.

Tian Jian J   Yan Yaru Y   Yue Qingxia Q   Liu Xiaoqing X   Chu Xiaoyu X   Wu Ningfeng N   Fan Yunliu Y  

Scientific reports 20170830 1


Of the 20 common amino acids, 18 are encoded by multiple synonymous codons. These synonymous codons are not redundant; in fact, all of codons contribute substantially to protein expression, structure and function. In this study, the codon usage pattern of genes in the E. coli was learned from the sequenced genomes of E. coli. A machine learning based method, Presyncodon was proposed to predict synonymous codon selection in E. coli based on the learned codon usage patterns of the residue in the c  ...[more]

Similar Datasets

2022-05-16 | GSE202900 | GEO
| S-EPMC2646356 | biostudies-other
| S-EPMC7750956 | biostudies-literature
| S-EPMC10790568 | biostudies-literature
| S-EPMC6045634 | biostudies-literature
| S-EPMC3400985 | biostudies-literature
| S-EPMC4355305 | biostudies-literature
| S-EPMC4188181 | biostudies-literature
| S-EPMC4257736 | biostudies-literature
| S-EPMC10861588 | biostudies-literature