Unknown

Dataset Information

0

Structure-specific DNA recombination sites: Design, validation, and machine learning-based refinement.


ABSTRACT: Recombination systems are widely used as bioengineering tools, but their sites have to be highly similar to a consensus sequence or to each other. To develop a recombination system free of these constraints, we turned toward attC sites from the bacterial integron system: single-stranded DNA hairpins specifically recombined by the integrase. Here, we present an algorithm that generates synthetic attC sites with conserved structural features and minimal sequence-level constraints. We demonstrate that all generated sites are functional, their recombination efficiency can reach 60%, and they can be embedded into protein coding sequences. To improve recombination of less efficient sites, we applied large-scale mutagenesis and library enrichment coupled to next-generation sequencing and machine learning. Our results validated the efficiency of this approach and allowed us to refine synthetic attC design principles. They can be embedded into virtually any sequence and constitute a unique example of a structure-specific DNA recombination system.

SUBMITTER: Nivina A 

PROVIDER: S-EPMC7439510 | biostudies-literature | 2020 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Structure-specific DNA recombination sites: Design, validation, and machine learning-based refinement.

Nivina Aleksandra A   Grieb Maj Svea MS   Loot Céline C   Bikard David D   Cury Jean J   Shehata Laila L   Bernardes Juliana J   Mazel Didier D  

Science advances 20200724 30


Recombination systems are widely used as bioengineering tools, but their sites have to be highly similar to a consensus sequence or to each other. To develop a recombination system free of these constraints, we turned toward <i>attC</i> sites from the bacterial integron system: single-stranded DNA hairpins specifically recombined by the integrase. Here, we present an algorithm that generates synthetic <i>attC</i> sites with conserved structural features and minimal sequence-level constraints. We  ...[more]

Similar Datasets

2021-06-01 | GSE171549 | GEO
| S-EPMC8062585 | biostudies-literature
2021-07-26 | GSE175955 | GEO
| S-EPMC8201040 | biostudies-literature
| S-EPMC2708028 | biostudies-literature
| S-EPMC4802303 | biostudies-literature
| PRJNA720048 | ENA
| S-EPMC4304695 | biostudies-literature
2023-06-01 | GSE193400 | GEO
| S-EPMC6180079 | biostudies-other