Unknown

Dataset Information

0

Predicting crossover generation in DNA shuffling.


ABSTRACT: We introduce a quantitative framework for assessing the generation of crossovers in DNA shuffling experiments. The approach uses free energy calculations and complete sequence information to model the annealing process. Statistics obtained for the annealing events then are combined with a reassembly algorithm to infer crossover allocation in the reassembled sequences. The fraction of reassembled sequences containing zero, one, two, or more crossovers and the probability that a given nucleotide position in a reassembled sequence is the site of a crossover event are estimated. Comparisons of the predictions against experimental data for five example systems demonstrate good agreement despite the fact that no adjustable parameters are used. An in silico case study of a set of 12 subtilases examines the effect of fragmentation length, annealing temperature, sequence identity and number of shuffled sequences on the number, type, and distribution of crossovers. A computational verification of crossover aggregation in regions of near-perfect sequence identity and the presence of synergistic reassembly in family DNA shuffling is obtained.

SUBMITTER: Moore GL 

PROVIDER: S-EPMC30635 | biostudies-literature | 2001 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Predicting crossover generation in DNA shuffling.

Moore G L GL   Maranas C D CD   Lutz S S   Benkovic S J SJ  

Proceedings of the National Academy of Sciences of the United States of America 20010301 6


We introduce a quantitative framework for assessing the generation of crossovers in DNA shuffling experiments. The approach uses free energy calculations and complete sequence information to model the annealing process. Statistics obtained for the annealing events then are combined with a reassembly algorithm to infer crossover allocation in the reassembled sequences. The fraction of reassembled sequences containing zero, one, two, or more crossovers and the probability that a given nucleotide p  ...[more]

Similar Datasets

| S-EPMC4128973 | biostudies-literature
| S-EPMC1222993 | biostudies-other
| S-EPMC152248 | biostudies-literature
| S-EPMC2677662 | biostudies-literature
| S-EPMC1149501 | biostudies-literature
| S-EPMC1072806 | biostudies-literature
| S-EPMC6550543 | biostudies-literature
| S-EPMC6284010 | biostudies-literature
| S-EPMC3656877 | biostudies-literature
| S-EPMC6358705 | biostudies-literature