Ontology highlight
ABSTRACT:
SUBMITTER: Shen MW
PROVIDER: S-EPMC8551035 | biostudies-literature | 2021 Nov
REPOSITORIES: biostudies-literature
Shen Max W MW Zhao Kevin T KT Liu David R DR
Nature chemical biology 20211011 11
Directed evolution can generate proteins with tailor-made activities. However, full-length genotypes, their frequencies and fitnesses are difficult to measure for evolving gene-length biomolecules using most high-throughput DNA sequencing methods, as short read lengths can lose mutation linkages in haplotypes. Here we present Evoracle, a machine learning method that accurately reconstructs full-length genotypes (R<sup>2</sup> = 0.94) and fitness using short-read data from directed evolution expe ...[more]