Unknown

Dataset Information

0

UMI-linked consensus sequencing enables phylogenetic analysis of directed evolution.


ABSTRACT: The success of protein evolution campaigns is strongly dependent on the sequence context in which mutations are introduced, stemming from pervasive non-additive interactions between a protein's amino acids ('intra-gene epistasis'). Our limited understanding of such epistasis hinders the correct prediction of the functional contributions and adaptive potential of mutations. Here we present a straightforward unique molecular identifier (UMI)-linked consensus sequencing workflow (UMIC-seq) that simplifies mapping of evolutionary trajectories based on full-length sequences. Attaching UMIs to gene variants allows accurate consensus generation for closely related genes with nanopore sequencing. We exemplify the utility of this approach by reconstructing the artificial phylogeny emerging in three rounds of directed evolution of an amine dehydrogenase biocatalyst via ultrahigh throughput droplet screening. Uniquely, we are able to identify lineages and their founding variant, as well as non-additive interactions between mutations within a full gene showing sign epistasis. Access to deep and accurate long reads will facilitate prediction of key beneficial mutations and adaptive potential based on in silico analysis of large sequence datasets.

SUBMITTER: Zurek PJ 

PROVIDER: S-EPMC7691348 | biostudies-literature | 2020 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

UMI-linked consensus sequencing enables phylogenetic analysis of directed evolution.

Zurek Paul Jannis PJ   Knyphausen Philipp P   Neufeld Katharina K   Pushpanath Ahir A   Hollfelder Florian F  

Nature communications 20201126 1


The success of protein evolution campaigns is strongly dependent on the sequence context in which mutations are introduced, stemming from pervasive non-additive interactions between a protein's amino acids ('intra-gene epistasis'). Our limited understanding of such epistasis hinders the correct prediction of the functional contributions and adaptive potential of mutations. Here we present a straightforward unique molecular identifier (UMI)-linked consensus sequencing workflow (UMIC-seq) that sim  ...[more]

Similar Datasets

| S-EPMC10473586 | biostudies-literature
| S-EPMC11376274 | biostudies-literature
| S-EPMC5984373 | biostudies-literature
| S-EPMC9701214 | biostudies-literature
| S-EPMC7484502 | biostudies-literature
| S-EPMC3845167 | biostudies-literature
2024-07-07 | GSE235343 | GEO
| S-EPMC5023497 | biostudies-literature
| S-EPMC9010518 | biostudies-literature
| S-EPMC6423013 | biostudies-literature