Unknown

Dataset Information

0

Automated identification of sequence-tailored Cas9 proteins using massive metagenomic data.


ABSTRACT: The identification of the protospacer adjacent motif (PAM) sequences of Cas9 nucleases is crucial for their exploitation in genome editing. Here we develop a computational pipeline that was used to interrogate a massively expanded dataset of metagenome and virome assemblies for accurate and comprehensive PAM predictions. This procedure allows the identification and isolation of sequence-tailored Cas9 nucleases by using the target sequence as bait. As proof of concept, starting from the disease-causing mutation P23H in the RHO gene, we find, isolate and experimentally validate a Cas9 which uses the mutated sequence as PAM. Our PAM prediction pipeline will be instrumental to generate a Cas9 nuclease repertoire responding to any PAM requirement.

SUBMITTER: Ciciani M 

PROVIDER: S-EPMC9617884 | biostudies-literature | 2022 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications


The identification of the protospacer adjacent motif (PAM) sequences of Cas9 nucleases is crucial for their exploitation in genome editing. Here we develop a computational pipeline that was used to interrogate a massively expanded dataset of metagenome and virome assemblies for accurate and comprehensive PAM predictions. This procedure allows the identification and isolation of sequence-tailored Cas9 nucleases by using the target sequence as bait. As proof of concept, starting from the disease-c  ...[more]

Similar Datasets

| S-EPMC6676585 | biostudies-literature
| S-EPMC7755236 | biostudies-literature
| S-EPMC2896179 | biostudies-literature
| S-EPMC9294416 | biostudies-literature
| S-EPMC8169707 | biostudies-literature
| S-EPMC10516513 | biostudies-literature
| S-EPMC3001099 | biostudies-literature
| S-EPMC7645119 | biostudies-literature
| S-EPMC4053813 | biostudies-other
| S-EPMC11020464 | biostudies-literature