Unknown

Dataset Information

0

Expanding the catalog of cas genes with metagenomes.


ABSTRACT: The CRISPR (clusters of regularly interspaced short palindromic repeats)-Cas adaptive immune system is an important defense system in bacteria, providing targeted defense against invasions of foreign nucleic acids. CRISPR-Cas systems consist of CRISPR loci and cas (CRISPR-associated) genes: sequence segments of invaders are incorporated into host genomes at CRISPR loci to generate specificity, while adjacent cas genes encode proteins that mediate the defense process. We pursued an integrated approach to identifying putative cas genes from genomes and metagenomes, combining similarity searches with genomic neighborhood analysis. Application of our approach to bacterial genomes and human microbiome datasets allowed us to significantly expand the collection of cas genes: the sequence space of the Cas9 family, the key player in the recently engineered RNA-guided platforms for genome editing in eukaryotes, is expanded by at least two-fold with metagenomic datasets. We found genes in cas loci encoding other functions, for example, toxins and antitoxins, confirming the recently discovered potential of coupling between adaptive immunity and the dormancy/suicide systems. We further identified 24 novel Cas families; one novel family contains 20 proteins, all identified from the human microbiome datasets, illustrating the importance of metagenomics projects in expanding the diversity of cas genes.

SUBMITTER: Zhang Q 

PROVIDER: S-EPMC3936711 | biostudies-literature | 2014 Feb

REPOSITORIES: biostudies-literature

Similar Datasets

2014-04-16 | E-GEOD-37151 | biostudies-arrayexpress
2014-04-16 | GSE37151 | GEO
| S-EPMC8394144 | biostudies-literature
| S-EPMC4322841 | biostudies-other
| S-EPMC9264700 | biostudies-literature
| S-EPMC9160773 | biostudies-literature
| S-EPMC8201803 | biostudies-literature
| S-EPMC8763822 | biostudies-literature
| S-EPMC4562114 | biostudies-literature
| S-EPMC5573215 | biostudies-literature