Unknown

Dataset Information

0

Not all predicted CRISPR-Cas systems are equal: isolated cas genes and classes of CRISPR like elements.


ABSTRACT: The CRISPR-Cas systems in prokaryotes are RNA-guided immune systems that target and deactivate foreign nucleic acids. A typical CRISPR-Cas system consists of a CRISPR array of repeat and spacer units, and a locus of cas genes. The CRISPR and the cas locus are often located next to each other in the genomes. However, there is no quantitative estimate of the co-location. In addition, ad-hoc studies have shown that some non-CRISPR genomic elements contain repeat-spacer-like structures and are mistaken as CRISPRs.Using available genome sequences, we observed that a significant number of genomes have isolated cas loci and/or CRISPRs. We found that 11%, 22% and 28% of the type I, II and III cas loci are isolated (without CRISPRs in the same genomes at all or with CRISPRs distant in the genomes), respectively. We identified a large number of genomic elements that superficially reassemble CRISPRs but don't contain diverse spacers and have no companion cas genes. We called these elements false-CRISPRs and further classified them into groups, including tandem repeats and Staphylococcus aureus repeat (STAR)-like elements.This is the first systematic study to collect and characterize false-CRISPR elements. We demonstrated that false-CRISPRs could be used to reduce the false annotation of CRISPRs, therefore showing them to be useful for improving the annotation of CRISPR-Cas systems.

SUBMITTER: Zhang Q 

PROVIDER: S-EPMC5294841 | biostudies-literature | 2017 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Not all predicted CRISPR-Cas systems are equal: isolated cas genes and classes of CRISPR like elements.

Zhang Quan Q   Ye Yuzhen Y  

BMC bioinformatics 20170206 1


<h4>Background</h4>The CRISPR-Cas systems in prokaryotes are RNA-guided immune systems that target and deactivate foreign nucleic acids. A typical CRISPR-Cas system consists of a CRISPR array of repeat and spacer units, and a locus of cas genes. The CRISPR and the cas locus are often located next to each other in the genomes. However, there is no quantitative estimate of the co-location. In addition, ad-hoc studies have shown that some non-CRISPR genomic elements contain repeat-spacer-like struc  ...[more]

Similar Datasets

| S-EPMC6360697 | biostudies-literature
| S-EPMC6709367 | biostudies-literature
| S-EPMC4053933 | biostudies-literature
| S-EPMC6546389 | biostudies-literature
| S-EPMC8658065 | biostudies-literature
| S-EPMC3795411 | biostudies-literature
| S-EPMC5901762 | biostudies-literature
| S-EPMC5300952 | biostudies-literature
| S-EPMC6024849 | biostudies-literature
| S-EPMC4417044 | biostudies-literature