Unknown

Dataset Information

0

Mining new crystal protein genes from Bacillus thuringiensis on the basis of mixed plasmid-enriched genome sequencing and a computational pipeline.


ABSTRACT: We have designed a high-throughput system for the identification of novel crystal protein genes (cry) from Bacillus thuringiensis strains. The system was developed with two goals: (i) to acquire the mixed plasmid-enriched genomic sequence of B. thuringiensis using next-generation sequencing biotechnology, and (ii) to identify cry genes with a computational pipeline (using BtToxin_scanner). In our pipeline method, we employed three different kinds of well-developed prediction methods, BLAST, hidden Markov model (HMM), and support vector machine (SVM), to predict the presence of Cry toxin genes. The pipeline proved to be fast (average speed, 1.02 Mb/min for proteins and open reading frames [ORFs] and 1.80 Mb/min for nucleotide sequences), sensitive (it detected 40% more protein toxin genes than a keyword extraction method using genomic sequences downloaded from GenBank), and highly specific. Twenty-one strains from our laboratory's collection were selected based on their plasmid pattern and/or crystal morphology. The plasmid-enriched genomic DNA was extracted from these strains and mixed for Illumina sequencing. The sequencing data were de novo assembled, and a total of 113 candidate cry sequences were identified using the computational pipeline. Twenty-seven candidate sequences were selected on the basis of their low level of sequence identity to known cry genes, and eight full-length genes were obtained with PCR. Finally, three new cry-type genes (primary ranks) and five cry holotypes, which were designated cry8Ac1, cry7Ha1, cry21Ca1, cry32Fa1, and cry21Da1 by the B. thuringiensis Toxin Nomenclature Committee, were identified. The system described here is both efficient and cost-effective and can greatly accelerate the discovery of novel cry genes.

SUBMITTER: Ye W 

PROVIDER: S-EPMC3416374 | biostudies-literature | 2012 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Mining new crystal protein genes from Bacillus thuringiensis on the basis of mixed plasmid-enriched genome sequencing and a computational pipeline.

Ye Weixing W   Zhu Lei L   Liu Yingying Y   Crickmore Neil N   Peng Donghai D   Ruan Lifang L   Sun Ming M  

Applied and environmental microbiology 20120427 14


We have designed a high-throughput system for the identification of novel crystal protein genes (cry) from Bacillus thuringiensis strains. The system was developed with two goals: (i) to acquire the mixed plasmid-enriched genomic sequence of B. thuringiensis using next-generation sequencing biotechnology, and (ii) to identify cry genes with a computational pipeline (using BtToxin_scanner). In our pipeline method, we employed three different kinds of well-developed prediction methods, BLAST, hidd  ...[more]

Similar Datasets

| S-EPMC2223206 | biostudies-literature
| S-EPMC151414 | biostudies-literature
| S-EPMC1148518 | biostudies-other
| S-EPMC1899880 | biostudies-literature
| S-EPMC1196294 | biostudies-literature
| S-EPMC98934 | biostudies-other
| S-EPMC3847046 | biostudies-literature
| S-EPMC4780641 | biostudies-literature
| S-EPMC7385851 | biostudies-literature
| S-EPMC1393223 | biostudies-literature