Unknown

Dataset Information

0

BAGEL2: mining for bacteriocins in genomic data.


ABSTRACT: Mining bacterial genomes for bacteriocins is a challenging task due to the substantial structure and sequence diversity, and generally small sizes, of these antimicrobial peptides. Major progress in the research of antimicrobial peptides and the ever-increasing quantities of genomic data, varying from (un)finished genomes to meta-genomic data, led us to develop the significantly improved genome mining software BAGEL2, as a follow-up of our previous BAGEL software. BAGEL2 identifies putative bacteriocins on the basis of conserved domains, physical properties and the presence of biosynthesis, transport and immunity genes in their genomic context. The software supports parameter-free, class-specific mining and has high-throughput capabilities. Besides building an expert validated bacteriocin database, we describe the development of novel Hidden Markov Models (HMMs) and the interpretation of combinations of HMMs via simple decision rules for prediction of bacteriocin (sub-)classes. Furthermore, the genetic context is automatically annotated based on (combinations of) PFAM domains and databases of known context genes. The scoring system was fine-tuned using expert knowledge on data derived from screening all bacterial genomes currently available at the NCBI. BAGEL2 is freely accessible at http://bagel2.molgenrug.nl.

SUBMITTER: de Jong A 

PROVIDER: S-EPMC2896169 | biostudies-literature | 2010 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

BAGEL2: mining for bacteriocins in genomic data.

de Jong Anne A   van Heel Auke J AJ   Kok Jan J   Kuipers Oscar P OP  

Nucleic acids research 20100512 Web Server issue


Mining bacterial genomes for bacteriocins is a challenging task due to the substantial structure and sequence diversity, and generally small sizes, of these antimicrobial peptides. Major progress in the research of antimicrobial peptides and the ever-increasing quantities of genomic data, varying from (un)finished genomes to meta-genomic data, led us to develop the significantly improved genome mining software BAGEL2, as a follow-up of our previous BAGEL software. BAGEL2 identifies putative bact  ...[more]

Similar Datasets

| S-EPMC3173205 | biostudies-literature
| S-EPMC4451026 | biostudies-literature
| S-EPMC8301259 | biostudies-literature
| S-EPMC6694479 | biostudies-literature
| S-EPMC8659200 | biostudies-literature
| S-EPMC6298052 | biostudies-literature
| S-EPMC7987574 | biostudies-literature
| S-EPMC3355044 | biostudies-literature
| S-EPMC3140520 | biostudies-literature
| S-EPMC5024470 | biostudies-literature