Unknown

Dataset Information

0

NBC update: The addition of viral and fungal databases to the Naive Bayes classification tool.


ABSTRACT: BACKGROUND: Classifying the fungal and viral content of a sample is an important component of analyzing microbial communities in environmental media. Therefore, a method to classify any fragment from these organisms' DNA should be implemented. RESULTS: We update the näive Bayes classification (NBC) tool to classify reads originating from viral and fungal organisms. NBC classifies a fungal dataset similarly to Basic Local Alignment Search Tool (BLAST) and the Ribosomal Database Project (RDP) classifier. We also show NBC's similarities and differences to RDP on a fungal large subunit (LSU) ribosomal DNA dataset. For viruses in the training database, strain classification accuracy is 98%, while for those reads originating from sequences not in the database, the order-level accuracy is 78%, where order indicates the taxonomic level in the tree of life. CONCLUSIONS: In addition to being competitive to other classifiers available, NBC has the potential to handle reads originating from any location in the genome. We recommend using the Bacteria/Archaea, Fungal, and Virus databases separately due to algorithmic biases towards long genomes. The tool is publicly available at: http://nbc.ece.drexel.edu.

SUBMITTER: Rosen GL 

PROVIDER: S-EPMC3284397 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

NBC update: The addition of viral and fungal databases to the Naïve Bayes classification tool.

Rosen Gail L GL   Lim Tze Yee TY  

BMC research notes 20120131


<h4>Background</h4>Classifying the fungal and viral content of a sample is an important component of analyzing microbial communities in environmental media. Therefore, a method to classify any fragment from these organisms' DNA should be implemented.<h4>Results</h4>We update the näive Bayes classification (NBC) tool to classify reads originating from viral and fungal organisms. NBC classifies a fungal dataset similarly to Basic Local Alignment Search Tool (BLAST) and the Ribosomal Database Proje  ...[more]

Similar Datasets

| S-EPMC8249850 | biostudies-literature
| S-EPMC3439675 | biostudies-literature
| S-EPMC6480413 | biostudies-literature
| S-EPMC4662880 | biostudies-literature
| S-EPMC1698579 | biostudies-literature
| S-EPMC4219333 | biostudies-other
| S-EPMC7277995 | biostudies-literature
| S-EPMC4117951 | biostudies-literature
| S-EPMC1635426 | biostudies-literature
| S-EPMC5860388 | biostudies-literature