Unknown

Dataset Information

0

NBC update: The addition of viral and fungal databases to the Naive Bayes classification tool.


ABSTRACT:

Background

Classifying the fungal and viral content of a sample is an important component of analyzing microbial communities in environmental media. Therefore, a method to classify any fragment from these organisms' DNA should be implemented.

Results

We update the näive Bayes classification (NBC) tool to classify reads originating from viral and fungal organisms. NBC classifies a fungal dataset similarly to Basic Local Alignment Search Tool (BLAST) and the Ribosomal Database Project (RDP) classifier. We also show NBC's similarities and differences to RDP on a fungal large subunit (LSU) ribosomal DNA dataset. For viruses in the training database, strain classification accuracy is 98%, while for those reads originating from sequences not in the database, the order-level accuracy is 78%, where order indicates the taxonomic level in the tree of life.

Conclusions

In addition to being competitive to other classifiers available, NBC has the potential to handle reads originating from any location in the genome. We recommend using the Bacteria/Archaea, Fungal, and Virus databases separately due to algorithmic biases towards long genomes. The tool is publicly available at: http://nbc.ece.drexel.edu.

SUBMITTER: Rosen GL 

PROVIDER: S-EPMC3284397 | biostudies-literature | 2012 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

NBC update: The addition of viral and fungal databases to the Naïve Bayes classification tool.

Rosen Gail L GL   Lim Tze Yee TY  

BMC research notes 20120131


<h4>Background</h4>Classifying the fungal and viral content of a sample is an important component of analyzing microbial communities in environmental media. Therefore, a method to classify any fragment from these organisms' DNA should be implemented.<h4>Results</h4>We update the näive Bayes classification (NBC) tool to classify reads originating from viral and fungal organisms. NBC classifies a fungal dataset similarly to Basic Local Alignment Search Tool (BLAST) and the Ribosomal Database Proje  ...[more]

Similar Datasets

| S-EPMC8249850 | biostudies-literature
| S-EPMC11522871 | biostudies-literature
| S-EPMC3439675 | biostudies-literature
| S-EPMC6480413 | biostudies-literature
| S-EPMC4662880 | biostudies-literature
| S-EPMC1698579 | biostudies-literature
| S-EPMC7277995 | biostudies-literature
| S-EPMC4117951 | biostudies-literature
| S-EPMC4219333 | biostudies-literature
| S-EPMC1635426 | biostudies-literature