Unknown

Dataset Information

0

BioCluster: tool for identification and clustering of Enterobacteriaceae based on biochemical data.


ABSTRACT: Presumptive identification of different Enterobacteriaceae species is routinely achieved based on biochemical properties. Traditional practice includes manual comparison of each biochemical property of the unknown sample with known reference samples and inference of its identity based on the maximum similarity pattern with the known samples. This process is labor-intensive, time-consuming, error-prone, and subjective. Therefore, automation of sorting and similarity in calculation would be advantageous. Here we present a MATLAB-based graphical user interface (GUI) tool named BioCluster. This tool was designed for automated clustering and identification of Enterobacteriaceae based on biochemical test results. In this tool, we used two types of algorithms, i.e., traditional hierarchical clustering (HC) and the Improved Hierarchical Clustering (IHC), a modified algorithm that was developed specifically for the clustering and identification of Enterobacteriaceae species. IHC takes into account the variability in result of 1-47 biochemical tests within this Enterobacteriaceae family. This tool also provides different options to optimize the clustering in a user-friendly way. Using computer-generated synthetic data and some real data, we have demonstrated that BioCluster has high accuracy in clustering and identifying enterobacterial species based on biochemical test data. This tool can be freely downloaded at http://microbialgen.du.ac.bd/biocluster/.

SUBMITTER: Abdullah A 

PROVIDER: S-EPMC4563349 | biostudies-literature | 2015 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

BioCluster: tool for identification and clustering of Enterobacteriaceae based on biochemical data.

Abdullah Ahmed A   Sabbir Alam S M SM   Sultana Munawar M   Hossain M Anwar MA  

Genomics, proteomics & bioinformatics 20150601 3


Presumptive identification of different Enterobacteriaceae species is routinely achieved based on biochemical properties. Traditional practice includes manual comparison of each biochemical property of the unknown sample with known reference samples and inference of its identity based on the maximum similarity pattern with the known samples. This process is labor-intensive, time-consuming, error-prone, and subjective. Therefore, automation of sorting and similarity in calculation would be advant  ...[more]

Similar Datasets

| S-EPMC6902487 | biostudies-literature
| S-EPMC7056916 | biostudies-literature
| S-EPMC4489218 | biostudies-literature
| S-EPMC6022753 | biostudies-literature
| S-EPMC2823712 | biostudies-literature
| S-EPMC7446192 | biostudies-literature
| S-EPMC2697633 | biostudies-literature
| S-EPMC6030426 | biostudies-literature
2020-10-09 | GSE158683 | GEO
| S-EPMC5828440 | biostudies-literature