Unknown

Dataset Information

0

GSTaxClassifier: a genomic signature based taxonomic classifier for metagenomic data analysis.


ABSTRACT: GSTaxClassifier (Genomic Signature based Taxonomic Classifier) is a program for metagenomics analysis of shotgun DNA sequences. The program includes a simple but effective algorithm, a modification of the Bayesian method, to predict the most probable genomic origins of sequences at different taxonomical ranks, on the basis of genome databases;a function to generate genomic profiles of reference sequences with tri-, tetra-, penta-, and hexa-nucleotide motifs for setting a user-defined database; two different formats (tabular- and tree-based summaries) to display taxonomic predictions with improved analytical methods; and effective ways to retrieve, search, and summarize results by integrating the predictions into the NCBI tree-based taxonomic information.GSTaxClassifier takes input nucleotide sequences and using a modified Bayesian model evaluates the genomic signatures between metagenomic query sequences and reference genome databases. The simulation studies of a numerical data sets showed that GSTaxClassifier could serve as a useful program for metagenomics studies, which is freely available at http://helix2.biotech.ufl.edu:26878/metagenomics/.

SUBMITTER: Yu F 

PROVIDER: S-EPMC2770370 | biostudies-literature | 2009 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

GSTaxClassifier: a genomic signature based taxonomic classifier for metagenomic data analysis.

Yu Fahong F   Sun Yijun Y   Liu Li L   Farmerie William W  

Bioinformation 20090820 1


GSTaxClassifier (Genomic Signature based Taxonomic Classifier) is a program for metagenomics analysis of shotgun DNA sequences. The program includes a simple but effective algorithm, a modification of the Bayesian method, to predict the most probable genomic origins of sequences at different taxonomical ranks, on the basis of genome databases;a function to generate genomic profiles of reference sequences with tri-, tetra-, penta-, and hexa-nucleotide motifs for setting a user-defined database; t  ...[more]

Similar Datasets

| S-EPMC4005636 | biostudies-literature
| S-EPMC8266618 | biostudies-literature
| S-EPMC8921650 | biostudies-literature
| S-EPMC9651046 | biostudies-literature
| S-EPMC10059269 | biostudies-literature
| S-EPMC4993507 | biostudies-literature
| S-EPMC6069770 | biostudies-literature
| S-EPMC3413139 | biostudies-literature
| S-EPMC4315456 | biostudies-literature
| S-EPMC4309676 | biostudies-literature