Unknown

Dataset Information

0

CLARK: fast and accurate classification of metagenomic and genomic sequences using discriminative k-mers.


ABSTRACT:

Background

The problem of supervised DNA sequence classification arises in several fields of computational molecular biology. Although this problem has been extensively studied, it is still computationally challenging due to size of the datasets that modern sequencing technologies can produce.

Results

We introduce CLARK a novel approach to classify metagenomic reads at the species or genus level with high accuracy and high speed. Extensive experimental results on various metagenomic samples show that the classification accuracy of CLARK is better or comparable to the best state-of-the-art tools and it is significantly faster than any of its competitors. In its fastest single-threaded mode CLARK classifies, with high accuracy, about 32 million metagenomic short reads per minute. CLARK can also classify BAC clones or transcripts to chromosome arms and centromeric regions.

Conclusions

CLARK is a versatile, fast and accurate sequence classification method, especially useful for metagenomics and genomics applications. It is freely available at http://clark.cs.ucr.edu/ .

SUBMITTER: Ounit R 

PROVIDER: S-EPMC4428112 | biostudies-literature | 2015 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

CLARK: fast and accurate classification of metagenomic and genomic sequences using discriminative k-mers.

Ounit Rachid R   Wanamaker Steve S   Close Timothy J TJ   Lonardi Stefano S  

BMC genomics 20150325


<h4>Background</h4>The problem of supervised DNA sequence classification arises in several fields of computational molecular biology. Although this problem has been extensively studied, it is still computationally challenging due to size of the datasets that modern sequencing technologies can produce.<h4>Results</h4>We introduce CLARK a novel approach to classify metagenomic reads at the species or genus level with high accuracy and high speed. Extensive experimental results on various metagenom  ...[more]

Similar Datasets

| S-EPMC3319535 | biostudies-literature
| S-EPMC7576720 | biostudies-literature
| S-EPMC2957682 | biostudies-literature
| S-EPMC8826084 | biostudies-literature
| S-EPMC3157928 | biostudies-literature
| S-EPMC4828714 | biostudies-literature
| S-EPMC3333187 | biostudies-literature
| S-EPMC6104016 | biostudies-literature
| S-EPMC6395045 | biostudies-literature
| S-EPMC5389551 | biostudies-literature