Unknown

Dataset Information

0

Bambus 2: scaffolding metagenomes.


ABSTRACT:

Motivation

Sequencing projects increasingly target samples from non-clonal sources. In particular, metagenomics has enabled scientists to begin to characterize the structure of microbial communities. The software tools developed for assembling and analyzing sequencing data for clonal organisms are, however, unable to adequately process data derived from non-clonal sources.

Results

We present a new scaffolder, Bambus 2, to address some of the challenges encountered when analyzing metagenomes. Our approach relies on a combination of a novel method for detecting genomic repeats and algorithms that analyze assembly graphs to identify biologically meaningful genomic variants. We compare our software to current assemblers using simulated and real data. We demonstrate that the repeat detection algorithms have higher sensitivity than current approaches without sacrificing specificity. In metagenomic datasets, the scaffolder avoids false joins between distantly related organisms while obtaining long-range contiguity. Bambus 2 represents a first step toward automated metagenomic assembly.

Availability

Bambus 2 is open source and available from http://amos.sf.net.

Contact

mpop@umiacs.umd.edu.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Koren S 

PROVIDER: S-EPMC3198580 | biostudies-literature | 2011 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Bambus 2: scaffolding metagenomes.

Koren Sergey S   Treangen Todd J TJ   Pop Mihai M  

Bioinformatics (Oxford, England) 20110916 21


<h4>Motivation</h4>Sequencing projects increasingly target samples from non-clonal sources. In particular, metagenomics has enabled scientists to begin to characterize the structure of microbial communities. The software tools developed for assembling and analyzing sequencing data for clonal organisms are, however, unable to adequately process data derived from non-clonal sources.<h4>Results</h4>We present a new scaffolder, Bambus 2, to address some of the challenges encountered when analyzing m  ...[more]

Similar Datasets

| S-EPMC314292 | biostudies-literature
| S-EPMC10789090 | biostudies-literature
| PRJEB43177 | ENA
| S-EPMC3150381 | biostudies-literature
| S-EPMC4603742 | biostudies-literature
| S-EPMC10530170 | biostudies-literature
| S-EPMC3166145 | biostudies-literature
| S-EPMC9040807 | biostudies-literature
| S-EPMC3391805 | biostudies-literature
| S-EPMC10708022 | biostudies-literature