Unknown

Dataset Information

0

Increasing the coverage of a metapopulation consensus genome by iterative read mapping and assembly.


ABSTRACT:

Motivation

Most microbial species can not be cultured in the laboratory. Metagenomic sequencing may still yield a complete genome if the sequenced community is enriched and the sequencing coverage is high. However, the complexity in a natural population may cause the enrichment culture to contain multiple related strains. This diversity can confound existing strict assembly programs and lead to a fragmented assembly, which is unnecessary if we have a related reference genome available that can function as a scaffold.

Results

Here, we map short metagenomic sequencing reads from a population of strains to a related reference genome, and compose a genome that captures the consensus of the population's sequences. We show that by iteration of the mapping and assembly procedure, the coverage increases while the similarity with the reference genome decreases. This indicates that the assembly becomes less dependent on the reference genome and approaches the consensus genome of the multi-strain population.

Contact

dutilh@cmbi.ru.nl

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Dutilh BE 

PROVIDER: S-EPMC2781756 | biostudies-literature | 2009 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Increasing the coverage of a metapopulation consensus genome by iterative read mapping and assembly.

Dutilh Bas E BE   Huynen Martijn A MA   Strous Marc M  

Bioinformatics (Oxford, England) 20090619 21


<h4>Motivation</h4>Most microbial species can not be cultured in the laboratory. Metagenomic sequencing may still yield a complete genome if the sequenced community is enriched and the sequencing coverage is high. However, the complexity in a natural population may cause the enrichment culture to contain multiple related strains. This diversity can confound existing strict assembly programs and lead to a fragmented assembly, which is unnecessary if we have a related reference genome available th  ...[more]

Similar Datasets

| S-EPMC4609002 | biostudies-literature
| S-EPMC6776680 | biostudies-literature
| S-EPMC6169397 | biostudies-literature
| S-EPMC8699914 | biostudies-literature
| S-EPMC4920363 | biostudies-literature
| S-EPMC5100563 | biostudies-literature
| S-EPMC4622496 | biostudies-literature
| S-EPMC2884544 | biostudies-literature
| S-EPMC5751067 | biostudies-literature
| S-EPMC3309356 | biostudies-literature