Unknown

Dataset Information

0

Individual genome assembly from complex community short-read metagenomic datasets.


ABSTRACT: Assembling individual genomes from complex community metagenomic data remains a challenging issue for environmental studies. We evaluated the quality of genome assemblies from community short read data (Illumina 100 bp pair-ended sequences) using datasets recovered from freshwater and soil microbial communities as well as in silico simulations. Our analyses revealed that the genome of a single genotype (or species) can be accurately assembled from a complex metagenome when it shows at least about 20 × coverage. At lower coverage, however, the derived assemblies contained a substantial fraction of non-target sequences (chimeras), which explains, at least in part, the higher number of hypothetical genes recovered in metagenomic relative to genomic projects. We also provide examples of how to detect intrapopulation structure in metagenomic datasets and estimate the type and frequency of errors in assembled genes and contigs from datasets of varied species complexity.

SUBMITTER: Luo C 

PROVIDER: S-EPMC3309356 | biostudies-literature | 2012 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Individual genome assembly from complex community short-read metagenomic datasets.

Luo Chengwei C   Tsementzi Despina D   Kyrpides Nikos C NC   Konstantinidis Konstantinos T KT  

The ISME journal 20111027 4


Assembling individual genomes from complex community metagenomic data remains a challenging issue for environmental studies. We evaluated the quality of genome assemblies from community short read data (Illumina 100 bp pair-ended sequences) using datasets recovered from freshwater and soil microbial communities as well as in silico simulations. Our analyses revealed that the genome of a single genotype (or species) can be accurately assembled from a complex metagenome when it shows at least abou  ...[more]

Similar Datasets

| S-EPMC3100316 | biostudies-literature
| S-EPMC4622496 | biostudies-literature
| S-EPMC10079220 | biostudies-literature
| S-EPMC10782538 | biostudies-literature
| S-EPMC6027122 | biostudies-literature
| S-EPMC8440550 | biostudies-literature
| S-EPMC5701471 | biostudies-literature
| S-EPMC9749362 | biostudies-literature
| S-EPMC8812927 | biostudies-literature
| S-EPMC7469296 | biostudies-literature