Unknown

Dataset Information

0

Improved assemblies using a source-agnostic pipeline for MetaGenomic Assembly by Merging (MeGAMerge) of contigs.


ABSTRACT: Assembly of metagenomic samples is a very complex process, with algorithms designed to address sequencing platform-specific issues, (read length, data volume, and/or community complexity), while also faced with genomes that differ greatly in nucleotide compositional biases and in abundance. To address these issues, we have developed a post-assembly process: MetaGenomic Assembly by Merging (MeGAMerge). We compare this process to the performance of several assemblers, using both real, and in-silico generated samples of different community composition and complexity. MeGAMerge consistently outperforms individual assembly methods, producing larger contigs with an increased number of predicted genes, without replication of data. MeGAMerge contigs are supported by read mapping and contig alignment data, when using synthetically-derived and real metagenomic data, as well as by gene prediction analyses and similarity searches. MeGAMerge is a flexible method that generates improved metagenome assemblies, with the ability to accommodate upcoming sequencing platforms, as well as present and future assembly algorithms.

SUBMITTER: Scholz M 

PROVIDER: S-EPMC4180827 | biostudies-other | 2014

REPOSITORIES: biostudies-other

altmetric image

Publications

Improved assemblies using a source-agnostic pipeline for MetaGenomic Assembly by Merging (MeGAMerge) of contigs.

Scholz Matthew M   Lo Chien-Chi CC   Chain Patrick S G PS  

Scientific reports 20141001


Assembly of metagenomic samples is a very complex process, with algorithms designed to address sequencing platform-specific issues, (read length, data volume, and/or community complexity), while also faced with genomes that differ greatly in nucleotide compositional biases and in abundance. To address these issues, we have developed a post-assembly process: MetaGenomic Assembly by Merging (MeGAMerge). We compare this process to the performance of several assemblers, using both real, and in-silic  ...[more]

Similar Datasets

| S-EPMC4053804 | biostudies-literature
2014-02-18 | E-GEOD-55053 | biostudies-arrayexpress
2014-02-18 | GSE55053 | GEO
| S-EPMC7005248 | biostudies-literature
| S-EPMC4545859 | biostudies-literature
| S-EPMC4032850 | biostudies-literature
| S-EPMC5084376 | biostudies-literature
| S-EPMC4749706 | biostudies-literature
| S-EPMC5406902 | biostudies-literature
| S-EPMC6114274 | biostudies-literature