Unknown

Dataset Information

0

Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes.


ABSTRACT: Metagenomics, the application of shotgun sequencing, facilitates the reconstruction of the genomes of individual species from natural environments. A major challenge in the genome recovery domain is to agglomerate or 'bin' sequences assembled from metagenomic reads into individual groups. Metagenomic binning without consideration of reference sequences enables the comprehensive discovery of new microbial organisms and aids in the microbial genome reconstruction process. Here we present MyCC, an automated binning tool that combines genomic signatures, marker genes and optional contig coverages within one or multiple samples, in order to visualize the metagenomes and to identify the reconstructed genomic fragments. We demonstrate the superior performance of MyCC compared to other binning tools including CONCOCT, GroopM, MaxBin and MetaBAT on both synthetic and real human gut communities with a small sample size (one to 11 samples), as well as on a large metagenome dataset (over 250 samples). Moreover, we demonstrate the visualization of metagenomes in MyCC to aid in the reconstruction of genomes from distinct bins. MyCC is freely available at http://sourceforge.net/projects/sb2nhri/files/MyCC/.

SUBMITTER: Lin HH 

PROVIDER: S-EPMC4828714 | biostudies-literature | 2016 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes.

Lin Hsin-Hung HH   Liao Yu-Chieh YC  

Scientific reports 20160412


Metagenomics, the application of shotgun sequencing, facilitates the reconstruction of the genomes of individual species from natural environments. A major challenge in the genome recovery domain is to agglomerate or 'bin' sequences assembled from metagenomic reads into individual groups. Metagenomic binning without consideration of reference sequences enables the comprehensive discovery of new microbial organisms and aids in the microbial genome reconstruction process. Here we present MyCC, an  ...[more]

Similar Datasets

| S-EPMC3514610 | biostudies-literature
| S-EPMC3123841 | biostudies-other
| S-EPMC6873667 | biostudies-literature
| S-EPMC8175635 | biostudies-literature
| PRJEB19201 | ENA
| S-EPMC6829986 | biostudies-literature
| S-EPMC3213679 | biostudies-literature
| S-EPMC6330020 | biostudies-literature
| S-EPMC3319535 | biostudies-literature