Unknown

Dataset Information

0

Species-level deconvolution of metagenome assemblies with Hi-C-based contact probability maps.


ABSTRACT: Microbial communities consist of mixed populations of organisms, including unknown species in unknown abundances. These communities are often studied through metagenomic shotgun sequencing, but standard library construction methods remove long-range contiguity information; thus, shotgun sequencing and de novo assembly of a metagenome typically yield a collection of contigs that cannot readily be grouped by species. Methods for generating chromatin-level contact probability maps, e.g., as generated by the Hi-C method, provide a signal of contiguity that is completely intracellular and contains both intrachromosomal and interchromosomal information. Here, we demonstrate how this signal can be exploited to reconstruct the individual genomes of microbial species present within a mixed sample. We apply this approach to two synthetic metagenome samples, successfully clustering the genome content of fungal, bacterial, and archaeal species with more than 99% agreement with published reference genomes. We also show that the Hi-C signal can secondarily be used to create scaffolded genome assemblies of individual eukaryotic species present within the microbial community, with higher levels of contiguity than some of the species' published reference genomes.

SUBMITTER: Burton JN 

PROVIDER: S-EPMC4455782 | biostudies-literature | 2014 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Species-level deconvolution of metagenome assemblies with Hi-C-based contact probability maps.

Burton Joshua N JN   Liachko Ivan I   Dunham Maitreya J MJ   Shendure Jay J  

G3 (Bethesda, Md.) 20140522 7


Microbial communities consist of mixed populations of organisms, including unknown species in unknown abundances. These communities are often studied through metagenomic shotgun sequencing, but standard library construction methods remove long-range contiguity information; thus, shotgun sequencing and de novo assembly of a metagenome typically yield a collection of contigs that cannot readily be grouped by species. Methods for generating chromatin-level contact probability maps, e.g., as generat  ...[more]

Similar Datasets

| S-EPMC8883645 | biostudies-literature
| S-EPMC8382278 | biostudies-literature
| S-EPMC5596920 | biostudies-literature
| S-EPMC7203009 | biostudies-literature
| S-EPMC4045339 | biostudies-literature
| S-EPMC5870694 | biostudies-literature
| S-EPMC6866797 | biostudies-literature
| S-EPMC8364283 | biostudies-literature
| S-EPMC10659349 | biostudies-literature
| S-EPMC6391755 | biostudies-literature