Unknown

Dataset Information

0

MetaGen: reference-free learning with multiple metagenomic samples.


ABSTRACT: A major goal of metagenomics is to identify and study the entire collection of microbial species in a set of targeted samples. We describe a statistical metagenomic algorithm that simultaneously identifies microbial species and estimates their abundances without using reference genomes. As a trade-off, we require multiple metagenomic samples, usually ≥10 samples, to get highly accurate binning results. Compared to reference-free methods based primarily on k-mer distributions or coverage information, the proposed approach achieves a higher species binning accuracy and is particularly powerful when sequencing coverage is low. We demonstrated the performance of this new method through both simulation and real metagenomic studies. The MetaGen software is available at https://github.com/BioAlgs/MetaGen .

SUBMITTER: Xing X 

PROVIDER: S-EPMC5627425 | biostudies-other | 2017 Oct

REPOSITORIES: biostudies-other

altmetric image

Publications

MetaGen: reference-free learning with multiple metagenomic samples.

Xing Xin X   Liu Jun S JS   Zhong Wenxuan W  

Genome biology 20171003 1


A major goal of metagenomics is to identify and study the entire collection of microbial species in a set of targeted samples. We describe a statistical metagenomic algorithm that simultaneously identifies microbial species and estimates their abundances without using reference genomes. As a trade-off, we require multiple metagenomic samples, usually ≥10 samples, to get highly accurate binning results. Compared to reference-free methods based primarily on k-mer distributions or coverage informat  ...[more]

Similar Datasets

| S-EPMC3280959 | biostudies-literature
| S-EPMC7545149 | biostudies-literature
| S-EPMC6289653 | biostudies-literature
| S-EPMC7773188 | biostudies-literature
| S-EPMC4237721 | biostudies-literature
| S-EPMC6554296 | biostudies-literature
| S-EPMC6883653 | biostudies-literature
| S-EPMC6357350 | biostudies-literature
| S-EPMC3205936 | biostudies-literature