Unknown

Dataset Information

0

MBMC: An Effective Markov Chain Approach for Binning Metagenomic Reads from Environmental Shotgun Sequencing Projects.


ABSTRACT: Metagenomics is a next-generation omics field currently impacting postgenomic life sciences and medicine. Binning metagenomic reads is essential for the understanding of microbial function, compositions, and interactions in given environments. Despite the existence of dozens of computational methods for metagenomic read binning, it is still very challenging to bin reads. This is especially true for reads from unknown species, from species with similar abundance, and/or from low-abundance species in environmental samples. In this study, we developed a novel taxonomy-dependent and alignment-free approach called MBMC (Metagenomic Binning by Markov Chains). Different from all existing methods, MBMC bins reads by measuring the similarity of reads to the trained Markov chains for different taxa instead of directly comparing reads with known genomic sequences. By testing on more than 24 simulated and experimental datasets with species of similar abundance, species of low abundance, and/or unknown species, we report here that MBMC reliably grouped reads from different species into separate bins. Compared with four existing approaches, we demonstrated that the performance of MBMC was comparable with existing approaches when binning reads from sequenced species, and superior to existing approaches when binning reads from unknown species. MBMC is a pivotal tool for binning metagenomic reads in the current era of Big Data and postgenomic integrative biology. The MBMC software can be freely downloaded at http://hulab.ucf.edu/research/projects/metagenomics/MBMC.html .

SUBMITTER: Wang Y 

PROVIDER: S-EPMC4982950 | biostudies-literature | 2016 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

MBMC: An Effective Markov Chain Approach for Binning Metagenomic Reads from Environmental Shotgun Sequencing Projects.

Wang Ying Y   Hu Haiyan H   Li Xiaoman X  

Omics : a journal of integrative biology 20160722 8


Metagenomics is a next-generation omics field currently impacting postgenomic life sciences and medicine. Binning metagenomic reads is essential for the understanding of microbial function, compositions, and interactions in given environments. Despite the existence of dozens of computational methods for metagenomic read binning, it is still very challenging to bin reads. This is especially true for reads from unknown species, from species with similar abundance, and/or from low-abundance species  ...[more]

Similar Datasets

2021-07-26 | E-MTAB-9189 | biostudies-arrayexpress
2021-07-26 | E-MTAB-9191 | biostudies-arrayexpress
| S-EPMC7506068 | biostudies-literature
| S-EPMC9726812 | biostudies-literature
| S-EPMC8175635 | biostudies-literature
| S-EPMC8277151 | biostudies-literature
| S-EPMC4736482 | biostudies-literature
| S-EPMC3232206 | biostudies-literature
| S-EPMC3424124 | biostudies-literature
| S-EPMC9154269 | biostudies-literature