Unknown

Dataset Information

0

Reconstructing 16S rRNA genes in metagenomic data.


ABSTRACT: Metagenomic data, which contains sequenced DNA reads of uncultured microbial species from environmental samples, provide a unique opportunity to thoroughly analyze microbial species that have never been identified before. Reconstructing 16S ribosomal RNA, a phylogenetic marker gene, is usually required to analyze the composition of the metagenomic data. However, massive volume of dataset, high sequence similarity between related species, skewed microbial abundance and lack of reference genes make 16S rRNA reconstruction difficult. Generic de novo assembly tools are not optimized for assembling 16S rRNA genes. In this work, we introduce a targeted rRNA assembly tool, REAGO (REconstruct 16S ribosomal RNA Genes from metagenOmic data). It addresses the above challenges by combining secondary structure-aware homology search, zproperties of rRNA genes and de novo assembly. Our experimental results show that our tool can correctly recover more rRNA genes than several popular generic metagenomic assembly tools and specially designed rRNA construction tools.The source code of REAGO is freely available at https://github.com/chengyuan/reago.

SUBMITTER: Yuan C 

PROVIDER: S-EPMC4765874 | biostudies-literature | 2015 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Reconstructing 16S rRNA genes in metagenomic data.

Yuan Cheng C   Lei Jikai J   Cole James J   Sun Yanni Y  

Bioinformatics (Oxford, England) 20150601 12


<h4>Unlabelled</h4>Metagenomic data, which contains sequenced DNA reads of uncultured microbial species from environmental samples, provide a unique opportunity to thoroughly analyze microbial species that have never been identified before. Reconstructing 16S ribosomal RNA, a phylogenetic marker gene, is usually required to analyze the composition of the metagenomic data. However, massive volume of dataset, high sequence similarity between related species, skewed microbial abundance and lack of  ...[more]

Similar Datasets

| S-EPMC4547618 | biostudies-literature
| S-EPMC2754607 | biostudies-literature
| S-EPMC7381495 | biostudies-literature
| S-EPMC5079562 | biostudies-literature
| S-EPMC5997914 | biostudies-literature
| S-EPMC9091251 | biostudies-literature
| S-EPMC5856467 | biostudies-literature
| S-EPMC6727555 | biostudies-literature
2020-09-30 | GSE113701 | GEO
| S-EPMC9318310 | biostudies-literature