Ontology highlight
ABSTRACT: Motivation
16S rDNA hypervariable tag sequencing has become the de facto method for accessing microbial diversity. Illumina paired-end sequencing, which produces two separate reads for each DNA fragment, has become the platform of choice for this application. However, when the two reads do not overlap, existing computational pipelines analyze data from read separately and underutilize the information contained in the paired-end reads.Results
We created a workflow known as Illinois Mayo Taxon Organization from RNA Dataset Operations (IM-TORNADO) for processing non-overlapping reads while retaining maximal information content. Using synthetic mock datasets, we show that the use of both reads produced answers with greater correlation to those from full length 16S rDNA when looking at taxonomy, phylogeny, and beta-diversity.Availability and implementation
IM-TORNADO is freely available at http://sourceforge.net/projects/imtornado and produces BIOM format output for cross compatibility with other pipelines such as QIIME, mothur, and phyloseq.
SUBMITTER: Jeraldo P
PROVIDER: S-EPMC4266640 | biostudies-literature | 2014
REPOSITORIES: biostudies-literature
Jeraldo Patricio P Kalari Krishna K Chen Xianfeng X Bhavsar Jaysheel J Mangalam Ashutosh A White Bryan B Nelson Heidi H Kocher Jean-Pierre JP Chia Nicholas N
PloS one 20141215 12
<h4>Motivation</h4>16S rDNA hypervariable tag sequencing has become the de facto method for accessing microbial diversity. Illumina paired-end sequencing, which produces two separate reads for each DNA fragment, has become the platform of choice for this application. However, when the two reads do not overlap, existing computational pipelines analyze data from read separately and underutilize the information contained in the paired-end reads.<h4>Results</h4>We created a workflow known as Illinoi ...[more]