Unknown

Dataset Information

0

TriageTools: tools for partitioning and prioritizing analysis of high-throughput sequencing data.


ABSTRACT: High-throughput sequencing is becoming a popular research tool but carries with it considerable costs in terms of computation time, data storage and bandwidth. Meanwhile, some research applications focusing on individual genes or pathways do not necessitate processing of a full sequencing dataset. Thus, it is desirable to partition a large dataset into smaller, manageable, but relevant pieces. We present a toolkit for partitioning raw sequencing data that includes a method for extracting reads that are likely to map onto pre-defined regions of interest. We show the method can be used to extract information about genes of interest from DNA or RNA sequencing samples in a fraction of the time and disk space required to process and store a full dataset. We report speedup factors between 2.6 and 96, depending on settings and samples used. The software is available at http://www.sourceforge.net/projects/triagetools/.

SUBMITTER: Fimereli D 

PROVIDER: S-EPMC3627586 | biostudies-literature | 2013 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

TriageTools: tools for partitioning and prioritizing analysis of high-throughput sequencing data.

Fimereli Danai D   Detours Vincent V   Konopka Tomasz T  

Nucleic acids research 20130213 7


High-throughput sequencing is becoming a popular research tool but carries with it considerable costs in terms of computation time, data storage and bandwidth. Meanwhile, some research applications focusing on individual genes or pathways do not necessitate processing of a full sequencing dataset. Thus, it is desirable to partition a large dataset into smaller, manageable, but relevant pieces. We present a toolkit for partitioning raw sequencing data that includes a method for extracting reads t  ...[more]

Similar Datasets

| S-EPMC6935493 | biostudies-literature
| S-EPMC6070604 | biostudies-literature
| S-EPMC3965039 | biostudies-literature
| S-EPMC6793853 | biostudies-literature
| S-EPMC4957989 | biostudies-literature
| S-EPMC2825224 | biostudies-literature
| S-EPMC4048240 | biostudies-literature
| S-EPMC3832420 | biostudies-literature
| S-EPMC3991327 | biostudies-literature
| S-EPMC3416827 | biostudies-literature