Unknown

Dataset Information

0

SlopMap: a software application tool for quick and flexible identification of similar sequences using exact k-mer matching.


ABSTRACT: With the advent of Next-Generation (NG) sequencing, it has become possible to sequence a entire genomes quickly and inexpensively. However, in some experiments one only needs to extract and assembly a portion of the sequence reads, for example when performing transcriptome studies, sequencing mitochondrial genomes, or characterizing exomes. With the raw DNA-library of a complete genome it would appear to be a trivial problem to identify reads of interest. But it is not always easy to incorporate well-known tools such as BLAST, BLAT, Bowtie, and SOAP directly into a bioinformatics pipelines before the assembly stage, either due to incompatibility with the assembler's file inputs, or because it is desirable to incorporate information that must be extracted separately. For example, in order to incorporate flowgrams from a Roche 454 sequencer into the Newbler assembler it is necessary to first extract them from the original SFF files. We present SlopMap, a bioinformatics software utility that allows quickly identification similar to the provided reference reads from either Roche 454 or Illumnia DNA library. With simple and intuitive command-line interface along with file output formats compatible to assembly programs, SlopMap can be directly embedded to biological data processing pipeline without any additional programming work. In addition, SlopMap preserves flowgram information needed for Roche 454 assembler.

SUBMITTER: Zhbannikov IY 

PROVIDER: S-EPMC3882202 | biostudies-literature | 2013 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

SlopMap: a software application tool for quick and flexible identification of similar sequences using exact k-mer matching.

Zhbannikov Ilya Y IY   Hunter Samuel S SS   Settles Matthew L ML   Foster James A JA  

Journal of data mining in genomics & proteomics 20130801 3


With the advent of Next-Generation (NG) sequencing, it has become possible to sequence a entire genomes quickly and inexpensively. However, in some experiments one only needs to extract and assembly a portion of the sequence reads, for example when performing transcriptome studies, sequencing mitochondrial genomes, or characterizing exomes. With the raw DNA-library of a complete genome it would appear to be a trivial problem to identify reads of interest. But it is not always easy to incorporate  ...[more]

Similar Datasets

| S-EPMC168949 | biostudies-literature
| S-EPMC4096558 | biostudies-literature
| S-EPMC5501583 | biostudies-literature
| S-EPMC137074 | biostudies-literature
| S-EPMC7824336 | biostudies-literature
| S-EPMC1994674 | biostudies-literature
| S-EPMC1523217 | biostudies-literature
| S-EPMC5013917 | biostudies-literature
| S-EPMC8774748 | biostudies-literature