Unknown

Dataset Information

0

Fuzzysplit: demultiplexing and trimming sequenced DNA with a declarative language.


ABSTRACT: Next-generation sequencing technologies create large, multiplexed DNA sequences that require preprocessing before any further analysis. Part of this preprocessing includes demultiplexing and trimming sequences. Although there are many existing tools that can handle these preprocessing steps, they cannot be easily extended to new sequence schematics when new pipelines are developed. We present Fuzzysplit, a tool that relies on a simple declarative language to describe the schematics of sequences, which makes it incredibly adaptable to different use cases. In this paper, we explain the matching algorithms behind Fuzzysplit and we provide a preliminary comparison of its performance with other well-established tools. Overall, we find that its matching accuracy is comparable to previous tools.

SUBMITTER: Liu D 

PROVIDER: S-EPMC6589082 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

Fuzzysplit: demultiplexing and trimming sequenced DNA with a declarative language.

Liu Daniel D  

PeerJ 20190619


Next-generation sequencing technologies create large, multiplexed DNA sequences that require preprocessing before any further analysis. Part of this preprocessing includes demultiplexing and trimming sequences. Although there are many existing tools that can handle these preprocessing steps, they cannot be easily extended to new sequence schematics when new pipelines are developed. We present Fuzzysplit, a tool that relies on a simple declarative language to describe the schematics of sequences,  ...[more]

Similar Datasets

| S-EPMC5207735 | biostudies-literature
| S-EPMC8231307 | biostudies-literature
| S-EPMC8632504 | biostudies-literature
| S-EPMC7180228 | biostudies-literature
| S-EPMC6535646 | biostudies-literature
| S-EPMC3986888 | biostudies-literature
| S-EPMC4342083 | biostudies-literature
| S-EPMC2820518 | biostudies-literature
| S-EPMC9942447 | biostudies-literature
| S-EPMC407849 | biostudies-literature