Ontology highlight
ABSTRACT: Unlabelled
We present a tool suited for searching for many short nucleotide sequences in large databases, allowing for a predefined number of gaps and mismatches. The commandline-driven program implements a non-deterministic automata matching algorithm on a keyword tree of the search strings. Both queries with and without ambiguity codes can be searched. Search time is short for perfect matches, and retrieval time rises exponentially with the number of edits allowed.Availability
The C++ source code for PatMaN is distributed under the GNU General Public License and has been tested on the GNU/Linux operating system. It is available from http://bioinf.eva.mpg.de/patman.Supplementary information
Supplementary data are available at Bioinformatics online.
SUBMITTER: Prufer K
PROVIDER: S-EPMC2718670 | biostudies-literature | 2008 Jul
REPOSITORIES: biostudies-literature
Prüfer Kay K Stenzel Udo U Dannemann Michael M Green Richard E RE Lachmann Michael M Kelso Janet J
Bioinformatics (Oxford, England) 20080508 13
<h4>Unlabelled</h4>We present a tool suited for searching for many short nucleotide sequences in large databases, allowing for a predefined number of gaps and mismatches. The commandline-driven program implements a non-deterministic automata matching algorithm on a keyword tree of the search strings. Both queries with and without ambiguity codes can be searched. Search time is short for perfect matches, and retrieval time rises exponentially with the number of edits allowed.<h4>Availability</h4> ...[more]