Ontology highlight
ABSTRACT:
SUBMITTER: Iseli C
PROVIDER: S-EPMC1894650 | biostudies-literature | 2007 Jun
REPOSITORIES: biostudies-literature
Iseli Christian C Ambrosini Giovanna G Bucher Philipp P Jongeneel C Victor CV
PloS one 20070627 6
Searching for matches between large collections of short (14-30 nucleotides) words and sequence databases comprising full genomes or transcriptomes is a common task in biological sequence analysis. We investigated the performance of simple indexing strategies for handling such tasks and developed two programs, fetchGWI and tagger, that index either the database or the query set. Either strategy outperforms megablast for searches with more than 10,000 probes. FetchGWI is shown to be a versatile t ...[more]