Unknown

Dataset Information

0

A bioinformatician's guide to the forefront of suffix array construction algorithms.


ABSTRACT: The suffix array and its variants are text-indexing data structures that have become indispensable in the field of bioinformatics. With the uninitiated in mind, we provide an accessible exposition of the SA-IS algorithm, which is the state of the art in suffix array construction. We also describe DisLex, a technique that allows standard suffix array construction algorithms to create modified suffix arrays designed to enable a simple form of inexact matching needed to support 'spaced seeds' and 'subset seeds' used in many biological applications.

SUBMITTER: Shrestha AM 

PROVIDER: S-EPMC3956071 | biostudies-literature | 2014 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

A bioinformatician's guide to the forefront of suffix array construction algorithms.

Shrestha Anish Man Singh AM   Frith Martin C MC   Horton Paul P  

Briefings in bioinformatics 20140110 2


The suffix array and its variants are text-indexing data structures that have become indispensable in the field of bioinformatics. With the uninitiated in mind, we provide an accessible exposition of the SA-IS algorithm, which is the state of the art in suffix array construction. We also describe DisLex, a technique that allows standard suffix array construction algorithms to create modified suffix arrays designed to enable a simple form of inexact matching needed to support 'spaced seeds' and '  ...[more]

Similar Datasets

| S-EPMC4123905 | biostudies-literature
| S-EPMC7197101 | biostudies-literature
| S-EPMC9206251 | biostudies-literature
| S-EPMC3320572 | biostudies-literature
| S-EPMC6069885 | biostudies-other
| S-EPMC3031031 | biostudies-literature
| S-EPMC4816028 | biostudies-literature
| S-EPMC5416843 | biostudies-literature
| S-EPMC3829466 | biostudies-literature
| S-EPMC2983088 | biostudies-literature