Unknown

Dataset Information

0

Efficient Boolean implementation of universal sequence maps (bUSM).


ABSTRACT: BACKGROUND:Recently, Almeida and Vinga offered a new approach for the representation of arbitrary discrete sequences, referred to as Universal Sequence Maps (USM), and discussed its applicability to genomic sequence analysis. Their work generalizes and extends Chaos Game Representation (CGR) of DNA for arbitrary discrete sequences. RESULTS:We have considered issues associated with the practical implementation of USMs and offer a variation on the algorithm that: 1) eliminates the overestimation of similar segment lengths, 2) permits the identification of arbitrarily long similar segments in the context of finite word length coordinate representations, 3) uses more computationally efficient operations, and 4) provides a simple conversion for recovering the USM coordinates. Computational performance comparisons and examples are provided. CONCLUSIONS:We have shown that the desirable properties of the USM encoding of nucleotide sequences can be retained in a practical implementation of the algorithm. In addition, the proposed implementation enables determination of local sequence identity at increased speed.

SUBMITTER: Schwacke J 

PROVIDER: S-EPMC137598 | biostudies-literature | 2002 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Efficient Boolean implementation of universal sequence maps (bUSM).

Schwacke John J   Almeida Jonas S JS  

BMC bioinformatics 20021021


<h4>Background</h4>Recently, Almeida and Vinga offered a new approach for the representation of arbitrary discrete sequences, referred to as Universal Sequence Maps (USM), and discussed its applicability to genomic sequence analysis. Their work generalizes and extends Chaos Game Representation (CGR) of DNA for arbitrary discrete sequences.<h4>Results</h4>We have considered issues associated with the practical implementation of USMs and offer a variation on the algorithm that: 1) eliminates the o  ...[more]

Similar Datasets

| S-EPMC7575051 | biostudies-literature
2016-12-21 | GSE92674 | GEO
| S-EPMC7665037 | biostudies-literature
| S-EPMC4614354 | biostudies-literature
| S-EPMC3926364 | biostudies-literature
| S-EPMC3532238 | biostudies-literature
| S-EPMC4810783 | biostudies-literature
| S-EPMC8805959 | biostudies-literature
| S-EPMC1376893 | biostudies-literature
| S-EPMC7687814 | biostudies-literature