Unknown

Dataset Information

0

Probabilistic approaches to alignment with tandem repeats.


ABSTRACT:

Background

Short tandem repeats are ubiquitous in genomic sequences and due to their complex evolutionary history pose a challenge for sequence alignment tools.

Results

To better account for the presence of tandem repeats in pairwise sequence alignments, we propose a simple tractable pair hidden Markov model that explicitly models their presence. Using the framework of gain functions, we design several optimization criteria for decoding this model and describe resulting decoding algorithms, ranging from the traditional Viterbi and posterior decoding to block-based decoding algorithms tailored to our model. We compare the accuracy of individual decoding algorithms on simulated and real data and find that our approach is superior to the classical three-state pair HMM.

Conclusions

Our study illustrates versatility of pair hidden Markov models coupled with appropriate decoding criteria as a modeling tool for capturing complex sequence features.

SUBMITTER: Nanasi M 

PROVIDER: S-EPMC3975930 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC3783189 | biostudies-literature
| S-EPMC1449883 | biostudies-literature
| S-EPMC4151033 | biostudies-literature
| S-EPMC3648361 | biostudies-literature
| S-EPMC3159474 | biostudies-literature
| S-EPMC4307672 | biostudies-literature
| S-EPMC3766767 | biostudies-literature
| S-EPMC6419916 | biostudies-literature
| S-EPMC7549362 | biostudies-literature
2018-03-01 | E-MTAB-6411 | biostudies-arrayexpress