Unknown

Dataset Information

0

Finding functional sequence elements by multiple local alignment.


ABSTRACT: Algorithms that detect and align locally similar regions of biological sequences have the potential to discover a wide variety of functional motifs. Two theoretical contributions to this classic but unsolved problem are presented here: a method to determine the width of the aligned motif automatically; and a technique for calculating the statistical significance of alignments, i.e. an assessment of whether the alignments are stronger than those that would be expected to occur by chance among random, unrelated sequences. Upon exploring variants of the standard Gibbs sampling technique to optimize the alignment, we discovered that simulated annealing approaches perform more efficiently. Finally, we conduct failure tests by applying the algorithm to increasingly difficult test cases, and analyze the manner of and reasons for eventual failure. Detection of transcription factor-binding motifs is limited by the motifs' intrinsic subtlety rather than by inadequacy of the alignment optimization procedure.

SUBMITTER: Frith MC 

PROVIDER: S-EPMC373279 | biostudies-literature | 2004

REPOSITORIES: biostudies-literature

altmetric image

Publications

Finding functional sequence elements by multiple local alignment.

Frith Martin C MC   Hansen Ulla U   Spouge John L JL   Weng Zhiping Z  

Nucleic acids research 20040102 1


Algorithms that detect and align locally similar regions of biological sequences have the potential to discover a wide variety of functional motifs. Two theoretical contributions to this classic but unsolved problem are presented here: a method to determine the width of the aligned motif automatically; and a technique for calculating the statistical significance of alignments, i.e. an assessment of whether the alignments are stronger than those that would be expected to occur by chance among ran  ...[more]

Similar Datasets

| S-EPMC4595117 | biostudies-literature
| S-EPMC2228335 | biostudies-literature
| S-EPMC1636350 | biostudies-literature
| S-EPMC2677745 | biostudies-literature
| S-EPMC101229 | biostudies-literature
| S-EPMC3799466 | biostudies-literature
| S-EPMC2367447 | biostudies-literature