Unknown

Dataset Information

0

Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints.


ABSTRACT: We are interested in the problem of predicting secondary structure for small sets of homologous RNAs, by incorporating limited comparative sequence information into an RNA folding model. The Sankoff algorithm for simultaneous RNA folding and alignment is a basis for approaches to this problem. There are two open problems in applying a Sankoff algorithm: development of a good unified scoring system for alignment and folding and development of practical heuristics for dealing with the computational complexity of the algorithm.We use probabilistic models (pair stochastic context-free grammars, pairSCFGs) as a unifying framework for scoring pairwise alignment and folding. A constrained version of the pairSCFG structural alignment algorithm was developed which assumes knowledge of a few confidently aligned positions (pins). These pins are selected based on the posterior probabilities of a probabilistic pairwise sequence alignment.Pairwise RNA structural alignment improves on structure prediction accuracy relative to single sequence folding. Constraining on alignment is a straightforward method of reducing the runtime and memory requirements of the algorithm. Five practical implementations of the pairwise Sankoff algorithm - this work (Consan), David Mathews' Dynalign, Ian Holmes' Stemloc, Ivo Hofacker's PMcomp, and Jan Gorodkin's FOLDALIGN - have comparable overall performance with different strengths and weaknesses.

SUBMITTER: Dowell RD 

PROVIDER: S-EPMC1579236 | biostudies-literature | 2006 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints.

Dowell Robin D RD   Eddy Sean R SR  

BMC bioinformatics 20060904


<h4>Background</h4>We are interested in the problem of predicting secondary structure for small sets of homologous RNAs, by incorporating limited comparative sequence information into an RNA folding model. The Sankoff algorithm for simultaneous RNA folding and alignment is a basis for approaches to this problem. There are two open problems in applying a Sankoff algorithm: development of a good unified scoring system for alignment and folding and development of practical heuristics for dealing wi  ...[more]

Similar Datasets

| S-EPMC1868766 | biostudies-literature
| S-EPMC6980424 | biostudies-literature
| S-EPMC2668612 | biostudies-literature
| S-EPMC2850363 | biostudies-literature
| S-EPMC1904245 | biostudies-literature
| S-EPMC1955456 | biostudies-literature
| S-EPMC3465099 | biostudies-literature
| S-EPMC2677745 | biostudies-literature
| S-EPMC2630964 | biostudies-literature
| S-EPMC8430217 | biostudies-literature