Unknown

Dataset Information

0

Fast filtering for RNA homology search.


ABSTRACT: Homology search for RNAs can use secondary structure information to increase power by modeling base pairs, as in covariance models, but the resulting computational costs are high. Typical acceleration strategies rely on at least one filtering stage using sequence-only search.Here we present the multi-segment CYK (MSCYK) filter, which implements a heuristic of ungapped structural alignment for RNA homology search. Compared to gapped alignment, this approximation has lower computation time requirements (O(N?) reduced to O(N³), and space requirements (O(N³) reduced to O(N²). A vector-parallel implementation of this method gives up to 100-fold speed-up; vector-parallel implementations of standard gapped alignment at two levels of precision give 3- and 6-fold speed-ups. These approaches are combined to create a filtering pipeline that scores RNA secondary structure at all stages, with results that are synergistic with existing methods.

SUBMITTER: Kolbe DL 

PROVIDER: S-EPMC3208395 | biostudies-literature | 2011 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Fast filtering for RNA homology search.

Kolbe Diana L DL   Eddy Sean R SR  

Bioinformatics (Oxford, England) 20110928 22


<h4>Motivation</h4>Homology search for RNAs can use secondary structure information to increase power by modeling base pairs, as in covariance models, but the resulting computational costs are high. Typical acceleration strategies rely on at least one filtering stage using sequence-only search.<h4>Results</h4>Here we present the multi-segment CYK (MSCYK) filter, which implements a heuristic of ungapped structural alignment for RNA homology search. Compared to gapped alignment, this approximation  ...[more]

Similar Datasets

2022-03-04 | GSE189259 | GEO
2021-07-21 | GSE179646 | GEO
| S-EPMC3154205 | biostudies-literature
| S-EPMC3716875 | biostudies-literature
| S-EPMC3476332 | biostudies-literature
2024-07-29 | GSE272969 | GEO
| S-EPMC2562014 | biostudies-literature
| S-EPMC2703968 | biostudies-literature
2021-07-21 | GSE179641 | GEO
2021-07-21 | GSE179638 | GEO