Unknown

Dataset Information

0

Efficient known ncRNA search including pseudoknots.


ABSTRACT: BACKGROUND: Searching for members of characterized ncRNA families containing pseudoknots is an important component of genome-scale ncRNA annotation. However, the state-of-the-art known ncRNA search is based on context-free grammar (CFG), which cannot effectively model pseudoknots. Thus, existing CFG-based ncRNA identification tools usually ignore pseudoknots during search. As a result, dozens of sequences that do not contain the native pseudoknots are reported by these tools. When pseudoknot structures are vital to the functions of the ncRNAs, these sequences may not be true members. RESULTS: In this work, we design a pseudoknot search tool using multiple simple sub-structures, which are derived from knot-free and bifurcation-free structural motifs in the underlying family. We test our tool on a contiguous 22-Mb region of the Maize Genome. The experimental results show that our work competes favorably with other pseudoknot search methods. CONCLUSIONS: Our sub-structure based tool can conduct genome-scale pseudoknot-containing ncRNA search effectively and efficiently. It provides a complementary pseudoknot search tool to Infernal. The source codes are available at http://www.cse.msu.edu/~chengy/knotsearch.

SUBMITTER: Yuan C 

PROVIDER: S-EPMC3549841 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Efficient known ncRNA search including pseudoknots.

Yuan Cheng C   Sun Yanni Y  

BMC bioinformatics 20130121


<h4>Background</h4>Searching for members of characterized ncRNA families containing pseudoknots is an important component of genome-scale ncRNA annotation. However, the state-of-the-art known ncRNA search is based on context-free grammar (CFG), which cannot effectively model pseudoknots. Thus, existing CFG-based ncRNA identification tools usually ignore pseudoknots during search. As a result, dozens of sequences that do not contain the native pseudoknots are reported by these tools. When pseudok  ...[more]

Similar Datasets

| S-EPMC2896150 | biostudies-literature
| S-EPMC3619282 | biostudies-literature
| S-EPMC3263976 | biostudies-literature
| S-EPMC3307117 | biostudies-literature
| S-EPMC8769711 | biostudies-literature
| S-EPMC3549817 | biostudies-literature
| S-EPMC3311100 | biostudies-literature
| S-EPMC4168714 | biostudies-literature
| S-EPMC1941756 | biostudies-literature
| S-EPMC1160208 | biostudies-literature