Unknown

Dataset Information

0

Computational generation and screening of RNA motifs in large nucleotide sequence pools.


ABSTRACT: Although identification of active motifs in large random sequence pools is central to RNA in vitro selection, no systematic computational equivalent of this process has yet been developed. We develop a computational approach that combines target pool generation, motif scanning and motif screening using secondary structure analysis for applications to 10(12)-10(14)-sequence pools; large pool sizes are made possible using program redesign and supercomputing resources. We use the new protocol to search for aptamer and ribozyme motifs in pools up to experimental pool size (10(14) sequences). We show that motif scanning, structure matching and flanking sequence analysis, respectively, reduce the initial sequence pool by 6-8, 1-2 and 1 orders of magnitude, consistent with the rare occurrence of active motifs in random pools. The final yields match the theoretical yields from probability theory for simple motifs and overestimate experimental yields, which constitute lower bounds, for aptamers because screening analyses beyond secondary structure information are not considered systematically. We also show that designed pools using our nucleotide transition probability matrices can produce higher yields for RNA ligase motifs than random pools. Our methods for generating, analyzing and designing large pools can help improve RNA design via simulation of aspects of in vitro selection.

SUBMITTER: Kim N 

PROVIDER: S-EPMC2910066 | biostudies-literature | 2010 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Computational generation and screening of RNA motifs in large nucleotide sequence pools.

Kim Namhee N   Izzo Joseph A JA   Elmetwaly Shereef S   Gan Hin Hark HH   Schlick Tamar T  

Nucleic acids research 20100506 13


Although identification of active motifs in large random sequence pools is central to RNA in vitro selection, no systematic computational equivalent of this process has yet been developed. We develop a computational approach that combines target pool generation, motif scanning and motif screening using secondary structure analysis for applications to 10(12)-10(14)-sequence pools; large pool sizes are made possible using program redesign and supercomputing resources. We use the new protocol to se  ...[more]

Similar Datasets

| S-EPMC8258673 | biostudies-literature
| S-EPMC4551918 | biostudies-literature
| S-EPMC6547422 | biostudies-literature
| S-EPMC1087784 | biostudies-literature
| S-EPMC4569568 | biostudies-literature
| S-EPMC2567462 | biostudies-literature
| S-EPMC6279437 | biostudies-literature
| S-EPMC1800515 | biostudies-literature
| S-EPMC2893486 | biostudies-literature
| S-EPMC4921307 | biostudies-literature