Unknown

Dataset Information

0

A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution.


ABSTRACT: MOTIVATIONS: The design of RNA sequences folding into predefined secondary structures is a milestone for many synthetic biology and gene therapy studies. Most of the current software uses similar local search strategies (i.e. a random seed is progressively adapted to acquire the desired folding properties) and more importantly do not allow the user to control explicitly the nucleotide distribution such as the GC-content in their sequences. However, the latter is an important criterion for large-scale applications as it could presumably be used to design sequences with better transcription rates and/or structural plasticity. RESULTS: In this article, we introduce IncaRNAtion, a novel algorithm to design RNA sequences folding into target secondary structures with a predefined nucleotide distribution. IncaRNAtion uses a global sampling approach and weighted sampling techniques. We show that our approach is fast (i.e. running time comparable or better than local search methods), seedless (we remove the bias of the seed in local search heuristics) and successfully generates high-quality sequences (i.e. thermodynamically stable) for any GC-content. To complete this study, we develop a hybrid method combining our global sampling approach with local search strategies. Remarkably, our glocal methodology overcomes both local and global approaches for sampling sequences with a specific GC-content and target structure. AVAILABILITY: IncaRNAtion is available at csb.cs.mcgill.ca/incarnation/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

SUBMITTER: Reinharz V 

PROVIDER: S-EPMC3694657 | biostudies-other | 2013 Jul

REPOSITORIES: biostudies-other

altmetric image

Publications

A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution.

Reinharz Vladimir V   Ponty Yann Y   Waldispühl Jérôme J  

Bioinformatics (Oxford, England) 20130701 13


<h4>Motivations</h4>The design of RNA sequences folding into predefined secondary structures is a milestone for many synthetic biology and gene therapy studies. Most of the current software uses similar local search strategies (i.e. a random seed is progressively adapted to acquire the desired folding properties) and more importantly do not allow the user to control explicitly the nucleotide distribution such as the GC-content in their sequences. However, the latter is an important criterion for  ...[more]

Similar Datasets

| S-EPMC4956659 | biostudies-literature
| S-EPMC297010 | biostudies-literature
| S-EPMC3042186 | biostudies-literature
| S-EPMC5641303 | biostudies-literature
| S-EPMC434454 | biostudies-literature
| S-EPMC6191713 | biostudies-literature
| S-EPMC2804298 | biostudies-literature
| S-EPMC3117358 | biostudies-literature
| S-EPMC5368421 | biostudies-literature
| S-EPMC4168127 | biostudies-literature