Unknown

Dataset Information

0

Using structural and evolutionary information to detect and correct pyrosequencing errors in noncoding RNAs.


ABSTRACT: The analysis of the sequence-structure relationship in RNA molecules is not only essential for evolutionary studies but also for concrete applications such as error-correction in next generation sequencing (NGS) technologies. The prohibitive sizes of the mutational and conformational landscapes, combined with the volume of data to process, require efficient algorithms to compute sequence-structure properties. In this article, we address the correction of NGS errors by calculating which mutations most increase the likelihood of a sequence to a given structure and RNA family. We introduce RNApyro, an efficient, linear time and space inside-outside algorithm that computes exact mutational probabilities under secondary structure and evolutionary constraints given as a multiple sequence alignment with a consensus structure. We develop a scoring scheme combining classical stacking base-pair energies to novel isostericity scores and apply our techniques to correct pointwise errors in 5s and 16s rRNA sequences. Our results suggest that RNApyro is a promising algorithm to complement existing tools in the NGS error-correction pipeline.

SUBMITTER: Reinharz V 

PROVIDER: S-EPMC3822367 | biostudies-literature | 2013 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using structural and evolutionary information to detect and correct pyrosequencing errors in noncoding RNAs.

Reinharz Vladimir V   Ponty Yann Y   Waldispühl Jérôme J  

Journal of computational biology : a journal of computational molecular cell biology 20131017 11


The analysis of the sequence-structure relationship in RNA molecules is not only essential for evolutionary studies but also for concrete applications such as error-correction in next generation sequencing (NGS) technologies. The prohibitive sizes of the mutational and conformational landscapes, combined with the volume of data to process, require efficient algorithms to compute sequence-structure properties. In this article, we address the correction of NGS errors by calculating which mutations  ...[more]

Similar Datasets

| S-EPMC5799025 | biostudies-literature
| S-EPMC2828116 | biostudies-literature
| S-EPMC11234323 | biostudies-literature
| S-EPMC3626854 | biostudies-literature
| S-EPMC10480150 | biostudies-literature
| S-EPMC2650297 | biostudies-other
| S-EPMC7092718 | biostudies-literature
| S-EPMC5566112 | biostudies-other
| S-EPMC1567933 | biostudies-literature
| S-EPMC3975061 | biostudies-literature