Unknown

Dataset Information

0

Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory.


ABSTRACT: BACKGROUND:The nearest neighbor model and associated dynamic programming algorithms allow for the efficient estimation of the RNA secondary structure Boltzmann ensemble. However because a given RNA secondary structure only contains a fraction of the possible helices that could form from a given sequence, the Boltzmann ensemble is multimodal. Several methods exist for clustering structures and finding those modes. However less focus is given to exploring the underlying reasons for this multimodality: the presence of conflicting basepairs. Information theory, or more specifically mutual information, provides a method to identify those basepairs that are key to the secondary structure. RESULTS:To this end we find most informative basepairs and visualize the effect of these basepairs on the secondary structure. Knowing whether a most informative basepair is present tells us not only the status of the particular pair but also provides a large amount of information about which other pairs are present or not present. We find that a few basepairs account for a large amount of the structural uncertainty. The identification of these pairs indicates small changes to sequence or stability that will have a large effect on structure. CONCLUSION:We provide a novel algorithm that uses mutual information to identify the key basepairs that lead to a multimodal Boltzmann distribution. We then visualize the effect of these pairs on the overall Boltzmann ensemble.

SUBMITTER: Lin L 

PROVIDER: S-EPMC5836418 | biostudies-literature | 2018 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory.

Lin Luan L   McKerrow Wilson H WH   Richards Bryce B   Phonsom Chukiat C   Lawrence Charles E CE  

BMC bioinformatics 20180305 1


<h4>Background</h4>The nearest neighbor model and associated dynamic programming algorithms allow for the efficient estimation of the RNA secondary structure Boltzmann ensemble. However because a given RNA secondary structure only contains a fraction of the possible helices that could form from a given sequence, the Boltzmann ensemble is multimodal. Several methods exist for clustering structures and finding those modes. However less focus is given to exploring the underlying reasons for this mu  ...[more]

Similar Datasets

| S-EPMC1370799 | biostudies-literature
| S-EPMC4181469 | biostudies-literature
| S-EPMC4267672 | biostudies-literature
| S-EPMC3549843 | biostudies-literature
| S-EPMC5688744 | biostudies-literature
| S-EPMC3750279 | biostudies-literature
| S-EPMC2874162 | biostudies-literature
| S-EPMC5529173 | biostudies-literature
| S-EPMC3026369 | biostudies-literature
| S-EPMC6881452 | biostudies-literature