Unknown

Dataset Information

0

Fast selection of miRNA candidates based on large-scale pre-computed MFE sets of randomized sequences.


ABSTRACT: BACKGROUND: Small RNAs are important regulators of genome function, yet their prediction in genomes is still a major computational challenge. Statistical analyses of pre-miRNA sequences indicated that their 2D structure tends to have a minimal free energy (MFE) significantly lower than MFE values of equivalently randomized sequences with the same nucleotide composition, in contrast to other classes of non-coding RNA. The computation of many MFEs is, however, too intensive to allow for genome-wide screenings. RESULTS: Using a local grid infrastructure, MFE distributions of random sequences were pre-calculated on a large scale. These distributions follow a normal distribution and can be used to determine the MFE distribution for any given sequence composition by interpolation. It allows on-the-fly calculation of the normal distribution for any candidate sequence composition. CONCLUSION: The speedup achieved makes genome-wide screening with this characteristic of a pre-miRNA sequence practical. Although this particular property alone will not be able to distinguish miRNAs from other sequences sufficiently discriminative, the MFE-based P-value should be added to the parameters of choice to be included in the selection of potential miRNA candidates for experimental verification.

SUBMITTER: Warris S 

PROVIDER: S-EPMC3895842 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Fast selection of miRNA candidates based on large-scale pre-computed MFE sets of randomized sequences.

Warris Sven S   Boymans Sander S   Muiser Iwe I   Noback Michiel M   Krijnen Wim W   Nap Jan-Peter JP  

BMC research notes 20140113


<h4>Background</h4>Small RNAs are important regulators of genome function, yet their prediction in genomes is still a major computational challenge. Statistical analyses of pre-miRNA sequences indicated that their 2D structure tends to have a minimal free energy (MFE) significantly lower than MFE values of equivalently randomized sequences with the same nucleotide composition, in contrast to other classes of non-coding RNA. The computation of many MFEs is, however, too intensive to allow for gen  ...[more]

Similar Datasets

| S-EPMC5029481 | biostudies-literature
| S-EPMC3135579 | biostudies-literature
| S-EPMC3516144 | biostudies-literature
| S-EPMC3535721 | biostudies-literature
| S-EPMC3022630 | biostudies-literature
| S-EPMC4641693 | biostudies-literature
| S-EPMC10251624 | biostudies-literature
| S-EPMC4437040 | biostudies-literature
| S-EPMC2142204 | biostudies-other
| S-EPMC3992564 | biostudies-literature