Unknown

Dataset Information

0

BpRNA: large-scale automated annotation and analysis of RNA secondary structure.


ABSTRACT: While RNA secondary structure prediction from sequence data has made remarkable progress, there is a need for improved strategies for annotating the features of RNA secondary structures. Here, we present bpRNA, a novel annotation tool capable of parsing RNA structures, including complex pseudoknot-containing RNAs, to yield an objective, precise, compact, unambiguous, easily-interpretable description of all loops, stems, and pseudoknots, along with the positions, sequence, and flanking base pairs of each such structural feature. We also introduce several new informative representations of RNA structure types to improve structure visualization and interpretation. We have further used bpRNA to generate a web-accessible meta-database, 'bpRNA-1m', of over 100 000 single-molecule, known secondary structures; this is both more fully and accurately annotated and over 20-times larger than existing databases. We use a subset of the database with highly similar (?90% identical) sequences filtered out to report on statistical trends in sequence, flanking base pairs, and length. Both the bpRNA method and the bpRNA-1m database will be valuable resources both for specific analysis of individual RNA molecules and large-scale analyses such as are useful for updating RNA energy parameters for computational thermodynamic predictions, improving machine learning models for structure prediction, and for benchmarking structure-prediction algorithms.

SUBMITTER: Danaee P 

PROVIDER: S-EPMC6009582 | biostudies-literature | 2018 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

bpRNA: large-scale automated annotation and analysis of RNA secondary structure.

Danaee Padideh P   Rouches Mason M   Wiley Michelle M   Deng Dezhong D   Huang Liang L   Hendrix David D  

Nucleic acids research 20180601 11


While RNA secondary structure prediction from sequence data has made remarkable progress, there is a need for improved strategies for annotating the features of RNA secondary structures. Here, we present bpRNA, a novel annotation tool capable of parsing RNA structures, including complex pseudoknot-containing RNAs, to yield an objective, precise, compact, unambiguous, easily-interpretable description of all loops, stems, and pseudoknots, along with the positions, sequence, and flanking base pairs  ...[more]

Similar Datasets

| S-EPMC2519158 | biostudies-literature
| S-EPMC3288334 | biostudies-literature
| S-EPMC3448840 | biostudies-literature
| S-EPMC7470976 | biostudies-literature
2022-06-15 | GSE148422 | GEO
| S-EPMC5037380 | biostudies-literature
| S-EPMC6317475 | biostudies-literature
| S-EPMC8126727 | biostudies-literature
| S-EPMC6687061 | biostudies-literature
| S-EPMC4560050 | biostudies-literature