Unknown

Dataset Information

0

RNA profiling 2.0: Enhanced cluster analysis of structural ensembles.


ABSTRACT: Understanding the base pairing of an RNA sequence provides insight into its molecular structure. By mining suboptimal sampling data, RNAprofiling 1.0 identifies the dominant helices in low-energy secondary structures as features, organizes them into profiles which partition the Boltzmann sample, and highlights key similarities/differences among the most informative, i.e. selected, profiles in a graphical format. Version 2.0 enhances every step of this approach. First, the featured substructures are expanded from helices to stems. Second, profile selection includes low-frequency pairings similar to featured ones. In conjunction, these updates extend the utility of the method to sequences up to length 600, as evaluated over a sizable dataset. Third, relationships are visualized in a decision tree which highlights the most important structural differences. Finally, this cluster analysis is made accessible to experimental researchers in a portable format as an interactive webpage, permitting a much greater understanding of trade-offs among different possible base pairing combinations.

SUBMITTER: Hurley F 

PROVIDER: S-EPMC10081340 | biostudies-literature | 2023 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

RNA profiling 2.0: Enhanced cluster analysis of structural ensembles.

Hurley Forrest F   Heitsch Christine C  

ArXiv 20230327


Understanding the base pairing of an RNA sequence provides insight into its molecular structure. By mining suboptimal sampling data, RNAprofiling 1.0 identifies the dominant helices in low-energy secondary structures as features, organizes them into profiles which partition the Boltzmann sample, and highlights key similarities/differences among the most informative, i.e. selected, profiles in a graphical format. Version 2.0 enhances every step of this approach. First, the featured substructures  ...[more]

Similar Datasets

| S-EPMC6929272 | biostudies-literature
| S-EPMC308814 | biostudies-literature
| S-EPMC5570192 | biostudies-literature
2023-05-12 | GSE225383 | GEO
| S-EPMC8015854 | biostudies-literature
| S-EPMC9661767 | biostudies-literature
| S-EPMC9004634 | biostudies-literature
| PRJNA935379 | ENA
| S-EPMC8891300 | biostudies-literature
| S-EPMC4755865 | biostudies-literature