Unknown

Dataset Information

0

A nonredundant structure dataset for benchmarking protein-RNA computational docking.


ABSTRACT: Protein-RNA interactions play an important role in many biological processes. The ability to predict the molecular structures of protein-RNA complexes from docking would be valuable for understanding the underlying chemical mechanisms. We have developed a novel nonredundant benchmark dataset for protein-RNA docking and scoring. The diverse dataset of 72 targets consists of 52 unbound-unbound test complexes, and 20 unbound-bound test complexes. Here, unbound-unbound complexes refer to cases in which both binding partners of the cocrystallized complex are either in apo form or in a conformation taken from a different protein-RNA complex, whereas unbound-bound complexes are cases in which only one of the two binding partners has another experimentally determined conformation. The dataset is classified into three categories according to the interface root mean square deviation and the percentage of native contacts in the unbound structures: 49 easy, 16 medium, and 7 difficult targets. The bound and unbound cases of the benchmark dataset are expected to benefit the development and improvement of docking and scoring algorithms for the docking community. All the easy-to-view structures are freely available to the public at http://zoulab.dalton.missouri.edu/RNAbenchmark/.

SUBMITTER: Huang SY 

PROVIDER: S-EPMC3546201 | biostudies-literature | 2013 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

A nonredundant structure dataset for benchmarking protein-RNA computational docking.

Huang Sheng-You SY   Zou Xiaoqin X  

Journal of computational chemistry 20121010 4


Protein-RNA interactions play an important role in many biological processes. The ability to predict the molecular structures of protein-RNA complexes from docking would be valuable for understanding the underlying chemical mechanisms. We have developed a novel nonredundant benchmark dataset for protein-RNA docking and scoring. The diverse dataset of 72 targets consists of 52 unbound-unbound test complexes, and 20 unbound-bound test complexes. Here, unbound-unbound complexes refer to cases in wh  ...[more]

Similar Datasets

| S-EPMC7447090 | biostudies-literature
| S-EPMC10883643 | biostudies-literature
| S-EPMC10862857 | biostudies-literature
| S-EPMC4489248 | biostudies-literature
| S-EPMC7394329 | biostudies-literature
| S-EPMC10132383 | biostudies-literature
| S-EPMC5994945 | biostudies-literature
| S-EPMC3149062 | biostudies-literature
| S-EPMC7897250 | biostudies-literature
| S-EPMC6584985 | biostudies-other