Unknown

Dataset Information

0

HybridRNAbind: prediction of RNA interacting residues across structure-annotated and disorder-annotated proteins.


ABSTRACT: The sequence-based predictors of RNA-binding residues (RBRs) are trained on either structure-annotated or disorder-annotated binding regions. A recent study of predictors of protein-binding residues shows that they are plagued by high levels of cross-predictions (protein binding residues are predicted as nucleic acid binding) and that structure-trained predictors perform poorly for the disorder-annotated regions and vice versa. Consequently, we analyze a representative set of the structure and disorder trained predictors of RBRs to comprehensively assess quality of their predictions. Our empirical analysis that relies on a new and low-similarity benchmark dataset reveals that the structure-trained predictors of RBRs perform well for the structure-annotated proteins while the disorder-trained predictors provide accurate results for the disorder-annotated proteins. However, these methods work only modestly well on the opposite types of annotations, motivating the need for new solutions. Using an empirical approach, we design HybridRNAbind meta-model that generates accurate predictions and low amounts of cross-predictions when tested on data that combines structure and disorder-annotated RBRs. We release this meta-model as a convenient webserver which is available at https://www.csuligroup.com/hybridRNAbind/.

SUBMITTER: Zhang F 

PROVIDER: S-EPMC10018345 | biostudies-literature | 2023 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

HybridRNAbind: prediction of RNA interacting residues across structure-annotated and disorder-annotated proteins.

Zhang Fuhao F   Li Min M   Zhang Jian J   Kurgan Lukasz L  

Nucleic acids research 20230301 5


The sequence-based predictors of RNA-binding residues (RBRs) are trained on either structure-annotated or disorder-annotated binding regions. A recent study of predictors of protein-binding residues shows that they are plagued by high levels of cross-predictions (protein binding residues are predicted as nucleic acid binding) and that structure-trained predictors perform poorly for the disorder-annotated regions and vice versa. Consequently, we analyze a representative set of the structure and d  ...[more]

Similar Datasets

| S-EPMC4329725 | biostudies-literature
| S-EPMC5796408 | biostudies-literature
| S-EPMC7246089 | biostudies-literature
| S-EPMC2853471 | biostudies-literature
| S-EPMC6389706 | biostudies-literature
| S-EPMC3930195 | biostudies-literature
| S-EPMC3107212 | biostudies-literature
| S-EPMC10570024 | biostudies-literature
| S-EPMC6635445 | biostudies-literature
| S-EPMC4529986 | biostudies-literature