Unknown

Dataset Information

0

BeRBP: binding estimation for human RNA-binding proteins.


ABSTRACT: Identifying binding targets of RNA-binding proteins (RBPs) can greatly facilitate our understanding of their functional mechanisms. Most computational methods employ machine learning to train classifiers on either RBP-specific targets or pooled RBP-RNA interactions. The former strategy is more powerful, but it only applies to a few RBPs with a large number of known targets; conversely, the latter strategy sacrifices prediction accuracy for a wider application, since specific interaction features are inevitably obscured through pooling heterogeneous datasets. Here, we present beRBP, a dual approach to predict human RBP-RNA interaction given PWM of a RBP and one RNA sequence. Based on Random Forests, beRBP not only builds a specific model for each RBP with a decent number of known targets, but also develops a general model for RBPs with limited or null known targets. The specific and general models both compared well with existing methods on three benchmark datasets. Notably, the general model achieved a better performance than existing methods on most novel RBPs. Overall, as a composite solution overarching the RBP-specific and RBP-General strategies, beRBP is a promising tool for human RBP binding estimation with good prediction accuracy and a broad application scope.

SUBMITTER: Yu H 

PROVIDER: S-EPMC6411931 | biostudies-literature | 2019 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

beRBP: binding estimation for human RNA-binding proteins.

Yu Hui H   Wang Jing J   Sheng Quanhu Q   Liu Qi Q   Shyr Yu Y  

Nucleic acids research 20190301 5


Identifying binding targets of RNA-binding proteins (RBPs) can greatly facilitate our understanding of their functional mechanisms. Most computational methods employ machine learning to train classifiers on either RBP-specific targets or pooled RBP-RNA interactions. The former strategy is more powerful, but it only applies to a few RBPs with a large number of known targets; conversely, the latter strategy sacrifices prediction accuracy for a wider application, since specific interaction features  ...[more]

Similar Datasets

| S-EPMC7397871 | biostudies-literature
| S-EPMC6331435 | biostudies-literature
| S-EPMC5786023 | biostudies-literature
| S-EPMC3064767 | biostudies-literature
| S-EPMC7410833 | biostudies-literature
| S-EPMC10511089 | biostudies-literature
| S-EPMC8284322 | biostudies-literature
| S-EPMC6062212 | biostudies-literature
| S-EPMC7443297 | biostudies-literature
| S-EPMC7898822 | biostudies-literature