Unknown

Dataset Information

0

PredPRBA: Prediction of Protein-RNA Binding Affinity Using Gradient Boosted Regression Trees.


ABSTRACT: Protein-RNA interactions play essential roles in many biological aspects. Quantifying the binding affinity of protein-RNA complexes is helpful to the understanding of protein-RNA recognition mechanisms and identification of strong binding partners. Due to experimentally measured protein-RNA binding affinity data available is still limited to date, there is a pressing demand for accurate and reliable computational approaches. In this paper, we propose a computational approach, PredPRBA, which can effectively predict protein-RNA binding affinity using gradient boosted regression trees. We build a dataset of protein-RNA binding affinity that includes 103 protein-RNA complex structures manually collected from related literature. Then, we generate 37 kinds of sequence and structural features and explore the relationship between the features and protein-RNA binding affinity. We find that the binding affinity mainly depends on the structure of RNA molecules. According to the type of RNA associated with proteins composed of the protein-RNA complex, we split the 103 protein-RNA complexes into six categories. For each category, we build a gradient boosted regression tree (GBRT) model based on the generated features. We perform a comprehensive evaluation for the proposed method on the binding affinity dataset using leave-one-out cross-validation. We show that PredPRBA achieves correlations ranging from 0.723 to 0.897 among six categories, which is significantly better than other typical regression methods and the pioneer protein-RNA binding affinity predictor SPOT-Seq-RNA. In addition, a user-friendly web server has been developed to predict the binding affinity of protein-RNA complexes. The PredPRBA webserver is freely available at http://PredPRBA.denglab.org/.

SUBMITTER: Deng L 

PROVIDER: S-EPMC6688581 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

PredPRBA: Prediction of Protein-RNA Binding Affinity Using Gradient Boosted Regression Trees.

Deng Lei L   Yang Wenyi W   Liu Hui H  

Frontiers in genetics 20190802


Protein-RNA interactions play essential roles in many biological aspects. Quantifying the binding affinity of protein-RNA complexes is helpful to the understanding of protein-RNA recognition mechanisms and identification of strong binding partners. Due to experimentally measured protein-RNA binding affinity data available is still limited to date, there is a pressing demand for accurate and reliable computational approaches. In this paper, we propose a computational approach, PredPRBA, which can  ...[more]

Similar Datasets

| S-EPMC7944598 | biostudies-literature
| S-EPMC8674730 | biostudies-literature
| S-EPMC9352200 | biostudies-literature
| S-EPMC6486753 | biostudies-literature
| S-EPMC9302655 | biostudies-literature
| S-EPMC7319248 | biostudies-literature
| S-EPMC6811674 | biostudies-literature
| S-EPMC5786232 | biostudies-literature
| S-EPMC6859855 | biostudies-literature
| S-EPMC3937850 | biostudies-literature