Dataset Information

A structure-based model for the prediction of protein-RNA binding affinity.

ABSTRACT: Protein-RNA recognition is highly affinity-driven and regulates a wide array of cellular functions. In this study, we have curated a binding affinity data set of 40 protein-RNA complexes, for which at least one unbound partner is available in the docking benchmark. The data set covers a wide affinity range of eight orders of magnitude as well as four different structural classes. On average, we find the complexes with single-stranded RNA have the highest affinity, whereas the complexes with the duplex RNA have the lowest. Nevertheless, free energy gain upon binding is the highest for the complexes with ribosomal proteins and the lowest for the complexes with tRNA with an average of -5.7 cal/mol/Å² in the entire data set. We train regression models to predict the binding affinity from the structural and physicochemical parameters of protein-RNA interfaces. The best fit model with the lowest maximum error is provided with three interface parameters: relative hydrophobicity, conformational change upon binding and relative hydration pattern. This model has been used for predicting the binding affinity on a test data set, generated using mutated structures of yeast aspartyl-tRNA synthetase, for which experimentally determined ΔG values of 40 mutations are available. The predicted ΔG_empirical values highly correlate with the experimental observations. The data set provided in this study should be useful for further development of the binding affinity prediction methods. Moreover, the model developed in this study enhances our understanding on the structural basis of protein-RNA binding affinity and provides a platform to engineer protein-RNA interfaces with desired affinity.

SUBMITTER: Nithin C

PROVIDER: S-EPMC6859855 | biostudies-literature | 2019 Dec

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A structure-based model for the prediction of protein-RNA binding affinity.

Nithin Chandran C Mukherjee Sunandan S Bahadur Ranjit Prasad RP

RNA (New York, N.Y.) 20190808 12

Protein-RNA recognition is highly affinity-driven and regulates a wide array of cellular functions. In this study, we have curated a binding affinity data set of 40 protein-RNA complexes, for which at least one unbound partner is available in the docking benchmark. The data set covers a wide affinity range of eight orders of magnitude as well as four different structural classes. On average, we find the complexes with single-stranded RNA have the highest affinity, whereas the complexes with the ...[more]

PMID: 31395671

Dataset Information

A structure-based model for the prediction of protein-RNA binding affinity.

Publications

A structure-based model for the prediction of protein-RNA binding affinity.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Structure-based protein-ligand interaction fingerprints for binding affinity prediction.
| S-EPMC8637032 | biostudies-literature

Structure-based, deep-learning models for protein-ligand binding affinity prediction.
| S-EPMC10765576 | biostudies-literature

SPOT-Seq-RNA: predicting protein-RNA complex structure and RNA-binding function by fold recognition and binding affinity prediction.
| S-EPMC3937850 | biostudies-literature

Blind tests of RNA-protein binding affinity prediction.
| S-EPMC6486753 | biostudies-literature

Contacts-based prediction of binding affinity in protein-protein complexes.
| S-EPMC4523921 | biostudies-literature

RNA Binding Protein-Based Model for Prognostic Prediction of Colorectal Cancer.
| S-EPMC8182183 | biostudies-literature

A structure-based benchmark for protein-protein binding affinity.
| S-EPMC3064828 | biostudies-literature

3pHLA-score improves structure-based peptide-HLA binding affinity prediction.
| S-EPMC9232595 | biostudies-literature

Structure-aware deep model for MHC-II peptide binding affinity prediction.
| S-EPMC10826266 | biostudies-literature

A linear model for transcription factor binding affinity prediction in protein binding microarrays.
| S-EPMC3102690 | biostudies-literature