Dataset Information

Incorporating structural similarity into a scoring function to enhance the prediction of binding affinities.

ABSTRACT: In this study, we developed a novel algorithm to improve the screening performance of an arbitrary docking scoring function by recalibrating the docking score of a query compound based on its structure similarity with a set of training compounds, while the extra computational cost is neglectable. Two popular docking methods, Glide and AutoDock Vina were adopted as the original scoring functions to be processed with our new algorithm and similar improvement performance was achieved. Predicted binding affinities were compared against experimental data from ChEMBL and DUD-E databases. 11 representative drug receptors from diverse drug target categories were applied to evaluate the hybrid scoring function. The effects of four different fingerprints (FP2, FP3, FP4, and MACCS) and the four different compound similarity effect (CSE) functions were explored. Encouragingly, the screening performance was significantly improved for all 11 drug targets especially when CSE = S⁴ (S is the Tanimoto structural similarity) and FP2 fingerprint were applied. The average predictive index (PI) values increased from 0.34 to 0.66 and 0.39 to 0.71 for the Glide and AutoDock vina scoring functions, respectively. To evaluate the performance of the calibration algorithm in drug lead identification, we also imposed an upper limit on the structural similarity to mimic the real scenario of screening diverse libraries for which query ligands are general-purpose screening compounds and they are not necessarily structurally similar to reference ligands. Encouragingly, we found our hybrid scoring function still outperformed the original docking scoring function. The hybrid scoring function was further evaluated using external datasets for two systems and we found the PI values increased from 0.24 to 0.46 and 0.14 to 0.42 for A2AR and CFX systems, respectively. In a conclusion, our calibration algorithm can significantly improve the virtual screening performance in both drug lead optimization and identification phases with neglectable computational cost.

SUBMITTER: Ji B

PROVIDER: S-EPMC7884591 | biostudies-literature | 2021 Feb

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Incorporating structural similarity into a scoring function to enhance the prediction of binding affinities.

Ji Beihong B He Xibing X Zhang Yuzhao Y Zhai Jingchen J Man Viet Hoang VH Liu Shuhan S Wang Junmei J

Journal of cheminformatics 20210215 1

In this study, we developed a novel algorithm to improve the screening performance of an arbitrary docking scoring function by recalibrating the docking score of a query compound based on its structure similarity with a set of training compounds, while the extra computational cost is neglectable. Two popular docking methods, Glide and AutoDock Vina were adopted as the original scoring functions to be processed with our new algorithm and similar improvement performance was achieved. Predicted bin ...[more]

PMID: 33588902

Dataset Information

Incorporating structural similarity into a scoring function to enhance the prediction of binding affinities.

Publications

Incorporating structural similarity into a scoring function to enhance the prediction of binding affinities.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Predicting peptide binding affinities to MHC molecules using a modified semi-empirical scoring function.
| S-EPMC3178607 | biostudies-literature

Protein binding site prediction using an empirical scoring function.
| S-EPMC1540721 | biostudies-literature

CSM-carbohydrate: protein-carbohydrate binding affinity prediction and docking scoring function.
| S-EPMC8769910 | biostudies-literature

Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships.
| S-EPMC7909622 | biostudies-literature

The Impact of Protein Structure and Sequence Similarity on the Accuracy of Machine-Learning Scoring Functions for Binding Affinity Prediction.
| S-EPMC5871981 | biostudies-literature

Blind prediction of charged ligand binding affinities in a model binding site.
| S-EPMC3962782 | biostudies-literature

Structure-based protocol for identifying mutations that enhance protein-protein binding affinities.
| S-EPMC2682327 | biostudies-literature

Improved Prediction of Ligand-Protein Binding Affinities by Meta-modeling.
| S-EPMC11632770 | biostudies-literature

UPF201 archaeal specific family members reveal structural similarity to RNA-binding proteins but low likelihood for RNA-binding function.
| S-EPMC2596488 | biostudies-literature

Wei2GO: weighted sequence similarity-based protein function prediction.
| S-EPMC8855713 | biostudies-literature