Unknown

Dataset Information

0

Comparison of Data Fusion Methods as Consensus Scores for Ensemble Docking.


ABSTRACT: Ensemble docking is a widely applied concept in structure-based virtual screening-to at least partly account for protein flexibility-usually granting a significant performance gain at a modest cost of speed. From the individual, single-structure docking scores, a consensus score needs to be produced by data fusion: this is usually done by taking the best docking score from the available pool (in most cases- and in this study as well-this is the minimum score). Nonetheless, there are a number of other fusion rules that can be applied. We report here the results of a detailed statistical comparison of seven fusion rules for ensemble docking, on five case studies of current drug targets, based on four performance metrics. Sevenfold cross-validation and variance analysis (ANOVA) allowed us to highlight the best fusion rules. The results are presented in bubble plots, to unite the four performance metrics into a single, comprehensive image. Notably, we suggest the use of the geometric and harmonic means as better alternatives to the generally applied minimum fusion rule.

SUBMITTER: Bajusz D 

PROVIDER: S-EPMC6695709 | biostudies-literature | 2019 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Comparison of Data Fusion Methods as Consensus Scores for Ensemble Docking.

Bajusz Dávid D   Rácz Anita A   Héberger Károly K  

Molecules (Basel, Switzerland) 20190724 15


Ensemble docking is a widely applied concept in structure-based virtual screening-to at least partly account for protein flexibility-usually granting a significant performance gain at a modest cost of speed. From the individual, single-structure docking scores, a consensus score needs to be produced by data fusion: this is usually done by taking the best docking score from the available pool (in most cases- and in this study as well-this is the minimum score). Nonetheless, there are a number of  ...[more]

Similar Datasets

| S-EPMC6435795 | biostudies-literature
| S-EPMC8169699 | biostudies-literature
| S-EPMC3337782 | biostudies-literature
| S-EPMC4482624 | biostudies-literature
| S-EPMC3031034 | biostudies-other
| S-EPMC8748946 | biostudies-literature
| S-EPMC1618413 | biostudies-other
| S-EPMC2776026 | biostudies-literature
| S-EPMC3600029 | biostudies-literature
| S-EPMC4003394 | biostudies-other