Unknown

Dataset Information

0

Efficiency of Stratification for Ensemble Docking Using Reduced Ensembles.


ABSTRACT: Molecular docking can account for receptor flexibility by combining the docking score over multiple rigid receptor conformations, such as snapshots from a molecular dynamics simulation. Here, we evaluate a number of common snapshot selection strategies using a quality metric from stratified sampling, the efficiency of stratification, which compares the variance of a selection strategy to simple random sampling. We also extend the metric to estimators of exponential averages (which involve an exponential transformation, averaging, and inverse transformation) and minima. For docking sets of over 500 ligands to four different proteins of varying flexibility, we observe that, for estimating ensemble averages and exponential averages, many clustering algorithms have similar performance trends: for a few snapshots (less than 25), medoids are the most efficient, while, for a larger number, optimal (the allocation that minimizes the variance) and proportional (to the size of each cluster) allocation become more efficient. Proportional allocation appears to be the most consistently efficient for estimating minima.

SUBMITTER: Xie B 

PROVIDER: S-EPMC6338335 | biostudies-literature | 2018 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Efficiency of Stratification for Ensemble Docking Using Reduced Ensembles.

Xie Bing B   Xie Bing B   Clark John D JD   Minh David D L DDL  

Journal of chemical information and modeling 20180829 9


Molecular docking can account for receptor flexibility by combining the docking score over multiple rigid receptor conformations, such as snapshots from a molecular dynamics simulation. Here, we evaluate a number of common snapshot selection strategies using a quality metric from stratified sampling, the efficiency of stratification, which compares the variance of a selection strategy to simple random sampling. We also extend the metric to estimators of exponential averages (which involve an exp  ...[more]

Similar Datasets

| S-EPMC2143983 | biostudies-other
| S-EPMC3021968 | biostudies-literature
| S-EPMC7815257 | biostudies-literature
| S-EPMC8748946 | biostudies-literature
| S-EPMC2649978 | biostudies-literature
| S-EPMC2762758 | biostudies-literature
| S-EPMC2881208 | biostudies-literature
| S-EPMC2891173 | biostudies-literature
| S-EPMC2573042 | biostudies-literature
| S-EPMC6435795 | biostudies-literature