Unknown

Dataset Information

0

Quantitative surface field analysis: learning causal models to predict ligand binding affinity and pose.


ABSTRACT: We introduce the QuanSA method for inducing physically meaningful field-based models of ligand binding pockets based on structure-activity data alone. The method is closely related to the QMOD approach, substituting a learned scoring field for a pocket constructed of molecular fragments. The problem of mutual ligand alignment is addressed in a general way, and optimal model parameters and ligand poses are identified through multiple-instance machine learning. We provide algorithmic details along with performance results on sixteen structure-activity data sets covering many pharmaceutically relevant targets. In particular, we show how models initially induced from small data sets can extrapolatively identify potent new ligands with novel underlying scaffolds with very high specificity. Further, we show that combining predictions from QuanSA models with those from physics-based simulation approaches is synergistic. QuanSA predictions yield binding affinities, explicit estimates of ligand strain, associated ligand pose families, and estimates of structural novelty and confidence. The method is applicable for fine-grained lead optimization as well as potent new lead identification.

SUBMITTER: Cleves AE 

PROVIDER: S-EPMC6096883 | biostudies-literature | 2018 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Quantitative surface field analysis: learning causal models to predict ligand binding affinity and pose.

Cleves Ann E AE   Jain Ajay N AN  

Journal of computer-aided molecular design 20180622 7


We introduce the QuanSA method for inducing physically meaningful field-based models of ligand binding pockets based on structure-activity data alone. The method is closely related to the QMOD approach, substituting a learned scoring field for a pocket constructed of molecular fragments. The problem of mutual ligand alignment is addressed in a general way, and optimal model parameters and ligand poses are identified through multiple-instance machine learning. We provide algorithmic details along  ...[more]

Similar Datasets

| S-EPMC8274096 | biostudies-literature
| S-EPMC10765576 | biostudies-literature
| S-EPMC6373529 | biostudies-literature
| S-EPMC5562487 | biostudies-literature
| S-EPMC9454149 | biostudies-literature
| S-EPMC8985993 | biostudies-literature
| S-EPMC7328444 | biostudies-literature
| S-EPMC6856045 | biostudies-literature
| S-EPMC9421647 | biostudies-literature
| S-EPMC3725268 | biostudies-literature