Unknown

Dataset Information

0

Thompson Sampling─An Efficient Method for Searching Ultralarge Synthesis on Demand Databases.


ABSTRACT: Over the last five years, virtual screening of ultralarge synthesis on-demand libraries has emerged as a powerful tool for hit identification in drug discovery programs. As these libraries have grown to tens of billions of molecules, we have reached a point where it is no longer cost-effective to screen every molecule virtually. To address these challenges, several groups have developed heuristic search methods to rapidly identify the best molecules on a virtual screen. This article describes the application of Thompson sampling (TS), an active learning approach that streamlines the virtual screening of large combinatorial libraries by performing a probabilistic search in the reagent space, thereby never requiring the full enumeration of the library. TS is a general technique that can be applied to various virtual screening modalities, including 2D and 3D similarity search, docking, and application of machine-learning models. In an illustrative example, we show that TS can identify more than half of the top 100 molecules from a docking-based virtual screen of 335 million molecules by evaluating 1% of the data set.

SUBMITTER: Klarich K 

PROVIDER: S-EPMC10900287 | biostudies-literature | 2024 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Thompson Sampling─An Efficient Method for Searching Ultralarge Synthesis on Demand Databases.

Klarich Kathryn K   Goldman Brian B   Kramer Trevor T   Riley Patrick P   Walters W Patrick WP  

Journal of chemical information and modeling 20240205 4


Over the last five years, virtual screening of ultralarge synthesis on-demand libraries has emerged as a powerful tool for hit identification in drug discovery programs. As these libraries have grown to tens of billions of molecules, we have reached a point where it is no longer cost-effective to screen every molecule virtually. To address these challenges, several groups have developed heuristic search methods to rapidly identify the best molecules on a virtual screen. This article describes th  ...[more]

Similar Datasets

| S-EPMC8494236 | biostudies-literature
| S-EPMC10446805 | biostudies-literature
| S-EPMC3543413 | biostudies-literature
| S-EPMC4613384 | biostudies-literature
| S-EPMC11647911 | biostudies-literature
| S-EPMC8588424 | biostudies-literature
| S-EPMC6148622 | biostudies-literature
| S-EPMC3196336 | biostudies-literature
| S-EPMC1949409 | biostudies-literature
| S-EPMC146152 | biostudies-other