Dataset Information

Maximizing gain in high-throughput screening using conformal prediction.

ABSTRACT: Iterative screening has emerged as a promising approach to increase the efficiency of screening campaigns compared to traditional high throughput approaches. By learning from a subset of the compound library, inferences on what compounds to screen next can be made by predictive models, resulting in more efficient screening. One way to evaluate screening is to consider the cost of screening compared to the gain associated with finding an active compound. In this work, we introduce a conformal predictor coupled with a gain-cost function with the aim to maximise gain in iterative screening. Using this setup we were able to show that by evaluating the predictions on the training data, very accurate predictions on what settings will produce the highest gain on the test data can be made. We evaluate the approach on 12 bioactivity datasets from PubChem training the models using 20% of the data. Depending on the settings of the gain-cost function, the settings generating the maximum gain were accurately identified in 8-10 out of the 12 datasets. Broadly, our approach can predict what strategy generates the highest gain based on the results of the cost-gain evaluation: to screen the compounds predicted to be active, to screen all the remaining data, or not to screen any additional compounds. When the algorithm indicates that the predicted active compounds should be screened, our approach also indicates what confidence level to apply in order to maximize gain. Hence, our approach facilitates decision-making and allocation of the resources where they deliver the most value by indicating in advance the likely outcome of a screening campaign.

SUBMITTER: Svensson F

PROVIDER: S-EPMC5821614 | biostudies-literature | 2018 Feb

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Maximizing gain in high-throughput screening using conformal prediction.

Svensson Fredrik F Afzal Avid M AM Norinder Ulf U Bender Andreas A

Journal of cheminformatics 20180221 1

Iterative screening has emerged as a promising approach to increase the efficiency of screening campaigns compared to traditional high throughput approaches. By learning from a subset of the compound library, inferences on what compounds to screen next can be made by predictive models, resulting in more efficient screening. One way to evaluate screening is to consider the cost of screening compared to the gain associated with finding an active compound. In this work, we introduce a conformal pre ...[more]

PMID: 29468427

Dataset Information

Maximizing gain in high-throughput screening using conformal prediction.

Publications

Maximizing gain in high-throughput screening using conformal prediction.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A simple high-throughput technology enables gain-of-function screening of human microRNAs.
| S-EPMC3671589 | biostudies-other

Estimating Potency in High-Throughput Screening Experiments by Maximizing the Rate of Change in Weighted Shannon Entropy.
| S-EPMC4908415 | biostudies-literature

High dielectric ternary oxides from crystal structure prediction and high-throughput screening.
| S-EPMC7060264 | biostudies-literature

Identification of Kinase-substrate Pairs Using High Throughput Screening.
| S-EPMC4692564 | biostudies-other

High-throughput carrier screening using TaqMan allelic discrimination.
| S-EPMC3608587 | biostudies-literature

High-Throughput parallel blind Virtual Screening using BINDSURF.
| S-EPMC3504923 | biostudies-literature

Towards Prebiotic Catalytic Amyloids Using High Throughput Screening.
| S-EPMC4674085 | biostudies-literature

High-throughput isolation of giant viruses using high-content screening.
| S-EPMC6584669 | biostudies-literature

High throughput reaction screening using desorption electrospray ionization mass spectrometry.
| S-EPMC5887808 | biostudies-other

Quantitative High-Throughput Screening Using a Coincidence Reporter Biocircuit.
| S-EPMC5510169 | biostudies-literature