Unknown

Dataset Information

0

A Machine Learning Strategy for Drug Discovery Identifies Anti-Schistosomal Small Molecules.


ABSTRACT: Schistosomiasis is a chronic and painful disease of poverty caused by the flatworm parasite Schistosoma. Drug discovery for antischistosomal compounds predominantly employs in vitro whole organism (phenotypic) screens against two developmental stages of Schistosoma mansoni, post-infective larvae (somules) and adults. We generated two rule books and associated scoring systems to normalize 3898 phenotypic data points to enable machine learning. The data were used to generate eight Bayesian machine learning models with the Assay Central software according to parasite's developmental stage and experimental time point (?24, 48, 72, and >72 h). The models helped predict 56 active and nonactive compounds from commercial compound libraries for testing. When these were screened against S. mansoni in vitro, the prediction accuracy for active and inactives was 61% and 56% for somules and adults, respectively; also, hit rates were 48% and 34%, respectively, far exceeding the typical 1-2% hit rate for traditional high throughput screens.

SUBMITTER: Zorn KM 

PROVIDER: S-EPMC7887754 | biostudies-literature | 2021 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Machine Learning Strategy for Drug Discovery Identifies Anti-Schistosomal Small Molecules.

Zorn Kimberley M KM   Sun Shengxi S   McConnon Cecelia L CL   Ma Kelley K   Chen Eric K EK   Foil Daniel H DH   Lane Thomas R TR   Liu Lawrence J LJ   El-Sakkary Nelly N   Skinner Danielle E DE   Ekins Sean S   Caffrey Conor R CR  

ACS infectious diseases 20210112 2


Schistosomiasis is a chronic and painful disease of poverty caused by the flatworm parasite <i>Schistosoma</i>. Drug discovery for antischistosomal compounds predominantly employs <i>in vitro</i> whole organism (phenotypic) screens against two developmental stages of <i>Schistosoma mansoni</i>, post-infective larvae (somules) and adults. We generated two rule books and associated scoring systems to normalize 3898 phenotypic data points to enable machine learning. The data were used to generate e  ...[more]

Similar Datasets

| S-EPMC7807207 | biostudies-literature
2021-01-14 | GSE164788 | GEO
| S-EPMC8356896 | biostudies-literature
| S-EPMC6428806 | biostudies-literature
| S-EPMC8574649 | biostudies-literature
2020-01-22 | GSE129144 | GEO
2022-10-01 | GSE200096 | GEO
| S-EPMC7815257 | biostudies-literature
| S-EPMC7884393 | biostudies-literature
| S-EPMC4827534 | biostudies-literature