Unknown

Dataset Information

0

Analysis of Biological Screening Compounds with Single- or Multi-Target Activity via Diagnostic Machine Learning.


ABSTRACT: Predicting compounds with single- and multi-target activity and exploring origins of compound specificity and promiscuity is of high interest for chemical biology and drug discovery. We present a large-scale analysis of compound promiscuity including two major components. First, high-confidence datasets of compounds with multi- and corresponding single-target activity were extracted from biological screening data. Positive and negative assay results were taken into account and data completeness was ensured. Second, these datasets were investigated using diagnostic machine learning to systematically distinguish between compounds with multi- and single-target activity. Models built on the basis of chemical structure consistently produced meaningful predictions. These findings provided evidence for the presence of structural features differentiating promiscuous and non-promiscuous compounds. Machine learning under varying conditions using modified datasets revealed a strong influence of nearest neighbor relationship on the predictions. Many multi-target compounds were found to be more similar to other multi-target compounds than single-target compounds and vice versa, which resulted in consistently accurate predictions. The results of our study confirm the presence of structural relationships that differentiate promiscuous and non-promiscuous compounds.

SUBMITTER: Feldmann C 

PROVIDER: S-EPMC7761051 | biostudies-literature | 2020 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Analysis of Biological Screening Compounds with Single- or Multi-Target Activity via Diagnostic Machine Learning.

Feldmann Christian C   Yonchev Dimitar D   Bajorath Jürgen J  

Biomolecules 20201127 12


Predicting compounds with single- and multi-target activity and exploring origins of compound specificity and promiscuity is of high interest for chemical biology and drug discovery. We present a large-scale analysis of compound promiscuity including two major components. First, high-confidence datasets of compounds with multi- and corresponding single-target activity were extracted from biological screening data. Positive and negative assay results were taken into account and data completeness  ...[more]

Similar Datasets

| S-EPMC8147869 | biostudies-literature
| S-EPMC11293328 | biostudies-literature
| S-EPMC5374972 | biostudies-other
2023-01-25 | GSE223385 | GEO
| S-EPMC4958551 | biostudies-literature
| S-EPMC6538545 | biostudies-literature
| S-EPMC9682350 | biostudies-literature
| S-EPMC8566526 | biostudies-literature
| S-EPMC7670649 | biostudies-literature
| S-EPMC8479195 | biostudies-literature