Unknown

Dataset Information

0

Exploring non-linear distance metrics in the structure-activity space: QSAR models for human estrogen receptor.


ABSTRACT:

Background

Quantitative structure-activity relationship (QSAR) models are important tools used in discovering new drug candidates and identifying potentially harmful environmental chemicals. These models often face two fundamental challenges: limited amount of available biological activity data and noise or uncertainty in the activity data themselves. To address these challenges, we introduce and explore a QSAR model based on custom distance metrics in the structure-activity space.

Methods

The model is built on top of the k-nearest neighbor model, incorporating non-linearity not only in the chemical structure space, but also in the biological activity space. The model is tuned and evaluated using activity data for human estrogen receptor from the US EPA ToxCast and Tox21 databases.

Results

The model closely trails the CERAPP consensus model (built on top of 48 individual human estrogen receptor activity models) in agonist activity predictions and consistently outperforms the CERAPP consensus model in antagonist activity predictions.

Discussion

We suggest that incorporating non-linear distance metrics may significantly improve QSAR model performance when the available biological activity data are limited.

SUBMITTER: Balabin IA 

PROVIDER: S-EPMC6755572 | biostudies-literature | 2018 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Exploring non-linear distance metrics in the structure-activity space: QSAR models for human estrogen receptor.

Balabin Ilya A IA   Judson Richard S RS  

Journal of cheminformatics 20180918 1


<h4>Background</h4>Quantitative structure-activity relationship (QSAR) models are important tools used in discovering new drug candidates and identifying potentially harmful environmental chemicals. These models often face two fundamental challenges: limited amount of available biological activity data and noise or uncertainty in the activity data themselves. To address these challenges, we introduce and explore a QSAR model based on custom distance metrics in the structure-activity space.<h4>Me  ...[more]

Similar Datasets

| S-EPMC5345030 | biostudies-literature
| S-EPMC5850101 | biostudies-literature
| S-EPMC2887612 | biostudies-literature
| S-EPMC2323961 | biostudies-literature
| S-EPMC4516874 | biostudies-literature
| S-EPMC3222957 | biostudies-literature
| S-EPMC5708782 | biostudies-literature
| S-EPMC8029670 | biostudies-literature
| S-EPMC6420154 | biostudies-literature
| S-EPMC2774254 | biostudies-literature