Unknown

Dataset Information

0

How Far Could We Go with Open Data - A Case Study for TRPV1 Antagonists.


ABSTRACT: Publicly open databases of small compounds have become an indispensable tool for chemoinformaticians for collection and preparation of datasets suitable for drug discovery questions. Since these databases comprise compounds coming from structure-activity relationship (SAR) studies performed by different research groups, they are very diverse with respect to the biological assays used. In the present study we analyzed the applicability of a thoroughly curated dataset gathered from open sources for ligand-based studies, using the transient receptor potential vanilloid type 1 (TRPV1) as use case. Thorough curation of compounds according to the biological assay type and conditions led to a dataset of comparable bioactive chemicals. Subsequent exhaustive analysis of the obtained dataset using classification algorithms demonstrated that the models obtained in most of the cases possess reliable quality. Analysis of constantly misclassified compounds showed that they belong to local SAR series, where small changes in structure lead to different class labels. These small structural differences could not be captured by the classification algorithms. However application of the 3D alignment-independent QSAR technique GRIND for local, structurally related series overcomes this problem.

SUBMITTER: Tsareva DA 

PROVIDER: S-EPMC3743172 | biostudies-literature | 2013 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

How Far Could We Go with Open Data - A Case Study for TRPV1 Antagonists.

Tsareva Daria A DA   Ecker Gerhard F GF  

Molecular informatics 20130618 5-6


Publicly open databases of small compounds have become an indispensable tool for chemoinformaticians for collection and preparation of datasets suitable for drug discovery questions. Since these databases comprise compounds coming from structure-activity relationship (SAR) studies performed by different research groups, they are very diverse with respect to the biological assays used. In the present study we analyzed the applicability of a thoroughly curated dataset gathered from open sources fo  ...[more]

Similar Datasets

| S-EPMC9122639 | biostudies-literature
| S-EPMC3763634 | biostudies-literature
| S-EPMC3065692 | biostudies-literature
| S-EPMC1432203 | biostudies-literature
| S-EPMC9522383 | biostudies-literature
| S-EPMC10497977 | biostudies-literature
| S-EPMC8004144 | biostudies-literature
| S-EPMC5935603 | biostudies-other
| S-EPMC7548397 | biostudies-literature
| S-EPMC8262459 | biostudies-literature