Unknown

Dataset Information

0

QSAR workbench: automating QSAR modeling to drive compound design.


ABSTRACT: We describe the QSAR Workbench, a system for the building and analysis of QSAR models. The system is built around the Pipeline Pilot workflow tool and provides access to a variety of model building algorithms for both continuous and categorical data. Traditionally models are built on a one by one basis and fully exploring the model space of algorithms and descriptor subsets is a time consuming basis. The QSAR Workbench provides a framework to allow for multiple models to be built over a number of modeling algorithms, descriptor combinations and data splits (training and test sets). Methods to analyze and compare models are provided, enabling the user to select the most appropriate model. The Workbench provides a consistent set of routines for data preparation and chemistry normalization that are also applied for predictions. The Workbench provides a large degree of automation with the ability to publish preconfigured model building workflows for a variety of problem domains, whilst providing experienced users full access to the underlying parameterization if required. Methods are provided to allow for publication of selected models as web services, thus providing integration with the chemistry desktop. We describe the design and implementation of the QSAR Workbench and demonstrate its utility through application to two public domain datasets.

SUBMITTER: Cox R 

PROVIDER: S-EPMC3657086 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC3097501 | biostudies-literature
| S-EPMC4481845 | biostudies-other
| S-EPMC7097998 | biostudies-literature
| S-EPMC6558509 | biostudies-literature
| S-EPMC6314802 | biostudies-literature
| S-EPMC1302793 | biostudies-literature
| S-EPMC7520035 | biostudies-literature
| S-EPMC5498782 | biostudies-other
| S-EPMC6263259 | biostudies-literature
| S-EPMC3697037 | biostudies-literature