Unknown

Dataset Information

0

Enhanced QSAR model performance by integrating structural and gene expression information.


ABSTRACT: Despite decades of intensive research and a number of demonstrable successes, quantitative structure-activity relationship (QSAR) models still fail to yield predictions with reasonable accuracy in some circumstances, especially when the QSAR paradox occurs. In this study, to avoid the QSAR paradox, we proposed a novel integrated approach to improve the model performance through using both structural and biological information from compounds. As a proof-of-concept, the integrated models were built on a toxicological dataset to predict non-genotoxic carcinogenicity of compounds, using not only the conventional molecular descriptors but also expression profiles of significant genes selected from microarray data. For test set data, our results demonstrated that the prediction accuracy of QSAR model was dramatically increased from 0.57 to 0.67 with incorporation of expression data of just one selected signature gene. Our successful integration of biological information into classic QSAR model provided a new insight and methodology for building predictive models especially when QSAR paradox occurred.

SUBMITTER: Chen Q 

PROVIDER: S-EPMC6270197 | biostudies-literature | 2013 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Enhanced QSAR model performance by integrating structural and gene expression information.

Chen Qian Q   Wu Leihong L   Liu Wei W   Xing Li L   Fan Xiaohui X  

Molecules (Basel, Switzerland) 20130904 9


Despite decades of intensive research and a number of demonstrable successes, quantitative structure-activity relationship (QSAR) models still fail to yield predictions with reasonable accuracy in some circumstances, especially when the QSAR paradox occurs. In this study, to avoid the QSAR paradox, we proposed a novel integrated approach to improve the model performance through using both structural and biological information from compounds. As a proof-of-concept, the integrated models were buil  ...[more]

Similar Datasets

| S-EPMC6476151 | biostudies-literature
| S-EPMC8095114 | biostudies-literature
| S-EPMC4047268 | biostudies-literature
| S-EPMC4529363 | biostudies-other
| S-EPMC6121016 | biostudies-literature
| S-EPMC6936209 | biostudies-literature
| S-EPMC3639359 | biostudies-literature
| S-EPMC4895261 | biostudies-literature
| S-EPMC5758906 | biostudies-literature
| S-EPMC4150992 | biostudies-literature