Unknown

Dataset Information

0

Significantly improved HIV inhibitor efficacy prediction employing proteochemometric models generated from antivirogram data.


ABSTRACT: Infection with HIV cannot currently be cured; however it can be controlled by combination treatment with multiple anti-retroviral drugs. Given different viral genotypes for virtually each individual patient, the question now arises which drug combination to use to achieve effective treatment. With the availability of viral genotypic data and clinical phenotypic data, it has become possible to create computational models able to predict an optimal treatment regimen for an individual patient. Current models are based only on sequence data derived from viral genotyping; chemical similarity of drugs is not considered. To explore the added value of chemical similarity inclusion we applied proteochemometric models, combining chemical and protein target properties in a single bioactivity model. Our dataset was a large scale clinical database of genotypic and phenotypic information (in total ca. 300,000 drug-mutant bioactivity data points, 4 (NNRTI), 8 (NRTI) or 9 (PI) drugs, and 10,700 (NNRTI) 10,500 (NRTI) or 27,000 (PI) mutants). Our models achieved a prediction error below 0.5 Log Fold Change. Moreover, when directly compared with previously published sequence data, derived models PCM performed better in resistance classification and prediction of Log Fold Change (0.76 log units versus 0.91). Furthermore, we were able to successfully confirm both known and identify previously unpublished, resistance-conferring mutations of HIV Reverse Transcriptase (e.g. K102Y, T216M) and HIV Protease (e.g. Q18N, N88G) from our dataset. Finally, we applied our models prospectively to the public HIV resistance database from Stanford University obtaining a correct resistance prediction rate of 84% on the full set (compared to 80% in previous work on a high quality subset). We conclude that proteochemometric models are able to accurately predict the phenotypic resistance based on genotypic data even for novel mutants and mixtures. Furthermore, we add an applicability domain to the prediction, informing the user about the reliability of predictions.

SUBMITTER: van Westen GJ 

PROVIDER: S-EPMC3578754 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Significantly improved HIV inhibitor efficacy prediction employing proteochemometric models generated from antivirogram data.

van Westen Gerard J P GJ   Hendriks Alwin A   Wegner Jörg K JK   Ijzerman Adriaan P AP   van Vlijmen Herman W T HW   Bender Andreas A  

PLoS computational biology 20130221 2


Infection with HIV cannot currently be cured; however it can be controlled by combination treatment with multiple anti-retroviral drugs. Given different viral genotypes for virtually each individual patient, the question now arises which drug combination to use to achieve effective treatment. With the availability of viral genotypic data and clinical phenotypic data, it has become possible to create computational models able to predict an optimal treatment regimen for an individual patient. Curr  ...[more]

Similar Datasets

| S-EPMC3085061 | biostudies-literature
2022-02-16 | GSE157494 | GEO
| S-EPMC7328444 | biostudies-literature
| S-EPMC154089 | biostudies-literature
| S-EPMC10463891 | biostudies-literature
| S-EPMC6523630 | biostudies-literature
| S-EPMC2777180 | biostudies-literature
| S-EPMC3561943 | biostudies-literature
| S-EPMC4957684 | biostudies-literature
2020-12-31 | GSE158699 | GEO