Unknown

Dataset Information

0

A Bayesian Machine Learning Approach for Optimizing Dynamic Treatment Regimes.


ABSTRACT: Medical therapy often consists of multiple stages, with a treatment chosen by the physician at each stage based on the patient's history of treatments and clinical outcomes. These decisions can be formalized as a dynamic treatment regime. This paper describes a new approach for optimizing dynamic treatment regimes that bridges the gap between Bayesian inference and existing approaches, like Q-learning. The proposed approach fits a series of Bayesian regression models, one for each stage, in reverse sequential order. Each model uses as a response variable the remaining payoff assuming optimal actions are taken at subsequent stages, and as covariates the current history and relevant actions at that stage. The key difficulty is that the optimal decision rules at subsequent stages are unknown, and even if these decision rules were known the relevant response variables may be counterfactual. However, posterior distributions can be derived from the previously fitted regression models for the optimal decision rules and the counterfactual response variables under a particular set of rules. The proposed approach averages over these posterior distributions when fitting each regression model. An efficient sampling algorithm for estimation is presented, along with simulation studies that compare the proposed approach with Q-learning.

SUBMITTER: Murray TA 

PROVIDER: S-EPMC6366650 | biostudies-other | 2018

REPOSITORIES: biostudies-other

altmetric image

Publications

A Bayesian Machine Learning Approach for Optimizing Dynamic Treatment Regimes.

Murray Thomas A TA   Yuan Ying Y   Thall Peter F PF  

Journal of the American Statistical Association 20181008 523


Medical therapy often consists of multiple stages, with a treatment chosen by the physician at each stage based on the patient's history of treatments and clinical outcomes. These decisions can be formalized as a dynamic treatment regime. This paper describes a new approach for optimizing dynamic treatment regimes that bridges the gap between Bayesian inference and existing approaches, like Q-learning. The proposed approach fits a series of Bayesian regression models, one for each stage, in reve  ...[more]

Similar Datasets

| S-EPMC6750237 | biostudies-literature
| S-EPMC5175473 | biostudies-literature
| S-EPMC5966293 | biostudies-literature
| S-EPMC6457899 | biostudies-literature
| S-EPMC4517946 | biostudies-literature
| S-EPMC4231831 | biostudies-literature
| S-EPMC7113333 | biostudies-literature
| S-EPMC4300556 | biostudies-literature
| S-EPMC6373443 | biostudies-literature
| S-EPMC5015434 | biostudies-literature