Dataset Information

On using electronic health records to improve optimal treatment rules in randomized trials.

ABSTRACT: Individualized treatment rules (ITRs) tailor medical treatments according to patient-specific characteristics in order to optimize patient outcomes. Data from randomized controlled trials (RCTs) are used to infer valid ITRs using statistical and machine learning methods. However, RCTs are usually conducted under specific inclusion/exclusion criteria, thus limiting their generalizability to a broader patient population in real-world practice settings. Because electronic health records (EHRs) document treatment prescriptions in the real world, transferring information in EHRs to RCTs, if done appropriately, could potentially improve the performance of ITRs, in terms of precision and generalizability. In this work, we propose a new domain adaptation method to learn ITRs by incorporating information from EHRs. Unless we assume that there is no unmeasured confounding in EHRs, we cannot directly learn the optimal ITR from the combined EHR and RCT data. Instead, we first pretrain "super" features from EHRs that summarize physician treatment decisions and patient observed benefits in the real world, as these are likely to be informative of the optimal ITRs. We then augment the feature space of the RCT and learn the optimal ITRs by stratifying by super features using subjects enrolled in RCT. We adopt Q-learning and a modified matched-learning algorithm for estimation. We present heuristic justification of our method and conduct simulation studies to demonstrate the performance of super features. Finally, we apply our method to transfer information learned from EHRs of patients with type 2 diabetes to learn individualized insulin therapies from RCT data.

SUBMITTER: Wu P

PROVIDER: S-EPMC7786287 | biostudies-literature | 2020 Dec

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

On using electronic health records to improve optimal treatment rules in randomized trials.

Wu Peng P Zeng Donglin D Fu Haoda H Wang Yuanjia Y

Biometrics 20200514 4

Individualized treatment rules (ITRs) tailor medical treatments according to patient-specific characteristics in order to optimize patient outcomes. Data from randomized controlled trials (RCTs) are used to infer valid ITRs using statistical and machine learning methods. However, RCTs are usually conducted under specific inclusion/exclusion criteria, thus limiting their generalizability to a broader patient population in real-world practice settings. Because electronic health records (EHRs) docu ...[more]

PMID: 32365232

Dataset Information

On using electronic health records to improve optimal treatment rules in randomized trials.

Publications

On using electronic health records to improve optimal treatment rules in randomized trials.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Estimating individualized treatment rules for multicategory type 2 diabetes treatments using electronic health records.
| S-EPMC10857856 | biostudies-literature

Matched Learning for Optimizing Individualized Treatment Strategies Using Electronic Health Records.
| S-EPMC7539620 | biostudies-literature

Using Electronic Health Records to Derive Control Arms for Early Phase Single-Arm Lung Cancer Trials: Proof-of-Concept in Randomized Controlled Trials.
| S-EPMC7006884 | biostudies-literature

Incorporating natural language processing to improve classification of axial spondyloarthritis using electronic health records.
| S-EPMC7850056 | biostudies-literature

Publication bias in clinical trials of electronic health records.
| S-EPMC3662474 | biostudies-literature

Weight Change and the Onset of Cardiovascular Diseases: Emulating Trials Using Electronic Health Records.
| S-EPMC8318567 | biostudies-literature

Using Electronic Health Records to Support Clinical Trials: A Report on Stakeholder Engagement for EHR4CR.
| S-EPMC4619877 | biostudies-literature

High-throughput genetic analyses of drug intolerances using electronic health records
| PRJNA683675 | ENA

Different approaches to improve cohort identification using electronic health records: X-linked hypophosphatemia as an example.
| S-EPMC7882088 | biostudies-literature

Detecting Associations between Major Depressive Disorder Treatment and Essential Hypertension using Electronic Health Records.
| S-EPMC4419773 | biostudies-literature