Ontology highlight
ABSTRACT: Objective
The electronic health record is a rising resource for quantifying medical practice, discovering the adverse effects of drugs, and studying comparative effectiveness. One of the challenges of applying these methods to health care data is the high dimensionality of the health record. Methods to discover the effects of drugs in health data must account for tens of thousands of potentially relevant confounders. Our goal in this work is to reduce the dimensionality of the health data with the aim of accelerating the application of retrospective cohort studies to this data.Materials and methods
Here, we develop indication embeddings, a way to reduce the dimensionality of health data while capturing information relevant to treatment decisions. We evaluate these embeddings using external data on drug indications. Then, we use the embeddings as a substitute for medical history to match patients and develop evaluation metrics for these matches.Results
We demonstrate that these embeddings recover the therapeutic uses of drugs. We use embeddings as an informative representation of relationships between drugs, between health history events and drug prescriptions, and between patients at a particular time in their health history. We show that using embeddings to match cohorts improves the balance of the cohorts, even in terms of poorly measured risk factors like smoking.Discussion and conclusion
Unlike other embeddings inspired by word2vec, indication embeddings are specifically designed to capture the medical history leading to the prescription of a new drug. For retrospective cohort studies, our low-dimensional representation helps in finding comparator drugs and constructing comparator cohorts.
SUBMITTER: Melamed RD
PROVIDER: S-EPMC7751136 | biostudies-literature |
REPOSITORIES: biostudies-literature