Ontology highlight
ABSTRACT:
SUBMITTER: Hejblum BP
PROVIDER: S-EPMC6326114 | biostudies-literature | 2019 Jan
REPOSITORIES: biostudies-literature
Hejblum Boris P BP Weber Griffin M GM Liao Katherine P KP Palmer Nathan P NP Churchill Susanne S Shadick Nancy A NA Szolovits Peter P Murphy Shawn N SN Kohane Isaac S IS Cai Tianxi T
Scientific data 20190108
We develop an algorithm for probabilistic linkage of de-identified research datasets at the patient level, when only diagnosis codes with discrepancies and no personal health identifiers such as name or date of birth are available. It relies on Bayesian modelling of binarized diagnosis codes, and provides a posterior probability of matching for each patient pair, while considering all the data at once. Both in our simulation study (using an administrative claims dataset for data generation) and ...[more]