Unknown

Dataset Information

0

A blinded evaluation of privacy preserving record linkage with Bloom filters.


ABSTRACT:

Background

Privacy preserving record linkage (PPRL) methods using Bloom filters have shown promise for use in operational linkage settings. However real-world evaluations are required to confirm their suitability in practice.

Methods

An extract of records from the Western Australian (WA) Hospital Morbidity Data Collection 2011-2015 and WA Death Registrations 2011-2015 were encoded to Bloom filters, and then linked using privacy-preserving methods. Results were compared to a traditional, un-encoded linkage of the same datasets using the same blocking criteria to enable direct investigation of the comparison step. The encoded linkage was carried out in a blinded setting, where there was no access to un-encoded data or a 'truth set'.

Results

The PPRL method using Bloom filters provided similar linkage quality to the traditional un-encoded linkage, with 99.3% of 'groupings' identical between privacy preserving and clear-text linkage.

Conclusion

The Bloom filter method appears suitable for use in situations where clear-text identifiers cannot be provided for linkage.

SUBMITTER: Randall S 

PROVIDER: S-EPMC8761329 | biostudies-literature | 2022 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

A blinded evaluation of privacy preserving record linkage with Bloom filters.

Randall Sean S   Wichmann Helen H   Brown Adrian A   Boyd James J   Eitelhuber Tom T   Merchant Alexandra A   Ferrante Anna A  

BMC medical research methodology 20220116 1


<h4>Background</h4>Privacy preserving record linkage (PPRL) methods using Bloom filters have shown promise for use in operational linkage settings. However real-world evaluations are required to confirm their suitability in practice.<h4>Methods</h4>An extract of records from the Western Australian (WA) Hospital Morbidity Data Collection 2011-2015 and WA Death Registrations 2011-2015 were encoded to Bloom filters, and then linked using privacy-preserving methods. Results were compared to a tradit  ...[more]

Similar Datasets

| S-EPMC3932473 | biostudies-literature
| S-EPMC6180364 | biostudies-other
| S-EPMC9968283 | biostudies-literature
| S-EPMC11491627 | biostudies-literature
| S-EPMC8896632 | biostudies-literature
| S-EPMC9499274 | biostudies-literature
| S-EPMC7482522 | biostudies-literature
| S-EPMC6994009 | biostudies-literature
| S-EPMC7482515 | biostudies-literature
| S-EPMC10161965 | biostudies-literature