Unknown

Dataset Information

0

Enabling genomic-phenomic association discovery without sacrificing anonymity.


ABSTRACT: Health information technologies facilitate the collection of massive quantities of patient-level data. A growing body of research demonstrates that such information can support novel, large-scale biomedical investigations at a fraction of the cost of traditional prospective studies. While healthcare organizations are being encouraged to share these data in a de-identified form, there is hesitation over concerns that it will allow corresponding patients to be re-identified. Currently proposed technologies to anonymize clinical data may make unrealistic assumptions with respect to the capabilities of a recipient to ascertain a patients identity. We show that more pragmatic assumptions enable the design of anonymization algorithms that permit the dissemination of detailed clinical profiles with provable guarantees of protection. We demonstrate this strategy with a dataset of over one million medical records and show that 192 genotype-phenotype associations can be discovered with fidelity equivalent to non-anonymized clinical data.

SUBMITTER: Heatherly RD 

PROVIDER: S-EPMC3566194 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Enabling genomic-phenomic association discovery without sacrificing anonymity.

Heatherly Raymond D RD   Loukides Grigorios G   Denny Joshua C JC   Haines Jonathan L JL   Roden Dan M DM   Malin Bradley A BA  

PloS one 20130206 2


Health information technologies facilitate the collection of massive quantities of patient-level data. A growing body of research demonstrates that such information can support novel, large-scale biomedical investigations at a fraction of the cost of traditional prospective studies. While healthcare organizations are being encouraged to share these data in a de-identified form, there is hesitation over concerns that it will allow corresponding patients to be re-identified. Currently proposed tec  ...[more]

Similar Datasets

| S-EPMC7317793 | biostudies-literature
| S-EPMC7820566 | biostudies-literature
| S-EPMC10600307 | biostudies-literature
| S-EPMC6309501 | biostudies-literature
| S-EPMC9257259 | biostudies-literature
| S-EPMC4331717 | biostudies-literature
| S-EPMC4556653 | biostudies-literature
| S-EPMC2807471 | biostudies-literature
| S-EPMC8328207 | biostudies-literature
| S-EPMC4059762 | biostudies-other