Dataset Information

Validity of Privacy-Protecting Analytical Methods That Use Only Aggregate-Level Information to Conduct Multivariable-Adjusted Analysis in Distributed Data Networks.

ABSTRACT: Distributed data networks enable large-scale epidemiologic studies, but protecting privacy while adequately adjusting for a large number of covariates continues to pose methodological challenges. Using 2 empirical examples within a 3-site distributed data network, we tested combinations of 3 aggregate-level data-sharing approaches (risk-set, summary-table, and effect-estimate), 4 confounding adjustment methods (matching, stratification, inverse probability weighting, and matching weighting), and 2 summary scores (propensity score and disease risk score) for binary and time-to-event outcomes. We assessed the performance of combinations of these data-sharing and adjustment methods by comparing their results with results from the corresponding pooled individual-level data analysis (reference analysis). For both types of outcomes, the method combinations examined yielded results identical or comparable to the reference results in most scenarios. Within each data-sharing approach, comparability between aggregate- and individual-level data analysis depended on adjustment method; for example, risk-set data-sharing with matched or stratified analysis of summary scores produced identical results, while weighted analysis showed some discrepancies. Across the adjustment methods examined, risk-set data-sharing generally performed better, while summary-table and effect-estimate data-sharing more often produced discrepancies in settings with rare outcomes and small sample sizes. Valid multivariable-adjusted analysis can be performed in distributed data networks without sharing of individual-level data.

SUBMITTER: Li X

PROVIDER: S-EPMC6438804 | biostudies-literature | 2019 Apr

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Validity of Privacy-Protecting Analytical Methods That Use Only Aggregate-Level Information to Conduct Multivariable-Adjusted Analysis in Distributed Data Networks.

Li Xiaojuan X Fireman Bruce H BH Curtis Jeffrey R JR Arterburn David E DE Fisher David P DP Moyneur Érick É Gallagher Mia M Raebel Marsha A MA Nowell W Benjamin WB Lagreid Lindsay L Toh Sengwee S

American journal of epidemiology 20190401 4

Distributed data networks enable large-scale epidemiologic studies, but protecting privacy while adequately adjusting for a large number of covariates continues to pose methodological challenges. Using 2 empirical examples within a 3-site distributed data network, we tested combinations of 3 aggregate-level data-sharing approaches (risk-set, summary-table, and effect-estimate), 4 confounding adjustment methods (matching, stratification, inverse probability weighting, and matching weighting), and ...[more]

PMID: 30535131

Dataset Information

Validity of Privacy-Protecting Analytical Methods That Use Only Aggregate-Level Information to Conduct Multivariable-Adjusted Analysis in Distributed Data Networks.

Publications

Validity of Privacy-Protecting Analytical Methods That Use Only Aggregate-Level Information to Conduct Multivariable-Adjusted Analysis in Distributed Data Networks.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Privacy-protecting estimation of adjusted risk ratios using modified Poisson regression in multi-center studies.
| S-EPMC6894462 | biostudies-literature

Conducting Privacy-Preserving Multivariable Propensity Score Analysis When Patient Covariate Information Is Stored in Separate Locations.
| S-EPMC5391702 | biostudies-literature

Protecting patient privacy in survival analyses.
| S-EPMC7025359 | biostudies-literature

Routes for breaching and protecting genetic privacy.
| S-EPMC4151119 | biostudies-literature

Protecting contacts against privacy leaks in smartphones.
| S-EPMC6040689 | biostudies-literature

SecureMA: protecting participant privacy in genetic association meta-analysis.
| S-EPMC4296153 | biostudies-literature

Protection of Location Privacy Based on Distributed Collaborative Recommendations.
| S-EPMC5029899 | biostudies-literature

Privacy-Protecting, Reliable Response Data Discovery Using COVID-19 Patient Observations.
| S-EPMC7523159 | biostudies-literature

Privacy-protecting, reliable response data discovery using COVID-19 patient observations.
| S-EPMC8194878 | biostudies-literature

A Privacy-Preserving Distributed Analytics Platform for Health Care Data.
| S-EPMC9246511 | biostudies-literature