Dataset Information

Privacy-Preserving Anonymity for Periodical Releases of Spontaneous Adverse Drug Event Reporting Data: Algorithm Development and Validation.

ABSTRACT:

Background

Spontaneous reporting systems (SRSs) have been increasingly established to collect adverse drug events for fostering adverse drug reaction (ADR) detection and analysis research. SRS data contain personal information, and so their publication requires data anonymization to prevent the disclosure of individuals' privacy. We have previously proposed a privacy model called MS(k, θ*)-bounding and the associated MS-Anonymization algorithm to fulfill the anonymization of SRS data. In the real world, the SRS data usually are released periodically (eg, FDA Adverse Event Reporting System [FAERS]) to accommodate newly collected adverse drug events. Different anonymized releases of SRS data available to the attacker may thwart our single-release-focus method, that is, MS(k, θ*)-bounding.

Objective

We investigate the privacy threat caused by periodical releases of SRS data and propose anonymization methods to prevent the disclosure of personal privacy information while maintaining the utility of published data.

Methods

We identify potential attacks on periodical releases of SRS data, namely, BFL-attacks, mainly caused by follow-up cases. We present a new privacy model called PPMS(k, θ*)-bounding, and propose the associated PPMS-Anonymization algorithm and 2 improvements: PPMS+-Anonymization and PPMS++-Anonymization. Empirical evaluations were performed using 32 selected FAERS quarter data sets from 2004Q1 to 2011Q4. The performance of the proposed versions of PPMS-Anonymization was inspected against MS-Anonymization from some aspects, including data distortion, measured by normalized information loss; privacy risk of anonymized data, measured by dangerous identity ratio and dangerous sensitivity ratio; and data utility, measured by the bias of signal counting and strength (proportional reporting ratio).

Results

The best version of PPMS-Anonymization, PPMS++-Anonymization, achieves nearly the same quality as MS-Anonymization in both privacy protection and data utility. Overall, PPMS++-Anonymization ensures zero privacy risk on record and attribute linkage, and exhibits 51%-78% and 59%-82% improvements on information loss over PPMS+-Anonymization and PPMS-Anonymization, respectively, and significantly reduces the bias of ADR signal.

Conclusions

The proposed PPMS(k, θ*)-bounding model and PPMS-Anonymization algorithm are effective in anonymizing SRS data sets in the periodical data publishing scenario, preventing the series of releases from disclosing personal sensitive information caused by BFL-attacks while maintaining the data utility for ADR signal detection.

SUBMITTER: Wang JT

PROVIDER: S-EPMC8587328 | biostudies-literature | 2021 Oct

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Privacy-Preserving Anonymity for Periodical Releases of Spontaneous Adverse Drug Event Reporting Data: Algorithm Development and Validation.

Wang Jie-Teng JT Lin Wen-Yang WY

JMIR medical informatics 20211028 10

<h4>Background</h4>Spontaneous reporting systems (SRSs) have been increasingly established to collect adverse drug events for fostering adverse drug reaction (ADR) detection and analysis research. SRS data contain personal information, and so their publication requires data anonymization to prevent the disclosure of individuals' privacy. We have previously proposed a privacy model called MS(k, θ*)-bounding and the associated MS-Anonymization algorithm to fulfill the anonymization of SRS data. In ...[more]

PMID: 34709197

Dataset Information

Privacy-Preserving Anonymity for Periodical Releases of Spontaneous Adverse Drug Event Reporting Data: Algorithm Development and Validation.

Background

Objective

Methods

Results

Conclusions

Publications

Privacy-Preserving Anonymity for Periodical Releases of Spontaneous Adverse Drug Event Reporting Data: Algorithm Development and Validation.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Adverse Events Related to Off-Label Drugs Using Spontaneous Adverse Event Reporting Systems.
| S-EPMC8387311 | biostudies-literature

Statistical methods for exploring spontaneous adverse event reporting databases for drug-host factor interactions.
| S-EPMC10041785 | biostudies-literature

Pattern of adverse events induced by aflibercept and ranibizumab: A nationwide spontaneous adverse event reporting database, 2007-2016.
| S-EPMC6831246 | biostudies-literature

Spontaneous Reporting on Adverse Events by Consumers in the United States: An Analysis of the Food and Drug Administration Adverse Event Reporting System Database.
| S-EPMC5984610 | biostudies-literature

Anonymity-preserving Reputation Management System for health sector.
| S-EPMC5896921 | biostudies-literature

Cardiac Events Potentially Associated to Remdesivir: An Analysis from the European Spontaneous Adverse Event Reporting System.
| S-EPMC8308754 | biostudies-literature

Privacy-preserving analysis of time-to-event data under nested case-control sampling.
| S-EPMC10863373 | biostudies-literature

Privacy-preserving parallel kNN classification algorithm using index-based filtering in cloud computing.
| S-EPMC9070920 | biostudies-literature

Adverse event reporting of mirtazapine: A disproportionality analysis of FDA adverse event reporting system (FAERS) database from 2004-2024.
| S-EPMC12716765 | biostudies-literature

Adverse events of antibody-drug conjugates: comparative analysis of agents with a common payload using the adverse event spontaneous reporting database.
| S-EPMC12517340 | biostudies-literature