Unknown

Dataset Information

0

Enabling Privacy-Preserving GWASs in Heterogeneous Human Populations.


ABSTRACT: The proliferation of large genomic databases offers the potential to perform increasingly larger-scale genome-wide association studies (GWASs). Due to privacy concerns, however, access to these data is limited, greatly reducing their usefulness for research. Here, we introduce a computational framework for performing GWASs that adapts principles of differential privacy-a cryptographic theory that facilitates secure analysis of sensitive data-to both protect private phenotype information (e.g., disease status) and correct for population stratification. This framework enables us to produce privacy-preserving GWAS results based on EIGENSTRAT and linear mixed model (LMM)-based statistics, both of which correct for population stratification. We test our differentially private statistics, PrivSTRAT and PrivLMM, on simulated and real GWAS datasets and find they are able to protect privacy while returning meaningful results. Our framework can be used to securely query private genomic datasets to discover which specific genomic alterations may be associated with a disease, thus increasing the availability of these valuable datasets.

SUBMITTER: Simmons S 

PROVIDER: S-EPMC4994706 | biostudies-literature | 2016 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Enabling Privacy-Preserving GWASs in Heterogeneous Human Populations.

Simmons Sean S   Sahinalp Cenk C   Berger Bonnie B  

Cell systems 20160721 1


The proliferation of large genomic databases offers the potential to perform increasingly larger-scale genome-wide association studies (GWASs). Due to privacy concerns, however, access to these data is limited, greatly reducing their usefulness for research. Here, we introduce a computational framework for performing GWASs that adapts principles of differential privacy-a cryptographic theory that facilitates secure analysis of sensitive data-to both protect private phenotype information (e.g., d  ...[more]

Similar Datasets

| S-EPMC7084661 | biostudies-literature
| S-EPMC3932473 | biostudies-literature
| S-EPMC7482515 | biostudies-literature
| S-EPMC4704467 | biostudies-literature
| S-EPMC8487550 | biostudies-literature
| S-EPMC4848404 | biostudies-literature
| S-EPMC8276015 | biostudies-literature
| S-EPMC4908319 | biostudies-literature
| S-EPMC6585383 | biostudies-literature
| S-EPMC8857019 | biostudies-literature