Ontology highlight
ABSTRACT: Motivation
Developing targeted therapeutics and identifying biomarkers relies on large amounts of research participant data. Beyond human DNA, scientists now investigate the DNA of micro-organisms inhabiting the human body. Recent work shows that an individual's collection of microbial DNA consistently identifies that person and could be used to link a real-world identity to a sensitive attribute in a research dataset. Unfortunately, the current suite of DNA-specific privacy-preserving analysis tools does not meet the requirements for microbiome sequencing studies.Results
To address privacy concerns around microbiome sequencing, we implement metagenomic analyses using secure computation. Our implementation allows comparative analysis over combined data without revealing the feature counts for any individual sample. We focus on three analyses and perform an evaluation on datasets currently used by the microbiome research community. We use our implementation to simulate sharing data between four policy-domains. Additionally, we describe an application of our implementation for patients to combine data that allows drug developers to query against and compensate patients for the analysis.Availability and implementation
The software is freely available for download at: http://cbcb.umd.edu/?hcorrada/projects/secureseq.htmlSupplementary information
Supplementary data are available at Bioinformatics online.Contact
hcorrada@umiacs.umd.edu.
SUBMITTER: Wagner J
PROVIDER: S-EPMC4908319 | biostudies-literature | 2016 Jun
REPOSITORIES: biostudies-literature
Wagner Justin J Paulson Joseph N JN Wang Xiao X Bhattacharjee Bobby B Corrada Bravo Héctor H
Bioinformatics (Oxford, England) 20160211 12
<h4>Motivation</h4>Developing targeted therapeutics and identifying biomarkers relies on large amounts of research participant data. Beyond human DNA, scientists now investigate the DNA of micro-organisms inhabiting the human body. Recent work shows that an individual's collection of microbial DNA consistently identifies that person and could be used to link a real-world identity to a sensitive attribute in a research dataset. Unfortunately, the current suite of DNA-specific privacy-preserving a ...[more]