Unknown

Dataset Information

0

Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern.


ABSTRACT:

Background

Genome-wide methylation arrays are increasingly used tools in studies of complex medical disorders. Because of their expense and potential utility to the scientific community, current federal policy dictates that data from these arrays, like those from genome-wide genotyping arrays, be deposited in publicly available databases. Unlike the genotyping information, access to the expression data is not restricted. An underlying supposition in the current nonrestricted access to methylation data is the belief that protected health and personal identifying information cannot be simultaneously extracted from these arrays.

Results

In this communication, we analyze methylation data from the Illumina HumanMethylation450 array and show that genotype at 1,069 highly informative loci, and both alcohol and smoking consumption information, can be derived from the array data.

Conclusions

We conclude that both potentially personally identifying information and substance-use histories can be simultaneously derived from methylation array data. Because access to genetic information about a database subject or one of their relatives is critical to the de-identification process, this risk of de-identification is limited at the current time. We propose that access to genome-wide methylation data be restricted to institutionally approved investigators who accede to data use agreements prohibiting re-identification.

SUBMITTER: Philibert RA 

PROVIDER: S-EPMC4391334 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern.

Philibert Robert A RA   Terry Nicolas N   Erwin Cheryl C   Philibert Winter J WJ   Beach Steven Rh SR   Brody Gene H GH  

Clinical epigenetics 20141119 1


<h4>Background</h4>Genome-wide methylation arrays are increasingly used tools in studies of complex medical disorders. Because of their expense and potential utility to the scientific community, current federal policy dictates that data from these arrays, like those from genome-wide genotyping arrays, be deposited in publicly available databases. Unlike the genotyping information, access to the expression data is not restricted. An underlying supposition in the current nonrestricted access to me  ...[more]

Similar Datasets

| S-EPMC1314899 | biostudies-literature
| S-EPMC3939862 | biostudies-other
| S-EPMC2864335 | biostudies-literature
| S-EPMC2196135 | biostudies-literature
| S-EPMC5959723 | biostudies-literature
| S-EPMC3838392 | biostudies-literature
| S-EPMC6476769 | biostudies-literature
| S-EPMC8582278 | biostudies-literature
| S-EPMC5480596 | biostudies-literature
| S-EPMC5795053 | biostudies-literature