Unknown

Dataset Information

0

Phenotype validation in electronic health records based genetic association studies.


ABSTRACT: The linkage between electronic health records (EHRs) and genotype data makes it plausible to study the genetic susceptibility of a wide range of disease phenotypes. Despite that EHR-derived phenotype data are subjected to misclassification, it has been shown useful for discovering susceptible genes, particularly in the setting of phenome-wide association studies (PheWAS). It is essential to characterize discovered associations using gold standard phenotype data by chart review. In this work, we propose a genotype stratified case-control sampling strategy to select subjects for phenotype validation. We develop a closed-form maximum-likelihood estimator for the odds ratio parameters and a score statistic for testing genetic association using the combined validated and error-prone EHR-derived phenotype data, and assess the extent of power improvement provided by this approach. Compared with case-control sampling based only on EHR-derived phenotype data, our genotype stratified strategy maintains nominal type I error rates, and result in higher power for detecting associations. It also corrects the bias in the odds ratio parameter estimates, and reduces the corresponding variance especially when the minor allele frequency is small.

SUBMITTER: Wang L 

PROVIDER: S-EPMC5891135 | biostudies-literature | 2017 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Phenotype validation in electronic health records based genetic association studies.

Wang Lu L   Damrauer Scott M SM   Zhang Hong H   Zhang Alan X AX   Xiao Rui R   Moore Jason H JH   Chen Jinbo J  

Genetic epidemiology 20171011 8


The linkage between electronic health records (EHRs) and genotype data makes it plausible to study the genetic susceptibility of a wide range of disease phenotypes. Despite that EHR-derived phenotype data are subjected to misclassification, it has been shown useful for discovering susceptible genes, particularly in the setting of phenome-wide association studies (PheWAS). It is essential to characterize discovered associations using gold standard phenotype data by chart review. In this work, we  ...[more]

Similar Datasets

| S-EPMC5904248 | biostudies-literature
| S-EPMC4465698 | biostudies-literature
| S-EPMC4185241 | biostudies-literature
| S-EPMC4377559 | biostudies-literature
| S-EPMC9080132 | biostudies-literature
| S-EPMC9620826 | biostudies-literature
| PRJNA683675 | ENA
| S-EPMC10782284 | biostudies-literature
| S-EPMC10371114 | biostudies-literature
| S-EPMC7365658 | biostudies-literature