Unknown

Dataset Information

0

Simple and efficient analysis of disease association with missing genotype data.


ABSTRACT: Missing genotype data arise in association studies when the single-nucleotide polymorphisms (SNPs) on the genotyping platform are not assayed successfully, when the SNPs of interest are not on the platform, or when total sequence variation is determined only on a small fraction of individuals. We present a simple and flexible likelihood framework to study SNP-disease associations with such missing genotype data. Our likelihood makes full use of all available data in case-control studies and reference panels (e.g., the HapMap), and it properly accounts for the biased nature of the case-control sampling as well as the uncertainty in inferring unknown variants. The corresponding maximum-likelihood estimators for genetic effects and gene-environment interactions are unbiased and statistically efficient. We developed fast and stable numerical algorithms to calculate the maximum-likelihood estimators and their variances, and we implemented these algorithms in a freely available computer program. Simulation studies demonstrated that the new approach is more powerful than existing methods while providing accurate control of the type I error. An application to a case-control study on rheumatoid arthritis revealed several loci that deserve further investigations.

SUBMITTER: Lin DY 

PROVIDER: S-EPMC2427170 | biostudies-literature | 2008 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Simple and efficient analysis of disease association with missing genotype data.

Lin D Y DY   Hu Y Y   Huang B E BE  

American journal of human genetics 20080201 2


Missing genotype data arise in association studies when the single-nucleotide polymorphisms (SNPs) on the genotyping platform are not assayed successfully, when the SNPs of interest are not on the platform, or when total sequence variation is determined only on a small fraction of individuals. We present a simple and flexible likelihood framework to study SNP-disease associations with such missing genotype data. Our likelihood makes full use of all available data in case-control studies and refe  ...[more]

Similar Datasets

| S-EPMC7077088 | biostudies-literature
| S-EPMC2553438 | biostudies-literature
| S-EPMC5809924 | biostudies-literature
| S-EPMC4697868 | biostudies-literature
| S-EPMC3554627 | biostudies-literature
| S-EPMC2758715 | biostudies-literature
| S-EPMC2515855 | biostudies-other
| S-EPMC194902 | biostudies-literature
| S-EPMC2874738 | biostudies-literature
2012-01-06 | E-GEOD-26486 | biostudies-arrayexpress