Unknown

Dataset Information

0

A composite likelihood approach in testing for Hardy Weinberg Equilibrium using family-based genetic survey data.


ABSTRACT: In population-based household surveys, for example, the National Health and Nutrition Examination Survey, households are often sampled by stratified multistage cluster sampling, and multiple individuals related by blood are often sampled within households. Therefore, genetic data collected from these population-based household surveys, called National Genetic Household Surveys, can be correlated because of two levels of correlation. One level of correlation is caused by the multistage geographical cluster sampling and the other is caused by biological inheritance among participants within the same sampled family. In this paper, we develop an efficient Hardy Weinberg Equilibrium (HWE) test utilizing pairwise composite likelihood methods that incorporate the sample weighting effect induced by the differential selection probabilities in complex sample designs, as well as the two-level clustering (correlation) effects described above. Monte Carlo simulation studies show that the proposed HWE test maintains the nominal levels, and is more powerful than existing methods (Li et al. 2011) under various (non)informative sample designs that depend on genotypes (explicitly or implicitly), family relationships or both, especially when within-household sampling depends on the genotypes. The developed tests are further evaluated using simulated genetic data based on the Hispanic Health and Nutrition Survey. Copyright © 2016 John Wiley & Sons, Ltd.

SUBMITTER: Wang L 

PROVIDER: S-EPMC7210008 | biostudies-literature | 2016 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

A composite likelihood approach in testing for Hardy Weinberg Equilibrium using family-based genetic survey data.

Wang Lingxiao L   Graubard Barry I BI   Li Yan Y  

Statistics in medicine 20160801 27


In population-based household surveys, for example, the National Health and Nutrition Examination Survey, households are often sampled by stratified multistage cluster sampling, and multiple individuals related by blood are often sampled within households. Therefore, genetic data collected from these population-based household surveys, called National Genetic Household Surveys, can be correlated because of two levels of correlation. One level of correlation is caused by the multistage geographic  ...[more]

Similar Datasets

| S-EPMC2640163 | biostudies-literature
| S-EPMC2657816 | biostudies-other
| S-EPMC3972031 | biostudies-literature
| S-EPMC4868269 | biostudies-literature
| S-EPMC2798818 | biostudies-literature
| S-EPMC2986695 | biostudies-literature
| S-EPMC9250657 | biostudies-literature
| S-EPMC4725899 | biostudies-literature
| S-EPMC5671567 | biostudies-literature
| S-EPMC5429372 | biostudies-literature