Unknown

Dataset Information

0

Genetic association analysis under complex survey sampling: the Hispanic Community Health Study/Study of Latinos.


ABSTRACT: The cohort design allows investigators to explore the genetic basis of a variety of diseases and traits in a single study while avoiding major weaknesses of the case-control design. Most cohort studies employ multistage cluster sampling with unequal probabilities to conveniently select participants with desired characteristics, and participants from different clusters might be genetically related. Analysis that ignores the complex sampling design can yield biased estimation of the genetic association and inflation of the type I error. Herein, we develop weighted estimators that reflect unequal selection probabilities and differential nonresponse rates, and we derive variance estimators that properly account for the sampling design and the potential relatedness of participants in different sampling units. We compare, both analytically and numerically, the performance of the proposed weighted estimators with unweighted estimators that disregard the sampling design. We demonstrate the usefulness of the proposed methods through analysis of MetaboChip data in the Hispanic Community Health Study/Study of Latinos, which is the largest health study of the Hispanic/Latino population in the United States aimed at identifying risk factors for various diseases and determining the role of genes and environment in the occurrence of diseases. We provide guidelines on the use of weighted and unweighted estimators, as well as the relevant software.

SUBMITTER: Lin DY 

PROVIDER: S-EPMC4259979 | biostudies-literature | 2014 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genetic association analysis under complex survey sampling: the Hispanic Community Health Study/Study of Latinos.

Lin Dan-Yu DY   Tao Ran R   Kalsbeek William D WD   Zeng Donglin D   Gonzalez Franklyn F   Fernández-Rhodes Lindsay L   Graff Mariaelisa M   Koch Gary G GG   North Kari E KE   Heiss Gerardo G  

American journal of human genetics 20141201 6


The cohort design allows investigators to explore the genetic basis of a variety of diseases and traits in a single study while avoiding major weaknesses of the case-control design. Most cohort studies employ multistage cluster sampling with unequal probabilities to conveniently select participants with desired characteristics, and participants from different clusters might be genetically related. Analysis that ignores the complex sampling design can yield biased estimation of the genetic associ  ...[more]

Similar Datasets

| S-EPMC4716704 | biostudies-other
| S-EPMC8245895 | biostudies-literature
| S-EPMC7603898 | biostudies-literature
| S-EPMC5639746 | biostudies-literature
| S-EPMC10061308 | biostudies-literature
| S-EPMC5428979 | biostudies-literature
| S-EPMC5347427 | biostudies-literature
| S-EPMC5583292 | biostudies-literature
| S-EPMC7376098 | biostudies-literature
| S-EPMC4351363 | biostudies-literature