Unknown

Dataset Information

0

A general framework for studying genetic effects and gene-environment interactions with missing data.


ABSTRACT: Missing data arise in genetic association studies when genotypes are unknown or when haplotypes are of direct interest. We provide a general likelihood-based framework for making inference on genetic effects and gene-environment interactions with such missing data. We allow genetic and environmental variables to be correlated while leaving the distribution of environmental variables completely unspecified. We consider 3 major study designs-cross-sectional, case-control, and cohort designs-and construct appropriate likelihood functions for all common phenotypes (e.g. case-control status, quantitative traits, and potentially censored ages at onset of disease). The likelihood functions involve both finite- and infinite-dimensional parameters. The maximum likelihood estimators are shown to be consistent, asymptotically normal, and asymptotically efficient. Expectation-Maximization (EM) algorithms are developed to implement the corresponding inference procedures. Extensive simulation studies demonstrate that the proposed inferential and numerical methods perform well in practical settings. Illustration with a genome-wide association study of lung cancer is provided.

SUBMITTER: Hu YJ 

PROVIDER: S-EPMC3294269 | biostudies-literature | 2010 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

A general framework for studying genetic effects and gene-environment interactions with missing data.

Hu Y J YJ   Lin D Y DY   Zeng D D  

Biostatistics (Oxford, England) 20100326 4


Missing data arise in genetic association studies when genotypes are unknown or when haplotypes are of direct interest. We provide a general likelihood-based framework for making inference on genetic effects and gene-environment interactions with such missing data. We allow genetic and environmental variables to be correlated while leaving the distribution of environmental variables completely unspecified. We consider 3 major study designs-cross-sectional, case-control, and cohort designs-and co  ...[more]

Similar Datasets

| S-EPMC4479853 | biostudies-literature
| S-EPMC9781692 | biostudies-literature
| S-EPMC7615108 | biostudies-literature
| S-EPMC10858057 | biostudies-literature
| S-EPMC10671243 | biostudies-literature
| S-EPMC4046647 | biostudies-literature
| S-EPMC2924567 | biostudies-literature
| S-EPMC8664495 | biostudies-literature
| S-EPMC4350262 | biostudies-literature
| S-EPMC10989912 | biostudies-literature