Unknown

Dataset Information

0

Extreme sampling design in genetic association mapping of quantitative trait loci using balanced and unbalanced case-control samples.


ABSTRACT: It is extremely expensive to conduct large sample size array- or sequencing based genome scale association studies. For a quantitative trait, an extreme case-control study design may improve the power and reduce the cost of variant calling. We investigated the performance of extreme study design when various proportions of samples are selected from the tails of phenotype distribution. Using simulations, we show that when risk genotypes become rare in the population and effect size is relatively small, it is beneficial to carry out an extreme sampling study. In particular, the number of selected cases and controls can even be unbalanced such that power is further increased, compared with a balanced selection. Our application to two data sets: methadone dose data and yearling weight data, demonstrated that similar results for full data analysis can be obtained using extreme sampling with only a fraction of the data. Using power analysis with simulated data and an experimental data application, we conclude that when full data is unavailable due to restricted budget, it is rewarding to employ an extreme sampling design in the sense that there can be immense cost reductions and qualitatively similar power as in the full data analysis.

SUBMITTER: Li Y 

PROVIDER: S-EPMC6820758 | biostudies-literature | 2019 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Extreme sampling design in genetic association mapping of quantitative trait loci using balanced and unbalanced case-control samples.

Li Yi Y   Levran Orna O   Kim JongJoo J   Zhang Tiejun T   Chen Xingdong X   Suo Chen C  

Scientific reports 20191029 1


It is extremely expensive to conduct large sample size array- or sequencing based genome scale association studies. For a quantitative trait, an extreme case-control study design may improve the power and reduce the cost of variant calling. We investigated the performance of extreme study design when various proportions of samples are selected from the tails of phenotype distribution. Using simulations, we show that when risk genotypes become rare in the population and effect size is relatively  ...[more]

Similar Datasets

| S-EPMC7324978 | biostudies-literature
| S-EPMC1470895 | biostudies-other
| S-EPMC3716265 | biostudies-literature
| S-EPMC1893048 | biostudies-literature
| S-EPMC3430541 | biostudies-literature
| S-EPMC1821103 | biostudies-other
| S-EPMC3852377 | biostudies-literature
| S-EPMC2034641 | biostudies-literature
| S-EPMC5500132 | biostudies-literature
| S-EPMC3832267 | biostudies-literature