Unknown

Dataset Information

0

Combining controls can improve power in two-stage association studies.


ABSTRACT:

Background

High dimensional case control studies are ubiquitous in the biological sciences, particularly genomics. To maximise power while constraining cost and to minimise type-1 error rates, researchers typically seek to replicate findings in a second experiment on independent cohorts before proceeding with further analyses. This can be an expensive procedure, particularly when control samples are difficult to recruit or ascertain; for example in inter-disease comparisons, or studies on degenerative diseases.

Results

This paper presents a method in which control (or case) samples from the discovery cohort are re-used in a replication study. The theoretical implications of this method are discussed and simulated genome-wide association study (GWAS) tests are used to compare performance against the standard approach in a range of circumstances. Using similar methods, a procedure is proposed for 'partial replication' using a new independent cohort consisting of only controls. This methods can be used to provide some validation of findings when a full replication procedure is not possible. The new method has differing sensitivity to confounding in study cohorts compared to the standard procedure, which must be considered in its application. Type-1 error rates in these scenarios are analytically and empirically derived, and an online tool for comparing power and error rates is provided.

Conclusions

In several common study designs, a shared-control method allows a substantial improvement in power while retaining type-1 error rate control. Although careful consideration must be made of all necessary assumptions, this method can enable more efficient use of data in GWAS and other applications.

SUBMITTER: Liley J 

PROVIDER: S-EPMC6171163 | biostudies-literature | 2018 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Combining controls can improve power in two-stage association studies.

Liley James J  

BMC genetics 20181003 1


<h4>Background</h4>High dimensional case control studies are ubiquitous in the biological sciences, particularly genomics. To maximise power while constraining cost and to minimise type-1 error rates, researchers typically seek to replicate findings in a second experiment on independent cohorts before proceeding with further analyses. This can be an expensive procedure, particularly when control samples are difficult to recruit or ascertain; for example in inter-disease comparisons, or studies o  ...[more]

Similar Datasets

| S-EPMC4350584 | biostudies-literature
| S-EPMC2935880 | biostudies-literature
| S-EPMC6915826 | biostudies-literature
| S-EPMC6789773 | biostudies-literature
| S-EPMC3435377 | biostudies-literature
| S-EPMC8181458 | biostudies-literature
| S-EPMC5125008 | biostudies-literature
| S-EPMC6169386 | biostudies-other
| S-EPMC2868915 | biostudies-literature
| S-EPMC2795923 | biostudies-literature