Unknown

Dataset Information

0

A flexible and nearly optimal sequential testing approach to randomized testing: QUICK-STOP.


ABSTRACT: In the analysis of current life science datasets, we often encounter scenarios in which the application of asymptotic theory to hypothesis testing can be problematic. Besides improved asymptotic results, permutation/simulation-based tests are a general approach to address this issue. However, these randomized tests can impose a massive computational burden, for example, in scenarios in which large numbers of statistical tests are computed, and the specified significance level is very small. Stopping rules aim to assess significance with the smallest possible number of draws while controlling the probabilities of errors due to statistical uncertainty. In this communication, we derive a general stopping rule, QUICK-STOP, based on the sequential testing theory that is easy to implement, controls the error probabilities rigorously, and is nearly optimal in terms of expected draws. In a simulation study, we show that our approach outperforms current stopping approaches for general randomized tests by factor 10 and does not impose an additional computational burden. We illustrate our approach by applying our stopping rule to a single-variant analysis of a whole-genome sequencing study for lung function.

SUBMITTER: Hecker J 

PROVIDER: S-EPMC7028451 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

A flexible and nearly optimal sequential testing approach to randomized testing: QUICK-STOP.

Hecker Julian J   Ruczinski Ingo I   Cho Michael H MH   Silverman Edwin K EK   Coull Brent B   Lange Christoph C  

Genetic epidemiology 20191111 2


In the analysis of current life science datasets, we often encounter scenarios in which the application of asymptotic theory to hypothesis testing can be problematic. Besides improved asymptotic results, permutation/simulation-based tests are a general approach to address this issue. However, these randomized tests can impose a massive computational burden, for example, in scenarios in which large numbers of statistical tests are computed, and the specified significance level is very small. Stop  ...[more]

Similar Datasets

| S-EPMC8649474 | biostudies-literature
| S-EPMC7519231 | biostudies-literature
| S-EPMC5988456 | biostudies-literature
| S-EPMC1436025 | biostudies-literature
| S-EPMC3540257 | biostudies-literature
| S-EPMC2846951 | biostudies-literature
| S-EPMC4725820 | biostudies-literature
| S-EPMC3223366 | biostudies-literature
| S-EPMC3085015 | biostudies-literature