Unknown

Dataset Information

0

The Joint Null Criterion for Multiple Hypothesis Tests


ABSTRACT: Simultaneously performing many hypothesis tests is a problem commonly encountered in high-dimensional biology. In this setting, a large set of p-values is calculated from many related features measured simultaneously. Classical statistics provides a criterion for defining what a “correct” p-value is when performing a single hypothesis test. We show here that even when each p-value is marginally correct under this single hypothesis criterion, it may be the case that the joint behavior of the entire set of p-values is problematic. On the other hand, there are cases where each p-value is marginally incorrect, yet the joint distribution of the set of p-values is satisfactory. Here, we propose a criterion defining a well behaved set of simultaneously calculated p-values that provides precise control of common error rates and we introduce diagnostic procedures for assessing whether the criterion is satisfied with simulations. Multiple testing p-values that satisfy our new criterion avoid potentially large study specific errors, but also satisfy the usual assumptions for strong control of false discovery rates and family-wise error rates. We utilize the new criterion and proposed diagnostics to investigate two common issues in high-dimensional multiple testing for genomics: dependent multiple hypothesis tests and pooled versus test-specific null distributions.

SUBMITTER: Leek J 

PROVIDER: S-EPMC3135422 | biostudies-literature | 2011 Jan

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC3289673 | biostudies-literature
| S-EPMC2912702 | biostudies-literature
| S-EPMC6676337 | biostudies-literature
| S-EPMC8122026 | biostudies-literature
| S-EPMC9035066 | biostudies-literature
| S-EPMC7539472 | biostudies-literature
| S-EPMC9561355 | biostudies-literature
| S-EPMC3621801 | biostudies-literature
| S-EPMC5991793 | biostudies-literature
| S-EPMC5540883 | biostudies-literature