Unknown

Dataset Information

0

Independent filtering increases detection power for high-throughput experiments.


ABSTRACT: With high-dimensional data, variable-by-variable statistical testing is often used to select variables whose behavior differs across conditions. Such an approach requires adjustment for multiple testing, which can result in low statistical power. A two-stage approach that first filters variables by a criterion independent of the test statistic, and then only tests variables which pass the filter, can provide higher power. We show that use of some filter/test statistics pairs presented in the literature may, however, lead to loss of type I error control. We describe other pairs which avoid this problem. In an application to microarray data, we found that gene-by-gene filtering by overall variance followed by a t-test increased the number of discoveries by 50%. We also show that this particular statistic pair induces a lower bound on fold-change among the set of discoveries. Independent filtering-using filter/test pairs that are independent under the null hypothesis but correlated under the alternative-is a general approach that can substantially increase the efficiency of experiments.

SUBMITTER: Bourgon R 

PROVIDER: S-EPMC2906865 | biostudies-other | 2010 May

REPOSITORIES: biostudies-other

altmetric image

Publications

Independent filtering increases detection power for high-throughput experiments.

Bourgon Richard R   Gentleman Robert R   Huber Wolfgang W  

Proceedings of the National Academy of Sciences of the United States of America 20100511 21


With high-dimensional data, variable-by-variable statistical testing is often used to select variables whose behavior differs across conditions. Such an approach requires adjustment for multiple testing, which can result in low statistical power. A two-stage approach that first filters variables by a criterion independent of the test statistic, and then only tests variables which pass the filter, can provide higher power. We show that use of some filter/test statistics pairs presented in the lit  ...[more]

Similar Datasets

| S-EPMC3740625 | biostudies-literature
| S-EPMC3476334 | biostudies-other
| S-EPMC4357712 | biostudies-literature
| S-EPMC3125730 | biostudies-literature
| S-EPMC2949886 | biostudies-other
| S-EPMC2813853 | biostudies-literature
| S-EPMC8240032 | biostudies-literature
| S-EPMC1127019 | biostudies-literature
| S-EPMC4770208 | biostudies-literature
| S-EPMC5553769 | biostudies-other