Unknown

Dataset Information

0

The projack: a resampling approach to correct for ranking bias in high-throughput studies.


ABSTRACT: The problem of ranked inference arises in a number of settings, for which the investigator wishes to perform parameter inference after ordering a set of [Formula: see text] statistics. In contrast to inference for a single hypothesis, the ranking procedure introduces considerable bias, a problem known as the "winner's curse" in genetic association. We introduce the projack (for Prediction by Re- Ordered Jackknife and Cross-Validation, [Formula: see text]-fold). The projack is a resampling-based procedure that provides low-bias estimates of the expected ranked effect size parameter for a set of possibly correlated [Formula: see text] statistics. The approach is flexible, and has wide applicability to high-dimensional datasets, including those arising from genomics platforms. Initially, motivated for the setting where original data are available for resampling, the projack can be extended to the situation where only the vector of [Formula: see text] values is available. We illustrate the projack for correction of the winner's curse in genetic association, although it can be used much more generally.

SUBMITTER: Zhou YH 

PROVIDER: S-EPMC4679068 | biostudies-literature | 2016 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

The projack: a resampling approach to correct for ranking bias in high-throughput studies.

Zhou Yi-Hui YH   Wright Fred A FA  

Biostatistics (Oxford, England) 20150603 1


The problem of ranked inference arises in a number of settings, for which the investigator wishes to perform parameter inference after ordering a set of [Formula: see text] statistics. In contrast to inference for a single hypothesis, the ranking procedure introduces considerable bias, a problem known as the "winner's curse" in genetic association. We introduce the projack (for Prediction by Re- Ordered Jackknife and Cross-Validation, [Formula: see text]-fold). The projack is a resampling-based  ...[more]

Similar Datasets

| S-EPMC8088017 | biostudies-literature
2022-10-14 | PXD033711 | Pride
| S-EPMC2670840 | biostudies-literature
| S-EPMC6443505 | biostudies-literature
| S-EPMC4770208 | biostudies-literature
| S-EPMC3896549 | biostudies-literature
| S-EPMC3166835 | biostudies-literature
| S-EPMC3378858 | biostudies-other
| S-EPMC5242475 | biostudies-literature
| S-EPMC2863005 | biostudies-literature