Dataset Information

Limitations of empirical calibration of p-values using observational data.

ABSTRACT: Controversy over non-reproducible published research reporting a statistically significant result has produced substantial discussion in the literature. p-value calibration is a recently proposed procedure for adjusting p-values to account for both random and systematic errors that address one aspect of this problem. The method's validity rests on the key assumption that bias in an effect estimate is drawn from a normal distribution whose mean and variance can be correctly estimated. We investigated the method's control of type I and type II error rates using simulated and real-world data. Under mild violations of underlying assumptions, control of the type I error rate can be conservative, while under more extreme departures, it can be anti-conservative. The extent to which the assumption is violated in real-world data analyses is unknown. Barriers to testing the plausibility of the assumption using historical data are discussed. Our studies of the type II error rate using simulated and real-world electronic health care data demonstrated that calibrating p-values can substantially increase the type II error rate. The use of calibrated p-values may reduce the number of false-positive results, but there will be a commensurate drop in the ability to detect a true safety or efficacy signal. While p-value calibration can sometimes offer advantages in controlling the type I error rate, its adoption for routine use in studies of real-world health care datasets is premature. Separate characterizations of random and systematic errors provide a richer context for evaluating uncertainty surrounding effect estimates. Copyright © 2016 John Wiley & Sons, Ltd.

SUBMITTER: Gruber S

PROVIDER: S-EPMC5012943 | biostudies-literature | 2016 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Limitations of empirical calibration of p-values using observational data.

Gruber Susan S Tchetgen Tchetgen Eric E

Statistics in medicine 20160310 22

Controversy over non-reproducible published research reporting a statistically significant result has produced substantial discussion in the literature. p-value calibration is a recently proposed procedure for adjusting p-values to account for both random and systematic errors that address one aspect of this problem. The method's validity rests on the key assumption that bias in an effect estimate is drawn from a normal distribution whose mean and variance can be correctly estimated. We investig ...[more]

PMID: 26970249

Dataset Information

Limitations of empirical calibration of p-values using observational data.

Publications

Limitations of empirical calibration of p-values using observational data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Improving reproducibility by using high-throughput observational studies with empirical calibration.
| S-EPMC6107542 | biostudies-literature

A calibration approach to transportability and data-fusion with observational data.
| S-EPMC10201931 | biostudies-literature

Development of Fluoride Protective Values for Aquatic Life Using Empirical Bioavailability Models.
| S-EPMC9303462 | biostudies-literature

Automated selection of changepoints using empirical <i>P</i>-values and trimming.
| S-EPMC9617685 | biostudies-literature

Species Delimitation Using Genomic Data: Options and Limitations.
| S-EPMC11974488 | biostudies-literature

Empirical assessment of alternative methods for identifying seasonality in observational healthcare data.
| S-EPMC9250712 | biostudies-literature

Assessing the effectiveness of empirical calibration under different bias scenarios.
| S-EPMC9327283 | biostudies-literature

Empirical calibration of a simulation model of opioid use disorder.
| S-EPMC11949371 | biostudies-literature

Using observational study data as an external control group for a clinical trial: an empirical comparison of methods to account for longitudinal missing data.
| S-EPMC9148529 | biostudies-literature

Review of the limitations and potential empirical improvements of the parametric group method of data handling for rainfall modelling.
| S-EPMC10533576 | biostudies-literature