Unknown

Dataset Information

0

Probability-scale residuals for continuous, discrete, and censored data.


ABSTRACT: We describe a new residual for general regression models, defined as pr(Y* < y) - pr(Y* > y), where y is the observed outcome and Y* is a random variable from the fitted distribution. This probability-scale residual can be written as E {sign(y, Y*)} whereas the popular observed-minus-expected residual can be thought of as E(y - Y*). Therefore, the probability-scale residual is useful in settings where differences are not meaningful or where the expectation of the fitted distribution cannot be calculated. We present several desirable properties of the probability-scale residual that make it useful for diagnostics and measuring residual correlation, especially across different outcome types. We demonstrate its utility for continuous, ordered discrete, and censored outcomes, including current status data, and with various models including Cox regression, quantile regression, and ordinal cumulative probability models, for which fully specified distributions are not desirable or needed, and in some cases suitable residuals are not available. The residual is illustrated with simulated data and real datasets from HIV-infected patients on therapy in the southeastern United States and Latin America.

SUBMITTER: Shepherd BE 

PROVIDER: S-EPMC5364820 | biostudies-literature | 2016 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Probability-scale residuals for continuous, discrete, and censored data.

Shepherd Bryan E BE   Li Chun C   Liu Qi Q  

The Canadian journal of statistics = Revue canadienne de statistique 20160824 4


We describe a new residual for general regression models, defined as <i>pr</i>(<i>Y</i><sup>*</sup> < <i>y</i>) - <i>pr</i>(<i>Y</i><sup>*</sup> > <i>y</i>), where <i>y</i> is the observed outcome and <i>Y</i><sup>*</sup> is a random variable from the fitted distribution. This probability-scale residual can be written as <i>E</i> {sign(<i>y</i>, <i>Y</i><sup>*</sup>)} whereas the popular observed-minus-expected residual can be thought of as <i>E</i>(<i>y</i> - <i>Y</i><sup>*</sup>). Therefore, t  ...[more]

Similar Datasets

| S-EPMC5949238 | biostudies-literature
| S-EPMC8018589 | biostudies-literature
| S-EPMC3929353 | biostudies-literature
| S-EPMC5654714 | biostudies-literature
| S-EPMC4845150 | biostudies-literature
| S-EPMC5655034 | biostudies-literature
| S-EPMC6402504 | biostudies-literature
| S-EPMC2565512 | biostudies-literature
| S-EPMC3385950 | biostudies-literature
| S-EPMC4055564 | biostudies-literature