Unknown

Dataset Information

0

An evaluation of common methods for dichotomization of continuous variables to discriminate disease status.


ABSTRACT: Dichotomization of continuous variables to discriminate a dichotomous outcome is often useful in statistical applications. If a true threshold for a continuous variable exists, the challenge is identifying it. This paper examines common methods for dichotomization to identify which ones recover a true threshold. We provide mathematical and numeric proofs demonstrating that maximizing the odds ratio, Youden's statistic, Gini Index, chi-square statistic, relative risk and kappa statistic all theoretically recover a true threshold. A simulation study evaluating the ability of these statistics to recover a threshold when sampling from a population indicates that maximizing the chi-square statistic and Gini Index have the smallest bias and variability when the probability of being larger than the threshold is small while maximizing Kappa or Youden's statistics is best when this probability is larger. Maximizing odds ratio is the most variable and biased of the methods.

SUBMITTER: Prince Nelson SL 

PROVIDER: S-EPMC6020169 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

An evaluation of common methods for dichotomization of continuous variables to discriminate disease status.

Prince Nelson Sybil L SL   Ramakrishnan Viswanathan V   Nietert Paul J PJ   Kamen Diane L DL   Ramos Paula S PS   Wolf Bethany J BJ  

Communications in statistics: theory and methods 20170802 21


Dichotomization of continuous variables to discriminate a dichotomous outcome is often useful in statistical applications. If a true threshold for a continuous variable exists, the challenge is identifying it. This paper examines common methods for dichotomization to identify which ones recover a true threshold. We provide mathematical and numeric proofs demonstrating that maximizing the odds ratio, Youden's statistic, Gini Index, chi-square statistic, relative risk and kappa statistic all theor  ...[more]

Similar Datasets

| S-EPMC10198136 | biostudies-literature
| S-EPMC8516097 | biostudies-literature
2018-04-05 | PXD007683 | Pride
2023-07-26 | GSE231693 | GEO
2019-02-15 | MSV000083453 | MassIVE
| S-EPMC5984592 | biostudies-literature
| S-EPMC5675816 | biostudies-literature
| S-EPMC2367531 | biostudies-literature
| S-EPMC3360667 | biostudies-literature
| S-EPMC5508357 | biostudies-literature