Unknown

Dataset Information

0

Probabilistic classifiers with high-dimensional data.


ABSTRACT: For medical classification problems, it is often desirable to have a probability associated with each class. Probabilistic classifiers have received relatively little attention for small n large p classification problems despite of their importance in medical decision making. In this paper, we introduce 2 criteria for assessment of probabilistic classifiers: well-calibratedness and refinement and develop corresponding evaluation measures. We evaluated several published high-dimensional probabilistic classifiers and developed 2 extensions of the Bayesian compound covariate classifier. Based on simulation studies and analysis of gene expression microarray data, we found that proper probabilistic classification is more difficult than deterministic classification. It is important to ensure that a probabilistic classifier is well calibrated or at least not "anticonservative" using the methods developed here. We provide this evaluation for several probabilistic classifiers and also evaluate their refinement as a function of sample size under weak and strong signal conditions. We also present a cross-validation method for evaluating the calibration and refinement of any probabilistic classifier on any data set.

SUBMITTER: Kim KI 

PROVIDER: S-EPMC3138069 | biostudies-literature | 2011 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Probabilistic classifiers with high-dimensional data.

Kim Kyung In KI   Simon Richard R  

Biostatistics (Oxford, England) 20101117 3


For medical classification problems, it is often desirable to have a probability associated with each class. Probabilistic classifiers have received relatively little attention for small n large p classification problems despite of their importance in medical decision making. In this paper, we introduce 2 criteria for assessment of probabilistic classifiers: well-calibratedness and refinement and develop corresponding evaluation measures. We evaluated several published high-dimensional probabili  ...[more]

Similar Datasets

| S-EPMC3687811 | biostudies-other
| S-EPMC7923594 | biostudies-literature
| S-EPMC3090739 | biostudies-literature
| S-EPMC7313548 | biostudies-literature
| S-EPMC5593641 | biostudies-literature
| S-EPMC8403970 | biostudies-literature
| S-EPMC9636303 | biostudies-literature
| S-EPMC10117939 | biostudies-literature
| S-EPMC4834947 | biostudies-literature
| S-EPMC2682540 | biostudies-literature