Unknown

Dataset Information

0

Tournament leave-pair-out cross-validation for receiver operating characteristic analysis.


ABSTRACT: Receiver operating characteristic analysis is widely used for evaluating diagnostic systems. Recent studies have shown that estimating an area under receiver operating characteristic curve with standard cross-validation methods suffers from a large bias. The leave-pair-out cross-validation has been shown to correct this bias. However, while leave-pair-out produces an almost unbiased estimate of area under receiver operating characteristic curve, it does not provide a ranking of the data needed for plotting and analyzing the receiver operating characteristic curve. In this study, we propose a new method called tournament leave-pair-out cross-validation. This method extends leave-pair-out by creating a tournament from pair comparisons to produce a ranking for the data. Tournament leave-pair-out preserves the advantage of leave-pair-out for estimating area under receiver operating characteristic curve, while it also allows performing receiver operating characteristic analyses. We have shown using both synthetic and real-world data that tournament leave-pair-out is as reliable as leave-pair-out for area under receiver operating characteristic curve estimation and confirmed the bias in leave-one-out cross-validation on low-dimensional data. As a case study on receiver operating characteristic analysis, we also evaluate how reliably sensitivity and specificity can be estimated from tournament leave-pair-out receiver operating characteristic curves.

SUBMITTER: Montoya Perez I 

PROVIDER: S-EPMC6745617 | biostudies-literature | 2019 Oct-Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Tournament leave-pair-out cross-validation for receiver operating characteristic analysis.

Montoya Perez Ileana I   Airola Antti A   Boström Peter J PJ   Jambor Ivan I   Pahikkala Tapio T  

Statistical methods in medical research 20180820 10-11


Receiver operating characteristic analysis is widely used for evaluating diagnostic systems. Recent studies have shown that estimating an area under receiver operating characteristic curve with standard cross-validation methods suffers from a large bias. The leave-pair-out cross-validation has been shown to correct this bias. However, while leave-pair-out produces an almost unbiased estimate of area under receiver operating characteristic curve, it does not provide a ranking of the data needed f  ...[more]

Similar Datasets

| S-EPMC2795956 | biostudies-literature
| S-EPMC3743052 | biostudies-literature
| S-EPMC8671363 | biostudies-literature
| S-EPMC5577377 | biostudies-literature
| S-EPMC6263661 | biostudies-literature
| S-EPMC3747897 | biostudies-literature
| S-EPMC6768691 | biostudies-literature
| S-EPMC2936710 | biostudies-literature
| S-EPMC6880940 | biostudies-literature
| S-EPMC3684818 | biostudies-literature