Unknown

Dataset Information

0

Biased binomial assessment of cross-validated estimation of classification accuracies illustrated in diagnosis predictions.


ABSTRACT: Multivariate classification is used in neuroimaging studies to infer brain activation or in medical applications to infer diagnosis. Their results are often assessed through either a binomial or a permutation test. Here, we simulated classification results of generated random data to assess the influence of the cross-validation scheme on the significance of results. Distributions built from classification of random data with cross-validation did not follow the binomial distribution. The binomial test is therefore not adapted. On the contrary, the permutation test was unaffected by the cross-validation scheme. The influence of the cross-validation was further illustrated on real-data from a brain-computer interface experiment in patients with disorders of consciousness and from an fMRI study on patients with Parkinson disease. Three out of 16 patients with disorders of consciousness had significant accuracy on binomial testing, but only one showed significant accuracy using permutation testing. In the fMRI experiment, the mental imagery of gait could discriminate significantly between idiopathic Parkinson's disease patients and healthy subjects according to the permutation test but not according to the binomial test. Hence, binomial testing could lead to biased estimation of significance and false positive or negative results. In our view, permutation testing is thus recommended for clinical application of classification with cross-validation.

SUBMITTER: Noirhomme Q 

PROVIDER: S-EPMC4053638 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Biased binomial assessment of cross-validated estimation of classification accuracies illustrated in diagnosis predictions.

Noirhomme Quentin Q   Lesenfants Damien D   Gomez Francisco F   Soddu Andrea A   Schrouff Jessica J   Garraux Gaëtan G   Luxen André A   Phillips Christophe C   Laureys Steven S  

NeuroImage. Clinical 20140413


Multivariate classification is used in neuroimaging studies to infer brain activation or in medical applications to infer diagnosis. Their results are often assessed through either a binomial or a permutation test. Here, we simulated classification results of generated random data to assess the influence of the cross-validation scheme on the significance of results. Distributions built from classification of random data with cross-validation did not follow the binomial distribution. The binomial  ...[more]

Similar Datasets

| S-EPMC9314673 | biostudies-literature
| S-EPMC7850487 | biostudies-literature
| S-EPMC7185352 | biostudies-literature
| S-EPMC6136749 | biostudies-literature
| S-EPMC7401749 | biostudies-literature
| S-EPMC7758077 | biostudies-literature
| S-EPMC8058773 | biostudies-literature
| S-EPMC10218746 | biostudies-literature
| S-EPMC4889063 | biostudies-literature
| S-EPMC8727988 | biostudies-literature