Unknown

Dataset Information

0

A novel test for independence derived from an exact distribution of ith nearest neighbours.


ABSTRACT: Dependence measures and tests for independence have recently attracted a lot of attention, because they are the cornerstone of algorithms for network inference in probabilistic graphical models. Pearson's product moment correlation coefficient is still by far the most widely used statistic yet it is largely constrained to detecting linear relationships. In this work we provide an exact formula for the [Formula: see text]th nearest neighbor distance distribution of rank-transformed data. Based on that, we propose two novel tests for independence. An implementation of these tests, together with a general benchmark framework for independence testing, are freely available as a CRAN software package (http://cran.r-project.org/web/packages/knnIndep). In this paper we have benchmarked Pearson's correlation, Hoeffding's D, dcor, Kraskov's estimator for mutual information, maximal information criterion and our two tests. We conclude that no particular method is generally superior to all other methods. However, dcor and Hoeffding's D are the most powerful tests for many different types of dependence.

SUBMITTER: Dumcke S 

PROVIDER: S-EPMC4183502 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

A novel test for independence derived from an exact distribution of ith nearest neighbours.

Dümcke Sebastian S   Mansmann Ulrich U   Tresch Achim A  

PloS one 20141002 9


Dependence measures and tests for independence have recently attracted a lot of attention, because they are the cornerstone of algorithms for network inference in probabilistic graphical models. Pearson's product moment correlation coefficient is still by far the most widely used statistic yet it is largely constrained to detecting linear relationships. In this work we provide an exact formula for the [Formula: see text]th nearest neighbor distance distribution of rank-transformed data. Based on  ...[more]

Similar Datasets

| S-EPMC7610670 | biostudies-literature
| S-EPMC6028586 | biostudies-literature
| S-EPMC5630623 | biostudies-literature
| S-EPMC3582538 | biostudies-literature
| S-EPMC4904494 | biostudies-literature
| S-EPMC4506508 | biostudies-other
| S-EPMC6754223 | biostudies-literature
| S-EPMC9648765 | biostudies-literature
| S-EPMC3240574 | biostudies-literature
| S-EPMC5793489 | biostudies-literature