Unknown

Dataset Information

0

A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario.


ABSTRACT: Two classical multivariate statistical problems, testing of multivariate normality and the k-sample problem, are explored by a novel analysis on several resolutions simultaneously. The presented methods do not invert any estimated covariance matrix. Thereby, the methods work in the High Dimension Low Sample Size situation, i.e. when n ? p. The output, a significance map, is produced by doing a one-dimensional test for all possible resolution/position pairs. The significance map shows for which resolution/position pairs the null hypothesis is rejected. For the testing of multinormality, the Anderson-Darling test is utilized to detect potential departures from multinormality at different combinations of resolutions and positions. In the k-sample case, it is tested whether k data sets can be said to originate from the same unspecified discrete or continuous multivariate distribution. This is done by testing the k vectors corresponding to the same resolution/position pair of the k different data sets through the k-sample Anderson-Darling test. Successful demonstrations of the new methodology on artificial and real data sets are presented, and a feature selection scheme is demonstrated.

SUBMITTER: Hindberg K 

PROVIDER: S-EPMC6342313 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario.

Hindberg Kristian K   Hannig Jan J   Godtliebsen Fred F  

PloS one 20190122 1


Two classical multivariate statistical problems, testing of multivariate normality and the k-sample problem, are explored by a novel analysis on several resolutions simultaneously. The presented methods do not invert any estimated covariance matrix. Thereby, the methods work in the High Dimension Low Sample Size situation, i.e. when n ≤ p. The output, a significance map, is produced by doing a one-dimensional test for all possible resolution/position pairs. The significance map shows for which r  ...[more]

Similar Datasets

| S-EPMC5173295 | biostudies-literature
| S-EPMC3140372 | biostudies-literature
2021-03-26 | PXD022280 | Pride
| S-EPMC4442753 | biostudies-literature
| S-EPMC9295049 | biostudies-literature
| S-EPMC8214147 | biostudies-literature
| S-EPMC5378610 | biostudies-literature
| S-EPMC4749572 | biostudies-literature
| S-EPMC7514953 | biostudies-literature
| S-EPMC4468148 | biostudies-literature