Unknown

Dataset Information

0

Exploring High-D Spaces with Multiform Matrices and Small Multiples.


ABSTRACT: We introduce an approach to visual analysis of multivariate data that integrates several methods from information visualization, exploratory data analysis (EDA), and geovisualization. The approach leverages the component-based architecture implemented in GeoVISTA Studio to construct a flexible, multiview, tightly (but generically) coordinated, EDA toolkit. This toolkit builds upon traditional ideas behind both small multiples and scatterplot matrices in three fundamental ways. First, we develop a general, MultiForm, Bivariate Matrix and a complementary MultiForm, Bivariate Small Multiple plot in which different bivariate representation forms can be used in combination. We demonstrate the flexibility of this approach with matrices and small multiples that depict multivariate data through combinations of: scatterplots, bivariate maps, and space-filling displays. Second, we apply a measure of conditional entropy to (a) identify variables from a high-dimensional data set that are likely to display interesting relationships and (b) generate a default order of these variables in the matrix or small multiple display. Third, we add conditioning, a kind of dynamic query/filtering in which supplementary (undisplayed) variables are used to constrain the view onto variables that are displayed. Conditioning allows the effects of one or more well understood variables to be removed from the analysis, making relationships among remaining variables easier to explore. We illustrate the individual and combined functionality enabled by this approach through application to analysis of cancer diagnosis and mortality data and their associated covariates and risk factors.

SUBMITTER: Maceachren A 

PROVIDER: S-EPMC3176663 | biostudies-literature | 2003

REPOSITORIES: biostudies-literature

altmetric image

Publications

Exploring High-D Spaces with Multiform Matrices and Small Multiples.

Maceachren Alan A   Dai Xiping X   Hardisty Frank F   Guo Diansheng D   Guo Diansheng D   Lengerich Gene G  

IEEE Conference on Information Visualization : an International Conference on Computer Visualization & Graphics, proceedings ... IEEE Conference on Information Visualization 20030101


We introduce an approach to visual analysis of multivariate data that integrates several methods from information visualization, exploratory data analysis (EDA), and geovisualization. The approach leverages the component-based architecture implemented in GeoVISTA Studio to construct a flexible, multiview, tightly (but generically) coordinated, EDA toolkit. This toolkit builds upon traditional ideas behind both small multiples and scatterplot matrices in three fundamental ways. First, we develop  ...[more]

Similar Datasets

| S-EPMC6038708 | biostudies-literature
| S-EPMC4909101 | biostudies-literature
| S-EPMC10029706 | biostudies-literature
| S-EPMC9199579 | biostudies-literature
| S-EPMC10055021 | biostudies-literature
| S-EPMC4463944 | biostudies-literature
| S-EPMC3243793 | biostudies-literature
| S-EPMC6946529 | biostudies-literature
| S-EPMC3712168 | biostudies-literature
| S-EPMC7805880 | biostudies-literature