Unknown

Dataset Information

0

Projections as visual aids for classification system design.


ABSTRACT: Dimensionality reduction is a compelling alternative for high-dimensional data visualization. This method provides insight into high-dimensional feature spaces by mapping relationships between observations (high-dimensional vectors) to low (two or three) dimensional spaces. These low-dimensional representations support tasks such as outlier and group detection based on direct visualization. Supervised learning, a subfield of machine learning, is also concerned with observations. A key task in supervised learning consists of assigning class labels to observations based on generalization from previous experience. Effective development of such classification systems depends on many choices, including features descriptors, learning algorithms, and hyperparameters. These choices are not trivial, and there is no simple recipe to improve classification systems that perform poorly. In this context, we first propose the use of visual representations based on dimensionality reduction (projections) for predictive feedback on classification efficacy. Second, we propose a projection-based visual analytics methodology, and supportive tooling, that can be used to improve classification systems through feature selection. We evaluate our proposal through experiments involving four datasets and three representative learning algorithms.

SUBMITTER: Rauber PE 

PROVIDER: S-EPMC6131729 | biostudies-other | 2018 Oct

REPOSITORIES: biostudies-other

altmetric image

Publications

Projections as visual aids for classification system design.

Rauber Paulo E PE   Falcão Alexandre X AX   Telea Alexandru C AC  

Information visualization 20170627 4


Dimensionality reduction is a compelling alternative for high-dimensional data visualization. This method provides insight into high-dimensional feature spaces by mapping relationships between observations (high-dimensional vectors) to low (two or three) dimensional spaces. These low-dimensional representations support tasks such as outlier and group detection based on direct visualization. Supervised learning, a subfield of machine learning, is also concerned with observations. A key task in su  ...[more]

Similar Datasets

| S-EPMC5560353 | biostudies-other
2010-03-01 | GSE17189 | GEO
2010-03-07 | E-GEOD-17189 | biostudies-arrayexpress
| S-EPMC5921864 | biostudies-literature
2012-07-18 | GSE17372 | GEO
2012-07-17 | E-GEOD-17372 | biostudies-arrayexpress
| S-EPMC8476535 | biostudies-literature
| S-EPMC8169797 | biostudies-literature
| S-EPMC6585423 | biostudies-literature
| S-EPMC7606250 | biostudies-literature