Unknown

Dataset Information

0

Classification and visualization based on derived image features: application to genetic syndromes.


ABSTRACT: Data transformations prior to analysis may be beneficial in classification tasks. In this article we investigate a set of such transformations on 2D graph-data derived from facial images and their effect on classification accuracy in a high-dimensional setting. These transformations are low-variance in the sense that each involves only a fixed small number of input features. We show that classification accuracy can be improved when penalized regression techniques are employed, as compared to a principal component analysis (PCA) pre-processing step. In our data example classification accuracy improves from 47% to 62% when switching from PCA to penalized regression. A second goal is to visualize the resulting classifiers. We develop importance plots highlighting the influence of coordinates in the original 2D space. Features used for classification are mapped to coordinates in the original images and combined into an importance measure for each pixel. These plots assist in assessing plausibility of classifiers, interpretation of classifiers, and determination of the relative importance of different features.

SUBMITTER: Balliu B 

PROVIDER: S-EPMC4236018 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Classification and visualization based on derived image features: application to genetic syndromes.

Balliu Brunilda B   Würtz Rolf P RP   Horsthemke Bernhard B   Wieczorek Dagmar D   Böhringer Stefan S  

PloS one 20141118 11


Data transformations prior to analysis may be beneficial in classification tasks. In this article we investigate a set of such transformations on 2D graph-data derived from facial images and their effect on classification accuracy in a high-dimensional setting. These transformations are low-variance in the sense that each involves only a fixed small number of input features. We show that classification accuracy can be improved when penalized regression techniques are employed, as compared to a p  ...[more]

Similar Datasets

| S-EPMC4032673 | biostudies-other
| S-EPMC3623732 | biostudies-literature
| S-EPMC4823793 | biostudies-literature
| S-EPMC8014705 | biostudies-literature
| S-EPMC4795769 | biostudies-literature
| S-EPMC4204487 | biostudies-literature
| S-EPMC10020761 | biostudies-literature
| S-EPMC9240637 | biostudies-literature
| S-EPMC8921609 | biostudies-literature
| S-EPMC3107973 | biostudies-literature