Dataset Information

Comparing supervised and unsupervised approaches to multimodal emotion recognition.

ABSTRACT: We investigated emotion classification from brief video recordings from the GEMEP database wherein actors portrayed 18 emotions. Vocal features consisted of acoustic parameters related to frequency, intensity, spectral distribution, and durations. Facial features consisted of facial action units. We first performed a series of person-independent supervised classification experiments. Best performance (AUC = 0.88) was obtained by merging the output from the best unimodal vocal (Elastic Net, AUC = 0.82) and facial (Random Forest, AUC = 0.80) classifiers using a late fusion approach and the product rule method. All 18 emotions were recognized with above-chance recall, although recognition rates varied widely across emotions (e.g., high for amusement, anger, and disgust; and low for shame). Multimodal feature patterns for each emotion are described in terms of the vocal and facial features that contributed most to classifier performance. Next, a series of exploratory unsupervised classification experiments were performed to gain more insight into how emotion expressions are organized. Solutions from traditional clustering techniques were interpreted using decision trees in order to explore which features underlie clustering. Another approach utilized various dimensionality reduction techniques paired with inspection of data visualizations. Unsupervised methods did not cluster stimuli in terms of emotion categories, but several explanatory patterns were observed. Some could be interpreted in terms of valence and arousal, but actor and gender specific aspects also contributed to clustering. Identifying explanatory patterns holds great potential as a meta-heuristic when unsupervised methods are used in complex classification tasks.

SUBMITTER: Fernandez Carbonell M

PROVIDER: S-EPMC8725659 | biostudies-literature | 2021

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Comparing supervised and unsupervised approaches to multimodal emotion recognition.

Fernández Carbonell Marcos M Boman Magnus M Laukka Petri P

PeerJ. Computer science 20211224

We investigated emotion classification from brief video recordings from the GEMEP database wherein actors portrayed 18 emotions. Vocal features consisted of acoustic parameters related to frequency, intensity, spectral distribution, and durations. Facial features consisted of facial action units. We first performed a series of person-independent supervised classification experiments. Best performance (AUC = 0.88) was obtained by merging the output from the best unimodal vocal (Elastic Net, AUC = ...[more]

PMID: 35036530

Dataset Information

Comparing supervised and unsupervised approaches to multimodal emotion recognition.

Publications

Comparing supervised and unsupervised approaches to multimodal emotion recognition.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Comparing supervised and unsupervised approaches to emotion categorization in the human brain, body, and subjective experience.
| S-EPMC7679385 | biostudies-literature

DOES - A multimodal dataset for supervised and unsupervised analysis of steel scrap.
| S-EPMC10632433 | biostudies-literature

Automatic emotion recognition in healthcare data using supervised machine learning.
| S-EPMC8725656 | biostudies-literature

Training Emotion Recognition Accuracy: Results for Multimodal Expressions and Facial Micro Expressions.
| S-EPMC8406528 | biostudies-literature

Effects of aging on emotion recognition from dynamic multimodal expressions and vocalizations.
| S-EPMC7846600 | biostudies-literature

Comparison of Supervised and Unsupervised Approaches for the Generation of Synthetic CT from Cone-Beam CT.
| S-EPMC8395013 | biostudies-literature

Supervised, semi-supervised and unsupervised inference of gene regulatory networks.
| S-EPMC3956069 | biostudies-literature

K-EmoCon, a multimodal sensor dataset for continuous emotion recognition in naturalistic conversations.
| S-EPMC7479607 | biostudies-literature

Multimodal and Spectral Degradation Effects on Speech and Emotion Recognition in Adult Listeners.
| S-EPMC6236866 | biostudies-literature

Masking important information to assess the robustness of a multimodal classifier for emotion recognition
| S-EPMC10075078 | biostudies-literature