Unknown

Dataset Information

0

Learning Facial Action Units from Web Images with Scalable Weakly Supervised Clustering.


ABSTRACT: We present a scalable weakly supervised clustering approach to learn facial action units (AUs) from large, freely available web images. Unlike most existing methods (e.g., CNNs) that rely on fully annotated data, our method exploits web images with inaccurate annotations. Specifically, we derive a weakly-supervised spectral algorithm that learns an embedding space to couple image appearance and semantics. The algorithm has efficient gradient update, and scales up to large quantities of images with a stochastic extension. With the learned embedding space, we adopt rank-order clustering to identify groups of visually and semantically similar images, and re-annotate these groups for training AU classifiers. Evaluation on the 1 millon EmotioNet dataset demonstrates the effectiveness of our approach: (1) our learned annotations reach on average 91.3% agreement with human annotations on 7 common AUs, (2) classifiers trained with re-annotated images perform comparably to, sometimes even better than, its supervised CNN-based counterpart, and (3) our method offers intuitive outlier/noise pruning instead of forcing one annotation to every image. Code is available.

SUBMITTER: Zhao K 

PROVIDER: S-EPMC6594709 | biostudies-literature | 2018 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Learning Facial Action Units from Web Images with Scalable Weakly Supervised Clustering.

Zhao Kaili K   Chu Wen-Sheng WS   Martinez Aleix M AM  

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition 20180601


We present a scalable weakly supervised clustering approach to learn facial action units (AUs) from large, freely available web images. Unlike most existing methods (e.g., CNNs) that rely on fully annotated data, our method exploits web images with inaccurate annotations. Specifically, we derive a weakly-supervised spectral algorithm that learns an embedding space to couple image appearance and semantics. The algorithm has efficient gradient update, and scales up to large quantities of images wi  ...[more]

Similar Datasets

| S-EPMC7418463 | biostudies-literature
| S-EPMC8711640 | biostudies-literature
2023-11-01 | GSE244807 | GEO
| S-EPMC9307817 | biostudies-literature
| S-EPMC7248915 | biostudies-literature
| S-EPMC8506936 | biostudies-literature
| S-EPMC8692405 | biostudies-literature
| S-EPMC8336446 | biostudies-literature
| S-EPMC7415644 | biostudies-literature