Unknown

Dataset Information

0

Semantic Pooling for Complex Event Analysis in Untrimmed Videos.


ABSTRACT: Pooling plays an important role in generating a discriminative video representation. In this paper, we propose a new semantic pooling approach for challenging event analysis tasks (e.g., event detection, recognition, and recounting) in long untrimmed Internet videos, especially when only a few shots/segments are relevant to the event of interest while many other shots are irrelevant or even misleading. The commonly adopted pooling strategies aggregate the shots indifferently in one way or another, resulting in a great loss of information. Instead, in this work we first define a novel notion of semantic saliency that assesses the relevance of each shot with the event of interest. We then prioritize the shots according to their saliency scores since shots that are semantically more salient are expected to contribute more to the final event analysis. Next, we propose a new isotonic regularizer that is able to exploit the constructed semantic ordering information. The resulting nearly-isotonic support vector machine classifier exhibits higher discriminative power in event analysis tasks. Computationally, we develop an efficient implementation using the proximal gradient algorithm, and we prove new and closed-form proximal steps. We conduct extensive experiments on three real-world video datasets and achieve promising improvements.

SUBMITTER: Xiaojun Chang 

PROVIDER: S-EPMC5570670 | biostudies-literature | 2017 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Semantic Pooling for Complex Event Analysis in Untrimmed Videos.

Xiaojun Chang   Yao-Liang Yu   Yi Yang   Xing Eric P EP  

IEEE transactions on pattern analysis and machine intelligence 20160913 8


Pooling plays an important role in generating a discriminative video representation. In this paper, we propose a new semantic pooling approach for challenging event analysis tasks (e.g., event detection, recognition, and recounting) in long untrimmed Internet videos, especially when only a few shots/segments are relevant to the event of interest while many other shots are irrelevant or even misleading. The commonly adopted pooling strategies aggregate the shots indifferently in one way or anothe  ...[more]

Similar Datasets

| S-EPMC7042074 | biostudies-literature
| S-EPMC2748775 | biostudies-other
| S-EPMC8588067 | biostudies-literature
| S-EPMC4762689 | biostudies-literature
| S-EPMC3698992 | biostudies-literature
| S-EPMC2665161 | biostudies-other
| S-EPMC7865181 | biostudies-literature
| S-EPMC4546414 | biostudies-literature
| S-EPMC5670351 | biostudies-literature
| S-EPMC3277650 | biostudies-literature