Unknown

Dataset Information

0

Visual Tracking Using Sparse Coding and Earth Mover's Distance.


ABSTRACT: An efficient iterative Earth Mover's Distance (iEMD) algorithm for visual tracking is proposed in this paper. The Earth Mover's Distance (EMD) is used as the similarity measure to search for the optimal template candidates in feature-spatial space in a video sequence. The local sparse representation is used as the appearance model for the iEMD tracker. The maximum-alignment-pooling method is used for constructing a sparse coding histogram which reduces the computational complexity of the EMD optimization. The template update algorithm based on the EMD is also presented. When the camera is mounted on a moving robot, e.g., a flying quadcopter, the camera could experience a sudden and rapid motion leading to large inter-frame movements. To ensure that the tracking algorithm converges, a gyro-aided extension of the iEMD tracker is presented, where synchronized gyroscope information is utilized to compensate for the rotation of the camera. The iEMD algorithm's performance is evaluated using eight publicly available videos from the CVPR 2013 dataset. The performance of the iEMD algorithm is compared with eight state-of-the-art tracking algorithms based on relative percentage overlap. Experimental results show that the iEMD algorithm performs robustly in the presence of illumination variation and deformation. The robustness of this algorithm for large inter-frame displacements is also illustrated.

SUBMITTER: Yao G 

PROVIDER: S-EPMC7805675 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

Visual Tracking Using Sparse Coding and Earth Mover's Distance.

Yao Gang G   Dani Ashwin A  

Frontiers in robotics and AI 20180822


An efficient iterative Earth Mover's Distance (iEMD) algorithm for visual tracking is proposed in this paper. The Earth Mover's Distance (EMD) is used as the similarity measure to search for the optimal template candidates in feature-spatial space in a video sequence. The local sparse representation is used as the appearance model for the iEMD tracker. The maximum-alignment-pooling method is used for constructing a sparse coding histogram which reduces the computational complexity of the EMD opt  ...[more]

Similar Datasets

| S-EPMC2286557 | biostudies-literature
| S-EPMC3649976 | biostudies-literature
| S-EPMC3757072 | biostudies-literature
| S-EPMC7864399 | biostudies-literature
| S-EPMC10292743 | biostudies-literature
| S-EPMC9504661 | biostudies-literature
| S-EPMC4413851 | biostudies-other
| S-EPMC7683856 | biostudies-literature
| S-EPMC5345030 | biostudies-literature
| S-EPMC7099633 | biostudies-literature