Sparse coding-based space-time video representation for action recognition

Yinghua Fu; Tao Zhang; Wenjin Wang

doi:10.1007/s11042-016-3630-9

Sparse coding-based space-time video representation for action recognition

Yinghua Fu, Tao Zhang, Wenjin Wang

Source

Multimedia Tools and Applications > 2017 > 76 > 10 > 12645-12658

Abstract

Methods based on feature descriptors around local interest points are now widely used in action recognition. Feature points are detected using a number of measures, namely saliency, periodicity, motion activity etc. Each of these measures is usually intensity-based and provides a trade-off between density and informativeness. In this paper, we address the problem of action recognition by representing image sequences as a sparse collection of patch-level space-time events that are salient in both space and time domain. Our method uses a multi-scale volumetric representation of video and adaptively selects an optimal space-time scale under which the saliency of a patch is most significant. The input image sequences are first partitioned into non-overlapping patches. Then, each patch is represented by a vector of coefficients that can linearly reconstruct the patch from a learned dictionary of basis patches. The space-time saliency of patches is measured by Shannon’s self-information entropy, where a patch’s saliency is determined by information variation in the contents of the patch’s spatiotemporal neighborhood. Experimental results on three benchmark datasets demonstrate the effectiveness of the proposed method.

Identifiers

journal ISSN :	1380-7501
journal e-ISSN :	1573-7721
DOI	10.1007/s11042-016-3630-9

Authors

Yinghua Fu

Shanghai Jiao Tong University, Department of Automation, Shanghai, China
University of Shanghai for Science and Technology, School of Optical-Electrical and Computer Engineering, Shanghai, China

Tao Zhang

Shanghai Jiao Tong University, Department of Automation, Shanghai, China

Wenjin Wang

University of Shanghai for Science and Technology, School of Optical-Electrical and Computer Engineering, Shanghai, China

Keywords

Sparse coding Space-time saliency Action recognition Self-information Shannon entropy

Additional information

Publication languages: English

Data set: Springer

Publisher

Springer US

Fields of science

No field of science has been suggested yet.

article

Read online
Download
Add to read later
Add to collection
Add to followed
Share

Export to bibliography


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Sparse coding-based space-time video representation for action recognition $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Yinghua Fu

Tao Zhang

Wenjin Wang

Keywords

Additional information

Publisher

Fields of science

Fields of science

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Sparse coding-based space-time video representation for action recognition