International Journal of Computer Vision

, Volume 100, Issue 1, pp 1–15

Sparse Modeling of Human Actions from Motion Imagery


DOI: 10.1007/s11263-012-0534-7

Cite this article as:
Castrodad, A. & Sapiro, G. Int J Comput Vis (2012) 100: 1. doi:10.1007/s11263-012-0534-7


An efficient sparse modeling pipeline for the classification of human actions from video is here developed. Spatio-temporal features that characterize local changes in the image are first extracted. This is followed by the learning of a class-structured dictionary encoding the individual actions of interest. Classification is then based on reconstruction, where the label assigned to each video comes from the optimal sparse linear combination of the learned basis vectors (action primitives) representing the actions. A low computational cost deep-layer model learning the inter-class correlations of the data is added for increasing discriminative power. In spite of its simplicity and low computational cost, the method outperforms previously reported results for virtually all standard datasets.


Action classificationSparse modelingDictionary learningSupervised learning

Copyright information

© Springer Science+Business Media, LLC (outside the USA) 2012

Authors and Affiliations

  1. 1.Department of Electrical and Computer EngineeringUniversity of MinnesotaMinneapolisUSA