International Journal of Computer Vision

, Volume 100, Issue 1, pp 1-15

Sparse Modeling of Human Actions from Motion Imagery

  • Alexey CastrodadAffiliated withDepartment of Electrical and Computer Engineering, University of Minnesota Email author 
  • , Guillermo SapiroAffiliated withDepartment of Electrical and Computer Engineering, University of Minnesota

Rent the article at a discount

Rent now

* Final gross prices may vary according to local VAT.

Get Access


An efficient sparse modeling pipeline for the classification of human actions from video is here developed. Spatio-temporal features that characterize local changes in the image are first extracted. This is followed by the learning of a class-structured dictionary encoding the individual actions of interest. Classification is then based on reconstruction, where the label assigned to each video comes from the optimal sparse linear combination of the learned basis vectors (action primitives) representing the actions. A low computational cost deep-layer model learning the inter-class correlations of the data is added for increasing discriminative power. In spite of its simplicity and low computational cost, the method outperforms previously reported results for virtually all standard datasets.


Action classification Sparse modeling Dictionary learning Supervised learning