Similarity Constrained Latent Support Vector Machine: An Application to Weakly Supervised Action Classification

Shapovalova, Nataliya; Vahdat, Arash; Cannons, Kevin; Lan, Tian; Mori, Greg

doi:10.1007/978-3-642-33786-4_5

Nataliya Shapovalova²¹,
Arash Vahdat²¹,
Kevin Cannons²¹,
Tian Lan²¹ &
…
Greg Mori²¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7578))

Included in the following conference series:

European Conference on Computer Vision

6842 Accesses
24 Citations

Abstract

We present a novel algorithm for weakly supervised action classification in videos. We assume we are given training videos annotated only with action class labels. We learn a model that can classify unseen test videos, as well as localize a region of interest in the video that captures the discriminative essence of the action class. A novel Similarity Constrained Latent Support Vector Machine model is developed to operationalize this goal. This model specifies that videos should be classified correctly, and that the latent regions of interest chosen should be coherent over videos of an action class. The resulting learning problem is challenging, and we show how dual decomposition can be employed to render it tractable. Experimental results demonstrate the efficacy of the method.

Download to read the full chapter text

Chapter PDF

Multi-label Discriminative Weakly-Supervised Human Activity Recognition and Localization

Discriminative Joint Non-negative Matrix Factorization for Human Action Classification

Max-margin adaptive model for complex video pattern recognition

Article 10 May 2014

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part based models. IEEE Trans. PAMI 32, 1627–1645 (2010)
Article Google Scholar
Lan, T., Wang, Y., Mori, G.: Discriminative figure-centric models for joint action localization and recognition. In: ICCV (2011)
Google Scholar
Yao, B., Fei-Fei, L.: Modeling mutual context of object and human pose in human-object interaction activities. In: CVPR (2010)
Google Scholar
Bilen, H., Namboodiri, V., Gool, L.V.: Object and action classification with latent variables. In: Proceedings of the British Machine Vision Conference (2011)
Google Scholar
Aggarwal, J., Ryoo, M.: Human activity analysis: A review. ACM Comput. Surv. 43, 16:1–16:43 (2011)
Google Scholar
Alexe, B., Deselaers, T., Ferrari, V.: What is an object? In: CVPR (2010)
Google Scholar
Vezhnevets, A., Ferrari, V., Buhmann, J.M.: Weakly supervised semantic segmentation with a multi-image model. In: ICCV (2011)
Google Scholar
Joachims, T.: Transductive inference for text classification using support vector machines. In: International Conference on Machine Learning, ICML (1999)
Google Scholar
Sontag, D., Globerson, A., Jaakkola, T.: Introduction to dual decomposition for inference. In: Sra, S., Nowozin, S., Wright, S.J. (eds.) Optimization for Machine Learning. MIT Press (2011)
Google Scholar
Komodakis, N.: Efficient training for pairwise or higher order crfs via dual decomposition. In: CVPR, pp. 1841–1848 (2011)
Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Weakly supervised scale-invariant learning of models for visual recognition. International Journal of Computer Vision 71, 273–303 (2007)
Google Scholar
Viola, P.A., Platt, J.C., Zhang, C.: Multiple instance boosting for object detection. In: NIPS (2005)
Google Scholar
Yang, W., Wang, Y., Mori, G.: Recognizing human actions from still images with latent poses. In: CVPR (2010)
Google Scholar
Laptev, I., Marszalek, M., Schmid, C., Rozenfeld, B.: Learning realistic human actions from movies. In: CVPR (2008)
Google Scholar
Sorokin, A., Forsyth, D.: Utility data annotation with amazon mechanical turk. In: First IEEE Workshop on Internet Vision (at CVPR). (2008)
Google Scholar
Wang, Y., Mori, G.: Hidden part models for human action recognition: Probabilistic vs. max-margin. IEEE Trans. PAMI 33, 1310–1323 (2011)
Google Scholar
Yu, C.N.J., Joachims, T.: Learning structural svms with latent variables. In: ICML (2009)
Google Scholar
Xu, L., Neufeld, J., Larson, B., Schuurmans, D.: Maximum margin clustering. In: NIPS (2004)
Google Scholar
Zien, A., Brefeld, U., Scheffer, T.: Transductive support vector machines for structured variables. In: International Conference on Machine Learning, ICML (2007)
Google Scholar
Klaser, A., Marszalek, M., Schmid, C.: A spatio-temporal descriptor based on 3d-gradients. In: BMVC (2008)
Google Scholar
Do, T.M.T., Artieres, T.: Large margin training for hidden markov models with partially observed states. In: ICML (2009)
Google Scholar
Rodriguez, M.D., Ahmed, J., Shah, M.: Action MACH a spatio-temporal maximum average correlation height filter for action recognition. In: CVPR (2008)
Google Scholar
Wang, H., Ullah, M.M., Klaser, A., Laptev, I., Schmid, C.: Evaluation of local spatio-temporal features for action recognition. In: BMVC (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing Science, Simon Fraser University, Canada
Nataliya Shapovalova, Arash Vahdat, Kevin Cannons, Tian Lan & Greg Mori

Authors

Nataliya Shapovalova
View author publications
You can also search for this author in PubMed Google Scholar
Arash Vahdat
View author publications
You can also search for this author in PubMed Google Scholar
Kevin Cannons
View author publications
You can also search for this author in PubMed Google Scholar
Tian Lan
View author publications
You can also search for this author in PubMed Google Scholar
Greg Mori
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research Ltd., CB3 0FB, Cambridge, UK
Andrew Fitzgibbon
Dept. of Computer Science, University of North Carolina, 27599, Chapel Hill, NC, USA
Svetlana Lazebnik
California Institute of Technology, 91125, Pasadena, CA, USA
Pietro Perona
Institute of Industrial Science, The University of Tokyo, 153-8505, Tokyo, Japan
Yoichi Sato
INRIA, 38330, Montbonnot, France
Cordelia Schmid

Additional information

This work was supported by a Google Research Award, MITACS-Elevate and the Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior National Business Center contract number D11PC20069. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright annotation thereon. Disclaimer: The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of IARPA, DOI/NBC, or the U.S. Government.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shapovalova, N., Vahdat, A., Cannons, K., Lan, T., Mori, G. (2012). Similarity Constrained Latent Support Vector Machine: An Application to Weakly Supervised Action Classification. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds) Computer Vision – ECCV 2012. ECCV 2012. Lecture Notes in Computer Science, vol 7578. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33786-4_5

Download citation

DOI: https://doi.org/10.1007/978-3-642-33786-4_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33785-7
Online ISBN: 978-3-642-33786-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Similarity Constrained Latent Support Vector Machine: An Application to Weakly Supervised Action Classification

Abstract

Chapter PDF

Similar content being viewed by others

Multi-label Discriminative Weakly-Supervised Human Activity Recognition and Localization

Discriminative Joint Non-negative Matrix Factorization for Human Action Classification

Max-margin adaptive model for complex video pattern recognition

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Additional information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Similarity Constrained Latent Support Vector Machine: An Application to Weakly Supervised Action Classification

Abstract

Chapter PDF

Similar content being viewed by others

Multi-label Discriminative Weakly-Supervised Human Activity Recognition and Localization

Discriminative Joint Non-negative Matrix Factorization for Human Action Classification

Max-margin adaptive model for complex video pattern recognition

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Additional information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation