Abstract
Activity understanding plays an essential role in video content analysis and remains a challenging open problem. Most of previous research is limited due to the use of excessively localized features without sufficiently encapsulating the interaction context or focus on simply discriminative models but totally ignoring the interaction patterns. In this paper, a new approach is proposed to recognize human group activities. Firstly, we design a new quaternion descriptor to describe the interactive insight of activities regarding the appearance, dynamic, causality and feedback, respectively. The designed descriptor is capable of delineating the individual and pairwise interactions in the activities. Secondly, considering both activity category and interaction variety, we propose an extended pLSA (probabilistic Latent Semantic Analysis) model with two hidden variables. This extended probabilistic graphic paradigm constructed on the quaternion descriptors facilitates the effective inference of activity categories as well as the exploration of activity interaction patterns. The experiments on the realistic movie and human activity databases validate that the proposed approach outperforms the state-of-the-art results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Turaga, P., Chellappa, R., Subrahmanian, V.S., Udrea, O.: Machine recognition of human activitites: a survey. T-CSVT (2008)
Bobick, A., Davis, J.: The recognition of human movement using temporal templates. T-PAMI (2001)
Shi, J., Malik, J.: Normalized cuts and image segmentation. T-PAMI (2000)
Wang, Y., Mori, G.: Human action recognition by semilatent topic models. T-PAMI (2009)
Niebles, J.C., Wang, H., Li, F.F.: Unsupervised learning of human action categories using spatial-temoral words. IJCV (2008)
Lowe, D.: Distincitive image features from scale-invariant keypoints. IJCV (2004)
Isard, M., Blake, A.: CONDENSATION - Conditional density propagation for visual tracking. IJCV (1998)
Granger, C.W.J.: Investigating causal relations by econometric models and cross-spectral methods. Econometrica (1969)
Liu, J., Luo, J., Shah, M.: Recognizing realistic actions from videos in the wild. In: CVPR (2009)
Laptev, I., Lindeberg, T.: Space-time interest points. In: ICCV (2003)
Torralba, A., Murphy, K., Freeman, W., Rubin, M.: Context-based vision system for place and object recognition. In: ICCV (2003)
Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: a local svm approach. In: ICPR (2004)
Liu, Z., Sarkar, S.: Simplest representation yet for gait recognition: averaged silhouette. In: ICPR (2004)
Andrade, E., Blunsden, S., Fisher, R.: Modelling crowd scenes for event detection. In: ICPR (2006)
Ryoo, M.S., Aggarwal, J.K.: Hierarchical recognition of human activities interacting with objects. In: CVPR (2007)
Turaga, P., Veraraghavan, A., Chellappa, R.: From videos to verbs: mining videos for activites using a cascade of dynamical system. In: CVPR (2007)
Zhou, Y., Yan, S., Huang, T.: Pair-activity classification by bi-trajectory analysis. In: CVPR (2008)
Ni, B., Yan, S., Kassim, A.: Recognizing human group activities with localized causalities. In: CVPR (2009)
Marszalek, M., Laptev, I., Schmid, C.: Actions in Context. In: CVPR 2009 (2009)
Sun, J., Wu, X., Yan, S., Cheong, L.F., Chua, T.S., Li, J.: Hierarchical spatio-temporal context modeling for action recognition. In: CVPR (2009)
Mortensen, E., Deng, H., Shapiro, L.: A SIFT descriptor with global context. In: CVPR (2005)
Hofmann, T.: Probabilistic latent semanic indexing. In: ACM SIGIR (1999)
Breiman, L.: Probability. Society for Industrial Mathematics (1992)
Jury, E.I.: Sampled-data control systems. John Wiley & Sons, Chichester (1958)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhu, G., Yan, S., Han, T.X., Xu, C. (2011). Generative Group Activity Analysis with Quaternion Descriptor. In: Lee, KT., Tsai, WH., Liao, HY.M., Chen, T., Hsieh, JW., Tseng, CC. (eds) Advances in Multimedia Modeling. MMM 2011. Lecture Notes in Computer Science, vol 6524. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17829-0_1
Download citation
DOI: https://doi.org/10.1007/978-3-642-17829-0_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17828-3
Online ISBN: 978-3-642-17829-0
eBook Packages: Computer ScienceComputer Science (R0)