Abstract
Human action recognition is an important area of research in computer vision. Its applications include surveillance systems, patient monitoring, human-computer interaction, just to name a few. Numerous techniques have been developed to solve this problem in 2D and 3D spaces. However 3D imaging gained a lot of interest nowadays. In this paper we propose a novel view-independent action recognition algorithm based on fusion between a global feature and a graph based feature. We used the motion history of skeleton volumes; we compute a skeleton for each volume and a motion history for each action. Then, alignment is performed using cylindrical coordinates-based Fourier transform to form a feature vector. A dimension reduction step is subsequently applied using PCA and action classification is carried out by using Mahalonobis distance, and Linear Discernment analysis. The second feature is the temporal changes in bounding volume, volumes are aligned using PCA and each divided into sub volumes then temporal change in volume is calculated and classified using Logistic Model Trees. The fusion is done using majority vote. The proposed technique is evaluated on the benchmark IXMAS and i3DPost datasets where results of the fusion are compared against using each feature individually. Obtained results demonstrate that fusion improve the recognition accuracy over individual features and can be used to recognize human actions independent of view point and scale.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aggarwal, J.K., Ryoo, M.S.: Human activity analysis: A review. ACM Comput. Surv. 43(3), 1–43 (2011)
Polana, R., Nelson, R.: Recognizing activities. In: Proceedings of the 12th IAPR International Conference on Pattern Recognition, Conference A: Computer Vision & Image Processing, vol. 1 (1994)
Bobick, A.F., Davis, J.W.: The recognition of human movement using temporal templates. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(3), 257–267 (2001)
Davis, J.W.: Hierarchical motion history images for recognizing human motion. In: Proceedings of the IEEE Workshop on Detection and Recognition of Events in Video 2001 (2001)
Weinland, D., Ronfard, R., Boyer, E.: Free viewpoint action recognition using motion history volumes. Comput. Vis. Image Underst. 104(2), 249–257 (2006)
Weinland, D., Ronfard, R., Boyer, E.: Motion history volumes for free viewpoint action recognition. In: IEEE International Workshop on Modeling People and Human Interaction, vol. 104(2) (2005)
Karali, A., ElHelw, M.: Motion History of Skeletal Volumes for Human Action Recognition. In: Bebis, G., Boyle, R., Parvin, B., Koracin, D., Fowlkes, C., Wang, S., Choi, M.-H., Mantler, S., Schulze, J., Acevedo, D., Mueller, K., Papka, M. (eds.) ISVC 2012, Part II. LNCS, vol. 7432, pp. 135–144. Springer, Heidelberg (2012)
Laurentini, A.: The Visual Hull Concept for Silhouette-Based Image Understanding. IEEE Trans. Pattern Anal. Mach. Intell. 16(2), 150–162 (1994)
Bouix, S., Siddiqi, K.: Divergence-Based Medial Surfaces. In: Vernon, D. (ed.) ECCV 2000 Part I. LNCS, vol. 1842, pp. 603–618. Springer, Heidelberg (2000)
Swets, D.L., Weng, J.J.: Using discriminant eigenfeatures for image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence 18(8), 831–836 (1996)
Landwehr, N., Hall, M., Frank, E.: Logistic Model Trees. Mach. Learn. 59(1-2), 161–205 (2005)
Quinlan, J.R.: C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc. (1993)
Cleary, J.G., Trigg, L.E.: K*: An Instance-based Learner Using an Entropic Distance Measure. In: Proceedings of the 12th International Conference on Machine Learning, pp. 108-114 Key (1995)
4D Repository, INRIA Xmas Motion Acquisition Sequences (IXMAS). INRIA Xmas Motion Acquisition Sequences (IXMAS) (cited 2012), http://4drepository.inrialpes.fr/public/viewgroup/6
Gkalelis, N., et al.: The i3DPost Multi-View and 3D Human Action/Interaction Database. In: Proceedings of the 2009 Conference for Visual Media Production, pp. 159–168. IEEE Computer Society (2009)
Turaga, P., Veeraraghavan, A., Chellappa, R.: Statistical analysis on Stiefel and Grassmann manifolds with applications in computer vision. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008 (2008)
Weinland, D., Boyer, E., Ronfard, R.: Action Recognition from Arbitrary Views using 3D Exemplars. In: IEEE 11th International Conference on Computer Vision, ICCV 2007 (2007)
Fengjun, L., Nevatia, R.: Single View Human Action Recognition using Key Pose Matching and Viterbi Path Searching. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2007 (2007)
Pingkun, Y., Khan, S.M., Shah, M.: Learning 4D action feature models for arbitrary view action recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR (2008)
Liu, J., et al.: Cross-view action recognition via view knowledge transfer. In: Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition, pp. 3209–3216. IEEE Computer Society (2011)
Gkalelis, N., Nikolaidis, N., Pitas, I.: View indepedent human movement recognition from multi-view video exploiting a circular invariant posture representation. In: Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, pp. 394–397. IEEE Press, New York (2009)
Iosifidis, A., Nikolaidis, N., Pitas, I.: Movement recognition exploiting multi-view information. In: 2010 IEEE International Workshop on Multimedia Signal Processing, MMSP (2010)
Holte, M.B., et al.: 3D Human Action Recognition for Multi-view Camera Systems. In: Proceedings of the 2011 International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission, pp. 342–349. IEEE Computer Society (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Karali, A., ElHelw, M. (2013). Motion History of Skeletal Volumes and Temporal Change in Bounding Volume Fusion for Human Action Recognition. In: Schwenker, F., Scherer, S., Morency, LP. (eds) Multimodal Pattern Recognition of Social Signals in Human-Computer-Interaction. MPRSS 2012. Lecture Notes in Computer Science(), vol 7742. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37081-6_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-37081-6_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37080-9
Online ISBN: 978-3-642-37081-6
eBook Packages: Computer ScienceComputer Science (R0)