Abstract
Human actions are spatio-temporal visual events and recognizing human actions in different conditions is still a challenging computer vision problem. In this paper, we introduce a robust feature based space-time correlation filter, called Action-02MCF (0’zero-aliasing’ 2M’ Maximum Margin’) for recognizing human actions in video sequences. This filter combines (i) the sparsity of spatio-temporal feature space, (ii) generalization of maximum margin criteria, (iii) enhanced aliasing free localization performance of correlation filtering using (iv) rich context of maximally stable space-time interest points into a single classifier. Its rich multi-objective function provides robustness, generalization and recognition as a single package. Action-02MCF can simultaneously localize and classify actions of interest even in clutter and adverse imaging conditions. We evaluate the performance of our proposed filter for challenging human action datasets. Experimental results verify the performance potential of our action-filter compared to other correlation filtering based action recognition approaches.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Ali, S., Lucey, S.: Are correlation filters useful for human actionrecognition? In: 2010 20th InternationalConference on Pattern Recognition (ICPR), pp. 2608–2611, August 2010
Boddeti, V.N., Kanade, T., Kumar, B.: Correlation filters for object alignment. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2291–2298 (2013)
Boddeti, V.N., Kumar, B.: Maximum margin vector correlation filter. arXiv preprint (2014). arXiv:1404.6031
Davis, J.W., Sharma, V.: Background-subtraction using contour-based fusion of thermal and visible imagery. Comput. Vis. Image Underst. 106(2), 162–182 (2007)
Donoser, M., Bischof, H.: 3d segmentation by maximally stable volumes (msvs). In: International Conference on Pattern Recognition, vol. 1, pp. 63–66. IEEE (2006)
Everts, I., Gemert, J., Gevers, T.: Evaluation of color stips for human action recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2850–2857 (2013)
Fernandez, J.A., Boddeti, V.N., Rodriguez, A., Kumar, B.V.K.V.: Zero-aliasing correlation filters for object recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37(8), 1702–1715 (2015)
Fernandez, J.A., Kumar, B.V.: Space-time correlation filters for human action detection. In: SPIE Electronic Imaging, pp. 866304–866304 (2013)
Galoogahi, H., Sim, T., Lucey, S.: Multi-channel correlation filters. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3072–3079 (2013)
Gong, S., Xiang, T.: Recognition of group activities using dynamic probabilistic networks. In: IEEE International Conference on Computer Vision, pp. 742–749 (2003)
Anwaar, G.I.H., Manzur, M.: Action recognition using spatio-temporal distance classifier correlation filter. In: International Conference on Digital Image Computing Techniques and Applications (DICTA), pp. 474–479 (2011)
Hester, C.F., Casasent, D.: Multivariant technique for multiclass pattern recognition. Appl. Optics 19(11), 1758–1761 (1980)
Junejo, I.N., Dexter, E., Laptev, I., Perez, P.: View-independent action recognition from temporal self-similarities. IEEE Trans. Pattern Anal. Mach. Intell. 33(1), 172–185 (2011)
Kiani, H., Sim, T., Lucey, S.: Multi-channel correlation filters for human action recognition. In: International Conference on Image Processing (ICIP), pp. 1485–1489. IEEE (2014)
Klaser, A., Marszałek, M., Schmid, C.: A spatio-temporal descriptor based on 3d-gradients. In: British Machine Vision Conference, pp. 275–281 (2008)
Kumar, B.V.K.V., Mahalanobis, A., Juday, R.D.: Correlation Pattern Recognition. Cambridge University Press, New York (2005)
Laptev, I., Marszałek, M., Schmid, C., Rozenfeld, B.: Learning realistic human actions from movies. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2008)
Lewis, J., Nikolov, S., Loza, A., Canga, E.F., Cvejic, N., Li, J., Cardinali, A., Canagarajah, C., Bull, D., Riley, T., et al.: The eden project multi-sensor data set. The Online Resource for Research in Image Fusion ImageFusion.org (2006)
Liu, J., Luo, J., Shah, M.: Recognizing realistic actions from videos in the wild. In: IEEE Conference on Computer Vision and Pattern Recognition, pp.1996–2003 (2009)
Rodriguez, A., Boddeti, V.N., Kumar, B., Mahalanobis, A.: Maximum margin correlation filter: a new approach for localization and classification. IEEE Trans. Image Process. 22(2), 631–643 (2013)
Rodriguez, M.D., Ahmed, J., Shah, M.: Action mach a spatio-temporal maximum average correlation height filter for action recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8, June 2008
Siva, P., Xiang, T.: Action detection in crowd. In: British Machine Vision Conference, pp. 1–11 (2010)
Toet, A., de Jong, M.J., Hogervorst, M.A., Hooge, I.T.: Perceptual evaluation of colorized nighttime imagery. In: IS&T/SPIE Electronic Imaging, pp. 901412–901412 (2014)
Ulhaq, A., Iqbal, G., Murshed, M.: Contextual action recognition in multi-sensor nighttime video sequences. In: Digital Image Computing Techniques and Applications (DICTA), pp. 256–261 (2011)
Weinland, D., Özuysal, M., Fua, P.: Making action recognition robust to occlusions and viewpoint changes. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6313, pp. 635–648. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15558-1_46
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Ulhaq, A., Yin, X., Zhang, Y., Gondal, I. (2016). Action-02MCF: A Robust Space-Time Correlation Filter for Action Recognition in Clutter and Adverse Lighting Conditions. In: Blanc-Talon, J., Distante, C., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2016. Lecture Notes in Computer Science(), vol 10016. Springer, Cham. https://doi.org/10.1007/978-3-319-48680-2_41
Download citation
DOI: https://doi.org/10.1007/978-3-319-48680-2_41
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48679-6
Online ISBN: 978-3-319-48680-2
eBook Packages: Computer ScienceComputer Science (R0)