Abstract
In this paper, we deal the problem of classification of activities in videos by learning in a different way of modifying the usual input features to enrich the efficiency by reducing the space and time consumption. For efficient action recognition, representing important information is important for reliability. The motivation here is to extract binary silhouette (shape) information from real or raw videos. Most of the researchers are using sequence matching scheme consists of a set of successive silhouette frame that consumes much time and space for learning the system. Here, we adopted recently proposed subspace learning method: spectral regression discriminant analysis (SRDA) for dimensionality reduction. Median Hausdorff distance was used for similarity measures to match the embedded action trajectories. Then, action classification is achieved in a nearest neighbor framework. For efficient learning using SRDA, we use (i) raw silhouette, (ii) images obtained after doing distance transform (DT) in the raw silhouette, (iii) images obtained after extracting edges from the raw silhouettes—edge representation (iv) silhouette history image (SHI), and (v) silhouette energy image (SEI) that gives shape and motion information as an input feature. Using these different input methods, we achieved 100 % correct recognition rate (CRR) for each cases. From the result, it is evident that SHI and SEI is found effective input method than other methods in terms of time and space consumption.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
M. Turk, A. Pentland, Eigenfaces for recognition. J. Cogn. Neurosci. 3(1), 71–86 (1991)
R.O. Duda, P.E. Hart, D.G. Stork, Pattern Classification (2000)
X. He, S. Yan, Y. Hu, P. Niyogi, H. Zhang, Face recognition using laplacian faces. IEEE Trans. Pattern Anal. Mach. Intell. 27(3), 328–340 (2005)
J.B. Tenenbaum, V. de Silva, J.C. Langford, A global geometric framework for non linear dimensionality reduction. Science 290, 2319–2323 (2000)
S.T. Roweis, L.K. Saul, Nonlinear dimensionality reduction by locally linear embedding. Science 290, 2323–2326 (2000)
M. Belkin, P. Niyogi, Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput. 15, 1373–1396 (2003)
R.R. Coifman, S. Lafon, A.B. Lee, M. Maggioni, B. Nadler, F. Warner, S. Zucker, Geometric diffusions as a tool for harmonics analysis and structure definition of data. Proc. Natl. Acad. Sci. 102(21), 7426–7431 (2005)
B. Schlkopf, A.J. Smola, Learning with Kernels (2002)
L. Wang, D. Suter, Recognizing human activities from silhouettes: motion subspace and factorial discriminative graphical model, in Computer Vision and Pattern Recognition (2007), pp. 1–8
S. Mika, G. Ratsch, J. Weston, B. Scholkopf, K.-R. Muller, Fisher discriminant analysis with kernels, in Proceedings of IEEE Neural Networks for Signal Processing Workshop (NNSP) (1999)
G. Baudat, F. Anouar, Generalized discriminant analysis using a kernel approach. Neural Comput. 12, 2385–2404 (2000)
D. Cai, X. He, J. Han, Regularized locality preserving indexing via spectral regression, in ACM International Conference on Information and Knowledge Management (2007)
D. Cai, X. He, J. Han, Spectral regression for efficient regularized subspace learning, in IEEE International Conference on Computer Vision (ICCV) (2007)
D. Cai, X. He, J. Han, SRDA: an efficient algorithm for large scale discriminant analysis. IEEE Trans. Knowl. Data Eng. 20(1), 1–12 (2008)
H. Scharr, Optimal second order derivative filter families for transparent motion estimation, in 15th European Signal Processing Conference (EUSIPCO 2007) (2007)
M. Ahmad, I. Parvin, S.-W. Lee, Silhouette history and energy image information for human movement recognition. J. Multimedia 5(1), 12–21 (2010)
L. Wang, D. Suter, Learning and matching of dynamic shape manifolds for human action recognition. IEEE Trans. Image Process. 16(6), 1646–1661 (2007)
M. Blank, L. Gorelick, E. Shechtman, M. Irani, R. Basri, Actions as space-time shapes. IEEE Int. Conf. Comput. Vision 2, 1395–1402 (2005)
M. Blank, L. Gorelick, E. Shechtman, M. Irani, R. Basri, Actions as space-time shapes. IEEE Int. Conf. Comput. Vision 2, 1395–1402 (2005)
V. Kellokumpu, G. Zhao, M. Pietikainen, Texture based description of movements for activity analysis, in Proceedings of 3rd International Conference on Computer Vision Theory and Applications (2008)
K. Jia, D. Yeung, Human action recognition using Local Spatio-Temporal Discriminant Embedding, in IEEE Conference on Computer Vision and Pattern Recognition (2008)
F. Zheng, L. Shao, Z. Song, Eigen-space Learning Using Semi-supervised Diffusion Maps for Human Action Recognition, in ACM International Conference on Image and Video Retrieval (CIVR) (2010)
L. Shao, X. Chen, Histogram of body poses and spectral regression discriminant analysis for human action categorization, in Proceedings of the British Machine Vision Conference (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer India
About this paper
Cite this paper
Glorindal Selvam, G., Gnanadurai, D. (2015). Efficient Silhouette-based Input Methods for Reliable Human Action Recognition from Videos. In: Suresh, L., Dash, S., Panigrahi, B. (eds) Artificial Intelligence and Evolutionary Algorithms in Engineering Systems. Advances in Intelligent Systems and Computing, vol 325. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2135-7_54
Download citation
DOI: https://doi.org/10.1007/978-81-322-2135-7_54
Published:
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-2134-0
Online ISBN: 978-81-322-2135-7
eBook Packages: EngineeringEngineering (R0)