Abstract
A new method for complex activity recognition in videos by key frames was presented. The progressive bisection strategy (PBS) was employed to divide the complex activity into a series of simple activities and the key frames representing the simple activities were extracted by the self-splitting competitive learning (SSCL) algorithm. A new similarity criterion of complex activities was defined. Besides the regular visual factor, the order factor and the interference factor measuring the timing matching relationship of the simple activities and the discontinuous matching relationship of the simple activities respectively were considered. On these bases, the complex human activity recognition could be achieved by calculating their similarities. The recognition error was reduced compared with other methods when ignoring the recognition of simple activities. The proposed method was tested and evaluated on the self-built broadcast gymnastic database and the dancing database. The experimental results prove the superior efficiency.
Similar content being viewed by others
References
AGGARWAL J K, RYOO M S. Human activity analysis: A review [J]. ACM Computing Surveys (CSUR), 2011, 43(3): 16–62.
OLIVER N, HORVITZ E, GARG A. Layered representations for human activity recognition [C]// Fourth IEEE International Conference on Multimodal Interfaces. Pittsburgh, PA, USA: IEEE Computer Science, 2002: 3–8.
KHOSHHAL K, ALIAKBARPOUR H, MEKHNACHA K, JULIEN R S, JOAO Q, JORGE D. Lma-based human behaviour analysis using HMM [C]// 2nd IFIP WG 5.5/SOCOLNET Doctoral Conference on Computing, Electrical and Industrial Systems. Costa de Caparica, Portugal: Springer-Verlag, 2011: 189–196.
MOZAFARI K, CHARKARI N M, BOROUJENI H S, BEHROUZIFAR M. A novel fuzzy HMM approach for human action recognition in video [C]// 3rd Conference of Knowledge Technology Week. Kajang, Malaysia: Springer-Verlag, 2012: 184–193.
DAI Peng, DI Hui-jun, DONG Li-geng, TAO Lin-mi. Group interaction analysis in dynamic context [J]. IEEE Transactions On Systems, Man and Cybernetics—Part B, 2009, 39(1): 34–42.
DAMEN D, HOGG D. Recognizing linked events: Searching the space of feasible explanations [C]// IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Miami, Florida, USA: IEEE Computer Science, 2009: 927–934.
ARSIC D, SCHULLER B. Real time person tracking and behavior interpretation in multi-camera scenarios applying homography and coupled HMMs [C]// 2012 International Conference on Analysis of Verbal and Nonverbal Communication and Enactment. Budapest, Hungary: Springer-Verlag, 2011: 1–18.
JOO S W, CHELLAPPA R. Attribute grammar-based event recognition and anomaly detection [C]// IEEE Conference on Computer Vision and Pattern Recognition Workshop (CVPRW). New York: IEEE Computer Science, 2006: 107–107.
FERNANDEZ-CABALLERO A, CASTILLO J C, RODRIGUEZSANCHEZ J M. Human activity monitoring by local and global finite state machines [J]. Expert Systems with Applications, 2012, 39(8): 6982–6993.
SANROMA G, BURGHOUTS G, SCHUTTE K. Recognition of long-term behaviors by parsing sequences of short-term actions with a stochastic regular grammar [C]// Proceedings of the 2012 Joint IAPR International Conference on Structural, Syntactic, and Statistical Pattern Recognition (SSPR’12/SPR’12). Berlin Heidelberg: Springer-Verlag, 2012: 225–233.
SANROMA G, PATINO L, BURGHOUTS G, SCHUTTE K, FERRYMAN J. A unified approach to the recognition of complex actions from sequences of zone-crossings [J]. Image and Vision Computing, 2014, 32(5): 363–378.
NIEBLES J C, WANG Hong-cheng, LI Fei-fei. Unsupervised learning of human action categories using spatial-temporal words [J]. International Journal of Computer Vision, 2008, 79(3): 299–318.
HAMID R, MADDI S, JOHNSON A, BOBICK A, ESSA I, ISBELL C. A novel sequence representation for unsupervised analysis of human activities [J]. Artificial Intelligence, 2009, 173(14): 1221–1244.
KOOIJ J F P, ENGLEBIENNE G, GAVRILA D M. A non-parametric hierarchical model to discover behavior dynamics from tracks [C]// 12th European Conference on Computer Vision (ECCV). Florence, Italy: Springer-Verlag, 2012: 270–283.
OUANANE A, SERIR A, KEROUH F. New geometric descriptor for the recognition of aggressive human behavior [C]// Fifth IEEE International Congress on Image and Signal Processing (CISP). Morocco: IEEE Computer Science, 2012: 148–153.
ZIVKOVIC, Z. H., FERDINAND V D. Efficient adaptive density estimation per image pixel for the task of background subtraction [J]. Pattern Recognition Letters, 2006, 27(7): 773–780.
HUO Y, ZHANG P, WANG Y. Adaptive threshold video shot boundary detection algorithm based on progressive bisection strategy [J]. Journal of Information & Computational Science, 2014, 11(2): 391–403.
XIA Li-min, DENG Ke-jie. The key frame extraction based on self-splitting competitive learning algorithm [J]. Computer Engineering and Applications, 2011, 47(2): 146–148. (in Chinese)
LI Wei-wei, LIU Chun-ping, WANG Zhao-hui, ZHANG Shu-kui. Fuzzy C-means image classification method based on SSCL [J]. Journal of Image and Graphics, 2011, 16(2): 215–220. (in Chinese)
XIA Li-min, HUANG Jin-xia, TAN Lun-zheng. Human action recognition based on chaotic invariants [J]. Journal of Central South University, 2013, 20: 3171–3179.
Author information
Authors and Affiliations
Corresponding author
Additional information
Foundation item: Project(50808025) supported by the National Natural Science Foundation of China; Project(20090162110057) supported by the Doctoral Fund of Ministry of Education, China
Rights and permissions
About this article
Cite this article
Xia, Lm., Shi, Xt. & Tu, Hb. An approach for complex activity recognition by key frames. J. Cent. South Univ. 22, 3450–3457 (2015). https://doi.org/10.1007/s11771-015-2885-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11771-015-2885-z