Skip to main content
Log in

An approach for complex activity recognition by key frames

  • Published:
Journal of Central South University Aims and scope Submit manuscript

Abstract

A new method for complex activity recognition in videos by key frames was presented. The progressive bisection strategy (PBS) was employed to divide the complex activity into a series of simple activities and the key frames representing the simple activities were extracted by the self-splitting competitive learning (SSCL) algorithm. A new similarity criterion of complex activities was defined. Besides the regular visual factor, the order factor and the interference factor measuring the timing matching relationship of the simple activities and the discontinuous matching relationship of the simple activities respectively were considered. On these bases, the complex human activity recognition could be achieved by calculating their similarities. The recognition error was reduced compared with other methods when ignoring the recognition of simple activities. The proposed method was tested and evaluated on the self-built broadcast gymnastic database and the dancing database. The experimental results prove the superior efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. AGGARWAL J K, RYOO M S. Human activity analysis: A review [J]. ACM Computing Surveys (CSUR), 2011, 43(3): 16–62.

    Article  Google Scholar 

  2. OLIVER N, HORVITZ E, GARG A. Layered representations for human activity recognition [C]// Fourth IEEE International Conference on Multimodal Interfaces. Pittsburgh, PA, USA: IEEE Computer Science, 2002: 3–8.

    Chapter  Google Scholar 

  3. KHOSHHAL K, ALIAKBARPOUR H, MEKHNACHA K, JULIEN R S, JOAO Q, JORGE D. Lma-based human behaviour analysis using HMM [C]// 2nd IFIP WG 5.5/SOCOLNET Doctoral Conference on Computing, Electrical and Industrial Systems. Costa de Caparica, Portugal: Springer-Verlag, 2011: 189–196.

    Google Scholar 

  4. MOZAFARI K, CHARKARI N M, BOROUJENI H S, BEHROUZIFAR M. A novel fuzzy HMM approach for human action recognition in video [C]// 3rd Conference of Knowledge Technology Week. Kajang, Malaysia: Springer-Verlag, 2012: 184–193.

    Google Scholar 

  5. DAI Peng, DI Hui-jun, DONG Li-geng, TAO Lin-mi. Group interaction analysis in dynamic context [J]. IEEE Transactions On Systems, Man and Cybernetics—Part B, 2009, 39(1): 34–42.

    Article  Google Scholar 

  6. DAMEN D, HOGG D. Recognizing linked events: Searching the space of feasible explanations [C]// IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Miami, Florida, USA: IEEE Computer Science, 2009: 927–934.

    Google Scholar 

  7. ARSIC D, SCHULLER B. Real time person tracking and behavior interpretation in multi-camera scenarios applying homography and coupled HMMs [C]// 2012 International Conference on Analysis of Verbal and Nonverbal Communication and Enactment. Budapest, Hungary: Springer-Verlag, 2011: 1–18.

    Google Scholar 

  8. JOO S W, CHELLAPPA R. Attribute grammar-based event recognition and anomaly detection [C]// IEEE Conference on Computer Vision and Pattern Recognition Workshop (CVPRW). New York: IEEE Computer Science, 2006: 107–107.

    Google Scholar 

  9. FERNANDEZ-CABALLERO A, CASTILLO J C, RODRIGUEZSANCHEZ J M. Human activity monitoring by local and global finite state machines [J]. Expert Systems with Applications, 2012, 39(8): 6982–6993.

    Article  Google Scholar 

  10. SANROMA G, BURGHOUTS G, SCHUTTE K. Recognition of long-term behaviors by parsing sequences of short-term actions with a stochastic regular grammar [C]// Proceedings of the 2012 Joint IAPR International Conference on Structural, Syntactic, and Statistical Pattern Recognition (SSPR’12/SPR’12). Berlin Heidelberg: Springer-Verlag, 2012: 225–233.

    Google Scholar 

  11. SANROMA G, PATINO L, BURGHOUTS G, SCHUTTE K, FERRYMAN J. A unified approach to the recognition of complex actions from sequences of zone-crossings [J]. Image and Vision Computing, 2014, 32(5): 363–378.

    Article  Google Scholar 

  12. NIEBLES J C, WANG Hong-cheng, LI Fei-fei. Unsupervised learning of human action categories using spatial-temporal words [J]. International Journal of Computer Vision, 2008, 79(3): 299–318.

    Article  Google Scholar 

  13. HAMID R, MADDI S, JOHNSON A, BOBICK A, ESSA I, ISBELL C. A novel sequence representation for unsupervised analysis of human activities [J]. Artificial Intelligence, 2009, 173(14): 1221–1244.

    Article  MathSciNet  Google Scholar 

  14. KOOIJ J F P, ENGLEBIENNE G, GAVRILA D M. A non-parametric hierarchical model to discover behavior dynamics from tracks [C]// 12th European Conference on Computer Vision (ECCV). Florence, Italy: Springer-Verlag, 2012: 270–283.

    Google Scholar 

  15. OUANANE A, SERIR A, KEROUH F. New geometric descriptor for the recognition of aggressive human behavior [C]// Fifth IEEE International Congress on Image and Signal Processing (CISP). Morocco: IEEE Computer Science, 2012: 148–153.

    Google Scholar 

  16. ZIVKOVIC, Z. H., FERDINAND V D. Efficient adaptive density estimation per image pixel for the task of background subtraction [J]. Pattern Recognition Letters, 2006, 27(7): 773–780.

    Article  Google Scholar 

  17. HUO Y, ZHANG P, WANG Y. Adaptive threshold video shot boundary detection algorithm based on progressive bisection strategy [J]. Journal of Information & Computational Science, 2014, 11(2): 391–403.

    Article  Google Scholar 

  18. XIA Li-min, DENG Ke-jie. The key frame extraction based on self-splitting competitive learning algorithm [J]. Computer Engineering and Applications, 2011, 47(2): 146–148. (in Chinese)

    Google Scholar 

  19. LI Wei-wei, LIU Chun-ping, WANG Zhao-hui, ZHANG Shu-kui. Fuzzy C-means image classification method based on SSCL [J]. Journal of Image and Graphics, 2011, 16(2): 215–220. (in Chinese)

    Google Scholar 

  20. XIA Li-min, HUANG Jin-xia, TAN Lun-zheng. Human action recognition based on chaotic invariants [J]. Journal of Central South University, 2013, 20: 3171–3179.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Li-min Xia  (夏利民).

Additional information

Foundation item: Project(50808025) supported by the National Natural Science Foundation of China; Project(20090162110057) supported by the Doctoral Fund of Ministry of Education, China

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xia, Lm., Shi, Xt. & Tu, Hb. An approach for complex activity recognition by key frames. J. Cent. South Univ. 22, 3450–3457 (2015). https://doi.org/10.1007/s11771-015-2885-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11771-015-2885-z

Keywords

Navigation