Skip to main content
Log in

Feature design scheme for Kinect-based DTW human gesture recognition

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Feature selection is a crucial factor in Kinect-based pattern recognition, including common human gesture recognition. For Kinect-based human gesture recognition, the information contained in the feature extracted for gesture recognition is conventionally the (x,y,z) coordinates of the primary joints in the human body. However, such traditionally used feature information containing only joint positions is apparently insufficient for clearly describing the characteristics of human activity patterns. This paper proposes a feature design scheme involving hybridizations of joint positions and joint angles for human gesture recognition with the Kinect camera. The presented feature design method effectively hybridizes the 20 main human joint positions captured by the Kinect camera and the joint angle information of 12 critical joints, along with significant angle variations when a gesture is made. The method is employed in dynamic time warping (DTW) gesture recognition. When the proposed feature design method is used for Kinect-based DTW human gesture recognition, it derives an appropriately sized feature vector for each of the gesture categories in the DTW-referenced template database according to the activity characteristics of a certain category of gestures. Experiments on Kinect-based DTW gesture recognition involving 14 common categories of human gestures show that the feature determined using the proposed approach is superior to that obtained using the conventional approach, which considers only the joint position information.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Bautista MÁ, Hernández-Vela A, Ponce V, Perez-Sala X, Baró X, Pujol O, Angulo C, Escalera S (2013) Probability-based dynamic time warping for gesture recognition on RGB-D data. Lect Notes Comput Sci 7854:126–135

    Article  Google Scholar 

  2. Carmona JM, Climent J (2012) A performance evaluation of HMM and DTW for gesture recognition. Lect Notes Comput Sci 7441:236–243

    Article  Google Scholar 

  3. Ding IJ (2013) Speech recognition using variable-length frame overlaps by intelligent fuzzy control. J Intell Fuzzy Syst 25(1):49–56

    Google Scholar 

  4. Ding IJ, Chang CW An eigenspace-based method with a user adaptation scheme for human gesture recognition by using Kinect 3D data. Appl Math Model. doi:10.1016/j.apm.2014.12.054

  5. Ding IJ, Chang CW An adaptive hidden Markov model-based gesture recognition approach using Kinect to simplify large-scale video data processing for humanoid robot imitation. Multimed Tools Appl. doi:10.1007/s11042-015-2505-9

  6. Ding IJ, Yen CT (2013) Enhancing GMM speaker identification by incorporating SVM speaker verification for intelligent web-based speech applications. Multimed Tools Appl. doi:10.1007/s11042-013-1587-5

    Google Scholar 

  7. Han J, Shao L, Xu D, Shotton J (2013) Enhanced computer vision with microsoft Kinect sensor: a review. IEEE Trans Cybern 43(5):2168–2267

    Google Scholar 

  8. Nguyen-Duc-Thanh N, Lee S, Kim D (2012) Two-stage hidden Markov model in gesture recognition for human robot interaction. Int J Adv Robot Syst 9:1–10

    Article  Google Scholar 

  9. Qian K, Niu J, Yang H (2013) Developing a gesture based remote human-robot interaction system using Kinect. Int J Smart Home 7(4):203–208

    Google Scholar 

  10. Sakoe H, Chiba S (1978) Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans Acoust Speech Signal Process 26(1):43–49

    Article  MATH  Google Scholar 

  11. Su CJ (2013) Personal rehabilitation exercise assistant with Kinect and dynamic time warping. Int J Inform Educ Technol 3(4):448–454

    Article  Google Scholar 

  12. Su CJ, Huang JY, Huang SF (2012) Ensuring home-based rehabilitation exercise by using Kinect and fuzzified dynamic time warping algorithm. Proc. the Asia Pacific Industrial Engineering & Management Systems Conference, pp. 884–895

  13. Tashev I (2013) Kinect development kit: a toolkit for gesture- and speech based human-machine interaction. IEEE Signal Process Mag 30(5):129–131

    Article  Google Scholar 

  14. Wu J, Konrad J, Ishwar P (2013) Dynamic time warping for gesture-based user identification and authentication with Kinect. Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2371–2375

  15. Xu W, Lee EJ (2012) Continuous gesture recognition system using improved HMM algorithm based on 2D and 3D space. Int J Multimed Ubiquitous Eng 7(2):335–340

    MathSciNet  Google Scholar 

  16. Zhang L, Hsieh JC, Wang J (2012) A Kinect-based golf swing classification system using HMM and neuro-fuzzy. Proc. IEEE International Conference on Computer Science and Information Processing (CSIP), pp. 1163–1166

  17. Zhang X, Ye Z, Jin L, Feng Z, Xu S (2013) A new writing experience: finger writing in the air using a Kinect sensor. IEEE MultiMed 20(4):85–93

    Article  Google Scholar 

Download references

Acknowledgments

This research is partially supported by the Ministry of Science and Technology (MOST) in Taiwan under Grant MOST 103-2221-E-150-046.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ing-Jr Ding.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ding, IJ., Chang, CW. Feature design scheme for Kinect-based DTW human gesture recognition. Multimed Tools Appl 75, 9669–9684 (2016). https://doi.org/10.1007/s11042-015-2782-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-015-2782-3

Keywords

Navigation