Abstract
Feature selection is a crucial factor in Kinect-based pattern recognition, including common human gesture recognition. For Kinect-based human gesture recognition, the information contained in the feature extracted for gesture recognition is conventionally the (x,y,z) coordinates of the primary joints in the human body. However, such traditionally used feature information containing only joint positions is apparently insufficient for clearly describing the characteristics of human activity patterns. This paper proposes a feature design scheme involving hybridizations of joint positions and joint angles for human gesture recognition with the Kinect camera. The presented feature design method effectively hybridizes the 20 main human joint positions captured by the Kinect camera and the joint angle information of 12 critical joints, along with significant angle variations when a gesture is made. The method is employed in dynamic time warping (DTW) gesture recognition. When the proposed feature design method is used for Kinect-based DTW human gesture recognition, it derives an appropriately sized feature vector for each of the gesture categories in the DTW-referenced template database according to the activity characteristics of a certain category of gestures. Experiments on Kinect-based DTW gesture recognition involving 14 common categories of human gestures show that the feature determined using the proposed approach is superior to that obtained using the conventional approach, which considers only the joint position information.
Similar content being viewed by others
References
Bautista MÁ, Hernández-Vela A, Ponce V, Perez-Sala X, Baró X, Pujol O, Angulo C, Escalera S (2013) Probability-based dynamic time warping for gesture recognition on RGB-D data. Lect Notes Comput Sci 7854:126–135
Carmona JM, Climent J (2012) A performance evaluation of HMM and DTW for gesture recognition. Lect Notes Comput Sci 7441:236–243
Ding IJ (2013) Speech recognition using variable-length frame overlaps by intelligent fuzzy control. J Intell Fuzzy Syst 25(1):49–56
Ding IJ, Chang CW An eigenspace-based method with a user adaptation scheme for human gesture recognition by using Kinect 3D data. Appl Math Model. doi:10.1016/j.apm.2014.12.054
Ding IJ, Chang CW An adaptive hidden Markov model-based gesture recognition approach using Kinect to simplify large-scale video data processing for humanoid robot imitation. Multimed Tools Appl. doi:10.1007/s11042-015-2505-9
Ding IJ, Yen CT (2013) Enhancing GMM speaker identification by incorporating SVM speaker verification for intelligent web-based speech applications. Multimed Tools Appl. doi:10.1007/s11042-013-1587-5
Han J, Shao L, Xu D, Shotton J (2013) Enhanced computer vision with microsoft Kinect sensor: a review. IEEE Trans Cybern 43(5):2168–2267
Nguyen-Duc-Thanh N, Lee S, Kim D (2012) Two-stage hidden Markov model in gesture recognition for human robot interaction. Int J Adv Robot Syst 9:1–10
Qian K, Niu J, Yang H (2013) Developing a gesture based remote human-robot interaction system using Kinect. Int J Smart Home 7(4):203–208
Sakoe H, Chiba S (1978) Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans Acoust Speech Signal Process 26(1):43–49
Su CJ (2013) Personal rehabilitation exercise assistant with Kinect and dynamic time warping. Int J Inform Educ Technol 3(4):448–454
Su CJ, Huang JY, Huang SF (2012) Ensuring home-based rehabilitation exercise by using Kinect and fuzzified dynamic time warping algorithm. Proc. the Asia Pacific Industrial Engineering & Management Systems Conference, pp. 884–895
Tashev I (2013) Kinect development kit: a toolkit for gesture- and speech based human-machine interaction. IEEE Signal Process Mag 30(5):129–131
Wu J, Konrad J, Ishwar P (2013) Dynamic time warping for gesture-based user identification and authentication with Kinect. Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2371–2375
Xu W, Lee EJ (2012) Continuous gesture recognition system using improved HMM algorithm based on 2D and 3D space. Int J Multimed Ubiquitous Eng 7(2):335–340
Zhang L, Hsieh JC, Wang J (2012) A Kinect-based golf swing classification system using HMM and neuro-fuzzy. Proc. IEEE International Conference on Computer Science and Information Processing (CSIP), pp. 1163–1166
Zhang X, Ye Z, Jin L, Feng Z, Xu S (2013) A new writing experience: finger writing in the air using a Kinect sensor. IEEE MultiMed 20(4):85–93
Acknowledgments
This research is partially supported by the Ministry of Science and Technology (MOST) in Taiwan under Grant MOST 103-2221-E-150-046.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ding, IJ., Chang, CW. Feature design scheme for Kinect-based DTW human gesture recognition. Multimed Tools Appl 75, 9669–9684 (2016). https://doi.org/10.1007/s11042-015-2782-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-015-2782-3