IWCIA 2011: Combinatorial Image Analysis pp 483-493 | Cite as
A Shared Parameter Model for Gesture and Sub-gesture Analysis
Abstract
Gesture sequences typically have a common set of distinct internal sub-structures which can be shared across the gestures. In this paper, we propose a method using a generative model to learn these common actions which we refer to as sub-gestures, and in-turn perform recognition. Our proposed model learns sub-gestures by sharing parameters between gesture models. We evaluated our method on the Palm Graffiti digits-gesture dataset and showed that the model with shared parameters outperformed the same model without the shared parameters. Also, we labeled different observation sequences thereby intuitively showing how sub-gestures are related to complete gestures.
Preview
Unable to display preview. Download preview PDF.
References
- 1.Alon, J., Athitsos, V., Yuan, Q., Sclaroff, S.: A unified framework for gesture recognition and spatiotemporal gesture segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 31 (2008)Google Scholar
- 2.Assan, M., Grobel, K.: Video-based sign language recognition using hidden markov models. In: Wachsmuth, I., Fröhlich, M. (eds.) GW 1997. LNCS (LNAI), vol. 1371, pp. 97–109. Springer, Heidelberg (1998)CrossRefGoogle Scholar
- 3.Brand, M., Oliver, N., Pentland, A.: Coupled hidden markov models for complex action recognition, pp. 994–999 (1997)Google Scholar
- 4.Gavrila, D.M., St. Runge, W.: The visual analysis of human movement: A survey (1999)Google Scholar
- 5.Jones, M.J., Rehg, J.M.: Statistical color models with application to skin detection. International Journal of Computer Vision 46, 81–96 (2002)CrossRefMATHGoogle Scholar
- 6.Kapoor, A., Picard, R.W.: A real-time head nod and shake detector. In: PUI 2001: Proceedings of the 2001 workshop on Perceptive user interfaces, pp. 1–5 (2001)Google Scholar
- 7.Kienzle, W., Bakir, G., Franz, M., Schlkopf, B.: Face detection - efficient and rank deficient 17, 673–680 (2005)Google Scholar
- 8.Li, H., Greenspan, M.: Multi-scale gesture recognition from time-varying contours. In: ICCV 2005: Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV 2005), vol. 1, pp. 236–243. IEEE Computer Society, Washington, DC, USA (2005)Google Scholar
- 9.Lucas, B.D., Kanade, T.: An iterative image registration technique with an application to stereo vision. In: Proceedings of Imaging Understanding Workshop, vol. 2, pp. 121–130 (1981)Google Scholar
- 10.Morency, L.P., Quattoni, A., Darrell, T.: Latent-dynamic discriminative models for continuous gesture recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2007 (2007)Google Scholar
- 11.Pavlovic, V.I., Sharma, R., Huang, T.S.: Visual interpretation of hand gestures for human-computer interaction: A review. IEEE Transactions on Pattern Analysis and Machine Intelligence 19, 677–695 (1997)CrossRefGoogle Scholar
- 12.Sato, Y., Kobayashi, T.: Extension of hidden markov models to deal with multiple candidates of observations and its application to mobile-robot-oriented gesture recognition. In: International Conference on Pattern Recognition, vol. 2 (2002)Google Scholar
- 13.Starner, T., Pentland, A.: Real-time American Sign Language recognition from video using hidden Markov models. In: International Symposium on Computer Vision, p. 265 (1995)Google Scholar
- 14.Wang, S.B., Quattoni, A., Morency, L.P., Demirdjian, D., Darrell, T.: Hidden conditional random fields for gesture recognition. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 1521–1527 (2006)Google Scholar
- 15.Wu, Y., Huang, T.S.: Vision-based gesture recognition: A review. In: Braffort, A., Gibet, S., Teil, D., Gherbi, R., Richardson, J. (eds.) GW 1999. LNCS (LNAI), vol. 1739, pp. 103–115. Springer, Heidelberg (2000)CrossRefGoogle Scholar
- 16.Yang, R., Sarkar, S.: Gesture recognition using hidden markov models from fragmented observations. In: CVPR 2006: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 766–773. IEEE Computer Society, Washington, DC, USA (2006)Google Scholar