Unsupervised Analysis of Human Gestures

Wang, Tian-Shu; Shum, Heung-Yeung; Xu, Ying-Qing; Zheng, Nan-Ning

doi:10.1007/3-540-45453-5_23

Tian-Shu Wang⁷,
Heung-Yeung Shum⁸,
Ying-Qing Xu⁸ &
…
Nan-Ning Zheng⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2195))

Included in the following conference series:

Pacific-Rim Conference on Multimedia

966 Accesses
26 Citations

Abstract

Recognition of human gestures is important for analysis and indexing of video. To recognize human gestures on video, generally a large number of training examples for each individual gesture must be collected. This is a labor-intensive and error-prone process and is only feasible for a limited set of gestures. In this paper, we present an approach for automatically segmenting sequences of natural activities into atomic sections and clustering them. Our work is inspired by natural language processing where words are extracted from long sentences. We extract primitive gestures from sequences of human motion. Our approach contains two steps. First, the sequences of human motion are segmented into atomic components and clustered using a Hidden Markov Model. Thus we can represent the original sequences by discrete symbols. Then we extract lexicon from these discrete sequences by using an algorithm named COMPRESSIVE. Experimental results on music conducting gestures demonstrate the effectiveness of our approach

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

D. M. Gavria.: The Visual Analysis of Human Movement: A Suvey. Computer vision and Image Understanding, Vol 73, 82–98,(1999).
Article Google Scholar
T. Starner, A. Pentland.: Visual Recognition of American Language Using Hidden Markov Models. In Int workshop on Automatic Face and Gesture Recognition, 189–194, (1995).
Google Scholar
Lee Campbell, Aaron. Bobick.: Recognition of Human Body Motion Using Phase Space Constraints, Fifth International Conference on Computer Vision, 624–630, Cambridge MA (1995)
Google Scholar
Ying Wu, Thomas Huang.: Vision-Based Gesture Recognition: A Review. International Gesture Workshop, France, (1999)
Google Scholar
Vladimir Pavlovic, James M. Rehg, John MacCormick.: Impact of Dynamic Model learning on Classification of Human Motion. International Conference on Computer Vision. (1999)
Google Scholar
Brian. Clarkson, Alex. Pentland.: Unsupervised Clustering Of Ambulatory Audio and Video. AAAI99, (1999)
Google Scholar
Matthew. Brand: Learning Concise Model of Human Activity from Ambient Video via a Structure-inducting M-step Estimator. MERL Technical report. (1997)
Google Scholar
M. Walter, A. Psarrou, S. Gong.: An Incremental Learning Approach to Human Gesture Recognition Using Semi-CONditional DENSity PropagATION. International Conference on CARV, Singapore, (2000)
Google Scholar
Nevil-Manning, and I. Witten.: Identifying Hierarchical Structure in Sequences: a Linear-time Algorithm. Artificial Intelligence Research, Vol 7, 66–82, (1997)
Google Scholar
Wolff. J.G.: An Algorithm for the Segmentation of an Artificial Language analogue. British Journal of Psychology, vol 66, 79–90, (1975)
Google Scholar
Kit. Chunyu.: A Goodness Measure for Phrase Learning via Compression with the MDL Principle. IESSLLI-98 Student Session, Chapter 13, 175–187, (1998).
Google Scholar
L. Rabiner, B. Juang.: Fundamentals of Speech Recognition. Prentice Hall, New Jersey, USA (1993)
Google Scholar
A. K. Jain, M. N. Murthy, P. J. Flynn.: Data Clustering: A Review. Technical report MSU-CSE-00-16, MSU, (2000).
Google Scholar
Nevill-Manning, I. Witten.: Online and Offline Heuristics for Inferring Hierarchies of Repetitions in Sequence, Proceedings of the IEEE, in press.
Google Scholar
K. Sadakane, H. Imai.: Constructing Suffix Arrays of Large Texts. Proc of DEWS98, (1998).
Google Scholar

Download references

Author information

Authors and Affiliations

Artificial Intelligence and Robotics Lab, Xi’an Jiaotong University, Xi’an 710049, P.R.China
Tian-Shu Wang & Nan-Ning Zheng
Microsoft Research, China, No49, Zhichun Road, Haidian District, Beijing, 100086, P.R. China
Heung-Yeung Shum & Ying-Qing Xu

Authors

Tian-Shu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Heung-Yeung Shum
View author publications
You can also search for this author in PubMed Google Scholar
Ying-Qing Xu
View author publications
You can also search for this author in PubMed Google Scholar
Nan-Ning Zheng
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research China, 5/F Beijing Sigma Center 49 Zhichung Road, Haidian District, Beijing, 100080, China
Heung-Yeung Shum
Institute of Information Science, Academia Sinica, Taiwan
Mark Liao
Department of Electrical Engineering, Columbia University, New York, NY, 10027, USA
Shih-Fu Chang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, TS., Shum, HY., Xu, YQ., Zheng, NN. (2001). Unsupervised Analysis of Human Gestures. In: Shum, HY., Liao, M., Chang, SF. (eds) Advances in Multimedia Information Processing — PCM 2001. PCM 2001. Lecture Notes in Computer Science, vol 2195. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45453-5_23

Download citation

DOI: https://doi.org/10.1007/3-540-45453-5_23
Published: 20 November 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42680-6
Online ISBN: 978-3-540-45453-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics