Boosted Multiple Deformable Trees for Parsing Human Poses

Wang, Yang; Mori, Greg

doi:10.1007/978-3-540-75703-0_2

Yang Wang¹ &
Greg Mori¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4814))

Included in the following conference series:

Workshop on Human Motion

1609 Accesses
2 Citations

Abstract

Tree-structured models have been widely used for human pose estimation, in either 2D or 3D. While such models allow efficient learning and inference, they fail to capture additional dependencies between body parts, other than kinematic constraints. In this paper, we consider the use of multiple tree models, rather than a single tree model for human pose estimation. Our model can alleviate the limitations of a single tree-structured model by combining information provided across different tree models. The parameters of each individual tree model are trained via standard learning algorithms in a single tree-structured model. Different tree models are combined in a discriminative fashion by a boosting procedure. We present experimental results showing the improvement of our model over previous approaches on a very challenging dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Crandell, D., Felzenszwalb, P.F., Huttenlocher, D.P.: Spatial priors for part-based recognition using statistical models. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 10–17 (2005)
Google Scholar
Felzenszwalb, P.F., Huttenlocher, D.P.: Pictorial structures for object recognition. International Journal of Computer Vision 61(1), 55–79 (2003)
Article Google Scholar
Forsyth, D.A., Arikan, O., Ikemoto, L., O’Brien, J., Ramanan, D.: Computational studies of human motion: Part 1, tracking and motion synthesis. Foundations and Trends in Computer Graphics and Vision 1(2/3), 77–254 (2006)
Article Google Scholar
Hogg, D.: Model-based vision: a program to see a walking person. Image and Vision Computing 1(1), 5–20 (1983)
Article Google Scholar
Hua, G., Yang, M.H., Wu, Y.: Learning to estimate human pose with data driven belief propagation. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 747–754 (2005)
Google Scholar
Ioffe, S., Forsyth, D.: Finding people by sampling. In: IEEE International Conference on Computer Vision, vol. 2, pp. 1092–1097 (1999)
Google Scholar
Ju, S.X., Black, M.J., Yaccob, Y.: Cardboard people: A parameterized model of articulated image motion. In: International Conference on Automatic Face and Gesture Recognition, pp. 38–44 (1996)
Google Scholar
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: ICML. International Conference on Machine Learning, pp. 282–289 (2001)
Google Scholar
Lan, X., Huttenlocher, D.P.: Beyond trees: Common-factor models for 2d human pose recovery. In: IEEE International Conference on Computer Vision, vol. 1, pp. 470–477 (2005)
Google Scholar
Lee, M.W., Cohen, I.: Proposal maps driven mcmc for estimating human body pose in static images. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 334–341 (2004)
Google Scholar
Meila, M., Jordan, M.I.: Learing with mixtures of trees. Journal of Machine Learning Research 1, 1–48 (2000)
Article MathSciNet Google Scholar
Mori, G.: Guiding model search using segmentation. In: IEEE International Conference on Computer Vision, vol. 2, pp. 1417–1423 (2005)
Google Scholar
Mori, G., Malik, J.: Estimating human body configurations using shape context matching. In: European Conference on Computer Vision, vol. 3, pp. 666–680 (2002)
Google Scholar
Mori, G., Ren, X., Efros, A., Malik, J.: Recovering human body configuration: Combining segmentation and recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 326–333 (2004)
Google Scholar
Ramanan, D.: Learning to parse images of articulated bodies. In: Advances in Neural Information Processing Systems, vol. 19, pp. 1129–1136 (2007)
Google Scholar
Ramanan, D., Forsyth, D.A., Zisserman, A.: Strike a pose: Tracking people by finding stylized poses. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 271–278 (2005)
Google Scholar
Ramanan, D., Sminchisescu, C.: Training deformable models for localization. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 206–213 (2006)
Google Scholar
Ren, X., Berg, A., Malik, J.: Recovering human body configurations using pairwise constraints between parts. In: IEEE International Conference on Computer Vision, vol. 1, pp. 824–831 (2005)
Google Scholar
Shakhnarovich, G., Viola, P., Darrell, T.: Fast pose estimation with parameter sensitive hashing. In: IEEE International Conference on Computer Vision, vol. 2, pp. 750–757 (2003)
Google Scholar
Sigal, L., Black, M.J.: Measure locally, reason globally: Occlusion-sensitive articulated pose estimation. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2041–2048 (2006)
Google Scholar
Song, Y., Goncalves, L., Perona, P.: Unsupervised learning of human motion. IEEE Transaction on Pattern Analysis and Machine Intelligence 25(7), 814–827 (2003)
Article Google Scholar
Srinivasan, P., Shi, J.: Bottom-up recognition and parsing of the human body. In: IEEE Conference on Computer Vision and Pattern Recognition, IEEE Computer Society Press, Los Alamitos (2007)
Google Scholar
Sudderth, E.B., Mandel, M.I., Freeman, W.T., Willsky, A.S.: Distributed occlusion reasoning for tracking with nonparametric belief propagation. In: Advances in Neural Information Processing Systems, pp. 1369–1376. MIT Press, Cambridge (2004)
Google Scholar
Sullivan, J., Carlsson, S.: Recognizing and tracking human action. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2352, pp. 629–644. Springer, Heidelberg (2002)
Google Scholar
Torralba, A., Murphy, K.P., Freeman, W.T.: Contextual models for object detection using boosted random fields. In: Advances in Neural Information Processing Systems, vol. 17, pp. 1401–1408. MIT Press, Cambridge (2005)
Google Scholar
Toyama, K., Blake, A.: Probabilistic exemplar-based tracking in a metric space. In: IEEE International Conference on Computer Vision, vol. 2, pp. 50–57 (2001)
Google Scholar
Truyen, T.T., Phung, D.Q., Bui, H.H., Venkatesh, S.: AdaBoost.MRF: Boosted markov random forests and application to multilevel activity recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 1686–1693 (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing Science, Simon Fraser University, Burnaby, BC, Canada
Yang Wang & Greg Mori

Authors

Yang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Greg Mori
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Ahmed Elgammal Bodo Rosenhahn Reinhard Klette

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, Y., Mori, G. (2007). Boosted Multiple Deformable Trees for Parsing Human Poses. In: Elgammal, A., Rosenhahn, B., Klette, R. (eds) Human Motion – Understanding, Modeling, Capture and Animation. HuMo 2007. Lecture Notes in Computer Science, vol 4814. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75703-0_2

Download citation

DOI: https://doi.org/10.1007/978-3-540-75703-0_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75702-3
Online ISBN: 978-3-540-75703-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics