Improved Human Parsing with a Full Relational Model

Tran, Duan; Forsyth, David

doi:10.1007/978-3-642-15561-1_17

Duan Tran¹⁹ &
David Forsyth¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6314))

Included in the following conference series:

European Conference on Computer Vision

12k Accesses
31 Citations

Abstract

We show quantitative evidence that a full relational model of the body performs better at upper body parsing than the standard tree model, despite the need to adopt approximate inference and learning procedures. Our method uses an approximate search for inference, and an approximate structure learning method to learn. We compare our method to state of the art methods on our dataset (which depicts a wide range of poses), on the standard Buffy dataset, and on the reduced PASCAL dataset published recently. Our results suggest that the Buffy dataset over emphasizes poses where the arms hang down, and that leads to generalization problems.

Download to read the full chapter text

Chapter PDF

Easy Minimax Estimation with Random Forests for Human Pose Estimation

Pose Machines: Articulated Pose Estimation via Inference Machines

DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Andriluka, M., Roth, S., Schiele, B.: Pictorial structures revisited: People detection and articulated pose estimation. In: CVPR (2009)
Google Scholar
Desai, C., Ramanan, D., Fowlkes, C.: Discriminative models for static human-object interactions (2010)
Google Scholar
Eichner, M., Ferrari, V.: Better appearance models for pictorial structures. In: British Machine Vision Conference (2009)
Google Scholar
Farhadi, A., Endres, I., Hoiem, D., Forsyth, D.: Learning to describe objects. In: CVPR (2009)
Google Scholar
Felzenszwalb, P., Huttenlocher, D.: Pictorial structures for object recognition. IJCV 61(1), 55–79 (2005)
Article Google Scholar
Felzenszwalb, P.F., Huttenlocher, D.P.: Efficient matching of pictorial structures. In: CVPR (2000)
Google Scholar
Felzenszwalb, P.F., McAllester, D.A., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: CVPR (2008)
Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Object Class Recognition by Unsupervised Scale-Invariant Learning. In: CVPR (2003)
Google Scholar
Ferrari, V., Marin, M., Zisserman, A.: Progressive search space reduction for human pose estimation. In: CVPR (2008)
Google Scholar
Ioffe, S., Forsyth, D.: Finding people by sampling. In: ICCV, pp. 1092–1097 (1999)
Google Scholar
Ioffe, S., Forsyth, D.: Human tracking with mixtures of trees. In: ICCV, pp. 690–695 (2001)
Google Scholar
Jiang, H.: Human pose estimation using consistent max-covering. In: ICCV (2009)
Google Scholar
Jiang, H., Martin, R.: Global pose estimation using non-tree models. In: CVPR (2008)
Google Scholar
Johnson, S., Everingham, M.: Combining discriminative appearance and segmentation cues for articulated human pose estimation. In: MLVMA 2009 (2009)
Google Scholar
Mori, G., Malik, J.: Estimating human body configurations using shape context matching. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2352, pp. 666–680. Springer, Heidelberg (2002)
Chapter Google Scholar
Mori, G., Ren, X., Efros, A.A., Malik, J.: Recovering human body configurations: Combining segmentation and recognition. In: CVPR (2004)
Google Scholar
Platt, J.: Probabilities for sv machines. In: Advances in Neural Information Processing (1999)
Google Scholar
Ramanan, D.: Learning to parse images of articulated bodies. In: Advances in Neural Information Processing (2006)
Google Scholar
Ramanan, D., Forsyth, D., Barnard, K.: Building models of animals from video. PAMI 28(8), 1319–1334 (2006)
Google Scholar
Ratliff, N., Bagnell, J.A., Zinkevich, M.: Subgradient methods for maximum margin structured learning. In: ICML 2006 Workshop on Learning in Structured Output Spaces (2006)
Google Scholar
Ronfard, R., Schmid, C., Triggs, B.: Learning to parse pictures of people. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, p. 700. Springer, Heidelberg (2002)
Chapter Google Scholar
Sapp, B., Jordan, C., Taskar, B.: Adaptive pose prior for pictorial structure. In: CVPR (2010)
Google Scholar
Shechtman, E., Irani, M.: Matching local self-similarities across images and videos. In: CVPR (2007)
Google Scholar
Sigal, L., Black, M.J.: Measure locally, reason globally: Occlusion-sensitive articulated pose estimation. In: CVPR (2006)
Google Scholar
Song, Y., Feng, X., Perona, P.: Towards detection of human motion. In: CVPR, pp. 810–817 (2000)
Google Scholar
Taskar, B.: Learning Structured Prediction Models: A Large Margin Approach. PhD thesis, Stanford University (2004)
Google Scholar
Taskar, B., Lacoste-Julien, S., Jordan, M.: Structured prediction via the extragradient method. In: Neural Information Processing Systems Conference (2005)
Google Scholar
Tian, T.-P., Sclaroff, S.: Fast globally optimal 2d human detection with loopy graph models. In: CVPR (2010)
Google Scholar
Tran, D., Forsyth, D.: Configuration estimates improve pedestrian finding. In: Advances in Neural Information Processing (2007)
Google Scholar
Tsochantaridis, I., Joachims, T., Hofmann, T., Altun, Y.: Large margin methods for structured and interdependent output variables. Journal of Machine Learning Research (JMLR) 6, 1453–1484 (2005)
MathSciNet Google Scholar
Varma, M., Zisserman, A.: A statistical approach to texture classification from single images. Int. J. Computer Vision 62(1-2), 61–81 (2005)
Article Google Scholar
Yao, B., Fei-Fei, L.: Model mutual context of object and human pose in human-object interaction activities. In: CVPR (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Illinois at Urbana-Champaign, USA
Duan Tran & David Forsyth

Authors

Duan Tran
View author publications
You can also search for this author in PubMed Google Scholar
David Forsyth
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

GRASP Laboratory, University of Pennsylvania, 3330 Walnut Street, 19104, Philadelphia, PA, USA
Kostas Daniilidis
School of Electrical and Computer Engineering, National Technical University of Athens, 15773, Athens, Greece
Petros Maragos
Department of Applied Mathematics, Ecole Centrale de Paris, Grande Voie des Vignes, 92295, Chatenay-Malabry, France
Nikos Paragios

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tran, D., Forsyth, D. (2010). Improved Human Parsing with a Full Relational Model. In: Daniilidis, K., Maragos, P., Paragios, N. (eds) Computer Vision – ECCV 2010. ECCV 2010. Lecture Notes in Computer Science, vol 6314. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15561-1_17

Download citation

DOI: https://doi.org/10.1007/978-3-642-15561-1_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15560-4
Online ISBN: 978-3-642-15561-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Improved Human Parsing with a Full Relational Model

Abstract

Chapter PDF

Similar content being viewed by others

Easy Minimax Estimation with Random Forests for Human Pose Estimation

Pose Machines: Articulated Pose Estimation via Inference Machines

DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Improved Human Parsing with a Full Relational Model

Abstract

Chapter PDF

Similar content being viewed by others

Easy Minimax Estimation with Random Forests for Human Pose Estimation

Pose Machines: Articulated Pose Estimation via Inference Machines

DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation