Abstract
Human pose estimation from monocular video streams is a challenging problem. Much of the work on this problem has focused on developing inference algorithms and probabilistic prior models based on learned measurements. Such algorithms face challenges in generalisation beyond the learned dataset. We propose an interactive model-based generative approach for estimating the human pose from uncalibrated monocular video in unconstrained sports TV footage. Belief propagation over a spatio-temporal graph of candidate body part hypotheses is used to estimate a temporally consistent pose between user-defined keyframe constraints. Experimental results show that the proposed generative pose estimation framework is capable of estimating pose even in very challenging unconstrained scenarios.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agarwal A, Triggs B (2006) Recovering 3d human pose from monocular images. IEEE Trans Pattern Anal Mach Intell Proc 28(1):44–58
Agarwal P, Kumar S, Ryde J, Corso J, Krovi V (2012) Estimating human dynamics on-the-fly using monocular video for pose estimation. In: Proceedings of robotics: science and systems, Sydney, Australia, July 2012
Andriluka M, Roth S, Schiele B (2009) Pictorial structures revisited: people detection and articulated pose estimation. In: IEEE conference on computer vision and pattern recognition
Andriluka M, Roth S, Schiele B (2010) Monocular 3d pose estimation and tracking by detection. In: IEEE conference on computer vision and pattern recognition (CVPR), San Francisco, USA, 06/2010
Bissacco A, Yang M-H, Soatto S (2007) Fast human pose estimation using appearance and motion via multi-dimensional boosting regression. In: IEEE conference on computer vision and pattern recognition, June, pp 1–8
Felzenszwalb PF, Huttenlocher DP (2005) Pictorial structures for object recognition. Int J Comput Vis 61(1):55–79
Ferrari V, Marin-Jimenez M, Zisserman A (2008) Progressive search space reduction for human pose estimation. In: IEEE conference on computer vision and pattern recognition, pp 1–8
Ferrari V, Marn-jimnez M, Zisserman A (2009) 2d human pose estimation in TV shows. In: In dagstuhl post-proceedings
Jiang H (2009) Human pose estimation using consistent max-covering. In: International conference on computer vision
Lan X, Huttenlocher D (2004) A unified spatio-temporal articulated model for tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, June-2 July, vol 1, pp I-722–I-729
Moeslund TB, Hilton A, Krüger V, Sigal L (eds) (2011) Visual analysis of humans—looking at people. Springer, New York
Mooij JM (2010) libDAI: a free and open source C++ library for discrete approximate inference in graphical models. J Mach Learn Res 11:2169–2173
Mori G, Ren X, Efros AA, Malik J (2004) Recovering human body configurations: combining segmentation and recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 326–333
Pishchulin L, Jain A, Andriluka M, Thormaehlen T, Schiele B (2012) Articulated people detection and pose estimation: reshaping the future. In: 2012 IEEE conference on computer vision and pattern recognition (CVPR), Providence, United States, June 2012. IEEE, pp 1–8
Ren X, Berg AC, Malik J (2005) Recovering human body configurations using pairwise constraints between parts, vol 1, pp 824–831
Roberts TJ, McKenna SJ, Ricketts IW (2007) Human pose estimation using partial configurations and probabilistic regions. Int J Comput Vis Proc 73(3):285–306
Sapp B, Weiss D, Taskar B (2011) Parsing human motion with stretchable models. In: CVPR
Srinivasan P, Shi J (2007) Bottom-up recognition and parsing of the human body. In: Proceedings of the IEEE computer vision and pattern recognition, June 2007, pp 1–8
Yedidia JS, Freeman WT, Weiss Y (2000) Generalized belief propagation. In: IN NIPS 13. MIT Press, Cambridge, pp 689–695
Zhao T, Nevatia R (2003) Bayesian human segmentation in crowded situations. In: Proceedings of the IEEE conference on computer vision and pattern recognition, vol 2, June 2003, pp II-459-66
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Fastovets, M., Guillemaut, JY., Hilton, A. (2014). Estimating Athlete Pose from Monocular TV Sports Footage. In: Moeslund, T., Thomas, G., Hilton, A. (eds) Computer Vision in Sports. Advances in Computer Vision and Pattern Recognition. Springer, Cham. https://doi.org/10.1007/978-3-319-09396-3_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-09396-3_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09395-6
Online ISBN: 978-3-319-09396-3
eBook Packages: Computer ScienceComputer Science (R0)