Abstract
This paper presents a novel solution to the difficult task of both detecting and estimating the 3D pose of humans in monoscopic images. The approach consists of two parts. Firstly the location of a human is identified by a probabalistic assembly of detected body parts. Detectors for the face, torso and hands are learnt using adaBoost. A pose likliehood is then obtained using an a priori mixture model on body configuration and possible configurations assembled from available evidence using RANSAC. Once a human has been detected, the location is used to initialise a matching algorithm which matches the silhouette and edge map of a subject with a 3D model. This is done efficiently using chamfer matching, integral images and pose estimation from the initial detection stage. We demonstrate the application of the approach to large, cluttered natural images and at near framerate operation (16fps) on lower resolution video streams.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Barrow, H., Tenenbaum, J., Bolles, R., Wolf, H.: Parametric correspondence and chamfer matching: Two new techniques for image matching. In: Proc. of Joint Conf. Artificial Intelligence, pp. 659–663 (1977)
Felzenszwalb, P., Hurrenlocher, D.: Distance transforms of sampled functions. Technical Report TR2004-1963, Cornell Computing and Information Science (2004)
Felzenszwalb, P.F., Huttenlocher, D.P.: Efficient matching of pictorial structures. In: Proc. of CVPR, vol. 2, pp. 66–73 (2000)
Fischler, M.A., Bolles, R.C.: Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Comm. of the ACM 24, 381–395 (1981)
Howe, N., Leventon, M., Freeman, W.: Bayesian reconstruction of 3d human motion from single camera video. Advances in Neural Information Processing Systems 12, 820–826 (2000)
Ioffe, S., Forsyth, D.: Probabilistic methods for finding people. International Journal of Computer Vision 43(1), 45–68 (2001)
Micilotta, A.S., Bowden, R.: View-based location and tracking of body parts for visual interaction. In: Proc. of British Machine Vision Conference, September 2004, vol. 2, pp. 849–858 (2004)
Mikolajczyk, K., Schmid, C., Zisserman, A.: Human detection based on a probabilistic assembly of robust body part detectors. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3021, pp. 69–82. Springer, Heidelberg (2004)
Mohan, A., Papageorgiou, C., Poggio, T.: Example-based object detection in images by components. IEEE Transactions on PAMI 23(4), 349–361 (2001)
Triggs, B., Ronfard, R., Schmid, C.: Learning to parse pictures of people. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 700–707. Springer, Heidelberg (2002)
Roberts, T., McKenna, S., Ricketts, I.: Human pose estimation using learnt probabilistic region similarities and partial configurations. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3024, pp. 291–303. Springer, Heidelberg (2004)
Rowley, H.A., Baluja, S., Kanade, T.: Neural network-based face detection. IEEE Transactions on PAMI 20(1), 23–38 (1998)
Sigal, L., Isard, M., Sigelman, B., Black, M.: Attractive people: Assembling looselimbed models using non-parametric belief propagation. Proc. of Advances in Neural Information Processing Systems 16, 1539–1546 (2003)
Stenger, B., Thayananthan, A., Torr, P., Cipolla, R.: Hand pose estimation using hierarchical detection. In: Workshop on Human Computer Interaction, pp. 105–116 (2004)
Viola, P., Jones, M.: Robust real-time object detection. In: Proc. of IEEE Workshop on Statistical and Computational Theories of Vision (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Micilotta, A.S., Ong, EJ., Bowden, R. (2006). Real-Time Upper Body Detection and 3D Pose Estimation in Monoscopic Images. In: Leonardis, A., Bischof, H., Pinz, A. (eds) Computer Vision – ECCV 2006. ECCV 2006. Lecture Notes in Computer Science, vol 3953. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11744078_11
Download citation
DOI: https://doi.org/10.1007/11744078_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33836-9
Online ISBN: 978-3-540-33837-6
eBook Packages: Computer ScienceComputer Science (R0)