Skip to main content
Log in

Three-Dimensional Human Body Model Acquisition from Multiple Views

  • Published:
International Journal of Computer Vision Aims and scope Submit manuscript

Abstract

We present a novel approach to the three-dimensional human body model acquisition from three mutually orthogonal views. Our technique is based on the spatiotemporal analysis of the deforming apparent contour of a human moving according to a protocol of movements. For generality and robustness our technique does not use a prior model of the human body and a prior body part segmentation is not assumed. Therefore, our technique applies to humans of any anthropometric dimension. To parameterize and segment over time a deforming apparent contour, we introduce a new shape representation technique based on primitive composition. The composed deformable model allows us to represent large local deformations and their evolution in a compact and intuitive way. In addition, this representation allows us to hypothesize an underlying part structure and test this hypothesis against the relative motion (due to forces exerted from the image data) of the defining primitives of the composed model. Furthermore, we develop a Human Body Part Decomposition Algorithm (HBPDA) that recovers all the body parts of a subject by monitoring the changes over time to the shape of the deforming silhouette. In addition, we modularize the process of simultaneous two-dimensional part determination and shape estimation by employing the Supervisory Control Theory of Discrete Event Systems. Finally, we present a novel algorithm which selectively integrates the (segmented by the HBPDA) apparent contours from three mutually orthogonal viewpoints to obtain a three-dimensional model of the subject's body parts. The effectiveness of the approach is demonstrated through a series of experiments where a subject performs a set of movements according to a protocol that reveals the structure of the human body.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Akita, K. 1984. Image sequence analysis of realworld human motion. Pattern Recognition, 17:73-83.

    Article  Google Scholar 

  • Azuola, F., Badler, N.I., Ho, P, Kakadiaris, I.A., Metaxas, D., and Ting, B. 1994. Building anthropometry-based virtual human models. In Proceedings of the IMAGE VII Society Conference, Tucson, AZ.

  • Barr, A.H. 1981. Superquadrics and angle-preserving transformations. IEEE Computer Graphics and Applications, 1(1):11-23.

    Google Scholar 

  • Barr, A.H. 1984. Global and local deformations of solid primitives. Computer Graphics, 18(3):21-30.

    Google Scholar 

  • Besl, P. and McKay, N.D. 1992. A method for registration of 3-D shapes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 14(2).

  • Canny, J. 1986. A computational approach to edge detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 8(6):679-698.

    Google Scholar 

  • Gavrila, D.M. and Davis, L.S. 1996. 3-D model-based tracking of humans in action: A multi-view approach. In Proceedings of the 1996 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, pp. 73-80, IEEE Computer Society Press: New York, NY.

    Google Scholar 

  • Goncalves, L., Di Bernardom, E., Ursella, E., and Perona, P. 1995. Monocular tracking of the human arm in 3D. In Proceedings of the Fifth International Conference on Computer Vision, Boston, MA, pp. 764-770.

  • Hogg, D. 1983. Model-based vision: A program to see a walking person. Image and Vision Computing, 1(1):5-20.

    Article  Google Scholar 

  • Kakadiaris, I.A. 1997. Motion-based part segmentation, shape and motion estimation of multi-part objects. Ph.D. Dissertation, Department of Computer and Information Science, University of Pennsylvania, Philadelphia, PA.

    Google Scholar 

  • Kakadiaris, I.A. and Metaxas, D. 1995. 3D Human body model acquisition from multiple views. In Proceedings of the Fifth International Conference on Computer Vision, Boston, MA, pp. 618-623.

  • Kakadiaris, I.A. and Metaxas, D. 1996. Model-based estimation of 3D human motion with occlusion based on active multi-viewpoint selection. In Proceedings of the 1996 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, pp. 81-87.

  • Kakadiaris, I.A., Metaxas, D., and Bajcsy, R. 1994. Active part-decomposition, shape and motion estimation of articulated objects: A physics-based approach. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Seattle, WA. pp. 980-984.

  • Kakadiaris, I.A., Metaxas, D., and Bajcsy, R. 1997. Inferring 2D object structure from the deformation of apparent contours. Computer Vision and Image Understanding, 65(2):129-147.

    Article  Google Scholar 

  • Leung, M.K. and Yang, Y.H. 1987a. Human body motion segmentation in a complex scene. Pattern Recognition, 20(1):55-64.

    Article  Google Scholar 

  • Leung, M.K. and Yang, Y.H. 1987b. A region based approach for human body motion analysis. Pattern Recognition, 20(3):321- 339.

    Article  Google Scholar 

  • Leung, M.K. and Yang, Y.H. 1995. First sight: A human body outline labeling system. IEEE Transactions on Pattern Analysis and Machine Intelligence, 17(4):369-377.

    Article  Google Scholar 

  • Mann, R., Jepson, A., and Siskind, J.M. 1996. Richard Mann, Allan Jepson, and Jeffrey Mark Siskind. Computational perception of scene dynamics. In Proc. of the Fourth European Conference on Computer Vision, Cambridge, UK, Bernard Buxton and Robert Cipola, (Eds.), Lecture Notes in Computer Science, Springer, pp. II:528-539.

  • Metaxas, D. 1992. Physics-based modeling of nonrigid objects for vision and graphics. Ph.D. Dissertation, Department of Computer Science, University of Toronto.

  • Metaxas, D. and Terzopoulos, D. 1993. Shape and nonrigid motion estimation through physics-based synthesis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(6):580-591.

    Article  Google Scholar 

  • NASA. 1978. Anthropometric source book. volume II: A handbook of anthropometric data. Technical Report NASA Reference Publication 1024, NASA Scientific and Technical Information Office, Johnson Space Center, Houston, TX.

    Google Scholar 

  • O’Rourke, J. and Badler, N.I. 1980. Model-based image analysis of human motion using constraint propagation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2(6):522-536.

    Google Scholar 

  • Pito, R.A. 1996. Mesh integration based on comeasurement. In IEEE International Conference on Image Processing, Vienna, Austria, pp. II:397-400.

    Google Scholar 

  • Prasad, M. 1991. Intersection of line segments. In Graphics Gems II, James Arvo (Ed.), Academic Press.

  • Press, W.H., Teukolsky, S.A., Vetterling, W.T., and Flannery, B.P. 1992. Numerical Recipes in C: The Art of Scientific Computing. Cambridge University Press.

  • Ramadge, P.J. and Wonham, W.M. 1989. The control of discrete event systems. Proceedings of the IEEE, 77(1):81-97.

    Article  Google Scholar 

  • Rehg, J.M. and Kanade, T. 1994. Visual tracking of high DOF articulated structures: An application to human hand tracking. In Proceedings of the Third European Conference on Computer Vision, Jan-Olof Eklundh (Ed.), Stockholm, Sweden, pp. 35-46.

  • Rehg, J.M. and Kanade, T. 1995. Model-based tracking of selfoccluding articulated objects. In Proceedings of the Fifth International Conference on Computer Vision, Boston, MA, pp. 612-617.

  • Rohr, K. 1994. Towards model-based recognition of human movements in image sequences. Computer Vision, Graphics, and Image Processing: Image Understanding, 59(1):94-115.

    Google Scholar 

  • Russell, K., Starner, T., and Pentland, A. 1995. Unencumbered virtual environments. In IJCAI-95 Workshop on Entertainment and AI/Alife, IEEE Computer Society Press.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kakadiaris, I.A., Metaxas, D. Three-Dimensional Human Body Model Acquisition from Multiple Views. International Journal of Computer Vision 30, 191–218 (1998). https://doi.org/10.1023/A:1008071332753

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1008071332753

Navigation