Model-Based Human Motion Capture from Monocular Video Sequences
Generating motion and capturing motion of an articulated body for computer animation is an expensive and time-consuming task. Conventionally, animators manually generate intermediate frames between key frames, but this task is very labor-intensive. This paper presents a model-based singularity-free automatic-initialization approach to capturing human motion from widely-available, static background monocular video sequences. A 3D human body model is built and projected on a 2D projection plane to find the best fit with the foreground image silhouette. We convert the human motion capture problem into two types of parameter optimization problems: static optimization and dynamic optimization. First, we determine each model body configuration using static optimizations for every input image. Then, to obtain better description of motion, the results from all static optimizations are fed into a dynamic optimization process where the entire sequence of motion is considered for the user-specified motion. The user-specified motion is defined by the user and the final form of the motion they want. A cost function for static optimization is used to estimate the degree of overlapping between the foreground input image silhouette and a projected 3D model body silhouette. The overlapping is computed using computational geometry by converting a set of pixels from the image domain to a polygon in the real projection plane domain. A cost function for dynamic optimization is the user-specified motion based on the static optimization results as well as image fitting. Our method is used to capture various human motions: walking, pushing, kicking, and handshaking.
KeywordsInput Image Dynamic Optimization Model Body Computer Animation Parameter Optimization Problem
Unable to display preview. Download preview PDF.
- 3.Park, J., Park, S., Aggarwal, J.K.: Human motion tracking by combining viewbased and model-based methods for monocular video sequences. In: Kumar, V., Gavrilova, M.L., Tan, C.J.K., L’Ecuyer, P. (eds.) ICCSA 2003. LNCS, vol. 2669, Springer, Heidelberg (2003)Google Scholar
- 4.Bregler, C., Malik, J.: Tracking people with twists and exponential maps. In: IEEE Conference on Computer Vision and Pattern Recognition, Santa Barbara, California, pp. 8–15 (1998)Google Scholar
- 6.Park, J., Fussell, D.: Forward dynamics based realistic animation of rigid bodies. Computers and Graphics 21 (1997)Google Scholar
- 7.Huang, Y., Huang, T.S.: Model-based human body tracking. In: International Conference on Pattern Recognition (2002)Google Scholar
- 8.Lasdon, L., Fox, R., Ratner, M.: Nonlinear Optimization using the Generalized Reduced Gradient Method. Department of Operations Research 325, CaseWestern Reserve University (1973)Google Scholar
- 9.Park, S., Park, J., Aggarwal, J.K.: Video retrieval of human interactions using model-based motion tracking and multi-layer finite state automata. In: Bakker, E.M., Lew, M., Huang, T.S., Sebe, N., Zhou, X.S. (eds.) CIVR 2003. LNCS, vol. 2728, Springer, Heidelberg (2003)Google Scholar
- 10.Hill, F.: Computer Graphics. Macmillan, Basingstoke (1990)Google Scholar