International Journal of Computer Vision

, Volume 88, Issue 2, pp 214-237

First online:

Learning Articulated Structure and Motion

  • David A. RossAffiliated withUniversity of Toronto Email author 
  • , Daniel TarlowAffiliated withUniversity of Toronto
  • , Richard S. ZemelAffiliated withUniversity of Toronto

Rent the article at a discount

Rent now

* Final gross prices may vary according to local VAT.

Get Access


Humans demonstrate a remarkable ability to parse complicated motion sequences into their constituent structures and motions. We investigate this problem, attempting to learn the structure of one or more articulated objects, given a time series of two-dimensional feature positions. We model the observed sequence in terms of “stick figure” objects, under the assumption that the relative joint angles between sticks can change over time, but their lengths and connectivities are fixed. The problem is formulated as a single probabilistic model that includes multiple sub-components: associating the features with particular sticks, determining the proper number of sticks, and finding which sticks are physically joined. We test the algorithm on challenging datasets of 2D projections of optical human motion capture and feature trajectories from real videos.


Structure from motion Graphical models Non-rigid motion