International Journal of Computer Vision

, Volume 75, Issue 2, pp 267–282

POP: Patchwork of Parts Models for Object Recognition


    • Department of Statistics and the Department of Computer ScienceUniversity of Chicago
  • Alain Trouvé
    • CMLA at the Ecole Normale Superieur, Cachan

DOI: 10.1007/s11263-006-0033-9

Cite this article as:
Amit, Y. & Trouvé, A. Int J Comput Vis (2007) 75: 267. doi:10.1007/s11263-006-0033-9


We formulate a deformable template model for objects with an efficient mechanism for computation and parameter estimation. The data consists of binary oriented edge features, robust to photometric variation and small local deformations. The template is defined in terms of probability arrays for each edge type. A primary contribution of this paper is the definition of the instantiation of an object in terms of shifts of a moderate number local submodels—parts—which are subsequently recombined using a patchwork operation, to define a coherent statistical model of the data. Object classes are modeled as mixtures of patchwork of parts POP models that are discovered sequentially as more class data is observed. We define the notion of the support associated to an instantiation, and use this to formulate statistical models for multi-object configurations including possible occlusions. All decisions on the labeling of the objects in the image are based on comparing likelihoods. The combination of a deformable model with an efficient estimation procedure yields competitive results in a variety of applications with very small training sets, without need to train decision boundaries—only data from the class being trained is used. Experiments are presented on the MNIST database, reading zipcodes, and face detection.


deformable modelsmodel estimationmulti-object configurationsobject detection
Download to read the full article text

Copyright information

© Springer Science+Business Media, LLC 2007