Skip to main content

Data Decomposition and Spatial Mixture Modeling for Part Based Model

  • Conference paper
Computer Vision – ACCV 2012 (ACCV 2012)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7724))

Included in the following conference series:

Abstract

This paper presents a system of data decomposition and spatial mixture modeling for part based models. Recently, many enhanced part based models (with e.g., multiple features, more components or parts) have been proposed. Nevertheless, those enhanced models bring high computation cost together with the risk of over-fitting. To tackle this problem, we propose a data decomposition method for part based models which not only accelerates training and testing process but also improves the performance on average. Besides, the original part based model uses a strict rigid structural model to describe the distribution of each part location. It is not “deformable” enough, especially for those instances with different viewpoints or poses in the same aspect ratio. To address this problem, we present a novel spatial mixture modeling method. The spatial mixture embedded model is then integrated into the proposed data decomposition framework. We evaluate our system on the challenging PASCAL VOC2007 and PASCAL VOC2010 datasets, demonstrating the state-of-the-art performance compared with other related methods in terms of accuracy and efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. TPAMI 32, 1627–1645 (2010)

    Article  Google Scholar 

  2. Schnitzspan, P., Roth, S., Schiele, B.: Automatic discovery of meaningful object parts with latent crfs. In: CVPR, pp. 121–128 (2010)

    Google Scholar 

  3. Zhang, J., Yu, Y., Huang, K., Tan, T.: Boosted Local Structured HOG-LBP for Object Localization. In: CVPR, pp. 1393–1400 (2011)

    Google Scholar 

  4. Zhu, L., Chen, Y., Yuille, A.L., Freeman, W.T.: Latent hierarchical structural learning for object detection. In: CVPR, pp. 1062–1069 (2010)

    Google Scholar 

  5. Yang, Y., Ramanan, D.: Articulated pose estimation with flexible mixtures-of-parts. In: CVPR, pp. 1385–1392

    Google Scholar 

  6. Schnitzspan, P., Fritz, M., Roth, S., Schiele, B.: Discriminative structure learning of hierarchical representations for object detection. In: CVPR, pp. 2238–2245 (2009)

    Google Scholar 

  7. Fischler, M., Elschlager, R.: The representation and matching of pictorial structures. IEEE Transactions on Computers C-22, 67–92 (1973)

    Article  Google Scholar 

  8. Marr, D., Nishihara, H.K.: Representation and recognition of the spatial organization of three-dimensional shapes. In: Proceedings of the Royal Society of London. Series B, Biological Sciences, pp. 269–294 (1978)

    Google Scholar 

  9. Felzenszwalb, P.F., Huttenlocher, D.P.: Pictorial structures for object recognition. Int. J. Comput. Vision 61, 55–79 (2005)

    Article  Google Scholar 

  10. Girshick, R., Felzenszwalb, P., McAllester, D.: Object Detection with Grammar Models. In: NIPS (2011)

    Google Scholar 

  11. Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: CVPR, pp. 264–271 (2003)

    Google Scholar 

  12. Wang, Y., Mori, G.: Hidden part models for human action recognition: Probabilistic versus max margin. TPAMI 33, 1310–1323 (2011)

    Article  Google Scholar 

  13. Pandey, M., Lazebnik, S.: Scene recognition and weakly supervised object localization with deformable part-based models. In: ICCV, pp. 1307–1314 (2011)

    Google Scholar 

  14. Mark, E., Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The Pascal Visual Object Classes (VOC) Challenge. IJCV, 303–338

    Google Scholar 

  15. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR, pp. 886–893 (2005)

    Google Scholar 

  16. Ott, P., Everingham, M.: Shared parts for deformable part-based models. In: CVPR, pp. 1513–1520 (2011)

    Google Scholar 

  17. Hussain, S.U., Triggs, B.: Feature sets and dimensionality reduction for visual object detection, pp. 112.1–112.10. BMVA Press (2010)

    Google Scholar 

  18. Pedersoli, M., Vedaldi, A., Gonzalez, J.: A coarse-to-fine approach for fast deformable object detection. In: CVPR, pp. 1353–1360 (2011)

    Google Scholar 

  19. van de Sande, K.E.A., Uijlings, J.R.R., Gevers, T., Smeulders, A.W.M.: Segmentation as selective search for object recognition. In: ICCV, pp. 1879–1886 (2011)

    Google Scholar 

  20. Felzenszwalb, P.F., Girshick, R.B., Mcallester, D.: Cascade object detection with deformable part models. In: CVPR, pp. 2241–2248 (2010)

    Google Scholar 

  21. Zhang, J., Yu, Y., Zheng, S., Huang, K.: An empirical study of visual features for part based model. In: ACPR, pp. 219–223 (2011)

    Google Scholar 

  22. Felzenszwalb, P.F., Girshick, R.B., McAllester, D.: Discriminatively Trained Deformable Part Models, Release 4 (2010)

    Google Scholar 

  23. Desai, C., Ramanan, D., Fowlkes, C.: Discriminative models for multi-class object layout. In: ICCV, pp. 229–236 (2009)

    Google Scholar 

  24. Vedaldi, A., Gulshan, V., Varma, M., Zisserman, A.: Multiple kernels for object detection. In: ICCV, pp. 606–613 (2009)

    Google Scholar 

  25. Razavi, N., Gall, J., van Gool, L.: Scalable multi-class object detection. In: CVPR, pp. 1505–1512 (2011)

    Google Scholar 

  26. Divvala, S.K., Zitnick, C., Kapoor, A., Baker, S.: Detecting objects using unsupervised parts-based attributes. Technical Report CMU-RI-TR-11-10, Robotics Institute, Pittsburgh, PA (2010)

    Google Scholar 

  27. Schnitzspan, P., Fritz, M., Roth, S., Schiele, B.: Discriminative structure learning of hierarchical representations for object detection. In: CVPR, pp. 2238–2245 (2009)

    Google Scholar 

  28. Malisiewicz, T., Gupta, A., Efros, A.A.: Ensemble of exemplar-svms for object detection and beyond. In: ICCV, pp. 89–96 (2011)

    Google Scholar 

  29. ul Hussain, S.: Machine Learning Methods for Visual Object Detection. PhD thesis, University of Caen (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhang, J., Huang, Y., Huang, K., Wu, Z., Tan, T. (2013). Data Decomposition and Spatial Mixture Modeling for Part Based Model. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds) Computer Vision – ACCV 2012. ACCV 2012. Lecture Notes in Computer Science, vol 7724. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37331-2_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-37331-2_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-37330-5

  • Online ISBN: 978-3-642-37331-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics