Data Decomposition and Spatial Mixture Modeling for Part Based Model

Zhang, Junge; Huang, Yongzhen; Huang, Kaiqi; Wu, Zifeng; Tan, Tieniu

doi:10.1007/978-3-642-37331-2_10

Junge Zhang²⁰,
Yongzhen Huang²⁰,
Kaiqi Huang²⁰,
Zifeng Wu²⁰ &
…
Tieniu Tan²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7724))

Included in the following conference series:

Asian Conference on Computer Vision

8434 Accesses
3 Citations

Abstract

This paper presents a system of data decomposition and spatial mixture modeling for part based models. Recently, many enhanced part based models (with e.g., multiple features, more components or parts) have been proposed. Nevertheless, those enhanced models bring high computation cost together with the risk of over-fitting. To tackle this problem, we propose a data decomposition method for part based models which not only accelerates training and testing process but also improves the performance on average. Besides, the original part based model uses a strict rigid structural model to describe the distribution of each part location. It is not “deformable” enough, especially for those instances with different viewpoints or poses in the same aspect ratio. To address this problem, we present a novel spatial mixture modeling method. The spatial mixture embedded model is then integrated into the proposed data decomposition framework. We evaluate our system on the challenging PASCAL VOC2007 and PASCAL VOC2010 datasets, demonstrating the state-of-the-art performance compared with other related methods in terms of accuracy and efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. TPAMI 32, 1627–1645 (2010)
Article Google Scholar
Schnitzspan, P., Roth, S., Schiele, B.: Automatic discovery of meaningful object parts with latent crfs. In: CVPR, pp. 121–128 (2010)
Google Scholar
Zhang, J., Yu, Y., Huang, K., Tan, T.: Boosted Local Structured HOG-LBP for Object Localization. In: CVPR, pp. 1393–1400 (2011)
Google Scholar
Zhu, L., Chen, Y., Yuille, A.L., Freeman, W.T.: Latent hierarchical structural learning for object detection. In: CVPR, pp. 1062–1069 (2010)
Google Scholar
Yang, Y., Ramanan, D.: Articulated pose estimation with flexible mixtures-of-parts. In: CVPR, pp. 1385–1392
Google Scholar
Schnitzspan, P., Fritz, M., Roth, S., Schiele, B.: Discriminative structure learning of hierarchical representations for object detection. In: CVPR, pp. 2238–2245 (2009)
Google Scholar
Fischler, M., Elschlager, R.: The representation and matching of pictorial structures. IEEE Transactions on Computers C-22, 67–92 (1973)
Article Google Scholar
Marr, D., Nishihara, H.K.: Representation and recognition of the spatial organization of three-dimensional shapes. In: Proceedings of the Royal Society of London. Series B, Biological Sciences, pp. 269–294 (1978)
Google Scholar
Felzenszwalb, P.F., Huttenlocher, D.P.: Pictorial structures for object recognition. Int. J. Comput. Vision 61, 55–79 (2005)
Article Google Scholar
Girshick, R., Felzenszwalb, P., McAllester, D.: Object Detection with Grammar Models. In: NIPS (2011)
Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: CVPR, pp. 264–271 (2003)
Google Scholar
Wang, Y., Mori, G.: Hidden part models for human action recognition: Probabilistic versus max margin. TPAMI 33, 1310–1323 (2011)
Article Google Scholar
Pandey, M., Lazebnik, S.: Scene recognition and weakly supervised object localization with deformable part-based models. In: ICCV, pp. 1307–1314 (2011)
Google Scholar
Mark, E., Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The Pascal Visual Object Classes (VOC) Challenge. IJCV, 303–338
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR, pp. 886–893 (2005)
Google Scholar
Ott, P., Everingham, M.: Shared parts for deformable part-based models. In: CVPR, pp. 1513–1520 (2011)
Google Scholar
Hussain, S.U., Triggs, B.: Feature sets and dimensionality reduction for visual object detection, pp. 112.1–112.10. BMVA Press (2010)
Google Scholar
Pedersoli, M., Vedaldi, A., Gonzalez, J.: A coarse-to-fine approach for fast deformable object detection. In: CVPR, pp. 1353–1360 (2011)
Google Scholar
van de Sande, K.E.A., Uijlings, J.R.R., Gevers, T., Smeulders, A.W.M.: Segmentation as selective search for object recognition. In: ICCV, pp. 1879–1886 (2011)
Google Scholar
Felzenszwalb, P.F., Girshick, R.B., Mcallester, D.: Cascade object detection with deformable part models. In: CVPR, pp. 2241–2248 (2010)
Google Scholar
Zhang, J., Yu, Y., Zheng, S., Huang, K.: An empirical study of visual features for part based model. In: ACPR, pp. 219–223 (2011)
Google Scholar
Felzenszwalb, P.F., Girshick, R.B., McAllester, D.: Discriminatively Trained Deformable Part Models, Release 4 (2010)
Google Scholar
Desai, C., Ramanan, D., Fowlkes, C.: Discriminative models for multi-class object layout. In: ICCV, pp. 229–236 (2009)
Google Scholar
Vedaldi, A., Gulshan, V., Varma, M., Zisserman, A.: Multiple kernels for object detection. In: ICCV, pp. 606–613 (2009)
Google Scholar
Razavi, N., Gall, J., van Gool, L.: Scalable multi-class object detection. In: CVPR, pp. 1505–1512 (2011)
Google Scholar
Divvala, S.K., Zitnick, C., Kapoor, A., Baker, S.: Detecting objects using unsupervised parts-based attributes. Technical Report CMU-RI-TR-11-10, Robotics Institute, Pittsburgh, PA (2010)
Google Scholar
Schnitzspan, P., Fritz, M., Roth, S., Schiele, B.: Discriminative structure learning of hierarchical representations for object detection. In: CVPR, pp. 2238–2245 (2009)
Google Scholar
Malisiewicz, T., Gupta, A., Efros, A.A.: Ensemble of exemplar-svms for object detection and beyond. In: ICCV, pp. 89–96 (2011)
Google Scholar
ul Hussain, S.: Machine Learning Methods for Visual Object Detection. PhD thesis, University of Caen (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

National Lab of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, China
Junge Zhang, Yongzhen Huang, Kaiqi Huang, Zifeng Wu & Tieniu Tan

Authors

Junge Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yongzhen Huang
View author publications
You can also search for this author in PubMed Google Scholar
Kaiqi Huang
View author publications
You can also search for this author in PubMed Google Scholar
Zifeng Wu
View author publications
You can also search for this author in PubMed Google Scholar
Tieniu Tan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electrical and Computer Engineering, Seoul National University, 1 Gwanak-ro, 151-744, Gwanak-gu, Seoul, Korea
Kyoung Mu Lee
Microsoft Research Asia, No. 5, Danling st., Haidian district, 100080, Beijing, P.R. China
Yasuyuki Matsushita
School of Interactive Computing, Georgia Institute of Technology, 801 Atlantic Drive, CCB 315, 30332, Atlanta, GA, USA
James M. Rehg
Institute of Automation, National Laboratory of Pattern Recognition, Chinese Academy of Sciences, Zhong Quan Cun East Road 95, Haidian District, 100 190, Beijing, P.R. China
Zhanyi Hu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, J., Huang, Y., Huang, K., Wu, Z., Tan, T. (2013). Data Decomposition and Spatial Mixture Modeling for Part Based Model. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds) Computer Vision – ACCV 2012. ACCV 2012. Lecture Notes in Computer Science, vol 7724. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37331-2_10

Download citation

DOI: https://doi.org/10.1007/978-3-642-37331-2_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37330-5
Online ISBN: 978-3-642-37331-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics