Skip to main content

Inference Scene Labeling by Incorporating Object Detection with Explicit Shape Model

  • Conference paper
  • 2893 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6494))

Abstract

In this paper, we incorporate shape detection into contextual scene labeling and make use of both shape, texture, and context information in a graphical representation. We propose a candidacy graph, whose vertices are two types of recognition candidates for either a superpixel or a window patch. The superpixel candidates are generated by a discriminative classifier with textural features as well as the window proposals by a learned deformable templates model in the bottom-up steps. The contextual and competitive interactions between graph vertices, in form of probabilistic connecting edges, are defined by two types of contextual metrics and the overlapping of their image domain, respectively. With this representation, a composite clustering sampling algorithm is proposed to fast search the optimal convergence globally using the Markov Chain Monte Carlo (MCMC). Our approach is applied on both lotus hill institute (LHI) and MSRC public datasets and achieves the state-of-art results.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Laferte, J.M., Heitz, F., Perez, P., Fabre, E.: Hierarchical statistical methord for the fusion of multiresolution data. In: ICCV (1995)

    Google Scholar 

  2. Xuming, H., Zemel, R.S., Carreira-Perpinan, M.A.: Multiscale conditional random fields for image labeling. In: CVPR (2004)

    Google Scholar 

  3. Shotton, J., Winn, J., Rother, C., Criminisi, A.: Textonboost: Joint appearnce, shape and context modeling for multiclass object recognition and segmentation. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 1–15. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  4. Gould, S., Rodgers, J., Cohen, D., Elidan, D., Koller, D.: Multi-class segmentation with relative location prior. IJCV 80, 300–316 (2008)

    Article  Google Scholar 

  5. Galleguillos, C., Rabinovich, A., Belongie, S.: Object categorization using co-occurrence, location and appearance. In: CVPR (2008)

    Google Scholar 

  6. Rabinovich, A., Vedaldi, A., Galleguillos, C., Wiewiora, E., Belongie, S.: Object in context. In: ICCV (2007)

    Google Scholar 

  7. Bastian, L., Ales, L., Bernt, S.: Combined object categorization and segmentation with an implicit shape model. In: ECCV Workshop on Statistical Learning in Computer Vision (2004)

    Google Scholar 

  8. Zuowen, T.: Auto-context and its application for high-level vision tasks. In: CVPR (2008)

    Google Scholar 

  9. Kolmogorov, V., Zabin, R.: What energy functions can be minimized via graph cuts? PAMI 26, 147–159 (2004)

    Article  Google Scholar 

  10. Frey, B.J., Mackay, D.: A revolution: Belief propagation in graphs with cycles. In: NIPS (1997)

    Google Scholar 

  11. Apt, K.: The essence of constraint propagation. Theoretical Computer Science 221, 179–210 (1999)

    Article  MathSciNet  MATH  Google Scholar 

  12. Geman, S., Geman, D.: Stochastic relaxation, gibbs distributions and the bayesian restoration of images. PAMI 6, 721–741 (1984)

    Article  MATH  Google Scholar 

  13. Ferrari, V., Jurie, F., Schmid, C.: Accurate object detection with deformable shape models learnt from images. In: CVPR (2007)

    Google Scholar 

  14. Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. PAMI 24, 509–522 (2002)

    Article  Google Scholar 

  15. Yinnian, W., Zhangzhang, S., Songchun, Z.: Deformable template as active basis. In: ICCV (2007)

    Google Scholar 

  16. Xiang, B., Xinggan, W., Longin, J.L., Wenyu, L., Zuowen, T.: Active Skeleton for Non-rigid Object Detection. In: ICCV (2009)

    Google Scholar 

  17. Borenstein, E., Ullman, S.: Combined top-down/bottom-up segmentation. PAMI 30, 2109–2125 (2008)

    Article  Google Scholar 

  18. Tu, Z.W., Chen, X., Yulle, A., Zhu, S.: Image parsing: Unifying segmentation, detection, and recognition. IJCV 63 (2005)

    Google Scholar 

  19. Metropolis, N.: Equation of state calculations by fast computing machines. Journal of Chemical Physics 21, 1087–1092 (1953)

    Article  Google Scholar 

  20. Barbu, A., Zhu, S.: Generalizing swendsen-wang for image analysis. Journal of Computational and Graphical Statistics 16, 877–900 (2007)

    Article  MathSciNet  Google Scholar 

  21. Yao, B., Yang, X., Zhu, S.-C.: Introduction to a large-scale general purpose ground truth database: Methodology, annotation tool and benchmarks. In: Yuille, A.L., Zhu, S.-C., Cremers, D., Wang, Y. (eds.) EMMCVPR 2007. LNCS, vol. 4679, pp. 169–183. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  22. Bai, X., Sapiro, G.: Geodesic matting: A framework for fast interactive image and video segmentation and matting. IJCV 82, 113–132 (2009)

    Article  Google Scholar 

  23. Torralba, A., Murphy, K.P., Freeman, W.T.: Sharing features: efficient boosting procedures for multiclass object detection. In: CVPR (2004)

    Google Scholar 

  24. Lin, Y., Meer, P., Foran, D.J.: Multiple class segmentation using a unified framework over mean-shift patches. In: CVPR (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhou, Q., Liu, W. (2011). Inference Scene Labeling by Incorporating Object Detection with Explicit Shape Model. In: Kimmel, R., Klette, R., Sugimoto, A. (eds) Computer Vision – ACCV 2010. ACCV 2010. Lecture Notes in Computer Science, vol 6494. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19318-7_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-19318-7_30

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-19317-0

  • Online ISBN: 978-3-642-19318-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics