Skip to main content

Weighted Pooling of Image Code with Saliency Map for Object Recognition

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 240))

Abstract

Recently, codebook-based object recognition methods have achieved the state-of-the-art performances for many public object databases. Based on the codebook-based object recognition method, we propose a novel method which uses the saliency information in the stage of pooling code vectors. By controlling each code response using the saliency value that represents the visual importance of each local area in an image, the proposed method can effectively reduce the adverse influence of low visual saliency regions, such as the background. On the basis of experiments on the public Flower102 database and Caltech object database, we confirm that the proposed method can improve the conventional codebook-based methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Sivic J, Zisserman A (2003) Video google: a text retrieval approach to object matching in videos. Proceedings of ICCV’03, vol. 2. Los Alamitos, USA, p 1470

    Google Scholar 

  2. Yang J, Yu K, Gong Y, Huang T (2009) Linear spatial pyramid matching using sparse coding for image classification. Proceedings of CVPR’09, Miami, USA, pp 1794–1801

    Google Scholar 

  3. Wang J, Yang J, Yu K, Lv F, Huang T, Gong Y (2010) Locality-constrained linear coding for image classification. Proceedings of CVPR’10, San Francisco, USA, pp 3360–3367

    Google Scholar 

  4. Boureau Y-L, Roux N, Bach F, Ponce J, LeCun Y (2011) Ask the locals: multi-way local pooling for image recognition. ICCV’11, Barcelona, Spain

    Google Scholar 

  5. Amari S (2007) Integration of stochastic models by minimizing α-divergence. Neural Comput 19(10):2780–2796

    Article  MATH  MathSciNet  Google Scholar 

  6. http://www.vision.caltech.edu/Image_Datasets. Accessed 20 July 2012

  7. Nilsback M-E, Zisserman A (2008) Automated flower classification over a large number of classes. Proceedings of ICVGIP’08, Bhubaneswar, India, pp 722–729

    Google Scholar 

  8. Harel J, Koch C, Perona P (2006) Graph-based visual saliency. Proceedings of NIPS’06, Vancouver, Canada

    Google Scholar 

  9. Cheng M-M, Zhang G-X, Mitra NJ, Huang X, Hu S-M (2011) Global contrast based salient region detection. Proceedings of CVPR’11, Colorado Springs, USA, pp 409–416

    Google Scholar 

  10. McCann S, Lowe DG (2012) Local naïve Bayes nearest neighbor for image classification. Proceedings of CVPR’12, Providence, USA, pp 3650–3656

    Google Scholar 

Download references

Acknowledgments

This research was partially supported by the MKE(The Ministry of Knowledge Economy), Korea, under the ITRC(Information Technology Research Center) support program (NIPA-2012- H0301-12-2004) supervised by the NIPA(National IT Industry Promotion Agency); and by the Converging Research Center Program funded by the Ministry of Education, Science and Technology (2012K001342).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hyeyoung Park .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer Science+Business Media Dordrecht(Outside the USA)

About this paper

Cite this paper

Kim, DH., Lee, K., Park, H. (2013). Weighted Pooling of Image Code with Saliency Map for Object Recognition. In: Park, J., Ng, JY., Jeong, HY., Waluyo, B. (eds) Multimedia and Ubiquitous Engineering. Lecture Notes in Electrical Engineering, vol 240. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-6738-6_20

Download citation

  • DOI: https://doi.org/10.1007/978-94-007-6738-6_20

  • Published:

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-007-6737-9

  • Online ISBN: 978-94-007-6738-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics