Skip to main content

A Universal Visual Dictionary Learned from Natural Scenes for Recognition

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8228))

Abstract

Inspired by the efficient coding hypothesis and simple-to-complex cell hierarchy of the visual system, we study a universal visual dictionary learned from natural scenes using sparse coding for recognition. The vocabularies are similar to V1 simple cells receptive fields. Max pooling is done in a local region (”block”) so that the features are translation invariant, which is the function of complex cells. Macro-features of a grid of overlapping spatial blocks are built and fed to a linear SVM classifier for recognition. We have tested the learned universal visual dictionary on different recognition tasks and demonstrated the effectiveness of the model.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Serre, T., Wolf, L., Poggio, T.: Object recognition with features inspired by visual cortex. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 2, pp. 994–1000 (2005)

    Google Scholar 

  2. Bengio, Y.: Learning deep architectures for ai. Foundations and Trends® in Machine Learning 2, 1–127 (2009)

    Article  MATH  Google Scholar 

  3. Olshausen, B.A., et al.: Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature 381, 607–609 (1996)

    Article  Google Scholar 

  4. Labusch, K., Barth, E., Martinetz, T.: Simple method for high-performance digit recognition based on sparse coding. IEEE Transactions on Neural Networks 19, 1985–1989 (2008)

    Article  Google Scholar 

  5. Raina, R., Battle, A., Lee, H., Packer, B., Ng, A.Y.: Self-taught learning: transfer learning from unlabeled data. In: Proceedings of the 24th International Conference on Machine Learning, pp. 759–766 (2007)

    Google Scholar 

  6. Mairal, J., Bach, F., Ponce, J., Sapiro, G., Zisserman, A.: Supervised dictionary learning. arXiv preprint arXiv:0809.3083 (2008)

    Google Scholar 

  7. Huang, K., Aviyente, S.: Sparse representation for signal classification. In: Advances in Neural Information Processing Systems (NIPS), pp. 609–616 (2006)

    Google Scholar 

  8. Bradley, D.M., Bagnell, J.A.: Differential sparse coding. In: Advances in Neural Information Processing Systems (NIPS) (2008)

    Google Scholar 

  9. Yang, J., Yu, K., Huang, T.: Supervised translation-invariant sparse coding. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3517–3524 (2010)

    Google Scholar 

  10. Mairal, J., Bach, F., Ponce, J.: Task-driven dictionary learning. IEEE Transactions on Pattern Analysis and Machine Intelligence 34, 791–804 (2012)

    Article  Google Scholar 

  11. Shan, H., Cottrell, G.W.: Looking around the backyard helps to recognize faces and digits. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–8 (2008)

    Google Scholar 

  12. Ciresan, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3642–3649 (2012)

    Google Scholar 

  13. Sim, T., Baker, S., Bsat, M.: The cmu pose, illumination, and expression (pie) database. In: IEEE Conference on Automatic Face and Gesture Recognition, pp. 46–51 (2002)

    Google Scholar 

  14. Cai, D., He, X., Han, J.: Semi-supervised discriminant analysis. In: International Conference on Computer Vision (ICCV), pp. 1–7 (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ding, L., Xu, J. (2013). A Universal Visual Dictionary Learned from Natural Scenes for Recognition. In: Lee, M., Hirose, A., Hou, ZG., Kil, R.M. (eds) Neural Information Processing. ICONIP 2013. Lecture Notes in Computer Science, vol 8228. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-42051-1_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-42051-1_21

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-42050-4

  • Online ISBN: 978-3-642-42051-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics