A Universal Visual Dictionary Learned from Natural Scenes for Recognition

Ding, Li; Xu, Jinhua

doi:10.1007/978-3-642-42051-1_21

A Universal Visual Dictionary Learned from Natural Scenes for Recognition

Li Ding²⁰ &
Jinhua Xu²⁰

Conference paper

4364 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8228))

Abstract

Inspired by the efficient coding hypothesis and simple-to-complex cell hierarchy of the visual system, we study a universal visual dictionary learned from natural scenes using sparse coding for recognition. The vocabularies are similar to V1 simple cells receptive fields. Max pooling is done in a local region (”block”) so that the features are translation invariant, which is the function of complex cells. Macro-features of a grid of overlapping spatial blocks are built and fed to a linear SVM classifier for recognition. We have tested the learned universal visual dictionary on different recognition tasks and demonstrated the effectiveness of the model.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Serre, T., Wolf, L., Poggio, T.: Object recognition with features inspired by visual cortex. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 2, pp. 994–1000 (2005)
Google Scholar
Bengio, Y.: Learning deep architectures for ai. Foundations and Trends® in Machine Learning 2, 1–127 (2009)
Article MATH Google Scholar
Olshausen, B.A., et al.: Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature 381, 607–609 (1996)
Article Google Scholar
Labusch, K., Barth, E., Martinetz, T.: Simple method for high-performance digit recognition based on sparse coding. IEEE Transactions on Neural Networks 19, 1985–1989 (2008)
Article Google Scholar
Raina, R., Battle, A., Lee, H., Packer, B., Ng, A.Y.: Self-taught learning: transfer learning from unlabeled data. In: Proceedings of the 24th International Conference on Machine Learning, pp. 759–766 (2007)
Google Scholar
Mairal, J., Bach, F., Ponce, J., Sapiro, G., Zisserman, A.: Supervised dictionary learning. arXiv preprint arXiv:0809.3083 (2008)
Google Scholar
Huang, K., Aviyente, S.: Sparse representation for signal classification. In: Advances in Neural Information Processing Systems (NIPS), pp. 609–616 (2006)
Google Scholar
Bradley, D.M., Bagnell, J.A.: Differential sparse coding. In: Advances in Neural Information Processing Systems (NIPS) (2008)
Google Scholar
Yang, J., Yu, K., Huang, T.: Supervised translation-invariant sparse coding. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3517–3524 (2010)
Google Scholar
Mairal, J., Bach, F., Ponce, J.: Task-driven dictionary learning. IEEE Transactions on Pattern Analysis and Machine Intelligence 34, 791–804 (2012)
Article Google Scholar
Shan, H., Cottrell, G.W.: Looking around the backyard helps to recognize faces and digits. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–8 (2008)
Google Scholar
Ciresan, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3642–3649 (2012)
Google Scholar
Sim, T., Baker, S., Bsat, M.: The cmu pose, illumination, and expression (pie) database. In: IEEE Conference on Automatic Face and Gesture Recognition, pp. 46–51 (2002)
Google Scholar
Cai, D., He, X., Han, J.: Semi-supervised discriminant analysis. In: International Conference on Computer Vision (ICCV), pp. 1–7 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Technology, East China Normal University, Shanghai, China
Li Ding & Jinhua Xu

Authors

Li Ding
View author publications
You can also search for this author in PubMed Google Scholar
Jinhua Xu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Kyungpook National University, 1370 Sankyuk-Dong, Puk-Gu, 702-701, Taegu, Korea
Minho Lee
The University of Tokyo, 7-3-1 Hongo, 113-8656, Bunkyo-ku, Tokyo, Japan
Akira Hirose
Institute of Automation, Key Laboratory of Complex Systems and Intelligence Science, Chinese Academy of Sciences, 100190, Beijing, China
Zeng-Guang Hou
Sungkyunkwan University, 2066, Seobu-ro, Jangan-gu, 440-746, Suwon, Korea
Rhee Man Kil

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ding, L., Xu, J. (2013). A Universal Visual Dictionary Learned from Natural Scenes for Recognition. In: Lee, M., Hirose, A., Hou, ZG., Kil, R.M. (eds) Neural Information Processing. ICONIP 2013. Lecture Notes in Computer Science, vol 8228. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-42051-1_21

Download citation

DOI: https://doi.org/10.1007/978-3-642-42051-1_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-42050-4
Online ISBN: 978-3-642-42051-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics