Multi-layer Dictionary Learning for Image Classification

  • Stefen Chan Wai Tim
  • Michele Rombaut
  • Denis Pellerin
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10016)

Abstract

This paper presents a multi-layer dictionary learning method for classification tasks. The goal of the proposed multi-layer framework is to use the supervised dictionary learning approach locally on raw images in order to learn local features. This method starts by building a sparse representation at the patch-level and relies on a hierarchy of learned dictionaries to output a global sparse representation for the whole image. It relies on a succession of sparse coding and pooling steps in order to find an efficient representation of the data for classification. This method has been tested on a classification task with good results.

Keywords

Input Image Sparse Representation Sparse Code Convolutional Neural Network Dictionary Learning 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Notes

Acknowledgement

This work has been partially supported by the LabEx PERSYVAL-Lab (ANR-11-LABX-0025-01).

References

  1. 1.
    Aharon, M., Elad, M., Bruckstein, A.: K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process. 54, 4311–4322 (2006)CrossRefGoogle Scholar
  2. 2.
    Bradley, D.M., Bagnell, J.A.: Differential sparse coding. In: NIPS (2008)Google Scholar
  3. 3.
    Tim, C.W.S., Rombaut, M., Pellerin, D.: Rejection-based classification for action recognition using a spatio-temporal dictionary. In: EUSIPCO (2015)Google Scholar
  4. 4.
    Coates, A., Lee, H., Ng, A.: An analysis of single-layer networks in unsupervised feature learning. In: AISTATS (2011)Google Scholar
  5. 5.
    Coates, A., Ng, A.: The importance of encoding versus training with sparse coding and vector quantization. In: ICML (2011)Google Scholar
  6. 6.
    Elad, M., Aharon, M.: Image denoising via sparse and redundant representations over learned dictionaries. IEEE Trans. Image Process. 15, 3736–3745 (2006)MathSciNetCrossRefGoogle Scholar
  7. 7.
    Fawzi, A., Davies, M., Frossard, P.: Dictionary learning for fast classification based on soft-thresholding. Int. J. Comput. Vis. 114, 306–321 (2015)MathSciNetCrossRefGoogle Scholar
  8. 8.
    Hinton, G.E., Srivastava, N., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.R.: Improving neural networks by preventing co-adaptation of feature detector. https://arxiv.org/pdf/1207.0580.pdf (2012)
  9. 9.
    Krizhevsky, A., Hinton, G.: Learning multiple layers of features from tiny images. Technical report, University of Toronto (2009)Google Scholar
  10. 10.
    LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86, 2278–2324 (1998)CrossRefGoogle Scholar
  11. 11.
    Mairal, J., Bach, F., Ponce, J.: Task-driven dictionary learning. IEEE Trans. Pattern Anal. Mach. Intell. 34, 791–804 (2012)CrossRefGoogle Scholar
  12. 12.
    Mairal, J., Elad, M., Sapiro, G.: Sparse representation for color image restauration. IEEE Trans. Image Process. 17, 53–69 (2008)MathSciNetCrossRefMATHGoogle Scholar
  13. 13.
    Olshausen, B.A., Field, D.J.: Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature 381, 607–609 (1996)CrossRefGoogle Scholar
  14. 14.
    Qiu, Q., Jiang, Z., Chellappa, R.: Sparse dictionary-based representation and recognition of action attributes. In: ICCV (2011)Google Scholar
  15. 15.
    Raina, R., Battle, A., Lee, H., packer, B., Ng, A.Y.: Self-taught learning: transfer learning from unlabeled data. In: ICML (2008)Google Scholar
  16. 16.
    Springenberg, J.T., Dosovitskiy, A., Brox, T., Riedmiller, M.: Striving for simplicity: the all convolutional net. In: ICLR (2015)Google Scholar
  17. 17.
    Tibshirani, R.: Regression shrinkage and selection via the lasso. J. R. Stat. Soc. 58, 267–288 (1996)MathSciNetMATHGoogle Scholar
  18. 18.
    Wang, H., Ullah, M.M., Klaser, A., Laptev, I., Schmid, C.: Evaluation of local spatio-temporal features for action recognition. British Machine Vision Conference (2009)Google Scholar
  19. 19.
    Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y.: Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31, 210–227 (2009)CrossRefGoogle Scholar
  20. 20.
    Yang, J., Yu, K., Huang, T.: Supervised translation-invariant sparse coding. In: CVPR (2010)Google Scholar

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  • Stefen Chan Wai Tim
    • 1
  • Michele Rombaut
    • 1
  • Denis Pellerin
    • 1
  1. 1.GIPSA-LabUniversity of Grenoble AlpesGrenobleFrance

Personalised recommendations