Skip to main content
Log in

Content-based image retrieval by combining convolutional neural networks and sparse representation

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

As stored data and images on memory disks increase, image retrieval has a necessary task on image processing. Although lots of researches have been reported for this task so far, semantic gap between low level features of images and human concept is still an important challenge on content-based image retrieval. For this task, a robust method is proposed by a combination of convolutional neural network and sparse representation, in which deep features are extracted by using CNN and sparse representation to increase retrieval speed and accuracy. The proposed method has been tested on three common databases on image retrieval, named Corel, ALOI and MPEG7. By computing metrics such as P(0.5), P(1) and ANMRR, experimental results show that the proposed method has achieved higher accuracy and better speed compared to state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  1. Arel I, Rose DC, Karnowski TP (2010) Deep machine learning - a new frontier in artificial intelligence research [research frontier]. IEEE Comput Intell Mag 5:13–18

    Article  Google Scholar 

  2. Chun YD, Kim NC, Jang IH (2008) Content-based image retrieval using multi-resolution color and texture features. IEEE Trans Multimedia 10(6):1073–1084

    Article  Google Scholar 

  3. Coral dataset, last referred on June 2009, Available at http://wang.ist.psu.edu/docs/related/

  4. Farsi H, Mohamadzadeh S (2013) Colour and texture feature-based image retrieval by using Hadamard matrix in discrete wavelet transform. IET Image Process 7(3):212–218

    Article  MathSciNet  Google Scholar 

  5. Farsi H, Mohamadzadeh S (2013) Combining Hadamard matrix, discrete wavelet transform and DCT features based on PCA and KNN for image retrieval. Majlesi Journal of Electrical Engineering 7(1):9–15

    Google Scholar 

  6. Geusebroek JM, Burghouts GJ, Smeulders AWM (2005) The Amsterdam library of object images. Int J Comput Vis 61:103–112

    Article  Google Scholar 

  7. Gou Y, Tao D, Yu j XH, Li Y, Tao D (2016) Deep neural networks with relativity learning for facial expression recognition. In: IEEE international conference on Multimedia & Expo Workshops (ICMEW)

    Google Scholar 

  8. Guo Y, Liu Y, Oerlemans A, Lao S, Wu S, Lew MS (2016) Deep learning for visual understanding: a review. Neurocomputing 187:27–48

    Article  Google Scholar 

  9. Hiremath PS, Shivashankar S, Pujari J (2006) Wavelet based features for color texture classification with application to CBIR. IJCSNS International Journal of Computer Science and Network Security 6(9):124–133

    Google Scholar 

  10. International organization for standardization, MPEG-7 overview 2004. Available at http://mpeg.chiariglione.org/standards/mpeg-7/mpeg-7.htm. accessed 15 Nov 2011

  11. Ka-Man W, Lai-Man P, Kwok-Wai C (2007) Dominant color structure descriptor for image retrieval. In: IEEE international conference on image processing (ICIP)

    Google Scholar 

  12. A. Krizhevsky and G. Hinton, "Learning multiple layers of features from tiny images". 2009

    Google Scholar 

  13. Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Advances in neural information processing systems 25 (NIPS 2012)

    Google Scholar 

  14. Lecun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86:2278–2324

    Article  Google Scholar 

  15. Lee H, Largman Y, Pham P, Ng A (2009) Unsupervised feature learning for audio classification using convolutional deep belief networks. In: Advances in neural information processing systems 22 (NIPS’09)

    Google Scholar 

  16. Li F, Dai Q, Xu W, Er G (2008) Multi-label neighborhood propagation for region-based image retrieval. IEEE Trans Multimed 10(8):1592–1604

    Article  Google Scholar 

  17. Liapis S, Tziritas G (2004) Color and texture image retrieval using chromaticity histograms and wavelet frames. IEEE Trans Multimedia 6(5):676–686

    Article  Google Scholar 

  18. Liu H, Li B, Lv X, Huang Y (2017) Image retrieval using fused deep convolutional features. Procedia Comput Sci 107:749–754

    Article  Google Scholar 

  19. Manjunath BS, Ohm JR, Vasudvan VV, Andyamada A (2001) Color and texture descriptors. IEEE Trans Circuits Syst Video Technol 11(6):703–715

    Article  Google Scholar 

  20. Minh ND, Vetterli M (2002) Wavelet-based texture retrieval using generalized Gaussian density and kullback–leibler distance. IEEE Trans Image Process 11(2):146–158

    Article  MathSciNet  Google Scholar 

  21. Mohamadzadeh S, Farsi H (2014) Image retrieval using color-texture features extracted from Gabor-Walsh wavelet pyramid. Journal of Information Systems and Telecommunication 2(1):31–40

    Google Scholar 

  22. Mohamadzadeh S, Farsi H (2016) Content-based image retrieval system via sparse representation. IET Comput Vis 10:95–102

    Article  MATH  Google Scholar 

  23. Montagna R, Finlayson GD (2012) Padua point interpolation and Lp-norm minimization in color-based image indexing and retrieval. IET Image Process 6(2):139–147

    Article  MathSciNet  Google Scholar 

  24. Peng T q, Li F (2017) Image retrieval based on deep convolutional neural networks and binary hashing learning. In: IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 1742–1746

    Google Scholar 

  25. Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) Object retrieval with large vocabularies and fast spatial matching. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–8

    Google Scholar 

  26. Qayyum A, Anwar SM, Awais M, Majid M (2017) Medical image retrieval using deep convolutional neural network. Neurocomputing

  27. Silva Júnior JA, Marçal RE, Batista MA (2014) Image retrieval importance and applications. In: Workshop de Visao Computacional - WVC 2014

    Google Scholar 

  28. Karen Simonyan, and Andrew Zisserman, "Very deep convolutional networks for large-scale image recognition". http://arxiv.org/abs/1409.1556, 2014

    Google Scholar 

  29. K. Simonyan, and Zisserman, A. "Very deep convolutional networks for large-scale image recognition, " Published as a conference paper at ICLR 2015

    Google Scholar 

  30. Singha M, Hemachandran K (2012) Content based image retrieval using color and texture. Signal & Image Processing: An International Journal (SIPIJ) 3(1):39–57

    Google Scholar 

  31. Szegedy C, Wei L, Yangqing J, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR), pp 1–9

    Google Scholar 

  32. Tao D, Guo Y, Song M, Li Y, Yu Z, Yan Tang Y (2016) Person re-identification by dual-regularized KISS metric learning. IEEE Trans Image Process 25(6):2726–2738

    Article  MathSciNet  MATH  Google Scholar 

  33. Tao D, Guo Y, Li Y, Gao X (2018) Tensor rank preserving discriminant analysis for facial recognition. IEEE Trans Image Process 27(1):325–334

    Article  MathSciNet  MATH  Google Scholar 

  34. Torres RDS, Falcao AX (2006) Content-based image retrieval theory and applications. RITA 8

  35. Troncy R, Huet B, Schenk S (2011) Feature extraction for multimedia analysis: multimedia semantics, desktop edition (XML): metadata, analysis and interaction, 1st edn. Wiley, New York

    Book  Google Scholar 

  36. Varga D, Szirányi T (2016) Fast content-based image retrieval using convolutional neural network and hash function. In: IEEE international conference on systems, Man, and cybernetics (SMC), pp 2636–2640

    Chapter  Google Scholar 

  37. Veganzones MA, Graña M (2012) A spectral /spatial CBIR system for hyper spectral images. IEEE J-STARS 5:488–500

    Google Scholar 

  38. Yang AY, Zhou Z, Ganesh A et al (2013) Fast l1-minimization algorithms for robust face recognition. IEEE Trans Image Process 22(8):3234–3246

    Article  Google Scholar 

  39. Zhang Z, Xu Y, Yang J et al (2015) A survey of sparse representation: algorithms and applications. IEEE Access 3:490–530

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hassan Farsi.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sezavar, A., Farsi, H. & Mohamadzadeh, S. Content-based image retrieval by combining convolutional neural networks and sparse representation. Multimed Tools Appl 78, 20895–20912 (2019). https://doi.org/10.1007/s11042-019-7321-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-019-7321-1

Keywords

Navigation