Abstract
As stored data and images on memory disks increase, image retrieval has a necessary task on image processing. Although lots of researches have been reported for this task so far, semantic gap between low level features of images and human concept is still an important challenge on content-based image retrieval. For this task, a robust method is proposed by a combination of convolutional neural network and sparse representation, in which deep features are extracted by using CNN and sparse representation to increase retrieval speed and accuracy. The proposed method has been tested on three common databases on image retrieval, named Corel, ALOI and MPEG7. By computing metrics such as P(0.5), P(1) and ANMRR, experimental results show that the proposed method has achieved higher accuracy and better speed compared to state-of-the-art methods.
Similar content being viewed by others
References
Arel I, Rose DC, Karnowski TP (2010) Deep machine learning - a new frontier in artificial intelligence research [research frontier]. IEEE Comput Intell Mag 5:13–18
Chun YD, Kim NC, Jang IH (2008) Content-based image retrieval using multi-resolution color and texture features. IEEE Trans Multimedia 10(6):1073–1084
Coral dataset, last referred on June 2009, Available at http://wang.ist.psu.edu/docs/related/
Farsi H, Mohamadzadeh S (2013) Colour and texture feature-based image retrieval by using Hadamard matrix in discrete wavelet transform. IET Image Process 7(3):212–218
Farsi H, Mohamadzadeh S (2013) Combining Hadamard matrix, discrete wavelet transform and DCT features based on PCA and KNN for image retrieval. Majlesi Journal of Electrical Engineering 7(1):9–15
Geusebroek JM, Burghouts GJ, Smeulders AWM (2005) The Amsterdam library of object images. Int J Comput Vis 61:103–112
Gou Y, Tao D, Yu j XH, Li Y, Tao D (2016) Deep neural networks with relativity learning for facial expression recognition. In: IEEE international conference on Multimedia & Expo Workshops (ICMEW)
Guo Y, Liu Y, Oerlemans A, Lao S, Wu S, Lew MS (2016) Deep learning for visual understanding: a review. Neurocomputing 187:27–48
Hiremath PS, Shivashankar S, Pujari J (2006) Wavelet based features for color texture classification with application to CBIR. IJCSNS International Journal of Computer Science and Network Security 6(9):124–133
International organization for standardization, MPEG-7 overview 2004. Available at http://mpeg.chiariglione.org/standards/mpeg-7/mpeg-7.htm. accessed 15 Nov 2011
Ka-Man W, Lai-Man P, Kwok-Wai C (2007) Dominant color structure descriptor for image retrieval. In: IEEE international conference on image processing (ICIP)
A. Krizhevsky and G. Hinton, "Learning multiple layers of features from tiny images". 2009
Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Advances in neural information processing systems 25 (NIPS 2012)
Lecun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86:2278–2324
Lee H, Largman Y, Pham P, Ng A (2009) Unsupervised feature learning for audio classification using convolutional deep belief networks. In: Advances in neural information processing systems 22 (NIPS’09)
Li F, Dai Q, Xu W, Er G (2008) Multi-label neighborhood propagation for region-based image retrieval. IEEE Trans Multimed 10(8):1592–1604
Liapis S, Tziritas G (2004) Color and texture image retrieval using chromaticity histograms and wavelet frames. IEEE Trans Multimedia 6(5):676–686
Liu H, Li B, Lv X, Huang Y (2017) Image retrieval using fused deep convolutional features. Procedia Comput Sci 107:749–754
Manjunath BS, Ohm JR, Vasudvan VV, Andyamada A (2001) Color and texture descriptors. IEEE Trans Circuits Syst Video Technol 11(6):703–715
Minh ND, Vetterli M (2002) Wavelet-based texture retrieval using generalized Gaussian density and kullback–leibler distance. IEEE Trans Image Process 11(2):146–158
Mohamadzadeh S, Farsi H (2014) Image retrieval using color-texture features extracted from Gabor-Walsh wavelet pyramid. Journal of Information Systems and Telecommunication 2(1):31–40
Mohamadzadeh S, Farsi H (2016) Content-based image retrieval system via sparse representation. IET Comput Vis 10:95–102
Montagna R, Finlayson GD (2012) Padua point interpolation and Lp-norm minimization in color-based image indexing and retrieval. IET Image Process 6(2):139–147
Peng T q, Li F (2017) Image retrieval based on deep convolutional neural networks and binary hashing learning. In: IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 1742–1746
Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) Object retrieval with large vocabularies and fast spatial matching. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–8
Qayyum A, Anwar SM, Awais M, Majid M (2017) Medical image retrieval using deep convolutional neural network. Neurocomputing
Silva Júnior JA, Marçal RE, Batista MA (2014) Image retrieval importance and applications. In: Workshop de Visao Computacional - WVC 2014
Karen Simonyan, and Andrew Zisserman, "Very deep convolutional networks for large-scale image recognition". http://arxiv.org/abs/1409.1556, 2014
K. Simonyan, and Zisserman, A. "Very deep convolutional networks for large-scale image recognition, " Published as a conference paper at ICLR 2015
Singha M, Hemachandran K (2012) Content based image retrieval using color and texture. Signal & Image Processing: An International Journal (SIPIJ) 3(1):39–57
Szegedy C, Wei L, Yangqing J, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR), pp 1–9
Tao D, Guo Y, Song M, Li Y, Yu Z, Yan Tang Y (2016) Person re-identification by dual-regularized KISS metric learning. IEEE Trans Image Process 25(6):2726–2738
Tao D, Guo Y, Li Y, Gao X (2018) Tensor rank preserving discriminant analysis for facial recognition. IEEE Trans Image Process 27(1):325–334
Torres RDS, Falcao AX (2006) Content-based image retrieval theory and applications. RITA 8
Troncy R, Huet B, Schenk S (2011) Feature extraction for multimedia analysis: multimedia semantics, desktop edition (XML): metadata, analysis and interaction, 1st edn. Wiley, New York
Varga D, Szirányi T (2016) Fast content-based image retrieval using convolutional neural network and hash function. In: IEEE international conference on systems, Man, and cybernetics (SMC), pp 2636–2640
Veganzones MA, Graña M (2012) A spectral /spatial CBIR system for hyper spectral images. IEEE J-STARS 5:488–500
Yang AY, Zhou Z, Ganesh A et al (2013) Fast l1-minimization algorithms for robust face recognition. IEEE Trans Image Process 22(8):3234–3246
Zhang Z, Xu Y, Yang J et al (2015) A survey of sparse representation: algorithms and applications. IEEE Access 3:490–530
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Sezavar, A., Farsi, H. & Mohamadzadeh, S. Content-based image retrieval by combining convolutional neural networks and sparse representation. Multimed Tools Appl 78, 20895–20912 (2019). https://doi.org/10.1007/s11042-019-7321-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-019-7321-1