Abstract
Image representation using feature descriptors is crucial. A number of histogram-based descriptors are widely used for this purpose. However, histogram-based descriptors have certain limitations and kernel descriptors (KDES) are proven to overcome them. Moreover, the combination of more than one KDES performs better than an individual KDES. Conventionally, KDES fusion is performed by concatenating them after the gradient, colour and shape descriptors have been extracted. This approach has limitations in regard to the efficiency as well as the effectiveness. In this paper, we propose a novel approach to fuse different image features before the descriptor extraction, resulting in a compact descriptor which is efficient and effective. In addition, we have investigated the effect on the proposed descriptor when texture-based features are fused along with the conventionally used features. Our proposed descriptor is examined on two publicly available image databases and shown to provide outstanding performances.
Similar content being viewed by others
References
Bakar SA, Hitam MS, Yussof WNJHW (2013) Content-based image retrieval using sift for binary and greyscale images. In: Signal and image processing applications (ICSIPA), 2013 IEEE international conference on, pp. 83–88. IEEE
Bo L, Lai K, Ren X, Fox D (2011) Object recognition with hierarchical kernel descriptors. In: Computer vision and pattern recognition (CVPR), 2011 IEEE conference on, pp. 1729–1736. IEEE
Bo L, Ren X, Fox D (2010) Kernel descriptors for visual recognition. In: Advances in neural information processing systems, pp. 244–252
Bo L, Ren X, Fox D (2011) Depth kernel descriptors for object recognition. In: Intelligent robots and systems (IROS), 2011 IEEE/RSJ international conference on, pp. 821–826. IEEE
Chang CC, Lin CJ (2011) Libsvm: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST) 2(3):27
Chatzichristofis SA, Boutalis YS (2008) Fcth: Fuzzy color and texture histogram-a low level feature for accurate image retrieval. In: 2008 Ninth international workshop on image analysis for multimedia interactive services, pp. 191–196. IEEE
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Computer vision and pattern recognition, 2005. CVPR 2005. IEEE computer society conference on, vol. 1, pp. 886–893. IEEE
Deselaers T, Keysers D, Ney H (2008) Features for image retrieval: An experimental comparison. Inf Retr 11(2):77–107
Dov D, Talmon R, Cohen I (2016) Kernel-based sensor fusion with application to audio-visual voice activity detection. IEEE Trans Signal Process 64 (24):6406–6416
Fei-Fei L, Fergus R, Perona P (2007) Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. Comput Vis Image Underst 106(1):59–70
Garcia-Garcia A, Orts-Escolano S, Oprea S, Villena-Martinez V, Garcia-Rodriguez J. (2017) A review on deep learning techniques applied to semantic segmentation. arXiv:1704.06857
Günter S, Schraudolph NN, Vishwanathan S (2007) Fast iterative kernel principal component analysis. J Mach Learn Res 8(Aug):1893–1918
Hu D, Bo L, Ren X (2011) Toward robust material recognition for everyday objects. In: BMVC, vol. 2, pp. 1–6
Han J, Kai-kuang MA (2002) Fuzzy color histogram and its use in color image retrieval. IEEE Trans Image Process 11(8):944–952
Karmakar P, Teng SW, Lu G, Zhang D (2018) A kernel-based approach for content-based image retrieval. In: 2018 International conference on image and vision computing New Zealand (IVCNZ), pp. 1–6. IEEE
Karmakar P, Teng SW, Zhang D, Liu Y, Lu G (2017) Improved kernel descriptors for effective and efficient image classification. In: Digital image computing: techniques and applications (DICTA), 2017 international conference on, pp. 1–8. IEEE
Karmakar P, Teng SW, Zhang D, Liu Y, Lu G (2017) Improved tamura features for image classification using kernel based descriptors. In: Digital image computing: techniques and applications (DICTA), 2017 international conference on, pp. 1–7. IEEE
Konstantinidis K, Gasteratos A, Andreadis I (2005) Image retrieval based on fuzzy color histogram processing. Opt Commun 248(4-6):375–386
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: CVPR, vol. 2, pp. 2169–2178. IEEE
Liu X, Wang L, Zhang J, Yin J (2014) Sample-adaptive multiple kernel learning. In: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, AAAI’14, pp. 1975–1981. AAAI Press. http://dl.acm.org/citation.cfm?id=2892753.2892827
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Luan S, Chen C, Zhang B, Han J, Liu J (2018) Gabor convolutional networks. IEEE Trans Image Process
Makantasis K, Doulamis A, Doulamis N, Ioannides M (2016) In the wild image retrieval and clustering for 3d cultural heritage landmarks reconstruction. Multimed Tools Appl 75(7):3593–3629
Manning CD, Raghavan P, Schütze H. (2008) Introduction to information retrieval. Cambridge University Press, New York
Ojala T, Pietikäinen M., Harwood D (1996) A comparative study of texture measures with classification based on featured distributions. Pattern Recogn 29(1):51–59
Pan H, Olsen SI, Zhu Y (2015) Feature extraction and learning using context cue and rényi entropy based mutual information. In: International conference on pattern recognition applications and methods, pp. 69–88. Springer
Pilario KE, Shafiee M, Cao Y, Lao L, Yang SH (2020) A review of kernel methods for feature extraction in nonlinear process monitoring. Processes 8(1):24
Ren X, Bo L, Fox D (2012) Rgb-(d) scene labeling: Features and algorithms. In: Computer vision and pattern recognition (CVPR), 2012 IEEE conference on, pp. 2759–2766. IEEE
Rubner Y, Tomasi C, Guibas LJ (2000) The earth mover’s distance as a metric for image retrieval. Int J Comput Vision 40(2):99–121
Sajjad M, Ullah A, Ahmad J, Abbas N, Rho S, Baik SW (2018) Integrating salient colors with rotational invariant texture features for image representation in retrieval systems. Multimed Tools Appl 77(4):4769–4789
Serra G, Grana C, Manfredi M, Cucchiara R (2014) Covariance of covariance features for image classification. In: Proceedings of International Conference on Multimedia Retrieval, p. 411. ACM
Shawe-Taylor J, Cristianini N (2004) Kernel methods for pattern analysis. Cambridge University Press, Cambridge
Tamura H, Mori S, Yamawaki T (1978) Textural features corresponding to visual perception. IEEE Trans Syst Man Cybern 8(6):460–473
Tieu K, Viola P (2004) Boosting image retrieval. Int J Comput Vis 56(1-2):17–36
Tran TH, Nguyen VT (2015) How good is kernel descriptor on depth motion map for action recognition. In: International conference on computer vision systems, pp. 137–146. Springer
Tuzel O, Porikli F, Meer P (2006) Region covariance: A fast descriptor for detection and classification. Comput Vision–ECCV 2006:589–600
Tuzel O, Porikli F, Meer P (2008) Pedestrian detection via classification on riemannian manifolds. IEEE Trans Pattern Anal Mach Intell 30 (10):1713–1727
Varma M, Ray D (2007) Learning the discriminative power-invariance trade-off. In: Computer vision, 2007. ICCV 2007. IEEE 11th international conference on, pp. 1–8. IEEE
Vedaldi A, Gulshan V, Varma M, Zisserman A (2009) Multiple kernels for object detection. In: Computer vision, 2009 IEEE 12th international conference on, pp. 606–613. IEEE
Voulodimos A, Doulamis N, Doulamis A, Protopapadakis E (2018) Deep learning for computer vision: A brief review. Comput Intell Neurosci 2018:7068349–7068349
Wang P, Wang J, Zeng G, Xu W, Zha H, Li S (2013) Supervised kernel descriptors for visual recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2858–2865
Xie B, Liu Y, Zhang H, Yu J (2013) Efficient kernel descriptor for image categorization via pivots selection. In: Image processing (ICIP), 2013 20th IEEE international conference on, pp. 3479–3483. IEEE
Xie B, Liu Y, Zhang H, Yu J (2016) A novel supervised approach to learning efficient kernel descriptors for high accuracy object recognition. Neurocomputing 182:94–101
Yang S, Bo L, Wang J, Shapiro LG (2012) Unsupervised template learning for fine-grained object recognition. In: Advances in neural information processing systems, pp. 3122–3130
Zhang D, Islam MM, Lu G (2012) A review on automatic image annotation techniques. Pattern Recogn 45(1):346–362
Zhou Y, Ye Q, Qiu Q, Jiao J (2017) Oriented response networks. In: Computer vision and pattern recognition (CVPR), 2017 IEEE conference on, pp. 4961–4970. IEEE
Acknowledgements
This research was partially supported by Australian Research Council Discovery Projects scheme: DP130100024.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Karmakar, P., Teng, S.W., Lu, G. et al. A novel fusion approach in the extraction of kernel descriptor with improved effectiveness and efficiency. Multimed Tools Appl 80, 14545–14564 (2021). https://doi.org/10.1007/s11042-020-10300-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-10300-1