Abstract
In various applications across different platforms, image similarity features such as image searching and similar image recommendations are widely used. However, the challenges of semantic gap and querying speed continue to pose significant challenges in image similarity searching. In this study, we propose a novel solution to address these issues using contrastive learning within the TensorFlow Similarity library. Specifically, we trained and tested our proposed method using the Caltech-256 dataset and further evaluated it on the Corel1K dataset. Our work distinguishes itself from previous studies that primarily focus on evaluating accuracy while neglecting the importance of speed evaluation. As such, we propose evaluating both the mean average precision score and query time spending. Our experimental results reveal that our method based on EfficientNet (B7) yields the best average precision scores of 0.93 on the Caltech-256 test dataset and 0.94 on the Corel1K dataset. However, other methods achieve faster query times, although their average precision scores are significantly lower.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-023-01437-x/MediaObjects/41870_2023_1437_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-023-01437-x/MediaObjects/41870_2023_1437_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-023-01437-x/MediaObjects/41870_2023_1437_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-023-01437-x/MediaObjects/41870_2023_1437_Fig4_HTML.png)
Similar content being viewed by others
Data availability
The Caltech-256 dataset [12] is available in https://authors.library.caltech.edu/7694/ and is derived from https://paperswithcode.com/dataset/caltech-256. The Corel1K dataset [6, 16, 23,24,25,26] is derived from https://sites.google.com/site/dctresearch/Home/content-based-image-retrieval.
References
Agrawal S, Chowdhary A, Agarwala S, Mayya V, Kamath S (2022) Content-based medical image retrieval system for lung diseases using deep CNNs. Int J Inf Technol 14(7):3619–3627
Ahmed KT, Irtaza A, Iqbal MA (2017) Fusion of local and global features for effective image extraction. Appl Intell 47(2):526–543
Ahmad K, Sahu M, Shrivastava M, Rizvi MA, Jain V (2020) An efficient image retrieval tool: query based image management system. Int J Inf Technol 12(1):103–111
Appalaraju S, Chaoji V (2017) Image similarity using deep CNN and curriculum learning.arXiv:1709.08761
Baliga BS, Medepalli R, Muralikrishna SN (2021) Securing textual and image data on cloud using searchable encryption. Int J Inf Technol 13(3):1111–1117
Bian W, Tao D (2010) Biased Discriminant Euclidean Embedding for Content based Image Retrieval. IEEE Trans Image Process 19(2):545–554
Bursztein E, Long J, Lin S, Vallis O, Chollet F (2021) TensorFlow Similarity: A Usable, High-Performance Metric Learning Library. Fixme
Chechik G, Sharma V, Shalit U, Bengio S (2010) Large Scale Online Learning of Image Similarity Through Ranking. J Mach Learn Res 11(3):1109–1135
Chen Y, Gong S, Bazzani L (2020) Image search with text feedback by visiolinguistic attention learning. CVPR:2998–3008
Durmaz O, Bilge HS (2019) Fast image similarity search by distributed locality sensitive hashing. Pattern Recognit Lett 128:361–369
Gao Y, Wang M, Luan H, Shen J, Yan S, Tao D (2011) Tag-based social image search with visual-text joint hypergraph learning. ACM MM:1517–1520
Griffin G, Holub A, Perona P (2022) Caltech 256. CaltechDATA, https://doi.org/10.22002/D1.20087
Harini DND, Bhaskari DL (2012) Image retrieval system based on feature extraction and relevance feedback. CUBE:69–73
He K, Zhang X, Ren S, Sun J (2016) Deep Residual Learning for Image Recognition. CVPR:770–778
Kang L, Hsu C, Chen H, Lu C, Lin C,Pei S (2011) Feature-based sparse representation for image similarity assessment. IEEE Trans Multimed. 13(5):1019–1030
Li J, Allinsion N, Tao D, Li X (2006) Multitraining Support Vector Machine for Image Retrieval. IEEE Trans Image Process 15(11):3597–3601
Park G, Im W (2016) Image-text multi-modal representation learning by adversarial backpropagation. arXiv:1612.08354
Portaz M, Randrianarivo H, Nivaggioli A, Maudet E, Servan C, Peyronnet S (2019) Image search using multilingual texts: a cross-modal learning approach between image and text. arXiv:1903.11299
Roy K, Mukherjee J (2013) Image similarity measure using color histogram, color coherence vector, and sobel method. IJSR 2(1):538–543
Sachar S, Kumar A (2022) Deep ensemble learning for automatic medicinal leaf identification. Int J Inf Technol 14(6):3089–3097
Sun Y, Cheng C, Zhang Y, Zhang C, Zheng L, Wang Z, Wei Y (2020) Circle loss: a unified perspective of pair similarity optimization. CVPR:6397–6406
Tan M, Le QV (2019) EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. ICML:6105–6114
Tao D, Tang X, Li X, Rui Y (2006) Direct kernel biased discriminant analysis: a new content-based image retrieval relevance feedback algorithm. IEEE Trans Multimed 8(4):716–727
Tao D, Tang X, Li X, Wu X (2006) Asymmetric Bagging and Random Subspace for Support Vector Machines-based Relevance Feedback in Image Retrieval. IEEE Trans Pattern Anal Mach Intell 28(7):1088–1099
Tao D, Li X, Maybank SJ (2007) Negative Samples Analysis in Relevance Feedback. IEEE Trans Knowl Data Eng 19(4):568–580
Wang JZ, Li J, Wiederhold G (2001) SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture Libraries. IEEE Trans Pattern Anal Mach Intell 23(9):947–963
Wang J, Song Y, Leung T, Rosenberg C, Wang J, Philbin J, Chen B, Wu Y (2014) Learning fine-grained image similarity with deep ranking. CVPR:1386–1393
Yuan X, Liu Q, Long J, Hu L, Wang Y (2019) Deep image similarity measurement based on the improved triplet network with spatial pyramid pooling. Information 10(4):1–17
Author information
Authors and Affiliations
Contributions
All authors contributed equally.
Corresponding author
Ethics declarations
Conflict of interest
No conflicts of interest to declare.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Jintanachaiwat, W., Siriborvornratanakul, T. Vision-based image similarity measurement for image search similarity. Int. j. inf. tecnol. 15, 4125–4130 (2023). https://doi.org/10.1007/s41870-023-01437-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41870-023-01437-x