Vision-based image similarity measurement for image search similarity

Jintanachaiwat, Werapat; Siriborvornratanakul, Thitirat

doi:10.1007/s41870-023-01437-x

Vision-based image similarity measurement for image search similarity

Original Research
Published: 19 September 2023

Volume 15, pages 4125–4130, (2023)
Cite this article

International Journal of Information Technology Aims and scope Submit manuscript

122 Accesses
5 Citations
Explore all metrics

Abstract

In various applications across different platforms, image similarity features such as image searching and similar image recommendations are widely used. However, the challenges of semantic gap and querying speed continue to pose significant challenges in image similarity searching. In this study, we propose a novel solution to address these issues using contrastive learning within the TensorFlow Similarity library. Specifically, we trained and tested our proposed method using the Caltech-256 dataset and further evaluated it on the Corel1K dataset. Our work distinguishes itself from previous studies that primarily focus on evaluating accuracy while neglecting the importance of speed evaluation. As such, we propose evaluating both the mean average precision score and query time spending. Our experimental results reveal that our method based on EfficientNet (B7) yields the best average precision scores of 0.93 on the Caltech-256 test dataset and 0.94 on the Corel1K dataset. However, other methods achieve faster query times, although their average precision scores are significantly lower.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 4

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

Article Open access 06 February 2017

Learning to Prompt for Vision-Language Models

Article 31 July 2022

CLIP-Adapter: Better Vision-Language Models with Feature Adapters

Article 15 September 2023

Data availability

The Caltech-256 dataset [12] is available in https://authors.library.caltech.edu/7694/ and is derived from https://paperswithcode.com/dataset/caltech-256. The Corel1K dataset [6, 16, 23,24,25,26] is derived from https://sites.google.com/site/dctresearch/Home/content-based-image-retrieval.

References

Agrawal S, Chowdhary A, Agarwala S, Mayya V, Kamath S (2022) Content-based medical image retrieval system for lung diseases using deep CNNs. Int J Inf Technol 14(7):3619–3627
Ahmed KT, Irtaza A, Iqbal MA (2017) Fusion of local and global features for effective image extraction. Appl Intell 47(2):526–543
Ahmad K, Sahu M, Shrivastava M, Rizvi MA, Jain V (2020) An efficient image retrieval tool: query based image management system. Int J Inf Technol 12(1):103–111
Appalaraju S, Chaoji V (2017) Image similarity using deep CNN and curriculum learning.arXiv:1709.08761
Baliga BS, Medepalli R, Muralikrishna SN (2021) Securing textual and image data on cloud using searchable encryption. Int J Inf Technol 13(3):1111–1117
Bian W, Tao D (2010) Biased Discriminant Euclidean Embedding for Content based Image Retrieval. IEEE Trans Image Process 19(2):545–554
Bursztein E, Long J, Lin S, Vallis O, Chollet F (2021) TensorFlow Similarity: A Usable, High-Performance Metric Learning Library. Fixme
Chechik G, Sharma V, Shalit U, Bengio S (2010) Large Scale Online Learning of Image Similarity Through Ranking. J Mach Learn Res 11(3):1109–1135
Chen Y, Gong S, Bazzani L (2020) Image search with text feedback by visiolinguistic attention learning. CVPR:2998–3008
Durmaz O, Bilge HS (2019) Fast image similarity search by distributed locality sensitive hashing. Pattern Recognit Lett 128:361–369
Gao Y, Wang M, Luan H, Shen J, Yan S, Tao D (2011) Tag-based social image search with visual-text joint hypergraph learning. ACM MM:1517–1520
Griffin G, Holub A, Perona P (2022) Caltech 256. CaltechDATA, https://doi.org/10.22002/D1.20087
Harini DND, Bhaskari DL (2012) Image retrieval system based on feature extraction and relevance feedback. CUBE:69–73
He K, Zhang X, Ren S, Sun J (2016) Deep Residual Learning for Image Recognition. CVPR:770–778
Kang L, Hsu C, Chen H, Lu C, Lin C,Pei S (2011) Feature-based sparse representation for image similarity assessment. IEEE Trans Multimed. 13(5):1019–1030
Li J, Allinsion N, Tao D, Li X (2006) Multitraining Support Vector Machine for Image Retrieval. IEEE Trans Image Process 15(11):3597–3601
Park G, Im W (2016) Image-text multi-modal representation learning by adversarial backpropagation. arXiv:1612.08354
Portaz M, Randrianarivo H, Nivaggioli A, Maudet E, Servan C, Peyronnet S (2019) Image search using multilingual texts: a cross-modal learning approach between image and text. arXiv:1903.11299
Roy K, Mukherjee J (2013) Image similarity measure using color histogram, color coherence vector, and sobel method. IJSR 2(1):538–543
Sachar S, Kumar A (2022) Deep ensemble learning for automatic medicinal leaf identification. Int J Inf Technol 14(6):3089–3097
Sun Y, Cheng C, Zhang Y, Zhang C, Zheng L, Wang Z, Wei Y (2020) Circle loss: a unified perspective of pair similarity optimization. CVPR:6397–6406
Tan M, Le QV (2019) EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. ICML:6105–6114
Tao D, Tang X, Li X, Rui Y (2006) Direct kernel biased discriminant analysis: a new content-based image retrieval relevance feedback algorithm. IEEE Trans Multimed 8(4):716–727
Tao D, Tang X, Li X, Wu X (2006) Asymmetric Bagging and Random Subspace for Support Vector Machines-based Relevance Feedback in Image Retrieval. IEEE Trans Pattern Anal Mach Intell 28(7):1088–1099
Tao D, Li X, Maybank SJ (2007) Negative Samples Analysis in Relevance Feedback. IEEE Trans Knowl Data Eng 19(4):568–580
Wang JZ, Li J, Wiederhold G (2001) SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture Libraries. IEEE Trans Pattern Anal Mach Intell 23(9):947–963
Wang J, Song Y, Leung T, Rosenberg C, Wang J, Philbin J, Chen B, Wu Y (2014) Learning fine-grained image similarity with deep ranking. CVPR:1386–1393
Yuan X, Liu Q, Long J, Hu L, Wang Y (2019) Deep image similarity measurement based on the improved triplet network with spatial pyramid pooling. Information 10(4):1–17

Download references

Author information

Authors and Affiliations

Graduate School of Applied Statistics, National Institute of Development Administration (NIDA), Bangkok, Thailand
Werapat Jintanachaiwat & Thitirat Siriborvornratanakul

Authors

Werapat Jintanachaiwat
View author publications
You can also search for this author in PubMed Google Scholar
Thitirat Siriborvornratanakul
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed equally.

Corresponding author

Correspondence to Thitirat Siriborvornratanakul.

Ethics declarations

Conflict of interest

No conflicts of interest to declare.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Jintanachaiwat, W., Siriborvornratanakul, T. Vision-based image similarity measurement for image search similarity. Int. j. inf. tecnol. 15, 4125–4130 (2023). https://doi.org/10.1007/s41870-023-01437-x

Download citation

Received: 14 February 2023
Accepted: 31 July 2023
Published: 19 September 2023
Issue Date: December 2023
DOI: https://doi.org/10.1007/s41870-023-01437-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Vision-based image similarity measurement for image search similarity

Abstract

Access this article

Similar content being viewed by others

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

Learning to Prompt for Vision-Language Models

CLIP-Adapter: Better Vision-Language Models with Feature Adapters

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Vision-based image similarity measurement for image search similarity

Abstract

Access this article

Similar content being viewed by others

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

Learning to Prompt for Vision-Language Models

CLIP-Adapter: Better Vision-Language Models with Feature Adapters

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation