Abstract
Image retrieval is a way to search similar images from a given query image. The method successfully applied to different fashion image retrieval over different garments and footwear. However, research over complex fashion products such as ornaments have not got much momentum due to the complex nature of similarity and unavailability of suitable datasets. In this paper we have proposed a Dual Shallow Siamese Network (DSSN) for the task. The model has two identical Siamese networks and takes the same images as input but in a different representation. One subnetwork takes input as RGB color images and the other as identical segmented images. First, the networks are trained using positive and negative image pairs. Next, the trained model is used to find the difference between gallery and query images. The similarity score of the Siamese networks are then fused using weighted averaging. The method is applied on two public datasets, namely, RingFIR (earring dataset) and UT-Zap50K (footwear dataset). We have compared the retrieval accuracy of our method with other state-of-the-art image retrieval methods. The result shows that our method outperforms state-of-the-art methods. The source code and dataset is available publicly (https://github.com/skarifahmed/DSSN).
Similar content being viewed by others
Data Availability
The source code and dataset is available publicly (https://github.com/skarifahmed/DSSN).
The datasets generated during and/or analysed during the current study are available in the github repository, (https://github.com/skarifahmed/DSSN)
References
Ak K E, Kassim A A, Lim J H, Tham J Y (2018) Learning attribute representations with localization for flexible fashion search. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7708–7717
Ak K E, Lim J H, Tham J Y, Kassim A A (2018) Which shirt for my first date? Towards a flexible attribute-based fashion query system. Pattern Recogn Lett 112:212–218
Ak K E, Lim J H, Tham J Y, Kassim A (2019) Semantically consistent hierarchical text to fashion image synthesis with an enhanced-attentional generative adversarial network. In: 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW). IEEE, pp 3121–3124
Alsmadi M K (2020) Content-based image retrieval using color, shape and texture descriptors and features. Arab J Sci Eng 45(4):3317–3330
Beaupre D-A, Bilodeau G-A (2019) Siamese cnns for rgb-lwir disparity estimation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, pp 0–0
Bibi R, Mehmood Z, Yousaf R M, Saba T, Sardaraz M, Rehman A (2020) Query-by-visual-search: multimodal framework for content-based image retrieval. J Ambient Intell Humaniz Comput 11(11):5629–5648
Caltagirone L, Bellone M, Svensson L, Wahde M (2019) Lidar–camera fusion for road detection using fully convolutional neural networks. Robot Auton Syst 111:125–131
Chaudhuri U, Banerjee B, Bhattacharya A (2019) Siamese graph convolutional network for content based remote sensing image retrieval. Comput Vis Image Underst 184:22–30
Cheng Z-Q, Wu X, Liu Y, Hua X-S (2017) Video2shop: exact matching clothes in videos to online shopping images. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4048–4056
Chung D, Tahboub K, Delp E J (2017) A two stream siamese convolutional neural network for person re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 1983–1991
Chung Y-A, Weng W-H (2017) Learning deep representations of medical images using siamese cnns with application to content-based image retrieval. arXiv:1711.084901711.08490
Corbiere C, Ben-Younes H, Ramé A, Ollion C (2017) Leveraging weakly annotated data for fashion image retrieval and label prediction. In: Proceedings of the IEEE international conference on computer vision workshops, pp 2268–2274
Dong H, Liang X, Zhang Y, Zhang X, Shen X, Xie Z, Wu B, Yin J (2020) Fashion editing with adversarial parsing learning. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 8120–8128
Erkut U, Bostancıoğlu F, Erten M, Özbayoğlu A M, Solak E (2019) Hsv color histogram based image retrieval with background elimination. In: 2019 1st International Informatics and Software Engineering Conference (UBMYK). IEEE, pp 1–5
Fei-Fei L, Fergus R, Perona P (2004) Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. Computer Vision and Pattern Recognition Workshop
Gajic B, Baldrich R (2018) Cross-domain fashion image retrieval. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 1869–1871
Ge Y, Zhang R, Wang X, Tang X, Luo P (2019) Deepfashion2: a versatile benchmark for detection, pose estimation, segmentation and re-identification of clothing images. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 5337–5345
Ghojogh B, Sikaroudi M, Shafiei S, Tizhoosh H R, Karray F, Crowley M (2020) Fisher discriminant triplet and contrastive losses for training siamese networks. In: 2020 International Joint Conference on Neural Networks (IJCNN). IEEE, pp 1–7
Gu X, Gao F, Tan M, Peng P (2020) Fashion analysis and understanding with artificial intelligence. Inform Process Manag 57(5):102276
Guo Y, Cheung N-M (2018) Efficient and deep person re-identification using multi-level similarity. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2335–2344
Ha I, Kim H, Park S, Kim H (2018) Image retrieval using bim and features from pretrained vgg network for indoor localization. Build Environ 140:23–31
Hadi Kiapour M, Han X, Lazebnik S, Berg A C, Berg T L (2015) Where to buy it: matching street clothing photos in online shops. In: Proceedings of the IEEE international conference on computer vision, pp 3343–3351
Hamreras S, Boucheham B, Molina-Cabello M A, Benitez-Rochel R, Lopez-Rubio E (2020) Content based image retrieval by ensembles of deep learning object classifiers. Integr Comput-Aided Eng 27(3):317–331
Hidayati S C, Hsu C-C, Chang Y-T, Hua K-L, Fu J, Cheng W-H (2018) What dress fits me best? fashion recommendation on the clothing style for personal body shape. In: Proceedings of the 26th ACM international conference on Multimedia, pp 438–446
Hsiao S-C, Kao D-Y, Liu Z-Y, Tso R (2019) Malware image classification using one-shot learning with siamese networks. Procedia Comput Sci 159:1863–1871
Huang C-L, Huang D-H (1998) A content-based image retrieval system. Image Vis Comput 16(3):149–163
Huang J, Feris R S, Chen Q, Yan S (2015) Cross-domain image retrieval with a dual attribute-aware ranking network. In: Proceedings of the IEEE international conference on computer vision, pp 1062–1070
Huang L, Chen Y (2020) Dual-path siamese cnn for hyperspectral image classification with limited training samples. IEEE Geosci Remote Sens Lett
Ilhan H O, Sigirci I O, Serbes G, Aydin N (2020) A fully automated hybrid human sperm detection and classification system based on mobile-net and the performance comparison with conventional methods. Medical & Biological Engineering & Computing, 1–22
Islam S M, Joardar S, Dogra D P, Sekh A A (2021) Ornament image retrieval using multimodal fusion. SN Comput Sci 2(4):1–9
Islam S M, Joardar S, Sekh A A (2021) Ringfir: a large volume earring dataset for fashion image retrieval. Springer, Singapore, pp 100–111
Jaradat S (2017) Deep cross-domain fashion recommendation. In: Proceedings of the Eleventh ACM conference on recommender systems, pp 407–410
Jian M, Qi Q, Dong J, Yin Y, Lam K-M (2018) Integrating qdwd with pattern distinctness and local contrast for underwater saliency detection. J Vis Commun Image Represent 53:31–41
Jian M, Wang J, Yu H, Wang G, Meng X, Yang L, Dong J, Yin Y (2021) Visual saliency detection by integrating spatial position prior of object with background cues. Expert Syst Appl 168:114219
Jian M, Yin Y, Dong J, Lam K-M (2018) Content-based image retrieval via a hierarchical-local-feature extraction scheme. Multimed Tools Applic 77(21):29099–29117
Jian M, Zhang W, Yu H, Cui C, Nie X, Zhang H, Yin Y (2018) Saliency detection based on directional patches extraction and principal local color contrast. J Vis Commun Image Represent 57:1–11
Kalaiarasi G, Thyagharajan KK (2019) Clustering of near duplicate images using bundled features. Clust Comput 22(5):11997–12007
Kang W-C, Fang C, Wang Z, McAuley J (2017) Visually-aware fashion recommendation and design with generative image models. In: 2017 IEEE International Conference on Data Mining (ICDM). IEEE, pp 207–216
Karczmarek P, Kiersztyn A, Pedrycz W (2018) Generalized choquet integral for face recognition. Int J Fuzzy Syst 20(3):1047–1055
Khurana T, Mahajan K, Arora C, Rai A (2018) Exploiting texture cues for clothing parsing in fashion images. In: 2018 25th IEEE International Conference on Image Processing (ICIP). IEEE, pp 2102–2106
Kuang Z, Gao Y, Li G, Luo P, Chen Y, Lin L, Zhang W (2019) Fashion retrieval via graph reasoning networks on a similarity pyramid. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 3066–3075
Lang Y, He Y, Yang F, Dong J, Xue H (2020) Which is plagiarism: fashion image retrieval based on regional representation for design protection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 2595–2604
Lehmann T M, Güld M O, Thies C, Fischer B, Spitzer K, Keysers D, Ney H, Kohnen M, Schubert H, Wein B B (2004) Content-based image retrieval in medical applications. Methods Inform Med 43(04):354–361
Li J, Chi Z, Chen G (2004) Image retrieval based on sugeno fuzzy integral. In: Third International Conference on Image and Graphics (ICIG’04). IEEE, pp 160–163
Li Z, Li Y, Tian W, Pang Y, Liu Y (2016) Cross-scenario clothing retrieval and fine-grained style recognition. In: 2016 23rd International Conference on Pattern Recognition (ICPR). IEEE, pp 2912–2917
Liang X, Lin L, Yang W, Luo P, Huang J, Yan S (2016) Clothes co-parsing via joint image segmentation and labeling with application to clothing retrieval. IEEE Trans Multimed 18(6):1175–1186
Liao Q (2016) Comparison of several color histogram based retrieval algorithms. In: 2016 IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC). IEEE, pp 1670–1673
Lin C-H, Chen R-T, Chan Y-K (2009) A smart content-based image retrieval system based on color and texture feature. Image Vis Comput 27 (6):658–665
Lin K, Yang H-F, Liu K-H, Hsiao J-H, Chen C-S (2015) Rapid clothing retrieval via deep learning of binary codes and hierarchical search. In: Proceedings of the 5th ACM on international conference on multimedia retrieval, pp 499–502
Liu G-H, Yang J-Y (2021) Deep-seated features histogram: a novel image retrieval method. Pattern Recogn 116:107926
Liu K-H, Chen T-Y, Chen C-S (2016) Mvc: a dataset for view-invariant clothing retrieval and attribute prediction. In: Proceedings of the 2016 ACM on international conference on multimedia retrieval, pp 313–316
Liu Y, Chen X, Peng H, Wang Z (2017) Multi-focus image fusion with a deep convolutional neural network. Inform Fus 36:191–207
Liu Z, Luo P, Qiu S, Wang X, Tang X (2016) Deepfashion: powering robust clothes recognition and retrieval with rich annotations. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1096–1104
Liu Z, Yan S, Luo P, Wang X, Tang X (2016) Fashion landmark detection in the wild. In: European conference on computer vision. Springer, pp 229–245
Long F, Zhang H, Feng D D (2003) Fundamentals of content-based image retrieval. In: Multimedia information retrieval and management. Springer, pp 1–26
Melekhov I, Kannala J, Rahtu E (2016) Siamese network features for image matching. In: 2016 23rd International Conference on Pattern Recognition (ICPR). IEEE, pp 378–383
Miao Y, Li G, Bao C, Zhang J, Wang J (2020) Clothingnet: cross-domain clothing retrieval with feature fusion and quadruplet loss. IEEE Access 8:142669–142679
Nanni L, Minchio G, Brahnam S, Maguolo G, Lumini A (2021) Experiments of image classification using dissimilarity spaces built with siamese networks. Sensors 21(5):1573
Pandey N, Savakis A (2020) Poly-gan: multi-conditioned gan for fashion synthesis. Neurocomputing 414:356–364
Pelka O, Nensa F, Friedrich C M (2018) Annotation of enhanced radiographs for medical image retrieval with deep convolutional neural networks. PloS one 13(11):e0206229
Peng L, Zhang J, Liu M, Hu A (2019) Deep learning based rf fingerprint identification using differential constellation trace figure. IEEE Trans Veh Technol 69(1):1091–1095
Qi Y, Song Y-Z, Zhang H, Liu J (2016) Sketch-based image retrieval via siamese convolutional neural network. In: 2016 IEEE International Conference on Image Processing (ICIP). IEEE, pp 2460–2464
Saxen F, Werner P, Handrich S, Othman E, Dinges L, Al-Hamadi A (2019) Face attribute detection with mobilenetv2 and nasnet-mobile. In: 2019 11th International Symposium on Image and Signal Processing and Analysis (ISPA). IEEE, pp 176–180
Sharma P, Reilly R B (2003) A colour face image database for benchmarking of automatic face detection algorithms. In: Proceedings EC-VIP-MC 2003. 4th EURASIP conference focused on video/image processing and multimedia communications (IEEE Cat. No. 03EX667), vol 1. IEEE, pp 423–428
Shi M, Lewis V D (2020) Using artificial intelligence to analyze fashion trends. arXiv:2005.00986
Su H, Wang P, Liu L, Li H, Li Z, Zhang Y (2020) Where to look and how to describe: fashion image retrieval with an attentional heterogeneous bilinear network. IEEE Trans Circuits Syst Video Technol
Sun G-L, Wu X, Chen H-H, Peng Q (2015) Clothing style recognition using fashion attribute detection. In: Proceedings of the 8th international conference on mobile multimedia communications, pp 145–148
Tan M, Le Q V (2021) Efficientnetv2: smaller models and faster training. arXiv:2104.00298
Ter Braak CJF, Looman CWN (1986) Weighted averaging, logistic regression and the gaussian response model. Vegetatio 65(1):3–11
Thyagharajan K K, Kalaiarasi G (2018) Pulse coupled neural network based near-duplicate detection of images (pcnn–ndd). Adv Electr Comput Eng 18(3):87–97
Thyagharajan KK, Kalaiarasi G (2021) A review on near-duplicate detection of images using computer vision techniques. Arch Comput Methods Eng 28 (3):897–916
Tu Q, Dong L (2010) An intelligent personalized fashion recommendation system. In: 2010 International Conference on Communications, Circuits and Systems (ICCCAS). IEEE, pp 479–485
Verma S, Anand S, Arora C, Rai A (2018) Diversity in fashion recommendation using semantic parsing. In: 2018 25th IEEE International Conference on Image Processing (ICIP). IEEE, pp 500–504
Wang W, Xu Y, Shen J, Zhu S-C (2018) Attentive fashion grammar network for fashion landmark detection and clothing category classification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4271–4280
Wang X, Sun Z, Zhang W, Zhou Y, Jiang Y-G (2016) Matching user photos to online products with robust deep features. In: Proceedings of the 2016 ACM on international conference on multimedia retrieval, pp 7–14
Welinder P, Branson S, Mita T, Wah C, Schroff F, Belongie S, Perona P (2010) Caltech-UCSD Birds 200. Technical Report CNS-TR-2010-001, Cali711 fornia Institute of Technology
Wiggers K L, Britto A S, Heutte L, Koerich A L, Oliveira L S (2019) Image retrieval and pattern spotting using siamese neural network. In: 2019 International Joint Conference on Neural Networks (IJCNN). IEEE, pp 1–8
Wu L, Wang Y, Gao J, Li X (2018) Where-and-when to look: deep siamese attention networks for video-based person re-identification. IEEE Trans Multimed 21(6):1412–1424
Wu Q (2020) Image retrieval method based on deep learning semantic feature extraction and regularization softmax. Multimed Tools Applic 79 (13):9419–9433
Xiao H, Rasul K, Vollgraf R (2017) Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms. ArXiv:1708.07747
Xu N, Liu A-A, Wong Y, Zhang Y, Nie W, Su Y, Kankanhalli M (2018) Dual-stream recurrent neural network for video captioning. IEEE Trans Circuits Syst Video Technol 29(8):2482–2493
Yager R R (1988) On ordered weighted averaging aggregation operators in multicriteria decisionmaking. IEEE Trans Syst Man Cybern 18(1):183–190
Yamaguchi K, Kiapour M H, Ortiz L E, Berg T L (2012) Parsing clothing in fashion photographs. In: 2012 IEEE Conference on computer vision and pattern recognition. IEEE, pp 3570–3577
Yasmin M, Mohsin S, Sharif M (2014) Intelligent image retrieval techniques: a survey. J Appl Res Technol 12(1):87–103
Yin R, Li K, Lu J, Zhang G (2019) Enhancing fashion recommendation with visual compatibility relationship. In: The world wide web conference, pp 3434–3440
Yu A, Grauman K (2014) Fine-grained visual comparisons with local learning. In: Computer Vision and Pattern Recognition (CVPR)
Yu A, Grauman K (2017) Semantic jitter: dense supervision for visual comparisons via synthetic images. In: International Conference on Computer Vision (ICCV)
Yu L, Liu N, Zhou W, Dong S, Fan Y, Abbas K (2021) Weber’s law based multi-level convolution correlation features for image retrieval. Multimed Tools Applic 80(13):19157–19177
Zhang J, Lu C, Li X, Kim H-J, Wang J (2019) A full convolutional network based on densenet for remote sensing scene classification. Math Biosci Eng 16(5):3345–3367
Zhang J, Hu J (2008) Image segmentation based on 2d otsu method with histogram analysis. In: 2008 international conference on computer science and software engineering, vol 6. IEEE, pp 105–108
Zhao L, Min C (2019) The rise of fashion informatics: a case of data-mining-based social network analysis in fashion. Cloth Text Res J 37(2):87–102
Zheng S, Yang F, Kiapour M H, Piramuthu R (2018) Modanet: a large-scale street fashion dataset with polygon annotations. In: ACM Multimedia conference on multimedia conference. ACM, pp 1670–1678
Zhou W, Mok PY, Zhou Y, Zhou Y, Shen J, Qu Q, Chau KP (2019) Fashion recommendations through cross-media information retrieval. J Vis Commun Image Represent 61:112–120
Zhu S, Urtasun R, Fidler S, Lin D, Change Loy C (2017) Be your own prada: fashion synthesis with structural coherence. In: Proceedings of the IEEE international conference on computer vision, pp 1680–1688
Zou X, Kong X, Wong W, Wang C, Liu Y, Cao Y (2019) Fashionai: a hierarchical dataset for fashion understanding. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, pp 0–0
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Ethics approval and consent to participate
This article does not contain any studies with human participants or animals performed by any of the authors. Informed consent was obtained from all individual participants included in the study.
Conflict of Interests
The authors declare that there is no conflict of interest regarding the publication of this article.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Islam, S.M., Joardar, S. & Sekh, A.A. DSSN: dual shallow Siamese network for fashion image retrieval. Multimed Tools Appl 82, 16501–16517 (2023). https://doi.org/10.1007/s11042-022-14204-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-14204-0