Style Scanner—Personalized Visual Search and Recommendations

Kushwaha, Abhishek; Chakravorty, Saurav; Das, Paulami

doi:10.1007/978-981-33-6656-5_5

Abhishek Kushwaha²,
Saurav Chakravorty² &
Paulami Das²

Part of the book series: Springer Proceedings in Business and Economics ((SPBE))

706 Accesses

Abstract

In this paper, we present a visual search and recommendation system that supports typical shopping behaviour. We present a unified convolutional neural network architecture, to learn embeddings, which is a way to capture notion of similarity. We will introduce the concept of embeddings with respect to similarity and show how we try to achieve required embeddings with various loss functions. We demonstrate various model architectures based on availability of data. We also demonstrate a semiautomatic way of creating labelled dataset for training. We will talk about the concept of accuracy with respect to similarity, which is complicated as similarity is subjective. Finally, we present an end-to-end system for deployment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Hardcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Boureau, Y.-L., Bach, F., LeCun, Y., & Ponce, J. (2010). Learning mid-level features for recognition. In 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 2559–2566). IEEE.
Google Scholar
Chechik, G., Varun S., Uri, S., & Samy, Bengio. (2010). Large scale online learning of image similarity through ranking. Journal of Machine Learning Research, 11(3).
Google Scholar
Chopra, S., Raia, H., & Yann, L. (2005). Learning a similarity metric discriminatively, with application to face verification. In 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05) (Vol. 1, pp. 539–546). IEEE.
Google Scholar
Dalal, N., & Triggs, B. (2005). Histograms of oriented gradients for human detection. In Computer Vision and Pattern Recognition. CVPR 2005 (pp. 886–893). IEEE Computer Society Conference on 1.
Google Scholar
He, K., Xiangyu, Z., Shaoqing, R., & Jian, S. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 770–778).
Google Scholar
Krizhevsky, A., Ilya, S., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (pp. 1097–1105).
Google Scholar
Lai, H., Yan, P., Ye, L., & Shuicheng, Y. (2015). Simultaneous feature learning and hash coding with deep neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 3270–3278).
Google Scholar
Lowe, D. G. (1999). Object recognition from local scale-invariant features. Paper presented at the meeting of the Proceedings of the International Conference on Computer Vision ICCV, Corfu.
Google Scholar
Rother, Carsten, Kolmogorov, Vladimir, & Blake, Andrew. (2004). " GrabCut" interactive foreground extraction using iterated graph cuts. ACM Transactions on Graphics (TOG), 23(3), 309–314.
Article Google Scholar
Simonyan, K., & Andrew, Z. (2014). Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556.
Szegedy, C., Wei, L., Yangqing, J., Pierre, S., Scott, R., Anguelov, D., Erhan, D., et al. (2015). Going deeper with convolutions, 1–9. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, New Jersey.
Google Scholar
Taylor, G. W., Spiro, I., Christoph, B., & Rob, F. (2011). Learning invariance through imitation. In CVPR 2011 (pp. 2729–2736). IEEE.
Google Scholar
Wang, X., Sun, Z., Zhang, W., Zhou, Y., & Jiang, Y.-G. (2016). Matching user photos to online products with robust deep features. In Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval (pp. 7–14).
Google Scholar
Wang, J., Yang, S., Thomas, L., Chuck, R., Jingbin, W., James, P., et al. (2014). Learning fine-grained image similarity with deep ranking. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1386–1393).
Google Scholar

Download references

Author information

Authors and Affiliations

Brillio Technologies, Bengaluru, India
Abhishek Kushwaha, Saurav Chakravorty & Paulami Das

Authors

Abhishek Kushwaha
View author publications
You can also search for this author in PubMed Google Scholar
Saurav Chakravorty
View author publications
You can also search for this author in PubMed Google Scholar
Paulami Das
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Abhishek Kushwaha .

Editor information

Editors and Affiliations

Production and Quantitative Methods, Indian Institute of Management Ahmedabad, Ahmedabad, Gujarat, India
Arnab Kumar Laha

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kushwaha, A., Chakravorty, S., Das, P. (2021). Style Scanner—Personalized Visual Search and Recommendations. In: Laha, A.K. (eds) Applied Advanced Analytics. Springer Proceedings in Business and Economics. Springer, Singapore. https://doi.org/10.1007/978-981-33-6656-5_5

Download citation

DOI: https://doi.org/10.1007/978-981-33-6656-5_5
Published: 09 June 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-33-6655-8
Online ISBN: 978-981-33-6656-5
eBook Packages: Business and ManagementBusiness and Management (R0)

Publish with us

Policies and ethics