Skip to main content

Style Scanner—Personalized Visual Search and Recommendations

  • Conference paper
  • First Online:
Applied Advanced Analytics

Abstract

In this paper, we present a visual search and recommendation system that supports typical shopping behaviour. We present a unified convolutional neural network architecture, to learn embeddings, which is a way to capture notion of similarity. We will introduce the concept of embeddings with respect to similarity and show how we try to achieve required embeddings with various loss functions. We demonstrate various model architectures based on availability of data. We also demonstrate a semiautomatic way of creating labelled dataset for training. We will talk about the concept of accuracy with respect to similarity, which is complicated as similarity is subjective. Finally, we present an end-to-end system for deployment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Boureau, Y.-L., Bach, F., LeCun, Y., & Ponce, J. (2010). Learning mid-level features for recognition. In 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 2559–2566). IEEE.

    Google Scholar 

  • Chechik, G., Varun S., Uri, S., & Samy, Bengio. (2010). Large scale online learning of image similarity through ranking. Journal of Machine Learning Research, 11(3).

    Google Scholar 

  • Chopra, S., Raia, H., & Yann, L. (2005). Learning a similarity metric discriminatively, with application to face verification. In 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05) (Vol. 1, pp. 539–546). IEEE.

    Google Scholar 

  • Dalal, N., & Triggs, B. (2005). Histograms of oriented gradients for human detection. In Computer Vision and Pattern Recognition. CVPR 2005 (pp. 886–893). IEEE Computer Society Conference on 1.

    Google Scholar 

  • He, K., Xiangyu, Z., Shaoqing, R., & Jian, S. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 770–778).

    Google Scholar 

  • Krizhevsky, A., Ilya, S., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (pp. 1097–1105).

    Google Scholar 

  • Lai, H., Yan, P., Ye, L., & Shuicheng, Y. (2015). Simultaneous feature learning and hash coding with deep neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 3270–3278).

    Google Scholar 

  • Lowe, D. G. (1999). Object recognition from local scale-invariant features. Paper presented at the meeting of the Proceedings of the International Conference on Computer Vision ICCV, Corfu.

    Google Scholar 

  • Rother, Carsten, Kolmogorov, Vladimir, & Blake, Andrew. (2004). " GrabCut" interactive foreground extraction using iterated graph cuts. ACM Transactions on Graphics (TOG), 23(3), 309–314.

    Article  Google Scholar 

  • Simonyan, K., & Andrew, Z. (2014). Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556.

  • Szegedy, C., Wei, L., Yangqing, J., Pierre, S., Scott, R., Anguelov, D., Erhan, D., et al. (2015). Going deeper with convolutions, 1–9. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, New Jersey.

    Google Scholar 

  • Taylor, G. W., Spiro, I., Christoph, B., & Rob, F. (2011). Learning invariance through imitation. In CVPR 2011 (pp. 2729–2736). IEEE.

    Google Scholar 

  • Wang, X., Sun, Z., Zhang, W., Zhou, Y., & Jiang, Y.-G. (2016). Matching user photos to online products with robust deep features. In Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval (pp. 7–14).

    Google Scholar 

  • Wang, J., Yang, S., Thomas, L., Chuck, R., Jingbin, W., James, P., et al. (2014). Learning fine-grained image similarity with deep ranking. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1386–1393).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Abhishek Kushwaha .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kushwaha, A., Chakravorty, S., Das, P. (2021). Style Scanner—Personalized Visual Search and Recommendations. In: Laha, A.K. (eds) Applied Advanced Analytics. Springer Proceedings in Business and Economics. Springer, Singapore. https://doi.org/10.1007/978-981-33-6656-5_5

Download citation

Publish with us

Policies and ethics