Abstract
Over the past years, fashion-related challenges have gained a lot of attention in the research community. Outfit generation and recommendation, i.e., the composition of a set of items of different types (e.g., tops, bottom, shoes, accessories) that go well together, are among the most challenging ones. That is because items have to be both compatible amongst each other and also personalized to match the taste of the customer. Recently there has been a plethora of work targeted at tackling these problems by adopting various techniques and algorithms from the machine learning literature. However, to date, there is no extensive comparison of the performance of the different algorithms for outfit generation and recommendation. In this paper, we close this gap by providing a broad evaluation and comparison of various algorithms, including both personalized and non-personalized approaches, using online, real-world user data from one of Europe’s largest fashion stores. We present the adaptations we made to some of those models to make them suitable for personalized outfit generation. Moreover, we provide insights for models that have not yet been evaluated on this task, specifically, GPT, BERT and Seq-to-Seq LSTM.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
The exact number of forward passes for sampling a single outfit from BERT depends on the implementation. We found it to be ideally at least an order of magnitude higher than the outfit length.
- 3.
- 4.
- 5.
A random algorithm would result in close to 100% personalization rate.
References
Bromley J, Guyon I, LeCun Y, Säckinger E, Shah R (1993) Signature verification using a siamese time delay neural network. In: Cowan JD, Tesauro G, Alspector J (eds) Advances in neural information processing systems 6, [7th NIPS Conference, Denver, Colorado, USA, 1993], pp. 737–744. Morgan Kaufmann (1993). http://papers.nips.cc/paper/769-signature-verification-using-a-siamese-time-delay-neural-network
Chen W, Huang P, Xu J, Guo X, Guo C, Sun F, Li C, Pfadler A, Zhao H, Zhao B (2019) POG: personalized outfit generation for fashion recommendation at alibaba ifashion. In: Teredesai A, V Kumar, Y Li, R Rosales, R Terzi, Karypis G (eds) Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, KDD 2019, Anchorage, AK, USA, August 4–8, 2019, pp 2662–2670. ACM (2019). https://doi.org/10.1145/3292500.3330652
Cheng W, Song S, Chen C, Hidayati SC, Liu J (2020) Fashion meets computer vision: a survey. CoRR abs/2003.13988 (2020). https://arxiv.org/abs/2003.13988
Devlin J, Chang M, Lee K, Toutanova K (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: Burstein J, Doran C, Solorio T (eds) Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, NAACL-HLT 2019, Minneapolis, MN, USA, June 2–7, 2019, Volume 1 (Long and Short Papers), pp. 4171–4186. Association for Computational Linguistics (2019). https://doi.org/10.18653/v1/n19-1423
Han X, Wu Z, Jiang Y, Davis LS (2017) Learning fashion compatibility with bidirectional lstms. In: Liu Q, Lienhart R, Wang H, Chen SH, Boll S, Chen YP, Friedland G, Li J, Yan S (eds) Proceedings of the 2017 ACM on multimedia conference, MM 2017, mountain view, CA, USA, October 23–27, 2017, pp 1078–1086. ACM (2017). https://doi.org/10.1145/3123266.3123394
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
Hu Y, Yi X, Davis LS (2015) Collaborative fashion recommendation: a functional tensor factorization approach. In: Zhou X, Smeaton AF, Tian Q, Bulterman DCA, Shen HT, Mayer-Patel K, Yan S (eds) Proceedings of the 23rd annual ACM conference on multimedia conference, MM ’15, Brisbane, Australia, October 26–30, 2015, pp 129–138. ACM (2015). https://doi.org/10.1145/2733373.2806239
Koch G, Zemel R, Salakhutdinov R (2015) Siamese neural networks for one-shot image recognition. In: ICML deep learning workshop, vol 2. Lille (2015)
Kuhn T, Bourke S, Brinkmann L, Buchwald T, Digan C, Hache H, Jaeger S, Lehmann P, Maier O, Matting S, Okulovsky Y (2019) Supporting stylists by recommending fashion style. CoRR abs/1908.09493 (2019). http://arxiv.org/abs/1908.09493
Laenen K, Moens M (2019) Attention-based fusion for outfit recommendation. CoRR abs/1908.10585 (2019). http://arxiv.org/abs/1908.10585
Lee H, Seol J, Lee S (2017) Style2vec: Representation learning for fashion items from style sets. CoRR abs/1708.04014 (2017). http://arxiv.org/abs/1708.04014
Lipton ZC (2015) A critical review of recurrent neural networks for sequence learning. CoRR abs/1506.00019 (2015). http://arxiv.org/abs/1506.00019
McAuley JJ, Targett C, Shi Q, van den Hengel A (2015) Image-based recommendations on styles and substitutes. In: Baeza-Yates R, Lalmas M, Moffat A, Ribeiro-Neto BA (eds) Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval, Santiago, Chile, August 9–13, 2015, pp 43–52. ACM (2015). https://doi.org/10.1145/2766462.2767755
Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Burges CJC, Bottou L, Ghahramani Z, Weinberger KQ (eds) Advances in neural information processing systems 26: 27th annual conference on neural information processing systems 2013. Proceedings of a meeting held December 5-8, 2013, Lake Tahoe, Nevada, United States, pp 3111–3119 (2013). http://papers.nips.cc/paper/5021-distributed-representations-of-words-and-phrases-and-their-compositionality
Polanía LF, Gupte S (2019) Learning fashion compatibility across apparel categories for outfit recommendation. In: 2019 IEEE international conference on image processing, ICIP 2019, Taipei, Taiwan, September 22–25, 2019, pp 4489–4493. IEEE (2019). https://doi.org/10.1109/ICIP.2019.8803587
Radford A, Narasimhan K, Salimans T, Sutskever I (2018) Improving language understanding by generative pre-training (2018)
Sutskever I, Vinyals O, Le QV (2014) Sequence to sequence learning with neural networks. In: Ghahramani Z, Welling M, Cortes C, Lawrence ND, Weinberger KQ (eds) Advances in neural information processing systems 27: annual conference on neural information processing systems 2014, December 8–13, 2014, Montreal, Quebec, Canada, pp 3104–3112 (2014). http://papers.nips.cc/paper/5346-sequence-to-sequence-learning-with-neural-networks
Vasileva MI, Plummer BA, Dusad K, Rajpal S, Kumar R, Forsyth DA (2018) Learning type-aware embeddings for fashion compatibility. In: Ferrari V, Hebert M, Sminchisescu C, Weiss Y (eds) Computer Vision - ECCV 2018 - 15th European conference, Munich, Germany, September 8–14, 2018, Proceedings, Part XVI, Lecture Notes in Computer Science, vol 11220, pp 405–421. Springer (2018). https://doi.org/10.1007/978-3-030-01270-0_24
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. In: Guyon I, von Luxburg U, Bengio S, Wallach HM, Fergus R, Vishwanathan SVN, Garnett R (eds) Advances in neural information processing systems 30: annual conference on neural information processing systems 2017, 4–9 December 2017, Long Beach, CA, USA, pp 5998–6008 (2017). http://papers.nips.cc/paper/7181-attention-is-all-you-need
Veit A, Belongie SJ, Karaletsos T (2016) Disentangling nonlinear perceptual embeddings with multi-query triplet networks. CoRR abs/1603.07810 (2016). http://arxiv.org/abs/1603.07810
Veit A, Kovacs B, Bell S, McAuley JJ, Bala K, Belongie SJ (2015) Learning visual clothing style with heterogeneous dyadic co-occurrences. In: 2015 IEEE international conference on computer vision, ICCV 2015, Santiago, Chile, December 7–13, 2015, pp 4642–4650. IEEE Computer Society (2015). https://doi.org/10.1109/ICCV.2015.527
Wang A, Cho K (2019) BERT has a mouth, and it must speak: BERT as a markov random field language model. CoRR abs/1902.04094 (2019). http://arxiv.org/abs/1902.04094
Wu X, Lv S, Zang L, Han J, Hu S (2019) Conditional BERT contextual augmentation. In: Rodrigues JMF, Cardoso PJS, Monteiro JM, Lam R, Krzhizhanovskaya VV, Lees HM, Dongarra JJ, Sloot PMA (eds) Computational Science - ICCS 2019 - 19th international conference, Faro, Portugal, June 12–14, 2019, Proceedings, Part IV, Lecture Notes in Computer Science, vol 11539, pp 84–95. Springer (2019). https://doi.org/10.1007/978-3-030-22747-0_7
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Celikik, M. et al. (2021). Outfit Generation and Recommendation—An Experimental Study. In: Dokoohaki, N., Jaradat, S., Corona Pampín, H.J., Shirvany, R. (eds) Recommender Systems in Fashion and Retail. Lecture Notes in Electrical Engineering, vol 734. Springer, Cham. https://doi.org/10.1007/978-3-030-66103-8_7
Download citation
DOI: https://doi.org/10.1007/978-3-030-66103-8_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-66102-1
Online ISBN: 978-3-030-66103-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)