Abstract
Recommender systems are destined to solve the immense issue of analyzing information overload and supporting customers in decision making with more relevant and personalized information. Most of recommender systems only consider the feedback provided by the customers or content of items. They do not consider the different modes of available information. Smartphones, smart devices, web 2.0, etc., enable users to generate different multimedia content which may help to learn about user’s preferences. Multimodal information can be analyzed to learn users’ preference dynamics and generate more accurate personalized information by considering different modes of available information simultaneously. In recent years, multimodal recommender systems have been developed by using multimodal information of users and items. In this paper, a comprehensive analysis of multimodal recommender systems is provided. Our paper focuses on various aspects such as modality, applications and techniques for multimodal recommender systems. The positive and negative aspects of using multimodality are also discussed in recommender systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Iyengar SS, Lepper MR (2000) When choice is demotivating: can one desire too much of a good thing? J Pers Soc Psychol 79(6):995
Ricci F, Rokach L, Shapira B (2015) Recommender systems: introduction and challenges. In: Recommender systems handbook. Springer, Boston, pp 1–34
Shah RR (2016) Multimodal analysis of user-generated content in support of social media applications. In: Proceedings of the ACM on international conference on multimedia retrieval, pp 423–426
Adomavicius G, Tuzhilin A (2005) Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Trans Knowl Data Eng 17(6):734–749
Burke R (2002) Hybrid recommender systems: survey and experiments. User Model User-Adap Inter 12(4):331–370
Yang X, Guo Y, Liu Y, Steck H (2014) A survey of collaborative filtering based social recommender systems. Comput Commun 41:1–10
Milicevic AK, Nanopoulos A, Ivanovic M (2010) Social tagging in recommender systems: a survey of the state-of-the-art and possible extensions. Artif Intell Rev 33(3):187–209
Zhang S, Yao L, Sun A, Tay Y (2019) Deep learning based recommender system: a survey and new perspectives. ACM Comput Surv (CSUR) 52(1):1–38
Cantador I, Fernández-Tobías I, Berkovsky S, Cremonesi, P (2015) Cross-domain recommender systems. In: Recommender systems handbook. Springer, Boston, pp 119–159
Zhang Q, Wu D, Lu J, Liu F, Zhang G (2017) A cross-domain recommender system with consistent information transfer. Decis Support Syst 104:49–63
Kaklauskas A, Gudauskas R, Kozlovas M, Peciure L, Lepkova N, Cerkauskas J, Banaitis A (2016) An affect-based multimodal video recommendation system. Stud Inf Control 25(1):6
Oramas S, Nieto O, Sordo M, Serra X (2017) A deep multimodal approach for cold-start music recommendation. In: Proceedings of the 2nd workshop on deep learning for recommender systems, pp 32–37
Yukawa M, Hayashi Y, Ogawa H, Kryssanov VV (2012) A group recommendation system with a multi-modal user interface. In: The 6th international conference on soft computing and ıntelligent systems, and The 13th international symposium on advanced ıntelligence systems. IEEE, pp 2158–2163
Kuo FF, Shan MK, Lee SY (2013) Background music recommendation for video based on multimodal latent semantic analysis. In: 2013 IEEE international conference on multimedia and expo (ICME). IEEE, pp 1–6
Domingues MA, Gouyon F, Jorge AM, Leal JP, Vinagre J, Lemos L, Sordo M (2013) Combining usage and content in an online recommendation system for music in the long tail. Int J Multimedia Inf Retrieval 2(1):3–13
Tzanetakis G, Cook P (2000) Marsyas: a framework for audio analysis. Organised Sound 4(3):169–175
Da Costa AF, Manzato MG (2016) Exploiting multimodal interactions in recommender systems with ensemble algorithms. Inf Syst 56:120–132
Da Costa AF, Domingues MA, Rezende SO, Manzato MG (2014, Aug) Improving personalized ranking in recommender systems with multimodal interactions. In: 2014 IEEE/WIC/ACM International joint conferences on web intelligence (WI) and intelligent agent technologies (IAT), vol 1. IEEE, pp 198–204
Arapakis I, Moshfeghi Y, Joho H, Ren R, Hannah D, Jose JM (2009) Integrating facial expressions into user profiling for the improvement of a multimodal recommender system. In: IEEE International conference on multimedia and expo, pp 1440–1443
Cho S, Lee M, Jang C, Choi E (2006) Multidimensional filtering approach based on contextual information. In: International conference on hybrid ınformation technology, vol 2. IEEE, pp 497–504
Peska L (2017) Multimodal ımplicit feedback for recommender systems. In: ITAT, pp 240–245
Da Costa AF, Manzato MG (2014) Multimodal interactions in recommender systems: an ensembling approach. In: Brazilian conference on ıntelligent systems. IEEE, pp 67–72
Jia X, Wang A, Li X, Xun G, Xu W, Zhang A (2015) Multi-modal learning for video recommendation based on mobile application usage. In: International conference on big data (big data). IEEE, pp 837–842
Laurier C, Grivolla J, Herrera P (2008) Multimodal music mood classification using audio and lyrics. In: Seventh international conference on machine learning and applications. IEEE, pp 688–693
Yang B, Mei T, Hua XS, Yang L, Yang SQ, Li M (2007) Online video recommendation based on multimodal fusion and relevance feedback. In: Proceedings of the 6th ACM international conference on image and video retrieval, pp 73–80
Campigotto P, Rudloff C, Leodolter M, Bauer D (2016) Personalized and situation-aware multimodal route recommendations: the FAVOUR algorithm. IEEE Trans Intell Transp Syst 18(1):92–102
Rafailidis D, Kefalas P, Manolopoulos Y (2017) Preference dynamics with multimodal user-item interactions in social media recommendation. Expert Syst Appl 74:11–18
Wang X, Zhao YL, Nie L, Gao Y, Nie W, Zha ZJ, Chua TS (2014) Semantic-based location recommendation with multimodal venue semantics. IEEE Trans Multimedia 17(3):409–419
Zhao Z, Yang Q, Lu H, Weninger T, Cai D, He X, Zhuang Y (2017) Social-aware movie recommendation via multimodal network learning. IEEE Trans Multimedia 20(2):430–440
Alhamid MF, Rawashdeh M, El Saddik A (2013) Towards context-aware recommendations of multimedia in an ambient intelligence environment. In: IEEE International symposium on multimedia. IEEE, pp 409–414
Liu L, Gan J (2017) Using multi-modal topic modeling in national culture resources: methods and applications. In: 9th International conference on ıntelligent human-machine systems and cybernetics (IHMSC), vol 2. IEEE, pp 298–301
Huang Z, Xu X, Ni J, Zhu H, Wang C (2009) Multimodal representation learning for recommendation in Internet of Things. IEEE Internet Things J 6(6):10675–10685
Salah A, Truong QT, Lauw HW (2020) Cornac a comparative framework for multimodal recommender systems. J Mach Learn Res 21(95):1–5
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Singh, V.K., Sabharwal, S., Gabrani, G. (2021). Comprehensive Analysis of Multimodal Recommender Systems. In: Jeena Jacob, I., Kolandapalayam Shanmugam, S., Piramuthu, S., Falkowski-Gilski, P. (eds) Data Intelligence and Cognitive Informatics. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-15-8530-2_70
Download citation
DOI: https://doi.org/10.1007/978-981-15-8530-2_70
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-8529-6
Online ISBN: 978-981-15-8530-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)