Abstract
Recommender Systems (RS) are information retrieval systems that can be used for serving personalized content to online users. Most industrial recommendation systems utilize a large amount of online data to generate personalized recommendations for users. The quality of the data plays an important role in the performance of the RS. The majority of the RS data is generated from event data that are stored in data lakes through multiple data pipelines. Event-based data pipelines have emerged as a popular approach to handle the massive amount of data generated by modern applications. In this paper, we explore the impact of event-based data pipelines on recommendation systems. We discuss how these pipelines enable efficient data ingestion, real-time processing, and low-latency recommendations.
D. Reddy, U. Sinha, and R. S. Rajput—Contributed equally to this work.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Adomavicius, G., Tuzhilin, A.: Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Trans. Knowl. Data Eng. 17(6), 734–749 (2005). https://doi.org/10.1109/TKDE.2005.99
Afchar, A., Epure, E.V., Mille, A., Moussallam, M.: Explainability in music recommender systems. arXiv preprint arXiv:2201.10528 (2022)
Ahmad, S., Lavin, A., Purdy, S., Agha, Z.: Unsupervised real-time anomaly detection for streaming data. Neurocomputing 262, 134–147 (2017)
Bobadilla, J., Ortega, F., Hernando, A., Gutiérrez, A.: Recommender systems survey. Knowl.-Based Syst. 46, 109–132 (2013)
Bouneffouf, D., Rish, I., Aggarwal, C.: Survey on applications of multi-armed and contextual bandits. IEEE Congress on Evolutionary Computation (CEC), pp. 1–8 (2020). https://doi.org/10.1109/CEC48606.2020.9185782
Carbone, P., Katsifodimos, A., Ewen, S., Markl, V., Haridi, S., Tzoumas, K.: Apache flink: Stream and batch processing in a single engine. Bull. Tech. Committee Data Eng. 38(4), 13–26 (2015)
Chen, J., Gao, Z.: A comprehensive pipeline for hotel recommendation system. arXiv preprint arXiv:2009.01860 (2020)
Choi, S.M., Lee, D., Jang, K., Park, C., Lee, S.: Improving data sparsity in recommender systems using matrix regeneration with item features. Mathematics 11(2), 292 (2023). https://doi.org/10.3390/math11020292
Covington, P., Adams, J., Sargin, E.: Deep neural networks for youtube recommendations. In: Proceedings of the 10th ACM Conference on Recommender Systems, pp. 191–198 (2016)
Deng, Y.: Recommender systems based on graph embedding techniques: a comprehensive review. arXiv preprint arXiv:2109.09587 (2021)
Gautham, S., Bakirtzis, G., Will, A., Jayakumar, A.V., Elks, C.R.: Stpadriven multilevel runtime monitoring for in-time hazard detection. arXiv preprint arXiv:2204.08999 (2022)
Gomes-Ju´nior, A.R., Santana, T., Winter, O.C., Sfair, R.: The main perturbing objects on the orbits of (616) prometheus and (617) pandora. arXiv preprint arXiv:2202.01617 (2022)
Kohar, M., Rana, C.: Survey paper on recommendation system. Int. J. Comput. Sci. Inf. Technol. 3(2), 3460–3462 (2012)
Kumar, B., Sharma, N.: Approaches, issues and challenges in recommender systems: a systematic review. Indian J. Sci. Tech. 9(47), 94892 (2016). https://doi.org/10.17485/ijst/2016/v9i47/94892
Kunaver, M., Požrl, T.: Diversity in recommender systems a survey. Knowl.-Based Syst. 123, 154–162 (2017). https://doi.org/10.1016/j.knosys.2017.02.009, https://www.sciencedirect.com/science/article/pii/S0950705117300680
Lika, B., Kolomvatsos, K., Hadjiefthymiades, S.: Facing the cold start problem in recommender systems. Expert Syst. Appl.: Int. J. 41, 2065–2073 (2014). https://doi.org/10.1016/j.eswa.2013.09.005
Mazaheri, M., Kiar, G., Glatard, T.: A recommender system for scientific datasets and analysis pipelines. arXiv preprint arXiv:2108.09275 (2021)
Mishra, N., et al.: Research problems in recommender systems. J. Phys.: Conf. Ser. 1717(1), 012002 (2021). https://doi.org/10.1088/1742-6596/1717/1/012002
Mohamed, M.H., Khafagy, M.H., Ibrahim, M.H.: Recommender systems challenges and solutions. In: 2019 International Conference on Innovative Trends in Computer Engineering (ITCE’2019), pp. 1–6 (2019)
Ngoc, T.V., Thi, H.T.: Techniques, benefits, and challenges of recommendation system in e-commerce: a literature review. In: Proceedings of the International Conference on Industrial Engineering and Operations Management, pp. 107–114 (2021)
Ponnuswami, G., Kailasam, S., Dinesh, D.A.: Event-driven data pipeline for network management systems. In: 2020 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT), pp. 1–6 (2020). https://doi.org/10.1109/ICCCNT49239.2020.9225344
Roa, J., Farnocchia, D., Chesley, S.R.: A novel approach to asteroid impact monitoring and hazard assessment. arXiv preprint arXiv:2108.03201 (2021)
Roy, D., Dutta, M.: A systematic review and research perspective on recommender systems. J. Big Data 9(1), 59 (2022). https://doi.org/10.1186/s40537-022-00414-2
Sharma, L., Gera, A.: A survey of recommendation system: research challenges. Int. J. Eng. Trends Technol. 4(5), 1989–1995 (2013)
Tagliabue, J.: You do not need a bigger boat recommendations at reasonable scale in a (mostly) serverless and openstack. In: Woodstock ‘18: ACM Symposium on Neural Gaze Detection, p. 6. ACM (2018)
Tiwalola, A.B., Asafe, Y.N.: A comprehensive study of recommender systems: prospects and challenges. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 9(5), 224–230 (2019)
Vrijenhoek, S.: Do you mind? Reflections on the mind dataset for research on diversity in new recommendations. arXiv preprint arXiv:2304.08253 (2023)
Wang, H.: Dotmat: solving cold-start problem and alleviating sparsity problem for recommender systems. arXiv preprint arXiv:2303.14419 (2023)
Wang, H.: Evolution of the online rating platform data structures and its implications for recommender systems. arXiv preprint arXiv:2303.14419 (2023)
Wang, Y., et al.: Experimental comparison of various techniques for spot size measurement of high-energy x-ray source. arXiv preprint arXiv:1511.07668 (2015)
Wu, F., et al.: Mind: a large-scale dataset for news recommendation. In: Proceedings of the Association for Computational Linguistics (ACL) (2020)
Zhang, M., Jansen, B.J.: The effect of missing click-through data on click through rate estimation in sponsored search. Inf. Process. Manage. 47(4), 671–688 (2011)
Acknowledgements
While the original ideas, study, findings, and interpretations expressed in this paper are our own, the clarity of the presentation in specific sub-sections was achieved with the assistance of ChatGPT which helped us in enhancing the readability of the paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Reddy, D., Sinha, U., Rajput, R.S. (2024). Event-Based Data Pipelines in Recommender Systems: The Data Engineering Perspective. In: Miraz, M.H., Southall, G., Ali, M., Ware, A. (eds) Emerging Technologies in Computing. iCETiC 2023. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 538. Springer, Cham. https://doi.org/10.1007/978-3-031-50215-6_3
Download citation
DOI: https://doi.org/10.1007/978-3-031-50215-6_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-50214-9
Online ISBN: 978-3-031-50215-6
eBook Packages: Computer ScienceComputer Science (R0)