Synthetic Sequence Generator for Recommender Systems – Memory Biased Random Walk on a Sequence Multilayer Network

Antulov-Fantulin, Nino; Bošnjak, Matko; Zlatić, Vinko; Grčar, Miha; Šmuc, Tomislav

doi:10.1007/978-3-319-11812-3_3

Nino Antulov-Fantulin²¹,
Matko Bošnjak²²,
Vinko Zlatić²³,
Miha Grčar²⁴ &
…
Tomislav Šmuc²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8777))

Included in the following conference series:

International Conference on Discovery Science

1952 Accesses
2 Citations
1 Altmetric

Abstract

Personalized recommender systems rely on each user’s personal usage data in the system, in order to assist in decision making. However, privacy policies protecting users’ rights prevent these highly personal data from being publicly available to a wider researcher audience. In this work, we propose a memory biased random walk model on a multilayer sequence network, as a generator of synthetic sequential data for recommender systems. We demonstrate the applicability of the generated synthetic data in training recommender system models in cases when privacy policies restrict clickstream publishing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Adomavicius, G., Tuzhilin, A.: Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering 17(6), 734–749 (2005)
Article Google Scholar
Rendle, S., Tso-Sutter, K., Huijsen, W., Freudenthaler, C., Gantner, Z., Wartena, C., Brussee, R., Wibbels, M.: Report on state of the art recommender algorithms (update). Technical report, MyMedia public deliverable D4.1.2 (2011)
Google Scholar
Burke, R.: Hybrid recommender systems: Survey and experiments. User Modeling and User-Adapted Interaction 12(4), 331–370 (2002)
Article MATH Google Scholar
Resnick, P., Iacovou, N., Suchak, M., Bergstrom, P., Riedl, J.: Grouplens: an open architecture for collaborative filtering of netnews. In: Proceedings of the 1994 ACM Conference on Computer Supported Cooperative Work, CSCW 1994, pp. 175–186 (1994)
Google Scholar
Narayanan, A., Shmatikov, V.: Robust de-anonymization of large sparse datasets. In: Proceedings of the 2008 IEEE Symposium on Security and Privacy, SP 2008, pp. 111–125 (2008)
Google Scholar
Feller, W.: An introduction to probability theory and its applications, vol. 2. John Wiley & Sons (2008)
Google Scholar
Kao, E.: An introduction to stochastic processes. Business Statistics Series. Duxbury Press (1997)
Google Scholar
Bogers, T.: Movie recommendation using random walks over the contextual graph. In: Proceedings of the 2nd Intl. Workshop on Context-Aware Recommender Systems (2010)
Google Scholar
Fouss, F., Faulkner, S., Kolp, M., Pirotte, A., Saerens, M.: Web recommendation system based on a markov-chain model. In: International Conference on Enterprise Information Systems, ICEIS 2005 (2005)
Google Scholar
Gori, M., Pucci, A.: Research paper recommender systems: A random-walk based approach. In: Web Intelligence, pp. 778–781 (2006)
Google Scholar
Antulov-Fantulin, N., Bošnjak, M., Žnidaršič, M., Grčar, M., Morzy, M., Šmuc, T.: ECML/PKDD 2011 Discovery Challenge overview. In: Proceedings of the ECML-PKDD 2011 Workshop on Discovery Challenge, pp. 7–20 (2011)
Google Scholar
Dror, G., Koenigstein, N., Koren, Y., Weimer, M.: The Yahoo! music dataset and kdd-cup’11. In: Proceedings of KDD Cup 2011 (2011)
Google Scholar
Zlatić, V., Gabrielli, A., Caldarelli, G.: Topologically biased random walk and community finding in networks. Phys. Rev. E 82, 066,109 (2010)
Google Scholar
Newman, M.: Networks: An Introduction. Oxford University Press, Inc. (2010)
Google Scholar
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval Cambridge University Press (2008)
Google Scholar
Deshpande, M., Karypis, G.: Item-based top-n recommendation algorithms. ACM Transactions on Information Systems 22(1), 143–177 (2004)
Article Google Scholar
Rendle, S., Freudenthaler, C., Gantner, Z., Schmidt-Thieme, L.: Bpr: Bayesian personalized ranking from implicit feedback. In: Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence, UAI 2009, pp. 452–461 (2009)
Google Scholar
Gantner, Z., Rendle, S., Freudenthaler, C., Schmidt-Thieme, L.: Mymedialite: A free recommender system library. In: Proceedings of the Fifth ACM Conference on Recommender Systems, pp. 305–308 (2011)
Google Scholar
Mihelčić, M., Antulov-Fantulin, N., Bošnjak, M., Šmuc, T.: Extending rapidminer with recommender systems algorithms. In: Proceedings of the RapidMiner Community Meeting and Conference, pp. 63–75 (2012)
Google Scholar
Bošnjak, M., Antulov-Fantulin, N., Šmuc, T., Gamberger, D.: Constructing recommender systems workflow templates in RapidMiner. In: Proc. of the 2nd RapidMiner Community Meeting and Conference, pp. 101–112 (2011)
Google Scholar
Chen, B.C., Kifer, D., LeFevre, K., Machanavajjhala, A.: Privacy-preserving data publishing. Foundations and Trends in Databases 2(1-2), 1–167 (2009)
Article Google Scholar
Fung, B.C., Wang, K., Fu, A.W.C., Yu, P.S.: Introduction to Privacy-Preserving Data Publishing: Concepts and Techniques, 1st edn. Chapman & Hall/CRC (2010)
Google Scholar
Aggarwal, C.C., Yu, P.S. (eds.): Privacy-Preserving Data Mining. Models and Algorithms. Springer (2008)
Google Scholar
Berendt, B.: More than modelling and hiding: towards a comprehensive view of web mining and privacy. Data Mining and Knowledge Discovery 24(3), 697–737 (2012)
Article Google Scholar
Kenig, B., Tassa, T.: A practical approximation algorithm for optimal k-anonymity. Data Mining and Knowledge Discovery 25(1), 134–168 (2012)
Article MathSciNet MATH Google Scholar
Samarati, P.: Protecting respondents’ identities in microdata release. IEEE Transactions on Knowledge and Data Engineering 13(6), 1010–1027 (2001)
Article Google Scholar
Machanavajjhala, A., Kifer, D., Gehrke, J., Venkitasubramaniam, M.: L-diversity: Privacy beyond k-anonymity. ACM Transactions on Knowledge Discovery from Data 1(1) (2007)
Google Scholar
Dwork, C.: Differential privacy. In: Bugliesi, M., Preneel, B., Sassone, V., Wegener, I. (eds.) ICALP 2006. LNCS, vol. 4052, pp. 1–12. Springer, Heidelberg (2006)
Chapter Google Scholar
Wolf, P.P.D., Amsterdam, H.V., Design, C., Order, W.T.: An empirical evaluation of PRAM statistics. Netherlands Voorburg/Heerlen (2004)
Google Scholar
Wolf, P.P.D., Gouweleeuw, J.M., Kooiman, P., Willenborg, L.: Reflections on PRAM. Statistical Data Protection, Luxembourg, pp. 337–349 (1999)
Google Scholar
Aggarwal, C.C., Yu, P.S.: A framework for condensation-based anonymization of string data. Data Mining and Knowledge Discovery 16(3), 251–275 (2008)
Article MathSciNet Google Scholar
Raghunathan, T., Reiter, J., Rubin, D.: Multiple imputation for statistical disclosure limitation. Journal of Official Statistics 19(1), 1–16 (2003)
Google Scholar
Fienberg, S.: A radical proposal for the provision of micro-data samples and the preservation of confidentiality. Technical report, Department of Statistics, Carnegie-Mellon University (1994)
Google Scholar
Dandekar, R.A., Cohen, M., Kirkendall, N.: Sensitive micro data protection using latin hypercube sampling technique. In: Domingo-Ferrer, J. (ed.) Inference Control in Statistical Databases. LNCS, vol. 2316, pp. 117–125. Springer, Heidelberg (2002)
Chapter Google Scholar
Dandekar, R.A., Domingo-Ferrer, J., Sebé, F.: LHS-based hybrid microdata vs rank swapping and microaggregation for numeric microdata protection. In: Domingo-Ferrer, J. (ed.) Inference Control in Statistical Databases. LNCS, vol. 2316, pp. 153–162. Springer, Heidelberg (2002)
Chapter Google Scholar
Reiter, J.: Inference for partially synthetic, public use microdata sets. Survey Methodology 29(2), 181–188 (2003)
Google Scholar
Brookshear, J., Glenn, H.: Theory of Computation: Formal Languages, Automata, and Complexity. Benjamin/Cummings Publish Company, Redwood City (1989)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Laboratory for Information Systems, Division of Electronics, Rudjer Bošković Institute, Zagreb, Croatia
Nino Antulov-Fantulin & Tomislav Šmuc
Department of Computer Science, University College London, London, UK
Matko Bošnjak
Theoretical Physics Division, Rudjer Bošković Institute, Zagreb, Croatia
Vinko Zlatić
Department of Knowledge Technologies - E8, Jožef Stefan Institute, Ljubljana, Slovenia
Miha Grčar

Authors

Nino Antulov-Fantulin
View author publications
You can also search for this author in PubMed Google Scholar
Matko Bošnjak
View author publications
You can also search for this author in PubMed Google Scholar
Vinko Zlatić
View author publications
You can also search for this author in PubMed Google Scholar
Miha Grčar
View author publications
You can also search for this author in PubMed Google Scholar
Tomislav Šmuc
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Knowledge Technologies, Jožef Stefan Institute, Jamova cesta 39, 1000, Ljubljana, Slovenia
Sašo Džeroski , Panče Panov & Dragi Kocev , &
Faculty of Administration, University of Ljubljana, Gosarjeva 5, 1000, Ljubljana, Slovenia
Ljupčo Todorovski

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Antulov-Fantulin, N., Bošnjak, M., Zlatić, V., Grčar, M., Šmuc, T. (2014). Synthetic Sequence Generator for Recommender Systems – Memory Biased Random Walk on a Sequence Multilayer Network. In: Džeroski, S., Panov, P., Kocev, D., Todorovski, L. (eds) Discovery Science. DS 2014. Lecture Notes in Computer Science(), vol 8777. Springer, Cham. https://doi.org/10.1007/978-3-319-11812-3_3

Download citation

DOI: https://doi.org/10.1007/978-3-319-11812-3_3
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11811-6
Online ISBN: 978-3-319-11812-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics