Dataset Morphing to Analyze the Performance of Collaborative Filtering

  • André CorreiaEmail author
  • Carlos Soares
  • Alípio Jorge
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11828)


Machine Learning algorithms are often too complex to be studied from a purely analytical point of view. Alternatively, with a reasonably large number of datasets one can empirically observe the behavior of a given algorithm in different conditions and hypothesize some general characteristics. This knowledge about algorithms can be used to choose the most appropriate one given a new dataset. This very hard problem can be approached using metalearning. Unfortunately, the number of datasets available may not be sufficient to obtain reliable meta-knowledge. Additionally, datasets may change with time, by growing, shrinking and editing, due to natural actions like people buying in a e-commerce site. In this paper we propose dataset morphing as the basis of a novel methodology that can help overcome these drawbacks and can be used to better understand ML algorithms. It consists of manipulating real datasets through the iterative application of gradual transformations (morphing) and by observing the changes in the behavior of learning algorithms while relating these changes with changes in the meta features of the morphed datasets. Although dataset morphing can be envisaged in a much wider framework, we focus on one very specific instance: the study of collaborative filtering algorithms on binary data. Results show that the proposed approach is feasible and that it can be used to identify useful metafeatures to predict the best collaborative filtering algorithm for a given dataset.


Recommender Systems Metalearning 



This work is funded by ERDF through the Operational Programme of Competitiveness and Internationalization—COMPETE 2020—of Portugal 2020 within project PushNews—POCI-01-0247-FEDER-0024257.


  1. 1.
    Brazdil, P., Carrier, C.G., Soares, C., Vilalta, R.: Metalearning: Applications to Data Mining. COGTECH. Springer, Heidelberg (2008). Scholar
  2. 2.
    Cunha, T., Soares, C., de Carvalho, A.C.P.L.F.: Selecting collaborative filtering algorithms using metalearning. In: Frasconi, P., Landwehr, N., Manco, G., Vreeken, J. (eds.) ECML PKDD 2016. LNCS (LNAI), vol. 9852, pp. 393–409. Springer, Cham (2016). Scholar
  3. 3.
    Cunha, T., Soares, C., de Carvalho, A.C.: Metalearning and recommender systems: a literature review and empirical study on the algorithm selection problem for collaborative filtering. Inf. Sci. 423, 128–144 (2018)MathSciNetCrossRefGoogle Scholar
  4. 4.
    Domingues, M.A., et al.: Combining usage and content in an online recommendation system for music in the long tail. Int. J. Multimed. Inf. Retrieval 2, 3–13 (2013)CrossRefGoogle Scholar
  5. 5.
    Maxwell, F., Konstan, J.A.: The MovieLens datasets: history and context. ACM Trans. Intell. Syst. Technol. (TIST) 5, 1–19 (2015)Google Scholar
  6. 6.
    García-Saiz, D., Zorrilla, M.: A meta-learning based framework for building algorithm recommenders: an application for educational arena. J. Intell. Fuzzy Syst. 32, 1–11 (2017)Google Scholar
  7. 7.
    Hahsler, M.: recommenderlab: a framework for developing and testing recommendation algorithms, November 2011Google Scholar
  8. 8.
    Massa, P., Avesani, P.: In: Proceedings of the 2007 ACM conference on Recommender systems - RecSys 2007 (2007)Google Scholar
  9. 9.
    Matuszyk, P., Vinagre, J., Spiliopoulou, M., Jorge, A.M., Gama, J.: Forgetting methods for incremental matrix factorization in recommender systems. In: Proceedings of the 30th Annual ACM Symposium on Applied Computing - SAC ’15 (2015)Google Scholar
  10. 10.
    McAuley, J., Leskovec, J.: Hidden factors and hidden topics: understanding rating dimensions with review text. Proceedings of the 7th ACM conference on Recommender systems - RecSys 2013 (2015)Google Scholar
  11. 11.
    Prudêncio, R.B.C., Soares, C., Ludermir, T.B.: Combining meta-learning and active selection of datasetoids for algorithm selection. In: Corchado, E., Kurzyński, M., Woźniak, M. (eds.) HAIS 2011. LNCS (LNAI), vol. 6678, pp. 164–171. Springer, Heidelberg (2011). Scholar
  12. 12.
    Ricci, F., Rokach, L., Shapira, B.: Recommender systems: introduction and challenges. In: Ricci, F., Rokach, L., Shapira, B. (eds.) Recommender Systems Handbook, pp. 1–34. Springer, Boston, MA (2015). Scholar
  13. 13.
    Rivolli, A., Garcia, L.P.F., Soares, C., Vanschoren, J., de Leon Ferreira de Carvalho, A.C.P.: Towards reproducible empirical research in meta-learning. CoRR abs/1808.1 (2018)Google Scholar
  14. 14.
    Wolberg, G.: Image morphing: a survey. Vis. Comput. 14, 360–372 (1998)CrossRefGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • André Correia
    • 1
    Email author
  • Carlos Soares
    • 1
    • 2
    • 3
  • Alípio Jorge
    • 3
    • 4
  1. 1.Faculdade de Engenharia da Universidade do PortoPortoPortugal
  2. 2.LIACCPortoPortugal
  3. 3.LIAAD-INESC TECPortoPortugal
  4. 4.Faculdade de Ciências da Universidade do PortoPortoPortugal

Personalised recommendations