Skip to main content
Log in

A hybrid approach of topic model and matrix factorization based on two-step recommendation framework

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

Recommender systems become increasingly significant in solving the information explosion problem. Two typical kinds of techniques treat the recommendation problem as either a rating prediction or a ranking prediction one. In contrast, we propose a two-step framework that considers recommendation as a simulation of users’ behaviors to generate ratings. The first step is to predict the probability that a user rates an item, and the second step is to predict rating values. After that, the predicted results from both steps are combined to compute the expectations of users’ ratings on items, which are used to generate recommendations. Based on this framework, we propose a hybrid approach which uses topic model in the first step and matrix factorization in the second to solve the recommendation problem. Experiments with MovieLens and EachMovie datasets demonstrate the effectiveness of the proposed framework and the recommendation approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Notes

  1. 0 is a typical value out of the range of rating scale, which can be used to distinguish the rating value and the rating behavior.

  2. The effectiveness is demonstrated in the experiment section, in which HTMMF using rating behaviors in the first step gets better results than the one (L&S) with same model but using rating values in the first step.

  3. Types of implicit feedback include rating behaviors, purchase history, browsing history, and search patterns.

  4. http://www.grouplens.org/node/73

  5. http://www.research.digital.com/src/eachmovie

  6. The original ratings from EM1 are in 0-to-1 scale. We convert it to 0-to-5 scale, and then exclude the ratings with value 0 in order to fit the proposed recommendation framework.

  7. Since the restriction of the column width, we only report some typical performances for different K T .

References

  • Adomavicius, G., & Kwon, Y. (2012). Improving aggregate recommendation diversity using ranking-based techniques. IEEE Transactions on Knowledge and Data Engineering, 24(5), 896–911.

    Article  Google Scholar 

  • Aytekin, T., Karakaya, M.Ö. (2014). Clustering-based diversity improvement in top-n recommendation. Journal of Intelligent Information Systems, 42(1), 1–18.

    Article  Google Scholar 

  • Blei, D.M., Ng, A.Y., Jordan, M.I. (2003). Latent dirichlet allocation. The Journal of Machine Learning Research, 3, 993–1022.

    MATH  Google Scholar 

  • Breese, J.S., Heckerman, D., Kadie, C. (1998). Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the fourteenth conference on uncertainty in artificial intelligence (pp. 43–52). Morgan Kaufmann.

  • Chen, W., Niu, Z., Zhao, X., Li, Y. (2014). A hybrid recommendation algorithm adapted in e-learning environments. World Wide Web, 17(2), 271–284.

    Article  Google Scholar 

  • Cremonesi, P., Koren, Y., Turrin, R. (2010). Performance of recommender algorithms on top-n recommendation tasks. In Proceedings of the fourth ACM conference on recommender systems (pp. 39–46). ACM.

  • Delgado, J., & Ishii, N. (1999). Memory-based weighted majority prediction. In ACM SIGIR’99 workshop on recommender systems. Citeseer.

  • Griffiths, T.L., & Steyvers, M. (2004). Finding scientific topics. In Proceedings of the National Academy of Sciences of the United States of America (Vol. 101, Suppl 1, pp. 5228–5235).

  • Hofmann, T. (1999). Probabilistic latent semantic analysis. In Proceedings of the fifteenth conference on uncertainty in artificial intelligence (pp. 289–296). Morgan Kaufmann

  • Hofmann, T. (2004). Latent semantic models for collaborative filtering. ACM Transactions on Information Systems (TOIS), 22(1), 89–115.

    Article  Google Scholar 

  • Järvelin, K., & Kekäläinen, J. (2002). Cumulated gain-based evaluation of ir techniques. ACM Transactions on Information Systems (TOIS), 20(4), 422–446.

    Article  Google Scholar 

  • Kantor, P.B., Rokach, L., Ricci, F., Shapira, B. (2011). Recommender systems handbook. Springer.

  • Koren, Y. (2008). Factorization meets the neighborhood: A multifaceted collaborative filtering model. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 426–434). ACM.

  • Koren, Y., & Sill, J. (2011). Ordrec: An ordinal model for predicting personalized item rating distributions. In Proceedings of the fifth ACM conference on recommender systems (pp. 117–124). ACM.

  • Li, Y., Hu, J., Zhai, C., Chen, Y. (2010). Improving one-class collaborative filtering by incorporating rich user information. In Proceedings of the 19th ACM international conference on information and knowledge management (pp. 959–968). ACM.

  • Liu, N.N., Xiang, E.W., Zhao, M., Yang, Q. (2010). Unifying explicit and implicit feedback for collaborative filtering. In Proceedings of the 19th ACM international conference on Information and knowledge management (pp. 1445–1448). ACM.

  • Liu, N.N., Zhao, M., Yang, Q. (2009). Probabilistic latent preference analysis for collaborative filtering. In Proceedings of the 18th ACM conference on information and knowledge management (pp. 759–766). ACM.

  • Liu, Q., Chen, E., Xiong, H., Ding, C.H., Chen, J. (2012). Enhancing collaborative filtering by user interest expansion via personalized ranking. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 42 (1), 218–233.

    Article  MATH  Google Scholar 

  • Park, Y.J., & Tuzhilin, A. (2008). The long tail of recommender systems and how to leverage it. In Proceedings of the 2008 ACM conference on recommender systems(pp. 11–18). ACM.

  • Perugini, S., Gonçalves, M.A., Fox, E.A (2004). Recommender systems research: A connection-centric survey. Journal of Intelligent Information Systems, 23(2), 107–143.

    Article  MATH  Google Scholar 

  • Sarwar, B., Karypis, G., Konstan, J., Riedl, J. (2001). Item-based collaborative filtering recommendation algorithms. In Proceedings of the 10th international conference on World Wide Web (pp. 285–295). ACM.

  • Shang, M.S., Lü, L., Zeng, W., Zhang, Y.C., Zhou, T. (2009). Relevance is more significant than correlation: Information filtering on sparse data. EPL (Europhysics Letters), 88 (6), 68, 008.

    Article  Google Scholar 

  • Wei, X., & Croft, W.B. (2006). Lda-based document models for ad-hoc retrieval. In Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval (pp. 178–185). ACM.

  • Weimer, M., Karatzoglou, A., Le, Q.V., Smola, A.J. (2007). Cofi rank-maximum margin matrix factorization for collaborative ranking. In Advances in neural information processing systems (pp. 1593–1600).

  • Zhao, X., Niu, Z., Chen, W. (2013). Interest before liking: Two-step recommendation approaches. Knowledge-Based Systems, 48, 46–56.

    Article  Google Scholar 

  • Zhou, Y., Wilkinson, D., Schreiber, R., Pan, R. (2008). Large-scale parallel collaborative filtering for the netflix prize. In Algorithmic aspects in information and management (pp. 337–348). Springer.

Download references

Acknowledgements

This work is supported by the National Basic Research Program of China (No. 2012CB7207002), the National Natural Science Foundation of China (Project Nos. 61370137, 61250010, and 61272361), and the 111 Project of Beijing Institute of Technology. The EachMovie dataset is by courtesy of Digital Equipment Corporation and was generously provided by Paul McJones.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhendong Niu.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhao, X., Niu, Z., Chen, W. et al. A hybrid approach of topic model and matrix factorization based on two-step recommendation framework. J Intell Inf Syst 44, 335–353 (2015). https://doi.org/10.1007/s10844-014-0334-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10844-014-0334-3

Keywords

Navigation