Exploiting interactions of review text, hidden user communities and item groups, and time for collaborative filtering

Abstract

Rich side information concerning users and items are valuable for collaborative filtering (CF) algorithms for recommendation. For example, rating score is often associated with a piece of review text, which is capable of providing valuable information to reveal the reasons why a user gives a certain rating. Moreover, the underlying community and group relationship buried in users and items are potentially useful for CF. In this paper, we develop a new model to tackle the CF problem which predicts user’s ratings on previously unrated items by effectively exploiting interactions among review texts as well as the hidden user community and item group information. We call this model CMR (co-clustering collaborative filtering model with review text). Specifically, we employ the co-clustering technique to model the user community and item group, and each community–group pair corresponds to a co-cluster, which is characterized by a rating distribution in exponential family and a topic distribution. We have conducted extensive experiments on 22 real-world datasets, and our proposed model CMR outperforms the state-of-the-art latent factor models. Furthermore, both the user’s preference and item profile are drifting over time. Dynamic modeling the temporal changes in user’s preference and item profiles are desirable for improving a recommendation system. We extend CMR and propose an enhanced model called TCMR to consider time information and exploit the temporal interactions among review texts and co-clusters of user communities and item groups. In this TCMR model, each community–group co-cluster is characterized by an additional beta distribution for time modeling. To evaluate our TCMR model, we have conducted another set of experiments on 22 larger datasets with wider time span. Our proposed model TCMR performs better than CMR and the standard time-aware recommendation model on the rating score prediction tasks. We also investigate the temporal effect on the user–item co-clusters.

This is a preview of subscription content, log in to check access.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Notes

  1. 1.

    http://snap.stanford.edu/data/web-Amazon.html.

  2. 2.

    www.amazon.com.

References

  1. 1.

    Agarwal D, Chen B-C (2010) flda: matrix factorization through latent dirichlet allocation. In: Proceedings of the 3rd ACM international conference on web search and data mining, pp 91–100

  2. 2.

    Agarwal D, Merugu S (2007) Predictive discrete latent factor models for large scale dyadic data. In: Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining, pp 26–35

  3. 3.

    Aizenberg N, Koren Y, Somekh O (2012) Build your own music recommender by modeling internet radio streams. In: Proceedings of the 21st international conference on world wide web, pp 1–10

  4. 4.

    Almahairi A, Kastner K, Cho K, Courville A (2015) Learning distributed representations from reviews for collaborative filtering. In: Proceedings of the 9th ACM conference on recommender systems, pp 147–154

  5. 5.

    Baltrunas L, Amatriain X (2009) Towards time-dependant recommendation based on implicit feedback. In: Workshop on context-aware recommender systems

  6. 6.

    Bao Y, Fang H, Zhang J (2014) Topicmf: Simultaneously exploiting ratings and reviews for recommendation. In: Proceedings of the 28th AAAI conference on artificial intelligence, pp 2–8

  7. 7.

    Bell R, Koren Y, Volinsky, C (2007) Modeling relationships at multiple scales to improve accuracy of large recommender systems. In: Proceedings of the 13rd ACM SIGKDD international conference on knowledge discovery and data mining, pp 95–104

  8. 8.

    Beutel A, Murray K, Faloutsos C, Smola A (2014) Cobafi: collaborative bayesian filtering. In: Proceedings of the 23rd international conference on world wide web, pp 97–108

  9. 9.

    Bishop CM (2006) Pattern recognition and machine learning. Information science and statistics. Springer, New York

    Google Scholar 

  10. 10.

    Blei D, Ng A, Jordan M (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022

    MATH  Google Scholar 

  11. 11.

    Bobadilla J, Ortega F, Hernando A, Gutiérrez A (2013) Recommender systems survey. Knowl Based Syst 46:109–132

    Article  Google Scholar 

  12. 12.

    Cai Y, Leung H-F, Li Q, Min H, Tang J, Li J (2014) Typicality-based collaborative filtering recommendation. IEEE Trans Knowl Data Eng 26(3):766–779

    Article  Google Scholar 

  13. 13.

    Chen C, Li D, Zhao Y, Lv Q, Shang L (2015) Wemarec: accurate and scalable recommendation through weighted and ensemble matrix approximation. In: Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval, pp 303–312

  14. 14.

    Chen T, Han W, Wang H, Zhou Y, Xu B, Zang B (2007) Content recommendation system based on private dynamic user profile. In: IEEE international conference on machine learning and cybernetics, pp 2112–2118

  15. 15.

    Cheng Y, Church G (2000) Biclustering of expression data. In: Proceedings of the 8th international conference on intelligent systems for molecular biology, vol 8, pp 93–103

  16. 16.

    Chu W, Park S (2009) Personalized recommendation on dynamic content using predictive bilinear models. In: Proceedings of the 18th international conference on world wide web, pp 691–700

  17. 17.

    DeCoste D (2006) Collaborative prediction using ensembles of maximum margin matrix factorizations. In: Proceedings of the 23rd international conference on machine learning, pp 249–256

  18. 18.

    Dhillon I, Mallela S, Modha D (2003) Information-theoretic co-clustering. In: Proceedings of the 9th ACM SIGKDD international conference on knowledge discovery and data mining, pp 89–98

  19. 19.

    Diao Q, Qiu M, Wu C-Y, Smola AJ, Jiang J, Wang C (2014) Jointly modeling aspects, ratings and sentiments for movie recommendation (jmars). In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining, pp 193–202

  20. 20.

    Ding Y, Li X (2005) Time weight collaborative filtering. In: Proceedings of the 14th ACM international conference on information and knowledge management, pp 485–492

  21. 21.

    Gaillard J, Renders J-M (2015) Time-sensitive collaborative filtering through adaptive matrix completion. In: Proceedings of the 37th European conference on information retrieval, pp 327–332

  22. 22.

    Ganu G, Elhadad Y, Marian A (2009) Beyond the stars: improving rating predictions using review text content. In: Proceedings of the 12th international workshop on the web and databases

  23. 23.

    Geman S, Geman D (1984) Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans Pattern Anal Mach Intell 6:721–741

    Article  MATH  Google Scholar 

  24. 24.

    George T, Merugu S (2005) A scalable collaborative filtering framework based on co-clustering. In: Proceedings of the 5th IEEE international conference on data mining, pp 625–628

  25. 25.

    Geuens S (2015) Factorization machines for hybrid recommendation systems based on behavioral, product, and customer data. In: Proceedings of the 9th ACM conference on recommender systems, pp 379–382

  26. 26.

    Griffiths T, Steyvers M (2004) Finding scientific topics. In: Proceedings of the National academy of Sciences of the United States of America, pp 5228–5235

  27. 27.

    Guan L, Alam MH, Ryu W, Lee S (2016) A phrase-based model to discover hidden factors and hidden topics in recommender systems. In: IEEE international conference on big data and smart computing (BigComp), pp 337–340

  28. 28.

    Hartigan J (1972) Direct clustering of a data matrix. J Am Stat Assoc 67(337):123–129

    Article  Google Scholar 

  29. 29.

    He X, Chen T, Kan M, Chen X (2015) Trirank: review-aware explainable recommendation by modeling aspects. In: Proceedings of the 24th ACM international conference on information and knowledge management, pp 1661–1670

  30. 30.

    Heckel R, Vlachos M (2016) Interpretable recommendations via overlapping co-clusters. arXiv preprint arXiv:1604.02071

  31. 31.

    Hoffman M, Bach F, Blei D (2010) Online learning for latent dirichlet allocation. In: Proceedings of advances neural information processing systems, pp 856–864

  32. 32.

    Hong L, Doumith A, Davison B (2013) Co-factorization machines: modeling user interests and predicting individual decisions in twitter. In: Proceedings of the sixth ACM international conference on web search and data mining, pp 557–566

  33. 33.

    Koren Y (2008) Factorization meets the neighborhood: a multifaceted collaborative filtering model. In: Proceedings of the 14th ACM SIGKDD international conference on knowledge discovery and data mining, pp 426–434

  34. 34.

    Koren Y (2010) Collaborative filtering with temporal dynamics. Commun ACM 53(4):89–97

    Article  Google Scholar 

  35. 35.

    Koren Y, Bell R, Volinsky C (2009) Matrix factorization techniques for recommender systems. Computer 42(8):30–37

    Article  Google Scholar 

  36. 36.

    Lacoste-Julien S, Sha F, Jordan M (2008) Disclda: discriminative learning for dimensionality reduction and classification. In: Proceedings of advances neural information processing systems, vol 83, p 85

  37. 37.

    Lathia N, Hailes S, Capra L (2009) Temporal collaborative filtering with adaptive neighbourhoods. In: Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval, pp 796–797

  38. 38.

    Loni B, Shi Y, Larson M, Hanjalic A (2014) Cross-domain collaborative filtering with factorization machines. In: 36th European conference on information retrieval, pp 656–661

  39. 39.

    McAuley J, Leskovec J (2013) From amateurs to connoisseurs: modeling the evolution of user expertise through online reviews. In: Proceedings of the 22nd international conference on world wide web, pp 897–908

  40. 40.

    McAuley J, Leskovec J (2013) Hidden factors and hidden topics: understanding rating dimensions with review text. In: Proceedings of the 7th ACM conference on recommender systems, pp 165–172

  41. 41.

    McAuley J, Leskovec J, Jurafsky, D (2012) Learning attitudes and attributes from multi-aspect reviews. In: Proceedings of the 12th IEEE international conference on data mining, pp 1020–1025

  42. 42.

    Paterek A (2007) Improving regularized singular value decomposition for collaborative filtering. In: Proceedings of KDD cup and workshop, PP 5–8

  43. 43.

    Qiang R, Liang F, Yang J (2013) Exploiting ranking factorization machines for microblog retrieval. In: Proceedings of the 22nd ACM international conference on information and knowledge management, pp 1783–1788

  44. 44.

    Rendle S (2012) Factorization machines with libfm. ACM Trans Intell Syst Technol (TIST) 3(3):57

    Google Scholar 

  45. 45.

    Rennie J, Srebro N (2005) Fast maximum margin matrix factorization for collaborative prediction. In: Proceedings of the 22nd international conference on machine learning, pp 713–719

  46. 46.

    Salakhutdinov R, Andriy M (2007) Probabilistic matrix factorization. In: Proceedings of advances neural information processing systems, vol 1, pp 1–2

  47. 47.

    Salakhutdinov R, Andriy M (2008) Bayesian probabilistic matrix factorization using markov chain monte carlo. In: Proceedings of the 33rd international conference on machine learning, pp 880–887

  48. 48.

    Sarwar B, Karypis G, Konstan J, Riedl J (2002) Incremental singular value decomposition algorithms for highly scalable recommender systems. In: Proceedings of 5th international conference on computer and information science, pp 27–28

  49. 49.

    Shan H, Banerjee A (2008) Bayesian co-clustering. In: Proceedings of the 8th IEEE international conference on data mining, pp 530–539

  50. 50.

    Shi Y, Larson M, Hanjalic A (2014) Collaborative filtering beyond the user-item matrix: a survey of the state of the art and future challenges. ACM Comput Surv (CSUR) 47(1):3

    Article  Google Scholar 

  51. 51.

    Srebro N, Alon N, Jaakkola T (2004) Generalization error bounds for collaborative prediction with low-rank matrices. In: Proceedings of advances neural information processing systems

  52. 52.

    Srebro N, Jaakkola T (2003) Weighted low-rank approximations. In: Proceedings of the 20th international conference on machine learning, vol 3, pp 720–727

  53. 53.

    Srebro N, Rennie J, Jaakkola T (2004) Maximum-margin matrix factorization. In: Proceedings of advances neural information processing systems, vol 17, pp 1329–1336

  54. 54.

    Tan C, Chi E, Huffaker D, Kossinets G, Alexander S (2013) Instant foodie: predicting expert ratings from grassroots. In: Proceedings of the 22nd ACM international conference on information and knowledge management, pp 1127–1136

  55. 55.

    Titov I, McDonald R (2008) A joint model of text and aspect ratings for sentiment summarization. In: Proceedings of the 46th annual meeting of the Association for Computational Linguistics, pp 308–316

  56. 56.

    Titov I, McDonald R (2008) Modeling online reviews with multi-grain topic models. In: Proceedings of the 17th international conference on world wide web, pp 111–120

  57. 57.

    Wang C, Blei D (2011) Collaborative topic modeling for recommending scientific articles. In: Proceedings of the 17th ACM SIGKDD international conference on knowledge discovery and data mining, pp 448–456

  58. 58.

    Wang X, McCallum A (2006) Topics over time: a non-markov continuous-time model of topical trends. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining, pp 424–433

  59. 59.

    Weimer M, Karatzoglou A, Le Q, Smola A (2007) Maximum margin matrix factorization for collaborative ranking. In: Proceedings of advances neural information processing systems, pp 1593–1600

  60. 60.

    Weimer M, Karatzoglou A, Smola A (2008) Improving maximum margin matrix factorization. Mach Learn 72(3):263–276

    Article  Google Scholar 

  61. 61.

    Weston J, Bengio S, Usunier N (2011) Wsabie: Scaling up to large vocabulary image annotation. In: Proceedings of the 22nd international joint conference on artificial intelligence, pp 2764–2770

  62. 62.

    Xiang L, Yuan Q, Zhao S, Chen L, Zhang X, Yang Q, Sun J (2010) Temporal recommendation on graphs via long-and short-term preference fusion. In: Proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data mining, pp 723–732

  63. 63.

    Xin X, Liu Z, Lin C, Huang H, Wei X, Guo P (2015) Cross-domain collaborative filtering with review text. In: Proceedings of the 24th international joint conference on artificial intelligence, pp 1827–1833

  64. 64.

    Xu Y, Lam W, Lin T (2014) Collaborative filtering incorporating review text and co-clusters of hidden user communities and item groups. In: Proceedings of the 23rd ACM international conference on information and knowledge management, pp 251–260

  65. 65.

    Yang S, Long B, Alexander S, Zha H, Zheng Z (2011) Collaborative competitive filtering: learning recommender using context of user choice. In: Proceedings of the 34th international ACM SIGIR conference on research and development in information retrieval, pp 295–304

  66. 66.

    Yin H, Cui B, Chen L, Hu Z, Zhou X (2015) Dynamic user modeling in social media systems. ACM Trans Inf Syst (TOIS) 33(3):10

    Article  Google Scholar 

  67. 67.

    Yu J, Shen Y, Yang Z (2014) Topic-stg: Extending the session-based temporal graph approach for personalized tweet recommendation. In: Proceedings of the companion publication of the 23rd international conference on world wide web companion, pp 413–414

  68. 68.

    Yu K, Zhu S, Lafferty J, Gong Y (2009) Fast nonparametric matrix factorization for large-scale collaborative filtering. In: Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval, pp 211–218

  69. 69.

    Zhang T, Iyengar VS (2002) Recommender systems using linear classifiers. J Mach Learn Res 2:313–334

    MATH  Google Scholar 

  70. 70.

    Zhang Y, Lai G, Zhang M, Zhang Y, Liu Y, Ma S (2014) Explicit factor models for explainable recommendation based on phrase-level sentiment analysis. In: Proceedings of the 37th international ACM SIGIR conference on research and development in information retrieval. ACM, pp 83–92

  71. 71.

    Zhang Y, Zhang M, Zhang Y, Lai G, Liu Y, Zhang H, Ma S (2015) Daily-aware personalized recommendation based on feature-level time series analysis. In: Proceedings of the 24th international conference on world wide web, pp 1373–1383

  72. 72.

    Zimdars A, Chickering DM, Meek C (2001) Using temporal data for making recommendations. In: Proceedings of the 7th conference on uncertainty in artificial intelligence, pp 580–588

Download references

Acknowledgements

The work described in this paper is substantially supported by grants from the Research Grant Council of the Hong Kong Special Administrative Region, China (Project Code: 14203414) and the Direct Grant of the Faculty of Engineering, CUHK (Project Code: 4055034). This work is also affiliated with the CUHK MoE-Microsoft Key Laboratory of Human-centric Computing and Interface Technologies.

Author information

Affiliations

Authors

Corresponding author

Correspondence to Qian Yu.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Xu, Y., Yu, Q., Lam, W. et al. Exploiting interactions of review text, hidden user communities and item groups, and time for collaborative filtering. Knowl Inf Syst 52, 221–254 (2017). https://doi.org/10.1007/s10115-016-1005-1

Download citation

Keywords

  • Collaborative filtering
  • Latent factor modeling
  • Co-clustering