Advertisement

Knowledge and Information Systems

, Volume 52, Issue 1, pp 221–254 | Cite as

Exploiting interactions of review text, hidden user communities and item groups, and time for collaborative filtering

  • Yinqing Xu
  • Qian YuEmail author
  • Wai Lam
  • Tianyi Lin
Regular Paper

Abstract

Rich side information concerning users and items are valuable for collaborative filtering (CF) algorithms for recommendation. For example, rating score is often associated with a piece of review text, which is capable of providing valuable information to reveal the reasons why a user gives a certain rating. Moreover, the underlying community and group relationship buried in users and items are potentially useful for CF. In this paper, we develop a new model to tackle the CF problem which predicts user’s ratings on previously unrated items by effectively exploiting interactions among review texts as well as the hidden user community and item group information. We call this model CMR (co-clustering collaborative filtering model with review text). Specifically, we employ the co-clustering technique to model the user community and item group, and each community–group pair corresponds to a co-cluster, which is characterized by a rating distribution in exponential family and a topic distribution. We have conducted extensive experiments on 22 real-world datasets, and our proposed model CMR outperforms the state-of-the-art latent factor models. Furthermore, both the user’s preference and item profile are drifting over time. Dynamic modeling the temporal changes in user’s preference and item profiles are desirable for improving a recommendation system. We extend CMR and propose an enhanced model called TCMR to consider time information and exploit the temporal interactions among review texts and co-clusters of user communities and item groups. In this TCMR model, each community–group co-cluster is characterized by an additional beta distribution for time modeling. To evaluate our TCMR model, we have conducted another set of experiments on 22 larger datasets with wider time span. Our proposed model TCMR performs better than CMR and the standard time-aware recommendation model on the rating score prediction tasks. We also investigate the temporal effect on the user–item co-clusters.

Keywords

Collaborative filtering Latent factor modeling Co-clustering 

Notes

Acknowledgements

The work described in this paper is substantially supported by grants from the Research Grant Council of the Hong Kong Special Administrative Region, China (Project Code: 14203414) and the Direct Grant of the Faculty of Engineering, CUHK (Project Code: 4055034). This work is also affiliated with the CUHK MoE-Microsoft Key Laboratory of Human-centric Computing and Interface Technologies.

References

  1. 1.
    Agarwal D, Chen B-C (2010) flda: matrix factorization through latent dirichlet allocation. In: Proceedings of the 3rd ACM international conference on web search and data mining, pp 91–100Google Scholar
  2. 2.
    Agarwal D, Merugu S (2007) Predictive discrete latent factor models for large scale dyadic data. In: Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining, pp 26–35Google Scholar
  3. 3.
    Aizenberg N, Koren Y, Somekh O (2012) Build your own music recommender by modeling internet radio streams. In: Proceedings of the 21st international conference on world wide web, pp 1–10Google Scholar
  4. 4.
    Almahairi A, Kastner K, Cho K, Courville A (2015) Learning distributed representations from reviews for collaborative filtering. In: Proceedings of the 9th ACM conference on recommender systems, pp 147–154Google Scholar
  5. 5.
    Baltrunas L, Amatriain X (2009) Towards time-dependant recommendation based on implicit feedback. In: Workshop on context-aware recommender systemsGoogle Scholar
  6. 6.
    Bao Y, Fang H, Zhang J (2014) Topicmf: Simultaneously exploiting ratings and reviews for recommendation. In: Proceedings of the 28th AAAI conference on artificial intelligence, pp 2–8Google Scholar
  7. 7.
    Bell R, Koren Y, Volinsky, C (2007) Modeling relationships at multiple scales to improve accuracy of large recommender systems. In: Proceedings of the 13rd ACM SIGKDD international conference on knowledge discovery and data mining, pp 95–104Google Scholar
  8. 8.
    Beutel A, Murray K, Faloutsos C, Smola A (2014) Cobafi: collaborative bayesian filtering. In: Proceedings of the 23rd international conference on world wide web, pp 97–108Google Scholar
  9. 9.
    Bishop CM (2006) Pattern recognition and machine learning. Information science and statistics. Springer, New YorkzbMATHGoogle Scholar
  10. 10.
    Blei D, Ng A, Jordan M (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022zbMATHGoogle Scholar
  11. 11.
    Bobadilla J, Ortega F, Hernando A, Gutiérrez A (2013) Recommender systems survey. Knowl Based Syst 46:109–132CrossRefGoogle Scholar
  12. 12.
    Cai Y, Leung H-F, Li Q, Min H, Tang J, Li J (2014) Typicality-based collaborative filtering recommendation. IEEE Trans Knowl Data Eng 26(3):766–779CrossRefGoogle Scholar
  13. 13.
    Chen C, Li D, Zhao Y, Lv Q, Shang L (2015) Wemarec: accurate and scalable recommendation through weighted and ensemble matrix approximation. In: Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval, pp 303–312Google Scholar
  14. 14.
    Chen T, Han W, Wang H, Zhou Y, Xu B, Zang B (2007) Content recommendation system based on private dynamic user profile. In: IEEE international conference on machine learning and cybernetics, pp 2112–2118Google Scholar
  15. 15.
    Cheng Y, Church G (2000) Biclustering of expression data. In: Proceedings of the 8th international conference on intelligent systems for molecular biology, vol 8, pp 93–103Google Scholar
  16. 16.
    Chu W, Park S (2009) Personalized recommendation on dynamic content using predictive bilinear models. In: Proceedings of the 18th international conference on world wide web, pp 691–700Google Scholar
  17. 17.
    DeCoste D (2006) Collaborative prediction using ensembles of maximum margin matrix factorizations. In: Proceedings of the 23rd international conference on machine learning, pp 249–256Google Scholar
  18. 18.
    Dhillon I, Mallela S, Modha D (2003) Information-theoretic co-clustering. In: Proceedings of the 9th ACM SIGKDD international conference on knowledge discovery and data mining, pp 89–98Google Scholar
  19. 19.
    Diao Q, Qiu M, Wu C-Y, Smola AJ, Jiang J, Wang C (2014) Jointly modeling aspects, ratings and sentiments for movie recommendation (jmars). In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining, pp 193–202Google Scholar
  20. 20.
    Ding Y, Li X (2005) Time weight collaborative filtering. In: Proceedings of the 14th ACM international conference on information and knowledge management, pp 485–492Google Scholar
  21. 21.
    Gaillard J, Renders J-M (2015) Time-sensitive collaborative filtering through adaptive matrix completion. In: Proceedings of the 37th European conference on information retrieval, pp 327–332Google Scholar
  22. 22.
    Ganu G, Elhadad Y, Marian A (2009) Beyond the stars: improving rating predictions using review text content. In: Proceedings of the 12th international workshop on the web and databasesGoogle Scholar
  23. 23.
    Geman S, Geman D (1984) Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans Pattern Anal Mach Intell 6:721–741CrossRefzbMATHGoogle Scholar
  24. 24.
    George T, Merugu S (2005) A scalable collaborative filtering framework based on co-clustering. In: Proceedings of the 5th IEEE international conference on data mining, pp 625–628Google Scholar
  25. 25.
    Geuens S (2015) Factorization machines for hybrid recommendation systems based on behavioral, product, and customer data. In: Proceedings of the 9th ACM conference on recommender systems, pp 379–382Google Scholar
  26. 26.
    Griffiths T, Steyvers M (2004) Finding scientific topics. In: Proceedings of the National academy of Sciences of the United States of America, pp 5228–5235Google Scholar
  27. 27.
    Guan L, Alam MH, Ryu W, Lee S (2016) A phrase-based model to discover hidden factors and hidden topics in recommender systems. In: IEEE international conference on big data and smart computing (BigComp), pp 337–340Google Scholar
  28. 28.
    Hartigan J (1972) Direct clustering of a data matrix. J Am Stat Assoc 67(337):123–129CrossRefGoogle Scholar
  29. 29.
    He X, Chen T, Kan M, Chen X (2015) Trirank: review-aware explainable recommendation by modeling aspects. In: Proceedings of the 24th ACM international conference on information and knowledge management, pp 1661–1670Google Scholar
  30. 30.
    Heckel R, Vlachos M (2016) Interpretable recommendations via overlapping co-clusters. arXiv preprint arXiv:1604.02071
  31. 31.
    Hoffman M, Bach F, Blei D (2010) Online learning for latent dirichlet allocation. In: Proceedings of advances neural information processing systems, pp 856–864Google Scholar
  32. 32.
    Hong L, Doumith A, Davison B (2013) Co-factorization machines: modeling user interests and predicting individual decisions in twitter. In: Proceedings of the sixth ACM international conference on web search and data mining, pp 557–566Google Scholar
  33. 33.
    Koren Y (2008) Factorization meets the neighborhood: a multifaceted collaborative filtering model. In: Proceedings of the 14th ACM SIGKDD international conference on knowledge discovery and data mining, pp 426–434Google Scholar
  34. 34.
    Koren Y (2010) Collaborative filtering with temporal dynamics. Commun ACM 53(4):89–97CrossRefGoogle Scholar
  35. 35.
    Koren Y, Bell R, Volinsky C (2009) Matrix factorization techniques for recommender systems. Computer 42(8):30–37CrossRefGoogle Scholar
  36. 36.
    Lacoste-Julien S, Sha F, Jordan M (2008) Disclda: discriminative learning for dimensionality reduction and classification. In: Proceedings of advances neural information processing systems, vol 83, p 85Google Scholar
  37. 37.
    Lathia N, Hailes S, Capra L (2009) Temporal collaborative filtering with adaptive neighbourhoods. In: Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval, pp 796–797Google Scholar
  38. 38.
    Loni B, Shi Y, Larson M, Hanjalic A (2014) Cross-domain collaborative filtering with factorization machines. In: 36th European conference on information retrieval, pp 656–661Google Scholar
  39. 39.
    McAuley J, Leskovec J (2013) From amateurs to connoisseurs: modeling the evolution of user expertise through online reviews. In: Proceedings of the 22nd international conference on world wide web, pp 897–908Google Scholar
  40. 40.
    McAuley J, Leskovec J (2013) Hidden factors and hidden topics: understanding rating dimensions with review text. In: Proceedings of the 7th ACM conference on recommender systems, pp 165–172Google Scholar
  41. 41.
    McAuley J, Leskovec J, Jurafsky, D (2012) Learning attitudes and attributes from multi-aspect reviews. In: Proceedings of the 12th IEEE international conference on data mining, pp 1020–1025Google Scholar
  42. 42.
    Paterek A (2007) Improving regularized singular value decomposition for collaborative filtering. In: Proceedings of KDD cup and workshop, PP 5–8Google Scholar
  43. 43.
    Qiang R, Liang F, Yang J (2013) Exploiting ranking factorization machines for microblog retrieval. In: Proceedings of the 22nd ACM international conference on information and knowledge management, pp 1783–1788Google Scholar
  44. 44.
    Rendle S (2012) Factorization machines with libfm. ACM Trans Intell Syst Technol (TIST) 3(3):57Google Scholar
  45. 45.
    Rennie J, Srebro N (2005) Fast maximum margin matrix factorization for collaborative prediction. In: Proceedings of the 22nd international conference on machine learning, pp 713–719Google Scholar
  46. 46.
    Salakhutdinov R, Andriy M (2007) Probabilistic matrix factorization. In: Proceedings of advances neural information processing systems, vol 1, pp 1–2Google Scholar
  47. 47.
    Salakhutdinov R, Andriy M (2008) Bayesian probabilistic matrix factorization using markov chain monte carlo. In: Proceedings of the 33rd international conference on machine learning, pp 880–887Google Scholar
  48. 48.
    Sarwar B, Karypis G, Konstan J, Riedl J (2002) Incremental singular value decomposition algorithms for highly scalable recommender systems. In: Proceedings of 5th international conference on computer and information science, pp 27–28Google Scholar
  49. 49.
    Shan H, Banerjee A (2008) Bayesian co-clustering. In: Proceedings of the 8th IEEE international conference on data mining, pp 530–539Google Scholar
  50. 50.
    Shi Y, Larson M, Hanjalic A (2014) Collaborative filtering beyond the user-item matrix: a survey of the state of the art and future challenges. ACM Comput Surv (CSUR) 47(1):3CrossRefGoogle Scholar
  51. 51.
    Srebro N, Alon N, Jaakkola T (2004) Generalization error bounds for collaborative prediction with low-rank matrices. In: Proceedings of advances neural information processing systemsGoogle Scholar
  52. 52.
    Srebro N, Jaakkola T (2003) Weighted low-rank approximations. In: Proceedings of the 20th international conference on machine learning, vol 3, pp 720–727Google Scholar
  53. 53.
    Srebro N, Rennie J, Jaakkola T (2004) Maximum-margin matrix factorization. In: Proceedings of advances neural information processing systems, vol 17, pp 1329–1336Google Scholar
  54. 54.
    Tan C, Chi E, Huffaker D, Kossinets G, Alexander S (2013) Instant foodie: predicting expert ratings from grassroots. In: Proceedings of the 22nd ACM international conference on information and knowledge management, pp 1127–1136Google Scholar
  55. 55.
    Titov I, McDonald R (2008) A joint model of text and aspect ratings for sentiment summarization. In: Proceedings of the 46th annual meeting of the Association for Computational Linguistics, pp 308–316Google Scholar
  56. 56.
    Titov I, McDonald R (2008) Modeling online reviews with multi-grain topic models. In: Proceedings of the 17th international conference on world wide web, pp 111–120Google Scholar
  57. 57.
    Wang C, Blei D (2011) Collaborative topic modeling for recommending scientific articles. In: Proceedings of the 17th ACM SIGKDD international conference on knowledge discovery and data mining, pp 448–456Google Scholar
  58. 58.
    Wang X, McCallum A (2006) Topics over time: a non-markov continuous-time model of topical trends. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining, pp 424–433Google Scholar
  59. 59.
    Weimer M, Karatzoglou A, Le Q, Smola A (2007) Maximum margin matrix factorization for collaborative ranking. In: Proceedings of advances neural information processing systems, pp 1593–1600Google Scholar
  60. 60.
    Weimer M, Karatzoglou A, Smola A (2008) Improving maximum margin matrix factorization. Mach Learn 72(3):263–276CrossRefGoogle Scholar
  61. 61.
    Weston J, Bengio S, Usunier N (2011) Wsabie: Scaling up to large vocabulary image annotation. In: Proceedings of the 22nd international joint conference on artificial intelligence, pp 2764–2770Google Scholar
  62. 62.
    Xiang L, Yuan Q, Zhao S, Chen L, Zhang X, Yang Q, Sun J (2010) Temporal recommendation on graphs via long-and short-term preference fusion. In: Proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data mining, pp 723–732Google Scholar
  63. 63.
    Xin X, Liu Z, Lin C, Huang H, Wei X, Guo P (2015) Cross-domain collaborative filtering with review text. In: Proceedings of the 24th international joint conference on artificial intelligence, pp 1827–1833Google Scholar
  64. 64.
    Xu Y, Lam W, Lin T (2014) Collaborative filtering incorporating review text and co-clusters of hidden user communities and item groups. In: Proceedings of the 23rd ACM international conference on information and knowledge management, pp 251–260Google Scholar
  65. 65.
    Yang S, Long B, Alexander S, Zha H, Zheng Z (2011) Collaborative competitive filtering: learning recommender using context of user choice. In: Proceedings of the 34th international ACM SIGIR conference on research and development in information retrieval, pp 295–304Google Scholar
  66. 66.
    Yin H, Cui B, Chen L, Hu Z, Zhou X (2015) Dynamic user modeling in social media systems. ACM Trans Inf Syst (TOIS) 33(3):10CrossRefGoogle Scholar
  67. 67.
    Yu J, Shen Y, Yang Z (2014) Topic-stg: Extending the session-based temporal graph approach for personalized tweet recommendation. In: Proceedings of the companion publication of the 23rd international conference on world wide web companion, pp 413–414Google Scholar
  68. 68.
    Yu K, Zhu S, Lafferty J, Gong Y (2009) Fast nonparametric matrix factorization for large-scale collaborative filtering. In: Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval, pp 211–218Google Scholar
  69. 69.
    Zhang T, Iyengar VS (2002) Recommender systems using linear classifiers. J Mach Learn Res 2:313–334zbMATHGoogle Scholar
  70. 70.
    Zhang Y, Lai G, Zhang M, Zhang Y, Liu Y, Ma S (2014) Explicit factor models for explainable recommendation based on phrase-level sentiment analysis. In: Proceedings of the 37th international ACM SIGIR conference on research and development in information retrieval. ACM, pp 83–92Google Scholar
  71. 71.
    Zhang Y, Zhang M, Zhang Y, Lai G, Liu Y, Zhang H, Ma S (2015) Daily-aware personalized recommendation based on feature-level time series analysis. In: Proceedings of the 24th international conference on world wide web, pp 1373–1383Google Scholar
  72. 72.
    Zimdars A, Chickering DM, Meek C (2001) Using temporal data for making recommendations. In: Proceedings of the 7th conference on uncertainty in artificial intelligence, pp 580–588Google Scholar

Copyright information

© Springer-Verlag London 2016

Authors and Affiliations

  1. 1.Department of Systems Engineering and Engineering ManagementChinese University of Hong KongHong KongChina

Personalised recommendations