Advertisement

World Wide Web

, Volume 20, Issue 6, pp 1293–1312 | Cite as

Towards better news article recommendation

With the help of user comments
  • Youssef Meguebli
  • Mouna KacimiEmail author
  • Bich-Liên Doan
  • Fabrice Popineau
Article

Abstract

News media platforms publish articles about daily events letting their users comment on them, and forming interesting discussions in almost real-time. To keep users always active and interested, media platforms need an effective recommender system to bring up new articles that match user interests. In this article, we show that we can improve the quality of recommendation by exploiting valuable information provided by user comments. This information reveals aspects not directly tackled by the news article on which they have been posted. We call such aspects latent aspects. We demonstrate how these latent aspects can make a crucial difference in the accuracy of future recommendation. The challenge in detecting them is due to the noisy nature of user comments. To support our claim, we propose a novel news recommendation system that (1) enriches the description of news articles by latent aspects extracted from user comments, (2) deals with noisy comments by proposing a model for user comments ranking, and (3) proposes a diversification model to remove redundancies and provide a wide coverage of aspects. We have tested our approach using large collections of real user activities in four news Web sites, namely The INDEPENDENT, The Telegraph, CNN and Al-Jazeera. The results show that our approach outperforms baseline approaches achieving a significantly higher accuracy.

Keywords

News recommendation User comments ranking Diversification 

References

  1. 1.
    Abbar, S., Amer-Yahia, S., Indyk, P., Mahabadi, S.: Real-time recommendation of diverse related articles. In: Proceedings of the 22nd international conference on world wide Web, pp 1–12. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva (2013)Google Scholar
  2. 2.
    Abel, F., Gao, Q., Houben, G. -J., Tao, K.: Analyzing user modeling on twitter for personalized news recommendations. In: Proceedings of the 19th international conference on user modeling, adaption, and personalization, pp 1–12. Springer, Berlin (2011)Google Scholar
  3. 3.
    Aker, A., Kurtic, E., Balamurali, A., Paramita, M., Barker, E., Hepple, M., Gaizauskas, R: A graph-based approach to topic clustering for online comments to news. In: Advances in information retrieval, pp 15–29. Springer (2016)Google Scholar
  4. 4.
    Bansal, T., Das, M., Bhattacharyya, C: Content driven user profiling for commentworthy recommendations of news and blog articles. In: Proceedings of the 9th acm conference on recommender systems, pp 195–202. ACM, New York (2015)Google Scholar
  5. 5.
    Chen, J., Nairn, R., Nelson, L., Bernstein, M., Chi, E.: Short and tweet: Experiments on recommending content from information streams. In: Proceedings of the sigchi conference on human factors in computing systems, pp 1185–1194. ACM, New York (2010)Google Scholar
  6. 6.
    Danescu-Niculescu-Mizil, C., Kossinets, G., Kleinberg, J., Lee, L.: How opinions are received by online communities: a case study on amazon.com helpfulness votes. In: Proceedings of the 18th international conference on world wide Web, pp 141–150. ACM, New York (2009)Google Scholar
  7. 7.
    Ganesan, K., Zhai, C.: Opinion-based entity ranking. Inf. Retr. 15(2), 116–150 (2012)CrossRefGoogle Scholar
  8. 8.
    Gollapudi, S., Sharma, A.: An axiomatic approach for result diversification. In: Proceedings of the 18th international conference on world wide Web, pp 381–390. ACM, New York (2009a)Google Scholar
  9. 9.
    Gollapudi, S., Sharma, A.: An axiomatic approach for result diversification. In: Proceedings of the 18th international conference on world wide Web, pp 381–390. ACM, New York (2009b)Google Scholar
  10. 10.
    Hassin, R., Rubinstein, S., Tamir, A.: Approximation algorithms for maximum dispersion. Oper. Res. Lett. 21, 133–137 (1997)MathSciNetCrossRefzbMATHGoogle Scholar
  11. 11.
    Hu, M., Sun, A., Lim, E. -P.: Comments-oriented document summarization: Understanding documents with readers’ feedback. In: Proceedings of the 31st annual international acm sigir conference on research and development in information retrieval, pp 291–298. ACM, New York (2008)Google Scholar
  12. 12.
    Kant, R., Sengamedu, S.H., Kumar, K.S.: Comment spam detection by sequence mining. In: Proceedings of the fifth acm international conference on Web search and data mining, pp 183–192. ACM, New York (2012)Google Scholar
  13. 13.
    Kim, S., Pantel, P., Chklovski, T., Pennacchiotti, M.: Automatically assessing review helpfulness. In: Proceedings of the 2006 conference on empirical methods in natural language processing, pp 423–430 (2006)Google Scholar
  14. 14.
    Korte, B., Hausmann, D.: An analysis of the greedy heuristic for independence systems. Ann. Discret. Math. 2, 65–74 (1978)MathSciNetCrossRefzbMATHGoogle Scholar
  15. 15.
    Li, Q., Wang, J., Chen, Y.P., Lin, Z.: User comments for news recommendation in forum-based social media. Inform. Sci. 180(24), 4929–4939 (2010a)Google Scholar
  16. 16.
    Li, Q., Wang, J., Chen, Y.P., Lin, Z: User comments for news recommendation in forum-based social media. Inf. Sci 180(24), 4929–4939 (2010b)Google Scholar
  17. 17.
    Lin, C., He, Y.: Joint sentiment/topic model for sentiment analysis. In: Proceedings of the 18th acm conference on information and knowledge management, pp 375–384. ACM, New York (2009)Google Scholar
  18. 18.
    Litvak, M., Matz, L.: Smartnews: Bringing order into comments chaos. In: Proceedings of the international conference on knowledge discovery and information retrieval, kdir, vol. 13 (2013)Google Scholar
  19. 19.
    Liu, C. -Y., Chen, M. -S., Tseng, C. -Y.: Incrests: Towards real-time incremental short text summarization on comment streams from social network services. IEEE Trans. Knowl. Data Eng. 27, 2986–3000 (2015)CrossRefGoogle Scholar
  20. 20.
    Liu, Y., Huang, X., An, A., Yu, X.: Modeling and predicting the helpfulness of online reviews. In: Eighth ieee international conference on data mining, 2008. icdm’08, pp 443–452 (2008)Google Scholar
  21. 21.
    Llewellyn, C., Grover, C., Oberlander, J: Summarizing newspaper comments. In: Icwsm (2014)Google Scholar
  22. 22.
    Marcheggiani, D, Täckström, O., Esuli, A., Sebastiani, F: Hierarchical multilabel conditional random fields for aspect-oriented opinion mining. In: Advances in information retrieval, pp 273–285. Springer (2014)Google Scholar
  23. 23.
    Meguebli, Y., Kacimi, M., Doan, B.-L., Popineau, F: Unsupervised approach for identifying users? political orientations. In: Advances in information retrieval, pp 507–512. Springer (2014)Google Scholar
  24. 24.
    Persing, I., Ng, V: Vote prediction on comments in social polls. In: Emnlp, pp 1127–1138 (2014)Google Scholar
  25. 25.
    Phelan, O., McCarthy, K., Smyth, B.: Using twitter to recommend real-time topical news. In: Proceedings of the third acm conference on recommender systems, pp 385–388. ACM, New York (2009)Google Scholar
  26. 26.
    Popescu, A. -M., Etzioni, O.: Extracting product features and opinions from reviews. In: Proceedings of the conference on human language technology and empirical methods in natural language processing, pp 339–346. Association for Computational Linguistics, Stroudsburg (2005)Google Scholar
  27. 27.
    Rayana, S., Akoglu, L.: Collective opinion spam detection: Bridging review networks and metadata. In: Proceedings of the 21th acm sigkdd international conference on knowledge discovery and data mining, pp 985–994. ACM, New York (2015)Google Scholar
  28. 28.
    Rendle, S., Freudenthaler, C., Gantner, Z., Schmidt-Thieme, L.: Bpr: Bayesian personalized ranking from implicit feedback. In: Proceedings of the twenty-fifth conference on uncertainty in artificial intelligence, pp 452–461. AUAI Press, Arlington (2009)Google Scholar
  29. 29.
    San Pedro, J., Yeh, T., Oliver, N.: Leveraging user comments for aesthetic aware image search reranking. In: Proceedings of the 21st international conference on world wide Web, pp 439–448. ACM, New York (2012)Google Scholar
  30. 30.
    Shmueli, E., Kagian, A., Koren, Y., Lempel, R.: Care to comment?: Recommendations for commenting on news stories. In: Proceedings of the 21st international conference on world wide Web, pp 429–438. ACM, New York (2012)Google Scholar
  31. 31.
    Terra, E., Clarke, C.L.A.: Frequency estimates for statistical word similarity measures. In: Proceedings of the 2003 conference of the north american chapter of the association for computational linguistics on human language technology, vol. 1, pp 165–172. Association for Computational Linguistics, Stroudsburg (2003)Google Scholar
  32. 32.
    Tsagkias, M., Weerkamp, W., de Rijke, M.: Predicting the volume of comments on online news stories. In: Proceedings of the 18th acm conference on information and knowledge management, pp 1765–1768. ACM, New York (2009)Google Scholar
  33. 33.
    Tsagkias, M., Weerkamp, W., de Rijke, M.: News comments: Exploring, modeling, and online prediction. In: Proceedings of the 32nd european conference on advances in information retrieval, pp 191–203. Springer, Berlin (2010)Google Scholar
  34. 34.
    Tsur, O., Rappoport, A: Revrank: a fully unsupervised algorithm for selecting the most helpful book reviews. In: International aaai conference on weblogs and social media (2009)Google Scholar
  35. 35.
    Wang, H., Lu, Y., Zhai, C.: Latent aspect rating analysis on review text data: a rating regression approach. In: Proceedings of the 16th acm sigkdd international conference on knowledge discovery and data mining, pp 783–792. ACM, New York (2010)Google Scholar
  36. 36.
    Ye, J., Kumar, S., Akoglu, L.: Temporal opinion spam detection by multivariate indicative signals (2016). arXiv:1603.01929
  37. 37.
    Yee, W.G., Yates, A., Liu, S., Frieder, O: Are Web user comments useful for search. In: Proceedings of LSDS-IR, pp 63–70 (2009)Google Scholar
  38. 38.
    Zhao, L., Huang, M., Sun, J., Luo, H., Yang, X., Zhu, X.: Sentiment extraction by leveraging aspect-opinion association structure. In: Proceedings of the 24th acm international on conference on information and knowledge management, pp 343–352. ACM, New York (2015)Google Scholar
  39. 39.
    Zhuang, L., Jing, F., Zhu, X. -Y.: Movie review mining and summarization. In: Proceedings of the 15th acm international conference on information and knowledge management, pp 43–50. ACM, New York (2006)Google Scholar

Copyright information

© Springer Science+Business Media New York 2017

Authors and Affiliations

  1. 1.Orange Labs – France TelecomRennesFrance
  2. 2.Free University of Bozen-BolzanBozen-BolzanoItaly
  3. 3.LRI – CentraleSupélecSaclayFrance

Personalised recommendations