Diversifying User Comments on News Articles

  • Giorgos Giannopoulos
  • Ingmar Weber
  • Alejandro Jaimes
  • Timos Sellis
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7651)

Abstract

In this paper we present an approach for diversifying user comments on news articles. In our proposed framework, we analyse user comments w.r.t. four different criteria in order to extract the respective diversification dimensions in the form of feature vectors. These criteria involve content similarity, sentiment expressed within comments, article’s named entities also found within comments and commenting behavior of the respective users. Then, we apply diversification on comments, utilizing the extracted features vectors. The outcome of this process is a subset of the initial comments that contains heterogeneous comments, representing different aspects of the news article, different sentiments expressed, as well as different user categories, w.r.t. their commenting behavior. We perform a preliminary qualitative analysis showing that the diversity criteria we introduce result in distinctively diverse subsets of comments, as opposed to a baseline of diversifying comments only w.r.t. to their content (textual similarity). We also present a prototype system that implements our diversification framework on news articles comments.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Tsagkias, M., Weerkamp, W., de Rijke, M.: News Comments:Exploring, Modeling, and Online Prediction. In: Gurrin, C., He, Y., Kazai, G., Kruschwitz, U., Little, S., Roelleke, T., Rüger, S., van Rijsbergen, K. (eds.) ECIR 2010. LNCS, vol. 5993, pp. 191–203. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  2. 2.
    Tsagkias, E., Weerkamp, W., de Rijke, M.: Predicting the volume of comments on online news stories. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM 2009), pp. 1765–1768 (2009)Google Scholar
  3. 3.
    Diakopoulos, N., Naaman, M.: Towards quality discourse in online news comments. In: Proceedings of the ACM 2011 Conference on Computer Supported Cooperative Work (CSCW 2011), pp. 133–142 (2011)Google Scholar
  4. 4.
    Park, S., Ko, M., Kim, J., Liu, Y., Song, J.: The Politics of Comments: Predicting Political Orientation of News Stories with Commenters Sentiment Patterns. In: Proceedings of the ACM 2011 Conference on Computer Supported Cooperative Work (CSCW 2011), pp. 113–122 (2011)Google Scholar
  5. 5.
    Herring, S.C., Kouper, I., Paolillo, J.C., Scheidt, L.A., Tyworth, M., Welsch, P., Wright, E., Ning, Y.: Conversations in the Blogosphere: An Analysis “From the Bottom Up”. In: Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS 2005), p. 107b (2005)Google Scholar
  6. 6.
    Potthast, M.: Measuring the descriptiveness of web comments. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development (SIGIR 2009), pp. 724–725 (2009)Google Scholar
  7. 7.
    Li, Q., Wang, J., Chen, Y.P., Lin, Z.: User comments for news recommendation in forum-based social media. Information Sciences: an International Journal 180(24), 4929–4939 (2010)CrossRefGoogle Scholar
  8. 8.
    Shmueli, E., Kagian, A., Koren, Y., Lempel, R.: Care to Comment? Recommendations for Commenting on News Stories. In: Proceedings of the 18th International Conference on World Wide Web, WWW 2012 (to appear, 2012)Google Scholar
  9. 9.
    Hu, M., Sun, A., Lim, E.: Comments-oriented document summarization: understanding documents with readers’ feedback. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pp. 291–298 (2008)Google Scholar
  10. 10.
    Mishne, G.A., Glance, N.: Leave a Reply: An Analysis of Weblog Comments. In: Proceedings of the WWW 2006 Workshop on Weblogging Ecosystem: Aggregation, Analysis and Dynamics, at WWW: The 15th International Conference on World Wide Web (2006)Google Scholar
  11. 11.
    Wong, D., Faridani, S., Bitton, E., Hartmann, B., Goldberg, K.: The diversity donut: enabling participant control over the diversity of recommended responses. In: Proceedings of the 2011 Annual Conference Extended Abstracts on Human Factors in Computing Systems (CHI EA 2011), pp. 1471–1476 (2011)Google Scholar
  12. 12.
    Munson, S.A., Resnick, P.: Presenting diverse political opinions: how and how much. In: Proceedings of the 28th International Conference on Human Factors in Computing Systems (CHI 2010), pp. 1457–1466 (2010)Google Scholar
  13. 13.
    Carbonell, J., Goldstein, J.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 1998), pp. 335–336 (1998)Google Scholar
  14. 14.
    Chen, H., Karger, D.R.: Less is more: probabilistic models for retrieving fewer relevant documents. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2006), pp. 429–436 (2006)Google Scholar
  15. 15.
    Clarke, C.L.A., Kolla, M., Cormack, G.V., Vechtomova, O., Ashkan, A., Büttcher, S., MacKinnon, I.: Novelty and diversity in information retrieval evaluation. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pp. 659–666 (2008)Google Scholar
  16. 16.
    Gollapudi, S., Sharma, A.: An axiomatic approach for result diversification. In: Proceedings of the 18th International Conference on World Wide Web (WWW 2009), pp. 381–390 (2009)Google Scholar
  17. 17.
    Agrawal, R., Gollapudi, S., Halverson, A., Ieong, S.: Diversifying search results. In: Proceedings of the Second International Conference on Web Search and Web Data Mining (WSDM 2009), pp. 5–14 (2009)Google Scholar
  18. 18.
    Vee, E., Srivastava, U., Shanmugasundaram, J., Bhat, P., Yahia, S.A.: Efficient Computation of Diverse Query Results. In: Proceedings of the 2008 IEEE 24th International Conference on Data Engineering (ICDE 2008), pp. 228–236 (2008)Google Scholar
  19. 19.
    Drosou, M., Pitoura, E.: Search result diversification. ACM SIGMOD Record 39(1), 41–47Google Scholar
  20. 20.
    Hassin, R., Rubinstein, S., Tamir, A.: Approximation algorithms for maximum dispersion. Operations Research Letters 21(3), 133–137 (1997)MathSciNetMATHCrossRefGoogle Scholar
  21. 21.
    Ravi, S., Rosenkrantzt, D.J., Tayi, G.K.: Approximation Algorithms for Facility Dispersion. In: Gonzalez, T.F. (ed.) Handbook of Approximation Algorithms and Metaheuristics. Chapman & Hall/CRC (2007)Google Scholar
  22. 22.
    Finkel, J.R., Grenager, T., Manning, C.: Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling. In: Proceedings of the 43nd Annual Meeting of the Association for Computational Linguistics (ACL 2005), pp. 363–370 (2005)Google Scholar
  23. 23.
    Thelwall, M., Buckley, K., Paltoglou, G., Cai, D., Kappas, A.: Sentiment strength detection in short informal text. Journal of the American Society for Information Science and Technology 61(12), 2544–2558 (2010)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Giorgos Giannopoulos
    • 1
    • 2
  • Ingmar Weber
    • 3
  • Alejandro Jaimes
    • 3
  • Timos Sellis
    • 1
    • 2
  1. 1.School of ECENTU AthensGreece
  2. 2.IMIS Institute, “Athena” Research CenterGreece
  3. 3.Yahoo! ResearchBarcelonaSpain

Personalised recommendations