Abstract
In this paper we present an approach for diversifying user comments on news articles. In our proposed framework, we analyse user comments w.r.t. four different criteria in order to extract the respective diversification dimensions in the form of feature vectors. These criteria involve content similarity, sentiment expressed within comments, article’s named entities also found within comments and commenting behavior of the respective users. Then, we apply diversification on comments, utilizing the extracted features vectors. The outcome of this process is a subset of the initial comments that contains heterogeneous comments, representing different aspects of the news article, different sentiments expressed, as well as different user categories, w.r.t. their commenting behavior. We perform a preliminary qualitative analysis showing that the diversity criteria we introduce result in distinctively diverse subsets of comments, as opposed to a baseline of diversifying comments only w.r.t. to their content (textual similarity). We also present a prototype system that implements our diversification framework on news articles comments.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Tsagkias, M., Weerkamp, W., de Rijke, M.: News Comments:Exploring, Modeling, and Online Prediction. In: Gurrin, C., He, Y., Kazai, G., Kruschwitz, U., Little, S., Roelleke, T., Rüger, S., van Rijsbergen, K. (eds.) ECIR 2010. LNCS, vol. 5993, pp. 191–203. Springer, Heidelberg (2010)
Tsagkias, E., Weerkamp, W., de Rijke, M.: Predicting the volume of comments on online news stories. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM 2009), pp. 1765–1768 (2009)
Diakopoulos, N., Naaman, M.: Towards quality discourse in online news comments. In: Proceedings of the ACM 2011 Conference on Computer Supported Cooperative Work (CSCW 2011), pp. 133–142 (2011)
Park, S., Ko, M., Kim, J., Liu, Y., Song, J.: The Politics of Comments: Predicting Political Orientation of News Stories with Commenters Sentiment Patterns. In: Proceedings of the ACM 2011 Conference on Computer Supported Cooperative Work (CSCW 2011), pp. 113–122 (2011)
Herring, S.C., Kouper, I., Paolillo, J.C., Scheidt, L.A., Tyworth, M., Welsch, P., Wright, E., Ning, Y.: Conversations in the Blogosphere: An Analysis “From the Bottom Up”. In: Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS 2005), p. 107b (2005)
Potthast, M.: Measuring the descriptiveness of web comments. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development (SIGIR 2009), pp. 724–725 (2009)
Li, Q., Wang, J., Chen, Y.P., Lin, Z.: User comments for news recommendation in forum-based social media. Information Sciences: an International Journal 180(24), 4929–4939 (2010)
Shmueli, E., Kagian, A., Koren, Y., Lempel, R.: Care to Comment? Recommendations for Commenting on News Stories. In: Proceedings of the 18th International Conference on World Wide Web, WWW 2012 (to appear, 2012)
Hu, M., Sun, A., Lim, E.: Comments-oriented document summarization: understanding documents with readers’ feedback. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pp. 291–298 (2008)
Mishne, G.A., Glance, N.: Leave a Reply: An Analysis of Weblog Comments. In: Proceedings of the WWW 2006 Workshop on Weblogging Ecosystem: Aggregation, Analysis and Dynamics, at WWW: The 15th International Conference on World Wide Web (2006)
Wong, D., Faridani, S., Bitton, E., Hartmann, B., Goldberg, K.: The diversity donut: enabling participant control over the diversity of recommended responses. In: Proceedings of the 2011 Annual Conference Extended Abstracts on Human Factors in Computing Systems (CHI EA 2011), pp. 1471–1476 (2011)
Munson, S.A., Resnick, P.: Presenting diverse political opinions: how and how much. In: Proceedings of the 28th International Conference on Human Factors in Computing Systems (CHI 2010), pp. 1457–1466 (2010)
Carbonell, J., Goldstein, J.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 1998), pp. 335–336 (1998)
Chen, H., Karger, D.R.: Less is more: probabilistic models for retrieving fewer relevant documents. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2006), pp. 429–436 (2006)
Clarke, C.L.A., Kolla, M., Cormack, G.V., Vechtomova, O., Ashkan, A., Büttcher, S., MacKinnon, I.: Novelty and diversity in information retrieval evaluation. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pp. 659–666 (2008)
Gollapudi, S., Sharma, A.: An axiomatic approach for result diversification. In: Proceedings of the 18th International Conference on World Wide Web (WWW 2009), pp. 381–390 (2009)
Agrawal, R., Gollapudi, S., Halverson, A., Ieong, S.: Diversifying search results. In: Proceedings of the Second International Conference on Web Search and Web Data Mining (WSDM 2009), pp. 5–14 (2009)
Vee, E., Srivastava, U., Shanmugasundaram, J., Bhat, P., Yahia, S.A.: Efficient Computation of Diverse Query Results. In: Proceedings of the 2008 IEEE 24th International Conference on Data Engineering (ICDE 2008), pp. 228–236 (2008)
Drosou, M., Pitoura, E.: Search result diversification. ACM SIGMOD Record 39(1), 41–47
Hassin, R., Rubinstein, S., Tamir, A.: Approximation algorithms for maximum dispersion. Operations Research Letters 21(3), 133–137 (1997)
Ravi, S., Rosenkrantzt, D.J., Tayi, G.K.: Approximation Algorithms for Facility Dispersion. In: Gonzalez, T.F. (ed.) Handbook of Approximation Algorithms and Metaheuristics. Chapman & Hall/CRC (2007)
Finkel, J.R., Grenager, T., Manning, C.: Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling. In: Proceedings of the 43nd Annual Meeting of the Association for Computational Linguistics (ACL 2005), pp. 363–370 (2005)
Thelwall, M., Buckley, K., Paltoglou, G., Cai, D., Kappas, A.: Sentiment strength detection in short informal text. Journal of the American Society for Information Science and Technology 61(12), 2544–2558 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Giannopoulos, G., Weber, I., Jaimes, A., Sellis, T. (2012). Diversifying User Comments on News Articles. In: Wang, X.S., Cruz, I., Delis, A., Huang, G. (eds) Web Information Systems Engineering - WISE 2012. WISE 2012. Lecture Notes in Computer Science, vol 7651. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35063-4_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-35063-4_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35062-7
Online ISBN: 978-3-642-35063-4
eBook Packages: Computer ScienceComputer Science (R0)