Abstract
With the development of information technology, there is explosive growth in the number of online comment concerning news, blogs and so on. Good comments can improve the experience of reading, but the massive comments are overloaded, and the qualities of them vary greatly. Therefore, it is necessary to predict popular comments from all the comments. In this work, we introduce a novel task: popular comment prediction (PCP), which aims to find out which comments will be popular automatically. First, we construct a news comment corpus: Toutiao Comment Dataset, which consists of news, comments, and the corresponding label. Second, we analyze the dataset and find the popularity of comments can be measured in three aspects: informativeness, consistency, and novelty. Finally, we propose a novel multi-target text matching model, which can measure these three aspects by referring to the news and surrounding comments. Experimental results show that our method can outperform various baselines by a large margin on the new dataset.
N. Chen and S. Ma—Equally Contributed.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Bian, W., Li, S., Yang, Z., Chen, G., Lin, Z.: A compare-aggregate model with dynamic-clip attention for answer selection. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management 1987–1990 (2017)
Bromley, J., et al.: Signature ’ verification using A “siamese” time delay neural network. IJPRAI 7(4), 669–688 (1993)
Graves, A., Mohamed, A., Hinton, G.E.: Speech recognition with deep recurrent neural networks. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, 26–31 May 2013, pp. 6645–6649 (2013)
Hu, B., Lu, Z., Li, H., Chen, Q.: Convolutional neural network architectures for matching natural language sentences. In: Annual Conference on Neural Information Processing Systems 2014, pp. 2042–2050 (2014)
Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization. CoRR abs/1412.6980 (2014)
Kolhatkar, V., Taboada, M.: Using New York times picks to identify constructive comments. In: Proceedings of the 2017 Workshop: Natural Language Processing meets Journalism, NLPmJ@EMNLP, Copenhagen, Denmark, 7 September 2017, pp. 100–105 (2017)
Kolhatkar, V., Wu, H., Cavasso, L., Francis, E., Shukla, K., Taboada, M.: The SFU opinion and comments corpus: A corpus for the analysis of online news comments (2018)
Ma, T., Wan, X.: Opinion target extraction in Chinese news comments. In: COLING 2010, 23rd International Conference on Computational Linguistics, Posters Volume, pp. 782–790 (2010)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: 27th Annual Conference on Advances in Neural Information Processing Systems, vol. 26, pp. 3111–3119 (2013)
Napoles, C., Tetreault, J.R., Pappu, A., Rosato, E., Provenzale, B.: Finding good conversations online: the yahoo news annotated comments corpus. In: Proceedings of the 11th Linguistic Annotation Workshop, pp. 13–23 (2017)
Pang, L., Lan, Y., Guo, J., Xu, J., Wan, S., Cheng, X.: Text matching as image recognition. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 12–17 February 2016, Phoenix, Arizona, USA, pp. 2793–2799 (2016)
Srivastava, N., Hinton, G.E., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)
Wan, S., Lan, Y., Guo, J., Xu, J., Pang, L., Cheng, X.: A deep architecture for semantic matching with multiple positional sentence representations. In: Thirtieth AAAI Conference on Artificial Intelligence, March 2016
Wang, Z., Hamza, W., Florian, R.: Bilateral multiperspective matching for natural language sentences. IJCAI 2017, 4144–4150 (2017)
Yang, L., Ai, Q., Guo, J., Croft, W.B.: aNMM: ranking short answer texts with attention-based neural matching model. In: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, pp. 287–296. ACM (2016)
Zayats, V., Ostendorf, M.: Conversation modeling on Reddit using a graph-structured LSTM. Trans. Assoc. Comput. Linguist. 6, 121–132 (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Chen, D., Ma, S., Yang, P., Su, Q. (2019). Predicting Popular News Comments Based on Multi-Target Text Matching Model. In: Tang, J., Kan, MY., Zhao, D., Li, S., Zan, H. (eds) Natural Language Processing and Chinese Computing. NLPCC 2019. Lecture Notes in Computer Science(), vol 11838. Springer, Cham. https://doi.org/10.1007/978-3-030-32233-5_48
Download citation
DOI: https://doi.org/10.1007/978-3-030-32233-5_48
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32232-8
Online ISBN: 978-3-030-32233-5
eBook Packages: Computer ScienceComputer Science (R0)