Predicting Popular News Comments Based on Multi-Target Text Matching Model

Chen, Deli; Ma, Shuming; Yang, Pengcheng; Su, Qi

doi:10.1007/978-3-030-32233-5_48

Predicting Popular News Comments Based on Multi-Target Text Matching Model

Deli Chen¹³,
Shuming Ma¹³,
Pengcheng Yang^13,14 &
…
Qi Su¹⁵

Conference paper
First Online: 30 September 2019

2260 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11838))

Abstract

With the development of information technology, there is explosive growth in the number of online comment concerning news, blogs and so on. Good comments can improve the experience of reading, but the massive comments are overloaded, and the qualities of them vary greatly. Therefore, it is necessary to predict popular comments from all the comments. In this work, we introduce a novel task: popular comment prediction (PCP), which aims to find out which comments will be popular automatically. First, we construct a news comment corpus: Toutiao Comment Dataset, which consists of news, comments, and the corresponding label. Second, we analyze the dataset and find the popularity of comments can be measured in three aspects: informativeness, consistency, and novelty. Finally, we propose a novel multi-target text matching model, which can measure these three aspects by referring to the news and surrounding comments. Experimental results show that our method can outperform various baselines by a large margin on the new dataset.

N. Chen and S. Ma—Equally Contributed.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
https://github.com/faneshion/MatchZoo.

References

Bian, W., Li, S., Yang, Z., Chen, G., Lin, Z.: A compare-aggregate model with dynamic-clip attention for answer selection. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management 1987–1990 (2017)
Google Scholar
Bromley, J., et al.: Signature ’ verification using A “siamese” time delay neural network. IJPRAI 7(4), 669–688 (1993)
Google Scholar
Graves, A., Mohamed, A., Hinton, G.E.: Speech recognition with deep recurrent neural networks. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, 26–31 May 2013, pp. 6645–6649 (2013)
Google Scholar
Hu, B., Lu, Z., Li, H., Chen, Q.: Convolutional neural network architectures for matching natural language sentences. In: Annual Conference on Neural Information Processing Systems 2014, pp. 2042–2050 (2014)
Google Scholar
Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization. CoRR abs/1412.6980 (2014)
Google Scholar
Kolhatkar, V., Taboada, M.: Using New York times picks to identify constructive comments. In: Proceedings of the 2017 Workshop: Natural Language Processing meets Journalism, NLPmJ@EMNLP, Copenhagen, Denmark, 7 September 2017, pp. 100–105 (2017)
Google Scholar
Kolhatkar, V., Wu, H., Cavasso, L., Francis, E., Shukla, K., Taboada, M.: The SFU opinion and comments corpus: A corpus for the analysis of online news comments (2018)
Google Scholar
Ma, T., Wan, X.: Opinion target extraction in Chinese news comments. In: COLING 2010, 23rd International Conference on Computational Linguistics, Posters Volume, pp. 782–790 (2010)
Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: 27th Annual Conference on Advances in Neural Information Processing Systems, vol. 26, pp. 3111–3119 (2013)
Google Scholar
Napoles, C., Tetreault, J.R., Pappu, A., Rosato, E., Provenzale, B.: Finding good conversations online: the yahoo news annotated comments corpus. In: Proceedings of the 11th Linguistic Annotation Workshop, pp. 13–23 (2017)
Google Scholar
Pang, L., Lan, Y., Guo, J., Xu, J., Wan, S., Cheng, X.: Text matching as image recognition. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 12–17 February 2016, Phoenix, Arizona, USA, pp. 2793–2799 (2016)
Google Scholar
Srivastava, N., Hinton, G.E., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)
MathSciNet MATH Google Scholar
Wan, S., Lan, Y., Guo, J., Xu, J., Pang, L., Cheng, X.: A deep architecture for semantic matching with multiple positional sentence representations. In: Thirtieth AAAI Conference on Artificial Intelligence, March 2016
Google Scholar
Wang, Z., Hamza, W., Florian, R.: Bilateral multiperspective matching for natural language sentences. IJCAI 2017, 4144–4150 (2017)
Article Google Scholar
Yang, L., Ai, Q., Guo, J., Croft, W.B.: aNMM: ranking short answer texts with attention-based neural matching model. In: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, pp. 287–296. ACM (2016)
Google Scholar
Zayats, V., Ostendorf, M.: Conversation modeling on Reddit using a graph-structured LSTM. Trans. Assoc. Comput. Linguist. 6, 121–132 (2018)
Article Google Scholar

Download references

Author information

Authors and Affiliations

MOE Key Lab of Computational Linguistics, School of EECS, Peking University, Beijing, China
Deli Chen, Shuming Ma & Pengcheng Yang
Center for Data Science, Beijing Institute of Big Data Research, Peking University, Beijing, China
Pengcheng Yang
School of Foreign Languages, Peking University, Beijing, China
Qi Su

Authors

Deli Chen
View author publications
You can also search for this author in PubMed Google Scholar
Shuming Ma
View author publications
You can also search for this author in PubMed Google Scholar
Pengcheng Yang
View author publications
You can also search for this author in PubMed Google Scholar
Qi Su
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qi Su .

Editor information

Editors and Affiliations

Tsinghua University, Beijing, China
Jie Tang
National University of Singapore, Singapore, Singapore
Min-Yen Kan
Peking University, Beijing, China
Dongyan Zhao
Peking University, Beijing, China
Sujian Li
Zhengzhou University, Zhengzhou, China
Hongying Zan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, D., Ma, S., Yang, P., Su, Q. (2019). Predicting Popular News Comments Based on Multi-Target Text Matching Model. In: Tang, J., Kan, MY., Zhao, D., Li, S., Zan, H. (eds) Natural Language Processing and Chinese Computing. NLPCC 2019. Lecture Notes in Computer Science(), vol 11838. Springer, Cham. https://doi.org/10.1007/978-3-030-32233-5_48

Download citation

DOI: https://doi.org/10.1007/978-3-030-32233-5_48
Published: 30 September 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32232-8
Online ISBN: 978-3-030-32233-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the China Computer Federation (CCF) (opens in a new tab)