Abstract
The boom of social media platforms like Twitter brings the large scale short, noisy and redundant messages, making it difficult for people to obtain essential information. We study extractive topic-oriented social summarization to help people grasp the core information on social media quickly. Previous methods mainly extract salient content based on textual information and shallow social signals. They ignore that user generated messages propagate along the social network and affect users on their dissemination path, leading to user-level redundancy. Besides, hashtags on social media are a special kind of social signals, which can be regarded as keywords of a post and contain abundant semantics. In this paper, we propose to leverage social theories and social signals (i.e. multi-order social relations and hashtags) to address the redundancy problem and extract diverse summaries. Specifically, we propose a novel unsupervised social summarization framework which considers Social Contagion and Hashtag Consistency (SCHC) theories. To model relations among tweets, two relation graphs are constructed based on user-level and hashtag-level interaction among tweets. These social relations are further integrated into a sparse reconstruction framework to alleviate the user-level and hashtag-level redundancy respectively. Experimental results on the CTS dataset prove that our approach is effective.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bi, B., Tian, Y., Sismanis, Y., Balmin, A., Cho, J.: Scalable topic-specific influence analysis on microblogs. In: Proceedings of the 7th ACM International Conference on Web Search and Data Mining, pp. 513–522 (2014)
Chang, Y., Tang, J., Yin, D., Yamada, M., Liu, Y.: Timeline summarization from social media with life cycle models. In: Proceedings of the 25th International Joint Conference on Artificial Intelligence, pp. 3698–3704 (2016)
Duan, Y., Chen, Z., Wei, F., Zhou, M., Shum, H.Y.: Twitter topic summarization by ranking tweets using social influence and content quality. In: Proceedings of the 24th International Conference on Computational Linguistics, pp. 763–780 (2012)
Erkan, G., Radev, D.R.: Lexrank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22(1), 457–479 (2004)
Gong, Y., Liu, X.: Generic text summarization using relevance measure and latent semantic analysis. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 19–25 (2001)
Gu, Q., Han, J.: Towards feature selection in network. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 1175–1184 (2011)
He, R., Duan, X.: Twitter summarization based on social network and sparse reconstruction. In: Proceedings of the 32th AAAI Conference on Artificial Intelligence, pp. 5787–5794 (2018)
He, R., Liu, Y., Yu, G., Tang, J., Hu, Q., Dang, J.: Twitter summarization with social-temporal context. World Wide Web 20(2), 267–290 (2016). https://doi.org/10.1007/s11280-016-0386-0
He, Z., et al.: Document summarization based on data reconstruction. In: Proceedings of the 26th AAAI Conference on Artificial Intelligence, pp. 620–626 (2012)
Hopcroft, J.E., Lou, T., Tang, J.: Who will follow you back? reciprocal relationship prediction. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 1137–1146 (2011)
Hu, X., Tang, L., Tang, J., Liu, H.: Exploiting social relations for sentiment analysis in microblogging. In: Proceedings of the 6th ACM International Conference on Web Search and Data Mining, pp. 537–546 (2013)
Iacopini, I., Petri, G., Barrat, A., Latora, V.: Simplicial models of social contagion. Nature Commun. 10(1), 2485 (2019)
Inouye, D., Kalita, J.K.: Comparing twitter summarization algorithms for multiple post summaries. In: 2011 IEEE Third international conference on privacy, Security, Risk and Trust and 2011 IEEE Third International Conference on Social Computing, pp. 298–306 (2011)
Landis, Koch, G.G.: The measurement of observer agreement for categorical data. Biometrics 33(1), 159–174 (1977)
Lin, C.Y.: ROUGE: A package for automatic evaluation of summaries. In: Workshop on Text Summarization Branches Out, Post-Conference Workshop of ACL, vol. 2004, pp. 74–81 (2004)
Liu, H., Yu, H., Deng, Z.H.: Multi-document summarization based on two-level sparse representation model. In: Proceedings of the 29th AAAI Conference on Artificial Intelligence, pp. 196–202 (2015)
Liu, X., Li, Y., Wei, F., Zhou, M.: Graph-based multi-tweet summarization using social signals. In: Proceedings of the 24th International Conference on Computational Linguistics, pp. 1699–1714 (2012)
Nguyen, M., Tran, D., Nguyen, L., Phan, X.: Exploiting user posts for web document summarization. ACM Trans. Knowl. Discovery Data 12(4), 49 (2018)
Nguyen, M., Tran, V.C., Nguyen, X.H., Nguyen, L.: Web document summarization by exploiting social context with matrix co-factorization. Inf. Process. Manage. 56(3), 495–515 (2019)
Nichols, J., Mahmud, J., Drews, C.: Summarizing sporting events using twitter. In: Proceedings of the 2012 ACM International Conference on Intelligent User Interfaces, pp. 189–198 (2012)
Abelson, R.P.: Whatever Became of Consistency Theory?. Pers. Soc. Psychol. Bull. 9(1), 37–54 (1983)
Romero, D.M., Meeder, B., Kleinberg, J.: Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter. In: Proceedings of the 20th International Conference on World Wide Web, pp. 695–704 (2011)
Rui, Y., Li, C.T., Hsieh, H.P., Hu, P., Hu, X., He, T.: Socialized language model smoothing via bi-directional influence propagation on social networks. In: Proceedings of the 25th International Conference on World Wide Web, pp. 1395–1406 (2016)
Sedhai, S., Sun, A.: Hspam14: A collection of 14 million tweets for hashtag-oriented spam research. In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 223–232 (2015)
Shalizi, C.R., Thomas, A.C.: Homophily and contagion are generically confounded in observational social network. Sociol. Methods Res. 40(2), 211–239 (2011)
Shi, B., Ifrim, G., Hurley, N.: Learning-to-rank for real-time high-precision hashtag recommendation for streaming news. In: Proceedings of the 25th International Conference on World Wide Web, pp. 1191–1202 (2016)
Tsur, O., Rappoport, A.: What’s in a hashtag? content based prediction of the spread of ideas in microblogging communities. In: Proceedings of the Fifth ACM International Conference on Web Search and Data Mining, pp. 643–652 (2012)
Vosecky, J., Leung, K.W.T., Ng, W.: Collaborative personalized twitter search with topic-language models. In: Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 53–62 (2014)
Wang, X., Wei, F., Liu, X., Zhou, M., Zhang, M.: Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 1031–1040 (2011)
Wang, X., Wang, Y., Zuo, W., Cai, G.: Exploring social context for topic identification in short and noisy texts. In: Proceedings of the 29th AAAI Conference on Artificial Intelligence, pp. 1868–1874 (2015)
Wang, Y., Li, J., King, I., Lyu, M.R., Shi, S.: Microblog hashtag generation via encoding conversation contexts. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1624–1633 (2019)
Wu, Y., Zhang, H., Xu, B., Hao, H., Liu, C.: TR-LDA: a cascaded key-bigram extractor for microblog summarization. Int. J. Mach. Learn. Comput. 5(3), 172–178 (2015)
Zeng, Z., Yin, Y., Song, Y., Zhang, M.: Socialized word embeddings. In: Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, pp. 3915–3921 (2017)
Acknowledgement
We thank the anonymous reviewers for their valuable feedback. Our work is supported by the National Natural Science Foundation of China (61976154), the National Key R&D Program of China (2019YFC1521200), the Tianjin Natural Science Foundation (18JCYBJC15500), and the State Key Laboratory of Communication Content Cognition, People’s Daily Online (No.A32003).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
He, R., Liu, H., Zhao, L. (2021). SCHC: Incorporating Social Contagion and Hashtag Consistency for Topic-Oriented Social Summarization. In: Jensen, C.S., et al. Database Systems for Advanced Applications. DASFAA 2021. Lecture Notes in Computer Science(), vol 12682. Springer, Cham. https://doi.org/10.1007/978-3-030-73197-7_44
Download citation
DOI: https://doi.org/10.1007/978-3-030-73197-7_44
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-73196-0
Online ISBN: 978-3-030-73197-7
eBook Packages: Computer ScienceComputer Science (R0)