Abstract
Twitter is one of the most popular social media platforms for online users to create and share information. Tweets are short, informal, and large-scale, which makes it difficult for online users to find reliable and useful information, arising the problem of Twitter summarization. On the one hand, tweets are short and highly unstructured, which makes traditional document summarization methods difficult to handle Twitter data. On the other hand, Twitter provides rich social-temporal context beyond texts, bringing about new opportunities. In this paper, we investigate how to exploit social-temporal context for Twitter summarization. In particular, we provide a methodology to model temporal context globally and locally, and propose a novel unsupervised summarization framework with social-temporal context for Twitter data. To assess the proposed framework, we manually label a real-world Twitter dataset. Experimental results from the dataset demonstrate the importance of social-temporal context in Twitter summarization.
Similar content being viewed by others
References
Aker, A., Plaza, L., Lloret, E., Gaizauskas, R.: Multi-Document Summarization Techniques for Generating Image Descriptions: a Comparative Analysis. In: Multi-Source, Multilingual Information Extraction and Summarization (2013)
Carbonell, J., Goldstein, J.: The Use of MMR, diversity-based reranking for reordering documents and producing summaries. In: Proceedings of ACM SIGIR (1998)
Chang, Y., Wang, X., Mei, Q., Liu, Y.: Towards Twitter Context Summarization with User Influence Models Proceedings of WSDM (2013)
Daubechies, I.: The wavelet transform, time-frequency localization and signal analysis IEEE Transactions on Information Theory (1990)
Duan, Y., Chen, Z., Wei, F., Zhou, M., Shum, H.Y.: Twitter Topic Summarization by Ranking Tweets Using Social Influence and Content Quality. In: Pooceedings of COLING (2012)
Erkan, G., Radev, D.R.: Lexrank: Graph-based lexical centrality as salience in text summarization JAIR (2004)
Gong, Y., Liu, X.: Generic Text Summarization Using Relevance Measure and Latent Semantic Analysis. In: SIGIR (2001)
Inouye, D., Kalita, J.K.: Comparing Twitter Summarization Algorithms for Multiple Post Summaries. In: Socialcom (2011)
Johnstone, I.M., Silverman, B.W.: Wavelet threshold estimators for data with correlated noise. Journal of the Royal Statistical Society: Series B (Statistical Methodology) (1997)
Jones, K.S.: Automatic summarising: The state of the art Information Processing & Management (2007)
Kessler, R., Tannier, X., Hagege, C., Moriceau, V., Bittar, A.: Finding Salient Dates for Building Thematic Timelines. In: ACL (2012)
Lin, C.Y.: Rouge: A package for automatic evaluation of summaries. In: Text Summarization Branches Out: Proceedings of the ACL-04 Workshop
Mallat, S.G.: A theory for multiresolution signal decomposition: the wavelet representation TPAMI (1989)
Mihalcea, R., Tarau, P.: Textrank: Bringing Order into Texts. In: Proceedings of EMNLP (2004)
Misiti, M., Misiti, Y., Oppenheim, G., Poggi, J.: Wavelet toolbox. The MathWorks Inc., Natick, MA (1996)
Nenkova, A., McKeown, K.: A Survey of Text Summarization Techniques. In: Mining Text Data. Springer (2012)
Nenkova, A., Vanderwende, L.: The impact of frequency on summarization. Microsoft Research. Technical Report MSR-TR-2005-101, Redmond, Washington (2005)
Nichols, J., Mahmud, J., Drews, C.: Summarizing Sporting Events Using Twitter. In: IUI (2012)
Shamma, D.A., Kennedy, L., Churchill, E.F.: Peaks and Persistence: Modeling the Shape of Microblog Conversations. In: Proceedings of the ACM Conference on Computer Supported Cooperative Work (2011)
Sharifi, B., Hutton, M.A., Kalita, J.: Summarizing Microblogs Automatically. In: Proceedings of HLT-NAACL (2010)
Shen, C., Liu, F., Weng, F., Li, T.: A Participant-Based Approach for Event Summarization Using Twitter Streams. In: NAACL-HLT (2013)
Shi, Z., Melli, G., Wang, Y., Liu, Y., Gu, B., Kashani, M.M., Sarkar, A., Popowich, F.: Question Answering Summarization of Multiple Biomedical Documents. In: Advances in Artificial Intelligence (2007)
Vanderwende, L., Suzuki, H., Brockett, C., Nenkova, A.: Beyond SumBasic: Task-focused summarization with sentence simplification and lexical expansion. Information Process- ing & Management (2007)
Wang, D., Li, T., Zhu, S., Ding, C.: Multi-Document Summarization via Sentence-Level Semantic Analysis and Symmetric Matrix Factorization. In: SIGIR (2008)
Weng, J., Lee, B.S.: Event Detection in Twitter. In: Proceedings of ICWSM (2011)
Zubiaga, A., Spina, D., Amigó, E., Gonzalo, J.: Towards real-time summarization of scheduled events from twitter streams. In: Proceedings of the 23nd ACM conference on Hypertext and Social Media (HT’12)
Acknowledgments
This work was supported in part by National Key Basic Research and Development Program of China (973 Program) under Grant 2013CB329304,2013CB329301, National Natural Science Foundation of China (Grant No:61100123,61472277), Ministry of Education Fund of China for the Doctoral (Grant No:20110032120040) and Tianjin Younger Natural Science Foundation (Grant No:14JCQNJC00400).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
He, R., Liu, Y., Yu, G. et al. Twitter summarization with social-temporal context. World Wide Web 20, 267–290 (2017). https://doi.org/10.1007/s11280-016-0386-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-016-0386-0