Abstract
Due to the huge content of data generated by various social networking sites like Twitter, it has become very difficult to extract the informative content from the huge lumps of data. Hundreds of millions of peoples are using Twitter, generating many posts on a daily basis; therefore, it is very challenging to extract and summarize the user-generated content. Moreover, the Twitter API also provides only latest posts in a sequential order. This motivates the dire need for a new automatic event summarization system to create event summaries supporting intelligence. In this paper, we intend to summarize the Twitter posts corresponding to Twitter hashtags to find a representative Twitter post with high quality, strong relevance, clear usefulness, and ideal acceptability. We used two approaches, namely temporal TF-IDF and temporal TF-IDF with keyword importance for finding the summary of the events. Then, we evaluate and compare these approaches using a self-generated dataset of Twitter posts and show that the system will automatically generate the summary of the selected hashtag.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alsaedi, N., Burnap, P., Rana, O.: Temporal TF-IDF: a high-performance approach for event summarization in Twitter. In: IEEE/WIC/ACM International Conference on Web Intelligence, pp. 515–521 (2016)
Zhao, Y.: Twitter Data Analysis with R-Text Mining and Social Network Analysis. RDM, Canbera (2016)
Radev, D.R., Blair-Goldensohn, S., Zhang, Z.: Experiments in single and multi-document summarization using mead. In: First Document Understanding Conference (2001)
Becker, H., Naaman, M., Gravano, L.: Selecting quality Twitter content for events. In: Proceedings of the 5th International AAAI Conference on Weblogs and Social Media (ICWSM’11) (2011)
Sharifi, B., Hutton, M.-A., Kalita, J.: Summarizing microblogs automatically. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, ser. HLT ’10, pp. 685–688 (2010)
Erkan, G., Radev, D.R.: Lexrank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22(1), 457–479 (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Maryam, A., Ali, R. (2019). Temporal TF-IDF-Based Twitter Event Summarization Incorporating Keyword Importance. In: Satapathy, S., Joshi, A. (eds) Information and Communication Technology for Intelligent Systems . Smart Innovation, Systems and Technologies, vol 107. Springer, Singapore. https://doi.org/10.1007/978-981-13-1747-7_54
Download citation
DOI: https://doi.org/10.1007/978-981-13-1747-7_54
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1746-0
Online ISBN: 978-981-13-1747-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)