Abstract
How to detect bursty events in data streams on social media is a hot research topic in natural language processing. However, current methods for extracting bursty events suffer from poor accuracy and low efficiency. Fortunately, sentiment analysis has been applied to event detection, which has improved the performance greatly. Inspired by this, this paper proposes a new model which utilizes sentiment analysis for Chinese bursty event detection. First, we build a sentiment co-occurrence graph offline and apply it to analyze microblog sentiment. Plutchik’s emotion wheel is the base for the sentiment classification of the graph. Second, sentiment is used as features to detect bursts in microblog streams online. At last, we exploit regular expressions to extract hashtags in bursty periods and segment hashtags into keywords. By using mutual information and frequent patterns, we fetch words relevant to hashtags as keywords to form events. This approach can detect bursty events online while analyzing the sentiment of microblogs. The experimental results on a large real dataset show that our method can detect bursty events with higher accuracy in a shorter time than traditional methods.
Similar content being viewed by others
References
Adedoyin-Olowe M, Gaber MM, Dancausa CM, Stahl F, Gomes JB (2016) A rule dynamics approach to event detection in twitter with its application to sports and politics. Expert Syst Appl 55:351–360
Anjaria M, Guddeti RMR (2014) A novel sentiment analysis of social networks using supervised learning. Soc Netw Anal 4(1):1–15
Atefeh F, Khreich W (2013) A survey of techniques for event detection in twitter. Comput Intell 31(1):132–164
Baccianella S, Esuli A, Sebastiani F (2010) Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: LREC, vol 10, pp 2200–2204
Bisht S, Toshniwal D (2014) EventStory: event detection using Twitter stream based on locality. In: Intelligent data engineering and automated learning - ideal 2014. Springer Berlin, Berlin, pp 394–403
Bollen J, Mao H, Zeng X (2010) Twitter mood predicts the stock market. Eprint Arxiv 2(1):1–8
Bouras C, Tsogkas V (2010) Assigning web news to clusters. In: 2010 Fifth international conference on internet and web applications and services (ICIW), pp 1–6
Cambria E, Olsher D, Rajagopal D (2014) Senticnet 3: a common and common-sense knowledge base for cognition-driven sentiment analysis. In: Proceedings of the twenty-eighth AAAI conference on artificial intelligence. AAAI Press, pp 1515–1521
Caschera MC, Ferri F, Grifoni P (2016) Sentiment analysis from textual to multimodal features in digital environments. In: International conference on management of digital ecosystems, pp 137–144
Cui A, Zhang M, Liu Y, Ma S (2011) Emotion tokens: Bridging the gap among multilingual twitter sentiment analysis. In: Information retrieval technology - 7th Asia information retrieval societies conference, AIRS 2011, Dubai, United Arab Emirates, December 18-20, 2011. Proceedings, pp 238–249
Cui A, Zhang M, Liu Y, Ma S, Zhang K (2012) Discover breaking events with popular hashtags in twitter. In: ACM international conference on information and knowledge management, pp 1794–1798
Dai XY, Chen QC, Wang XL, Xu J (2010) Online topic detection and tracking of financial news based on hierarchical clustering. In: International conference on machine learning and cybernetics, pp 3341–3346
Fan J, Liang R-Z (2016) Stochastic learning of multi-instance dictionary for earth mover’s distance-based histogram comparison. Neural Comput & Applic 1–11
Fung GPC, Yu JX, Yu PS, Lu H (2005) Parameter free bursty events detection in text streams. In: International conference on very large data bases, pp 181–192
Fung GPC, Yu JX, Liu H, Yu PS (2007) Time-dependent event hierarchy construction. In: ACM SIGKDD international conference on knowledge discovery and data mining, San Jose, California, USA August, pp 300–309
Goorha S, Ungar L (2010) Discovery of significant emerging trends. In: ACM SIGKDD international conference on knowledge discovery and data mining, Washington, DC, USA, July, pp 57–64
Guàrdia-Sebaoun É, Rafrafi A, Guigue V, Gallinari P (2013) Cross-media sentiment classification and application to box-office forecasting. In: Proceedings of the 10th conference on open research areas in information retrieval. LE CENTRE DE HAUTES ETUDES INTERNATIONALES D’INFORMATIQUE DOCUMENTAIRE, pp 201–208
Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. In: ACM Sigmod record. Vol 29. ACM, pp 1–12
He Q, Chang K, Lim EP (2007) Analyzing feature trajectories for event detection. In: International ACM SIGIR conference on research and development in information retrieval, pp 207–214
Kim HG, Kim C (2014) Interval clustering algorithm for fast event detection in stream monitoring applications. Pattern Recogn Lett 36(36):171–176
Kleinberg J (2003) Bursty and hierarchical structure in streams. In: Eighth ACM SIGKDD international conference on knowledge discovery and data mining, pp 373–397
Li H, Sun Z, Yang W (2015) Topic-based chinese message sentiment analysis: A multilayered analysis system. In: Eighth Sighan workshop on Chinese language processing, pp 144–148
Li Q, Zhou X, Gu A, Li Z, Liang R-Z (2016) Nuclear norm regularized convolutional max pos@top machine. Neural Comput & Applic, 1–10. https://doi.org/10.1007/s00521-016-2680-2
Liang R-Z, Shi L, Wang H, Meng J, Wang JJ-Y, Sun Q, Gu Y (2016) Optimizing top precision performance measure of content-based image retrieval by learning similarity function. arXiv:1604.06620
Liang R-Z, Xie W, Li W, Wang H, Wang JJ-Y, Taylor L (2016) A novel transfer learning method based on common space mapping and weighted domain matching. arXiv:1608.04581
Liu Q, Li S (2002) Word similarity computing based on how-net. Comput Linguist & Chin Lang Process 7(2):59–76
Lu TJ (2015) Semi-supervised microblog sentiment analysis using social relation and text similarity. In: International conference on big data and smart computing, pp 194–201
Nguyen T, Phung D, Adams B, Venkatesh S (2013) Event extraction using behaviors of sentiment signals and burst structure in social media. Knowl Inf Syst 37(2):279–304
Ortigosa-Hernández J, Rodríguez JD, Alzate L, Lucania M, Inza I, Lozano JA (2012) Approaching sentiment analysis by using semi-supervised learning of multi-dimensional classifiers. Neurocomputing 92:98–115
Paltoglou G (2015) Sentiment-based event detection in twitter. Journal of the Association for Information Science and Technology
Pang B, Lee L, Vaithyanathan S (2002) Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10. Association for Computational Linguistics, pp 79–86
Plutchik R (2001) The nature of emotions. Philos Stud 89(4):393–409
Popescu A-M, Pennacchiotti M (2010) Detecting controversial events from twitter. In: Proceedings of the 19th ACM international conference on information and knowledge management. ACM, pp 1873–1876
Preotiuc-Pietro D, Srijith P, Hepple M, Cohn T (2016) Studying the temporal dynamics of word co-occurrences. An application to event detection
Read J (2005) Using emoticons to reduce dependency in machine learning techniques for sentiment classification. In: Proceedings of the ACL student research workshop. Association for Computational Linguistics, pp 43–48
Shen G, Yang W, Wang W, Yu M (2015) Burst topic detection oriented large-scale microblogs streams. J Comp Res Dev 52(2):512–521
Snowsill T, Nicart F, Stefani M, De Bie T (2010) Finding surprising patterns in textual data streams. In: International workshop on cognitive information processing, pp 405–410
Stilo G, Velardi P (2015) Efficient temporal mining of micro-blog texts and its application to event discovery. Data Min Knowl Disc 30(2):372–402
Sun J, Wang G, Cheng X, Fu Y (2014) Mining affective text to improve social media item recommendation. Inf Process Manag 51(4):444–457
Thelwall M, Buckley K, Paltoglou G (2011) Sentiment in twitter events. J Am Soc Inf Sci Technol 62(2):406–418
Walther M, Kaisser M (2013) Geo-spatial event detection in the twitter stream. In: European conference on advances in information retrieval, pp 356–367
Wang X, Wei F, Liu X, Zhou M, Zhang M (2011) Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach. In: Proceedings of the 20th ACM international conference on information and knowledge management. ACM, pp 1031–1040
Wang Z, Joo V, Tong C, Xin X, Chin HC (2014) Anomaly detection through enhanced sentiment analysis on social media data. In: IEEE international conference on cloud computing technology and science, pp 917–922
Weiler A, Grossniklaus M, Scholl MH (2015) Run-time and task-based performance of event detection techniques for twitter. In: International conference, CAISE, pp 35–49
Wu Q, Ma S, Liu Y (2015) Sub-event discovery and retrieval during natural hazards on social media data. World Wide Web-internet & Web Inf Syst, 1–21
Xie W, Zhu F, Jiang J, Lim EP, Wang K (2013) Topicsketch: Real-time bursty topic detection from twitter, 837–846
Yao J, Cui B, Huang Y, Zhou Y (2012) Bursty event detection from collaborative tags. World Wide Web-internet & Web Inf Syst 15(15):171–195
Zhang L-M, Jia Y, Zhou B, Zhao J-H, Hong F (2013) Online bursty events detection based on emoticons. Chin J Comput 36(8):1659–1667
Acknowledgements
This paper is supported by (1) the National Natural Science Foundation of China under Grant nos. 61672179, 61370083 and 61402126, (2) Research Fund for the Doctoral Program of Higher Education of China under Grant nos. 20122304110012, (3) the Youth Science Foundation of Heilongjiang Province of China under Grant no. QC2016083, (4) Heilongjiang postdoctoral Fund no. LBH-Z14071. This paper is also supported by China Scholarship Council.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Xiaomei, Z., Jing, Y. & Jianpei, Z. Sentiment-based and hashtag-based Chinese online bursty event detection. Multimed Tools Appl 77, 21725–21750 (2018). https://doi.org/10.1007/s11042-017-5531-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-017-5531-y