Skip to main content
Log in

Sentiment-based and hashtag-based Chinese online bursty event detection

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

How to detect bursty events in data streams on social media is a hot research topic in natural language processing. However, current methods for extracting bursty events suffer from poor accuracy and low efficiency. Fortunately, sentiment analysis has been applied to event detection, which has improved the performance greatly. Inspired by this, this paper proposes a new model which utilizes sentiment analysis for Chinese bursty event detection. First, we build a sentiment co-occurrence graph offline and apply it to analyze microblog sentiment. Plutchik’s emotion wheel is the base for the sentiment classification of the graph. Second, sentiment is used as features to detect bursts in microblog streams online. At last, we exploit regular expressions to extract hashtags in bursty periods and segment hashtags into keywords. By using mutual information and frequent patterns, we fetch words relevant to hashtags as keywords to form events. This approach can detect bursty events online while analyzing the sentiment of microblogs. The experimental results on a large real dataset show that our method can detect bursty events with higher accuracy in a shorter time than traditional methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18

Similar content being viewed by others

Notes

  1. http://www.techweb.com.cn/data/2016-05-12/2331298.shtml

  2. http://www.keenage.com/

  3. http://www.nlpir.org/?action-viewnews-itemid-263

References

  1. Adedoyin-Olowe M, Gaber MM, Dancausa CM, Stahl F, Gomes JB (2016) A rule dynamics approach to event detection in twitter with its application to sports and politics. Expert Syst Appl 55:351–360

    Article  Google Scholar 

  2. Anjaria M, Guddeti RMR (2014) A novel sentiment analysis of social networks using supervised learning. Soc Netw Anal 4(1):1–15

    Google Scholar 

  3. Atefeh F, Khreich W (2013) A survey of techniques for event detection in twitter. Comput Intell 31(1):132–164

    Article  MathSciNet  Google Scholar 

  4. Baccianella S, Esuli A, Sebastiani F (2010) Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: LREC, vol 10, pp 2200–2204

  5. Bisht S, Toshniwal D (2014) EventStory: event detection using Twitter stream based on locality. In: Intelligent data engineering and automated learning - ideal 2014. Springer Berlin, Berlin, pp 394–403

  6. Bollen J, Mao H, Zeng X (2010) Twitter mood predicts the stock market. Eprint Arxiv 2(1):1–8

    Google Scholar 

  7. Bouras C, Tsogkas V (2010) Assigning web news to clusters. In: 2010 Fifth international conference on internet and web applications and services (ICIW), pp 1–6

  8. Cambria E, Olsher D, Rajagopal D (2014) Senticnet 3: a common and common-sense knowledge base for cognition-driven sentiment analysis. In: Proceedings of the twenty-eighth AAAI conference on artificial intelligence. AAAI Press, pp 1515–1521

  9. Caschera MC, Ferri F, Grifoni P (2016) Sentiment analysis from textual to multimodal features in digital environments. In: International conference on management of digital ecosystems, pp 137–144

  10. Cui A, Zhang M, Liu Y, Ma S (2011) Emotion tokens: Bridging the gap among multilingual twitter sentiment analysis. In: Information retrieval technology - 7th Asia information retrieval societies conference, AIRS 2011, Dubai, United Arab Emirates, December 18-20, 2011. Proceedings, pp 238–249

  11. Cui A, Zhang M, Liu Y, Ma S, Zhang K (2012) Discover breaking events with popular hashtags in twitter. In: ACM international conference on information and knowledge management, pp 1794–1798

  12. Dai XY, Chen QC, Wang XL, Xu J (2010) Online topic detection and tracking of financial news based on hierarchical clustering. In: International conference on machine learning and cybernetics, pp 3341–3346

  13. Fan J, Liang R-Z (2016) Stochastic learning of multi-instance dictionary for earth mover’s distance-based histogram comparison. Neural Comput & Applic 1–11

  14. Fung GPC, Yu JX, Yu PS, Lu H (2005) Parameter free bursty events detection in text streams. In: International conference on very large data bases, pp 181–192

  15. Fung GPC, Yu JX, Liu H, Yu PS (2007) Time-dependent event hierarchy construction. In: ACM SIGKDD international conference on knowledge discovery and data mining, San Jose, California, USA August, pp 300–309

  16. Goorha S, Ungar L (2010) Discovery of significant emerging trends. In: ACM SIGKDD international conference on knowledge discovery and data mining, Washington, DC, USA, July, pp 57–64

  17. Guàrdia-Sebaoun É, Rafrafi A, Guigue V, Gallinari P (2013) Cross-media sentiment classification and application to box-office forecasting. In: Proceedings of the 10th conference on open research areas in information retrieval. LE CENTRE DE HAUTES ETUDES INTERNATIONALES D’INFORMATIQUE DOCUMENTAIRE, pp 201–208

  18. Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. In: ACM Sigmod record. Vol 29. ACM, pp 1–12

  19. He Q, Chang K, Lim EP (2007) Analyzing feature trajectories for event detection. In: International ACM SIGIR conference on research and development in information retrieval, pp 207–214

  20. Kim HG, Kim C (2014) Interval clustering algorithm for fast event detection in stream monitoring applications. Pattern Recogn Lett 36(36):171–176

    Article  Google Scholar 

  21. Kleinberg J (2003) Bursty and hierarchical structure in streams. In: Eighth ACM SIGKDD international conference on knowledge discovery and data mining, pp 373–397

  22. Li H, Sun Z, Yang W (2015) Topic-based chinese message sentiment analysis: A multilayered analysis system. In: Eighth Sighan workshop on Chinese language processing, pp 144–148

  23. Li Q, Zhou X, Gu A, Li Z, Liang R-Z (2016) Nuclear norm regularized convolutional max pos@top machine. Neural Comput & Applic, 1–10. https://doi.org/10.1007/s00521-016-2680-2

  24. Liang R-Z, Shi L, Wang H, Meng J, Wang JJ-Y, Sun Q, Gu Y (2016) Optimizing top precision performance measure of content-based image retrieval by learning similarity function. arXiv:1604.06620

  25. Liang R-Z, Xie W, Li W, Wang H, Wang JJ-Y, Taylor L (2016) A novel transfer learning method based on common space mapping and weighted domain matching. arXiv:1608.04581

  26. Liu Q, Li S (2002) Word similarity computing based on how-net. Comput Linguist & Chin Lang Process 7(2):59–76

    Google Scholar 

  27. Lu TJ (2015) Semi-supervised microblog sentiment analysis using social relation and text similarity. In: International conference on big data and smart computing, pp 194–201

  28. Nguyen T, Phung D, Adams B, Venkatesh S (2013) Event extraction using behaviors of sentiment signals and burst structure in social media. Knowl Inf Syst 37(2):279–304

    Article  Google Scholar 

  29. Ortigosa-Hernández J, Rodríguez JD, Alzate L, Lucania M, Inza I, Lozano JA (2012) Approaching sentiment analysis by using semi-supervised learning of multi-dimensional classifiers. Neurocomputing 92:98–115

    Article  Google Scholar 

  30. Paltoglou G (2015) Sentiment-based event detection in twitter. Journal of the Association for Information Science and Technology

  31. Pang B, Lee L, Vaithyanathan S (2002) Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10. Association for Computational Linguistics, pp 79–86

  32. Plutchik R (2001) The nature of emotions. Philos Stud 89(4):393–409

    Google Scholar 

  33. Popescu A-M, Pennacchiotti M (2010) Detecting controversial events from twitter. In: Proceedings of the 19th ACM international conference on information and knowledge management. ACM, pp 1873–1876

  34. Preotiuc-Pietro D, Srijith P, Hepple M, Cohn T (2016) Studying the temporal dynamics of word co-occurrences. An application to event detection

  35. Read J (2005) Using emoticons to reduce dependency in machine learning techniques for sentiment classification. In: Proceedings of the ACL student research workshop. Association for Computational Linguistics, pp 43–48

  36. Shen G, Yang W, Wang W, Yu M (2015) Burst topic detection oriented large-scale microblogs streams. J Comp Res Dev 52(2):512–521

    Google Scholar 

  37. Snowsill T, Nicart F, Stefani M, De Bie T (2010) Finding surprising patterns in textual data streams. In: International workshop on cognitive information processing, pp 405–410

  38. Stilo G, Velardi P (2015) Efficient temporal mining of micro-blog texts and its application to event discovery. Data Min Knowl Disc 30(2):372–402

    Article  MathSciNet  Google Scholar 

  39. Sun J, Wang G, Cheng X, Fu Y (2014) Mining affective text to improve social media item recommendation. Inf Process Manag 51(4):444–457

    Article  Google Scholar 

  40. Thelwall M, Buckley K, Paltoglou G (2011) Sentiment in twitter events. J Am Soc Inf Sci Technol 62(2):406–418

    Article  Google Scholar 

  41. Walther M, Kaisser M (2013) Geo-spatial event detection in the twitter stream. In: European conference on advances in information retrieval, pp 356–367

  42. Wang X, Wei F, Liu X, Zhou M, Zhang M (2011) Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach. In: Proceedings of the 20th ACM international conference on information and knowledge management. ACM, pp 1031–1040

  43. Wang Z, Joo V, Tong C, Xin X, Chin HC (2014) Anomaly detection through enhanced sentiment analysis on social media data. In: IEEE international conference on cloud computing technology and science, pp 917–922

  44. Weiler A, Grossniklaus M, Scholl MH (2015) Run-time and task-based performance of event detection techniques for twitter. In: International conference, CAISE, pp 35–49

  45. Wu Q, Ma S, Liu Y (2015) Sub-event discovery and retrieval during natural hazards on social media data. World Wide Web-internet & Web Inf Syst, 1–21

  46. Xie W, Zhu F, Jiang J, Lim EP, Wang K (2013) Topicsketch: Real-time bursty topic detection from twitter, 837–846

  47. Yao J, Cui B, Huang Y, Zhou Y (2012) Bursty event detection from collaborative tags. World Wide Web-internet & Web Inf Syst 15(15):171–195

    Article  Google Scholar 

  48. Zhang L-M, Jia Y, Zhou B, Zhao J-H, Hong F (2013) Online bursty events detection based on emoticons. Chin J Comput 36(8):1659–1667

    Article  Google Scholar 

Download references

Acknowledgements

This paper is supported by (1) the National Natural Science Foundation of China under Grant nos. 61672179, 61370083 and 61402126, (2) Research Fund for the Doctoral Program of Higher Education of China under Grant nos. 20122304110012, (3) the Youth Science Foundation of Heilongjiang Province of China under Grant no. QC2016083, (4) Heilongjiang postdoctoral Fund no. LBH-Z14071. This paper is also supported by China Scholarship Council.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yang Jing.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xiaomei, Z., Jing, Y. & Jianpei, Z. Sentiment-based and hashtag-based Chinese online bursty event detection. Multimed Tools Appl 77, 21725–21750 (2018). https://doi.org/10.1007/s11042-017-5531-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-017-5531-y

Keywords

Navigation