Skip to main content

Topical Event Detection on Twitter

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9877))

Abstract

Event detection on Twitter has attracted active research. Although existing work considers the semantic topic structure of documents for event detection, the topic dynamics and the semantic consistency are under-investigated. In this paper, we study the problem of topical event detection in tweet streams. We define topical events as the bursty occurrences of semantically consistent topics. We decompose the problem of topical event detection into two components: (1) We address the issue of the semantic incoherence of the evolution of topics. We propose to improve topic modelling to filter out semantically inconsistent dynamic topics. (2) We propose to perform burst detection on the time series of dynamic topics to detect bursty occurrences. We apply our proposed techniques to the real world application by detecting topical events in public transport tweets. Experiments demonstrate that our approach can detect the newsworthy events with high success rate.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Allan, J.: Introduction to topic detection and tracking. In: Allan, J. (ed.) Topic Detection and Tracking, pp. 1–16. Springer, New York (2002)

    Chapter  Google Scholar 

  2. Allan, J., Lavrenko, V., Jin, H.: First story detection in TDT is hard. In: Proceeding 19th International Conference on Information and Knowledge Management, pp. 374–381. ACM (2000)

    Google Scholar 

  3. Araujo, L., Cuesta, J.A., Merelo, J.J.: Genetic algorithm for burst detection and activity tracking in event streams. In: Runarsson, T.P., Beyer, H.-G., Burke, E.K., Merelo-Guervós, J.J., Whitley, L.D., Yao, X. (eds.) PPSN 2006. LNCS, vol. 4193, pp. 302–311. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  4. Becker, H., Naaman, M., Gravano, L.: Beyond trending topics: real-world event identification on Twitter. ICWSM 11, 438–441 (2011)

    Google Scholar 

  5. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. Mach. Learn. Res. 3, 993–1022 (2003)

    MATH  Google Scholar 

  6. Buntine, W.L., Mishra, S.: Experiments with non-parametric topic models. In: Proceeding of 20th ACM SIGKDD, pp. 881–890. ACM (2014)

    Google Scholar 

  7. Cordeiro, M.: Twitter event detection: combining wavelet analysis and topic inference summarization. In: Proceeding Doctoral Symposium on Informatics Engineering, DSIE, p. 123–138 (2012)

    Google Scholar 

  8. Fung, G.P.C., Yu, J.X., Yu, P.S., Lu, H.: Parameter free bursty events detection in text streams. In: Proceeding of 31st International Conference on Very Large Data Bases, pp. 181–192. VLDB Endowment (2005)

    Google Scholar 

  9. He, Q., Chang, K., Lim, E.P.: Analyzing feature trajectories for event detection. In: Proceeding of 30th ACM SIGIR, pp. 207–214. ACM (2007)

    Google Scholar 

  10. Kleinberg, J.: Bursty and hierarchical structure in streams. Data Min. Knowl. Disc. 7(4), 373–397 (2003)

    Article  MathSciNet  Google Scholar 

  11. Kullback, S., Leibler, R.A.: On information and sufficiency. Ann. Math. Stat. 22(1), 79–86 (1951)

    Article  MathSciNet  MATH  Google Scholar 

  12. Li, J., Buntine, W.: Experiments with dynamic topic models. In: Proceeding of NewsKDD workshop on Data Science for News Publishing. ACM (2014)

    Google Scholar 

  13. Metzler, D., Cai, C., Hovy, E.: Structured event retrieval over microblog archives. In: Proceeding of the North American Chapter of the Association for Computational Linguistics, pp. 646–655. Association for Computational Linguistics (2012)

    Google Scholar 

  14. Mimno, D., Wallach, H.M., Talley, E., Leenders, M., McCallum, A.: Optimizing semantic coherence in topic models. In: Proc. Empirical Methods in Natural Language Processing. pp. 262–272. Association for Computational Linguistics (2011)

    Google Scholar 

  15. Newman, D., Lau, J.H., Grieser, K., Baldwin, T.: Automatic evaluation of topic coherence. In: Proceeding of the North American Chapter of the Association for Computational Linguistics, pp. 100–108. Association for Computational Linguistics (2010)

    Google Scholar 

  16. Pan, C.C., Mitra, P.: Event detection with spatial latent dirichlet allocation. In: Proceeding of 11th ACM/IEEE International joint Conference on Digital libraries, pp. 349–358. ACM (2011)

    Google Scholar 

  17. Petrović, S., Osborne, M., Lavrenko, V.: Streaming first story detection with application to Twitter. In: Proceeding of the North American Chapter of the Association for Computational Linguistics, pp. 181–189. Association for Computational Linguistics (2010)

    Google Scholar 

  18. Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes Twitter users: real-time event detection by social sensors. In: Proceeding of 19th International Conference WWW, pp. 851–860. ACM (2010)

    Google Scholar 

  19. Sankaranarayanan, J., Samet, H., Teitler, B.E., Lieberman, M.D., Sperling, J.: Twitterstand: news in tweets. In: Proceeding 17th ACM SIGSPATIAL International Conference on advances in geographic information systems, pp. 42–51. ACM (2009)

    Google Scholar 

  20. Wang, X., Grimson, E.: Spatial latent Dirichlet allocation. In: Proceeding Advances in Neural Information Processing Systems, pp. 1577–1584 (2008)

    Google Scholar 

  21. Weng, J., Lee, B.S.: Event detection in Twitter. ICWSM 11, 401–408 (2011)

    Google Scholar 

  22. Yao, L., Mimno, D., McCallum, A.: Efficient methods for topic model inference on streaming document collections. In: Proceeding of 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 937–946. ACM (2009)

    Google Scholar 

  23. Zhou, X., Chen, L.: Event detection over Twitter social media streams. Int. J. VLDB 23(3), 381–400 (2014)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lishan Cui .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Cui, L., Zhang, X., Zhou, X., Salim, F. (2016). Topical Event Detection on Twitter. In: Cheema, M., Zhang, W., Chang, L. (eds) Databases Theory and Applications. ADC 2016. Lecture Notes in Computer Science(), vol 9877. Springer, Cham. https://doi.org/10.1007/978-3-319-46922-5_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-46922-5_20

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-46921-8

  • Online ISBN: 978-3-319-46922-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics