Skip to main content

Event Detection in Twitter Big Data by Virtual Backbone Deep Learning

  • Conference paper
  • First Online:
High-Performance Computing and Big Data Analysis (TopHPC 2019)

Abstract

In addition to knowledge enhancement, recreation and providing chat, development of Social Network Sites leads to big data, such data can be of great value, as it shows the tendency of the members based on geographical zone, language and culture. The data can also be useful for content oriented planning. In addition, special events of society can be discovered and classified using these data. In some cases, the events have previously existed in society and are considered to be repetitive, like flood in Indian or Typhoon in Florida state, and sometimes the events are unprecedented in which cases, the new event is classified under a new class. The high cost for computations associated with event detection in real time is considered as a major challenge encountered in this context. In the present paper, a model is presented based on deep learning. In the first phase, the first class is trained based on labeled data, then unlabeled data are introduced to the model in a flow manner, and are classified into current classes based on the model through which they have been trained. The data which are higher than a specified threshold are classified into a new class, and if they are lower than the threshold, they are classified as temporary event. At the end, the effectiveness of the model will be evaluated through an available corpus as a benchmark data set. A significant improvement is shown in recall and precision over five state-of-the-art baselines.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Dou, W., Wang, X., Ribarsky, W., Zhou, M.: Event detection in social media data. In: IEEE VisWeek Workshop on Interactive Visual Text Analytics-Task Driven Analytics of Social Media Content, pp. 971–980 (2012)

    Google Scholar 

  2. Osborne, M., Dredze, M.: Facebook, Twitter and Google Plus for breaking news: is there a winner?. In: Eighth International AAAI Conference on Weblogs and Social Media (ICWSM), Michigan, USA (2014)

    Google Scholar 

  3. Atefeh, F., Khreich, W.: A survey of techniques for event detection in Twitter. J. Comput. Intell. 1(31), 132–164 (2015)

    Article  MathSciNet  Google Scholar 

  4. Gaglio, S., Re, G.L., Morana, M.: A framework for real-time Twitter data analysis. J. Comput. Commun. 7, 236–242 (2016)

    Article  Google Scholar 

  5. Petkos, G., Papadopoulos, S., Aiello, L., Skraba, R., Kompatsiaris, Y.: A soft frequent pattern mining approach for textual topic detection. In: Proceedings of the 4th International Conference on Web Intelligence Mining and Semantics (WIMS 2014), Greece, pp. 25–40 (2014)

    Google Scholar 

  6. Mathioudakis, M., Koudas, N.: Twittermonitor: trend detection over the twitter stream. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, USA, pp. 1155–1158 (2010)

    Google Scholar 

  7. Boom, C.D., Canneyt, S.V., Dhoedt, B.: Semantics-driven event clustering in Twitter feeds. Making Sense Microposts 1395, 2–9 (2015)

    Google Scholar 

  8. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)

    MATH  Google Scholar 

  9. Aiello, L.M., et al.: Sensing trending topics in Twitter. J. IEEE Trans. Multimed. 15(6), 1268–1282 (2013)

    Article  Google Scholar 

  10. Stilo, G., Velardi, P.: A survey on real-time event detection from the Twitter data stream. J. Data Min. Knowl. Discov. 30(2), 372–402 (2016)

    Article  Google Scholar 

  11. Hasan, M., Orgun, M.A., Schwitter, R.: Efficient temporal mining of micro-blog texts and its application to event discovery. J. Inf. Sci. 44(4), 443–463 (2018)

    Article  Google Scholar 

  12. Hasan, M., Orgun, M.A., Schwitter, R.: Real-time event detection from the Twitter data stream using the TwitterNews+ Framework. J. Inf. Process. Manag. 56(3), 1146–1165 (2018)

    Article  Google Scholar 

  13. Arkaitz, Z.: A longitudinal assessment of the persistence of Twitter datasets. J. Assoc. Inf. Sci. Technol. 69(8), 974–984 (2018)

    Article  Google Scholar 

  14. Yang, Z., Yang, D., Dyer, C., He, X., Smola, A., Hovy, E.: Hierarchical attention networks for document classification. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technology, pp. 1480–1489 (2016)

    Google Scholar 

  15. Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: Advances in Neural Information Processing Systems, pp. 556–562 (2001)

    Google Scholar 

  16. Wold, S., Esbensen, K., Geladi, P.: Principal component analysis. J. Chemometr. Intell. Lab. Syst. 2, 37–52 (1987)

    Article  Google Scholar 

  17. Griggs, J.R., Grinstead, C.M., Guichard, D.R.: The number of maximal independent sets in a connected graph. J. Discret. Math. 68, 211–220 (1988)

    Article  MathSciNet  Google Scholar 

  18. Johnson, D.S., Yannakakis, M., Papadimitriou, C.H.: On generating all maximal independent sets. J. Inf. Process. Lett. 27(3), 119–123 (1988)

    Article  MathSciNet  Google Scholar 

  19. Cheng, X., Huang, X., Li, D., Wu, W., Du, D.Z.: A polynomial-time approximation scheme for the minimum-connected dominating set in ad hoc wireless networks. J. Netw.: Int. J. 42(4), 202–208 (2003)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hossein Ebrahimpour Komleh .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rezaei, Z., Komleh, H.E., Eslami, B. (2019). Event Detection in Twitter Big Data by Virtual Backbone Deep Learning. In: Grandinetti, L., Mirtaheri, S., Shahbazian, R. (eds) High-Performance Computing and Big Data Analysis. TopHPC 2019. Communications in Computer and Information Science, vol 891. Springer, Cham. https://doi.org/10.1007/978-3-030-33495-6_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-33495-6_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-33494-9

  • Online ISBN: 978-3-030-33495-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics