Abstract
In addition to knowledge enhancement, recreation and providing chat, development of Social Network Sites leads to big data, such data can be of great value, as it shows the tendency of the members based on geographical zone, language and culture. The data can also be useful for content oriented planning. In addition, special events of society can be discovered and classified using these data. In some cases, the events have previously existed in society and are considered to be repetitive, like flood in Indian or Typhoon in Florida state, and sometimes the events are unprecedented in which cases, the new event is classified under a new class. The high cost for computations associated with event detection in real time is considered as a major challenge encountered in this context. In the present paper, a model is presented based on deep learning. In the first phase, the first class is trained based on labeled data, then unlabeled data are introduced to the model in a flow manner, and are classified into current classes based on the model through which they have been trained. The data which are higher than a specified threshold are classified into a new class, and if they are lower than the threshold, they are classified as temporary event. At the end, the effectiveness of the model will be evaluated through an available corpus as a benchmark data set. A significant improvement is shown in recall and precision over five state-of-the-art baselines.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Dou, W., Wang, X., Ribarsky, W., Zhou, M.: Event detection in social media data. In: IEEE VisWeek Workshop on Interactive Visual Text Analytics-Task Driven Analytics of Social Media Content, pp. 971–980 (2012)
Osborne, M., Dredze, M.: Facebook, Twitter and Google Plus for breaking news: is there a winner?. In: Eighth International AAAI Conference on Weblogs and Social Media (ICWSM), Michigan, USA (2014)
Atefeh, F., Khreich, W.: A survey of techniques for event detection in Twitter. J. Comput. Intell. 1(31), 132–164 (2015)
Gaglio, S., Re, G.L., Morana, M.: A framework for real-time Twitter data analysis. J. Comput. Commun. 7, 236–242 (2016)
Petkos, G., Papadopoulos, S., Aiello, L., Skraba, R., Kompatsiaris, Y.: A soft frequent pattern mining approach for textual topic detection. In: Proceedings of the 4th International Conference on Web Intelligence Mining and Semantics (WIMS 2014), Greece, pp. 25–40 (2014)
Mathioudakis, M., Koudas, N.: Twittermonitor: trend detection over the twitter stream. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, USA, pp. 1155–1158 (2010)
Boom, C.D., Canneyt, S.V., Dhoedt, B.: Semantics-driven event clustering in Twitter feeds. Making Sense Microposts 1395, 2–9 (2015)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Aiello, L.M., et al.: Sensing trending topics in Twitter. J. IEEE Trans. Multimed. 15(6), 1268–1282 (2013)
Stilo, G., Velardi, P.: A survey on real-time event detection from the Twitter data stream. J. Data Min. Knowl. Discov. 30(2), 372–402 (2016)
Hasan, M., Orgun, M.A., Schwitter, R.: Efficient temporal mining of micro-blog texts and its application to event discovery. J. Inf. Sci. 44(4), 443–463 (2018)
Hasan, M., Orgun, M.A., Schwitter, R.: Real-time event detection from the Twitter data stream using the TwitterNews+ Framework. J. Inf. Process. Manag. 56(3), 1146–1165 (2018)
Arkaitz, Z.: A longitudinal assessment of the persistence of Twitter datasets. J. Assoc. Inf. Sci. Technol. 69(8), 974–984 (2018)
Yang, Z., Yang, D., Dyer, C., He, X., Smola, A., Hovy, E.: Hierarchical attention networks for document classification. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technology, pp. 1480–1489 (2016)
Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: Advances in Neural Information Processing Systems, pp. 556–562 (2001)
Wold, S., Esbensen, K., Geladi, P.: Principal component analysis. J. Chemometr. Intell. Lab. Syst. 2, 37–52 (1987)
Griggs, J.R., Grinstead, C.M., Guichard, D.R.: The number of maximal independent sets in a connected graph. J. Discret. Math. 68, 211–220 (1988)
Johnson, D.S., Yannakakis, M., Papadimitriou, C.H.: On generating all maximal independent sets. J. Inf. Process. Lett. 27(3), 119–123 (1988)
Cheng, X., Huang, X., Li, D., Wu, W., Du, D.Z.: A polynomial-time approximation scheme for the minimum-connected dominating set in ad hoc wireless networks. J. Netw.: Int. J. 42(4), 202–208 (2003)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Rezaei, Z., Komleh, H.E., Eslami, B. (2019). Event Detection in Twitter Big Data by Virtual Backbone Deep Learning. In: Grandinetti, L., Mirtaheri, S., Shahbazian, R. (eds) High-Performance Computing and Big Data Analysis. TopHPC 2019. Communications in Computer and Information Science, vol 891. Springer, Cham. https://doi.org/10.1007/978-3-030-33495-6_2
Download citation
DOI: https://doi.org/10.1007/978-3-030-33495-6_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33494-9
Online ISBN: 978-3-030-33495-6
eBook Packages: Computer ScienceComputer Science (R0)