Skip to main content
Log in

The empirical study of tweet classification system for disaster response using shallow and deep learning models

  • Original Research
  • Published:
Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

Abstract

Disaster-based tweets during an emergency consist of a variety of information on people who have been hurt or killed, people who are lost or discovered, infrastructure and utilities destroyed; this information can assist governmental and humanitarian organizations in prioritizing their aid and rescue efforts. It is crucial to build a model that can categorize these tweets into distinct types due to their massive volume so as to better organize rescue and relief effort and save lives. In this study, Twitter data of 2013 Queensland flood and 2015 Nepal earthquake has been classified as disaster or non-disaster by employing three classes of models. The first model is performed using the lexical feature based on Term Frequency-Inverse Document Frequency (TF-IDF). The classification was performed using five classification algorithms such as DT, LR, SVM, RF, while Ensemble Voting was used to produce the outcome of the models. The second model uses shallow classifiers in conjunction with several features, including lexical (TF-IDF), hashtag, POS, and GloVe embedding. The third set of the model utilized deep learning algorithms including LSTM, LSTM, and GRU, using BERT (Bidirectional Encoder Representations from Transformers) for constructing semantic word embedding to learn the context. The key performance evaluation metrics such as accuracy, F1 score, recall, and precision were employed to measure and compare the three sets of models for disaster response classification on two publicly available Twitter datasets. By performing a comprehensive empirical evaluation of the tweet classification technique across different disaster kinds, the predictive performance shows that the best accuracy was achieved with DT algorithm which attained the highest performance accuracy followed by Bi-LSTM models for disaster response classification by attaining the best accuracy of 96.46% and 96.40% on the Queensland flood dataset; DT algorithm also attained 78.3% accuracy on the Nepal earthquake dataset based on the majority-voting ensemble respectively. Thus, this research contributes by investigating the integration of deep and shallow learning models effectively in a tweet classification system designed for disaster response. Examining the ways that these two methods work seamlessly offers insights into how to best utilize their complimentary advantages to increase the robustness and accuracy of locating suitable data in disaster crisis.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data availability

The data that support the findings of this study are available upon request.

References

  • Alam F, Joty S, Imran M (2018) Domain adaptation with adversarial training and graph embeddings. arXiv preprint arXiv:1805.05151

  • Alqaraleh S (2020) Classification of turkish text using machine learning: a case study using disasters tweets. International Journal of Scientific & Technology Research 9(3):4953–4956

    Google Scholar 

  • Ashish V (2017) Attention is all you need. Adv Neural Inf Process Syst 30:1

    Google Scholar 

  • Burel G, Alani H (2018) Crisis event extraction service (crees)-automatic detection and classification of crisis-related content on social media. In: Proceedings of the 15th ISCRAM Conference – Rochester, NY, USA May 2018

  • Chen Y, Xu L, Liu K, et al (2015) Event extraction via dynamic multi-pooling convolutional neural networks. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp 167–176

  • Cho SE, Jung K, Park HW (2013) Social media use during japan’s 2011 earthquake: how twitter transforms the locus of crisis communication. Media International Australia 149(1):28–40

    Article  Google Scholar 

  • Eke C, Norman AA, Shuib L, et al (2022) Random forest-based classifier for automatic sarcasm classification on twitter data using multiple features. Journal of Information Systems and Digital Technologies 4(2)

  • Eke CI, Norman AA, Shuib L et al (2019) A survey of user profiling: State-of-the-art, challenges, and solutions. IEEE Access 7:144907–144924

    Article  Google Scholar 

  • Eke CI, Norman AA, Shuib L (2021) Context-based feature technique for sarcasm identification in benchmark datasets using deep learning and bert model. IEEE Access 9:48501–48518

    Article  Google Scholar 

  • Eke CI, Norman AA, Shuib L (2021) Multi-feature fusion framework for sarcasm identification on twitter data: A machine learning based approach. PLoS ONE 16(6):e0252918

    Article  Google Scholar 

  • Gao H, Barbier G, Goolsby R (2011) Harnessing the crowdsourcing power of social media for disaster relief. IEEE Intell Syst 26(3):10–14

    Article  Google Scholar 

  • George A, Barathi Ganesh H, Anand Kumar M, et al (2019) Significance of global vectors representation in protein sequences analysis. In: Computer Aided Intervention and Diagnostics in Clinical and Medical Images, Springer, pp 261–269

  • Graves A, Schmidhuber J (2005) Framewise phoneme classification with bidirectional lstm and other neural network architectures. Neural Netw 18(5–6):602–610

    Article  Google Scholar 

  • Graves A, Jaitly N, Mohamed Ar (2013) Hybrid speech recognition with deep bidirectional lstm. In: 2013 IEEE workshop on automatic speech recognition and understanding, IEEE, pp 273–278

  • Imran M, Castillo C, Diaz F et al (2015) Processing social media messages in mass emergency: A survey. ACM Computing Surveys (CSUR) 47(4):1–38

    Article  Google Scholar 

  • Kumar A, Singh JP (2019) Location reference identification from tweets during emergencies: A deep learning approach. International journal of disaster risk reduction 33:365–375

    Article  Google Scholar 

  • Kumar A, Singh JP, Dwivedi YK, et al (2022) A deep multi-modal neural network for informative twitter content classification during emergencies. Annals of Operations Research pp 1–32

  • Lamsal R, Kumar TV (2023) Twitter-based disaster response using recurrent nets. In: Research Anthology on Managing Crisis and Risk Communications. IGI Global, p 613–632

  • Nguyen D, Al Mannai KA, Joty S, et al (2017) Robust classification of crisis-related data on social networks using convolutional neural networks. In: Proceedings of the international AAAI conference on web and social media, pp 632–635

  • Nguyen TH, Grishman R (2015) Event detection and domain adaptation with convolutional neural networks. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pp 365–371

  • Pennington J, Socher R, Manning CD (2014) Glove: Global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1532–1543

  • Powers DM (2020) Evaluation: from precision, recall and f-measure to roc, informedness, markedness and correlation. arXiv preprint arXiv:2010.16061

  • Prasetyo PK, Gao M, Lim EP, et al (2013) Social sensing for urban crisis management: The case of singapore haze. In: Social Informatics: 5th International Conference, SocInfo 2013, Kyoto, Japan, November 25-27, 2013, Proceedings 5, Springer, pp 478–491

  • Qu Y, Huang C, Zhang P, et al (2011) Microblogging after a major disaster in china: a case study of the 2010 yushu earthquake. In: Proceedings of the ACM 2011 conference on Computer supported cooperative work, pp 25–34

  • Ragini JR, Anand PR (2016) An empirical analysis and classification of crisis related tweets. In: 2016 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC), IEEE, pp 1–4

  • Rashid NM, Shuib L, Eke CI, et al (2021) Detecting covid-19 in chest x-ray images with convolutional neural network. In: 2021 International Conference on Computer Science and Engineering (IC2SE), IEEE, pp 1–10

  • Salehinejad H, Valaee S, Dowdell T, et al (2018) Generalization of deep neural networks for chest pathology classification in x-rays using generative adversarial networks. In: 2018 IEEE international conference on acoustics, speech and signal processing (ICASSP), IEEE, pp 990–994

  • Starbird K, Palen L, Hughes AL, et al (2010) Chatter on the red: what hazards threat reveals about the social life of microblogged information. In: Proceedings of the 2010 ACM conference on Computer supported cooperative work, pp 241–250

  • Tapia AH, Moore K (2014) Good enough is good enough: Overcoming disaster response organizations’ slow social media data adoption. Computer supported cooperative work (CSCW) 23:483–512

    Article  Google Scholar 

  • Tay Y, Tuan LA, Hui SC, et al (2018) Reasoning with sarcasm by reading in-between. arXiv preprint arXiv:1805.02856

  • Thomson R, Ito N, Suda H, et al (2012) Trusting tweets: The fukushima disaster and information source credibility on twitter. In: Iscram

  • Vieweg S (2012) Twitter communications in mass emergency: contributions to situational awareness. In: Proceedings of the ACM 2012 conference on computer supported cooperative work companion, pp 227–230

  • Vieweg S, Hughes AL, Starbird K, et al (2010) Microblogging during two natural hazards events: what twitter may contribute to situational awareness. In: Proceedings of the SIGCHI conference on human factors in computing systems, pp 1079–1088

  • Yang Z, Nguyen LH, Stuve J, et al (2017) Harvey flooding rescue in social media. In: 2017 IEEE International Conference on Big Data (Big Data), IEEE, pp 2177–2185

  • Yang Z, Nguyen L, Zhu J, et al (2020) Coordinating disaster emergency response with heuristic reinforcement learning. In: 2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), IEEE, pp 565–572

  • Young T, Hazarika D, Poria S et al (2018) Recent trends in deep learning based natural language processing. IEEE Comput Intell Mag 13(3):55–75

    Article  Google Scholar 

  • Yu S, Su J, Luo D (2019) Improving bert-based text classification with auxiliary sentence and domain knowledge. IEEE Access 7:176600–176612

    Article  Google Scholar 

  • Zen H, Agiomyrgiannakis Y, Egberts N, et al (2016) Fast, compact, and high quality lstm-rnn based statistical parametric speech synthesizers for mobile devices. arXiv preprint arXiv:1606.06061

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Christopher Ifeanyi Eke.

Ethics declarations

Conflict of interest

Authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix A

Appendix A

See Tables 6, 7, 8, 9, 10, 11.

Table 6 Different LSTM accuracy performance on the 2015 Nepal Earthquake dataset with varying parameters
Table 7 Different LSTM accuracy performance on the 2013 Queensland flood dataset with varying parameters
Table 8 Different Bi-LSTM accuracy performance on the 2015 Nepal Earthquake dataset with varying parameters
Table 9 Different Bi-LSTM accuracy performance on the 2013 Queensland flood data with varying parameters
Table 10 Different GRU accuracy performance on the 2015 Nepal Earthquake with varying parameters
Table 11 Different GRU accuracy performance on the 2013 Queensland flood with varying parameters

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Maswadi, K., Alhazmi, A., Alshanketi, F. et al. The empirical study of tweet classification system for disaster response using shallow and deep learning models. J Ambient Intell Human Comput (2024). https://doi.org/10.1007/s12652-024-04807-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s12652-024-04807-w

Keywords

Navigation