Skip to main content

Prediction of Influenza-like Illness from Twitter Data and Its Comparison with Integrated Disease Surveillance Program Data

  • Conference paper
  • First Online:
Computer Networks, Big Data and IoT

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 66))

  • 1634 Accesses

Abstract

The social networking sites are currently assisting in delivering faster communication and they are also very useful to know about the different people’s opinions, views, and their sentiments. Twitter is one of the social networking sites, which can help to predict many health-related problems. In this work, sentiment analysis has been performed on tweets to predict the possible number of cases with H1N1 disease. The data will be collected country wise, where the tweets lie between four ranges on which the further analysis will be done. The results show the position of India based on the frequency of occurrence in the tweets as compared to the other countries. This type of disease prediction can help to take a quick decision in order to overcome the damage. The results predicted by sentiment analysis of Twitter data will then compared with the data obtained from the ‘Ministry of Health and Family Welfare-Government of India’ site. The data present at this site gives the actual number of cases occurred and collected by Indian Governments “Integrated Disease Survellience Program”. Comparison with this data will help in calculating the accuracy of the sentiment analysis approach proposed in this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 299.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Poecze F, Ebster C, Strauss C (2018) Social media metrics and sentiment analysis to evaluate the effectiveness of social media posts. Proc Comput Sci 130:660–666

    Article  Google Scholar 

  2. McCalman J, Bainbridge R, Brown C, Tsey K, Clarke A (2018) The aboriginal australian family wellbeing program: a historical analysis of the conditions that enabled its spread. Front Public Heal 6:26

    Article  Google Scholar 

  3. Amato PR (2010) Research on divorce: continuing trends and new developments. J Marriage Fam 72(3):650–666. https://doi.org/10.1111/j.1741-3737.2010.00723.x

    Article  Google Scholar 

  4. Sakaki T, Okazaki M, Matsuo Y (2010) Earthquake shakes Twitter users: Real-time event detection by social sensors. In: Proceedings of the 19th international conference on world wide web, WWW’10, pp 851–860. https://doi.org/10.1145/1772690.1772777

  5. Prier KW, Smith MS, Giraud-Carrier C, Hanson CL (2011) Identifying health-related topics on Twitter. In: International conference on social computing, behavioral-cultural modeling, and prediction, pp 18–25. https://doi.org/10.1007/978-3-642-19656-0_4

  6. Neiger BL, Thackeray R, Burton SH, Thackeray CR, Reese JH (2013) Use of twitter among local health departments: an analysis of information sharing, engagement, and action. J Med Internet Res 15(8):e177. https://doi.org/10.2196/jmir.2775

    Article  Google Scholar 

  7. Bechmann A, Lomborg S (2013) Dissemination of health information through social networks: Twitter and antibiotics. New Media Soc 15(5):765–781. https://doi.org/10.1016/j.ajic.2009.11.004

    Article  Google Scholar 

  8. Malik M, Habib S, Agarwal P (2018) A novel approach to web-based review analysis using opinion mining. Proc Comput Sci 132:1202–1209

    Article  Google Scholar 

  9. Freberg K, Palenchar MJ, Veil SR (2013) Managing and sharing H1N1 crisis information using social media bookmarking services. Public Relat Rev 39(3):178–184. https://doi.org/10.1016/j.pubrev.2013.02.007

    Article  Google Scholar 

  10. Jania VK, Kuma S (2015) An effective approach to track levels of influenza-A (H1N1) pandemic in India. Proc Comput Sci 70:801–807

    Article  Google Scholar 

  11. Yaqub U, Chun SA, Atluri V, Vaidya J (2017) Analysis of political discourse on twitter in the context of the 2016 US presidential elections. Gov Inf Q 34(4):613–626. https://doi.org/10.1016/j.giq.2017.11.001

    Article  Google Scholar 

  12. Leitch D, Sherif M (2017) Twitter mood, CEO succession announcements and stock returns. J Comput Sci 21:1–10

    Article  Google Scholar 

  13. Wang W, Chen L, Thirunarayan K, Sheth AP (2012) Harnessing twitter ‘big data’ for automatic emotion identification. In: Proceedings - 2012 ASE/IEEE international conference on privacy, security, risk and trust and 2012 ASE/IEEE international conference on social computing, SocialCom/PASSAT 2012, pp 587–592. https://doi.org/10.1109/SocialCom-PASSAT.2012.119

  14. Malik M, Naaz S, Ansari IR (2018) Sentiment analysis of Twitter data using big data tools and Hadoop ecosystem. In: International conference on ISMAC in computational vision and bio-engineering, pp 857–863

    Google Scholar 

  15. Chaudhary S, Naaz S (2017) Use of big data in computational epidemiology for public health surveillance. In: 2017 international conference on computing and communication technologies for smart nation, IC3TSN 2017, 2018, Oct 2017. https://doi.org/10.1109/IC3TSN.2017.8284467

  16. Bifet A, Frank E (2010) Sentiment knowledge discovery in Twitter streaming data. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), vol 6332 LNAI, pp 1–15. https://doi.org/10.1007/978-3-642-16184-1_1

  17. Phelan O, McCarthy K, Smyth B (2009) Using twitter to recommend real-time topical news. In: RecSys’09—proceedings of the 3rd ACM conference on recommender systems, pp 385–388. https://doi.org/10.1145/1639714.1639794

  18. Heppermann C (2013) Twitter: the company and its founders. ABDO

    Google Scholar 

  19. Chunara R, Andrews JR, Brownstein JS (2012) Social and news media enable estimation of epidemiological patterns early in the 2010 Haitian cholera outbreak. Am J Trop Med Hyg 86(1):39–45

    Article  Google Scholar 

  20. Lampos CNV (2010) Tracking the flu pandemic by monitoring the social web. In: 2010 2nd international workshop on cognitive information processing (CIP). IEEE Computer Society, pp 411–416

    Google Scholar 

  21. Basha PS Document based clustering for detecting events in microblogging websites

    Google Scholar 

  22. Siston AM et al (2010) Pandemic 2009 influenza A(H1N1) virus illness among pregnant women in the United States. JAMA J Am Med Assoc 303(15):1517–1525. https://doi.org/10.1001/jama.2010.479

    Article  Google Scholar 

  23. Aramaki E, Maskawa S, Morita M (2011) Twitter catches the flu: detecting influenza epidemics using Twitter. In: Proceedings of the conference on empirical methods in natural language processing, pp 1568–1576

    Google Scholar 

  24. Bosley JC et al (2013) Decoding twitter: Surveillance and trends for cardiac arrest and resuscitation communication. Resuscitation 84(2):206–212. https://doi.org/10.1016/j.resuscitation.2012.10.017

    Article  Google Scholar 

  25. Zhang L, Hall M, Bastola D (2018) Utilizing Twitter data for analysis of chemotherapy. Int J Med Inform 120:92–100. https://doi.org/10.1016/j.ijmedinf.2018.10.002

    Article  Google Scholar 

  26. Reece AG, Reagan AJ, Lix KLM, Dodds PS, Danforth CM, Langer EJ (2017) Forecasting the onset and course of mental illness with Twitter data. Sci Rep 7(1):1–11. https://doi.org/10.1038/s41598-017-12961-9

    Article  Google Scholar 

  27. Jain VK, Kumar S (2018) Effective surveillance and predictive mapping of mosquito-borne diseases using social media. J Comput Sci 25:406–415. https://doi.org/10.1016/j.jocs.2017.07.003

    Article  Google Scholar 

  28. Gohil S, Vuik S, Darzi A (2018) Sentiment analysis of health care tweets: review of the methods used. J Med Internet Res 20(4):e43. https://doi.org/10.2196/publichealth.5789

    Article  Google Scholar 

  29. Arora M, Kansal V (2019) Character level embedding with deep convolutional neural network for text normalization of unstructured data for Twitter sentiment analysis. Soc Netw Anal Min 9(1):12

    Article  Google Scholar 

  30. Hamzah FAB et al (2020) CoronaTracker: worldwide COVID-19 outbreak data analysis and prediction. Bull World Heal Org 1:32

    Google Scholar 

  31. Wang X, Gerber MS, Brown DE (2012) Automatic crime prediction using events extracted from twitter posts. In: International conference on social computing, behavioral-cultural modeling, and prediction, pp 231–238

    Google Scholar 

  32. Signorini A, Segre AM, Polgreen PM (2011) The use of Twitter to track levels of disease activity and public concern in the US during the influenza A H1N1 pandemic. PLoS ONE 6(5):e19467

    Article  Google Scholar 

  33. Zhang L, Ghosh R, Dekhil M, Hsu M, Liu B (2011) Combining lexicon-based and learning-based methods for Twitter sentiment analysis. HP Lab Tech Rep HPL-2011 89

    Google Scholar 

  34. Pennebaker JW, Booth RJ, Francis ME (2007) Linguistic inquiry and word count: LIWC [computer software], vol 135. Austin, TX liwc.net

    Google Scholar 

  35. Nielsen FÅ (2011) A new ANEW: evaluation of a word list for sentiment analysis in microblogs. arXiv:1103.2903

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sameena Naaz .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Malik, M., Naaz, S. (2021). Prediction of Influenza-like Illness from Twitter Data and Its Comparison with Integrated Disease Surveillance Program Data. In: Pandian, A., Fernando, X., Islam, S.M.S. (eds) Computer Networks, Big Data and IoT. Lecture Notes on Data Engineering and Communications Technologies, vol 66. Springer, Singapore. https://doi.org/10.1007/978-981-16-0965-7_31

Download citation

  • DOI: https://doi.org/10.1007/978-981-16-0965-7_31

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-16-0964-0

  • Online ISBN: 978-981-16-0965-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics