Skip to main content

Hybrid Model Based Influenza Detection with Sentiment Analysis from Social Networks

  • Conference paper
  • First Online:
Social Media Processing (SMP 2015)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 568))

Included in the following conference series:

Abstract

Sina microblog is a popular microblogging service in China, which could provide perfect reference sources for flu detection due to its’ real-time characteristic and large number of active users posting about their daily life continually. In this paper, we investigate the real-time flu detection problem and propose a flu detection model with emotion factors(sentiment analysis) and sematic information (Em-Flu model). First, we extract flu-related microblog posts automatically in real-time using a trained SVM filter. For posts classification, we also adopt association rule mining to extract strongly associated features as additional features of posts to overcome the limitation of 140 words, including sentiment analysis information which can help to classify the posts without explicit flu-related features. Then Conditional Random Field model is revised and applied to detect the transition time of flu that we can find out which place is more likely for influenza outbreak and when is more likely for influenza outbreak in one city or a province in China. Experimental results on detecting flu situation during certain time in some locations show the robustness and effectiveness of the proposed model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.cdc.gov/flu/about/disease/spread.htm.

References

  1. Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes Twitter users: real-time event detection by social sensors. In: Proceedings of the 19th international conference on World wide web, pp. 851–860. ACM (2010)

    Google Scholar 

  2. Collier, N.: Uncovering text mining: a survey of current work on web-based epidemic intelligence. Global Public Health 7(7), 731–749 (2012)

    Article  Google Scholar 

  3. Paul, M.J., Dredze, M.: You are what you Tweet: Analyzing Twitter for public health. In: ICWSM (2011)

    Google Scholar 

  4. Achrekar, H., Gandhe, A., Lazarus, R., et al.: Predicting flu trends using twitter data. In: 2011 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), pp. 702–707. IEEE (2011)

    Google Scholar 

  5. Culotta, A.: Towards detecting influenza epidemics by analyzing Twitter messages. In: Proceedings of the first workshop on social media analytics, pp. 115–122. ACM (2010)

    Google Scholar 

  6. Aramaki, E., Maskawa, S., Morita, M.: Twitter catches the flu: detecting influenza epidemics using Twitter. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 1568–1576. Association for Computational Linguistics (2011)

    Google Scholar 

  7. Lamb, A., Paul, M.J., Dredze, M.: Separating fact from fear: tracking flu infections on twitter. In: Proceedings of NAACL-HLT, pp. 789–795 (2013)

    Google Scholar 

  8. Achrekar, H.: Online Social Network Flu Tracker a Novel Sensory Approach to Predict Flu Trends. University of Massachusetts, Lowell (2012)

    Google Scholar 

  9. Aschwanden, C.: Spatial simulation model for infectious viral diseases with focus on SARS and the common Flu. In: HICSS (2004)

    Google Scholar 

  10. http://www.google.org/flutrends/, May 2015

  11. Lazer, D., Kennedy, R., King, G., Vespignani, A.: The parable of google flu: traps in big data analysis. Science 343(6176), 1203–1205 (2014)

    Article  Google Scholar 

  12. https://code.google.com/p/word2vec/, May 2015

  13. Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceeding of 18th International Conference on Machine Learning, pp. 282–289. Morgan Kaufmann (2001)

    Google Scholar 

  14. Cook, S., et al.: Assessing Google flu trends performance in the United States during the 2009 influenza virus A (H1N1) pandemic. PloS One 6(8), e23610 (2011)

    Article  Google Scholar 

  15. Sun, X., Ye, J., Tang, C., et al.: The method and application of Sina microblogging big data grabbing based on Mulit-strategy. J. Hefei Univ. Technol. (Natural Science) 17(10), 1210–1215 (2014)

    Google Scholar 

  16. Martinez-Beneito, M.A., et al.: Bayesian Markov switching models for the early detection of influenza epidemics. Stat. Med. 27(22), 4455–4468 (2008)

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgment

The work is supported by National Natural Science Funds for Distinguished Young Scholar(No.61203315), This work was partially supported by JSPS KAKENHI Grant Number 15H01712. This work was supported by the Open Project Program of the National Laboratory of Pattern Recognition (NLPR).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiao Sun .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer Science+Business Media Singapore

About this paper

Cite this paper

Sun, X., Ye, J., Ren, F. (2015). Hybrid Model Based Influenza Detection with Sentiment Analysis from Social Networks. In: Zhang, X., Sun, M., Wang, Z., Huang, X. (eds) Social Media Processing. SMP 2015. Communications in Computer and Information Science, vol 568. Springer, Singapore. https://doi.org/10.1007/978-981-10-0080-5_5

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-0080-5_5

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-0079-9

  • Online ISBN: 978-981-10-0080-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics