Abstract
Sina microblog is a popular microblogging service in China, which could provide perfect reference sources for flu detection due to its’ real-time characteristic and large number of active users posting about their daily life continually. In this paper, we investigate the real-time flu detection problem and propose a flu detection model with emotion factors(sentiment analysis) and sematic information (Em-Flu model). First, we extract flu-related microblog posts automatically in real-time using a trained SVM filter. For posts classification, we also adopt association rule mining to extract strongly associated features as additional features of posts to overcome the limitation of 140 words, including sentiment analysis information which can help to classify the posts without explicit flu-related features. Then Conditional Random Field model is revised and applied to detect the transition time of flu that we can find out which place is more likely for influenza outbreak and when is more likely for influenza outbreak in one city or a province in China. Experimental results on detecting flu situation during certain time in some locations show the robustness and effectiveness of the proposed model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes Twitter users: real-time event detection by social sensors. In: Proceedings of the 19th international conference on World wide web, pp. 851–860. ACM (2010)
Collier, N.: Uncovering text mining: a survey of current work on web-based epidemic intelligence. Global Public Health 7(7), 731–749 (2012)
Paul, M.J., Dredze, M.: You are what you Tweet: Analyzing Twitter for public health. In: ICWSM (2011)
Achrekar, H., Gandhe, A., Lazarus, R., et al.: Predicting flu trends using twitter data. In: 2011 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), pp. 702–707. IEEE (2011)
Culotta, A.: Towards detecting influenza epidemics by analyzing Twitter messages. In: Proceedings of the first workshop on social media analytics, pp. 115–122. ACM (2010)
Aramaki, E., Maskawa, S., Morita, M.: Twitter catches the flu: detecting influenza epidemics using Twitter. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 1568–1576. Association for Computational Linguistics (2011)
Lamb, A., Paul, M.J., Dredze, M.: Separating fact from fear: tracking flu infections on twitter. In: Proceedings of NAACL-HLT, pp. 789–795 (2013)
Achrekar, H.: Online Social Network Flu Tracker a Novel Sensory Approach to Predict Flu Trends. University of Massachusetts, Lowell (2012)
Aschwanden, C.: Spatial simulation model for infectious viral diseases with focus on SARS and the common Flu. In: HICSS (2004)
http://www.google.org/flutrends/, May 2015
Lazer, D., Kennedy, R., King, G., Vespignani, A.: The parable of google flu: traps in big data analysis. Science 343(6176), 1203–1205 (2014)
https://code.google.com/p/word2vec/, May 2015
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceeding of 18th International Conference on Machine Learning, pp. 282–289. Morgan Kaufmann (2001)
Cook, S., et al.: Assessing Google flu trends performance in the United States during the 2009 influenza virus A (H1N1) pandemic. PloS One 6(8), e23610 (2011)
Sun, X., Ye, J., Tang, C., et al.: The method and application of Sina microblogging big data grabbing based on Mulit-strategy. J. Hefei Univ. Technol. (Natural Science) 17(10), 1210–1215 (2014)
Martinez-Beneito, M.A., et al.: Bayesian Markov switching models for the early detection of influenza epidemics. Stat. Med. 27(22), 4455–4468 (2008)
Acknowledgment
The work is supported by National Natural Science Funds for Distinguished Young Scholar(No.61203315), This work was partially supported by JSPS KAKENHI Grant Number 15H01712. This work was supported by the Open Project Program of the National Laboratory of Pattern Recognition (NLPR).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer Science+Business Media Singapore
About this paper
Cite this paper
Sun, X., Ye, J., Ren, F. (2015). Hybrid Model Based Influenza Detection with Sentiment Analysis from Social Networks. In: Zhang, X., Sun, M., Wang, Z., Huang, X. (eds) Social Media Processing. SMP 2015. Communications in Computer and Information Science, vol 568. Springer, Singapore. https://doi.org/10.1007/978-981-10-0080-5_5
Download citation
DOI: https://doi.org/10.1007/978-981-10-0080-5_5
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-0079-9
Online ISBN: 978-981-10-0080-5
eBook Packages: Computer ScienceComputer Science (R0)