Multimedia Tools and Applications

, Volume 75, Issue 15, pp 8843–8860

Social sentiment sensor: a visualization system for topic detection and topic sentiment analysis on microblog

Article

Abstract

As a new form of social media, microblogging provides platform sharing, wherein users can share their feelings and ideas on certain topics. Bursty topics from microblogs are the results of the emerging issues that instantly attract more followers and more attention online, which provide a unique opportunity to gauge the relation between expressed public sentiment and hot topics. This paper presents a Social Sentiment Sensor (SSS) system on Sina Weibo to detect daily hot topics and analyze the sentiment distributions toward these topics. SSS includes two main techniques, namely, hot topic detection and topic-oriented sentiment analysis. Hot topic detection aims to detect the most popular topics online based on the following steps, topic detection, topic clustering, and topic popularity ranking. We extracted topics from the hashtags using a hashtag filtering model because they can cover almost all the topics. Then, we cluster the topics that describe the same issue, and rank the topic clusters via their popularity to exploit the final hot topics. Topic-oriented sentiment analysis aims to analyze public opinions toward the hot topics. After retrieving the topic-related messages, we recognize sentiment for each message using a state-of-the-art SVM (Support Vector Machine) sentiment classifier. Then, we summarize the sentiments for the hot topic to achieve topic sentiment distribution. Based on the above framework and algorithms, SSS produces a real-time visualization system to monitor social sentiments, which is offering the public a new and timely perspective on the dynamics of the social topics.

Keywords

Sentiment analysis Social media Topic detection Microblogging Opinion mining 

References

  1. Agarwal A, Xie B, Vovsha I, Rambow O, Passonneau R (2011) Sentiment analysis of twitter data. In: Proceedings of the workshop on languages in social media, LSM ’11. Association for Computational Linguistics, Stroudsburg, pp 30–38. http://dl.acm.org/citation.cfm?id=2021109.2021114
  2. Allan J (ed) (2002) Topic detection and tracking: event-based information organization. Kluwer Academic Publishers, NorwellGoogle Scholar
  3. Allan J, Carbonell J, Doddington G, Yamron J, Yang Y (1998) Topic detection and tracking pilot study: final report. In: Proceedings of the DARPA broadcast news transcription and understanding workshop. Lansdowne, pp 194–218Google Scholar
  4. Asur S, Huberman B A (2010) Predicting the future with social media. In: Proceedings of the 2010 IEEE/WIC/ACM international conference on web intelligence and intelligent agent technology - Volume 01, WI-IAT ’10. IEEE Computer Society, Washington, pp 492–499. doi:10.1109/WI-IAT.2010.63
  5. Barbosa L, Feng J (2010) Robust sentiment detection on twitter from biased and noisy data. In: Proceedings of the 23rd international conference on computational linguistics: Posters, COLING’10. Association for Computational Linguistics, Stroudsburg, pp 36–44. http://dl.acm.org/citation.cfm?id=1944566.1944571
  6. Cataldi M, Di Caro L, Schifanella C (2010) Emerging topic detection on twitter based on temporal and social terms evaluation. In: Proceedings of the 10th international workshop on multimedia data mining, MDMKDD ’10. ACM, New York, pp 4:1–4:10. doi:10.1145/1814245.1814249
  7. Ciot M, Sonderegger M, Ruths D (2013) Gender inference of twitter users in non-English contexts. In: Proceedings of the 2013 conference on empirical methods in natural language processing. Association for Computational Linguistics, Seattle, pp 1136–1145. http://www.aclweb.org/anthology/D13-1114
  8. Diao Q, Jiang J (2013) A unified model for topics, events and users on Twitter. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. http://www.aclweb.org/anthology/D13-1192. Association for Computational Linguistics, Seattle, Washington, pp 1869–1879
  9. Diao Q, Jiang J, Zhu F, Lim E P (2012) Finding bursty topics from microblogs. In: Proceedings of the 50th annual meeting of the association for computational linguistics (Vol 1: Long Papers). Association for Computational Linguistics, Jeju Island, Korea, pp 536–544 . http://www.aclweb.org/anthology/P12-1056
  10. Gao Y, Wang F, Luan H, Chua T S (2014) Brand data gathering from live social media streams. In: Proceedings of International Conference on Multimedia Retrieval, ICMR ’14. ACM, New York, pp 169–176. doi:10.1145/2578726.2578748
  11. Gao Y, Wang M, Zha Z j, Jialie S, Li X, Wu X (2013) Learning for tag-based social image search. IEEE Trans Image Process 22 (1): 363–376MathSciNetCrossRefGoogle Scholar
  12. Johansson R, Moschitti A (2011) Extracting opinion expressions and their polarities - exploration of pipelines and joint models. In: ACL (Short Papers). The Association for Computer Linguistics, pp 101–106Google Scholar
  13. Joshi A, R, B A, Bhattacharyya P, Mohanty R K (2011) C-feel-it: a sentiment analyzer for micro-blogs. In: ACL (system demonstrations). The Association for Computer Linguistics, pp 127–132. http://dblp.uni-trier.de/db/conf/acl/acl2011d.html#JoshiABM11
  14. Kleinberg J (2002) Bursty and hierarchical structure in streams. In: Proceedings of the 8th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’02. ACM, New York, pp 91–101. doi:10.1145/775047.775061
  15. Liu B (2012) Sentiment analysis and opinion mining. Synthesis lectures on human language technologies. Morgan & ClaypoolGoogle Scholar
  16. Mohammad S, Kiritchenko S, Zhu X (2013) Nrc-canada: building the state-of-the-art in sentiment analysis of tweets. In: Proceedings of the 7th international workshop on semantic evaluation exercises (SemEval-2013). Atlanta, GeorgiaGoogle Scholar
  17. Pang B, Lee L (2008) Opinion mining and sentiment analysis. Found Trends Inf Retr 2 (1-2): 1–135. doi:10.1561/1500000011 CrossRefGoogle Scholar
  18. Pang B, Lee L, Vaithyanathan S (2002) Thumbs up?: Sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing - EMNLP ’02, vol 10. Association for Computational Linguistics, Stroudsburg, pp 79–86. doi:10.3115/1118693.1118704
  19. Petrović S, Osborne M, Lavrenko V (2010) Streaming first story detection with application to twitter. In: Human language technologies: The 2010 annual conference of the north american chapter of the association for computational linguistics, HLT ’10. Association for Computational Linguistics, Stroudsburg, pp 181–189. http://dl.acm.org/citation.cfm?id=1857999.1858020
  20. Salakhutdinov R, Hinton G (2009) Semantic hashing. Int J Approx Reasoning 50 (7): 969–978. doi:10.1016/j.ijar.2008.11.006 CrossRefGoogle Scholar
  21. Si J, Mukherjee A, Liu B, Li Q, Li H, Deng X (2013) Exploiting topic based twitter sentiment for stock prediction. In: Proceedings of the 51st annual meeting of the association for computational linguistics (Volume 2: Short Papers). Association for Computational Linguistics, Sofia, Bulgaria, pp 24–29. http://www.aclweb.org/anthology/P13-2005
  22. Tang D, Qin B, Liu T, Li Z (2013) Learning sentence representation for emotion classification on microblogs. In: Proceedings of natural language processing and chinese computing, pp 212–223Google Scholar
  23. Weng J, Lee B S (2011) Event detection in twitter. http://www.aaai.org/ocs/index.php/ICWSM/ICWSM11/paper/view/2767
  24. Wiebe J, Wilson T, Cardie C (2005) Annotating expressions of opinions and emotions in language. Lang Resour Eval 1 (2): 0. http://www.cs.pitt.edu/~wiebe/pubs/papers/lre05withappendix.pdf Google Scholar
  25. Williams J, Katz G (2012) Extracting and modeling durations for habits and events from twitter. In: Proceedings of the 50th annual meeting of the association for computational linguistics: short papers - ACL ’12, vol 2. Association for Computational Linguistics, Stroudsburg, pp 223–227. http://dl.acm.org/citation.cfm?id=2390665.2390720
  26. Yang Y, Pierce T, Carbonell J (1998) A study of retrospective and on-line event detection. In: Proceedings of the 21st annual international ACM SIGIR conference on research and development in information retrieval, SIGIR ’98. ACM, New York, pp 28–36. doi:10.1145/290941.290953
  27. Zhao J, Dong L, Wu J, Xu K (2012) Moodlens: An emoticon-based sentiment analysis system for chinese tweets. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’12. ACM, New York, pp 1528–1531. doi:10.1145/2339530.2339772

Copyright information

© Springer Science+Business Media New York 2014

Authors and Affiliations

  1. 1.Department of Media Technology and ArtHarbin Institute of TechnologyHarbinChina
  2. 2.Department of Computer Science and TechnologyHarbin Institute of TechnologyHarbinChina

Personalised recommendations