Advertisement

Knowledge and Information Systems

, Volume 39, Issue 3, pp 667–702 | Cite as

Mood sensing from social media texts and its applications

  • Thin NguyenEmail author
  • Dinh Phung
  • Brett Adams
  • Svetha Venkatesh
Regular Paper

Abstract

We present a large-scale mood analysis in social media texts. We organise the paper in three parts: (1) addressing the problem of feature selection and classification of mood in blogosphere, (2) we extract global mood patterns at different level of aggregation from a large-scale data set of approximately 18 millions documents (3) and finally, we extract mood trajectory for an egocentric user and study how it can be used to detect subtle emotion signals in a user-centric manner, supporting discovery of hyper-groups of communities based on sentiment information. For mood classification, two feature sets proposed in psychology are used, showing that these features are efficient, do not require a training phase and yield classification results comparable to state of the art, supervised feature selection schemes; on mood patterns, empirical results for mood organisation in the blogosphere are provided, analogous to the structure of human emotion proposed independently in the psychology literature; and on community structure discovery, sentiment-based approach can yield useful insights into community formation.

Keywords

Mood sensing Mood classification Mood pattern  Hyper-community 

References

  1. 1.
    Adams B, Phung D, Venkatesh S (2010) Discovery of latent subcommunities in a blog’s readership. ACM Trans Web 4(3):1–30CrossRefGoogle Scholar
  2. 2.
    Backstrom L, Huttenlocher D, Kleinberg J, Lan X (2006) Group formation in large social networks: membership, growth, and evolution. In: Proceedings of the ACM international conference on knowledge discovery and data mining (SIGKDD), pp 44–54Google Scholar
  3. 3.
    Berendt B, Hanser C (2007) Tags are not metadata, but ‘just more content’-to some people. In: Proceedings of the international AAAI conference on weblogs and social media (ICWSM)Google Scholar
  4. 4.
    Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022zbMATHGoogle Scholar
  5. 5.
    Bolón-Canedo V, Sánchez-Maroño N, Alonso-Betanzos A (2013) A review of feature selection methods on synthetic data. Knowl Inf Syst 34: 483–519Google Scholar
  6. 6.
    Bradley MM, Lang PJ (1999) Affective norms for English words (ANEW): instruction manual and affective ratings. University of Florida, GainesvilleGoogle Scholar
  7. 7.
    Cambria E, Hussain A, Havasi C, Eckl C, Munro J (2010) Towards crowd validation of the UK national health service. In: Proceedings of the web science conference (WebSci)Google Scholar
  8. 8.
    Fan TK, Chang CH (2010) Sentiment-oriented contextual advertising. Knowl Inf Syst 23:321–344CrossRefGoogle Scholar
  9. 9.
    Farahat AK, Ghodsi A, Kamel MS (2012) Efficient greedy feature selection for unsupervised learning. Knowl Inf Syst 1–26. doi: 10.1007/s10115-012-0538-1
  10. 10.
    Feng S, Wang D, Yu G, Gao W, Wong KF (2011) Extracting common emotions from blogs based on fine-grained sentiment clustering. Knowl Inf Syst 27:281–302CrossRefzbMATHGoogle Scholar
  11. 11.
    Frey BJ, Dueck D (2007) Clustering by passing messages between data points. Science 315:972–976CrossRefzbMATHMathSciNetGoogle Scholar
  12. 12.
    Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The weka data mining software: an update. ACM SIGKDD Explor Newsl 11(1):10–18CrossRefGoogle Scholar
  13. 13.
    Hayes C, Avesani P (2007) Using tags and clustering to identify topic-relevant blogs. In: Proceedings of the international AAAI conference on weblogs and social media (ICWSM)Google Scholar
  14. 14.
    Hu X, Downie JS (2007) Exploring mood metadata: relationships with genre, artist and usage metadata. In: Proceedings of the international conference on music, information retrievalGoogle Scholar
  15. 15.
    Kumar R, Novak J, Tomkins A (2006) Structure and evolution of online social networks. In: Proceedings of the ACM international conference on knowledge discovery and data mining (SIGKDD), p 617Google Scholar
  16. 16.
    Leshed G, Kaye JJ (2006) Understanding how bloggers feel: recognizing affect in blog posts. In: Proceedings of the ACM conference on human factors in computing systems (SIGCHI), p 1024Google Scholar
  17. 17.
    Long C, Zhang J, Huang M, Zhu X, Li M, Ma B (2012) Estimating feature ratings through an effective review selection approach. Knowl Inf Syst (accepted)Google Scholar
  18. 18.
    McCallum A, Wang X, Corrada-Emmanuel A (2007) Topic and role discovery in social networks with experiments on enron and academic email. J Artif Intell Res 30:249–272Google Scholar
  19. 19.
    McCallum A, Wang X, Mohanty N (2007) Joint group and topic discovery from relations and text. Lect Notes Comput Sci 4503:28CrossRefGoogle Scholar
  20. 20.
    Mishne G (2005) Experiments with mood classification in blog posts. In: Proceedings of ACM workshop on stylistic analysis of text for information accessGoogle Scholar
  21. 21.
    Mishne G, Glance N (2006) Predicting movie sales from blogger sentiment. In: Proceedings of the AAAI spring symposium on computational approaches to analysing weblogsGoogle Scholar
  22. 22.
    Mohtasseb H, Ahmed A (2012) Two-layered blogger identification model integrating profile and instance-based methods. Knowl Inf Syst 31(1):1–21CrossRefGoogle Scholar
  23. 23.
    Nallapati R, Cohen W (2008) Link-PLSA-LDA: a new unsupervised model for topics and influence of blogs. In: Proceedings of the international AAAI conference on weblogs and social media (ICWSM)Google Scholar
  24. 24.
    Negoescu RA, Adams B, Phung D, Venkatesh S, Gatica-Perez D (2009) Flickr hypergroups. In: Proceedings of the ACM international conference on multimedia, pp 813–816Google Scholar
  25. 25.
    Nguyen T, Phung D, Adams B, Tran T, Venkatesh S (2010) Classification and pattern discovery of mood in weblogs. Adv Knowl Discov Data Min 6119:283–290Google Scholar
  26. 26.
    Nguyen T, Phung D, Adams B, Tran T, Venkatesh S (2010) Hyper-community detection in the blogosphere. In: Proceeding of ACM workshop on social media, in conjunction with ACM Int Conf on Multime’d (ACM-MM). ACM, Firenze, ItalyGoogle Scholar
  27. 27.
    Nigam K, Hurst M (2004) Towards a robust metric of opinion. In: AAAI spring symposium on exploring attitude and affect in text, pp 598–603Google Scholar
  28. 28.
    Pang B, Lee L (2008) Opinion mining and sentiment analysis. Found Trends Inf Retr 2(1–2):1–135CrossRefGoogle Scholar
  29. 29.
    Pang B, Lee L, Vaithyanathan S (2002) Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of the ACL conference on empirical methods in natural language processing, pp 79–86Google Scholar
  30. 30.
    Pennebaker JW, Chung CK, Ireland M, Gonzales A, Booth RJ (2007) The development and psychometric properties of LIWC2007. LIWC, AustinGoogle Scholar
  31. 31.
    Pennebaker JW, Francis ME, Booth RJ (2007) Linguistic inquiry and word count (LIWC) [computer software]. LIWC, AustinGoogle Scholar
  32. 32.
    Russell JA (1980) A circumplex model of affect. J Pers Soc Psychol 39(6):1161–1178CrossRefGoogle Scholar
  33. 33.
    Russell JA (2003) Core affect and the psychological construction of emotion. Psychol Rev 110(1):145Google Scholar
  34. 34.
    Sebastiani F (2002) Machine learning in automated text categorization. ACM Comput Surv 34(1):1–47CrossRefGoogle Scholar
  35. 35.
    Song X, Lin CY, Tseng BL, Sun MT (2005) Modeling and predicting personal information dissemination behavior. In: Proceedings of the ACM international conference on knowledge discovery and data mining (SIGKDD), pp 479–488Google Scholar
  36. 36.
    Sood SO, Vasserman L (2009) ESSE: exploring mood on the web. In: Proceedings of the international AAAI conference on weblogs and social media (ICWSM)Google Scholar
  37. 37.
    Tausczik YR, Pennebaker JW (2010) The psychological meaning of words: LIWC and computerized text analysis methods. J Lang Soc Psychol 29(1):24CrossRefGoogle Scholar
  38. 38.
    Teh YW, Jordan MI (2010) Hierarchical bayesian nonparametric models with applications. In: Hjort N, Holmes C, Müller P, Walker S (eds) Bayesian nonparametrics: principles and practice. Cambridge University Press, CambridgeGoogle Scholar
  39. 39.
    Tsuruoka Y, Tsujii J (2005) Bidirectional inference with the easiest-first strategy for tagging sequence data. In: Proceedings of the conference on human language technology and empirical methods in natural language processing, pp 467–474Google Scholar
  40. 40.
    Yang Y, Pedersen JO (1997) A comparative study on feature selection in text categorization. In: Proceedings of the international conference on machine learning (ICML), pp 412–420Google Scholar

Copyright information

© Springer-Verlag London 2013

Authors and Affiliations

  • Thin Nguyen
    • 1
    Email author
  • Dinh Phung
    • 1
  • Brett Adams
    • 2
  • Svetha Venkatesh
    • 1
  1. 1.School of Information TechnologyDeakin UniversityGeelong, VIC 3220Australia
  2. 2.Department of ComputingCurtin UniversityPerthAustralia

Personalised recommendations