Sentiment Analysis in Social Media

  • Georgios PaltoglouEmail author
Part of the Lecture Notes in Social Networks book series (LNSN)


Sentiment Analysis deals with the detection and analysis of affective content in written text. It utilizes methodologies, theories, and techniques from a diverse set of scientific domains, ranging from psychology and sociology to natural language processing and machine learning. In this chapter, we discuss the contributions of the field in social media analysis with a particular focus in online collective actions; as these actions are typically motivated and driven by intense emotional states (e.g., anger), sentiment analysis can provide unique insights into the inner workings of such phenomena throughout their life cycle. We also present the state of the art in the field and describe some of its contributions into understanding online collective behavior. Lastly, we discuss significant real-world datasets that have been successfully utilized in research and are available for scientific purposes and also present a diverse set of available tools for conducting sentiment analysis.


Online Community Opinion Mining Sentiment Analysis Affective Content Forum Post 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. Agarwal A, Xie B, Vovsha I, Rambow O, Passonneau R (2011) Sentiment analysis of Twitter data. In: Proceedings of LSM’11. Association for Computational Linguistics, Stroudsburg, PA, pp 30–38Google Scholar
  2. Alias-i (2008) Lingpipe 4.1.0.
  3. Baccianella S, Esuli A, Sebastiani F (2010) Sentiwordnet 3.0. In: Proceedings of LREC’10, Valletta, MaltaGoogle Scholar
  4. Barnett E (2009) Facebook fuelling divorce, research claims (21 Dec, 2009, The Telegraph). Accessed 08 Sept 2011
  5. Barrett LF, Russell JA (1999) The structure of current affect: controversies and emerging consensus. Curr Dir Psychol Sci 8(1)Google Scholar
  6. Bautin M, Vijayarenu L, Skiena S (2008) International sentiment analysis for news and blogs. In: Proceedings of ICWSM’08, Seattle, Washington, DCGoogle Scholar
  7. Bishop CM (2006) Pattern recognition and machine learning (information science and statistics). Springer, New YorkGoogle Scholar
  8. Bollen J, Mao H, Zeng XJ (2010) Twitter mood predicts the stock market. CoRR.
  9. Bollen J, Mao H, Pepe A (2011) Modeling public mood and emotion: Twitter sentiment and socio-economic phenomena. In ICWSM, Barcelona, SpainGoogle Scholar
  10. Bradley MM, Lang PJ (1999) Affective norms for English words (ANEW): instruction manual and affective ratings. The Center for Research in Psychophysiology, Gainesville, FLGoogle Scholar
  11. Brew A, Greene D, Cunningham P (2010) Using crowdsourcing and active learning to track sentiment in online media. In: Proceedings of ECAI’10, Lisbon, Portugal, pp 145–150Google Scholar
  12. Burton KNK, Soboroff I (2011) The ICWSM 2011 spinn3r dataset. ICWSM, BarcelonaGoogle Scholar
  13. Burton K, Java A, Soboroff I (2009) The ICWSM 2009 spinn3r dataset. ICWSM, San Jose, CAGoogle Scholar
  14. Cha M, Pérez JAN, Haddadi H (2009) Flash floods and ripples: the spread of media content through the blogosphere. In: Proceedings of ICWSM’11, Barcelona, SpainGoogle Scholar
  15. Chang CC, Lin CJ (2011) Libsvm: a library for support vector machines. ACM Trans Intell Syst Technol 2:27.1–27.27Google Scholar
  16. Chang R, Pimentel S, Svistunov A (2011) Sentiment analysis of occupy wall street tweets.
  17. Chen H, Zimbra D (2010) Ai and opinion mining. IEEE Intell Syst 25:74–80Google Scholar
  18. Chmiel A, Sienkiewicz J, Thelwall M, Paltoglou G, Buckley K, Kappas A (2011a) Collective emotions online and their influence on community life. PLoS ONE 6(7):e22207Google Scholar
  19. Chmiel A, Sobkowicz P, Sienkiewicz J, Paltoglou G, Buckley K, Thelwall M (2011b) Negative emotions boost user activity at BBC forum. Phys A 390(16):2936–2944CrossRefGoogle Scholar
  20. Dalgleish T, Power M (1999) Handbook of cognition and emotion. Wiley, New YorkCrossRefGoogle Scholar
  21. Diakopoulos NA, Shamma DA (2010) Characterizing debate performance via aggregated Twitter sentiment. In: Proceedings of CHI’10, Atlanta, GA, pp 1195–1198Google Scholar
  22. Ding X, Liu B, Yu PS (2008) A holistic lexicon-based approach to opinion mining. In: Proceedings of WSDM’08, Palo Alto, CA, pp 231–240Google Scholar
  23. Dodds P, Danforth C (2009) Measuring the happiness of large-scale written expression: songs, blogs, and presidents. J Happiness Stud. doi: 10.1007/s10902-009-9150-9 Google Scholar
  24. Fan RE, Chang KW, Hsieh CJ, Wang XR, Lin CJ (2008) LIBLINEAR: a library for large linear classification. J Mach Learn Res 9:1871–1874zbMATHGoogle Scholar
  25. Fouche G (2011) Nobel peace prize may recognise Arab spring. (28 Sept, 2011, Reuters). Accessed 27 Dec 2011
  26. Gazzar SE, Vitorovich L, Bender R (2011) Egypt communications cut ahead of further protests (28 Jan 2011, The Wall Street Journal). Accessed 27 Dec 2011
  27. Godbole N, Srinivasaiah M, Skiena S (2007) Large-scale sentiment analysis for news and blogs. In: Proceedings of ICWSM’07, Boulder, COGoogle Scholar
  28. Gonzalez-Bailon S, Paltoglou G (2012) The positive effects of negative emotions in online communities (under review)Google Scholar
  29. Gonzalez-Bailon S, Banchs RE, Kaltenbrunner A (2010) Emotional reactions and the pulse of public opinion: Measuring the impact of political events on the sentiment of online discussions. CoRR.
  30. Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The weka data mining software: an update. SIGKDD Explor Newsl 11:10–18CrossRefGoogle Scholar
  31. Halliday J, Garside J (2011) Rioting leads to cameron call for social media clampdown (11 Aug, 2011, The Guardian). Accessed 27 Dec 2011
  32. Harvey M (2010) Facebook ousts Google in us popularity (17 Mar 2010, The Sunday Times). Accessed 05 July 2010
  33. Howard P (2011) The Arab spring’s cascading effects (23 Feb 2011, Miller-McCune). Accessed 27 Dec 2011
  34. Jijkoun V, de Rijke M, Weerkamp W (2010) Generating focused topic-specific sentiment lexicons. In: Proceedings of ACL ’08, Columbus, OH, pp 585–594Google Scholar
  35. Joachims T (1999) Making large-scale SVM learning practical. In: Advances in kernel methods - support vector learning, vol 11. MIT Press, Cambridge, MAGoogle Scholar
  36. John GH, Langley P (1995) Estimating continuous distributions in Bayesian classifiers. Stanford University, Stanford, CA, pp 338–345Google Scholar
  37. Kramer AD (2010) An unobtrusive behavioral model of “gross national happiness”. In: Proceedings of CHI’10, Atlanta, GA, pp 287–290Google Scholar
  38. Le Cessie S, Van Houwelingen JC (1992) Ridge estimators in logistic regression. Appl Stat 41(1):191–201CrossRefzbMATHGoogle Scholar
  39. Manning CD, Schütze H (1999) Foundations of statistical natural language processing. MIT Press, Cambridge, MAzbMATHGoogle Scholar
  40. Mauss IB, Robinson MD (2009) Measures of emotion: a review. Cogn Emotion 23(2):209–237CrossRefGoogle Scholar
  41. McCallum AK (2002) Mallet: a machine learning for language toolkit.
  42. Miller GA (1995) Wordnet: a lexical database for English. Commun ACM 38(11):39–41CrossRefGoogle Scholar
  43. Mishne G (2005) Experiments with mood classification in blog posts. In: 1st Workshop on stylistic analysis of text for information access, Salvador, BrazilGoogle Scholar
  44. Mishne G, de Rijke M (2006) Capturing global mood levels using blog posts. In: Proceedings of AAAI-CAAW, Stanford University, Stanford, CA, pp 145–152Google Scholar
  45. Mitchell TM (1997) Machine learning, 1st edn. McGraw-Hill, New YorkGoogle Scholar
  46. Mitrović M, Paltoglou G, Tadić B (2011) Quantitative analysis of bloggers’ collective behavior powered by emotions. J Stat Mech Theory Exp 2011(02):P02005Google Scholar
  47. O’Connor B, Balasubramanyan R, Routledge BR, Smith NA (2010) From tweets to polls: linking text sentiment to public opinion time series. In: Proceedings of ICWSM’10, Washington, DCGoogle Scholar
  48. Owsley S, Sood S, Hammond KJ (2006) Domain specific affective classification of documents. In: Proceedings of AAAICAAW’06, Stanford University, Stanford, CA, pp 181–183Google Scholar
  49. Pak A, Paroubek P (2010) Twitter as a corpus for sentiment analysis and opinion mining. In: Proceedings of LREC’10, Valletta, MaltaGoogle Scholar
  50. Paltoglou G, Thelwall M (2012) Twitter, myspace, digg: unsupervised sentiment analysis in social media. ACM TIST 3(4):66.1–66.19Google Scholar
  51. Paltoglou G, Thelwall M, Buckely K (2010) Online textual communication annotated with grades of emotion strength. In: Proceedings of EMOTION, Imperial College, London, pp 25–31Google Scholar
  52. Paltoglou G, Theunis M, Kappas A, Thelwall M (2013) Predicting emotional responses to long informal text. J IEEE Trans Affect Comput 99(PrePrints):1Google Scholar
  53. Pang B, Lee L (2008) Opinion mining and sentiment analysis. Found Trends Inform Retrieval 2(1–2):1–135CrossRefGoogle Scholar
  54. Pang B, Lee L, Vaithyanathan S (2002) Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of EMNLP’02. Association for Computational Linguistics, Philadelphia, pp 79–86Google Scholar
  55. Pennebaker JW (2008) Debate 3: Mccain and Obama word usage (15 Oct 2008, WordWathcers). Accessed 01 Feb 2012
  56. Pennebaker JW, Francis ME (1999) Linguistic inquiry and word count, 1st edn. Lawrence Erlbaum, Mahwah, NJGoogle Scholar
  57. Pennebaker JW, Persaud R (2010) The 2010 UK election: the second debate (23 Apr 2010, WordWathcers). Accessed 01 Feb 2012
  58. Platt, J. C. (1999). Fast training of support vector machines using sequential minimal optimization. John C. Platt Microsoft Research 1 Microsoft Way, Redmond, WA, pp 185–208Google Scholar
  59. Quirk R, Greenbaum S, Leech G, Svartvik J (1985) A comprehensive grammar of the English language. Longman, LondonGoogle Scholar
  60. Russell JA (1980) A circumplex model of affect. J Person Soc Psychol 39(6):1161–1178CrossRefGoogle Scholar
  61. Sebastiani F (2002) Machine learning in automated text categorization. ACM Comput Surv 34(1):1–47CrossRefGoogle Scholar
  62. Strapparava C, Mihalcea R (2007) Semeval-2007 task 14: affective text. In: Proceedings of SemEval’07, Prague, Czech Republic, pp 70–74Google Scholar
  63. Strapparava C, Mihalcea R (2008) Learning to identify emotions in text. In: Proceedings of SAC’08, Fortaleza, Ceara, Brazil, pp 1556–1560Google Scholar
  64. Strapparava C, Valitutti A (2004) WordNet-Affect: an affective extension of WordNet. In: Proceedings of LREC’04 (Vol 4), Lisbon, Portugal, pp 1083–1086Google Scholar
  65. Taboada M, Brooke J, Tofiloski M, Voll K, Stede M (2001) Lexicon-based methods for sentiment analysis. Comput Linguistics 37:267–307CrossRefGoogle Scholar
  66. Thelwall M (2009) Myspace comments. Online Inform Rev 33(1):58–76CrossRefGoogle Scholar
  67. Thelwall M, Buckley K, Paltoglou G, Di C, Kappas A (2010) Sentiment strength detection in short informal text. JASIST 61(12):2544–2558CrossRefGoogle Scholar
  68. Thelwall M, Buckley K, Paltoglou G (2011) Sentiment in Twitter events. J Am Soc Inf Sci Technol 62:406–418CrossRefGoogle Scholar
  69. Thomas M, Pang B, Lee L (2006) Get out the vote: determining support or opposition from congressional floor-debate transcripts. In: Proceedings of EMNLP’06. Association for Computational Linguistics, Morristown, pp 327–335Google Scholar
  70. Tonkin E, Pfeiffer HD, Tourte G (2012) Twitter, information sharing and the London riots. Bull Am Soc Inf Sci Tech 38(2):49–57CrossRefGoogle Scholar
  71. Vapnik VN (1999) The nature of statistical learning theory (information science and statistics). Springer, HeidelbergGoogle Scholar
  72. Velikovich L, Blair-Goldensohn S, Hannan K, McDonald R (2010) The viability of web-derived polarity lexicons. In: Proceedings of HLT’10, Stroudsburg, PA, USA, pp 777–785Google Scholar
  73. Whitelaw C, Garg N, Argamon S (2005) Using appraisal groups for sentiment analysis. In: Proceedings of CIKM’05, Bremen, Germany, pp 625–631Google Scholar
  74. Wiebe J, Bruce RF, O’Hara TP (1999) Development and use of a gold-standard data set for subjectivity classifications. In: Proceedings of 37th annual meeting of ACL, College Park, MD, USA, pp 246–253Google Scholar
  75. Wilson T, Hoffmann P, Somasundaran S, Kessler J, Wiebe J, Choi Y (2005a) Opinionfinder: a system for subjectivity analysis. In: Proceedings of HLT/EMNLP-demo, Vancouver, BC, Canada, pp 34–35Google Scholar
  76. Wilson T, Wiebe J, Hoffmann P (2005b) Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings of EMNLP’05, Vancouver, BC, Canada, pp 347–354Google Scholar
  77. Zaidan O, Eisner J, Piatko CD (2007) Using “annotator rationales” to improve machine learning for text categorization. In: Proceedings of HLT-NAACL. Association of Computational Linguistics, Rochester, NY, pp 260–267Google Scholar

Copyright information

© Springer-Verlag Wien 2014

Authors and Affiliations

  1. 1.School of TechnologyUniversity of WolverhamptonWolverhamptonUK

Personalised recommendations