Skip to main content

Opinion Mining on the Web 2.0 – Characteristics of User Generated Content and Their Impacts

  • Conference paper
Human-Computer Interaction and Knowledge Discovery in Complex, Unstructured, Big Data (HCI-KDD 2013)

Abstract

The field of opinion mining provides a multitude of methods and techniques to be utilized to find, extract and analyze subjective information, such as the one found on social media channels. Because of the differences between these channels as well as their unique characteristics, not all approaches are suitable for each source; there is no “one-size-fits-all” approach. This paper aims at identifying and determining these differences and characteristics by performing an empirical analysis as a basis for a discussion which opinion mining approach seems to be applicable to which social media channel.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Liu, B.: Sentiment analysis and opinion mining. Morgan & Claypool, San Rafael (2012)

    Google Scholar 

  2. Pang, B., Lee, L.: Opinion Mining and Sentiment Analysis. Foundations and Trends in Information Retrieval 2(1-2), 1–135 (2008)

    Article  Google Scholar 

  3. Abbasi, A., Chen, H., Salem, A.: Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums. ACM Transactions on Information Systems (TOIS) 26(3), 12–34 (2008)

    Article  Google Scholar 

  4. Dey, L., Haque, S.M.: Opinion mining from noisy text data. International Journal on Document Analysis and Recognition (IJDAR) 12(3), 205–226 (2009)

    Article  Google Scholar 

  5. Maynard, D., Bontcheva, K., Rout, D.: Challenges in developing opinion mining tools for social media. In: Proceedings of @NLP can u tag #user_generated_content?! Workshop at LREC 2012 (2012)

    Google Scholar 

  6. Petz, G., Karpowicz, M., Fürschuß, H., Auinger, A., Winkler, S.M., Schaller, S., Holzinger, A.: On Text Preprocessing for Opinion Mining Outside of Laboratory Environments. In: Huang, R., Ghorbani, A.A., Pasi, G., Yamaguchi, T., Yen, N.Y., Jin, B. (eds.) AMT 2012. LNCS, vol. 7669, pp. 618–629. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  7. Yi, L., Liu, B.: Web page cleaning for web mining through feature weighting. In: Proceedings of the 18th International Joint Conference on Artificial Intelligence, pp. 43–48. Morgan Kaufmann Publishers Inc., San Francisco (2003)

    Google Scholar 

  8. Kaiser, C.: Opinion Mining im Web 2.0 – Konzept und Fallbeispiel. HMD - Praxis der Wirtschaftsinformatik 46(268), 90–99 (2009)

    Google Scholar 

  9. Hu, M., Liu, B.: Mining Opinion Features in Customer Reviews. In: Proceedings of AAAI, pp. 755–760 (2004)

    Google Scholar 

  10. Liu, B., Hu, M., Cheng, J.: Opinion observer: analyzing and comparing opinions on the Web. In: Proceedings of the 14th International Conference on World Wide Web, New York, NY, USA, pp. 342–351 (2005)

    Google Scholar 

  11. Jindal, N., Liu, B.: Identifying Comparative Sentences in Text Documents. In: Dumas, S. (ed.) Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 244–251. Association for Computing Machinery, New York (2006)

    Google Scholar 

  12. Zhang, T.: Fundamental Statistical Techniques. In: Indurkhya, N., Damerau, F.J. (eds.) Handbook of Natural Language Processing, 2nd edn., pp. 189–204. Chapman & Hall/CRC, Boca Raton (2010)

    Google Scholar 

  13. Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? Sentiment Classification Using Machine Learning Techniques. In: Proceedings of the ACL-2002 Conference on Empirical Methods in Natural Language Processing, pp. 79–86 (2002)

    Google Scholar 

  14. Liu, B.: Web data mining. Exploring hyperlinks, contents, and usage data, Corr. 2. print. Data-centric systems and applications. Springer, Berlin (2008)

    Google Scholar 

  15. Dasgupta, S., Ng, V.: Mine the Easy, Classify the Hard: A Semi-Supervised Approach to Automatic Sentiment Classification. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, vol. 2, pp. 701–709 (2009)

    Google Scholar 

  16. Wong, T.-L., Bing, L., Lam, W.: Normalizing Web Product Attributes and Discovering Domain Ontology with Minimal Effort. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, pp. 805–814 (2011)

    Google Scholar 

  17. Choi, Y., Cardie, C.: Hierarchical Sequential Learning for Extracting Opinions and their Attributes. In: Proceedings of the ACL 2010 Conference Short Papers, pp. 269–274 (2010)

    Google Scholar 

  18. Guo, H., Zhu, H., Guo, Z., et al.: Domain Customization for Aspect-oriented Opinion Analysis with Multi-level Latent Sentiment Clues. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 2493–2496 (2011)

    Google Scholar 

  19. Holzinger, A., Simonic, K.-M., Yildirim, P.: Disease-disease relationships for rheumatic diseases. Web-based biomedical textmining and knowledge discovery to assist medical decision making. In: IEEE 36th International Conference on Computer Software and Applications, pp. 573–580 (2012)

    Google Scholar 

  20. Cui, H., Mittal, V., Datar, M.: Comparative Experiments on Sentiment Classification for Online Product Reviews. In: Proceedings of AAAI-2006, pp. 1265–1270 (2006)

    Google Scholar 

  21. Chaovalit, P., Zhou, L.: Movie Review Mining: A Comparison Between Supervised and Unsupervised Classification Approaches. In: Proceedings of the 38th Annual Hawaii International Conference on System Sciences, pp. 112–121 (2005)

    Google Scholar 

  22. Moghaddam, S., Ester, M.: On the Design of LDA Models for Aspect-based Opinion Mining. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 803–812 (2012)

    Google Scholar 

  23. Mishne, G., Glance, N.S.: Predicting Movie Sales from Blogger Sentiment. In: Proceedings of the 21st National Conference on Artificial Intelligence, pp. 11–14. AAAI Press, Boston (2006)

    Google Scholar 

  24. Sik Kim, Y., Lee, K., Ryu, J.-H.: Algorithm for Extrapolating Blogger’s Interests through Library Classification Systems. In: Proceedings of the IEEE International Conference on Web Services, Beijing, China, September 23-26, pp. 481–488. IEEE (2008)

    Google Scholar 

  25. Liu, Y., Huang, X., An, A., et al.: ARSA: A Sentiment-Aware Model for Predicting Sales Performance Using Blogs. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 607–614. ACM Press, New York (2007)

    Chapter  Google Scholar 

  26. Sadikov, E., Parameswaran, A., Venetis, P.: Blogs as Predictors of Movie Success. In: Proceedings of the Third International Conference on Weblogs and Social Media, pp. 304–307 (2009)

    Google Scholar 

  27. Liu, F., Wang, D., Li, B., et al.: Improving Blog Polarity Classification via Topic Analysis and Adaptive Methods. In: Proceedings of Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the ACL, pp. 309–312 (2010)

    Google Scholar 

  28. Liu, F., Li, B., Liu, Y.: Finding Opinionated Blogs Using Statistical Classifiers and Lexical Features. In: Proceedings of the Third International ICWSM Conference, pp. 254–257 (2009)

    Google Scholar 

  29. Chmiel, A., Sobkowicz, P., Sienkiewicz, J., et al.: Negative emotions boost user activity at BBC forum. Physica A 390(16), 2936–2944 (2011)

    Article  Google Scholar 

  30. Softic, S., Hausenblas, M.: Towards opinion mining through tracing discussions on the web. In: Social Data on the Web Workshop at the 7th International Semantic Web Conference (2008)

    Google Scholar 

  31. Go, A., Bhayani, R., Huang, L.: Twitter Sentiment Classification using Distant Supervision. CS224N Project Report, Stanford (2009)

    Google Scholar 

  32. Pak, A., Paroubek, P.: Twitter as a Corpus for Sentiment Analysis and Opinion Mining. In: Proceegins of the Seventh Conference on International Language Resources and Evaluation (LREC), Valletta, Malta, pp. 1320–1326 (2010)

    Google Scholar 

  33. Barbosa, L., Feng, J.: Robust Sentiment Detection on Twitter from Biased and Noisy Data. In: Proceedings of the 23rd International Conference on Computational Linguistics. Posters, pp. 36–44 (2010)

    Google Scholar 

  34. Bollen, J., Mao, H., Zeng, X.: Twitter mood predicts the stockmarket. Journal of Computational Science 2(1), 1–8 (2011)

    Article  Google Scholar 

  35. Derczynski, L., Maynard, D., Aswani, N., et al.: Microblog-Genre Noise and Impact on Semantic Annotation Accuracy. In: 24th ACM Conference on Hypertext and Social Media (2013)

    Google Scholar 

  36. Thelwall, M., Wilkinson, D., Uppal, S.: Data Mining Emotion in Social Network Communication: Gender differences in MySpace. Journal of the American Society for Information Science and Technology 61(1), 190–199 (2010)

    Article  Google Scholar 

  37. Bermingham, A., Conway, M., McInerney, L., et al.: Combining Social Network Analysis and Sentiment Analysis to Explore the Potential for Online Radicalisation. In: International Conference on Advances in Social Network Analysis and Mining, pp. 231–236 (2009)

    Google Scholar 

  38. Titov, I., McDonald, R.: A Joint Model of Text and Aspect Ratings for Sentiment Summarization. In: Proceedings of ACL-2008: HLT, pp. 308–316 (2008)

    Google Scholar 

  39. Titov, I., McDonald, R.: Modeling Online Reviews with Multi-grain Topic Models. In: Proceedings of the 17th International Conference on World Wide Web, pp. 111–120 (2008)

    Google Scholar 

  40. Zhao, W.X., Jiang, J., Yan, H., et al.: Jointly Modeling Aspects and Opinions with a MaxEnt-LDA Hybrid. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 56–65 (2010)

    Google Scholar 

  41. Brody, S., Elhadad, N.: An Unsupervised Aspect-Sentiment Model for Online Reviews. In: Proceedings of HLT 2010 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 804–812 (2010)

    Google Scholar 

  42. Jo, Y., Oh, A.: Aspect and Sentiment Unification Model for Online Review Analysis. In: Proceedings of the fourth ACM International Conference on Web Search and Data Mining, pp. 815–824 (2011)

    Google Scholar 

  43. Backhaus, K., Erichson, B., Plinke, W., et al.: Multivariate Analysemethoden, 12th edn. Eine anwendungsorientierte Einführung. Springer, Berlin (2008)

    MATH  Google Scholar 

  44. Moilanen, K., Pulman, S.: Sentiment Composition. In: Proceedings of the Recent Advances in Natural Language Processing International Conference, pp. 378–382 (2007)

    Google Scholar 

  45. Sayeed, A.B.: A Distributional and Syntactic Approach to Fine-Grained Opinion Mining. Dissertation, University of Maryland (2011)

    Google Scholar 

  46. Davidov, D., Tsur, O., Rappoport, A.: Enhanced sentiment learning using Twitter hashtags and smileys. In: Proceedings of the 23rd International Conference on Computational Linguistics. Posters, pp. 241–249 (2010)

    Google Scholar 

  47. Zhang, L., Ghosh, R., Dekhil, M., et al.: Combining Lexicon-based and Learning-based Methods for Twitter Sentiment Analysis. Technical Report HPL-2011-89 (2011)

    Google Scholar 

  48. Kouloumpis, E., Wilson, T., Moore, J.: Twitter Sentiment Analysis: The Good the Bad and the OMG? In: Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media, pp. 538–541 (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Petz, G., Karpowicz, M., Fürschuß, H., Auinger, A., Stříteský, V., Holzinger, A. (2013). Opinion Mining on the Web 2.0 – Characteristics of User Generated Content and Their Impacts. In: Holzinger, A., Pasi, G. (eds) Human-Computer Interaction and Knowledge Discovery in Complex, Unstructured, Big Data. HCI-KDD 2013. Lecture Notes in Computer Science, vol 7947. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39146-0_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-39146-0_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-39145-3

  • Online ISBN: 978-3-642-39146-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics