Skip to main content

Polarity Lexicon for the Polish Language: Design and Extension with Random Walk Algorithm

  • Conference paper

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 240))

Abstract

Sentiment analysis aims at an automatic assignment to a portion of text a value expressing an emotional attitude towards its content. Out of numerous efficient methods for investigating sentiment, the authors decided to opt for the lexicon-based approach. A necessary prerequisite for adopting it was the availability of specific lexical resources for the investigated language. While there are substantial readily accessible polarity resources for English, those for Polish are meagre and, to the best of our knowledge, none of them is able to fully support sentiment analysis. Accordingly, the main objective of the presented work is to plug this gap in academic research by creating in an automated manner, a polarity lexical resource for the Polish language. In this paper, we present the motivation for the study and the key mechanisms underlying the development of the polarity lexicon, elucidate the linguistic phenomena to be reckoned with in the process, as well as discuss the random walk algorithm used to extend the obtained polarity resources. Finally, the results of the conducted experiments and the newly compiled polarity lexicon are demonstrated.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2(1-2), 1–135 (2008)

    Article  Google Scholar 

  2. Tromp, E., Pechenizkiy, M.: Senticorr: Multilingual sentiment analysis of personal correspondence. In: Proceedings of the 2011 IEEE 11th International Conference on Data Mining Workshops, ICDMW 2011, pp. 1247–1250. IEEE Computer Society, Washington, DC (2011)

    Google Scholar 

  3. Haniewicz, K., Rutkowski, W., Adamczyk, M., Kaczmarek, M.: Towards the lexicon-based sentiment analysis of polish texts - polarity lexicon. In: Proceedings of ICCI. LNCS (LNAI). Springer, Heidelberg (2013)

    Google Scholar 

  4. Österle, H., Becker, J., Frank, U., Hess, T., Karagiannis, D., et al.: Memorandum on design-oriented information systems research. EJIS 20, 7–10 (2011)

    Google Scholar 

  5. Hevner, A.R., March, S.T., Park, J., Ram, S.: Design science in information systems research. Management Information Systems Quarterly 28(1), 75–106 (2004)

    Google Scholar 

  6. Hevner, A.R.: The three cycle view of design science research. SJIS 19(2), 87–92 (2007)

    Google Scholar 

  7. Gliwa, B., Kozlak, J., Zygmunt, A., Cetnarowicz, K.: Models of social groups in blogosphere based on information about comment addressees and sentiments. CoRR abs/1301.5201 (2013)

    Google Scholar 

  8. Zhang, C., Zeng, D., Li, J., Wang, F.Y., Zuo, W.: Sentiment analysis of chinese documents: From sentence to document level. J. Am. Soc. Inf. Sci. Technol. 60(12), 2474–2487 (2009)

    Article  Google Scholar 

  9. Kowalska, K., Cai, D., Wade, S.: Sentiment analysis of polish texts. International Journal of Computer and Communication Engineering 1(1), 2010–3743 (2012) ISSN 2010-3743

    Google Scholar 

  10. Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL 2002 Conference on Empirical Methods in Natural Language Processing, EMNLP 2002, vol. 10, pp. 79–86. Association for Computational Linguistics, Stroudsburg (2002)

    Chapter  Google Scholar 

  11. Paltoglou, G., Thelwall, M.: A study of Information Retrieval weighting schemes for sentiment analysis, pp. 1386–1395. Association for Computational Linguistics (July 2010)

    Google Scholar 

  12. Choi, Y., Cardie, C.: Adapting a polarity lexicon using integer linear programming for domain-specific sentiment classification. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pp. 590–598. Association for Computational Linguistics, Singapore (2009)

    Google Scholar 

  13. Sagot, B.: Introduction. In: Proceedings of WoLeR 2011, The 1st International Workshop on Lexical Resources (2011)

    Google Scholar 

  14. Maks, I., Vossen, P.: Different approaches to automatic polarity annotation at synset level. In: Proceedings of the First International Workshop on Lexical Resources, WoLeR 2011 (2011)

    Google Scholar 

  15. Turney, P.D., Littman, M.L.: Measuring praise and criticism: Inference of semantic orientation from association. ACM Trans. Inf. Syst. 21(4), 315–346 (2003)

    Article  Google Scholar 

  16. Lu, Y., Castellanos, M., Dayal, U., Zhai, C.: Automatic construction of a context-aware sentiment lexicon: an optimization approach. In: Proceedings of the 20th International Conference on World Wide Web, WWW 2011, pp. 347–356. ACM, New York (2011)

    Google Scholar 

  17. Esuli, A., Sebastiani, F.: Sentiwordnet: A publicly available lexical resource for opinion mining. In: Proceedings of the 5th Conference on Language Resources and Evaluation, pp. 417–422 (2006)

    Google Scholar 

  18. Strapparava, C., Valitutti, A.: WordNet-Affect: an affective extension of WordNet, vol. 4, pp. 1083–1086. Citeseer (2004)

    Google Scholar 

  19. Bradley, M.M., Lang, P.J.: Affective norms for english words (anew): Instruction manual and affective ratings. Psychology Technical (C-1) (1999)

    Google Scholar 

  20. Miller, G.A.: Wordnet: a lexical database for english. Commun. ACM 38, 39–41 (1995)

    Article  Google Scholar 

  21. Velikovich, L., Blair-Goldensohn, S., Hannan, K., McDonald, R.: The viability of web-derived polarity lexicons. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 777–785. Association for Computational Linguistics, Los Angeles (2010)

    Google Scholar 

  22. Hassan, A., Radev, D.: Identifying text polarity using random walks. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL 2010, pp. 395–403. Association for Computational Linguistics, Stroudsburg (2010)

    Google Scholar 

  23. Rosell, M., Kann, V.: Constructing a swedish general purpose polarity lexicon: Random walks in the people’s dictionary of synonyms. In: Proceedings of the Conference SLTC. Linkopings Univ., Sweden (2010)

    Google Scholar 

  24. Milkowski, M.: Automated Building of Error Corpora of Polish. In: Lewandowska-Tomaszczyk, B. (ed.) Corpus Linguistics, Computer Tools, and Applications - State of the Art, PALC 2007, pp. 631–639. Peter Lang, Frankfurt am Main (2008)

    Google Scholar 

  25. Wawer, A.: Extracting emotive patterns for languages with rich morphology. In: Proceedings of 13th International Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2012 (2012) (forthcoming)

    Google Scholar 

  26. Przepiórkowski, A.: A preliminary formalism for simultaneous rule-based tagging and partial parsing. In: Rehm, G., Witt, A., Lemnitzer, L. (eds.) Data Structures for Linguistic Resources and Applications: Proceedings of the Biennial GLDV Conference 2007, pp. 81–90 (2007)

    Google Scholar 

  27. Buczynski, A., Wawer, A.: Shallow parsing in sentiment analysis of product reviews (2008)

    Google Scholar 

  28. Kaji, N., Kitsuregawa, M.: Building lexicon for sentiment analysis from massive collection of html documents. Proceedings EMNLP-CoNLL (2007)

    Google Scholar 

  29. Maks, I., Vossen, P.: Building a fine-grained subjectivity lexicon from a web corpus. In: Calzolari, N., et al. (eds.) Proceedings of the Eight International Conference on Language Resources and Evaluation. European Language Resources Association (ELRA), Istanbul (2012)

    Google Scholar 

  30. http://www.cs.put.poznan.pl/dweiss/rzeczpospolita

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Konstanty Haniewicz .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Haniewicz, K., Kaczmarek, M., Adamczyk, M., Rutkowski, W. (2014). Polarity Lexicon for the Polish Language: Design and Extension with Random Walk Algorithm. In: Swiątek, J., Grzech, A., Swiątek, P., Tomczak, J. (eds) Advances in Systems Science. Advances in Intelligent Systems and Computing, vol 240. Springer, Cham. https://doi.org/10.1007/978-3-319-01857-7_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-01857-7_17

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-01856-0

  • Online ISBN: 978-3-319-01857-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics