Do We Need Word Sense Disambiguation for LCM Tagging?

Wawer, Aleksander; Sarzyńska, Justyna

doi:10.1007/978-3-030-00794-2_21

Aleksander Wawer¹⁹ &
Justyna Sarzyńska²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11107))

Included in the following conference series:

International Conference on Text, Speech, and Dialogue

1358 Accesses

Abstract

Observing the current state of natural language processing, especially in the Polish language, one notices that sense-level dictionaries are becoming increasingly popular. For instance, the largest manually annotated sentiment dictionary for Polish is now based on plWordNet (the Polish WordNet) [13], also the Polish Linguistic Category Model (LCM-PL) [10] dictionary has its significant part annotated on sense level. Our paper addresses the important question: what is the influence of word sense disambiguation in real-world scenarios and how it compares to the simpler baseline of labeling using just the tag of the most frequent sense. We evaluate both approaches on data sets compiled for studies on fake opinion detection and predicting levels of self-esteem in the area of social psychology. Our conclusion is that the baseline method vastly outperforms its competitor.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Our data sets have been processed in March 2018. We have no information about version of the WSD module available at that time, including no information on potential open bugs that might influence sense annotation.
2.
The study was conducted by members of the Warsaw Evaluative Learning Lab headed by professor Robert Balas.

References

Beukeboom, C., Tanis, M., Vermeulen, I.: The language of extraversion: extraverted people talk more abstractly, introverts are more concrete. J. Lang. Soc. Psychol. 32(2), 191–201 (2013)
Article Google Scholar
Hoorens, V.: What’s really in a name-letter effect? Name-letter preferences as indirect measures of self-esteem. Eur. Rev. Soc. Psychol. 25(1), 228–262 (2014)
Article Google Scholar
Kędzia, P., Piasecki, M., Orlińska, M.: Word sense disambiguation based on large scale Polish Clarin heterogeneous lexical resources. Cogn. Stud. 15, 269–292 (2015)
Google Scholar
Robins, R.W., Hendin, H.M., Trzesniewski, K.H.: Measuring global self-esteem: construct validation of a single-item measure and the rosenberg self-esteem scale. Pers. Soc. Psychol. Bull. 27(2), 151–161 (2001)
Article Google Scholar
Rubikowski, M., Wawer, A.: The scent of deception: recognizing fake perfume reviews in polish. In: Kłopotek, M.A., Koronacki, J., Marciniak, M., Mykowiecka, A., Wierzchoń, S.T. (eds.) IIS 2013. LNCS, vol. 7912, pp. 45–49. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-38634-3_6
Chapter Google Scholar
Rubini, M., Sigall, H.: Taking the edge off of disagreement: linguistic abstractness and self-presentation to a heterogeneous audience. Eur. J. Soc. Psychol. 32(3), 343–351 (2002)
Article Google Scholar
Semin, G.R., Fiedler, K.: The cognitive functions of linguistic categories in describing persons: social cognition and language. J. Pers. Soc. Psychol. 54(4), 558 (1988)
Article Google Scholar
Smith, E.R., Mackie, D.M., Claypool, H.M.: Social Psychology. Psychology Press, Hove (2014)
Book Google Scholar
Wawer, A., Ogrodniczuk, M.: Results of the PolEval 2017 competition: sentiment analysis shared task. In: 8th Language and Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics (2017)
Google Scholar
Wawer, A., Sarzyńska, J.: The linguistic category model in polish (LCM-PL). In: Chair, N.C.C., et al. (eds.) Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). European Language Resources Association (ELRA), Paris, France, May 2018
Google Scholar
Wigboldus, D.H., Semin, G.R., Spears, R.: How do we communicate stereotypes? Linguistic bases and inferential consequences. J. Pers. Soc. Psychol. 78(1), 5 (2000)
Article Google Scholar
Youyou, W., Kosinski, M., Stillwell, D.: Computer-based personality judgments are more accurate than those made by humans. Proc. Natl. Acad. Sci. 112(4), 1036–1040 (2015)
Article Google Scholar
Zaśko-Zielińska, M., Piasecki, M., Szpakowicz, S.: A large wordnet-based sentiment lexicon for Polish. In: Proceedings of the International Conference Recent Advances in Natural Language Processing, pp. 721–730. INCOMA Ltd., Shoumen, BULGARIA, Hissar, Bulgaria, September 2015. http://www.aclweb.org/anthology/R15-1092

Download references

Author information

Authors and Affiliations

Institute of Computer Science, Polish Academy of Sciences, Jana Kazimierza 5, 01-248, Warszawa, Poland
Aleksander Wawer
Institute of Psychology, Polish Academy of Sciences, Jaracza 1, 00-378, Warszawa, Poland
Justyna Sarzyńska

Authors

Aleksander Wawer
View author publications
You can also search for this author in PubMed Google Scholar
Justyna Sarzyńska
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Aleksander Wawer .

Editor information

Editors and Affiliations

Faculty of Informatics, Masaryk University, Brno, Czech Republic
Petr Sojka
Faculty of Informatics, Masaryk University, Brno, Czech Republic
Aleš Horák
Faculty of Informatics, Masaryk University, Brno, Czech Republic
Ivan Kopeček
Faculty of Informatics, Masaryk University, Brno, Czech Republic
Karel Pala

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wawer, A., Sarzyńska, J. (2018). Do We Need Word Sense Disambiguation for LCM Tagging?. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech, and Dialogue. TSD 2018. Lecture Notes in Computer Science(), vol 11107. Springer, Cham. https://doi.org/10.1007/978-3-030-00794-2_21

Download citation

DOI: https://doi.org/10.1007/978-3-030-00794-2_21
Published: 08 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00793-5
Online ISBN: 978-3-030-00794-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics