Do We Need Word Sense Disambiguation for LCM Tagging?
Observing the current state of natural language processing, especially in the Polish language, one notices that sense-level dictionaries are becoming increasingly popular. For instance, the largest manually annotated sentiment dictionary for Polish is now based on plWordNet (the Polish WordNet) , also the Polish Linguistic Category Model (LCM-PL)  dictionary has its significant part annotated on sense level. Our paper addresses the important question: what is the influence of word sense disambiguation in real-world scenarios and how it compares to the simpler baseline of labeling using just the tag of the most frequent sense. We evaluate both approaches on data sets compiled for studies on fake opinion detection and predicting levels of self-esteem in the area of social psychology. Our conclusion is that the baseline method vastly outperforms its competitor.
KeywordsLinguistic Category Model LCM LCM-PL Polish Word sense disambiguation Sense-level tagging
- 3.Kędzia, P., Piasecki, M., Orlińska, M.: Word sense disambiguation based on large scale Polish Clarin heterogeneous lexical resources. Cogn. Stud. 15, 269–292 (2015)Google Scholar
- 5.Rubikowski, M., Wawer, A.: The scent of deception: recognizing fake perfume reviews in polish. In: Kłopotek, M.A., Koronacki, J., Marciniak, M., Mykowiecka, A., Wierzchoń, S.T. (eds.) IIS 2013. LNCS, vol. 7912, pp. 45–49. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-38634-3_6CrossRefGoogle Scholar
- 9.Wawer, A., Ogrodniczuk, M.: Results of the PolEval 2017 competition: sentiment analysis shared task. In: 8th Language and Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics (2017)Google Scholar
- 10.Wawer, A., Sarzyńska, J.: The linguistic category model in polish (LCM-PL). In: Chair, N.C.C., et al. (eds.) Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). European Language Resources Association (ELRA), Paris, France, May 2018Google Scholar
- 13.Zaśko-Zielińska, M., Piasecki, M., Szpakowicz, S.: A large wordnet-based sentiment lexicon for Polish. In: Proceedings of the International Conference Recent Advances in Natural Language Processing, pp. 721–730. INCOMA Ltd., Shoumen, BULGARIA, Hissar, Bulgaria, September 2015. http://www.aclweb.org/anthology/R15-1092