Skip to main content

Evaluation of Knowledge-Based Recognition of Spatial Expressions for Polish

  • 779 Accesses

Part of the Lecture Notes in Computer Science book series (LNAI,volume 12496)

Abstract

In the paper, we deal with the problem of spatial expression recognition. The goal of this task is to recognize in text information structures that represent a relative spatial relationship between two objects (a trajector and a landmark) indicated by a preposition of location, for example, a book on the table. We used the Corpus of Polish Spatial Texts (PST) to evaluate the knowledge-based approach to spatial expression recognition. We focused on the evaluation of the recall of the method for filtering candidates of spatial expressions. Our goal was to identify the bottlenecks of the existing preprocessing pipeline and the knowledge-based approach. We have shown that it is necessary to focus on three main areas, i.e., coreference resolution (relations from implied subjects and pronouns to nouns and named entities), word sense disambiguation, and cognitive schemas.

Keywords

  • Information extraction
  • Evaluation
  • Spatial information
  • Ontology
  • SUMO
  • Polish spatial texts
  • Polish

Work financed as part of the investment in the CLARIN-PL research infrastructure funded by the Polish Ministry of Science and Higher Education.

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • DOI: 10.1007/978-3-030-63007-2_53
  • Chapter length: 12 pages
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
eBook
USD   119.00
Price excludes VAT (USA)
  • ISBN: 978-3-030-63007-2
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book
USD   159.99
Price excludes VAT (USA)
Fig. 1.

Notes

  1. 1.

    https://github.com/CLARIN-PL/PolDeepNer.

References

  1. FrameNet: http://framenet.icsi.berkeley.edu/. Accessed 3 Jan 2020

  2. Acedański, S.: A morphosyntactic brill tagger for inflectional languages. In: Loftsson, H., Rögnvaldsson, E., Helgadóttir, S. (eds.) NLP 2010. LNCS (LNAI), vol. 6233, pp. 3–14. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-14770-8_3

    CrossRef  Google Scholar 

  3. Dobnik, S., Kelleher, J.: Exploration of functional semantics of prepositions from corpora of descriptions of visual scenes. In: Proceedings of the Third Workshop on Vision and Language, pp. 33–37, Dublin City University and the Association for Computational Linguistics, Dublin, Ireland, August 2014. https://doi.org/10.3115/v1/W14-5405, https://www.aclweb.org/anthology/W14-5405

  4. Fellbaum, C., Miller, G.: The Lexical Database. MITP (1998)

    Google Scholar 

  5. Garrod, S., Ferrier, G., Campbell, S.: In and on: investigating the functional geometry of spatial prepositions. Cognition 72(2), 167–189 (1999). https://doi.org/10.1016/S0010-0277(99)00038-4,http://www.sciencedirect.com/science/article/pii/S0010027799000384

  6. Głowińska, K.: Anotacja składniowa NKJP. In: Przepiórkowski, A., Bańko, M., Górski, R.L., Lewandowska-Tomaszczyk, B. (eds.) Narodowy Korpus Języka Polskiego, pp. 107–127. Wydawnictwo Naukowe PWN, Warsaw (2012)

    Google Scholar 

  7. Jenge, C., Kawaletz, S., Schade, U.: Combining different NLP methods for HUMINT report analysis (2009)

    Google Scholar 

  8. Kaczmarek, A., Marcińczuk, M.: Heuristic algorithm for zero subject detection in polish. In: Král, P., Matoušek, V. (eds.) TSD 2015. LNCS (LNAI), vol. 9302, pp. 378–386. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24033-6_43

    CrossRef  Google Scholar 

  9. Kolomiyets, O., Kordjamshidi, P., Bethard, S., Moens, M.: SemEval-2013 task 3: spatial role labeling. Second joint conference on lexical and computational semantics (SEM). In: Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval 2013), East Stroudsburg, PA, ACL, Atlanta, USA, June 2013

    Google Scholar 

  10. Kopeć, M., Ogrodniczuk, M.: Creating a coreference resolution system for polish. In: Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2012. ELRA, Istanbul, Turkey, pp. 192–195 (2012)

    Google Scholar 

  11. Kopeć, M.: Zero subject detection for polish. In: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, Short Papers, vol. 2, pp. 221–225. Association for Computational Linguistics, Gothenburg (2014)

    Google Scholar 

  12. Kędzia, P., Piasecki, M., Orlińska, M.: WoSeDon (2016). http://hdl.handle.net/11321/290, CLARIN-PL digital repository

  13. Mani, I., et al.: SpatialML: annotation scheme, resources, and evaluation. Lang. Resour. Eval. 44, 263–280 (2010)

    CrossRef  Google Scholar 

  14. Marcińczuk, M., Kocoń, J., Janicki, M.: Liner2 – a customizable framework for proper names recognition for Polish. In: Bembenik, R., Skonieczny, Ł., Rybiński, H., Kryszkiewicz, M., Niezgódka, M. (eds.) Intelligent Tools for Building a Scientific Information Platform, Studies in Computational Intelligence, vol. 467, pp. 231–253. Springer (2013). https://doi.org/10.1007/978-3-642-35647-6_17, http://dblp.uni-trier.de/db/series/sci/sci467.html#MarcinczukKJ13

  15. Marcińczuk, M., Oleksy, M.: Inforex – a collaborative system for text corpora annotation and analysis goes open. In: Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP, pp. 711–719 (2019)

    Google Scholar 

  16. Marcińczuk, M.M., Oleksy, M., Wieczorek, J.: Towards recognition of spatial relations between entities for polish. Cognitive Studies|Études cognitives (16), 119–132 (2016)

    Google Scholar 

  17. Marcińczuk, M.: Fine-grained named entity recognition for polish using deep learning. In: Proceedings of PP-RAI 2019 Conference, Department of Systems and Computer Networks, Faculty of Electronics, Wroclaw University of Science and Technology, Wrocław, pp. 219–222 (2019)

    Google Scholar 

  18. Maziarz, M., Piasecki, M., Szpakowicz, S.: Approaching plWordNet 2.0. In: Proceedings of the 6th Global Wordnet Conference, Matsue, Japan, January 2012

    Google Scholar 

  19. Oleksy, M., Marcińczuk, M., Bernaś, T., Wieczorek, J., Kocoń, J.: KPWr annotation guidelines - spatial expressions (2.0) (2019). http://hdl.handle.net/11321/719, CLARIN-PL digital repository

  20. Oleksy, M., Wieczorek, J., Bernaś, T., Marcińczuk, M.: Polish Spatial Texts (PST) 2.0 (2019). http://hdl.handle.net/11321/721, CLARIN-PL digital repository

  21. Pease, A., Niles, I., Li, J.: The suggested upper merged ontology: a large ontology for the semantic web and its applications. In: In Working Notes of the AAAI-2002 Workshop on Ontologies and the Semantic Web (2002)

    Google Scholar 

  22. Przepiórkowski, A.: Powierzchniowe przetwarzanie języka polskiego. Problemy współczesnej nauki, teoria i zastosowania: Inżynieria lingwistyczna, Akademicka Oficyna Wydawnicza “Exit” (2008). https://books.google.pl/books?id=V076OgAACAAJ

  23. Pustejovsky, J., Moszkowicz, J., Verhagen, M.: A linguistically grounded annotation language for spatial information. TAL 53(2), 87–113 (2012). http://atala.org/Extraction-de-dates-saillantes

  24. Radziszewski, A.: Metody znakowania morfosyntaktycznego i automatycznej płytkiej analizy składniowej języka polski. Ph.D. thesis, Politechnika Wrocławska, Wrocław (2012)

    Google Scholar 

  25. Radziszewski, A.: A tiered CRF tagger for Polish. In: Bembenik, R., Skonieczny, L., Rybiński, H., Kryszkiewicz, M., Niezgódka, M. (eds.) Intelligent Tools for Building a Scientific Information Platform: Advanced Architectures and Solutions. Springer Verlag (2013). https://doi.org/10.1007/978-3-642-35647-6_16

  26. Radziszewski, A., Pawlaczek, A.: Large-scale experiments with np chunking of Polish. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2012. LNCS (LNAI), vol. 7499, pp. 143–149. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-32790-2_17

    CrossRef  Google Scholar 

  27. Roberts, K., Rodriguez, L., Shooshan, S.E., Demner-Fushman, D.: Automatic extraction and post-coordination of spatial relations in consumer language. In: AMIA ... Annual Symposium Proceedings. AMIA Symposium 2015, pp. 1083–1092 (2015)

    Google Scholar 

  28. Waszczuk, J.: Harnessing the CRF complexity with domain-specific constraints. The case of morphosyntactic tagging of a highly inflected language. In: Proceedings of COLING 2012, pp. 2789–2804, December 2012 . http://cse.iitk.ac.in/users/cs671/2013/hw3/waszczuk-12coling_CRF-w-domainspecific-constraints-for-morpho-tagging.pdf

  29. Wieczorek, J., Oleksy, M.: NE\_SUMO\_PLWN\_mapping (2016). http://hdl.handle.net/11321/286, CLARIN-PL digital repository

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Michał Marcińczuk .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Verify currency and authenticity via CrossMark

Cite this paper

Marcińczuk, M., Oleksy, M., Wieczorek, J. (2020). Evaluation of Knowledge-Based Recognition of Spatial Expressions for Polish. In: Nguyen, N.T., Hoang, B.H., Huynh, C.P., Hwang, D., Trawiński, B., Vossen, G. (eds) Computational Collective Intelligence. ICCCI 2020. Lecture Notes in Computer Science(), vol 12496. Springer, Cham. https://doi.org/10.1007/978-3-030-63007-2_53

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-63007-2_53

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-63006-5

  • Online ISBN: 978-3-030-63007-2

  • eBook Packages: Computer ScienceComputer Science (R0)