Discovery of Common Nominal Facts for Coreference Resolution: Proof of Concept

  • Maciej Ogrodniczuk
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8284)

Abstract

This paper reports on the preliminary experiment aimed at verification whether extraction of nominal facts corresponding to world knowledge from both structured and unstructured data could be effectively performed and its results used as a source of pragmatic knowledge for coreference resolution in Polish. Being the proof-of-concept only, this approach is work in progress and is intended to be further validated in a full-scale project.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Ogrodniczuk, M., Zawisławska, M., Głowińska, K., Savary, A.: Coreference Annotation Schema for an Inflectional Language. In: Gelbukh, A. (ed.) CICLing 2013, Part I. LNCS, vol. 7816, pp. 394–407. Springer, Heidelberg (2013)CrossRefGoogle Scholar
  2. 2.
    Ogrodniczuk, M., Kopeć, M.: End-to-end coreference resolution baseline system for Polish. In: Vetulani, Z. (ed.) Proceedings of the 5th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Poznań, Poland, pp. 167–171 (2011)Google Scholar
  3. 3.
    Kopeć, M., Ogrodniczuk, M.: Creating a Coreference Resolution System for Polish. In: [17] 192–195Google Scholar
  4. 4.
    Piasecki, M., Szpakowicz, S., Broda, B.: A Wordnet from the Ground Up. Oficyna Wydawnicza Politechniki Wrocawskiej (2009), http://www.plwordnet.pwr.wroc.pl/main/content/files/publications/A_Wordnet_from_the_Ground_Up.pdf
  5. 5.
    Vetulani, Z., Kubis, M., Obrębski, T.: PolNet — Polish WordNet: Data and Tools. In: Calzolari, N., Choukri, K., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S., Rosner, M., Tapias, D. (eds.) LREC. European Language Resources Association (2010)Google Scholar
  6. 6.
    Thelen, M., Riloff, E.: A bootstrapping method for learning semantic lexicons using extraction pattern contexts. In: Proceedings of the ACL 2002 Conference on Empirical Methods in Natural Language Processing, EMNLP 2002, vol. 10, pp. 214–221. Association for Computational Linguistics, Stroudsburg (2002)CrossRefGoogle Scholar
  7. 7.
    Agichtein, E., Gravano, L.: Snowball: extracting relations from large plain-text collections. In: Proceedings of the Fifth ACM Conference on Digital Libraries, DL 2000, pp. 85–94. ACM, New York (2000)Google Scholar
  8. 8.
    Dumais, S., Banko, M., Brill, E., Lin, J., Ng, A.: Web question answering: is more always better? In: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2002, pp. 291–298. ACM, New York (2002)Google Scholar
  9. 9.
    Bańko, M.: Słownik peryfraz czyli wyrażeń omownych. PWN Scientific Publishers, Warszawa (2003)Google Scholar
  10. 10.
    Żmigrodzki, P.: O projekcie Wielkiego słownika języka polskiego. Język Polski 5(LXXXVII), 265–267 (2007)Google Scholar
  11. 11.
    Przepiórkowski, A., Bańko, M., Górski, R.L., Lewandowska-Tomaszczyk, B. (eds.): Narodowy Korpus Języka Polskiego (Eng.: National Corpus of Polish). Wydawnictwo Naukowe PWN, Warsaw (2012)Google Scholar
  12. 12.
    Broda, B., Marcińczuk, M., Maziarz, M., Radziszewski, A., Wardyński, A.: KPWr: Towards a Free Corpus of Polish. In: [17], pp. 3218–3222Google Scholar
  13. 13.
    Presspublica: Korpus Rzeczpospolitej, http://www.cs.put.poznan.pl/dweiss/rzeczpospolita
  14. 14.
    Ogrodniczuk, M.: The Polish Sejm Corpus. In: Calzolari, N., Choukri, K., Declerck, T., Dogan, M.U., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S. (eds.) Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey. European Language Resources Association (ELRA) (2012)Google Scholar
  15. 15.
    Ogrodniczuk, M., Głowińska, K., Kopeć, M., Savary, A., Zawisławska, M.: Interesting Linguistic Features in Coreference Annotation of an Inflectional Language. In: Sun, M., Zhang, M., Lin, D., Wang, H. (eds.) CCL and NLP-NABD 2013. LNCS, vol. 8202, pp. 97–108. Springer, Heidelberg (2013)CrossRefGoogle Scholar
  16. 16.
    Waszczuk, J., Głowińska, K., Savary, A., Przepiórkowski, A., Lenart, M.: Annotation tools for syntax and named entities in the National Corpus of Polish. International Journal of Data Mining, Modelling and Management 5(2), 103–122 (2013)CrossRefGoogle Scholar
  17. 17.
    Calzolari, N., Choukri, K., Declerck, T., Dogan, M.U., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S. (eds.): Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2012, Istanbul, Turkey. ELRA (2012)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2013

Authors and Affiliations

  • Maciej Ogrodniczuk
    • 1
  1. 1.Institute of Computer SciencePolish Academy of SciencesPoland

Personalised recommendations