Advertisement

Manócska: A Unified Verb Frame Database for Hungarian

  • Ágnes Kalivoda
  • Noémi Vadász
  • Balázs Indig
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11107)

Abstract

This paper presents Manócska, a verb frame database for Hungarian. It is called unified as it was built by merging all available verb frame resources. To be able to merge these, we had to cope with their structural and conceptual differences. After that, we transformed them into two easy to use formats: a TSV and an XML file. Manócska is open-access, the whole resource and the scripts which were used to create it are available in a github repository. This makes Manócska reproducible and easy to access, version, fix and develop in the future. During the merging process, several errors came into sight. These were corrected as systematically as possible. Thus, by integrating and harmonizing the resources, we produced a Hungarian verb frame database of a higher quality.

Keywords

Verb frame database Lexical resource Corpus linguistics Hungarian 

References

  1. 1.
    Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley FrameNet Project. In: Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, ACL 1998, vol. 1, pp. 86–90. Association for Computational Linguistics, Stroudsburg (1998).  https://doi.org/10.3115/980845.980860
  2. 2.
    Brew, C., Schulte im Walde, S.: Spectral clustering for German verbs. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, EMNLP 2002, - vol. 10, pp. 117–124. Association for Computational Linguistics, Stroudsburg (2002).  https://doi.org/10.3115/1118693.1118709
  3. 3.
    Halácsy, P., Kornai, A., Németh, L., Rung, A., Szakadát, I., Trón, V.: Creating open language resources for Hungarian. In: Calzolari, N. (ed.) Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC 2004), pp. 203–210 (2004)Google Scholar
  4. 4.
    Indig, B., Vadász, N.: Windows in Human Parsing – How Far can a Preverb Go? In: Tadić, M., Bekavac, B. (eds.) Proceedings of the Tenth International Conference on Natural Language Processing (HrTAL2016) 2016, Dubrovnik, Croatia, 29–30 September 2016. Springer, Cham (2016). (accepted, in press)Google Scholar
  5. 5.
    Kalivoda, Á.: A magyar igei komplexumok vizsgálata [The Hungarian Verbal Complexes]. Master’s thesis, PPKE-BTK (2016). https://github.com/kagnes/hungarian_verbal_complex
  6. 6.
    Kornai, A., Nemeskey, D.M., Recski, G.: Detecting Optional Arguments of Verbs. In: Calzolari, N., et al. (eds.) Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). European Language Resources Association (ELRA) (2016)Google Scholar
  7. 7.
    Oravecz, C., Váradi, T., Sass, B.: The Hungarian Gigaword Corpus. In: Calzolari, N., et al. (eds.) Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014). European Language Resources Association (ELRA) (2014)Google Scholar
  8. 8.
    Sass, B.: Igei szerkezetek gyakorisági szótára - Egy automatikus lexikai kinyerő eljárás és alkalmazása [A Frequency Dictionary of Verbal Structures - An Automatic Lexical Extraction Procedure and its Application]. Ph.D. thesis, Pázmány Péter Katolikus Egyetem ITK (2011)Google Scholar
  9. 9.
    Sass, B.: 28 millió szintaktikailag elemzett mondat és 500 000 igei szerkezet [28 Million Syntactically Parsed Sentences and 500 000 Verbal Structures]. In: Tanács, A., Varga, V., Vincze, V. (eds.) XI. Magyar Számítógépes Nyelvészeti Konferencia (MSZNY 2015) [XI. Hungarian Conference on Computational Linguistics], pp. 399–403. SZTE TTIK Informatikai Tanszékcsoport, Szeged (2015)Google Scholar
  10. 10.
    Sass, B., Váradi, T., Pajzs, J., Kiss, M.: Magyar igei szerkezetek - A leggyakoribb vonzatok és szókapcsolatok szótára [Hungarian Verbal Structures - The Dictionary of the Most Frequent Arguments and Phrases]. Tinta Könyvkiadó, Budapest (2010)Google Scholar
  11. 11.
    Schuler, K.K.: VerbNet: A broad-coverage, comprehensive verb lexicon. Ph.D. thesis, University of Pennsylvania (2006). http://verbs.colorado.edu/~kipper/Papers/dissertation.pdf
  12. 12.
    Váradi, T.: The Hungarian National Corpus. In: Proceedings of the Third International Conference on Language Resources and Evaluation (LREC-2002), pp. 385–389. European Language Resources Association, Paris (2002)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2018

Authors and Affiliations

  • Ágnes Kalivoda
    • 1
    • 2
    • 3
  • Noémi Vadász
    • 1
    • 3
  • Balázs Indig
    • 1
    • 2
  1. 1.Pázmány Péter Catholic UniversityBudapestHungary
  2. 2.MTA–PPKE Hungarian Language Technology Research GroupBudapestHungary
  3. 3.Research Institute for LinguisticsHungarian Academy of SciencesBudapestHungary

Personalised recommendations