Skip to main content

Identification of Croatian Light Verb Constructions with NooJ

  • Conference paper
  • First Online:
Formalizing Natural Languages with NooJ and Its Natural Language Processing Applications (NooJ 2017)

Abstract

This paper deals with the identification of verbal multiword expressions in Croatian using NooJ. We focus on light verb constructions, i.e. constructions consisting of a light verb and a nominal part (e.g. donijeti odluku ‘to make a decision’ or ‘to reach a decision’). The nominal part in light verb constructions (LVCs) can be either a noun phrase (e.g. dati udarac ‘to give a blow’) or a prepositional phrase (staviti na raspolaganje ‘to put at disposal’). Light verbs are entirely or partially deprived of their lexical meaning and the meaning of the whole construction is conveyed by their complimentary NPs or PPs. All elements in these constructions act as a single unit at some level of linguistic analysis, particularly at the level of semantics and, more recently, of syntax.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    However, although this is one of our final objectives, we will deal with this part in future work.

  2. 2.

    It is important to notice that the preposition in Croatian language do not have a case. However, each one only allows NPs in some cases to follow them. This information is incoded in the dictionary of pronouns as a Case attribute and it is what we refer to at this place.

  3. 3.

    The same logic present in graphs for the recognition of Types 1 and 5 is followed for the recognition of Types 2, 3 and 4.

References

  1. Agić, Ž., Ljubešić, N.: Universal dependencies for Croatian (that work for Serbian, too). In: Proceedings of the 5th Workshop on Balto-Slavic Natural Language Processing, Hissar, Bulgaria, pp. 1–8 (2015)

    Google Scholar 

  2. Baldwin, T., Kim, S.N.: Multiword expressions. In: Indurkhya, N., Damerau, F.J. (eds.) Handbook of Natural Language Processing, 2nd edn, pp. 267–292. CRC Press, Boca Raton (2010)

    Google Scholar 

  3. Kocijan, K., Librenjak, S.: Recognizing verb-based Croatian idiomatic MWUs. In: Okrut, T., Hetsevich, Y., Silberztein, M., Stanislavenka, H. (eds.) NooJ 2015. CCIS, vol. 607, pp. 96–106. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-42471-2_9

    Chapter  Google Scholar 

  4. Gulić, A.: Klasifikacija perifraznih glagola u hrvatskom jeziku. M.A. thesis. Faculty of Humanities and Social Sciences, University of Zagreb (2015)

    Google Scholar 

  5. Katunar, D., Srebačić, M., Raffaelli, I., Šojat, K.: Arguments for phrasal verbs in Croatian and their influence on semantic relations in Croatian WordNet. In: Proceedings of the Eight International Conferences on Language Resources and Evaluation (2012)

    Google Scholar 

  6. Menac, A., Fink Arsovski, Ž., Venturin, R.: Hrvatski frazeološki rječnik. Naklada Ljevak, Zagreb (2014)

    Google Scholar 

  7. Rosén, V., et al.: MWEs in treebanks: from survey to guidelines. In: Proceedings of the 10th International Conference on Language Resources and Evaluation, Portorož, Slovenia, pp. 179–193 (2016)

    Google Scholar 

  8. Sag, I.A., Baldwin, T., Bond, F., Copestake, A., Flickinger, D.: Multiword expressions: a pain in the neck for NLP. In: Gelbukh, A. (ed.) CICLing 2002. LNCS, vol. 2276, pp. 1–15. Springer, Heidelberg (2002). https://doi.org/10.1007/3-540-45715-1_1

    Chapter  Google Scholar 

  9. Silberztein, M.: Formalizing Natural Languages: The NooJ Approach. Wiley, USA (2016)

    Book  Google Scholar 

  10. Silić, J., Pranjković, I.: Gramatika hrvatskoga jezika – za gimnazije i visoka učilišta. Školska knjiga, Zagreb (2005)

    Google Scholar 

  11. Šojat, K., Filko, M., Farkaš, D.: Verbal multiword expressions in Croatian. In: Proceedings of the Second International Conference Computational Linguistics in Bulgaria. Institute for Bulgarian Language, Bulgarian Academy of Science, pp. 78–85 (2016)

    Google Scholar 

  12. Vučković, K.: Model parsera za hrvatski jezik. Ph.D. dissertation. Faculty of Humanities and Social Sciences, Zagreb (2009)

    Google Scholar 

  13. Vučković, K., Tadić, M., Bekavac, B.: Croatian language resources for NooJ. CIT. J. Comput. Inf. Technol. 18, 295–301 (2010)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kristina Kocijan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Šojat, K., Kocijan, K., Bekavac, B. (2018). Identification of Croatian Light Verb Constructions with NooJ. In: Mbarki, S., Mourchid, M., Silberztein, M. (eds) Formalizing Natural Languages with NooJ and Its Natural Language Processing Applications. NooJ 2017. Communications in Computer and Information Science, vol 811. Springer, Cham. https://doi.org/10.1007/978-3-319-73420-0_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-73420-0_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-73419-4

  • Online ISBN: 978-3-319-73420-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics