Skip to main content

Enriching WordNet with Derivational Subnets

  • Conference paper
Book cover Computational Linguistics and Intelligent Text Processing (CICLing 2005)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3406))

  • 2216 Accesses

Abstract

In this paper, we deal with the derivational (word formation) relations as they are handled by the Czech morphological module Ajka. First, we show that they represent empirically well-based semantic relations forming small semantic networks, and then we solve the problem how to integrate them into lexical database such as (Czech) WordNet. In this respect we examine the relation between the derivational relations and semantic roles (deep cases) defined as Internal Language Relations in EuroWordNet. An attempt is made to match up the inventory of the semantic roles in EWN with the derivational (semantic) relations. We also use a tool called SAFT that can process a raw (corpus) text in such a way that it uses module Ajka to find links relating the WordNet senses to the noun and verbal lemmata obtained from the raw (corpus) text. This technique allows us to enrich Czech WordNet with the derivational subnets and represent them in a XML format. The result is a new kind of the semantic network, which consists of two layers, upper and lower. The result is a more powerful and efficient resource for applications like tools for WSD, web searching or information extraction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Baker, C.F., Fillmore, Ch.J., Lowe, J.B.: FrameNet Project. In: Proceedings of the Coling-ACL, Montreal, Canada (1998)

    Google Scholar 

  2. Dokulil, M.: Tvoření slov v češtině 1 (Word-Formation in Czech 1). Nakladatelství ČSAV, Prague (1962)

    Google Scholar 

  3. Fellbaum, C. (ed.): WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)

    MATH  Google Scholar 

  4. Horák, A., Smrž, P.: New Features of WordNet editor VisDic. RomanianJournal of Information Science and Technology 7(1-2), 201–214 (2004)

    Google Scholar 

  5. Klímová, J., Pala, K.: Application of WordNet ILR in Czech Word-Formation. In: Proceedings of LREC 2000, pp. 987–992. ELRA (2000)

    Google Scholar 

  6. Levin, B.: English Verb Classes and Alternations: a preliminary investigation. The University of Chicago Press, Chicago (1993)

    Google Scholar 

  7. Lopatková, M., Žabokrtský, Z.: Valency Dictionary of Czech Verbs. ELRA (2002)

    Google Scholar 

  8. Mráková-Žáčková, E.: Partial Parser DIS/VADIS (for Czech), Ph. D. Dissertation, Faculty of Informatics, Masaryk University, Brno (2002)

    Google Scholar 

  9. Pala, K., Rychlý, P., Smrž, P.: DESAM—Annotated Corpus or Czech. In: Jeffery, K. (ed.) SOFSEM 1997. LNCS, vol. 1338, pp. 523–530. Springer, Heidelberg (1997)

    Chapter  Google Scholar 

  10. Sedláček, R., Smrž, P.: A New Czech Morphological Analyser ajka. In: Matoušek, V., Mautner, P., Mouček, R., Tauser, K. (eds.) TSD 2001. LNCS (LNAI), vol. 2166, pp. 100–107. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  11. Sedláček, R., Čapek, T., Svoboda, L.: Morphological Analysis and Czech WordNet, abstract of the non-published paper

    Google Scholar 

  12. Smrž, P., Rychlý, P.: Finding Semantically Related Words in Large Corpora. In: Matoušek, V., Mautner, P., Mouček, R., Tauser, K. (eds.) TSD 2001. LNCS (LNAI), vol. 2166, pp. 108–115. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  13. Vossen, P. (ed.): EuroWordNet: a multilingual database with lexical semantic networks for European languages. Kluwer Academic Publishers, Dordrecht (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pala, K., Sedláček, R. (2005). Enriching WordNet with Derivational Subnets. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2005. Lecture Notes in Computer Science, vol 3406. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30586-6_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30586-6_33

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-24523-0

  • Online ISBN: 978-3-540-30586-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics