Skip to main content

Description and acquisition of multiword lexemes

  • Acquiring Lexical Data
  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 898))

Abstract

This paper deals with multiword lexemes (MWLs), focussing on two types of verbal MWLs: verbal idioms and support verb constructions. We discuss the characteristic properties of MWLs, namely nonstandard compositionality, restricted substitutability of components, and restricted morpho-syntactic flexibility, and we show how these properties may cause serious problems during the analysis, generation, and transfer steps of machine translation systems. In order to cope with these problems, MT lexicons need to provide detailed descriptions of MWL properties. We list the types of information which we consider the necessary minimum for a successful processing of MWLs, and report on some feasibility studies aimed at the automatic extraction of German verbal multiword lexemes from text corpora and machine-readable dictionaries.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Breidt, E.: Extraktion von Verb-Nomen-Verbindungen aus dem Mannheimer Korpus I. SfS-Report 03-93, University of Tübingen, 1993

    Google Scholar 

  2. Breidt, E.: Extraction of V-N-collocations from text corpora: A feasibility study for German. First workshop on very large corpora, Ohio State University, Columbus Ohio, June 1993

    Google Scholar 

  3. Breidt, E.: Definition and classification criteria for verbal multiword lexemes. SfS Report, University of Tübingen, (to appear 1994)

    Google Scholar 

  4. Bresson, D.: Classification des verbes support (Funktionsverben) de l'allemand. In: Cahiers d'Etudes Germaniques 15, (1988) 53–65

    Google Scholar 

  5. Brundage, J., Kresse, M., Schwall, U., Storrer, A.: Multiword Lexemes: A monolingual and contrastive typology for NLP and MT. IWBS Report 232, IBM Institute for Knowledge Based Systems, Stuttgart 1992

    Google Scholar 

  6. Church, K., Gale, W., Hanks, P., Hindle, D.: Using statistics in lexical analysis. In: Zernik, U. (ed.): Lexical Acquisition. New York 1991

    Google Scholar 

  7. Danlos, L.: Support Verb Constructions. Linguistic properties, representation, translation. In: Journal of French Linguistic Study, Vol.2, No. 1 (1992), 1–32

    Google Scholar 

  8. Duden-2: Duden Stilwörterbuch der deutschen Sprache (ed. by G. Drosdowski, G.). 7th edition, Mannheim 1988

    Google Scholar 

  9. Engelke, S.: Eigenschaften von Phraseolexemen: Eine Untersuchung zu Möglichkeiten der Transformation und internen Modifizierbarkeit von somatischen verbalen Phraseolexemen. M.A. dissertation, University of Tübingen, 1994

    Google Scholar 

  10. Hauser, R., Storrer, A.: Dictionary entry parsing using the LexParse system. In: Lexicographica 9 (1993), Tübingen 1994, 174–219

    Google Scholar 

  11. Hausmann, F.K.: Kollokationen im Deutschen Wörterbuch. Ein Beitrag zur Theorie des lexikographischen Beispiels. In: Bergenholtz, H., Mugdan, J.(ed.): Lexikographie und Grammatik. Akten des Essener Kolloquiums 1984. Tübingen 1985, 118–129

    Google Scholar 

  12. Krenn, B., Volk, M.: DiTo Datenbank Datendokumentation zu Funktionsverbgefügen und Relativsätzen. DFKI-Document D-93-24, Deutsches Forschungszentrum für Künstliche Intelligenz, Saarbrücken 1993

    Google Scholar 

  13. Nicolas, T.: Internal modification of English V-NP-idioms. In: Everaert et al. (ed.): Proceedings of the Second Tilburg Workshop on Idioms 1992. Tilburg 1992, 85–96

    Google Scholar 

  14. Nunberg, G., Sag, I., Wasow, T.: Idioms. Manuscript, Stanford 1993 (to appear in Language 1994)

    Google Scholar 

  15. Schenk, A.: The syntactic behaviour of idioms. In: Everaert et al. (ed.): Proceedings of the Second Tilburg Workshop on Idioms 1992. Tilburg 1992, 97–110

    Google Scholar 

  16. Schwall, U.: Aspektualität — Eine semantisch-funktionelle Kategorie. Tübinger Beiträge zur Linguistik 344, Tübingen 1991

    Google Scholar 

  17. Smadja, F.A.: Macrocoding the lexicon with co-occurence knowledge. In: Zernik, U. (ed.): Lexical Acquisition: exploring on-line resources to build a lexicon. Hillsdale, NJ 1991

    Google Scholar 

  18. Storrer, A., Feldweg, H., Hinrichs, E.W.: Korpusunterstützte Entwicklung lexikalischer Wissensbasen. In: Sprache und Datenverarbeitung 17 (1993), Bonn 1994, 59–72

    Google Scholar 

  19. Van der Linden, E.-J.: A categorial, computational theory of idioms. OTS dissertation series, Rijksuniversiteit Utrecht 1993

    Google Scholar 

  20. Wiegand, H.E.: Kritische Lanze für das Fackelredensartenwörterbuch. Bericht und Diskussion zu einem Workshop in der österreichischen Akademie der Wissenschaften am 14.2.1994. In: Lexicographica 9, (1993), 1994

    Google Scholar 

  21. Wotjak, B: Verbale Phraseolexeme in System und Text. Tübingen 1992

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Petra Steffens

Rights and permissions

Reprints and permissions

Copyright information

© 1995 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Storrer, A., Schwall, U. (1995). Description and acquisition of multiword lexemes. In: Steffens, P. (eds) Machine Translation and the Lexicon. WMTL 1993. Lecture Notes in Computer Science, vol 898. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-59040-4_19

Download citation

  • DOI: https://doi.org/10.1007/3-540-59040-4_19

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-59040-8

  • Online ISBN: 978-3-540-49174-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics