Abstract
This paper deals with multiword lexemes (MWLs), focussing on two types of verbal MWLs: verbal idioms and support verb constructions. We discuss the characteristic properties of MWLs, namely nonstandard compositionality, restricted substitutability of components, and restricted morpho-syntactic flexibility, and we show how these properties may cause serious problems during the analysis, generation, and transfer steps of machine translation systems. In order to cope with these problems, MT lexicons need to provide detailed descriptions of MWL properties. We list the types of information which we consider the necessary minimum for a successful processing of MWLs, and report on some feasibility studies aimed at the automatic extraction of German verbal multiword lexemes from text corpora and machine-readable dictionaries.
This is a preview of subscription content, log in via an institution.
Preview
Unable to display preview. Download preview PDF.
References
Breidt, E.: Extraktion von Verb-Nomen-Verbindungen aus dem Mannheimer Korpus I. SfS-Report 03-93, University of Tübingen, 1993
Breidt, E.: Extraction of V-N-collocations from text corpora: A feasibility study for German. First workshop on very large corpora, Ohio State University, Columbus Ohio, June 1993
Breidt, E.: Definition and classification criteria for verbal multiword lexemes. SfS Report, University of Tübingen, (to appear 1994)
Bresson, D.: Classification des verbes support (Funktionsverben) de l'allemand. In: Cahiers d'Etudes Germaniques 15, (1988) 53–65
Brundage, J., Kresse, M., Schwall, U., Storrer, A.: Multiword Lexemes: A monolingual and contrastive typology for NLP and MT. IWBS Report 232, IBM Institute for Knowledge Based Systems, Stuttgart 1992
Church, K., Gale, W., Hanks, P., Hindle, D.: Using statistics in lexical analysis. In: Zernik, U. (ed.): Lexical Acquisition. New York 1991
Danlos, L.: Support Verb Constructions. Linguistic properties, representation, translation. In: Journal of French Linguistic Study, Vol.2, No. 1 (1992), 1–32
Duden-2: Duden Stilwörterbuch der deutschen Sprache (ed. by G. Drosdowski, G.). 7th edition, Mannheim 1988
Engelke, S.: Eigenschaften von Phraseolexemen: Eine Untersuchung zu Möglichkeiten der Transformation und internen Modifizierbarkeit von somatischen verbalen Phraseolexemen. M.A. dissertation, University of Tübingen, 1994
Hauser, R., Storrer, A.: Dictionary entry parsing using the LexParse system. In: Lexicographica 9 (1993), Tübingen 1994, 174–219
Hausmann, F.K.: Kollokationen im Deutschen Wörterbuch. Ein Beitrag zur Theorie des lexikographischen Beispiels. In: Bergenholtz, H., Mugdan, J.(ed.): Lexikographie und Grammatik. Akten des Essener Kolloquiums 1984. Tübingen 1985, 118–129
Krenn, B., Volk, M.: DiTo Datenbank Datendokumentation zu Funktionsverbgefügen und Relativsätzen. DFKI-Document D-93-24, Deutsches Forschungszentrum für Künstliche Intelligenz, Saarbrücken 1993
Nicolas, T.: Internal modification of English V-NP-idioms. In: Everaert et al. (ed.): Proceedings of the Second Tilburg Workshop on Idioms 1992. Tilburg 1992, 85–96
Nunberg, G., Sag, I., Wasow, T.: Idioms. Manuscript, Stanford 1993 (to appear in Language 1994)
Schenk, A.: The syntactic behaviour of idioms. In: Everaert et al. (ed.): Proceedings of the Second Tilburg Workshop on Idioms 1992. Tilburg 1992, 97–110
Schwall, U.: Aspektualität — Eine semantisch-funktionelle Kategorie. Tübinger Beiträge zur Linguistik 344, Tübingen 1991
Smadja, F.A.: Macrocoding the lexicon with co-occurence knowledge. In: Zernik, U. (ed.): Lexical Acquisition: exploring on-line resources to build a lexicon. Hillsdale, NJ 1991
Storrer, A., Feldweg, H., Hinrichs, E.W.: Korpusunterstützte Entwicklung lexikalischer Wissensbasen. In: Sprache und Datenverarbeitung 17 (1993), Bonn 1994, 59–72
Van der Linden, E.-J.: A categorial, computational theory of idioms. OTS dissertation series, Rijksuniversiteit Utrecht 1993
Wiegand, H.E.: Kritische Lanze für das Fackelredensartenwörterbuch. Bericht und Diskussion zu einem Workshop in der österreichischen Akademie der Wissenschaften am 14.2.1994. In: Lexicographica 9, (1993), 1994
Wotjak, B: Verbale Phraseolexeme in System und Text. Tübingen 1992
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Storrer, A., Schwall, U. (1995). Description and acquisition of multiword lexemes. In: Steffens, P. (eds) Machine Translation and the Lexicon. WMTL 1993. Lecture Notes in Computer Science, vol 898. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-59040-4_19
Download citation
DOI: https://doi.org/10.1007/3-540-59040-4_19
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-59040-8
Online ISBN: 978-3-540-49174-3
eBook Packages: Springer Book Archive