Abstract
Essential aspects of a verb’s usage reside in its valence environments. The Norwegian valence resource here presented, called NorVal, has 6,300 verb lemmas. About 3,360 of them are associated with sets of frames, and the organization of entries is divided into one enumeration of the total number of frame-specific entries, which is about 15,750, and one enumeration of lemmas, counting 6,300. About 300 frame types are distinguished inducing the 15,750 frame specific entries, taking into account most grammatical factors distinguishing verb frames and verb-headed construction types. Both the frame types and the two dimensions of entries are represented in string-based formalisms, enabling simple procedures for comparing individual valence frames, frame-specific entries, and entries representing lemmas, and for doing statistics over types and combinations of all of these. The paper illustrates the resources relative to their representation of light reflexives, verb particles, and frames including sentential constituents.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
Universal Dependency Grammar: https://universaldependencies.org/.
- 3.
Apparent countercases like ut porten ‘out the gate’ as in Han løp ut porten ‘He ran out through the gate’ can be analyzed as generally representing an understood ‘through’ or ‘along’ (cf. Jørgensen [29]), and thus in principle following the pattern of (ran) out of the house. (The opposite position may lead to regarding adverbs as ‘intransitive prepositions’, a notion we thus reject.).
- 4.
The 5 ‘aberrant’ verbs are ergre (‘annoy’), kjøpe (‘buy’), skjenke (apart from senses ‘give, donate, endow’, here meaning ‘pour’), spyle (‘flush’) and stue (‘stow’), and one may try to find a factor distinguishing those to be worked into the prediction.
- 5.
In presentational constructions with an LR (as in Det setter seg en katt ‘there seats itself a cat’), the LR also, in our analysis, counts as object; here, the expletive det carrying subject status, en katt is assigned the function of ‘presented’, following Hellan and Beermann [21], Hellan [16]. These constructions have not yet been fully registered in the catalogue, see Sect. 5.1.
- 6.
In valence assignments in the valence corpus sustained by the grammar Norsource, such ‘derived’ structures are accordingly assigned valence frame reflecting their ‘base’ structure.
- 7.
Rather than (8) above, the valpod thus would be:
spise:{V-av__intrObl-oblN-ACTIVITY & V-på__intrObl-oblN-ACTIVITY & V__tr & V-innpå__trObl-obRefl-oblN & V-opp__trPrtcl & V-i__trScpr-scPPrefl}.
- 8.
A factor here is also that the semantics of ‘input’ and ‘output’ must be consistently related (synonymy in the cases of passives etc., addition of causation between defined entities in the previous case). So, in Nei, han er opptatt, han sitter og spiser (‘No, he is occupied, he is sitting eating’), the verb spise ‘eat’ is arguably used as a one-participant concept; would this be a relevant proposal in many of the 1000 cases? Or how many are analyzable in terms of causativization (or ‘anticausativization’)?
A related concern is when a transitive and an intransitive frame are ‘collapsed’ using symbols for optionality, such as in the use of parentheses in notations like ‘NP V (NP)’ for expressing ‘optionality’ of an object: a constant semantic relation of meaning must be defined between such options.
- 9.
Also for løpe there is a frame which can be used only for one of the senses, viz. the frame with a caused secondary predicate, which is available only for the reading involving actual movement: a. Hun løper seg frisk. ‘She runs herself healthy’. b. *Linjen løper seg krum.’ The line runs itself curved’. The circumstance alluded to in (b) can be expressed, e.g., by Linjen krummer seg’ The line curves’, just not with the valence frame in question.
- 10.
A matter closely related to senses are multi-word expressions (MWEs), including idioms. They mostly follow the patterns of non-MWEs as far as the syntax is concerned, and so their forms can in principle be specified within the existing frame repertory, while their meanings would be encoded once senses would be eventually encoded. The form specification would be an extension of the format already used for ‘selected’ items. For a preliminary outline concerning the type of MWEs called Light Verb Constructions, see Hellan [14, 18].
- 11.
- 12.
- 13.
- 14.
- 15.
- 16.
- 17.
http://clip.ipipan.waw.pl/Walenty; cf. Przepiórkowski et al. [18].
- 18.
A device for displaying the feature structures associated with each frame type label is also envisaged, cf. https://typecraft.org/tc2wiki/NorVal_resources.
- 19.
- 20.
References
Beermann, D., Hellan, L.: Enhancing grammar and valence resources for Akan and Ga. In: West African Languages. Linguistic Theory and Communication. Wydawnictwa Uniwersytetu Warszawskiego, Warzawa, pp. 166–185 (2020). ISBN 978-83-235-4623-8
Bresnan, J.: Lexical Functional Grammar. Blackwell, Oxford (2001)
Calzolari, N., et al. (eds.): Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), Reykjavík, Iceland. ELRA (2014)
Carpenter, B.: The Logic of Typed Feature Structures. Cambridge University Press, Cambridge (1992)
Copestake, A.: Implementing Typed Feature Structure Grammars. CSLI Publications, Stanford (2002)
Copestake, A., Flickinger, D., Sag, I., Pollard, C.: Minimal recursion semantics: an introduction. J. Res. Lang. Comput. 3, 281–332 (2005)
Creissels, D.: Transitivity, valency, and voice. Ms. European Summer School in Linguistic Typology. Porquerolles (2016)
Dakubu, M.E.K.: Ga-English Dictionary with English-Ga Index. Black Mask Publishers, Accra (2009)
Dakubu, M.E.K.: Ga Toolbox project expanded with Construction Labeling valence information. Ms (2010). https://typecraft.org/tc2wiki/Ga_Valence_Profile
Dakubu, M.E.K.: Ga Verbs and their constructions. Monograph ms, Univ. of Ghana (2011)
Dakubu, M.E.K., Hellan, L.: A labeling system for valency: linguistic coverage and applications. In: Hellan, L., Malchukov, A., Cennamo, M. (eds.) Contrastive Studies in Valency. John Benjamins Publ. Co., Amsterdam (2017)
Dalrymple, M., Lødrup, H.: The grammatical functions of complement clauses. In: Proceedings of the LFG00 Conference. CSLI Publications (2000)
Haugen, T.A.: Polyvalent adjectives in Norwegian: aspects of their semantics and complementation patterns. Ph.D. dissertation, University of Oslo (2012)
Hellan, L.: Construction-based compositional grammar. J. Logic Lang. Inform. 28(2), 101–130 (2019). https://doi.org/10.1007/s10849-019-09284-5
Hellan, L.: Interoperable semantic annotation. In: LREC workshop ISA-16, 6th Joint ACL-ISO Workshop on Interoperable Semantic Annotation (2020)
Hellan, L.: Representing Light Reflexives in a valence resource for Norwegian. Presentation at SLE 2020 (2020)
Hellan, L.: Supplementary data for: ‘a valence catalogue for Norwegian. In: Loukanova (ed.) Natural Language Processing in Artificial Intelligence, NLPinAI 2021. Springer (2021). https://doi.org/10.18710/8U3L2U. (Further resources are displayed at: https://typecraft.org/tc2wiki/NorVal_resources)
Hellan, L.: Unification and selection in Light Verb Constructions. A study of Norwegian. In: Pompei, A., Mereu, L., Piunno, V. (eds.) Light verb constructions as complex verbs. Features, typology and function (Series “Trends in Linguistics. Studies and Monographs”), Mouton de Gruyter (to appear)
Hellan, L., Dakubu, M.E.K.: Identifying verb constructions cross-linguistically. In: Studies in the Languages of the Volta Basin 6.3. Legon: Linguistics Department, University of Ghana (2010). https://typecraft.org/tc2wiki/Verbconstructions_cross-linguistically_-_Introduction
Hellan, L., Bruland, T.: A cluster of applications around a Deep Grammar. In: Vetulani, Z., et al. (eds.) Proceedings from The Language & Technology Conference (LTC) 2015, Poznan (2015). Web server version at http://regdili.hf.ntnu.no:8081/linguisticAce/parse
Hellan, L., Beermann, D.: Presentational and related constructions in Norwegian with reference to German. In: Abraham, W., Leiss, E., Fujinawa, Y. (eds.) Thetics and Categoricals. [LA 262], John BenjaminsPublishing Company (2020)
Hellan, L., Johnsen, L.G., Pitz, A.: TROLL. Ms, University of Trondheim. (Downloadable at Nasjonalbiblioteket) (1989)
Hellan, L., Beermann, D., Bruland, T., Dakubu, M.E.K., Marimon, M.:. MultiVal: towards a multilingual valence lexicon. In: Calzolari, et al. (eds.) Proceedings of LREC 2014 (2014). (web demo: https://typecraft.org/tc2wiki/Multilingual_Verb_Valence_Lexicon and http://regdili.hf.ntnu.no:8081/multilanguage_valence_demo/multivalence)
Hellan, L., Malchukov, A.L., Cennamo, M. (eds): Contrastive studies in Valency. John Benjamins Publ. Co., Amsterdam & Philadelphia (2017)
Hellan, L., Beermann, D., Bruland, T., Haugland, T., Aamot, E.: Creating a Norwegian valence corpus from a deep grammar. In: Vetulani, Z., Paroubek, P., Kubis, M. (eds.) Human Language Technology. Challenges for Computer Science and Linguistics. 8th Language & Technology Conferene, LTC 2017. LNCS, vol. 12598. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-66527-2_1. ISBN 978-3-030-66526-5, https://typecraft.org/tc2wiki/Norwegian_Valency_Corpus)
Holen, G.I.: Automatic anaphora resolution for Norwegian. In: Branco, A. (ed.) 6th Discourse Anaphora and Anaphor Resolution Colloquium, DAARC 2007, Lagos, Portugal, pp. 151–166. Springer, Berlin. (2007) https://doi.org/10.1007/978-3-540-71412-5_11
Jespersen, O.: Analytic Syntax. Holt, Rinehart and Winston, New York (1969, orig. edition 1937)
Jespersen, O.: The Philosophy of Grammar. Routledge, London. (2010, orig. edition 1924)
Jørgensen, F.: The semantic representation of location in machine translation. Cand. Philol. thesis, University of Oslo (2004)
Korhonen, A., Briscoe, T.: extended lexical-semantic classification of english verbs. In: Proceedings of the HLT/NAACL Workshop on Computational Lexical Semantics, Boston, MA (2004)
Levin, B.: English Verb Classes and Alternations. University of Chicago Press, Chicago (1991)
Loukanova, R.: An approach to functional formal models of constraint-based lexicalized grammar (CBLG). Fund. Inform. 152(4), 341–372 (2017). https://doi.org/10.3233/FI-2017-1524
Malchukov, A.L., Comrie, B. (eds.): Valency Classes in the World’s Languages. Mouton De Gruyter, Berlin (2015)
Marantz, A.: Grammatical Relations. MIT Press, Cambridge (1985)
Marneffe, M.-C., Manning, C.D., Nivre, J., Zeman, D.: Universal Dependencies. Computational Linguistics (2021). https://doi.org/10.1162/COLI_a_00402
Nordgård, T.: Norwegian Computational Lexicon (NorKompLeks). In: Proceedings of NoDaLiDa 1998 (1998)
Pollard, C., Sag, I.A.: Head-Driven Phrase Structure Grammar. Chicago University Press, Chicago (1994)
Przepiórkowski, A., Hajnicz, E., Patejuk, A., Woliński, M., Skwarski, F., Świdziński, M.: Walenty: towards a comprehensive valence dictionary of Polish. In: Calzolari et al. (eds.) (2014)
Quasthoff, U., Hellan, L., Körner, E., Eckart, T., Goldhahn, D., Beermann, D.: Typical Sentences as a Resource for Valence. LREC 2020 (2020). http://www.lrec-conf.org/proceedings/lrec2020/index.html
Ross, J.R.: Constraints on variables in syntax. PhD dissertation, MIT (1967)
Tesnière, L.: Éleménts de syntaxe structurale. Klincksieck, Paris (1959)
Acknowledgments
I am grateful to Dorothee Beermann, the editor Roussanka Loukanova, and the reviewers of this chapter, for comments and advice.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendices
Appendix 1 Overview of Frame Types
The first column in the following table lists lexvals, with one lexval for each frame type. The ordering of the rows reflects a standard ordering of frame types (alphabetically, and according to internal DTD, cf. Sect. 3. Each frame type is presented as part of a lexval, and thus with a lemma to its left, so that when searching according to the alphabetical order of frame types, one must ignore what is to the left of the ‘__’.
The English translations often do not quite match the valence pattern of the source sentence, and the points of deviance are marked in this way: VR,P,L,T,I,NS means that V (mostly the verb, but also a preposition) differs in valence from the Norwegian counterpart with respect to, respectively, reflexive, preposition, particle, finite complementizer, infinitival marker, non-split predicate (in most cases the factor is missing in the translation, but in some cases in the original).
To see the logical forms associated with the various frame types, the example sentences in column 2 can be entered into the online grammar parse window at http://regdili.hf.ntnu.no:8081/linguisticAce/parse, where in most cases an MRS (‘Minimal Recursion Semantics’; cf. Copestake et al. [6]) representation, close to a standard predicate logic representation, is displayed for each parse. Further supporting facilities are described at https://typecraft.org/tc2wiki/NorVal_resources (Table 15).
Appendix 2 Verbs Allowing for All Three Types of Clausal Arguments: Declaratives, Interrogatives and Infinitives
See Table 16.
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Hellan, L. (2022). A Valence Catalogue for Norwegian. In: Loukanova, R. (eds) Natural Language Processing in Artificial Intelligence — NLPinAI 2021. Studies in Computational Intelligence, vol 999. Springer, Cham. https://doi.org/10.1007/978-3-030-90138-7_3
Download citation
DOI: https://doi.org/10.1007/978-3-030-90138-7_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-90137-0
Online ISBN: 978-3-030-90138-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)