Abstract
CompounDic is an Arabic MWEs dictionary that lists many entries, divided into more than 20 domains. It lists only MWEs in their base form. With regard to syntactic and morphological flexibility, the lexicon covers 2 types of MWEs: Fixed MWEs (no variation allowed) and semi-fixed MWEs (variation in their structural pattern). Arabic presents distinctive features to deal with MWEs processing. A lot of possible derivations are possible (plural or dual forms, multiple irregular plurals). In addition, we need to process agglutination forms. In this paper, we will study the structural variability of semi-fixed multiword expressions in Arabic language in order to recognize the morphological and inflectional variations. We will adopt a recognition approach based on the use of a cascade of local grammars.
The recognition system is based on NooJ’s local grammars as well as an Arabic MWEs dictionary covering more than 20 domains. The inflectional and derivational rules, which concern semi-fixed MWEs, use some specific morphological operators that will be described as well. Finally, we present new results showing the experimentation scores of morpho-lexical coverage enhancement.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
- 2.
Electronic Dictionary for Arabic “El-DicAr” resources (Mesfar et al. 2008), developed using NooJ’s linguistic engine.
- 3.
NooJ’s syntactic, inflectional and semantic categories are detailed in Annex.
References
Najar, D., Mesfar, S., Ghezela, H.B.: A large terminological dictionary of Arabic compound words. In: Okrut, T., Hetsevich, Y., Silberztein, M., Stanislavenka, H. (eds.) Automatic Processing of Natural-Language Electronic Texts with NooJ, pp. 16–28. Springer, Cham (2015)
Mesfar, S.: Analyse Morpho-syntaxique Automatique et Reconnaissance Des Entités Nommées En Arabe Standard. Thesis, Graduate School - Languages, Space, Time, Societies. Paris, France (2008)
Silberztein, M.: Nooj’s dictionaries. In: Vetulani, Z. (ed.): Proceedings of the 2nd Language and Technology Conference. Wydawnictvo Poznańskie Sp. z o.o., Poznan (2005)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Najar, D., Mesfar, S., Ben Ghezala, H. (2016). Inflectional and Morphological Variation of Arabic Multi-word Expressions. In: Barone, L., Monteleone, M., Silberztein, M. (eds) Automatic Processing of Natural-Language Electronic Texts with NooJ. NooJ 2016. Communications in Computer and Information Science, vol 667. Springer, Cham. https://doi.org/10.1007/978-3-319-55002-2_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-55002-2_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-55001-5
Online ISBN: 978-3-319-55002-2
eBook Packages: Computer ScienceComputer Science (R0)