Abstract
Lexical analysis can be a way to remove ambiguities in the Arabic language. So, their resolution is an important task in several domains of Natural Language Processing (NLP). In this context, this paper is inscribed. Our proposed resolution method is based essentially on the use of transducers on text automata. Indeed, these transducers specify the lexical rules of the Arabic language allowing corpus disambiguation. In order to achieve our resolution method, different types of lexical ambiguities are identified and studied. Then, an appropriate set of rules is proposed. After that, we represent all specified rules in NooJ. In addition, we present experimentation with NooJ platform conducted through various linguistic resources to obtain disambiguated syntactic structures suitable for the analysis. The results obtained are ambitious and can be improved by adding other rules and heuristics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Attia, M.: Handlinh Arabic morphological and syntactic ambiguity within the LFG framework with a view to machine translation. Ph.D. thesis in the University of Manchester (2008)
Beesley, K.: Finite-state morphological analysis and generation of arabic at xerox research: status and plans. In: ACL/EACL, Toulouse, France, 6 July 2001
Dichy, J., Alrahabi, M.: Levée d’ambiguité par la methode d’exploration contextuelle: la sequence ‘alif-nûn’ en arabic. In: SIIE, Hammamet, Tunisia (2009)
Diab, M.: Second generation tools (AMIRA 2.0): fast and robust tokenization, POS tagging, and base phrase chunking. In: MEDAR 2nd International Conference on Arabic Language Resources and Tools, April, Cairo, Egypt (2009)
Ellouze, S., Haddar, K., Abdelhamid, A.: Etude et analyse du pluriel brisé arabe avec la plateforme NooJ. In: NooJ Conference and Workshop. Tozeur, Tunisia (2009)
Fehri, H., Haddar, K., Abdelhamid, A.: Recognition and translation of Arabic named entities with NooJ using a new representation model. In: FSMNLP, pp. 134–142 (2011)
Habash, N., Rambow, O., Roth, R.: MADAÂ +Â TOKAN: a toolkit for Arabic tokenization, diacritization, morphological disambiguation, POS tagging, stemming ald lemmatization. In: Proceedings of the 2nd International Conference on Arabic Language Resources and Tools (MEDAR), Cairo, Egypt (2009)
Othman, E., Shaalan, K., Rafea, A.: Towards resolving ambiguity in understanding Arabic sentence. In: International Conference on Arabic Language Resources and Tools (2006)
Mesfar, S.: Morphological grammars for standard Arabic tokenization. In: Proceedings of the International NooJ Conference, pp. 114–127. Cambridge Scholars Publishing, Newcastle (2010)
Mesfar, S.: Analyse morpho-syntaxique automatique et reconnaissance des entités nommées en arabe strandard. Ph.D. thesis in the University of Franche Comté, 235 (2008)
Silberztein, M.: Disambiguation tools for NooJ. In: Proceedings of the 2008 International NooJ Conference, pp. 158–171. Cambridge Scholars Publishing, Newcastle (2010)
Shaalan, K., Othman, E., Rafea, A.: Towards resolving ambiguity in understanding Arabic sentence. In: The Proceedings of the International Conference on Arabic Language Resources and Tools, NEMLAR, Cairo, Egypt, 22–23 September 2004, pp. 118–122 (2004)
Zalila, I, Haddar, K.: Construction of an HPSG grammar for the Arabic relative sentences. In: The Proceedings of RANLP, Hissar, Bulgaria (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Ghezaiel, N., Haddar, K. (2016). Study and Resolution of Arabic Lexical Ambiguity Through Transduction on Text Automaton. In: Okrut, T., Hetsevich, Y., Silberztein, M., Stanislavenka, H. (eds) Automatic Processing of Natural-Language Electronic Texts with NooJ. NooJ 2015. Communications in Computer and Information Science, vol 607. Springer, Cham. https://doi.org/10.1007/978-3-319-42471-2_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-42471-2_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42470-5
Online ISBN: 978-3-319-42471-2
eBook Packages: Computer ScienceComputer Science (R0)