Skip to main content

Study and Resolution of Arabic Lexical Ambiguity Through Transduction on Text Automaton

  • Conference paper
  • First Online:
Automatic Processing of Natural-Language Electronic Texts with NooJ (NooJ 2015)

Abstract

Lexical analysis can be a way to remove ambiguities in the Arabic language. So, their resolution is an important task in several domains of Natural Language Processing (NLP). In this context, this paper is inscribed. Our proposed resolution method is based essentially on the use of transducers on text automata. Indeed, these transducers specify the lexical rules of the Arabic language allowing corpus disambiguation. In order to achieve our resolution method, different types of lexical ambiguities are identified and studied. Then, an appropriate set of rules is proposed. After that, we represent all specified rules in NooJ. In addition, we present experimentation with NooJ platform conducted through various linguistic resources to obtain disambiguated syntactic structures suitable for the analysis. The results obtained are ambitious and can be improved by adding other rules and heuristics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Attia, M.: Handlinh Arabic morphological and syntactic ambiguity within the LFG framework with a view to machine translation. Ph.D. thesis in the University of Manchester (2008)

    Google Scholar 

  2. Beesley, K.: Finite-state morphological analysis and generation of arabic at xerox research: status and plans. In: ACL/EACL, Toulouse, France, 6 July 2001

    Google Scholar 

  3. Dichy, J., Alrahabi, M.: Levée d’ambiguité par la methode d’exploration contextuelle: la sequence ‘alif-nûn’ en arabic. In: SIIE, Hammamet, Tunisia (2009)

    Google Scholar 

  4. Diab, M.: Second generation tools (AMIRA 2.0): fast and robust tokenization, POS tagging, and base phrase chunking. In: MEDAR 2nd International Conference on Arabic Language Resources and Tools, April, Cairo, Egypt (2009)

    Google Scholar 

  5. Ellouze, S., Haddar, K., Abdelhamid, A.: Etude et analyse du pluriel brisé arabe avec la plateforme NooJ. In: NooJ Conference and Workshop. Tozeur, Tunisia (2009)

    Google Scholar 

  6. Fehri, H., Haddar, K., Abdelhamid, A.: Recognition and translation of Arabic named entities with NooJ using a new representation model. In: FSMNLP, pp. 134–142 (2011)

    Google Scholar 

  7. Habash, N., Rambow, O., Roth, R.: MADA + TOKAN: a toolkit for Arabic tokenization, diacritization, morphological disambiguation, POS tagging, stemming ald lemmatization. In: Proceedings of the 2nd International Conference on Arabic Language Resources and Tools (MEDAR), Cairo, Egypt (2009)

    Google Scholar 

  8. Othman, E., Shaalan, K., Rafea, A.: Towards resolving ambiguity in understanding Arabic sentence. In: International Conference on Arabic Language Resources and Tools (2006)

    Google Scholar 

  9. Mesfar, S.: Morphological grammars for standard Arabic tokenization. In: Proceedings of the International NooJ Conference, pp. 114–127. Cambridge Scholars Publishing, Newcastle (2010)

    Google Scholar 

  10. Mesfar, S.: Analyse morpho-syntaxique automatique et reconnaissance des entités nommées en arabe strandard. Ph.D. thesis in the University of Franche Comté, 235 (2008)

    Google Scholar 

  11. Silberztein, M.: Disambiguation tools for NooJ. In: Proceedings of the 2008 International NooJ Conference, pp. 158–171. Cambridge Scholars Publishing, Newcastle (2010)

    Google Scholar 

  12. Shaalan, K., Othman, E., Rafea, A.: Towards resolving ambiguity in understanding Arabic sentence. In: The Proceedings of the International Conference on Arabic Language Resources and Tools, NEMLAR, Cairo, Egypt, 22–23 September 2004, pp. 118–122 (2004)

    Google Scholar 

  13. Zalila, I, Haddar, K.: Construction of an HPSG grammar for the Arabic relative sentences. In: The Proceedings of RANLP, Hissar, Bulgaria (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nadia Ghezaiel .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Ghezaiel, N., Haddar, K. (2016). Study and Resolution of Arabic Lexical Ambiguity Through Transduction on Text Automaton. In: Okrut, T., Hetsevich, Y., Silberztein, M., Stanislavenka, H. (eds) Automatic Processing of Natural-Language Electronic Texts with NooJ. NooJ 2015. Communications in Computer and Information Science, vol 607. Springer, Cham. https://doi.org/10.1007/978-3-319-42471-2_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-42471-2_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-42470-5

  • Online ISBN: 978-3-319-42471-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics