Jabalín: A Comprehensive Computational Model of Modern Standard Arabic Verbal Morphology Based on Traditional Arabic Prosody

  • Alicia González Martínez
  • Susana López Hervás
  • Doaa Samy
  • Carlos G. Arques
  • Antonio Moreno Sandoval
Part of the Communications in Computer and Information Science book series (CCIS, volume 380)

Abstract

The computational handling of Modern Standard Arabic is a challenge in the field of natural language processing due to its highly rich morphology. However, several authors have pointed out that the Arabic morphological system is in fact extremely regular. The existing Arabic morphological analyzers have exploited this regularity to variable extent, yet we believe there is still some scope for improvement. Taking inspiration in traditional Arabic prosody, we have designed and implemented a compact and simple morphological system which in our opinion takes further advantage of the regularities encountered in the Arabic morphological system. The output of the system is a large-scale lexicon of inflected forms that has subsequently been used to create an Online Interface for a morphological analyzer of Arabic verbs. The Jabalín Online Interface is available at http://elvira.lllf.uam.es/jabalin/, hosted at the LLI-UAM lab. The generation system is also available under a GNU GPL 3 license.

Keywords

Computational morphology Arabic Arabic morphological system Modern Standard Arabic traditional Arabic prosody 

References

  1. 1.
    Habash, N.Y.: Introduction to Arabic Natural Language Processing. Morgan & Claypool, San Rafael (2010)Google Scholar
  2. 2.
    Kaye, A.S.: Formal vs. Informal Arabic: Diglossia, Triglossia, Tetraglossia, etc., Polyglossia viewed as a continuum. In: Comrie, B. (ed.) The World’s Major Languages, pp. 664–685. Oxford University Press, Oxford (1990)Google Scholar
  3. 3.
    Ferrando Frutos, I.: El plural fracto en semítico: nuevas perspectivas. Estudios de Dialectología Norteafricana y Andalusí 4, 7–24 (1999)Google Scholar
  4. 4.
    Holes, C.: Modern Arabic: Structures, Functions, and Varieties. Georgetown University Press, Washington, D.C (2004)Google Scholar
  5. 5.
    Danks, W.: The Arabic Verb: Form and Meaning in the Vowel-Lengthening Patterns. John Benjamins, Amsterdam (2011)Google Scholar
  6. 6.
    Lieber, R.: Introducing Morphology. Cambridge University Press, Cambridge (2009)CrossRefGoogle Scholar
  7. 7.
    Robin, C.: L’Arabie antique de Karib’il à Mahomet: nouvelles données sur l’histoire des Arabes grâce aux inscriptions. Édisud, Aix-en-Provence (1992)Google Scholar
  8. 8.
    Beesley, K.R.: Arabic Finite-State Morphological Analysis and Generation. In: Proceedings of COLING 1996 (1996)Google Scholar
  9. 9.
    Ratcliffe, R.R.: The “Broken” Plural Problem in Arabic and Comparative Semitic: Allomorphy and Analogy in Non-Concatenative Morphology. John Benjamins, Amsterdam (1998)Google Scholar
  10. 10.
    Cowan, D.: An Introduction to Modern Literary Arabic. Cambridge University Press, Cambridge (1958)Google Scholar
  11. 11.
    Versteegh, K.: The Arabic language. Edinburgh University Press, Edinburgh (2001)Google Scholar
  12. 12.
    Shimron, J.: Language Processing and Acquisition in Languages of Semitic, Root-Based, Morphology. John Benjamins, Amsterdam (2003)Google Scholar
  13. 13.
    Pierrehumbert, J.: Dissimilarity in the Arabic Verbal Roots. In: Proceedings of the 23rd Meeting of the Northeastern Linguistic Society, pp. 367–381. Graduate Student Association, U. Mass. Amherst (1993), http://faculty.wcas.northwestern.edu/~jbp/publications/arabic_roots.pdf
  14. 14.
    Attia, M., Pecina, P., Toral, A., Tounsi, L., van Genabith, J.: An Open-Source Finite State Morphological Transducer for Modern Standard Arabic. In: Proceedings of the International Workshop on Finite State Methods and Natural Language Processing (FSNLP), pp. 125–136 (2011)Google Scholar
  15. 15.
    Beesley, K.R.: Finite-state Morphological Analysis and Generation of Arabic at Xerox Research: Status and plans in 2001. In: ACL Workshop on Arabic Language Processing: Status and Perspective, pp. 1–8 (2001)Google Scholar
  16. 16.
    Beesley, K.R.: Arabic Morphology Using only Finite-State Operations. In: Proceedings of the Workshop on Computational Approaches to Semitic Languages, pp. 50–57 (1998)Google Scholar
  17. 17.
    McCarthy, J.J.: A Prosodic Theory of Nonconcatenative Morphology. Linguistic Inquiry 12, 373–418 (1981)Google Scholar
  18. 18.
    Abu-Chacra, F.: Arabic: An Essential Grammar. Taylor & Francis, New York (2007)Google Scholar
  19. 19.
    Kiraz, G.A.: Computational Analyses of Arabic Morphology (1994)Google Scholar
  20. 20.
    Soudi, A., Eisele, A.: Generating an Arabic Full-Form Lexicon for Bidirectional Morphology Lookup. In: Proceedings of LREC 2004 (2004)Google Scholar
  21. 21.
    Kiraz, G.A.: Computational Nonlinear Morphology: With Emphasis on Semitic Languages. Cambridge University Press, Cambridge (2001)CrossRefGoogle Scholar
  22. 22.
    Kiraz, G.A.: Computing Prosodic Morphology. In: Proceedings of the 16th Conference on Computational Linguistics, vol. 2, pp. 664–669 (1996)Google Scholar
  23. 23.
    Wright, W., Smith, W.R., de Goeje, M.J.: A Grammar of the Arabic Language. Cambridge University Press, Cambridge (1896)Google Scholar
  24. 24.
    Ryding, K.C.: A Reference Grammar of Modern Standard Arabic. Cambridge University Press, Cambridge (2005)CrossRefGoogle Scholar
  25. 25.
    Khashan, K.M.: Al-Khalil Ibn Ahmad and Numerical Prosody I. Journal of Arabic Linguistic Tradition 1, 25–34 (2003)Google Scholar
  26. 26.
    Habash, N.: Large-Scale Lexeme-Based Arabic Morphological Generation. In: Proceedings of Traitement Automatique du Langage Naturel, TALN 2004 (2004)Google Scholar
  27. 27.
    Attia, M., Pecina, P., Toral, A., Tounsi, L., van Genabith, J.: A Lexical Database for Modern Standard Arabic Interoperable with a Finite State Morphological Transducer. In: Mahlow, C., Piotrowski, M. (eds.) SFCM 2011. CCIS, vol. 100, pp. 98–118. Springer, Heidelberg (2011)CrossRefGoogle Scholar
  28. 28.
    Sawalha, M., Atwell, E.S.: Comparative Evaluation of Arabic Language Morphological Analysers and Stemmers. In: Proceedings of COLING 2008 (Poster Volume), pp. 107–110 (2008), http://eprints.whiterose.ac.uk/42635/
  29. 29.
    Al Shamsi, F., Guessoum, A.: A Hidden Markov Model-Based POS Tagger for Arabic. In: Proceedings of the 8th International Conference on the Statistical Analysis of Textual Data, pp. 31–42 (2006)Google Scholar
  30. 30.
    El-Dahdah, A.: A Dictionary of Arabic Verb Conjugation. Librairie du Liban, Beirut (1991)Google Scholar
  31. 31.
    Owens, J.: A Linguistic History of Arabic. Oxford University Press, Oxford (2006)CrossRefGoogle Scholar
  32. 32.
    Smrž, O.: Functional Arabic Morphology. Formal System and Implementation (2007)Google Scholar
  33. 33.
    Rodrigues, P., Cavar, D.: Learning Arabic Morphology Using Statistical Constraint-Satisfaction Models. In: Benmamoun, E. (ed.) Papers from the 19th Annual Symposium on Arabic Linguistics, Urbana, Illinois, pp. 63–76. John Benjamins, Amsterdam (2007)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Alicia González Martínez
    • 1
  • Susana López Hervás
    • 1
  • Doaa Samy
    • 2
  • Carlos G. Arques
    • 3
  • Antonio Moreno Sandoval
    • 1
  1. 1.LLI-UAMUniversidad Autónoma de MadridSpain
  2. 2.Spanish Language DepartmentCairo UniversityEgypt
  3. 3.Dept. of Development & DifferentiationSpanish National Center for Molecular Biology (CBM-SO)Spain

Personalised recommendations