Abstract
This paper proposes and experiments a new approach for morphological feature disambiguation of non-vocalized Arabic texts using a possibilistic classifier. The main idea is to learn contextual dependencies between features from vocalized texts and exploit this knowledge to disambiguate non-vocalized ones. We use possibility theory as a means to model imprecision in the training and testing steps, since the context is itself ambiguous. We also investigate the dependency between various features focusing on the Part-Of-Speech (POS).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Buckwalter, T.: Buckwalter Arabic Morphological Analyzer Version 2.0. Linguistic Data Consortium (LDC) catalogue number LDC2004L02 (2004) ISBN 1-58563-324-0
Haouari, B., Ben Amor, N., Elouedi, Z., Mellouli, K.: Naïve Possibilistic Network Classifiers. Fuzzy Sets and Systems 160, 3224–3238 (2009)
Harrag, F., Hamdi-Cherif, A., Al-Salman, A.M.S., Elyas El-Qawasmeh, E.: Experiments in Improvement of Arabic Information Retrieval. In: 3rd International Conference on Arabic Language Processing (CITALA), Rabat, Morocco, pp. 71–81 (2009)
Bounhas, I., Elayeb, B., Evrard, F., Slimani, Y.: Toward a Computer Study of The Reliability of Arabic Stories. Journal of the American Society for Information Science and Technology 61, 1686–1705 (2010)
Al-Echikh, A.A.: Encyclopedia of The Six Major Citation Collections, Dar-esselem, Ryadh, KSA (1998)
Roth, R., Rambow, O., Habash, N., Diab, M., Rudin, C.: Arabic Morphological Tagging, Diacritization, and Lemmatization Using Lexeme Models and Feature Ranking. In: Proceedings of Association for Computational Linguistics (ACL), Columbus, Ohio, USA, pp. 573–580 (2008)
Debili, F., Achour, H., Souissi, E.: La langue arabe et l’ordinateur: de l’étiquetage grammatical à la voyellation automatique. In: IRMC (2002) (in French)
Dubois, D., Prade, H.: Possibility Theory: An Approach to Computerized Processing of Uncertainty. Plenum Press, New York (1994)
Merialdo, B.: Tagging English Text with A Probabilistic Model. Computational Linguistics 20, 155–171 (1994)
Daoud, D.: Synchronized Morphological and Syntactic Disambiguation for Arabic. Advances in Computational Linguistics 41, 73–86 (2009)
Khoja, S.: APT: Arabic Part-of-speech Tagger. In: Proc. Student Workshop at the Second Meeting of the North American Association for Computational Linguistics, Carnegie Mellon University, Pennsylvania, USA (2001)
Diab, M., Hacioglu, K., Jurafsky, D.: Automatic Tagging of Arabic Text: From Raw Text to Base Phrase Chunks. In: Proc. Human Language Technology Conference / North American Chapter of the Association for Computational Linguistics Annual Meeting (HLT-NAACL 2004), Boston, USA, pp. 149–152 (2004)
Habash, N., Rambow, O.: Arabic Diacritization Through Full Morphological Tagging. In: Proc. Human Language Technology Conference / North American Chapter of the Association for Computational Linguistics Annual Meeting (HLT-NAACL 2007), Companion Volume, Rochester, NY, USA, pp. 53–56 (2007)
Habash, N., Rambow, O., Roth, R.: MADA+TOKAN: A Toolkit for Arabic Tokenization, Diacritization, Morphological Disambiguation, POS Tagging, Stemming and Lemmatization. In: Proceedings of the 2nd International Conference on Arabic Language Resources and Tools (MEDAR), Cairo, Egypt, pp. 102–109 (2009)
Hoceini, Y., Cheragui, M.A., Abbas, M.: Towards a New Approach for Disambiguation in NLP by Multiple Criterian Decision-Aid. The Prague Bulletin of Mathematical Linguistics 95, 19–32 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ayed, R., Bounhas, I., Elayeb, B., Evrard, F., Saoud, N.B.B. (2012). Arabic Morphological Analysis and Disambiguation Using a Possibilistic Classifier. In: Huang, DS., Ma, J., Jo, KH., Gromiha, M.M. (eds) Intelligent Computing Theories and Applications. ICIC 2012. Lecture Notes in Computer Science(), vol 7390. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31576-3_36
Download citation
DOI: https://doi.org/10.1007/978-3-642-31576-3_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31575-6
Online ISBN: 978-3-642-31576-3
eBook Packages: Computer ScienceComputer Science (R0)