Abstract
Domain terminology extraction is an important step in many applications such as ontology building and information retrieval. Analyzing a corpus to automatically extract key terms is a difficult task, especially in the case of Arabic language. The complexity of spelling, morphology and semantics of Arabic makes natural language processing tasks quite difficult. In addition to the complexity of Arabic, the challenges related to domain terminology extraction are caused by the inherent difficulty in determining whether a word or a phrase represents or not a given text. All these problems have not restricted the multitude of Arabic terminology extraction approaches in the ontology building process. Therefore, this article presents a literature review in the field of Arabic terminology extraction focusing on the specificities of this language.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bounhas, I., Elayeb, B., Evrard, F., Slimani, Y.: ArabOnto: Experimenting a new distributional approach for building arabic ontological resources. International Journal of Metadata, Semantics and Ontologies (IJMSO) 6(2), 91–95 (2011)
Bounhas, I., Elayeb, B., Evrard, F., Slimani, Y.: Organizing contextual knowledge for arabic text disambiguation and terminology extraction. Knowledge Organization 38(6), 473–490 (2011)
Bougouin, A.: État de l’art des méthodes d’extraction automatique de termes-clés. Actes de la conférence TALN-RECITAL, 17-21 Juin, Sables d’Olonne, France (2013)
Rey A.: La terminologie: noms et notions. France : Presses. Université de France (No. 1780) (1979)
Jacquemin, C.: Variation terminologique: Reconnaissance et acquisition automatiques de termes et de leurs variantes en corpus. Thèse d’habilitation, Université de Nantes, France (1997)
Roberts, A., Al-Sulaiti, L., Atwell, E.: aConCorde: Towards an open-source, extendable concordancer for Arabic. Corpora 1(1), 39–60 (2006)
Diab, M.T., Kadri, H., Jurafsky, D.: Automatic tagging of arabic text: From raw text to base phrase chunks. In: Proceedings of The 5th Meeting of the North American Chapter of the Association for Computational Linguistics/Human Language Technologies Conference (HLT-NAACL04), Boston, USA, May 2-7, pp. 149–152 (2004)
AlGahtani, S., Black, W., Mc-Naught, J.: Arabic part-of-speech-tagging using transformation-based learning. In: Proceedings of the 2nd International Conference on Arabic Language Resources and Tools, April 22-23, pp. 66–70. The MEDAR Consortium, Cairo (2009)
El-beltagy, S.R.: A Framework for the Rapid Development of Dictionary Based Domain Specific Arabic Stemmers, Rapport Technique (TR/COE_WM/12/12/2007), Center of Excellence for Data Mining and Computer modeling, Egypt (2007)
Hajic, J., Smrz, O., Buckwalter, T., Jin, H.: Feature-based tagger of approximations of functional arabic morphology. In: The Fourth Workshop on Treebanks and Linguistic Theories, December 9-10, pp. 53–64. University of Barcelona, Spain (2005)
Roth, R., Rambow, O., Habash, N., Diab, M., Rudin, C.: Arabic Morphological Tagging, Diacritization, and Lemmatization Using Lexeme Models and Feature Ranking. In: Proceedings of the Association for Computational Linguistics conference (ACL), Columbus, Ohio, USA, pp. 117–120 (2008)
Ayed, R., Bounhas, I., Elayeb, B., Evrard, F.: Benllamine Ben Saoud N. Arabic Morphological Analysis and Disambiguation Using a Possibilistic Classifier. In: Intelligent Computing Theories and Applications, Proceedings of the 8th International Conference on Intelligent Computing (ICIC), China, pp. 274–279 (2012)
El-beltagy, S.R., Rafea, A.: KP-Miner: A keyphrase extraction system for English and Arabic documents. Information Systems 34(1), 132–144 (2001)
Zaidi, S., Laskri, M.T., Bechkoun, K.: A Cross-language Information Retrieval Based on an Arabic Ontology in the Legal Domain. In: Signal Image Technology and Internet based-systems, Yaoundé Cameroun (2005)
Yousif, A., Khurshid, A.: LoLo: A System for Extracting Statistical Information and Lexical Resources from Arabic Corpora. In: Proceedings of the Workshop on Arabic Natural Language Processing, Information and Communication Technologies International Symposium, Fes, Morrocco, July 2007, pp. 394–398. IEEE (2007)
Boulaknadel, S., Daille, B., Aboutajdine, D.: A multi-word term extraction program for arabic language. In: Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC), Marrakech, Morocco, May 17-23, pp. 1485–1488 (2008)
El Mahdaouy, A., El Alaoui Ouatik, S., Gaussier, E.: A Study of Association Measures and their Combination for Arabic MWT Extraction. In: 10th International Conference on Terminology and Artificial Intelligence, Paris, France (2013)
Mazari, A.C., Aliane, H., Alimazighi, Z.: Automatic Construction of Ontology from Arabic Texts. In: Proceedings of International Conference on Web and Information Technologies ICWIT, pp. 193–202 (2012)
Belkredim, F.Z., El Sebai, A.: An Ontology Based Formalism for the Arabic Language Using Verbs and their Derivatives. Communications of the IBIMA 11(5), 44–52 (2009)
Attia, M., Toral, A., Tounis, L., Pecina, P., Van Genabith, J.: tomatic Extraction of Arabic Multiword Expressions. In: Proceedings of the 2010 Workshop on Multiword Expressions: from Theory to Applications, Beijing, China, pp. 19–27 (2010)
El-shishtawy, T., Al-sammak, A.: Arabic Keyphrase Extraction using Linguistic knowledge and Machine Learning Techniques. In: Proceedings of the Second International Conference on Arabic Language Resources and Tools, the MEDAR Consortium, Cairo, Egypt (2012)
Mashaan Abed, A., Tiun, S., Albared, M.: Arabic term extraction using combined approach on Islamic document. Journal of Theoretical and Applied Information Technology 58(3), 601–608 (2013)
Zaidi, S., Adbelali, A., Laskri, M.T., Al Shenify, M.: Extracting Simple and Compound Terms from Arabic Texts: An Application on The Quranic Text. Communications of the Arab Computer Society 4(1) (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bounhas, I., Lahbib, W., Elayeb, B. (2014). Arabic Domain Terminology Extraction: A Literature Review. In: Meersman, R., et al. On the Move to Meaningful Internet Systems: OTM 2014 Conferences. OTM 2014. Lecture Notes in Computer Science, vol 8841. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45563-0_51
Download citation
DOI: https://doi.org/10.1007/978-3-662-45563-0_51
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45562-3
Online ISBN: 978-3-662-45563-0
eBook Packages: Computer ScienceComputer Science (R0)