Skip to main content

Arabic Domain Terminology Extraction: A Literature Review

(Short Paper)

  • Conference paper
On the Move to Meaningful Internet Systems: OTM 2014 Conferences (OTM 2014)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8841))

Abstract

Domain terminology extraction is an important step in many applications such as ontology building and information retrieval. Analyzing a corpus to automatically extract key terms is a difficult task, especially in the case of Arabic language. The complexity of spelling, morphology and semantics of Arabic makes natural language processing tasks quite difficult. In addition to the complexity of Arabic, the challenges related to domain terminology extraction are caused by the inherent difficulty in determining whether a word or a phrase represents or not a given text. All these problems have not restricted the multitude of Arabic terminology extraction approaches in the ontology building process. Therefore, this article presents a literature review in the field of Arabic terminology extraction focusing on the specificities of this language.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bounhas, I., Elayeb, B., Evrard, F., Slimani, Y.: ArabOnto: Experimenting a new distributional approach for building arabic ontological resources. International Journal of Metadata, Semantics and Ontologies (IJMSO) 6(2), 91–95 (2011)

    Google Scholar 

  2. Bounhas, I., Elayeb, B., Evrard, F., Slimani, Y.: Organizing contextual knowledge for arabic text disambiguation and terminology extraction. Knowledge Organization 38(6), 473–490 (2011)

    Google Scholar 

  3. Bougouin, A.: État de l’art des méthodes d’extraction automatique de termes-clés. Actes de la conférence TALN-RECITAL, 17-21 Juin, Sables d’Olonne, France (2013)

    Google Scholar 

  4. Rey A.: La terminologie: noms et notions. France : Presses. Université de France (No. 1780) (1979)

    Google Scholar 

  5. Jacquemin, C.: Variation terminologique: Reconnaissance et acquisition automatiques de termes et de leurs variantes en corpus. Thèse d’habilitation, Université de Nantes, France (1997)

    Google Scholar 

  6. Roberts, A., Al-Sulaiti, L., Atwell, E.: aConCorde: Towards an open-source, extendable concordancer for Arabic. Corpora 1(1), 39–60 (2006)

    Article  Google Scholar 

  7. Diab, M.T., Kadri, H., Jurafsky, D.: Automatic tagging of arabic text: From raw text to base phrase chunks. In: Proceedings of The 5th Meeting of the North American Chapter of the Association for Computational Linguistics/Human Language Technologies Conference (HLT-NAACL04), Boston, USA, May 2-7, pp. 149–152 (2004)

    Google Scholar 

  8. AlGahtani, S., Black, W., Mc-Naught, J.: Arabic part-of-speech-tagging using transformation-based learning. In: Proceedings of the 2nd International Conference on Arabic Language Resources and Tools, April 22-23, pp. 66–70. The MEDAR Consortium, Cairo (2009)

    Google Scholar 

  9. El-beltagy, S.R.: A Framework for the Rapid Development of Dictionary Based Domain Specific Arabic Stemmers, Rapport Technique (TR/COE_WM/12/12/2007), Center of Excellence for Data Mining and Computer modeling, Egypt (2007)

    Google Scholar 

  10. Hajic, J., Smrz, O., Buckwalter, T., Jin, H.: Feature-based tagger of approximations of functional arabic morphology. In: The Fourth Workshop on Treebanks and Linguistic Theories, December 9-10, pp. 53–64. University of Barcelona, Spain (2005)

    Google Scholar 

  11. Roth, R., Rambow, O., Habash, N., Diab, M., Rudin, C.: Arabic Morphological Tagging, Diacritization, and Lemmatization Using Lexeme Models and Feature Ranking. In: Proceedings of the Association for Computational Linguistics conference (ACL), Columbus, Ohio, USA, pp. 117–120 (2008)

    Google Scholar 

  12. Ayed, R., Bounhas, I., Elayeb, B., Evrard, F.: Benllamine Ben Saoud N. Arabic Morphological Analysis and Disambiguation Using a Possibilistic Classifier. In: Intelligent Computing Theories and Applications, Proceedings of the 8th International Conference on Intelligent Computing (ICIC), China, pp. 274–279 (2012)

    Google Scholar 

  13. El-beltagy, S.R., Rafea, A.: KP-Miner: A keyphrase extraction system for English and Arabic documents. Information Systems 34(1), 132–144 (2001)

    Article  Google Scholar 

  14. Zaidi, S., Laskri, M.T., Bechkoun, K.: A Cross-language Information Retrieval Based on an Arabic Ontology in the Legal Domain. In: Signal Image Technology and Internet based-systems, Yaoundé Cameroun (2005)

    Google Scholar 

  15. Yousif, A., Khurshid, A.: LoLo: A System for Extracting Statistical Information and Lexical Resources from Arabic Corpora. In: Proceedings of the Workshop on Arabic Natural Language Processing, Information and Communication Technologies International Symposium, Fes, Morrocco, July 2007, pp. 394–398. IEEE (2007)

    Google Scholar 

  16. Boulaknadel, S., Daille, B., Aboutajdine, D.: A multi-word term extraction program for arabic language. In: Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC), Marrakech, Morocco, May 17-23, pp. 1485–1488 (2008)

    Google Scholar 

  17. El Mahdaouy, A., El Alaoui Ouatik, S., Gaussier, E.: A Study of Association Measures and their Combination for Arabic MWT Extraction. In: 10th International Conference on Terminology and Artificial Intelligence, Paris, France (2013)

    Google Scholar 

  18. Mazari, A.C., Aliane, H., Alimazighi, Z.: Automatic Construction of Ontology from Arabic Texts. In: Proceedings of International Conference on Web and Information Technologies ICWIT, pp. 193–202 (2012)

    Google Scholar 

  19. Belkredim, F.Z., El Sebai, A.: An Ontology Based Formalism for the Arabic Language Using Verbs and their Derivatives. Communications of the IBIMA 11(5), 44–52 (2009)

    Google Scholar 

  20. Attia, M., Toral, A., Tounis, L., Pecina, P., Van Genabith, J.: tomatic Extraction of Arabic Multiword Expressions. In: Proceedings of the 2010 Workshop on Multiword Expressions: from Theory to Applications, Beijing, China, pp. 19–27 (2010)

    Google Scholar 

  21. El-shishtawy, T., Al-sammak, A.: Arabic Keyphrase Extraction using Linguistic knowledge and Machine Learning Techniques. In: Proceedings of the Second International Conference on Arabic Language Resources and Tools, the MEDAR Consortium, Cairo, Egypt (2012)

    Google Scholar 

  22. Mashaan Abed, A., Tiun, S., Albared, M.: Arabic term extraction using combined approach on Islamic document. Journal of Theoretical and Applied Information Technology 58(3), 601–608 (2013)

    Google Scholar 

  23. Zaidi, S., Adbelali, A., Laskri, M.T., Al Shenify, M.: Extracting Simple and Compound Terms from Arabic Texts: An Application on The Quranic Text. Communications of the Arab Computer Society 4(1) (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bounhas, I., Lahbib, W., Elayeb, B. (2014). Arabic Domain Terminology Extraction: A Literature Review. In: Meersman, R., et al. On the Move to Meaningful Internet Systems: OTM 2014 Conferences. OTM 2014. Lecture Notes in Computer Science, vol 8841. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45563-0_51

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-45563-0_51

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-45562-3

  • Online ISBN: 978-3-662-45563-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics