Arabic Tag Sets: Review

  • Marwah AlianEmail author
  • Arafat Awajan
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 868)


Labeling a word with a suitable tag based on its context and its grammatical category is a major step in many applications of natural language processing. Constantly, there is an effort for inventing a set of these tags for Arabic language. In this research, a review for the existing Arabic tag sets is presented. A description for their features and limitations is also introduced.


Tag Tag set Arabic tag set 


  1. 1.
    Abumalloh, R., Al-Sarhan, H., Ibrahim, O., Abu-Ulbeh, W.: Arabic part-of-speech tagging. J. Soft Comput. Decis. Support. Syst. 3(2), 45–52 (2016)Google Scholar
  2. 2.
    Böhmová, A., Haji, J., Hajiová, E., Hladká, B.: The prague dependency treebank: a three level annotation scenario. In: Treebanks: Building and Using Parsed Corpora. Springer (2003)Google Scholar
  3. 3.
    Alqrainy, S., Ayesh, A., Almuaidi, H.: Automated tagging system and tagset design for arabic text. J. Comput. Linguist. Res. 1(2), 55–62 (2010)Google Scholar
  4. 4.
    Maamouri, M., Bies, A.: Developing an Arabic treebank: methods, guidelines, procedures, and tools. In: Proceedings of the Workshop on Computational Approaches to Arabic Script-based Languages (COLING), Geneva, pp. 2–9 (2004)Google Scholar
  5. 5.
    Alqrainy, S.: Morphological - syntactical analysis approach for Arabic textual tagging. Ph.D. thesis, De Montfort University (2008)Google Scholar
  6. 6.
    Al Shamsi, F., Guessoum, A.: A hidden Markov model–based POS tagger for Arabic (2006)Google Scholar
  7. 7.
    Elhadj, Y., Abdelali, A., Bouziane, R., Ammar, A.H.: Revisiting Arabic part of speech tagsets. In: Proceedings of 11th International Conference on Computer Systems and Applications (AICCSA), pp. 793–802 (2014)Google Scholar
  8. 8.
    El-Kareh, S., Al-Ansary, S.: An Arabic interactive multi-feature POS tagger. In: Proceedings of International Conference on Artificial and Computational Intelligence for Decision, Control, and Automation in Engineering and Industrial Applications (CIDCA), Monastir, Tunisia, pp. 204–210Google Scholar
  9. 9.
    ElHadj, Y., Al-Sughayeir, I.A., Al-Aansari, A.M.: Arabic part-of-speech tagging using the sentence structure. In: Proceedings of 2nd International Conference on Arabic Language Resources and Tools. Cairo, pp. 241–245 (2009)Google Scholar
  10. 10.
    Khoja, S., Garside, R., Knowles, G.: A tagset for the morphosyntactic tagging of Arabic. In: Proceedings of Corpus Linguistics, Lancaster, pp. 341–353 (2001)Google Scholar
  11. 11.
    Abuzed, M., Arteimi, M.: Using the Brill of speech tagger for modern standard Arabic. In: The International Arab Conference on Information Technology (ACIT), Amman (2005)Google Scholar
  12. 12.
    Alosaimy, A.M.S. Atwell, E.S.: A review of morphosyntactic analysers and tag-sets for Arabic corpus linguistics. In: Corpus Linguistics, Lancaster, pp. 16–19 (2015)Google Scholar
  13. 13.
    Buckwalter, T.: Issues in Arabic orthography and morphology analysis. In: Proceedings of the Workshop on Computational Approaches to Arabic Script-Based Languages, pp. 31–34. COLING, Geneva (2004)Google Scholar
  14. 14.
    Alkuhlani, S., Habash, N., Roth, R.: Automatic morphological enrichment of a morphologically underspecified treebank. In: Association for Computational Linguistics (NAACL-HLT), Atlanta [s.n.], pp. 460–470 (2013)Google Scholar
  15. 15.
    Sawalha, M., Atwell, E., Abushariah, M.A.M.: SALMA: standard arabic language morphological analysis. In: Proceedings of 1st International Conference on Communications, Signal Processing, and their Applications (ICCSPA), Sharjah, pp. 1–6 (2013)Google Scholar
  16. 16.
    Smrž, O., Bielický, V., Kouřilová, I., Kráčmar, J., Hajič, J., Zemánek, P.: Prague Arabic dependency treebank: a word on the million words. In: Proceedings of the LREC Workshop on HLT & NLP within the Arabic World: Arabic Language and Local Languages (2008)Google Scholar
  17. 17.
    Diab, M., Kadri, H., Daniel, J.: Automatic tagging of Arabic text: from raw text to base phrase chunks. In: Proceedings of Human Language Technology-North American Association for Computational Linguistics (HLT-NAACL) (2004)Google Scholar
  18. 18.
    Diab, M.: Towards an optimal POS tag set for modern standard arabic processing. In: Proceedings of Recent Advances in Natural Language Processing (RANLP), Borovets (2007)Google Scholar
  19. 19.
    Aliwy, A.: Arabic morphosyntactic raw text part of speech tagging system. University of Warsaw, Faculty of Mathematics, Informatics and Mechanics (2013)Google Scholar
  20. 20.
    Habash, N.: Introduction to Arabic Natural Language Processing. Morgan & Claypool Publishers Series, San Rafael (2010)Google Scholar
  21. 21.
    Ibrahim, M.N.: Statistical Arabic grammar analyzer. In: Proceedings of 16th International Conference in Computational Linguistics and Intelligent Text Processing (CICLing), Cairo, pp. 187–200 (2015)Google Scholar
  22. 22.
    Sawalha, M., Atwell, E.: A standard tag set expounding traditional morphological features for Arabic language part-of-speech tagging. Word Struct. 6(1), 43–99 (2013)CrossRefGoogle Scholar
  23. 23.
    Habash, N., Roth, R.M.: CATiB: the Columbia Arabic treebank. In: Proceedings of the Association for Computational Linguistics (ACL-IJCNLP), pp. 221–224 (2009)Google Scholar
  24. 24.
    Alqrainy, S., Ayesh, A.: Developing a tagset for automated POS tagging in Arabic. In: Proceedings of the 10th WSEAS International Conference on COMPUTERS, Athens, pp. 956–961 (2006)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.Hashemite UniversityZarqaJordan
  2. 2.Princess Sumaya University for TechnologyAmmanJordan

Personalised recommendations