Skip to main content

A Survey of Arabic Text Mining

  • Chapter
  • First Online:

Part of the book series: Studies in Computational Intelligence ((SCI,volume 740))

Abstract

Recently, text mining has become an interesting research field due to the huge amount of existing text on the web. Text mining is an essential field in the context of data mining for discovering interesting patterns in textual data. Examining and extracting of such information patterns from huge datasets is considered as a crucial process. A lot of survey studies were conducted for the purpose of using various text mining methods for unstructured datasets. It has been noticed that comprehensive survey studies in the Arabic context were neglected. This study aims to give a broad review of various studies related to the Arabic text mining with more focus on the Holy Quran, sentiment analysis, and web documents. Furthermore, the synthesis of the research problems and methodologies of the surveyed studies will help the text mining scholars in pursuing their future studies.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   299.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Hung, J.L., Zhang, K.: Examining mobile learning trends 2003–2008: a categorical meta-trend analysis using text mining techniques. J. Comput. High. Educ. 24(1), 1–17 (2012)

    Article  Google Scholar 

  2. Zaza, S., Al-Emran, M.:. Mining and exploration of credit cards data in UAE. In: 2015 Fifth International Conference on e-Learning (econf),  pp. 275–279. IEEE (2015, October)

    Google Scholar 

  3. Gök, A., Waterworth, A., Shapira, P.: Use of web mining in studying innovation. Scientometrics 102(1), 653–671 (2015)

    Article  Google Scholar 

  4. Fan, W., Wallace, L., Rich, S., Zhang, Z.: Tapping into the power of text mining (2005)

    Google Scholar 

  5. Zhang, J.Q., Craciun, G., Shin, D.: When does electronic word-of-mouth matter? A study of consumer product reviews. J. Bus. Res. 63(12), 1336–1341 (2010)

    Article  Google Scholar 

  6. Shaalan, K.: A survey of arabic named entity recognition and classification. Comput. Linguist. 40(2), 469–510 (2014)

    Article  Google Scholar 

  7. Ray, S.K., Shaalan, K.: A review and future perspectives of arabic question answering systems. IEEE Trans. Knowl. Data Eng. 28(12), 3169–3190 (2016)

    Article  Google Scholar 

  8. Oudah, M., Shaalan, K.: NERA 2.0: improving coverage and performance of rule-based named entity recognition for Arabic. Nat. Lang. Eng. 1–32 (2016)

    Google Scholar 

  9. Salloum, S.A., Al-Emran, M., Shaalan, K.: A survey of lexical functional grammar in the Arabic context. Int. J. Com. Net. Tech, 4(3)

    Google Scholar 

  10. Al Emran, M., Shaalan, K.: A survey of intelligent language tutoring systems. In: 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp. 393–399. IEEE (2014)

    Google Scholar 

  11. Al-Emran, M., Zaza, S., Shaalan, K.: Parsing modern standard Arabic using Treebank resources. In: 2015 International Conference on Information and Communication Technology Research (ICTRC), pp. 80–83. IEEE (2015)

    Google Scholar 

  12. Al-Emran, M.: Hierarchical reinforcement learning: a survey. Int. J. Comput. Dig. Syst. 4(2), (2015)

    Google Scholar 

  13. Al-Emran, M., Malik, S.I.: The impact of google apps at work: higher educational perspective. Int. J. Interact. Mob. Technol. (iJIM) 10(4), 85–88 (2016)

    Article  Google Scholar 

  14. Al-Emran, M., Shaalan, K.: Learners and educators attitudes towards mobile learning in higher education: state of the art. In: 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp. 907–913. IEEE (2015, August)

    Google Scholar 

  15. Chen, X., Vorvoreanu, M., Madhavan, K.: Mining social media data for understanding students’ learning experiences. IEEE Trans. Learn. Technol. 7(3), 246–259 (2014)

    Article  Google Scholar 

  16. Al-Radaideh, Q.A., Twaiq, L.M.: Rough set theory for Arabic sentiment classification. In: 2014 International Conference on Future Internet of Things and Cloud (FiCloud), pp. 559–564. IEEE (2014, August)

    Google Scholar 

  17. Gupta, V., Lehal, G.S.: A survey of text mining techniques and applications. J. Emerg. Technol. Web Intell. 1(1), 60–76 (2009)

    Google Scholar 

  18. Navathe, S.B., Ramez, E.: Data warehousing and data mining. Fundam. Database Syst. 841–872 (2000)

    Google Scholar 

  19. Sukanya, M., Biruntha, S.: Techniques on text mining. In: 2012 IEEE International Conference on Advanced Communication Control and Computing Technologies (ICACCCT), pp. 269–271. IEEE (2012, August)

    Google Scholar 

  20. Witten, I.H.: Text mining. Practical handbook of internet computing, 14-1 (2005)

    Google Scholar 

  21. Salloum, S.A., Al-Emran, M., Monem, A.A., Shaalan, K.: A survey of text mining in social media: facebook and twitter perspectives. Adv. Sci. Technol. Eng. Syst. J. (2017)

    Google Scholar 

  22. Schoder, D., Gloor, P.A., Metaxas, P.T.: Spec. Issue Soc. Med. KI 27(1), 5–8 (2013)

    Google Scholar 

  23. Steinberger, R.: A survey of methods to ease the development of highly multilingual text mining applications. Lang. Resour. Eval. 46(2), 155–176 (2012)

    Article  Google Scholar 

  24. Abdul-Baquee, S., Atwell, E.S.: Knowledge representation of the Quran through frame semantics: a corpus-based approach. In: Proceedings of the Fifth Corpus Linguistics Conference. University of Liverpool (2009)

    Google Scholar 

  25. Farghaly, A., Shaalan, K.: Arabic natural language processing: challenges and solutions. ACM Trans. Asian Lang. Inf. Process. (TALIP) 8(4), 14 (2009)

    Google Scholar 

  26. Alhawarat, M., Hegazi, M., Hilal, A.: Processing the text of the Holy Quran: a text mining study. Int. J. Adv. Comput. Sci. Appl. (IJACSA) 6(2), 262–267 (2015)

    Google Scholar 

  27. Muhammad, A.B.: Annotation of conceptual co-reference and text mining the Qur’an. University of Leeds (2012)

    Google Scholar 

  28. Sharaf, A.M.: The Qur’an annotation for text mining. First year transfer report. School of Computing, Leeds University, December 2009

    Google Scholar 

  29. Aldayel, H.K., Azmi, A.M.: Arabic tweets sentiment analysis–a hybrid scheme. J. Inf. Sci. 42(6), 782–797 (2016)

    Article  Google Scholar 

  30. Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends® Inf. Retr. 2(1–2), 1–135 (2008)

    Google Scholar 

  31. Liu, B.: Sentiment analysis and opinion mining. Synth. Lect. Hum. Lang. Technol. 5(1), 1–167 (2012)

    Article  Google Scholar 

  32. Cherif, W., Madani, A., Kissi, M.: A new modeling approach for Arabic opinion mining recognition. In: Intelligent Systems and Computer Vision (ISCV), pp. 1–6. IEEE (2015, March)

    Google Scholar 

  33. Mahyoub, F.H., Siddiqui, M.A., Dahab, M.Y.: Building an Arabic sentiment lexicon using semi-supervised learning. J. King Saud Univ. Comput. Inf. Sci. 26(4), 417–424 (2014)

    Google Scholar 

  34. Duwairi, R.M., Qarqaz, I.: Arabic sentiment analysis using supervised classification. In: 2014 International Conference on Future Internet of Things and Cloud (FiCloud), pp. 579–583. IEEE (2014, August)

    Google Scholar 

  35. Soliman, T.H., Elmasry, M.A., Hedar, A., Doss, M.M.: Sentiment analysis of Arabic slang comments on facebook. Int. J. Comput. Technol. 12(5), 3470–3478 (2014)

    Google Scholar 

  36. Al-Kabi, M., Gigieh, A., Alsmadi, I., Wahsheh, H., Haidar, M.: An opinion analysis tool for colloquial and standard Arabic. In: The Fourth International Conference on Information and Communication Systems (ICICS 2013), pp. 23–25 (2013)

    Google Scholar 

  37. Rushdi-Saleh, M., Martín-Valdivia, M.T., Ureña-López, L.A., Perea-Ortega, J.M.: OCA: opinion corpus for Arabic. J. Am. Soc. Inform. Sci. Technol. 62(10), 2045–2054 (2011)

    Article  Google Scholar 

  38. Hedar, A.R., Doss, M.: Mining social networks Arabic slang comments. In: IEEE Symposium on Computational Intelligence and Data Mining (CIDM) (2013)

    Google Scholar 

  39. Atlam, E.S., Morita, K., Fuketa, M., Aoe, J.I.: A new approach for Arabic text classification using Arabic field-association terms. J. Am. Soc. Inform. Sci. Technol. 62(11), 2266–2276 (2011)

    Article  Google Scholar 

  40. Selamat, A., Ng, C.C.: Arabic script web page language identifications using decision tree neural networks. Pattern Recogn. 44(1), 133–144 (2011)

    Article  MATH  Google Scholar 

  41. Brahmi, A., Ech-Cherif, A., Benyettou, A.: Arabic texts analysis for topic modeling evaluation. Inf. Retr. 15(1), 33–53 (2012)

    Article  Google Scholar 

  42. Alghamdi, H.M., Selamat, A., Karim, N.S.A.: Arabic web pages clustering and annotation using semantic class features. J. King Saud Univ. Comput. Inf. Sci. 26(4), 388–397 (2014)

    Google Scholar 

  43. Khorsheed, M.S., Al-Thubaity, A.O.: Comparative evaluation of text classification techniques using a large diverse Arabic dataset. Lang. Resour. Eval. 47(2), 513–538 (2013)

    Article  Google Scholar 

  44. Al-Anzi, F.S., AbuZeina, D.: Toward an enhanced Arabic text classification using cosine similarity and latent semantic indexing. J. King Saud Univ. Comput. Inf. Sci.

    Google Scholar 

  45. Eldos, T.M.: Arabic text data mining: a root-based hierarchical indexing model. Int. J. Model. Simul. 23(3), 158–166 (2003)

    Google Scholar 

  46. Nehar, A., Benmessaoud, A., Cherroun, H., Ziadi, D.: Subsequence kernels-based Arabic text classification. In: 2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA), pp. 206–213. IEEE (2014, November)

    Google Scholar 

  47. Zayene, O., Seuret, M., Touj, S.M., Hennebert, J., Ingold, R., Amara, N.E.B.: Text detection in Arabic news video based on swt operator and convolutional auto-encoders. In: 12th IAPR Workshop on Document Analysis Systems (DAS), IEEE, pp. 13–18, 2016

    Google Scholar 

  48. Wahsheh, H., Alsmadi, I., Al-Kabi, M.: Analyzing the popular words to evaluate spam in Arabic web pages. IJJ Res. Bull. JORDAN ACM–ISWSA, 2(2), 22–26

    Google Scholar 

  49. Harrag, F.: Text mining approach for knowledge extraction in Sahîh Al-Bukhari. Comput. Hum. Behav. 30, 558–566 (2014)

    Article  Google Scholar 

  50. A-Brahimi, B., Touahria, M., Tari, A.: Data and text mining techniques for classifying Arabic tweet polarity. J. Dig. Inf. Manage. 14(1), 15

    Google Scholar 

  51. Zubi, Z.S.: Using some web content mining techniques for Arabic text classification. Recent Advances on Data Networks, Communications, Computers (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Said A. Salloum .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Salloum, S.A., AlHamad, A.Q., Al-Emran, M., Shaalan, K. (2018). A Survey of Arabic Text Mining. In: Shaalan, K., Hassanien, A., Tolba, F. (eds) Intelligent Natural Language Processing: Trends and Applications. Studies in Computational Intelligence, vol 740. Springer, Cham. https://doi.org/10.1007/978-3-319-67056-0_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-67056-0_20

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-67055-3

  • Online ISBN: 978-3-319-67056-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics