Abstract
Recently, text mining has become an interesting research field due to the huge amount of existing text on the web. Text mining is an essential field in the context of data mining for discovering interesting patterns in textual data. Examining and extracting of such information patterns from huge datasets is considered as a crucial process. A lot of survey studies were conducted for the purpose of using various text mining methods for unstructured datasets. It has been noticed that comprehensive survey studies in the Arabic context were neglected. This study aims to give a broad review of various studies related to the Arabic text mining with more focus on the Holy Quran, sentiment analysis, and web documents. Furthermore, the synthesis of the research problems and methodologies of the surveyed studies will help the text mining scholars in pursuing their future studies.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Hung, J.L., Zhang, K.: Examining mobile learning trends 2003–2008: a categorical meta-trend analysis using text mining techniques. J. Comput. High. Educ. 24(1), 1–17 (2012)
Zaza, S., Al-Emran, M.:. Mining and exploration of credit cards data in UAE. In: 2015 Fifth International Conference on e-Learning (econf), pp. 275–279. IEEE (2015, October)
Gök, A., Waterworth, A., Shapira, P.: Use of web mining in studying innovation. Scientometrics 102(1), 653–671 (2015)
Fan, W., Wallace, L., Rich, S., Zhang, Z.: Tapping into the power of text mining (2005)
Zhang, J.Q., Craciun, G., Shin, D.: When does electronic word-of-mouth matter? A study of consumer product reviews. J. Bus. Res. 63(12), 1336–1341 (2010)
Shaalan, K.: A survey of arabic named entity recognition and classification. Comput. Linguist. 40(2), 469–510 (2014)
Ray, S.K., Shaalan, K.: A review and future perspectives of arabic question answering systems. IEEE Trans. Knowl. Data Eng. 28(12), 3169–3190 (2016)
Oudah, M., Shaalan, K.: NERA 2.0: improving coverage and performance of rule-based named entity recognition for Arabic. Nat. Lang. Eng. 1–32 (2016)
Salloum, S.A., Al-Emran, M., Shaalan, K.: A survey of lexical functional grammar in the Arabic context. Int. J. Com. Net. Tech, 4(3)
Al Emran, M., Shaalan, K.: A survey of intelligent language tutoring systems. In: 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp. 393–399. IEEE (2014)
Al-Emran, M., Zaza, S., Shaalan, K.: Parsing modern standard Arabic using Treebank resources. In: 2015 International Conference on Information and Communication Technology Research (ICTRC), pp. 80–83. IEEE (2015)
Al-Emran, M.: Hierarchical reinforcement learning: a survey. Int. J. Comput. Dig. Syst. 4(2), (2015)
Al-Emran, M., Malik, S.I.: The impact of google apps at work: higher educational perspective. Int. J. Interact. Mob. Technol. (iJIM) 10(4), 85–88 (2016)
Al-Emran, M., Shaalan, K.: Learners and educators attitudes towards mobile learning in higher education: state of the art. In: 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp. 907–913. IEEE (2015, August)
Chen, X., Vorvoreanu, M., Madhavan, K.: Mining social media data for understanding students’ learning experiences. IEEE Trans. Learn. Technol. 7(3), 246–259 (2014)
Al-Radaideh, Q.A., Twaiq, L.M.: Rough set theory for Arabic sentiment classification. In: 2014 International Conference on Future Internet of Things and Cloud (FiCloud), pp. 559–564. IEEE (2014, August)
Gupta, V., Lehal, G.S.: A survey of text mining techniques and applications. J. Emerg. Technol. Web Intell. 1(1), 60–76 (2009)
Navathe, S.B., Ramez, E.: Data warehousing and data mining. Fundam. Database Syst. 841–872 (2000)
Sukanya, M., Biruntha, S.: Techniques on text mining. In: 2012 IEEE International Conference on Advanced Communication Control and Computing Technologies (ICACCCT), pp. 269–271. IEEE (2012, August)
Witten, I.H.: Text mining. Practical handbook of internet computing, 14-1 (2005)
Salloum, S.A., Al-Emran, M., Monem, A.A., Shaalan, K.: A survey of text mining in social media: facebook and twitter perspectives. Adv. Sci. Technol. Eng. Syst. J. (2017)
Schoder, D., Gloor, P.A., Metaxas, P.T.: Spec. Issue Soc. Med. KI 27(1), 5–8 (2013)
Steinberger, R.: A survey of methods to ease the development of highly multilingual text mining applications. Lang. Resour. Eval. 46(2), 155–176 (2012)
Abdul-Baquee, S., Atwell, E.S.: Knowledge representation of the Quran through frame semantics: a corpus-based approach. In: Proceedings of the Fifth Corpus Linguistics Conference. University of Liverpool (2009)
Farghaly, A., Shaalan, K.: Arabic natural language processing: challenges and solutions. ACM Trans. Asian Lang. Inf. Process. (TALIP) 8(4), 14 (2009)
Alhawarat, M., Hegazi, M., Hilal, A.: Processing the text of the Holy Quran: a text mining study. Int. J. Adv. Comput. Sci. Appl. (IJACSA) 6(2), 262–267 (2015)
Muhammad, A.B.: Annotation of conceptual co-reference and text mining the Qur’an. University of Leeds (2012)
Sharaf, A.M.: The Qur’an annotation for text mining. First year transfer report. School of Computing, Leeds University, December 2009
Aldayel, H.K., Azmi, A.M.: Arabic tweets sentiment analysis–a hybrid scheme. J. Inf. Sci. 42(6), 782–797 (2016)
Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends® Inf. Retr. 2(1–2), 1–135 (2008)
Liu, B.: Sentiment analysis and opinion mining. Synth. Lect. Hum. Lang. Technol. 5(1), 1–167 (2012)
Cherif, W., Madani, A., Kissi, M.: A new modeling approach for Arabic opinion mining recognition. In: Intelligent Systems and Computer Vision (ISCV), pp. 1–6. IEEE (2015, March)
Mahyoub, F.H., Siddiqui, M.A., Dahab, M.Y.: Building an Arabic sentiment lexicon using semi-supervised learning. J. King Saud Univ. Comput. Inf. Sci. 26(4), 417–424 (2014)
Duwairi, R.M., Qarqaz, I.: Arabic sentiment analysis using supervised classification. In: 2014 International Conference on Future Internet of Things and Cloud (FiCloud), pp. 579–583. IEEE (2014, August)
Soliman, T.H., Elmasry, M.A., Hedar, A., Doss, M.M.: Sentiment analysis of Arabic slang comments on facebook. Int. J. Comput. Technol. 12(5), 3470–3478 (2014)
Al-Kabi, M., Gigieh, A., Alsmadi, I., Wahsheh, H., Haidar, M.: An opinion analysis tool for colloquial and standard Arabic. In: The Fourth International Conference on Information and Communication Systems (ICICS 2013), pp. 23–25 (2013)
Rushdi-Saleh, M., Martín-Valdivia, M.T., Ureña-López, L.A., Perea-Ortega, J.M.: OCA: opinion corpus for Arabic. J. Am. Soc. Inform. Sci. Technol. 62(10), 2045–2054 (2011)
Hedar, A.R., Doss, M.: Mining social networks Arabic slang comments. In: IEEE Symposium on Computational Intelligence and Data Mining (CIDM) (2013)
Atlam, E.S., Morita, K., Fuketa, M., Aoe, J.I.: A new approach for Arabic text classification using Arabic field-association terms. J. Am. Soc. Inform. Sci. Technol. 62(11), 2266–2276 (2011)
Selamat, A., Ng, C.C.: Arabic script web page language identifications using decision tree neural networks. Pattern Recogn. 44(1), 133–144 (2011)
Brahmi, A., Ech-Cherif, A., Benyettou, A.: Arabic texts analysis for topic modeling evaluation. Inf. Retr. 15(1), 33–53 (2012)
Alghamdi, H.M., Selamat, A., Karim, N.S.A.: Arabic web pages clustering and annotation using semantic class features. J. King Saud Univ. Comput. Inf. Sci. 26(4), 388–397 (2014)
Khorsheed, M.S., Al-Thubaity, A.O.: Comparative evaluation of text classification techniques using a large diverse Arabic dataset. Lang. Resour. Eval. 47(2), 513–538 (2013)
Al-Anzi, F.S., AbuZeina, D.: Toward an enhanced Arabic text classification using cosine similarity and latent semantic indexing. J. King Saud Univ. Comput. Inf. Sci.
Eldos, T.M.: Arabic text data mining: a root-based hierarchical indexing model. Int. J. Model. Simul. 23(3), 158–166 (2003)
Nehar, A., Benmessaoud, A., Cherroun, H., Ziadi, D.: Subsequence kernels-based Arabic text classification. In: 2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA), pp. 206–213. IEEE (2014, November)
Zayene, O., Seuret, M., Touj, S.M., Hennebert, J., Ingold, R., Amara, N.E.B.: Text detection in Arabic news video based on swt operator and convolutional auto-encoders. In: 12th IAPR Workshop on Document Analysis Systems (DAS), IEEE, pp. 13–18, 2016
Wahsheh, H., Alsmadi, I., Al-Kabi, M.: Analyzing the popular words to evaluate spam in Arabic web pages. IJJ Res. Bull. JORDAN ACM–ISWSA, 2(2), 22–26
Harrag, F.: Text mining approach for knowledge extraction in Sahîh Al-Bukhari. Comput. Hum. Behav. 30, 558–566 (2014)
A-Brahimi, B., Touahria, M., Tari, A.: Data and text mining techniques for classifying Arabic tweet polarity. J. Dig. Inf. Manage. 14(1), 15
Zubi, Z.S.: Using some web content mining techniques for Arabic text classification. Recent Advances on Data Networks, Communications, Computers (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this chapter
Cite this chapter
Salloum, S.A., AlHamad, A.Q., Al-Emran, M., Shaalan, K. (2018). A Survey of Arabic Text Mining. In: Shaalan, K., Hassanien, A., Tolba, F. (eds) Intelligent Natural Language Processing: Trends and Applications. Studies in Computational Intelligence, vol 740. Springer, Cham. https://doi.org/10.1007/978-3-319-67056-0_20
Download citation
DOI: https://doi.org/10.1007/978-3-319-67056-0_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67055-3
Online ISBN: 978-3-319-67056-0
eBook Packages: EngineeringEngineering (R0)