Abstract
Twitter is one of the most famous applications of social networks that allow users to communicate with each other and share their opinions and feelings in all types of topics: economics, business, science, social, religion, and politics in a very short message of information called Tweets. Users are usually written using colloquial Arabic and include a lot of slang. In this Paper, we studied sentiment analysis of Arabic text retrieved from a twitter focus on presidential elections in Egypt 2012. We are using Naïve Bayes (NB) which is a machine learning algorithm, one time by using N-Gram (unigram and bigram) and another time by using feature selection. The main objective of this paper is to measure the accuracy of each method and determine which method is more accurate for Arabic text classification. The results show that unigram and information gain attribute selection achieves the highest accuracy and the lowest error rate.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Diab, M., Habash, N., Rambow, O., Altantawy, M., Benajiba, Y.: COLABA: Arabic dialect annotation and processing. In: Lrec Workshop on Semitic Language Processing, pp. 66–74 (2010)
Farid, D.: Egypt has the largest number of Facebook users in the Arab world: report. Daily News Egypt (2013)
Tumasjan, A., Sprenger, T.O., Sandner, P.G., Welpe, I.M.: Predicting elections with twitter: What 140 characters reveal about political sentiment. ICWSM 10(1), 178–185 (2010)
ElSahar, H., El-Beltagy, S.R.: A fully automated approach for arabic slang lexicon extraction from microblogs. In: International Conference on Intelligent Text Processing and Computational Linguistics, pp. 79–91. Springer (2014)
Liu, B.: Sentiment analysis and opinion mining. Synth. Lect. Hum. Lang. Technol. 5(1), 1–167 (2012)
Elhawary, M., Elfeky, M.: Mining Arabic business reviews. In: IEEE International Conference on Data Mining Workshops (ICDMW), 2010, pp 1108–1113. IEEE (2010)
Parikh, R., Movassate, M.: Sentiment analysis of user-generated twitter updates using various classification techniques. CS224 N Final Report: 1–18 (2009)
Rushdi-Saleh, M., Martín-Valdivia, M.T., Ureña-López, L.A., Perea-Ortega, J.M.: OCA: Opinion corpus for Arabic. J. Am. Soc. Inform. Sci. Technol. 62(10), 2045–2054 (2011)
El Kourdi, M., Bensaid, A., Rachidi, T-e.: Automatic Arabic document categorization based on the Naïve Bayes algorithm. In: Proceedings of the Workshop on Computational Approaches to Arabic Script-based Languages, 2004. Association for Computational Linguistics, pp 51–58
Larkey, L.S., Ballesteros, L., Connell, M.E.: Improving stemming for Arabic information retrieval: light stemming and co-occurrence analysis. In: Proceedings of the 25th Annual International ACM SIGIR Conference On Research And Development in Information Retrieval, pp. 275–282. ACM (2002)
Ibrahim, A., Elghazaly, T.: Arabic text summarization using Rhetorical Structure Theory. In: 2012 8th International Conference on Informatics and Systems (INFOS)
Elghazaly, T., Mahmoud, A., Hefny, H.A.: Political sentiment analysis using twitter data. In: Proceedings of the International Conference on Internet of things and Cloud Computing, p. 11. ACM (2016)
Witten, I.H., Frank, E., Hall, M.A., Pal, C.J.: Data Mining: Practical Machine Learning Tools And Techniques. Morgan Kaufmann (2016)
Ibrahim, A., Elghazaly, T., Gheith, M.: A novel Arabic text summarization model based on rhetorical structure theory and vector space model. Int. J. Comput. Linguist. Nat. Lang. Process. 2(8), 480–485 (2013)
Savoy, J., Rasolofo, Y.: Report on the TREC 11 Experiment: Arabic, Named Page and Topic Distillation Searches. In: TREC (2002)
Saad, M.K., Ashour, W.: Arabic morphological tools for text mining. Corpora 18, 19 (2010)
Elghazaly, T.A., Fahmy, A.A.: English/Arabic cross language information retrieval (CLIR) for Arabic OCR-degraded text. Commun. IBIMA 9(25), 208–218 (2009)
Taghva, K., Elkhoury, R., Coombs, J.: Arabic stemming without a root dictionary. In: International Conference on Information Technology: Coding and Computing, 2005 (ITCC 2005). IEEE (2005)
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 conference on empirical methods in natural language processing vol. 10. Association for Computational Linguistics, pp 79–86 (2002)
Saeys, Y., Inza, I., Larrañaga, P.: A review of feature selection techniques in bioinformatics. Bioinformatics 23(19), 2507–2517 (2007)
Liu, H., Yu, L.: Toward integrating feature selection algorithms for classification and clustering. IEEE Trans. Knowl. Data Eng. 17(4), 491–502 (2005)
Li, Y., Hsu, D.F., Chung, S.M.: Combining multiple feature selection methods for text categorization by using rank-score characteristics. In: 21st International Conference on Tools with Artificial Intelligence, 2009 (ICTAI’09), pp. 508–517. IEEE (2009)
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, vol. 2. Wiley, New York (1973)
Alsaleem, S.: Automated Arabic text categorization using SVM and NB. Int. Arab. J. e-Technol. 2(2), 124–128 (2011)
Sokolova, M., Japkowicz, N., Szpakowicz, S.: Beyond accuracy, F-score and ROC: a family of discriminant measures for performance evaluation. In: Australasian Joint Conference on Artificial Intelligence, pp. 1015–1021, 2006. Springer
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this chapter
Cite this chapter
Mahmoud, A., Elghazaly, T. (2018). Using Twitter to Monitor Political Sentiment for Arabic Slang. In: Shaalan, K., Hassanien, A., Tolba, F. (eds) Intelligent Natural Language Processing: Trends and Applications. Studies in Computational Intelligence, vol 740. Springer, Cham. https://doi.org/10.1007/978-3-319-67056-0_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-67056-0_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67055-3
Online ISBN: 978-3-319-67056-0
eBook Packages: EngineeringEngineering (R0)