Abstract
With the outbreak of social networks, blogs, and forums, classifying subjective text influenced by personal feelings and opinions has become an interesting research area. Many techniques have been proposed to solve the problem of analyzing and classifying sentiments held in those reviews and recommendations. Recently, deep learning models showed promising outcomes in many fields, including sentiment analysis. Therefore in this study, we propose a sentiment analysis deep learning-based model to predict the polarity of opinions and sentiments. Two types of recurrent neural networks are leveraged to learn higher-level representations. Then to mitigate the data dependency problem and to increase the model robustness, three distinct classification algorithms were utilized to produce the final output. Experimental results proved that our model prevailed in all the selected datasets with an accuracy ranging between 81.11 and 94.32%. Moreover, the model reduced the relative classification error rate by up to 26% compared to state-of-the-art models.
Similar content being viewed by others
Notes
the W terms denote the weight matrices, the b terms denote the bias vectors.
The tensorflow software library is available at https://www.tensorflow.org
The full code for DeepASA and the used datasets and other resources are available at https://zenodo.org/record/3864879#.XtDBjMBRU2x
References
Singh, J.; Singh, G.; Singh, R.: A review of sentiment analysis techniques for opinionated web text. CSI Trans. ICT 4(2–4), 241–247 (2016)
Aydoğan, E.; Akcayol, M.A.: A comprehensive survey for sentiment analysis tasks using machine learning techniques. In: International Symposium on INnovations in Intelligent SysTems and Applications (INISTA). IEEE 2016, 1–7 (2016)
Internet world users by language: Top 10 languages. https://www.internetworldstats.com/stats7.htm, Accessed 4 Oct 2019
Cho, K.; van Merriënboer, B.; Gulcehre, C.; Bahdanau, D.; Bougares, F.; Schwenk, H.; Bengio, Y.: Learning phrase representations using RNN encoder—decoder for statistical machine translation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics, Doha, Qatar, pp. 1724–1734 (2014) https://doi.org/10.3115/v1/D14-1179. https://www.aclweb.org/anthology/D14-1179
Hochreiter, S.; Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Alharbi, A., Taileb, M., Kalkatawi, M.: Deep learning in arabic sentiment analysis: an overview. J. Inf. Sci. 0165551519865488 (2019)
Al Sallab, A.; Hajj, H.; Badaro, G.; Baly, R.; El Hajj, W.; Shaban, K. B.: Deep learning models for sentiment analysis in Arabic. In: Proceedings of the Second Workshop on Arabic Natural Language Processing, pp. 9–17 (2015)
Badaro, G.; Baly, R.; Hajj, H.; Habash, N.; El-Hajj, W.: A large scale Arabic sentiment lexicon for Arabic opinion mining. In: Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing (ANLP), pp. 165–173 (2014)
Al-Sallab, A.; Baly, R.; Hajj, H.; Shaban, K. B.; El-Hajj, W.; Badaro, G.: Aroma: a recursive deep learning model for opinion mining in Arabic as a low resource language. ACM Trans. Asian Low Resour. Lang. Inform. Process. (TALLIP) 16(4): 25 (2017)
Baly, R.; Badaro, G.; Hamdi, A.; Moukalled, R.; Aoun, R.; El-Khoury, G.; Al Sallab, A.; Hajj, H.; Habash, N.; Shaban, K.; et al.: Omam at semeval-2017 task 4: evaluation of English state-of-the-art sentiment analysis models for Arabic and a new topic-based model. In: Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), pp. 603–610 (2017)
Abdelhade, N.; Soliman, T. H. A.; Ibrahim, H. M.: Detecting twitter users’ opinions of Arabic comments during various time episodes via deep neural network. In: The Proceedings of International Conference on Advanced Intelligent Systems and Informatics, Springer, pp. 232–246 (2017)
Baly, R.; Hajj, H.; Habash, N.; Shaban, K. B.; El-Hajj, W.: A sentiment treebank and morphologically enriched recursive deep models for effective sentiment analysis in Arabic. ACM Trans. Asian Low Res. Lang. Inf. Process. (TALLIP) 16(4): 23 (2017)
Alayba, A. M.; Palade, V.; England, M.; Iqbal, R.: A combined cnn and lstm model for Arabic sentiment analysis. In: International Cross-Domain Conference for Machine Learning and Knowledge Extraction, Springer, pp. 179–191 (2018)
Heikal, M.; Torki, M.; El-Makky, N.: Sentiment analysis of Arabic tweets using deep learning. Proced. Comput. Sci. 142, 114–122 (2018)
Dahou, A.; Xiong, S.; Zhou, J.; Haddoud, M. H.; Duan, P.: Word embeddings and convolutional neural network for Arabic sentiment classification. In: The Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 2418–2427 (2016)
Liu, J.; Shang, J.; Wang, C.; Ren, X.; Han, J.: Mining quality phrases from massive text corpora. In: The Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, ACM, pp. 1729–1744 (2015)
Mikolov, T.; Chen, K.; Corrado, G.; Dean, J.: Efficient estimation of word representations in vector space, arXiv preprint arXiv:1301.3781
Le, Q.; Mikolov, T.: Distributed representations of sentences and documents. In: International conference on machine learning, pp. 1188–1196 (2014)
Bojanowski, P.; Grave, E.; Joulin, A.; Mikolov, T.: Enriching word vectors with subword information. Trans. Assoc. Comput. Linguist. 5, 135–146 (2017)
Bergstra, J.; Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Resear. 13, 281–305 (2012)
Nowak, J.; Taspinar, A.; Scherer, R.: Lstm recurrent neural networks for short text and sentiment classification. In: International Conference on Artificial Intelligence and Soft Computing, Springer, pp. 553–562 (2017)
Aly, M.; Atiya, A.: Labr: A large scale arabic book reviews dataset. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Vol. 2, pp. 494–498. (2013)
ElSahar, H.; El-Beltagy, S. R.: Building large Arabic multi-domain resources for sentiment analysis. In: International Conference on Intelligent Text Processing and Computational Linguistics, Springer, pp. 23–34 (2015)
Abdulla, N. A.; Ahmed, N. A.; Shehab, M. A.; Al-Ayyoub, M.: Arabic sentiment analysis: Lexicon-based and corpus-based. In: The Proceeding of 2013 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT), IEEE, pp. 1–6 (2013)
Nabil, M.; Aly, M.; Atiya, A.: Astd: Arabic sentiment tweets dataset. In: The Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 2515–2519 (2015)
Chollet, F.; et al.: Keras. https://keras.io (2015).
Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; Vanderplas, J.; Passos, A.; Cournapeau, D.; Brucher, M.; Perrot, M.; Duchesnay, E.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Al-Azani, S.; El-Alfy, E.-S. M.: Hybrid deep learning for sentiment polarity determination of arabic microblogs. In: The Proceeding of the International Conference on Neural Information Processing, Springer, pp. 491–500 (2017)
Elshakankery, K.; Ahmed, M. F.: Hilatsa: A hybrid incremental learning approach for Arabic tweets sentiment analysis. Egy. Inf. J.
Kalkatawi, M.; Magana-Mora, A.; Jankovic, B.; Bajic, V. B.: DeepGSR:an optimized deep-learning structure for the recognition of genomicsignals and regions. Bioinformatics 35(7): 1125–1132 (2018) https://academic.oup.com/bioinformatics/articlepdf/35/7/1125/28277811/bty752.pdf, https://doi.org/10.1093/bioinformatics/bty752.
Devlin, J.; Chang, M.-W.; Lee, K.; Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.
Peters, M. E.; Neumann, M.; Iyyer, M.; Gardner, M.; Clark, C.; Lee, K.; Zettlemoyer, L.: Deep contextualized word representations. arXiv preprint arXiv:1802.05365
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Alharbi, A., Kalkatawi, M. & Taileb, M. Arabic Sentiment Analysis Using Deep Learning and Ensemble Methods. Arab J Sci Eng 46, 8913–8923 (2021). https://doi.org/10.1007/s13369-021-05475-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13369-021-05475-0