Skip to main content
Log in

Arabic Sentiment Analysis Using Deep Learning and Ensemble Methods

  • Research Article-Computer Engineering and Computer Science
  • Published:
Arabian Journal for Science and Engineering Aims and scope Submit manuscript

Abstract

With the outbreak of social networks, blogs, and forums, classifying subjective text influenced by personal feelings and opinions has become an interesting research area. Many techniques have been proposed to solve the problem of analyzing and classifying sentiments held in those reviews and recommendations. Recently, deep learning models showed promising outcomes in many fields, including sentiment analysis. Therefore in this study, we propose a sentiment analysis deep learning-based model to predict the polarity of opinions and sentiments. Two types of recurrent neural networks are leveraged to learn higher-level representations. Then to mitigate the data dependency problem and to increase the model robustness, three distinct classification algorithms were utilized to produce the final output. Experimental results proved that our model prevailed in all the selected datasets with an accuracy ranging between 81.11 and 94.32%. Moreover, the model reduced the relative classification error rate by up to 26% compared to state-of-the-art models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Notes

  1. the W terms denote the weight matrices, the b terms denote the bias vectors.

  2. The tensorflow software library is available at https://www.tensorflow.org

  3. The full code for DeepASA and the used datasets and other resources are available at https://zenodo.org/record/3864879#.XtDBjMBRU2x

References

  1. Singh, J.; Singh, G.; Singh, R.: A review of sentiment analysis techniques for opinionated web text. CSI Trans. ICT 4(2–4), 241–247 (2016)

    Article  Google Scholar 

  2. Aydoğan, E.; Akcayol, M.A.: A comprehensive survey for sentiment analysis tasks using machine learning techniques. In: International Symposium on INnovations in Intelligent SysTems and Applications (INISTA). IEEE 2016, 1–7 (2016)

  3. Internet world users by language: Top 10 languages. https://www.internetworldstats.com/stats7.htm, Accessed 4 Oct 2019

  4. Cho, K.; van Merriënboer, B.; Gulcehre, C.; Bahdanau, D.; Bougares, F.; Schwenk, H.; Bengio, Y.: Learning phrase representations using RNN encoder—decoder for statistical machine translation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics, Doha, Qatar, pp. 1724–1734 (2014) https://doi.org/10.3115/v1/D14-1179. https://www.aclweb.org/anthology/D14-1179

  5. Hochreiter, S.; Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)

    Article  Google Scholar 

  6. Alharbi, A., Taileb, M., Kalkatawi, M.: Deep learning in arabic sentiment analysis: an overview. J. Inf. Sci. 0165551519865488 (2019)

  7. Al Sallab, A.; Hajj, H.; Badaro, G.; Baly, R.; El Hajj, W.; Shaban, K. B.: Deep learning models for sentiment analysis in Arabic. In: Proceedings of the Second Workshop on Arabic Natural Language Processing, pp. 9–17 (2015)

  8. Badaro, G.; Baly, R.; Hajj, H.; Habash, N.; El-Hajj, W.: A large scale Arabic sentiment lexicon for Arabic opinion mining. In: Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing (ANLP), pp. 165–173 (2014)

  9. Al-Sallab, A.; Baly, R.; Hajj, H.; Shaban, K. B.; El-Hajj, W.; Badaro, G.: Aroma: a recursive deep learning model for opinion mining in Arabic as a low resource language. ACM Trans. Asian Low Resour. Lang. Inform. Process. (TALLIP) 16(4): 25 (2017)

  10. Baly, R.; Badaro, G.; Hamdi, A.; Moukalled, R.; Aoun, R.; El-Khoury, G.; Al Sallab, A.; Hajj, H.; Habash, N.; Shaban, K.; et al.: Omam at semeval-2017 task 4: evaluation of English state-of-the-art sentiment analysis models for Arabic and a new topic-based model. In: Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), pp. 603–610 (2017)

  11. Abdelhade, N.; Soliman, T. H. A.; Ibrahim, H. M.: Detecting twitter users’ opinions of Arabic comments during various time episodes via deep neural network. In: The Proceedings of International Conference on Advanced Intelligent Systems and Informatics, Springer, pp. 232–246 (2017)

  12. Baly, R.; Hajj, H.; Habash, N.; Shaban, K. B.; El-Hajj, W.: A sentiment treebank and morphologically enriched recursive deep models for effective sentiment analysis in Arabic. ACM Trans. Asian Low Res. Lang. Inf. Process. (TALLIP) 16(4): 23 (2017)

  13. Alayba, A. M.; Palade, V.; England, M.; Iqbal, R.: A combined cnn and lstm model for Arabic sentiment analysis. In: International Cross-Domain Conference for Machine Learning and Knowledge Extraction, Springer, pp. 179–191 (2018)

  14. Heikal, M.; Torki, M.; El-Makky, N.: Sentiment analysis of Arabic tweets using deep learning. Proced. Comput. Sci. 142, 114–122 (2018)

    Article  Google Scholar 

  15. Dahou, A.; Xiong, S.; Zhou, J.; Haddoud, M. H.; Duan, P.: Word embeddings and convolutional neural network for Arabic sentiment classification. In: The Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 2418–2427 (2016)

  16. Liu, J.; Shang, J.; Wang, C.; Ren, X.; Han, J.: Mining quality phrases from massive text corpora. In: The Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, ACM, pp. 1729–1744 (2015)

  17. Mikolov, T.; Chen, K.; Corrado, G.; Dean, J.: Efficient estimation of word representations in vector space, arXiv preprint arXiv:1301.3781

  18. Le, Q.; Mikolov, T.: Distributed representations of sentences and documents. In: International conference on machine learning, pp. 1188–1196 (2014)

  19. Bojanowski, P.; Grave, E.; Joulin, A.; Mikolov, T.: Enriching word vectors with subword information. Trans. Assoc. Comput. Linguist. 5, 135–146 (2017)

    Article  Google Scholar 

  20. Bergstra, J.; Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Resear. 13, 281–305 (2012)

    MathSciNet  MATH  Google Scholar 

  21. Nowak, J.; Taspinar, A.; Scherer, R.: Lstm recurrent neural networks for short text and sentiment classification. In: International Conference on Artificial Intelligence and Soft Computing, Springer, pp. 553–562 (2017)

  22. Aly, M.; Atiya, A.: Labr: A large scale arabic book reviews dataset. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Vol. 2, pp. 494–498. (2013)

  23. ElSahar, H.; El-Beltagy, S. R.: Building large Arabic multi-domain resources for sentiment analysis. In: International Conference on Intelligent Text Processing and Computational Linguistics, Springer, pp. 23–34 (2015)

  24. Abdulla, N. A.; Ahmed, N. A.; Shehab, M. A.; Al-Ayyoub, M.: Arabic sentiment analysis: Lexicon-based and corpus-based. In: The Proceeding of 2013 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT), IEEE, pp. 1–6 (2013)

  25. Nabil, M.; Aly, M.; Atiya, A.: Astd: Arabic sentiment tweets dataset. In: The Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 2515–2519 (2015)

  26. Chollet, F.; et al.: Keras. https://keras.io (2015).

  27. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; Vanderplas, J.; Passos, A.; Cournapeau, D.; Brucher, M.; Perrot, M.; Duchesnay, E.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)

    MathSciNet  MATH  Google Scholar 

  28. Al-Azani, S.; El-Alfy, E.-S. M.: Hybrid deep learning for sentiment polarity determination of arabic microblogs. In: The Proceeding of the International Conference on Neural Information Processing, Springer, pp. 491–500 (2017)

  29. Elshakankery, K.; Ahmed, M. F.: Hilatsa: A hybrid incremental learning approach for Arabic tweets sentiment analysis. Egy. Inf. J.

  30. Kalkatawi, M.; Magana-Mora, A.; Jankovic, B.; Bajic, V. B.: DeepGSR:an optimized deep-learning structure for the recognition of genomicsignals and regions. Bioinformatics 35(7): 1125–1132 (2018) https://academic.oup.com/bioinformatics/articlepdf/35/7/1125/28277811/bty752.pdf, https://doi.org/10.1093/bioinformatics/bty752.

  31. Devlin, J.; Chang, M.-W.; Lee, K.; Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.

  32. Peters, M. E.; Neumann, M.; Iyyer, M.; Gardner, M.; Clark, C.; Lee, K.; Zettlemoyer, L.: Deep contextualized word representations. arXiv preprint arXiv:1802.05365

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Amal Alharbi.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Alharbi, A., Kalkatawi, M. & Taileb, M. Arabic Sentiment Analysis Using Deep Learning and Ensemble Methods. Arab J Sci Eng 46, 8913–8923 (2021). https://doi.org/10.1007/s13369-021-05475-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13369-021-05475-0

Keywords

Navigation