Skip to main content

BAN-ABSA: An Aspect-Based Sentiment Analysis Dataset for Bengali and Its Baseline Evaluation

  • Conference paper
  • First Online:
Proceedings of International Joint Conference on Advances in Computational Intelligence

Abstract

Due to the breathtaking growth of social media or newspaper user comments, online product reviews comments, sentiment analysis (SA) has captured substantial interest from the researchers. With the fast increase of domain, SA work aims not only to predict the sentiment of a sentence or document but also to give the necessary detail on different aspects of the sentence or document (i.e. aspect-based sentiment analysis). A considerable number of datasets for SA and aspect-based sentiment analysis (ABSA) have been made available for English and other well-known European languages. In this paper, we present a manually annotated Bengali dataset of high quality, BAN-ABSA, which is annotated with aspect and its associated sentiment by three native Bengali speakers. The dataset consists of 2619 positive, 4721 negative and 1669 neutral data samples from 9009 unique comments gathered from some famous Bengali news portals. In addition, we conducted a baseline evaluation with a focus on deep learning model, achieved an accuracy of 78.75% for aspect term extraction and accuracy of 71.08% for sentiment classification. Experiments on the BAN-ABSA dataset show that the CNN model is better in terms of accuracy though Bi-LSTM significantly outperforms CNN model in terms of average F1-score.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.kaggle.com/mahfuzahmed/banabsa.

References

  1. Pang B, Lee L (2005) Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. In: Proceedings of the 43rd annual meeting on association for computational linguistics. Association for Computational Linguistics, pp 115–124

    Google Scholar 

  2. Liu B (2012) Sentiment analysis and opinion mining. Synthesis Lectures Human Lang Technol 5(1):1–167

    Article  Google Scholar 

  3. Mohammad SM, Kiritchenko S, Zhu X (2013) Nrc-canada: building the state-of-the-art in sentiment analysis of tweets. arXiv preprint arXiv:1308.6242

  4. Rao D, Ravichandran D (2009) Semi-supervised polarity lexicon induction. In: Proceedings of the 12th conference of the European chapter of the ACL (EACL 2009), pp 675–682

    Google Scholar 

  5. Al-Smadi M, Qawasmeh O, Talafha B, Quwaider M (2015) Human annotated arabic dataset of book reviews for aspect based sentiment analysis. In: 2015 3rd international conference on future internet of things and cloud. IEEE, pp 726–730

    Google Scholar 

  6. Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780

    Article  Google Scholar 

  7. Ma Y, Peng H, Cambria E (2018) Targeted aspect-based sentiment analysis via embedding commonsense knowledge into an attentive lstm. In: Thirty-second AAAI conference on artificial intelligence

    Google Scholar 

  8. Wang Y, Huang M, Zhu X, Zhao L (2016) Attention-based lstm for aspect-level sentiment classification. In: Proceedings of the 2016 conference on empirical methods in natural language processing, pp. 606–615

    Google Scholar 

  9. Xu J, Chen D, Qiu X, Huang X (2016) Cached long short-term memory neural networks for document-level sentiment classification. arXiv preprint arXiv:1610.04989

  10. Kumar R, Pannu HS, Malhi AK (2019) Aspect-based sentiment analysis using deep networks and stochastic optimization. Neural Comput Appl 1–15

    Google Scholar 

  11. Xue W, Li T (2018) Aspect based sentiment analysis with gated convolutional networks. arXiv preprint arXiv:1805.07043

  12. Thet TT, Na JC, Khoo CS (2010) Aspect-based sentiment analysis of movie reviews on discussion boards. J Inform Sci 36(6):823–848

    Article  Google Scholar 

  13. Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805

  14. Peters ME, Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., Zettlemoyer, L.: Deep contextualized word representations. arXiv preprint arXiv:1802.05365 (2018)

  15. Radford A, Narasimhan K, Salimans T, Sutskever I (2018) Improving language understanding by generative pre-training

    Google Scholar 

  16. Sun C, Huang L, Qiu X (2019) Utilizing bert for aspect-based sentiment analysis via constructing auxiliary sentence. arXiv preprint arXiv:1903.09588 (2019)

  17. Saeidi M, Bouchard G, Liakata M, Riedel S (2016) Sentihood: targeted aspect based sentiment analysis dataset for urban neighbourhoods. arXiv preprint arXiv:1610.03771

  18. Akhtar MS, Ekbal A, Bhattacharyya P (2016) Aspect based sentiment analysis in hindi: resource creation and evaluation. In: Proceedings of the tenth international conference on language resources and evaluation (LREC’16), pp 2703–2709

    Google Scholar 

  19. Pontiki M, Galanis D, Papageorgiou H, Androutsopoulos I, Manandhar S, Al-Smadi M, Al-Ayyoub M, Zhao Y, Qin B, De Clercq O et al (2016) Semeval-2016 task 5: aspect based sentiment analysis. In: 10th international workshop on semantic evaluation (SemEval 2016)

    Google Scholar 

  20. Tamchyna A, Fiala O, Veselovská K (2015) Czech aspect-based sentiment analysis: a new dataset and preliminary results. In: ITAT, pp 95–99

    Google Scholar 

  21. Rahman M, Kumar Dey E et al (2018) Datasets for aspect-based sentiment analysis in bangla and its baseline evaluation. Data 3(2):15

    Article  Google Scholar 

  22. Rahman MA, Dey EK (2018) Aspect extraction from bangla reviews using convolutional neural network. In: 2018 Joint 7th international conference on informatics, electronics & vision (ICIEV) and 2018 2nd international conference on imaging, vision & pattern recognition (icIVPR). IEEE, pp 262–267

    Google Scholar 

  23. Daily Prothom Alo, https://www.prothomalo.com. Last accessed 31 July, 2020

  24. Daily Prothom Alo Facebook Page, https://www.facebook.com/DailyProthomAlo. Last accessed 31 July, 2020

  25. Daily Jugantor, https://www.jugantor.com. Last accessed 31 July, 2020

  26. Daily Jugantor Facebook Page, https://web.facebook.com/TheDailyJugantor. Last accessed 31 July, 2020

  27. Kaler Kantho, https://www.kalerkantho.com. Last accessed 31 July, 2020

  28. Kaler Kantho Facebook Page, https://web.facebook.com/kalerkantho. Last accessed 31 July, 2020

  29. Powers DM (1998) Applications and explanations of zipf’s law. In: New methods in language processing and computational natural language learning

    Google Scholar 

  30. Schuster M, Paliwal KK (1997) Bidirectional recurrent neural networks. IEEE Trans Signal Process 45(11):2673–2681

    Article  Google Scholar 

  31. Haque S, Rahman T, khan Shakir A, Arman MS, Been K, Biplob B, Himu FA, Das D (2020) Aspect based sentiment analysis in Bangla dataset based on aspect term extraction

    Google Scholar 

Download references

Acknowledgements

We are very grateful to the SUST NLP Group and to the previous researchers who have worked in Bengali SA and ABSA. We are also very grateful to the researchers who have paved the way for NLP and neural networks.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mahfuz Ahmed Masum .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ahmed Masum, M., Junayed Ahmed, S., Tasnim, A., Saiful Islam, M. (2021). BAN-ABSA: An Aspect-Based Sentiment Analysis Dataset for Bengali and Its Baseline Evaluation. In: Uddin, M.S., Bansal, J.C. (eds) Proceedings of International Joint Conference on Advances in Computational Intelligence. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-16-0586-4_31

Download citation

Publish with us

Policies and ethics