Abstract
Due to the breathtaking growth of social media or newspaper user comments, online product reviews comments, sentiment analysis (SA) has captured substantial interest from the researchers. With the fast increase of domain, SA work aims not only to predict the sentiment of a sentence or document but also to give the necessary detail on different aspects of the sentence or document (i.e. aspect-based sentiment analysis). A considerable number of datasets for SA and aspect-based sentiment analysis (ABSA) have been made available for English and other well-known European languages. In this paper, we present a manually annotated Bengali dataset of high quality, BAN-ABSA, which is annotated with aspect and its associated sentiment by three native Bengali speakers. The dataset consists of 2619 positive, 4721 negative and 1669 neutral data samples from 9009 unique comments gathered from some famous Bengali news portals. In addition, we conducted a baseline evaluation with a focus on deep learning model, achieved an accuracy of 78.75% for aspect term extraction and accuracy of 71.08% for sentiment classification. Experiments on the BAN-ABSA dataset show that the CNN model is better in terms of accuracy though Bi-LSTM significantly outperforms CNN model in terms of average F1-score.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Pang B, Lee L (2005) Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. In: Proceedings of the 43rd annual meeting on association for computational linguistics. Association for Computational Linguistics, pp 115–124
Liu B (2012) Sentiment analysis and opinion mining. Synthesis Lectures Human Lang Technol 5(1):1–167
Mohammad SM, Kiritchenko S, Zhu X (2013) Nrc-canada: building the state-of-the-art in sentiment analysis of tweets. arXiv preprint arXiv:1308.6242
Rao D, Ravichandran D (2009) Semi-supervised polarity lexicon induction. In: Proceedings of the 12th conference of the European chapter of the ACL (EACL 2009), pp 675–682
Al-Smadi M, Qawasmeh O, Talafha B, Quwaider M (2015) Human annotated arabic dataset of book reviews for aspect based sentiment analysis. In: 2015 3rd international conference on future internet of things and cloud. IEEE, pp 726–730
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Ma Y, Peng H, Cambria E (2018) Targeted aspect-based sentiment analysis via embedding commonsense knowledge into an attentive lstm. In: Thirty-second AAAI conference on artificial intelligence
Wang Y, Huang M, Zhu X, Zhao L (2016) Attention-based lstm for aspect-level sentiment classification. In: Proceedings of the 2016 conference on empirical methods in natural language processing, pp. 606–615
Xu J, Chen D, Qiu X, Huang X (2016) Cached long short-term memory neural networks for document-level sentiment classification. arXiv preprint arXiv:1610.04989
Kumar R, Pannu HS, Malhi AK (2019) Aspect-based sentiment analysis using deep networks and stochastic optimization. Neural Comput Appl 1–15
Xue W, Li T (2018) Aspect based sentiment analysis with gated convolutional networks. arXiv preprint arXiv:1805.07043
Thet TT, Na JC, Khoo CS (2010) Aspect-based sentiment analysis of movie reviews on discussion boards. J Inform Sci 36(6):823–848
Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805
Peters ME, Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., Zettlemoyer, L.: Deep contextualized word representations. arXiv preprint arXiv:1802.05365 (2018)
Radford A, Narasimhan K, Salimans T, Sutskever I (2018) Improving language understanding by generative pre-training
Sun C, Huang L, Qiu X (2019) Utilizing bert for aspect-based sentiment analysis via constructing auxiliary sentence. arXiv preprint arXiv:1903.09588 (2019)
Saeidi M, Bouchard G, Liakata M, Riedel S (2016) Sentihood: targeted aspect based sentiment analysis dataset for urban neighbourhoods. arXiv preprint arXiv:1610.03771
Akhtar MS, Ekbal A, Bhattacharyya P (2016) Aspect based sentiment analysis in hindi: resource creation and evaluation. In: Proceedings of the tenth international conference on language resources and evaluation (LREC’16), pp 2703–2709
Pontiki M, Galanis D, Papageorgiou H, Androutsopoulos I, Manandhar S, Al-Smadi M, Al-Ayyoub M, Zhao Y, Qin B, De Clercq O et al (2016) Semeval-2016 task 5: aspect based sentiment analysis. In: 10th international workshop on semantic evaluation (SemEval 2016)
Tamchyna A, Fiala O, Veselovská K (2015) Czech aspect-based sentiment analysis: a new dataset and preliminary results. In: ITAT, pp 95–99
Rahman M, Kumar Dey E et al (2018) Datasets for aspect-based sentiment analysis in bangla and its baseline evaluation. Data 3(2):15
Rahman MA, Dey EK (2018) Aspect extraction from bangla reviews using convolutional neural network. In: 2018 Joint 7th international conference on informatics, electronics & vision (ICIEV) and 2018 2nd international conference on imaging, vision & pattern recognition (icIVPR). IEEE, pp 262–267
Daily Prothom Alo, https://www.prothomalo.com. Last accessed 31 July, 2020
Daily Prothom Alo Facebook Page, https://www.facebook.com/DailyProthomAlo. Last accessed 31 July, 2020
Daily Jugantor, https://www.jugantor.com. Last accessed 31 July, 2020
Daily Jugantor Facebook Page, https://web.facebook.com/TheDailyJugantor. Last accessed 31 July, 2020
Kaler Kantho, https://www.kalerkantho.com. Last accessed 31 July, 2020
Kaler Kantho Facebook Page, https://web.facebook.com/kalerkantho. Last accessed 31 July, 2020
Powers DM (1998) Applications and explanations of zipf’s law. In: New methods in language processing and computational natural language learning
Schuster M, Paliwal KK (1997) Bidirectional recurrent neural networks. IEEE Trans Signal Process 45(11):2673–2681
Haque S, Rahman T, khan Shakir A, Arman MS, Been K, Biplob B, Himu FA, Das D (2020) Aspect based sentiment analysis in Bangla dataset based on aspect term extraction
Acknowledgements
We are very grateful to the SUST NLP Group and to the previous researchers who have worked in Bengali SA and ABSA. We are also very grateful to the researchers who have paved the way for NLP and neural networks.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Ahmed Masum, M., Junayed Ahmed, S., Tasnim, A., Saiful Islam, M. (2021). BAN-ABSA: An Aspect-Based Sentiment Analysis Dataset for Bengali and Its Baseline Evaluation. In: Uddin, M.S., Bansal, J.C. (eds) Proceedings of International Joint Conference on Advances in Computational Intelligence. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-16-0586-4_31
Download citation
DOI: https://doi.org/10.1007/978-981-16-0586-4_31
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-0585-7
Online ISBN: 978-981-16-0586-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)