Abstract
Aspect sentiment classification is an important branch of sentiment classification that has gained increasing attention recently. Existing aspect sentiment classification methods typically use different network branches to encode context and aspect words separately, and then use an attention mechanism to capture their associations. This attention-based approach cannot completely ignore the contexts unrelated to the current aspect words, which brings noise interference. In this paper, a gated filtering network based on BERT is suggested as a solution to this issue. We employ BERT to encode the text semantics of contexts and sentence pairs consisting of context and aspect words respectively, and to extract lexical features as well as associative features of context and aspect words. Based on this, we designed a gating module that, unlike the attention mechanism, uses association features to precisely filter irrelevant contexts. Additionally, because the BERT network parameters are so big, there is a tendency to over-fitting during training. To effectively combat this problem, we developed a loss function with a threshold. We carried out extensive experiments using three benchmark datasets to verify the performance of our proposed model. The experimental results show that the method improves the accuracy by 0.5%, 1.39% and 2.57% on the Laptop, Restaurant and Twitter datasets respectively, and 1.564%, 2.36% and 4.144% on Macro-F1 respectively, compared to the recent RA-CNN (BERT), proving that our method is effective in improving the presentation of aspect sentiment classification in comparison to other cutting-edge sentiment classification methods.
Similar content being viewed by others
Data availability
The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.
References
Abel, J., & Lantow, B. (2019). A methodological framework for dictionary and rule-based text classification. In: 11th International Conference on Knowledge Discovery and Information Retrieval, pp. 330–337. SciTePress. https://doi.org/10.5220/0008121503300337
Chen, P., Sun, Z., Bing, L., et al. (2017). Recurrent attention network on memory for aspect sentiment analysis. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 452–461. Association for Computational Linguistics. https://doi.org/10.18653/v1/D17-1047
Chung, J., Gulcehre, C., Cho, K., et al. (2014). Empirical evaluation of gated recurrent neural networks on sequence modeling. In: NIPS 2014 Workshop on Deep Learning, pp. 1–9. MIT Press. https://doi.org/10.48550/arXiv.1412.3555
Colbrook, M. J., Antun, V., & Hansen, A. C. (2022). The difficulty of computing stable and accurate neural networks: On the barriers of deep learning and smale’s 18th problem. Proceedings of the National Academy of Sciences of the United States of America, 119, 2107151119. https://doi.org/10.1073/pnas.2107151119
Dauphin, Y. N., Fan, A., Auli, M., et al. (2017). Language modeling with gated convolutional networks. In: Proceedings of the 34th International Conference on Machine Learning, vol. 70, pp. 933–941. JMLR.org. https://doi.org/10.48550/arXiv.1612.08083
Dong, L., Wei, F., Tan, C., et al. (2014). Adaptive recursive neural network for target-dependent twitter sentiment classification. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, vol. 2, pp. 49–54. Association for Computational Linguistics. https://doi.org/10.3115/v1/P14-2009
Gan, S., Shao, S., Chen, L., et al. (2021). Adapting hidden naive bayes for text classification. Mathematics, 9, 2378. https://doi.org/10.3390/math9192378
Gong, C., Du, Y., Li, X., et al. (2020). Structural hole-based approach to control public opinion in a social network. Engineering Applications of Artificial Intelligence, 93, 103690. https://doi.org/10.1016/j.engappai.2020.103690
Guia, M., Silva, R., & Bernardino, J. (2019). Comparison of naïve bayes, support vector machine, decision trees and random forest on sentiment analysis. In: International Conference on Knowledge Discovery and Information Retrieval, pp. 525–531. SciTePress. https://doi.org/10.5220/0008364105250531
He, R., Lee, W. S., Ng, H. T., et al. (2018). Exploiting document knowledge for aspect-level sentiment classification. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, vol. 2, pp. 579–585. Association for Computational Linguistics. https://doi.org/10.18653/v1/P18-2092
Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9, 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
Huang, B., & Carley, K. (2018). Parameterized convolutional neural networks for aspect level sentiment classification. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 1091–1096. Association for Computational Linguistics. https://doi.org/10.18653/v1/D18-1136
Huang, B., Ou, Y., & Carley, K. M. (2018). Aspect level sentiment classification with attention-over-attention neural networks. In: Social, Cultural, and Behavioral Modeling, vol. 10899, pp. 197–206. Springer. https://doi.org/10.1007/978-3-319-93372-6_22
Imani, M., & Noferesti, S. (2022). Aspect extraction and classification for sentiment analysis in drug reviews. Journal of Intelligent Information Systems, 59, 613–633. https://doi.org/10.1007/s10844-022-00712-w
Jaiswal, S., Mehta, A., & Nandi, G. (2018). Investigation on the effect of l1 an l2 regularization on image features extracted using restricted boltzmann machine. In: 2018 Second International Conference on Intelligent Computing and Control Systems (ICICCS), pp. 1548–1553. IEEE. https://doi.org/10.1109/ICCONS.2018.8663071
Karimi, A., Rossi, L., & Prati, A. (2021). Adversarial training for aspect-based sentiment analysis with bert. In: 2020 25th International Conference on Pattern Recognition, pp. 8797–8803. IEEE. https://doi.org/10.1109/ICPR48806.2021.9412167
Karimi, A., Rossi, L., & Prati, A. (2021). Improving bert performance for aspect-based sentiment analysis. In: Proceedings of the 4th International Conference on Natural Language and Speech Processing, pp. 39–46. Association for Computational Linguistics. https://aclanthology.org/2021.icnlsp-1.5
Kim Phung, T., An Te, N., & Thi Thu Ha, T. (2021). A machine learning approach for opinion mining online customer reviews. In: 2021 21st ACIS International Winter Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD-Winter), pp. 243–246. IEEE. https://doi.org/10.1109/SNPDWinter52325.2021.00059
Lan, Z., Chen, M., Goodman, S., et al. (2020). Albert: A lite bert for self-supervised learning of language representations. In: 8th International Conference on Learning Representations, pp. 1–17. OpenReview.net. https://doi.org/10.48550/arXiv.1909.11942
Li, X., Bing, L., Lam, W., et al. (2018). Transformation networks for target-oriented sentiment classification. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 946–956. Association for Computational Linguistics. https://doi.org/10.18653/v1/P18-1087
Liu, P., Qiu, X., & Huang, X. (2016). Recurrent neural network for text classification with multi-task learning. In: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, pp. 2873–2879. AAAI Press. https://doi.org/10.48550/arXiv.1605.05101
Liu, J., & Zhang, Y. (2017). Attention modeling for targeted sentiment. In: Conference of the European Chapter of the Association for Computational Linguistics, vol. 2, pp. 572–577. Association for Computational Linguistics. https://doi.org/10.18653/v1/E17-2091
Loshchilov, I., & Hutter, F. (2017). Fixing weight decay regularization in adam. Computing Research Repository abs/1711.05101, 1–13 (2017). https://doi.org/10.48550/arXiv.1711.05101
Ma, D., Li, S., Zhang, X., et al. (2017). Interactive attention networks for aspect-level sentiment classification. In: Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, pp. 4068–4074. AAAI Press. https://doi.org/10.24963/ijcai.2017/568
Ma, D., Wang, G., Han, C., et al. (2018). A memristive neural network model with associative memory for modeling affections. IEEE Access, 6, 61614–61622. https://doi.org/10.1109/ACCESS.2018.2875433
Meng, W., Wei, Y., Liu, P., et al. (2019). Aspect based sentiment analysis with feature enhanced attention cnn-bilstm. IEEE Access, 7, 167240–167249. https://doi.org/10.1109/ACCESS.2019.2952888
Mewada, A., & Dewang, R. K. (2022). Sa-asba: a hybrid model for aspect-based sentiment analysis using synthetic attention in pre-trained language bert model with extreme gradient boosting. The Journal of Supercomputing, 79, 5516–5551. https://doi.org/10.1007/s11227-022-04881-x
Morente-Molinera, J. A., Kou, G., Peng, Y., et al. (2018). Analysing discussions in social networks using group decision making methods and sentiment analysis. Information Sciences, 447, 157–168. https://doi.org/10.1016/j.ins.2018.03.020
Pontiki, M., Galanis, D., Papageorgiou, H., et al. (2016). Semeval-2016 task 5: Aspect based sentiment analysis. In: International Workshop on Semantic Evaluation, pp. 19–30. Association for Computational Linguistics. https://doi.org/10.18653/v1/S16-1002
Pontiki, M., Galanis, D., Pavlopoulos, J., et al. (2014). Semeval-2014 task 4: Aspect based sentiment analysis. In: Proceedings of the 8th International Workshop on Semantic Evaluation, pp. 27–35. Association for Computational Linguistics. https://doi.org/10.3115/v1/S14-2004
Rajpurkar, P., Jia, R., & Liang, P. (2018). Know what you don’t know: Unanswerable questions for squad. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, vol. 2, pp. 784–789. Association for Computational Linguistics. https://doi.org/10.18653/v1/P18-2124
Ramaswamy, S. L., & Chinnappan, J. (2022). Recognet-lstm+cnn: a hybrid network with attention mechanism for aspect categorization and sentiment classification. Journal of Intelligent Information Systems, 58, 379–404. https://doi.org/10.1007/s10844-021-00692-3
Santucci, V., Forti, L., Santarelli, F., et al. (2020). Learning to classify text complexity for the italian language using support vector machines. In: Computational Science and Its Applications - ICCSA 2020, vol. 12250, pp. 367–376. Springer. https://doi.org/10.1007/978-3-030-58802-1_27
Singh, L. G., & Singh, S. R. (2021). Empirical study of sentiment analysis tools and techniques on societal topics. Journal of Intelligent Information Systems, 56, 379–407. https://doi.org/10.1007/s10844-020-00616-7
Song, Y., Wang, J., Jiang, T., et al. (2019). Targeted sentiment classification with attentional encoder network. In: Artificial Neural Networks and Machine Learning – ICANN 2019: Text and Time Series, vol. 11730, pp. 93–103. Springer. https://doi.org/10.1007/978-3-030-30490-4_9
Song, W., Wen, Z., Xiao, Z., et al. (2021). Semantics perception and refinement network for aspect-based sentiment analysis. Knowledge-Based Systems, 214, 106755. https://doi.org/10.1016/j.knosys.2021.106755
Sukhbaatar, S., Szlam, A., Weston, J., et al. (2015). End-to-end memory networks. In: Proceedings of the 28th International Conference on Neural Information Processing Systems, vol. 2, pp. 2440–2448. MIT Press. https://doi.org/10.48550/arXiv.1503.08895
Sun, C., Huang, L., & Qiu, X. (2019). Utilizing bert for aspect-based sentiment analysis via constructing auxiliary sentence. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 380–385. Association for Computational Linguistics. https://doi.org/10.18653/v1/N19-1035
Suzuki, M., Itoh, N., Nagano, T., et al. (2019). Improvements to n-gram language model using text generated from neural language model. In: ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 7245–7249. IEEE. https://doi.org/10.1109/ICASSP.2019.8683481
Tang, D., Qin, B., Feng, X., et al. (2016). Effective lstms for target-dependent sentiment classification. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 3298–3307. The COLING 2016 Organizing Committee. https://doi.org/10.48550/arXiv.1512.01100
Tang, D., Qin, B., & Liu, T. (2016). Aspect level sentiment classification with deep memory network. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 214–224. Association for Computational Linguistics. https://doi.org/10.18653/v1/D16-1021
Tran, N. K., & Niedereée, C. (2018). Multihop attention networks for question answer matching. In: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 325–334. Association for Computing Machinery. https://doi.org/10.1145/3209978.3210009
Wang, Y., Huang, M., Zhu, X., et al. (2016). Attention-based lstm for aspect-level sentiment classification. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 606–615. Association for Computational Linguistics. https://doi.org/10.18653/v1/D16-1058
Wang, Q., Xu, J., Chen, H., et al. (2017). Two improved continuous bag-of-word models. In: 2017 International Joint Conference on Neural Networks (IJCNN), pp. 2851–2856. IEEE. https://doi.org/10.1109/IJCNN.2017.7966208
Weston, J., Chopra, S., & Bordes, A. (2015). Memory networks. In: 3rd International Conference on Learning Representations, pp. 1–15. Ithaca, NY: arXiv.org. https://doi.org/10.48550/arXiv.1410.3916
Wu, C., Xiong, Q., Yang, Z., et al. (2021). Residual attention and other aspects module for aspect-based sentiment analysis. Neurocomputing, 435. https://doi.org/10.1016/j.neucom.2021.01.019
Xu, H., Liu, B., Shu, L., et al. (2019). Bert post-training for review reading comprehension and aspect-based sentiment analysis. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 2324–2335. Association for Computational Linguistics. https://doi.org/10.18653/v1/N19-1242
Xu, H., Liu, B., Shu, L., et al. (2019). Bert post-training for review reading comprehension and aspect-based sentiment analysis. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 2324–2335. Association for Computational Linguistics. https://doi.org/10.18653/v1/N19-1242
Xue, W., & Li, T. (2018). Aspect based sentiment analysis with gated convolutional networks. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 2514–2523. Association for Computational Linguistics (2018). https://doi.org/10.18653/v1/P18-1234
Zhang, Y., Ding, Q., Zhu, Z., et al. (2022). Enhancing aspect and opinion terms semantic relation for aspect sentiment triplet extraction, 59, 523–542. https://doi.org/10.1007/s10844-022-00710-y
Zhang, Z., Han, X., Liu, Z., et al. (2019). Ernie: Enhanced language representation with informative entities. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 1441–1451. Association for Computational Linguistics. https://doi.org/10.18653/v1/P19-1139
Zhang, L., Wang, S., & Liu, B. (2018). Deep learning for sentiment analysis: A survey. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 8, 1253. https://doi.org/10.1002/widm.1253
Zhou, J., Huang, X., Hu, Q., et al. (2020). Sk-gcn: Modeling syntax and knowledge via graph convolutional network for aspect-level sentiment classification. Knowledge-Based Systems, 205, 106292. https://doi.org/10.1016/j.knosys.2020.106292
Zhu, P., & Qian, T. (2018). Enhanced aspect level sentiment classification with auxiliary memory. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 1077–1087. Association for Computational Linguistics. https://aclanthology.org/C18-1092/
Zhu, L., Zhu, X., Guo, J., et al. (2022). Exploring rich structure information for aspect-based sentiment classification. Journal of Intelligent Information Systems, 60, 1–21. https://doi.org/10.1007/s10844-022-00729-1
Acknowledgements
Not Applicable.
Funding
This paper was supported by the Open Fund of Hubei Key Laboratory of Intelligent Geo-Information Processing (ZRIGIP-201801).
Author information
Authors and Affiliations
Contributions
Not Applicable.
Corresponding author
Ethics declarations
Ethical approval
Not Applicable.
Competing interests
The authors affirm that they do not have any competing interests. The founders were not involved in the design of the study, data collecting, analysis, or interpretation, article preparation, or decision-making.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, J., Wu, W. & Ren, J. BERT-PG: a two-branch associative feature gated filtering network for aspect sentiment classification. J Intell Inf Syst 60, 709–730 (2023). https://doi.org/10.1007/s10844-023-00785-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10844-023-00785-1