Abstract
Previous studies show effective of pre-trained language models for sentiment analysis. However, most of these studies ignore the importance of sentimental information for pre-trained models. Therefore, we fully investigate the sentimental information for pre-trained models and enhance pre-trained language models with semantic graphs for sentiment analysis. In particular, we introduce Semantic Graphs based Pre-training(SGPT) using semantic graphs to obtain synonym knowledge for aspect-sentiment pairs and similar aspect/sentiment terms. We then optimize the pre-trained language model with the semantic graphs. Empirical studies on several downstream tasks show that proposed model outperforms strong pre-trained baselines. The results also show the effectiveness of proposed semantic graphs for pre-trained model.
Similar content being viewed by others
References
Lai, G., Xie, Q., Liu, H., Yang, Y., Hovy, E.: RACE: Large-scale ReAding Comprehension Dataset From Examinations (2017)
Rajpurkar, P., Zhang, J., Lopyrev, K., Liang, P.: SQuAD: 100,000+ Questions for Machine Comprehension of Text (2016)
Zhang, L., Wang, S., Liu, B.: Deep learning for sentiment analysis : a survey. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 1253 (2018)
Zeng, Z., Zhou, W., Liu, X., Song, Y.: A variational approach to weakly supervised document-level multi-aspect sentiment classification. In: Conference of the North (2019)
Li, J., Chiu, B., Shang, S., Shao, L.: Neural text segmentation and its application to sentiment analysis. IEEE Trans. Knowl. Data Eng. 34(2), 828–842 (2022). https://doi.org/10.1109/TKDE.2020.2983360
Wang, Z., Du, B., Guo, Y.: Domain adaptation with neural embedding matching. IEEE Trans. Neural Networks Learn. Syst. 31(7), 2387–2397 (2020). https://doi.org/10.1109/TNNLS.2019.2935608
Wang, Z., Du, B., Tu, W., Zhang, L., Tao, D.: Incorporating distribution matching into uncertainty for multiple kernel active learning. IEEE Trans. Knowl. Data Eng. 33(1), 128–142 (2021). https://doi.org/10.1109/TKDE.2019.2923211
Liu, B.: Sentiment analysis and opinion mining. Synth. Lect. Hum. Lang. Technol 5(1), 1–167 (2012)
Tian, H., Gao, C., Xiao, X., Liu, H., He, B., Wu, H., Wang, H., Wu, F.: Skep: Sentiment knowledge enhanced pre-training for sentiment analysis. arXiv:2005.05635 (2020)
Ke, P., Ji, H., Liu, S., Zhu, X., Huang, M.: Sentilare: Linguistic knowledge enhanced language representation for sentiment analysis. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 6975–6988 (2020)
Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., Levy, O., Lewis, M., Zettlemoyer, L., Stoyanov, V.: Roberta: A robustly optimized bert pretraining approach. arXiv:1907.11692 (2019)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805 (2018)
Zhang, X., Zhao, J.J., LeCun, Y.: Character-level convolutional networks for text classification. CoRR arXiv:abs/1509.01626 (2015)
Socher, R., Perelygin, A., Wu, J., Chuang, J., Manning, C.D., Ng, A.Y., Potts, C.: Recursive deep models for semantic compositionality over a sentiment treebank. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 1631–1642 (2013)
Maria, P., Dimitris, G., John, P., Harris, P., Ion, A., Suresh, M.: Semeval-2014 task 4: As- pect based sentiment analysis. In: Proceedings of the 8th International Workshop on Semantic Evaluation, pp. 27–35 (2014)
Wiebe, J., Wilson, T., Cardie, C.: Annotating expressions of opinions and emotions in language. Lang. Resour. Eval. 39(2), 165–210 (2005)
Wilson, T.A.: Fine-Grained Subjectivity and Sentiment Analysis: Recognizing the Intensity, Polarity, and Attitudes of Private States. PhD thesis, University of Pittsburgh (2008)
Marasovic, A., Frank, A.: SRL4ORL: Improving opinion role labelling using multi-task learning with semantic role labeling. CoRR arXiv:abs/1711.00768 (2017)
Taboada, M., Brooke, J., Tofiloski, M., Voll, K.D., Stede, M.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37(2), 267–307 (2011)
Shin, B., Lee, T., Choi, J.D.: Lexicon integrated cnn models with attention for sentiment analysis. Alexandra Balahur Dobrescu, 149–158 (2017)
Vo, D.-T., Zhang, Y.: Target-Dependent Twitter Sentiment Classification with Rich Automatic Features. In: Twenty-Fourth International Joint Conference on Artificial Intelligence (2015)
Li, X., Lam, W.: Deep multi-task learning for aspect term extraction with memory interaction. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 2886–2892 (2017)
Gui, L., Hu, J., He, Y., Xu, R., Lu, Q., Du, J.: A question answering approach to emotion cause extraction. arXiv:1708.05482 (2017)
Fan, C., Yan, H., Du, J., Gui, L., Bing, L., Yang, M., Xu, R., Mao, R.: A knowledge regularized hierarchical approach for emotion cause analysis. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 5618–5628 (2019)
Turney, P.D.: Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. arXiv:cs/0212032 (2002)
Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37(2), 267–307 (2011). https://doi.org/10.1162/COLI_a_00049
Bakshi, R.K., Kaur, N., Kaur, R., Kaur, G.: Opinion mining and sentiment analysis. In: 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom), pp 452–455. IEEE (2016)
Agarwal, A., Xie, B., Vovsha, I., Rambow, O., Passonneau, R.J.: Sentiment analysis of twitter data. In: Proceedings of the Workshop on Language in Social Media (LSM 2011), pp. 30–38 (2011)
Shin, B., Lee, T., Choi, J.D.: Lexicon integrated cnn models with attention for sentiment analysis. arXiv:1610.06272 (2016)
Qian, Q., Huang, M., Lei, J., Zhu, X.: Linguistically regularized lstms for sentiment classification. arXiv:1611.03949 (2016)
Zeng, Z., Zhou, W., Liu, X., Song, Y.: A variational approach to weakly supervised document-level multi-aspect sentiment classification. arXiv:1904.05055 (2019)
Yang, Z., Salakhutdinov, R., Cohen, W.W.: Transfer learning for sequence tagging with hierarchical recurrent networks. arXiv:1703.06345 (2017)
Ding, Y., Yu, J., Jiang, J.: Recurrent neural networks with auxiliary labels for cross-domain opinion target extraction. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 31 (2017)
Wang, X., Gao, T., Zhu, Z., Liu, Z., Li, J., Tang, J.: Kepler: A unified model for knowledge embedding and pre-trained language representation. arXiv:1911.06136 (2019)
Peters, M.E., Neumann, M., Logan IV, R.L., Schwartz, R., Joshi, V., Singh, S., Smith, N.A.: Knowledge enhanced contextual word representations. arXiv:1909.04164 (2019)
Zhang, Z., Han, X., Liu, Z., Jiang, X., Sun, M., Liu, Q.: Ernie: Enhanced language representation with informative entities. arXiv:1905.07129(2019)
Liu, W., Zhou, P., Zhao, Z., Wang, Z., Ju, Q., Deng, H., Wang, P.: K-bert: Enabling language representation with knowledge graph. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 2901–2908 (2020)
Wang, R., Tang, D., Duan, N., Wei, Z., Huang, X., Cao, C., Jiang, D., Zhou, M., et al.: K-adapter: Infusing knowledge into pre-trained models with adapters. arXiv:2002.01808 (2020)
Funding
not applicable
Author information
Authors and Affiliations
Contributions
Yong qian, chen chen and Zhongqing Wang wrote the main manuscript text and Yong Qian did all experiments. Chen chen prepared figures 1-3. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Ethics approval
not applicable
Competing interests
not applicable
Additional information
Availability of data and materials
All the data can be found online. SST-2:http://nlp.stanford.edu/sentiment MPQA2:http://mpqa.cs.pitt.edu/corpora/mpqa_corpus/mpqa_corpus_2_0/ Amazon-2:https://snap.stanford.edu/data/web-Amazon.html SRL4ORL:http://alt.qcri.org/semeval2014/task4/data/uploads/ Taobao dataset cannot be provided at present because Alibaba has not yet approved, once approved we will release dataset.
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article belongs to the Topical Collection: Special Issue on Spatiotemporal Data Management and Analytics for Recommend Guest Editors: Shuo Shang, Xiangliang Zhang and Panos Kalnis
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Yong, Q., Chen, C., Wang, Z. et al. SGPT: Semantic graphs based pre-training for aspect-based sentiment analysis. World Wide Web 26, 2201–2214 (2023). https://doi.org/10.1007/s11280-022-01123-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-022-01123-1