Abstract
Debate websites are valuable social media platforms for discussing controversial issues and gaining insights into diverse perspectives. However, with thousands of arguments on popular topics, browsing through entire discussions can be too time-consuming to extract a summary of the main viewpoints. To address this challenge, natural language processing techniques have been proposed to automatically identify argument structures and determine a user's stance on an issue. In addition to detecting stances, identifying topics that remain contentious between factions is especially important for policy-makers. However, the topics discussed by users with the same stance may vary in importance. To extract potential text topics, models based on probability and neural topic models have been proposed, with the latter being more effective due to improved computational cost and the ability to extract more consistent topics. This research proposes a two-stage stance detection model, which combines a neural topic model based on variational autoencoder and a recurrent neural network to learn the characteristics of an argument and enhance the model with subtopic features. The experimental results show that the model can obtain more coherent subtopics with a coherence of 0.42, and achieve an accuracy of nearly 70% in stance detection, demonstrating the effectiveness of the subtopic feature detection process.
Similar content being viewed by others
Data availability
Data sharing does not apply to this article, as no datasets were generated or analyzed during the current study.
References
Justo R et al (2014) Extracting relevant knowledge for the detection of sarcasm and nastiness in the social web. Knowl-Based Syst 69:124–133
Rostami M et al (2023) Community Detection Algorithms in Healthcare Applications: A Systematic Review. IEEE Access 11:30247–30272
Jain PK et al (2021) Consumer recommendation prediction in online reviews using Cuckoo optimized machine learning models. Comput Electr Eng 95:107397
Jain PK, Quamer W, Pamula R (2023) Consumer sentiment analysis with aspect fusion and GAN-BERT aided adversarial learning. Expert Syst 40(4):13247
Jain PK et al (2022) Predicting airline customers’ recommendations using qualitative and quantitative contents of online reviews. Multimed Tools Applic 81(5):6979–6994
Jain PK et al. (2022) Employing BERT-DCNN with sentic knowledge base for social media sentiment analysis. J Ambient Intell Human Comput, p 10429
Jelodar H et al (2021) A NLP framework based on meaningful latent-topic detection and sentiment analysis via fuzzy lattice reasoning on youtube comments. Multimed Tools Applic 80(3):4155–4181
Augenstein I et al (2016) Stance detection with bidirectional conditional encoding. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. Austin, Texas, USA, pp 876–885
Abbott R et al (2016) Internet argument corpus 2.0: An sql schema for dialogic social media and the corpora to go with it. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16). Portorož, Slovenia, pp 4445–4452
Srivastava A, Sutton C (2017) Autoencoding variational inference for topic models. in arXiv preprint arXiv:1703.01488
ALDayel A, Magdy W, (2021) Stance detection on social media: State of the art and trends. Inf Process Manag 58(4):102597
Sasaki A et al (2018) Predicting stances from social media posts using factorization machines. Proceedings of the 27th International Conference on Computational Linguistics. Santa Fe, New Mexico, USA, pp 3381–3390
Wang ZQ et al (2020) Neural Stance Detection With Hierarchical Linguistic Representations. IEEE-ACM Trans Audio Speech Lang Process 28:635–645
Ng LHX, Carley KM (2022) Is my stance the same as your stance? A cross validation study of stance detection datasets. Inf Process Manag 59(6):103070
Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J Mach Learn Res 3(4–5):993–1022
Vilares D, He Y (2017) Detecting perspectives in political debates. Proceedings of the 2017 conference on empirical methods in natural language processing, pp 1573–1582
Rezende DJ, Mohamed S, Wierstra D (2014) Stochastic backpropagation and approximate inference in deep denerative models. International Conference on Machine Learning. Beijing, China, pp 1401–4082
Zhu Q, Feng Z, Li X (2018) GraphBTM: Graph enhanced autoencoded variational inference for biterm topic model. Proceedings of the 2018 conference on empirical methods in natural language processing. Brussels, Belgium, pp 4663–4672
Jin O et al (2011) Transferring topical knowledge from auxiliary long texts for short text clustering. Proceedings of the 20th ACM international conference on Information and knowledge management. Glasgow Scotland, UK, pp 2063576–2063689
Zhao YY et al (2016) Social sentiment sensor: a visualization system for topic detection and topic sentiment analysis on microblog. Multimed Tools Applic 75(15):8843–8860
Kumaran P, Chitrakala S (2023) Topic adaptive sentiment classification based community detection for social influential gauging in online social networks. Multimed Tools Applic 40:8943–8982
Thonet T et al (2016) VODUM: a topic model unifying viewpoint, topic and opinion discovery. European conference on information retrieval. Springer, Padua, Italy, p 9626
Miao Y, Yu L, Blunsom P (2016) Neural variational inference for text processing. In: International conference on machine learning. New York, USA, pp 1511–06038
Sethuraman J (1994) A Constructive Definition of Dirichlet Priors. Stat Sin 4(2):639–650
Ding R, Nallapati R, Xiang B (2018) Coherence-aware neural topic modeling. in arXiv preprint arXiv:1809.02687
Sun Q et al (2016) Exploring various linguistic features for stance detection. In: Natural Language Understanding and Intelligent Applications. Springer, pp 840–847
Ebrahimi J, Dou D, Lowd D (2016) A joint sentiment-target-stance model for stance classification in tweets. Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics. Osaka, Japan, p 1250
Wei W et al (2016) pkudblab at semeval-2016 task 6: A specific convolutional neural network system for effective stance detection. Proceedings of the 10th international workshop on semantic evaluation (SemEval-2016). San Diego, California, USA, pp 384–388
Wei-Fan Chen L-WK (2016) UTCNN: a deep learning model of stance classification on social media text. Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers. Osaka, Japan, pp 1635–1645
Du J et al (2017) Stance classification with target-specific neural attention networks. International Joint Conferences on Artificial Intelligence. Melbourne, Australia, pp 3988–3994.
Xu C et al (2018) Cross-target stance classification with self-attention networks. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. Melbourne, Australia, pp 778–783
Zulqarnain M et al (2022) An efficient two-state GRU based on feature attention mechanism for sentiment analysis. Multimedia Tools and Applications, p 26
Seok M et al (2016) Named entity recognition using word embedding as a feature. Int J Softw Eng Applic 10(2):93–104
Ghannay S et al (2016) Word embedding evaluation and combination. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16). Portorož, Slovenia, pp 300–305
Pennington J, Socher R, Manning CD (2014) Glove: global vectors for word representation. Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP). Doha, Qatar, pp 1532–1543
Mikolov T et al (2013) Efficient estimation of word representations in vector space. In: arXiv preprint arXiv:1301.3781
Kingma DP, Welling M (2013) Auto-encoding variational bayes. In: arXiv preprint arXiv:1312.6114
Miao Y, Grefenstette E, Blunsom P (2018) Discovering discrete latent topics with neural variational inference. In: arXiv preprint arXiv:1706.00359
Feng J et al (2020) Context Reinforced Neural Topic Modeling over Short Texts. In: arXiv preprint arXiv:2008.04545
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
He R et al (2017) An unsupervised neural attention model for aspect extraction. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Vancouver, Canada, pp 388–397
Chang J et al (2009) Reading tea leaves: How humans interpret topic models. In: Neural information processing systems. Citeseer, Vancouver, Canada
Norouzi R et al (2022) Developing tourism users’ profiles with data-driven explicit information. Mathe Probl Eng 2022:6536908
Hamad O et al (2022) StEduCov: an explored and benchmarked dataset on stance detection in tweets towards online education during COVID-19 pandemic. Big Data Cognit Comput 6(3):88
Sharaf et al (2022) An efficient hybrid stock trend prediction system during COVID-19 pandemic based on stacked-LSTM and news sentiment analysis. Multimed Tools Applic 1–33
Sheikhpour R, Berahmand K, Forouzandeh S (2023) Hessian-based semi-supervised feature selection using generalized uncorrelated constraint. Knowl-Based Syst 269
Funding
The research is based on work supported by Taiwan Ministry of Science and Technology under Grant No. MOST 107–2410-H-006 040-MY3 and MOST 108–2511-H-0 06–009. We would like to thank partially research grant supported by "Higher Education SPROUT Project" and "Center for Innovative FinTech Business Models" of National Cheng Kung University (NCKU), sponsored by the Ministry of Education, Taiwan.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, H.C., Putra, C.D. & Wu, CY. A recurrent stick breaking topic model for argument stance detection. Multimed Tools Appl 83, 38241–38266 (2024). https://doi.org/10.1007/s11042-023-16829-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-16829-1