Skip to main content
Log in

A recurrent stick breaking topic model for argument stance detection

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Debate websites are valuable social media platforms for discussing controversial issues and gaining insights into diverse perspectives. However, with thousands of arguments on popular topics, browsing through entire discussions can be too time-consuming to extract a summary of the main viewpoints. To address this challenge, natural language processing techniques have been proposed to automatically identify argument structures and determine a user's stance on an issue. In addition to detecting stances, identifying topics that remain contentious between factions is especially important for policy-makers. However, the topics discussed by users with the same stance may vary in importance. To extract potential text topics, models based on probability and neural topic models have been proposed, with the latter being more effective due to improved computational cost and the ability to extract more consistent topics. This research proposes a two-stage stance detection model, which combines a neural topic model based on variational autoencoder and a recurrent neural network to learn the characteristics of an argument and enhance the model with subtopic features. The experimental results show that the model can obtain more coherent subtopics with a coherence of 0.42, and achieve an accuracy of nearly 70% in stance detection, demonstrating the effectiveness of the subtopic feature detection process.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Data availability

Data sharing does not apply to this article, as no datasets were generated or analyzed during the current study.

References

  1. Justo R et al (2014) Extracting relevant knowledge for the detection of sarcasm and nastiness in the social web. Knowl-Based Syst 69:124–133

    Article  Google Scholar 

  2. Rostami M et al (2023) Community Detection Algorithms in Healthcare Applications: A Systematic Review. IEEE Access 11:30247–30272

    Article  Google Scholar 

  3. Jain PK et al (2021) Consumer recommendation prediction in online reviews using Cuckoo optimized machine learning models. Comput Electr Eng 95:107397

  4. Jain PK, Quamer W, Pamula R (2023) Consumer sentiment analysis with aspect fusion and GAN-BERT aided adversarial learning. Expert Syst 40(4):13247

    Article  Google Scholar 

  5. Jain PK et al (2022) Predicting airline customers’ recommendations using qualitative and quantitative contents of online reviews. Multimed Tools Applic 81(5):6979–6994

    Article  Google Scholar 

  6. Jain PK et al. (2022) Employing BERT-DCNN with sentic knowledge base for social media sentiment analysis. J Ambient Intell Human Comput, p 10429

  7. Jelodar H et al (2021) A NLP framework based on meaningful latent-topic detection and sentiment analysis via fuzzy lattice reasoning on youtube comments. Multimed Tools Applic 80(3):4155–4181

    Article  Google Scholar 

  8. Augenstein I et al (2016) Stance detection with bidirectional conditional encoding. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. Austin, Texas, USA, pp 876–885

  9. Abbott R et al (2016) Internet argument corpus 2.0: An sql schema for dialogic social media and the corpora to go with it. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16). Portorož, Slovenia, pp 4445–4452

  10. Srivastava A, Sutton C (2017) Autoencoding variational inference for topic models. in arXiv preprint arXiv:1703.01488

  11. ALDayel A, Magdy W, (2021) Stance detection on social media: State of the art and trends. Inf Process Manag 58(4):102597

  12. Sasaki A et al (2018) Predicting stances from social media posts using factorization machines. Proceedings of the 27th International Conference on Computational Linguistics. Santa Fe, New Mexico, USA, pp 3381–3390

  13. Wang ZQ et al (2020) Neural Stance Detection With Hierarchical Linguistic Representations. IEEE-ACM Trans Audio Speech Lang Process 28:635–645

    Article  Google Scholar 

  14. Ng LHX, Carley KM (2022) Is my stance the same as your stance? A cross validation study of stance detection datasets. Inf Process Manag 59(6):103070

  15. Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J Mach Learn Res 3(4–5):993–1022

    Google Scholar 

  16. Vilares D, He Y (2017) Detecting perspectives in political debates. Proceedings of the 2017 conference on empirical methods in natural language processing, pp 1573–1582

  17. Rezende DJ, Mohamed S, Wierstra D (2014) Stochastic backpropagation and approximate inference in deep denerative models. International Conference on Machine Learning. Beijing, China, pp 1401–4082

  18. Zhu Q, Feng Z, Li X (2018) GraphBTM: Graph enhanced autoencoded variational inference for biterm topic model. Proceedings of the 2018 conference on empirical methods in natural language processing. Brussels, Belgium, pp 4663–4672

  19. Jin O et al (2011) Transferring topical knowledge from auxiliary long texts for short text clustering. Proceedings of the 20th ACM international conference on Information and knowledge management. Glasgow Scotland, UK, pp 2063576–2063689

  20. Zhao YY et al (2016) Social sentiment sensor: a visualization system for topic detection and topic sentiment analysis on microblog. Multimed Tools Applic 75(15):8843–8860

    Article  Google Scholar 

  21. Kumaran P, Chitrakala S (2023) Topic adaptive sentiment classification based community detection for social influential gauging in online social networks. Multimed Tools Applic 40:8943–8982

    Article  Google Scholar 

  22. Thonet T et al (2016) VODUM: a topic model unifying viewpoint, topic and opinion discovery. European conference on information retrieval. Springer, Padua, Italy, p 9626

  23. Miao Y, Yu L, Blunsom P (2016) Neural variational inference for text processing. In: International conference on machine learning. New York, USA, pp 1511–06038

  24. Sethuraman J (1994) A Constructive Definition of Dirichlet Priors. Stat Sin 4(2):639–650

    MathSciNet  Google Scholar 

  25. Ding R, Nallapati R, Xiang B (2018) Coherence-aware neural topic modeling. in arXiv preprint arXiv:1809.02687

  26. Sun Q et al (2016) Exploring various linguistic features for stance detection. In: Natural Language Understanding and Intelligent Applications. Springer, pp 840–847

  27. Ebrahimi J, Dou D, Lowd D (2016) A joint sentiment-target-stance model for stance classification in tweets. Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics. Osaka, Japan, p 1250

  28. Wei W et al (2016) pkudblab at semeval-2016 task 6: A specific convolutional neural network system for effective stance detection. Proceedings of the 10th international workshop on semantic evaluation (SemEval-2016). San Diego, California, USA, pp 384–388

  29. Wei-Fan Chen L-WK (2016) UTCNN: a deep learning model of stance classification on social media text. Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers. Osaka, Japan, pp 1635–1645

  30. Du J et al (2017) Stance classification with target-specific neural attention networks. International Joint Conferences on Artificial Intelligence. Melbourne, Australia, pp  3988–3994.

  31. Xu C et al (2018) Cross-target stance classification with self-attention networks. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. Melbourne, Australia, pp 778–783

  32. Zulqarnain M et al (2022) An efficient two-state GRU based on feature attention mechanism for sentiment analysis. Multimedia Tools and Applications, p 26

  33. Seok M et al (2016) Named entity recognition using word embedding as a feature. Int J Softw Eng Applic 10(2):93–104

    Google Scholar 

  34. Ghannay S et al (2016) Word embedding evaluation and combination. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16). Portorož, Slovenia, pp 300–305

  35. Pennington J, Socher R, Manning CD (2014) Glove: global vectors for word representation. Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP). Doha, Qatar, pp 1532–1543

  36. Mikolov T et al (2013) Efficient estimation of word representations in vector space. In: arXiv preprint arXiv:1301.3781

  37. Kingma DP, Welling M (2013) Auto-encoding variational bayes. In: arXiv preprint arXiv:1312.6114

  38. Miao Y, Grefenstette E, Blunsom P (2018) Discovering discrete latent topics with neural variational inference. In: arXiv preprint arXiv:1706.00359

  39. Feng J et al (2020) Context Reinforced Neural Topic Modeling over Short Texts. In: arXiv preprint arXiv:2008.04545

  40. Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780

    Article  Google Scholar 

  41. He R et al (2017) An unsupervised neural attention model for aspect extraction. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Vancouver, Canada, pp 388–397

  42. Chang J et al (2009) Reading tea leaves: How humans interpret topic models. In: Neural information processing systems. Citeseer, Vancouver, Canada

  43. Norouzi R et al (2022) Developing tourism users’ profiles with data-driven explicit information. Mathe Probl Eng 2022:6536908

  44. Hamad O et al (2022) StEduCov: an explored and benchmarked dataset on stance detection in tweets towards online education during COVID-19 pandemic. Big Data Cognit Comput 6(3):88

  45. Sharaf et al (2022) An efficient hybrid stock trend prediction system during COVID-19 pandemic based on stacked-LSTM and news sentiment analysis. Multimed Tools Applic 1–33

  46. Sheikhpour R, Berahmand K, Forouzandeh S (2023) Hessian-based semi-supervised feature selection using generalized uncorrelated constraint. Knowl-Based Syst 269

Download references

Funding

The research is based on work supported by Taiwan Ministry of Science and Technology under Grant No. MOST 107–2410-H-006 040-MY3 and MOST 108–2511-H-0 06–009. We would like to thank partially research grant supported by "Higher Education SPROUT Project" and "Center for Innovative FinTech Business Models" of National Cheng Kung University (NCKU), sponsored by the Ministry of Education, Taiwan.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hei -Chia Wang.

Ethics declarations

Competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, H.C., Putra, C.D. & Wu, CY. A recurrent stick breaking topic model for argument stance detection. Multimed Tools Appl 83, 38241–38266 (2024). https://doi.org/10.1007/s11042-023-16829-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-16829-1

Keywords

Navigation