Advertisement

Stance Prediction for Russian: Data and Analysis

  • Nikita LozhnikovEmail author
  • Leon DerczynskiEmail author
  • Manuel MazzaraEmail author
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 925)

Abstract

Stance detection is a critical component of rumour and fake news identification. It involves the extraction of the stance a particular author takes related to a given claim, both expressed in text. This paper investigates stance classification for Russian. It introduces a new dataset, RuStance, of Russian tweets and news comments from multiple sources, covering multiple stories, as well as text classification approaches to stance detection as benchmarks over this data in this language. As well as presenting this openly-available dataset, the first of its kind for Russian, the paper presents a baseline for stance prediction in the language.

References

  1. 1.
    Rapoza K (2017) These two Russian ‘fake news’ outfits get billions of hits on Facebook. https://www.forbes.com/sites/kenrapoza/2017/09/22/these-two-russian-fake-news-outfits-get-billions-of-hits-on-facebook
  2. 2.
    Zubiaga A, Aker A, Bontcheva K, Liakata M, Procter R (April 2017) Detection and resolution of rumours in social media: a survey. ArXiv e-printsGoogle Scholar
  3. 3.
    Mrowca D, Wang E, Kosson A (2017) Stance detection for fake news identificationGoogle Scholar
  4. 4.
    Ferreira W, Vlachos A (2016) Emergent: a novel data-set for stance classification. In: Proceedings of the 2016 conference of the North American chapter of the Association for Computational Linguistics: human language technologies. ACLGoogle Scholar
  5. 5.
    Anta AF, Chiroque LN, Morere P, Santos A (2013) Sentiment analysis and topic detection of spanish tweets: a comparative study of of NLP techniques. Procesamiento del lenguaje natural 50:45–52Google Scholar
  6. 6.
    Taulé M, Martí MA, Rangel FM, Rosso P, Bosco C, Patti V et al (2017) Overview of the task on stance and gender detection in tweets on Catalan independence at IberEval 2017. In: 2nd workshop on evaluation of human language technologies for Iberian languages, IberEval 2017, vol 1881, CEUR-WS, pp 157–177Google Scholar
  7. 7.
    Enikolopov R, Petrova M, Zhuravskaya E (2011) Media and political persuasion: evidence from Russia. Am Econ Rev 101(7):3253–85CrossRefGoogle Scholar
  8. 8.
    Zubiaga A, Kochkina E, Liakata M, Procter R, Lukasik M (2016) Stance classification in rumours as a sequential task exploiting the tree structure of social media conversations. arXiv preprint arXiv:1609.09028
  9. 9.
    Qazvinian V, Rosengren E, Radev DR, Mei Q (2011) Rumor has it: identifying misinformation in microblogs. In: Proceedings of the conference on empirical methods in natural language processing. Association for Computational Linguistics, pp 1589–1599Google Scholar
  10. 10.
    Lukasik M, Srijith P, Vu D, Bontcheva K, Zubiaga A, Cohn T (2016) Hawkes processes for continuous time sequence classification: an application to rumour stance classification in Twitter. In: Proceedings of 54th annual meeting of the Association for Computational Linguistics. Association for Computational Linguistics, pp 393–398Google Scholar
  11. 11.
    Meduza (2018) http://meduza.io
  12. 12.
    Channel RT TV (2018) Russia Today. https://rt.com
  13. 13.
    The Guardian (2017) Russia’s ‘irrefutable evidence’ of us help for ISIS appears to be video game still. https://www.theguardian.com/world/2017/nov/14/russia-us-isis-syria-video-game-still
  14. 14.
  15. 15.
  16. 16.
    Channel RT TV (2017) Russia Today: on Russian president candidates 2018. https://russian.rt.com/inotv/2017-11-14/Rukovoditel-internet-kampanii-Sobchak-eyo-uchastie
  17. 17.
    Derczynski L, Bontcheva K (2014) PHEME: veracity in digital social networks. In: UMAP workshopsGoogle Scholar
  18. 18.
    Pomerleau D, Rao D (2017) Fake news challenge. http://www.fakenewschallenge.org
  19. 19.
    Anonymous (2018) Data statements for NLP: toward mitigating system bias and enabling better science. OpenReview.net
  20. 20.
    Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111–3119Google Scholar
  21. 21.
  22. 22.
    Chollet F et al (2015) Keras. https://github.com/keras-team/keras
  23. 23.
    Liu X, Nourbakhsh A, Li Q, Fang R, Shah S (2015) Real-time rumor debunking on Twitter. In: Proceedings of the 24th ACM international on conference on information and knowledge management. ACM, pp 1867–1870Google Scholar
  24. 24.
    Kutuzov A, Kuzmenko E (2017) WebVectors: a toolkit for building web interfaces for vector semantic models. Springer, Cham, pp 155–161Google Scholar
  25. 25.
    Ghulati D (2016) Introducing factmata—artificial intelligence for political fact-checking. https://medium.com/factmata/introducing-factmata-artificial-intelligence-for-political-fact-checking-db8acdbf4cf1
  26. 26.
    Baird YPS, Sibley D (2017) Talos targets disinformation with fake news challenge victory. http://blog.talosintelligence.com/2017/06/talos-fake-news-challenge.html
  27. 27.
    Řehůřek R, Sojka P (May 2010) Software framework for topic modelling with large corpora. In: Proceedings of the LREC 2010 workshop on new challenges for NLP frameworks. ELRA, Valletta, pp 45–50. http://is.muni.cz/publication/884893/en
  28. 28.
    Derczynski L, Bontcheva K, Liakata M, Procter R, Hoi GWS, Zubiaga A (2017) SemEval-2017 task 8: RumourEval: determining rumour veracity and support for rumours. arXiv preprint arXiv:1704.05972
  29. 29.
    Mohammad SM, Sobhani P, Kiritchenko S (2017) Stance and sentiment in tweets. ACM Trans. Internet Technol. (TOIT) 17(3):26CrossRefGoogle Scholar
  30. 30.
    Aker A, Derczynski L, Bontcheva K (2017) Simple open stance classification for rumour analysis. In: Proceedings of RANLPGoogle Scholar
  31. 31.
    Ruder S, Glover J, Mehrabani A, Ghaffari P (2018) 360\({}^\circ \) stance detection. In: Proceedings of the 2018 conference of the North American chapter of the Association for Computational Linguistics: demonstrations. Association for Computational Linguistics, pp 31–35Google Scholar
  32. 32.
    Thorne J, Vlachos A, Christodoulopoulos C, Mittal A (2018) Fever: a large-scale dataset for fact extraction and verification. In: Proceedings of the 2018 conference of the North American chapter of the Association for Computational Linguistics: human language technologies (long papers), vol 1. Association for Computational Linguistics, pp 809–819Google Scholar
  33. 33.
    Kochkina E, Liakata M, Augenstein I (2017) Turing at SemEval-2017 task 8: sequential approach to rumour stance classification with branch-LSTM. In: Proceedings of SemEval (2017)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  1. 1.Innopolis UniversityInnopolisRussian Federation
  2. 2.ITU CopenhagenCopenhagenDenmark

Personalised recommendations