Abstract
Argumentation mining refers to automatic extraction of arguments and their relations from texts. This field has been evolving rapidly in recent years, but there is almost no research for the Russian language. The present study is an attempt to overcome this gap. Firstly, we create the first argument-annotated corpus of Russian based on Argumentative Microtext Corpus and make it publicly available. Secondly, we study the importance of various feature types. Contextual and lexical features turn out to be the most significant. Thirdly, we evaluate the performance of various classifiers for argumentation mining. Bagging and XGBoost classifiers give the best results. Fourthly, we assess the possibility of using several machine translation systems (Google Translate, Yandex.Translate and Promt) for automatic creating of argument-annotated corpora. Google Translate appears to be the best system to reach this goal.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
References
Afantenos, S., Peldszus, A., Stede, M.: Comparing decoding mechanisms for parsing argumentative structures. Argum. Comput. 9(3), 177–192 (2018)
Aharoni, E., et al.: A benchmark dataset for automatic detection of claims and evidence in the context of controversial topics. In: Proceedings of the First Workshop on Argumentation Mining, Baltimore, Maryland, USA, pp. 64–68 (2014)
Aker, A., et al.: What works and what does not: classifier and feature analysis for argument mining. In: Proceedings of the 4th Workshop on Argument Mining, Copenhagen, Denmark, pp. 91–96 (2017)
Aker, A., Zhang, H.: Projection of argumentative corpora from source to target languages. In: Proceedings of the 4th Workshop on Argument Mining, Copenhagen, Denmark, pp. 67–72 (2017)
Baroni, P., Gabbay, D., Giacomin, M., van der Torre, L. (eds.): Handbook of Formal Argumentation. College Publications, London (2018)
Bird, S., Loper, E., Klein, E.: Natural Language Processing with Python. O’Reilly Media Inc., Sebastopol (2009)
Doddington, G.: Automatic evaluation of machine translation quality using N-gram co-occurrence statistics. In: Proceedings of the 2nd International Conference on Human Language Technology Research, San Diego, California, USA, pp. 138–145 (2002)
Eger, S., Daxenberger, J., Stab, C., Gurevych, I.: Cross-lingual argumentation mining: machine translation (and a bit of projection) is all you need! In: Proceedings of the 27th International Conference on Computational Linguistics, Santa Fe, New Mexico, USA, pp. 831–844 (2018)
Habernal, I., Gurevych, I.: Argumentation mining in user-generated web discourse. Comput. Linguist. 43(1), 125–179 (2017)
Koehn, P., et al.: Moses: open source toolkit for statistical machine translation. In: Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, Prague, Czech Republic, pp. 177–180 (2007)
Kutuzov, A., Kuzmenko, E.: WebVectors: a toolkit for building web interfaces for vector semantic models. In: Ignatov, D.I., et al. (eds.) AIST 2016. CCIS, vol. 661, pp. 155–161. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-52920-2_15
Lauscher, A., Glavaš, G., Ponzetto, S. P., Eckert, K.: Investigating the role of argumentation in the rhetorical analysis of scientific publications with neural multi-task learning models. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, pp. 3326–3338 (2018)
Lippi, M., Torroni, P.: Argument mining from speech: detecting claims in political debates. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, Phoenix, Arizona, USA, pp. 2979–2985 (2016)
Lippi, M., Torroni, P.: Argumentation mining: state of the art and emerging trends. ACM Trans. Internet Technol. 16(2), 1–25 (2016)
McDonald, R., Petrov, S., Hall, K.: Multi-source transfer of delexicalized dependency parsers. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2011), Stroudsburg, PA, USA, pp. 62–72. (2011)
Moens, M.-F.: Argumentation mining: how can a machine acquire common sense and world knowledge? Argum. Comput. 9, 1–14 (2018)
Nguyen, H.V., Litman, D.J.: Argument mining for improving the automated scoring of persuasive essays. In: Thirty-Second AAAI Conference on Artificial Intelligence, New Orleans, Louisiana, USA, pp. 5892–5899 (2018)
Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: BLEU: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual meeting of the Association for Computational Linguistics (ACL-2002), Philadelphia, Pennsylvania, USA, pp. 311–318 (2002)
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., et al.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Peldszus, A., Stede, M.: An annotated corpus of argumentative microtexts. In: Argumentation and Reasoned Action: Proceedings of the 1st European Conference on Argumentation, Lisbon, Portugal, pp. 801–815 (2015)
Peldszus, A., Stede, M.: From argument diagrams to argumentation mining in texts: a survey. Int. J. Cogn. Inform. Nat. Intell. 7(1), 1–31 (2013)
Reed, C., Palau, R.M., Rowe, G., Moens, M.-F.: Language resources for studying argument. In: Proceedings of the 6th Conference on Language Resources and Evaluation (LREC 2008), Marrakech, Morocco, pp. 91–100. ELRA (2008)
Skeppstedt, M., Peldszus, A., Stede, M.: More or less controlled elicitation of argumentative text: enlarging a microtext corpus via crowdsourcing. In: Proceedings of the 5th Workshop in Argumentation Mining, Brussels, Belgium, pp. 155–163 (2018)
Sliwa, A., et al.: Multi-lingual argumentative corpora in English, Turkish, Greek, Albanian, Croatian, Serbian, Macedonian, Bulgarian, Romanian and Arabic. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan, pp. 3908–3911 (2018)
Sonntag, J., Stede, M.: GraPAT: a tool for graph annotations. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), Reykjavik, Iceland, pp. 4147–4151 (2014)
Stab, C., Gurevych, I.: Parsing argumentation structure in persuasive essays. Comput. Linguist. 43(3), 619–659 (2017)
Stede, M., Schneider, J.: Argumentation Mining. Synthesis Lectures on Human Language Technologies. Morgan & Claypool Publishers (2018)
Stenetorp, P., Pyysalo, S., Topić, G., Ohta, T., Ananiadou, S., Tsujii, J.: BRAT: a web-based tool for NLP-assisted text annotation. In: Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, pp. 102–107 (2012)
van Eemeren, F.H., Grootendorst, R.: A Systematic Theory of Argumentation: The Pragma-dialectical Approach. Cambridge University Press, Cambridge (2004)
Vora, S., Yang, H.: A comprehensive study of eleven feature selection algorithms and their impact on text classification. In: Proceedings of the Computing Conference, London, UK, pp. 440–449 (2017)
Yarowsky, D., Ngai, G., Wicentowski, R.: Inducing multilingual text analysis tools via robust projection across aligned corpora. In: Proceedings of the First International Conference on Human Language Technology Research (HLT 2001), Stroudsburg, PA, USA, pp. 1–8 (2001)
Yimam, S.M., Gurevych, I., Eckart de Castilho, R., Biemann, C.: WebAnno: a flexible, web-based and visually supported system for distributed annotations. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Sofia, Bulgaria, pp. 1–6 (2013)
Acknowledgments
The reported study was jointly financed by the German Academic Exchange Service (DAAD) and the Ministry of Education and Science of the Russian Federation within the “Michail Lomonosov” programme (2018).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Fishcheva, I., Kotelnikov, E. (2019). Cross-Lingual Argumentation Mining for Russian Texts. In: van der Aalst, W., et al. Analysis of Images, Social Networks and Texts. AIST 2019. Lecture Notes in Computer Science(), vol 11832. Springer, Cham. https://doi.org/10.1007/978-3-030-37334-4_12
Download citation
DOI: https://doi.org/10.1007/978-3-030-37334-4_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37333-7
Online ISBN: 978-3-030-37334-4
eBook Packages: Computer ScienceComputer Science (R0)