Argument Extraction from News, Blogs, and Social Media

  • Theodosis Goudas
  • Christos Louizos
  • Georgios Petasis
  • Vangelis Karkaletsis
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8445)


Argument extraction is the task of identifying arguments, along with their components in text. Arguments can be usually decomposed into a claim and one or more premises justifying it. Among the novel aspects of this work is the thematic domain itself which relates to Social Media, in contrast to traditional research in the area, which concentrates mainly on law documents and scientific publications. The huge increase of social media communities, along with their user tendency to debate, makes the identification of arguments in these texts a necessity. Argument extraction from Social Media is more challenging because texts may not always contain arguments, as is the case of legal documents or scientific publications usually studied. In addition, being less formal in nature, texts in Social Media may not even have proper syntax or spelling. This paper presents a two-step approach for argument extraction from social media texts. During the first step, the proposed approach tries to classify the sentences into “sentences that contain arguments” and “sentences that don’t contain arguments”. In the second step, it tries to identify the exact fragments that contain the premises from the sentences that contain arguments, by utilizing conditional random fields. The results exceed significantly the base line approach, and according to literature, are quite promising.


Support Vector Machine Social Medium Conditional Random Field Argumentation Scheme Thematic Domain 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Besnard, P., Hunter, A.: Elements of Argumentation. MIT Press (2008)Google Scholar
  2. 2.
    Blair, J., Anthony Tindale, C.W.: Groundwork in the Theory of Argumentation. Argumentation library, vol. 21. Springer (2012)Google Scholar
  3. 3.
    Cohen, C., Copi, I.M.: Introduction to Logic, 11th edn. Pearson Education (2001)Google Scholar
  4. 4.
    Palau, R.M., Moens, M.F.: Argumentation mining: the detection, classification and structure of arguments in text. In: ICAIL, pp. 98–107. ACM (2009)Google Scholar
  5. 5.
    Wyner, A., Schneider, J., Atkinson, K., Bench-Capon, T.: Semi-automated argumentative analysis of online product reviews. In: Proceedings of the 4th International Conference on Computational Models of Argument, COMMA 2012 (2012)Google Scholar
  6. 6.
    Schneider, J., Groza, T., Passant, A.: A review of argumentation for the social semantic web. Semantic Web 4(2), 159–218 (2013)Google Scholar
  7. 7.
    Moens, M.F., Boiy, E., Palau, R.M., Reed, C.: Automatic detection of arguments in legal texts. In: ICAIL, pp. 225–230. ACM (2007)Google Scholar
  8. 8.
    Berger, A.L., Pietra, V.J.D., Pietra, S.A.D.: A maximum entropy approach to natural language processing. Comput. Linguist. 22(1), 39–71 (1996)Google Scholar
  9. 9.
    Nir Friedman, D.G., Goldszmidt, M.: Bayesian network classifiers. Machine Learning 29, 131–163 (1997)CrossRefzbMATHGoogle Scholar
  10. 10.
    Cortes, C., Vapnik, V.: Support-vector networks. Machine Learning 20(3), 273–297 (1995)zbMATHGoogle Scholar
  11. 11.
    Mochales, R., Ieven, A.: Creating an argumentation corpus: Do theories apply to real arguments?: A case study on the legal argumentation of the echr. In: Proceedings of the 12th International Conference on Artificial Intelligence and Law, ICAIL 2009, pp. 21–30. ACM, New York (2009)Google Scholar
  12. 12.
    Angrosh, M.A., Cranefield, S., Stanger, N.: Ontology-based modelling of related work sections in research articles: Using crfs for developing semantic data based information retrieval systems. In: Proceedings of the 6th International Conference on Semantic Systems, I-SEMANTICS 2010, pp. 14:1–14:10. ACM, New York (2010)Google Scholar
  13. 13.
    Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proc. 18th International Conf. on Machine Learning, pp. 282–289. Morgan Kauffmann (2001)Google Scholar
  14. 14.
    Schneider, J., Wyner, A.: Identifying consumers’ arguments in text. In: Maynard, D., van Erp, M., Davis, B. (eds.) SWAIE. CEUR Workshop Proceedings, vol. 925, pp. 31–42. (2012)Google Scholar
  15. 15.
    Colosimo, M.S.B.: Logistic regression analysis for experimental determination of forming limit diagrams. International Journal of Machine Tools and Manufacture 46(6), 673–682 (2006)CrossRefGoogle Scholar
  16. 16.
    Leo, B.: Random forests. Machine Learning 45(1), 5–32 (2001)CrossRefzbMATHMathSciNetGoogle Scholar
  17. 17.
    Manning, C.D.: Prabhakar Raghavan, H.S.: Introduction to Information Retrieval. Cambridge University Press (2008)Google Scholar
  18. 18.
    Reed, C., Rowe, G.: Araucaria: Software for argument analysis, diagramming and representation. International Journal of AI Tools 14, 961–980 (2004)CrossRefGoogle Scholar
  19. 19.
    Palau, R.M., Moens, M.F.: Argumentation mining. Artif. Intell. Law 19(1), 1–22 (2011)CrossRefGoogle Scholar
  20. 20.
    Petasis, G., Karkaletsis, V., Paliouras, G., Androutsopoulos, I., Spyropoulos, C.: Ellogon: A new text engineering platform. In: Third International Conference on Language Resources and Evaluation (2002)Google Scholar
  21. 21.
    Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The weka data mining software: An update. SIGKDD Explorations 11(1) (2009)Google Scholar
  22. 22.
    Florou, E., Konstantopoulos, S., Koukourikos, A., Karampiperis, P.: Argument extraction for supporting public policy formulation. In: Proceedings of the 7th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, pp. 49–54. Association for Computational Linguistics, Sofia (August 2013)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Theodosis Goudas
    • 1
  • Christos Louizos
    • 2
  • Georgios Petasis
    • 3
  • Vangelis Karkaletsis
    • 3
  1. 1.Department of Digital SystemsUniversity of PiraeusAthensGreece
  2. 2.Department of Informatics & TelecommunicationsUniversity of AthensAthensGreece
  3. 3.Software and Knowledge Engineering Laboratory, Institute of Informatics and TelecommunicationsNational Centre for Scientific Research (N.C.S.R.) “Demokritos”AthensGreece

Personalised recommendations