Automatic Classification and Linguistic Analysis of Extremist Online Material

  • Juan Soler-CompanyEmail author
  • Leo Wanner
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11296)


The growth of the Internet in the last decade has created great opportunities for sharing content and opinions at a global scale. While this may look like a completely positive feature, it also facilitates the dissemination of discriminative material, propaganda calling for violence, etc. We present a system for recognition, classification and inspection of this kind of material in terms of different characteristics and identification of its authors. The system is illustrated using different sources – including Jihadist magazines and White Supremacist forum posts. We show experiments on the detection of offensive content, on its classification and provide a visualization and enrichment of extremist data.


Extremist material Abusive content Hate speech Classification 


  1. 1.
    Abbasi, A.: Affect intensity analysis of dark web forums. In: 2007 IEEE Intelligence and Security Informatics, pp. 282–288. IEEE (2007)Google Scholar
  2. 2.
    Corcoglioniti, F., Rospocher, M., Palmero Aprosio, A.: Frame-based ontology population with pikes. IEEE Trans. Knowl. Data Eng. 28(12), 3261–3275 (2016)CrossRefGoogle Scholar
  3. 3.
    Djuric, N., Zhou, J., Morris, R., Grbovic, M., Radosavljevic, V., Bhamidipati, N.: Hate speech detection with comment embeddings. In: Proceedings of the 24th International Conference on World Wide Web, WWW 2015 Companion, pp. 29–30. ACM, New York (2015).
  4. 4.
    Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)Google Scholar
  5. 5.
    Rudinac, S., Gornishka, I., Worring, M.: Multimodal classification of violent online political extremism content with graph convolutional networks. In: Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Thematic Workshops 2017, pp. 245–252. ACM, New York (2017).
  6. 6.
    Smedt, T.D., Pauw, G.D., Ostaeyen, P.V.: Automatic detection of online jihadist hate speech. CoRR (2018)Google Scholar
  7. 7.
    Soler-Company, J., Wanner, L.: On the relevance of syntactic and discourse features for author profiling and identification. In: Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics, pp. 681–687 (2017)Google Scholar
  8. 8.
    Xu, J., Lu, T.C., et al.: Automated classification of extremist twitter accounts using content-based and network-based features. In: 2016 IEEE International Conference on Big Data (Big Data), pp. 2545–2549. IEEE (2016)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.TALN GroupPompeu Fabra UniversityBarcelonaSpain
  2. 2.ICREABarcelonaSpain

Personalised recommendations