A Semantic Graph-Based Approach for Radicalisation Detection on Social Media

  • Hassan Saif
  • Thomas Dickinson
  • Leon Kastler
  • Miriam Fernandez
  • Harith Alani
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10249)

Abstract

From its start, the so-called Islamic State of Iraq and the Levant (ISIL/ISIS) has been successfully exploiting social media networks, most notoriously Twitter, to promote its propaganda and recruit new members, resulting in thousands of social media users adopting a pro-ISIS stance every year. Automatic identification of pro-ISIS users on social media has, thus, become the centre of interest for various governmental and research organisations. In this paper we propose a semantic graph-based approach for radicalisation detection on Twitter. Unlike previous works, which mainly rely on the lexical representation of the content published by Twitter users, our approach extracts and makes use of the underlying semantics of words exhibited by these users to identify their pro/anti-ISIS stances. Our results show that classifiers trained from semantic features outperform those trained from lexical, sentiment, topic and network features by 7.8% on average F1-measure.

Keywords

Radicalisation detection Semantics Feature engineering Twitter 

References

  1. 1.
    Agarwal, A., Xie, B., Vovsha, I., Rambow, O., Passonneau, R.: Sentiment analysis of Twitter data. In: Proceedings of ACL 2011 Workshop on Languages in Social Media, pp. 30–38 (2011)Google Scholar
  2. 2.
    Bartlett, J., Miller, C.: The edge of violence: towards telling the difference between violent and non-violent radicalization. Terrorism Polit. Violence 24(1), 1–21 (2012)CrossRefGoogle Scholar
  3. 3.
    Berger, J., Morgan, J.: The ISIS Twitter census: defining and describing the population of ISIS supporters on Twitter. Brookings Proj. US Relat. Islamic World 3(20), 265–284 (2015)Google Scholar
  4. 4.
    Berger, J.M.: Tailored online interventions: The islamic state’s recruitment strategy. Combatting Terrorism Center (2015)Google Scholar
  5. 5.
    Bhuiyan, M.A., Al Hasan, M.: FSM-H: frequent subgraph mining algorithm in hadoop. In: 2014 IEEE International Congress on Big Data, pp. 9–16. IEEE (2014)Google Scholar
  6. 6.
    Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATHGoogle Scholar
  7. 7.
    Forman, G.: An extensive empirical study of feature selection metrics for text classification. J. Mach. Learn. Res. 3, 1289–1305 (2003)MATHGoogle Scholar
  8. 8.
    Hall, J.: Canadian foreign fighters and ISIS (2015)Google Scholar
  9. 9.
    King, M., Taylor, D.M.: The radicalization of homegrown Jihadists: a review of theoretical models and social psychological evidence. Terrorism Polit. Violence 23(4), 602–622 (2011)CrossRefGoogle Scholar
  10. 10.
    Magdy, W., Darwish, K., Weber, I.: # failedrevolutions: using Twitter to study the antecedents of ISIS support. First Monday 21(2), 1481–1492 (2016)CrossRefGoogle Scholar
  11. 11.
    O’Callaghan, D., Prucha, N., Greene, D., Conway, M., Carthy, J., Cunningham, P.: Online social media in the Syria conflict: encompassing the extremes and the in-betweens. In: Proceedings of International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014) (2014)Google Scholar
  12. 12.
    Pirró, G.: Explaining and suggesting relatedness in knowledge graphs. In: Arenas, M., Corcho, O., Simperl, E., Strohmaier, M., d’Aquin, M., Srinivas, K., Groth, P., Dumontier, M., Heflin, J., Thirunarayan, K., Staab, S. (eds.) ISWC 2015. LNCS, vol. 9366, pp. 622–639. Springer, Cham (2015). doi:10.1007/978-3-319-25007-6_36 CrossRefGoogle Scholar
  13. 13.
    Rizzo, G., Troncy, R.: Nerd: evaluating named entity recognition tools in the web of data. In: Workshop on Web Scale Knowledge Extraction (WEKEX 2011), vol. 21 (2011)Google Scholar
  14. 14.
    Rowe, M., Saif, H.: Mining pro-ISIS radicalisation signals from social media users. In: Proceeedings of the International Conference on Weblogs and Social Media (2016)Google Scholar
  15. 15.
    Saif, H., He, Y., Alani, H.: Semantic sentiment analysis of Twitter. In: Cudré-Mauroux, P., Heflin, J., Sirin, E., Tudorache, T., Euzenat, J., Hauswirth, M., Parreira, J.X., Hendler, J., Schreiber, G., Bernstein, A., Blomqvist, E. (eds.) ISWC 2012. LNCS, vol. 7649, pp. 508–524. Springer, Heidelberg (2012). doi:10.1007/978-3-642-35176-1_32 CrossRefGoogle Scholar
  16. 16.
    Siegel, S.: Nonparametric statistics for the behavioral sciences (1956)Google Scholar
  17. 17.
    Thelwall, M., Buckley, K., Paltoglou, G.: Sentiment strength detection for the social web. J. Am. Soc. Inf. Sci. Technol. 63(1), 163–173 (2012)CrossRefGoogle Scholar
  18. 18.
    Winter, C.: Documenting the virtual ‘caliphate’. Quilliam Foundation (2015)Google Scholar
  19. 19.
    Xin, D., Han, J., Yan, X., Cheng, H.: On compressing frequent patterns. Data Knowl. Eng. 60(1), 5–29 (2007)CrossRefGoogle Scholar
  20. 20.
    Yan, X., Han, J.: Closegraph: mining closed frequent graph patterns. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 286–295. ACM (2003)Google Scholar
  21. 21.
    Yang, Y., Pedersen, J.O.: A comparative study on feature selection in text categorization. In: ICML, vol. 97, pp. 412–420 (1997)Google Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • Hassan Saif
    • 1
  • Thomas Dickinson
    • 1
  • Leon Kastler
    • 2
  • Miriam Fernandez
    • 1
  • Harith Alani
    • 1
  1. 1.Knowledge Media InstituteThe Open UniversityMilton KeynesUK
  2. 2.University of Koblenz LandauMainzGermany

Personalised recommendations