Big Five Personality Traits and Ensemble Machine Learning to Detect Cyber-Violence in Social Media

  • Randa ZarnoufiEmail author
  • Mounia Abik
Conference paper
Part of the Learning and Analytics in Intelligent Systems book series (LAIS, volume 7)


Cyber-violence is a largely addressed problem in e-health researches, its focus is the detection of harmful behavior from online user-generated content in order to prevent and protect victims. In this work, we show how big five personality traits are correlated to the violent behavior of the cyber-violence perpetrator. We use a set of ensemble learning algorithms with engineered features related to the vocabulary used in each Big Five personality trait namely, Agreeableness, Conscientiousness, Extraversion, Neuroticism and Openness. The findings show a significant association between the individuals’ personality state and the harmful intention. This result can be a good indicator of online users’ susceptibility to cyber-violence and therefore can help in dealing with it.


E-Health Cyber-violence Social networks Harmful behavior Big Five personality Ensemble Machine Learning Features engineering 


  1. 1.
    Van Geel, M., Goemans, A., Toprak, F., Vedder, P.: Which personality traits are related to traditional bullying and cyberbullying? A study with the Big Five, Dark Triad and sadism. PAID (2016)Google Scholar
  2. 2.
    Salawu, S., He, Y., Lumsden, J.: Approaches to automated detection of cyberbullying: a survey. IEEE Trans. Affect. Comput. 3045, 1–20 (2017)CrossRefGoogle Scholar
  3. 3.
    Balakrishnan, V., Khan, S., Fernandez, T., Arabnia, H.R.: Personality and individual differences cyberbullying detection on Twitter using Big Five and Dark Triad features. Pers. Individ. Dif. 141, 252–257 (2019)CrossRefGoogle Scholar
  4. 4.
    Dadvar, M., Ordelman, R., De Jong, F., Trieschnigg, D.: Towards user modelling in the combat against cyberbullying. In: Natural Language Processing and Information Systems, pp. 277–283 (2012)CrossRefGoogle Scholar
  5. 5.
    Al-garadi, M.A., Varathan, K.D., Ravana, S.D.: Cybercrime detection in online communications: the experimental case of cyberbullying detection in the Twitter network. Comput. Human Behav. 63, 433–443 (2016)CrossRefGoogle Scholar
  6. 6.
    Chatzakou, D., Kourtellis, N., Blackburn, J., De Cristofaro, E., Stringhini, G., Vakali, A.: Mean birds: detecting aggression and bullying on Twitter. In: Proceedings of the 2017 ACM on Web Science Conference, New York, USA, pp. 13–22 (2017)Google Scholar
  7. 7.
    Dadvar, M., de Jong, F., Ordelman, R., Trieschnigg, D.: Improved cyberbullying detection using gender information. In: DIR 2012 (2012)Google Scholar
  8. 8.
    Hosseinmardi, H., Mattson, S.A., Rafiq, R.I., Han, R., Lv, Q., Mishra, S.: Detection of cyberbullying incidents on the Instagram social network. In: 13th Annual International Conference on Mobile Systems, Applications, and Services, Florence. 18–22 May 2015, p. 481. ACM (2015)Google Scholar
  9. 9.
    Robinson, D., Zhang, Z., Tepper, J.: Hate speech detection on Twitter: feature engineering vs feature selection. In: ESWC, pp. 1–4 (2018)Google Scholar
  10. 10.
    Stillwell, D., Matz, S.: Latent human traits in the language of social media: an open-vocabulary approach. PLoS ONE 13(11), e0201703 (2018)CrossRefGoogle Scholar
  11. 11.
    Waseem, Z., Hovy, D.: Hateful symbols or hateful people? Predictive features for hate speech detection on Twitter. In: Proceedings of NAACL-HLT, pp. 88–93 (2016)Google Scholar
  12. 12.
    Schwartz, H.A., Eichstaedt, J.C., Kern, M.L., Dziurzynski, L., Ramones, S.M., Agrawal, M., et al.: Personality, gender, and age in the language of social media: the open-vocabulary approach. PLoS ONE 8, e73791 (2013)CrossRefGoogle Scholar
  13. 13.
    Pennebaker, J.W., Boyd, R.L., Jordan, K., Blackburn, K.: The Development and Psychometric Properties of LIWC2015 (2015)Google Scholar
  14. 14.
    Chawla, N.V., Bowyer, K.W., Hall, L.O.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)CrossRefGoogle Scholar
  15. 15.
    Rezvan, M., Shalin, V.L., Sheth, A.: A quality type-aware annotated corpus and lexicon for harassment research. In: WebSci 2018, Web Science. ACM (2018)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  1. 1.IPSS Research Team, FSRMohammed V University in RabatRabatMorocco
  2. 2.IPSS Research Team, ENSIASMohammed V University in RabatRabatMorocco

Personalised recommendations