Skip to main content

Machine Learning for Suicidal Ideation Identification on Twitter for the Portuguese Language

  • Conference paper
  • First Online:
Intelligent Systems (BRACIS 2020)

Abstract

Suicidal ideation is one of the main predictors of the risk of suicide attempt and can be described as thoughts, ideas, planning, and desire to commit suicide. Fast detection of such ideation in early stages is essential for effective treatment. Many expressions of suicidal ideation can be found in publications in social networks, especially by young people. Previous works explore the automatic detection of suicidal ideation in social networks for the English language using machine learning algorithms. In this work, we present the first exploration of machine learning algorithms for suicidal ideation detection for the Portuguese language. We compared three classifiers in Twitter data: SVM, LSTM, and BERT (multilingual and Portuguese). Results suggest that BERT is effective for suicidal ideation identification in Portuguese data, achieving 79% of F1 score and less than 9% false negative score.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://developer.twitter.com/.

  2. 2.

    https://github.com/viniciosfaustino/suicide-detection.

  3. 3.

    https://github.com/viniciosfaustino/tweet-collector.

References

  1. Agarwal, A., Xie, B., Vovsha, I., Rambow, O., Passonneau, R.J.: Sentiment analysis of twitter data. In: Proceedings of the Workshop on Language in Social Media (LSM 2011), pp. 30–38 (2011)

    Google Scholar 

  2. Aggarwal, C.C.: Neural Networks and Deep Learning. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-94463-0

    Book  MATH  Google Scholar 

  3. Araújo, L.D.C., Vieira, K.F.L., Coutinho, M.D.P.D.L.: Ideação suicida na adolescência: um enfoque psicossociológico no contexto do ensino médio. Psico-USF 15(1), 47–57 (2010)

    Google Scholar 

  4. Berman, A.L., Silverman, M.M., Bongar, B.M.: Comprehensive Textbook of Suicidology. Guilford Press, New York (2000)

    Google Scholar 

  5. Birjali, M., Beni-Hssane, A., Erritali, M.: Machine learning and semantic sentiment analysis based algorithms for suicide sentiment prediction in social networks. Procedia Comput. Sci. 113, 65–72 (2017)

    Article  Google Scholar 

  6. Boden, M.: A guide to recurrent neural networks and back propagation. The Dallas project (2002)

    Google Scholar 

  7. Borges, V.R., Werlang, B.S.G.: Estudo de ideação suicida em adolescentes de 15 a 19 anos. Estudos de Psicologia (Natal) 11(3), 345–351 (2006)

    Article  Google Scholar 

  8. Brasil, Ministério da Saúde, S.D.V.E.S.: Perfil epidemiológico das tentativas e óbitos por suicídio no brasil ea rede de atenção à saúde. Bol Epidemiol 48 (2017)

    Google Scholar 

  9. Burnap, P., Colombo, W., Scourfield, J.: Machine classification and analysis of suicide-related communication on Twitter. In: Proceedings of the 26th ACM Conference on Hypertext and Social Media, pp. 75–84 (2015)

    Google Scholar 

  10. Cho, K., Van Merriënboer, B., Bahdanau, D., Bengio, Y.: On the properties of neural machine translation: encoder-decoder approaches. arXiv preprint arXiv:1409.1259 (2014)

  11. Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12, 2493–2537 (2011)

    MATH  Google Scholar 

  12. Colombo, G.B., Burnap, P., Hodorog, A., Scourfield, J.: Analysing the connectivity and communication of suicidal users on Twitter. Comput. Commun. 73, 291–300 (2016)

    Article  Google Scholar 

  13. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 4171–4186 (2019)

    Google Scholar 

  14. Du, J.: Extracting psychiatric stressors for suicide from social media using deep learning. BMC Med. Inform. Decis. Mak. 18(2), 43 (2018)

    Article  Google Scholar 

  15. Goldberg, Y.: Neural Network Methods in Natural Language Processing. Morgan and Claypool Publishers, San Rafael (2017)

    Book  Google Scholar 

  16. Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. The MIT Press, Cambridge (2016)

    MATH  Google Scholar 

  17. Graves, A.: Long short-term memory. In: Supervised Sequence Labelling with Recurrent Neural Networks, pp. 37–45. Springer (2012). https://doi.org/10.1007/978-3-642-24797-2_4

  18. Hartmann, N., Fonseca, E., Shulby, C., Treviso, M., Rodrigues, J., Aluisio, S.: Portuguese word embeddings: evaluating on word analogies and natural language tasks. arXiv preprint arXiv:1708.06025 (2017)

  19. Hochreiter, S., Bengio, Y., Frasconi, P., Schmidhuber, J., et al.: Gradient flow in recurrent nets: the difficulty of learning long-term dependencies (2001)

    Google Scholar 

  20. Ji, S., Yu, C.P., Fung, S.F., Pan, S., Long, G.: Supervised learning for suicidal ideation detection in online user content. Complexity 2018 (2018)

    Google Scholar 

  21. Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998). https://doi.org/10.1007/BFb0026683

    Chapter  Google Scholar 

  22. Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)

    MATH  Google Scholar 

  23. Merity, S., Keskar, N.S., Socher, R.: Regularizing and optimizing LSTM language models. arXiv preprint arXiv:1708.02182 (2017)

  24. Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Burges, C.J.C., Bottou, L., Welling, M., Ghahramani, Z., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 26, pp. 3111–3119. Curran Associates, Inc. (2013)

    Google Scholar 

  25. Mitchell, T.M.: Machine Learning, 1st edn. McGraw-Hill Inc, New York (1997)

    MATH  Google Scholar 

  26. Nadeem, M.: Identifying depression on twitter. arXiv preprint arXiv:1607.07384 (2016)

  27. O’dea, B., Wan, S., Batterham, P.J., Calear, A.L., Paris, C., Christensen, H.: Detecting suicidality on Twitter. Internet Interv. 2(2), 183–188 (2015)

    Google Scholar 

  28. Orabi, A.H., Buddhitha, P., Orabi, M.H., Inkpen, D.: Deep learning for depression detection of Twitter users. In: Proceedings of the Fifth Workshop on Computational Linguistics and Clinical Psychology: From Keyboard to Clinic, pp. 88–97 (2018)

    Google Scholar 

  29. Organization, W.H., et al.: Preventing suicide: a global imperative. World Health Organization (2014)

    Google Scholar 

  30. Pereira, C.C.M., Botti, N.C.L.: O suicídio na comunicação das redes sociais virtuais: revisão integrativa da literatura. Revista Portuguesa de Enfermagem de Saúde Mental 17, 17–24 (2017)

    Google Scholar 

  31. Qi, Y., Sachan, D.S., Felix, M., Padmanabhan, S.J., Neubig, G.: When and why are pre-trained word embeddings useful for neural machine translation? arXiv preprint arXiv:1804.06323 (2018)

  32. Recupero, P.R., Harms, S.E., Noble, J.M.: Googling suicide: surfing for suicide information on the internet. J. Clin. Psychiatry (2008)

    Google Scholar 

  33. Robertson, L., Skegg, K., Poore, M., Williams, S., Taylor, B.: An adolescent suicide cluster and the possible role of electronic communication technology. Crisis (2012)

    Google Scholar 

  34. Sawhney, R., Manchanda, P., Singh, R., Aggarwal, S.: A computational approach to feature extraction for identification of suicidal ideation in tweets. In: Proceedings of ACL 2018, Student Research Workshop, pp. 91–98 (2018)

    Google Scholar 

  35. Sebastiani, F.: Machine learning in automated text categorization. ACM Comput. Surv. 34(1), 1–47 (2002)

    Article  Google Scholar 

  36. Sher, L.: Preventing suicide. QJM 97(10), 677–680 (2004)

    Article  Google Scholar 

  37. Souza, F., Nogueira, R., Lotufo, R.: Portuguese named entity recognition using BERT-CRF. arXiv preprint arXiv:1909.10649 (2019)

  38. Souza, L.D.D.M., et al.: Ideação suicida na adolescência: prevalência e fatores associados. Jornal Brasileiro de Psiquiatria 59(4), 286–292 (2010)

    Article  Google Scholar 

  39. Turian, J., Ratinov, L., Bengio, Y.: Word representations: a simple and general method for semi-supervised learning. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 384–394. Association for Computational Linguistics (2010)

    Google Scholar 

  40. Vasconcelos-Raposo, J., Soares, A.R., Silva, F., Fernandes, M.G., Teixeira, C.M.: Níveis de ideação suicida em jovens adultos. Estudos de psicologia 33(2), 345–354 (2016)

    Article  Google Scholar 

  41. Vaswani, A., et al.: Attention is all you need. arXiv preprint arXiv:1706.03762 10 (2017)

  42. Werlang, B.S.G., Borges, V.R., Fensterseifer, L.: Fatores de risco ou proteção para a presençade ideação suicida na adolescência. Interam. J. Psychol. 39(2), 259–266 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bruno Magalhães Nogueira .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

de Carvalho, V.F., Giacon, B., Nascimento, C., Nogueira, B.M. (2020). Machine Learning for Suicidal Ideation Identification on Twitter for the Portuguese Language. In: Cerri, R., Prati, R.C. (eds) Intelligent Systems. BRACIS 2020. Lecture Notes in Computer Science(), vol 12319. Springer, Cham. https://doi.org/10.1007/978-3-030-61377-8_37

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-61377-8_37

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-61376-1

  • Online ISBN: 978-3-030-61377-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics