Skip to main content

Automatic Classification of Stigmatizing Articles of Mental Illness: The Case of Portuguese Online Newspapers

  • Conference paper
  • First Online:
New Trends in Database and Information Systems (ADBIS 2022)

Abstract

The stigma related to mental health continues to be present in online newspapers, where mental diseases are often used metaphorically to refer to entities or situations outside the clinical of mental health. This project explores the implementation of Artificial Intelligence and Natural Language Processing techniques for the task of automatically classifying stigmatizing articles with references to the mental disorders of schizophrenia and psychosis. This work is implemented in Portuguese online news articles, collected from the Arquivo.pt repository, a public repository of archived Portuguese web pages, and can be adapted to other languages or similar problems. Nine machine and deep learning algorithms were implemented, most of them yielding results with a precision above 90%. In addition, the automatic detection of articles topics was also performed, through topic modeling with the top2vec model, which allowed concluding that the stigmatization of mental health occurs, essentially, in Economics and Politics related news. The results confirm the existence of stigma in Portuguese newspapers (52% of the 978 articles collected) and the effectiveness of the use of Artificial Intelligence models to detect it. Additionally, a set of 978 articles collected and manually classified with the classes [“stigmatizing”, “literal”] is obtained.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://github.com/alina-yanchuk02/stigmaClassification.

  2. 2.

    https://www.arquivo.pt/.

  3. 3.

    https://github.com/arquivo/pwa-technologies/wiki/Arquivo.pt-API.

  4. 4.

    https://www.publico.pt/.

  5. 5.

    https://observador.pt/.

  6. 6.

    https://www.dn.pt/.

  7. 7.

    https://expresso.pt/.

  8. 8.

    https://www.cmjornal.pt/.

  9. 9.

    https://www.jn.pt/.

  10. 10.

    https://www.sabado.pt/.

  11. 11.

    https://visao.sapo.pt/.

  12. 12.

    https://www.abola.pt/.

  13. 13.

    https://www.nltk.org/howto/portuguese_en.html.

  14. 14.

    http://nilc.icmc.usp.br/embeddings.

  15. 15.

    http://143.107.183.175:21380/portlex/index.php/pt/projetos/liwc.

References

  1. Abadi, M., et al.: Tensorflow: large-scale machine learning on heterogeneous distributed systems. ArXiv (2016). https://doi.org/10.48550/ARXIV.1603.04467, https://tensorflow.org/

  2. Aggarwal, C.C., Zhai, C.: A Survey of text classification algorithms. In: Aggarwal, C.C., Zhai, C. (eds.) Mining Text Data, pp. 163–222. Springer, New York (2012). https://doi.org/10.1007/978-1-4614-3223-4_6

  3. Ahmed, J., Ahmed, M.: Online news classification using machine learning techniques. IIUM Eng. J. 22(2), 210–225 (2021). https://doi.org/10.31436/iiumej.v22i2.1662

  4. Angelov, D.: Top2vec: listributed representations of topics. ArXiv abs/2008.09470 (2020). https://doi.org/10.48550/ARXIV.2008.09470

  5. Aragonès, E., López-Muntaner, J., Ceruelo, S., Basora, J.: Reinforcing stigmatization: lover. age of mental illness in Spanish newspapers. J. Health Commun. 19(11), 1248–1258 (2014). https://doi.org/10.1080/10810730.2013.872726

    Article  Google Scholar 

  6. Athanasopoulou, C., Välimäki, M.: ’Schizophrenia’ as a metaphor in Greek newspaper websites. Stud. Health Technol. Inform. 202, pp. 275–278. (2014). https://doi.org/10.3233/978-1-61499-423-7-275

  7. Bevilacqua Guarniero, F., Bellinghini, R.H., Gattaz, W.F.: The schizophrenia stigma and mass media: l search for news published by wide circulation media in Brazil. Int. Rev. Psychiatr. (Abingdon, England) 29(3), 241–247 (2017). https://doi.org/10.1080/09540261.2017.1285976

    Article  Google Scholar 

  8. Bird, Steven, E.L., Klein, E.: Natural Language Processing with Python. O’Reilly Media Inc (2009). https://www.nltk.org/

  9. Chollet, F., et al.: Keras (2015). https://keras.io

  10. Chopra, A., Doody, G.: Schizophrenia, an illness and a metaphor: analysis of the use of the term ’schizophrenia’ in the UK national newspapers. J. R Soc. Med. 100, 423–426 (2007). https://doi.org/10.1258/jrsm.100.9.423

  11. para a Comunicação Social, E.R.: Públicos e consumos de média - o consumo de notícias e as plataformas digitais em portugal e em mais dez países (2014). https://www.erc.pt/pt/estudos-e-publicacoes/consum os-de-media/estudo-publicos-e-consumos-de-media

  12. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Vol. 1 (Long and Short Papers), pp. 4171–4186. Association for Computational Linguistics (2019). https://doi.org/10.18653/v1/N19-1423

  13. Duckworth, K., Halpern, J.H., Schutt, R.K., Gillespie, C.: Use of schizophrenia as a metaphor in US newspapers. Psychiatr. Serv. (Washington, D.C.) 54(10), 1402–1404 (2003). https://doi.org/10.1176/appi.ps.54.10.1402

  14. Fundação para a Ciência e Tecnologia: Recolha de conteúdos - sobre.arquivo.pt, https://sobre.arquivo.pt/pt/ajuda/recolha-e-arquivo-de-conteudos/

  15. Gao, G., Choi, E., Choi, Y., Zettlemoyer, L.: Neural metaphor detection in context. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 607–613. Association for Computational Linguistics (2018). https://doi.org/10.18653/v1/D18-1060

  16. Hsu, B.M.: Comparison of supervised classification models on textual data. Mathematics 8(5) (2020). https://doi.org/10.3390/math8050851

  17. O’Malley, T., et al.: Kerastuner (2019). https://github.com/keras-team/keras-tuner

  18. Onan, A., Togoclu, M.: Satire identification in Turkish news articles based on ensemble of classifiers. Turk. J. Electr. Eng. Comput. Sci. 28, 1086–1106 (2020). https://doi.org/10.3906/elk-1907-11

    Article  Google Scholar 

  19. Ou-Yang, L.: Newspaper3k: aArticle scraping & curation. https://newspaper.readthedocs.io/en/latest/

  20. Paszke, A., et al.: Pytorch: An imperative style, high-performance deep learning library. In: Wallach, H., Larochelle, H., Beygelzimer, A., d’Alché-Buc, F., Fox, E., Garnett, R. (eds.) Advances in Neural Information Processing Systems 32, pp. 8024–8035. Curran Associates, Inc. (2019). https://papers.neurips.cc/paper/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf

  21. Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011). https://scikit-learn.org

  22. Pennebaker, J., Francis, M.: Linguistic Inquiry and Word Count. Lawrence Erlbaum Associates, Incorporated (1999). https://books.google.pt/books?id=6FnuAAAACAAJ

  23. Pennington, J., Socher, R., Manning, C.: GloVe: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1532–1543. Association for Computational Linguistics, Doha, Qatar, October 2014. https://doi.org/10.3115/v1/D14-1162

  24. dos Psicólogos Portugueses, O.: Desenvolvimento sustentável e sustentabilidade dos cuidados de saúde primários (2021)

    Google Scholar 

  25. Rodrigues-Silva, N., Falcão de Almeida, T., Araújo, F., Molodynski, A., Venâncio, Bouça, J.: Use of the word schizophrenia in Portuguese newspapers. J. Mental Health (Abingdon, England) 26(5), 426–430 (2017). https://doi.org/10.1080/09638237.2016.1207231

  26. Sociedade Portuguesa de Psiquiatria e Saúde Mental: Os media e a saúde mental - análise de conteúdo de notícias publicadas por meios de comunicação social portugueses (2016). https://www.sppsm.org/informemente/apresentacao/

  27. Souza, F., Nogueira, R., Lotufo, R.: BERTimbau: pretrained BERT models for Brazilian Portuguese. In: 9th Brazilian Conference on Intelligent Systems, BRACIS, Rio Grande do Sul, Brazil, October 20–23 (to appear 2020)

    Google Scholar 

  28. Wolf, T., et al.: Transformers: State-of-the-Art Natural Language Processing, pp. 38–45. Association for Computational Linguistics, October 2020

    Google Scholar 

Download references

Acknowledgment

This work was supported by FCT - Fundação para a Ciência e Tecnologia within project DSAIPA/AI/0088/2020.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alina Yanchuk .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Yanchuk, A., Trifan, A., Fajarda, O., Oliveira, J.L. (2022). Automatic Classification of Stigmatizing Articles of Mental Illness: The Case of Portuguese Online Newspapers. In: Chiusano, S., et al. New Trends in Database and Information Systems. ADBIS 2022. Communications in Computer and Information Science, vol 1652. Springer, Cham. https://doi.org/10.1007/978-3-031-15743-1_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-15743-1_31

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-15742-4

  • Online ISBN: 978-3-031-15743-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics