Abstract
The stigma related to mental health continues to be present in online newspapers, where mental diseases are often used metaphorically to refer to entities or situations outside the clinical of mental health. This project explores the implementation of Artificial Intelligence and Natural Language Processing techniques for the task of automatically classifying stigmatizing articles with references to the mental disorders of schizophrenia and psychosis. This work is implemented in Portuguese online news articles, collected from the Arquivo.pt repository, a public repository of archived Portuguese web pages, and can be adapted to other languages or similar problems. Nine machine and deep learning algorithms were implemented, most of them yielding results with a precision above 90%. In addition, the automatic detection of articles topics was also performed, through topic modeling with the top2vec model, which allowed concluding that the stigmatization of mental health occurs, essentially, in Economics and Politics related news. The results confirm the existence of stigma in Portuguese newspapers (52% of the 978 articles collected) and the effectiveness of the use of Artificial Intelligence models to detect it. Additionally, a set of 978 articles collected and manually classified with the classes [“stigmatizing”, “literal”] is obtained.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
- 14.
- 15.
References
Abadi, M., et al.: Tensorflow: large-scale machine learning on heterogeneous distributed systems. ArXiv (2016). https://doi.org/10.48550/ARXIV.1603.04467, https://tensorflow.org/
Aggarwal, C.C., Zhai, C.: A Survey of text classification algorithms. In: Aggarwal, C.C., Zhai, C. (eds.) Mining Text Data, pp. 163–222. Springer, New York (2012). https://doi.org/10.1007/978-1-4614-3223-4_6
Ahmed, J., Ahmed, M.: Online news classification using machine learning techniques. IIUM Eng. J. 22(2), 210–225 (2021). https://doi.org/10.31436/iiumej.v22i2.1662
Angelov, D.: Top2vec: listributed representations of topics. ArXiv abs/2008.09470 (2020). https://doi.org/10.48550/ARXIV.2008.09470
Aragonès, E., López-Muntaner, J., Ceruelo, S., Basora, J.: Reinforcing stigmatization: lover. age of mental illness in Spanish newspapers. J. Health Commun. 19(11), 1248–1258 (2014). https://doi.org/10.1080/10810730.2013.872726
Athanasopoulou, C., Välimäki, M.: ’Schizophrenia’ as a metaphor in Greek newspaper websites. Stud. Health Technol. Inform. 202, pp. 275–278. (2014). https://doi.org/10.3233/978-1-61499-423-7-275
Bevilacqua Guarniero, F., Bellinghini, R.H., Gattaz, W.F.: The schizophrenia stigma and mass media: l search for news published by wide circulation media in Brazil. Int. Rev. Psychiatr. (Abingdon, England) 29(3), 241–247 (2017). https://doi.org/10.1080/09540261.2017.1285976
Bird, Steven, E.L., Klein, E.: Natural Language Processing with Python. O’Reilly Media Inc (2009). https://www.nltk.org/
Chollet, F., et al.: Keras (2015). https://keras.io
Chopra, A., Doody, G.: Schizophrenia, an illness and a metaphor: analysis of the use of the term ’schizophrenia’ in the UK national newspapers. J. R Soc. Med. 100, 423–426 (2007). https://doi.org/10.1258/jrsm.100.9.423
para a Comunicação Social, E.R.: Públicos e consumos de média - o consumo de notícias e as plataformas digitais em portugal e em mais dez países (2014). https://www.erc.pt/pt/estudos-e-publicacoes/consum os-de-media/estudo-publicos-e-consumos-de-media
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Vol. 1 (Long and Short Papers), pp. 4171–4186. Association for Computational Linguistics (2019). https://doi.org/10.18653/v1/N19-1423
Duckworth, K., Halpern, J.H., Schutt, R.K., Gillespie, C.: Use of schizophrenia as a metaphor in US newspapers. Psychiatr. Serv. (Washington, D.C.) 54(10), 1402–1404 (2003). https://doi.org/10.1176/appi.ps.54.10.1402
Fundação para a Ciência e Tecnologia: Recolha de conteúdos - sobre.arquivo.pt, https://sobre.arquivo.pt/pt/ajuda/recolha-e-arquivo-de-conteudos/
Gao, G., Choi, E., Choi, Y., Zettlemoyer, L.: Neural metaphor detection in context. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 607–613. Association for Computational Linguistics (2018). https://doi.org/10.18653/v1/D18-1060
Hsu, B.M.: Comparison of supervised classification models on textual data. Mathematics 8(5) (2020). https://doi.org/10.3390/math8050851
O’Malley, T., et al.: Kerastuner (2019). https://github.com/keras-team/keras-tuner
Onan, A., Togoclu, M.: Satire identification in Turkish news articles based on ensemble of classifiers. Turk. J. Electr. Eng. Comput. Sci. 28, 1086–1106 (2020). https://doi.org/10.3906/elk-1907-11
Ou-Yang, L.: Newspaper3k: aArticle scraping & curation. https://newspaper.readthedocs.io/en/latest/
Paszke, A., et al.: Pytorch: An imperative style, high-performance deep learning library. In: Wallach, H., Larochelle, H., Beygelzimer, A., d’Alché-Buc, F., Fox, E., Garnett, R. (eds.) Advances in Neural Information Processing Systems 32, pp. 8024–8035. Curran Associates, Inc. (2019). https://papers.neurips.cc/paper/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011). https://scikit-learn.org
Pennebaker, J., Francis, M.: Linguistic Inquiry and Word Count. Lawrence Erlbaum Associates, Incorporated (1999). https://books.google.pt/books?id=6FnuAAAACAAJ
Pennington, J., Socher, R., Manning, C.: GloVe: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1532–1543. Association for Computational Linguistics, Doha, Qatar, October 2014. https://doi.org/10.3115/v1/D14-1162
dos Psicólogos Portugueses, O.: Desenvolvimento sustentável e sustentabilidade dos cuidados de saúde primários (2021)
Rodrigues-Silva, N., Falcão de Almeida, T., Araújo, F., Molodynski, A., Venâncio, Bouça, J.: Use of the word schizophrenia in Portuguese newspapers. J. Mental Health (Abingdon, England) 26(5), 426–430 (2017). https://doi.org/10.1080/09638237.2016.1207231
Sociedade Portuguesa de Psiquiatria e Saúde Mental: Os media e a saúde mental - análise de conteúdo de notícias publicadas por meios de comunicação social portugueses (2016). https://www.sppsm.org/informemente/apresentacao/
Souza, F., Nogueira, R., Lotufo, R.: BERTimbau: pretrained BERT models for Brazilian Portuguese. In: 9th Brazilian Conference on Intelligent Systems, BRACIS, Rio Grande do Sul, Brazil, October 20–23 (to appear 2020)
Wolf, T., et al.: Transformers: State-of-the-Art Natural Language Processing, pp. 38–45. Association for Computational Linguistics, October 2020
Acknowledgment
This work was supported by FCT - Fundação para a Ciência e Tecnologia within project DSAIPA/AI/0088/2020.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Yanchuk, A., Trifan, A., Fajarda, O., Oliveira, J.L. (2022). Automatic Classification of Stigmatizing Articles of Mental Illness: The Case of Portuguese Online Newspapers. In: Chiusano, S., et al. New Trends in Database and Information Systems. ADBIS 2022. Communications in Computer and Information Science, vol 1652. Springer, Cham. https://doi.org/10.1007/978-3-031-15743-1_31
Download citation
DOI: https://doi.org/10.1007/978-3-031-15743-1_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-15742-4
Online ISBN: 978-3-031-15743-1
eBook Packages: Computer ScienceComputer Science (R0)