Automatic Classification of Stigmatizing Articles of Mental Illness: The Case of Portuguese Online Newspapers

Yanchuk, Alina; Trifan, Alina; Fajarda, Olga; Oliveira, José Luís

doi:10.1007/978-3-031-15743-1_31

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1652))

Included in the following conference series:

European Conference on Advances in Databases and Information Systems

943 Accesses

Abstract

The stigma related to mental health continues to be present in online newspapers, where mental diseases are often used metaphorically to refer to entities or situations outside the clinical of mental health. This project explores the implementation of Artificial Intelligence and Natural Language Processing techniques for the task of automatically classifying stigmatizing articles with references to the mental disorders of schizophrenia and psychosis. This work is implemented in Portuguese online news articles, collected from the Arquivo.pt repository, a public repository of archived Portuguese web pages, and can be adapted to other languages or similar problems. Nine machine and deep learning algorithms were implemented, most of them yielding results with a precision above 90%. In addition, the automatic detection of articles topics was also performed, through topic modeling with the top2vec model, which allowed concluding that the stigmatization of mental health occurs, essentially, in Economics and Politics related news. The results confirm the existence of stigma in Portuguese newspapers (52% of the 978 articles collected) and the effectiveness of the use of Artificial Intelligence models to detect it. Additionally, a set of 978 articles collected and manually classified with the classes [“stigmatizing”, “literal”] is obtained.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Abadi, M., et al.: Tensorflow: large-scale machine learning on heterogeneous distributed systems. ArXiv (2016). https://doi.org/10.48550/ARXIV.1603.04467, https://tensorflow.org/
Aggarwal, C.C., Zhai, C.: A Survey of text classification algorithms. In: Aggarwal, C.C., Zhai, C. (eds.) Mining Text Data, pp. 163–222. Springer, New York (2012). https://doi.org/10.1007/978-1-4614-3223-4_6
Ahmed, J., Ahmed, M.: Online news classification using machine learning techniques. IIUM Eng. J. 22(2), 210–225 (2021). https://doi.org/10.31436/iiumej.v22i2.1662
Angelov, D.: Top2vec: listributed representations of topics. ArXiv abs/2008.09470 (2020). https://doi.org/10.48550/ARXIV.2008.09470
Aragonès, E., López-Muntaner, J., Ceruelo, S., Basora, J.: Reinforcing stigmatization: lover. age of mental illness in Spanish newspapers. J. Health Commun. 19(11), 1248–1258 (2014). https://doi.org/10.1080/10810730.2013.872726
Article Google Scholar
Athanasopoulou, C., Välimäki, M.: ’Schizophrenia’ as a metaphor in Greek newspaper websites. Stud. Health Technol. Inform. 202, pp. 275–278. (2014). https://doi.org/10.3233/978-1-61499-423-7-275
Bevilacqua Guarniero, F., Bellinghini, R.H., Gattaz, W.F.: The schizophrenia stigma and mass media: l search for news published by wide circulation media in Brazil. Int. Rev. Psychiatr. (Abingdon, England) 29(3), 241–247 (2017). https://doi.org/10.1080/09540261.2017.1285976
Article Google Scholar
Bird, Steven, E.L., Klein, E.: Natural Language Processing with Python. O’Reilly Media Inc (2009). https://www.nltk.org/
Chollet, F., et al.: Keras (2015). https://keras.io
Chopra, A., Doody, G.: Schizophrenia, an illness and a metaphor: analysis of the use of the term ’schizophrenia’ in the UK national newspapers. J. R Soc. Med. 100, 423–426 (2007). https://doi.org/10.1258/jrsm.100.9.423
para a Comunicação Social, E.R.: Públicos e consumos de média - o consumo de notícias e as plataformas digitais em portugal e em mais dez países (2014). https://www.erc.pt/pt/estudos-e-publicacoes/consum os-de-media/estudo-publicos-e-consumos-de-media
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Vol. 1 (Long and Short Papers), pp. 4171–4186. Association for Computational Linguistics (2019). https://doi.org/10.18653/v1/N19-1423
Duckworth, K., Halpern, J.H., Schutt, R.K., Gillespie, C.: Use of schizophrenia as a metaphor in US newspapers. Psychiatr. Serv. (Washington, D.C.) 54(10), 1402–1404 (2003). https://doi.org/10.1176/appi.ps.54.10.1402
Fundação para a Ciência e Tecnologia: Recolha de conteúdos - sobre.arquivo.pt, https://sobre.arquivo.pt/pt/ajuda/recolha-e-arquivo-de-conteudos/
Gao, G., Choi, E., Choi, Y., Zettlemoyer, L.: Neural metaphor detection in context. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 607–613. Association for Computational Linguistics (2018). https://doi.org/10.18653/v1/D18-1060
Hsu, B.M.: Comparison of supervised classification models on textual data. Mathematics 8(5) (2020). https://doi.org/10.3390/math8050851
O’Malley, T., et al.: Kerastuner (2019). https://github.com/keras-team/keras-tuner
Onan, A., Togoclu, M.: Satire identification in Turkish news articles based on ensemble of classifiers. Turk. J. Electr. Eng. Comput. Sci. 28, 1086–1106 (2020). https://doi.org/10.3906/elk-1907-11
Article Google Scholar
Ou-Yang, L.: Newspaper3k: aArticle scraping & curation. https://newspaper.readthedocs.io/en/latest/
Paszke, A., et al.: Pytorch: An imperative style, high-performance deep learning library. In: Wallach, H., Larochelle, H., Beygelzimer, A., d’Alché-Buc, F., Fox, E., Garnett, R. (eds.) Advances in Neural Information Processing Systems 32, pp. 8024–8035. Curran Associates, Inc. (2019). https://papers.neurips.cc/paper/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011). https://scikit-learn.org
Pennebaker, J., Francis, M.: Linguistic Inquiry and Word Count. Lawrence Erlbaum Associates, Incorporated (1999). https://books.google.pt/books?id=6FnuAAAACAAJ
Pennington, J., Socher, R., Manning, C.: GloVe: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1532–1543. Association for Computational Linguistics, Doha, Qatar, October 2014. https://doi.org/10.3115/v1/D14-1162
dos Psicólogos Portugueses, O.: Desenvolvimento sustentável e sustentabilidade dos cuidados de saúde primários (2021)
Google Scholar
Rodrigues-Silva, N., Falcão de Almeida, T., Araújo, F., Molodynski, A., Venâncio, Bouça, J.: Use of the word schizophrenia in Portuguese newspapers. J. Mental Health (Abingdon, England) 26(5), 426–430 (2017). https://doi.org/10.1080/09638237.2016.1207231
Sociedade Portuguesa de Psiquiatria e Saúde Mental: Os media e a saúde mental - análise de conteúdo de notícias publicadas por meios de comunicação social portugueses (2016). https://www.sppsm.org/informemente/apresentacao/
Souza, F., Nogueira, R., Lotufo, R.: BERTimbau: pretrained BERT models for Brazilian Portuguese. In: 9th Brazilian Conference on Intelligent Systems, BRACIS, Rio Grande do Sul, Brazil, October 20–23 (to appear 2020)
Google Scholar
Wolf, T., et al.: Transformers: State-of-the-Art Natural Language Processing, pp. 38–45. Association for Computational Linguistics, October 2020
Google Scholar

Download references

Acknowledgment

This work was supported by FCT - Fundação para a Ciência e Tecnologia within project DSAIPA/AI/0088/2020.

Author information

Authors and Affiliations

IEETA, DETI, University of Aveiro, Aveiro, Portugal
Alina Yanchuk, Alina Trifan, Olga Fajarda & José Luís Oliveira
LASI - Intelligent Systems Associate Laboratory, Guimarães, Portugal
Alina Trifan, Olga Fajarda & José Luís Oliveira

Authors

Alina Yanchuk
View author publications
You can also search for this author in PubMed Google Scholar
Alina Trifan
View author publications
You can also search for this author in PubMed Google Scholar
Olga Fajarda
View author publications
You can also search for this author in PubMed Google Scholar
José Luís Oliveira
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alina Yanchuk .

Editor information

Editors and Affiliations

Politecnico di Torino, Turin, Italy
Silvia Chiusano
Politecnico di Torino, Turin, Italy
Tania Cerquitelli
Poznań University of Technology, Poznań, Poland
Robert Wrembel
Norwegian University of Science and Technology, Trondheim, Norway
Kjetil Nørvåg
University of Genoa, Genoa, Italy
Barbara Catania
CNRS, Villeurbanne Cedex, France
Genoveva Vargas-Solar
University of Calabria, Rende, Italy
Ester Zumpano

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yanchuk, A., Trifan, A., Fajarda, O., Oliveira, J.L. (2022). Automatic Classification of Stigmatizing Articles of Mental Illness: The Case of Portuguese Online Newspapers. In: Chiusano, S., et al. New Trends in Database and Information Systems. ADBIS 2022. Communications in Computer and Information Science, vol 1652. Springer, Cham. https://doi.org/10.1007/978-3-031-15743-1_31

Download citation

DOI: https://doi.org/10.1007/978-3-031-15743-1_31
Published: 29 August 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-15742-4
Online ISBN: 978-3-031-15743-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Automatic Classification of Stigmatizing Articles of Mental Illness: The Case of Portuguese Online Newspapers