Abstract
Background: Cervical cancer is the second most common cancer among women under 45 years of age. To deal with the decrease of smear test coverage in the UK, a Twitter campaign called #SmearForSmear has been launched in 2015 for the European Cervical Cancer Prevention Week. Its aim was to encourage women to take a selfie showing their lipstick going over the edge and post it on Twitter with a raising awareness message promoting cervical cancer screening. The estimated audience was 500 million people. In a previous study (Lenoir et al., J Med Internet Res 19(10):e344, 2017, https://doi.org/10.2196/jmir.8421, http://www.jmir.org/2017/10/e344/), we identified the tweets delivering a raising awareness message promoting cervical cancer screening (sensitizing tweets) and understood the characteristics of Twitter users posting about this campaign.
Objective: The objective of this new study is to investigate the interest of deep learning methods to automatically categorize tweets according to themes and users’ status.
Methods: We conducted a 4-step content analysis of the English tweets tagged #SmearForSmear and posted on Twitter for the 2015 European Cervical Cancer Prevention Week. 18,292 messages were collected using the Twitter Streaming API between the period of January 2017 and November 2017. In order to produce training and test data sets, we annotated the messages according to themes and users’ statuses.
These messages have been analyzed by two independent researchers using a thematic analysis, validated by a strong Cohen kappa coefficient. A total of seven themes were coded for sensitizing tweets and seven for Twitter users’ status. Based on this annotation, we compared by cross validation the predictive performances of traditional classification techniques against more advanced deep learning methods.
Results: Deep learning models were able to predict efficiently the seven themes and seven users’ status. More specifically, the deep learning models performed better than traditional approaches.
Conclusions: Deep learning methods can efficiently predict themes and users’ status. These predictive models could be used as a powerful tool to automatically analyze social data such as twitter streams for medical perspectives. This study also demonstrates that the success of a public health campaign using a social media platform depends on its ability to get its targets involved. It also suggests the need to use social marketing based on efficient predictive approaches to help its dissemination. The clinical impact of this Twitter campaign to increase cervical cancer screening is yet to be evaluated.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
https://twitter.com/caradelevingne. Accessed 2018/02/15.
References
Bragazzi N, Martini M, Igwe S, Vecchio I, Barberis I, Tornali C. Has the ice bucket challenge really increased people’s awareness of amyotrophic lateral sclerosis? Insights and ethical implications from Google Trends and Wikipedia: a 2 years-follow up. Acta Medica Mediterr 2017;33(5):875–8.
Crutzen R, de Nooijer J, Brouwer W, Oenema A, Brug J, de Vries NK. Strategies to facilitate exposure to internet-delivered health behavior change interventions aimed at adolescents or young adults: a systematic review. Health Educ Behav 38(1), 49–62 (2011). https://doi.org/10.1177/1090198110372878. PMID: 21189422.
Ducoffe M, Precioso F. QBDC: query by dropout committee for training deep supervised architecture. CoRR abs/1511.06412 (2015). http://arxiv.org/abs/1511.06412.
Elo S, Kyngäs H. The qualitative content analysis process. J Adv Nurs 2008;62(1):107–11.
Ferlay J, Soerjomataram I, Dikshit R, Eser S, Mathers C, Rebelo M, Parkin DM, Forman D, Bray F. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. Int J Cancer 2015;136(5):E359–86. http://dx.doi.org/10.1002/ijc.29210.
Glaser B, Strauss A. The discovery of grounded theory: strategies for qualitative research. New Jersey: Transaction Publishers;1967.
Hsieh H, Shannon S. Three approaches to qualitative content analysis. Qual Health Res 2005;15(9):1277–88.
Kondracki N, Wellman N, Amundson D. Content analysis: review of methods and their applications in nutrition education. J Nutr Educ Behav 2002;34(4):224–30.
Koohy H, Koohy B. A lesson from the ice bucket challenge: using social networks to publicize science. Front Genet 2014;5:430. https://doi.org/10.3389/fgene.2014.00430. https://www.frontiersin.org/article/10.3389/fgene.2014.00430.
Lenoir P, Moulahi B, Azé J, Bringay S, Mercier G, Carbonnel F. Raising awareness about cervical cancer using twitter: content analysis of the 2015 #SmearForSmear campaign. J Med Internet Res 2017;19(10):e344. https://doi.org/10.2196/jmir.8421. http://www.jmir.org/2017/10/e344/.
Liu W, Wang Z, Liu X, Zeng N, Liu Y, Alsaadi FE. A survey of deep neural network architectures and their applications. Neurocomputing 2017;234:11–26. https://doi.org/10.1016/j.neucom.2016.12.038. http://www.sciencedirect.com/science/article/pii/S0925231216315533.
Marlow LAV, Wardle J, Waller J. Understanding cervical screening non-attendance among ethnic minority women in England. Br J Cancer 2015;113(5):833–9. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4559824/.
Nakov P, Ritter A, Rosenthal S, Sebastiani F, Stoyanov V. Semeval-2016 task 4: sentiment analysis in twitter. In: Proceedings of the 10th International Workshop on Semantic Evaluation; 2016. pp. 1–18.
Neiger BL, Thackeray R, Burton SH, Giraud-Carrier CG, Fagen MC. Evaluating social media’s capacity to develop engaged audiences in health promotion settings: use of twitter metrics as a case study. Health Promot Pract 2013;14(2):157–62. https://doi.org/10.1177/1524839912469378. PMID: 23271716.
Nowak J, Taspinar A, Scherer R. LSTM recurrent neural networks for short text and sentiment classification. Cham: Springer; 2017. pp. 553–562. https://doi.org/10.1007/978-3-319-59060-8_50.
Rosenthal S, Nakov P, Kiritchenko S, Mohammad S, Ritter A, Stoyanov V. Semeval-2015 task 10: sentiment analysis in twitter. In: Proceedings of the 9th International Workshop on Semantic Evaluation; 2015. pp. 451–463
Schmidhuber J. Deep learning in neural networks: an overview. Neural Netw 2015;61:85–117. https://doi.org/10.1016/j.neunet.2014.09.003. Published online 2014; based on TR arXiv:1404.7828 [cs.NE].
Shwartz-Ziv R, Tishby N. Opening the black box of deep neural networks via information. CoRR abs/1703.00810 (2017). http://arxiv.org/abs/1703.00810.
Strecher JV, McClure J, Alexander G, Chakraborty B, Nair V, Konkel J, Greene S, Couper M, Carlier C, Wiese C, Little R, Pomerleau C, Pomerleau O. The role of engagement in a tailored web-based smoking cessation program: randomized controlled trial. J Med Internet Res 2008;10(5):e36. https://doi.org/10.2196/jmir.1002. http://www.jmir.org/2008/5/e36/.
Thun MJ, DeLancey JO, Center MM, Jemal A, Ward EM. The global burden of cancer: priorities for prevention. Carcinogenesis 2010;31(1):100–10. http://dx.doi.org/10.1093/carcin/bgp263.
Valle I, Tramalloni D, Bragazzi N. Cancer prevention: state of the art and future prospects. J Prev Med Hyg 2015;56(1):E21–7. http://europepmc.org/articles/PMC4718348.
Walboomers JMM, Jacobs MV, Manos MM, Bosch FX, Kummer JA, Shah KV, Snijders PJF, Peto J, Meijer CJLM, Muñoz N. Human papillomavirus is a necessary cause of invasive cervical cancer worldwide. J Pathol 1999;189(1):12–9. http://dx.doi.org/10.1002/(SICI)1096-9896(199909)189:1<12::AID-PATH431>3.0.CO;2-F.
Wang S, Paul MJ, Dredze M. Exploring health topics in Chinese social media: an analysis of Sina Weibo. In: AAAI Workshop on the World Wide Web and Public Health Intelligence. Vol. 23; 2014. pp. 20–3.
Zeng QT, Tse T. Viewpoint paper: exploring and developing consumer health vocabularies. JAMIA 2006;13(1):24–9. https://doi.org/10.1197/jamia.M1761.
Zhan Y, Liu R, Li Q, Leischow S, Zeng D. Identifying topics for e-cigarette user-generated contents: a case study from multiple social media platforms. J Med Internet Res 2017;19(1):e24.
Acknowledgements
The authors would like to thank Gérard Bourrel, Professor at the University of Montpellier, for his help and wise advice, reviewing this paper. They would also like to thank and congratulate the Jo’s Cervical Cancer Trust for their work on this inspiring campaign and its participants for their commitment to combat cervical cancer. This study did not require ethics approval as the authors only used publically available Twitter content.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Mercadier, Y. et al. (2019). Analysis by Multiclass Multilabel Classification of the 2015 #SmearForSmear Campaign Using Deep Learning. In: Bian, J., Guo, Y., He, Z., Hu, X. (eds) Social Web and Health Research. Springer, Cham. https://doi.org/10.1007/978-3-030-14714-3_10
Download citation
DOI: https://doi.org/10.1007/978-3-030-14714-3_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-14713-6
Online ISBN: 978-3-030-14714-3
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)