Analysis by Multiclass Multilabel Classification of the 2015 #SmearForSmear Campaign Using Deep Learning

Mercadier, Yves; Moulahi, Bilel; Bringay, Sandra; Azé, Jérôme; Lenoir, Philippe; Mercier, Grégoire; Carbonnel, François

doi:10.1007/978-3-030-14714-3_10

Yves Mercadier⁵,
Bilel Moulahi⁵,
Sandra Bringay^5,6,
Jérôme Azé⁵,
Philippe Lenoir⁷,
Grégoire Mercier^8,9,10,11 &
…
François Carbonnel^{7,11,12,13,14}

681 Accesses

Abstract

Background: Cervical cancer is the second most common cancer among women under 45 years of age. To deal with the decrease of smear test coverage in the UK, a Twitter campaign called #SmearForSmear has been launched in 2015 for the European Cervical Cancer Prevention Week. Its aim was to encourage women to take a selfie showing their lipstick going over the edge and post it on Twitter with a raising awareness message promoting cervical cancer screening. The estimated audience was 500 million people. In a previous study (Lenoir et al., J Med Internet Res 19(10):e344, 2017, https://doi.org/10.2196/jmir.8421, http://www.jmir.org/2017/10/e344/), we identified the tweets delivering a raising awareness message promoting cervical cancer screening (sensitizing tweets) and understood the characteristics of Twitter users posting about this campaign.

Objective: The objective of this new study is to investigate the interest of deep learning methods to automatically categorize tweets according to themes and users’ status.

Methods: We conducted a 4-step content analysis of the English tweets tagged #SmearForSmear and posted on Twitter for the 2015 European Cervical Cancer Prevention Week. 18,292 messages were collected using the Twitter Streaming API between the period of January 2017 and November 2017. In order to produce training and test data sets, we annotated the messages according to themes and users’ statuses.

These messages have been analyzed by two independent researchers using a thematic analysis, validated by a strong Cohen kappa coefficient. A total of seven themes were coded for sensitizing tweets and seven for Twitter users’ status. Based on this annotation, we compared by cross validation the predictive performances of traditional classification techniques against more advanced deep learning methods.

Results: Deep learning models were able to predict efficiently the seven themes and seven users’ status. More specifically, the deep learning models performed better than traditional approaches.

Conclusions: Deep learning methods can efficiently predict themes and users’ status. These predictive models could be used as a powerful tool to automatically analyze social data such as twitter streams for medical perspectives. This study also demonstrates that the success of a public health campaign using a social media platform depends on its ability to get its targets involved. It also suggests the need to use social marketing based on efficient predictive approaches to help its dissemination. The clinical impact of this Twitter campaign to increase cervical cancer screening is yet to be evaluated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Disaster Tweets Classification for Multilingual Tweets Using Machine Learning Techniques

Roe v Wade in Twitter: Sentiment Analysis with Machine Learning

Analysis of Public Perceptions Towards the COVID-19 Vaccination Drive: A Case Study of Tweets with Machine Learning Classifiers

Notes

1.
http://www.alsa.org/.
2.
http://scikit-learn.org/stable/.
3.
https://keras.io/.
4.
https://www.investopedia.com/articles/markets/100215/twitter-vs-facebook-vs-instagram-who-target-audience.asp. Accessed 2018/02/15.
5.
https://www.investopedia.com/articles/markets/100215/twitter-vs-facebook-vs-instagram-who-target-audience.asp. Accessed 2018/02/15.
6.
https://twitter.com/caradelevingne. Accessed 2018/02/15.

References

Bragazzi N, Martini M, Igwe S, Vecchio I, Barberis I, Tornali C. Has the ice bucket challenge really increased people’s awareness of amyotrophic lateral sclerosis? Insights and ethical implications from Google Trends and Wikipedia: a 2 years-follow up. Acta Medica Mediterr 2017;33(5):875–8.
Google Scholar
Crutzen R, de Nooijer J, Brouwer W, Oenema A, Brug J, de Vries NK. Strategies to facilitate exposure to internet-delivered health behavior change interventions aimed at adolescents or young adults: a systematic review. Health Educ Behav 38(1), 49–62 (2011). https://doi.org/10.1177/1090198110372878. PMID: 21189422.
Article Google Scholar
Ducoffe M, Precioso F. QBDC: query by dropout committee for training deep supervised architecture. CoRR abs/1511.06412 (2015). http://arxiv.org/abs/1511.06412.
Elo S, Kyngäs H. The qualitative content analysis process. J Adv Nurs 2008;62(1):107–11.
Article Google Scholar
Ferlay J, Soerjomataram I, Dikshit R, Eser S, Mathers C, Rebelo M, Parkin DM, Forman D, Bray F. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. Int J Cancer 2015;136(5):E359–86. http://dx.doi.org/10.1002/ijc.29210.
Article CAS Google Scholar
Glaser B, Strauss A. The discovery of grounded theory: strategies for qualitative research. New Jersey: Transaction Publishers;1967.
Google Scholar
Hsieh H, Shannon S. Three approaches to qualitative content analysis. Qual Health Res 2005;15(9):1277–88.
Article Google Scholar
Kondracki N, Wellman N, Amundson D. Content analysis: review of methods and their applications in nutrition education. J Nutr Educ Behav 2002;34(4):224–30.
Article Google Scholar
Koohy H, Koohy B. A lesson from the ice bucket challenge: using social networks to publicize science. Front Genet 2014;5:430. https://doi.org/10.3389/fgene.2014.00430. https://www.frontiersin.org/article/10.3389/fgene.2014.00430.
Lenoir P, Moulahi B, Azé J, Bringay S, Mercier G, Carbonnel F. Raising awareness about cervical cancer using twitter: content analysis of the 2015 #SmearForSmear campaign. J Med Internet Res 2017;19(10):e344. https://doi.org/10.2196/jmir.8421. http://www.jmir.org/2017/10/e344/.
Article Google Scholar
Liu W, Wang Z, Liu X, Zeng N, Liu Y, Alsaadi FE. A survey of deep neural network architectures and their applications. Neurocomputing 2017;234:11–26. https://doi.org/10.1016/j.neucom.2016.12.038. http://www.sciencedirect.com/science/article/pii/S0925231216315533.
Article Google Scholar
Marlow LAV, Wardle J, Waller J. Understanding cervical screening non-attendance among ethnic minority women in England. Br J Cancer 2015;113(5):833–9. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4559824/.
Article CAS Google Scholar
Nakov P, Ritter A, Rosenthal S, Sebastiani F, Stoyanov V. Semeval-2016 task 4: sentiment analysis in twitter. In: Proceedings of the 10th International Workshop on Semantic Evaluation; 2016. pp. 1–18.
Google Scholar
Neiger BL, Thackeray R, Burton SH, Giraud-Carrier CG, Fagen MC. Evaluating social media’s capacity to develop engaged audiences in health promotion settings: use of twitter metrics as a case study. Health Promot Pract 2013;14(2):157–62. https://doi.org/10.1177/1524839912469378. PMID: 23271716.
Article Google Scholar
Nowak J, Taspinar A, Scherer R. LSTM recurrent neural networks for short text and sentiment classification. Cham: Springer; 2017. pp. 553–562. https://doi.org/10.1007/978-3-319-59060-8_50.
Google Scholar
Rosenthal S, Nakov P, Kiritchenko S, Mohammad S, Ritter A, Stoyanov V. Semeval-2015 task 10: sentiment analysis in twitter. In: Proceedings of the 9th International Workshop on Semantic Evaluation; 2015. pp. 451–463
Google Scholar
Schmidhuber J. Deep learning in neural networks: an overview. Neural Netw 2015;61:85–117. https://doi.org/10.1016/j.neunet.2014.09.003. Published online 2014; based on TR arXiv:1404.7828 [cs.NE].
Article Google Scholar
Shwartz-Ziv R, Tishby N. Opening the black box of deep neural networks via information. CoRR abs/1703.00810 (2017). http://arxiv.org/abs/1703.00810.
Strecher JV, McClure J, Alexander G, Chakraborty B, Nair V, Konkel J, Greene S, Couper M, Carlier C, Wiese C, Little R, Pomerleau C, Pomerleau O. The role of engagement in a tailored web-based smoking cessation program: randomized controlled trial. J Med Internet Res 2008;10(5):e36. https://doi.org/10.2196/jmir.1002. http://www.jmir.org/2008/5/e36/.
Article Google Scholar
Thun MJ, DeLancey JO, Center MM, Jemal A, Ward EM. The global burden of cancer: priorities for prevention. Carcinogenesis 2010;31(1):100–10. http://dx.doi.org/10.1093/carcin/bgp263.
Article CAS Google Scholar
Valle I, Tramalloni D, Bragazzi N. Cancer prevention: state of the art and future prospects. J Prev Med Hyg 2015;56(1):E21–7. http://europepmc.org/articles/PMC4718348.
CAS PubMed PubMed Central Google Scholar
Walboomers JMM, Jacobs MV, Manos MM, Bosch FX, Kummer JA, Shah KV, Snijders PJF, Peto J, Meijer CJLM, Muñoz N. Human papillomavirus is a necessary cause of invasive cervical cancer worldwide. J Pathol 1999;189(1):12–9. http://dx.doi.org/10.1002/(SICI)1096-9896(199909)189:1<12::AID-PATH431>3.0.CO;2-F.
Article CAS Google Scholar
Wang S, Paul MJ, Dredze M. Exploring health topics in Chinese social media: an analysis of Sina Weibo. In: AAAI Workshop on the World Wide Web and Public Health Intelligence. Vol. 23; 2014. pp. 20–3.
Google Scholar
Zeng QT, Tse T. Viewpoint paper: exploring and developing consumer health vocabularies. JAMIA 2006;13(1):24–9. https://doi.org/10.1197/jamia.M1761.
PubMed Google Scholar
Zhan Y, Liu R, Li Q, Leischow S, Zeng D. Identifying topics for e-cigarette user-generated contents: a case study from multiple social media platforms. J Med Internet Res 2017;19(1):e24.
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank Gérard Bourrel, Professor at the University of Montpellier, for his help and wise advice, reviewing this paper. They would also like to thank and congratulate the Jo’s Cervical Cancer Trust for their work on this inspiring campaign and its participants for their commitment to combat cervical cancer. This study did not require ethics approval as the authors only used publically available Twitter content.

Author information

Authors and Affiliations

LIRMM, UMR 5506, Montpellier University, Montpellier, France
Yves Mercadier, Bilel Moulahi, Sandra Bringay & Jérôme Azé
AMIS, Paul Valéry University, Montpellier, France
Sandra Bringay
Department of General Practice, Montpellier University, Montpellier, France
Philippe Lenoir & François Carbonnel
Public Health Department, Montpellier University Hospital, Montpellier, France
Grégoire Mercier
Center for Political Studies of Latin Europe UMR 5112, Montpellier University, Montpellier, France
Grégoire Mercier
Scientific Research National Center CNRS, Paris, France
Grégoire Mercier
Center for Evaluation of Health Prevention Programs, Paul Valéry University, Montpellier, France
Grégoire Mercier & François Carbonnel
Avicenne Multiprofessional Health Center, Cabestany, France
François Carbonnel
Montpellier Cancer Institute, Montpellier, France
François Carbonnel
EPSYLON EA4556, Paul Valéry University, Montpellier, France
François Carbonnel

Authors

Yves Mercadier
View author publications
You can also search for this author in PubMed Google Scholar
Bilel Moulahi
View author publications
You can also search for this author in PubMed Google Scholar
Sandra Bringay
View author publications
You can also search for this author in PubMed Google Scholar
Jérôme Azé
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Lenoir
View author publications
You can also search for this author in PubMed Google Scholar
Grégoire Mercier
View author publications
You can also search for this author in PubMed Google Scholar
François Carbonnel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yves Mercadier .

Editor information

Editors and Affiliations

Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA
Jiang Bian
Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA
Yi Guo
School of Information, Florida State University, Tallahassee, FL, USA
Zhe He
Computer Science and Engineering Department, Texas A&M University, College Station, TX, USA
Xia Hu

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Mercadier, Y. et al. (2019). Analysis by Multiclass Multilabel Classification of the 2015 #SmearForSmear Campaign Using Deep Learning. In: Bian, J., Guo, Y., He, Z., Hu, X. (eds) Social Web and Health Research. Springer, Cham. https://doi.org/10.1007/978-3-030-14714-3_10

Download citation

DOI: https://doi.org/10.1007/978-3-030-14714-3_10
Published: 29 June 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-14713-6
Online ISBN: 978-3-030-14714-3
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics

Analysis by Multiclass Multilabel Classification of the 2015 #SmearForSmear Campaign Using Deep Learning

Abstract

Access this chapter

Similar content being viewed by others

Disaster Tweets Classification for Multilingual Tweets Using Machine Learning Techniques

Roe v Wade in Twitter: Sentiment Analysis with Machine Learning

Analysis of Public Perceptions Towards the COVID-19 Vaccination Drive: A Case Study of Tweets with Machine Learning Classifiers

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Analysis by Multiclass Multilabel Classification of the 2015 #SmearForSmear Campaign Using Deep Learning

Abstract

Access this chapter

Similar content being viewed by others

Disaster Tweets Classification for Multilingual Tweets Using Machine Learning Techniques

Roe v Wade in Twitter: Sentiment Analysis with Machine Learning

Analysis of Public Perceptions Towards the COVID-19 Vaccination Drive: A Case Study of Tweets with Machine Learning Classifiers

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation