Skip to main content

Analysis by Multiclass Multilabel Classification of the 2015 #SmearForSmear Campaign Using Deep Learning

  • Chapter
  • First Online:
Social Web and Health Research

Abstract

Background: Cervical cancer is the second most common cancer among women under 45 years of age. To deal with the decrease of smear test coverage in the UK, a Twitter campaign called #SmearForSmear has been launched in 2015 for the European Cervical Cancer Prevention Week. Its aim was to encourage women to take a selfie showing their lipstick going over the edge and post it on Twitter with a raising awareness message promoting cervical cancer screening. The estimated audience was 500 million people. In a previous study (Lenoir et al., J Med Internet Res 19(10):e344, 2017, https://doi.org/10.2196/jmir.8421, http://www.jmir.org/2017/10/e344/), we identified the tweets delivering a raising awareness message promoting cervical cancer screening (sensitizing tweets) and understood the characteristics of Twitter users posting about this campaign.

Objective: The objective of this new study is to investigate the interest of deep learning methods to automatically categorize tweets according to themes and users’ status.

Methods: We conducted a 4-step content analysis of the English tweets tagged #SmearForSmear and posted on Twitter for the 2015 European Cervical Cancer Prevention Week. 18,292 messages were collected using the Twitter Streaming API between the period of January 2017 and November 2017. In order to produce training and test data sets, we annotated the messages according to themes and users’ statuses.

These messages have been analyzed by two independent researchers using a thematic analysis, validated by a strong Cohen kappa coefficient. A total of seven themes were coded for sensitizing tweets and seven for Twitter users’ status. Based on this annotation, we compared by cross validation the predictive performances of traditional classification techniques against more advanced deep learning methods.

Results: Deep learning models were able to predict efficiently the seven themes and seven users’ status. More specifically, the deep learning models performed better than traditional approaches.

Conclusions: Deep learning methods can efficiently predict themes and users’ status. These predictive models could be used as a powerful tool to automatically analyze social data such as twitter streams for medical perspectives. This study also demonstrates that the success of a public health campaign using a social media platform depends on its ability to get its targets involved. It also suggests the need to use social marketing based on efficient predictive approaches to help its dissemination. The clinical impact of this Twitter campaign to increase cervical cancer screening is yet to be evaluated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    http://www.alsa.org/.

  2. 2.

    http://scikit-learn.org/stable/.

  3. 3.

    https://keras.io/.

  4. 4.

    https://www.investopedia.com/articles/markets/100215/twitter-vs-facebook-vs-instagram-who-target-audience.asp. Accessed 2018/02/15.

  5. 5.

    https://www.investopedia.com/articles/markets/100215/twitter-vs-facebook-vs-instagram-who-target-audience.asp. Accessed 2018/02/15.

  6. 6.

    https://twitter.com/caradelevingne. Accessed 2018/02/15.

References

  1. Bragazzi N, Martini M, Igwe S, Vecchio I, Barberis I, Tornali C. Has the ice bucket challenge really increased people’s awareness of amyotrophic lateral sclerosis? Insights and ethical implications from Google Trends and Wikipedia: a 2 years-follow up. Acta Medica Mediterr 2017;33(5):875–8.

    Google Scholar 

  2. Crutzen R, de Nooijer J, Brouwer W, Oenema A, Brug J, de Vries NK. Strategies to facilitate exposure to internet-delivered health behavior change interventions aimed at adolescents or young adults: a systematic review. Health Educ Behav 38(1), 49–62 (2011). https://doi.org/10.1177/1090198110372878. PMID: 21189422.

    Article  Google Scholar 

  3. Ducoffe M, Precioso F. QBDC: query by dropout committee for training deep supervised architecture. CoRR abs/1511.06412 (2015). http://arxiv.org/abs/1511.06412.

  4. Elo S, Kyngäs H. The qualitative content analysis process. J Adv Nurs 2008;62(1):107–11.

    Article  Google Scholar 

  5. Ferlay J, Soerjomataram I, Dikshit R, Eser S, Mathers C, Rebelo M, Parkin DM, Forman D, Bray F. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. Int J Cancer 2015;136(5):E359–86. http://dx.doi.org/10.1002/ijc.29210.

    Article  CAS  Google Scholar 

  6. Glaser B, Strauss A. The discovery of grounded theory: strategies for qualitative research. New Jersey: Transaction Publishers;1967.

    Google Scholar 

  7. Hsieh H, Shannon S. Three approaches to qualitative content analysis. Qual Health Res 2005;15(9):1277–88.

    Article  Google Scholar 

  8. Kondracki N, Wellman N, Amundson D. Content analysis: review of methods and their applications in nutrition education. J Nutr Educ Behav 2002;34(4):224–30.

    Article  Google Scholar 

  9. Koohy H, Koohy B. A lesson from the ice bucket challenge: using social networks to publicize science. Front Genet 2014;5:430. https://doi.org/10.3389/fgene.2014.00430. https://www.frontiersin.org/article/10.3389/fgene.2014.00430.

  10. Lenoir P, Moulahi B, Azé J, Bringay S, Mercier G, Carbonnel F. Raising awareness about cervical cancer using twitter: content analysis of the 2015 #SmearForSmear campaign. J Med Internet Res 2017;19(10):e344. https://doi.org/10.2196/jmir.8421. http://www.jmir.org/2017/10/e344/.

    Article  Google Scholar 

  11. Liu W, Wang Z, Liu X, Zeng N, Liu Y, Alsaadi FE. A survey of deep neural network architectures and their applications. Neurocomputing 2017;234:11–26. https://doi.org/10.1016/j.neucom.2016.12.038. http://www.sciencedirect.com/science/article/pii/S0925231216315533.

    Article  Google Scholar 

  12. Marlow LAV, Wardle J, Waller J. Understanding cervical screening non-attendance among ethnic minority women in England. Br J Cancer 2015;113(5):833–9. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4559824/.

    Article  CAS  Google Scholar 

  13. Nakov P, Ritter A, Rosenthal S, Sebastiani F, Stoyanov V. Semeval-2016 task 4: sentiment analysis in twitter. In: Proceedings of the 10th International Workshop on Semantic Evaluation; 2016. pp. 1–18.

    Google Scholar 

  14. Neiger BL, Thackeray R, Burton SH, Giraud-Carrier CG, Fagen MC. Evaluating social media’s capacity to develop engaged audiences in health promotion settings: use of twitter metrics as a case study. Health Promot Pract 2013;14(2):157–62. https://doi.org/10.1177/1524839912469378. PMID: 23271716.

    Article  Google Scholar 

  15. Nowak J, Taspinar A, Scherer R. LSTM recurrent neural networks for short text and sentiment classification. Cham: Springer; 2017. pp. 553–562. https://doi.org/10.1007/978-3-319-59060-8_50.

    Google Scholar 

  16. Rosenthal S, Nakov P, Kiritchenko S, Mohammad S, Ritter A, Stoyanov V. Semeval-2015 task 10: sentiment analysis in twitter. In: Proceedings of the 9th International Workshop on Semantic Evaluation; 2015. pp. 451–463

    Google Scholar 

  17. Schmidhuber J. Deep learning in neural networks: an overview. Neural Netw 2015;61:85–117. https://doi.org/10.1016/j.neunet.2014.09.003. Published online 2014; based on TR arXiv:1404.7828 [cs.NE].

    Article  Google Scholar 

  18. Shwartz-Ziv R, Tishby N. Opening the black box of deep neural networks via information. CoRR abs/1703.00810 (2017). http://arxiv.org/abs/1703.00810.

  19. Strecher JV, McClure J, Alexander G, Chakraborty B, Nair V, Konkel J, Greene S, Couper M, Carlier C, Wiese C, Little R, Pomerleau C, Pomerleau O. The role of engagement in a tailored web-based smoking cessation program: randomized controlled trial. J Med Internet Res 2008;10(5):e36. https://doi.org/10.2196/jmir.1002. http://www.jmir.org/2008/5/e36/.

    Article  Google Scholar 

  20. Thun MJ, DeLancey JO, Center MM, Jemal A, Ward EM. The global burden of cancer: priorities for prevention. Carcinogenesis 2010;31(1):100–10. http://dx.doi.org/10.1093/carcin/bgp263.

    Article  CAS  Google Scholar 

  21. Valle I, Tramalloni D, Bragazzi N. Cancer prevention: state of the art and future prospects. J Prev Med Hyg 2015;56(1):E21–7. http://europepmc.org/articles/PMC4718348.

    CAS  PubMed  PubMed Central  Google Scholar 

  22. Walboomers JMM, Jacobs MV, Manos MM, Bosch FX, Kummer JA, Shah KV, Snijders PJF, Peto J, Meijer CJLM, Muñoz N. Human papillomavirus is a necessary cause of invasive cervical cancer worldwide. J Pathol 1999;189(1):12–9. http://dx.doi.org/10.1002/(SICI)1096-9896(199909)189:1<12::AID-PATH431>3.0.CO;2-F.

    Article  CAS  Google Scholar 

  23. Wang S, Paul MJ, Dredze M. Exploring health topics in Chinese social media: an analysis of Sina Weibo. In: AAAI Workshop on the World Wide Web and Public Health Intelligence. Vol. 23; 2014. pp. 20–3.

    Google Scholar 

  24. Zeng QT, Tse T. Viewpoint paper: exploring and developing consumer health vocabularies. JAMIA 2006;13(1):24–9. https://doi.org/10.1197/jamia.M1761.

    PubMed  Google Scholar 

  25. Zhan Y, Liu R, Li Q, Leischow S, Zeng D. Identifying topics for e-cigarette user-generated contents: a case study from multiple social media platforms. J Med Internet Res 2017;19(1):e24.

    Article  Google Scholar 

Download references

Acknowledgements

The authors would like to thank Gérard Bourrel, Professor at the University of Montpellier, for his help and wise advice, reviewing this paper. They would also like to thank and congratulate the Jo’s Cervical Cancer Trust for their work on this inspiring campaign and its participants for their commitment to combat cervical cancer. This study did not require ethics approval as the authors only used publically available Twitter content.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yves Mercadier .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Mercadier, Y. et al. (2019). Analysis by Multiclass Multilabel Classification of the 2015 #SmearForSmear Campaign Using Deep Learning. In: Bian, J., Guo, Y., He, Z., Hu, X. (eds) Social Web and Health Research. Springer, Cham. https://doi.org/10.1007/978-3-030-14714-3_10

Download citation

Publish with us

Policies and ethics