Skip to main content

An Ensemble Neural Network Model for Benefiting Pregnancy Health Stats from Mining Social Media

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11292))

Abstract

Extensive use of social media for communication has made it a desired resource in human behavior intensive tasks like product popularity, public polls and more recently for public health surveillance tasks such as lifestyle associated diseases and mental health. In this paper, we exploited Twitter data for detecting pregnancy cases and used tweets about pregnancy to study trigger terms associated with maternal physical and mental health. Such systems can enable clinicians to offer a more comprehensive health care in real time. Using a Twitter-based corpus, we have developed an ensemble Long-short Term Memory (LSTM) – Recurrent Neural Networks (RNN) and Convolution Neural Networks (CNN) network representation model to learn legitimate pregnancy cases discussed online. These ensemble representations were learned by a SVM classifier, which can achieve F1-score of 95% in predicting pregnancy accounts discussed in tweets. We also further investigate the words most commonly associated with physical disease symptoms ‘Distress’ and negative emotions ‘Annoyed’ sentiment. Results from our sentiment analysis study are quite encouraging, identifying more accurate triggers for pregnancy sentiment classes.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    https://zephoria.com/twitter-statistics-top-ten/.

  2. 2.

    https://newsroom.fb.com/company-info/.

  3. 3.

    https://www.soas.ac.uk/blogs/study/twitter-study-un-real-world-issues/.

  4. 4.

    http://www.businesswire.com/news/home/20121120005872/en/Twenty-percent-online-adults discuss-health-information, 2012.

  5. 5.

    https://nlp.stanford.edu/projects/glove/.

  6. 6.

    https://github.com/keras-team/keras.

  7. 7.

    https://github.com/scikit-learn/scikit-learn/tree/master/sklearn/svm.

  8. 8.

    http://nlp.tmu.edu.tw/wordcloud/wordcloud.html.

  9. 9.

    https://www.csie.ntu.edu.tw/~cjlin/libshorttext/.

References

  1. Woodall, J., Calisher, C.H.: ProMED-mail: background and purpose. Emerg. Infect. Dis. 7(3 Suppl.), 563 (2001)

    Article  Google Scholar 

  2. Mykhalovskiy, E., Weir, L.: The global public health intelligence network and early warning outbreak detection: a Canadian contribution to global public health. Can. J. Public Health 97(1), 42–44 (2006)

    Google Scholar 

  3. Brownstein, J.S., Freifeld, C.C., Madoff, L.C.: Digital disease detection — harnessing the web for public health surveillance. New England J. Med. 360(21), 2153–2157 (2009). https://doi.org/10.1056/NEJMp0900702

    Article  Google Scholar 

  4. Huang, Y., et al.: Incorporating dependency trees improve identification of pregnant women on social media platforms. In: 2017 Proceedings of the International Workshops on Digital Disease Detection using Social Media (DDDSM-2017), Taipei, pp. 26–32 (2017)

    Google Scholar 

  5. Gomide, J.: Dengue surveillance based on a computational model of spatio-temporal locality of Twitter. In: Proceedings of the 3rd International Web Science Conference, p. 3. ACM (2011)

    Google Scholar 

  6. Diaz-Aviles, Stewart, A.: Tracking Twitter for epidemic intelligence: case study: Ehec/hus outbreak in Germany. In: 2011 Proceedings of the 4th Annual ACM Web Science Conference, pp. 82–85. ACM (2012)

    Google Scholar 

  7. Odlum, M.: How Twitter can support early warning systems in Ebola outbreak surveillance. In: 143rd APHA Annual Meeting and Exposition, 31 October–4 November 2015. APHA (2015)

    Google Scholar 

  8. McGough, S.F., Brownstein, J.S., Hawkins, J.B., Santillana, M.: Forecasting zika incidence in the 2016 latin america outbreak combining traditional disease surveillance with search, social media, and news report data. PLoS Negl. Trop. Dis. 11(1), e0005295 (2017). https://doi.org/10.1371/journal.pntd.0005295

    Article  Google Scholar 

  9. Mejova, Y., Haddadi, H., Noulas, A., Weber, I.: # foodporn: obesity patterns in culinary interactions. In: Proceedings of the 5th International Conference on Digital Health 2015, pp. 51–58. ACM (2015)

    Google Scholar 

  10. Aphinyanaphongs, Y., Ray, B., Statnikov, A., Krebs, P.: Text classification for automatic detection of alcohol use-related tweets. In: International Workshop on Issues and Challenges in Social Computing (2014)

    Google Scholar 

  11. Leaman, R., Wojtulewicz, L., Sullivan, R., Skariah, A., Yang, J., Gonzalez, G.: Towards internet-age pharmacovigilance: extracting adverse drug reactions from user posts to health-related social networks. In: Proceedings of the 2010 Workshop on Biomedical Natural Language Processing, pp. 117–125 (2010)

    Google Scholar 

  12. Bian, J., Topaloglu, U., Yu, F.: Towards large-scale twitter mining for drug-related adverse events. In: Proceedings of the 2012 International Workshop on Smart Health and Wellbeing, pp. 25–32. ACM (2012)

    Google Scholar 

  13. Sarker, A., Gonzalez, G.: Portable automatic text classification for adverse drug reaction detection via multi-corpus training. J. Biomed. Inf. 53, 196–207 (2015)

    Article  Google Scholar 

  14. Dai, H.J., Touray, M., Jonnagaddala, J., Syed-Abdul, S.: Feature engineering for recognizing adverse drug reactions from Twitter posts. Information 7(2), 27 (2016). https://doi.org/10.3390/info7020027

    Article  Google Scholar 

  15. Banjari, I., Kenjeri, D., Šolić K., Mandić , M.L.: Cluster analysis as a prediction tool for pregnancy outcomes. Collegium Antropol. 39(1), 247–252 (2015)

    Google Scholar 

  16. Laopaiboon, M.: ̈lmezoglu. Advanced maternal age and pregnancy outcomes: a multicountry assessment. BJOG: Int. J. Obstet. Gynaecol. 121(s1), 49–56 (2014)

    Article  Google Scholar 

  17. Wettach, C., Thomann, J., Lambrigger-Steiner, C., Buclin, T., Desmeules, J., von Mandach, U.: Pharmacovigilance in pregnancy: adverse drug reactions associated with fetal disorders. J. Perinat. Med. 41(3), 301–307 (2013)

    Article  Google Scholar 

  18. De Choudhury, M., Counts, S., Horvitz, E.: Predicting postpartum changes in emotion and behavior via social media. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 3267–3276. ACM (2013)

    Google Scholar 

  19. Chandrashekar, P.B., Magge, A., Sarker, A., Gonzalez, G.: Social media mining for identification and exploration of health-related information from pregnant women (2017). CoRR, abs/1702.02261

    Google Scholar 

  20. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997). https://doi.org/10.1162/neco.1997.9.8.1735

    Article  Google Scholar 

  21. Lecun, Y., et al.: Comparison of learning algorithms for handwritten digit recognition. In: Fogelman, F., Gallinari, P. (eds.) International Conference on Artificial Neural Networks, Paris, pp. 53–60. EC2 & Cie (1995)

    Google Scholar 

  22. Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, EMNLP 2014, October 2014, Doha, Qatar, A meeting of SIGDAT, a Special Interest Group of the ACL pp. 1746–1751 (2014)

    Google Scholar 

  23. Joachims, T.: Text categorization with Support Vector Machines: Learning with many relevant features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998). https://doi.org/10.1007/BFb0026683

    Chapter  Google Scholar 

Download references

Acknowledgments

We are grateful to the anonymous reviewers for their insightful comments. This research was supported by the Ministry of Science and Technology of Taiwan under grant number MOST 106-2218-E-038-004-MY2, MOST 107-2634-F-001-005, and MOST 107-2319-B-400-001.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yung-Chun Chang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Warikoo, N., Chang, YC., Dai, HJ., Hsu, WL. (2018). An Ensemble Neural Network Model for Benefiting Pregnancy Health Stats from Mining Social Media. In: Tseng, YH., et al. Information Retrieval Technology. AIRS 2018. Lecture Notes in Computer Science(), vol 11292. Springer, Cham. https://doi.org/10.1007/978-3-030-03520-4_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-03520-4_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-03519-8

  • Online ISBN: 978-3-030-03520-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics