Analysis of Social Media Posts for Early Detection of Mental Health Conditions

  • Antoine Briand
  • Hayda Almeida
  • Marie-Jean MeursEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10832)


This paper presents a multipronged approach to predict early risk of mental health issues from user-generated content in social media. Supervised learning and information retrieval methods are used to estimate the risk of depression for a user given the content of its posts in reddit. The approach presented here was evaluated on the CLEF eRisk 2017 pilot task. We describe the details of five systems submitted to the task, and compare their performance. The comparisons show that combining information retrieval and machine learning methods gives the best results.


Artificial intelligence Classification Information retrieval Machine learning Mental health Natural language processing Social media Text mining 


  1. 1.
    Almeida, H., Queudot, M., Meurs, M.J.: Automatic triage of mental health online forum posts: CLPsych 2016 system description. In: Proceedings of the Third Workshop on Computational Linguistics and Clinical Psychology, pp. 183–187 (2016)Google Scholar
  2. 2.
    Ayers, J.W., Althouse, B.M., Allem, J.P., Rosenquist, J.N., Ford, D.E.: Seasonality in seeking mental health information on Google. Am. J. Prev. Med. (AJPM) 44(5), 520–525 (2013)CrossRefGoogle Scholar
  3. 3.
    Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)CrossRefzbMATHGoogle Scholar
  4. 4.
    Brunborg, G.S., Mentzoni, R.A., Frøyland, L.R.: Is video gaming, or video game addiction, associated with depression, academic achievement, heavy episodic drinking, or conduct problems? J. Behav. Addict. 3(1), 27–32 (2014)CrossRefGoogle Scholar
  5. 5.
    Cambria, E., Olsher, D., Rajagopal, D.: SenticNet 3: a common and common-sense knowledge base for cognition-driven sentiment analysis. In: Proceedings of the 28th AAAI Conference on Artificial Intelligence, pp. 1515–1521. AAAI Press (2014)Google Scholar
  6. 6.
    Coppersmith, G., Dredze, M., Harman, C., Hollingshead, K., Mitchell, M.: CLPsych 2015 shared task: depression and PTSD on Twitter. In: Proceedings of the 2nd Workshop on Computational Linguistics and Clinical Psychology (CLPsych): From Linguistic Signal to Clinical Reality, pp. 31–39 (2015)Google Scholar
  7. 7.
    Coppersmith, G., Ngo, K., Leary, R., Wood, A.: Exploratory analysis of social media prior to a suicide attempt. In: Proceedings of the 3rd Workshop on Computational Lingusitics and Clinical Psychology (CLPSych), pp. 106–117 (2016)Google Scholar
  8. 8.
    De Choudhury, M., Gamon, M., Counts, S., Horvitz, E.: Predicting depression via social media. In: Proceedings of the 7th International AAAI Conference on Weblogs and Social Media (ICWSM), p. 2 (2013)Google Scholar
  9. 9.
    Granic, I., Lobel, A., Engels, R.C.: The benefits of playing video games. Am. Psychol. 69(1), 66 (2014)CrossRefGoogle Scholar
  10. 10.
    Hammond, K.W., Laundry, R.J., OLeary, T.M., Jones, W.P.: Use of text search to effectively identify lifetime prevalence of suicide attempts among veterans. In: Proceedings of the 46th Hawaii International Conference on System Sciences (HICSS), pp. 2676–2683. IEEE (2013)Google Scholar
  11. 11.
    Hollingshead, K., Ireland, M.E., Loveys, K.: Proceedings of the Fourth Workshop on Computational Linguistics and Clinical Psychology—From Linguistic Signal to Clinical Reality (2017)Google Scholar
  12. 12.
    Hutto, C.J., Gilbert, E.: VADER: a parsimonious rule-based model for sentiment analysis of social media text. In: Proceedings of the 8th International AAAI Conference on Weblogs and Social Media (ICWSM), June 2014Google Scholar
  13. 13.
    Jones, K.S., Walker, S., Robertson, S.E.: A probabilistic model of information retrieval: development and comparative experiments: Part 2. Inf. Process. Manag. 36(6), 809–840 (2000)CrossRefGoogle Scholar
  14. 14.
    Kwak, H., Lee, C., Park, H., Moon, S.: What is Twitter, a social network or a news media? In: Proceedings of the 19th International Conference on World Wide Web (WWW), pp. 591–600. ACM (2010)Google Scholar
  15. 15.
    Landwehr, N., Hall, M., Frank, E.: Logistic model trees. Mach. Learn. 59(1–2), 161–205 (2005)CrossRefzbMATHGoogle Scholar
  16. 16.
    Lin, H., Jia, J., Guo, Q., Xue, Y., Li, Q., Huang, J., Cai, L., Feng, L.: User-level psychological stress detection from social media using deep neural network. In: Proceedings of the 22nd ACM International Conference on Multimedia, pp. 507–516. ACM (2014)Google Scholar
  17. 17.
    Losada, D.E., Crestani, F.: A test collection for research on depression and language use. In: Fuhr, N., Quaresma, P., Gonçalves, T., Larsen, B., Balog, K., Macdonald, C., Cappellato, L., Ferro, N. (eds.) CLEF 2016. LNCS, vol. 9822, pp. 28–39. Springer, Cham (2016). Scholar
  18. 18.
    Losada, D.E., Crestani, F., Parapar, J.: eRISK 2017: CLEF lab on early risk prediction on the internet: experimental foundations. In: Jones, G.J.F., Lawless, S., Gonzalo, J., Kelly, L., Goeuriot, L., Mandl, T., Cappellato, L., Ferro, N. (eds.) CLEF 2017. LNCS, vol. 10456, pp. 346–360. Springer, Cham (2017). Scholar
  19. 19.
    McClellan, C., Ali, M.M., Mutter, R., Kroutil, L., Landwehr, J.: Using social media to monitor mental health discussions - evidence from Twitter. J. Am. Med. Inform. Assoc. (JAMIA) (2016).
  20. 20.
    Milne, D.N., Pink, G., Hachey, B., Calvo, R.A.: CLPsych 2016 shared task: triaging content in online peer-support forums. In: CLPsych@ HLT-NAACL, pp. 118–127 (2016)Google Scholar
  21. 21.
    Moreno, M.A., Ton, A., Selkie, E., Evans, Y.: Secret society 123: understanding the language of self-harm on Instagram. J. Adolesc. Health 58(1), 78–84 (2016)CrossRefGoogle Scholar
  22. 22.
    Nguyen, T., Phung, D., Dao, B., Venkatesh, S., Berk, M.: Affective and content analysis of online depression communities. IEEE Trans. Affect. Comput. 5(3), 217–226 (2014)CrossRefGoogle Scholar
  23. 23.
    Platt, J.: Sequential minimal optimization: a fast algorithm for training support vector machines. Technical report MSR-TR-98-14, Microsoft, April 1998Google Scholar
  24. 24.
    Ramrakha, S., Paul, C., Bell, M.L., Dickson, N., Moffitt, T.E., Caspi, A.: The relationship between multiple sex partners and anxiety, depression, and substance dependence disorders: a cohort study. Arch. Sex. Behav. 42(5), 863–872 (2013)CrossRefGoogle Scholar
  25. 25.
    Rice, S.M., Goodall, J., Hetrick, S.E., Parker, A.G., Gilbertson, T., Amminger, G.P., Davey, C.G., McGorry, P.D., Gleeson, J., Alvarez-Jimenez, M.: Online and social networking interventions for the treatment of depression in young people: a systematic review. J. Med. Internet Res. (JMIR) 16(9), e206 (2014)CrossRefGoogle Scholar
  26. 26.
    Santorini, B.: Part-of-speech tagging guidelines for the Penn Treebank project, 3rd revision. Technical reports (CIS), p. 570 (1990)Google Scholar
  27. 27.
    Schou Andreassen, C., Billieux, J., Griffiths, M.D., Kuss, D.J., Demetrovics, Z., Mazzoni, E., Pallesen, S.: The relationship between addictive use of social media and video games and symptoms of psychiatric disorders: a large-scale cross-sectional study. Psychol. Addict. Behav. 30(2), 252 (2016)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  • Antoine Briand
    • 1
  • Hayda Almeida
    • 1
  • Marie-Jean Meurs
    • 1
    Email author
  1. 1.Université du Québec à MontréalMontréalCanada

Personalised recommendations