Utilizing Temporal Psycholinguistic Cues for Suicidal Intent Estimation

Mathur, Puneet; Sawhney, Ramit; Chopra, Shivang; Leekha, Maitree; Ratn Shah, Rajiv

doi:10.1007/978-3-030-45442-5_33

Puneet Mathur¹⁵,
Ramit Sawhney¹⁶,
Shivang Chopra¹⁷,
Maitree Leekha¹⁷ &
…
Rajiv Ratn Shah¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12036))

Included in the following conference series:

European Conference on Information Retrieval

6652 Accesses
9 Citations

Abstract

Temporal psycholinguistics can play a crucial role in studying expressions of suicidal intent on social media. Current methods are limited in their approach in leveraging contextual psychological cues from online user communities. This work embarks in a novel direction to explore historical activities of users and homophily networks formed between Twitter users for extracting suicidality trends. Empirical evidence proves the advantages of incorporating historical user profiling and temporal graph convolutional modeling for automated detection of suicidal connotations on Twitter.

P. Mathur and R. Sawhney—Equal contribution

You have full access to this open access chapter, Download conference paper PDF

Detection of Suicidal Tendency in Users by Analysing the Twitter Posts

DARE to Care: A Context-Aware Framework to Track Suicidal Ideation on Social Media

An Approach to Analyse a Hashtag-Based Topic Thread in Twitter

1 Introduction

Suicidal ideation detection is a well studied problem in social media analysis. Various works have tried to identify linguistic patterns correlated with suicidality intent. Despite the sustained efforts from the community, most approaches ignore the psychological relevance of temporal characteristics of suicidal behaviour. Moreover, there has been limited explorations in the space of homophily networks to identify collusive depressive users. We hypothesize that the contextual information embedded in social media engagement and historical activities of users can lead to substantial improvements in automated identification of suicidal ideation. We look beyond linguistic cues into temporal signals throughout this work, with the help of a publicly available dataset given by [14] of 34,306 tweets on suicidality detection.

2 Related Work

2.1 Challenges on Social Media

The growth of social media websites hosts a number of challenges such as cyberbullying, suicide pacts, and radicalism that motivate suicidal behavior and impact the mental health of the users [10]. The associativity of suicide-related verbalizations on social media websites has been found to be strongly related to potential suicidal attempts. Prior studies show how suicidal intent declarations were significantly more assortative than chance, at times connected till 6 degrees of separation [5]. A patient’s social media profile can help medical experts gain perspective into their mental health status and identify those at critical risk for suicide attempts [15]. The potential of technological interventions for suicidal risk assessment and mitigation needs to be explored in detail.

2.2 Text-Based Approaches

Various works have been recently proposed with an objective of automating the detection of social media posts expressing suicide ideation using textual information [3, 7, 17]. [4] performed a semi-automated content-based analysis on a small number of tweets related to depression in order to derive certain qualitative insights into the behavior of users displaying suicidal behavior. Self-disclosure helps to facilitate psychological well being in individuals with mental illness [2]. Textual descriptions of social media disclosures have been extensively studied in the past [7]. [19] explored deep learning based supervised classifiers for suicidal ideation detection.

2.3 Psycho-Linguistic Analysis

[13] used social graph based features and gained considerable improvement in the task of abuse detection. [16] performed a psycho-linguistic analysis of online users for a similar task. [1] tried to link users’ psychological features such as personality traits including personalities, sentiment and emotion for cyberbulling and trolling. The contributions that we make in this work are different from previous efforts as there has been hardly any attempt to take a combined multi-faceted approach for solving the task of suicidal ideation in Twitter.

2.4 Signals from Temporal Data

Temporal graphs can capture the relationships in data with time so as to model new events and comparison to related entities and historical states [18]. [9] detected groups based on interesting features of the time-evolving networks. It studied several clustering frameworks for time-evolving networks for detecting group structure. [6] performed temporal sentiment analysis for early detection of cyberbulling and suicide ideation of a user through graph-based data mining approaches.

3 Methodology

The proposed methodology looks beyond text classifiers and leverages tweeting history of users as well as their social network communication patterns. User-based features were extracted from the historical tweeting activity and inter-user interactions was modeled as a social graph. The methodology is two-fold consisting of historical signal modeling and temporal graph convolutional modeling.

3.1 Classification Network

In order to learn from the textual information available in the raw tweets, we trained a BLSTM + Attention network [20]. We train a BLSTM model with 100 LSTM units, dropout rate of 0.25 and a recurrent dropout rate of 0.2. The attention layer was followed by another dropout layer of 0.2. This was followed by two dense layers having 256 units and 2 units, respectively.

3.2 Temporal Modeling of Suicidal Tendency

Motivation: The idea of temporal modeling of suicidal tendencies is inspired by [11] with additions. According to [11], a representation for the historical activity can be formulated as a temporal weighting scheme $\phi _i$ which is a sum of two independent time varying functions of suicidality - ideation build-up $\lambda _i(t)$ and sinusoidal episodes $\mu _i(t)$. Extrapolating from this, we add a third independent time-varying function - white Gaussian noise $z_{i}(t)$. Let $\varDelta t_i$ be the time offset from the original tweet and the temporal representation function z be given by Eq. 1.

$$\begin{aligned} z(u,H)=\sum _{h_i \in H}{} \phi _i(\varDelta t) f(h_i) \end{aligned}$$

(1)

Table 1. Hyper parameters for Eq. 3 [11]

Full size table

Suicidal Ideation Build-Up: Each user’s historical tweets can be modeled as an exponential function in time given by Eq. 2 where $\alpha $ and $\beta $ are hyper parameters tuned over training data.

$$\begin{aligned} \lambda _i(\varDelta t) = \alpha e^{\beta \varDelta t_i} \end{aligned}$$

(2)

Suicidal Episodes: Phased changes in suicidal intent are mathematically represented by Eq. 3. As per [12], the hyper parameters for the same are given by Table 1.

$$\begin{aligned} \mu _i(\varDelta t) = \sum _{1}^{Q}(a_{q}cos(\frac{2\pi q\varDelta t_i}{U}) + b_{q}sin(\frac{2\pi q \varDelta t_i}{U})) \end{aligned}$$

(3)

Temporal Surprise: Similar to any channel medium, social media platforms are prone to noise that adds randomness to the temporal suicidal patterns. The white Gaussian noise is modeled as being derived from a normal distribution with the expectation value of the noise term $\zeta _(t)$ equal to 1.

$$\begin{aligned} \phi _i(\varDelta t)=\lambda _i(\varDelta (t) + \mu _i(\varDelta t) + \zeta _{i}(\varDelta t) \end{aligned}$$

(4)

For each of the tweet samples, the historical activity representation was an input to logistic regression model to learn temporal embeddings from these features which was used as an input to the final model.

3.3 Graph Convolutional Networks for User Profiling

Learning user representations can be significantly enriched by leveraging information derived from the inter-user interactions in social media channels. For this purpose, Graph Convolutional Networks (GCN) [8] can be effectively utilized that are capable of modeling social interactions in the form of features of nodes in the graph and allow contextual learning of information with respect to a node’s neighbourhood.

Temporal GCN: We tried to incorporate the historical views into the extended graph by constructing time weighted TF-IDF vectors of the historical tweets. The author nodes were modified to consist of temporal weighting of TF-IDF representation of tweets. Let the TF-IDF vector $f_{k}^{t}$ of tweet at timestamp t for $k^{th}$ author be defined by Eq. 5, where $C_{k}$ is the global noise parameter, $\hbar $ controls the margin of influence of a user on its neighbours social activity and $\omega $ is the rate of decay of the suicidal sentiment. The external parameters $C_{k}$, $\hbar _{k}$ and $\omega _{k}$ are learnt from the training portion of the dataset in an unsupervised fashion.

$$\begin{aligned} f^{t} = \hbar _{k} \exp ^{\omega _{k} \varDelta t} \end{aligned}$$

(5)

4 Experiments

4.1 Data Description and Setup

To gauge the effectiveness of our proposed approach, we use the dataset from SNAP-BATNET [14] which consists of 34, 306 tweets with 3, 984 of them suicidal ideations. For each of these users, the tweet timelines were also collected to create the set of historical tweets. 10-fold stratified cross-validation was employed to evaluate models on each of the 10 train-val splits. The hyper parameters for the temporal weighted combination were tuned using a grid search over the grid $\alpha =\{0.1, 0.5, 1.0\}$, $\beta = \{0, 0.01, 0.1, 1\}$, $U = \{1, 2 ..., 7\}$ yielding $\alpha = 0.5, \beta = 1, U = 7$. $t_0$ was assigned to time series points with values equal to $argmax (|\mu |)_i$.

Table 2. Performance analysis

Full size table

5 Results and Ablation Analysis

The ablation study of experimented features presented in Table 2 highlights the significance of temporal features extracted from social media in suicide ideation risk assessment. Temporal GCN provides a substantial gain over text in prediction confidence due to the user interactions. Additionally, it is interesting to observe the ability of the GCN model to better represent historical suicidal signals in comparison to naive historical and textual features to a sufficient degree. Empirically, temporal features help suppress false positives induced by text classifiers that try to overfit on the presence of anecdotal suicidal phrases such as “kill me...hahaha !!” that may be considered as noise in non-suicidal text. The most optimal weights for temporal signal modeling Text + Builtup + Episodic + Surprise were derived to be 0.52, 0.04, 0.04 and 0.32 through cross-validation experiments.

Figure 1 elucidates the impact of including psychological contextual cues on a small sample of connected users from the test dataset. It is evident from the historic trends of Users B and C that they follow a nearly episodic nature with scattered surprises. Analysing the trend plots for Users A and D reveals an inverse build-up thereby demonstrating that there can be either a positive or negative build-up in the suicidal intent of users. All these aspects when captured by our model has led to a statistically significant increase in the model’s performance.

6 Conclusion

In spite of high importance of suicidal ideation identification on social media, little research has focused on looking beyond linguistic patterns. Through our work, we demonstrate that user interactions and past user behaviour are strong indicators of a potentially concerning mental state of online users. In this study, employing both qualitative and quantitative methods, we address this gap by investigating the impact of augmenting text based suicidal ideation detection models with contextual cues based on historical tweeting behavior and social media engagement.

References

Balakrishnan, V., Khan, S., Arabnia, H.R.: Improving cyberbullying detection using Twitter users’ psychological features and machine learning. Comput. Secur. 101710 (2020)
Google Scholar
Balani, S., De Choudhury, M.: Detecting and characterizing mental health related self-disclosure in social media. In: Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems, pp. 1373–1378. ACM (2015)
Google Scholar
Benton, A., Mitchell, M., Hovy, D.: Multi-task learning for mental health using social media text. arXiv preprint arXiv:1712.03538 (2017)
Cavazos-Rehg, P.A., et al.: A content analysis of depression-related tweets. Comput. Hum. Behav. 54, 351–357 (2016)
Article Google Scholar
Cero, I., Witte, T.K.: Assortativity of suicide-related posting on social media. Am. Psychol. (2019)
Google Scholar
Chatterjee, A., Das, A.: Temporal sentiment analysis of the data from social media to early detection of cyberbullicide ideation of a victim by using graph-based approach and data mining tools. In: Bhattacharyya, S., Mitra, S., Dutta, P. (eds.) Intelligence Enabled Research. AISC, vol. 1109, pp. 107–112. Springer, Singapore (2020). https://doi.org/10.1007/978-981-15-2021-1_12
Chapter Google Scholar
De Choudhury, M., Gamon, M., Counts, S., Horvitz, E.: Predicting depression via social media. In: Seventh International AAAI Conference on Weblogs and Social Media (2013)
Google Scholar
Kipf, T.N., Welling, M.: Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016)
Lee, K.H., Xue, L., Hunter, D.R.: Model-based clustering of time-evolving networks through temporal exponential-family random graph models. J. Multivar. Anal. 175, 104540 (2020)
Article MathSciNet Google Scholar
Lopez-Castroman, J., et al.: Mining social networks to improve suicide prevention: a scoping review. J. Neurosci. Res. (2019)
Google Scholar
Mathur, P., Sawhney, R., Shah, R.R.: Suicide risk assessment via temporal psycholinguistic modeling (student abstract). In: 2020 Proceedings of the 34th AAAI Conference on Artificial Intelligence. AAAI (2020)
Google Scholar
Mathur, P., Shah, R., Sawhney, R., Mahata, D.: Detecting offensive tweets in Hindi-English code-switched language. In: Proceedings of the Sixth International Workshop on Natural Language Processing for Social Media, pp. 18–26 (2018)
Google Scholar
Mishra, P., Del Tredici, M., Yannakoudakis, H., Shutova, E.: Author profiling for abuse detection. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 1088–1098 (2018)
Google Scholar
Mishra, R., Sinha, P.P., Sawhney, R., Mahata, D., Mathur, P., Shah, R.R.: SNAP-BATNET: cascading author profiling and social network graphs for suicide ideation detection on social media. In: Proceedings of the 2019 NAACL Student Research Workshop, pp. 147–156 (2019)
Google Scholar
Pourmand, A., Roberson, J., Caggiula, A., Monsalve, N., Rahimi, M., Torres-Llenza, V.: Social media and suicide: a review of technology-based epidemiology and risk assessment. Telemed. e-Health 25(10), 880–888 (2019)
Article Google Scholar
Qian, J., ElSherief, M., Belding, E.M., Wang, W.Y.: Leveraging intra-user and inter-user representation learning for automated hate speech detection. arXiv preprint arXiv:1804.03124 (2018)
Sawhney, R., Manchanda, P., Mathur, P., Shah, R., Singh, R.: Exploring and learning suicidal ideation connotations on social media with deep learning. In: Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pp. 167–175 (2018)
Google Scholar
Steer, B., Cuadrado, F., Clegg, R.: Raphtory: streaming analysis of distributed temporal graphs. Future Gener. Comput. Syst. 102, 453–464 (2020)
Article Google Scholar
Tadesse, M.M., Lin, H., Xu, B., Yang, L.: Detection of suicide ideation in social media forums using deep learning. Algorithms 13(1), 7 (2020)
Article Google Scholar
Zhou, P., Qi, Z., Zheng, S., Xu, J., Bao, H., Xu, B.: Text classification improved by integrating bidirectional lstm with two-dimensional max pooling. arXiv preprint arXiv:1611.06639 (2016)

Download references

Author information

Authors and Affiliations

University of Maryland, College Park, USA
Puneet Mathur
Netaji Subhas Institute of Technology, Delhi, India
Ramit Sawhney
Delhi Technological University, Delhi, India
Shivang Chopra & Maitree Leekha
MIDAS Labs, IIIT Delhi, New Delhi, India
Rajiv Ratn Shah

Authors

Puneet Mathur
View author publications
You can also search for this author in PubMed Google Scholar
Ramit Sawhney
View author publications
You can also search for this author in PubMed Google Scholar
Shivang Chopra
View author publications
You can also search for this author in PubMed Google Scholar
Maitree Leekha
View author publications
You can also search for this author in PubMed Google Scholar
Rajiv Ratn Shah
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Puneet Mathur .

Editor information

Editors and Affiliations

University of Glasgow, Glasgow, UK
Joemon M. Jose
University College London, London, UK
Emine Yilmaz
Universidade NOVA de Lisboa, Lisbon, Portugal
João Magalhães
Universidad Autónoma de Madrid, Madrid, Spain
Pablo Castells
University of Padua, Padua, Italy
Nicola Ferro
Universidade de Lisboa, Lisbon, Portugal
Mário J. Silva
Universidade NOVA de Lisboa, Lisbon, Portugal
Flávio Martins

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mathur, P., Sawhney, R., Chopra, S., Leekha, M., Ratn Shah, R. (2020). Utilizing Temporal Psycholinguistic Cues for Suicidal Intent Estimation. In: Jose, J., et al. Advances in Information Retrieval. ECIR 2020. Lecture Notes in Computer Science(), vol 12036. Springer, Cham. https://doi.org/10.1007/978-3-030-45442-5_33

Download citation

DOI: https://doi.org/10.1007/978-3-030-45442-5_33
Published: 08 April 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-45441-8
Online ISBN: 978-3-030-45442-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Utilizing Temporal Psycholinguistic Cues for Suicidal Intent Estimation

Abstract

Similar content being viewed by others

Detection of Suicidal Tendency in Users by Analysing the Twitter Posts

DARE to Care: A Context-Aware Framework to Track Suicidal Ideation on Social Media

An Approach to Analyse a Hashtag-Based Topic Thread in Twitter

1 Introduction

2 Related Work

2.1 Challenges on Social Media

2.2 Text-Based Approaches

2.3 Psycho-Linguistic Analysis

2.4 Signals from Temporal Data

3 Methodology

3.1 Classification Network

3.2 Temporal Modeling of Suicidal Tendency

3.3 Graph Convolutional Networks for User Profiling

4 Experiments

4.1 Data Description and Setup

5 Results and Ablation Analysis

6 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Utilizing Temporal Psycholinguistic Cues for Suicidal Intent Estimation

Abstract

Similar content being viewed by others

Detection of Suicidal Tendency in Users by Analysing the Twitter Posts

DARE to Care: A Context-Aware Framework to Track Suicidal Ideation on Social Media

An Approach to Analyse a Hashtag-Based Topic Thread in Twitter

1 Introduction

2 Related Work

2.1 Challenges on Social Media

2.2 Text-Based Approaches

2.3 Psycho-Linguistic Analysis

2.4 Signals from Temporal Data

3 Methodology

3.1 Classification Network

3.2 Temporal Modeling of Suicidal Tendency

3.3 Graph Convolutional Networks for User Profiling

4 Experiments

4.1 Data Description and Setup

5 Results and Ablation Analysis

6 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation