Facial image analysis for automated suicide risk detection with deep neural networks

Rashed, Amr E. Eldin; Atwa, Ahmed E. Mansour; Ahmed, Ali; Badawy, Mahmoud; Elhosseini, Mostafa A.; Bahgat, Waleed M.

doi:10.1007/s10462-024-10882-4

Facial image analysis for automated suicide risk detection with deep neural networks

Open access
Published: 03 September 2024

Volume 57, article number 274, (2024)
Cite this article

Download PDF

You have full access to this open access article

Artificial Intelligence Review Aims and scope Submit manuscript

Facial image analysis for automated suicide risk detection with deep neural networks

Download PDF

Amr E. Eldin Rashed¹,
Ahmed E. Mansour Atwa²,
Ali Ahmed³,
Mahmoud Badawy^4,6,
Mostafa A. Elhosseini^5,6 &
…
Waleed M. Bahgat^4,7

1 Altmetric

Abstract

Accurately assessing suicide risk is a critical concern in mental health care. Traditional methods, which often rely on self-reporting and clinical interviews, are limited by their subjective nature and may overlook non-verbal cues. This study introduces an innovative approach to suicide risk assessment using facial image analysis. The Suicidal Visual Indicators Prediction (SVIP) Framework leverages EfficientNetb0 and ResNet architectures, enhanced through Bayesian optimization techniques, to detect nuanced facial expressions indicating mental state. The models’ interpretability is improved using GRADCAM, Occlusion Sensitivity, and LIME, which highlight significant facial regions for predictions. Using datasets DB1 and DB2, which consist of full and cropped facial images from social media profiles of individuals with known suicide outcomes, the method achieved 67.93% accuracy with EfficientNetb0 on DB1 and up to 76.6% accuracy with a Bayesian-optimized Support Vector Machine model using ResNet18 features on DB2. This approach provides a less intrusive, accessible alternative to video-based methods and demonstrates the substantial potential for early detection and intervention in mental health care.

Artificial intelligence assisted tools for the detection of anxiety and depression leading to suicidal ideation in adolescents: a review

Article 22 November 2022

Early detection of depression through facial expression recognition and electroencephalogram-based artificial intelligence-assisted graphical user interface

Article 15 February 2024

FacialCueNet: unmasking deception - an interpretable model for criminal interrogation using facial expressions

Article Open access 07 September 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Annually, 703,000 individuals lose their lives to suicide, with countless others making attempts. Each suicide is a heart-wrenching tragedy that casts a profound shadow over families, communities, and entire nations, leaving lasting scars on those who remain. Suicide knows no age boundaries, and in 2019, it stood as the fourth major cause of mortality among individuals aged 15–29 on a global scale (WHO 2023).

This tragic phenomenon is frequently intertwined with mental health challenges, encompassing conditions like depression, anxiety, and substance misuse. It is noteworthy that a substantial portion of those who succumb to suicide grapple with underlying mental health disorders. Consequently, the implementation of robust suicide prevention measures and the provision of comprehensive mental health support are of paramount importance. In response to this public health concern, numerous entities, including governmental bodies and various organizations, have launched extensive campaigns and initiatives to tackle the issue head-on.

Preventing suicide has risen to the forefront of global priorities, inspiring extraordinary endeavors focused on raising awareness, advancing research, and improving access to care. Despite these advancements, suicide remains a leading cause of mortality, accompanied by inherent complexities in its prevention. While there is a wealth of well-studied screening and assessment tools available to clinicians, researchers, and educators, the absence of a consensus on a gold-standard for suicide risk assessment and management, coupled with the lack of standardized terminology, presents challenges in accurately identifying risks and effectively preventing suicide outcomes (Silverman et al. 2007).

In our modern, interconnected world, digital platforms, particularly social media, have become the primary means for individuals to express themselves, including their emotions and experiences. However, this shift also brings forth a significant concern: identifying mental health issues, particularly those related to suicidal thoughts and emotional distress, from the images shared on these platforms. This issue is of great importance due to the vast amount of visual content available online, which may contain subtle yet crucial indicators of emotional turmoil and self-harm contemplation. Detecting these signs is complex, often relying on meticulous human observation. Given the sheer volume of online content, there is an urgent need for automated solutions to address this challenge.

Conducting clinical assessments for individuals at risk of self-harm, whether fatal or non-fatal, represents the initial step in suicide prevention efforts. Guidelines have been created to assist healthcare professionals in this demanding task (Jacobs et al. 2010), and assessment tools can complement these clinical evaluations (Bernert et al. 2014; Fochtmann and Jacobs 2015). Several recent evaluations have explored the predictive accuracy of suicide assessment tools, revealing subpar performance in anticipating future suicide attempts and suicides (Herrman et al. 2022). However, these assessments lacked detailed descriptions of selection processes and neglected to consider potential biases.

Furthermore, in recent years, the domain of deep learning has emerged as an indispensable tool, particularly in the realm of face image classification. Deep learning, powered by architectures such as Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs), has showcased exceptional proficiency in extracting facial features, recognizing emotional expressions, and the precise differentiation of individuals. This confluence of deep learning’s capabilities with the intricate domain of face image classification opens up novel horizons in technology and applications. It holds the promise of delivering systems that are not only more accurate and efficient but also exceptionally versatile in the domains of face recognition and image analysis.Recent advancements in AI and Machine Learning have significantly impacted the field of facial medicine, demonstrating the potential for these technologies to enhance diagnostic accuracy and patient outcomes. For instance, Umirzakova et al. (2023) presented a deep learning-driven diagnosis approach for segmenting stroke and Bell’s palsy, showcasing the utility of AI in medical diagnostics.

The scope and implications of this research extend far beyond its technical contributions. It underscores the potential of artificial intelligence, particularly deep learning, in bolstering mental health support and suicide prevention. In a world where digital platforms continue to reshape the landscape of human connection and sharing, it is our imperative duty to harness technology to preserve lives and extend timely intervention where it is most needed.

This study presents a novel approach to suicide risk assessment through the analysis of facial features using deep learning techniques. Unlike previous studies that have relied on video and verbal cues, this work is pioneering in its focus on static images, which presents a unique set of challenges and opportunities. By leveraging advanced machine learning models and Bayesian optimization, the study aims to extract meaningful patterns from facial expressions that could indicate suicidal tendencies. Our principal objective is to provide a scalable, proactive, and effective solution for detecting suicide risk through image analysis within the digital landscape. The contributions of this study can be summarized as follows:

Unprecedented Research Point: Emphasize the originality of the research, focusing on the rarity of studies addressing suicide detection through facial analysis. Highlight the limited literature on this subject, making our work a pioneering contribution.
Novel Public Dataset Creation: Elaborate on creating an exclusive public dataset containing 319 images drawn from social media sources. Highlight the dataset’s remarkable diversity and delve into the intricacies encountered during the collection process, with a specific emphasis on ensuring image authenticity and rigorous case verification.
Introducing Suicidal Visual Indicators Prediction (SVIP) Framework: Present the SVIP framework designed to predict suicidal behavior through facial analysis. Explain the two dataset versions and the primary approaches, combining pre-trained deep learning and machine learning models to maximize predictive accuracy.
Feature extraction: Implement feature extraction techniques using state-of-the-art neural networks such as ResNet18 and EfficientNetb0, which are instrumental in identifying subtle facial cues associated with suicidal tendencies.
Hyperparameter Optimization: Employment of Bayesian optimization for fine-tuning Support Vector Machine (SVM) models has achieved state-of-the-art accuracy in classification tasks.
Automated Machine Learning: The incorporation of Automated Machine Learning (AutoML) techniques, such as Lazy Predict, TPOT (Tree-based Pipeline Optimization Tool), and Orange AutoML, to streamline the model selection process and enhance predictive performance without extensive manual intervention.
Enhancing Model Interpretability: Utilization of explainable AI techniques, such as Gradient-weighted Class Activation Mapping (Grad-CAM), Local Interpretable Model-agnostic Explanations (LIME), and Occlusion Sensitivity, to provide insights into the decision-making process of the models, enhancing the interpretability of the predictions.
Performance Evaluation: Comprehensive performance evaluation using two distinct datasets, demonstrating the robustness and effectiveness of the proposed approach across different data scenarios. A comparative analysis with existing state-of-the-art techniques showcasing the superior performance of the proposed method in the context of suicide risk assessment through facial analysis.

The paper’s structure is as follows: Sect. 2 delves into the relevant literature. Section 3 provides background. Section 4 outlines the methodology applied and in-depth description of the materials employed. Section 5 presents the results obtained using the proposed approach. Section 6 compares the proposed method with state-of-the-art approaches. Section 7 offers the concluding remarks and future work.

2 Literature review

As Fig. 1 depicts, suicide analysis can be conducted using both verbal and non-verbal methods in the quest to understand and prevent self-harm and suicidal tendencies comprehensively. An overview of these dual approaches is as follows:

2.1 Verbal analysis can be categorized into:

Text-Based Analysis: This involves the scrutiny of written or spoken content like social media posts, text messages, or speech transcripts. Its purpose is to identify linguistic cues and sentiments associated with suicidal thoughts.
Natural Language Processing (NLP): Leveraging NLP techniques, experts uncover patterns and sentiments in the language that may signal suicidal ideation.
Sentiment Analysis: The emotional tenor and sentiment conveyed in written or spoken text are analyzed to unveil signs of depression, hopelessness, or suicidal ideation.
Topic Modeling: Common themes or topics within textual data, especially those tied to distress, self-harm, or suicide, are identified.

2.2 Non-verbal analysis can be categorized into:

Facial Analysis: By utilizing computer vision techniques, facial expressions are evaluated, shedding light on emotions like sadness, hopelessness, or agitation, which are indicative of emotional distress.
Eye Movement Analysis: Eye movement patterns and gaze direction are explored, offering insights into signs of discomfort or avoidance when discussing suicide-related subjects.
Voice Analysis: Acoustic features in speech, such as pitch, tone, and speech rate, are scrutinized to detect variations linked to distress or suicidal ideation.
Physiological Measurements: Monitoring physiological cues, including heart rate, skin conductance, and pupil dilation, helps identify emotional states associated with suicidal tendencies.

Utilizing machine learning and data analysis methodologies enables the creation of models for early detection and intervention, potentially saving lives. Nevertheless, it is of utmost importance to maintain a conscientious awareness of ethical and privacy considerations while dealing with sensitive mental health and suicide-related data. The following subsections delve into the research directions within verbal and non-verbal analysis.

2.3 Verbal analysis research directions

In recent years, there has been a considerable focus on understanding the intricate interplay between language usage and mental health, intending to shed light on the early detection and prevention of suicidal ideation. The following sections provide an extensive review of research related to verbal analysis for suicide prediction, encompassing the evaluation of suicide notes and other pertinent studies in this domain.

Well-established linguistic features commonly employed in the field of psychiatry, such as LIWC (Robinson and Lumontod 2020), emotion-based indicators (Masuda et al. 2013), and insights from suicide notes (Pestian et al. 2010). While these linguistic approaches offer valuable tools for analyzing language in isolation, it’s important to recognize their limitations when dealing with extensive and diverse datasets. Nobles et al. (2018) developed a model to identify periods of suicidality by analyzing text messages from individuals with a history of suicidal thoughts. Their unique SMS dataset allowed them to identify distinct communication patterns preceding suicide attempts. They used a deep neural network (DNN) to model these patterns, achieving notable performance with accuracy, F1-Score, Recall, and Precision rates of 70.0, 75.0, 81.0, and 71.4, respectively.

Jingcheng et al. (2018) used deep learning and transfer learning to identify psychiatric stressors related to suicide in Twitter data. They employed a convolutional neural network (CNN) for the binary classification of suicide-related tweets. Recurrent neural networks (RNN) were used for psychiatric stressor recognition. Deep learning surpassed traditional methods, with CNN achieving 78% precision and an F-1 measure of 83%. The RNN-based recognition achieved an F-1 measure of 53.25% (exact match) and 67.94% (inexact match). Transfer learning from clinical notes improved the F-1 measure to 54.9% (exact match). Tadesse et al. (2019) aimed to detect suicidal ideation in social media posts, focusing on early identification. They utilized deep learning and machine learning, working with Reddit content. Their model combined Long Short-Term Memory (LSTM) and Convolutional Neural Network (CNN) components, outperforming other classification models. The LSTM-CNN architecture, especially when integrated with word embeddings, achieved remarkable performance with accuracy, F1-Score, Recall, and Precision rates of 93.8, 93.4, 94.1, and 93.2, respectively.

Aldhyani et al. (2022) developed a machine-learning system to detect suicidal ideation and behavioral changes in social media posts. They used Reddit datasets, utilizing TF-IDF and Word2Vec for text representation. Their approach combined deep learning (CNN-BiLSTM) and machine learning (XGBoost) algorithms. In two experiments, the CNN-BiLSTM model outperformed XGBoost, achieving 95% accuracy for suicidal ideation detection, surpassing XGBoost’s 91.5%. Conversely, using LIWC features, XGBoost outperformed CNN-BiLSTM. Baghdadi et al. (2022) proposed tracking depression and mental disorders in Arabic social media data. This presented unique challenges due to the Arabic language’s wide usage and complex syntax. The study involved scraping and annotating Arabic tweets, introducing a classification framework for categories like Normal or Suicidal. They presented an Arabic tweet preprocessing algorithm, comparing lemmatization, stemming, and various lexical analysis methods. The study conducted experiments on Twitter data with five annotators and reported various performance metrics. The best-performing model achieved a WSM of 95.26% with Arabic BERT models.

Renjith et al. (2022) developed an automated method for detecting behavioral shifts by analyzing social media interactions, employing deep learning and machine learning, including LSTM and CNN models. Their LSTM-Attention-CNN model achieved 90.3% accuracy and a 92.6% F1-score, surpassing baseline models.

Chadha and Kaushik (2022) tokenized a dataset of 20,000 Reddit posts using word2vec techniques and proposed the ACL (Attention Convolution Long Short-Term Memory) model, achieving an 88.48% accuracy, 87.36% precision, 90.82% F1 score, and 79.23% specificity. The ACL model with Random embedding reached a 94.94% recall. Ghosal and Jain (2023) presented a framework to differentiate depression-related content and suicidal risk, utilizing fastText embeddings, TF-IDF vectors, and the XGBoost classifier. Their approach achieved an accuracy of 71.05%, an AUC of 78%, and a weighted F1-score of 71% on a Reddit dataset.

Burkhardt et al. (2023) developed an innovative model using Bidirectional Encoder Representations from Transformers (BERT) to identify suicide risk signals in social media posts. Their multi-stage transfer learning approach optimized the model’s performance and introduced a novel metric for assessing its ability to expedite response times for distressed individuals. This method achieved an enhanced F1 score of 79.7% from the initial 73.4%. Implementing this model could reduce response times by 15 min for urgent messages.

2.4 Non-verbal analysis research directions

Suicidal intent is strongly associated with co-occurring mental health conditions, particularly depression and anxiety. Individuals displaying suicidal tendencies exhibit symptoms commonly linked to depression and anxiety, such as elevated stress levels, persistent fatigue, and a tendency to withdraw from social interactions Waern et al. (2016). Furthermore, advanced research involving statistical analyses and machine learning-based assessments of facial and ocular movement patterns has unearthed specific patterns. For instance, patients diagnosed with depression frequently manifest narrower eye openings, extended blink durations, and slower, less frequent head movements, which may be indicative of fatigue and a preference for avoiding eye contact (Alghowinem et al. 2013a; Waxer 1977; Alghowinem et al. 2013b). These nuanced findings offer valuable insights into the intricate relationship between suicidal ideation and these concurrent mental health conditions. An expanding body of research in mental disorder detection has demonstrated that facial appearance and expressions can convey meaningful non-verbal cues (Dhelim et al. 2023). These cues can be effectively analyzed to evaluate a range of mental disorders, including depression (Pampouchidou et al. 2017), bipolar disorder (Venn et al. 2004), and social anxiety (Silvia et al. 2006).

In a study by Laksana et al. (2017), facial behaviors in interviews from three hospitals were analyzed to differentiate patients with suicidal ideation and mental health disorders. Significant differences were observed, particularly in Duchenne smile occurrences. The study also noted the influence of interview stages on these behaviors. Facial expressions hold promise as markers for identifying such issues. Three predictive models were compared: SVM, Random Forest, and Multinomial Naïve Bayes, with Random Forest achieving the highest accuracy at 39.4%.

In their study, Eigbe et al. (2018) explored dynamics in smiles to differentiate genuine from fake smiles in individuals with suicidal ideation and investigated smile frequency across various conversational contexts. They also assessed gaze aversion as a potential behavioral marker. Analyzing 74 interviews with hospital patients, they developed predictive models that showed promise in distinguishing between mental health conditions, particularly identifying those with suicidal tendencies. This research underscores the potential of behavioral cues in detecting mental health conditions during clinical interviews, achieving 69% accuracy in classification.

Shah and colleagues (Shah et al. 2019) investigated behavioral markers associated with expressed suicidal intent in social media videos using a unique annotated dataset. Their research involved statistical hypothesis testing to validate hypotheses from the literature. Employing multimodal predictive modeling, they identified significant markers like silences, slouched shoulders, rapid hand movements, and profanity indicating suicidal intent. Combining visual, acoustic, and language elements, the trimodal approach achieved an AUC of around 71%."

Liu et al. (2022) explore suicide risk assessment and nonverbal behavioral interpretation using statistical analysis, feature selection, and machine learning. They analyze unique data on eye and head signals, discovering that high-risk individuals display psychomotor retardation, anxiety, and depression symptoms as behavioral cues like eye contact avoidance, slower blinks, and downward eye gaze. Classification methods effectively distinguish levels of suicide risk, consistently achieving over 98% accuracy. Table 1 compares conducted research studies on suicide risk detection.

Table 1 Comparative studies on suicide risk detection

Facial image analysis for automated suicide risk detection with deep neural networks

Abstract

Similar content being viewed by others

Artificial intelligence assisted tools for the detection of anxiety and depression leading to suicidal ideation in adolescents: a review

Early detection of depression through facial expression recognition and electroencephalogram-based artificial intelligence-assisted graphical user interface

FacialCueNet: unmasking deception - an interpretable model for criminal interrogation using facial expressions

Explore related subjects

1 Introduction

2 Literature review

2.1 Verbal analysis can be categorized into:

2.2 Non-verbal analysis can be categorized into:

2.3 Verbal analysis research directions

2.4 Non-verbal analysis research directions

2.5 Research gap and motivation

3 Background

3.1 Dataset utilization in suicide detection

3.2 Artificial intelligent suicide detection models

3.3 Automated machine learning (AutoML) in data science

3.4 Lazy predict

3.5 Tree-based Pipeline Optimization Tool (TPOT)

3.6 Orange AutoML

3.6.1 The Bayesian Optimization Algorithm (BOA)

3.7 Performance metrics

4 Materials and methods

4.1 Used datasets

4.2 Ethical considerations and methodology in utilizing image-based data for suicide research

4.3 Dataset preprocessing

4.4 The proposed Suicidal Visual Indicators Prediction (SVIP) framework

5 Results and discussion

5.1 Performance evaluation of the first phase

5.1.1 Performance evaluation results using the original dataset (DB1)

5.1.2 Performance evaluation results for dataset after cropping images (DB2)

5.2 Performance evaluation of the second phase

5.3 Performance evaluation of the third phase

6 Comparison with state-of-the-art techniques

7 Conclusions and future work

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation