A review on evaluating mental stress by deep learning using EEG signals

Badr, Yara; Tariq, Usman; Al-Shargie, Fares; Babiloni, Fabio; Al Mughairbi, Fadwa; Al-Nashash, Hasan

doi:10.1007/s00521-024-09809-5

A review on evaluating mental stress by deep learning using EEG signals

Review
Open access
Published: 09 May 2024

Volume 36, pages 12629–12654, (2024)
Cite this article

Download PDF

You have full access to this open access article

Neural Computing and Applications Aims and scope Submit manuscript

A review on evaluating mental stress by deep learning using EEG signals

Download PDF

Yara Badr¹,
Usman Tariq²,
Fares Al-Shargie²,
Fabio Babiloni³,
Fadwa Al Mughairbi⁴ &
…
Hasan Al-Nashash ORCID: orcid.org/0000-0002-9685-4937²

1798 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

Mental stress is a common problem that affects individuals all over the world. Stress reduces human functionality during routine work and may lead to severe health defects. Early detection of stress is important for preventing diseases and other negative health-related consequences of stress. Several neuroimaging techniques have been utilized to assess mental stress, however, due to its ease of use, robustness, and non-invasiveness, electroencephalography (EEG) is commonly used. This paper aims to fill a knowledge gap by reviewing the different EEG-related deep learning algorithms with a focus on Convolutional Neural Networks (CNNs) and Long Short-Term Memory networks (LSTMs) for the evaluation of mental stress. The review focuses on data representation, individual deep neural network model architectures, hybrid models, and results amongst others. The contributions of the paper address important issues such as data representation and model architectures. Out of all reviewed papers, 67% used CNN, 9% LSTM, and 24% hybrid models. Based on the reviewed literature, we found that dataset size and different representations contributed to the performance of the proposed networks. Raw EEG data produced classification accuracy around 62% while using spectral and topographical representation produced up to 88%. Nevertheless, the roles of generalizability across different deep learning models and individual differences remain key areas of inquiry. The review encourages the exploration of innovative avenues, such as EEG data image representations concurrently with graph convolutional neural networks (GCN), to mitigate the impact of inter-subject variability. This novel approach not only allows us to harmonize structural nuances within the data but also facilitates the integration of temporal dynamics, thereby enabling a more comprehensive assessment of mental stress levels.

A novel technique for stress detection from EEG signal using hybrid deep learning model

Article 06 July 2022

Heart Rate Variability-Based Mental Stress Detection Using Deep Learning Approach

Stress Detection with Deep Learning Approaches Using Physiological Signals

Find the latest articles, discoveries, and news in related topics.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Mental stress is a widespread inexorable problem faced by people worldwide, regardless of their ethnicity, age, religion, and gender [1, 2]. The Global Organization for Stress reported mental stress as the number one health problem among high school students. In addition, it was stated by the American Institute of Stress that 48% of people have difficulty sleeping due to feeling stressed [3]. Stress is a condition that affects and limits the person's ability to function and disrupts the daily routine. In psychology, stress is defined as a process of two consecutive phases: the perception of a stressor or a situation and the body’s response to it [4, 5]. Stress can be triggered when a person encounters adverse stimuli, including mental (tasks that test cognitive capacity or rapidly changing tasks), physical (such as sleep deprivation or painful stimuli), or emotional stressors (such as emotional videos). These stressors can be either internal, depending on the individual’s personality, thinking, and perception, or external, like financial debt or relationship problems [7]. The body’s response to a stimulus is known as the stress response, where the body elicits a pattern of behavioral, cognitive, and affective responses to cope with the situation [6].

Mental stress is categorized into two main types: acute and chronic stress. When a person is exposed to a stressor for a short duration, such as having a job interview or public speaking, this results in acute stress. Several changes in the physiological processes of the body occur for the person to cope with the current, stressful situation. This includes the release of stress hormones such as adrenaline, noradrenaline, and cortisol that supply the body with instant energy. Later, the parasympathetic nervous system aids in regulating the body back to homeostasis (normal body condition), without any significant harm [8]. On the other hand, frequent and long-term exposure to stressors, including bad relationships, stressful jobs, and poor sleep habits, may result in chronic stress. Continuous and permanent exposure to stressful situations has deleterious consequences, affecting the mental and physical health of an individual [9].

Stress is a major threat that leads to a wide variety of health problems [10]. Several studies have shown that mental stress contributes to several diseases such as hypertension [11], stroke [12], coronary artery disease [13], cardiac arrest [14], exhaustion of the muscular system, and persistent pain [15], and psychological disorders such as anxiety and depression [16]. According to [17], around 35% of somatic symptoms patients face are unexplained with regard to any physical cause. It was discussed that stress might be the leading cause of the increased symptoms, thus stating that stress is considered dangerous for one’s health. For the aforementioned reasons, researchers have proposed and developed several ways of assessing stress levels early on to prevent the harmful health consequences associated with it.

Mental stress influences the sympathetic nervous system which is under the control of the autonomic nervous system, which affects the body in a psychological, behavioral, and physiological manner [18]. Clinicians and psychiatrists tend to evaluate the mental stress of an individual by assessing the psychological effect of stress with the help of self-report questionnaires. These are the most prevalent approaches for evaluation. Several questionnaires have been implemented to assess stress, including the perceived stress scale [19,20,21], daily stress inventory, and relative stress scale. Nevertheless, there are debates concerning their use, as they are subjective and prone to invalid answers and errors due to social desirability bias and response bias [22]. Alternatively, behavioral responses which can be visual, vocal, or nonverbal indications such as rapid eye movement and body gestures have been used to assess stress [23]. However, this behavior can be altered under conscious control.

Physiological changes in the body, on the contrary, are involuntary in the sense that it is influenced directly by the autonomic nervous system [23]. This can provide an objective means of evaluating mental stress, compared to the methods stated before. Physiological measurements include pupil diameter [24], skin temperature [25], eye gaze [26], voice [27], heart rate variability [28], blood volume pressure [29], and electrodermal conductance [30]. Note that, these measurements are bound by limitations as they are influenced by many factors other than mental stress, such as the person’s health and environmental conditions. For example, electrodermal conductance is highly sensitive to skin diseases and environmental weather conditions such as humidity and temperature [31, 32]. The cortisol level was also noticed to be highly variable, as it is easily influenced by several factors such as the circadian rhythm (i.e., fluctuating during the day), physical activity level, eating, specific medications, or certain diseases [33, 34].

In addition, many researchers have been implementing neuroimaging techniques to evaluate mental stress directly or indirectly. These methods include functional near-infrared spectroscopy [35,36,37], electroencephalography (EEG) [38,39,40,41], positron emission tomography [42, 43] and functional magnetic resonance imaging [44, 45]. From the methods mentioned above, EEG is the most prevalent technique used to study the brain’s condition and function, for clinical application as well as research studies. The main advantages of EEG over other neuroimaging techniques are its high temporal resolution, its modest set-up cost, and its simplicity of use. EEG is a non-invasive method (i.e., does not require surgery), that is used to measure and record oscillations generated by the electrical activity of the brain by placing electrodes over the scalp [46]. EEG signal recordings have a peak-to-peak amplitude of no more than 100 µV. Any recorded signal of a higher amplitude is an artifact. There are standard EEG frequency bands, where each corresponds to a specific mental state: delta (1–4 Hz), theta (4–8 Hz), alpha (8–13 Hz), beta (13–30 Hz), and gamma (> 30 Hz) [47].

To perform mental stress assessment using EEG modality, there are two major steps to undertake: feature extraction and selection, followed by stress classification. Features extracted from EEG can be roughly categorized into 3 main types: time-domain (also known as temporal features), frequency domain (also known as spectral features), and statistical features. These features are then fed to several types of machine learning classifiers to assess the level of stress of an individual.

Machine learning is a domain of artificial intelligence that uses past data patterns and trends to make predictions about future data. Machine learning offers a variety of algorithms, including Decision Trees, Polynomial Classifiers, Random Forests (RF), Support Vector Machines (SVMs), Naive Bayes (NB), boosted classifiers, and more. For detecting mental stress from biological signals, the most common machine-learning techniques include K-nearest neighbors (KNN), Logistic regression, SVM, and Random Forest [48]. SVM [49], Logistic Regression (LR) [50], Naive Bayes [51], KNN [52], and Linear discriminant analysis (LDA) [53] were the most significant when dealing with EEG signals specifically. However, the most critical task in traditional machine learning is feature selection, which has a great effect on the classification results [54]. For instance, Saeed et al. [55] reported an accuracy of 65.96% when applying KNN with combined features such as alpha asymmetry, beta, and gamma waves. On the other hand, Darzi et al. [56] achieved an accuracy of 90.0% when power spectrum density, laterality index, correlation coefficient, and phase-slope index were used as features to the KNN classifier. In the recent past, we have seen an increasing trend in the use of deep learning architectures for evaluating mental stress [57]. Not only does it eliminate the need for feature selection, which is time-consuming, but it also does not require human intervention or prior expert knowledge [58]. Deep learning also offers several models, such as Recurrent Neural Networks (RNNs), Convolutional Neural Networks (CNNs), Deep Belief Networks (DBNs), and Long Short-Term Memory networks (LSTMs) [59]. This paper aims to fill a knowledge gap by reviewing the different EEG-related deep learning algorithms with a focus on Convolutional Neural Networks (CNNs) and Long Short-Term Memory networks (LSTMs) for the evaluation of mental stress. In evaluating mental stress using EEG, CNNs, and LSTMs are the most used deep learning models, thus this review paper will mainly focus on them.

The results of various research studies focusing on the application of deep learning classification methods to EEG signals for assessing mental stress have yielded inconsistent findings. For instance, the same deep learning technique, such as CNN, has produced a wide range of accuracy outcomes across different studies. For instance, raw EEG data produced classification accuracy around 62% while using spectral and topographical representation produced up to 88%. In light of this challenge, our work endeavors to conduct a comprehensive review of published papers related to mental stress assessment and classification using deep learning architectures. Our goal is to not only assess the existing body of research but also to propose potential avenues for future investigations.

While prior studies have commendably provided comprehensive reviews of EEG signal classification through machine learning techniques [31, 60], this paper distinguishes itself as the pioneering effort to deliver a dedicated review centered solely on the application of deep learning methodologies for EEG-based mental stress assessment. This endeavor is of profound significance, as it holds the potential to significantly advance our understanding of the neural underpinnings of mental stress, while also shedding light on the analytical intricacies associated with the fusion of both machine and deep learning techniques. Within the scope of this review, we meticulously scrutinize each paper's model architecture, parameter configurations, and their compatibility with the specific EEG input representations employed, emphasizing the pivotal role that deep learning plays in this critical domain of research.

In summary, our paper represents pioneering research as it explores the relationship between various categories of deep learning methods and the most suitable EEG input formulations. Additionally, we offer recommendations concerning the most common and optimal layer counts and the choice of activation functions based on our comprehensive analysis of the reviewed literature. Lastly, our work puts forth suggestions for future research directions, including the exploration of hybrid models and other innovative approaches. This review promises to be of great interest in the field of stress detection, offering valuable insights and recommendations to researchers.

Thus, the contributions of this review paper can be summarized as follows:

1.
This review paper emphasizes the application of Deep Learning (DL) techniques for classifying mental stress using EEG data, distinguishing it from the conventional Machine Learning (ML) approaches.
2.
The review follows the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines, employing strict criteria for including and excluding studies.
3.
A bias risk analysis was also performed to evaluate the quality and reliability of the included studies.
4.
The paper explored various DL model architectures and their associated EEG input formulation, providing a comprehensive understanding.
5.
Based on the findings from the review and analysis, recommendations are provided, guiding researchers in selecting DL models and suitable EEG input formulations for optimal performance.
6.
The review identifies critical knowledge gaps within the existing literature and suggests areas for future research and advancements in the field of EEG-based mental stress classification using DL.

The rest of the paper is organized as follows. The materials and methods are described in Sect. 2, where the search strategy, the inclusion, and exclusion strategy, as well as the variables of interest, are stated. EEG signal extraction and pre-processing are presented in Sect. 3. Section 4 provides theoretical background knowledge about deep learning as well as an overview of CNN and LSTM basic architecture models. Section 5 reviews different CNN and LSTM models, as well as hybrid models, that have been used in quantifying stress levels. The discussion of the findings of the reviewed papers is described in Sect. 6. Finally, Sects. 7, 8 summarize the main challenges and conclusions of the research in EEG-based stress estimation.

2 Materials and methods

2.1 Search strategy

This review was conducted using the Preferred Reporting Items for Systemic Reviews and Meta-Analyses (PRISMA). Different databases were used to search for publications, such as Google Scholar, PubMed, Science Direct, and IEEE Xplore. The following combinations of keywords were used:

‘EEG’ AND ‘deep learning’
‘EEG’ AND ‘deep learning’ AND ‘mental stress’ OR ‘mental workload’
‘EEG’ AND ‘mental stress’ OR ‘mental workload’ AND (‘CNN’ OR ‘convolution’ OR ‘deep learning’)
‘EEG’ AND ‘mental stress’ OR ‘mental workload’ AND (‘LSTM’ OR ‘deep learning’)
‘EEG’ AND ‘deep learning’ AND ‘mental stress’ OR ‘mental workload’ AND (‘LSTM’ OR ‘neural network’ OR ‘CNN’ OR ‘convolution’).

In addition, the list of citations and references of each paper was looked over to check for any relevant studies that could be used. Figure 1 shows the search technique used.

2.2 Inclusion and exclusion strategy

Duplicates across two databases were eliminated, as were studies that did not meet the following inclusion criteria:

This review only included neural networks having at least two hidden layers to be considered as deep learning.
Only studies employing deep learning models for classification were included; those using alternative classifiers such as traditional machine learning were excluded.
This review focused entirely on the use of EEG data to classify tasks done by people. Any other studies were excluded, like power analyses, non-human studies, and feature selection with no end categorization.
Studies were included if they pertained to the classification of mental stress, while those related to depression, anxiety, emotional behavior, or suicidal tendencies were excluded.
Studies with subjects < 10 have been excluded from the analysis and comparison between architectures and models due to their small sample sizes. This is because their potential significance in the context of mental stress cannot be definitively established at this point.
Due to the fast development of this field of research, only papers published between January 2018 to November 2022 were considered in this review article.

2.3 Risk of bias assessment

The risk of bias in the included papers has been evaluated as demonstrated in Fig. 2.

2.4 Data of interest

The main variable categories collected from each paper were as follows:

1.
Experimental environment:
- Stressor type
- Number of participants/subjects
- Experiment duration

2.
EEG preprocessing techniques

3.
Type of EEG input:
- EEG signal features type
- Number of electrodes/channels used
- Electrode location

4.
Deep learning algorithms:
- Deep learning architecture used
- Hyperparameters, such as the number of layers, activation functions, etc.

Figure 3 shows the number of papers that were published in the last five years which were mostly about evaluating mental stress using EEG and deep learning. It is noticed that applying deep learning for assessing mental stress is becoming of increased interest lately.

3 Electroencephalogram

Electroencephalogram (EEG) is a neuroimaging technique that is used to monitor and measure the electrical activity and function of the brain over time. Usually, 32 and 64 electrodes are the most common setting for EEG [47]. In the human brain, different localizations correlate with different and specific types of activity and functions. Thus, electrode placement on the scalp is very essential. The standard method for localizing the electrodes is the international 10–20 electrode system as shown in Fig. 4. Where, the electrodes are given nomenclature by letters and numbers referring to the electrode brain lobe and hemisphere location [61], as shown in Fig. 5.

Usually, EEG amplitude ranges between 10 to 100 µV. A higher amplitude or a different pattern is considered to be either technical or physiological [64]. Unfortunately, this causes concern as artifacts may imitate cognitive activity or mental disorders thus affecting the diagnosis or clinical research results [65]. In order to use EEG signals for evaluating mental stress, it must go through considerable preprocessing. According to [66], non-physiological artifacts can be easily eliminated by precisely handling the experiment and applying linear filters such as high pass, band pass, and notch filters. On the other hand, physiological artifacts are difficult to handle as they usually overlap with the recorded EEG signal, thus requiring more advanced preprocessing methods like ensemble averaging, optimum filtering, and Independent Component Analysis (ICA).

Table 1 shows some of the most common filtering techniques utilized by the studies reviewed, which include mainly an ICA, a high pass filter, and a low pass or bandpass filter. Therefore, it is recommended to start with these initial filtering techniques to clean the EEG signals, and if needed, further sophisticated methods can be tested.

Table 1 Filtering techniques used by the reviewed research papers

Full size table

In order to perform mental stress assessment using EEG, there are several main steps to follow, namely EEG data acquisition, pre-processing, feature extraction, feature selection, and classification [51]. EEG data acquisition and preprocessing have been discussed previously. Various alternative approaches may be employed in the feature extraction step to extract several distinct features from the same raw data. Features extracted from EEG can be classified into three main categories: time-domain (also known as temporal features), frequency-domain (also known as spectral features), and time–frequency features.

Time-domain features provide temporal information about the EEG signal. Various time-domain features have been used in quantifying mental stress, including Hjorth parameters [78], Higuchi’s fractal dimension [79], and entropies such as Shannon entropy [52], approximate entropy [80], and wavelet sum of entropy [81]. On the other hand, frequency domain features of an EEG signal provide useful information about the pattern and characteristics of the signal. It can be extracted from the filtered signal using several techniques such as discrete wavelet transform [82] or short-time Fourier transform [83], which decomposes the signal into sub-bands called EEG clinical frequency bands, delta (1–4 Hz), theta (4–8 Hz), alpha (8–13 Hz), beta (13–30 Hz) and gamma (> 30 Hz). Each of these bands has different frequencies and amplitudes corresponding to specific brain states (such as awake, alert, or asleep). Several spectral features have been implemented in evaluating mental stress, such as power spectral density [84], absolute power [85], relative power [86], wavelet transform [87], and Gaussian mixtures [88]. Moreover, some papers have utilized EEG time–frequency features in assessing mental stress. This approach allows extracting information from both domains, time and frequency, simultaneously [89]. Time–frequency features can be used to produce spectrogram images or topography maps [90], which can then be used as EEG input features to the classifiers [91, 92].

4 Deep learning in mental stress

Deep learning is now one of the most famous research trends, owing to its tremendous success [85, 93]. The tangible edge of deep learning algorithms over traditional machine learning is the ability to learn and select features jointly with classifier training. Also, deep learning shines when handling a large amount of data compared to traditional machine learning, provided the hyperparameters are chosen wisely and the increased data has more information. Several deep learning architectures—including artificial neural networks, autoencoders, convolutional neural networks, Recurrent Neural Networks (RNNs), and their combinations—have been implemented in several fields. Nevertheless, when it comes to evaluating mental stress using EEG, the two most common models are Convolutional Neural Networks (CNNs) and Long Short-Term Memory networks (LSTMs).

CNNs excel in extracting spatial features, which is crucial in analyzing EEG data where spatial patterns play a significant role. On the other hand, LSTMs can capture temporal information, allowing for the modeling of sequential patterns in EEG signals over time. The decision to focus on CNNs and LSTMs in this review is based on their proven effectiveness in handling the unique characteristics of EEG data related to mental stress. This strategic focus aligns with the broader trend observed among researchers, where CNNs and LSTMs have consistently proven their efficiency in mental stress assessment. By leveraging CNNs for spatial information and LSTMs for temporal dynamics, this approach aims to capture a comprehensive representation of the complex EEG patterns associated with stress states, ultimately contributing to a more enhanced understanding of mental stress assessment.

4.1 Convolutional neural networks (CNNs)

A CNN is a type of deep neural network capable of recognizing and classifying certain features in a signal or from an image. Its major benefit over its predecessors is that it introduces convolution operations in neural networks that help to learn equivariant features. These are good at automatically recognizing important features without the need for human intervention, making them the most widely utilized architecture. The simplest CNN architecture is composed of three main layer types convolutional layers, pooling layers, and fully-connected (FC) layers [94], as shown in Fig. 6. These layers are stacked together and organized in a way to perform two consecutive basic functions, namely feature extraction, and classification.

In the feature extraction operation, two main layer types are responsible for the process: convolutional layers and pooling layers. In the classification phase, a Fully Connected (FC) layer with an activation function is employed on the features extracted in the previous phase to perform class prediction.

Convolutional layers: A convolutional layer is made up of several convolutional filters, also known as kernels, which are convolved with the input image [96]. Convolution of an input with a kernel can be thought of as shifting the filter from one corner (e.g., top left) to the other corner (e.g., bottom right) in steps/strides. During each shift, the dot product of the kernel coefficients with the overlapping input is computed and placed at the output. One can choose different stride values, which influence the output size and reduce the overlapping of receptive fields. In addition, the input can be padded with zeros before convolution. This keeps the output size consistent with the input [97]. An example of convolution where a 5 × 5 image is convolved using a 3 × 3 filter can be found in [98].

Pooling layers: A pooling layer performs sub-sampling to the output to produce smaller feature maps while maintaining the main dominant features. There is a wide variety of pooling methods [93], however, the most prevalent are max and global average pooling.

Fully connected layers: The output feature map is flattened and used as an input to the Fully Connected layer. Each neuron in this layer is connected to every neuron in the previous layer, thus the name. They are usually found at the end of a CNN [99].

Activation function: The activation function adds non-linearities to CNNs. There are several commonly used activation functions. Rectified Linear Unit (ReLU), hyperbolic tangent (tanh), and sigmoid functions are usually implemented on the hidden layers (after convolutional layers), while the SoftMax function is mostly utilized on the output after the FC layer [99, 100] in case of classification problems. Mostly, the motivation behind these activation functions is to add some sort of non-linearities in the network. The non-linearities increase the expressive power of the model. The soft-max function also ensures that the outputs can be regarded as probability values, by making sure that they are non-negative and sum up to one. These probability values can be regarded as a confidence measure for an input to belong to a certain class. Some of these activation functions can be found in [93, 99, 100].

4.2 Long short-term memory networks

Long Short-Term Memory Networks (LSTMs) are a special kind of Recurrent Neural Networks, which in turn are an extension of neural networks with the addition of recurrent connections in the hidden layers. The recurrent connections produce temporal memory, as shown in Fig. 7. It is worth noting that, RNNs face some limitations during the backpropagation of errors to update the weight value of the neural network. These include problems of vanishing or exploding gradients. This causes the RNN to be limited to learning only a limited temporal dependency [101].

LSTMs introduce gates to solve these issues. LSTMs have memory cell state blocks through which signals flow and are guided by input, forget, and output gates. These gates regulate what is to be saved, read, and written on the cell [103]:

a. Forget gate

The forget gate is the first step, which involves determining what information should be wiped out from the cell state. Mathematically it can be expressed as follows:

$$f_{t} = \sigma \left( {{\text{W}}_{f} x_{t} + {\text{W}}_{f} h_{t - 1} + b_{f} } \right)$$

where, f_t represents the output of the forget gate and time instant t, h_t, and x_t is the hidden state and input vector respectively. In addition, b is the bias of each layer, σ represents the sigmoid function and W’s are the learning weight parameters.

b. Input and update gates

This layer decides what values are going to be stored or added to the cell state. Mathematically this can be expressed as follows:

$$i_{t} = \sigma \left( {{\text{W}}_{i} .[h_{t - 1,} x_{t} } \right] + b_{i} )$$

$$\widetilde{{\text{C}}}_{t} = tanh\left( {{\text{W}}_{c*} [h_{t - 1,} x_{t} } \right] + b_{c} )$$

where i_t represents the output of the input gate and $\widetilde{C}$ is the candidate hidden state that is based on current input multiplied by the previous hidden state.

To get the cell state $C$ updated, the last two layers can be combined:

$${\text{C}}_{t} = f_{t} {\text{*C}}_{t} + i_{t} {*}\widetilde{{\text{C}}}_{t}$$

where, $C$ is the internal memory of the unit, which is based on previous memory multiplied by forget gate f_t added to the product of the newly computed hidden state $\widetilde{C}$ and input gate i_t.

c. Output gate

The sigmoid layer first decides which part of the cell state it is going to keep or output. This can be expressed as follows:

$$o_{t} = \sigma \left( {{\text{W}}_{o} .[h_{t - 1,} x_{t} } \right] + b_{o} )$$

$$h_{t} = o*{\text{tanh}}({\text{C}}_{t} )$$

A bidirectional LSTM, noted as BiLSTM, implies that the signal propagation is in both directions, forward as well as backward [104]. A typical internal structure of the LSTM model can be found in [105], while each gate has been extensively explained in many studies [103, 104, 106,107,108].

5 Architecture design choices

This portion of the review focuses on identifying patterns in the development of various deep-learning architectures. In general, the factors to be kept in mind while designing are the architectural type, and hyperparameters, such as the number of hidden layers, the kind of activation functions, and the type of end classifiers. In this review, we focused on the type of architecture. The most predominant observed architecture types from the reviewed papers were CNNs, RNNs (LSTMs), as well as some hybrid architectures. The architecture design choices of each paper reviewed in this chapter and their results are summarized in Table 2.

Table 2 Summary of all studies reviewed in this paper

Full size table

5.1 Convolutional neural networks

CNNs are the most popular architectural design framework applied to detect mental stress [109]. It involves alternating convolutional and pooling layers. The type and number of layers, the structure of layers as well as the type of final classifier are the most important design features for CNNs. Several papers proposed to classify mental stress using raw EEG data and have achieved competitive outcomes as reviewed below.

5.1.1 Raw EEG with CNN

Jebelli et al. [67] proposed the use of deep CNNs, to predict the level of stress of 10 construction workers applied to several stressors at actual construction sites. The deep CNN architecture used had four blocks of convolution and pooling layers, while for the classification, one final dense layer, fully connected, was used with two softmax units. Figure 8 shows the architecture developed. The CNN detected two levels of stress (low and high stress) using raw EEG signals with an accuracy of 64.20%. The results were compared with state-of-the-art methods that used fully connected deep neural networks, with 2 hidden layers having 83 and 23 neurons respectively, 1 input layer and 1 output layer. The accuracy achieved was 86.62%. The paper claimed that increasing the number of hidden layers of the architecture will not necessarily improve the accuracy of detecting stress.

In another study on detecting the mental state, Zeng et al. [69] constructed two novel classifiers, namely EEG-Conv and EEG-Conv-R, to differentiate between two mental states which are vigilance and bored. EEG-Conv is based on traditional deep CNNs, composed of eight layers: the input layer, three convolutional layers, a pooling layer, a local response normalization layer, a fully connected layer composed of 2048 neurons, and where a dropout strategy was applied to prevent overfitting, and an output layer. Each neuron in the CNN had a ReLU activation function, which in this paper was stated to be more efficient than the sigmoid and tanh functions. On the output layer, logistic regression was used as a linear, probabilistic classifier. The architecture of EEG-Conv is illustrated in Fig. 9.

To enhance classification accuracy, this study integrated Convolutional Neural Networks (CNNs) with state-of-the-art deep residual learning techniques, resulting in the creation of a novel classifier known as EEG-Conv-R. The EEG-Conv-R architecture is established by the incorporation of two residual blocks into the existing EEG-Conv structure, as visually represented in Fig. 10. In this configuration, each layer within the network employing residual blocks propagates its output not only to the immediate subsequent layer but also directly to layers positioned two to three steps further ahead. These introduced "skip" or "shortcut" connections serve as a strategic solution to combat the vanishing gradient issue, thus enabling the effective training of significantly deeper networks.

Raw EEG data extracted from people subjected to driving simulation were fed to the EEG-Conv and EEG-Conv-R architectures directly, achieving an accuracy of 82.95 and 84.38% respectively. These results outperformed the traditional LSTM- and SVM-based classifiers.

Penchina et al. [70], investigated the use of EEGNet (CNN) to discriminate between anxious and non-anxious states in neurotypical people and people on the autism spectrum. The subjects were given arithmetic tasks to induce stress, while relaxation was achieved by guided and unguided breathing periods. The EEGNet architecture developed consisted of eight 2D convolution filters, one depth-wise convolution layer, one separable convolution layer, and a final dense layer of four neurons with a SoftMax activation function for classification purpose, as shown in Fig. 11. EEGNet was able to detect stress from raw EEG signals with an accuracy of 60.21%. As stated by the paper, limited accuracy can be due to several reasons, such as that the tasks were simplified to not overstimulate people with autism, and the period of guided/unguided breathing was longer than the stress period causing an unbalanced dataset.

Sundaresan et al. [72] conducted further studies to compare the efficiency of previously proposed EEGNet with deep ConvNet and shallow ConvNet architectures, where deep ConvNet consisted of 4 convolution-max-pooling blocks, with the first block containing 25 2D temporal convolutional filters, 25 2D spatial convolutional filters, and a max pooling layer, while the subsequent 3 blocks contained 2D convolutional layer and a max pooling layer with 50, 100 and 200 filters per block. All the neurons except in the final layer utilized Exponential Linear Unit (ELU) as the activation function, while a SoftMax activation function was used on the final dense layer for classification purposes.

On the other hand, shallow ConvNet is a modified version of deep ConvNet, which consists of a single convolution-pooling block, a squaring non-linearity function, an average pooling layer, and a logarithmic activation function. The diagram of model architectures used is shown in Fig. 12. The EEGNet, deep ConvNet, and shallow ConvNet architectures were able to detect stress by 61.18, 58.80, and 62.84% respectively.

In a more recent study [110], Fu et al. proposed a novel deep learning model that differentiates between 4 mental states (relaxed, medium stress, high stress, and stress recovery). The model named Symmetric Deep Convolutional Adversarial Network (SDCAN) works by merging CNNs and adversarial theory. This helps in automatically extracting invariant and discriminative features from raw EEG to enhance classification and subject generalization. The model is composed of two symmetrical CNNs that serve as the generator and discriminator as shown in Fig. 13. The discriminator consisted of 4 convolution max pooling blocks attached to 5 deconvolution layers. The subjects were placed under stressor TSST (Trier Social Stress Test), while raw EEG was collected. In this study, when compared, the SDCAN model achieved an accuracy of 87.62%, outperforming traditional CNN. In addition, Abhishek and Nallavan proposed mental stress assessment in sports [111].

5.1.2 Spectral images or topological maps with CNN

Apart from the aforementioned CNN architectures that used raw EEG signals as input, there are a few papers that proposed the use of spectral images or topological maps and achieved competitive outcomes, as reviewed below.

The EEG data can be represented as 2D or 3D pictures in the topological map input formulation, depending on the spatial topology of the electrodes, i.e., the position of electrodes on the scalp [112]. Martínez-Rodrigo et al. [73] proposed a study that focused on using a CNN model that utilized EEG signals interpreted as topological maps of the scalp. These maps were fed as images to the CNN model. The power for every EEG channel was individually obtained for each frequency sub-band (Alpha, Beta, Gamma, and Theta) and the whole band (4 Hz to 45 Hz). The resulting powers were then normalized.

The obtained powers P_t, P_θ, P_α, P_β, and P_γ were initially transformed into 2D images using three different mapping approaches, namely Direct Matrix Distribution (DMD), Direct Matrix Distribution interpolated (DMDi), and Azimuthal Equidistant Projection (AEP). It should be stated that the three mapping approaches utilized jet colormap with 256 colors, ranging from dark red (maximum value) to dark blue (minimum value) to represent the spectral power values. The 2D maps obtained from the power parameters were then piled to form 3D images, from each of the three mapping approaches separately. The authors in this paper proposed the use of an AlexNet-Based CNN model to differentiate two mental states, which are distress and calm. The 2D AlexNet-Based CNN model proposed in this study consisted of five convolution layers, three max-pooling layers, and three FC layers (containing two drop-out layers to prevent overfitting and improve generalization errors) with 4096 neurons in each layer. It is to be noted that a ReLU activation function was used after every convolutional and FC layer. The 2D AlexNet-based CNN model proposed in this study is shown in Fig. 14. For the CNN model to deal with 3D images as input data, the same CNN architecture was used, but the size of the convolution and max-pooling layers was extended. The authors found the combination of DMD with 3D CNN to be slightly better than other combinations, with an accuracy of 86.12%.

Kamińska et al. [77], applied Morlet wavelet transform to the averaged values from all electrodes to generate time–frequency representation images, where the frequency range taken into consideration was from 1 to 28 Hz. The time–frequency representation images were used as input for a CNN-based deep learning model, achieving an accuracy of 87.5% in detecting stress and relaxed states. Kamińska et al. built a CNN architecture, shown in Fig. 15, with three convolution layers having 250, 250, and 100 neurons respectively, and a final dense FC layer with 100 neurons. It was stated that only after the first relaxation period, it was possible to record the stressful state, which may have been due to participants still not feeling comfortable with the situation or stressed by the new experience.

A more recent study by Mane et al. [113], investigated the use of 2D Azimuthal projection to develop 2D images as an input to a CNN model. Initially, raw EEG signals were chopped into frames and a Hanning window was applied. Meanwhile, Fast Fourier Transform (FFT) was used to extract the EEG frequency domain. Later, frequency binning was performed to group the signal into the 3 frequency ranges. Finally, RGB channel values were used to represent alpha, beta, and theta values and form images. The authors in this paper proposed using a CNN model that consists of 10 2D convolution layers, then a max pooling layer that feeds to a final flattened dense layer for classification between stressed and normal states. This study achieved an accuracy of 93.0%.

5.2 Long short-term memory networks

As with the CNN architecture, designing an LSTM architecture mainly focuses on the type and number of layers as well as the type of final classifier. LSTM models are considered the second most prevalent deep neural network models, after CNNs, in classifying mental stress with EEG signals.

Penchina et al. [70] proposed the use of an LSTM network with two layers to detect anxiety in neurotypical people and people with autism. According to this study, several papers have suggested the use of LSTMs with two layers, stating that it improves the accuracy of classification compared to using only one or more than two. They achieved an accuracy of 93.27%. The proposed LSTM design, as shown in Fig. 16, consisted of two LSTM layers, with 50 neurons in the first layer and 40 neurons in the second layer, two drop-out layers of rate 0.5, two hidden dense layers with 20 neurons that use a sigmoid activation function in the first layer, while the second layer had 10 neurons and uses ReLU, followed by one output dense layer with a SoftMax function for classification. The number of neurons in both LSTM layers was implemented according to the literature study [114].

A more recent study [72], was performed after [70] to further compare the results of the aforementioned two-layered LSTM model performance with more CNN architectures such as EEGNet (CNN), Deep ConvNet, and shallow ConvNet, that achieved accuracies of 61.18, 58.80, and 62.84% respectively. The LSTM model was able to outperform the CNN performance in evaluating mental states using raw EEG signals in this paper. The authors also asserted that the performance of the LSTM network remained consistent regardless of the presence of mental disorders and produced similar results. Therefore, it proves to be effective in detecting stress in both neurotypical individuals and those with autism. In a more recent study [115], Phutela et al. analyzed two levels of mental stress using a 2-layer LSTM model The study achieved an approximate accuracy of 99.71%. Stress was induced by watching emotional video clips. Meanwhile, raw EEG signals were collected from the subjects and then fed to the LSTM model. The authors implemented an LSTM model that contains an initial 8-neuron LSTM layer (LSTM 1), followed by a second 16-neuron LSTM layer (LSTM 2). Then a dropout layer was added to prevent overfitting and noise learning. Finally, for classification purpose, a FC-dense layer with a Sigmoid activation function was used.

5.3 Hybrid architectures

Hybrid deep learning models combine two or more deep learning models into a single architecture [112]. Researchers have attempted to integrate several deep learning networks, including the standalone deep learning models discussed above (CNNs and LSTMs), obtaining promising findings for identifying mental stress.

For instance, Kuanar et al. [71] implemented the concept of hybrid architecture where a CNN (ConvNet) model and an LSTM model were integrated to preserve the spectral, spatial, and temporal structure of EEG data. The authors also focused on extracting spatial-frequency images, or spectral images, from EEG signals to be fed to the hybrid system. Fast Fourier Transform was applied to estimate the EEG’s power spectrum, and only three sub-bands, theta (4–8 Hz), alpha (8–13 Hz), and beta (13–30 Hz) were chosen, as several literature studies recommended. Later, the sum of the squared absolute power values for each sub-band associated with each electrode was calculated and transformed into 2D images with corresponding color channels to represent the spectral dimensions. The resulting topographical map is fed as an input to the ConvNet (CNN) model that extracts feature vectors which are fed to the LSTM model, as shown in Fig. 17.

The proposed hybrid model in Fig. 17 consisted of a CNN model with nine convolutional layers, three max-pooling layers, and one fully connected layer. All the layers utilized the ReLU activation function. For the LSTM architecture, three models have been suggested: a two-layered LSTM network, a two-layered LSTM network with a 1D convolutional layer, and a bidirectional LSTM network. Note that Bidirectional LSTMs can process EEG data in both directions, using two separate hidden layers. For the final layer of each model, a fully connected layer was used with a SoftMax function to perform classification. When comparing the performances of the proposed models, ConvNet + LSTM, ConvNet + LSTM + 1D Conv, and ConvNet + Bi-LSTM achieved accuracies of 84.48, 87.68, and 92.5% respectively in detecting four mental states. The proposed model using ConvNet + Bi-LSTM is shown in Fig. 18.

Chakladar et al. [75], analyzed three levels of mental stress using a hybrid of bidirectional LSTM and LSTM networks. The authors also extracted a hybrid of frequency, statistical, and non-linear features. Furthermore, the Grey Wolf Optimization technique was implemented in order to choose the best features. These features were fed to the hybrid deep learning model shown in Fig. 19. It was proposed to have a single bidirectional LSTM layer and two LSTM layers. Each layer was followed by a dropout layer of rate 0.2 and a batch normalization layer, to prevent overfitting and normalize the output. Finally, for classification purposes, two consecutive dense layers have been used. The classification was done using the STEW dataset, yielding an accuracy of 82.57%.

In another study [72], Sundaresan et al. proposed the use of hybrid LSTM models with fully convolutional networks to detect anxiety in neurotypical people and people with autism. The raw EEG signal was fed as input simultaneously to two blocks—an LSTM block and a fully convolutional network—as shown in Fig. 20. The LSTM block was composed of a single LSTM layer followed by a dropout layer with a rate of 0.8, while the fully convolutional network block had three 1D convolutional layers of different sizes, followed by a pooling layer. Finally, the output from each block is concatenated and fed into a 4-neurons dense output layer using the SoftMax activation for classification. The proposed model achieved a relatively low accuracy of 62.97%, which can be explained by the simplified stressor task meant to not overstimulate the subjects with autism. However, it's noteworthy that the results were not affected by the presence of mental disorders, producing similar outcomes. Moreover, the dataset collected was unbalanced due to longer breathing periods compared to the stressor periods. Finally, LSTM-based deep learning models are usually over-reliant upon large datasets.

In a more recent study [116], Malviya et al. investigated the use of hybrid CNN- Bidirectional LSTM for detecting mental stress. The subjects performed mental arithmetic tasks to induce stress. Further, Discrete Wavelet Transform (DWT) was used to filter the raw EEG signals and divide them into 5 frequency bands. The proposed hybrid architecture, as shown in Fig. 21, consisted of a CNN model with 2 convolution layers, 2 max-pooling layers, and 1 FC layer, while the BiLSTM model is composed of 3 LSTM layers with 2 cells in each layer and 64 neurons. The output is fed into a dropout layer then a dense layer with a SoftMax unit for classification. The hybrid system was able to discriminate stress from relaxation state with an accuracy of 99.2%.

Meanwhile, Xia et al. [117], investigated the use of a multi-branch LSTM merged with a hierarchical Temporal Attention (MuLHiTA) model. Raw EEG signals collected from subjects performing mental arithmetic tasks were fed to the proposed model. Meanwhile, the model consisted of two complementary branches, an Intra-LTAM layer (an Intra-BLSTM and a temporal attention mechanism) and then an Inter-LTAM layer (an Inter-BLSTM and a temporal attention mechanism), where each layer contains an attention module and a fully connected layer. Note that, an intra and inter in the LTAM networks refer to Interslice and intraslice feature extractions. Following that, a concatenate layer is used to aggregate the outputs from the two networks. Finally, for classification purposes, an FC layer with SoftMax was used. The classification between two mental states (rest and stress) was performed using the DMAT dataset, yielding an accuracy of 99.71%.

A summary of all the studies that have been reviewed in this paper can be found in Table 2.

6 Discussion

In this section, we engage in a comprehensive analysis of the statistical findings derived from the data extracted from the reviewed papers. We aim to provide a clear and insightful interpretation of these statistics, shedding light on key trends and patterns observed in the field of mental stress classification using EEG signals.

Furthermore, we extend our focus beyond the immediate statistical insights. We offer valuable recommendations for designing robust deep-learning architectures and optimizing essential parameters.

However, our commitment to advancing the field does not cease in the present. Recognizing that research is continually evolving and expanding, we present forward-looking recommendations. These suggestions outline the potential future directions that could take the study of stress to new levels of understanding. Our aim is to promote a more advanced, comprehensive, and multidimensional approach to stress research, deepening our understanding of this area of study.

6.1 Deep learning architecture

6.1.1 Input formulation

Choosing the type of EEG input formulation depended to a great extent on the type of deep learning architecture utilized to detect mental stress as shown in Fig. 22.

As for CNN-based deep learning architectures, two common types of input formulations were used by the studies reviewed in this paper to classify mental stress. These types are raw EEG signals and spectral/topological maps. Raw EEG signals and spectral/topological maps were proposed equally in the reviewed papers. Although raw EEG and spectral/topological maps were recommended equally, the average accuracy achieved by CNNs with spectral/topological maps across all review papers outperformed those achieved by CNNs using raw EEG signals, with 84.73% and 67.79% respectively, as shown in Fig. 23.

This result can be linked to CNNs’ ability to extract higher representations or features from image content. Besides, it can also be noticed that using raw EEG signals as input to the CNN model did not yield consistent accuracy across all reviewed papers. There was a large variability in the results, ranging from 60 to 84%, raising questions of whether there are other factors that have influenced the accuracy of the classification other than the raw EEG input.

On the other hand, LSTM-based deep learning architecture models used only raw EEG signals as input formulation, achieving an accuracy ranging from 93 to 95%, which shows that the accuracy of using the LSTM model to classify mental stress from raw EEG signals is considered stable at around 94%. However, further investigation is required.

Meanwhile, hybrid-based deep learning architectures investigated the use of three different types of input formulations. Spectral/topological maps were used in 60% of the hybrid-based studies, followed by raw EEG signals and hybrid input signals in 30% (each). The usage of different types of input formulation accounts for the ability of these hybrid architectures to process a variety of data inputs depending on the types of deep learning architectures. For instance, in [71], the CNN and LSTM models were integrated, thus allowing for the use of spectral topography as an input. On the other hand, Chakladar et al. [75] utilized a hybrid of statistical and spectral features as input to the merged bidirectional LSTM and LSTM network. Figure 24 shows the average accuracy for different types of input formulations with hybrid deep learning architectures.

As shown before, the type of input formulation with regard to specific deep learning models plays an important role in the classification result. For instance, the utilization of the same deep learning technique, for example, CNN, resulted in a variety of different accuracies when utilized by different studies using different types of input signal formulation.

6.1.2 Deep learning architecture and activation function

Based on the reviewed studies, the most prevalent architecture design framework adopted was the CNN, found in 67% of the papers as shown in Fig. 25. The reason behind the widespread use of CNNs can be due to several factors. First, a distinctive property of CNNs is that it doesn’t require prior feature selection, which means that they can extract deep distinct features and spatial patterns from raw EEG signals. Thus, it can perform feature extraction as well as classification. Furthermore, CNN architectures can classify and process various forms of EEG signals, including raw signals as well as 2D images such as spectral and topological maps.

Meanwhile, of the reviewed papers, none have specifically mentioned or compared the use of different numbers of convolution layers in CNN models. However, there was a common trend in literature, as 56% utilized either three convolution layers, followed by four layers, 22% Meanwhile, the rest of the studies used five or ten convolution layers equally, 11% as shown in Fig. 26. Given the large number of studies that used three convolution layers, it is recommended to employ three convolution layers in the first design of a CNN model. The potential for performance improvement can be investigated using four layers followed by other numbers of layers that were used less often.

On the other hand, the most used activation function in convolutional layers is ReLU with 64.7%, followed by Exponential Linear Unit (ELU) with 23.5%. Meanwhile, one study explicitly investigated the effect of using squaring nonlinearity and logarithm as activation functions [72]. No other studies have investigated their use yet. However, the accuracy achieved in [72] was relatively low, 62.84%.

For the classification layer, half of the reviewed studies have proposed one fully connected layer in the CNN models, while the other half proposed three. Therefore, it is recommended to test both while designing a CNN model for detecting mental stress, to determine which one works best. For classification purposes, all the studies utilized the SoftMax function except [69], where Zeng et al. proposed the use of logistic regression as a probabilistic classifier, achieving a classification accuracy of 84.38%.

The LSTM-based deep learning model was proposed in only 12% of the papers, which is far below expectations. LSTMs had proven effectiveness and outperformed other deep learning models in previous research reviewed. For instance, [70] compared the performance of LSTM and CNN models of different architectures, and found that LSTMs significantly outperformed CNNs in classifying stress. Furthermore, LSTM-based deep learning models give outstanding performances when raw EEG signals are used as input formulation, thus providing end-to-end learning as well as reducing processing time by eliminating the need for feature extraction.

Most of the reviewed studies that specified the number of LSTM layers proposed the use of two LSTM layers, following the suggestions of several previous studies. Sundaresan et al. [72] investigated and compared the performance of three LSTM models with different numbers of LSTM layers. It was shown that the accuracy improved tremendously, increasing by about 20% when using two layers instead of one. However, using three LSTM layers instead of two reduced the accuracy by 18%, showing that using two LSTM layers yields the best performance.

Finally, hybrid architectures were proposed 20% of the time to detect mental stress. The reviewed studies did not specifically compare the performance of hybrid models with standalone deep learning models, but hybrid architectures proved to achieve high performances in several studies, except in [72] where an LSTM model integrated with a fully convolutional network block yielded an accuracy of 62.97% in detecting mental stress. The resulting low accuracy is thought to be due to the usage of a single LSTM layer in the model.

6.2 Limitations

This review paper deliberately confines its scope to the examination of Convolutional Neural Networks (CNN), Long Short-Term Memory networks (LSTM), and their hybrid architectures, exclusively omitting other deep learning models. The emphasis is primarily directed toward exploring the impact of various input formulations on the performance and accuracy of these specific models. Consequently, unconventional or less common models within the deep learning landscape are not explicitly addressed in this review. While this focused approach provides an in-depth analysis of the specified models and their input variations, it inherently limits the broader applicability of the findings to a specific subset of deep learning architectures. Furthermore, it is noteworthy that this review spans the timeframe from January 2017 to November 2022, reflecting the research landscape during this period. The limitations imposed by exclusions and temporal constraints may influence the overall comprehensiveness of the review. Readers are advised to interpret the findings considering these specified boundaries.

6.3 Future recommendations

This paper has helped in understanding the relationship between different deep learning models and EEG input formulations for classifying mental stress. As we move forward, several areas can be considered for further research and exploration. A visual representation of recommended paths is presented in Flowchart Fig. 27.

Literature Gaps and Recommendations.

1. The review findings indicate that CNNs tend to perform best with EEGs presented in the form of spectral or topographical images, while LSTM models are typically applied to raw EEG data. However, previous studies have predominantly used spectrograms or connectivity maps as input images for CNNs. To advance the field, we propose exploring the utilization of 3D spatiotemporal images as an input. By converting EEG connectivity features into images, we anticipate an enhancement in understanding spatial and temporal dynamics, potentially leading to improved classification results.

2. Although hybrid designs have been recommended in only a few studies, these instances have demonstrated high performance. Surprisingly, there is a lack of comparative research on the performance of hybrid models versus standalone models. We recommend conducting comprehensive investigations to assess the effectiveness of hybrid architectures in EEG classification tasks as this is an underexplored area with potential benefits.

3. We also recommend exploring novel practices such as attention-based deep learning models and the application of graphical neural networks like Graph Convolutional Networks (GCN). The GCN approach holds promise for advancing our understanding of brain topography and connectivity while improving classification accuracy, particularly in scenarios with irregular or non-Euclidean data like EEG signals. Investigating attention-based deep learning models as a means to incorporate additional information into the classification process can also help. These models can identify crucial elements within the data for improved classification results. While initial investigations have begun in a few papers [118,119,120], there is room for further exploration of these novel approaches in various EEG classification scenarios.

4. Exploring innovative approaches such as LSTM-ALO and CNN-INFO holds significant promise. The proposed CNN-INFO, an optimization algorithm relying on the weighted mean of vectors in conjunction with CNNs, offers a promising avenue for stress detection. By integrating CNNs' capabilities in processing visual data with a novel optimization approach based on weighted vector aggregation, CNN-INFO aims to capture comprehensive representations of stress-related patterns while maintaining adaptability to varying input conditions. This approach facilitates enhanced feature aggregation, robustness to data variability, and improved interpretability, making it well-suited for real-world stress detection applications. As future research progresses, exploring CNN-INFO's performance across different datasets and its potential extensions could further solidify its role in advancing stress detection methodologies [121]. Similarly, the LSTM-ALO algorithm, it is a hybrid model combining the Long Short-Term Memory (LSTM) neural network with the Ant Lion Optimizer (ALO), has showcased its efficacy in optimizing learning rates and facilitating rapid convergence in various optimization tasks [122].

7 Conclusion

In conclusion, this paper has conducted an extensive review of deep learning models applied to classify mental stress using EEG signals. We have thoroughly examined the variations in deep network design based on the type of input formulation and deep learning techniques. It is evident that the choice of input formulation significantly impacts classification outcomes when paired with specific deep learning models; As a result, this paper has primarily focused on understanding the relation between the EEG input formulation and deep learning models. Our findings have highlighted the effectiveness of CNNs with spectral and topographical images, while LSTM models have demonstrated their capability with raw EEG data.

In addition to the type of input formulation, details about deep learning architectures such as the most common number of layers utilized for each type of deep learning model and the type of activation function were also explained and explored. These recommendations are based on a comprehensive analysis of deep learning design choices derived from the diverse studies reviewed. Based on these observations, we aim to offer practical guidance for researchers undertaking similar studies, in selecting the most suitable model, adjusting the model’s parameters as per the nature of their data nature, and thus achieving better results.

Additionally, we have highlighted the potential of hybrid architectures and emphasized the need for more in-depth research in this area as it is an area that is not widely covered, and its performance compared to standalone models was not investigated. Additionally, we proposed the application of Graphical Neural Networks (GCNs) for improving classification results, as these can help due to the non-Euclidean data structure of EEG signals. As the field continues to evolve, it is crucial to explore novel approaches and combinations to enhance accuracy, interpretability, and our overall understanding of EEG classification. By exploring GCNs, researchers can unlock new dimensions in mental stress classification, leading to more effective diagnostic and therapeutic applications. We anticipate that this paper will serve as a solid foundation for future research endeavors in the realm of deep learning and EEG-based mental stress signal classification. These efforts will undoubtedly contribute to advancements in mental health assessment and treatment, ultimately benefiting individuals dealing with stress-related conditions.

Data availability

Not applicable.

References

Karyotaki E et al (2020) Sources of stress and their associations with mental disorders among college students: results of the world health organization world mental health surveys international college student initiative. Front Psychol 11:1759
Google Scholar
Ogrodniczuk JS, Kealy D, Laverdière O (2021) Who is coming through the door? A national survey of self-reported problems among post-secondary school students who have attended campus mental health services in Canada. Couns Psychother Res 21(4):837–845
Google Scholar
E. Patterson (2022) Important facts and statistics about stress: Prevalence, impact, & amp; more The Recovery Village Drug and Alcohol Rehab. Available at: https://www.therecoveryvillage.com/mental-health/stress/stress-statistics/ (Accessed: November 6, 2022)
Koolhaas JM et al (2011) Stress revisited: a critical evaluation of the stress concept. Neurosci Biobehav Rev 35(5):1291–1301
Google Scholar
Lee RS (2022) The physiology of stress and the human body’s response to stress. Epigenetics of stress and stress disorders. Elsevier, pp 1–18
Google Scholar
Aljerf L, AlMasri N (2018) Beyond pain, fear, withdrawal-findings, and problems involving change-treatment and application for a chronic addiction on alcohol do not end 2 (1). DDIPIJ MS ID 130(1):9
Google Scholar
Giannakakis G, Grigoriadis D, Giannakaki K, Simantiraki O, Roniotis A, Tsiknakis M (2019) Review on psychological stress detection using biosignals. IEEE Trans Affective Comp. https://doi.org/10.1109/TAFFC.2019.2927337
Article Google Scholar
Vanitha L, Suresh G (2013) Hybrid SVM classification technique to detect mental stress in human beings using ECG signals. In: 2013 International conference on advanced computing and communication systems pp. 1–6 IEEE
Arsalan A, Majid M (2021) Human stress classification during public speaking using physiological signals. Comput Biol Med 133:104377. https://doi.org/10.1016/j.compbiomed.2021.104377
Article Google Scholar
Khosrowabadi R (2018) Stress and perception of emotional stimuli: long-term stress rewiring the brain. Basic Clin Neurosci 9(2):107–120. https://doi.org/10.29252/NIRP.BCN.9.2.107
Article Google Scholar
Esler M et al (2008) Chronic mental stress is a cause of essential hypertension: presence of biological markers of stress. Clin Exp Pharmacol Physiol 35(4):498–502
Google Scholar
Kotlęga D, Gołąb-Janowska M, Masztalewicz M, Ciećwież S, Nowacki P (2016) The emotional stress and risk of ischemic stroke. Neurol Neurochir Pol 50(4):265–270
Google Scholar
Pickering TG (2001) Mental stress as a causal factor in the development of hypertension and cardiovascular disease. Curr Hypertens Rep 3(3):249–254
Google Scholar
Lampert R (2016) Mental stress and ventricular arrhythmias. Curr Cardiol Rep 18(12):1–7
Google Scholar
Bansevicius D, Westgaard RH, Jensen C (1997) Mental stress of long duration: EMG activity, perceived tension, fatigue, and pain development in pain-free subjects. Headache: J Head Face Pain 37(8):499–510
Google Scholar
Basnet B, Jaiswal M, Adhikari B, Shyangwa P (2012) Depression among undergraduate medical students. Kathmandu Univ Med J 10(3):56–59
Google Scholar
Fischer S, Nater U, Laferton J (2016) Negative stress beliefs predict somatic symptoms in students under academic stress. Int J Behav Med. https://doi.org/10.1007/s12529-016-9562-y
Article Google Scholar
Castaldo R, Melillo P, Bracale U, Caserta M, Triassi M, Pecchia L (2015) Acute mental stress assessment via short term HRV analysis in healthy adults: a systematic review with meta-analysis. Biomed Signal Process Control 18:370–377. https://doi.org/10.1016/j.bspc.2015.02.012
Article Google Scholar
Geslani GP, Gaebelein CJ (2013) Perceived stress, stressors, and mental distress among doctor of pharmacy students. Soc Behav Personal Int J 41(9):1457–1468
Google Scholar
Buddeberg-Fischer B, Stamm M, Buddeberg C, Klaghofer R (2010) Chronic stress experience in young physicians: impact of person-and workplace-related factors. Int Arch Occup Environ Health 83(4):373–379
Google Scholar
Ong SL, Abdullah KL, Danaee M, Soh KL, Soh KG, Japar S (2019) Stress and anxiety among mothers of premature infants in a Malaysian neonatal intensive care unit. J Reprod Infant Psychol 37(2):193–205
Google Scholar
Demetriou C, Ozer BU, Essau C (1970) Self-report questionnaires, University of Roehampton Research Explorer. John Wiley & Sons, Inc. Available at: https://pure.roehampton.ac.uk/portal/en/publications/self-report-questionnaires (Accessed: November 6, 2022).
Aljerf L, AlMasri N (2018) Syrian case study: Behçet’s disease clinical symptomatologies, ocular manifestations, and treatment. Chron Pharm Sci 2(2):502–509
Google Scholar
Pedrotti M et al (2014) Automatic stress classification with pupil diameter analysis. Int J Human-Comp Interact 30(3):220–236
Google Scholar
Herborn KA et al (2015) Skin temperature reveals the intensity of acute stress. Physiol Behav 152:225–230
Google Scholar
Jyotsna C, Amudha J (2018) Eye gaze as an indicator for stress level analysis in students. In: 2018 International conference on advances in computing, communications and informatics (ICACCI) pp. 1588–1593 IEEE
Rothkrantz LJ, Wiggers P, Van Wees JWA, van Vark RJ, Voice stress analysis. Text, Speech and Dialogue 7th International Conference, TSD 2004, Brno, Czech Republic, Proceedings Springer: Berlin Heidelberg pp. 449–456
Berntson GG, Cacioppo JT (2004) Heart rate variability: stress and psychiatric conditions. Dyn Electrocardiogr 41(2):57–64
Google Scholar
Ring C, Burns VE, Carroll D (2002) Shifting hemodynamics of blood pressure control during prolonged mental stress. Psychophysiology 39(5):585–590
Google Scholar
Andersson S, Finset A (1998) Heart rate and skin conductance reactivity to brief psychological stress in brain-injured patients. J Psychosom Res 44(6):645–656
Google Scholar
Katmah R, Al-Shargie F, Tariq U, Babiloni F, Al-Mughairbi F, Al-Nashash H (2021) A review on mental stress assessment methods using EEG signals. Sensors 21(15):5043
Google Scholar
Society for Psychophysiological Research Ad Hoc Committee on Electrodermal Measures, Boucsein W, Fowles DC, Grimnes S, Ben-Shakhar G, Roth WT, Dawson ME, Filion DL (2012) Publication recommendations for electrodermal measurements. Psychophysiology 49(8):1017–1034
Google Scholar
Hanrahan K, McCarthy AM, Kleiber C, Lutgendorf S, Tsalikian E (2006) Strategies for salivary cortisol collection and analysis in research with children. Appl Nurs Res 19(2):95–101. https://doi.org/10.1016/j.apnr.2006.02.001
Article Google Scholar
Gibson EL, Checkley S, Papadopoulos A, Poon L, Daley S, Wardle J (1999) Increased salivary cortisol reliably induced by a protein-rich midday meal. Psychosom Med 61(2):214–224
Google Scholar
Al-Shargie F, Tang TB, Kiguchi M (2016) Mental stress grading based on fNIRS signals In: 2016 38th annual international conference of the ieee engineering in medicine and biology society (EMBC) pp. 5140–5143 doi: https://doi.org/10.1109/EMBC.2016.7591884
Al-Shargie F, Kiguchi M, Badruddin N, Dass SC, Hani AFM, Tang TB (2016) Mental stress assessment using simultaneous measurement of EEG and fNIRS. Biomed Opt Express 7(10):3882–3898. https://doi.org/10.1364/BOE.7.003882
Article Google Scholar
Al-Shargie F, Tang TB, Kiguchi M (2017) Assessment of mental stress effects on prefrontal cortical activities using canonical correlation analysis: an fNIRS-EEG study. Biomed Opt Express 8(5):2583–2598. https://doi.org/10.1364/BOE.8.002583
Article Google Scholar
Al-Shargie FM, Tang TB, Badruddin N, Kiguchi M (2016) Mental stress quantification using EEG signals. International conference for innovation in biomedical engineering and life sciences. Springer, Singapore, pp 15–19
Google Scholar
Seo SH, Lee JT (2010) Stress and EEG. In: Convergence and hybrid information technologies Geumjeong Gu Busan: Korea p. 27
Hu B et al (2015) Signal quality assessment model for wearable EEG sensor on prediction of mental stress. IEEE Trans Nanobiosci 14(5):553–561
Google Scholar
Gaurav AR, Kumar V (2018) EEG-metric based mental stress detection. Netw Biol 8(1):25–34
MathSciNet Google Scholar
Pruessner JC, Champagne F, Meaney MJ, Dagher A (2004) Dopamine release in response to a psychological stress in humans and its relationship to early life maternal care: a positron emission tomography study using [11C] raclopride. J Neurosci 24(11):2825–2831
Google Scholar
Lataster J et al (2011) Psychosocial stress is associated with in vivo dopamine release in human ventromedial prefrontal cortex: a positron emission tomography study using [18F] fallypride. Neuroimage 58(4):1081–1089
Google Scholar
Sinha R et al (2005) Neural activity associated with stress-induced cocaine craving: a functional magnetic resonance imaging study. Psychopharmacology 183(2):171–180
Google Scholar
Gossett EW et al (2018) Anticipatory stress associated with functional magnetic resonance imaging: Implications for psychosocial stress research. Int J Psychophysiol 125:35–41
Google Scholar
Umar Saeed SM, Anwar SM, Majid M, Awais M, Alnowami M (2018) Selection of neural oscillatory features for human stress classification with single channel EEG headset. BioMed Res Int 2018:1049257. https://doi.org/10.1155/2018/1049257
Article Google Scholar
Freeman W, Quiroga RQ (2012) Imaging brain function with EEG: advanced temporal and spatial analysis of electroencephalographic signals Springer Science & Business Media
Malviya L, Mal S, Lalwani P (2021) EEG data analysis for stress detection. In: 2021 10th IEEE international conference on communication systems and network technologies (CSNT) pp. 148–152 IEEE
Al-shargie F, Tang TB, Badruddin N, Kiguchi M (2018) Towards multilevel mental stress assessment using SVM with ECOC: an EEG approach. Med Biol Eng Compu 56(1):125–136. https://doi.org/10.1007/s11517-017-1733-8
Article Google Scholar
Calibo TK, Blanco JA, Firebaugh SL (2013) Cognitive stress recognition. In: 2013 IEEE international instrumentation and measurement technology conference (I2MTC) pp. 1471–1475 IEEE
Arsalan A, Majid M, Butt AR, Anwar SM (2019) Classification of perceived mental stress using a commercially available EEG headband. IEEE J Biomed Health Inform 23(6):2257–2264
Google Scholar
Sulaiman N, Taib MN, Lias S, Murat ZH, Aris SA, Hamid NHA (2011) Novel methods for stress features identification using EEG signals. Int J Simul Syst Sci Technol 12(1):27–33
Google Scholar
Rajendran V, Jayalalitha S, Adalarasu K, Usha G (2022) A Review on Mental Stress Detection Using PSS Method and EEG Signal Method. ECS Trans 107(1):1845
Google Scholar
Angra S, Ahuja S (2017) Machine learning and its applications: a review. In: 2017 International conference on big data analytics and computational intelligence (ICBDAC) pp. 57–60 doi: https://doi.org/10.1109/ICBDACI.2017.8070809.
Saeed SMU, Anwar SM, Khalid H, Majid M, Bagci U (2020) EEG based classification of long-term stress using psychological labeling. Sensors 20(7):1886
Google Scholar
Darzi A, Azami H, Khosrowabadi R (2019) Brain functional connectivity changes in long-term mental stress. J Neurodev Cognit 1(1):16–41
Google Scholar
Bhatnagar S, Khandelwal S, Jain S, Vyawahare H (2023) A deep learning approach for assessing stress levels in patients using electroencephalogram signals. Decis Anal J 7:100211
Google Scholar
Craik A, He Y, Contreras-Vidal JL (2019) Deep learning for electroencephalogram (EEG) classification tasks: a review. J Neural Eng 16(3):031001
Google Scholar
Emmert-Streib F, Yang Z, Feng H, Tripathi S, Dehmer M (2020) An introductory review of deep learning for prediction models with big data. Front Artif Intell. https://doi.org/10.3389/frai.2020.00004
Article Google Scholar
Zaway L, Chrifi-Alaoui L, Amor NB, Jallouli M, Delahoche L (2022) Classification of EEG Signals using Deep Learning In: 2022 19th International multi-conference on systems, signals & devices (SSD) pp. 679–686 IEEE
Siuly S, Li Y, Zhang Y (2016) EEG signal analysis and classification: techniques and applications (Health Information Science). Switzerland Springer
Google Scholar
Duffy F, Shankardass A, McAnulty G, Als H (2017) A unique pattern of cortical connectivity characterizes patients with attention deficit disorders: a large electroencephalographic coherence study. BMC Med. https://doi.org/10.1186/s12916-017-0805-9
Article Google Scholar
Shriram R, Sundhararajan M, Daimiwal N (2013) EEG based cognitive workload assessment for maximum efficiency. Int Organ Sci Res IOSR 7:34–38
Google Scholar
Kanoga S, Mitsukura Y (2017) Review of artifact rejection methods for electroencephalographic systems. Electroencephalography 69:69–89
Google Scholar
Jiang X, Bian G-B, Tian Z (2019) Removal of Artifacts from EEG Signals: A Review. Sensors 19(5):987
Google Scholar
Mumtaz W, Rasheed S, Irfan A (2021) Review of challenges associated with the EEG artifact removal methods. Biomed Signal Process Control 68:102741. https://doi.org/10.1016/j.bspc.2021.102741
Article Google Scholar
Jebelli H, Khalili MM, Lee S (2019) Mobile EEG-based workers’ stress recognition by applying deep neural network. Advances in informatics and computing in civil and construction engineering. Springer International Publishing, pp 173–180
Google Scholar
Almogbel MA, Dang AH, Kameyama W (2018) EEG-signals based cognitive workload detection of vehicle driver using deep learning. In: 2018 20th International conference on advanced communication technology (ICACT) pp. 256–259 doi: https://doi.org/10.23919/ICACT.2018.8323716.
Zeng H, Yang C, Dai G, Qin F, Zhang J, Kong W (2018) EEG classification of driver mental states by deep learning. Cogn Neurodyn 12(6):597–606. https://doi.org/10.1007/s11571-018-9496-y
Article Google Scholar
Penchina B, Sundaresan A, Cheong S, Martel A (2020) Deep LSTM recurrent neural network for anxiety classification from EEG in adolescents with autism. Brain Informatics: 13th international conference, BI 2020, Padua, Italy, September 19, 2020 proceedings. Springer International Publishing, pp 227–238
Google Scholar
Kuanar S, Athitsos V, Pradhan N, Mishra A, Rao KR (2018) Cognitive analysis of working memory load from EEG, by a deep recurrent neural network. In: 2018 IEEE International conference on acoustics, speech and signal processing (ICASSP) pp. 2576–2580 doi: https://doi.org/10.1109/ICASSP.2018.8462243
Sundaresan A, Penchina B, Cheong S, Grace V, Valero-Cabré A, Martel A (2021) Evaluating deep learning EEG-based mental stress classification in adolescents with autism for breathing entrainment BCI. Brain Inf 8(1):13. https://doi.org/10.1186/s40708-021-00133-5
Article Google Scholar
Martínez-Rodrigo A, García-Martínez B, Huerta Á, Alcaraz R (2021) Detection of negative stress through spectral features of electroencephalographic recordings and a convolutional neural network. Sensors 21(9):3050
Google Scholar
Khan T et al (2021) EEG based aptitude detection system for stress regulation in health care workers. Sci Program 2021:4620487. https://doi.org/10.1155/2021/4620487
Article Google Scholar
Das Chakladar D, Dey S, Roy PP, Dogra DP (2020) EEG-based mental workload estimation using deep BLSTM-LSTM network and evolutionary algorithm. Biomed Signal Process Control 60:101989. https://doi.org/10.1016/j.bspc.2020.101989
Article Google Scholar
Almogbel MA, Dang AH, Kameyama W (2019) Cognitive workload detection from raw EEG-signals of vehicle driver using deep learning. In: 2019 21st International conference on advanced communication technology (ICACT) pp. 1–6 doi: https://doi.org/10.23919/ICACT.2019.8702048.
Kamińska D, Smółka K, Zwoliński G (2021) Detection of mental stress through EEG signal in virtual reality environment. Electronics 10(22):2840
Google Scholar
Shon D, Im K, Park J-H, Lim D-S, Jang B, Kim J-M (2018) Emotional stress state detection using genetic algorithm-based feature selection on EEG signals. Int J Environ Res Public Health 15(11):2461
Google Scholar
Khosrowabadi R, Quek C, Ang KK, Tung SW, Heijnen M (2011) A brain-computer interface for classifying EEG correlates of chronic mental stress. In: The 2011 international joint conference on neural networks. pp. 757–762 IEEE
Alonso J, Romero S, Ballester M, Antonijoan R, Mañanas M (2015) Stress assessment based on EEG univariate features and functional connectivity measures. Physiol Meas 36(7):1351
Google Scholar
Hasan MJ, Kim J-M (2019) A hybrid feature pool-based emotional stress state detection algorithm using EEG signals. Brain Sci 9(12):376
Google Scholar
Alturki FA, AlSharabi K, Abdurraqeeb AM, Aljalal M (2020) EEG signal analysis for diagnosing neurological disorders using discrete wavelet transform and intelligent techniques. Sensors 20(9):2505
Google Scholar
Alickovic E, Kevric J, Subasi A (2018) Performance evaluation of empirical mode decomposition, discrete wavelet transform, and wavelet packed decomposition for automated epileptic seizure detection and prediction. Biomed Signal Process Control 39:94–102. https://doi.org/10.1016/j.bspc.2017.07.022
Article Google Scholar
Saidatul A, Paulraj MP, Yaacob S, Yusnita MA (2011) Analysis of EEG signals during relaxation and mental stress condition using AR modeling techniques. In: 2011 IEEE international conference on control system, computing and engineering. pp. 477–481 IEEE
Peng H et al (2013) A method of identifying chronic stress by EEG. Pers Ubiquit Comput 17(7):1341–1347
Google Scholar
Park KS, Choi H, Lee KJ, Lee JY, An KO, Kim EJ (2011) Patterns of electroencephalography (EEG) change against stress through noise and memorization test. Int J Med Med Sci 3(14):381–389
Google Scholar
Al-Shargie F, Tang TB, Badruddin N, Kiguchi M (2015) Simultaneous measurement of EEG-fNIRS in classifying and localizing brain activation to mental stress. In: 2015 IEEE International conference on signal and image processing applications (ICSIPA) pp. 282–286 IEEE
Jun G, Smitha KG (2016) EEG based stress level identification. In: 2016 IEEE international conference on systems, man, and cybernetics (SMC) pp. 003270–003274 IEEE
Ramos-Aguilar R, Olvera-López JA, Olmos-Pineda I (2017) Analysis of EEG signal processing techniques based on spectrograms. Res Comput Sci 145:151–162
Google Scholar
Fuad N, Jailani R, Omar WRW, Jahidin AH, Taib MN (2012) Three dimension 3D signal for electroencephalographic (EEG). In: 2012 IEEE Control and system graduate research colloquium. pp. 262–266 doi: https://doi.org/10.1109/ICSGRC.2012.6287173
Al-Shargie F, (2019) Fusion of fNIRS and EEG signals: mental stress study.
Al-Shargie F (2021) Prefrontal cortex functional connectivity based on simultaneous record of electrical and hemodynamic responses associated with mental stress. arXiv preprint arXiv:2103.04636
Alzubaidi L et al (2021) Review of deep learning: concepts, CNN architectures, challenges, applications, future directions. J Big Data 8(1):1–74
Google Scholar
Masood K, Alghamdi MA (2019) Modeling mental stress using a deep learning framework. IEEE Access 7:68446–68454. https://doi.org/10.1109/ACCESS.2019.2917718
Article Google Scholar
Islam MZ, Islam MM, Asraf A (2020) A combined deep CNN-LSTM network for the detection of novel coronavirus (COVID-19) using X-ray images. Inf Med Unlocked 20:100412. https://doi.org/10.1016/j.imu.2020.100412
Article Google Scholar
B. Wicht B (2018) Deep learning feature extraction for image processing. Dissertation, University of Fribourgs
Pröve PL (2017) An introduction to different types of convolutions in deep learning. Online URL: https://towardsdatascience.com/types-of-convolutions-in-deep-learning-717013397f4d
Kiourt C, Pavlidis G, Markantonatou S (2020) Deep learning approaches in food recognition. Machine learning paradigms: advances in deep learning-based technological applications. Springer International Publishing, pp 83–108
Google Scholar
Gurucharan M (2020) Basic CNN Architecture: explaining 5 layers of convolutional neural network. URL: https://www.upgrad.com/blog/basic-cnn-architecture
Gu J et al (2018) Recent advances in convolutional neural networks. Pattern Recogn 77:354–377. https://doi.org/10.1016/j.patcog.2017.10.013
Article Google Scholar
Sengupta S et al (2020) A review of deep learning with special emphasis on architectures, applications and recent trends. Knowl-Based Syst 194:105596. https://doi.org/10.1016/j.knosys.2020.105596
Article Google Scholar
Bao W, Yue J, Rao Y (2017) A deep learning framework for financial time series using stacked autoencoders and long-short term memory. PLoS ONE. https://doi.org/10.1371/journal.pone.0180944
Article Google Scholar
Shrestha A, Mahmood A (2019) Review of deep learning algorithms and architectures. IEEE Access 7:53040–53065. https://doi.org/10.1109/ACCESS.2019.2912200
Article Google Scholar
Abdul Hamid DSB, Goyal SB, Bedi P (2021) Integration of deep learning for improved diagnosis of depression using EEG and facial features. Mater Today: Proc. https://doi.org/10.1016/j.matpr.2021.05.659
Article Google Scholar
Le XH, Ho H, Lee G, Jung S (2019) Application of long short-term memory (LSTM) neural network for flood forecasting. Water 11:1387. https://doi.org/10.3390/w11071387
Article Google Scholar
Sarang PG (2021) Artificial neural networks with TensorFlow 2 : ANN architecture machine learning projects. [S.l.]: Apress (in English)
Almalaq A, Edwards G (2017) A review of deep learning methods applied on load forecasting. In: 2017 16th IEEE international conference on machine learning and applications (ICMLA) pp. 511–516 IEEE
Rahman M, Watanobe Y, Nakamura K (2020) A neural network based intelligent support model for program code completion. Sci Program 2020:1–18. https://doi.org/10.1155/2020/7426461
Article Google Scholar
Hwang B, You J, Vaessen T, Myin-Germeys I, Park C, Zhang B-T (2018) Deep ECGNet: An optimal deep learning framework for monitoring mental stress using ultra short-term ECG signals. Telemed and e-HEALTH 24(10):753–772
Google Scholar
Fu R et al (2022) Symmetric convolutional and adversarial neural network enables improved mental stress classification from EEG. IEEE Trans Neural Syst Rehabil Eng 30:1384–1400
Google Scholar
Abhishek A, Nallavan G (2022) Classification of mental stress on a sports person using EEG. Int J Innov Res Eng 3(3):524–531
Google Scholar
Altaheri H et al (2021) Deep learning techniques for classification of electroencephalogram (EEG) motor imagery (MI) signals: a review. Neural Comp Appl. https://doi.org/10.1007/s00521-021-06352-5
Article Google Scholar
S. A. M. Mane, A. A. Shinde (2022) Novel imaging approach for mental stress detection using EEG signals. In: Proceedings of academia-industry consortium for data science. Springer pp. 25–36
Alhagry S, Fahmy AA, El-Khoribi RA (2017) Emotion recognition based on EEG using LSTM recurrent neural network. Emotion 8(10):355–358
Google Scholar
Phutela N, Relan D, Gabrani G, Kumaraguru P, Samuel M (2022) Stress classification using brain signals based on LSTM network. Comp Intell Neurosci. https://doi.org/10.1155/2022/7607592
Article Google Scholar
Malviya L, Mal S (2022) A novel technique for stress detection from EEG signal using hybrid deep learning model. Neural Comput Appl. https://doi.org/10.1007/s00521-022-07540-7
Article Google Scholar
Xia L et al. (2022) MuLHiTA: A novel multiclass classification framework with multibranch lstm and hierarchical temporal attention for early detection of mental stress. In: IEEE transactions on neural networks and learning systems
Wang Z, Tong Y, Heng X (2019) Phase-locking value based graph convolutional neural networks for emotion recognition. IEEE Access 7:93711–93722
Google Scholar
Yin X, Zheng B, Hu YZ, Cui X (2021) EEG emotion recognition using fusion model of graph convolutional neural networks and LSTM. Appl Soft Comp 100:106954
Google Scholar
Lun X, Jia S, Hou Y, Shi Y, Li Y (2020) GCNs-net: a graph convolutional neural network approach for decoding time-resolved eeg motor imagery signals. arXiv preprint arXiv:2006.08924
Ikram RMA et al (2023) Water temperature prediction using improved deep learning methods through reptile search algorithm and weighted mean of vectors optimizer. J Mar Sci Eng 11(2):259
Google Scholar
Guruvammal S, Chellatamilan T, Deborah LJ (2022) Autism detection in young children using optimized long short-term memory. Data Intelligence and Cognitive Informatics: Proceedings of ICDICI 2022

Download references

Acknowledgements

The authors would like to thank the American University of Sharjah, Sharjah, United Arab Emirates for supporting this research work through FRG Grant FRG20-L-E25. The open access publication of this article is supported by the Open Access Program at the American University of Sharjah. This paper represents the opinions of the authors and does not mean to represent the position or opinions of the American University of Sharjah.

Author information

Authors and Affiliations

Biomedical Engineering, American University of Sharjah, Sharjah, UAE
Yara Badr
Electrical Engineering, American University of Sharjah, Sharjah, UAE
Usman Tariq, Fares Al-Shargie & Hasan Al-Nashash
Molecular Medicine, University of Rome Sapienza, Rome, Italy
Fabio Babiloni
Clinical Psychology, UAE University, Al Ain, UAE
Fadwa Al Mughairbi

Authors

Yara Badr
View author publications
You can also search for this author in PubMed Google Scholar
Usman Tariq
View author publications
You can also search for this author in PubMed Google Scholar
Fares Al-Shargie
View author publications
You can also search for this author in PubMed Google Scholar
Fabio Babiloni
View author publications
You can also search for this author in PubMed Google Scholar
Fadwa Al Mughairbi
View author publications
You can also search for this author in PubMed Google Scholar
Hasan Al-Nashash
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hasan Al-Nashash.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Badr, Y., Tariq, U., Al-Shargie, F. et al. A review on evaluating mental stress by deep learning using EEG signals. Neural Comput & Applic 36, 12629–12654 (2024). https://doi.org/10.1007/s00521-024-09809-5

Download citation

Received: 29 November 2022
Accepted: 12 April 2024
Published: 09 May 2024
Issue Date: July 2024
DOI: https://doi.org/10.1007/s00521-024-09809-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A review on evaluating mental stress by deep learning using EEG signals

Abstract

Similar content being viewed by others

A novel technique for stress detection from EEG signal using hybrid deep learning model

Heart Rate Variability-Based Mental Stress Detection Using Deep Learning Approach

Stress Detection with Deep Learning Approaches Using Physiological Signals

Explore related subjects

1 Introduction

2 Materials and methods

2.1 Search strategy

2.2 Inclusion and exclusion strategy

2.3 Risk of bias assessment

2.4 Data of interest

3 Electroencephalogram

4 Deep learning in mental stress

4.1 Convolutional neural networks (CNNs)

4.2 Long short-term memory networks

5 Architecture design choices

5.1 Convolutional neural networks

5.1.1 Raw EEG with CNN

5.1.2 Spectral images or topological maps with CNN

5.2 Long short-term memory networks

5.3 Hybrid architectures

6 Discussion

6.1 Deep learning architecture

6.1.1 Input formulation

6.1.2 Deep learning architecture and activation function

6.2 Limitations

6.3 Future recommendations

7 Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation