Feature engineering of EEG applied to mental disorders: a systematic mapping study

García-Ponsoda, Sandra; García-Carrasco, Jorge; Teruel, Miguel A.; Maté, Alejandro; Trujillo, Juan

doi:10.1007/s10489-023-04702-5

Feature engineering of EEG applied to mental disorders: a systematic mapping study

Open access
Published: 06 July 2023

Volume 53, pages 23203–23243, (2023)
Cite this article

Download PDF

You have full access to this open access article

Applied Intelligence Aims and scope Submit manuscript

Feature engineering of EEG applied to mental disorders: a systematic mapping study

Download PDF

3470 Accesses
3 Citations
4 Altmetric
Explore all metrics

Abstract

Around a third of the total population of Europe suffers from mental disorders. The use of electroencephalography (EEG) together with Machine Learning (ML) algorithms to diagnose mental disorders has recently been shown to be a prominent research area, as exposed by several reviews focused on the field. Nevertheless, previous to the application of ML algorithms, EEG data should be correctly preprocessed and prepared via Feature Engineering (FE). In fact, the choice of FE techniques can make the difference between an unusable ML model and a simple, effective model. In other words, it can be said that FE is crucial, especially when using complex, non-stationary data such as EEG. To this aim, in this paper we present a Systematic Mapping Study (SMS) focused on FE from EEG data used to identify mental disorders. Our SMS covers more than 900 papers, making it one of the most comprehensive to date, to the best of our knowledge. We gathered the mental disorder addressed, all the FE techniques used, and the Artificial Intelligence (AI) algorithm applied for classification from each paper. Our main contributions are: (i) we offer a starting point for new researchers on these topics, (ii) we extract the most used FE techniques to classify mental disorders, (iii) we show several graphical distributions of all used techniques, and (iv) we provide critical conclusions for detecting mental disorders. To provide a better overview of existing techniques, the FE process is divided into three parts: (i) signal transformation, (ii) feature extraction, and (iii) feature selection. Moreover, we classify and analyze the distribution of existing papers according to the mental disorder they treat, the FE processes used, and the ML techniques applied. As a result, we provide a valuable reference for the scientific community to identify which techniques have been proven and tested and where the gaps are located in the current state of the art.

Graphical Abstract

Deep learning techniques for classification of electroencephalogram (EEG) motor imagery (MI) signals: a review

Article 25 August 2021

MICROSTATELAB: The EEGLAB Toolbox for Resting-State Microstate Analysis

Article Open access 11 September 2023

Emerging Trends in EEG Signal Processing: A Systematic Review

Article 09 April 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

During 2019, around 968 million people suffered from some form of mental disorder, that is 1 out of 8 people around the world. One year later, because of the COVID-19 pandemic, this percentage increased significantly, rising to 26% for Anxiety disorder and 28% for Major Depressive Disorder (MDD) [1]. One of the most common disorders suffered by the population is MDD. Indeed, suicide is the third cause of death among 15-29-year-olds, influenced by MDD. Regarding young people, around 1 out of 5 children and adolescents suffer from some mental health issue. In addition, people with severe mental disorders have more chances to die prematurely in comparison with neurotypicals, specifically 10 to 20 years earlier in high-income countries and up to 30 years earlier in low-income countries [1]. Therefore, it is paramount to make a reliable diagnosis as early as possible. To the best of our knowledge, there is still a gap between people needing to be diagnosed, and access to effective and low-cost healthcare. With this work, we aim to help create an accurate, reliable, and accessible diagnosis, through the collection of Feature Engineering (FE) techniques, as well as Artificial Intelligence (AI) models that other researchers have used mainly to diagnose mental disorders.

The Diagnostic and Statistical Manual of Mental Disorders Fifth Edition (DSM-5) acts as a standard reference for psychiatry and it includes more than 450 different definitions of mental disorders [2]. This fact highlights the impact that mental disorders have on individuals and society in general. According to [3], over a third of the total European population suffers from mental disorders nowadays, and only a third of all cases receive some kind of treatment, concluding that the burden of mental disorders has been considerably underestimated. Therefore, having efficient methods for an early diagnosis of mental disorders would be extremely helpful.

There are several procedures to diagnose mental disorders such as neurological exams [4], neuropsychological assessments [5] and neuroimaging modalities [6]. Physicians specialized in mental disorders usually turn to neuroimaging approaches for help and to improve the efficacy of the treatments. Neuroimages can be divided into two categories, depending on what type of data they collect: functional and structural. Functional neuroimages show information about the activity of the brain, while structural neuroimages capture the interior structures of the brain. Some of the most used functional neuroimages are Magnetoencephalography [7], functional Magnetic Resonance Imaging (fMRI) [8], Electroencephalogram (EEG) [9], and Positron Emission Tomography (PET) [10]. On the other side, part of the most usual structural neuroimaging modalities is structural MRI (sMRI) [11], Diffusion Tensor Imaging (DTI) [12] and Computed Tomography (CT) [13]. This study assesses papers that deal with functional data, specifically with EEG modalities, for the reasons set out below.

Electroencephalography consists in recording brain activity by measuring voltage fluctuations of brain regions via the placement of small electrodes around the scalp. This record is called EEG and it is widely used in the study and diagnosis of brain disorders such as Epilepsy [14], Dementia [15], Schizophrenia [16], and Alzheimer’s Disease (AD) [17]. The most remarkable advantages of EEG are the following. First, EEG devices are relatively portable, easy to set up and non-invasive. Second, these devices are characterized by their high temporal resolution, being capable of recording brain signals with up to 1 millisecond of resolution, even though their spatial resolution is worse compared with other methods such as MRI. Finally, EEG devices are relatively inexpensive, compared to other technology devices used to collect brain data, such as CT scanners or fMRI and PET devices. As a result, the use of EEG is a good candidate for an efficient and affordable diagnosis of mental disorders; especially since it is easy to use in underdeveloped countries, where quality healthcare is not fully accessible.

Traditionally, EEG was visually interpreted by highly specialized experts, and it was characterized as being a difficult and time-consuming task, as the volume of information that EEG data provides is considerably large. Because of this, the use of AI techniques has been proposed to automate the process and to aid in the diagnosis and study of mental disorders. Such techniques fall into two subsets of AI itself, defined as Machine Learning (ML), and a subset of ML, Deep Learning (DL). One of the most common tasks in the field of EEG and mental disorders is classification, i.e., an ML model takes several features derived from EEG data as input and outputs a prediction, e.g., whether a patient has a mental disorder or not. The input features are extracted from the raw EEG by applying FE. Extracting and choosing the right set of features for a given problem is one of the most relevant factors, as it can make the difference between an unusable ML model and a simple, effective model. In other words, it can be said that the FE is crucial, especially when using data such as EEG.

Indeed, properly applying FE on EEG data to train AI models related to brain disorders is still a challenging task, as there is no general FE pipeline that performs well on every task. For example, the authors of [18] showed that the beta band power was a relevant feature for detecting individuals with Insomnia, as they had significant and robust increases in that feature, whereas [19] showed that features such as the Variance, Energy, Nonlinear Energy and Shannon Entropy of the raw EEG signals were relevant for the task of epileptic seizure detection. In other words, the set of relevant features depends on the task and/or dataset, and properly applying FE remains a challenging task.

Given the importance of FE in the diagnosis of mental disorders by means of ML, it is clear that a secondary study that compiles the works in this area would foster the development of new techniques and lead to improvements in the diagnosis. Therefore, in this paper, we present a Systematic Mapping Study (SMS) with the purpose of clearly showing which FE techniques and ML models have been applied to each mental disorder in order to provide a way to easily find new research opportunities within the field. There are some secondary studies (reviews, surveys, and similar studies) on the topic of EEG and ML models applied to brain disorders, such as [20, 21]. Nevertheless, our work contains more significant contributions, as we can see in Table 1. Moreover, we will also share some insights and issues that we found after carefully analyzing the results of the SMS, as well as providing recommendations related to research directions. In order to help researchers to introduce new research opportunities discovered via this SMS, it is also included a brief description of other secondary studies that we found when collecting papers which can act as a starting point for future investigation gaps. It is worth noting that we do not report the efficiency achieved in each paper. That is mainly because it would not be correct to compare the accuracy obtained with different databases, since almost every paper selected uses a different one. In addition, as we present an SMS, we have only analyzed the abstract of each paper due to the number of works selected and we were not able to gather the databases used by reading only the abstract.

Table 1 Main contributions of our work in comparison with the closest related works [20, 21]

Feature engineering of EEG applied to mental disorders: a systematic mapping study

Abstract

Graphical Abstract

Similar content being viewed by others

Deep learning techniques for classification of electroencephalogram (EEG) motor imagery (MI) signals: a review

MICROSTATELAB: The EEGLAB Toolbox for Resting-State Microstate Analysis

Emerging Trends in EEG Signal Processing: A Systematic Review

1 Introduction

2 Brain disorders, EEG, FE and ML

2.1 Brain disorders

2.2 EEG

2.3 Feature Engineering

2.4 Machine Learning

3 Related works

4 Methodology

4.1 Definition of research questions

4.2 Conducted search

4.3 Screening of papers

4.4 Keywording of full text

4.5 Data extraction and mapping of studies

4.6 Research question 4

4.6.1 Signal transform related works

4.6.2 Feature extraction related works

4.6.3 Feature selection related works

4.6.4 Classification techniques related works

4.6.5 Brain disorders related works

4.6.6 Other related works

5 Discussion

6 Conclusions and future works

6.1 Conclusions

6.2 Limitations

6.3 Future work

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Appendix A Labels and categories

Appendix A Labels and categories

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation