Introduction

A neurological disease is a disorder in nervous system in the human’s body, and it is composed of three parts: brain, human body, and spinal cord. Causes of neurological problems are varied, but they could include a genetic disorder, congenital abnormalities, brain injury, or environmental health problems. The most significant symptom in neurological diseases is chronic pain, which affects 20–40% of patients and reaches 60% in the case of Parkinson’s disease [1]. From a medical perspective, diagnosing neurological diseases is challenging, because the nervous system is complex. Many types of neurological diseases exist, and thankfully, all of them have treatments that can allow the patients to live a normal life and lead them to a comfortable level. The second most common disease after Alzheimer’s is epilepsy. This is a type of neurological disorder characterized by a disorder of the brain causing recurrent seizures [2]. Epilepsy’s history begins in the twentieth century when American Canadian neurologists noted a connection between brain and personality changes in the 1950s. The first antiepileptic, phenytoin, was introduced in that time, and it is still used alongside other drugs [3]. It is thus very important to find this correlation that could potentially result in an early identification and severity of the disease. In this regard, several studies have been conducted for epilepsy identification in the literature. Out of which machine learning (ML)-based approaches are getting popularity due to their promising nature and effectiveness. This study focuses on ML-based approaches in disease identification, especially for epilepsy and further dig down for applications of ML-based approaches in pediatric epilepsy identification. This area is important in many ways; for instance, an earlier identification of the disease in the minors could help in prevention and their better care after. Moreover, it is a long and sequential process involving several aspects that can affect the overall classification results. For instance, dataset, the way signals were obtained (health of BCI), the way signals were processed (a range of options available like DWT, IWT etc.), types of filters applied, the way missing details were added, investigating wide range of classifiers and their effectiveness, and finally applying various evaluation metrics such as accuracy, precision, recall, etc. to contrast the performance of various classifiers. Therefore, analyzing the best combination for a specific dataset is really a challenging task. This study aims to uncover such aspects in the pediatric epilepsy identification.

The organization of paper as follows: the background of pediatric epilepsy is given in the section “Background”; the section “Literature Review” contains the comprehensive literature review. Analysis by contrasting outcomes of various techniques is given in the section “Analysis of Feature Extraction Methods of EEG Signals”, while the section “Conclusion” concludes the paper.

Background

Epilepsy in Adult and Children

According to the “International League against Epilepsy” (ILAE) in 2014, epilepsy definition is “a transient occurrence of signs and/or symptoms due to abnormal, excessive or synchronous neuronal activity in the brain” [4]. Epilepsy can be diagnosed by three main symptoms: at least two seizures having occurred over 24 h, reflex seizures having occurred twice or more over 10 years, or/and epilepsy syndrome [4]. The most common symptom is called “epileptic seizure” [5], which is a seizure caused by a disrupting episode with the brain’s activities, and not all epileptic patients have seizures [6]. Approximately 50 million people over the world have epilepsy [7], or 1% of the population. According to the Saudi Epilepsy Society, 0.654% of people in Saudi Arabia suffer from this disease. Epilepsy affects all ages; the symptoms and signs differ by age group. For example, in newborns, the symptoms involve a lack of oxygen during delivery and/or abnormal brain development, in which the symptoms in infants are brain tumors and/or genetic disorders [2]. Therefore, an accurate diagnosis is important. As childhood is the stage of brain formation, epilepsy occurs within a more dynamic nervous system and might thus be interfering with brain development that could affect the individual in several ways, such as the failure to develop skills, a slower rate of development, and the possibility of losing previously developed skills [8]. Every 4.8/1000 children worldwide have epilepsy. In the US, 0.94/1000 children under 18 have epilepsy [9]. In recent years, epilepsy diagnosis in infants and children has improved, and new methods to identify epilepsy have become hot topics for researchers. Electroencephalogram (EEG) signals can reflect the state of brain activities with time. It is a complex and non-linear interconnection between a billion neurons [10]. In addition, it is frequently the way to diagnose epilepsy by analyzing the EEG data; in this regard, several studies have focused on aiding epileptic patients to find suitable treatments.

Brain–Computer Interface

Brain–computer interface (BCI) is a fast-growing technology that helps in the medical field in terms of diagnosis. It contains hardware and software that can monitor the brain's activities to diagnose neurological diseases [11]. The BCI translate the brain signals into commands to send it into an output device to make decision. In addition, it can also store the output data [12]. The cycle of BCI can be summarized in five steps, as illustrated in Fig. 1, and details of each step provided afterward. After acquiring, the signal should be processed for further steps, such as feature extraction and classification, and it is eventually demonstrated on the application interface [13]. There are many applications of BCI (e.g., artificial hands using a brain-machine interface, systems to recognize driver drowsiness, brain fingerprint, and EEG) [13]. The challenges that BCI faces are many, especially when it is employed in the real world. The primary challenge is extracting the signals from the brain, which is not easy, because the strength of signals and the rate of data (bandwidth) are low [13], but in general, it is useful for researchers, because it does not require deep knowledge of programming languages. In a neurological disorder, BCI can connect the brain with the devices, and it allows the specialist to observe the brain activities and evaluate the disease.

Fig. 1
figure 1

Cycle of brain–computer interface

Interfacing and Signal Acquisition

The first step in EEG signal processing is to remove artifacts. To do so, specific software must be used, such as MATLAB or Python. Then, the feature from the signals data must be extracted using feature extraction methods, such as wavelet transform. To remove artifacts from signals, FIR filter was applied. The FIR filter consisted of “high-pass” and “low-pass” filters. The “high-pass” filter was used to “allow frequencies higher than the border to pass through it while blocking low frequencies”, and the “low-pass” filter was used to allow frequencies lower than the border to pass through it while blocking high frequencies”. In this work, the high-pass filter border was 0.5 Hz, and the low-pass filter border was 40 Hz. These boundary frequencies were selected, because 0.5–40 Hz represents the range of the five frequency bands, from delta to gamma (Table 1). However, any frequencies lower than 0.5 Hz and higher than 40 Hz are regarded as noisy signals.

Table 1 Frequency bands of EEG signals

Preprocessing

The second step of EEG data processing is to determine the channel location on the EEG scalp. Determining the location of the channels is significant to plot the EEG scalp map in 2D or 3D or to plot the data component in the brain area [97]. The location channels file is in the location format that should add to an EEG signal in case the file does not have any electrode locations. By knowing the locations of the channels, identifying the epileptic area becomes clear in addition to the types of seizures. To add the channel location file to the EEG data, the number of channels might differ from one patient to another depending about the EEG recording. In this study, the numbers of the channels in each patient could differ; for example, patient number 16 has 28 channels, while patient number 8 has 23 channels.

Feature Extraction

Since the EEG data are non-stationary signals, using DWT in the “wavelet transform” is a suitable method for an EEG signal, since it captures features in both the “time domain” and “frequency domain”. To select a convenient level of decomposition in the Daubechies family, wavelet decomposition splits the original signal into a different band of frequencies called the A’s and D’s, which are Approximations and Details of coefficient information, respectively. In each stage, two types of coefficients exist: details as high-pass frequencies and approximations as low-pass frequencies, together with the number for the level (e.g., in level one there are D1 and A1). This procedure is repeated on the approximation side until it reaches the low-pass frequency. Statistics over the sets of coefficients are used to decrease the dimensionality of the extracted feature vectors; therefore, statistics features are used to represent the distribution of the time–frequency of the EEG signals. The statistics features selected in this study are “maximum”, “minimum”, “mean”, “median”, and “standard deviation” (STD) of the wavelet coefficients in each original single signal. Based on the feature extraction, 9-dimensional feature sets (from D1 to D9 and A9) in the five statistics over the sets are calculated. The average of the statistic features of the wavelet coefficients for each channel is calculated.

Classification

The classification technique is used when the data have an output, and the researcher is attempting to predict. The classification problem appears as when signals were classified as an epileptic pediatric seizure or not. This type of classification is called binary classification, in which there are two types of classes.

EEG Signals

The electroencephalogram (EEG) is a clinical method for recording the brain’s activities signals from the brain surface. Based on the state of the brain, the characteristics of the EEG change [14], as presented in Fig. 2 [15]. The recording of high-quality EEG signals depends on the selection of positions of electrodes in the brain surface based on the brain regions. The brain can be divided into four regions based on functionality, as depicted in Fig. 3. These regions are the “occipital lobe”, “temporal lobe”, “parietal lobe”, and “frontal lobe”. The occipital region is the back region of the brain and is responsible for vision; the temporal region is on both side of the brain above the ears, and it is responsible for the brain skills such as hearing, memory, meaning, and language; the frontal region is up the “temporal lobes”, and it is responsible for movement, speech, and emotion. The parietal regions are behind the “frontal lobes” in the top back of the brain and are responsible for the senses, such as pain, touch, and taste [15].

Fig. 2
figure 2

EEG electrode measures the signal through the brain surface [15]  

Fig. 3
figure 3

Brain regions [15]

An EEG is recorded by placing electrodes in different positions on the brain surface, and each electrode has a name based on its place on the brain surface, as illustrated in Fig. 4 [16]. This arrangement is called the 10–20 system; it is a relationship between the locations of each electrode measuring the total distances between points on the head [10]. For example, from the bridge of the nose to the lowest point on the back of the head [12], each electrode has a number and letter to identify its location on the head. Here, “F” is frontal region, “T” is temporal region, “P” is parietal region, and “O” is occipital region, while even numbers mean the right half of the head, and odd numbers mean the left half of the head [16].

Fig. 4
figure 4

Electrode location [16]

The equipments needed to record EEG signals are electrodes, amplifiers, and filters, an analog-to-digital converter, and recording hardware and software. Figure 5 illustrates the EEG recording set-up [17].

Fig. 5
figure 5

EEG recording set-up [17]

An EEG signal can be divided into five frequency bands based on rhythm [18]: “delta”, “theta”, “alpha”, “beta”, and “gamma”. The frequency bands range of “delta” between 0.5 and 4 Hz, “theta” between 4 and 8 Hz, “alpha” between 8 and 13 Hz, “beta” between 13 and 30 Hz, and “gamma” more than 30 Hz (see Table 1). The amplitudes of alpha waves are low; these waves are recorded primarily from the occipital and parietal regions. For beta waves, the amplitudes are low and are typically recorded in the temporal and frontal regions. By contrast, delta waves are high where theta waves are slow–medium, and gamma waves are the fastest frequency bands and could reach up to 100 Hz [14].

In an actual EEG signal, there is a mixture of these frequencies, and it is a complex signal. Many techniques exist for extracting features from signals; the two most known are the Fourier transform and the wavelet transform.

Classification of Epileptic Seizures

Classifying epileptic seizures is important for researchers to conduct investigation, for clinicians to enhance the treatment and drugs, and for the patient [4]. According to the classification by the ILAE performed in 2017, seizure types are divided into three sections, illustrated in Fig. 6. Based on the area of onset, seizures are classified into three major sections: “focal”, “generalized”, and “unknown onset”. Focal onset is a new term for “partial,” and can occur in any region of the brain [13]. Generalized onset occurs when seizures are activated by patient behavior and EEG, and unknown onset refers to an “unknown” onset, where other symptoms are “known,” meaning that seizures cannot be diagnosed as having focal or generalization onset [4, 13]. Epilepsy is classified into types as well as a seizure types, creating a fourth type that is a combination of focal and generalized onset [19].

Fig. 6
figure 6

Classification of seizure types based on ILAE [13]

Many techniques have been used to classify epileptic seizures, most of which are reviewed in the subsequent sections. The goal of this study is to review the application of machine learning techniques in epileptic seizures identification. The paper reviews the epilepsy approaches systematically. From general epilepsy identification in adults using AI and ML to specifically children. To highlight the research gaps and shortcoming in terms of dataset availability, signal preprocessing models, and selection of appropriate model.

Literature Review

In the past decade, many studies have published on EEG signal classification for detecting epilepsy and other neurological diseases. In this section, some of the previous studies are discussed, and it is divided into several sections based on area of specialization.

Machine Learning Techniques in Epilepsy

Several studies have applied data mining techniques to detect epilepsy in humans and animals, and they achieved high accuracy. In Jaiswal et al. [20], two techniques are proposed: “sub-pattern” and “cross-sub-pattern correlation-based principal component analysis” (“SpPCA” and “SubXPCA”) with the support vector machine algorithm. The authors compared their result with other studies that used different algorithms. The results of the proposed techniques achieved the highest accuracy, 100%, and this average was better than that of other techniques, such as “k-nearest neighbor”, “support vector machine” (SVM), “decision tree”, and “naïve Bayes”. The authors in [20] proposed method for detect seizures of epilepsy, the type of the problem is a binary classification, which the EEG signals are classified into two classes: seizure and non-seizure signals. The EEG signals can be classified as seizure and non-seizure using a key point computation-based “local binary pattern” (LBP) [21]. Processing of the EEG signals is divided into three phases, the first phase called “key point localization”, the second called “key point-based local binary pattern computation”, and the third called histogram feature. Afterward, the “support vector machine” classifier categorizes the signals into (seizure/non-seizure). These methodologies were easy to use and simple to implement; it was achieved good performance compared with existing methods in other studies.

In Tharayil et al., [22], the authors developed a method to predict epileptic seizures in adult and child patients together. This method called linear mixed model; they applied these techniques to more than 1.2 million seizures recorded. The primary discovery was that all developed models achieved higher accuracy in adults than in children. The authors stated the reasons for this, such as that seizure patterns of children and adults are different, or that undercounted seizures in children not being available. The early discover of epileptic seizures can help the patient to avoid any side effects in the brain. In Usman et al. [23], data preprocessing converted 23 EEG signals channels into a single signal channel to improve the “signal-to-noise ratio” (SNR) and then applied “empirical mode decomposition” (EMD) to increase the SNR. The authors selected SVM as a classifier, and the result revealed that the model predicted the seizure prior to its occurrence by 23.6 min, up to a maximum prediction time of 33 min. According to Kabir et al. [24], the proposed approach is performed by dividing the EEG signals into groups based on time period and then drawing a sample from each group of the class via optimum-allocated techniques (OTA) before combining all the samples; afterward, the features are extracted from the OTA set.

The authors in [25] used three techniques in learning: “logistic model trees” (LMTs), “multinomial logistic regression” (MLR), and SVM. The results reveal that the LMT classifier achieved higher accuracy than the others, and that the OAT can be used as an ideal scheme for extracting the features. A few other algorithms were applied in the epilepsy data, such as SVM and neural network, relevance vector machine (RVM), and fractal dimension [26]. In addition, there are random forest, Gaussian mixture models, k-means, and other methods [27]. The SVM algorithm is efficient in the binary classification problem. In research by Sohaib [28], the authors evaluated different machine learning techniques to classify EEG signals. The study extracted four features: “minimum”, “maximum”, “mean”, and “standard deviation” (STD). The result reveals that the SVM classifier was the best classifier for EEG signals, with 85.81%, while the regression tree was second best, with 83.50%. In Patrick et al., [29], applied machine learning techniques were used to predict and detect an epileptic seizure by applying three techniques. The first was the phase-space adjacency spectrum, which used the graph of phase-space adjacency as a biomarker of seizure prediction; this technique achieved 97% accuracy. The second technique was the phase-space Laplacian spectrum, which used the metrics of the phase-space Laplacian spectrum as a biomarker of seizure prediction; this achieved 93% accuracy. The last technique was the hypergraph analysis of phase-space graph, which analyzed a subset of edges of phase graph as a hyperedge in hypergraph and used it as a biomarker for seizure prediction and training accuracy; this technique achieved 93% and 80% for testing accuracy. For seizure detection, the author combined the phase-space analysis method with deep learning using the CNN algorithm to detect the onset of seizures, and this achieved 100% accuracy. An ensemble classifier was used in many studies to classify epileptic seizures.

In Abualsaud et al. [30], the ensemble classifier was applied in incomplete EEG data to examine the efficiency of the classifier. In a comparison with other experiments, the ensemble classifier achieved 90%, while the other experiments achieved 85%, 85.9%, and 89.5%, respectively. In Raghu et al. [31], a new tool called “Computerized Automated Detection of Focal Epileptic Seizure” (CADFES) was introduced. Its primary function was to preprocess EEG data to detect focal and non-focal epileptic seizures. After extracting 28 features from the data set, the authors optimized the number of features using “Neighborhood Component Analysis” (NCA), and then, Four classifiers used to evaluate algorithm performance were SVM, KNN, random forest, and the AdaBoost classifier. The result demonstrated that SVM achieved the best accuracy, with 95.9%. In [32], the authors established a new method to detect an epileptic seizure. They used hybrid SVM by combining a “genetic algorithm” (GA) with “particle swarm optimization” (PSO) to determine the SVM parameters; this model achieved 99.38% accuracy. Lekshmy et al. [84] provided a comparative analysis of the ML methods in epileptic seizure prediction along with their effectiveness. It was concluded that the Random Forest (RF) and long short-term memory (LSTM) algorithms achieved the highest accuracies as 97% and 98%, respectively. Similarly, Nair et al. [85] concluded that the AI-based approaches have been tremendously contributed to epilepsy detection, prediction, and management for an improved healthcare society. Likewise, Natu et al. [86] presented a wholistic view of AI and ML applications in epilepsy detection. It encompasses, data preprocessing approaches, channel selection methods followed by the classifier or the prediction model. They further provided the research gap and the limitations in this regard. For instance, the dataset related issues and shortcomings as well as the incurred computation cost of the prediction model. As a remedy, they suggested dataset labeling, deep learning algorithms investigation alongside the appropriate feature selection. In [87], authors proposed a deep learning (DL) approach to seizer detection by investigating the reconstructed phase space (RPS) instead of direct EEG signals that exhibits a chaotic and non-linear behavior make them inadequate for analysis. The approach exhibits accuracies of 98.5% and 95% for binary and tertiary classification, respectively. In [88] authors proposed a novel ML approach to epilepsy prediction using correlation dimension (CD) and achieved 100% accuracy. The proposed model exhibits a much faster convergence in contrast to similar approach in the literature over the same dataset due to a small set of features and specific combination of subsets. Table 2 presents a summary of the studies investigating AI and ML in epilepsy. It includes, type of dataset used, algorithm investigated and the obtained metrics like accuracy, precision, and recall. It shows that ML remain quite promising in terms of accuracy and other metrics for epilepsy identification.

Table 2 Summary of machine learning in epilepsy

Machine Learning Techniques in Pediatric Epilepsy

In Shoeb [33] and [34], the researchers applied the SVM algorithm to classify seizure onset in pediatric epilepsy patients. The data set from the Boston children’s Hospital, it contains 23 patient’s files of EEG recording, and all patients under 18 years. It consisted of 18 channels that contained 163 seizures and was separated into records of one hour each. The records that did not contain a seizure were called “non-seizure” records, and those that had a seizure were called “seizure” records. For 163 seizures [33], the algorithm achieved 96% [33] and 173 seizures as well [34] with a 3-s delay. In [35], the authors designed an “Advance Seizure Prediction via Preictal Relabeling” (ASPPR) to predict epileptic pediatric seizures; the model achieved 96.30% for predicting seizures between 1 and 6 min, 96.13% for 8 and 13 min, 94% for 14 and 19 min, and 94.2% for 20 and 25 min.

In [36], the authors proposed a technique to extract features from the epileptic pediatric data set from the CHB-MIT EEG database. In their proposed method, EEG signals are mapped into 2D space that leads to texture image and the gray-level image domain. Furthermore, they compared their method with other methods and achieved 97.74% using the SVM classifier with linear kernels. In addition, a study was conducted using the CHB-MIT EEG database [37]. The data mining of EEG signals was analyzed using a time-series approach, which calculated peak points (the lowest point in a part of the signals) and valley points (the highest point in a part of the signals) and then calculated the distances between them. The study consisted of three experiments; the first experiment was conducted of the basis of an optimized setting of one patient file, while the second experiment was performed by identifying common sub-settings to predict the onset of the seizure for one patient seizure file. The third experiment was applied to the setting of the first experiment on three other patients. All the experiments predicted onset seizures early with minimum latency. Identifying seizures in children is different than in adults, because the seizures in children have lack characteristics of EEG features. The study in [38] proposed two linear methods to detect seizures in newborns (39–42 weeks) namely correlation-based feature methods and a relief F method. The feature selector operated in several steps: select subset feature, relevance and redundancy evaluator, and optimized feature vector prior to classification. The average detection achieved 93%. Some studies that focused on newborn data proposed alternative methods for detecting seizures ([39,40,41,42,43], and [44]) and achieved a high-level accuracy compared with previous studies. The number of electrodes may affect pediatric seizure detection. The authors in [42] attempted to decrease the electrode’s number in the terms to achieve a high-quality result of detecting seizures; the experiment revealed that 19 electrodes had higher agreement than less than 19. When using a “collective network of binary classifiers” (CNBC) using MD PSO, the proposed method achieved 93% accuracy [45], this method was applied in long-term EEG data and extracted the seizures from the signals data. Table 3 shows a summary of studies that reviewed in this section are presented. The authors in [46] detected cognitive impairment in children with epilepsy using network analysis, and their proposed method achieved 85% accuracy. Authors in [74] investigated ensemble learning approaches to identify epilepsy in the children and achieve 100% accuracy over CHB-MIT children epilepsy dataset. Similarly, authors in [89] proposed a novel deep learning approach to detect epilepsy in children on CHB-MIT dataset. They proposed a 2D deep convolution autoencoder (2D-DCAE) linked to a neural network-based classifier to form a unified system that is trained in a supervised deep convolutional autoencoder (SDCAE) using LSTM. The scheme exhibits 98.79 ± 0.53% accuracy, 98.72 ± 0.77% sensitivity, 98.86 ± 0.53% specificity, 98.86 ± 0.53% precision, and F1-score of 98.79 ± 0.53%. It can be seen that ML-based techniques are promising in the pediatric epilepsy identification as of Table 3.

Table 3 Summary of machine learning in pediatric epilepsy

Machine Learning Techniques in Other Fields

In Mohamed et al. [47], the authors applied data mining techniques on data from the Egypt institute in Giza to build a model that helps students to select the track based on their features. Three types of decision tree algorithms were applied: J48, reducer error pruning (REP) tree, and random tree. The results revealed that the J48 tree achieved higher accuracy, with 87.64%. The authors in [48] applied the J48 tree algorithm using AdaBoost and bagging techniques in the Diabetes data set. The results demonstrated that AdaBoost is superior to bagging. The authors in [49] applied five classifiers (after hit and trial) to data sets and took the average. They used an ensemble classifier to increase the accuracy of detecting the disease, develop five models, and compared the models’ performance. These algorithms included support vector machine (SVM), C5.0, extreme learning machine (ELM), multivariate adaptive regression splines (MARS), and random forest (RF). Figure 7 presents various steps of the proposed techniques.

Fig. 7
figure 7

Flowchart of the proposed techniques in [49]

Using more than one classifier in the ensemble technique, the study in [50] achieved 98.07% accuracy for diagnosing breast cancer. The authors proposed two layers of the ensemble model by applying three Meta classifiers rather than single classifiers such as Bayes-Net and Naïve Bayes to improve the accuracy of detecting breast cancer. Table 4 shows a summary of studies that reviewed in this section are presented. In [51], the authors tested the ability of machine learning techniques to predict which tree in a forest might be damaged during a storm. An “artificial neural network” and “random forest” were used as models; the random forest algorithm achieved better accuracy than NN, with more than 75%. In [75][75], authors proposed supervised machine learning methods for COVID-19 and achieved very high accuracy. Similarly, ML-based approaches have been investigated for classification in other areas [90][90]. Approaches in [92] and [93] investigate ML for heart disease prediction and diabetes type II, respectively.

Table 4 Summary of machine learning in other diseases

Survey Studies in Machine Learning Techniques

The authors in [52] divided data mining techniques into two main categories: the predictive and the descriptive [52]. Each category has its own techniques. The predictive category has classification, predication, regression, time-series, and fractal coding. The descriptive category has association rules, summarization, clustering, and sequence discovery. According to the IEEE conference on data mining techniques in December 2006 [53], the top 10 algorithms tested were “AdaBoost”, “Apriori”, “Bagging”, “C4.5”, “CART”, “EM”,” k-means”, “KNN”, “NB”, and “SVM”. No one algorithm was better than another; the best result of the experimental models determined which achieved the best accuracy. In Siuly et al. [54], the authors presented the survey in neurological diseases diagnosis, in which they compared between the most important computer-aided diagnosis (CAD) approaches to neurological diseases. They reviewed many articles on epilepsy and applied the classification methods in EEG signal classification, such as RBFNN, “support vector machine”, “k-nearest neighbor”, “artificial neural network”, and “multiclass least square support vector machine” (MLS-SVM), among other methods. All these studies proved that computer-aided diagnosis can help to diagnose all these types of diseases. Many processing techniques exist for processing EEG signals before applying machine learning techniques. The most popular technique is time analysis, and it consists of two sections: “time domain” and “frequency domain”. The wavelet technique is an analysis of signals into sub-signals that contain activity at different times [7], and it can be a combination of two or more techniques. The primary challenge in signal processing is selecting analysis methods; there are many, and each has pros and cons [55]. The most two methods are the Fourier transform, the wavelet transform, and the eigenvector have characteristics that help researchers to select a suitable method for their research. Table 5 presents a summary of the studies that reviewed in this section. The paper in [56] surveyed various supervised machine learning techniques of text classification. The results revealed that the most effective method for classifying a text is to combine related information into the classification process, because it enhances the result quality of the classification.

Table 5 Summary of survey studies in machine learning techniques

Analysis of Feature Extraction Methods of EEG Signals

An EEG signals has many features; therefore, many studies have proposed techniques to extract this feature in terms to classify them. The authors in [57] used various methods to extract features: “time analysis”, “frequency analysis”, “time–frequency analysis”, and “time–frequency space analysis”. The features were classified using an artificial neuronal network algorithm, and 99% accuracy was achieved between two tasks and 96% between three tasks. Another study described how to extract features from EEG signals using the fast Fourier transform filter [58], a short-time Fourier transform [59].

The authors in [60] used two methods for extract features from EEG signals. The first method called “local neighbor descriptive pattern” (LNDP), and the second was “one-dimensional local gradient pattern” (1D-LGP), which achieved an accuracy of 99.82% and 99.80%, respectively. The authors in [61] described the wavelet transform filter to extract features, which achieved 98%. The first step in terms of classifying EEG signals is reprocessing. Many approaches are available to do so, each of which has different results. In [62], the authors proposed an interesting approach for removing artifact signals called “fully automated statistical thresholding for EEG artifact rejection” (FASTER). This approach divides EEG signals into five aspects: channels, single-channel, epochs, single epoch, ICs, and aggregated data, each of which were calculated data of statistical parameters and defined the metrics. This method achieved more than 90% accuracy. Furthermore, the authors in [63] proposed a successful method for removing artifacts from signals. The method is known as ADJUST; it is automatic algorithm which identifies artifact components by combined temporal features with stereotype artifact-specific spatial and achieved 95.2% accuracy.

Electroencephalogram signals have noise signals such as eye movements called electrooculography (EOG), and to process EEG signals and achieve quality results in classification, a noise should be removed from signals. The authors in [64] proposed automated methods to removed artifacts from EEG signals by implementing an open-source software library called BioSig for processing a signal, which reduces artifact by 80%. The contribution of the authors in [65] was to achieve clean EEG signal data from the artifact, and these data were made available for the public to allow the researchers to manually add an artifact for their purposes. A method called “independent component analysis” (ICA) combined with dipole was applied in [66]. The advantages of ICA in terms of removed artifacts are that it is efficient; the technique can separate EEG signals and artifacts without needing references. According to a review the authors presented in [18], no perfect solution exists for cleaning EEG signals from the artifacts, and some of the methods have an application, while others do not, according to the comparison. Many studies have proposed methods to reduce the noisy signal from EEG signal, such as combining the wavelet transform with an adaptive thresholding mechanism [67] or independent component analysis [68]. In terms of using ICA to remove artifact signals from EEG, the study in [69] reveals that a good selection of variables in ICA results in enhanced EEG signals and reduces artifacts as accurately as possible for these variables. The advantage of using the ICA is the ability to examine information directly in the data [70]. Although it is an efficient method for rejecting artifacts, it has disadvantages; one of which is that ICA is a complex algorithm that requires suitable data to address [71]. The authors in [72] proposed new methods by combining two techniques, namely the “information in frequency domain” technique and the “information gain” technique. After extracting the features, SVM is used to classify the EEG. These proposed methods achieved 95.62% accuracy. A technique in [94] proposed an improved feature extraction algorithm for EEG signals obtained from motor imagery BCI. They investigated ICA, WT, and common spatial pattern (CSP) jointly. For analysis purposes, they investigated Bayesian Linear Discriminative Analysis (BLDA), SVM, LDA, and Bagging Tree (BT). The accuracy obtained as BLDA: 87.42%, SVM: 81.75%, BT: 79.42, and LDA: 78.41%, respectively. Table 6 presents a summary of the studies reviewed in this section.

Table 6 Summary of feature extraction methods of EEG signals

Research Findings

The undergoing study reviews the epilepsy detection, prediction, and identification approaches using AI, ML, and DL, systematically. From general epilepsy identification in adults using AI and ML to specifically children. To highlight the research shortcoming in terms of dataset availability, signal preprocessing models, and selection of appropriate model. Following research gap has been identified with suggestions to overcome in the future studies.

  1. 1.

    Dataset the dataset availability is big challenge in epilepsy in general and for children in specific. This has many aspects: first, the time period and frequency of the seizer occurrence and its proper recording. This plays a critical role in disease prediction and varies for patient to patient and the way data were recorded. Second, the equipment used can also be another issue. Third, there is a need to combine, normalize, and process multiple practices datasets around the globe to have a holistic, comprehensive, and uniform dataset.

  2. 2.

    Dataset preprocessing the preprocessing methods play a crucial role in the accuracy of prediction and success of the model. There are various methods with their own pros and cons. There is a need to find out the best preprocessing approach for the unified and normalized dataset concept state in point 1.

  3. 3.

    Feature selection method there are several feature selection methods from EEG signals in the literature with their own strengths and weaknesses. An investigation of such methods for the holistic dataset (as state in points 1 and 2) is much needed for the better understanding the nature of the disease in a particular age group. It is worth mentioning that there is still much attention needed to improve the accuracy of feature selection methods. Moreover, this selection plays and crucial role in seizer prediction.

  4. 4.

    Selection of the model instead of having a single model or method, an ensemble of ML and DL method should be investigated in contrast to the learning methods and aligned to the points 1–3. It is better to come up with a framework in this regard, which should comprehend all the aspects from dataset selection to the evaluation.

Conclusion

This paper presents a comprehensive review on machine learning (ML) techniques for identification of pediatric epilepsy, comprising of last decade (2009–2022) especially in children. The techniques are further categorized in terms of approaches for general disease identification, epilepsy identification in general and finally pediatric epilepsy identification. It further tabulates and classifies the results chronologically, based on the schemes/approaches/algorithms investigated, datasets, and highest benchmark achieved. That is in terms of accuracy mainly and/or precision, F-score, and recall rate. Moreover, this study segregates the analysis and feature extraction approaches investigated on electroencephalogram (EEG) signals, specifically. It further elaborates that one scheme may be better in way, while another be in another way. By summarizing this systematic literature review, the researcher, scholars, and academician in the field of applied machine learning for disease identification can have a clear overview about the total work done in the field of epilepsy identification in general and pediatric epilepsy identification in specific. Furthermore, research gaps have been identified and potential solution is suggested for the researchers as their future work.