Abstract
Motor imagery (MI) is a mental process that produces two types of event-related potentials called event-related desynchronization (ERD) and event-related synchronization (ERS). We can record ERD and ERS in an electroencephalogram (EEG) and use them to identify a MI execution. However, the classification of MI is a challenging task because ERD and ERS exhibit inter- and intra-subject variability. Recently, researchers have proposed deep learning models to solve this problem. Although they achieve cutting-edge results, the amount of data available for training constrains their learning ability. To address this issue, we propose to incorporate hand-crafted features, which have a strong inductive bias, into deep learning models at different levels of depth, which have a soft inductive bias, without making them lose their ability to discover new features from data. Our approach enables the design of models that benefit from deep learning and traditional machine learning models for MI EEG-based classification. In this manner, it is possible to build compact machine learning models that perform better than pure deep learning models in a small data setting. Results of experiments on the public datasets 2a and 2b of the BCI Competition IV demonstrate that a model built following our proposed strategy achieves state-of-the-art accuracy on MI EEG-based classification.
Similar content being viewed by others
Data availability
The electroencephalogram data that support the findings of this study are available from BCI Competition IV, https://www.bbci.de/competition/iv/.
Notes
The phase locking value is a measure of the phase synchrony between signals from two EEG channels
References
Mulder T (2007) Motor imagery and action observation: Cognitive tools for rehabilitation. J Neural Transm 114(10):1265–1278. https://doi.org/10.1007/s00702-007-0763-z
Lotte F, Bougrain L, Cichocki A, Clerc M, Congedo M, Rakotomamonjy A, Yger F (2018) A review of classification algorithms for eeg-based brain-computer interfaces: a 10 year update. J Neural Eng 15:031005. https://doi.org/10.1088/1741-2552/aab2f2
Bashashati A, Fatourechi M, Ward RK, Birch GE (2007) A survey of signal processing algorithms in brain-computer interfaces based on electrical brain signals. J Neural Eng 4(2):32–57. https://doi.org/10.1088/1741-2560/4/2/R03
Nicolas-Alonso LF, Gomez-Gil J (2012) Brain computer interfaces, a review. Sensors (Basel, Switzerland)12(2):1211–79. https://doi.org/10.3390/s120201211
Sun S, Zhou J (2014) A review of adaptive feature extraction and classification methods for EEG-based brain-computer interfaces. In: 2014 International joint conference on neural networks (IJCNN), pp 1746–1753. https://doi.org/10.1109/IJCNN.2014.6889525
Al-Saegh A, Dawwd SA, Abdul-Jabbar JM (2021) Deep learning for motor imagery EEG-based classification: A review. Biomed Signal Process Control 63:102172. https://doi.org/10.1016/j.bspc.2020.102172
Tangermann M, Müller K-R, Aertsen A, Birbaumer N, Braun C, Brunner C, Leeb R, Mehring C, Miller KJ, Müller-Putz GR, Nolte G, Pfurtscheller G, Preissl H, Schalk G, Schlögl A, Vidaurre C, Waldert S, Blankertz B (2012) Review of the BCI competition IV. Front Neurosci 6:55. https://doi.org/10.3389/fnins.2012.00055
Hyvärinen A, Oja E (2000) Independent component analysis: algorithms and applications. Neural Netw 13(4–5):411–430. https://doi.org/10.1016/S0893-6080(00)00026-5
Koles ZJ, Lazar MS, Zhou SZ (1990) Spatial patterns underlying population differences in the background EEG. Brain Topogr 2(4):275–284. https://doi.org/10.1007/BF01129656
Ang KK, Chin ZY, Zhang H, Guan C (2008) Filter bank common spatial pattern (FBCSP) in brain-computer interface. In: International joint conference on neural networks (IJCNN), pp 2390–2397. https://doi.org/10.1109/IJCNN.2008.4634130
Ang KK, Chin ZY, Wang C, Guan C, Zhang H (2012) Filter bank common spatial pattern algorithm on BCI competition IV datasets 2a and 2b. Front Neurosci 6:39. https://doi.org/10.3389/fnins.2012.00039
Blankertz B, Tomioka R, Lemm S, Kawanabe M, Müller K-R (2008) Optimizing spatial filters for robust EEG single-trial analysis. IEEE Signal Proc Mag 25:41–56. https://doi.org/10.1109/MSP.2008.4408441
Caramia N, Lotte F, Ramat S (2014) Optimizing spatial filter pairs for EEG classification based on phase-synchronization. In: 2014 IEEE International conference on acoustics, speech and signal processing (ICASSP), pp 2049–2053. https://doi.org/10.1109/ICASSP.2014.6853959
Bustios P, Rosa JL (2017) Restricted exhaustive search for frequency band selection in motor imagery classification. In: 2017 International joint conference on neural networks (IJCNN), pp 4208–4213, https://doi.org/10.1109/IJCNN.2017.7966388
Brodu N, Lotte F, Lecuyer A (2011) Comparative study of band-power extraction techniques for motor imagery classification. In: 2011 IEEE Symposium on computational intelligence, cognitive algorithms, mind, and brain (CCMB), pp 1–6. https://doi.org/10.1109/CCMB.2011.5952105
He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In: Proceedings of the IEEE international conference on computer vision, pp 1026–1034. https://doi.org/10.1109/ICCV.2015.123
Craik A, He Y, Contreras-Vidal JL (2019) Deep learning for electroencephalogram (EEG) classification tasks: A review. J Neural Eng 16(3):031001. https://doi.org/10.1088/1741-2552/ab0ab5
Schirrmeister RT, Springenberg JT, Fiederer LDJ, Glasstetter M, Eggensperger K, Tangermann M, Hutter F, Burgard W, Ball T (2017) Deep learning with convolutional neural networks for EEG decoding and visualization. Hum Brain Mapp 38(11):5391–5420. https://doi.org/10.1002/hbm.23730
Lawhern VJ, Solon AJ, Waytowich NR, Gordon SM, Hung CP, Lance BJ (2018) EEGNet: A compact convolutional network for eeg-based brain-computer interfaces. J Neural Eng 15(5):056013. https://doi.org/10.1088/1741-2552/aace8c
Borra D, Fantozzi S, Magosso E (2018) Interpretable and lightweight convolutional neural network for EEG decoding: Application to movement execution and imagination. Neural Networks 129:55–74. https://doi.org/10.1016/j.neunet.2020.05.032
Ravanelli M, Bengio Y (2018) Speaker recognition from raw waveform with sincnet. In: 2018 IEEE Spoken language technology workshop (SLT), pp 1021–1028. https://doi.org/10.1109/SLT.2018.8639585
Bustios P (2022) Source code repository. https://github.com/bustios/miconvnet
Kingma D, Ba J (2015) Adam: A Method for Stochastic Optimization. In: International conference on learning representations, ICLR
Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado G.S, Davis A, Dean J, Devin M, Ghemawat S, Goodfellow I, Harp A, Irving G, Isard M, Jia Y, Jozefowicz R, Kaiser L, Kudlur M, Levenberg J, Mané D, Monga R, Moore S, Murray D, Olah C, Schuster M, Shlens J, Steiner B, Sutskever I, Talwar K, Tucker P, Vanhoucke V, Vasudevan V, Viégas F, Vinyals O, Warden P, Wattenberg M, Wicke M, Yu Y, Zheng X (2015) TensorFlow: large-scale machine learning on heterogeneous systems. Software available from tensorflow.org
Diez PF, Mut V, Laciar E, Avila E (2010) A comparison of monopolar and bipolar eeg recordings for SSVEP detection. In: 2010 Annual international conference of the ieee engineering in medicine and biology, pp 5803–5806. https://doi.org/10.1109/IEMBS.2010.5627451
Hameed A, Fourati R, Ammar B, Ksibi A, Alluhaidan AS, Ayed MB, Khleaf HK (2024) Temporal-spatial transformer based motor imagery classification for BCI using independent component analysis. Biomedical Signal Processing and Control 87:105359. https://doi.org/10.1016/j.bspc.2023.105359
Liu X, Shi R, Hui Q, Xu S, Wang S, Na R, Sun Y, Ding W, Zheng D, Chen X (2022) TCACNet: Temporal and channel attention convolutional network formotor imagery classification of EEG-based BCI. Inf Process Manag 59(5):103001. https://doi.org/10.1016/j.ipm.2022.103001
Medhi K, Hoque N, Dutta SK, Hussain MI (2022) An efficient EEG signal classification technique for brain-computer interface using hybrid deep learning. Biomedical Signal Processing and Control 78:104005. https://doi.org/10.1016/j.bspc.2022.104005
Amin SU, Alsulaiman M, Muhammad G, Mekhtiche MA, Shamim Hossain M (2019) Deep learning for eeg motor imagery classification based on multi-layer cnns feature fusion. Futur Gener Comput Syst 101:542–554. https://doi.org/10.1016/j.future.2019.06.027
Shi X, Li B, Wang W, Qin Y, Wang H, Wang X (2023) Classification algorithm for electroencephalogram-based motor imagery using hybrid neural network with spatio-temporal convolution and multi-head attention mechanism. Neuroscience 527:64–73. https://doi.org/10.1016/j.neuroscience.2023.07.020
Grosse-Wentrup M, Buss M (2008) Multiclass common spatial patterns and information theoretic feature extraction. IEEE Transactions on Biomedical Engineering 55(8):1991–2000. https://doi.org/10.1109/TBME.2008.921154
Acknowledgements
This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix A common spatial patterns
Appendix A common spatial patterns
The CSP mathematical procedure to find out spatial filters for a two-class problem is described below.
First, calculate the covariance matrices of training epochs of each class \(k \in \{a, b\}\):
where:
-
\(X_{k,i} \in \mathbb {R}^{c \times t}\) is the i-th epoch of class k, composed of t samples from c channels.
-
\(n_k\) is the number of epochs of class k.
-
\(R_k \in \mathbb {R}^{c \times c}\) is the covariance matrix of training epochs of class k.
Then, with the covariance matrices \(R_a\) and \(R_b\), solve the generalized eigenvalue problem in the matrix form:
where:
-
\(W \in \mathbb {R}^{c \times c}\) (with \(c =\) number of channels) contains the generalized eigenvectors \(w_{j= \{1,...,c\}}\) of \(R_a\) and \(R_a + R_b\).
-
\(\Lambda \in \mathbb {R}^{c \times c}\) is a diagonal matrix which contains the generalized eigenvalues \(\lambda _{j= \{1, ...,c\}}\), corresponding to the eigenvectors \(w_j\), of \(R_a\) and \(R_a + R_b\).
Finally, each eigenvector \(w_j\) is a spatial filter. Grosse-Wentrup and Buss [31] proposed an extension of the CSP mathematical procedure for multiclass problems.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Bustios, P., Garcia Rosa, J.L. Incorporating hand-crafted features into deep learning models for motor imagery EEG-based classification. Appl Intell 53, 30133–30147 (2023). https://doi.org/10.1007/s10489-023-05134-x
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-023-05134-x