Electric powered wheelchair control using user-independent classification methods based on surface electromyography signals

Iqbal, Hassam; Zheng, Jinchuan; Chai, Rifai; Chandrasekaran, Sivachandran

doi:10.1007/s11517-023-02921-z

Electric powered wheelchair control using user-independent classification methods based on surface electromyography signals

Original Article
Open access
Published: 26 September 2023

Volume 62, pages 167–182, (2024)
Cite this article

Download PDF

You have full access to this open access article

Medical & Biological Engineering & Computing Aims and scope Submit manuscript

Electric powered wheelchair control using user-independent classification methods based on surface electromyography signals

Download PDF

Hassam Iqbal ORCID: orcid.org/0000-0002-1438-9403¹,
Jinchuan Zheng¹,
Rifai Chai¹ &
…
Sivachandran Chandrasekaran²

1503 Accesses
1 Citation
Explore all metrics

Abstract

Wheelchairs are one of the most popular assistive technology (AT) among individuals with motor impairments due to their comfort and mobility. People with finger problems may find it difficult to operate wheelchairs using the conventional joystick control method. Therefore, in this research study, a hand gesture-based control method is developed for operating an electric-powered wheelchair (EPW). This study selected a comfort-based hand position to determine the stop maneuver. An additional exploration was undertaken to investigate four gesture recognition methods: linear regression (LR), regularized linear regression (RLR), decision tree (DT), and multi-class support vector machine (MC-SVM). The first two methods, LR and RLR, have promising accuracy values of 94.85% and 95.88%, respectively, but each new user must be trained. To overcome this limitation, this study explored two user-independent classification methods: MC-SVM and DT. These methods effectively addressed the finger dependency issue and demonstrated remarkable success in recognizing gestures across different users. MC-SVM has about 99.05% of both precision and accuracy, and the DT has about 97.77% accuracy and precision. All six participants were successful in controlling the EPW without any collisions. According to the experimental results, the proposed approach has high accuracy and can address finger dependency issues.

sEMG Classification of Upper Limb Movements Under Different Loads

Hand Gesture Recognition Based Omnidirectional Wheelchair Control Using IMU and EMG Sensors

Article 14 October 2017

Real-Time Electromyographic Hand Gesture Signal Classification Using Machine Learning Algorithm Based on Bispectrum Feature

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

There is a growing problem of aging in many countries, including Australia, China, and the USA [1]. The shortage of nursing resources for the elderly makes it difficult to meet the needs of disabled people. Fortunately, assistive technology (AT) is on the verge of solving this issue. Whenever an unfortunate event impairs a person’s ability to walk, AT becomes essential. A large contingent of people is physically disabled as a result of health problems or accidents. A smart assistive wheelchair can make a world of difference for people who have neck paralysis, quadriplegia, congenital gait disorders, or finger dependencies [2].

A wheelchair is a great AT for people with special needs. However, some wheelchair users with finger disabilities face dilemmas when operating conventional joystick control wheelchairs. Quadriplegics who lack control over their legs and arms are examples of this. As a result, they have trouble eating and going to the restroom every day. Furthermore, their fingers make it difficult for them to control traditional joystick wheelchairs [3]. By using hand gestures, the indirect control method operates the wheelchair. An accelerometer, a gyroscope, and a camera are usually used for recognizing hand gestures [4].

Several researchers have described methods for detecting and categorizing hand gestures in the literature. In general, these gestures can be divided into stationary gestures and dynamic gestures [5]. Systems based on centroid points are developed to control mouse cursors and video players [6]. Leap motion can be used to control robotic devices, simulate flights, and detect hand tremors [7]. Finger signs can also be recognized using orientation features in addition to head gestures [8]. An accelerometer-based neural network can be developed for high-accuracy hand gesture classification [9]. Hand gestures can be recognized using machine learning techniques as an alternative and user-friendly option. Controlling mobile robots or wheelchairs can be done through mild forced commonly used hand gestures.

The utilization of surface electromyography (sEMG) has become a prevalent technique for the recognition of hand gestures. The sEMG approach measures biosignal currents produced by motor units during muscle contraction [10]. The summation of motor unit action potentials is detected over the skin using sEMG. Due to its noninvasive and low-cost characteristics, sEMG-based gesture recognition systems are widely employed in human–machine interfaces, robot control, speech detection, and rehabilitation studies [11]. Amputees can also benefit from sEMG, as it allows them to control electrical powered wheelchair (EPW). Despite the benefits of sEMG-controlled prosthetics, a majority of amputees in the USA do not use this technology [12]. The low acceptance rate is typically attributed to the challenge of intuitive control over prostheses, which remains a significant hurdle for researchers to overcome [13].

This research study focuses on developing hand gesture recognition (HGR) methods to control EPW for individuals with disabilities, the elderly, and patients with multiple sclerosis. In literature, touch screens and graphical user interfaces have also been used to control wheelchairs. However, individuals with finger-related problems may still face difficulties while controlling the wheelchair effectively. To address this issue, this study employed an existing control system for an EPW that was originally controlled with a joystick.

This research study presents two user-dependent classification methods including linear regression (LR) and regularized linear regression (RLR) and two user-independent classification methods including multi-class support vector machine (MC-SVM) and decision tree (DT). Every time a new user is introduced, user-dependent classification methods need to be trained [14, 15]. To overcome this limitation, this study also investigates user-independent classification methods. Moreover, this study compares the performance of user-independent and user-dependent-based classification models.

This research study focuses on the evaluation of different algorithms using real-time sEMG data. In Section 2, the methodology is detailed, covering participant recruitment, hand gesture selection, training, and evaluation. Section 3 provides a description of the analysis conducted using the MC-SVM, DT, LR, and RLR methods, the graphical abstract is shown in Fig. 1.

2 Methodology

This study conducted a research that entailed acquiring sEMG signals through the Myo armband and leveraged real-time sEMG data to assess both user-dependent and user-independent methods. Following this, this study applied preprocessing techniques to the sEMG signals and extracted the root mean square (RMS) feature. Subsequently, a model was trained using the RMS features, facilitating the accurate classification of sEMG signals.

2.1 General structure and participant recruitment

The research study, approved by the Swinburne University of Technology Human Ethics Committee, employed the reliable and portable Myo armband for sEMG data acquisition, which has been extensively utilized for capturing sEMG signals. The general structure of this study is shown in Fig. 2. The primary data source consisted of sEMG signals collected through the Myo armband sensor. To ensure accurate analysis, a preprocessing step was applied, involving a data processing class implemented in MATLAB. The class effectively removed the DC offset and normalized the sEMG data using z-score normalization, standardizing the data for further analysis. Additionally, the class extracted the RMS feature from the normalized sEMG signals, providing valuable insights into the magnitude and intensity of muscle activities. This crucial information enabled the distinction between different muscle movements. The research study primarily focused on recognizing five hand gestures: fist, spread fingers, wave-in, wave-out, and rest gesture. The main objective was to develop a precise system capable of classifying and distinguishing these hand gestures based on sEMG signals. The study involved six healthy and able-bodied participants, all aged 18 years and above, with no history of neurological disorders or injuries to the shoulder, elbow, or wrist. Table 1 offers a concise summary of the participants’ demographic information, including age and height.

2.2 Sensor placement protocol

The participants in the study were equipped with a Myo armband on their dominant limb, which included an inertial measurement unit (IMU) and eight dry electrodes, as shown in Fig. 3a. The Myo armband was carefully positioned approximately one inch distal to the elbow joint of the participant’s dominant limb, following the manufacturer’s guidelines. To ensure consistency and minimize variability between participants, the fourth sensor was specifically placed above the extensor carpi ulnaris muscle, as depicted in Fig. 3b. This deliberate positioning allowed for the precise placement of the IMU sensor on the dorsal side of the forearm in Sensor 4, in accordance with established recommendations [15].

2.3 Gestures

During the experimental protocol, participants were instructed to execute five distinct hand gestures while wearing the Myo armband, as depicted in Fig. 4. Each gesture was performed in two iterations with moderate force and maintained for a duration of 5 s, followed by a two-second rest period. The sequence of gestures carried out by the participants followed the order of right, rest, left, rest, forward, rest, and reverse movements. To enhance the effectiveness of user-independent classification, each gesture was extended to a 5-s duration during the data collection process. This extension allowed for the collection of a larger number of data points for all gestures, except for the rest gesture, thereby optimizing the training time. As a result, each gesture was recorded for approximately 5 s, resulting in approximately 258 samples per gesture. However, for the rest gesture, data was collected only for a duration of 2 s, which consequently resulted in nearly half the number of data samples compared to other gestures.

Table 1 Participants data

Full size table

2.4 Data collection

The surface electromyography (sEMG) data were collected at a sampling frequency of 200 Hz. The recorded data was then transmitted to a computer via Bluetooth 4.0, where it was processed using a data acquisition graphical user interface (GUI) developed in MATLAB R2022b. The GUI was designed using the app designer toolbox and integrated with the Myo SDK MATLAB Mex wrapper toolbox [16]. In the preprocessing step for sEMG data, the DC offset was removed. The DC offset represents the average value of the signal, which can vary due to factors like myo armband placement and skin impedance. Removing the DC offset is essential to ensure that the EMG signal is centered around zero and to eliminate any potential drift in the signal over time. This step helps in enhancing the accuracy of the subsequent analysis and classification of the sEMG data. Figure 5 demonstrates the effect of DC offset removal on the sEMG signal.

2.5 Feature extraction

sEMG signals are non-stationary, and therefore require conversion into a low-dimension feature set, as illustrated in Fig. 6. These features are derived from subsets of the raw signals referred to as windows. The processing time is influenced by the window’s length, and a longer window may result in a longer delay between signal generation and prosthesis actuation. In practice, a delay between 100 and 259 ms is often chosen to maintain the total delay under 300 ms. An RMS feature set has been extracted from the raw sEMG signals with over-lapping signal windows of 200 ms, which is updated every 40 ms.

It will be important to note that as the window moves forward in time, it may overlap or disjoint according to the gap between consecutive windows. Overlapping windows are those with intervals shorter than the window’s length, while disjoint windows are those with intervals equal to the window’s length.

A representation of the overlapping window can be seen in Fig. 7. A disjoint window causes a longer delay, so the overlapping window is often preferred. To extract a feature set, various features are available for each window, including the time domain, frequency domain, and time-scale domain. Power spectrum parameters are commonly used to study muscle exhaustion, but they may not offer enough information about signals with non-stationary or transitory properties in the time domain. In contrast, wavelet analysis provides a time-domain feature that incorporates both frequency and time information, even though it has low computational efficiency. Despite the lack of frequency information, time-domain features are most widely used in myoelectric control due to their quick processing time and high computational efficiency. The RMS of an sEMG signal is an example of a time-domain feature. A comprehensive review can be found at [17]; the RMS is explained below:

2.5.1 Root mean square

The square root of the mean value of a squared function is the RMS value. RMS is one of the technologies commonly used to evaluate sEMG. The formula is as follows:

$$\begin{aligned} x_{RMS} = \sqrt{\frac{(x_1^2+x_2^2+...x_n^2)}{n}} \end{aligned}$$

(1)

where $x_1 + x_2+...x_n$ are the data points and n is the samples of the sEMG data.

2.6 Study design

In this research study, participants were seated in front of a computer that was attached to the powered wheelchair while wearing the Myo Armband on their dominant limb. To indicate the origin, participants were asked to adopt a natural posture. Participants were instructed to control the horizontal and vertical velocity of the cursor using the fist and spread fingers gesture and the wave in and wave out gesture, respectively, as depicted in Fig. 8a. The trajectory layout is presented in Fig. 8b, and all participants were required to navigate the wheelchair from point 1 to point 4 according to the trajectory layout.

2.7 Training session

After wearing the Myo armband, participants were instructed to perform a series of five specific gestures. These gestures, commonly used in daily tasks, were categorized using four machine learning methods: two user-dependent algorithms based on LR and RLR and two user-independent algorithms based on DT and MC-SVM. Each trial involved participants executing the five gestures with moderate force. Two repetitions of each gesture were performed with mild force, with a two-second pause between each repetition. To capture as much data as possible, the gestures were held for 5 s during data segmentation.

The training consisted of participants following the target in two DOFs cartesian coordinates using hand gestures in four directions displayed one at a time on the screen as shown in Fig. 9. The end-point is reached in 5 s, stayed at for 2 s, and then returned to the origin in 5 s. In this research study, the vertical position of the target was controlled by wrist flexion and extension, while the horizontal position was controlled by the fist and spread fingers gestures. Participants were required to maintain the gestures until the cursor returned to its origin after the researcher had described the correlation between each gesture and the degree of freedom. In contrast to traditional approaches, this research study used a training method where the time and memory allocation for each target were pre-determined. Based on the real-time learning performance of each participant, this research study determined the amount of time and memory allocated to each target Table 2.

Table 2 Table of Performance Metrics during training

Full size table

2.8 Evaluation

The evaluation process in this study was divided into two sections: a training section and a testing section. The training section was devoted to the evaluation of training methods, while the testing section focused on measuring the performance of machine learning models in terms of accuracy, precision, recall, and F1 score. During sEMG data collection, five targets were presented individually, and participants’ performance was evaluated using performance metrics as presented in Table 3. If a target is incomplete, it is counted as a missing target. Completing a target requires the cursor to be moved to the target and held there until the cursor returns to the original target within 5 s. If the time exceeds 5 s, the target is identified as missed. The completion ratio is calculated by dividing the number of reached targets by the total number of targets. After completing a target, the cursor must return to the origin before proceeding to the next target. Each target’s completion time and the time taken to complete it are recorded. All six participants successfully hit the 5 targets within 112 s, which was the designated completion time according to the training protocol. As a result, the completion ratio for all participants was $100\%$, indicating that each participant achieved the goal of hitting all targets within the specified time frame during training. This study evaluated each participant once.

2.9 Performance metrics for user-independent classification methods

An accuracy performance metric is defined as the ratio of correctly predicted observations to total observations. In machine learning (ML), accuracy is a metric that measures the overall correctness of a model’s predictions. It is defined as the ratio of the number of correct predictions to the total number of predictions made by the model. Accuracy is calculated using the following formula in Eq. (2). ML classification models are commonly assessed by accuracy metrics, where TP is true positive, TN is true negative, FP is false positive, and FN is false negative [18].

$$\begin{aligned} Accuracy = \frac{TP+TN}{TP+TN+FP+FN} \end{aligned}$$

(2)

2.9.1 Precision

In ML, precision is a metric that measures the accuracy of positive predictions made by a model. It is defined as the ratio of true positive predictions to the total number of positive predictions made by the model. Precision is calculated using the following formula in Eq. (3):

$$\begin{aligned} Precision =\frac{TP}{TP+FP} \end{aligned}$$

(3)

2.9.2 Recall sensitivity

In ML, recall is a metric that measures the ability of a model to correctly identify positive instances from a dataset. It is defined as the ratio of true positive predictions to the total number of positive instances in the dataset. A recall is calculated using the following formula in Eq. (4):

$$\begin{aligned} Recall =\frac{TP}{TP+FN} \end{aligned}$$

(4)

2.9.3 F1 score

F1 score is a metric that combines precision and recall into a single score. It is a measure of a model’s accuracy that takes into account both the number of true positive predictions and the number of false positive and false negative predictions. The F1 score is calculated as the harmonic mean of precision and recall as shown in Eq. (5). The F1 score ranges from 0 to 1, with a higher score indicating better performance. A perfect F1 score of 1 indicates that the model has achieved both high precision and high recall. The F1 score is commonly used as a performance metric for classification tasks, especially in cases where the dataset is imbalanced or the cost of false positives and false negatives is different. By taking into account both precision and recall, the F1 score provides a more balanced evaluation of an ML model’s performance than accuracy alone.

$$\begin{aligned} F1 score = \frac{2TP}{2TP+FP+FN} \end{aligned}$$

(5)

Table 3 Comparison between proposed and state-of-the-art-work in literature

Full size table

2.10 Performance metrics for user-dependent classification methods

The following metrics are widely used to assess the performance of a regression model f. The sum of squares is shown below:

$$\begin{aligned} SS_{tot} = \sum _{i-1}^{m} (y_i - \bar{y})^2 \end{aligned}$$

(6)

The second is the residual sum of squares:

$$\begin{aligned} SS_{res} = \sum _{i-1}^{m}( (y_i)- f(x_i))^2 \end{aligned}$$

(7)

The hyperparameter optimizations were performed based on 10-fold cross-validation. Randomly selected data samples were used for evenly distributed training samples in each fold. $R^2$ was used as a performance metric, which is a common regression metric. It is defined as follows:

$$\begin{aligned} R^2 = 1- \frac{SS_{res}}{SS_{tot}} \end{aligned}$$

(8)

The second term in Eq. (8) is the variance of the labelled data divided by the mean square error. A value of $R^2=1$ is at its maximum, and the closer to it, the higher the performance. If the mean square error is greater than the variance of labels, $R^2$ could be negative, which represents poor model performance. The hyperparameter value that resulted in the highest cross-validation $R^2$ was selected during a hyperparameter optimization.

3 Classification methods

To classify the data sets obtained during training, two user-independent and two user-dependent classification methods were implemented. More specifically, Sections 3.1 and 3.2 discussed the multi-class support vector machine (SVM) and the decision tree (DT) classification methods. A linear regression (LR) method and a regularized linear regression (RLR) method were also used as the third and fourth classification methods. All these methods have been used to classify five hand gestures. This study employed user-dependent classification models, LR and RLR, which are specifically tailored to individual users. These models do not depend on data from other users, making them effective for recognizing gestures for a single participant at a time. However, when introducing a new user, these models require training with data from that particular user to ensure accurate gesture recognition. In order to overcome the user-dependent nature, this study further investigated two user-independent ML models including MU-SVM and DT. Following is an explanation of how these classification methods are implemented.

3.1 User-independent multi-class support vector machine method

Support vector machines (SVM) are widely used for classification tasks in supervised machine learning. By finding the hyperplane that maximizes the margin between the two classes, it divides the data into two classes. However, the SVM algorithm needs to be modified for multi-class classification problems. The one-vs-all (OvA) method, also called one-vs-rest, is one approach to solving multi-class classification problems using SVM.

Multiple binary classifiers are trained using the OvA method, each responsible for differentiating one class from the rest. Based on the predicted score, the highest class is selected for the final prediction. One-vs-one (OvO) is another approach to solving multi-class classification problems using SVM. Using this method, multiple binary classifiers are trained, each of which separates two classes. The final prediction is based on a voting scheme, where each binary classifier votes for one class. OvA approach for hand gesture recognition is the main focus of this study.

Both OvA and OvO have advantages and disadvantages. In comparison to OvO, OvA is computationally more efficient and requires fewer classifiers to be trained, but it can be less accurate. OvO, on the other hand, trains more classifiers, making it computationally more expensive, but it can result in better accuracy since it takes all classes into account. Multi-class SVM is a learning method in which linear functions are mapped into high-dimensional feature spaces instead of modelling probabilities through training data. A support vector kernel is used to map the input data to a high-dimensional feature space, allowing the problem to be processed linearly. In the optimization process, support vectors are samples whose multipliers are not zero.

In SVMs, the global minimum is always found since the objective is to decrease the bound on the structural risk rather than the empirical risk. For MC-SVM-based gesture recognition, the SVM should be extended to a k-class problem as it is a binary classifier. This study adopted the one-vs-all strategy, which is a pairwise approach and needs to train $k(k-\frac{1}{2})$ SVM classifiers. Matlab classification learner includes many classification algorithms. This research tested each of these classifiers during non-real-time classification. In this study, SVM algorithms were emphasized since they achieved the highest accuracy as compared to decision trees, linear regression, and regularized linear regression algorithms.

A support vectors (SVs) kernel is used to map the data from the input space to a high-dimensional feature space, which facilitates the linear processing of the problem. At the end of optimization, SVs have multipliers that are not zero. In order to achieve this goal, SVMs minimize the following Lagrange formulation, Lagrange is a function of the model parameters W weight vector and b bias term as well as Lagrange multipliers $\alpha $ as shown as in (9), where the first term is a regularization term that encourages the weights to be small, second is the margin term that measures the quality of the classification margin, and third term $\alpha _i$ is a constraint term enforces the constraint that the Lagrange multiplier is non-negative.

$$\begin{aligned} L_{p}\equiv {1\over 2}\Vert w\Vert ^{2}-\sum \limits _{i=1}^{l}\alpha _{i}y_{i}(x_{i}w+b)+\sum \limits _{i=1}^{l}\alpha _{i} \end{aligned}$$

(9)

$$\begin{aligned} f(x)=\sum \limits _{i=1}^{N}\alpha _{i}y_{i}k(x, x_{i})+b \end{aligned}$$

(10)

The decision function can be seen in (10), where k represents the kernel function, and this study has used the linear kernel function in this gesture recognition problem, $x_i$ represents the training samples, $y_i$ represents their class labels, and b represents the model parameters. Using SVM information for binary classifiers, one solution for k-class pattern classification is to extend it to k-class pattern classification. Multiple pattern recognition problems can be classified and identified using SVMs with supervised learning classifiers. SVMs with multiple classes can be used to identify gestures based on their trajectory. The high-dimensional data is separated by SVM so that errors are reduced since it is linearly incomparable data. A one-vs-all approach is used for gesture recognition in this paper. This method uses k SVMs, each of which separates one class from all other classes in the training set [19].

3.2 User independent decision tree method

A decision tree consists of a root node, internal nodes, and leaf nodes, where leaf nodes represent classes, and non-leaf nodes indicate attributes of classes. In the root node, sample data, including values for different attributes, are placed. As a result of the rules in non-leaf nodes, the decision tree splits values into multiple branches corresponding to different attributes. Finally, leaf nodes assign which class input data belong to. It is easy to understand and interpret decision trees. Multi-feature pattern classification is well suited to their ability to fusion diverse information. Additionally, they reduce the search range between classes for classification by using their sequential structure of branches. A decision tree can perform well on large data in a short amount of time, which is a significant advantage for implementing real-time classification systems [20].

The decision tree can be used to predict responses to data, also called a classification tree or regression tree. Trees begin at the root and are divided into leaf nodes as they progress down the tree. During response prediction, the decision starts at the beginning node and moves down to the leaf node. The leaf node then stores the response. Classification trees provide output in the form of true or false, whereas regression trees give numerical results. As compared to other classifiers, the training and testing accuracy obtained in this case is lower than multi-class SVM. DT has the highest accuracy of $97.77\%$ among 6 different participants.

3.3 User dependent regression methods

Supervised learning, a branch of machine learning, involves the use of algorithms such as linear regression to model the relationship between two variables. Linear regression specifically utilizes a linear equation to fit observed data, with one variable acting as the independent variable and the other as the dependent variable. The aim of linear regression is to predict the dependent variable, denoted as y, from the independent variable, denoted as x. To accomplish this, the algorithm seeks to establish a linear relationship between input x and output y, and ultimately utilize the best-fitting line to predict the continuous variable outcome. The best-fitting line represents the relationship between the independent variable x and the dependent variable, and the algorithm endeavors to minimize the sum of the squared differences between the data points and the regression line to obtain the optimal fit.

$$\begin{aligned} y_i = \beta _0 + \beta _1 x_{i_1} + ...+\beta _p x{i_p} +\epsilon _i y_i = x_i^T\beta + \epsilon _i \end{aligned}$$

(11)

Noting that T denotes the transpose operator, the hypothesis function for linear regression is given by $x_i\beta $, where $x_i$ and $\beta $ represent vectors, and $x_i\beta $ represents the inner product between the two vectors.

$$\begin{aligned} y=\theta _1 + \theta _2x \end{aligned}$$

(12)

In the context of linear regression, where x denotes the input training data and y represents the corresponding labels, the cost function J is defined as the root mean squared error (RMSE) between the predicted y value and the true y value.

$$\begin{aligned} J= \frac{1}{n} \sum _{i=1}^n (pred_i - y_i)^2 \end{aligned}$$

(13)

In order to achieve the best-fit line and minimize the cost function (i.e., minimize RMSE) in linear regression, the model employs gradient descent to update the values of $\theta _1$ and $\theta _2$ iteratively. The initial values of $\theta _1$ and $\theta _2$ are set randomly and are then updated iteratively until the minimum cost function is reached. Specifically, the feature set is represented as ${\textbf {X}} \in \mathbb {R}^{P\times n}$, where P is the dimension of the feature and n is the number of samples, and the labels are represented as ${\textbf {Y}} \in \mathbb {R}^{n \times Q}$, where Q is the dimension of the output target. The ultimate objective is to estimate $\hat{y}_{n+1}$, which represents the prediction for a new observation $x_{n+1}$.

3.3.1 Linear regression

The linear regression assumes a linear relationship between X and Y as follows:

$$\begin{aligned} \hat{Y} = X^TW \end{aligned}$$

(14)

where $W \epsilon R^{P \times Q}$ is a weight matrix. The cost function of least squares in $q^{th}$ DOF out of Q DOFs is

$$\begin{aligned} J_q = \sum _{t=1}^n |e_q(t) |^2 \end{aligned}$$

(15)

where e is the error term defined as $e=Y-X^TW$. The weight matrix that minimizes (14) is shown below:

$$\begin{aligned} W= (XX^T)^{-1} XY \end{aligned}$$

(16)

3.3.2 Regularized linear regression

Regularized linear regression has the same linear model as (15) with an additional term in the cost function. The cost function is given as follows:

$$\begin{aligned} J_q = \sum _{t=1}^n |e_q(t) |^2 + \lambda W_q^T W_q \end{aligned}$$

(17)

where the additional term is the $l_2$ regularization term with the positive constant $\lambda $. $l_2$ normalization is a computational way to avoid overfitting and instigate its general ability. The weight matrix that minimizes (16) is

$$\begin{aligned} W= (XX^T+\lambda I)^{-1} XY \end{aligned}$$

(18)

where I is an identity matrix. The regularization constant $\lambda $ is selected within a logarithmically spaced vector $[10^{-3},....,10^{3}]$ by a grid search based on a k-fold cross-validation accuracy.

4 Results

The evaluation of diverse machine learning techniques for the classification of sEMG data gathered from the library of five gestures through the utilization of the Myo armband sensor involved assessing their accuracy. Additionally, precision, recall, and F1 score were computed to gain deeper insights into the models’ performance. Precision gauges the model’s capacity to exclusively yield relevant data or accurately identify gestures associated with a specific class. On the contrary, recall quantifies the ratio of true positive classifications (accurately classified samples) to false negatives (samples mistakenly classified as a different class). The F1 score represents a measure of the harmonic mean between precision and recall, offering a balanced perspective on the model’s precision and recall performance [18].

The MC-SVM, DT, LR, and RLR achieved accuracy rates of $99.05\%$, $97.77\%$, $95.88\%$, and $94.85\%$ respectively. Assessment was based on criteria including accuracy and the confusion matrix. Figure 10 provides insights into the accuracy, precision, recall, and F1 score across all methods. Notably, when utilizing the MC-SVM, the overall precision reached $99.15\%$, with recall at $98.47\%$, and an F1 score of $98.81\%$ significantly outperforming other algorithms scrutinized in this research study. A graphical representation of the mean squared error and coefficient of determination for user-dependent methods in both training and testing can be found in Fig. 11.

Multi-class SVM achieved the highest accuracy of $99.05\%$ which is also higher when compared with other existing studies. In this work, a comparison has been made between MC-SVM, DT, LR, and RLR, all of these methods applied to control an electric-powered wheelchair. The confusion matrix for each of the algorithms analyzed in this study can be found in Fig. 12, and real-time implementation of all the algorithms on a wheelchair is shown in Fig. 13. Upon the introduction of a new user for wheelchair control, the LR and RLR user-dependent models require training. On the other hand, the user-independent models, DT and MC-SVM, eliminate the need for training with new users. A new user simply needs to calibrate the Myo armband and navigate the powered wheelchair.

Table 3 provides a comparison of various classification models used for gesture recognition, including convolutional neural network (CNN) [22], feed-forward ANN classifier [23], dynamic time warping and affinity propagation [24], and adaptive least square SVM [25] methods, and the CNN model achieved a recognition accuracy of $90\%$ for 6 gestures. The feed-forward ANN classifier achieved a slightly higher accuracy of $92.45\%$ for 5 gestures. Dynamic time warping and affinity propagation achieved a recognition accuracy of $94.60\%$ for a larger set of 18 gestures. The adaptive least square SVM achieved an accuracy of $92.90\%$ for 7 gestures. This research study proposed four methods for hand gesture recognition. The proposed LR model achieved an accuracy of $94.85\%$ for 5 gestures. The proposed RLR improved the accuracy further to $95.88\%$ for the same set of gestures. The proposed DT model achieved an accuracy of $97.77\%$. Nevertheless, the MC-SVM model attained the peak accuracy of $99.05\%$ for five hand gestures. Comparing the outcomes of this research study with those from the studies outlined in Table 3, the proposed MC-SVM model showcases superior performance over other models, including CNN, ANN, DTW, and least square SVM methods, in terms of recognition accuracy. This underscores the effectiveness and exceptional capabilities of the presented model for gesture recognition tasks, even with a smaller set of gestures. In summary, this study demonstrates the successful application of the proposed MC-SVM model, achieving an impressive recognition accuracy of 99.05% for gesture classification. This accomplishment surpasses the performance of existing models and underscores its potential for practical deployment in real-world gesture recognition systems.

To conclude, the machine learning models employed in this study successfully classified five distinct gestures: fist, wave-in (left), wave-out (right), spread-fingers, and hand at rest. Notably, the MC-SVM model exhibited enhanced accuracy in comparison to other models investigated in this study. The study conducted real-time experimental validation using an EPW, yielding promising outcomes. By harnessing sEMG data and the RMS feature, a confusion matrix was derived for the complete dataset, encompassing approximately 258 samples for each gesture, except for the rest gesture, which had half the sample count. This discrepancy originates from the experimental protocol, wherein the rest gesture was recorded for 2 s, whereas other gestures were captured for 5 s, contributing to the observed difference in sample quantities.

5 Discussion

5.1 Linear regression and regularized linear regression methods

Using linear regression (LR) and regularized linear regression (RLR), the results showed that it is possible to achieve recognition accuracy of up to $95.88\%$ for the six subjects when using sEMG data to classify five hand gestures. This study trained and tested two different ML models. The RLR model achieved a recognition accuracy of $95.88\%$, higher than the LR model’s $94.85\%$. As a result, RLR produced more accurate results when dealing with the complexity of the hand gesture recognition task.

Overfitting could be one reason for the worse performance of LR. Essentially, overfitting occurs when a model fits the training data too well, resulting in poor generalization to new data once the model has been trained. By adding a regularized term $\lambda $ to the cost function, RLR reduces the complexity of the model. By doing so, the model can avoid overfitting and produce more accurate results. Regularized regression could also improve performance because it effectively addresses multi-collinearity. Multi-collinearity occurs when two or more predictors are highly correlated in regression analysis. As a result, regression coefficient estimates can be unstable and inconsistent. This issue can be mitigated by regularizing regression to reduce the magnitude of coefficients for highly correlated predictors. This study concluded that RLR is a promising approach for hand gesture recognition with high accuracy and low mean squared error, indicating that this approach can handle the complexity of the task and produce more accurate results. The potential for RLR in other applications in this field needs to be explored further. Whenever a new user is introduced, it becomes essential to train these models. In the upcoming section, this study will explore two algorithms that are independent of user-specific training.

5.2 Decision tree method

A decision tree (DT) classifier, a widely used ML algorithm for multi-class classification issues, was implemented to identify five hand gestures. Study objectives included evaluating the performance of the decision tree for recognizing hand gestures for controlling the powered wheelchair’s movement. Model accuracy, precision, recall, and F1 score were evaluated using real-time sEMG data, which was used to train and test the model. According to the results, the decision tree achieved an accuracy of 97.77% and a precision of 97.11%. The results show that the model recognizes hand gestures for controlling the powered wheelchair very effectively. Additionally, six able-bodied participants were used in a real-time application test using an EPW. Based on a set of hand gestures performed by the six participants, the DT was able to recognize the gestures accurately in real time. As a result, the model is effective in practical applications, such as controlling the movement of PW. In real-world applications, a DT model is an attractive option due to its simplicity and computational efficiency. It is possible to use this model in practical applications for hand gesture recognition in powered mobility devices because of its high accuracy and precision.

Based on DT results, it appears that the decision tree classifier captured the relationships between features and classes effectively, resulting in accurate predictions. It is important to note that decision trees are prone to overfitting. In other words, they may fit the training data too closely, resulting in poor generalization performance. There are several ways to address this issue, including pruning, assembling, or using a random forest classifier, which is an extension of a decision tree. A decision tree classifier can be a useful tool for solving multi-class classification problems, and it was able to effectively capture data relationships with an accuracy rate of 97.77%. The risk of overfitting must, however, be considered and techniques employed to prevent it. The performance of sEMG data-based gesture recognition was improved after classification, but the significant drawback was the low recognition accuracy observed after the classification of the data. To improve the accuracy, the next section will study MC-SVM method.

5.3 Multi-class support vector machine method

The outcome of utilizing the Myo armband muscle sensor in the MC-SVM classification process shows not only an improvement in performance but also the capacity of the model to identify each gesture individually. Additionally, the MC-SVM technique is user-independent, meaning it doesn’t require any adjustment or training when a new user is introduced. The objectives included evaluating the performance of the MC-SVM model in recognizing hand gestures for controlling EPW. Training and testing were conducted on real-time sEMG data of hand gestures, and accuracy, precision, recall, and F1 score were used to assess performance. Results showed that the multiclass SVM achieved a precision of $99.15\%$ and an accuracy of $99.05\%$. According to these results, the model was highly effective at recognizing hand gestures.

In order to further validate the model, six able-bodied participants and a powered wheelchair were used in real-time application tests. Participants performed a set of hand gestures, and the multi-class SVM accurately recognized the gestures in real time. Consequently, the model can be utilized in practical applications, such as controlling the movement of an electric-powered wheelchair. This study has demonstrated that the MC-SVM is effective for recognizing hand gestures in an electric-powered wheelchair. The high accuracy and precision achieved during training and testing, as well as the successful real-time application, support the potential of this model for practical applications for controlling powered mobility devices.

6 Conclusions and future work

As a result of this research, the following contributions were made. With an overall average accuracy of $98.61\%$ for 6 different able-bodied users, the proposed MC-SVM machine learning model demonstrated a remarkable response time of less than 1 s for hand gesture commands, processing, and execution. Additionally, this study achieved the best recognition accuracy of approximately $99.05\%$. The user-independent MC-SVM-based ML classification model enables new users to control the EPW without the need for re-training. It has been experimentally demonstrated that an electric-powered wheelchair can be controlled by users to complete all possible movements efficiently. This study aimed to enhance the accuracy of hand gesture recognition using sEMG data by developing four classification models. Two models were user-independent, and two were user-dependent. The study compares the different models and aims to determine which of them is most suitable for use in real-time applications by comparing the different models. The goal was to enhance the interaction between humans and computers through improved hand gesture recognition. The study utilized LR, RLR, DT, and MC-SVM approaches. The most accurate results were achieved using a MC-SVM with an accuracy of 99.05%. The DT method also had favorable results with an accuracy of 97.77%. The proposed system was found to be comfortable to use based on six participants. Future work will include comparisons with advanced deep learning-based gesture recognition techniques and an analysis of subject variability among more users, and recruit non-able-bodied participants to validate the experiment. The study also demonstrated the viability of the proposed system for human–machine interaction through its successful application to control an electric-powered wheelchair.

References

World Health Organization and International Spinal Cord Society (2013) International perspectives on spinal cord injury, World Health Organization
Rusydi MI, Hadi K, Setiawan AW, Reni I, Nugroho H, Windasari N (2022) Electric wheelchair control using wrist rotation based on analysis of muscle fatigue. IEEE Access, Voulme, p 10
Google Scholar
Machangpa Jigmee Wangchuk, Chingtham Tejbanta Singh (2018) Head gesture controlled wheelchair for quadriplegic patients, Procedia Computer Science, vol 132. Elsevier
Google Scholar
Cardoso Tiago, Delgado João, Barata José (2015) Hand gesture recognition towards enhancing accessibility, Procedia Computer Science, vol 67. Elsevier
Google Scholar
Ameur Safa, Khalifa Anouar Ben, Bouhlel Med Salim (2020) A novel hybrid bidirectional unidirectional LSTM network for dynamic hand gesture recognition with leap motion, Entertainment Computing Journal, vol 35. Elsevier
Google Scholar
Joy, Eldhose and Chandran, Sruthy and George, Chikku and Sabu, Abhijith A and Madhu, Divya (2018), Gesture controlled video player a non-tangible approach to develop a video player based on human hand gestures using convolutional neural networks, Second International Conference on Intelligent Computing and Control Systems (ICICCS), IEEE, 2018
Mendes Nuno, Ferrer João, Vitorino João, Safeea Mohammad, Neto Pedro (2017) Human behaviour and hand gesture classification for smart human-robot interaction, Procedia Manufacturing Journal, vol 11. Elsevier
Google Scholar
Kincaid, Clay J and Vaterlaus, Austin C and Stanford, Nathan R and Charles, Steven K (2019) Frequency response of the leap motion controller and its suitability for measuring tremor, Medical engineering & Physics, volume 63, Elsevier
Xie, Renqiang and Cao, Juncheng (2016) Accelerometer-based hand gesture recognition by neural network and similarity matching, IEEE Sensors Journal, Volume 16, IEEE
De Luca, Carlos J (1995) Decomposition of the EMG signal into constituent motor unit action potentials, Muscle & Nerve Journal, Volume 18
Castiblanco, Jenny C and Ortmann, Steffen and Mondragon, Ivan F and Alvarado-Rojas, Catalina and Jöbges, Michael and Colorado, Julian D (2020) Myoelectric pattern recognition of hand motions for stroke rehabilitation, Biomedical Signal Processing and Control Journal, Volume 57, Elsevier
Farina, Dario and Merletti, Roberto and Enoka, Roger M (2004) The extraction of neural strategies from the surface EMG, Journal of Applied Physiology, American Physiological Society
Jiang Ning, Dosen Strahinja, Muller Klaus-Robert, Farina Dario (2012) Myoelectric control of artificial limbs-is there a need to change focus? IEEE Signal Processing Magazine Journal, IEEE, Volume, p 29
Google Scholar
Iqbal, Hassam and Zheng, Jinchuan and Chai, Rifai and Chandrasekaran, Sivachandran (2022) Regression-based real-time hand gesture recognition and control for electric powered wheelchair, Australasian Conference on Robotics and Automation (ACRA 2022)
Hassan, Uzair and Mughal, Hassam and Mohsin, Inamullah and Khan, Zeashan Hameed (2018) Real-time control of a mobile robot using electrooculogram based eye tracking system, 2018 5th International Multi-Topic ICT Conference (IMTIC)
Höglund Gustav, Grip Helena, Öhberg Fredrik (2021) The importance of inertial measurement unit placement in assessing upper limb motion, Medical Engineering & Physics Journal, vol 92. Elsevier
Tomaszewski Mark (2016) Myo SDK MATLAB MEX wrapper, https://github.com/mark-toma/MyoMex
Iqbal, Nisheena V and Subramaniam, Kamalraj (2018) A review on upper-limb myoelectric prosthetic control, IETE Journal of Research, Taylor & Francis
Goutte, Cyril and Gaussier, Eric (2005) A probabilistic interpretation of precision, recall and F-score, with implication for evaluation, Advances in Information Retrieval: 27th European Conference on IR Research, ECIR 2005, Santiago de Compostela, Spain, March 21-23, 2005. Proceedings 27, Springer
Wang Zhe, Xue Xiangyang (2014) Multi-class support vector machine. Springer, Support Vector Machines Applications Journal
Fang, Gaolin and Gao, Wen and Zhao, Debin (2004) Large vocabulary sign language recognition based on fuzzy decision trees, IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans, Volume 34, IEEE
Kim Seo Yul, Han Hong Gul, Kim Jin Woo, Lee Sanghoon, Kim Tae Wook (2017) A hand gesture recognition sensor using reflected impulses. IEEE Sensors Journal, IEEE
Book Google Scholar
Benalcázar, Marco E and González, José and Jaramillo-Yánez, Andrés and Anchundia, Carlos E and Zambrano, Patricio and Segura, Marco (2020) A model for real-time hand gesture recognition using electromyography (EMG), covariances and feed-forward artificial neural networks, 2020 IEEE ANDESCON, IEEE
Akl Ahmad, Feng Chen, Valaee Shahrokh (2011) A novel accelerometer-based gesture recognition system. IEEE Transactions on Signal Processing, IEEE, Volume, p 59
Google Scholar
Colli Alfaro, Jose Guillermo and Trejos, Ana Luisa (2022) User-independent hand gesture recognition classification models using sensor fusion, Sensors, Volume 22, MDPI

Download references

Funding

Open Access funding enabled and organized by CAUL and its Member Institutions

Author information

Authors and Affiliations

Department of Engineering Technologies, Swinburne University of Technology, John Street, 3122, Melbourne, Victoria, Australia
Hassam Iqbal, Jinchuan Zheng & Rifai Chai
Department of Computing Technologies, Swinburne University of Technology, Melbourne, Australia
Sivachandran Chandrasekaran

Authors

Hassam Iqbal
View author publications
You can also search for this author in PubMed Google Scholar
Jinchuan Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Rifai Chai
View author publications
You can also search for this author in PubMed Google Scholar
Sivachandran Chandrasekaran
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hassam Iqbal.

Ethics declarations

Ethical approval

The Swinburne University of Technology Human Ethics Committee approved the study.

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Iqbal, H., Zheng, J., Chai, R. et al. Electric powered wheelchair control using user-independent classification methods based on surface electromyography signals. Med Biol Eng Comput 62, 167–182 (2024). https://doi.org/10.1007/s11517-023-02921-z

Download citation

Received: 13 March 2023
Accepted: 22 August 2023
Published: 26 September 2023
Issue Date: January 2024
DOI: https://doi.org/10.1007/s11517-023-02921-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Electric powered wheelchair control using user-independent classification methods based on surface electromyography signals

Abstract

Similar content being viewed by others

sEMG Classification of Upper Limb Movements Under Different Loads

Hand Gesture Recognition Based Omnidirectional Wheelchair Control Using IMU and EMG Sensors

Real-Time Electromyographic Hand Gesture Signal Classification Using Machine Learning Algorithm Based on Bispectrum Feature

1 Introduction

2 Methodology

2.1 General structure and participant recruitment

2.2 Sensor placement protocol

2.3 Gestures

2.4 Data collection

2.5 Feature extraction

2.5.1 Root mean square

2.6 Study design

2.7 Training session

2.8 Evaluation

2.9 Performance metrics for user-independent classification methods

2.9.1 Precision

2.9.2 Recall sensitivity

2.9.3 F1 score

2.10 Performance metrics for user-dependent classification methods

3 Classification methods

3.1 User-independent multi-class support vector machine method

3.2 User independent decision tree method

3.3 User dependent regression methods

3.3.1 Linear regression

3.3.2 Regularized linear regression

4 Results

5 Discussion

5.1 Linear regression and regularized linear regression methods

5.2 Decision tree method

5.3 Multi-class support vector machine method

6 Conclusions and future work

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethical approval

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation