Predicting sit-to-stand motions with a deep reinforcement learning based controller under idealized exoskeleton assistance

Ratnakumar, Neethan; Akbaş, Kübra; Jones, Rachel; You, Zihang; Zhou, Xianlian

doi:10.1007/s11044-024-10009-1

Predicting sit-to-stand motions with a deep reinforcement learning based controller under idealized exoskeleton assistance

Research
Open access
Published: 22 July 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Multibody System Dynamics Aims and scope Submit manuscript

Predicting sit-to-stand motions with a deep reinforcement learning based controller under idealized exoskeleton assistance

Download PDF

Neethan Ratnakumar¹,
Kübra Akbaş¹,
Rachel Jones¹,
Zihang You¹ &
…
Xianlian Zhou¹

196 Accesses
Explore all metrics

Abstract

Maintaining the capacity for sit-to-stand transitions is paramount for preserving functional independence and overall mobility in older adults and individuals with musculoskeletal conditions. Lower limb exoskeletons have the potential to play a significant role in supporting this crucial ability. In this investigation, a deep reinforcement learning (DRL) based sit-to-stand (STS) controller is developed to study the biomechanics of STS under both exoskeleton assisted and unassisted scenarios. Three distinct conditions are explored: 1) Hip joint assistance (H-Exo), 2) Knee joint assistance (K-Exo), and 3) Hip-knee joint assistance (H+K-Exo). By utilizing a generic musculoskeletal model, the STS joint trajectories generated under these scenarios align with unassisted experimental observations. We observe substantial reductions in muscle activations during the STS cycle, with an average decrease of 68.63% and 73.23% in the primary hip extensor (gluteus maximus) and primary knee extensor (vasti) muscle activations, respectively, under H+K-Exo assistance compared to the unassisted STS scenario. However, the H-Exo and K-Exo scenarios reveal unexpected increases in muscle activations in the hamstring and gastrocnemius muscles, potentially indicating a compensatory mechanism for stability. In contrast, the combined H+K-Exo assistance demonstrates a noticeable reduction in the activation of these muscles. These findings underscore the potential of sit-to-stand assistance, particularly in the combined hip-knee exoskeleton scenario, and contribute valuable insights for the development of robust DRL-based controllers for assistive devices to improve functional outcomes.

Momentum-based trajectory planning for lower-limb exoskeletons supporting sit-to-stand transitions

Article 07 February 2018

Human sit-to-stand transfer modeling towards intuitive and biologically-inspired robot assistance

Article 25 February 2016

Three dimensional unassisted sit-to-stand prediction for virtual healthy young and elderly individuals

Article 18 September 2019

Find the latest articles, discoveries, and news in related topics.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Sit-to-stand (STS) transitions are one of the most challenging movement activities required for independent daily living that is often compromised by aging and other neurological and physical conditions such as strokes, spinal cord injuries, osteoarthritis, and muscular dystrophy. A healthy adult on average performs 60 (±22) STS transitions each day [1], and this action serves as the fundamental initial step necessary for ambulation. Furthermore, STS transition tests are also often used as functional performance measures in clinical practice and to estimate one’s muscle strength in the lower extremities [2–4]. An STS transition cycle from a static seated position to a stable upright stance position in healthy individuals typically takes less than 3 seconds [5] and is divided into two to five phases for the purpose of analysis depending on the specific criteria or perspective of the study [6–9]. STS is initiated by leaning forwards with the torso, then the momentum is transferred from the upper body to the whole body, the hips and knees extend to reach a standing position, and finally STS is complete when standing stability is achieved. The gluteus maximus (hip extensors), the quadriceps (knee extensors), and to a considerable extent the hamstrings are some of the primary muscles contributing to the STS transition [10], while the muscles surrounding the ankle joint contribute towards balance and stability. We believe that reducing the strain on the major muscle groups by assisting the knee and hip joints may contribute to effortless STS transitions.

Musculoskeletal (MSK) modeling of the STS motion allows us to analyze the parameters that are otherwise inherently difficult to measure noninvasively, such as the individual muscle forces and joint reaction forces. Caruthers et al. created a 3D custom MSK model and used it to identify and study the individual muscle contributions in accelerating the whole-body center of mass during STS [11]. Likewise, Smith et al. used an MSK modeling approach to explain the STS difficulties experienced by older adults [12]. Furthermore, several studies have also utilized predictive musculoskeletal simulations to understand the high-level physiological controllers in the human body for STS motions. Norman-Gerum et al. developed a three-link planar model and used Bézier curves to prescribe STS trajectories [13]. Kumar et al. used an open loop single shooting optimization framework to generate STS trajectories under different lower limb muscle strength deficits [14]. The cost function was designed as a linear combination of ten different cost terms that encouraged a stable STS trajectory, including minimization of control effort, penalties for breaching joint limits, penalties for any joint movement at the end time of the simulation, penalties for excessive body accelerations, and others. The use of hip vs. knee reserve actuators during severe strength deficits was explored supplementarily to identify the muscle responsible for STS failure (vasti). Munoz et al. utilized vestibular and muscle length reflexes to simulate STS within the SCONE software environment, which also utilizes a shooting-based optimization approach. The cost function utilized in this study did not include any effort measure and was set as a combination of only different degree of freedom (DOF) measures for each STS phase to emphasize the role of vestibular input in STS [15]. Gordon et al. utilized an inverse MSK optimal control (a bi-level optimization technique used to identify cost functions) framework to learn personalized STS motion strategies during perturbed STS [16] and contributed valuable insights showing that humans modulate STS strategies under instabilities in a subject-specific way.

Numerous researchers have also studied STS motions with external devices such as robotic devices that apply forces to different parts of the body [17] or wearable exoskeletons that apply joint torques with the primary goal of helping individuals regain their independence when confronted with mobility constraints. Many of these devices focus on applying volitionally-triggered desired STS trajectories through position control or impedance control, which is adequate when assisting severely impaired individuals, while others have also utilized a model-based approach. However, providing optimum assistance with optimal timing for maximal biomechanical benefit, while also ensuring stability throughout the movement, remains a challenging task. Choi et al. explored the effect of peak assistance timing in reducing knee extensor muscle activations and concluded that a peak knee assistive torque applied between 25% and 40% of the STS cycle was most effective [18]. Utilizing functional electrical stimulation on the knee extensors, combined with synchronized knee assistive torque, as exemplified in Alouane et al. [19], represents a hybrid approach also employed by various researchers. A pilot experimental STS study performed while wearing an active hip exoskeleton revealed minimal differences in the joint kinematics, while the hip assistance provided was capable of reducing muscle activations in the gluteus maximus and hamstrings [20]. Similarly, Myosuit, a hybrid cable-driven exosuit with elastic bands between limbs developed by Schmidt et al., was able to reduce the muscle activation levels in the gluteus maximus muscles up to 60% [21]. A model-based control for STS transitions that considered the exoskeleton dynamics and its contacts with the environment was developed and implemented to apply assist-as-needed torques to the lower-limb joints by Vantilt et al. [22].

Recently deep reinforcement learning (DRL) has gained popularity in MSK model [23–26] and exoskeleton control [27, 28] due to advancements in computational capabilities. The redundant nature of the motor control problem makes DRL an ideal tool for devising robust controllers for exoskeletons. Utilizing such an approach, Jamali et al. used a Q-Learning method to find the optimal joint moments during STS movement for a simple torque-driven dynamic human representative linkage model [29]. Additionally, a DRL controller for a robot-assisted standing seat was also tested to optimize STS transitions, with user satisfaction serving as the reward metric [30]. In this study, we aim to obtain a biologically realistic representation of the unassisted human STS movement by employing a muscle-driven model and extending it to exoskeleton-assisted STS.

The primary objective of this paper is to propose a DRL framework for idealized STS assistances from an exoskeleton, specifically targeting the hip and knee joints due to their critical roles in facilitating STS movements. The hip is essential for initiating STS movements and pushing the body vertically upwards [11], while limitations in the force production capabilities of the knee extensor muscles are often the primary constraint in STS performance [31]. This framework is designed to deliver tailored, idealized torque profiles that synchronize seamlessly with human movements during STS maneuvers. Additionally, we explore the biomechanical changes associated with the use of these STS controllers. Four scenarios of STS are simulated and studied: 1) without exoskeleton assistance, 2) with hip assistance (H-Exo), 3) with knee assistance (K-Exo), and 4) with hip plus knee assistance (H+K-Exo). Subsequent sections of this research article discuss the methodologies employed in the simulations and the results obtained.

2 Methods

In this study, we have adapted a DRL framework to train STS controllers with and without exoskeleton assistance. These controllers are engineered to mimic a target STS motion while fulfilling additional task objectives, steered by specifically designed rewards. An overview of the DRL controller framework used in this study is shown in Fig. 1. Our DRL training environment adapts the two-level imitation learning structure initially developed by Lee et al. [32]. It consists of a trajectory mimicking control policy network that outputs desired joint angles and a muscle coordination neural network that produces individual muscle excitations to generate desired torques. A key innovation in our work is the integration of exoskeleton assistive torques into the control networks. These assistive torques work in tandem with the muscle forces to generate the desired torques. Additionally, we have implemented specially designed balance rewards to enhance the assisted STS motion’s performance.

2.1 Musculoskeletal model and idealized torque assistance

The MSK model utilized in this study was adapted from the gait10dof18.osim model, retrieved from OpenSim model repository [33]. For faster computational speed and efficient learning, the model was modified by removing the left lower extremity and its associated muscles, assuming symmetry. This adjustment reduced the model to include seven degrees of freedom (DOF): a 3-DOF planar pelvis joint, a 1-DOF lumbar joint, and 1-DOF for the hip, knee, and ankle joints each and nine muscles on the right limb. Additionally, the torso mass was halved to account for the removed lower extremity. The original Millard muscles in the model were converted to MuJoCo-type muscles [34] for computational efficiency, where the tendons are modeled as rigid components. The physical muscle parameters (such as fiber length and maximum muscle force) of each muscle were loaded from the original OpenSim model without modifications. The maximum isometric forces of each of these muscles as used in the simulations is shown in Table 1. The knee joint was simplified to a single revolute joint, and the attachment points for the vasti and rectus femoris muscles were adjusted to maintain comparable moment arms, especially when the knee is highly flexed. Further, the erector spinae longissimus and rectus abdominis muscles were added to the lumbar joint to act as simplified versions of the lumbar musculature to enable control over trunk orientation. Three contact spheres were added to the foot, one at the heel and two at the toes, to model the contact between the foot and the ground. Additionally, one contact sphere placed close to the ischial tuberosity of the pelvis and one sphere on the thigh were utilized to model the contact between the buttocks and the chair. A depiction of the final model in a standing and seated position is shown in Fig. 2.

Table 1 The maximum isometric force for each muscle included in the musculoskeletal model (unit: Newtons)

Full size table

The muscle activation ($a$) in all the 11 human muscles of the model is governed by the first order excitation-activation dynamics equation as follows:

$$ \dot{a} = \frac{u-a}{\tau(u,a)}, \tau(u,a) = \textstyle\begin{cases} \tau_{act}(0.5+1.5a) & u -a >0\\ \tau_{deact}/(0.5+1.5a)&u-a\leq0 \end{cases}\displaystyle , $$

(1)

where $u$ is the muscle excitation (control signal obtained from the muscle network output) and $\tau $ is the delay time. $\tau _{act}$ and $\tau _{deact}$ are muscle activation and de-activation time constants with the values set to (0.01, 0.04). This equation describing the dynamics is solved through integration with both excitation and activation values ranging within [0,1].

The dynamics of the human musculoskeletal model is represented in the joint space and is governed by the Euler–Lagrangian equations utilizing generalized coordinates:

$$ \boldsymbol{M} \left ( \boldsymbol{q} \right ) \ddot{\boldsymbol{q}} + \boldsymbol{C} \left ( \boldsymbol{q}, \dot{\boldsymbol{q}} \right ) = \boldsymbol{J}_{\boldsymbol{M}}^{\boldsymbol{T}} \boldsymbol{F}_{\boldsymbol{M}} + \boldsymbol{J}_{\boldsymbol{ext}}^{\boldsymbol{T}} \boldsymbol{F}_{\boldsymbol{ext}} + \boldsymbol{\tau}_{exo}. $$

(2)

Here, $\boldsymbol{q}$, $\dot{\boldsymbol{q}}$, $\ddot{\boldsymbol{q}}$ are the joint angles, angular velocity, and angular accelerations, respectively. $\boldsymbol{F}_{M}$ are the muscle forces that depend on the muscle activations and $\boldsymbol{F}_{ext}$ are the external forces (e.g., contact forces) acting on the musculoskeletal model. $\boldsymbol{M} ( \boldsymbol{q} )$ is the generalized mass matrix, and $\boldsymbol{C} ( \boldsymbol{q}, \dot{\boldsymbol{q}} )$ accounts for the Coriolis and gravitational forces. The Jacobian matrices $\boldsymbol{J}_{M}$ and $\boldsymbol{J}_{ext}$ convert the muscle forces and external forces into generalized joint torques. $\boldsymbol{\tau}_{exo}$ is the idealized exoskeleton assistance torque. In our implementation, the assistance torque at each joint is modeled as a pair of agonistic and antagonistic actuators that provide either flexion or extension assistance.

The dynamics of the musculoskeletal model are integrated using a forward dynamics approach with the muscle excitations as obtained from the muscle coordination neural network as part of the DRL framework. Kinematic constraints such as the hip and knee joint limits are imposed, and the contact forces are solved using the open-source Dynamic Animation and Robotics Toolkit (DART) simulation environment during the forward simulations [35].

2.2 Reinforcement learning for sit to stand muscle control

The MSK model interacts with the ground and the seating box in the learning environment, which is the dynamic simulator. The control of this environment is realized through a combination of two multilayer perceptron (MLP) neural networks as shown in Fig. 1: the control policy network (CPN) and the muscle coordination network (MCN). The agent (CPN) takes the human body state information as input, then outputs the desired joint angles as the action. The desired joint angles are thereafter converted into desired joint torques ($\boldsymbol{\tau}_{d}$) through a proportional-derivative (PD) controller [36].

The MCN neural network used for learning muscle excitations is a deterministic policy $a = \pi _{\psi} ( \boldsymbol{\tau}_{d}, s_{muscle} )$, where the network parameters $\psi $ are learned through regression by supervised learning. The muscle coordination network is defined with three hidden layers (n = 512, 256, 256 nodes) and the loss function is given by

$$ Loss(a(\psi ))= \mathbb{E} \left [ \left \Vert \boldsymbol{\tau}_{\boldsymbol{d}} - \boldsymbol{\tau}_{\boldsymbol{m}} - \boldsymbol{\tau}_{\boldsymbol{exo}} \right \Vert ^{2} + w_{reg} \left \Vert \boldsymbol{a} (\psi ) \right \Vert ^{2} \right ]. $$

(3)

Here, the first term minimizes the difference between the desired torques ($\boldsymbol{\tau}_{d}$) and the sum of the biological joint torques ($\boldsymbol{\tau}_{m}$) and the exoskeleton torques ($\boldsymbol{\tau}_{exo}$). The second term is a regularization term that reduces large muscle activations. The MCN predicted $\boldsymbol{a} ( \psi )$ is fed to the simulation environment as the muscle excitation instead of activation since the activation must obey Eq. (1).

The CPN acts as the main RL agent controlling the MSK model’s actions based on its accumulated rewards. As the RL agent interacts with its environment, its actions are scored using a reward, and the agent is updated based on the action’s reward. At each time step $t$, the agent’s state $s_{t}$ is observed and an action $a_{t}$ is selected according to its control policy $\pi _{\theta} ( a_{t} \mid s_{t} )$, with $\theta $ being the weights and bias of the neural network. The control policy is learned by maximizing the discounted sum of reward ($r_{t}$).

The DRL framework is trained with the proximal policy optimization (PPO) algorithm [37], which is a model-free policy gradient algorithm widely used for continuous control problems. PPO updates the control policy’s parameters ($\theta $) using the expected return’s gradient with respect to the parameters. The agent learns to increase its reward by modifying the parameters $\theta $ of the network. The CPN is defined as an MLP with two hidden layers with 256 nodes each. A desired target trajectory is provided as reference.

The total reward function $r_{t}$ for the RL algorithm is designed to drive the MSK model to reach the target state by including primarily a torque reward $r_{t}^{torq}$, a tracking reward $r_{t}^{track}$, and an extrapolated center of mass (XcoM) [38] stability reward $r_{t}^{xcom}$. Additionally, the simulation introduces a reward for maintaining an upright posture $r_{t}^{upright}$ and a reward for minimizing velocity at the end of the movement $r_{t}^{vel}$. Both rewards are activated at the 2-second mark to ensure the posture is upright and the movement velocity is minimized at the conclusion of the simulation as follows:

$$ r_{t} = w^{torq} r_{t}^{torq} + w^{track} r_{t}^{track} + w^{xcom} r_{t}^{xcom} \left ( + w^{up} r_{t}^{up} + w^{vel} r_{t}^{vel} \right )_{ift \geq 2}, $$

(4)

where $w^{torq} =0.1$, $w^{xcom} =0.1$, $w^{track} =1.0$, $w^{up} =1.0$, and $w^{vel} =0.5$. The torque reward is included to help reduce the energy consumption of the joints by minimizing the torques:

$$ \textstyle\begin{array}{l} r_{t}^{torq} = exp \left ( - \sigma _{torq} \sum _{j} \left \Vert \tau _{j} \right \Vert ^{2} \right ), \end{array} $$

(5)

where $\sigma _{torq} =0.001$. The tracking reward minimizes the difference between the reference trajectories and the controller prescribed angles. The experimental reference STS motion capture joint angle data used in this study for trajectory mimicking is obtained from inverse kinematics solutions as determined by Caruthers et al. [11]. Our goal was not to achieve perfect tracking but to use the tracking data as general guidance and demonstrate that our RL based simulation can generate physically feasible and realistic motion even when a nonspecific motion is used for tracking. The tracking reward is defined as follows:

$$ \textstyle\begin{array}{l} r_{t}^{track} = exp \left ( - \sigma _{p} \sum _{j} \left \Vert \hat{q}_{t}^{j} - q_{t}^{j} \right \Vert ^{2} \right ), \end{array} $$

(6)

where $\sigma _{q} =2.0$, $q_{t}^{j}$ is the DOF value of the jth joint, and $\hat{q}_{t}^{j}$ is the corresponding DOF value for the tracking motion. The XcoM reward is defined as

$$ r_{t}^{xcom} = exp \left ( - \sigma _{xcom} \left \Vert xco m_{t} - xco m_{t arg e t} \right \Vert ^{2} \right ), $$

(7)

where $\sigma _{xcom} =\ 40$ and $xcom_{target}$ is set as the $x$ (horizontal) position of the foot’s COM. The upright posture reward is defined as follows:

$$ r_{t}^{up} = exp \left ( - \sigma _{up} \left \Vert p_{h ead}^{x} - p_{pelvis}^{x} \right \Vert ^{2} \right ), $$

(8)

where $\sigma _{up} =\ 100$ and $p_{pelvis}^{x} $ is the $x$ position of the pelvis, $p_{head}^{x} $ is the $x$ position of the head, which equals to $p_{pelvis}^{x} $ when the torso is totally upright. The velocity at target posture reward ensures stability when standing is achieved and is defined as follows:

$$ \textstyle\begin{array}{l} r_{t}^{vel} = exp \left ( - \sigma _{vel} \sum _{j} \left \Vert \dot{\hat{q}}_{t}^{j} - \dot{q}_{t}^{j} \right \Vert ^{2} \right ), \end{array} $$

(9)

where $\sigma _{vel} =2$ and $j$ is the joint index. $\dot{\hat{q}}_{t}^{j}$ is the joint velocity value for the standing posture, which is set to zero. All the reward terms used in the optimization framework are summarized in Table 2 with their respective weights.

Table 2 Reward terms used in the DRL framework and respective weights

Full size table

Moreover, early termination conditions [39] are imposed to ensure faster learning. The termination conditions include the detection of a fall (imposed by specifying a lower bound to the vertical position for the pelvis), as well as the detection of a toe or heel lift or a large enough foot sliding scenario. All training is executed on a Linux machine equipped with Intel Xeon CPUs (2.30 GHz) and a 16 GB Nvidia Quadro RTX 5000 GPU. Each training session involved a maximum of 50,000 iterations, typically requiring approximately 40 hours to reach completion. It is noteworthy that the rewards often plateaued well before reaching the 50,000 iteration mark, indicating the expected convergence during the training process.

3 Results

After obtaining trained controllers for each of the four cases, we conducted forward dynamic simulations of STS with these controllers to test their performance. Since the CPN is a stochastic control policy, we conducted 100 dynamic simulations for each case to obtain the mean responses. The initial state for these dynamic simulations in the tests were the same as the one used in the training (IK results @ t = 0 s as obtained from Caruthers et al. [11]). The variance in the tests was negligible; for example, the standard deviations in hip, knee, and joint angles for the unassisted case were all below 0.25 degrees. And we did not observe failed cases (resulting in a fall) from these dynamic simulations, underscoring the control’s robustness.

One example test case dynamic simulation for the STS motion for the unassisted condition is illustrated in Fig. 3 through a time-lapse sequence of screenshots of the model performing the motion.

The mean exoskeleton assistive hip and knee joint torques obtained for the assisted STS scenarios from the selected 100 dynamic simulations each is presented in Fig. 4. An upper limit of 50 Nm assistive torque per joint was imposed during the simulations. We chose 50 Nm for the knee and hip joints in our simulations to provide substantial but partial support, as evidenced by biomechanical studies such as Roebroeck et al. [10] and Yoshioka et al. [40], which demonstrated the typical joint torque requirements during STS movements. This level of torque strikes a balance between enabling necessary movement and encouraging user effort to prevent muscle atrophy, while also considering the state-of-the-art torque capabilities of the lightweight and compliant motors for future physical implementation. The hip and knee joint assistive torque stayed completely below this limit for the H+K-Exo scenario. Saturated maximum assistive extension torques equal to the upper limit of 50 Nm are observed during brief time periods following the occurrence of maximum hip flexion/lumbar flexion (i.e., the time point at which the hip and knee are starting to extend to go into standing position) in both the H-Exo and K-Exo assistive scenarios. All the assistive torque profiles obtained across all assistive scenarios showed a relatively smooth progression with a few small abrupt changes.

The average joint angles observed for each scenario are presented in Fig. 5. The angles are compared with experimental IK results obtained from Caruthers et al. [11]. A delayed maximum hip flexion timing compared to experimental data was observed across all the four scenarios. Further, the lumbar flexion peak was shifted for the H-Exo and H+K-Exo scenarios with large lumbar flexion angle as high as twice the observed experimental mean. The knee angles across all scenarios were comparable to the experimental trajectories, while the occurrence of the peak dorsi-flexion and plantar-flexion angles in the ankle were both delayed, and the overall ankle angle range was much smaller.

The trajectory of the whole-body COM for all four scenarios, presented in Fig. 6(a), shows a slight backward shift in overall body position at the end of the STS for the K-Exo and H+K-Exo scenarios. The kinematic and COM trajectories indicate a slightly larger forward lean in the assisted cases compared to the unassisted baseline. The contact forces observed at the ground and the seat are shown in Fig. 6 (b,c). These forces are relatively smooth and consistent across unassisted and assisted scenarios. The contact forces obtained at the seat are comparable to simulated observations of Munoz et al., the ground reaction forces are more comparable to experimental results from them, while their simulation results follow a sharper S profile with missing clear midlevel/halfway peak.

The mean muscle activation patterns for all 11 muscles in all four STS control scenarios are presented in Fig. 7. The peak activations in the GMAX and VAS muscle are reduced respectively by [−77% (increases), 0%], [3.6%, 12.1%], and [60.1%, 55.7%] for the hip, knee, and hip-knee scenarios. The muscle activation in the SOL muscle is near minimal, except for very small activations during the stabilizing phase of the unassisted scenario. Similarly, the activation in the BFSH muscle is also very low other than for a small portion of the unassisted scenario. Interestingly, the H-Exo assistance increases the muscle activation levels in the GMAX, GAS, and TA muscles compared to the baseline. The K-Exo assistance case also results in increased muscle activations in some muscles (HAMS & GAS).

The total percentage reductions in the muscle activations in the muscles compared to the unassisted condition over the whole 3-second time period are shown in Table 3. These values are obtained by comparing the average muscle activations (computed as the areas below the activation curves divided by the total time) in Fig. 7. Moreover, no considerable passive forces are noted to be developing, indicating that the muscles are not undergoing any undue passive stretching even under large knee flexion angle.

Table 3 Percentage reductions in overall (mean) muscle activations compared to the unassisted STS condition. Negative values are indicative of a percentage increase in comparison to the unassisted baseline. The soleus muscle activations were very low and therefore are not presented in this table

Full size table

The progression of each individual reward component (which is inclusive of raw reward value and weight) over the simulation period is presented in Fig. 8. The tracking, upward, and velocity rewards contribute more significantly to the total reward during the upright standing stabilization phase, whereas the torque and XcoM rewards have a lesser impact but also increased their rewards right before and during standing. It is evident that the XcoM reward plays an important role during the rapid transition phase, while tracking is less prioritized relative to other portions of the movement during this segment. The variability in the rewards is minimal across conditions and stabilization in the total reward is observed at the end of the simulation period. The near-zero values for torque minimization reward right before liftoff are indicative of large control demand in the muscles and the idealized actuators. Further, the controllers’ difficulty in imitating the joint kinematics (resulting in extensive lumbar flexion) in the H-Exo and H+K-Exo conditions is visible from the lower tracking reward values observable during the rising phase. The controller struggling to achieve complete static standing stability is also evident from the changing XcoM and velocity reward values in the assisted conditions.

4 Discussions

STS transition is undeniably an important component of daily life and results in severe loss of independence when impaired. Identifying the ideal level of joint assistance necessary to minimize effort and still preserve adequate stability in the STS motion is highly beneficial. In this study, we have engineered and presented four distinct DRL-based controllers for facilitating both unassisted and assisted (hip, knee, and combined hip-knee) STS transitions in a controlled simulation environment. Each controller underwent extensive evaluation through one hundred dynamic simulations for each scenario. Future work should involve implementing these controllers on a physical hip-knee assistive device and studying the biomechanical changes to conclusively verify their effectiveness and robustness through comprehensive hardware-equipped experimental validation.

Our predicted STS joint angles for the unassisted case generally align with those reported by Caruthers et al. [11], with two notable exceptions: hip flexion is more excessive and dorsiflexion at the ankle joint is less pronounced, though both are still within observed ranges from other studies. Specifically, while our hip flexion exceeds the levels reported by Caruthers et al., it remains within the 130 degrees maximum hip flexion angles reported by Schenkman et al. [41], with our maximum angles not surpassing 110 degrees. Our results for ankle dorsiflexion, albeit lower than Caruthers et al., are still comparable to other STS ankle trajectories, such as those reported by Kumar et al. [14]. We believe that the considerable variability in ankle angles reported across studies could likely be due to differences in experimental conditions, such as seat height or initial foot position [42]. It is plausible that the increased hip flexion observed serves as a compensatory adaptation, enhancing the alignment of the body’s center of mass over the feet when ankle dorsiflexion is relatively small.

While tracking the provided reference STS motion is one of the primary rewards, the DRL policy also accounts for other competing balance rewards, albeit at the potential expense of tracking accuracy. Given the differences between our generic MSK model and Caruthers’ model, strictly tracking the reference trajectory is unlikely to be feasible. The MSK model was not scaled anthropometrically, and the maximum isometric forces of the muscles were not updated to match the Caruthers source subject of the tracking data. Notably, the ankle joint angle of the reference trajectory is outside the mean ± SD region of Caruthers’ experimental data, but our predicted trajectory adjusted for this during the standing phase of unassisted motion. Utilizing the reference trajectory as general guidance, we showcase that our RL-based simulation can generate physically feasible motion for the current MSK model. This is achieved despite differences in anthropometry and muscle characteristics between the MSK model and the real subject, which is an important factor for tracking accuracy. It is plausible that the controller may produce very different motion or even infeasible motion if the muscle capacity is insufficient (e.g., in cases of severe muscle weakness or disability).

The observed muscle activations during unassisted STS in the soleus (SOL) muscle are notably minimal and corroborated by Caruthers et al.’s [11] findings, which also revealed low levels of SOL activations in their static optimization results. Munoz et al. in their studies noted higher experimental activations in the SOL muscle, while the activations yielded by their reflexive-controller based simulations were low, mirroring our DRL based observations [15]. Similarly, Kumar et al. also reported very minimal SOL activations in their simulations [14]. Exploring the activation behavior of the TA muscle reveals that our TA activation pattern and level closely align with existing literature [11] despite the slightly reduced dorsiflexion angle. In the unassisted scenario, short-duration saturation in activations were observed for the IL and VAS muscles. The VAS is noted to reach high activation levels during the peak of the extension phase [14, 15], whereas no corresponding experimental STS data was found for the IL activation. The RA and ESL muscles exhibit prolonged full activation in our simulation, likely due to their oversimplified modeling which was only intended to produce the necessary lumbar moments. Practically, activation levels for the RA and ESL should be lower with mean activations typically up to 55% and 70%, respectively, during fast STS maneuvers [43]. This suggests that the current RA and ESL muscles in our model might be too weak, and a higher isometric force might be more appropriate. It is plausible that the high activation of ESL could also be a result of the predicted motion with fast movement of COM during the extension phase. With hip assistance, the ESL activation is increased compared to the unassisted case. One possible explanation for this could be that, while hip assistance reduced hip muscle torques, the ESL may need to increase its activation to control the movement of the pelvis and torso.

In general, all the assistive cases resulted in a delayed occurrence of the maximum lumbar and hip flexion angle, as well as a deeper flexion before seat-off. Hip assistive torques both in the H-Exo and H+K-Exo scenarios increase the maximum lumbar flexion angle by around 50% compared to baseline and resulted in slightly lower peak hip flexion angles. Many studies have identified increased lumbar flexion as a stabilization strategy in STS [14], hinting that the larger lumbar motion we observe could be a compensatory mechanism to maintain balance when hip assistive torque is provided. The muscle activations in the two lumbar muscles, the ESL and RA, were observed to be saturating, indicative of potentially insufficient muscle strength that resulted in large lumbar flexion. Similarly, the hip assistive torques in these two assistive scenarios also resulted in a higher ankle plantar flexion angle when rising from the chair to a standing pose compared to the unassisted STS ankle angle. The trajectory through which the knee angle extended remained relatively consistent across the unassisted and all the assistive STS scenarios.

The H+K-Exo assistive case is effective in reducing the activations in all the muscles in comparison to baseline, except for the TA, RA, and ESL. Particularly, considerable activation percentage reductions ([VAS, 73.23%], [GMAX, 68.63%], [HAMS, 58.21%], and [RF, 92.32%]) in the major STS hip and knee extension contributors are noticed. However, the H-Exo and K-Exo scenarios result in increased muscle activations in the HAMS, GAS, and TA muscles. Assisting the knee joint alone (K-Exo) seems to introduce some instability after seat-off that is countered by additional muscle activations in the GAS and TA muscles (major ankle plantarflexion and dorsiflexion muscles). Similar observations where muscles surrounding the unassisted joints increasing in activation level for higher assistance levels have been noticed in emulator-based ankle assisted experimental gait studies [44]. The biarticular hamstrings and the rectus femoris are known to exhibit significant co-contraction during STS [45]. This is also observable to some extent in our biarticular HAMS and RF in the model right after seat-off for the unassisted and K-Exo scenarios.

The torque profiles acquired from our simulations demonstrated peak assistive torque values approximately at the midpoint of the STS motion and displayed a generally smooth profile with some oscillations and few abrupt direction changes. These profiles can be readily parameterized after minor adjustments and implemented on a real-world hip-knee exoskeleton through sim-to-real transfer as required. Similarly, parameterized smooth torque profiles such as cubic splines or linear portions are often used as the basis for specifying assistive torques in exoskeletons during gait [46] or other activities such as squats [47]. This is particularly common in exoskeleton prototype emulator-based human-in-the-loop studies, where such torque profiles are optimized by altering these parameters based on human physiological objectives [48] or user preferences [49]. Our framework generates generic torque profiles that could potentially be used as initial references for subject-specific assistance optimization in human-in-the-loop experiments, pending validation on real-world hardware. By serving as a starting point, these profiles enable the customization of assistive strategies to meet individualized biomechanical needs of users during STS transitions. This structured approach presents a promising avenue for developing more personalized and effective exoskeleton assistance for STS in the future.

The presented framework could potentially be adapted to develop generic STS assistive controllers for clinical populations, such as individuals with neuromuscular disorders or the elderly, by incorporating factors like muscle strength deficits, activation-deactivation delays, and neuromuscular noise. A more complex MSK model and additional refinements to the rewards will likely be necessary, along with validation to ensure effectiveness before clinical application. As an example, modeling conditions that are characterized by asymmetry, such as hemiparesis, requires a two-legged model with the degrees of freedom represented in all three dimensions. In addition, this approach can be broadened to account for different chair heights, unforeseen environmental interactions, such as slippery floors or incorporating the use of arm supports and crutches, or even unexpected perturbations through a more comprehensive training of the controller.

The presented methodology has revealed promising avenues for the development of DRL-based STS assistance controllers with several notable limitations and opportunities for future enhancement. Firstly, we modeled the STS assistance as idealized joint torques, omitting the consideration of exoskeleton inertial properties and the interactive forces between the exoskeleton and human. Nonetheless, this allows us to focus on the DRL and generating assistance profiles without tailoring them to any specific exoskeleton design. Secondly, the DRL controller was only trained on one generic symmetrical MSK model. While this approach provides initial insights, extending it to subject-specific MSK models with more realistic muscle models could potentially yield personalized assistive torque profiles. This limitation can potentially be further alleviated by incorporating domain randomization in the simulation, which would simulate a broader range of model conditions and variables, improving the realism and generalizability of the results [28]. Furthermore, our preliminary insights into the influence of hip and knee joint assistance on STS motion are encouraging, though they highlight areas where simulation fidelity could be improved and interpretation of results could be strengthened with statistical analysis. Enhancing the model by increasing maximum isometric force values in specific muscles (such as RA, ESL, VAS, and IL) could lead to more precise simulations and better reflect realistic STS motion. Moreover, adopting multiple trained controllers, each trained with different rewards and variations of the MSK model, could generate more diverse motions and responses to assistance. Reporting their mean and standard deviations will provide a more comprehensive statistical analysis of possible solutions. Lastly, the current method of selecting the relative weights for reward terms during the learning process relied on a trial-and-error approach. Adopting a more systematic approach, such as a grid search or an inverse reinforcement learning [50] approach for reward shaping, could potentially further improve the learning of the controllers. Despite these limitations and potential improvements, our approach sets a strong foundation for future research. Rigorous real-world testing and experimental validation will be crucial in confirming the practical effectiveness and feasibility of these DRL-based controllers in STS assistance.

5 Conclusions

This study presents a DRL framework for training robust muscle controllers and generating muscle-controlled STS movements while co-optimizing joint assistive torques. The resulting controllers demonstrate the in-situ ability to substantially reduce muscle activations in major lower limb muscles without introducing significant changes to kinematics or compromising stability, especially when both the hip and knee joints are conjointly assisted. Although the practical effectiveness of the predicted assistance torque profiles in assisting STS requires real-world testing and validation, the findings of this research provide valuable insights for the development of similar DRL methodologies to develop robust controllers for assistive technologies, including exoskeletons and prosthetic devices.

Data Availability

No datasets were generated or analysed during the current study.

References

Dall, P.M., Kerr, A.: Frequency of the sit to stand task: an observational study of free-living adults. Appl. Ergon. 41(1), 58–61 (2010). https://doi.org/10.1016/j.apergo.2009.04.005
Article Google Scholar
Alcazar, J., Losa-Reyna, J., Rodriguez-Lopez, C., Alfaro-Acha, A., Rodriguez-Mañas, L., Ara, I., et al.: The sit-to-stand muscle power test: an easy, inexpensive and portable procedure to assess muscle power in older people. Exp. Gerontol. 112, 38–43 (2018). https://doi.org/10.1016/j.exger.2018.08.006
Article Google Scholar
Csuka, M., McCarty, D.J.: Simple method for measurement of lower extremity muscle strength. Am. J. Med. 78(1), 77–81 (1985). https://doi.org/10.1016/0002-9343(85)90465-6
Article Google Scholar
Lord, S.R., Murray, S.M., Chapman, K., Munro, B., Tiedemann, A.: Sit-to-stand performance depends on sensation, speed, balance, and psychological status in addition to strength in older people. J. Gerontol., Ser. A, Biol. Sci. Med. Sci. 57(8), M539–M543 (2002)
Article Google Scholar
Norman-Gerum, V., McPhee, J.: Comprehensive description of sit-to-stand motions using force and angle data. J. Biomech. 112, 110046 (2020). https://doi.org/10.1016/j.jbiomech.2020.110046
Article Google Scholar
Xue, Q., Wang, T., Yang, S., Zhou, B., Zhang, H.: Experimental study on sit-to-stand (sts) movement: a systematic review. Int. J. Intell. Robot. Appl. 6(1), 152–170 (2022). https://doi.org/10.1007/s41315-021-00188-x
Article Google Scholar
Etnyre, B., Thomas, D.Q.: Event standardization of sit-to-stand movements. Phys. Ther. 87(12), 1651–1666 (2007). https://doi.org/10.2522/ptj.20060378
Article Google Scholar
Millington, P.J., Myklebust, B.M., Shambes, G.M.: Biomechanical analysis of the sit-to-stand motion in elderly persons. Arch. Phys. Med. Rehabil. 73(7), 609–617 (1992)
Google Scholar
Lindemann, U., Claus, H., Stuber, M., Augat, P., Muche, R., Nikolaus, T., et al.: Measuring power during the sit-to-stand transfer. Eur. J. Appl. Physiol. Occup. Physiol. 89(5), 466–470 (2003). https://doi.org/10.1007/s00421-003-0837-z
Article Google Scholar
Roebroeck, M., Doorenbosch, C., Harlaar, J., Jacobs, R., Lankhorst, G.: Biomechanics and muscular activity during sit-to-stand transfer. Clin. Biomech. 9(4), 235–244 (1994)
Article Google Scholar
Caruthers, E.J., Thompson, J.A., Chaudhari, A.M., Schmitt, L.C., Best, T.M., Saul, K.R., et al.: Muscle forces and their contributions to vertical and horizontal acceleration of the center of mass during sit-to-stand transfer in young, healthy adults. J. Appl. Biomech. 32(5), 487–503 (2016)
Article Google Scholar
Smith, S.H., Reilly, P., Bull, A.M.: A musculoskeletal modelling approach to explain sit-to-stand difficulties in older people due to changes in muscle recruitment and movement strategies. J. Biomech. 98, 109451 (2020)
Article Google Scholar
Norman-Gerum, V., McPhee, J.: Constrained dynamic optimization of sit-to-stand motion driven by Bézier curves. J. Biomech. Eng. 140(12), 121011 (2018). https://doi.org/10.1115/1.4041527
Article Google Scholar
Kumar, V., Yoshiike, T., Shibata, T.: Predicting sit-to-stand adaptations due to muscle strength deficits and assistance trajectories to complement them. Front. Bioeng. Biotechnol. 10, 799836 (2022). https://doi.org/10.3389/fbioe.2022.799836
Article Google Scholar
Muñoz, D., De Marchis, C., Gizzi, L., Severini, G.: Predictive simulation of sit-to-stand based on reflexive-controllers. PLoS ONE 17(12), e0279300 (2022)
Article Google Scholar
Gordon, D., Christou, A., Stouraitis, T., Gienger, M., Vijayakumar, S.: Learning personalised human sit-to-stand motion strategies via inverse musculoskeletal optimal control. In: 2023 IEEE International Conference on Robotics and Automation. IEEE, New York (2023)
Google Scholar
Geravand, M., Korondi, P.Z., Werner, C., Hauer, K., Peer, A.: Human sit-to-stand transfer modeling towards intuitive and biologically-inspired robot assistance. Auton. Robots 41(3), 575–592 (2017). https://doi.org/10.1007/s10514-016-9553-5
Article Google Scholar
Choi, G., Lee, D., Kang, I., Young, A.J.: Effect of assistance timing in knee extensor muscle activation during sit-to-stand using a bilateral robotic knee exoskeleton. Proc. Annu. Int. Conf. IEEE Eng. Med. Biol. Soc. 2021, 4879–4882 (2021). https://doi.org/10.1109/embc46164.2021.9629965
Article Google Scholar
Alouane, M.A., Huo, W., Rifai, H., Amirat, Y., Mohammed, S.: Hybrid fes-exoskeleton controller to assist sit-to-stand movement. IFAC-PapersOnLine 51(34), 296–301 (2019). https://doi.org/10.1016/j.ifacol.2019.01.032
Article Google Scholar
Zhou, J., Zeng, Q., Tang, B., Luo, J., Xiang, K., Pang, M.: A hip active lower limb support exoskeleton for load bearing sit-to-stand transfer. In: International Conference on Intelligent Robotics and Applications, pp. 24–35. Springer, Berlin (2022)
Chapter Google Scholar
Schmidt, K., Duarte, J.E., Grimmer, M., Sancho-Puchades, A., Wei, H., Easthope, C.S., et al.: The myosuit: bi-articular anti-gravity exosuit that reduces hip extensor activity in sitting transfers. Front. Neurorobot. 11, 57 (2017). https://doi.org/10.3389/fnbot.2017.00057
Article Google Scholar
Vantilt, J., Tanghe, K., Afschrift, M., Bruijnes, A.K.B.D., Junius, K., Geeroms, J., et al.: Model-based control for exoskeletons with series elastic actuators evaluated on sit-to-stand movements. J. NeuroEng. Rehabil. 16(1), 65 (2019). https://doi.org/10.1186/s12984-019-0526-8
Article Google Scholar
Weng, J., Hashemi, E., Arami, A.: Natural walking with musculoskeletal models using deep reinforcement learning. IEEE Robot. Autom. Lett. 1(1), 4156–4162 (2021). https://doi.org/10.1109/LRA.2021.3067617
Article Google Scholar
Nowakowski, K., El Kirat, K., Dao, T.-T.: Deep reinforcement learning coupled with musculoskeletal modelling for a better understanding of elderly falls. Med. Biol. Eng. Comput. 60(6), 1745–1761 (2022). https://doi.org/10.1007/s11517-022-02567-3
Article Google Scholar
Denizdurduran, B., Markram, H., Gewaltig, M.-O.: Optimum trajectory learning in musculoskeletal systems with model predictive control and deep reinforcement learning. Biol. Cybern. 116(5–6), 711–726 (2022)
Article Google Scholar
Song, S., Kidziński, Ł., Peng, X.B., Ong, C., Hicks, J., Levine, S., et al.: Deep reinforcement learning for modeling human locomotion control in neuromechanical simulation. J. NeuroEng. Rehabil. 18(1), 126 (2021). https://doi.org/10.1186/s12984-021-00919-y
Article Google Scholar
Kayan, O., Yalcin, H.: Learning to walk on a human musculoskeletal model wearing a knee orthosis via deep reinforcement learning. In: 2023 5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA), pp. 1–4 (2023)
Google Scholar
Luo, S., Androwis, G., Adamovich, S., Nunez, E., Su, H., Zhou, X.: Robust walking control of a lower limb rehabilitation exoskeleton coupled with a musculoskeletal model via deep reinforcement learning. J. NeuroEng. Rehabil. 20(1), 34 (2023). https://doi.org/10.1186/s12984-023-01147-2
Article Google Scholar
Jamali, S., Taghvaei, S., Haghpanah, S.A.: Optimal strategy for sit-to-stand movement using reinforcement learning. J. Rehabil. Sci. Res. 4(3), 70–75 (2017). https://doi.org/10.30476/jrsr.2017.41122
Article Google Scholar
Tian, R., Sun, W.: Assistive standing seat based on reinforcement learning. In: Proceedings of the 2023 3rd International Conference on Robotics and Control Engineering, pp. 76–80 (2023)
Chapter Google Scholar
Van der Heijden, M.M., Meijer, K., Willems, P.J., Savelberg, H.H.: Muscles limiting the sit-to-stand movement: an experimental simulation of muscle weakness. Gait Posture 30(1), 110–114 (2009). https://doi.org/10.1016/j.gaitpost.2009.04.002
Article Google Scholar
Lee, S., Park, M., Lee, K., Lee, J.: Scalable muscle-actuated human simulation and control. ACM Trans. Graph. 38(4), 1–13 (2019). https://doi.org/10.1145/3306346.3322972
Article Google Scholar
Delp, S.L., Anderson, F.C., Arnold, A.S., Loan, P., Habib, A., John, C.T., et al.: Opensim: open-source software to create and analyze dynamic simulations of movement. IEEE Trans. Biomed. Eng. 54(11), 1940–1950 (2007). https://doi.org/10.1109/TBME.2007.901024
Article Google Scholar
Todorov, E., Erez, T., Tassa, Y.: Mujoco: a physics engine for model-based control. In: 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 5026–5033. IEEE, New York (2012)
Chapter Google Scholar
Lee, J., Grey, M.X., Ha, S., Kunz, T., Jain, S., Ye, Y., et al.: Dart: dynamic animation and robotics toolkit. J. Open Sour. Softw. 3(22), 500 (2018)
Article Google Scholar
Tan, J., Liu, K., Turk, G.: Stable proportional-derivative controllers. IEEE Comput. Graph. Appl. 31(4), 34–44 (2011). https://doi.org/10.1109/MCG.2011.30
Article Google Scholar
Schulman, J., Wolski, F., Dhariwal, P., Radford, A., Klimov, O.: Proximal policy optimization algorithms (2017). arXiv preprint arXiv:1707.06347
Hof, A.L., Gazendam, M., Sinke, W.: The condition for dynamic stability. J. Biomech. 38(1), 1–8 (2005)
Article Google Scholar
Peng, X.B., Abbeel, P., Levine, S., Panne, M.V.D.: Deepmimic: example-guided deep reinforcement learning of physics-based character skills. ACM Trans. Graph. 37(4), 1–14 (2018). https://doi.org/10.1145/3197517.3201311
Article Google Scholar
Yoshioka, S., Nagano, A., Hay, D.C., Fukashiro, S.: Peak hip and knee joint moments during a sit-to-stand movement are invariant to the change of seat height within the range of low to normal seat height. Biomed. Eng. Online 13(1), 27 (2014). https://doi.org/10.1186/1475-925x-13-27
Article Google Scholar
Schenkman, M., Berger, R.A., Riley, P.O., Mann, R.W., Hodge, W.A.: Whole-body movements during rising to standing from sitting. Phys. Ther. 70(10), 638–648 (1990). Discussion 648–651. https://doi.org/10.1093/ptj/70.10.638
Article Google Scholar
Jeon, W., Hsiao, H.Y., Griffin, L.: Effects of different initial foot positions on kinematics, muscle activation patterns, and postural control during a sit-to-stand in younger and older adults. J. Biomech. 117, 110251 (2021). https://doi.org/10.1016/j.jbiomech.2021.110251
Article Google Scholar
Tebbache, N., Hamaoui, A.: Effect of seat backrest inclination on the muscular pattern and biomechanical parameters of the sit-to-stand. Front. Human Neurosci. 15, 678302 (2021). https://doi.org/10.3389/fnhum.2021.678302
Article Google Scholar
Poggensee, K.L., Collins, S.: Lower limb biomechanics of fully trained exoskeleton users reveal complex mechanisms behind the reductions in energy cost with human-in-the-loop optimization. Front. Robot. AI 11, 1283080 (2024)
Article Google Scholar
Roebroeck, M.E., Doorenbosch, C.A.M., Harlaar, J., Jacobs, R., Lankhorst, G.J.: Biomechanics and muscular activity during sit-to-stand transfer. Clin. Biomech. 9(4), 235–244 (1994). https://doi.org/10.1016/0268-0033(94)90004-3
Article Google Scholar
Zhang, J., Fiers, P., Witte, K.A., Jackson, R.W., Poggensee, K.L., Atkeson, C.G., et al.: Human-in-the-loop optimization of exoskeleton assistance during walking. Science 356(6344), 1280–1284 (2017). https://doi.org/10.1126/science.aal5054
Article Google Scholar
Kantharaju, P., Jeong, H., Ramadurai, S., Jacobson, M., Jeong, H., Kim, M.: Reducing squat physical effort using personalized assistance from an ankle exoskeleton. IEEE Trans. Neural Syst. Rehabil. Eng. 30, 1786–1795 (2022)
Article Google Scholar
Ma, L., Ba, X., Xu, F., Leng, Y., Fu, C.: Emg-based human-in-the-loop optimization of ankle plantar-flexion assistance with a soft exoskeleton. In: 2022 International Conference on Advanced Robotics and Mechatronics (ICARM), pp. 453–458 (2022)
Chapter Google Scholar
Ingraham, K.A., Remy, C.D., Rouse, E.J.: The role of user preference in the customized control of robotic exoskeletons. Sci. Robot. 7(64), eabj3487 (2022). https://doi.org/10.1126/scirobotics.abj3487
Article Google Scholar
Liu, W., Zhong, J., Wu, R., Fylstra, B.L., Si, J., Huang, H.H.: Inferring human-robot performance objectives during locomotion using inverse reinforcement learning and inverse optimal control. IEEE Robot. Autom. Lett. 7(2), 2549–2556 (2022). https://doi.org/10.1109/LRA.2022.3143579
Article Google Scholar

Download references

Funding

The authors did not receive support from any organization for the submitted work.

Author information

Authors and Affiliations

Department of Biomedical Engineering, New Jersey Institute of Technology, Newark, NJ, 07102, USA
Neethan Ratnakumar, Kübra Akbaş, Rachel Jones, Zihang You & Xianlian Zhou

Authors

Neethan Ratnakumar
View author publications
You can also search for this author in PubMed Google Scholar
Kübra Akbaş
View author publications
You can also search for this author in PubMed Google Scholar
Rachel Jones
View author publications
You can also search for this author in PubMed Google Scholar
Zihang You
View author publications
You can also search for this author in PubMed Google Scholar
Xianlian Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Neethan Ratnakumar contributed to simulations, data analysis, and manuscript writing. Kübra Akbaş contributed to the conceptualization and methodology development. Rachel Jones and Zihang You contributed to methodology discussion and manuscript writing. Xianlian Zhou contributed to the conceptualization, simulations, and design of the study, providing supervision, writing, and revising the manuscript. All authors contributed to the writing, revision and approved the final manuscript.

Corresponding author

Correspondence to Xianlian Zhou.

Ethics declarations

Ethical clearances

Non-applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ratnakumar, N., Akbaş, K., Jones, R. et al. Predicting sit-to-stand motions with a deep reinforcement learning based controller under idealized exoskeleton assistance. Multibody Syst Dyn (2024). https://doi.org/10.1007/s11044-024-10009-1

Download citation

Received: 16 February 2024
Accepted: 24 June 2024
Published: 22 July 2024
DOI: https://doi.org/10.1007/s11044-024-10009-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Predicting sit-to-stand motions with a deep reinforcement learning based controller under idealized exoskeleton assistance

Abstract

Similar content being viewed by others

Momentum-based trajectory planning for lower-limb exoskeletons supporting sit-to-stand transitions

Human sit-to-stand transfer modeling towards intuitive and biologically-inspired robot assistance

Three dimensional unassisted sit-to-stand prediction for virtual healthy young and elderly individuals

1 Introduction

2 Methods

2.1 Musculoskeletal model and idealized torque assistance

2.2 Reinforcement learning for sit to stand muscle control

3 Results

4 Discussions

5 Conclusions

Data Availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethical clearances

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Predicting sit-to-stand motions with a deep reinforcement learning based controller under idealized exoskeleton assistance

Abstract

Similar content being viewed by others

Momentum-based trajectory planning for lower-limb exoskeletons supporting sit-to-stand transitions

Human sit-to-stand transfer modeling towards intuitive and biologically-inspired robot assistance

Three dimensional unassisted sit-to-stand prediction for virtual healthy young and elderly individuals

Explore related subjects

1 Introduction

2 Methods

2.1 Musculoskeletal model and idealized torque assistance

2.2 Reinforcement learning for sit to stand muscle control

3 Results

4 Discussions

5 Conclusions

Data Availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethical clearances

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation