Variational integrators and graph-based solvers for multibody dynamics in maximal coordinates

Brüdigam, Jan; Sosnowski, Stefan; Manchester, Zachary; Hirche, Sandra

doi:10.1007/s11044-023-09949-x

Variational integrators and graph-based solvers for multibody dynamics in maximal coordinates

Research
Open access
Published: 03 November 2023

Volume 61, pages 381–414, (2024)
Cite this article

Download PDF

You have full access to this open access article

Multibody System Dynamics Aims and scope Submit manuscript

Variational integrators and graph-based solvers for multibody dynamics in maximal coordinates

Download PDF

Jan Brüdigam¹,
Stefan Sosnowski¹,
Zachary Manchester² &
…
Sandra Hirche¹

1132 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

Multibody dynamics simulators are an important tool in many fields, including learning and control in robotics. However, many existing dynamics simulators suffer from inaccuracies when dealing with constrained mechanical systems due to unsuitable integrators with bad energy behavior and problematic constraint violations, for example in contact interactions. Variational integrators are numerical discretization methods that can reduce physical inaccuracies when simulating mechanical systems, and formulating the dynamics in maximal coordinates allows for easy and numerically robust incorporation of constraints such as kinematic loops or contacts. Therefore, this article derives a variational integrator for mechanical systems with equality and inequality constraints in maximal coordinates. Additionally, efficient graph-based sparsity-exploiting algorithms for solving the integrator are provided and implemented as an open-source simulator. The evaluation of the simulator shows improved physical accuracy due to the variational integrator and the advantages of the sparse solvers. Comparisons to minimal-coordinate algorithms show improved numerical robustness, and application examples of a walking robot and an exoskeleton with explicit constraints demonstrate the necessity and capabilities of maximal coordinates.

Linear-Time Variational Integrators in Maximal Coordinates

A Linear-Time Variational Integrator for Multibody Systems

Efficient Computation of Higher-Order Variational Integrators in Robotic Simulation and Trajectory Optimization

Discover the latest articles, news and stories from top researchers in related subjects.

Automotive Engineering

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Simulators for mechanical systems are widely used, for example, in testing and verification [1, 2], in model-based control strategies [3, 4], or in learning-based methods [5, 6]. However, many common simulators have numerical difficulties with more complex mechanical systems involving constraints [7]. Such constraints can represent joints connecting rigid bodies, which may form kinematic loops, for example, in exoskeletons. Constraints can also be used to confine the movement of bodies, for example, to model joint limits in robotic arms or to describe contact with other bodies or the environment in walking and grasping. Exactly enforcing such constraints can cause numerical issues, for example, due to the stiff nature of contact interactions. To alleviate these numerical issues, simulators often allow small constraint violations by representing all constraints as spring-damper elements, as in MuJoCo [8] and Brax [9], or by accepting interpenetration of bodies, as in Drake [10] and Bullet [11]. Small violations can sometimes be acceptable, for example, contact interpenetration in the order of micrometers for meter-scale walking robots. But millimeter or centimeter violations, for example in MuJoCo, can be considered too large. Employing these methods and accepting larger constraint violations for stable simulations contributes to the sim-to-real gap, which is a major issue in robotics [12].

Therefore, this article addresses physically accurate simulations of mechanical systems. Specifically, we focus on good modeling of the energy behavior of such systems, as this quantity is important for the stability of dynamical systems from a control theory perspective. Moreover, we treat the correct enforcement of constraints, since allowing for constraint violations can lead to problematic results. For example, allowing softness in contact interactions can lead to wrong contact points, which can cause a robot to fail to properly grasp an object when the simulation results are directly transferred to a real-world application.

Most simulators, including the ones listed above except Brax, use minimal coordinates (generalized coordinates) as a mechanism’s state representation. Given a system with $m$ degrees of freedom and a set of constraints that removes $c$ degrees of freedom, the system is parameterized by $n=m-c$ independent coordinates. In minimal coordinates, the constraints are implicitly incorporated into the dynamics equations, and only the smallest possible number of variables is retained, which leads to dimensionally small problem sizes. However, eliminating constraints requires specialized treatment of each constraint type and is not possible for nonholonomic constraints such as contacts. Furthermore, the algorithms typically used for minimal coordinates can have numerical issues such as ill-conditioning [13, 14], which may be increased by their recursive nature, although a rigorous study of the numerical effects and stability of minimal-coordinate dynamics algorithms remains to be done. We provide some empirical evidence for these numerical issues in Sect. 5.

It is also possible to explicitly retain all or part of the constraints on a mechanical system by introducing constraint forces to adhere to these constraints. With this approach, redundant coordinates are obtained since the constraints make part of the coordinates mathematically dependent. There are many ways to parameterize mechanical systems in redundant coordinates. Natural coordinates use three points in Cartesian coordinates to parameterize the configuration of each body [15]. Alternatively, the position of the center of mass of a body and a director frame for the orientation can be used [16, 17]. These methods have in common that they aim at circumventing the inherent intricacies of parameterizing rotations. In contrast, we use so-called maximal coordinates [18], which represent each body in a mechanism with its three degrees of freedom for the position of the center of mass and three degrees of freedom for the orientation by purposefully building on the group structure of rotations. All kinematic relations, such as joints or contacts, are formulated as explicit constraints. The use of maximal coordinates allows for very modular modeling of dynamics, which makes it possible to describe many different kinds of constraints in a unified and non-specialized manner. Additionally, resorting to maximal and redundant coordinates enables the use of different algorithms than for minimal coordinates, specifically direct matrix methods, which generally have well-investigated numerical properties and enable the formulation of stable and well-conditioned algorithms [19]. The core idea of direct matrix methods is to modify the underlying matrix before, during, and after it is used in an algorithm to improve the stability and accuracy of an algorithm. Examples of such modifications are scaling certain matrix entries or iterative solution refinement. The few existing works on simulation, specifically in maximal coordinates, have explored algorithms for continuous-time forward dynamics [18] and discrete-time algorithms for mechanisms without kinematic loops or contacts [20]. Besides their application in simulation, maximal coordinates have shown promising results in control applications [21, 22]. The simulator Brax is also based on maximal coordinates, but suffers from the aforementioned issue of soft constraint handling.

Besides the numerical issues related to constraints, all of the packaged simulators above use integrators for the dynamics that are not well suited for mechanical systems due to the unrealistic energy behavior of certain explicit or implicit Runge–Kutta methods. Deriving a perfect integrator is generally not trivial [23, 24], but certain classes of integrators exhibit excellent properties, among them symmetric and symplectic methods [25]. An elegant way to derive symplectic integrators for mechanical systems is through a variational perspective, leading to variational integrators. These integrators are derived by discretizing the derivation of the mechanism’s differential equations instead of the differential equations themselves. Thus, certain properties such as energy and momentum conservation are maintained [26]. Compared to classical discretizations such as explicit Runge–Kutta methods, larger time steps can be taken due to the increased physical accuracy, and constraint drift is avoided entirely. Computationally efficient variational integrators have been theoretically investigated in minimal coordinates [27–29]. However, the implementations are limited to simple mechanical systems with few joints types and no contact interactions, potentially due to numerical difficulties. Variational and other structure-preserving integrators have also been investigated in redundant coordinates. Here, much effort has been put into variational integrators for systems with explicit constraints [17, 26, 30]. Another investigated approach is nullspace methods which eliminate constraints while remaining structure-preserving [31, 32]. Redundant coordinates are also well suited for structure-preserving integration of flexible multibody systems [33–35]. These presented works deal with advancing the theoretical properties of structure-preserving integrators. In this article, we derive a unified and modular integrator and develop methods for the numerically efficient implementation of such an integrator.

More specifically, the contribution of this article to maximal-coordinate simulators is as follows. Firstly, we derive a variational integrator in maximal coordinates for physically accurate simulations, including good energy behavior and rigid, non-drifting constraints. While variational integrators are generally not new, we derive this integrator in a unified framework for typical dynamics components, including rigid bodies, joints, contacts, friction, actuators and external forces, springs, and dampers. Secondly, we provide an efficient graph-based solver algorithm for the system of nonlinear equations that form the integrator. The solver exploits the sparsity of the nonlinear system of equations to account for the increased number of variables in maximal coordinates. It achieves linear computational complexity in the number of links and joints for mechanisms without kinematic loops and reduces the complexity for mechanisms with kinematic loops. For environment contacts and friction, the solver also achieves linear computational complexity in the number of links and contact points, while reducing the complexity for inter-mechanism contacts. Besides these theoretical contributions for efficient maximal-coordinate simulators based on variational integrators, we also provide an open-source implementation of such a simulator (see Sect. 5), which achieves competitive timing results compared to state-of-the-art simulators.

This article is structured as follows. In Sect. 2, the desired simulator components are mathematically formalized. Based on these components, the variational integrator is derived in Sect. 3, resulting in a system of nonlinear equations. The solver for this system of equations is presented in Sect. 4. An evaluation of the theoretical and numerical properties and application examples are given in Sect. 5, and conclusions are drawn in Sect. 6. The appendices provide background information, including our quaternion notation in Appendix A.

2 Dynamics components

This section formulates the dynamics components for which the variational integrator is derived in Sect. 3. Unit quaternions are used for rotations and orientations due to their computational efficiency (see Appendix A for our quaternion notation). However, other representations, such as rotation matrices, of which the group of unit quaternions is a double cover [36], could be used as well. Figure 1 shows a mechanism with the treated components.

Rigid body

In maximal coordinates, each of the $n_{\mathrm{b}}$ rigid bodies in a mechanism has a position $\boldsymbol{x}\in \mathbb{R}^{3}$ and orientation $\boldsymbol{q}\in \mathbb{H}$ ($\boldsymbol{q}^{ \mathrm{T}}\boldsymbol{q}=1$), as well as a translational velocity $\boldsymbol{v}\in \mathbb{R}^{3}$ and angular velocity $\boldsymbol{\omega }\in \mathbb{R}^{3}$. The configuration of a body is denoted as $\boldsymbol{z} = [\boldsymbol{x}^{\mathsf{T}}\ \boldsymbol{q}^{ \mathsf{T}}]^{\mathsf{T}}$, and the velocity is denoted as $\dot{\boldsymbol{z}} = [\boldsymbol{v}^{\mathsf{T}}\ \boldsymbol{\omega }^{\mathsf{T}}]^{\mathsf{T}}$. Each body has a mass $m\in \mathbb{R}$ and a symmetric moment of inertia matrix $\boldsymbol{J}\in \mathbb{R}^{3\times 3}$. All quantities refer to the center of mass of a body. Given $\boldsymbol{M}=m\boldsymbol{I}_{3\times 3}$, the kinetic energy of each body is $\mathcal{T}(\dot{\boldsymbol{z}}) = \frac{1}{2}\boldsymbol{v}^{ \mathsf{T}}\boldsymbol{M}\boldsymbol{v} + \frac{1}{2} \boldsymbol{\omega }^{\mathsf{T}}\boldsymbol{J}\boldsymbol{\omega }$.

Conservative forces, potentials, and springs

Conservative forces in the dynamics are derived from potential functions $\mathcal{V}(\boldsymbol{z}_{\mathrm{a}}, \boldsymbol{z}_{\mathrm{b}}, \boldsymbol{z}_{\mathrm{c}}, \ldots )\in \mathbb{R}$ involving one or multiple bodies. Such potentials can represent, for example, gravity or springs.

Non-conservative forces, actuators, and dampers

Non-conservative forces, including actuators and dampers, are directly added to the dynamics as external forces $\boldsymbol{\mathrm{f}}\in \mathbb{R}^{3}$ or torques $\boldsymbol{\tau }\in \mathbb{R}^{3}$. Forces are described in the global frame and torques in the body frame. The wrench on a body is denoted as $\boldsymbol{\mathrm{w}} = [\boldsymbol{\mathrm{f}}^{\mathsf{T}}\ \boldsymbol{\tau }^{\mathsf{T}}]^{\mathsf{T}}$. These wrenches are added for each body individually. Actuators fixed at joints are formulated by expressing their wrench in the frames of the connected bodies.

As an example, for an actuator with wrench $\boldsymbol{\mathrm{w}}_{\mathrm{act}} = [\boldsymbol{\mathrm{f}}_{ \mathrm{act}}^{\mathsf{T}}\ \boldsymbol{\tau }_{\mathrm{act}}^{ \mathsf{T}}]^{\mathsf{T}}$ at a joint between bodies a (parent) and b (child), the resulting wrenches for the bodies are

(1a)

(1b)

where $\mathrm{N}$ is the global frame, $\mathrm{A}$ and $\mathrm{B}$ are reference frames of bodies a and b, respectively, and $\boldsymbol{p}$ is a vector from center of mass to actuator.

Joints and equality constraints

The joints of a mechanism consisting of one or multiple rigid bodies are represented by differentiable equality constraints $\boldsymbol{g}(\boldsymbol{z}_{\mathrm{a}}, \boldsymbol{z}_{ \mathrm{b}}, \boldsymbol{z}_{\mathrm{c}}, \ldots )=\boldsymbol{0}\in \mathbb{R}^{n_{\mathrm{e}}}$, where $n_{\mathrm{e}}$ is the number of equality constraints on the mechanism. Typically, equality constraints are formulated for all $i$ kinematically connected pairs of rigid bodies independently and subsequently stacked into one constraint function $\boldsymbol{g} = [\boldsymbol{g}_{1}^{\mathsf{T}}\cdots \boldsymbol{g}_{i}^{\mathsf{T}}]^{\mathsf{T}}$. In maximal coordinates, two generic equality constraint functions, one for the translational and one for the rotational movement of the two connected bodies, can be combined to create most of the common joints encountered in mechanisms. This insight greatly simplifies analytic gradient calculations for fast computations and gives direct access to minimal coordinates as well (see Appendix B for a list of possible joints and the recovery of minimal coordinates).

Contacts and inequality constraints

Rigid contacts between multiple bodies and with the environment are represented by differentiable inequality constraints $\boldsymbol{\phi }(\boldsymbol{z}_{\mathrm{a}}, \boldsymbol{z}_{ \mathrm{b}}, \boldsymbol{z}_{\mathrm{c}}, \ldots )\geq \boldsymbol{0} \in \mathbb{R}^{n_{\mathrm{i}}}$, where $n_{\mathrm{i}}$ is the number of inequality constraints on the mechanism. As for joints, inequality constraints are typically formulated independently for all $j$ pairs of rigid bodies from their signed distance function and subsequently stacked into one constraint function $\boldsymbol{\phi } = [\phi _{1}^{\mathsf{T}}\cdots \phi _{j}^{ \mathsf{T}}]^{\mathsf{T}}$.

As an example, ground contact of a single point contact on a body in maximal coordinates always has the form $\phi (\boldsymbol{z}) = \boldsymbol{e}_{\mathrm{z}}^{\mathsf{T}}( \boldsymbol{x} + \boldsymbol{q}\cdot \boldsymbol{p}\cdot \boldsymbol{q}^{-1}) \geq 0$, where $\boldsymbol{e}_{\mathrm{z}}$ is the z-axis unit vector in the global frame and $\boldsymbol{p}\in \mathbb{R}^{3}$ points from center of mass to contact point in the body’s frame. As before, this consistent structure allows for the calculation of analytic gradients to reduce computation time.

Static and sliding friction

In the dynamics, static and sliding friction are added as external forces on the bodies. These forces are calculated by solving an optimization problem derived from the maximum dissipation principle [37]. We are using a linearized friction cone [38], although a nonlinear friction cone could be used as well. The maximum dissipation principle states that the energy dissipation rate of the bodies in contact is maximized by a friction force $\boldsymbol{\beta }\in \mathbb{R}^{n_{\mathrm{c}}n_{\mathrm{f}}}$, where $n_{\mathrm{c}}$ is the number of contacts and $n_{\mathrm{f}}$ is the even number of basis vectors of the friction cone. The basis vectors $\boldsymbol{b}_{i}\in \mathbb{R}^{3}$ of the linearized friction cone are depicted in Fig. 1 (b). A detailed derivation of the optimization problem is stated in Appendix C. The optimization problem resulting from the maximum dissipation principle for the bodies of a mechanism is

min_{β} [\begin{array}{c} {\dot{z}}_{a}^{T} & {\dot{z}}_{b}^{T} & {\dot{z}}_{c}^{T} & \dots \end{array}] B {(z_{a}, z_{b}, z_{c}, \dots)}^{T} β,

(2a)

s.t. E^{T} β \leq C_{f} γ,

(2b)

β \geq 0,

(2c)

where $\boldsymbol{B}\in \mathbb{R}^{6n_{\mathrm{b}}\times n_{\mathrm{c}}n_{ \mathrm{f}}}$ maps the $n_{\mathrm{f}}$-dimensional friction forces at each of the $n_{\mathrm{c}}$ contact point to a six-dimensional wrench on the respective bodies. The constraint $\boldsymbol{E}^{\mathsf{T}}\boldsymbol{\beta } \leq \boldsymbol{C}_{ \mathrm{f}}\boldsymbol{\gamma }$ describes the limit on the friction forces with $\boldsymbol{E}=\mathrm{diag}(\pmb{1}, \ldots , \pmb{1})\in \mathbb{R}^{n_{\mathrm{c}}n_{\mathrm{f}}\times n_{\mathrm{c}}}$, $\pmb{1} = [1 \ \cdots \ 1]^{\mathsf{T}}\in \mathbb{R}^{n_{\mathrm{f}}}$, normal forces $\boldsymbol{\gamma }\in \mathbb{R}^{n_{\mathrm{c}}}$, and the friction coefficient matrix $\boldsymbol{C}_{\mathrm{f}} = \mathrm{diag}(c_{\mathrm{f},1},\ldots ,c_{ \mathrm{f},n_{\mathrm{c}}})\in \mathbb{R}^{n_{\mathrm{c}}\times n_{ \mathrm{c}}}$ containing the friction coefficients $c_{\mathrm{f}}$ for each contact point. The friction wrenches for each body resulting from the optimization are added as external wrenches to the dynamics.

As an example, the optimization problem for ground contact of a single point contact on a body in maximal coordinates always has the form

min_{β} {\dot{z}}^{T} B^{T} β,

(3a)

(3b)

β \geq 0,

(3c)

with $\boldsymbol{B}^{\mathsf{T}}=[\boldsymbol{B}_{\boldsymbol{x}} \ \boldsymbol{B}_{\boldsymbol{q}}]^{\mathsf{T}}$ consisting of

$$\begin{aligned} \boldsymbol{B}_{\boldsymbol{x}}^{\mathsf{T}}&= \begin{bmatrix} \boldsymbol{b}_{1} ~ -\boldsymbol{b}_{1} ~ \cdots ~ \boldsymbol{b}_{ \frac{n_{\mathrm{f}}}{2}} ~ -\boldsymbol{b}_{\frac{n_{\mathrm{f}}}{2}} \end{bmatrix}\in \mathbb{R}^{3\times n_{\mathrm{f}}}, \end{aligned}$$

(4a)

$$\begin{aligned} \boldsymbol{B}_{\boldsymbol{q}}^{\mathsf{T}}&= \boldsymbol{p}^{ \times }\boldsymbol{Q}(\boldsymbol{q})\boldsymbol{B}_{\boldsymbol{x}}^{ \mathsf{T}}\in \mathbb{R}^{3\times n_{\mathrm{f}}}, \end{aligned}$$

(4b)

where the vector from the body’s center of mass to the contact point is denoted $\boldsymbol{p}$, the ^× operator creates the skew-symmetric matrix from this vector, and $\boldsymbol{Q}(\boldsymbol{q})$ is the rotation matrix for quaternion $\boldsymbol{q}$.

3 Mathematical integrator

A first-order variational (symplectic) integrator [26] is derived for the simulator components described in the previous section. This integrator discretizes the rigid-body dynamics while maintaining energy and momentum conservation properties as well as constraint satisfaction. Higher-order variational integrators are possible [39, 40], and we restrict the derivation to the first order for clarity. First, the derivation for unconstrained dynamics is provided. Afterward, equality and inequality constraints are added to the integrator, and finally, friction dynamics are incorporated.

3.1 Unconstrained dynamics

The derivation of the integrator for unconstrained dynamics is split into translational and rotational components for clarity. Note that the derivation also holds for coupled translational and rotational dynamics. Variational integrators are based on the principle of least action, which states that a mechanical system takes the path of least action when going from a fixed starting point to a fixed ending point. External forces and torques are incorporated with the Lagrange-d’Alembert principle. Action has the dimensions $[\text{Energy}]\times [\text{Time}]$ and the unconstrained action integral $S_{0}$ with is defined as

$$ S_{0}(\boldsymbol{z},\dot{\boldsymbol{z}}) = \int _{t_{0}}^{t_{N}} \mathcal{L}(\boldsymbol{z},\dot{\boldsymbol{z}}) ~ \mathrm{d}t + \int _{t_{0}}^{t_{N}} \begin{bmatrix} \boldsymbol{\mathrm{f}}^{\mathsf{T}}& 2\boldsymbol{L}(\boldsymbol{q}) \boldsymbol{V}^{\mathsf{T}}\boldsymbol{\tau }\end{bmatrix} ^{\mathsf{T}}\boldsymbol{z} ~ \mathrm{d}t, $$

(5)

where $\mathcal{L} = \mathcal{T}-\mathcal{V}$ is the Lagrangian with kinetic energy $\mathcal{T}$ and potential energy $\mathcal{V}$. A brief explanation of the external force and torque components in (5), as well as our quaternion notation, are given in Appendix A. Further details on virtual work for quaternions can be found in [41, 42].

Translational component

The translational component of the action integral (5) is

$$ S_{0,\mathrm{T}}(\boldsymbol{x},\boldsymbol{v}) = \int _{t_{0}}^{t_{N}} \mathcal{L}_{\mathrm{T}}(\boldsymbol{x},\boldsymbol{v}) ~ \mathrm{d}t + \int _{t_{0}}^{t_{N}}\boldsymbol{\mathrm{f}}^{\mathsf{T}} \boldsymbol{x} ~ \mathrm{d}t. $$

(6)

For numerical integration, (6) is discretized. A first-order discretization of the integral and a first-order approximation of the velocity,

$$ \boldsymbol{v}_{k} = \frac{\boldsymbol{x}_{k+1}-\boldsymbol{x}_{k}}{\Delta t}, $$

(7)

with step size $\Delta t$ is used to obtain the discrete action sum

$$ S_{\mathrm{d}, 0, \mathrm{T}}(\boldsymbol{x}_{k},\boldsymbol{v}_{k}) = \sum _{k=0}^{N-1}\left ( \mathcal{L}_{\mathrm{T}}(\boldsymbol{x}_{k}, \boldsymbol{v}_{k}) + \boldsymbol{\mathrm{f}}_{k}^{\mathsf{T}} \boldsymbol{x}_{k}\right ) \Delta t. $$

(8)

For a first-order integrator, the principle of least action must be fulfilled for trajectories consisting of three knot points, i.e., from 0 to $N=2$. Since $\boldsymbol{x}_{0}$ and $\boldsymbol{x}_{2}$ are fixed start and end points of the trajectory, only $\boldsymbol{x}_{1}$ can vary. Therefore, the action sum is minimized with respect to the position $\boldsymbol{x}_{1}$:

$$ \nabla _{\boldsymbol{x}_{1}}S_{\mathrm{d}, 0, \mathrm{T}} = - \boldsymbol{d}_{\mathrm{0,T}}\Delta t = \boldsymbol{0}, $$

(9)

where $\boldsymbol{d}_{\mathrm{0,T}}$ are the resulting implicit discretized translational dynamics. In other words, if Equation (9) is fulfilled, we have found the discrete approximation of the physically correct trajectory consisting of the knot points $\boldsymbol{x}_{0}$, $\boldsymbol{x}_{1}$, and $\boldsymbol{x}_{2}$. Note that the derivative with respect to $\boldsymbol{x}_{1}$ is taken for each body in a mechanism.

An example for a single body with potential function $\mathcal{V}(\boldsymbol{x})$ yields

$$\begin{aligned} \boldsymbol{d}_{\mathrm{0,T}}(\boldsymbol{v}_{1}) &= -\nabla _{ \boldsymbol{x}_{1}}\left (\mathcal{L}_{\mathrm{T}}(\boldsymbol{x}_{0}, \boldsymbol{v}_{0}) + \boldsymbol{\mathrm{f}}_{0}^{\mathsf{T}} \boldsymbol{x}_{0} + \mathcal{L}_{\mathrm{T}}(\boldsymbol{x}_{1}, \boldsymbol{v}_{1}) + \boldsymbol{\mathrm{f}}_{1}^{\mathsf{T}} \boldsymbol{x}_{1}\right ) \\ &= -\left (\boldsymbol{M} \frac{\boldsymbol{x}_{1}-\boldsymbol{x}_{0}}{\Delta t} - \boldsymbol{M}\frac{\boldsymbol{x}_{2}-\boldsymbol{x}_{1}}{\Delta t} - \nabla _{\boldsymbol{x}_{1}}\mathcal{V}(\boldsymbol{x}_{1}) + \boldsymbol{\mathrm{f}}_{1}\right ) \\ &= \boldsymbol{M} \frac{\boldsymbol{v}_{1}-\boldsymbol{v}_{0}}{\Delta t} + \nabla _{ \boldsymbol{x}_{1}}\mathcal{V}(\boldsymbol{x}_{1}) - \boldsymbol{\mathrm{f}}_{1} = \boldsymbol{0}, \end{aligned}$$

(10)

which resembles the discretized version of Newton’s second law, $\boldsymbol{M}\dot{\boldsymbol{v}}-\boldsymbol{\mathrm{f}}= \boldsymbol{0}$.

The physically accurate dynamics are obtained by varying $\boldsymbol{x}_{1}$ with fixed $\boldsymbol{x}_{0}$ and $\boldsymbol{x}_{2}$. However, when integrating dynamics forward in time, we start from a known initial state $\boldsymbol{x}_{0}$ and $\boldsymbol{v}_{0}$. From (7), the position at the next time step, $\boldsymbol{x}_{1}$, is calculated as

$$ \boldsymbol{x}_{1} = \boldsymbol{x}_{0} + \boldsymbol{v}_{0}\Delta t. $$

(11)

Then, the implicit dynamics equations (9) are solved to obtain the velocity $\boldsymbol{v}_{1}$. This resulting integration scheme, consisting of (11) and (9), is the symplectic Euler method [25].

Rotational component

The rotational component of the integrator can be derived similarly to the translation case. A description for an unconstrained floating single rigid body is presented in [43], and we later extend the derivation to constrained multibody systems.

The action integral for the rotational component is

$$ S_{0,\mathrm{R}}(\boldsymbol{q},\boldsymbol{\omega }) = \int _{t_{0}}^{t_{N}} \mathcal{L}_{\mathrm{R}}(\boldsymbol{q},\boldsymbol{\omega })~ \mathrm{d}t + \int _{t_{0}}^{t_{N}}2\boldsymbol{\tau }^{\mathsf{T}} \boldsymbol{V} \boldsymbol{L}(\boldsymbol{q})^{\mathsf{T}} \boldsymbol{q}~\mathrm{d}t. $$

(12)

To maintain unit norm in the quaternion update (see Appendix A), the discrete quaternion angular velocity is defined as

$$ \bar{\boldsymbol{\omega }}_{k} = \begin{bmatrix} \sqrt{\left (\frac{2}{\Delta t}\right )^{2} - \boldsymbol{\omega }_{k}^{ \mathsf{T}}\boldsymbol{\omega }_{k}}~ \\ \boldsymbol{\omega }_{k}\end{bmatrix} = \frac{2}{\Delta t}\boldsymbol{L}(\boldsymbol{q}_{k})^{ \mathsf{T}}\boldsymbol{q}_{k+1}. $$

(13)

As before, the action integral (12) is discretized to obtain the action sum

$$ S_{\mathrm{d},0,\mathrm{R}}(\boldsymbol{q}_{k},\boldsymbol{\omega }_{k}) = \sum _{k=0}^{N-1}\left (\mathcal{L}_{\mathrm{R}}(\boldsymbol{q}_{k}, \boldsymbol{\omega }_{k}) + 2\boldsymbol{\tau }_{k}^{\mathsf{T}} \boldsymbol{V} \boldsymbol{L}(\boldsymbol{q}_{k})^{\mathsf{T}} \boldsymbol{q}_{k}\right ) \Delta t. $$

(14)

The principle of least action is fulfilled by minimizing the discrete action sum from 0 to $N=2$ over the orientation $\boldsymbol{q}_{1}$:

$$ \nabla ^{\mathrm{r}}_{\boldsymbol{q}_{1}}S_{\mathrm{d},0,\mathrm{R}} = -\boldsymbol{d}_{\mathrm{0,R}}\Delta t = \boldsymbol{0}, $$

(15)

where $\boldsymbol{d}_{\mathrm{0,R}}$ are the implicit discretized rotational dynamics. Note that we have used the rotational gradient $\nabla ^{\mathrm{r}}$ (see Appendix A) and that the derivative with respect to $\boldsymbol{q}_{1}$ is taken for each body in a mechanism.

An example for a single body with potential function $\mathcal{V}(\boldsymbol{q})$ yields

\begin{matrix} d_{0, R} (ω_{1}) = J ω_{1} \sqrt{\frac{4}{Δ t^{2}} - ω_{1}^{T} ω_{1}} + ω_{1}^{\times} J ω_{1} - \\ J ω_{0} \sqrt{\frac{4}{Δ t^{2}} - ω_{0}^{T} ω_{0}} + ω_{0}^{\times} J ω_{0} + \nabla_{q_{1}}^{r} V (q_{1}) - 2 τ_{2} = 0, \end{matrix}

(16)

which—analogous to the translational case—bears resemblance to Euler’s equations for rotations $J\dot{\boldsymbol{\omega }}+\boldsymbol{\omega }^{\times}J \boldsymbol{\omega } - \boldsymbol{\tau } = \boldsymbol{0}$.

An integration step given $\boldsymbol{q}_{0}$ and $\boldsymbol{\omega }_{0}$ is performed by first calculating

$$ \boldsymbol{q}_{1} = \frac{\Delta t}{2}\boldsymbol{L}(\boldsymbol{q}_{0}) \bar{\boldsymbol{\omega }}_{0}, $$

(17)

and subsequently solving (15) for the angular velocity $\boldsymbol{\omega }_{1}$.

3.2 Equality constrained dynamics

Equality constraint functions $\boldsymbol{g}(\boldsymbol{z})$ with Lagrange multiplier $\boldsymbol{\lambda }\in \mathbb{R}^{n_{\mathrm{e}}}$ are added to the integrator by appending them to the action integral:

$$ S(\boldsymbol{z},\dot{\boldsymbol{z}},\boldsymbol{\lambda }) = S_{0}( \boldsymbol{z},\dot{\boldsymbol{z}}) + \int _{t_{0}}^{t_{N}} \boldsymbol{\lambda }^{\mathsf{T}}\boldsymbol{g}(\boldsymbol{z}) ~ \mathrm{d}t. $$

(18)

Accordingly, the discrete action sum changes to

$$ S_{\mathrm{d}}(\boldsymbol{z}_{k},\dot{\boldsymbol{z}}_{k}, \boldsymbol{\lambda }_{k}) = S_{\mathrm{d},0}(\boldsymbol{z}_{k}, \dot{\boldsymbol{z}}_{k})+\sum _{k=0}^{N-1}\boldsymbol{\lambda }_{k}^{ \mathsf{T}}\boldsymbol{g}(\boldsymbol{z}_{k}) \Delta t. $$

(19)

Taking the gradient of (19) with respect to $\boldsymbol{x}_{1}$ and $\boldsymbol{q}_{1}$ yields the constrained implicit discretized dynamics

$$\begin{aligned} \boldsymbol{d}(\dot{\boldsymbol{z}}_{1},\boldsymbol{\lambda }_{1}) = \boldsymbol{d}_{0}(\dot{\boldsymbol{z}}_{1}) - \boldsymbol{G}( \boldsymbol{z}_{1})^{\mathsf{T}}\boldsymbol{\lambda }_{1} &= \boldsymbol{0}, \end{aligned}$$

(20a)

$$\begin{aligned} \boldsymbol{g}\left (\boldsymbol{z}_{2}\left (\dot{\boldsymbol{z}}_{1} \right )\right ) &= \boldsymbol{0}, \end{aligned}$$

(20b)

where

$$ \boldsymbol{G}(\boldsymbol{z}) = \begin{bmatrix} \frac{\partial \boldsymbol{g}(\boldsymbol{z})}{\partial \boldsymbol{x}} & \frac{\partial \boldsymbol{g}(\boldsymbol{z})}{\partial ^{\mathrm{r}} \boldsymbol{q}} \end{bmatrix} . $$

(21)

Physically, the constraint forces $\boldsymbol{G}(\boldsymbol{z}_{1})^{\mathsf{T}}\boldsymbol{\lambda }_{1}$ act on the rigid bodies to guarantee satisfaction of constraints $\boldsymbol{g}=\boldsymbol{0}$. Mathematically, $\boldsymbol{\lambda }$ serves a similar purpose as Lagrange multipliers in constrained optimization.

The integration step starting from $\boldsymbol{z}_{0}$ and $\dot{\boldsymbol{z}}_{0}$ is calculated as follows. First, $\boldsymbol{z}_{1}$ is calculated from the update rules (11) and (17). Then, the nonlinear system of equations (20a)–(20b) is solved for $\dot{\boldsymbol{z}}_{1}$ and $\boldsymbol{\lambda }_{1}$. Note that the constraints are fulfilled for $\boldsymbol{z}_{2}$, which depends on $\dot{\boldsymbol{z}}_{1}$ through update rules (11) and (17). Therefore, the resulting velocity $\dot{\boldsymbol{z}}_{1}$ always ensures constraint satisfaction for the next position $\boldsymbol{z}_{2}$.

3.3 Inequality constrained dynamics

Inequality constraint functions $\boldsymbol{\phi }(\boldsymbol{z})$ with Lagrange multipliers $\boldsymbol{\gamma }\in \mathbb{R}^{n_{\mathrm{i}}}$ are added to the integrator in a similar fashion. Physically, the multipliers $\boldsymbol{\gamma }$ are the magnitudes of the normal forces at the contacts. To add the constraints to the dynamics, they are discretized and formulated as a nonlinear complementarity problem (NCP)

$$\begin{aligned} \boldsymbol{\phi }(\boldsymbol{z}_{k}) &\geq \boldsymbol{0}, \end{aligned}$$

(22a)

$$\begin{aligned} \boldsymbol{\gamma }_{k} &\geq \boldsymbol{0}, \end{aligned}$$

(22b)

$$\begin{aligned} \boldsymbol{\phi }(\boldsymbol{z}_{k})^{\mathsf{T}} \boldsymbol{\gamma }_{k} &= \boldsymbol{0}, \end{aligned}$$

(22c)

with element-wise ≥, for which we use the standard shorthand notation

$$ \boldsymbol{0}\leq \boldsymbol{\phi }(\boldsymbol{z}_{k})\perp \boldsymbol{\gamma }_{k}\geq \boldsymbol{0}. $$

(23)

The resulting dynamics are

$$\begin{aligned} \boldsymbol{d}(\dot{\boldsymbol{z}}_{1},\boldsymbol{\gamma }_{1}) = \boldsymbol{d}_{0}(\dot{\boldsymbol{z}}_{1}) - \boldsymbol{N}( \boldsymbol{z}_{1})^{\mathsf{T}}\boldsymbol{\gamma }_{1} &= \boldsymbol{0}, \end{aligned}$$

(24a)

$$\begin{aligned} \boldsymbol{0}\leq \boldsymbol{\phi }(\boldsymbol{z}_{2}( \dot{\boldsymbol{z}}_{1}))\perp \boldsymbol{\gamma }_{1}&\geq \boldsymbol{0}, \end{aligned}$$

(24b)

where

$$ \boldsymbol{N}(\boldsymbol{z}) = \begin{bmatrix} \frac{\partial \boldsymbol{\phi }(\boldsymbol{z})}{\partial \boldsymbol{x}} & \frac{\partial \boldsymbol{\phi }(\boldsymbol{z})}{\partial ^{\mathrm{r}} \boldsymbol{q}} \end{bmatrix} . $$

(25)

The integration step starting from $\boldsymbol{z}_{0}$ and $\dot{\boldsymbol{z}}_{0}$ is again performed by first calculating $\boldsymbol{z}_{1}$ from (11) and (17), and then solving (24a)–(24b) for $\dot{\boldsymbol{z}}_{1}$ and $\boldsymbol{\gamma }_{1}$.

3.4 Friction dynamics

The friction dynamics are included in the variational integrator by discretizing the maximum dissipation principle (2a)–(2c):

min_{β} {\dot{z}}_{k}^{T} B {(z_{k})}^{T} β_{k},

(26a)

s.t. E^{T} β_{k} \leq C_{f} γ_{k},

(26b)

β_{k} \geq 0 .

(26c)

Note that the normal force multipliers $\boldsymbol{\gamma }$ result from contact inequality constraints in the form of (22a)–(22c).

As with the contact constraint, we formulate (26a)–(26c) as an NCP with Lagrange multipliers $\boldsymbol{\psi }_{k}\in \mathrm{R}^{n_{\mathrm{c}}}$—the tangential velocities at the contact points—and $\boldsymbol{\eta }_{k}\in \mathbb{R}^{n_{\mathrm{c}}n_{\mathrm{f}}}$:

$$\begin{aligned} \boldsymbol{B}(\boldsymbol{z}_{k})\dot{\boldsymbol{z}}_{k} + \boldsymbol{E}\boldsymbol{\psi }_{k} - \boldsymbol{\eta }_{k} &= \boldsymbol{0}, \end{aligned}$$

(27a)

$$\begin{aligned} \boldsymbol{0} \leq \boldsymbol{C}_{\mathrm{f}}\boldsymbol{\gamma }_{k} - \boldsymbol{E}^{\mathsf{T}}\boldsymbol{\beta }_{k} \perp \boldsymbol{\psi }_{k} &\geq \boldsymbol{0}, \end{aligned}$$

(27b)

$$\begin{aligned} \boldsymbol{0} \leq \boldsymbol{\beta }_{k} \perp \boldsymbol{\eta }_{k} &\geq \boldsymbol{0}. \end{aligned}$$

(27c)

Given a mechanism with $n_{\mathrm{c}}$ contact inequality constraints with friction, the resulting dynamics are

$$\begin{aligned} \boldsymbol{d}(\dot{\boldsymbol{z}}_{1},\boldsymbol{\gamma }_{1}, \boldsymbol{\beta }_{1},\boldsymbol{\psi }_{1},\boldsymbol{\eta }_{1}) = \boldsymbol{d}_{0}(\dot{\boldsymbol{z}}_{1}) - \boldsymbol{N}( \boldsymbol{z}_{1})^{\mathsf{T}}\boldsymbol{\gamma }_{1} - \boldsymbol{B}(\boldsymbol{z}_{1})^{\mathsf{T}}\boldsymbol{\beta }_{1} &= \boldsymbol{0}, \end{aligned}$$

(28a)

$$\begin{aligned} \boldsymbol{0}\leq \boldsymbol{\phi }(\boldsymbol{z}_{2}( \dot{\boldsymbol{z}}_{1}))\perp \boldsymbol{\gamma }_{1}&\geq \boldsymbol{0}, \end{aligned}$$

(28b)

$$\begin{aligned} \boldsymbol{B}(\boldsymbol{z}_{1})\dot{\boldsymbol{z}}_{1} + \boldsymbol{E}\boldsymbol{\psi }_{1} - \boldsymbol{\eta }_{1} &= \boldsymbol{0}, \end{aligned}$$

(28c)

$$\begin{aligned} \boldsymbol{0} \leq \boldsymbol{C}_{\mathrm{f}}\boldsymbol{\gamma }_{1} - \boldsymbol{E}^{\mathsf{T}}\boldsymbol{\beta }_{1} \perp \boldsymbol{\psi }_{1} &\geq \boldsymbol{0}, \end{aligned}$$

(28d)

$$\begin{aligned} \boldsymbol{0} \leq \boldsymbol{\beta }_{1} \perp \boldsymbol{\eta }_{1} &\geq \boldsymbol{0}, \end{aligned}$$

(28e)

and the integration step starting from $\boldsymbol{z}_{0}$ and $\dot{\boldsymbol{z}}_{0}$ is performed by first calculating $\boldsymbol{z}_{1}$ from (11) and (17), and then solving (28a)–(28e) for $\dot{\boldsymbol{z}}_{1}$, $\boldsymbol{\gamma }_{1}$, $\boldsymbol{\beta }_{1}$, $\boldsymbol{\psi }_{1}$, and $\boldsymbol{\eta }_{1}$.

3.5 Complete integrator

Putting all components together, the simulation of one time step given $\boldsymbol{z}_{0}$ and $\dot{\boldsymbol{z}}_{0}$ is performed by first calculating $\boldsymbol{z}_{1}$ from (11) and (17). Subsequently, the constrained implicit dynamics must be solved:

$$\begin{aligned} \boldsymbol{d}(\dot{\boldsymbol{z}}_{1},\boldsymbol{\lambda }_{1}, \boldsymbol{\gamma }_{1},\boldsymbol{\beta }_{1} ,\boldsymbol{\psi }_{1},\boldsymbol{\eta }_{1}) = \boldsymbol{d}_{0}(\dot{\boldsymbol{z}}_{1}) - \boldsymbol{G}^{ \mathsf{T}}\boldsymbol{\lambda }_{1} - \boldsymbol{N}^{\mathsf{T}} \boldsymbol{\gamma }_{1} - \boldsymbol{B}^{\mathsf{T}} \boldsymbol{\beta }_{1} &= \boldsymbol{0}, \end{aligned}$$

(29a)

$$\begin{aligned} \boldsymbol{g}\left (\boldsymbol{z}_{2}(\dot{\boldsymbol{z}}_{1}) \right ) &= \boldsymbol{0}, \end{aligned}$$

(29b)

$$\begin{aligned} \boldsymbol{0}\leq \boldsymbol{\phi }\left (\boldsymbol{z}_{2}( \dot{\boldsymbol{z}}_{1})\right )\perp \boldsymbol{\gamma }_{1}&\geq \boldsymbol{0}, \end{aligned}$$

(29c)

$$\begin{aligned} \boldsymbol{B}\dot{\boldsymbol{z}}_{1} + \boldsymbol{E} \boldsymbol{\psi }_{1} - \boldsymbol{\eta }_{1} &= \boldsymbol{0}, \end{aligned}$$

(29d)

$$\begin{aligned} \boldsymbol{0} \leq \boldsymbol{C}_{\mathrm{f}} \boldsymbol{\gamma }_{1} - \boldsymbol{E}^{\mathsf{T}} \boldsymbol{\beta }_{1} \perp \boldsymbol{\psi }_{1} & \geq \boldsymbol{0}, \end{aligned}$$

(29e)

$$\begin{aligned} \boldsymbol{0} \leq \boldsymbol{\beta }_{1} \perp \boldsymbol{\eta }_{1} &\geq \boldsymbol{0} . \end{aligned}$$

(29f)

Note that (29a)–(29f) contains the equations and constraints for all bodies of a mechanism.

If the initial velocity $\dot{\boldsymbol{z}}_{0}$ is unknown, and a non-constraint-fulfilling one is chosen, an error occurs at the first time step. The magnitude of the error depends on the magnitude of the initial constraint violation. The discrete Legendre transform can be used to determine a constraint-fulfilling initial velocity. We refer to the extensive explanation in [30] for details on initializing a simulation with the Legendre transform.

4 Numerical solver

The variational integrator (29a)–(29f) can be summarized as a system of nonlinear equations with inequality constraints:

$$\begin{aligned} \boldsymbol{f}(\boldsymbol{s})&=\boldsymbol{0}, \end{aligned}$$

(30a)

$$\begin{aligned} \boldsymbol{h}(\boldsymbol{s})&\geq \boldsymbol{0}, \end{aligned}$$

(30b)

with solution vector $\boldsymbol{s} = [\dot{\boldsymbol{z}}_{1}^{\mathsf{T}}\ \boldsymbol{\lambda }_{1}^{\mathsf{T}}\ \boldsymbol{\gamma }_{1}^{ \mathsf{T}}\ \boldsymbol{\beta }_{1}^{\mathsf{T}}\ \boldsymbol{\psi }_{1}^{ \mathsf{T}}\ \boldsymbol{\eta }_{1}^{\mathsf{T}}]^{\mathsf{T}}$. In this section, the algorithms derived for solving the system (30a)–(30b) are applicable to the class of Newton-based root-finding methods. Certain methods of this class, for example, interior-point methods [44], introduce slack variables $\boldsymbol{\sigma }$ and additional constraints $\boldsymbol{h}(\boldsymbol{s})=\boldsymbol{\sigma }$ to facilitate the numerical treatment of the inequality constraints. Since the slack variables match the inequality constraints up to the desired solution tolerance, the theoretical properties of the integrator still hold. The solution vector $\boldsymbol{s}$ and the system $\boldsymbol{f}(\boldsymbol{s})$ would be extended by these slack variables and constraints, but the computational complexity does not change if each inequality constraint and its associated slack constraint are considered as a single node in a graph. Moreover, the graph-based argument is not limited to mechanical systems and can be implemented for any graph-based system, but we will restrict the discussion to mechanical systems.

At the core, Newton-based methods iteratively produce solution approximations for (30a)–(30b) with the procedure

$$ \boldsymbol{s}^{(i+1)} = \boldsymbol{s}^{(i)} - \boldsymbol{F}( \boldsymbol{s}^{(i)})^{-1}\boldsymbol{f}(\boldsymbol{s}^{(i)}), $$

(31)

where

$$ \boldsymbol{F}(\boldsymbol{s}) = \frac{\partial \boldsymbol{f}(\boldsymbol{s})}{\partial \boldsymbol{s}}. $$

(32)

Numerically, (31) is formulated as a linear system of equations

$$ \boldsymbol{F}(\boldsymbol{s}^{(i)})\Delta \boldsymbol{s}^{(i)} = - \boldsymbol{f}(\boldsymbol{s}^{(i)}), $$

(33)

where the result $\Delta \boldsymbol{s}^{(i)}$ is used to obtain $\boldsymbol{s}^{(i+1)} = \boldsymbol{s}^{(i)} + \Delta \boldsymbol{s}^{(i)}$.

Linear systems of the form (33) are solved with decomposition and backsubstitution with overall cubic computational complexity $\mathcal{O}(n^{3})$. For general integrators, including the variational integrator derived in the previous section, (33) is neither symmetric nor block symmetric. Therefore, the LDU decomposition [45] for asymmetric systems is chosen as the foundation of the algorithms. While (33) is asymmetric, its sparsity pattern, i.e., the zero and non-zero entries, is block-symmetric, and the following algorithms exploit this sparsity to improve complexity.

4.1 Linear-complexity algorithm

For mechanisms without kinematic loops, the LDU decomposition can be modified to obtain decomposition and backsubstitution with linear computational complexity $\mathcal{O}(n)$, where $n$ is the number of nodes in the corresponding graph. This modification is achieved by taking into account the graph representing the components of a mechanism and its associated $\boldsymbol{F}$ matrix. Consider, for example, the mechanism and graph in Fig. 2.

According to [46], the $\boldsymbol{F}$ matrix corresponding to an acyclic graph, i.e., a mechanism without kinematic loops, can be decomposed with linear complexity by traversing the graph from leaves to root.

A depth-first search (DFS) starting from the (arbitrary) root is performed to find the correct processing order. The found nodes are stored in a list with the root as the last element and the last-found node as the first element. This list is then used in the modified LDU decomposition (Algorithm 1) and backsubstitution (Algorithm 2). Note that in the algorithms, vector indices $i$ stand for the respective rows of node $i$, and matrix indices $i$, $j$ stand for the respective rows of node $i$ and columns of node $j$.

The decomposition in Algorithm 1 processes the matrix $\boldsymbol{F}$ according to the graph structure from leaves to root. Additionally, for each node, computations are only performed for connected components, since computations for disconnected components are zero in the LDU decomposition. The linear complexity is a direct result.

Decomposition complexity: In an acyclic graph with $n$ nodes, each node has at most one parent, so there are $\mathcal{O}(n)$ children (and $\mathcal{O}(n)$ parents). Therefore, a total of $\mathcal{O}(n)$ evaluations of the for-loop on line 2 of Algorithm 1 are required. The result is a linear complexity $\mathcal{O}(n)$.

Backsubstitution complexity: In an acyclic graph with $n$ nodes, there are $\mathcal{O}(n)$ children and $\mathcal{O}(n)$ parents. Therefore, a total of $\mathcal{O}(n)$ evaluations of the for-loops on lines 3 and 9 of Algorithm 2 are required. The result is a linear complexity $\mathcal{O}(n)$.

Articulated mechanism example

The comparison of dense and sparse LDU decomposition for the example in Fig. 2 is shown in Fig. 3.

The matrices in Fig. 3 have off-diagonal entries only at the intersection of directly connected nodes. For the example mechanism in Fig. 2, the diagonal entries $\boldsymbol{D}_{1}$ to $\boldsymbol{D}_{5}$ are the derivatives of the dynamics $\boldsymbol{d}$ of each body, and the off-diagonal entries $\boldsymbol{c}_{ij}$ are the equality constraint derivatives representing joints between two bodies. Note that when processing the matrix in the wrong order, a so-called fill-in is created. This fill-in occurs at the off-diagonals of nodes indirectly connected through a node that is processed before it becomes a leaf. Since fill-in must also be processed in the decomposition and backsubstitution, linear complexity is no longer achieved.

Environment contact example

A direct result of the linear-complexity property for acyclic graphs is the following. Mechanisms without kinematic loops and only environment contact, i.e., no contact between bodies, correspond to acyclic graphs and, therefore, have linear complexity in the number of bodies and contact points. Consider the exemplary mechanism, graph, and matrix in Fig. 4.

Since all environment contacts are leaves in the graph, adding contact points leads to linear scaling with the correct processing order. This result is especially interesting for bipedal or quadrupedal walking robots.

4.2 Reduced-fill-in algorithm

If the graph of a mechanism has cycles, fill-in can generally no longer be avoided entirely, since nodes of a cycle are never leaves. However, by processing the leaves attached to a cycle first, the amount of fill-in is reduced. As an example, the mechanism from Fig. 2 is modified to contain a kinematic loop, displayed in Fig. 5.

Note that for computational reasons, two joints are combined into node 6, since these two joints form the beginning of the kinematic loop. Such loop-openers are also found with the depth-first search, since they are simply the first and last joints in a detected loop. In the algorithms, a distinction is made between nodes that are part of a cycle and nodes that are not. This distinction is also determined by the depth-first search.

Since any square matrix can be represented by a (potentially cyclic) graph, the following algorithms are applicable to all such matrices. In the case of mechanical systems, they can be used for all components defined in Sect. 2. The sparse decomposition for such systems is formulated in Algorithm 3 and the backsubstitution in Algorithm 4.

Note that all lists in Algorithms 3 and 4 are sorted in the same order as the depth-first search list.

Decomposition complexity: In a cyclic graph with $n$ nodes and $k$ cycles, there are $\mathcal{O}(n)$ acyclic children and $\mathcal{O}(n)$ acyclic parents. Additionally, there are $\mathcal{O}(n)$ cyclic children and $\mathcal{O}(n)$ loop-opening parents per cycle, since each cycle contains at most all nodes and each of these nodes has the same loop-opening parent. Therefore, a total of $\mathcal{O}(n)$ evaluations of the for-loop on line 2 of Algorithm 3 and a total of $\mathcal{O}(kn^{2})$ evaluations of the for-loop on line 8 are required. Resulting is a complexity $\mathcal{O}(n+kn^{2})$—quadratic in $n$ and linear in $k$.

Backsubstitution complexity: In a cyclic graph with $n$ nodes and $k$ cycles, there are $\mathcal{O}(n)$ acyclic children, $\mathcal{O}(n)$ acyclic parents, $\mathcal{O}(n)$ cyclic children per cycle, and $\mathcal{O}(n)$ loop-opening parents per cycle. Therefore, a total of $\mathcal{O}(n+nk)$ evaluations of the for-loop on line 3 of Algorithm 4 and a total of $\mathcal{O}(n+nk)$ evaluations of the for-loop on line 9 are required. This results in complexity $\mathcal{O}(n+nk)$—linear in $n$ and linear in $k$.

In the worst case of a fully connected graph, i.e., a fully dense matrix, each node has $\mathcal{O}(n)$ cyclic children and parents, resulting in a complexity of $\mathcal{O}(n+n^{3})$. However, for real systems, the theoretical complexities are often too conservative. For a system without intersecting cycles, i.e., each node belongs to at most a single cycle, there are a total of $\mathcal{O}(n)$ cyclic children and parents and, therefore, the overall complexity is $\mathcal{O}(n+n^{2})$. In the case of a constant cycle size, there is a total of $\mathcal{O}(k)$ cyclic children and parents, and a linear complexity $\mathcal{O}(n + k)$ is obtained. In a combined setting with non-intersecting cycles of fixed size, for example, a mechanism with disconnected identical legs made of kinematic loops, there are $\mathcal{O}(n)$ cycles with $\mathcal{O}(1)$ cyclic children and parents each, resulting in a linear complexity $\mathcal{O}(n)$. Improvements in the complexity are theoretically possible, but finding a processing order that creates the minimum fill-in is generally NP-hard [46].

The comparison of dense and sparse decomposition for the example in Fig. 4 with additional dampers at each joint is shown in Fig. 6.

The dampers depend on the relative velocity between the two connected bodies, leading to additional off-diagonal entries. As Fig. 6 shows, with a bad processing order, an almost fully dense matrix is obtained due to fill-in, while the correct processing order creates fill-in only at the off-diagonals of nodes that are part of cycles and the loop-openers.

5 Evaluation

The evaluation of the simulator is comprised of four parts. First, the physical accuracy of the variational integrator is analyzed. Then, the runtime and computational complexity of the graph-based algorithms are investigated. Next, the numerical robustness of the simulator is tested. Lastly, two application examples for the simulator are given. Comparisons are made to different simulators and integrators. We compare to the dynamics simulator RigidBodyDynamics [47] as it is written in the same programming language as our integrator, Julia [48], and to the widely used simulator MuJoCo, which is representative for soft constraint handling. Variational integrator comparisons are made with two minimal-coordinate integrators [28, 29] and an integrator in redundant coordinates for constrained systems [17]. In order to solve the integrator equations (29a)–(29f), a basic interior-point method is used and described in Appendix D. Note that the focus of the implementation is on the theoretical properties and not runtime optimization. Nonetheless, even this basic implementation achieves reasonable timing results, benefitting from the easy and modular implementation in maximal coordinates. Code for the simulator^{Footnote 1} and the graph-based system solver^{Footnote 2} including all experiments and additional examples is available in Julia. All experiments are carried out on an Asus ZenBook with an i7-8565U CPU and 16 GB RAM.

5.1 Physical accuracy

The physical accuracy of the simulator is examined in four scenarios: constraint drift, energy conservation, energy dissipation, and contact violation. A comparison to commonly used alternative implementations is provided for reference. The results are displayed in Fig. 7.

For the constraint drift in Fig. 7 (a), a three-link mechanism forming a kinematic loop is used with link lengths $l_{1}=1\text{ m}$, $l_{2}={\frac{\sqrt{2}}{2}}\text{ m}$, $l_{3}=1\text{ m}$, masses $m_{1}=1\text{ kg}$, $m_{2}={\frac{\sqrt{2}}{2}}\text{ kg}$, $m_{3}=1\text{ kg}$, and a step size $\Delta t=0.01\text{ s}$. In minimal coordinates, the loop-closure constraints must be explicitly enforced. In non-variational integrators, the dynamics and constraints are formulated on an acceleration level. Without constraint stabilization [49], i.e., a spring-damper connection, the links of the mechanism start to drift apart. The explicit 4th-order Runge–Kutta–Munte–Kaas integrator [25] exhibits constraint drift without such stabilization. The 1st-order variational integrator prevents constraint drift entirely and does not require constraint stabilization.

The energy conservation in Fig. 7 (b) is evaluated on a frictionless double pendulum with link lengths $l=1\text{ m}$, masses $m=1\text{ kg}$, and a step size $\Delta t=0.01\text{ s}$. Unlike variational integrators, strictly explicit or implicit Runge–Kutta integrators do not generally have tight bounds on the energy error for conservative mechanical systems, although certain methods, for example symmetric ones, do [25]. The explicit 2nd-order Runge–Kutta method (Heun’s method) injects energy into the system, while the energy error stays bounded for the variational integrator.

Accurate energy dissipation is an important property, for example, in passivity-based control approaches [50]. The dissipation behavior in Fig. 7 (c) is evaluated on a damped pendulum with link length $l=1\text{ m}$, mass $m=1\text{ kg}$, joint damping $d=\frac{1}{2}\text{N}\frac{\text{s}}{\text{m}}$ and a step size $\Delta t=0.1\text{ s}$. Euler’s method used for comparison shows poor dissipation behavior in drastically underdamping the pendulum, whereas the variational integrator demonstrates good dissipation performance after a small initial error.

Correct simulation of rigid contacts is crucial for transferring learned or optimized control policies from simulation to real systems. To compare rigid and soft contacts, a cube with edge length $l=0.5\text{ m}$, mass $m=1\text{ kg}$, and step size $\Delta t=0.01\text{ s}$ is dropped from a height (bottom-to-ground) $h=0.4\text{ m}$. MuJoCo’s default solver parameters ($\mathrm{solimp} = (0.9, 0.95, 0.001, 0.5, 2)$, $\mathrm{solref} = (0.02, 1)$) and default Euler integrator are used and result in a ground violation of $2.7\text{ cm}$. We also analyzed drops from other heights up to 1 m, which resulted in similar violations. In contrast, the rigid contact formulation with the variational integrator in maximal coordinates stops 43$\mathrm{\mu}$m above the ground due to the interior-point formulation, which practically satisfies the constraint.

5.2 Computational complexity

The evaluation of the computational complexity serves two purposes. We show that the complexity of the graph-based algorithms holds in practice, and by using these algorithms, maximal coordinates can achieve competitive timing results compared to minimal coordinates, despite their larger dimension. For all timings, the best result of 100 samples is used to diminish right-skewing computer noise. The linear complexity of the simulator and a comparison to minimal coordinates are shown in Fig. 8. The performance for systems with kinematic loops and comparisons to a dense solver are displayed in Fig. 9.

Figure 8 (a) compares the computation time of our simulator with the RigidBodyDynamics simulator for 1000 time steps of $n$-link pendulums with link length $l=1\text{ m}$, mass $m=1\text{ kg}$, and step size $\Delta t=0.01\text{ s}$. The comparison is made for revolute and spherical (ball-and-socket) joints between the links. The main result is that despite the higher dimension of maximal coordinates, comparable computation times are achieved. Minimal coordinates are naturally faster for revolute joints, as they have the smallest number of degrees of freedom in minimal coordinates and the most constraints in maximal coordinates. On the other hand, spherical joints perform worse in minimal coordinates as these joints increase their state dimension and reduce the number of constraints in maximal coordinates. The linear complexity for both minimal and maximal coordinates becomes clearly visible.

In Fig. 8 (b), the linear complexity for contacts is demonstrated for a chain of spheres with radius $r=0.25\text{ m}$, mass $m=1\text{ kg}$, and a step size $\Delta t=0.01\text{ s}$. The spheres are connected by spherical joints. A comparison to RigidBodyDynamics is not possible due to the limited support of contacts. Besides the complexity, this experiment demonstrates the numerical robustness of the maximal-coordinate approach for treating contacts. We also implemented the experiment in MuJoCo. While faster, MuJoCo with default solver parameters consistently has contact violations of 10 cm or more when simulating more than 20 spheres and fails to compute chains of more than 44 spheres. Therefore, a fair comparison is difficult.

The computation time for simulating 1000 time steps of a chain of 4-link segments is shown in Fig. 9 (a). The links have a length $l=1\text{ m}$, mass $m=1\text{ kg}$, and the step size is $\Delta t=0.01\text{ s}$. While no longer fully linear, the increase in computation time for this system is modest due to the reduction of fill-in. A simulation of this system with the RigidBodyDynamics simulator failed for more than one 4-link segment, even for smaller step sizes. Numerically, MuJoCo can simulate this system successfully, but with the default solver parameters, the explicit loop-closure constraints introduce high damping into the system, resulting in bad energy conservation behavior.

Comparisons of the sparse graph-based system solver with a dense one are shown in Figs. 9 (c)–(d). The ladder graph consists of cycles with four nodes each, and two nodes are shared by two cycles, except for the first and last two nodes. The net graph is a square net of nodes, where inner nodes are part of four cycles, edge nodes part of two, and corner nodes part of one. The crystal graph is a three-dimensional cubic structure of nodes, where inner nodes are part of twelve cycles, surface nodes part of eight, edge nodes part of five, and corner nodes part of three. The graphs with $n$ nodes are represented by matrices with $6n$ entries. The sparse algorithms outperform the dense ones in most cases, often by more than two orders of magnitude. The crystal graph relates to a rather dense matrix, resulting in decreasing advantage of the sparse approach over the dense one and slightly better performance for $6^{3} = 216$ nodes. This last example is an extreme case. The algorithms are aimed at robotic systems, which typically have significantly fewer nodes and cycles, and the sparse performance is convincing for such systems.

5.3 Numerical robustness

We investigate three different scenarios regarding the numerical robustness of the simulator. A comparison with two state-of-the-art variational integrators in minimal coordinates is made to demonstrate the ability to simulate systems with decreasing solution tolerance. The results are displayed in Fig. 10. Since constrained mechanical systems are known to become ill-conditioned with decreasing step sizes [51, 52], we show the ability to simulate systems for reasonable step sizes in Fig. 11. A double-four-bar linkage—a commonly-used benchmark problem that exhibits singular constraint configurations—is simulated, and the results are compared to another variational integrator for constrained systems in Fig. 12.

In Fig. 10, we compare our algorithm to state-of-the-art second- and third-order variational integrators with the same n-link pendulum we use for the timing comparison in Sect. 5.2. Only revolute joints and loop-free structures are tested as ball-and-socket joints, and loop-closures were, to the best of our knowledge, not part of the implementations of these integrators. To demonstrate the robustness of our algorithm, we run simulations with solution tolerances of $10^{-6}$, $10^{-8}$, and $10^{-10}$ for the Newton methods in all algorithms. All three algorithms display the theoretical linear computational complexity, and despite the higher dimensionality of the maximal-coordinate algorithm, we achieve comparable timing results. However, while our algorithm successfully simulates the n-link pendulums for all tolerances, the minimal-coordinate integrators fail for smaller tolerances and an increasing number of links, hinting at potential numerical issues in minimal coordinates.

Generally, large simulation step sizes are desirable for fast simulations. However, smaller step sizes are required for physically smaller systems to capture their high-frequency motions. Moreover, smaller step sizes may be necessary to achieve highly accurate results. By simulating a physically scaled pendulum with different step sizes, we determine the time-step range for which we can successfully simulate the systems. The pendulum of scale $s\in [1,0.5,0.1]$ has a mass $m=s$, length $l=s^{2}$, joint inertia $J=\frac{1}{3}ml^{2}$, and gravitational acceleration $g=9.81$. Accordingly, the period of the pendulum is $T=2\pi \sqrt{\frac{2}{3}\frac{l}{g}}=2\pi \sqrt{\frac{2}{3} \frac{1}{g}}s$, i.e., proportional to the scale $s$. The pendulum is initialized at an angle and angular velocity $[\theta _{0},\dot{\theta }_{0}]=[\frac{\pi }{2},0]$ and the simulation time is $\frac{T}{2}$, i.e., a half swing. The results in Fig. 11 are plotted for a step size scaled by $s$ to account for the different frequencies of the systems. An increase in the average iteration number per time step for smaller step sizes can be seen in Fig. 11 (a), which is consistent with the increased ill-conditioning of the systems. Nonetheless, the simulation is successful and achieves accurate results even for small step sizes. The accuracy is measured as $\mathrm{norm}(-\frac{\pi }{2}-\theta _{T})$, i.e., the error of the angle at the last time step.

We simulate the double four-bar linkage, a common benchmark system [7], to evaluate the performance of our simulator since the system encounters configuration singularities during simulation when the bars are in parallel. We compare our algorithm to the GGL method in [17] and use the same notation and parameters for the system. The simulation step size is $\Delta t=0.01\text{ s}$. As Fig. 12 (a) and (b) show, our method simulates the system with a bounded error on the energy, whereas the GGL method incurs an increasing energy error, which also leads to an increasingly wrong trajectory. Regarding the constraint satisfaction shown in Fig. 12 (c), both algorithms achieve very low constraint violations, although GGL performs better, potentially because the GGL method enforces the constraints also on a velocity level. Figure 12 (d) shows the kinetic, potential, and total energy of the system as a reference for other benchmarks. We also ran the benchmark for a step size $\Delta t=0.001\text{ s}$, where almost no difference exists between the two methods. A computation time comparison between uncompiled Matlab and compiled Julia code is difficult, and the GGL method is not designed for numerical efficiency, but it is still worth mentioning that our algorithm was more than 100 times faster than the GGL method.

5.4 Application examples

This article focuses on the theoretical foundation for building efficient and physically accurate maximal-coordinate simulators. However, an implementation with a basic interior-point method can already be used for real-world applications. As such application examples, we show that the control parameters for a quadrupedal robot can be learned with a simple sampling-based approach and how controller gains of an exoskeleton can be personalized to impaired patients in simulation to avoid injury. The quadrupedal robot and learning progress are displayed in Fig. 13. The exoskeleton and resulting torques for original and adapted controller gains are visualized in Fig. 14.

We use the Unitree A1 quadrupedal robot for sampling-based learning. The simulation step size is $\Delta t = 0.001\text{ s}$. The learning algorithm is stated in Appendix E. For each episode, a random set of control parameters in the vicinity of the current set of parameters is chosen, and a simulation rollout is performed for 10 seconds. In case of progress, i.e., the walking distance increased, this set of parameters is selected as the starting point for new sample draws. The robot learns to walk a distance of more than 12 m (average velocity of $1.2~\frac{\text{m}}{\text{s}}$) in 100 episodes. Due to the rigid contacts of the simulation, a successful transfer of the learned control parameters to a real system is more likely than with an incorrectly soft contact model.

For the gain tuning, a 4-degrees-of-freedom (DoF) exoskeleton for an arm is used. Three DoF actuate the shoulder and upper arm, and one DoF the elbow and lower arm. The attachments of the upper and lower arm are modeled as two 3-DoF spring-damper joints with two translational and one rotational DoF. As a result, there are two connected kinematic loops in the mechanism. Commonly, attachments between the exoskeleton and the human body are modeled as pure spring-damper connections without joints to avoid such kinematic loops, for example, in [2]. However, such models are not necessarily correct. For the exoskeleton application, we assume a healthy and a spastic patient that perform a rehabilitation routine in the exoskeleton. The shoulder and elbow flexion/extension joints are supposed to follow sinusoidal trajectories (see Appendix E). A limit of 5 Nm is assumed to be a comfortable elbow torque for the patients. While the original gains stay within these limits for the healthy patient, they are exceeded for the spastic patient. By tuning the gains in simulation, an adapted set of gains adhering to the torque limits for both patients is found. With this approach, uncomfortable and potentially harmful tuning on a real patient can be avoided or at least reduced. The modeling accuracy, including proper kinematic loops, should provide closer estimates of the controller gains for the real system.

6 Conclusions

This article introduces a maximal-coordinate variational integrator and efficient graph-based solver for simulating mechanical systems with common components such as springs and dampers, actuated joints, and contacts with friction. Besides the theoretical formulation of the integrator and solver algorithms, an application-ready implementation of the simulator is provided as open-source code.

Building maximal-coordinate simulators on variational integrators is useful not just for the conservation properties and physical accuracy, but also for avoiding constraint drift in the naturally constrained formulation. The increased state dimension is treated with efficient numerical solver algorithms, which reduce the computational complexity in theory and render the simulator useable in practice for various applications. Additionally, it appears that the formulation in maximal coordinates increases the numerical robustness and allows the simulation of systems with contacts or kinematic loops that other simulators in minimal coordinates fail to compute.

Because of the simple modular formulation in maximal coordinates, additional components can be added to the integrator, such as a nonlinear friction model, joint limits, or physically correct elastic contacts. The graph-based nature of the solver algorithms also opens up the possibility of parallelizing computations on different branches of a graph.

Notes

References

Agarwal, P., Narayanan, M.S., Lee, L.-F., Mendel, F., Krovi, V.N.: Simulation-based design of exoskeletons using musculoskeletal analysis. In: Computers and Information in Engineering Conference, pp. 1357–1364. ASMEDC, Montreal (2010)
Google Scholar
Kuhn, J., Hu, T., Schappler, M., Haddadin, S.: Dynamics simulation for an upper-limb human-exoskeleton assistance system in a latent-space controlled tool manipulation task. In: International Conference on Simulation, Modeling, and Programming for Autonomous Robots (SIMPAR), pp. 158–165. IEEE, Brisbane (2018)
Google Scholar
Koenemann, J., Del Prete, A., Tassa, Y., Todorov, E., Stasse, O., Bennewitz, M., Mansard, N.: Whole-body model-predictive control applied to the HRP-2 humanoid. In: International Conference on Intelligent Robots and Systems (IROS), pp. 3346–3351. IEEE, Hamburg (2015)
Google Scholar
Erez, T., Lowrey, K., Tassa, Y., Kumar, V., Kolev, S., Todorov, E.: An integrated system for real-time model predictive control of humanoid robots. In: International Conference on Humanoid Robots (Humanoids), pp. 292–299. IEEE, Atlanta (2013)
Google Scholar
Andrychowicz, O.M., Baker, B., Chociej, M., Józefowicz, R., McGrew, B., Pachocki, J., Petron, A., Plappert, M., Powell, G., Ray, A., Schneider, J., Sidor, S., Tobin, J., Welinder, P., Weng, L., Zaremba, W.: Learning dexterous in-hand manipulation. Int. J. Robot. Res. 39(1), 3–20 (2020)
Google Scholar
Lee, J., Hwangbo, J., Wellhausen, L., Koltun, V., Hutter, M.: Learning quadrupedal locomotion over challenging terrain. Sci. Robot. 5(47), eabc5986 (2020)
Google Scholar
González, M., Dopico, D., Lugrís, U., Cuadrado, J.: A benchmarking system for MBS simulation software: problem standardization and performance measurement. Multibody Syst. Dyn. 16, 179–190 (2006)
Google Scholar
Todorov, E., Erez, T., Tassa, Y.: MuJoCo: a physics engine for model-based control. In: International Conference on Intelligent Robots and Systems (IROS), pp. 5026–5033. IEEE, Vilamoura-Algarve (2012)
Google Scholar
Freeman, C.D., Frey, E., Raichuk, A., Girgin, S., Mordatch, I., Bachem, O.: Brax – a differentiable physics engine for large scale rigid body simulation (2021). http://github.com/google/brax
Tedrake, R.: (2019). The Drake Development Team: Drake: model-based design and verification for robotics. https://drake.mit.edu
Coumans, E., Bai, Y.: PyBullet, a Python module for physics simulation for games, robotics and machine learning (2016). http://pybullet.org
Zhao, W., Queralta, J.P., Westerlund, T.: Sim-to-real transfer in deep reinforcement learning for robotics: a survey. In: Symposium Series on Computational Intelligence (SSCI), pp. 737–744. IEEE, Canberra (2020)
Google Scholar
Featherstone, R.: Rigid Body Dynamics Algorithms. Springer, Boston (2008)
Google Scholar
Featherstone, R.: An empirical study of the joint space inertia matrix. Int. J. Robot. Res. 23(9), 859–871 (2004)
Google Scholar
de Jalón, J.G.: Twenty-five years of natural coordinates. Multibody Syst. Dyn. 18, 15–33 (2007)
MathSciNet Google Scholar
Betsch, P., Steinmann, P.: Constrained integration of rigid body dynamics. Comput. Methods Appl. Mech. Eng. 191(3–5), 467–488 (2001)
MathSciNet Google Scholar
Kinon, P.L., Betsch, P., Schneider, S.: The ggl variational principle for constrained mechanical systems. Multibody Syst. Dyn. 57(3–4), 211–236 (2023)
MathSciNet Google Scholar
Baraff, D.: Linear-time dynamics using Lagrange multipliers. In: Proceedings of the 23rd Annual Conference on Computer Graphics and Interactive Techniques – SIGGRAPH ’96, pp. 137–146. ACM Press, New Orleans (1996)
Google Scholar
Higham, N.J.: Accuracy and Stability of Numerical Algorithms, 2nd edn. SIAM, Philadelphia (2002)
Google Scholar
Brüdigam, J., Manchester, Z.: Linear-time variational integrators in maximal coordinates. In: Workshop on the Algorithmic Foundations of Robotics (WAFR), pp. 194–209. Springer, Cham (2020)
Google Scholar
Brüdigam, J., Manchester, Z.: Linear-quadratic optimal control in maximal coordinates. In: International Conference on Robotics and Automation (ICRA), pp. 9775–9781. IEEE, Xi’an (2021)
Google Scholar
Shield, S., Patel, A.: Minor change, major gains II: are maximal coordinates the fastest choice for trajectory optimization? In: International Conference on Intelligent Robots and Systems (IROS), pp. 12963–12970. IEEE, Kyoto (2022)
Google Scholar
Zhong, G., Marsden, J.E.: Lie-Poisson Hamilton-Jacobi theory and Lie-Poisson integrators. Phys. Lett. A 133(3), 134–139 (1988)
MathSciNet Google Scholar
Chartier, P., Faou, E., Murua, A.: An algebraic approach to invariant preserving integators: the case of quadratic and Hamiltonian invariants. Numer. Math. 103, 575–590 (2006)
MathSciNet Google Scholar
Hairer, E., Lubich, C., Wanner, G.: Geometric Numerical Integration: Structure-Preserving Algorithms for Ordinary Differential Equations, 2nd edn. Springer, Berlin (2006)
Google Scholar
Marsden, J., West, M.: Discrete mechanics and variational integrators. Acta Numer. 10, 357–514 (2001)
MathSciNet Google Scholar
Johnson, E.R., Murphey, T.D.: Scalable variational integrators for constrained mechanical systems in generalized coordinates. IEEE Trans. Robot. 25(6), 1249–1261 (2009)
Google Scholar
Lee, J., Liu, C., Park, F., Srinivasa, S.: A linear-time variational integrator for multibody systems. In: Workshop on the Algorithmic Foundations of Robotics (WAFR), pp. 352–367. Springer, San Francisco (2016)
Google Scholar
Fan, T., Schultz, J., Murphey, T.: Efficient computation of higher-order variational integrators in robotic simulation and trajectory optimization. In: Workshop on the Algorithmic Foundations of Robotics (WAFR), pp. 689–706. Springer, Mérida (2018)
Google Scholar
Leyendecker, S., Marsden, J.E., Ortiz, M.: Variational integrators for constrained dynamical systems. Z. Angew. Math. Mech. 88(9), 677–708 (2008)
MathSciNet Google Scholar
Betsch, P.: The discrete null space method for the energy consistent integration of constrained mechanical systems: part I: holonomic constraints. Comput. Methods Appl. Mech. Eng. 194(50–52), 5159–5190 (2005)
Google Scholar
Betsch, P., Leyendecker, S.: The discrete null space method for the energy consistent integration of constrained mechanical systems. Part II: multibody dynamics. Int. J. Numer. Methods Eng. 67(4), 499–552 (2006)
Google Scholar
Betsch, P., Steinmann, P.: A dae approach to flexible multibody dynamics. Multibody Syst. Dyn. 8, 365–389 (2002)
MathSciNet Google Scholar
Leyendecker, S., Betsch, P., Steinmann, P.: The discrete null space method for the energy-consistent integration of constrained mechanical systems. Part III: flexible multibody dynamics. Multibody Syst. Dyn. 19, 45–72 (2008)
MathSciNet Google Scholar
Brugnoli, A., Alazard, D., Pommier-Budinger, V., Matignon, D.: Port-Hamiltonian flexible multibody dynamics. Multibody Syst. Dyn. 51(3), 343–375 (2021)
MathSciNet Google Scholar
Sola, J., Deray, J., Atchuthan, D.: A micro Lie theory for state estimation in robotics. arXiv preprint (2018). arXiv:1812.01537
Preclik, T., Eibl, S., Rüde, U.: The maximum dissipation principle in rigid-body dynamics with inelastic impacts. Comput. Mech. 62(1), 81–96 (2018)
MathSciNet Google Scholar
Stewart, D.E., Trinkle, J.C.: An implicit time-stepping scheme for rigid body dynamics with inelastic collisions and Coulomb friction. Int. J. Numer. Methods Eng. 39(15), 2673–2691 (1996)
MathSciNet Google Scholar
Wenger, T., Ober-Blöbaum, S., Leyendecker, S.: Constrained Galerkin variational integrators and modified constrained symplectic Runge-Kutta methods. In: International Conference of Numerical Analysis and Applied Mathematics (ICNAAM), Rhodes, Greece (2017)
Google Scholar
Wenger, T., Ober-Blöbaum, S., Leyendecker, S.: Construction and analysis of higher order variational integrators for dynamical systems with holonomic constraints. Adv. Comput. Math. 43(5), 1163–1195 (2017)
MathSciNet Google Scholar
Baruh, H.: Analytical Dynamics. WCB, McGraw-Hill, Boston (1999)
Google Scholar
Shivarama, R., Fahrenthold, E.P.: Hamilton’s equations with Euler parameters for rigid body dynamics modeling. J. Dyn. Syst. Meas. Control 126(1), 124–130 (2004)
Google Scholar
Manchester, Z., Peck, M.: Quaternion variational integrators for spacecraft dynamics. J. Guid. Control Dyn. 39(1), 69–76 (2016)
Google Scholar
Nocedal, J., Wright, S.: Numerical Optimization. Springer, New York (2006)
Google Scholar
Kwak, J., Hong, S.: Linear Algebra, 2nd edn. Birkhäuser, Boston (2004)
Google Scholar
Duff, I., Erisman, A., Reid, J.: Direct Methods for Sparse Matrices, 2nd edn. Oxford University Press, Oxford (2017)
Google Scholar
Koolen, T., Deits, R.: Julia for robotics: simulation and real-time control in a high-level programming language. In: International Conference on Robotics and Automation (ICRA), pp. 604–611. IEEE, Montreal (2019)
Google Scholar
Bezanson, J., Edelman, A., Karpinski, S., Shah, V.: Julia: a fresh approach to numerical computing. SIAM Rev. 59(1), 65–98 (2017)
MathSciNet Google Scholar
Baumgarte, J.: Stabilization of constraints and integrals of motion in dynamical systems. Comput. Methods Appl. Mech. Eng. 1(1), 1–16 (1972)
MathSciNet Google Scholar
Music, S., Hirche, S.: Passive noninteracting control for human-robot team interaction. In: Conference on Decision and Control (CDC), pp. 421–427. IEEE, Miami Beach (2018)
Google Scholar
Petzold, L., Lötstedt, P.: Numerical solution of nonlinear differential equations with algebraic constraints II: practical implications. SIAM J. Sci. Stat. Comput. 7(3), 720–733 (1986)
MathSciNet Google Scholar
Cardenal, J., Cuadrado, J., Morer, P., Bayo, E.: A multi-index variable time step method for the dynamic simulation of multibody systems. Int. J. Numer. Methods Eng. 44(11), 1579–1598 (1999)
Google Scholar
Jackson, B.E., Tracy, K., Manchester, Z.: Planning with attitude. IEEE Robot. Autom. Lett. 6(3), 5658–5664 (2021)
Google Scholar

Download references

Acknowledgements

The authors would like to thank Petar Bevanda for his help in preparing the manuscript, as well as Marko Galic and Jana Janeva for their help in the implementation.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

School of Computation, Information and Technology, Technical University of Munich, Barer Str. 21, Munich, 80333, Germany
Jan Brüdigam, Stefan Sosnowski & Sandra Hirche
The Robotics Institute, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, 15213, PA, USA
Zachary Manchester

Authors

Jan Brüdigam
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Sosnowski
View author publications
You can also search for this author in PubMed Google Scholar
Zachary Manchester
View author publications
You can also search for this author in PubMed Google Scholar
Sandra Hirche
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.B. and Z.M. conceptualized the work. J.B. developed the methodology and software. J.B. wrote the main manuscript text. All authors reviewed and edited the manuscript.

Corresponding author

Correspondence to Jan Brüdigam.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A: Quaternions

We use quaternions as rotation representations since they are globally non-singular, as opposed to three-parameter representations, and numerically efficient, having only four parameters, unlike, for example, rotation matrices with nine parameters.

1.1 A.1 Notation

We write quaternions as a stacked vector,

$$ \boldsymbol{q} = \begin{bmatrix} q_{s} \\ q_{v_{1}} \\ q_{v_{2}} \\ q_{v_{3}} \end{bmatrix} = \begin{bmatrix} q_{s} \\ \boldsymbol{q}_{v} \end{bmatrix} \in \mathbb{H}, $$

(A.1)

where $q_{s}$ and $\boldsymbol{q}_{v}$ are the scalar and vector parts, respectively. We follow the Hamilton convention with a local-to-global rotation action. In this convention, a quaternion $\boldsymbol{q}$ maps vectors from the local to the global frame, whereas its inverse maps from the global to the local frame.

Notation (A.1) allows for a simple formulation of the basic operations conjugate, inverse, and multiplication:

Conjugate: q^{C} = [\begin{array}{c} q_{s} \\ - q_{v} \end{array}],

(A.2a)

Inverse: q^{- 1} = \frac{q^{C}}{∥ q ∥},

(A.2b)

Multiplication: q \cdot p = [\begin{array}{c} q_{s} p_{s} - q_{v}^{T} p_{v} \\ q_{s} p_{v} + p_{s} q_{v} + q_{v} \times p_{v} \end{array}] .

(A.2c)

Note that, for unit quaternions, $\boldsymbol{q}^{\mathsf{C}}= \boldsymbol{q}^{-1}$, and that the × operator indicates the standard cross product of two vectors.

Three other common operations are expanding a vector $\boldsymbol{x} \in \mathbb{R}^{3}$ into a quaternion, retrieving the vector part from a quaternion, and constructing a skew-symmetric matrix from a vector $\boldsymbol{x} \in \mathbb{R}^{3}$ to form the cross product as a matrix-vector product:

Expand vector: x^{\land} = [\begin{array}{c} 0 \\ x \end{array}],

(A.3a)

Retrieve vector: q^{\lor} = q_{v},

(A.3b)

Skew-symmetric matrix: x^{\times} = [\begin{array}{c} 0 & - x_{3} & x_{2} \\ x_{3} & 0 & - x_{1} \\ - x_{2} & x_{1} & 0 \end{array}] .

(A.3c)

The cross product of two vectors can then be written as $\boldsymbol{x}_{1}\times \boldsymbol{x}_{2} = \boldsymbol{x}_{1}^{ \times}\boldsymbol{x}_{2}$.

To simplify calculations with quaternions, we introduce the following four matrices with the identity matrix $\boldsymbol{I}_{3\times 3} \in \mathbb{R}^{3\times 3}$:

T = [\begin{array}{c} 1 & 0^{T} \\ 0 & - I_{3 \times 3} \end{array}] \in R^{4 \times 4},

(A.4a)

L (q) = [\begin{array}{c} q_{s} & - q_{v}^{T} \\ q_{v} & q_{s} I_{3 \times 3} + q_{v}^{\times} \end{array}] \in R^{4 \times 4},

(A.4b)

R (q) = [\begin{array}{c} q_{s} & - q_{v}^{T} \\ q_{v} & q_{s} I_{3 \times 3} - q_{v}^{\times} \end{array}] \in R^{4 \times 4},

(A.4c)

V = [\begin{array}{c} 0 & I_{3 \times 3} \end{array}] \in R^{3 \times 4} .

(A.4d)

With these matrices all required quaternion operations can be written as matrix-vector products for which the standard rules of linear algebra hold:

$$\begin{aligned} \boldsymbol{q}_{1} \cdot \boldsymbol{q}_{2} &= \boldsymbol{L}( \boldsymbol{q}_{1})\boldsymbol{q}_{2} = \boldsymbol{R}(\boldsymbol{q}_{2}) \boldsymbol{q}_{1}, \end{aligned}$$

(A.5a)

$$\begin{aligned} \boldsymbol{q}^{-1} &= \boldsymbol{T}\boldsymbol{q}, \end{aligned}$$

(A.5b)

$$\begin{aligned} \boldsymbol{q}^{\vee} &= \boldsymbol{V}\boldsymbol{q}, \end{aligned}$$

(A.5c)

$$\begin{aligned} \boldsymbol{x}^{\wedge} &= \boldsymbol{V}^{\mathsf{T}}\boldsymbol{x}. \end{aligned}$$

(A.5d)

The rotation of a vector $\boldsymbol{x}$ can be expressed as

$$ \left (\boldsymbol{q}\cdot \boldsymbol{x}^{\wedge}\cdot \boldsymbol{q}^{-1}\right )^{\vee} = \boldsymbol{Q}(\boldsymbol{q}) \boldsymbol{x}, $$

(A.6)

where $\boldsymbol{Q}(\boldsymbol{q})=\boldsymbol{V} \boldsymbol{R}( \boldsymbol{q})^{\mathsf{T}} \boldsymbol{L}(\boldsymbol{q}) \boldsymbol{V}^{\mathsf{T}}$ is the rotation matrix formed from $\boldsymbol{q}$.

1.2 A.2 Derivatives

While quaternions have four parameters, rotations only have three degrees of freedom, and so we use specialized derivatives for quaternion functions following [20, 53].

The rotational gradient of a quaternion-dependent scalar function $f(\boldsymbol{q})$ is defined as

$$ \nabla _{\boldsymbol{q}}^{\mathrm{r}}f(\boldsymbol{q}) = \boldsymbol{V}\boldsymbol{L}(\boldsymbol{q})^{\mathsf{T}}\nabla _{ \boldsymbol{q}}f(\boldsymbol{q}), $$

(A.7)

and the rotational Jacobian of a vector-valued function $\boldsymbol{f}(\boldsymbol{q})$ as

$$ \frac{\partial \boldsymbol{f}(\boldsymbol{q})}{\partial ^{\mathrm{r}}\boldsymbol{q}} = \frac{\partial \boldsymbol{f}(\boldsymbol{q})}{\partial \boldsymbol{q}} \boldsymbol{L}(\boldsymbol{q})\boldsymbol{V}^{\mathsf{T}}. $$

(A.8)

1.3 A.3 Properties for dynamics descriptions

Using quaternions in the description of dynamical systems requires some attention.

Angular velocity

In continuous time, the quaternion angular velocity is defined as

$$ \bar{\boldsymbol{\omega }} = \begin{bmatrix} \bar{\omega}_{s} \\ \boldsymbol{\omega } \end{bmatrix} = 2\boldsymbol{L}(\boldsymbol{q})^{\mathsf{T}}\dot{\boldsymbol{q}}, $$

(A.9)

where $\bar{\omega}_{s} = 0$. However, using a first-order approximation of $\dot{\boldsymbol{q}}$,

$$ \dot{\boldsymbol{q}}_{k} = \frac{\boldsymbol{q}_{k+1}-\boldsymbol{q}_{k}}{\Delta t}, $$

(A.10)

we obtain a discretized quaternion angular velocity $\bar{\boldsymbol{\omega }}_{k}$, generally with a scalar part $\bar{\omega}_{k,s}\neq 0$. Therefore, $\bar{\boldsymbol{\omega }}_{k}$ is defined so that given $\boldsymbol{q}_{k}$ and $\boldsymbol{\omega }_{k}$, $\boldsymbol{q}_{k+1}$ maintains unit norm.

Given the discretized angular velocity

$$\begin{aligned} \boldsymbol{\omega }_{k} &= \left (2 \boldsymbol{L}(\boldsymbol{q}_{k})^{\mathsf{T}} \frac{\boldsymbol{q}_{k+1}-\boldsymbol{q}_{k}}{\Delta t}\right )^{ \vee} \\ &=\frac{2}{\Delta t}\left ( \boldsymbol{L}(\boldsymbol{q}_{k})^{\mathsf{T}} \boldsymbol{q}_{k+1}\right )^{\vee}, \end{aligned}$$

(A.11)

we define the discretized quaternion angular velocity as

$$ \bar{\boldsymbol{\omega }}_{k} = \frac{2}{\Delta t}\boldsymbol{L}( \boldsymbol{q}_{k})^{\mathsf{T}}\boldsymbol{q}_{k+1}. $$

(A.12)

Since $\boldsymbol{L}(\boldsymbol{q}_{k})^{\mathsf{T}} \boldsymbol{q}_{k+1}$ is an orientation and must have unit norm, the constraint on $\bar{\boldsymbol{\omega }}_{k}$ is

$$ \left \lVert \frac{\Delta t}{2}\bar{\boldsymbol{\omega }}_{k}\right \rVert ^{2} = \left (\frac{\Delta t}{2}\right )^{2}\bar{\omega}_{k,s}^{2} + \left (\frac{\Delta t}{2}\right )^{2}\boldsymbol{\omega }_{k}^{ \mathsf{T}}\boldsymbol{\omega }_{k} = 1. $$

(A.13)

As a result,

$$ \bar{\boldsymbol{\omega }}_{k}= \begin{bmatrix} \bar{\omega}_{k,s} \\ \boldsymbol{\omega }_{k}\end{bmatrix} = \begin{bmatrix} \sqrt{\left (\tfrac{2}{\Delta t}\right )^{2} - \boldsymbol{\omega }_{k}^{ \mathsf{T}}\boldsymbol{\omega }_{k}}~ \\ \boldsymbol{\omega }_{k}\end{bmatrix} . $$

(A.14)

Note that $\bar{\omega }_{k,s} = \frac{2}{\Delta t}$ for $\boldsymbol{\omega }=0$. This difference to the continuous-time definition simplifies the integrator derivation and implementation.

Virtual work

In the Lagrange–d’Alembert principle, the virtual work for external forces and torques is

$$ \delta W = \begin{bmatrix} \boldsymbol{\mathrm{f}}(\boldsymbol{z}) & \boldsymbol{\tau }( \boldsymbol{z}) \end{bmatrix} ^{\mathsf{T}}\delta \boldsymbol{z}, $$

(A.15)

where $\delta \boldsymbol{z}$ represents variations of the trajectory. In our case, the force $\boldsymbol{\mathrm{f}}(\boldsymbol{z})=\boldsymbol{\mathrm{f}}$ is not state-dependent, and the torque $\boldsymbol{\tau }(\boldsymbol{z}) = 2\boldsymbol{L}(\boldsymbol{q}) \boldsymbol{V}^{\mathsf{T}}\boldsymbol{\tau }$ depends on a body’s current orientation. In the integrator derivation, gradients represent variations $\delta \boldsymbol{z}$.

Appendix B: Kinematic joints

Most of the common joints can be defined by composing a general translational constraint function and a general rotational constraint function. A visualization of the two constraints is given in Fig. B.1. This representation, based on two general constraint functions, also allows easy extraction of minimal coordinates.

2.1 B.4 Translational constraint

The general translational constraint function describes the relative distance of two points relative to the origins of the local frames of two bodies:

$$ \boldsymbol{g}_{\mathrm{T}} = \boldsymbol{Q}(\boldsymbol{q}_{ \mathrm{a}})^{\mathsf{T}}\left (\left (\boldsymbol{x}_{\mathrm{b}} + \boldsymbol{Q}(\boldsymbol{q}_{\mathrm{b}})\boldsymbol{p}_{\mathrm{b}} \right ) - \left (\boldsymbol{x}_{\mathrm{a}} + \boldsymbol{Q}( \boldsymbol{q}_{\mathrm{a}})\boldsymbol{p}_{\mathrm{a}}\right ) \right ). $$

(B.1)

The vectors $\boldsymbol{p}_{\mathrm{a}}$ and $\boldsymbol{p}_{\mathrm{b}}$, pointing from the center of mass to the joint, are defined in the respective local frames, and the resulting relative distance is defined in body $\mathrm{a}$’s frame.

In case of only a single body directly connected to the global frame, i.e., $\boldsymbol{x}_{\mathrm{a}}=\boldsymbol{0}$ and $\boldsymbol{q}_{\mathrm{a}}=\mathbbm{1}$ (identity quaternion), we obtain

$$ \boldsymbol{g}_{\mathrm{T}} = \boldsymbol{x}_{\mathrm{b}} + \boldsymbol{Q}(\boldsymbol{q}_{\mathrm{b}}) \boldsymbol{p}_{\mathrm{b}} - \boldsymbol{p}_{\mathrm{a}}. $$

(B.2)

2.2 B.5 Rotational constraint

The general rotational constraint function describes the relative distance of two local frames of two bodies, including a possible offset:

$$ \boldsymbol{g}_{\mathrm{R}} = \boldsymbol{V}\boldsymbol{L}(\boldsymbol{q}_{ \mathrm{off}})^{\mathsf{T}}\boldsymbol{L}(\boldsymbol{q}_{\mathrm{a}})^{ \mathsf{T}}\boldsymbol{q}_{\mathrm{b}}. $$

(B.3)

The offset quaternion $\boldsymbol{q}_{\mathrm{off}}$ is defined in body $\mathrm{a}$’s frame, and the resulting relative rotation is defined in body $\mathrm{a}$’s frame as well.

In case of only a single body directly connected to the global frame, i.e., $\boldsymbol{q}_{\mathrm{a}}=\mathbbm{1}$, we obtain

$$ \boldsymbol{g}_{\mathrm{R}} = \boldsymbol{V}\boldsymbol{L}(\boldsymbol{q}_{ \mathrm{off}})^{\mathsf{T}}\boldsymbol{q}_{\mathrm{b}}. $$

(B.4)

2.3 B.6 Composite constraints and minimal coordinates

To obtain actual joint constraints, the general constraint functions are multiplied with a selection matrix $\boldsymbol{D}$ indicating the desired constraints. Multiplying the constraint functions with the nullspace matrix $\boldsymbol{C}$ of $\boldsymbol{D}$ yields the corresponding minimal coordinates.

The selection matrix $\boldsymbol{D}$ is calculated by performing singular value decomposition on a skew-symmetric matrix formed from a vector $\boldsymbol{V}_{3}$:

$$ \mathrm{svd}(\boldsymbol{V}_{3}^{\times}) = \boldsymbol{U} \boldsymbol{\Sigma } \boldsymbol{V}^{\mathrm{T}}. $$

(B.5)

The matrix $V$ contains both the original vector $\boldsymbol{V}_{3}$ (sign-adjustment might be necessary) and two perpendicular vectors $\boldsymbol{V}_{1}$ and $\boldsymbol{V}_{2}$:

$$ \boldsymbol{V} = \begin{bmatrix} \boldsymbol{V}_{1} ~& \boldsymbol{V}_{2} ~& \boldsymbol{V}_{3} \end{bmatrix} = \begin{bmatrix} \boldsymbol{V}_{1:2} ~& \boldsymbol{V}_{3} \end{bmatrix}. $$

(B.6)

Using the matrices $\boldsymbol{V}_{1:2}$ and $\boldsymbol{V}_{3}$, a number of constraint functions $\boldsymbol{D}^{\mathrm{T}}\boldsymbol{g}$ can be created, where the meaning depends on $\boldsymbol{D}$. The different options for $\boldsymbol{D}$ and the corresponding nullspace matrices $\boldsymbol{C}$ are stated in Table B.1.

Table B.1 Constraint selection and nullspace matrices

Full size table

Mechanical joints are created by using different selection and nullspace matrices for the translational and rotational constraints and stacking everything as

$$\begin{aligned} \boldsymbol{D}^{\mathrm{T}}\boldsymbol{g} &= \begin{bmatrix} \boldsymbol{D}_{\mathrm{T}} & 0 \\ 0 & \boldsymbol{D}_{\mathrm{R}} \end{bmatrix}^{\mathrm{T}} \begin{bmatrix} \boldsymbol{g}_{\mathrm{T}} \\ \boldsymbol{g}_{\mathrm{R}} \end{bmatrix}= \begin{bmatrix} \boldsymbol{D}_{\mathrm{T}}\boldsymbol{g}_{\mathrm{T}} \\ \boldsymbol{D}_{\mathrm{R}}\boldsymbol{g}_{\mathrm{R}} \end{bmatrix}, \end{aligned}$$

(B.7a)

$$\begin{aligned} \boldsymbol{C}^{\mathrm{T}}\boldsymbol{g} &= \begin{bmatrix} \boldsymbol{C}_{\mathrm{T}} & 0 \\ 0 & \boldsymbol{C}_{\mathrm{R}} \end{bmatrix}^{\mathrm{T}} \begin{bmatrix} \boldsymbol{g}_{\mathrm{T}} \\ \boldsymbol{g}_{\mathrm{R}} \end{bmatrix}= \begin{bmatrix} \boldsymbol{C}_{\mathrm{T}}\boldsymbol{g}_{\mathrm{T}} \\ \boldsymbol{C}_{\mathrm{R}}\boldsymbol{g}_{\mathrm{R}} \end{bmatrix}. \end{aligned}$$

(B.7b)

A list of mechanical joints that can be created with this composition is given in Table B.2.

Table B.2 List of joints made of the two general constraint functions

Full size table

Joints with $\boldsymbol{D}_{\mathrm{R}} = \boldsymbol{V}_{3}$ do not appear to have any physical meaning. There are also joints that cannot be described by this composition, for example, helical joints (screws). Nonetheless, they can still be formulated as an equality constraint.

Appendix C: Friction dynamics

The friction forces in a mechanism are derived from the maximum dissipation principle [37] with a linearized friction cone [38]. The principle states that the energy dissipation rate of the bodies in contact is maximized through friction.

Since friction acts on moving bodies, the energy dissipation rate is the time derivative of the kinetic energy. As the kinetic energy for all bodies of a mechanism is the sum of the kinetic energies of all individual bodies,

$$ \mathcal{T}(\dot{\boldsymbol{z}}) = \sum _{n=1}^{n_{\mathrm{b}}} \mathcal{T}(\dot{\boldsymbol{z}}_{\mathrm{n}}), $$

(C.1)

the energy dissipation is the sum of the dissipation of all individual bodies. The energy dissipation for the contact point on a single body is

$$\begin{aligned} \frac{\mathrm{d}}{\mathrm{d}t}\mathcal{T}(\dot{\boldsymbol{z}}) &= \frac{\mathrm{d}}{\mathrm{d}t}\left (\frac{1}{2}\boldsymbol{v}^{ \mathsf{T}}\boldsymbol{M}\boldsymbol{v} + \frac{1}{2} \boldsymbol{\omega }^{\mathsf{T}}\boldsymbol{J}\boldsymbol{\omega } \right ) \end{aligned}$$

(C.2a)

$$\begin{aligned} &=\boldsymbol{v}^{\mathsf{T}}\boldsymbol{M}\dot{\boldsymbol{v}} + \boldsymbol{\omega }^{\mathsf{T}}\boldsymbol{J} \dot{\boldsymbol{\omega }} \end{aligned}$$

(C.2b)

$$\begin{aligned} &=\boldsymbol{v}^{\mathsf{T}}\boldsymbol{\mathrm{f}} + \boldsymbol{\omega }^{\mathsf{T}}\boldsymbol{\tau } \end{aligned}$$

(C.2c)

$$\begin{aligned} &=\boldsymbol{v}^{\mathsf{T}}\boldsymbol{\mathrm{f}} + \boldsymbol{\omega }^{\mathsf{T}}\boldsymbol{p}^{\times } \boldsymbol{Q}(\boldsymbol{q}^{-1})\boldsymbol{\mathrm{f}} ~~ \text{($\boldsymbol{\tau }$ from $\boldsymbol{\mathrm{f}}$ in body frame)} \end{aligned}$$

(C.2d)

$$\begin{aligned} &=\boldsymbol{z}^{\mathsf{T}} \begin{bmatrix} \boldsymbol{I}_{3\times 3} \\ \boldsymbol{p}^{\times }\boldsymbol{Q}(\boldsymbol{q^{-1}}) \end{bmatrix}\boldsymbol{\mathrm{f}} \end{aligned}$$

(C.2e)

$$\begin{aligned} &=\boldsymbol{z}^{\mathsf{T}} \begin{bmatrix} \boldsymbol{I}_{3\times 3} \\ \boldsymbol{p}^{\times }\boldsymbol{Q}(\boldsymbol{q}^{-1}) \end{bmatrix} \begin{bmatrix} \boldsymbol{b}_{1} ~ -\boldsymbol{b}_{1} ~ \cdots ~ \boldsymbol{b}_{ \frac{n_{\mathrm{f}}}{2}} ~ -\boldsymbol{b}_{\frac{n_{\mathrm{f}}}{2}} \end{bmatrix}^{\mathsf{T}}\boldsymbol{\beta } \\ \end{aligned}$$

(C.2f)

$$\begin{aligned} &=\boldsymbol{z}^{\mathsf{T}}\boldsymbol{B}^{\mathsf{T}} \boldsymbol{\beta }, \end{aligned}$$

(C.2g)

where $\boldsymbol{\mathrm{f}}$ and $\boldsymbol{\tau }$ are the force and torque resulting from the friction at the contact point, $\boldsymbol{p}$ is the vector from the center of mass to the contact point in the body frame, and the ^× operator creates the skew-symmetric matrix from a vector. The rotation matrix $\boldsymbol{Q}(\boldsymbol{q}^{-1})$ maps the force $\boldsymbol{\mathrm{f}}$ from the global frame to the body frame in order to obtain the torque $\boldsymbol{\tau }=\boldsymbol{p}^{\times }\boldsymbol{Q}( \boldsymbol{q}^{-1})\boldsymbol{\mathrm{f}}$ in the body frame. The force $\boldsymbol{\mathrm{f}}$ at the contact point is the linearized friction force, consisting of the basis vectors $\boldsymbol{b}_{i}$ of the friction cone and the magnitude $\boldsymbol{\beta }$. The basis vectors $\boldsymbol{b}_{i}$ of a linearized friction cone are depicted in Fig. 1 (b).

The dissipation derivation (C.2a)–(C.2g) can be trivially extended to multiple bodies and contact points. Each contact point involves at most two bodies. Two bodies in contact share the same friction magnitude $\boldsymbol{\beta }$ but have different (flipped) basis vectors $\boldsymbol{b}_{i}$. Accordingly, the mapping matrix $\boldsymbol{B}$ differs for each body and contact point. Accordingly, the dissipation for multiple bodies and contact points takes on the same form

$$ \frac{\mathrm{d}}{\mathrm{d}t}\mathcal{T}(\dot{\boldsymbol{z}}) = \boldsymbol{z}^{\mathsf{T}}\boldsymbol{B}^{\mathsf{T}} \boldsymbol{\beta }, $$

(C.3)

but now $\dot{\boldsymbol{z}}$, $\boldsymbol{z}$, and $\boldsymbol{\beta }$ are the stacked quantities for all bodies and magnitudes, and $\boldsymbol{B}$ consists of the individual $\boldsymbol{B}_{i,j}$ matrices of body $i$ and contact point $j$.

Pairs of basis vectors point in opposite directions. Therefore, all elements of the friction magnitude must be positive. The magnitude is also limited by the friction coefficient $c_{\mathrm{f}}$ according to Coulomb friction, yielding the constraints

$$\begin{aligned} \pmb{1}^{\mathsf{T}}\boldsymbol{\beta } &\leq c_{\mathrm{f}}\gamma , \end{aligned}$$

(C.4a)

$$\begin{aligned} \boldsymbol{\beta }&\geq \boldsymbol{0}. \end{aligned}$$

(C.4b)

Maximum dissipation can, therefore, be stated as a constrained optimization problem

min_{β} z^{T} B^{T} β,

(C.5a)

s.t. E^{T} β \leq C_{f} γ,

(C.5b)

β \geq 0,

(C.5c)

which yields the friction forces.

Appendix D: Simulator algorithms

The simulator computes the forward dynamics by solving the system of equations (29a)–(29f) at each time step. An interior-point method is implemented as a solver for this system. The implementation follows Algorithm 19.1 in [44], which provides more detailed explanations. Pseudo code for the implementation is given in Algorithm 5, and code is available in the open-source implementation (see Sect. 5).

Appendix E: Application details

Details on the application examples and pseudo code are given in this appendix. Code is available in the open-source implementation (see Sect. 5).

Walking quadruped

The legs of the quadruped follow a sinusoidal trajectory with five parameters. The front right and back left leg follow the same trajectory, and the front left and back right leg follow the same trajectory offset by a period of $\pi $. The leg trajectories are tracked with proportional-derivative (PD) controllers. Pseudo code for controlling the gait of the quadrupedal robot is stated in Algorithm 6.

The idea of the sampling-based learning algorithm is to randomly pick the five gait parameters. If progress is made with these parameters, i.e., the robot walks further than before, the next sampling is biased in the successful parameter direction. If no progress is made, new parameters are picked randomly in the vicinity of the current parameters. The pseudo-code of the sampling-based learning algorithm for the quadruped is stated in Algorithm 7.

Exoskeleton

The exoskeleton tracks a sinusoidal rehabilitation trajectory in the shoulder flexion/extension joint and in the elbow flexion/extension joint. A PD-controller with variable scaling is used for tracking, and this scaling is manually tuned with the simulation to adhere to the torque limit. The pseudo-code of the tracking controller is stated in Algorithm 8.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Brüdigam, J., Sosnowski, S., Manchester, Z. et al. Variational integrators and graph-based solvers for multibody dynamics in maximal coordinates. Multibody Syst Dyn 61, 381–414 (2024). https://doi.org/10.1007/s11044-023-09949-x

Download citation

Received: 12 February 2023
Accepted: 20 October 2023
Published: 03 November 2023
Issue Date: July 2024
DOI: https://doi.org/10.1007/s11044-023-09949-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Variational integrators and graph-based solvers for multibody dynamics in maximal coordinates

Abstract

Similar content being viewed by others

Linear-Time Variational Integrators in Maximal Coordinates

A Linear-Time Variational Integrator for Multibody Systems

Efficient Computation of Higher-Order Variational Integrators in Robotic Simulation and Trajectory Optimization

Explore related subjects

1 Introduction

2 Dynamics components

Rigid body

Conservative forces, potentials, and springs

Non-conservative forces, actuators, and dampers

Joints and equality constraints

Contacts and inequality constraints

Static and sliding friction

3 Mathematical integrator

3.1 Unconstrained dynamics

Translational component

Rotational component

3.2 Equality constrained dynamics

3.3 Inequality constrained dynamics

3.4 Friction dynamics

3.5 Complete integrator

4 Numerical solver

4.1 Linear-complexity algorithm

Articulated mechanism example

Environment contact example

4.2 Reduced-fill-in algorithm

5 Evaluation

5.1 Physical accuracy

5.2 Computational complexity

5.3 Numerical robustness

5.4 Application examples

6 Conclusions

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s Note

Appendices

Appendix A: Quaternions

1.1 A.1 Notation

1.2 A.2 Derivatives

1.3 A.3 Properties for dynamics descriptions

Angular velocity

Virtual work

Appendix B: Kinematic joints

2.1 B.4 Translational constraint

2.2 B.5 Rotational constraint

2.3 B.6 Composite constraints and minimal coordinates

Appendix C: Friction dynamics

Appendix D: Simulator algorithms

Appendix E: Application details

Walking quadruped

Exoskeleton

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation