Differential–algebraic systems with dissipative Hamiltonian structure

Mehrmann, Volker; van der Schaft, Arjan

doi:10.1007/s00498-023-00349-2

Differential–algebraic systems with dissipative Hamiltonian structure

Original Article
Open access
Published: 03 March 2023

Volume 35, pages 541–584, (2023)
Cite this article

Download PDF

You have full access to this open access article

Mathematics of Control, Signals, and Systems Aims and scope Submit manuscript

Differential–algebraic systems with dissipative Hamiltonian structure

Download PDF

Volker Mehrmann¹ &
Arjan van der Schaft²

1668 Accesses
6 Citations
Explore all metrics

Abstract

Different representations of linear dissipative Hamiltonian and port-Hamiltonian differential–algebraic equations (DAE) systems are presented and compared. Using global geometric and algebraic points of view, translations between different representations are presented. Characterizations are also derived when a general DAE system can be transformed into one of these structured representations. Approaches for computing the structural information and the described transformations are derived that can be directly implemented as numerical methods. The results are demonstrated with a large number of examples.

On controlled Hamilton and Hamilton–Jacobi differential equations of higher-order

Article Open access 27 August 2022

Physical system analysis related to observability, controllability and stability using its equations of generalized motion and canonical equations both in dissipative forms

Article 27 June 2022

Port-Hamiltonian Differential-Algebraic Systems

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Since the original introduction of the energy-based modeling concept of port-Hamiltonian (pH) systems in [28, 35], see also [6, 22, 24, 33, 34, 38, 40], various novel definitions and formulations have been made to incorporate on the one hand systems defined on manifolds, see [14, 41], and on the other hand systems with algebraic constraints defined in the form of differential–algebraic equations (DAEs), see [4, 31]. Constraints in pH systems typically arise in electrical or transport networks, where Kirchhoff’s laws constrain the models at network nodes, as balance equations in chemical engineering, or as holonomic or non-holonomic constraints in mechanical multibody systems. Furthermore, they arise in the interconnection of pH systems when the interface conditions are explicitly formulated and enforced via Lagrange multipliers, see [4, 31, 39,40,41,42] for a variety of examples, or the recent survey [32].

In this paper, we recall different model representations of pHDAEs, respectively, dissipative Hamiltonian DAEs (dHDAEs) and provide a systematic geometric and algebraic theory that also provides a ’translation’ between different representations. We also discuss approaches for explicitly computing the structural information that can be directly implemented as numerical methods. We mainly restrict ourselves to finite-dimensional linear time-invariant systems without inputs and outputs, but indicate in several places where extensions to general systems are possible. We also discuss only real systems although most of the results can also be formulated for complex problems in a similar way.

For general linear time-invariant homogeneous differential–algebraic equations

$$\begin{aligned} \frac{d}{dt} (Ex)= E\dot{x}= Ax, \end{aligned}$$

(1)

where $E,A\in {\mathbb {R}}^{n,n}$ (without additional geometric and algebraic structures), as well as for their extensions to linear time-varying and nonlinear systems, the theory is well understood [25]. On the other hand, in contrast with general DAEs (1), the DAEs that arise in energy-based modeling have extra structure and symmetries, and it is natural to identify and exploit this extra structure for purposes of analysis, simulation and control.

In this paper, we will restrict our attention to regular systems, i.e., systems with $\det (\lambda E-A)$ not identically zero, although many of the results are expected to extend to over- and underdetermined systems; cf. Conclusions. The structural properties for regular systems are characterized via the (real) Weierstraß canonical form of the matrix pair (E, A), see, e.g., [20]. To determine this canonical form, one computes nonsingular matrices U, W such that

$$\begin{aligned} UEW=\left[ \begin{array}{cc} I &{}\quad 0 \\ 0 &{}\quad {\mathcal {N}}\end{array}\right] ,\ UAW= \left[ \begin{array}{cc} {\mathcal {J}} &{}\quad 0 \\ 0 &{}\quad I \end{array}\right] , \end{aligned}$$

where ${\mathcal {J}}$ is in real Jordan canonical form and ${\mathcal {N}}$ is nilpotent in Jordan canonical form which is associated with the eigenvalue $\infty $

An important quantity that will be used throughout the paper is the size of the largest Jordan block in ${\mathcal {N}}$ (the index of nilpotency), which is called the (differentiation) index of the pair (E, A) as well as the associated DAE, where, by convention, $\nu =0$ if E is invertible.

The goal of this paper is to study in detail the relationships and differences between different classes of (extended) Hamiltonian DAEs; without making any a priori invertibility assumptions. Furthermore, we exploit at the same time a geometric point of view on Hamiltonian DAEs using Lagrange and Dirac structures (as well maximally monotone subspaces; cf. Sect. 4.2), and an algebraic point of view using (computationally feasible) condensed forms. From a DAE analysis point of view, similar to the result known already for Hamiltonian DAEs of the form (2) in [29], we also show that the index of an (extended) dissipative Hamiltonian DAEs can be at most two. Moreover, we will show that index two algebraic constraints may only arise from singularity of P, i.e., from the Lagrange structure, and thus are strictly linked to singular Hamiltonians.

Another important question that we will answer is the characterization when a general DAE is equivalent to a structured DAE the form ((2) or ((10)).

The paper is organized as follows.

In Sect. 2, we discuss dissipative Hamiltonian DAEs and present several examples. The concept of extended Hamiltonian DAEs is discussed in Sect. 3. In Sect. 4, we will present the geometric theory of dissipative Hamiltonian DAEs; we introduce Dirac and Lagrange structures as well maximally monotone subspaces to incorporate dissipation. Section 5 presents different coordinate representations and their relation. To derive algebraic characterizations, in Sect. 6 we present different types of equivalence transformations and corresponding condensed forms for different classes of (extended dissipative) Hamiltonian DAEs. In Sect. 7, it is analyzed when general linear DAEs can be represented as (extended dissipative) Hamiltonian systems. In all sections, we present examples that illustrate the properties of different representations. In several appendices, we present proofs that can be implemented as numerically stable algorithms.

2 Dissipative Hamiltonian DAEs

A well-established representation of DAE systems with symmetry structure is that of dissipative Hamiltonian DAEs (dHDAEs) which has been studied in detail in [4, 29, 30]. These systems have the form

$$\begin{aligned} { \frac{d}{dt}(Ez)}= E\dot{z}= (J-R)Q z, \end{aligned}$$

(2)

where $J=-J^\top , R=R^\top \in {\mathbb {R}}^{\ell ,\ell }$, and $E,Q\in {\mathbb {R}}^{\ell ,n}$ with $E^\top Q=Q^\top E$. In this representation, the associated Hamilton function (Hamiltonian) is given by a quadratic form

$$\begin{aligned} {\mathcal {H}}^z(z)=\frac{1}{2} z^\top E^\top Q z, \end{aligned}$$

(3)

which (as an energy) is typically nonnegative for all z, but more general Hamiltonians also arise in practice. Note that in the ODE case, i.e., if $E=I$, then this reduces to the standard quadratic Hamiltonian ${\mathcal {H}}^z(z)=\frac{1}{2} z^\top Q z$.

Remark 1

The equivalent formulations $\frac{d}{dt}(Ez)= E\dot{z}$ are both used in the literature and obviously lead to the same results in the linear constant coefficient case, but not anymore in the linear time-varying or nonlinear case. Actually, for the structured DAEs considered in this paper E is the product $E=KP$ of two matrices K and P, and for generalizations it is appropriate to consider $K\frac{d}{dt}(Pz)$.

Furthermore, the dissipation term R is typically positive semidefinite, denoted as $R\ge 0$, but we will first discuss the case that $R=0$ which we call Hamiltonian DAE (HDAE).

Remark 2

The definition of (dHDAEs) in (2) can be easily extended to systems with ports in which case it takes the form

$$\begin{aligned} { \frac{d}{dt}(Ez)}=E\dot{z}= & {} (J-R)Q z+ (F-G) u,\nonumber \\ y= & {} (F+G)^\top Q z + D u, \end{aligned}$$

(4)

with

$$\begin{aligned} \left[ \begin{array}{cc} Q^\top J Q &{}\quad Q^\top F \\ -F^\top Q &{}\quad \frac{1}{2} (D-D^\top )\end{array}\right] =-\left[ \begin{array}{cc} Q^\top J Q &{}\quad Q^\top F \\ -F^\top Q &{}\quad \frac{1}{2} (D-D^\top )\end{array}\right] ^\top \end{aligned}$$

and

$$\begin{aligned} \left[ \begin{array}{cc} Q^\top R Q &{}\quad Q^\top G \\ G^\top Q &{}\quad \frac{1}{2} (D+D^\top )\end{array}\right] =\left[ \begin{array}{cc} Q^\top R Q &{}\quad Q^\top G \\ G^\top Q &{}\quad \frac{1}{2} (D+D^\top )\end{array}\right] ^\top \ge 0, \end{aligned}$$

see [4]. We will call these systems dissipative pHDAE (dpHDAE) systems.

Remark 3

Note that by the symmetry of $E^\top Q$ the Hamiltonian ${\mathcal {H}}^z(z)=\frac{1}{2} z^\top E^\top Q z$ can be also written as a function of Qz or Ez.

Remark 4

In standard port-Hamiltonian modeling, the matrix R in (2) is always assumed to be symmetric (as well as positive semi-definite). Alternatively, one could start from a general matrix R only satisfying $R+R^\top \ge 0$. Then, by splitting R into its symmetric and skew-symmetric part, one could add the skew-symmetric part to the skew-symmetric structure matrix J and continue as in (2) with the symmetric part $\frac{1}{2} (R+R^\top )$ of R.

Structured DAEs of the form (2) arise naturally in all physical domains.

Example 5

The subclass of RLC networks as considered in, e.g., [4, 12, 19] has the form

$$\begin{aligned} \underbrace{\left[ \begin{array}{ccc} D_C C D_C^\top &{}\quad 0 &{}\quad 0\\ 0 &{}\quad L &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 \end{array}\right] }_{:=E} \left[ \begin{array}{c}{\dot{V}} \\ \dot{I_L}\\ \dot{I_S} \end{array}\right] = \underbrace{\left[ \begin{array}{ccc} -D_R G D_R^\top &{}\quad -D_L &{}\quad -D_S \\ D_L^\top &{}\quad 0 &{}\quad 0\\ D_S^\top &{}\quad 0 &{}\quad 0 \end{array}\right] }_{:=J-R} \left[ \begin{array}{c}V \\ I_L\\ I_S \end{array}\right] , \end{aligned}$$

(5)

where the real positive definite diagonal matrices L, C, G describe inductances, capacitances and conductances, respectively. The matrices $D_C, D_L, D_R, D_S$ are the parts of the incidence matrix corresponding, respectively, to the capacitors, inductors, resistors (conductors) and current sources of the circuit graph, where $D_S$ is of full column rank. Furthermore, V are the node potentials and $I_L, I_S$ denote the currents through the inductors and sources, respectively. This system has the form (2), with $E=E^\top \ge 0$, Q equal to the identity matrix, and where J and $-R$ are defined to be the skew-symmetric and symmetric part, respectively, of the matrix on the right hand side of (5). The Hamiltonian is given by ${\mathcal {H}}(V,I_L)= \frac{1}{2} V^\top D_C C D_C^\top V+ \frac{1}{2} I_L^\top L I_L$, and it does not involve the variables $I_S$, which are in the kernel of E.

Example 6

Space discretization of the Stokes equation in fluid dynamics, see, e.g., [18], leads to a dissipative Hamiltonian system

$$\begin{aligned} \left[ \begin{array}{cc}M&{}\quad 0\\ 0 &{}\quad 0\end{array}\right] \left[ \begin{array}{c} \dot{v}_h \\ \dot{p}_h \end{array}\right] = \left[ \begin{array}{cc} A &{}\quad B\\ -B^\top &{}\quad 0\end{array}\right] \left[ \begin{array}{c} v_h \\ p_h \end{array}\right] +\left[ \begin{array}{c} f_h \\ 0 \end{array}\right] , \end{aligned}$$

where $A=A^\top $ is a positive semidefinite discretization of the negative Laplace operator, B is a discretized gradient and $M=M^\top $ is a positive definite mass matrix. The homogeneous system has the form (2), with

$$\begin{aligned} R=\left[ \begin{array}{cc}A&{}\quad 0\\ 0&{}\quad 0\end{array}\right] ,\ J=\left[ \begin{array}{cc}0&{}\quad B\\ -B^\top &{}\quad 0\end{array}\right] ,\ E= \left[ \begin{array}{cc}M&{}\quad 0\\ 0 &{}\quad 0\end{array}\right] ,\ Q=I. \end{aligned}$$

The Hamiltonian is given by ${\mathcal {H}}= \frac{1}{2} v^\top _h M v_h$; it does not involve the variables $p_h$.

Example 7

Space discretization of the Euler equation describing the acoustic wave propagation in a gas pipeline network [15, 16] leads to a DAE

$$\begin{aligned} \left[ \begin{array}{ccc} M_1&{}\quad 0 &{}\quad 0\\ 0 &{}\quad M_2 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 \end{array}\right] \left[ \begin{array}{c} \dot{p}_h\\ \dot{q}_h \\ {{\dot{\lambda }}} \end{array}\right] = \left( \left[ \begin{array}{ccc} 0&{}\quad -G &{}\quad 0\\ G^\top &{}\quad 0 &{}\quad K^\top \\ 0 &{}\quad -K &{}\quad 0\end{array}\right] -\left[ \begin{array}{ccc} 0&{}\quad 0 &{}\quad 0\\ 0 &{}\quad D &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0\end{array}\right] \right) \left[ \begin{array}{c} p_h \\ q_h \\ \lambda \end{array}\right] +f, \end{aligned}$$

where $p_h$ is the discretized pressure, $q_h$ is a discretized flux and $\lambda $ is a Lagrange multiplier that penalizes the violation of the conservation of mass and momentum at the pipeline nodes.

The homogeneous system has the form (2) with $Q=I$ and Hamiltonian $ {\mathcal {H}}(p_h,q_h) =\frac{1}{2} ( p_h^\top M_1 p_h+ q_h^\top M_2 q_h)$. The Hamiltonian does not involve the Lagrange multiplier $\lambda $.

Example 8

Consider a linear mechanical system (with q denoting the vector of position coordinates) $M\ddot{q} +D\dot{q} + Wq= f$, together with kinematic constraints $G\dot{q}=0$, see, e.g., [17]. The constraints give rise to constraint forces $G^\top \lambda $, with $\lambda $ a vector of Lagrange multipliers. This yields the dynamics $M\ddot{q} +D\dot{q} + Wq = f+G^\top \lambda $, and the resulting system can be written in first-order form as

$$\begin{aligned} \left[ \begin{array}{ccc}M &{}\quad 0 &{}\quad 0\\ 0 &{}\quad I &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 \end{array}\right] \left[ \begin{array}{c}\ddot{q} \\ \dot{q} \\ {{\dot{\lambda }}}\end{array}\right] +\left[ \begin{array}{ccc}D &{}\quad I &{}\quad -G^\top \\ -I&{} 0 &{}\quad 0 \\ G &{}\quad 0 &{}\quad 0 \end{array}\right] \left[ \begin{array}{ccc} I &{}\quad 0 &{}\quad \quad 0 \\ 0 &{}\quad \quad W &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad I\end{array}\right] \left[ \begin{array}{c}\dot{q}\\ q \\ \lambda \end{array}\right] =\left[ \begin{array}{c}f\\ 0 \\ 0\end{array}\right] . \end{aligned}$$

Here, $E=E^\top $ and $Q=Q^\top $ are commuting matrices, E, W, and $R=R^\top $ are positive semidefinite, so the homogeneous system is of the form (2). The Hamiltonian is given by $\frac{1}{2} ( \dot{q}^\top M \dot{q} + q^\top W q)$ (kinetic plus potential energy) and does not involve the Lagrange multiplier $\lambda $.

As indicated in the presented examples, singularity of E, and thus the presence of algebraic constraints, implies that the Hamiltonian (the total stored energy) ${\mathcal {H}}^z(z)=\frac{1}{2} z^\top E^\top Q z$ does not involve all of the variables contained in the vector z. In fact in some of the examples, the variables that do not show up in the Hamiltonian are Lagrange multipliers. Conversely, singularity of E may arise as a limiting situation of otherwise regular Hamiltonians, as shown by the following simple example.

Example 9

Consider the model of standard mass–spring–damper system with model equation

$$\begin{aligned} \left[ \begin{array}{c} {\dot{q}} \\ {\dot{p}} \end{array}\right] = \left[ \begin{array}{cc} 0 &{}\quad 1 \\ -1 &{}\quad -d \end{array}\right] \left[ \begin{array}{cc} k &{}\quad 0 \\ 0 &{}\quad \frac{1}{m} \end{array}\right] \left[ \begin{array}{c} q \\ p \end{array}\right] \end{aligned}$$

and Hamiltonian ${\mathcal {H}}(q,p) = \frac{1}{2} kq^2 + \frac{p^2}{2\,m}$.

To compute the limit $m \rightarrow 0$, we rewrite the system in coordinates q and $v:=\frac{p}{m}$ as

$$\begin{aligned} \left[ \begin{array}{c} {\dot{q}} \\ m{\dot{v}} \end{array}\right] = \left[ \begin{array}{cc} 0 &{}\quad 1 \\ -1 &{}\quad -d \end{array}\right] \left[ \begin{array}{c} kq \\ v \end{array}\right] \end{aligned}$$

with Hamiltonian ${\mathcal {H}}(q,v)= \frac{1}{2} kq^2 + \frac{1}{2}mv^2$. For $m \rightarrow 0$, this converges to the DAE system

$$\begin{aligned} \left[ \begin{array}{c} {\dot{q}} \\ 0 \end{array}\right] = \left[ \begin{array}{cc} 0 &{}\quad 1 \\ -1 &{}\quad -d \end{array}\right] \left[ \begin{array}{c} kq \\ v \end{array}\right] , \end{aligned}$$

which is of (differentiation) index one if $d \ne 0$, and of index two if $d=0$. The limiting Hamiltonian ${\mathcal {H}}(q,v)= \frac{1}{2} kq^2$ is not a function of v anymore.

Alternatively, we can compute the limit for $k \rightarrow \infty $. For this, we rewrite the system in coordinates $F:=kq$ and p as

$$\begin{aligned} \left[ \begin{array}{c} \frac{1}{k}{\dot{F}} \\ {\dot{p}} \end{array}\right] = \left[ \begin{array}{cc} 0 &{}\quad 1 \\ -1 &{}\quad -d \end{array}\right] \left[ \begin{array}{c} F \\ \frac{p}{m} \end{array}\right] , \end{aligned}$$

with Hamiltonian ${\mathcal {H}}(F,p)= \frac{1}{2k} F^2 + \frac{1}{2\,m}p^2$. For $k \rightarrow \infty $, this converges to the DAE system

$$\begin{aligned} \left[ \begin{array}{c} 0 \\ {\dot{p}} \end{array}\right] = \left[ \begin{array}{cc} 0 &{}\quad 1 \\ -1 &{}\quad -d \end{array}\right] \left[ \begin{array}{c} F \\ \frac{p}{m} \end{array}\right] , \end{aligned}$$

which is index two for any d, with Hamiltonian ${\mathcal {H}}(F,p)= \frac{1}{2m}p^2$, which is not involving F.

We may also take the limits $m \rightarrow 0$ and $k \rightarrow \infty $ simultaneously. By rewriting the system in the variables F, v, with Hamiltonian ${\mathcal {H}}(F,p)= \frac{1}{2k} F^2 + \frac{1}{2}mv^2$. This leads to the purely algebraic system

$$\begin{aligned} \left[ \begin{array}{c} 0 \\ 0 \end{array}\right] = \left[ \begin{array}{cc} 0 &{}\quad 1 \\ -1 &{}\quad -d \end{array}\right] \left[ \begin{array}{c} F \\ v \end{array}\right] , \end{aligned}$$

with zero Hamiltonian, and having a single solution $v=0$, $F=0$, irrespective of the damper.

3 Extended Hamiltonian DAEs

Classically, Hamiltonian systems and also their extension to systems with inputs and outputs, called port-Hamiltonian systems, see, e.g., [13, 22, 28, 35, 36, 38, 40], are defined by a Dirac structure, representing the power-conserving interconnection structure, an energy-dissipation relation and a Hamilton function (Hamiltonian), capturing the total energy storage in the system. The Dirac structure formalizes the generalized junction structure known from port-based modeling theory. Importantly, the Dirac structure may entail linear constraints on the so-called effort variables, called effort constraints. Through the gradient vector of the Hamilton function, these effort constraints induce algebraic constraints on the state variables, see especially [31, 39] for a discussion of general nonlinear port-Hamiltonian DAEs. In the special case of a linear autonomous system without energy dissipation and without inputs and outputs, and after choosing a basis such that ${\mathcal {X}}={\mathbb {R}}^{n}$, the equations of motion take the DAE form

$$\begin{aligned} K{\dot{x}}=LQx, \end{aligned}$$

(6)

where the pair of matrices $K,L\in {\mathbb {R}}^{n,n}$, satisfying $KL^\top + LK^\top =0$ and ${{\,\textrm{rank}\,}}\left[ \begin{array}{cc} K&L \end{array}\right] = n$, describes the Dirac structure, and ${\mathcal {H}}^x(x) = \frac{1}{2}x^\top Q x$ defines the Hamilton function for some $Q=Q^\top $. Denoting by $\ker $ and $\mathrm{im\,}$ the kernel and image of a linear map or its matrix representation, geometrically the Dirac structure is defined by the subspace ${\mathcal {D}} \subset {{\mathcal {X}}} \times {\mathcal {X}}^*$ given as $\ker \left[ \begin{array}{cc} K&\quad L \end{array}\right] $, where ${\mathcal {X}}$ is the linear state space, and ${\mathcal {X}}^*$ its dual space.

Remark 10

More precisely, ${\mathcal {D}} \subset \hat{{\mathcal {X}}} \times {\mathcal {X}}^*$, where $\hat{{\mathcal {X}}}$ is the tangent space to ${\mathcal {X}}$ at $x \in {\mathcal {X}}$. However, since ${\mathcal {X}}$ is linear, tangent spaces at any $x\in {\mathcal {X}}$ can be identified with each other and with ${\mathcal {X}}$.

Remark 11

In this paper, we frequently switch between coordinate-free representations with state space ${\mathcal {X}}$ and coordinate representations that are obtained for the case ${\mathcal {X}}={\mathbb {R}}^{n}$ after choosing a basis. Furthermore, in this case it will be tacitly assumed that the dual basis is chosen for the dual space ${\mathcal {X}}^*$. Although this is not the most general setup (from an abstract linear algebraic or functional analytic point of view), it makes the presentation of results much more convenient.

Algebraic constraints occur if the matrix K is singular, and are represented via

$$\begin{aligned} Qx \in \mathrm{im\,}K^\top . \end{aligned}$$

Singularity of K often results from the network structure as the following example demonstrates.

Example 12

Consider a general linear LC-electrical circuit. Let the circuit graph be determined by an incidence matrix D, defining Kirchhoff’s current laws $I \in \ker D$ and voltage laws $V \in \mathrm{im\,}D^\top $. Split the currents I into currents $I_C$ through the capacitors and currents $I_L$ through the inductors, and the voltages V into voltages $V_C$ across the capacitors and voltages $V_L$ across the inductors. Furthermore, let $I_C=-{\dot{q}}_C$ and $V_L= - {\dot{\varphi }}$, with q the vector of charges at the capacitors and $\varphi $ the flux linkages of the inductors. Split the incidence matrix D accordingly as $D= \left[ \begin{array}{cc} D_C&\quad D_L \end{array}\right] $. Then, Kirchhoff’s current laws take the form

$$\begin{aligned} D_C{\dot{q}}=D_L I_L. \end{aligned}$$

Furthermore, let F be a maximal annihilator of $D^\top $, i.e., $\ker F= \mathrm{im\,}D^\top $. Then, Kirchhoff’s voltage laws are given as $F \left[ \begin{array}{c} V_C \\ V_L \end{array}\right] = 0$, and after splitting $F= \left[ \begin{array}{cc} F_C&F_L \end{array}\right] $ accordingly, we have

$$\begin{aligned} F_L{\dot{\varphi }}=F_C V_C. \end{aligned}$$

Writing the linear constitutive equations for the capacitors as $q=C V_C$ for some positive definite diagonal capacitance matrix C and those for the inductors as $\varphi =L I_L$ for some positive definite diagonal inductance matrix L, we finally obtain the system of equations

$$\begin{aligned} \left[ \begin{array}{cc} D_C &{}\quad 0 \\ 0 &{}\quad F_L \end{array}\right] \left[ \begin{array}{c} {\dot{q}} \\ {\dot{\varphi }} \end{array}\right] = \left[ \begin{array}{cc} 0 &{}\quad D_L \\ F_C &{}\quad 0 \end{array}\right] \left[ \begin{array}{cc} C^{-1} &{}\quad 0 \\ 0 &{}\quad L^{-1} \end{array}\right] \left[ \begin{array}{c} q \\ \varphi \end{array}\right] , \end{aligned}$$

which is in the form (6) with Hamilton function ${\mathcal {H}}(q,\varphi )= \frac{1}{2}q^\top C^{-1} q + \frac{1}{2}\varphi ^\top L^{-1} \varphi $. It is easily checked that singularity of $K =\left[ \begin{array}{cc} D_C &{}\quad 0 \\ 0 &{}\quad F_L \end{array}\right] $ corresponds to parallel interconnection of capacitors or series interconnection of inductors. (Note that since $\ker F= \mathrm{im\,}D^\top $ the linear space $\mathrm{im\,}F^\top = \ker D$ is spanned by the cycles of the circuit graph.) See, e.g., [23] for pHDAE modeling of electrical circuits.

Motivated by [4], the port-Hamiltonian point of view on DAE systems was extended in [41] by replacing the gradient vector Qx of the Hamiltonian function ${\mathcal {H}}(x)=\frac{1}{2} x^\top Qx$ by a general Lagrangian subspace, called Lagrange structure in the present paper, since we want to emphasize the similarity with Dirac structures. As we will see, this extension allows to bridge the gap with the (dissipative) Hamiltonian formulation (4) of Sect. 2.

In the linear homogeneous case without dissipation, one starts with a Dirac structure ${\mathcal {D}} \subset {{\mathcal {X}}} \times {\mathcal {X}}^*$ and a Lagrange structure ${\mathcal {L}} \subset {\mathcal {X}} \times {\mathcal {X}}^*$. The composition of the Dirac structure ${\mathcal {D}}$ and the Lagrange structure ${\mathcal {L}}$, over the shared variables $e \in {\mathcal {X}}^*$, is then defined as

$$\begin{aligned} {\mathcal {D}} \circ {\mathcal {L}} = \{(f,x) \in {{\mathcal {X}}} \times {\mathcal {X}} \mid \hbox {there exists } e \in {\mathcal {X}}^* \hbox { such that } (f,e) \in {\mathcal {D}},\ (e,x) \in {\mathcal {L}} \}. \nonumber \\ \end{aligned}$$

(7)

Substituting $f=-{\dot{x}}$, this leads to the coordinate-free definition of the dynamics

$$\begin{aligned} (-{\dot{x}},x) \in {\mathcal {D}} \circ {\mathcal {L}}. \end{aligned}$$

(8)

In order to obtain a coordinate representation of the dynamics (8), the simplest option (later on in Sect. 5.1 we will discuss another one) is to start from the image representation of the Lagrange structure ${\mathcal {L}}$, defined by a pair of matrices $P,S\in {\mathbb {R}}^{n,n}$ satisfying $P^\top S=S^\top P$ and ${{\,\textrm{rank}\,}}\left[ \begin{array}{cc} P^\top&\quad S^\top \end{array}\right] = n$. Taking coordinates x for ${\mathcal {X}}={\mathbb {R}}^{n}$ and dual coordinates e for its dual space ${\mathcal {X}}^*={\mathbb {R}}^{n}$, the image representation of ${\mathcal {L}}$ is given by

$$\begin{aligned} \left[ \begin{array}{c} x \\ e \end{array}\right] = \left[ \begin{array}{c} Pz \\ Sz \end{array}\right] , \end{aligned}$$

(9)

for some parameterizing vector z in a space ${\mathcal {Z}}$ (of the same dimension as ${\mathcal {X}}$). Analogously, we consider a kernel representation of the Dirac structure ${\mathcal {D}}$ given by matrices K, L satisfying $KL^\top =-LK^\top $ and ${{\,\textrm{rank}\,}}\left[ \begin{array}{cc} K&L \end{array}\right] = n$, such that ${\mathcal {D}}= \{(f,e) \in {\mathcal {X}} \times {\mathcal {X}}^* \mid Kf + Le=0 \}$. (More details are given in Sect. 4.)

Substituting $f=-{\dot{x}} = P{\dot{z}}$ and $e=Sz$, this leads to the coordinate representation

$$\begin{aligned} KP\dot{z}= LS z. \end{aligned}$$

(10)

We will call this class extended Hamiltonian differential–algebraic systems (extended HDAEs). If dissipation is incorporated, see Sect. 4.2, then it is called extended dHDAEs.

Importantly, the presence of algebraic constraints in (10) may arise both by singularity of K (as was already the case for (6)) as well as by singularity of P (as was the case for (2)). This motivated the introduction of the notions of Dirac algebraic constraints (corresponding to singularity of K) and of Lagrange algebraic constraints (corresponding to singularity of P) in [41].

Similar to dHDAE systems (2), the Hamiltonian of the extended HDAE system (10) is specified by the Lagrange structure and is given by

$$\begin{aligned} {\mathcal {H}}^z(z)= \frac{1}{2} z^\top S^\top Pz. \end{aligned}$$

(11)

Indeed, one immediately has the energy conservation property

$$\begin{aligned} \frac{d}{dt} {\mathcal {H}}^z(z) = z^\top S^\top P{\dot{z}} = - e^\top f =0, \end{aligned}$$

since $(f,e) \in {\mathcal {D}}$ and thus $e^\top f=0$.

While singularity of K in physical systems modeling typically arises from interconnection due to the network structure, singularity of P often arises as a limiting situation. An elaborate example will be provided later as Example 17.

Remark 13

Note that if K is invertible, then by multiplying (10) with $K^{-1}$ from the left, we obtain the lossless version of the dHDAE system (2) in Sect. 2 with $E=P$, $Q=S$, $J=-J^\top = K^{-1} L$ (and $R=0$). Thus, the replacement of the Hamiltonian ${\mathcal {H}}(x)=\frac{1}{2} x^\top Q x$ by a Lagrange subspace (9) constitutes a first step toward an overarching formulation of (dissipative) Hamiltonian DAE systems.

4 Geometric theory of (dissipative) Hamiltonian DAEs

In this section, we take a systematic geometric view on Hamiltonian DAE systems, extending the existing geometric treatment of extended HDAE systems, as already discussed in Sect. 3. We also incorporate the discussed classes of dHDAE systems from Sect. 2.

Consider an n-dimensional linear state space ${\mathcal {X}}$ with elements denoted by x. Let $\hat{{\mathcal {X}}}$ denote the tangent space to ${\mathcal {X}}$ at $x \in {\mathcal {X}}$, with elements denoted by f and called flow vectors. As mentioned before, since ${\mathcal {X}}$ is linear, tangent spaces at different $x \in {\mathcal {X}}$ can be identified with each other and with ${\mathcal {X}}$, implying that $f \in {\mathcal {X}}$ as well. Furthermore, let ${\mathcal {X}}^*$ be the dual space of ${\mathcal {X}}$, with elements denoted by e and called effort vectors.

4.1 Dirac and Lagrange structures

The product space ${\mathcal {X}} \times {\mathcal {X}}^*$ is endowed with the two canonical bilinear forms

$$\begin{aligned} \begin{array}{l} \langle (f_1,e_1), (f_2,e_2) \rangle _+:= f_1^\top e_2 + f_2^\top e_1, \\ \langle (f_1,e_1), (f_2,e_2) \rangle _-:= f_1^\top e_2 - f_2^\top e_1, \end{array} \end{aligned}$$

(12)

represented by the two matrices

$$\begin{aligned} \Pi _+:= \left[ \begin{array}{cc} 0 &{}\quad I_n \\ I_n &{}\quad 0 \end{array}\right] , \quad \Pi _-:=\left[ \begin{array}{cc} 0 &{} I_n \\ -I_n &{}\quad 0 \end{array}\right] , \end{aligned}$$

where we recognize $\Pi _-$ as the standard symplectic form on ${\mathcal {X}}$.

Definition 14

A subspace ${\mathcal {D}} \subset {\mathcal {X}} \times {\mathcal {X}}^*$ is called a Dirac structure if the bilinear form $\langle \cdot , \cdot \rangle _+$ is zero on ${\mathcal {D}}$ and moreover ${\mathcal {D}}$ is maximal with respect to this property. A subspace ${\mathcal {L}} \subset {\mathcal {X}} \times {\mathcal {X}}^*$ is a Lagrange structure if the bilinear form $\langle \cdot , \cdot \rangle _-$ is zero on ${\mathcal {L}}$ and moreover ${\mathcal {L}}$ is maximal with respect to this property. A Lagrange structure ${\mathcal {L}} \subset {\mathcal {X}} \times {\mathcal {X}}^*$ is called nonnegative if the quadratic form defined by $\Pi _+$ is nonnegative on ${\mathcal {L}}$.

Remark 15

In this paper, we have chosen the terminology ’Lagrange structure,’ instead of the more common terminology ’Lagrangian subspace,’ in order to emphasize the similarity to Dirac structures. Also note that the definition of Dirac structures can be extended to manifolds instead of linear state spaces ${\mathcal {X}}$; in this context Dirac structures on linear spaces are often referred to as constant Dirac structures.

We have the following characterizations of Lagrange and Dirac structures, see, e.g., [13, 40].

Proposition 16

Consider an n-dimensional linear state space ${\mathcal {X}}$ and its dual space ${\mathcal {X}}^*$.

(i)
A subspace ${\mathcal {D}} \subset {\mathcal {X}} \times {\mathcal {X}}^*$ is a Dirac structure if and only if ${\mathcal {D}}={\mathcal {D}}^{\perp \!\!\!\perp _+}$, where ${}^{\perp \!\!\!\perp _+}$ denotes the orthogonal complement with respect to the bilinear form $\langle \cdot , \cdot \rangle _+$. Furthermore, ${\mathcal {D}} \subset {\mathcal {X}} \times {\mathcal {X}}^*$ is a Dirac structure if and only $e^\top f=0$ for all $(f,e) \in {\mathcal {D}}$ and $\dim {\mathcal {D}}=n$.
(ii)
A subspace ${\mathcal {L}} \subset {\mathcal {X}} \times {\mathcal {X}}^*$ is a Lagrange structure if and only if ${\mathcal {L}}={\mathcal {L}}^{\perp \!\!\!\perp _-}$, where ${}^{\perp \!\!\!\perp _-}$ denotes the orthogonal complement with respect to the bilinear form $\langle \cdot , \cdot \rangle _-$. Any Lagrange structure satisfies $\dim {\mathcal {L}} =n$.

Dirac and Lagrange structures admit structured coordinate representations, see, e.g., [40, 41]. For this paper, the following representations are most relevant. Using matrices $K,L \in {\mathbb {R}}^{n,n}$, any Dirac structure ${\mathcal {D}} \subset {{\mathcal {X}}} \times {\mathcal {X}}^*$ admits the kernel/image representation

$$\begin{aligned} {\mathcal {D}}= & {} \ker \left[ \begin{array}{cc} K&\quad L \end{array}\right] \nonumber \\= & {} \mathrm{im\,}\left[ \begin{array}{cc} L^\top \\ K^\top \end{array}\right] \subset {{\mathcal {X}}} \times {\mathcal {X}}^*, \end{aligned}$$

(13)

with K, L satisfying ${{\,\textrm{rank}\,}}\left[ \begin{array}{cc} K&\quad L\end{array}\right] =n$ and the generalized skew-symmetry condition

$$\begin{aligned} KL^\top + L K^\top =0. \end{aligned}$$

(14)

Conversely any such pair K, L defines a Dirac structure.

Analogously, any Lagrange structure ${\mathcal {L}} \subset {\mathcal {X}} \times {\mathcal {X}}^*$ can be represented as

$$\begin{aligned} {\mathcal {L}}= & {} \ker \left[ \begin{array}{cc} -P^\top&\quad S^\top \end{array}\right] \nonumber \\= & {} \mathrm{im\,}\left[ \begin{array}{cc} S \\ P \end{array}\right] \subset {\mathcal {X}}^* \times {\mathcal {X}}, \end{aligned}$$

(15)

for certain matrices $S,P \in {\mathbb {R}}^{n,n}$ satisfying ${{\,\textrm{rank}\,}}\left[ \begin{array}{c} P \\ S \end{array}\right] =n$ as well as the generalized symmetry condition

$$\begin{aligned} S^\top P= P^\top S. \end{aligned}$$

(16)

A Lagrange structure is, furthermore, nonnegative if and only if $S^\top P \ge 0$.

As already described in Sect. 3, by using the image representation $x=Pz$, $e=Sz$ of the Lagrange structure ${\mathcal {L}}$ and the kernel representation $Kf + Le=0$ of the Dirac structure ${\mathcal {D}}$ one is led to the representation (10) of the extended HDAE system defined by ${\mathcal {D}}$ and ${\mathcal {L}}$.

The following is a physical example where both K and P turn out to be singular. The singularity of K is due to the presence of kinematic constraints, while the singularity of P is caused by a limiting argument in the energy expression.

Example 17

Consider two masses $m_1$ and $m_2$ connected by a spring with spring constant k, where the right mass $m_2$ is subject to the kinematic constraint $v_2=0$ (velocity is zero). With positions $q_1,q_2$ and momenta $p_1,p_2$, the Hamiltonian is given by

$$\begin{aligned} {\mathcal {H}}(q_1,q_2,p_1,p_2) = \frac{1}{2}k(q_1-q_2)^2 + \frac{p_1^2}{2m_1} + \frac{p_2^2}{2m_2}. \end{aligned}$$

Denoting by $e_{q1}, e_{q2}$ the spring forces at both ends of the spring and by $e_{p1}, e_{p2}$ the velocities of the two masses, we obtain the relation

$$\begin{aligned} \begin{bmatrix} q_1 \\ q_2 \\ p_1 \\ p_2 \\ \hline e_{q1} \\ e_{q2} \\ e_{p1} \\ e_{p2} \end{bmatrix} = \begin{bmatrix} 1 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 1 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 1 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 1 \\ \hline k &{}\quad -k &{}\quad 0 &{}\quad 0 \\ -k &{}\quad k &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad \frac{1}{m_1} &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad \frac{1}{m_2} \\ \end{bmatrix}\quad \begin{bmatrix} z_1 \\ z_2 \\ z_3 \\ z_4 \end{bmatrix} \end{aligned}$$

To consider the limit $k \rightarrow \infty $, meaning that the spring is replaced by rigid connection, we first express the system in different coordinates.

$$\begin{aligned} \begin{bmatrix} z_1\\ z_2 \\ z_3 \\ z_4 \end{bmatrix} = \begin{bmatrix} \frac{m_2}{m_1 + m_2} &{}\quad 1 &{}\quad 0 &{}\quad 0 \\ - \frac{m_1}{m_1 + m_2} &{}\quad 1 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 1 &{}\quad \frac{m_1}{m_1 + m_2} \\ 0 &{}\quad 0 &{}\quad -1 &{}\quad \frac{m_2}{m_1 + m_2} \end{bmatrix} \begin{bmatrix} {\tilde{z}}_1\\ {\tilde{z}}_2 \\ {\tilde{z}}_3 \\ {\tilde{z}}_4 \end{bmatrix}. \end{aligned}$$

This yields the transformed representation

$$\begin{aligned} \begin{bmatrix} q_1 \\ q_2 \\ p_1 \\ p_2 \\ \hline e_{q1} \\ e_{q2} \\ e_{p1} \\ e_{p2} \end{bmatrix} = \begin{bmatrix} \frac{m_2}{m_1 + m_2} &{}\quad 1 &{}\quad 0 &{}\quad 0 \\ -\frac{m_1}{m_1 + m_2} &{}\quad 1 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 1 &{}\quad \frac{m_1}{m_1 + m_2} \\ 0 &{}\quad 0 &{}\quad -1 &{}\quad \frac{m_2}{m_1 + m_2} \\ \hline k &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ -k &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad \frac{1}{m_1} &{}\quad \frac{1}{m_1+m_2} \\ 0 &{}\quad 0 &{}\quad -\frac{1}{m_2} &{}\quad \quad \frac{1}{m_1 + m_2} \end{bmatrix} \begin{bmatrix} {\tilde{z}}_1 \\ {\tilde{z}}_2 \\ {\tilde{z}}_3 \\ {\tilde{z}}_4 \end{bmatrix}. \end{aligned}$$

Taking the limit $k \rightarrow \infty $ yields the Lagrange structure ${\mathcal {L}}$ in image representation

$$\begin{aligned} \begin{bmatrix} q_1 \\ q_2 \\ p_1 \\ p_2 \\ \hline e_{q1} \\ e_{q2} \\ e_{p1} \\ e_{p2} \end{bmatrix} = \begin{bmatrix} 0 &{}\quad 1 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 1 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 1 &{}\quad \frac{m_1}{m_1 + m_2} \\ 0 &{}\quad 0 &{}\quad -1 &{}\quad \frac{m_2}{m_1 + m_2} \\ \hline 1 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ -1 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad \frac{1}{m_1} &{}\quad \frac{1}{m_1+m_2} \\ 0 &{}\quad 0 &{}\quad -\frac{1}{m_2} &{}\quad \frac{1}{m_1 + m_2} \end{bmatrix} \begin{bmatrix} {\tilde{z}}_1 \\ {\tilde{z}}_2 \\ {\tilde{z}}_3 \\ {\tilde{z}}_4 \end{bmatrix} \end{aligned}$$

and the limiting Hamiltonian is just the kinetic energy

$$\begin{aligned} {\mathcal {H}}({\tilde{z}}_1,{\tilde{z}}_2,{\tilde{z}}_3,{\tilde{z}}_4) = \frac{1}{2}\left( \frac{1}{m_1} + \frac{1}{m_2}\right) {\tilde{z}}^2_3 + \frac{{\tilde{z}}^2_4}{2(m_1 + m_2)}=\frac{{\tilde{z}}^2_4}{2(m_1 + m_2)}. \end{aligned}$$

The system has a Lagrange algebraic constraint due to the linear dependency in the rows of P. The Dirac structure ${\mathcal {D}}$ is given as

$$\begin{aligned} {\mathcal {D}}= & {} \left\{ (f_{q1}, f_{q2}, f_{p1}, f_{p2}, e_{q1}, e_{q2}, e_{p1}, e_{p2} =0) \mid \hbox {there exists }\lambda \right. \\{} & {} \hbox { s.t. } \begin{bmatrix} f_{q1}\\ f_{q2}\\ f_{p1}\\ f_{p2} \end{bmatrix} = \begin{bmatrix} 0 &{}\quad 0&{} -1 &{}\quad 0 \\ 0 &{}\quad 0&{} 0 &{}\quad -1 \\ 1 &{}\quad 0&{} 0 &{}\quad 0 \\ 0 &{}\quad 1 &{}\quad 0 &{}\quad 0 \end{bmatrix} \begin{bmatrix} e_{q1}\\ e_{q2}\\ e_{p1}\\ e_{p2} \end{bmatrix} + \begin{bmatrix} 0 \\ 0 \\ 0 \\ 1 \end{bmatrix} \left. \lambda \right\} . \end{aligned}$$

After elimination of the Lagrange multiplier $\lambda $, this yields

$$\begin{aligned} K= \begin{bmatrix} 1 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 1 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 1 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{bmatrix}, \quad L= \begin{bmatrix} 0 &{}\quad 0 &{}\quad 1 &{} 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 1 \\ -1 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{bmatrix}, \end{aligned}$$

and hence

$$\begin{aligned} KP = \begin{bmatrix} 0 &{}\quad 1 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 1 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 1 &{}\quad \frac{m_1}{m_1 + m_2} \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{bmatrix}, \quad LS = \begin{bmatrix} 0 &{}\quad 0 &{}\quad \frac{1}{m_1} &{}\quad \frac{1}{m_1 + m_2} \\ 0 &{}\quad 0 &{} -\frac{1}{m_2} &{}\quad \frac{1}{m_1 + m_2} \\ -1 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad - \frac{1}{m_2} &{}\quad \frac{1}{m_1 + m_2} \end{bmatrix}. \end{aligned}$$

Finally subtracting the second equation from the first equation, we obtain the HDAE system

$$\begin{aligned} \begin{bmatrix} 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 1 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 1 &{}\quad \frac{m_1}{m_1 + m_2} \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{bmatrix} \begin{bmatrix} \dot{{\tilde{z}}}_1 \\ \dot{{\tilde{z}}}_2 \\ \dot{{\tilde{z}}}_3 \\ \dot{{\tilde{z}}}_4 \end{bmatrix} = \begin{bmatrix} 0 &{}\quad 0 &{}\quad \frac{1}{m_1} + \frac{1}{m_2} &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad - \frac{1}{m_2} &{}\quad \frac{1}{m_1 + m_2}\\ -1 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad - \frac{1}{m_2} &{}\quad \frac{1}{m_1 + m_2} \end{bmatrix} \begin{bmatrix} {\tilde{z}}_1 \\ {\tilde{z}}_2 \\ {\tilde{z}}_3 \\ {\tilde{z}}_4 \end{bmatrix}. \end{aligned}$$

Here the first equation is the Lagrange algebraic constraint ${\tilde{z}}_3=0$ (and eventually ${\tilde{z}}_1=0$) obtained by letting $k \rightarrow \infty $ (corresponding to singularity of P), and the last equation is the Dirac algebraic constraint $- \frac{{\tilde{z}}_3}{m_2} + \frac{{\tilde{z}}_4}{m_1 + m_2}=0$, i.e., ${\tilde{z}}_4=0$ resulting from the kinematic constraint, leading to singularity of K and resulting in the trivial dynamics $\dot{{\tilde{z}}}_2 = 0 $.

Remark 18

Instead of using a parametrization of the Lagrange structure ${\mathcal {L}}$, one can also use a parameterization of the Dirac structure ${\mathcal {D}}$

$$\begin{aligned} \left[ \begin{array}{c} f \\ e \end{array}\right] = \left[ \begin{array}{c} L^\top \\ K^\top \end{array}\right] v \end{aligned}$$

(17)

with $v \in {\mathcal {V}}$, where ${\mathcal {V}}$ is an n-dimensional parameter space. This yields an extended dHDAE system (but now in the parameter vector v) given by

$$\begin{aligned} P^\top K^\top {\dot{v}}= S^\top L^\top v, \end{aligned}$$

(18)

which is the adjoint system of (10). See [26] for a detailed discussion of adjoint systems of DAEs.

4.2 Incorporation of dissipation

As noted in Sect. 4, extended HDAE systems (10), geometrically defined by a Dirac and Lagrange structure, already include HDAE systems (2) without dissipation. Conversely, any HDAE system with K invertible can be rewritten into the form (2) with $R=0$.

In order to complete the geometric viewpoint toward the inclusion of dissipation (and thus to (2)), we recall the geometric definition of a port-Hamiltonian system [35, 37, 40]. By replacing the Hamiltonian function by a Lagrange structure as in [41], and specializing to the case without external variables (inputs and outputs), such systems will be called extended dHDAE systems.

Definition 19

Consider a state space ${\mathcal {X}}$ with linear coordinates x and a linear space of resistive flows ${\mathcal {F}}_R$. Furthermore, consider a Dirac structure ${\mathcal {D}}$ on ${\mathcal {X}} \times {\mathcal {F}}_R$, a Lagrange structure ${\mathcal {L}} \subset {\mathcal {X}} \times {\mathcal {X}}^*$ and a nonnegative Lagrange structure ${\mathcal {R}} \subset {\mathcal {F}}_R \times {\mathcal {F}}^*_R$. Then, an extended dissipative Hamiltonian DAE (extended dHDAE) system is defined as the tuple $({\mathcal {X}},{\mathcal {F}}_R,{\mathcal {D}},{\mathcal {L}},{\mathcal {R}})$ with

$$\begin{aligned} \begin{array}{rcl} &{}&{}\left\{ ({\dot{x}},x) \mid \hbox {there exist } e \in {\mathcal {X}}^*, f_R \in {\mathcal {F}}_R, e_R \in {\mathcal {F}}^*_R \right. \\ &{}&{} \quad \left. \hbox { such that } (-{\dot{x}},-f_R,e,e_R) \in {\mathcal {D}}, (f_R,e_R) \in {\mathcal {R}}, \, (x,e) \in {\mathcal {L}} \right\} . \end{array} \end{aligned}$$

(19)

If ${\mathcal {L}}$ is represented as in (15), i.e.,

$$\begin{aligned} {\mathcal {L}}= \left\{ \left[ \begin{array}{c} x \\ e \end{array}\right] \mid \left[ \begin{array}{c} x \\ e \end{array}\right] = \left[ \begin{array}{c} P \\ S \end{array}\right] z, \; z \in {\mathbb {R}}^n \right\} , \end{aligned}$$

(20)

then it immediately follows from the properties of the Dirac structure ${\mathcal {D}}$ and the nonnegative Lagrange structure ${\mathcal {R}}$ that the dynamics of the extended dHDAE satisfies

$$\begin{aligned} \frac{d}{dt} \left( \frac{1}{2} z^\top S^\top P z\right) = e_R^\top f_R \le 0. \end{aligned}$$

More generally, we will now introduce the notion a maximally monotone subspace, which is overarching the notions of a Dirac structure ${\mathcal {D}}$ and a nonnegative Lagrange structure ${\mathcal {R}}$.

Definition 20

Consider a linear space ${\mathcal {X}}$. A subspace ${\mathcal {M}} \subset {\mathcal {X}} \times {\mathcal {X}}^*$ is called monotone subspace if

$$\begin{aligned} e^\top f \ge 0 \end{aligned}$$

(21)

for all $(f,e) \in {\mathcal {M}}$, and it is maximally monotone if additionally ${\mathcal {M}}$ is maximal with respect to this property (i.e., there does not exist a monotone subspace ${\mathcal {M}}' \subset {\mathcal {X}} \times {\mathcal {X}}^*$ with ${\mathcal {M}} \subsetneq {\mathcal {M}}'$).

Remark 21

The definition of a monotone subspace is a special case of the notion of a monotone relation ${\widetilde{M}}$, which is defined as a subset of ${\mathcal {X}} \times {\mathcal {X}}^*$ satisfying

$$\begin{aligned} (e_1 - e_2)^\top (f_1 - f_2) \ge 0 \end{aligned}$$

(22)

for all $(f_1,e_1), (f_2,e_2) \in {\widetilde{M}}$. Clearly if ${\widetilde{M}}$ is a subspace then (22) reduces to (21). (Maximally) monotone subspaces with a sign change were employed before in [21], using the terminology of ’(maximally) linear dissipative relations.’ Nonlinear port-Hamiltonian systems with respect to a general (maximally) monotone relation were coined as incremental port-Hamiltonian systems in [10]; see [9] for further developments.

Obviously, a subspace ${\mathcal {M}}$ is monotone if and only if the quadratic form defined by $\Pi _+$ is nonnegative on ${\mathcal {M}}$, since $\langle (f,e), (f,e) \rangle _+ = 2 e^\top f$. This yields

Proposition 22

Consider a state space ${\mathcal {X}}$ with $\dim {\mathcal {X}}=n$. Then, any monotone subspace of ${\mathcal {X}} \times {\mathcal {X}}^*$ has dimension less than or equal to n, and any maximally monotone subspace of ${\mathcal {X}} \times {\mathcal {X}}^*$ has dimension n. Any maximally monotone subspace ${\mathcal {M}} \subset {\mathcal {X}} \times {\mathcal {X}}^*$ can be represented as

$$\begin{aligned} {\mathcal {M}} = \mathrm{im\,}\left[ \begin{array}{c} N^\top \\ M^\top \end{array}\right] \end{aligned}$$

(23)

for $M,N\in {\mathbb {R}}^{n,n}$ satisfying ${{\,\textrm{rank}\,}}\left[ \begin{array}{cc} N&\quad M \end{array}\right] =n$ and

$$\begin{aligned} MN^\top + NM^\top \ge 0. \end{aligned}$$

(24)

Conversely, any subspace defined by M, N satisfying (24) is a maximally monotone subspace.

Proof

The proof follows, since $\Pi _+$ has n positive and n negative eigenvalues. $\square $

Obviously, any Dirac structure ${\mathcal {D}}$ given by a pair of matrices K, L is maximally monotone (take $M=K$ and $N=L$). In a similar way, any nonnegative Lagrange structure ${\mathcal {R}}$ given by a pair of matrices P, S with $S^\top P \ge 0$ is maximally monotone by taking $N^\top =P$, $M^\top =S$.

Importantly, also the composition of two maximally monotone subspaces is again maximally monotone. In order to prove this, we first state the following lemma.

Lemma 23

Let $A: {\mathcal {F}} \rightarrow {\mathcal {G}}$ be a linear map between two linear spaces ${\mathcal {F}},{\mathcal {G}}$. Let ${\mathcal {M}}_{{\mathcal {G}}} \subset {\mathcal {G}} \times {\mathcal {G}}^*$ be a maximally monotone subspace. Then, the pull-back of ${\mathcal {M}}_{{\mathcal {G}}}$ via A, defined as

$$\begin{aligned} b_A ({\mathcal {M}}_{{\mathcal {G}}}):= \{ (f,A^*g^*) \in {\mathcal {F}} \times {\mathcal {F}}^* \mid (Af,g^*) \in {\mathcal {M}}_{{\mathcal {G}}} \}, \end{aligned}$$

is maximally monotone. Furthermore, let ${\mathcal {M}}_{{\mathcal {F}}} \subset {\mathcal {F}} \times {\mathcal {F}}^*$ be a maximally monotone subspace. Then, the push-forward of ${\mathcal {M}}_{{\mathcal {F}}}$ via A, defined as

$$\begin{aligned} f_A ({\mathcal {M}}_{{\mathcal {F}}}):= \{ (Af,g^*) \in {\mathcal {G}} \times {\mathcal {G}}^* \mid (f,A^*g^*) \in {\mathcal {M}}_{{\mathcal {F}}} \}, \end{aligned}$$

is maximally monotone.

Proof

It is immediately checked that $b_A ({\mathcal {M}}_{{\mathcal {G}}})$ is monotone. Furthermore,

$$\begin{aligned} \begin{array}{l} \dim b_A ({\mathcal {M}}_{{\mathcal {G}}}) = \dim {\mathcal {M}}_{{\mathcal {G}}} + \dim \ker A - \dim \ker A^* = \\ \dim {\mathcal {G}} + \dim \ker A - \dim \ker A^* = \dim {\mathcal {F}}, \end{array} \end{aligned}$$

and thus, $b_A ({\mathcal {M}}_{{\mathcal {G}}})$ is maximally monotone. The proof to show that $f_A ({\mathcal {M}}_{{\mathcal {F}}})$ is maximally monotone is analogous. $\square $

Using Lemma 23, we can show that maximally monotone subspaces satisfy the following composition property. This same property was recently derived for maximally monotone relations in [9], assuming additional regularity conditions.

Proposition 24

Consider an extended dHDAE system as in (19) and let ${\mathcal {M}}_a$ and ${\mathcal {M}}_b$ be maximally monotone subspaces

$$\begin{aligned} {\mathcal {M}}_a \subset {\mathcal {F}}_a \times {\mathcal {F}} \times {\mathcal {E}}_a \times {\mathcal {E}}, \quad {\mathcal {M}}_b \subset {\mathcal {F}}_b \times {\mathcal {F}} \times {\mathcal {E}}_b \times {\mathcal {E}} \end{aligned}$$

with ${\mathcal {E}}={\mathcal {F}}^*, {\mathcal {E}}_a={\mathcal {F}}_a^*, {\mathcal {E}}_b={\mathcal {F}}_b^*$. Define the composition

$$\begin{aligned} \begin{array}{rcl} {\mathcal {M}}_a \circ {\mathcal {M}}_b &{}:= &{}\quad \{(f_a,f_b,e_a,e_b) \mid \hbox { there exist } f \in {\mathcal {F}}, e \in {\mathcal {E}} \hbox { such that } \\ &{}&{} (f_a,f,e_a,e) \in {\mathcal {M}}_a, (f_b,-f,e_b,e) \in {\mathcal {M}}_b \}. \end{array} \end{aligned}$$

Then, ${\mathcal {M}}_a \circ {\mathcal {M}}_b \subset {\mathcal {F}}_a \times {\mathcal {F}}_b \times {\mathcal {E}}_a \times {\mathcal {E}}_b$ is again maximally monotone.

Proof

Let ${\mathcal {V}}_a:={\mathcal {F}}$ and ${\mathcal {V}}_b:={\mathcal {F}}$. Define the linear maps

$$\begin{aligned} \begin{array}{lrll} \phi : &{}\quad {\mathcal {F}}_a \times {\mathcal {V}}_a \times {\mathcal {F}}_b \times {\mathcal {V}}_b &{}\quad \rightarrow &{}\quad {\mathcal {F}}_a \times {\mathcal {V}}_a \times {\mathcal {V}}_a \times {\mathcal {F}}_b \times {\mathcal {V}}_b \times {\mathcal {V}}_b \\ &{} (f_a,v_a,f_b,v_b) &{}\quad \mapsto &{}\quad (f_a,v_a,v_a,f_b,v_b,v_b) \\ \psi : &{}\quad {\mathcal {F}}_a \times {\mathcal {V}}_a \times {\mathcal {F}}_b \times {\mathcal {V}}_b &{}\quad \rightarrow &{}\quad {\mathcal {F}}_a \times {\mathcal {F}}_b \\ &{} (f_a,v_a,f_b,v_b) &{}\quad \mapsto &{}\quad (f_a,f_b) \\ \end{array} \end{aligned}$$

Then, for the maximally monotone subspace

$$\begin{aligned} {\mathcal {M}}_I = \{(v_a,v_b,v_a^*,v_b^*) \in {\mathcal {V}}_a \times {\mathcal {V}}_b \times {\mathcal {V}}^*_a \times {\mathcal {V}}^*_b \mid v_a=-v_b, v^*_a=v^*_b \}, \end{aligned}$$

it can be readily checked that

$$\begin{aligned} {\mathcal {M}}_a \circ {\mathcal {M}}_b = f_{\psi } (b_{\phi } \left( {\mathcal {M}}_a \times {\mathcal {M}}_I \times {\mathcal {M}}_b) \right) , \end{aligned}$$

where ${\mathcal {M}}_a \times {\mathcal {M}}_I \times {\mathcal {M}}_b$ is clearly maximally monotone. Then, the proof finishes by applying Lemma 23. $\square $

We immediately have the following corollary.

Corollary 25

Consider a Dirac structure ${\mathcal {D}} \subset {\mathcal {X}} \times {\mathcal {F}}_R \times {\mathcal {X}}^* \times {\mathcal {F}}^*_R$, together with a nonnegative Lagrangian subspace ${\mathcal {R}} \subset {\mathcal {F}}_R \times {\mathcal {F}}^*_R$. Then, the composition of ${\mathcal {D}}$ and ${\mathcal {R}}$ defined via

$$\begin{aligned} \begin{array}{rcl} {\mathcal {D}} \circ {\mathcal {R}}:= \{(f,e) \in {\mathcal {X}} \times {\mathcal {X}}^* &{}\quad \mid &{}\quad \hbox { there exists } f_R \in {\mathcal {F}}_R, e_R \in {\mathcal {F}}_R^* \hbox { such that }\\ &{}&{} (f,-f_R,e,e_R) \in {\mathcal {D}}, \hbox { and } (f_R,e_R) \in {\mathcal {R}} \} \end{array} \end{aligned}$$

is maximally monotone. In particular, for any $(f,e) \in {\mathcal {D}} \circ {\mathcal {R}}$, one has

$$\begin{aligned} e^\top f = e_R^\top f_R \ge 0. \end{aligned}$$

Remark 26

We conjecture that conversely any maximally monotone subspace ${\mathcal {M}}$ can be generated this way, i.e., as the composition of a certain Dirac structure ${\mathcal {D}}$ and a certain nonnegative Lagrangian subspace ${\mathcal {R}}$.

The presented analysis of maximally monotone subspaces leads to the following geometric definition of an extended dHDAE system, covering both dHDAE systems (2) and extended HDAE systems (10). See [21] for related results (using the terminology of (maximally) dissipative linear relations).

Definition 27

Consider a linear state space ${\mathcal {X}}$ with coordinates x, a maximally monotone subspace ${\mathcal {M}} \subset {\mathcal {X}} \times {\mathcal {X}}^*$ and a Lagrange structure ${\mathcal {L}} \subset {\mathcal {X}} \times {\mathcal {X}}^*$. Then, an extended dHDAE system is a system $({\mathcal {X}},{\mathcal {M}},{\mathcal {L}})$ satisfying

$$\begin{aligned} \{({\dot{x}},x) \mid \hbox { there exists } e \in {\mathcal {X}}^* \hbox { such that } (-{\dot{x}},e) \in {\mathcal {M}}, (x,e) \in {\mathcal {L}} \}. \end{aligned}$$

(25)

A coordinate representation of an extended dHDAE system is obtained as follows. Consider a coordinate expression (23) of the maximally monotone subspace ${\mathcal {M}}$ (with M, N satisfying (24)). This means that any element $(f,e) \in {\mathcal {M}}$ can be represented as

$$\begin{aligned} \left[ \begin{array}{c} f \\ e \end{array}\right] = \left[ \begin{array}{c} N^\top \\ M^\top \end{array}\right] v \end{aligned}$$

for some $v \in {\mathbb {R}}^n$. Furthermore, any $(x,e) \in {\mathcal {L}}$ can be represented as in (20). Substituting $-f={\dot{x}}=P{\dot{z}}$, this yields

$$\begin{aligned} \left[ \begin{array}{c} -P{\dot{z}} \\ Sz \end{array}\right] = \left[ \begin{array}{c} N^\top \\ M^\top \end{array}\right] v \end{aligned}$$

Now construct matrices C, D satisfying

$$\begin{aligned} \ker \left[ \begin{array}{cc} C&\quad D \end{array}\right] = \mathrm{im\,}\left[ \begin{array}{c} N^\top \\ M^\top \end{array}\right] . \end{aligned}$$

(26)

Then, pre-multiplication by such a maximal annihilator $\left[ \begin{array}{cc} C&\quad D \end{array}\right] $ eliminates the auxiliary variables v, and one obtains the coordinate representation

$$\begin{aligned} CP {\dot{z}} =DSz. \end{aligned}$$

(27)

Remark 28

The geometric construction of extended dissipative Hamiltonian system can be immediately generalized to extended dissipative port-Hamiltonian DAE (dpHDAE) systems with external port variables (inputs and outputs), by extending the maximally monotone subspace ${\mathcal {M}} \subset {\mathcal {X}} \times {\mathcal {X}}^*$ to a maximally monotone subspace ${\mathcal {M}}_e \subset {\mathcal {X}} \times {\mathcal {X}}^* \times {\mathcal {F}}_P \times {\mathcal {F}}_P^*$, where ${\mathcal {F}}_P \times {\mathcal {F}}_P^*$ is the space of external port variables.

Two particular cases of Definition 27 are of special interest. The first one is where the maximally monotone subspace ${\mathcal {M}}$ is actually a Dirac structure as in (6) with K, L satisfying (14). In this case, one can take $\left[ \begin{array}{cc} C&\quad D \end{array}\right] =\left[ \begin{array}{cc} K&\quad L \end{array}\right] $, and thus, the extended dHDAE system reduces to the extended HDAE system

$$\begin{aligned} KP {\dot{z}} = LSz. \end{aligned}$$

(28)

The other special case is where the maximally monotone subspace ${\mathcal {M}}$ in (23) is such that M is invertible. In this case, without loss of generality $M^\top $ can be taken to be the identity matrix, and the maximal annihilator $\left[ \begin{array}{cc} C&\quad D \end{array}\right] $ can be taken to be of the form $\left[ \begin{array}{cc} I&\quad D \end{array}\right] $. Hence, $D=-N^\top $, and thus, (27) reduces to

$$\begin{aligned} P {\dot{z}} =-N^\top Sz. \end{aligned}$$

Furthermore, $MN^\top + NM^\top \ge 0$ reduces to $N^\top + N \ge 0$, and hence,

$$\begin{aligned} N^\top = - \frac{1}{2} (N - N^\top ) + \frac{1}{2} (N + N^\top ) =: -J + R \end{aligned}$$

with $J=-J^\top $ and $R=R^\top \ge 0$. Thus, in this case the extended dpHDAE system takes the familiar form (2) with $E=P$, $S=Q$ expressed as

$$\begin{aligned} E{\dot{z}} = (J-R)Qz, \quad Q^\top E= E^\top Q, J=-J^\top , R=R^\top \ge 0. \end{aligned}$$

Remark 29

Similar to the theory exposed in [41] for HDAE systems (10), the algebraic constraints of the dpHDAE system (27) can be split into two classes: one corresponding to singularity of P (Lagrange algebraic constraints in [41]) and one corresponding to singularity of C. In case of (10) the second class of algebraic constraints are called Dirac algebraic constraints in [41], but now they correspond to the maximally monotone subspace.

Furthermore, mimicking the developments in [41], one can transform algebraic constraints associated with index one belonging to one class into algebraic constraints in the other, by the use of additional state variables (serving as Lagrange multipliers).

5 Representation of DAE systems generated by Dirac and Lagrange structures in the state variables x

The representation (10) of an extended HDAE system as discussed in the previous sections does not use the state variables x of the state space ${\mathcal {X}}$, but instead an equally dimensioned vector $z\in {\mathcal {Z}}$ parameterizing the Lagrange structure, cf. (9) and (15). In this section, we show how a different DAE representation involving the original state vector $x \in {\mathcal {X}}$ can be obtained. Furthermore, we discuss in what sense this representation in x is equivalent with the representation (10) involving z.

5.1 A coordinate representation in the original state variables x

Consider a Dirac structure ${\mathcal {D}} \subset {\mathcal {X}} \times {\mathcal {X}}^*$, a Lagrange structure ${\mathcal {L}} \subset {\mathcal {X}} \times {\mathcal {X}}^*$ and the resulting dynamics specified (in coordinate-free form) as ${\mathcal {D}} \circ {\mathcal {L}} \subset {{\mathcal {X}}} \times {\mathcal {X}}$. Let x be coordinates for the state space ${\mathcal {X}}$, and let the Dirac structure represented by a pair of matrices K, L and the Lagrange structure by a pair of matrices P, S. To derive a coordinate representation employing directly the state vector x, we first consider the combined representations of ${\mathcal {D}}$ and ${\mathcal {L}}$, both in kernel representation, i.e.,

$$\begin{aligned} \left[ \begin{array}{ccc} K &{}\quad L &{}\quad 0 \\ 0 &{}\quad -P^\top &{}\quad S^\top \end{array}\right] \left[ \begin{array}{c} -{\dot{x}} \\ e \\ x \end{array}\right] =0, \end{aligned}$$

(29)

where e are dual coordinates for ${\mathcal {X}}^*$. In order to obtain a DAE system only involving x we need to eliminate the variables e. This can be done by considering a maximal annihilator (left null-space) $\left[ \begin{array}{cc} M&\quad N \end{array}\right] $ of $\left[ \begin{array}{c} L \\ -P^\top \end{array}\right] $, i.e.,

$$\begin{aligned} \ker \left[ \begin{array}{cc} M&\quad N \end{array}\right] = \mathrm{im\,}\left[ \begin{array}{c} L \\ -P^\top \end{array}\right] , \end{aligned}$$

(30)

and thus, in particular,

$$\begin{aligned} ML = NP^\top . \end{aligned}$$

(31)

Since

$$\begin{aligned} \left[ \begin{array}{cc} M&\quad N \end{array}\right] \left[ \begin{array}{ccc} K &{}\quad L &{}\quad 0 \\ 0 &{}\quad -P^\top &{}\quad S^\top \end{array}\right] = \left[ \begin{array}{ccc} MK&\quad 0&\quad NS^\top \end{array}\right] , \end{aligned}$$

premultiplication of the equations (29) by $\left[ \begin{array}{cc} M&\quad N \end{array}\right] $ thus yields

$$\begin{aligned} {\mathcal {D}} \circ {\mathcal {L}} = \ker \left[ \begin{array}{cc} MK&\quad NS^\top \end{array}\right] \subset {{\mathcal {X}}} \times {\mathcal {X}}. \end{aligned}$$

Hence, the resulting DAE system is given by

$$\begin{aligned} MK {\dot{x}} = NS^\top x. \end{aligned}$$

(32)

Remark 30

Also for extended dHDAE systems (including dissipation), we can consider, instead of the coordinate representation (27) involving the parameterizing vector z, a representation that is using the original state x. In fact, let as before, cf. (26), $\left[ \begin{array}{cc} C&\quad D \end{array}\right] $ denote a maximal annihilator of $\left[ \begin{array}{cc} N&\quad M \end{array}\right] ^\top $, i.e., $\ker \left[ \begin{array}{cc} C&\quad D \end{array}\right] =\mathrm{im\,}\left[ \begin{array}{cc} N&\quad M \end{array}\right] ^\top $. Then, consider, similar to (29), the stacked matrix

$$\begin{aligned} \left[ \begin{array}{ccc} C &{}\quad D &{}\quad 0 \\ 0 &{}\quad -P^\top &{}\quad S^\top \end{array}\right] \end{aligned}$$

and a maximal annihilator $\left[ \begin{array}{cc} V&\quad W \end{array}\right] $ to $\left[ \begin{array}{cc} D^\top&-P \end{array}\right] ^\top $, that is, $\ker \left[ \begin{array}{cc} V&\quad W \end{array}\right] = \mathrm{im\,}[D^\top -P]^\top $. Then, premultiplication by $\left[ \begin{array}{cc} V&\quad W \end{array}\right] $ yields the representation

$$\begin{aligned} VC{\dot{x}}=WS^\top x \end{aligned}$$

The analysis performed in the current subsection for (32) can be performed, mutatis mutandis, for (33) as well.

Recall that in the coordinate representation (10), we have the expression ${\mathcal {H}}^z(z)= \frac{1}{2} z^\top S^\top Pz$ for the Hamiltonian. In the representation (32), we do not yet have a Hamiltonian associated with the extended HDAE system. To define such a Hamiltonian ${\mathcal {H}}^x$ in (32), we would need that P is invertible, in which case it is given by

$$\begin{aligned} {\mathcal {H}}^x(x)= \frac{1}{2}x^\top SP^{-1}x. \end{aligned}$$

(33)

If P is invertible then there is a direct relation between the Hamiltonians (11) and (33). In fact, substituting $x=Pz$, we immediately obtain

$$\begin{aligned} {\mathcal {H}}^x(x)= \frac{1}{2}x^\top SP^{-1}x= \frac{1}{2}z^\top P^\top SP^{-1}Pz = \frac{1}{2}z^\top P^\top Sz = {\mathcal {H}}^z(z). \end{aligned}$$

Alternatively, if S is invertible, then one can use the co-energy (Legendre transform) of $H^x$ given by

$$\begin{aligned} {\mathcal {H}}^e(e)= \frac{1}{2}e^\top PS^{-1}e \end{aligned}$$

(34)

for which ${\mathcal {H}}^e(e) = {\mathcal {H}}^z(z)$ with $e=Sz$.

Note that if P is invertible, then also M is invertible. This follows, since then $ML=NP^\top $ implies that the columns of N are in $\mathrm{im\,}M$, and since ${{\,\textrm{rank}\,}}\left[ \begin{array}{cc} M&\quad N \end{array}\right] =n$, this means M is invertible. The converse that M invertible implies P invertible follows analogously. In a similar fashion, it follows that L is invertible if and only if N is invertible. These observations imply the following simplifications of the representation (32) under additional assumptions.

1.
1. (a)
  If P is invertible then $N=MLP^{-\top }$ and by multiplying (32) from the left by $M^{-1}$ we obtain the system
  $$\begin{aligned} K {\dot{x}} = LP^{-\top }S^\top x = L \left( SP^{-1}\right) ^\top x = L SP^{-1} x, \end{aligned}$$
  (35)
  where the last equality follows from $S^\top P = P^\top S$. This is exactly the form of a Hamiltonian DAE system in case of a general Dirac structure and a Lagrange structure that is given as the graph of a symmetric matrix $Q:=SP^{-1}$, see [35, 38,39,40]. Indeed, the Lagrange structure simplifies to the gradient of the Hamiltonian function ${\mathcal {H}}^x(x)= \frac{1}{2}x^\top SP^{-1}x$.
2. (b)
  If in this case, additionally K is invertible, then we obtain the Poisson formulation of Hamiltonian systems, see, e.g., [2],
  $$\begin{aligned} {\dot{x}} = \left( K^{-1} L\right) \left( SP^{-1}\right) x= JQx, \end{aligned}$$
  (36)
  with $J=-J^\top =K^{-1} L$, and $Q=Q^\top $.
2.
1. (a)
  If L and thus also N are invertible, then by (31) we have $M=NP^\top L^{-1}$ and multiplying with $N^{-1}$ from the left we get the DAE
  $$\begin{aligned} P^\top J\dot{x}=P^\top L^{-1}K {\dot{x}} = S^\top x \end{aligned}$$
  (37)
  with $J:=\left( L^{-1}K \right) ^\top = - L^{-1}K$.
2. (b)
  If additionally P is invertible, then with $Q=SP^{-1}= P^{-\top } S^\top $ this may be rewritten as
  $$\begin{aligned} J{\dot{x}} = Qx, \end{aligned}$$
  (38)
  which is the standard symplectic formulation of a Hamiltonian system in case additionally J is invertible, see, e.g., [2].

5.2 Relation between representations (10) and (32)

An immediate question that arises is how the representations (10) and (32) are related. We have already seen that if P is invertible then the relationship is obvious, since in this case $x=Pz$ defines an ordinary state space transformation. However, if P is not invertible, then the representations are not state space equivalent, as the following simple example demonstrates.

Example 31

For $P=[0]$, $S=[1]$, $K=[1]$, $L=[0]$, we have that (10) is the singular system $0 \cdot {\dot{z}} = 0 \cdot z$. On the other hand,

$$\begin{aligned} {{\,\textrm{rank}\,}}\left[ \begin{array}{c} L \\ -P^\top \end{array}\right] =0, \end{aligned}$$

and ${\mathcal {D}} \circ {\mathcal {L}}$ is the origin in ${{\mathcal {X}}} \times {\mathcal {X}}$, defining the degenerate DAE system ${\dot{x}}=x=0$.

However, representations (10) (in the parameterizing z variables) and (32) (in the original state variables x) can be shown to be equivalent in the following generalized sense. First note that for any representation P, S of a Lagrange structure there exist nonsingular matrices V, W such that

$$\begin{aligned} V^{-1}PW =V^\top PW= \left[ \begin{array}{cc} I_{n_1} &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] , \ V^\top SW = \left[ \begin{array}{cc} S_{11} &{}\quad 0 \\ 0 &{}\quad I_{n_2} \end{array}\right] , \nonumber \\ KV=\left[ \begin{array}{cc} K_{11} &{}\quad K_{12} \\ K_{21} &{}\quad K_{22} \end{array}\right] ,\ LV^{-T} =LV=\left[ \begin{array}{cc} L_{11} &{}\quad L_{12} \\ L_{21} &{}\quad L_{22} \end{array}\right] . \end{aligned}$$

(39)

This is a direct consequence of Lemma 37 that will be presented in the next section. Setting $\left[ \begin{array}{c} z_1 \\ z_2 \end{array}\right] = W^{-1} z$ and $\left[ \begin{array}{c} x_1 \\ x_2 \end{array}\right] =V^{-1}x$, it follows that $z_1=x_1$ and $z_2=e_2$. After such a transformation, the system $KP {\dot{z}} = LSz$ takes the form

$$\begin{aligned} \left[ \begin{array}{cc} K_{11} &{}\quad K_{12} \\ K_{21} &{}\quad K_{22} \end{array}\right] \left[ \begin{array}{cc} I &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] \left[ \begin{array}{c} {\dot{x}}_1 \\ {\dot{e}}_2 \end{array}\right] = \left[ \begin{array}{cc} L_{11} &{}\quad L_{12} \\ L_{21} &{}\quad L_{22} \end{array}\right] \left[ \begin{array}{cc} S_{11} &{}\quad 0 \\ 0 &{}\quad I \end{array}\right] \left[ \begin{array}{c} x_1 \\ e_2 \end{array}\right] . \end{aligned}$$

(40)

If we add to the vector $\left[ \begin{array}{cc} x_1 \\ e_2 \end{array}\right] $ the subvector $x_2$, and if we consider the equations (40) together with the original Lagrange algebraic constraint $x_2=0$, then the so extended system can be rewritten as

$$\begin{aligned} \left[ \begin{array}{cc} K_{11} &{}\quad K_{12} \\ K_{21} &{}\quad K_{22} \\ 0 &{}\quad 0 \end{array}\right] \left[ \begin{array}{cc} I &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] \left[ \begin{array}{c} {\dot{x}}_1 \\ {\dot{x}}_2 \end{array}\right] = \left[ \begin{array}{cc} L_{11} S_{11} x_1 \\ L_{21} S_{11} x_1 \\ x_2 \end{array}\right] + \left[ \begin{array}{c} L_{12} \\ L_{22} \\ 0 \end{array}\right] e_2. \end{aligned}$$

(41)

On the other hand, as shown in Sect. 5.1, the extended dHDAE system defined by the Dirac structure ${\mathcal {D}}$ and the Lagrangian structure ${\mathcal {L}}$ in the state space variables x can be expressed as

$$\begin{aligned} \left[ \begin{array}{cccccc} K_{11} &{}\quad K_{12} &{}\quad L_{11} &{}\quad L_{12} &{}\quad 0 &{}\quad 0 \\ K_{21} &{}\quad K_{22} &{}\quad L_{21} &{}\quad L_{22} &{}\quad 0 &{}\quad 0 \\ 0 &{}0 &{}\quad -I &{}\quad 0 &{}\quad S_{11} &{}\quad 0 \\ 0 &{}0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad I \end{array}\right] \left[ \begin{array}{c} - {\dot{x}}_1 \\ - {\dot{x}}_2 \\ e_1 \\ e_2 \\ x_1 \\ x_2 \end{array}\right] =0, \end{aligned}$$

(42)

with $e_1,e_2$ serving as auxiliary variables. Instead of eliminating $e_1,e_2$ from these equations, as discussed in Sect. 4.1, we can only eliminate $e_1$ by premultiplication of (42) by the full row rank matrix

$$\begin{aligned} \left[ \begin{array}{cc} M&\quad N \end{array}\right] = \left[ \begin{array}{cccc} I &{}\quad 0 &{}\quad L_{11} &{}\quad 0 \\ 0 &{}\quad I &{}\quad L_{21} &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad I \end{array}\right] , \end{aligned}$$

which directly leads to the system (41). This extended equivalence between (10) and (32) is summarized in the following proposition.

Proposition 32

Consider the pHDAE representations (10) and (32) defined by the same Lagrange structure ${\mathcal {L}}$ represented by matrices P, S, and by the same Dirac structure ${\mathcal {D}}$ represented by K, L. Consider a transformation such that P, S and K, L are transformed into the form (39) with corresponding partitioning

$$\begin{aligned} x = \left[ \begin{array}{c} x_1 \\ x_2 \end{array}\right] , \; z = \left[ \begin{array}{c} z_1 \\ z_2 \end{array}\right] , \end{aligned}$$

where $x_1=z_1$. Adding to (10) the Lagrange algebraic constraint $x_2=0$ corresponding to $x=Pz$, the resulting dHDAE system is given by (41). This system is equivalent to the representation (29) of (32) after elimination of the variables $e_1$.

Note that the subvector $e_2=z_2$ can be regarded as the Lagrange multiplier vector corresponding to the constraint $x_2=0$. As such, $e_2=z_2$ does not contribute to the expression of the Hamiltonian ${\mathcal {H}}^z(z)$.

Let us illustrate the previous discussion with some further examples.

Example 33

Consider a system in the form (10) with $K= \left[ \begin{array}{cc} I &{}\quad 0 \\ 0 &{}\quad I \end{array}\right] $, $L= \left[ \begin{array}{cc} 0 &{}\quad I \\ -I &{}\quad 0 \end{array}\right] $, $P= \left[ \begin{array}{cc} I &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] $, $S= \left[ \begin{array}{cc} I &{}\quad 0 \\ 0 &{}\quad I \end{array}\right] $ which is the general DAE

$$\begin{aligned} \left[ \begin{array}{cc} I &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] \left[ \begin{array}{c} {\dot{z}}_1 \\ {\dot{z}}_2 \end{array}\right] = \left[ \begin{array}{cc} 0 &{}\quad I \\ -I &{}\quad 0 \end{array}\right] \left[ \begin{array}{c} z_1 \\ z_2 \end{array}\right] \end{aligned}$$

In order to compute the representation (32), we solve

$$\begin{aligned} 0= \left[ \begin{array}{cc} M&\quad N \end{array}\right] \left[ \begin{array}{cc}0 &{}\quad I \\ -I &{}\quad 0 \\ -I &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] , \end{aligned}$$

and with

$$\begin{aligned} \left[ \begin{array}{cc} M&\quad N \end{array}\right] = \left[ \begin{array}{cccc} 0 &{}\quad 0 &{}\quad 0 &{}\quad I \\ 0 &{}\quad I &{}\quad -I &{}\quad 0 \end{array}\right] \end{aligned}$$

we get the system

$$\begin{aligned} \left[ \begin{array}{cc} 0 &{}\quad 0 \\ 0 &{}\quad I \end{array}\right] \left[ \begin{array}{c} {\dot{x}}_1 \\ {\dot{x}}_2 \end{array}\right] = \left[ \begin{array}{cc} 0 &{}\quad I \\ -I &{}\quad 0 \end{array}\right] \left[ \begin{array}{c} x_1 \\ x_2 \end{array}\right] . \end{aligned}$$

Example 34

Consider the system of the form (10) with $K= 0$, $L= I$, $P= I$, $S= I$, i.e., $0 \cdot {\dot{z}} = z$. Solving

$$\begin{aligned} 0= \left[ \begin{array}{cc} M&\quad N \end{array}\right] \left[ \begin{array}{c} I \\ -I \end{array}\right] , \end{aligned}$$

yields $M=I, N=I$, and we obtain the system (32) given by $0 \cdot {\dot{x}} = x$.

Example 35

Consider the system of the form (10) with $K= 0$, $L= I$, $P= 0$, $S= I$, i.e., $0 \cdot {\dot{z}} = z$. Solving

$$\begin{aligned} 0= \left[ \begin{array}{cc} M&\quad N \end{array}\right] \left[ \begin{array}{c}I \\ 0 \end{array}\right] \end{aligned}$$

yields $M=0, N=I$, and thus, the representation (32) is $0 \cdot {\dot{x}} = x$.

Example 36

Consider the system of the form (10) with $K= 0$, $L= \begin{bmatrix}0 &{}\quad I \\ -I &{}\quad 0 \end{bmatrix}$, $P= I$, $S= I$, i.e., $0 \cdot {\dot{z}} = \begin{bmatrix}0 &{}\quad I \\ -I &{}\quad 0 \end{bmatrix}z$. Solving

$$\begin{aligned} 0= \left[ \begin{array}{cc} M&\quad N \end{array}\right] \left[ \begin{array}{cc} 0 &{}\quad I \\ -I &{}\quad 0 \\ -I &{}\quad 0 \\ 0 &{}\quad -I \end{array}\right] \end{aligned}$$

yields $M= I, N= \left[ \begin{array}{cc} 0 &{}\quad I \\ -I &{}\quad 0 \end{array}\right] $, and thus, the representation (32) is given by

$$\begin{aligned} 0 \cdot {\dot{x}} = \left[ \begin{array}{cc} 0 &{}\quad I \\ -I &{}\quad 0\end{array}\right] x. \end{aligned}$$

6 Equivalence transformations and condensed forms

To characterize the properties of extended dHDAEs, we use transformations to condensed forms from which the properties can be read off.

For general DAEs (1) given by matrix pairs (E, A), $E,A{\mathbb {R}}^{\ell ,n}$( or the representation via matrix pencils $\lambda E-A$), we can perform equivalence transformations of the coefficients of the form

$$\begin{aligned} ({{\tilde{E}}},{{\tilde{A}}})=(U^\top EW,U^\top AW), \end{aligned}$$

(43)

with $U\in {\mathbb {R}}^{\ell ,ell}$, $W\in {\mathbb {R}}^{n,n}$ nonsingular. This corresponds to a scaling of the equation with $U^\top $ and a change of variables $z= W{{\tilde{z}}}$. Under such transformations, there is a one-to-one relationship between the solution spaces, see [25], and the canonical form is the Weierstraß canonical form.

For structured systems of the form (2), the associated equivalence transformation that preserves the structure is of the form

$$\begin{aligned} ({{\tilde{E}}},({{\tilde{J}}}-{{\tilde{R}}}){{\tilde{Q}}})=( U^\top EW,(U^\top (J-R)U) U^{-1}QW), \end{aligned}$$

with $U\in {\mathbb {R}}^{\ell ,\ell }$, $W\in {\mathbb {R}}^{n,n}$ nonsingular. A condensed form for this case has been presented in [29].

Finally, for systems of the form (10), the equivalence transformations have the form

$$\begin{aligned} {{\tilde{K}}}= U^\top KV,\ {{\tilde{L}}}= U^\top L V^{-T},\ {{\tilde{P}}} = V^{-1} P W,\ {{\tilde{S}}}= V^\top S W, \end{aligned}$$

(44)

where $U\in {\mathbb {R}}^{\ell ,\ell }$, $V \in {\mathbb {R}}^{n,n}$, $W\in {\mathbb {R}}^{m,m}$ are nonsingular.

The geometric interpretation of the set of transformations in (44) is clear: V defines a coordinate transformation on the state space ${\mathcal {X}}$ while $V^{-\top }$ is the corresponding dual transformation on the dual state space ${\mathcal {X}}^*$. Also note that the combination of V and $V^{-\top }$ on the product space ${\mathcal {X}} \times {\mathcal {X}}^*$ leaves the canonical bilinear forms defined by the matrices $\Pi _-$ and $\Pi _+$ invariant. (In fact, it can be shown that any transformation on ${\mathcal {X}} \times {\mathcal {X}}^*$ that leaves both canonical bilinear forms invariant is necessarily of this form for some invertible V.) Finally, $U^\top $ is an invertible transformation on the equation space for the kernel representation of the Dirac structure ${\mathcal {D}}$, while W is an invertible transformation on the parametrization space ${\mathcal {Z}}$ for the Lagrange structure ${\mathcal {L}}$.

In all three cases, in view of an implementation of the transformations as numerically stable procedures, we are also interested in the case that U, V, W are real orthogonal matrices. We then have that $V^{-1}=V^\top $ and for both pairs (K, L) and (P, S) this is a classical orthogonal equivalence transformation.

Using the described equivalence transformations, we can derive condensed forms for pencils $\lambda P -S$ with $P^\top S= S^\top P$ associated with Lagrange subspaces (or isotropic subspaces if the dimension is not n, see, e.g., [29]). Here we slightly modify the representation and also give a constructive proof that can be implemented as numerically stable algorithm in Appendix A.

Lemma 37

Let $P,S\in {\mathbb {R}}^{n,m}$ be such that $P^\top S=S^\top P$. Then, there exist invertible matrices $V\in {\mathbb {R}}^{n,n}$, $W\in {\mathbb {R}}^{m,m}$ such that

$$\begin{aligned} V^{-1}PW=\left[ \begin{array}{ccccc}I_{m_1} &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad I_{m_2}&{}0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad I_{m_3} &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0\end{array}\right] , V^\top SW=\left[ \begin{array}{cccccc}I_{m_1} &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad -I_{m_2} &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad I_{m_4} &{}\quad 0 \\ S_{51} &{}\quad S_{5,2} &{}\quad S_{5,3} &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{array}\right] , \end{aligned}$$

(45)

with $\left[ \begin{array}{ccc} S_{51}&\quad S_{5,2}&\quad S_{5,3}\end{array}\right] $ of full row rank $n_5$. (Note that block sizes may be zero). Moreover, if the pencil $\lambda P-S$ is regular, then the condensed form is unique, except for the order of blocks, and just contains the first four block rows and columns.

Proof

See Appendix A. $\square $

Note that the condensed form is in general not unique in the fifth block row, but the block sizes $m_1,m_2,m_3,m_4$ and the row dimension $n_5$ are.

Corollary 38

Let $P,S\in {\mathbb {R}}^{n,m}$ be such that $P^\top S=S^\top P$. Then, there exist real orthogonal matrices $V\in {\mathbb {R}}^{n,n}$, $W\in {\mathbb {R}}^{m,m}$ such that

$$\begin{aligned} V^\top PW=\left[ \begin{array}{cccc}P_{11} &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ P_{21} &{}\quad P_{22} &{}0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{array}\right] , V^\top SW=\left[ \begin{array}{cccccc}S_{11} &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0\\ S_{31} &{}\quad S_{32} &{}\quad S_{33} &{}\quad 0 \\ S_{41} &{}\quad S_{42} &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{array}\right] , \end{aligned}$$

(46)

with $P_{11},S_{11}\in {\mathbb {R}}^{m_1+m_2,m_1+m_2}$, $P_{22} \in {\mathbb {R}}^{m_3,m_3}$, $S_{33} \in {\mathbb {R}}^{m_4,m_4}$ invertible, $\left[ \begin{array}{ccc} S_{41}&\quad S_{42} \end{array}\right] $ of full row rank $n_5$, and $P_{11}^\top S_{11}=S_{11}^\top P_{11}$. Here the block sizes $m_1+m_2$, $m_3$, $m_4$, and $m_5$ are as in (45).

Proof

The proof follows by performing Steps 1. and 2. of the proof of Lemma 37, see Appendix B, which yields

$$\begin{aligned} V_2^\top V_1^\top P W_1 W_2= \left[ \begin{array}{ccc} {{\hat{P}}}_{11} &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 \end{array}\right] ,\ V_2^\top V_1^\top S W_1 W_2= \left[ \begin{array}{ccc} {{\hat{S}}}_{11} &{}\quad 0 &{}\quad 0\\ {{\hat{S}}}_{21} &{}\quad {{\hat{S}}}_{22} &{}\quad 0 \\ {{\hat{S}}}_{31} &{}\quad 0 &{}\quad 0 \end{array}\right] , \end{aligned}$$

followed by a singular value decomposition ${{\hat{V}}}^\top _3{{\hat{S}}}_{11} {{\hat{W}}}_3= \left[ \begin{array}{cc} {\check{S}}_{11} &{}\quad 0 \\ 0 &{} 0 \end{array}\right] $ with ${\check{S}}_{11}$ nonsingular diagonal and a full rank decomposition ${\check{V}}_4^\top {{\hat{S}}}_{31}{{\hat{W}}}_3 =\left[ \begin{array}{cc} S_{41}&\quad S_{42} \end{array}\right] $. $\square $

Corollary 38 shows that the characteristic quantities $m_1+m_2$, $m_3$ and $m_4$, as well as $n_5$ can be obtained by purely real orthogonal transformations. The quantities $m_1,m_2$ can then be determined from the real orthogonal staircase form of the symmetric pencil $\lambda P_{11}^\top S_{11} -S_{11}^\top P_{11}$ which has been presented in [8] and implemented as production software in [7].

There is an analogous condensed form for pencils of the form $\lambda K-L$ satisfying $KL^\top =-LK^\top $. For the case of regular pairs, this directly follows from the canonical form presented in [11], but again we present the construction so that it can be directly implemented as a numerical method, see Appendix B.

Lemma 39

Let $K,L\in {\mathbb {R}}^{\ell ,n}$ be such that $KL^\top =-LK^\top $. Then, there exist invertible matrices $U\in {\mathbb {R}}^{\ell ,\ell }$, $V\in {\mathbb {R}}^{n,n}$ such that

$$\begin{aligned} U^\top KV=\left[ \begin{array}{cccccc}I_{\ell _1} &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad I_{\ell _1}&{}0 &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad I_{\ell _3} &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0\end{array}\right] , U^\top LV^{-\top }=\left[ \begin{array}{cccccc} 0 &{}\quad I_{\ell _1} &{}\quad 0 &{}\quad 0 &{}\quad L_{15} &{}\quad 0\\ -I_{\ell _1} &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad L_{25} &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad L_{35} &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad I_{\ell _4} &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{array}\right] , \nonumber \\ \end{aligned}$$

(47)

with $\left[ \begin{array}{c} L_{15} \\ L_{2,5} \\ L_{3,5}\end{array}\right] $ of full column rank $n_5$. (Note that block sizes may be zero). Moreover, if the pencil $\lambda K-L$ is regular, then the condensed form is unique except for the order of blocks and just contains the first four block rows and columns.

Proof

See Appendix B. $\square $

Note again that the form (47) is not unique in general, but the block sizes $\ell _1,\ell _2,\ell _3,\ell _4$ and the column dimension $n_5$ are.

Corollary 40

Let $K,L\in {\mathbb {R}}^{\ell ,n}$ be such that $LK^\top =-KL^\top $. Then, there exist real orthogonal matrices $U\in {\mathbb {R}}^{\ell ,\ell }$, $V\in {\mathbb {R}}^{n,n}$ such that

$$\begin{aligned} U^\top KV=\left[ \begin{array}{ccccc}K_{11} &{}\quad K_{12} &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad K_{22} &{}0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{array}\right] , U^\top LV=\left[ \begin{array}{cccccc}L_{11} &{}\quad 0 &{}\quad L_{13} &{}\quad L_{14} &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad L_{23} &{}\quad L_{24} &{}\quad 0\\ 0 &{}\quad 0 &{}\quad L_{33} &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{array}\right] . \end{aligned}$$

(48)

with $K_{11},L_{11}\in {\mathbb {R}}^{2\ell _1,2\ell _1}$, $K_{22} \in {\mathbb {R}}^{\ell _3,\ell _3}$, $L_{33} \in {\mathbb {R}}^{\ell _4,\ell _4}$ invertible, $\left[ \begin{array}{c} L_{14} \\ L_{2,4} \end{array}\right] $ is of full column rank $n_5$, and $K_{11}^\top L_{11}=-L_{11}^\top K_{11}$. Here the block sizes $\ell _1$, $\ell _3$, $\ell _4$, and $n_5$ are as in (47).

Proof

The proof follows by performing Steps 1. and 2. of the proof of Lemma 39, which yields

$$\begin{aligned} U_2^\top U_1^\top K V_1 V_2= \left[ \begin{array}{ccc} {{\hat{K}}}_{11} &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 \end{array}\right] ,\ U_2^\top U_1^\top L V_1 V_2= \left[ \begin{array}{ccc} {{\hat{L}}}_{11} &{}\quad {{\hat{L}}}_{12} &{}\quad {{\hat{L}}}_{13}\\ 0 &{} {{\hat{L}}}_{22} &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 \end{array}\right] , \end{aligned}$$

followed by a singular value decomposition ${{\hat{U}}}^\top _3{{\hat{L}}}_{11} {{\hat{V}}}_4= \left[ \begin{array}{cc} {\check{L}}_{11} &{}\quad 0 \\ 0 &{} 0 \end{array}\right] $ with ${\check{L}}_{11}$ nonsingular diagonal and a full rank decomposition ${{\hat{U}}}_3^\top {{\hat{L}}}_{13}{{\hat{V}}}_3 =\left[ \begin{array}{c} L_{14} \\ L_{24} \end{array}\right] $. $\square $

Corollary 40 shows that the characteristic quantities $\ell _1$, $\ell _3$, $\ell _4$, as well as $n_5$ can be obtained by purely real orthogonal transformations.

The presented condensed forms can now be used in generating a condensed form for systems of the form (10).

Lemma 41

Consider a system of the form (10) with $K,P,L,S\in {\mathbb {R}}^{n,n}$ and regular pencil $\lambda KP-LS$. Then, there exist invertible matrices U, V, W as in (44) such that

$$\begin{aligned} {{\hat{K}}}= & {} U^\top KV =\left[ \begin{array}{cc} K_{11} &{}\quad K_{12} \\ K_{21} &{}\quad K_{22} \end{array}\right] ,\ {{\hat{L}}} =U^\top LV^{-\top } = \left[ \begin{array}{cc} L_{11} &{}\quad 0 \\ 0 &{}\quad I_{n_2} \end{array}\right] ,\nonumber \\ {{\hat{P}}}= & {} V^{-1} P W=\left[ \begin{array}{cc} I_{n_1} &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] ,\ {{\hat{S}}} = V^\top S W=\left[ \begin{array}{cc} S_{11} &{}\quad 0 \\ 0 &{}\quad I_{n_2} \end{array}\right] , \end{aligned}$$

(49)

where $S_{11}=S_{11}^\top $ and $K_{11} L_{11}^\top =-L_{11} K_{11}^\top $.

Proof

Since the pencil $\lambda KP-LS$ is square and regular, it is square, and also the pencil $\lambda P-S$ is regular; otherwise, by Lemma 37 there would be common kernel of P and S which would imply the pencil $\lambda KP-LS$ to be singular.

Thus, by Lemma 37 there exist nonsingular matrices $W_1, V_1\in {\mathbb {R}}^{n,n}$ such that

$$\begin{aligned} {{\tilde{K}}}= & {} KV_1=\left[ \begin{array}{cc} {{\tilde{K}}}_{11} &{}\quad {{\tilde{K}}}_{12} \\ {{\tilde{K}}}_{21} &{}\quad {{\tilde{K}}}_{22} \end{array}\right] ,\ {{\tilde{L}}}=L V_1^{-\top }=\left[ \begin{array}{cc} {{\tilde{L}}}_{11} &{}\quad {{\tilde{L}}}_{12} \\ {{\tilde{L}}}_{21} &{}\quad {{\tilde{L}}}_{22} \end{array}\right] ,\\ {{\tilde{P}}}= & {} V_1^{-1} P W_1= \left[ \begin{array}{cc} I_{n_1} &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] ,\ {{\tilde{S}}}= V_1^\top S W_1 =\left[ \begin{array}{cc} {{\tilde{S}}}_{11} &{}\quad 0 \\ 0 &{} I_{n_2}\end{array}\right] , \end{aligned}$$

with $n_1= m_1+m_2+m_3$, $n_2 =m_4$ and ${{\tilde{S}}}_{11}$ symmetric. The regularity of the pencil $\lambda KP-LS$ implies that

$$\begin{aligned} \left[ \begin{array}{c} {{\tilde{L}}}_{12} \\ {{\tilde{L}}}_{22} \end{array}\right] \end{aligned}$$

has full column rank, and hence, there exist invertible matrices $U_2\in {\mathbb {R}}^{n,n}$, ${{\tilde{V}}}_2\in {\mathbb {R}}^{n_2,n_2}$, and

$$\begin{aligned} V_2=\left[ \begin{array}{cc} I_{n_1} &{}\quad 0 \\ 0 &{}\quad {{\tilde{V}}}_2\end{array}\right] \end{aligned}$$

such that

$$\begin{aligned} U_2^\top {{\tilde{L}}} V_2=\left[ \begin{array}{cc} {{\hat{L}}}_{11} &{}\quad 0 \\ {{\hat{L}}}_{21} &{}\quad I_{n_2} \end{array}\right] . \end{aligned}$$

With

$$\begin{aligned} V_3=\left[ \begin{array}{cc} I_{n_1} &{}\quad {{\hat{L}}}_{21}^\top \\ 0 &{}\quad I_{n_2} \end{array}\right] , \end{aligned}$$

we then get that

$$\begin{aligned} {{\hat{K}}}= & {} U_2^\top KV_1V_2V_3 =\left[ \begin{array}{cc} K_{11} &{}\quad K_{12} \\ K_{21} &{}\quad K_{22} \end{array}\right] ,\ {{\hat{L}}} =U_2^\top LV_1^{-\top }V_2^{-\top }V_3^{-\top } = \left[ \begin{array}{cc} L_{11} &{}\quad 0 \\ 0 &{}\quad I_{n_2} \end{array}\right] ,\\ {{\hat{P}}}= & {} V_3^{-1}V_2^{-1}V_1^{-1} P W_1= \left[ \begin{array}{cc} I_{n_1} &{}\quad 0 \\ 0 &{}\quad I_{n_2} \end{array}\right] ,\ {{\hat{S}}} = V_3^\top V_2^\top V_1^\top SW_1 =\left[ \begin{array}{cc} S_{11} &{}\quad 0 \\ 0 &{}\quad I_{n_2} \end{array}\right] , \end{aligned}$$

has the desired form with $U=U_2$, $V=V_1V_2V_3$, $W=W_1$ and where $S_{11}=S_{11}^\top $ and $K_{11} L_{11}^\top =-L_{11} K_{11}^\top $. $\square $

Transforming the system as in (49) and setting

$$\begin{aligned} z= W \left[ \begin{array}{c} z_1 \\ z_2 \end{array}\right] , \end{aligned}$$

partitioned accordingly, from the first block row of the coefficient matrices we obtain a reduced system given by

$$\begin{aligned} {\bar{K}} {\bar{P}} \dot{z}_1 = {\bar{L}} {\bar{S}} z_1, \end{aligned}$$

(50)

with ${\bar{P}}=I_{n_1}$, ${\bar{S}} ={\bar{S}}^\top =S_{11}$, ${\bar{K}}=K_{11}$ and ${\bar{L}}= L_{11}$, together with an equation $z_2=K_{21} \dot{z}_1$, where $z_2$ does not contribute to the Hamiltonian ${\mathcal {H}}^z(z)=\frac{1}{2} z^\top P^\top S z$. Note that the second equation is an index two constraint, because it uses the derivative of $z_1$, [25]. It arises from the Lagrange structure due to the singularity of P.

An analogous representation can be constructed from the condensed form of Lemma 39.

Lemma 42

Consider a system of the form (10) with $K,P,L,S\in {\mathbb {R}}^{n,n}$ and regular pencil $\lambda KP-LS$. Then, there exist invertible matrices U, V, W as in (44) such that

$$\begin{aligned} {{\hat{K}}}= & {} U^\top KV =\left[ \begin{array}{cc} I_{n_1} &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] ,\ {{\hat{L}}} =U^\top LV^{-\top } = \left[ \begin{array}{cc} L_{11} &{}\quad 0 \\ 0 &{}\quad I_{n_2} \end{array}\right] ,\nonumber \\ {{\hat{P}}}= & {} V^{-1} P W=\left[ \begin{array}{cc} P_{11} &{}\quad P_{12} \\ P_{21} &{}\quad P_{22} \end{array}\right] ,\ {{\hat{S}}} = V^\top S W=\left[ \begin{array}{cc} S_{11} &{}\quad 0 \\ 0 &{}\quad I_{n_2} \end{array}\right] , \end{aligned}$$

(51)

where $L_{11}=-L_{11}^\top $ and $P_{11}^\top S_{11}=S_{11}^\top P_{11}$.

Proof

Since the pencil $\lambda KP-LS$ is square and regular, also the pencil $\lambda K-L$ is regular, otherwise by Lemma 39 there would be a common left nullspace of K and L which would imply the pencil $\lambda KP-LS$ to be singular.

Thus, by Lemma 39, there exist nonsingular matrices $U_1, V_1\in {\mathbb {R}}^{n,n}$ such that

$$\begin{aligned} {{\tilde{K}}}= & {} U_1KV_1=\left[ \begin{array}{cc} I_{n_1} &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] ,\ {{\tilde{L}}}=U_1 L V_1^{-\top }=\left[ \begin{array}{cc} {{\tilde{L}}}_{11} &{}\quad 0 \\ 0 &{}\quad I_{n_2} \end{array}\right] ,\\ {{\tilde{P}}}= & {} V_1^{-1} P = \left[ \begin{array}{cc} {{\tilde{P}}}_{11} &{}\quad {{\tilde{P}}}_{12} \\ {{\tilde{P}}}_{21} &{}\quad {{\tilde{P}}}_{22} \end{array}\right] ,\ {{\tilde{S}}}= V_1^\top S =\left[ \begin{array}{cc} {{\tilde{S}}}_{11} &{}\quad {{\tilde{S}}}_{12} \\ {{\tilde{S}}}_{21} &{}\quad {{\tilde{S}}}_{22}\end{array}\right] , \end{aligned}$$

with $n_1= 2\ell _1+\ell _3$, $n_2 =\ell _4$ and ${{\tilde{L}}}_{11}$ skew-symmetric. The regularity of the pencil $\lambda KP-LS$ implies that

$$\begin{aligned} \left[ \begin{array}{cc} {{\tilde{S}}}_{21}&{{\tilde{S}}}_{22} \end{array}\right] \end{aligned}$$

has full row rank and hence there exist invertible matrices $W_2\in {\mathbb {R}}^{n,n}$, ${{\tilde{V}}}_2\in {\mathbb {R}}^{n_2,n_2}$, and

$$\begin{aligned} V_2=\left[ \begin{array}{cc} I_{n_1} &{}\quad 0 \\ 0 &{}\quad {{\tilde{V}}}_2\end{array}\right] \end{aligned}$$

such that

$$\begin{aligned} V_2^\top {{\tilde{S}}} W_2=\left[ \begin{array}{cc} {{\hat{S}}}_{11} &{}\quad {{\hat{S}}}_{12} \\ 0 &{}\quad I_{n_2} \end{array}\right] . \end{aligned}$$

With

$$\begin{aligned} V_3=\left[ \begin{array}{cc} I_{n_1} &{}\quad - {{\hat{S}}}_{12} \\ 0 &{}\quad I_{n_2} \end{array}\right] , \end{aligned}$$

we then get that

$$\begin{aligned} {{\hat{K}}}= & {} U_1^\top KV_1V_2V_3 =\left[ \begin{array}{cc} I_{n_1} &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] ,\ {{\hat{L}}} =U_1^\top L V_1^{-\top }V_2^{-\top }V_3^{-\top } = \left[ \begin{array}{cc} L_{11} &{}\quad 0 \\ 0 &{}\quad I_{n_2} \end{array}\right] ,\\ {{\hat{P}}}= & {} V_3^{-1}V_2^{-1}V_1^{-1} P W_2= \left[ \begin{array}{cc} P_{11} &{}\quad P_{12} \\ P_{21} &{}\quad P_{22} \end{array}\right] ,\ {{\hat{S}}} = V_3^\top V_2^\top V_1^\top S W_2 =\left[ \begin{array}{cc} S_{11} &{}\quad 0 \\ 0 &{}\quad I_{n_2} \end{array}\right] , \end{aligned}$$

has the desired form with $W=W_2$, $V=V_1V_2V_3$, $U=U_1$ and where $L_{11}=-L_{11}^\top $ and $P_{11}^\top S_{11}=S_{11}^\top P_{11}$. $\square $

Transforming $KP\dot{z}=LSz$ as in (51) and setting

$$\begin{aligned} z= W \left[ \begin{array}{c} z_1 \\ z_2 \end{array}\right] , \end{aligned}$$

partitioned accordingly, from the first block row of the coefficient matrices we obtain a reduced system given by

$$\begin{aligned} {\bar{K}} {\bar{P}} \dot{z}_1 = {\bar{L}} {\bar{S}} z_1, \end{aligned}$$

(52)

with ${\bar{P}}=P_{11}$, ${\bar{S}} =S_{11}$, ${\bar{K}}=I_{n_1}$ and ${\bar{L}}= L_{11}=-L_{11}^\top $, together with a differential algebraic equation $0=z_2$, so that $z_2$ does not contribute to the Hamiltonian.

Remark 43

System (50) is a dHDAE of the form (35) in which the Lagrange structure is spanned by the columns of

$$\begin{aligned} \left[ \begin{array}{c} I_{n_1} \\ {\bar{S}}_{11} \end{array}\right] . \end{aligned}$$

See also [30, 32] for similar constructions in the context of removing the factor Q in systems of the form (2).

Similarly, System (52) is a pHDAE of the form (2) (with $R=0$) and the Dirac structure is spanned by the columns of

$$\begin{aligned} \left[ \begin{array}{c} I_{n_1} \\ {\bar{L}}_{11} \end{array}\right] . \end{aligned}$$

We also perform a similar construction for systems of the form (27). Since C and D are chosen to be a maximal annihilator such that $C N^\top + D M^\top =0$ in (26) and $MN^\top +NM^\top \ge 0$ with ${{\,\textrm{rank}\,}}\left[ \begin{array}{cc} N&\quad M \end{array}\right] =n$, we can use the same construction as in the proof of Lemma 39 to first transform N and M in such a way that

$$\begin{aligned} N=\left[ \begin{array}{cc} I_{n_1} &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] ,\ M=\left[ \begin{array}{cc} M_{11} &{}\quad 0 \\ 0 &{}\quad I_{n_2} \end{array}\right] . \end{aligned}$$

This implies that we may choose C and D such that

$$\begin{aligned} C=\left[ \begin{array}{cc} -M_{11}^\top &{}\quad 0 \\ \\ 0 &{}\quad I_{n_2} \end{array}\right] , \ D=\left[ \begin{array}{cc} I_{n_1} &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] . \end{aligned}$$

If $\lambda CP-DS$ is regular, then it follows that the last $n_2$ rows of CP have full row rank, and hence, altogether we have the following condensed form.

Lemma 44

Consider a system of the form (27) with $C,P,L,S\in {\mathbb {R}}^{n,n}$ and regular pencil $\lambda CP-DS$ and $\left[ \begin{array}{cc} C&D\end{array}\right] $ a maximal annihilator as in (26). Then, there exist invertible matrices U, V, W as in (44) such that

$$\begin{aligned} {{\hat{C}}}= & {} U^\top CV =\left[ \begin{array}{cc} C_{11} &{}\quad 0 \\ 0 &{}\quad I_{n_2} \end{array}\right] ,\ {{\hat{D}}} =U^\top DV^{-\top } = \left[ \begin{array}{cc} I_{n_1} &{}\quad 0 \\ 0 &{}\quad 0\end{array}\right] ,\nonumber \\ {{\hat{P}}}= & {} V^{-1} P W=\left[ \begin{array}{cc} P_{11} &{}\quad P_{12} \\ 0 &{}\quad I_{n_2} \end{array}\right] ,\ {{\hat{S}}} = V^\top S W=\left[ \begin{array}{cc} S_{11} &{}\quad S_{12} \\ S_{21} &{} S_{22} \end{array}\right] , \end{aligned}$$

(53)

where $C_{11}=-C_{11}^\top $ and $P_{11}^\top S_{11}= S_{11}^\top P_{11}$.

As a consequence, by transforming $CP\dot{z}=DSz$ as in (53) and setting

$$\begin{aligned} z= W \left[ \begin{array}{c} z_1 \\ z_2 \end{array}\right] , \end{aligned}$$

partitioned accordingly, from the second block row of the coefficient matrices we obtain $\dot{z}_2=0$, i.e., $z_2$ is a constant function and the first block row gives an inhomogeneous reduced system

$$\begin{aligned} {\bar{C}} {\bar{P}} \dot{z}_1 = {\bar{D}} {\bar{S}} z_1 + S_{12} z_2, \end{aligned}$$

(54)

with ${\bar{P}}=P_{11}$, ${\bar{S}} =S_{11}$, ${\bar{P}}^\top {\bar{S}}={\bar{S}}^\top {\bar{P}}$, ${\bar{C}}=-M_{11}^T$ and ${\bar{D}}= I_{n_1}$. It will then depend on the initial condition for $z_2$ whether $z_2=0$ in which case it does not contribute to the Hamiltonian; otherwise, the Hamiltonian is still a quadratic function in $z_1$ plus some linear and constant terms.

Remark 45

The condensed forms in this section require rank decisions. Even if they are done in a numerically stable way using singular value decompositions, they can give wrong decisions in finite precision arithmetic. It is a common strategy to use in the case of doubt the worst case scenario. In the case of condensed forms, this would be to assume that the problem is a DAE of index two.

In this section, we have derived structured condensed forms and shown that these can also be used to identify a subsystem which is of one of the well-established forms plus an algebraic constraint whose solution does not contribute to the Hamiltonian. In the next section, we analyze, when general DAEs can be transformed to the forms (2) or (10).

7 Representation of DAEs into the form $KP\dot{z}=LS z$ or $\frac{d}{dt}(Ez) =(J-R)Qz$

For general DAE systems $E\dot{x}=Ax$, it has been characterized in [30] when they are equivalent to a dHDAE system of the form (2). We present here a simplified result for the regular case.

Theorem 46

i) A regular pencil $L(\lambda )=\lambda {{\hat{E}}}-{{\hat{A}}}$ is equivalent to a pencil of the form $\lambda E-(J-R)Q$ as in (2) with $\lambda E-Q$ being regular if and only if the following conditions are satisfied:

1.
The spectrum of $L(\lambda )$ is contained in the closed left half plane.
2.
The finite nonzero eigenvalues on the imaginary axis are semisimple, and the partial multiplicities of the eigenvalue zero are at most two.
3.
The index of $L(\lambda )$ is at most two.

ii) A regular pencil $L(\lambda )=\lambda {{\hat{E}}}-{{\hat{A}}}$ is equivalent to a pencil of the form $\lambda E-(J-R)$ as in (2) (i.e., with $Q=I$) if and only if the following conditions are satisfied:

1.
The spectrum of $L(\lambda )$ is contained in the closed left half plane.
2.
The finite eigenvalues on the imaginary axis (including zero) are semisimple.
3.
The index of $L(\lambda )$ is at most two.

As a corollary for the case without dissipation, we have the following result.

Corollary 47

A regular pencil $L(\lambda ) =\lambda {{\hat{E}}} -{{\hat{A}}}$ is equivalent to a pencil of the form $\lambda E-J$ as in (2) (with $Q=I,R=0$) if and only if the following conditions are satisfied:

1.
All finite eigenvalues are on the imaginary axis and semisimple.
2.
The index of $L(\lambda )$ is at most two.

To study when general regular DAEs of the form (1) can be expressed as extended dHDAEs of the form (10), we first consider a condensed form under orthogonal equivalence.

Theorem 48

Consider a regular pencil $\lambda E-A$ with $E,A\in {\mathbb {R}}^{n,n}$ of index at most two. Then, there exist real orthogonal matrices $U\in {\mathbb {R}}^{n,n}$ and $V\in {\mathbb {R}}^{n,n}$ such that

$$\begin{aligned} U^\top EV= \left[ \begin{array}{cccc} E_{11} &{}\quad E_{12} &{}\quad 0 &{}\quad 0 \\ E_{21} &{}\quad E_{22} &{}\quad 0 &{}\quad 0 \\ 0&{} 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{array}\right] ,\ U^\top A V=\left[ \begin{array}{ccccc} A_{11} &{}\quad A_{12} &{}\quad A_{13} &{}\quad A_{14} \\ A_{21} &{}\quad A_{22} &{}\quad A_{23} &{}\quad 0 \\ A_{31} &{}\quad A_{32} &{}\quad A_{33} &{}\quad 0 \\ A_{41} &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{array}\right] , \end{aligned}$$

(55)

with $A_{14}\in {\mathbb {R}}^{n_1,n_1}$, $A_{41}\in {\mathbb {R}}^{n_1,n_1}$, $E_{22}\in {\mathbb {R}}^{n_2,n_2}$, and $A_{33}\in {\mathbb {R}}^{n_3,n_3}$ invertible.

Proof

The proof is presented in Appendix C. $\square $

Transforming the DAE (1) as $U^\top EV V^\top \dot{x}= U^\top A V V^\top x$ and setting $V^\top x= [x_1^\top , \ldots , x_4^\top ]^\top $, it follows that $x_1=0$, $x_2$ is determined form the implicit ordinary differential equation (note that $E_{22}$ is invertible)

$$\begin{aligned} E_{22} \dot{x}_2 = (A_{22}-A_{23} A_{33}^{-1} A_{32}) x_2, \end{aligned}$$

(56)

$x_3= -A_{33}^{-1} A_{32} x_2$, and $x_4$ is uniquely determined in terms of $x_2,\dot{x}_2, x_3$. Initial conditions can be prescribed freely for $x_2$ only.

Corollary 49

Consider a general regular pencil $\lambda E-A$ with $E,A\in {\mathbb {R}}^{n,n}$ that is of index at most two and for which all finite eigenvalues are in the closed left half plane and those on the imaginary axis are semi-simple. Then, there exist invertible matrices $U\in {\mathbb {R}}^{n,n}$ and $V\in {\mathbb {R}}^{n,n}$ such that

$$\begin{aligned} U^\top EV= \left[ \begin{array}{ccccc} I_{{{\hat{n}}}_1} &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad E_{33} &{}\quad 0&{} 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{array}\right] ,\ U^\top A V=\left[ \begin{array}{ccccc} 0&{} 0 &{}\quad 0 &{}\quad I_{{{\hat{n}}}_1} &{}0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad I_{{{\hat{n}}}_2} \\ 0 &{}\quad 0 &{}\quad A_{33} &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}I_{{{\hat{n}}}_4} &{}\quad 0 \\ -I_{{{\hat{n}}}_1} &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad -I_{{{\hat{n}}}_2} &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{array}\right] , \nonumber \\ \end{aligned}$$

(57)

where

$$\begin{aligned} E_{33}=E_{33}^\top >0,\ A_{33}= J_{33}-R_{33},\ J_{33}=-J_{33}^\top ,\ R_{33}^\top =R_{33}\ge 0. \end{aligned}$$

Proof

The proof follows by considering the condensed form (55), and using block elimination with the invertible matrices $A_{33}$, $A_{14}$, $A_{41}$, $E_{22}$ to transform pencil in (55) to the form

$$\begin{aligned} \lambda \left[ \begin{array}{cccc} {{\tilde{E}}}_{11} &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad I_{n_2} &{}\quad 0&{} 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{array}\right] -\left[ \begin{array}{ccccc} 0&{} 0 &{}\quad 0 &{}\quad I_{n_1} \\ 0 &{}\quad {{\tilde{A}}}_{22} &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad I_{n_3} &{}\quad 0 \\ -I_{n_1} &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{array}\right] . \end{aligned}$$

(58)

Let $U_1^\top {{\tilde{E}}}_{11} V_1= \left[ \begin{array}{cc} I_{{\hat{n}}_1}&{} 0 \\ 0 &{}\quad 0 \end{array}\right] $ be the echelon form of ${{\tilde{E}}}_{11}$. We scale the first block row of with $U_1^\top $, the fourth block row with $V_1^{-1}$, the first block column by $V_1$ and the fourth block column by $U_1^{-\top }$ and obtain a form

$$\begin{aligned} U^\top EV= \left[ \begin{array}{cccccc} I_{{{{\hat{n}}}}_1} &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad I_{n_3} &{}\quad 0&{} 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0&{} 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0&{} 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0&{} 0 \end{array}\right] ,\ U^\top A V=\left[ \begin{array}{cccccc} 0&{} 0 &{}\quad 0 &{}\quad 0 &{}\quad I_{{{{\hat{n}}}}_1} &{}0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0&{} I_{{{{\hat{n}}}}_2} \\ 0 &{}\quad 0 &{}\quad {{\hat{A}}}_{33} &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}I_{{{{\hat{n}}}}_4} &{}\quad 0 &{}\quad 0 \\ -I_{{{{\hat{n}}}}_1} &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad -I_{{{{\hat{n}}}}_2} &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{array}\right] , \end{aligned}$$

For any positive definite solution X of the Lyapunov inequality

$$\begin{aligned} -{{\tilde{A}}}_{33}^\top X-X {{\tilde{A}}}_{33}\ge 0 \end{aligned}$$

(59)

one can multiply the second block row by X and obtain that $E_{33}=X$ and $A_{33} =X{{\hat{A}}}_{33}$ has the desired form, see, e.g., [1, 3]. $\square $

Note that the transformation to a system of the form (2) can also be achieved in a similar way for singular pencils with zero minimal indices.

If there is no dissipation, i.e., if $R_{33}=0$, then $A_{33}$ is skew-symmetric.

In Corollary 49, we have shown that general systems $E\dot{x}=Ax$ can be transformed to a very special canonical form and the following remark shows that for the case $R_{33}=0$ each of the blocks in the canonical form can be expressed as a pencil of the form $\lambda KP-LS$.

Remark 50

Consider a regular pencil $\lambda E-A$ in the form (57), then after a permutation one gets four blocks which all can be written in the form $\lambda KP-LS$ as in (10).

(1)
We have
$$\begin{aligned} \left[ \begin{array}{cc} I_{{{{\hat{n}}}}_1} &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] \left[ \begin{array}{c} \dot{z}_1 \\ \dot{z}_5 \end{array}\right] - \left[ \begin{array}{cc} 0 &{}\quad I_{{{{\hat{n}}}}_1} \\ -I_{{{{\hat{n}}}}_1} &{}\quad 0 \end{array}\right] \left[ \begin{array}{c} z_1 \\ z_5 \end{array}\right] = K_{1}P_{1}\left[ \begin{array}{c} \dot{z}_1 \\ \dot{z}_5 \end{array}\right] - L_{1}S_{1}\left[ \begin{array}{c} z_1 \\ z_5 \end{array}\right] \end{aligned}$$
with
$$\begin{aligned} K_1=\left[ \begin{array}{cc} I_{{{{\hat{n}}}}_1} &{}\quad 0 \\ 0 &{}\quad I_{{{{\hat{n}}}}_1} \end{array}\right] ,\ L_1= \left[ \begin{array}{cc} 0 &{}\quad I_{{{{\hat{n}}}}_1} \\ -I_{{{{\hat{n}}}}_1} &{} 0 \end{array}\right] ,\ P_1= \left[ \begin{array}{cc} I_{{{{\hat{n}}}}_1} &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] , \ S_1= \left[ \begin{array}{cc} I_{{{{\hat{n}}}}_1} &{}\quad 0 \\ 0 &{}\quad I_{{{{\hat{n}}}}_1} \end{array}\right] \end{aligned}$$
and Hamiltonian ${\mathcal {H}}^z_1= \frac{1}{2} \left[ \begin{array}{c} z_1 \\ z_5\end{array}\right] ^\top \left[ \begin{array}{cc} I_{{{{\hat{n}}}}_1} &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] \left[ \begin{array}{c} z_1 \\ z_5 \end{array}\right] ={ \frac{1}{2} z_1^\top z_1}$ which is actually 0. In this case, we can insert the derivative of the second equation into the first (index reduction) and obtain
$$\begin{aligned} \left[ \begin{array}{cc} 0 &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] \left[ \begin{array}{c} \dot{z}_1 \\ \dot{z}_5 \end{array}\right] = \left[ \begin{array}{cc} 0 &{}\quad I_{{{{\hat{n}}}}_1} \\ -I_{{{{\hat{n}}}}_1} &{}\quad 0 \end{array}\right] \left[ \begin{array}{c} z_1 \\ z_5 \end{array}\right] \end{aligned}$$
without changing the Hamiltonian.
(2)
We have
$$\begin{aligned} \left[ \begin{array}{cc} 0 &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] \left[ \begin{array}{c} \dot{z}_2 \\ \dot{z}_6 \end{array}\right] =\left[ \begin{array}{cc} 0 &{}\quad I_{{{{\hat{n}}}}_2} \\ -I_{{{{\hat{n}}}}_2} &{}\quad 0 \end{array}\right] \left[ \begin{array}{c} z_2 \\ z_6 \end{array}\right] = K_{2}P_{2}\left[ \begin{array}{c} \dot{z}_2 \\ \dot{z}_6 \end{array}\right] - L_{2}S_{2}\left[ \begin{array}{c} z_2 \\ z_6 \end{array}\right] \end{aligned}$$
with different possibilities of representation, e.g., a)
$$\begin{aligned} K_2=\left[ \begin{array}{cc} I_{{{{\hat{n}}}}_2} &{}\quad 0 \\ 0 &{}\quad I_{{{{\hat{n}}}}_2} \end{array}\right] ,\ L_3= \left[ \begin{array}{cc} 0 &{}\quad I_{{{{\hat{n}}}}_2} \\ -I_{{{{\hat{n}}}}_2} &{}\quad 0 \end{array}\right] ,\ P_2= \left[ \begin{array}{cc} 0 &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] , \ S_3= \left[ \begin{array}{cc} I_{{{{\hat{n}}}}_1} &{}\quad 0 \\ 0 &{}\quad I_{{{{\hat{n}}}}_1} \end{array}\right] \end{aligned}$$
and Hamiltonian ${\mathcal {H}}^z_2= \frac{1}{2} \left[ \begin{array}{c} z_2 \\ z_6\end{array}\right] ^\top \left[ \begin{array}{cc} 0 &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] \left[ \begin{array}{c} z_2 \\ z_6 \end{array}\right] =0$, or b)
$$\begin{aligned} K_2=\left[ \begin{array}{cc} 0 &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] ,\ L_2= \left[ \begin{array}{cc} 0 &{}\quad I_{{{{\hat{n}}}}_2} \\ -I_{{{{\hat{n}}}}_2} &{}\quad 0 \end{array}\right] ,\ P_2= \left[ \begin{array}{cc} 0 &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] , \ S_2= \left[ \begin{array}{cc} I_{{{{\hat{n}}}}_2} &{}\quad 0 \\ 0 &{}\quad I_{{{{\hat{n}}}}_2} \end{array}\right] \end{aligned}$$
and Hamiltonian ${\mathcal {H}}^z_2= \frac{1}{2} \left[ \begin{array}{c} z_2 \\ z_6\end{array}\right] ^\top \left[ \begin{array}{cc} 0 &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] \left[ \begin{array}{c} z_2 \\ z_6 \end{array}\right] =0$.
(3)
We have $\lambda E_{33}-A_{33}= \lambda K_{3}P_{3}- L_{3}S_{3}$ with $K_3=I_{\hat{n_3}}$, $P_3=E_{33}$, $L_3= A_{33}$, $S_{3}= I_{{{{\hat{n}}}}_3}$. Here the Hamiltonian is ${\mathcal {H}}^z_3= \frac{1}{2} z_3^\top E_{33} z_3$.
(4)
We have $\lambda 0 -I_{{{{\hat{n}}}}_4}= \lambda K_{4}P_{4}- L_{4}S_{4}$ with $K_4=0$, $P_4=I_{{{{\hat{n}}}}_4}$, $L_4=I_{{{{\hat{n}}}}_4}$, $S_4=I_{{{{\hat{n}}}}_4}$. Here the Hamiltonian is ${\mathcal {H}}^z_4= \frac{1}{2} z_4^\top z_4$.

Note that the presented representations are in no way unique, but if the condensed form is available or computable, and the properties of Theorem 48 hold, then we can express the general DAE in the representation (10) or (2).

This discussion yields the following useful corollary.

Corollary 51

Consider a regular pencil of the form $\lambda KP-LS$ associated with the dHDAE (10). Then, it has index at most two, and index two can only occur if the system has a singular Lagrange structure.

Proof

Consider the representations in Remark 50. Then, the index two structure occurs only in the first case where $K_1,L_1$ are invertible, but the product $P_1^TS_1$ is singular. Thus index two arises only from a singular Lagrange structure. $\square $

8 Conclusion and outlook

Different definitions of (extended, dissipative) Hamiltonian or port Hamiltonian differential–algebraic systems lead to different representations. We have collected all the known representations as well as a few new ones and analyzed them from a geometric as well as an algebraic point of view. The latter leads to condensed forms that can be directly implemented in numerical algorithms to compute the structural properties of the systems. We have also studied the effect that different representations have on the index of the differential–algebraic system as well as on the associated Hamilton function. In general, it can be seen that certain algebraic constraints do not contribute to the Hamiltonian and therefore can be separated from the system in an appropriate coordinate system. We have also characterized when a general differential–algebraic system can be transformed to different representations. Several important tasks remain open. These include extensions to the case of non-regular systems. These can be based on the results and methods in Appendices A and B that are already proved for the non-square case. For systems with inputs and outputs, linear time-varying and nonlinear systems, the extensions are currently under consideration.

Availability of data and materials

Not applicable.

References

Achleitner F, Arnold A, Mehrmann V (2021) Hypocoercivity and controllability in linear semi-dissipative ODEs and DAEs. ZAMM Z Angew Math Mech (in press)
Arnol’d VI (2013) Mathematical methods of classical mechanics, vol 60. Springer, New York
Google Scholar
Beattie C, Mehrmann V, Van Dooren P (2019) Robust port-Hamiltonian representations of passive systems. Automatica 100:182–186
Article MathSciNet MATH Google Scholar
Beattie C, Mehrmann V, Xu H, Zwart H (2018) Port-Hamiltonian descriptor systems. Math Control Signals Syst 30(17):1–27
MathSciNet MATH Google Scholar
Benner P, Byers R, Faßbender H, Mehrmann V, Watkins D (2000) Cholesky-like factorizations of skew-symmetric matrices. Electron Trans Numer Anal 11:85–93
MathSciNet MATH Google Scholar
Breedveld PC (2008) Modeling and simulation of dynamic systems using bond graphs. EOLSS Publishers Co. Ltd./UNESCO, Oxford, pp 128–173
Brüll T, Mehrmann V (2007) STCSSP: A FORTRAN 77 routine to compute a structured staircase form for a (skew-)symmetric/(skew-)symmetric matrix pencil. Preprint 31-2007, Institut für Mathematik, TU Berlin
Byers R, Mehrmann V, Xu H (2007) A structured staircase algorithm for skew-symmetric/symmetric pencils. Electron Trans Numer Anal 26:1–13
MathSciNet MATH Google Scholar
Camlibel MK, van der Schaft A (2022) Port-Hamiltonian systems and monotonicity. arXiv:2206.09139
Camlibel MK, van der Schaft AJ (2013) Incrementally Port-Hamiltonian systems. In: 52nd IEEE conference on decision and control. IEEE, pp 2538–2543
Courant TJ (1990) Dirac manifolds. Trans Am Math Soc 319(2):631–661
Article MathSciNet MATH Google Scholar
Dai L (1989) Singular control systems, vol 118. Lecture Notes in Control and Inform. Sci. Springer-Verlag, Berlin, Heidelberg
Duindam V, Macchelli A, Stramigioli S, Bruyninckx H (2009) Modeling and control of complex physical systems: the port-Hamiltonian approach. Springer-Verlag, Berlin, Heidelberg
Book MATH Google Scholar
Eberard D, Maschke B, Van Der Schaft A (2007) An extension of pseudo-Hamiltonian systems to the thermodynamic space: towards a geometry of non-equilibrium thermodynamics. Rep Math Phys 60(2):175–198
Article MathSciNet MATH Google Scholar
Egger H, Kugler T (2018) Damped wave systems on networks: exponential stability and uniform approximations. Numer Math 138(4):839–867
Article MathSciNet MATH Google Scholar
Egger H, Kugler T, Liljegren-Sailer B, Marheineke N, Mehrmann V (2018) On structure preserving model reduction for damped wave propagation in transport networks. SIAM J Sci Comput 40:A331–A365
Article MathSciNet MATH Google Scholar
Eich-Soellner E, Führer C (1998) Numerical methods in multibody dynamics. Vieweg+Teubner Verlag, Wiesbaden
Book MATH Google Scholar
Emmrich E, Mehrmann V (2013) Operator differential-algebraic equations arising in fluid dynamics. Comput Methods Appl Math 13(4):443–470
Article MathSciNet MATH Google Scholar
Freund RW (2011) The SPRIM algorithm for structure-preserving order reduction of general RLC circuits. In: Benner P, Hinze M, ter Maten EJW (eds) Model reduction for circuit simulation. Springer-Verlag, Dordrecht, pp 25–52
Chapter Google Scholar
Gantmacher FR (1959) Theory of matrices, vol 1. Chelsea, New York
MATH Google Scholar
Gernandt H, Haller FE, Reis E (2021) A linear relation approach to port-Hamiltonian differential-algebraic equations. SIAM J Matrix Anal Appl 42(2):1011–1044
Article MathSciNet MATH Google Scholar
Golo G, van der Schaft AJ, Breedveld PC, Maschke BM (2003) Hamiltonian formulation of bond graphs. In: Rantzer A, Johansson R (eds) Nonlinear and hybrid systems in automotive control. Springer, Heidelberg, pp 351–372
Günther M, Bartel A, Jacob B, Reis T (2021) Dynamic iteration schemes and port-Hamiltonian formulation in coupled differential-algebraic equation circuit simulation. Int J Circ Theor Appl 49(2):430–452
Article Google Scholar
Jacob B, Zwart H (2012) Linear port-Hamiltonian systems on infinite-dimensional spaces. Advances and Applications. Birkhäuser, Basel, Operator Theory
Kunkel P, Mehrmann V (2006) Differential-algebraic equations. Analysis and Numerical Solution. European Mathematical Society, Zürich
Kunkel P, Mehrmann V (2011) Formal adjoints of linear DAE operators and their role in optimal control. Electron J Linear Algebra 22:672–693
Article MathSciNet MATH Google Scholar
Liesen J, Mehrmann V (2015) Linear algebra. Springer Undergraduate Mathematics Series. Springer-Verlag, Cham
Maschke BM, van der Schaft A (1992) Port-controlled Hamiltonian systems: modelling origins and system theoretic properties. IFAC Proc Vol 25(13):359–365
Article Google Scholar
Mehl C, Mehrmann V, Wojtylak M (2018) Linear algebra properties of dissipative Hamiltonian descriptor systems. SIAM J Matrix Anal Appl 39(3):1489–1519
Article MathSciNet MATH Google Scholar
Mehl C, Mehrmann V, Wojtylak M (2021) Distance problems for dissipative Hamiltonian systems and related matrix polynomials. Linear Algebra Appl, pp 335–366
Mehrmann V, Morandin R (2019) Structure-preserving discretization for port-Hamiltonian descriptor systems. In: 58th IEEE conference on decision and control (CDC), Nice, France, pp 6863–6868
Mehrmann V, Unger B (2023) Control of port-Hamiltonian differential-algebraic systems and applications. Acta Numerica, To appear
Book Google Scholar
Ortega R, van der Schaft AJ, Mareels Y, Maschke BM (2001) Putting energy back in control. Control Syst Mag 21:18–33
Article Google Scholar
van der Schaft AJ (2006) Port-Hamiltonian systems: an introductory survey. In: Verona JL, Sanz-Sole M, Verdura J (eds) Proceedings of the International Congress of Mathematicians, vol. III, Invited Lectures. Madrid, Spain, pp 1339–1365
van der Schaft AJ, Maschke BM (1995) The Hamiltonian formulation of energy conserving physical systems with external ports. Arch. Elektron. Übertragungstech. 45:362–371
Google Scholar
van der Schaft AJ, Maschke BM (2013) Port-Hamiltonian systems on graphs. SIAM J Control Optim 51:906–937
Article MathSciNet MATH Google Scholar
Scholz L (2017) Condensed forms for linear port-Hamiltonian descriptor systems. Preprint 09–2017, Institut für Mathematik, Technische Universität Berlin
Van der Schaft A (2017) L2-gain and passivity techniques in nonlinear control, 3rd edn. Springer, Cham
Book MATH Google Scholar
van der Schaft A (2013) Port-Hamiltonian differential-algebraic systems. In: Ilchmann A, Reis T (eds) Surveys in Differential-Algebraic Equations I. Differential-Algebraic Equations Forum. Springer-Verlag, Berlin, Heidelberg, pp 173–226
Chapter Google Scholar
van der Schaft A, Jeltsema D (2014) Port-Hamiltonian systems theory: An introductory overview. Foundations and Trends in Systems and Control 1(2–3):173–378
Article MATH Google Scholar
van der Schaft A, Maschke B (2018) Generalized port-Hamiltonian DAE systems. Systems Control Lett 121:31–37
Article MathSciNet MATH Google Scholar
van der Schaft A, Maschke B (2020) Dirac and Lagrange algebraic constraints in nonlinear port-Hamiltonian systems. Vietnam J. Mathematics 48(4):929–939
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

We thank two anonymous referees for very valuable comments and suggestions that helped to substantially improve the paper.

Funding

Open Access funding enabled and organized by Projekt DEAL. Research of the first author supported by Deutsche Forschungsgemeinschaft (DFG) through priority Program SPP 1984 within project 790 39-2: Distributed dynamic control of network security.

Author information

Authors and Affiliations

Institut für Mathematik, Sekr. MA 4-5, TU Berlin, Straße des 17. Juni 136, 10623, Berlin, Germany
Volker Mehrmann
Bernoulli institute for Mathematics, Computer Science and AI, Jan C. Willems Center for Systems and Control, University of Groningen, Nijenborgh 9, Groningen, The Netherlands
Arjan van der Schaft

Authors

Volker Mehrmann
View author publications
You can also search for this author in PubMed Google Scholar
Arjan van der Schaft
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Both authors did the research together; there is no distinction that can be specified.

Corresponding author

Correspondence to Volker Mehrmann.

Ethics declarations

Ethical approval and consent for participants

Not applicable.

Conflict of interest

There are no competing interests of a financial or personal nature.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

1.1 Appendix A: Proof of Lemma 37

Proof

We present a proof in form of an algorithmic procedure that can be implemented as a numerical algorithm.

Step 1. Let $V_1$ and $W_1$ be real orthogonal matrices such that

$$\begin{aligned} V_1^\top P W_1=\left[ \begin{array}{cc} {{\tilde{P}}}_{11} &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] , \ V_{1}^\top S W= \left[ \begin{array}{cc} {{\tilde{S}}}_{11} &{}\quad {{\tilde{S}}}_{12} \\ {{\tilde{S}}}_{21} &{}\quad {{\tilde{S}}}_{22} \end{array}\right] , \end{aligned}$$

where ${{\tilde{P}}}_{11}\in {\mathbb {R}}^{{{\tilde{m}}}_1,{{\tilde{m}}}_1}$ is diagonal with positive diagonal elements. This transformation can be constructed via a singular value decomposition of P and a numerical rank decision.

Then, using the structure (16), it follows that ${{\tilde{P}}}_{11}^\top {{\tilde{S}}}_{11}={{\tilde{S}}}_{11}^\top {{\tilde{P}}}_{11}$ and ${{\tilde{S}}}_{12}=0$.

Step 2. Let ${{\tilde{V}}}_2$, ${{\tilde{W}}}_2$ be real orthogonal matrices such that

$$\begin{aligned} {{\tilde{V}}}_2^\top {{\tilde{S}}}_{22} W_2= \left[ \begin{array}{cc} {{\hat{S}}}_{22} &{}\quad 0 \\ 0 &{}\quad 0\end{array}\right] \end{aligned}$$

where ${{\hat{S}}}_{22}\in {\mathbb {R}}^{m_4,m_4}$ is diagonal with positive diagonal elements. This transformation can be constructed again via a singular value decomposition and a numerical rank decision. Set

$$\begin{aligned} V_2=\left[ \begin{array}{cc} I_{{{\tilde{m}}}_1} &{}\quad 0 \\ 0 &{}\quad {{\tilde{V}}}_2^\top \end{array}\right] , \ W_2 =\left[ \begin{array}{cc} I_{{{\tilde{m}}}_1} &{}\quad 0 \\ 0 &{}\quad {{\tilde{W}}}_2\end{array}\right] \end{aligned}$$

and form

$$\begin{aligned} V_2^\top V_1^\top P W_1 W_2= \left[ \begin{array}{ccc} {{\hat{P}}}_{11} &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 \end{array}\right] ,\ V_2^\top V_1^\top S W_1 W_2= \left[ \begin{array}{ccc} {{\hat{S}}}_{11} &{}\quad 0 &{}\quad 0\\ {{\hat{S}}}_{21} &{}\quad {{\hat{S}}}_{22} &{}\quad 0 \\ {{\hat{S}}}_{31} &{}\quad 0 &{}\quad 0 \end{array}\right] , \end{aligned}$$

with ${{\hat{P}}}_{11}^\top ={{\tilde{P}}}_{11}$.

Step 3. Let

$$\begin{aligned} V_3= \left[ \begin{array}{ccc} {{\hat{P}}}_{11} &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad {{\hat{S}}}_{22}^{-\top } &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad I_{n-{{\tilde{m}}}_1-m_4}\end{array}\right] ,\ W_3=\left[ \begin{array}{ccc} I_{{{\tilde{m}}}_1} &{}\quad 0 &{}\quad 0 \\ -{{\hat{S}}}_{22}^{-1} {{\hat{S}}}_{21} &{}\quad I_{m_4} &{}\quad 0 \\ 0 &{} 0 &{}\quad I_{m-{{\tilde{m}}}_1-m_4}\end{array}\right] , \end{aligned}$$

and form

$$\begin{aligned} V_3^{-1}V_2^\top V_1^\top P W_1 W_2 W_3= \left[ \begin{array}{ccc} I_{{{\tilde{m}}}_1} &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 \end{array}\right] ,\ V_{3}^\top V_2^\top V_1^\top S W_1 W_2 W_3= \left[ \begin{array}{ccc} {\check{S}}_{11} &{}\quad 0 &{}\quad 0\\ 0 &{}\quad I_{m_4} &{}\quad 0 \\ {\check{S}}_{31} &{}\quad 0 &{}\quad 0 \end{array}\right] , \end{aligned}$$

where by the structure (16) now ${\check{S}}_{11}$ is symmetric. Note that although we are working with nonorthogonal transformation matrices in this step, the numerical errors can be controlled, since we are inverting diagonal matrices.

Step 4. Let

$$\begin{aligned} {\check{V}}_{11}^\top {\check{S}}_{11} {\check{V}}_{11}= \left[ \begin{array}{ccc} I_{m_1} &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad -I_{m_2} &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 \end{array}\right] \end{aligned}$$

be the canonical form of the skew-symmetric matrix ${\check{S}}_{11}$ under congruence which can be obtained by first computing the spectral decomposition and then scaling the nonsingular diagonal parts by congruence to be $\pm I$, see, e.g., [27].

Furthermore, let

$$\begin{aligned} {\check{S}}_{31} = {\check{V}}_4 \left[ \begin{array}{ccc} S_{51} &{}\quad S_{5,2} &{}\quad S_{5,3}\\ 0 &{}\quad 0 &{}\quad 0\end{array}\right] \end{aligned}$$

be a full rank decomposition partitioned accordingly, with ${\check{V}}_3$ real orthogonal. Then, set

$$\begin{aligned} V_4= \left[ \begin{array}{ccc} {\check{V}}_{11}^{-\top } &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad I_{m_4} &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad {\check{V}}_4\end{array}\right] ,\ W_4=\left[ \begin{array}{ccc} {\check{V}}_{11} &{}\quad 0 &{}\quad 0 \\ {} &{}\quad I_{m_4} &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad I_{m-{{\tilde{m}}}_1-m_4}\end{array}\right] , \end{aligned}$$

and form

$$\begin{aligned} V_4^{-1} V_3^{-1}V_2^{-1} V_1^{-1} P W_1 W_2 W_3 W_4= & {} \left[ \begin{array}{ccccc} I_{m_1} &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0&{} I_{m_2} &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0&{} 0 &{}\quad I_{m_3} &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{array}\right] ,\\ V_4^{-1} V_3^{-1}V_2^{-1} V_1^{-1} S W_1 W_2 W_3= & {} \left[ \begin{array}{cccccc}I_{m_1} &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad -I_{m_2} &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad I_{m_4} &{}\quad 0 \\ S_{51} &{}\quad S_{5,2} &{}\quad S_{5,3} &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{array}\right] , \end{aligned}$$

which is as claimed. $\square $

1.2 Appendix B: Proof of Lemma 39

Proof

The proof is similar to that of Lemma 37 just adapting to different symmetries and transformation structure. The following algorithmic procedure can be directly implemented as a numerical algorithm.

Step 1. Let $U_1$ and $V_1$ be real orthogonal matrices such that

$$\begin{aligned} U_1^\top K V_1=\left[ \begin{array}{cc} {{\tilde{K}}}_{11} &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] , \ U_{1}^\top L V_1^{-\top }= \left[ \begin{array}{cc} {{\tilde{L}}}_{11} &{}\quad {{\tilde{L}}}_{12} \\ {{\tilde{L}}}_{21} &{}\quad {{\tilde{L}}}_{22} \end{array}\right] , \end{aligned}$$

where ${{\tilde{K}}}_{11}\in {\mathbb {R}}^{{{\tilde{\ell }}}_1,{{\tilde{\ell }}}_1}$ is diagonal with positive diagonal elements. This transformation can be constructed via a singular value decomposition of K and a numerical rank decision.

Then, using the structure (14), it follows that ${{\tilde{K}}}_{11}^\top {{\tilde{L}}}_{11}={{\tilde{L}}}_{11}^\top {{\tilde{K}}}_{11}$ and ${{\tilde{L}}}_{21}=0$.

Step 2. Let ${{\tilde{U}}}_2$, ${{\tilde{V}}}_2$ be real orthogonal matrices such that

$$\begin{aligned} {{\tilde{U}}}_2^\top {{\tilde{L}}}_{22} V_2^{-\top }= \left[ \begin{array}{cc} {{\hat{L}}}_{22} &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] , \end{aligned}$$

where ${{\hat{L}}}_{22}\in {\mathbb {R}}^{\ell _4,\ell _4}$ is diagonal with positive diagonal elements. This transformation can be constructed again via a singular value decomposition and a numerical rank decision. Set

$$\begin{aligned} U_2=\left[ \begin{array}{cc} I_{{{\tilde{\ell }}}_1} &{}\quad 0 \\ 0 &{}\quad {{\tilde{U}}}_2 \end{array}\right] , \ V_2 =\left[ \begin{array}{cc} I_{{{\tilde{\ell }}}_1} &{}\quad 0 \\ 0 &{}\quad {{\tilde{V}}}_2\end{array}\right] \end{aligned}$$

and form

$$\begin{aligned} U_2^\top U_1^\top K V_1 V_2= \left[ \begin{array}{ccc} {{\hat{K}}}_{11} &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 \end{array}\right] ,\ U_2^\top U_1^\top L V_1^{-\top } V_2^{-\top }= \left[ \begin{array}{ccc} {{\hat{L}}}_{11} &{}\quad {{\hat{L}}}_{12} &{}\quad {{\hat{L}}}_{13}\\ 0 &{}\quad {{\hat{L}}}_{22} &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 \end{array}\right] , \end{aligned}$$

with ${{\hat{K}}}_{11}={{\tilde{K}}}_{11}$.

Step 3. Let

$$\begin{aligned} U_3= \left[ \begin{array}{ccc} I_{{{\tilde{\ell }}}_1} &{}\quad -{{\hat{L}}}_{22}^{-\top } {{\hat{L}}}_{12}^\top &{}\quad 0 \\ 0 &{}\quad {{\hat{L}}}_{22}^{-\top } &{}\quad 0 \\ 0 &{} 0 &{}\quad I_{\ell -{{\tilde{\ell }}}_1-\ell _4}\end{array}\right] ,\ V_3=\left[ \begin{array}{ccc} {{\hat{K}}}_{11}^{-1} &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad I_{\ell _4} &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad I_{n-{{\tilde{\ell }}}_1-\ell _4}\end{array}\right] , \end{aligned}$$

and form

$$\begin{aligned} U_3^\top U_2^\top U_1^\top K V_1 V_2 V_3= \left[ \begin{array}{ccc} I_{{{\tilde{\ell }}}_1} &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 \end{array}\right] ,\ U_{3}^\top U_2^\top U_1^\top L V_1^{-\top } V_2^{-\top } V_3^{-\top }= \left[ \begin{array}{ccc} {\check{L}}_{11} &{}\quad 0 &{}\quad {\check{L}}_{13}\\ 0 &{}\quad I_{\ell _4} &{}\quad 0 \\ {\check{0}} &{}\quad 0 &{}\quad 0 \end{array}\right] , \end{aligned}$$

where by the structure (14) now ${\check{L}}_{11}$ is skew-symmetric. Note that although we are working with nonorthogonal transformation matrices in this step, the numerical errors can be controlled since we are inverting diagonal matrices.

Step 4. Let

$$\begin{aligned} {\check{U}}_{11}^\top {\check{L}}_{11} {\check{U}}_{11}= \left[ \begin{array}{ccc} 0 &{}\quad I_{\ell _1} &{}\quad 0 \\ -I_{\ell _1} &{}\quad 0 &{}0\\ 0 &{}\quad 0 &{}\quad 0 \end{array}\right] \end{aligned}$$

be the canonical form of the skew-symmetric matrix ${\check{L}}_{11}$ under congruence which can be obtained by first computing the spectral decomposition and then scaling the nonsingular diagonal parts by congruence to be $\pm I$. This procedure is implemented in a numerically robust way in [5]. Furthermore, let

$$\begin{aligned} {\check{L}}_{13} = \left[ \begin{array}{cc} L_{15} &{}\quad 0 \\ L_{25} &{}\quad 0\\ L_{35} &{}\quad 0 \end{array}\right] {\check{V}}_3 \end{aligned}$$

be a full rank decomposition partitioned accordingly, with ${\check{V}}_3$ real orthogonal. Then, set

$$\begin{aligned} U_4= \left[ \begin{array}{ccc} {\check{U}}_{11} &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad I_{\ell _4} &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad I_{\ell -{{\tilde{\ell }}}_1 -\ell _4}\end{array}\right] ,\ V_4=\left[ \begin{array}{ccc} {\check{U}}_{11}^{-\top } &{}\quad 0 &{}\quad 0 \\ {} &{}\quad I_{\ell _4} &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad {\check{V}}_3\end{array}\right] , \end{aligned}$$

and form

$$\begin{aligned} U_4^{T} U_3^{T} U_2^\top U_1^\top K V_1 V_2 V_3 V_4= & {} \left[ \begin{array}{ccccc} I_{\ell _1} &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0&{} I_{\ell _1} &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0&{} 0 &{}\quad I_{\ell _3} &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{array}\right] ,\\ U_4^{T} U_3^{T} U_2^\top U_1^\top L V_1^{-\top } V_2^{-\top } V_3^{-\top } V_4^{-\top }= & {} \left[ \begin{array}{cccccc}0 &{}\quad I_{\ell _1} &{}\quad 0 &{}\quad 0 &{}\quad L_{15} &{}\quad 0\\ -I_{\ell _1} &{}\quad 0 &{}\quad 0&{} 0 &{}\quad L_{25} &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad L_{35} &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad I_{\ell _4} &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \end{array}\right] , \end{aligned}$$

which is as claimed. $\square $

1.3 Appendix C: Proof of Theorem 48

Proof

We again present a proof that can be implemented as a numerical algorithm.

Step 1. Let $U_1$ and $V_1$ be real orthogonal matrices such that

$$\begin{aligned} U_1^\top E V_1=\left[ \begin{array}{cc} {{\tilde{E}}}_{11} &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] , \ U_{1}^\top A V_1= \left[ \begin{array}{cc} {{\tilde{A}}}_{11} &{}\quad {{\tilde{A}}}_{12} \\ {{\tilde{A}}}_{21} &{}\quad {{\tilde{A}}}_{22} \end{array}\right] , \end{aligned}$$

where ${{\tilde{E}}}_{11}\in {\mathbb {R}}^{{{\tilde{n}}}_1,{{\tilde{n}}}_1}$ is diagonal with positive diagonal elements. This transformation can be constructed via a singular value decomposition of E and a numerical rank decision.

Step 2. Let ${{\tilde{U}}}_2$, ${{\tilde{V}}}_2$ be real orthogonal matrices such that

$$\begin{aligned} {{\tilde{U}}}_2^\top {{\tilde{A}}}_{22} {{\tilde{V}}}_2= \left[ \begin{array}{cc} {{\hat{A}}}_{22} &{}\quad 0 \\ 0 &{}\quad 0 \end{array}\right] , \end{aligned}$$

where ${{\hat{A}}}_{22}\in {\mathbb {R}}^{n_3,n_3}$ is diagonal with positive diagonal elements. This transformation can be constructed again via a singular value decomposition and a numerical rank decision. Set

$$\begin{aligned} U_2=\left[ \begin{array}{cc} I_{{{\tilde{n}}}_1} &{}\quad 0 \\ 0 &{}\quad {{\tilde{U}}}_2 \end{array}\right] , \ V_2 =\left[ \begin{array}{cc} I_{{{\tilde{n}}}_1} &{}\quad 0 \\ 0 &{}\quad {{\tilde{V}}}_2\end{array}\right] \end{aligned}$$

and form

$$\begin{aligned} U_2^\top U_1^\top E V_1 V_2= \left[ \begin{array}{ccc} {{\hat{E}}}_{11} &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 \end{array}\right] ,\ U_2^\top U_1^\top A V_1^{-\top } V_2= \left[ \begin{array}{ccc} {{\hat{A}}}_{11} &{}\quad {{\hat{A}}}_{12} &{}\quad {{\hat{A}}}_{13}\\ A_{21} &{}\quad {{\hat{A}}}_{22} &{}\quad 0\\ A_{31}&{} 0 &{}\quad 0 \end{array}\right] , \end{aligned}$$

with ${{\hat{E}}}_{11}={{\tilde{E}}}_{11}$.

The regularity of the pencil implies that $A_{13}$ has full column rank $n_1={{\tilde{n}}}_1-n_3$ and that $A_{31}$ has full row rank $n_1={{\tilde{n}}}_1-n_3$, because otherwise there would be common right or left nullspace, respectively.

Step 3. Let ${{\tilde{U}}}_{31}$, ${{\tilde{V}}}_{31}$, ${{\tilde{U}}}_{13}$, ${{\tilde{V}}}_{13}$ be real orthogonal matrices of appropriate dimensions such that

$$\begin{aligned} {{\tilde{U}}}_{13}^\top {{\tilde{A}}}_{13} {{\tilde{V}}}_{13}= \left[ \begin{array}{c} {{\hat{A}}}_{14} \\ 0 \end{array}\right] ,\ {{\tilde{U}}}_{31}^\top {{\tilde{A}}}_{31} {{\tilde{V}}}_{31}= \left[ \begin{array}{cc} {{\hat{A}}}_{41}&\quad 0 \end{array}\right] \end{aligned}$$

where ${{\hat{A}}}_{41}\in {\mathbb {R}}^{n_1,n_1}$ and ${{\hat{A}}}_{14}\in {\mathbb {R}}^{n_1,n_1}$ are diagonal with positive diagonal elements. This transformation can be constructed again via a singular value decomposition and numerical rank decisions. Set

$$\begin{aligned} U_3=\left[ \begin{array}{ccc} {{\tilde{U}}}_{13} &{}\quad 0 &{}0 \\ 0 &{}\quad I_{\ell _3} &{}\quad 0 \\ 0 &{} 0 &{}\quad {{\tilde{U}}}_{31}\end{array}\right] , \ V_3=\left[ \begin{array}{ccc} {{\tilde{V}}}_{31} &{}\quad 0 &{}0 \\ 0 &{}\quad I_{\ell _3} &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad {{\tilde{V}}}_{13}\end{array}\right] , \end{aligned}$$

then $U_3^\top U_2^\top U_1^\top E V_1 V_2 V_3$ and $U_3^\top U_2^\top U_1^\top A V_1 V_2 V_3$ are as claimed in (55). The invertibility of $E_{22}$ then follows from the assumption that the pencil has index at most two, see [25]. $\square $

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mehrmann, V., van der Schaft, A. Differential–algebraic systems with dissipative Hamiltonian structure. Math. Control Signals Syst. 35, 541–584 (2023). https://doi.org/10.1007/s00498-023-00349-2

Download citation

Received: 04 August 2022
Accepted: 09 February 2023
Published: 03 March 2023
Issue Date: September 2023
DOI: https://doi.org/10.1007/s00498-023-00349-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Differential–algebraic systems with dissipative Hamiltonian structure

Abstract

Similar content being viewed by others

On controlled Hamilton and Hamilton–Jacobi differential equations of higher-order

Physical system analysis related to observability, controllability and stability using its equations of generalized motion and canonical equations both in dissipative forms

Port-Hamiltonian Differential-Algebraic Systems

1 Introduction

2 Dissipative Hamiltonian DAEs

Remark 1

Remark 2

Remark 3

Remark 4

Example 5

Example 6

Example 7

Example 8

Example 9

3 Extended Hamiltonian DAEs

Remark 10

Remark 11

Example 12

Remark 13

4 Geometric theory of (dissipative) Hamiltonian DAEs

4.1 Dirac and Lagrange structures

Definition 14

Remark 15

Proposition 16

Example 17

Remark 18

4.2 Incorporation of dissipation

Definition 19

Definition 20

Remark 21

Proposition 22

Proof

Lemma 23

Proof

Proposition 24

Proof

Corollary 25

Remark 26

Definition 27

Remark 28

Remark 29

5 Representation of DAE systems generated by Dirac and Lagrange structures in the state variables x

5.1 A coordinate representation in the original state variables x

Remark 30

5.2 Relation between representations (10) and (32)

Example 31

Proposition 32

Example 33

Example 34

Example 35

Example 36

6 Equivalence transformations and condensed forms

Lemma 37

Proof

Corollary 38

Proof

Lemma 39

Proof

Corollary 40

Proof

Lemma 41

Proof

Lemma 42

Proof

Remark 43

Lemma 44

Remark 45

7 Representation of DAEs into the form \(KP\dot{z}=LS z\) or \(\frac{d}{dt}(Ez) =(J-R)Qz\)

Theorem 46

Corollary 47

Theorem 48

Proof

Corollary 49

Proof

Remark 50

Corollary 51

Proof