Abstract
Isospectral flows are abundant in mathematical physics; the rigid body, the the Toda lattice, the Brockett flow, the Heisenberg spin chain, and point vortex dynamics, to mention but a few. Their connection on the one hand with integrable systems and, on the other, with Lie–Poisson systems motivates the research for optimal numerical schemes to solve them. Several works about numerical methods to integrate isospectral flows have produced a large varieties of solutions to this problem. However, many of these algorithms are not intrinsically defined in the space where the equations take place and/or rely on computationally heavy transformations. In the literature, only few examples of numerical methods avoiding these issues are known, for instance, the spherical midpoint method on \({{\mathfrak {s}}}{{\mathfrak {o}}}(3)\). In this paper we introduce a new minimal-variable, second order, numerical integrator for isospectral flows intrinsically defined on quadratic Lie algebras and symmetric matrices. The algorithm is isospectral for general isospectral flows and Lie–Poisson preserving when the isospectral flow is Hamiltonian. The simplicity of the scheme, together with its structure-preserving properties, makes it a competitive alternative to those already present in literature.
Similar content being viewed by others
1 Introduction
The numerical integration of isospectral flows is a classical subject of study in numerical analysis [7, 10]. The interest in this problem is motivated by the numerical simulation of integrable systems, which are deeply related to isospectral flows via the Lax pair formulation. The quasi-periodic dynamics of integrable systems depends on the presence of a large number of first integrals. In the Lax pair formulation, some of these first integrals can be presented as a linear combination of the eigenvalues of the dynamical variable. Therefore, the preservation of the spectrum of the dynamical variable is a key feature of a numerical scheme applied to isospectral flows, in order to expect the right qualitative behaviour of the discrete approximate solutions [7]. Furthermore, as a special case, Lie–Poisson systems on the dual of reductive Lie algebras can be seen as isospectral flows [18]. A reductive Lie algebra is defined as the direct sum of a semisimple Lie algebra and an abelian Lie algebra. In this paper, the crucial property of real semisimple Lie algebras is that they can be represented as matrix Lie algbras which are closed under conjugate transpose [12, Prop. 6.28]. Moreover, any real matrix Lie algebra which is closed under conjugate transpose is reductive [12, Prop. 1.56].Footnote 1 Because of this, Lie–Poisson systems on the dual of a reductive Lie algebra can be equivalently seen as isospectral flows of the form (1.2) below. Lie–Poisson systems originate from the Poisson reduction of canonical Hamiltonian systems on the cotangent bundle of a Lie group [15]. Classical examples of Lie–Poisson systems are the rigid body [21], the heavy top and the incompressible Euler equations [1]. It is known from the backward error analysis, that Lie–Poisson preserving numerical schemes are superior to standard methods when applied to Lie–Poisson systems, especially for long-time simulations. As state-of-the-art, well established theories on numerical methods for both isospectral and Lie–Poisson systems exist in the literature (see for example [4, 10, 20]). For Lie–Poisson systems various symplectic algorithms have been developed (see [6] for a recent survey). However, few examples of numerical schemes that are intrinsically defined in the space where the dynamics takes place are known (e.g. [16]). This issue often causes a lack of efficiency for these schemes, which rely on group to algebra maps (e.g. the matrix exponential or the Cayley map) or a large number of unknowns. Before presenting our result, let us introduce the mathematical setup used throughout the paper.
Isospectral flows are first order ODEs of the form:
Here, \([\cdot ,\cdot ]\) denotes the matrix commutator, V is a linear subspace of the Lie algebra \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\), and the function \(B:V\rightarrow {\mathfrak {n}}(V)\) maps V into its \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\)-normalizer algebra \({\mathfrak {n}}(V)\).Footnote 2 The most studied case in literature is when \(V={{\,\mathrm{Sym}\,}}(n,{\mathbb {R}})\) is the space of symmetric real matrices, for which the normalizer is the Lie algebra of skew-symmetric real matrices \({\mathfrak {n}}(V) = {{\mathfrak {s}}}{{\mathfrak {o}}}(n)\). For Lie–Poisson systems on the dual of a reductive Lie algebra, we have that \(V={\mathfrak {g}}^*\) is the dual of a reductive Lie subalgebra of \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\), for which the \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})-\)normalizer is \({\mathfrak {n}}(V) = {\mathfrak {g}}_0\oplus {\mathfrak {c}}({\mathfrak {g}})\) (see Definition 2 and Lemma 2, in Section 2). Throughout the paper, we identify \({\mathfrak {g}}^*\) with \({\mathfrak {g}}\), via the Frobenius inner product \(\langle A,B\rangle ={{\,\mathrm{Tr}\,}}(A^\dagger B)\), where \(^\dagger \) is the conjugate transpose. Via this identifications, Lie–Poisson systems on the dual of a reductive Lie algebra \({\mathfrak {g}}\) take the form:
for \(H:{\mathfrak {g}}\rightarrow {\mathbb {C}}\), a smooth function called Hamiltonian.
A class of numerical methods to solve (1.1)–(1.2), called Isospectral Symplectic Runge–Kutta (IsoSyRK), has been introduced in [18]. In the case of Lie–Poisson systems these schemes are symplectic. In this paper, we focus on the IsoSyRK associated to the implicit midpoint method, which turns out to have a specially nice structure. On the one hand, we provide a simpler proof (avoiding the use of the B-series theory) that for the implicit midpoint method, the respective IsoSyRK defined in [18] is isospectral for any \(B=B(W)\). On the other hand, we derive a simpler scheme reducing the number of unknowns up to minimality, revealing an intrinsic relation between the implicit midpoint method and the Cayley transform. The resulting integrator, although implicit, is second order, isospectral and symplectic when the isospectral flow is Lie–Poisson. The scheme is also intrinsically defined on V, for a large class of isospectral flows (see section 2 for details) and, when the isospectral flow is Lie–Poisson, it preserves the coadjoint orbits. Furthermore, only one evaluation of \(B(\cdot )\) and two matrix multiplications per iteration are required, making the scheme very efficient. In the last section of this paper, we show some numerical examples of our scheme and we compare it with the spherical midpoint method, which is another minimal-variable Lie–Poisson integrator on \({\mathbb {R}}^3\). Finally, we show how our scheme looks on \({{\mathfrak {s}}}{{\mathfrak {l}}}(2,{\mathbb {R}})\), defining what we call the hyperbolic midpoint method.
2 Main result
Let us consider an isospectral flow of the form (1.1). In order to present our result, we need a short detour on some concepts and basic results on Lie algebras. As already mentioned in section 1, for (1.1) to be well defined, we have to require \(B(\cdot )\) to take values in the \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\)-normalizer algebra of V. We recall here the definition of the normalizer Lie group and normalizer Lie algebra:
Definition 1
Let G be a Lie group and \({\mathfrak {g}}\) its Lie algebra. Furthermore, let \(V\subseteq {\mathfrak {g}}\) be a linear subspace. Then the two sets
are respectively called the G-normalizer and the \({\mathfrak {g}}\)-normalizer of V. Notice that N(V) is a subgroup of G and \({\mathfrak {n}}(V)\) is a Lie subalgebra of \({\mathfrak {g}}\).
A related concept to normalizer is the centralizer Lie algebra.
Definition 2
Let \({\mathfrak {g}}\) be a Lie algebra and let \(V\subseteq {\mathfrak {g}}\) be a linear subspace. Then the set
is called the \({\mathfrak {g}}\)-centralizer of V. Notice that \({\mathfrak {c}}(V)\) is a Lie subalgebra of \({\mathfrak {g}}\).
We now recall the definition of a J-quadratic Lie algebra.
Definition 3
A Lie subalgebra \({\mathfrak {g}}\) of \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\) is called J-quadratic Lie algebra if there exists an invertible matrix J, such that
Lemma 1
Let \({\mathfrak {g}}\) be a Lie subalgebra of \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\) such that there exists a matrix J for which
Then \({\mathfrak {g}}={\mathfrak {g}}^\dagger \) implies \(J^2 \in {\mathfrak {c}}({\mathfrak {g}}) \). Moreover, if J is invertible and \(J^2 \in {\mathfrak {c}}({\mathfrak {g}}) \), then \({\mathfrak {g}}={\mathfrak {g}}^\dagger \).
Proof
Suppose \({\mathfrak {g}}={\mathfrak {g}}^\dagger \). Then, for all \(W\in {\mathfrak {g}}\), both the following identities hold:
The second of these implies that \(W J^2 + J W^\dagger J = 0\) and the first one \(J W^\dagger J + J^2 W = 0\). Subtracting these identities, we get \([W,J^2]=0\). Hence \(J^2 \in {\mathfrak {c}}({\mathfrak {g}}) \).
Now assume that \({\mathfrak {g}}\) is \(J-\)quadratic and \(J^2 \in {\mathfrak {c}}({\mathfrak {g}})\). Then, for all \(W\in {\mathfrak {g}}\) we have \(0 = JW^\dagger J + J^2 W = JW^\dagger J + WJ^2\) and hence \(W J + J W^\dagger = 0\), being J invertible. Therefore \({\mathfrak {g}}\) and \({\mathfrak {g}}^\dagger \) are defined by the same identity and they coincide. \(\square \)
Lemma 2
Let \({\mathfrak {g}}\) be a Lie subalgebra of \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\) such that \({\mathfrak {g}}={\mathfrak {g}}^\dagger \). Then the \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})-\)normalizer of \({\mathfrak {g}}\) is \({\mathfrak {n}}({\mathfrak {g}})={\mathfrak {g}}_0\oplus {\mathfrak {c}}({\mathfrak {g}})\), where \({\mathfrak {g}}_0\) is the semisimple ideal of \({\mathfrak {g}}\) such that \({\mathfrak {g}}={\mathfrak {g}}_0 \oplus {\mathfrak {z}}({\mathfrak {g}})\), for \({\mathfrak {z}}({\mathfrak {g}})\) the center of \({\mathfrak {g}}\).Footnote 3
Proof
Let \({\mathfrak {g}}^\perp \) be the orthogonal complement of \({\mathfrak {g}}\) in \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\) with respect to the Frobenius inner product. It is not hard to check that the following properties hold:
Hence, if \(A\in {\mathfrak {n}}({\mathfrak {g}})\cap {\mathfrak {g}}^\perp \) it must be \([{\mathfrak {g}},A]=0\). Therefore, A has to be in \({\mathfrak {c}}({\mathfrak {g}})\). Moreover, we have that the following inclusions always hold \({\mathfrak {z}}({\mathfrak {g}})\subset {\mathfrak {c}}({\mathfrak {g}})\subset {\mathfrak {n}}({\mathfrak {g}})\). Therefore, \({\mathfrak {n}}({\mathfrak {g}})={\mathfrak {g}}_0\oplus {\mathfrak {c}}({\mathfrak {g}})\), being \({\mathfrak {g}}_0\) centerless.\(\square \)
Notice that, since always \({\mathfrak {n}}({\mathfrak {g}})^\dagger = {\mathfrak {n}}({\mathfrak {g}}^\perp )\), we have that \({\mathfrak {n}}({\mathfrak {g}}^\perp )={\mathfrak {g}}_0\oplus {\mathfrak {c}}({\mathfrak {g}})^\dagger \), whenever \({\mathfrak {g}}={\mathfrak {g}}^\dagger \). In conclusion, we have the following corollary:
Corollary 1
Let \({\mathfrak {g}}\) be a J-quadratic Lie subalgebra of \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\) such that \(J^2 \in {\mathfrak {c}}({\mathfrak {g}})\). Then the \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})-\)normalizer of \({\mathfrak {g}}\) and \({\mathfrak {g}}^\perp \) are respectively \({\mathfrak {n}}({\mathfrak {g}})={\mathfrak {g}}_0\oplus {\mathfrak {c}}({\mathfrak {g}})\) and \({\mathfrak {n}}({\mathfrak {g}}^\perp )={\mathfrak {g}}_0\oplus {\mathfrak {c}}({\mathfrak {g}})^\dagger \). In particular, under the identification of \({\mathfrak {g}}^*\) with \({\mathfrak {g}}\) via the Frobenius inner product, any Lie–Poisson system on \({\mathfrak {g}}^*\) can be written in the form (1.2).
Notice that by [12, Prop. 1.56] any \({\mathfrak {g}}\) such as in Corollary 1 is a reductive Lie algebra. As mention in the introduction, Lie–Poisson systems on the dual of reductive Lie algebras can be written in the form (1.2). In particular, this is true for Lie–Poisson systems on the dual of \({\mathfrak {g}}\oplus {\mathfrak {Z}}\), where \({\mathfrak {Z}}\) is an Abelian Lie algebra. We can now state the main result of this paper.
Theorem 1
Let \(W_k\in D\subset V\), for a domain D in the linear subspace \(V\subset {{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\). Assume that the normalizer splits as \({\mathfrak {n}}(V)={\mathfrak {g}}_0\oplus {\mathfrak {c}}(V)\), for some Lie algebra \({\mathfrak {g}}_0\), which satisfies
for some constant matrix P. Furthermore, let \(B:D\subset {{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\rightarrow {\mathfrak {n}}(V)\) be continuously differentiable. Then, for some \(h> 0\), there exists \({\widetilde{W}}\in V\) such that the numerical scheme \(W_k\rightarrow W_{k+1}\), implicitly defined by:
is a second order isospectral integrator for (1.1), for any \(k\ge 0\).Footnote 4. Moreover, when (1.1) is a Lie–Poisson system on \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})^*\) or on the dual of some J-quadratic Lie algebra \({\mathfrak {g}}\) such that \(J^2 \in {\mathfrak {c}}({\mathfrak {g}}) \) (or even on \({\mathfrak {g}}\oplus {\mathfrak {Z}}\), where \({\mathfrak {Z}}\) is an Abelian Lie algebra), then (2.3) is a Lie–Poisson integrator for (1.1) which preserves the coadjoint orbits in \({\mathfrak {g}}^*\).
Remark 1
The main contribution of Theorem 1, with respect to the results presented in [18], is that the scheme (2.3) is a minimal-variable isospectral (Lie–Poisson) integrator. Minimal-variable means here that the only unknown is \({\widetilde{W}}\), which lives in a vector space of dimension \(\dim (V)\). Hereafter, we will refer to the scheme (2.3) as the isospectral minimal midpoint. Moreover, the proof of the properties of (2.3), unlikely to [18, Cor. 1], does not require any application of the B-series theory and reveals a deep connection with the Cayley transform (see the proof of Lemma 4). The latter is a quite interesting fact because the Cayley transform arises as a necessary consequence of the use of the implicit midpoint scheme and not, as it has always appeared in literature, as a prescribed choice to construct a certain numerical scheme. We also emphasize that the condition (2.2) and the ones on J in Theorem 1 to get a Lie–Poisson integrator is slightly more general than the one considered in [18, Thm. 1–2].
We will give the proof of Theorem 1 in some lemmas.
Lemma 3
Let \(B:D\subset {{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\rightarrow {{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\) continuously differentiable in the domain D. Then, for every \(Y\in D\), there exist \({\overline{h}}>0\) such that the equation
has a solution \(X\in {{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\) for any \(0\le h<{\overline{h}}\).
Proof
In order to get a solution to (2.4), we consider the function \(F_h(X) := Y + \frac{h}{2}[B(X),X] + \frac{h^2}{4}B(X)XB(X)\), such that (2.4) is equal to \(X = F_h(X)\). In order to determine \({\overline{h}}\), we consider the initial value problem:
Since \(\dfrac{\partial F_h(X)}{\partial h}\) is continuous, if we prove that the operator \((Id - D F_h(X))\) is continuous and invertible, then the Peano existence theorem will ensure a solution X(h), for any h in some interval \([0,{\overline{h}})\), for \({\overline{h}}>0\). Indeed, \(D F_h(X)=h G(D B (X),B(X),X,h)\), where G is polynomial in its variables and DB(X) is continuous by hypothesis. Hence, \(D F_h(X)\rightarrow 0\), for \(h\rightarrow 0\), therefore there exist some \({\overline{h}}>0\) such that \((Id - D F_h(X))\) is invertible, for any \(0\le h<{\overline{h}}\). \(\square \)
Lemma 4
Let \(B:D\subset {{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\rightarrow {{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\) continuously differentiable in the domain D and let \(0\le h<{\overline{h}}\) as in Lemma 3. Then, for every \(W_k\in D\), the numerical scheme \(W_k\rightarrow W_{k+1}\) is isospectral. Moreover, if \(D\subset V\), for D domain in the linear subspace \(V\subset {{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\), and \(B:D\subset {{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\rightarrow {\mathfrak {n}}(V)\), where \({\mathfrak {n}}(V)={\mathfrak {g}}_0\oplus {\mathfrak {c}}(V)\), for some Lie algebra \({\mathfrak {g}}_0\) which satisfies (2.2), then \(W_{k+1}\in V\). Furthermore, when V is a J-quadratic Lie algebra such that \(J^2 \in {\mathfrak {c}}({\mathfrak {g}}) \), then \(W_{k+1}\in {\mathscr {O}}_{W_k}\subset V\), where \({\mathscr {O}}_{W_k}\) is the coadjoint orbit of which \(W_k\) belongs.
Proof
Clearly \(W_{k+1} = (Id + \frac{h}{2}B({\widetilde{W}}))(Id - \frac{h}{2}B({\widetilde{W}}))^{-1}W_k(Id - \frac{h}{2}B({\widetilde{W}}))(Id + \frac{h}{2}B({\widetilde{W}}))^{-1}\) and hence \(W_{k+1}\) and \(W_k\) are similar. Furthermore, we notice that \((Id - \frac{h}{2}B({\widetilde{W}}))(Id + \frac{h}{2}B({\widetilde{W}}))^{-1}=\text{ Cay }(\frac{h}{2}B({\widetilde{W}}))\), where \(\text{ Cay }\) is the Cayley transform. Therefore we have that:
Assuming \({\mathfrak {n}}(V)={\mathfrak {g}}_0\oplus {\mathfrak {c}}(V)\), for some Lie algebra \({\mathfrak {g}}_0\) which satisfies (2.2), by [7, Lemma IV.8.7], \(\text{ Cay }\left( \frac{h}{2}B({\widetilde{W}})\right) \) is in the normalizer group N(V) of V and therefore \(W_{k+1}\) is in V as well. When \(V = {\mathfrak {g}}^*\) for \({\mathfrak {g}}\) a J-quadratic Lie algebra such that \(J^2 \in {\mathfrak {c}}({\mathfrak {g}}) \), the transformation (2.5) coincides with the coadjoint action of G on \({\mathfrak {g}}^*\), where G is the respective connected component to the identity of a Lie group with Lie algebra \({\mathfrak {g}}\). Therefore, (2.5) fixes the coadjoint orbits. \(\square \)
Remark 2
We point out that the equation (2.5) reveals an interesting relation between the Cayley transform and the implicit midpoint method. Indeed, the fact that the Cayley transform appears as a consequence of the reduction of the implicit midpoint from \(T^*GL(n,{\mathbb {C}})\) to \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})^*\) may indicate a deeper, perhaps canonical, relation between symplectic Runge–Kutta methods and the Cayley transform. The former are associated to conservation of quadratic first integrals of ODEs and the latter to transforming quadratic Lie algebras in quadratic Lie groups.
Corollary 2
Let the hypotesis of Lemma 3 hold. Then, if \({\mathfrak {n}}(V)={\mathfrak {g}}_0\oplus {\mathfrak {c}}(V)\) and \({\mathfrak {g}}_0\) is a compact Lie algebra, there exists \({\overline{h}}>0\) independent from k such that the scheme (2.3) has solution.
Proof
Since \({\mathfrak {g}}_0\) is a compact Lie algebra, the associate connected Lie group G is compact and therefore the orbits of the action (2.5) are compact. Hence, we can find a minimum \({\overline{h}}>0\) in Lemma 3 independent from the iteration \(k\ge 0\). \(\square \)
Lemma 5
Let \(B:D\subset {{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\rightarrow {{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\) continuously differentiable in the domain D and let \(0\le h<{\overline{h}}\) as in Lemma 3. Then, for every \(W_k\in D\), the numerical scheme \(W_k\rightarrow W_{k+1}\) in (2.3) descends from the method defined in [18, Def. 1] associated with the implicit midpoint method. In particular, if \(B = \nabla H^\dagger \) for some function \(H:D\subset {{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})^*\rightarrow {\mathbb {R}}\), then the method is Lie–Poisson in \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})^*\).
Proof
Consider the second order method as defined in [18, Def. 1] associated with the implicit midpoint method:
for \(k\ge 0\) with unknowns \(X,Y,K\in {{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\). It is not hard to check that the following identities hold:
Applying these to (2.6) we get, after some computations, the scheme (2.3). In [18, Thm. 3], it has been proven that when \(B(W)=\nabla H(W)^\dagger \), for some functions \(H:{{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})^*\rightarrow {\mathbb {R}}\), the method is a Lie-Poisson integrator in \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})^*\). The scheme \(W_k\rightarrow W_{k+1}\) defined in (2.3) coincides with (2.6), but with the elimination of the intermediate variables X, Y, K. Therefore, (2.3) is a Lie–Poisson integrator. \(\square \)
Proof. (Theorem 1)
The proof simply follows from the lemmas. Lemma 3 says that the method (2.3) has solution for h sufficiently small. Under the assumptions of Lemma 4 proves the isospectrality of the scheme and its intrinsic restriction V, when \({\mathfrak {n}}(V)={\mathfrak {g}}_0\oplus {\mathfrak {c}}(V)\), for some Lie algebra \({\mathfrak {g}}_0\) which satisfies (2.2). Finally, in Lemma 5 is shown that the scheme descends from the isospectral midpoint method defined in [18] and therefore it is a second order Lie–Poisson integrator in \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})^*\), when (1.1) is. Putting together Lemma 4, Lemma 5 and Corollary 1, we have that when a V is the dual of a J-quadratic Lie algebra such that \(J^2 \in {\mathfrak {c}}({\mathfrak {g}})\) (possibly plus a commutative Lie algebra) and (1.1) is Lie–Poisson, the scheme (2.3) is a Lie–Poisson integrator on V, which preserves the coadjoint orbits. \(\square \)
Remark 3
We notice that the isospectral minimal midpoint (2.3) is somehow similar to the modified implicit midpoint rule introduced in [4]. However, in their scheme, \({\widetilde{W}}\) was set to be \(\frac{W_{k+1}+W_k}{2}\) which does not hold in general while solving the isospectral minimal midpoint (2.3). In fact, even though the scheme in [4] is isospectral, it is not symplectic.
Remark 4
The isospectral minimal midpoint (2.3) can be derived in a different way, as proposed in [6]. The construction there is more general and (2.3) can be recovered choosing as a retraction map the Cayley transform instead of the exponential map. This surprising connection opens up a question about a geometrical description of the methods proposed in [18, Def. 1]. Let us consider for any \(s=1,2,\ldots \) and a\(s\times s\) real matrix, a retraction map \(\tau _a:{\mathfrak {g}}^{\oplus s}\rightarrow G^{\times s}\). Then, similarly to [6], for each \(i=1,\ldots ,s\), it is implicitly defined by the differential of the retraction map \(d\tau _a\) a discrete map \(W_k\mapsto {\widetilde{W}}_i\ni {\mathfrak {g}}\). Finally, we define our integrator \(\varPsi _h:W_k\mapsto W_{k+1}\) as:
for some real numbers \(b_i\). The question is whether, for any s-stage symplectic Runge–Kutta method, there exists a retraction map \(\tau _a:{\mathfrak {g}}^{\oplus s}\rightarrow G^{\times s}\), such that any Lie–Poisson integrator defined in [18] can be obtained in this way.
3 Numerical examples
In this section we present some applications of the isospectral minimal midpoint (2.3) on isospectral flows and Lie–Poisson systems found in literature. We also compare our method in the case of \({{\mathfrak {s}}}{{\mathfrak {o}}}(3)\cong ({\mathbb {R}}^3,\times )\) with the spherical midpoint method, showing that the isospectral minimal midpoint (2.3) has the same computational cost. Finally, we show explicitly how the isospectral minimal midpoint (2.3) looks on \({{\mathfrak {s}}}{{\mathfrak {l}}}(2,{\mathbb {R}})\), applying it to the the point vortex equations on the hyperbolic plane. For the example considered in this section, we plot the variation of the first integrals of (1.1). As expected, we get exact conservation (up to round-off errors) of the Casimir functions and, when the flow is Hamiltonian, near conservation of the Hamiltonian.
3.1 The generalized rigid body
A classical example among Hamiltonian isospectral systems is the generalized rigid body. It represents a class of completely integrable systems on \({{\mathfrak {s}}}{{\mathfrak {o}}}(n)\), for every \(n\ge 1\), [14]. The Hamiltonian is given by
where \({\mathscr {I}}:{{\mathfrak {s}}}{{\mathfrak {o}}}(n)\rightarrow {{\mathfrak {s}}}{{\mathfrak {o}}}(n)\) is a symmetric positive definite inertia tensor. The equations of motion are then
We discretize this system for \(n=10\). Our implementation uses Newton iterations for the non-linear system.Footnote 5 The inertia tensor is given by
and we use the stepsize \(h=0.1\). The initial conditions are given by
As shown in Fig. 1, the Hamiltonian is nearly conserved and the Casimir functions are conserved up to the accuracy of the Newton iterations.
3.2 The Brockett flow
In this section we specify the isospectral minimal midpoint (2.3) for the Brockett flow, or double bracket flow:
where N, W are \(n\times n\) self-adjoint complex matrices. In [3], Brockett shows that for N diagonal matrix with distinct entries and \(W_0\) self-adjoint matrix with distinct eigenvalues, for \(t\rightarrow \infty \), W(t) converges exponentially fast to a diagonal matrix with the eigenvalues sorted accordingly to the order of the entries of N. In Fig. 2, we plot the eigenvalue variation for a randomly generatedFootnote 6 self-adjoint initial matrix \(W_0\) of dimension \(10\times 10\) and \(N=diag(1,2,\ldots ,10)\). The asymptotic stationarity of the eigenvalues variation reflects the fact that W is close to a diagonal matrix.
3.3 Lie–Poisson systems on \(({\varvec{\mathbb {R}}}^3,\times )\)
On \(({\mathbb {R}}^3,\times )\) the isospectral minimal midpoint (2.3) can be written as:
for \({\widetilde{w}},w_k,w_{k+1}\in {\mathbb {R}}^3\) and \(B:{\mathbb {R}}^3\rightarrow {\mathbb {R}}^3\). We want to compare the isospectral minimal midpoint (3.6) with another minimal-variable symplectic integrator on \({\mathbb {R}}^3\) introduced in [17], i.e. the spherical midpoint method:
Remark 5
Let \(B(\cdot )\) be orthogonal with respect to the rays, i.e. \(B(w)\cdot w=0\), for every \(w\in {\mathbb {R}}^3\). It is immediate to check that then (3.6) coincides with the classical midpoint scheme:
In [17] it is shown that also (3.7) coincides with (3.8) when \(B(\cdot )=\nabla H(\cdot )\), for some Hamiltonian function \(H:{\mathbb {R}}^3\rightarrow {\mathbb {R}}\) constant on the rays (which implies \(B(\cdot )\) to be orthogonal to the rays). In this case, (3.8) is known to be symplectic, whereas this fails for general Hamiltonian H. Therefore, (3.6) can be seen as the second order correction of (3.8) to be symplectic for any Hamiltonian H.
Let us now consider the two schemes (3.6) and (3.7). Both methods are implicit and therefore an implicit solver has to be used. Here we show that they exhibit the same computational cost. The example we consider is the Heisenberg spin chain on \({\mathbb {R}}^{3N}\). For this one has to extend both the isospectral minimal midpoint (3.6) and the spherical midpoint (3.7) to direct products of \({\mathbb {R}}^3\) (see [17, 18]).
The Heisenberg spin chain of micromagnetics is defined as:
where \(w_i\in {\mathbb {S}}^2\), for \(i=1,\ldots ,N\) and \(w_{N+1}=w_1\). It corresponds, up to scaling, to spatial discretization of the Landau–Lifschitz PDE:
for \(w:{\mathbb {S}}^1\rightarrow {\mathbb {S}}^2\) smooth. We notice that (3.9) is a Lie–Poisson system on \({\mathbb {R}}^{3N}\), with Hamiltonian:
Clearly, to get a good approximation of (3.10), N has to be large. In Fig. 3 we show the average time cost for time-step with respect to the number of spin particles for both the isospectral minimal midpoint (3.6) and the spherical midpoint (3.7). We conclude that the complexity grows similarly.
In terms of conservation properties, the two schemes exactly preserve the linear invariants and nearly conserve the the quadratic first integrals of the form \(\sum _{i,j}w_i^\dagger A w_j\). Moreover, the spherical midpoint has the advantage of exactly conserving all the quadratic first integrals of the form \(w_i^\dagger A w_i\), for some square matrix A, whereas the isospectral minimal midpoint (3.6) exactly conserves only the quadratic invariants \(w_i^\dagger w_i\). In Fig. 4 we compare the isospectral minimal midpoint (3.6) and the spherical midpoint (3.7) for the initial data given as am equispaced discretization in \(N=100\) points of the closed spherical curve:
for \(x\in [0,1]\). We can conclude from Fig. 4 that the spherical midpoint performs slightly better than the isospectral minimal midpoint (3.6). However, both the schemes show the desired conservation properties due to their symplecticity. Finally, in Fig. 5, we present the error diagram for the isospectral minimal midpoint (3.6) and the spherical midpoint (3.7). The curves in Fig. 5 show the expected second order of the schemes, with no significant difference.
3.4 Lie–Poisson systems on \(\varvec{{{\mathfrak {s}}}{{\mathfrak {l}}}}(2,{\varvec{\mathbb {R}}})^*\)
In this section we specify the isospectral minimal midpoint (2.3) in the case of the Lie algebra \({{\mathfrak {s}}}{{\mathfrak {l}}}(2,{\mathbb {R}})\), i.e. the \(2\times 2\) matrices with zero trace. The first observation is that \({{\mathfrak {s}}}{{\mathfrak {l}}}(2,{\mathbb {R}})\cong {{\mathfrak {s}}}{{\mathfrak {p}}}(2,{\mathbb {R}})\), which is a non-compact J-quadratic Lie algebra with respect to \(J=\begin{bmatrix} 0&\quad -1\\ 1&\quad 0 \end{bmatrix}.\) From this, it is straightforward to see that, when \(B:{{\mathfrak {g}}}{{\mathfrak {l}}}(2,{\mathbb {R}})\rightarrow {{\mathfrak {s}}}{{\mathfrak {l}}}(2,{\mathbb {R}})\) and \(W_k\in {{\mathfrak {s}}}{{\mathfrak {l}}}(2,{\mathbb {R}})\), \({\widetilde{W}}\) in (2.3) is also in \({{\mathfrak {s}}}{{\mathfrak {l}}}(2,{\mathbb {R}})\). We notice that this is no longer true for \({{\mathfrak {s}}}{{\mathfrak {l}}}(n,{\mathbb {R}})\), for \(n>2\). On the other hand, any element in \({{\mathfrak {s}}}{{\mathfrak {l}}}(2,{\mathbb {R}})\) can be written as a vector in \({\mathbb {R}}^3\), via the vector spaces isomorphism:
for any \(x,y,z\in {\mathbb {R}}\). In this coordinates, we can express the isospectral minimal midpoint (2.3) for \({{\mathfrak {s}}}{{\mathfrak {l}}}(2,{\mathbb {R}})\) as:
for \({\widetilde{w}},w_k,w_{k+1}\in {\mathbb {R}}^3\) and \(B:{\mathbb {R}}^3\rightarrow {\mathbb {R}}^3\), where:
and
for any \(a,b,c\in {\mathbb {R}}^3\).
We notice that the map (3.11) is a Lie algebra isomorphism from \({{\mathfrak {s}}}{{\mathfrak {l}}}(2,{\mathbb {R}})\) to \(({\mathbb {R}}^3,\times _L)\), where \(a\times _L b = 2L (a\times b)\), for any \(a,b\in {\mathbb {R}}^3\). Furthermore, the tensor L defines also the hyperbolic inner product \(a\cdot _L b = a\cdot (Lb)\), for any \(a,b\in {\mathbb {R}}^3\). We recall that the coadjoint orbits in \({{\mathfrak {s}}}{{\mathfrak {l}}}(2,{\mathbb {R}})^*\) are hyperboloids of the form \(\lbrace x^2+y^2-z^2=const\rbrace \). Hence, in analogy with the spherical midpoint method in [16], we will call (3.12) the hyperbolic midpoint method.
We illustrate an application of the hyperbolic midpoint method (3.12) on the point vortex equations on the hyperbolic plane (see for example [8, 9, 19]). The interest in this equations is motivated also by the studies on ideal hydrodynamics on hyperbolic spaces [5, 11], and in particular on the Euler equations, for which the point vortices can be seen as a finite dimensional approximation [2]. These equations are a Lie–Poisson system on \(({{\mathfrak {s}}}{{\mathfrak {l}}}(2,{\mathbb {R}})^*)^N\cong ({\mathbb {R}}^3,\times _L)^N\), with initial values on the coadjoint orbit determined by the equations \(w_i\cdot _L w_i=-1\), for \(i=1,2,\ldots ,N\). The Hamiltonian is given by:
The equations of motion are then:
Equations (3.14) constrain the vortices to move on the hyperboloid \(x^2+y^2-z^2=-1\). Furthermore, the \(SL(2,{\mathbb {R}})\) symmetry of (3.14) gives the conservation of the momentum vector:
Equations (3.14) and their \(SL(2,{\mathbb {R}})\)-relative equilibria have been studied in [8, 9, 19]. In particular, for two and three vortices most of the stability issues have been worked out in [19]. However, unlike to point vortex equations on a sphere [13], it is still unknown a general result on the stability of a relative equilibrium of point vortices on the hyperbolic plane.
Here we present the results of some numerical simulations of (3.14) with the hyperbolic midpoint method (3.12). In particular, we consider as initial values some of the relative equilibria found in [9, 19]. Let us take two different initial values in \(({\mathbb {R}}^3)^3\), \(w_0^1\) and \(w_0^2\), whose columns represent the initial position of three vortices with strengths respectively equal to \(\varGamma ^1\) and \(\varGamma ^2\), as defined here below:
The initial condition \(w_0^1\) is an equilateral relative equilibrium, whereas \(w_0^2\) is a geodesic relative equilibrium, as defined in [19]. The fist initial condition is known to be stable, whereas it is not known for the second one. However, from Fig. 6 we can see that both the initial conditions evolve in close trajectories, which proves numerically the stability for both of them.
We conclude showing in Fig. 7 the conservation properties for the hyperbolic midpoint method (3.12), concerning the first integrals (3.15) and (3.13).
Notes
In view of [12, Sec. I.8], there is no restriction in looking at real matrix Lie algebras, since the complex ones can be seen as real matrix Lie algebras of double dimension.
\({\mathfrak {n}}(V)\) is the largest Lie subalgebra of \({{\mathfrak {g}}}{{\mathfrak {l}}}(n,{\mathbb {C}})\) such that \([{\mathfrak {n}}(V),V]\subset V\) (see Definition 1 in Section 2).
Such a decomposition always exists being \({\mathfrak {g}}\) reductive by [12, Prop. 1.56].
Here Id denotes the \(n\times n\) identity matrix.
Fixed-point iteration could be used as well, no numerical issues have arisen in our experiments with it.
Throughout the whole paper, a randomly generated matrix is understood as a matrix whose components are defined as pseudorandom values drawn from the standard uniform distribution on the open interval (0, 1), generated via the MATLAB function rand.
References
Arnold, V.I., Khesin, B.A.: Topological Methods in Hydrodynamics. Applied Mathematical Sciences, vol. 125. Springer, New York (1998)
Beale, J.T., Majda, A.: Vortex methods. ii: higher order accuracy in two and three dimensions. Math. Comput. 39(159), 29–52 (1982)
Brockett, R.W.: Dynamical systems that sort lists, diagonalize matrices, and solve linear programming problems. Linear Algebra Appl. 146, 79–91 (1991)
Calvo, M.P., Iserles, A., Zanna, A.: Numerical solution of isospectral flows. Math. Comput., 66:1461–1486, 10 (1997). https://doi.org/10.1090/S0025-5718-97-00902-2
Chan, C.H., Czubak, M.: Non-uniqueness of the Leray–Hopf solutions in the hyperbolic setting. Dyn Partial Differ. Equ. 10(1), 43–77 (2013)
de Diego, D.M.: Lie-Poisson integrators. In: Rev. de la Academia Canaria de Ciencias, vol. XXX, pp. 9–30 (2018)
Hairer, E., Lubich, C., Wanner, G.: Geometric Numerical Integration, vol. 31. Springer, Berlin (2006)
Hwang, S., Kim, S.-C.: Point vortices on hyperbolic sphere. J. Geom. Phys. 59, 475–488 (2009)
Hwang, S., Kim, S.-C.: Relative equilibria of point vortices on the hyperbolic sphere. J. Geom. Phys. 54:063101/1–15 (2013)
Iserles, A., Munthe-Kaas, H., Nørsett, S., Zanna, A.: Lie-group methods. Acta Numerica 9, 215–365 (2000)
Khesin, B., Misiołek, G.: Euler and Navier–Stokes equations on the hyperbolic plane. Proc. Natl. Acad. Sci 109(45):18324–18326 (2012). ISSN 0027-8424. https://doi.org/10.1073/pnas.1210350109
Knapp, A.W.: Lie groups beyond an introduction. Birkhäuser (1996). ISBN 978-1-4757-2453-0
Laurent-Polz, F., Montaldi, J., Roberts, M.: Point vortices on the sphere: stability of symmetric relative equilibria. J. Geom. Mech. 3(4), 439–486 (2011)
Manakov, S.V.: Note on the integration of the Euler’s equations of the dynamics of a n-dimensional rigid body. Funct. Anal. Appl. 10, 328–329 (1976)
Marsden, J.E., Ratiu, T.S.: Introduction to Mechanics and Symmetry, volume vol. 17 of Texts in Applied Mathematics. Springer, New York (1999). ISBN 978-0-387-21792-5
McLachlan, R., Modin, K., Verdier, O.: Symplectic integrators for spin systems. Phys. Rev. E 89, 061301 (2014)
McLachlan, R., Modin, K., Verdier, O.: A minimal-variable symplectic integrator on spheres. Math. Comput. 86(307), 2325–2344 (2017)
Modin, K., Viviani, M.: Lie–Poisson methods for isospectral flows. Foundations of Computational Mathematics (2019). ISSN 1615-3383. https://doi.org/10.1007/s10208-019-09428-w
Montaldi, J., Nava-Gaxiola, C.: Point vortices on the hyperbolic plane. J. Math. Phys. 55, 03 (2014). https://doi.org/10.1063/1.4897210
Munthe-Kaas, H.: Lie–Butcher theory for Runge–Kutta methods. BIT Numer. Math. 35, 06 (1996). https://doi.org/10.1007/BF01739828
Poincaré, H.: Sur une forme nouvelle des équations de la mécanique. C. R. Acad. Sci. 132, 369–371 (1901)
Acknowledgements
Open access funding provided by Chalmers University of Technology. The author was supported by EU Horizon 2020 Grant No. 691070, by the Swedish Foundation for International Cooperation in Research and Higher Education (STINT) Grant No. PT2014-5823, by the Swedish Foundation for Strategic Research Grant ICA12-0052, and by the Swedish Research Council (VR) Grant No. 2017-05040. The author would like to thank Klas Modin for the support and the enlightening discussions during the work on this paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The author was supported by EU Horizon 2020 Grant No. 691070, by the Swedish Foundation for International Cooperation in Research and Higher Education (STINT) Grant No. PT2014-5823, by the Swedish Foundation for Strategic Research Grant ICA12-0052, and by the Swedish Research Council (VR) Grant No. 2017-05040.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Viviani, M. A minimal-variable symplectic method for isospectral flows. Bit Numer Math 60, 741–758 (2020). https://doi.org/10.1007/s10543-019-00792-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10543-019-00792-1
Keywords
- Isospectral flow
- Lie–Poisson integrator
- Symplectic Runge–Kutta methods
- Generalized rigid body
- Brockett flow
- Heisenberg spin chain
- Point-vortex on the hyperbolic plane