On the Lagrangian Structure of Integrable Hierarchies

Suris, Yuri B.; Vermeeren, Mats

doi:10.1007/978-3-662-50447-5_11

Yuri B. Suris² &
Mats Vermeeren²

29k Accesses
6 Citations
1 Altmetric

Abstract

We develop the concept of pluri-Lagrangian structures for integrable hierarchies. This is a continuous counterpart of the pluri-Lagrangian (or Lagrangian multiform) theory of integrable lattice systems. We derive the multi-time Euler Lagrange equations in their full generality for hierarchies of two-dimensional systems, and construct a pluri-Lagrangian formulation of the potential Korteweg-de Vries hierarchy.

You have full access to this open access chapter, Download chapter PDF

An Ablowitz-Ladik Integrable Lattice Hierarchy with Multiple Potentials

Article 19 May 2020

Second-order integrable Lagrangians and WDVV equations

Article Open access 26 April 2021

On a Lagrangian Reduction and a Deformation of Completely Integrable Systems

Article 21 April 2016

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

In this paper, our departure point are two developments which have taken place in the field of discrete integrable systems in recent years.

Firstly, multi-dimensional consistency of lattice systems has been proposed as a notion of integrability [8, 15]. In retrospect, this notion can be seen as a discrete counterpart of the well-known fact that integrable systems never appear alone but are organized into integrable hierarchies. Based on the notion of multi-dimensional consistency, a classification of two-dimensional integrable lattice systems (the so called ABS list) was given in [1]. Moreover, for all equations of the ABS list, considered as equations on $\mathbb Z^2$, a variational interpretation was found in [1].
Secondly, the idea of the multi-dimensional consistency was blended with the variational formulation in [13], where it was shown that solutions of any ABS equation on any quad surface $\varSigma $ in $\mathbb {Z}^N$ are critical points of a certain action functional $\int _\varSigma \mathscr {L}$ obtained by integration of a suitable discrete Lagrangian two-form $\mathscr {L}$. Moreover, it was observed in [13] that the critical value of the action remains invariant under local changes of the underlying quad-surface, or, in other words, that the 2-form $\mathscr {L}$ is closed on solutions of quad-equations, and it was suggested to consider this as a defining feature of integrability. However, later research [10] revealed that $\mathscr {L}$ is closed not only on solutions of (non-variational) quad-equations, but also on general solutions of the corresponding Euler-Lagrange equations. Therefore, at least for discrete systems, the closedness condition is implicitly contained in the variational formulation.

A general theory of multi-time one-dimensional Lagrangian systems, both discrete and continuous, has been developed in [20]. A first attempt to formulate the theory for continuous two-dimensional systems was made in [21]. For such systems, a solution is a critical point of the action functional $\int _S \mathscr {L}$ on any two-dimensional surface S in $\mathbb {R}^N$, where $\mathscr {L}$ is a suitable differential two-form. The treatment in [21] was restricted to second order Lagrangians, i.e. to two-forms $\mathscr {L}$ that only depend on the second jet bundle. In the present work we will extend this to Lagrangians of any order.

As argued in [10], the unconventional idea to consider the action on arbitrary two-dimensional surfaces in the multi-dimensional space of independent variables has significant precursors. These include:

Theory of pluriharmonic functions and, more generally, of pluriharmonic maps [11, 17, 19]. By definition, a pluriharmonic function of several complex variables $f:\mathbb {C}^N\rightarrow \mathbb {R}$ minimizes the Dirichlet functional $E_\varGamma =\int _\varGamma |(f\circ \varGamma )_z|^2dz\wedge d\bar{z}$ along any holomorphic curve in its domain $\varGamma :\mathbb {C}\rightarrow \mathbb {C}^N$. Differential equations governing pluriharmonic functions,
$$ \frac{\partial ^2 f}{\partial z_i\partial \bar{z}_j}=0 \qquad \mathrm{for\;all}\qquad i,j=1,\ldots ,N, $$
are heavily overdetermined. Therefore it is not surprising that pluriharmonic functions (and maps) belong to the theory of integrable systems.

This motivates the term pluri-Lagrangian systems, which was proposed in [9, 10].
Baxter’s Z-invariance of solvable models of statistical mechanics [3, 4]. This concept is based on invariance of the partition functions of solvable models under elementary local transformations of the underlying planar graphs. It is well known (see, e.g., [7]) that one can identify planar graphs underlying these models with quad-surfaces in $\mathbb {Z}^N$. On the other hand, the classical mechanical analogue of the partition function is the action functional. This suggests the relation of Z-invariance to the concept of closedness of the Lagrangian 2-form, at least at the heuristic level. This relation has been made mathematically precise for a number of models, through the quasiclassical limit [5, 6].
The classical notion of variational symmetry, going back to the seminal work of E. Noether [16], has been shown to be directly related to the closedness of the Lagrangian form in the multi-time [21].

The main goal of this paper is two-fold: to derive the Euler Lagrange equations for two-dimensional pluri-Lagrangian problems of arbitrary order, and to state the (potential) KdV hierarchy as a pluri-Lagrangian system. We will also discuss the closedness of the Lagrangian two-form, which turns out to be related to the Hamiltonian theory of integrable hierarchies.

Note that the influential monograph [12], according to the foreword, is “about hierarchies of integrable equations rather than about individual equations”. However, its Lagrangian part (Chaps. 19, 20) only deals with individual equations. The reason for this is apparently the absence of the concept of pluri-Lagrangian systems. We hope that this paper opens up the way for a variational approach to integrable hierarchies.

2 Pluri-Lagrangian Systems

2.1 Definition

We place our discussion in the formalism of the variational bicomplex as presented in [12, Chap. 19] (and summarized, for the reader’s convenience, in Appendix A). Slightly different versions of this theory can be found in [18] and in [2].

Consider a vector bundle $X: \mathbb {R}^N \rightarrow \mathbb {R}$ and its nth jet bundle $J^n X$. Let $\mathscr {L}\in \mathscr {A}^{(0,d)}(J^n X)$ be a smooth horizontal d-form. In other words, $\mathscr {L}$ is a d-form on $\mathbb R^N$ whose coefficients depend on a function $u: \mathbb {R}^N \rightarrow \mathbb {R}$ and its partial derivatives up to order n. We call $\mathbb {R}^N$ the multi-time, u the field, and $\mathscr {L}$ the Lagrangian d-form. We will use coordinates $(t_1, \ldots , t_N)$ on $\mathbb {R}^N$. Recall that in the standard calculus of variations the Lagrangian is a volume form, so that $d=N$.

Definition 2.1

We say that the field u solves the pluri-Lagrangian problem for $\mathscr {L}$ if u is a critical point of the action $\int _S \mathscr {L}$ simultaneously for all d-dimensional surfaces S in $\mathbb {R}^N$. The equations describing this condition are called the multi-time Euler-Lagrange equations. We say that they form a pluri-Lagrangian system and that $\mathscr {L}$ is a pluri-Lagrangian structure for these equations.

To discuss critical points of a pluri-Lagrangian problem, consider the vertical derivative $\delta \mathscr {L}$ of the (0,d)-form $\mathscr {L}$ in the variational bicomplex, and a variation $\mathscr {V}$. Note that we consider variations $\mathscr {V}$ as vertical vector fields; such a restriction is justified by our interest, in the present paper, in autonomous systems only. Besides, in the context of discrete systems only vertical vector fields seem to possess a natural analogs. The criticality condition of the action, $\delta \int _S \mathscr {L}=0$, is described by the equation

$$\begin{aligned} \int _S {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}= 0, \end{aligned}$$

(1)

which has to be satisfied for any variation $\mathscr {V}$ on S that vanishes at the boundary $\partial S$. Recall that ${{\mathrm{pr}}}\mathscr {V}$ is the nth jet prolongation of the vertical vector field $\mathscr {V}$, and that ${{\mathrm{\iota }}}$ stands for the contraction. One fundamental property of critical points can be established right from the outset.

Proposition 2.2

The exterior derivative $\mathrm{{d}}\mathscr {L}$ of the Lagrangian is constant on critical points u.

Proof

Consider a critical point u and a small $(d+1)$-dimensional ball B. Because $S := \partial B$ has no boundary, Eq. (1) is satisfied for any variation $\mathscr {V}$. Using Stokes’ theorem and the properties that $\delta \mathrm{{d}}+ \mathrm{{d}}\delta = 0$ and ${{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \mathrm{{d}}+ \mathrm{{d}}{{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} = 0$ (Propositions A.1 and A.4 in Appendix A), we find that

$$ 0 = \int _{\partial B} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}= \int _B \mathrm{{d}}({{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}) = -\int _B {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \mathrm{{d}}(\delta \mathscr {L}) = \int _B {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta (\mathrm{{d}}\mathscr {L}). $$

Since this holds for any ball B it follows that ${{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta (\mathrm{{d}}\mathscr {L}) = 0$ for any variation $\mathscr {V}$ of a critical point u. Therefore, $\delta (\mathrm{{d}}\mathscr {L})=0$, so that $\mathrm{{d}}\mathscr {L}$ is constant on critical points u. Note that here we silently assume that the space of critical points is connected. It would be difficult to justify this property in any generality, but it is usually clear in applications, where the critical points are solutions of certain well-posed systems of partial differential equations. $\square $

We will take a closer look at the property $\mathrm{{d}}\mathscr {L}=\mathrm{const}$ in Sect. 6, when we discuss the link with Hamiltonian theory. It will be shown that vanishing of this constant, i.e., closedness of $\mathscr {L}$ on critical points, is related to integrability of the multi-time Euler-Lagrange equations.

2.2 Approximation by Stepped Surfaces

For computations, we will use the multi-index notation for partial derivatives. For any multi-index $I=(i_1,\ldots ,i_N)$ we set

$$ u_I = \frac{\partial ^{|I|} u}{(\partial t_1)^{i_1} \ldots (\partial t_N)^{i_N}}, $$

where $|I| = i_1 + \ldots + i_N$. The notations Ik and $Ik^\alpha $ will represent the multi-indices $(i_1,\ldots ,i_k + 1, \ldots i_N)$ and $(i_1,\ldots ,i_k + \alpha , \ldots i_N)$ respectively. When convenient we will also use the notations $It_k$ and $It_k^\alpha $ for these multi-indices. We will write $k \not \in I$ if $i_k = 0$ and $k \in I$ if $i_k > 0$. We will denote by ${{\mathrm{D}}}_i$ or ${{\mathrm{D}}}_{t_i}$ the total derivative with respect to coordinate direction $t_i$,

$$ {{\mathrm{D}}}_i := {{\mathrm{D}}}_{t_i} := \sum _I u_{I i} \frac{\partial {}}{\partial {u_I}} $$

and by ${{\mathrm{D}}}_I := {{\mathrm{D}}}_{t_1}^{i_1} \ldots {{\mathrm{D}}}_{t_N}^{i_N}$ the corresponding higher order derivatives.

Our main general result is the derivation of the multi-time Euler-Lagrange equations for two-dimensional surfaces ($d = 2$). That will allow us to study the KdV hierarchy as a pluri-Lagrangian system. However, it is instructive to first derive the multi-time Euler-Lagrange equations for curves ($d = 1$).

The key technical result used to derive multi-time Euler-Lagrange equations is the observation that it suffices to consider a very specific type of surface.

Definition 2.3

A stepped d-surface is a d-surface that is a finite union of coordinate d-surfaces. A coordinate d-surface of the direction $(i_1, \ldots , i_d)$ is a d-surface lying in an affine d-plane $\{(t_1,\ldots ,t_N) \mid t_j = c_j\; \; \mathrm{for} \;\; j \ne i_1,\ldots ,i_d\}$.

Lemma 2.4

If the action is stationary on any stepped surface, then it is stationary on any smooth surface.

The proof of this Lemma can be found in appendix B.

2.3 Multi-time Euler-Lagrange Equations for Curves

Theorem 2.5

Consider a Lagrangian 1-form $\mathscr {L} = \sum _{i=1}^N L_i \,\mathrm{{d}}t_i$. The multi-time Euler-Lagrange equations for curves are:

$$\begin{aligned}&\frac{\delta _{i} {L_i}}{\delta {u_I}} = 0 \qquad \forall I \not \ni i, \end{aligned}$$

(2)

$$\begin{aligned}&\frac{\delta _{i} {L_i}}{\delta {u_{Ii}}} = \frac{\delta _{j} {L_j}}{\delta {u_{Ij}}} \qquad \forall I, \end{aligned}$$

(3)

where i and j are distinct, and the following notation is used for the variational derivative corresponding to the coordinate direction i:

$$ \frac{\delta _{i} {L_i}}{\delta {u_{I}}} = \sum _{\alpha \ge 0} (-1)^\alpha {{\mathrm{D}}}_i^\alpha \frac{\partial {L_i}}{\partial {u_{I i^\alpha }}} = \frac{\partial {L_i}}{\partial {u_I}} - {{\mathrm{D}}}_i \frac{\partial {L_i}}{\partial {u_{I i}}} + {{\mathrm{D}}}_i^2 \frac{\partial {L_i}}{\partial {u_{I i^2}}} - \ldots . $$

Remark 2.6

In the special case that $\mathscr {L}$ only depends on the first jet bundle, system (2)–(3) reduces to the equations found in [20]:

$$\begin{aligned}&\frac{\delta _{i} {L_i}}{\delta {u}} = 0 \quad \Leftrightarrow \quad \frac{\partial L_i}{\partial u}- {{\mathrm{D}}}_i \frac{\partial L_i}{\partial u_i} =0, \\&\frac{\delta _{i} {L_i}}{\delta {u_j}} = 0 \quad \Leftrightarrow \quad \frac{\partial L_i}{\partial u_j}=0\quad \mathrm{for}\quad i\ne j,\\&\frac{\delta _{i} {L_i}}{\delta {u_{i}}} = \frac{\delta _{j} {L_j}}{\delta {u_{j}}} \quad \Leftrightarrow \quad \frac{\partial L_i}{\partial u_i}=\frac{\partial L_j}{\partial u_j} \quad \mathrm{for}\quad i\ne j. \end{aligned}$$

Proof

(of Theorem 2.5) According to Lemma 2.4, it is sufficient to look at a general L-shaped curve $S = S_i \cup S_j$, where $S_i$ is a line segment of the coordinate direction i and $S_j$ is a line segment of the coordinate direction j. Denote the cusp by $p := S_i \cap S_j$. We orient the curve such that $S_i$ induces the positive orientation on the point p and $S_j$ the negative orientation. There are four cases, depending on how the L-shape is rotated. They are depicted in Fig. 1. To each case we associate a pair $(\varepsilon _i, \varepsilon _j) \in \{-1,+1\}^2$, where the positive value is taken if the respective piece of curve is oriented in the coordinate direction, and negative if it is oriented opposite to the coordinate direction.

The variation of the action is

$$\begin{aligned} \int _{S} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}&= \varepsilon _i \int _{S_i} ({{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta L_i) \,\mathrm{{d}}t_i + \varepsilon _j \int _{S_j} ({{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta L_j) \,\mathrm{{d}}t_j \\&= \varepsilon _i \int _{S_i} \sum _{I} \frac{\partial {L_i}}{\partial {u_I}} \delta u_I(\mathscr {V}) \,\mathrm{{d}}t_i + \varepsilon _j \int _{S_j} \sum _{I} \frac{\partial {L_j}}{\partial {u_I}} \delta u_I(\mathscr {V}) \,\mathrm{{d}}t_j. \end{aligned}$$

Note that these sums are actually finite. Indeed, since $\mathscr {L}$ depends on the nth jet bundle all terms with $|I| := i_1 + \ldots + i_N > n$ vanish.

Now we expand the sum in the first of the integrals and perform integration by parts.

$$\begin{aligned} \varepsilon _i&\int _{S_i} ({{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta L_i) \,\mathrm{{d}}t_i \\&= \varepsilon _i \int _{S_i} \sum _{I \not \ni i} \bigg ( \frac{\partial {L_i}}{\partial {u_I}} \delta u_I(\mathscr {V}) + \frac{\partial {L_i}}{\partial {u_{I i}}} \delta u_{I i}(\mathscr {V}) + \frac{\partial {L_i}}{\partial {u_{I i^2}}} \delta u_{I i^2}(\mathscr {V}) + \ldots \bigg ) \mathrm{{d}}t_i\\&= \varepsilon _i \int _{S_i} \sum _{I \not \ni i} \bigg (\frac{\partial {L_i}}{\partial {u_I}} - {{\mathrm{D}}}_i \frac{\partial {L_i}}{\partial {u_{I i}}} + {{\mathrm{D}}}_i^2 \frac{\partial {L_i}}{\partial {u_{I i^2}}} - {{\mathrm{D}}}_i^3 \frac{\partial {L_i}}{\partial {u_{I i^3}}} + \ldots \bigg ) \delta u_I(\mathscr {V}) \, \mathrm{{d}}t_i\\&\quad + \sum _{I \not \ni i} \bigg ( \frac{\partial {L_i}}{\partial {u_{I i}}} \delta u_I(\mathscr {V}) + \frac{\partial {L_i}}{\partial {u_{I i^2}}} \delta u_{I i}(\mathscr {V}) - {{\mathrm{D}}}_i \frac{\partial {L_i}}{\partial {u_{I i^2}}} \delta u_{I}(\mathscr {V}) \\&\qquad \quad + \frac{\partial {L_i}}{\partial {u_{I i^3}}} \delta u_{I i^2}(\mathscr {V}) - {{\mathrm{D}}}_i \frac{\partial {L_i}}{\partial {u_{I i^3}}}(\mathscr {V}) \delta u_{I i}(\mathscr {V}) + {{\mathrm{D}}}_i^2 \frac{\partial {L_i}}{\partial {u_{I i^3}}} \delta u_{I}(\mathscr {V}) + \ldots \bigg ) \bigg |_p. \end{aligned}$$

Using the language of variational derivatives, this reads

$$\begin{aligned} \varepsilon _i \int _{S_i} ({{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta L_i) \,\mathrm{{d}}t_i&= \varepsilon _i \int _{S_i} \sum _{I \not \ni i} \frac{\delta _{i} {L_i}}{\delta {u_I}} \delta u_I(\mathscr {V}) \,\mathrm{{d}}t_i \\&\qquad + \sum _{I \not \ni i} \bigg ( \frac{\delta _{i} {L_i}}{\delta {u_{I i}}} \delta u_I(\mathscr {V}) + \frac{\delta _{i} {L_i}}{\delta {u_{I i^2}}} \delta u_{I i}(\mathscr {V}) + \ldots \bigg ) \bigg |_p\\&= \varepsilon _i \int _{S_i} \sum _{I \not \ni i} \frac{\delta _{i} {L_i}}{\delta {u_I}} \delta u_I(\mathscr {V}) \,\mathrm{{d}}t_i + \sum _{I}\bigg ( \frac{\delta _{i} {L_i}}{\delta {u_{I i}}} \delta u_I(\mathscr {V}) \bigg )\bigg |_p. \end{aligned}$$

The other piece, $S_j$, contributes

$$ \varepsilon _j \int _{S_j} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta L_j \,\mathrm{{d}}t_j = \varepsilon _j \int _{S_j} \sum _{I \not \ni j} \frac{\delta _{j} {L_j}}{\delta {u_I}} \delta u_I(\mathscr {V}) \,\mathrm{{d}}t_j - \sum _{I} \bigg ( \frac{\delta _{j} {L_j}}{\delta {u_{I j}}} \delta u_I(\mathscr {V}) \bigg )\bigg |_p, $$

where the minus sign comes from the fact that $S_j$ induces negative orientation on the point p. Summing the two contributions, we find

$$\begin{aligned} \int _{S} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}&= \varepsilon _i \int _{S_i} \sum _{I \not \ni i} \frac{\delta _{i} {L_i}}{\delta {u_I}} \delta u_I(\mathscr {V}) \,\mathrm{{d}}t_i + \varepsilon _j \int _{S_j} \sum _{I \not \ni j} \frac{\delta _{j} {L_j}}{\delta {u_I}} \delta u_I(\mathscr {V}) \,\mathrm{{d}}t_j \nonumber \\&\qquad + \sum _{I} \bigg ( \frac{\delta _{i} {L_i}}{\delta {u_{I i}}} \delta u_I(\mathscr {V}) - \frac{\delta _{j} {L_j}}{\delta {u_{I j}}} \delta u_I(\mathscr {V}) \bigg )\bigg |_p. \end{aligned}$$

(4)

Now require that the variation (4) of the action is zero for any variation $\mathscr {V}$. If we consider variations that vanish on $S_j$, then we find for every multi-index I which does not contain i that

$$ \frac{\delta _{i} {L_i}}{\delta {u_I}} = 0. $$

Given this equation, and its analogue for the index j, only the last term remains in the right hand side of Eq. (4). Considering variations around the cusp p we find for every multi-index I that

$$ \frac{\delta _{i} {L_i}}{\delta {u_{Ii}}} = \frac{\delta _{j} {L_j}}{\delta {u_{Ij}}}. $$

It is clear these equations combined are also sufficient for the action to be critical. $\square $

2.4 Multi-time Euler-Lagrange Equations for Two-Dimensional Surfaces

The two-dimensional case ($d = 2$) covers many known integrable hierarchies, including the potential KdV hierarchy which we will discuss in detail later on. We consider a Lagrangian two-form $\mathscr {L} = \sum _{i < j} L_{ij} \,\mathrm{{d}}t_i \wedge \mathrm{{d}}t_j$ and we will use the notational convention $L_{ji} = - L_{ij}$.

Theorem 2.7

The multi-time Euler-Lagrange equations for two-dimensional surfaces are

$$\begin{aligned}&\frac{\delta _{ij} {L_{ij}}}{\delta {u_I}} =0,&\forall I \not \ni i,j, \end{aligned}$$

(5)

$$\begin{aligned}&\frac{\delta _{ij} {L_{ij}}}{\delta {u_{I j}}} = \frac{\delta _{ik} {L_{ik}}}{\delta {u_{I k}}}&\forall I \not \ni i, \end{aligned}$$

(6)

$$\begin{aligned}&\frac{\delta _{ij} {L_{ij}}}{\delta {u_{I i j}}} + \frac{\delta _{jk} {L_{jk}}}{\delta {u_{I j k}}} + \frac{\delta _{ki} {L_{ki}}}{\delta {u_{I k i}}} = 0&\forall I, \end{aligned}$$

(7)

where i, j and k are distinct, and the following notation is used for the variational derivative corresponding to the coordinate directions i, j:

$$ \frac{\delta _{ij} {L_{ij}}}{\delta {u_{I}}} := \sum _{\alpha ,\beta \ge 0} (-1)^{\alpha +\beta } {{\mathrm{D}}}_i^\alpha {{\mathrm{D}}}_j^\beta \frac{\partial {L_{ij}}}{\partial {u_{I i^\alpha j^\beta }}}. $$

Remark 2.8

In the special case that $\mathscr {L}$ only depends on the second jet bundle, this system reduces to the equations stated in [21].

Before proceeding with the proof of Theorem 2.7, we introduce some terminology and prove a lemma. A two-dimensional stepped surface consisting of q flat pieces intersecting at some point p is called a q-flower around p, the flat pieces are called its petals. If the action is stationary on every q-flower, it is stationary on any stepped surface. By Lemma 2.4 the action will then be stationary on any surface. The following Lemma shows that it is sufficient to consider 3-flowers.

Lemma 2.9

If the action is stationary on every 3-flower, then it is stationary on every q-flower for any $q > 3$.

Proof

Let F be a q-flower. Denote its petals corresponding to coordinate directions $(t_{i_1},t_{i_2})$, $(t_{i_2},t_{i_3})$, $\ldots , (t_{i_q},t_{i_1})$ by $S_{12}$, $S_{23}, \ldots , S_{q1}$ respectively. Consider the 3-flower $F_{123} = S_{12} \cup S_{23} \cup S_{31}$, where $S_{31}$ is a petal in the coordinate direction $(t_{i_3},t_{i_1})$ such that $F_{123}$ is a flower around the same point as F. Similarly, define $F_{134}, \ldots , F_{1\,q-1\,q}$. Then (for any integrand)

$$\begin{aligned} \int _{F_{123}} + \int _{F_{134}}&+ \ldots + \int _{F_{1\,q-1\,q}} \\ =&\int _{S_{12}} + \int _{S_{23}} + \int _{S_{31}} + \int _{S_{13}} + \int _{S_{34}} + \int _{S_{41}} + \ldots + \int _{S_{1\,q-1}} + \int _{S_{q-1\,q}} + \int _{S_{q1}}. \end{aligned}$$

Here, $S_{21}$, $S_{32}$, ... are the petals $S_{12}$, $S_{23}$, ... but with opposite orientation (see Fig. 2). Therefore all terms where the index of S contains 1 cancel, except for the first and last, leaving

$$ \int _{F_{123}} + \ldots + \int _{F_{1\,q-1\,q}} = \int _{S_{12}} + \int _{S_{23}} + \int _{S_{34}} + \ldots + \int _{S_{q-1\,q}} + \int _{S_{q1}} = \int _F. $$

By assumption the action is stationary on every 3-flower, so

$$ \int _F {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}= \int _{F_{123}} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}+ \ldots + \int _{F_{1\,q-1\,q}} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}= 0. $$

$\square $

Proof

(of Theorem 2.7) Consider a 3-flower $S = S_{ij} \cup S_{jk} \cup S_{ki}$ around the point $p = S_{ij} \cap S_{jk} \cap S_{ki}$. Denote its interior edges by

$$\partial S_i:= S_{ij} \cap S_{ki} ,\qquad \partial S_j := S_{jk} \cap S_{ij} ,\qquad \partial S_k := S_{ki} \cap S_{jk}.$$

On $\partial S_i$, $\partial S_j$ and $\partial S_k$ we choose the orientations that induce negative orientation on p. We consider the case where these orientations correspond to the coordinate directions, as in Fig. 3. The cases where one or more of these orientations are opposite to the corresponding coordinate direction (see Fig. 4) can be treated analogously and yield the same result.

We choose the orientation on the petals in such a way that the orientations of $\partial S_i$, $\partial S_j$ and $\partial S_k$ are induced by $S_{ij}$, $S_{jk}$ and $S_{ki}$ respectively. Then the orientations of $\partial S_i$, $\partial S_j$ and $\partial S_k$ are the opposite of those induced by $S_{ki}$, $S_{ij}$ and $S_{jk}$ respectively (see Fig. 3).

We will calculate

$$\begin{aligned} \int _{S} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L} = \int _{S_{ij}} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L} + \int _{S_{jk}} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L} + \int _{S_{ki}} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L} \end{aligned}$$

(8)

and require it to be zero for any variation $\mathscr {V}$ which vanishes on the (outer) boundary of S. This will give us the multi-time Euler-Lagrange equations.

For the first term of Eq. (8) we find

$$\begin{aligned} \int _{S_{ij}} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}&=\int _{S_{ij}} \sum _I \frac{\partial L_{ij}}{\partial u_{I}} \delta u_{I}(\mathscr {V}) \, \mathrm{{d}}t_i \wedge \mathrm{{d}}t_j \\&= \int _{S_{ij}} \sum _{I \not \ni i,j} \sum _{\lambda ,\mu \ge 0} \frac{\partial L_{ij}}{\partial u_{I i^\lambda j^\mu }} \delta u_{I i^\lambda j^\mu }(\mathscr {V}) \, \mathrm{{d}}t_i \wedge \mathrm{{d}}t_j. \end{aligned}$$

First we perform integration by parts with respect to $t_i$ as many times as possible.

$$\begin{aligned} \int _{S_{ij}} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}&=\int _{S_{ij}} \sum _{I \not \ni i,j} \sum _{\lambda ,\mu \ge 0} (-1)^\lambda {{\mathrm{D}}}_i^\lambda \frac{\partial L_{ij}}{\partial u_{I i^\lambda j^\mu }} \delta u_{I j^\mu }(\mathscr {V}) \, \mathrm{{d}}t_i \wedge \mathrm{{d}}t_j \\&\qquad - \int _{\partial S_j} \sum _{I \not \ni i,j} \sum _{\lambda ,\mu \ge 0} \sum _{\pi = 0}^{\lambda -1} (-1)^\pi {{\mathrm{D}}}_i^\pi \frac{\partial L_{ij}}{\partial u_{I i^\lambda j^\mu }} \delta u_{I i^{\lambda - \pi -1}j^\mu }(\mathscr {V}) \, \mathrm{{d}}t_j. \end{aligned}$$

Next integrate by parts with respect to $t_j$ as many times as possible.

$$\begin{aligned} \int _{S_{ij}} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}&=\int _{S_{ij}} \sum _{I \not \ni i,j} \sum _{\lambda ,\mu \ge 0} (-1)^{\lambda +\mu } {{\mathrm{D}}}_i^\lambda {{\mathrm{D}}}_j^\mu \frac{\partial L_{ij}}{\partial u_{I i^\lambda j^\mu }} \delta u_{I}(\mathscr {V}) \, \mathrm{{d}}t_i \wedge \mathrm{{d}}t_j \end{aligned}$$

(9)

$$\begin{aligned}&\quad - \int _{\partial S_j} \sum _{I \not \ni i,j} \sum _{\lambda ,\mu \ge 0} \sum _{\pi = 0}^{\lambda -1} (-1)^\pi {{\mathrm{D}}}_i^\pi \frac{\partial L_{ij}}{\partial u_{I i^\lambda j^\mu }} \delta u_{I i^{\lambda - \pi -1}j^\mu }(\mathscr {V}) \, \mathrm{{d}}t_j \end{aligned}$$

(10)

$$\begin{aligned}&\quad - \int _{\partial S_i} \sum _{I \not \ni i,j} \sum _{\lambda ,\mu \ge 0} \sum _{\rho = 0}^{\mu -1} (-1)^{\lambda + \rho } {{\mathrm{D}}}_i^\lambda {{\mathrm{D}}}_j^\rho \frac{\partial L_{ij}}{\partial u_{I i^\lambda j^\mu }} \delta u_{I j^{\mu - \rho -1}}(\mathscr {V}) \, \mathrm{{d}}t_i. \end{aligned}$$

(11)

The signs of (10) and (11) are due to the choice of orientations (see Fig. 3). We can rewrite the integral (9) as

$$\int _{S_{ij}} \sum _{I \not \ni i,j} \frac{\delta _{ij} {L_{ij}}}{\delta {u_I}} \delta u_I(\mathscr {V}) \, \mathrm{{d}}t_i \wedge \mathrm{{d}}t_j.$$

The last integral (11) takes a similar form if we replace the index $\mu $ by $\beta = \mu - \rho - 1$.

$$\begin{aligned} -\int _{\partial S_i} \sum _{I \not \ni i,j}&\sum _{\lambda ,\mu \ge 0} \sum _{\rho = 0}^{\mu -1} (-1)^{\lambda + \rho } {{\mathrm{D}}}_i^\lambda {{\mathrm{D}}}_j^\rho \frac{\partial L_{ij}}{\partial u_{I i^\lambda j^\mu }} \delta u_{I j^{\mu - \rho -1}}(\mathscr {V}) \, \mathrm{{d}}t_i \\&= -\int _{\partial S_i} \sum _{I \not \ni i,j} \sum _{\beta ,\lambda ,\rho \ge 0} (-1)^{\lambda + \rho } {{\mathrm{D}}}_i^\lambda {{\mathrm{D}}}_j^\rho \frac{\partial L_{ij}}{\partial u_{I i^\lambda j^{\beta + \rho + 1}}} \delta u_{I j^\beta }(\mathscr {V}) \, \mathrm{{d}}t_i \\&= -\int _{\partial S_i} \sum _{I \not \ni i,j} \sum _{\beta \ge 0} \frac{\delta _{ij} {L_{ij}}}{\delta {u_{I j^{\beta +1}}}} \delta u_{I j^\beta }(\mathscr {V}) \,\mathrm{{d}}t_i. \end{aligned}$$

To write the other boundary integral (10) in this form we first perform integration by parts.

$$\begin{aligned} - \int _{\partial S_j}&\sum _{I \not \ni i,j} \sum _{\lambda ,\mu \ge 0} \sum _{\pi = 0}^{\lambda -1} (-1)^\pi {{\mathrm{D}}}_i^\pi \frac{\partial L_{ij}}{\partial u_{I i^\lambda j^\mu }} \delta u_{I i^{\lambda - \pi -1} j^\mu }(\mathscr {V}) \, \mathrm{{d}}t_j \\&= - \int _{\partial S_j} \sum _{I \not \ni i,j} \sum _{\lambda ,\mu \ge 0} \sum _{\pi = 0}^{\lambda -1} (-1)^{\pi + \mu } {{\mathrm{D}}}_i^\pi {{\mathrm{D}}}_j^\mu \frac{\partial L_{ij}}{\partial u_{I i^\lambda j^\mu }} \delta u_{I i^{\lambda - \pi -1}}(\mathscr {V}) \, \mathrm{{d}}t_j \\&\qquad + \sum _{I \not \ni i,j} \sum _{\lambda ,\mu \ge 0} \sum _{\pi = 0}^{\lambda -1} \sum _{\rho = 0}^{\mu -1} (-1)^{\pi + \rho } \bigg ( {{\mathrm{D}}}_i^\pi {{\mathrm{D}}}_j^\rho \frac{\partial L_{ij}}{\partial u_{I i^\lambda j^\mu }} \delta u_{I i^{\lambda - \pi -1} j^{\mu - \rho -1}}(\mathscr {V}) \bigg ) \bigg |_p . \end{aligned}$$

Then we replace $\lambda $ by $\alpha = \lambda - \pi -1$ and in the last term $\mu $ by $\beta = \mu - \rho - 1$.

$$\begin{aligned} -&\int _{\partial S_j} \sum _{I \not \ni i,j} \sum _{\lambda ,\ge 0} \sum _{\pi = 0}^{\lambda -1} (-1)^\pi {{\mathrm{D}}}_i^\pi \frac{\partial L_{ij}}{\partial u_{I i^\lambda j^\mu }} \delta u_{I i^{\lambda - \pi -1} j^\mu }(\mathscr {V}) \, \mathrm{{d}}t_j \\&= - \int _{\partial S_j} \sum _{I \not \ni i,j} \sum _{\alpha ,\mu ,\pi \ge 0} (-1)^{\pi + \mu } {{\mathrm{D}}}_i^\pi {{\mathrm{D}}}_j^\mu \frac{\partial L_{ij}}{\partial u_{I i^{\alpha + \pi + 1} j^\mu }} \delta u_{I i^\alpha }(\mathscr {V}) \, \mathrm{{d}}t_j \\&\qquad + \sum _{I \not \ni i,j} \sum _{\alpha ,\beta ,\pi ,\rho \ge 0} \bigg ( (-1)^{\pi + \rho } {{\mathrm{D}}}_i^\pi {{\mathrm{D}}}_j^\rho \frac{\partial L_{ij}}{\partial u_{I i^{\alpha + \pi + 1} j^{\beta + \rho + 1}}} \delta u_{I i^\alpha j^\beta }(\mathscr {V}) \bigg ) \bigg |_p \\&= - \int _{\partial S_j} \sum _{I \not \ni i,j} \sum _{\alpha \ge 0} \frac{\delta _{ij} {L_{ij}}}{\delta {u_{I i^{\alpha + 1}}}} \delta u_{I i^\alpha }(\mathscr {V}) \, \mathrm{{d}}t_j + \sum _{I \not \ni i,j} \sum _{\alpha ,\beta \ge 0} \bigg ( \frac{\delta _{ij} {L_{ij}}}{\delta {u_{I i^{\alpha + 1} j^{\beta + 1}}}} \delta u_{I i^\alpha j^\beta }(\mathscr {V}) \bigg )\bigg |_p . \end{aligned}$$

Putting everything together we find

$$\begin{aligned} \int _{S_{ij}} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}&= \int _{S_{ij}} \sum _{I \not \ni i,j} \frac{\delta _{ij} {L_{ij}}}{\delta {u_I}} \delta u_I(\mathscr {V}) \, \mathrm{{d}}t_i \wedge \mathrm{{d}}t_j - \int _{\partial S_i} \sum _{I \not \ni i} \frac{\delta _{ij} {L_{ij}}}{\delta {u_{Ij}}} \delta u_{I}(\mathscr {V}) \mathrm{{d}}t_i \\&\qquad - \int _{\partial S_j} \sum _{I \not \ni j} \frac{\delta _{ij} {L_{ij}}}{\delta {u_{Ii}}} \delta u_{I}(\mathscr {V}) \, \mathrm{{d}}t_j + \bigg ( \sum _{I} \frac{\delta _{ij} {L_{ij}}}{\delta {u_{I i j}}} \delta u_{I}(\mathscr {V}) \bigg )\bigg |_p . \end{aligned}$$

Expressions for the integrals over $S_{jk}$ and $S_{ki}$ are found by cyclic permutation of the indices. Finally we obtain

$$\begin{aligned} \int _{S} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}&= \int _{S_{ij}} \sum _{I \not \ni i,j} \frac{\delta _{ij} {L_{ij}}}{\delta {u_I}} \delta u_I(\mathscr {V}) \, \mathrm{{d}}t_i \wedge \mathrm{{d}}t_j \nonumber \\&\qquad - \int _{\partial S_i} \bigg (\sum _{I \not \ni i} \frac{\delta _{ij} {L_{ij}}}{\delta {u_{I j}}} \delta u_{I}(\mathscr {V}) + \sum _{I \not \ni i} \frac{\delta _{ki} {L_{ki}}}{\delta {u_{I k}}} \delta u_{I}(\mathscr {V}) \bigg ) \, \mathrm{{d}}t_i \nonumber \\&\qquad + \sum _{I} \bigg ( \frac{\delta _{ij} {L_{ij}}}{\delta {u_{I i j}}} \delta u_{I}(\mathscr {V}) \bigg ) \bigg |_p \qquad + \text { cyclic permutations in } i,j,k. \end{aligned}$$

(12)

From this we can read off the multi-time Euler-Lagrange equations. $\square $

3 Pluri-Lagrangian Structure of the Sine-Gordon Equation

We borrow our first example of a pluri-Lagrangian system from [21].

The Sine-Gordon equation $u_{xy}=\sin u$ is the Euler-Lagrange equation for

$$L = \frac{1}{2}u_xu_y-\cos u.$$

Consider the vector field $\varphi \frac{\partial {}}{\partial {u}}$ with

$$\varphi =u_{xxx}+\frac{1}{2}u_x^3$$

and its prolongation ${{\mathrm{D}}}_\varphi := \sum _I \varphi _I \frac{\partial {}}{\partial {u_I}}.$ It is known that ${{\mathrm{D}}}_\varphi $ is a variational symmetry for the sine-Gordon equation [18, p. 336]. In particular, we have that

$$\begin{aligned} {{\mathrm{D}}}_\varphi L= {{\mathrm{D}}}_x N + {{\mathrm{D}}}_y M \end{aligned}$$

(13)

with

$$\begin{aligned} M&= \dfrac{1}{2}\varphi u_x-\dfrac{1}{8}u_x^4+\dfrac{1}{2}u_{xx}^2, \\ N&= \dfrac{1}{2}\varphi u_y -\dfrac{1}{2}u_x^2\cos u - u_{xx}(u_{xy}-\sin u). \end{aligned}$$

Now we introduce a new independent variable z corresponding to the “flow” of the generalized vector field ${{\mathrm{D}}}_\varphi $, i.e. $u_z = \varphi $. Consider simultaneous solutions of the Euler-Lagrange equation $\frac{\delta _{} {L}}{\delta {u}} = 0$ and of the flow $u_z = \varphi $ as functions of three independent variables x, y, z. Then Eq. (13) expresses the closedness of the two-form

$$\mathscr {L}= L\, dx \wedge dy - M\, dz \wedge dx - N\, dy\wedge dz.$$

The fact that $\mathrm{{d}}\mathscr {L}= 0$ on solutions is consistent with Proposition 2.2. Hence $\mathscr {L}$ is a reasonable candidate for a Lagrangian two-form.

Theorem 3.1

The multi-time Euler-Lagrange equations for the Lagrangian two-form

$$\mathscr {L}= L_{12}\ dx\wedge dy + L_{13}\ dx\wedge dz + L_{23}\ dy\wedge dz$$

with the components

$$\begin{aligned} L_{12}&= \frac{1}{2}u_xu_y-\cos u, \end{aligned}$$

(14)

$$\begin{aligned} L_{13}&= \frac{1}{2}u_xu_z-\dfrac{1}{8}u_x^4+\dfrac{1}{2}u_{xx}^2, \end{aligned}$$

(15)

$$\begin{aligned} L_{23}&= -\frac{1}{2}u_yu_z+\dfrac{1}{2}u_x^2\cos u+u_{xx}(u_{xy}-\sin u), \end{aligned}$$

(16)

consist of the sine-Gordon equation

$$u_{xy} = \sin u,$$

the modified KdV equation

$$u_z = u_{xxx} + \frac{1}{2} u_x^3,$$

and corollaries thereof. On solutions of either of these equations the two-form $\mathscr {L}$ is closed.

Proof

Let us calculate the multi-time Euler-Lagrange Eqs. (5)–(7) one by one:

The equation $\displaystyle \frac{\delta _{12} {L_{12}}}{\delta {u}} = 0$ yields .

For any $\alpha > 0$ the equation $\displaystyle \frac{\delta _{12} {L_{12}}}{\delta {u_{z^\alpha }}} = 0$ yields $0=0$.
The equation $\displaystyle \frac{\delta _{13} {L_{13}}}{\delta {u}} = 0$ yields .

For any $\alpha > 0$ the equation $\displaystyle \frac{\delta _{13} {L_{13}}}{\delta {u_{y^\alpha }}} = 0$ yields $0=0$.
The equation $\displaystyle \frac{\delta _{23} {L_{23}}}{\delta {u}} = 0$ yields .

The equation $\displaystyle \frac{\delta _{23} {L_{23}}}{\delta {u_x}} = 0$ yields . The equation $\displaystyle \frac{\delta _{23} {L_{23}}}{\delta {u_{xx}}} = 0$ yields .

For any $\alpha > 2$, the equation $\displaystyle \frac{\delta _{23} {L_{23}}}{\delta {u_{x^\alpha }}} = 0$ yields $0=0$.
The equation $\displaystyle \frac{\delta _{13} {L_{13}}}{\delta {u_x}} = \frac{\delta _{23} {L_{23}}}{\delta {u_y}}$ yields .

The equation $\displaystyle \frac{\delta _{13} {L_{13}}}{\delta {u_{xx}}} = \frac{\delta _{23} {L_{23}}}{\delta {u_{xy}}}$ yields $u_{xx} = u_{xx}$.

For any other I the equation $\displaystyle \frac{\delta _{13} {L_{13}}}{\delta {u_{Ix}}} = \frac{\delta _{23} {L_{23}}}{\delta {u_{Iy}}}$ yields $0=0$.
The equation $\displaystyle \frac{\delta _{12} {L_{12}}}{\delta {u_y}} = \frac{\delta _{13} {L_{13}}}{\delta {u_z}}$ yields $\frac{1}{2}u_x = \frac{1}{2}u_x$.

For any nonempty I, the equation $\displaystyle \frac{\delta _{12} {L_{12}}}{\delta {u_{Iy}}} = \frac{\delta _{13} {L_{13}}}{\delta {u_{Iz}}}$ yields $0=0$.
The equation $\displaystyle \frac{\delta _{12} {L_{12}}}{\delta {u_x}} = \frac{\delta _{23} {L_{32}}}{\delta {u_z}}$ yields $\frac{1}{2}u_y = \frac{1}{2}u_y$.

For any nonempty I, the equation $\displaystyle \frac{\delta _{12} {L_{12}}}{\delta {u_{Ix}}} = \frac{\delta _{23} {L_{32}}}{\delta {u_{Iz}}}$ yields $0=0$.
For any I the equation $\displaystyle \frac{\delta _{12} {L_{12}}}{\delta {u_{Ixy}}} + \frac{\delta _{23} {L_{23}}}{\delta {u_{Iyz}}} + \frac{\delta _{13} {L_{31}}}{\delta {u_{Izx}}} = 0$ yields $0=0$.

It remains to notice that all nontrivial equations in this list are corollaries of the equations $u_{xy} = \sin u$ and $u_z = u_{xxx} + \frac{1}{2} u_x^3$, derived by differentiation.

The closedness of $\mathscr {L}$ can be verified by direct calculation:

$$\begin{aligned} {{\mathrm{D}}}_z L_{12} - {{\mathrm{D}}}_y L_{13} + {{\mathrm{D}}}_x L_{23}&= \frac{1}{2} (u_{yz} u_x + u_{xz} u_y) + u_z \sin u \\&\qquad - \frac{1}{2} u_{yz} u_x - \frac{1}{2} u_z u_{xy} + \frac{1}{2} u_x^3 u_{xy} - u_{xx} u_{xxy} \\&\qquad - \frac{1}{2} u_{xz} u_y - \frac{1}{2} u_z u_{xy} + u_x u_{xx} \cos u - \frac{1}{2} u_x^3 \sin u \\&\qquad + u_{xxx} (u_{xy} - \sin u) + u_{xx} (u_{xxy} - u_x \cos u) \\&= - \left( u_z - \frac{1}{2} u_x^3 - u_{xxx}\right) (u_{xy} - \sin u). \end{aligned}$$

$\square $

Remark 3.2

The Sine-Gordon equation and the modified KdV equation are the simplest equations of their respective hierarchies. Furthermore, those hierarchies can be seen as the positive and negative parts of one single hierarchy that is infinite in both directions [14, Sect. 3c and 5k]. It seems likely that this whole hierarchy possesses a pluri-Lagrangian structure.

4 The KdV Hierarchy

Our second and the main example of a pluri-Lagrangian system will be the (potential) KdV hierarchy. This section gives an overview of the relevant known facts about KdV, mainly following Dickey [12, Sect. 3.7]. The next section will present its pluri-Lagrangian structure.

One way to introduce the Korteweg-de Vries (KdV) hierarchy is to consider a formal power series

$$ R = \sum _{k=0}^\infty r_k z^{-2k-1}, $$

with the coefficients $r_k=r_k[u]$ being polynomials of u and its partial derivatives with respect to x, satisfying the equation

$$\begin{aligned} R_{xxx} + 4uR_x + 2u_x R - z^2 R_x = 0. \end{aligned}$$

(17)

Multiplying this equation by R and integrating with respect to x we find

$$\begin{aligned} R R_{xx} - \frac{1}{2} R_x^2 + 2 \left( u - \frac{1}{4} z^2 \right) R^2 = C(z), \end{aligned}$$

(18)

where $C(z) = \sum _{k = 0}^\infty c_k z^{-2k}$ is a formal power series in $z^{-2}$, with coefficients $c_k$ being constants. Different choices of C(z) correspond to different normalizations of the KdV hierarchy. We take $C(z) = \frac{1}{8}$, i.e. $c_0 = \frac{1}{8}$ and $c_k = 0$ for $k >0$. The first few coefficients of the power series $R = r_0 z^{-1} + r_1 z^{-3} + r_2 z^{-5} + \ldots $ are

$$ r_0 = \frac{1}{2}, \quad r_1 = u, \quad r_2 = u_{xx} + 3 u^2, \quad r_{3} = u_{xxxx} + 10 u u_{xx} + 5 u_x^2 + 10 u^3. $$

The Korteweg-de Vries hierarchy is defined as follows.

Definition 4.1

The KdV hierarchy is the family of equations
$$ u_{t_k} = ( r_k[u] )_x. $$
Write $g_k[v] := r_k[v_x]$. The potential KdV (PKdV) hierarchy is the family of equations
$$ v_{t_k} = g_k[v] . $$
The differentiated potential KdV (DPKdV) hierarchy is the family of equations
$$ v_{x t_k} = ( g_k[v] )_x.$$

The right-hand sides of first few PKdV equations are

$$ g_1 = v_x, \quad g_2 = v_{xxx} + 3 v_x^2, \quad g_{3} = v_{xxxxx} + 10 v_x v_{xxx} + 5 v_{xx}^2 + 10 v_x^3. $$

Remark 4.2

The first KdV and PKdV equations, $u_{t_1} = u_x$, resp. $v_{t_1}=v_x$, allow us to identify x with $t_1$.

Proposition 4.3

The differential polynomials $r_k[u]$ satisfy

$$ \frac{\delta _{} {r_k}}{\delta {u}} = \left( 4 k - 2\right) r_{k-1}, $$

where $\frac{\delta _{} {}}{\delta {u}}$ is shorthand notation for $\frac{\delta _{1} {}}{\delta {u}}$.

A proof of this statement can be found in [12, 3.7.11–3.7.14].

Corollary 4.4

Set $h_k [v]:= \frac{1}{4 k + 2}g_{k+1}[v]$, then the differential polynomials $g_k$ and $h_k$ satisfy

$$\frac{\delta _{} {g_k}}{\delta {v_x}} = \left( 4 k - 2\right) g_{k-1} \qquad \text {and} \qquad \frac{\delta _{} {h_k}}{\delta {v_x}} = g_k.$$

Before we proceed, let us formulate a simple Lemma.

Lemma 4.5

For any multi-index I and for any differential polynomial f[v] we have:

$$ {{\mathrm{D}}}_{x} \left( \frac{\delta _{} {f}}{\delta {v_{Ix}}} \right) = \frac{\partial {f}}{\partial {v_I}} - \frac{\delta _{} {f}}{\delta {v_I}}.$$

Proof

By direct calculation:

$$\begin{aligned} {{\mathrm{D}}}_{x} \left( \frac{\delta _{} {f}}{\delta {v_{Ix}}} \right)&= {{\mathrm{D}}}_x \bigg ( \frac{\partial {f}}{\partial {v_{Ix}}} - {{\mathrm{D}}}_x \frac{\partial {f}}{\partial {v_{Ix^2}}} + {{\mathrm{D}}}_x^2 \frac{\partial {f}}{\partial {v_{Ix^3}}} - \ldots \bigg ) \\&= {{\mathrm{D}}}_x \frac{\partial {f}}{\partial {v_{Ix}}} - {{\mathrm{D}}}_x^2 \frac{\partial {f}}{\partial {v_{Ix^2}}} + {{\mathrm{D}}}_x^2 \frac{\partial {f}}{\partial {v_{Ix^3}}} - \ldots =\frac{\partial {f}}{\partial {v_I}} - \frac{\delta _{} {f}}{\delta {v_I}}. \end{aligned}$$

$\square $

We can now find Lagrangians for the the DPKdV equations.

Proposition 4.6

The DPKdV equations are Lagrangian, with the Lagrange functions

$$ L_k[v] = \frac{1}{2}v_x v_{t_k} - h_k[v]. $$

Proof

Since $h_k = \frac{1}{4k + 2}g_{k+1}$ does not depend on v directly, it follows from Lemma 4.5 and Corollary 4.4 that

$$ \frac{\delta _{} {L_k}}{\delta {v}} = -v_{t_k x} - \frac{\delta _{} {h_k}}{\delta {v}} = - v_{t_k x} + {{\mathrm{D}}}_x\frac{\delta _{} {h_k}}{\delta {v_x}} = -v_{t_k x} + (g_k)_x. $$

$\square $

5 Pluri-Lagrangian Structure of PKdV Hierarchy

Since the individual KdV and PKdV equations are evolutionary (not variational), it seems not very plausible that they could have a pluri-Lagrangian structure. However, it turns out that the PKdV hierarchy as a whole is pluri-Lagrangian. Let us stress that this structure is only visible if one considers several PKdV equations simultaneously and not individually. We consider a finite-dimensional multi-time $\mathbb {R}^N$ parametrized by $t_1,t_2,\ldots , t_N$ supporting the first N flows of the PKdV hierarchy. Recall that the first PKdV equation reads $v_{t_1} = v_{x}$, which allows us to identify $t_1$ with x.

The formulation of the main result involves certain differential polynomials introduced in the following statement.

Lemma 5.1

There exist differential polynomials $b_{ij}[v]$ depending on v and $v_{x^\alpha }$, $\alpha >0$, such that
$$\begin{aligned} {{\mathrm{D}}}_x(g_i) g_j = {{\mathrm{D}}}_x(b_{ij}). \end{aligned}$$
(19)
These polynomials satisfy
$$\begin{aligned} b_{ij} + b_{ji} = g_i g_j. \end{aligned}$$
(20)
The differential polynomials $a_{ij}[v]$ (depending on $v_{x^\alpha }$ and $v_{x^\alpha t_j}$, $\alpha \ge 0$) defined by
$$\begin{aligned} a_{ij} := v_{t_j} \frac{\delta _{1} {h_i}}{\delta {v_x}} + v_{x t_j} \frac{\delta _{1} {h_i}}{\delta {v_{xx}}} + v_{xx t_j} \frac{\delta _{1} {h_i}}{\delta {v_{xxx}}} + \ldots \end{aligned}$$
(21)
satisfy
$$\begin{aligned} {{\mathrm{D}}}_j(h_i) + {{\mathrm{D}}}_{x}(g_i) v_{t_j} = {{\mathrm{D}}}_x(a_{ij}). \end{aligned}$$
(22)

Proof

The existence of polynomials $b_{ij}$ is shown in [12, 3.7.9]. Since

$${{\mathrm{D}}}_x ( b_{ij} + b_{ji} ) = {{\mathrm{D}}}_x (g_i) g_j + g_i {{\mathrm{D}}}_x (g_j) = {{\mathrm{D}}}_x(g_i g_j),$$

and since neither $b_{ij} + b_{ji}$ nor $g_i g_j$ contain constant terms, Eq. (20) follows. The last claim is a straightforward calculation using Lemma 4.5:

$$\begin{aligned} {{\mathrm{D}}}_x(a_{ij})&= {{\mathrm{D}}}_x \left( v_{t_j} \frac{\delta _{1} {h_i}}{\delta {v_x}} + v_{x t_j} \frac{\delta _{1} {h_i}}{\delta {v_{xx}}} + v_{xx t_j} \frac{\delta _{1} {h_i}}{\delta {v_{xxx}}} + \ldots \right) \\&= v_{x t_j} \frac{\delta _{1} {h_i}}{\delta {v_x}} + v_{xx t_j} \frac{\delta _{1} {h_i}}{\delta {v_{xx}}} + v_{xxx t_j} \frac{\delta _{1} {h_i}}{\delta {v_{xxx}}} + \ldots \\&\quad + v_{t_j} {{\mathrm{D}}}_x \left( \frac{\delta _{1} {h_i}}{\delta {v_x}}\right) + v_{x t_j} {{\mathrm{D}}}_x \left( \frac{\delta _{1} {h_i}}{\delta {v_{xx}}}\right) + v_{xx t_j} {{\mathrm{D}}}_x \left( \frac{\delta _{1} {h_i}}{\delta {v_{xxx}}}\right) + \ldots \\&= v_{x t_j} \frac{\delta _{1} {h_i}}{\delta {v_x}} + v_{xx t_j} \frac{\delta _{1} {h_i}}{\delta {v_{xx}}} + v_{xxx t_j} \frac{\delta _{1} {h_i}}{\delta {v_{xxx}}} + \ldots \\&\quad - v_{t_j} \frac{\delta _{1} {h_i}}{\delta {v}} + v_{t_j} \frac{\partial {h_i}}{\partial {v}} - v_{x t_j} \frac{\delta _{1} {h_i}}{\delta {v_{x}}} + v_{x t_j} \frac{\partial {h_i}}{\partial {v_{x}}} - v_{xx t_j} \frac{\delta _{1} {h_i}}{\delta {v_{xx}}} + v_{xx t_j} \frac{\partial {h_i}}{\partial {v_{xx}}} - \ldots \\&= {{\mathrm{D}}}_j h_i - v_{t_j} \frac{\delta _{1} {h_i}}{\delta {v}} = {{\mathrm{D}}}_j h_i + {{\mathrm{D}}}_x(g_i) v_{t_j} . \end{aligned}$$

$\square $

Now we are in a position to give a pluri-Lagrangian formulation of the PKdV hierarchy.

Theorem 5.2

The multi-time Euler-Lagrange equations for the Lagrangian two-form $\mathscr {L}= \sum _{i<j} L_{ij} \,\mathrm{{d}}t_i \wedge \mathrm{{d}}t_j$, with coefficients given by

$$\begin{aligned} L_{1i} := L_i = \frac{1}{2} v_x v_{t_i} - h_i \end{aligned}$$

(23)

and

$$\begin{aligned} L_{ij} := \frac{1}{2} (v_{t_i} g_j - v_{t_j} g_i) + (a_{ij} - a_{ji}) - \frac{1}{2}(b_{ij} - b_{ji}) \quad \text {for} \quad j>i>1 \end{aligned}$$

(24)

are the first $N-1$ nontrivial PKdV equations

$$v_{t_2} = g_2, \quad v_{t_3} = g_3, \quad \ldots \quad v_{t_N} = g_N,$$

and equations that follow from these by differentiation.

5.1 Variational Symmetries and the Pluri-Lagrangian Form

Before proving Theorem 5.2, let us give an heuristic derivation of expression (24) for $L_{ij}$. The ansatz is that different flows of the PKdV hierarchy should be variational symmetries of each other. (We are grateful to V. Adler who proposed this derivation to us in a private communication.)

Fix two distinct integers $i,j \in \{ 2, 3, \ldots , N \}$. Consider the the ith DPKdV equation, which is nothing but the conventional two-dimensional variational system generated in the $(x,t_i)$-plane by the Lagrange function

$$ L_{1i} [v]= \frac{1}{2} v_x v_{t_i} - h_i[v]. $$

Consider the evolutionary equation $v_{t_j} = g_j[v]$, i.e., the jth PKdV equation, and the corresponding generalized vector field

$$ {{\mathrm{D}}}_{g_j} := \sum _{I\not \ni j} ({{\mathrm{D}}}_I g_j)\frac{\partial }{\partial v_I}. $$

We want to show that ${{\mathrm{D}}}_{g_j}$ is a variational symmetry of $L_{1i}$. For this end, we look for $L_{ij}$ such that

$$\begin{aligned} {{\mathrm{D}}}_{g_j} (L_{1i}) - {{\mathrm{D}}}_i \left( L_{1j}^{(g_j)}\right) + {{\mathrm{D}}}_x (L_{ij})= 0. \end{aligned}$$

(25)

Here, $L_{1j}^{(g_j)}$ is the Lagrangian defined by (23) but with $v_{t_j}$ replaced by $g_j$:

$$ L_{1j}^{(g_j)} := \frac{1}{2} v_x g_j - h_j. $$

We have:

$$\begin{aligned}&{{\mathrm{D}}}_i \left( L_{1j}^{(g_j)}\right) = \frac{1}{2} v_{t_i x} g_j + \frac{1}{2} v_x (g_j)_{t_i} - {{\mathrm{D}}}_i (h_j),\\&{{\mathrm{D}}}_{g_j} (L_{1i}) = \frac{1}{2} (g_j)_x v_{t_i} + \frac{1}{2} v_x (g_j)_{t_i} - {{\mathrm{D}}}_{g_j}(h_i). \end{aligned}$$

Upon using (22) and (19), and introducing the polynomial

$$ a_{ij}^{(g_j)} := g_j \frac{\delta _{1} {h_i}}{\delta {v_x}} + (g_j)_x \frac{\delta _{1} {h_i}}{\delta {v_{xx}}} + (g_j)_{xx} \frac{\delta _{1} {h_i}}{\delta {v_{xxx}}} + \ldots $$

obtained from $a_{ij}$ through the replacement of $v_{t_j}$ by $g_j$, we find:

$$\begin{aligned} {{\mathrm{D}}}_i \left( L_{1j}^{(g_j)}\right) - {{\mathrm{D}}}_{g_j} (L_{1i})&= \frac{1}{2} v_{t_i x} g_j - \frac{1}{2} (g_j)_x v_{t_i} - {{\mathrm{D}}}_i (h_j) + {{\mathrm{D}}}_{g_j}(h_i) \\&= \frac{1}{2} v_{t_i x} g_j - \frac{1}{2} (g_j)_x v_{t_i} - (a_{ji})_x + (g_j)_x v_{t_i} + \left( a_{ij}^{(g_j)} \right) _x - (g_i)_x g_j \\&= \frac{1}{2} v_{t_i x} g_j + \frac{1}{2} (g_j)_x v_{t_i} - \left( a_{ji} - a_{ij}^{(g_j)} \right) _x - (g_i)_x g_j \\&= \frac{1}{2} (v_{t_i}g_j)_x + \left( a_{ij}^{(g_j)} - a_{ji} \right) _x - (b_{ij})_x. \end{aligned}$$

We denote the antiderivative with respect to x of this quantity by

$$L_{ij}^{(i)} := \frac{1}{2} v_{t_i}g_j + \left( a_{ij}^{(g_j)} - a_{ji} \right) - b_{ij}.$$

The analogous calculation with coordinates x and $t_j$ yields

$$ {{\mathrm{D}}}_{g_i} (L_{1j}) - {{\mathrm{D}}}_j\left( L_{1i}^{(g_i)}\right) = -\frac{1}{2} (v_{t_j} g_i)_x + \left( a_{ij} - a_{ji}^{(g_i)} \right) _x + (b_{ji})_x.$$

We denote its antiderivative by

$$L_{ij}^{(j)} := -\frac{1}{2} v_{t_j} g_i + \left( a_{ij} - a_{ji}^{(g_i)} \right) + b_{ji}. $$

Now we look for a differential polynomial $L_{ij}[v]$ depending on the partial derivatives of v with respect to x, $t_i$ and $t_j$ that reduces to $L_{ij}^{(i)}$ and to $L_{ij}^{(j)}$ after the substitutions $v_{t_j} = g_j$ and $v_{t_i} = g_i$, respectively. It turns out that there is a one-parameter family of such functions, given by

$$ L_{ij} = c v_{t_i}v_{t_j} + (a_{ij} - a_{ji}) + \left( \frac{1}{2} - c \right) v_{t_i} g_j - \left( \frac{1}{2} + c \right) v_{t_j} g_i + \frac{1}{2}(b_{ji} - b_{ij}) + c g_i g_j$$

for $c \in \mathbb {R}$. Checking this is a straightforward calculation using Eq. (20). Our theory does not depend in any essential way on the choice of $L_{ij}$ within this family. For aesthetic reasons we chose $c=0$, which gives us Eq. (24).

Remark 5.3

We could also take $\mathscr {L}$ to be the c-linear part of the form we have just obtained, i.e. $\mathscr {L}=\sum _{1<i<j} (v_{t_i} - g_i)(v_{t_j} - g_j) \,\mathrm{{d}}t_i \wedge \mathrm{{d}}t_j$. One can think of this as choosing $c = \infty $. Such a two-form $\mathscr {L}$ can be considered for any family of evolutionary equations $v_{t_i}=g_i[v]$. However, due to the vanishing components $L_{1i}$, this form $\mathscr {L}$ has no relation to the classical variational formulation of the individual differential equations $v_{xt_i}=(g_i)_x$.

Eventually, Eq. (25) leads to the following closedness property.

Proposition 5.4

The two-form $\mathscr {L}= \sum _{i<j} L_{ij} \,\mathrm{{d}}t_i \wedge \mathrm{{d}}t_j$, with coefficients given by (23) and (24), is closed as soon as v solves all but one of the PKdV equations $v_{t_2} = g_2, \ldots , v_{t_N} = g_N$.

Proof

We use the notation

$$\begin{aligned} \mathrm{{d}}\mathscr {L}= \sum _{i< j < k} M_{ijk} \, \mathrm{{d}}t_i \wedge \mathrm{{d}}t_j \wedge \mathrm{{d}}t_k, \quad M_{ijk}={{\mathrm{D}}}_k L_{ij} - {{\mathrm{D}}}_j L_{ik} + {{\mathrm{D}}}_i L_{jk} \end{aligned}$$

(26)

We start by showing that $M_{1jk}={{\mathrm{D}}}_k L_{1j} - {{\mathrm{D}}}_j L_{1k} + {{\mathrm{D}}}_{x} L_{jk}$ vanishes as soon as either $v_{t_j} = g_j$ or $v_{t_k} = g_k$ is satisfied. Indeed, we have:

$$\begin{aligned} M_{1jk}&= {{\mathrm{D}}}_k L_{1j} - {{\mathrm{D}}}_j L_{1k} + {{\mathrm{D}}}_x L_{jk} \nonumber \\&= \frac{1}{2} v_{t_j t_k} v_x + \frac{1}{2} v_{t_j} v_{x t_k} - {{\mathrm{D}}}_k h_j - \frac{1}{2} v_{t_j t_k} v_x - \frac{1}{2} v_{t_k} v_{x t_j} + {{\mathrm{D}}}_j h_k \nonumber \\&\qquad + \frac{1}{2} \left( v_{x t_j} g_k + v_{t_j} {{\mathrm{D}}}_x g_k - v_{x t_k} g_j - v_{t_k} {{\mathrm{D}}}_x g_j \right) \nonumber \\&\qquad + {{\mathrm{D}}}_k h_j + v_{t_k} {{\mathrm{D}}}_x g_j - {{\mathrm{D}}}_j h_k - v_{t_j} {{\mathrm{D}}}_x g_k - \frac{1}{2}( g_k {{\mathrm{D}}}_x g_j - g_j {{\mathrm{D}}}_x g_k ) \nonumber \\&= \frac{1}{2} \Big (v_{t_j} v_{x t_k} - v_{t_k} v_{x t_j} + v_{x t_j} g_k - v_{t_j} {{\mathrm{D}}}_x g_k \nonumber \\&\qquad \quad - v_{x t_k} g_j + v_{t_k} {{\mathrm{D}}}_x g_j - g_k {{\mathrm{D}}}_x g_j + g_j {{\mathrm{D}}}_x g_k \Big ) \nonumber \\&= \frac{1}{2} \left( v_{t_j} - g_j \right) {{\mathrm{D}}}_x \left( v_{t_k} - g_k \right) - \frac{1}{2} \left( v_{t_k} - g_k \right) {{\mathrm{D}}}_x \left( v_{t_j} - g_j \right) . \end{aligned}$$

(27)

For the case $i,j,k > 1$, we assume without loss of generality that $v_{t_i} = g_i$ and $v_{t_j} = g_j$ are satisfied. We do not assume that $v_{t_k}=g_k$ holds, and correspondingly we do not make any identification involving $v_{t_k}$, $v_{x t_k}$, .... Using Eq. (27), we find:

$$\begin{aligned} {{\mathrm{D}}}_x M_{ijk}&= {{\mathrm{D}}}_x \left( {{\mathrm{D}}}_k L_{ij} - {{\mathrm{D}}}_j L_{ik} + {{\mathrm{D}}}_i L_{jk} \right) \\&= {{\mathrm{D}}}_k \left( {{\mathrm{D}}}_i L_{1j} - {{\mathrm{D}}}_j L_{1i} \right) - {{\mathrm{D}}}_j \Big ( {{\mathrm{D}}}_i L_{1k} - {{\mathrm{D}}}_k L_{1i} \Big ) + {{\mathrm{D}}}_i \left( {{\mathrm{D}}}_j L_{1k} - {{\mathrm{D}}}_k L_{1j} \right) \\&= 0. \end{aligned}$$

Since these polynomials do not contain constant terms, it follows that

$$ {{\mathrm{D}}}_k L_{ij} - {{\mathrm{D}}}_j L_{ik} + {{\mathrm{D}}}_i L_{jk} = 0. $$

$\square $

Remark 5.5

Assuming that the statement of Theorem 5.2 holds true, one can easily prove a somewhat weaker claim than Proposition 5.4, namely that the two-form $\mathscr {L}$ is closed on simultaneous solutions of all the PKdV equations. Indeed, by Proposition 2.2, $\mathrm{{d}}\mathscr {L}$ is constant on solutions of the multi-time Euler-Lagrange equations $v_{t_i} = g_i$. Vanishing of this constant follows from the fact that $\mathrm{{d}}\mathscr {L}=0$ on the trivial solution $v \equiv 0$.

5.2 The Multi-time Euler-Lagrange Equations

Proof

(of Theorem 5.2) We check all multi-time Euler-Lagrange Eqs. (5)–(7) individually. If $N>3$, we fix $k> j> i >1$. If $N=3$, we take $j=3$, $i=2$, and in the following ignore all equations containing k. We use the convention $L_{ji} = - L_{ij}$, etc.

Equations (7)

The equations
$$\frac{\delta _{1i} {L_{1i}}}{\delta {v_{Ix t_i}}} + \frac{\delta _{ij} {L_{ij}}}{\delta {v_{It_i t_j}}} + \frac{\delta _{1j} {L_{j1}}}{\delta {v_{It_j x}}} = 0$$
and
$$\frac{\delta _{ij} {L_{ij}}}{\delta {v_{It_i t_j}}} + \frac{\delta _{jk} {L_{jk}}}{\delta {v_{It_j t_k}}} + \frac{\delta _{ki} {L_{ki}}}{\delta {v_{It_k t_i}}}= 0$$
are trivial because all terms vanish.

Equations (6)

The equation
$$\frac{\delta _{1i} {L_{1i}}}{\delta {v_{x}}} = \frac{\delta _{ij} {L_{ji}}}{\delta {v_{t_j}}}$$
yields
$$\begin{aligned} \frac{1}{2} v_{t_i} - \frac{\delta _{1i} {h_i}}{\delta {v_x}}&= \frac{1}{2} g_i - \frac{\delta _{ij} {a_{ij}}}{\delta {v_{t_j}}} \\&= \frac{1}{2} g_i - \frac{\delta _{ij} {}}{\delta {v_{t_j}}} \left( v_{t_j} \frac{\delta _{1} {h_i}}{\delta {v_x}} + v_{t_j x} \frac{\delta _{1} {h_i}}{\delta {v_{xx}}} + v_{t_j xx} \frac{\delta _{1} {h_i}}{\delta {v_{xxx}}} +\ldots \right) \\&= \frac{1}{2} g_i - \frac{\delta _{1} {h_i}}{\delta {v_x}}. \end{aligned}$$
This simplifies to the PKdV equation
$$\begin{aligned} v_{t_i} = g_i. \end{aligned}$$
(28)
For $\alpha > 0$, the equation
$$\frac{\delta _{1i} {L_{1i}}}{\delta {v_{x^{\alpha +1}}}} = \frac{\delta _{ij} {L_{ji}}}{\delta {v_{t_j x^\alpha }}}$$
yields
$$\begin{aligned} -\frac{\delta _{1i} {h_i}}{\delta {v_{x^{\alpha +1}}}}&= -\frac{\delta _{ij} {}}{\delta {v_{t_j x^\alpha }}} \left( v_{t_j} \frac{\delta _{1} {h_i}}{\delta {v_x}} + v_{t_j x} \frac{\delta _{1} {h_i}}{\delta {v_{xx}}} + v_{t_j xx} \frac{\delta _{1} {h_i}}{\delta {v_{xxx}}} +\ldots \right) \\&= -\frac{\delta _{1} {h_i}}{\delta {v_{x^{\alpha +1}}}}, \end{aligned}$$
which is trivial.
Similarly, the equation
$$\frac{\delta _{1j} {L_{1j}}}{\delta {v_{x}}} = \frac{\delta _{ij} {L_{ij}}}{\delta {v_{t_i}}}$$
yields PKdV equation
$$\begin{aligned} v_{t_j} = g_j, \end{aligned}$$
(29)
and for $\alpha > 0$, the equation
$$\frac{\delta _{1j} {L_{1j}}}{\delta {v_{x^{\alpha +1}}}} = \frac{\delta _{ij} {L_{ij}}}{\delta {v_{t_i x^\alpha }}}$$
is trivial.
All equations of the form
$$\frac{\delta _{1i} {L_{1i}}}{\delta {v_{x I}}} = \frac{\delta _{ij} {L_{ji}}}{\delta {v_{t_j I}}} \quad (t_i \not \in I) \qquad \text {and} \qquad \frac{\delta _{1j} {L_{1j}}}{\delta {v_{x I}}} = \frac{\delta _{ij} {L_{ij}}}{\delta {v_{t_i I}}} \quad (t_j \not \in I)$$
where I contains any $t_l$ ($l > 1$) are trivial because each term is zero.
The equations
$$\frac{\delta _{1i} {L_{1i}}}{\delta {v_{It_i}}} = \frac{\delta _{1j} {L_{1j}}}{\delta {v_{It_j}}} \qquad (x \not \in I)$$
and
$$\frac{\delta _{ij} {L_{ij}}}{\delta {v_{It_j}}} = \frac{\delta _{ik} {L_{ik}}}{\delta {v_{It_k}}} \qquad (t_i \not \in I)$$
are easily seen to be trivial as well.

Equations (5)

By construction, the equations $\displaystyle \frac{\delta _{1i} {L_{1i}}}{\delta {v}} = 0$ for $i >1$ are the equations
$$\begin{aligned} v_{x t_i} = {{\mathrm{D}}}_x g_i. \end{aligned}$$
(30)
For I containing any $t_l$, $l > 1$, $l \ne i$, the equations $\displaystyle \frac{\delta _{1i} {L_{1i}}}{\delta {v_{t_I}}} = 0 $ are trivial.
The last family of equations we discuss as a lemma because its calculation is far from trivial.

Lemma 5.6

The equations $\displaystyle \frac{\delta _{ij} {L_{ij}}}{\delta {v_{x^\alpha }}} = 0$ are corollaries of the PKdV equations.

Proof

(of Lemma 5.6) From Eq. (24) we see that the variational derivative of $L_{ij}$ contains only three nonzero terms,

$$\begin{aligned} \frac{\delta _{ij} {L_{ij}}}{\delta {v_{x^\alpha }}} = \frac{\partial {L_{ij}}}{\partial {v_{x^\alpha }}} - {{\mathrm{D}}}_{i} \left( \frac{\partial {L_{ij}}}{\partial {v_{x^\alpha t_i}}} \right) - {{\mathrm{D}}}_{j} \left( \frac{\partial {L_{ij}}}{\partial {v_{x^\alpha t_j}}} \right) . \end{aligned}$$

(31)

In particular, the equation $\displaystyle \frac{\delta _{ij} {L_{ij}}}{\delta {v}} = 0$ yields ${{\mathrm{D}}}_i g_j - {{\mathrm{D}}}_j g_i = 0$, that is, the compatibility condition of the flows $v_{t_i}=g_i$ and $v_{t_j}=g_j$. To determine the first term on the right hand side of Eq. (31) for an arbitrary $\alpha >0$, we use an indirect method. Assume that the dimension of multi-time N is at least 4 and fix $k > 1$ distinct from i and j. Let v be a solution of all PKdV equations except $v_{t_k} = g_k$. By Proposition 5.4 we have

$$\begin{aligned} \sum _I \frac{\partial {L_{ij}}}{\partial {v_I}} v_{I t_k} = {{\mathrm{D}}}_{k} L_{ij} = {{\mathrm{D}}}_{j} L_{ik} - {{\mathrm{D}}}_{i} L_{jk}. \end{aligned}$$

(32)

Since $\frac{\partial {L_{ij}}}{\partial {v_I}}$ does not contain any derivatives with respect to $t_k$, we can determine $\frac{\partial {L_{ij}}}{\partial {v_{x^\alpha }}}$ by looking at the terms in the right hand side of Eq. (32) containing $v_{x^\alpha t_k}$. These are

$$\begin{aligned}&\quad {{\mathrm{D}}}_{j} \bigg ( - \frac{1}{2} g_i v_{t_k} + v_{t_k} \frac{\delta _{1} {h_i}}{\delta {v_x}} + v_{x t_k} \frac{\delta _{1} {h_i}}{\delta {v_{xx}}} + \ldots \bigg ) \\&- {{\mathrm{D}}}_{i} \bigg ( - \frac{1}{2} g_j v_{t_k} + v_{t_k} \frac{\delta _{1} {h_j}}{\delta {v_x}} + v_{x t_k} \frac{\delta _{1} {h_j}}{\delta {v_{xx}}} + \ldots \bigg ). \end{aligned}$$

Now we expand the brackets. By again throwing out all terms that do not contain any $v_{x^\alpha t_k}$, and those that cancel modulo $v_{t_i} = g_i$ or $v_{t_j} = g_j$, we get

$$\begin{aligned}&- v_{t_k} {{\mathrm{D}}}_{j}\left( \frac{\delta _{1} {h_i}}{\delta {v_x}} \right) + v_{x t_k} {{\mathrm{D}}}_{j}\left( \frac{\delta _{1} {h_i}}{\delta {v_{xx}}} \right) + v_{xx t_k} {{\mathrm{D}}}_{j}\left( \frac{\delta _{1} {h_i}}{\delta {v_{xxx}}} \right) + \ldots \\&+ v_{t_k} {{\mathrm{D}}}_{i}\bigg ( \frac{\delta _{1} {h_j}}{\delta {v_x}} \bigg ) - v_{x t_k} {{\mathrm{D}}}_{i}\bigg ( \frac{\delta _{1} {h_j}}{\delta {v_{xx}}} \bigg ) - v_{xx t_k} {{\mathrm{D}}}_{i}\bigg ( \frac{\delta _{1} {h_j}}{\delta {v_{xxx}}} \bigg ) - \ldots . \end{aligned}$$

Comparing this to Eq. (32), we find that

$$\frac{\partial {L_{ij}}}{\partial {v_{x^\alpha }}} = -{{\mathrm{D}}}_{i}\left( \frac{\delta _{1} {h_j}}{\delta {v_{x^{\alpha +1}}}} \right) + {{\mathrm{D}}}_{j}\left( \frac{\delta _{1} {h_i}}{\delta {v_{x^{\alpha +1}}}} \right) . $$

On the other hand we have

$$- {{\mathrm{D}}}_{i} \left( \frac{\partial {L_{ij}}}{\partial {v_{x^\alpha t_i}}} \right) - {{\mathrm{D}}}_{j} \left( \frac{\partial {L_{ij}}}{\partial {v_{x^\alpha t_j}}} \right) = {{\mathrm{D}}}_{i} \left( \frac{\delta _{1} {h_j}}{\delta {v_{x^{\alpha +1}}}} \right) - {{\mathrm{D}}}_{j} \left( \frac{\delta _{1} {h_i}}{\delta {v_{x^{\alpha +1}}}} \right) ,$$

so Equation (31) implies that $\displaystyle \frac{\delta _{ij} {L_{ij}}}{\delta {v_{x^\alpha }}} = 0$ for any $\alpha $.

Since $\displaystyle \frac{\delta _{23} {L_{23}}}{\delta {v_{x^\alpha }}} = 0$ does not depend on the dimension $N \geqslant 3$, the result for $N \geqslant 4$ implies the claim for $N = 3$. $\square $

This concludes the proof of Theorem 5.2. $\square $

6 Relation to Hamiltonian Formalism

In this last section, we briefly discuss the connection between the closedness of $\mathscr {L}$ and the involutivity of the corresponding Hamiltonians.

In Proposition 2.2 we saw that $\mathrm{{d}}\mathscr {L}$ is constant on solutions. For the one–dimensional case ($d=1$) with $\mathscr {L}$ depending on the first jet bundle only, it has been shown in [20] that this is equivalent to the commutativity of the corresponding Hamiltonian flows. If the constant is zero then the Hamiltonians are in involution. Now we will prove a similar result for the two-dimensional case.

We will use a Poisson bracket on formal integrals, i.e. equivalence classes of functions modulo x-derivatives [12, Chap. 1–2]. In this section, the integral sign $\int $ will always denote an equivalence class, not an integration operator. The Poisson bracket due to Gardner-Zakharov-Faddeev is defined by

$$ \left\{ {\textstyle \int }F, {\textstyle \int }G \right\} = \int \left( {{\mathrm{D}}}_x\frac{\delta _{1} {F}}{\delta {u}} \right) \frac{\delta _{1} {G}}{\delta {u}}.$$

Using integration by parts, we see that this bracket is anti-symmetric. Less obvious is the fact that it satisfies the Jacobi identity [18, Chap. 7]. As we did when studying the KdV hierarchy, we introduce a potential v that satisfies $v_x = u$, and we identify the space-coordinate x with the first coordinate $t_1$ of multi-time. We can now re-write the Poisson bracket as

$$\begin{aligned} \left\{ {\textstyle \int }F, {\textstyle \int }G \right\} = \int \left( {{\mathrm{D}}}_x \frac{\delta _{1} {F}}{\delta {v_x}} \right) \frac{\delta _{1} {G}}{\delta {v_x}} = - \int \frac{\delta _{1} {F}}{\delta {v}} \frac{\delta _{1} {G}}{\delta {v_x}}, \end{aligned}$$

(33)

for functions F and G that depend on the x-derivatives of v but not on v itself.

Assume that the coefficients $L_{1j}$ of the Lagrangian two-from $\mathscr {L}$ are given by

$$L_{1j} = \frac{1}{2} v_x v_{t_j} - h_j,$$

where $h_j$ is a differential polynomial in $v_x, v_{xx}, \ldots $. This is the case for the PKdV hierarchy. The $L_{1j}$ are Lagrangians of the equations

$$v_{x t_j} = {{\mathrm{D}}}_x g_j \qquad \text {or} \qquad u_{t_j} = {{\mathrm{D}}}_x g_j,$$

where $g_j := \frac{\delta _{1} {h_j}}{\delta {v_x}}$, hence $\frac{\delta _{1} {h_j}}{\delta {v}} = - {{\mathrm{D}}}_x g_j$. It turns out that the formal integral ${\textstyle \int }h_j$ is the Hamilton functional for the equation $u_{t_j} = {{\mathrm{D}}}_x g_j$ with respect to the Poisson bracket (33). Formally:

$$ \left\{ {\textstyle \int }h_j, u(y) \right\} =\left\{ {\textstyle \int }h_j, {\textstyle \int }u \,\delta (\cdot - y) \right\} = -\int \frac{\delta _{1} {h_j}}{\delta {v}} \delta (x - y) \ = {{\mathrm{D}}}_x g_j (y),$$

where $\delta $ denotes the Dirac delta.

Theorem 6.1

If $\mathrm{{d}}\mathscr {L}= 0$ on solutions, then the Hamiltonians are in involution,

$$\left\{ {\textstyle \int }h_i, {\textstyle \int }h_j \right\} = 0.$$

Proof

Recall notation (26). We have

$$\begin{aligned} {\textstyle \int }M_{1jk}&= \int \big ( {{\mathrm{D}}}_x L_{jk} - {{\mathrm{D}}}_j L_{1k} + {{\mathrm{D}}}_k L_{1j} \big ) \\&= \int \big ( - {{\mathrm{D}}}_j L_{1k} + {{\mathrm{D}}}_k L_{1j} \big ) \\&= \int \left( -\frac{1}{2} v_{x t_j} v_{t_k} - \frac{1}{2} v_x v_{t_k t_j} + {{\mathrm{D}}}_j h_k + \frac{1}{2} v_{x t_k} v_{t_j} + \frac{1}{2} v_x v_{t_j t_k} - {{\mathrm{D}}}_k L_{1j} \right) \\&= \int \left( \frac{1}{2} \big ( v_{x t_k} v_{t_j} - v_{x t_j} v_{t_k} \big ) - {{\mathrm{D}}}_k L_{1j} + {{\mathrm{D}}}_j h_k \right) \end{aligned}$$

Using Eq. (21) (which, as opposed to Eq. (19), is independent of the form of $h_i$ and $g_i$), the evolution equations $v_{t_j} = g_j$, and integration by parts, we find that

$$\begin{aligned} {\textstyle \int }M_{1jk}&= \int \left( \frac{1}{2} \big (v_{x t_k} v_{t_j} - v_{x t_j} v_{t_k} \big ) - {{\mathrm{D}}}_x a_{jk} + v_{t_k} {{\mathrm{D}}}_x g_j + {{\mathrm{D}}}_x a_{kj} - v_{t_j} D_x g_k \right) \\&= \int \left( -\frac{1}{2} \big ( g_j {{\mathrm{D}}}_x g_k - g_k {{\mathrm{D}}}_x g_j \big ) - {{\mathrm{D}}}_x a_{jk} + {{\mathrm{D}}}_x a_{kj} \right) \\&= \int g_k {{\mathrm{D}}}_x g_j \\&= - \int \frac{\delta _{1} {h_j}}{\delta {v}} \frac{\delta _{1} {h_k}}{\delta {v_x}} \\&= \left\{ {\textstyle \int }h_j, {\textstyle \int }h_k \right\} . \end{aligned}$$

Hence if $\mathrm{{d}}\mathscr {L}= 0$ on solutions of the evolution equations $v_{t_j} = g_j$, then the Hamilton functionals are in involution. $\square $

7 Conclusion

We have formulated the pluri-Lagrangian theory of integrable hierarchies, and propose it as a definition of integrability. The motivation for this definition comes from the discrete case [10, 13, 20] and the fact that we have established a relation with the Hamiltonian side of the theory. For the Hamiltonians to be in involution, we need the additional fact that the Lagrangian two-form is closed. However, we believe that the essential part of the theory is inherently contained in the pluri-Lagrangian formalism.

Since the KdV hierarchy is one of the most important examples of an integrable hierarchy, our construction of a pluri-Lagrangian structure for the PKdV hierarchy is an additional indication that the existence of a pluri-Lagrangian structure is a reasonable definition of integrability.

It is remarkable that multi-time Euler-Lagrange equations are capable of producing evolutionary equations. This is a striking difference from the discrete case, where the evolution equations (quad equations) imply the multi-time Euler–Lagrange equations (corner equations), but are themselves not variational [10].

References

Adler, V., Bobenko, A., Suris, Y.: Classification of integrable equations on quad-graphs. the consistency approach. Commun. Math. Phys. 233, 513–543 (2003)
Article MathSciNet MATH Google Scholar
Anderson, I.: The variational bicomplex. Preprint (1989)
Google Scholar
Baxter, R.: Solvable eight-vertex model on an arbitrary planar lattice. Philos. Trans. R. Soc. Lond. 289, 315–346 (1978)
Article MathSciNet Google Scholar
Baxter, R.: Free-fermion, checkerboard and Z-invariant lattice models in statistical mechanics. Proc. R. Soc. Lond. A 404, 1–33 (1986)
Article MathSciNet Google Scholar
Bazhanov, V., Mangazeev, V., Sergeev, S.: A master solution of the quantum Yang-Baxter equation and classical discrete integrable equations. Adv. Theor. Math. Phys. 16, 65–95 (2012)
Article MathSciNet MATH Google Scholar
Bazhanov, V., Mangazeev, V., Sergeev, S.: Faddeev-Volkov solution of the Yang-Baxter equation and discrete conformal symmetry. Nucl. Phys. B 784, 234–258 (2007)
Article MathSciNet MATH Google Scholar
Bobenko, A., Mercat, C., Suris, Y.: Linear and nonlinear theories of discrete analytic functions. Intergable structure and isomonodromic Green’s function. J. Reine Angew. Math. 583, 117–161 (2005)
Article MathSciNet MATH Google Scholar
Bobenko, A., Suris., Y.: Integrable systems on quad-graphs. Intern. Math. Res. Not. 2002(11), 573–611 (2002)
Google Scholar
Bobenko, A., Suris, Y.: Discrete pluriharmonic functions as solutions of linear pluri-Lagrangian systems. Commun. Math. Phys. 336, 199–215 (2015)
Article MathSciNet MATH Google Scholar
Boll, R., Petrera, M., Suris, Y.: What is integrability of discrete variational systems? Proc. R. Soc. A 470, 20130, 550 (2014)
Google Scholar
Burstall, F., Ferus, D., Pedit, F., Pinkall, U.: Harmonic tori in symmetric spaces and commuting Hamiltonian systems on loop algebras. Ann. Math. 138, 173–212 (1993)
Article MathSciNet MATH Google Scholar
Dickey, L.: Soliton equations and Hamiltonian systems, 2nd edn. World Scientific (2003)
Google Scholar
Lobb, S., Nijhoff., F.: Lagrangian multiforms and multidimensional consistency. J. Phys. A: Math. Theor. 42, 454,013 (2009)
Google Scholar
Newell, A.: Solitons in mathematics and physics. SIAM (1985)
Google Scholar
Nijhoff, F.: Lax pair for the Adler (lattice Krichever-Novikov) system. Phys. Lett. A 297, 49–58 (2002)
Article MathSciNet MATH Google Scholar
Noether, E.: Invariante Variationsprobleme. Nachrichten von der Gesellschaft der Wissenschaften zu Göttingen, Math.-Phys. Kl. 235–257 (1918)
Google Scholar
Ohnita, Y., Valli, G.: Pluriharmonic maps into compact Lie groups and factorization into unitons. Proc. Lond. Math. Soc. 61, 546–570 (1990)
Article MathSciNet MATH Google Scholar
Olver, P.: Applications of Lie groups to differential equations, 2nd edn. Springer (1993)
Google Scholar
Rudin, W.: Function theory in polydiscs. Benjamin (1969)
Google Scholar
Suris, Y.: Variational formulation of commuting Hamiltonian flows: multi-time Lagrangian 1-forms. J. Geom. Mech. 5, 365–379 (2013)
Article MathSciNet MATH Google Scholar
Suris, Y.: Variational symmetries and pluri-Lagrangian systems. In: Dynamical Systems, Number Theory and Applications. A Festschrift in Honor of Armin Leutbecher’s 80th Birthday, Hagen, Th., Rupp, F., Scheurle, J. (eds.) World Scientific, pp. 255–266 (2016)
Google Scholar

Download references

Acknowledgments

This research is supported by the Berlin Mathematical School and the DFG Collaborative Research Center TRR 109 “Discretization in Geometry and Dynamics”.

Author information

Authors and Affiliations

Inst. für Mathematik, Technische Universität Berlin, Straße des 17. Juni 136, 10623, Berlin, Germany
Yuri B. Suris & Mats Vermeeren

Authors

Yuri B. Suris
View author publications
You can also search for this author in PubMed Google Scholar
Mats Vermeeren
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuri B. Suris .

Editor information

Editors and Affiliations

Technical University of Berlin, Berlin, Germany
Alexander I. Bobenko

Appendices

A. A very short introduction to the variational bicomplex

Here we introduce the variational bicomplex and derive the basic results that we use in the text. We follow Dickey, who provides a more complete discussion in [12, Chap. 19]. Another good source on a (subtly different) variational bicomplex is Anderson’s unfinished manuscript [2]. For ease of notation we restrict to real fields $u: \mathbb {R}^N \rightarrow \mathbb {R}$, rather than vector-valued fields.

The space of (p, q)-forms $\mathscr {A}^{(p,q)}$ consists of all formal sums

$$\begin{aligned} \omega ^{p,q} = \sum f \,\delta u_{I_1} \wedge \ldots \wedge \delta u_{I_p} \wedge \mathrm{{d}}t_{j_1} \wedge \ldots \wedge \mathrm{{d}}t_{j_q}, \end{aligned}$$

where f is a polynomial in u and partial derivatives of u of arbitrary order with respect to any coordinates. The vertical one-forms $\delta u_I$ are dual to the vector fields $\frac{\partial {}}{\partial {u_I}}$. The action of the derivative ${{\mathrm{D}}}_i$ on $\omega ^{p,q}$ is

$$\begin{aligned} {{\mathrm{D}}}_i \omega ^{p,q}&= \sum ({{\mathrm{D}}}_i f) \,\delta u_{I_1} \wedge \ldots \wedge \delta u_{I_p} \wedge \mathrm{{d}}t_{j_1} \wedge \ldots \wedge \mathrm{{d}}t_{j_q} \\&\qquad \quad +\, f \,\delta u_{I_1i} \wedge \ldots \wedge \delta u_{I_p} \wedge \mathrm{{d}}t_{j_1} \wedge \ldots \wedge \mathrm{{d}}t_{j_q} \\&\qquad \quad + \,\ldots + f \,\delta u_{I_1} \wedge \ldots \wedge \delta u_{I_pi} \wedge \mathrm{{d}}t_{j_1} \wedge \ldots \wedge \mathrm{{d}}t_{j_q}. \end{aligned}$$

The integral of $\omega ^{p,q}$ over an q-dimensional manifold is the (p, 0)-form defined by

$$\begin{aligned} \int \omega ^{p,q} = \sum \left( \int f \, \mathrm{{d}}t_{j_1} \wedge \ldots \wedge \mathrm{{d}}t_{j_q} \right) \delta u_{I_1} \wedge \ldots \wedge \delta u_{I_p}. \end{aligned}$$

We call (0, q)-forms horizontal and (p, 0)-forms vertical. The horizontal exterior derivative $\mathrm{{d}}: \mathscr {A}^{(p,q)} \rightarrow \mathscr {A}^{(p,q+1)}$ and the vertical exterior derivative $\delta : \mathscr {A}^{(p,q)} \rightarrow \mathscr {A}^{(p+1,q)}$ are defined by the anti-derivation property

(a)
$\displaystyle \mathrm{{d}}\left( \omega _1^{p_1,q_1} \wedge \omega _2^{p_2,q_2} \right) = \mathrm{{d}}\omega _1^{p_1,q_1} \wedge \omega _2^{p_2,q_2} + (-1)^{p_1+q_1} \, \omega _1^{p_1,q_1} \wedge \mathrm{{d}}\omega _2^{p_2,q_2}$,

$\displaystyle \delta \left( \omega _1^{p_1,q_1} \wedge \omega _2^{p_2,q_2} \right) = \delta \omega _1^{p_1,q_1} \wedge \omega _2^{p_2,q_2} + (-1)^{p_1+q_1} \, \omega _1^{p_1,q_1} \wedge \delta \omega _2^{p_2,q_2}$,

and by the way they act on (0, 0)-, (1, 0)-, and (0, 1)-forms:

(b)
$\displaystyle \mathrm{{d}}f = \sum _j {{\mathrm{D}}}_j f \,\mathrm{{d}}t_j = \sum _j \bigg (\frac{\partial {f}}{\partial {t_j}} + \sum _I \frac{\partial {f}}{\partial {u_I}}u_{Ij} \bigg ) \mathrm{{d}}t_j$, $\displaystyle \delta f = \sum _I \frac{\partial {f}}{\partial {u_I}} \delta u_{I}$,
(c)
$\displaystyle \mathrm{{d}}(\delta u_I )= - \sum _j \delta u_{Ij} \wedge \mathrm{{d}}t_j,$ $\delta (\delta u_I) = 0$,
(d)
$\mathrm{{d}}(\mathrm{{d}}t_j) = 0$, $\delta (\mathrm{{d}}t_j) = 0$, $\displaystyle \delta (\mathrm{{d}}u_I )= - \mathrm{{d}}(\delta u_I )= \sum _j \delta u_{Ij} \wedge \mathrm{{d}}t_j$.

Properties (a)–(d) determine the action of $\mathrm{{d}}$ and $\delta $ on any form. The corresponding mapping diagram is known as the variational bicomplex.

$$\begin{aligned} \begin{matrix} \vdots &{} &{} \vdots &{} &{} &{} &{} \vdots &{} &{} \vdots \\ \uparrow {\scriptstyle \delta } &{} &{} \uparrow {\scriptstyle \delta } &{} &{} &{} &{} \uparrow {\scriptstyle \delta } &{} &{} \uparrow {\scriptstyle \delta } \\ \mathscr {A}^{(1,0)} &{} \xrightarrow {\mathrm{{d}}} &{} \mathscr {A}^{(1,1)} &{} \xrightarrow {\mathrm{{d}}} &{} \ldots &{} \xrightarrow {\mathrm{{d}}} &{} \mathscr {A}^{(1,n-1)} &{} \xrightarrow {\mathrm{{d}}} &{} \mathscr {A}^{(1,n)} \\ \uparrow {\scriptstyle \delta } &{} &{} \uparrow {\scriptstyle \delta } &{} &{} &{} &{} \uparrow {\scriptstyle \delta } &{} &{} \uparrow {\scriptstyle \delta } \\ \mathscr {A}^{(0,0)} &{} \xrightarrow {\mathrm{{d}}} &{} \mathscr {A}^{(0,1)} &{} \xrightarrow {\mathrm{{d}}} &{} \ldots &{} \xrightarrow {\mathrm{{d}}} &{} \mathscr {A}^{(0,n-1)} &{} \xrightarrow {\mathrm{{d}}} &{} \mathscr {A}^{(0,n)} \\ \end{matrix} \end{aligned}$$

The following claims follow immediately from the definitions.

Proposition A.1

We have $\mathrm{{d}}^2 = \delta ^2 = 0$ and $\mathrm{{d}}\delta + \delta \mathrm{{d}}= 0$.

Remark A.2

This implies that $\mathrm{{d}}+ \delta : \mathscr {A}^k \rightarrow \mathscr {A}^{k+1}$, where $\mathscr {A}^k := \bigcup _{i=0}^k \mathscr {A}^{(i,k-i)}$, is an exterior derivative as well.

Proposition A.3

We have ${{\mathrm{D}}}_i \delta = \delta {{\mathrm{D}}}_i$.

Proposition A.4

For a differential polynomial h, define the corresponding vertical generalized vector field by $\partial _h := \sum _I h_I \frac{\partial {}}{\partial {u_I}}$. We have $\mathrm{{d}}{{\mathrm{\iota }}}_{\partial _h} + {{\mathrm{\iota }}}_{\partial _h} \mathrm{{d}}= 0$.

Proof

It suffices to show this for (0,0)-forms (polynomials f in u and partial derivatives of u), for (0,1)-forms $\mathrm{{d}}t_j$, and for (1,0)-forms $\delta u_I$. For (0,0)-forms, both terms of the claimed identity are zero:

$$ \mathrm{{d}}\Big ( {{\mathrm{\iota }}}_{\partial _h} f \Big ) = 0, \quad {{\mathrm{\iota }}}_{\partial _h} ( \mathrm{{d}}f )= {{\mathrm{\iota }}}_{\partial _h} \bigg ( \sum _j {{\mathrm{D}}}_j f \,\mathrm{{d}}t_j\bigg ) = 0. $$

Likewise for (0,1)-forms:

$$ \mathrm{{d}}\Big ( {{\mathrm{\iota }}}_{\partial _h} \mathrm{{d}}t_j \Big ) = 0, \quad {{\mathrm{\iota }}}_{\partial _h} ( \mathrm{{d}}\mathrm{{d}}t_j ) = 0. $$

For (1,0)-forms we find:

$$ {{\mathrm{\iota }}}_{\partial _h} ( \mathrm{{d}}\delta u_I ) = {{\mathrm{\iota }}}_{\partial _h} \bigg ( - \sum _j \delta u_{Ij} \wedge \mathrm{{d}}t_j \bigg ) = - \sum _j h_{Ij} \, \mathrm{{d}}t_j = - \mathrm{{d}}h_{I} = - \mathrm{{d}}\Big ( {{\mathrm{\iota }}}_{\partial _h} \delta u_I \Big ). $$

$\square $

B. Proof of Lemma 2.4

Assume that the action is stationary on all d-dimensional stepped surfaces in $\mathbb {R}^N$. Let S be a smooth d-dimensional surface in $\mathbb {R}^N$. Partition the space $\mathbb {R}^N$ into hypercubes $C_i$ of edge length $\varepsilon $. We can choose this partitioning in such a way that the surface S does not contain the center of any of the hypercubes. Denote $S_i^N := S \cap C_i$.

We give each hypercube its own coordinate system $[-1,1]^N \rightarrow C_i$ and identify the hypercube with its coordinates. In each punctured hypercube $[-1,1]^N \setminus \{0\}$ we define a family of balloon maps

$$ \mathscr {B}_\alpha ^{N}: [-1,1]^N \setminus \{0\} \rightarrow [-1,1]^N \setminus \{0\}: x \mapsto {\left\{ \begin{array}{ll} \dfrac{\alpha x}{\Vert x \Vert _\mathrm {max}} &{} \text {if } \Vert x \Vert _\mathrm {max} < \alpha \\ x &{} \text {if } \Vert x \Vert _\mathrm {max} \geqslant \alpha \end{array}\right. }$$

for $\alpha \in [0,1]$. Here, $ \Vert x \Vert _\mathrm {max} := \max (|x_1|, \ldots |x_N|)$ denotes the maximum norm with respect to the local coordinates. The idea is that from the center of each hypercube, we inflate a square balloon which pushes the curve away from the center, until it lies on the boundary of the hypercube.

Indeed, the deformed surface $S_i^{N-1} := \mathscr {B}_1^{N} (S_i^N) = \mathscr {B}_1^{N} (S \cap C_i)$ lies on the boundary of the hypercube, i.e. within the $(N-1)$-faces of the hypercube. We want it to lie within the d-faces of the hypercube, which would imply that it is a stepped surface. To achieve this, we introduce a balloon map

$$ \mathscr {B}_\alpha ^{N-1,j}: [-1,1]^{N-1} \setminus \{0\} \rightarrow [-1,1]^{N-1} \setminus \{0\}: x \mapsto {\left\{ \begin{array}{ll} \dfrac{\alpha x}{\Vert x \Vert _\mathrm {max}} &{} \text {if } \Vert x \Vert _\mathrm {max} < \alpha \\ x &{} \text {if } \Vert x \Vert _\mathrm {max} \geqslant \alpha \end{array}\right. }$$

in each of the $(N-1)$-faces $C_i^j$ of the hypercube $C_i$, which pushes the surface into the $(N-2)$-faces. We denote the surface we obtain this way by $S_i^{N-2}$. If the surface happens to contain the center of a $(N-1)$-face, we can slightly perturb the surface without affecting the argument. By iterating this procedure, using balloon maps $\mathscr {B}_\alpha ^{k,j}$ in each k-face $C_i^j$ ($N \geqslant k \geqslant d+1$), we obtain a surface $S_i^d$ that lies in the d-faces (Figs. 5 and 6).

Consider the $(d+1)$-dimensional surface

$$ M_i := \bigcup _{k=d+1}^N \bigcup _{\begin{array}{c} j:\ C_i^j \text { is a} \\ k\text { -face of } C_i \end{array}} \bigcup _{\alpha \in [0,1]} \mathscr {B}_\alpha ^{k,j} (S_i^k \cap C_i^j)$$

that is swept out by the consecutive application of the balloon maps to $S_i^N := S \cap C_i$. Assuming that $\varepsilon $ is small compared to the curvature of S, the $(d+1)$-dimensional volume of each of the $\bigcup _{\alpha \in [0,1]} \mathscr {B}_\alpha ^{k,j} (S_i^k \cap C_i^j)$ is of the order $\varepsilon ^{d+1}$. The number of such volumes making up $M_i$ only depends on the dimensions N and d, not on $\varepsilon $, so the $(d+1)$-dimensional volume $|M_i|$ of $M_i$ is of the order $|M_i| = \mathscr {O}(\varepsilon ^{d+1})$.

Now consider a variation $\mathscr {V}$ with compact support and restrict the surface S to this support. Denote by $\widehat{S} := \bigcup _i S_i^d$ the stepped surface obtained from S by repeated application of balloon maps in all the hypercubes, and by $M := \bigcup _i M_i$ the $(d+1)$-dimensional surface swept out by these balloon maps. The bounary of M consists of S, $\widehat{S}$, and a small strip of area $\mathscr {O}(\varepsilon )$ connecting the boundaries of S and $\widehat{S}$ (the dotted line in Fig. 5). The number of hypercubes intersecting S is of order $\varepsilon ^{-d}$, so $| M | = \mathscr {O}(\varepsilon ^{-d}) \mathscr {O}(\varepsilon ^{d+1}) = \mathscr {O}(\varepsilon )$. It follows that

$$\begin{aligned} \left| \int _{\widehat{S}} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}- \int _{S} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}\right|&= \left| \int _{\partial M} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}\right| +\mathscr {O}(\varepsilon ) \\&= \left| \int _{M} \mathrm{{d}}({{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}) \right| +\mathscr {O}(\varepsilon ) \rightarrow 0 \end{aligned}$$

as $\varepsilon \rightarrow 0$. By assumption, $\int _{\widehat{S}} {{\mathrm{\iota }}}_{{{\mathrm{pr}}}\mathscr {V}} \delta \mathscr {L}= 0$ for all $\varepsilon $, so the action on S will be stationary as well. $\square $

Rights and permissions

Open Access This chapter is distributed under the terms of the Creative Commons Attribution-Noncommercial 2.5 License (http://creativecommons.org/licenses/by-nc/2.5/) which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

The images or other third party material in this chapter are included in the work’s Creative Commons license, unless indicated otherwise in the credit line; if such material is not included in the work’s Creative Commons license and the respective action is not permitted by statutory regulation, users will need to obtain permission from the license holder to duplicate, adapt or reproduce the material.

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Suris, Y.B., Vermeeren, M. (2016). On the Lagrangian Structure of Integrable Hierarchies. In: Bobenko, A. (eds) Advances in Discrete Differential Geometry. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-50447-5_11

Download citation

DOI: https://doi.org/10.1007/978-3-662-50447-5_11
Published: 13 August 2016
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-50446-8
Online ISBN: 978-3-662-50447-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

On the Lagrangian Structure of Integrable Hierarchies

Abstract

Similar content being viewed by others

An Ablowitz-Ladik Integrable Lattice Hierarchy with Multiple Potentials

Second-order integrable Lagrangians and WDVV equations

On a Lagrangian Reduction and a Deformation of Completely Integrable Systems

Keywords

1 Introduction

2 Pluri-Lagrangian Systems

2.1 Definition

Definition 2.1

Proposition 2.2

Proof

2.2 Approximation by Stepped Surfaces

Definition 2.3

Lemma 2.4

2.3 Multi-time Euler-Lagrange Equations for Curves

Theorem 2.5

Remark 2.6

Proof

2.4 Multi-time Euler-Lagrange Equations for Two-Dimensional Surfaces

Theorem 2.7

Remark 2.8

Lemma 2.9

Proof

Proof

3 Pluri-Lagrangian Structure of the Sine-Gordon Equation

Theorem 3.1

Proof

Remark 3.2

4 The KdV Hierarchy

Definition 4.1

Remark 4.2

Proposition 4.3

Corollary 4.4

Lemma 4.5

Proof

Proposition 4.6

Proof

5 Pluri-Lagrangian Structure of PKdV Hierarchy

Lemma 5.1

Proof

Theorem 5.2

5.1 Variational Symmetries and the Pluri-Lagrangian Form

Remark 5.3

Proposition 5.4

Proof

Remark 5.5

5.2 The Multi-time Euler-Lagrange Equations

Proof

Lemma 5.6

Proof

6 Relation to Hamiltonian Formalism

Theorem 6.1

Proof

7 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendices

A. A very short introduction to the variational bicomplex

Proposition A.1

Remark A.2

Proposition A.3

Proposition A.4

Proof

B. Proof of Lemma 2.4

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation