Abstract
We introduce a variational setting for the action functional of an autonomous and indefinite Lagrangian on a finite dimensional manifold M. Our basic assumption is the existence of an infinitesimal symmetry whose Noether charge is the sum of a one-form and a function on M. Our setting includes different types of Lorentz–Finsler Lagrangians admitting a timelike Killing vector field.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The principle of least or stationary-action in Lagrangian mechanics has been at the heart of the development of the variational calculus. It has given rise to different methods for solving the problem of finding (or at least establishing the existence of) a path of evolution between two points of a dynamical system described by a finite number of variables (see, e.g., [12, 42, 49]). The techniques developed to get solutions have been proved to be useful in the study of general Lagrangian systems with an infinite number of degrees of freedom (see, e.g., [28, 47]). A very classical field of application of these methods is the geodesic problem in Riemannian and Finsler geometry. In this case, completeness of the metric is enough to get a solution with fixed end points and topological arguments give multiplicity of geodesics. The landscape is quite different for the analogous problem on a Lorentzian manifold where (geodesic) completeness is not enough to get compactness properties on the space of paths between two points and other geometric conditions as global hyperbolicity have been considered as a replacement [4, 46]. Only recently the features underlying global hyperbolicity, in connection with the geodesic problem and more generally with causality, start to find a field of applications beyond classical Lorentzian geometry (see [11, 27, 35, 41]).
On the other hand, the existence of a symmetry that leaves invariant the action functional of a Lagrangian is a source of information about its stationary points through the Noether’s theorem. The impact of this result in variational calculus can be hardly overestimated. A nice application of it to the geodesic problem of a Lorentzian manifold can be found in [31], where a stationary spacetime \((M,g_L)\) (i.e. a spacetime endowed with a timelike Killing vector field) is considered. In this case, the Noether charge associated to the Killing field is used to get a reduction of the Sobolev manifold of paths between two points p and q in M, where the energy functional of the Lorentz metric is defined, to the infinite dimensional submanifold \({\mathcal {N}}_{p,q}\) of the curves with a.e. constant Noether charge. This reduction resembles the classical Routh reduction for Lagrangian systems (see, e.g. [23, 39]) but it involves merely the paths space and not the phase space. The roots of the idea of this infinite dimensional reduction are in a couple of papers about geodesic connectedness of static and stationary spacetimes admitting a global splitting [9, 30] and, indeed, some local computation in [31] and in the present paper (see Theorem 7.6) are based on those papers.
Our goal is to show that the full variational setting in [31] admits a generalization for an indefinite \(C^1\) Lagrangian L on a smooth finite dimensional manifold M. We assume that L is invariant by a one-parameter group of local diffeomorphisms whose infinitesimal generator is a vector field K and that the associated Noether charge is a \(C^1\) function N on TM, which is affine in each tangent space \(T_xM\):
where Q and d are a one-form and a function on M, respectively. We assume also that d is invariant by the flow of K and \(Q(K)<0\) (see Assumption 2.2). Notice that in the case of a stationary Lorentzian manifold, \(d=0\) and Q coincides with the one-form metrically equivalent to the timelike Killing field K.
In Theorems 5.7 and 6.3 we obtain existence and multiplicity of weak solutions to the Euler-Lagrange equation of the action functional of L connecting two given points on M. The regularity of solutions is analysed in Appendix A. A key assumption in Theorem 5.7 is c-boundedness (Definition 5.1) of \({\mathcal {N}}_{p,q}\). Under conditions contained in Assumptions 2.2–2.9, c-boundedness implies that the reduced action functional \(\mathcal {J}\) (differently from the action) is bounded from below (Proposition 5.2) and satisfies the Palais-Smale condition (Theorem 5.6). We show in Sect. 7 that c-boundedness is essentially equivalent to c-precompactness of \({\mathcal {N}}_{p,q}\), a condition introduced in [31] which is a compactness property of the set of paths in a sublevel of the reduced action functional. Actually, on a stationary Lorentzian manifold M, if \({\mathcal {N}}_{p,q}\) is c-precompact for all \(c\in {\mathbb {R}}\) then M is globally hyperbolic (see [31, Proposition B1] in the case when the timelike Killing vector field is complete and [15, Section 6.4-(a)] for any timelike Killing vector field). On the converse, if M is globally hyperbolic with a complete smooth Cauchy hypersurface then \({\mathcal {N}}_{p,q}\) is c-precompact for all \(c\in {\mathbb {R}}\) (see [15, Theorem 5.1]). Thus, if c-precompactness is satisfied for all \(c\in {\mathbb {R}}\), the spacetime M cannot be compact. Inspired by Proposition A.3 in [31], we give a condition that implies c-precompactness of \({\mathcal {N}}_{p,q}\), for all \(c\in {\mathbb {R}}\) and all \(p,q\in M\), in our setting, and that cannot be satisfied if M is compact (see Proposition 8.1).
The Lagrangians that we consider (see Sect. 3) include, but are not limited to, \(C^1\) stationary Lorentzian metrics, electromagnetic type Lagrangians on a stationary Lorentzian manifold with a Killing vector field K and K-invariant potentials (see, e.g. [6, 16, 19, 50]) and some stationary Lorentz–Finsler metrics. Loosing speaking, a Lorentz–Finsler metric is an indefinite, positively homogeneous of degree two in the velocities, Lagrangian that generalizes the quadratic form of a Lorentzian metric in the same way as the square of a Finsler metric generalizes the square of the norm of a Riemannian metric. They were studied by K. Beem [7] following some work by H. Busemann. Although considered from time to time in works about anisotropy in special and general relativity (even if they often appear as the square of a more fundamental function, positively homogeneous of degree one in the velocities, see e.g. [13, 34, 45]), there has been a growing interest about them (or their possible generalizations as non-degenerate Lagrangians defined on a cone bundle on M) in the last decade, see for example [1, 10, 17, 29, 32, 33, 36, 38, 40, 44].
Some explicit examples that are covered by our present setting are Beem’s Lorentz–Finsler metrics endowed with a timelike Killing vector field K including also their sum with a potential function and a one-form, both invariant by the flow of K (see Example 3.6). In particular, this class includes Lagrangians defined as
introduced in [35], where F and \(\omega \) are, respectively, a Finsler metric and a one-form on M both invariant by the one-parameter group of local diffeomorphisms generated by K, provided a sign assumption on \(F^2(K)-\omega ^2(K)\) is satisfied, see Example 3.9. Other examples are given by Lagrangians L that locally, i.e. on a neighborhood of the type \(S\times (a,b)\subset M\), can be expressed as
where \(L_0\) is a \(C^1\) Tonelli Lagrangian on S, with quadratic growth in the velocities, \(\omega \), d and \(\beta \) are respectively a \(C^1\) one-form on S and two \(C^1\) functions on S with \(\beta >0\) (see Example 3.1 and Proposition 7.4). We include the possibility that the Lagrangian \(L_0\) might not be twice differentiable on the zero section of TS, but we require that it is pointwise strongly convex (see Assumption 2.7-(ii)). Notice that the possible lack of twice differentiability of \(L_0\) at the zero section implies that L is not twice differentiable along the line bundle defined by \(K=\partial _t\), being t the natural coordinate on the interval (a, b). Lagrangians of the type (1.2) on a global splitting \(S\times {\mathbb {R}}\), with \(L_0\) being the square of a Finsler metric and \(d=0\), were introduced in [36] when \(\omega = 0\) (see also [20]) and in [21] for \(\omega \ne 0\).
Let us point out a comment about the regularity of the objects we consider in this work. We consider a smooth, finite dimensional manifold M; the Lagrangian L and the vector field K are of class \(C^1\) on TM. Lorentz–Finsler Lagrangians are not twice differentiable at the zero section of TM, hence assuming that L is \(C^1\) is motivated by that wide class of indefinite Lagrangians. We are confident that both the regularity of L and the linearity of the Noether charge can be further relaxed at least for the existence of a global minimizer of the reduced action functional. This is clearly suggested by the fact that L is the sum of a Lagrangian which is strongly convex in the velocities and a \(C^1\) Lagrangian related to the Noether charge (see (2.7)), and that some computations of this work are more related to the sublinearity of the Noether charge than to its expression (1.1).
2 Notations, assumptions and preliminary results
Let M be a smooth, connected, \((m+1)\)-dimensional manifold, with \(m \ge 1\); let us denote by TM the tangent bundle of M. Throughout the paper, we consider a (auxiliary) complete Riemannian metric g on M and we denote by \(\Vert \cdot \Vert :TM \rightarrow \mathbb {R}\) its induced norm, i.e. \(\Vert v\Vert ^2 = g(v,v)\) for all \(v \in TM\).
We will often denote an element of TM as a couple (x, v), \(x\in M\), \(v\in T_xM\) (for example we use such a notation in connection with the variables of an autonomous Lagrangian \(L:TM\rightarrow {\mathbb {R}}\), i.e. we will write \(L=L(x,v)\)). On the other hand, we will avoid specifying the point x where a one-form \(\omega \) or a vector field K on M is applied, and we will write, for example \(\omega (v)\), \(v\in TM\) or also \(\omega (K)\). Some exceptions are possible for the sake of clarity, and we will write then, e.g., \(K_x\) or \(\omega _x(v)\), \(v\in T_xM\) and also \(\omega _x(K)\). We will often explicitly write the variable of a function on M, like in d(x), C(x), \(\lambda (x)\), etc. When a vector field K on M is evaluated along a curve \(z:[0,1]\rightarrow M\), we will write K(z). In some cases we will look at a one-form Q on M also as a function on TM writing then Q(x, v).
Let \(L:TM\rightarrow {\mathbb {R}}\) be a Lagrangian on M. For any \((x,v)\in TM\), we denote by \(\partial _v L(x,v)[\cdot ]\) the vertical derivative of L, i.e. for all \(x\in M\) and all \(v,w\in T_xM\)
We need also a notion of horizontal derivative of the Lagrangian L (a derivative w.r.t. x). Let \((x^0,\dots ,x^m)\) be coordinates on M and let \((x^0,\dots ,x^m, v^{0},\dots ,v^m)\) be the induced ones on TM. Let \((x,v)\in TM\), with coordinates values \((x^0,\dots ,x^m, v^{0},\dots ,v^m)\); we define \(\partial _x L(x,v)[\cdot ]\) as the v-depending one-form on M locally given by
Remark 2.3
Even though, differently from the vertical derivative, this definition is not intrinsic, it fits our purposes (in the following, we will make extensively use of local arguments in computations involving L). In particular, we denote by \(\Vert \partial _x L_c(x,v)\Vert \) and \(\Vert \partial _v L_c(x,v)\Vert \) the two scalar fields on TM which are pointwise the norm of the above two linear operators w.r.t. g.
Assumption 2.2
The Lagrangian \(L:TM \rightarrow \mathbb {R}\) satisfies the following conditions:
-
(i)
\(L\in C^1(TM)\);
-
(ii)
there exists a \(C^1\) vector field K on M such that L is invariant by the one-parameter group of local \(C^1\) diffeomorphisms generated by K (we call K an infinitesimal symmetry of L); moreover the Noether charge, i.e. the map \((x,v)\in TM\mapsto \partial _vL(x,v)[K]\in \mathbb {R}\), is a function N on TM which is the sum of a \(C^1\) one-form Q on M and a \(C^1\) function \(d:M\rightarrow {\mathbb {R}}\), i.e.
$$\begin{aligned} N(x,v):=\partial _vL(x,v)[K]=Q(v)+d(x); \end{aligned}$$(2.1) -
(iii)
the function d in (2.1) is invariant by the flow of K (in particular the case when d is a constant function is compatible); moreover,
$$\begin{aligned} Q(K)<0. \end{aligned}$$(2.2)
Remark 2.4
Vector fields K which are infinitesimal symmetries for L can be characterized similarly to Killing vector fields for Finsler metrics (see, e.g., [21]). We denote by \(K^c\) the complete lift of K to TM, which, using Einstein summation convention, is locally defined as:
It follows that, if \(\psi \) is a local flow of K, then for any \((x,v)\in TM\) the local flow \(\psi ^c\) of \(K^c\) on TM is given by \(\psi ^c(t,x,v)=\big (\psi (t,x),\partial _x\psi (t,x)[v]\big )\). Hence,
and, since
we get
Remark 2.6
Since K is an infinitesimal symmetry of L, by Noether’s theorem, the Noether charge is constant for any weak solution z of the Euler-Lagrange equation
of the Lagrangian L, independently from the boundary conditions. This can be seen by recalling that a weak solution \(z=z(s)\) of the Euler-Lagrange equation is a \(C^1\) curve (for fixed end points boundary conditions, see Appendix A) that locally (i.e. in natural local coordinates of TM) satisfies the system of equations
hence from (2.5) we get
Let us introduce a Lagrangian \(L_c\) on TM defined as
Proposition 3.2
The following statements hold:
-
(i)
\(L_c\in C^1(TM)\);
and, for all \((x,v)\in TM\):
-
(ii)
$$\begin{aligned} Q_x(K) = 2\left( L(x,K) - L(x,0)-d(x) \right) ; \end{aligned}$$(2.8)
-
(iii)
$$\begin{aligned}&L_c(x,0)=L(x,0),\nonumber \\&L(x,K) + L_c(x,K) = 2 \big (L(x,0)+d(x)\big ); \end{aligned}$$(2.9)
and
$$\begin{aligned} \partial _v L_c(x,v)[K]=-Q(v)+d(x), \end{aligned}$$(2.10) -
(iv)
the flow of K preserves also \(L_c\), i.e. \(K^c(L_c)=0\).
Proof
Statement (i) comes immediately from (2.7) and Assumption 2.2-(i). Let us prove (ii). Let \(x \in M\) be a given point and let \(l:\mathbb {R} \rightarrow \mathbb {R}\) be defined as
Hence, \(l(0) = 0\) and, by Assumption 2.2-(ii), we obtain
As a consequence, the function l is equal to
Therefore, noticing that \(Q_x(K) = 2\big ( l(1)-d(x))\), we obtain (2.8). Now (iii) is a simple consequence of (2.7) and (2.8).
Let us prove (iv). From (2.7) it is enough to prove that Q and Q(K) are invariant by the flow of \(K^c\) and K, respectively. Let us consider Q as a function on TM, i.e. \(Q(x,v):=Q(v)\), thus we have to show that \(K^c(Q)=0\). By (2.2), \(K_x\ne 0\) for all \(x\in M\), thus for each \({\bar{x}}\in M\) we can take a neighborhood U of \({\bar{x}}\) and a coordinate system \((x^0, x^1, \ldots , x^m)\) defined in U such that \(\frac{\partial }{\partial x^0}=K|_U\). Therefore, in such a coordinate system,
Since Q and d are \(C^1\), we know that \(\frac{\partial L}{\partial v^0}\) admits continuous partial derivatives w.r.t. the coordinates \((x^0,x^1,\ldots , x^m, v^0,v^1,\ldots ,v^m)\) in TU. Notice also that, from (2.5), \(K^c(L)=0\) is equivalent to \(\frac{\partial L}{\partial x^0}(x,v)=0\). Being then a constant function, \(\frac{\partial L}{\partial x^0}\) admits zero partial derivatives w.r.t. the coordinates \((x^0,x^1,\ldots , x^m, v^0,v^1,\ldots ,v^m)\) as well. As d is invariant by the flow of K, we have \(\frac{\partial d}{\partial x^0}=0\) on U. Thus, from (2.3), we then get
Since \(Q(x,K)=\frac{\partial L}{\partial v^0}\big (x,(1,0,\ldots ,0)\big )\), we also have
\(\square \)
Remark 2.8
From (2.10) and (iv) in Proposition 2.5, we have that, like L, \(L_c\) has affine Noether charge as well.
Recalling Remark 2.1, the following assumption ensures some growth conditions on \(L_c\), often used in critical point theory for the action functional of a Lagrangian (see, e.g., [2, 8]), and its pointwise strong convexity.
Assumption 2.7
The Lagrangian \(L_c:TM \rightarrow \mathbb {R}\), defined as in (2.7), satisfies the following assumptions:
-
(i)
there exists a continuous function \(C:M\rightarrow (0,+\infty )\) such that for all \((x,v)\in TM\), the following inequalities hold
$$\begin{aligned} L_c(x,v)&\le C(x)\big (\Vert v\Vert ^2+1\big ); \end{aligned}$$(2.11)$$\begin{aligned} \Vert \partial _x L_c(x,v)\Vert&\le C(x)\big (\Vert v\Vert ^2+1\big ); \end{aligned}$$(2.12)$$\begin{aligned} \Vert \partial _v L_c(x,v)\Vert&\le C(x)\big (\Vert v\Vert +1\big ); \end{aligned}$$(2.13) -
(ii)
there exists a continuous function \(\lambda :M \rightarrow (0,+\infty )\) such that for each \(x\in M\) and for all \(v_1, v_2\in T_xM\), the following inequality holds:
$$\begin{aligned} \big (\partial _vL_c(x,v_2)-\partial _vL_c(x,v_1)\big )[v_2-v_1] \ge \lambda (x)\Vert v_2 - v_1\Vert ^2; \end{aligned}$$(2.14)
Remark 3.3
We notice that from (2.14) and (2.10) we obtain
Moreover, for all \((x,v)\in TM\) we have
Thus, \(L_c\) satisfies the growth condition
and since \(L_c(x,0)=L(x,0)\) and \(\partial _vL_c(x,0)=\partial _vL(x,0)\) we get
The next and final assumption is needed to get a compactness condition on the sublevels of the reduced action functional (see Lemma 5.3) and then in the proof of the Palais-Smale condition for the same functional.
Assumption 2.9
There exist four constants, \(c_1,c_2,c_3, k_1, k_2\) such that, for all \(x \in M\), the following inequalities hold:
3 Some classes of examples
In this section we present various type of Lagrangians that satisfy Assumptions 2.2–2.9. We start with a generalization of the Lorentz–Finsler Lagrangians studied in [21].
Example 3.1
Let S be a smooth m–dimensional manifold and \(M = S \times \mathbb {R}\). Let \(g_S\) be a complete auxiliary Riemannian metric on S, whose associated norm is denoted by \(\Vert \cdot \Vert _S\), and let g be the product metric \(g=g_S\oplus dt^2\). Let \(L:TM \rightarrow \mathbb {R}\) be a Lagrangian on M defined as
where
-
(i)
\(L_0:TS \rightarrow \mathbb {R}\) belongs to \(C^1(TS)\) and there exists a continuous positive function \(\ell : S \rightarrow (0,+\infty )\) such that
$$\begin{aligned} L_0(x,\nu )&\le \ell (x)\big (\Vert \nu \Vert _S^2+1\big ); \end{aligned}$$(3.2)$$\begin{aligned} \Vert \partial _x L_0(x,\nu )\Vert _S&\le \ell (x)\big (\Vert \nu \Vert _S^2+1\big ); \end{aligned}$$(3.3)$$\begin{aligned} \Vert \partial _\nu L_0(x,\nu )\Vert _S&\le \ell (x)\big (\Vert \nu \Vert _S+1\big ); \end{aligned}$$(3.4) -
(ii)
\(L_0\) is pointwise strongly convex, i.e. there exists a continuous function \(\lambda _0:S\rightarrow (0,+\infty )\) such that, for all \(x\in S\) and all \(\nu _1,\nu _2\in T_xS\), (2.14) holds with \(L_c\) replaced by \(L_0\), \(\lambda \) by \(\lambda _0\) and \(\Vert \cdot \Vert \) by \(\Vert \cdot \Vert _S\);
-
(iii)
\(\omega \) is a \(C^1\) one-form on S, \(d:S\rightarrow \mathbb {R}\) is a \(C^1\) function and \(\beta :S \rightarrow (0,+\infty )\) is a \(C^1\) positive function.
In this case, the field \(K=\partial _t\equiv (0,1)\) is an infinitesimal symmetry of L and d is invariant by the flow of K, because it is a function on S. Notice that if \(L_0\) is the square of a Riemannian norm on S and \(d=0\) then L is the quadratic form associated with the Lorentzian metric of a standard stationary spacetime (see, e.g., [30]). Moreover, if \(L_0\) is the square of the norm of a Riemannian metric plus a one-form \(\omega _0\) on S, then they include electromagnetic type Lagrangians on a standard stationary Lorentzian manifold with an exact electromagnetic field on \(S\times {\mathbb {R}}\) having a potential one-form \(\omega _0\oplus d(x)\textrm{d}t\), see Remark 3.3 below.
In the next result we show that L defined as in (3.1) satisfies Assumptions 2.2–2.7 and we give some further conditions ensuring that it also satisfies Assumption 2.9.
Proposition 3.5
A Lagrangian L defined as in (3.1), such that (i)–(iii) above hold, satisfies Assumptions 2.2 and 2.7. Moreover, if there exist some constants \(b, \ell _1, \ell _2, \ell _3, \ell _4\), such that \(\beta (x) \ge b>0\), \(\lambda _0(x)\ge \ell _1>0\), \(L_0(x,0)\ge \ell _2\), \(\Vert \partial _vL_0 (x,0)\Vert \le \ell _3\) and \(|d(x)|\le \ell _4\), for every \(x\in S\), then L satisfies Assumption 2.9.
Proof
As remarked above, the vector field \(\partial _t\equiv (0,1)\) is an infinitesimal symmetry for L; moreover, since by hypotheses \(L_0\), \(\omega \) and \(\beta \) are of class \(C^1\), \(L \in C^1(TM)\) as well. A direct computation shows that
which is an affine function on TM that we denote by N. Let \(Q:=2(\omega -\beta \textrm{d}t)\), hence
and thus the conditions in Assumption 2.2 are satisfied. Using (2.7) with (3.6), we see that \(L_c:TM \rightarrow \mathbb {R}\) is given by
Let us show that \(L_c\) satisfies (2.11), (2.12) and (2.13). By (3.2) we have
so setting
(2.11) holds. Let us compute \(\partial _{(x,t)}L_c\):
Hence,
By (3.3) and recalling that \(\Vert (\nu ,\tau )\Vert ^2 = \Vert \nu \Vert _S^2 + |\tau |^2\), we infer the existence of a function \(C:M \rightarrow (0,+\infty )\) such that (2.12) holds. Similarly, using (3.4) we obtain (2.13).
Let us show that \(L_c\) satisfies (2.14). From (3.7) we have
hence using that \(L_0\) is pointwise strongly convex we get
thus (2.14) holds by taking
It remains to prove Assumption 2.9. Of course, (2.19) is trivially satisfied and if \(\beta \ge b > 0\) then by (3.6) we obtain (2.18). By (3.8), we also have (2.16) with \(c_1=\min \{\ell _1,b\}\). As \(L\big ((x,t), 0\big )=L_0(x,0)\) and \(\partial _{(\nu ,\tau )}L\big ((x,t),0\big )=\partial _\nu L_0(x,0)\), (2.17) is satisfied as well with \(c_2=\ell _2\) and \(c_3=\ell _3\). \(\square \)
Remark 4.1
A special case of a Lagrangian in Example 3.1 that satisfies our assumptions is given by (3.1) with
where \(V:S \rightarrow \mathbb {R}\) is a \(C^1\) function bounded from below, \(\omega _0\) is a \(C^1\) one-form on S, such that \(\sup _{x\in S}\Vert (\omega _0)_x\Vert _S<+\infty \), and \(F:TS \rightarrow [0,+\infty )\) is a \(C^1\) Finsler metric on S, i.e it is a non-negative, \(C^1\) Lagrangian on TS, positively homogeneous of degree 1 w.r.t. \(\nu \), such that \(F^2\) is pointwise strongly convex, i.e. it satisfies (2.14) on TS. We remark that usually in the definition of a Finsler metric it is assumed that \(F^2\in C^2(TS{\setminus } 0)\) (where 0 denotes the zero section of TS) and its vertical Hessian, the so-called fundamental tensor \(g_F\),
for all \((x,v)\in TM{\setminus } 0\) and all \(u,w \in T_xM\), is assumed to be positively homogeneous of degree 0 in v and positive definite for all \((x,v)\in TM\setminus 0\) (see, e.g., [5]). Inequality (2.14) for \(F^2\) on TS follows by the mean value theorem applied to the function \(\nu \in TS\mapsto \partial _vF^2(x,v)[\nu _2-\nu _1]\), when \(\nu _1, \nu _2\) are not collinear vectors with opposite directions or when one of them is 0; for collinear vectors with opposite directions it follows by a continuity argument. Notice that \(\lambda _0(x)\) in (2.14) for \(F^2\) is then equal to
and that (3.2)–(3.4) are ensured by the homogeneity of degree 2 of \(F^2\) w.r.t. \(\nu \).
We notice that Lagrangians L satisfying Assumptions 2.2–2.7 are generated by Lagrangians \(L_b\) satisfying (2.11)–(2.14) and admitting a vector field K as an infinitesimal symmetry with affine Noether charge \(N_b=Q_b+d\) such that \(Q_b(K)>0\). Indeed, arguing as in Proposition 2.5-(iv), \(L:=L_b-\frac{Q_b^2}{Q_b(K)}\) admits K as infinitesimal symmetry and its Noether charge is
hence \(L_c=L_b\) and then, of course, \(L_c\) satisfies Assumption 2.7. This observation gives rise to the following families of examples.
Example 3.4
Let M be a smooth \((m+1)\)-dimensional manifold endowed with a complete Riemannian metric g and F, \(\omega _0\), V be respectively a Finsler metric, a one-form and function on M, all of \(C^1\) class and invariant by the flow of a nowhere vanishing vector field K on M. Let us assume that the Noether charge associated with \(F^2\) and K is a one-form of class \(C^1\). Let \(L:TM\rightarrow {\mathbb {R}}\) be given by
and \(\lambda (x)\) be the positive continuous function in (2.14) for \(F^2\) on TM.
Proposition 3.7
Assume that there exist two constants \(c_1>0\) and \(a\ge 0\) such that \(\lambda (x)>c_1\), and \(\Vert (\omega _0)_x\Vert \le a\), for all \(x\in M\), V is bounded from below, \(\inf _{x\in M}\Vert K_x\Vert >0\) and \(\sup _{x\in M}\Vert K_x\Vert <+\infty \). Then L in (3.9) satisfies Assumptions 2.2–2.9.
Proof
Set \(L_b=F^2+\omega _0+V\), then \(L_b\) admits K as an infinitesimal symmetry with affine Noether charge
Hence, the same holds for L and the one-form appearing in its Noether charge is \(Q=-Q_F\). Since \(\omega _0\) is invariant by the flow of K, the Lie derivative \({\mathcal {L}}_K\omega _0\) vanishes. In particular, \(0={\mathcal {L}}_K\omega _0(K)=K\big (\omega _0(K)\big )\), i.e. \(\omega _0(K)\) is invariant by the flow of K. We also notice that \((Q_F)_x(K)=2F^2(x,K)>0\) for all \(x\in M\) since, by assumption, \(K_x\ne 0\) for all \(x\in M\). Thus L satisfies Assumption 2.2. As \(L_c=L_b\), it satisfies (2.11)–(2.13) because \(F^2\) is positively homogeneous of degree two; moreover, it satisfies also (2.14) as \(F^2\) is pointwise strongly convex. Since \(L(x,0)=V(x)\), and \(\partial _vL(x,0)=\omega _0\), (2.17) holds; being
and \(d = \omega _0(K)\), (2.18) and (2.19) hold as well. \(\square \)
The next class of examples involves Lorentz–Finsler metrics \(L_F\) as defined by J. K. Beem in [7] (see also [29, 40, 44]).
Example 3.6
Let M be a smooth manifold of dimension \(m+1\), and g an auxiliary complete Riemannian metric on M. Let \(L_F:TM\rightarrow {\mathbb {R}}\) be a Lagrangian which satisfies the following conditions:
-
(i)
\(L_F \in C^1(TM) \cap C^2(TM\setminus 0)\), where 0 denotes the zero section of TM;
-
(ii)
\(L_F(x,\lambda v)=\lambda ^2 L_F(x,v)\) for all \(v \in TM\) and all \(\lambda >0\);
-
(iii)
for any \((x,v) \in TM \setminus 0\), the vertical Hessian of \(L_F\), i.e. the symmetric matrix
$$\begin{aligned} (g_F)_{\alpha \beta }(x,v):= \frac{\partial ^2 L_F}{\partial v^\alpha \partial v^\beta }(x,v), \quad \alpha ,\beta =0,1,\ldots ,m, \end{aligned}$$is non-degenerate with index 1.
Let us assume that \(L_F\) admits a nowhere vanishing vector field K as an infinitesimal symmetry and that its Noether charge is equal to \(N_{L_F} = Q_{L_F}\), where \(Q_{L_F}\) is a \(C^1\) one-form such that \(Q_{L_F}(K) < 0\). Let \(L=L_F+\omega _1+V\) where \(\omega _1\) and V are, respectively, a \(C^1\) one-form on M, such that \(\sup _{x\in M}\Vert (\omega _1)_x\Vert <+\infty \), and a \(C^1\) function, bounded from below on M. We assume that both \(\omega _1\) and V are invariant by the flow of K. Then the Noether charge of L is \(N=N_{L_F}+\omega _1(K)=Q_{L_F}+d_{L_F}+\omega _1(K)\). Thus, L satisfies Assumption 2.2. Let \(Q:=Q_{L_F}\); so \(L_c\) is equal to \(L_c= L-\frac{Q^2}{Q(K)}\).
Proposition 3.10
If conditions (i)–(iii) of Example 3.6 hold, then \(L_c\) satisfies Assumption 2.7.
Proof
Let us show that \(L_c\) admits vertical Hessian at any \((x,v)\in TM{\setminus } 0\), which is a positive definite bilinear form on \(T_xM\). We observe that for any \((x,v)\in TM\setminus 0\), we have
As for each \(u\in T_x M\) we have
from (2.2) and (3.10), we get \(\partial _{vv}L_c(x,v)[K,K]=-Q(K)>0\). Let \(w\in \ker Q\). From (3.11), we have \(\partial _{vv}L(x,v)[w,K]=0\), and since \(\partial _{vv}L(x,v)\) has index 1, we also have
hence \(\partial _{vv}L_c(x,v)[\cdot ,\cdot ]\) is positive definite. Reasoning as in the last part of Remark 3.3, we deduce that (2.14) holds. From (3.10), since \(\partial _{vv}L_F(x,v)-\frac{2}{Q(K)}Q\otimes Q\) is continuous on \(TM\setminus 0\) and positively homogeneous of degree 0 in v, we deduce (2.11)
and
where
Up to redefine C(x), (2.12) and (2.13) can be obtained analogously. \(\square \)
In particular, Lagrangians in Example 3.6 include the class of \(C^2\) stationary Lorentzian metrics. We also want to consider the \(C^1\) case.
Example 3.8
Let \((M,g_L)\) be a Lorentzian manifold of dimension \(m+1\) with \(C^1\) metric tensor \(g_L\). Let K be a timelike Killing vector field for \(g_L\), i.e. K is a Killing vector field such that \(g_L(K,K)<0\). Then \((M,g_L)\) is called a stationary Lorentzian manifold. Let \(L(x,v):=g_L(v,v)\); we notice that \(L\in C^1(TM)\) and \(\partial _vL(x,v)[K]=(g_L)_x(\cdot ,K)\), thus \(Q(K)=2g_L(K,K)<0\). The Lagrangian \(L_c\) is equal to
and then it is equal to the square of the norm of a Riemannian metric \(g_R\) (as in [31]). Thus, Assumption 2.7 is satisfied as well (by using the same metric \(g_R\) as auxiliary Riemannian metric g), provided that \(g_R\) is complete with
Finally, Assumption 2.9 is satisfied provided that there exists a constant \(k_1\) such that \(-g_L(K,K)\ge k_1>0\).
The following example of Lagrangians are the Lorentz–Finsler Lagrangians studied in [35] and they can be included in the class of Example 3.6 (see Proposition 3.10).
Example 3.9
Let M be a smooth manifold and
where F and \(\omega \) are, respectively, a Finsler metric of class \(C^1(TM) \cap C^2(TM\setminus 0)\) and a one-form of class \(C^1\) on M, both invariant by the flow of a nowhere vanishing vector field K and such that the Noether charge \(N_F\) associated with \(F^2\) and K is a \(C^1\) one-form, \(N_F = Q_F\). Then, \(L_F\) admits K as an infinitesimal symmetry and \(N_{L_F} = Q_F - 2\omega (K)\omega \). Notice that \((Q_{L_F})_x(K)=2\big (F^2(x, K)-\omega ^2(K)\big )\).
Proposition 4.2
Assume that \(Q_{L_F}(K)<0\), then \(\omega _x(K)\ne 0\) for all \(x\in M\) and \(L_F\) in (3.12) is a Lagrangian of the type in Example 3.6.
Proof
The non-trivial part of the statement is to prove (iii) in Example 3.6. For all \((x,v)\in TM\setminus 0\) we have:
thus in particular we get that \(\omega _x(K)\ne 0\), for all \(x\in M\). Moreover for all \(w\in \ker (\omega _x)\), \(w \ne 0\), we have
Thus, being K transversal to \(\ker (\omega )\), we deduce that \(\partial _{vv}L_F(x,v)\) has index 1 for all \((x,v)\in TM{\setminus } 0\). \(\square \)
4 The reduced manifold of paths and action
Let \(L:TM\rightarrow {\mathbb {R}}\) be a Lagrangian satisfying Assumptions 2.2 and 2.7. Recalling that M is endowed with an auxiliary complete Riemannian metric g, let us consider the set
and, for any two fixed points \(p,q\in M\), its subset
It is well known that since (M, g) is complete, \(W^{1,2}([0,1],M)\) is a smooth, infinite dimensional, complete Riemannian manifold and \(\Omega _{p,q}^{1,2}\) is a smooth closed (hence complete) submanifold (see, e.g., [25, Lemma 6.2]). For every \(z \in \Omega _{p,q}^{1,2}\), the tangent space \(T_z\Omega ^{1,2}_{p,q}\) is equal to
Weak solutions of (2.6) connecting the points p, \(q\in M\) are by definition the critical points of the action functional \({\mathcal {A}}:\Omega _{p,q}^{1,2} \rightarrow \mathbb {R}\), defined as
Remark 4.6
From (2.7), we have that \(L=L_c+Q^2/Q(K)\) and hence from Assumption 2.7 we get that \(\mathcal {A}\) is \(C^1\) on \(\Omega _{p,q}^{1,2}\) (see, e.g., the first part of the proof of Proposition 3.1 in [3]), with differential \(\text {d}{\mathcal {A}}(z):T_z\Omega _{p,q}^{1,2} \rightarrow \mathbb {R}\) at a curve \(z\in \Omega _{p,q}^{1,2}\) equal to
Let \(\xi \in T_z\Omega _{p,q}\) such that \(\xi =X\circ z\), with X a smooth vector field in M, then in natural coordinates \((x^0,\dots ,x^m,v^{0},\dots ,v^m)\), of TM, the integrand function in (4.1) is given by
In the following, by an abuse of notation, we also denote by \(\dot{X}\) the derivative of X(z), i.e. \(\dfrac{\partial X^i}{\partial x^h}(z)\dot{z}^h\).
From (2.5) we then get
for all \(z \in \Omega _{p,q}^{1,2}\).
The main goal of this section is to prove that the critical points of \({\mathcal {A}}\) lay on the following subset of \(\Omega _{p,q}^{1,2}\):
For every \(z \in \Omega _{p,q}^{1,2}\), let us define
Proposition 4.3
Proof
For all \(\xi \in {\mathcal {W}}_z\), from (4.2) we have
As a consequence, \(\text {d}{\mathcal {A}}(z)[\xi ] = 0\) for all \(\xi \in \mathcal {W}_z\) if and only if
namely if and only if \( \partial _v L(z,{\dot{z}})[K]=N(z,\dot{z}) \) is constant a.e. on [0, 1]. \(\square \)
Proposition 5.2
The set \(\mathcal {N}_{p,q}\) is a \(C^1\) closed submanifold of \(\Omega _{p,q}^{1,2}\). Moreover, for every \(z \in \mathcal {N}_{p,q}\), the tangent space \(T_z\mathcal {N}_{p,q}\) is given by
Proof
Let \(F:\Omega _{p,q}^{1,2}\rightarrow L^{2}([0,1],\mathbb {R})\) be defined as
and \(\mathcal {C}\subset L^2([0,1],\mathbb {R})\) be defined as
By the definition of \(\mathcal {N}_{p,q}\) given in (4.3), we have
The map F is \(C^1\) and its differential is
By [37, Proposition 3, p. 28], it is enough to show that for all \(z \in \mathcal {N}_{p,q}\) and \(h \in L^2([0,1],\mathbb {R})\) there exist \(\xi \in T_z\Omega _{p,q}^{1,2}\) and \(c \in \mathbb {R}\) such that
Therefore, let us fix \(z \in \mathcal {N}_{p,q}\) and \(h \in L^2([0,1],\mathbb {R})\). Let us consider \(\xi \in \mathcal {W}_z \subset T_z\Omega _{p,q}^{1,2}\), so there exists \(\mu \in W^{1,2}_0([0,1],\mathbb {R})\) such that \(\xi (s) = \mu (s)K(z(s))\). By (4.5), recalling that d is invariant by the flow of K and then \(\textrm{d}d(K)=0\), we obtain
Using (4.7) and recalling that by Assumption 2.2-(iii), \(Q_x(K) \ne 0\) for all \(x\in M\), (4.6) becomes an ODE in normal form with respect to \(\mu \), namely
where
Setting \( A(s)=\int _{0}^{s}a(\tau )\text {d}\tau , \) and
a solution of (4.8) which satisfies the boundary conditions \(\mu (0) = \mu (1) = 0\) is given by
Thus, for every \(z \in \mathcal {N}_{p,q}\) and \(h\in L^{2}([0,1],\mathbb {R})\), there exist \(\xi \in T_z\Omega _{p,q}^{1,2}\) and \(c \in \mathbb {R}\) such that (4.6) holds, hence \(\mathcal {N}_{p,q}\) is a \(C^1\) submanifold of \(\Omega ^{1,2}_{p,q}\).
By the previous part of the proof, for all \(z\in {\mathcal {N}}_{p,q}\), \(T_z\mathcal {N}_{p,q}\) is identified with the set of all \(\zeta \) such that \(\textrm{d}F(z)[\zeta ] \in T_{F(z)}\mathcal {C}\). Then, (4.4) follows from (4.5) and the fact that \(T_{F(z)}\mathcal {C}\) is identified with the set of constant functions on [0, 1].
It remains to show that \(\mathcal {N}_{p,q}\) is closed. Let \((z_n)_n \subset \mathcal {N}_{p,q} \subset \Omega ^{1,2}_{p,q}\) be a sequence converging to \(z\in \Omega ^{1,2}_{p,q}\). Up to considering a subsequence, we have that \(N(z_n,{\dot{z}}_n)\) converges pointwise to \(N(z,{\dot{z}})\), so \(N(z,{\dot{z}})\) is constant a.e. on [0, 1], i.e. \(z \in \mathcal {N}_{p,q}\). \(\square \)
Lemma 4.4
For each \(z\in {\mathcal {N}}_{p,q}\), \(T_z\Omega _{p,q}^{1,2} = \mathcal {W}_z \oplus T_z\mathcal {N}_{p,q}\).
Proof
It is enough to show that for each \(\zeta \in T_z\Omega ^{1,2}_{p,q}\) there exists \(\mu \in W^{1,2}_0([0,1],\mathbb {R})\) such that
By (4.4), this amounts to prove that there exist \(\mu \in W^{1,2}_0([0,1],\mathbb {R})\) and a constant \(c\in {\mathbb {R}}\) such that
which is equivalent to
a.e. on [0, 1]. Arguing as in the proof of Proposition 4.3, we see that (4.9) admits a solution \(\mu \in W^{1,2}_0([0,1],\mathbb {R})\) for a certain constant c, and we are done. \(\square \)
Definition 4.5
The reduced action functional \({\mathcal {J}}\) is the restriction of the functional \({\mathcal {A}}\) to the manifold \(\mathcal {N}_{p,q}\), i.e. \({\mathcal {J}}:\mathcal {N}_{p,q} \rightarrow \mathbb {R}\), \({\mathcal {J}} = {\mathcal {A}} \big |_{\mathcal {N}_{p,q}}\).
Remark 5.5
Being \(\mathcal {A}\in C^1(\Omega ^{1,2}_{p,q})\), we get that \(\mathcal {J}\) is \(C^1\) on \({\mathcal {N}}_{p,q}\) as well.
Theorem 4.7
A curve \(z \in \Omega _{p,q}^{1,2}\) is a critical point for \(\mathcal {A}\) if and only if \(z \in \mathcal {N}_{p,q}\) and z is a critical point for \(\mathcal {J}\).
Proof
Let us assume that z is a critical point for \(\mathcal {A}\). Then \(\text {d}\mathcal {A}(z)[\xi ] = 0\) for all \(\xi \in \mathcal {W}_z \subset T_z\Omega ^{1,2}_{p,q}\) and by Proposition 4.2 we have \(z \in \mathcal {N}_{p,q}\). Since \(T_z\mathcal {N}_{p,q}\subset T_z\Omega ^{1,2}_{p,q}\),
so z is a critical point for \(\mathcal {J}\).
Now, let us assume that \(z\in \mathcal {N}_{p,q}\) and z is a critical point for \(\mathcal {J}\). By Lemma 4.4, for every \(\zeta \in T_z\Omega _{p,q}^{1,2}\) there exist \(\xi \in T_z\mathcal {N}_{p,q}\) and \(\psi \in \mathcal {W}_z\) such that \(\zeta = \psi + \xi \). By Proposition 4.2, we have \(\text {d}\mathcal {A}(z)[\psi ] = 0\), while \(\text {d}\mathcal {A}(z)[\xi ] = \text {d}\mathcal {J}(z)[\xi ] = 0\) because z is a critical point for \(\mathcal {J}\). Therefore, \(\text {d}\mathcal {A}(z)[\zeta ] = \text {d}\mathcal {A}(z)[\psi ] + \text {d}\mathcal {A}(z)[\xi ] = 0\), namely z is a critical point for \(\mathcal {A}\). \(\square \)
5 Lower boundedness and Palais-Smale condition for the reduced action
Let us give a condition on the manifold \({\mathcal {N}}_{p,q}\) implying that \(\mathcal {J}\) is bounded from below and satisfies the Palais-Smale condition. For every \(c\in \mathbb {R}\), we denote by \(\mathcal {J}^c\) the sublevel of \(\mathcal {J}\), namely
Definition 5.1
We say that \({\mathcal {N}}_{p,q}\) is c-bounded if \(\mathcal {J}^c\ne \emptyset \) and
Proposition 6.4
Under Assumptions 2.2–2.9, let \(c\in {\mathbb {R}}\) such that \({\mathcal {N}}_{p,q}\) is c-bounded. Then, \(\mathcal {J}\) is bounded from below.
Proof
By (2.7) and (2.15), we obtain
Since \(\mathcal {N}_{p,q}\) is c-bounded and using (2.19), for every \(z \in \mathcal {J}^c\) we have
thus, using (2.17) and (2.18) we have
and the thesis follows. \(\square \)
We show now that c-boundedness and Assumptions 2.2–2.9 imply a compactness condition for the sublevels of \(\mathcal {J}\).
Lemma 5.3
Let \(c \in \mathbb {R}\) be such that \(\mathcal {N}_{p,q}\) is c-bounded. If Assumptions 2.2–2.9 hold, then every sequence \((z_n)_n \subset \mathcal {J}^c\) admits a uniformly convergent subsequence.
Proof
From (5.1) and Assumption 2.9, if \(\mathcal {N}_{p,q}\) is c-bounded we have
hence the sequence \(\Vert {\dot{z}}_n\Vert \) is bounded in \(L^2([0,1])\). Then, denoting by \(d_g\) the distance induced by the metric g, by the Cauchy-Schwarz inequality we have
for all \(0\le s_1 \le s_2\le 1\). Thus, \((z_n)\) is uniformly bounded and uniformly equicontinuous and, being (M, g) complete, by the Ascoli-Arzelà theorem there exists a uniformly convergent subsequence. \(\square \)
Definition 5.4
A sequence \((z_n)_n\subset \mathcal {J}^c\) is said a Palais-Smale sequence for \(\mathcal {J}\) if \(\text {d}\mathcal {J}(z_n)\rightarrow 0\) strongly. We say that \(\mathcal {J}\) satisfies the Palais-Smale condition on \(\mathcal {J}^c\) if every Palais-Smale sequence \((z_n)_n\subset \mathcal {J}^c\) admits a strongly converging subsequence.
Remark 5.8
We point out that if Assumptions 2.2–2.9 hold and \(\mathcal {N}_{p,q}\) is c-bounded, then \(\mathcal {J}\) is bounded on any sequence \((z_n)\subset \mathcal {J}^c\) by Proposition 5.2, as it is required in the usual definition of the Palais-Smale condition.
Theorem 5.6
Under Assumptions 2.2–2.9, assume also that \(\mathcal {N}_{p,q}\) is c-bounded. Then \(\mathcal {J}\) satisfies the Palais-Smale condition on \(\mathcal {J}^c\).
Proof
Let \((z_n)_{n}\subset \mathcal {J}^c\) be a Palais-Smale sequence for \(\mathcal {J}\). By Lemma 5.3, there exists a subsequence, still denoted by \((z_n)_n\), which uniformly converges to a continuous curve \(z:[0,1]\rightarrow M\) such that \(z(0)=p\) and \(z(1)=q\).
Let us now notice that by Lemma 4.4, and taking into account that the supports of the curves \(z_n\) are in a compact subset of M, if \(\zeta _n\in T_{z_n}\Omega ^{1,2}_{p,q}\) is bounded in \(H^1\) norm then there exist two bounded sequences \(\xi _n \in T_{z_n}\mathcal {N}_{p,q}\) and \(\mu _n \in H^1_0([0,1],\mathbb {R})\) such that \(\zeta _n = \xi _n + \mu _n K_{z_n}\). By Proposition 4.2 and since \(z_n\) is a Palais-Smale sequence, we obtain
We now apply a localization argument as in [2, Appendix A]; thus, we can assume that the Lagrangian L is defined on \([0,1]\times U \times {\mathbb {R}}^{m+1}\), with U an open neighborhood of 0 in \({\mathbb {R}}^{m+1}\). Moreover, we can identify \((z_n)_n\) with a sequence in the Sobolev space \(H^1([0,1],U)\). By Lemma 5.3, taking into account that the curves \(z_n\) have fixed end-points, we get that \((z_n)_n\) is bounded in \(H^1([0,1],U)\) and so it admits a subsequence, still denoted by \((z_n)\), which weakly and uniformly converges to a curve \(z\in H^1([0,1],{\mathbb {R}}^{m+1})\) which also satisfies the same fixed end-points boundary conditions. Thus, being \(z_n-z\) bounded in \(H^1\), we have \(\text {d}\mathcal {A}(z_n)[z_n-z]\rightarrow 0\), i.e.
where \(\Lambda (x):=-Q_x(K)\). From (2.12),
thus, recalling that C is continuous and \(z_n-z\) uniformly converges to 0, the first integral in the above expression converges to 0. Since the sequence \(Q(\dot{z}_n )\) is uniformly bounded on [0, 1] (recall (5.2)) and, from (2.18), \(0<1/\Lambda (z_n)<1/k_1\), the third term above converges to 0 because \(\dot{z}_n\) is bounded in \(L^1\) and \(z_n-z\rightarrow 0\) uniformly. Analogously the fourth term goes to 0 since \(z_n\) converges uniformly to z and \(\dot{z}_n-\dot{z}\rightarrow 0\) weakly in \(H^1\). For estimating the fifth term, taking into account that \(Q^2(\dot{z}_n)\) is uniformly bounded on [0, 1], we observe that \(\text {d}\Lambda (z_n)\rightarrow \text {d}\Lambda (z)\) in operator norm and then
and both the above integrals goes to 0, because \(\dot{z}_n-\dot{z}\), in the first one, is bounded in \(L^1\) and, in the second one, weakly converges to 0 in \(H^1\). Thus, we have obtained that
Using that \(z_n\) pointwise converges to z and \(\dot{z}_n\) is bounded in \(L^1\), from (2.13) and Lebesgue’s dominated convergence theorem we get
As \(\dot{z}_n-\dot{z}\rightarrow 0\) weakly in \(H^1\), also \(\int _0^1\partial _v L_c(z,\dot{z})[\dot{z}_n-\dot{z}]\textrm{d}s\rightarrow 0\), and then from the above limit
From (2.16), (5.3) and (5.4) we then get
which implies that \(z_n\rightarrow z\) strongly in \(H^1\). Moreover, there exists a subsequence such that \(\dot{z}_n(s)\rightarrow \dot{z}(s)\) a.e. on [0, 1] and then
so that also \(N(z,\dot{z})\) is constant a.e. on [0, 1], i.e. \(z\in {\mathcal {N}}_{p,q}\) as required. \(\square \)
From Propositions 5.2 and Theorem 5.6, \(\mathcal {J}\) is bounded from below and satisfies the Palais-Smale condition on \(\mathcal {J}^c\). Since \({\mathcal {N}}_{p,q}\) is only a \(C^1\) submanifold of \(\Omega _{p,q}^{1,2}\) (recall Proposition 4.3) then the exponential map of its infinite dimensional Riemannian structure is not well-defined, and we cannot invoke Ekeland’s variational principle to conclude that a minimizer of \(\mathcal {J}\) exists (see [25, Proposition 5.1]). Anyway, from [48, Theorem 3.1] (which, nevertheless, is based on Ekeland’s variational principle) or as a straightforward consequence of the noncritical interval theorem (see [22, Theorem (2.15)]), we actually get the existence of a minimizer of \(\mathcal {J}\). Summing up, we have the following result:
Theorem 5.7
Let \(L:TM \rightarrow \mathbb {R}\) be an indefinite Lagrangian satisfying Assumptions 2.2–2.9. Assume also that \(\mathcal {N}_{p,q}\) is c-bounded, for some \(c\in {\mathbb {R}}\). Then there exists a curve \(z \in {\mathcal {N}}_{p,q}\) which minimizes \(\mathcal {J}\) and it is then a critical point of \(\mathcal {A}\) on \(\Omega _{p,q}^{1,2}\).
Remark 6.2
The critical points of \(\mathcal {A}\) on \(\Omega _{p,q}^{1,2}\), whose existence is ensured by Theorem 5.7, satisfy the Euler-Lagrange equation (2.6) in weak sense. We will show in Appendix A that they also satisfy it in classical sense.
6 Multiplicity of critical points
In this section we obtain a multiplicity result for critical points of the functional \(\mathcal {A}\) by using Ljusternik-Schnirelmann theory, provided that M is a not contractible. Let us recall the definition of Ljusternik-Schnirelmann category. Let A be a non-empty subset of a topological space B; the Lusternik-Schnirelman category of a A, denoted by \(\textrm{cat}_B(A)\), is the least integer n such that A can be covered n closed contractible (in B) subsets of B. If no such a number exists then \(\text {cat}_B(A)=+\infty \). If \(A = \emptyset \), we set \(\text {cat}_B{A} = 0\). We denote \(\text {cat}_B(B)\) with \(\text {cat}(B)\).
By [26, Proposition 3.2], we know that if M is a non-contractible manifold then \(\text {cat}(\Omega _{p,q}^{1,2}) = + \infty \). This fact can be exploited together with the following proposition, which is a straightforward corollary of [22, Theorem (3.6)] and allows to prove the multiplicity of critical points for a functional of class \(C^1\) defined on a manifold with the same regularity, as it is in our setting.
Theorem 6.1
(Corvellec-Degiovanni-Marzocchi) Let \({\mathcal {M}}\) be a (possibly infinite dimensional) \(C^1\) Riemannian manifold and \(f:{\mathcal {M}} \rightarrow {\mathbb {R}}\) be a bounded from below \(C^1\) functional satisfying the Palais-Smale condition.
Then f has at least \(\text {cat}(\mathcal {M})\) critical points. Moreover, if \(\text {cat}(\mathcal {M})=+\infty \) then \(\sup f=+\infty \) and there exists a sequence \((c_m)_m\) of critical values such that \(c_m\rightarrow +\infty \).
Remark 7.5
Actually [22, Theorem (3.6)] is stated for a continuous functional f on a complete metric space X with a critical point defined by using the notion of weak slope introduced in [24]. Points with vanishing weak slope are standard critical points if f is a \(C^1\) functional on a Riemannian manifold. The metric space must also be weakly locally contractible, meaning that each \(x\in X\) admits a neighborhood contractible in X. Notice that if X is weakly locally contractible then, for each \(x\in X\), \(\text {cat}_X(\{x\})=1\). A \(C^1\) Riemannian manifold is clearly weakly locally contractible (it is enough to take a small neighborhood of x diffeomorphic to a ball in the model Hilbert space). Thus, for example, both \(\Omega _{p,q}^{1,2}\) and \({\mathcal {N}}_{p,q}\) are weakly locally contractible, the latter a fortiori being also a strong deformation retract of \(\Omega _{p,q}^{1,2}\) if K is complete (see Proposition 6.4). Finally, we notice that in [22] the definition of Ljusternik-Schnirelman category is given with open coverings instead of closed one. This is equivalent to the definition with closed coverings in every ANR space; since metrizable manifolds are ANR (see [43, Theorem 5]), the two definitions are then equivalent for \({\mathcal {N}}_{p,q}\).
Let us now state the main result of this section.
Theorem 6.3
Let M be a non-contractible manifold and \(L:TM \rightarrow \mathbb {R}\) a Lagrangian that satisfies Assumptions 2.2–2.9. If K is a complete vector field and \(\mathcal {N}_{p,q}\) is c-bounded for all \(c \in \mathbb {R}\), then there exists a sequence \((z_n)_{n\in \mathbb {N}}\subset \Omega _{p,q}^{1,2}\) of critical points of \(\mathcal {A}\) such that
Like in the existence result given in Theorem 5.7, we cannot work directly on \(\Omega _{p,q}^{1,2}\) to prove Theorem 6.3, where \(\mathcal {A}\) is not bounded from below and does not satisfy the Palais-Smale condition, but we have to restrict our analysis on \(\mathcal {N}_{p,q}\).
Let us first show that when K is complete then \({\mathcal {N}}_{p,q}\) is a strong deformation retract of \(\Omega ^{1,2}_{p,q}\) (so that the Ljusternik-Schnirelmann category is preserved), namely there exists a homotopy \(H:\Omega _{p,q}^{1,2}\times [0,1] \rightarrow \Omega _{p,q}^{1,2}\) such that, for all \(z \in \Omega _{p,q}^{1,2}\), \(w \in \mathcal {N}_{p,q}\) and \(t \in [0,1]\), we have \(H(z,0) = z\), \(H(z,1)\in \mathcal {N}_{p,q}\) and \(H(w,t) = w\). Next proposition extends [31, Proposition 5.9] from stationary Lorentzian manifold to our setting.
Proposition 7.2
Assume that K is a complete vector field, then \(\mathcal {N}_{p,q}\) is a strong deformation retract of \(\Omega _{p,q}^{1,2}\).
In the proof of Proposition 6.4, it will be useful the following preliminary result.
Lemma 6.5
Let the vector field K be complete and let \(\psi :\mathbb {R}\times M \rightarrow M \) be its flow. Then, for every \(z \in \Omega ^{1,2}_{p,q}\) there exists a uniquely defined function \(\phi \in H^{1}_0([0,1], {\mathbb {R}})\) such that
Moreover, defining \(\Psi :\Omega _{p,q}^{1,2} \rightarrow \mathcal {N}_{p,q}\) as
the function \(\Psi \) is \(C^1\).
Proof
Let \(z\in \Omega _{p,q}^{1,2}\) and, for each \(\phi \in H^{1}_0([0,1], {\mathbb {R}})\), let us denote by \(w:[0,1] \rightarrow M\) the curve
We want to find \(\phi \in H^{1}_0([0,1], {\mathbb {R}})\) such that \(w \in \mathcal {N}_{p,q}\), hence \(w(0) = p\), \(w(1) = q\) and
for some constant \(C \in \mathbb {R}\). By differentiating (6.2), we get
Substituting this expression in (6.3) and recalling that \(N=Q+d\), we get
which, for each \(z\in \Omega ^{1,2}_{p,q}\), can be seen as a differential equation for \(\phi \).
Let us rewrite (6.4) in order to get a simpler equation. Since \(\psi =\psi (t,x)\) is the flow generated by K, we have
Using the group property \(\psi (t_1,\psi (t_2,x)) = \psi (t_1 + t_2,x)\), and (6.5) we also obtain
Moreover, recalling (2.4), for every \(v \in T_{z(s)}M\) we have
thus
By (2.1) and (6.6), the last equality becomes
Substituting v with \(\dot{z}\) in (6.7), we get
Recalling that Q(K) is invariant by the flow (see the proof of Proposition 2.5-(iv)), by (6.5) we have
Thus, from (6.8) and (6.9), (6.4) becomes
and since by Assumption 2.2, Q(K(z)) is different from 0, we get
Hence, \(\phi \) can be obtained as the solution of (6.11) with initial condition \(\phi (0) = 0\) and, by setting \(\int _{0}^{1}\phi '(s)\text {d}s = 0\), we can ensure that \(\phi (1) = 0\) by taking
The fact that \(\Psi \) is \(C^1\) is a simple consequence of the \(C^1\)-regularity of N and (6.10). \(\square \)
Proof of Proposition 6.4
By Lemma 6.5, for every \(z \in \Omega _{p,q}^{1,2}\), we consider \(\phi \in H^{1}_0([0,1],{\mathbb {R}})\) (depending on z and univocally defined as shown in Lemma 6.5), such that (6.1) holds. Then, let us define \(H:\Omega _{p,q}^{1,2}\times [0,1] \rightarrow \Omega _{p,q}^{1,2}\) as
Notice that \(H(\cdot ,0)\) is the identity map on \(\Omega _{p,q}^{1,2}\), and \(H(\Omega _{p,q}^{1,2},1)\subset \mathcal {N}_{p,q}\). If \(w \in \mathcal {N}_{p,q}\), then \(N(w,{\dot{w}})\) is constant and recalling that \(\phi \) satisfies (6.10) with C given by (6.11), we get that the corresponding \(\phi \) is the zero function, hence \( H(w,t) = w\) for all \(t \in [0,1]\). \(\square \)
We can now prove Theorem 6.3.
Proof of Theorem 6.3
Since the Ljusternik-Schnirelmann category is a homotopy invariant, by Proposition 6.4 and [26, Proposition 3.2], we have \(\text {cat}(\mathcal {N}_{p,q}) = \text {cat}(\Omega _{p,q}^{1,2}) = + \infty \). From Theorem 5.6, \(\mathcal {J}\) satisfies the Palais-Smale condition on \({\mathcal {J}}^c\) for every \(c\in {\mathbb {R}}\) and, then, on \({\mathcal {N}}_{p,q}\) (recall Remark 5.5). Hence, by Theorem 6.1, there exists a sequence \((z_n)_n \subset \mathcal {N}_{p,q}\) of critical points of \(\mathcal {J}\) such that \(\mathcal {J}(z_n)\rightarrow +\infty \). By Theorem 4.7, every critical point of \(\mathcal {J}\) is a critical point of \(\mathcal {A}\), and \({\mathcal {A}}(z_n)=\mathcal {J}(z_n)\). \(\square \)
7 c-precompactness and c-boundedness
In light of Theorems 5.7 and 6.3, it becomes important to give conditions ensuring the c-boundedness of \({\mathcal {N}}_{p,q}\). We firstly need the following definition, introduced in [31].
Definition 7.1
Let c be a real number. The set \(\mathcal {N}_{p,q}\) is said to be c-precompact if every sequence \((z_n)_n \subset \mathcal {J}^c\) has a uniformly convergent subsequence. We say that \(\mathcal {J}\) is pseudocoercive if \({\mathcal {N}}_{p,q}\) is c-precompact for all \(c\in {\mathbb {R}}\).
We are going to show that c-boundedness and c-precompactness are essentially equivalent properties for Lagrangians admitting a local expression of “product” type (see (7.1) below). As a first step, we notice that Lemma 5.3 immediately gives one of the implications in the equivalence.
Proposition 7.4
Let Assumptions 2.2–2.9 hold. If \(\mathcal {N}_{p,q}\) is c-bounded, then it is c-precompact.
The converse implication holds if L admits a local structure of the type in (3.1), so we give the following definition.
Definition 7.3
We say that L admits a stationary product type local structure if for every point \(p \in M\) there exist an open precompact neighborhood \(U_p \subset M\) of p, a manifold with boundary \(S_p\), an open interval \(I_p = (-\epsilon _p,\epsilon _p)\subset \mathbb {R}\), and a diffeomorphism \(\phi :S_p \times I_p \rightarrow U_p\) such that, named t the natural coordinate of \(I_p\),
and for all \(\big ((x,t),(\nu ,\tau )\big ) \in T(S_p \times I_p)\) we have
where
-
\(L_0 \in C^1(TS_p)\) is a Lagrangian on \(S_p\) which satisfies (3.2)–(3.4) with respect to the norm \(\Vert \cdot \Vert _{S_p}\) of the metric induced on \(S_p\) by the auxiliary Riemannian metric on M, and it is pointwise strongly convex, i.e. it satisfies (2.14) on \(TS_p\) (with \(L_c\) replaced by \(L_0\) and \(\Vert \cdot \Vert \) by \(\Vert \cdot \Vert _{S_p}\)), for a continuous function \(\lambda :S_p\rightarrow (0,+\infty )\);
-
\(\omega \) is a \(C^1\) one-form on \(S_p\);
-
\(d:S_p \rightarrow \mathbb {R}\) is a \(C^1\) function;
-
\(\beta :S_p \rightarrow (0,+\infty )\) is a positive \(C^1\) function.
Notice that Definition 7.3 is satisfied for \(L(v)=g_L(v,v)\), where \(g_L\) is a \(C^1\) Lorentzian metric on M having a timelike Killing vector field K; in such a case \(S_p\) is a spacelike hypersurface in M, \(L_0\) is the Riemannian metric induced on it by \(g_L\), \(\beta (x)=-g_L(K_x,K_x)\) and \(\omega \) is the one-form metrically equivalent to the orthogonal projection of K on \(TS_p\) and \(d \equiv 0\) (see, e.g., [31, Appendix C]). The next result shows that it is satisfied as well by a Lagrangian fulfilling Assumptions 2.2–2.7.
Proposition 8.1
Let \(L:TM\rightarrow {\mathbb {R}}\) satisfy Assumptions 2.2 and 2.7. Then it admits a stationary product type local structure.
Proof
Let us denote by \(\mathcal {D}\) the distribution in TM generated by the kernel of Q, i.e. for all \(z\in M\), \(\mathcal {D}_z=\ker Q_z\). Notice that by (2.2), \(\mathcal {D}\) has constant rank equal to m (recall that \(\dim (M)=m+1\)). Let \( {\bar{z}}\in M\) and \(S_{{\bar{z}}}\) be a smooth hypersurface (with boundary) in M such that \( {\bar{z}}\in S_{{\bar{z}}}\) and \(T_{{\bar{z}}}S_{{\bar{z}}}= {\mathcal {D}}_{{\bar{z}}}\). We endow \(S_{{\bar{z}}}\) with the Riemannian metric induced by the auxiliary Riemannian metric g on M and let us denote its norm with \(\Vert \cdot \Vert _{S_{{\bar{z}}}}\). From (2.2), up to shrink \(S_{{\bar{z}}}\), we can assume that for all \(x\in S_{{\bar{z}}}\), \(K_x\) is transversal to \(S_{{\bar{z}}}\), i.e. \(T_x M=T_xS_{{\bar{z}}}\oplus [K_x]\). Using (2.7), we get
for all \((x,\nu )\in TS_{{\bar{z}}}\). In particular, \(\partial _{v}L({\bar{z}},\nu )=\partial _{v}L_c({\bar{z}},\nu )\) for all \(\nu \in T_{{\bar{z}}}S_{{\bar{z}}}\). Considering a smaller hypersurface \(S_{{\bar{z}}}\) such that
for all \((x,\nu _1),(x,\nu _2)\in TS_{{\bar{z}}}\) we have
Let \(L_0=L|_{TS_{{\bar{z}}}}\); the above inequality gives then (2.14) for \(L_0\) on \(TS_{{\bar{z}}}\). Since \(L_0(x, \nu )=L_c(x,\nu )+\frac{Q_x^2(\nu )}{Q_x(K_x)}\) and \(L_c\) satisfies (2.11)–(2.13), we also have that \(L_0\) satisfies (3.2)–(3.4). Let us now evaluate \(\frac{\textrm{d}}{\textrm{d}s} L(x,y+s\tau K)\), for any \(y\in T_x M\), \(x\in M\), and \(\tau \in {\mathbb {R}}\):
Hence, integrating w.r.t. s between 0 and 1 we get
Let now \(w\in T_x M\), \(x\in S_{{\bar{z}}}\), and \(w_S\in T_{x}S_{{\bar{z}}}\), \(\tau _w\in {\mathbb {R}}\) such that \(w=w_S+\tau _wK_x\). From (7.3) we get
Thus, we get an expression of the type at the right-hand side of (7.1) on \(S_{{\bar{z}}}\) by defining \(\omega \) as the one-form induced by Q/2 on \(S_{{\bar{z}}}\) and \(\beta (x):=-Q(K)/2\). Since L is invariant by the flow of \(K^c\) we then obtain (7.1) on \(S_{{\bar{z}}}\times I_{{\bar{z}}}\), for some open interval \(I_{{\bar{z}}}\) containing 0, by taking \(\phi \) as the restriction to \(S_{{\bar{z}}}\times I_{{\bar{z}}}\) of the flow \(\psi \) of K adapted to \(S_{{\bar{z}}}\), i.e. such that \(S_{{\bar{z}}}=\psi (S_{{\bar{z}}}\times \{0\})\). \(\square \)
Remark 7.7
Notice that if the distribution \(\mathcal {D}\) generated by the kernel of Q is integrable then we can take in the above proof \(S_{{\bar{z}}}\) equal to an integral manifold of \(\mathcal {D}\). In this case the local expression of L simplifies to
This can be considered as a generalization of the notion of a static Lorentzian metric to an indefinite Lagrangian admitting an infinitesimal symmetry satisfying Assumptions 2.2–2.7 (compare also with [20, 21]).
By Proposition 7.4 we obtain the following generalization of [31, Lemma 4.1].
Theorem 7.6
Let Assumptions 2.2 and 2.7 hold. If \({\mathcal {N}}_{p,q}\) is c-precompact then it is c-bounded.
Proof
Let \((z_n)_n \subset \mathcal {J}^c\) be a sequence such that
Moreover, let \((C_{z_n})_n \subset \mathbb {R}\) be the sequence of real numbers such that for all n
To obtain the thesis, it suffices to prove that \(C_{z_n}\) is bounded. Since \({\mathcal {N}}_{p,q}\) is c-precompact we can assume, up to pass to a subsequence, that \(z_n\) converges uniformly to a curve \(z\in \mathcal {J}^c\). We can then assume that there exists a finite number of neighborhoods \(U_k\), with \(k = 1,\dots ,N\), that cover z([0, 1]) such that, for some finite sequence \(0 = a_0< a_1< \dots < a_N = 1\), \(z_n([a_{k-1},a_k])\subset U_k\), for all n sufficiently large and for all \(k = 1,\dots ,N\). Moreover, by Proposition 7.4, in each domain \(U_k\) we can identify L with \(L\circ (\phi _k)_*\) so that L, evaluated along a curve \(z(s)=\big (x(s),t(s)\big )\) contained in \(U_k\), is given by
(here we are not writing the point where the one-forms \(\omega _k\) are applied). Up to replace each \(U_k\) by a precompact open subset, we can assume that
and
reminding that \(\Vert y\Vert = \sqrt{g(y,y)}\), where g is the auxiliary Riemannian metric. Analogously, we have
and there also exist two constants \(\nu ,\ \mu \) such that
In the following, we write \(L_{0,k}\), \(\omega _k\), \(d_k\) and \(\beta _k\) without the index k. In this local charts, let \(z_n(s) = \big (x_n(s),t_n(s)\big )\). As for (3.1) and (3.5), we have \(N(z_n, \dot{z}_n) = 2\omega ({\dot{x}}_n)-2\beta {\dot{t}}_n+d(x_n)\). Hence,
Defining \(T_n^k = t_n(a_k)- t_n(a_{k-1})\), we have
Therefore, the quantity
is well-defined and finite. Moreover,
From (7.8) we obtain
By (7.5), we have \(|w({\dot{x}}_n)|\le D_0 \Vert {\dot{x}}_n\Vert \). As a consequence, using also (7.6), \(|T_n^k|\le \Delta \) and (7.9), from (7.10) we have
By (7.11), to prove that \(C_{z_n}\) is bounded, and thus to prove the theorem, it suffices to show that
To this end, recall that by (7.7) we have
therefore, using also (7.10) we obtain
By the Schwartz inequality in \(L^2\), we obtain
Hence, from (7.13) we obtain
Since \(L_0\) is the Lagrangian in a stationary product type local structure, as for (2.15), we deduce that there exist two positive constants \(\ell _1,\ell _2 \in \mathbb {R}\) such that, for all the domains \(U_k\) of the charts, we have
Since \(d_k\), \(T_n^k\) and \(1/b_{n}^k\) are bounded for each k, we obtain the existence of two positive constants \(E_1,E_2\) (depending on \(\nu ,\mu ,\Delta ,D_0, D_1,\ell _2\)) such that
As a consequence, (7.12) holds and by (7.10) we conclude that \(\mathcal {N}_{p,q}\) is c-precompact. \(\square \)
Remark 7.10
If \(\mathcal {N}_{p,q}\) is c-precompact, then there exists a compact subset of M that contains the images of all curves in \(\mathcal {J}^c\). Therefore, Assumption 2.9 holds on such a compact set.
From Theorem 7.6, Remark 7.7, Theorems 5.7, 6.3 we deduce the following corollary.
Corollary 7.8
Let \(L:TM \rightarrow \mathbb {R}\) satisfy Assumptions 2.2 and 2.7. If \(\mathcal {N}_{p,q}\) is c-precompact for some \(c\in {\mathbb {R}}\) such that \(\mathcal {J}^c\ne \emptyset \), then \(\mathcal {J}^c\) is bounded from below and it admits a minimizer which is critical point of \(\mathcal {A}\).
Moreover, if \(\mathcal {J}\) is pseudocoercive, K is complete, and M is a non-contractible manifold, then \({\mathcal {N}}_{p,q}\ne \emptyset \) and there exists a sequence \((z_n)_{n\in \mathbb {N}}\subset \Omega _{p,q}^{1,2}\) of critical points of \(\mathcal {A}\) such that \(\lim _{n\rightarrow \infty }\mathcal {A}(z_n) = + \infty \).
Recalling Example 3.8, by Corollary 7.8 we then obtain the following extension of [31, Theorems 1.2 and 1.3] to \(C^1\) stationary Lorentzian manifolds.
Corollary 7.9
Let (M, g) be a Lorentzian manifold such that g is a \(C^1\) metric endowed with a timelike Killing vector field K. If \(\mathcal {N}_{p,q}\) is c-precompact for some \(c\in {\mathbb {R}}\) such that \(\mathcal {J}^c\ne \emptyset \), then there exists a geodesic connecting p to q. Moreover, if \(\mathcal {J}\) is pseudocoercive, K is complete, and M is a non-contractible manifold, then \({\mathcal {N}}_{p,q}\ne \emptyset \) and there exists a sequence of geodesics \((z_n)_{n\in \mathbb {N}}\subset \Omega _{p,q}^{1,2}\) with unbounded energy.
Remark 8.2
Apart from completeness of K, whenever \(d=0\), a condition ensuring that \({\mathcal {N}}_{p,q}\) is non-empty for all p and q in M is that the distribution \({\mathcal {D}}\) defined by the kernel of Q is not integrable through any point in M. Indeed by Chow-Rashevskii Theorem, there exists then a horizontal \(C^1\) curve \(\gamma \) connecting p to q. Hence, such curve belongs to \({\mathcal {N}}_{p,q}\) with constant \(Q({{\dot{\gamma }}})=0\). We recall that, in the case when L is the quadratic form associated with a stationary Lorentzian metric \(g_L\) with Killing vector field K, the non-integrability of \({\mathcal {D}}\) through any point is equivalent to the fact that K is not static in any region of M. Geodesic connectedness of a smooth static Lorentzian manifold was studied in [18]; we point out that, thanks to Theorem 6.1, the results in [18] can be extended to a \(C^1\) static Lorentzian metric.
8 Dynamic conditions for pseudocoercivity
Inspired by Appendix A in [31], we give some conditions that ensure that \({\mathcal {J}}\) is pseudocoercive.
Let us assume that there exists a \(C^1\) function \(\varphi :M\rightarrow {\mathbb {R}}\) which satisfies the monotonicity condition \(\textrm{d}\varphi (K)>0\).
If K is complete, this implies that M is foliated by level sets of the function \(\varphi \), and it splits as \(\Sigma \times {\mathbb {R}}\), where \(\Sigma \) is one of this level set. Notice that [31, Assumption (4.11)] implies the completeness of the timelike Killing vector field there, so the setting leading to [31, Proposition A.3] is actually analogous to ours (compare also with [15, Theorem 2.3]). Some differences, on the other hand, are that the splitting \(\Sigma \times {\mathbb {R}}\) is only \(C^1\) and there is no simple link between convexity properties of the induced Lagrangian \(L_0\) and the level set \(\Sigma \) (see Remark 8.3).
Since \(\Sigma \) is transversal to K, using Assumption 2.2 and arguing as in the proof of Proposition 7.4, we get that L is given by (3.1) in \(\Sigma \times {\mathbb {R}}\) for a \(C^1\) Lagrangian \(L_0:T\Sigma \rightarrow {\mathbb {R}}\). Let us denote by \(g_\Sigma \) the \(C^1\) Riemannian metric on \(\Sigma \) induced by g. We assume that the one-form \(\omega \) induced by Q on \(\Sigma \) has sublinear growth w.r.t. the distance \(d_{\Sigma }\) induced by \(g_\Sigma \), i.e. there exist \(\alpha \in [0,1)\) and two non-negative constants \(k_0\) and \(k_1\) such that
for some \(x_0\in \Sigma \) and all \(x\in \Sigma \). We recall that \(\beta \) in an expression like (3.1) for L is equal to \(-Q(K)/2\) (see (7.4)).
Proposition 1.50 Let L satisfy Assumption 2.2 with d in (2.1) bounded and K complete. Let \(\varphi :M\rightarrow {\mathbb {R}}\) be a \(C^1\) function such that \(\textrm{d}\varphi (K)>0\). Let \(\Sigma \) a level set of \(\varphi \) and \(L_0\) be the Lagrangian induced by L on \(\Sigma \). Assume that
-
\(L_0\) satisfies (i) and (ii) in Example 3.1 (namely it satisfies the growth conditions and the pointwise convexity) and there exist three constants \(\ell _1, \ell _2, \ell _3\) such that \(\lambda _0(x)\ge \ell _1>0\), \(L_0(x,0)\ge \ell _2\) and \(\Vert \partial _vL_0(x,0)\Vert _{\Sigma }\le \ell _3\);
-
\(\omega \) satisfies (8.1);
-
there exist two constant \(b_1\) and \(b_2\) such that \(0<b_1\le \beta (x)\le b_2\), for all \(x\in \Sigma \).
Then \(\mathcal {J}\) is pseudocoercive.
Proof
Recalling that, by Definition 7.1, \(\mathcal {J}\) is pseudocoercive if \(\mathcal {N}_{p,q}\) is c-precompact for all \(c \in \mathbb {R}\), the thesis follows from Proposition 7.2 by showing that that \({\mathcal {N}}_{p,q}\) is c-bounded for all \(c\in {\mathbb {R}}\).
Let us set \(\Delta :=t(q)-t(p)\) and let \(z_n=z_n(s)=\big (x_n(s), t_n(s)\big )\in \mathcal {J}^c\) be a sequence such that \(|N(z_n,\dot{z}_n)|\rightarrow \sup _{z\in \mathcal {J}^c} |N(z,\dot{z})|\). As for (7.14), we then get
where \(b_n=\int _0^1\frac{1}{\beta (x_n(s))}\textrm{d}s\). Then taking into account that d is bounded, \(0<b_1\le \beta (x) \le b_2\), \(\lambda _0(x)\ge \ell _1>0\), \(L_0(x,0)\ge \ell _2\) and \(\Vert \partial _vL_0(x,0)\Vert _{\Sigma }\le \ell _3\), for all \(x\in \Sigma \), using (2.15) for \(L_0\) and (8.1), we obtain from (8.2) that \(\int _0^1\Vert \dot{x}_n\Vert ^2_\Sigma \textrm{d}s\) is bounded. Analogously to (7.7) we have then
and hence \(N(z_n, \dot{z}_n)\) is bounded as well. \(\square \)
Remark 8.3
The proof of Proposition 8.1 also shows that the manifold \({\mathcal {N}}_{p,q}\) associated to the Lagrangian in Example 3.1 is c-bounded for all \(c\in {\mathbb {R}}\) provided that \(L_0\) satisfies (i) and (ii) in Example 3.1, d is bounded, \(\omega \) has sublinear growth on S (hence (8.1) holds) and there exist some constants \(b_1, b_2, \ell _1, \ell _2, \ell _3\) such that \(0<b_1\le \beta (x)\le b_2\), \(\lambda _0(x)\ge \ell _1>0\), \(L_0(x,0)\ge \ell _2\) and \(\Vert \partial _vL_0(x,0)\Vert _{\Sigma }\le \ell _3\), for all \(x\in S\).
Remark 1.52 The strong convexity condition for \(L_0\) holds if \(L_0\), satisfying (2.14) on \(T\Sigma \), satisfies also (7.2) on \(\Sigma \). This condition can be considered as a replacement of being \(\Sigma \) a spacelike and complete hypersurface when L is the quadratic form of a stationary Lorentzian manifold (in our setting the Riemannian metric on \(\Sigma \), induced by the auxiliary one g, is complete because g is complete by assumptions). Indeed, in such a case, it is enough to assume that \(\nabla \varphi \) is timelike (i.e. \(\varphi \) is a \(C^1\) time function) to get that a level set \(\Sigma \) of \(\varphi \) is spacelike. The existence of such a \(\varphi \) is guaranteed if there exists a spacelike hypersurface that intersects once every flow line of the complete timelike Killing vector field K (see [31, Appendix A]).
References
Aazami, A.B., Javaloyes, M.A.: Penrose’s singularity theorem in a Finsler spacetime. Class. Quant. Grav. 33, 025003 (2016). https://doi.org/10.1088/0264-9381/33/2/025003
Abbondandolo, A., Figalli, A.: High action orbits for Tonelli Langrangians and superlinear Hamiltonians on compact configuration spaces. J. Differ. Equ. 234, 626–653 (2007). https://doi.org/10.1016/j.jde.2006.10.015
Abbondandolo, A., Schwarz, M.: A smooth pseudo-gradient for the Lagrangian action functional. Adv. Nonlinear Stud. 9, 597–623 (2009). https://doi.org/10.1515/ans-2009-0402
Avez, A.: Essais de géométrie riemannienne hyperbolique globale. Applications à la relativité générale. Ann. Inst. Fourier 13, 105–190 (1963). https://doi.org/10.5802/aif.144
Bao, D., Chern, S.-S., Shen, Z.: An Introduction to Riemann-Finsler geometry. Graduate Texts in Mathematics. Springer, New York (2000)
Bartolo, R.: Trajectories connecting two events of a Lorentzian manifold in the presence of a vector field. J. Differ. Equ. 153, 82–95 (1999). https://doi.org/10.1006/jdeq.1998.3521
Beem, J.K.: Indefinite Finsler spaces and timelike spaces. Canad. J. Math. 22, 1035–1039 (1970). https://doi.org/10.4153/CJM-1970-119-7
Benci, V.: Periodic solutions of Lagrangian systems on a compact manifold. J. Differ. Equ. 63, 135–161 (1986). https://doi.org/10.1016/0022-0396(86)90045-8
Benci, V., Fortunato, D., Giannoni, F.: On the existence of multiple geodesics in static space-times. Ann. Inst. H. Poincaré C Anal. Non Linéaire 8, pp. 79–102 (1991). https://doi.org/10.1016/S0294-1449(16)30278-5
Bernal, A.N., Javaloyes, M.A., Sánchez, M.: Foundations of Finsler spacetimes from the observers’ viewpoint. Universe 6, 55 (2020). https://doi.org/10.3390/universe6040055
Bernard, P., Suhr, S.: Lyapounov functions of closed cone fields: from Conley theory to time functions. Commun. Math. Phys. 359, 467–498 (2018). https://doi.org/10.1007/s00220-018-3127-7
Bolza, O.: Lectures on the Calculus of Variations. University of Chicago Press, Chicago, (1904). http://www.hti.umich.edu/cgi/t/text/text-idx?c=umhistmath;idno=ACM2513
Brandt, H.E.: Finsler-spacetime tangent bundle. Found. Phys. Lett. 5, 221–248 (1992). https://doi.org/10.1007/BF00692801
Buttazzo, G., Giaquinta, M., Hildebrandt, S.: One-Dimensional Variational Problems: An Introduction. The Clarendon Press, Oxford University Press, New York (1998)
Candela, A.M., Flores, J.L., Sánchez, M.: Global hyperbolicity and Palais-Smale condition for action functionals in stationary spacetimes. Adv. Math. 218, 515–556 (2008). https://doi.org/10.1016/j.aim.2008.01.004
Caponio, E., Masiello, A.: Trajectories for relativistic particles under the action of an electromagnetic field in a stationary space-time. Nonlinear Anal. Theory Meth. Appl. 50, 71–89 (2002). https://doi.org/10.1007/978-88-470-2101-3_28
Caponio, E., Masiello, A.: On the analyticity of static solutions of a field equation in Finsler gravity. Universe 6, 59 (2020). https://doi.org/10.3390/universe6040059
Caponio, E., Masiello, A., Piccione, P.: Some global properties of static spacetimes. Math. Z. 244, 457–468 (2003). https://doi.org/10.1007/s00209-003-0488-0
Caponio, E., Masiello, A., Piccione, P.: Maslov index and Morse theory for the relativistic Lorentz force equation. Manuscripta Math. 113, 471–506 (2004). https://doi.org/10.1007/s00229-004-0441-5
Caponio, E., Stancarone, G.: Standard static Finsler spacetimes. Int. J. Geom. Methods Mod. Phys. 13, 1650040 (2016). https://doi.org/10.1142/S0219887816500407
Caponio, E., Stancarone, G.: On Finsler spacetimes with a timelike Killing vector field. Class. Quant. Grav. 35, 085007 (2018). https://doi.org/10.1088/1361-6382/aab0d9
Corvellec, J.-N., Degiovanni, M., Marzocchi, M.: Deformation properties for continuous functionals and critical point theory. Topol. Methods Nonlinear Anal. 1, 151–171 (1993). https://doi.org/10.12775/TMNA.1993.012
Crampin, M., Mestdag, T.: Routh’s procedure for non-abelian symmetry groups. J. Math. Phys. 49, 032901 (2008). https://doi.org/10.1063/1.2885077
Degiovanni, M., Marzocchi, M.: A critical point theory for nonsmooth functional. Ann. Mat. Pura Appl. 167, 73–100 (1994). https://doi.org/10.1007/BF01760329
Ekeland, I.: On the variational principle. J. Math. Anal. Appl. 47, 324–353 (1974). https://doi.org/10.1016/0022-247X(74)90025-0
Fadell, E., Husseini, S.: Category of loop spaces of open subsets in Euclidean space. Nonlinear Anal. Theory Meth. Appl. 17, 1153–1161 (1991). https://doi.org/10.1016/0362-546X(91)90234-R
Fathi, A., Siconolfi, A.: On smooth time functions. Math. Proc. Cambridge Philos. Soc. 152, 303–339 (2012). https://doi.org/10.1017/S0305004111000661
Fonseca, I., Leoni, G.: Modern Methods in Calculus of Variations. \(L^p\) Spaces. Springer New York, NY, 2007 https://doi.org/10.1007/978-0-387-69006-3
Gallego Torromé, R., Piccione, P., Vitório, H.: On Fermat’s principle for causal curves in time oriented Finsler spacetimes. J. Math. Phys. 53, 123511 (2012). https://doi.org/10.1063/1.4765066
Giannoni, F., Masiello, A.: On the existence of geodesics on stationary Lorentz manifolds with convex boundary. J. Funct. Anal. 101, 340–369 (1991). https://doi.org/10.1016/0022-1236(91)90162-X
Giannoni, F., Piccione, P.: An intrinsic approach to the geodesical connectedness of stationary Lorentzian manifolds. Commun. Anal. Geom. 7, 157–197 (1999). https://doi.org/10.4310/CAG.1999.v7.n1.a6
Hasse, W., Perlick, V.: Redshift in Finsler spacetimes. Phys. Rev. D 100, 024033 (2019). https://doi.org/10.1103/PhysRevD.100.024033
Hohmann, M., Pfeifer, C., Voicu, N.: Finsler gravity action from variational completion. Phys. Rev. D 100, 064035 (2019). https://doi.org/10.1103/PhysRevD.100.064035
Horváth, J.I.: New geometrical methods of the theory of physical fields. Il Nuovo Cimento 9, 444–496 (1958). https://doi.org/10.1007/BF02747685
Javaloyes, M.A., Sánchez, M.: On the definition and examples of cones and Finsler spacetimes. Rev. R. Acad. Cienc. Exactas Fís. Nat. Ser. A Mat. RACSAM, 114 (2020), 30. https://doi.org/10.1007/s13398-019-00736-y
Lämmerzahl, C., Perlick, V., Hasse, W.: Observable effects in a class of spherically symmetric static Finsler spacetimes. Phys. Rev. D 86, 104042 (2012). https://doi.org/10.1142/9789814623995_0255
Lang, S.: Differential Manifolds. Springer, Berlin (1985)
Lu, Y., Minguzzi, E., Ohta, S.-I.: Geometry of weighted Lorentz-Finsler manifolds I: singularity theorems. J. Lond. Math. Soc. 104, 362–393 (2021). https://doi.org/10.1112/jlms.12434
Marsden, J.E., Ratiu, T.S., Scheurle, J.: Reduction theory and the Lagrange-Routh equations. J. Math. Phys. 41, 3379–3429 (2000). https://doi.org/10.1063/1.533317
Minguzzi, E.: Light cones in Finsler spacetime. Commun. Math. Phys. (2015). https://doi.org/10.1007/s00220-014-2215-6
Minguzzi, E.: Causality theory for closed cone structures with applications. Rev. Math. Phys. 31, 1930001 (2019). https://doi.org/10.1142/S0129055X19300012
Morse, M.: The foundations of a theory in the calculus of variations in the large. Trans. Amer. Math. Soc. 30, 213–274 (1928). https://doi.org/10.2307/1989122
Palais, R.S.: Homotopy theory of infinite dimensional manifolds. Topology 5, 1–16 (1966). https://doi.org/10.1016/0040-9383(66)90002-4
Perlick, V.: Fermat principle in Finsler spacetimes. Gen. Relat. Gravit. 38, 365–380 (2006). https://doi.org/10.1007/s10714-005-0225-6
Rutz, S.F.: A Finsler generalisation of Einstein’s vacuum field equations. Gen. Relat. Gravit. 25, 1139–1158 (1993). https://doi.org/10.1007/BF00763757
Seifert, H.-J.: Global Connectivity by Timelike Geodesics. Zeitschrift für Naturforschung A 22, 1356–1360 (1967). https://doi.org/10.1515/zna-1967-0912
Struwe, M.: Variational Methods. Applications to Nonlinear Partial Differential Equations and Hamiltonian Systems. Springer, Berlin, (2008). https://doi.org/10.1007/978-3-540-74013-1
Szulkin, A.: Ljusternik-Schnirelmann theory on \(C^1\)-manifolds. Annales de l’I.H.P. Analyse non linéaire, 5, pp. 119–139 (1988). https://doi.org/10.1016/S0294-1449(16)30348-1
Tonelli, L.: The calculus of variations. Bull. Amer. Math. Soc. 31, 163–172 (1925). https://doi.org/10.1090/S0002-9904-1925-04002-1
Vitório, H.: On the Maslov index in a symplectic reduction and applications. Proc. Amer. Math. Soc. 148, 3517–3526 (2020). https://doi.org/10.1090/proc/14985
Funding
Open access funding provided by Politecnico di Bari within the CRUI-CARE Agreement.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by A. Mondino.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Data sharing not applicable to this article as no datasets were generated or analysed during the current study.
Appendix A: Regularity of the critical points
Appendix A: Regularity of the critical points
In this section we show that a critical point of the action functional \(\mathcal {A}\) on \(\Omega _{p,q}\) is actually a curve of class \(C^1\). This is a quite standard result in relation with Assumptions 2.2 and 2.7, but we give the details for the reader convenience.
Proposition A.1
Let \(L:TM \rightarrow \mathbb {R}\) be a Lagrangian satisfying Assumptions 2.2 and 2.7 and let z be a critical point of the action functional \(\mathcal {A}:\Omega ^{1,2}_{p,q}\rightarrow {\mathbb {R}}\), \(\mathcal {A}(z)=\int _0^1L(z,\dot{z})\text {d}s\). Then, both z and \(\partial _v L(z,{\dot{z}})\) are of class \(C^1\), the Euler-Lagrange equation (2.6) holds in classical sense, namely for all \(i\in \{0,\ldots ,m\}\),
and z satisfies the conservation law
for some constant \(E\in {\mathbb {R}}\).
Proof
As regularity of a critical curve is a local result, by Proposition 7.4 we can assume, without loosing generality, that L is a Lagrangian on \(U\times I\), where U is a precompact open neighborhood of \({\mathbb {R}}^m\) and \(I\subset {\mathbb {R}}\) an open interval, defined as
for all \(\big ((x,t),(\nu ,\tau )\big )\in (U\times I)\times ({\mathbb {R}}^m\times {\mathbb {R}})\). Arguing as in the proof of Proposition 7.4, for any point \({\bar{z}} \in M\), we can take U as a hypersurface in M passing through \({\bar{z}}\) such that \(\omega \) vanishes at \({\bar{z}}\). Let \(z:[0,1]\rightarrow U\times I\), \(z(s)=(x(s), t(s))\) be a critical point for \(\mathcal {A}\) then for all \((\xi ,\eta )\in H^1_0([0,1], {\mathbb {R}}^m)\times H^1_0([0,1], {\mathbb {R}})\) we have
Since \(z=(x,t)\) is a critical point of \(\mathcal {A}\), there exists a constant \(C_z\in \mathbb {R}\) such that
hence we have
Moreover, from (A.4), for all \(\xi \in C^{\infty }_{0}([0,1],\mathbb {R}^m)\) we have
hence,
Then, there exists an \(L^1\) map \(h:[0,1] \rightarrow (\mathbb {R}^m)^*\) such that
Denoting by H a primitive of \(- h\), we obtain the existence of a constant \(A \in (\mathbb {R}^m)^*\) such that
Using (A.5) we obtain
where the right-hand side is an absolute continuous function.
Let us consider the continuous maps \({\mathcal {L}}:U\times \mathbb {R}^m \rightarrow (\mathbb {R}^m)^*\) and \(\mathcal {P}:U\times \mathbb {R}^m \rightarrow U\times (\mathbb {R}^m)*\) defined respectively as
and
As in the proof of Proposition 7.4, recalling how U has been chosen, and up to take a smaller U, we can state that there exists \(C>0\) such that for any \(x\in U\) and for all \(\nu _1,\nu _2\in {\mathbb {R}}^m\)
Notice that (A.7) implies that for each \(x\in U\), \({\mathcal {L}}(x,\cdot )\) is injective with inverse which is continuous on the image of \({\mathcal {L}}(x,\cdot )\). Using again (A.7) together with the continuity of \({\mathcal {L}}\) on \(U\times {\mathbb {R}}^m\), we get that the map \({\mathcal {P}}\) is injective with continuous inverse as well. Hence,by (),
and so x is of class \(C^1\). By (A.5), even \({\dot{t}}\) is continuous, so z is of class \(C^1\) in the coordinate system where the stationary product type local structure (A.3) of L holds and then in any other coordinate system. Hence, the function h is actually a continuous function, and the right-hand side of (A.7) is of class \(C^1\). Since \(\omega \) is a \(C^1\) one-form, (A.7) shows that also \(\partial _\nu L_0(x,{\dot{x}})\) is of class \(C^1\). Being \(\partial _vL(z,\dot{z})\) identifiable with
we deduce that the function \(s\in [0,1]\mapsto \partial _vL\big (z(s),\dot{z}(s)\big )\) is \(C^1\) as well, and then (A.1) holds. Moreover, by standard arguments, z satisfies the conservation law (A.2) for some constant \(E \in \mathbb {R}\) (see, e.g., Proposition 1.16 of [14]). \(\square \)
Remark A.2
We notice that, if \(L_0\) admits positive definite vertical Hessian at some vector in \(\nu \in TU\), then \({\mathcal {L}}\) admits a bijective fiberwise derivative, so it is a local \(C^1\)-diffeomorphism in a neighborhood \({\mathcal {V}}\) of \(\nu \) in TU. Hence, \({\mathcal {P}}\) has a \(C^1\) inverse on \({\mathcal {P}}({\mathcal {V}})\) and then from (A.8) we get that \(\dot{x}\) is \(C^1\) on an open interval J containing the instant \(s_0\) such that \(\dot{x}(s_0)=\nu \). From (A.5), \(\dot{t}\) is \(C^1\) as well on J and then \(z\in C^2(J,M)\). We observe that this holds in particular when L is the quadratic form associated with \(C^1\) stationary Lorentzian metric \(g_L\) (see Example 3.8), hence its critical curves are \(C^2\) on the interval where they are defined and then they are classical geodesics.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Caponio, E., Corona, D. A variational setting for an indefinite Lagrangian with an affine Noether charge. Calc. Var. 62, 39 (2023). https://doi.org/10.1007/s00526-022-02379-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00526-022-02379-1