Fixed energy solutions to the Euler-Lagrange equations of an indefinite Lagrangian with affine Noether charge

Caponio, Erasmo; Corona, Dario; Giambò, Roberto; Piccione, Paolo

doi:10.1007/s10231-024-01424-4

Fixed energy solutions to the Euler-Lagrange equations of an indefinite Lagrangian with affine Noether charge

Open access
Published: 23 April 2024

Volume 203, pages 1819–1850, (2024)
Cite this article

Download PDF

You have full access to this open access article

Annali di Matematica Pura ed Applicata (1923 -) Aims and scope Submit manuscript

Fixed energy solutions to the Euler-Lagrange equations of an indefinite Lagrangian with affine Noether charge

Download PDF

Erasmo Caponio¹^na1,
Dario Corona ORCID: orcid.org/0000-0002-7575-710X²^na1,
Roberto Giambò^2,3^na1 &
…
Paolo Piccione⁴^na1

383 Accesses
1 Altmetric
Explore all metrics

Abstract

We consider an autonomous, indefinite Lagrangian L admitting an infinitesimal symmetry K whose associated Noether charge is linear in each tangent space. Our focus lies in investigating solutions to the Euler-Lagrange equations having fixed energy and that connect a given point p to a flow line $\gamma =\gamma (t)$ of K that does not cross p. By utilizing the invariance of L under the flow of K, we simplify the problem into a two-point boundary problem. Consequently, we derive an equation that involves the differential of the “arrival time” t, seen as a functional on the infinite dimensional manifold of connecting paths satisfying the semi-holonomic constraint defined by the Noether charge. When L is positively homogeneous of degree 2 in the velocities, the resulting equation establishes a variational principle that extends the Fermat’s principle in a stationary spacetime. Furthermore, we also analyze the scenario where the Noether charge is affine.

The topological entropy for autonomous Lagrangian systems on compact manifolds whose fundamental groups have exponential growth

Article 12 July 2019

A variational setting for an indefinite Lagrangian with an affine Noether charge

Article Open access 24 December 2022

An Eulerian–Lagrangian Form for the Euler Equations in Sobolev Spaces

Article 03 June 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In the problem of finding solutions with fixed energy for an autonomous Lagrangian system with a finite number of degrees of freedom, subject to two-point or periodic boundary conditions, one viable approach is to allow for a free interval of parametrization for the involved curves. This method entails employing the action functional with fixed energy, as originally defined by Mañé (see [1] and the survey [2]). Specifically, given a pair of distinct points on a compact manifold M and a fiberwise convex and superlinear Lagrangian L, there always exists a solution connecting the two points with a fixed energy value $\kappa $, provided that $\kappa $ is strictly greater than the so-called Mañé critical value c(L) [3, Theorem X]. This approach has also demonstrated its effectiveness in addressing the challenging problem of establishing the existence of periodic solutions, as explored in works such as [4,5,6,7].

A more basic approach involves fixing the parameter interval and the initial point p, while allowing the final point to traverse along a given curve $\gamma $. This is a common framework employed in General Relativity when studying causal geodesics, which represent the paths of light rays (photons) or the worldlines of massive particles. In this scenario, the Lagrangian energy coincides with the conserved quantity of the geodesic, namely the square of the norm of its velocity vector. Consequently, the values $\kappa =0, -1$ correspond to the energy levels of light rays and massive particles, respectively. The curve $\gamma $ represents the worldline of an observer, while p signifies the event of emission or, in an alternative perspective, $\gamma $ symbolizes the worldline of a source of light signals or massive particles, and p represents the event of detecting those signals (in the latter case, the geodesics originate from $\gamma $ and terminate at p). In the former scenario, the parameter value of $\gamma $ (more precisely, its “proper time”) at the intersection point with a lightlike future-oriented curve z from p to $\gamma $ is referred to as the arrival time of z [8]. The future-oriented lightlike geodesics are then all and only the stationary points of the arrival time with respect to any smooth variation $z_\epsilon $ made by smooth future-oriented lightlike curves between p and $\gamma $ [9]. This general statement is recognized as Fermat’s principle in General Relativity.

In fact, when the spacetime is static or stationary, Levi-Civita first introduced Fermat’s principle in local coordinates in [10] and [11] and observed that the geometry of lightlike geodesics in the spacetime can be linked to a metric in a spacelike slice, called optical metric, which is Riemannian, when the spacetime is static, and Finslerian in the stationary case, and that allows describing and calculating various geometric and causal properties of the spacetime through it [12,13,14,15,16]. Subsequently, other variations of Fermat’s principle have emerged, sharing fundamental elements while possessing distinct technical characteristics (see, e.g., [17,18,19,20,21,22,23]); moreover, it has been generalized to massive particles in [15, 24]. Light rays and massive particles via variational methods have also been studied in [25,26,27,28,29,30]. The use of Fermat’s principle also emerges in the study of motion around black holes [31], as well as in the well-known phenomenon known as gravitational lensing [32,33,34,35,36], which has been specialized for particular exact solutions of the Einstein field equations, such as Schwarzschild [37] and NUT spacetimes [38]. Interestingly, the utilization of least-time principles in observational cosmology opens up the possibility of interpreting observed instances of gravitational lensing without the need to invoke the existence of dark matter [39]. We recommend referring readers to [40] for a comprehensive review on gravitational lensing in a relativistic context.

Additionally, it is worth noting that the application of variational principles related with or inspired by Fermat’s principle has been extended and generalized beyond Euclidean or Lorentzian geometry as evidenced in [23, 41,42,43,44,45,46,47,48,49,50].

This work contributes to the latest type of research. We consider an indefinite Lagrangian L on a manifold M that is invariant under a one-dimensional group of local diffeomorphisms generated by a complete vector field K. The Noether charge associated with L is assumed to be linear in each tangent space $T_xM$. Our focus lies on solutions to the Euler-Lagrange equations of the action functional of L that connect a point p to a flow line $\gamma $ of K and having fixed energy $\kappa $. Our approach is based on the variational setting in [51], which is inspired by [52]. The main result in this work, Theorem 5.1, at least when L is 2-homogeneous in the velocities, extends Fermat’s principle as established in [53], that was specifically tailored to the framework explored in [52]. Leveraging this extension, we provide proof of existence (Theorem 6.9-(a)) and a multiplicity result (Theorem 6.9-(b)) for such solutions. Additionally, we delve into the analysis of the case where the Noether charge is an affine function in Appendix 1. In Appendix B we give some results that link an assumption on the manifold of curves that we consider in our variational setting, called pseudocoercivity, with the notion of global hyperbolicity for the cone structure associated with L in the 2-homogeneous case.

2 Notations, assumptions and a class of examples

Let M be a smooth, connected manifold of dimension $(m+1)$, where $m \ge 1$. We denote the tangent bundle of M as TM. In this paper, we consider a Riemannian metric g on M as an auxiliary metric, and we use $\left\| \cdot \right\| :TM \rightarrow {\mathbb {R}}$ to represent its induced norm; specifically, for any $v \in TM$, we have $\left\| v\right\| ^2 = g(v,v)$. We represent an element of TM as a pair (x, v), where x belongs to M and v belongs to the tangent space $T_xM$.

Let $L:TM \rightarrow {\mathbb {R}}$ be a Lagrangian on M. For any $(x,v) \in TM$, we denote the vertical derivative of L as $\partial _vL(x,v)[\cdot ]$, which is is defined as follows:

$$\begin{aligned} \partial _vL(x,v)[\xi ] = \frac{\text {d}}{\text {d}s}L(x,v + s\xi )\bigg |_{s = 0}, \quad \forall \xi \in T_xM. \end{aligned}$$

We also need a derivative w.r.t. x, denoted by $\partial _xL(x,v)$. This is defined only locally (in a system of coordinates) as:

$$\begin{aligned} \partial _xL(x,v)[\xi ] = \sum _{i = 0}^{m} \frac{\partial L}{\partial x^i}(x,v)\xi ^i, \quad \forall \xi \in T_xM, \end{aligned}$$

where $(x^0,\dots ,x^{m})$ is a local coordinate system in a neighbourhood of x, and consequently, $(x^0,\dots ,x^{m},v^0,\dots ,v^m)$ are the induced coordinates on TM. With this notation, the Euler-Lagrange equations for a curve $z:[0,1] \rightarrow M$ of class $C^1$ are given by:

$$\begin{aligned} \frac{\text {d}}{\text {d}s}\partial _vL(z,\dot{z}) - \partial _xL(z,\dot{z}) = 0, \quad \forall s \in [0,1], \end{aligned}$$

(2.1)

where $\dot{z}$ denotes the derivative of z with respect to the parameter s. It is well-known that the energy function $E:TM \rightarrow {\mathbb {R}}$, defined as:

$$\begin{aligned} E(x,v) = \partial _vL(x,v)[v] - L(x,v), \end{aligned}$$

is a first integral of the Lagrangian system. Therefore, if $z:[0,1] \rightarrow M$ is a solution of the Euler-Lagrange equations, there exists a constant $\kappa \in {\mathbb {R}}$ such that:

$$\begin{aligned} E(z,\dot{z}) = \kappa , \quad \forall s \in [0,1]. \end{aligned}$$

(2.2)

Assumption 1

The Lagrangian $L:TM \rightarrow {\mathbb {R}}$ satisfies the following conditions:

L is a $C^1$ function on TM;
There exists a complete $C^3$ vector field K on M such that L is invariant under the one-parameter group of $C^3$ diffeomorphisms of M generated by K (we refer to K as an infinitesimal symmetry of L);
The Noether charge, i.e., the map $(x,v)\in TM\mapsto \partial _v L(x,v)[K]\in {\mathbb {R}}$, is a $C^1$ one-form Q on M, namely
$$\begin{aligned} \partial _vL(x,v)[K]=Q(v); \end{aligned}$$
(2.3)
For every $x \in M$, the following equality holds:
$$\begin{aligned} Q(K) = -1 \end{aligned}$$
(2.4)

Remark 2.1

In [51], the Noether charge was assumed to be an affine function on each tangent space. For the sake of simplicity, we present the main results under the more restrictive assumption of linearity. A discussion about the affine case is given in Appendix 1.

Remark 2.2

The $C^3$ regularity condition on K is needed to get that a certain map constructed by using the flow of K is a diffeomorphism (see Proposition 4.1). We don’t know if the regularity of K can be lowered there to $C^1$.

Assumption 2

The Lagrangian $L_c:TM \rightarrow {\mathbb {R}}$, defined by

$$\begin{aligned} L_c(x,v) {:}{=}L(x,v) + Q^2(v), \end{aligned}$$

(2.5)

satisfies the following conditions:

there exists a continuous function $C:M\rightarrow (0,+\infty )$ such that for all $(x,v)\in TM$, the following inequalities hold:
$$\begin{aligned} L_c(x,v)&\le C(x)\big (\left\| v\right\| ^2+1\big ); \end{aligned}$$
(2.6)
$$\begin{aligned} |\partial _x L_c(x,v)|&\le C(x)\big (\left\| v\right\| ^2+1\big );\end{aligned}$$
(2.7)
$$\begin{aligned} |\partial _v L_c(x,v)|&\le C(x)\big (\left\| v\right\| +1\big ); \end{aligned}$$
(2.8)
there exists a continuous function $\lambda :M \rightarrow (0,+\infty )$ such that for each $x\in M$ and for all $v_1, v_2\in T_xM$, the following inequality holds:
$$\begin{aligned} \big (\partial _v L_c(x,v_2)-\partial _v L_c(x,v_1)\big )[v_2-v_1] \ge \lambda (x)\left\| v_2 - v_1\right\| ^2; \end{aligned}$$
(2.9)

Remark 2.3

As proven in [51, Proposition 2.5], K is also an infinitesimal symmetry of $L_c$, and a simple computation shows that

$$\begin{aligned}\partial _v L_c(x,v)[K] = - Q(v).\end{aligned}$$

Remark 2.4

From [51, Proposition 7.4], if L satisfies Assumptions 1 and 2, it admits a stationary product type local structure. This means that for each point $p \in M$, there exists a neighbourhood $U_p \subset M$, an open neighborhood $S_p$ of ${\mathbb {R}}^{m}$, an open interval $I_p$ of ${\mathbb {R}}$, and a diffeomorphism $\phi :S_p \times I_p \rightarrow U_p$ such that, denoting t as the natural coordinate of $I_p$,

$$\begin{aligned} \phi (\partial _t) = K|_{U_p}, \end{aligned}$$

and the function L can be expressed as follows:

$$\begin{aligned} L(x,v) = L\circ \phi \big ((y,t),(\nu ,\tau )\big ) = L_0(y,\nu ) + \omega _y(\nu )\tau - \frac{1}{2}\tau ^2, \end{aligned}$$

(2.10)

where

$(y,t)\in S_p \times I_p$, $(\nu ,\tau )\in {\mathbb {R}}^{m}\times {\mathbb {R}}$, and $(x,v) = \phi \big ((y,t),(\nu ,\tau )\big )$;
$L_0 \in C^1(S_p)$ is a Lagrangian that satisfies the growth conditions (2.6)–(2.8) with respect to the norm $\left\| \cdot \right\| _{S_p}$ and it is fiberwise strongly convex, i.e., (2.9) holds (with $L_c$ replaced by $L_0$) for some function $\lambda :S_p \rightarrow (0,+\infty )$;
$\omega _y$ is the $C^1$ one-form induced by Q on $S_p$.

Using this notation, we have the following equalities:

$$\begin{aligned} Q(v)&= Q\circ \phi (\nu ,\tau ) = \omega _y(\nu ) - \tau ; \nonumber \\ L_c(x,v)&= L_c\circ \phi _* \big ((y,t),(\nu ,\tau )\big ) = L_0(y,\nu ) + \omega _y^2(\nu ) - \omega _y(\nu )\tau + \frac{1}{2}\tau ^2; \end{aligned}$$

(2.11)

and

$$\begin{aligned} E(x,v) = E\circ \phi _{*}\big ((y,t),(\nu ,\tau )\big ) = E_0(y,\nu ) + \omega _y(\nu )\tau - \frac{1}{2}\tau ^2, \end{aligned}$$

(2.12)

where $E_0(y,\nu ) = \partial _\nu L_0(y,\nu )[\nu ] - L_0(y,\nu )$.

Remark 2.5

For every $p \in M$, let $\phi _p:S_p \times I_p \rightarrow M$ be a mapping that satisfies (2.10). Since $L_0$ is fiberwise strongly convex, we can conclude that

$$\begin{aligned} E_0(y,\nu ) > E_0(y,0) = - L_0(y,0), \qquad \forall \nu \ne 0, \end{aligned}$$

(2.13)

Indeed, from the strict convexity of $L_0$ we have

$$\begin{aligned} L_0(y,0) > L(y,\nu ) + \partial _\nu L(y,\nu )[-\nu ], \qquad \forall \nu \ne 0. \end{aligned}$$

Assumption 3

We require:

$$\begin{aligned} \sup _{x\in M} L(x,0)<+\infty . \end{aligned}$$

(2.14)

Remark 2.6

We need the last assumption to guarantee the existence of $\kappa \in {\mathbb {R}}$ satisfying (4.19), which is a key condition for our main result.

2.1 Lorentz-Finsler metrics

Provided the existence of an infinitesimal symmetry, an important kind of Lagrangians that satisfy the above assumptions is given by Lorentz-Finsler metrics, introduced by J. K. Beem in [54].

Definition 2.7

Let M be a smooth, connected manifold of dimension $m+1$. A Lagrangian $L_F:TM \rightarrow {\mathbb {R}}$ is called Lorentz-Finsler metric if it satisfies the following conditions:

(a)
$L_F \in C^1(TM) \cap C^2(TM \setminus 0)$, where 0 denotes the zero section of TM;
(b)
$L_F(x,\lambda v) = \lambda ^2 L_F(x,v)$, for all $\lambda > 0$;
(c)
for any $(x,v)\in TM \setminus 0$, the vertical Hessian of $L_F$, i.e. the symmetric matrix
$$\begin{aligned} (g_F)_{\alpha \beta }(x,v){:}{=}\frac{\partial ^2 L_F}{\partial v^\alpha \partial v^\beta }(x,v), \quad \alpha ,\beta = 0,\dots ,m, \end{aligned}$$
is non-degenerate with index 1.

Remark 2.8

The regularity conditions required on $L_F$ are sometimes too rigid and we relax them to include some interesting classes of Lagrangians (see [46, 55]). The first and the last conditions above will be replaced by:

(a’)
Let M be a smooth, connected, manifold of dimension $m+1$, $m\ge 1$, and $L_F\in C^{1}(TM)\cap C^2({\mathcal {O}}$), where ${\mathcal {O}}\subset TM\setminus 0$ is such that $\mathcal O_x{:}{=}{\mathcal {O}}\cap T_xM\ne \emptyset $ for all $x\in M$, and ${\mathcal {O}}_x$ is an open set in $T_xM$ which is a linear cone (i.e. $\lambda v\in {\mathcal {O}}_x$, for all $\lambda >0$, if $v\in {\mathcal {O}}_x$); moreover, for any $v_1, v_2\in T_xM$ there exist two sequences of vectors $v_{1k}, v_{2k}$ such that, for all $k\in {\mathbb {N}}$, the segment with extreme points $v_{1k}$ and $v_{2k}$ is entirely contained in ${\mathcal {O}}_x$ and $v_{ik}\rightarrow v_i$, $i=1,2$.
(c’)
Condition (c) is valid for each $(x,v)\in {\mathcal {O}}$; moreover the eigenvalues $\lambda _i(x,v)$ of $(g_F)_{\alpha ,\beta }(x,v)$ are bounded away from 0 on $\mathcal O_x$, i.e. there exists $\lambda _+(x)>0$ such that
$$\begin{aligned} |\lambda _i(x,v)|\ge \lambda _+(x), \end{aligned}$$
(2.15)
for all $i \in \{0,\ldots ,m\}$ and $v\in {\mathcal {O}}_x$.

If $L_F$ is a Lorentz-Finsler metric, then the couple $(M,L_F)$ is called a Finsler spacetime.

The study of the notion of a Finsler spacetime has received renewed impetus from various sources. V. Perlick’s work [41], which explores Fermat’s principle, was particularly influential. Subsequent contributions came from [56] (also see [57]), which revived the research initiated by G. Y. Bogoslowsky [58,59,60], and from [61]. Additional momentum was provided by the works of V. A. Kostelecký and collaborators [62,63,64,65], as well as C. Pfeifer, N. Voicu, and their coworkers (refer to [66,67,68,69,70,71] for further details). Notable mathematical contributions include [72,73,74], which have influenced the field in a different manner. For a comprehensive historical overview, diverse definitions of a Finsler spacetime, and additional references, interested readers are directed to [44, 48, 75, 76].

As we will see later, the significance of Lorentz-Finsler metrics relies on the 2-homogeneity assumption. This homogeneity ensures that the solutions of the Euler-Lagrange equations, with a suitably prescribed energy value (in this case, less than or equal to 0), connecting a point to a flow line of the infinitesimal symmetry vector field, are the ones for which the time of arrival is critical. Therefore, Fermat’s principle holds (see Remark 5.2).

Proposition 2.9

Let $L_F:TM \rightarrow {\mathbb {R}}$ be a Lorentz-Finsler metric satisfying $(a')$, (b) and $(c')$ above, and assume there exists an infinitesimal symmetry $K:M \rightarrow TM$ such that (2.3) and (2.4) hold. Then Assumptions 2 and 3 hold.

Proof

Assumption 3 is ensured by the 2–homogeneity of $L_F$, since $L_F(x,0) = 0$ for every $x \in M$. Let us show that Assumption 2 holds. As a first step, we notice that the Lagrangian $L_c:TM \rightarrow {\mathbb {R}}$, defined by

$$\begin{aligned} L_c(x,v) = L_F(x,v) + Q^2(v), \end{aligned}$$

admits vertical Hessian at any $(x,v)\in {\mathcal {O}}$ that is a positive definite bilinear form on $T_xM$. For any $(x,v) \in {\mathcal {O}}$, we have

$$\begin{aligned} \partial _{vv}L_c(x,v) = \partial _{vv}L_F(x,v) + 2 Q \otimes Q. \end{aligned}$$

(2.16)

For each $w \in T_x M$, we have, thanks to (2.3),

$$\begin{aligned}{} & {} \partial _{vv}L_F(x,v)[K,w] =\frac{\partial ^2L_F}{\partial s \partial t} (x,v + tK + sw)\Big |_{(s,t) = (0,0)}\nonumber \\{} & {} \quad = \frac{\partial (\partial _v L_F(x,v + s w)[K])}{\partial s}\Big |_{s = 0} = \frac{\partial Q(v + sw)}{\partial s}\Big |_{s = 0} = Q(w), \end{aligned}$$

(2.17)

hence we obtain

$$\begin{aligned} \partial _{vv}L_c(x,v)[K,K] = \partial _{vv}L_F(x,v)[K,K] + 2 Q^2(K) = Q(K) + 2 = 1 > 0. \end{aligned}$$

Now consider $w \in \textrm{ker}\, Q$; from (2.17) we have $\partial _{vv}L_F(x,v)[w,K] = 0$, and since $\partial _{vv}L_F(x,v)$ has index 1 we obtain that $\partial _{vv}L_c(x,v)[w,w] = \partial _{vv}L_F(x,v)[w,w]>0$, for all $w\in \ker Q$, from which we conclude that $\partial _{vv}L_c(x,v)[\cdot ,\cdot ]$ is positive definite.

Let

$$\begin{aligned} \lambda (x):\inf _{v\in {\mathcal {O}}_x}\min _{w\in T_x M, \Vert w\Vert =1} \partial _{vv}L_c(x,v)[w,w]. \end{aligned}$$

(2.18)

As any $w\in \ker Q$ is orthogonal to $K_x$ with respect to both bilinear forms $\partial _{vv}L_c(x,v)$ and $\partial _{vv}L_F(x,v)$, by (2.16) we deduce that the determinants of $(g_F)_{\alpha \beta }$ and $(g_c)_{\alpha \beta }$ are opposite numbers and then, from (2.15) we conclude that $\lambda (x)>0$, for all $x\in M$.

Inequality (2.9) then follows by the mean value theorem applied to the function $v \in {\mathcal {O}}_x \mapsto \partial _vL_c(x,v)[v_2 - v_1] $, when $v_1$ and $v_2$ both belong to ${\mathcal {O}}_x$ and the segment having them as extreme points is contained in ${\mathcal {O}}_x$ as well. Then, for each $x \in M$, (2.9) follows by continuity due to the property of approximation by segments in (a’). The inequalities (2.6), (2.7) and (2.8) are ensured by the fact that $L_c$ is $C^1$ on TM and it is positive homogeneous of degree 2 w.r.t. v. $\square $

Remark 2.10

As shown in the above proof, the vertical Hessian of $L_c$ is positive definite on ${\mathcal {O}}$, the last being dense in TM. Hence, by homogeneity, $L_c$ is a non-negative fiberwise strongly function on TM. Moreover, the vertical Hessian of $F_c{:}{=}\sqrt{L_c}$ at any $(x,v)\in {\mathcal {O}}$ is positive semi-definite (see, e.g., [86, p. 8]). Hence, for any $v_1$ and $v_2$ belonging to ${\mathcal {O}}_x$ defining a segment contained in ${\mathcal {O}}_x$, we get by Taylor’s theorem,

$$\begin{aligned} F_c(x,v_2)\ge F_c(x,v_1)+\partial _{v}F_c(x,v_1)[v_2-v_1]. \end{aligned}$$

By continuity and the approximation by segments property in (a’), the above inequality holds on TM, hence $F_c$ is fiberwise convex and therefore it is a Finsler metric on M, (i.e., $F_c(x,\cdot )$ is non-negative, positively homogeneous, and satisfies the triangle inequality on $T_xM$, for each $x\in M$) whose square is only of class $C^1$ on TM.

As a consequence of Proposition 2.9, if $L_F:TM \rightarrow M$ is a Lorentz-Finsler metric and there exists a complete vector field K such that Assumption 1 holds, then Remark 2.4 ensures that $L_F$ can be locally expressed as follows:

$$\begin{aligned} L_F(x,v) = L_F\circ \phi ((y,t),(\nu ,\tau )) = F^2(y,\nu ) + \omega _y(\nu )\tau - \frac{1}{2}\tau ^2, \end{aligned}$$

(2.19)

where $F:TS \rightarrow {\mathbb {R}}$ is a Finsler metric on S, with $F^2\in C^1(TM)$. Whenever $L_F$ is not twice differentiable only at the line sub-bundle of TM defined by K, F becomes a classical Finsler metric on S, (i.e. $F^2 \in C^2(TS{\setminus } 0)$ and, for each $y\in S$, $F(y,\cdot )$ is a Minkowski norm on $T_yS$, see e.g. [86, §1.2]).

Since in this case K is a timelike Killing vector field, namely it is an infinitesimal symmetry of $L_F$ such that $L_F(x,K) < 0$ for every x, $(M,L_F)$ is called stationary Finsler spacetime. In particular, if $L_F$ is twice differentiable on $TM\setminus 0$, then $F^2(y,\cdot )$ in (2.19) must be the square of the norm of a positive definite inner product on $T_yS$. We thank the referee for this observation. In fact, a special kind of stationary Finsler spacetimes are the stationary Lorentzian manifolds, namely those Lorentzian manifolds $(M,g_L)$ for which $g_L$ is a Lorentzian metric and there exists a timelike Killing vector field for $g_L$. In this case, the stationary product type local structure is given by

$$\begin{aligned} g_L(v,v) = g_R(\nu ,\nu )+\omega (\nu )\tau -\frac{1}{2}\tau ^2, \end{aligned}$$

where $g_R$ is a Riemannian metric on an open neighbourhood S of ${\mathbb {R}}^m$. In this direction, the results in this paper improve previous results about stationary Lorentzian metrics (see, [15, 19, 53]), since just $C^1$ stationary metrics with a $C^3$ timelike Killing vector field are allowed and both lightlike and timelike geodesics can be considered in an unified setting.

3 Variational setting

Let us fix a point $p \in M$ and consider a flow line $\gamma :{\mathbb {R}} \rightarrow M$ of K that does not pass through p, i.e., $p \notin \gamma ({\mathbb {R}})$. We are interested in finding solutions of the Euler-Lagrange equations that connect p to points on $\gamma $ with a fixed energy $\kappa \in {\mathbb {R}}$. Specifically, we seek to characterize curves $z\in C^1([0,1],M)$ that satisfy (2.1), with $z(0) = p$, $z(1) \in \gamma ({\mathbb {R}})$, and $E(z(s),\dot{z}(s)) = \kappa $ for all $s \in [0,1]$.

We define the action functional ${\mathcal {L}}:H^1([0,1],M) \rightarrow {\mathbb {R}}$ as follows:

$$\begin{aligned} {\mathcal {L}}(z) {:}{=}\int _{0}^{1} L(z,\dot{z})\, \textrm{d}s. \end{aligned}$$

Similarly, we define the energy functional:

$$\begin{aligned} {\mathcal {E}}(z) {:}{=}\int _{0}^{1} E(z,\dot{z})\, \textrm{d}s. \end{aligned}$$

We note that both ${\mathcal {L}}$ and ${\mathcal {E}}$ are well-defined on $H^1([0,1],M)$ and they are respectively a $C^{1}$ and a $C^{0}$ functional due to (2.5), the growth conditions (2.6)–(2.8) and the fiberwise convexity of $L_c$ (2.9) (see, e.g., the first part of the proof of Proposition 3.1 in [78]).

Remark 3.1

Henceforth, we will assume that ${\mathcal {E}}$ is a $C^1$ functional. This holds if L is positively homogeneous of degree 2 in the velocities, since in that case ${\mathcal {E}}={\mathcal {L}}$; moreover it holds if $L_c$ is a $C^2$, strongly convex Lagrangian on TM with second derivatives satisfying assumptions (L1’) in [78, p. 605].

Recalling that we have chosen a fixed point $p \in M$, we define the set $\Omega _{p,r}(M)$ for every $r \in M$ as follows:

$$\begin{aligned} \Omega _{p,r}(M) {:}{=}\big \{ z \in H^1([0,1],M): z(0)=p, z(1) = r \big \}, \end{aligned}$$

and we denote by ${\mathcal {L}}_{p,r}$ the restriction of ${\mathcal {L}}$ to $\Omega _{p,r}(M)$.

Remark 3.2

According to [51, Proposition A.1], if z is a critical point of ${\mathcal {L}}_{p,r}$, then both z and the function

$$\begin{aligned} s \mapsto \partial _vL(z(s),\dot{z}(s))[\dot{z}(s)] \end{aligned}$$

are of class $C^1$. As a consequence, z is a critical point of ${\mathcal {L}}_{p,r}$ if and only if equation (2.1) holds and there exists $\kappa \in {\mathbb {R}}$ such that equation (2.2) holds.

3.1 Preliminary results

Recalling that K is a complete vector field, we denote by $\psi :{\mathbb {R}} \times M \rightarrow M$ the flow of K, and by $\partial _u\psi $ and $\partial _x\psi $ the partial derivatives of $\psi (t,x)$ with respect to $t \in {\mathbb {R}}$ and $x \in M$, respectively.

Let us denote by $K^c$ the complete lift of K to TM (see, e.g., [46]). Then, for any $(x,v)\in TM$, the flow $\psi ^c$ of $K^c$ on TM is given by $\psi ^c(t,x,v)=\big (\psi (t,x),\partial _x\psi (t,x)[v]\big )$, and we have

$$\begin{aligned} K^c(L)\big (\psi ^c(t,x,v)\big )= \dfrac{\partial \big (L\circ \psi ^c\big )}{\partial t}(t,x,v). \end{aligned}$$

Since K is an infinitesimal symmetry of L, we have

$$\begin{aligned} \dfrac{\partial \big (L\circ \psi ^c\big )}{\partial t}(t,x,v)=0, \end{aligned}$$

(3.1)

which implies

$$\begin{aligned} K^c(L)(x,v) = \frac{\partial L }{\partial x^h}(x,v)K^h(x) + \frac{\partial L}{\partial v^h}(x,v)\frac{\partial K^h}{\partial x^i}(x)v^i =0. \end{aligned}$$

(3.2)

Moreover, from (3.1) we also obtain

$$\begin{aligned} L(x,v) = L\big (\psi (t,x),\partial _x\psi (t,x)[v]\big ), \quad \forall (x,v) \in TM,\, t \in {\mathbb {R}}, \end{aligned}$$

(3.3)

and consequently

$$\begin{aligned} \partial _v L(x,v)[\xi ] = \partial _vL\big (\psi (t,x),\partial _x\psi (t,x)[v]\big )\big [\partial _x\psi (t,x)[\xi ]\big ]. \end{aligned}$$

(3.4)

Lemma 3.3

If $z:[0,1]\rightarrow M$ is a weak solution of the Euler-Lagrange equation (2.1) (i.e. a critical point of ${\mathcal {L}}$ on $\Omega _{z(0),z(1)}(M)$ $)$, then it is a $C^1$ curve and its Noether charge is constant, namely there exists $c \in {\mathbb {R}}$ such that

$$\begin{aligned} \partial _vL(z(s),\dot{z}(s))[K(z(s))] = c, \quad \forall s \in [0,1]. \end{aligned}$$

Proof

By [51, Proposition A.1], both z and $\partial _vL(z,\dot{z})$ are of class $C^1$. Therefore, it suffices to prove that, for every $s \in [0,1]$, we have

$$\begin{aligned} \frac{\textrm{d}}{\textrm{d}s}\Big ( \partial _vL(z(s),\dot{z}(s))[K(z(s))]\Big ) = 0. \end{aligned}$$

Therefore, we can work on a local coordinate system $(x^0,\dots ,x^{m},v^0,\dots ,v^m)$ of TM and, using (2.1) and (3.2), we obtain the following chain of equalities:

$$\begin{aligned}&{\frac{\textrm{d}}{\textrm{d}s}\left( \frac{\partial L}{\partial v^i}\big (z(s),\dot{z}(s)\big )K^i(z(s))\right) }&\\&\quad =\frac{\textrm{d}}{\textrm{d}s}\left( \frac{\partial L}{\partial v^i}\big (z(s),\dot{z}(s)\big )\right) K^i(z(s)) +\frac{\partial L}{\partial v^i}\big (z(s),\dot{z}(s)\big )\frac{\partial K^i}{\partial x^h}(z(s))\dot{z}^h(s)\\&\quad =\frac{\partial L}{\partial x^i}\big (z(s),\dot{z}(s)\big )K^i(z(s)) +\frac{\partial L}{\partial v^i}\big (z(s),\dot{z}(s)\big )\frac{\partial K^i}{\partial x^h}(z(s))\dot{z}^h(s) =0. \end{aligned}$$

$\square $

On the basis of Lemma 3.3, the curves with a constant Noether charge are the only ones that can be critical points of the action functional. The following results ensure that this subset of curves is indeed a closed manifold of class $C^1$, allowing for a simplification of the variational setting by considering only these curves. A detailed proof can be found in [51] and relies on the linearity assumption of the Noether charge.

Let us define the following sets:

$$\begin{aligned} {\mathcal {N}}_{p,r} {:}{=}\{z \in \Omega _{p,r}(M): \exists c \in {\mathbb {R}} \text { such that } Q(\dot{z}) = c \text { a.e. on } [0,1]\} \subset \Omega _{p,r}(M), \end{aligned}$$

and

$$\begin{aligned} {\mathcal {W}}_z:= \big \{ \eta \in T_z\Omega _{p,r}(M): \exists \mu \in H^{1}_0([0,1],{\mathbb {R}})\\ \text { such that } \eta (s)=\mu (s)K(z(s)), \text { a.e. on } [0,1] \big \}. \end{aligned}$$

Since L is invariant under the one-parameter group of local $C^1$ diffeomorphisms generated by K, we have the following result.

Proposition 3.4

The space ${\mathcal {N}}_{p,r}$ is non-empty, it is a $C^1$ closed submanifold of $\Omega _{p,r}(M)$ and satisfies

$$\begin{aligned} {\mathcal {N}}_{p,r} = \left\{ z \in \Omega _{p,r}(M): \textrm{d}{\mathcal {L}}_{p,r}(z)[\eta ] = 0, \forall \eta \in {\mathcal {W}}_z \right\} . \end{aligned}$$

(3.5)

Moreover, for every $z \in {\mathcal {N}}_{p,r}$, the tangent space of ${\mathcal {N}}_{p,r}$ at z is given by

$$\begin{aligned} T_z{\mathcal {N}}_{p,r} = \left\{ \xi \in T_z\Omega _{p,r}(M): \exists c\in {\mathbb {R}} \text { such that } \partial _xQ(\dot{z})[\xi ]+Q({\dot{\xi }})=c \text { a.e.} \right\} , \end{aligned}$$

(3.6)

and

$$\begin{aligned} T_z\Omega _{p,r}(M) = T_z{\mathcal {N}}_{p,r} +{\mathcal {W}}_z. \end{aligned}$$

(3.7)

Proof

The fact that ${\mathcal {N}}_{p,r}\ne \emptyset $, for all $p, r\in M$, follows from [51, proposition 6.4]. Equality (3.5) is proved in [51, Proposition 4.2], and (3.6) is a particular case of [51, Proposition 4.3]. Finally, (3.7) comes from [51, Lemma 4.4],^{Footnote 1}. $\square $

The above result gives the following variational principle for the critical points of ${\mathcal {L}}_{p,r}$, which extends a result by F. Giannoni and P. Piccione (see [52]).

Proposition 3.5

Let ${\mathcal {J}}_{p,r}:{\mathcal {N}}_{p,r} \rightarrow {\mathbb {R}}$ be the restriction of ${\mathcal {L}}_{p,r}$ to ${\mathcal {N}}_{p,r}$. Then, z is a critical point for ${\mathcal {L}}_{p,r}$ if and only if $z \in {\mathcal {N}}_{p,r}$ and z is a critical point for ${\mathcal {J}}_{p,r}$.

Proof

See [51, Theorem 4.7]. $\square $

4 The variational structure of the action in relation with the flow of K

In this section, we consider the flow of the complete vector field K and its relationship with the variational structure of the action. More precisely, let $\psi :{\mathbb {R}} \times M \rightarrow M$ denote the flow generated by the vector field K. Given a flow line $\gamma :{\mathbb {R}} \rightarrow M$ of K, there exists a point $q \in M$ such that $\gamma (t) = \psi (t,q)$.

Our goal is to prove that for each $t \in {\mathbb {R}}$, there is a diffeomorphism between ${\mathcal {N}}_{p,q}$ and ${\mathcal {N}}_{p,\gamma (t)}$. This enables us to define a functional (see (4.9)) on ${\mathcal {N}}_{p,q}\times {\mathbb {R}}$ and obtain an alternative equation for solutions of the Euler-Lagrange equations connecting p and $\gamma $ (see (4.13)). Furthermore, recalling that we seek the solutions of Euler-Lagrange equations with a fixed energy $\kappa $, we show that for any $z\in {\mathcal {N}}_{p,q}$, there are two values of $t\in {\mathbb {R}}$ such that ${\mathcal {E}}(z^t) = \kappa $, where $\kappa $ satisfies (4.19) and $z^t\in {\mathcal {N}}_{p,\gamma (t)}$ is the curve corresponding to z via the diffeomorphism. Therefore, we can simplify the problem and study a couple of functionals defined only on ${\mathcal {N}}_{p,q}$ (see (4.20)).

Let us define the map $F^t:\Omega _{p,q}(M) \rightarrow \Omega _{p,\gamma (t)}(M)$ as follows:

$$\begin{aligned} \big (F^t(z)\big )(s) {:}{=}\psi (ts,z(s)). \end{aligned}$$

(4.1)

To simplify the notation, we write

$$\begin{aligned} z^t = F^t(z) \end{aligned}$$

for any $z \in \Omega _{p,q}(M)$.

Proposition 4.1

The map $F^t$ is a diffeomorphism with its inverse being $F^{-t}$. Furthermore, $F^t|_{{\mathcal {N}}_{p,q}}$ is a diffeomorphism from ${\mathcal {N}}_{p,q}$ to ${\mathcal {N}}_{p,\gamma (t)}$. Therefore, for every $z \in \Omega _{p,q}(M)$, we have the following equivalences:

$$\begin{aligned} \textrm{d}F^t(z)[\xi ] \in T_{z^t}{\mathcal {N}}_{p,\gamma (t)} \quad \mathrm{{if}}\,\, \mathrm{{and}}\,\, \mathrm{{only}}\,\, \mathrm{{if}} \quad \xi \in T_z{\mathcal {N}}_{p,q}, \end{aligned}$$

(4.2)

and

$$\begin{aligned} \textrm{d}F^t(z)[\eta ] \in {\mathcal {W}}_{z^t} \quad \mathrm{{if}}\,\, \mathrm{{and}}\,\, \mathrm{{only}}\,\, \mathrm{{if}} \quad \eta \in {\mathcal {W}}_{z}. \end{aligned}$$

(4.3)

Proof

By utilizing a result by R. Palais [77] and considering that the flow of K is $C^3$, we can conclude that $F^t$ is a diffeomorphism (cf. [53, Proposition 2.2]). Recalling that $\partial _u\psi $ is the differential of $\psi $ with respect to the first variable, we can derive the following equalities:

$$\begin{aligned} \partial _u \psi (ts,z(s))[1] = K(\psi (ts,z(s))), \end{aligned}$$

and

$$\begin{aligned} \partial _x\psi (ts,z(s))[K(z(s))] = K(\psi (ts,z(s))). \end{aligned}$$

(4.4)

Consequently, we obtain the velocity of $z^t$ as:

$$\begin{aligned} \frac{\textrm{d}}{\textrm{d}s}z^t(s) = \dot{z}^t(s) = \partial _u\psi (ts,z(s))[t] + \partial _x\psi (ts,z(s))[\dot{z}(s)]. \end{aligned}$$

(4.5)

Now, considering that $Q(K) \equiv -1$, we deduce:

$$\begin{aligned} \partial _vL(z^t,\dot{z}^t)[K(z^t)] = Q(\dot{z}^t) = -t + Q\left( \partial _x \psi (ts,z(s))[\dot{z}(s)]\right) . \end{aligned}$$

Hence, from (3.4), we have:

$$\begin{aligned} Q(\dot{z}^t) = Q(\dot{z}) - t, \end{aligned}$$

(4.6)

which implies that $z^t \in {\mathcal {N}}_{p,\gamma (t)}$ if and only if $z \in {\mathcal {N}}_{p,q}$. Therefore, this implies (4.2). Finally, (4.3) follows from $\textrm{d}F^t(z)[\nu ]=\partial _x\psi (ts,z(s))[\nu (s)]$ and (4.4). $\square $

We introduce the functional ${\mathcal {H}}_{p,q}:\Omega _{p,q}(M)\times {\mathbb {R}} \rightarrow {\mathbb {R}}$ defined as follows:

$$\begin{aligned} {\mathcal {H}}_{p,q}(z,t) {:}{=}{\mathcal {L}}_{p,\gamma (t)}(F^t(z)). \end{aligned}$$

(4.7)

Using (4.5) and observing that $\partial _u\psi (ts,z(s))[t]=t\partial _x\psi (ts,z(s))[K(z(s)]$, we can deduce the expression:

$$\begin{aligned} \dot{z}^t = \partial _x\psi (ts,z(s))\big [\dot{z} + tK(z(s))\big ], \end{aligned}$$

(4.8)

so that, by applying also (3.3), we can rewrite ${\mathcal {H}}_{p,q}(z,t)$ as

$$\begin{aligned} {\mathcal {H}}_{p,q}(z,t) = \int _{0}^{1}L\big (z,\dot{z}+ t K(z)\big )\textrm{d}s. \end{aligned}$$

(4.9)

Considering that $F^t|_{{\mathcal {N}}_{p,q}}$ is a diffeomorphism, we obtain the following result, which allows us to focus our study on critical curves of ${\mathcal {H}}_{p,q}$ within ${\mathcal {N}}_{p,q}$.

Proposition 4.2

For $(z,t) \in \Omega _{p,q} (M)\times {\mathbb {R}}$, the following statements hold:

$$\begin{aligned} \partial _z{\mathcal {H}}_{p,q}(z,t)[\xi ] = 0, \qquad \forall \xi \in T_z\Omega _{p,q}(M), \end{aligned}$$

(4.10)

if and only if $z \in {\mathcal {N}}_{p,q}$ and

$$\begin{aligned} \partial _z{\mathcal {H}}_{p,q}(z,t)[\xi ] = 0, \qquad \forall \xi \in T_z{\mathcal {N}}_{p,q}. \end{aligned}$$

(4.11)

Proof

If (4.10) holds, we can use (4.7) and Proposition 4.1 to conclude that $z^t=F^t(z)$ is a critical point of ${\mathcal {L}}_{p,\gamma (t)}$, and by Proposition 3.5, $z^t$ belongs to $\mathcal N_{p,\gamma (t)}$. Consequently, we have $z=F^{-t}(z^t)\in \mathcal N_{p,q}$, and (4.11) trivially follows from (4.10).

For the other implication, we need to show that if $z \in {\mathcal {N}}_{p,q}$, then

$$\begin{aligned} \partial _z{\mathcal {H}}_{p,q}(z,t)[\eta ] = 0, \qquad \forall \eta \in {\mathcal {W}}_{z}. \end{aligned}$$

(4.12)

By contradiction, let’s assume that $z\in {\mathcal {N}}_{p,q}$ and (4.12) does not hold. According to the definition of ${\mathcal {H}}_{p,q}$, there exists $\eta \in {\mathcal {W}}_z$ such that

$$\begin{aligned} \partial _z{\mathcal {H}}_{p,q}(z,t)[\eta ] = \textrm{d}{\mathcal {L}}_{p,\gamma (t)}(F^t(z))\big [ \textrm{d}F^t(z)[\eta ]\big ] \ne 0. \end{aligned}$$

Using (4.3), we know that $\textrm{d}F^t(z)[\eta ] \in {\mathcal {W}}_{z^t}$. Applying Proposition 3.4, we can conclude that $F^t(z) \notin {\mathcal {N}}_{p,\gamma (t)}$, which contradicts Proposition 4.1. $\square $

Corollary 4.3

If (z, t) satisfies (4.10), then $z^t$ is a critical point for ${\mathcal {L}}_{p,\gamma (t)}$. The following Euler-Lagrange equations (in local coordinates) hold:

$$\begin{aligned}{} & {} \frac{\partial L}{\partial x^i}\big (z,\dot{z} + tK(z)\big ) -\frac{\textrm{d}}{\textrm{d}s}\frac{\partial L}{\partial v^i}\big (z,\dot{z}+ tK(z)\big )\nonumber \\{} & {} \qquad + t\frac{\partial L}{\partial v^j}\big (z,\dot{z} + tK(z)\big )\frac{\partial K^j}{\partial x^i}(z) = 0, \quad \forall s \in [0,1], \end{aligned}$$

(4.13)

and there exists $\kappa \in {\mathbb {R}}$ such that

$$\begin{aligned} E\big (z,\dot{z} + tK(z)\big ) = \kappa , \quad \forall s \in [0,1]. \end{aligned}$$

(4.14)

Proof

According to Proposition 4.2, if (4.10) holds, then (4.13) is an immediate consequence of (4.9) and the du Bois-Reymond lemma. By (4.7), $z^t=F^t(z)$ is a critical point of ${\mathcal {L}}$ on $\Omega _{p,\gamma (t)}(M)$. Hence, using Remark 3.2, we can conclude that there exists a constant $\kappa $ such that $E(z^t, \dot{z}^t) = \kappa $. Combining (3.3), (3.4), and (4.8), we obtain (4.14). $\square $

Proposition 4.4

For every $(x,v) \in TM$ and every $t \in {\mathbb {R}}$, the following two equations hold:

$$\begin{aligned} L\big (x,v + tK(x)\big ) = L\big (x,v\big ) + t Q(v) - \frac{1}{2}t^2, \end{aligned}$$

(4.15)

and

$$\begin{aligned} E\big (x,v + tK(x)\big ) = E\big (x,v\big ) + t Q(v) - \frac{1}{2}t^2. \end{aligned}$$

(4.16)

As a consequence, for every $(x,v)\in TM$, we have

$$\begin{aligned} L\big (x,v + tK(x)\big ) - E\big (x,v + tK(x)\big ) = L\big (x,v\big ) - E\big (x,v\big ). \end{aligned}$$

(4.17)

Proof

We will prove (4.15); the computations for (4.16) are analogous. Since the result has a local nature, we can use (2.10). For every $(x,v) \in TM$, we can write

$$\begin{aligned} \begin{aligned} L(x,v + tK)&= L\circ \phi _* \big ((y,t),(\nu ,\tau + t)\big ) \\&= L_0(y,\nu ) + \omega _y(\nu )(\tau + t) - \frac{1}{2}(\tau + t)^2 \\&= \Big (L_0(y,\nu ) + \omega _y(\nu )\tau - \frac{1}{2}\tau ^2\Big ) + \big (\omega _y(\nu ) - \tau \big )t - \frac{1}{2}t^2\\&= L(x,v) + t Q(v) - \frac{1}{2}t^2. \end{aligned} \end{aligned}$$

This completes the proof. $\square $

Using (4.15) and recalling that $Q(\dot{z})$ is constant for all $z \in {\mathcal {N}}_{p,q}$, the functional ${\mathcal {H}}_{p,q}$ can be written as:

$$\begin{aligned} {\mathcal {H}}_{p,q}(z,t) = \int _{0}^{1}L\big (z,\dot{z}\big )\textrm{d}s + tQ(\dot{z}) - \frac{1}{2}t^2 = {\mathcal {L}}(z) +t Q(\dot{z}) - \frac{1}{2}t^2. \end{aligned}$$

(4.18)

Proposition 4.5

Let

$$\begin{aligned} \kappa \le - \sup _{x \in M}L(x,0) \end{aligned}$$

(4.19)

(recall (2.14)). Then the functionals $t_+^{\kappa },t_-^{\kappa }:{\mathcal {N}}_{p,q} \rightarrow {\mathbb {R}}$ defined by

$$\begin{aligned} t_{\pm }^{\kappa }(z) = Q(\dot{z}) \pm \sqrt{Q^2(\dot{z}) + 2\big ( {\mathcal {E}}(z)- \kappa \big )}, \end{aligned}$$

(4.20)

are well-defined, and they satisfy the following equation:

$$\begin{aligned} {\mathcal {E}}(F^{t_{\pm }^{\kappa }(z)}(z)) = \kappa . \end{aligned}$$

(4.21)

Proof

Since $Q(\dot{z})$ is constant for every $z \in {\mathcal {N}}_{p,q}$, from (4.16) we have that $t_{\pm }^{\kappa }(z)$ are the only two solutions of

$$\begin{aligned} {\mathcal {E}}(F^t(z)) = {\mathcal {E}}(z^t) = {\mathcal {E}}(z) + t Q(\dot{z}) - \frac{1}{2}t^2 = \kappa . \end{aligned}$$

Hence, it remains to prove that for every $z \in {\mathcal {N}}_{p,q}$, we have

$$\begin{aligned} {\mathcal {E}}(z) + \frac{1}{2}Q^2(\dot{z}) \ge \kappa , \end{aligned}$$

provided that $\kappa $ satisfies (4.19). As a consequence, it suffices to prove that

$$\begin{aligned} E(x,v) + \frac{1}{2}Q^2(v) \ge \kappa , \qquad \forall (x,v) \in TM. \end{aligned}$$

(4.22)

Using the expression of L in a local chart in a neighbourhood of $x\in M$, in particular (2.11) and (2.12), and setting $(x,v) = \phi _*\big ((y,t),(\nu ,\tau )\big )$, we obtain the following equalities:

$$\begin{aligned} E(x,v) + \frac{1}{2}Q^2(v)= & {} E_0(y,\nu ) + \omega _y(\nu )\tau - \frac{1}{2}\tau ^2 + \frac{1}{2}\big (\omega _y(\nu ) - \tau \big )^2 \nonumber \\= & {} E_0(y,\nu ) + \frac{1}{2}\omega _y^2(\nu ), \end{aligned}$$

(4.23)

where $E_0(y,\nu )$ is the energy function of the Lagrangian $L_0$. As a consequence, using (2.13), we obtain

$$\begin{aligned} E_0(y,\nu ) + \frac{1}{2}\omega _y^2(\nu ) \ge E_0(y,0) = - L_0(y,0) = - L(x,0). \end{aligned}$$

Since $\kappa $ satisfies (4.19), we infer

$$\begin{aligned} E(x,v) + \frac{1}{2}Q^2(v) \ge - L(x,0) \ge \kappa , \qquad \forall (x,v) \in TM, \end{aligned}$$

and we are done. $\square $

Remark 4.6

Our problem naturally leads to the condition (4.19). For a Finsler spacetime (M, L) (see Sect. 2.1), this condition means $\kappa \le 0$. Therefore, we only consider the energy values that correspond to causal geodesics (timelike or lightlike geodesics).

Lemma 4.7

If $\kappa $ satisfies (4.19) then

$$\begin{aligned} {\mathcal {E}}(z) +\frac{1}{2} Q^2(z)>\kappa , \quad \forall z\in \mathcal N_{p,q}. \end{aligned}$$

Proof

From (4.22), it is enough to prove that

$$\begin{aligned} {\mathcal {E}}(z) + \frac{1}{2}Q^2(\dot{z}) \ne \kappa . \end{aligned}$$

By contradiction assume that ${\mathcal {E}}(z) + \frac{1}{2}Q^2(\dot{z}) = \kappa $. Using (4.23) and (2.13), we conclude that in any neighbourhood $U_{z({\bar{s}})}$, ${\bar{s}}\in [0,1]$, as in Remark 2.4, and for a.e. s in a neighbourhood of ${\bar{s}}$, the vector $\dot{z}(s)$ corresponds through $\phi _*$ to a vector whose component in $TS_{z({\bar{s}})}$ vanishes. This is equivalent to the existence of a function $\alpha :[0,1] \rightarrow {\mathbb {R}}$ such that

$$\begin{aligned} \dot{z}(s) = \alpha (s)K(z(s)), \qquad \text {for a.e. } s \in [0,1]. \end{aligned}$$

Since $Q(\dot{z})$ is constant a.e. and $Q(\alpha (s)K(z(s)))=-\alpha (s)$, we deduce that $\alpha $ is constant a.e. and $\dot{z}$ is equivalent to a continuous TM-valued function on [0, 1]. Hence p and q are on the same flow line of K, which is a contradiction. $\square $

Remark 4.8

As a consequence of Lemma 4.7, $t_{\pm }^{\kappa }$ in (4.20) are $C^1$ functionals on ${\mathcal {N}}_{p,q}$.

Corollary 4.9

If $\kappa $ satisfies (4.19), then

$$\begin{aligned} \partial _t{\mathcal {H}}_{p,q}(z,t_+^{\kappa }(z)) \ne 0, \quad \forall z \in {\mathcal {N}}_{p,q}, \end{aligned}$$

and the same holds replacing $t_+^{\kappa }(z)$ with $t_-^{\kappa }(z)$.

Proof

By (4.18) and (4.20), we have

$$\begin{aligned} \partial _t{\mathcal {H}}_{p,q}(z,t_+^{\kappa }(z)) = Q(\dot{z}) - t_+^{\kappa }(z) = - \sqrt{ Q^2(\dot{z}) + 2\big ( {\mathcal {E}}(z)- \kappa \big ) }. \end{aligned}$$

(4.24)

Then, the thesis follows by Lemma 4.7. $\square $

5 Main result

We are ready to proof our main result:

Theorem 5.1

Let $L:TM\rightarrow {\mathbb {R}}$ satisfy Assumptions 1, 2, and 3, and let $\kappa \in {\mathbb {R}}$ satisfy (4.19). A curve $\ell :[0,1] \rightarrow M$ is a solution of the Euler-Lagrange equations (2.1) joining p and $\gamma $ with energy $\kappa $ if and only if there exists $z \in {\mathcal {N}}_{p,q}$ such that $\ell = F^{t_+^{\kappa }(z)}(z)$ or $\ell = F^{t_-^{\kappa }(z)}(z)$, and the following equality holds:

$$\begin{aligned} \textrm{d}t_{\pm }^{\kappa }(z) = \frac{\textrm{d}{\mathcal {E}}(z) - \textrm{d}{\mathcal {L}}(z)}{\sqrt{Q^2(\dot{z})+ 2\big ( {\mathcal {E}}(z)- \kappa \big )}}, \end{aligned}$$

(5.1)

or

$$\begin{aligned} \textrm{d}t_-^{\kappa }(z) = \frac{\textrm{d}{\mathcal {L}}(z) - \textrm{d}{\mathcal {E}}(z)}{\sqrt{Q^2(\dot{z})+ 2\big ( {\mathcal {E}}(z)- \kappa \big )}}. \end{aligned}$$

(5.2)

Proof

Consider a critical curve $\ell \in {\mathcal {N}}_{p,\gamma (t)}$ with energy $\kappa $. We know that $F^t$ is a diffeomorphism, so there exists $z \in {\mathcal {N}}_{p,q}$ such that $F^{-t}(\ell ) = z$ and $t = t_+^{\kappa }(z)$ or $t = t_-^{\kappa }(z)$. For this proof, we will focus on the case where $t = t_+^{\kappa }(z)$.

Since $\ell $ is a critical curve for ${\mathcal {L}}_{p,\gamma (t)}$, by the definition of ${\mathcal {H}}_{p,q}$ (see (4.7)), we have $\partial _z{\mathcal {H}}_{p,q}(z,t) = 0$. Furthermore, using (4.17) and the definition of $t_+^{\kappa }(z)$, we obtain the following equation:

$$\begin{aligned} \begin{aligned} {\mathcal {H}}_{p,q}(z,t_+^{\kappa }(z))&= \int _{0}^{1} L\big (z,\dot{z} + t_+^{\kappa }(z)K(z)\big ) \textrm{d}s\\&= {\mathcal {L}}(z) - {\mathcal {E}}(z) + k, \qquad \forall z \in {\mathcal {N}}_{p,q}. \end{aligned} \end{aligned}$$

(5.3)

By differentiating both sides of (5.3), we obtain:

$$\begin{aligned} \partial _z{\mathcal {H}}_{p,q}(z,t_+^{\kappa }(z)) + \partial _t{\mathcal {H}}_{p,q}(z,t_+^{\kappa }(z))\, \textrm{d}t_+^{\kappa }(z) = \textrm{d}{\mathcal {L}}(z) - \textrm{d}{\mathcal {E}}(z). \end{aligned}$$

(5.4)

Since we know that $\partial _z{\mathcal {H}}_{p,q}(z,t_+^{\kappa }(z)) = 0$, substituting this into (5.4), we get:

$$\begin{aligned} \partial _t{\mathcal {H}}_{p,q}(z,t_+^{\kappa }(z))\, \textrm{d}t_+^{\kappa }(z) = \textrm{d}{\mathcal {L}}(z) - \textrm{d}{\mathcal {E}}(z). \end{aligned}$$

According to Corollary 4.9, we have $\partial _t{\mathcal {H}}_{p,q}(z,t_+^{\kappa }(z)) \ne 0$ for every $z \in {\mathcal {N}}_{p,q}$. Using equation (4.24), we obtain ().

For the converse, if $z \in {\mathcal {N}}_{p,q}$ satisfies (), we can use (5.4) to conclude that $\partial _z{\mathcal {H}}_{p,q}(z,t_+^{\kappa }(z)) = 0$. By Proposition 4.2 and Corollary 4.3, we then deduce that $\ell = F^{t_+^{\kappa }(z)}(z)$ is a critical point of ${\mathcal {L}}_{p,\gamma (t)}$. Hence the thesis follows from Proposition 3.5 and Remark 3.2. $\square $

Remark 5.2

If L is homogeneous of degree 2 in the velocities (i.e., L is a Lorentz-Finsler metric, Definition 2.7), then ${\mathcal {L}}(z) = {\mathcal {E}}(z)$ for every z and, consequently, () and () are equivalent to $\textrm{d}t_+^{\kappa }(z) = 0$ and $\textrm{d}t_-^{\kappa }(z) = 0$, respectively. Hence, in this case we re-obtain, for $\kappa \le 0$, the Fermat’s principle in a stationary spacetime that globally splits [19] (also known as standard stationary spacetime) as well as in a stationary spacetime that may not globally split [53]. Furthermore, we also obtain a Fermat’s principle in a stationary Finsler spacetime that is not necessarily a stationary splitting one (compare with [46, Appendix B]), including also timelike geodesics.

6 An existence and multiplicity result

In this section we assume that L is a Lorentz-Finsler metric as in Sect. 2.1, satisfying Assumption 1. By Theorem 5.1 and Remark 5.2, the critical points z of the functionals $t^\kappa _\pm :\mathcal N_{p,q}\rightarrow {\mathbb {R}}$ give all and only the solutions $\ell $ of (2.1) connecting p to $\gamma $ and having fixed energy $\kappa \le 0$ (recall Remark 4.6) through the relation $\ell =F^{t^\kappa _\pm (z)}(z)$. We are going to show that $t^\kappa _\pm $ satisfy the Palais-Smale condition provided that ${\mathcal {J}}_{p,\gamma (t)}$ (recall Proposition 3.5) is pseudocoercive, for all $t\in {\mathbb {R}}$. Pseudocoercivity is a compactness assumption introduced in [52] and recently revived in [51]. Let us recall it:

Definition 6.1

Let $t, c\in {\mathbb {R}}$; the manifold ${\mathcal {N}}_{p,\gamma (t)}$ is said to be c-precompact if every sequence $(z_n)_n \subset {\mathcal {J}}_{p,\gamma (t)}^c{:}{=}\{z\in \mathcal N_{p,\gamma (t)}:{\mathcal {J}}_{p,\gamma (t)}(z)\le c\}$ has a uniformly convergent subsequence. We say that ${\mathcal {J}}_{p,\gamma (t)}$ is pseudocoercive if ${\mathcal {N}}_{p,\gamma (t)}$ is c-precompact for all $c\in {\mathbb {R}}$.

Remark 6.2

A sufficient condition ensuring that ${\mathcal {J}}_{p,r}$ is pseudocoercive, for all $p,r\in M$, is based on the existence of a $C^1$ function $\varphi :M\rightarrow {\mathbb {R}}$ such that $\textrm{d}\varphi (K)>0$, see [51, Proposition 8.1]. It is then natural to look at this result in the framework of causality properties of a Finsler spacetime as global hyperbolicity. We analyze this question in Appendix B.

Remark 6.3

We point out that if ${\mathcal {J}}_{p,r}$ is pseudocoercive then, for each $c\in {\mathbb {R}}$,

$$\begin{aligned} \sup _{z\in {\mathcal {J}}_{p,r}^c} |Q(\dot{z})|<+\infty , \end{aligned}$$

see [51, Theorem 7.6].

Henceforth, our attention turns to the functional $t_+^\kappa $, recognizing that all the subsequent considerations can be replicated comparably for $t_-^\kappa $.

Lemma 6.4

Let L be a Lorentz-Finsler metric, $p\in M$ and $\gamma =\gamma (t)$ be a flow line of K such that $p\not \in \gamma ({\mathbb {R}})$. Assume that ${\mathcal {J}}_{p,\gamma (t)}$ is pseudocoercive for all $t\in {\mathbb {R}}$. Let $(z_n)\subset {\mathcal {N}}_{p,q}$ such that $t^\kappa _+(z_n)$ is bounded, then

$$\begin{aligned} \sup _{m} |Q(\dot{z}_n)|<+\infty . \end{aligned}$$

(6.1)

Proof

Assume by contradiction that $\sup _{m} |Q(\dot{z}_n)|=+\infty $. Since $q\in \gamma ({\mathbb {R}})$ and ${\mathcal {J}}_{p,\gamma (t)}$ is pseudocoercive for all $t\in {\mathbb {R}}$, from Remark 6.3 necessarily ${\mathcal {J}}_{p,q}(z_n)\rightarrow +\infty $. Since $t^\kappa _+(z_n)$ is bounded and

$$\begin{aligned} t_{+}^{\kappa }(z_n)&= Q(\dot{z}_n) + \sqrt{Q^2(\dot{z}_n) + 2\big ( {\mathcal {E}}(z_n)- \kappa \big )} \nonumber \\&=Q(\dot{z}_n) + \sqrt{Q^2(\dot{z}_n) + 2\big ( {\mathcal {L}}(z_n)- \kappa \big )}, \end{aligned}$$

(6.2)

we get that, up to pass to a subsequence,

$$\begin{aligned} Q(\dot{z}_n)\rightarrow -\infty . \end{aligned}$$

(6.3)

Let $C\ge 0$ such that $|t^\kappa _+(z_n)|\le C$, for all $m\in {\mathbb {N}}$. From (6.2) we then get

$$\begin{aligned} 2{\mathcal {L}}(z_n) \le C^2 -2 C\, Q(\dot{z}_n)+2\kappa . \end{aligned}$$

(6.4)

Take ${\bar{t}}>C$ and consider $z_n^{{\bar{t}}}{:}{=}F^{{\bar{t}}} (z_n)$. Recalling (4.7) and (4.18), we then get from (6.4):

$$\begin{aligned} {\mathcal {L}}(z_n^{{\bar{t}}})= {\mathcal {L}}(z_n) +{\bar{t}} Q(\dot{z}_n) - \frac{1}{2}\bar{t}\,^2\le \frac{C^2}{2} +({\bar{t}} -C)Q(\dot{z}_n)+\kappa \rightarrow -\infty . \end{aligned}$$

By Remark 6.3, we deduce that $\sup _m |Q(\dot{z}^{\bar{t}}_m)|<+\infty $. Then from (4.6),

$$\begin{aligned} \sup _m |Q(\dot{z}_n)|<+\infty , \end{aligned}$$

in contradiction with (6.3). $\square $

Let $z\in {\mathcal {N}}_{p,q}$ and $\zeta \in T_z\Omega _{p,q}(M)$; we recall that the $H^1$-norm of $\zeta $ is given by $\int _0^1g_z(\zeta ', \zeta ')\textrm{d}s$ where $\zeta '$ denotes the covariant derivative of $\zeta $ along z defined by the Levi-Civita connection of the auxiliary Riemannian metric g.

Lemma 6.5

Let $(z_n)_n\subset {\mathcal {N}}_{p,q}$ be a bounded sequence (w.r.t. the topology induced on ${\mathcal {N}}_{p,q}$ by the topology of $\Omega _{p,q}(M)$ $)$ such that their images $z_n([0,1])$ are contained in a compact subset of M and, for each $n\in {\mathbb {N}}$, let $\zeta _n\in T_{z_n}\Omega _{p,q}(M)$. If

$$\begin{aligned} \sup _{n} \int _0^1g_{z_n}(\zeta _n', \zeta _n')\textrm{d}s<+\infty , \end{aligned}$$

then there exist bounded sequences $\xi _n\in T_{z_n}\mathcal N_{p,q}$ and $\mu _n\in H^1_0([0,1],{\mathbb {R}})$ such that $\zeta _n=\xi _n + \mu _n K(z_n)$.

Proof

Since the images of the curves $z_n$ are contained in a compact subset W of M, we can assume that the field K is bounded and the covariant derivatives of the fields $K(z_n)$ along $z_n$ are uniformly bounded in the $L^2$-norm. Thus, it suffices to show that there exists a bounded sequence $(\mu _n)_{n} \subset H^1_{0}([0,1],{\mathbb {R}})$ such that

$$\begin{aligned} \xi _n {:}{=}\zeta _n - \mu _n K(z_n) \in T_{z_n}{\mathcal {N}}_{p,q}, \quad \forall n \in {\mathbb {N}}. \end{aligned}$$

By (3.6), we need to prove that, for each $n\in {\mathbb {N}}$, there exists $c_n \in {\mathbb {R}}$ such that

$$\begin{aligned} \partial _xQ(\dot{z}_n)[\xi _n] + Q(\dot{\xi }_{n}) = c_n, \text { a.e.}, \end{aligned}$$

so we need to solve, with respect to $c_n\in {\mathbb {R}}$ and $\mu _n \in H^1_0([0,1],{\mathbb {R}})$, the following ODE:

$$\begin{aligned} \partial _xQ(\dot{z}_{n})[\zeta _n] + Q(\dot{\zeta }_{n}) - \mu _n \big (\partial _xQ(\dot{z}_n)[K(z_n)] + Q(\dot{K}_n)\big ) + \mu _n' = c_n, \end{aligned}$$

(6.5)

where, $\dot{K}_n$ denotes $\frac{\partial K^i}{\partial x^j}(z_n(s))\dot{z}_n^j(s)\frac{\partial }{\partial x^i}|_{z_n(s)}$. Let us re-write (6.5) as

$$\begin{aligned} \mu _n'(s) - a_n(s)\mu _n(s) = b_n(s), \end{aligned}$$

(6.6)

where

$$\begin{aligned} a_n(s)&= \partial _{x}Q(\dot{z}_n)[K(z_n)] +Q(\dot{K}_n) \end{aligned}$$

and

$$\begin{aligned} b_n(s)&= c_n-h_n(s), \quad \quad h_n(s){:}{=}\partial _xQ(\dot{z}_{n})[\zeta _{n}] +Q(\dot{\zeta }_{n}) \end{aligned}$$

Setting $ A_n(s)=\int _{0}^{s}a_n(\tau )\textrm{d}\tau , $ and

$$\begin{aligned} c_n = \left( \int _{0}^{1} e^{-A_n(s)}\textrm{d}s \right) ^{-1} \left( \int _{0}^{1} e^{A_n(s)}h_n(s)\textrm{d}s \right) , \end{aligned}$$

a solution of (6.6) which satisfies the boundary conditions $\mu _n(0) = \mu _n(1) = 0$ is given by

$$\begin{aligned} \mu _n(s)=e^{A_n(s)}\int _{0}^{s}b_n(\tau )e^{-A_n(\tau )}\textrm{d}\tau . \end{aligned}$$

We notice that the sequence $A_n(s):[0,1]\rightarrow {\mathbb {R}}$ is uniformly bounded in $L^\infty $ since

$$\begin{aligned} |A_n(s)|\le \int _0^1|a_n|\textrm{d}s \le C_1\int _0^1\sqrt{g(\dot{z}_n,\dot{z}_n)}\textrm{d}s, \end{aligned}$$

where $C_1$ is a positive constant depending on the maxima of the absolute values of the components of Q and K and their derivatives, in each coordinate system used to cover W, and on a constant that bounds from above the Euclidean norm with the norm associated with g in each of the same of coordinate system. This implies that the sequence of functions $e^{\pm A_n(s)}$ is also uniformly bounded in $L^\infty $ and then $\left( \int _{0}^{1} e^{-A_n(s)}\textrm{d}s \right) ^{-1}$ is bounded as well. Analogously,

$$\begin{aligned} |h_n(s)|\le C_2\sqrt{g(\dot{z}_n,\dot{z}_n)}, \end{aligned}$$

where now $C_2\ge 0$ is independent of K but depend on an upper bound for the $L^\infty $-norms of the fields $\zeta _n$. Hence $c_n$ is bounded and $b_n$ satisfies then

$$\begin{aligned} |b_n(s)|\le C_3 +C_2\sqrt{g(\dot{z}_n,\dot{z}_n)}, \end{aligned}$$

for some non-negative constant $C_3$. Since

$$\begin{aligned} \mu '_n(s)=a_n(s)e^{A_n(s)}\int _{0}^{s}b_n(\tau )e^{-A_n(\tau )}\textrm{d}\tau + b_n(s) \end{aligned}$$

we get

$$\begin{aligned} |\mu '_n(s)|\le C_4|a_n(s)|+|b_n(s)|, \end{aligned}$$

for some non-negative constant $C_4$, depending also on an upper bound of the sequence $\int _0^1\sqrt{g(\dot{z}_n,\dot{z}_n)}\textrm{d}s$. Hence, $\mu _n$ is bounded in $H^1_0$-norm. $\square $

Lemma 6.6

Let $z\in {\mathcal {N}}_{p,q}$ and $\eta \in {\mathcal {W}}_z$, then $\textrm{d}t^\kappa _+(z)[\eta ]=0$.

Proof

From (5.4), since ${\mathcal {L}}={\mathcal {E}}$, we get

$$\begin{aligned} \partial _z{\mathcal {H}}_{p,q}(z,t_+^{\kappa }(z)) + \partial _t{\mathcal {H}}_{p,q}(z,t_+^{\kappa }(z))\, \textrm{d}t_+^{\kappa }(z)=0. \end{aligned}$$

As showed in the proof of Proposition 4.2, $\partial _z{\mathcal {H}}_{p,q}(z,t_+^{\kappa }(z))[\eta ]=0$, and since, from Corollary 4.9, $\partial _t{\mathcal {H}}_{p,q}(z,t_+^{\kappa }(z))\ne 0$, we get the thesis. $\square $

We are now ready to prove the Palais-Smale condition for $t^\kappa _+$. We recall that a $C^1$ functional $f:\mathcal M\rightarrow {\mathbb {R}}$, defined on a manifold ${\mathcal {M}}$, satisfies the Palais-Smale condition if every sequence $z_n\subset {\mathcal {M}}$ such that $f(z_n)$ is bounded and $\textrm{d}f(z_n)\rightarrow 0$, admits a converging subsequence.

Proposition 6.7

Under the assumptions in Lemma 6.4, $t^\kappa _+:{\mathcal {N}}_{p,q}\rightarrow {\mathbb {R}}$ satisfies the Palais-Smale condition.

Proof

Let $(z_n)_n\subset {\mathcal {N}}_{p,q}$ and $C\ge 0$ such that $|t^\kappa _+(z_n)|\le C$ and $\textrm{d}t^\kappa _+(z_n)\rightarrow 0$. From Lemma 6.4, we have that (6.1) holds. Hence, from (6.2) we deduce that ${\mathcal {L}}(z_n)$ is bounded from above. By the pseudocoercivity assumption, there exists then a subsequence, still denoted by $z_n$, which uniformly converge to a continuous curve $z:[0,1]\rightarrow M$ connecting p to q. Thus, the curves $z_n$ are contained in a compact subset W of M. Hence, from Remark 2.10 there exists a positive constant $\alpha $, depending on W, such that $L_c(x,v)\ge \alpha g(v,v)$, for all $x\in W$ and $v\in T_x M$. Let ${\mathcal {L}}_c$ denote the action functional of $L_c$ and

$$\begin{aligned} {\mathcal {S}}(z){:}{=}\sqrt{Q^2(\dot{z}) + 2\big ( {\mathcal {L}}(z)- \kappa \big )} =\sqrt{2\big ( {\mathcal {L}}_c(z)- \kappa \big )-Q^2(\dot{z})}. \end{aligned}$$

Since $Q(\dot{z}_n)$ is bounded, ${\mathcal {L}}(z_n)$ is bounded from above and

$$\begin{aligned} \alpha \int _0^1g(\dot{z}_n,\dot{z}_n)\le {\mathcal {L}}_c(z_n) =\mathcal L(z_n)+Q^2(\dot{z}_n), \end{aligned}$$

(6.7)

we deduce that ${\mathcal {S}}(z_n)$ and $\int _0^1\,g(\dot{z}_n,\dot{z}_n)\textrm{d}s$ are bounded as well. Moreover, for $z\in \mathcal N_{p,q}$, let us see $Q(\dot{z})$ as a functional ${\mathcal {Q}}$ on ${\mathcal {N}}_{p,q}$ (recall that $Q(\dot{z})$ is constant a.e. on [0, 1]).

Let then $\zeta _n\in T_{z_n}\Omega ^{1,2}_{p,q}(M)$ be a bounded sequence; from Lemma 6.5 there exist two bounded sequences $\xi _n \in T_{z_n}{\mathcal {N}}_{p,q}$ and $\mu _n \in H^1_0([0,1],{\mathbb {R}})$ such that $\zeta _n = \xi _n + \mu _n K_{z_n}$. As $z_n$ is a Palais-Smale sequence, from Lemma 6.6 we obtain

$$\begin{aligned} \textrm{d}t^\kappa _+(z_n)[\zeta _n] =\textrm{d}t^\kappa _+(z_n)[\xi _n]+\textrm{d}t^\kappa _+(z_n)[\mu _nK_{z_n}] = \textrm{d}t^\kappa _+(z_n)[\xi _n]\rightarrow 0. \end{aligned}$$

We now apply a localization argument as in [78] (see also the proof of [51, Theorem 5.6]). Thus, we can assume that L is defined on $[0,1]\times U \times {\mathbb {R}}^{m+1}$, with U an open neighbourhood of 0 in $\mathbb R^{m+1}$. Analogously, we associate to $L_c$ and Q, a time-dependent fiberwise strongly convex Lagrangian $L_{cs}$ in U and a $C^1$ family of linear forms $Q_s$. Moreover, we can identify $(z_n)_n$ with a sequence in the Sobolev space $H^1([0,1],U)$. By (6.7), taking into account that the curves $z_n$ have fixed end-points, we get that $(z_n)_n$ is bounded in $H^1([0,1],U)$ and so it admits a subsequence, still denoted by $(z_n)_n$, which weakly and uniformly converges to a curve $z\in H^1([0,1],{\mathbb {R}}^{m+1})$ which also satisfies the same fixed end-points boundary conditions. The differential at $z_n$ of the localized functional obtained, that we still denote with $t_+^\kappa $, is given by

$$\begin{aligned} {\text {d}}t_{ + }^{\kappa } (z_{n} ) = {\text {d}}{\mathcal {Q}}_{s} (z_{n} ) + \frac{{{\text {d}}{\mathcal {L}}_{{cs}} (z_{n} ) - {\mathcal {Q}}_{s} (z_{n} ){\text {d}}{\mathcal {Q}}_{s} (z_{n} )}}{{{\mathcal {S}}_{s} (z_{n} )}} \end{aligned}$$

where the index s is used to denote the localized functionals. Since ${\mathcal {S}}_s(z_n)$ is bounded we get

$$\begin{aligned} 0 \leftarrow {\mathcal {S}}_s(z_n)\textrm{d}t^\kappa _+(z_n) =\big (\mathcal S_s(z_n)-{\mathcal {Q}}_s(z_n)\big )\textrm{d}{\mathcal {Q}}_s(z_n) +\textrm{d}\mathcal L_{cs}(z_n). \end{aligned}$$

In particular, since $z_n-z$ is bounded in $H^1_0$, we obtain

$$\begin{aligned} \big ({\mathcal {S}}_s(z_n)-{\mathcal {Q}}_s(z_n)\big ) \textrm{d}{\mathcal {Q}}_s(z_n)[z_n-z] +\textrm{d}{\mathcal {L}}_{cs}(z_n)[z_n-z]\rightarrow 0. \end{aligned}$$

(6.8)

Since $z_n\rightarrow z$ uniformly and weakly, we deduce that $\textrm{d}{\mathcal {Q}}_s(z_n)[z_n-z]\rightarrow 0$. As $\mathcal S_s(z_n)-{\mathcal {Q}}_s(z_n)$ is bounded then $\big (\mathcal S_s(z_n)-{\mathcal {Q}}_s(z_n)\big )\textrm{d}{\mathcal {Q}}_s(z_n)[z_n-z]\rightarrow 0$ as well. From (6.8), we then get $\textrm{d}\mathcal L_{cs}(z_n)[z_n-z]\rightarrow 0$. We can then conclude that $z_n\rightarrow z$ in $H^1$-norm thanks to the convexity of $L_{cs}$ as in the proof of [51, Theorem 5.6]. There exists then a subsequence $z_{n_k}$ such that $\dot{z}_{n_k}\rightarrow \dot{z}$, a.e. on [0, 1]. As $Q(\dot{z}_{n_k})=c_k$ a.e., for some $c_k\in {\mathbb {R}}$, we get that also $Q(\dot{z})$ is constant a.e., i.e. $z\in \mathcal N_{p,q}$. $\square $

Lemma 6.8

Under the assumptions of Lemma 6.4, the functional $t^{\kappa }_+:{\mathcal {N}}_{p,q} \rightarrow {\mathbb {R}}$ is bounded from below.

Proof

By contradiction, let us assume the existence of a sequence $(z_n)_n \subset {\mathcal {N}}_{p,q}$ such that $\lim _{n \rightarrow \infty }t^{\kappa }_+(z_n)= -\infty $. From (6.2), this implies that

$$\begin{aligned} \lim _{n \rightarrow \infty } Q(\dot{z}_n) = -\infty , \end{aligned}$$

hence from Remark 6.3, up to pass to a subsequence, ${\mathcal {L}}(z_n)={\mathcal {J}}_{p,q}(z_n) \rightarrow +\infty $. Therefore, from (6.2), $t^{\kappa }_+(z_n)\ge 0$, for n big enough. $\square $

We are now ready to present an existence and multiplicity results for solutions of the Euler-Lagrange equations (2.1). Previous existence results, in the case $\kappa =0$, based on causality techniques were obtained in [46, Proposition 6.2 and Proposition B.2] for Finsler spacetimes that admit a global splitting $S\times {\mathbb {R}}$ endowed with a Lorentz-Finsler metric of the type (2.19) and in [47, Theorem 2.49] in the more general setting of a manifold with a proper cone structure.

Theorem 6.9

Let M be a smooth, connected finite dimensional manifold, $L:TM \rightarrow {\mathbb {R}}$ be a Lorentz-Finsler metric on M satisfying Assumption 1, $p\in M$ and $\gamma :{\mathbb {R}}\rightarrow M$ be a flow line of K such that $p\not \in \gamma ({\mathbb {R}})$. Let us assume that ${\mathcal {J}}_{p,\gamma (t)}$ is pseudocoercive for all $t\in {\mathbb {R}}$. Let $\kappa \le 0$. Then,

(a)
There exists a curve $z:[0,1] \rightarrow M$ that is a solution of Euler-Lagrange equations (2.1) with energy $\kappa $, joining p and $\gamma ({\mathbb {R}})$ and minimizes $t^\kappa _{+}$;
(b)
If M is a non-contractible manifold, then there exists a sequence of curves $z_n:[0,1] \rightarrow M$ that are solutions of Euler-Lagrange equations (2.1) with energy $\kappa $ joining p and $\gamma ({\mathbb {R}})$ and such that $\lim _{n \rightarrow \infty }t^\kappa _{+}(w_n) = + \infty $.

Proof

Since $t^\kappa _{+}$ is a bounded from below, $C^1$ functional defined on a $C^1$ manifold and it satisfies the Palais-Smale condition, both part (a) and (b) follows from [79, Theorem (3.6)], Theorem 5.1 and Remark 5.2, taking into account, for part (b), that if M is non-contractible then the Lusternik-Schnirelmann category of ${\mathcal {N}}_{p,q}$ is $+\infty $ as follows from [51, Proposition 6.4] and [80, Proposition 3.2]. $\square $

Remark 6.10

Assumption (2.4) could be considered quite restrictive; however, for solutions with energy $\kappa =0$, that is not the case for the following reasons:

(1)
Since L is a Lorentz-Finsler metric, solutions $z:[0,1]\rightarrow M$ of the Euler-Lagrange equations (2.1) with $\kappa =0$ satisfy $L\big (z(s),\dot{z}(s)\big )=0$ for all $s\in [0,1]$ and therefore they are lightlike geodesics (see, e.g., [41, 81, 82]).
(2)
According to [83, Proposition 4.4] (see also [82, Proposition 3.4] and [81, Proposition 12]) for any smooth function $\varphi :M\rightarrow (0,+\infty )$ and for any lightlike geodesic $z:[0,1]\rightarrow M$ of L, there exists a reparametrization of z (on some interval $[0,a_z]$) which is a lightlike geodesic of the Lorentz-Finsler metric $\varphi L$.
(3)
Let ${\tilde{L}}$ be a Lorentz-Finsler metric on M which satisfies Assumption 1 with (2.4) replaced by ${\tilde{Q}}(K)<0$ (where ${\tilde{Q}}$ is the Noether charge of $\tilde{L}$). Hence, $L{:}{=}-{\tilde{L}}/{\tilde{Q}}(K)$ satisfies (2.4).
(4)
The infinitesimal symmetry K of ${\tilde{L}}$ remains an infinitesimal symmetry for L. This is a consequence of the fact that the flow $\psi $ of K preserves ${\tilde{Q}}(K)$ (see the proof of [51, Proposition 2.5-(iv)]), and then
$$\begin{aligned} \dfrac{\partial \big (L\circ \psi ^c\big )}{\partial t}(t,x,v)= & {} K^c(L)\big (\psi ^c(t,x,v)\big )=K^c\big (-{\tilde{L}}/{\tilde{Q}}(K)\big )\big (\psi ^c(t,x,v)\big )\\= & {} -\dfrac{\partial \big ({\tilde{L}}\circ \psi ^c/({\tilde{Q}}(K)\circ \psi )\big )}{\partial t}(t,x,v)\\= & {} \Big (\big ({\tilde{Q}}(K)\circ \psi \big )^{-2}\dfrac{\partial \big ({\tilde{Q}}(K)\circ \psi \big )}{\partial t}{\tilde{L}}\circ \psi ^c\\{} & {} -\big ({\tilde{Q}}(K)\circ \psi \big )^{-1}\dfrac{\partial \big ({\tilde{L}}\circ \psi ^c\big )}{\partial t}\Big )(t,x,v)=0, \end{aligned}$$
(recall the beginning of Sect. 3.1).

Summing up, Theorem 6.9 also holds (replacing [0, 1] with unknown interval of parametrizations $[0, a_z]$) for a Lorentz-Finsler metric ${\tilde{L}}$ on M which satisfies Assumption 1 with (2.4) replaced by $\tilde{Q}(K)<0$.

Notes

We would like to draw attention to a misprint in [51, Lemma 4.4] where we note that the “direct sum” should be corrected to “sum” as it appears there.

References

Mañé, R.: Lagrangian flows: the dynamics of globally minimizing orbits. Bol. Soc. Brasil. Mat. (N.S.) 28(2), 141–153 (1997). https://doi.org/10.1007/BF01233389
Article MathSciNet Google Scholar
Abbondandolo, A.: Lectures on the free period Lagrangian action functional. J. Fixed Point Theory Appl. 13(2), 397–430 (2013). https://doi.org/10.1007/s11784-013-0128-1
Article MathSciNet Google Scholar
Contreras, G., Delgado, J., Iturriaga, R.: Lagrangian flows: the dynamics of globally minimizing orbits II. Bol. Soc. Brasil. Mat. (N.S.) 28(2), 155–196 (1997). https://doi.org/10.1007/BF01233390
Article MathSciNet Google Scholar
Contreras, G.: The Palais-Smale condition on contact type energy levels for convex Lagrangian systems. Calc. Var. Partial Diff. Equ. 27(3), 321–395 (2006). https://doi.org/10.1007/s00526-005-0368-z
Article MathSciNet Google Scholar
Corona, D.: A multiplicity result for euler-lagrange orbits satisfying the conormal boundary conditions. J. Fixed Point Theory Appl. 22(3), 60 (2020). https://doi.org/10.1007/s11784-020-00795-4
Article MathSciNet Google Scholar
Corona, D., Giannoni, F.: A new approach for Euler-Lagrange orbits on compact manifolds with boundary. Symmetry 12(11), 1917 (2020). https://doi.org/10.3390/sym12111917
Article Google Scholar
Asselle, L., Benedetti, G., Mazzucchelli, M.: Minimal boundaries in Tonelli Lagrangian systems. Int. Math. Res. Not. IMRN 2021(20), 15746–15787 (2021). https://doi.org/10.1093/imrn/rnz246
Article MathSciNet Google Scholar
Kovner, I.: Fermat principles for arbitrary space-times. Astrophys. J. 351, 114–120 (1990). https://doi.org/10.1086/168450
Article Google Scholar
Perlick, V.: On Fermat’s principle in general relativity. I. The general case. Class. Quantum Gravity 7(8), 1319–1331 (1990). https://doi.org/10.1088/0264-9381/7/8/011
Article MathSciNet Google Scholar
Levi-Civita, T.: Statica einsteiniana. Atti della Reale Accademia dei Lincei. Rendiconti 26, 458–470 (1917). https://doi.org/10.1007/bf02959761
Article Google Scholar
Levi-Civita, T.: La teoria di Einstein e il principio di Fermat. Nuovo Cimento 16, 105–114 (1918). https://doi.org/10.1007/bf02959761
Article Google Scholar
Pham, M.Q.: Inductions électromagnétiques en relativité générale et principe de Fermat. Arch. Ration. Mech. Anal. 1, 54–80 (1957). https://doi.org/10.1007/BF00297996
Article Google Scholar
Gibbons, G.W., Werner, M.C.: Applications of the Gauss-Bonnet theorem to gravitational lensing. Class. Quantum Gravity 25(23), 235009 (2008). https://doi.org/10.1088/0264-9381/25/23/235009
Article MathSciNet Google Scholar
Gibbons, G.W., Herdeiro, C.A.R., Warnick, C.M., Werner, M.C.: Stationary metrics and optical Zermelo-Randers-Finsler geometry. Phys. Rev. D 79(4), 044022 (2009). https://doi.org/10.1103/PhysRevD.79.044022
Article MathSciNet Google Scholar
Caponio, E., Javaloyes, M.A., Masiello, A.: On the energy functional on Finsler manifolds and applications to stationary spacetimes. Math. Ann. 351(2), 365–392 (2011). https://doi.org/10.1007/s00208-010-0602-7
Article MathSciNet Google Scholar
Caponio, E., Javaloyes, M.A., Sánchez, M.: On the interplay between Lorentzian causality and Finsler metrics of randers type. Rev. Mat. Iberoam. 27(3), 919–952 (2011). https://doi.org/10.4171/RMI/658
Article MathSciNet Google Scholar
Pham, M.Q.: Projections des géodésiques de longueur nulle et rayons électromagnétiques dans un milieu en mouvement permanent. C. R. Acad. Sci. Paris 242, 875–878 (1956)
MathSciNet Google Scholar
Uhlenbeck, K.: A Morse theory for geodesics on a Lorentz manifold. Topology 14, 69–90 (1975). https://doi.org/10.1016/0040-9383(75)90037-3
Article MathSciNet Google Scholar
Fortunato, D., Giannoni, F., Masiello, A.: A Fermat principle for stationary space-times and applications to light rays. J. Geom. Phys. 15(2), 159–188 (1995). https://doi.org/10.1016/0393-0440(94)00011-R
Article MathSciNet Google Scholar
Antonacci, F., Piccione, P.: A Fermat principle on Lorentzian manifolds and applications. Appl. Math. Lett. 9(2), 91–95 (1996). https://doi.org/10.1016/0893-9659(96)00019-5
Article MathSciNet Google Scholar
Perlick, V., Piccione, P.: A general-relativistic Fermat principle for extended light sources and extended receivers. Gen. Relativ. Gravit. 30(10), 1461–1476 (1998). https://doi.org/10.1023/A:1018861024445
Article MathSciNet Google Scholar
Frolov, V.P.: Generalized Fermat’s principle and action for light rays in a curved spacetime. Phys. Rev. D 88, 064039 (2013). https://doi.org/10.1103/PhysRevD.88.064039
Article Google Scholar
Caponio, E., Javaloyes, M.A., Sánchez, M.: Wind Finslerian structures: from Zermelo’s navigation to the causality of spacetimes. Memoirs AMS (in press) arXiv:1407.5494v5 [math.DG]
Giannoni, F., Masiello, A., Piccione, P.: A timelike extension of Fermat’s principle in General Relativity and applications. Calc. Var. Partial Diff. Equ. 6(3), 263–283 (1998). https://doi.org/10.1007/s005260050091
Article MathSciNet Google Scholar
Giannoni, F., Masiello, A., Piccione, P.: A variational theory for light rays in stably causal Lorentzian manifolds: regularity and multiplicity results. Commun. Math. Phys. 187(2), 375–415 (1997). https://doi.org/10.1007/s002200050141
Article MathSciNet Google Scholar
Giannoni, F., Masiello, A., Piccione, P.: A Morse theory for light rays on stably causal Lorentzian manifolds. Ann. Inst. H. Poincaré Phys. Théor. 69(4), 359–412 (1998)
MathSciNet Google Scholar
Giannoni, F.: Global variational methods in general relativity with applications to gravitational lensing. Ann. Phys. 8(10), 849–859 (1999)
Article MathSciNet Google Scholar
Giannoni, F., Masiello, A., Piccione, P.: A Morse theory for massive particles and photons in general relativity. J. Geom. Phys. 35(1), 1–34 (2000). https://doi.org/10.1016/S0393-0440(99)00045-5
Article MathSciNet Google Scholar
Giannoni, F., Masiello, A., Piccione, P.: The Fermat principle in general relativity and applications. J. Math. Phys. 43(1), 563–596 (2002). https://doi.org/10.1063/1.1415428
Article MathSciNet Google Scholar
Caponio, E., Javaloyes, M.A., Masiello, A.: Morse theory of causal geodesics in a stationary spacetime via Morse theory of geodesics of a Finsler metric. Ann. Inst. H. Poincaré C Anal. Non Linéaire 27(3), 857–876 (2010). https://doi.org/10.1016/j.anihpc.2010.01.001
Article MathSciNet Google Scholar
Hod, S.: Fermat’s principle in black-hole spacetimes. Int. J. Mod. Phys. D 27(14), 1847025 (2018). https://doi.org/10.1142/S0218271818470259
Article Google Scholar
Faraoni, V.: Nonstationary gravitational lenses and the Fermat principle. Astrophys. J. 398(2), 425–428 (1992). https://doi.org/10.1086/171866
Article Google Scholar
Nandor, M.J., Helliwell, T.M.: Fermat’s principle and multiple imaging by gravitational lenses. Am. J. Phys. 64(1), 45–49 (1996). https://doi.org/10.1119/1.18291
Article MathSciNet Google Scholar
Frittelli, S., Kling, T.P., Newman, E.T.: Fermat potentials for nonperturbative gravitational lensing. Phys. Rev. D 65(12), 123007 (2002). https://doi.org/10.1103/PhysRevD.65.123007
Article MathSciNet Google Scholar
Sereno, M.: Gravitational lensing in metric theories of gravity. Phys. Rev. D 67(6), 064007 (2003). https://doi.org/10.1103/PhysRevD.67.064007
Article MathSciNet Google Scholar
Giambò, R., Giannoni, F., Piccione, P.: Gravitational lensing in general relativity via bifurcation theory. Nonlinearity 17(1), 117–132 (2004). https://doi.org/10.1088/0951-7715/17/1/008
Article MathSciNet Google Scholar
Virbhadra, K.S., Ellis, G.F.R.: Schwarzschild black hole lensing. Phys. Rev. D 62(8), 084003 (2000). https://doi.org/10.1103/PhysRevD.62.084003
Article MathSciNet Google Scholar
Halla, M., Perlick, V.: Application of the Gauss-Bonnet theorem to lensing in the NUT metric. Gen. Relativ. Gravit. 52(11), 1 (2020). https://doi.org/10.1007/s10714-020-02766-z
Article MathSciNet Google Scholar
Annila, A.: Least-time paths of light. Mon. Not. R. Astron. Soc. 416(4), 2944–2948 (2011). https://doi.org/10.1111/j.1365-2966.2011.19242.x
Article Google Scholar
Perlick, V.: Gravitational lensing from a spacetime perspective. Living Rev. Relat. 7(9) (2004). https://doi.org/10.12942/lrr-2004-9
Perlick, V.: Fermat principle in Finsler spacetimes. Gen. Relativ. Gravit. 38(2), 365–380 (2006). https://doi.org/10.1007/s10714-005-0225-6
Article MathSciNet Google Scholar
Duval, C.: Finsler spinoptics. Commun. Math. Phys. 283(3), 701–727 (2008). https://doi.org/10.1007/s00220-008-0573-7
Article MathSciNet Google Scholar
Masiello, A.: An alternative variational principle for geodesies of a randers metric. Adv. Nonlinear Stud. 9(4), 783–801 (2009). https://doi.org/10.1515/ans-2009-0410
Article MathSciNet Google Scholar
Gallego Torromé, R., Piccione, P., Vitório, H.: On Fermat’s principle for causal curves in time oriented Finsler spacetimes. J. Math. Phys. 53(12), 123511 (2012). https://doi.org/10.1063/1.4765066
Article MathSciNet Google Scholar
Caponio, E., Stancarone, G.: Standard static Finsler spacetimes. Int. J. Geom. Methods Mod. Phys. 13(4), 1650040 (2016). https://doi.org/10.1142/S0219887816500407
Article MathSciNet Google Scholar
Caponio, E., Stancarone, G.: On Finsler spacetimes with a timelike killing vector field. Class. Quantum Gravity 35(8), 085007 (2018). https://doi.org/10.1088/1361-6382/aab0d9
Article MathSciNet Google Scholar
Minguzzi, E.: Causality theory for closed cone structures with applications. Rev. Math. Phys. 31(5), 1930001–139 (2019). https://doi.org/10.1142/S0129055X19300012
Article MathSciNet Google Scholar
Javaloyes, M.A., Sánchez, M.: On the definition and examples of cones and Finsler spacetimes. Rev. R. Acad. Cienc. Exactas Fís Nat. Ser. A Mat. 114(1), 30 (2020). https://doi.org/10.1007/s13398-019-00736-y
Article MathSciNet Google Scholar
Herrera, J., Javaloyes, M.A.: Stationary-complete spacetimes with non-standard splittings and pre-randers metrics. J. Geom. Phys. 163, 104120 (2021). https://doi.org/10.1016/j.geomphys.2021.104120
Article MathSciNet Google Scholar
Caponio, E., Giannoni, F., Masiello, A., Suhr, S.: Connecting and closed geodesics of a Kropina metric. Adv. Nonlinear Stud. 21(3), 683–695 (2021). https://doi.org/10.1515/ans-2021-2133
Article MathSciNet Google Scholar
Caponio, E., Corona, D.: A variational setting for an indefinite Lagrangian with an affine Noether charge. Calc. Var. Partial Diff. Equ. 62(2), 39 (2023). https://doi.org/10.1007/s00526-022-02379-1
Article MathSciNet Google Scholar
Giannoni, F., Piccione, P.: An intrinsic approach to the geodesical connectedness of stationary Lorentzian manifolds. Commun. Anal. Geom. 7(1), 157–197 (1999). https://doi.org/10.4310/CAG.1999.v7.n1.a6
Article MathSciNet Google Scholar
Caponio, E.: An intrinsic Fermat principle on stationary Lorentzian manifolds and applications. Diff. Geom. Appl. 16(3), 245–265 (2002). https://doi.org/10.1016/S0926-2245(02)00069-4
Article MathSciNet Google Scholar
Beem, J.K.: Indefinite Finsler spaces and timelike spaces. Can. J. Math. 22, 1035–1039 (1970). https://doi.org/10.4153/CJM-1970-119-7
Article MathSciNet Google Scholar
Lämmerzahl, C., Perlick, V., Hasse, W.: Observable effects in a class of spherically symmetric static Finsler spacetimes. Phys. Rev. D 86, 104042 (2012). https://doi.org/10.1103/PhysRevD.86.104042
Article Google Scholar
Gibbons, G.W., Gomis, J., Pope, C.N.: General very special relativity is Finsler geometry. Phys. Rev. D 76(8), 081701 (2007). https://doi.org/10.1103/PhysRevD.76.081701
Article MathSciNet Google Scholar
Kouretsis, A.P., Stathakopoulos, M., Stavrinos, P.C.: General very special relativity in Finsler cosmology. Phys. Rev. D 79(10), 104011 (2009). https://doi.org/10.1103/PhysRevD.79.104011
Article MathSciNet Google Scholar
Bogoslovsky, G.Y.: A special-relativistic theory of the locally anisotropic space-time. I: the metric and group of motions of the anisotropic space of events. Il Nuovo Cimento B 40, 99–115 (1977). https://doi.org/10.1007/BF02739183
Article Google Scholar
Bogoslovsky, G.Y.: A special-relativistic theory of the locally anisotropic space-time II mechanics and electrodynamics in the anisotropic space. Nuovo Cimento B 40, 116–134 (1977). https://doi.org/10.1007/BF02739184
Article Google Scholar
Bogoslovsky, G.Y.: A viable model of locally anisotropic space-time and the Finslerian generalization of the relativity theory. Fortschr. Phys. 42(2), 143–193 (1994). https://doi.org/10.1002/prop.2190420203
Article MathSciNet Google Scholar
Girelli, F., Liberati, S., Sindoni, L.: Planck-scale modified dispersion relations and Finsler geometry. Phys. Rev. D 75(6), 064015 (2007). https://doi.org/10.1103/PhysRevD.75.064015
Article MathSciNet Google Scholar
Kostelecký, V.A.: Riemann-Finsler geometry and Lorentz-violating kinematics. Phys. Lett. B 701(1), 137–143 (2011). https://doi.org/10.1016/j.physletb.2011.05.041
Article MathSciNet Google Scholar
Colladay, D., McDonald, P.: Classical Lagrangians for momentum dependent Lorentz violation. Phys. Rev. D 85, 044042 (2012). https://doi.org/10.1103/PhysRevD.85.044042
Article Google Scholar
Kostelecký, V.A., Russell, N., Tso, R.: Bipartite Riemann-Finsler geometry and Lorentz violation. Phys. Lett. B 716(3–5), 470–474 (2012). https://doi.org/10.1016/j.physletb.2012.09.002
Article MathSciNet Google Scholar
Russell, N.: Finsler-like structures from Lorentz-breaking classical particles. Phys. Rev. D 91(4), 045008 (2015). https://doi.org/10.1103/PhysRevD.91.045008
Article MathSciNet Google Scholar
Pfeifer, C., Wohlfarth, M.N.R.: Causal structure and electrodynamics on Finsler spacetimes. Phys. Rev. D (2011). https://doi.org/10.1103/PhysRevD.84.0440391104.1079
Article Google Scholar
Fuster, A., Pabst, C.: Finsler $pp$-waves. Phys. Rev. D 94(10), 104072 (2016). https://doi.org/10.1103/physrevd.94.104072
Article MathSciNet Google Scholar
Hohmann, M., Pfeifer, C.: Geodesics and the magnitude-redshift relation on cosmologically symmetric Finsler spacetimes. Phys. Rev. D 95(10), 104021 (2017). https://doi.org/10.1103/physrevd.95.104021
Article MathSciNet Google Scholar
Voicu, N.: Volume forms for time orientable Finsler spacetimes. J. Geom. Phys. 112, 85–94 (2017). https://doi.org/10.1016/j.geomphys.2016.11.005
Article MathSciNet Google Scholar
Fuster, A., Pabst, C., Pfeifer, C.: Berwald spacetimes and very special relativity. Phys. Rev. D 98(8), 084062 (2018). https://doi.org/10.1103/physrevd.98.084062
Article MathSciNet Google Scholar
Hohmann, M., Pfeifer, C., Voicu, N.: Finsler gravity action from variational completion. Phys. Rev. D 100(6), 064035 (2019). https://doi.org/10.1103/physrevd.100.064035
Article MathSciNet Google Scholar
Javaloyes, M.A., Sánchez, M.: Finsler metrics and relativistic spacetimes. Int. J. Geom. Methods Mod. Phys. 11(9), 1460032 (2014). https://doi.org/10.1142/S0219887814600329
Article MathSciNet Google Scholar
Minguzzi, E.: Light cones in Finsler spacetime. Commun. Math. Phys. 334(3), 1529–1551 (2015). https://doi.org/10.1007/s00220-014-2215-6
Article MathSciNet Google Scholar
Minguzzi, E.: Convex neighborhoods for Lipschitz connections and sprays. Monatsh. Math. 177, 569–625 (2015). https://doi.org/10.1007/s00605-014-0699-y
Article MathSciNet Google Scholar
Caponio, E., Masiello, A.: On the analyticity of static solutions of a field equation in Finsler gravity. Universe 6, 59 (2020). https://doi.org/10.3390/universe6040059
Article Google Scholar
Minguzzi, E.: An equivalence of Finslerian relativistic theories. Math. Phys. 77, 45–55 (2016). https://doi.org/10.1016/S0034-4877(16)30004-0
Article MathSciNet Google Scholar
Palais, R.S.: Morse theory on Hilbert manifolds. Topology 2, 299–340 (1963). https://doi.org/10.1016/0040-9383(63)90013-2
Article MathSciNet Google Scholar
Abbondandolo, A., Schwarz, M.: A smooth pseudo-gradient for the Lagrangian action functional. Adv. Nonlinear Stud. 9, 597–623 (2009). https://doi.org/10.1515/ans-2009-0402
Article MathSciNet Google Scholar
Corvellec, J.-N., Degiovanni, M., Marzocchi, M.: Deformation properties for continuous functionals and critical point theory. Topol. Methods Nonlinear Anal. 1(1), 151 (1993). https://doi.org/10.12775/TMNA.1993.012
Article MathSciNet Google Scholar
Fadell, E., Husseini, S.: Category of loop spaces of open subsets in Euclidean space. Nonlinear Anal. 17(12), 1153–1161 (1991). https://doi.org/10.1016/0362-546X(91)90234-R
Article MathSciNet Google Scholar
Minguzzi, E.: Affine sphere relativity. Commun. Math. Phys. 350, 749–801 (2017). https://doi.org/10.1007/s00220-016-2802-9
Article MathSciNet Google Scholar
Javaloyes, M.A., Soares, B.L.: Anisotropic conformal invariance of lightlike geodesics in pseudo-Finsler manifolds. Class. Quantum Gravity 38(2), 16 (2021). https://doi.org/10.1088/1361-6382/abc225Id/No025002
Article MathSciNet Google Scholar
Javaloyes, M.A., Soares, B.L.: Geodesics and Jacobi fields of pseudo-Finsler manifolds. arXiv:1401.8149v1 [math.DG] (2014) https://doi.org/10.48550/arXiv.1401.8149
Candela, A.M., Flores, J.L., Sánchez, M.: Global hyperbolicity and Palais-Smale condition for action functionals in stationary spacetimes. Adv. Math. 218, 515–556 (2008). https://doi.org/10.1016/j.aim.2008.01.004
Article MathSciNet Google Scholar
Fathi, A., Siconolfi, A.: On smooth time functions. Math. Proc. Camb. Philos. Soc. 152, 303–339 (2012). https://doi.org/10.1017/S0305004111000661
Article MathSciNet Google Scholar
Bao, D., Chern, S.S., Shen, Z.: An Introduction to Riemann-Finsler Geometry. Springer-Verlag, New York (2000)
Book Google Scholar

Download references

Acknowledgements

D. Corona is partially supported by the FAPESP (São Paulo, Brazil) grant n. 2022/13010-3. E. Caponio is partially supported by European Union - Next Generation EU - PRIN 2022 PNRR “P2022YFAJH Linear and Nonlinear PDE’s: New directions and Applications”. E. Caponio, D. Corona, R. Giambòthank the partial support of GNAMPA INdAM - Italian National Institute of High Mathematics, project CUP-E55F22000270001. P. Piccione is partially sponsored by FAPESP (São Paulo, Brazil), grant n. 2022/16097-2, and by CNPq, Brazil. We would like to thank a referee for providing us with valuable comments.

Author information

Erasmo Caponio, Dario Corona, Roberto Giambò and Paolo Piccione have contributed equally to this work.

Authors and Affiliations

Dipartimento di Meccanica, Matematica e Management, Politecnico di Bari, Bari, Italy
Erasmo Caponio
School of Science and Technology, University of Camerino, Camerino, Italy
Dario Corona & Roberto Giambò
Istituto Nazionale di Fisica Nucleare, Sezione di Perugia, Perugia, Italy
Roberto Giambò
Institute of Mathematics and Statistics, Universidade de São Paulo, São Paulo, Brazil
Paolo Piccione

Authors

Erasmo Caponio
View author publications
You can also search for this author in PubMed Google Scholar
Dario Corona
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Giambò
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Piccione
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Erasmo Caponio.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A Affine Noether charge

In this section, we briefly show that Theorem 5.1 holds even if the Noether charge is an affine function with respect to v. Specifically, there exists a $C^1$ one-form Q on M and a $C^1$ function $d:M \rightarrow {\mathbb {R}}$ such that (2.3) is replaced by

$$\begin{aligned} N(x,v) {:}{=}\partial _vL(x,v)[K] = Q(v) + d(x); \end{aligned}$$

(A1)

and d is invariant under the one-parameter group of $C^3$ diffeomorphisms generated by K. In such a case, the stationary type local structure is given by

$$\begin{aligned} L(x,v) = L\circ \phi _* \big ((y,t),(\nu ,\tau )\big ) = L_0(y,\nu ) + \big (\omega _y(\nu ) + d(y)\big )\tau - \frac{1}{2}\tau ^2, \end{aligned}$$

(A2)

so we have

$$\begin{aligned} E(x,v) = E\circ \phi _* \big ((y,t),(\nu ,\tau )\big ) = E_0(y,\nu ) + \omega _y(\nu ) \tau - \frac{1}{2}\tau ^2. \end{aligned}$$

(A3)

Moreover, the set ${\mathcal {N}}_{p,r}$ is given by

$$\begin{aligned} {\mathcal {N}}_{p,r} {:}{=}\{z \in \Omega _{p,r}(M): \exists c \in {\mathbb {R}} \text { such that } N(z,\dot{z}) = c, \text { a.e. on } [0,1]\}\subset \Omega _{p,r}(M), \end{aligned}$$

and Proposition 3.5 still holds. Moreover, defining $F^t:\Omega _{p,q}(M) \rightarrow \Omega _{p,\gamma (t)}(M)$ as (4.1) and ${\mathcal {H}}_{p,q}:\Omega _{p,q}(M) \times {\mathbb {R}}\rightarrow {\mathbb {R}}$ as in (4.7), it is possible to prove both Proposition 4.2 and Corollary 4.3. The main difference with the linear case is that Proposition 4.4 doesn’t hold and it is replaced by the following result, whose proof is a based on a computation in local charts which employs (A2).

Proposition A.1

For every $(x,v) \in TM$ and for every $t \in {\mathbb {R}}$, the following two equations holds:

$$\begin{aligned} L\big (x,v + tK(x)\big ) = L\big (x,v\big ) + t N(x,v) - \frac{1}{2}t^2, \end{aligned}$$

and

$$\begin{aligned} E\big (x,v + tK(x)\big ) = E\big (x,v\big ) + t Q(v) - \frac{1}{2}t^2. \end{aligned}$$

As a consequence, for every $(x,v)\in TM$ we have

$$\begin{aligned} L\big (x,v + tK(x)\big ) - E\big (x,v + tK(x)\big ) = L\big (x,v\big ) - E\big (x,v\big ) + t d(x). \end{aligned}$$

(A4)

Since on any curve $z \in {\mathcal {N}}_{p,q}$ the quantities $Q(\dot{z})$ and d(z) are not necessarily constant, let us introduce the functionals ${\mathcal {Q}}:{\mathcal {N}}_{p,q} \rightarrow {\mathbb {R}}$ and ${\mathcal {D}}:{\mathcal {N}}_{p,q} \rightarrow {\mathbb {R}}$ as follows:

$$\begin{aligned} {\mathcal {Q}}(z) {:}{=}\int _{0}^{1}Q(\dot{z})\textrm{d}s, \quad \text {and}\quad {\mathcal {D}}(z) {:}{=}\int _0^1 d(z)\textrm{d}s. \end{aligned}$$

Using this notation, for every $z\in {\mathcal {N}}_{p,q}$ the two quantities $t_{\pm }^{\kappa }(z)$ that satisfy (4.21) are given by

$$\begin{aligned} t_{\pm }^{\kappa }(z) = {\mathcal {Q}}(z) \pm \sqrt{ {\mathcal {Q}}^2(z) + 2\big ( {\mathcal {E}}(z)- \kappa \big ) }, \end{aligned}$$

and they are still well defined if $\kappa $ satisfies (4.19). Moreover, since the function d doesn’t appear in the expression of E(x, v) in local coordinates (see (A3)), Corollary 4.9 still holds. Because of the difference between (A4) and (4.17), we have that in the affine case the equation analogous to (5.3) is

$$\begin{aligned} {\mathcal {H}}_{p,q}(z,t_+^{\kappa }(z)) = {\mathcal {L}}(z) -{\mathcal {E}}(z) + \kappa + t_+^{\kappa }(z)\big (N(z,\dot{z}) - {\mathcal {Q}}(z)\big )\\ = {\mathcal {L}}(z) -{\mathcal {E}}(z) + \kappa + t_+^{\kappa }(z){\mathcal {D}}(z). \end{aligned}$$

As a consequence, using a similar proof of the one of Theorem 5.1, we obtain the following result.

Theorem A.2

Let $L:TM \rightarrow {\mathbb {R}}$ satisfy assumptions 1, 2, and 3, with (A1) instead of (2.3), and let $\kappa \in {\mathbb {R}}$ satisfy (4.19). A curve $\ell $ is a solution of the Euler-Lagrange equations (2.1) joining p and $\gamma $ with energy $\kappa $ if and only if there exists $z \in {\mathcal {N}}_{p,q}$ such that $\ell = F^{t_+^{\kappa }(z)}(z)$ or $\ell = F^{t_-^{\kappa }(z)}(z)$, and the following equality holds:

$$\begin{aligned} \textrm{d}t_+^{\kappa }(z) = \frac{\textrm{d}{\mathcal {E}}(z) - \textrm{d}{\mathcal {L}}(z) - t_+^{\kappa }(z)\textrm{d}{\mathcal {D}}(z)}{\sqrt{{\mathcal {Q}}^2(\dot{z})+ 2\big ( {\mathcal {E}}(z)- \kappa \big )}}, \end{aligned}$$

or

$$\begin{aligned} \textrm{d}t_-^{\kappa }(z) = \frac{\textrm{d}{\mathcal {L}}(z) - \textrm{d}{\mathcal {E}}(z) + t_-^{\kappa }(z)\textrm{d}{\mathcal {D}}(z)}{\sqrt{{\mathcal {Q}}^2(\dot{z})+ 2\big ( {\mathcal {E}}(z)- \kappa \big )}}. \end{aligned}$$

Corollary A.3

Let $L:TM \rightarrow {\mathbb {R}}$ satisfy assumptions 1, 2, and 3, with (A1) instead of (2.3), and let $\kappa \in {\mathbb {R}}$ satisfy (4.19). Moreover, assume that $d:M \rightarrow {\mathbb {R}}$ is a constant function. Then, a curve $\ell $ is a solution of the Euler-Lagrange equations (2.1) joining p and $\gamma $ with energy $\kappa $ if and only if there exists $z \in {\mathcal {N}}_{p,q}$ such that $\ell = F^{t_+^{\kappa }(z)}(z)$ or $\ell = F^{t_-^{\kappa }(z)}(z)$, and () or () holds.

Appendix B Pseudocoercivity and global hyperbolicity

In this appendix we show that pseudocoercivity and global hyperbolicity of a Finsler spacetime (M, L) as defined in Subsection 2.1, are connected notions. We refer to [47, 73] for the needed notions of causality, and in particular of global hyperbolicity and of a Cauchy hypersurface, in Finsler spacetimes and in the more general framework of proper cone structures (see [47, Definition 2.4]). We notice indeed that M is endowed with a continuous cone structure ${\mathcal {C}}{:}{=}\{(x,v)\in TM: L(x,v)\le 0, Q(v)<0\}$. In fact, from the local expression of L (2.19), we deduce that $(\nu ,\tau )\in {\mathcal {C}}_{(y,t)}{:}{=}{\mathcal {C}}\cap T_{(y,t)}M$ if and only if

$$\begin{aligned} \tau \ge \omega (\nu )+\sqrt{\omega ^2(\nu ) +2F^2(y,\nu )},\ \end{aligned}$$

(B1)

and since $F^2(y,\cdot )$ is strongly convex, we deduce that ${\mathcal {C}}_{(y,t)}\cup \{0\}$ is a closed, convex, sharp cone with non-empty interior.

Our first aim would be to extend [84, Theorem 5.1], which states that if a stationary Lorentzian manifold is globally hyperbolic with a complete Cauchy hypersurface then it is pseudocoercive. We obtain a result in that direction, namely Proposition B.2, that ensures pseudocoerciveness from the global hyperbolicity in our setting requiring some other technical assumptions that are trivially satisfied in the Lorentzian setting.

Lemma B.1

Let (M, L) be a Finsler spacetime (i.e. $L_F :TM \rightarrow {\mathbb {R}}$ satisfies $(a')$, (b) and $(c')$ in Definition 2.7) such that Assumption 1 holds. If (M, L) is globally hyperbolic (i.e. the cone structure ${\mathcal {C}}$ associated to L is globally hyperbolic) then M globally splits as $S\times \mathbb R$ and L is given on $S\times {\mathbb {R}}$ by an expression of the type (2.10), with $L_0{:}{=}L|_{TS}$ and $\omega $ the one-form induced by Q on S.

Proof

From [85, Theorem 1.3], we have that there exists a smooth Cauchy time function $T:M\rightarrow {\mathbb {R}}$. Let then $S{:}{=}T^{-1}(0)$. Being $K_x\in {\mathcal {C}}_x$, for all $x\in M$, we have that $\textrm{d}T(K)>0$ by definition of a smooth time function, and then K is transversal to S. Thus, for any vector $(x,w)\in TM$ with $x\in S$, we can write $w=w_S+\tau _w K_x$ where $w_S\in T_x S$. Since

$$\begin{aligned} \frac{\textrm{d}}{\textrm{d}s} L(x,w_S + s\tau _w K)=\tau _w\partial _vL(x, w_S+s\tau _w K)[K]=\tau _w Q(w_S)-s\tau _w^2, \end{aligned}$$

by integrating w.r.t. s between 0 and 1, we get

$$\begin{aligned} L(x, w)=L(x, w_S+\tau _w K)=L(x,w_S)+\tau _w Q(w_S)-\frac{1}{2} \tau _w^2, \end{aligned}$$

which gives the required expression for L restricted to vectors $(x,w)\in TM$ with $x\in S$. Let $\phi $ be the restriction to $S\times {\mathbb {R}}$ of the flow of K. Since T is a Cauchy time function, it is strictly increasing on the flow lines $\gamma $ of K and it satisfies $\lim _{s\rightarrow \pm \infty }T(\gamma (s))=\pm \infty $. Therefore, $\phi :S\times {\mathbb {R}}\rightarrow M$ is a diffeomorphism. Using that L is invariant by the flow of $K^c$ we obtain

$$\begin{aligned} L \circ \phi _*\big ((x,t),(\nu ,\tau )\big ) = L_0(x,\nu ) + \omega (\nu )\tau - \frac{1}{2} \tau ^2, \end{aligned}$$

(B2)

where $L_0=L|_{TS}$ and $\omega $ is the one-form induced by Q on S. $\square $

Let us denote by $g_S$ the $C^1$ Riemannian metric on S induced by g. We assume that the one-form $\omega $ has sublinear growth w.r.t. the distance $d_{S}$ induced by $g_S$, i.e. there exist $\alpha \in [0,1)$ and two non-negative constants $k_0$ and $k_1$ such that

$$\begin{aligned} \Vert \omega \Vert \le k_0+k_1\big (d_S(x,x_0)\big )^\alpha , \end{aligned}$$

(B3)

for some $x_0\in S$ and all $x\in S$. From [51, Proposition 8.1], we immediately obtain the following result.

Proposition B.2

Under the assumptions of Lemma B.1, assume also that g is complete, (B3) holds, $L_0$ is non-negative and satisfies

$$\begin{aligned} \big (\partial _v L_0(x,v_2)-\partial _0 L_0(x,v_1)\big )[v_2-v_1] \ge \lambda _0(x)\left\| v_2 - v_1\right\| ^2, \end{aligned}$$

(B4)

for each $x\in S$, and all $v_1, v_2\in T_xS$. If $\displaystyle \inf _{x\in S}\lambda _0(x)>0$, then ${\mathcal {J}}_{p,r}$ is pseudocoercive for all $p, r\in M$.

Remark B.3

We notice that the condition (B4) is always satisfied in the Lorentzian setting, since S can be taken to be a smooth spacelike Cauchy hypersurface; moreover if S is complete then a possible auxiliary Riemannian metric g on $S\times {\mathbb {R}}$ is the natural product metric which is then also complete. Therefore, $\lambda _0(x)=1$, for each $x\in S$, in the Lorentzian setting. We point out that in [84, Theorem 5.1] the completeness of S is a required assumption. The more technical assumption in Proposition B.2 is (B3). It is needed to get the boundedness of the constants $Q(\dot{z})$, for all z in a fixed sublevel $\mathcal J_{p,r}^c$, a property called c-boundedness in [51], that implies pseudocoerciveness if satisfied for each $c\in {\mathbb {R}}$ (see [51, Proposition 7.2]). Actually, when L is a 2-positive homogeneous Lagrangian and $L_0\in C^1(TS)$ is the square of a Finsler metric on S, a close inspection of the proof of [84, Theorem 5.1] makes clear that (B3) can be removed, and an analogous proof can be repeated by using the action functional of $L_0$ instead of the energy functional of the Riemannian metric on S. In fact, using the global splitting $S\times {\mathbb {R}}$ and (B2), the arrival time functional of a lightlike curve $z(s)=\big (x(s), t(s)\big )$, (i.e., a causal curve $z:[0,1]\rightarrow M$ such that $L\big (z(s), \dot{z}(s)\big )=0$, a.e. on [0, 1]) between $p=(x_0, 0)\in S\times \{0\}$ and a flow line of K, $\gamma (t)=(x_1, t)$, is given by

$$\begin{aligned} J:\Omega _{x_0,x_1}(S)\rightarrow {\mathbb {R}}, \qquad J(x)=\int _0^1 \left( \omega (\dot{x})+\sqrt{\omega ^2(\dot{x}) +2L_0(x, \dot{x}\big )}\right) \textrm{d}s, \end{aligned}$$

and this is a key point in the proof of [84, Theorem 5.1] (refer to [84, Lemma 5.4]). Moreover, the completeness of the Riemannian metric on S can be replaced by the forward or backward completeness of $\sqrt{L_0}$. Another fundamental point is the compactness of $S\cap J^-(q)$, for any $q\in M$, (see (B6) for the definition of $J^-(q)$), used in the proof of [84, Lemma 5.5]. In our setting, this is an immediate consequence of [47, Theorem 2.44]. Summing up, the following result extending [84, Theorem 5.1] holds:

Theorem B.4

Under the assumptions of Lemma B.1, assume also that $L_0\in C^1(TS)$ is the square of a forward or backward complete Finsler metric on S. Then ${\mathcal {J}}_{p,r}$ is pseudocoercive for all $p, r\in M$.

Remark B.5

In light of Theorem B.4, it becomes important to give conditions ensuring that $L_0$ is the square of a Finsler metric on S. A first observation is that $L_0$ is non-negative and (B4) holds if, for each $x\in S$

$$\begin{aligned} \lambda (x) -\max _{\nu \in T_x S, \Vert \nu \Vert =1} 2Q^2_x(\nu )>0, \end{aligned}$$

where $\lambda (x)$ is defined in (2.9) (see [51, Remark 2.14]).

We also notice that, if ${\mathcal {O}}_0:={\mathcal {O}}\cap TS$, satisfies, relatively to TS, the same properties satisfied by ${\mathcal {O}}$ in Remark 2.8-(a’), then (B4) holds if

$$\begin{aligned} \inf _{v\in {\mathcal {O}}_x}\left( \min _{\nu \in T_x S, \Vert \nu \Vert =1}\big (\partial _{vv}L_c(x,v)[\nu ,\nu ]- 2 Q^2_x(\nu )\big )\right) >0. \end{aligned}$$

(B5)

Moreover, in this case, $\sqrt{L_0}$ in (B2) is a Finsler metric on S such that $L_0$ is of class $C^1$. Indeed, from (2.16) and (B5) we immediately get that $\partial _{vv}L(x,v)|_{T_xS\times T_x S}$ is a positive definite bilinear form, for every $v \in T_xS\cap {\mathcal {O}}_0$. Therefore, recalling that $L_0=L|_{TS}$ and it is fiberwise positively homogeneous, we have that $L_0(v)\ge 0$ for all $v\in \mathcal O_0$ and then on TS by density of ${\mathcal {O}}_0$ in TS. Arguing as in Remark 2.10, we then conclude that $\sqrt{L_0}$ is a Finsler metric.

Actually, in this last setting, (B5) is also a necessary condition for $L_0$ being the square of a Finsler metric. In fact, let $\{e_1,\dots ,e_m\}\subset T_xS$ be an orthonormal basis of $T_xS$ with respect to the auxiliary Riemannian metric g. Using this basis, we can write the one-form $\omega :T_xS \rightarrow {\mathbb {R}}$ given by $Q|_{T_xS}$ as $(\omega _1,\dots ,\omega _m)$. Let us denote by $g_0(v)_{ij}$ the vertical Hessian matrix of $L_0$ in $v\in T_xS\cap {\mathcal {O}}_x$ with respect to this basis. Similarly, we denote by $g_c(v)_{ij}$ the vertical Hessian matrix of $L_c$ restricted to $T_xS$. With this notation, first we notice that, $g_0(v)_{ij}$ has $m-1$ positive eigenvalues, since it coincides with $g_c(v)_{ij}$ on $\textrm{ker}(\omega )$. By [86, Proposition 11.2.1], applied to the vector $i\sqrt{2}(w_1, \ldots , w_m)\in {\mathbb {C}}^m$, we have

$$\begin{aligned} \textrm{det}(g_0(v)_{ij})=\textrm{det}\big (g_c(v)_{ij} -2\omega _i\omega _j\big ) =\big (1 - 2g_c(v)^{ih}\omega _h\omega _i\big )\textrm{det}(g_c(v)_{ij}), \end{aligned}$$

where $g_c(v)^{ij}$ denotes the inverse matrix of $g_c(v)_{ij}$. Since $g_c(v)_{ij}$ is positive definite, then $g_0(v)_{ij}$ is positive definite if and only if $1-2g_c(v)^{ih}\omega _h\omega _i > 0$, namely if and only if the norm of $\omega $ with respect to $g_c(v)$ is strictly less than 1/2 for every $v\in T_xS\cap {\mathcal {O}}_x$.

Let us now analyze the converse situation, i.e. we assume now that ${\mathcal {J}}_{p,r}$ is pseudocoercive for all $p, r\in M$ and we prove that global hyperbolicity holds. We recall (see, e.g., [47, §2.1]) that an absolutely continuous curve $\gamma :[a,b] \rightarrow M$ is causal if ${\dot{\gamma }} (t)\in {\mathcal {C}}_{\gamma (t)}$, for a.e. $t\in [a,b]$. For any $p \in M$, we set

$$\begin{aligned} J^+(p) {:}{=}\left\{ \begin{aligned} r \in M: r = p&\text { or there exists a causal curve } \gamma :[0,1] \rightarrow M\\&\text {such that } \gamma (0) = p \text { and } \gamma (1) = r \end{aligned} \right\} , \end{aligned}$$

and, analogously, we define

$$\begin{aligned} J^-(p) {:}{=}\left\{ \begin{aligned} r \in M: r = p&\text { or there exists a causal curve } \gamma :[0,1] \rightarrow M\\&\text {such that } \gamma (0) = r \text { and } \gamma (1) = p \end{aligned} \right\} . \end{aligned}$$

(B6)

We call causal diamond a set given by $J^+(p)\cap J^-(r)$, for some $p,r \in M$.

According to [47, Corollary 2.4], global hyperbolicity on a proper cone structure ${\mathcal {C}}$ is equivalent to the non-existence of absolutely continuous closed causal curves plus compactness of every causal diamond. We use this characterization to prove the next result that extends to Lorentz-Finsler stationary spacetimes [52, Proposition B1].

Theorem B.6

Let (M, L) be a Finsler spacetime such that Assumption 1 holds. If ${\mathcal {J}}_{p,r}$ is pseudocoercive for all $p, r\in M$, then (M, L) is globally hyperbolic.

Before proving the above result we need the following lemma.

Lemma B.7

Any absolutely continuous causal curve $\gamma :[a,b]\rightarrow M$ admits a reparametrization on [0, 1] as an $H^1$ curve with $Q(\dot{\gamma }(s))=\mathrm {const.}$.

Proof

By the local splitting and homogeneity in (B1), we can use the locally defined functions t to parametrize locally $\gamma $ as $\gamma (t)=(x(t),t)$, so that $t\mapsto \left\| \dot{x}(t)\right\| $ is locally bounded. As the support of $\gamma $ is compact, we can patch together the locally defined reparametrization to get an $H^1$ curve defined on an interval [0, c], and a further reparametrization gives the thesis. $\square $

Proof of Proposition B.6

From Lemma B.7, there is no loss of generality in considering just $H^1$ curves parametrized on [0, 1] with $Q(\dot{\gamma }(s))=\mathrm {const.}$. Assume that there exists a closed causal curve $\gamma :[0,1]\rightarrow M$. We take the sequence $\gamma _n$, $n\ge 1$, defined by concatenating the n curves $\gamma _j(s){:}{=}\gamma (n(s-j/n))$ for $s\in [j/n, (j+1)/n]$, $j=0, \dots , n-1$. The sequence satisfies ${\mathcal {J}}(\gamma _n)\le 0$ but it does not admit any uniformly converging subsequence in contradiction with pseucoercivity of ${\mathcal {J}}_{\gamma (0), \gamma (0)}$, hence (M, L) must be causal. Let us now assume by contradiction that $J^+(p)\cap J^-(r)$ is not compact. Then there exists a sequence of points $(q_n)_{n \in {\mathbb {N}}}\subset J^+(p)\cap J^-(r)$ that does not admit any subsequence converging to a point in $J^+(p)\cap J^-(r)$. We take then a sequence of causal curves $(\gamma _n)_{n \in {\mathbb {N}}}\subset J^+(p)\cap J^-(r)$ such that $q_n\in \gamma _n([0,1])$, for each $n\in {\mathbb {N}}$. Moreover, by Lemma B.7 we can assume that the sequence $(\gamma _n)_{n \in {\mathbb {N}}}$ belongs to ${\mathcal {J}}_{p,r}$. Since ${\mathcal {J}}(\gamma _n)\le 0$ for every $n \in {\mathbb {N}}$, by pseudocoercivity $(\gamma _n)_{n \in {\mathbb {N}}}$ admits a uniformly converging subsequence $(\gamma _{n_k})_{k}$. The uniform limit is then a causal curve $\gamma :[0,1]\rightarrow M$ connecting p to r, by theorem [47, Theorem 2.12]. This implies that $(q_{n_k})_{k}$ must admit a converging subsequence to a point in $J^+(p)\cap J^-(r)$, which is a contradiction. $\square $

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Caponio, E., Corona, D., Giambò, R. et al. Fixed energy solutions to the Euler-Lagrange equations of an indefinite Lagrangian with affine Noether charge. Annali di Matematica 203, 1819–1850 (2024). https://doi.org/10.1007/s10231-024-01424-4

Download citation

Received: 14 July 2023
Accepted: 05 January 2024
Published: 23 April 2024
Issue Date: August 2024
DOI: https://doi.org/10.1007/s10231-024-01424-4

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Fixed energy solutions to the Euler-Lagrange equations of an indefinite Lagrangian with affine Noether charge

Abstract

Similar content being viewed by others

The topological entropy for autonomous Lagrangian systems on compact manifolds whose fundamental groups have exponential growth

A variational setting for an indefinite Lagrangian with an affine Noether charge

An Eulerian–Lagrangian Form for the Euler Equations in Sobolev Spaces

1 Introduction

2 Notations, assumptions and a class of examples

Assumption 1

Remark 2.1

Remark 2.2

Assumption 2

Remark 2.3

Remark 2.4

Remark 2.5

Assumption 3

Remark 2.6

2.1 Lorentz-Finsler metrics

Definition 2.7

Remark 2.8

Proposition 2.9

Proof

Remark 2.10

3 Variational setting

Remark 3.1

Remark 3.2

3.1 Preliminary results

Lemma 3.3

Proof

Proposition 3.4

Proof

Proposition 3.5

Proof

4 The variational structure of the action in relation with the flow of K

Proposition 4.1

Proof

Proposition 4.2

Proof

Corollary 4.3

Proof

Proposition 4.4

Proof

Proposition 4.5

Proof

Remark 4.6

Lemma 4.7

Proof

Remark 4.8

Corollary 4.9

Proof

5 Main result

Theorem 5.1

Proof

Remark 5.2

6 An existence and multiplicity result

Definition 6.1

Remark 6.2

Remark 6.3

Lemma 6.4

Proof

Lemma 6.5

Proof

Lemma 6.6

Proof

Proposition 6.7

Proof

Lemma 6.8

Proof

Theorem 6.9

Proof

Remark 6.10

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices