Partial regularity for minimizers of a class of discontinuous Lagrangians

We study a one dimensional Lagrangian problem including the variational reformulation, derived in a recent work of Ambrosio-Baradat-Brenier, of the discrete Monge-Amp\`ere gravitational model, which describes the motion of interacting particles whose dynamics is ruled by the optimal transport problem. The more general action type functional we consider contains a discontinuous potential term related to the descending slope of the opposite squared distance function from a generic discrete set in R^{d}. We exploit the underlying geometrical structure provided by the associated Voronoi decomposition of the space to obtain C^{1,1} regularity for local minimizers out of a finite number of shock times.


Introduction
In recent years, action functionals of the form have received the attention of many authors, due to their appearence in several areas of Mathematics.
In the theory of gradient flows, for instance, they correspond to the integral form of the energy dissipation (see [5]), and they are also related to the so called entropic regularization of the Wasserstein distance, when f is a multiple of the logarithmic entropy, defined on the space of probability measures with finite quadratic moment (see [9]).In all these cases, the main obstruction to the application of standard results of Calculus of Variations stems from the lack of differentiability, even continuity, of the Lagrangian with respect to γ.
When f : X → (−∞, +∞] is a λ-convex function defined on a metric space (X, d), the term |∇f (x)| has to be interpreted as the descending slope of f at x, namely and, if X is a Hilbert space, it also coincides with the norm of the minimal selection in the subdifferential ∂f (x), known as the extended gradient of f at x (see [5]).In this general framework, stability of the functionals I f with respect to Γ-convergence of the functions f was investigated in [2], [3] and [4].
In particular, [2] addressed a rigorous derivation, along the lines of [8], of a dynamical system of interacting particles strictly related to the optimal transport problem, known as the discrete Monge-Ampère gravitational (MAG) model (see subsection 2.2).What emerged from this work is that the dynamics of MAG can be conveniently studied as the Euler-Lagrange equation associated to an action functional of type (1.1), where f = f K is the (−1)-convex function given by the opposite squared distance from a specific discrete set K ⊂ R d , namely Clearly, f K is not everywhere differentiable in general, so that ∇f K , denoting the extended gradient of f K , is not even continuous, and standard results of Calculus of Variations are not directly applicable in this case.The present work aims at a systematic analysis of the properties of local minimizers for the functional I f K , where K is a generic discrete set K in R d .Our results apply in particular to solutions of the discrete MAG model, thereby addressing the general n-dimensional case, left open in [2], where the most involved part of the analysis was carried out only in dimension 1.
The plan of the paper is the following.In section 2 we present the general framework and the motivations of our work.More precisely, in subsection 2.1 we provide a contextualization of the problem in the general Hilbertian setting, with particular emphasis on the variational properties of functionals of type (1.1), when f is a λ-convex function.The main references for this part are [3] and [5].Then, in subsection 2.2, we introduce the Monge-Ampère gravitational model in the flat torus T n as a modification of the classical Newtonian gravitation in which the linear Poisson equation is replaced by the fully nonlinear Monge-Ampère equation (see (2.7)).Following the ideas of [8], we underline the intriguing link of this dynamical system with the optimal transport problem, whose powerful tools can be used in order to derive a Lagrangian reformulation of MAG particularly meaningful in the discrete setting, where additional foundation to the model is given by the results in [2].By means of a least action principle, we then interpret solutions of the discrete MAG model as local minimizers of the functional I f K , where f K is the opposite squared distance function from a discrete set K ⊂ R d , as defined in (1.3).Section 3, being the core of the paper, is then devoted to the analysis of local minimizers for the Lagrangian problem associated to I f K , when K is a finite collection of points in R d : We crucially consider the Voronoi partition of the space carried by K, which encodes the underlying geometrical structure of the problem, and exploit it in order to obtain, in Proposition 3.8, the existence of some specific directions along which momentum is locally conserved by the dynamics.As a byproduct, we show in Corollary 3.9 that a local minimizer γ is regular as long as it stays in a single Voronoi cell, possibly developing singularities only at those times in which the optimality class changes.As it is shown later, the set K also carries a partition of R d into "potential zones" (see Proposition 3.6).This second partition is in general less fine than the Voronoi one, and coincides with it when the set K is "balanced", like for instance a cubic lattice.We then define S(γ) to be the set of "shock times", at which the curve γ jumps from a Voronoi cell to another, and N DS(γ) ⊆ S(γ) the set of "nondegenerate shock times", at which γ not only changes the Voronoi cell, but also the potential zone.With this in mind, our main regularity results Theorem 3.15 and Corollary 3.16 can be collected in a single statement as follows: Theorem (Partial regularity).Let γ be a local minimizer of I f K with endpoints constraints.Then i) γ has a finite number of nondegenerate shock times out of which it is C 1,1 .
ii) Under the additional assumption that K is balanced, γ has a finite number of shock times out of which it is C ∞ .
This result in particular provides an extention to any space dimension of [2, Theorem 13], where regularity out of a finite number of shock times was proved for minimizers of a one dimentional version of the MAG model.
Acknowledgements.I wish to thank Prof. Luigi Ambrosio for introducing me to the present research topic and for providing me with very useful suggestions throughout the preparation of this work.
2 General framework and motivations It is easily seen that λ-convex functions are precisely those functions that satisfy the perturbed convexity inequality By ∂f (x) we denote the Gateaux subdifferential of f at x ∈ dom(f ), namely the (possibly empty) closed convex set We denote by dom(∂f ) the domain of the subdifferential.For a λ-convex function, we can exploit the monotonicity of difference quotients to derive the equivalent non asymptotic definition of the subdifferential Whenever x ∈ dom(∂f ), there exists a unique element ξ with minimal norm in ∂f (x), obtained by projecting 0 on ∂f (x).This element is called the extended gradient of f at x, and is denoted by ∇f (x).
The concept of extended gradient is strictly related to the one of descending slope of f at x ∈ dom(f ), namely In fact, for λ-convex functions, it can be proved that ∂f (x) is not empty if and only if |∇ − f |(x) < +∞, and that, in this case, the following equalities hold (see [5]): In this paper we deal with the following specialization of the above setting.Given a closed set K ⊆ H, we consider the opposite squared distance function from K, namely 2) The infimum is not attained in general, unless K is either convex or compact, or H is finite dimensional.By defining the convex function we derive from the equality g K A class of action functionals.We now introduce, in the general Hilbertian setting, the class of action functionals that we are going to study throughout the paper.We fix a function h : [0, +∞] → [0, +∞] representing a "potential shape".Then, for δ > 0 and f : H → (−∞, +∞] proper, λ-convex and lower semicontinuous, we consider the functional I δ f : C([0, δ], H) → [0, +∞] defined by Compared to the one of type (1.1) studied so far in the literature, we consider here the enriched class of functionals in which the potential shape h is allowed to be different from the identity.In the sequel we assume h to be continuous, and C 1 when restricted to [0, +∞).
As we have in mind to study this type of functionals from the variational point of view, it is crucial to realize that (2.3) is lower semicontinuous with respect to the C([0, δ], H) topology.This in fact easily follows from the lower semicontinuity of the classical action and the above characterization of the extended gradient (2.1).Then, for x 0 , x δ ∈ H, the infimum is attained under suitable coercivity conditions.Note in particular that this is the case if H is finite dimensional.
Due to the lack of continuity of the potential term, however, very little is known about the regularity for minimizers of this type of functionals, even in the finite dimensional case.We could ask for instance whether some higher regularity or at least a sort of Euler-Lagrange equation like formally could be derived for local minimizers.In the very specific case in which f is the opposite squared distance function from a discrete set in R d , we will prove in the sequel that local minimizers are piecewise C 1,1 , and that, out of a finite number of singularities, (2.4) holds taking the modulus on both sides and replacing the equality with a ≤ sign (see Theorem 3.15).Nevertheless, in the general setting, one can exploit the fact that the functional I δ f is autonomous in order to perform "horizontal" variations of the independent variable, and eventually derive the Dubois-Reymond equation for a local minimizer γ (see [1]): in the sense of distributions in (0, δ).Equivalently, there exists a constant c ∈ R such that a.e. in (0, δ).This implies in particular that every local minimizer of I δ f is Lipschitz continuous, provided that |∇f | is bounded on bounded sets.
We end this part by quoting a result from [3] addressing the matter of stability for the class of functionals considered so far.By adding endpoints constraints x 0 , x δ ∈ H, we define the functional Theorem 2.1 (Stability, [3]).Let f j , f be uniformly λ-convex functions, and let x j,0 , x j,δ , x 0 , x δ ∈ H. Suppose that i) f j → f w.r.t.Mosco convergence.
Then I δ f j ,x j,0 ,x j,δ Γ-converge to I δ f,x 0 ,x δ in the C([0, δ], H) topology.As a byproduct, under an additional equi-coercivity assumption, this theorem grants convergence of minimal values to minimal values and of minimizers to minimizers.Notice that Theorem 2.1 is stated in [3] for h = id, but the same proof is seen to work in the general case with only minor modifications.

The Monge-Ampère gravitational model
In a periodic spatial domain like the flat torus T n = R n /Z n , we can describe classical Newtonian gravitation of a unity of mass in a "parametric" way as follows.We first choose a reference metric probability space (A, λ) of labels for the gravitating particles.Then we assign to each particle a ∈ A its position X t (a) ∈ T n at time t.Tipical choices for the reference space are the unit cube [0, 1] n with the n-dimensional Lebesgue measure in the continuous case, and a finite set of points with the renormalized counting measure in the discrete case.Denoting by µ t := (X t ) # λ the image measure of λ by X t , the Newtonian model can be written as ( Here φ t is the gravitational potential generated by µ t , defined on T n .Note that due to the periodicity of the space, the average density 1 has been removed from the right hand side of the Poisson equation, in order to let the uniform measure L n be a stationary solution of the system.This is a perfectly meaningful assumption, because by symmetry, the attractive force of the uniform density has to be zero everywhere on T n .In this section we are interested in the related Monge-Ampère gravitational model (MAG in short), which is simply obtained from (2.6) by replacing the Poisson equation with the fully nonlinear Monge-Ampère equation: (2.7) Notice that (2.6) can be recovered from (2.7) by expanding the determinant in the Monge-Ampère equation and keeping only the linear term: We refer to [8] and the references therein for a broader introduction to this dynamical system, as well as for a comparison with the classical Newtonian model.
The MAG model in optimal transportation terms.System (2.7) appears to have an intriguing geometrical interpretation if we look at it from the optimal transportation point of view.
In order to better illustrate this link, we first quote the following specialization to the flat torus T n of the classical Brenier-McCann theorem on the existence and uniqueness of optimal transport maps on Riemannian manifolds (see [7], [10], [12]).Let us begin with some notation.We denote by π : R n → T n the projection to the quotient.We say that a vector field If this is the case we make a little abuse of notation by considering F also as a vector field from T n to itself.Given a Borel probability measure λ on T n , we consider the Hilbert space and its closed subset K λ given by all the λ-preserving vector fields Finally, we recall that f K λ denotes the opposite squared distance function from K λ , as defined in (2.2).
Theorem 2.2 (Existence and uniqueness of optimal transport maps in T n ).Let µ and λ be Borel probability measures on T n , and suppose that µ ≪ L n .Then i) There exists a locally Lipschitz convex function ψ : , and T := ∇ψ : T n → T n is the unique optimal transport map from µ to λ.
ii) If µ = ρL n and λ = ηL n are both absolutely continuous w.r.t. the Lebesgue measure, then φ solves the Monge-Ampère equation in the almost everywhere sense.Furthermore, if ρ and η are of class C 0,α , then φ is of class C 2,β , for 0 < β < α, and solves (2.8) in the classical sense. iii where W 2 is the Wasserstein distance in the space P 2 (T n ).Moreover, the map In order to reformulate the Monge-Ampère gravitational model in optimal transportation terms, we look to the continuous case, in which the reference space is given by (T n , L n ).Fix then λ = L n in the Theorem above, and consider a parametrization X t : T n → T n such that (X t ) # λ = µ t and µ t = ρ t L n is absolutely continuous w.r.t. the Lebesgue measure.If Y t ∈ H λ is any lifting of X t , that is to say a map that satisfies π • Y t = X t , then Theorem 2.2 grants that the Kantorovich potential φ t solves the Monge-Ampère equation and , where T t is the unique optimal transport map from µ t to λ.So we see that (2.7) reduces to with T t equal to the unique optimal transport map from suggesting an interpretation of (2.11) as the Euler-Lagrange equation associated to the functional This variational reformulation appears natural in the attempt to give a meaning to system (2.7) also in the discrete setting, where, as it is well known, Theorem 2.2 fails.
The discrete MAG model.As already mentioned before, one of the aims of this work is to go deeper in the analysis of the discrete version of the Monge-Ampère gravitational model, first introduced in [8] and then formalized in [2].Here we choose as reference measure where the a i 's are distinct points on T n (think for instance to a regular lattice approximating the uniform measure).In this case, the space H λ is easily seen to be finite dimensional, and isomorphic to R nm , through the identification of a map Y ∈ H λ with the m-uple By regarding, a bit improperly, the a i 's as elements of [0, 1) n , the set K λ can be written as the union of m! cubic lattices in R nm : In this discrete scenario, the MAG model describes the motion of m particles of equal mass 1/m in the torus T n , whose dynamics is ruled by the optimal transport problem as follows.The position of the i-th particle at time t is denoted by x i (t) = X t (a i ), and a lifting of The equivalent of (2.11) in this setting is, at least formally, where The system (2.13) is easily seen to be ill posed, because of the general non uniqueness of the projection on K λ , ultimately due to the nonuniqueness of the optimal transport map in the discrete setting, in contrast with the absolutely continuous one.As already pointed out in [2], in order to fix this problem, it is convenient to switch to a variational reformulation of the dynamical system, by considering an action functional of type (2.12).Therefore, relying on a least action principle, we say that y ∈ AC([0, δ], R nm ) is a solution of the discrete MAG model if it is a local minimizer of the functional subject to endpoints constraints.In the next secion, we are going to study a more general functional in which K λ is replaced by a generic discrete set K in R d .
Before concluding this part, we would like to briefly turn the attention of the reader to an analoguous Lagrangian problem in the space of probability measures (P(T n ), W 2 ).This can be obtained from MAG by dropping the parametric description of the gravitating matter, required by the Hilbertian setting of subsection 2.1, and directly considering the evolution of a probability measure µ t in T n .
A related Lagrangian problem in (P(T n ), W 2 ).Far from being limited to the Hilbertian context, functionals of type (1.1) can be considered in a much more general metrict setting, provided that we interpret |∇f (x)| as the descending slope |∇ − f |(x) defined in (1.2), and | γ| as the metric derivative of an absolutely continuous curve γ : [0, 1] → X.We avoid to repeat all the constructions in this new scenario (see [4] for a systematic introduction), and prefer to immediately specialize to our case of interest.We take (X, d) = (P(T n ), W 2 ) the space of probability measures on T n endowed with the Wasserstein distance induced by the optimal transport problem with quadratic cost.As it is well known, (X, d) is compact, geodesic and positively curved (see [5]).Given a "reference" probability measure λ ∈ P(T n ), we consider the opposite squared distance function from λ, namely Since (X, d) is positively curved, we easily deduce that f λ is (−1)-convex (in the metric setting, convexity has to be intended along geodesics).Moreover, we can bound the descending slope of f λ at µ as follows: Theorem 10.4.12], and involves the minimal L 2 norm of the barycentric projection of optimal transport plans.Inspired by the MAG model, and in particular by formulas (2.9) and (2.10), one could study the Lagrangian problem associated to the lower semicontinuous functional I δ f λ ,µ 0 ,µ δ : C([0, δ], X) → [0, +∞] defined by From the compactness of X, we immediately obtain the existence of minimizers of I δ f λ ,µ 0 ,µ δ .In addition, by exploiting a generalization of Theorem 2.1 to the general metric setting provided by [4,Theorem 17], as well as the bound on the descending slope (2.14), we obtain the following stability result: Proposition 2.3.Let λ j , λ ∈ P(T d ) be reference measures, and µ j,0 , µ j,δ , µ 0 , µ δ ∈ P(T d ) be endpoints.Suppose that i) Then I δ f λ j ,µ j,0 ,µ j,δ Γ-converge to I δ f λ ,µ 0 ,µ δ in the C([0, δ], X) topology.Moreover, we have convergence of minimal values to minimal values and of minimizers to minimizers.

The case of the opposite squared distance function in R d
In this section the main results of the paper will be derived.We study functionals of type (2.1) in the special case in which H = R d and f = f K is the opposite squared distance function from a closed subset K ⊆ R d .Motivated by the variational reformulation of the discrete MAG model, derived in the previous section, we will in particular focus on the case in which K is a discrete collection of points in R d .In this last setting, we will exploit the geometrical structure given by the associated Voronoi decomposition of the space in order to get regularity for local minimizers out of a finite number of "shock times".
Given a closed set K ⊆ R d , we consider the opposite squared distance function from K, defined by Notice that the infimum in (2.2) is always attained here, due to the local compactness of the ambient space.The convex function , thus implying the (−1)-convexity of f K .We fix a potential shape h : [0, +∞) → [0, +∞) of class C 1 and consider the action functional We stress that ∇f K has to be intended as an extended gradient, because f K is differentiable only at those points in which the projection on K is unique.
In order to get a useful characterization of ∇f K , we need a well known Lemma of convex analysis providing an explicit formula for the subdifferential at x of the maximum of a family of convex functions, under suitable assumptions (see [11]).Lemma 3.1 (Subdifferential of the sup function).Let g α : R d → R α∈A be a collection of convex functions indexed on a compact metric space A, and suppose that α → g α (x) is upper semicontinuous for every x ∈ R d .We consider the supremum function Then, if the supremum in the definition of g(x) is attained, the following formula holds for the subdifferential of g at x: We call opt K (x) the compact subset of K containing all the points that minimize the distance from x: In the sequel we will refer to opt K (x) as the optimality class of x.Applying Lemma 3.1 we get: Proposition 3.2 (Subdifferential of the opposite squared distance function).Let K ⊆ R d be a closed set, and let f K , g K be defined as in (3.1) and (3.2).Then i) The subdifferential of g K at x is given by ii) The subdifferential of f K at x is given by Moreover, denoting by η K (x) the unique projection of x on the closed convex set conv(opt K (x)), the following formula holds for the extended gradient of f K at x: iii) The point η K (x) depends only on the optimality class of x.That is to say, η K (x) = η K (y), whenever opt K (x) = opt K (y).
Proof.Point i) easily follows from Lemma 3.1 if K is compact.To deal with the general case it is enough to notice that for every x ∈ R d and every radius R > dist K (x), we have g K = g K∩B R (x) in a neighborhood of x.The formula for ∂f K (x) is a consequence of the rule for the subdifferential of the sum of two functions, one of which smooth.Then, by definition, ∇f K (x) is the projection of 0 on ∂f K (x) = conv(opt K (x)) − x, and formula (3.3) follows after a translation of x.Let us now address point iii).Suppose that x and y share the same optimality class, opt K (x) = opt K (y).Consider the affine space A spanned by opt K (x) and its orthogonal space B passing through x.From the hypothesis on x and y we deduce that also y belongs to B. Then, denoting by p the point of intersection of A and B, by orthogonality we have: Hence, both distances are minimized by the point z obtained by projecting p on conv(opt K (x)), thus η K (x) = η K (y).
Remark 3.3.From (3.3) we deduce that the potential term |∇f K (x)| 2 is always less than or equal to −2f K (x) = dist 2 K (x), and equality holds if and only if x has a unique projection on K.It is interesting to see what this means for the MAG model.Using the notation of subsection 2.2, the potential of a configuration (y 1 , . . ., y m ) ∈ R nm is always smaller than W 2 (µ, λ), where, setting and W 2 is the Wasserstein distance in P 2 (T n ).Moreover, equality holds if and only if there exists a unique optimal transport map from µ to λ.So we see that in the context of the MAG model, the potential term should be interpreted as a "measure of the ambiguity in the optimal transport problem".Tipical manifestations of ambiguity in the discrete scenario appear when two or more particles collapse, thus sharing the same position in T n .Compare also this phenomenon with the continuous framework of Theorem 2.2, where this ambiguity does not occur.
To end this part, we briefly come back to the matter of stability in this specialized context, stating the following Corollary of Theorem 2.1: Corollary 3.4.Let K j , K be closed subsets of R d , and let x j,0 , x j,δ , x 0 , x δ ∈ R d .Suppose that i) K j → K in the sense of Hausdorff in every compact set.ii) x j,i → x i for i = 0, δ.
Then I δ f K j ,x j,0 ,x j,δ Γ-converge to I δ f K ,x 0 ,x δ in the C([0, δ], R d ) topology.Moreover, we have convergence of minimal values to minimal values and of minimizers to minimizers.
As a consequence, the functional associated to a closed set K can be approximated by functionals associated to K j , where each K j is a finite collection of points in R d .It is the scope of the following subsection to focus on this simpler situation.

The discrete case
From now on, we restrict our analysis to the case in which K is given by a collection of N distinct points in R d : K = {p 1 , . . ., p N } .
We are particularly interested in studying properties of local minimizers for I δ f K ,x 0 ,x δ because of the link with the variational reformulation of the discrete MAG model (see the discussion above).There, K was an infinite discrete set, but we can clearly restrict our analysis, which is essentially local, to the case in which K is finite, due to the compactness of the range of every continuous curve γ : [0, δ] → R d .Let us fix K, so as to be allowed to omit all the pedices involving it.Then, for instance, we will write f, g, η, opt in the place of f K , g K , η K , opt K .
Polyhedra, Voronoi cells and potential zones.We say that P ⊆ R d is a polyhedron if it is a nonempty closed convex set admitting a representation of the form where ℓ ∈ N and T j : R d → R are affine functions.A bounded polyhedron is called a polytope.
The Voronoi partition associated to a finite collection of points K = {p 1 , . . ., p N } is the finite decomposition {V H } H∈P(K) of R d , indexed by the set P(K) of the parts of K, and such that We call V H the Voronoi cell corresponding to the optimality class H.The following are well known facts about this remarkable cellular decomposition of the space (see [6]).Proposition 3.5 (Properties of the Voronoi partition).Let H ∈ P(K) be such that the Voronoi cell V H is nonempty.Then i) V H is a convex set.Moreover, denoting by A H the affine space spanned by H, and by B H the affine space ii) The closure V H is a polyhedron, whose relative boundary in B H is precisely given by the disjoint union of all the Voronoi cells V L with H L.
In order to introduce the second fundamental decomposition associated to K, we also need to define, for every η ∈ R d , the sets: The following proposition encodes the underlying geometrical structure conferred to our variational problem by the particular choice we made for the potential.It will be of fundamental importance in deriving regularity results for local minimizers of the functional I δ f K ,x 0 ,x δ .Proposition 3.6 (Voronoi cells and potential zones).The following facts hold: i) The map η is constant in each Voronoi cell, and hence has a finite range, that we denote by E.
ii) {Q η } η∈E is a partition of R d , and x ∈ Q η if and only if ∇f (x) = η − x.In the sequel we will refer to the Q η 's as potential zones.
iii) For every η ∈ E, both Q η and P η are union of Voronoi cells and it holds Q η ⊆ P η .iv) For every η ∈ E, P η is a polyhedron.Proof.Point i), ii) and iii) are direct consequences of Proposition 3.2.To prove point iv) it is enough to notice that P η is a closed convex set that can be written as the union of a finite number of polyhedra (the closures of the Voronoi cells contained in P η ).Finally, point v) easily follows from the fact that η is the projection of x on the closed convex set ∂g(x) containing η.
So we see that K carries two partitions of R d , one finer than the other: the first into Voronoi cells and the second into potential zones.Simple examples show that for a general K they do not coincide (see for instance Example 3.14 hereafter).If they coincide, we say that K is balanced.In such a case, the map η defines a bijection between Voronoi cells and potential zones, that is to say, η(x) = η(y) ⇐⇒ opt(x) = opt(y).
Clearly, a sufficient condition for K to be balanced is given by opt(η(x)) = opt(x) for every x ∈ R d . (3.7) It is worth noting that in dimension d = 1 every K is balanced, and that the same is true in any dimension for cubic lattices.
Conserved quantities.In this paragraph we underline the presence of some conserved quantities for local minimizers of our variational problem.They naturally arise by testing the local minimality against variations along some specific directions.The following Lemma collects two crucial observations in order to suitably perform such variations: Lemma 3.7 (Local properties of the Voronoi diagram).The following facts hold: i) If x ∈ V H , for some H ⊆ K, then there exists a neighborhood U of x such that opt(y) ⊆ H, for every y ∈ U .
ii) If x ∈ V H , then x + ǫv ∈ V H , provided that the vector v is parallel to B H , and ǫ is sufficiently small.
Proof.Point i) follows from the fact that a point of K which is not optimal for x is neither optimal for y, provided that y is chosen close enough to x. Point ii) is instead a direct consequence of the fact that V H is relatively open in the affine space B H .
By using very classical variational arguments as well as Lemma 3.7 we derive the following Proposition 3.8 (Conservation laws).Let γ be a local minimizer of I δ f K ,x 0 ,x δ .Then i) (Conservation of the energy).There exists a constant c ∈ R such that In particular, γ is Lipschitz continuous.
ii) (Local conservation of momentum).Let (t 1 , t 2 ⊆ [0, δ] be a time interval.Suppose that there exists an optimality class H ⊆ K, with V H = ∅, such that for every s ∈ (t 1 , t 2 ) the inclusion opt(γ(s)) ⊆ H holds. Let γ H be the curve obtained by projecting γ on the affine space B H . Then γ H is a C 1,1 curve in (t 1 , t 2 ) and, in this interval, it satisfies In particular, for each time t ∈ (0, δ), denoting H = opt(γ(t)), there exists a neighborhood of t in which the component γH of the momentum parallel to B H is continuous.
Proof.Point i) states that γ solves the Dubois-Reymond equation (2.5).This can be shown by testing the local minimality through "horizontal" variations of the independent variable of the form γ ǫ = γ•ρ −1 ǫ , where ρ ǫ = id +ǫϕ, ϕ ∈ C ∞ c ((0, δ)), and ǫ is small enough so that ρ ǫ is a diffeomorphism.To get point ii), instead, we need to perform "vertical" variations of the form γ ǫ = γ + ǫϕv, where ϕ ∈ C ∞ c ((t 1 , t 2 )) and v is any vector parallel to the affine space B H .We then use point ii) of Lemma 3.7 to get η(γ ǫ ) = η(γ), for ǫ sufficiently small.Point ii) of the previous proposition implies in particular that γ is regular as long as it stays in a single Voronoi cell.More precisely: Corollary 3.9 (Regularity inside a Voronoi cell).Let γ be a local minimizer of As a natural consequence, any singularity for a local minimizer of the functional I δ f K ,x 0 ,x δ appears only when the optimality class "changes".In the next paragraph we will try to give a more precise meaning to this statement.
Shock times and minimal deviation.Given a curve γ : [0, δ] → R d , and a time t ∈ [0, δ], we say that • t is a shock for γ if opt(γ) is not constant on I for every neighborhood I of t in [0, δ].
• t is a nondegenerate shock for γ if η(γ) is not constant on I for every neighborhood I of t in [0, δ].
• t is an effective shock for γ if there are two distinct potential zones Q η , Q η, and two distinct Voronoi cells V H ⊆ Q η , V H ⊆ Q η such that, for some ǫ > 0, one of the following holds: γ((t − ǫ, t)) ⊂ V H and γ([t, t + ǫ)) ⊂ V H .In this case, H H and we say that t is a left effective shock.
γ((t − ǫ, t]) ⊂ V H and γ((t, t + ǫ)) ⊂ V H .In this case, H H and we say that t is a right effective shock.
We denote by S(γ), N DS(γ) and ES(γ) respectively the sets of shocks, nondegenerate shocks and effective shocks for γ.Notice that ES(γ) ⊆ N DS(γ) ⊆ S(γ), and that S(γ) and N DS(γ) are compact.According to the definitions above, during a shock there must be a change of Voronoi cell, while, during a nondegenerate shock there is also a change of potential zone.Clearly we have S(γ) = N DS(γ) provided that K is balanced.Finally, we have an effective shock when a neat passage occurs from a Voronoi cell to an adjacent one with different potential.By conservation of the energy, we expect the dynamics to develop a singularity in the kinetic term here.This is the content of the following proposition, which is a direct consequence of the conservation laws stated in Proposition 3.8.Proposition 3.10 (Minimal deviation during an effective shock).Suppose that the potential shape h is strictly increasing.Let γ be a local minimizer of Clearly an analogous result holds for right effective shocks.
Remark 3.11.Notice that, in the specific case of a superadditive shape h, (for instance when h = id, as in the MAG model), we derive a uniform lower bound on the jump of the derivative during an effective shock: where β was defined in (3.5).
Remark 3.12.In the MAG dynamics, shocks tipically happen when two or more particles collide or separate, generating an instant change in the optimality class.For instance an effective shock occurs when two particles collide and remain sticked together.Notice that Proposition 3.8 tells us that energy and momentum are conserved in a collision.
To end this part, we show, through a couple of simple examples, that all of the three types of shock times defined in this paragraph may occur for a minimizer of I δ f K ,x 0 ,x δ .Example 3.13 (Effective and noneffective shocks).Take It is easily seen that a minimizer γ of I δ f K ,x 0 ,x δ has to be nondecreasing, and thus γ −1 (0) = [t 0 , t 1 ] is a closed interval (possibly degenerate if t 0 = t 1 ), and S(γ) = N DS(γ) = {t 0 , t 1 }.Then, there are two possible qualitatively different behaviours of γ, according to whether t 0 = t 1 or not.If t 0 = t 1 , then γ has a single nondegenerate non effective shock time.If instead t 0 = t 1 , then t 0 and t 1 are respectively left and right effective shocks.Now, direct computations show that the first case occurs if for instance c = 1.On the other hand, we can prove that the second case necessarily occurs if c is chosen sufficiently small.As a matter of fact, for c < 1, by the monotonicity of γ, the minimum value Γ(−c, c) of the functional can be bounded below as follows: At the same time, by Corollary 3.4, we have Therefore, t 1 > t 0 , provided that we choose c small enough.

Regularity results
Here we state and prove our main regularity results.Recall that K = {p 1 , . . ., p N } is a finite collection of points in R d , and h : [0, +∞) → [0, +∞) is a C 1 potential shape.Theorem 3.15 (C 1,1 regularity out of a finite number of nondegenerate shock times).Let γ be a local minimizer of I δ f K ,x 0 ,x δ .Suppose that h is strictly increasing.Then: i) The set N DS(γ) of non degenerate shock times of γ is finite.That is to say, there is a finite number of times ii) Setting t 0 = 0 and t ℓ+1 = δ, then γ is C 1,1 regular in the interval [t i , t i+1 ] for every i ∈ [0 : ℓ].
Moreover, if we let η i ∈ E be such that γ((t i , t i+1 )) ⊆ Q η i , then we can estimate Actually, if K is balanced and h is smooth, Theorem 3.15 can be improved to reach piecewise smooth regularity for any local minimizer γ.In fact, since K is balanced, the equality N DS(γ) = S(γ) holds and point i) of Theorem 3.15 implies that γ has a finite number of shock times.On the other hand, Corollary 3.9 together with the smoothness of h ensures that γ is smooth in each connected component of [0, δ] \ S(γ), where clearly opt(γ(t)) is constant.
Corollary 3.16 (C ∞ regularity out of a finite number of shock times).Let γ be a local minimizer of I δ f K ,x 0 ,x δ .Suppose that K is balanced and h is C ∞ and strictly increasing.Then: i) The set S(γ) of shock times of γ is finite.That is to say, there is a finite number of times ii) Setting t 0 = 0 and t ℓ+1 = δ, then γ is C ∞ in the interval [t i , t i+1 ] for every i ∈ [0 : ℓ].Moreover, if t i+1 > t i , denoting by H i the optimality class of γ in the interval (t i , t i+1 ), and defining Remark 3.17.Corollary 3.16 offers a generalization of [2, Theorem 13], in which smooth regularity out of a finite number of shock times was derived for solutions of a 1-dimensional version of the discrete MAG model.In their framework, h was simply the identity, while K consisted of all the m! points of R d obtainable by permuting the components of a fixed vector Exploiting the rearrangement inequality provided by the order structure of R, it is not difficult to show that hypothesis (3.7) holds in this case, therefore implying that K is balanced.
Let us now give an overview of the proof of Theorem 3.15 before entering in the details of the forthcoming paragraphs.We start discussing how point ii) can be derived from point i).We need to show that if η(γ) = η is constant in an interval (s, t), then γ is C 1,1 regular in [s, t].We clearly have that γ([s, t]) ⊂ P η .Then we observe that for every absolutely continuous curve ρ : [s, t] → P η , due to the monotonicity of h and inequality (3.6), the following inequality holds: Then, by a comparison argument, knowing that γ is a local minimizer of the functional on the left hand side of (3.8), and that ρ = γ saturates the inequality, we get that γ is a local minimizer for the functional on the right hand side of (3.8), if one restricts to curves living in the closed convex set P η .
On the other hand, we are going to prove in Lemma 3.18 that any such constrained local minimizer is necessarily C 1,1 regular.We now pass to the proof of point i) of the Theorem.First notice that we can restrict ourselves to prove the following equivalent local statement: Claim: for each time t ∈ (0, δ], there exists ǫ > 0 such that η(γ) is constant in (t − ǫ, t).
From claim (3.9) and the analogous one for right intervals (obtainable by exploiting the autonomicity of the functional), it clearly descends that N DS(γ) is discrete.Then, by compactness, we derive that N DS(γ) is finite.The proof of claim (3.9) will be elementary accomplished by a quite intricated series of "cut and paste" constructions of competitors.Let us first outline the general heuristic idea.We divide the proof in a few steps: Step 1. Suppose by contradiction that t is a cluster point for the "jumps" in the potential.Then, approaching t from the left, γ would infinitely often visit high and low potential zones.If only one low potential zone were visited asymptotically, then it would be convenient to stay in it definitely.Therefore we can assume that γ infinitely often visits at least two different low potential zones, approaching t from the left.Moreover, in alternating between different low potential zones, γ necessarily spends a non negligible amount of time in high potential ones.
Step 2. Approaching t from the left, the percentage of time spent by γ in high potential zones tends to zero, thus enforcing at least two different low potential zones to be very near to each other in order to make it possible for the Lipschitz curve γ to jump from one to the other in a short time.
Step 3. By Lemma 3.20 we will be eventually allowed to slightly deviate γ to reach an even lower potential zone (the interface between the two), hence contradicting the local optimality of γ and reaching the desired absurdum.
In the following paragraph we prove Lemmas 3.18 and 3.20.As a byproduct of Lemma 3.18, we obtain that a local minimizer is C 1,1 regular as long as it stays in a single potential zone (see Corollary 3.19), hence addressing point ii) of Theorem 3.15.Subsequently, in the final paragraph, we will provide the rigorous proof of claim 3.9 along the lines of the heuristics above, thus concluding the proof of Theorem 3.15.
Two Lemmas.The first Lemma, of indpendent interest, concerns the regularity of local minimizers of action functionals restricted to curves living in a given closed convex set.Because of boundary effects, such constrained minimizers are in general not C 2 , even if the Lagrangian is smooth.Nevertheless, they must be at least C 1,1 , whenever the Lagrangian is C 1 .Lemma 3.18 (Regularity for a problem with a convex constraint).Let P ⊆ R d be a closed convex set, and let Ψ : R d → [0, +∞) be of class C 1 in a neighborhood of P .Given two points x 0 , x δ ∈ P , we consider the functional G : C([0, δ], R d ) → [0, +∞] defined by
Proof.We call π P the projection on P .We start by defining, for each point x ∈ P , the "blow up" of P at x, namely the closed convex cone P x defined by the formula We then call S x the projection on P x , which turns out to be positive homogeneous and 1-Lipschitz.It is not difficult to realize that P − x ǫ in the sense of Hausdorff on every compact set.Since the inclusion (P − x)/ǫ ⊆ P x always holds, in order to show the convergence (3.10), we only need to check, using a separation argument, that, for every ǫ j → 0 + , and every point z ∈ P x , there exists a sequence y j ∈ (P − x)/ǫ j converging to z.Hence, the projection on (P − x)/ǫ pointwise converges to S x as ǫ → 0 + .By exploiting also the homogeneity of S x we eventually obtain that, for every v ∈ R d , Now, let γ be a local minimizer for the functional G.Given a test function ϕ ∈ C ∞ c ((0, δ); R d ), we consider the following competitors: γ ǫ := π P (γ + ǫϕ), for ǫ > 0.
Using the previous pointwise expansion (3.11), as well as the dominated convergence Theorem, we obtain From the local minimality of γ, it follows immediately that the right hand side in (3.12) is nonnegative.Finally, we can use the contractiveness of the projections S x to get the inequality and the thesis follows.
As already shown above, in the comparison argument accompanying inequality (3.8), from Lemma 3.18 descends the following Corollary 3.19 (Regularity in a potential zone).Let h be a nondecreasing potential shape, and let γ be a local minimizer of I δ f K ,x 0 ,x δ .Suppose that there exists an η ∈ E such that γ((s, t)) ⊆ Q η , where (s, t) ⊂ [0, δ].Then γ is C 1,1 regular on [s, t] and the following estimate holds Thus, point ii) of Theorem 3.15 is proved.We can now address the second Lemma, which will turn out to be crucial in the proof of point i) of Theorem 3.15.Lemma 3.20 (Reciprocal distance of intersecting polytopes).Let A, B ⊂ R d be two polytopes with A ∩ B = ∅.Then there exists a sufficiently large constant M > 0 such that dist A∩B (x) ≤ M dist B (x) for every x ∈ A.
Proof.Let P ⊂ R d be a polytope endowed with a representation of the form (3.4), where we assume without loss of generality that each of the affine functions T j has Lipschitz constant equal to 1. Let us consider the associated vector valued function z P : R d → [0, +∞) ℓ defined by z P (x) = T 1 (x) + , . . ., T ℓ (x) + . (3.13) As a first step we show that there exists a constant c P > 0 such that T j (x) + ≤ dist P (x) ≤ c P |z P (x)| 1 for every j ∈ {1, . . ., ℓ} and every x ∈ R d . (3.14) The left inequality follows by the 1-Lipschitz assumption on T j , while the right one can be obtained using a compactness argument as follows.Suppose by contradiction that there exists a sequence of points Up to replacing x n with we can assume that dist P (x n ) = 1.Now, if x is any cluster point for x n , we obtain that dist P (x) = 1 and z P (x) = 0, which is clearly a contradiction.We are now in the position to prove the Lemma.Let ℓ ∈ N be the number of affine functions in a representation of B of the form (3.4), with 1-Lipschitz affine functions T j .By exploiting the estimates in (3.14), for every x ∈ A we can bound where c A∩B is a positive constant.The thesis follows by choosing M = ℓc A∩B .
Proof of the regularity result.In this paragraph we are going to complete the proof of our main regularity result, Theorem 3.15, following the heuristic idea outlined above.It remains to prove point i), and we already restricted to show claim (3.9).Throughout the proof, γ will be a fixed local minimizer of I δ f K ,x 0 ,x δ and L the Lipschitz constant of γ.Moreover, for the sake of simplicity, we indicate I δ f K ,x 0 ,x δ as F. As a preliminary observation, notice that we can choose an R > 0 large enough such that γ([0, δ]) ⊂ B R , and then, by point v) in Proposition 3.6, we can find an α > 0 small enough such that |η − x| ≥ |η − x| + α for every x ∈ B 2R ∩ Q η ∩ P η and every η, η ∈ E. (3.15) Proof of claim 3.9.First of all, by translation invariance, we can assume without loss of generality that γ(t) = 0, thus simplifying some further computations.Then we consider the asymptotic lowest potential threshold a := lim inf We call Ẽ the subset of E indexing the potential zones that are visited infinitely often before t.Namely, Notice that the thesis is equivalent to # Ẽ = 1.The set Ẽ can be partitioned into two subsets Ẽ1 and Ẽ2 defined by The set Ẽ1 , which is clearly nonempty by the definition of a, corresponds to those potential zones which are infinitely often visited by γ before t and that share the asymptotic lowest potential threshold a.
We expect the curve γ to spend most of the time there.We make the following technical choices of constants in order to simplify later arguments.We fix r, µ, ǫ > 0 small enough so that these requirements are satisfied: R1) For every x ∈ B r we have opt(x) ⊆ opt(0).R5) For every choice of η, η ∈ E, exactly one of the following holds: Notice that thanks to R1), for every x ∈ B r the segment [x, 0) will be entirely contained in the Voronoi cell V opt(x) , while R2) assures us that the curve belongs to this good area.Conditions R3), R4) and R5) will be useful to effectively distinguish between different potential Finally, condition R6) implies that the potential zones touched by γ in the interval (t − ǫ, t) are exactly those touched asymptotically.Thus, in particular, we have a = min By condition R6), the interval (t − ǫ, t) can be partitioned into the following two sets: We observe that C 1 is closed in (t − ǫ, t).In fact, if s j ∈ C Remember that the thesis is equivalent to # Ẽ = 1.We now assume that # Ẽ ≥ 2 and try to find a contradiction.
Step 1. (Reduction to the case in which # Ẽ2 ≥ 1 and # Ẽ1 ≥ 2).We first show that # Ẽ2 ≥ 1.If this were not the case, then we would have |η(γ(s))| = a for every s ∈ (t − ǫ, t) and # Ẽ1 ≥ 2. Then we could find two distinct η, η both belonging to Ẽ1 and a time s ∈ (t − ǫ, t) such that x := γ(s) ∈ Q η ∩ P η.Now, by (3.15)We now show that # Ẽ1 ≥ 2. Suppose by contradiction that Ē1 = {η}.Then we can build a better competitor γ by performing arbitrarily small perturbations of γ in the following way.We choose s ∈ (t − ǫ, t) as close to t as we want, such that γ(s) ∈ Q η .Then we modify γ only in the interval [s, t], by replacing it with its projection on the closed convex set P η .Namely, denoting by π η the projection on P η , we define γ(u) = γ(u) for u < s or u > t, π η (γ(u)) for u ∈ [s, t].
Notice that V H and V H are two polyhedra whose intersection contains 0. Possibly replacing them with their intersection with a large d-dimensional cube, we can assume that they are polytopes.Then we can apply Lemma 3.20 to deduce the existence of a constant M > 0 and a sequence of points Step 3. (A slight deviation of γ through V H ∩ V H reduces the action, thus contradicting its local minimality).We call ǫ ℓ := c 4 (t − s ℓ ) and assume that ℓ is large enough so that ǫ ℓ ∈ (0, 1).We crucially consider the following competitor: for u < s ℓ or u > t, γ(s ℓ ) + x ℓ −γ(s ℓ ) ǫ ℓ (t−s ℓ ) (u − s ℓ ) for u ∈ [s ℓ , s ℓ + ǫ ℓ (t − s ℓ )), Notice that in the interval [s ℓ , t], the curve δ ℓ is simply a piecewise linear modification of γ, going from γ(s ℓ ) to x ℓ in time ǫ ℓ (t − s ℓ ), and then from x ℓ to 0 in the remaining time.We will see that δ ℓ has strictly less action than γ for ℓ large enough, thus reaching the desired contradiction.We first want to be sure that V H ∩ V H is a very low potential zone, so that we can lower the action of γ by a slight deviation through it.This can be seen as follows.For sure η and η both belong to ∂f (x ℓ ), thus also η+η 2 ∈ ∂f (x ℓ ).But then where we used in the very last equality that |η| = |η| = a.Therefore, for ℓ large enough, we can assume that x ℓ ∈ B r , and |η(x ℓ )| < a.The last one in particular implies that |η(x)| ≤ a − 3µ for every x ∈ [x ℓ , 0] and ℓ large enough.
We can also estimate Let us then compare the action of δ ℓ with the one of γ.We start from the kinetic part: Whence, integrating: Here ℓ is choosen large enough so that c 0 (t − s ℓ ) ≤ µ 2 .Moreover, we set Finally, collecting all the estimates together, we obtain Now the contradiction comes from the fact that the right term is strictly positive for ℓ large enough.This concludes the proof.
∀p i , p j ∈ H , we have that A H is orthogonal to B H , they have complementary dimensions in R d , and V H is relatively open in B H .We call p H the unique intersection point of A H and B H .