Abstract
In this paper, we investigate the structure of reachable sets from a given point q 0 for a class of analytic control affine systems characterized, among other things, by possessing two singular trajectories initiating at q 0. The aim of the paper is to establish the connection between the minimal number of analytic functions needed for describing reachable sets and the number of geometrically optimal singular trajectories. The paper is written in a language of the sub-Lorentzian geometry. Also, the sub-Lorentzian geometry methods are used to prove theorems.
Similar content being viewed by others
1 Introduction
1.1 Preliminaries
This paper is a continuation of the research started in [8,9] and devoted to the study on reachable sets for noncontact sub-Lorentzian structures on ℝ3, as well as for affine control systems induced by them. Similarly as in [8,9] our objective is to investigate the interrelation of the structure of reachable sets from a given point q 0 for the mentioned systems and geometric optimality of singular trajectories starting at q 0 or—speaking in the sub-Lorentzian language—geometric optimality of timelike abnormal curves starting at q 0 (a trajectory of a control system starting from a point q 0 is said to be geometrically optimal if it is contained in the boundary of the reachable set from q 0; cf. [1]). The paper is arranged in such a way that, in the first four sections, we develop the theory in the sub-Lorentzian setting, and Section 5 contains applications of the obtained results to control affine systems.
For all facts and notions from the sub-Lorentzian geometry, the reader is referred to the previous papers by the author (see [6] and its reference section; see also [10]). Here, we recall only those basic facts that are needed for stating the results. Let M be a smooth manifold, and let H be a smooth distribution on M of constant rank. For a point q ∈ M and an integer k ∈ ℕ, we define \(H_{q}^{k}\) to be the linear subspace in H q generated by all vectors of the form [X 1, [X 2,...,[X i−1, X i ]... ]](q), where X 1,...,X i are smooth (local) sections of H defined in a neighborhood of q, and i ≤ k. We say that H is a bracket generating distribution if for every q ∈ M, there exists an i = i(q) such that \(H_{q}^{i}\) = T q M. Now, by a sub-Lorentzian structure (or a sub-Lorentzian metric) on a manifold M, we mean a pair (H, g), where H is a bracket generating distribution on M, and g is a smooth Lorentzian metric on H. The triple (M, H, g) is called a sub-Lorentzian manifold. Take a point q ∈ M; a vector v ∈ H q is said to be timelike if g(v, v) < 0, nonspacelike if g(v, v) ≤ 0, and null if g(v, v) = 0 but v ≠ 0. A time orientation of (M, H, g) is, by definition, a continuous timelike vector field on M. Suppose (M, H, g) to be time-oriented by X and let v ∈ H q be a nonspacelike vector. We say that v is future directed if g(v, X(q)) < 0. A curve γ : [a, b] → M is called horizontal if it is absolutely continuous, γ̇ (t) ∈ H γ(t) a.e. on [a, b], and γ̇ is square integrable with respect to some Riemannian metric on M. From now on, all curves are supposed to be horizontal. We will also use the following abbreviations: t. for “timelike,” nspc. for “nonspacelike,” and f.d. for “future directed.” We say that a curve γ : [a, b] → M is t.f.d. (resp. nspc.f.d., null f.d.) if so is γ̇ (t) a.e. on [a, b]. Fix a point q 0 ∈ M and its neighborhood U. The (future) timelike (resp. nonspacelike, null) reachable set from q 0 relative toU is defined to be the set of all points in U that can be reached from q 0 by a t.f.d. (resp. nspc. f.d., null f.d.) curve entirely contained in U. They are denoted respectively by I +(q 0, U), J +(q 0, U), and N +(q 0, U). In the general case, all we can say about reachable sets is that intI +(q 0, U) ≠ ∅, and that the three reachable sets have the same interiors and closures with respect to U. In order to be able to say something more, we need to make certain assumptions on U. To this end, let us notice that if U is sufficiently small, then our sub-Lorentzian metric can be extended to a Lorentzian metric, say g͂, on U. So U is said to be a normal neighborhood of q 0 if it is a convex normal neighborhood of q 0 with respect to g͂, and U is contained in some other convex normal neighborhood of q 0 with respect to g͂ (see [8] for a constructive definition of normal neighborhoods). Now, if U is a normal neighborhood of q 0, we know that J +(q 0, U) is closed with respect to U and moreover cl U (intI + (q 0, U)) = cl U (intN +(q 0, U)) = J + (q 0, U), where cl U stands for the closure with respect to U. Note that unlike the Lorentzian case, the boundary ∂͂ J +(q 0, U) (here and below, ∂͂ means the boundary with respect to U) may contain timelike curves starting from q 0. It can be proved [6] that such curves are abnormal curves for the underlaying distribution H (see [11] for a definition); they are also Goh curves (cf. [1]), but we do not need this latter fact in this paper. Let X 0,...,X k be an orthonormal frame for (H, g) defined on an open set U. We define the so-called geodesic (sub-Lorentzian) Hamiltonian ℋ : T ∗ U → ℝ, by formula ℋ(q, p) = \(-\frac {1}{2} \left \langle p,X_{0}(q)\right \rangle +\frac {1}{2}\sum _{i=1}^{k}\left \langle p,X_{i}(q)\right \rangle \) (it is possible to define ℋ in a global and invariant way; see [6]). Now, a curve γ : [a, b] → U is said to be a Hamiltonian geodesic if it can be represented as γ(t) = π ◦ Γ(t) where π : T ∗ M → M is the canonical projection and Γ̇ = ℋ⃗, ℋ⃗ being the Hamiltonian vector field corresponding to the function ℋ. Hamiltonian geodesics do not change their causal character; moreover, null f.d. Hamiltonian geodesics are [6] locally geometrically optimal. Finally, let U be an open subset in (M, H, g), and suppose that φ : U → ℝ is a smooth function. The horizontal gradient of the function φ is defined to be the vector field ∇ H φ such that d q φ(v) = g( ∇ H φ(q), v) for every v ∈ H q , q ∈ U. One easily makes sure that if ∇ H φ is null f.d. on U and γ : [a, b] → U is t.f.d. (nspc.f.d.), then the function [a, b] ∋ t → φ(γ(t)) is decreasing (nonincreasing).
1.2 Statement of the Results
In papers [5,7], we studied on the contact sub-Lorentzian structures on ℝ3. On the other hand, in [8,9] (generalized) Martinet sub-Lorentzian structures of Hamiltonian type of order k were studied, i.e., structures that, among other conditions imposed on them, are not contact on a hypersurface or, speaking in another way, structures whose Martinet surface is smooth. As a next step, it is reasonable to consider structures with the simplest non-smooth Martinet surface S, i.e., where S is a union of transversely intersecting smooth hypersurfaces. In order to formulate necessary assumptions, let us introduce a notion of the hyperbolic angle on a sub-Lorentzian manifold (M, H, g). Let v 1, v 2 ∈ H q , q ∈ M, be t.f.d. vectors. The hyperbolic angle between v 1 and v 2 is the number ∢(v 1, v 2) ≥ 0 defined by
where ∥v∥ = ∣g(v, v)∣1/2; by the reverse Schwarz inequality \(-\frac {g(v_{1},v_{2})}{\left \| v_{1}\right \| \left \| v_{2}\right \| }\geq 1\), so the definition makes sense. If L 1 = Span {v 1}, L 2 = Span {v 2} are 1-dimensional timelike linear subspaces in H q with v 1, v 2 being chosen to be t.f.d., then we put
Now, we come to the precise definition of the type of sub-Lorentzian structures that we are going to consider in this paper. So let H be a bracket generating distribution of constant rank equal to 2, defined in a neighborhood U of 0 ∈ ℝ3. We say that H satisfies the condition (M 2, 2) if the following conditions are satisfied:
-
(i)
There exist smooth hypersurfaces S 1 and S 2 such that the intersection Γ = S 1 ∩ S 2 is smooth of dimension 1, contains the origin, and is transverse to H; moreover, for each q ∈ Γ, dim (T q S i ∩ H q ) = 1, i = 1, 2;
-
(ii)
H defines a contact structure on U∖(S 1 ∪ S 2);
-
(iii)
\(H_{q}^{2}\) = H q and \(H_{q}^{3}\) = T q R 3 whenever q ∈ (S 1 ∪ S 2)∖Γ;
-
(iv)
\(H_{q}^{4}\) = T q R 3 whenever q ∈ Γ.
The set S = S 1 ∪ S 2 will be called the Martinet surface for H. Note that S is foliated by abnormal curves for the distribution H. Next, we chose a Lorentzian metric g on H in the way similar as in [8,9]:
-
(v)
The field of directions S i ∋ q → T q S i ∩ H q is timelike, i = 1, 2;
-
(vi)
The function S 1 ∩ S 2 ∋ q → ∢ (T q S 1 ∩ H q , T q S 2 ∩ H q ) is constant;
-
(vii)
The abnormal curves foliating S are, up to a change of parameter, t.f.d. Hamiltonian geodesics.
As we shall see, the two latter assumptions are used only in the process of constructing our normal form.
We will say that a sub-Lorentzian structure (H, g) is of type M 2, 2 if it satisfies conditions (i)–(vii). The sub-Lorentzian structure (H, g) is analytic if all objects entering its definition (e.g., the Martinet surface) are analytic.
Theorem 1.1
Let (H, g) be a time-oriented analytic sub-Lorentzian structure of type M 2, 2 defined on a neighborhood U of the origin in ℝ3. Then, provided that U is sufficiently small, there exist analytic coordinates x, y, z on U in which (H, g) has an orthonormal frame in the normal form
where X is a time orientation; c 1 and c 2 are constants such that − 1 < c 2 < c 1 < 1, S = {y = c 1 x} ∪ {y = c 2 x} is the Martinet surface for H; and φ and ψ are analytic functions on U, ψ(0, 0, z) = 0.
Using Theorem 1.1, we then investigate the structure of reachable sets. Let
It is not difficult to see what the geometric interpretation of Eq. 1.2 is. Let α = ∢ (T q S 1 ∩ H q , T q S 2 ∩ H q ) for a q ∈ Γ; Eq. 1.1 implies that \(\cosh \alpha =\frac {1-c_{1}c_{2}}{\sqrt {1-c_{1}^{2}}\sqrt {1-c_{2}^{2}}} \) and at the same time \(\sinh \alpha =\frac {c_{1}-c_{2}}{\sqrt {1-c_{1}^{2}} \sqrt {1-c_{2}^{2}}}\). Thus,
is an invariant for metrics of type M 2, 2. In particular, the sign of W has an invariant meaning and does not depend on the choice of coordinates. Note by the way that W > 0 ( W < 0, W = 0) if and only if \(\tanh \alpha > \frac {1}{2}\) (\(\tanh \alpha <\frac {1}{2}\), \(\tanh \alpha =\frac {1}{2}\)). As we are about to see, the sign of W is determinative for the structure of reachable sets. More precisely, we will prove two theorems.
Theorem 1.2
Suppose that (H, g) is a sub-Lorentzian structure defined on a suitably small normal neighborhood U of the origin by an orthonormal frame X, Y in the normal form (1.1) with X being a time orientation. Then, if W(c 1, c 2) > 0, there exist two analytic functions η 1, η 2 : U → ℝ such that the reachable sets from the origin for (H, g) are of the following form:
where
In particular, the three reachable sets are semi-analytic.
Note that in cases covered by Theorem 1.2, there are no timelike curves in the boundary ∂͂ J +(0, U), and the structure of reachable sets is the same as in the contact case.
Theorem 1.3
Suppose that (H, g) is a sub-Lorentzian structure defined on a suitably small normal neighborhood U of the origin by an orthonormal frame X, Y in the normal form (1.1) with X being a time orientation. Then, if W(c 1, c 2) < 0, there exist six analytic functions η 1, η 2, ξ ij : U → ℝ, i, j = 1, 2, and two 2-dimensional semi-analytic sets Σ 1, Σ 2 with the property that U ∩ {x ≥ 0} ∩ {z ≥ 0} ∩ {c 2 x ≤ y ≤ x}∖Σ 1 has two connected components \(\Sigma _{1}^{+}\), \(\Sigma _{1}^{-}\), and U ∩ {x ≥ 0} ∩ {z ≤ 0} ∩ {−x ≤ y ≤ c 1 x}∖Σ 2 has two connected components \( \Sigma _{2}^{+}\), \(\Sigma _{2}^{-}\), such that
where
In particular, the three reachable sets are semi-analytic.
Note that in cases covered by Theorem 1.3, there are two timelike curves on the boundary ∂͂ J +(0, U). It is also seen that in such cases, neither I +(0, U) is open nor N +(0, U) is closed.
The structure of all geometrically optimal curves in cases treated in Theorems 1.2 and 1.3 is described in Section 2. Let us notice that proofs of Theorems 1.2 and 1.3 give a sort of “algorithm” for computing the reachable sets. Moreover, the presented results bear a geometric character, so having proved Theorems 1.2 and 1.3 and applying a remark similar to Remark 4.4 in [9], we no longer have to transform our structure to the normal form in order to compute reachable sets.
Similarly as it was done in some previous papers by the author, all the above results can be applied to the study on reachable sets for control affine systems induced by sub-Lorentzian metrics of type M 2, 2. Let
be a control affine system defined on M, where M is an open subset of ℝ3 (or a 3-manifold). We suppose the fields X, Y to be linearly independent. Fix a point q 0 ∈ M and its neighborhood U ⊂ M. We will consider the reachable set 𝒜[a, b](q 0, U) from q 0 for the system (1.3) which is defined to be the set of endpoints of all trajectories of Eq. 1.3 that are contained in U, initiate at q 0, and are generated by measurable controls u : [0, T] → [a, b]; here, T = T(u) depends on a control. It was proved in [9], Lemma 1.2, that the reachable set 𝒜[a, b](q 0, U) is equal to the future nonspacelike reachable set for the time-oriented sub-Lorentzian structure (H, g a, b) defined on M by declaring the frame
to be orthonormal with a time orientation Z a, b (in [9], we studied also on reachable sets 𝒜(a, b)(q 0, U) and 𝒜{a, b}(q 0, U)). Let us recall a notion of singular trajectories (cf. [4]) for affine control systems. So, a trajectory γ : [0, T] → [U of Eq. 1.3 is called a singular trajectory if it is generated by a control u(t) with values in the open interval (a, b) and is an abnormal curve for the distribution H. It is worth noting that singular trajectories satisfy necessary conditions for geometric optimality of Pontriagin maximum principle. In terms of the sub-Lorentzian metric (H, g a, b), singular trajectories are exactly timelike abnormal curves.
Now, suppose that X, Y is an orthonormal frame for a metric (H, g) of type M 2, 2. We say that the system q̇ = X + uY, u ∈ [a, b], is typical in class M 2, 2 if \(W\left (\frac {2c_{1}-b-a}{b-a},\frac { 2c_{2}-b-a}{b-a}\right )\neq 0\) (cf. 5.9), where c 1 and c 2 are constants obtained by transforming (H, g) to normal form (1.1). Then, we can prove the following:
Theorem 1.4
Suppose that the system (1.3) is typical in class M 2, 2. If it has k geometrically optimal singular trajectories initiating at q 0 (in our case, k = 0, 1, 2), then the minimal number of analytic functions that one needs to describe the reachable set 𝒜[a, b](q 0, U) is 2 + 2k.
The above theorem also holds for all cases treated in [9]. Thus, the presence on the boundary ∂͂ 𝒜[a, b](q 0, U) of a singular trajectory initiating at q 0 increases (at least in the described cases) by two the minimal number of analytic functions needed for describing the reachable set 𝒜[a, b](q 0, U). It would be interesting to know whether this observation can be extended to a more general class of (not necessarily affine) control systems.
1.3 Organizations of the Paper
Section 2 is devoted to computing reachable sets for the so-called flat structures—they correspond to normal form (1.1) with φ and ψ set to be equal to zero. In Section 3, we compute normal forms. More precisely, we prove Theorem 3.1 which gives normal forms in a more general situation than that treated in the present paper and which can be a starting point for further studies. Theorem 1.1 is then a corollary of Theorem 3.1. In Section 4, we generalize global results from Section 2 to local results in a general (i.e., nonflat) situation of type M 2, 2 in cases where W(c 1, c 2) ≠ 0. In Section 5, we apply the results obtained for sub-Lorentzian structures to control affine systems. Proofs of the lemmas from Section 2.2 are contained in Section 6. Section 7 contains 3-dimensional visualizations of examples of reachable sets studied in Section 2. In the Appendix, we state some corollaries concerning the image under exponential mapping and also conjugate and cut loci.
Some proofs of the results presented in the paper are omitted since they are similar to those from [9].
2 Reachable Sets in the Flat Case
In this section, we study on reachable sets from the origin for the sub-Lorentzian structure (Ĥ, ĝ) defined by an orthonormal basis
with a time orientation X̂ where we assume that −1 < c 2 < c 1 < 1. Let S i = {(x, y, z) : y = c i x}. We see that the Martinet surface S in our case is equal to S 1 ∪ S 2. The structure (or a metric) (Ĥ, ĝ) will be called flat. This is because Eq. 2.1 is a particular case of Eq. 1.1 where φ and ψ has been set to zero. Hence, any structure as in Eq. 1.1 can be regarded as a perturbation of the flat structure; see Section 4 for some applications of this observation.
Reachable sets from the origin for Eq. 2.1 will be denoted respectively by Ĵ +(0) = J +(0, ℝ3), Î +(0) = I +(0, ℝ3), and N̂ +(0) = N +(0, ℝ3). First of all, it is obvious that
and
As in the previous papers by the author, the key role in the process of constructing functions describing reachable sets is played by the signs of the z-coordinates of the fields
Let
and
Since (X̂ − Ŷ) (z) > 0 on Γ1 and (X̂ + Ŷ) (z) < 0 on Γ2, similarly as in [7–9], we construct two functions η̂ 1 and η̂ 2. η̂ 1 is the solution to the Cauchy problem
and η̂ 2 is the solution to the Cauchy problem
After calculations, we obtain
and
As in the previous papers, we need to know the horizontal gradient ∇ Ĥ η̂ i of the function η̂ i with respect to (Ĥ, ĝ), i = 1, 2. Clearly, ∇ Ĥ η̂ i = − X̂ (η̂ 1)(X̂ − Ŷ) where
and ∇ Ĥ η̂ 2 = −X̂ (η̂ 2) (X̂ + Ŷ) where
Recall that in (1.2) in Section 1, we defined the polynomial W = c 1 c 2 + 2c 1 − 2c 2 − 1. As it was announced, the sign of W determines the structure of the reachable set for Eq. 2.1. Indeed, it is easy to check that
with
and
with
Let us notice here that c 1 c 2 − 2c 1 + 2c 2 − 1 = (c 1 c 2 − 1) − 2 (c 1 − c 2) < 0 for all c 1 and c 2 such that − 1 < c 2 < c 1 < 1.
2.1 The Case W < 0
This case is the simplest because, as it can be seen from Eqs. 2.6 and 2.8, ∇ Ĥ η̂ i , i = 1, 2 is null f.d. in the whole sector {−x < y < x}. Thus, using similar arguments as, e.g., in [7] or [8], we have
Proposition 2.1
If W < 0 then
and
where
Let us remark that
This means that there are no geometrically optimal timelike (and hence abnormal) curves starting from 0, and in what follows, only two functions suffice to describe the reachable sets.
To visualize better how the reachable set looks like, let us list all geometrically optimal curves. To this end, introduce the following notation. If Z 1, Z 2 are two vector fields on ℝ3, then by Z 1 Z 2, we will mean the curve which is a concatenation of a segment of the trajectory of Z 1 starting from 0, and a segment of a trajectory of Z 2. Using such a notation, every geometrically optimal curve is either (X̂ + Ŷ) (X̂ − Ŷ) or (X̂ − Ŷ) (X̂ + Ŷ). The intersection of ∂Ĵ +(0) with the plane {x = const > 0} is schematically presented in Fig. 1.
Points A and B lie on the plane {z = 0}. A (resp. B) corresponds to half-line {y = −x, z = 0} (resp. {y = −x, z = 0} ). The curve BA located above the straight line joining A and B represents ∂Ĵ +(0) ∩ {z ≥ 0} and is formed by the trajectories (X̂ + Ŷ) (X̂ − Ŷ); the curve AB located below this straight line represents ∂Ĵ +(0) ∩ {z ≤ 0} and is formed by the trajectories (X̂ − Ŷ) (X̂ + Ŷ).
2.2 The Case W > 0
In order to simplify reading of this subsection, proofs of all lemmas are moved to Section 6.
This case is more complicated since now E i , i = 1,...,4 are real. First of all, we must examine constants E 1,..., E 4. Obviously, E 2 < E 1 and E 4 < E 3. Moreover, we have two lemmas.
Lemma 2.1
The following inequalities hold true: − 1 < E 2 < E 1 < 1, E i < c i , i = 1, 2.
Lemma 2.2
The following inequalities hold true: − 1 < E 4 < E 3 < 1, c 2 < E 4, c 1 < E 3.
Using Eqs. 2.6 and 2.8, we conclude that ∇ Ĥ η̂ 1 is null f.d. on {E 1 x < y < x}, while ∇ Ĥ η̂ 2 is null f.d. on {−x < y < E 4 x}. Hence, we need more functions to describe the reachable sets from the origin.
First, we will compute Ĵ +(0) ∩ {z ≥ 0} . Since (X̂ + Ŷ) (z) > 0 for y > c 2 x, and (X̂ − Ŷ) (z) > 0 for y < c 2 x, everything in a neighborhood of the plane {y = c 2 x}, it is natural to consider the following Cauchy problems:
with the solution equal to
and
with the solution equal to
Now, we examine horizontal gradients ∇ Ĥ ξ̂ 1i , i = 1, 2. So ∇ Ĥ ξ̂ 11 = − X̂ (ξ̂ 11)(X̂ + Ŷ) with
and ∇ Ĥ ξ̂ 12 = − X̂ (ξ̂ 12)(X̂ − Ŷ) with
Using Eq. 2.11, it is easy to see that ∇ Ĥ ξ̂ 12 is null f.d. in {−x < y < c 2 x}. Indeed, if c 1 − 3c 2 − 2 ≥ 0, then we have (2c 1 c 2 + 3c 1 − c 2) x + c 1 − 3c 2 − 2 y ≥ 2 (c 2 + 1) (c 1 + 1) x > 0. If, on the other hand, c 1 − 3c 2 − 2 < 0, then (2c 1 c 2 + 3c 1 − c 2) x + (c 1 − 3c 2 − 2) y > (2c 1 c 2 + 3c 1 − c 2) x + c 2(c 1 − 3c 2 − 2) x = 3 (c 2 + 1) (c 1 − c 2) x > 0. Also, since c 1 − 3c 2 + 2 = c 1 − c 2 + 2 (1 − c 2) > 0, we see that ∇ Ĥ ξ̂ 11 is null f.d. for \(y<-\frac {2c_{1}c_{2}-3c_{1}+c_{2}}{ c_{1}-3c_{2}+2}x\).
Lemma 2.3
\(c_{1}<-\frac {2c_{1}c_{2}-3c_{1}+c_{2}}{c_{1}-3c_{2}+2}<1.\)
Now, by Lemmas 2.1 and 2.3, it is clear that \(E_{1}<-\frac { 2c_{1}c_{2}+c_{2}-3c_{1}}{c_{1}+2-3c_{2}}\). We will compute the intersection \(\left \{ \hat {\eta }_{1}=0\right \} \cap \left \{ \hat {\xi }_{11}=0\right \} \cap \left \{ E_{1}x<y<-\frac {2c_{1}c_{2}+c_{2}-3c_{1}}{c_{1}+2-3c_{2}}x\right \}. \) Evidently,
On the other hand,
We need the following:
Lemma 2.4
Let f (x, y) = 4 (x − y)3 + (y − 1)2 (x + 1)2 (xy − 2x + 2y − 1 ) be a function considered on the set D = {(x, y) : −1 < y < x < 1}. Then f < 0 on D.
Lemma 2.4 and Eq. 2.13 give
Moreover, (X + Y)(ξ̂ 11 − η̂ 1) = − (X + Y)(η̂ 1) < 0 on {c 2 x < y < E 1 x}, and (X + Y) (ξ̂ 11 − η̂ 1) > 0 on {E 1 x < y < x}. Now, let us sum up what we already know. By Eq. 2.12, the expression ξ̂ 1 − η̂ 1 (which is a homogeneous polynomial in x, y) is negative on y = c 2 x and decreases along the trajectories of X + Y in {c 2 x < y < E 1 x}. Then, ξ̂ 11 − η̂ 1 starts to increase and for y = c 1 x, it attains a positive value by Eq. 2.14. It follows that \(\left \{ \hat {\eta }_{1}=0\right \} \cap \left \{ \hat {\xi } _{11}=0\right \} \cap \left \{ E_{1}x<y<-\frac {2c_{1}c_{2}+c_{2}-3c_{1}}{ c_{1}+2-3c_{2}}x\right \} \) is of the form y = S 1 x with E 1 < S 1 < c 1. In this way, we arrive at the following:
where
The signs of z-coordinates of the fields X̂ + Ŷ, X̂ − Ŷ needed in the computation of ∂Ĵ +(0) ∩ {z > 0} are schematically illustrated in Fig. 2. Arrows pointing up correspond to the field X̂ + Ŷ, while those pointing down correspond to X̂ − Ŷ.
Now, we examine Ĵ +(0) ∩ {z ≤ 0}. Let us consider two Cauchy problems:
with the solution equal to
and
with the solution equal to
As above, we need to know the regions where horizontal gradients ∇ Ĥ ξ̂ 2i , i = 1, 2 are suitably directed. After calculations,
and
Equation 2.19 immediately yields that ∇ Ĥ ξ̂ 22 is null f.d. in {c 1 x < y < x}. Indeed, (−2c 1 c 2 − c 1 + 3c 2) x + (3c 1 − c 2 − 2) y < −2 (c 2 − 1) (c 1 − 1) x < 0 in this sector whenever 3c 1 − c 2 − 2 ≥ 0. On the other hand, if 3c 1 − c 2 − 2 < 0, then (−2c 1 c 2 − c 1 + 3c 2) x + (3c 1 − c 2 − 2) y < (−2c 1 c 2 − c 1 + 3c 2) x + c 1 (3c 1 − c 2 − 2) x = 3 (c 1 − 1) (c 1 − c 2) x < 0. Also by Eq. 2.18, we know that ∇ H ξ̂ 21 is null f.d. for \(y>-\frac { 2c_{1}c_{2}-c_{1}+3c_{2}}{-3c_{1}+c_{2}-2}x\). Indeed, it is enough to notice that −3c 1 + c 2 − 2 = − (c 1 − c 2) − 2 (c 1 + 1) < 0.
Lemma 2.5
\(-1<-\frac {2c_{1}c_{2}-c_{1}+3c_{2}}{-3c_{1}+c_{2}-2}<c_{2}.\)
Now, it follows that \(-\frac {2c_{1}c_{2}-c_{1}+3c_{2}}{-3c_{1}+c_{2}-2}<E_{4},\) and we will compute the intersection \(\left \{ \hat {\eta }_{2}=0\right \} \cap \left \{ \hat {\xi }_{21}=0\right \} \cap \left \{ -\frac { 2c_{1}c_{2}-c_{1}+3c_{2}}{-3c_{1}+c_{2}-2}x<y<E_{4}x\right \} \). We proceed similarly as above. So, first of all,
Next,
Lemma 2.6
The function f(x, y) = 4 (−x + y)3 − (x + 1)2 (y − 1)2 (xy − 2x + 2y − 1) is positive on D = {(x, y) : −1 < y < x < 1}.
Lemma 2.6 and Eq. 2.21 give
Now, Eqs. 2.20 and 2.22 imply that similarly as above, the set \( \left \{ \hat {\eta }_{2}=0\right \} \cap \left \{ \hat {\xi }_{21}=0\right \} \cap \left \{ -\frac {2c_{1}c_{2}-c_{1}+3c_{2}}{-3c_{1}+c_{2}-2}x<y<E_{4}x\right \} \) is of the form y = S 2 x with c 2 < S 2 < E 4. We deduce that
where
We conclude this section with the following:
Proposition 2.2
If W > 0, then
and
where  1,...,  6 are given by Eqs. 2.15–2.17, 2.23, 2.24, and 2.25, respectively, and A 7 = {y = c 1 x, z = 0, x ≥ 0}, A 8 = {y = c 2 x, z = 0, x ≥ 0}.
In this way, we see that there are two geometrically optimal timelike (abnormal) curves, and we need six analytic functions to describe reachable sets.
The signs of z-cordinates of the fields X̂ + Ŷ, X̂ − Ŷ, needed in the computation of ∂Ĵ +(0) ∩ {z < 0}, are illustrated in Fig. 3. As above, arrows pointing up (resp. down) correspond to the field X̂ + Ŷ (resp.X̂ − Ŷ).
Now, using the notation from the end of Section 2.1, we list all geometrically optimal curves in the case W > 0. They can be divided into two groups:
-
(i) Those forming ∂Ĵ +(0) ∩ {z ≥ 0}, i.e. (X̂ + Ŷ) (X̂ − Ŷ), (X̂ + c 2 Ŷ) (X̂ + Ŷ), (X̂ + c 2 Ŷ) (X̂ − Ŷ); note that (X̂ + Ŷ) (X̂ − Ŷ) and (X̂ + c 2 Ŷ) (X̂ + Ŷ) cease to be geometrically optimal when they reach the plane {y = S 1 x} (then they enter the interior int Ĵ +(0));
-
(ii) Those forming ∂Ĵ +(0) ∩ {z ≤ 0}, i.e. (X̂ − Ŷ) (X̂ + Ŷ), (X̂ + c 1 Ŷ) (X̂ − Ŷ), (X̂ + c 1 Ŷ) (X̂ + Ŷ); note that (X̂ + c 1 Ŷ) (X̂ − Ŷ) and (X̂ − Ŷ) (X̂ + Ŷ) cease to be optimal when they intersect the plane {y = S 2 x}. The intersection of the set ∂Ĵ +(0) with the plane {x = const > 0} is represented in Fig. 4.
The points A, B, C, D lie on the plane {z = 0}; A and B correspond to half-lines {y = − x, z = 0} and {y = x, z = 0}, while B and C to singular trajectories {y = c 2 x, z = 0} and {y = c 1 x, z = 0}, respectively. DE represents the surface formed by the trajectories (X̂ + Ŷ) (X̂ − Ŷ), BE is the surface made up of the trajectories (X̂ + c 2 Ŷ) (X̂ + Ŷ), and the point E corresponds to the set ∂Ĵ +(0) ∩ {y = S 1 x}, i.e., the place where the trajectories (X̂ + Ŷ) (X̂ − Ŷ) and (X̂ + c 2 Ŷ) (X̂ + Ŷ) meet. Finally, part BA of ∂Ĵ +(0) ∩ {z ≥ 0} is filled with trajectories (X̂ + c 2 Ŷ) (X̂ − Ŷ). Similarly, AF (resp. CF) is the surface formed by the trajectories (X̂ − Ŷ) (X̂ + Ŷ) (resp. (X̂ + c 1 Ŷ) (X̂ − Ŷ)), and CD corresponds to part of ∂Ĵ +(0) ∩ {z ≤ 0} which is made up of the trajectories (X̂ + c 1 Ŷ) (X̂ + Ŷ. The point F represents the set ∂Ĵ +(0) ∩ {y = S 2 x}, i.e., the place where the trajectories (X̂ − Ŷ) (X̂ + Ŷ) and (X̂ + c 1 Ŷ) (X̂ − Ŷ) meet.
2.3 The Case W = 0
Using Eqs. 2.6–2.9 and the condition W = 0, we see that in the case under consideration
and in what follows
Thus, ∇ Ĥ η̂ 1 is null f.d. on {−x < y < x} ∩ {y ≠ c 2 x}, and ∇ Ĥ η̂ 2 is null f.d. on {−x < y < x} ∩ {y ≠ c 1 x}. We can also see that η̂ 1(x, c 2 x, 0) = η̂ 2(x, c 1 x, 0) = 0. Moreover, ∇ Ĥ ξ̂ 11 is null f.d. for \(y<-\frac {2c_{1}c_{2}-3c_{1}+c_{2}}{c_{1}-3c_{2}+2}x\), and as above, we make sure that Eq. 2.14 holds together with the following:
Similar reasoning as in Section 2.2 shows that ξ̂ 11 − η̂ 1 is nondecreasing along trajectories of X + Y starting at y = c 2 x. It follows that ξ̂ 11 ≥ η̂ 1, and in turn,
Moreover,
which is in fact clear without calculations, since both functions satisfy the same linear differential equation with the same boundary conditions on the hypersurface y = c 2 x. Similar reasoning shows that
and
We sum up this subsection with the following:
Proposition 2.3
If W = 0 then
and
where
As we see, this case is very exceptional as compared to the previous cases with W ≠ 0. Namely, in spite of the fact that there are two geometrically optimal timelike curves, only two analytic functions suffice for describing reachable sets.
We list all geometrically optimal curves in this case: the curves forming ∂Ĵ +(0) ∩ {z ≥ 0}, i.e., the curves (X̂ + Ŷ) (X̂ − Ŷ), (X̂ + c 2 Ŷ) (X̂ − Ŷ), and the curves forming ∂Ĵ +(0) ∩ {z ≤ 0}, i.e., X̂ − ŶX̂ + Ŷ, X̂ + c 1 ŶX̂ + Ŷ. The set ∂Ĵ +(0) ∩ {x = const > 0} can be depicted similarly as in Fig. 4, but this time, the curves DB and BA correspond to trajectories of X̂ − Ŷ, and the curves AC and CD correspond to trajectories of X̂ + Ŷ.
Three-dimensional visualizations of reachable sets studied in this section are presented in Section 7.
3 Normal Forms
In this section, we consider more general sub-Lorentzian structures (H, g) than those dealt with in Theorem 1.1. At first, we describe the underlaying distribution H. So let H be a rank 2 distribution defined on a neighborhood U of the origin in ℝ3, and let l 1,...,l k ≥ 2 be positive integers. H will be said to satisfy the condition \((M_{l_{1},...,l_{k}})\) if it possesses the following properties:
-
(i) There exist smooth hypersurfaces S 1,...,S k in U, such that the intersection \(\Gamma =\bigcap _{i=1}^{k}S_{i}\) contains the origin, is smooth 1-dimensional, and transverse to H; moreover, for each q ∈ Γ and every i, j = 1,...,k, i ≠ j, dim (T q S i ∩ T q S j ) = 1, dim (T q S i ∩ H q ) = 1;
-
(ii) H defines a contact structure on \(U\backslash \bigcup _{i=1}^{k}S_{i}\);
-
(iii) For any fixed i = 1,...,k, \(H_{q}^{l}\subset H_{q}\), l ≤ l i , \(H_{q}^{l_{i}+1}=T_{q}\mathbb {R}^{3}\) on the set S i ∖Γ.
-
(iv) \(H_{q}^{l}\subset H_{q}\), l ≤ l 1 + ... + l k − k + 1, \(H_{q}^{l_{1}+...+l_{k}-k+2}=T_{q}\mathbb {R}^{3}\) for every q ∈ Γ.
Now, we choose a Lorentzian metric g. We make three assumptions:
-
(v) For each i = 1,...,k, the field of directions S i ∋ q → T q S i ∩ H q is timelike;
-
(vi) For every i, j = 1,...,k, i ≠ j, the function S i ∩ S j ∋ q → ∢(T q S i ∩ H q , T q S j ∩ H q ) is constant; and
-
(vii) The abnormal curves foliating the surfaces S 1,...,S k are, up to a change of parameter, t.f.d. Hamiltonian geodesics.
We will say that a sub-Lorentzian structure (or metric) (H, g) is of type \(M_{l_{1},...,l_{k}}\) on U if (i)–(vii) hold on U. The set \( S=\bigcup _{i=1}^{k}S_{i}\) is again called the Martinet surface for H. Note that structures of type M l are exactly generalized Martinet sub-Lorentzian structures studied in [9].
Our aim is to prove the following:
Theorem 3.1
Let (H, g) be a time-oriented analytic sub-Lorentzian structure of type \( M_{l_{1},...,l_{k}}\) defined on a neighborhood U of the origin in ℝ3. Then, provided that U is sufficiently small, there exist analytic coordinates x, y, z on U in which (H, g) has an orthonormal frame in the normal form
where X is a time orientation, c 1,...,c k are constants such that − 1 < c k < ... < c 1 < 1, S i = {y = c i x}, i = 1,...,k, and finally φ, ψ are analytic functions on U with ψ(0, 0, z) = 0.
We start with the following result.
Lemma 3.1
Suppose that (H, g) is analytic and satisfies the condition M 2,...,2 on a neighborhood U of the origin in ℝ3. Then, provided that U is sufficiently small, there are analytic coordinates x, y, z defined on U in which (H, g) admits an orthonormal frame in the following form
with X being a time orientation. Here, A, B are analytic functions, and S i = {y = c i x}, i = 1,...,k.
Proof
Fix an i, i = 1,...,k. Choose analytic coordinates x̂, ŷ, ẑ defined in a neighborhood of the origin such that S i = {ŷ = 0}, Γ = {(0, 0, ẑ)}, \(H_{|S_{i}}=\ker d \hat {z}\), and \(\frac {\partial }{\partial \hat {x}}_{|S_{i}}\), \(\frac {\partial }{\partial \hat {y}}_{|S_{i}}\) is an orthonormal basis for \(H_{|S_{i}}\) with a time orientation \(\frac {\partial }{\partial \hat {x}}\). Clearly, abnormal curves (which by (vii) are Hamiltonian geodesics) contained in S i satisfy transversality condition with respect to Γ; cf. [8], Lemma 3.1; see also [2]. Since the satisfaction of the transversality condition does not depend on a choice of coordinates, and, moreover, i was arbitrary, we see that every abnormal curve starting from Γ satisfies the transversality condition with respect to Γ. Now, by use of the exponential mapping and assumption (vii), similarly as it was done in [5,8], we are led to the existence of analytic coordinates x, y, z in which (H, g) has an orthonormal frame in the following form:
where S j ∩ {z = 0} = {y = c j x, z = 0}, ।c j । < 1, j = 1,...,k, S i = {y = 0} (i.e., c i = 0), and one can suppose that c k < ... < c 1. Using assumption (vi), we see that S j , j = 1,...,k, are all of the form S j = {y = c j x}.
Now, the z-coordinate of [X, Y] is equal to
By our assumptions \(\left [ X,Y\right ] _{|y=c_{i}x}\) is horizontal, i = 1,...,k. Take i = 1. Then, there exist analytic functions f(x, z), g(x, z) such that \(\left [ X,Y\right ] _{|y=c_{1}x}=f(x,z)X_{|y=c_{1}x}+g(x,z)Y_{|y=c_{1}x}\). This leads us to the equality of the z-coordinates
where A and its derivatives should be evaluated at (x, c 1 x, z). Suppose that A(x, c 1 x, z) does not vanish identically. Then, we can find z such that A(x, c 1 x, z) = a m (z)x m + o(x m) as x → 0 with a m (z) ≠ 0 and m > 0. Since \(x\left ( \frac {\partial A}{\partial x} +c_{1}\frac {\partial A}{\partial y}\right ) _{|y=c_{1}x}=x\left ( X+c_{1}Y\right ) (A)_{|y=c_{1}x}=ma_{m}(z)x^{m}+o(x^{m})\), Eq. 3.2 gives
so we arrive at a m (z) = 0 which is a contradiction. In this way, A may be replaced by the expression (y − c 1 x)A for some other analytic function A.
Repeating the argument for i = 2,...,k, we are lead to Eq. 3.1. □
Now, suppose that our structure (H, g), which is given by Eq. 3.1 on a neighborhood U of the origin, satisfies the condition \( M_{l_{1},...,l_{k}}\), l i ≥ 2, i = 1,...,k.
Fix an index i. If we set A 1 to be the function defined by y (y− c i x) A 1 = y (y − c 1 x) ... (y − c k x) A, the frame (3.1) takes the following form:
Applying the following change of coordinates
with tanh φ = c i (i.e., y − c i x = y͂/coshφ) and rewriting (3.3), we are led to
where
and \(\tilde {A}=\frac {A_{1}}{\cosh \varphi }\). Obviously (cf. [13]) X͂, Y͂ is again an orthonormal frame for (H, g) with a time orientation X͂, and we can apply to it the same method as in [9], Proposition 3.2. As a result, Eq. 3.5 can be written as follows:
with \(\tilde {A}=\tilde {y}^{l_{i}-2}\hat {A}\). Passing again to Eq. 3.4, we obtain the following:
i.e., to say
for some new analytic function A. Repeating the same argument for every i = 1,...., k, we are led to
for yet another analytic A about which we know that does not contain terms of the form (y − c i x)l. Equation 3.6 is, in fact, all we can get without assuming (iv).
Now, we take (iv) into account. To obtain a condition for A, we need l 1 + ... + l k − k + 1 differentiations of X(z) or Y(z), i.e., we have to consider sections of \(H^{l_{1}+...+l_{k}-k+2}\). So let W be a (local) section of \(H^{l_{1}+...+l_{k}-k+2}\) defined near zero. Then, looking at Eq. 3.6, we see that W(z) = CA + O(r), \(r=\sqrt {x^{2}+y^{2}+z^{2}}\), where by (iv) C ≠ 0 for suitable chosen W. Setting x = y = 0, we arrive at A(0, 0, z) ≠ 0. Last stage is to renormalize the z-axis, so as to have \(A(0,0,z)=\frac {1}{2}\). This can be done similarly as, e.g., in [9]. To end the proof, we write φ = − B, ψ = 2A − 1.
4 Reachable Sets in the General Case
In this section, by (H, g) we denote a fixed time-oriented sub-Lorentzian metric of type M 2, 2, defined on a normal neighborhood U of the origin in ℝ3. Throughout this section, we assume that U is as small as we need. We may suppose that (H, g) is already transformed to the normal form. So let X, Y be an orthonormal frame for (H, g) given on U by Eq. 1.1. In cases W > 0, W < 0, we will use the same method to compute local reachable sets as in [7–9]. The mentioned method, however, does not work when W = 0, and it seems impossible to arbitrate in advance what the structure of the reachable set will be in this case.
Let X = X̂ + X 1, Y = Ŷ + Y 2, and X̂, Ŷ be as in Eq. 2.1, and
4.1 The Case W < 0
Similarly as in Section 2, consider two Cauchy problems:
with the solution denoted by η 1, and
with the solution denoted by η 2. We write η 1 = η̂ 1 + R 1, η 2 = η̂ 2 + R 2. It is seen that R 1 and R 2 satisfy, respectively,
and
Clearly, (X 1 − Y 1)(η̂ 1) = Or 4, \(r=\sqrt {x^{2}+y^{2}+z^{2}}\). Since η 1 − z is divisible by x 2 − y 2( y = − x is the trajectory of X − Y starting at (0, 0, 0)), we deduce that R 1 = O (r 5). Similarly, R 2 = O (r 5) which, in view of η̂ i = ± z + O (r 4), means that η i may be regarded as a perturbation of η̂ i , i = 1, 2. Exactly, e.g., as in subsection 4.1 of [9], we prove that X(η 1) is divisible by x − y. It follows that ∇ H η 1 = − X(η 1)(X − Y) where, by using Eq. 2.4, we have
It follows that X(η 1) < 0 on U ∩ {−x < y < x}, and ∇ H η 1 is null f.d. on U ∩ {−x < y < x}. Similarly,
from which X(η 2) < 0 on U ∩ {−x < y < x}, and hence, ∇ H η 2 is null f.d. on U ∩ {−x < y < x}. We finish the proof of Theorem 1.2 as in the Section 2.1.
Geometrically, optimal trajectories in this case are the same as in the corresponding flat case, with X̂ (resp. Ŷ) replaced by X (resp. Y). Also, reachable sets look similarly; cf. Fig. 1.
4.2 The Case W > 0
Here, X(η 1) and X(η 2) are again given by Eqs. 4.2 and 4.3, respectively. This time, however, X(η 1) < 0 on {(E 1 + ε)x < y < x} ∩ U and X(η 2) < 0 on {−x < y < (E 2 − ε)x} ∩ U, where ε > 0 will be supposed to be sufficiently small. Next, we define functions ξ 11, ξ 12 as solutions to the following Cauchy problems:
and
respectively. As above, we write ξ 11 = ξ̂ 11 + R 11, ξ 12 = ξ̂ 12 + R 12, where, e.g., R 11 satisfies (X + Y)(R 11) = − (X 1 + Y 1)ξ̂ 11, R 11(x, c 2 x, z) = 0. It follows that R 11 = Or 5, and similarly R 12 = O(r 5). So again, we may think of ξ 1i as being perturbations of ξ̂ 1i , i = 1, 2. Now, since \(X+c_{2}Y_{|y=c_{2}x}=\frac {\partial }{\partial x}+c_{2} \frac {\partial }{\partial y}\), (X + c 2 Y)(ξ 11)| y = c 2x = 0 by definition of ξ 11. But also (X + Y)(ξ 11) = 0, from which \(X(\xi _{11})_{|y=c_{2}x}=0\), and therefore X(ξ 11) is divisible by y − c 2 x. We prove analogously that also X(ξ 12) is divisible by y − c 2 x. However, this is not enough for our purposes, and we need the following:
Lemma 4.1
X(ξ 11) and X(ξ 12) are divisible by (y − c 2 x)2.
Proof
We prove the first statement. We already know that X(ξ 11) = (y − c 2 x)g for an analytic function g. Since [X, X + Y] = 0 on {y = c 2 x}, it follows that (X + Y)(X(ξ 11)) = X(X + Y)(ξ 11) = 0 on {y = c 2 x}, where
By setting y = c 2 x, we arrive at \((1-c_{2})\left [ 1+(c_{2}+1)x^{2}\varphi \right ] g_{|y=c_{2}x}=0\), and the proof is over since g must be divisible by y − c 2 x (recall that U is as small as we need).
The proof of the second statement is analogous. We notice that [X, X − Y] = 0 on {y = c 2 x}, so (X − Y)(X(ξ 11)) = X(X − Y)(ξ 11) = 0 on {y = c 2 x} and continue in the same manner. □
Making use of Eqs. 2.10, 2.11, and the above lemma, ∇ H ξ 11 = − X(ξ 11)(X + Y) with
and ∇ H ξ 12 = − X(ξ 12)(X − Y) with
This, of course, implies that ∇ H ξ 11 is null f.d. on \(\left \{ c_{2}x<y<\left ( -\frac {2c_{1}c_{2}-3c_{1}+c_{2}}{c_{1}-3c_{2}+2}-\varepsilon \right ) x\right \} \cap U\), and ∇ H ξ 12 is null f.d. on − x < y < c 2 x ∩ U.
Using just the presented considerations and remembering Section 2.2, we may suppose that ξ 11 < η 1 on {c 2 x < y < (E 1 + ε)x} ∩ U, while ξ 11 > η 1 on {(c 1 − ε)x < y < x} ∩ U or, which is more convenient to us, that ξ 11 < η 1 on {c 2 x < y < (S 1 − ε)x} ∩ U, while ξ 11 > η 1 on {(S 1 + ε)x < y < x} ∩ U.
Let us define a set Z 1 by
clearly Z 1 is a semi-analytic set (cf. [12]). As in [8] and [9], we make sure that dimZ 1 = 1 and that Z 1 is made up of a single analytic curve entering the origin. Further, let us define a semi-analytic set by Σ 1 = ρ − 1(ρ(Z 1)) ∩ U where ρ : ℝ3 → ℝ2 is the projection (x, y, z) → (x, y), and let \(\Sigma _{1}^{+}\), \(\Sigma _{1}^{-}\) be the two connected components of U ∩ {x ≥ 0} ∩ {z ≥ 0} ∩ {c 2 x ≤ y ≤ x}∖Σ 1, containing half-lines y = x, x ≥ 0 and y = c 2 x, x ≥ 0, respectively. All what we have just said leads us to
where
as it was announced in Theorem 1.3.
Quite similar considerations can be carried out to describe the set J +(0, U) ∩ {z ≤ 0}. We only note that this time, we define a 1-dimensional semi-analytic set
Then, we set Σ 2 = ρ − 1(ρ(Z 2)) ∩ U and define \(\Sigma _{2}^{+}\), \(\Sigma _{2}^{-}\) to be the two connected components of U ∩ {x ≥ 0} ∩ {z ≤ 0} ∩ {−x ≤ y ≤ c 1 x}∖Σ 2, containing half-lines y = c 1 x, x ≥ 0, and y = − x, x ≥ 0, respectively. Finally, we obtain
where
This terminates the proof of Theorem 1.3.
All geometrically optimal curves are listed in Section 2.2 (again with X̂, Ŷ replaced by X, Y), and ∂͂ J +(0, U) ∩ {x = const > 0} can be depicted as in Fig. 4.
4.3 The Case W = 0
Here, as it was mentioned earlier, we are not able to say what the structure of reachable sets is. This is, for instance, because the relation η̂ 1(x, c 2 x, 0) = 0 may no longer be true after perturbation. Therefore, we cannot predict the sign of the expression η 1(x, c 2 x, 0) even in a small neighborhood of the origin without computing higher-order terms in the expression for η i . We will not do it in this paper.
4.4 Nilpotent Approximations
Note that all the functions describing reachable sets in the flat case, i.e., η̂ i , ξ̂ ij , and i, j = 1, 2, are homogeneous with respect to the family of dilatations δ t (x, y, z) = (tx, ty, t 4 z). In other words, the mentioned functions are homogeneous when we prescribe the following weights to variables: weight (x) = weight(y) = 1, weight (z) = 4. It also means that the flat structure is the nilpotent approximation for structures given by Eq. 1.1; cf. [3].
5 Applications to Control Affine Systems
Let us consider a control affine system
defined on a neighborhood U of the origin in ℝ3, where X and Y are supposed to be an orthonormal frame for the sub-Lorentzian metric of type M 2, 2. We can assume that X and Y are given by Eq. 1.1. As it was mentioned in Section 1, the reachable set 𝒜[a, b](0, U) for Eq. 5.1 coincides with the future nonspacelike reachable set J +(0, U) for the time-oriented sub-Lorentzian structure (H, g a, b) defined by declaring the frame
to be orthonormal with a time orientation Z a, b. Thus, 𝒜[a, b](0, U) = J +(0, U) is just the reachable set for the system
(cf. Lemmas 1.1 and 1.2 in [9]).
The aim of this section is to prove Theorem 1.4. To save space, we will not give exact formulas for functions describing reachable sets. We will restrict ourselves only in examining the structure of reachable sets and its dependence on geometric optimality of singular trajectories.
The first evident observation is that
5.1 The Case c 1, c 2 ∉ (a, b)
In this case, there are no singular trajectories starting from the origin for Eq. 5.2 (and hence for Eq. 5.1 since both systems are equivalent). As in [7,9] or as above, we investigate z-coordinates of Z a, b ± W a, b. So,
and
have opposite signs on {ax < y < bx} ∩ U. In this way, again as in [7,9], 𝒜[a, b](q 0, U) is described by two analytic functions.
5.2 The Case c 2 ≤ a < c 1 < b or a < c 2 < b ≤ c 1
In this case, the system (5.2) has one singular trajectory initiating at the origin. Again, we examine the signs of z-coordinates of Z a, b ± W a, b. Thus, in the first case, (5.4) on {y = ax} and Eq. 5.5 on {y = bx} are both positive, while in the second case, they are both negative. Also, Eqs. 5.4 and 5.5 are negative near {y = c 1 x} in the first case, while Eqs. 5.4 and 5.5 are positive near {y = c 2 x} in the second case. Therefore, we arrive at the similar situation as in [9], and a similar reasoning as in [9] leads to the conclusion that the minimal number of analytic functions needed for describing 𝒜[a, b](q 0, U) is four.
5.3 The Case a < c 2 < c 1 < b
In this case, the system (5.2) has two singular trajectories initiating at the origin. In order to simplify the situation, we make the following change of coordinates in Eq. 5.2:
The resulting system is as follows:
where
with
and
Note that by rescaling the z͂ -axis, we can always get rid of the factor \(\left ( \frac {b-a}{2}\right ) ^{3}\), so we no longer take care of it. Since the change of coordinates (5.6) is bi-analytic, transforms straight lines onto straight lines, and preserves geometric optimality of trajectories, the reachable set for Eq. 5.7 is described by the same number of analytic functions as 𝒜[a, b](q 0, U) and has the same number of geometrically optimal singular trajectories. Now, we repeat the above arguments. Reachable sets for Eq. 5.7 with φ = ψ = 0 in Eq. 5.8 are computed according to Section 2. Reachable sets for Eq. 5.7 with arbitrary φ and ψ in Eq. 5.8 are computed according to Section 4 (here, we should remark that although Eq. 5.8 does not coincide with Eq. 1.1, the argument still works since, as one easily checks, functions describing reachable sets for arbitrary φ and ψ are again perturbations of functions describing reachable sets for φ = ψ = 0). To sum up, 𝒜[a, b](q 0, U) is described by six analytic function whenever W(c͂ 1, c͂ 2) > 0 and by two analytic functions whenever W(c͂ 1, c͂ 2) < 0. We also know that in case W(c͂ 1, c͂ 2) = 0 and φ = ψ = 0, two analytic functions suffice.
In this way the proof of Theorem 1.4 is over.
Remark 5.1
At the end, let us note that the presented method of analysis of reachable sets works for affine control systems induced by any sub-Lorentzian structure described by Theorem 3.1, but because of a large number of constants, it is difficult to state a general theorem.
5.4 Two Examples
To illustrate how the presented methods work in practice, let us consider two examples.
Example 5.1
Suppose that we are interested in the structure of the reachable set from the origin for the following affine control system:
where
This system is equivalent to the sub-Lorentzian structure (H, g − 1, 4) defined by an orthonormal basis Z − 1, 4, W − 1, 4 with Z − 1, 4 being a time orientation, where
According to Eq. 5.6, we change coordinates as follows: x͂ = x, \(\tilde {y}=-\frac {3}{5}\tilde {x}+\frac {2}{5}\tilde {y}\), z͂ = z, and as a result, we have
We end up with \(\tilde {c}_{1}=\frac {3}{5}\), \(\tilde {c}_{2}=\frac {1}{5}\). Since \(W\left (\frac {3}{5},\frac {1}{5}\right )<0\), we know that there are no geometrically optimal singular trajectories; hence, the reachable set from the origin for the system that we started with can be described by two analytic functions. ∂𝒜[−1, 4](0, ℝ) ∩ {x = const > 0} can be depicted as in Fig. 1, where this time A (resp. B) corresponds to half-line {y = − x, z = 0} (resp. {y = 4x, z = 0}). An approximate shape of 𝒜[−1, 4](0, ℝ) can be seen in Fig. 5.
Example 5.2
Consider again the system (5.10) where this time
According to the above procedure, we pass to the sub-Lorentzian structure induced by an orthonormal frame:
Making again the same change of coordinates, we are led to
so \(\tilde {c}_{1}=\frac {3}{5}\), \(\tilde {c}_{2}=-\frac {1}{5}\). But this time, \( W\left (\frac {3}{5},-\frac {1}{5}\right )>0\); therefore, we conclude that there are two geometrically optimal singular trajectories, and that we need six analytic functions to describe the reachable set 𝒜[−1, 4](0, ℝ) for the system we started with. The intersection ∂𝒜[−1, 4](0, ℝ) ∩ x = const > 0 can be represented, with obvious changes in the interpretation, as in Fig. 4. 𝒜[−1, 4](0, ℝ) looks approximately as in Fig. 7.
Remark 5.2
Let us notice in this place that the structure of reachable sets in the case a < c 2 < c 1 < b essentially depends on a and b. For instance, if in the last example with c 1 = 3, c 2 = 1, a = − 1, we let b vary then (cf. 5.9), the expression \(W\left (\frac {2c_{1}-b-a}{b-a},\frac { 2c_{2}-b-a}{b-a}\right )=W\left (\frac {6-b+1}{b+1},\frac {2-b+1}{b+1}\right )=-4\frac {b-7}{\left ( b+1\right ) ^{2}}\) changes sign when b exceeds 7.
6 Proofs of Lemmas from Section 2.2
In this section, we present the proofs of lemmas from Section 2.
Proof of Lemma 2.1
The proof relies on straightforward computations. For instance, E 1 < 1 is equivalent to
where 3c 1 c 2 − 6c 1 − 6c 2 + 9 = 3 (W + 4 − 4c 1) is positive. Squaring both sides of Eq. 6.1, we see that Eq. 6.1 is equivalent to
which is, of course, true. The remaining inequalities are proved analogously. □
Proof of Lemma 2.2
We prove, for instance, that c 2 < E 4. By Eq. 2.9, this is equivalent to
which, in turn, is equivalent to
□
Proof of Lemma 2.3
We will prove the first inequality. Since c 1 − 3c 2 + 2 > 0, the hypothesis is equivalent to − 2c 1 c 2 − 3c 1 + c 2 − c 1 c 1 − 3c 2 + 2 = 1 − c 1 c 1 − c 2 > 0 which is, of course, true. □
Proof of Lemma 2.4
We look for stationary points of f. Any such point (x, y) must satisfy the equality
Clearly, 3xy + 5y − 5x − 3 = 3(xy − 1) − 5(x − y) < 0 in D, thus, either x = − 1 or y = 1, or y = − x. If y = − x in D, we must have x > 0, but then \(\frac {\partial f}{ \partial x}(x,-x)=-3x^{5}-20x^{4}-46x^{3}-23x-4<0\), while \(\frac {\partial f}{ \partial y}(x,-x)=3x^{5}+20x^{4}+46x^{3}+23x+4>0\). It follows that all stationary points of f are contained in ∂D. By direct calculation, we make sure that f | ∂D ≤ 0 which implies f < 0 in D. □
Proof of Lemma 2.5
We prove the second inequality. Since − 3c 1 + c 2 − 2 < 0, the hypo- thesis is equivalent to − 2c 1 c 2 − c 1 + 3c 2 − c 2 − 3c 1 + c 2 − 2 = c 2 + 1 c 1 − c 2 > 0. □
Proof of Lemma 2.6
This is proved analogously as Lemma 2.4. □
7 Pictures
In this section, we present 3-dimensional visualizations of exemplary reachable sets studied in Section 2.
At first, consider the case \(c_{1}=\frac {3}{4}\), \(c_{2}=\frac {1}{2}\). Clearly, \(W\left (\frac {3}{4},\frac {1}{2}\right )=-\frac {1}{8}\), and in this case,
On all the following three figures, the plane z = 0 is marked with white color. The corresponding reachable set is bounded by two hypersurfaces: by {η̂ 1 = 0} from above (darker color) and by {η̂ 2 = 0} from below (lighter color) as it is presented in Fig. 5.
Next, consider the case \(c_{1}=\frac {1}{2}\), c 2 = 0. Now, \(W\left (\frac {1}{2},0\right )=0\), and
The reachable set in this case is presented in Fig. 6. As in the previous case, it is the set bounded by two hypersurfaces {η̂ i = 0}, i = 1, 2, with similarly chosen colors.
Finally, consider the case \(c_{1}=\frac {1}{2}\), \(c_{2}=-\frac {1}{2}\). This time \(W\left (\frac {1}{2},-\frac {1}{2}\right )=\frac {3}{4}\), and
The corresponding reachable set is presented in Fig. 7. It is the set which is bounded by six hypersurfaces {η̂ i = 0}, {ξ̂ ij = 0}, i, j = 1, 2.
References
Agrachev A, Sachkov Y. Control theory from geometric viewpoint. Encyclopedia of Mathematical Science. vol. 87. New York: Springer; 2004.
Agrachev A, Chakir El-A El-H, Gauthier JP. Sub-Riemannian metrics on ℝ3. Canadian Mathematical Society Conference Proceedings, Vol. 25; 1998.
Bellaïche A. The tangent space in the sub-Riemannian geometry. Sub-Riemannian Geometry. Cambridge: Birkhäuser; 1996.
Bonnard B, Chyba M. Singular trajectories and their role in control theory. Berlin: Springer; 2003.
Grochowski M. Normal forms of germs of contact sub-Lorentzian structures on ℝ3. Differentiability of the sub-Lorentzian distance. J Dyn Control Syst. 2003;9(4):531–547.
Grochowski M. Properties of reachable sets in the sub-Lorentzian geometry. J Geom Phys. 2009;59:885–900.
Grochowski M. Reachable sets for contact sub-Lorentzian structures on ℝ3. Application to control affine systems on ℝ3 with a scalar input. J Math Sci. 2011a;177(3):383–394.
Grochowski M. Normal forms and reachable sets for analytic martinet sub-Lorentzian structures of Hamiltonian type. J Dyn Control Syst. 2011b;17(1).
Grochowski M. The structure f reachable sets for affine control systems induced by generalized Martinet sub-Lorentzian metrics. ESAIM: COCV;18:1150–1177 doi:10.1051/cocv/2011202.
Korolko A, Markina I. Nonholonomic Lorentzian geometry on some ℍ-type groups. J Geom Anal. 2009;19(4):864–889.
Liu W, Sussmann H. Shortest paths for sub-Riemannian metrics on rank-two distributions. Mem Am Math Soc. 1995;118(564):x+104.
Łojasiewicz S. Ensembles semi-analytiques. Bures-sur-Yvette: Institut des Hautes études Scientifiques; 1964.
O’Neill B. Semi-Riemannain geometry with applications to relativity, Pure and Applied Series. vol. 103. New York: Academic; 1983.
Acknowledgments
This work was partially supported by the Polish Ministry of Research and Higher Education, grant NN201 607540. This paper was written during my stay in the Institut Mittag-Leffler in autumn 2011.
Author information
Authors and Affiliations
Corresponding author
Appendix
Appendix
In this section, we state some corollaries concerning sub-Lorentzian metrics of type M 2, 2. We start from definitions; cf. [8,9].
Let (M, H, g) be a sub-Lorentzian manifold. Fix a point q 0. Denote by \( \mathcal {D}_{q_{0}}\), the set of all covectors \(\lambda \in T_{q_{0}}^{\ast }M \) such that the curve t → Φ t(λ) is defined on the whole interval [0, 1]. Here, Φ t stands for the (local) flow of the Hamiltonian vector field ℋ⃗. The exponential mapping \(\exp _{q_{0}}:\mathcal {D}_{q_{0}}\longrightarrow M\) is defined to be \(\exp _{q_{0}}(\lambda )=\pi \circ \Phi ^{1}(\lambda )\), where π : T ∗ M → M is the canonical projection. It is seen that if γ(t) is the Hamiltonian geodesic with initial condition \( \lambda \in T_{q_{0}}^{\ast }M\), then \(\gamma (t)=\exp _{q_{0}}(t\lambda )\). Now, we say that a point q is conjugate to q 0 if there exists a covector \(\lambda \in T_{q_{0}}^{\ast }M\) such that \(\exp _{q_{0}}(\lambda )=q\) and \(d_{\lambda }\exp _{q_{0}}\) is singular; in such a case, we say that q is conjugate to q 0 along the geodesic \( t\longrightarrow \exp _{q_{0}}(t\lambda )\). The future timelike (nonspacelike, null) conjugate locus of a point q 0 is defined to be the set of all points q that are conjugate to q 0 along the timelike (nonspacelike, null) f.d. Hamiltonian geodesics; it is denoted by \({\rm {Conj}}_{q_{0}}^{t}\) \(\left ({\rm {Conj}}_{0}^{nspc},{\rm {Conj}}_{0}^{{\rm {null}}}\right )\). Finally, by the future null cut locus of q 0, \({\rm {Cut}}_{q_{0}}^{{\rm {null}}}(M)\), we mean the set of points q ∈ M such that there exists a null f.d. (not necessarily Hamiltonian) geodesic γ : [0, T] → M having the following properties: γ(0) = q 0, γ(t 1) = q with 0 < t 1 < T, \(\gamma _{|[0,t_{1}]}\) is a length maximizer and \(\gamma _{|[0,t_{1}+\varepsilon ]}\) is not a length maximizer for an ε > 0, t 1 + ε ≤ T.
Now, let (H, g) be a sub-Lorentzian structure of type M 2, 2 defined on a normal neighborhood U of 0 ∈ ℝ3. Suppose that (H, g) is given by Eq. 1.1.
Below, we list some properties of exp0, \({\rm {Conj}}_{0}^{t}\), \({\rm {Conj}}_{0}^{{\rm {null}}}\), and \({\rm {Cut}}_{0}^{{\rm {null}}}(U)\). Proofs are omitted since they are similar to those found in [8,9].
1.1 A.1 Image Under the Exponential Mapping
Proposition 8.1
1.2 A.2 Conjugate Locus
Proposition 8.2
\(Conj_{0}^{null}\) is equal to the union of the two null f.d. Hamiltonian geodesics starting at 0, i.e., \(Conj_{0}^{null}=\left \{ y=\pm x\text {, } z=0\right \} \cap U\).
Note that in case W ≤ 0, the future timelike conjugate locus \( {\rm {Conj}}_{0}^{t} \) contains the two abnormal curves starting from 0 (because they are timelike and lie on ∂͂͂ J +(0, U)). These curves are unique maximizers.
1.3 A.3 Future Null Cut Locus
Proposition 8.3
Suppose that W > 0. Then, \(Cut_{0}^{null}(U)=\{0\}\).
Proposition 8.4
Suppose that W < 0. Then, \(Cut_{0}^{null}(U)=\tilde {\partial }J(0,U)\cap \left ( \Sigma _{1}\cup \Sigma _{2}\right ) \).
Proposition 8.5
Suppose that W = 0. Then, \(Cut_{0}^{null}(U)=\left ( \left \{ y=c_{1}x\text {, } z=0\right \} \cup \right .\) y = c 2 x, z = 0 ∩ U.
Note that \({\rm {Cut}}_{0}^{{\rm {null}}}(U)\cap {\rm {Conj}}_{0}^{t}\neq \emptyset \) in the case W = 0.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.
About this article
Cite this article
Grochowski, M. The Structure of Reachable Sets and Geometric Optimality of Singular Trajectories for Certain Affine Control Systems in ℝ3. The Sub-Lorentzian Approach. J Dyn Control Syst 20, 59–89 (2014). https://doi.org/10.1007/s10883-013-9202-7
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10883-013-9202-7
Keywords
- Sub-Lorentzian manifolds
- Geodesics
- Reachable sets
- Geometric optimality
- Vector distributions
- A Martinet surface