1 Introduction

1.1 Preliminaries

This paper is a continuation of the research started in [8,9] and devoted to the study on reachable sets for noncontact sub-Lorentzian structures on ℝ3, as well as for affine control systems induced by them. Similarly as in [8,9] our objective is to investigate the interrelation of the structure of reachable sets from a given point q 0 for the mentioned systems and geometric optimality of singular trajectories starting at q 0 or—speaking in the sub-Lorentzian language—geometric optimality of timelike abnormal curves starting at q 0 (a trajectory of a control system starting from a point q 0 is said to be geometrically optimal if it is contained in the boundary of the reachable set from q 0; cf. [1]). The paper is arranged in such a way that, in the first four sections, we develop the theory in the sub-Lorentzian setting, and Section 5 contains applications of the obtained results to control affine systems.

For all facts and notions from the sub-Lorentzian geometry, the reader is referred to the previous papers by the author (see [6] and its reference section; see also [10]). Here, we recall only those basic facts that are needed for stating the results. Let M be a smooth manifold, and let H be a smooth distribution on M of constant rank. For a point qM and an integer k ∈ ℕ, we define \(H_{q}^{k}\) to be the linear subspace in H q generated by all vectors of the form [X 1, [X 2,...,[X i−1, X i ]... ]](q), where X 1,...,X i are smooth (local) sections of H defined in a neighborhood of q, and ik. We say that H is a bracket generating distribution if for every qM, there exists an i = i(q) such that \(H_{q}^{i}\) = T q M. Now, by a sub-Lorentzian structure (or a sub-Lorentzian metric) on a manifold M, we mean a pair (H, g), where H is a bracket generating distribution on M, and g is a smooth Lorentzian metric on H. The triple (M, H, g) is called a sub-Lorentzian manifold. Take a point qM; a vector vH q is said to be timelike if g(v, v) < 0, nonspacelike if g(v, v) ≤ 0, and null if g(v, v) = 0 but v ≠ 0. A time orientation of (M, H, g) is, by definition, a continuous timelike vector field on M. Suppose (M, H, g) to be time-oriented by X and let vH q be a nonspacelike vector. We say that v is future directed if g(v, X(q)) < 0. A curve γ : [a, b] → M is called horizontal if it is absolutely continuous, γ̇ (t) ∈ H γ(t) a.e. on [a, b], and γ̇ is square integrable with respect to some Riemannian metric on M. From now on, all curves are supposed to be horizontal. We will also use the following abbreviations: t. for “timelike,” nspc. for “nonspacelike,” and f.d. for “future directed.” We say that a curve γ : [a, b] → M is t.f.d. (resp. nspc.f.d., null f.d.) if so is γ̇ (t) a.e. on [a, b]. Fix a point q 0M and its neighborhood U. The (future) timelike (resp. nonspacelike, null) reachable set from q 0 relative toU is defined to be the set of all points in U that can be reached from q 0 by a t.f.d. (resp. nspc. f.d., null f.d.) curve entirely contained in U. They are denoted respectively by I +(q 0, U), J +(q 0, U), and N +(q 0, U). In the general case, all we can say about reachable sets is that intI +(q 0, U) ≠ ∅, and that the three reachable sets have the same interiors and closures with respect to U. In order to be able to say something more, we need to make certain assumptions on U. To this end, let us notice that if U is sufficiently small, then our sub-Lorentzian metric can be extended to a Lorentzian metric, say , on U. So U is said to be a normal neighborhood of q 0 if it is a convex normal neighborhood of q 0 with respect to , and U is contained in some other convex normal neighborhood of q 0 with respect to (see [8] for a constructive definition of normal neighborhoods). Now, if U is a normal neighborhood of q 0, we know that J +(q 0, U) is closed with respect to U and moreover cl U (intI + (q 0, U)) = cl U (intN +(q 0, U)) = J + (q 0, U), where cl U stands for the closure with respect to U. Note that unlike the Lorentzian case, the boundary ∂͂ J +(q 0, U) (here and below, ∂͂ means the boundary with respect to U) may contain timelike curves starting from q 0. It can be proved [6] that such curves are abnormal curves for the underlaying distribution H (see [11] for a definition); they are also Goh curves (cf. [1]), but we do not need this latter fact in this paper. Let X 0,...,X k be an orthonormal frame for (H, g) defined on an open set U. We define the so-called geodesic (sub-Lorentzian) Hamiltonian ℋ : T U → ℝ, by formula ℋ(q, p) = \(-\frac {1}{2} \left \langle p,X_{0}(q)\right \rangle +\frac {1}{2}\sum _{i=1}^{k}\left \langle p,X_{i}(q)\right \rangle \) (it is possible to define ℋ in a global and invariant way; see [6]). Now, a curve γ : [a, b] → U is said to be a Hamiltonian geodesic if it can be represented as γ(t) = π ◦ Γ(t) where π : T MM is the canonical projection and Γ̇ = ℋ⃗, ℋ⃗ being the Hamiltonian vector field corresponding to the function ℋ. Hamiltonian geodesics do not change their causal character; moreover, null f.d. Hamiltonian geodesics are [6] locally geometrically optimal. Finally, let U be an open subset in (M, H, g), and suppose that φ : U → ℝ is a smooth function. The horizontal gradient of the function φ is defined to be the vector field ∇ H φ such that d q φ(v) = g( ∇ H φ(q), v) for every vH q , qU. One easily makes sure that if ∇ H φ is null f.d. on U and γ : [a, b] → U is t.f.d. (nspc.f.d.), then the function [a, b] ∋ tφ(γ(t)) is decreasing (nonincreasing).

1.2 Statement of the Results

In papers [5,7], we studied on the contact sub-Lorentzian structures on ℝ3. On the other hand, in [8,9] (generalized) Martinet sub-Lorentzian structures of Hamiltonian type of order k were studied, i.e., structures that, among other conditions imposed on them, are not contact on a hypersurface or, speaking in another way, structures whose Martinet surface is smooth. As a next step, it is reasonable to consider structures with the simplest non-smooth Martinet surface S, i.e., where S is a union of transversely intersecting smooth hypersurfaces. In order to formulate necessary assumptions, let us introduce a notion of the hyperbolic angle on a sub-Lorentzian manifold (M, H, g). Let v 1, v 2H q , qM, be t.f.d. vectors. The hyperbolic angle between v 1 and v 2 is the number ∢(v 1, v 2) ≥ 0 defined by

$$\cosh \sphericalangle (v_{1},v_{2})=-\frac{g(v_{1},v_{2})}{\left\| v_{1}\right\| \left\| v_{2}\right\| }\text{,}$$

where ∥v∥ = ∣g(v, v)∣1/2; by the reverse Schwarz inequality \(-\frac {g(v_{1},v_{2})}{\left \| v_{1}\right \| \left \| v_{2}\right \| }\geq 1\), so the definition makes sense. If L 1 = Span {v 1}, L 2 = Span {v 2} are 1-dimensional timelike linear subspaces in H q with v 1, v 2 being chosen to be t.f.d., then we put

$$\sphericalangle (L_{1},L_{2})=\sphericalangle (v_{1},v_{2})\text{.}$$

Now, we come to the precise definition of the type of sub-Lorentzian structures that we are going to consider in this paper. So let H be a bracket generating distribution of constant rank equal to 2, defined in a neighborhood U of 0 ∈ ℝ3. We say that H satisfies the condition (M 2, 2) if the following conditions are satisfied:

  1. (i)

    There exist smooth hypersurfaces S 1 and S 2 such that the intersection Γ = S 1S 2 is smooth of dimension 1, contains the origin, and is transverse to H; moreover, for each q ∈ Γ, dim (T q S i H q ) = 1, i = 1, 2;

  2. (ii)

    H defines a contact structure on U∖(S 1S 2);

  3. (iii)

    \(H_{q}^{2}\) = H q and \(H_{q}^{3}\) = T q R 3 whenever q ∈ (S 1S 2)∖Γ;

  4. (iv)

    \(H_{q}^{4}\) = T q R 3 whenever q ∈ Γ.

The set S = S 1S 2 will be called the Martinet surface for H. Note that S is foliated by abnormal curves for the distribution H. Next, we chose a Lorentzian metric g on H in the way similar as in [8,9]:

  1. (v)

    The field of directions S i qT q S i H q is timelike, i = 1, 2;

  2. (vi)

    The function S 1S 2q → ∢ (T q S 1H q , T q S 2H q ) is constant;

  3. (vii)

    The abnormal curves foliating S are, up to a change of parameter, t.f.d. Hamiltonian geodesics.

As we shall see, the two latter assumptions are used only in the process of constructing our normal form.

We will say that a sub-Lorentzian structure (H, g) is of type M 2, 2 if it satisfies conditions (i)–(vii). The sub-Lorentzian structure (H, g) is analytic if all objects entering its definition (e.g., the Martinet surface) are analytic.

Theorem 1.1

Let (H, g) be a time-oriented analytic sub-Lorentzian structure of type M 2, 2 defined on a neighborhood U of the origin in3. Then, provided that U is sufficiently small, there exist analytic coordinates x, y, z on U in which (H, g) has an orthonormal frame in the normal form

$$\left. \begin{array}{l} X=\dfrac{\partial }{\partial x}+y\varphi \left(y\dfrac{\partial }{\partial x}+x \dfrac{\partial }{\partial y}\right)+\dfrac{1}{2}y\left( y-c_{1}x\right) \left( y-c_{2}x\right) (1+\psi )\dfrac{\partial }{\partial z}\medskip \\ Y=\dfrac{\partial }{\partial y}-x\varphi \left(y\dfrac{\partial }{\partial x}+x \dfrac{\partial }{\partial y}\right)-\dfrac{1}{2}x\left( y-c_{1}x\right) \left( y-c_{2}x\right) (1+\psi )\dfrac{\partial }{\partial z} \end{array} \right. \text{,} $$
(1.1)

where X is a time orientation; c 1 and c 2 are constants such that − 1 < c 2 < c 1 < 1, S = {y = c 1 x} ∪ {y = c 2 x} is the Martinet surface for H; and φ and ψ are analytic functions on U, ψ(0, 0, z) = 0.

Using Theorem 1.1, we then investigate the structure of reachable sets. Let

$$W=W(c_{1},c_{2})=c_{1}c_{2}+2c_{1}-2c_{2}-1\text{.} $$
(1.2)

It is not difficult to see what the geometric interpretation of Eq. 1.2 is. Let α = ∢ (T q S 1H q , T q S 2H q ) for a q ∈ Γ; Eq. 1.1 implies that \(\cosh \alpha =\frac {1-c_{1}c_{2}}{\sqrt {1-c_{1}^{2}}\sqrt {1-c_{2}^{2}}} \) and at the same time \(\sinh \alpha =\frac {c_{1}-c_{2}}{\sqrt {1-c_{1}^{2}} \sqrt {1-c_{2}^{2}}}\). Thus,

$$\frac{W}{\sqrt{1-c_{1}^{2}}\sqrt{1-c_{2}^{2}}}=-\cosh \alpha +2\sinh \alpha$$

is an invariant for metrics of type M 2, 2. In particular, the sign of W has an invariant meaning and does not depend on the choice of coordinates. Note by the way that W > 0 ( W < 0, W = 0) if and only if \(\tanh \alpha > \frac {1}{2}\) (\(\tanh \alpha <\frac {1}{2}\), \(\tanh \alpha =\frac {1}{2}\)). As we are about to see, the sign of W is determinative for the structure of reachable sets. More precisely, we will prove two theorems.

Theorem 1.2

Suppose that (H, g) is a sub-Lorentzian structure defined on a suitably small normal neighborhood U of the origin by an orthonormal frame X, Y in the normal form (1.1) with X being a time orientation. Then, if W(c 1, c 2) > 0, there exist two analytic functions η 1, η 2 : U → ℝ such that the reachable sets from the origin for (H, g) are of the following form:

$$J^{+}(0,U)=N^{+}(0,U)=A_{1}\cup A_{2}\text{,}$$
$$I^{+}(0,U)=\mathrm{int}͂\left( A_{1}\cup A_{2}\right) \text{,}$$

where

$$\begin{array}{*{20}l} A_{1}=\left\{ (x,y,z)\in U:\text{ }\eta _{1}(x,y,z)\leq 0\right\} \cap \left\{ x\geq 0\text{, }z\geq 0\right\},\\ A_{2}=\left\{ (x,y,z)\in U:\text{ }\eta _{2}(x,y,z)\leq 0\right\} \cap \left\{ x\geq 0\text{, }z\leq 0\right\}.\\ \end{array}$$

In particular, the three reachable sets are semi-analytic.

Note that in cases covered by Theorem 1.2, there are no timelike curves in the boundary ∂͂ J +(0, U), and the structure of reachable sets is the same as in the contact case.

Theorem 1.3

Suppose that (H, g) is a sub-Lorentzian structure defined on a suitably small normal neighborhood U of the origin by an orthonormal frame X, Y in the normal form (1.1) with X being a time orientation. Then, if W(c 1, c 2) < 0, there exist six analytic functions η 1, η 2, ξ ij : U → ℝ, i, j = 1, 2, and two 2-dimensional semi-analytic sets Σ 1, Σ 2 with the property that U ∩ {x ≥ 0} ∩ {z ≥ 0} ∩ {c 2 xyx}∖Σ 1 has two connected components \(\Sigma _{1}^{+}\), \(\Sigma _{1}^{-}\), and U ∩ {x ≥ 0} ∩ {z ≤ 0} ∩ {−xyc 1 x}∖Σ 2 has two connected components \( \Sigma _{2}^{+}\), \(\Sigma _{2}^{-}\), such that

$$J^{+}(0,U)=A_{1}\cup ...\cup A_{6}\text{,}$$
$$I^{+}(0,U)=\mathrm{int}\left( A_{1}\cup ...\cup A_{6}\right) \cup A_{7}\cup A_{8}\text{,}$$
$$N^{+}(0,U)=\mathrm{int}\left( A_{1}\cup ...\cup A_{6}\right) \cup \left( \left\{ \eta _{1}=0\right\} \cap \Sigma _{1}^{+}\right) \cup \left( \left\{ \eta _{2}=0\right\} \cap \Sigma _{2}^{-}\right)$$

where

$$\begin{array}{@{}rcl@{}} A_{1}&=&\left\{ (x,y,z)\in U:\text{ }\eta _{1}(x,y,z)\leq 0\right\} \cap \Sigma _{1}^{+},\\ A_{2}&=&\left\{ (x,y,z)\in U:\text{ }\xi _{11}(x,y,z)\leq 0\right\} \cap \Sigma _{1}^{-},\\ A_{3}&=&\left\{ (x,y,z)\in U:\text{ }\xi _{12}(x,y,z)\leq 0\right\} \cap \left\{ -x\leq y\leq c_{2}x\right\} \cap \left\{ z\geq 0\right\} ,\\ A_{4}&=&\left\{ (x,y,z)\in U:\text{ }\eta _{2}(x,y,z)\leq 0\right\} \cap \Sigma _{2}^{-}, \\ A_{5}&=&\left\{ (x,y,z)\in U:\text{ }\xi _{21}(x,y,z)\leq 0\right\} \cap \Sigma _{2}^{+}, \\ A_{6}&=&\left\{ (x,y,z)\in U:\text{ }\xi _{22}(x,y,z)\leq 0\right\} \cap \left\{ c_{1}x\leq y\leq x\right\} \cap \left\{ z\leq 0\right\} ,\\ A_{7}&=&\left\{ (x,y,z)\in U:\text{ }y=c_{1}x\text{, }x\geq 0\text{, } z=0\right\} ,\\ A_{8}&=&\left\{ (x,y,z)\in U:\text{ }y=c_{2}x\text{, }x\geq 0\text{, } z=0\right\} .\\ \end{array}$$

In particular, the three reachable sets are semi-analytic.

Note that in cases covered by Theorem 1.3, there are two timelike curves on the boundary ∂͂ J +(0, U). It is also seen that in such cases, neither I +(0, U) is open nor N +(0, U) is closed.

The structure of all geometrically optimal curves in cases treated in Theorems 1.2 and 1.3 is described in Section 2. Let us notice that proofs of Theorems 1.2 and 1.3 give a sort of “algorithm” for computing the reachable sets. Moreover, the presented results bear a geometric character, so having proved Theorems 1.2 and 1.3 and applying a remark similar to Remark 4.4 in [9], we no longer have to transform our structure to the normal form in order to compute reachable sets.

Similarly as it was done in some previous papers by the author, all the above results can be applied to the study on reachable sets for control affine systems induced by sub-Lorentzian metrics of type M 2, 2. Let

$$\dot{q}=X+uY\text{, \ \ }u\in \lbrack a,b]\text{,} $$
(1.3)

be a control affine system defined on M, where M is an open subset of ℝ3 (or a 3-manifold). We suppose the fields X, Y to be linearly independent. Fix a point q 0M and its neighborhood UM. We will consider the reachable set 𝒜[a, b](q 0, U) from q 0 for the system (1.3) which is defined to be the set of endpoints of all trajectories of Eq. 1.3 that are contained in U, initiate at q 0, and are generated by measurable controls u : [0, T] → [a, b]; here, T = T(u) depends on a control. It was proved in [9], Lemma 1.2, that the reachable set 𝒜[a, b](q 0, U) is equal to the future nonspacelike reachable set for the time-oriented sub-Lorentzian structure (H, g a, b) defined on M by declaring the frame

$$\left. \begin{array}{l} Z^{a,b}=X+\frac{1}{2}(b+a)Y\medskip \\ W^{a,b}=\frac{1}{2}(b-a)Y \end{array} \right.$$

to be orthonormal with a time orientation Z a, b (in [9], we studied also on reachable sets 𝒜(a, b)(q 0, U) and 𝒜{a, b}(q 0, U)). Let us recall a notion of singular trajectories (cf. [4]) for affine control systems. So, a trajectory γ : [0, T] → [U of Eq. 1.3 is called a singular trajectory if it is generated by a control u(t) with values in the open interval (a, b) and is an abnormal curve for the distribution H. It is worth noting that singular trajectories satisfy necessary conditions for geometric optimality of Pontriagin maximum principle. In terms of the sub-Lorentzian metric (H, g a, b), singular trajectories are exactly timelike abnormal curves.

Now, suppose that X, Y is an orthonormal frame for a metric (H, g) of type M 2, 2. We say that the system = X + uY, u ∈ [a, b], is typical in class M 2, 2 if \(W\left (\frac {2c_{1}-b-a}{b-a},\frac { 2c_{2}-b-a}{b-a}\right )\neq 0\) (cf. 5.9), where c 1 and c 2 are constants obtained by transforming (H, g) to normal form (1.1). Then, we can prove the following:

Theorem 1.4

Suppose that the system (1.3) is typical in class M 2, 2. If it has k geometrically optimal singular trajectories initiating at q 0 (in our case, k = 0, 1, 2), then the minimal number of analytic functions that one needs to describe the reachable set 𝒜[a, b](q 0, U) is 2 + 2k.

The above theorem also holds for all cases treated in [9]. Thus, the presence on the boundary ∂͂ 𝒜[a, b](q 0, U) of a singular trajectory initiating at q 0 increases (at least in the described cases) by two the minimal number of analytic functions needed for describing the reachable set 𝒜[a, b](q 0, U). It would be interesting to know whether this observation can be extended to a more general class of (not necessarily affine) control systems.

1.3 Organizations of the Paper

Section 2 is devoted to computing reachable sets for the so-called flat structures—they correspond to normal form (1.1) with φ and ψ set to be equal to zero. In Section 3, we compute normal forms. More precisely, we prove Theorem 3.1 which gives normal forms in a more general situation than that treated in the present paper and which can be a starting point for further studies. Theorem 1.1 is then a corollary of Theorem 3.1. In Section 4, we generalize global results from Section 2 to local results in a general (i.e., nonflat) situation of type M 2, 2 in cases where W(c 1, c 2) ≠ 0. In Section 5, we apply the results obtained for sub-Lorentzian structures to control affine systems. Proofs of the lemmas from Section 2.2 are contained in Section 6. Section 7 contains 3-dimensional visualizations of examples of reachable sets studied in Section 2. In the Appendix, we state some corollaries concerning the image under exponential mapping and also conjugate and cut loci.

Some proofs of the results presented in the paper are omitted since they are similar to those from [9].

2 Reachable Sets in the Flat Case

In this section, we study on reachable sets from the origin for the sub-Lorentzian structure (Ĥ, ĝ) defined by an orthonormal basis

$$\left. \begin{array}{l} \hat{X}=\dfrac{\partial }{\partial x}+\dfrac{1}{2}y(y-c_{1}x)(y-c_{2}x) \dfrac{\partial }{\partial z} \\ \hat{Y}=\dfrac{\partial }{\partial y}-\dfrac{1}{2}x(y-c_{1}x)(y-c_{2}x) \dfrac{\partial }{\partial z} \end{array} \right. $$
(2.1)

with a time orientation where we assume that −1 < c 2 < c 1 < 1. Let S i = {(x, y, z) : y = c i x}. We see that the Martinet surface S in our case is equal to S 1S 2. The structure (or a metric) (Ĥ, ĝ) will be called flat. This is because Eq. 2.1 is a particular case of Eq. 1.1 where φ and ψ has been set to zero. Hence, any structure as in Eq. 1.1 can be regarded as a perturbation of the flat structure; see Section 4 for some applications of this observation.

Reachable sets from the origin for Eq. 2.1 will be denoted respectively by Ĵ +(0) = J +(0, ℝ3), Î +(0) = I +(0, ℝ3), and +(0) = N +(0, ℝ3). First of all, it is obvious that

$$\hat{I}^{+}(0)\subset \left\{ -x<y<x\text{, }x>0\right\}$$

and

$$\hat{J}^{+}(0)\subset \left\{ -x\leq y\leq x\text{, }x\geq 0\right\} \text{.}$$

As in the previous papers by the author, the key role in the process of constructing functions describing reachable sets is played by the signs of the z-coordinates of the fields

$$\hat{X}-\hat{Y}=\dfrac{\partial }{\partial x}-\dfrac{\partial }{\partial y}+ \dfrac{1}{2}(y+x)(y-c_{1}x)(y-c_{2}x)\dfrac{\partial }{\partial z}\text{,} $$
(2.2)
$$\hat{X}+\hat{Y}=\dfrac{\partial }{\partial x}+\dfrac{\partial }{\partial y}+ \dfrac{1}{2}(y-x)(y-c_{1}x)(y-c_{2}x)\dfrac{\partial }{\partial z}\text{.} $$
(2.3)

Let

$$\Gamma _{1}=\left\{ (x,x,z):\;x,z\in \mathbb{R}\right\}$$

and

$$\Gamma _{2}=\left\{ (x,-x,z):\;x,z\in \mathbb{R}\right\} \text{.}$$

Since (Ŷ) (z) > 0 on Γ1 and ( + Ŷ) (z) < 0 on Γ2, similarly as in [79], we construct two functions η̂ 1 and η̂ 2. η̂ 1 is the solution to the Cauchy problem

$$(\hat{X}-\hat{Y})(\eta )=0\text{, \ \ }\eta |_{\Gamma _{1}}(x,x,z)=z\text{,}$$

and η̂ 2 is the solution to the Cauchy problem

$$\left(\hat{X}+\hat{Y}\right)(\eta )=0\text{, \ \ }\eta |_{\Gamma _{2}}(x,-x,z)=-z\text{.}$$

After calculations, we obtain

$$\begin{array}{@{}rcl@{}} \hat{\eta}_{1}(x,y,z)&=&z-\frac{x^{2}-y^{2}}{48}\left(\left(7c_{1}c_{2}-2c_{1}-2c_{2}+1\right) x^{2}+4\left(c_{1}c_{2}-2c_{1}-2c_{2}+1\right) xy \right.\\ &&{\kern5.4pc} \left.+\left(c_{1}c_{2}-2c_{2}-2c_{1}+7\right) y^{2}\right) \end{array}$$

and

$$\begin{array}{@{}rcl@{}} \hat{\eta}_{2}(x,y,z)&=&-z-\frac{x^{2}-y^{2}}{48}\left( \left( 7c_{1}c_{2}+2c_{1}+2c_{2}+1\right) x^{2}-4\left( c_{1}c_{2}+2c_{1}+2c_{2}+1\right) xy \right.\\ &&{\kern5pc} \left.+\left( c_{1}c_{2}+2c_{2}+2c_{1}+7\right) y^{2}\right) \text{.} \end{array}$$

As in the previous papers, we need to know the horizontal gradient ∇ Ĥ η̂ i of the function η̂ i with respect to (Ĥ, ĝ), i = 1, 2. Clearly, ∇ Ĥ η̂ i = − (η̂ 1)(Ŷ) where

$$\begin{array}{@{}rcl@{}} \hat{X}(\hat{\eta}_{1})&=&-\frac{1}{12}\left( x-y\right) \left( \left( 7c_{1}c_{2}-2c_{1}-2c_{2}+1\right) x^{2}+4\left( c_{1}c_{2}-2c_{1}-2c_{2}+1\right) xy \right. \notag\\ &&{\kern5pc} \left.+\left( c_{1}c_{2}-2c_{1}-2c_{2}+7\right) y^{2}\right) \text{,} \end{array}$$
(2.4)

and ∇ Ĥ η̂ 2 = − (η̂ 2) ( + Ŷ) where

$$\begin{array}{@{}rcl@{}} \hat{X}(\hat{\eta}_{2})&=&-\frac{1}{12}\left(x+y\right) \left( \left(7c_{1}c_{2}+2c_{1}+2c_{2}+1\right) x^{2}-4\left( c_{1}c_{2}+2c_{1}+2c_{2}+1\right) xy\right. \notag\\ &&{\kern5pc}\left.+\left( c_{1}c_{2}+2c_{1}+2c_{2}+7\right) y^{2}\right) \text{.} \end{array}$$
(2.5)

Recall that in (1.2) in Section 1, we defined the polynomial W = c 1 c 2 + 2c 1 − 2c 2 − 1. As it was announced, the sign of W determines the structure of the reachable set for Eq. 2.1. Indeed, it is easy to check that

$$\hat{X}(\hat{\eta}_{1})=-\frac{1}{12}\left( x-y\right) \left( c_{1}c_{2}-2c_{1}-2c_{2}+7\right) \left( y-E_{1}x\right) \left( y-E_{2}x\right) $$
(2.6)

with

$$\left. \begin{array}{c} E_{1}=\frac{1}{c_{1}c_{2}-2c_{1}-2c_{2}+7}\left( -2c_{1}c_{2}+4c_{1}+4c_{2}-2+\sqrt{-3\left( c_{1}c_{2}-2c_{1}+2c_{2}-1\right) W}\right) \\ E_{2}=\frac{1}{c_{1}c_{2}-2c_{1}-2c_{2}+7}\left( -2c_{1}c_{2}+4c_{1}+4c_{2}-2-\sqrt{-3\left( c_{1}c_{2}-2c_{1}+2c_{2}-1\right) W}\right) \end{array} \right. $$
(2.7)

and

$$\hat{X}(\hat{\eta}_{2})=-\frac{1}{12}\left( x+y\right) \left( c_{1}c_{2}+2c_{1}+2c_{2}+7\right) \left( y-E_{3}x\right) \left( y-E_{4}x\right) $$
(2.8)

with

$$\left. \begin{array}{c} E_{3}=\frac{1}{c_{1}c_{2}+2c_{1}+2c_{2}+7}\left( 2c_{1}c_{2}+4c_{1}+4c_{2}+2+ \sqrt{-3\left( c_{1}c_{2}-2c_{1}+2c_{2}-1\right) W}\right) \\ E_{4}=\frac{1}{c_{1}c_{2}+2c_{1}+2c_{2}+7}\left( 2c_{1}c_{2}+4c_{1}+4c_{2}+2- \sqrt{-3\left( c_{1}c_{2}-2c_{1}+2c_{2}-1\right) W}\right) \end{array} \right. \text{.} $$
(2.9)

Let us notice here that c 1 c 2 − 2c 1 + 2c 2 − 1 = (c 1 c 2 − 1) − 2 (c 1c 2) < 0 for all c 1 and c 2 such that − 1 < c 2 < c 1 < 1.

2.1 The Case W < 0

This case is the simplest because, as it can be seen from Eqs. 2.6 and 2.8, ∇ Ĥ η̂ i , i = 1, 2 is null f.d. in the whole sector {−x < y < x}. Thus, using similar arguments as, e.g., in [7] or [8], we have

Proposition 2.1

If W < 0 then

$$\hat{J}^{+}(0)=\hat{N}^{+}(0)=\hat{A}_{1}\cup \hat{A}_{2}$$

and

$$\hat{I}^{+}(0)=\mathrm{int}(\hat{A}_{1}\cup \hat{A}_{2})\text{,}$$

where

$$\begin{array}{*{20}l} \hat{A}_{1}=\left\{ (x,y,z)\in \mathbb{R}^{3}:\text{ }\hat{\eta} _{1}(x,y,z)\leq 0\right\} \cap \left\{ x\geq 0\text{, }z\geq 0\right\},\\ \hat{A}_{2}=\left\{ (x,y,z)\in \mathbb{R}^{3}:\text{ }\hat{\eta} _{2}(x,y,z)\leq 0\right\} \cap \left\{ x\geq 0\text{, }z\leq 0\right\}. \end{array}$$

Let us remark that

$$\begin{array}{*{20}l} \hat{\eta}_{1}(x,c_{1}x,0)&=\frac{1}{48}x^{4}\left( c_{1}-1\right) ^{2}\left( c_{1}+1\right) ^{2}\left( c_{1}c_{2}-2c_{1}+2c_{2}-1\right) <0,\\ \hat{\eta}_{1}(x,c_{2}x,0)&=\frac{1}{48}x^{4}\left( c_{2}-1\right) ^{2}\left( c_{2}+1\right) ^{2}W<0,\\ \hat{\eta}_{2}(x,c_{1}x,0)&=\frac{1}{48}x^{4}\left( c_{1}-1\right) ^{2}\left( c_{1}+1\right) ^{2}W<0, \text{and} \\ \hat{\eta}_{2}(x,c_{2}x,0)&=\frac{1}{48}x^{4}\left( c_{2}-1\right) ^{2}\left( c_{2}+1\right) ^{2}\left( c_{1}c_{2}-2c_{1}+2c_{2}-1\right) <0. \end{array}$$

This means that there are no geometrically optimal timelike (and hence abnormal) curves starting from 0, and in what follows, only two functions suffice to describe the reachable sets.

To visualize better how the reachable set looks like, let us list all geometrically optimal curves. To this end, introduce the following notation. If Z 1, Z 2 are two vector fields on ℝ3, then by Z 1 Z 2, we will mean the curve which is a concatenation of a segment of the trajectory of Z 1 starting from 0, and a segment of a trajectory of Z 2. Using such a notation, every geometrically optimal curve is either ( + Ŷ) (Ŷ) or (Ŷ) ( + Ŷ). The intersection of ∂Ĵ +(0) with the plane {x = const > 0} is schematically presented in Fig. 1.

Fig. 1
figure 1

The set ∂Ĵ + (0) ∩ {x = const > 0} in the case W < 0

Points A and B lie on the plane {z = 0}. A (resp. B) corresponds to half-line {y = −x, z = 0} (resp. {y = −x, z = 0} ). The curve BA located above the straight line joining A and B represents ∂Ĵ +(0) ∩ {z ≥ 0} and is formed by the trajectories ( + Ŷ) (Ŷ); the curve AB located below this straight line represents ∂Ĵ +(0) ∩ {z ≤ 0} and is formed by the trajectories (Ŷ) ( + Ŷ).

2.2 The Case W > 0

In order to simplify reading of this subsection, proofs of all lemmas are moved to Section 6.

This case is more complicated since now E i , i = 1,...,4 are real. First of all, we must examine constants E 1,..., E 4. Obviously, E 2 < E 1 and E 4 < E 3. Moreover, we have two lemmas.

Lemma 2.1

The following inequalities hold true: − 1 < E 2 < E 1 < 1, E i < c i , i = 1, 2.

Lemma 2.2

The following inequalities hold true: − 1 < E 4 < E 3 < 1, c 2 < E 4, c 1 < E 3.

Using Eqs. 2.6 and 2.8, we conclude that ∇ Ĥ η̂ 1 is null f.d. on {E 1 x < y < x}, while ∇ Ĥ η̂ 2 is null f.d. on {−x < y < E 4 x}. Hence, we need more functions to describe the reachable sets from the origin.

First, we will compute Ĵ +(0) ∩ {z ≥ 0} . Since ( + Ŷ) (z) > 0 for y > c 2 x, and (Ŷ) (z) > 0 for y < c 2 x, everything in a neighborhood of the plane {y = c 2 x}, it is natural to consider the following Cauchy problems:

$$(\hat{X}+\hat{Y})(\eta )=0\text{, \ }\eta (x,c_{2}x,z)=z$$

with the solution equal to

$$\hat{\xi}_{11}(x,y,z)=z-\frac{1}{12}\frac{\left( x-y\right) \left( y-c_{2}x\right) ^{2}}{\left( 1-c_{2}\right) ^{2}}\left( \left( -2c_{1}c_{2}+3c_{1}-c_{2}\right) x+\left( -c_{1}+3c_{2}-2\right) y\right) \text{,}$$

and

$$(\hat{X}-\hat{Y})(\eta )=0\text{, \ \ }\eta (x,c_{2}x,z)=z$$

with the solution equal to

$$\hat{\xi}_{12}(x,y,z)=z-\frac{1}{12}\frac{\left( x+y\right) \left( c_{2}x-y\right) ^{2}}{\left( c_{2}+1\right) ^{2}}\left( \left( 2c_{1}c_{2}+3c_{1}-c_{2}\right) x+\left( c_{1}-3c_{2}-2\right) y\right) \text{.}$$

Now, we examine horizontal gradients ∇ Ĥ ξ̂ 1i , i = 1, 2. So ∇ Ĥ ξ̂ 11 = − (ξ̂ 11)( + Ŷ) with

$$\hat{X}(\hat{\xi}_{11})=\frac{\left( y-c_{2}x\right) ^{2}}{3\left( c_{2}-1\right) ^{2}}\left( \left( 2c_{1}c_{2}-3c_{1}+c_{2}\right) x+\left( c_{1}-3c_{2}+2\right) y\right) $$
(2.10)

and ∇ Ĥ ξ̂ 12 = − (ξ̂ 12)(Ŷ) with

$$\hat{X}(\hat{\xi}_{12})=-\frac{\left( y-c_{2}x\right) ^{2}}{3\left( c_{2}+1\right) ^{2}}\left( \left( 2c_{1}c_{2}+3c_{1}-c_{2}\right) x+\left( c_{1}-3c_{2}-2\right) y\right) \text{.} $$
(2.11)

Using Eq. 2.11, it is easy to see that ∇ Ĥ ξ̂ 12 is null f.d. in {−x < y < c 2 x}. Indeed, if c 1 − 3c 2 − 2 ≥ 0, then we have (2c 1 c 2 + 3c 1c 2) x + c 1 − 3c 2 − 2 y ≥ 2 (c 2 + 1) (c 1 + 1) x > 0. If, on the other hand, c 1 − 3c 2 − 2 < 0, then (2c 1 c 2 + 3c 1c 2) x + (c 1 − 3c 2 − 2) y > (2c 1 c 2 + 3c 1c 2) x + c 2(c 1 − 3c 2 − 2) x = 3 (c 2 + 1) (c 1c 2) x > 0. Also, since c 1 − 3c 2 + 2 = c 1c 2 + 2 (1 − c 2) > 0, we see that ∇ Ĥ ξ̂ 11 is null f.d. for \(y<-\frac {2c_{1}c_{2}-3c_{1}+c_{2}}{ c_{1}-3c_{2}+2}x\).

Lemma 2.3

\(c_{1}<-\frac {2c_{1}c_{2}-3c_{1}+c_{2}}{c_{1}-3c_{2}+2}<1.\)

Now, by Lemmas 2.1 and 2.3, it is clear that \(E_{1}<-\frac { 2c_{1}c_{2}+c_{2}-3c_{1}}{c_{1}+2-3c_{2}}\). We will compute the intersection \(\left \{ \hat {\eta }_{1}=0\right \} \cap \left \{ \hat {\xi }_{11}=0\right \} \cap \left \{ E_{1}x<y<-\frac {2c_{1}c_{2}+c_{2}-3c_{1}}{c_{1}+2-3c_{2}}x\right \}. \) Evidently,

$$\hat{\xi}_{11}\left( x,c_{2}x,z\right) -\hat{\eta}_{1}\left( x,c_{2}x,z\right) =-\frac{1}{48}x^{4}\left( c_{2}-1\right) ^{2}\left( c_{2}+1\right) ^{2}W<0\text{.} $$
(2.12)

On the other hand,

$$\begin{array}{@{}rcl@{}} &&\hat{\xi}_{11}\left( x,c_{1}x,z\right) -\hat{\eta}_{1}\left( x,c_{1}x,z\right) =-\frac{1}{48}\frac{\left( c_{1}-1\right) ^{2}}{\left( c_{2}-1\right) ^{2}}x^{4} \notag\\ &&\left( 4\left( c_{1}-c_{2}\right) ^{3}+\left( c_{2}-1\right) ^{2}\left( c_{1}+1\right) ^{2}\left( c_{1}c_{2}-2c_{1}+2c_{2}-1\right) \right) \text{.} \end{array}$$
(2.13)

We need the following:

Lemma 2.4

Let f (x, y) = 4 (xy)3 + (y − 1)2 (x + 1)2 (xy − 2x + 2y − 1 ) be a function considered on the set D = {(x, y) : −1 < y < x < 1}. Then f < 0 on D.

Lemma 2.4 and Eq. 2.13 give

$$\hat{\xi}_{11}\left( x,c_{1}x,z\right) -\hat{\eta}_{1}\left( x,c_{1}x,z\right) >0\text{.} $$
(2.14)

Moreover, (X + Y)(ξ̂ 11η̂ 1) = − (X + Y)(η̂ 1) < 0 on {c 2 x < y < E 1 x}, and (X + Y) (ξ̂ 11η̂ 1) > 0 on {E 1 x < y < x}. Now, let us sum up what we already know. By Eq. 2.12, the expression ξ̂ 1η̂ 1 (which is a homogeneous polynomial in x, y) is negative on y = c 2 x and decreases along the trajectories of X + Y in {c 2 x < y < E 1 x}. Then, ξ̂ 11η̂ 1 starts to increase and for y = c 1 x, it attains a positive value by Eq. 2.14. It follows that \(\left \{ \hat {\eta }_{1}=0\right \} \cap \left \{ \hat {\xi } _{11}=0\right \} \cap \left \{ E_{1}x<y<-\frac {2c_{1}c_{2}+c_{2}-3c_{1}}{ c_{1}+2-3c_{2}}x\right \} \) is of the form y = S 1 x with E 1 < S 1 < c 1. In this way, we arrive at the following:

$$\hat{J}^{+}(0)\cap \left\{ z\geq 0\right\} =\hat{A}_{1}\cup \hat{A}_{2}\cup \hat{A}_{3}$$

where

$$\hat{A}_{1}=\left\{ \hat{\eta}_{1}\leq 0\right\} \cap \left\{ S_{1}x\leq y\leq x\right\} \cap \left\{ z\geq 0\right\} \text{,} $$
(2.15)
$$\hat{A}_{2}=\left\{ \hat{\xi}_{11}\leq 0\right\} \cap \left\{ c_{2}x\leq y\leq S_{1}x\right\} \cap \left\{ z\geq 0\right\} \text{,} $$
(2.16)
$$\hat{A}_{3}=\left\{ \hat{\xi}_{12}\leq 0\right\} \cap \left\{ -x\leq y\leq c_{2}x\right\} \cap \left\{ z\geq 0\right\} \text{.} $$
(2.17)

The signs of z-coordinates of the fields + Ŷ, Ŷ needed in the computation of ∂Ĵ +(0) ∩ {z > 0} are schematically illustrated in Fig. 2. Arrows pointing up correspond to the field + Ŷ, while those pointing down correspond to Ŷ.

Fig. 2
figure 2

The case W > 0. Lines on the plane {z = 0} along which the fields + Ŷ, Ŷ point toward the positive direction of the z-axis

Now, we examine Ĵ +(0) ∩ {z ≤ 0}. Let us consider two Cauchy problems:

$$(\hat{X}-\hat{Y})(\eta )=0\text{, \ }\eta (x,c_{1}x,z)=-z$$

with the solution equal to

$$\hat{\xi}_{21}(x,y,z)=-z-\frac{1}{12}\frac{\left( x+y\right) \left( c_{1}x-y\right) ^{2}}{\left( 1+c_{1}\right) ^{2}}\left( \left( c_{1}-2c_{1}c_{2}-3c_{2}\right) x+\left( 3c_{1}-c_{2}+2\right) y\right) \text{,}$$

and

$$(\hat{X}+\hat{Y})(\eta )=0\text{, \ }\eta (x,c_{1}x,z)=-z$$

with the solution equal to

$$\hat{\xi}_{22}(x,y,z)=-z+\frac{1}{12}\frac{\left( x-y\right) \left( y-c_{1}x\right) ^{2}}{\left( 1-c_{1}\right) ^{2}}\left( \left( -2c_{1}c_{2}-c_{1}+3c_{2}\right) x+\left( 3c_{1}-c_{2}-2\right) y\right) \text{.}$$

As above, we need to know the regions where horizontal gradients ∇ Ĥ ξ̂ 2i , i = 1, 2 are suitably directed. After calculations,

$$\hat{X}(\hat{\xi}_{21})=\frac{\left( c_{1}x-y\right) ^{2}}{3\left( c_{1}+1\right) ^{2}}\left( \left( 2c_{1}c_{2}-c_{1}+3c_{2}\right) x+\left( -3c_{1}+c_{2}-2\right) y\right) $$
(2.18)

and

$$\hat{X}(\hat{\xi}_{22})=\frac{\left( y-c_{1}x\right) ^{2}}{3\left( 1-c_{1}\right) ^{2}}\left( \left( -2c_{1}c_{2}-c_{1}+3c_{2}\right) x+\left( 3c_{1}-c_{2}-2\right) y\right) \text{.} $$
(2.19)

Equation 2.19 immediately yields that ∇ Ĥ ξ̂ 22 is null f.d. in {c 1 x < y < x}. Indeed, (−2c 1 c 2c 1 + 3c 2) x + (3c 1c 2 − 2) y < −2 (c 2 − 1) (c 1 − 1) x < 0 in this sector whenever 3c 1c 2 − 2 ≥ 0. On the other hand, if 3c 1c 2 − 2 < 0, then (−2c 1 c 2c 1 + 3c 2) x + (3c 1c 2 − 2) y < (−2c 1 c 2c 1 + 3c 2) x + c 1 (3c 1c 2 − 2) x = 3 (c 1 − 1) (c 1c 2) x < 0. Also by Eq. 2.18, we know that ∇ H ξ̂ 21 is null f.d. for \(y>-\frac { 2c_{1}c_{2}-c_{1}+3c_{2}}{-3c_{1}+c_{2}-2}x\). Indeed, it is enough to notice that −3c 1 + c 2 − 2 = − (c 1c 2) − 2 (c 1 + 1) < 0.

Lemma 2.5

\(-1<-\frac {2c_{1}c_{2}-c_{1}+3c_{2}}{-3c_{1}+c_{2}-2}<c_{2}.\)

Now, it follows that \(-\frac {2c_{1}c_{2}-c_{1}+3c_{2}}{-3c_{1}+c_{2}-2}<E_{4},\) and we will compute the intersection \(\left \{ \hat {\eta }_{2}=0\right \} \cap \left \{ \hat {\xi }_{21}=0\right \} \cap \left \{ -\frac { 2c_{1}c_{2}-c_{1}+3c_{2}}{-3c_{1}+c_{2}-2}x<y<E_{4}x\right \} \). We proceed similarly as above. So, first of all,

$$\hat{\xi}_{21}(x,c_{1}x,z)-\hat{\eta}_{2}(x,c_{1}x,z)=-\frac{1}{48} x^{4}\left( c_{1}-1\right) ^{2}\left( c_{1}+1\right) ^{2}W<0\text{.} $$
(2.20)

Next,

$$\begin{array}{@{}rcl@{}} &&\hat{\xi}_{21}(x,c_{2}x,z)-\hat{\eta}_{2}(x,c_{2}x,z)=\frac{\left( c_{2}+1\right) ^{2}}{48\left( c_{1}+1\right) ^{2}}x^{4} \notag\\ &&\left( 4\left(-c_{1}+c_{2}\right) ^{3}-\left( c_{1}+1\right) ^{2}\left( c_{2}-1\right) ^{2}\left( c_{1}c_{2}+2c_{2}-2c_{1}-1\right) \right) \text{.} \end{array}$$
(2.21)

Lemma 2.6

The function f(x, y) = 4 (−x + y)3 − (x + 1)2 (y − 1)2 (xy − 2x + 2y − 1) is positive on D = {(x, y) : −1 < y < x < 1}.

Lemma 2.6 and Eq. 2.21 give

$$\hat{\xi}_{21}(x,c_{2}x,z)-\hat{\eta}_{2}(x,c_{2}x,z)>0\text{.} $$
(2.22)

Now, Eqs. 2.20 and 2.22 imply that similarly as above, the set \( \left \{ \hat {\eta }_{2}=0\right \} \cap \left \{ \hat {\xi }_{21}=0\right \} \cap \left \{ -\frac {2c_{1}c_{2}-c_{1}+3c_{2}}{-3c_{1}+c_{2}-2}x<y<E_{4}x\right \} \) is of the form y = S 2 x with c 2 < S 2 < E 4. We deduce that

$$\hat{J}^{+}(0)\cap \left\{ z\leq 0\right\} =\hat{A}_{4}\cup \hat{A}_{5}\cup \hat{A}_{6}$$

where

$$\hat{A}_{4}=\left\{ \hat{\eta}_{2}\leq 0\right\} \cap \left\{ -x\leq y\leq S_{2}x\right\} \cap \left\{ z\leq 0\right\} \text{,} $$
(2.23)
$$\hat{A}_{5}=\left\{ \hat{\xi}_{21}\leq 0\right\} \cap \left\{ S_{2}x\leq y\leq c_{1}x\right\} \cap \left\{ z\leq 0\right\} \text{,} $$
(2.24)
$$\hat{A}_{6}=\left\{ \hat{\xi}_{22}\leq 0\right\} \cap \left\{ c_{1}x\leq y\leq x\right\} \cap \left\{ z\leq 0\right\} \text{.} $$
(2.25)

We conclude this section with the following:

Proposition 2.2

If W > 0, then

$$\hat{J}^{+}(0)=\hat{A}_{1}\cup ...\cup \hat{A}_{6}\text{,}$$
$$\hat{I}^{+}(0)=\mathrm{int}\left(\hat{A}_{1}\cup ...\cup \hat{A}_{6}\right)\cup A_{7}\cup A_{8}$$

and

$$N^{+}(0)=\mathrm{int}\left(\hat{A}_{1}\cup ...\cup \hat{A}_{6}\right)\cup \left( \left\{ \hat{\eta}_{1}=0\right\} \cap \left\{ S_{1}x\leq y\leq x\right\} \right) \cup \left(\left\{ \hat{\eta}_{2}=0\right\} \cap \left\{ -x\leq y\leq S_{2}x\right\} \right)$$

where  1,...,  6 are given by Eqs. 2.152.172.232.24, and 2.25, respectively, and A 7 = {y = c 1 x, z = 0, x ≥ 0}, A 8 = {y = c 2 x, z = 0, x ≥ 0}.

In this way, we see that there are two geometrically optimal timelike (abnormal) curves, and we need six analytic functions to describe reachable sets.

The signs of z-cordinates of the fields + Ŷ, Ŷ, needed in the computation of ∂Ĵ +(0) ∩ {z < 0}, are illustrated in Fig. 3. As above, arrows pointing up (resp. down) correspond to the field + Ŷ (resp.Ŷ).

Fig. 3
figure 3

The case W > 0. Lines on the plane {z = 0} along which the fields + Ŷ, Ŷ point toward the negative direction of the z-axis

Now, using the notation from the end of Section 2.1, we list all geometrically optimal curves in the case W > 0. They can be divided into two groups:

  • (i)  Those forming ∂Ĵ +(0) ∩ {z ≥ 0}, i.e. ( + Ŷ) (Ŷ), ( + c 2 Ŷ) ( + Ŷ), ( + c 2 Ŷ) (Ŷ); note that ( + Ŷ) (Ŷ) and ( + c 2 Ŷ) ( + Ŷ) cease to be geometrically optimal when they reach the plane {y = S 1 x} (then they enter the interior int Ĵ +(0));

  • (ii)  Those forming ∂Ĵ +(0) ∩ {z ≤ 0}, i.e. (Ŷ) ( + Ŷ), ( + c 1 Ŷ) (Ŷ), ( + c 1 Ŷ) ( + Ŷ); note that ( + c 1 Ŷ) (Ŷ) and (Ŷ) ( + Ŷ) cease to be optimal when they intersect the plane {y = S 2 x}. The intersection of the set ∂Ĵ +(0) with the plane {x = const > 0} is represented in Fig. 4.

Fig. 4
figure 4

The set ∂Ĵ + (0) ∩ {x = const > 0} in the case W > 0

The points A, B, C, D lie on the plane {z = 0}; A and B correspond to half-lines {y = − x, z = 0} and {y = x, z = 0}, while B and C to singular trajectories {y = c 2 x, z = 0} and {y = c 1 x, z = 0}, respectively. DE represents the surface formed by the trajectories ( + Ŷ) (Ŷ), BE is the surface made up of the trajectories ( + c 2 Ŷ) ( + Ŷ), and the point E corresponds to the set ∂Ĵ +(0) ∩ {y = S 1 x}, i.e., the place where the trajectories ( + Ŷ) (Ŷ) and ( + c 2 Ŷ) ( + Ŷ) meet. Finally, part BA of ∂Ĵ +(0) ∩ {z ≥ 0} is filled with trajectories ( + c 2 Ŷ) (Ŷ). Similarly, AF (resp. CF) is the surface formed by the trajectories (Ŷ) ( + Ŷ) (resp. ( + c 1 Ŷ) (Ŷ)), and CD corresponds to part of ∂Ĵ +(0) ∩ {z ≤ 0} which is made up of the trajectories ( + c 1 Ŷ) ( + Ŷ. The point F represents the set ∂Ĵ +(0) ∩ {y = S 2 x}, i.e., the place where the trajectories (Ŷ) ( + Ŷ) and ( + c 1 Ŷ) (Ŷ) meet.

2.3 The Case W = 0

Using Eqs. 2.62.9 and the condition W = 0, we see that in the case under consideration

$$E_{1}=E_{2}=-\frac{2c_{1}-1}{c_{1}-2}=c_{2}\text{,}$$
$$E_{3}=E_{4}=\frac{2c_{2}+1}{c_{2}+2}=c_{1}\text{,}$$
$$\hat{\eta}_{1}(x,y,z)=z+\frac{x^{2}-y^{2}}{12}\left( c_{1}-2\right) \left( y-c_{2}x\right) ^{2}\text{,}$$
$$\hat{\eta}_{2}(x,y,z)=-z-\frac{x^{2}-y^{2}}{12}\left( c_{2}+2\right) \left( y-c_{1}x\right) ^{2}$$

and in what follows

$$\hat{X}(\hat{\eta}_{1})=\frac{1}{3}\left( x-y\right) \left( c_{1}-2\right) \left( y+\frac{2c_{1}-1}{c_{1}-2}x\right) ^{2}=\frac{1}{3}\left( x-y\right) \left( c_{1}-2\right) \left( y-c_{2}x\right) ^{2}\text{,}$$
$$\hat{X}(\hat{\eta}_{2})=-\frac{1}{3}\left( x+y\right) \left( c_{2}+2\right) \left( y-\frac{2c_{2}+1}{c_{2}+2}x\right) ^{2}=-\frac{1}{3}\left( x+y\right) \left( c_{2}+2\right) \left( y-c_{1}x\right) ^{2}\text{.}$$

Thus, ∇ Ĥ η̂ 1 is null f.d. on {−x < y < x} ∩ {yc 2 x}, and ∇ Ĥ η̂ 2 is null f.d. on {−x < y < x} ∩ {yc 1 x}. We can also see that η̂ 1(x, c 2 x, 0) = η̂ 2(x, c 1 x, 0) = 0. Moreover, ∇ Ĥ ξ̂ 11 is null f.d. for \(y<-\frac {2c_{1}c_{2}-3c_{1}+c_{2}}{c_{1}-3c_{2}+2}x\), and as above, we make sure that Eq. 2.14 holds together with the following:

$$\hat{\xi}_{11}\left( x,c_{2}x,z\right) -\hat{\eta}_{1}\left( x,c_{2}x,z\right) =0\text{.} $$
(2.26)

Similar reasoning as in Section 2.2 shows that ξ̂ 11η̂ 1 is nondecreasing along trajectories of X + Y starting at y = c 2 x. It follows that ξ̂ 11η̂ 1, and in turn,

$$\left\{ \hat{\xi}_{11}\leq 0\right\} \cap \left\{ c_{2}x<y<x\right\} \subset \left\{ \hat{\eta}_{1}\leq 0\right\} \cap \left\{ c_{2}x<y<x\right\} \text{.}$$

Moreover,

$$\hat{\xi}_{12}(x,y,z)-\hat{\eta}_{1}(x,y,z)=-\frac{1}{48}\left( x+y\right) ^{4}\left( c_{2}-1\right) ^{2}\frac{W}{\left( c_{2}+1\right) ^{2}}=0$$

which is in fact clear without calculations, since both functions satisfy the same linear differential equation with the same boundary conditions on the hypersurface y = c 2 x. Similar reasoning shows that

$$\left\{ \hat{\xi}_{21}\leq 0\right\} \cap \left\{ -x<y<c_{1}x\right\} \subset \left\{ \hat{\eta}_{2}\leq 0\right\} \cap \left\{ -x<y<c_{1}x\right\}$$

and

$$\hat{\xi}_{22}(x,y,z)-\hat{\eta}_{2}(x,y,z)=-\frac{1}{48}\left( x-y\right) ^{4}\left( c_{1}+1\right) ^{2}\frac{W}{\left( c_{1}-1\right) ^{2}}=0\text{.}$$

We sum up this subsection with the following:

Proposition 2.3

If W = 0 then

$$\hat{J}^{+}(0)=\hat{N}^{+}(0)=\hat{A}_{1}\cup \hat{A}_{2}$$

and

$$\hat{I}^{+}(0)=\mathrm{int}(\hat{A}_{1}\cup \hat{A}_{2})\cup \left\{ y=c_{1}x \text{, }x>0\right\} \cup \left\{ y=c_{2}x\text{, }x>0\right\} \text{,}$$

where

$$\begin{array}{*{20}l} \hat{A}_{1}=\left\{ (x,y,z)\in \mathbb{R}^{3}:\text{ }\hat{\eta} _{1}(x,y,z)\leq 0\right\} \cap \left\{ x\geq 0\text{, }z\geq 0\right\},\\ \hat{A}_{2}=\left\{ (x,y,z)\in \mathbb{R}^{3}:\text{ }\hat{\eta} _{2}(x,y,z)\leq 0\right\} \cap \left\{ x\geq 0\text{, }z\leq 0\right\}.\\ \end{array}$$

As we see, this case is very exceptional as compared to the previous cases with W ≠ 0. Namely, in spite of the fact that there are two geometrically optimal timelike curves, only two analytic functions suffice for describing reachable sets.

We list all geometrically optimal curves in this case: the curves forming ∂Ĵ +(0) ∩ {z ≥ 0}, i.e., the curves ( + Ŷ) (Ŷ), ( + c 2 Ŷ) (Ŷ), and the curves forming ∂Ĵ +(0) ∩ {z ≤ 0}, i.e., ŶX̂ + Ŷ, + c 1 ŶX̂ + Ŷ. The set ∂Ĵ +(0) ∩ {x = const > 0} can be depicted similarly as in Fig. 4, but this time, the curves DB and BA correspond to trajectories of Ŷ, and the curves AC and CD correspond to trajectories of + Ŷ.

Three-dimensional visualizations of reachable sets studied in this section are presented in Section 7.

3 Normal Forms

In this section, we consider more general sub-Lorentzian structures (H, g) than those dealt with in Theorem 1.1. At first, we describe the underlaying distribution H. So let H be a rank 2 distribution defined on a neighborhood U of the origin in ℝ3, and let l 1,...,l k ≥ 2 be positive integers. H will be said to satisfy the condition \((M_{l_{1},...,l_{k}})\) if it possesses the following properties:

  • (i)  There exist smooth hypersurfaces S 1,...,S k in U, such that the intersection \(\Gamma =\bigcap _{i=1}^{k}S_{i}\) contains the origin, is smooth 1-dimensional, and transverse to H; moreover, for each q ∈ Γ and every i, j = 1,...,k, ij, dim (T q S i T q S j ) = 1, dim (T q S i H q ) = 1;

  • (ii)  H defines a contact structure on \(U\backslash \bigcup _{i=1}^{k}S_{i}\);

  • (iii)  For any fixed i = 1,...,k, \(H_{q}^{l}\subset H_{q}\), ll i , \(H_{q}^{l_{i}+1}=T_{q}\mathbb {R}^{3}\) on the set S i ∖Γ.

  • (iv)  \(H_{q}^{l}\subset H_{q}\), ll 1 + ... + l k k + 1, \(H_{q}^{l_{1}+...+l_{k}-k+2}=T_{q}\mathbb {R}^{3}\) for every q ∈ Γ.

Now, we choose a Lorentzian metric g. We make three assumptions:

  • (v)  For each i = 1,...,k, the field of directions S i qT q S i H q is timelike;

  • (vi)  For every i, j = 1,...,k, ij, the function S i S j q → ∢(T q S i H q , T q S j H q ) is constant; and

  • (vii)  The abnormal curves foliating the surfaces S 1,...,S k are, up to a change of parameter, t.f.d. Hamiltonian geodesics.

We will say that a sub-Lorentzian structure (or metric) (H, g) is of type \(M_{l_{1},...,l_{k}}\) on U if (i)–(vii) hold on U. The set \( S=\bigcup _{i=1}^{k}S_{i}\) is again called the Martinet surface for H. Note that structures of type M l are exactly generalized Martinet sub-Lorentzian structures studied in [9].

Our aim is to prove the following:

Theorem 3.1

Let (H, g) be a time-oriented analytic sub-Lorentzian structure of type \( M_{l_{1},...,l_{k}}\) defined on a neighborhood U of the origin in3. Then, provided that U is sufficiently small, there exist analytic coordinates x, y, z on U in which (H, g) has an orthonormal frame in the normal form

$$\left. \begin{array}{l} X=\dfrac{\partial }{\partial x}+y\varphi \left(y\dfrac{\partial }{\partial x}+x \dfrac{\partial }{\partial y}\right)+\dfrac{1}{2}y\left( y-c_{1}x\right) ^{l_{1}-1}...\left( y-c_{k}x\right) ^{l_{k}-1}(1+\psi )\dfrac{\partial }{ \partial z}\medskip \\ Y=\dfrac{\partial }{\partial y}-x\varphi \left(y\dfrac{\partial }{\partial x}+x \dfrac{\partial }{\partial y}\right)-\dfrac{1}{2}x\left( y-c_{1}x\right) ^{l_{1}-1}...\left( y-c_{k}x\right) ^{l_{k}-1}(1+\psi )\dfrac{\partial }{ \partial z} \end{array} \right. \text{,}$$

where X is a time orientation, c 1,...,c k are constants such that − 1 < c k < ... < c 1 < 1, S i = {y = c i x}, i = 1,...,k, and finally φ, ψ are analytic functions on U with ψ(0, 0, z) = 0.

We start with the following result.

Lemma 3.1

Suppose that (H, g) is analytic and satisfies the condition M 2,...,2 on a neighborhood U of the origin in3. Then, provided that U is sufficiently small, there are analytic coordinates x, y, z defined on U in which (H, g) admits an orthonormal frame in the following form

$$\left. \begin{array}{l} X=\dfrac{\partial }{\partial x}-yB\left(y\dfrac{\partial }{\partial x}+x\dfrac{ \partial }{\partial y}\right)-y\left( y-c_{1}x\right) ...\left( y-c_{k}x\right) A \dfrac{\partial }{\partial z}\medskip \\ Y=\dfrac{\partial }{\partial y}+xB\left(y\dfrac{\partial }{\partial x}+x\dfrac{ \partial }{\partial y}\right)+x\left( y-c_{1}x\right) ...\left( y-c_{k}x\right) A \dfrac{\partial }{\partial z} \end{array} \right. $$
(3.1)

with X being a time orientation. Here, A, B are analytic functions, and S i = {y = c i x}, i = 1,...,k.

Proof

Fix an i, i = 1,...,k. Choose analytic coordinates , ŷ, defined in a neighborhood of the origin such that S i = {ŷ = 0}, Γ = {(0, 0, )}, \(H_{|S_{i}}=\ker d \hat {z}\), and \(\frac {\partial }{\partial \hat {x}}_{|S_{i}}\), \(\frac {\partial }{\partial \hat {y}}_{|S_{i}}\) is an orthonormal basis for \(H_{|S_{i}}\) with a time orientation \(\frac {\partial }{\partial \hat {x}}\). Clearly, abnormal curves (which by (vii) are Hamiltonian geodesics) contained in S i satisfy transversality condition with respect to Γ; cf. [8], Lemma 3.1; see also [2]. Since the satisfaction of the transversality condition does not depend on a choice of coordinates, and, moreover, i was arbitrary, we see that every abnormal curve starting from Γ satisfies the transversality condition with respect to Γ. Now, by use of the exponential mapping and assumption (vii), similarly as it was done in [5,8], we are led to the existence of analytic coordinates x, y, z in which (H, g) has an orthonormal frame in the following form:

$$\left. \begin{array}{l} X=\dfrac{\partial }{\partial x}-yB\left(y\dfrac{\partial }{\partial x}+x\dfrac{ \partial }{\partial y}\right)-yA\dfrac{\partial }{\partial z}\medskip \\ Y=\dfrac{\partial }{\partial y}+xB\left(y\dfrac{\partial }{\partial x}+x\dfrac{ \partial }{\partial y}\right)+xA\dfrac{\partial }{\partial z} \end{array} \right. \text{,}$$

where S j ∩ {z = 0} = {y = c j x, z = 0}, ।c j । < 1, j = 1,...,k, S i = {y = 0} (i.e., c i = 0), and one can suppose that c k < ... < c 1. Using assumption (vi), we see that S j , j = 1,...,k, are all of the form S j = {y = c j x}.

Now, the z-coordinate of [X, Y] is equal to

$$\left[ X,Y\right] (z)=\left( 1-y^{2}B\right) \left( A+x\frac{\partial A}{ \partial x}\right) -x^{2}y\frac{\partial A}{\partial y}B+xy^{2}\frac{ \partial A}{\partial x}B+\left( 1+x^{2}B\right) \left( A+y\frac{\partial A}{ \partial y}\right) \text{.}$$

By our assumptions \(\left [ X,Y\right ] _{|y=c_{i}x}\) is horizontal, i = 1,...,k. Take i = 1. Then, there exist analytic functions f(x, z), g(x, z) such that \(\left [ X,Y\right ] _{|y=c_{1}x}=f(x,z)X_{|y=c_{1}x}+g(x,z)Y_{|y=c_{1}x}\). This leads us to the equality of the z-coordinates

$$2A+\left( \frac{\partial A}{\partial x}+c_{1}\frac{\partial A}{\partial y} \right) x+\left( 1-c^{2}\right) ABx^{2}=\left( g(x,z)-c_{1}f(x,z)\right) xA $$
(3.2)

where A and its derivatives should be evaluated at (x, c 1 x, z). Suppose that A(x, c 1 x, z) does not vanish identically. Then, we can find z such that A(x, c 1 x, z) = a m (z)x m + o(x m) as x → 0 with a m (z) ≠ 0 and m > 0. Since \(x\left ( \frac {\partial A}{\partial x} +c_{1}\frac {\partial A}{\partial y}\right ) _{|y=c_{1}x}=x\left ( X+c_{1}Y\right ) (A)_{|y=c_{1}x}=ma_{m}(z)x^{m}+o(x^{m})\), Eq. 3.2 gives

$$\left( 2+m\right) a_{m}(z)=o(1)\text{,}$$

so we arrive at a m (z) = 0 which is a contradiction. In this way, A may be replaced by the expression (yc 1 x)A for some other analytic function A.

Repeating the argument for i = 2,...,k, we are lead to Eq. 3.1. □

Now, suppose that our structure (H, g), which is given by Eq. 3.1 on a neighborhood U of the origin, satisfies the condition \( M_{l_{1},...,l_{k}}\), l i ≥ 2, i = 1,...,k.

Fix an index i. If we set A 1 to be the function defined by y (yc i x) A 1 = y (yc 1 x) ... (yc k x) A, the frame (3.1) takes the following form:

$$\left. \begin{array}{l} X=\dfrac{\partial }{\partial x}-yB\left(y\dfrac{\partial }{\partial x}+x\dfrac{ \partial }{\partial y}\right)-y\left( y-c_{i}x\right) A_{1}\dfrac{\partial }{ \partial z}\medskip \\ Y=\dfrac{\partial }{\partial y}+xB\left(y\dfrac{\partial }{\partial x}+x\dfrac{ \partial }{\partial y}\right)+x\left( y-c_{i}x\right) A_{1}\dfrac{\partial }{ \partial z} \end{array} \right. \text{.} $$
(3.3)

Applying the following change of coordinates

$$\left\{ \begin{array}{l} \tilde{x}=x\cosh \varphi -y\sinh \varphi \\ \tilde{y}=-x\sinh \varphi +y\cosh \varphi \\ \tilde{z}=z \end{array} \right. \text{,}$$

with tanh φ = c i (i.e., yc i x = /coshφ) and rewriting (3.3), we are led to

$$\left. \begin{array}{l} X=\left( \cosh \varphi \right) \tilde{X}-\left( \sinh \varphi \right) \tilde{ Y}\medskip \\ Y=-\left( \sinh \varphi \right) \tilde{X}+\left( \cosh \varphi \right) \tilde{Y} \end{array} \right. \text{,} $$
(3.4)

where

$$\left. \begin{array}{l} \tilde{X}=\dfrac{\partial }{\partial \tilde{x}}-\tilde{y}B\left(\tilde{y}\dfrac{ \partial }{\partial \tilde{x}}+\tilde{x}\dfrac{\partial }{\partial \tilde{y}} \right)-\tilde{y}^{2}\tilde{A}\dfrac{\partial }{\partial \tilde{z}}\medskip \\ \tilde{Y}=\dfrac{\partial }{\partial \tilde{y}}+\tilde{x}B\left(\tilde{y}\dfrac{ \partial }{\partial \tilde{x}}+\tilde{x}\dfrac{\partial }{\partial \tilde{y}} \right)+\tilde{x}\tilde{y}\tilde{A}\dfrac{\partial }{\partial \tilde{z}} \end{array} \right. \text{,} $$
(3.5)

and \(\tilde {A}=\frac {A_{1}}{\cosh \varphi }\). Obviously (cf. [13]) , is again an orthonormal frame for (H, g) with a time orientation , and we can apply to it the same method as in [9], Proposition 3.2. As a result, Eq. 3.5 can be written as follows:

$$\left. \begin{array}{l} \tilde{X}=\dfrac{\partial }{\partial \tilde{x}}-\tilde{y}B\left(\tilde{y}\dfrac{ \partial }{\partial \tilde{x}}+\tilde{x}\dfrac{\partial }{\partial \tilde{y}} \right)-\tilde{y}^{l_{i}}\hat{A}\dfrac{\partial }{\partial \tilde{z}}\medskip \\ \tilde{Y}=\dfrac{\partial }{\partial \tilde{y}}+\tilde{x}B\left(\tilde{y}\dfrac{ \partial }{\partial \tilde{x}}+\tilde{x}\dfrac{\partial }{\partial \tilde{y}} \right)+\tilde{x}\tilde{y}^{l_{i}-1}\hat{A}\dfrac{\partial }{\partial \tilde{z}} \end{array} \right. \text{,}$$

with \(\tilde {A}=\tilde {y}^{l_{i}-2}\hat {A}\). Passing again to Eq. 3.4, we obtain the following:

$$\left. \begin{array}{l} X=\dfrac{\partial }{\partial x}-yB\left(y\dfrac{\partial }{\partial x}+x\dfrac{ \partial }{\partial y}\right)-y\left( y-c_{i}x\right) ^{l_{i}-1}\left( \cosh \varphi \right) ^{l_{i}-1}\hat{A}\dfrac{\partial }{\partial z}\medskip \\ Y=\dfrac{\partial }{\partial y}+xB\left(y\dfrac{\partial }{\partial x}+x\dfrac{ \partial }{\partial y}\right)+x\left( y-c_{i}x\right) ^{l_{i}-1}\left( \cosh \varphi \right) ^{l_{i}-1}\hat{A}\dfrac{\partial }{\partial z} \end{array} \right. \text{,}$$

i.e., to say

$$\left. \begin{array}{l} X=\dfrac{\partial }{\partial x}-yB\left(y\dfrac{\partial }{\partial x}+x\dfrac{ \partial }{\partial y}\right)-y\left( y-c_{1}x\right) ...\left( y-c_{i}x\right) ^{l_{i}-1}...\left( y-c_{k}x\right) A\dfrac{\partial }{\partial z}\medskip \\ Y=\dfrac{\partial }{\partial y}+xB\left(y\dfrac{\partial }{\partial x}+x\dfrac{ \partial }{\partial y}\right)+x\left( y-c_{1}x\right) ...\left( y-c_{i}x\right) ^{l_{i}-1}...\left( y-c_{k}x\right) A\dfrac{\partial }{\partial z} \end{array} \right.$$

for some new analytic function A. Repeating the same argument for every i = 1,...., k, we are led to

$$\left. \begin{array}{l} X=\dfrac{\partial }{\partial x}-yB\left(y\dfrac{\partial }{\partial x}+x\dfrac{ \partial }{\partial y}\right)+y\left( y-c_{1}x\right) ^{l_{1}-1}...\left( y-c_{k}x\right) ^{l_{k}-1}A\dfrac{\partial }{\partial z}\medskip \\ Y=\dfrac{\partial }{\partial y}+xB\left(y\dfrac{\partial }{\partial x}+x\dfrac{ \partial }{\partial y}\right)-x\left( y-c_{1}x\right) ^{l_{1}-1}...\left( y-c_{k}x\right) ^{l_{k}-1}A\dfrac{\partial }{\partial z} \end{array} \right. $$
(3.6)

for yet another analytic A about which we know that does not contain terms of the form (yc i x)l. Equation 3.6 is, in fact, all we can get without assuming (iv).

Now, we take (iv) into account. To obtain a condition for A, we need l 1 + ... + l k k + 1 differentiations of X(z) or Y(z), i.e., we have to consider sections of \(H^{l_{1}+...+l_{k}-k+2}\). So let W be a (local) section of \(H^{l_{1}+...+l_{k}-k+2}\) defined near zero. Then, looking at Eq. 3.6, we see that W(z) = CA + O(r), \(r=\sqrt {x^{2}+y^{2}+z^{2}}\), where by (iv) C ≠ 0 for suitable chosen W. Setting x = y = 0, we arrive at A(0, 0, z) ≠ 0. Last stage is to renormalize the z-axis, so as to have \(A(0,0,z)=\frac {1}{2}\). This can be done similarly as, e.g., in [9]. To end the proof, we write φ = − B, ψ = 2A − 1.

4 Reachable Sets in the General Case

In this section, by (H, g) we denote a fixed time-oriented sub-Lorentzian metric of type M 2, 2, defined on a normal neighborhood U of the origin in ℝ3. Throughout this section, we assume that U is as small as we need. We may suppose that (H, g) is already transformed to the normal form. So let X, Y be an orthonormal frame for (H, g) given on U by Eq. 1.1. In cases W > 0, W < 0, we will use the same method to compute local reachable sets as in [79]. The mentioned method, however, does not work when W = 0, and it seems impossible to arbitrate in advance what the structure of the reachable set will be in this case.

Let X = + X 1, Y = Ŷ + Y 2, and , Ŷ be as in Eq. 2.1, and

$$\begin{array}{l} Y_{1}=y\varphi \left(y\dfrac{\partial }{\partial x}+x\dfrac{\partial }{\partial y} \right)+\dfrac{1}{2}y\left( y-c_{1}x\right) \left( y-c_{2}x\right) \psi \dfrac{ \partial }{\partial z}\medskip \\ Y_{2}=-x\varphi \left(y\dfrac{\partial }{\partial x}+x\dfrac{\partial }{\partial y }\right)-\dfrac{1}{2}x\left( y-c_{1}x\right) \left( y-c_{2}x\right) \psi \dfrac{ \partial }{\partial z} \end{array} \text{.} $$
(4.1)

4.1 The Case W < 0

Similarly as in Section 2, consider two Cauchy problems:

$$(X-Y)(\eta )=0\text{, \ \ }\eta (x,x,z)=z$$

with the solution denoted by η 1, and

$$(X+Y)(\eta )=0\text{, \ \ }\eta (x,-x,z)=-z$$

with the solution denoted by η 2. We write η 1 = η̂ 1 + R 1, η 2 = η̂ 2 + R 2. It is seen that R 1 and R 2 satisfy, respectively,

$$(X-Y)(R_{1})=-(X_{1}-Y_{1})(\hat{\eta}_{1})\text{, \ \ }R_{1}(x,x,z)=0$$

and

$$(X+Y)(R_{2})=-(X_{1}+Y_{1})(\hat{\eta}_{2})\text{, \ \ }R_{2}(x,-x,z)=0\text{.}$$

Clearly, (X 1Y 1)(η̂ 1) = Or 4, \(r=\sqrt {x^{2}+y^{2}+z^{2}}\). Since η 1z is divisible by x 2y 2( y = − x is the trajectory of XY starting at (0, 0, 0)), we deduce that R 1 = O (r 5). Similarly, R 2 = O (r 5) which, in view of η̂ i = ± z + O (r 4), means that η i may be regarded as a perturbation of η̂ i , i = 1, 2. Exactly, e.g., as in subsection 4.1 of [9], we prove that X(η 1) is divisible by xy. It follows that ∇ H η 1 = − X(η 1)(XY) where, by using Eq. 2.4, we have

$$\begin{array}{@{}rcl@{}} X(\eta _{1})&=&-\frac{1}{12}\left( x-y\right) \left(\left( 7c_{1}c_{2}-2c_{1}-2c_{2}+1\right) x^{2}+4\left( c_{1}c_{2}-2c_{1}-2c_{2}+1\right) xy\allowbreak\right. \notag \\ &&{\kern5pc}\left. +\left( c_{1}c_{2}-2c_{1}-2c_{2}+7\right) y^{2}+O(r^{3})\right)\text{.} \end{array}$$
(4.2)

It follows that X(η 1) < 0 on U ∩ {−x < y < x}, and ∇ H η 1 is null f.d. on U ∩ {−x < y < x}. Similarly,

$$\begin{array}{@{}rcl@{}} X(\eta _{2})&=&-\frac{1}{12}\left( x+y\right) \left(\left( 7c_{1}c_{2}+2c_{1}+2c_{2}+1\right) x^{2}-4\left( c_{1}c_{2}+2c_{1}+2c_{2}+1\right) xy \right.\notag\\ && {\kern5pc} \left.+ \left( c_{1}c_{2}+2c_{1}+2c_{2}+7\right) y^{2}+O(r^{3})\right) \end{array}$$
(4.3)

from which X(η 2) < 0 on U ∩ {−x < y < x}, and hence, ∇ H η 2 is null f.d. on U ∩ {−x < y < x}. We finish the proof of Theorem 1.2 as in the Section 2.1.

Geometrically, optimal trajectories in this case are the same as in the corresponding flat case, with (resp. Ŷ) replaced by X (resp. Y). Also, reachable sets look similarly; cf. Fig. 1.

4.2 The Case W > 0

Here, X(η 1) and X(η 2) are again given by Eqs. 4.2 and 4.3, respectively. This time, however, X(η 1) < 0 on {(E 1 + ε)x < y < x} ∩ U and X(η 2) < 0 on {−x < y < (E 2ε)x} ∩ U, where ε > 0 will be supposed to be sufficiently small. Next, we define functions ξ 11, ξ 12 as solutions to the following Cauchy problems:

$$\left( X+Y\right) (\eta )=0\text{, \ \ }\eta (x,c_{2}x,z)=z$$

and

$$\left( X-Y\right) (\eta )=0\text{, \ \ }\eta (x,c_{2}x,z)=z\text{,}$$

respectively. As above, we write ξ 11 = ξ̂ 11 + R 11, ξ 12 = ξ̂ 12 + R 12, where, e.g., R 11 satisfies (X + Y)(R 11) = − (X 1 + Y 1)ξ̂ 11, R 11(x, c 2 x, z) = 0. It follows that R 11 = Or 5, and similarly R 12 = O(r 5). So again, we may think of ξ 1i as being perturbations of ξ̂ 1i , i = 1, 2. Now, since \(X+c_{2}Y_{|y=c_{2}x}=\frac {\partial }{\partial x}+c_{2} \frac {\partial }{\partial y}\), (X + c 2 Y)(ξ 11)| y = c 2x = 0 by definition of ξ 11. But also (X + Y)(ξ 11) = 0, from which \(X(\xi _{11})_{|y=c_{2}x}=0\), and therefore X(ξ 11) is divisible by yc 2 x. We prove analogously that also X(ξ 12) is divisible by yc 2 x. However, this is not enough for our purposes, and we need the following:

Lemma 4.1

X(ξ 11) and X(ξ 12) are divisible by (yc 2 x)2.

Proof

We prove the first statement. We already know that X(ξ 11) = (yc 2 x)g for an analytic function g. Since [X, X + Y] = 0 on {y = c 2 x}, it follows that (X + Y)(X(ξ 11)) = X(X + Y)(ξ 11) = 0 on {y = c 2 x}, where

$$(X+Y)(X(\xi _{11}))=\left( 1-c_{2}-(y-x)(c_{2}y+x)\varphi \right) g+O\left( (y-c_{2}x)\right) \text{.}$$

By setting y = c 2 x, we arrive at \((1-c_{2})\left [ 1+(c_{2}+1)x^{2}\varphi \right ] g_{|y=c_{2}x}=0\), and the proof is over since g must be divisible by yc 2 x (recall that U is as small as we need).

The proof of the second statement is analogous. We notice that [X, XY] = 0 on {y = c 2 x}, so (XY)(X(ξ 11)) = X(XY)(ξ 11) = 0 on {y = c 2 x} and continue in the same manner. □

Making use of Eqs. 2.102.11, and the above lemma, ∇ H ξ 11 = − X(ξ 11)(X + Y) with

$$X(\xi _{11})=\frac{\left( y-c_{2}x\right) ^{2}}{3\left( c_{2}-1\right) ^{2}} \left( \left( 2c_{1}c_{2}-3c_{1}+c_{2}\right) x+\left( c_{1}-3c_{2}+2\right) y+O(r^{2})\right)$$

and ∇ H ξ 12 = − X(ξ 12)(XY) with

$$X(\xi _{12})=-\frac{\left( y-c_{2}x\right) ^{2}}{3\left( c_{2}+1\right) ^{2}} \left( \left( 2c_{1}c_{2}+3c_{1}-c_{2}\right) x+\left( c_{1}-3c_{2}-2\right) y+O(r^{2})\right) \text{.}$$

This, of course, implies that ∇ H ξ 11 is null f.d. on \(\left \{ c_{2}x<y<\left ( -\frac {2c_{1}c_{2}-3c_{1}+c_{2}}{c_{1}-3c_{2}+2}-\varepsilon \right ) x\right \} \cap U\), and ∇ H ξ 12 is null f.d. on − x < y < c 2 xU.

Using just the presented considerations and remembering Section 2.2, we may suppose that ξ 11 < η 1 on {c 2 x < y < (E 1 + ε)x} ∩ U, while ξ 11 > η 1 on {(c 1ε)x < y < x} ∩ U or, which is more convenient to us, that ξ 11 < η 1 on {c 2 x < y < (S 1ε)x} ∩ U, while ξ 11 > η 1 on {(S 1 + ε)x < y < x} ∩ U.

Let us define a set Z 1 by

$$Z_{1}=\left\{ \eta _{1}=0\right\} \cap \left\{ \xi _{11}=0\right\} \cap \left\{ (S_{1}-\varepsilon )x<y<\left( S_{1}+\varepsilon \right) x\right\} \cap U\text{;}$$

clearly Z 1 is a semi-analytic set (cf. [12]). As in [8] and [9], we make sure that dimZ 1 = 1 and that Z 1 is made up of a single analytic curve entering the origin. Further, let us define a semi-analytic set by Σ 1 = ρ − 1(ρ(Z 1)) ∩ U where ρ : ℝ3 → ℝ2 is the projection (x, y, z) → (x, y), and let \(\Sigma _{1}^{+}\), \(\Sigma _{1}^{-}\) be the two connected components of U ∩ {x ≥ 0} ∩ {z ≥ 0} ∩ {c 2 xyx}∖Σ 1, containing half-lines y = x, x ≥ 0 and y = c 2 x, x ≥ 0, respectively. All what we have just said leads us to

$$J^{+}(0,U)\cap \left\{ z\geq 0\right\} =A_{1}\cup A_{2}\cup A_{3}$$

where

$$A_{1}=\left\{ \eta _{1}\leq 0\right\} \cap \Sigma _{1}^{+}\text{,}$$
$$A_{2}=\left\{ \xi _{11}\leq 0\right\} \cap \Sigma _{1}^{-}\text{,}$$
$$A_{3}=\left\{ \xi _{12}\leq 0\right\} \cap \left\{ -x\leq y\leq c_{2}x\right\} \cap \left\{ z\geq 0\right\} \cap U$$

as it was announced in Theorem 1.3.

Quite similar considerations can be carried out to describe the set J +(0, U) ∩ {z ≤ 0}. We only note that this time, we define a 1-dimensional semi-analytic set

$$Z_{2}=\left\{ \eta _{2}=0\right\} \cap \left\{ \xi _{21}=0\right\} \cap \left\{ (S_{2}-\varepsilon )x<y<\left( S_{2}+\varepsilon \right) x\right\} \cap U\text{.}$$

Then, we set Σ 2 = ρ − 1(ρ(Z 2)) ∩ U and define \(\Sigma _{2}^{+}\), \(\Sigma _{2}^{-}\) to be the two connected components of U ∩ {x ≥ 0} ∩ {z ≤ 0} ∩ {−xyc 1 x}∖Σ 2, containing half-lines y = c 1 x, x ≥ 0, and y = − x, x ≥ 0, respectively. Finally, we obtain

$$J^{+}(0,U)\cap \left\{ z\leq 0\right\} =A_{4}\cup A_{5}\cup A_{6}$$

where

$$A_{4}=\left\{ \eta _{2}\leq 0\right\} \cap \Sigma _{2}^{-}\text{,}$$
$$A_{5}=\left\{ \xi _{21}\leq 0\right\} \cap \Sigma _{2}^{+}\text{,}$$
$$A_{6}=\left\{ \xi _{22}\leq 0\right\} \cap \left\{ c_{1}x\leq y\leq x\right\} \cap \left\{ z\leq 0\right\} \cap U\text{.}$$

This terminates the proof of Theorem 1.3.

All geometrically optimal curves are listed in Section 2.2 (again with , Ŷ replaced by X, Y), and ∂͂ J +(0, U) ∩ {x = const > 0} can be depicted as in Fig. 4.

4.3 The Case W = 0

Here, as it was mentioned earlier, we are not able to say what the structure of reachable sets is. This is, for instance, because the relation η̂ 1(x, c 2 x, 0) = 0 may no longer be true after perturbation. Therefore, we cannot predict the sign of the expression η 1(x, c 2 x, 0) even in a small neighborhood of the origin without computing higher-order terms in the expression for η i . We will not do it in this paper.

4.4 Nilpotent Approximations

Note that all the functions describing reachable sets in the flat case, i.e., η̂ i , ξ̂ ij , and i, j = 1, 2, are homogeneous with respect to the family of dilatations δ t (x, y, z) = (tx, ty, t 4 z). In other words, the mentioned functions are homogeneous when we prescribe the following weights to variables: weight (x) = weight(y) = 1, weight (z) = 4. It also means that the flat structure is the nilpotent approximation for structures given by Eq. 1.1; cf. [3].

5 Applications to Control Affine Systems

Let us consider a control affine system

$$\dot{q}=X+uY\text{, \ \ }u\in \lbrack a,b]\text{,} $$
(5.1)

defined on a neighborhood U of the origin in ℝ3, where X and Y are supposed to be an orthonormal frame for the sub-Lorentzian metric of type M 2, 2. We can assume that X and Y are given by Eq. 1.1. As it was mentioned in Section 1, the reachable set 𝒜[a, b](0, U) for Eq. 5.1 coincides with the future nonspacelike reachable set J +(0, U) for the time-oriented sub-Lorentzian structure (H, g a, b) defined by declaring the frame

$$\left. \begin{array}{l} Z^{a,b}=X+\frac{1}{2}(b+a)Y\medskip \\ W^{a,b}=\frac{1}{2}(b-a)Y \end{array} \right.$$

to be orthonormal with a time orientation Z a, b. Thus, 𝒜[a, b](0, U) = J +(0, U) is just the reachable set for the system

$$\dot{q}=Z^{a,b}+uW^{a,b}\text{, \ \ }u\in \lbrack -1,1]\text{,} $$
(5.2)

(cf. Lemmas 1.1 and 1.2 in [9]).

The aim of this section is to prove Theorem 1.4. To save space, we will not give exact formulas for functions describing reachable sets. We will restrict ourselves only in examining the structure of reachable sets and its dependence on geometric optimality of singular trajectories.

The first evident observation is that

$$\mathcal{A}_{[a,b]}(q_{0},U)\subset \left\{ ax\leq y\leq bx\right\} \cap U \text{.}$$
(5.3)

5.1 The Case c 1, c 2 ∉ (a, b)

In this case, there are no singular trajectories starting from the origin for Eq. 5.2 (and hence for Eq. 5.1 since both systems are equivalent). As in [7,9] or as above, we investigate z-coordinates of Z a, b ± W a, b. So,

$$\left( Z^{a,b}+W^{a,b}\right) (z)=\frac{1}{2}\left( y-c_{1}x\right) \left( y-c_{2}x\right) \left( y-bx\right) \left( 1+\psi \right) $$
(5.4)

and

$$\left( Z^{a,b}-W^{a,b}\right) (z)=\frac{1}{2}\left( y-c_{1}x\right) \left( y-c_{2}x\right) \left( y-ax\right) \left( 1+\psi \right) $$
(5.5)

have opposite signs on {ax < y < bx} ∩ U. In this way, again as in [7,9], 𝒜[a, b](q 0, U) is described by two analytic functions.

5.2 The Case c 2a < c 1 < b or a < c 2 < bc 1

In this case, the system (5.2) has one singular trajectory initiating at the origin. Again, we examine the signs of z-coordinates of Z a, b ± W a, b. Thus, in the first case, (5.4) on {y = ax} and Eq. 5.5 on {y = bx} are both positive, while in the second case, they are both negative. Also, Eqs. 5.4 and 5.5 are negative near {y = c 1 x} in the first case, while Eqs. 5.4 and 5.5 are positive near {y = c 2 x} in the second case. Therefore, we arrive at the similar situation as in [9], and a similar reasoning as in [9] leads to the conclusion that the minimal number of analytic functions needed for describing 𝒜[a, b](q 0, U) is four.

5.3 The Case a < c 2 < c 1 < b

In this case, the system (5.2) has two singular trajectories initiating at the origin. In order to simplify the situation, we make the following change of coordinates in Eq. 5.2:

$$\left\{ \begin{array}{l} \tilde{x}=x \\ \tilde{y}=-\frac{b+a}{b-a}x+\frac{2}{b-a}y \\ \tilde{z}=z \end{array} \right. \text{.} $$
(5.6)

The resulting system is as follows:

$$\dot{q}=\tilde{Z}^{a,b}+u\tilde{W}^{a,b}\text{, \ \ }u\in \lbrack -1,1]\text{,} $$
(5.7)

where

$$\begin{array}{l} \tilde{Z}^{a,b}=\frac{\partial }{\partial \tilde{x}}+\varphi A^{a,b}+\frac{1 }{2}\left( \frac{b-a}{2}\right) ^{3}\tilde{y}\left( \tilde{y}-\tilde{c}_{1} \tilde{x}\right) \left( \tilde{y}-\tilde{c}_{2}\tilde{x}\right) \left( 1+\psi \right) \frac{\partial }{\partial \tilde{z}}\medskip \\ \tilde{W}^{a,b}=\frac{\partial }{\partial \tilde{y}}-\varphi B^{a,b}-\frac{1 }{2}\left( \frac{b-a}{2}\right) ^{3}\tilde{x}\left( \tilde{y}-\tilde{c}_{1} \tilde{x}\right) \left( \tilde{y}-\tilde{c}_{2}\tilde{x}\right) \left( 1+\psi \right) \frac{\partial }{\partial \tilde{z}} \end{array} $$
(5.8)

with

$$\begin{array}{l} A^{a,b}=\frac{1}{2}\tilde{y}\left( \frac{1}{2}\left( b^{2}-a^{2}\right) \tilde{x}+\frac{1}{2}\left( b-a\right) ^{2}\tilde{y}\right) \frac{\partial }{ \partial \tilde{x}}+\frac{1}{2}\tilde{y}\left( \frac{1}{2}\left( 4-\left( a+b\right) ^{2}\right) \tilde{x}-\frac{1}{2}\left( b^{2}-a^{2}\right) \tilde{ y}\right) \frac{\partial }{\partial \tilde{y}}\medskip \\ B^{a,b}=\frac{1}{2}\tilde{x}\left( \frac{1}{2}\left( b^{2}-a^{2}\right) \tilde{x}+\frac{1}{2}\left( b-a\right) ^{2}\tilde{y}\right) \frac{\partial }{ \partial \tilde{x}}+\frac{1}{2}\tilde{x}\left( \frac{1}{2}\left( 4-\left( a+b\right) ^{2}\right) \tilde{x}-\frac{1}{2}\left( b^{2}-a^{2}\right) \tilde{ y}\right) \frac{\partial }{\partial \tilde{y}} \end{array} \text{,}$$

and

$$\tilde{c}_{1}=\frac{2c_{1}-b-a}{b-a}\text{, \ \ }\tilde{c}_{2}=\frac{ 2c_{2}-b-a}{b-a}\text{.} $$
(5.9)

Note that by rescaling the -axis, we can always get rid of the factor \(\left ( \frac {b-a}{2}\right ) ^{3}\), so we no longer take care of it. Since the change of coordinates (5.6) is bi-analytic, transforms straight lines onto straight lines, and preserves geometric optimality of trajectories, the reachable set for Eq. 5.7 is described by the same number of analytic functions as 𝒜[a, b](q 0, U) and has the same number of geometrically optimal singular trajectories. Now, we repeat the above arguments. Reachable sets for Eq. 5.7 with φ = ψ = 0 in Eq. 5.8 are computed according to Section 2. Reachable sets for Eq. 5.7 with arbitrary φ and ψ in Eq. 5.8 are computed according to Section 4 (here, we should remark that although Eq. 5.8 does not coincide with Eq. 1.1, the argument still works since, as one easily checks, functions describing reachable sets for arbitrary φ and ψ are again perturbations of functions describing reachable sets for φ = ψ = 0). To sum up, 𝒜[a, b](q 0, U) is described by six analytic function whenever W( 1, 2) > 0 and by two analytic functions whenever W( 1, 2) < 0. We also know that in case W( 1, 2) = 0 and φ = ψ = 0, two analytic functions suffice.

In this way the proof of Theorem 1.4 is over.

Remark 5.1

At the end, let us note that the presented method of analysis of reachable sets works for affine control systems induced by any sub-Lorentzian structure described by Theorem 3.1, but because of a large number of constants, it is difficult to state a general theorem.

5.4 Two Examples

To illustrate how the presented methods work in practice, let us consider two examples.

Example 5.1

Suppose that we are interested in the structure of the reachable set from the origin for the following affine control system:

$$\dot{q}=X+uY\text{, \ \ }u\in \lbrack -1,4]\text{,} $$
(5.10)

where

$$\begin{array}{l} X=\frac{\partial }{\partial x}+\frac{1}{2}y(y-2x)(y-3x)\frac{\partial }{ \partial z}\medskip \\ Y=\frac{\partial }{\partial y}-\frac{1}{2}x(y-2x)(y-3x)\frac{\partial }{ \partial z} \end{array} \text{.}$$

This system is equivalent to the sub-Lorentzian structure (H, g − 1, 4) defined by an orthonormal basis Z − 1, 4, W − 1, 4 with Z − 1, 4 being a time orientation, where

$$\begin{array}{l} Z^{-1,4}=\frac{\partial }{\partial x}+\frac{3}{2}\frac{\partial }{\partial y} +\frac{1}{2}\left( y-\frac{3}{2}x\right) \left( y-2x\right) \left( y-3x\right) \frac{\partial }{\partial z}\medskip \\ W^{-1,4}=\frac{5}{2}\frac{\partial }{\partial y}-\frac{5}{4}x(y-2x)(y-3x) \frac{\partial }{\partial z} \end{array} \text{.}$$

According to Eq. 5.6, we change coordinates as follows: = x, \(\tilde {y}=-\frac {3}{5}\tilde {x}+\frac {2}{5}\tilde {y}\), = z, and as a result, we have

$$\begin{array}{l} Z^{-1,4}=\frac{\partial }{\partial \tilde{x}}+\frac{1}{2}\left( \frac{5}{2} \right) ^{3}\tilde{y}\left( \tilde{y}-\frac{3}{5}\tilde{x}\right) \left( \tilde{y}-\frac{1}{5}\tilde{x}\right) \frac{\partial }{\partial \tilde{z}} \\ W^{-1,4}=\frac{\partial }{\partial \tilde{y}}-\frac{1}{2}\left( \frac{5}{2} \right) ^{3}\tilde{x}\left( \tilde{y}-\frac{3}{5}\tilde{x}\right) \left( \tilde{y}-\frac{1}{5}\tilde{x}\right) \frac{\partial }{\partial \tilde{z}} \end{array} \text{.} $$
(5.11)

We end up with \(\tilde {c}_{1}=\frac {3}{5}\), \(\tilde {c}_{2}=\frac {1}{5}\). Since \(W\left (\frac {3}{5},\frac {1}{5}\right )<0\), we know that there are no geometrically optimal singular trajectories; hence, the reachable set from the origin for the system that we started with can be described by two analytic functions. 𝒜[−1, 4](0, ℝ) ∩ {x = const > 0} can be depicted as in Fig. 1, where this time A (resp. B) corresponds to half-line {y = − x, z = 0} (resp. {y = 4x, z = 0}). An approximate shape of 𝒜[−1, 4](0, ℝ) can be seen in Fig. 5.

Fig. 5
figure 5

The surfaces bounding the reachable set in the case W < 0

Example 5.2

Consider again the system (5.10) where this time

$$\begin{array}{l} X=\frac{\partial }{\partial x}+\frac{1}{2}y(y-x)(y-3x)\frac{\partial }{ \partial z}\medskip \\ Y=\frac{\partial }{\partial y}-\frac{1}{2}x(y-x)(y-3x)\frac{\partial }{ \partial z} \end{array} \text{.}$$

According to the above procedure, we pass to the sub-Lorentzian structure induced by an orthonormal frame:

$$\begin{array}{l} Z^{-1,4}=\frac{\partial }{\partial x}+\frac{3}{2}\frac{\partial }{\partial y} +\frac{1}{2}\left( y-x\right) \left( y-3x\right) \left( y-\frac{3}{2} x\right) \frac{\partial }{\partial z}\medskip \\ W^{-1,4}=\frac{5}{2}\frac{\partial }{\partial y}-\frac{5}{4}x(y-x)(y-3x) \frac{\partial }{\partial z} \end{array} \text{.}$$

Making again the same change of coordinates, we are led to

$$\begin{array}{l} Z^{-1,4}=\frac{\partial }{\partial \tilde{x}}+\frac{1}{2}\left( \frac{5}{2} \right) ^{3}\tilde{y}\left( \tilde{y}-\frac{3}{5}\tilde{x}\right) \left( \tilde{y}+\frac{1}{5}\tilde{x}\right) \frac{\partial }{\partial \tilde{z}} \\ W^{-1,4}=\frac{\partial }{\partial \tilde{y}}-\frac{1}{2}\left( \frac{5}{2} \right) ^{3}\tilde{x}\left( \tilde{y}-\frac{3}{5}\tilde{x}\right) \left( \tilde{y}+\frac{1}{5}\tilde{x}\right) \frac{\partial }{\partial \tilde{z}} \end{array} \text{,}$$

so \(\tilde {c}_{1}=\frac {3}{5}\), \(\tilde {c}_{2}=-\frac {1}{5}\). But this time, \( W\left (\frac {3}{5},-\frac {1}{5}\right )>0\); therefore, we conclude that there are two geometrically optimal singular trajectories, and that we need six analytic functions to describe the reachable set 𝒜[−1, 4](0, ℝ) for the system we started with. The intersection 𝒜[−1, 4](0, ℝ) ∩ x = const > 0 can be represented, with obvious changes in the interpretation, as in Fig. 4. 𝒜[−1, 4](0, ℝ) looks approximately as in Fig. 7.

Remark 5.2

Let us notice in this place that the structure of reachable sets in the case a < c 2 < c 1 < b essentially depends on a and b. For instance, if in the last example with c 1 = 3, c 2 = 1, a = − 1, we let b vary then (cf. 5.9), the expression \(W\left (\frac {2c_{1}-b-a}{b-a},\frac { 2c_{2}-b-a}{b-a}\right )=W\left (\frac {6-b+1}{b+1},\frac {2-b+1}{b+1}\right )=-4\frac {b-7}{\left ( b+1\right ) ^{2}}\) changes sign when b exceeds 7.

6 Proofs of Lemmas from Section 2.2

In this section, we present the proofs of lemmas from Section 2.

Proof of Lemma 2.1

The proof relies on straightforward computations. For instance, E 1 < 1 is equivalent to

$$\sqrt{-3\left( c_{1}c_{2}-2c_{1}+2c_{2}-1\right) \left( c_{1}c_{2}+2c_{1}-2c_{2}-1\right) }<3c_{1}c_{2}-6c_{1}-6c_{2}+9\text{,} $$
(6.1)

where 3c 1 c 2 − 6c 1 − 6c 2 + 9 = 3 (W + 4 − 4c 1) is positive. Squaring both sides of Eq. 6.1, we see that Eq. 6.1 is equivalent to

$$12\left( c_{2}-1\right) \left( c_{1}-1\right) \left( c_{1}c_{2}-2c_{1}-2c_{2}+7\right) >0$$

which is, of course, true. The remaining inequalities are proved analogously. □

Proof of Lemma 2.2

We prove, for instance, that c 2 < E 4. By Eq. 2.9, this is equivalent to

$$-\left( c_{2}+2\right) \left( 2c_{2}+c_{1}c_{2}-1-2c_{1}\right) >\sqrt{ -3\left( c_{1}c_{2}-2c_{1}+2c_{2}-1\right) \left( c_{1}c_{2}+2c_{1}-2c_{2}-1\right) }$$

which, in turn, is equivalent to

$$\begin{array}{@{}rcl@{}} &&\left( c_{2}+2\right) ^{2}\left( 2c_{2}+c_{1}c_{2}-1-2c_{1}\right) ^{2}+3\left( c_{1}c_{2}-2c_{1}+2c_{2}-1\right) \left( c_{1}c_{2}+2c_{1}-2c_{2}-1\right) \\ &=&\left( c_{2}-1\right) \left( c_{2}+1\right) \left( c_{1}c_{2}+2c_{1}+2c_{2}+7\right) \left( 2c_{2}+c_{1}c_{2}-1-2c_{1}\right) >0 \text{.} \end{array}$$

Proof of Lemma 2.3

We will prove the first inequality. Since c 1 − 3c 2 + 2 > 0, the hypothesis is equivalent to − 2c 1 c 2 − 3c 1 + c 2c 1 c 1 − 3c 2 + 2 = 1 − c 1 c 1c 2 > 0 which is, of course, true. □

Proof of Lemma 2.4

We look for stationary points of f. Any such point (x, y) must satisfy the equality

$$\frac{\partial f}{\partial x}(x,y)+\frac{\partial f}{\partial y}(x,y)=\left( x+1\right) \left( y-1\right) \left( x+y\right) \left( 3xy+5y-5x-3\right) =0 \text{.}$$

Clearly, 3xy + 5y − 5x − 3 = 3(xy − 1) − 5(xy) < 0 in D, thus, either x = − 1 or y = 1, or y = − x. If y = − x in D, we must have x > 0, but then \(\frac {\partial f}{ \partial x}(x,-x)=-3x^{5}-20x^{4}-46x^{3}-23x-4<0\), while \(\frac {\partial f}{ \partial y}(x,-x)=3x^{5}+20x^{4}+46x^{3}+23x+4>0\). It follows that all stationary points of f are contained in ∂D. By direct calculation, we make sure that f | ∂D ≤ 0 which implies f < 0 in D. □

Proof of Lemma 2.5

We prove the second inequality. Since − 3c 1 + c 2 − 2 < 0, the hypo- thesis is equivalent to − 2c 1 c 2c 1 + 3c 2c 2 − 3c 1 + c 2 − 2 = c 2 + 1 c 1c 2 > 0. □

Proof of Lemma 2.6

This is proved analogously as Lemma 2.4. □

7 Pictures

In this section, we present 3-dimensional visualizations of exemplary reachable sets studied in Section 2.

At first, consider the case \(c_{1}=\frac {3}{4}\), \(c_{2}=\frac {1}{2}\). Clearly, \(W\left (\frac {3}{4},\frac {1}{2}\right )=-\frac {1}{8}\), and in this case,

$$\begin{array}{@{}rcl@{}} \hat{\eta}_{1}(x,y,z)&=&-\frac{1}{128}\left( x-y\right) \left( x+y\right) \left( -12xy+3x^{2}+13y^{2}\right) +z, \\ \hat{\eta}_{2}(x,y,z)&=&-\frac{1}{384}\left( x-y\right) \left( x+y\right) \left( -124xy+49x^{2}+79y^{2}\right) \allowbreak -z. \end{array}$$

On all the following three figures, the plane z = 0 is marked with white color. The corresponding reachable set is bounded by two hypersurfaces: by {η̂ 1 = 0} from above (darker color) and by {η̂ 2 = 0} from below (lighter color) as it is presented in Fig. 5.

Next, consider the case \(c_{1}=\frac {1}{2}\), c 2 = 0. Now, \(W\left (\frac {1}{2},0\right )=0\), and

$$\begin{array}{@{}rcl@{}} \hat{\eta}_{1}(x,y,z)&=&-\frac{1}{8}y^{2}\left( x-y\right) \left( x+y\right) +z, \\ \hat{\eta}_{2}(x,y,z)&=&-\frac{1}{24}\left( x-y\right) \left( x+y\right) \left( x-2y\right) ^{2}-z. \end{array}$$

The reachable set in this case is presented in Fig. 6. As in the previous case, it is the set bounded by two hypersurfaces {η̂ i = 0}, i = 1, 2, with similarly chosen colors.

Fig. 6
figure 6

The surfaces bounding the reachable set in the case W = 0

Finally, consider the case \(c_{1}=\frac {1}{2}\), \(c_{2}=-\frac {1}{2}\). This time \(W\left (\frac {1}{2},-\frac {1}{2}\right )=\frac {3}{4}\), and

$$\begin{array}{@{}rcl@{}} \hat{\eta}_{1}(x,y,z)&=&\frac{1}{64}\left( x-y\right) \left( x+y\right) \left( -4xy+x^{2}-9y^{2}\right) +z,\\ \hat{\eta}_{2}(x,y,z)&=&\frac{1}{64}\left( x-y\right) \left( x+y\right) \left( 4xy+x^{2}-9y^{2}\right) -z, \\ \hat{\xi}_{11}(x,y,z)&=&-\frac{1}{216}\left( 5x-8y\right) \left( x-y\right) \left( x+2y\right) ^{2}+z, \\ \hat{\xi}_{12}(x,y,z)&=&-\frac{1}{8}x\left( x+y\right) \left( x+2y\right) ^{2}+z, \\ \hat{\eta}_{2}(x,y,z)&=&\frac{1}{64}\left( x-y\right) \left( x+y\right) \left( 4xy+x^{2}-9y^{2}\right) -z, \\ \hat{\xi}_{21}(x,y,z)&=&\allowbreak -\frac{1}{216}\left( x+y\right) \left( 5x+8y\right) \left( x-2y\right) ^{2}-z,\\ \hat{\xi}_{22}(x,y,z)&=&\allowbreak -\frac{1}{8}x\left( x-y\right) \left( x-2y\right) ^{2}-z. \end{array}$$

The corresponding reachable set is presented in Fig. 7. It is the set which is bounded by six hypersurfaces {η̂ i = 0}, {ξ̂ ij = 0}, i, j = 1, 2.

Fig. 7
figure 7

The surfaces bounding the reachable set in the case W > 0