1 Introduction

The addition of mid-course corrections (or deep space maneuvers, DSM) along an interplanetary transfer has the advantage of decreasing the overall cost of a trajectory. In fact, applying a DSM will modify not only the trajectory but also the boundary conditions (BC) of the orbit (Bernelli-Zazzera et al. 2007). In an impulsive thrust case, this implies modifying the departure and arrival \(\varDelta \mathbf {V}\)s since the cost of a mission can be defined as sum of impulses (magnitudes of the \(\varDelta \mathbf {V}\)s). Generally, this process is done in the scenario of the patched conics method, which means considering a sequence of Keplerian arcs linked together (Battin 1999). The inclusion of a DSM along a transfer orbit has the main drawback of increasing the search space complexity of an optimization problem (Vasile and De Pascale 2006). For this reason, it can be useful to have a method that gives a first guess on where to perturb the orbit for the possible addition of a mid-course correction.

In the early 1960s, Lawden introduced the concept of the primer vector in Lawden (1963). Derived with an indirect optimal control method, this variable is associated with the equation of motion and it has to satisfy a set of necessary conditions (NC) for a transfer to be optimal. In particular if the NCs are not fulfilled, it is energetically favourable to add a DSM, see Conway (2010). For a linear system, Prussing (1995) proved that these conditions are also sufficient.

Later studies such as Lion and Handelsman (1968), Jezewski and Rozendaal (1968), Vinh (1972) and Jezewski (1975) extensively showed where and how to perturb the trajectory for the addition of the correction (with gradient based optimization methods). At present, the primer vector theory is mainly exploited in global optimization scenario as a first estimation of the position of the DSM and therefore it allows mission planners to reduce the size of the search space (Olympio et al. 2006). However, due to its computational complexity, its usage is narrowed also to simple three-body problem cases as in Luo et al. (2010).

The main disadvantage of Lawden’s theory is that it is based on solving a boundary value problem (BVP) that involves the state transition matrix (STM) of the motion. Currently, there are no known analytic solutions to the problem and, subsequently, no insights into what determines whether a given transfer will be optimal or not. In papers like Goodyear (1965), a formal solution to the differential equations used to derive the STM is provided. Even if the method presented in Goodyear (1965) is valid for every type of Keplerian transfer, it does not give a complete understanding of the structure of the problem. In fact, it is defined in a general inertial frame and therefore no simplifications in the equations (36 terms expressed as sums and multiplications of series) are possible.

This paper presents a novel approach to the theory which simplifies the structure of the problem. In particular, the case where departure, arrival and transfer orbits are co-planar is analyzed.

2 Primer vector theory procedure

The equations of motion can be written as

$$\begin{aligned} \dot{\mathbf {x}}(t)=\begin{bmatrix} \dot{\mathbf {r}}(t)\\ \dot{\mathbf {v}}(t)\\ \end{bmatrix}=\begin{bmatrix} \mathbf {v}(t)\\ \mathbf {g}[\mathbf {r}(t)]+\varGamma \mathbf {u}(t) \end{bmatrix} \end{aligned}$$
(1)

where \(\mathbf {x}(t)\) is the state, \(\mathbf {r}(t)\) is the position vector, \(\mathbf {v}(t)\) is the velocity vector, \(\mathbf {g}(\mathbf {r})\) is the gravitational acceleration, \(\mathbf {u}(t)\) represents the unit vector in the direction of the thrust and \(\varGamma \) is the impulsive thrust acceleration (Conway 2010).

As described in Bryson and Ho (1975), there exists a Hamiltonian that defines the system given in Eq. 1:

$$\begin{aligned} H = \varGamma + {\varvec{\lambda }_{r}}^{T}\mathbf {v}(t) + {\varvec{\lambda }_{v}}^{T}[\mathbf {g}(\mathbf {r})+\varGamma \mathbf {u}], \end{aligned}$$
(2)

where \(\varvec{\lambda }_{r/v}\) are the co-state variables associated, respectively, to the position and the velocity of Eq. 1. In a classic optimization problem, the cost function can be defined as the total propellant consumption. For an impulsive thrust case, the cost is simply the sum of the magnitudes of the N impulses fired along the transfer arc (Conway 2010)

$$\begin{aligned} J = \sum _{i=1}^{N} \varDelta V_{i}. \end{aligned}$$
(3)

The adjoint equations referred to Eq. 2 which minimize the cost are

$$\begin{aligned} \dot{\varvec{\lambda }_{r}}^{\mathrm{T}}= & {} -\frac{\partial H}{\partial \mathbf {r}}=-\varvec{\lambda }_{v}\mathbf {G}(\mathbf {r})\end{aligned}$$
(4)
$$\begin{aligned} \dot{\varvec{\lambda }_{v}}^{\mathrm{T}}= & {} -\frac{\partial H}{\partial \mathbf {v}}=-{\varvec{\lambda }_{r}}^{\mathrm{T}} \end{aligned}$$
(5)

where \(\mathbf {G}(\displaystyle \mathbf {r}) \triangleq \frac{\partial \mathbf {g}(\mathbf {r})}{\partial \mathbf {r}}\) is the \(3\times 3\) gravity gradient matrix.

The primer vector \(\mathbf {p}(t)\) and its derivative \(\dot{\mathbf {p}}(t)\) have been defined in Lawden (1963) as

$$\begin{aligned} \mathbf {p}(t)&\triangleq - \varvec{\lambda }_{v}(t)\end{aligned}$$
(6)
$$\begin{aligned} \dot{\mathbf {p}}(t)&=\varvec{\lambda }_{r}(t) . \end{aligned}$$
(7)

Equations 4, 5, 6 and 7 combined together, give the ‘primer vector equation’:

$$\begin{aligned} \ddot{\mathbf {p}}=\mathbf {G}(\mathbf {r})\mathbf {p}. \ \end{aligned}$$
(8)

Furthermore, Lawden characterized the theory with a set of NC for the impulsive thrust case, when the cost is simply expressed as a sum of the magnitudes of \(\varDelta \mathbf {V}\)s (as in Eq. 3). The satisfaction of the NCs defines the optimality of a trajectory [extensive explanation in Jezewski (1975)]:

  1. 1.

    The primer vector and its derivative are continuous everywhere.

  2. 2.

    The magnitude of the primer vector is always below 1, aside from the instants where the impulse occurs, where \(p=1\).

  3. 3.

    At the impulse times, the primer vector is a unit vector in the direction of the thrust.

  4. 4.

    As a consequence of the above conditions, \(dp/dt=\dot{p}=\dot{\mathbf {p}}^{T}\mathbf {p} =0\) at an intermediate impulse.

Equation 7 needs to be integrated to obtain the primer vector profile along the transfer orbit and then to verify if the NC are satisfied. Its solution can be expressed in the form of

$$\begin{aligned} \begin{bmatrix} \mathbf {p}(t) \\ \dot{\mathbf {p}}(t) \end{bmatrix} = \varvec{\varPhi }(t, t_{0}) \begin{bmatrix} \mathbf {p}(t_{0}) \\ \dot{\mathbf {p}}(t_{0}) \end{bmatrix} \end{aligned}$$
(9)

where \(\varvec{\varPhi }\) is the STM associated to the state (Jezewski 1975). The third NC gives the BC on the primer vector, defining it as unit vector in the direction of the thrust:

$$\begin{aligned} \begin{array}{cc} \mathbf {p}_{0} \equiv \mathbf {p}(t_{0}) = \displaystyle \frac{\varDelta \mathbf {V}_{0}}{\Vert \varDelta \mathbf {V}_{0} \Vert } \\ \mathbf {p}_{f} \equiv \mathbf {p}(t_{f}) = \displaystyle \frac{\varDelta \mathbf {V}_{f}}{\Vert \varDelta \mathbf {V}_{f} \Vert } \end{array}. \end{aligned}$$
(10)

However, both initial conditions on \(\mathbf {p}(t)\) and \(\dot{\mathbf {p}}(t)\) are required for the propagation of Eq. 9. \(\mathbf {p}_{0}\) is known from Eq. 10 and the initial condition of \(\dot{\mathbf {p}}\) comes inverting Eq. 9 as:

$$\begin{aligned} \dot{\mathbf {p}}(t_{0})= {\varvec{\varPhi }_{12}}^{-1}(t_{f},t_{0}) \cdot [ \mathbf {p}(t_{f})- \varvec{\varPhi }_{11}(t_{f},t_{0}) \cdot \mathbf {p}(t_{0})] \end{aligned}$$
(11)

where \({\varvec{\varPhi }_{11}}\) and \({\varvec{\varPhi }_{12}}\) represent partition matrices of \(\varvec{\varPhi }\), and they are the only unknowns of Eq. 11. As a consequence, this procedure implies solving a BVP where firstly the STM is integrated along the arc to solve Eq. 11 for \(\dot{\mathbf {p}}(t_{0})\) and, afterwards, Eq. 9 is propagated to verify Lawden’s NCs.

3 Polar coordinates transformation

This paper presents a novel procedure framed in a local polar coordinate system defined by \(\hat{\mathbf {e}}_{r}, \hat{\mathbf {e}}_{\vartheta }\) and \(\hat{\mathbf {e}}_{h}\). They are, respectively, unit vectors in the direction of the local position vector, of the derivative of \(\hat{\mathbf {e}}_{r}\) with respect to the true anomaly (\(\nu \)) and of the orbital momentum vector (\(\mathbf {h}\)):

$$\begin{aligned} \hat{\textstyle \mathbf {e}}_{r}=\frac{{\textstyle \mathbf {r}}}{\Vert {\textstyle \mathbf {r}}\Vert };\ \ \ \ \hat{\textstyle \mathbf {e}}_{\vartheta }= -\hat{\textstyle \mathbf {e}}_{r} \times \hat{\textstyle \mathbf {e}}_{h};\ \ \ \ \hat{\textstyle \mathbf {e}}_{h}=\frac{\textstyle \mathbf {h}}{\Vert {\textstyle \mathbf {h}}\Vert } . \end{aligned}$$
(12)

It can be demonstrated that this coordinate system is an eigenvector basis of the gravity gradient matrix \(\mathbf {G}\) (Eq. 8), defined to first order approximation as

$$\begin{aligned} \mathbf {G} = -\frac{\mu }{r^3}\mathbf {I}_{3}+\frac{3\mu }{r^5}\mathbf {r}\cdot \mathbf {r}^{T}. \end{aligned}$$
(13)

The corresponding eigenvalues of \(\mathbf {G}\), with respect to the unit vectors of Eq. 12, are \( 2\mu /r^{3} \), of algebraic multiplicity 1, and \( - \mu /r^{3} \), of algebraic multiplicity 2.

Combining together Eq. 8 and the eigen-decomposition of \(\mathbf {G}\), the primer vector equation can be expressed in polar form as

$$\begin{aligned} \ddot{\mathbf {p}}=\mathbf {G}\mathbf {p}=\frac{\mu }{r^{3}}(2p_{r}\hat{\mathbf {e}}_{r}-p_{\vartheta }\hat{\mathbf {e}}_{\vartheta }-p_{h}\hat{\mathbf {e}}_{h}). \end{aligned}$$
(14)

Finally the first derivative of the primer vector with respect to time in polar coordinates can be derived:

$$\begin{aligned} \frac{d\mathbf {p}}{dt} = \underbrace{\left( \dot{p}_{r}-\frac{h}{r^{2}}p_{\vartheta }\right) \hat{\mathbf {e}}_{r}\ +\ \left( \dot{p}_{\vartheta }+\frac{h}{r^{2}}p_{r}\right) \hat{\mathbf {e}}_{\vartheta }}_{{} \text {IN PLANE} } \quad + \underbrace{\dot{p}_{h}\hat{\mathbf {e}}_{h}}_{{} \text {OUT OF PLANE}} . \end{aligned}$$
(15)

From Eq. 15, it is clear that the out-of-plane component is decoupled from the in-plane components.

The analysis related to the out-of-plane component is treated in a complementary paper from the authors, now in press.

3.1 In-plane components

Concerning the in-plane part of Eq. 15, \(p_{r}\) and \(p_{\vartheta }\) do not have formal analytic solutions. Even so, it is possible to find an analytic approximation for which the time-variant variable is considered to be constant for a small integration step size.

From Eq. 15, two new variables can be defined:

$$\begin{aligned} \left\{ \begin{array}{ll} P_{r} \triangleq \dot{p}_{r}-\displaystyle \frac{h}{r^{2}}p_{\vartheta } \\ P_{\vartheta } \triangleq \dot{p}_{\vartheta }+\displaystyle \frac{h}{r^{2}}p_{r} \end{array} \right. . \end{aligned}$$
(16)

\(P_r\) and \(P_{\vartheta }\) are the components of the first derivative of the primer vector with respect to time in the local orbital reference frame. Therefore, the in-plane part of Eq. 15 becomes

$$\begin{aligned} \dot{\mathbf {p}}=P_{r}\hat{\mathbf {e}}_{r}+P_{\vartheta }\hat{\mathbf {e}}_{\vartheta } . \end{aligned}$$
(17)

Taking into account the derivatives with respect to time of the eigenvectors:

$$\begin{aligned} \left\{ \begin{array}{ll} \dot{\hat{\mathbf {e}}}_{r} = \dot{\nu } \hat{\mathbf {e}}_{\vartheta }\\ \dot{\hat{\mathbf {e}}}_{\vartheta } = -\dot{\nu } \hat{\mathbf {e}}_{r}\\ \dot{\hat{\mathbf {e}}}_{h} = 0\\ \end{array} \right. , \end{aligned}$$
(18)

the second derivative of the primer vector can be easily derived as

$$\begin{aligned} \ddot{\mathbf {p}}= \bigl ( \dot{P}_{r} - \frac{h}{r^2} P_{\vartheta }\bigr )\hat{\mathbf {e}}_{r} + \bigl ( \dot{P}_{\vartheta }+\frac{h}{r^2}P_{r}\bigr )\hat{\mathbf {e}}_{\vartheta }. \end{aligned}$$
(19)

The primer vector equation in polar form (Eq. 14) combined with Eq. 16 and Eq. 19 gives the derivatives of \(P_r\) and \(P_{\vartheta }\)

$$\begin{aligned} \left\{ \begin{array}{ll} \dot{P}_{r}=\displaystyle \frac{2\mu }{r^{3}}p_{r}+\displaystyle \frac{h}{r^{2}}P_{\vartheta } \\ \dot{P}_{\vartheta }=-\displaystyle \frac{\mu }{r^{3}}p_{\vartheta }-\displaystyle \frac{h}{r^{2}}P_{r} \end{array} \right. . \end{aligned}$$
(20)

To simplify the structure of the equations, two new variables can be defined:

$$\begin{aligned} \begin{array}{ll} \sigma &{}\triangleq \displaystyle \frac{r}{l} \equiv \frac{1}{1+e\cos \nu } \\ \omega &{}\triangleq \displaystyle \frac{h}{r^2} = \frac{h}{(l\sigma )^{2}} \end{array} , \end{aligned}$$
(21)

which satisfy also the following equivalence

$$\begin{aligned} \sigma \cdot \omega ^{2} = \frac{r}{l}\frac{h^{2}}{r^{4}} = \frac{\mu }{r^{3}} \frac{r}{r} \frac{l}{l} = \frac{\mu }{r^{3}} \end{aligned}$$
(22)

where \({l=a(1-e^2)}\) is the semi-latus rectum of the orbit (Battin 1999).

The new variables, linked together, give a new first order ODE set in \([p_r,p_{\vartheta },P_{r}, P_{\vartheta }]\):

$$\begin{aligned} \begin{bmatrix} \dot{p}_{r} \\ \dot{p}_{\vartheta } \\ \dot{P}_{r} \\ \dot{P}_{\vartheta } \end{bmatrix} = \begin{bmatrix} 0&\quad \omega&\quad 1&\quad 0 \\ -\omega&\quad 0&\quad 0&\quad 1 \\ 2\sigma \omega ^{2}&\quad 0&\quad 0&\quad \omega \\ 0&\quad -\sigma \omega ^{2}&\quad -\omega&\quad 0 \end{bmatrix} \begin{bmatrix} p_{r} \\ p_{\vartheta } \\ P_{r} \\ P_{\vartheta } \end{bmatrix}. \end{aligned}$$
(23)

Equation 23 can be represented in compact matrix format as

$$\begin{aligned} \dot{\mathbf {p}}_{\mathrm{polar}}(t)=\mathbf {M}(t)\mathbf {p}_{\mathrm{polar}}(t). \end{aligned}$$
(24)

3.2 Palmer coordinates conversion and integration scheme

The system of Eq. 23 can be written in terms of the Hamiltonian:

$$\begin{aligned} H_{\mathrm{PV}}=\frac{1}{2}({P_{r}}^{2}+{P_{\vartheta }}^{2})+\omega (p_{\vartheta }P_{r}-p_{r}P_{\vartheta })-\frac{1}{2}\sigma \omega ^{2}(2{p_{r}}^{2}-{p_{\vartheta }}^{2}). \end{aligned}$$
(25)

If \(H_{\mathrm{PV}}\) is compared to the one of Hill’s problem, \(H_{\mathrm{Hill}}\) of Quinn et al. (2010), there is a noticeable similarity:

$$\begin{aligned} H_{\mathrm{Hill}}=\frac{1}{2}({p_{x}}^{2}+{p_{y}}^{2})+\varOmega (yp_{x}-xp_{y})-\frac{1}{2}\varOmega ^{2}(2{x}^{2}-y^{2}). \end{aligned}$$
(26)

The substantial difference between the two Hamiltonian forms is that, while in \(H_{\mathrm{Hill}}\) the coefficient \(\varOmega \) is constant during the motion, in \(H_{\mathrm{PV}}\) both \(\sigma \) and \(\omega \) vary in time. However, for the circular orbit case (Hill’s problem), \(\sigma =1\) and \(\omega =\varOmega \), which implies a one-to-one correspondence between the two Hamiltonians. This fact validates a fundamental result: for a circular orbit transfer there is an analytic solution to the planar primer vector problem, which is exactly the Hill’s solution.

For Hill’s problem, the motion of the in-plane state vector of Eq. 26, \(\mathbf {z}=(\varOmega x, \dot{x}, \varOmega y, \dot{y})^{\mathrm{T}}\), can be separated into two parts: an oscillation and a linear drift. This result is presented in Palmer (2007), where the transformation of coordinates \(\tilde{\mathbf {p}} = \mathcal {P}\mathbf {z}=(p_{1}, p_{2}, p_{3}, p_{4})^{T}\) on \(\mathbf {z}\) allows this separation to be explicit (as a note, the original notation of Palmer (2007) has been changed to avoid confusion).

The combination of Eqs. 16, 25 and 26 results in the relationship between the state vector \(\mathbf {z}\) and \(\mathbf {p}_{\mathrm{polar}}\):

$$\begin{aligned} \left\{ \begin{array}{ll} x=p_{r} \\ y=p_{\vartheta } \end{array} \right. \text { ; } \left\{ \begin{array}{ll} p_{x} = P_{r} = \dot{x} - \varOmega y\\ p_{y} = P_{\vartheta } = \dot{y} + \varOmega x \end{array} \right. \Rightarrow \left\{ \begin{array}{ll} \dot{x} = p_{x} + \varOmega y = P_{r} + \omega p_{\vartheta } \\ \dot{y} = p_{y} - \varOmega x = P_{\vartheta } - \omega p_{r} \end{array} \right. \end{aligned}$$
(27)

where the frequency associated with \(\mathbf {z}\) is \(\varOmega \), while the one of \(\mathbf {p}_{\mathrm{polar}}\) is \(\omega \).

If the transformation of coordinates presented in Palmer (2007) is applied to the primer vector problem (Eq. 27), it becomes

$$\begin{aligned} \left[ \begin{array}{c} p_{1} \\ p_{2} \\ p_{3} \\ p_{4} \end{array}\right] = \left[ \begin{array}{c@{\quad }c@{\quad }c@{\quad }c} -3 &{} 0 &{} 0 &{} -2 \\ 0 &{} 1 &{} 0 &{} 0 \\ 0 &{} 2 &{} -1 &{} 0 \\ 2 &{} 0 &{} 0 &{} 1 \end{array}\right] \begin{bmatrix} \omega p_{r} \\ P_{r} + \omega p_{\vartheta } \\ \omega p_{\vartheta } \\ P_{\vartheta } - \omega p_{r} \end{bmatrix}. \end{aligned}$$
(28)

3.2.1 Integration of the Palmer coordinates

The coordinates \(\tilde{\mathbf {p}}\) (also called Palmer coordinates) can be expressed as functions of (\(p_{r},p_{\vartheta },P_{r},P_{\vartheta }\)) as shown in Eq. 28, where two couplings are evident: one between \(p_{1}\) and \(p_{4}\) and the other between \(p_{2}\) and \(p_{3}\).

If \(\tilde{\mathbf {p}}\) is first combined with the system of Eq. 23 and then differentiated with respect to time (while \(\sigma \), and therefore \(\omega \), are kept constant for small intervals of time), it yields

$$\begin{aligned} \begin{bmatrix} \dot{p}_{1} \\ \dot{p}_{2} \\ \dot{p}_{3} \\ \dot{p}_{4} \end{bmatrix} = \omega \begin{bmatrix} 0&\quad 4\sigma -3&\quad -2(\sigma -1)&\quad 0 \\ 2\sigma - 3&\quad 0&\quad 0&\quad 4(\sigma -1) \\ 4(\sigma -1)&\quad 0&\quad 0&\quad 8\sigma -5 \\ 0&\quad -2(\sigma - 1)&\quad \sigma -1&\quad 0 \end{bmatrix} \begin{bmatrix} p_{1} \\ p_{2} \\ p_{3} \\ p_{4} \end{bmatrix}. \end{aligned}$$
(29)

From this system of equations, it is then possible to differentiate even further to a second order system as

$$\begin{aligned} \begin{bmatrix} \ddot{p}_{1} \\ \ddot{p}_{2} \\ \ddot{p}_{3} \\ \ddot{p}_{4} \end{bmatrix} = \omega ^{2} \begin{bmatrix} -(2\sigma -1)&\quad 0&\quad 0&\quad -2(\sigma -1) \\ 0&\quad -(2\sigma -1)&\quad 2(\sigma -1)&\quad 0 \\ 0&\quad -2(\sigma -1)&\quad 3(\sigma -1)&\quad 0 \\ 2(\sigma -1)&\quad 0&\quad 0&\quad 3(\sigma -1) \end{bmatrix} \begin{bmatrix} p_{1} \\ p_{2} \\ p_{3} \\ p_{4} \end{bmatrix}. \end{aligned}$$
(30)

In Eq. 30, the variables are coupled again (\(p_{1}\) with \(p_{4}\) and \(p_{2}\) with \(p_{3}\)).

The structure of the problem allows to have a 4th order ODE in \(p_1\) in the form of

$$\begin{aligned} {p_{1}}^{\mathrm{IV}}-\omega ^{2}(\sigma -2){{p}_{1}}^{\mathrm{II}}-\omega ^{4}(\sigma -1)(2\sigma +1)=0. \end{aligned}$$
(31)

Assuming that the coefficients of Eq. 31 are constant for a small interval of time, the solution will be \(p_{1} \approx e^{ \omega \lambda t}\).

The characteristic equation in \(\lambda \) of Eq. 31 is

$$\begin{aligned} \lambda ^{4}-\omega ^{2}(\sigma -2)\lambda ^{2}-\omega ^{4}(\sigma -1)(2\sigma +1)=0, \end{aligned}$$
(32)

whose solutions are

$$\begin{aligned} \begin{array}{ll} \lambda _{1}^{\pm }&{}=\displaystyle \frac{\sqrt{2}}{2\omega }\sqrt{(\sigma -2)\pm \sqrt{9\sigma ^{2}- 8\sigma }} \\ \lambda _{2}^{\pm }&{}=-\displaystyle \frac{\sqrt{2}}{2\omega }\sqrt{(\sigma -2)\pm \sqrt{9\sigma ^{2}- 8\sigma }} \end{array}. \end{aligned}$$
(33)

The nature of \({\lambda _{1,2}}^{\pm }\) depends upon the value of \(\sigma \). If \(\varDelta _{\sigma } \triangleq 9\sigma ^{2}-8\sigma \), there are three cases (the interval \(\sigma <0\) is not taken into consideration because it does not have any physical meaning):

  • \(\varvec{\varDelta _{\sigma }>0}\): \(\sigma >8/9.\,{\lambda _{1,2}}^{\pm }\) can be either real (if \({{\mathrm{sign}}}\bigl ({(\sigma -2) \pm \varDelta _{\sigma } } \bigr )> 0\)) or imaginary (if \({{\mathrm{sign}}}\bigl ({(\sigma -2) \pm \varDelta _{\sigma } }\bigr ) < 0\));

  • \(\varvec{\varDelta _{\sigma } = 0}\): \(\sigma \) can be either 0 or 8 / 9 and \({\lambda _{1,2}}^{\pm }\) are always imaginary;

  • \(\varvec{\varDelta _{\sigma } <0}\): \(0<\sigma <8/9\) and \({\lambda _{1,2}}^{\pm }\) are generally complex.

In the case of an elliptic orbit the nature of the eigenvalues can be summarized, as in Fig. 1, into three distinguished areas:

Fig. 1
figure 1

Elliptical transfer orbit divided into three regions (I, II, III) according to the nature of the eigenvalues \({\lambda _{1,2}}^{\pm }\)

  1. Area (I)

    \(0<\sigma <8/9\): corresponds to the case of \(\varDelta _{\sigma } <0 \), and includes the periapsis of the transfer orbit, which is the point where \(\sigma \) has its minimum value, \(\sigma _{\mathrm{min}}=(1+e)^{-1}\). In this interval the eigenvalues are complex.

  2. Area (II)

    \(8/9\le \sigma <1\): corresponds to the case of \(\varDelta _{\sigma }>0\) with \({{\mathrm{sign}}}\bigl ((\sigma -2) \pm \varDelta _{\sigma } \bigr ) < 0\), and the eigenvalues are imaginary. For \(\sigma =1\), that is \(r=l, {\lambda _{1,2}}^{+}=0\) while \({\lambda _{1,2}}^{-}\) are imaginary.

  3. Area (III)

    \(\sigma >1\): corresponds to the case of \(\varDelta _{\sigma }>0; {\lambda _{1,2}}^{+}\) are real, while \({\lambda _{1,2}}^{-}\) are imaginary. At the apoapsis (included in this area) \(\sigma \) reaches its maximum value \(\sigma _{\mathrm{max}}=(1-e)^{-1}\).

For values of the eccentricity smaller than 1/8, \(\varDelta \ge 0\) and \({\sigma \ge 8/9}\), therefore there is no Area (I). In other words, the eigenvalues are either purely imaginary or real (Area (II) and Area (III)).

After the aforementioned considerations, the complete solution of Eq. 31 can be derived, and it simply is

$$\begin{aligned} p_{1}=A e^{\omega \lambda _{1}^{+}t}+B e^{\omega \lambda _{2}^{+}t}+Ce^{\omega \lambda _{1}^{-}t}+De^{\omega \lambda _{2}^{-}t} \end{aligned}$$
(34)

where \(\mathbf {A}=(A, B, C, D)^{\mathrm{T}}\) is a vector of coefficients given by the initial conditions of \(p_{1}\).

The other components of \(\tilde{\mathbf {p}}\) can be evaluated as for \(p_{1}\), resulting in a formal matrix expression

$$\begin{aligned} \tilde{\mathbf {p}}={\varvec{\varSigma }}(\sigma )^{-1}\tilde{\mathbf {E}}(t,\sigma )^{-1}\mathbf {A}. \end{aligned}$$
(35)

\(\tilde{\mathbf {E}}(t,\sigma )\) is a diagonal matrix with independent exponential functions as elements of the diagonal, functions of time t and \(\omega (\sigma )\):

$$\begin{aligned} \tilde{\mathbf {E}} = \begin{bmatrix} e^{\omega \lambda _{2}^{+}t}&\quad 0&\quad 0&\quad 0 \\ 0&\quad e^{\omega \lambda _{1}^{+}t}&\quad 0&\quad 0 \\ 0&\quad 0&\quad e^{\omega \lambda _{2}^{-}t}&\quad 0 \\ 0&\quad 0&\quad 0&\quad e^{\omega \lambda _{1}^{-}t} \end{bmatrix}, \end{aligned}$$
(36)

while \(\varvec{\varSigma }(\sigma )\) is a polynomial matrix in \(\sigma \), written as

$$\begin{aligned} \varvec{\varSigma } = \frac{(\sigma -1)}{(2\sigma +1)\sqrt{\varDelta _{\sigma }}} \left[ \begin{array}{c@{\quad }c@{\quad }c@{\quad }c} (2\sigma +1)\xi ^{-} &{} -\lambda _{1}^{+}\gamma ^{-} &{} \lambda _{1}^{+}\delta ^{-} &{} -(2\sigma +1) \\ (2\sigma +1)\xi ^{-} &{} \lambda _{1}^{+}\gamma ^{-} &{} -\lambda _{1}^{+}\delta ^{-} &{} -(2\sigma +1) \\ -(2\sigma +1)\xi ^{+} &{} \lambda _{1}^{-}\gamma ^{+} &{} -\lambda _{1}^{-}\delta ^{+} &{} (2\sigma +1) \\ -(2\sigma +1)\xi ^{+} &{} -\lambda _{1}^{-}\gamma ^{+} &{} \lambda _{1}^{-}\delta ^{+} &{} (2\sigma +1) \end{array}\right] . \end{aligned}$$
(37)

The parameters \(\xi ^{\pm }, \gamma ^{\pm }\) and \(\delta ^{\pm }\) are also only functions of \(\sigma \) and they are defined as

$$\begin{aligned} \left\{ \begin{array}{lc} \delta ^{\pm }\triangleq [ (-3\sigma +1) \mp \sqrt{\varDelta _{\sigma }} ] \\ \gamma ^{\pm }\triangleq \displaystyle \frac{1}{4(1-\sigma )} [ (24\sigma ^{2}-25\sigma +4) \pm (8\sigma -5)\sqrt{\varDelta _{\sigma }} ]\\ \xi ^{\pm }\triangleq \displaystyle \frac{(5\sigma -4)\pm \sqrt{\varDelta _{\sigma }})}{4(1-\sigma )} \end{array} \right. . \end{aligned}$$
(38)

3.2.2 Conversion from time to true anomaly

The propagation process presented so far, is based on time as the integration variable. In orbital dynamics it is sometimes more convenient to use true anomaly rather than time; therefore, the variable conversion is

$$\begin{aligned} \frac{d\nu }{dt}=\omega \Rightarrow \frac{d\nu }{\omega }=dt \Rightarrow \frac{\nu }{\omega }= t. \end{aligned}$$
(39)

Since both \(\sigma \) and \(\omega \) have been considered constant along an integration step, the conversion from time to true anomaly does not affect the solutions of Eq. 35, a part from a factor \(\omega \).

The ODE of Eq. 29 can be converted into true anomaly as

$$\begin{aligned} {\tilde{\mathbf {p}}'} = \begin{bmatrix} 0&\quad 4\sigma -3&\quad -2(\sigma -1)&\quad 0 \\ 2\sigma - 3&\quad 0&\quad 0&\quad 4(\sigma -1) \\ 4(\sigma -1)&\quad 0&\quad 0&\quad 8\sigma -5 \\ 0&\quad -2(\sigma - 1)&\quad \sigma -1&\quad 0 \end{bmatrix} \tilde{\mathbf {p}} , \end{aligned}$$
(40)

where the symbol ( \('\) ) represents the derivative with respect to \(\nu \), and in compact form it is equivalent to

$$\begin{aligned} {\tilde{\mathbf {p}}'}=\varvec{\varLambda }\tilde{\mathbf {p}}. \end{aligned}$$
(41)

This implies that the solution shown in Eq. 35 remains in the same form:

$$\begin{aligned} \mathbf {A}={\mathbf {E}}(\nu ,\sigma ){\varvec{\varSigma }}(\sigma )\tilde{\mathbf {p}}. \end{aligned}$$
(42)

The only difference appears in \(\mathbf {E}\), as

$$\begin{aligned} \mathbf {E}(\nu ,\sigma )= \begin{bmatrix} e^{{\lambda }_{2}^{+}\nu }&\quad 0&\quad 0&\quad 0 \\ 0&\quad e^{{\lambda }_{1}^{+}\nu }&\quad 0&\quad 0 \\ 0&\quad 0&\quad e^{{\lambda }_{2}^{-}\nu }&\quad 0 \\ 0&\quad 0&\quad 0&\quad e^{{\lambda }_{1}^{-}\nu } \end{bmatrix} \end{aligned}$$
(43)

where the exponents are explicit functions in \(\nu \). Through the transformation from time to true anomaly, the importance of the Palmer coordinates is even more clear. In fact, using \(\tilde{\mathbf {p}}\) allows a solution which is a linear combination of the eigenvalues of the matrix \(\mathbf {M}\) (Eq.  24). This solution depends only on \(\sigma \) through the polynomial matrix \(\varvec{\varSigma }\) and the diagonal matrix \({\mathbf {E}}\), with an explicit propagation variable \(\nu \).

3.2.3 Matrices propagation scheme

Since Eq. 42 assumes that \(\sigma \) is constant for small interval of \(\nu \), it is required to apply a ‘double’ propagation for the Palmer coordinates. This means that every time that the true anomaly is propagated to the next step, the corresponding value of \(\sigma \) must be updated, as shown in the scheme of Fig.  2.

Fig. 2
figure 2

Step Propagation scheme of \(\nu \) and \(\sigma \), with \(\sigma = \text {const}\) along every interval of \(\nu \). A general kth state is highlighted with the two limiting values (\(k^{\pm }\)) in red

At the initial time the coefficients depend upon the initial value of the Palmer coordinates as

$$\begin{aligned} \mathbf {A}_{0}=\mathbf {E}(\nu _{0},\sigma _{0})\varvec{\varSigma }(\sigma _{0})\tilde{\mathbf {p}}_{0} . \end{aligned}$$
(44)

Knowing \(\mathbf {A}_{0}, \tilde{\mathbf {p}}\) can be propagated to the step ‘1’, but only just before the update of \(\sigma \). In fact the value of \(\tilde{\mathbf {p}}\) after the update will be different. To have a formal expression, this condition can be considered at a generic kth propagation state (see Fig.  2), where the values of the Palmer coordinates just before (\(\tilde{\mathbf {p}}_{k^{-}}\)) and after (\(\tilde{\mathbf {p}}_{k^{+}}\)) the variation of \(\sigma \) are

$$\begin{aligned} \tilde{\mathbf {p}}_{k^{-}}= & {} \varvec{\varSigma }^{-1}(\sigma _{k-1})\mathbf {E}^{-1}(\nu _{k},\sigma _{k-1})\mathbf {A}_{k-1}\end{aligned}$$
(45)
$$\begin{aligned} \tilde{\mathbf {p}}_{k^{+}}= & {} \varvec{\varSigma }^{-1}(\sigma _{k})\mathbf {E}^{-1}(\nu _{k},\sigma _{k})\mathbf {A}_{k} \end{aligned}$$
(46)

In the limit towards \(k, \tilde{\mathbf {p}}_{k^{\pm }}\) are not the same because they depend on two different \(\sigma \)s. However, for the primer vector the continuity must be guaranteed along the whole transfer arc. For this reason, a simple propagation scheme (Fig.  2) has to be used, instead of a more complex one (e.g. leapfrog method). In fact, it guarantees that a ‘continuity check’ on the primer vector is executed at every update of \(\sigma \).

From Eq. 28 a relationship between \(\mathbf {p}_{polar}\) and \(\tilde{\mathbf {p}}\) can be derived

$$\begin{aligned} \tilde{\mathbf {p}}=\left[ \begin{array}{c@{\quad }c@{\quad }c@{\quad }c} -\omega &{} 0 &{} 0 &{} -2 \\ 0 &{} \omega &{} 1 &{} 0 \\ 0 &{} \omega &{} 2 &{} 0 \\ \omega &{} 0 &{} 0 &{} 1 \end{array}\right] \mathbf {p}_{polar} \equiv \mathbf {K}(\omega )\mathbf {p}_{polar}. \end{aligned}$$
(47)

The matrix \(\mathbf {K}(\omega )\), depending on \(\omega \), will be different before and after the update. Subsequently, the relations between the Palmer coordinate and the primer vector at a generic kth step are

$$\begin{aligned} {\mathbf {p}_{polar}}_{k}= & {} \mathbf {K}^{-1}(\omega _{k-1}) \tilde{\mathbf {p}}_{k^{-}}\end{aligned}$$
(48)
$$\begin{aligned} \tilde{\mathbf {p}}_{k^{+}}= & {} \mathbf {K}(\omega _{k}){\mathbf {p}_{polar}}_{k} . \end{aligned}$$
(49)

Combining Eqs. 48 and 49 gives the matrix which guarantees the continuity of the primer vector, defined as \(\varvec{\mathcal {I}}_{\omega _{k}}\):

$$\begin{aligned} \tilde{\mathbf {p}}_{k^{+}}=\mathbf {K}(\omega _{k})\mathbf {K}^{-1}(\omega _{k-1})\tilde{\mathbf {p}}_{k^{-}}\triangleq \varvec{\mathcal {I}}_{\omega _{k}}\tilde{\mathbf {p}}_{k^{-}}. \end{aligned}$$
(50)

The procedure for the propagation of the Palmer coordinates from its initial conditions \(\tilde{\mathbf {p}}_{0}\) to a generic kth state can be then formalized as

$$\begin{aligned} \tilde{\mathbf {p}}_{k}=\Bigl ( \prod _{i=0}^{k-1}\varvec{\mathcal {I}}_{\omega _{i}}\varvec{\varSigma }^{-1}(\sigma _{i})\mathbf {E}^{-1}(\nu _{i+1},\sigma _{i})\mathbf {E}(\nu _{i},\sigma _{i}) \varvec{\varSigma }(\sigma _{i})\Bigr )\tilde{\mathbf {p}}_{0} . \end{aligned}$$
(51)

In summary, Eq. 51 can be written in matrix form as

$$\begin{aligned} \tilde{\mathbf {p}}_{k} = \varvec{\varPsi }(\nu _{k},\nu _{0}) \tilde{\mathbf {p}}_{0}. \end{aligned}$$
(52)

where \(\varvec{\varPsi }(\nu _{k},\nu _{0})\) represents the propagation matrix of the Palmer coordinates.

The final step of the scheme is to convert the Palmer coordinates back into primer vector at the BC. Combining the definitions of \(P_{r}\) and \(P_{\vartheta }\) (Eq. 16) and Eq.  28, there is (where \(\mathbf {p}_{tot}\) clearly refers to the the primer vector and its derivative with respect to time):

$$\begin{aligned} \tilde{\mathbf {p}} = \begin{bmatrix} -3\omega&\quad 0&\quad 0&\quad -2 \\ 0&\quad 0&\quad 1&\quad 0 \\ 0&\quad -\omega&\quad 2&\quad 0 \\ 2\omega&\quad 0&\quad 0&\quad 1 \end{bmatrix} \begin{bmatrix} p_{r} \\ p_{\vartheta } \\ \dot{p}_{r} \\ \dot{p}_{\vartheta } \end{bmatrix} \equiv \mathbf {B}(\omega ) \mathbf {p}_{tot}. \end{aligned}$$
(53)

Equations 52 and 53 together yield to

$$\begin{aligned} {\mathbf {p}_{tot}}_{f} = \mathbf {B}^{-1}(\omega _{f})\varvec{\varPsi }(\nu _{f},\nu _{0})\mathbf {B}(\omega _{0}){\mathbf {p}_{tot}}_{0}\triangleq \tilde{\varvec{\varPhi }}_{(f,0)}{\mathbf {p}_{tot}}_{0} \end{aligned}$$
(54)

where \(\tilde{\varvec{\varPhi }}_{(f,0)}\) has been defined as the total propagation matrix of the primer vector.

Equation 54 summarizes the whole propagation scheme for the primer vector.

3.2.4 Independence from the semi-major axis of the transfer orbit

The value of the ‘continuity’ matrix, \(\varvec{\mathcal {I}}_{\omega }\), defined in Eq.  50, is actually independent from the explicit value of \(\omega \) (and therefore from a) and it is equal to

$$\begin{aligned} \varvec{\mathcal {I}}_{\omega _{k}} = \begin{bmatrix} (\omega _{k}/\omega _{k-1}+2)&\quad 0&\quad 0&\quad (-2\omega _{k}/\omega _{k-1} + 2) \\ 0&\quad (2\omega _{k}/\omega _{k-1} -1)&\quad (-\omega _{k}/\omega _{k-1}+1)&\quad 0 \\ 0&\quad (2\omega _{k}/\omega _{k-1}-2)&\quad (-\omega _{k}/\omega _{k-1}+2)&\quad 0 \\ (\omega _{k}/\omega _{k-1}-1)&\quad 0&\quad 0&\quad (2\omega _{k}/\omega _{k-1}-1) \end{bmatrix} \end{aligned}$$
(55)

where the ratio \(\omega _{k}/\omega _{k-1}\) can be found to be dependent only on the true anomalies before and after the update (see definitions of \(\omega \) and \(\sigma \) from Eq.  21)

$$\begin{aligned} \frac{\omega _{k}}{\omega _{k-1}}=\Bigl (\frac{l \cdot \sigma _{k-1}}{l \cdot \sigma _{k} } \Bigr )^2=\Bigl (\frac{1+e\cos (\nu _{k})}{1+e\cos (\nu _{k-1})}\Bigr )^2 . \end{aligned}$$
(56)

Therefore, since l does not appear in any other terms of the matrices of Eq. 51, \(\varvec{\varPsi }\) is not a function of the semi-major axis of the transfer orbit.

On the contrary, by the definition of Eq. 53, \(\mathbf {B}\) is linearly dependent on \(\omega \). This makes \(\tilde{\varvec{\varPhi }}_{(f,0)}\) to be a function of the initial and final values of \(\omega \) in a ‘block-form’ as

$$\begin{aligned} \tilde{\varvec{\varPhi }}_{(f,0)} = \mathbf {B}^{-1}(\omega _{f})\varvec{\varPsi }(\nu _{f},\nu _{0})\mathbf {B}(\omega _{0}) \propto \begin{bmatrix} 1&\quad 1&\quad (1/{\omega _{f}})&\quad (1/{\omega _{f}}) \\ 1&\quad 1&\quad (1/{\omega _{f}})&\quad (1/{\omega _{f}}) \\ \omega _{0}&\quad \omega _{0}&\quad 1&\quad 1 \\ \omega _{0}&\quad \omega _{0}&\quad 1&\quad 1 \\ \end{bmatrix} . \end{aligned}$$
(57)

For the final solution of Eq. 54, it is necessary to have the complete initial state \({\mathbf {p}_{0}}_{tot}\), but at this stage of the procedure, the part concerning the derivative is not yet known.

A transformation of the primer vector components from the derivative with respect to time, to the derivative with respect to true anomaly (as in Eq. 39) gives

$$\begin{aligned} \frac{dp}{dt}=\frac{dp}{d\nu }\frac{d\nu }{dt}=\frac{dp}{d\nu }\omega . \end{aligned}$$
(58)

The product between the primer vector in true anomaly, which is \(\mathbf {p}_{tot, \nu _{0}} = (p_{r 0} , p_{\vartheta 0}, \omega _{0}p'_{r 0}, \omega _{0} p'_{\vartheta 0})^{T}\) and \(\tilde{\varvec{\varPhi }}_{(f,0)}\) depends on \(\omega \) as

$$\begin{aligned} \tilde{\varvec{\varPhi }}_{(f,0)}\cdot \mathbf {p}_{tot, \nu _{0}} = \mathbf {p}_{tot, \nu _{f}} \propto \begin{bmatrix} 1&\quad 1&\quad ({\omega _{0}}/{\omega _{f}})&\quad ({\omega _{0}}/{\omega _{f}}) \\ 1&\quad 1&\quad ({\omega _{0}}/{\omega _{f}})&\quad ({\omega _{0}}/{\omega _{f}}) \\ \omega _{0}&\quad \omega _{0}&\quad {\omega _{0}}&\quad {\omega _{0}} \\ \omega _{0}&\quad \omega _{0}&\quad {\omega _{0}}&\quad {\omega _{0}} \\ \end{bmatrix} . \end{aligned}$$
(59)

Having \(\mathbf {p}_{tot, \nu _{f}} = (p_{r f }, p_{\vartheta f}, \omega _{f}p'_{r f}, \omega _{f} p'_{\vartheta f})^{T}\), from Eq.  59 clearly \(p_{rf}\) and \(p_{\vartheta f}\) are independent from the semi-major axis, since the ratio between the initial and final \(\omega \)s of Eq. 59 is of the same form of Eq. 56. The last two rows of \(\tilde{\varvec{\varPhi }}_{(f,0)}\cdot \mathbf {p}_{tot, \nu _{0}}\) of Eq. 59 are linear in \(\omega _{0}\). Since they respectively correspond to \(\omega _{f}p'_{r f}\) and \(\omega _{f} p'_{\vartheta f}\), this implies that the derivatives in true anomaly of the primer vector are also both independent from a.

This is an important conclusion because it states that the primer vector and its derivative are not influenced by the value of the semi-major axis of the transfer orbit. As a result, the size of the search space is reduced and the optimization problem is simplified.

4 Fixed transfer orbit and variable departure/arrival orbits

From the definition at the BC of the primer vector (Eq.  10), \(\mathbf {p}_{0}\) and \(\mathbf {p}_{f}\) depend only on the directions of the initial and final \(\varDelta \mathbf {V}\)s, but they are independent from their magnitudes. Therefore, a ‘natural’ application is to consider a problem where the \(\varDelta \mathbf {V}\)s are parametrized by their directions. In the case where the transfer, departure and arrival orbits are coplanar, the in-plane components of the primer vector can be defined with respect to the orientation of the two local polar reference frames at initial and final position, expressed through two angles \(\alpha \) and \(\beta \):

$$\begin{aligned} \begin{array}{c@{\quad }c} \mathbf {p}_{0} = \displaystyle \begin{bmatrix} \cos \alpha \\ \sin \alpha \end{bmatrix} \\ \\ \mathbf {p}_{f} = \displaystyle \begin{bmatrix} \cos \beta \\ \sin \beta \end{bmatrix} \\ \end{array}. \end{aligned}$$
(60)
Fig. 3
figure 3

Geometric representation of a co-planar departure (\({e_{D}}, {a_{D}}\)), transfer (\({e_{T}}, {a_{T}}\)) and arrival (\({e_{A}}, {a_{A}}\)) orbits case

If the transfer orbit is fixed (therefore also the velocities \(\mathbf {v}_{0}\) and \(\mathbf {v}_{f}\)), the departure and arrival orbits vary according to the orientation (and magnitude) of the \(\varDelta \mathbf {V}\)s (see Fig. 3).

In the local boundary polar reference frames there is

$$\begin{aligned} \begin{array}{c@{\quad }c} \mathbf {v}_{0} = v_{r0}\hat{e}_{r0}+v_{\vartheta 0}\hat{e}_{\vartheta 0} \\ \mathbf {v}_{f} = v_{rf}\hat{e}_{rf}+v_{\vartheta f}\hat{e}_{\vartheta f} \end{array}, \end{aligned}$$
(61)

while the \(\varDelta \)Vs are

$$\begin{aligned} \begin{array}{cc} \varDelta \mathbf {V}_{0} = \varDelta V_{0}(\cos \alpha \hat{e}_{r0}+\sin \alpha \hat{e}_{\vartheta 0}) \\ \varDelta \mathbf {V}_{f} = \varDelta V_{f}(\cos \beta \hat{e}_{r f}+\sin \beta \hat{e}_{\vartheta f}) \end{array}. \end{aligned}$$
(62)

Defining \(\mathbf {v}_{D}\) and \(\mathbf {v}_{A}\) respectively as the velocity at \(\mathbf {r}_{0}\) on the departure orbit and the velocity at \(\mathbf {r}_{f}\) on the arrival orbit, they will be

$$\begin{aligned} \begin{array}{c@{\quad }c} \mathbf {v}_{D} = \mathbf {v}_{0} - \varDelta \mathbf {V}_{0} = (v_{r0}-\varDelta V_{0}\cos \alpha )\hat{e}_{r0}+(v_{\vartheta 0}- \varDelta V_{0}\sin \alpha )\hat{e}_{\vartheta 0} \\ \mathbf {v}_{A} = \mathbf {v}_{f} + \varDelta \mathbf {V}_{f} = (v_{rf}+\varDelta V_{f}\cos \beta )\hat{e}_{rf}+(v_{\vartheta f} + \varDelta V_{f} \sin \beta )\hat{e}_{\vartheta f} \end{array}. \end{aligned}$$
(63)

With a fixed transfer orbit, the total state transition matrix \(\tilde{\varvec{\varPhi }}_{(f,0)}\) of Eq. 54 is fixed as well. In addition, fixing the transfer orbit means fixing its eccentricity and its semi-major axis. This implies that for this scenario the optimality can vary only if the directions of the \(\varDelta \mathbf {V}\)s, \(\alpha \) and \(\beta \), change. In an optimization process, such a problem has a 5 dimensional search space: \(e, \nu _{0}\) and \(\nu _{f}\), which define the transfer orbit, and \(\alpha \) and \(\beta \), which define the BC on the \(\varDelta \mathbf {V}\)s. Hence, the analysis can be approached differently, having restricted the problem to those 5 dimensions.

Lawden’s NCs state that the primer vector is equal to 1 at the BC and at each intermediate impulse (Lawden 1963). In other words, for a trajectory to be optimal, p must be always smaller than 1 along the arc and equal to 1 at the boundaries, where the \(\varDelta \mathbf {V}\)s are fired. Within this scenario, a set of transfer orbits with the same eccentricity can be parametrized by, for example, fixing the initial true anomaly and varying \(\nu _f\).

Figure 4 illustrates the case where a fixed transfer orbit and fixed \(\nu _0\) are considered. The maps show the maximum value of the primer vector according to gradual increasing values of \(\nu _f\), for fixed \(e=0.8\) and \(\nu _0 =\) 5 deg. They are plotted in a complete interval (0–360 deg) of \(\alpha \) and \(\beta \) with a computational step of 5 deg.

As already explained, an optimal transfer will have an overall maximum value of the primer vector equal to 1 (dark blues area in Fig. 4). Non-optimal trajectories have \(p>1\) and therefore in Fig. 4 they are represented in colours (‘non-optimality islands’).

Fig. 4
figure 4

Sequence of maximum values of primer vector maps for a fixed transfer orbit and increasing value of \({\nu _f}\), versus \({\alpha }\) and \({\beta }\). The fixed parameters of the transfer orbit are e \(=\) 0.8, a \(=\) 0.95 DU, \({\nu _0} =\) 5 deg

The problem is very well structured with a precise geometry: for every specific \(\nu _f\) case, the ‘non-optimal’ islands have always the same dimension, shape and orientation. In particular for small span of true anomalies the ‘non-optimal’ islands are very stretched but small in size, therefore almost the whole map of Fig.  4a represents an optimal case. There are two main motivations behind this result, one is that for small \({\varDelta \nu }\), the modulus of the primer vector does not change significantly. In fact, it starts and ends in 1 but its variation along the interval is minimal due to the small span between the initial and final true anomaly. The other reason can be found in the accuracy of the numerical computation (\(10^{-16}\)). Therefore, in Fig. 4a it can be seen that even ‘non-optimal’ islands have very low peak values.

Along the different plots shown in Fig. 4, the islands rotate and grow in size and eventually merge for increasingly higher spans of \(\varDelta \nu \). The maximum value of the primer vector is always located in the central area of the islands and it increases with an increasing interval of true anomalies. In fact, while in the case where \(\varDelta \nu =\) 15 deg there is \(p_{max}\approx 1\), in the last case (\(\varDelta \nu =\) 175 deg), \(p_{max} \approx 3\). These kind of maps allow mission planners not only to visualize the possibility to be in an optimal region but also to change the BC, represented either by the direction of the thrusts (\(\alpha \) and \(\beta \)) or by the true anomalies, to obtain an optimal transfer.

With the same kind of approach, a different scenario can be to exploit the boundary derivatives of the primer vector. It is clear that for an optimal transfer, since the profile of the primer vector has to be smaller than 1, the initial derivative must be negative while the final derivative must be positive. This condition does not ensure that overall the primer vector will be optimal, but it gives information on the possibility of having an optimal trajectory and on the BC of the problem. On the other hand, if \(\dot{p}_0\) is positive and \(\dot{p}_f\) is negative they definitely define non-optimal trajectories. The expansions at first order of the initial and final derivatives are

$$\begin{aligned} \left\{ \begin{array}{l@{\quad }l} \dot{p}_{0} \approx \dot{p}_{r0}p_{r0}+\dot{p}_{\vartheta 0}p_{\vartheta 0} \\ \dot{p}_{f} \approx \dot{p}_{rf}p_{rf}+\dot{p}_{\vartheta f}p_{\vartheta f} \end{array}, \right. \end{aligned}$$
(64)
Fig. 5
figure 5

Sequence of initial and final derivatives maps for a fixed transfer orbit and increasing value of \({\nu _f}\), versus \({\alpha }\) and \({\beta }\). The characteristics of the transfer orbit are e = 0.8, a = 0.95 DU, \({\nu _0} =\) 5 deg

Fig. 6
figure 6

Departure (left-side) and arrival (right-side) orbits Keplerian elements (eccentricity and semi-major axis) versus magnitudes of initial and final \({\varDelta }\mathbf {V}\)s and semi-major axis of the transfer orbits

and they are shown in Fig. 5; specifically, \(\dot{p}_{0}\) is represented in Fig. 5a–c, whereas \(\dot{p}_{f}\) in Fig. 5d–f. As in Fig. 4, the structure of the maps of Fig. 5 is very well defined. These plots are evaluated (as in the previous case) for fixed values of \(e = 0.8\) and \(\nu _0 =\) 5 deg, while \(\nu _f\) is increasing and \(\alpha \) and \(\beta \) are defined in the interval from 0 deg to 360 deg.

For small true anomaly spans (\(\nu _f=\) 20.0 deg), both the derivatives have a sort of ‘stripe’ configuration, with opposite values. In fact, it is clear from Fig. 5a, d that \(\dot{p}_0>0\) and \(\dot{p}_f<0\) and they reach a minimum value of \(\approx -800\) and \(\approx +800\) respectively. Therefore, the two derivatives define the same boundary optimality areas. On the other hand, for larger \(\varDelta \nu , \dot{p}_{0}\) and \(\dot{p}_{f}\) tend to smaller absolute values and they both have positive and negative values. Furthermore, as \(\nu _f\) increases, the two structures are no longer comparable: they move and stretch tending, in the end, into two completely different scenario. At \(\nu _f =\) 180 deg, in fact, the initial derivative has a vertical stripes structure (Fig. 5c), while the final derivative has well-defined rounded non-optimal areas (negative value, represented by dark blue and green regions) which have the same size (Fig. 5f).

For both cases (Figs. 45), the most interesting area of the map is represented when the variables cross their critical boundary value (that is 1 for the primer vector and 0 for its derivative). This aspect represents, in fact, the marginal conditions from which a trajectory changes its optimality properties.

The important characteristic of the results presented so far is that they encompass an entire set of orbits with same primer vector profile but different transfer orbits and BC. Specifically, what is varying are the semi-major axis of the transfer orbit (\(a_{\mathrm{transf}}\)) and the magnitudes of the \(\varDelta \mathbf {V}\)s at the beginning and at the end (therefore the departure and arrival orbits themselves).

Fig. 7
figure 7

Primer vector profile, in common to all departure and arrival orbits of Fig. 6

An interesting complete scenario is represented in Fig.  6, where it is shown how e and a of the departure and arrival orbits vary according to different magnitudes of \(\varDelta \mathbf {V}_{0,f}\) and \(a_{\mathrm{transf}}\). The fixed parameters are the 5 aforementioned dimensions of the search space: BC in direction of \(\varDelta \mathbf V \)s (\(\alpha \) and \(\beta \)), eccentricity of the transfer orbit and BC in position (\(\nu _0\) and \(\nu _f\)). In this specific case the values are \(\alpha =\) 30 deg, \(\beta =\) 60 deg, \(e = 0.5, \nu _0 = 5\) deg and \(\nu _f = 180\) deg. The simulations have been run with the Sun as central body, a set of semi-major axes of the transfer orbit that goes from 0.01 AU to 1.0 AU and a set of magnitudes of \(\varDelta \mathbf V _{0,f}\) that go from 1 to 80 % of the magnitudes of \(\mathbf {v}_{0,f}\) (see Eq. 63).

On the left-hand side of Fig. 7 are the Keplerian elements of the possible departure orbits, while on the right-hand side are the equivalent parameters for the arrival orbits. With the chosen intervals of \(a_{\mathrm{transf}}\) and \(\varDelta \hbox {V}_{0,f}\), the eccentricities of the departure orbits result in a possible variation from roughly 0.5 to 0.87, while the ones of the arrival orbits have higher values (0.8–0.96). On the other hand the values of the semi-major axes of the two orbits are roughly comparable, but their variation is different with respect to \(a_{\mathrm{transf}}\) and \(\varDelta \mathbf V _{0,f}\).

The main conclusion of this result is that all kinds of departure and arrival orbits summarized in Fig. 6 will meet the same objective, in fact they all correspond to the same primer vector profile (shown in Fig. 7), that in this case is non-optimal.

In summary, this procedure gives a different approach to the primer vector theory and allows one to pursue a complete analysis on entire classes of departure and arrival orbits with common characteristics for specific fixed parameters of the transfer trajectory.

5 Conclusions

This paper shows how applying a reference system transformation to the primer vector equation causes the de-coupling between the in-plane and out-of-plane coordinates. The case when the transfer, departure and arrival orbits are co-planar is analyzed. Furthermore, a coordinates transformation is applied with the support of the Hamiltonian of the equations. The Palmer coordinates allow the simplification of the structure of the analysis and give a complete analytical approximation which, given the boundary conditions of the problem, can propagate the primer vector along the arc. For the circular transfer case, Hill’s solution is also a solution for the primer vector problem. The primer vector is shown to be independent from the semi-major axis of the transfer orbit and the mathematical complexity and computational time of the problem are reduced. Furthermore, fixing the transfer orbit, it is established that the dimensions of the search space can be reduced to 5. Two approaches to understand the likelihood of the optimality of a specific set of transfers are examined in the paper, with related optimality maps. Finally it is shown that a single profile of primer vector is related to a set of departure and arrival orbits through the directions of the initial and final \(\varDelta \mathbf {V}\)s and the semi-major axis of the transfer trajectory. The simplicity of the new procedure allows the exploitation of many other properties of the problem, not presented in this paper.