1 Introduction

Only about \(60\%\) of the Moon surface can be seen from the Earth. The first successful attempt to explain this fact was made by Cassini (1693), who borrowed the kinematic model of the ‘triple Earth motion’ from Copernicus (1543, Lib. 1,Cap. 11), and applied it to the Moon with some important amendments. Retaining the postulate of the fixed angle between the rotation axis and the orbital plane, Cassini postulated the equality of orbital period and the sidereal rotation period, which protected the far side from being seen from the Earth. By additionally postulating that the rotation axis, the ecliptic pole, and the lunar orbit pole remain coplanar, Cassini suppressed the possibility of revealing the complete polar caps over one lunar axis precession cycle. Two centuries later, Tisserand (1891) rephrased these postulates as laws. The three Cassini laws of Tisserand state that: (1) rotation and orbital periods are equal, (2) the rotation axis has a constant inclination to the ecliptic, and (3) the three axes are coplanar. The second law differs from the original Cassini’s statement, but in view of the third law, the difference is unimportant.

With the advent of the Newtonian dynamics, the question arose if the Cassini’s model is consistent with equations of motion. This was a part of the problem issued by the French Académie Royale des Sciences for the Prize of 1764. In his prize dissertation and in a later work, Lagrange (1764, 1780) demonstrated that the state described by Cassini is an equilibrium of the associated differential system and studied small librations in its vicinity.

The work of Colombo (1966) brought a new understanding of the Cassini laws and motion near the stationary configuration which they describe in the specific case of the Moon. In particular, Colombo demonstrated that the second and the third laws are conceptually independent from the first law and themselves serve as a basis of an interesting dynamical problem which describes the long-term evolution of the spin axis of an arbitrary rotating body. He thus dropped from his analysis the assumption of the direct spin-orbit resonance, but kept the assumption of general precession of the orbital plane due to perturbations (either caused by the oblate central body, or by other masses in the system). He showed that the long-term dynamics of the spin axis can be described by a simple, one-dimensional problem assuming the orbital node performs a uniform precession and the inclination remains constant. Its stationary points represent generalizations of the Cassini second and third laws.

It was soon understood that the Colombo problem is a very suitable starting point for analysis of the obliquity evolution of terrestrial planets, even when the orbital node and inclination undergo more complex evolution. A fascinating example are studies of Mars obliquity variations in relation to this planet’s past paleoclimate, starting with Ward (1973, 1974). Tides or internal process may additionally change some of the system’s parameters, a situation relevant to all terrestrial planet studies, including the Moon—see Peale (1974), Ward (1975, 1982), or Ward and de Campli (1979), to mention just few examples of early works. Later studies of Laskar and colleagues made a masterful use of detailed knowledge of planetary long-term dynamics and its implications on secular evolution of their spin axes (e.g., Laskar and Robutel 1993; Laskar et al. 1993; Correia and Laskar 2001, and many other with more technical details). Following earlier hints, mentioned already in Harris and Ward (1982), applications to giant planets were also developed in the past two decades (e.g., Ward and Hamilton 2004; Hamilton and Ward 2004; Ward and Canup 2006; Boué et al. 2009; Vokrouhlický and Nesvorný 2015; Brasser and Lee 2015; Rogoszinski and Hamilton 2020).

Beyond planets and satellites, studies of secular spin evolution of asteroids flourished recently, especially after Vokrouhlický et al. (2003) applied it to explain space parallelism of spin axes of large members in the Koronis family (see also Vokrouhlický et al. (2006)). Further applications include spin states of exoplanets (e.g., Atobe et al. 2004; Atobe and Ida 2007; Saillenfest et al. 2019), or artificial satellites and space debris (e.g., Efimov et al. 2018).

Taken altogether, we note that the backbone of all these studies is the basic Colombo model. Interestingly, a systematic mathematical treatment of this elegant Hamiltonian problem has not been significantly advanced beyond the state of art dating back to Henrard and Murigande (1987), and Henrard (1993). These classical works focused on the aspects directly related with the probability of capture into different phase space zones when a slow parameter evolution drives the system across the homoclinic orbits; hence, they paid no attention to regular orbits. One can also recognize the Colombo top in an anonymous quadratic Hamiltonian treated by Lanchares and Elipe (1995), who focused on the qualitative study of its parametric bifurcations.

The main reason to seek for the complete analytical solution of the Colombo top is its significance for further studies of more realistic, perturbed problems. Be it analytical perturbation techniques, or numerical splitting methods, the knowledge of explicit time dependence of the Colombo top motion is crucial. The present work is divided in two principal parts: Sect. 2 explores the problem using purely geometric and algebraic tools, whereas the integration of equations of motion is considered in Sect. 3. In other words, Sect. 2 concerns integral curves, that become time-dependent trajectories in Sect. 3.

The formulation the problem is given in Sect. 2.1, where we introduce two sets of variables: traditional xyz and shifted XYZ. Throughout the text, we switch between the two sets, depending on convenience. In Sect. 2.2, the geometric construction of the integral curves is shown; intersections of the curves with the \(x=X=0\) meridians (their extremities in z) are found in Sect. 2.3 , expressed in terms of the energy constant and of the two parameters ab. Some of these can be critical points (the Cassini states), which allows the expression in terms of the parameters only—given in Sect. 2.4. The information gained allows to partition the phase space and distinguish three types of the Colombo top problem.

In Sect. 3, we first provide a universal solution in terms of the Weierstrass elliptic function \(\wp \) (Sect. 3.1), comparing various formulations of the same result. But since the Weierstrass function behaves differently in various domains of the phase space, we reformulate the solution in terms of the more common Jacobi elliptic functions (Sect. 3.2). Finally, the specific trajectories that admit solutions in terms of elementary functions are presented in Sect. 3.3. The closing Sect. 4 summarizes the results and their implications.

2 The Colombo top model

2.1 Equations of motion

The basic assumptions leading to the Colombo top problem involve a rigid body on an orbit around some distant primary. The orbital motion might be called Keplerian, but with one notable addition: the orbital plane rotates uniformly around some fixed axis in the inertial space with the angular rate \(\mu \). The body is assumed to rotate in the lowest energy state, namely about the shortest principal axis of its inertia tensor. Following the work of Colombo (1966), we assume the rotation period is not in resonance with the revolution period about the primary (see, e.g., Peale 1969 for generalizations to this situation). Considering the quadrupole torque due to the gravitational field of the primary, and averaging over both orbital revolution about the center and rotation cycle about the spin axis, one easily realizes that the total angular momentum of rotation (or, equivalently, the angular velocity \(\omega \) of rotation) is conserved. The whole dynamical problem then reduces to the analysis of the motion of the unit vector \(\mathbf{r}\) of rotation pole in space.

In order to introduce fundamental astronomical parameters of relevance, let us for a moment assume \(\mu =0\), i.e., the orbital plane is fixed in the inertial space. In this simple case, \(\mathbf{r}\) performs regular precession about the fixed direction \(\mathbf{K}\) of the orbital angular momentum with a frequency

$$\begin{aligned} \mu _r = - \alpha \, \cos \varepsilon , \end{aligned}$$
(1)

where \(\alpha \) is the precession constant and \(\varepsilon \) is the obliquity, namely the angle between \(\mathbf{r}\) and \(\mathbf{K}\) (\(\cos \varepsilon =\mathbf{r} \cdot \mathbf{K}\)). Note the minus sign in Eq. (1) which indicates polar regression in the inertial space. In this model, \(\varepsilon \) stays constant and

$$\begin{aligned} \alpha = \frac{3}{2} \frac{n^2}{\omega }\frac{E_b}{(1-e^2)^{3/2}}, \end{aligned}$$
(2)

where n is the orbital mean motion about the primary, \(\omega \) the angular rotational frequency, \(E_b\) the dynamical ellipticity, and e the orbital eccentricity. The dynamical ellipticity expresses degree of non-sphericity of the body, and it is defined using principal moments \(A\le B\le C\) of the inertia tensor as

$$\begin{aligned} E_b = \frac{C-(A+B)/2}{C}. \end{aligned}$$
(3)

Things become more interesting in the case where the orbital plane about the primary is not constant. As mentioned above, the Colombo top model describes the situation when it performs a uniform precession in the inertial space. In particular, \(\mathbf{K}\) revolves uniformly on a cone about a fixed direction \(\mathbf{K}'\) in space, such that the orbital inclination I with respect to the reference plane normal to \(\mathbf{K}'\) is constant. The magnitude of the precession rate is \(\mu \), though most often the orbital plane performs again regression in the inertial space (assuming \(I<90^\circ \)). The interest and complexity of this model revolves about a possibility of a resonance between the two precession frequencies \(-\mu \) and \(\mu _r\). In order to describe it using a simple Hamiltonian model, Colombo (1966) observed it is useful to refer \(\mathbf{r}\) to the reference frame following precession of the orbital plane, thus representing \(\mathbf{r}^\mathrm {T} = (\sin \varepsilon \cos (h-\pi /2),\sin \varepsilon \sin (h-\pi /2),\cos \varepsilon )\). Here, \(h-\pi /2\) is a longitude reckoned from the ascending node of the orbital plane and \(\varepsilon \) is a colatitude measured from \(\mathbf{K}\) as above. It is actually an advantage to introduce \(H=\cos \varepsilon \) (and \(\sin \varepsilon =\sqrt{1-H^2}\)). This is because in terms of the symplectic variables (hH), the Colombo top is a one degree of freedom, conservative problem with a Hamiltonian function

$$\begin{aligned} {\mathscr {H}}_s(h,H) = -\frac{(H-b)^2}{2} - a \sqrt{1-H^2} \, \cos {h} = E_\mathrm {H}, \end{aligned}$$
(4)

and two new nondimensional parameters are defined as

$$\begin{aligned} a = \frac{\mu }{\alpha }\,\sin I, \qquad b = \frac{\mu }{\alpha }\,\cos I. \end{aligned}$$
(5)

As observed by Henrard and Murigande (1987), the discussion can be confined to nonnegative constants a and b thanks to the symmetries \((a,h,H) \rightarrow (-a, h+\pi , H)\), and \((b,h,H) \rightarrow (-b,-h,-H)\) admitted by \({\mathscr {H}}_s\).

Since the equations of motion

$$\begin{aligned} {\dot{h}} = \frac{\partial {\mathscr {H}}_s}{\partial H} = b - H + \frac{a H \cos {h}}{\sqrt{1-H^2}}, \qquad {\dot{H}} = - \frac{\partial {\mathscr {H}}_s}{\partial h} = - a \sqrt{1-H^2} \sin {h}, \end{aligned}$$
(6)

are singular at \(H^2 = 1\), it is better to use the Cartesian coordinates of the unit momentum vector

$$\begin{aligned} {\mathbf {r}}= \left( \begin{array}{c} x \\ y \\ z \\ \end{array} \right) = \left( \begin{array}{c} \sqrt{1-H^2} \sin {h} \\ -\sqrt{1-H^2} \cos {h} \\ H \\ \end{array} \right) . \end{aligned}$$
(7)

Then, similarly to Henrard and Murigande (1987), we obtain the Hamiltonian function

$$\begin{aligned} {\mathscr {H}}_\mathrm {HM}(x,y,z) = - \frac{(z-b)^2}{2} + a y = E_\mathrm {H}, \end{aligned}$$
(8)

that generates equations of motion

$$\begin{aligned} {\dot{x}}= & {} (z-b)(y+a) + a b, \nonumber \\ {\dot{y}}= & {} -(z-b)\,x,\nonumber \\ {\dot{z}}= & {} - a x. \end{aligned}$$
(9)

For the sake of minor simplification, let us propose the shifted variables

$$\begin{aligned} X=x, \quad Y = y+a, \quad Z = z-b, \quad \mathbf {R} = (X,Y,Z)^\mathrm {T} = \mathbf {r}+(0,a,-b)^\mathrm {T}. \end{aligned}$$
(10)

Adding a constant \(a^2\) to the Hamiltonian (8), we simplify it to

$$\begin{aligned} {\mathscr {H}} = - \frac{Z^2}{2} + a Y = E, \end{aligned}$$
(11)

with the new energy constant

$$\begin{aligned} E = E_\mathrm {H}+a^2. \end{aligned}$$
(12)

Equations of motion for the shifted variables are

$$\begin{aligned} {\dot{X}}= & {} Z\,Y + a b, \nonumber \\ {\dot{Y}}= & {} -Z\,X, \nonumber \\ {\dot{Z}}= & {} - a X. \end{aligned}$$
(13)

Notably, when \(a=0\), the problem is simplified to the symmetric free top, with constant Z and uniform rotation of \(\mathbf {r}\) around the third axis, with the frequency \(\sqrt{- 2 E} = |Z |\).

2.2 Geometric interpretation

It is customary to represent the integral curves of Eq. (9) as the intersections of two surfaces:

  1. S1

    a sphere \(x^2+y^2+z^2=1\),

  2. S2

    a parabolic cylinder \((z-b)^2 - 2 a (y+a) + 2 E = 0\), implied by the energy integral.

The symmetry plane of S2 is \(z=b\), and its vertex line is parallel to the x-axis, passing through \(y= (E-a^2)/a\).

Observe that combining S1 and S2 we can also obtain another surface (see Fig. 1):

  1. S3

    a paraboloid of revolution \(x^2+(y+a)^2+ 2 b z - \left( 2E - a^2 + b^2+1 \right) = 0\).

Actually, the S3-based function

$$\begin{aligned} {\mathscr {H}}_\mathrm {a}(\mathbf {r}) = \frac{x^2}{2} + \frac{(y+a)^2}{2} + b z = E' = E + \frac{1-a^2 + b^2}{2}, \end{aligned}$$
(14)

is an alternative Hamiltonian of the Colombo top, leading to the same equations of motion (9) as the Hamiltonian (8).

Fig. 1
figure 1

Surfaces S1, S2, S3 (left) and their section at \(x=0\) (right), for \(a=0.2\), \(b=0.25\), \(E=-0.29\)

The paraboloid S3 has the symmetry axis parallel to the z-axis, and passing through the points \(x=0\), \(y=-a\). Its vertex is located at \(z = E'/b\). The advantage of S3 appears when discussing the limit of \(a=0\). Then, the paraboloid does not change the shape and the intersections of S1 and S3 are circles because of coincidence of the symmetry axes. Contrarily to this, setting \(a=0\) in S2 results in degeneracy: the cylinder breaks in two parallel planes. On the other hand, \(b=0\) turns the S3 paraboloid into a (circular) cylinder, whereas S2 retains its shape. Thus, S2 and S3 (or \({\mathscr {H}}\) and \({\mathscr {H}}_\mathrm {a}\)) can be considered complementary, although the regularity at \(a=0\) seems to us more favorable (e.g., if some perturbation approach is based upon the small inclination assumption).

An integral curve can be represented as a parametric curve in a number of ways. Of course, the best is to solve Eq. (9) and find \(\mathbf {r}(t)\). But before we accomplish it (and, actually, in order to do it), let us consider parameterizations, where two coordinates are expressed in terms of the third. The most straightforward is to solve the system of S1 and S2 equations, using z as a parameter variable, which leads to

$$\begin{aligned} \mathbf {r}(z) = \left( \begin{array}{c} \pm \sqrt{ 1-z^2 - y(z)^2} \\ \frac{ (z-b)^2 + 2 E-2 a^2}{2a} \\ z \\ \end{array} \right) , \text{ or } \mathbf {R}(Z) = \left( \begin{array}{c} \pm \sqrt{ 1-(Z+b)^2 - \left( Y(Z)-a\right) ^2} \\ \frac{ Z^2 + 2 E}{2a} \\ Z \\ \end{array} \right) .\quad \end{aligned}$$
(15)

All three invariant surfaces S1, S2, S3 intersect the plane \(x=0\), which is their common plane of symmetry. The integral curves, as the lines of intersection of S1, S2, and S3 also pass through \(x=0\) and are symmetric with respect to this plane. Moreover, both y and z coordinates of a given integral curve attain their local extremes at \(x=0\), according to equations of motion (13).

2.3 Intersections with the plane \(x=0\)

To find the intersection points of an integral curve with the plane \(x=0\) for some specified energy E, it is enough to find either y or z coordinate. Knowing one of them, one might recover the other from the relation \(y^2+z^2=1\). But this involves the ambiguity of sign; thus, it is better to find z, and then use the parametric equation (15) for y(z). To benefit from minor simplifications, we first find Z and then Y(Z).

According to Eq. (15), the relation between X and Z is

$$\begin{aligned} X = \pm \frac{1}{2a}\,\sqrt{W(Z)}, \end{aligned}$$
(16)

where W(Z) is a polynomial of degree 4

$$\begin{aligned} W(Z) = - Z^4 - 4 E Z^2 - 8 a^2 b Z - 4 \left( E - a^2\right) ^2 + 4 a^2 \left( 1- b^2\right) . \end{aligned}$$
(17)

Note the absence of the cubic term thanks to the use of the shifted variable Z.

Solving the quartic equation \(W(Z)=0\) is a tedious task; the details can be found in “Appendix A”. Briefly, the four roots \(Z_j\) are given in terms of the three roots \(e_i\) of the reduced cubic resolvent equation (additionally modified to match the standard cubic polynomial appearing in the Weierstrass elliptic integrals). Depending on the parameters a and b, selecting some energy value E, the number of meridian intersection points is 4, 3, 2, 1, or 0.

Four intersection points mean that there are two distinct integral curves with the same energy, each intersecting \(x=0\) in two points:

$$\begin{aligned} z_1= & {} b - \sqrt{e_1-\frac{2}{3}E} - \sqrt{e_2-\frac{2}{3}E} - \sqrt{e_3-\frac{2}{3}E}, \nonumber \\ z_2= & {} b - \sqrt{e_1-\frac{2}{3}E} + \sqrt{e_2-\frac{2}{3}E} + \sqrt{e_3-\frac{2}{3}E}, \end{aligned}$$
(18)

for a lower curve, and

$$\begin{aligned} z_3= & {} b + \sqrt{e_1-\frac{2}{3}E} - \sqrt{e_2-\frac{2}{3}E} + \sqrt{e_3-\frac{2}{3}E}, \nonumber \\ z_4= & {} b + \sqrt{e_1-\frac{2}{3}E} + \sqrt{e_2-\frac{2}{3}E} - \sqrt{e_3-\frac{2}{3}E}, \end{aligned}$$
(19)

for the upper curve, with \(e_j\) defined in Eq. (141) as functions of E, a, and b, such that \(z_1< z_2< z_3 < z_4\).

Three intersection points mean that either one of the two curves contracts to a single point, or two curves share the same intersection point. These situations are distinguished by the sign of invariant \(g_3\) from Eq. (135).

  • If \(g_3 > 0\), then the upper curve shrinks into a point with

    $$\begin{aligned} z_{34} = b + \sqrt{e_1- \frac{2}{3}E}. \end{aligned}$$
    (20)

    The remaining intersection points of the lower curve are

    $$\begin{aligned} z_1= 2 b - z_{34} - 2 \sqrt{e_{23}-\frac{2}{3}E}, \qquad z_2=2 b - z_{34} + 2 \sqrt{e_{23}-\frac{2}{3}E}. \end{aligned}$$
    (21)

    The expressions of \(e_1\) and \(e_{23}\) are given in Eq. (145).

  • If \(g_3 < 0\), then the upper and the lower curves meet at

    $$\begin{aligned} z_{23} = b + \sqrt{e_3 -\frac{2}{3}E}, \end{aligned}$$
    (22)

    with the remaining intersection points at

    $$\begin{aligned} z_1= 2b - z_{23} - 2 \sqrt{e_{23}-\frac{2}{3}E}, \qquad z_4 = 2b - z_{23} + 2 \sqrt{e_{23}-\frac{2}{3}E}. \end{aligned}$$
    (23)

If \(e_1=e_2=e_3\), then \(z_2\) from Eq. (21) becomes equal to \(z_{34}\) and only two intersection points remain

$$\begin{aligned} z_1 = b - 3 \root 3 \of {a^2 b}, \qquad z_{234} = b+ \root 3 \of {a^2 b}, \end{aligned}$$
(24)

with \(a^\frac{2}{3}+b^\frac{2}{3}=1\). This case requires a unique value of energy with a simple expression (154).

A more generic situation with two intersection points occurs when E defines only one trajectory, with \(z_1 < z_2\) given by

$$\begin{aligned} z_1= & {} b - \sqrt{e_1-\frac{2}{3}E} - \sqrt{ e_1- 2 E + \sqrt{\nu }}, \nonumber \\ z_2= & {} b - \sqrt{e_1-\frac{2}{3}E} + \sqrt{ e_1 - 2 E + \sqrt{\nu } }, \nonumber \\ \nu= & {} \left( e_1+ \frac{4}{3}E\right) ^2+e_\mathrm {c}^2, \end{aligned}$$
(25)

where \(e_1\) and \(e_\mathrm {c}\) are defined in Eqs. (142) and (144). Two other roots of \(W(z-b)=0\), i.e., \(z_3= Z_3+b\) and \(z_4 = {\overline{z}}_3\) are complex, so they do not define the intersections—see Eq. (151).

Finally, one intersection point means that for the energy E there is only one integral curve that contracted to a point with

$$\begin{aligned} z_{12} = b - \sqrt{e_1 - \frac{2}{3} E}, \end{aligned}$$
(26)

and \(e_1\) given by Eq. (145).

In further discussion, the point of intersection of a regular curve with the \(x=0\) plane will be named a turning point, unless it is an equilibrium from the dynamical point of view.

2.4 Cassini states and homoclinic orbits

Finding the Cassini states can be approached from different points of view. Geometrically, they are the points of tangency of the surfaces S1, S2, and S3. Algebraically, they are the multiple roots of \(x(z,E)=0\), and \(x(y,E)=0\). Dynamically, they are the fixed points of equations of motion (9) or, equivalently, the local extremes and saddle points of the Hamiltonian on a sphere S1.

In principle, the multiple roots have been found in Sect. 2.3. But they are given in terms of E, which is an implicit function of a and b as a root of \(\varDelta (E,a,b)=0\), where \(\varDelta \) is the discriminant of W(Z), defined in Eq. (136). Hence, one possible way is to find the roots of \(\varDelta =0\), which is a quartic equation in E, and substitute them into \(z_{12}\), \(z_{23}\), or \(z_{34}\) from Sect. 2.3 (we already did it for \(z_{234}\)). The alternative is to find the stationary points (xyz) of the Hamiltonian and use them to evaluate the energy by the substitution into \({\mathscr {H}}(x,y,z)=E\). The latter path is more convenient and was recently taken by Saillenfest et al. (2019).

Discussing the Cassini states as the multiple roots of \(W(Z)=0\), we should include the information, that they are also the roots of its derivative \(W'(Z)=0\). Since two polynomials have a common root if and only if their resultant equals zero, we ask about the solutions of \(R(W(Z),W'(Z))=0\). If we treat W(Z) and \(W'(Z)\) as polynomials in Z, the result is simply the discriminant \(\varDelta \) from Eq. (136), multiplied by a constant (negative) factor. Yet, we can also treat W(Z) and \(W'(Z)\) as polynomials of E with degrees 2 and 1, respectively, whose coefficients depend on Z. So, using

$$\begin{aligned}&W(Z) = U_1(E) = - 4 E^2 + 4 \left( 2 a^2 - Z^2 \right) E - Z^4 - 4 a^2 \left( 2 b Z + a^2 + b^2 - 1\right) , \end{aligned}$$
(27)
$$\begin{aligned}&W'(Z) = U_2(E) = - 8 Z E - 4 Z^3 - 8 a^2 b, \end{aligned}$$
(28)

we obtain the resultant \(R(U_1(E),U_2(E)) = 256 a^2 w(Z)\), where

$$\begin{aligned} w(Z) = - Z^4 - 2 b Z^3 + 3 \rho \, Z^2 - 2 a^2 b Z - a^2 b^2, \end{aligned}$$
(29)

and

$$\begin{aligned} \rho = \frac{1-a^2-b^2}{3}. \end{aligned}$$
(30)

For easier reference to Saillenfest et al. (2019), we abandon the shifted variables and solve \(w(z-b)=0\), which is the same equation as the one they used. However, thanks to considering the Cartesian variables on a sphere, we do not have to warn about any loss of sign, because it does not occur neither in evaluating \(R(U_1,U_2)\), nor in obtaining W(Z). The value of y associated with a given z results directly from the condition \({\dot{x}}=0\) in Eq. (9), being

$$\begin{aligned} y = -a - \frac{ a b}{z-b} = - \frac{a z}{z-b}. \end{aligned}$$
(31)

Solving \(w(z-b)=0\), we can recycle the procedure used for \(W(Z)=0\). More details can be found in “Appendix B” so here we only summarize the final results. When considering the problem on a sphere, there are only two generic situations: either there are two Cassini states \(C_2\) and \(C_3\), or there are four of them: \(C_1\), \(C_2\), \(C_3\) and \(C_4\). Let us call the former case ‘the Colombo top problem of type II,’ and the latter—‘type IV.’ The special case when there are three Cassini states will be called ‘the Colombo top problem of type III’ (see Fig. 2).

In the following subsections, we briefly review each type, providing the coordinates of the Cassini states \(C_j\) as the functions of a and b parameters. To avoid confusion with the \(z_i\) expressions of Sect. 2.3, we label the coordinates of \(C_j\) as \(z^*_j\), and \(y^*_j\) (of course, \(x^*_j=0\) for all the points). We only provide \(z^*_j\) that allows to find \(y^*_j\) from Eq. (31), and then, the energy \(E_j\) of the Cassini state is

$$\begin{aligned} E_j = - \frac{\left( z^*_j-b\right) ^2}{2} - \frac{ a^2 b}{z^*_j-b}. \end{aligned}$$
(32)

Substituting this energy value into an appropriate \(z_{12}\), \(z_{23}\), \(z_{34}\), or \(z_{234}\) from Sect. 2.3, should result in returning to the expression of \(z^*_j\) in terms of a and b.

Fig. 2
figure 2

Three types of the Colombo top established by the values of a and b, shown at the \(x \ge 0\) hemisphere of S1. The discriminant \(\varDelta \) (136) is positive in the shaded area, negative in white zones, and null on the thick curves \(\varGamma \) or at the Cassini states C. Exemplary parameters values are: \(a=b=0.5\) (type II), \(a= (3/4)^3, b = (7/16)^\frac{3}{2}\) (type III), and \(a=b=0.2\) (type IV)

2.4.1 Type II

When \(a^{\frac{2}{3}}+b^{\frac{2}{3}} > 1\), the dynamics on a sphere is relatively simple. There are two stable equilibria (elliptic points): \(C_3\) at the lower (\(z<0\)) hemisphere and \(C_2\) at the upper hemisphere, located at

$$\begin{aligned} z^*_3 = \frac{b}{2} - \sqrt{ A_2 } - \sqrt{ B_2}, \qquad z^*_2 = \frac{b}{2} - \sqrt{A_2} + \sqrt{B_2}, \end{aligned}$$
(33)

where

$$\begin{aligned} A_2= & {} - P'_2 +\frac{1}{2}\,\left( \root 3 \of {g'_3+ \sqrt{-\varDelta '} } + \root 3 \of {g'_3 - \sqrt{-\varDelta '} } \right) , \nonumber \\ B_2= & {} -3 P'_2 - A_2 + 2 \sqrt{A_2^2 + 3 P'_2 \left( P'_2+A_2\right) - \frac{3 \rho ^2}{16}}, \end{aligned}$$
(34)

and according to Eqs. (164) and (163)

$$\begin{aligned} \sqrt{-\varDelta '} = \frac{ab}{4}\sqrt{a^2b^2 - \rho ^3},\quad g'_3 = \frac{a^2 b^2}{4} - \left( \frac{\rho }{2}\right) ^3, \quad P'_2 = - \frac{\rho }{2} - \frac{b^2}{4}. \end{aligned}$$
(35)

We can identify the Cassini states coordinates as \(z^*_3 = z_{12}(E_3)\), and \(z^*_2 = z_{12}(E_2)\), where \(z_{12}\) is given by Eq. (26).

The energy values in the type II problem are bounded by \(E_3 \le E \le E_2\). All trajectories with energy \(E_3< E < E_2\) are simple periodic curves (one for each value of E), oscillating in z between \(z_1\) and \(z_2\), as given by Eq. (25). Extending and modifying the domains labeling of Henrard and Murigande (1987), let us label the entire sphere surface (with the two Cassini states excluded) as \(\mathrm {D}_{23}\) (Fig. 2, left).

2.4.2 Type IV

When \(a^{\frac{2}{3}}+b^{\frac{2}{3}}<1\), the flow is shaped by the presence of three elliptic fixed points \(C_1\), \(C_2\), \(C_3\), and a hyperbolic point \(C_4\) (Fig. 2, right). The latter is accompanied by two homoclinic orbits—upper \(\varGamma _{1}\) and lower \(\varGamma _{2}\). Thus, the sphere is first partitioned into three domains: \(\mathrm {D}_1\) between \(C_1\) and \(\varGamma _{1}\), \(\mathrm {D}_2\) between \(C_2\), \(\varGamma _{1}\), \(\varGamma _{2}\), and the third area bounded by \(C_3\) and \(\varGamma _2\). But the last of the three domains is further divided by the curve whose energy equals that of \(C_1\)—the thick dashed curve \(\varGamma _3\) in Fig. 2. The subdomains \(\mathrm {D}_3\) and \(\mathrm {D}_4\) may look similar from the geometrical point of view, but note the fact that each curve in \(\mathrm {D}_4\) has a companion in \(\mathrm {D}_1\) with the same energy, which is not the case in \(\mathrm {D}_3\).

Equation (171) from “Appendix B” can be used as they are, but here we add an alternative form, based upon the transformation (129):

$$\begin{aligned} z^*_1= & {} \frac{b}{2} + \sqrt{A_4} + \sqrt{B_4} - \sqrt{B'_4} = \frac{b}{2} + \sqrt{A_4} + \sqrt{B_4+B'_4 - 2 \sqrt{B_4 B'_4}}, \nonumber \\ z^*_2= & {} \frac{b}{2} - \sqrt{A_4} + \sqrt{B_4} + \sqrt{B'_4} = \frac{b}{2} - \sqrt{A_4} + \sqrt{B_4+B'_4 + 2 \sqrt{B_4 B'_4}}, \nonumber \\ z^*_3= & {} \frac{b}{2} - \sqrt{A_4} -\sqrt{B_4} - \sqrt{B'_4} = \frac{b}{2} - \sqrt{A_4} - \sqrt{B_4+B'_4 + 2 \sqrt{B_4 B'_4}}, \nonumber \\ z^*_4= & {} \frac{b}{2} + \sqrt{A_4} - \sqrt{B_4} + \sqrt{B'_4} = \frac{b}{2} + \sqrt{A_4} - \sqrt{B_4+B'_4 - 2 \sqrt{B_4 B'_4}}, \end{aligned}$$
(36)

where

$$\begin{aligned} A_4 = \frac{b^2}{4}+ \rho \cos ^2\frac{\phi _4}{3}, \qquad B_4 = \frac{b^2}{4}+ \rho \cos ^2\frac{\phi _4-\pi }{3} , \qquad B'_4 = \frac{b^2}{4}+ \rho \cos ^2\frac{\phi _4+\pi }{3}, \end{aligned}$$
(37)

and

$$\begin{aligned} \cos {\phi _4} = a b \rho ^{-\frac{3}{2}}. \end{aligned}$$
(38)

From \(A_4< B_4 < B'_4\), we infer \(z^*_3< 0< z^*_2< b< z^*_4 < z^*_1\). The associated y coordinates satisfy inequalities \(y^*_4< y^*_1< -a< y^*_3< 0 < y^*_2\).

The energy values are \(E_3< E_1< E_4< 0 < E_2\). Referring to Sect. 2.3, we identify \(z^*_1 = z_{34}(E_1)\), \(z^*_2 = z_{12}(E_2)\), \(z^*_3 = z_{12}(E_3)\), and \(z^*_4 = z_{23}(E_4)\). The unstable equilibrium energy \(E_4\) is of special importance, because it serves to determine the turning points of the separatrices from Eq. (23): \(z_4(E_4)\) for \(\varGamma _1\), and \(z_1(E_4)\) for \(\varGamma _2\).

For the energy values \(E_1< E < E_4\), when \(\varDelta > 0\), each E refers to two periodic curves: one in \(\mathrm {D}_4\) with the turning points \((z_1,z_2)\) given by (18), and one in \(\mathrm {D}_1\) with \((z_3,z_4)\) given by Eq. (19). An energy in \(E_3< E < E_1\) defines only one periodic trajectory in \(\mathrm {D}_3\), and each \(E_4< E < E_2\) defines one periodic curve in \(\mathrm {D}_2\). Since \(\varDelta < 0\) in both cases, the turning points are given by \((z_1,z_2)\) from Eq. (25).

2.4.3 Type III

The case of \(a^\frac{2}{3}+b^\frac{2}{3}=1\) is specific, but it has to be included to understand the bifurcation between the two neighboring types. Increasing a and/or b from the type IV, we observe that the two Cassini states \(C_1\) and \(C_4\) merge into a single point \(C_{14}\) of neutral stability, the homoclinic orbit \(\varGamma _1\) contracts to a point, and the separatrix \(\varGamma _2\) merges with the specific periodic orbit \(\varGamma _{3}\) into \(\varGamma _{23}\). Thus, the domains \(\mathrm {D}_1\) and \(\mathrm {D}_4\) disappear. In course of transition from type III to type II, the division between \(\mathrm {D}_{2}\) and \(\mathrm {D}_3\) disappears; hence, we merge them into a single \(\mathrm {D}_{23}\). Lanchares and Elipe (1995) refer to this transition under the name of teardrop bifurcation.

The coordinates of the Cassini states can be obtained from Eq. (174) and are expressible in terms of a or b alone, resulting in

$$\begin{aligned} z^*_{14} = \sqrt{1-A_3}= b^\frac{1}{3}, \qquad y^*_{14} = - \sqrt{A_3} = - a^\frac{1}{3}, \end{aligned}$$
(39)

and

$$\begin{aligned} z^*_3 = - z^*_{14} \left( A_3 + \sqrt{B_3} \right) , \qquad z^*_2 = - z^*_{14} \left( A_3 - \sqrt{B_3} \right) , \end{aligned}$$
(40)

where

$$\begin{aligned} A_3 = 1- b^\frac{2}{3} = a^\frac{2}{3}, \qquad B_3 = A^2_3-A_3+1. \end{aligned}$$
(41)

The expression of the energy at \(C_{14}\) is so simple, that we write it explicitly

$$\begin{aligned} E_{14} = - \frac{3}{2} \root 3 \of {a^4 b^2}= - \frac{3 \left( z^*_{14}\right) ^2 \left( y^*_{14}\right) ^4}{2} = - \frac{3 A_3^2 \left( 1-A_3\right) }{2}. \end{aligned}$$
(42)

It is useful also in finding the turning point of \(\varGamma _{23}\) at \(x=0\), which is

$$\begin{aligned} z_\mathrm {m} = z^*_{14} \left( 1-4A_3\right) , \qquad y_\mathrm {m} = - y^*_{14} \left( 3-4 A_3\right) . \end{aligned}$$
(43)

For regular trajectories with \(E_3< E < E_2\), and \(E \ne E_{14}\), their turning points are given by \(z_1\) and \(z_2\) from Eq. (25), which also means \(z_1 < z_2\).

3 Analytical solution

3.1 Weierstrass form

3.1.1 General framework

Equations of motion (9) or (13) admit an exact analytical solution in terms of the elliptic functions. Actually, the solution hinges upon the fact that the variable Z can be considered separately from the remaining two. To see it, we can differentiate \({\dot{Z}}=-aX\), substitute \({\dot{X}}\) from the first of Eq. (13), and use the energy integral (11) to eliminate Y, obtaining

$$\begin{aligned} {\ddot{Z}} = - \frac{1}{2} Z^3 - E Z - a^2 b. \end{aligned}$$
(44)

Equation (44) defines a 1 degree of freedom, conservative system with the potential

$$\begin{aligned} V_z(Z) = \frac{Z^4}{8} + \frac{E}{2} Z^2 + a^2 b Z, \end{aligned}$$
(45)

and the energy integral

$$\begin{aligned} \frac{{\dot{Z}}^2}{2} + V_z(Z) = E_z = \mathrm {const}. \end{aligned}$$
(46)

But the potential \(V_z(Z)\) and the energy \(E_z\) are closely related to the polynomial W(Z):

$$\begin{aligned} E_z - V_z(Z) = \frac{1}{8} W(Z), \text{ with } 2 E_z = \frac{W(0)}{4} = - \left( E - a^2\right) ^2 + a^2 \left( 1- b^2\right) . \end{aligned}$$
(47)

Thus, whether we use Eq. (46), or the squared Eqs. (13) and (16), the outcome is \(4 {\dot{Z}}^2 = W(Z)\), amenable to the separation of variables method.

Since W(Z) is a quartic polynomial, finding Z(t) amounts to inverting the elliptic integral

$$\begin{aligned} \sigma _z \intop _{Z_{0}}^{Z}\frac{\mathrm {d}Z}{\sqrt{W(Z)}} = \frac{1}{2} \intop _{t_0}^{t} \mathrm {d}t, \end{aligned}$$
(48)

where \(\sigma _z = \mathrm {sgn}{{\dot{Z}}} = - \mathrm {sgn}{X}\), and the right-hand side evaluates to

$$\begin{aligned} \tau _0 = \frac{t-t_0}{2}. \end{aligned}$$
(49)

The integral to the left of (48) should be a monotonous function of Z to allow the inversion (solving for Z(t)). In the admissible range of Z between two turning points (or one turning point and an unstable equilibrium), the sign of \({\dot{Z}}\) is constant, which allows to establish its value from the initial condition and to pull \(\sigma _z\) out of the integrand.

The integral to the left of Eq. (48) is an elliptic integral, and it can be reduced to the Weierstrass normal form by an appropriate transformation \(Z \rightarrow s\), so that

$$\begin{aligned} \sigma _z \intop _{Z_0}^{Z}\frac{\mathrm {d}Z}{\sqrt{W(Z)}} = \intop _{s}^{\infty } \frac{\mathrm {d}s}{\sqrt{S(s)}} = \tau _0, \end{aligned}$$
(50)

where \(S(s)= 4 s^3 - g_2 s - g_3\) is the cubic polynomial of the Weierstrass resolvent (133) for \(W(Z)=0\), with the invariants \(g_2\), \(g_3\) defined in “Appendix A.2”—Eqs. (134) and (135). Note that the initial value \(Z_0\) is always mapped to \(s \rightarrow \infty \), regardless of the ordering of the integration limits \(Z_0\) and Z, which explains the presence of \(\sigma _z\) in the forthcoming transformations.

Solving the rightmost part of Eq. (50) amounts to the substitution of the Weierstrass elliptic function

$$\begin{aligned} s = \wp (\tau _0; g_2, g_3), \text{ hence } \mathrm {d}s = \wp '(\tau _0; g_2, g_3) \mathrm {d}\tau _0. \end{aligned}$$
(51)

In all further instances, we will use the abbreviated notation for the Weierstrass \(\wp \) function \(\wp (u) = \wp (u; g_2, g_3)\), whenever the invariants \(g_2\) and \(g_3\) from “Appendix A.2” are used. Only the invariants different than \(g_2\) and \(g_3\) will be added to the list of arguments if needed. The derivative of the Weierstrass \(\wp \) function obeys

$$\begin{aligned} \left| \wp '(\tau _0 ) \right| = \sqrt{S(\wp (\tau _0 ))}, \end{aligned}$$
(52)

and in the first half-period \(0< \tau _0 < \omega _1 = \wp ^{-1}(e_1)\), we have \(\wp '(\tau _0 )<0\). Thus, indeed

$$\begin{aligned} \intop _{s}^{\infty } \frac{\mathrm {d}s}{\sqrt{S(s)}} = \intop _{\tau _0}^{0} \frac{\wp '(\tau )}{\left| \wp '(\tau )\right| }\,\mathrm {d}\tau = - \intop _{\tau _0}^{0} \mathrm {d}\tau = \tau _0, \end{aligned}$$
(53)

as expected from Eq. (50).

Once the solution for the first half-period is found, its continuation can be investigated by the substitution into Eq. (44) which is free from the restrictions imposed by the inversion procedure.

3.1.2 Initial conditions at turning point

Let us begin with the easiest situation, when the initial condition is \(Z_0 = Z_j = z_j-b\), and \(t_0=t_j\) is the epoch of crossing the turning point \(X_0=0\), i.e., \({\dot{Z}}=0\). Then, the integral to the left of (50) is reduced to the Weierstrass normal form in variable s through the rational transformation (Enneper 1890; Bianchi 1901)

$$\begin{aligned} Z = Z_j + \frac{\frac{1}{4} W'(Z_j)}{s - \frac{1}{24} W''(Z_j)}, \qquad s = \frac{W'(Z_j)}{4 \left( Z-Z_j\right) } + \frac{1}{24} W''(Z_j). \end{aligned}$$
(54)

Replacing the subscript 0 with j in the formulae of Sect. 3.1.1, we find

$$\begin{aligned} Z = Z_j + \frac{\frac{1}{4} W'(Z_j)}{\wp (\tau _j) - \frac{1}{24} W''(Z_j)}, \end{aligned}$$
(55)

where

$$\begin{aligned} \tau _j = \frac{t-t_j}{2}. \end{aligned}$$
(56)

Thanks to the symmetry of trajectory with respect to the turning point, the solution does not depend on \(\sigma _z\) and remains valid for all values of \(\tau _j\).

Given the initial conditions \(X(t_j) = 0\), \(Y(t_j)=Y_j\), \(Z(t_j)=Z_j\), and knowing Z(t), we may complete the solution for \(\mathbf {R}(t)\). The two missing variables come from the energy integral (11), with

$$\begin{aligned} E = - \frac{Z^2}{2} + a Y = - \frac{Z_j^2}{2} + a Y_j, \end{aligned}$$
(57)

and from the third of the equations of motion (13), i.e., \(X = - {\dot{Z}}/a\), so

$$\begin{aligned} X= & {} - \left( \frac{Z -Z_j}{2 a}\right) ^2 \frac{\wp '(\tau _j)}{a b + Y_j Z_j}, \nonumber \\ Y= & {} Y_j + \frac{Z^2-Z_j^2}{2 a}, \nonumber \\ Z= & {} Z_j - \frac{ 2 a^2 b + 2 E Z_j + Z_j^3}{\wp (\tau _j) + \frac{1}{3} E + \frac{1}{2} Z_j^2} = Z_j - \frac{6 a \left( a b + Y_j Z_j\right) }{3 \wp (\tau _j) + a Y_j + Z_j^2}. \end{aligned}$$
(58)

The coordinates on the unit sphere are, as usually, \(x=X\), \(y=Y-a\), and \(z=Z+b\).

3.1.3 Arbitrary initial conditions: Weierstrass–Biermann form

If the initial conditions are not at the turning point, i.e., \(Z_0 \ne Z_j\), the transformation \(Z \mapsto s\) is more cumbersome than (54). Whittaker and Watson (1927) quote two alternative forms of the final solution: one due to Weierstrass, published by Biermann (1865), and one by Mordell (1915). Actually, there is yet another, formally elegant form—the secondo metodo d’inversione of Bianchi (1901), but the relation of its constants to initial conditions is rather awkward, so we do not consider it here.

The Weierstrass–Biermann form results from the substitution (Enneper 1890)

$$\begin{aligned} Z= & {} Z_0 + \frac{\sigma _z \sqrt{W_0} \sqrt{S(s)} +\frac{1}{2} W_1 \left( s - \frac{1}{12} W_2\right) + \frac{1}{4} W_0 W_3}{2 \left( s-\frac{1}{12} W_2\right) ^2 - \frac{1}{2} W_0 W_4 }, \end{aligned}$$
(59)
$$\begin{aligned} s= & {} \frac{\sqrt{W_0} \sqrt{W(Z)} + W_0 + \frac{1}{2} W_1 \left( Z-Z_0\right) + \frac{1}{6} W_2 \left( Z-Z_0\right) ^2 }{2 \left( Z-Z_0 \right) ^2}, \end{aligned}$$
(60)

where \(\sigma _z\) is the sign of \(Z-Z_0\), and

$$\begin{aligned} W_k = \frac{1}{k!} \left[ \frac{\mathrm {d}^k W(Z)}{\mathrm {d}Z^k}\right] _{Z_0}, \end{aligned}$$
(61)

are the Taylor coefficients in

$$\begin{aligned} W(Z) = \sum _{k=0}^4 W_k (Z-Z_0)^k. \end{aligned}$$
(62)

After expressing E in terms of the initial conditions, they take the form

$$\begin{aligned}&W_0 = 4 a^2 X_0^2, \qquad \qquad \, \, W_1 = - 8 a \left( a b+Y_0 Z_0\right) , \nonumber \\&W_2 = - 4 \left( a Y_0 + Z_0^2\right) , \quad W_3 = - 4 Z_0, \quad W_4 = -1. \end{aligned}$$
(63)

Substituting \(s = \wp (\tau _0)\), and recalling that \(\sqrt{S(s)} = - \wp '(\tau _0)\) over the first half-period, we obtain

$$\begin{aligned} Z = Z_0 + \frac{-\sigma _z \sqrt{W_0} \wp '(\tau _0) +\frac{1}{2} W_1 \left( \wp (\tau _0) - \frac{1}{12} W_2\right) + \frac{1}{4} W_0 W_3}{ 2 \left( \wp (\tau _0) -\frac{1}{12} W_2\right) ^2 - \frac{1}{2} W_0 W_4 }. \end{aligned}$$
(64)

Then, accounting for Eq. (63) and the fact that \(\sqrt{W_0} = 2 a |X_0| = - 2 \sigma _z a X_0\),

$$\begin{aligned} Z = Z_0 + a \, \frac{X_0 \wp '(\tau _0) - 2 \left( a b + Y_0 Z_0 \right) \left[ \wp (\tau _0) + \frac{1}{3} \left( a Y_0 + Z_0^2\right) \right] - 2 a X_0^2 Z_0}{ \left[ \wp (\tau _0) + \frac{1}{3} \left( a Y_0 + Z_0^2\right) \right] ^2 + a^2 X_0^2 }. \end{aligned}$$
(65)

The substitution into Eq. (44) with the initial condition \({\dot{Z}}_0 = - a X_0\) shows that the formula (65) is actually valid for all values of \(\tau _0\), so the initial restriction to the first half-period can be abolished.

Like before, the remaining two variables YX are found from the energy integral and from the equations of motion, respectively. Thus, for Y, we have simply

$$\begin{aligned} Y = Y_0 + \frac{Z^2-Z_0^2}{2 a}, \end{aligned}$$
(66)

whereas \(X= - {\dot{Z}}/a\), requires the differentiation and some manipulations leading to

$$\begin{aligned} X {=} \frac{- X_0 \left( 3 \wp (\tau _0)- \frac{1}{4}g_2\right) {+} \left( a b + Y_0 Z_0 \right) \wp '(\tau _0) {+} \frac{(Z-Z_0)}{a} \left[ \wp (\tau _0) + \frac{1}{3} \left( a Y_0 + Z_0^2\right) \right] \wp '(\tau _0)}{ \left[ \wp (\tau _0) + \frac{1}{3} \left( a Y_0 + Z_0^2\right) \right] ^2 + a^2 X_0^2 }, \end{aligned}$$
(67)

where the second derivative has been removed using the identity \(\wp ''=6 \wp ^2-(g_2/2)\).

3.1.4 Arbitrary initial conditions: Safford form

If \(X_0=0\), so \(\tau _0=\tau _j\), the above solution simplifies to Eq. (58) in a straightforward manner. On the other hand, as pointed out by Safford (1919), the general solution (64) can be derived from the particular solution (55) by assuming \(\tau _j = \tau _0 + \phi _j\), and making use of the addition theorem for the Weierstrass function \(\wp \). Given the initial conditions \(X_0,Y_0,Z_0\) at \(t=t_0\), we can find the turning point coordinates \(Z_j, Y_j\) for this trajectory using the formulae of Sect. 2.3. Then, according to Safford (1919), the phase \(\phi _j\) is defined through Eq. (60) at \(Z=Z_j\), giving \(s=s_j = \wp (\phi _j)\). Thus, as an alternative to using Eqs. (65) and (66) for arbitrary initial conditions, one can first compute \(\phi _j = \wp ^{-1}(s_j)\), and then apply the particular solution (58) with \(\tau _j = \tau _0 + \phi _j\).

But Safford (1919) cared solely about the reduction of the integrand form, so he paid no attention to the problem that \(\wp (\phi _j)\) is uniquely invertible only in the domain \(0 \le \phi _j \le \omega _1\), i.e., within the first half-period. In order to properly place the phase in the full period range \(- \omega _1 \le \phi _j \le \omega _1\), one needs the information about the sign of the derivative \(\wp '(\phi _j)\). To this end, we take a slightly different approach, that actually leads to a simpler expression for \(s_j\). Substituting \(\tau _j=\phi _j\), and \(\mathbf {R}=\mathbf {R}_0\) in Eq. (58), we can solve them to find

$$\begin{aligned} \wp (\phi _j)= & {} - 2 a \frac{a b + Y_j Z_j}{Z_0-Z_j} - \frac{a Y_j + Z_j^2}{3}, \end{aligned}$$
(68)
$$\begin{aligned} \wp '(\phi _j)= & {} - X_0 \left( a b + Y_j Z_j\right) \, \left[ \frac{2a}{Z_0-Z_j}\right] ^2, \end{aligned}$$
(69)

and this set allows the unique determination of the phase. Then, we can apply Eq. (58) to obtain

$$\begin{aligned} X= - \left( \frac{Z -Z_j}{2 a}\right) ^2 \frac{\wp '(\tau _0+\phi _j)}{a b + Y_j Z_j}, \qquad Z = Z_j - \frac{6 a \left( a b + Y_j Z_j\right) }{3 \wp (\tau _0+\phi _j) + a Y_j + Z_j^2}, \end{aligned}$$
(70)

with \(Y_j\) and Y derived from the energy integral.

Compared to the Weierstrass–Biermann solution, we gain the simplicity at the expense of pre-computing the turning point coordinates and the phase for the given (arbitrary) initial conditions.

3.1.5 Arbitrary initial conditions: Mordell form

The inversion formula of Mordell (1915) was formulated in the language of homogeneous binary forms; in order to apply it to the Colombo top problem, let us translate it to the univariate polynomials framework. The link is simple: the quartic polynomial W(Z) from Eq. (122), and the quartic binary form

$$\begin{aligned} V(\xi ,\eta ) = a_0 \xi ^4 + 4 a_1 \xi ^3 \eta + 6 a_2 \xi ^2 \eta ^2 + 4 a_3 \xi \eta ^3 + a_4 \eta ^4, \end{aligned}$$
(71)

can be matched by

$$\begin{aligned} W(Z) = V(Z,1), \qquad V(\xi ,\eta ) = \eta ^4 W(\xi \eta ^{-1}). \end{aligned}$$
(72)

Thus, after the substitution \(Z=\xi \eta ^{-1}\), Eq. (50) is equivalent to

$$\begin{aligned} \sigma _z \intop _{(\xi _0,\eta _0)}^{(\xi ,\eta )}\frac{\eta \mathrm {d}\xi - \xi \mathrm {d}\eta }{\sqrt{V(\xi ,\eta )}} = \intop _{s}^{\infty } \frac{\mathrm {d}s}{\sqrt{S(s)}} = \tau _0, \end{aligned}$$
(73)

where the definite integral in Z has been replaced by a path-independent line integral. Skipping the intermediate steps described in Mordell (1914, 1915), the inversion of (73) results in

$$\begin{aligned} \xi= & {} -\sigma _z \xi _0 \sqrt{V_0} \wp '(\tau _0) + \frac{1}{2} \frac{\partial V_0}{\partial \eta _0} \wp (\tau _0) + \frac{1}{2} \frac{\partial {\tilde{h}}_0}{\partial \eta _0}, \nonumber \\ \eta= & {} -\sigma _z \eta _0 \sqrt{V_0} \wp '(\tau _0) - \frac{1}{2} \frac{\partial V_0}{\partial \xi _0} \wp (\tau _0) - \frac{1}{2} \frac{\partial {\tilde{h}}_0}{\partial \xi _0}, \end{aligned}$$
(74)

where \(V_0 = V(\xi _0,\eta _0)\), and \({\tilde{h}}_0\) is the Hessian covariantFootnote 1 of V.

From the correspondence rules (72), we derive

$$\begin{aligned}&\frac{\partial V_0}{\partial \xi _0} = \eta _0^3 W_1, \nonumber \\&\frac{\partial V_0}{\partial \eta _0} = \eta _0^2 \left( 4 \eta _0 W_0 - \xi _0 W_1\right) , \nonumber \\&{\tilde{h}}_0 = \frac{1}{144} \left( \frac{\partial ^2 V_0}{\partial \xi _0^2} \frac{\partial ^2 V_0}{\partial \eta _0^2} - \left( \frac{\partial ^2 V_0}{\partial \xi _0 \partial \eta _0} \right) ^2\right) = \frac{\eta _0^4 \left( 8 W_0 W_2 - 3 W_1^2\right) }{48}, \nonumber \\&\frac{\partial {\tilde{h}}_0}{\partial \xi _0} = \frac{\eta _0^3 \left( 6 W_0 W_3 - W_1 W_2\right) }{12}, \nonumber \\&\frac{\partial {\tilde{h}}_0}{\partial \eta _0} = \frac{\eta _0^2 \left( 8 \eta _0 W_0 W_2 - 3 \eta _0 W_1^2 + \xi _0 W_1 W_2 - 6 \xi _0 W_0 W_3\right) }{12}, \end{aligned}$$
(75)

where \(W_n\) are defined in Eq. (63). By letting \(Z=\xi /\eta \), and \(Z_0=\xi _0/\eta _0\), we find that (74) and (75) amount to

$$\begin{aligned} Z = \frac{\xi }{\eta } = Z_0 + \frac{48 W_0 \wp (\tau _0) + 8 W_0 W_2 - 3 W_1^2}{ - 24 \sigma _z \sqrt{W_0} \wp '(\tau _0) - 12 W_1 \wp (\tau _0)+ W_1 W_2 - 6 W_0 W_3}. \end{aligned}$$
(76)

Proceeding like in Sect. 3.1.3, we obtain the final form

$$\begin{aligned} Z = Z_0 + 4 a \, \frac{ X_0^2 \left[ \wp (\tau _0) - \frac{2}{3} \left( a Y_0 + Z_0^2\right) \right] - \left( a b + Y_0 Z_0\right) ^2 }{ X_0 \wp '(\tau _0) + 2 \left( a b + Y_0 Z_0\right) \left[ \wp (\tau _0) + \frac{1}{3}\left( a Y_0 + Z_0^2\right) \right] + 2 a X_0^2 Z_0}, \end{aligned}$$
(77)

looking different from the Weierstrass–Biermann solution (65), yet providing the same values of Z. Notably, the Mordell solution involves only the first power of \(\wp \).

In order to demonstrate the equivalence of the Weierstrass–Biermann and the Mordell solutions, one can multiply the numerator and the denominator in Eq. (65) by the factor

$$\begin{aligned} \left( 48 a X_0 \wp '(\tau _0) - \left( 12 \wp (\tau _0) - W_2 \right) W_1 - 6 W_0 W_3 \right) , \end{aligned}$$

use the identity \(\left( \wp '(\tau _0) \right) ^2 = S(\wp (\tau _0))\), and substitute the expressions of the invariants in terms of \(W_n\)

$$\begin{aligned} g_2 = \frac{W_2^2}{12} - \frac{W_2 W_3}{4} + W_0 W_4, \qquad g_3 = \left| \begin{array}{ccc} W_4 &{} \frac{1}{4} W_3 &{} \frac{1}{6} W_2 \\ \frac{1}{4} W_3 &{} \frac{1}{6} W_2 &{} \frac{1}{4} W_1 \\ \frac{1}{6} W_2 &{} \frac{1}{4} W_1 &{} W_0 \\ \end{array} \right| , \end{aligned}$$
(78)

by analogy with the definitions (134) and (135). The result of this procedure is the Mordell solution (77). Obviously, both (65) and (77) admit the same limit expression (58) when \(X_0=0\) and \(\tau _0=\tau _j\).

As usually, the solution for X and Y can be derived from the equations of motion and the energy integral, like in Sect. 3.1.3.

3.2 Solution in terms of the Jacobi elliptic functions

3.2.1 Weierstrass functions in terms of the Jacobi functions

Although the solution in terms of the Weierstrass \(\wp \) function presented in Sect. 3.1 may look universal, its qualitative properties depend on the values of the invariants through the sign of the discriminant \(\varDelta \). Indeed, the sign plays the central role in expressing the solution in terms of the Jacobian elliptic functions. In this section, only the generic, \(\varDelta \ne 0\) cases are to be discussed.

The basic relation between the Weierstrass and Jacobi functions is formally universal (Byrd and Friedman 1971)

$$\begin{aligned} \wp (\tau )= & {} e_3 + \left( \frac{\gamma _p}{\mathrm {sn}(u_p,k_p)}\right) ^2 = e_3 + \gamma _p^2 \, \frac{1+\mathrm {dn}(2 u_p,k_p)}{1- \mathrm {cn}(2 u_p,k_p)} , \end{aligned}$$
(79)
$$\begin{aligned} \wp '(\tau )= & {} - 2 \left( \frac{\gamma _p}{\mathrm {sn}(u_p,k_p)}\right) ^3 \mathrm {cn}(u_p ,k_p)\,\mathrm {dn}(u_p ,k_p) \nonumber \\= & {} - 2 \gamma _p^3 \, \frac{\left( \mathrm {cn}(2 u_p,k_p)+\mathrm {dn}(2 u_p,k_p)\right) \left( 1+\mathrm {dn}(2 u_p,k_p)\right) }{\mathrm {sn}(2 u_p,k_p) \left( 1- \mathrm {cn}(2 u_p,k_p)\right) }, \end{aligned}$$
(80)

where

$$\begin{aligned} u_p = \gamma _p \tau , \qquad \gamma _p = \sqrt{e_1-e_3}, \qquad k_p = \sqrt{\frac{e_2-e_3}{e_1-e_3}}. \end{aligned}$$
(81)

But if we restrict considerations to the real arguments and moduli of the elliptic functions, the above expressions are valid only if the discriminant \(\varDelta \) from Eq. (136) is positive.

When \(\varDelta < 0\), which means complex \(e_2\) and \(e_3\), the appropriate form is (Abramowitz and Stegun 1972)

$$\begin{aligned} \wp (\tau )= & {} = e_1 + \gamma _n \,\left( \frac{\mathrm {cn}(u_n,k_n)}{ \mathrm {sn}(u_n, k_n) \mathrm {dn}(u_n, k_n)}\right) ^{2} = e_1 + \gamma _n \, \frac{1+\mathrm {cn}(2 u_n, k_n)}{1-\mathrm {cn}(2 u_n, k_n)}, \end{aligned}$$
(82)
$$\begin{aligned} \wp '(\tau )= & {} -2 \left( \frac{ \sqrt{\gamma _n} }{\mathrm {sn}(u_n, k_n) \mathrm {dn}(u_n, k_n)} \right) ^3 \, \mathrm {cn}(u_n,k_n) \, \left( 1-k_n^2 + k_n^2 \mathrm {cn}^4(u_n,k_n) \right) \nonumber \\= & {} - 4 \gamma _n^\frac{3}{2} \, \frac{\mathrm {sn}(2 u_n, k_n) \mathrm {dn}(2 u_n, k_n)}{\left( 1 - \mathrm {cn}(2 u_n,k_n)\right) ^2}, \end{aligned}$$
(83)

where

$$\begin{aligned} u_n = \sqrt{\gamma _n} \tau , \qquad \gamma _n = \gamma _p \sqrt{e_1-e_2} = \frac{1}{2} \sqrt{12 e_1^2 - g_2}, \qquad k_n = \frac{1}{2} \sqrt{2 - \frac{3 e_1}{\gamma _n} }. \end{aligned}$$
(84)

The two cases are linked by the complex modulus transformation—see Byrd and Friedman (1971, formula 165.07).

Let the initial conditions at the epoch \(t_0\) be \((X_0,Y_0,Z_0)\). The equations relating the Jacobi and Weierstrass functions can be substituted in to any of the solution forms provided in Sect. 3.1. For the Weierstrass–Biermann or the Mordell form, it is enough to compute the energy \(E=E(Y_0,Z_0)\) and the discriminant \(\varDelta \) to choose the appropriate set (79,80) or (82,83). Then, from the invariants \(g_2, g_3\) the roots \(e_1, e_2, e_3\) are found, which allows the computation of \(\mathbf {R}(t)\) or \(\mathbf {r}(t)\) for any epoch t.

Below, we discuss the Safford form from Sect. 3.1.4, which allows to use the initial conditions at any \(t_0\), but requires the turning point coordinates \((X_j=0, Y_j=y_j+a, Z_j = z_j-b)\) as supplementary parameters. Having computed two appropriate turning points, such that either \(Z_1< Z_0 < Z_2\), or \(Z_3< Z_0 < Z_4\), we pick one of them as the reference point \(Z_j\) to be used in Eq. (70). Then, after determining phase \(\phi _j\) with respect to the turning point \(Z_j\), the motion can be computed for any epoch t, using \(\tau _j = \tau _0+\phi _j\).

3.2.2 Motion in \(\mathrm {D}_1\) and \(\mathrm {D}_4\)

The case \(\varDelta > 0\) occurs only in type IV, when the energy is bounded by \(E_4< E < E_1\): either in the domain \(\mathrm {D}_1\), where turning points \(Z_3, Z_4\) are given by Eq. (19) with \(Z_4^*< Z_3 < Z_4\), or in the domain \(D_4\), where the turning points are \(Z_1, Z_2\) given by Eq. (18) and \(Z_1< Z_2 < Z_4^*\). Then, selecting any appropriate \(Z_j\) as the reference point, we can use the expressions

$$\begin{aligned} X= & {} \frac{A_p \gamma _p \mathrm {sn}(u_p,k_p) \mathrm {cn}(u_p,k_p) \mathrm {dn}(u_p,k_p)}{\left( 1+ B_p \mathrm {sn}^2(u_p,k_p)\right) ^2} \end{aligned}$$
(85)
$$\begin{aligned} Y= & {} Y_j + \frac{Z^2-Z_j^2}{2 a } = Y_j - \frac{A_p \mathrm {sn}^2(u_p,k_p)}{1+ B_p \mathrm {sn}^2(u_p,k_p)} \left( Z_j - \frac{a A_p \mathrm {sn}^2(u_p,k_p)}{ 1+ B_p \mathrm {sn}^2(u_p,k_p)}\right) \end{aligned}$$
(86)
$$\begin{aligned} Z= & {} Z_j - \frac{a A_p \mathrm {sn}^2(u_p,k_p)}{1+ B_p \mathrm {sn}^2(u_p,k_p)} \end{aligned}$$
(87)

where

$$\begin{aligned} A_p = \frac{2 (a b + Y_j Z_j)}{e_1-e_3}, \quad B_p = \frac{\frac{1}{3} \left( a Y_j + Z_j^2 \right) + e_3}{e_1-e_3}, \end{aligned}$$
(88)

and the phase \(\phi _j\) in

$$\begin{aligned} u_p = \gamma _p (\tau _0 + \phi _j), \end{aligned}$$
(89)

can be computed from the incomplete elliptic function of the first kind F:

$$\begin{aligned} \gamma _p \phi _j= & {} \mathrm {F}(v_j,k_p), \end{aligned}$$
(90)
$$\begin{aligned} v_j= & {} \mathrm {am}(\gamma _p \phi _j, k_p) = \mathrm {sgn}{(A_p X_0)}\,\arcsin \sqrt{\frac{Z_j-Z_0}{a A_p - (Z_j-Z_0) B_p}}. \end{aligned}$$
(91)

The motion is periodic, and the period \(P_t\) (with respect to time t) is given by the complete elliptic integral of the first kind K

$$\begin{aligned} P_t = \frac{4 \mathrm {K}(k_p)}{\gamma _p}. \end{aligned}$$
(92)

3.2.3 Motion in \(\mathrm {D}_2\), \(\mathrm {D}_3\), and \(\mathrm {D}_{23}\)

The common feature of trajectories in domains \(\mathrm {D}_2\), \(\mathrm {D}_3\), and \(\mathrm {D}_{23}\) is the negative discriminant \(\varDelta \). Thus, given the initial conditions and resulting energy, we pick \(Z_1\) or \(Z_2\) computed from Eq. (25) as the reference point \(Z_j\) and compute, for any t

$$\begin{aligned} X= & {} \frac{A_n \sqrt{\gamma _n} (1-B_n) \mathrm {sn}(2 u_n,k_n) \mathrm {dn}(2 u_n,k_n)}{\left( 1- B_n \mathrm {cn}(2u_n,k_n)\right) ^2} \end{aligned}$$
(93)
$$\begin{aligned} Y= & {} Y_j + \frac{Z^2-Z_j^2}{2 a } = Y_j - A_n \frac{ 1-\mathrm {cn}(2u_n,k_n) }{1 - B_n \mathrm {cn}(2u_n,k_n)} \left( Z_j - \frac{a A_n}{2} \,\frac{ 1-\mathrm {cn}(2u_n,k_n)}{ 1 - B_n \mathrm {cn}(2u_n,k_n)}\right) , \end{aligned}$$
(94)
$$\begin{aligned} Z= & {} Z_j - a A_n\, \frac{ 1-\mathrm {cn}(2u_n,k_n)}{ 1 - B_n \mathrm {cn}(2u_n,k_n)}, \end{aligned}$$
(95)

where

$$\begin{aligned} A_n = \frac{2 (a b + Y_j Z_j)}{e_1+ \gamma _n + \frac{1}{3} \left( a Y_j + Z_j^2 \right) }, \quad B_n = 1- \frac{2 \gamma _n}{e_1+ \gamma _n + \frac{1}{3} \left( a Y_j + Z_j^2 \right) }. \end{aligned}$$
(96)

The phase in

$$\begin{aligned} u_n = \sqrt{\gamma _n} ( \tau _0 + \phi _j), \end{aligned}$$
(97)

results from

$$\begin{aligned} 2 \sqrt{\gamma _n} \phi _j= & {} \mathrm {F}(v_j,k_n), \end{aligned}$$
(98)
$$\begin{aligned} v_j= & {} \mathrm {am}(2 \sqrt{\gamma _n} \phi _j, k_n) = \mathrm {sgn}{(A_n X_0)}\,\arccos {\left( \frac{a A_n + Z_0-Z_j}{a A_n + (Z_0-Z_j) B_n} \right) }. \end{aligned}$$
(99)

The related period is

$$\begin{aligned} P_t = \frac{4 \mathrm {K}(k_n)}{\sqrt{\gamma _n}}. \end{aligned}$$
(100)

The above formulation is universal, i.e., appropriate in types II, III, and IV, provided \(\varDelta < 0\).

3.3 Special cases

3.3.1 Reduction rules

By special cases, we mean the ones where elliptic functions reduce to the elementary ones. They could be studied by taking limits of the Jacobi functions at \(k_n\) or \(k_p\) tending to 0 or 1. But it is more direct to observe that each of the special cases, be it the Cassini states, separatrices, or the special curve \(\varGamma _3\), results from the reduction of the Weierstrass function \(\wp (u,g_2,g_3)\) to the special case

$$\begin{aligned} \wp (u;3,1) = 1 + \frac{3}{2 \tan ^2{\left( \sqrt{\frac{3}{2}} u\right) }}. \end{aligned}$$
(101)

Since \(\varDelta =0\) implies \((g_2/3)^3 = g_3^2\), reduction to the above form is always possible by the homogeneity relations (Abramowitz and Stegun 1972)

$$\begin{aligned} \wp ( u ; g_2,g_3) = \lambda ^{-2} \wp (\lambda ^{-1} u; \lambda ^4 g_2 , \lambda ^6 g_3), \end{aligned}$$
(102)

except for \(g_2=g_3=0\), when

$$\begin{aligned} \wp (u;0,0) = u^{-2}. \end{aligned}$$
(103)

For the derivative \(\wp '\), respective equations are obtained by straightforward differentiation.

Depending on the sign of \(g_3\), two procedures are available. For \(g_3 > 0\), the substitution of \(\lambda =g_3^{-\frac{1}{6}}\) leads straight to

$$\begin{aligned} \wp ( u ; g_2,g_3) = e_1 \wp (\sqrt{e}_1\, u; 3 , 1), \end{aligned}$$
(104)

where \(g_3=e_1^3\), according to Eq. (145).

If \(g_3 <0\), two steps are taken. First, letting \(\lambda =\mathrm {i}\), we convert

$$\begin{aligned} \wp ( u ; g_2,g_3) = - \wp (\mathrm {i}\, u; g_2,-g_3). \end{aligned}$$
(105)

Then, with \(\lambda = (-g_3)^{-\frac{1}{6}}\), we recover

$$\begin{aligned} \wp ( u ; g_2,g_3) = - 2 e_{12} \wp (\mathrm {i}\,\sqrt{2 e_{12}} \, u; 3, 1), \end{aligned}$$
(106)

according to Eq. (146).

3.3.2 Cassini states \(C_2\) and \(C_3\)

When discussing the Cassini states, we can use the simplest form (58) for Z. Although the time dependence at the Cassini states does vanish due to \(a b+ Y_0 Z_0 = -W'(Z_0)/(8a) = 0\), and \(X_0=0\), it remains of interest to inspect \(\wp (\tau )\) in the numerator, because its period is the period of small oscillations around the stable equilibrium.

Given a and b, one should first establish the problem type in order to compute the appropriate coordinate \(Z^*_2 = z^*_2-b\), or \(Z^*_3 = z^*_3-b\). These are Eqs. (33), (40), and (36) for the types II, III, and IV, respectively. Starting from this point, the procedure is common: \(Z^*_j\) gives the energy of the Cassini state

$$\begin{aligned} E_j = - \frac{\left( Z^*_j\right) ^3 + a^2 b}{2 Z^*_j}, \qquad j=2,3, \end{aligned}$$
(107)

which substituted in Eq. (134) or (135) gives the invariants \(g_2(E_j)\) or \(g_3(E_j)\)—both positive.

Resorting to Eqs. (104) and (101), we find that Z in solution (58) depends on the squared tangent of \(\sqrt{3 e_1/8}\, t\) which implies the period

$$\begin{aligned} P_j = 2 \pi \sqrt{\frac{2}{3 e_1(E_j)}}, \qquad j=2,3, \end{aligned}$$
(108)

where \(e_1(E_j) = \root 3 \of {g_3}= \sqrt{g_2/2}\). The same result can be obtained by taking the limit of (100) at \(k_n \rightarrow 0\).

3.3.3 Cassini state \(C_4\) and homoclinic orbits \(\varGamma _1\), \(\varGamma _2\)

The energy of the unstable Cassini state \(C_4\) can be evaluated from Eq. (107) with \(j=4\), and \(Z^*_4\) given by (36). However, we need it not for the Cassini state itself, but rather to describe the motion on homoclinic orbits having the energy \(E_4\), which are \(\varGamma _1\) and \(\varGamma _2\). To this end, we will use the Safford form (70) with the reference points given by Eq. (23), namely \(Z_4\) for \(\varGamma _1\), and \(Z_1\) for \(\varGamma _2\).

Since \(g_3<0\), the reduction (106) leads to

$$\begin{aligned} X= & {} \frac{{\tilde{A}} \sqrt{3 e_{12}} \sinh (2 {\tilde{u}})}{\left( 1+{\tilde{B}} \, \sinh ^2{\tilde{u}} \right) ^2}, \nonumber \\ Y= & {} Y_j + \frac{Z^2-Z_j^2}{2 a } = Y_j - \frac{ 2 {\tilde{A}} \, \sinh ^2{\tilde{u}} }{1+{\tilde{B}} \, \sinh ^2{\tilde{u}} } \left( Z_j - \frac{ a {\tilde{A}} \, \sinh ^2{\tilde{u}} }{1+{\tilde{B}} \, \sinh ^2{\tilde{u}} }\right) , \nonumber \\ Z= & {} Z_j - \frac{ 2 a {\tilde{A}} \, \sinh ^2{\tilde{u}} }{1+{\tilde{B}} \, \sinh ^2{\tilde{u}} }, \end{aligned}$$
(109)

with the coefficients

$$\begin{aligned} {\tilde{A}} = \frac{ a b + Y_j Z_j}{3 e_{12}}, \quad {\tilde{B}} = \frac{e_{12}+ \frac{1}{3} \left( a Y_j + Z_j^2 \right) }{3 e_{12}}, \end{aligned}$$
(110)

where \(e_{12}\) is given by Eq. (146). The argument

$$\begin{aligned} {\tilde{u}} = \sqrt{3 e_{12}} ( \tau _0 + \phi _j), \end{aligned}$$
(111)

whose phase with respect to \(Z_j\) at the initial epoch \(t_0\) is given by

$$\begin{aligned} \phi _j = \frac{\mathrm {sgn}{(X_0 \, {\tilde{A}})}}{\sqrt{3 e_{12}}} \, \mathrm {arsinh} \sqrt{\frac{Z_j-Z_0}{2 a {\tilde{A}} - {\tilde{B}} (Z_j-Z_0)}}. \end{aligned}$$
(112)

Note that the time rate of argument \({\tilde{u}}\) is the same at both the separatrices, and as expected the solution tends to the Cassini state \(C_4\) asymptotically at \(t \rightarrow \pm \infty \).

3.3.4 Cassini state \(C_1\) and special orbit \(\varGamma _3\)

The Cassini state \(C_1\), being a stable equilibrium with \(g_3>0\), is characterized by the period of small oscillations given directly by the formula (108) with the energy \(E_1\) evaluated at \(Y^*_1\), \(Z^*_1\) deduced from Eq. (36). What makes the difference, compared to \(C_2\) or \(C_3\), is the presence of another trajectory having the energy \(E_1\)—the special curve \(\varGamma _3\).

The motion along \(\varGamma _3\) can be described using the Safford form solution (70) with respect to the turning points \(Z_1\) or \(Z_2\), given by Eq. (21). Then, performing the reduction (104), we obtain the solution in terms of trigonometric functions

$$\begin{aligned} X= & {} \frac{{\bar{A}} \gamma _3 \sin {2 {\bar{u}}}}{\left( 1+ {\bar{B}} \sin ^2{\bar{u}}\right) ^2}, \nonumber \\ Y= & {} Y_j + \frac{Z^2-Z_j^2}{2 a } = Y_j - \frac{ {\bar{A}} \sin ^2{\bar{u}}}{1+ {\bar{B}} \sin ^2{\bar{u}}} \left( Z_j - \frac{a {\bar{A}} \sin ^2{\bar{u}}}{2 \left( 1+ {\bar{B}} \sin ^2{\bar{u}} \right) }\right) , \nonumber \\ Z= & {} Z_j - \frac{a {\bar{A}} \sin ^2{\bar{u}}}{1+ {\bar{B}} \sin ^2{\bar{u}}}, \end{aligned}$$
(113)

where

$$\begin{aligned} {\bar{A}} = \frac{4 (a b + Y_j Z_j)}{3 e_1}, \quad {\bar{B}} = \frac{\frac{2}{3} \left( a Y_j + Z_j^2 \right) -e_1}{3 e_1}. \end{aligned}$$
(114)

The argument \({\bar{u}}\) is

$$\begin{aligned} {\bar{u}} = \gamma _3 (\tau _0 + \phi _j), \end{aligned}$$
(115)

where \(\gamma _3 = \sqrt{3 e_1/2}\), as in Eq. (81) for \(e_3= -e_1/2 \); hence, the time period of the solution is the same as that of small oscillations around \(C_1\), i.e., \(P_1\) given by Eq. (108). The phase \(\phi _j\) can be computed from

$$\begin{aligned} \gamma _3 \phi _j = \mathrm {sgn}{({\bar{A}} X_0)}\,\arcsin \sqrt{\frac{Z_j-Z_0}{a {\bar{A}} - (Z_j-Z_0) {\bar{B}}}}. \end{aligned}$$
(116)

The above solution can be obtained either from (86) with \(k_p=0\), and \(e_3 = -e_1/2 \), or—in a different form—from (94) with \(k_n=0\), and \(g_2=3 e_1^2\).

3.3.5 Cassini state \(C_{14}\) and special orbit \(\varGamma _{23}\)

The most degenerate case occurs in type III, where \(C_{14}\) is the cusp (parabolic) equilibrium with energy \(E_{14}\) given by Eq. (42). The homoclinic curve \(\varGamma _{23}\) with this energy can be parameterized using rational functions of time, as indicated by the reduction formula (103). For the sake of using the Safford form, we introduce

$$\begin{aligned} u_{23} = t-t_0 + \tau _{23}, \qquad \tau _{23} = t_0-t_\mathrm {m}, \end{aligned}$$
(117)

where \(\tau _{23}\) is the time interval between the initial epoch \(t_0\) and the epoch of crossing the reference turning point with coordinates \(Y_\mathrm {m} = y_\mathrm {m}+a\), and \(Z_\mathrm {m} = z_\mathrm {m}-b\), as given by Eq. (43). Then, with \(\beta = \root 3 \of {a^4 b^2}\),

$$\begin{aligned} X= & {} - \frac{8 a b u_{23}}{\left( 1+\beta u_{23}^2 \right) ^2}, \end{aligned}$$
(118)
$$\begin{aligned} Y= & {} Y_\mathrm {m} + \frac{4 a b u_{23}^2}{1+\beta u_{23}^2 } \left( Z_m + \frac{2 a^2 b u_{23}^2}{1+\beta u_{23}^2 } \right) , \end{aligned}$$
(119)
$$\begin{aligned} Z= & {} Z_\mathrm {m} + \frac{4 a^2 b u_{23}^2}{1+\beta u_{23}^2 }. \end{aligned}$$
(120)

The time offset \(\tau _{23}\) is given by

$$\begin{aligned} \tau _{23} = - \mathrm {sgn}{X_0} \sqrt{\frac{Z_0-Z_\mathrm {m}}{4 a^2 b - \beta \left( Z_0-Z_\mathrm {m}\right) }}. \end{aligned}$$
(121)

4 Conclusions

It is common to describe the integral curves of the Colombo top problem as an intersection of a parabolic cylinder and a unit sphere, the latter being centered at the origin of the xyz coordinate system. We have proposed a new point of view: the curves can be tracked along the intersection of two out of the three invariant surfaces: a parabolic cylinder, a sphere, and a paraboloid of revolution. This thread has been merely signaled, but it can be of possible interest when designing geometric integrators for the numerical treatment of the Colombo top motion. Moreover, using the shifted coordinates XYZ, one introduces the symmetry to the parabolic cylinder on the paraboloid (at the expense of having an off-centered sphere) which does simplify a number of expressions given in this work.

When partitioning the phase space of the Colombo top problem, we have completed the landscape, well known from earlier works, with an interesting but hitherto overlooked feature: the trajectory \(\varGamma _3\) which is unique by being periodic, yet expressible in terms of elementary functions of time. Its presence calls for the distinction of \(\mathrm {D}_3\) and \(\mathrm {D}_4\) domains even though qualitatively they look similar. It also adds to a better understanding of the parametric bifurcation associated with the passage from type II, to type IV.

The analytical expressions for the turning points of the Colombo top trajectories as functions of energy, given in Sect. 2.3, had not been reported so far. The expressions for the location of the Cassini states from Sect. 2.4 depend only on parameters ab. Up to some rearrangement of terms, they are similar to those of Saillenfest et al. (2019) in type II or III. For type IV, when \(a^\frac{2}{3}+b^\frac{2}{3} < 1\), the Cardano form provided by Saillenfest et al. (2019) is formally correct, but it gives real values only as the sums of two complex conjugates (casus irreducibilis of the resolvent cubic). In the present work, we have preferred to use expressions based on the purely real trigonometric form whenever the quartic has a positive discriminant.

The differential equation for \({\dot{Z}}\), with its right-hand side proportional to the square root of the degree 4 polynomial, is not a novelty in celestial mechanics. The same form pops up while discussing the second fundamental model of resonance (Henrard and Lemaitre 1983). Its solution in terms of the Weierstrass elliptic function has always been given either in the simplified form of Eq. (55), as in Ferraz-Mello (2007), or in the Biermann-Weierstrass form (Nesvorný and Vokrouhlický 2016). We have taken an opportunity to recall other possibilities (Safford and Mordell forms) than can be of use in other applications as well.

We hope that the results of the present work will facilitate the study of perturbed Colombo top problems. They should be useful either as the kernel of analytical perturbation procedures, or as a building block of numerical integrators based upon composition methods.