Analytical solution of the Colombo top problem

Haponiak, J.; Breiter, S.; Vokrouhlický, D.

doi:10.1007/s10569-020-09963-7

Analytical solution of the Colombo top problem

Original Article
Open access
Published: 29 April 2020

Volume 132, article number 24, (2020)
Cite this article

Download PDF

You have full access to this open access article

Celestial Mechanics and Dynamical Astronomy Aims and scope Submit manuscript

Analytical solution of the Colombo top problem

Download PDF

J. Haponiak¹,
S. Breiter¹ &
D. Vokrouhlický²

2178 Accesses
7 Citations
8 Altmetric
1 Mention
Explore all metrics

Abstract

The Colombo top is a basic model in the rotation dynamics of a celestial body moving on a precessing orbit and perturbed by a gravitational torque. The paper presents a detailed study of analytical solution to this problem. By solving algebraic equations of degree 4, we provide the expressions for the extreme points of trajectories as functions of their energy. The location of stationary points (known as the Cassini states) is found as the function of the two parameters of the problem. Analytical solution in terms of the Weierstrass and the Jacobi elliptic functions is given for regular trajectories. Some trajectories are expressible through elementary functions: not only the homoclinic orbits, as expected, but also a special periodic solution whose energy is equal to that of the first Cassini state (unnoticed in previous studies).

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Only about $60\%$ of the Moon surface can be seen from the Earth. The first successful attempt to explain this fact was made by Cassini (1693), who borrowed the kinematic model of the ‘triple Earth motion’ from Copernicus (1543, Lib. 1,Cap. 11), and applied it to the Moon with some important amendments. Retaining the postulate of the fixed angle between the rotation axis and the orbital plane, Cassini postulated the equality of orbital period and the sidereal rotation period, which protected the far side from being seen from the Earth. By additionally postulating that the rotation axis, the ecliptic pole, and the lunar orbit pole remain coplanar, Cassini suppressed the possibility of revealing the complete polar caps over one lunar axis precession cycle. Two centuries later, Tisserand (1891) rephrased these postulates as laws. The three Cassini laws of Tisserand state that: (1) rotation and orbital periods are equal, (2) the rotation axis has a constant inclination to the ecliptic, and (3) the three axes are coplanar. The second law differs from the original Cassini’s statement, but in view of the third law, the difference is unimportant.

With the advent of the Newtonian dynamics, the question arose if the Cassini’s model is consistent with equations of motion. This was a part of the problem issued by the French Académie Royale des Sciences for the Prize of 1764. In his prize dissertation and in a later work, Lagrange (1764, 1780) demonstrated that the state described by Cassini is an equilibrium of the associated differential system and studied small librations in its vicinity.

The work of Colombo (1966) brought a new understanding of the Cassini laws and motion near the stationary configuration which they describe in the specific case of the Moon. In particular, Colombo demonstrated that the second and the third laws are conceptually independent from the first law and themselves serve as a basis of an interesting dynamical problem which describes the long-term evolution of the spin axis of an arbitrary rotating body. He thus dropped from his analysis the assumption of the direct spin-orbit resonance, but kept the assumption of general precession of the orbital plane due to perturbations (either caused by the oblate central body, or by other masses in the system). He showed that the long-term dynamics of the spin axis can be described by a simple, one-dimensional problem assuming the orbital node performs a uniform precession and the inclination remains constant. Its stationary points represent generalizations of the Cassini second and third laws.

It was soon understood that the Colombo problem is a very suitable starting point for analysis of the obliquity evolution of terrestrial planets, even when the orbital node and inclination undergo more complex evolution. A fascinating example are studies of Mars obliquity variations in relation to this planet’s past paleoclimate, starting with Ward (1973, 1974). Tides or internal process may additionally change some of the system’s parameters, a situation relevant to all terrestrial planet studies, including the Moon—see Peale (1974), Ward (1975, 1982), or Ward and de Campli (1979), to mention just few examples of early works. Later studies of Laskar and colleagues made a masterful use of detailed knowledge of planetary long-term dynamics and its implications on secular evolution of their spin axes (e.g., Laskar and Robutel 1993; Laskar et al. 1993; Correia and Laskar 2001, and many other with more technical details). Following earlier hints, mentioned already in Harris and Ward (1982), applications to giant planets were also developed in the past two decades (e.g., Ward and Hamilton 2004; Hamilton and Ward 2004; Ward and Canup 2006; Boué et al. 2009; Vokrouhlický and Nesvorný 2015; Brasser and Lee 2015; Rogoszinski and Hamilton 2020).

Beyond planets and satellites, studies of secular spin evolution of asteroids flourished recently, especially after Vokrouhlický et al. (2003) applied it to explain space parallelism of spin axes of large members in the Koronis family (see also Vokrouhlický et al. (2006)). Further applications include spin states of exoplanets (e.g., Atobe et al. 2004; Atobe and Ida 2007; Saillenfest et al. 2019), or artificial satellites and space debris (e.g., Efimov et al. 2018).

Taken altogether, we note that the backbone of all these studies is the basic Colombo model. Interestingly, a systematic mathematical treatment of this elegant Hamiltonian problem has not been significantly advanced beyond the state of art dating back to Henrard and Murigande (1987), and Henrard (1993). These classical works focused on the aspects directly related with the probability of capture into different phase space zones when a slow parameter evolution drives the system across the homoclinic orbits; hence, they paid no attention to regular orbits. One can also recognize the Colombo top in an anonymous quadratic Hamiltonian treated by Lanchares and Elipe (1995), who focused on the qualitative study of its parametric bifurcations.

The main reason to seek for the complete analytical solution of the Colombo top is its significance for further studies of more realistic, perturbed problems. Be it analytical perturbation techniques, or numerical splitting methods, the knowledge of explicit time dependence of the Colombo top motion is crucial. The present work is divided in two principal parts: Sect. 2 explores the problem using purely geometric and algebraic tools, whereas the integration of equations of motion is considered in Sect. 3. In other words, Sect. 2 concerns integral curves, that become time-dependent trajectories in Sect. 3.

The formulation the problem is given in Sect. 2.1, where we introduce two sets of variables: traditional x, y, z and shifted X, Y, Z. Throughout the text, we switch between the two sets, depending on convenience. In Sect. 2.2, the geometric construction of the integral curves is shown; intersections of the curves with the $x=X=0$ meridians (their extremities in z) are found in Sect. 2.3 , expressed in terms of the energy constant and of the two parameters a, b. Some of these can be critical points (the Cassini states), which allows the expression in terms of the parameters only—given in Sect. 2.4. The information gained allows to partition the phase space and distinguish three types of the Colombo top problem.

In Sect. 3, we first provide a universal solution in terms of the Weierstrass elliptic function $\wp $ (Sect. 3.1), comparing various formulations of the same result. But since the Weierstrass function behaves differently in various domains of the phase space, we reformulate the solution in terms of the more common Jacobi elliptic functions (Sect. 3.2). Finally, the specific trajectories that admit solutions in terms of elementary functions are presented in Sect. 3.3. The closing Sect. 4 summarizes the results and their implications.

2 The Colombo top model

2.1 Equations of motion

The basic assumptions leading to the Colombo top problem involve a rigid body on an orbit around some distant primary. The orbital motion might be called Keplerian, but with one notable addition: the orbital plane rotates uniformly around some fixed axis in the inertial space with the angular rate $\mu $. The body is assumed to rotate in the lowest energy state, namely about the shortest principal axis of its inertia tensor. Following the work of Colombo (1966), we assume the rotation period is not in resonance with the revolution period about the primary (see, e.g., Peale 1969 for generalizations to this situation). Considering the quadrupole torque due to the gravitational field of the primary, and averaging over both orbital revolution about the center and rotation cycle about the spin axis, one easily realizes that the total angular momentum of rotation (or, equivalently, the angular velocity $\omega $ of rotation) is conserved. The whole dynamical problem then reduces to the analysis of the motion of the unit vector $\mathbf{r}$ of rotation pole in space.

In order to introduce fundamental astronomical parameters of relevance, let us for a moment assume $\mu =0$, i.e., the orbital plane is fixed in the inertial space. In this simple case, $\mathbf{r}$ performs regular precession about the fixed direction $\mathbf{K}$ of the orbital angular momentum with a frequency

$$\begin{aligned} \mu _r = - \alpha \, \cos \varepsilon , \end{aligned}$$

(1)

where $\alpha $ is the precession constant and $\varepsilon $ is the obliquity, namely the angle between $\mathbf{r}$ and $\mathbf{K}$ ($\cos \varepsilon =\mathbf{r} \cdot \mathbf{K}$). Note the minus sign in Eq. (1) which indicates polar regression in the inertial space. In this model, $\varepsilon $ stays constant and

$$\begin{aligned} \alpha = \frac{3}{2} \frac{n^2}{\omega }\frac{E_b}{(1-e^2)^{3/2}}, \end{aligned}$$

(2)

where n is the orbital mean motion about the primary, $\omega $ the angular rotational frequency, $E_b$ the dynamical ellipticity, and e the orbital eccentricity. The dynamical ellipticity expresses degree of non-sphericity of the body, and it is defined using principal moments $A\le B\le C$ of the inertia tensor as

$$\begin{aligned} E_b = \frac{C-(A+B)/2}{C}. \end{aligned}$$

(3)

Things become more interesting in the case where the orbital plane about the primary is not constant. As mentioned above, the Colombo top model describes the situation when it performs a uniform precession in the inertial space. In particular, $\mathbf{K}$ revolves uniformly on a cone about a fixed direction $\mathbf{K}'$ in space, such that the orbital inclination I with respect to the reference plane normal to $\mathbf{K}'$ is constant. The magnitude of the precession rate is $\mu $, though most often the orbital plane performs again regression in the inertial space (assuming $I<90^\circ $). The interest and complexity of this model revolves about a possibility of a resonance between the two precession frequencies $-\mu $ and $\mu _r$. In order to describe it using a simple Hamiltonian model, Colombo (1966) observed it is useful to refer $\mathbf{r}$ to the reference frame following precession of the orbital plane, thus representing $\mathbf{r}^\mathrm {T} = (\sin \varepsilon \cos (h-\pi /2),\sin \varepsilon \sin (h-\pi /2),\cos \varepsilon )$. Here, $h-\pi /2$ is a longitude reckoned from the ascending node of the orbital plane and $\varepsilon $ is a colatitude measured from $\mathbf{K}$ as above. It is actually an advantage to introduce $H=\cos \varepsilon $ (and $\sin \varepsilon =\sqrt{1-H^2}$). This is because in terms of the symplectic variables (h, H), the Colombo top is a one degree of freedom, conservative problem with a Hamiltonian function

$$\begin{aligned} {\mathscr {H}}_s(h,H) = -\frac{(H-b)^2}{2} - a \sqrt{1-H^2} \, \cos {h} = E_\mathrm {H}, \end{aligned}$$

(4)

and two new nondimensional parameters are defined as

$$\begin{aligned} a = \frac{\mu }{\alpha }\,\sin I, \qquad b = \frac{\mu }{\alpha }\,\cos I. \end{aligned}$$

(5)

As observed by Henrard and Murigande (1987), the discussion can be confined to nonnegative constants a and b thanks to the symmetries $(a,h,H) \rightarrow (-a, h+\pi , H)$, and $(b,h,H) \rightarrow (-b,-h,-H)$ admitted by ${\mathscr {H}}_s$.

Since the equations of motion

$$\begin{aligned} {\dot{h}} = \frac{\partial {\mathscr {H}}_s}{\partial H} = b - H + \frac{a H \cos {h}}{\sqrt{1-H^2}}, \qquad {\dot{H}} = - \frac{\partial {\mathscr {H}}_s}{\partial h} = - a \sqrt{1-H^2} \sin {h}, \end{aligned}$$

(6)

are singular at $H^2 = 1$, it is better to use the Cartesian coordinates of the unit momentum vector

$$\begin{aligned} {\mathbf {r}}= \left( \begin{array}{c} x \\ y \\ z \\ \end{array} \right) = \left( \begin{array}{c} \sqrt{1-H^2} \sin {h} \\ -\sqrt{1-H^2} \cos {h} \\ H \\ \end{array} \right) . \end{aligned}$$

(7)

Then, similarly to Henrard and Murigande (1987), we obtain the Hamiltonian function

$$\begin{aligned} {\mathscr {H}}_\mathrm {HM}(x,y,z) = - \frac{(z-b)^2}{2} + a y = E_\mathrm {H}, \end{aligned}$$

(8)

that generates equations of motion

$$\begin{aligned} {\dot{x}}= & {} (z-b)(y+a) + a b, \nonumber \\ {\dot{y}}= & {} -(z-b)\,x,\nonumber \\ {\dot{z}}= & {} - a x. \end{aligned}$$

(9)

For the sake of minor simplification, let us propose the shifted variables

$$\begin{aligned} X=x, \quad Y = y+a, \quad Z = z-b, \quad \mathbf {R} = (X,Y,Z)^\mathrm {T} = \mathbf {r}+(0,a,-b)^\mathrm {T}. \end{aligned}$$

(10)

Adding a constant $a^2$ to the Hamiltonian (8), we simplify it to

$$\begin{aligned} {\mathscr {H}} = - \frac{Z^2}{2} + a Y = E, \end{aligned}$$

(11)

with the new energy constant

$$\begin{aligned} E = E_\mathrm {H}+a^2. \end{aligned}$$

(12)

Equations of motion for the shifted variables are

$$\begin{aligned} {\dot{X}}= & {} Z\,Y + a b, \nonumber \\ {\dot{Y}}= & {} -Z\,X, \nonumber \\ {\dot{Z}}= & {} - a X. \end{aligned}$$

(13)

Notably, when $a=0$, the problem is simplified to the symmetric free top, with constant Z and uniform rotation of $\mathbf {r}$ around the third axis, with the frequency $\sqrt{- 2 E} = |Z |$.

2.2 Geometric interpretation

It is customary to represent the integral curves of Eq. (9) as the intersections of two surfaces:

S1
a sphere $x^2+y^2+z^2=1$,
S2
a parabolic cylinder $(z-b)^2 - 2 a (y+a) + 2 E = 0$, implied by the energy integral.

The symmetry plane of S2 is $z=b$, and its vertex line is parallel to the x-axis, passing through $y= (E-a^2)/a$.

Observe that combining S1 and S2 we can also obtain another surface (see Fig. 1):

S3
a paraboloid of revolution $x^2+(y+a)^2+ 2 b z - \left( 2E - a^2 + b^2+1 \right) = 0$.

Actually, the S3-based function

$$\begin{aligned} {\mathscr {H}}_\mathrm {a}(\mathbf {r}) = \frac{x^2}{2} + \frac{(y+a)^2}{2} + b z = E' = E + \frac{1-a^2 + b^2}{2}, \end{aligned}$$

(14)

is an alternative Hamiltonian of the Colombo top, leading to the same equations of motion (9) as the Hamiltonian (8).

The paraboloid S3 has the symmetry axis parallel to the z-axis, and passing through the points $x=0$, $y=-a$. Its vertex is located at $z = E'/b$. The advantage of S3 appears when discussing the limit of $a=0$. Then, the paraboloid does not change the shape and the intersections of S1 and S3 are circles because of coincidence of the symmetry axes. Contrarily to this, setting $a=0$ in S2 results in degeneracy: the cylinder breaks in two parallel planes. On the other hand, $b=0$ turns the S3 paraboloid into a (circular) cylinder, whereas S2 retains its shape. Thus, S2 and S3 (or ${\mathscr {H}}$ and ${\mathscr {H}}_\mathrm {a}$) can be considered complementary, although the regularity at $a=0$ seems to us more favorable (e.g., if some perturbation approach is based upon the small inclination assumption).

An integral curve can be represented as a parametric curve in a number of ways. Of course, the best is to solve Eq. (9) and find $\mathbf {r}(t)$. But before we accomplish it (and, actually, in order to do it), let us consider parameterizations, where two coordinates are expressed in terms of the third. The most straightforward is to solve the system of S1 and S2 equations, using z as a parameter variable, which leads to

$$\begin{aligned} \mathbf {r}(z) = \left( \begin{array}{c} \pm \sqrt{ 1-z^2 - y(z)^2} \\ \frac{ (z-b)^2 + 2 E-2 a^2}{2a} \\ z \\ \end{array} \right) , \text{ or } \mathbf {R}(Z) = \left( \begin{array}{c} \pm \sqrt{ 1-(Z+b)^2 - \left( Y(Z)-a\right) ^2} \\ \frac{ Z^2 + 2 E}{2a} \\ Z \\ \end{array} \right) .\quad \end{aligned}$$

(15)

All three invariant surfaces S1, S2, S3 intersect the plane $x=0$, which is their common plane of symmetry. The integral curves, as the lines of intersection of S1, S2, and S3 also pass through $x=0$ and are symmetric with respect to this plane. Moreover, both y and z coordinates of a given integral curve attain their local extremes at $x=0$, according to equations of motion (13).

2.3 Intersections with the plane $x=0$

To find the intersection points of an integral curve with the plane $x=0$ for some specified energy E, it is enough to find either y or z coordinate. Knowing one of them, one might recover the other from the relation $y^2+z^2=1$. But this involves the ambiguity of sign; thus, it is better to find z, and then use the parametric equation (15) for y(z). To benefit from minor simplifications, we first find Z and then Y(Z).

According to Eq. (15), the relation between X and Z is

$$\begin{aligned} X = \pm \frac{1}{2a}\,\sqrt{W(Z)}, \end{aligned}$$

(16)

where W(Z) is a polynomial of degree 4

$$\begin{aligned} W(Z) = - Z^4 - 4 E Z^2 - 8 a^2 b Z - 4 \left( E - a^2\right) ^2 + 4 a^2 \left( 1- b^2\right) . \end{aligned}$$

(17)

Note the absence of the cubic term thanks to the use of the shifted variable Z.

Solving the quartic equation $W(Z)=0$ is a tedious task; the details can be found in “Appendix A”. Briefly, the four roots $Z_j$ are given in terms of the three roots $e_i$ of the reduced cubic resolvent equation (additionally modified to match the standard cubic polynomial appearing in the Weierstrass elliptic integrals). Depending on the parameters a and b, selecting some energy value E, the number of meridian intersection points is 4, 3, 2, 1, or 0.

Four intersection points mean that there are two distinct integral curves with the same energy, each intersecting $x=0$ in two points:

$$\begin{aligned} z_1= & {} b - \sqrt{e_1-\frac{2}{3}E} - \sqrt{e_2-\frac{2}{3}E} - \sqrt{e_3-\frac{2}{3}E}, \nonumber \\ z_2= & {} b - \sqrt{e_1-\frac{2}{3}E} + \sqrt{e_2-\frac{2}{3}E} + \sqrt{e_3-\frac{2}{3}E}, \end{aligned}$$

(18)

for a lower curve, and

$$\begin{aligned} z_3= & {} b + \sqrt{e_1-\frac{2}{3}E} - \sqrt{e_2-\frac{2}{3}E} + \sqrt{e_3-\frac{2}{3}E}, \nonumber \\ z_4= & {} b + \sqrt{e_1-\frac{2}{3}E} + \sqrt{e_2-\frac{2}{3}E} - \sqrt{e_3-\frac{2}{3}E}, \end{aligned}$$

(19)

for the upper curve, with $e_j$ defined in Eq. (141) as functions of E, a, and b, such that $z_1< z_2< z_3 < z_4$.

Three intersection points mean that either one of the two curves contracts to a single point, or two curves share the same intersection point. These situations are distinguished by the sign of invariant $g_3$ from Eq. (135).

If $g_3 > 0$, then the upper curve shrinks into a point with
$$\begin{aligned} z_{34} = b + \sqrt{e_1- \frac{2}{3}E}. \end{aligned}$$
(20)
The remaining intersection points of the lower curve are
$$\begin{aligned} z_1= 2 b - z_{34} - 2 \sqrt{e_{23}-\frac{2}{3}E}, \qquad z_2=2 b - z_{34} + 2 \sqrt{e_{23}-\frac{2}{3}E}. \end{aligned}$$
(21)
The expressions of $e_1$ and $e_{23}$ are given in Eq. (145).
If $g_3 < 0$, then the upper and the lower curves meet at
$$\begin{aligned} z_{23} = b + \sqrt{e_3 -\frac{2}{3}E}, \end{aligned}$$
(22)
with the remaining intersection points at
$$\begin{aligned} z_1= 2b - z_{23} - 2 \sqrt{e_{23}-\frac{2}{3}E}, \qquad z_4 = 2b - z_{23} + 2 \sqrt{e_{23}-\frac{2}{3}E}. \end{aligned}$$
(23)

If $e_1=e_2=e_3$, then $z_2$ from Eq. (21) becomes equal to $z_{34}$ and only two intersection points remain

$$\begin{aligned} z_1 = b - 3 \root 3 \of {a^2 b}, \qquad z_{234} = b+ \root 3 \of {a^2 b}, \end{aligned}$$

(24)

with $a^\frac{2}{3}+b^\frac{2}{3}=1$. This case requires a unique value of energy with a simple expression (154).

A more generic situation with two intersection points occurs when E defines only one trajectory, with $z_1 < z_2$ given by

$$\begin{aligned} z_1= & {} b - \sqrt{e_1-\frac{2}{3}E} - \sqrt{ e_1- 2 E + \sqrt{\nu }}, \nonumber \\ z_2= & {} b - \sqrt{e_1-\frac{2}{3}E} + \sqrt{ e_1 - 2 E + \sqrt{\nu } }, \nonumber \\ \nu= & {} \left( e_1+ \frac{4}{3}E\right) ^2+e_\mathrm {c}^2, \end{aligned}$$

(25)

where $e_1$ and $e_\mathrm {c}$ are defined in Eqs. (142) and (144). Two other roots of $W(z-b)=0$, i.e., $z_3= Z_3+b$ and $z_4 = {\overline{z}}_3$ are complex, so they do not define the intersections—see Eq. (151).

Finally, one intersection point means that for the energy E there is only one integral curve that contracted to a point with

$$\begin{aligned} z_{12} = b - \sqrt{e_1 - \frac{2}{3} E}, \end{aligned}$$

(26)

and $e_1$ given by Eq. (145).

In further discussion, the point of intersection of a regular curve with the $x=0$ plane will be named a turning point, unless it is an equilibrium from the dynamical point of view.

2.4 Cassini states and homoclinic orbits

Finding the Cassini states can be approached from different points of view. Geometrically, they are the points of tangency of the surfaces S1, S2, and S3. Algebraically, they are the multiple roots of $x(z,E)=0$, and $x(y,E)=0$. Dynamically, they are the fixed points of equations of motion (9) or, equivalently, the local extremes and saddle points of the Hamiltonian on a sphere S1.

In principle, the multiple roots have been found in Sect. 2.3. But they are given in terms of E, which is an implicit function of a and b as a root of $\varDelta (E,a,b)=0$, where $\varDelta $ is the discriminant of W(Z), defined in Eq. (136). Hence, one possible way is to find the roots of $\varDelta =0$, which is a quartic equation in E, and substitute them into $z_{12}$, $z_{23}$, or $z_{34}$ from Sect. 2.3 (we already did it for $z_{234}$). The alternative is to find the stationary points (x, y, z) of the Hamiltonian and use them to evaluate the energy by the substitution into ${\mathscr {H}}(x,y,z)=E$. The latter path is more convenient and was recently taken by Saillenfest et al. (2019).

Discussing the Cassini states as the multiple roots of $W(Z)=0$, we should include the information, that they are also the roots of its derivative $W'(Z)=0$. Since two polynomials have a common root if and only if their resultant equals zero, we ask about the solutions of $R(W(Z),W'(Z))=0$. If we treat W(Z) and $W'(Z)$ as polynomials in Z, the result is simply the discriminant $\varDelta $ from Eq. (136), multiplied by a constant (negative) factor. Yet, we can also treat W(Z) and $W'(Z)$ as polynomials of E with degrees 2 and 1, respectively, whose coefficients depend on Z. So, using

$$\begin{aligned}&W(Z) = U_1(E) = - 4 E^2 + 4 \left( 2 a^2 - Z^2 \right) E - Z^4 - 4 a^2 \left( 2 b Z + a^2 + b^2 - 1\right) , \end{aligned}$$

(27)

$$\begin{aligned}&W'(Z) = U_2(E) = - 8 Z E - 4 Z^3 - 8 a^2 b, \end{aligned}$$

(28)

we obtain the resultant $R(U_1(E),U_2(E)) = 256 a^2 w(Z)$, where

$$\begin{aligned} w(Z) = - Z^4 - 2 b Z^3 + 3 \rho \, Z^2 - 2 a^2 b Z - a^2 b^2, \end{aligned}$$

(29)

and

$$\begin{aligned} \rho = \frac{1-a^2-b^2}{3}. \end{aligned}$$

(30)

For easier reference to Saillenfest et al. (2019), we abandon the shifted variables and solve $w(z-b)=0$, which is the same equation as the one they used. However, thanks to considering the Cartesian variables on a sphere, we do not have to warn about any loss of sign, because it does not occur neither in evaluating $R(U_1,U_2)$, nor in obtaining W(Z). The value of y associated with a given z results directly from the condition ${\dot{x}}=0$ in Eq. (9), being

$$\begin{aligned} y = -a - \frac{ a b}{z-b} = - \frac{a z}{z-b}. \end{aligned}$$

(31)

Solving $w(z-b)=0$, we can recycle the procedure used for $W(Z)=0$. More details can be found in “Appendix B” so here we only summarize the final results. When considering the problem on a sphere, there are only two generic situations: either there are two Cassini states $C_2$ and $C_3$, or there are four of them: $C_1$, $C_2$, $C_3$ and $C_4$. Let us call the former case ‘the Colombo top problem of type II,’ and the latter—‘type IV.’ The special case when there are three Cassini states will be called ‘the Colombo top problem of type III’ (see Fig. 2).

In the following subsections, we briefly review each type, providing the coordinates of the Cassini states $C_j$ as the functions of a and b parameters. To avoid confusion with the $z_i$ expressions of Sect. 2.3, we label the coordinates of $C_j$ as $z^*_j$, and $y^*_j$ (of course, $x^*_j=0$ for all the points). We only provide $z^*_j$ that allows to find $y^*_j$ from Eq. (31), and then, the energy $E_j$ of the Cassini state is

$$\begin{aligned} E_j = - \frac{\left( z^*_j-b\right) ^2}{2} - \frac{ a^2 b}{z^*_j-b}. \end{aligned}$$

(32)

Substituting this energy value into an appropriate $z_{12}$, $z_{23}$, $z_{34}$, or $z_{234}$ from Sect. 2.3, should result in returning to the expression of $z^*_j$ in terms of a and b.

2.4.1 Type II

When $a^{\frac{2}{3}}+b^{\frac{2}{3}} > 1$, the dynamics on a sphere is relatively simple. There are two stable equilibria (elliptic points): $C_3$ at the lower ($z<0$) hemisphere and $C_2$ at the upper hemisphere, located at

$$\begin{aligned} z^*_3 = \frac{b}{2} - \sqrt{ A_2 } - \sqrt{ B_2}, \qquad z^*_2 = \frac{b}{2} - \sqrt{A_2} + \sqrt{B_2}, \end{aligned}$$

(33)

where

$$\begin{aligned} A_2= & {} - P'_2 +\frac{1}{2}\,\left( \root 3 \of {g'_3+ \sqrt{-\varDelta '} } + \root 3 \of {g'_3 - \sqrt{-\varDelta '} } \right) , \nonumber \\ B_2= & {} -3 P'_2 - A_2 + 2 \sqrt{A_2^2 + 3 P'_2 \left( P'_2+A_2\right) - \frac{3 \rho ^2}{16}}, \end{aligned}$$

(34)

and according to Eqs. (164) and (163)

$$\begin{aligned} \sqrt{-\varDelta '} = \frac{ab}{4}\sqrt{a^2b^2 - \rho ^3},\quad g'_3 = \frac{a^2 b^2}{4} - \left( \frac{\rho }{2}\right) ^3, \quad P'_2 = - \frac{\rho }{2} - \frac{b^2}{4}. \end{aligned}$$

(35)

We can identify the Cassini states coordinates as $z^*_3 = z_{12}(E_3)$, and $z^*_2 = z_{12}(E_2)$, where $z_{12}$ is given by Eq. (26).

The energy values in the type II problem are bounded by $E_3 \le E \le E_2$. All trajectories with energy $E_3< E < E_2$ are simple periodic curves (one for each value of E), oscillating in z between $z_1$ and $z_2$, as given by Eq. (25). Extending and modifying the domains labeling of Henrard and Murigande (1987), let us label the entire sphere surface (with the two Cassini states excluded) as $\mathrm {D}_{23}$ (Fig. 2, left).

2.4.2 Type IV

When $a^{\frac{2}{3}}+b^{\frac{2}{3}}<1$, the flow is shaped by the presence of three elliptic fixed points $C_1$, $C_2$, $C_3$, and a hyperbolic point $C_4$ (Fig. 2, right). The latter is accompanied by two homoclinic orbits—upper $\varGamma _{1}$ and lower $\varGamma _{2}$. Thus, the sphere is first partitioned into three domains: $\mathrm {D}_1$ between $C_1$ and $\varGamma _{1}$, $\mathrm {D}_2$ between $C_2$, $\varGamma _{1}$, $\varGamma _{2}$, and the third area bounded by $C_3$ and $\varGamma _2$. But the last of the three domains is further divided by the curve whose energy equals that of $C_1$—the thick dashed curve $\varGamma _3$ in Fig. 2. The subdomains $\mathrm {D}_3$ and $\mathrm {D}_4$ may look similar from the geometrical point of view, but note the fact that each curve in $\mathrm {D}_4$ has a companion in $\mathrm {D}_1$ with the same energy, which is not the case in $\mathrm {D}_3$.

Equation (171) from “Appendix B” can be used as they are, but here we add an alternative form, based upon the transformation (129):

$$\begin{aligned} z^*_1= & {} \frac{b}{2} + \sqrt{A_4} + \sqrt{B_4} - \sqrt{B'_4} = \frac{b}{2} + \sqrt{A_4} + \sqrt{B_4+B'_4 - 2 \sqrt{B_4 B'_4}}, \nonumber \\ z^*_2= & {} \frac{b}{2} - \sqrt{A_4} + \sqrt{B_4} + \sqrt{B'_4} = \frac{b}{2} - \sqrt{A_4} + \sqrt{B_4+B'_4 + 2 \sqrt{B_4 B'_4}}, \nonumber \\ z^*_3= & {} \frac{b}{2} - \sqrt{A_4} -\sqrt{B_4} - \sqrt{B'_4} = \frac{b}{2} - \sqrt{A_4} - \sqrt{B_4+B'_4 + 2 \sqrt{B_4 B'_4}}, \nonumber \\ z^*_4= & {} \frac{b}{2} + \sqrt{A_4} - \sqrt{B_4} + \sqrt{B'_4} = \frac{b}{2} + \sqrt{A_4} - \sqrt{B_4+B'_4 - 2 \sqrt{B_4 B'_4}}, \end{aligned}$$

(36)

where

$$\begin{aligned} A_4 = \frac{b^2}{4}+ \rho \cos ^2\frac{\phi _4}{3}, \qquad B_4 = \frac{b^2}{4}+ \rho \cos ^2\frac{\phi _4-\pi }{3} , \qquad B'_4 = \frac{b^2}{4}+ \rho \cos ^2\frac{\phi _4+\pi }{3}, \end{aligned}$$

(37)

and

$$\begin{aligned} \cos {\phi _4} = a b \rho ^{-\frac{3}{2}}. \end{aligned}$$

(38)

From $A_4< B_4 < B'_4$, we infer $z^*_3< 0< z^*_2< b< z^*_4 < z^*_1$. The associated y coordinates satisfy inequalities $y^*_4< y^*_1< -a< y^*_3< 0 < y^*_2$.

The energy values are $E_3< E_1< E_4< 0 < E_2$. Referring to Sect. 2.3, we identify $z^*_1 = z_{34}(E_1)$, $z^*_2 = z_{12}(E_2)$, $z^*_3 = z_{12}(E_3)$, and $z^*_4 = z_{23}(E_4)$. The unstable equilibrium energy $E_4$ is of special importance, because it serves to determine the turning points of the separatrices from Eq. (23): $z_4(E_4)$ for $\varGamma _1$, and $z_1(E_4)$ for $\varGamma _2$.

For the energy values $E_1< E < E_4$, when $\varDelta > 0$, each E refers to two periodic curves: one in $\mathrm {D}_4$ with the turning points $(z_1,z_2)$ given by (18), and one in $\mathrm {D}_1$ with $(z_3,z_4)$ given by Eq. (19). An energy in $E_3< E < E_1$ defines only one periodic trajectory in $\mathrm {D}_3$, and each $E_4< E < E_2$ defines one periodic curve in $\mathrm {D}_2$. Since $\varDelta < 0$ in both cases, the turning points are given by $(z_1,z_2)$ from Eq. (25).

2.4.3 Type III

The case of $a^\frac{2}{3}+b^\frac{2}{3}=1$ is specific, but it has to be included to understand the bifurcation between the two neighboring types. Increasing a and/or b from the type IV, we observe that the two Cassini states $C_1$ and $C_4$ merge into a single point $C_{14}$ of neutral stability, the homoclinic orbit $\varGamma _1$ contracts to a point, and the separatrix $\varGamma _2$ merges with the specific periodic orbit $\varGamma _{3}$ into $\varGamma _{23}$. Thus, the domains $\mathrm {D}_1$ and $\mathrm {D}_4$ disappear. In course of transition from type III to type II, the division between $\mathrm {D}_{2}$ and $\mathrm {D}_3$ disappears; hence, we merge them into a single $\mathrm {D}_{23}$. Lanchares and Elipe (1995) refer to this transition under the name of teardrop bifurcation.

The coordinates of the Cassini states can be obtained from Eq. (174) and are expressible in terms of a or b alone, resulting in

$$\begin{aligned} z^*_{14} = \sqrt{1-A_3}= b^\frac{1}{3}, \qquad y^*_{14} = - \sqrt{A_3} = - a^\frac{1}{3}, \end{aligned}$$

(39)

and

$$\begin{aligned} z^*_3 = - z^*_{14} \left( A_3 + \sqrt{B_3} \right) , \qquad z^*_2 = - z^*_{14} \left( A_3 - \sqrt{B_3} \right) , \end{aligned}$$

(40)

where

$$\begin{aligned} A_3 = 1- b^\frac{2}{3} = a^\frac{2}{3}, \qquad B_3 = A^2_3-A_3+1. \end{aligned}$$

(41)

The expression of the energy at $C_{14}$ is so simple, that we write it explicitly

$$\begin{aligned} E_{14} = - \frac{3}{2} \root 3 \of {a^4 b^2}= - \frac{3 \left( z^*_{14}\right) ^2 \left( y^*_{14}\right) ^4}{2} = - \frac{3 A_3^2 \left( 1-A_3\right) }{2}. \end{aligned}$$

(42)

It is useful also in finding the turning point of $\varGamma _{23}$ at $x=0$, which is

$$\begin{aligned} z_\mathrm {m} = z^*_{14} \left( 1-4A_3\right) , \qquad y_\mathrm {m} = - y^*_{14} \left( 3-4 A_3\right) . \end{aligned}$$

(43)

For regular trajectories with $E_3< E < E_2$, and $E \ne E_{14}$, their turning points are given by $z_1$ and $z_2$ from Eq. (25), which also means $z_1 < z_2$.

3 Analytical solution

3.1 Weierstrass form

3.1.1 General framework

Equations of motion (9) or (13) admit an exact analytical solution in terms of the elliptic functions. Actually, the solution hinges upon the fact that the variable Z can be considered separately from the remaining two. To see it, we can differentiate ${\dot{Z}}=-aX$, substitute ${\dot{X}}$ from the first of Eq. (13), and use the energy integral (11) to eliminate Y, obtaining

$$\begin{aligned} {\ddot{Z}} = - \frac{1}{2} Z^3 - E Z - a^2 b. \end{aligned}$$

(44)

Equation (44) defines a 1 degree of freedom, conservative system with the potential

$$\begin{aligned} V_z(Z) = \frac{Z^4}{8} + \frac{E}{2} Z^2 + a^2 b Z, \end{aligned}$$

(45)

and the energy integral

$$\begin{aligned} \frac{{\dot{Z}}^2}{2} + V_z(Z) = E_z = \mathrm {const}. \end{aligned}$$

(46)

But the potential $V_z(Z)$ and the energy $E_z$ are closely related to the polynomial W(Z):

$$\begin{aligned} E_z - V_z(Z) = \frac{1}{8} W(Z), \text{ with } 2 E_z = \frac{W(0)}{4} = - \left( E - a^2\right) ^2 + a^2 \left( 1- b^2\right) . \end{aligned}$$

(47)

Thus, whether we use Eq. (46), or the squared Eqs. (13) and (16), the outcome is $4 {\dot{Z}}^2 = W(Z)$, amenable to the separation of variables method.

Since W(Z) is a quartic polynomial, finding Z(t) amounts to inverting the elliptic integral

$$\begin{aligned} \sigma _z \intop _{Z_{0}}^{Z}\frac{\mathrm {d}Z}{\sqrt{W(Z)}} = \frac{1}{2} \intop _{t_0}^{t} \mathrm {d}t, \end{aligned}$$

(48)

where $\sigma _z = \mathrm {sgn}{{\dot{Z}}} = - \mathrm {sgn}{X}$, and the right-hand side evaluates to

$$\begin{aligned} \tau _0 = \frac{t-t_0}{2}. \end{aligned}$$

(49)

The integral to the left of (48) should be a monotonous function of Z to allow the inversion (solving for Z(t)). In the admissible range of Z between two turning points (or one turning point and an unstable equilibrium), the sign of ${\dot{Z}}$ is constant, which allows to establish its value from the initial condition and to pull $\sigma _z$ out of the integrand.

The integral to the left of Eq. (48) is an elliptic integral, and it can be reduced to the Weierstrass normal form by an appropriate transformation $Z \rightarrow s$, so that

$$\begin{aligned} \sigma _z \intop _{Z_0}^{Z}\frac{\mathrm {d}Z}{\sqrt{W(Z)}} = \intop _{s}^{\infty } \frac{\mathrm {d}s}{\sqrt{S(s)}} = \tau _0, \end{aligned}$$

(50)

where $S(s)= 4 s^3 - g_2 s - g_3$ is the cubic polynomial of the Weierstrass resolvent (133) for $W(Z)=0$, with the invariants $g_2$, $g_3$ defined in “Appendix A.2”—Eqs. (134) and (135). Note that the initial value $Z_0$ is always mapped to $s \rightarrow \infty $, regardless of the ordering of the integration limits $Z_0$ and Z, which explains the presence of $\sigma _z$ in the forthcoming transformations.

Solving the rightmost part of Eq. (50) amounts to the substitution of the Weierstrass elliptic function

$$\begin{aligned} s = \wp (\tau _0; g_2, g_3), \text{ hence } \mathrm {d}s = \wp '(\tau _0; g_2, g_3) \mathrm {d}\tau _0. \end{aligned}$$

(51)

In all further instances, we will use the abbreviated notation for the Weierstrass $\wp $ function $\wp (u) = \wp (u; g_2, g_3)$, whenever the invariants $g_2$ and $g_3$ from “Appendix A.2” are used. Only the invariants different than $g_2$ and $g_3$ will be added to the list of arguments if needed. The derivative of the Weierstrass $\wp $ function obeys

$$\begin{aligned} \left| \wp '(\tau _0 ) \right| = \sqrt{S(\wp (\tau _0 ))}, \end{aligned}$$

(52)

and in the first half-period $0< \tau _0 < \omega _1 = \wp ^{-1}(e_1)$, we have $\wp '(\tau _0 )<0$. Thus, indeed

$$\begin{aligned} \intop _{s}^{\infty } \frac{\mathrm {d}s}{\sqrt{S(s)}} = \intop _{\tau _0}^{0} \frac{\wp '(\tau )}{\left| \wp '(\tau )\right| }\,\mathrm {d}\tau = - \intop _{\tau _0}^{0} \mathrm {d}\tau = \tau _0, \end{aligned}$$

(53)

as expected from Eq. (50).

Once the solution for the first half-period is found, its continuation can be investigated by the substitution into Eq. (44) which is free from the restrictions imposed by the inversion procedure.

3.1.2 Initial conditions at turning point

Let us begin with the easiest situation, when the initial condition is $Z_0 = Z_j = z_j-b$, and $t_0=t_j$ is the epoch of crossing the turning point $X_0=0$, i.e., ${\dot{Z}}=0$. Then, the integral to the left of (50) is reduced to the Weierstrass normal form in variable s through the rational transformation (Enneper 1890; Bianchi 1901)

$$\begin{aligned} Z = Z_j + \frac{\frac{1}{4} W'(Z_j)}{s - \frac{1}{24} W''(Z_j)}, \qquad s = \frac{W'(Z_j)}{4 \left( Z-Z_j\right) } + \frac{1}{24} W''(Z_j). \end{aligned}$$

(54)

Replacing the subscript 0 with j in the formulae of Sect. 3.1.1, we find

$$\begin{aligned} Z = Z_j + \frac{\frac{1}{4} W'(Z_j)}{\wp (\tau _j) - \frac{1}{24} W''(Z_j)}, \end{aligned}$$

(55)

where

$$\begin{aligned} \tau _j = \frac{t-t_j}{2}. \end{aligned}$$

(56)

Thanks to the symmetry of trajectory with respect to the turning point, the solution does not depend on $\sigma _z$ and remains valid for all values of $\tau _j$.

Given the initial conditions $X(t_j) = 0$, $Y(t_j)=Y_j$, $Z(t_j)=Z_j$, and knowing Z(t), we may complete the solution for $\mathbf {R}(t)$. The two missing variables come from the energy integral (11), with

$$\begin{aligned} E = - \frac{Z^2}{2} + a Y = - \frac{Z_j^2}{2} + a Y_j, \end{aligned}$$

(57)

and from the third of the equations of motion (13), i.e., $X = - {\dot{Z}}/a$, so

$$\begin{aligned} X= & {} - \left( \frac{Z -Z_j}{2 a}\right) ^2 \frac{\wp '(\tau _j)}{a b + Y_j Z_j}, \nonumber \\ Y= & {} Y_j + \frac{Z^2-Z_j^2}{2 a}, \nonumber \\ Z= & {} Z_j - \frac{ 2 a^2 b + 2 E Z_j + Z_j^3}{\wp (\tau _j) + \frac{1}{3} E + \frac{1}{2} Z_j^2} = Z_j - \frac{6 a \left( a b + Y_j Z_j\right) }{3 \wp (\tau _j) + a Y_j + Z_j^2}. \end{aligned}$$

(58)

The coordinates on the unit sphere are, as usually, $x=X$, $y=Y-a$, and $z=Z+b$.

3.1.3 Arbitrary initial conditions: Weierstrass–Biermann form

If the initial conditions are not at the turning point, i.e., $Z_0 \ne Z_j$, the transformation $Z \mapsto s$ is more cumbersome than (54). Whittaker and Watson (1927) quote two alternative forms of the final solution: one due to Weierstrass, published by Biermann (1865), and one by Mordell (1915). Actually, there is yet another, formally elegant form—the secondo metodo d’inversione of Bianchi (1901), but the relation of its constants to initial conditions is rather awkward, so we do not consider it here.

The Weierstrass–Biermann form results from the substitution (Enneper 1890)

$$\begin{aligned} Z= & {} Z_0 + \frac{\sigma _z \sqrt{W_0} \sqrt{S(s)} +\frac{1}{2} W_1 \left( s - \frac{1}{12} W_2\right) + \frac{1}{4} W_0 W_3}{2 \left( s-\frac{1}{12} W_2\right) ^2 - \frac{1}{2} W_0 W_4 }, \end{aligned}$$

(59)

$$\begin{aligned} s= & {} \frac{\sqrt{W_0} \sqrt{W(Z)} + W_0 + \frac{1}{2} W_1 \left( Z-Z_0\right) + \frac{1}{6} W_2 \left( Z-Z_0\right) ^2 }{2 \left( Z-Z_0 \right) ^2}, \end{aligned}$$

(60)

where $\sigma _z$ is the sign of $Z-Z_0$, and

$$\begin{aligned} W_k = \frac{1}{k!} \left[ \frac{\mathrm {d}^k W(Z)}{\mathrm {d}Z^k}\right] _{Z_0}, \end{aligned}$$

(61)

are the Taylor coefficients in

$$\begin{aligned} W(Z) = \sum _{k=0}^4 W_k (Z-Z_0)^k. \end{aligned}$$

(62)

After expressing E in terms of the initial conditions, they take the form

$$\begin{aligned}&W_0 = 4 a^2 X_0^2, \qquad \qquad \, \, W_1 = - 8 a \left( a b+Y_0 Z_0\right) , \nonumber \\&W_2 = - 4 \left( a Y_0 + Z_0^2\right) , \quad W_3 = - 4 Z_0, \quad W_4 = -1. \end{aligned}$$

(63)

Substituting $s = \wp (\tau _0)$, and recalling that $\sqrt{S(s)} = - \wp '(\tau _0)$ over the first half-period, we obtain

$$\begin{aligned} Z = Z_0 + \frac{-\sigma _z \sqrt{W_0} \wp '(\tau _0) +\frac{1}{2} W_1 \left( \wp (\tau _0) - \frac{1}{12} W_2\right) + \frac{1}{4} W_0 W_3}{ 2 \left( \wp (\tau _0) -\frac{1}{12} W_2\right) ^2 - \frac{1}{2} W_0 W_4 }. \end{aligned}$$

(64)

Then, accounting for Eq. (63) and the fact that $\sqrt{W_0} = 2 a |X_0| = - 2 \sigma _z a X_0$,

$$\begin{aligned} Z = Z_0 + a \, \frac{X_0 \wp '(\tau _0) - 2 \left( a b + Y_0 Z_0 \right) \left[ \wp (\tau _0) + \frac{1}{3} \left( a Y_0 + Z_0^2\right) \right] - 2 a X_0^2 Z_0}{ \left[ \wp (\tau _0) + \frac{1}{3} \left( a Y_0 + Z_0^2\right) \right] ^2 + a^2 X_0^2 }. \end{aligned}$$

(65)

The substitution into Eq. (44) with the initial condition ${\dot{Z}}_0 = - a X_0$ shows that the formula (65) is actually valid for all values of $\tau _0$, so the initial restriction to the first half-period can be abolished.

Like before, the remaining two variables Y, X are found from the energy integral and from the equations of motion, respectively. Thus, for Y, we have simply

$$\begin{aligned} Y = Y_0 + \frac{Z^2-Z_0^2}{2 a}, \end{aligned}$$

(66)

whereas $X= - {\dot{Z}}/a$, requires the differentiation and some manipulations leading to

$$\begin{aligned} X {=} \frac{- X_0 \left( 3 \wp (\tau _0)- \frac{1}{4}g_2\right) {+} \left( a b + Y_0 Z_0 \right) \wp '(\tau _0) {+} \frac{(Z-Z_0)}{a} \left[ \wp (\tau _0) + \frac{1}{3} \left( a Y_0 + Z_0^2\right) \right] \wp '(\tau _0)}{ \left[ \wp (\tau _0) + \frac{1}{3} \left( a Y_0 + Z_0^2\right) \right] ^2 + a^2 X_0^2 }, \end{aligned}$$

(67)

where the second derivative has been removed using the identity $\wp ''=6 \wp ^2-(g_2/2)$.

3.1.4 Arbitrary initial conditions: Safford form

If $X_0=0$, so $\tau _0=\tau _j$, the above solution simplifies to Eq. (58) in a straightforward manner. On the other hand, as pointed out by Safford (1919), the general solution (64) can be derived from the particular solution (55) by assuming $\tau _j = \tau _0 + \phi _j$, and making use of the addition theorem for the Weierstrass function $\wp $. Given the initial conditions $X_0,Y_0,Z_0$ at $t=t_0$, we can find the turning point coordinates $Z_j, Y_j$ for this trajectory using the formulae of Sect. 2.3. Then, according to Safford (1919), the phase $\phi _j$ is defined through Eq. (60) at $Z=Z_j$, giving $s=s_j = \wp (\phi _j)$. Thus, as an alternative to using Eqs. (65) and (66) for arbitrary initial conditions, one can first compute $\phi _j = \wp ^{-1}(s_j)$, and then apply the particular solution (58) with $\tau _j = \tau _0 + \phi _j$.

But Safford (1919) cared solely about the reduction of the integrand form, so he paid no attention to the problem that $\wp (\phi _j)$ is uniquely invertible only in the domain $0 \le \phi _j \le \omega _1$, i.e., within the first half-period. In order to properly place the phase in the full period range $- \omega _1 \le \phi _j \le \omega _1$, one needs the information about the sign of the derivative $\wp '(\phi _j)$. To this end, we take a slightly different approach, that actually leads to a simpler expression for $s_j$. Substituting $\tau _j=\phi _j$, and $\mathbf {R}=\mathbf {R}_0$ in Eq. (58), we can solve them to find

$$\begin{aligned} \wp (\phi _j)= & {} - 2 a \frac{a b + Y_j Z_j}{Z_0-Z_j} - \frac{a Y_j + Z_j^2}{3}, \end{aligned}$$

(68)

$$\begin{aligned} \wp '(\phi _j)= & {} - X_0 \left( a b + Y_j Z_j\right) \, \left[ \frac{2a}{Z_0-Z_j}\right] ^2, \end{aligned}$$

(69)

and this set allows the unique determination of the phase. Then, we can apply Eq. (58) to obtain

$$\begin{aligned} X= - \left( \frac{Z -Z_j}{2 a}\right) ^2 \frac{\wp '(\tau _0+\phi _j)}{a b + Y_j Z_j}, \qquad Z = Z_j - \frac{6 a \left( a b + Y_j Z_j\right) }{3 \wp (\tau _0+\phi _j) + a Y_j + Z_j^2}, \end{aligned}$$

(70)

with $Y_j$ and Y derived from the energy integral.

Compared to the Weierstrass–Biermann solution, we gain the simplicity at the expense of pre-computing the turning point coordinates and the phase for the given (arbitrary) initial conditions.

3.1.5 Arbitrary initial conditions: Mordell form

The inversion formula of Mordell (1915) was formulated in the language of homogeneous binary forms; in order to apply it to the Colombo top problem, let us translate it to the univariate polynomials framework. The link is simple: the quartic polynomial W(Z) from Eq. (122), and the quartic binary form

$$\begin{aligned} V(\xi ,\eta ) = a_0 \xi ^4 + 4 a_1 \xi ^3 \eta + 6 a_2 \xi ^2 \eta ^2 + 4 a_3 \xi \eta ^3 + a_4 \eta ^4, \end{aligned}$$

(71)

can be matched by

$$\begin{aligned} W(Z) = V(Z,1), \qquad V(\xi ,\eta ) = \eta ^4 W(\xi \eta ^{-1}). \end{aligned}$$

(72)

Thus, after the substitution $Z=\xi \eta ^{-1}$, Eq. (50) is equivalent to

$$\begin{aligned} \sigma _z \intop _{(\xi _0,\eta _0)}^{(\xi ,\eta )}\frac{\eta \mathrm {d}\xi - \xi \mathrm {d}\eta }{\sqrt{V(\xi ,\eta )}} = \intop _{s}^{\infty } \frac{\mathrm {d}s}{\sqrt{S(s)}} = \tau _0, \end{aligned}$$

(73)

where the definite integral in Z has been replaced by a path-independent line integral. Skipping the intermediate steps described in Mordell (1914, 1915), the inversion of (73) results in

$$\begin{aligned} \xi= & {} -\sigma _z \xi _0 \sqrt{V_0} \wp '(\tau _0) + \frac{1}{2} \frac{\partial V_0}{\partial \eta _0} \wp (\tau _0) + \frac{1}{2} \frac{\partial {\tilde{h}}_0}{\partial \eta _0}, \nonumber \\ \eta= & {} -\sigma _z \eta _0 \sqrt{V_0} \wp '(\tau _0) - \frac{1}{2} \frac{\partial V_0}{\partial \xi _0} \wp (\tau _0) - \frac{1}{2} \frac{\partial {\tilde{h}}_0}{\partial \xi _0}, \end{aligned}$$

(74)

where $V_0 = V(\xi _0,\eta _0)$, and ${\tilde{h}}_0$ is the Hessian covariant^{Footnote 1} of V.

From the correspondence rules (72), we derive

$$\begin{aligned}&\frac{\partial V_0}{\partial \xi _0} = \eta _0^3 W_1, \nonumber \\&\frac{\partial V_0}{\partial \eta _0} = \eta _0^2 \left( 4 \eta _0 W_0 - \xi _0 W_1\right) , \nonumber \\&{\tilde{h}}_0 = \frac{1}{144} \left( \frac{\partial ^2 V_0}{\partial \xi _0^2} \frac{\partial ^2 V_0}{\partial \eta _0^2} - \left( \frac{\partial ^2 V_0}{\partial \xi _0 \partial \eta _0} \right) ^2\right) = \frac{\eta _0^4 \left( 8 W_0 W_2 - 3 W_1^2\right) }{48}, \nonumber \\&\frac{\partial {\tilde{h}}_0}{\partial \xi _0} = \frac{\eta _0^3 \left( 6 W_0 W_3 - W_1 W_2\right) }{12}, \nonumber \\&\frac{\partial {\tilde{h}}_0}{\partial \eta _0} = \frac{\eta _0^2 \left( 8 \eta _0 W_0 W_2 - 3 \eta _0 W_1^2 + \xi _0 W_1 W_2 - 6 \xi _0 W_0 W_3\right) }{12}, \end{aligned}$$

(75)

where $W_n$ are defined in Eq. (63). By letting $Z=\xi /\eta $, and $Z_0=\xi _0/\eta _0$, we find that (74) and (75) amount to

$$\begin{aligned} Z = \frac{\xi }{\eta } = Z_0 + \frac{48 W_0 \wp (\tau _0) + 8 W_0 W_2 - 3 W_1^2}{ - 24 \sigma _z \sqrt{W_0} \wp '(\tau _0) - 12 W_1 \wp (\tau _0)+ W_1 W_2 - 6 W_0 W_3}. \end{aligned}$$

(76)

Proceeding like in Sect. 3.1.3, we obtain the final form

$$\begin{aligned} Z = Z_0 + 4 a \, \frac{ X_0^2 \left[ \wp (\tau _0) - \frac{2}{3} \left( a Y_0 + Z_0^2\right) \right] - \left( a b + Y_0 Z_0\right) ^2 }{ X_0 \wp '(\tau _0) + 2 \left( a b + Y_0 Z_0\right) \left[ \wp (\tau _0) + \frac{1}{3}\left( a Y_0 + Z_0^2\right) \right] + 2 a X_0^2 Z_0}, \end{aligned}$$

(77)

looking different from the Weierstrass–Biermann solution (65), yet providing the same values of Z. Notably, the Mordell solution involves only the first power of $\wp $.

In order to demonstrate the equivalence of the Weierstrass–Biermann and the Mordell solutions, one can multiply the numerator and the denominator in Eq. (65) by the factor

$$\begin{aligned} \left( 48 a X_0 \wp '(\tau _0) - \left( 12 \wp (\tau _0) - W_2 \right) W_1 - 6 W_0 W_3 \right) , \end{aligned}$$

use the identity $\left( \wp '(\tau _0) \right) ^2 = S(\wp (\tau _0))$, and substitute the expressions of the invariants in terms of $W_n$

$$\begin{aligned} g_2 = \frac{W_2^2}{12} - \frac{W_2 W_3}{4} + W_0 W_4, \qquad g_3 = \left| \begin{array}{ccc} W_4 &{} \frac{1}{4} W_3 &{} \frac{1}{6} W_2 \\ \frac{1}{4} W_3 &{} \frac{1}{6} W_2 &{} \frac{1}{4} W_1 \\ \frac{1}{6} W_2 &{} \frac{1}{4} W_1 &{} W_0 \\ \end{array} \right| , \end{aligned}$$

(78)

by analogy with the definitions (134) and (135). The result of this procedure is the Mordell solution (77). Obviously, both (65) and (77) admit the same limit expression (58) when $X_0=0$ and $\tau _0=\tau _j$.

As usually, the solution for X and Y can be derived from the equations of motion and the energy integral, like in Sect. 3.1.3.

3.2 Solution in terms of the Jacobi elliptic functions

3.2.1 Weierstrass functions in terms of the Jacobi functions

Although the solution in terms of the Weierstrass $\wp $ function presented in Sect. 3.1 may look universal, its qualitative properties depend on the values of the invariants through the sign of the discriminant $\varDelta $. Indeed, the sign plays the central role in expressing the solution in terms of the Jacobian elliptic functions. In this section, only the generic, $\varDelta \ne 0$ cases are to be discussed.

The basic relation between the Weierstrass and Jacobi functions is formally universal (Byrd and Friedman 1971)

$$\begin{aligned} \wp (\tau )= & {} e_3 + \left( \frac{\gamma _p}{\mathrm {sn}(u_p,k_p)}\right) ^2 = e_3 + \gamma _p^2 \, \frac{1+\mathrm {dn}(2 u_p,k_p)}{1- \mathrm {cn}(2 u_p,k_p)} , \end{aligned}$$

(79)

$$\begin{aligned} \wp '(\tau )= & {} - 2 \left( \frac{\gamma _p}{\mathrm {sn}(u_p,k_p)}\right) ^3 \mathrm {cn}(u_p ,k_p)\,\mathrm {dn}(u_p ,k_p) \nonumber \\= & {} - 2 \gamma _p^3 \, \frac{\left( \mathrm {cn}(2 u_p,k_p)+\mathrm {dn}(2 u_p,k_p)\right) \left( 1+\mathrm {dn}(2 u_p,k_p)\right) }{\mathrm {sn}(2 u_p,k_p) \left( 1- \mathrm {cn}(2 u_p,k_p)\right) }, \end{aligned}$$

(80)

where

$$\begin{aligned} u_p = \gamma _p \tau , \qquad \gamma _p = \sqrt{e_1-e_3}, \qquad k_p = \sqrt{\frac{e_2-e_3}{e_1-e_3}}. \end{aligned}$$

(81)

But if we restrict considerations to the real arguments and moduli of the elliptic functions, the above expressions are valid only if the discriminant $\varDelta $ from Eq. (136) is positive.

When $\varDelta < 0$, which means complex $e_2$ and $e_3$, the appropriate form is (Abramowitz and Stegun 1972)

$$\begin{aligned} \wp (\tau )= & {} = e_1 + \gamma _n \,\left( \frac{\mathrm {cn}(u_n,k_n)}{ \mathrm {sn}(u_n, k_n) \mathrm {dn}(u_n, k_n)}\right) ^{2} = e_1 + \gamma _n \, \frac{1+\mathrm {cn}(2 u_n, k_n)}{1-\mathrm {cn}(2 u_n, k_n)}, \end{aligned}$$

(82)

$$\begin{aligned} \wp '(\tau )= & {} -2 \left( \frac{ \sqrt{\gamma _n} }{\mathrm {sn}(u_n, k_n) \mathrm {dn}(u_n, k_n)} \right) ^3 \, \mathrm {cn}(u_n,k_n) \, \left( 1-k_n^2 + k_n^2 \mathrm {cn}^4(u_n,k_n) \right) \nonumber \\= & {} - 4 \gamma _n^\frac{3}{2} \, \frac{\mathrm {sn}(2 u_n, k_n) \mathrm {dn}(2 u_n, k_n)}{\left( 1 - \mathrm {cn}(2 u_n,k_n)\right) ^2}, \end{aligned}$$

(83)

where

$$\begin{aligned} u_n = \sqrt{\gamma _n} \tau , \qquad \gamma _n = \gamma _p \sqrt{e_1-e_2} = \frac{1}{2} \sqrt{12 e_1^2 - g_2}, \qquad k_n = \frac{1}{2} \sqrt{2 - \frac{3 e_1}{\gamma _n} }. \end{aligned}$$

(84)

The two cases are linked by the complex modulus transformation—see Byrd and Friedman (1971, formula 165.07).

Let the initial conditions at the epoch $t_0$ be $(X_0,Y_0,Z_0)$. The equations relating the Jacobi and Weierstrass functions can be substituted in to any of the solution forms provided in Sect. 3.1. For the Weierstrass–Biermann or the Mordell form, it is enough to compute the energy $E=E(Y_0,Z_0)$ and the discriminant $\varDelta $ to choose the appropriate set (79,80) or (82,83). Then, from the invariants $g_2, g_3$ the roots $e_1, e_2, e_3$ are found, which allows the computation of $\mathbf {R}(t)$ or $\mathbf {r}(t)$ for any epoch t.

Below, we discuss the Safford form from Sect. 3.1.4, which allows to use the initial conditions at any $t_0$, but requires the turning point coordinates $(X_j=0, Y_j=y_j+a, Z_j = z_j-b)$ as supplementary parameters. Having computed two appropriate turning points, such that either $Z_1< Z_0 < Z_2$, or $Z_3< Z_0 < Z_4$, we pick one of them as the reference point $Z_j$ to be used in Eq. (70). Then, after determining phase $\phi _j$ with respect to the turning point $Z_j$, the motion can be computed for any epoch t, using $\tau _j = \tau _0+\phi _j$.

3.2.2 Motion in $\mathrm {D}_1$ and $\mathrm {D}_4$

The case $\varDelta > 0$ occurs only in type IV, when the energy is bounded by $E_4< E < E_1$: either in the domain $\mathrm {D}_1$, where turning points $Z_3, Z_4$ are given by Eq. (19) with $Z_4^*< Z_3 < Z_4$, or in the domain $D_4$, where the turning points are $Z_1, Z_2$ given by Eq. (18) and $Z_1< Z_2 < Z_4^*$. Then, selecting any appropriate $Z_j$ as the reference point, we can use the expressions

$$\begin{aligned} X= & {} \frac{A_p \gamma _p \mathrm {sn}(u_p,k_p) \mathrm {cn}(u_p,k_p) \mathrm {dn}(u_p,k_p)}{\left( 1+ B_p \mathrm {sn}^2(u_p,k_p)\right) ^2} \end{aligned}$$

(85)

$$\begin{aligned} Y= & {} Y_j + \frac{Z^2-Z_j^2}{2 a } = Y_j - \frac{A_p \mathrm {sn}^2(u_p,k_p)}{1+ B_p \mathrm {sn}^2(u_p,k_p)} \left( Z_j - \frac{a A_p \mathrm {sn}^2(u_p,k_p)}{ 1+ B_p \mathrm {sn}^2(u_p,k_p)}\right) \end{aligned}$$

(86)

$$\begin{aligned} Z= & {} Z_j - \frac{a A_p \mathrm {sn}^2(u_p,k_p)}{1+ B_p \mathrm {sn}^2(u_p,k_p)} \end{aligned}$$

(87)

where

$$\begin{aligned} A_p = \frac{2 (a b + Y_j Z_j)}{e_1-e_3}, \quad B_p = \frac{\frac{1}{3} \left( a Y_j + Z_j^2 \right) + e_3}{e_1-e_3}, \end{aligned}$$

(88)

and the phase $\phi _j$ in

$$\begin{aligned} u_p = \gamma _p (\tau _0 + \phi _j), \end{aligned}$$

(89)

can be computed from the incomplete elliptic function of the first kind F:

$$\begin{aligned} \gamma _p \phi _j= & {} \mathrm {F}(v_j,k_p), \end{aligned}$$

(90)

$$\begin{aligned} v_j= & {} \mathrm {am}(\gamma _p \phi _j, k_p) = \mathrm {sgn}{(A_p X_0)}\,\arcsin \sqrt{\frac{Z_j-Z_0}{a A_p - (Z_j-Z_0) B_p}}. \end{aligned}$$

(91)

The motion is periodic, and the period $P_t$ (with respect to time t) is given by the complete elliptic integral of the first kind K

$$\begin{aligned} P_t = \frac{4 \mathrm {K}(k_p)}{\gamma _p}. \end{aligned}$$

(92)

3.2.3 Motion in $\mathrm {D}_2$, $\mathrm {D}_3$, and $\mathrm {D}_{23}$

The common feature of trajectories in domains $\mathrm {D}_2$, $\mathrm {D}_3$, and $\mathrm {D}_{23}$ is the negative discriminant $\varDelta $. Thus, given the initial conditions and resulting energy, we pick $Z_1$ or $Z_2$ computed from Eq. (25) as the reference point $Z_j$ and compute, for any t

$$\begin{aligned} X= & {} \frac{A_n \sqrt{\gamma _n} (1-B_n) \mathrm {sn}(2 u_n,k_n) \mathrm {dn}(2 u_n,k_n)}{\left( 1- B_n \mathrm {cn}(2u_n,k_n)\right) ^2} \end{aligned}$$

(93)

$$\begin{aligned} Y= & {} Y_j + \frac{Z^2-Z_j^2}{2 a } = Y_j - A_n \frac{ 1-\mathrm {cn}(2u_n,k_n) }{1 - B_n \mathrm {cn}(2u_n,k_n)} \left( Z_j - \frac{a A_n}{2} \,\frac{ 1-\mathrm {cn}(2u_n,k_n)}{ 1 - B_n \mathrm {cn}(2u_n,k_n)}\right) , \end{aligned}$$

(94)

$$\begin{aligned} Z= & {} Z_j - a A_n\, \frac{ 1-\mathrm {cn}(2u_n,k_n)}{ 1 - B_n \mathrm {cn}(2u_n,k_n)}, \end{aligned}$$

(95)

where

$$\begin{aligned} A_n = \frac{2 (a b + Y_j Z_j)}{e_1+ \gamma _n + \frac{1}{3} \left( a Y_j + Z_j^2 \right) }, \quad B_n = 1- \frac{2 \gamma _n}{e_1+ \gamma _n + \frac{1}{3} \left( a Y_j + Z_j^2 \right) }. \end{aligned}$$

(96)

The phase in

$$\begin{aligned} u_n = \sqrt{\gamma _n} ( \tau _0 + \phi _j), \end{aligned}$$

(97)

results from

$$\begin{aligned} 2 \sqrt{\gamma _n} \phi _j= & {} \mathrm {F}(v_j,k_n), \end{aligned}$$

(98)

$$\begin{aligned} v_j= & {} \mathrm {am}(2 \sqrt{\gamma _n} \phi _j, k_n) = \mathrm {sgn}{(A_n X_0)}\,\arccos {\left( \frac{a A_n + Z_0-Z_j}{a A_n + (Z_0-Z_j) B_n} \right) }. \end{aligned}$$

(99)

The related period is

$$\begin{aligned} P_t = \frac{4 \mathrm {K}(k_n)}{\sqrt{\gamma _n}}. \end{aligned}$$

(100)

The above formulation is universal, i.e., appropriate in types II, III, and IV, provided $\varDelta < 0$.

3.3 Special cases

3.3.1 Reduction rules

By special cases, we mean the ones where elliptic functions reduce to the elementary ones. They could be studied by taking limits of the Jacobi functions at $k_n$ or $k_p$ tending to 0 or 1. But it is more direct to observe that each of the special cases, be it the Cassini states, separatrices, or the special curve $\varGamma _3$, results from the reduction of the Weierstrass function $\wp (u,g_2,g_3)$ to the special case

$$\begin{aligned} \wp (u;3,1) = 1 + \frac{3}{2 \tan ^2{\left( \sqrt{\frac{3}{2}} u\right) }}. \end{aligned}$$

(101)

Since $\varDelta =0$ implies $(g_2/3)^3 = g_3^2$, reduction to the above form is always possible by the homogeneity relations (Abramowitz and Stegun 1972)

$$\begin{aligned} \wp ( u ; g_2,g_3) = \lambda ^{-2} \wp (\lambda ^{-1} u; \lambda ^4 g_2 , \lambda ^6 g_3), \end{aligned}$$

(102)

except for $g_2=g_3=0$, when

$$\begin{aligned} \wp (u;0,0) = u^{-2}. \end{aligned}$$

(103)

For the derivative $\wp '$, respective equations are obtained by straightforward differentiation.

Depending on the sign of $g_3$, two procedures are available. For $g_3 > 0$, the substitution of $\lambda =g_3^{-\frac{1}{6}}$ leads straight to

$$\begin{aligned} \wp ( u ; g_2,g_3) = e_1 \wp (\sqrt{e}_1\, u; 3 , 1), \end{aligned}$$

(104)

where $g_3=e_1^3$, according to Eq. (145).

If $g_3 <0$, two steps are taken. First, letting $\lambda =\mathrm {i}$, we convert

$$\begin{aligned} \wp ( u ; g_2,g_3) = - \wp (\mathrm {i}\, u; g_2,-g_3). \end{aligned}$$

(105)

Then, with $\lambda = (-g_3)^{-\frac{1}{6}}$, we recover

$$\begin{aligned} \wp ( u ; g_2,g_3) = - 2 e_{12} \wp (\mathrm {i}\,\sqrt{2 e_{12}} \, u; 3, 1), \end{aligned}$$

(106)

according to Eq. (146).

3.3.2 Cassini states $C_2$ and $C_3$

When discussing the Cassini states, we can use the simplest form (58) for Z. Although the time dependence at the Cassini states does vanish due to $a b+ Y_0 Z_0 = -W'(Z_0)/(8a) = 0$, and $X_0=0$, it remains of interest to inspect $\wp (\tau )$ in the numerator, because its period is the period of small oscillations around the stable equilibrium.

Given a and b, one should first establish the problem type in order to compute the appropriate coordinate $Z^*_2 = z^*_2-b$, or $Z^*_3 = z^*_3-b$. These are Eqs. (33), (40), and (36) for the types II, III, and IV, respectively. Starting from this point, the procedure is common: $Z^*_j$ gives the energy of the Cassini state

$$\begin{aligned} E_j = - \frac{\left( Z^*_j\right) ^3 + a^2 b}{2 Z^*_j}, \qquad j=2,3, \end{aligned}$$

(107)

which substituted in Eq. (134) or (135) gives the invariants $g_2(E_j)$ or $g_3(E_j)$—both positive.

Resorting to Eqs. (104) and (101), we find that Z in solution (58) depends on the squared tangent of $\sqrt{3 e_1/8}\, t$ which implies the period

$$\begin{aligned} P_j = 2 \pi \sqrt{\frac{2}{3 e_1(E_j)}}, \qquad j=2,3, \end{aligned}$$

(108)

where $e_1(E_j) = \root 3 \of {g_3}= \sqrt{g_2/2}$. The same result can be obtained by taking the limit of (100) at $k_n \rightarrow 0$.

3.3.3 Cassini state $C_4$ and homoclinic orbits $\varGamma _1$, $\varGamma _2$

The energy of the unstable Cassini state $C_4$ can be evaluated from Eq. (107) with $j=4$, and $Z^*_4$ given by (36). However, we need it not for the Cassini state itself, but rather to describe the motion on homoclinic orbits having the energy $E_4$, which are $\varGamma _1$ and $\varGamma _2$. To this end, we will use the Safford form (70) with the reference points given by Eq. (23), namely $Z_4$ for $\varGamma _1$, and $Z_1$ for $\varGamma _2$.

Since $g_3<0$, the reduction (106) leads to

$$\begin{aligned} X= & {} \frac{{\tilde{A}} \sqrt{3 e_{12}} \sinh (2 {\tilde{u}})}{\left( 1+{\tilde{B}} \, \sinh ^2{\tilde{u}} \right) ^2}, \nonumber \\ Y= & {} Y_j + \frac{Z^2-Z_j^2}{2 a } = Y_j - \frac{ 2 {\tilde{A}} \, \sinh ^2{\tilde{u}} }{1+{\tilde{B}} \, \sinh ^2{\tilde{u}} } \left( Z_j - \frac{ a {\tilde{A}} \, \sinh ^2{\tilde{u}} }{1+{\tilde{B}} \, \sinh ^2{\tilde{u}} }\right) , \nonumber \\ Z= & {} Z_j - \frac{ 2 a {\tilde{A}} \, \sinh ^2{\tilde{u}} }{1+{\tilde{B}} \, \sinh ^2{\tilde{u}} }, \end{aligned}$$

(109)

with the coefficients

$$\begin{aligned} {\tilde{A}} = \frac{ a b + Y_j Z_j}{3 e_{12}}, \quad {\tilde{B}} = \frac{e_{12}+ \frac{1}{3} \left( a Y_j + Z_j^2 \right) }{3 e_{12}}, \end{aligned}$$

(110)

where $e_{12}$ is given by Eq. (146). The argument

$$\begin{aligned} {\tilde{u}} = \sqrt{3 e_{12}} ( \tau _0 + \phi _j), \end{aligned}$$

(111)

whose phase with respect to $Z_j$ at the initial epoch $t_0$ is given by

$$\begin{aligned} \phi _j = \frac{\mathrm {sgn}{(X_0 \, {\tilde{A}})}}{\sqrt{3 e_{12}}} \, \mathrm {arsinh} \sqrt{\frac{Z_j-Z_0}{2 a {\tilde{A}} - {\tilde{B}} (Z_j-Z_0)}}. \end{aligned}$$

(112)

Note that the time rate of argument ${\tilde{u}}$ is the same at both the separatrices, and as expected the solution tends to the Cassini state $C_4$ asymptotically at $t \rightarrow \pm \infty $.

3.3.4 Cassini state $C_1$ and special orbit $\varGamma _3$

The Cassini state $C_1$, being a stable equilibrium with $g_3>0$, is characterized by the period of small oscillations given directly by the formula (108) with the energy $E_1$ evaluated at $Y^*_1$, $Z^*_1$ deduced from Eq. (36). What makes the difference, compared to $C_2$ or $C_3$, is the presence of another trajectory having the energy $E_1$—the special curve $\varGamma _3$.

The motion along $\varGamma _3$ can be described using the Safford form solution (70) with respect to the turning points $Z_1$ or $Z_2$, given by Eq. (21). Then, performing the reduction (104), we obtain the solution in terms of trigonometric functions

$$\begin{aligned} X= & {} \frac{{\bar{A}} \gamma _3 \sin {2 {\bar{u}}}}{\left( 1+ {\bar{B}} \sin ^2{\bar{u}}\right) ^2}, \nonumber \\ Y= & {} Y_j + \frac{Z^2-Z_j^2}{2 a } = Y_j - \frac{ {\bar{A}} \sin ^2{\bar{u}}}{1+ {\bar{B}} \sin ^2{\bar{u}}} \left( Z_j - \frac{a {\bar{A}} \sin ^2{\bar{u}}}{2 \left( 1+ {\bar{B}} \sin ^2{\bar{u}} \right) }\right) , \nonumber \\ Z= & {} Z_j - \frac{a {\bar{A}} \sin ^2{\bar{u}}}{1+ {\bar{B}} \sin ^2{\bar{u}}}, \end{aligned}$$

(113)

where

$$\begin{aligned} {\bar{A}} = \frac{4 (a b + Y_j Z_j)}{3 e_1}, \quad {\bar{B}} = \frac{\frac{2}{3} \left( a Y_j + Z_j^2 \right) -e_1}{3 e_1}. \end{aligned}$$

(114)

The argument ${\bar{u}}$ is

$$\begin{aligned} {\bar{u}} = \gamma _3 (\tau _0 + \phi _j), \end{aligned}$$

(115)

where $\gamma _3 = \sqrt{3 e_1/2}$, as in Eq. (81) for $e_3= -e_1/2 $; hence, the time period of the solution is the same as that of small oscillations around $C_1$, i.e., $P_1$ given by Eq. (108). The phase $\phi _j$ can be computed from

$$\begin{aligned} \gamma _3 \phi _j = \mathrm {sgn}{({\bar{A}} X_0)}\,\arcsin \sqrt{\frac{Z_j-Z_0}{a {\bar{A}} - (Z_j-Z_0) {\bar{B}}}}. \end{aligned}$$

(116)

The above solution can be obtained either from (86) with $k_p=0$, and $e_3 = -e_1/2 $, or—in a different form—from (94) with $k_n=0$, and $g_2=3 e_1^2$.

3.3.5 Cassini state $C_{14}$ and special orbit $\varGamma _{23}$

The most degenerate case occurs in type III, where $C_{14}$ is the cusp (parabolic) equilibrium with energy $E_{14}$ given by Eq. (42). The homoclinic curve $\varGamma _{23}$ with this energy can be parameterized using rational functions of time, as indicated by the reduction formula (103). For the sake of using the Safford form, we introduce

$$\begin{aligned} u_{23} = t-t_0 + \tau _{23}, \qquad \tau _{23} = t_0-t_\mathrm {m}, \end{aligned}$$

(117)

where $\tau _{23}$ is the time interval between the initial epoch $t_0$ and the epoch of crossing the reference turning point with coordinates $Y_\mathrm {m} = y_\mathrm {m}+a$, and $Z_\mathrm {m} = z_\mathrm {m}-b$, as given by Eq. (43). Then, with $\beta = \root 3 \of {a^4 b^2}$,

$$\begin{aligned} X= & {} - \frac{8 a b u_{23}}{\left( 1+\beta u_{23}^2 \right) ^2}, \end{aligned}$$

(118)

$$\begin{aligned} Y= & {} Y_\mathrm {m} + \frac{4 a b u_{23}^2}{1+\beta u_{23}^2 } \left( Z_m + \frac{2 a^2 b u_{23}^2}{1+\beta u_{23}^2 } \right) , \end{aligned}$$

(119)

$$\begin{aligned} Z= & {} Z_\mathrm {m} + \frac{4 a^2 b u_{23}^2}{1+\beta u_{23}^2 }. \end{aligned}$$

(120)

The time offset $\tau _{23}$ is given by

$$\begin{aligned} \tau _{23} = - \mathrm {sgn}{X_0} \sqrt{\frac{Z_0-Z_\mathrm {m}}{4 a^2 b - \beta \left( Z_0-Z_\mathrm {m}\right) }}. \end{aligned}$$

(121)

4 Conclusions

It is common to describe the integral curves of the Colombo top problem as an intersection of a parabolic cylinder and a unit sphere, the latter being centered at the origin of the x, y, z coordinate system. We have proposed a new point of view: the curves can be tracked along the intersection of two out of the three invariant surfaces: a parabolic cylinder, a sphere, and a paraboloid of revolution. This thread has been merely signaled, but it can be of possible interest when designing geometric integrators for the numerical treatment of the Colombo top motion. Moreover, using the shifted coordinates X, Y, Z, one introduces the symmetry to the parabolic cylinder on the paraboloid (at the expense of having an off-centered sphere) which does simplify a number of expressions given in this work.

When partitioning the phase space of the Colombo top problem, we have completed the landscape, well known from earlier works, with an interesting but hitherto overlooked feature: the trajectory $\varGamma _3$ which is unique by being periodic, yet expressible in terms of elementary functions of time. Its presence calls for the distinction of $\mathrm {D}_3$ and $\mathrm {D}_4$ domains even though qualitatively they look similar. It also adds to a better understanding of the parametric bifurcation associated with the passage from type II, to type IV.

The analytical expressions for the turning points of the Colombo top trajectories as functions of energy, given in Sect. 2.3, had not been reported so far. The expressions for the location of the Cassini states from Sect. 2.4 depend only on parameters a, b. Up to some rearrangement of terms, they are similar to those of Saillenfest et al. (2019) in type II or III. For type IV, when $a^\frac{2}{3}+b^\frac{2}{3} < 1$, the Cardano form provided by Saillenfest et al. (2019) is formally correct, but it gives real values only as the sums of two complex conjugates (casus irreducibilis of the resolvent cubic). In the present work, we have preferred to use expressions based on the purely real trigonometric form whenever the quartic has a positive discriminant.

The differential equation for ${\dot{Z}}$, with its right-hand side proportional to the square root of the degree 4 polynomial, is not a novelty in celestial mechanics. The same form pops up while discussing the second fundamental model of resonance (Henrard and Lemaitre 1983). Its solution in terms of the Weierstrass elliptic function has always been given either in the simplified form of Eq. (55), as in Ferraz-Mello (2007), or in the Biermann-Weierstrass form (Nesvorný and Vokrouhlický 2016). We have taken an opportunity to recall other possibilities (Safford and Mordell forms) than can be of use in other applications as well.

We hope that the results of the present work will facilitate the study of perturbed Colombo top problems. They should be useful either as the kernel of analytical perturbation procedures, or as a building block of numerical integrators based upon composition methods.

Notes

In this paper, we use the Hessian covariant as defined by Janson (2011), which is the same as in Whittaker and Watson (1927). Its sign is opposite to the one originally applied by Mordell (1915).
In this approach, $\root 3 \of {x}$ of a real argument x is used as a real-valued function for $x<0$, i.e., $\root 3 \of {-1} = -1$.

References

Abramowitz, M., Stegun, I.A.: Handbook of Mathematical Functions. Applied Mathematics Series, vol. 55. National Bureau of Standards, Washington DC (1972)
MATH Google Scholar
Atobe, K., Ida, S.: Obliquity evolution of extrasolar terrestrial planets. Icarus 188(1), 1–17 (2007). https://doi.org/10.1016/j.icarus.2006.11.022. arXiv:astrop-ph/0611669
Article ADS Google Scholar
Atobe, K., Ida, S., Ito, T.: Obliquity variations of terrestrial planets in habitable zones. Icarus 168(2), 223–236 (2004). https://doi.org/10.1016/j.icarus.2003.11.017
Article ADS Google Scholar
Bianchi, L.: Lezioni Sulla Teoria delle Funzioni di Variabile Complessa e delle Funzioni Ellittiche. Enrico Spoerri, Pisa (1901)
MATH Google Scholar
Biermann, W.G.A.: Problemata Quaedam Mechanica Functionum Ellipticarum Ope Soluta. Carl Schulze, Berlin (1865)
Google Scholar
Boué, G., Laskar, J., Kuchynka, P.: Speed limit on Neptune migration imposed by Saturn tilting. Astrophys. J. Lett. 702(1), L19–L22 (2009). https://doi.org/10.1088/0004-637X/702/1/L19. arXiv:0909.0332
Article ADS Google Scholar
Brasser, R., Lee, M.H.: Tilting Saturn without tilting Jupiter: constraints on giant planet migration. Astron. J. 150(5), 157 (2015). https://doi.org/10.1088/0004-6256/150/5/157. arXiv:1509.06834
Article ADS Google Scholar
Brizard, A.J.: Notes on the Weierstrass Elliptic Function. arXiv e-prints arXiv:1510.07818 (2015)
Byrd, P.F., Friedman, M.D.: Handbook of Elliptic Integrals for Engineers and Scientists. Springer, Berlin (1971)
Book Google Scholar
Cassini, G.D.: De l’origine et du progrés de l’Astronomie, et de son usage dans la Geographie et dans la Navigation. In: Recueil d’observations faites en plusieurs voyages par ordre de Sa Majesté pour perfectionner l’astronomie et la geographie, l’Imprimerie Royale, Paris, pp. 1–43 (1693)
Colombo, G.: Cassini’s second and third laws. Astron. J. 71, 891–896 (1966). https://doi.org/10.1086/109983
Article ADS Google Scholar
Copernicus, N.: De Revolutionibus Orbium Coelestium. Johannes Petreius, Nuremberg (1543)
MATH Google Scholar
Correia, A.C.M., Laskar, J.: The four final rotation states of Venus. Nature 411(6839), 767–770 (2001). https://doi.org/10.1038/35081000
Article ADS Google Scholar
Efimov, S., Pritykin, D., Sidorenko, V.: Long-term attitude dynamics of space debris in Sun-synchronous orbits: Cassini cycles and chaotic stabilization. Celest Mech. Dyn. Astron. 130(10), 62 (2018). https://doi.org/10.1007/s10569-018-9854-4. arXiv:1712.08596
Article ADS MathSciNet Google Scholar
Enneper, A.: Elliptische Functionen. Verlag von Luis Nebert, Halle, Theorie und Geschichte (1890)
Ferraz-Mello, S.: Canonical Perturbation Theories—Degenerate Systems and Resonance. Springer, New York (2007). https://doi.org/10.1007/978-0-387-38905-9
Book MATH Google Scholar
Hamilton, D.P., Ward, W.R.: Tilting Saturn II. Numerical model. Astron. J. 128(5), 2510–2517 (2004). https://doi.org/10.1086/424534
Article ADS Google Scholar
Harris, A.W., Ward, W.R.: Dynamical constraints on the formation and evolution of planetary bodies. Ann. Rev. Earth Planet Sci. 10, 61 (1982). https://doi.org/10.1146/annurev.ea.10.050182.000425
Article ADS Google Scholar
Henrard, J.: Dynamics Reported, vol 2, Springer, Berlin, Heidelberg, chap The Adiabatic Invariant in Classical Mechanics, pp. 117–235. https://doi.org/10.1007/978-3-642-61232-9_4 (1993)
Chapter Google Scholar
Henrard, J., Lemaitre, A.: A second fundamental model for resonance. Celest Mech. 30(2), 197–218 (1983). https://doi.org/10.1007/BF01234306
Article ADS MathSciNet MATH Google Scholar
Henrard, J., Murigande, C.: Colombo’s top. Celest Mech. 40, 345–366 (1987). https://doi.org/10.1007/BF01235852
Article ADS MathSciNet MATH Google Scholar
Janson, S.: Invariants of polynomials and binary forms. arXiv e-prints arXiv:1102.3568 (2011)
Lagrange, J.L.: Recherches sur la libration de la Lune. Prix de l’Academie Royale des Sciences de Paris IX:1–50 (1764)
Lagrange, J.L.: Théorie de la libration de la Lune. Nouveau Mémoires de l’Academie Royale des Sciences et Belles-Lettres de Berlin (unnumbered):203–308 (1780)
Lanchares, V., Elipe, A.: Bifurcations in biparametric quadratic potentials. Chaos 5(2), 367–373 (1995). https://doi.org/10.1063/1.166107
Article ADS MathSciNet MATH Google Scholar
Laskar, J., Robutel, P.: The chaotic obliquity of the planets. Nature 361(6413), 608–612 (1993). https://doi.org/10.1038/361608a0
Article ADS Google Scholar
Laskar, J., Joutel, F., Robutel, P.: Stabilization of the Earth’s obliquity by the Moon. Nature 361(6413), 615–617 (1993). https://doi.org/10.1038/361615a0
Article ADS Google Scholar
Mordell, L.J.: Indeterminate equations of the third and fourth degrees. Q. J. Pure Appl. Math. 45, 170–186 (1914)
MATH Google Scholar
Mordell, L.J.: The inversion of the integral (...). Messenger Math. 44, 138–141 (1915)
Google Scholar
Nesvorný, D., Vokrouhlický, D.: Dynamics and transit variations of resonant exoplanets. Astrophys. J. 823(2), 72 (2016). https://doi.org/10.3847/0004-637X/823/2/72. arXiv:1603.07306
Article ADS Google Scholar
Neumark, S.: Solution of Cubic and Quartic Equations. Pergamon Press, Oxford (1965)
MATH Google Scholar
Peale, S.J.: Possible histories of the obliquity of Mercury. Astron. J. 79, 722 (1974). https://doi.org/10.1086/111604
Article ADS Google Scholar
Rogoszinski, Z., Hamilton, D.P.: Tilting ice giants with a spin-orbit resonance. Astrophys. J. 888(2), 60 (2020). https://doi.org/10.3847/1538-4357/ab5d35. arXiv:1908.10969
Article ADS Google Scholar
Safford, F.H.: Reduction of the elliptic element to the Weierstrass form. Bull. Am. Math. Soc. 26(1), 13–16 (1919)
Article MathSciNet Google Scholar
Saillenfest, M., Laskar, J., Boué, G.: Secular spin-axis dynamics of exoplanets. Astron. Astroph. 623, A4 (2019). https://doi.org/10.1051/0004-6361/201834344. arXiv:1901.02831
Article ADS Google Scholar
Tisserand, F.: Traité de Mécanique Céleste, vol. 2. Gauthier-Villars, Paris (1891)
MATH Google Scholar
Vokrouhlický, D., Nesvorný, D.: Tilting Jupiter (a bit) and Saturn (a lot) during planetary migration. Astrophys. J. 806(1), 143 (2015). https://doi.org/10.1088/0004-637X/806/1/143. arXiv:1505.02938
Article ADS Google Scholar
Vokrouhlický, D., Nesvorný, D., Bottke, W.F.: The vector alignments of asteroid spins by thermal torques. Nature 425(6954), 147–151 (2003). https://doi.org/10.1038/nature01948
Article ADS Google Scholar
Vokrouhlický, D., Nesvorný, D., Bottke, W.F.: Secular spin dynamics of inner main-belt asteroids. Icarus 184(1), 1–28 (2006). https://doi.org/10.1016/j.icarus.2006.04.007
Article ADS Google Scholar
Ward, W.R.: Large-scale variations in the obliquity of Mars. Science 181(4096), 260–262 (1973). https://doi.org/10.1126/science.181.4096.260
Article ADS Google Scholar
Ward, W.R.: Climatic variations on Mars: 1. Astronomical theory of insolation. J. Geophys. Res. 79(24), 3375–3386 (1974). https://doi.org/10.1029/JC079i024p03375
Article ADS Google Scholar
Ward, W.R.: Past orientation of the Lunar spin axis. Science 189(4200), 377–379 (1975). https://doi.org/10.1126/science.189.4200.377
Article ADS Google Scholar
Ward, W.R.: Comments on the long-term stability of the Earth’s obliquity. Icarus 50(2–3), 444–448 (1982). https://doi.org/10.1016/0019-1035(82)90134-8
Article ADS Google Scholar
Ward, W.R., Canup, R.M.: The obliquity of Jupiter. Astrophys. J. Lett. 640(1), L91–L94 (2006). https://doi.org/10.1086/503156
Article ADS Google Scholar
Ward, W.R., de Campli, W.M.: Comments on the Venus rotation pole. Astrophys. J. Lett. 230, L117–L121 (1979). https://doi.org/10.1086/182974
Article ADS Google Scholar
Ward, W.R., Hamilton, D.P.: Tilting Saturn. I. Analytic model. Astron. J. 128(5), 2501–2509 (2004). https://doi.org/10.1086/424533
Article ADS Google Scholar
Whittaker, E.T., Watson, G.N.: A Course of Modern Analysis, 4th edn. Cambridge University Press, Cambridge (1927)
MATH Google Scholar

Download references

Acknowledgements

The work of DV was funded by the Czech Science Foundation (Grant 18-06083S).

Author information

Authors and Affiliations

Astronomical Observatory Institute, Faculty of Physics, Adam Mickiewicz University, Sloneczna 36, 61-286, Poznan, Poland
J. Haponiak & S. Breiter
Institute of Astronomy, Charles University, Prague, V Holešovičkách 2, 180 00, Prague 8, Czech Republic
D. Vokrouhlický

Authors

J. Haponiak
View author publications
You can also search for this author in PubMed Google Scholar
S. Breiter
View author publications
You can also search for this author in PubMed Google Scholar
D. Vokrouhlický
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to S. Breiter.

Ethics declarations

Conflict of interest

The authors S. Breiter, J. Haponiak, and D. Vokrouhlický declare that they have no conflict of interest. The article is partially based upon the doctoral thesis prepared by J. Haponiak, but it includes the results obtained independently by the co-authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

A The roots of $W(Z)=0$

1.1 A.1 Factorization

In order to find the zeroes of the W(Z), let us first write it in the ‘classical’ polynomial form

$$\begin{aligned} W(Z) = a_0 Z^4 + 4 a_1 Z^3 + 6 a_2 Z^2 + 4 a_3 Z + a_4, \end{aligned}$$

(122)

where according to Eq. (17) the coefficients are

$$\begin{aligned} a_0 = - 1, \quad a_1 = 0, \quad a_2 = - \frac{2 E}{3}, \quad a_3 = - 2 a^2 b, \quad a_4 = - 4 E^2 + 8 a^2 E - 4 a^2 \left( a^2+b^2-1\right) . \end{aligned}$$

(123)

Note the absence of the cubic term ($a_1=0$), meaning that equation $W(Z)=0$ is already in the reduced form.

Solving equation $W(Z)=0$, we essentially follow a simplified and slightly reformulated procedure of Neumark (1965). In particular, the absence of the third power of Z allows the factorization

$$\begin{aligned} W(Z) = a_0\, W_+(Z) \, W_-(Z) = a_0\, \left( Z^2 + 2 \sqrt{\xi }\, Z + h_+ \right) \left( Z^2 - 2 \sqrt{\xi }\, Z + h_- \right) , \end{aligned}$$

(124)

with only three parameters ($\xi $, $h_+$, $h_-$), and three conditions resulting from equating the coefficients of W and $W_+ W_-$:

$$\begin{aligned} h_- h_+ = - a_4, \qquad h_- - h_+ = \frac{4 a^2 b}{\sqrt{\xi }}, \qquad h_- +h_+ = 4 \left( E + \xi \right) . \end{aligned}$$

(125)

The last two equations are easily solved for $h_-$ and $h_+$

$$\begin{aligned} h_\pm = 2 \left( E + \xi \mp \frac{a^2 b}{\sqrt{\xi }} \right) , \end{aligned}$$

(126)

so the first of Eq. (125), after the substitution of Eq. (126), is actually the resolvent cubic equation

$$\begin{aligned} \xi ^3 + 2 E \xi ^2 + a^2 \left( 3 \rho + 2 E \right) \xi - a^4 b^2 = 0, \end{aligned}$$

(127)

where $3\rho = 1-a^2-b^2$, according to Eq. (30). The Descartes rule guarantees that (for nonzero a and b) the resolvent has at least one positive real root to be used in factorization (124).

Before we proceed to solving the resolvent, let us make some important remarks. The four roots of the quartic equation $W(Z)=0$ come in two pairs of the roots of $W_+(Z)=0$ and $W_-(Z)=0$, i.e.,

$$\begin{aligned} W_+(Z)=0:&~~&Z_1 = - \sqrt{\xi } - \sqrt{\xi -h_+}, \qquad Z_2 = - \sqrt{\xi } + \sqrt{\xi -h_+}, \nonumber \\ W_-(Z)=0:&~~&Z_3 = \sqrt{\xi } - \sqrt{\xi -h_-}, \qquad Z_4 = \sqrt{\xi } + \sqrt{\xi -h_-}. \end{aligned}$$

(128)

Let $\xi _1$ be the only, or the greatest positive root of the resolvent (127). Then, from the Vieta’s formulas, we find for the remaining two roots $\xi _2+\xi _3 = - 2 E - \xi _1$, and $\xi _2 \xi _3 = a^4 b^2 \xi _1^{-1}$, which allows to see that

$$\begin{aligned} \sqrt{\xi _2} \pm \sqrt{\xi _3}= \sqrt{\xi _2+\xi _3 \pm 2 \sqrt{\xi _2 \xi _3}} =\sqrt{ -\xi _1-2 E \pm \frac{ 2 a^2 b}{\sqrt{ \xi _1}}} = \sqrt{\xi _1 - h_\pm }. \end{aligned}$$

(129)

This leads to the Euler form of the solution

$$\begin{aligned}&Z_1 = - \sqrt{\xi _1} - \sqrt{\xi _2} -\sqrt{\xi _3}, \quad Z_2 = - \sqrt{\xi _1} + \sqrt{\xi _2} + \sqrt{\xi _3}, \nonumber \\&Z_3 = \sqrt{\xi _1} - \sqrt{\xi _2} + \sqrt{\xi _3}, \quad Z_4 = \sqrt{\xi _1} + \sqrt{\xi _2}- \sqrt{\xi _3}. \end{aligned}$$

(130)

Assuming for the real roots $0 < \xi _3 \le \xi _2 \le \xi _1$, we guarantee a number of properties like the ordering $Z_1 \le Z_2 \le Z_3 \le Z_4$, the fact that a given trajectory contains only a pair $(Z_1,Z_2)$ or $(Z_3,Z_4)$, and that if $\xi _2,\xi _3$ are complex conjugates, then $Z_1$ and $Z_2$ remain real, whereas $Z_3$ and $Z_4$ become complex.

1.2 A.2 Weierstrass resolvent and its roots

The cubic resolvent equation (127) can be brought to a reduced form without the square term in a number of ways. We choose the substitution based upon the seminvariant (Janson 2011)

$$\begin{aligned} P_2 = a_0 a_2 - a_1^2 = \frac{2}{3} E, \end{aligned}$$

(131)

with

$$\begin{aligned} \xi = s - P_2 = s - \frac{2}{3} E. \end{aligned}$$

(132)

Applying it to Eq. (127), and multiplying both sides by 4, we obtain

$$\begin{aligned} S(s) = 4 s^3 - g_2 s - g_3 = 0. \end{aligned}$$

(133)

The cubic polynomial S(s) plays a special role in the theory of the Weierstrass elliptic functions; thus, let us call it the Weierstrass resolvent. The symbols $g_2$ and $g_3$ that appear in Eq. (133) are the two, algebraically independent, basis invariants of the quartic W(Z):

The apolar invariant of degree 2
$$\begin{aligned} g_2 = a_0 a_4 + 3 a_2^2 - 4 a_1 a_3 = 12 \left[ \left( \frac{2 E}{3}\right) ^2 - a^2 \left( \frac{2 E}{3}\right) - a^2 \rho \right] , \end{aligned}$$
(134)
The Hankel determinant of degree 3
$$\begin{aligned} g_3 = \left| \begin{array}{ccc} a_0 &{} a_1 &{} a_2 \\ a_1 &{} a_2 &{} a_3 \\ a_2 &{} a_3 &{} a_4 \\ \end{array} \right| = -8 \left( \frac{2 E}{3}\right) ^3 + 12 a^2 \left( \frac{2 E}{3}\right) ^2 +12 a^2 \rho \left( \frac{2 E}{3}\right) + 4 a^4 b^2, \end{aligned}$$
(135)
known also as a catalecticant (Janson 2011).

Notably, both the discriminants: $\varDelta _4$ of the quartic W(Z) and $\varDelta _3$ of the cubic S(s) are not only expressible in terms of $g_2$ and $g_3$, but they are equal up to a constant factor. If

$$\begin{aligned} \varDelta = \left( \frac{g_2}{3}\right) ^3- g_3^2, \end{aligned}$$

(136)

then $\varDelta _4 = 16 \varDelta _3 = 4^4\,3^3 \varDelta $.

Let the roots of the Weierstrass resolvent equation $S(s)=0$ be $s=e_1$, $s=e_2$, and $s=e_3$. By the Vieta’s formulae, they satisfy

$$\begin{aligned} e_1+e_2+e_3=0, \qquad e_1e_2+e_1 e_3 + e_2 e_3 = - \frac{g_2}{4}, \qquad e_1 e_2 e_3 = \frac{g_3}{4}, \end{aligned}$$

(137)

and by the definition of the scaled discriminant (136)

$$\begin{aligned} \left( e_1-e_2\right) ^2\left( e_1-e_3\right) ^2\left( e_2-e_3\right) ^2= \frac{27\varDelta }{16}. \end{aligned}$$

(138)

Introducing auxiliary quantities $\beta $ and $\phi $, such that

$$\begin{aligned} g_2 = 3 \beta ^2, \qquad g_3 = \beta ^3 \cos {\phi }, \qquad \varDelta = \beta ^6 \sin ^2\phi ; \end{aligned}$$

(139)

hence,

$$\begin{aligned} \beta = \sqrt{\frac{g_2}{3}}, \qquad \cos \phi = g_3 \left( \frac{3}{g_2}\right) ^\frac{3}{2}, \end{aligned}$$

(140)

we can establish the universal formula for the roots (Brizard 2015)

$$\begin{aligned} e_1= & {} \beta \cos {\frac{\phi }{3}}, \nonumber \\ e_2= & {} \beta \cos {\frac{\phi -2\pi }{3}} = - \frac{e_1}{2} + \frac{\sqrt{3}}{2} \beta \sin {\frac{\phi }{3}} = - \frac{e_1}{2} + \frac{\sqrt{g_2-3 e_1^2}}{2}, \nonumber \\ e_3= & {} \beta \cos {\frac{\phi +2\pi }{3}} = - \frac{e_1}{2}- \frac{\sqrt{3}}{2} \beta \sin {\frac{\phi }{3}} = - \frac{e_1}{2} - \frac{\sqrt{g_2-3 e_1^2}}{2}. \end{aligned}$$

(141)

If $\varDelta > 0$ (hence $g_2>0$), there are three simple real roots $e_3< e_2 < e_1$ given directly by Eq. (141). When $\varDelta < 0$, there is one real root $e_1$ and two complex ones, with $e_2 = {\overline{e}}_3$. Equations (139) and (141) remain valid in principle, but they involve complex quantities and require distinguishing the sign of $g_2$. In these circumstances, it is more convenient to use the Cardano form for the real root^{Footnote 2}

$$\begin{aligned} e_1 = \frac{1}{2} \left( \root 3 \of {g_3 + \sqrt{ -\varDelta }} + \root 3 \of {g_3 - \sqrt{ -\varDelta }}\right) . \end{aligned}$$

(142)

and

$$\begin{aligned} e_2 = - \frac{e_1- \mathrm {i} e_\mathrm {c}}{2}, \qquad e_3 = {\bar{e}}_2 = - \frac{e_1 + \mathrm {i} e_\mathrm {c}}{2}, \end{aligned}$$

(143)

with

$$\begin{aligned} e_\mathrm {c} = \sqrt{3 e_1^2- g_2} = \frac{\sqrt{3}}{2} \left( \root 3 \of {g_3 + \sqrt{ -\varDelta }} - \root 3 \of {g_3 - \sqrt{ -\varDelta }} \right) , \end{aligned}$$

(144)

for the complex roots.

Finally, the degeneracy $\varDelta =0$ implies real roots: one simple and one double, or one triple root. The former case requires $g_3 \ne 0$ and $g_2 > 0$; then, according to the sign of $g_3$, either

$$\begin{aligned} e_1 = \root 3 \of {g_3}=\sqrt{\frac{g_2}{3}}, \qquad e_2=e_3=e_{23} = - \frac{e_1}{2}, \text{ for } g_3 > 0 \end{aligned}$$

(145)

or

$$\begin{aligned} e_3 = \root 3 \of {g_3}= -\sqrt{\frac{g_2}{3}}, \qquad e_1 = e_2 = e_{12} = - \frac{e_3}{2}, \text{ for } g_3 < 0. \end{aligned}$$

(146)

The ordering of roots in (146) is exceptional ($e_3$ is the greatest), but helps to maintain a coherent notation in further applications.

The triple root $e_{123}=0$ may appear only for $g_2=g_3=0$, which is possible only when $a^\frac{2}{3} + b^\frac{2}{3} = 1$.

1.3 A.3 The roots $Z_j$

Although the roots of W(Z) are to be expressed in terms of the roots of S(s), we need to include in the discussion not only the invariants $\varDelta $, $g_2$, and $g_3$, but also seminvariants $P_2$ and

$$\begin{aligned} Q_2 = 2 a_0^2 a_3 - 6 a_0 a_1 a_2 + 4 a_1^3 = - 4 a^2 b. \end{aligned}$$

(147)

This is due to the fact that although formally it is enough to substitute

$$\begin{aligned} \xi _j = e_j - \frac{2}{3} E = e_j - P_2, \end{aligned}$$

(148)

into (130), the signs of $\xi _j$ play a significant role in determining which of the roots are real and which are complex, and there are various ways to create multiple roots.

In the following discussion, we will refer to the Theorem 9.3 of Janson (2011), adjusted to the different scaling of our invariants and seminvariant (namely, his $P= 48 P_2$, $Q=16 Q_2$, $J= 432 g_3$, and $I=12 g_2$).

1.3.1 A.3.1 Four simple real roots ($\varDelta >0$ and all $\xi _j \ge 0$)

If the four real roots exist, they take the form (130) with $\xi _j$ defined in Eq. (148) and $e_j$ as in (141). This requires not only $\varDelta > 0$ to have three simple real roots $e_j$, but also that $\xi _j \ge 0$ for each $j \in \{1,2,3\}$. The latter is secured by $P_2 < 0$ and $12 P_2^2 - a_0^2 g_2 \ge 0$ (Janson 2011). Substituting (131), (134), and (123), we obtain

$$\begin{aligned} \varDelta > 0, \text{ and } - \rho \le \frac{2E}{3} < 0, \end{aligned}$$

(149)

as the condition for the real quadruple $Z_1< Z_2< Z_3 < Z_4$. If $\varDelta $ is positive, but the second condition in (149) is not fulfilled, there are no real roots, and $Z_j$ form two distinct pairs of complex conjugate numbers.

1.3.2 A.3.2 Two simple real roots ($\varDelta < 0$)

When $e_2$ and $e_3$ are complex, the pair $(Z_3,Z_4)$ is complex, whereas $(Z_1,Z_2)$ in the formula (130) formally remain real-valued, yet only by canceling the imaginary parts. Using $\xi _1 = e_1 - P_2$, with $e_1$ given by Eq. (142), we can obtain $Z_1$ and $Z_2$ directly from Eqs. (128) and (126). Alternatively, we can find the expressions for $\sqrt{\xi _2} \pm \sqrt{\xi _3}$, which results in

$$\begin{aligned} Z_1 = - \sqrt{\xi _1} - \sqrt{2|\xi _2| - \xi _1 - 3 P_2}, \qquad Z_2 = - \sqrt{\xi _1} + \sqrt{2 |\xi _2| - \xi _1 - 3 P_2}, \end{aligned}$$

(150)

and

$$\begin{aligned} Z_3 = \sqrt{\xi _1} - \mathrm {i}\, \sqrt{ 2 |\xi _2| + \xi _1 + 3 P_2}, \qquad Z_4 = {\overline{Z}}_3, \end{aligned}$$

(151)

where

$$\begin{aligned} \xi _1 = e_1 - P_2, \qquad 2 |\xi _2| = 2 |\xi _3| = |e_1 + 2 P_2 + \mathrm {i} e_\mathrm {c}| = \sqrt{\left( e_1+ 2 P_2\right) ^2 + e_\mathrm {c}^2}, \end{aligned}$$

(152)

with $e_1$ given by Eq. (142).

1.3.3 A.3.3 Multiple roots ($\varDelta = 0$)

The statement $\varDelta =0$ means only that at least one of the roots is at least a double root. Further distinction is based upon the signs and values of $g_2$, $g_3$ and $P_2$. Let us inspect five possibilities involving multiple real roots from the Theorem 9.3 of Janson (2011).

A quadruple real root is not possible, because it requires $P_2=g_2=g_3=0$, whereas substituting $E=0$ we obtain $g_3= 4 a^4 b^2 \ne 0$. Two real double roots are also impossible, because they require (among other conditions) that $Q_2 = 0$, which is not the case. The remaining three cases are the following.

1.
A triple real root and one single real root appear when $g_2=g_3=0$, and $P_2 < 0$. Taking the resultant of $g_2$ and $g_3$ considered as the polynomials in E, one finds that both the invariants admit the common root if
$$\begin{aligned} a^\frac{2}{3} + b^\frac{2}{3} = 1, \end{aligned}$$
(153)
the relation well known from Henrard and Murigande (1987). With this constraint, $g_2=0$ can be solved to give a unique negative root
$$\begin{aligned} E = - \frac{3}{2} \left( a^2 b\right) ^\frac{2}{3}. \end{aligned}$$
(154)
According to the statement below Eq. (146), $g_2=g_3=0$ refers to the triple root $e_{123}=0$; hence, with $\xi _1=\xi _2=\xi _3= -\frac{2}{3}E$, we obtain
$$\begin{aligned} Z_1 = - 3 \root 3 \of {a^2 b}, \qquad Z_{234} = \root 3 \of {a^2 b}, \end{aligned}$$
(155)
where $Z_1$ is the single, and $Z_{234}$ is the triple root.
2.
Two simple real roots $Z_1$, $Z_2$ and double real root $Z_{34}$ appear when $g_2 > 0$, $P_2<0$, and $12 P_2^2-g_2 > 0$. So, if E is a real root of $\varDelta = 0$ in the interval
$$\begin{aligned} -\rho< \frac{2E}{3} < 0, \end{aligned}$$
(156)
then either
$$\begin{aligned} Z_1 = - \sqrt{\xi _1} - 2 \sqrt{\xi _{23}}, \qquad Z_2 = - \sqrt{\xi _1} + 2 \sqrt{\xi _{23}}, \qquad Z_{34} = \sqrt{\xi _1}, \qquad \text{ for } g_3 > 0, \end{aligned}$$
(157)
or
$$\begin{aligned} Z_1 = - \sqrt{\xi _3} - 2 \sqrt{\xi _{12}}, \qquad Z_{23} = \sqrt{\xi _3}, \qquad Z_{4} = - \sqrt{\xi _3} + 2 \sqrt{\xi _{12}}, \qquad \text{ for } g_3 < 0, \end{aligned}$$
(158)
where $\xi _i = e_i - P_2$, and $\xi _{ij}=e_{ij}-P_2$, with the Weierstrass resolvent roots given by Eq. (145) or (146), according to the sign of $g_3$.
3.
If the energy E is a real root of $\varDelta =0$ outside the interval (156), i.e.,
$$\begin{aligned} \frac{2E}{3} < -\rho , \text{ or } \left( E > 0, \text{ and } \frac{2E}{3} \ne -\rho \right) , \end{aligned}$$
(159)
then a double real root $Z_{12}$ is accompanied by two simple complex roots $Z_3$ and $Z_4$. This case appears when $e_\mathrm {c}=0$ in Eq. (143). Accordingly,
$$\begin{aligned} Z_{12} = - \sqrt{ \xi _1 } = - \sqrt{e_1-P_2}, \end{aligned}$$
(160)
where $e_1$ is given by Eq. (145).

B Cassini states coordinates $z^*_j$

The left-hand side of the quartic equation $w(z-b)=0$ is the polynomial

$$\begin{aligned} w(z-b) = a'_0 z^4 + 4 a'_1 z^3 + 6 a'_2 z^2 + 4 a'_3 z + a'_4, \end{aligned}$$

(161)

where

$$\begin{aligned} a'_0 = -1, \quad a'_1 = \frac{b}{2}, \quad a'_2 = \frac{1-a^2-b^2}{6} = \frac{\rho }{2} , \quad a'_3 = - \frac{b}{2}, \quad a'_4 = b^2. \end{aligned}$$

(162)

Evaluating the invariants and seminvariants from the primed coefficients, we find

$$\begin{aligned} g'_2 = \frac{3\rho ^2}{4} \ge 0, \qquad g'_3 = -\left( \frac{\rho }{2}\right) ^3 + \frac{a^2 b^2}{4}, \qquad P'_2 = - \frac{\rho }{2} - \frac{b^2}{4},\qquad Q'_2 = - \frac{ \left( 1 + a^2\right) b}{2} < 0, \end{aligned}$$

(163)

ant the scaled discriminant is

$$\begin{aligned} \varDelta ' = \left( \frac{g'_2}{3}\right) ^3 - \left( g'_3\right) ^2 = \frac{a^2 b^2}{16}\left( \rho ^3 - a^2 b^2 \right) . \end{aligned}$$

(164)

The transformation

$$\begin{aligned} z = z_\mathrm {r}+\frac{b}{2}, \end{aligned}$$

(165)

converts the equation $w(z-b)=0$ into $w(z_\mathrm {r}-b/2)=0$, which is free of the $z_\mathrm {r}^3$ term, and so is ready for the factorization from Sect. A.1. Fortunately, we do not need to know the coefficients of the equation in $z_\mathrm {r}$, because we require only the invariants (163) which are conserved under the simple transformation (165). Thus, tracing backward the procedure from “Appendix A”, we start with solving the Weierstrass resolvent

$$\begin{aligned} 4 s^3 - g'_2 s - g'_3 = 0, \end{aligned}$$

(166)

finding the roots $e'_1$, $e'_2$, and $e'_3$. These define

$$\begin{aligned} \xi '_j = e'_j - P'_2 = e'_j + \frac{\rho }{2} + \frac{b^2}{4}, \end{aligned}$$

(167)

as in Eqs. (132) and (148). Finally, four roots $z_{\mathrm {r},i}$ are given by Eq. (130) with $Z_i \rightarrow z_{\mathrm {r},i}, \xi _j \rightarrow \xi '_j$, and then $z^*_i = z_{\mathrm {r},i} + (b/2)$. Each real solution $z^*_i$ is the z coordinate of some Cassini state.

The number of real roots of $w(z-b)=0$ depends on the sign of $\varDelta '$.

1.
$\varDelta ' > 0$ is equivalent to $a^\frac{2}{3}+b^\frac{2}{3} < 1$. Three real roots $e'_1>e'_2>e'_3$ of the resolvent (166) are, by analogy with (141),
$$\begin{aligned} e'_1= & {} \frac{\rho }{2} \cos {\frac{\phi '}{3}}, \nonumber \\ e'_2= & {} \frac{\rho }{2} \cos {\frac{\phi '-2\pi }{3}} = - \frac{e'_1}{2} + \frac{\sqrt{g'_2-3 (e'_1)^2}}{2}, \nonumber \\ e'_3= & {} \frac{\rho }{2} \cos {\frac{\phi '+2\pi }{3}} = - \frac{e'_1}{2} - \frac{\sqrt{g'_2-3 ( e'_1)^2}}{2}, \end{aligned}$$
(168)
where
$$\begin{aligned} \phi ' = \arccos \left( \frac{2 a^2 b^2}{\rho ^3}-1\right) = \pi - 2 \arcsin \left( \frac{a b}{\rho ^\frac{3}{2}}\right) . \end{aligned}$$
(169)
They always define four real roots $z^*_i$, because all $\xi '_j$ are positive. Indeed
$$\begin{aligned} \xi '_1= & {} \frac{b^2}{4} + \frac{\rho }{2}\left( 1+\cos {\frac{\phi '}{3}}\right) =\frac{b^2}{4}+ \rho \cos ^2\frac{\phi '}{6}> 0, \nonumber \\ \xi '_2= & {} \frac{b^2}{4} + \frac{\rho }{2}\left( 1+\cos {\frac{\phi '-2\pi }{3}}\right) = \frac{b^2}{4}+ \rho \cos ^2\frac{\phi '-2\pi }{6}> 0, \nonumber \\ \xi '_3= & {} \frac{b^2}{4} + \frac{\rho }{2}\left( 1+\cos {\frac{\phi '+2\pi }{3}}\right) = \frac{b^2}{4}+ \rho \cos ^2\frac{\phi '+2\pi }{6} > 0. \end{aligned}$$
(170)
Unlike in eq. (130), we label the roots $z^*_i$ not according to their ordering in magnitude, but so that the subscript i matches the Cassini state label $C_i$ according to Colombo (1966), it is
$$\begin{aligned}&z^*_1 = \frac{b}{2} + \sqrt{\xi '_1} + \sqrt{\xi '_2} - \sqrt{\xi '_3}, \quad z^*_2 = \frac{b}{2} - \sqrt{\xi '_1} + \sqrt{\xi '_2} + \sqrt{\xi '_3}, \nonumber \\&z^*_3 = \frac{b}{2} - \sqrt{\xi '_1} - \sqrt{\xi '_2} - \sqrt{\xi '_3}, \quad z^*_4 = \frac{b}{2} + \sqrt{\xi '_1} - \sqrt{\xi '_2} + \sqrt{\xi '_3}, \end{aligned}$$
(171)
with $z^*_3< z^*_2< z^*_4 < z^*_1$, as expected.
2.
$\varDelta ' < 0$ means $a^\frac{2}{3}+b^\frac{2}{3} > 1$. The Weierstrass resolvent has one real root $e'_1$ and two complex roots $e'_2 = \overline{e'}_3$ given by Eqs. (142), (143), and (144) with $g'_2$, $g'_3$, and $\varDelta '$. Accordingly, we obtain two real roots $z^*_i$—the ones involving $\sqrt{\xi '_2}+\sqrt{\xi '_3}$, where the imaginary part cancels out. Adapting the expressions (150) and (152), and adjusting the subscripts of $z^*_i$ to the Cassini states $C_3$ and $C_2$, we obtain two real roots of $w(z-b)=0$ as
$$\begin{aligned} z^*_3 = \frac{b}{2} - \sqrt{\xi '_1} - \sqrt{2|\xi '_2| - \xi '_1 - 3 P'_2}, \qquad z^*_2 =\frac{b}{2} - \sqrt{\xi '_1} + \sqrt{2 |\xi '_2| - \xi '_1 - 3 P'_2}, \end{aligned}$$
(172)
with $z^*_3 < z^*_2$. The final substitution is made in Sect. 2.4.
3.
$\varDelta '=0$, hence $a^\frac{2}{3}+b^\frac{2}{3} = 1$, implies one single and one double real root of the Weierstrass resolvent. A triple root is excluded, because $g'_2 = \frac{3}{4}(ab)^\frac{4}{3} \ne 0$. Thus, observing that $g'_3 = a^2 b^2/8 > 0$, and $\rho = (ab)^\frac{2}{3}$,
$$\begin{aligned} e'_{1} = \frac{(ab)^\frac{2}{3}}{2}, \qquad e'_{23} = - \frac{ e'_{1}}{2}, \qquad \xi '_{1}= \frac{4 a^\frac{2}{3}b^\frac{2}{3} + b^2}{4}, \qquad \xi '_{23} = \frac{a^\frac{2}{3}b^\frac{2}{3} + b^2}{4}, \end{aligned}$$
(173)
provide three Cassini states: two usual $C_3$, $C_2$ and one degenerate $C_{14}$, with
$$\begin{aligned} z^*_3 = \frac{b}{2}- \sqrt{\xi '_1} - 2 \sqrt{\xi '_{23}}, \qquad z^*_2 = \frac{b}{2} - \sqrt{\xi '_1} + 2 \sqrt{\xi '_{23}}, \qquad z^*_{14} = \frac{b}{2} + \sqrt{\xi '_1}, \end{aligned}$$
(174)
listed in the ascending order.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Haponiak, J., Breiter, S. & Vokrouhlický, D. Analytical solution of the Colombo top problem. Celest Mech Dyn Astr 132, 24 (2020). https://doi.org/10.1007/s10569-020-09963-7

Download citation

Received: 22 January 2020
Accepted: 28 March 2020
Published: 29 April 2020
DOI: https://doi.org/10.1007/s10569-020-09963-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Analytical solution of the Colombo top problem

Abstract

1 Introduction

2 The Colombo top model

2.1 Equations of motion

2.2 Geometric interpretation

2.3 Intersections with the plane \(x=0\)

2.4 Cassini states and homoclinic orbits

2.4.1 Type II

2.4.2 Type IV

2.4.3 Type III

3 Analytical solution

3.1 Weierstrass form

3.1.1 General framework

3.1.2 Initial conditions at turning point

3.1.3 Arbitrary initial conditions: Weierstrass–Biermann form

3.1.4 Arbitrary initial conditions: Safford form

3.1.5 Arbitrary initial conditions: Mordell form

3.2 Solution in terms of the Jacobi elliptic functions

3.2.1 Weierstrass functions in terms of the Jacobi functions

3.2.2 Motion in \(\mathrm {D}_1\) and \(\mathrm {D}_4\)

3.2.3 Motion in \(\mathrm {D}_2\), \(\mathrm {D}_3\), and \(\mathrm {D}_{23}\)

3.3 Special cases

3.3.1 Reduction rules

3.3.2 Cassini states \(C_2\) and \(C_3\)

3.3.3 Cassini state \(C_4\) and homoclinic orbits \(\varGamma _1\), \(\varGamma _2\)

3.3.4 Cassini state \(C_1\) and special orbit \(\varGamma _3\)

3.3.5 Cassini state \(C_{14}\) and special orbit \(\varGamma _{23}\)

4 Conclusions

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendices

A The roots of \(W(Z)=0\)

1.1 A.1 Factorization

1.2 A.2 Weierstrass resolvent and its roots

1.3 A.3 The roots \(Z_j\)

1.3.1 A.3.1 Four simple real roots (\(\varDelta >0\) and all \(\xi _j \ge 0\))

1.3.2 A.3.2 Two simple real roots (\(\varDelta < 0\))

1.3.3 A.3.3 Multiple roots (\(\varDelta = 0\))

B Cassini states coordinates \(z^*_j\)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation