Abstract
A normal form theory for non-quasiperiodic systems is combined with the special properties of the partially averaged Newtonian potential pointed out in Pinzari (Celest Mech Dyn Astron 131(5):22, 2019) to prove, in the averaged, planar three-body problem, the existence of a plenty of motions where, periodically, the perihelion of the inner body affords librations about one equilibrium position and its ellipse squeezes to a segment before reversing its direction and again decreasing its eccentricity (perihelion librations).
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
This paper deals with certain motions of three point masses undergoing Newtonian attraction. More precisely, we study the case of two light bodies orbiting their common center of mass (a “binary asteroid system”) while interacting with a heavier mass (a “planet”), whose position is external to their trajectories. As no Newtonian interaction can be neglected, there is no reason to claim that the system undergoes the “Keplerian” approximation, successfully used in the so-called planetary modelFootnote 1 (Arnold 1963b; Féjoz 2004; Laskar and Robutel 1995; Pinzari 2009; Chierchia and Pinzari 2011). We think to the case that the time of mutual revolution of the lighter bodies is much shorter than the timescale of the motions of the massive body. We also assume that the ratio \(\epsilon \) between the semi-axis of the ellipse of the asteroids and the position ray of the heavier one keeps to be less than \(\frac{1}{2}\), so that collisions between the asteroids and the planet are not possible. Recall in fact that a body moving on a Keplerian ellipse does not go beyond twice the semimajor axis from its focus. We look at the system from a reference frame centered with one of the asteroids or with their center of mass, so as to deal with an effective two-particle system, given by the other asteroid and the planet. We shall refer as “asteroidal ellipse” the instantaneous ellipse of this asteroid, focused on the center of the reference frame. We are interested to its motions. To simplify the analysis a little bit, we introduce three assumptions. The main one is based on the belief that, as long as the difference between the timescales persists, the system is “well represented” by a certain “averaged” problem, which we call secular problem. We remark that such average is meant with respect to the proper time of the asteroid, so it should not be confused with the homonymous procedure often studied in the literature (e.g., Féjoz and Guardia 2016), where the Keplerian approximation is used for two particles about their common sun, and the average is done with respect to both their mean anomalies.
The secular system is simpler than the original problem, as we lose information concerning the position of the asteroid. In particular, collisions between the lighter particles are not observable. The degrees of freedom of the system are the motions of the eccentricity (or of the angular momentum) and of the pericenter direction of the asteroidal ellipse and the motions of the massive body. We associate with such system a certain “limiting system” which is similarly defined, but with the massive body being firm. From now on, we refer to such limiting problem as “unperturbed” and to the full secular problem as “perturbed.” The terminology is here used with abuse, as we do not assume that the massive body has slow velocity in the full problem. For the unperturbed system, only movements of the asteroidal ellipse occur. By Pinzari (2019), the unperturbed problem turns out to be integrable, in the sense that it possesses a complete family of independent and commuting first integrals. More importantly, it reveals a surprising property, which we named renormalizable integrability. Such property (recalled for completeness in Sect. 2.2) offers a remarkable shortcut to the knowledge of movements of the asteroidal ellipse. By Pinzari (2020), in the case that the interacting particles are constrained on a plane (this is actually our second assumption), and \(\varepsilon <\frac{1}{2}\), there are two stable equilibria such that the pericenter direction of the asteroidal ellipse in the unperturbed problem affords small oscillations about them, while its angular momentum oscillates about zero, affording a periodic change of sign. Physically, this means that the asteroidal ellipse is highly eccentric at any time and, moreover, there are two times (“squeezing times”) in a period of oscillation of the pericenter when the eccentricity is equal to one. Namely, at those times, the ellipse becomes a segment. After the squeezing times, the eccentricity of the ellipse decreases while the sense of the motion is reversed. We call “perihelion librations” such kind of motions. The question which here we address is whether perihelion librations do persist in the full perturbed problem, when the massive body moves. We are able to give a positive answer to this question under our third assumption, which consists in taking the total angular momentum of the system (which is preserved during the motion) equal to zero. Under this assumption, the symmetries of the Hamiltonian ensure that the equilibria persist also in the perturbed problem, even though the integrability is lost. On the other side, such assumption is a source of difficulty, as the angular momenta of the two particles will simultaneously vanish, and hence collisions of the heavy body with the center of the system are to be controlled. We shall formulate our result below, after we have introduced some mathematical tool.
In terms of Jacobi coordinates,Footnote 2 the three-body problem Hamiltonian with masses \(m_0\), \(\mu m_0\), \(\kappa m_0\) is the translation-free function
Here, \(({{\mathbf {y}}}^{\prime }, {{\mathbf {y}}}, {{\mathbf {x}}}^{\prime }, {{\mathbf {x}}})\in ({{\mathbb {R}}}^3)^4\) (or \(({{\mathbb {R}}}^2)^4\)), \(\Vert \cdot \Vert \) denotes Euclidean distance and the gravity constant has been taken equal to one, by a proper choice of the units system. We rescale impulses and positions
multiply the Hamiltonian by \(\frac{1+\mu }{\mu }\) (by a rescaling of time) and obtain
with
Likewise, one might consider the problem written in the so-called \(m_0\)-centricFootnote 3 coordinates, and in this case the Hamiltonian is
We apply an analogue rescaling, but with
We arrive at
We remark that in the case, of our interest, that \(\kappa \gg \mu \sim 1\), the above definition of Jacobi coordinates differs substantially from the usual one, because the barycentric reduction begins with one of the two lighter masses, rather than with the heavier one. A similar observation holds for the \(m_0\)-centric reduction, which here is not centered on the most massive body, contrarily to the usual convention. We look at the Hamiltonians \({\mathrm{H}}_i\) in (2) and (5). The assumptions mentioned above are:
- \((A_1)\):
If \(\ell \) is the mean anomaly associated with the Keplerian motions of the term
$$\begin{aligned} \frac{\Vert {\mathbf {y}}\Vert ^2}{2m_0}-\frac{m_0^2}{\Vert {\mathbf {x}}\Vert }=-\frac{m_0^5}{2{\Lambda }^2}, \end{aligned}$$(6)we replace the Hamiltonians (2) and (5) with their respective \(\ell \)-averages
$$\begin{aligned} \overline{\mathrm{H}}_i=-\frac{m_0^5}{2{\Lambda }^2}+\gamma {\widehat{{\mathrm{H}}}}_i; \end{aligned}$$(7)where \( \widehat{\mathrm{H}}_i\) are the \(\ell \)-averages of the terms inside parentheses in (2), (5).
- \((A_2)\):
The coordinates \({{\mathbf {x}}}\), \({{\mathbf {x}}}^{\prime }\) and the impulses \({{\mathbf {y}}}\), \({{\mathbf {y}}}^{\prime }\) are constrained on the plane \({{\mathbb {R}}}^2\);
- \((A_3)\):
The total angular momentum \({\mathbf {C}}={{\mathbf {x}}}^{\prime }\times {{\mathbf {y}}}^{\prime }+{{\mathbf {x}}}\times {{\mathbf {y}}}\) of the system vanishes.
As mentioned above, the main assumption is \((A_1)\). It allows us to exploit facts highlighted in Pinzari (2019, 2020), as now we describe.
Since the \(\overline{\mathrm{H}}_i\)’s in (7) are \(\ell \)-independent, \(\Lambda \) is a first integral; hence, the term \(-\frac{m_0^5}{2{\Lambda }^2}\) may be neglected. After a further rescaling of time \(t\rightarrow \gamma t\), we are led to look at the Hamiltonians \({\widehat{{\mathrm{H}}}}_i\) in (7), which are given by
where
is the \(\ell \)-average of the Newtonian potential. Remark that \({{\mathbf {y}}}(\ell )\) has vanishing \(\ell \)-average,Footnote 4 so that the last term in (5) does not survive. In the case of the planar problem, after the reduction in rotation invariance, the Hamiltonians \({\widehat{{\mathrm{H}}}}_i\) have two degrees of freedom. We use the following canonical coordinates:
where \({{\mathbf {P}}}\) is the perihelion of (6), and the direction \({{\mathbf {a}}}\), orthogonal to \({{\mathbf {x}}}\times {{\mathbf {y}}}\), will be specified later. Note the coordinates above are fit to describe motions of Keplerian elements for \(({{\mathbf {y}}}, {{\mathbf {x}}})\), but not of \(({{\mathbf {y}}}^{\prime }, {{\mathbf {x}}}^{\prime })\). If \({{\mathbf {y}}}^{\prime }\) is set to zero, the \({\widehat{{\mathrm{H}}}}_i\)’s reduce to sums of averaged Newtonian potentials, which are integrable, as do not depend on \({\mathrm{R}}\). The function \({\mathrm{U}}_\beta |_{\beta =1}\) has been thoroughly studied in Pinzari (2019). Its phase portrait in the plane \(({\mathrm{g}}, {\mathrm{G}})\), while the ratio \(\varepsilon =a/{\mathrm{r}}\) (where \(a={\Lambda }^2/m_0^3\) is the semimajor axis associated with (6)) varies, is as follows.
- (i)
Case \(0<\varepsilon <\frac{1}{2}\). There exist two centers, (0, 0) and \((0, \pi )\), surrounded by librational motions (Fig. 1).
- (ii)
Case \(\frac{1}{2}<\varepsilon <1\). The equilibrium (0, 0) becomes a saddle, with its own separatrix (the light blue curve), while \((0, \pi )\) is still stable. Two more equilibria appear on the \({\mathrm{G}}\)-axis (Fig. 2).
- (iii)
Case \(\varepsilon >1\). The equilibria on the \(\mathrm{G}\)-axis and the saddle persist. There is the birth of rotational motions (Fig. 3).
The purpose of this paper is to prove that motions of the kind (i) do persist in \({\widehat{{\mathrm{H}}}}_i\), when \({{\mathbf {y}}}^{\prime }\ne 0\). Note that we shall not require \(\Vert {{\mathbf {y}}}^{\prime }\Vert \) small.
To state the result, we introduce the following quantities, which will be used as mass parameters, at the place of \(\mu \) and \(\kappa \):
where \(\beta \) and \({\overline{\beta }}\) are as in (3), (4), respectively. Observe that \(\beta _*< \beta ^*\) and the case, of our interest, \(\kappa \gg \mu \sim 1\) corresponds to \(\beta _*\sim \beta ^*/2\gg 1\).
We shall prove the following result.
Theorem 1.1
(Perihelion librations about (0, 0)) Fix an arbitrary neighborhood \({\mathrm{U}}_0\) of (0, 0) and an arbitrary neighborhood \({\mathrm{V}}_0\) of an unperturbed curve \(\gamma _0(t)=({\mathrm{G}}_0(t), {\mathrm{g}}_0(t))\in {\mathrm{U}}_0\) in Fig. 1. Then, it is possible to find six numbers \(0<c<1\), \(0<\beta _-<\beta _+\), \(0<\alpha _-<\alpha _+\)\(T>0\), such that, for any \(\beta _-<\beta _*\le \beta ^*<\beta _+\) the projections \(\Gamma _0(t)=({\mathrm{G}}(t), {\mathrm{g}}(t))\) of all the orbits \(\Gamma (t)=({\mathrm{R}}(t), {\mathrm{G}}(t), {\mathrm{r}}(t), {\mathrm{g}}(t))\) of \(\overline{\mathrm{H}}_1\), \(\overline{\mathrm{H}}_2\) with initial datum \((\mathrm{R}_0, {\mathrm{r}}_0, {\mathrm{G}}_0, {\mathrm{g}}_0)\in [\frac{1}{\sqrt{c\alpha _+}}, \frac{1}{\sqrt{c\alpha _-}}]\times [c\alpha _-, \alpha _+]\times {\mathrm{U}}_0\) belong to \({\mathrm{V}}_0\) for all \(0\le t\le T\). Moreover, the angle \(\gamma (t)\) between the position ray of \(\Gamma _0(t)\) and the \({\mathrm{g}}\)-axis affords a variation larger than \(2\pi \) during the time T.
A similar statement concerning perihelion librations about \((0,\pi )\) holds true. The statement of Theorem 1.1 deserves two remarks. The former regards the motions involved, which are quasi-rectilinear, hence, close to be collisional. Generally speaking, for a three-body system composed of two asteroids and one planet, three kinds of collisions are possible: (1) collisions between the two asteroids; (2) collision between one of the asteroids and the planet; and (3) triple collision. The system under investigation is, as stressed above, an averaged problem derived from the full above problem. For this, averaged problem collisions of kind (1) or (3) do not exist, as the position of the asteroids is treated only in averaged meaning. Collisions of the kind (2) may exist, but they are to be intended as collisions between the planet and the average ellipse, rather than with a single particle on this ellipse. They are prevented by the assumption that the orbit of the planet is sufficiently far from the orbit of the asteroids, namely, with a careful choice of the domain of the coordinates. Under this assumption, the averaged Hamiltonians \({\widehat{{\mathrm{H}}}}_i\) keep finite. Incidentally, their regularity is studied in Proposition 3.3. During the proof of Theorem 1.1, in Sect. A, the trajectory of the massive planet is controlled to keep outside the trajectory of the asteroid for all the time of a perihelion libration.
The second remark concerns the thesis of Theorem 1.1. It holds in an open subset of phase space. In a sense, it recalls the statement of Nekhoroshev’s theorem (Nehorošev 1977). However, differently from it, Theorem 1.1 is not an application of perturbation theory, nor it uses trapping arguments. The reason is the following. In Sect. 4, we shall see that the manifolds
are in fact invariant for \({\widehat{{\mathrm{H}}}}_i\). On such invariant manifold, by \(A_3\), we have
so
Moreover, the functions \({\mathrm{U}}_\beta \), \(\mathrm{U}_{-{\overline{\beta }}}\), \({\mathrm{U}}_{\beta +{\overline{\beta }}}\) in (8) are asymptotic (as \(\varepsilon \rightarrow 0\)) to \( \frac{1}{{\mathrm{r}}}\). Hence, the motion of the coordinates \(({\mathrm{R}}, {\mathrm{r}})\) on \({{\mathcal {M}}}_0\) and \({{\mathcal {M}}}_\pi \) is ruled by an Hamiltonian asymptotic to
This Hamiltonian generates unbounded (hence, non-quasiperiodic) motions, for both positive and negative energies: For positive energies, both \({\mathrm{R}}\) and \({\mathrm{r}}\) are unbounded; for negative energies, only \({\mathrm{R}}\) is so. In any case, these motions are not quasiperiodic and hence we cannot apply the machinery of perturbation theory. In Sect. , we develop a theory suited to the case (see Fortunati and Wiggins 2016 for a result of the same kind). In this theory, no small denominators will arise, which is the reason why no trapping argument is needed.
Before switching to full statements and proofs, we quote three open questions arising from the present setting.
- \(({\mathrm{Q}}_1)\):
Let us consider the cases \(\frac{1}{2}<\varepsilon <1\) or \(\varepsilon >1\) (Figs. 2, 3, respectively). Does the separatrix split so as to produce chaotic dynamics in the partially averaged planar problem?
- \(({\mathrm{Q}}_2)\):
Again in the cases above, let us consider the full three-body problem. It has three degrees of freedom. Does the separatrix split so as to produce Arnold instability (Arnol’d 1964; Delshams et al. 2019)?
- \(({\mathrm{Q}}_3)\):
What is the scenario in the case of the spatial problem?
This paper is organized as follows. In Sect. 2, we provide a review of the main results of Pinzari (2019, 2020). In particular, we recall the mathematical formulation of the mentioned renormalizable integrability and we carry from Pinzari (2020) a set of action-angle like coordinates suited to our needs. In Sect. , we refine the analysis of Pinzari (2019) to the case of the planar secular problem, in the region of phase space where \(0<\varepsilon <\frac{1}{2}\). In this case, we are able to obtain simpler formulae compared to Pinzari (2019) and hence to study the regularity region of \({\widehat{{\mathrm{H}}}}_i\) completely. In Sect. 4, we state a normal form theory without small divisors (Theorem 4.1), suited for aperiodic systems. The proof of Theorem 4.1 is deferred to Appendix A. In Sect. 5, we provide the proof of Theorem 1.1, as well as of a more precise version of it (Theorem 5.1), as an application of Theorem 4.1.
2 Review of the Results of Pinzari (2019, 2020)
2.1 \({{\mathcal {K}}}\) Coordinates
We describe canonical coordinates suited to our problem.
We fix an arbitrary orthonormal frame
in \({{\mathbb {R}}}^3\), that we call inertial frame.
For given \(m_0>0\), fix a region of phase space (i.e., a set of values of \(({{\mathbf {y}}}^{\prime }, {{\mathbf {y}}},{{\mathbf {x}}}^{\prime }, {{\mathbf {x}}})\)) where the Kepler Hamiltonian (6) takes negative values. Consider the motion generated by (6) with initial datum \(({{\mathbf {y}}}, {{\mathbf {x}}})\), and denote:
a the semimajor axis;
\({{\mathbf {P}}}\), with \(\Vert {{\mathbf {P}}}\Vert =1\), the direction of perihelion, assuming the ellipse is not a circle;
\(\ell \): the mean anomaly, defined, mod \(2\pi \), as the area of the elliptic sector spanned by \({{\mathbf {x}}}\) from \({{\mathbf {P}}}\), normalized to \(2\pi \).
Denote also:
where “\(\times \)” denotes skew-product in \({{{\mathbb {R}}}}^3\). Observe the following relations:
Let
and assume
Given three vectors \({{\mathbf {i}}}\), \({{\mathbf {i}}}^{\prime }\) and \({{\mathbf {k}}}\), with \({{\mathbf {i}}}\), \({{\mathbf {i}}}^{\prime }\perp {{\mathbf {k}}}\), we denote as \(\alpha _{{\mathbf {k}}}({{\mathbf {i}}}, {{\mathbf {i}}}^{\prime })\) the oriented angle from \({{\mathbf {i}}}\) to \({{\mathbf {i}}}^{\prime }\) relatively to the positive orientation established by \({{\mathbf {k}}}\).
We define the coordinates
as
The canonical character of \({{{\mathcal {K}}}}\) has been discussed in Pinzari (2019), based on Pinzari (2013).
Using the formulae in the previous section, we provide the expressions of the following functions:
(where \({{\mathbf {x}}}_{{\mathcal {K}}}:={{\mathbf {x}}}\circ {{\mathcal {K}}}\), etc.) which will be mentioned in the next section. They are:
where \(a=a(\Lambda )\), the semimajor axis; \({\mathrm{e}}=\mathrm{e}(\Lambda , {\mathrm{G}})\), the eccentricity of the ellipse; \(\varrho =\varrho (\Lambda , {\mathrm{G}}, \ell )\); \({\mathrm{p}}={\mathrm{p}}( \Lambda , {\mathrm{G}}, \ell , {\mathrm{g}})\) are defined as
with \(\xi =\xi (\Lambda , {\mathrm{G}}, \ell )\) the eccentric anomaly, defined as the solution of Kepler equation
These formulae have been discussed in Pinzari (2020).
2.2 Renormalizable Integrability
We recall some results concerning the functions \({\mathrm{U}}\) and \(\mathrm{E}\) in (16). We refer to Pinzari (2019) for full details.
Definition 2.1
(Pinzari 2019, Definition 1) Let f, g be two functions of the form
where
with \( U\subset {{\mathbb {R}}}^2\), \({{\mathcal {B}}}\subset {{\mathbb {R}}}^{2n}\) open and connected, \((p,q)=\)\((p_1\), \(\dots \), \(p_n\), \(q_1\), \(\ldots \), \(q_n)\) conjugate coordinates with respect to the two-form \({\omega }=dy\wedge \hbox {d}x+\sum _{i=1}^{n}\hbox {d}p_i\wedge \hbox {d}q_i\) and \({\mathrm{I}}(p,q)=({\mathrm{I}}_1(p,q), \dots , {\mathrm{I}}_n(p,q))\), with
pairwise Poisson commuting:
We say that f is renormalizably integrable bygvia\({\widetilde{f}}\) (or renormalizably integrable byg, or simply renormalizably integrable), if there exists a function
such that
for all \((p, q, y, x)\in {{\mathcal {D}}}\).
Proposition 2.1
(Pinzari 2019, Proposition 4) If f is renormalizably integrable by g, then:
- (i)
\({\mathrm{I}}_1\), \(\ldots \), \({\mathrm{I}}_n\) are first integrals to f and g;
- (ii)
f and g Poisson commute.
Observe that, if f is renormalizably integrable via g, then, generically, their respective time laws for the coordinates (y, x) are the same, up to rescaling the time. Formally:
Proposition 2.2
(Pinzari 2019, Proposition 5) Let f be renormalizably integrable via g. Fix a value \(\mathrm{I}_0\) for the integrals \({\mathrm{I}}\) and look at the motion of (y, x) under f and g, on the manifold \({\mathrm{I}}={\mathrm{I}}_0\). For any fixed initial datum \((y_0, x_0)\), let \(g_0:=g({\mathrm{I}}_0, y_0, x_0)\). If \({\omega }({\mathrm{I}}_0, g_0):=\partial _{g}{\tilde{f}}({\mathrm{I}}, g)|_{(\mathrm{I}_0, g_0)}\ne 0\), the motion \((y^f(t), x^f(t))\) with initial datum\((y_0, x_0)\) under f is related to the corresponding motion \((y^g(t), x^g(t))\) under g via
In particular, under this condition, all the fixed points of g in the plane (y, x) are fixed points to f. Values of \(({\mathrm{I}}_0, g_0)\) for which \({\omega }({\mathrm{I}}_0, g_0)= 0\) provide, in the plane (y, x), curves of fixed points for f (which are not necessarily curves of fixed points to g).
We observe that \({\mathrm{U}}\) and \({\mathrm{E}}\) have the form in (19), with \({\mathrm{I}}=({\mathrm{I}}_1, {\mathrm{I}}_2, {\mathrm{I}}_3)=({\mathrm{r}}, \Lambda , \Theta )\) verifying (21) and \((y, x)=({\mathrm{G}}, {\mathrm{g}})\).
Proposition 2.3
(Pinzari 2019, Proposition 6) \({\mathrm{U}}\) is renormalizably integrable via \({\mathrm{E}}\). Namely, there exists a function \({\mathrm{F}}\) such that
The phase portrait of \({\mathrm{E}}\) in the planar case is shown in Figs. 1, 2 and 3, accordingly to the values of \(\varepsilon \).
2.3 Asymptotic Action-Angle Coordinates
In this section, we focus on the planar case, i.e., when \({\mathbf {y}}^{\prime }\), \({\mathbf {y}}\), \({\mathbf {x}}^{\prime }\), \({\mathbf {x}}\in {{\mathbb {R}}}^2\). In that case, the following 8-dimensional diffeomorphism replaces \({{\mathcal {K}}}\) in (15):
\({{\mathcal {K}}}_0\) may be regarded as the natural limit of \({{\mathcal {K}}}\), once \(\Theta \) and \(\vartheta \) are fixed to (0, 0), (0, \(\pi \)) (which are the values they take in the planar case), respectively, and \(({\mathrm{Z}}, {\mathrm{z}})\) are neglected. The functions \({\mathrm{U}}\) and \({\mathrm{E}}\) in (16) become
where \(a(\Lambda )\), \(\varrho ( \Lambda , {\mathrm{G}}, \ell )\), \(\mathrm{p}( \Lambda , {\mathrm{G}}, \ell , {\mathrm{g}})\) are given in (17).
Unfortunately, the action-angle coordinates associated with \({\mathrm{E}}\) are not explicit, since they are defined via inversion of elliptic integrals. However, it is possible to define, explicitly, action-angle coordinates associated with the leading part of \(\mathrm{E}\) in the case of large \({\mathrm{r}}\):
As discussed in Pinzari (2020), these coordinates, denoted asFootnote 5\(({{\mathcal {G}}}, \gamma )\), are defined via the canonicalFootnote 6 change
for any fixed value of \(\Lambda \). Observe that positive values of \({{\mathcal {G}}}\) (hence, \(k=0\)) provide coordinates with the image \(({\mathrm{G}}, {\mathrm{g}})\) in a neighborhood of (0, 0); negative values (\(k=1\)) are for \(({\mathrm{G}}, {\mathrm{g}})\) in a neighborhood of \((0,\pi )\). Using these “approximate” coordinates, one obtains the expression of \({\mathrm{E}}\) as a close-to-be-integrable system for large \(\mathrm{r}\):
The coordinates \(({{\mathcal {G}}}, \gamma )\) will be used in Sect. 5.
3 A Deeper Look at the Planar Case
In the planar case, the relation (23) becomes very special.
Instead of \({\mathrm{U}}\) and \({\mathrm{E}}\), it is convenient to switch to the functions
which are related to the previous ones via
with
if \(a=a(\Lambda )\) is as in (15). We rewrite relation (23) as
Here, we have used that \({\widehat{{\mathrm{F}}}}_\varepsilon \) does not depend explicitly on \(\Lambda \), since both \({\mathrm{U}}\) and \({\mathrm{E}}\) depend on \(\Lambda \) only via \(\frac{{\mathrm{G}}}{\Lambda }\). We claim that
Proposition 3.1
In the planar problem, if \(|\varepsilon | < \frac{1}{2}\) and \(|\widehat{E}_\varepsilon |\le 1\), (30) holds with
To prove Proposition 3.1, we need to recall the following result from Pinzari (2019).
Proposition 3.2
(Pinzari 2019, Theorem 4 and Remark 3) Let \(f({\mathrm{P}}, y, x)\) and \(g({\mathrm{P}}, y, x)\) Poisson commute. Assume that, for any fixed \({\mathrm{P}}\), the level sets \(\{(y, x):\ g({\mathrm{P}}, y, x)={{\mathcal {G}}}\}\) are union of graphs
Then, f renormalizably integrable by g through \({\widetilde{f}}\), and \({\widetilde{f}}\) can be chosen to be
for some fixed \(x_i\), \(y_j\).
Proof of Proposition 3.1
We apply Proposition 3.2 to the functions \({\widehat{{\mathrm{U}}}}_{\varepsilon (\Lambda , {\mathrm{r}})}(\Lambda , {\mathrm{G}}, {\mathrm{g}})\), \({\widehat{{\mathrm{E}}}}_{\varepsilon (\Lambda , {\mathrm{r}})}(\Lambda , {\mathrm{G}}, {\mathrm{g}})\) in (29). Such two functions do commute since \(\mathrm{U}(\Lambda , {\mathrm{G}}, {\mathrm{g}}, {\mathrm{r}})\) and \({\mathrm{E}}(\Lambda , \mathrm{G}, {\mathrm{g}}, {\mathrm{r}})\) do, and they both commute with \({\mathrm{r}}\). Moreover, the level sets \(\{(\Lambda , {\mathrm{G}}, {\mathrm{g}}):\ {\widehat{{\mathrm{E}}}}_{\varepsilon (\Lambda , {\mathrm{r}})}(\Lambda , {\mathrm{G}}, {\mathrm{g}})=t\}\) are graphs
We use the formula in (32) with \({\mathrm{P}}=(\Lambda , {\mathrm{r}})\), \(g_{2, j}=g^\pm _j({\mathrm{r}}, \Lambda , {\mathrm{G}}, t)\), \(f={\mathrm{U}}\) and \(y_j={\mathrm{G}}_j=0\) for all j. When \({\mathrm{G}}=0\), the functions \(\mathrm{g}^\pm _j\) take the value
which is well defined as \(|t|=|\widehat{E}_ \varepsilon |\le 1\). Then, by (32),
\(\square \)
A first consequence of the formula in (31) is underlined in the following.
Remark 3.1
Equation (31) can also be used to provide an expansion of \({\widehat{{\mathrm{U}}}}_\varepsilon \) about the equilibria (0, 0) and \((0, \pi )\). Indeed, \({\widehat{{\mathrm{E}}}}_{\varepsilon }(\Lambda , {\mathrm{G}}, \mathrm{g})\) takes the value \(+1\) at \((\Lambda , {\mathrm{G}}, {\mathrm{g}})=(0, 0)\), and the value \(-1\) at \((\Lambda , {\mathrm{G}}, {\mathrm{g}})=(0, \pi )\). Therefore, an expansion of \({\widehat{{\mathrm{F}}}}_{\varepsilon }(t)\) about \(\pm 1\) provides, via (30), an expansion \({\widehat{{\mathrm{U}}}}_{\varepsilon }\) about the corresponding equilibrium. On the other hand, the value of \({\widehat{{\mathrm{F}}}}_{\varepsilon }(t)\) and of its derivatives at \(t=+ 1\) or \(t=- 1\) can be explicitly computed, using the residue theorem. For example, for \(0<\varepsilon <\frac{1}{2}\),
Another consequence of (31) is:
Proposition 3.3
Let \(|\varepsilon |<\frac{1}{2}\). The complex function \((\varepsilon , t)\rightarrow {\widehat{{\mathrm{F}}}}_{\varepsilon }(t)\) loses its holomorphy if and only if
Proof
We equivalently write
Then, we change variable, letting \(x=1-\cos \xi \). The integral becomes
The only possibility it diverges is that there are two coinciding roots of the denominator on the real interval [0, 2]. The polynomial under the second square root has not coinciding roots on [0, 2] for \(|\varepsilon |<\frac{1}{2}\) and never has the root \(x=0\). The only possibility is that it has the root \(x=2\). This happens when \(t=\varepsilon +\frac{1}{4\varepsilon }\). \(\square \)
Remark 3.2
The formula in (31) (and its consequences below) is pretty specific for the planar case. In Pinzari (2019, Equation 49), we proposed a general formula (holding for the planar and the spatial case), which, unfortunately, does not seem equally exploitable.
4 Set Out and Analytic Tools
For definiteness, we refer to perihelion librations about (0, 0), the case \((0, \pi )\) being specular.
Using the identity in (30), the assumption \(A_3\) in the introduction, which gives
and the relation
we rewrite the Hamiltonians (8) as
having neglected to write (as well as we shall do below) the dependence on \(\Lambda \).
We suddenly remark that \({\widehat{{\mathrm{H}}}}_1\) and \({\widehat{{\mathrm{H}}}}_2\) are both even with respect to \({\mathrm{G}}\) and \({\mathrm{g}}\) separately, because so are the functions \({\widehat{{\mathrm{E}}}}_\varepsilon ({\mathrm{G}}, {\mathrm{g}})\) and theFootnote 7 term \(\frac{{\mathrm{G}}^2}{2m_0\mathrm{r}^2}\). Then, the manifolds
which, by the discussions of Sect. 2, are invariant for \({\widehat{{\mathrm{E}}}}_\varepsilon ({\mathrm{G}}, {\mathrm{g}})\), keep to be so also for \({\widehat{{\mathrm{H}}}}_i\). We focus on \({{\mathcal {M}}}_0\). On \({{\mathcal {M}}}_0\), the motions of the coordinates \(({\mathrm{R}}, \mathrm{r})\) are governed by the Hamiltonians
where (using (33) and dehomogenizating)
The “potentials” \({\mathrm{V}}_1\) and \({\mathrm{V}}_2\) are well defined and increasing from \(-\infty \) to 0 for \({\mathrm{r}}>2\beta a\), \(r>2(\beta +\overline{\beta })a\), respectively, so action-angle coordinates do not exist. In other words, closely to \({{\mathcal {M}}}_0\), \({\widehat{{\mathrm{H}}}}_i\) has not close to an integrable system in the sense of Liouville–Arnold (Arnold (1963a)) and hence standard perturbation theory does not apply. In the next section, we develop an analytic theory suited to this case. It will be used to “decouple” the Hamiltonians.
4.1 A Normal Form Theory Without Quasiperiodic Unperturbed Motions
In this section, we describe a procedure for eliminating the anglesFootnote 8\({\varvec{\varphi }}\) at high orders, given Hamiltonian of the form
which we assume to be holomorphic on the neighborhood
and
Here, \({{\mathbb {I}}}\subset {{\mathbb {R}}}^n\), \({{\mathbb {B}}}\subset {{\mathbb {R}}}^{2m}\), \({{\mathbb {Y}}}\subset {{\mathbb {R}}}\), \({{\mathbb {X}}}\subset {{\mathbb {R}}}\) are open and connected; \({\mathbb {T}}={\mathbb {R}}/(2\pi {\mathbb {Z}})\) is the standard torus.
We denote as \({{\mathcal {O}}}_{\rho , s, \delta , r, \xi }\) the set of complex holomorphic functions
for some \({\hat{\rho }}>\rho \), \({\hat{s}}>s\), \({\hat{\delta }}>\delta \), \({\hat{r}}>r\), \({\hat{\xi }}>\xi \), equipped with the norm
where \(\phi _{khj}({{\mathbf {I}}}, y, x)\) are the coefficients of the Taylor–Fourier expansionFootnote 9
If \(\phi \) is independent of x, we simply write \(\Vert \phi \Vert _{\rho , r}\) for \(\Vert \phi \Vert _{\rho , r,\xi }\).
If \(\phi \in {{\mathcal {O}}}_{\rho , s, \delta , r, \xi }\), we define its “off-average” \({\widetilde{\phi }}\) and “average” \({\overline{\phi }}\) as
with
We decompose
where \({{\mathcal {Z}}}_{\rho , s, \delta , r, \xi }\), \({{\mathcal {N}}}_{\rho , s, \delta , r, \xi }\) are the “zero-average” and the “normal” classes
respectively. We finally let \(\omega _{y,{{\mathbf {I}}},{{\mathbf {J}}}}:=\partial _{y,{{\mathbf {I}}},{{\mathbf {J}}}} {\mathrm{h}}\).
We shall prove the following result. Its peculiarity is that it does not need any non-resonance condition on the frequencies \(\omega _{{\mathbf {I}}}\), which, as a matter of fact, might also be zero.
Theorem 4.1
For any n, m, there exists a number \({\mathrm{c}}_{n,m}\ge 1\) such that, for any \(N\in {{\mathbb {N}}}\) such that, the following inequalities are satisfied:
with \({\mathrm{d}}:=\min \big \{\rho s, r\xi , {\delta }^2\big \}\), \({{\mathcal {X}}}:=\sup \big \{|x|:\ x\in {{\mathbb {X}}}_\xi \big \}\), one can find an operator
which carries \({\mathrm{H}}\) to
where \(g_*\in {{\mathcal {N}}}_{1/3 (\rho , s, \delta , r, \xi )}\), \(f_*\in {{\mathcal {O}}}_{1/3 (\rho , s, \delta , r, \xi )}\) and, moreover, the following inequalities hold:
The transformation \(\Psi _*\) can be obtained as a composition of time-one Hamiltonian flows and satisfies the following. If
the following uniform bounds hold:
Remark 4.1
(Extensions)
- (i)
There is an obvious extension to the case that \({{\mathbb {I}}}_\rho \), \({{\mathbb {T}}}^n_s\) are replaced with \(({{\mathbb {I}}}_1)_{\rho _1}\times \cdots \times ({{\mathbb {I}}}_n)_{\rho _n}\), \({{\mathbb {T}}}_{s_1}\times \cdots \times {{\mathbb {T}}}_{s_n}\). In this case, the number s in the former equation in (44) is to be replaced with \(\min _i\{s_i\}\). Moreover, the product \(\rho \, s\) in the definition of \({\mathrm{d}}\) is to be replaced with \(\min _i\{\rho _i\,s_i\}\). Finally, the bound in (47) is to be changed taking into account the different sizes.
- (ii)
If f does not depend on some angle \(\varphi _1\), \(\dots \), \(\varphi _p\), the vector \(\omega _{{\mathbf {I}}}\) in (44) is to be replaced with \({\widehat{\omega }}_{{\mathbf {I}}}:=(\omega _{{{\mathbf {I}}}_{p+1}},\dots , \omega _{{{\mathbf {I}}}_{n}})\).
4.2 Outline of the Proof
The complete proof of Theorem 4.1 is provided in Appendix A, but here we aim to spend some word, so as to highlight the main ideas.
We proceed by recursion. We assume that, at a certain step, we have a system of the form
where \(f\in {{\mathcal {O}}}_{\rho , s, \delta , r, \xi }\), \(g\in {{\mathcal {N}}}_{\rho , s, \delta , r, \xi }\). At the first step, just take \(g\equiv 0\).
After splitting f on its Taylor–Fourier basis
one looks for a time-one map
generated by a small Hamiltonian \(\phi \) which will be taken in the class \({{\mathcal {Z}}}_{\rho , s, \delta , r, \xi }\) in (42). Here,
denotes the Poisson parentheses of \(\phi \) and f. One lets
The operation
acts diagonally on the monomials in the expansion (49), carrying
Therefore, one defines
The formal application of \(\Phi =e^{{{\mathcal {L}}}_\phi }\) yields:
where the \(\Phi _h\)’s are the tails of \(e^{{{\mathcal {L}}}_\phi }\), defined in Appendix A.
Next, one requires that the residual term \(-D_\omega \phi +f\) lies in the class \({{\mathcal {N}}}_{\rho , s, \delta , r, \xi }\) in (43)
for \(\phi \).
Since we have chosen \(\phi \in {{\mathcal {Z}}}_{\rho , s, \delta , r, \xi }\), by (50), we have that also \(D_\omega \phi \in {{\mathcal {Z}}}_{\rho , s, \delta , r, \xi }\). So, Eq. (52) becomes
In terms of the Taylor–Fourier modes, the equation becomes
In the standard situation, one typically proceeds to solve such equation via Fourier series:
so as to find \(\displaystyle \phi _{khj\ell }=\frac{f_{khj\ell }}{\mu _{khj\ell }}\) with the usual denominators \(\mu _{khj\ell }:=\lambda _{khj}+{\mathrm{i}}\ell \omega _y\) which one requires not to vanish via, e.g., a “diophantine inequality” to be held for all \((k,h,j,\ell )\) with \((k,h-j)\ne (0,0)\). In this standard case, there is not much freedom in the choice of \(\phi \). In fact, such solution is determined up to solutions of the homogenous equation
which, in view of the Diophantine condition, has the only trivial solution \(\phi _0\equiv 0\). The situation is different iffis not periodic inx, or\(\phi \)is not needed so. In such a case, it is possible to find a solution of (53), corresponding to a non-trivial solution of (54), where small divisors do not appear. This is
Multiplying by \(e^{ik\varphi }\) and summing over k, h and j, we obtain
In Appendix A, we shall prove that, under the assumptions (44), this function can be used to obtain a convergent time-one map and that the construction can be iterated so as to provide the proof of Theorem 4.1. The construction of the iterations and the proof of its convergence are obtained adapting the techniques of Pöschel (1993) to the present case.
5 Proof of Theorem 1.1
The purpose of this section is state and prove a more precise version of Theorem 1.1 (Theorem 5.1). We shall obtain it as an application of Theorem 4.1 to the Hamiltonians \({\widehat{{\mathrm{H}}}}_i\). Therefore, we need to introduce a change in new coordinates \({{\mathcal {C}}}\) which put the \({\widehat{{\mathrm{H}}}}_i\)’s in the suited form (41). Since the potentials \({\mathrm{V}}_i\) in (40) are, for large \({\mathrm{r}}\), asymptotic to \(-\frac{m_0^2}{{\mathrm{r}}}\), it is convenient to rewrite the functions \({\widehat{{\mathrm{H}}}}_i\) in (37) as
and take \({{\mathcal {C}}}\) as the composition of two independent and canonical changes
where \({{\mathcal {C}}}_1\) is defined as in (26), with \(k=0\), while \({{\mathcal {C}}}_2\) is defined via the formulae
where \(\xi ^{\prime }(x)\) solves
\({{\mathcal {C}}}_2\) has been chosen so that
Using the new coordinates, we have
having abusively denoted as \(\varepsilon (y, x)\) the function \(\varepsilon ({\mathrm{r}}(y, x))\), and using the formulae in (27)–(29)
We now determine a domain of holomorphy of \({\widehat{{\mathrm{H}}}}_i\). Recall that we use, as mass parameters, the numbers \(\beta _*\), \(\beta ^*\) in (10). For the coordinates (y, x), we choose the complex domains \({{\mathbb {Y}}}_{\sqrt{m_0^3\alpha _-}}\), \({{\mathbb {X}}}_{\sqrt{\varepsilon _0}}\), where
with \(0<\varepsilon _0<1\), \(0<\alpha _-<\alpha _+{/4}\) verifying
with \(\beta ^*\) as in (10) and \(c_0\) being such that for any \(0<\varepsilon _0<1\) and for any \(x\in {{\mathcal {X}}}_{\sqrt{\varepsilon _0}}\), Eq. (58) has a unique solution \(\xi ^{\prime }(x)\) which depends analytically on x and verifies
(The existence of such a number \(c_0\) is well known.) For the coordinates \({{\mathcal {G}}}\), \(\gamma \), we choose the domains \( {{\mathbb {G}}}_\delta \), \({{\mathbb {T}}}_{s_0}\), with \(0<\delta <\Lambda \), \(s_0\) fixed, and \({{\mathbb {G}}}:=\Big \{{{\mathcal {G}}}\in {{\mathbb {R}}}:\ \Lambda -\delta<{{\mathcal {G}}}<\Lambda \Big \}\). Remark that, since, in \({\widehat{{\mathrm{H}}}}_i\), there is no dependence of \({\mathrm{G}}\), but only on \({\mathrm{G}}^2\), \({{\mathcal {G}}}=\Lambda \) is a regular point for \({\widehat{{\mathrm{H}}}}_i\). Then, we let
and
We now check that, under the further assumptions
\({\widehat{{\mathrm{H}}}}_i\) are holomorphic functions on the domain
By (62), the first equation in (60) and the expression of \({\mathrm{r}}(y, x)\) in (57), we have
and hence, because of (61),
By inequalities (65)–(66) and Proposition 3.3, we only need to check that, if \(\varepsilon _*\in \{\beta \varepsilon , -{\overline{\beta }}\varepsilon \}\) for \(i=1\) and \(\varepsilon _*=(\beta +{\overline{\beta }})\varepsilon \) for \(i=2\), then Eq. (34) with \(\varepsilon =\varepsilon _*\) and \(t={\widehat{{\mathrm{E}}}}_{\varepsilon _*}({{\mathcal {G}}}, \gamma )\) has not solutions in \({{\mathbb {D}}}_{\delta , s_0, \sqrt{m_0^3\alpha _-},\sqrt{\varepsilon _0}}\). We prove that any such solution would verify \(|\varepsilon _*|\ge \frac{1}{4}\), which would contradict (66), as \(|\beta ^*\varepsilon |\ge |\varepsilon _*|\). Using (59), Eq. (34) with \(\varepsilon =\varepsilon _*\) and \(t={\widehat{{\mathrm{E}}}}_{\varepsilon _*}({{\mathcal {G}}}, \gamma )\) is
We solve for \(\varepsilon _*\):
with the double determination of the square root. Since \(\left| \frac{{{\mathcal {G}}}}{\Lambda }\right| \ge 1-\frac{\delta }{\Lambda }\ge \frac{3}{4}\) and, if \(c_*(s_0):=\sup _{{{\mathbb {T}}}_{s_0}}|\sin \gamma |\), \(\left| \sin \gamma \sqrt{\frac{{{\mathcal {G}}}^2}{{\Lambda }^2}-1}\right| \le c^*(s_0)\sqrt{\frac{\delta }{\Lambda }}\le \frac{1}{4}\), we have \(|\varepsilon _*|\ge \frac{1}{2}\left( \frac{3}{4}-\frac{1}{4}\right) =\frac{1}{4}\), as claimed.
We are now ready to state the result. Observe that the motions
correspond, using the coordinates \(({\mathrm{G}}, {\mathrm{g}})\), to librations about (0, 0) if \(0<{{\mathcal {G}}}_0<\Lambda \); about \((0, \pi )\) if \(-\Lambda<{{\mathcal {G}}}_0<0\). We shall prove the existence, in \({\widehat{{\mathrm{H}}}}_i\)’s, of motions close to these ones.
Theorem 5.1
Let \(\alpha _-\), \(\alpha _+\), \(\beta \), \({\overline{\beta }}\), \(\delta \), \(\varepsilon _0\) and \(s_0\) be fixed; \(\beta _*\), \(\beta ^*\) as in (10). There exist two numbers \(C^*>C_*>1\), both independent of \(\alpha _-\), \(\alpha _+\), \(\beta \), \({\overline{\beta }}\), \(\delta \), \(\varepsilon _0\), with \(C^*\) possibly depending on \(s_0\), while \(C_*\) independent of \(s_0\), such that, if the following inequalities are satisfied
it is possible to find new coordinates \(({{\mathcal {G}}}_*, \gamma _*, y_*, x_*)\) and a time T such that any solution \(\Gamma _*(t)=({{\mathcal {G}}}_*(t), \gamma _*(t), y_*(t), x_*(t)) \) of \({\widehat{{\mathrm{H}}}}_i\) with initial datum \(\Gamma _*(0)\)\(=\)\(({{\mathcal {G}}}_*(0)\), \(\gamma _*(0)\), \(y_*(0)\), \(x_*(0))\)\(\in {{\mathbb {D}}}\) such that
stays in \({{\mathbb {D}}}\) for all \(0\le t\le T\) and \({{\mathcal {G}}}_*\) varies a little in the course of such time:
If, in addition,
then motions close to librations occur, in the sense that, also
The time T can be taken to be
Finally, the change
is real-analytic and close to the identity, in the sense that
Remark 5.1
(Proof of Theorem 1.1) Inequalities (67), (68) and (69) are simultaneously satisfied provided that the following holds. Fix \(0<\varepsilon _0<1\) and \(0<\delta \le \frac{\Lambda }{4}\) once forever. Then, identify \(\delta \) as the size of \({\mathrm{U}}_0\) and \(2^{-N_0}\) as the size of \({\mathrm{V}}_0\). Take
For short, we have written “\(a\rhd b\)” if there exist \(c>1\), independent of \(\delta \), \(\Lambda \), \(\alpha _-\), \(\alpha _+\), a and \(s_0\) such that \(a>cb\). Note that here it is essential that \(\alpha _-\), \(s_0\), \(\beta _*\) and \(\beta ^*\) can be chosen arbitrarily large.
Proof
During the proof, we shall make extensive use of CauchyFootnote 10 inequalities.
We aim to apply Theorem 4.1, with \({{\mathbf {I}}}={{\mathcal {G}}}\), \({\varvec{\varphi }}=\gamma \), (y, x) as in (57), \({\mathrm{h}}(y)=-\frac{m_0^5}{2y^2}\) and, finally
In our case, \({{\mathbf {p}}}\), \({{\mathbf {q}}}\) do not exist and the unperturbed term \({\mathrm{h}}\) does not depend on \({{\mathbf {I}}}={{\mathcal {G}}}\). Therefore, we have only to verify the last condition in (44). We have
(having used \(\varepsilon _0<1\)) and
withFootnote 11\(C_1\), \(C_2\), \(C_*\) independent of \(\alpha _-\), \(\alpha _+\), \(\delta \), \(\beta _*\), \(\beta ^*\) but \(C_1\) possibly depending on \(s_0\), while \(C_2\), \(C_*\) independent of \(s_0\). We have chosen the number \(C^*\) in (67) larger than or equal to \(2C_1/C_2\) and the number \(C_*\) larger than or equal to \(3C_2/2\), so that \(\left( C_1\frac{\delta }{\Lambda } +C_2\beta _*\right) \le \frac{3}{2}C_2\beta _*{\le C_*\beta _*}\). We have
Therefore, the last condition in (44) is immediately implied by \(N<N_0\), with \(N_0\) as in (67). We then find a real-analytic transformation
which leads \({\widehat{{\mathrm{H}}}}_i\) to
where \(g_*\) and \(f_*\) satisfy the following bounds:
with \({\overline{f}}(y_*, x_*, {{\mathcal {G}}}_*)\) the \(\gamma _*\)-average of \(f(y_*, x_*, {{\mathcal {G}}}_*, \gamma _*)\) and \(\Delta \) as in (72). Let now \(\Gamma _*(t)=({{\mathcal {G}}}_*(t), \gamma _*(t), y_*(t), x_*(t)) \) be a solution of \({\widehat{{\mathrm{H}}}}_i\) with initial datum \(\Gamma _*(0)\)\(=\)\(({{\mathcal {G}}}_*(0)\), \(\gamma _*(0)\), \(y_*(0)\), \(x_*(0))\)\( \in {{\mathbb {D}}}\) and verifying (68). We look for a time \(T>0\) such that \(\Gamma _*(t)\in {{\mathbb {D}}}\) for all \(0\le t\le T\). We show that we can take T as in (70), which, for convenience, we rewrite as
where \(\Delta \) is as in (72). Equation (73) implies
So, for \(t\le \frac{\sqrt{m_0^3\alpha _-\varepsilon _0}}{\Delta }\), we have
namely, \(y_*(t)\in {{\mathbb {Y}}}\) for \(0\le t\le T\). Since \(|y|\ge \sqrt{m_0^3 \alpha _-}\) for all this time, we also have
Inequalities \(0<\varepsilon _0<1\) and \(t\le \min \left\{ \sqrt{\frac{\alpha _-^3}{m_0}},\ \frac{\sqrt{m_0^3\alpha _-}}{\Delta } \right\} \) imply
and hence \(x_*(t)\in {{\mathbb {X}}}\) for \(0\le t\le T\). Since \(0\le t\le 2^N\frac{s_0\delta }{\Delta }\), we have
and hence \({{\mathcal {G}}}_*(t)\in {{\mathbb {G}}}\). Let us now evaluate the variation of \(\gamma _*\) during the time T. We have
Proceeding as in (72) and using Cauchy inequalities, one sees that \(\inf |\partial _{{{\mathcal {G}}}_*}{\overline{f}}|\ge c^*\beta _*{m_0^2}\frac{a}{\Lambda \alpha _+^2}\). So, using (72) and \(N_0^{-1}<\frac{c_0^2\varepsilon _0^2\alpha _-^2}{2\alpha _+^2}\),
with \(c^*\), \(c^\circ \) independent of \(\alpha _-\), \(\alpha _+\), \(\beta \), \({\overline{\beta }}\), \(\delta \), \(\varepsilon _0\) and \(s_0\). We then see that \(|\gamma _*(T)-\gamma _*(0)|\) is lower-bounded by \(3\pi \) as soon as the condition in (69) is satisfied. \(\square \)
Notes
The planetary model is the dynamical system of \(1+n\) (\(n\ge 2\)) point masses undergoing Newtonian attraction, in the case when one of the masses (“sun”) is much heavier than the remaining ones (“planets”), which are of comparable size.
If (\({{\mathbf {y}}}_0\), \({{\mathbf {x}}}_0\)), (\({{\mathbf {y}}}_1\), \({{\mathbf {x}}}_1\)), (\({{\mathbf {y}}}_2\), \({{\mathbf {x}}}_2)\) are impulse–position coordinates of \(m_0\), \(\mu m_0\), \(\kappa m_0\), respectively, by “Jacobi coordinates” one usually means a linear, canonical change (\({{\mathbf {y}}}_0\), \({{\mathbf {y}}}_1\), \({{\mathbf {y}}}_2\), \({{\mathbf {x}}}_0\), \({{\mathbf {x}}}_1\), \({{\mathbf {x}}}_2)\in ({{\mathbb {R}}}^3)^6\rightarrow ({{\mathbf {y}}}_{\mathrm{c}}\), \({{\mathbf {y}}}\), \({{\mathbf {y}}}^{\prime }\), \({{\mathbf {x}}}_{\mathrm{c}}\), \({{\mathbf {x}}}\), \({{\mathbf {x}}}^{\prime })\in ({{\mathbb {R}}}^3)^6\) defined so that \({{\mathbf {x}}}_\mathrm{c}=({{\mathbf {x}}}_0+\mu {{\mathbf {x}}}_1 +\kappa {{\mathbf {x}}}_2)(1+\mu +\kappa )^{-1}\) is the center of mass of the system, while \({{\mathbf {x}}}\), \({{\mathbf {x}}}^{\prime }\) are, respectively, the mutual position of \({{\mathbf {x}}}_1\) with respect to \({{\mathbf {x}}}_0\) and the position of \({{\mathbf {x}}}_2\) with respect to the center of mass of \({{\mathbf {x}}}_0\) and \({{\mathbf {x}}}_1\): \({{\mathbf {x}}}={{\mathbf {x}}}_1-{{\mathbf {x}}}_0\), \({{\mathbf {x}}}^{\prime }={{\mathbf {x}}}_2-\frac{\mu }{1+\mu }{{\mathbf {x}}}_1-\frac{1}{1+\mu }{{\mathbf {x}}}_0\). The new impulses \(({{\mathbf {y}}}_{\mathrm{c}}\), \({{\mathbf {y}}}\), \({{\mathbf {y}}}^{\prime }\)) are uniquely defined by the constraint of symplecticity. The new Hamiltonian turns to be \({{\mathbf {x}}}_\mathrm{c}\)-independent, due to the conservation of \({{\mathbf {y}}}_\mathrm{c}\). The dependence on \({{\mathbf {y}}}_{\mathrm{c}}\) can be eliminated choosing (as it is always possible to do) a reference frame where \({{\mathbf {x}}}_{\mathrm{c}}\equiv {{\mathbf {0}}}\). See e.g., Giorgilli (2008, §5.2–5.3) for more details.
If (\({{\mathbf {y}}}_0\), \({{\mathbf {y}}}_1\), \({{\mathbf {y}}}_2\), \({{\mathbf {x}}}_0\), \({{\mathbf {x}}}_1\), \({{\mathbf {x}}}_2)\) are impulse–position coordinates of \(m_0\), \(\mu m_0\), \(\kappa m_0\), respectively, the \(m_0\)-centric coordinates \(({{\mathbf {Y}}}_0, {{\mathbf {y}}}, {{\mathbf {y}}}^{\prime }, {{\mathbf {X}}}_0, {{\mathbf {x}}}, {{\mathbf {x}}}^{\prime })\) are defined via \({{\mathbf {X}}}_0={{\mathbf {x}}}_0\), \({{\mathbf {x}}}={{\mathbf {x}}}_1-{{\mathbf {x}}}_0\), \({{\mathbf {x}}}^{\prime }={{\mathbf {x}}}_2-{{\mathbf {x}}}_0\) and by the symplecticity constraint.
For the unperturbed Keplerian motions, it is \({{\mathbf {y}}}=\mathrm{m}\dot{{\mathbf {x}}}\), so
$$\begin{aligned} \frac{1}{2\pi }\int _0^{2\pi }{{\mathbf {y}}}\hbox {d}\ell =\frac{1}{2\pi }\int _0^{2\pi }m_0\dot{{\mathbf {x}}}\hbox {d}\ell =\frac{1}{T}\int _0^{T}m_0 \dot{{\mathbf {x}}}\hbox {d}t=0 \end{aligned}$$where \(T=\frac{2\pi }{\omega _{\mathrm{K}}}\) and \(\omega _\mathrm{K}=\frac{m_0^5}{{\Lambda }^3}\) is the Keplerian frequency.
The change of coordinates discussed in Pinzari (2020) is a little more general than (26), since it is a four-dimensional map composed by (26) and
$$\begin{aligned} \left\{ \begin{array}{llll}\displaystyle \Lambda ={{\mathcal {L}}}\\ \\ \displaystyle \ell =\lambda +\arg \left( \cos \gamma , \ \frac{{{\mathcal {L}}}}{|{{\mathcal {G}}}|}\sin \gamma \right) \end{array}\right. . \end{aligned}$$In this paper, we shall only use the two-dimensional projection (26).
Note that this circumstance would not hold without assuming \(A_3\), since, in that case, instead of the term \(\frac{{\mathrm{G}}^2}{{\mathrm{r}}^2}\), we would have \(\frac{(\mathrm{C}-{\mathrm{G}})^2}{{\mathrm{r}}^2}\) or \(\frac{({\mathrm{C}}+{\mathrm{G}})^2}{\mathrm{r}^2}\) (depending on the verse of rotation of \({{\mathbf {x}}}^{\prime }\)), which would break the symmetry.
Note that the procedure described in this section does not seem to be related to Arnold (2006, §6.4.4), for the lack of slow–fast couples.
We denote as \({{\mathbf {x}}}^h:=x_1^{h_1}\dots x_n^{h_n}\), where \({{\mathbf {x}}}=(x_1, \dots , x_n)\in {{{\mathbb {R}}}}^n\) and \(h=(h_1, \dots , h_n)\in {{{\mathbb {N}}}}^n\).
Split f in (71) as \(f=f_1+f_2\), where \(f_1=\frac{m_0^2}{{\mathrm{r}}(y, x)}\varepsilon (y, x)\frac{({\Lambda }^2-{{\mathcal {G}}}^2)}{2{\Lambda }^2}\cos ^2\gamma \). The term proportional to \(C_i\) corresponds to be a upper bound of \(\left\| \frac{f_i}{\omega _y}\right\| \). \(C_2\) can be chosen to be independent of \(s_0\) because \(\Vert f_2\Vert \) goes to zero as \(s_0\rightarrow \infty \).
nqp stands for “non-quasiperiodic.”
The three first inequalities in (86) are immediately seen to be stronger than the corresponding three first inequalities in (79). On the other hand, rewriting the second inequality in (86) as \(\frac{\delta ^{\prime }}{\delta }<1-\frac{2\delta ^{\prime }}{\delta }\) and using the inequality (which holds for all \(x\ge 1\)) \(\log x\ge 1-\frac{1}{x}\) with \(x=\frac{\delta }{2\delta ^{\prime }}\), we have also \(\frac{\delta ^{\prime }}{\delta }<\log \frac{\delta }{2\delta ^{\prime }} \).
References
Arnold, V.I.: A theorem of Liouville concerning integrable problems of dynamics. Sib. Mat. Ž. 4, 471–474 (1963a)
Arnold, V.I.: Small denominators and problems of stability of motion in classical and celestial mechanics. Russ. Math. Surv. 18(6), 85–191 (1963b)
Arnold, V.I.: Instability of dynamical systems with many degrees of freedom. Dokl. Akad. Nauk SSSR 156, 9–12 (1964)
Arnold, V.I., Kozlov, V.V., Neishtadt, A.I.: Mathematical aspects of classical and celestial mechanics, volume 3 of Encyclopedia of Mathematical Sciences, 3rd edn. Springer, Berlin (2006) [Dynamical systems. III], Translated from the Russian original by E. Khukhro
Chierchia, L., Pinzari, G.: The planetary \(N\)-body problem: symplectic foliation, reductions and invariant tori. Invent. Math. 186(1), 1–77 (2011)
Delshams, A., Kaloshin, V., de la Rosa, A., Seara, T.M.: Global instability in the restricted planar elliptic three body problem. Commun. Math. Phys. 366(3), 1173–1228 (2019)
Féjoz, J.: Démonstration du ‘théorème d’Arnold’ sur la stabilité du système planétaire (d’après Herman). Ergod. Theory Dyn. Syst. 24(5), 1521–1582 (2004)
Féjoz, J., Guardia, M.: Secular instability in the three-body problem. Arch. Ration. Mech. Anal. 221(1), 335–362 (2016)
Fortunati, A., Wiggins, S.: Negligibility of small divisor effects in the normal form theory for nearly-integrable Hamiltonians with decaying non-autonomous perturbations. Celest. Mech. Dyn. Astronom. 125(2), 247–262 (2016)
Giorgilli, A.: Appunti di Meccanica Celeste (Italian) (2008). http://www.mat.unimi.it/users/antonio/meccel/Meccel_5.pdf. Accessed 5 June 2016
Laskar, J., Robutel, P.: Stability of the planetary three-body problem. I. Expansion of the planetary Hamiltonian. Celest. Mech. Dyn. Astron. 62(3), 193–217 (1995)
Nehorošev, N.N.: An exponential estimate of the time of stability of nearly integrable Hamiltonian systems. Usp. Mat. Nauk 32(6(198)), 5–66, 287 (1977)
Pinzari, G.: On the Kolmogorov set for many-body problems. Ph.D Thesis, Università Roma Tre (April 2009)
Pinzari, G.: Aspects of the planetary Birkhoff normal form. Regul. Chaotic Dyn. 18(6), 860–906 (2013)
Pinzari, G.: A first integral to the partially averaged newtonian potential of the three-body problem. Celest. Mech. Dyn. Astron. 131(5), 22 (2019)
Pinzari, G.: Euler integral and perihelion librations. Discrete Contin. Dyn. Syst. (2020) https://doi.org/10.3934/dcds.2020165
Pöschel, J.: Nekhoroshev estimates for quasi-convex Hamiltonian systems. Math. Z. 213(2), 187–216 (1993)
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Alain Goriely.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
I wish to thank Ugo Locatelli for highlighting discussions; Qinbo Chen, Jerome Daquin and Sara Di Ruzza for their interest and doubly Qinbo Chen for pointing out a mistake in a previous formulation of Proposition 3.3. This work has been supported by the European Research Council [Grant Number 677793 Stable and Chaotic Motions in the Planetary Problem]. Figures 1, 2 and 3 have been produced using the software mathematica\(^{\textregistered }\).
Proof of Theorem 4.1
Proof of Theorem 4.1
In this section, we use the definitions and the notations introduced in Sect. 4.1, plus the following further ones.
Definition A.1
Given \({\mathrm{h}}\) and f as in (41), we callFootnote 12 the function (56) nqp-primitive offrelatively to\({\mathrm{h}}\) or, simply, nqp-primitive off.
Example A.1
Let \(n=1\) and \(f(I, \varphi )=a(I)\cos \varphi \). The nqp-primitive of f is
having changed variable \(\tau \rightarrow \psi =\frac{\omega _{ I}}{\omega _y}(\tau -x) \).
Definition A.2
(Time-one flow) Let \({{\mathcal {L}}}_\phi (\cdot ):=\big \{\phi , \cdot \big \}\). For a given \(\phi \in {{\mathcal {O}}}_{\rho , s, \delta , r, \xi }\) and \(h\in {{\mathbb {N}}}\), we denote as \(\Phi _h\) the formal series
We call \(\Psi :=\Phi _0:=e^{{{\mathcal {L}}}_\phi }\)time-one flow generated by \(\phi \).
Definition A.3
We call nqp-homological transformation the time-one flow \(\Psi :=\Phi _0=e^{{{\mathcal {L}}}_\phi }\) generated by \(\phi \) in (49), with \(\phi _{khj}\) as in (55).
Below, we prove that the nqp-homological transformation is well defined. Later, we shall prove that the composition of \(N+1\)nqp-homological transformations \(\Psi _1=e^{{{\mathcal {L}}}_{\phi _1}}\), \(\dots \), \(\Psi _{N+1}=e^{{{\mathcal {L}}}_{\phi _{N+1}}}\), where
with \({\mathrm{H}}_0={\mathrm{H}}\) provides the transformation \(\Psi _*\) in (45).
Lemma A.1
(Pöschel 1993) There exists an integer number \(\overline{\mathrm{c}}_{n,m}\) such that, for any \(\phi \in {{\mathcal {O}}}_{\rho , s, \delta , r, \xi }\) and any \(r^{\prime }<r\), \(s^{\prime }<s\), \(\rho ^{\prime }<\rho \), \(\xi ^{\prime }<\xi \), \(\delta ^{\prime }<\delta \) such that
then the series in (75) converge uniformly so as to define the family \(\{\Phi _h\}_{h=0,1,\dots }\) of operators
Moreover, the following bound holds (showing, in particular, uniform convergence):
for all \(g\in {{\mathcal {O}}}_{\rho , s, \delta , r, \xi }\).
Remark A.1
(Pöschel 1993) The bound (76) immediately implies
Lemma A.2
(Iterative lemma) There exists a number \({\widetilde{{\mathrm{c}}}}_{n,m}>1\) such that the following holds. For any choice of positive numbers \(r^{\prime }\), \(\rho ^{\prime }\), \(s^{\prime }\), \(\xi ^{\prime }\), \(\delta ^{\prime }\) satisfying
and provided that the following inequality holds true
one can find an operator
with
which carries the Hamiltonian \({\mathrm{H}}\) in (48) to
where
with
for a suitable \(\phi \in {{\mathcal {O}}}_{\rho _1, s_1, \delta _1, r_1, \xi _1}\) verifying
Furthermore, if
the following uniform bounds hold:
Remark A.2
The right-hand side of (81) benefits of the absence of small divisors, as no “ultraviolet” (i.e., with size \(\sim e^{-Ns}\Vert f\Vert \)) term appears.
Proof
Let \(\overline{\mathrm{c}}_{n,m}\) be as in Lemma A.1. We shall choose \({\widetilde{{\mathrm{c}}}}_{n,m}\) suitably large with respect to \(\overline{\mathrm{c}}_{n,m}\).
Let \(\phi _{khj}\) as in (55). Let us fix
and assume that
Then, we have
Since
we have
which yields (after multiplying by \(e^{|k|{\overline{s}}}({\overline{\delta }})^{j+h}\) and summing over k, j, h with \((k,h-k)\ne (0,0)\)) to
Note that the right-hand side is well defined because of (85). In the case of the choice
(which, in view of the two latter inequalities in (79), satisfies (84)–(85)) the inequality becomes (82). An application of Lemma ,with r, \(\rho \), \(\xi \), s, \(\delta \) replaced by \(r_1-r^{\prime }\), \(\rho _1-\rho ^{\prime }\), \(\xi _1-\xi ^{\prime }\), \(s_1-s^{\prime }\), \(\delta _1-\delta ^{\prime }\), concludes with a suitable choice of \({\widetilde{{\mathrm{c}}}}_{n,m}>\overline{\mathrm{c}}_{n,m}\) and (by (51))
Observe that the bound (81) follows from Eqs. (77) and (76) and the identities
with \(g_1:={{\mathcal {L}}}_\phi (g)=\{\phi , g\}\). The bounds in (83) are a consequence of equalities of the kind
(and similar). \(\square \)
The proof of the Theorem 4.1 goes through iterate applications of Lemma A.2.
We premise the following
Remark A.3
Replacing conditions in (79) with the stronger onesFootnote 13
[and keeping (78), (80) unvaried] one can take, for \(s_+\), \(\delta _+\), \(s_1\), \(\delta _1\) the simpler expressions
(while keeping \(r_+\), \(\rho _+\), \(\xi _+\), \(r_1\), \(\rho _1\), \(\xi _1\) unvaried). Indeed, since \(1-e^{-x}\le x\) for all x,
This also implies \(\xi _+=\delta _1-\delta ^{\prime }\ge \delta -2\delta ^{\prime }=\xi _{+\mathrm new}\). That \(s_+\ge s_{+\mathrm new}\), \(s_1\ge s_{1\mathrm new}\) is even more immediate.
Now, we can proceed with the
Proof of Theorem 4.1
Let \({\widetilde{{\mathrm{c}}}}_{n,m}\) be as in Lemma A.2. We shall choose \(\overline{\mathrm{c}}_{n,m}\) suitably large with respect to \({\widetilde{{\mathrm{c}}}}_{n,m}\).
We apply Lemma A.2 with
We make use of the stronger formulation described in Remark A.3. Conditions in (78) and the two former conditions in (86) are trivially true. The two latter inequalities in (86) reduce to
and they are certainly satisfied by assumption (44), for \(N>2\). Since
we have that condition (80) is certainly implied by the last inequality in (44), once one chooses \({\mathrm{c}}_{n,m}>{81} {\widetilde{{\mathrm{c}}}}_{n,m}\). By Lemma A.2, it is then possible to conjugate \(\mathrm{H}\) to
with \(f_1\in {{\mathcal {O}}}_{\rho ^{(1)},s^{(1)},\delta ^{(1)}, r^{(1)},\xi ^{(1)},}\), where \((\rho ^{(1)},s^{(1)},\delta ^{(1)}, r^{(1)},\xi ^{(1)},):=2/3 (\rho , s, \delta , r, \xi )\) and
since \({\mathrm{c}}_{n,m}\ge {162} {\widetilde{{\mathrm{c}}}}_{n,m}\) and \(N\ge 1\). Now, we aim to apply Lemma A.2N times, again as described in Remark A.3, each time with parameters
Therefore, we find, at each step
with \(1\le j\le N\).
We assume that for a certain \(1\le i\le N\) and all \(1\le j\le i\), we have \({\mathrm{H}}_j\in {{\mathcal {O}}}_{\rho ^{(j)},s^{(j)},\delta ^{(j)}, r^{(j)},\xi ^{(j)},}\) of the form
with \(g_{-1}\equiv 0\), \(g_0={\overline{f}}\). We want to prove that, if \(i<N\), Lemma A.2 can be applied once again, so as to conjugate \({\mathrm{H}}_i\) to a suitable \({\mathrm{H}}_{i+1}\) such that (90)–(91) are true with \(j=i+1\). To this end, according to the discussion in Remark A.3, we check the stronger inequalities
where \(d_i:=\min \{ \rho _i^{\prime } s_i^{\prime }, r_i^{\prime }\xi _i^{\prime }, {\delta ^{\prime }}_i^2 \}\). Conditions (92) and (93) are certainly verified, since in fact they are implied by the definitions above (using also \(\delta _i\le \frac{2}{3}\delta \), \({{\mathcal {X}}}_i\le {{\mathcal {X}}}\)) and the two former inequalities in (44). To check the validity of (94), we firstly observe that
Using then \({\mathrm{c}}_{n,m}>{162} {\widetilde{{\mathrm{c}}}}_{n,m}\),\({{\mathcal {X}}}_i<{{\mathcal {X}}}\), Eq. (87), the inequality in (91) with \(j=i\) and the last inequality in (44), we easily conclude
which implies (94).
Then, the iterative lemma is applicable to \({\mathrm{H}}_i\), which is conjugated to
with \(g_{i}\), \(f_{i+1}\) satisfying (90) with \(j=i+1\). We prove that (91) holds with \(j=i+1\), so as to complete the inductive step. By the thesis of the iterative lemma,
On the other hand, using (87), (95) and the last assumption in (44) and (91) with \(j=i\), we obtain
Furthermore, using (76) with \(j=1\), (82) and
and, with \(j=1\), \(\dots \), \(i-1\)
we obtain
where the last summand is to be taken into account only when \(i\ge 2\) and we have used that, when \(j=0\), \(d_i\) can be replaced with \(N d_i\). Collecting the bounds above, we find (91) holds with \(j=i+1\). Then, the iterative lemma can be applied N times, and we get (46) as a direct consequence of (87) and (97) and of the validity of (91) for all \(1\le i\le N\). The inequalities in (47) follow from (83) and (88). For example,
etc. \(\square \)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Pinzari, G. Perihelion Librations in the Secular Three-Body Problem. J Nonlinear Sci 30, 1771–1808 (2020). https://doi.org/10.1007/s00332-020-09624-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00332-020-09624-x