Abstract
We develop KAM theory close to an elliptic fixed point for quasi-linear Hamiltonian perturbations of the dispersive Degasperis–Procesi equation on the circle. The overall strategy in KAM theory for quasi-linear PDEs is based on Nash–Moser nonlinear iteration, pseudo differential calculus and normal form techniques. In the present case the complicated symplectic structure, the weak dispersive effects of the linear flow and the presence of strong resonant interactions require a novel set of ideas. The main points are to exploit the integrability of the unperturbed equation, to look for special wave packet solutions and to perform a very careful algebraic analysis of the resonances. Our approach is quite general and can be applied also to other 1d integrable PDEs. We are confident for instance that the same strategy should work for the Camassa–Holm equation.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction and Main Result
In this paper we prove existence and stability of Cantor families of quasi-periodic, small amplitude, solutions for quasi-linear Hamiltonian perturbations of the Degasperis–Procesi (DP) equation
under periodic boundary conditions \(x\in \mathbb {T}:=\mathbb {R}/2\pi \mathbb {Z}\), where
the “Hamiltonian density” f belongs to \(C^{\infty }(\mathbb {R}, \mathbb {R})\) and is such that
where \(O(u^9)\) denotes a function with a zero of order at least nine at the origin. The Eq. (1.1) is a Hamiltonian PDE of the form \(u_t=J\,\nabla H(u)\) where \(\nabla H\) is the \(L^2(\mathbb {T},\mathbb {R})\) gradient and the function
is defined on the phase space \(H_0^1(\mathbb {T}):=\left\{ u\in H^1(\mathbb {T}, \mathbb {R}) : \int _{\mathbb {T}} u\,dx=0\right\} \). The Eq. (1.1) for \(f=0\) is the DP equation which was first proposed in [29] in the form
where \(c_0,c_1,c_2,\gamma ,\alpha \in \mathbb {R}\), \(\alpha \ne 0\). By applying Galilean boosts, translations and time rescaling to (1.5) one obtains Eq. (1.1) with \(f=0\).
The DP equation can be regarded as a model for nonlinear shallow water dynamics and its asymptotic accuracy is the same as for the Camassa–Holm equation and a degree more than the KdV equation [23]. There is a rather large literature on this equation starting form the paper [28] in which the complete integrability is proved. The local and global well-posedness, for instance, have been extensively studied as well as existence of wave breaking phenomena (peakons, N-peakons solutions). Without trying to be exhaustive we quote [18, 20,21,22, 48, 54] and we refer to [32] and references therein for more literature about Degasperis–Procesi equation.
Actually many of these results (notably the wave breaking) are studied in the dispersionless case, which corresponds to (1.1) with \(f=0\) and \(u\rightsquigarrow u+1\). In the present paper the presence of the dispersive terms \(-4 u_{x}+ u_{xxx}\) is fundamental. Our main purpose is to prove existence of quasi-periodic solutions in high Sobolev regularity by following a KAM approach. In this setting a quasi-periodic solution with \(\nu \in \mathbb {N}\) frequencies is defined by an embedding
and a frequency vector \(\omega \in \mathbb {R}^{\nu }\), with rationally independent entries, such that \(u(t,x)=U(\omega t,x)\) is a solution of (1.1) and \(U({\varphi },x)\in H^p(\mathbb {T}^{\nu +1},\mathbb {R})\) for some p sufficiently large.
Notice that, in a neighbourhood of \(u=0\), (1.1) can be seen as a perturbation of the linear PDE
whose bounded solutions have the form
where \(j\mapsto \lambda (j)\) is the linear dispersion law. It is easily seen that all solutions of (1.7) with compact Fourier support are periodic, but with period depending on the support. In this context it is natural to investigate whether Eq. (1.1) has periodic or quasi-periodic solutions close to to small amplitude linear solutions (1.8). We remark that, since the solutions of (1.8) are all periodic, the existence of quasi-periodic solutions, if any, strongly relies on the presence of the quadratic nonlinearity in (1.1).
In the present paper we construct quasi-periodic solutions mainly supported in Fourier space at \(\nu \ge 2\) distinct tangential sites
where, without loss of generality, we shall always assume that \(\overline{\jmath }_1=\max _{i=1,\ldots , \nu }\overline{\jmath }_i\). We denote by
the linear frequencies of oscillations related to the tangential sites. More precisely our solutions will have the form
where \(o(\sqrt{|\xi |})\) is meant in the \(H^{s}\)-topology with s large. It is well know that in looking for quasi-periodic solutions “small divisors” problems arise. To overcome such problems we shall require that \(S^+\) satisfies a wave packet condition and that the unperturbed amplitudes\(\xi \) belong to an appropriate Cantor-like set of positive measure.
The following definition quantifies the wave packet condition.
Definition 1.1
For \(\mathtt {r}\in (0, 1)\), we say that a set of natural numbers \(S^+=\{\overline{\jmath }_1, \ldots , \overline{\jmath }_{\nu }\}\) is in \(\mathcal {V}(\mathtt {r})\) if
Denoting by \(B(0,\varrho )\) the ball centred at the origin of \(\mathbb {R}^{\nu }\) of radius \(\varrho >0\), our result can be stated as follows.
Theorem 1
Let \(\nu \in \mathbb {N}\), \(\nu \ge 2\), and consider \(f\in C^{\infty }(\mathbb {R},\mathbb {R})\) satisfying (1.3). There exists a constant \(\mathtt {r}_0>0\) such that, for any choice of \(S^+\) in \(\mathcal {V}(\mathtt {r})\), with \(0<\mathtt {r}\le \mathtt {r}_0\), there exist \(s\gg 1\), \(0<\varrho \ll 1\) and a positive measure Cantor-like set \(\mathfrak {A}\subseteq B(0,\varrho )\) such that the following holds. For any \(\xi \in \mathfrak {A}\), the Eq. (1.1) possesses a small amplitude quasi-periodic solution \(u(t,x; \xi )=U(\omega t,x; \xi )\) of the form (1.11) where \(U({\varphi },x)\in H^{s}(\mathbb {T}^{\nu +1},\mathbb {R})\) and \(\omega :=\omega (\xi )\in \mathbb {R}^{\nu }\) is a diophantine frequency vector. Moreover for \(0<\varepsilon \le \sqrt{\varrho }\), the set \(\mathfrak A\) has asymptotically full relative measure in \([\varepsilon ^2,2\varepsilon ^2]^\nu \).
Moreover we have the following stability result.
Theorem 2
(Linear stability). The quasi-periodic solutions (1.11) \(u(t, x)=U(\omega t, x)\) of Eq. (1.1) are linearly stable and reducible in the following sense. Consider Eq. (1.1) linearized at the embedded torus \(U({\varphi },x)\), then the corresponding operator has purely imaginary spectrum and there exists a change of variables \(H^s(\mathbb {T}, \mathbb {R}) \rightarrow H^s(\mathbb {T}, \mathbb {R})\), quasi periodic in time with frequency \(\omega \), which diagonalizes it in the directions normal to the torus. As a consequence the Cauchy problem of the linearized equation is stable, i.e. the Sobolev norms are uniformly bounded in t.
Theorems 1, 2 are formulated in the typical style of results on reducible KAM tori for PDEs. For the proof we use the overall strategy of [4], which however has to be substantially developed to deal with (1.1). Let us briefly explain the main new issues.
-
The dispersion law is asymptotically linear as for the Klein–Gordon equation, studied for instance in [6, 7]. As explained in those papers, the fact that the dispersive effects are very weak (essentially time and space play the same role) creates a number of difficulties even in the study of KAM theory for semi-linear PDEs. Of course, since (1.1) is quasi-linear, there are additional serious difficulties coming from the strong perturbative effects of the nonlinearity.
-
The DP equation is resonant at zero and does not depend on any external parameters. This is a fundamental difference w.r.t. the Klein–Gordon equation, where one modulates the mass in order to avoid resonances. Moreover the DP has non-trivial resonances already at order four (see Sect. 1.3), differently from the previous KAM results for quasi-linear PDEs. As a further difficulty the algebraic structure of the resonances is quite complicated. In order to avoid the inherent problems we rely on the presence of “many” (precisely eight) approximate constants of motion of (1.1) coming from the integrable structure of the DP equation. Dealing with the problems related to resonances is the core of this paper and requires a set of new ideas and a careful analysis.
-
The very strong restriction of the tangential sites \(S^{+}\) is exploited several times to simplify the problems arising from the rational and asymptotically linear dispersion law. Physically we are looking for solutions mainly supported in Fourier space on modes which are relatively close to each other. It seems reasonable that such condition could be weakened, but it is not clear to us how to deal with the technical difficulties which would arise.
-
As in other resonant cases, the diophantine constant \(\gamma \) is related to the size of the solution one is looking for (see (1.11)). Moreover, due to the linear dispersion law, we are forced to impose very “weak” non-degeneracy conditions on the linear frequencies of oscillations. As a consequence we need a refined bifurcation analysis in order to find a very good first approximate solution and fulfil the smallness conditions required for the Nash–Moser scheme.
Some comments on Eq. (1.1) and on Theorems 1, 2 are in order.
1.1 The unperturbed DP equation
We look at (1.1) as a perturbation of the linear equation (1.7), in order to fit the typical perturbative setting of KAM for PDEs , we refer to Sect. 1.1 for more details.
Actually, since the Degasperis–Procesi equation is completely integrable (see [28]) it would be very natural to try to construct solutions of (1.1) which bifurcate from quasi-periodic solutions of the unperturbed DP equation
which corresponds to (1.1) with \(f=0\). Indeed, near zero, the (1.1) can be seen also as a perturbation of (1.14). Unfortunately even though algebro-geometric finite-gap solutions have been already constructed in literature for the DP equation (see [42]) it is not clear to us whether they are real quasi-periodic solutions in the sense of (1.6). Of course if one were able to bifurcate from finite-gap solutions of (1.14) then it would be possible to prove existence of large quasi-periodic solutions, by requiring that f is small. Such a strategy has been followed successfully for the KdV and cubic NLS equation on the circle. Actually for those equations one can prove the existence of Birkhoff coordinates [41, 43] (the Cartesian version of action-angle variables), which trivialize the dynamics (in the sense that the solutions turn out to be all periodic, quasi-periodic or almost periodic) and provide a fundamental tool for investigating the dynamical consequences of small perturbative effects, also far from the origin, see [14].
For 1d integrable PDEs one would expect this to be the typical scenario at least in a neighborhood of zero, see [5, 46]; however, as far as we know, up to now such results are available only for the KdV, the NLS and the Toda system. Theorem 1 provides, again as far as we know, the first existence result of quasi-periodic solutions, in the sense of (1.6), for (1.14).
It would be interesting to apply our KAM approach to the Camassa–Holm equation, which is a well-known integrable PDE with an asymptotically linear dispersion law, but with a different symplectic structure. Even though we have not performed the computations, we expect to be able to prove the equivalent of Theorems 1, 2 also for this equation. We remark that in this case, the finite gap solutions are known to be quasi-periodic tori, see [20].
One could start by comparing them with the solutions predicted by our method and then possibly develop KAM theory close to large finite gap solutions.
1.2 Approximate constants of motion of (1.1)
Even though we do not fully exploit the integrability of (1.14) it is fundamental for us that (the non integrable) (1.1) has at least eight approximate constants of motion (up to an error of order \(O(u^9)\)). It is interesting to notice that, as shown in [29], no other equation with the same dispersion law, and the same symplectic structure, has eight approximate conserved quantities. This means that in (1.1) we cannot consider any quadratic nonlinearity, but we really need the DP structure.
The request of the presence of such approximate conserved quantities it is not only a technical matter. In order to implement a Nash/Moser-KAM algorithm one looks for a family of approximately invariant tori of (1.1) (with a sufficiently good approximation) such that the dynamics on the tori is integrable and non-degenerate, while the dynamics normal to the torus is non-degenerate at the linear level and satisfies the Melnikov conditions. If there are external parameters modulating the linear frequencies, then we can consider as approximate solutions the linear ones. Otherwise the modulation must come from the initial data and, hopefully, this can be achieved by means of Birkhoff normal form (BNF), see for instance [4, 39]. In this case, where the the dispersion law in (1.8) is a rational number and is asymptotically linear, such procedure is very difficult. One has to explicitly compute some potentially dangerous resonant terms in the Hamiltonian and show that they vanish. This is the same type of computations which have been done for water waves, see Craig–Worfolk [27] where the authors verify (by computing them) the vanishing of the coefficients of fourth order resonant interactions, the so called Benjamin–Feir resonances. In our case we have to deal with higher order resonances (up to eight), so this would be computationally extremely heavy. Our approach is to use the approximate constant of motions. This will be explained more in detail in Sect. 1.3. Once we have constructed the approximate invariant tori we have to impose the non-degeneracy and Melnikov conditions. Differently form the KdV case, this will not be possible for any choice of the tangential set, and it is where we will use the condition \(S^+\in \mathcal {V}(\mathtt {r})\), see Definition 1.1.
1.2.1 Linear stability
The linear stability result of Theorem 2 is of course a relevant dynamical information in the study of evolutionary PDEs, but it is also the consequence of a fundamental ingredient of our proof: the reducibility of the linearized equation at any quasi-periodic approximate solution. Reducibility for the Degasperis–Procesi equation linearized at a quasi-periodic function has been obtained in [33], under some appropriate diophantine conditions on the frequencies. Unfortunately, due to the resonances, our case does not fit such hypotheses, and a major point will be to overcome this difficulty. Here we shall use such result (appropriately adapted) inside a nonlinear algorithm to prove the existence of quasi-periodic solutions. This is a classical feature of the literature of KAM theory.
1.3 Some literature
Proving existence and stability for quasi-periodic solutions for PDEs close to an elliptic fixed point is a natural extension of the classical KAM theory for lower dimensional tori [51]. The first results in this direction were for model PDEs on an interval with no derivatives in the nonlinearity and with either Dirichlet, [44, 47, 51, 53] or periodic, [16, 19, 26], boundary conditions. For extension of KAM theory to higher spatial dimension we mention [8, 11, 17, 25, 30, 34, 52]. While KAM methods for constructing quasi-periodic solutions for PDEs on the circle with no derivatives in the nonlinearity are by now well established, generalizing to cases with derivatives is in general not at all trivial, even in the semi-linear cases (where the derivatives in the nonlinearity are of lower order w.r.t. the linear terms). We mention [45] for the KdV, [49] for the derivative NLS, and [6, 7] for the derivative NLW. Recently an innovative strategy was proposed, [3, 4] to deal with quasi-linear and fully nonlinear PDEs on the circle. This approach was first developed for the KdV equation but can be applied to many equations of interest in hydrodynamics, such as NLS, [37, 38] Kirchhoff [50] or directly the water wave equation [2, 15]. While these methods were first thought for PDEs on the circle, of course a very interesting point is the generalization to higher dimensions.
Equation (1.1) is a quasi-linear PDE on the circle and in our study we shall follow the general strategy of [4], extended and adapted to our case. Let us briefly explain the point of view of [4], referring also to [2] for more details.
1.4 The general strategy
We describe the strategy to prove existence and linear stability for small, reducible quasi-periodic solutions of completely resonant quasi-linear PDEs.
- (i):
-
The starting point is a Nash–Moser theorem of hypothetical conjugation following [9]. The strategy is to construct quadratically convergent sequence of families of approximately invariant (isotropic) tori. Such construction is based on tame estimates on the inverse of the operator associated to the Eq. (1.1) linearized at an approximate torus and restricted to the normal direction. This is proved by exploiting the Hamiltonian structure and exhibiting symplectic variables adapted to each approximate invariant torus, which essentially decouple the linearized dynamics. Then the bounds on the inverse are achieved by removing all the “bad” values of the parameters. We mention also [24] for a parallel strategy which does not rely on the Hamiltonian structure.
- (ii):
-
To construct the sequence of item (i) we need a good starting point, i.e. a first family of approximately invariant tori parametrized by real vectors \(\xi \in \mathbb {R}^{\nu }\).
As explained before this is achieved by BNF techniques. In particular, in the quasi-linear context, it is convenient to perform a Weak BNF, i.e. to exhibit a change of variables, close to the identity up to a finite rank operator, such that the following holds. The Hamiltonian H transforms to \(H_\mathrm{Birk}+ R\) where R is a small remainder, and
-
1.
The finite dimensional subspace \(U_S:=\{u_j=0,\;\forall j\notin S\}\) is invariant for \(H_\mathrm{Birk}\);
-
2.
The Hamiltonian restricted to \(U_S\) is integrable and non-degenerate in the sense that the “frequency-to-amplitude” map is invertible.
In order to describe in a simpler way the dynamics in a neighborhood of \(U_S\) it is convenient to define action-angle variables. This allows to distinguish the tangential and normal dynamics to the approximately invariant tori.
We remark that, for semi-linear PDEs, typically one performs a stronger BNF preliminary step, in order to “normalize” also the linearized dynamics normal to the torus, i.e. the terms in the Hamiltonian which are quadratic in the normal directions. In this case the Birkhoff map is close to the identity up to a bounded operator (at most one-smoothing), see for instance [47, 51]. Compared to the latter approach, the weak procedure has the disadvantage that the normal form depends on the angles; on the other hand we do not have to address well-posedness issues, since these changes of coordinates are time-one flow maps of an ODE. Note that the recent papers, [10, 35, 36] directly study the full Birkhoff normal form for quasi-linear PDES.
- (iii):
-
The third key point is to study the invertibility of the linearized operator restricted to the normal directions. Thanks to the very “mild” conjugation procedure of item (ii) (with a map = identity+finite rank) it turns out that such linear operator is pseudo differential (with non constant coefficients) up to a finite rank remainder. This is the most important reason for adopting the weak procedure described in (ii).
The invertibility of the linearized operator, with appropriate tame estimates, is based on a reducibility argument which is divided into two parts:
-
(a)
A reduction in decreasing order procedure which conjugates the linearized operator to a pseudo differential one with constant coefficients up to a remainder which is a bounded/regularizing term i.e. maps \(H^{s}(\mathbb {T}, \mathbb {R}) \) to \(H^{s+\rho }(\mathbb {T}, \mathbb {R})\), \(\rho \ge 0\). The choice of \(\rho \) depends of course on the problem one is studying;
-
(b)
A quadratic KAM scheme (for bounded operators) which completely diagonalizes the bounded/smoothing remainder of the previous step.
We want to point out the following:
-
The step (a) strongly relies on the pseudo differential structure of the operator;
-
The normal form contains angle-dependent terms and some of them turn out to be not perturbative for the KAM scheme (b). The conjugation to constant coefficients of such terms relies on purely algebraic arguments. We refer to this procedure as linear Birkhoff normal form;
-
As a consequence of having applied the weak and the linear Birkhoff procedure, the normal form around the approximately invariant tori has constant coefficients also in the normal directions.
In order to perform the diagonalization procedure of step (b) one needs the second Melnikov conditions, which essentially amount to requiring that the operator has simple eigenvalues with a lower bound on the differences. Once one has diagonalized the operator, the bounds on the inverse follow trivially from lower bounds on the eigenvalues, i.e. first Melnikov conditions.
- (iv):
-
In the scheme above, at each step we have removed some bad values of the parameters \(\xi \) where the Melnikov conditions do not hold. Hence the last (but not least) step is to prove that at the end of the procedure one has still a positive measure set of parameters. Note that often it is more convenient to express such conditions in terms of the frequency of the quasi-periodic solution. This can be done thanks to the invertibility of the frequency-to-amplitude map.
1.5 Main novelties and scheme of the proof
We describe the structure of the paper following Sect. 1.2, and with particular attention to the main novelties.
In Sect. 2 we introduce the Hamiltonian formalism for the DP equation and the functional spaces on which we shall work.
In Sect. 3 we perform the weak Birkhoff normal form explained in item (ii) of the previous section. The result is stated in Proposition 3.2. In order to reach a sufficiently good first approximate solution we need to perform 6-BNF steps. As is well-known, at the n-th step of this procedure one has to take into account the denominators (recall (1.8))
We say that a \((n+2)\)-uple of integer indices \((j_1, \ldots , j_{n+2})\) is a resonance, and hence may appear in \(H_\mathrm{Birk}\), if (1.15)\(=0\) and the momentum condition holds, namely \(\sum _{i=1}^{n+2} j_i=0\). We say that a resonance is trivial if it has the form \((i, -i, j, -j, \ldots )\) so that the corresponding monomial is integrable.
As mentioned before a major difficulty comes from the fact that the DP equation has many non-trivial resonances (already at order four) and in principle there is no reason why the Birkhoff Hamiltonian restricted to \(U_S\) should be integrable. By the fact that the Hamiltonian density f is of order \(O(u^9)\) the perturbation does not affect the leading terms of the Birkhoff Hamiltonian and we can exploit the integrability of the DP equation. Indeed the same Birkhoff transformation should normalize simultaneously all the commuting Hamiltonians. This means that a resonant monomial contributes to \(H_\mathrm{Birk}\) if and only if it is resonant for all the constants of motion. This was proved in detail in [32] at the level of formal power series. Here we adapt this result to the Eq. (1.1) which is only approximately integrable (close to the origin) and we reformulate it in a way better suited to the weak Birkhoff normal form context, see Proposition 3.6.
Once we have shown that the \(H_\mathrm{Birk}\)-dynamics restricted to \(U_S\) is integrable, in Sect. 4, we prove that it is non-degenerate, i.e. that the frequency to amplitude map is a diffeomorphism. We have a very explicit description of this map and hence this step amounts to proving that the matrix \(\mathbb {A}\) in (4.6) (which depends only on \(S^+\)) has determinant bounded away from zero (the so-called twist condition), see Lemma 4.1. A big difference with [4] is that, in our case, the determinant of \(\mathbb {A}\) is a rational function of several variables \(\overline{\jmath }_i\) that could accumulate to zero as \(|\overline{\jmath }_i |\rightarrow \infty \). By imposing the wave packet condition we restrict the study of its asymptotic behaviour to regions in which it behaves like a one variable function. Then we use continuity arguments to guarantee the invertibility of \(\mathbb {A}\) for every choice of \(S^+\in \mathcal {V}({\mathtt {r}})\) (see Definition 1.1) for \(\mathtt {r}\) small enough. Outside \(\mathcal {V}({\mathtt {r}})\) the proof of lower bounds for \(\det \mathbb {A}\) should rely on purely algebraic arguments and not on perturbative ones.
In Sect. 5 we introduce the Nash Moser hypothetical conjugation theorem (see Theorem 5.4) and in Sect. 6 we explain how to prove the invertibility of the linearized operator at an approximate solution by only studying it in the normal direction. Since there is no difference with [4] we only give a synopsis.
In Sects. 7 and 7.3 we prove the Theorems 7.1 and 7.13 which provide the reducibility of the linearized operator following item (iii) of Sect. 1.2. As we already mentioned, in [33] we provide a reducibility result for the DP equation (1.1) linearized at sufficiently small quasi-periodic functions under appropriate diophantine conditions on the frequencies . Unfortunately in our case the diophantine constant \(\gamma \) is related to the size of the approximate solutions (see (5.3)) and then the smallness and diophantine conditions above cannot be met.
In [4] this issue appears only in the step (b) of the strategy, where it is solved by the linear Birkhoff normal form method. A first difficulty in our case is that this problem appears also in step (a). So that we first need to perform some preliminary steps (see Sect. 7.1), more precisely we need changes of coordinates, preserving the pseudo differential structure, that conjugate the leading order of the linearized operator to a diagonal one plus a correction, which is unbounded but perturbative in the sense of [33]. In such steps the provided changes of coordinates are similar in structure to those of step (a) but they are proved to be well-defined not by using perturbative arguments, but by algebraic computations involving the Birkhoff resonances (see Lemma A.1). These difficulties appear also for the quasi-linear generalized KdV [39], but here we have several further problems due to the complexity of the symplectic structure of the DP equation. The first step, removing terms of order \(\varepsilon \), is straightforward. Already at the second step we encounter the difficulties arising form the presence of non-trivial resonances of order 4, and a priori there is no reason why the normal form should be integrable. Here it does not appear simple to apply the strategy of the weak BNF, using the constants of motion. On the other hand, computing the normal form explicitly by hand, as done in [39], is unmanageable. To bypass this problem we take a different point of view, based on an a posteriori identification argument of normal forms. More precisely in Theorem 7.9 we prove that the normal form obtained after the weak BNF, the preliminary steps and the linear BNF coincides with the one that we would obtain by performing the full formal BNF and then projecting on the quadratic terms in the normal variables. This result strongly relies on the fact that all the resonances contributing to the formal normal form are trivial. A similar identification argument has been used, for instance, in [12, 13].
A further point is that, due to the rational dispersion law \(\lambda (j)\), it is possible that a denominator in the linear BNF is not zero but is still uncontrollably small. In the third step, in order to deal with this problem we need to take into account in the unperturbed Hamiltonian also the integrable terms of order \(\varepsilon ^2\) coming from the previous steps of linear BNF. For this reason it is important to know the exact expression of the main order of the correction at the eigenvalues given by the perturbation, see for instance (5.5). This is also needed in the KAM scheme (b), in order to impose the second Melnikov conditions. Computing these corrections by hand would be a very difficult task, but this comes for free from Theorem 7.9.
In the first part of Sect. 8 we show the convergence of the Nash–Moser algorithm (see Theorem 8.1), which requires the ratio between the size of \(R=H-H_\mathrm{Birk}\) and \(\gamma ^{7/2}\) to be small (see the smallness condition (8.5)); in the second part we prove that the set of “bad” parameters, i.e. the frequencies which do not meet the first and second Melnikov conditions, has small measure (see (8.25), note that such sets are indexed by three parameters \(\ell ,j,k\)).
In Lemma 8.4 we provide the measure of the single bad set. Here we use the algebraic arguments provided by Lemma A.1, which guarantees the non-degeneracy of the leading terms of the small divisors. In Sect. 8.1.2 we deal with the summability of the bad sets in j, k for fixed \(\ell \).
The key difficulty is that the spectral gap \(\lambda (j)-\lambda (k)\) is asymptotically constant, hence there is a bad separation property of the eigenvalues. The same occurs for the wave equation [6, 7]. Due to the asymptotically constant spectral gap, these sets are infinitely many. Then the key ingredient is to show that for j, k sufficiently large the second Melnikov conditions are implied by the first ones. This is possible provided that we consider two different diophantine constants. More precisely we have to impose second order Melnikov conditions with \(\gamma ^{3/2}\) (see (8.6)), which is clearly much smaller than \(\gamma \). This is why we have to perform many steps of Birkhoff normal form in order to obtain a very good first approximate solution.
We point out that, differently from [2], our Melnikov conditions do not imply a loss of regularity in space. In [2] this loss is acceptable, since in the regularization step ((a) p. 5) the diagonalization is performed up to a very smoothing remainder. In this procedure it is fundamental that the diophantine constant \(\gamma \) is independent of the size of the solution. Of course in our case this is not true and thus in the regularization step we end up with a remainder of order \(-1\), and then in the measure estimates we put some extra efforts to prove second Melnikov conditions without loss of regularity.
2 Functional Setting
Hamiltonian formalism of the Degasperis–Procesi equation For any u, v in the space
we define the non-degenerate symplectic form
where J is defined in (1.4) and \((\cdot ,\cdot )_{L^{2}}\) is the \(L^{2}(\mathbb {T}, \mathbb {R})\) scalar product. To any \(C^{1}\) function \(H : H_0^{1}(\mathbb {T})\rightarrow \mathbb {R}\) we associate a vector field \(X_{H}\) by requiring
The Hamiltonian vector field \(X_{H}\) is uniquely determined since the symplectic form \(\Omega \) in (2.1) is non-degenerate, in particular \(X_{H}(u)=J\nabla H(u)\). The Poisson bracket between two \(C^{1}\) functions \(F,G : H_0^{1}(\mathbb {T})\rightarrow \mathbb {R}\) is
In this way
Finally, given a Hamiltonian H we define its adjoint action as the operator
Consider now two Hamiltonians H, G and let \(\Phi _{G}\) be the time-1 flow map of the vector field \(X_{G}\). Then we have (formally)
where \(\mathrm {ad}_{G}^{0}:=\mathrm {I}\) is the identity map.
Functional space We consider functions \(u(\varphi , x)\) defined on \(\mathbb {T}^{\nu }\times \mathbb {T}\). Passing to the Fourier representation
We define the scale of Sobolev spaces
where \(\langle \ell , j \rangle :=\max \{ 1, |\ell |, |j |\}\), \(|\ell |:=\sum _{i=1}^{\nu } |\ell _i |\). We shall work on the phase space \(H^s\cap H_0^1(\mathbb {T}, \mathbb {R})\). We denote by \(\mathfrak {B}_{r}(0, X)\) the ball of radius r centered at the origin of a Banach space X.
Lipschitz norm Fix \(\nu \in \mathbb {N}^{*}:=\mathbb {N}{\setminus }\{0\}\) and let \(\mathcal {O}\) be a compact subset of \(\mathbb {R}^{\nu }\). For a function \(u:\mathcal {O}\rightarrow E\), where \((E, \Vert \cdot \Vert _E)\) is a Banach space, we define the sup-norm and the lip-seminorm of u as
If E is finite dimensional, for any \(\gamma >0\) we introduce the weighted Lipschitz norm
If E is a scale of Banach spaces, say \(E= H^s\), for \(\gamma >0\) we introduce the weighted Lipschitz norm
where we denoted by [r] the integer part of \(r\in \mathbb {R}\).
Linear operators Let \(A:\mathbb {T}^{\nu }\rightarrow \mathcal {L}(L^2(\mathbb {T},\mathbb {R}))\), \(\varphi \mapsto A(\varphi )\), be a \(\varphi \)-dependent family of linear operators acting on \(L^2(\mathbb {T},\mathbb {R})\). We consider A as an operator acting on \(H^{s}(\mathbb {T}^{\nu +1},\mathbb {R})\) by setting
This action is represented in Fourier coordinates as
Conversely, given a Töpliz in time operator A, namely such that its matrix coefficients (with respect to the Fourier basis in \({\varphi },x\)) satisfy
we can associate it a time dependent family of operators acting on \(H^s(\mathbb {T})\) by setting
For \(m=1, \ldots , \nu \) we define the operators \(\partial _{\varphi _m} A(\varphi )\) as
We say that A is a real operator if it maps real valued functions in real valued functions. For the matrix coefficients this means that
Hamiltonian linear operators In the paper we shall deal with operators which are Hamiltonian according to the following Definition.
Definition 2.1
We say that a linear map is symplectic if it preserves the 2-form \(\Omega \) in (2.1); similarly we say that a linear operator M is Hamiltonian if Mu is a linear Hamiltonian vector field w.r.t. \(\Omega \) in (2.1). This means that each \(J^{-1} M\) is symmetric respect to the real \(L^2\)-scalar product. Similarly, we call a family of maps \({\varphi }\rightarrow A({\varphi })\) symplectic if, for each fixed \({\varphi }\), \(A({\varphi })\) is symplectic, same for the Hamiltonians. We shall say that an operator of the form \(\omega \cdot \partial _{{\varphi }}+M({\varphi })\) is Hamiltonian if \(M({\varphi })\) is Hamiltonian.
Notation We use the notation \(A\lesssim B\) to denote \(A\le C B\) where C is a positive constant possibly depending on fixed parameters given by the problem. We use the notation \(A\lesssim _y B\) to denote \(A\le C(y) B\) if we wish to highlight the dependence on the variable y of the constant \(C(y)>0\).
Linear Tame operators Here we introduce rigorously the spaces and the classes of operators on which we work.
Definition 2.2
(\(\sigma \)-Tame operators). Given \(\sigma \ge 0\) we say that a linear operator A is \(\sigma \)-tame w.r.t. a non-decreasing sequence \(\{\mathfrak M_A(\sigma ,s)\}_{s=s_0}^\mathcal {S}\) (with possibly \(\mathcal {S}=+\infty \)) if
for any \(s_0\le s\le \mathcal {S}\). We call \(\mathfrak {M}_A(\sigma ,s)\) a tame constant for the operator A. When the index \(\sigma \) is not relevant we write \(\mathfrak {M}_{A}(\sigma ,s)=\mathfrak {M}_{A}(s)\).
Definition 2.3
(Lip-\(\sigma \)-Tame operators). Let \(\sigma \ge 0\) and \(A=A(\omega )\) be a linear operator defined for \(\omega \in \mathcal {O}\subset \mathbb {R}^{\nu }\). Let us define
Then A is Lip-\(\sigma \)-tame w.r.t. a non-decreasing sequence \(\{\mathfrak M_A(\sigma ,s)\}_{s=s_0}^\mathcal {S}\) if the following estimate holds
We call \(\mathfrak {M}^{\gamma }_A(\sigma ,s)\) a Lip-tame constant of the operator A. When the index \(\sigma \) is not relevant we write \(\mathfrak {M}^{\gamma }_{A}(\sigma ,s)=\mathfrak {M}^{\gamma }_{A}(s)\).
Modulo-tame operators and majorant norms The modulo-tame operators are introduced in Sect. 2.2 of [15]. Note that we are interested only in the Lipschitz variation of the operators respect to the parameters of the problem, whereas in [15] the authors need to control also higher order derivatives.
Definition 2.4
Let \(u\in H^s\), \(s\ge 0\), we define the majorant function \( \underline{u} (\varphi , x):=\sum _{\ell \in \mathbb {Z}^{\nu }, j\in \mathbb {Z}} |u_{\ell j} |e^{\mathrm {i}(\ell \cdot \varphi +j x)}. \) Note that \(\Vert u \Vert _s=\Vert \underline{u} \Vert _s\).
Definition 2.5
(Majorant operator). Let \(A\in \mathcal {L}(H^s)\) and recall its matrix representation (2.11). We define the majorant matrix \(\underline{A}\) as the matrix with entries
We consider the majorant operatorial norms
We have a partial ordering relation in the set of the infinite dimensional matrices, i.e. if
Since we are working on a majorant norm we have the continuity of the projections on monomial subspace, in particular we define the following functor acting on the matrices
Finally we define for \(\mathtt {b}_0\in \mathbb {N}\)
In the sequel let \(1>{\gamma }>{\gamma }^{3/2}>0\) be fixed constants.
Definition 2.6
(Lip-\(\sigma \)-modulo tame). Let \(\sigma \ge 0\). A linear operator \( A := A(\omega ) \), \(\omega \in \mathcal {O}\subset \mathbb {R}^{\nu }\), is Lip-\(\sigma \)-modulo-tame w.r.t. a non-decreasing sequence \(\{ {\mathfrak M}_{A}^{\sharp , {\gamma }^{3/2}} (\sigma , s) \}_{s=s_0}^{\mathcal {S}}\) if the majorant operators \( \underline{ A }, \underline{\Delta _{\omega ,\omega '} A}\) are Lip-\(\sigma \)-tame w.r.t. these constants, i.e. they satisfy the following weighted tame estimates: for \(\sigma \ge 0\), for all \( s \ge s_0 \) and for any \(u \in H^{s} \),
The constant \( {\mathfrak M}_A^{{\sharp , {\gamma }^{3/2}}} (\sigma ,s) \) is called the modulo-tame constant of the operator A. When the index \(\sigma \) is not relevant we write \( {\mathfrak M}_{A}^{{\sharp , {\gamma }^{3/2}}} (\sigma , s) = {\mathfrak M}_{A}^{{\sharp , {\gamma }^{3/2}}} (s) \).
Definition 2.7
We say that A is Lip-\(-1\)-modulo tame if \(\langle D_x\rangle ^{1/2}{A} \langle D_x\rangle ^{1/2}\) is Lip-0-modulo tame. We denote
In the following we shall systematically use \(-1\) modulo-tame operators. We refer the reader to the “Appendix” of [33] for the properties of Tame and Modulo-tame operators.
Pseudo differential operators Following [15] we give the following definitions.
Definition 2.8
Let \(m\in \mathbb {R}\). A linear operator A is called pseudo differential of order \(\le m\) if its action on any \(H^s(\mathbb {T}, \mathbb {R})\) with \(s\ge m\) is given by
where a(x, j), called the symbol of A, is the restriction to \(\mathbb {T}\times \mathbb {Z}\) of a complex valued function a(x, y) which is \(C^{\infty }\) smooth on \(\mathbb {T}\times \mathbb {R}\), \(2\pi \)-periodic in x and satisfies
We denote by \(A[\cdot ]=\mathrm{Op}(a)[\cdot ]\) the pseudo operator with symbol \(a:=a(x, j)\). We call \(OPS^m\) the class of the pseudo differential operator of order less or equal to m and \(OPS^{-\infty }:=\bigcap _m OPS^m\). We define the class \(S^m\) as the set of symbols which satisfies (2.22).
We will consider mainly operators acting on \(H^s(\mathbb {T},\mathbb {R})\) with a quasi-periodic time dependence. In the case of pseudo differential operators this correspondsFootnote 1 to considering symbols \(a(\varphi , x, y)\) with \(\varphi \in \mathbb {T}^{\nu }\). Clearly these operators can be thought as acting on functions \(u({\varphi },x)=\sum _{j\in \mathbb {Z}}u_{j}({\varphi })e^{\mathrm{i}jx}\) in \(H^s(\mathbb {T}^{\nu +1},\mathbb {R})\) in the following sense:
The symbol \(a(\varphi , x, y)\) is \(C^{\infty }\) smooth also in the variable \(\varphi \). We still denote \(A:=A(\varphi )=\mathrm{Op}(a(\varphi , \cdot ))=\mathrm{Op}(a)\).
Definition 2.9
Let \(a:=a({\varphi },x, y)\in S^{m}\) and set \(A:=\mathrm{Op}(a)\in OPS^{m}\),
We will use also the notation \(|a |_{m, s, \alpha }:=|A|_{m,s,\alpha }\).
Note that the norm \(|\cdot |_{m,s,\alpha }\) is non-decreasing in s and \(\alpha \). Moreover given a symbol \(a({\varphi },x)\) independent of y, the norm of the associated multiplication operator \(\mathrm{Op}(a)\) is just the \(H^{s}\) norm of the function a. If on the contrary the symbol a(y) depends only on y, then the norm of the corresponding Fourier multipliers \(\mathrm{Op}(a(y))\) is just controlled by a constant.
As in formula (2.10), if \(A=\mathrm{Op}(a(\omega ,{\varphi },x, y))\in OPS^{m}\) is a family of pseudo differential operators with symbols \(a(\omega ,{\varphi },x, y)\) belonging to \(S^{m}\) and depending in a Lipschitz way on some parameter \(\omega \in \mathcal {O}\subset \mathbb {R}^{\nu }\), we set
For the properties of compositions, adjointness and quantitative estimates of the actions on the Sobolev spaces \(H^{s}\) of pseudo differential operators we refer to “Appendix B” of [33].
3 Weak Birkhoff Normal Form
The aim of this section is to construct a \(\xi \)-parameter family of approximately invariant, finite dimensional tori supporting quasi-periodic motions with frequency \(\omega (\xi )\). We will impose the map \(\xi \mapsto \omega (\xi )\) to be a diffeomorphism and we will consider such approximate solutions as the starting point for the Nash–Moser algorithm.
In order to state the main result of this section, we need some preliminary definitions.
We write the DP Hamiltonian in (1.4) in the following way:
Recall S in (1.9) and define \(S^{c}:=\mathbb {Z}{\setminus }\big (S\cup \{0\}\big )\). We decompose the phase space as
and we denote by \(\Pi _S, \Pi _S^{\perp }\) the corresponding orthogonal projectors. The subspaces \(H_S\) and \(H_S^{\perp }\) are symplectic orthogonal respect to the 2-form \(\Omega \) (see (2.1)). We write
For a finite dimensional space
let \(\Pi _E\) denote the corresponding \(L^2\)-projector on E. The notation \(R(v^{k-q} z^q)\) indicates a homogeneous polynomial of degree k in (v, z) of the form
We denote with \(H^{(n, \ge k)}, H^{(n, k)}, H^{(n, \le k)}\) the terms of type \(R(v^{n-s}\, z^s)\), where, respectively, \(s\ge k, s=k, s\le k\), that appear in the homogeneous polynomial \(H_n\) of degree n in the variables (v, z). Given an n-uple \(\{j_1,\ldots , j_n\}\subset \mathbb Z{\setminus }\{0\}\) and a set \(B\subset \mathbb Z{\setminus }\{0\}\) we define
Now we start the “weak” Birkhoff normal form procedure, i.e. we look for a change of coordinates which normalizes the terms in (3.1) independent and linear in the normal variable z.
As it is well known, one of the main problem of the Birkhoff normal form procedures is to deal with the resonances given by the equations (1.15) = 0 which arise from considering the kernel of the adjoint action \(\mathrm {ad}_{H^{(2)}}\) (see (2.4)). It turns out that when \(n\ge 2\) there are many non-trivial solutions of (1.15) = 0. A way to deal with this problem is to exploit the integrability of the DP equation.
In [32] the authors construct an infinite number of conserved quantities \(K_n\) for the Eq. (1.1) with \(f=0\) starting from the ones given in [28]. By an explicit characterization of the quadratic part of each \(K_n\), they deduce that, at a purely formal level, the Birkhoff normal form of the Degasperis–Procesi equation is action preserving (or integrable). Here we rename these constants of motion in the following way, writing only the quadratic parts (which are fundamental for the study of the Birkhoff resonances at \(u=0\))
where we denoted by
We remark that \(K_1\) is the momentum Hamiltonian arising from the translation invariance of the equation.
Definition 3.1
Given a quadratic diagonal Hamiltonian \(Q(u)=\sum _j \mathfrak {l}(j) |u_j |^2\), we define \(\Pi _{\mathrm{Ker}(Q)}\) as the projection on the kernel of the adjoint action (recall (2.2) and \(J=\mathrm {diag}_{j\in \mathbb {Z}}(\lambda (j))\))
We define the projector on the range of the adjoint action as \(\Pi _{\mathrm{Rg}(Q)}:=\mathrm {I}-\Pi _{{\text{ Ker }(Q)}}\).
We say that K, as in (3.6), “preserves” momentum if and only if
The main result of this section is the following.
Proposition 3.2
There exist \(r>0\), depending on S (see (1.9)), and an analytic symplectic change of coordinates
where E is a finite dimensional space as in (3.3), such that the Hamiltonian H in (3.1) transforms into
where
and \(\mathcal {H}^{(k, 0)}=\Pi _{\mathrm{Ker}(H^{(2)})} \mathcal {H}^{(k, 0)}\) with \(k=4, 6, 8\) depend only on \(|u_j |^2\). The same change of variables \(\Phi _B\) puts all the Hamiltonians in (3.4) in weak Birkhoff normal form up to order eight as in (3.8). In particular we have \(K_1\circ \Phi _B=K_1\).
In order to prove the Proposition 3.2 above we need some preliminary results proved in detail in [32].
Definition 3.3
(M-resonances). Fix \(M\in \mathbb {N}\), \(M\ge 3\). We recall that the quadratic part of H and \(K_r\), \(2\le r\le M\), in (3.4) are
We say that an n-uple \(\{j_1,\ldots ,j_{n}\}\subset \mathbb {Z}{\setminus }\{0\}\), with \(n\le M\), is a M-resonance of order n for the DP hierarchy if
Proposition 3.4
Fix \(M\in \mathbb {N}\), \(M\ge 3\). All the M-resonances of the DP equations (3.4) are trivial, namely there are no resonances of odd order and the even ones are, up to permutations, of the form
Proof
Since this Proposition is proved in [32] with different notations, for completeness we restate here a concise proof by induction on M. For \(M=3\) the thesis follows trivially: indeed direct computations show that
up to permutations, and this solution is incompatible with \(j_i\in \mathbb {Z}{\setminus }\{0\}\).
Let us now suppose that the thesis is true up to \(M-1\ge 3\) and prove it for M. We start by noticing that if \(n<M\) then (3.10) with \(r\le M-1\) can hold only if \(\{j_1,\ldots ,j_{n}\}\) is a \(M-1\) resonance of order n. The inductive hypothesis then says that \(\{j_1,\ldots ,j_{n}\}\) is trivial. Similarly if \(\{j_1,\ldots ,j_{n}\}\) contains a trivial resonance, i.e. if \(j_{i_1}+ j_{i_2}=0\) for \(1\le i_1,i_2\le n\), then \(j_{i_1}, j_{i_2}\) do not appear in (3.10) and hence \(\{j_1,\ldots ,j_{n}\} \) is an M-resonance of order n if and only if
Without loss of generality we assume that \(n=M\) and that \(j_{i_1}+ j_{i_2}\ne 0\) for any \(1\le i_1,i_2\le M\).
Up to a permutation we can assume that for some \(M\ge k\ge 1\) and \(\alpha _1,\ldots ,\alpha _k\ge 1\) one has
Consequently rewrite the third equation in (3.10) as \( \sum _{i=1}^{k} \alpha _i (1+\widehat{\jmath }_i{}^2)^2\, \widehat{\jmath }_i{}^{2 (r-2)}{\lambda }(\widehat{\jmath }_i)=0\), \(\forall r=2, \ldots , M+1\). Then we can extract k equations from these ones and write them in the form
The determinant of the Vandermonde matrix in (3.12) is \( \prod _{i\ne h} (\widehat{\jmath }_i{}^2-\widehat{\jmath }_h{}^2)\ne 0, \) since, by hypothesis, \(\widehat{\jmath }_i\ne \pm \widehat{\jmath }_h\). Then the only possible solution corresponds to \(\widehat{\jmath }_i=0\) for all i, which is not compatible with \(\widehat{\jmath }_i\in \mathbb {Z}{\setminus }\{0\}\). \(\quad \square \)
Remark 3.5
Notice that if \(j_1, \ldots , j_N\in \mathbb {Z}{\setminus }\{ 0\}\), \(j_1+\cdots + j_N=0\) and \(\#(\{ j_1, \ldots , j_N \}, S^c)\le 1\), then \(\max _{i=1,\ldots , N} |j_i |\le (N-1) \overline{\jmath }_1\). Thus, the vector field \(X_{F^{(N,\le 1)}}\), generated by the finitely supported Hamiltonian
is finite rank, and, in particular, it vanishes outside the finite dimensional subspace \(E:=E_{(N-1) \overline{\jmath }_1}\) (see (3.3) ) and it has the form
Therefore its flow \(\Phi ^{(N)}\) is analytic and invertible on the phase space \(H_0^1(\mathbb {T})\), provided that \(| \Pi _E u|\) is appropriately small.
In order to prove Proposition 3.2 we need the following result.
Proposition 3.6
Fix \(M\in \mathbb {N}\), \(M\ge 2\) and consider H in (1.4) and \(K_m\), \(m=1, \ldots , M\), in (3.4). Then, for any \(N\le M-2\), there exists \(r>0\) and an analytic symplectic change of coordinates \(\Phi _N^{\pm 1} :\mathcal B_r(0,H_0^1(\mathbb {T}))\rightarrow H_0^1(\mathbb {T})\) of the form
where E is a finite dimensional space as in (3.3), such that
where \(Z_N^{(\le N+2, 0)}, W^{(\le N+2, 0)}_{m, N}\in \bigcap _{m=1}^M \text{ Ker }(K_m^{(2)}) \cap \text{ Ker }(H^{(2)})\).
Proof
The terms of degree at most 2 in the variable z are not affected by the procedure that we are going to describe.
We argue the result by induction on the number of steps N. For \(N=0\) it is trivial since \(\Phi _0\) is the identity map.
Suppose that we have performed N steps. By the fact that \(\{ H, K_m \}=0\) then \(\{ H, K_m\}\circ \Phi _N^{-1}=0\). For the latter, we are interested in the corresponding equations for the terms of homogeneity at most \(N+3\) and degree in the variable z less or equal than one. So we consider the projection \(\Pi ^{(\le N+3, \le 1)}\Big (\{ H, K_m\}\circ \Phi _N^{-1}\Big )=0\) and we get, for any \(m=1, \ldots , M\), the following system of equations \(\{ H^{(2)}, K_m^{(2)}\}=0\,\) and
By the inductive hypothesis \(W_{N, m}^{(N+2, 0)}, Z_N^{(N+2, 0)}\in \bigcap _{m=1}^M \text{ Ker }(K_m^{(2)}) \cap \text{ Ker }(H^{(2)})\), hence
and
since \(\{ H^{(2)}, Q_{m,N}^{(N+3, \le 1)}\}\in \text{ Rg }(H^{(2)})\) and \(\{ R_N^{(N+3, \le 1)}, K_m^{(2)} \}\in \text{ Rg }(K_m^{(2)})\).
We note the following fact, which derives from the Jacobi identity: if \(f\in \text{ Ker }(H^{(2)})\) then \(\{ f, K_m^{(2)}\}\in \text{ Ker }(H^{(2)})\).
Then we have that \(\{ \Pi _{\mathrm{Ker}(H^{(2)})} R_N^{(N+3, \le 1)}, K_m^{(2)} \} \in \text{ Ker }(H^{(2)})\) and by (3.16)
Thus \(\{ \Pi _{\mathrm{Ker}(H^{(2)})} R_N^{(N+3, \le 1)}, K_m^{(2)} \}=0\) and
By symmetry \(\Pi _{\mathrm{Ker}(K_m^{(2)})} Q_{m, N}^{(N+3, \le 1)} =\Pi _{\mathrm{Ker}(H^{(2)})} \Pi _{\mathrm{Ker}(K_m^{(2)})} Q_{m, N}^{(N+3, \le 1)}\). Hence
In order to obtain the Birkhoff normal form at order \(N+3\) we consider a Birkhoff transformation \(\Phi _{F^{(N+3,\le 1)}}\) with generator \(F^{(N+3),\le 1}\) of the form (3.13) ( with \(N\rightsquigarrow N+3\)) and we define \(\Phi _{N+1}:= \Phi _{F^{(N+3,\le 1)}} \circ \Phi _N\). By Remark 3.5 the flow \(\Phi _{F^{(N+3,\le 1)}}\) is well defined in an appropriately small ball and it has the form Identity plus a finite rank operator. Note that, since \(F^{(N+3,\le 1)}\) is Fourier supported on \((j_1, \ldots , j_{N+3})\) such that \(j_1+\cdots +j_{N+3}=0\), the Hamiltonian \(K_1\) commutes with \(F^{(N+3,\le 1)}\) and, by the inductive hypothesis, \(K_1\circ \Phi _{N+1}^{-1}=K_1\). The function \(F^{(N+3,\le 1)}\) is chosen in order to solve the homological equation
We now show that \(F^{(N+3,\le 1)}\) solves also the homological equation for the commuting Hamiltonians \(K_m\circ \Phi _N^{-1}\), \(m=1, \ldots , M\). Indeed, by the fact that \(\mathrm {ad}_{H^{(2)}}^{-1}\) commutes with \(\mathrm {ad}_{K_m^{(2)}}\) on the intersection \(\text{ Rg }(H^{(2)})\cap \text{ Rg }(K_m^{(2)})\), we have
By (3.17) we have that the resonant term \(Z^{(N+3, \le 1)}_{N+1}:=\Pi _{\mathrm{Ker}(H^{(2)})} R_N^{(N+3, \le 1)}\) belongs to the intersection of the kernels and by Proposition 3.4 these terms are supported only on n-ples of indices of the form \((i, -i, j, -j, k, -k, \ldots )\). By the symmetry of the tangential set S this is possible for a set of indices with at most one outside S if and only if all the indices belong to S. Hence \(Z^{(N+3, 1)}_{N+1}=0\) and we define \(Z_{N+1}^{(\le N+3, 0)}:=Z_{N+1}^{(N+3, 0)}+Z_N^{(\le N+2, 0)}\). We do not compute explicitly the radius r of the ball in which we can perform the Birkhoff change of variables, however one can easily check that \(r\rightarrow 0\) as \(N\rightarrow \infty \) or as \(\mathtt {r}\rightarrow 0\) in Definition 1.9. \(\quad \square \)
Proof of Proposition 3.2
We apply Proposition 3.6 with \(N=6\) and \(M=8\) and we obtain (3.7), (3.8) by setting \(\Phi _B:=\Phi _N^{-1}\). To prove (3.9) we have to show explicitly the computations of the first step of Birkhoff normal form.
First we remove the cubic terms independent of z and linear in z from the Hamiltonian
We consider \(\Phi _1:=(\Phi ^t_{F^{(3,\le 1)}})_{|_{t=1}}\) as the time-1 flow map generated by the Hamiltonian vector field \(X_{F^{(3,\le 1)}}\), with an auxiliary Hamiltonian \(F^{(3,\le 1)}\) of the form (3.13) with \(N=3\). The transformed Hamiltonian is \(H_1:=H\circ \Phi _1^{-1}=H^{(2)}+H_1^{(3)}+H_1^{(4)}+H_1^{(\ge 5)}\) with
and where \(H_1^{(\ge 5)}\) collects all the terms of order at least five in (v, z). We choose \(F^{(3,\le 1)}\) such that the following homological equation holds
Recalling (2.2) and (3.18), the solution of the Eq. (3.20) is given by \(F^{(3,\le 1)}\) as in (3.13) with \(N=3\) with coefficients defined as
The Hamiltonian \(F^{(3,\le 1)}\) is well defined since, by Proposition 3.4, there are no non-trivial 3-resonances of order 3. Since \(\Pi _{\mathrm{Rg}(H^{(2)})} H^{(3, \le 1)}=H^{(3, \le 1)}\) we get (see (3.19), (3.20))
In the second step we normalize the terms of total degree 4 and \(\le 1\) in the variable z. The term \(\Pi _{\mathrm{Ker}(H^{(2)})}H_1^{(4,\le 1)}\) is Fourier supported on the set of 4-resonances of order 4, which are trivial by Proposition 3.4. By Proposition 3.6\(\Pi _{\mathrm{Ker}(H^{(2)})} H_1^{(4, 1)}=0\). Thus we have to compute only \(\Pi _{\mathrm{Ker}(H^{(2)})} H_1^{(4, 0)}\). We have
The remaining steps of this procedure do not affect the terms with degree of homogeneity less or equal than 4. Hence by (3.23), the fact that \({\lambda }(-j)=-{\lambda }(j)\) (see (1.8)) and the symmetry of S we obtain (3.9). \(\quad \square \)
4 Action-Angle Variables
On the submanifold \(\{ z=0\}\) we put the following action-angle variables
Note that this change of coordinates is real if and only if \(I_{-j}=I_j\) and \(\theta _{-j}=-\theta _j\). The symplectic form in (2.1) restricted to the subspace \(H_S\) transforms into the 2-form \( \sum _{j\in S^+}\frac{1}{{\lambda }(j)} d\theta _j\wedge \, d I_j{.} \) We have that the Hamiltonian \(\mathcal {H}^{(\le 8)}(\theta , I, 0)=\sum _{j\in S^+} I_j +\mathcal {H}^{(4, 0)}(I)+\mathcal {H}^{(6, 0)}(I)+\mathcal {H}^{(8, 0)}(I)\) depends only by the actions I and its equations of motion read as
where, by (3.9),
In order to highlight the fact that we are working close to zero, we introduce a small parameter \(\varepsilon >0\) and we rescale \(I\mapsto \varepsilon ^2 I\), so that the frequency–amplitude map can be written as
where \(\overline{\omega }\) is the vector of the linear frequencies (see (1.10)),
The submanifold \(\{z=0\}\) is foliated by tori, parameterized by the actions, supporting small amplitude quasi-periodic solutions for the truncated system with Hamiltonian \(\mathcal {H}^{(\le 8)}\). We shall select some of them as starting point of the Nash–Moser scheme, by fixing \(I=\xi \) (here \(\xi \) is a parameter), so that appropriate non-resonance conditions on the frequency \(\alpha (I)\) hold.
In order to work in a small neighbourhood of the prefixed torus \(\{ I\equiv \xi \}\) it is advantageous to introduce a set of coordinates \((\theta , y, z)\in \mathbb {T}^{\nu }\times \mathbb {R}^{\nu }\times H_S^{\perp }\) adapted to it, defined by
with \(b>1\) and where (recall \(\overline{u}_j=u_{-j}\))
The parameter b will be chosen close to one, to this purpose we shall set
and fix \(a>0\) appropriately small. For the tangential sites \(S^+:=\{ \overline{\jmath }_1,\ldots , \overline{\jmath }_{\nu }\}\) we will also denote \(\theta _{\overline{\jmath }_i}:=\theta _i\), \(y_{\overline{\jmath }_i}:=y_{i}\), \(\xi _{\overline{\jmath }_i}:=\xi _i\), \( i=1,\ldots , \nu \). The symplectic 2-form \(\Omega \) in (2.1), up to rescaling of time, becomes
where \(\Omega _{S^{\perp }}\) is the symplectic form \(\Omega \) in (2.1) restricted to the subspace \(H_{S}^{\perp }\) in (3.2). The Hamiltonian system generated by \(\mathcal {H}\) in (3.8) becomes
In the following lemma we prove that, under an appropriate choice of the tangential set (1.9), the function (4.5) is a diffeomorphism for \(\varepsilon \) small enough and then the system (4.2) is integrable and non-isochronous.
Lemma 4.1
(Twist condition). There exist \(\mathtt {r}_0, \mathtt {c}_{*}>0\) such that, for any choice of the tangential sites \(S^+\in \mathcal {V}(\mathtt {r})\) with \(0<\mathtt {r}\le \mathtt {r}_0 \) (see Definition 1.1), one has \(|\det \mathbb {A} |\ge \mathtt {c}_* \,\, \overline{\jmath }_1^{3 \nu }{.} \)
Proof
The proof is postponed in “Appendix A”. \(\quad \square \)
As a consequence of the non-degeneracy condition in Lemma 4.1 the map in (4.5) is invertible and we denote
5 The Nonlinear Functional Setting
We write the Hamiltonian in (4.10) (possibly eliminating constant terms depending only on \(\xi \) which are irrelevant for the dynamics) as
where \(\mathcal {N}\) describes the linear dynamics normal to the torus, and \(P:=H_{\varepsilon }-\mathcal {N}\) collects the nonlinear perturbative effects. Note that both N and P depend on \(\omega \) through the map \(\omega \mapsto \xi (\omega )\).
We consider \(H_{\varepsilon }\) as a \((\omega , \varepsilon )\)-parameter family of Hamiltonians and we note that, for \(P=0\), \(H_{\varepsilon }\) possess an invariant torus at the origin with frequency \(\omega \), which we want to continue to an invariant torus for the full system.
We will select the frequency parameters from the following set (recall (4.11))
Setting (see (4.7))
we define the non-resonant sets
for some constant C depending on S, where \(\mathbb {A}\) is defined in (4.6) and
We require that
Lemma 5.1
We have that \(|\Omega _{\varepsilon }{\setminus }\mathcal {G}_0 |\le C_{*}\varepsilon ^{2(\nu -1)} \gamma \,\) for some \(C_*=C_*(S)>0\,\).
Proof
The proof is postponed in “Appendix A”. \(\quad \square \)
Remark 5.2
The diophantine condition \(\omega \in \mathcal {G}_0^{(0)}\) is typical of KAM scheme. The lower bound in \(\mathcal {G}_0^{(1)}\) involves resonances of order five with two normal modes. As explained in the introduction, in order to impose such lower bounds we need to take into account also the corrections of order \(\varepsilon ^{2}\). The matrix \(\mathbb {A}\) comes from the weak BNF of Sect. 3. The terms \(\mathfrak {l}_{j}\) come from the linear BNF procedure of Sect. 7.2. In particular they are evaluated explicitly using the identification argument of Theorem 7.9.
Remark 5.3
Note that the definition of \(\gamma \) in (5.3) is slightly stronger than the minimal condition for which is possible to prove that \(\mathcal {G}_0^{(0)}\) has large measure, namely \(\gamma \le c\,\varepsilon ^2\), with \(c>0\) small enough. Our choice turns out to be useful for proving that the Cantor set of frequencies of the expected quasi-periodic solutions has asymptotically full measure (as \(\varepsilon \rightarrow 0\)).
We look for an embedded invariant torus
of the Hamiltonian vector field \(X_{H_{\varepsilon }}\) (see (5.1)) supporting quasi-periodic solutions with diophantine frequency \(\omega \in \mathcal {G}_0\).
For technical reason, it is useful to consider the modified Hamiltonian
More precisely, we introduce \(\zeta \) in order to control the average in the y-component in our Nash Moser scheme. The vector \(\zeta \) has no dynamical consequences since an invariant torus for the Hamiltonian vector field \(X_{H_{\varepsilon , \zeta }}\) is actually invariant for \(X_{H_{\varepsilon }}\) itself.
Thus, we look for zeros of the nonlinear operator \(\mathcal {F}(i, \zeta )\equiv \mathcal {F}(i, \zeta , \omega , \varepsilon ):= \omega \cdot \partial _{\varphi } i(\varphi )-X_{\mathcal {N}}(i(\varphi ))-X_P(i(\varphi ))+(0, \zeta , 0)\) defined as
where \(\Theta (\varphi ):=\theta (\varphi )-\varphi \) is \((2\pi )^{\nu }\)-periodic. We define the Sobolev norm of the periodic component of the embedded torus
where \( z \in H^s_{S^{\perp }}:= H^s \cap H_{S}^\perp \) (recall (3.2)) with norm defined in (2.7) and with abuse of notation, we are denoting by \(\Vert \cdot \Vert _{s}\) the Sobolev norms of functions in \(H^{s}(\mathbb {T}^{\nu },\mathbb {R}^{\nu })\). From now on we fix \(s_0:=[\nu /2]+4\).
Notice that in the coordinates (4.7), a quasi-periodic solution corresponds to an embedded invariant torus (5.8). Therefore we can reformulate the main Theorem 1 as follows.
Theorem 5.4
There exists a small constant \(\mathtt {r}>0\) such that, for any \(S^{+}\in \mathcal {V}(\mathtt {r})\) (see (1.9) and Definition 1.1), there exists \(\varepsilon _0>0\), small enough, such that the following holds. For all \(\varepsilon \in (0, \varepsilon _0)\) there exist positive constants \(C=C(\nu )\), \(\mu =\mu (\nu )\) and a Cantor-like set \(\mathcal {C}_{\varepsilon }\subseteq \Omega _{\varepsilon }\) (see (5.2)), with asymptotically full measure as \(\varepsilon \rightarrow 0\), namely
such that, for all \(\omega \in \mathcal {C}_{\varepsilon }\), there exists a solution \(i_{\infty }(\varphi ):=i_{\infty }(\omega , \varepsilon )(\varphi )\) of the equation \(\mathcal {F}(i_{\infty }, 0, \omega , \varepsilon )=0\) (see (5.10)). Hence the embedded torus \(\varphi \mapsto i_{\infty }(\varphi )\) is invariant for the Hamiltonian vector field \(X_{H_{\varepsilon }}\), and it is filled by quasi-periodic solutions with frequency \(\omega \). The torus \(i_{\infty }\) satisfies
Moreover the torus \(i_{\infty }\) is linearly stable.
We can deduce Theorem 1 from Theorem 5.4, indeed the quasi-periodic solution u in (1.11) is
for \(\omega =\omega (\xi )\in \mathcal {C}_{\varepsilon }\), where \(\omega (\xi )\) is the frequency amplitude map (4.5).
The rest of the paper is devoted to the proof of Theorem 5.4.
5.1 Tame estimates of the nonlinear vector field
We give tame estimates for the composition operator induced by the Hamiltonian vector fields \(X_{\mathcal {N}}\) and \(X_{P}\) in (5.10). Since the functions \(y\rightarrow \sqrt{\xi +\varepsilon ^{2(b-1)} y}, \theta \rightarrow e^{\mathrm {i}\,\theta }\) are analytic for \(\varepsilon \) small enough and \(|y |\le C\), classical composition results (see for instance Lemma 6.2 in [3]) imply that, for all \(\Vert \mathfrak {I} \Vert _{s_0}^{\gamma , \mathcal {O}}\le 1\),
In the following lemma we collect tame estimates for the Hamiltonian vector fields \(X_{\mathcal {N}}, X_{P}, X_{H_{\varepsilon }}\), see (5.1). These bounds rely on tame estimates for composition operators and their proof is completely analogous to the one in Sect. 5 of [4].
Lemma 5.5
Let \(\mathfrak {I}(\varphi )\) in (5.11) satisfy \(\Vert \mathfrak {I} \Vert _{s_0+1}^{\gamma , \mathcal {O}}\lesssim \,\varepsilon ^{9-2 b}\gamma ^{-1}\). Then we have
and for all \(\widehat{\imath }:=(\widehat{\Theta }, \widehat{y}, \widehat{z})\),
In the sequel we will use that, by the diophantine condition (5.7), the operator \((\omega \cdot \partial _{{\varphi }})^{-1}\) is defined for all functions u with zero \(\varphi \)-average, and satisfies
6 Approximate Inverse
We want to solve the nonlinear functional equation (see (5.10))
by applying a Nash–Moser scheme. It is well known that the main issue in implementing this algorithm concerns the approximate inversion of the linearized operator of \(\mathcal {F}\) at any approximate solution \((i_n, \zeta _n)\), namely \(\mathtt {D}\mathcal {F}(i_n, \zeta _n)\). Note that \(\mathtt {D}\mathcal {F}(i_n, \zeta _n)\) is independent of \(\zeta _n\). One of the main problems is that the \((\theta , y, z)\)-components of \(\mathtt {D}\mathcal {F}(i_n, \zeta _n)\) are coupled and then the linear system
is quite involved. In order to approximately solve (6.2) we follow the scheme developed by Berti–Bolle in [9] which describe a way to approximately triangularize (6.2). This method has been applied in [4, 39]. Since the strategy is identical to [39] we only summarize it and underline the differences which mainly come from the symplectic structure. For a fully detailed expository presentation see [40].
We now study the solvability of Eq. (6.2) at an approximate solution, which we denote by \((i_0, \zeta _0)\), \(i_0(\varphi )=(\theta _0(\varphi ), y_0(\varphi ), z_0(\varphi ))\) in order to keep the notations of [4, 39] . Assume the following hypothesis, which we shall verify at any step of the Nash–Moser iteration,
-
Assumption The map \(\omega \mapsto i_0(\omega )\) is a Lipschitz function defined on some subset \(\mathcal {O}_0\subseteq \mathcal {G}_0 \subseteq \Omega _{\varepsilon }\) (recall (5.7),(5.2)) and, for some \(\mathfrak {p}_0:=\mathfrak {p}_0( \nu )>0\),
$$\begin{aligned} \Vert \mathfrak {I}_0 \Vert _{s_0+\mathfrak {p}_0}^{\gamma , \mathcal {O}_0}\le \varepsilon ^{9-2b} \gamma ^{-1}, \quad \Vert Z \Vert _{s_0+\mathfrak {p}_0}^{\gamma , \mathcal {O}_0}\le \varepsilon ^{9-2 b}, \quad \gamma =\varepsilon ^{2b}, \end{aligned}$$(6.3)where \(\mathfrak {I}_0(\varphi ):=i_0(\varphi )-(\varphi , 0, 0)\) and Z is the error function
$$\begin{aligned} Z(\varphi ):=(Z_1, Z_2, Z_3)(\varphi ):=\mathcal {F}(i_0, \zeta _0)(\varphi )=\omega \cdot \partial _{\varphi } i_0 (\varphi )-X_{H_{\varepsilon , \zeta _0}}(i_0(\varphi )). \end{aligned}$$(6.4)
By estimating the Sobolev norm of the function Z we can measure how the embedding \(i_0\) is close to being invariant for \(X_{H_{\varepsilon , \zeta _0}}\). If \(Z=0\) then \(i_0\) is a solution. In general we say that \(i_0\) is “approximately invariant” up to order O(Z). We observe that by Lemma 6.1 in [4] we have that if \(i_0\) is a solution, then the parameter \(\zeta _0\) has to be naught, hence the embedded torus \(i_0\) supports a quasi-periodic solution of the “original” system with Hamiltonian \(H_{\varepsilon }\) (see (5.1)).
By [9] we know that it is possible to construct an embedded torus \(i_{\delta }(\varphi )=(\theta _0(\varphi ), y_{\delta }(\varphi ), z_0(\varphi ))\), which differs from \(i_0\) only for a small modification of the y-component, such that the 2-form \(\mathcal {W}\) (recall (4.9)) vanishes on the torus \(i_{\delta }(\mathbb {T}^{\nu })\), namely \(i_{\delta }\) is isotropic. In particular \(i_{\delta }(\varphi )\) is approximately invariant up to order O(Z) (see Lemma 7 in [9]) and, more precisely, there exists \(\tilde{\mathfrak {p}}:=\tilde{\mathfrak {p}}(\nu )>0\) such that
The strategy is to construct an approximate inverse for \(\mathtt {D}\mathcal {F}(i_0, \zeta _0)\) by starting from an approximate inverse for the linear operator \(\mathtt {D}\mathcal {F}(i_\delta , \zeta _0)\). The advantage of analyzing the linearized problem at \(i_{\delta }\) is that it is possible to construct a symplectic change of variable which approximately triangularizes the linear system thanks to the isotropicity of \(i_{\delta }\). For the details we refer to [9] and [4], here we only give the relevant definitions and state the main result. We define the symplectic change of coordinates
where \(\tilde{z}_0:=z_0 (\theta _0^{-1} (\theta ))\). We denote the transformed Hamiltonian by \(K:=K(\varphi , \eta , w, \zeta _0)\). We then define
where \(K_{02}\) is the linear operator representing the terms quadratic in w of K, i.e.
\(\mathcal {L}_{\omega }\) corresponds to the w-component of the linearized operator after the change of variable \(G_\delta \).
In [9] (see also [4, 39]) the following result is proved.
Theorem 6.1
Assume (6.3) and the following
Inversion Assumption There exist \(\mathfrak {p}_1:=\mathfrak {p}_1(\nu )>0\) and a set \(\Omega _{\infty }\subset \mathcal {G}_0\subseteq \Omega _{\varepsilon }\) such that for all \(\omega \in \Omega _{\infty }\) and every function \(g\in H^{s+2\tau +1} \cap H_{S}^\perp \), there exists a solution \(h:=\mathcal {L}_{\omega }^{-1} g\) of the linear equation \(\mathcal {L}_{\omega } h=g\) which satisfies
Then there exists \(\mu :=\mu ( \nu )\) such that, for all \(\omega \in \Omega _{\infty }\) there exists a linear operator \(\mathbf {T}_0\) such that:
-
1.
For all \(g:=(g^{(\theta )}, g^{(y)}, g^{(z)})\), one has
$$\begin{aligned} \Vert \mathbf {T}_0 g \Vert _s^{\gamma , \Omega _{\infty }}\lesssim _s\gamma ^{-1}(\Vert g \Vert _{s+\mu }^{\gamma , \Omega _{\infty }} +\varepsilon \gamma ^{-5/2} \Vert \mathfrak {I}_0 \Vert _{s+\mu }^{\gamma , \mathcal {O}_0} \Vert g \Vert _{s_0+\mu }^{\gamma , \Omega _{\infty }}){.} \end{aligned}$$(6.10) -
2.
\(\mathbf {T}_0\) is an approximate inverse of \(\mathtt {D}\mathcal {F}(i_0)\), namely
$$\begin{aligned}&\Vert (\mathtt {D}\mathcal {F}(i_0)\circ \mathbf {T}_0-\mathrm {I}) g \Vert _s^{\gamma , \Omega _{\infty }} \lesssim _s \varepsilon ^{2 b-1}{\gamma }^{-2} \Big ( \Vert \mathcal {F}(i_0, \zeta _0) \Vert _{s_0+\mu }^{\gamma , \mathcal {O}_0} \Vert g \Vert _{s+\mu }^{\gamma , \Omega _{\infty }}\nonumber \\&\qquad \qquad \qquad +\{\Vert \mathcal {F}(i_0, \zeta _0)\Vert _{s+\mu }^{\gamma , \mathcal {O}_0} +\varepsilon \gamma ^{-5/2} \Vert \mathcal {F}(i_0, \zeta _0) \Vert _{s_0+\mu }^{\gamma , \mathcal {O}_0} \Vert \mathfrak {I}_0 \Vert _{s+\mu }^{\gamma , \mathcal {O}_0}\} \Vert g \Vert _{s_0+\mu }^{\gamma , \Omega _{\infty }} \Big ){.} \end{aligned}$$(6.11)
6.1 The linearized operator in the normal directions
Recalling the assumption (6.3), in the sequel we assume that \(\mathfrak {I}_{\delta }:=\mathfrak {I}_{\delta }(\varphi ; \omega ) =i_{\delta }(\varphi ;\, \omega )-(\varphi ,\, 0,\, 0)\) satisfies, for some \(\mathfrak {p}_1>0\),
We note moreover that \(G_\delta \) in (6.6) is the identity plus a translation plus a finite rank linear operator; moreover, assuming (6.12), one has that \(G_{\delta }\) is \(O(\varepsilon ^{9-2 b}\gamma ^{-1})\)-close to the identity in low norm. Returning to the initial variables we set (see (4.7),(6.6))
and we have, for some \(\sigma :=\sigma (\nu )>0\),
By following Sect. 7 in [4] (see Lemma 7.1), \(K_{02}\) in (6.8) has rather explicit estimates.
Proposition 6.2
Assume (6.12). Then there exists \(\sigma _0=\sigma _0(\nu )>0\) such that the following holds. The Hamiltonian operator \(\mathcal {L}_{\omega }\) in (6.7) has the form
Recall that \(T_{\delta }\) is defined in (6.13), \(\Phi _B\) is the Birkhoff map given in Proposition 3.2, f is the Hamiltonian density in (1.3). The operator \(\mathcal {Q}_0\) is finite rank and has the form
In particular we divide \(\mathcal {Q}_0= \sum _{i=1}^5 \varepsilon ^i\mathcal {R}_i +\mathcal {R}_{>5}\), where the \(\mathcal {R}_i\), \(\mathcal {R}_{>5}\) are finite rank operators. Moreover we have
The remainders \(\mathcal {R}_i\) do not depend on \(\mathfrak {I}_\delta \) and satisfy
while \(\mathcal {R}_{>5}\) satisfies
Finally, recalling the Definition 2.3, we have
Proof
The expression (6.15) follows from the definition (6.8) by remarking that \(G_\delta \) and the weak BNF transformation \(\Phi _B\) is the identity plus a finite rank operator, while the action angle change of coordinates is a rescaling plus a finite rank operator (acting only on the v). Then, in applying the chain rule, we get
where the finite rank part contains all the terms where a derivative falls on the change of variables. Then (6.15) follows from the definition of H in (1.4). Regarding the estimates, (6.17) follows from (6.14); regarding the bounds (6.18), (6.19), we split the finite rank part \(R_1+R_2\) as follows. The operator \(R_1\) contains all terms arising form derivatives of \(G_\delta \). By tame estimates on the map \(G_{\delta }\) (see for instance Lemma 6.7 in [4]), it satisfies the bounds (6.19) and we put it in \(\mathcal {R}_{>5}\). The finite rank term \(R_2\) comes from the Birkhoff map. This is an analytic map so we consider the Taylor expansion
where each \(\Psi _i(u)\) is homogeneous of degree i in u, while \(\Psi _{\ge 6}=O(u^6)\) and all map \(H^1_0(\mathbb {T})\) in itself. We have to evaluate \(\Phi _B\) and its derivatives (up to order two) at \(u=T_{\delta }=\varepsilon v_{\delta }+\varepsilon ^b z_0\). We denote by \(\overline{v}\) the function
where \(\mathtt {l}(\overline{\jmath }_i)\) is the i-th vector of the canonical basis of \(\mathbb {Z}^{\nu }\) and is such that \(\mathtt {l}(-\overline{\jmath }_i)=-\mathtt {l}(\overline{\jmath }_i)\).
We observe thatFootnote 2
and hence we can expand
where \(\tilde{q}\) is a remainder which satisfies
Then in \(\mathcal {R}_i\) we include all the terms homogeneous of degree i coming from derivatives of \(\Phi _B-\mathrm {I}\) , evaluated at \(\tilde{q}=0\); we put in \(\mathcal {R}_{>5}\) all the rest. The (6.21), (6.22) follows by (6.18), (6.19) and (6.20). \(\quad \square \)
Remark 6.3
The motivation for separating the \(\mathcal {R}_i\) and \(\mathcal {R}_{>5}\) is the following. Consider the Hamiltonian \(H_\varepsilon \) as a function of \(\xi \) instead of \(\omega \). Then in all our expressions we can, and shall, evidence a purely polinomial term \(\sum _{i=0}^5 \varepsilon ^i f_i\) (where the \(f_i\) are \(\varepsilon \) independent) plus a remainder, which is not analytic in \(\varepsilon \), of size \(\varepsilon ^6+\varepsilon \Vert \mathfrak {I}_{\delta } \Vert _s^{\gamma , \mathcal {O}_0}\). By the assumption (6.3), this means that in low norm\(s=s_0+\mathfrak {p}_1\) all these remainders are negligible w.r.t. terms of order \(\varepsilon ^5\). This distinction is needed because, due to the resonant nature of the DP equation, we need to perform (see Sects. 7.1 and 7.2) five steps of the order reduction and of the linear BNF by hand, before entering in a perturbative regime.
In this framework \(\mathcal {R}_{>5}\) is purely a remainder, while the \(\mathcal {R}_i\) are homogeneous polynomial terms. One could apply the same division to the non finite rank terms, one would get
where g satisfies the same estimates as (6.19).
6.1.1 Hamiltonian of the linearized operator
Following Remark 6.3, we evidence the terms homogeneous in the Hamiltonian of \(\mathcal {L}_{\omega }\), let us call it \(\mathsf {H}\), whose Hamiltonian vector fields have degree \(\le 5\), since they are NOT perturbative. As explained in (6.26) this entails expanding the map \(\Phi _B(T_{\delta })\) in powers of \(\varepsilon \) up to order five plus a small remainder\(\tilde{q}\).
We consider the symplectic form in the extended phase space \(({\varphi },\eta ,z)\in \mathbb {R}^{\nu }\times \mathbb {R}^{\nu }\times H_S^{\perp }\)
with the Poisson brackets (recalling \(\{ \cdot , \cdot \}\) defined in (2.2))
The Hamiltonian of the operator (6.15) respect to the symplectic form (6.29) is (see (6.28))
with
for some \(\sigma >0\). The functions \(\mathsf {H}_{\mathcal R_i}\), \(\mathsf {H}_{\mathcal {R}_{>5}}\) are the quadratic forms associated to the corresponding linear operators, thus the estimates on the Hamiltonian vector fields can be deduced from (6.19), (6.18). It is easily seen that
Of course one can be even more explicit and write everything in terms of the original Hamiltonian (1.4) and of the generating functions of the weak BNF, for example one has
The terms \(\mathsf {H}_i\) can be computed explicitly, however we only need to prove that they fit the following definitions.
Definition 6.4
We say that a matrix \(\mathtt {B}:=\Big ((\mathtt {B})_{j}^{j'}(l-l')\Big )_{j, j'\in \mathbb {Z}, l, l'\in \mathbb {Z}^{\nu }}\) is almost diagonal if there exists a constant \(\mathtt {C}>0\) such that, if \((\mathtt {B})_{j}^{j'}(l-l')\ne 0\), then \(\langle j-j', l-l'\rangle \le \mathtt {C}\,\) for all \(j, j'\in S^c\), \(l, l'\in \mathbb {Z}^{\nu }\).
Let \(B(\varphi ):H^s(\mathbb {T})\rightarrow H^s(\mathbb {T})\) be a Töpliz in time operator (recall (2.12)). We say that \(B(\varphi )\) is almost diagonal if its associated matrix is almost diagonal.
Let \(H:=H(\varphi )\) be a quadratic Hamiltonian of the form \(H=( A(\varphi ) z, z)_{L^2}\), where \(A(\varphi )\) is a Töpliz in time operator. We say that H and its vector field are almost diagonal if \(A(\varphi )\) is almost diagonal.
Remark 6.5
It is easy to verify that if X and Y are almost diagonal operators then \(X+Y\), \(X \circ Y\) are almost diagonal.
Definition 6.6
Let \(p\in \mathbb {N}\) and \(m\in \mathbb {R}\). We say that a pseudo differential operator \(\mathfrak {B}=\mathrm{Op}(b({\varphi },x,j))\) (recall Definition 2.8) is homogenenous of degree p in the function \(\overline{v}\) in (6.25) if its symbol \(b({\varphi },x,j)\in S^{m}\) has the form
Definition 6.7
Let \(p\in \mathbb {N}\). We say that a Hamiltonian is pseudo differential and p-homogeneous if it has the form
where \(f_{p}\) is a homogeneous real valued function of \(\overline{v}\) (of degree p) of the form
\(\mathfrak {B}_p\in OPS^{-2}\) is a p-homogeneous pseudo differential operator according to Definition 6.6 which is self-adjoint w.r.t. to \((\cdot ,\cdot )_{L^2}\); finally \(\mathcal {R}_{p}\) is a finite dimensional operator of the form (6.16) with \(g_j, \chi _j\)p-homogeneous functions of \(\overline{v}\).
Lemma 6.8
The Hamiltonians \(\mathsf {H}_i + \mathsf {H}_{\mathcal {R}_i}\) in (6.31) are almost diagonal according to Definition 6.4 and pseudo differential homogeneous Hamiltonians according to Definition 6.7.
Proof
It follows by Proposition 6.2 and Remark 6.3. \(\quad \square \)
Lemma 6.9
Let \(p,q\in \mathbb {N}\) and consider \(H_p\), \(G_q\) two pseudo differential and homogenous Hamiltonians of degree respectively p and q. Then there is a pseudo differential and \((p+q)\)-homogeneous Hamiltonian \(\widetilde{H}\) such that \(X_{\widetilde{H}}=X_{\{H_p,G_{q}\}_{e}}\), where \(\{\cdot ,\cdot \}_e\) are defined in (6.30) and \(X_{H}\) denotes the Hamiltonian vector field generated by H.
Proof
By assumption \(H_p\) and \(G_{q}\) have the form (6.35) for some \(f_{p},f_q\) real valued and some self-adjoint pseudo differential homogenenous operators \(\mathfrak {B}_p\) and \(\mathfrak {B}_q\). Then we have (recalling (2.2), (1.4) and (3.5))
One has that the Hamiltonian
is equivalent to \(\{H_p,G_q\}_e\) in the sense that they generate the same vector field. Here \(A_i^{*}\), \(i=1,2\), denotes the adjoint of \(A_i\) w.r.t. the \(L^{2}\) scalar product. Notice that
which is an homogeneous function of \(\overline{v}\) of degree \(p+q\). Using the results on compositions of pseudo differential operators in Sect. 2 of [33], the fact that J is skew-self-adjoint, \(\mathfrak {B}_i\), \(i=p,q\), are self-adjoint, and \(f_q,f_p\) are real valued, we deduce that the operator \(A_2\) is a skew-self-adjoint operator in \(OPS^{-1}\). Hence, using the formula (2.13) in [33] for the adjoint, we have that \({A}_2+{A}_2^*\) is pseudo differential homogeneous operator (according to Definition 6.6) in \(OPS^{-2}\). \(\quad \square \)
7 Reduction and Inversion of the Linearized Operator
The aim of the section is to prove the claim in (6.9). As explained in the introduction, first one should reduce the unbounded parts of \(\mathcal {L}_{\omega }\) and then use classical KAM reducibility results to diagonalize. The difficulties arise from the fact that a few steps of this procedure must be done by hand, since they do not fit the typical smallness conditions, see [33].
The key result of this section is the following.
Theorem 7.1
Consider \(\mathcal {L}_{\omega }=\mathcal {L}_{\omega }(\mathfrak {I}_{\delta })\) in (6.15) and fix
There exist \(\mathcal S>s_0\) and \(\mu _1=\mu _1(\nu )>0\) such that, if condition (6.12) is satisfied with \(\mathfrak {p}_1=\mu _1\), then the following holds. There exists a constant \(m(\omega )\) defined for \(\omega \in \Omega _{\varepsilon }\) with
Footnote 3 such that for all \(\omega \) in the set \(\mathcal {O}_{\infty }^{2\gamma }\), where (recall that \(\mathcal {O}_0\subseteq \mathcal {G}_0\), see (5.7))
there exists a real, bounded linear operator \(\Upsilon =\Upsilon (\omega ) : H^{s}_{S^{\perp }}\rightarrow H^{s}_{S^{\perp }}\), for all \(s_0\le s\le \mathcal {S}\), such that
where \(\mathfrak {D}(\omega )\) is the diagonal operator of order \(-1\) defined as \(\mathfrak {D}:=\mathfrak {D}(\omega )= \mathrm {diag}(\mathrm {i}\kappa _j)_{j\in S^c} \), with
where \(\mathfrak {l}_{j}\) is defined in (5.6). The constant m depends on i and for \(\omega \in \mathcal {O}_{\infty }^{2\gamma }(i_1)\cap \mathcal {O}_{\infty }^{2\gamma }(i_2)\) one has
where \(\Delta _{12}m:=m(i_1)-m(i_2)\). The remainder \(\mathcal {P}_0\) in (7.4) is defined and Lipschitz in \(\omega \) belonging to the set \(\mathcal {O}^{2\gamma }_\infty \) and is Lip-\(-1\)-modulo tame (see Definition 2.7) with
for all \(\omega \in \mathcal {O}_{\infty }^{2{\gamma }}(i_1)\cap \mathcal {O}_{\infty }^{2{\gamma }}(i_2)\). Moreover if \(u=u(\omega )\) depends on the parameter \(\omega \in \mathcal {O}_{\infty }^{2 \gamma }\) in a Lipschitz way then
The result above has two relevant consequences. Firstly it shows that the operator \(\mathcal {L}_{\omega }\) in (6.15) can be conjugated to an operator (see (7.4)) which is “diagonal”, at the highest order of derivatives, plus a remainder which is \(-1\)-smoothing. In addition to this, thanks to a linear BNF procedure (performed in Sect. 7.2), the non-diagonal term \(\mathcal {P}_0\) in (7.4) has a size much smaller than \(\varepsilon \) (see estimates (7.7), (7.8)). In particular it is “perturbative” w.r.t. the constant \(\gamma \) in (5.3). This allows us to apply the reducibility scheme of [33] in order to complete the diagonalization of the operator \(\mathcal {L}\) (see Theorem 7.13). Then the inversion assumption (6.9) follows directly from Proposition 7.14).
Strategy of the Proof of Theorem7.1.
-
Reduction at the highest order The first step is to exploit the pseudo differential structure of the operator \(\mathcal {L}_{\omega }\) in order to conjugate it to an operator which has constant coefficients up to a smoothing remainder of order \(-1\). To this purpose we use changes of variables generated as the time-one flow map \(\Phi ^{\tau }\vert _{\tau =1}\) of Hamiltonians of the form
$$\begin{aligned} S(\tau ,{\varphi },z)= & {} \int b(\tau ,{\varphi },x) z^2 dx{,} \qquad b(\tau ,{\varphi },x):=\frac{\beta ({\varphi },x)}{1+\tau \beta _{x}({\varphi },x)}, \end{aligned}$$(7.10)$$\begin{aligned} \partial _{\tau } \Phi ^{\tau } u= & {} \Pi _S^{\perp }[(J\circ b) \Pi _S^{\perp }[\Phi ^{\tau }u]]{,}\qquad \Phi ^0 u=u{,} \end{aligned}$$(7.11)where \(\beta \) is some smooth function. In Proposition C.2 we show that \(\Phi ^{\tau }\) is well defined as symplectic map on \(H^{s}_{S^{\perp }}\) (see Lemma C.1) and study the structure of \(\Phi ^{\tau }\mathcal {L}_\omega (\Phi ^{\tau })^{-1}\). Proposition C.2 gives an explicit formula for the new coefficient at the highest order (see (C.17)). Then Corollary 3.6 of [31] (see also Proposition 3.6 in [33]) provides the solution for the Eq. (C.17)=const provided that some smallness condition is satisfied. This smallness condition has the form
$$\begin{aligned} C(s_1)\gamma ^{-1}\Vert a_0\Vert _{s_1}^{\gamma ,\mathcal {O}}\ll 1 \end{aligned}$$(7.12)
for some \(s_0+\mathfrak {p}_1>s_1>s_0\) and some constant \(C(s_1)>0\). As shown in [33], due to the Hamiltonian structure, this reduces \(\mathcal {L}_{\omega }\) to constant coefficients up to a correction of order \(-1\).
Unfortunately, since here \(\gamma = \varepsilon ^{2+a}\), \(a>0\), by (6.17), the coefficient \(a_0({\varphi },x)\) in \(\mathcal {L}_{\omega }\) does not satisfy (7.12). This is why we have to perform some preliminary steps in order to enter in the perturbative regime where we apply the scheme described in the proof of Corollary 3.6 in [31].
We first “regularize” the purely polynomial terms \(\mathsf {H}_i\) (see (6.32)) by hand, by exploiting their homogeneity according to Definition 6.7. After that we are left with only unbounded terms which satisfy the smallness conditions of [33]. We “regularize” them by applying the results of [33] adapted to our slightly more general setting, see Proposition C.2.
Remark 7.2
In order to determine the correct change of variables in the regularization of \(\mathsf {H}_i\), it will be convenient to use the Formal Lie expansions. We recall that \(H\circ (\Phi ^{\tau })^{-1}\) satisfies, for \(\tau \in [0,1]\),
By setting \(S:=S(0)\), the Lie expansion of the conjugated Hamiltonian \(H\circ \Phi ^{-1}\) is the following:
where the Poisson brackets \(\{\cdot ,\cdot \}_{e}\) are in (6.30). Recall that \(\Phi \) is a \(C^k\) map from \(H^s \) to \(H^{s-k}\). Therefore the Taylor expansion of the conjugated Hamiltonian coincides with the Lie series of the generator up to any order \(\tau ^k\).
-
Linear BNF The second step is to diagonalize the bounded terms. Here we diagonalize “by hand” the terms up to order \(\varepsilon ^3\), by exploiting the fact that they are almost diagonal according to Definition 6.4 and applying a linear BNF. Once this is done, the full diagonalization follows by a standard KAM reducibility theorem (see Theorem 7.13).
7.1 Reduction at the highest order
In the following we shall assume that the (6.12) holds with some \(\mathfrak {p}_1\gg 1\). The loss of regularity \(\mathfrak {p}_1\) will be determined explicitly at the end of the section. In order to perform the non-perturbative steps, we construct changes of coordinates \({\mathcal {B}}_{i}\), \(i=1,2,3,4,5\), as the time-one flow maps generated by Hamiltonians as in (7.10). Then we set \(\mathcal L_0:= \mathcal L_{\omega }\) and define iteratively \(\mathcal {L}_i:={\mathcal {B}}_i \mathcal {L}_{i-1} {\mathcal {B}}_i^{-1}\). Note that \(\mathcal L_0\) is pseudo differential plus a finite rank operator. Even though the \({\mathcal {B}}_{i}\) preserve the pseudo differential structure, in order to have a good quantitative control on the symbols we shall fix appropriate values
and write
where \(c_i(\omega )\) is a constant, \(a_i,\mathtt {q}_i\) are symbols, \(\mathrm{Op}(\mathtt {q}_i)\) is of order \(-1\) and \(\widehat{\mathcal {Q}}_i\in \mathfrak {L}_{\rho ,p}\). This is a class of operators of order \(-\rho \) which we introduced in [33] (we recall it in Definition C.5). Note that by Lemma C.7\(\mathcal {Q}_0=:\widehat{\mathcal {Q}}_0\) belongs to \(\mathcal {L}_{\rho , p}\) for all \(\rho ,p\), with bounds on \(\mathbb {M}^{\gamma }_{\widehat{\mathcal {Q}}_0}(s, \mathtt {b})\) given in the same lemma. Then one proves iteratively that
Note that the size of \(a_{i}\) (in the low norm) is decreasing in i. Regarding the remainders, the numbers \(\mathbb {M}^{\gamma }_{\widehat{\mathcal {Q}}_i}(s, \mathtt {b})\) control the norm of the corresponding operator, see Definition C.5. We have
with \(\sigma _0\) defined in Proposition 6.2 and \(\sigma _{i+3}>0\)\(i=1,\ldots ,5\), depending only on \(\nu \) (essentially \(\sigma _{i+3}\) are the losses coming from the application of Proposition C.2). Note that we can obtain (7.16) for any \(\rho ,p\) satisfying (7.15); however, if we want (7.18) to hold for some given p, we have to assume a smallness condition (6.12) with \(p+\sigma _0+\sigma _{i+3}<\mathfrak {p}_1\).
Step (\(\varepsilon \)). Consider the Hamiltonian
with \(S_4(\tau )\sim O(\tau ^3\varepsilon ^{4})\) and for some function \(\beta _1\) of the form (6.36) with \(p=1\) and some coefficients \((\beta _1)_{j}\), \(j\in S\), to be determined. The Hamiltonian system associated to \(S(\tau )\) is of the form (7.11) with \(b\rightsquigarrow b_1\). We call \({\mathcal {B}}_1\) the flow at time-one generated by \(S(\tau )\), then the Hamiltonian of the conjugated linearized operator \({\mathcal {B}}_1 \mathcal {L}_{\omega } {\mathcal {B}}_1^{-1}\) is (recall (6.31), (6.32) and Remark 7.2)
where, by Lemma 6.9, \(\mathsf {H}_3^{(1)}\) is some pseudo differential 3-homogeneous Hamiltonian of the form (6.35). Notice that also \(H_1^{(1)}\) and \(\mathsf {H}_2^{(1)}\) (for \(\eta =0\)) are pseudo differential and 1-homogeneous, resp. 2-homogenenous, Hamiltonians according to Definition 6.7. We want to solve the following equation
where \(\mathfrak {B}_1\) is some pseudo differential operator of order \(-2\). Recalling (6.32), and expanding \(\{S_1,\mathsf {H}_0\}\) as in the proof of Lemma 6.9, we note that the Eq. (7.24) is equivalent to the following one
Hence we choose \( \beta _1=\frac{1}{3} (\Lambda \partial _x)^{-1} \overline{v} \) and we note that
With the choice in (7.25) we have
In this way the Hamiltonians in (7.22), (7.23) become
By (7.26) the smallness assumption of Lemma C.1 is satisfied. By (6.17), (6.21), (6.22) and using the assumption (6.12) with \(\mathfrak {p}_1\) sufficiently large the condition (C.15) holds. In this case \(\mathtt {q}\rightsquigarrow 0\) (see (C.13)).
Then Proposition C.2 applies and the new linearized operator \(\mathcal {L}_1:={\mathcal {B}}_1 \mathcal {L}_{\omega } {\mathcal {B}}_1^{-1}\) has the form in (7.16) with \(i=1\) and \(c_1=0\). By (7.26), (6.21), (C.19), (C.17), (6.15), we have that \(\widehat{\mathcal {Q}}_1\in \mathfrak {L}_{\rho , p}(\mathcal {O})\) (see Definition C.5) (with \(\rho \), p satisfying (7.15)) and (7.18), (7.19) hold for \(i=1\).
The only estimates that are not given by Lemma C.2 are (7.17). The coefficient \(a_1\) is given by (C.17) with \(m\rightsquigarrow 1\), \(a\rightsquigarrow a_0\), \(a_+\rightsquigarrow a_1\) and \(\tilde{\beta }\) such that \(x\mapsto x+\tilde{\beta }\) is the inverse of \(x\mapsto x+\beta _1\). By the choice of \(\beta _1\) in (7.25) we have eliminated the \(\varepsilon \)-terms from \(a_1\). Hence by (7.26) and (6.17) we get (7.17) for \(i=1\).
Step (\(\varepsilon ^2\)). Now we deal with the terms of order \(\varepsilon ^2\) of the Hamiltonian (6.31). We consider the auxiliary Hamiltonian
where \(\tilde{S}_4(\tau ):=\tilde{S}(\tau )-\varepsilon ^2\tilde{S}_2 \sim O(\tau \varepsilon ^{4})\) and \(\beta _2\) is some function of the form (6.34), with \(p=2\), to be determined. Notice that \((\partial _{\tau }\tilde{S})(0)\sim O(\varepsilon ^{4})\). The Hamiltonian system associated to \(\tilde{S}(\tau )\) is of the form (7.11) with \(b\rightsquigarrow b_2\). If \({\mathcal {B}}_2\) is the flow at time-one generated by \(\tilde{S}(\tau )\), then the Hamiltonian of the conjugated linearized operator \(\mathcal {L}_2:={\mathcal {B}}_2\mathcal {L}_1 {\mathcal {B}}_2^{-1}\) is (recall (7.22), (7.28), (7.14))
We want to solve the equation
where \(\mathfrak {B}_2\) is some pseudo differential and 2-homogeneous operator of order \(-2\) (see Definition 6.6), c is some constant to be determined and \(\mathsf {H}_{\mathcal {R}_2}\) (possibly different from the one in (6.31)) is a Hamiltonian with the form (6.35). By Lemma 6.9 we have that \(\mathsf {H}^{(1)}_2 +\{\tilde{S}_2 ,\mathsf {H}_0\}\) can be written in the form (6.35) with, in particular (see (6.24) for the definition of \(\Psi _2\))
and some \(\mathfrak {B}_2\in OPS^{-2}\), as in Definition 6.6, up to a finite rank remainder. Hence the Eq. (7.31) is equivalent to
Since \(f_2\) in (7.32) has the form (6.36) with \(p=2\), we look for a function \(\beta _2\) of the same form in (6.36) with some coefficients \((\beta _2)_{j_1,j_2}\in \mathbb {C}\). Hence Eq. (7.33) reads
We have that, for \(j_1, j_2\in S\), \( {\lambda }(j_1)+{\lambda }(j_2)-(j_1+j_2)=0 \) if and only if \(j_1+j_2=0\), since \(j_1j_2\ne -1\). The terms with \(j_1=-j_2\) corresponds to the average in x of the function \(f_2(\overline{v})\). Hence we set
and we evaluate explicitly it. The functions \(\Psi _2(\overline{v})\) and \(\partial _{xx}(\beta _1^2)\) do not contribute since they have zero average in space.
Recalling that \(\beta _1=\frac{1}{3} (\Lambda \partial _x)^{-1} \overline{v}\) we have
Then the constant \(c=c(\omega )\) (recall the (4.11)) in (7.35) is given by
By noting that
by (7.18)–(7.17) with \(i=1\) and using the assumption (6.12) with \(\mathfrak {p}_1\) sufficiently large the smallness assumption of Lemma C.1 and the condition (C.15) are satisfied. In this case \(\mathtt {q}\rightsquigarrow \mathtt {q}_1\), hence by (7.18), (7.19) the bounds (C.13), (C.14) hold with \(\mathtt {k}_1\rightsquigarrow \varepsilon \), \(\mathtt {k}_2\rightsquigarrow \varepsilon \), \(\mathtt {k}_3\rightsquigarrow \varepsilon \), \(f\rightsquigarrow \mathfrak {I}_{\delta }\). Then Proposition C.2 applies and \(\mathcal {L}_2:=\Phi _2 \mathcal {L}_1 \Phi _2^{-1}\) with \(i=2\) and \(c_2(\omega ):=c(\omega )\) given in (7.36). By (7.37), (6.21), (C.19), (C.17) we have that \(\widehat{\mathcal {Q}}_2\in \mathfrak {L}_{\rho , p}(\mathcal {O})\) (with \(\rho \), p as in (7.15)) and (7.18), (7.19) hold for \(i=2\). By (7.37), (7.18)–(7.17) for \(i=1\), (C.20), we have that (7.18)–(7.17) holds for \(i=2\).
Steps\((\varepsilon ^{3})\)-\((\varepsilon ^4)\)-\((\varepsilon ^5)\). Consider \(i=3,4, 5\). We proceed exactly as in the previous steps. We consider a change of coordinates \({\mathcal {B}}_i\) as the time-one flow map of
for some smooth function \(\beta _i\) of the form (6.36) (with \(p=i\)) to be determined. Using Lemma 6.9 for the Hamiltonians of order \(\varepsilon ^{i}\), \(i=3,4,5\), we can choose \(\beta _i\) in order to solve an equation like the following
where \(f_i\) is a homogeneous function as in (6.36) (with \(p=i\)). The condition (1.13) implies that the Eq. (7.39) for \(i=4\) is solved up to remainders of the form
By (A.3) there are no small divisors for (7.39) if \(i=3\) or \(i=5\). By (7.18), (7.19) and by noting that
the smallness assumption of Lemma C.1 and the condition (C.15) are satisfied for the system (7.38). Arguing as in the previous steps we obtain that \(\mathcal {L}_5:={\mathcal {B}}_5{\mathcal {B}}_4 \mathcal {L}_3 {\mathcal {B}}_4^{-1}{\mathcal {B}}_5^{-1}\) has the form (7.16) with \(i=5\) and \(c_5=c_2+\varepsilon ^2 d(\xi )\). Moreover the bounds (7.18)–(7.17) hold for \(i=5\).
Remark 7.3
Since the symplectic maps \(\mathcal {B}_i\), \(i=1, \ldots , 5\) are smooth in \(\varepsilon \,\overline{v}\) (see (6.25) and Remark 7.2) and the Hamiltonian \(\mathsf {H}\) has the Taylor expansion (6.31), then the operators \(\mathrm{Op}(\mathtt {q}_5), \widehat{\mathcal {Q}_5}\) in (7.16) may be expanded, in degree of homogeneity of \(\varepsilon \overline{v}\) , in the following way (see Remark C.3)
with
for some \(\sigma >0\). Following Remark C.3 the \(\widehat{\mathcal {Q}}^{(i)}_5, \widehat{\mathcal {Q}}^{(\ge 4)}_5\) are in \(\mathfrak {L}_{\rho ', p'}\), as is habitual we rename them \(\rho ,p\).
By (6.31) and the fact that the generators \(\beta _i\) in (7.10) are \(\mathfrak {I}_{\delta }\)-independent, it is clear that \(\mathtt {q}_5^{(\ge 4)}\) and \(\widehat{\mathcal {Q}}^{(\ge 4)}_5\) contain terms of size \(\varepsilon ^4\), which are functions just of \(\overline{v}\), and terms dependent also on \(\mathfrak {I}_{\delta }\) of "size" \(O(\varepsilon \Vert \mathfrak {I}_{\delta } \Vert _{s+\sigma } )\), see the estimates (7.17), (7.18), (7.19). By the uniqueness of the Taylor expansion we have that \(\sum _{i=1}^3 \varepsilon ^i \,\big ( \,\mathtt {q}_5^{(i)}+\widehat{\mathcal {Q}}^{(i)}_5\big )\) coincide with the vector field \(-\sum _{i=1}^3 \varepsilon ^i\,J\,\nabla \,\mathcal {K}_i\) where, recalling (7.22), (7.30), (7.23), (7.36),
and \(\mathcal {K}_3\) is some pseudo differential 3-homogeneous Hamiltonian as in (6.35) with the corresponding function \(f_{3}(\overline{v})=0\).
Now we apply Proposition 3.6 in [33] (or Corollary 3.6 in [31]) in order to make constant the coefficient \(a_5\) of the linearized operator \(\mathcal {L}_5\), namely we find \(\beta \) such that
Note that, by (7.17) with \(i=5\) and (6.12), the smallness condition (7.12) is satisfied by the function \(a_{5}\). We have the following.
Proposition 7.4
There exists \(\beta ^{(\infty )}(\varphi , x)\) such that \((\varphi , x)\mapsto (\varphi , x+\beta ^{(\infty )}(\varphi , x))\) is a diffeomorphism of the torus \(\mathbb {T}^{\nu +1}\) with the following estimates (recall (7.3)),
for some \(\widehat{\sigma }>0\), and the following holds. If \(\beta \) is the function such that \((\varphi , x)\mapsto (\varphi , x+\beta (\varphi , x))\) is the inverse of the above diffeomorphism and \({\mathcal {B}}_6\) is the flow of the Hamiltonian PDE
then the conjugated of the operator \(\mathcal {L}_5\) in (7.16) with \(i=5\) is
where \(\mathcal {Q}_6=\mathrm{Op}(\mathtt {q}_6)+\widehat{\mathcal {Q}}_6\) is of order \(-1\), as in Proposition C.2, and m is a constant such that
Moreover, for any \(s\ge s_0\),
and \(\widehat{\mathcal {Q}}_{6}\in \mathfrak {L}_{\rho , p}\), for \(s_0\le s\le \mathcal {S}\), satisfies
with \(\widehat{\sigma }=\sigma _0+\sigma _9+\rho +s_1-s_0\) for some \(\sigma _9\), possibly larger than \(\sigma _8\) (recall (7.16) with \(i=5\) and \(s_1\) given in Proposition 3.6 in [33]).
Proof
The first order linear differential operator (recall (7.36), (7.40))
defined on \(H_{S^{\perp }}^s(\mathbb {T}^{\nu +1})\) is associated to the vector field on \(\mathbb {T}^{\nu +1}\)
For \(\mathfrak {p}_{1}\) in (6.12) large enough, i.e. if \(\mathfrak {p}_{1}\gg \sigma _0+s_1+\sigma _8\), and by (7.17) with \(i=5\), we have
provided that \(\varepsilon \) is small enough. This is the condition (7.12), hence Proposition 3.6 in [33] applies to the vector field (7.53). Thus there exist \(\beta ^{(\infty )}\) and m such that the bounds (7.45), (7.48) hold. In particular the second bound in (7.45) follows by Lemma 3.7 in [33]. Moreover the operator (7.52) conjugated by the transformation
is associated to the vector field
and by Proposition 3.6 in [33] we have that
By Lemma 11.4 in [1] the function \(\beta ^{(\infty )}\) satisfies the bound (7.45). By (7.45) and \(\mathfrak {p}_1\) large enough, for \(\varepsilon \) small enough, the function \(\beta ^{(\infty )}\) satisfies the smallness condition of Lemma C.1, indeed
Hence \({\mathcal {B}}_6\) is well defined. By (7.17), (7.18), (7.19), \(i=5\), the bounds (C.13), (C.14) hold with \(\mathtt {k}_1\rightsquigarrow \varepsilon ^6\), \(\mathtt {k}_2\rightsquigarrow \varepsilon \), \(\mathtt {k}_3\rightsquigarrow \varepsilon \gamma ^{-1}\) and Proposition C.2 applies and the thesis follows. \(\quad \square \)
Let us define
Then the Hamiltonian of the operator \(\mathcal {L}_6\) is (recall (6.31), (7.43) and (7.47))
Notice also that \(\{\mathtt {H}_0, \mathtt {Z}_0\}_e=0\). The expansion (7.57) allows us, together with Remark 7.3, to give a more precise expression of the remainder \(\mathcal {Q}_{6}\) in (7.47). This is the content of Lemma 7.6 in the next section.
7.2 Linear Birkhoff normal form
The aim of this section is to eliminate \(\mathcal {K}_1\), \(\mathcal {K}_3\) and normalize the Hamiltonian \(\mathcal {K}_2\) from (7.57). Our first point is that the \(-1\) smoothing remainder \(o(\varepsilon ^3)\) belongs to a special class of operators defined in Definition C.6 and denoted by \(\mathfrak {C}_{-1}\) . It turns out that this class is preserved under the changes of variables used in the linear Birkhoff normal form procedure (see Lemmata C.8, C.9).
Remark 7.5
In the following steps of linear Birkhoff normal form we shall use the relation
which holds by the conservation of momentum.
Lemma 7.6
Recall (7.47). We have
where \(X_{\mathcal {K}_i}:=J \nabla \mathcal {K}_i\), \(i=1, 2, 3, \) are almost diagonal and in \(\mathfrak {C}_{-1}(\mathcal {O}^{2\gamma }_{\infty })\) (recall Definitions 6.4 and C.6) satisfying
The remainder \(\mathfrak {R}\) belongs to \(\mathfrak {C}_{-1}(\mathcal {O}^{2\gamma }_{\infty })\) and satisfies
for \(\widehat{\sigma }\) given in Proposition 7.4.
Proof
By the discussion of Sect. 7.1\(\mathcal {K}_i\), \(i=1,2,3\), are of the form (6.35) with \(f_{i}=0\) for \(i=1,2,3\). Hence the vector field \(X_{\mathcal {K}_i}\) are pseudo differential of order \(-1\) up to a finite rank term. In addition, they are almost diagonal by (6.34) and the momentum condition (7.58). By Lemma C.8-(ii) \(\varepsilon X_{\mathcal {K}_1}, \varepsilon ^2 X_{\mathcal {K}_2}, \varepsilon ^3 X_{\mathcal {K}_3}\) belong to \(\mathfrak {C}_{-1}\) and, by (7.27), (7.26), (7.35), (7.37), (7.41), satisfy (7.60). By Proposition 7.4, the choice of \(\rho \) as in (7.15) and by Lemma C.8-(i) taking \(p=s_0\) and \(\mathfrak {p}_1\) large enough, \(\mathcal {Q}_6\in \mathfrak {C}_{-1}\). Thus \(\mathfrak {R}\in \mathfrak {C}_{-1}\).
Note that only \(\mathcal {Q}_6\) in (7.61) depend on the torus embedding \(i_{\delta }\), then the second bound in (7.62) follows by Lemma C.8-(i), (ii), (7.49) and (7.51). To prove the first bound in (7.62) we reason as follows.
By (7.45) and (7.17) with \(i=5\) we have that
Then the map \(\mathcal {B}_6\) leaves invariant (using Remark 7.2) the terms of size \(\varepsilon , \varepsilon ^2, \varepsilon ^3\) in \(\mathcal {L}_5\), and hence, by Remark 7.3, those terms in \(\mathcal {Q}_6\) are given by \( -\varepsilon J \nabla \, \mathcal {K}_1, -\varepsilon ^2 J \nabla \, \mathcal {K}_2, -\varepsilon ^3 J \nabla \, \mathcal {K}_3\).
From the proof of the bounds (C.18), (C.20) in Proposition C.2 one can notice that the operators \(\mathrm{Op}(\mathtt {q}_6)\) and \(\widehat{\mathcal {Q}}_6\) admit a “formal” expansion in \(\beta ^{(\infty )}\) (by expanding the flow in \(\tau \)). Of course, by the discussion above, the biggest term in \(\mathfrak {R}\) are the ones which are linear in \(\beta ^{(\infty )}\). Such term comes from the conjugation of \(\mathcal {L}_5\) under the map \(\mathcal {B}_6\), more precisely from the conjugation of
We refer to the formula (3.11) in Proposition 3.1 of [33] to see the term bounded by the norm of \(\beta ^{(\infty )}\). Comparing the bounds (7.63) and (7.42) one can deduce the first bound in (7.62). \(\quad \square \)
In order to normalize the vector fields \(\varepsilon ^i J \nabla \mathcal {K}_i\) we will look for changes of coordinates \(\Upsilon _i\) generated as one-time flow of quadratic Hamiltonians \(H_{\mathtt {A}_i}\) described by almost diagonal matrices \(\mathtt {A}_i\) (see (C.47), (C.48) ,(C.49), (C.50)). We remark that the Hamiltonian \(\varepsilon ^2\mathtt {Z}_0\) is left invariant by these changes of coordinates, since \(\{ \mathtt {H}_0, \mathtt {Z}_0\}_e=0\). At any step of the procedure we shall verify that \(J \mathtt {B}_i\) (see (C.49), (C.50)) are almost diagonal and belong to \(\mathfrak {C}_{-1}\) in order to apply Lemma C.13, which guarantees well-posedness and tame estimates of \(\Upsilon _i\).
Step one (order\(\varepsilon \)) At this step we want to eliminate \(\varepsilon X_{\mathcal {K}_1}\) from (7.59). We have
We choose \(\mathtt {A}_1\) such that
Recalling that \(\mathcal {K}_1:=\mathsf {H}_1^{(1)}\), we have (see (7.27), (7.28))
Then we choose \(\mathtt {B}_1=\mathfrak {B}_1\) in (C.49). By recalling the definition of \(\mathfrak {B}_1\) in (7.27) it is easy to see that \(J \mathtt {B}_1\in \mathfrak {C}_{-1}\), since it is a pseudo differential operator of order \(-1\). Moreover it is almost diagonal because J, \(3\Lambda \partial _x\) are diagonal operators and \(\beta _1\) is a function supported on the finite set S.
Given \(X\in \mathfrak {C}_{-1}\) to shorten the notation in the following lemma we write (recall (2.3))
Under this notation we have the following lemma.
Lemma 7.7
The transformed operator is (recall (7.59), (7.64))
where
the operator \(\mathcal {R}_{7}\in \mathfrak {C}_{-1}\) with
for some \(\tilde{\sigma }\ge \widehat{\sigma }\) (recall the loss of regularity in (7.49), (7.50), (7.51)).
Proof
By using (C.58) we have that formulæ (7.70), (7.71) hold and that
For \(Y \in \mathfrak {C}_{-1}\) define \(\mathcal {Z}_n:=\sum _{k\ge n} \frac{\varepsilon ^k}{k!}\mathrm {ad}_{X_{\mathtt {A}_1}}^k[Y]\) for any \(n\ge 1\). By Lemma C.9, and using (C.54), we deduce that
for \(\varepsilon \) small enough. In (7.73) there are terms of the form \(\mathrm {ad}^k_{X_{\mathtt {A}_1}}[Y]\), for some \(k\ge 1\), with \(Y=X_{\mathcal {K}_1}, X_{\mathcal {K}_2}, X_{\mathcal {K}_3}, \mathfrak {R}\) which belong to \(\mathfrak {C}_{-1}\) by Lemmata C.8 and 7.6. We note that by (7.66)
since \(\mathtt {A}_1\) is almost diagonal. Hence \((\omega -\overline{\omega })\cdot \partial _{\varphi }X_{\mathtt {A}_1}, [X_{\mathtt {A}_1}, J] \in \mathfrak {C}_{-1}\) (see the proof of Lemma C.13) and by Lemma C.8-(iii) the remainder \(\mathcal {R}_7\in \mathfrak {C}_{-1}\). By (7.73), (C.43), (C.44), (7.74), (7.75), (7.60), (7.62) and the fact that \(|\omega -\overline{\omega } |\lesssim \varepsilon ^2\) we get the bounds (7.72). \(\quad \square \)
Step two (order\(\varepsilon ^2\)) At this step we want to normalize \(\varepsilon ^2 X_{\mathcal {K}_2^{(1)}}\) from (7.69). We have
where \(\mathcal {K}_2^{(1)}\) is given in (7.65) (see also (7.70)). We choose \(\mathtt {A}_2\) in order to solve the following equation
Hence we choose \(\mathtt {B}_2= \nabla \Pi _{\mathrm{Rg}(\mathsf {H}_0)} \mathcal {K}^{(1)}_2(u)\) in (C.49). Note that \(X_{\mathcal {K}_2}\) is pseudo differential of order \(-1\) and \(J \mathtt {A}_1\), \(X_{\mathcal {K}_1}\) belong to \(\mathfrak {C}_{-1}\) and so also their Poisson brackets. Hence \(J \mathtt {B}_2\in \mathfrak {C}_{-1}\). By Remark 6.5 we have that \(J \mathtt {B}_2\) is also almost diagonal.
In order to perform the third step in the linear BNF we need to explicitly compute the corrections \(O(\varepsilon ^2)\) coming from \(\Pi _{\mathrm{Ker}(\mathsf {H}_0)}\mathcal {K}^{(1)}_2\). The point is that a priori, it is not clear whether the resonant terms \(\Pi _{\mathrm{Ker}(\mathsf {H}_0)}\mathcal {K}^{(1)}_2\) are supported only on trivial resonances. Our approach is then to show that the normal form we obtain must necessarily coincide with the formal one, which is relatively easy to compute.
Definition 7.8
Recalling the notations used in Sect. 3, we denote by \(\Pi ^{d_z\le k}\), respectively \(\Pi ^{d_z=k}\), the projector of a homogenous Hamiltonian of degree n on the monomials with degree less or equal than k, respectively equal k, in the normal variable z, i.e.
We denote by \(\Pi _{\mathrm {triv}}\) the projection onto trivial resonances (of the form (3.11)), i.e. monomials of the form
The following proposition allows to easily compute the resonant terms \(\Pi _{\mathrm{Ker}(\mathsf {H}_0)}\mathcal {K}^{(1)}_2\) in (B.17).
Theorem 7.9
(Normal form identification). Consider the symplectic change of coordinates \(A_{\varepsilon }\) in (4.7). Then
where \(A_1:=A_{{\varepsilon }_{|_{\varepsilon =1}}}\), \(\mathtt {H}_0\) is in (6.32) and we set (recalling (2.4)) \(\mathfrak {F}^{(3)}:=[\mathrm {ad}_{H^{(2)}}]^{-1}H^{(3)}\) with \(H^{(3)}\) in (3.1).
Proof
The proof is postponed to the “Appendix B”. \(\quad \square \)
As a consequence of the identification above, we have, by (7.78), (3.21), (3.1),
where (recall (1.8))
We define the diagonal operator (recall (7.36))
Lemma 7.10
We have
for an appropriate constant \(C>0\) depending on the set S.
Proof
Recalling the definitions (7.36) and (7.80) we have, for \(j\in S^c\),
It is easy to prove that \(|{\lambda }(j) |\le 4 |j |\), \(3+j_0^2+j^2\pm j_0 j\ge \frac{3}{4} j^2 \) and \((1+j_0^2)(7+5 j_0^2+j_0^4+3 j^2)\le 14 j_0^6 j^2\). Hence \( |\kappa (j) |\lesssim \frac{|j |}{j^2} \sum _{j_0\in S^+} j_0^6. \)\(\quad \square \)
Lemma 7.11
The transformed operator is (recall (7.69))
where \(\mathcal {K}_3^{(2)}=\mathcal {K}_3^{(1)}\), \(\mathfrak {D}(\xi )\) is the diagonal operator of order \(-1\) defined in (7.81), \(\mathcal {R}_8\in \mathfrak {C}_{-1}\) satisfies
for some \(\tilde{\sigma }\) possibly larger than the one in Lemma 7.7.
Proof
The proof follows by using the same arguments of the proof of Lemma 7.7. In particular, expanding the left hand side of (7.84) using (C.58) we get
By (7.77) and Theorem 7.9 we have that
By (7.10) \(\mathfrak {D}(\xi )\in \mathfrak {C}_{-1}\) and by the fact that \(\mathtt {A}_2\) is almost diagonal we have that \((\omega -\overline{\omega })\cdot \partial _{\varphi }X_{\mathtt {A}_2}, [X_{\mathtt {A}_2}, J]\in \mathfrak {C}_{-1}\). Then \(\mathcal {R}_{8}\in \mathfrak {C}_{-1}\). The bounds (7.85) are obtained by using the estimates (C.43), (C.44), (7.74), (C.54) and (7.72). \(\quad \square \)
Step three (order\(\varepsilon ^3\)) At this step we eliminate \(\varepsilon ^3 X_{\mathcal {K}^{(2)}_3}\) from (7.84). Recalling that \(\mathcal {K}^{(2)}_3\) is given in Lemma 7.11, we have
Note that we consider in the normal form also the \(\varepsilon ^2\)-terms. We want to solve the equation
Hence we choose the matrix \(\mathtt {B}_3:=\nabla \mathcal {K}^{(1)}_3(u)\) (note that \(\mathcal {K}^{(1)}_3=\mathcal {K}^{(2)}_3\)). Recalling (7.71) it is easy to see that \(J \mathtt {B}_3\) is sum of Lie brackets of elements of \(\mathfrak {C}_{-1}\), hence by Lemma C.8 it belongs to \(\mathfrak {C}_{-1}\). By the fact that \(\mathtt {A}_1\) is almost diagonal and by Remark 6.5 we have that \(J \mathtt {B}_3\) is almost diagonal.
Lemma 7.12
The transformed operator is (recall (7.69))
where \(\mathfrak {D}(\xi )\) is the diagonal operator of order \(-1\) defined in (7.81), \(\mathcal {R}_9\in \mathfrak {C}_{-1}\) satisfies
for some \(\tilde{\sigma }\) possibly larger than the one in Lemma 7.11.
Proof
The proof follows the same arguments used for proving Lemma 7.7. By (C.58) we deduce
We note that by (7.88) we have (recall (4.5) and (7.36))
since \(\mathtt {A}_3\) is almost diagonal . Hence the bounds (7.90) follows by (C.54), (7.85) and by using Lemma C.9. \(\quad \square \)
Proof of Theorem 7.1
We choose \(\mu _1=\tilde{\sigma }\) given in Lemma 7.12. We consider p and \(\mathfrak {p}_1\) so that
where \(\tilde{\sigma }\) is the loss of regularity in Lemma 7.12, \(\sigma _0\) has been introduced in Sect. 7, see estimates (6.17)–(6.22), \(\sigma _1>0\) and \(s_1\) are given respectively in Lemma C.1 and in Proposition 3.6 in [33].
We define the map (recall (C.48), (7.56))
By Proposition 7.4 the map \(\mathcal {B}\) is defined for \(\omega \in \mathcal {O}^{2{\gamma }}_{\infty }\) (see (7.3)), and so also \(\Upsilon \). By (7.26), (7.37), (7.41), (C.55), (C.54), (7.45), (7.17) (with \(i=5\)) and Lemma C.1 we have (7.9).
The result follows by setting \(\mathcal {L}:=\mathcal {L}_9\) (see (7.89)), \(\mathcal {P}_0:=\mathcal {R}_9\), and m is the constant given by Proposition 7.4. Indeed (7.48) implies (7.2) and (7.6). Moreover, by Lemma 7.12, we have that \(\mathcal {P}_0\in \mathfrak {C}_{-1}\) and satisfies (7.90). Lemma C.10 implies that \(\mathcal {P}_0\) is \(-1\)-modulo tame together with \(\mathtt {b}_0\) derivatives in the variable \(\varphi \) (recall the Definition C.32 and the fact that \(\gamma ^{3/2}<{\gamma }\)). The bounds (7.7), (7.8) follow from (C.45)–(C.46) , the definition of \(\mathbb {B}^{{\gamma }}_{\mathcal {P}_0}(s)\) (see (C.32),(C.34)) and (7.90). By Lemma 7.10 we deduce (7.5).
\(\square \)
7.3 KAM reducibilty and Inversion of the linearized operator
In this subsection we prove the claim (6.9) by diagonalizing the operator \(\mathcal {L}\) in (7.4). We first write
Notice that (by the smallness condition (7.94)) Proposition 4.1 in [33] applies to the operator \(\mathcal {L}\) in (7.4). Hence by following almost word by word the proof of Theorem 1.7 in [33] one has the following.
Theorem 7.13
(Reducibility). Fix \(\tilde{{\gamma }}\in [\gamma ^{3/2}/4, 4\gamma ^{3/2}]\). Assume that \(\omega \mapsto i_{\delta }(\omega )\) is a Lipschitz function defined on \(\mathcal {O}_0\subseteq \Omega _{\varepsilon }\) (recall (6.3)), satisfying (6.12) with \(\mathfrak {p}_1 \ge \mu _1\) where \(\mu _1:=\mu _1(\nu )\) is given in Proposition 7.1. There exist \(\delta _0\in (0, 1)\), \(N_0>0\), \(C_0>0\), such that, if (recall that by (4.8) \({\gamma }= \varepsilon ^{2+a}\))
then the following holds.
-
(i)
(Eigenvalues) For all \(\omega \in \Omega _{\varepsilon }\) there exists a sequenceFootnote 4
$$\begin{aligned} d_j^{\infty }(\omega ):=d_j^{\infty }(\omega , i_{\delta }(\omega )):=m(\omega , i_{\delta }(\omega ))\,{\lambda }(j)+\varepsilon ^2 \kappa _j (\omega ) + r_j^{\infty }(\omega , i_{\delta }(\omega )), \quad j\in S^c, \end{aligned}$$(7.95)with m and \(\kappa _j\) in (7.48) and (7.81) respectively. Furthermore, for all \(j\in S^c\)
$$\begin{aligned} \sup _j \langle j\rangle |r_j^{(\infty )}|^{\gamma ^{3/2}} \lesssim \varepsilon ^{4-3 a}, \qquad r_j^{\infty }=-r^{\infty }_{-j}. \end{aligned}$$(7.96)All the eigenvalues \(\mathrm {i} d_j^{\infty }\) are purely imaginary.
-
(ii)
(Conjugacy) For all \(\omega \) in the set
$$\begin{aligned}&\Omega ^{2 {\tilde{\gamma }}}_{\infty } :=\Omega ^{2 \tilde{\gamma }}_{\infty }(i_{\delta })\nonumber \\&\quad :=\left\{ \omega \in \mathcal {O}^{2\gamma }_{\infty } : |\omega \cdot \ell \!+\! d_j^{\infty }(\omega )\!-\! d_k^{\infty }(\omega )|\ge \frac{2 {\tilde{\gamma }}}{\langle \ell \rangle ^{\tau }}, \,\,\forall \ell \!\in \!\mathbb {Z}^{\nu }, \,\,\forall j, k\in S^c,\, j\ne k \right\} \nonumber \\ \end{aligned}$$(7.97)there is a real, bounded, invertible, linear operator \(\Phi _{\infty }(\omega ):H^s_{S^{\perp }}(\mathbb {T}^{\nu +1})\rightarrow H_{S^{\perp }}^s(\mathbb {T}^{\nu +1})\), with bounded inverse \(\Phi _{\infty }^{-1}(\omega )\), that conjugates \(\mathcal {L}\) in (7.4) to constant coefficients, namely
$$\begin{aligned} \mathcal {L}_{\infty }(\omega )&:=\Phi _{\infty } (\omega ) \circ \mathcal {L} \circ \Phi _{\infty }^{-1}(\omega )=\omega \cdot \partial _{\varphi }-\mathcal {D}_{\infty }(\omega ),\nonumber \\ \mathcal {D}_{\infty }(\omega )&:=\mathrm {diag}_{j\in S^c} \{ \mathrm {i} d_j^{\infty }(\omega ) \}. \end{aligned}$$(7.98)The transformations \(\Phi _{\infty }, \Phi _{\infty }^{-1}\) are tame and they satisfy for \(s_0\le s\le \mathcal {S}\) (recall \(\mu _1\) in Theorem 7.1)
$$\begin{aligned} \Vert (\Phi ^{\pm 1}_{\infty }-\mathrm {I}) h \Vert ^{\gamma ^{3/2}, \Omega _{\infty }^{2 \tilde{\gamma }}}_{s}&\lesssim _s \big (\varepsilon ^{4-3a} \gamma ^{-3/2}+\varepsilon \gamma ^{-5/2} \Vert \mathfrak {I}_{\delta } \Vert _{s+\mu _1}^{\gamma , \mathcal {O}_0}\big ) \Vert h \Vert ^{\gamma ^{3/2}, \Omega _{\infty }^{2 \tilde{\gamma }}}_{s_0}\nonumber \\&\quad +\varepsilon ^{4-3a} \gamma ^{-3/2} \Vert h \Vert ^{\gamma ^{3/2}, \Omega _{\infty }^{\tilde{\gamma }}}_s. \end{aligned}$$(7.99)Moreover \(\Phi _{\infty }, \Phi _{\infty }^{-1}\) are symplectic, and \(\mathcal {L}_{\infty }\) is a Hamiltonian operator.
-
(iii)
(Dependence on\(i_{\delta }(\omega )\)) Let \(i_1(\omega )\) and \(i_2(\omega )\) be two Lipschitz maps satisfying (6.12) with \(\mathfrak {I}_{\delta }\rightsquigarrow i_k(\varphi )-(\varphi , 0, 0)\), \(k=1, 2\), and such that
$$\begin{aligned} \Vert i_1-i_2 \Vert _{s_0+\mu _1}\lesssim \rho N^{-(\tau +1)} \end{aligned}$$(7.100)for N sufficiently large and \(0<\rho <\gamma ^{3/2}/4\). Fix \({\gamma }_1\in [\gamma ^{3/2}/2, 2 \gamma ^{3/2}]\) and \({\gamma }_2:={\gamma }_1-\rho \). Let \(r_j^{(\infty )}(\omega , i_k(\omega ))\) be the sequence in (7.95) with \(\tilde{{\gamma }}\rightsquigarrow {\gamma }_k\) for \(k=1, 2\). Then for all \(\omega \in \Omega _{\infty }^{{\gamma }_1}(i_1)\) we have, for some \(\kappa >(3/2) \tau \),
$$\begin{aligned} {\gamma }^{-1}|\Delta _{12} m |+ \sup _j \langle j\rangle |\Delta _{12}r_j^{(\infty )}| \le \varepsilon {\gamma }^{-1} \Vert i_1-i_2 \Vert _{s_0+\mu _1}+\varepsilon ^{4- 3 a} N^{-\kappa }. \end{aligned}$$(7.101)
Proof
The proof of items (i)–(ii) follow by Theorem 1.9 in [33]. The only point left to prove is item (iii). We apply Theorems 1.4, 1.5 in [33]. We have that
where \(\kappa >\tau \). Here \(m^{(N)}(i_2)\), \(r_j^{(N)}(i_2)\) are defined in (1.39) of [33] and they are an approximation of \(m(i_2)\) and \(r_j^{(\infty )}(i_2)\) satisfying
The bounds (7.102), (7.103) imply the (7.101). \(\quad \square \)
7.4 Proof of the inversion assumption (6.9)
We are in position to give estimates on the inverse of the operator \(\mathcal {L}_{\omega }\) in (6.15). Let us now define (recall (5.3))
We deduce the inversion assumption (6.9) by the following result.
Proposition 7.14
Assume the hypothesis of Theorem 7.13, (6.12) with \(\mathfrak {p}_1\ge \mu _1+2\tau +1\), where \(\mu _1\) is given in Theorem 7.1. Then for all \(\omega \in \Omega _{\infty }:=\Omega _{\infty }^{2 \gamma ^{3/2}}(i_{\delta }) \cap \mathcal {F}_{\infty }^{2\gamma }(i_{\delta })\) (see (7.97)), for any function \(g\in H^{s+2\tau +1}_{S^{\perp }}(\mathbb {T}^{\nu +1})\) the equation \(\mathcal {L}_{\omega } h=g\) has a solution \(h=\mathcal {L}_{\omega }^{-1} g\in H_{S^{\perp }}^s(\mathbb {T}^{\nu +1})\), satisfying
Proof
We conjugated the operator \(\mathcal {L}_{\omega }\) in (6.15) to a diagonal operator \(\mathcal {L}_{\infty }=\chi \mathcal {L}_{\omega } \chi ^{-1}\), see (7.98), with (recall (7.99) and Theorem 7.1) \(\chi :=\Phi _{\infty }\circ \Upsilon \). Moreover, by (7.9) and (7.99) we have the following estimates
We have
and then \( \Vert \mathcal {L}_{\infty }^{-1} g \Vert ^{\gamma , \Omega _{\infty }}_s\le \gamma ^{-1} \Vert g \Vert _{s+2\tau +1}^{\gamma , \Omega _{\infty }}. \) Thus we get the estimate (7.105). \(\quad \square \)
8 The Nash–Moser Nonlinear Iteration
In this section we prove Theorem 5.4. It will be a consequence of the Nash–Moser theorem 8.1.
Consider the finite-dimensional subspaces
where
and \(\Pi _n\) are the projectors (which, with a small abuse of notation, we denote with the same symbol)
We define \(\Pi _n^{\perp }=\mathrm {I}-\Pi _n\). The classical smoothing properties hold, namely, for all \(\alpha , s\ge 0\),
Recall (5.3), (4.8) for the definition of b we set \(a:=2b-2\). We define the following constants
where \(\mu :=\mu ( \nu )>0\) is the “loss of regularity” given by the Theorem 6.1 and \(C_1\) is fixed below.
Theorem 8.1
(Nash–Moser). Assume that \(f\in C^{\infty }\) (see (1.3)). Let \(\tau :=2\nu +6\). Then there exist \(C_1>\max \{ \alpha _0+\alpha , C_0 \}\) (where \(C_0:=C_0( \nu )\) is the one in Theorem 7.13), \(\delta _0:=\delta _0( \nu )>0\) such that, if
then there exists \(C_{*}=C_{*}(S)>0\) such that for all \(n\ge 0\) the following holds:
- \((\mathcal {P}1)_n\):
-
there exists a function \((\mathfrak {I}_n, \zeta _n):\mathcal {G}_n \subseteq \Omega _{\varepsilon } \rightarrow E_{n-1}\times \mathbb {R}^{\nu }, \omega \mapsto (\mathfrak {I}_n(\omega ), \zeta _n(\omega )), (\mathfrak {I}_0, \zeta _0):=(0, 0), E_{-1}:=\{0\}\), where the set \(\mathcal {G}_0\) is defined in (5.7) and the sets \(\mathcal {G}_n\) for \(n\ge 1\) are defined inductively by:
$$\begin{aligned}&\mathcal {G}_{n+1}:= \bigcap _{i=0}^2 \Lambda _{n+1}^{(i)}{,}\;\;\mathrm{with}\;\;\; \Lambda ^{(0)}_{n+1}:=\left\{ \omega \in \mathcal {G}_n : |\omega \cdot \ell +m(i_n) j |\ge \frac{2\,\gamma _n}{\langle \ell \rangle ^{\tau }}, \,\,\forall j\in S^c , \ell \in \mathbb {Z}^{\nu } \right\} {,} \nonumber \\&\Lambda ^{(1)}_{n+1}:=\left\{ \omega \in \mathcal {G}_n : |\omega \cdot \ell +d_j^{\infty }(i_n) |\ge \frac{2\,\gamma _n}{\langle \ell \rangle ^{\tau }}, \,\, \forall j\in S^c , \ell \in \mathbb {Z}^{\nu } \right\} {,}\nonumber \\&\Lambda ^{(2)}_{n+1}:=\left\{ \omega \in \mathcal {G}_n : |\omega \cdot \ell +d_j^{\infty }(i_n)-d_k^{\infty }(i_n) |\ge \frac{2\,\gamma ^{3/2}_n\,}{\langle \ell \rangle ^{\tau }}, \,\, \forall j, k\in S^c, j\ne k , \ell \in \mathbb {Z}^{\nu } \right\} {,} \end{aligned}$$(8.6)where \(\gamma _n:=\gamma (1+2^{-n})\), \(\gamma ^{*}_n:=\gamma ^{3/2}(1+2^{-n})\) and \(d_j^{\infty }(\omega ):=d_j^{\infty }(\omega , i_n(\omega ))\) are defined in (7.95) (and \(d_0^{\infty }(\omega )=0\)). Moreover \(|\zeta _n |^{\gamma , \mathcal {G}_n} \lesssim \Vert \mathcal {F}(U_n) \Vert _{s_0}^{\gamma , \mathcal {G}_n}\) and
$$\begin{aligned} \Vert \mathfrak {I}_n \Vert _{s_0+\mu }^{\gamma , \mathcal {G}_n} \le C_* \varepsilon ^{b_*} \gamma ^{-1}{,} \quad \Vert \mathcal {F}(U_n) \Vert _{s_0+\mu +3}^{\gamma , \mathcal {G}_n} \le C_* \varepsilon ^{b_*}{,} \end{aligned}$$(8.7)where \(U_n:=(i_n, \zeta _n)\) with \(i_n(\varphi )=(\varphi , 0, 0)+\mathfrak {I}_n(\varphi )\). The differences \(\widehat{\mathfrak {I}}_n:=\mathfrak {I}_n-\mathfrak {I}_{n-1}\) (where we set \(\widehat{\mathfrak {I}}_0:=0\)) is defined on \(\mathcal {G}_n\), and satisfy
$$\begin{aligned} \Vert \widehat{\mathfrak {I}}_1 \Vert _{s_0+\mu }^{\gamma , \mathcal {G}_1} \le C_* \varepsilon ^{b_*} \gamma ^{-1}{,} \quad \Vert \widehat{\mathfrak {I}}_n \Vert _{s_0+\mu }^{\gamma , \mathcal {G}_{n}} \le C_* \varepsilon ^{b_*} \gamma ^{-1} N_{n-1}^{-\alpha }{,} \quad \forall n\ge 2{.} \end{aligned}$$(8.8) - \((\mathcal {P} 2)_n\):
-
\(\Vert \mathcal {F}(U_n) \Vert _{s_0}^{\gamma , \mathcal {G}_n} \le C_* \varepsilon ^{b_*} N_{n-1}^{-\alpha }\) where we set \(N_{-1}:=1\).
- \((\mathcal {P}3)_n\):
-
(High Norms) \(\Vert \mathfrak {I}_n \Vert _{s_0+\beta _1}^{\gamma , \mathcal {G}_n} \le C_* \varepsilon ^{b_*} \gamma ^{-1} N_{n-1}^k\) and \(\Vert \mathcal {F}(U_n) \Vert _{s_0+\beta _1}^{\gamma , \mathcal {G}_n} \le C_* \varepsilon ^{b*} N_{n-1}^k\).
- \((\mathcal {P} 4)_n\):
-
(Measure) The measure of the “Cantor-like” sets \(\mathcal {G}_n\) satisfies
$$\begin{aligned} |\Omega _{\varepsilon }{\setminus } \mathcal {G}_0 |\le C_* \varepsilon ^{2 (\nu -1)} \gamma {,} \quad |\mathcal {G}_n {\setminus } \mathcal {G}_{n+1} |\le C_* \varepsilon ^{2 (\nu -1)} \gamma N_{n-1}^{-1}{.} \end{aligned}$$(8.9)
Proof
To simplify the notations we omit the index \({\gamma }, \mathcal {G}_n\) on the norm \(\Vert \cdot \Vert _s\).
Proof of\((\mathcal {P}_1)_0, (\mathcal {P}_2)_0, (\mathcal {P}_3)_0\). Recalling (5.10), we have, by the second estimate in (5.13),
Hence the smallness conditions in \((\mathcal {P}_1)_0, (\mathcal {P}_2)_0, (\mathcal {P}_3)_0\) hold taking \(C_*\) large enough.
Assume that\((\mathcal {P}_1)_n, (\mathcal {P}_2)_n, (\mathcal {P}_3)_n\)hold for some\(n\ge 0\), and prove\((\mathcal {P}_1)_{n+1}, (\mathcal {P}_2)_{n+1}, (\mathcal {P}_3)_{n+1}\). By (8.4) and (8.5)
for \(\varepsilon \) small enough. If we take \(C_1\) bigger than \(C_0\) in Theorem 7.13 then (7.94) holds. In (6.12) we consider \(\mathfrak {p}_0=\mathfrak {p}_1+\tilde{\mathfrak {p}}\), where \(\mathfrak {p}_1:=\mu _1+2\tau +1\) and \(\tilde{\mathfrak {p}}\) appears in (6.5). Since \(\mu \gg \mathfrak {p}_0\) (8.7) implies (6.3) and so (6.12), and Proposition 7.14 applies. Hence the operator \(\mathcal {L}_{\omega }:=\mathcal {L}_{\omega }(\omega , i_n (\omega ))\) in (6.15) is defined on \(\mathcal {O}_0= \mathcal {G}_n\) and is invertible for all \(\omega \in \mathcal {G}_{n+1}\) since \(\mathcal {G}_{n+1}\subseteq \Omega ^{2\gamma ^{*}_n}_{\infty }(i_n) \cap \mathcal {F}_{\infty }^{2{\gamma }_n}(i_n)\) and the (7.105) holds. This means that the assumption (6.9) of Theorem 6.1 is verified with \(\Omega _{\infty }=\mathcal {G}_{n+1}\). By Theorem 6.1 there exists an approximate inverse \(\mathbf{T} _n(\omega ):=\mathbf{T} _0(\omega , i_n(\omega ))\) of the linearized operator \(L_n(\omega ):=\mathtt {D}\mathcal {F}(\omega , i_n(\omega ),\zeta _n) \equiv \mathtt {D}\mathcal {F}(\omega , i_n(\omega ),0)\), satisfying (6.10). By (8.5), (8.7)
and, by (6.11), using also (8.5), (8.7), (8.3),
Now, for all \(\omega \in \mathcal {G}_{n+1}\), we can define, for \(n\ge 0\),
where \(\tilde{\Pi }_n(\mathfrak {I}, \zeta ):=(\Pi _n \mathfrak {I}, \zeta )\) with \(\Pi _n\) defined in (8.2). By construction we have
Then, by the definition of \(H_{n+1}\) in (8.15), using \([L_n, \Pi _n]\) and writing \(\tilde{\Pi }_n^{\perp }(\mathfrak {I}, \zeta ):=(\Pi _n^{\perp } \mathfrak {I}, 0)\) we have
where
Lemma 8.2
Define
Then there exists \(K:=K(s_0, \beta _1)>0\) such that, for all \(n \ge 0\), setting \(\alpha _0:=3\mu +3\)
The proof of Lemma 8.2 follows almost word by word the proof of Lemma 9.2 in [4].
Proof of\((\mathcal {P}_3)_{n+1}\). By (8.18) and \((\mathcal {P}_3)_n\)
provided \(2 K N_n^{\alpha _0+\rho ^{-1}-k}N_{n-1}^k\le 1, \forall n\ge 0\). Choosing k as in (8.4) and \(N_0\) large enough, i.e. for \(\varepsilon \) small enough. By (8.17) and the bound (8.19) \((\mathcal {P}_3)_{n+1}\) holds.
Proof of\((\mathcal {P}_2)_{n+1}\). Using (8.17), (8.18) and \((\mathcal {P}_2)_n, (\mathcal {P}_3)_n\), we get
and \(w_{n+1}\le C_* \varepsilon ^{b_*+1} \gamma ^{-2} N_n^{-\alpha }\) provided that
The inequalities in (8.20) hold by (8.5), taking \(\alpha \) as in (8.4), \(C_1>\alpha _0+\alpha \) and \(\delta _0\) in (8.5) small enough. By (8.17), the inequality \(w_{n+1}\le C_* \varepsilon ^{b_*+1} \gamma ^{-2} N_n^{-\alpha }\) implies \((\mathcal {P}_2)_{n+1}\).
Proof of\((\mathcal {P}_1)_{n+1}\). The bound (8.8) for \(\widehat{\mathfrak {I}}_1\) follows by (8.15), (8.10) (for \(s=s_0+\mu \)) and \( \Vert \mathcal {F}(U_0) \Vert _{s_0+2 \mu } \lesssim _{s_0+2\mu } \varepsilon ^{b_*}{.} \) The bound (8.8) for \(\widehat{\mathfrak {I}}_{n+1}\) follows by (8.2), \((\mathcal {P}_2)_n\) and (8.4). It remains to prove that (8.7) holds at the step \(n+1\). We have
taking \(\alpha _1\) as in (8.4) and \(N_0\) large enough, i.e. \(\varepsilon \) small enough. Moreover, using (8.2), \((\mathcal {P}_2)_{n+1}, (\mathcal {P}_3)_{n+1}\), (8.4) we get
which is the second inequality in (8.7) at the step \(n+1\). The bound \(|\zeta _{n+1} |^{\gamma }\lesssim \Vert \mathcal {F}(U_{n+1}) \Vert _{s_0}^{\gamma }\) is a consequence of Lemma 6.1 in [39].
To conclude the proof of Theorem (8.1) it remains to show the bounds (8.9). This is done in the next section. \(\quad \square \)
8.1 Measure estimates
Let us define for \(0<\eta \le \overline{\jmath }_1^4\,\sqrt{\varepsilon }\), \(\sigma \ge 1\) and \(n\in \mathbb {N}\)
Recalling (8.6) we can write, setting \(\eta \rightsquigarrow \gamma _n\) for the sets \(Q_{\ell j }(\eta ,\sigma )\) and \(P_{\ell j }(\eta ,\sigma )\), \(\eta \rightsquigarrow \gamma ^{*}_n\) for the set \(R_{\ell jk}(\eta ,\sigma )\), and \(\sigma \rightsquigarrow \tau \),
Since, by (5.7) and \(\gamma >\gamma ^{3/2}\) (see (8.5)), \(R_{\ell j k}(i_n)=\emptyset \) for \(j=k\), in the sequel we assume that \(j\ne k\). We start with a preliminary lemma, which gives a first relation between \(\ell ,j,k\) which must be satisfied in order to have non empty resonant sets.
Lemma 8.3
Let \(n \ge 0\). There is a constant \(C>0\) dependent of the tangential set and independent of \(\ell , j, k, n, i_n, \omega \) such that the following holds:
-
If \(R_{\ell j k}(i_n,\eta ,\sigma )\ne \emptyset \) then \(|\ell |\ge C |{\lambda }(j)-{\lambda }(k) |\ge \frac{C}{2}\,|j-k |\);
-
If \(Q_{\ell j}(i_n,\eta ,\sigma )\ne \emptyset \) then \(|\ell |\ge C |j |\);
-
If \(P_{\ell j}(i_n,\eta ,\sigma )\ne \emptyset \) then \(|\ell |\ge C |j |\).
Proof
If \(R_{\ell j k}(i_n)\ne \emptyset \), then there exists \(\omega \) such that
Moreover, using (7.95), (7.96), (7.2), (7.5), we get \(|d_j^{\infty }(\omega , i_n(\omega ))-d_k^{\infty }(\omega , i_n(\omega )) |\ge \frac{1}{3} |{\lambda }(j)-{\lambda }(k) |. \) Thus, for \(\varepsilon \) small enough
and this proves the first claim on \(R_{\ell j k}\). If \(Q_{\ell j}\ne \emptyset \) then we have \(|m j|< {2 \eta }{\langle \ell \rangle ^{-\sigma }} + |{\omega }\cdot \ell |\). Hence, for \(\varepsilon \) small enough, we have
Following the same arguments and by using that \(|d_j |\ge C |j |\) for some constant \(C>0\) we get the last statement. \(\quad \square \)
8.1.1 Measure of a resonant set
The aim of this subsection is to prove the following lemma.
Lemma 8.4
There is \(\mathtt {r}_0>0\) such that, for any \(0<\mathtt {r}\le \mathtt {r}_0\), and any choice of \(S^{+}\in \mathcal {V}(\mathtt {r})\) we have that
for some \(K=K(S)\). The same holds for \(Q_{\ell j}(\eta ,\sigma )\) and \(P_{\ell j}(\eta ,s)\).
The proof of the lemma above involves many arguments and we split it into several steps.
In several bounds we will evidence the dependence of the constants on the tangential set S in order to highlight that the smallness of the amplitudes \(\xi \) depends on the choice of the tangential sites.
Let us first consider the set \(R_{\ell j k}\), which is the most difficult case. We study the sub-levels of the function \(\omega \mapsto \phi _R(\omega )\) defined by (recall (4.5),(7.95))
We recall that (see (7.2), (7.48))
where \(\kappa _{j}\) is defined in (7.5) (see also (7.40)) and
We first study some properties of the function \(\phi _{R}(\omega )\) in (8.27).
The small divisor\(\phi _{R}(\omega )\)as an affine function of\(\omega \) It will be useful to consider \(\phi _R(\omega )\) in (8.27) as a small perturbation of an affine function in \(\omega \)
where
and the remainder \(q_{j k}(\omega )\) satisfies
Lemma 8.5
Denoting \(\vec {p}_j = {\lambda }(j)\vec {v}+ \vec {w}_j\), we have the following bounds:
Proof
The first bound follows by the fact that \(3+\overline{\jmath }_i^2+j^2 \pm j_2 \overline{\jmath }_i\ge 3+\frac{j^2+\overline{\jmath }_i^2}{2} \ge \frac{j^2}{2}\) and \((1+j^2)(1+\overline{\jmath }_i^2)(2+j^2 +\overline{\jmath }_i^2)\lesssim \overline{\jmath }_1^4 \,j^4\). The others follow similarly. \(\quad \square \)
Fix \(\alpha \in (0,1/2)\) and let
We have the following estimates for sets \(R_{\ell j k}\) with \(|\ell |\) “large”.
Lemma 8.6
Let \(|\ell |> \varepsilon ^{-\beta }\). Then \(R_{\ell jk}(\eta ,\sigma )\) satisfies (8.26).
Proof
In this proof we shall denote by C(S) a running positive constant depending on the set S. Suppose that \(|j-k |\le c_0 |\ell |\) with \(c_0=c_0(S)\) small. By Lemmata 8.5 and 4.1 we have \(|\mathbb {A}^{-T}(\vec {p}_j -\vec {p}_k)|\lesssim \overline{\jmath }_1^4 |j-k |< {|\ell |}/{2} \) for \(c_0\) sufficiently small. This means that \(|b_{\ell jk}| \gtrsim |\ell |/2\). Now suppose that \(|j-k |> c_0 |\ell |\). Then
for \(\varepsilon \) small enough. By (8.22), (8.33)
for \(\varepsilon \) small enough and \(\sigma \ge 1\). Again we have shown that \(|b_{\ell jk}|>\delta |\ell |\) with \(\delta := c_0/ 2|\overline{\omega } |\). Split \(\omega =s \widehat{b}+b^\perp \) where \(\widehat{b}:=b/|b |\) and \(b^\perp \cdot b=0\). Let \(\Psi _R(s):=\phi _R(s \widehat{b}+b^\perp )\). For \(\varepsilon \) small enough, by (8.33), we get
The lemma follows by Fubini’s theorem. \(\quad \square \)
We now prove that if the main term (in size) of \(\phi _{R}(\overline{\omega })\) is big enough and \(|\ell |\) is bounded by some constant then the bad set \(R_{\ell j k}(\eta , \sigma )=\emptyset \). We remark that
Lemma 8.7
If \(|\ell |\le \varepsilon ^{-\beta }\) and \(|\overline{\omega }\cdot \ell +{\lambda }(j)-{\lambda }(k) |\ge \gamma _0 \langle \ell \rangle ^{-\sigma }\) , where \(\gamma _0=\varepsilon ^{\alpha }\), then \(R_{\ell j k}(\eta , \sigma )=\emptyset \).
Proof
By definition
By Lemma 8.3 (recall (8.27)) we have \(|j-k |\lesssim C |\ell |\) and so
for some \(C(S)>0\), for \(\varepsilon \) small enough, by (8.34). \(\quad \square \)
Lemma 8.8
Let \(|\ell |\le \varepsilon ^{-\beta }\) and \(|\overline{\omega }\cdot \ell +{\lambda }(j)-{\lambda }(k) |\le {\gamma _0}{\langle \ell \rangle ^{-\sigma }}\). Then \(R_{\ell jk}(\eta ,\sigma )\) satisfies (8.26).
Proof
Let us call \(p:=\overline{\omega }\cdot \ell +{\lambda }(j)-{\lambda }(k)\) and note that \(|p |\lesssim {\gamma }_0=\varepsilon ^{\alpha }\). We also remark that \(\ell \ne 0\) since for \(j\ne k\) one has \( |{\lambda }(j)-{\lambda }(k)|>1/2\). We substitute p in the definition of \(b_{\ell jk}\) (see (8.32))
Then, using (A.5) in Lemma (A.1) we have \(|b_{\ell jk}|\ge |\ell |\delta /2\) for \(\varepsilon \) small enough. The thesis follows reasoning as in Lemma (8.6). \(\quad \square \)
Proof of Lemma 8.4
For the sets \(R_{\ell j k}\) the lemma follows by Lemmata 8.6, 8.7, 8.8.
The proof for the sets \(Q_{l j}\) and \(P_{\ell j}\) follows using the same arguments used for \(R_{\ell j k}\). Lemmata 8.6, 8.7 are identical, with the only difference that the non-resonance condition now reads respectively \( |\overline{\omega }\cdot \ell + j|\ge \gamma _0\langle \ell \rangle ^{-\sigma }{,} |\overline{\omega }\cdot \ell +{\lambda }(j) |\ge \gamma _0\langle \ell \rangle ^{-\sigma } \) in the case of \(Q_{\ell j}\) and \(P_{\ell j}\). Regarding Lemma 8.8, it follows from (A.4) in the case of \(Q_{\ell j}\) and from (A.6) in the case of \(P_{\ell j}\). \(\quad \square \)
8.1.2 Summability
Lemma 8.9
For \(n\ge 1, |\ell |\le N_{n-1}\), one has \(R_{\ell j k}(i_n),Q_{\ell j}(i_n),P_{\ell j}(i_n)=\emptyset \).
Proof
We first note that, by Lemma 8.3, if \(|{\lambda }(j)-{\lambda }(k)|> C_1^{-1}|\ell |\) (for some \(C_1=C_1(S)\)) then \(R_{\ell j k}(i_n)= R_{\ell j k}(i_{n-1})=\emptyset \), so that our claim is trivial. Otherwise, if
By (7.101) (with \(i_{\delta }^{(1)}\rightsquigarrow i_n\) and \(i_{\delta }^{(2)}\rightsquigarrow i_{n-1}\), \(N\rightsquigarrow N_{n-1}\)) and (8.8) we have for all \(j, k\in S^c\)
where \(\mathtt {a}:=\min \{\kappa ,\alpha \}\) (recall \(\alpha \) in (8.4) and \(\kappa \) in (7.13)). Now for all \(j\ne k\), \(|\ell |\le N_{n-1}\), \(\omega \in \mathcal {G}_n\) by (8.36)
since \(\varepsilon ^{4-3 a}\gamma ^{-3/2}N_n^{\tau -(2/3)\mathtt {a}}2^{n+1}\le 1\). Since by definition \(R_{\ell j k}(i_n)\subseteq \mathcal G_n\) then \(R_{\ell j k}(i_n)=\emptyset \) .
Now we prove that \(Q_{\ell j}(i_{n-1}) \subseteq Q_{\ell j}(i_n)\). We have
and then
since \(|\ell |\le N_{n-1}\).
As before, by (7.101), for all \(j, k\in S^c\)
For all \(j\ne k\), \(|\ell |\le N_{n-1}\), \(\omega \in \mathcal {G}_n\) by (8.36)
since \(\varepsilon ^{4-3 a}\gamma ^{-1}N_n^{\tau -(2/3)\mathtt {a}}2^{n+1}\le 1\). \(\quad \square \)
We have proved that
Lemma 8.10
There exists \(\mathtt {C}>0\) such that if \(|j |, |k |\ge \mathtt {C} \langle \ell \rangle ^{\nu +2}\gamma ^{-(1/2)}\) then (recall that \(\tau =2\nu +6>\nu +2\))
Proof
We have that
for \(\mathtt {C}\) big enough and since \(\varepsilon ^2 (\sqrt{\gamma })^{-1}\ll 1\). \(\quad \square \)
We are in position to prove (8.9). We have, by (8.42),
On one hand we have that, using Lemmata 8.4 and 8.10,
On the other hand
The discussion above implies estimates (8.9).
8.2 Conclusion of the Proof of Theorem 5.4
Theorem 8.1 implies that the sequence \((\mathfrak {I}_n, \zeta _n)\) is well defined for \(\omega \in \mathcal {G}_{\infty }:=\cap _{n\ge 0} \mathcal {G}_n\), \(\mathfrak {I}_n\) is a Cauchy sequence in \(\Vert \cdot \Vert _{s_0+\mu }^{{\gamma },\mathcal {G}_{\infty }}\) (see (8.8)) and \(|\zeta _n |^{\gamma }\rightarrow 0\). Therefore \(\mathfrak {I}_n\) converges to a limit \(\mathfrak {I}_{\infty }\) in norm \(\Vert \cdot \Vert _{s_0+\mu }^{\gamma ,\mathcal {G}_{\infty }}\) and, by \((\mathcal {P} 2)_n\), for all \(\omega \in \mathcal {G}_{\infty }, i_{\infty }(\varphi ):=(\varphi , 0, 0)+\mathfrak {I}_{\infty }(\varphi )\) is a solution of
by (8.7), (8.5). Therefore \(\varphi \mapsto i_{\infty }(\varphi )\) is an invariant torus for the Hamiltonian vector field \(X_{H_{\varepsilon }}\) (recall (5.1)). By (8.9),
The set \(\Omega _{\varepsilon }\) in (5.2) has measure \(|\Omega _{\varepsilon } |=O(\varepsilon ^{2 \nu })\). Hence \(|\Omega _{\varepsilon }{\setminus } \mathcal {G}_{\infty } |/|\Omega _{\varepsilon } |\rightarrow 0\) as \(\varepsilon \rightarrow 0\) because \(\gamma =o(\varepsilon ^2)\), and therefore the measure of \(\mathcal {C}_{\varepsilon }:=\mathcal {G}_{\infty }\) satisfies (5.12).
It remains to show the linear stability of the embedding \(i_{\infty }({\varphi })\). By the discussion of Sect. 6 (see also [4] for further details) and Sect. 7, since \(i_{\infty }({\varphi })\) is isotropic and solves the Eq. (8.45), it is possible to find a change of coordinates \(G_{\infty }\) (of the form (6.6)), so that in the linearized system of the Hamiltonian \(H_{\varepsilon }\circ G_{\infty }(\varphi , \eta , w)\) the equation for the actions is given by \(\dot{\eta }=0\). Moreover, by Sect. 7 the linear equation for the normal variables w is conjugated, by setting \(w=\Upsilon \circ \Phi _{\infty }(z)\) to the diagonal system \(\dot{z}_j-\mathrm{i}d_j^{\infty }(\omega )z_j=f_{j}(\omega t), j\in S^{c}\), where \(f(\omega t)\) is a forcing term.
Since \(d_j^{\infty }\in \mathbb {R}\) a standard argument shows that the Sobolev norms of w do not increase in time. For further details see [2, 4, 40].
Notes
Since \(\omega \) is diophantine we can replace the time variable with angles \(\varphi \in \mathbb {T}^{\nu }\). The time dependence is recovered by setting \(\varphi =\omega t\).
The function \(\varepsilon \overline{v}\) represents a torus supporting a quasi-periodic motion which is invariant for the system (5.1) with \(P=0\), namely it is the approximate solution from which we bifurcate.
Notice that \(\xi =\xi (\omega )\), recall (4.11).
Whenever it is not strictly necessary, we shall drop the dependence on \(i_{\delta }\).
References
Baldi, P.: Periodic solutions of fully nonlinear autonomous equations of Benjamin–Ono type. Ann. Inst. H. Poincaré (C) Anal. Non Linéaire 30, 33–77 (2013)
Baldi, P., Berti, M., Haus, E., Montalto, R.: Time quasi-periodic gravity water waves in finite depth. Invent. Math. 214(2), 739–911 (2018)
Baldi, P., Berti, M., Montalto, R.: KAM for quasi-linear and fully nonlinear forced perturbations of Airy equation. Math. Ann. 359, 471–536 (2014)
Baldi, P., Berti, M., Montalto, R.: KAM for autonomous quasi-linear perturbations of KdV. Ann. I. H. Poincaré (C) Anal. Non Linéaire 33, 1589–1638 (2016)
Bambusi, D., Maspero, A.: Birkhoff coordinates for the Toda lattice in the limit of infinitely many particles with an application to FPU. J. Funct. Anal. 270(5), 1818–1887 (2016)
Berti, M., Biasco, L., Procesi, M.: KAM theory for the Hamiltonian derivative wave equation. Ann. Sci. de l’ENS 46(2), 299–371 (2013)
Berti, M., Biasco, L., Procesi, M.: KAM theory for reversible derivative wave equations. Arch. Ration. Mech. Anal. 212(3), 905–955 (2014)
Berti, M., Bolle, P.: Quasi-periodic solutions with Sobolev regularity of NLS on \(\mathbb{T}^d\) with a multiplicative potential. Eur. J. Math. 15, 229–286 (2013)
Berti, M., Bolle, P.: A Nash–Moser approach to KAM theory. In: Guyenne, P., Nicholls, D., Sulem, C. (eds.) Hamiltonian PDEs and Applications. Fields Institute Communications, vol. 75, pp. 255–284. Springer, Berlin (2014)
Berti, M., Delort, J.M.: Almost global solutions of capillarity-gravity water waves equations on the circle. UMI Lecture Notes (2017)
Berti, M., Corsi, L., Procesi, M.: An abstract Nash–Moser theorem and quasi-periodic solutions for NLW and NLS on compact Lie groups and homogeneous manifolds. Commun. Math. Phys. 334(3), 1413–1454 (2015)
Berti, M., Feola, R., Pusateri, F.: Birkhoff normal form and long time existence for periodic gravity water waves (2018). Preprint arXiv:1810.11549
Berti, M., Feola, R., Pusateri, F.: Birkhoff normal form for gravity water waves. Water Waves (2020). https://doi.org/10.1007/s42286-020-00024-y
Berti, M., Kappeler, Th, Montalto, R.: KAM Tori for perturbations of the defocusing NLS equation. Astérisque 403, viii+148 (2018)
Berti, M., Montalto, R.: Quasi-Periodic Standing Wave Solutions of Gravity-Capillary Water Waves. Memoirs of the American Mathematical Society, American Mathematical Society, Providence, RI (2016)
Bourgain, J.: Construction of quasi-periodic solutions for Hamiltonian perturbations of linear equations and applications to nonlinear PDE. Int. Math. Res. Not. 11, 475–497 (1994)
Bourgain, J.: Quasi-periodic solutions of Hamiltonian perturbations of 2D linear Schrödinger equations. Ann. Math. (2) 148(2), 363–439 (1998)
Camassa, R., Holm, D.: An integrable shallow water equation with peaked solution. Phys. Rev. Lett. 71(11), 1661–1664 (1993)
Chierchia, L., You, J.: KAM Tori for 1D nonlinear wave equations with periodic boundary conditions. Commun. Math. Phys. 211, 497–525 (2000)
Constantin, A.: Global existence and breaking waves for a shallow water equation: a geometric approach. Ann. Inst. Fourier (Grenoble) 50, 321–362 (2000)
Constantin, A., Escher, J.: Wave breaking for nonlinear nonlocal shallow water equations. Acta Math. 181(2), 229–243 (1998)
Constantin, A., Ivanov, R.: Dressing method for the Degasperis–Procesi equation. Stud. Appl. Math. 138(2), 205–226 (2017)
Constantin, A., Lannes, D.: The hydrodynamical relevance of the Camassa–Holm and Degasperis–Procesi equations. Arch. Ration. Mech. Anal. 192, 165–186 (2009)
Corsi, L., Feola, R., Procesi, M.: Finite dimensional invariant KAM Tori for tame vector fields. Trans. Am. Math. Soc. 372, 1913–1983 (2019)
Corsi, L., Montalto, R.: Quasi-periodic solutions for the forced Kirchhoff equation on \(\mathbb{T}^{d}\). Nonlinearity 31(11), 5075–5109 (2018)
Craig, W., Wayne, C.E.: Newton’s method and periodic solutions of nonlinear wave equations. Commun. Pure Appl. Math. 46(11), 1409–1498 (1993)
Craig, W., Worfolk, P.: An integrable normal form for water waves in infinite depth. Physica D 84(3–4), 513–531 (1995)
Degasperis, A., Holm, D.D., Hone, A.N.W.: A new integrable equation with peakon solutions. Theor. Math. Phys. 133(2), 1463–1474 (2002)
Degasperis, A., Procesi, M.: Asymptotic integrability. In: Degasperis, A., Gaeta, G. (eds.) Symmetry and Perturbation Theory, pp. 23–37. World Scientific Publishing, Singapore (1999)
Eliasson, H., Kuksin, S.: KAM for the nonlinear Schrödinger equation. Ann. Math. (2) 172(1), 371–435 (2010)
Feola, R., Giuliani, F., Montalto, R., Procesi, M.: Reducibility of first order linear operators on tori via Moser’s theorem. J. Funct. Anal. 276(3), 932–970 (2019)
Feola, R., Giuliani, F., Pasquali, S.: On the integrability of Degasperis–Procesi equation: control of the Sobolev norms and Birkhoff resonances. J. Differ. Equ. 266(6), 3390–3437 (2018)
Feola, R., Giuliani, F., Procesi, M.: Reducibility for a class of weakly dispersive linear operators arising from the Degasperis Procesi equation. Dyn. Partial Differ. Equ. 16(1), 25–94 (2019)
Feola, R., Grébert, B.: Reducibility of Schrödinger equation on the sphere. Int. Math. Res. Not. (2019). https://doi.org/10.1093/imrn/rnz344
Feola, R., Iandoli, F.: Local well-posedness for quasi-linear NLS with large Cauchy data on the circle. Annales de l’Institut Henri Poincaré C. Analyse non linéaire 36(1), 119–164 (2019)
Feola, R., Iandoli, F.: Long time existence for fully nonlinear NLS with small Cauchy data on the circle. Annali della Scuola normale superiore di Pisa, Classe di scienze (2018). https://doi.org/10.2422/2036-2145.201811003
Feola, R., Procesi, M.: Quasi-periodic solutions for fully nonlinear forced reversible Schrödinger equations. J. Differ. Equ. 259(7), 3389–3447 (2014)
Feola, R., Procesi, M.: KAM for quasi-linear autonomous NLS. Preprint 2017, arXiv:1705.07287
Giuliani, F.: Quasi-periodic solutions for quasi-linear generalized KdV equations. J. Differ. Equ. 262(10), 5052–5132 (2017)
Giuliani, F.: KAM for quasi-linear PDEs. Ph.D. thesis. http://hdl.handle.net/20.500.11767/57306
Grébert, B., Kappeler, T.: The Defocusing NLS Equation and Its Normal Form, EMS Series of Lectures in Mathematics. European Mathematical Society Publishing House, Zürich (2014)
Hou, Y., Zhao, P., Fan, E., Qiao, Z.: Algebro-geometric solutions for the Degasperis–Procesi hierarchy. SIAM J. Math. Anal. 45(3), 1216–1266 (2013)
Kappeler, T., Pöschel, J.: KAM and KdV. Springer, Berlin (2003)
Kuksin, S.: Hamiltonian perturbations of infinite-dimensional linear systems with imaginary spectrum. Funktsional. Anal. i Prilozhen. 21(3), 192–205 (1987)
Kuksin, S.: A KAM theorem for equations of the Korteweg–de Vries type. Rev. Math. Phys. 10(3), 1–64 (1998)
Kuksin, S., Perelman, G.: Vey theorem in infinite dimensions and its application to KdV. Discrete Contin. Dyn. Syst. 27(1), 1–24 (2010)
Kuksin, S., Pöschel, J.: Invariant Cantor manifolds of quasi-periodic oscillations for a nonlinear Schrödinger equation. Ann. Math. 143(1), 149–179 (1996)
Liu, Y., Yin, Z.: Global existence and blow-up phenomena for the Degasperis–Procesi equation. Commun. Math. Phys. 267(3), 801–820 (2006)
Liu, J., Yuan, X.: A KAM theorem for Hamiltonian partial differential equations with unbounded perturbations. Commun. Math. Phys 307, 629–673 (2011)
Montalto, R.: Quasi-periodic solutions of forced Kirchhoff equation. Nonlinear Differ. Equ. Appl. 24(1), 9 (2017)
Pöschel, J.: A KAM-theorem for some nonlinear partial differential equations. Ann. Scuola Norm. Sup. Pisa Cl. Sci. (4) 23(1), 119–148 (1996)
Procesi, C., Procesi, M.: A KAM algorithm for the non-linear Schrödinger equation. Adv. Math. 272, 399–470 (2015)
Wayne, E.: Periodic and quasi-periodic solutions of nonlinear wave equations via KAM theory. Commun. Math. Phys. 127, 479–528 (1990)
Yin, Z.: On the Cauchy problem for an integrable equation with peakon solutions. Ill. J. Math. 47(3), 649–666 (2003)
Acknowledgements
The authors wish to thank Andy Hone, Luca Biasco, Livia Corsi and Marcel Guardia for useful discussions and comments. The authors also thank the anonymous referees for their helpful remarks. This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme under Grant Agreement No. 757802.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by W. Schlag.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This research was supported by Miur-PRIN 2015 “Variational methods, with applications to problems in mathematical physics and geometry” n. 2015KB9WPT-008, by the ERC Grant “Hamiltonian PDEs and small divisor problems: a dynamical systems approach” n. 306414 under FP7 and by the ERC Project “FAnFArE” n. 637510. This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme under Grant Agreement No. 757802.
Appendices
Appendix A. Non-degeneracy conditions
Proof of Lemma 4.1
Recalling (4.6) we introduce a matrix \(\mathbb K\) so that \(\mathbb {A}=: (2/9)\,\, \mathrm {diag}\Big ({\lambda }(\overline{\jmath }_i)(1+\overline{\jmath }_i^2)\Big ) \mathbb K \). Now we show that the entries of \(\mathbb K\) are bounded by some constant independent of the \(\overline{\jmath }_i\). After some direct computations we have that
Obviously \(|\mathbb {K}_{j j} |\le 2\); regarding the off-diagonal terms, we note that \(0\le 2 k j \le k^2+j^2\), hence \(|\mathbb {K}_{j k} |\le 12\) if \(j\ne k\).
We consider the variables \(x, p_2, \ldots , p_{\nu }\) defined as
so that \(P(x,p_i)=\det (\mathbb K)\) is a rational function. It is easily seen that \(\mathbb K\) computed at \(p_i=1\) for all i, coincides with the matrix
where U is the matrix with components \(U_{ij}=1\) for any \( i,j=1, \ldots , \nu \). Its determinant is
We note that the absolute value of (A.2) is \(\ge 1\) at \(x=0\). We conclude that there exists \(0<\mathtt {r}_0<1\) such that
This implies the thesis. \(\quad \square \)
Lemma A.1
There exists \(0<\mathtt {r}_0<1\) such that, for any \(S^+\in \mathcal {V}(\mathtt {r})\) with \(0<\mathtt {r}\le \mathtt {r}_0 \) (see Definition 1.1), the following holds true:
Here \(\vec {v}\) and \(\vec {w}_j\) are defined in (8.28) (see also (7.5)), \(\overline{\omega }\) in (1.10) and \(\delta \) is some appropriately small pure constant.
Proof of (A.3)
The case \(|\ell |=1\) is trivial. For \(|\ell |=2\) we use the fact that the \(\overline{\jmath }_i\) are all distinct. For \(|\ell |=3,5\) we pass to the variables (A.1) and we get
We notice that \(L(0,1)=|\sum _i \ell _i|\ge 1\) (since \(\sum _i \ell _i\) has the same parity of \(|\ell |\)) so, by continuity, there exists \(0<\mathtt {r}_0<1\) such that \(L(x,p)>1/2\) for all \( 0\le x<\mathtt {r}_0,\,\,\, |p_i-1|\le \mathtt {r}_0 \). This implies the result. \(\quad \square \)
Proof of (A.4)
We first note that (recall (8.28))
Consider the change of variables (A.1). One can note that the matrix \(\mathbb {A}\) in (4.6) at \(p_i=1\), \(i=2, \ldots , \nu \), is given by
By (A.8) at \(p_i=1\) we have
We note that, for \(x=0\), one has \(|\det (\mathrm {I} -\mathbb {A}^{-1}\vec {v}\overline{\omega }^{T}) |=3\nu -1\ge 2\). Thus there is \(0<\mathtt {r}_0<1\) so that for all \(0\le x<\mathtt {r}_0\), \(|p_i-1|\le \mathtt {r}_0\) one has \(|\det (\mathrm {I} -\mathbb {A}^{-1}\vec {v}\overline{\omega }^{T})|\ge 1\).
\(\square \)
Proof of (A.5),(A.6)
We systematically use the variables (A.1). We define \(\Omega \!=\!\,\text{ diag }\,({\overline{\omega }_i})\), \(V=\,\text{ diag }\,(\vec {v}_i)\) and write \(\mathbb {A}=\Omega \mathbb {H}\,V\) where
Then \(\vec {w}_j\) in (8.28) can be written as \(\vec {w}_j=\Omega \,V\,b_j\) with
where \(\mathtt {e}_i\) is the i-th vector of the canonical basis of \(\mathbb {R}^{\nu }\). In the new coordinates (x, p) in (A.1), and setting \(t=j (\overline{\jmath }_1)^{-1}\), this reads as
We claim that
where the term \( O(|p-1|, |x |)\) is uniform in t.
By direct computations we have
with C independent of t, and for x, p in a neighborhood of (0, 1). Moreover \(\sup _{t} |f(t, x) |\le 3|x|^{2}/2\). Thus, for x, p sufficiently close to (0, 1), we have that
which implies the claim (A.10). Hence, setting \(s=k (\overline{\jmath }_1)^{-1}\), we have
where
We note that \(|t\, (t^4+t^{2}+1)^{-1} |\le 0.41\), hence each single component satisfies
provided that \(|p-1|,|x| \le \mathtt {r}_0\) small enough. Hence, if \(\nu \ge 2\), one has \( |\ell _i- \big (\frac{3 h(t, s)}{1+\nu }\,\vec {1}+O(|p-1 |, |x |)\big )\cdot \mathtt {e}_i |\ge \delta |\ell _i|{.} \) This concludes the proof of (A.5). The proof of (A.6) is the same just setting \(h(t)= t\, (t^4+t^2+1)^{-1}\). \(\quad \square \)
Proof of Lemma 5.1
It is well known that, thanks to the choice of \(\tau \) in (5.3), \(|\Omega _{\varepsilon }{\setminus }\mathcal {G}_0^{(0)} |\le C \varepsilon ^{2(\nu -1)}\gamma \) for some \(C=C(S)>0\). Thus we focus on the estimate for the measure of \(\mathcal {G}_0^{(1)}\). For indexes \(\ell \in \mathbb {Z}{\setminus }\{0\}\), \(j,k\in S^{c}\) satisfying
we define the sets \( T_{\ell j k}:=\{ \omega \in \Omega _{\varepsilon } : |\overline{\omega }\cdot \ell +\varepsilon ^2 \mathbb {A}\xi \cdot \ell + {\lambda }(j)-{\lambda }(k)+\varepsilon ^2 ({\lambda }(j)\mathfrak {l}_{j}-{\lambda }(k)\mathfrak {l}_k) |\le C \gamma \}\). Recalling (5.5) we have that \(\Omega _{\varepsilon }{\setminus }\mathcal {G}^{(1)}_{0}=\bigcup T_{\ell j k}\) where the union is restricted to \(\ell ,j,k\) satisfying (A.11).
Let us first study the \(\varepsilon \)-independent part of our small divisor. By (1.8) and (A.11) we have
By Lemma A.1 (see (A.3)) if \(|j|\ge \widehat{C}(S)\), which is easily computed in terms of \(\mathtt {r}\), (A.12) is bounded from below by \(\mathtt {r}/4\). By (A.11), \(|{\lambda }(j)-{\lambda }(k) |\le C(S)\). Since \(0<|\ell |\le 3\) and substituting \(\mathfrak {l}_j=c(\omega )+\kappa _j/{\lambda }(j)\) (recall Lemma 7.10), where \(c(\omega ),\kappa _j\) are defined in (7.2), (7.5))
for \(\varepsilon \) small enough (depending on \(\mathtt {r}\)) and by using the fact that \(\kappa _j-\kappa _k\) is uniformly bounded in j, k. This implies that \(T_{\ell jk}=\emptyset \) for \(\varepsilon \) small enough.
We are left to deal with the case \(|j|\le \widehat{C}(S)\). We write (A.12) as \(P(\overline{\jmath }_i,j)/Q(\overline{\jmath }_i, j)\) where P, Q are polynomials with integer coefficients and Q has no real zeros. We remark that \(1<Q< C(S)\) due to the condition \(|j|\le \widehat{C}(S)\). If \(P\ne 0\) then \(|P|\ge 1\) and again (A.12) is larger than some K(S). We conclude that \(T_{\ell jk}=\emptyset \) by reasoning as in the case j large. Now we study the case in which \(P=0\). Fixed \(\ell \) in (A.12), then P has degree four in j and so the condition \(P=0\) fixes at most four choices of j that we call \(\widehat{\jmath }_1, \widehat{\jmath }_2, \widehat{\jmath }_3, \widehat{\jmath }_4\). For \(P=0\) (which is (A.12) \(=\,0\)) we have
where \(\vec {v}\) is in (8.28) and \(\kappa _j=w_j\cdot \xi \) with \(\kappa _j\) in (7.5). These are a finite number (depending only on \(\nu \)) of linear functions of \(\xi \). We compute the derivative in \(\xi \) which is
where \(\widehat{\jmath }\in \{ \widehat{\jmath }_1, \widehat{\jmath }_2, \widehat{\jmath }_3, \widehat{\jmath }_4 \}\). Now (A.5) implies that the quantity (A.14) is bounded from below by a constant depending on S. This lower bound and Fubini Theorem imply that \(|T_{\ell jk}|\le C(S)\varepsilon ^{2(\nu -1)}{\gamma }\) for some \(C(S)>0\) depending on S. By the discussion above we have
where \(K(S)>0\) and \(C(S)>0\) are constant depending on the set S. This implies the thesis. \(\quad \square \)
Appendix B. Normal form identification
Proof of Theorem 7.9
The core of Theorem 7.9 is to show that the terms in the l.h.s. of (7.78), which are obtained through a rather complicated sets of bounded changes of coordinates, coincide with the ones obtained by a purely formal full Birkhoff Normal Form procedure. In [32] it has been shown that, at purely formal level, the latter is well-defined and not resonant, i.e. the resonant Hamiltonian is supported only on trivial resonances as in (3.11). We procede as follows.
Step 1 The first step is to show that resonant terms at order \(\varepsilon ^{2}\) of the Full Birkhoff normal form coincide with the ones obtained by using the weak BNF procedure in Sect. 3, passing to action-angle variables and finally using a formal linear BNF.
Step 2 In order to conclude we note that the bounded maps we applied in Sects. 7.1 and 7.2 are, as functions of \(\varepsilon \), \(C^{3}\) with values in \(\mathcal {L}(H^{s}, H^{s-3})\) . Therefore the Taylor expansion of the Hamiltonian associated to the operators in (7.69) coincides with the Lie series of the generator up to order \(\varepsilon ^2\) (see also (7.71)). Then we show that the Lie series coincides (up to order \(\varepsilon ^2\)) to the one obtained in step 1. Even though we taylor our proof to the particular set of changes of variables that we use in Sect. 7, the argument is quite general and is essentially that the linear BNF up to order \(\varepsilon ^2\) is coordinate independent.
1.1 Formal equivalence between weak \(+\)linear and full BNF procedure
One step of formal BNF means that we apply the formal change of variables generated by
which removes completely \(H^{(3)}\) and conjugates H (using (2.5)) to \(H\circ {\Phi }_{(\mathfrak {F}^{(3)})^{-1}}=H^{(2)}+\frac{1}{2} \{\mathfrak {F}^{(3)}, H^{(3)}\} +\) h.o.t.
The scecond step of formal BNF removes all the non-resonant terms of degree four thus we get
In [32] it has been proved that \(\big (\Pi _{{\text{ Ker }}(H^{(2)})} -\Pi _{\mathrm{triv}}\big ) \{\mathfrak {F}^{(3)}, H^{(3)}\}= 0\) (see the notation of Definition 7.8) and hence
We claim that
This implies that
Proof of the claim (B.3)
Let us consider the term \(\{\mathfrak {F}^{(3,1)},H^{(3,3)}\}\), where
for some coefficients \({C}_{j_1j_2j_3}, \widetilde{C}_{j_1j_2j_3}\in \mathbb {C}\). Using (2.2) one gets
A monomial \(u_{j_1}u_{j_2}u_{k_1}u_{k_2}\) is supported on trivial resonances (3.11) only if \(j_1=-j_2\) and \(k_1=-k_2\), since \(j_i\in S\), \(k_i\in S^{c}\), \(i=1,2\). On the other hand, by the momentum condition, we have that \(j_1+j_2=-(k_1+k_2)\), which is not possible since \(0\notin S\cup S^{c}\) (see (1.9)). Hence the (B.3) holds. The others equalities in (B.3) in follows in the same way. \(\quad \square \)
Let us now perform the same Birkhoff procedure by first cancelling the terms of degree \(\le 1\) in z (weak BNF) and then the terms of degree two (linear BNF).
By the discussion in Sect. 3, recall the notations of Proposition 3.6, we have that, after two steps of weak Birkhoff normal form, the Hamiltonian of degree less or equal than 4 is
Here \(Z^{(4,0)}_2\) is defined in (3.23),
where \(H_1^{(3)}, H_1^{(4)}\) are defined in (3.19). The monomials of degree greater than 4 will be not involved in this computation, so we omit them. It is important to notice that, by direct inspection, \(\mathfrak {F}^{(3,\le 1)}= {F}^{(3,\le 1)}\) defined in formula (3.21). By Proposition 3.6, we know that the same change of variables puts one of the constants of motion (lets say \(K_3\) and drop the subindex 3) into normal form,
The step of formal linear BNF entails applying the formal change of variables generated by
Again, by direct inspection, one can note that \(F^{(3,2)}\equiv \mathfrak {F}^{(3,2)}:=\Pi ^{d_{x}=2}\mathfrak {F}^{(3)}\), where \(\mathfrak {F}^{(3)}\) is given in (B.1). Thus obtaining the Hamiltonian
Since (B.8) puts in normal form also \(K\circ \Phi ^{-1}_2\) following the same reasoning as in [32] and Proposition 3.6, we get that
We now want to pass to the action-angle variables introduced in (4.7). Since the rescaling with the parameter \(\varepsilon \) is covariant under the change of variables that we use, we consider instead of \(A_{\varepsilon }\) the symplectic change of variables \(A_1:=A_{\varepsilon _{|_{\varepsilon =1}}}\). By recalling that
and setting
we have that (B.10) reads
1.2 The rigorous procedure of subsections 7.1, 7.2 and the linear BNF
Since the r.h.s. of (B.12) is the r.h.s. of (7.78) it remains to show that the \(\varepsilon ^{2}\)-terms of the Hamiltonian associated to the operator \(\mathcal {L}_{7}\) in (7.69) coincides with the l.h.s. in (B.12). We remark that the operator \(\mathcal {L}_7\) has been constructed through a rigorous procedure providing also “tame” estimates of the remainder of higher order in \(\varepsilon \). As already explained the maps \(\mathcal {B}\), \(\Upsilon _i\), \(i=1,2\), are, as functions of \(\varepsilon \), \(C^{3}\) with values in \(\mathcal {L}(H^{s}, H^{s-3})\) . Therefore the Taylor expansion of the Hamiltonian associated to the operators in (7.69) coincides with the Lie series of the generator up to order \(\varepsilon ^2\) (see also (7.71)).
Let us then Taylor expand the Hamiltonian \(\mathcal {K}^{(2)}=\mathsf {H}\circ \mathcal {B}^{-1}\circ \Upsilon _1^{-1}\circ {\Upsilon _2}^{-1}\) (see (7.56), (7.57), (7.65) and (7.76)) up to order \(\varepsilon ^{2}\). It is sufficient to consider \(\mathcal {B}_1\) the flow generated by the Hamiltonian in (7.20) and \(\Upsilon _1\) the flow of the Hamiltonian \(H_{\mathtt {A_1}}\) (which has the form (C.47) and satisfies (7.66)). We have that
Using the Jacobi identity we have
Moreover, setting \(\mathsf {F}_1:=S_1+H_{\mathtt {A}_1}\), we note that
Since \(\Pi _{\mathrm{Ker}(\mathtt {H}_0)}\{S_2,\mathsf {H}_0\}_{e}\equiv 0\), by (B.13), (B.15), and (B.16) we get
Since \(\text{ Ker }(H^{(2)})\) is trivial on cubic monomials, we deduce that
and this concludes the proof. \(\quad \square \)
Appendix C. Technical Lemmata
1.1 C.1 Flows of hyperbolic PDEs
In this subsection we study some properties of the flow of (7.11). We start by studying the time-one flow map of the pseudo differential PDE
and how this differs from \(\Phi ^{\tau }\). Proposition 3.1 in [33] guarantees that the flow of (C.1) is the composition of the diffeomorphism of the torus
where \(\tilde{\beta }(\tau ;x,\xi )\) is such that
with a pseudo differential operator of order \(-1\) up to smoothing remainders belonging to the class \(\mathfrak {L}_{\rho ,p}\) (see Definition 2.8 in [33]) for any \(\rho \in \mathbb {N}\), \(\rho \ge 3\), \(p\ge s_0\). For completeness we restate the definition of \(\mathfrak {L}_{\rho ,p}\) in Appendix C.2 (see Definition C.5). The class \(\mathfrak {L}_{\rho ,p}\) has the property of being closed w.r.t. changes of variables as \(\Phi ^\tau \) (see Lemma B.10 in [33]). Moreover, in Lemma C.8, we show that \(\mathfrak {L}_{\rho ,p}\) is contained into \(\mathfrak {C}_{-1}\) which is another class of “tame” operators. Such class is introduced in Definition C.6 and is included in \(-1\)-tame operators, see Lemma C.10.
We refer to Proposition 3.1, Corollary 3.2 and Appendix A in [33] for properties and estimates of \(\Psi ^{\tau }\) and \(\mathcal {A}^{\tau }\) in (C.2).
Lemma C.1
Fix \(\rho \ge 3\) and \(p\ge s_0\). There exist \(\delta \ll 1\), \(\sigma _1:=\sigma _1(\rho ,p)\) such that if
then the following holds. Let \(\Psi ^{\tau }\) be the flow of the system (C.1), then the flow of (7.11) \(\Phi ^\tau \) is well defined for \(|\tau |\le 1\) and one has \(\Phi ^1= \Pi _S^\perp \Psi ^1\Pi _S^\perp \circ (\mathrm {I} +\mathcal {R})\) where \(\mathcal {R}\) is an operator with the form (6.16). Moreover \(\mathcal {R}\) belongs to \(\mathfrak {L}_{\rho ,p}(\mathcal {O})\) and satisfies
Proof
Proposition 3.1 in [33] provides \(\delta , \sigma _1\) such that if the smallness condition (C.3) is satisfied with such parameters, then the flow \(\Psi ^{\tau }\) is well-defined for \(|\tau |\le 1\). Let us define \(\Upsilon ^{\tau }\) as the flow of the following Cauchy problem
with
Equation (C.5) is well posed on \(H^{s}\) since its vector field is finite rank. By the following computations
we have that \(\Phi ^{\tau }=\Pi _S^{\perp }\circ \Psi ^{\tau }\circ \Upsilon ^{\tau }\) is well-defined on \(H^s\) and solves (7.11).
Now we show that \(\Upsilon ^{\tau }-\mathrm {I}\) is of the form (6.16). By Taylor expansion at \(\tau =0\) we get
Note that
has already the form (6.16) and \(\big (\Psi ^{\tau }_* \mathtt {Z} (\Upsilon ^{\tau } u)\big )_{|_{\tau =0}}=\mathtt {Z} u\). We denoted by \((\Psi ^{\tau })^*\), \((\Phi ^{\tau })^*\) the flows of the adjoint PDEs
We have
where
Thus by (C.6), for \(\tau =1\) we get
where \(\mathcal {R}_1\) has the finite dimensional form (6.16) and \(\mathcal {R}_2\) has the form (6.16). Hence by Lemma C.7 we have
By using the estimates in Proposition 3.1 and Corollary 3.2 in [33] we have
In the same way, the bounds for the variation on the i-variable (the second in (C.4)) follows by the estimates on the derivatives of the coefficients \(\mathtt {g}_j, \tilde{\mathtt {g}_j}, \chi _j, \tilde{\chi _j}\) whose depend on the variation \(\Delta _{1 2}\) of the flows \(\Phi ^{\tau }\), \(\Psi ^{\tau }\) and their adjoints. We have proved that \(\Upsilon ^1u=u+\mathcal {R} u\) and hence \(\Phi ^1 u=\Psi ^{1}\circ (\mathrm {I}+\mathcal {R})u\). By (C.9), (C.10) and (C.11) we obtain (C.4). \(\quad \square \)
The system (7.11) is a Hyperbolic PDE, thus we shall use a version of Egorov Theorem to study how pseudo differential operators change under the flow \(\Phi ^{\tau }\). This is the content of Theorem 3.4 in [33] which provides precise estimates for the transformed pseudo differential operators.
The following proposition is the counter-part of Proposition 3.5 in [33]. It describes the structure of an operator like \(\mathcal {L}_{\omega }\) conjugated by a flow of a system like (7.11).
Proposition C.2
(Conjugation). Let \(\mathcal {O}\) be a subset of \(\mathbb {R}^\nu \). Fix \(\rho \ge 3\), \(\alpha \in \mathbb {N}\), \(p\ge s_0\) and consider a linear operator
where \(m=m(\omega )\) is a real constant, \(a=a(\omega ,i(\omega ))\in C^\infty (\mathbb {T}^{\nu +1})\) is real valued, both are Lipschitz in \(\omega \in \mathcal {O}\) and a is Lipschitz in the variable i. Moreover \(\mathcal {Q}=\mathrm{Op}(\mathtt {q}(\varphi , x, \xi ))+\widehat{\mathcal {Q}}\) with \(\widehat{\mathcal {Q}}\in \mathfrak {L}_{\rho ,p}(\mathcal {O})\) and \(\mathtt {q}=\mathtt {q}(\omega ,i(\omega ))\in S^{-1}\) satisfying
Here \(\mathtt {k}_1, \mathtt {k}_2, \mathtt {k}_3,\sigma >0\) are constants depending on \(\mathtt {q}\) while \(f=f(\omega ,i(\omega ))\in C^\infty (\mathbb {T}^{\nu +1})\), is Lipschitz in \(\omega \) and in the variable i . There exist \(\tilde{\sigma }=\tilde{\sigma }(\rho )>0\) and \(\delta _*:=\delta _*(\rho )\) such that, if
the following holds for \(p+\sigma \le s_0+\tilde{\sigma }\). Consider \(\Phi :=\Phi ^{1}\) the flow at time one of the system (7.11), where b is defined in (7.10). Then we have
where
with \(\tilde{\beta }\) the function such that \(x+\tilde{\beta }(\varphi , x)\) is the inverse of the diffeomorphism of the torus \(x\mapsto x+\beta (\varphi , x)\). The operator \(\mathcal {Q}_+:=\mathrm{Op}(\mathtt {q}_+(\varphi , x, \xi )) +\widehat{\mathcal {Q}}_+\), with
and \(\widehat{\mathcal {Q}}_+\in \mathfrak {L}_{\rho ,p}(\mathcal {O})\) with, for \(s_0\le s \le \mathcal {S}\),
for any \(0\le \mathtt {b}\le \rho -2\) and
for any \(0\le \mathtt {b}\le \rho -3\).
Proof
The strategy is the following.
-
We conjugate
$$\begin{aligned} \mathcal {L}^0:= \omega \cdot \partial _{\varphi }- J \circ (m+a(\varphi , x)) +\mathcal {Q} \end{aligned}$$(C.21)by the flow \(\Psi ^{\tau }\) in (C.1). In order to do this we use Proposition 3.5 in [33].
-
The operator \(\mathcal {L}^0\) differs from \(\mathcal {L}\) by a infinitely regularizing operator of the form (6.16). By using this fact and Lemma C.1 we estimate the difference between \(\Phi ^{\tau } \mathcal {L} (\Phi ^{\tau })^{-1}\) and \(\Psi ^{\tau } \mathcal {L}^0 (\Psi ^{\tau })^{-1}\).
Let \(\Psi ^{\tau }\) be the flow in (C.1). The (C.13)-(C.14) imply that \(\mathcal {L}_0\) in (C.21) satisfies the hypotheses of Proposition 3.5 in [33]. Hence, if \(y:=x+\beta (\varphi , x)\), then we have
In particular \(\mathtt {q}_+\) and \(\widehat{\mathcal {Q}}_{\star } \) satisfy bounds like (C.18)-(C.20). By Lemma C.1 we have (recall (C.7), (C.8))
We define the remainder \(\widehat{\mathcal {Q}}_+:=\widehat{\mathcal {Q}}_{\star }+\widehat{\mathcal {Q}}_{\star \star }\). To conclude the proof we show that \(\widehat{\mathcal {Q}}_{\star \star }\) satisfies the bounds (C.19) and (C.20). We note that
Thus by Lemma C.7, Corollary 3.2 in [33] (for the estimates on \(\Psi \)) and (C.21) we get the bounds (C.19) and (C.20) for the operators (C.22). The bounds on \(\Psi \mathcal {L} \Gamma \Psi ^{-1}, \Psi \mathcal {L} \mathcal {R} \Psi ^{-1}, \Psi \mathcal {R} \mathcal {L} \Gamma \Psi ^{-1}\) follow by Proposition 3.1 in [33] and (C.4). \(\quad \square \)
Remark C.3
Assume that the symbols \(a, \mathtt {q}\) and the smoothing operator \(\widehat{\mathcal {Q}}\) in (C.12) admits an expansion in \(\varepsilon \) (actually in degree of homogeneity of \(\overline{v}\) in (6.25)) of the form
where \(a^{(i)}, \mathtt {q}^{(i)}\) have the form respectively (6.36), (6.34), and \(a^{(\ge 4)}\), \(\mathtt {q}^{(\ge 4)}\) satisfy estimates like (7.42). The remainders \(\widehat{\mathcal {Q}}^{(i)}\) are almost-diagonal (see Def. 6.4) and \(\widehat{\mathcal {Q}}^{(\ge 4)}\) satisfies estimates like (7.42). Assume also that \(\beta \) in (7.10) has an expansion as in (C.23). Then the symbols \(a^{+}, \mathtt {q}^{+}\) and the operator \(\mathcal {Q}^{+}\) in (C.16) admit the same expansion in \(\varepsilon \) as in (C.23). This fact can be deduced by following the proof of Proposition 3.5 in [33]. More precisely one reasons as follows. First of all, by linearity, the conjugate of a sum as in (C.23) is the sum of the conjugates. The conjugate of a \(\widehat{\mathcal {Q}}^{(i)}\) in (C.23) under the flow of (7.11) is a smoothing remainder by applying Lemma B.10 in [33]. Of course in order to obtain homogeneous terms of degree \(\le 3\) in \(\varepsilon \) we must Taylor expand the flow, following Remark 7.2 this implies that the remainders are in \(\mathfrak {L}_{\rho -3,p}\) (of course since \(\rho \) is arbitrary this is not a problem).
The conjugation of the pseudo differential operators \(J\circ a(\varphi ,x)\) and \(\mathrm{Op}(\mathtt {q}(\varphi , x, \xi ))\) is based on the Egorov Theorem 3.4 in [33]. This is a constructive perturbation scheme so we can Taylor expand up to order three. In conclusion (C.18) holds for each term in the homogeneity expansion, possibly with a larger \(\tilde{\sigma }\). On the other hand expanding the remainder \(\mathcal {Q}^{+}\) gives an estimate as (C.19) but with \(\rho \rightsquigarrow \rho -3\).
Remark C.4
We point out that the remainder in Proposition C.2 is of the order of \(\beta \), i.e. of the generator of the torus diffeomorphism. This will create problems in fulfilling the smallness conditions in the KAM reducibility scheme of Sect. 7.3, where a term is perturbative if it is small w.r.t. \({\gamma }^{3/2}\) ( and \({\gamma }\ll \varepsilon ^2\) ).
1.2 C.2 Classes of “smoothing” operators
In the first step of our reduction procedure (see Theorem 7.1) we need to work with operators which are pseudo differential up to a remainder in the class \(\mathfrak {L}_{\rho ,p}\) defined as follows. This class of smoothing (in space) operators has been introduced in [33].
Definition C.5
Fix \(s_0\ge (\nu +1)/2\) and \(p,\,\mathcal {S}\in \mathbb {N}\) with \(s_0\le p< \mathcal {S}\) with possibly \(\mathcal {S}=+\infty \). Fix \(\rho \in \mathbb {N}\), with \(\rho \ge 3\) and consider any subset \(\mathcal {O}\) of \(\mathbb {R}^\nu \). We denote by \(\mathfrak {L}_{\rho , p}=\mathfrak {L}_{\rho , p}(\mathcal {O})\) the set of the linear operators \(A=A(\omega ):H^s(\mathbb {T}^{\nu +1})\rightarrow H^{s}(\mathbb {T}^{\nu +1})\), \(\omega \in \mathcal {O}\) with the following properties:
-
The operator A is Lipschitz in \(\omega \),
-
The operators \(\partial _{\varphi }^{\vec {\mathtt {b}}} A\), \([\partial _{\varphi }^{\vec {\mathtt {b}}} A, \partial _x]\), for all \(\vec {\mathtt {b}}=(\mathtt {b}_1,\ldots ,\mathtt {b}_\nu )\in \mathbb {N}^{\nu }\) with \(0\le |\vec {\mathtt {b}}| \le \rho -2\) have the following properties, for any \(s_0\le s\le \mathcal {S}\), with possibly \(\mathcal {S}=+\infty \):
-
(i)
For any \(m_{1},m_{2}\in \mathbb {R}\), \(m_{1},m_{2}\ge 0\) and \(m_{1}+m_{2}=\rho -|\vec {\mathtt {b}}|\) one has that the operator \(\langle D_{x}\rangle ^{m_{1}} \partial _{{\varphi }}^{\vec {\mathtt {b}}}A \langle D_{x}\rangle ^{m_{2}}\) is Lip-0-tame according to Definition 2.3 and we set
$$\begin{aligned} \mathfrak {M}^{\gamma }_{\partial _{{\varphi }}^{\vec {\mathtt {b}}}A}(-\rho +|\vec {\mathtt {b}}|,s):= \sup _{\begin{array}{c} m_{1}+m_{2}=\rho -|\vec {\mathtt {b}}|\\ m_{1},m_{2}\ge 0 \end{array}}\mathfrak {M}^{\gamma }_{\langle D_{x}\rangle ^{m_{1}} \partial _{{\varphi }}^{\vec {\mathtt {b}}}A \langle D_{x}\rangle ^{m_{2}}}(0,s)\,; \end{aligned}$$(C.24) -
(ii)
For any \(m_{1},m_{2}\in \mathbb {R}\), \(m_{1},m_{2}\ge 0\) and \(m_{1}+m_{2}=\rho - |\vec {\mathtt {b}}|-1\) one has that \(\langle D_{x}\rangle ^{m_{1}} [\partial _{{\varphi }}^{\vec {\mathtt {b}}}A,\partial _{x}] \langle D_{x}\rangle ^{m_{2}}\) is Lip-0-tame according to Definition 2.3 and we set
$$\begin{aligned} \mathfrak {M}^{\gamma }_{[\partial _{{\varphi }}^{\vec {\mathtt {b}}}A,\partial _{x}]}(-\rho +|\vec {\mathtt {b}}|+1,s):= \sup _{\begin{array}{c} m_{1}+m_{2}=\rho -|\vec {\mathtt {b}}|-1\\ m_{1},m_{2}\ge 0 \end{array}}\mathfrak {M}^{\gamma }_{\langle D_{x}\rangle ^{m_{1}} [\partial _{{\varphi }}^{\vec {\mathtt {b}}}A,\partial _{x}] \langle D_{x}\rangle ^{m_{2}}}(0,s){.} \end{aligned}$$(C.25)
We define for \(0\le \mathtt {b}\le \rho -2\)
$$\begin{aligned} \begin{aligned} \mathbb {M}^{\gamma }_{A}(s, \mathtt {b}):=&\max _{0\le |\vec {\mathtt {b}}|\le \mathtt {b}}\max \left( \mathfrak {M}_{\partial _{{\varphi }}^{\vec {\mathtt {b}}}A}^{\gamma }(-\rho +|\vec {\mathtt {b}}|,s), \mathfrak {M}_{\partial _{{\varphi }}^{\vec {\mathtt {b}}}[A,\partial _{x}]}^{\gamma }(-\rho +|\vec {\mathtt {b}}|+1,s)\right) . \end{aligned} \end{aligned}$$(C.26)Assume now that the set \(\mathcal {O}\) and the operator A depend on \(i=i(\omega )\), and are well defined for \(\omega \in \mathcal {O}_0\subseteq \Omega _{\varepsilon }\) for all i satisfying (6.12). We consider \(i_{1}=i_{1}(\omega )\), \(i_{2}=i_{2}(\omega )\) and for \(\omega \in \mathcal {O}(i_1)\cap \mathcal {O}(i_2)\) we define
$$\begin{aligned} \Delta _{12}A:=A(i_1)-A(i_2){.} \end{aligned}$$(C.27)We require the following:
-
(i)
-
The operators \(\partial _{{\varphi }}^{\vec {\mathtt {b}}}\Delta _{12}A \), \([\partial _{{\varphi }}^{\vec {\mathtt {b}}}\Delta _{12}A ,\partial _{x}]\), for \(0\le |\vec {\mathtt {b}}|\le \rho -3\), have the following properties, for any \(s_0\le s\le \mathcal {S}\), with possibly \(\mathcal {S}=+\infty \):
-
(iii)
For any \(m_{1},m_{2}\in \mathbb {R}\), \(m_{1},m_{2}\ge 0\) and \(m_{1}+m_{2}=\rho -|\vec {\mathtt {b}}|-1\) one has that \(\langle D_{x}\rangle ^{m_{1}} \partial _{{\varphi }}^{\vec {\mathtt {b}}} \Delta _{12}A \langle D_{x}\rangle ^{m_{2}}\) is bounded on \(H^{p}\) into itself. More precisely there is a positive constant \(\mathfrak {N}_{\partial _{{\varphi }}^{\vec {\mathtt {b}}}\Delta _{12}A }(-\rho +|\vec {\mathtt {b}}|+1,p)\) such that, for any \(h\in H^{p}\), we have
$$\begin{aligned} \sup _{\begin{array}{c} m_{1}+m_{2}=\rho -|\vec {\mathtt {b}}|-1\\ m_{1},m_{2}\ge 0 \end{array}} \Vert \langle D_{x}\rangle ^{m_{1}} \partial _{{\varphi }}^{\vec {\mathtt {b}}}\Delta _{12}A\langle D_{x}\rangle ^{m_{2}} h\Vert _{p}\le \mathfrak {N}_{\partial _{{\varphi }}^{\vec {\mathtt {b}}}\Delta _{12}A }(-\rho +|\vec {\mathtt {b}}|+1,p)\Vert h\Vert _{p}\,; \end{aligned}$$(C.28) -
(iv)
For any \(m_{1},m_{2}\in \mathbb {R}\), \(m_{1},m_{2}\ge 0\) and \(m_{1}+m_{2}=\rho -|\vec {\mathtt {b}}|-2\) one has that \(\langle D_{x}\rangle ^{m_{1}} [\partial _{{\varphi }}^{\vec {\mathtt {b}}} \Delta _{12}A ,\partial _{x}] \langle D_{x}\rangle ^{m_{2}}\) is bounded on \(H^{p}\) into itself. More precisely there is a positive constant \(\mathfrak {N}_{[\partial _{{\varphi }}^{\vec {\mathtt {b}}}\Delta _{12}A,\partial _{x}] }(-\rho +|\vec {\mathtt {b}}|+2,p)\) such that for any \(h\in H^{p}\) one has
$$\begin{aligned} \sup _{\begin{array}{c} m_{1}+m_{2}=\rho -|\vec {\mathtt {b}}|-2\\ m_{1},m_{2}\ge 0 \end{array}} \Vert \langle D_{x}\rangle ^{m_{1}}[ \partial _{{\varphi }}^{\vec {\mathtt {b}}}\Delta _{12}A,\partial _{x}]\langle D_{x}\rangle ^{m_{2}} h\Vert _{p}\le \mathfrak {N}_{[\partial _{{\varphi }}^{\vec {\mathtt {b}}}\Delta _{12}A, \partial _{x}] }(-\rho +|\vec {\mathtt {b}}|+2,p)\Vert h\Vert _{p}{.} \end{aligned}$$(C.29)
-
(iii)
We define for \(0\le \mathtt {b}\le \rho -3\)
By construction one has that \(\mathbb {M}^{\gamma }_{A}(s, \mathtt {b}_1)\le \mathbb {M}^{\gamma }_{A}(s, \mathtt {b}_2)\) if \(\mathtt {b}_1\le \mathtt {b}_2\le \rho -2\) and \(\mathbb {M}_{\Delta _{12}A }(p, \mathtt {b}_1)\le \mathbb {M}_{\Delta _{12}A }(p, \mathtt {b}_2)\) if \(\mathtt {b}_1\le \mathtt {b}_2\le \rho -3\).
We shall also deal with “tame” operators in the following class.
Definition C.6
Fix \(\mathcal {S}\in \mathbb {N}\) with possibly \(\mathcal {S}=+\infty \). Fix \(\mathtt {b}=s_0+6\tau +6\) and consider \(\mathcal {O}\subseteq \mathbb {R}^\nu \). We denote by \(\mathfrak {C}_{-1}:=\mathfrak {C}_{-1}(\mathcal {O})\) the set of the linear operators \(A=A(\omega ):H^s(\mathbb {T}^{\nu +1})\rightarrow H^{s}(\mathbb {T}^{\nu +1})\), \(\omega \in \mathcal {O}\) which satisfy the following for any \(s_0\le s\le \mathcal {S}\):
-
\(\langle D_x \rangle ^{1/2} A \langle D_x \rangle ^{1/2}\), \(\langle D_x \rangle ^{1/2}\partial _{\varphi _m}^{\mathtt {b}} A \langle D_x \rangle ^{1/2}\), \(\langle D_x \rangle ^{1/2}[\partial _{\varphi _m}^{\mathtt {b}} A, \partial _x]\langle D_x \rangle ^{1/2}\), for \(m=1, \ldots , \nu \), \(0\le \mathtt {b}_1\le \mathtt {b}\) are Lip-0-tame operators (see Definition 2.3) and we define
$$\begin{aligned} \mathfrak {M}^{{\gamma }}_{\partial _{\varphi _m}^{\mathtt {b}_1}A}(-1, s)&:= \mathfrak {M}^{{\gamma }}_{\langle D_x \rangle ^{1/2} \partial _{\varphi _m}^{\mathtt {b}_1}A \langle D_x \rangle ^{1/2}}(0, s),\nonumber \\ \mathfrak {M}^{{\gamma }}_{\partial _{\varphi _m}^{\mathtt {b}_1}[A, \partial _x]}(-1, s)&:=\mathfrak {M}^{{\gamma }}_{\langle D_x \rangle ^{1/2}\partial _{\varphi _m}^{\mathtt {b}_1} [A, \partial _x]\langle D_x \rangle ^{1/2}}(0, s) {,} \end{aligned}$$(C.31)$$\begin{aligned} \mathbb {B}^{{\gamma }}_A(s)&:=\max _{\begin{array}{c} 0\le \mathtt {b}_1\le \mathtt {b}\\ m=1,\ldots , \nu \end{array}} \max \Big (\mathfrak {M}^{{\gamma }}_{\partial _{\varphi _m}^{\mathtt {b}_1}A}(-1, s), \mathfrak {M}^{{\gamma }}_{\partial _{\varphi _m}^{\mathtt {b}_1}[A, \partial _x]}(-1, s) \Big ){.} \end{aligned}$$(C.32)Assume now that the set \(\mathcal {O}\) and the operator A depend on \(i=i(\omega )\), and are well defined for \(\omega \in \mathcal {O}\subseteq \mathbb {R}^\nu \) for all i satisfying (6.12). We consider \(i_{1}=i_{1}(\omega )\), \(i_{2}=i_{2}(\omega )\) and for \(\omega \in \mathcal {O}(i_1)\cap \mathcal {O}(i_2)\) let \(\Delta _{12}A\) as in (C.27). We require the following:
-
\(\langle D_x \rangle ^{1/2} \partial _{\varphi _m}^{\mathtt {b}_1} \Delta _{12} A \langle D_x \rangle ^{1/2}\), \(\langle D_x \rangle ^{1/2}[\partial _{\varphi _m}^{\mathtt {b}_1} \Delta _{12} A, \partial _x]\langle D_x \rangle ^{1/2}\) for \(m=1, \ldots , \nu \), \(0\le \mathtt {b}_1\le \mathtt {b}\) are bounded operators on \(H^{s_0}\) into itself. More precisely there are positive constants \(\mathfrak {N}_{\partial _{\varphi _m}^{\mathtt {b}_1}\Delta _{12} A}(-1, s_0)\), and \(\mathfrak {N}_{[\partial _{\varphi _m}^{\mathtt {b}_1}\Delta _{12} A,\partial _{x}]}(-1, s_0)\) such that, for any \(h\in H^{s_0}\), we have
$$\begin{aligned} \Vert \langle D_x \rangle ^{1/2} \partial _{\varphi _m}^{\mathtt {b}_1} \Delta _{12} A \langle D_x \rangle ^{1/2}h\Vert _{s_0}&\le \mathfrak {N}_{\partial _{\varphi _m}^{\mathtt {b}_1}\Delta _{12} A}(-1, s_0)\Vert h\Vert _{s_0}{,}\nonumber \\ \Vert \langle D_x \rangle ^{1/2} [\partial _{\varphi _m}^{\mathtt {b}_1} \Delta _{12} A,\partial _{x}] \langle D_x \rangle ^{1/2}h\Vert _{s_0}&\le \mathfrak {N}_{[\partial _{\varphi _m}^{\mathtt {b}_1}\Delta _{12} A,\partial _{x}]}(-1, s_0)\Vert h\Vert _{s_0}{.} \end{aligned}$$(C.33)We define
$$\begin{aligned} \mathbb {B}_{\Delta _{12} A}(s_0):= \max _{\begin{array}{c} 0\le \mathtt {b}_1\le \mathtt {b}\\ m=1,\ldots , \nu \end{array}} \max \Big (\mathfrak {N}_{\partial _{\varphi _m}^{\mathtt {b}_1}\Delta _{12}A}(-1, s_0), \mathfrak {N}_{\partial _{\varphi _m}^{\mathtt {b}_1}[\Delta _{12}A, \partial _x]}(-1, s_0) \Big ){.} \end{aligned}$$(C.34)
The next Lemma shows that the finite rank operators of the form (6.16) are in \(\mathfrak {L}_{\rho , p}\).
Lemma C.7
Fix \(\rho \ge 3\). Let \(\mathcal {R}\) be an operator of the form (6.16), where the functions \(g_j(\tau ), \chi _j(\tau )\) belong to \(H^s\) for \(\tau \in [0, 1]\) and depend in a Lipschitz way on the parameter \(\omega \in \mathcal {O}\subset \mathbb {R}^{\nu }\). Then there exists \(\sigma _1=\sigma _1(\rho )>0\) such that \(\mathcal {R}\) belongs to \(\mathfrak {L}_{\rho , p}\) and
Proof
The Lemma follows by reasoning as in the proof of Lemma B.2 in [33] and using the explicit formula (6.16). \(\quad \square \)
We conclude this section by showing the connection between the class \(\mathfrak {L}_{\rho ,p}\) and the class \(\mathfrak {C}_{-1}\) in Definition C.6.
Lemma C.8
Consider \(\mathtt {b}\in \mathbb {N}\) and \(\rho \in \mathbb {N}\) with \(\rho \ge \mathtt {b}+3\). The following holds.
-
(i)
If \(A\in \mathfrak {L}_{\rho ,p}\) (see Definition C.5) then \(A\in \mathfrak {C}_{-1}\) (see Definition C.6) with
$$\begin{aligned} \mathbb {B}^{\gamma }_{A}(s,\mathtt {b}) \le _{\rho ,s}\mathbb {M}^{\gamma }_{A}(s,\rho -2){,} \qquad \mathbb {B}_{\Delta _{12}A }(p,\mathtt {b}) \le _{\rho ,p}\mathbb {M}_{\Delta _{12}A }(p,\rho -3){.} \end{aligned}$$(C.37) -
(ii)
Consider a symbol \(a=a(\omega , i(\omega ))\) in \(S^{m}\) with \(m\le -1\) depending on \(\omega \in \mathcal {O}_0\subset \mathbb {R}^{\nu }\) in a Lipschitz way and on i in a Lipschitz way and let \(A:=\mathrm{Op}(a(x,\xi ))\). Then one has that \(A\in \mathfrak {C}_{-1}\) with
$$\begin{aligned} \mathbb {B}^{\gamma }_{A}(s,\mathtt {b}) \le _{s}|a|^{{\gamma },\mathcal {O}_0}_{m,s+\mathtt {b}+2, 0}{,} \quad \mathbb {B}_{\Delta _{12} A}(p,\mathtt {b}) \le _{s}|\Delta _{12} a|_{m,p+\mathtt {b}+3, 0}. \end{aligned}$$(C.38) -
(iii)
Let \(A,B\in \mathfrak {C}_{-1}\). Then \(A\circ B\in \mathfrak {C}_{-1}\) with
$$\begin{aligned} \mathbb {B}^{\gamma }_{A\circ B}(s,\mathtt {b})&\le _{s} \mathbb {B}^{\gamma }_{A}(s,\mathtt {b})\mathbb {B}^{\gamma }_{A}(s_0,\mathtt {b})+ \mathbb {B}^{\gamma }_{A}(s_0,\mathtt {b})\mathbb {B}^{\gamma }_{A}(s,\mathtt {b}) \end{aligned}$$(C.39)$$\begin{aligned} \mathbb {B}_{\Delta _{12} (A\circ B) }(p, \mathtt {b})&\le _{p, \rho } \mathbb {B}_{\Delta _{12} A }(p,\mathtt {b})\mathbb {B}_B(p,\mathtt {b}){,} +\mathbb {B}_{\Delta _{12} B }(p,\mathtt {b})\mathbb {B}_A(p,\mathtt {b}){.} \end{aligned}$$(C.40)
Proof
Let us check item (i). The fact that \(\langle D_{x}\rangle ^{1/2}A\langle D_x \rangle ^{1/2}\) is Lip-0-tame follows by (C.24) since \(\rho \ge 1\). Indeed \(\langle D_{x}\rangle ^{-\rho +1}\) is bounded in x and for any \(h\in H^{s}\)
By studying the tameness constant of the operators \(\partial _{\varphi }^{\vec {\mathtt {b}}} A, [A, \partial _x], [\partial _{\varphi }^{\vec {\mathtt {b}}} A, \partial _x] \Delta _{12}A ,\partial _{{\varphi }}^{\vec {\mathtt {b}}}\Delta _{12}A , [\Delta _{12}A ,\partial _{x}]\), \([\partial _{{\varphi }}^{\vec {\mathtt {b}}}\Delta _{12}A ,\partial _{x}]\) for \(\vec {\mathtt {b}}\in \mathbb {N}^{\nu }\), \(|\vec {\mathtt {b}}|=\mathtt {b}\), following the same reasoning as above, one gets the (C.37).
In order to prove item (ii) one can follow almost word by word the proof of Lemma A.4 in [33].
Let us check (C.39). Let \(\mathtt {b}\in \mathbb {N}\) and consider \(0\le \mathtt {b}_1\le \mathtt {b}\), \(m=1, \ldots , \nu \). One has
We show that each summand in (C.41) is a Lip-0-tame operator. We have for \(h\in H^{s}\)
In (C.42) we used the fact that \(\langle D_{x}\rangle ^{\frac{1}{2}} (\partial _{{\varphi }_m}^{\mathtt {c}_1}A)\langle D_{x}\rangle ^{\frac{1}{2}}\) and \(\langle D_{x}\rangle ^{\frac{1}{2}} (\partial _{{\varphi }_m}^{\mathtt {c}_2}B)\langle D_{x}\rangle ^{\frac{1}{2}}\) are 0-tame by hypothesis (see Definition (C.6)). This proves (C.39) for the operators \(A\circ B\) and \(\partial _{{\varphi }_m}^{\mathtt {b}_1}(A\circ B)\) for any \(0\le \mathtt {b}_1\le \mathtt {b}\), \(m=1, \ldots , \nu \). One concludes the proof of (C.39) and (C.40) followings the same ideas used above. For further details we refer to the proof of Lemma B.1 in [33]. \(\quad \square \)
Lemma C.9
Let \(X,Y\in \mathfrak {C}_{-1}\) then \(\mathrm {ad}^k_{X}[Y]\in \mathfrak {C}_{-1}\) for any \(k\ge 1\) (recall (7.68)). Moreover for any \(k\ge 1\) we have
Moreover if X, Y depend on some parameter i we have
Proof
It follows by using the formula
and applying iteratively the estimates of Lemma C.8. \(\quad \square \)
Lemma C.10
Let \(A\in \mathfrak {C}_{-1}\) then A is a Lip-\(-1\)-modulo tame operator according to Definition 2.7. Moreover
Proof
It follows arguing as in the proof of Lemma A.4 in Appendix A of [33] using the definition of \(\mathfrak {C}_{-1}\) in Definition C.6. \(\quad \square \)
1.3 C.3 Linear Birkhoff normal form
The next lemma regards almost diagonal vector fields which belong to \(\mathfrak {C}_{-1}\).
Lemma C.11
Let \(X_A:=J A\in \mathfrak {C}_{-1}\) with A almost diagonal in the sense of Definition 6.4 . Then \(|A_{j, l}^{j', l'} |\le C\langle j,j'\rangle ^{-2}\) for some constant \(C>0\).
Proof
The operator \(B:=\langle D_x \rangle ^{1/2} J A \langle D_x \rangle ^{1/2}\) belongs to \(\mathcal {L}(H^s)\) for any s. Then \(|B_{j, l}^{j', l'} |\le C\) for all \(j, j'\in \mathbb {Z}\), \(l, l'\in \mathbb {Z}^{\nu }\). The thesis follows by the fact that
and \(|j |\gtrsim \langle j \rangle ^{1/2}\langle j' \rangle ^{1/2}\), since A is almost diagonal. \(\quad \square \)
We now study the flows generated by operators belonging to the class \(\mathfrak {C}_{-1}\) given in Definition C.6.
In Sect. 7.2 we look for symplectic changes of variable \(\Upsilon _i:H^s_{S^{\perp }}(\mathbb {T}^{\nu +1})\rightarrow H^s_{S^{\perp }}(\mathbb {T}^{\nu +1})\), \(i=1,2, 3\), that are the time-1 flow of quadratic Hamiltonians
where \(\mathtt {A}_i(\varphi )\) is a self-adjoint operator \(\forall \varphi \in \mathbb {T}^{\nu }\) and thus
Given linear operators \(\mathtt {B}_i\), \(i=1,2,3\) define the matrices \(\mathtt {A}_i\), \(i=1,2,3\) as
where, recall (1.8), (4.6),(5.6),
Lemma C.12
Let \(j, j'\in S^c\), \(j\ne j'\). If \(\sum _{i=1}^{\nu } \overline{\jmath }_i\,\ell _i+j-j'=0\), \(0<|\ell |\le 2\), \(\delta _{\ell j j'}\ne 0\), where \(\delta _{\ell j j'}\) are given in (C.51), then there exists a constant C depending on the set S such that \( |\delta _{\ell j j'} |\ge C\).
Proof
If \(|\ell |=1\) we have by the preservation of momentum
It is easy to verify that (recall that \(|j-j'|\le 2 \overline{\jmath }_1\))
Which implies the thesis for \(|\ell |=1\). Now suppose \(|\ell |=2\) and consider \(j_1, j_2\in S\). We can write \(\delta _{\ell j j'}={\lambda }(j_1)+{\lambda }(j_2)+{\lambda }(j)-{\lambda }(j')\) and by the conservation of momentum
and P is the rational function
If \(|j |> N\), where \(N=N(S)\) is a large constant to be fixed and which depends on the set S, then
for some constant \(C_1:=C_1(N)>0\) (possibly small), provided that N is large enough. Moreover
for some small constant \(C_2:=C_2(N)>0\), provided that N is large enough, and some big constant \(C_3:=C_3(S)>0\). Thus \( |{\lambda }(j_1)+{\lambda }(j_2)+{\lambda }(j)-{\lambda }(j') |\ge C_1 C_2 C_{3}^{-1}>0\).
Now consider \(|j |\le N\). Then we have
for some constant \(M_1, M_2>0\) depending on S. Set \(C_4:=M_1/ M_2\).
Therefore \(|{\lambda }(j_1)+{\lambda }(j_2)+{\lambda }(j)-{\lambda }(j')|\ge C_4>0\). At the end we choose \(C\ge \max \{ C_4, C_1 C_2/ C_3 \}\). \(\quad \square \)
Lemma C.13
Assume that \(\mathtt {B}_{i}\), \(i=1,2,3\) are such that \(J\mathtt {B}_{i}\in \mathfrak {C}_{-1} \) and that they are almost diagonal (see Definition 6.4). Then, for any \(\omega \in \mathcal {G}^{(1)}_0\) (see (5.5)), the following holds true:
(i) The linear vector fields \(X_{\mathtt {A}_i}:=J \mathtt {A}_i\), with \(\mathtt {A}_i\) defined as in (C.49),(C.50), belongs to the class \(\mathfrak {C}_{-1}\), in particular it satisfies the following:
Note that \(X_{\mathtt {A}_i}\) does not depend on \(i(\omega )\).
(ii) The transformation \(\Upsilon _i:H^s(\mathbb {T}^{\nu +1})\rightarrow H^s(\mathbb {T}^{\nu +1})\), \(i=1,2,3\) defined in (C.48) is invertible and satisfies, for any \(u=u(\omega )\in H^s\) Lipschitz in \(\omega \in \mathcal {O}_{\infty }^{2{\gamma }}\),
Proof
First of all notice that, by Lemmata C.11, C.12 and the fact that \(\omega \in \mathcal {G}_0^{(1)}\) (see (5.5)) we will have
for some constant \(C>0\) depending on the set S.
Proof of item (i). First we note that that \(B:=\langle D_x \rangle ^{1/2} J \mathtt {A}_i \langle D_x \rangle ^{1/2}\) maps \(H^s\) to itself for all \(s\ge 0\). Indeed it is sufficient to exploit the fact that the matrix entries \(B_j^{j'}(l-l')\) are uniformly bounded by a constant and B is almost diagonal. The matrix elements of \(\partial _{\varphi _m}^{\mathtt {b}} X_{\mathtt {A}_1}\), \([X_{\mathtt {A}_1}, \partial _x]\), \([\partial _{\varphi _m}^{\mathtt {b}} X_{\mathtt {A}_1}, \partial _x]\) are respectively
Note that by the definition of \(\mathtt {A}_1\) in (C.49)
for some constant C depending on the set S. Thus arguing as above one can easily prove that \(\partial _{\varphi _m}^{\mathtt {b}} X_{\mathtt {A}_1}\), \([X_{\mathtt {A}_1}, \partial _x]\), \([\partial _{\varphi _m}^{\mathtt {b}} X_{\mathtt {A}_1}, \partial _x]\) are \(-1\)-Lip-tame operators. This concludes the proof of the (C.54)
Proof of item (ii). By (C.48) we have
By using iteratively the property (iii) of Lemma C.8 and item (i) we have that
By using this relation to estimate the Lip-Sobolev norm of (C.58) and by noting that \(\varepsilon ^{n} C(s_0)^n\) is a summable sequence, for \(\varepsilon \) small enough, we prove the thesis. \(\quad \square \)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Feola, R., Giuliani, F. & Procesi, M. Reducible KAM Tori for the Degasperis–Procesi Equation. Commun. Math. Phys. 377, 1681–1759 (2020). https://doi.org/10.1007/s00220-020-03788-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00220-020-03788-z