Abstract
The derivative nonlinear Schrödinger (DNLS) equation can be derived as an amplitude equation via multiple scaling perturbation analysis for the description of the slowly varying envelope of an underlying oscillating and traveling wave packet in dispersive wave systems. It appears in the degenerated situation when the cubic coefficient of the similarly derived NLS equation vanishes. It is the purpose of this paper to prove that the DNLS approximation makes correct predictions about the dynamics of the original system under rather weak assumptions on the original dispersive wave system if we assume that the initial conditions of the DNLS equation are analytic in a strip of the complex plane. The method is presented for a Klein–Gordon model with a cubic nonlinearity.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The nonlinear Schrödinger (NLS) equation
with coefficients \( \nu _1, {\widetilde{\nu }}_2 \in {\mathbb {R}} \), can be derived for the description of small modulations in time and space of oscillatory wave packets in dispersive wave systems, such as the quadratic (\( f(u) = u^2 \)) or cubic (\( f(u) = u^3 \)) Klein–Gordon equation
the water wave problem and systems from nonlinear optics. For the cubic Klein–Gordon equation, the ansatz for the derivation of the NLS equation is given by
where \( c_g\) is the linear group velocity, \( k_0\) the basic spatial wave number, \( \omega _0 \) the basic temporal wave number and \( 0 < \varepsilon \ll 1 \) a small perturbation parameter.
Various NLS approximation results have been proved in the last decades. Such a result can trivially be established for a dispersive wave system with no quadratic terms by using Gronwall’s inequality, cf. [18]. However, in case of quadratic nonlinearities such a result is non-trivial since terms of order \( {\mathcal {O}}(\varepsilon ) \) have to be controlled on the long \( {\mathcal {O}}(1/\varepsilon ^2) \)-timescale. The idea to get rid of this problem is to use so-called normal form transformations. By a near identity change of variables, the terms of order \( {\mathcal {O}}(\varepsilon ) \) can be eliminated if non-resonance conditions are satisfied, cf. [13]. The last years saw various attempts to weaken these non-resonance conditions in order to control appearing resonances, cf. [22], and to make the theory applicable to quasilinear systems, cf. [4, 7, 28], such as the water wave problem, cf. [6, 8, 11, 27].
It turned out that in case of initial conditions for the NLS equation which are analytic in a strip of the complex plane almost no non-resonance conditions are necessary, cf. [5, 21]. It is the purpose of this paper to explain that the method developed in [5, 21] can be used in the justification of the derivative nonlinear Schrödinger (DNLS) approximation, too. Interestingly, it allows us to get rid of a problem which is not present in the justification of the NLS approximation, see below.
The DNLS equation
with \( T \ge 0 \), \( X \in {\mathbb {R}}\), \( A(X,T) \in {\mathbb {C}}\), and coefficients \( \nu _j \in {\mathbb {R}} \) for \( j = 1,\ldots ,5 \), appears if the cubic coefficient \( {\widetilde{\nu }}_2 = {\widetilde{\nu }}_2 (k_0) \) in (1) vanishes for the chosen basic spatial wave number \( k_0 \). This situation appears for instance in the water wave problem for one-dimensional sets in the parameter plane of surface tension and basic spatial wave number \( k_0 \), cf. [1]. The DNLS equation can be derived with an ansatz
The justification is more difficult and from a mathematical point of view even more interesting than that for the NLS approximation since for original dispersive wave systems with a quadratic nonlinearity, in the equation for the error, terms of order \( {\mathcal {O}}(\varepsilon ^{1/2}) \) have to be controlled on a long \( {\mathcal {O}}(1/\varepsilon ^2) \)-timescale. Even for dispersive wave systems with a cubic nonlinearity, in the equation for the error, terms of order \( {\mathcal {O}}(\varepsilon ) \) have to be controlled on a long \( {\mathcal {O}}(1/\varepsilon ^2) \)-timescale, and so, as a first step in establishing an approximation theory for the DNLS approximation we start with the most simple toy problem, namely a nonlinear Klein–Gordon equation with a special cubic nonlinearity,
Herein, \( x \in {\mathbb {R}} \), \( t \in {\mathbb {R}} \), \( u(x,t) \in {\mathbb {R}} \), and
Plugging the ansatz (3) with \( k_0 = 1 \) into (4) and equating the coefficients in front of \( e^{i(k_0 x - \omega _0 t)} \) to zero gives at \( {\mathcal {O}}(\varepsilon ^{1/2})\) the linear dispersion relation \( \omega _0^2 = k_0^2 + 1 \) and at \( {\mathcal {O}}(\varepsilon ^{3/2}) \) the linear group velocity \( c_g = k_0/\omega _0 \). Using the expansion
gives at \( {\mathcal {O}}(\varepsilon ^{5/2}) \) the DNLS equation
Remark 1.1
The Fourier transform of the DNLS approximation (3) is given by
Hence, the Fourier transform is strongly concentrated at the wave numbers \( \pm k_0 = \pm 1 \) and so the evolution of \( {\widehat{A}} \) is determined by the form of the dispersion relation and of the nonlinearity at the wave number \( k_0 = 1 \).
It is the goal of this paper to prove that the DNLS equation (5) makes through the ansatz (3) correct predictions about the dynamics of the Klein–Gordon model (4).
For the formulation of our approximation theorem, we need.
Definition 1.2
For \( \sigma , s \ge 0 \), we define the Gevrey spaces
Then our approximation theorem is as follows.
Theorem 1.3
Let \( s_A \ge 12 \), \( \sigma _0 > 0 \), and \( A\in C([0,T_0],G_{\sigma _0}^{s_A}) \) be a solution of the DNLS equation (5). Then there exist \( \varepsilon _0 > 0 \), \( T_1 \in (0,T_0] \), and \( C > 0 \) such that for all \( \varepsilon \in (0,\varepsilon _0) \) we have solutions u of the Klein–Gordon model (4) such that
Remark 1.4
As already said, such an approximation result is non-trivial since solutions of order \( {\mathcal {O}}(\varepsilon ^{1/2}) \) of (4) have to be controlled on an \( {\mathcal {O}}(1/\varepsilon ^{2}) \)-timescale. Although we have a cubic nonlinearity, a simple application of Gronwall’s inequality would only give estimates on an \( {\mathcal {O}}(1/\varepsilon ) \)-timescale.
Remark 1.5
Such an approximation result should not be taken for granted. There are counterexamples, cf. [10, 19, 23], showing that there are amplitude equations which are derived in a formally correct way, but fail to make correct predictions about the original system on the natural timescale of the approximation.
Remark 1.6
We recall that for \( \sigma , s \ge 0 \) due to the Paley–Wiener theorem functions \( u \in G_{\sigma }^{s} \) can be extended to a strip \( \{ z \in {{\mathbb {C}}}: |\text {Im} z| < \sigma \}\) in the complex plane, cf. [15].
Remark 1.7
The approximation result is not optimal in the sense that error estimates can only be proved on the correct timescale, namely for \( t \in [0,T_1/\varepsilon ^2] \), but not necessarily for all \( t \in [0,T_0/\varepsilon ^2] \). Hence, we can only guarantee that parts of the DNLS dynamics can be seen in the original system.
It turns out that there are two new difficulties which were not present in the justification analysis of other modulation equations so far and which have to be overcome, namely the problem of a total resonance and the problem of a second-order resonance, see below and Sect. 4 for detailed definitions of these resonances. We get rid of the second-order resonance by adapting a method developed in [5, 21] for justifying the NLS approximation under rather weak non-resonance conditions, however, with the drawback that the initial conditions for the NLS equation have to be chosen to be analytic in a strip of the complex plane. This approach will be combined with some energy estimates in order to get rid of the total resonance which is also not present in the justification analysis of the NLS approximation. For a detailed outline of the proof, see Sect. 2.
Remark 1.8
We call the following approach to justify the DNLS approximation robust since our approximation result holds under rather weak non-resonance conditions.
Remark 1.9
For completeness, we remark that the DNLS equation is a well-studied nonlinear dispersive system. Local well-posedness of smooth solutions in Sobolev spaces \(H^s\) with \(s>3/2\) was established by Tsutsumi and Fukuda [26]. See also [3, 9, 25, 29, 30] for further improvements. The complete integrability of the DNLS equation has been established in [16]. For a recent overview, see [12].
Notation
The Fourier transform of a function \( u: {\mathbb {R}} \mapsto {\mathbb {C}} \) is given by
The inverse Fourier transform of a function \( {\widehat{u}}: {\mathbb {R}} \mapsto {\mathbb {C}} \) is given by
Multiplication \((uv)(x)=u(x)v(x)\) in physical space corresponds in Fourier space to the convolution
The weighted Lebesgue space \( L^2_s \) which will be used a few times in Fourier space is equipped with the norm \( \Vert {\widehat{u}} \Vert _{L^2_s} = \Vert {\widehat{u}}\rho _0^s \Vert _{L^2} \) with \( \rho _0(k)=(1+k^2)^{1/2} \).
In the following, many possibly different constants are denoted with the same symbol C if they can be chosen independent of the small perturbation parameter \( 0 < \varepsilon \ll 1 \).
2 Plan of the paper
The plan of the paper is as follows. In Sect. 3, we write the Klein–Gordon model (4) as a first-order system in Fourier space and derive the equations for the error made by an improved DNLS approximation. The improved DNLS approximation is \( {\mathcal {O}}(\varepsilon ^{3/2}) \)-close to the original DNLS approximation (3), but has the advantage that it makes the residual smaller, i.e., the terms which do not cancel after inserting the ansatz in the original system. The improved DNLS approximation is constructed and remaining residual terms are estimated in “Appendix A.”
There are terms of order \( {\mathcal {O}}(\varepsilon ) \) in the equations for the error which prevents us to obtain bounds on the long \( {\mathcal {O}}(1/\varepsilon ^2) \)-timescale. Therefore, in Sect. 4 we perform some normal form transformations in order to get rid of these terms in the equations for the error. It turns out that this elimination is a non-trivial task since resonances are present in the system, and therefore, not all terms of order \( {\mathcal {O}}(\varepsilon ) \) can be eliminated by these transformations.
There are cubic terms which cannot be eliminated for any wave number \( k \in {{\mathbb {R}}}\). We call the associated resonance a total resonance. Moreover, there is another resonance at the wave numbers \( \pm k_0 = \pm 1 \). The denominator in the normal from transformation vanishes of second order for these wave numbers and so this resonance will be called second-order resonance in the following. Since the nonlinear terms which appear in the nominator only vanish linearly at the wave numbers \( \pm k_0 = \pm 1 \), the associated part of the normal form transform would be unbounded. Therefore, in Sect. 4 we only eliminate all terms which are not associated with the total resonance or second-order resonance. It turns out that the total resonant terms can be controlled with a simple energy estimate, and so, we concentrate on the handling of the terms associated with the second-order resonance by the use of Gevrey spaces in the following.
As a preparation we recall some estimates from the local existence and uniqueness proof for the DNLS equation in Gevrey spaces in Sect. 5. By lowering the decay rates \( \sigma \) in the definition of the norm \( \Vert \cdot \Vert _{G^s_{\sigma }} \) with time, we obtain an artificial smoothing which allows us to get rid of first-order derivatives in the nonlinearity.
The transfer of the analysis made in Sect. 5 to Fourier space is the basis of our approach to get rid of the terms associated with the second-order resonance. The transfer to Fourier space corresponds to giving up the exponential localization of the solutions in Fourier space with time which will allow us to use the derivative in front of the nonlinear term in (5) to come to the correct \( {\mathcal {O}}(1/\varepsilon ^2) \)-timescale. However, in order to use this idea on the original system (4) we have to get rid of the fact that the DNLS modes are concentrated at the wave number \( k_0=1 \) and that the nonlinear term vanishes at this wave number, too. Hence, in Sect. 6 we introduce a space where the Fourier modes are located at integer multiples of \( k_0 \) with an exponential decay proportional to \( |k-mk_0| \). Again by lowering the decay rates with time, we rebuilt the construction from Sect. 5 for (4), cf. Fig. 1. This allows us to come with our error estimates to the natural \( {\mathcal {O}}(1/\varepsilon ^{2}) \)-timescale of the DNLS approximation. This construction originally was used in [5, 21] to get rid of resonances which are bounded away from the integer multiples of the basic wave number \( k_0 \). The solutions at such resonant wave numbers will grow like \( e^{ {\mathcal {O}}(\varepsilon ) t} \). However, by the chosen spaces the solutions are initially \( e^{ -{\mathcal {O}}(1/\varepsilon )} \) small. Thus, the error will stay \( {\mathcal {O}}(1) \)-bounded on a long \( {\mathcal {O}}(1/\varepsilon ^2) \)-timescale in t.
In Sect. 7, we introduce so-called mode filters which allow us to separate the error function. These mode filters allow us to handle different parts of the error function in Fourier space differently. At the wave number \( k = \pm k_0 \) and except for small neighborhoods around integer multiples of \( k_0 \), we use the time-dependent weighted \( L^2 \)-spaces to control the magnitude of the error. In the small neighborhoods around the integer multiples of \( k_0 \), without the neighborhoods around \( k = \pm k_0 \), we use normal form transformations. Estimates for the normal form transformations in the chosen spaces can be found in Sect. 8. The final error estimates can be found in Sect. 9. We use energy estimates for the transformed system since we still have to get rid of the totally resonant terms. All ideas from the previous sections can be incorporated in these energy estimates. We close the paper in Sect. 10 with a discussion about possible improvements, generalizations and about the possible transfer to more complicated systems.
3 Equations for the error
The Fourier transformed cubic Klein–Gordon model (4) is given by
where \( \omega (k) = \text {sign}(k) \sqrt{1+k^2} \) and \( \rho (k)=- \frac{\varrho (i k)}{\omega (k)} \). By this choice of \( \omega \) and \( \rho \), the subsequent variables will be real-valued in physical space. With \( u = u_1 \), we write (7) as a first-order system
This system is diagonalized with
With \( V = (v_{-1},v_1) \), we obtain
where in Fourier space
is a skew-symmetric operator and
a symmetric trilinear mapping. The DNLS approximation is of the form
cf. (30), with \( a_j \) concentrated at the wave number \( k = j \) and higher-order approximation terms \( \varepsilon ^{3/2} \psi _{s,\pm 1} \). See “Appendix A” for the detailed construction. The error \( \varepsilon ^{{\widetilde{\beta }}} R = V - \varepsilon ^{1/2} \psi \) with \( {\widetilde{\beta }} > 3/2 \) made by the DNLS approximation satisfies
where \( \varepsilon L_{c}(R) \) are terms which are linear in R but have too few powers of \( \varepsilon \) in front and thus make difficulties to obtain \( {\mathcal {O}}(1) \)-bounds for R on the long \( {\mathcal {O}}(1/\varepsilon ^2) \)-timescale, where \( \varepsilon ^2 L_{s}(R) \) are terms which are linear in R and have enough powers of \( \varepsilon \) in front, and where \( \varepsilon ^{{\widetilde{\beta }}+1/2} L_r(R) \) are terms which are nonlinear in R (and have enough powers of \( \varepsilon \) in front), in detail
Moreover, \( \varepsilon ^{-{\widetilde{\beta }}} \text {Res}(\varepsilon ^{1/2} \psi ) \) are the so-called residual terms. These are the terms which do not cancel after inserting the DNLS approximation into the nonlinear Klein–Gordon equation (4).
In the following, we concentrate on estimating the error made by the DNLS approximation and postpone the standard construction of an improved approximation and the estimates for the residual to Appendix A. The improved approximation will be chosen in such a way that the term \( \varepsilon ^{-{\widetilde{\beta }}} \text {Res}(\varepsilon ^{1/2} \psi ) \) is of order \( {\mathcal {O}}(\varepsilon ^2) \).
In the following, in our notation we keep \( {\widetilde{\beta }} > 3/2 \) in order to show that by using improved approximations the error can be made arbitrarily small. All terms on the right-hand side of (10) are at least of order \( {\mathcal {O}}(\varepsilon ^2)\) except for the first two terms. Since \( \Lambda \) is skew-symmetric, the first term on the right-hand side of (10) makes no problems, too. However, the second term \( \varepsilon L_{c}(R) \) which is of order \( {\mathcal {O}}(\varepsilon ) \) makes serious problems in estimating the error on the long \({\mathcal {O}}(1/\varepsilon ^{2}) \)-timescale.
4 Normal form transformations and the resonance structure
The approach to get rid of the dangerous term \( \varepsilon L_{c}(R) \) in (10), which is a sum of trilinear mappings of \( a_{j_1} \), \( a_{j_2} \) and \( R_{j_3} \), with \( j_1,j_2,j_3 \in \{ -1,1\} \), are normal form transformations. By these near identity change of variables
with M a trilinear mapping, the \( {\mathcal {O}}(\varepsilon ) \)-terms can be transformed into \( {\mathcal {O}}(\varepsilon ^2) \)-terms, if a number of non-resonance conditions are satisfied.
Remark 4.1
In order to eliminate a trilinear term \( \varepsilon B(a_{j_1},a_{j_2},R_{j_3}) \) of the form
from the equation of \( R_j \) by a near identity transformation \( w_j = R_j + \varepsilon M(a_{j_1},a_{j_2},R_{j_3}) \), using the fact that the \( {\widehat{a}}_{j} \) in (9) are strongly concentrated at the wave numbers \( k = j \), we have to choose
with
cf. [24, §11]. For the terms which will be eliminated, the nominator b is bounded and the denominator is bounded away from zero.
Thus, the non-resonance condition
has to be satisfied for all \( k \in {{\mathbb {R}}}\) for the elimination of a term \( {\widehat{a}}_{j_1}*{\widehat{a}}_{j_2}*R_{j_3} \) from the equation for \( R_{j} \), cf. [24, §11] or Remark 4.1. The non-resonance conditions (11) can be analyzed graphically. We find no resonances except for
- (TR):
-
For \((j,j_1,j_2,j_3)=(j,j_1,-j_1,j)\), the resonance function \( r_{jj_1j_2j_3}(k) \) vanishes identically. Thus, the associated terms in \( \varepsilon L_{c}(R) \) cannot be eliminated by a normal form transformation.
- (SOR):
-
For \((j,j_1,j_2,j_3)=(-1,j_1,j_1,-1)\), cf. Fig. 2, there is a resonance at \(k=j_1\), which is of second order, in detail
$$\begin{aligned} \omega (k)-2\omega (j_1) -\omega (k-2j_1)= 2 \omega ''(j_1) (k-j_1)^2 +{\mathcal {O}}(|k-j_1|^3) \, \end{aligned}$$for k near \(j_1\). This second-order resonance would appear in the denominator of the normal form transformation. It cannot be balanced by the term \( \rho \) in the nominator of the normal form transformation which only vanishes linearly at \( k = \pm 1 \). Thus, the normal form transformation would be singular near the wave numbers \( k = \pm 1 \).
Thus, besides the normal form transformations which we use to get rid of the non-resonant terms, we need an idea to get rid of the terms which cannot be eliminated at all due to the total resonance (TR), and we need an idea to get rid of the terms which cannot be eliminated in a small neighborhood of the wave numbers \( k = \pm 1 \) due to the second-order resonance (SOR).
It turned out that the problem with the total resonance (TR) can be solved rather easily by using energy estimates. For the handling of the second-order resonance (SOR), we use the fact that in lowest order the system near the wave numbers \( k = \pm 1 \) is given by the DNLS equation. Our approach to solve this problem is similar to the approach chosen for instance in [17, 20] for the justification of the KdV approximation. By this approach, we not only get rid of the quasilinearity of the DNLS equation but also gain the missing \( {\mathcal {O}}(\varepsilon ) \) order to come to the long \( {\mathcal {O}}(1/\varepsilon ^2) \)-timescale. In order to explain this approach, we have a look at the solution theory of the DNLS equation in Gevrey spaces in Sect. 5 first.
Solving the NLS equation in Gevrey spaces was also the basis of the approach which has been used in [5, 21] for justifying the NLS approximation under rather weak non-resonance conditions. This underlying idea of the approach is introduced in Sect. 6. Interestingly, it also allows us to get rid of the second-order resonances which are not present in the justification of the NLS approximation.
And so in the end the transfer of the method developed in [5, 21] from the NLS approximation to the DNLS approximation not only gains the missing \( {\mathcal {O}}(\varepsilon ) \) order in order to come to the long \({\mathcal {O}}(1/\varepsilon ^2) \)-timescale but also allows us to justify the DNLS approximation under rather weak non-resonance conditions. Since we need to control the total resonant terms, too, we use energy estimates instead of the variation of constant formula, and so the \( L^1 \)-based spaces in Fourier space from [5] are replaced here by \( L^2 \)-based spaces.
5 The DNLS equation in Gevrey spaces
As already said, we solve the DNLS equation in Gevrey spaces \(G_\sigma ^s\) equipped with the norm
In our presentation of the properties of these spaces, we follow [2]. For the local existence and uniqueness of solutions, we use that \(G_\sigma ^s\) is an algebra for \(s>1/2\) and \( \sigma \ge 0\), i.e., if \(u,v\in G_\sigma ^s\), then \(uv\in G_\sigma ^s\) and
where the constant \(C_s\) is independent of \(\sigma \ge 0\). Since the DNLS equation is a quasilinear system, we need the following improved, so-called tame, estimate
which holds for all \( \sigma \ge 0 \), \(\kappa \ge 0\) and \(s>1/2\). The elements of \( G^s_\sigma \) form a proper subset of the space of functions which are analytic in a strip of the complex plane of width \( < 2 \sigma \), symmetric around the real axis, equipped with the sup-norm due to the Paley–Wiener theorem, cf. [15].
For the DNLS equation, we have the following local existence and uniqueness result.
Theorem 5.1
Let \(s>1\) and \(\sigma _A>0\). Then, for every \(R_0>0\), there exist \(\eta =\eta (R_0,s,\sigma _A)\) such that for every \(A_0\in G_{\sigma _A}^s\), with \(\Vert A_0\Vert _{G_{\sigma _A}^{s}}\le R_0\), there exists a unique local solution \(A(T)\in G^s_{\sigma (T)}\) of the DNLS equation (5) with \(\sigma (T):=\sigma _A-\eta T\), \(T\in [0,\sigma _A/\eta ]\), and \( \sup _{T\in [0,\sigma _A/\eta ]} \Vert A(T) \Vert _{G^s_{\sigma (T)}} \le R_0 \).
Proof
By rescaling T, X and A, the DNLS equation (5) is brought in its normal form
Next we set
where \(M=\sqrt{-\partial _x^2}\). Then B satisfies
We denote the scalar product in \( H^s \) with \( ( \cdot ,\cdot )_s \) and obtain
where
such that
Hence, \( (B,B)_s \) decays in time if \( \eta > 0 \) is chosen so large that initially
With these a priori estimates, the existence and uniqueness of solutions follow standard arguments, cf. [14]. \(\square \)
The existence of the solutions of the DNLS equation (5) which is assumed in Theorem 1.3 is guaranteed by the following corollary.
Corollary 5.2
Let \( s_A \ge 12 \) and \(\sigma _A>0\). Then, for every \(R_0>0\), there exist \(\eta =\eta (R_0,s_A,\sigma _A)\) such that for every \(A_0\in G_{\sigma _A}^{s_A}\), with \(\Vert A_0\Vert _{G_{\sigma _A}^{s_A}}\le R_0\) and \( \sigma _0 \in [0,\sigma _A) \) there exists a \( T_0 > 0 \) and unique local solution \(A\in C([0,T_0],G^{s_A}_{\sigma _0})\) of the DNLS equation (5) with \( \sup _{T\in [0,\sigma _A/\eta ]} \Vert A(T) \Vert _{G^s_{\sigma (T)}} \le R_0 \).
Proof
As above choose \(\sigma (T):=\sigma _A-\eta T\) and stop for \(\sigma (T_0) = \sigma _0 \). \(\square \)
6 Modulational Gevrey spaces
In the last section, we have seen that with an initial exponential decay in Fourier space for wave numbers \( |K| \rightarrow \infty \) we can create an artificial smoothing which allows us to control the derivative in front of the nonlinear terms of the DNLS equation. For the nonlinear Klein–Gordon equation (4), the DNLS equation is the lowest order approximation for the modes located at the wave number \( k = 1 \); in particular, the derivative in front of the nonlinear terms of the DNLS equation corresponds to the vanishing of the nonlinear terms of the nonlinear Klein–Gordon equation (4) at the wave number \( k = 1 \). For the DNLS approximation, the associated modes decay with an exponential rate around the wave number \( k = 1 \), cf. Fig. 1. However, by nonlinear interaction small peaks with width of order \( {\mathcal {O}}(\varepsilon ) \) are created at odd integer multiples of the basic wave number \( k_0 = 1\). See the right panel of Fig. 1. This means that the solutions of the nonlinear Klein–Gordon equation (4) will have a Fourier mode distribution which is bounded from above by a multiple of \(1/\vartheta _{\beta }\), where
or equivalently
for \(\beta \ge 0\). We define a number of spaces to combine these facts with the ideas from Sect. 5 for the DNLS equation (2) in order to handle the nonlinear Klein–Gordon equation (4). For estimating the solutions of the original system, we use energy estimates, and so, the nonlinear Klein–Gordon equation (4) will be solved in the \( L^2 \)-based space
As a consequence for \( u \in {\mathcal {M}}^s_{\beta } \), with \( \beta > 0 \), the modes bounded away from integer multiples of the basic wave number \( {k}_0 = 1\) are exponentially small w.r.t. \( \varepsilon \), i.e., these modes are of order \( {\mathcal {O}}(e^{-r/\varepsilon }) \) for \( 0 < \varepsilon \ll 1 \) with an \( {\mathcal {O}}(1) \)-bound \( r > 0 \). Due to the \( L^2 \)-scaling properties, the DNLS approximation is of order \( {\mathcal {O}}(1) \) in the \( {\mathcal {M}}^s_{\beta } \)-spaces and not of the formal order \( {\mathcal {O}}(\varepsilon ^{1/2}) \). Therefore, we additionally define the spaces
for which the DNLS approximation is of order \( {\mathcal {O}}(\varepsilon ^{1/2}) \).
For the subsequent error estimates, we need that these spaces are closed under point-wise multiplication.
Lemma 6.1
For all \( \beta \ge 0 \) and \( s > 1/2 \), we have
Moreover, for all \( \beta , s \ge 0 \) we have
Proof
The estimates immediately follow from
due to Young’s inequality for convolutions, Sobolev’s embedding
for \( \delta > 1/2 \), and the inequality
\(\square \)
Remark 6.2
The inequalities of (15) will be used to estimate combinations of the approximation with the error subsequently. The DNLS approximation will be estimated in the space \( {\mathcal {W}}^s_\beta \) where it is of order \( {\mathcal {O}}(\varepsilon ^{1/2}) \). The error will be estimated in the space \( {\mathcal {M}}^s_\beta \).
The initial value problem for (4) for initial conditions of order \( {\mathcal {O}}(\varepsilon ^{1/2}) \) can be solved in these spaces on a time interval of length \( {\mathcal {O}}(1/\varepsilon ) \) using the variation of constant formula, using the fact that we have a cubic nonlinearity and the fact that these spaces are closed under multiplication. In order to bound the error not only on the short \( {\mathcal {O}}(1/\varepsilon ) \)-timescale but also on the natural \( {\mathcal {O}}(1/\varepsilon ^2) \)-timescale of the DNLS approximation, we use the spaces \( {\mathcal {M}}^s_{\beta } \), but now with time-dependent \( \beta \). Similar to above, we choose
with constants \( \sigma _0,\, \eta >0\), which can be chosen independently of \( 0 < \varepsilon \ll 1 \). Note that (16) is the scaled version of \(\sigma (T)=\sigma _0-\eta T\) defined in Theorem 5.1 and that \( T_1 = \sigma _0/\eta \). If A is initially in a space \( G^{s+1}_{\sigma _0} \), then according to Remark 1.1 the DNLS approximation is initially in a space \( {\mathcal {W}}^s_{\sigma _0/\varepsilon } \). This is the reason why \( \beta (t) \) starts with \( \sigma _0/\varepsilon \). The decay \( - \eta \varepsilon t \) allows us to consider t on the natural \( {\mathcal {O}}(1/\varepsilon ^2) \)-timescale of the DNLS approximation. It turns out that subsequently choosing \( \eta = {\mathcal {O}}(1) \) is sufficient for our purposes.
In the subsequent sections, we explain in detail how with this approach all problems to come to the long \( {\mathcal {O}}(1/\varepsilon ^2) \)-timescale, found in Sect. 4, can be solved.
7 Separation of the modes
In order to obtain a bound for the error on the long \({\mathcal {O}}(1/\varepsilon ^2)\)-timescale, independently of the small perturbation parameter \( 0 < \varepsilon \ll 1 \), we have to get rid of the term \( \varepsilon L_c(R) \) in (10). Except at the resonant wave numbers, this term is oscillatory and can be removed by a near identity change of variables. In the last sections, we explained our strategy to get rid of the total resonance (TR) and of the second-order resonance (SOR).
Hence, for the error estimates we separate the modes in three parts. The first part which is denoted by \( R_n \) has a support near the odd integer multiples of the basic wave number \( k_0= 1 \) excluding neighborhoods around the basic wave numbers \( \pm k_0= \pm 1 \). It will be handled with normal form transformations and energy estimates. The second part which is denoted by \( R_r \) has a support which is bounded away from the odd integer multiples of the basic wave number \( k_0= 1 \) and will linearly be exponentially damped by our choice of time-dependent weights. The third part which is denoted by \( R_c \) has support near the basic wave numbers \( \pm k_0= \pm 1 \) and will be handled with the ideas which have been explained above in Sect. 5 and Sect. 6 and with energy estimates. The index n stands for normal form, r for rest, and c for critical.
In detail, we define for a small \(\delta _r>0\), but independent of \( 0 < \varepsilon \ll 1 \), the mode filter
the mode filter
and finally the mode filter \({\widehat{E}}_n=1-{\widehat{E}}_r- {\widehat{E}}_c\).
We use these projections to separate the error \( R=R_r+R_n + R_c\) in three parts, namely \(R_r=E_rR_r\), \(R_c=E_cR_c\) and \(R_n=E_nR_n\). These new variables satisfy
where
8 The normal form transform
As already said, in order to come to the long \({\mathcal {O}}(1/\varepsilon ^2)\)-timescale, we have to get rid of the terms
for \( j = n,r,c \) in (17)–(19). In a first step, we simplify the \( \varepsilon E_j L_{c}(R) \) for \( j = n,r,c \) by eliminating all non-resonant terms by normal form transformations. The \( \varepsilon E_j L_{c}(R) \) for \( j = n,r,c \) are sums of trilinear mappings w.r.t. \( a_{j_1} \), \( a_{j_2} \) and \( R_{j_3} \), with \( j_1,j_2,j_3 \in \{ -1,1\} \).
We recall from Remark 4.1 that to eliminate a trilinear term \( \varepsilon B(a_{j_1},a_{j_2},R_{j_3}) \) of the form
from the equation of \( R_j \) by a near identity transformation \( w_j = R_j + \varepsilon M(a_{j_1},a_{j_2},R_{j_3}) \) we have to choose
with
By the analysis of the denominator, made in Sect. 4, we can eliminate all terms except for the total resonant terms and second-order resonant terms.
After the transformation, we obtain a system
where the \( B_j \) are smooth trilinear mappings in their arguments and \( \varepsilon ^2 H_{r,n,c} = {\mathcal {O}}(\varepsilon ^2)\) with properties specified below.
\( \bullet \) The totally resonant term \( B_1(a_1,a_{-1},w_n) \) in (21) and the totally resonant term \( B_2(a_1,a_{-1},w_c) \) in (22) will be controlled by energy estimates in the following.
\( \bullet \) For the second-order resonant terms \( B_3(a_{-1},a_{-1},w_c) \) and \( B_4(a_1,a_{1},w_c) \) in (22), the denominator in the above normal form transformation would vanish quadratically for \( k = \pm 1 \) as we have seen in Sect. 4. Since the nominator only vanishes linearly at these wave numbers, the normal form transform would be unbounded. Therefore, the second-order resonant terms will be handled with the ideas presented in Sects. 5 and 6.
\( \bullet \) The term \( \varepsilon E_r L_{c}(w_r) \) contains totally resonant terms, too, i.e., it is of the form \( B_0(a_1,a_{-1},w_r) + {\mathcal {O}}(\varepsilon ^2) \). In order to explain subsequently a few possible improvements to our approach, we handle the totally resonant terms in \( \varepsilon E_r L_{c}(w_r) \) differently than the other totally resonant terms. The reason for this is, as already said, that the term \( \varepsilon E_r L_{c}(w_r) \) will be exponentially small w.r.t \( \varepsilon \) initially, and that it will take an \({\mathcal {O}}(1/\varepsilon ^2)\)-timescale to grow to an order \({\mathcal {O}}(\varepsilon )\) in any case. This observation allows us to reduce the number of necessary non-resonance conditions for general dispersive systems subsequently.
The properties of the normal form transformation are summarized in the following lemma.
Lemma 8.1
Let \( s > 1/2 \) and \( \sigma _0 \ge 0 \). The transformation
is a small perturbation of identity. For all \( \sigma \in [0,\sigma _0 ] \), the mapping is analytic. For all \( C_1 > 0 \), there exists an \( \varepsilon _0 > 0 \) such for all \( \varepsilon \in (0,\varepsilon _0) \) and all \( \sigma \in [0,\sigma _0 ] \) the following holds. For all \( (w_n,w_r,w_c) \) with \( \Vert (w_n,w_r,w_c) \Vert _{{\mathcal {M}}^s_{\sigma /\varepsilon } } \le C_1 \), there exists an analytic inverse. All bounds are independent of \( \varepsilon \in (0,\varepsilon _0) \) and \( \sigma \in [0,\sigma _0 ] \).
Proof
The estimate (15), the fact that \( |r_{jj_1j_2j_3}(k)| \) is uniformly bounded away from zero for the terms considered in the normal form transformation, and Neumann’s series imply the statements. \(\square \)
With this lemma, we immediately have
Corollary 8.2
Let \( s > 1/2 \) and
with \( \beta (t) \) defined in (16). There exist constants \( C_1 \), \( C_3>0 \) independent of \( {\check{M}} \) and \( \varepsilon \in (0,\varepsilon _0] \), with \( \varepsilon _0 > 0 \) from Lemma 8.1, and a monotonically increasing function \( C_2( {\check{M}} )>0 \), independent of \( \varepsilon \in (0,\varepsilon _0] \), such that
for \( j = r,n,c \).
9 The final error estimates
In order to estimate the solutions of Eqs. (20)–(22) for the error, we use the modulational Gevrey spaces introduced in Sect. 6. Introducing the new weighted variables
for \( j = r,n,c \) allows us to work in classical Sobolev spaces. We find
where the operator \( \Gamma \) is defined in Fourier space by
The trilinear mappings \( B_j \) from (20)–(22) transform into the \( {\widetilde{B}}_j \) which are again smooth trilinear mappings in their arguments. They are estimated below in detail. The remaining terms \( H_j \) from (20)–(22) transform into the \( {\widetilde{H}}_j \) whose properties are specified in the subsequent lemma.
Lemma 9.1
Let \( s > 1/2 \) and
There are constants \( C_1 \), \( C_3>0 \) independent of M and \( \varepsilon \in (0,\varepsilon _0] \), with \( \varepsilon _0 > 0 \) from Lemma 8.1, and a monotonically increasing function \( C_2(M)>0 \), independent of \( \varepsilon \in (0,\varepsilon _0] \), such that
for \( j = r,n,c \).
Proof
The lemma is mainly a reformulation of Corollary 8.2. \(\square \)
In order to estimate the solutions \( W_j \) of Eqs. (23)–(25), we use energy estimates, i.e., we multiply the equation for \( W_j \) with \( W_j \) for \( j =r,n,c \) and take the \( H^s \)-scalar product \( (\cdot ,\cdot )_s \). We find
with
with
and
Among the terms \( s_1,\ldots ,s_{14} \), there are terms which vanish identically and terms which do not make any difficulties since they have an \( \varepsilon ^2 \) in front. The dangerous terms are the ones which only have an \( \varepsilon \) in front, namely \( s_3,s_7,s_{11},s_{12},s_{13} \). They will be estimated through integration by parts, such as the totally resonant terms \( s_7 \) and \( s_{11} \), or by the damping terms \( s_2, s_6, s_{10} \), such as \( s_3 \) or the second-order resonant terms \(s_{12},s_{13} \).
We start with the terms which vanish identically, namely
\( \textbf{s}_{\textbf{1}}, \textbf{s}_{\textbf{5}}, \textbf{s}_{\textbf{9}}\): Due to the skew symmetry of \( \Lambda \), we immediately have
For the terms which have an \( \varepsilon ^2 \) in front, we proceed as follows and find:
\( \textbf{s}_{\textbf{4}}, \textbf{s}_{\textbf{8}}, \textbf{s}_{\textbf{14}}\): Using Lemma 9.1, the Cauchy–Schwarz inequality and \( a \le 1+ a^2 \) in the last term in \( {\widetilde{H}}_r \), \( {\widetilde{H}}_c \) and \( {\widetilde{H}}_c \), respectively, yields
for \( j = 4,8,14 \) using the notation from Lemma 9.1.
Next we go on with the rest of the \( W_r \)-equation.
\( \textbf{s}_{\textbf{2}}\): is the good term in the \( W_r \)-equation. There is a \( \sigma > 0 \) independent of \( 0 < \varepsilon ^2 \ll 1 \) such that
We have \( \alpha (\eta ) \rightarrow \infty \) for \( \eta \rightarrow \infty \), cf. (26).
\( \textbf{s}_{\textbf{3}}\): is the dangerous term in the \( W_r \)-equation which, however, can be estimated by the \( s_2 \)-term. For the \( s_3 \)-term, we have
for a constant \( C_{s_3} = C_{s_3}(\psi ) \) independent of \( 0 < \varepsilon ^2 \ll 1 \).
Next we go on with the rest of the \( W_n \)-equation.
\( \textbf{s}_{\textbf{6}}\): is the good term in the \( W_n \)-equation. However, for our purposes it is sufficient to have \( 2 \text {Re}(s_6) \le 0 \).
\( \textbf{s}_{\textbf{7}}\): comes from the totally resonant terms. In Fourier space, we have to control terms of the form
with
with
and where we used that the product \( a_{j_1}a_{-j_1} \) is real. We have that \((k,l) \mapsto \varrho _1(k,l) \) is smooth and satisfies \( \varrho _1(k,k) = 0 \) such that finally \( | \varrho _1(k,l) | \le C | k -l | \). Since \( \mathop {\int }\limits _{{\mathbb {R}}}{\widehat{a}}_{j_1}(k-m){\widehat{a}}_{-j_1}(m-l) dm \) is strongly concentrated at \( k-l= 0 \), we gain another power of \( \varepsilon \). In detail, we estimate
due to the Cauchy–Schwarz inequality and Young’s inequality for convolutions, such that finally
Finally we come to the remaining terms of the \( W_c \)-equation.
\( \textbf{s}_{\textbf{11}}\): The totally resonant terms \( \mathbf {s_{11}} \) are handled line for line as the totally resonant terms \( \mathbf {s_{7}} \), and so, we also have
\( \textbf{s}_{\textbf{10}}\): is the good term which allows us to handle the second-order resonant terms. We have
\( \textbf{s}_{\textbf{12}}, \textbf{s}_{\textbf{13}}\): In the following, \( W_{c,1} \) denotes the part of \( W_c \) located at \( k = 1 \) and \( W_{c,-1} \) the part of \( W_c \) located at \( k = -1 \). Then the second-order resonant terms are written as
and estimated by
The term \( \Vert \rho ^{1/2} \widetilde{{\widetilde{B}}}_4(a_1,a_{1},W_{c,-1}) \Vert _{L^2}^2 \) is in Fourier space of the form
with
We use that
that \( \int {\widehat{a}}_1(k-l) {\widehat{a}}_1(l-m) dl \) is strongly concentrated at \( k-m \approx 2 \) and Young’s inequality to obtain a bound
Less complicated is the bound
since \( \rho (m+2) = {\mathcal {O}}(|m+1|) \). Thus, we finally obtain
where we used \( \varepsilon ^{3/2} ab \le \varepsilon a^2 + \varepsilon ^{2} b^2 \). This defines the constant \( C_{\psi } \) and we may increase the original constant \( C_1 \).
Summary: Collecting all estimates gives for
that
The third and fourth line can be made negative by choosing \( \eta \) sufficiently large, but independent of the small perturbation parameter \( 0 < \varepsilon \ll 1 \) such that we finally have
Choosing \( C_2(M) \varepsilon ^{{\widetilde{\beta }}-3/2} E_s^{1/2}\le 1 \) yields
Applying Gronwall’s inequality yields for all \( t \in [0,T_1/\varepsilon ^2] \) that
independent of \(\varepsilon \in (0,\varepsilon _0)\) where \(\varepsilon _0>0\) had to be chosen so small that \( C_2(M) \varepsilon ^{{\widetilde{\beta }}-3/2} M \le 1 \). Therefore, we are done. \(\square \)
10 Discussion
In this section, we make a number of remarks about possible improvements and generalizations.
Remark 10.1
For an arbitrary, but fixed, \( {\widetilde{\beta }} > 3/2 \), the improved approximation can be constructed in such a way that the residual term \( \varepsilon ^{-{\widetilde{\beta }}} \text {Res}(\varepsilon ^{1/2} \psi ) \) is of order \( {\mathcal {O}}(\varepsilon ^2) \), cf. “Appendix A.” Hence, the error made by this higher-order approximation is of order \( {\mathcal {O}}(\varepsilon ^{{\widetilde{\beta }}}) \) in some Sobolev norm.
Remark 10.2
It is obvious by the proof that the assumption on the solutions of the DNLS equation (5), namely \( A\in C([0,T_0],G_{\sigma _0}^{s_A}) \), can be replaced by the weaker assumption that we take a solution constructed in Theorem 5.1 with \( s = s_A \), \( \sigma _A = \sigma _0 \) and \(A(T)\in G^s_{\sigma (T)}\) with \(\sigma (T)=\sigma _0-\eta T\).
Remark 10.3
From (17) to (20), we have eliminated all terms of order \( {\mathcal {O}}(\varepsilon ) \) except for the totally resonant ones. This is not necessary since finally \( \varepsilon E_r {\widetilde{L}}_{c}(W_r) \) appears in an equation where \( W_r \) is exponentially damped. Hence, other terms can be kept and other resonances can be handled as long as they are bounded away from odd integer multiples of the basic wave number \( k_0 = 1 \). This follows line for line as in [5]. Obviously our approach breaks down if a resonance falls on an integer multiple of the basic wave number \( k_0 \).
Remark 10.4
We have demonstrated that the DNLS approximation makes correct predictions about the dynamics of our chosen nonlinear Klein–Gordon equation (4). The question about possible generalizations and about the possible transfer to more complicated systems occurs. First of all, we would like to mention that the problems with the total resonance and the second-order resonance occur for all non-trivial systems for which the DNLS approximation can be derived. On the one hand, with this respect our system is not more complicated than necessary. On the other hand, the chosen nonlinear Klein–Gordon equation (4) is sufficiently complicated to contain all principle difficulties which have to be overcome.
Remark 10.5
Other additional difficulties one could think of have been handled in our situation before. For instance, quadratic terms in the original systems can be eliminated completely with a normal form transform for Klein–Gordon models. For other more complicated original systems, additional quadratic or quartic resonances can occur. It is not obvious how existing methods to handle such resonances interplay with the presented approach of this paper. The same is true for quasilinear systems such as the water wave problem. This will be the topic of future research.
Remark 10.6
It is the topic of parallel research to prove a DNLS approximation result for initial conditions which are not analytic in a strip of the complex plane but only live in a Sobolev space. In this case, the totally resonant terms have to be handled with energy estimates again. New ideas are needed to handle the second-order resonant terms. Moreover, all other terms of order \( {\mathcal {O}}(\varepsilon ) \) in the error Eq. (10) have to be eliminated by normal form transformations, i.e., no other resonances can be allowed. This is different to the situation in this paper where additional resonances bounded away from integer multiples of the basic wave number \( k_0 = 1 \) can be allowed due to the exponential smallness of these modes initially, cf. Remark 10.3.
References
Ablowitz, M.J., Segur, H.: Solitons and the Inverse Scattering Transform, vol. 4. Society for Industrial and Applied Mathematics (SIAM), Philadelphia (1981)
Bridges, T.J., Kostianko, A., Schneider, G.: A proof of validity for multiphase Whitham modulation theory. Proc. R. Soc. Lond. A Math. Phys. Eng. Sci. 476(2243), 19 (2020)
Colliander, J.E., Keel, M., Staffilani, G., Takaoka, H., Tao, T.C.: A refined global well-posedness result for Schrödinger equations with derivative. SIAM J. Math. Anal. 34(1), 64–86 (2002)
Duell, W.-P., Heß, M.: Existence of long time solutions and validity of the nonlinear Schrödinger approximation for a quasilinear dispersive equation. J. Differ. Equ. 264(4), 2598–2632 (2018)
Duell, W.-P., Hermann, A., Schneider, G., Zimmermann, D.: Justification of the 2D NLS equation for a fourth order nonlinear wave equation—quadratic resonances do not matter much in case of analytic initial conditions. J. Math. Anal. Appl. 436(2), 847–867 (2016)
Duell, W.-P., Schneider, G., Eugene Wayne, C.: Justification of the nonlinear Schrödinger equation for the evolution of gravity driven 2D surface water waves in a canal of finite depth. Arch. Ration. Mech. Anal. 220(2), 543–602 (2016)
Duell, W.-P.: Justification of the nonlinear Schrödinger approximation for a quasilinear Klein–Gordon equation. Commun. Math. Phys. 355(3), 1189–1207 (2017)
Duell, W.-P.: Validity of the nonlinear Schrödinger approximation for the two-dimensional water wave problem with and without surface tension in the arc length formulation. Arch. Ration. Mech. Anal. 239(2), 831–914 (2021)
Hayashi, N., Ozawa, T.: On the derivative nonlinear Schrödinger equation. Physica D 55(1–2), 14–36 (1992)
Haas, T., Schneider, G.: Failure of the \(N\)-wave interaction approximation without imposing periodic boundary conditions. ZAMM Z. Angew. Math. Mech. 100(6), e201900230 (2020)
Ifrim, M., Tataru, D.: The NLS approximation for two dimensional deep gravity waves. Sci. China Math. 62(6), 1101–1120 (2019)
Jenkins, R., Liu, J., Perry, P., Sulem, C.: The derivative nonlinear Schrödinger equation: global well-posedness and soliton resolution. Q. Appl. Math. 78(1), 33–73 (2020)
Kalyakin, L.A.: Asymptotic decay of a one-dimensional wave-packet in a nonlinear dispersive medium. Math. USSR Sb. 60(2), 457–483 (1988)
Kato, T.: The Cauchy problem for quasi-linear symmetric hyperbolic systems. Arch. Ration. Mech. Anal. 58, 181–205 (1975)
Katznelson, Y.: An Introduction to Harmonic Analysis. Cambridge University Press, Cambridge (2004)
Kaup, D.J., Newell, A.C.: An exact solution for a derivative nonlinear Schrödinger equation. J. Math. Phys. 19, 798–801 (1978)
Kano, T., Nishida, T.: A mathematical justification for Korteweg–de Vries equation and Boussinesq equation of water surface waves. Osaka J. Math. 23, 389–413 (1986)
Kirrmann, P., Schneider, G., Mielke, A.: The validity of modulation equations for extended systems with cubic nonlinearities. Proc. R. Soc. Edinb. Sect. A Math. 122(1–2), 85–91 (1992)
Schneider, G.: Validity and limitation of the Newell–Whitehead equation. Math. Nachr. 176, 249–263 (1995)
Schneider, G.: Limits for the Korteweg–de Vries-approximation. Z. Angew. Math. Mech. 76, 341–344 (1996)
Schneider, G.: Justification of modulation equations for hyperbolic systems via normal forms. NoDEA Nonlinear Differ. Equ. Appl. 5(1), 69–82 (1998)
Schneider, G.: Justification and failure of the nonlinear Schrödinger equation in case of non-trivial quadratic resonances. J. Differ. Equ. 216(2), 354–386 (2005)
Schneider, G., Sunny, D.A., Zimmermann, D.: The NLS approximation makes wrong predictions for the water wave problem in case of small surface tension and spatially periodic boundary conditions. J. Dyn. Differ. Equ. 27(3–4), 1077–1099 (2015)
Schneider, G., Uecker, H.: Nonlinear PDEs. A Dynamical Systems Approach, vol. 182. American Mathematical Society (AMS), Providence (2017)
Takaoka, H.: Well-posedness for the one-dimensional nonlinear Schrödinger equation with the derivative nonlinearity. Adv. Differ. Equ. 4(4), 561–580 (1999)
Tsutsumi, M., Fukuda, I.: On solutions of the derivative nonlinear Schrödinger equation. Existence and uniqueness theorem. Funkc. Ekvacioj Ser. Int. 23, 259–277 (1980)
Totz, N., Sijue, W.: A rigorous justification of the modulation approximation to the 2D full water wave problem. Commun. Math. Phys. 310(3), 817–883 (2012)
Eugene Wayne, C., Cummings, P.: Modified energy functionals and the NLS approximation. Discrete Contin. Dyn. Syst. 37(3), 1295–1321 (2017)
Yifei, W., Guo, Z.: Global well-posedness for the derivative nonlinear Schrödinger equation in \(H^{\frac{1}{2}} ({\mathbb{R} })\). Discrete Contin. Dyn. Syst. 37(1), 257–264 (2017)
Yifei, W.: Global well-posedness on the derivative nonlinear Schrödinger equation. Anal. PDE 8(5), 1101–1112 (2015)
Acknowledgements
The work of Guido Schneider is partially supported by the Deutsche Forschungsgemeinschaft DFG through the SFB 1173 ”Wave phenomena” Project-ID 258734477.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Higher-order DNLS approximation and estimates for the residual
Higher-order DNLS approximation and estimates for the residual
In the previous sections, we concentrated on estimating the error made by a DNLS approximation. In order to do so, we used an improved approximation introduced in (9) with the property that the residual term \( \varepsilon ^{-{\widetilde{\beta }}} \text {Res}(\varepsilon ^{1/2} \psi ) \) is of order \( {\mathcal {O}}(\varepsilon ^2) \) in the equations for the error (10). Since similar constructions can be found in several papers, cf. [24], we took this part out of the line of proof. Nevertheless, we will provide a few details in this appendix.
First we show how to construct an higher-order approximation. We define the residual
for the nonlinear Klein–Gordon equation (4). For the computation of a higher-order approximation, we need a Taylor expansion of
at \( k_0 = 1 \) and at other odd integer multiples of \( k_0 = 1 \). We find, for instance,
The improved ansatz is given
with \( {\textbf{E}} = e^{i(k_0 x - \omega _0 t)} \), \( X = \varepsilon (x-c_gt) \), \( T = \varepsilon ^2 t \), and \( p(n) = (||n|-1|+1)/2 \), with a fixed chosen \( N \in {{\mathbb {N}}}\). For expository reasons, we restrict ourselves in the following to the three terms explicitly displayed in the ansatz. They represent the three essential types of approximation equations which occur.
Plugging the ansatz into (28) and equating the coefficients in front of \( {\textbf{E}} \) to zero give as before the linear dispersion relation \( \omega _0^2 = k_0^2 + 1 \) at \( {\mathcal {O}}(\varepsilon ^{1/2})\) and the linear group velocity \( c_g = k_0/\omega _0 \) at \( {\mathcal {O}}(\varepsilon ^{3/2}) \). At \( {\mathcal {O}}(\varepsilon ^{5/2}) \), we again obtain the DNLS equation
At \( {\mathcal {O}}(\varepsilon ^{7/2}) \), we obtain the equation for \( A_{1,1} \), namely
This is a linearized DNLS equation with some inhomogeneity. All \( A_{1,j} \) for \( j \ge 2 \) satisfy linearized DNLS equations with inhomogeneities, too, whose solutions exist in Gevrey spaces as long as the solutions of the DNLS equation (29) do exist.
Equating the coefficients in front of \( {\textbf{E}}^3 \) to zero gives the determining approximation equation for \( A_{3,0} \) at \( {\mathcal {O}}(\varepsilon ^{3/2}) \), namely
which can be solved w.r.t. \( A_{3,0} \) since \( 9 \omega _0^2 - 9 k_0^2 - 1 \ne 0 \). All \( A_{n,j} \) with \( |n| \ge 3 \) satisfy algebraic equations which are linear in the \( A_{n,j} \) with a nonvanishing coefficient in front. Again the solutions exist in Gevrey spaces as long as the solutions of the DNLS equation (29) do exist.
The set of equations is structured in such a way that they can be solved one after the other. Therefore, more and more terms cancel in the residual, and so, the residual \( \text {Res}_u \) can be made of order \( {\mathcal {O}}(\varepsilon ^{{\widetilde{\beta }}+2}) \).
The approximation for the first-order system (8) is obtained by first setting
and then by setting
From
using \( \omega (1 + \varepsilon K) = \omega _0 + {\mathcal {O}}(\varepsilon )\), we find
and so
which defines the \( a_j \) in (9).
Since in exactly the same way we obtain the residual term \( \varepsilon ^{-{\widetilde{\beta }}} \text {Res}(\varepsilon ^{1/2} \psi ) \) in the equations for the error (10) from \( \text {Res}_u(\varepsilon ^{1/2} \psi _u) \), we find
for the term in (10) as stated in Corollary 8.2. Moreover, we have
More details can be added following the existing literature about the validity of the NLS approximation, cf. [24] for an overview.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Heß, M., Schneider, G. A robust way to justify the derivative NLS approximation. Z. Angew. Math. Phys. 74, 224 (2023). https://doi.org/10.1007/s00033-023-02121-7
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00033-023-02121-7