Abstract
We look for travelling wave fields
satisfying Maxwell’s equations in a nonlinear medium which is not necessarily cylindrically symmetric. The nonlinearity of the medium enters Maxwell’s equations by postulating a nonlinear material law \(D=\varepsilon E+\chi (x,y, \langle |E|^2\rangle )E\) between the electric field E, its time averaged intensity \(\langle |E|^2\rangle \) and the electric displacement field D. We derive a new semilinear elliptic problem for the profiles \(U,{\widetilde{U}}:\mathbb {R}^2\rightarrow \mathbb {R}^3\)
where \(f(x,y,u)=\omega ^2\chi (x,y, |u|^2)u\). Solving this equation we can obtain exact travelling wave solutions of the underlying nonlinear Maxwell equations. We are able to deal with super quadratic and subcritical focusing effects, e.g. in the Kerr-like materials with the nonlinear susceptibility of the form \(\chi (x,y,\langle |E^2|\rangle E) = \chi ^{(3)}(x,y)\langle |E|^2\rangle E\). A variational approach is presented for the semilinear problem. The energy functional associated with the equation is strongly indefinite, since L contains an infinite dimensional kernel. The methods developed in this paper may be applicable to other strongly indefinite elliptic problems and other nonlinear phenomena.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
We are looking for travelling wave fields
solving in the absence of charges and currents the Maxwell system \(\nabla \times E+\partial _t B=0\) (Faraday’s law), \(\nabla \times H=\partial _t D\) (Ampére’s law) together with \(\mathop {\textrm{div}\,}D=0\) and \(\mathop {\textrm{div}\,}B=0\). We require the linear magnetic material law \(B=\mu (x,y)H\) and the nonlinear electric material law \(D=\varepsilon (x,y) E+\chi (x,y, \langle |E|^2\rangle )E\) where \(\langle |E(x,y,z)|^2\rangle = \frac{1}{T}\int _0^T |E(x,y,z,t)|^2\,dt\) is the average intensity of a time-harmonic electric field over one period \(T=2\pi /\omega \). Taking the curl of Faraday’s law and inserting the material laws for B and D together with Ampére’s law we find that E has to satisfy the nonlinear electromagnetic wave equation
Here \(U,\widetilde{U}:\mathbb {R}^2\rightarrow \mathbb {R}^3\) are the profiles of the travelling waves, \(\omega >0\) is the temporal frequency and \(k\in \mathbb {R}\setminus \{0\}\) the spatial wave number in the direction of propagation, \(\varepsilon =\varepsilon (x,y)\) is the permittivity of the medium, \(\mu =\mu (x,y)\) is the magnetic permeability, and \(\chi \) is the scalar nonlinear susceptibility which depends on (x, y) and on the time averaged intensity of E only.
Note that having solved the nonlinear electromagnetic wave equation, one obtains the electric displacement field D directly from the constitutive relation
Moreover the magnetic induction B may be obtained by time integrating Faraday’s law with divergence free initial conditions, which are then preserved in time. Finally the magnetic field H is given by \(H=\mu ^{-1} B\). From (1.2) we get \(\mathop {\textrm{div}\,}(\partial _{tt} D)=0\) and the Gauss law \(\mathop {\textrm{div}\,}(D)=0\) is satisfied due to the travelling wave ansatz of the form (1.1). Altogether, we find exact propagation of the electromagnetic field in the nonlinear medium according to the Maxwell equations with the time-averaged material law (1.3), see also [23, 25, 26].
In physics and mathematical literature there are several simplifications relying on approximations of the nonlinear electromagnetic wave equation. The most prominent one is the scalar or vector nonlinear Schrödinger equation. For instance, one assumes that the term \(\nabla (\mathop {\textrm{div}\,}E)\) in \(\nabla \times (\nabla \times E)= \nabla (\mathop {\textrm{div}\,}E)-\Delta E\) is negligible and can be dropped, or one can use the so-called slowly varying envelope approximation. However, this approach may produce non-physical solutions; see e.g. [1, 11] and references therein. Therefore, in this paper, we are interested in exact travelling wave solutions of the Maxwell equations.
We would like to mention that exact propagation of travelling waves of the nonlinear electromagnetic wave problem (1.2) have been studied analytically so far only in cylindrically symmetric media. Namely, if E is an axisymmetric TE-mode of the form
with
and the scalar function u only depends on r, then solutions of (1.2) have been considered in a series of papers by Stuart and Zhou [25, 27,28,29,30,31,32] for asymptotically constant susceptibilities and by McLeod, Stuart and Troy [14] for a cubic nonlinear polarization. Clearly, E has the form of (1.1) with \({{\tilde{U}}}=0\). The search for these solutions reduces to a one-dimensional variational problem or an ODE for u(r), which simplifies the problem considerably. However, if the medium is not necessarily cylindrically symmetric, then it is not clear how to find travelling waves (1.1) with \({{\tilde{U}}}=0\) analytically and whether any variational approach can be provided with this constraint.
The aim of this work is to provide an analysis of the travelling waves of the general form (1.1) propagating in media, which are not necessarily cylindrically symmetric. To the best of our knowledge it is the first analytical study of travelling waves of (1.2), i.e., the Maxwell system with material law (1.3), where the medium is not supposed to be cylindrically symmetric. We present a variational approach which allows to treat (1.2) and to find ground state solutions of the problem with the least possible energy as well as infinitely many geometrically distinct bound states.
Let us briefly comment on the problem of finding time-harmonic standing wave solutions of (1.2) of the form
This leads to the so-called nonlinear curl–curl problem and has been recently studied e.g. in [4, 5] on a bounded domain and in [7, 15, 18] on \(\mathbb {R}^3\), see also the survey [6] and references therein. In the curl-curl problem, however, U is required to be localized in all space directions, i.e., it is supposed to lie in some Lebesgue space over \(\mathbb {R}^3\). Since travelling waves of the form (1.1) are not localized, they have not been taken into account in these works. We would like to also mention that the study of time-harmonic standing waves in \(\mathbb {R}^3\) in the nonsymmetric case has been presented only in [15, 18]. The methods used there required assumptions about the vanishing properties of the permittivity \(\varepsilon \), or even \(\varepsilon =0\), and the double power behaviour of the nonlinear effect, e.g. \(\chi (x,y,z,E)= \Gamma (x)\min \{|\langle E \rangle |^{p-2},|\langle E \rangle |^{q-2}\}\) with \(2<p<6<q\). Note that \(p=4\) corresponds to the Kerr-type effect, but only for sufficiently strong fields \(|\langle E\rangle |>1\). In this work, however, we are able to treat the probably most common type of nonlinearity in the physics and engineering literature, the Kerr nonlinearity
and we no longer require that the permittivity vanishes at infinity.
The search for travelling waves of the form (1.1) leads to a new nonlinear elliptic problem. Namely, in order to solve the nonlinear electromagnetic wave equation, we observe that the profiles \(U, \widetilde{U}\) of the travelling wave satisfy the elliptic problem
where
For simplicity we have assumed that the magnetic permeability is a constant given by \(\mu =1\). Let us define
where \(L((U,\widetilde{U})^T)\) is defined in the sense of distributions. One verifies that the second-order differential operator
is elliptic and self-adjoint, see Sect. 2 for details.
Our aim is to find solutions \(u:\mathbb {R}^2\rightarrow \mathbb {R}^6\) to the following slight generalization of (1.5) given by
involving the operator L, where we assume \(f(x,u)=\partial _u F(x,u)\). From now on the space variable in \(\mathbb {R}^2\) will be denoted by x instead of (x, y).
Observe that, if
then (1.7) leads to (1.5) and we obtain the exact propagation of the travelling electromagnetic waves (E, B), where E is given by (1.1) and B is provided by Faraday’s law and a subsequent time integration. On the other hand, (1.7) is more general, since the nonlinear potential F may depend on the direction of u and not necessarily on \(|u|=\sqrt{U^2+\widetilde{U}^2}\) as in anisotropic media.
Before we describe our results in more detail, let us comment on the main difference of the current work and previous works [25,26,27,28,29,30] in cylindrically symmetric media. In these previous works the reduction of the curl-curl-operator to the Laplacian when acting on \(E(x,y,z,t)=U(x,y)\cos (kz+\omega t)\) was possible by building-in the constraint \(\mathop {\textrm{div}\,}(U)=0\) into the ansatz as in (1.4). In our work we admit two profiles \(U,{\tilde{U}}\) in the ansatz (1.1) and thus reduce the curl-curl-operator to the operator L above. If we would obtain a one-profile solution (U, 0) of (1.5) with \(k\not =0\) then automatically \(\mathop {\textrm{div}\,}(U)=0\) follows. However, finding such a one-profile solution might be restricted to cylindrically symmetric media. Two-profile solutions like in our ansatz (1.1) allow us to treat more general non-cylindrically symmetric media and a large class of nonlinearities.
The model nonlinearity which we have in mind is
with \(\Gamma \in L^{\infty }(\mathbb {R}^2)\) positive, bounded away from 0 and \(p>2\). In particular if \(p=4\) we deal with the Kerr nonlinearity.
We want to find weak solutions of (1.7) which correspond to critical points of the following functional
defined on a Banach space \(X\subset L^2(\mathbb {R}^2)^6\cap L^p(\mathbb {R}^2)^6\) given later. Here \(b_L(\cdot ,\cdot )\) is the bilinear form associated with L such that \(b_L(u,\varphi )= \int _{\mathbb {R}^2}\langle Lu,\varphi \rangle \, dx\) for all \(u\in {{\,\textrm{dom}\,}}(L)\) and all \(\varphi \in {{\mathcal {C}}}_0^\infty (\mathbb {R}^2)^6\).
Now let us enlist several difficulties underlying the problem. For \(\alpha ,\widetilde{\alpha }\in {{\mathcal {C}}}_0^{\infty }(\mathbb {R}^2)\) and for \(\beta ,\widetilde{\beta }\in {{\mathcal {C}}}_0^\infty (\mathbb {R}^2)^3\) let us denote
and observe that \(L=\overset{\circ }{\nabla }\times \overset{\circ }{\nabla }\times \), \(\overset{\circ }{\nabla }\times \overset{\circ }{\nabla }=0\) and thus \(w:=\overset{\circ }{\nabla }\begin{pmatrix} \alpha \\ \widetilde{\alpha }\end{pmatrix}\in \ker (L)\). Moreover J may be unbounded from above and from below and its critical points may have infinite Morse index. This is due to the infinite-dimensional kernel of L. In addition to these problems related to the strongly indefinite geometry of \(J'\), we also have to deal with the lack of compactness issues. Namely, the functional J is not (sequentially) weak-to-weak\(^*\) continuous, i.e. the weak convergence \(u_n\rightharpoonup u\) in X does not imply that \(J'(u_n)\rightharpoonup J'(u)\) in \(X^*\). In particular, if \(J'(u_n)\rightarrow 0\), we do not know whether u is a critical point of J and solves (1.7).
In Theorem 2.3 we show that the spectrum \(\sigma (L)=\{0\}\cup [k^2,\infty )\), where 0 is an eigenvalue of infinite multiplicity and \([k^2,\infty )\) consists of absolutely continuous spectrum. This allows us to consider the following general assumptions.
-
(V)
\(V\in L^{\infty }(\mathbb {R}^2)\) and \(0<{{\,\mathrm{ess\,inf}\,}}V\le {{\,\mathrm{ess\,sup}\,}}V<k^2\). Moreover there is \(V_0\in \mathbb {R}\) such that \(V-V_0\in L^{\frac{p}{p-2}}(\mathbb {R}^2)\) for some \(p>2\).
-
(F1)
\(F:\mathbb {R}^2\times \mathbb {R}^2\rightarrow [0,\infty )\) is differentiable with respect to the second variable \(u\in \mathbb {R}^2\), and \(f=\partial _uF:\mathbb {R}^2\times \mathbb {R}^2\rightarrow \mathbb {R}^2\) is measurable in \(x\in \mathbb {R}^2\), continuous in \(u\in \mathbb {R}^2\) for a.e. \(x\in \mathbb {R}^2\). Moreover f is \(\mathbb {Z}^2\)-periodic in x i.e. \(f(x,u)=f(x+y,u)\) for \(x,u\in \mathbb {R}^2\), \(y\in \mathbb {Z}^2\).
-
(F2)
\(|f(x,u)|=o(1)\) as \(u\rightarrow 0\) uniformly in \(x\in \mathbb {R}^2\).
-
(F3)
There are \(p>2\) and \(c_1>0\) such that
$$\begin{aligned} |f(x,u)|\le c_1(1+|u|^{p-1})\quad \hbox {for all }u\in \mathbb {R}^2\hbox { and a.e. }x\in \mathbb {R}^2. \end{aligned}$$ -
(F4)
There is \(c_2>0\) such that
$$\begin{aligned} \liminf _{|u|\rightarrow \infty }F(x,u)/|u|^p\ge c_2\quad \hbox {for a.e. }x\in \mathbb {R}^2. \end{aligned}$$
Assumptions (V), (F1)–(F3) are sufficient to show that J is of class \({{\mathcal {C}}}^1\). (F4) provides a lower bound estimate for large |u|, however we will need also its stronger variant:
-
(F4’)
There is \(c_2>0\) such that
$$\begin{aligned} F(x,u)\ge c_2 |u|^p\quad \hbox {for all }u\in \mathbb {R}^2\hbox { and a.e. }x\in \mathbb {R}^2. \end{aligned}$$
In order to deal with ground states one has to assume the following assumption:
-
(F5)
If \( \langle f(x,u),v\rangle = \langle f(x,v),u\rangle >0\), then \(\ \displaystyle F(x,u) - F(x,v) \le \frac{\langle f(x,u),u\rangle ^2-\langle f(x,u),v\rangle ^2}{2\langle f(x,u),u\rangle }\). Moreover \(\langle f(x,u),u\rangle \ge 2F(x,u)\) for every \(x\in \mathbb {R}^2\) and \(u\in \mathbb {R}^2\).
Condition (F5) was introduced in [4, 16]. Conditions (F4) and (F5) imply in particular the convexity of F in u, cf. Remark 6.2. Observe that if F is isotropic in u, i.e. \(F(x,u)=\chi (x,\frac{1}{2}|u|^2)\), \(\chi (x,s)\ge 0\), \(\partial _s\chi (x,s)\) is nondecreasing and \(\chi (x,0)=0\), then (F5) is satisfied. In general, (F5) does not imply the Ambrosetti-Rabinowitz condition (F6) given below, cf. [2]. For instance, if we take \(0<a<b\) then we may find a function \(\chi \) of class \({{\mathcal {C}}}^1\) such that \(\partial _s\chi (s)=s^{\frac{p-2}{2}}\) for \(0\le s<a\), \(\partial _s\chi (s)\) is constant for \(s\in [a,b]\), and \(\partial _s\chi (s)=cs^{\frac{p-2}{2}}\) for \(s>b\) and a suitable \(c>0\). Then (F1)–(F5) are satisfied, but the Ambrosetti-Rabinowitz condition (F6) does not hold. Note also that if f and \(\widetilde{f}\) satisfy (F1)–(F4), then also \(\alpha f+\beta \widetilde{f}\) satisfies (F1)–(F4) for \(\alpha ,\beta >0\). It is not clear if (F5) considered alone has this positive additivity property, however similarly as in [16, Remark 3.3 (b)] we check that if f and \(\widetilde{f}\) satisfy (F1)–(F5) simultaneously, then \(\alpha f+\beta \widetilde{f}\) satisfies the same assumptions.
The main result reads as follows and note that all our assumptions are satisfied for the model nonlinearity given by (1.9).
Theorem 1.1
Suppose that (V) and (F1)–(F5) hold. If \(V=V_0\), or \(V(x)>V_0\) for a.e. \(x\in \mathbb {R}^3\) and (F4’) holds, then there is a nontrivial solution to (1.7) of the form \(u_0=v+w\) with \(v\in H^1(\mathbb {R}^2)^6\setminus \{0\}\) and \(w\in L^2(\mathbb {R}^2)^6\cap L^p(\mathbb {R}^2)^6\) such that \(Lw=0\). Moreover \(u_0\) is a ground state solution, i.e., \(J(u_0)=\inf _{{{\mathcal {N}}}}J>0\), where
If, in addition, f is odd in u and \(V=V_0\), then there is an infinite sequence \((u_n)\) of \(\mathbb {Z}^2\)-distinct solutions, i.e., \((\mathbb {Z}^2*u_n)\cap (\mathbb {Z}^2*u_m)=\emptyset \) for \(n\ne m\), where \(\mathbb {Z}^2*u:=\{u(\cdot +z):z\in \mathbb {Z}^2\}\) for \(u\in X\).
Observe that for our model (1.8), condition (V) provides the relation between the wave number k and the frequency \(\omega \).
Note that \({{\mathcal {N}}}\) contains all nontrivial critical points and \({{\mathcal {N}}}\) is contained in the classical Nehari manifold \(\{u\in X\setminus \{0\}: J'(u)(u)=0\}\), see [20] and Sect. 3 for further properties. Condition (F5) is important to obtain a bounded Palais-Smale sequence for J at level \(\inf _{{{\mathcal {N}}}} J\). However it is not clear how to obtain a bounded Palais-Smale sequence for J under (F1)–(F4) and the classical Ambrosetti-Rabinowitz condition
-
(F6)
There is \(\gamma >2\) such that for every \(u\in \mathbb {R}^2\) and a.e. \(x\in \mathbb {R}^2\)
$$\begin{aligned} \langle f(x,u),u\rangle \ge \gamma F(x,u). \end{aligned}$$
Instead, we need to consider (F4’) together with (F6) in order to prove the boundedness of Palais-Smale sequences, see Lemmas 6.3 and 6.4.
Theorem 1.2
Suppose that F is convex in \(u\in \mathbb {R}^2\), (F1)–(F3), (F4’), (F6) and that (V) holds for a constant function V, i.e., \(V(x)=V_0\) with \(0<V_0<k^2\). Then there is a nontrivial solution to (1.7) of the form \(u_0=v+w\) with \(v\in H^1(\mathbb {R}^2)^6\setminus \{0\}\) and \(w\in L^2(\mathbb {R}^2)^6\cap L^p(\mathbb {R}^2)^6\) such that \(Lw=0\). Moreover \(u_0\) is a least energy solution, i.e., \(J(u_0)=c_0\), where
If, in addition, f is odd in u, then there is an infinite sequence \((u_n)\) of \(\mathbb {Z}^2\)-distinct solutions.
We show that if \(u\in X\) solves (1.7), then the total electromagnetic energy per unit interval in \(x_3\)-direction given by
is finite; see Corollary 2.4. We do not know, however, whether the fields E, D, B and H are localized, i.e. decay to zero as \(|(x_1,x_2)|\rightarrow \infty \), however X lies in \(L^2(\mathbb {R}^2)^6\cap L^p(\mathbb {R}^2)^6\). Therefore E has a weaker decay property and cannot travel in the \((x_1,x_2)\)-plane. The finiteness of the total electromagnetic energy and the localization problem attract a strong attention in the study of self-guided beams of light in nonlinear media; see e.g. [25, 26]. The question of the regularity of the fields remains open.
The first crucial step in our approach is to build the functional and variational setting for the new operator L and the problem (1.7), which will be demonstrated in the next Sect. 2. Next we recognize the strongly indefinite nature of (1.10) and show that it is of the form of the critical point theory presented in [4, 5, 18] and built for curl-curl problems. However, we work in a different functional setting and under a different set of assumptions, so that we have to slightly refine the results, in particular we do not always assume condition (I8) given in Sect. 3. In Sect. 4 we deal with the lack of compactness issue, in particular with the lack of weak-to-weak\(^*\) continuity of \(J'\). By means of a profile decomposition result (Theorem 5.1), we are able to prove this regularity in some weakly closed topological constraint \({{\mathcal {M}}}\subset X\), so that a weak limit point of a bounded Palais-Smale sequence of \({{\mathcal {M}}}\) is a critical point of J. In the last Sect. 6 we show that a variant of Cerami’s condition holds [10]. In particular we show that any Cerami sequence is bounded, and we emphasize that the proof the boundedness is considerably nonstandard and not straightforward, even if (F6) holds. Some technical inequalities have to be worked out, see details in Lemmas 6.3, 6.4. Finally we complete the proof of Theorems 1.1 and 1.2.
2 Variational setting
We introduce the following notation. If \(u=\begin{pmatrix} U\\ {{\tilde{U}}}\end{pmatrix}\in \mathbb {R}^6\), then \(|u|:=\big (|U|^2+|{{\tilde{U}}}|^2\big )^{1/2}\) and \(\langle \cdot ,\cdot \rangle \) denotes the Euclidean inner product in \(\mathbb {R}^N\), \(N\ge 1\). In the sequel \(\langle \cdot ,\cdot \rangle _2\) denotes the inner product in \(L^2(\mathbb {R}^2)^6\) and \(|\cdot |_q\) denotes the usual \(L^q\)-norm for \(q\in [1,+\infty ]\). Furthermore we denote by C a generic positive constant which may vary from one inequality to the next. We always assume that \(k\ne 0\).
Let us introduce the following space
and note that it is a closed subspace of \(H^1(\mathbb {R}^2)^6\). Let us consider the following norm in \({{\mathcal {V}}}\)
which is equivalent to the standard \(H^1\)-norm. Let \({{\mathcal {W}}}\) be the completion of vector fields \(w=\overset{\circ }{\nabla }\begin{pmatrix} \Phi \\ \widetilde{\Phi }\end{pmatrix}\), \(\Phi ,\widetilde{\Phi }\in {{\mathcal {C}}}_0^{\infty }(\mathbb {R}^2)^3\) with respect to the following norm
so that \({{\mathcal {W}}}\subset L^2(\mathbb {R}^2)^6\cap L^p(\mathbb {R}^2)^6\). Note that \({{\mathcal {V}}}\cap {{\mathcal {W}}}=\{0\}\) and we may define a norm on \({{\mathcal {V}}}\oplus {{\mathcal {W}}}\) as follows
Theorem 2.1
The spaces \({{\mathcal {V}}}\) and \({{\mathcal {W}}}\) are closed subspaces of \(L^2(\mathbb {R}^2)^6\) and orthogonal with respect to \(\langle \cdot ,\cdot \rangle _2\) and \(X:={{\mathcal {V}}}\oplus {{\mathcal {W}}}\) is the completion of \({{\mathcal {C}}}_0^{\infty }(\mathbb {R}^2)^6\) with respect to the norm \(\Vert \cdot \Vert \).
Proof
It is clear that \({{\mathcal {V}}}\) and \({{\mathcal {W}}}\) are closed subspaces of \(L^2(\mathbb {R}^2)^6\) and orthogonal with respect to \(\langle \cdot ,\cdot \rangle _2\). Let \(\varphi =\begin{pmatrix} \Phi \\ \widetilde{\Phi }\end{pmatrix}\in {{\mathcal {C}}}_0^{\infty }(\mathbb {R}^2)^6\) and let \(\alpha , \widetilde{\alpha }\in H^{1}(\mathbb {R}^2)\cap {{\mathcal {C}}}^{\infty }(\mathbb {R}^2)\) be the unique solutions to
and
respectively. Since \(\alpha \) is the Bessel potential of the \({{\mathcal {C}}}_0^\infty \)-function \(-(\partial _{x_1} \Phi _1 +\partial _{x_2} \Phi _2 +k \widetilde{\Phi }_3)\) we find that \(\alpha \in W^{l,s}(\mathbb {R}^2)\) for any \(l\in \mathbb {N}\) and \(1<s<\infty \), cf. [24, Chapter V Sect. 3]. Arguing similarly we see that the same also holds for \( \widetilde{\alpha }\). Taking standard mollifiers and applying a cutoff argument, we find \( \alpha _n,\widetilde{\alpha }_n\in {{\mathcal {C}}}_0^{\infty }(\mathbb {R}^2)\) such that \(\alpha _n\rightarrow \alpha \) and \(\widetilde{\alpha }_n\rightarrow \widetilde{\alpha }\) in \(W^{l,s}(\mathbb {R}^2)\) for any \(l\in \mathbb {N}\) and \(1<s<\infty \). In particular this implies
and \(\varphi _{{\mathcal {W}}}\in W^{l,s}(\mathbb {R}^2)\) for all \(l\in \mathbb {N}\) and \(1<s<\infty \). Moreover, \({{\mathcal {W}}}\) is clearly contained in the completion of \({{\mathcal {C}}}_0^{\infty }(\mathbb {R}^2)^6\) with respect to the norm \(\Vert \cdot \Vert \). Since in particular \(\varphi _{{\mathcal {W}}}\in H^1(\mathbb {R}^2)^6\) we obtain by integration by parts
for every \(\beta ,\widetilde{\beta }\in {{\mathcal {C}}}_0^{\infty }(\mathbb {R}^2)\). Therefore \(\varphi _{{\mathcal {V}}}:=\varphi -\varphi _{{\mathcal {W}}}\in {{\mathcal {V}}}\) and we obtain the following Helmholtz-type decomposition
and moreover we have shown that \(\varphi \in X\). It remains to show that also \({{\mathcal {V}}}\) is contained in the completion of \({{\mathcal {C}}}_0^\infty (\mathbb {R}^2)^6\) with respect to \(\Vert \cdot \Vert \). To see this, let \(v\in {{\mathcal {V}}}\) and take \((\varphi _n)\subset {{\mathcal {C}}}_0^{\infty }(\mathbb {R}^2)^6\) such that \(\varphi _n\rightarrow v\) in \(H^1(\mathbb {R}^2)^6\). Let us decompose \(\varphi _n=\varphi _{{\mathcal {V}}}^n+\varphi _{{\mathcal {W}}}^n\in {{\mathcal {V}}}\oplus {{\mathcal {W}}}\) and let \(\mathrm{cl\,}_{L^2\cap L^p}{{\mathcal {V}}}\) denote the closure of \({{\mathcal {V}}}\) in \(L^2(\mathbb {R}^2)^6\cap L^p(\mathbb {R}^2)^6\). Observe that \(\big (\mathrm{cl\,}_{L^2\cap L^p}{{\mathcal {V}}}\big )\cap {{\mathcal {W}}}=\{0\}\), so that there is a continuous \(L^2(\mathbb {R}^2)^6\cap L^p(\mathbb {R}^2)^6\)-projection of \(\big (\mathrm{cl\,}_{L^2\cap L^p}{{\mathcal {V}}}\big )\oplus {{\mathcal {W}}}\) onto \({{\mathcal {W}}}\). Since \(\varphi _n\rightarrow v\) in \(L^2(\mathbb {R}^2)^6\cap L^p(\mathbb {R}^2)^6\), we infer that \(\varphi _{{\mathcal {W}}}^n\rightarrow 0\) in \({{\mathcal {W}}}\). Similarly, arguing with the closure of \({{\mathcal {W}}}\) in \(H^1(\mathbb {R}^2)^6\) we get \(\varphi _{{\mathcal {V}}}^n\rightarrow v\) in \({{\mathcal {V}}}\). Therefore
as \(n\rightarrow \infty \) and we conclude that \({{\mathcal {V}}}\oplus {{\mathcal {W}}}\) is the completion of \({{\mathcal {C}}}_0^{\infty }(\mathbb {R}^2)^6\) with respect to the norm \(\Vert \cdot \Vert \). \(\square \)
Now we investigate the differential operator L and its spectrum in the following two results.
Proposition 2.2
The second-order differential operator (1.6) is elliptic and self-adjoint on the domain \({{\,\textrm{dom}\,}}(L)\). Its associated bilinear form \(b_L: {{\,\textrm{dom}\,}}(b_L)\times {{\,\textrm{dom}\,}}(b_L)\rightarrow \mathbb {R}\) is given by
with \({{\,\textrm{dom}\,}}(b_L)= \{u\in L^2(\mathbb {R}^2)^6 \text{ s.t. } \overset{\circ }{\nabla }\times u\in L^2(\mathbb {R}^2)^6\}\).
Proof
Let us show that \({{\,\textrm{dom}\,}}(b_L)\) is closed in \(L^2(\mathbb {R}^2)^6\). First note the symmetry property \(\langle \overset{\circ }{\nabla }\times u,v\rangle = \langle u,\overset{\circ }{\nabla }\times v\rangle \) for all \(u,v \in {{\,\textrm{dom}\,}}(b_L)\). If \((u_n)_{n\in \mathbb {N}}\in {{\,\textrm{dom}\,}}(b_L)\) is a sequence such \(u_n\rightarrow u\) in \(L^2(\mathbb {R}^2)^6\) and \(\overset{\circ }{\nabla }\times u_n\rightarrow \psi \) in \(L^2(\mathbb {R}^2)^6\) for some \(\psi \in L^2(\mathbb {R}^2)^6\) then the symmetry property implies \(\langle \overset{\circ }{\nabla }\times u_n,\phi \rangle = \langle u_n,\overset{\circ }{\nabla }\times \phi \rangle \rightarrow \langle u,\overset{\circ }{\nabla }\times \phi \rangle \) for all \(\phi \in {{\,\textrm{dom}\,}}(b_L)\). Thus \(\psi =\overset{\circ }{\nabla }\times u\) and therefore \(b_L\) is a closed symmetric bilinear form. It is therefore the associated bilinear form of a unique selfadjoint operator, cf. [22, Theorem VIII.15]. For all \(u,v \in {{\mathcal {C}}}_0^\infty (\mathbb {R}^2)^6\) we see that
Hence \(b_L\) is the bilinear form of the operator \(L: {{\,\textrm{dom}\,}}(L) \rightarrow L^2(\mathbb {R}^2)^6\) with \({{\,\textrm{dom}\,}}(L)=\{u\in L^2(\mathbb {R}^2)^6 \text{ s.t. } Lu\in L^2(\mathbb {R}^2)^6\}\). \(\square \)
Theorem 2.3
The operator \(L:{{\,\textrm{dom}\,}}(L) \subset L^2(\mathbb {R}^2)^6 \rightarrow L^2(\mathbb {R}^3)^6\) has spectrum \(\sigma (L)=\{0\}\cup [k^2,\infty )\), where 0 is an eigenvalue of infinite multiplicity and \([k^2,\infty )\) consists of absolutely continuous spectrum.
Proof
Let us consider the symbol \({\hat{L}}(\xi )\) which is a complex hermitian \(6\times 6\) matrix for \(\xi \in \mathbb {R}^2\). Let us denote by \(\sigma ({\hat{L}}(\xi ))\) the matrix eigenvalues of \({\hat{L}}(\xi )\). We will verify \(\sigma (L)=\overline{\bigcup _{\xi \in \mathbb {R}^2}\sigma ({\hat{L}}(\xi ))}\) by two steps:
-
(i)
\(\lambda \) is a matrix eigenvalue of \({\hat{L}}(\xi )\) for some \(\xi \in \mathbb {R}^2\) \(\Rightarrow \) \(\lambda \in \sigma (L)\)
-
(ii)
\(\text { dist}(\lambda , \overline{\bigcup _{\xi \in \mathbb {R}^2}\sigma ({\hat{L}}(\xi ))})>0\) \(\Rightarrow \) \(\lambda \) lies in the resolvent set of L
Since the spectrum of L is closed, (i) and (ii) imply the claimed representation of \(\sigma (L)\).
Proof of (i): Let \(a\in \mathbb {R}^6\) be a unit vector with \({\hat{L}}(\xi _0)a = \lambda a\). If \(\rho \in {{\mathcal {C}}}_0^\infty (\mathbb {R}^2)\) is a function with \(\Vert \rho \Vert _{L^2(\mathbb {R}^2)}=1\) then let \({\hat{u}}_k(\xi ) = k \rho (k(\xi -\xi _0)) a\). One finds that \(\Vert {\hat{u}}_k\Vert _{L^2(\mathbb {R}^2)^6}=1\) and
by continuity of the symbol and dominated convergence. By Plancherel’s theorem we have that \((L-\lambda )u_k\rightarrow 0\) as \(k\rightarrow \infty \) in \(L^2(\mathbb {R}^2)^6\) with \(\Vert u_k\Vert _{L^2(\mathbb {R}^2)^6}=1\) so that \(\lambda \in \sigma (L)\) by Weyl’s criterion.
Proof of (ii): Let \(|\lambda -\lambda _j(\xi )|\ge \delta >0\) for all \(\xi \in \mathbb {R}^2\), all eigenvalues \(\lambda _1(\xi ),\ldots ,\lambda _6(\xi )\) of the matrix \({\hat{L}}(\xi )\) and let us denote the corresponding orthonormal basis of eigenvectors by \(v_1(\xi ),\ldots ,v_6(\xi )\in \mathbb {C}^6\). Then, for given \(f\in L^2(\mathbb {R}^2)^6\), solving the problem \((L-\lambda {{\,\textrm{Id}\,}})u=f\) is equivalent to computing the inverse Fourier transform of
By Plancherel’s theorem this amounts to
This shows that \(\lambda \) belongs to the resvolent set of L.
The remaining part of the proof is dedicated to the computation of the matrix eigenvalus \(\lambda _1(\xi ),\ldots ,\lambda _6(\xi )\). It turns out that the characteristic polynomial of \({\hat{L}}(\xi )\) is given by \(\lambda ^2(|\xi |^2+k^2-\lambda )^4\) and hence \(\sigma (L)=\{0\}\cup [k^2,\infty )\). Indeed, taking the Fourier-transform we find the symbol of L, whose characteristic polynomial is given by
Interchanging column 3 and 6 as well as line 3 and 6 we get a block structure which leads to
since both matrices have the same determinant \(-\lambda (|\xi |^2+k^2-\lambda )^2\). Thus the matrix eigenvalues are given by \(\sigma ({\hat{L}}(\xi ))=\{0, |\xi |^2+k^2\}\). The zero eigenvalue of infinite multiplicity if generated by all vector fields \(\overset{\circ }{\nabla }\begin{pmatrix} \alpha \\ \widetilde{\alpha }\end{pmatrix}\) with \(\alpha ,\widetilde{\alpha }\in C_c^\infty (\mathbb {R}^2)\). \(\square \)
Let \(p_{{{\mathcal {V}}}}:X\rightarrow {{\mathcal {V}}}\), \(p_{{{\mathcal {W}}}}:X\rightarrow {{\mathcal {W}}}\) denote the projections of X onto \({{\mathcal {V}}}\), \({{\mathcal {W}}}\), respectively. Usually we just write \(u=v+w\in {{\mathcal {V}}}\oplus {{\mathcal {W}}}\), where \(v=p_{{{\mathcal {V}}}}(u)\in {{\mathcal {V}}}\) and \(w=u-v=p_{{{\mathcal {W}}}}(u)\in {{\mathcal {W}}}\). Observe that \({{\mathcal {V}}}\) and \({{\mathcal {W}}}\) are both subspaces of \({{\,\textrm{dom}\,}}(b_L)\) and that for \(v+w\in X={{\mathcal {V}}}\oplus {{\mathcal {W}}}\) we have
For \(u\in {{\,\textrm{dom}\,}}(b_L)\) using the duality pairing between \({{\,\textrm{dom}\,}}(b_L)'\) and \({{\,\textrm{dom}\,}}(b_L)\) we can define Lu by setting \(\langle Lu,\phi \rangle _{{{\,\textrm{dom}\,}}(b_L)'\times {{\,\textrm{dom}\,}}(b_L)}:= b_L(u,\phi )\) for all \(\phi \in {{\,\textrm{dom}\,}}(b_L)\). In this way \(L: X\rightarrow {{\,\textrm{dom}\,}}(b_L)'\) is well-defined and has the kernel \({{\mathcal {W}}}\). Note that L restricted to \({{\mathcal {V}}}\) coincides with the vector Schrödinger operator \(-\Delta +k^2\) acting diagonally on elements of \({{\mathcal {V}}}\).
We say that \(u\in \begin{pmatrix} U \\ \widetilde{U} \end{pmatrix}\in X\) is a weak solution to (1.7) provided that
for any \(\varphi \in {{\mathcal {C}}}_0^{\infty }(\mathbb {R}^2)^6\). From now on we assume that (F1)–(F4) and (V) are satisfied. Observe that for \(u=v+w\in {{\mathcal {V}}}\oplus {{\mathcal {W}}}\) we get
Corollary 2.4
\(J:X\rightarrow \mathbb {R}\) is of class \({{\mathcal {C}}}^1\) and \(u\in X\) is a critical point of J if and only if u is a weak solution to (1.7). Moreover, if (1.8) holds and E of the form (1.1) is a travelling wave field with the profiles U and \({{\tilde{U}}}\), where \(u= \begin{pmatrix} U \\ \widetilde{U} \end{pmatrix}\in X\) is a critical point of J, then the total electromagnetic energy per unit interval on the \(x_3\)-axis is finite, i.e.
Proof
The first statement is a consequence of Theorem 2.1. According to the material laws
and by the Faraday’s law
Therefore
\(\square \)
3 Abstract variational approach
Our approach is based on the critical point theory from [4, 5, 18], however sometimes we do not assume the monotonicity assumption (I8) given below and we present results with emphasis on where this condition is crucial.
Let \(J:X\rightarrow \mathbb {R}\) be a functional of the form
such that X is a reflexive Banach space with the norm \(\Vert \cdot \Vert \) and a topological direct sum decomposition \(X=X^+\oplus X^-\), where \(X^+\) is a Hilbert space with a scalar product \(\langle \cdot , \cdot \rangle \). For \(u\in X\) we denote by \(u^+\in X^+\) and \(u^- \in X^-\) the corresponding summands so that \(u = u^++u^-\). We may assume \(\langle u,u \rangle = \Vert u\Vert ^2\) for any \(u\in X^+\) and \(\Vert u\Vert ^2 = \Vert u^+\Vert ^2+\Vert u^- \Vert ^2\). We define the topology \({{\mathcal {T}}}\) on X as the product of the norm topology in \(X^+\) and the weak topology in \(X^-\). Hence \(u_n{\mathop {\longrightarrow }\limits ^{{{\mathcal {T}}}}}u\) if and only if \(u_n^+\rightarrow u^+\) and \(u_n^-\rightharpoonup u^-\). Let us define the set
Clearly \({{\mathcal {M}}}\) contains all critical points of J and we assume the following conditions introduced in [4, 5]:
-
(I1)
\(I\in {{\mathcal {C}}}^1(X,\mathbb {R})\) and \(I(u)\ge I(0)=0\) for any \(u\in X\).
-
(I2)
I is \({{\mathcal {T}}}\)-sequentially lower semicontinuous: if \(u_n{\mathop {\longrightarrow }\limits ^{{{\mathcal {T}}}}}u\quad \textrm{then} \quad \liminf I(u_n)\ge I(u)\).
-
(I3)
If \(u_n{\mathop {\longrightarrow }\limits ^{{{\mathcal {T}}}}}u\) and \(I(u_n)\rightarrow I(u)\) then \(u_n\rightarrow u\).
-
(I4)
\(\Vert u^+\Vert +I(u)\rightarrow \infty \) as \(\Vert u\Vert \rightarrow \infty \).
-
(I5)
If \(u\in {{\mathcal {M}}}\) then \(I(u)<I(u+v)\) for every \(v\in X^- \setminus \{0\}\).
-
(I6)
There exists \(r>0\) such that \(a:=\inf \limits _{u\in X^+,\Vert u\Vert =r} J(u)>0\).
-
(I7)
\(I(t_nu_n)/t_n^2\rightarrow \infty \) if \(t_n\rightarrow \infty \) and \(u_n^+\rightarrow u^+\ne 0\) as \(n\rightarrow \infty \).
Observe that if I is strictly convex, continuous and satisfies (I4), then (I2) and (I5) are clearly satisfied. Moreover, for any \(u\in X^+\) we find \(m(u)\in {{\mathcal {M}}}\) which is the unique global maximizer of \(J|_{u+X^-}\). From now on we assume (I1)–(I7). Note that m needs not be \({{\mathcal {C}}}^1\), and \({{\mathcal {M}}}\) needs not be a differentiable manifold since \(I'\) is only continuous, however from [5, Proof of Theorem 4.4] we observe \(m:X^+\rightarrow {{\mathcal {M}}}\) is a homeomorphism and \(\widetilde{J}:=J\circ m: X^+\rightarrow \mathbb {R}\) is of class \({{\mathcal {C}}}^1\).
We introduce the following min-max level
where
If we look for ground state solutions with the least possible energy, we consider a Nehari-type constraint \({{\mathcal {N}}}\) and the following condition
-
(I8)
\(\frac{t^2-1}{2}I'(u)[u]+I(u) - I(tu+v)=\frac{t^2-1}{2}I'(u)[u] + tI'(u)[v] + I(u) - I(tu+v) \le 0\) for every \(u\in {{\mathcal {N}}}\), \(t\ge 0\), \(v\in X^-\), where
$$\begin{aligned} {{\mathcal {N}}}:= \{u\in X\setminus X^-: J'(u)|_{\mathbb {R}u\oplus X^-}=0\} = \{u\in {{\mathcal {M}}}\setminus X^-: J'(u)[u]=0\} \subset {{\mathcal {M}}}. \end{aligned}$$
The constraint \({{\mathcal {N}}}\) is a generalized Nehari-type manifold inspired by a constrained considered in [21] for the Schrödinger problem. We state the following result obtained in [18, Theorem 3.3] under assumptions (I1)–(I8).
Theorem 3.1
Suppose J of the form (3.1) satisfies (I1)–(I7). Then J has a sequence \((u_n)\subset {{\mathcal {M}}}\) such that \(J(u_n)\rightarrow c\ge a>0\) and \(J'(u_n)(1+\Vert u_n^+\Vert )\rightarrow 0\) as \(n\rightarrow \infty \). If, in addition, (I8) holds, then \(c=\inf _{{{\mathcal {N}}}}J\).
Proof
For the reader’s convenience, we shortly sketch the proof. Recall that by (I5)–(I7), \(\widetilde{J}(u)\ge J(u)\ge a\) for \(u\in X^+\) and \(\Vert u\Vert =r\), and \(\widetilde{J}(tu)/t^2\rightarrow -\infty \) as \(t\rightarrow \infty \). Thus \(\widetilde{J}\) has the mountain pass geometry and similarly as in [5, Theorem 4.4] we may define the mountain pass level
where
By the mountain pass theorem there exists a Cerami sequence \((v_n)\) for \(\widetilde{J}\) at the level \(c_{{\mathcal {M}}}\), see [3, 10]. Since \(\widetilde{J}'(v_n)=J'(m(v_n))\) (see [5, 18]), then setting \(u_n:=m(v_n)\) we obtain
If (I8) holds, then by [18, Theorem 3.3 (b)], \(c=\inf _{{{\mathcal {M}}}}J\). \(\square \)
Now we present a multiplicity result. Recall that for a topological group acting on X, the orbit of \(u\in X\) is often denoted by \(G*u\), i.e.,
A set \(A\subset X\) is called G-invariant if \(gA\subset A\) for all \(g\in G\). A functional \(J: X\rightarrow \mathbb {R}\) is called G-invariant and a map \(T: X\rightarrow X^*\) is called G-equivariant if \(J(gu)=J(u)\) and \(T(gu)=gT(u)\) for all \(g\in G\), \(u\in X\). In our application we use the action given by \(\mathbb {Z}^2\)-translations.
Assume that G is a topological group such that
-
(G)
G acts on X by isometries and discretely in the sense that for each \(u\ne 0\), \((G*u)\setminus \{u\}\) is bounded away from u. Moreover, J is G-invariant and \(X^+, X^-\) are G-invariant.
Observe that \({{\mathcal {M}}}\) is G-invariant and \(m:X^+\rightarrow {{\mathcal {M}}}\) is G-equivariant.
We shall use the notation
and call \(G*u\) a critical orbit whenever \(u\in {{\mathcal {K}}}\). Moreover the following variant of the Cerami condition between the levels \(\alpha , \beta \in \mathbb {R}\) has been introduced in [18]:
- \((M)_\alpha ^\beta \):
-
- (a):
-
Let \(\alpha \le \beta \). There exists \(M_\alpha ^\beta \) such that \(\limsup _{n\rightarrow \infty }\Vert u_n\Vert \le M_\alpha ^{\beta }\) for every \((u_n)\subset X^+\) satisfying \(\alpha \le \liminf _{n\rightarrow \infty }\widetilde{J}(u_n)\le \limsup _{n\rightarrow \infty }\widetilde{J}(u_n)\le \beta \) and \((1+\Vert u_n\Vert )\widetilde{J}'(u_n)\rightarrow 0\).
- (b):
-
Suppose in addition that the number of critical orbits in \(\widetilde{J}_\alpha ^\beta \) is finite. Then there exists \(m_\alpha ^\beta >0\) such that if \((u_n), (v_n)\) are two sequences as above and \(\Vert u_n-v_n\Vert <m_\alpha ^\beta \) for all n large, then \(\liminf _{n\rightarrow \infty }\Vert u_n-v_n\Vert =0\).
Note that if J is even, then m is odd and \({{\mathcal {M}}}\) is symmetric, i.e. \({{\mathcal {M}}}=-{{\mathcal {M}}}\). Note also that \((M)_\alpha ^\beta \) is a condition on \(\widetilde{J}\). We slightly generalize [18, Theorem 3.5 (b)].
Theorem 3.2
Suppose J of the form (3.1) satisfies (I1)–(I7), J is even and \(\textrm{dim}\,(X^+)=\infty \). If \((M)_0^{\beta }\) holds for every \(\beta >0\), then J has infinitely many distinct critical orbits.
In fact, in [18, Theorem 3.5 (b)] condition (I8) has been assumed as well, which is redundant for this part of the result. Indeed, by the inspection of the proof of [18, Theorem 3.5 (b)], the following min-max values
are well-defined and finite, where \(\Sigma := \{A\subset X^+: A=-A \text { and } A \text { is compact}\}\), and for \(A\in \Sigma \), we define a variant of Benci’s pseudoindex [3, 18].
where r is as in (I6), \(S(0,r):=\{u\in X^+:\Vert u\Vert =r\}\), \(\gamma \) is Krasnoselskii’s genus and
We observe also that \(\beta _k\ge a\). Clearly (I6) implies that
for any \(A\in \Sigma \) such that \(i^*(A)\ge k\). Hence, if we assume for contradiction that there is a finite number of distinct orbits \(\{G*u: u\in {{\mathcal {K}}}\}\), then arguing as in the proof of [18, Theorem 3.5 (b)], \(\beta _k\) are critical values and \(a\le \beta _1<\beta _2<\ldots \), which is impossible by the assumption. Therefore Theorem 3.2 holds true.
4 Application of the abstract variational approach
Recall that we assume (F1)–(F4) and (V). We assume, in addition, that F is convex in \(u\in \mathbb {R}^2\). Let \(X^+:={{\mathcal {V}}}\) and \(X^-:={{\mathcal {W}}}\). Let us define \(I:X\rightarrow \mathbb {R}\) such that
hence \(J(v+w)=\frac{1}{2}\Vert v\Vert ^2-I(v+w)\) for \(v\in {{\mathcal {V}}}\) and \(w\in {{\mathcal {W}}}\) so that J is of the form (3.1). In this section we check that (I1)–(I7) are satisfied.
Let us define the following map
which is continuous and convex.
Lemma 4.1
Conditions (I1)–(I4) are satisfied.
Proof
In view of (F1), (F2), (F3) and (V) we easily get (I1) and by the convexity assumption on F also (I2). To see that (I3) holds let us suppose that \(u_n{\mathop {\longrightarrow }\limits ^{{{\mathcal {T}}}}}u\) and \(I(u_n)\rightarrow I(u)\). Then by the fact that \(F\ge 0\) and weak lower semicontinuity
and passing to a subsequence \(\big (V(x)^{1/2}u_n\big )\rightharpoonup \big (V(x)^{1/2}u\big )\) in \(L^2(\mathbb {R}^2)^6\). Thus \(u_n\rightarrow u\) in \(L^2(\mathbb {R}^2)^6\) and \(u_n(x)\rightarrow u(x)\) for a.e. \(x\in \mathbb {R}^2\). Then, by Vitali’s convergence theorem
as \(n\rightarrow \infty \). From (4.1) we infer that \(I_F(u_n-u)\rightarrow 0\). Observe that, by (F4) and \(F\ge 0\), for any \(\varepsilon >0\) we find a constant \({\tilde{c}}_\varepsilon >0\) such that
Therefore
and we get \(u_n\rightarrow u\) in \(L^p(\mathbb {R}^2)^6\), which completes the proof of (I3). Now note that if \(I(v_n+w_n)\) is bounded with \((v_n)\subset {{\mathcal {V}}}\) and \(w_n\subset {{\mathcal {W}}}\) such that \(v_n\) is bounded, then \(w_n\) is bounded in \(L^2(\mathbb {R}^2)^6\) and by (4.2), \(w_n\) is bounded in \(L^p(\mathbb {R}^2)^6\). Thus (I4) holds. \(\square \)
Let \(v\in {{\mathcal {V}}}\). Since \({{\mathcal {W}}}\ni w\mapsto I(v+w)\in \mathbb {R}\) is strictly convex and coercive, \(I(u)\ge I(0)=0\), then \(I(v+\cdot )\) attains a unique global minimum at some \(w(v)\in {{\mathcal {W}}}\). Hence the set
obviously contains all critical points of J and there holds:
-
(I5)
If \(u\in {{\mathcal {M}}}\) then \(I(u)<I(u+w)\) for every \(w\in {{\mathcal {W}}}\setminus \{0\}\).
Since \(I(v+w(v))\le I(v)\) for \(v\in {{\mathcal {V}}}\), we see that w maps bounded sets into bounded sets and in view of (I2) and (I3) one can easily show the continuity of w.
Now we show the following properties which imply the linking geometry in the spirit of Benci and Rabinowitz [8].
Lemma 4.2
The following conditions are satisfied.
(I6) There exists \(r>0\) such that \(a:=\inf \limits _{v\in {{\mathcal {V}}},\Vert v\Vert =r} J(v)>0\).
(I7) \(I(t_nu_n)/t_n^2\rightarrow \infty \) if \(t_n\rightarrow \infty \) and \(p_{{{\mathcal {V}}}}(u_n)\rightarrow v\ne 0\) as \(n\rightarrow \infty \).
Proof
Observe that (F2) and (F3) imply that for any \(\varepsilon >0\) there is \(c_\varepsilon >0\) such that
Then, taking \(0<\varepsilon <k^2-{{\,\mathrm{ess\,sup}\,}}V\), by (V) we get
for \(v\in {{\mathcal {V}}}\), and we obtain (I6) by the Sobolev embedding of \(H^1(\mathbb {R}^2)^6\) into \(L^p(\mathbb {R}^2)^6\) and by choosing \(r>0\) sufficiently small. Now suppose that \(I(t_nu_n)/t_n^2\) is bounded, \(t_n\rightarrow \infty \) and \(p_{{{\mathcal {V}}}}(u_n)\rightarrow v\ne 0\) as \(n\rightarrow \infty \). Note that if \(u_n=v_n+w_n\) with \(v_n=p_{{{\mathcal {V}}}}(u_n)\in {{\mathcal {V}}}\) and \(w_n\in {{\mathcal {W}}}\), then by (4.2)
Since \(I(t_nu_n)/t_n^2\) is bounded and taking \(0<\varepsilon <{{\,\mathrm{ess\,inf}\,}}V\) we obtain that \(v_n+w_n\rightarrow 0\) in \(L^p(\mathbb {R}^2)^6\). Since \(v_n+w_n\) is also bounded in \(L^2(\mathbb {R}^2)^6\) it has a weakly convergent subsequence and the weak \(L^2\)-limit of this subsequence has to coincide with the the strong \(L^p\)-limit. Thus, dropping subsequence indices, we get in total that \(w_n\rightharpoonup -v\not =0\) in \({{\mathcal {W}}}\) and obtain a contradiction. \(\square \)
In view of Theorem 3.1, there is a sequence \(u_n=v_n+w_n\in {{\mathcal {M}}}\) such that \(J'(u_n)\rightarrow c\ge a>0\) and \(J'(u_n)(1+\Vert v_n\Vert )\rightarrow 0\) as \(n\rightarrow \infty \). We will show that \((u_n)\) is bounded if, in addition, (F5) or (F6) holds, see Lemma 6.4 in Sect. 6. The next section however is devoted to profile decompositions of bounded sequences in \({{\mathcal {M}}}\), which will be important to show that there is a nontrivial weak limit point of the sequence \((u_n)\) up to translations, which is a critical point.
5 Profile decompositions in \({{\mathcal {M}}}\)
In addition to (F1)–(F4) and (V) we assume also that \(V=V_0\) or \(V(x)>V_0\) for a.e. \(x\in \mathbb {R}^3\). Let us define \(I_0:X\rightarrow \mathbb {R}\) such that
Clearly \(I_0\) satisfies analogous conditions (I1)–(I4) and for \(v\in {{\mathcal {V}}}\), \(I_0(v+\cdot )\) attains a unique global minimum at some \(w_0(v)\in {{\mathcal {W}}}\). Thus, similarly as in (3.2), we define
Recall that profile decompositions of bounded sequences in \(H^1(\mathbb {R}^N)\) have been obtained for instance by Nawa [19], Hmidi and Keraani [13], and by Gérard [12] in \(\dot{H}^s(\mathbb {R}^N)\). A similar result cannot be obtained in \(X={{\mathcal {V}}}\oplus {{\mathcal {W}}}\), since \({{\mathcal {W}}}\) is not locally compactly embedded into \(L^p(\mathbb {R}^2)\) for \(p\ge 1\). However we prove the following decomposition result in the topological constraints \({{\mathcal {M}}},{{\mathcal {M}}}_0\subset X\).
Theorem 5.1
If \((u_n)\) is bounded in \({{\mathcal {M}}}\), then, passing to a subsequence, there is \(K\in \mathbb {N}\cup \{\infty \}\) and there are \(\widetilde{u}_0\in {{\mathcal {M}}}\) and sequences \((\widetilde{u}_i)_{i\ge 1}\subset {{\mathcal {M}}}_0\), \((y_n^i)_{n\ge i}\subset \mathbb {Z}^2\) such that \(y_n^0=0\), \(|y_n^i-y_n^j|\rightarrow \infty \) as \(n\rightarrow \infty \) for \(i\ne j\), and the following conditions hold:
-
(a)
If \(K<\infty \), then \(\widetilde{u}_i\ne 0\) for \(1\le i\le K\) and \(\widetilde{u}_i=0\) for \(i>K\). If \(K=\infty \), then \(\widetilde{u}_i\ne 0\) for all \(i\ge 1\).
-
(b)
\(u_n(\cdot +y_n^i)\rightharpoonup \widetilde{u}_i\hbox { in }X\) for any \(0\le i < K+1\) (Footnote 1).
-
(c)
\(u_n(\cdot +y_n^i)\rightarrow \widetilde{u}_i\hbox { in }L_{loc}^{p}(\mathbb {R}^2)^6\) and a.e. in \(\mathbb {R}^2\) for any \(0\le i < K+1\).
-
(d)
There holds
$$\begin{aligned} \lim _{n\rightarrow \infty }\Vert p_{{{\mathcal {V}}}}(u_n)\Vert ^2\ge & {} \sum _{i=0}^{\infty } \Vert p_{{{\mathcal {V}}}}(\widetilde{u}_i)\Vert ^2, \end{aligned}$$(5.1)$$\begin{aligned} \lim _{n\rightarrow \infty }I(u_n)\ge & {} I(\widetilde{u}_0)+\sum _{i=1}^{\infty } I_0(\widetilde{u}_i), \end{aligned}$$(5.2)$$\begin{aligned} \lim _{n\rightarrow \infty }I_F(u_n)= & {} \sum _{i=0}^{\infty } I_F(\widetilde{u}_i), \end{aligned}$$(5.3)$$\begin{aligned}{} & {} \lim _{k\rightarrow \infty }\lim _{n\rightarrow \infty } I_F\left( u_n-\sum _{i=0}^k\widetilde{u}_i \left( \cdot -y_n^i \right) \right) =0. \end{aligned}$$(5.4)
Proof
For a measurable set \(A\subset \mathbb {R}^2\) we use the notation \(\chi _A\) to denote the characteristic function of A. Let \(u_n=v_n+w(v_n)\in {{\mathcal {M}}}\), where \(v_n=p_{{{\mathcal {V}}}}(u_n)\in {{\mathcal {V}}}\). Passing to a subsequence \(\lim _{n\rightarrow \infty }\Vert v_n\Vert \), \(\lim _{n\rightarrow \infty }I(u_n)\) and \(\lim _{n\rightarrow \infty }I_F(u_n)\) exist and are finite.
Part 1. Profile decomposition for \((v_n)\). Take \(r>\sqrt{2}\). Since \((v_n)\subset H^1(\mathbb {R}^2)\) is bounded, we claim that, passing to a subsequence, there is \(K\in \mathbb {N}\cup \{\infty \}\) and there is a sequence \((\widetilde{v}_i)_{i=0}^K\subset H^1(\mathbb {R}^2)\), for \(0\le i <K+1\) there are a sequence \((y_n^i)\subset \mathbb {Z}^2\) and positive numbers \((c_i)_{i=1}^{K}\) such that \(y_n^0=0\) and for any \(0\le i<K+1\) one has
where
Indeed, passing to a subsequence we may assume that
The latter convergence follows from the fact that for any n, \(H^1(B(0, n))\) is compactly embedded into \(L^p(B(0, n))\) and we find sufficiently large \(k_n\) such that
The subsequence \((u_{k_n})\) is then relabelled by \((u_n)\).
Take \(\phi _n^0:=v_n-\widetilde{v}_0\) and if
then we finish the proof of our claim with \(K=0\). Otherwise, passing to a subsequence, we find \((y_n^1)\subset \mathbb {Z}^2\) and a constant \(c_1>0\) such that
Note that \((y_n^1)\) is unbounded and we may assume that \(|y_n^1|\ge n-r\). Since \((v_n(\cdot +y_n^1))\) is bounded in \(H^1(\mathbb {R}^2)\), we find \(\widetilde{v}_1\in H^1(\mathbb {R}^2)\) such that up to a subsequence \(v_n(\cdot +y_n^1)\rightharpoonup \widetilde{v}_1\). In view of (5.9), we get \(\widetilde{v}_1\ne 0\), and again we may assume that \(v_n(\cdot +y_n^1)\chi _{B(0,n)}\rightarrow \widetilde{v}_1\) in \(L^{2}(\mathbb {R}^2)^6\cap L^p(\mathbb {R}^2)^6\). We set \(\phi _n^1:=\phi _n^0-\widetilde{v}_1(\cdot -y_n^1)=v_n-\widetilde{v}_0-\widetilde{v}_1(\cdot -y_n^1)\) and observe that if
then we finish the proof of our claim with \(K=1\). Otherwise, passing to a subsequence, we find \((y_n^2)\subset \mathbb {Z}^2\) and a constant \(c_2>0\) such that
and \(|y_n^2|\ge n-r\). Moreover \(|y_n^2-y_n^1|\ge n-2r\). Otherwise \(B(y_n^2,r)\subset B(y_n^1,n)\) and the convergence \(\widetilde{v}_0\chi _{B(y_n^2,r)}\rightarrow 0\) and \(v_n(\cdot +y_n^1)\chi _{B(0,n)}\rightarrow \widetilde{v}_1\) in \(L^2(\mathbb {R}^2)^6\) contradict (5.10). Then we find \(\widetilde{v}_2\ne 0\) such that passing to a subsequence
Again, if
where \(v_n^2:=\phi _n^1-\widetilde{v}_2(\cdot -y_n^2)\), then we finish proof with \(K=2\). Continuing the above procedure we finally find \(K\in \mathbb {N}\cup \{\infty \}\) such that for \(0\le i<K+1\), (5.5)–(5.8) hold. In view of the Brezis-Lieb Lemma [9],
for \(i\ge 0\). If there is \(i\ge 0\) such that
then \(K=i\) and setting \(\widetilde{v}_k=0\) for \(k>i\) we get by Lions’ Lemma [33, Lemma 1.21]
Since \(\phi _n^k=\phi _n^i\) for \(k\ge i\), we find in particular
In the case \(K=\infty \) we want to show that (5.11) is still satisfied. We argue as in [17, Proof of Theorem 1.4]. Suppose, for a contradiction, that \(\limsup _{k\rightarrow \infty }\limsup _{n\rightarrow \infty } |\phi _{n}^k|_{p}>0\). Then we find \(\delta >0\) and increasing sequences \((i_k), (n_k)\subset \mathbb {N}\) such that
and
Note that by (5.8) we have
for any \(0\le i<k\). Taking into account (5.7) and letting \(n\rightarrow \infty \) we get \(c_{k+1}\le 4 c_{i+1}\). Take \(k\ge 1\) and \(n>4r\). Again by (5.8) and (5.7) we obtain
Observe that by (5.7) and since \(n>4r\) we have
and
as \(n\rightarrow \infty \). Hence
Therefore we obtain from (5.12)
and in view of [33, Lemma 1.21] we obtain that \(\phi _{n_k} ^{i_k}\rightarrow 0\) in \(L^p(\mathbb {R}^2)^6\) as \(k\rightarrow \infty \), which is a contradiction. Therefore passing to a subsequence (5.11) holds.
Part 2. Profile decomposition for \((u_n)\). Note that \((w(v_n))_{n\in \mathbb {N}}\) and \((w_0(v_n))_{n\in \mathbb {N}}\) are bounded and we may assume
for some \({\tilde{w}}_i\in {{\mathcal {W}}}\), \(i\ge 0\). Let us define \(\widetilde{u}_0:=\widetilde{v}_0+w(\widetilde{v}_0)\) and \(\widetilde{u}_i:=\widetilde{v}_i+w_0(\widetilde{v}_i)\) if \(i\ge 1\).
Part 2. Claim 1. There holds
Indeed, observe that for \(i\ge 1\)
so that the weak lower semicontinuity of \(I_0\) implies
for any \(k\ge 0\). Since \((u_n)\) and hence \((I_0(u_n))\) is bounded, (5.15) holds.
Part 2. Claim 2. Up to a subsequence, there holds
Observe that for given \(k\ge 0\)
Concerning the first sum on the right hand side of (5.17) note that, for given \(0\le j\le k\)
as \(n\rightarrow \infty \), since
for \(i\ne j\). The convergence in (5.18) also holds for \(p=2\). Taking into account the uniform continuity of \(I_F\) on bounded sets we obtain
as \(n\rightarrow \infty \). Finally, concerning the second sum on the right hand side of (5.17) observe that for any \(i\ge 0\)
as \(n\rightarrow \infty \). The same convergence also holds for \(p=2\). Taking the \(\limsup _{n\rightarrow \infty }\) and the \(\sup _{k\ge 0}\) in (5.17) we conclude the proof of (5.16).
Part 2. Claim 3. For any \(k\ge 0\)
as \(n\rightarrow \infty \). Indeed, observe that
Note that for any \(i\ge 1\) and \(R>0\)
as \(n\rightarrow \infty \). Letting \(R\rightarrow \infty \) we obtain \(\int _{\mathbb {R}^2}|\langle w(\widetilde{v}_0), w_0(\widetilde{v}_i)(\cdot -y_n^i)\rangle |\,dx=o(1)\) and we conclude the claim, since \(w(v_n)(\cdot +y_n^i)\chi _{B(0,\frac{n-2r}{2})}\rightharpoonup \widetilde{w}_i\) in \(L^2(\mathbb {R}^2)^6\).
Part 2. Claim 4. For any \(k\ge 0\)
Note that if \(i\ge j\) and \(i\ge 1\), then for any \(R>0\) we have
as \(n\rightarrow \infty \). Letting \(R\rightarrow \infty \) we conclude the claim.
Part 2. Claim 5. Up to a subsequence we have
as \(n\rightarrow \infty \). Let us denote \(\nu _n^k:=v_n+w(\widetilde{v}_0)+\sum _{i=1}^kw_0(\widetilde{v}_i)(\cdot -y_n^i)\). Since the map
is uniformly continuous on bounded sets, then in view of (5.11) for every \(\varepsilon >0\) there is \(k_0\ge 0\) such that for any \(k\ge k_0\) there is \(n_0=n_0(k)\) such that for \(n\ge n_0\) one obtains
Moreover
and
for any \(k\ge 0\). Then
where the last inequality follows from (5.19).
On the other hand, taking into account (5.23) and then (5.20), (5.16) we get
hence
Therefore we get equalities in the above considerations, in particular \(\widetilde{u}_i=\widetilde{v}_i+\widetilde{w}_i\) for \(i\ge 0\) and in (5.19) equality holds for \(k=\infty \), so that \(|w(v_n)(\cdot +y_n^i)\chi _{B(0,\frac{n-2r}{2})}|_2\rightarrow |\widetilde{w}_i|_2\) passing to a subsequence, and then \(w(v_n)(\cdot +y_n^i)\chi _{B(0,\frac{n-2r}{2})}\rightarrow \widetilde{w}_i\) in \(L^2(\mathbb {R}^2)^6\) for \(i\ge 0\). Moreover from equality in (5.24) we get
for \(i\ge 1\). Taking into account (5.5) and arguing as in the proof of (I3) from Lemma 4.1, passing to a subsequence, we obtain
Therefore (5.21) and (5.22) are satisfied.
Part 3. Proof of (d). Now we complete the proof. Note that we have already proved (a)–(c). Since \(v_n(\cdot +y_n^i)\chi _{B(0,\frac{n-2r}{2})}\rightharpoonup \widetilde{v}_i\) and arguing similarly as in (5.24) we obtain (5.1). Taking into account (5.25) we find (5.2) and using the fact that the equality holds in (5.24) for \(k=\infty \) we conclude (5.3).
In order to show (5.4), note that \(u_n(\cdot +y_n^i)\rightarrow \widetilde{u}_i\) for a.e. in \(\mathbb {R}^2\) and for every \(0\le i<K+1\). Starting with \(i=0\) and arguing similarly as in proof of Lemma 4.1 we get
Since we know from the beginning of the proof that \(\lim _{n\rightarrow \infty } I_F(u_n)\) exists and is finite, we therefore can rearrange and find
Next we define for \(0\le j < K\)
and observe that \(r_n^j(\cdot +y_n^{j+1})\rightarrow {\tilde{u}}_{j+1}\) and \(r_n^j(\cdot +y_n^{j+1})-r_n^{j+1}(\cdot +y_n^{j+1})={\tilde{u}}_{j+1}\). Starting with \(j=0\) we obtain again, similarly as in proof of Lemma 4.1, that
and since \(r_n^0=u_n-{\tilde{u}}_0\) and by the shift-invariance of \(I_F\) this implies
Now, for any \(0\le j<K\)
and hence
Collecting (5.26), (5.27) and (5.28) for \(0\le j<K\) we obtain
Then by (5.3)
and we finally obtain (5.4). \(\square \)
Corollary 5.2
The map \(J'\) is weak-to-weak\(^*\) continuous on \({{\mathcal {M}}}\), i.e., if \((u_n)\subset {{\mathcal {M}}}\) and \(u_n\rightharpoonup u\) for some \(u\in X\), then \(J'(u_n)(\varphi )\rightarrow J'(u)(\varphi )\) for any \(\varphi \in X\). Similarly, \(J_0'\) is weak-to-weak\(^*\) continuous on \({{\mathcal {M}}}_0\) where \(J_0(u)=\frac{1}{2}\Vert p_{{{\mathcal {V}}}}(u)\Vert ^2-I_0(u)\) .
Proof
Since \((u_n)\) is bounded, in view of Theorem 5.1(c) for \(i=0\), up to a subsequence \(u_n(x)\rightarrow u(x)\) for a.e. \(x\in \mathbb {R}^2\). By Vitaly’s convergence theorem we infer that \(J'(u_n)(\varphi )\rightarrow J'(u)(\varphi )\) and \(J'_0(u_n)(\varphi )\rightarrow J_0'(u)(\varphi )\) for any \(\varphi \in X\).
\(\square \)
6 Proof of Theorem 1.1 and Theorem 1.2
Lemma 6.1
Suppose that (F5) holds. If \(u\in X\), \(\psi \in \mathcal {W}\) and \(t\ge 0\), then
In particular (I8) holds.
Proof
We define a map \(\varphi :[0,+\infty )\times \mathbb {R}^2\rightarrow \mathbb {R}\) such that
If \(u(x)\ne 0\), then (F5) implies that \(\varphi (0,x)\le 0\) and if \(u(x)=0\) then also \(\varphi (0,x)=0\). Next, (F4) implies that \(\varphi (t,x)\rightarrow -\infty \) as \(t\rightarrow \infty \). Then, for every fixed \(x\in \mathbb {R}^2\), there is a global maximum point \(t_0=t_0(x)\ge 0\) of \(\varphi (x,\cdot )\). Thus \(\varphi '(t_0,x)=0\) and by (F5) we obtain that \(\varphi (t_0,x)\le 0\); see similar arguments in proof of [15][Proposition 4.1]. Therefore
and we conclude. \(\square \)
Remark 6.2
Observe that the inequality \(\varphi (1,x)\le 0\) implies the convexity of F in u. Therefore this assumption is not explicitly stated in Theorem 1.1.
Lemma 6.3
Suppose that (F6) holds. For any \(\varepsilon >0\) there is a constant \(c_\varepsilon >0\) such that if \(u=v+w\in {{\mathcal {V}}}\oplus {{\mathcal {W}}}\), then
for any \(0\le t\le \sqrt{1-\frac{2}{\gamma }}\).
Proof
Observe that by (F6)
Then we obtain
where the last inequality follows from (4.3) and we conclude. \(\square \)
Lemma 6.4
If (F5), or (F4’) and (F6) hold, then \((M)_0^\beta \) is satisfied for every \(\beta >0\).
Proof
(a) Take \(\beta >0\) and suppose that \(u_n=v_n+w_n\in {{\mathcal {M}}}\) is such that
as \(n\rightarrow \infty \). Observe that by (4.2)
and if we take \(\varepsilon =\frac{1}{4}{{\,\mathrm{ess\,inf}\,}}V\) then
Let \(s_n:=\Big (\frac{1}{2}\Vert v_n\Vert ^2-\frac{1}{4}{{\,\mathrm{ess\,inf}\,}}V|v_n|_2^2+\frac{1}{4}{{\,\mathrm{ess\,inf}\,}}V|w_n|_2^2+\tilde{c}_\varepsilon |u_n|_p^p\Big )^{1/2}\) and suppose by a contradiction that \((u_n)\) is unbounded, that is, passing to a subsequence \(s_n\rightarrow \infty \). Let \(\widetilde{v}_n:=v_n/s_n\) and we may assume that \(\widetilde{v}_n\rightharpoonup \widetilde{v}\) in X for some \(\widetilde{v}\in {{\mathcal {V}}}\), and \(\widetilde{v}_n(x)\rightarrow \widetilde{v}(x)\) a.e. in \(\mathbb {R}^2\).
Let us show that \(\inf _{n\in \mathbb {N}}|\widetilde{v}_n|_p>0\) by a contradiction argument both in the case where (F5) and (F4’), (F6) holds. Therefore assume (passing to a subsequence) that \(\widetilde{v}_n\rightarrow 0\) in \(L^p(\mathbb {R}^2)^6\). By (4.3) we obtain
First assume that (F5) is satisfied. Take \(\gamma \in (0,1)\) such that
and \(s=2\sqrt{\beta /\gamma }\). In view of Lemma 6.1, the fact that \(J'(u_n)(1+\Vert v_n\Vert )\rightarrow 0\) and (6.3) we obtain the following estimate
Using \(\Vert \widetilde{v}_n\Vert ^2-\frac{1}{2}{{\,\mathrm{ess\,inf}\,}}V |\widetilde{v}_n|_2^2\ge 1\) by (6.2) and (6.4) we obtain the contradiction
Next, assume that (F4’) and (F6) are satisfied. Then
and \((u_n)\) is bounded in \(L^p(\mathbb {R}^3)^6\). Hence \((w_n)\) is bounded in \(L^p(\mathbb {R}^3)^6\). Note that \(s_n\ge \delta |u_n|_2\) for some constant \(\delta >0\). Hence also \(s_n\ge \delta |w_n|_2\) and taking \(\gamma \in (0,1)\) as in (6.4), \(s=2\sqrt{\beta /\gamma }\) and \(\varepsilon =\frac{1}{8}\gamma \delta ^2\), in view of Lemma 6.3, the fact that \(J'(u_n)(1+\Vert v_n\Vert \rightarrow 0\) and (6.3) we obtain the following estimate
Using as before \(\Vert {\tilde{v}}_n\Vert -\frac{1}{2}{{\,\mathrm{ess\,inf}\,}}V |{\tilde{v}}_n|_2\ge 1\), the definition (6.4) of \(\gamma \) and the definition of \(\epsilon \) we obtain the contradiction
In both cases we have led the assumption that \(\widetilde{v}_n\rightarrow 0\) in \(L^p(\mathbb {R}^2)^6\) to a contradiction. Hence, we continue by having \(\inf _{n\in \mathbb {N}}|\widetilde{v}_n|_p>0\). Let \(\mathrm{cl\,}_{L^p}{{\mathcal {V}}}\) denote the closure of \({{\mathcal {V}}}\) in \(L^p(\mathbb {R}^2)^6\). Observe that \(\mathrm{cl\,}_{L^p}{{\mathcal {V}}}\cap {{\mathcal {W}}}=\{0\}\). Indeed let \(w\in \mathrm{cl\,}_{L^p}{{\mathcal {V}}}\cap {{\mathcal {W}}}\), \((\varphi _n)\subset {{\mathcal {V}}}\) and \(\varphi _n\rightarrow w\) in \(L^p(\mathbb {R}^2)^6\), Then
for any \(\alpha ,\widetilde{\alpha }\in {{\mathcal {C}}}_0^{\infty }(\mathbb {R}^2)\). By the density argument we infer that \(w=0\). Hence using the continuity of the projection \(\mathrm{cl\,}_{L^p}{{\mathcal {V}}}\oplus {{\mathcal {W}}}\rightarrow \mathrm{cl\,}_{L^p}{{\mathcal {V}}}\) there is a constant \(c'>0\) such that
Then by (4.2) we get
Therefore taking \(\varepsilon =\frac{1}{2}{{\,\mathrm{ess\,inf}\,}}V\) we see that
Thus we get a contradiction, and therefore \((u_n)\) must be bounded so that part (a) of \((M)_0^\beta \) holds.
Now let us show part (b) of \((M)_0^\beta \). This part only applies in the case where \(V(x)\equiv V_0\). So let us take another sequence \(u_n'=v_n'+w_n'\in {{\mathcal {M}}}\) is such that
as \(n\rightarrow \infty \). Suppose in addition that the number of critical orbits in \(J_0^\beta \) is finite. Let
and since \(G=\mathbb {Z}^2\) is discrete (i.e., condition (G) holds), \(m>0\). Now we split the proof into two cases.
Case 1 \(|v_n-v_n'|_p\rightarrow 0\). Then
Since \((u_n), (u_n')\) are bounded Palais-Smale sequences the first term on the right hand side converges to 0 and by (4.3) the last integral on the right hand side tends to 0. Hence, using \({{\,\mathrm{ess\,sup}\,}}V<k^2\) we see that \(\Vert v_n-v_n'\Vert ^2\rightarrow 0\) as \(n\rightarrow \infty \) and the proof is finished.
Case 2 \(\limsup |v_n-v_n'|_p>0\). Then by Lion’s Lemma [33, Lemma 1.21], up to a \(\mathbb {Z}^2\) translation, \(v_n\rightharpoonup v\) and \(v_n'\rightharpoonup v'\) for some \(v,v'\in {{\mathcal {V}}}\) such that \(v\ne v'\). We may assume that \(w_n\rightharpoonup w\) and \(w_n'\rightharpoonup w'\) for some \(w,w'\in {{\mathcal {W}}}\). In view of Corollary 5.2, \(J'(v+w)=J'(v'+w')=0\) and thus \(\Vert v-v'\Vert \ge m\). In other words: if \(\Vert v-v'\Vert < m\) then we are in the previous Case 1 and the proof is completed. \(\square \)
Proof of Theorem 1.1
Suppose that \((u_n)\subset {{\mathcal {M}}}\) sequence obtained in Theorem 3.1. By Lemma 6.4 it is a bounded Palais-Smale sequence for the functional J. Passing to a subsequence, we find \(\widetilde{u}_0\in {{\mathcal {M}}}\) and sequences \((\widetilde{u}_i)_{i\ge 0}\subset {{\mathcal {M}}}_0\), \((y_n^i)_{n\ge i}\subset \mathbb {Z}^2\) such that the statements of Theorem 5.1 hold. Observe that by Corollary 5.2, \(\widetilde{u}_0\) is a critical point of J. Similarly we show that \(\widetilde{u}_i\) is a critical point of \(J_0\) for \(i\ge 1\). Indeed, take any \(\varphi \in X\) and since \(|y_n^i|\rightarrow \infty \) and \(V-V_0\in L^{\frac{p}{p-2}}(\mathbb {R}^2)\) we obtain \(|V(\cdot +y_n^i)-V_0|_{\frac{p}{p-2}}\rightarrow 0\) as \(n\rightarrow \infty \) and
Since \(u_n(x+y_n^i)\rightarrow \widetilde{u}_i(x)\) for a.e. \(x\in \mathbb {R}^2\) by Vitaly’s convergence theorem we infer that \( J_0'(\widetilde{u}_i)(\varphi )=0\). We may assume that \(\lim _{n\rightarrow \infty }I_F(u_n)\) exists and is positive, since otherwise (for a subsequence) \(u_n\rightarrow 0\) in \(L^p(\mathbb {R}^2)^6\) and
we get a contradiction. Therefore, having \(\lim _{n\rightarrow \infty }I_F(u_n)>0\), we know that \(\widetilde{u}_0\ne 0\) or \(\widetilde{u}_1\ne 0\).
We first treat the case where \(V=V_0\), hence \(J=J_0\). Due to shift-invariance in this case, we may assume w.l.o.g. that \(\widetilde{u}_0\ne 0\). Clearly, by Fatou’s lemma
thus \(J_0(\widetilde{u}_0)=\inf _{{{\mathcal {N}}}_0}J_0\) and \(\widetilde{u}_0\in {{\mathcal {N}}}_0\) is a ground state solution. If, in addition, f is odd in u, then in view of Lemma 6.4 and Theorem 3.2 there is an infinite sequence of \(\mathbb {Z}^2\)-distinct solutions.
Next we consider the case that \(V(x)>V_0\) for a.e. \(x\in \mathbb {R}^2\) and (F4’) is satisfied. Again we apply Theorem 5.1. Let \(v_n:=p_{{{\mathcal {V}}}}(u_n)\) and \(\widetilde{v}_i:=p_{{{\mathcal {V}}}}(\widetilde{u}_i)\) for \(n\ge 1\) and \(0\le i<K+1\). In view of (5.1) from Theorem 5.1, for any \(0\le k<K+1\) we get
hence \(K<\infty \). Observe that
since \(v_n(\cdot +y_n^j)\rightharpoonup \widetilde{v}_j\), and for \(i\ne j\), \(b_{L}(\widetilde{v}_j,\widetilde{v}_i(\cdot -(y_n^i-y_n^j)))\rightarrow 0\) as \(n\rightarrow \infty \). Since (F4’) holds, then by (5.4) we get \(u_n-\sum _{i=0}^K\widetilde{u}_i(\cdot -y_n^i)\rightarrow 0\) in \(L^p(\mathbb {R}^2)^6\), hence \(v_n-\sum _{i=0}^K\widetilde{v}_i(\cdot -y_n^i)\rightarrow 0\) in \(L^p(\mathbb {R}^2)^6\). Then
and we get
and since \((u_n)\) is a bounded Palais-Smale sequence we obtain
Arguing similarly as above we get
and
Hence, using the divergence property \(|y_n^i-y_n^j|\rightarrow \infty \) as \(n\rightarrow \infty \) for \(i\not = j\), we get
and in view of (5.3) we finally get
Let us denote by \(\xi \) the ground state solutions of \(J_0\) obtained above. Observe that \(J(t\xi +w(t\xi ))\rightarrow -\infty \) as \(t\rightarrow \infty \), \(J(0+w(0))=0\), hence we find \(t>0\) such that
where c is given by (3.3). Then, in view of Lemma 6.1 and Theorem 3.1
Therefore \(K=0\) and \(\widetilde{u}_0\ne 0\) is a critical point of J. Observe that \(\widetilde{u}_0\in {{\mathcal {N}}}\) and (using once more Fatou’s Lemma)
so that \(J(\widetilde{u}_0)=\inf _{{{\mathcal {N}}}}J\) and \(\widetilde{u}_0\) is a ground state solution. \(\square \)
Proof of Theorem 1.2
Here \(V=V_0\) and we argue as in proof of Theorem 1.1 and we use the notation introduced therein. Suppose that \((u_n)\subset {{\mathcal {M}}}_0\) is the bounded Palais-Smale sequence obtained in Sect. 4. Passing to a subsequence, we find sequences \((\widetilde{u}_i)_{i\ge 0}\subset {{\mathcal {M}}}_0\), \((y_n^i)_{n\ge i}\subset \mathbb {Z}^2\) such that all the statements of Theorem 5.1 hold. Observe that by Corollary 5.2, \(\widetilde{u}_0\) is a critical point of J and \(\widetilde{u}_i\) is a critical point of \(J_0\). Since \(\lim _{n\rightarrow \infty }I_F(u_n)>0\), \(\widetilde{u}_0\ne 0\) or \(\widetilde{u}_1\ne 0\). Note that
and we already know that \({{\mathcal {K}}}_0\ne \emptyset \) since \({\tilde{u}}_0\not =0\) or \({\tilde{u}}_1\not =0\). It is easy to show that \(c_0:=\inf _{{{\mathcal {K}}}_0}J_0>0\) and, since \({{\mathcal {K}}}\) is nonempty, we may take any minimizing sequence \((\xi _n)\subset {{\mathcal {K}}}_0\) such that \(J_0(\xi _n)\rightarrow c_0\). Similarly as above applying the profile decomposition from Theorem 5.1 to \((\xi _n)\), passing to a subsequence we find \((z_n)\subset \mathbb {Z}^2\) such that \(\xi _n(\cdot +z_n)\rightharpoonup \xi \in {{\mathcal {K}}}_0\) and \(\xi _n(\cdot +z_n)\rightarrow \xi \) a.e. on \(\mathbb {R}^2\). Due to Fatou’s Lemma we have
\(J_0(\xi )=c_0\) and we conclude. If, in addition, f is odd in u, then in view of Lemma 6.4 and Theorem 3.2 there is an infinite sequence of \(\mathbb {Z}^2\)-distinct solutions. \(\square \)
Notes
If \(K=\infty \), then \(K+1=\infty \) as well.
References
Akhmediev, N.N., Ankiewicz, A., Soto-Crespo, J.M.: Does the nonlinear Schrödinger equation correctly describe beam propagation? Opt. Lett. 18, 411 (1993)
Ambrosetti, A., Rabinowitz, P.H.: Dual variational methods in critical point theory and applications. J. Funct. Anal. 14, 349–381 (1973)
Bartolo, P., Benci, V., Fortunato, D.: Abstract critical point theorems and applications to some nonlinear problems with “strong’’ resonance at infinity. Nonlinear Anal. 7, 981–1012 (1983)
Bartsch, T., Mederski, J.: Ground and bound state solutions of semilinear time-harmonic Maxwell equations in a bounded domain. Arch. Ration. Mech. Anal. 215(1), 283–306 (2015)
Bartsch, T., Mederski, J.: Nonlinear time-harmonic Maxwell equations in an anisotropic bounded medium. J. Funct. Anal. 272(10), 4304–4333 (2017)
Bartsch, T., Mederski, J.: Nonlinear time-harmonic Maxwell equations in domains. J. Fixed Point Theory Appl. 19(1), 959–986 (2017)
Bartsch, T., Dohnal, T., Plum, M., Reichel, W.: Ground states of a nonlinear curl-curl problem in cylindrically symmetric media. NoDEA Nonlinear Diff. Equ. Appl. 23(5), 34 (2016)
Benci, V., Rabinowitz, P.H.: Critical point theorems for indefinite functionals. Invent. Math. 52(3), 241–273 (1979)
Brézis, H., Lieb, E.: A relation between pointwise convergence of functions and convergence of functionals. Proc. Am. Math. Soc. 88(3), 486–490 (1983)
Cerami, G.: An existence criterion for the critical points on unbounded manifolds. Ist. Lomb. Accad. Sci. Lett. Rend. Sci. Mat. Fis. Chim. Geol. A 112(2), 332–336 (1978)
Ciattoni, A., Crossignani, B., Di Porto, P., Yariv, A.: Perfect optical solitons: spatial Kerr solitons as exact solutions of Maxwell’s equations. J. Opt. Soc. Am. B 22, 1384–94 (2005)
Gérard, P.: Description du défaut de compacité de l’injection de Sobolev. ESAIM Control Optim. Calc. Var. 3, 213–233 (1998)
Hmidi, T., Keraani, S.: Remarks on the blow-up for the \(L^2\)-critical nonlinear Schrödinger equations. SIAM J. Math. Anal. 38(4), 1035–1047 (2006)
McLeod, J.B., Stuart, C.A., Troy, W.C.: An exact reduction of Maxwell’s equations. In: Nonlinear Diffusion Equations and Their Equilibrium States, vol. 3, pp. 391–405 (1992)
Mederski, J.: Ground states of time-harmonic semilinear Maxwell equations in \({\mathbb{R} }^3\) with vanishing permittivity. Arch. Ration. Mech. Anal. 218(2), 825–861 (2015)
Mederski, J.: Ground states of a system of nonlinear Schrödinger equations with periodic potentials. Comm. Partial Differ. Equ. 41(9), 1426–1440 (2016)
Mederski, J.: Nonradial solutions of nonlinear scalar field equations. Nonlinearity 33, 6349–6380 (2020)
Mederski, J., Schino, J., Szulkin, A.: Multiple solutions to a nonlinear curl-curl problem in \(\mathbb{R} ^3\). Arch. Ration. Mech. Anal. 236, 253–288 (2020)
Nawa, H.: Mass concentration phenomenon for the nonlinear Schrödinger equation with the critical power nonlinearity. II Kodai Math. J. 13(3), 333–348 (1990)
Nehari, Z.: Characteristic values associated with a class of non-linear second-order differential equations. Acta Math. 105, 141–175 (1961)
Pankov, A.: Periodic nonlinear Schrödinger equation with application to photonic crystals. Milan J. Math. 73, 259–287 (2005)
Reed, M., Simon, B.: Methods of Modern Mathematical Physics. I. Functional Analysis, 2nd edn. Academic Press, New York (1980)
Saleh, B.E.A., Teich, M.C.: Fundamentals of Photonics, 2nd edn. Wiley, New Jersey (2007)
Stein, E.: Singular Integrals and Differentiability Properties of Functions. Princeton Mathematical Series, No. 30, Princeton University Press, Princeton, N.J. (1970)
Stuart, C.A.: Self-trapping of an electromagnetic field and bifurcation from the essential spectrum. Arch. Ration. Mech. Anal. 113(1), 65–96 (1991)
Stuart, C.A.: Guidance properties of nonlinear planar waveguides. Arch. Ration. Mech. Anal. 125(1), 145–200 (1993)
Stuart, C.A.: Modelling axi-symmetric travelling waves in a dielectric with nonlinear refractive index. Milan J. Math. 72, 107–128 (2004)
Stuart, C.A., Zhou, H.S.: A variational problem related to self-trapping of an electromagnetic field. Math. Methods Appl. Sci. 19(17), 1397–1407 (1996)
Stuart, C.A., Zhou, H.S.: Existence of guided cylindrical TM-modes in a homogeneous self-focusing dielectric. Ann. Inst. H. Poincaré Anal. Non Linéaire 18(1), 69–96 (2001)
Stuart, C.A., Zhou, H.S.: A constrained minimization problem and its application to guided cylindrical TM-modes in an anisotropic self-focusing dielectric. Calc. Var. Partial Differ. Equ. 16(4), 335–373 (2003)
Stuart, C.A., Zhou, H.S.: Axisymmetric TE-modes in a self-focusing dielectric. SIAM J. Math. Anal. 37(1), 218–237 (2005)
Stuart, C.A., Zhou, H.S.: Existence of guided cylindrical TM-modes in an inhomogeneous self-focusing dielectric. Math. Models Methods Appl. Sci. 20(9), 1681–1719 (2010)
Willem, M.: Minimax Theorems. Birkhäuser Verlag, Basel (1996)
Acknowledgements
Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – Project-ID 258734477 – SFB 1173. J. Mederski was also partially supported by the Alexander von Humboldt Foundation (Germany) during the stay at Karlsruhe Institute of Technology and by the National Science Centre, Poland (Grant No. 2020/37/B/ST1/02742). He would like to express his deep gratitude to these institutions for their support and warm hospitality.
Author information
Authors and Affiliations
Contributions
Both authors wrote and reviewed the manuscript.
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare that they have no conflict of interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Mederski, J., Reichel, W. Travelling waves for Maxwell’s equations in nonlinear and nonsymmetric media. Nonlinear Differ. Equ. Appl. 30, 22 (2023). https://doi.org/10.1007/s00030-022-00824-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00030-022-00824-w