Abstract
Anisotropic generalized Orlicz spaces have been investigated in many recent papers, but the basic assumptions are not as well understood as in the isotropic case. We study the greatest convex minorant of anisotropic \(\Phi \)-functions and prove the equivalence of two widely used conditions in the theory of generalized Orlicz spaces, usually called (A1) and (M). This provides a more natural and easily verifiable condition for use in the theory of anisotropic generalized Orlicz spaces for results such as Jensen’s inequality which we obtain as a corollary.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
This paper deals with generalized Orlicz spaces, also known as Musielak–Orlicz spaces. This is a very active field recently [2, 5, 6, 14, 16, 19, 29, 31, 32], boosted by work on the double phase problem by Baroni, Colombo and Mingione, e.g. [4, 13]. The generalized Orlicz case unifies the study of the double phase problem and the variable exponent growth, widely researched over the last 20 years [15]. Many studies deal with isotropic energies of the type
but recently also the anisotropic case
has been considered e.g. in [1, 7,8,9, 24, 27]. As an example of an anisotropic energy with non-standard growth we could take a double-phase functional where the q-phase is directional:
here only variation in the \(x_1\)-direction makes a contribution to the energy in the q-phase \(\{a>0\}\).
Counter examples (see, e.g., [3, 15]) show that more advanced results such as the boundedness of averaging operators or density of smooth functions require connecting \(\Phi (x, \xi )\) for different values of x. To this end, we developed in the isotropic case the (A1) condition [18, 22] (see also [28]), which is essentially optimal for the boundedness of the maximal operator. In the anisotropic case Chlebicka, Gwiazda, Zatorska-Goldstein and co-authors [1, 7,8,9,10,11,12, 17] have developed a theory based on their (M) condition. To state and compare these conditions, let us define
In essence, the (A1) conditions says that \(\Phi _B^+\) can be bounded by \(\Phi _B^-\) in small balls \(B\subset \mathbb {R}^n\) in a quantitative way, whereas (M) say that it can be similarly bounded by the least convex minorant \((\Phi _B^-)^\mathrm{conv}\) of \(\Phi _B^-\) (see Definition 3.2). Obviously, the latter is a stronger condition, and it is also more difficult to verify, since the relationship between \((\Phi _B^-)^\mathrm{conv}\) and \(\Phi _B^-\) may be complicated in the anisotropic case.
In the isotropic case \(\Phi _B^- \lesssim (\Phi _B^-)^\mathrm{conv}\) so (M) and (A1) are equivalent. In the anisotropic case this inequality does not hold (see Example 4.1), but we are nevertheless able to prove the equivalence of the conditions by a more careful analysis.
Theorem 1.2
Let \(\Phi :\Omega \times \mathbb {R}^m\rightarrow [0,\infty ]\) be a strong \(\Phi \)-function. Then (A1) and (M) are equivalent.
This result and the techniques introduced in this paper will allow for the development of a theory of anisotropic generalized Orlicz spaces with more natural assumptions. As an example we prove the following Jensen-type inequality.
Corollary 1.3
(Jensen-type inequality) Let \(\Phi \) satisfy (A1) and \(f\in L^\Phi _\mu (\Omega ; \mathbb {R}^m)\). Then there exists \(\beta >0\) such that
when \(\varrho _\Phi (f)\leqslant 1\) and \(\mu (B)\leqslant 1\).
Although the extra assumption \(\varrho _\Phi (f)\leqslant 1\) in the corollary may seem strange, it follows naturally for instance when dealing with local regularity and it is known that the anisotropic Jensen-inequality does not hold without restrictions.
Let us next define precisely the concepts we are using and characterize functions \(\Phi \) for which the equivalence \(\Phi ^\mathrm{conv}\simeq \Phi \) holds (Corollary 2.4). In Sect. 3, we define the conditions (A1) and (M) and give preliminary remarks regarding the definitions. Finally, in Sect. 4 we prove the main results mentioned above.
2 Almost Convexity and the Greatest Convex Minorant
I refer to the monographs [9, 18] for background on isotropic and anisotropic generalized Orlicz spaces, respectively. We consider functions \(\Phi :\Omega \times \mathbb {R}^m\rightarrow [0,\infty ]\); the capital letter \(\Phi \) is used to highlight the distinction from the isotropic case \(L^\varphi \) in [18] where \(\varphi :\Omega \times [0,\infty )\rightarrow [0,\infty ]\). The idea is to define
for a vector field \(v\in L^1_\mu (\Omega ; \mathbb {R}^m)\). The space \(L^\Phi _\mu (\Omega ; \mathbb {R}^m)\) is defined by the requirement \(\Vert v\Vert _\Phi <\infty \). We use the equivalence relation \(\Phi \simeq \Psi \) which means that there exists \(\beta >0\) such that
Here and in the rest of the paper \(\beta \) denotes a parameter which is given by by one or more conditions; if the conditions hold with different \(\beta _k\), then we can use \(\beta :=\min _k\beta _k\) for all the conditions so that we may just as well use only the one common \(\beta \). Since the parameter \(\lambda \) is inside \(\Phi \) in the definition of \(\Vert v\Vert _\Phi \), this is the natural way to compare functions \(\Phi \) (cf. Example 2.2). To ensure that the integral in \(\varrho _\Phi \) makes sense and \(\Vert \cdot \Vert _\Phi \) is a norm we require some conditions.
Definition 2.1
Let \(\Omega \subset \mathbb {R}^n\) be an open set. We say that \(\Phi :\Omega \times \mathbb {R}^m\rightarrow [0,\infty ]\) is a strong \(\Phi \)-function, and write \(\Phi \in \Phi _{\mathrm{s}}(\Omega )\), if the following four conditions hold:
-
(1)
\(x \mapsto \Phi (x, \xi )\) is measurable for every \(\xi \in \mathbb {R}^m\).
-
(2)
\(\displaystyle \Phi (x, 0) = \lim _{\xi \rightarrow 0} \Phi (x,\xi ) =0\) and \(\displaystyle \lim _{\xi \rightarrow \infty }\Phi (x,\xi )=\infty \) for a.e. \(x\in \Omega \).
-
(3)
\(\xi \mapsto \Phi (x,\xi )\) is continuous in the topology of \([0,\infty ]\) for a.e. \(x\in \Omega \).
-
(4)
\(\Phi \) is convex for a.e. \(x\in \Omega \):
$$\begin{aligned} \Phi (x, \alpha \xi + \alpha '\xi ') \leqslant \alpha \Phi (x, \xi ) + \alpha ' \Phi (x, \xi '), \quad \alpha , \alpha '\geqslant 0,\ \alpha +\alpha '=1. \end{aligned}$$
With these conditions, \(\Vert \cdot \Vert _\Phi \) is a norm. Note that continuity in \(\xi \) follows from convexity if \(\Phi \) is real-valued and (3) is only needed to ensure that \(\Phi \) does not jump to \(\infty \). Note also that this class of strong \(\Phi \)-functions is broader than that studied [9] since we do not require that upper and lower bounds in terms of N-functions independent of x. For instance, this definition allows for \(L^1\)- and \(L^\infty \)-type growth. In [18] in the isotropic case we relaxed (3) and (4) further, and so used “strong” for this class, even though it is still less restrictive than N-functions.
For the study of \(\Phi \)-functions depending on the space-variable x, we use local approximations with the functions \(\Phi _B^+\) and \(\Phi _B^-\) from (1.1) [18, 22]. However, \(\Phi _B^-\) need not be convex even if each \(\Phi (x,\cdot )\) is (just think of \(\min \{t,t^2\}\)). In the isotropic case, \(\varphi _B^-\) nevertheless satisfies the following weaker variant of (4) above:
-
(W4)
\(\Phi \) is almost convex if there exists \(\beta >0\) such that
$$\begin{aligned} \Phi \big (x, \beta (\alpha \xi + \alpha '\xi ')\big ) \leqslant \alpha \Phi (x, \xi ) + \alpha ' \Phi (x, \xi '), \end{aligned}$$for a.e. \(x\in \Omega \) and \(\alpha , \alpha '\geqslant 0\) with \(\alpha +\alpha '=1\).
Unfortunately, even this does not hold for \(\Phi _B^-\) in the anisotropic case (see Example 4.1).
The constant \(\beta \) in the almost convexity condition (W4) should be inside the function since we do not assume doubling, or even finite, functions, as the following example illustrates. A constant outside is possible, but too restrictive.
Example 2.2
Let \(\varphi _\infty (t):=\infty \chi _{(1,\infty )}(t)\) be the function generating the space \(L^\infty (\Omega )\). Define \(\Phi (\xi ) := \varphi _\infty (\Vert \xi \Vert _{1/2} ) = \varphi _\infty (|\xi _1| + 2\sqrt{|\xi _1 \xi _2|}+|\xi _2|)\) in \(\mathbb {R}^2\). Consider \(\alpha =\frac{1}{2}\) and the basis vectors \(\xi =e_1\) and \(\xi '=e_2\) in (W4). Then \(\Vert \tfrac{e_1+e_2}{2}\Vert _{1/2}=2\) so \(\Phi (\tfrac{e_1+e_2}{2}) = \infty \) and the inequality
does not hold for any \(L<\infty \). However, the almost convexity inequality (W4)
holds for \(\beta \leqslant \frac{1}{2}\) since in this case \(\Phi \left( \beta \tfrac{e_1+e_2}{2}\right) = 0\).
If we choose \(\xi '=0\) in the almost convexity condition (W4), then we obtain (aInc)\(_1\):
In the special case \(\beta =1\), i.e. for a convex function, we have (Inc)\(_1\):
These inequalities mean that the function \(t\mapsto \frac{\Phi (x, t\xi )}{t}\) is almost increasing or increasing, hence the notation aInc\(_1\) and Inc\(_1\). In [18] we showed that these inequalities are useful substitutes for convexity in the isotropic case. In particular, it is easy to see that \(\Phi _B^+\) and \(\Phi _B^-\) satisfy aInc\(_1\) or Inc\(_1\) if \(\Phi \) does. For the anisotropic case the almost convexity is more appropriate since it also carries information about non-radial behavior.
Let us denote by \(\Phi ^\mathrm{conv}\) the greatest convex minorant of \(\Phi \). This function is often denoted by \(\Phi ^{**}\), since it can be obtained by applying the conjugation operation \(^*\) twice [9, Corollary 2.1.42], but we will not use this fact here. We next show a connection between the greatest convex minorant and the almost convexity condition. The following is a version of Carathéodory’s Theorem from convex analysis. Probably it is known, but a proof is included for completeness, since I could not find a reference.
Lemma 2.3
Let \(\Phi :\mathbb {R}^m\rightarrow [0,\infty ]\). Then
Proof
Consider the epigraph of \(\Phi \),
By Carathéodory’s Theorem (see, e.g., [30, Theorem 2.1.3]), every point in the convex hull of E can be represented as a convex combination of at most \(m+2\) points \(\xi _k\) from E. Furthermore, we observe that if any of the points \(\xi _k\) are from the interior of E, then the convex combination is also in the interior of the convex hull. Thus the points of the boundary, i.e. the graph of \(\Phi ^\mathrm{conv}\), are given as a convex combination of points in the boundary of E, i.e. on the graph of \(\Phi \). Hence
This is the claim, except with one extra point \(\xi _{m+2}\).
However, \(\xi \) lies in the convex hull of \(\xi _1,\ldots , \xi _{m+2}\in \mathbb {R}^m\). Thus by Carathéodory’s Theorem in \(\mathbb {R}^m\), \(\xi \) can be expressed as the convex combination of at most \(m+1\) of the points \(\xi _1\), ..., \(\xi _{m+2}\). By re-labeling if necessary, we obtain
Since \(\big (\xi _k, \Phi (\xi _k)\big )_{k=1}^{m+2}\), all lie on the same hyper-plane for a boundary point, we also have
\(\square \)
We can now show that \(\Phi \simeq \Phi ^\mathrm{conv}\) for almost convex functions.
Corollary 2.4
The function \(\Phi :\mathbb {R}^m\rightarrow [0, \infty ]\) is almost convex if and only if \(\Phi \simeq \Phi ^\mathrm{conv}\).
Proof
Clearly, \(\Phi ^\mathrm{conv}\leqslant \Phi \) since \(\Phi ^\mathrm{conv}\) is defined as a minorant of \(\Phi \). Let \(2^i\geqslant m+1\) and set \(\alpha _k:=0\) and \(\xi _k:=0\) for \(k>m+1\). By the almost convexity condition,
where
Iterating this i times, we obtain that
By Lemma 2.3 and this inequality,
Thus the almost convexity implies that \(\Phi \simeq \Phi ^\mathrm{conv}\).
If, on the other hand, \(\Phi \simeq \Phi ^\mathrm{conv}\) with constant \(\beta \), then we directly obtain
\(\square \)
For almost convex functions we easily obtain a Jensen inequality with an extra constant.
Corollary 2.5
(Jensen’s inequality) Let \(E\subset \mathbb {R}^m\) have positive, finite measure \(\mu (E)\). If \(\Phi \in C(E; [0, \infty ])\) is almost convex, then there exists \(\beta \) such that
for every \(f\in L^\Phi _\mu (E; \mathbb {R}^m)\).
Proof
By Corollary 2.4 and Jensen’s inequality for the convex function \(\Phi ^\mathrm{conv}\),
\(\square \)
3 Definition of and Remarks on Conditions
The aInc\(_1\) and almost convexity (W4) conditions connect \(\Phi (x, \xi )\) for different values of \(\xi \) with x fixed. However, more advanced results such as the density of smooth functions in Sobolev spaces require connecting \(\Phi (x, \xi )\) for different values of x, cf. [6]. This is the purpose of the conditions (A1-\(\Psi \)) and (M-\(\Psi \)), which generalize (A1) and (M).
However, let us first start with the more elementary condition (A0): there exists \(\beta >0\) such that
for all \(\xi \in \mathbb {R}^m\) with \(|\xi |=1\) and all \(x\in \Omega \). Note that (A0) is implicit in the assumption \(m_1(|\xi |)\leqslant \Phi (x,\xi )\leqslant m_2(|\xi |)\) for N-functions \(m_1\) and \(m_2\) used in [1, 7,8,9,10,11, 17]. This property is inherited by other versions of \(\Phi \):
Lemma 3.1
If \(\Phi \in \Phi _{\mathrm{s}}(\Omega )\) satisfies (A0), then so do \(\Phi _B^+\), \(\Phi _B^-\) and \((\Phi _B^-)^\mathrm{conv}\).
Proof
Taking the supremum or infimum over \(x\in \Omega \) in (A0) of \(\Phi \) gives (A0) for \(\Phi _B^+\) and \(\Phi _B^-\). Since \((\Phi _B^-)^\mathrm{conv}\leqslant \Phi _B^-\), the left inequality of (A0) follows for \((\Phi _B^-)^\mathrm{conv}\). If \(|\xi |\geqslant \tfrac{1}{\beta }\), then by Inc\(_1\) and (A0) we conclude that
Hence, for all \(\xi \in \mathbb {R}^m\), \(\Phi (x, \xi ) \geqslant (\beta \,|\xi |-1)_+\) (since \((\beta \,|\xi |-1)_+=0\) when \(|\xi |\leqslant \tfrac{1}{\beta }\)) and so \(\Phi _B^-(\xi ) \geqslant (\beta \,|\xi |-1)_+\). But the right-hand side is a convex function, so it follows that \((\Phi _B^-)^\mathrm{conv}(\xi ) \geqslant (\beta \,|\xi |-1)_+\) since \((\Phi _B^-)^\mathrm{conv}\) is defined as the greatest convex minorant. Consequently,
when \(\xi \in \mathbb {R}^m\) with \(|\xi |=1\), so \((\Phi _B^-)^\mathrm{conv}\) satisfies (A0) with constant \(\tfrac{\beta }{2}\). \(\square \)
The condition (A1) was introduced in [22] (see also [18, 28]) and is essentially optimal for the boundedness of the maximal operator in isotropic generalized Orlicz spaces. It also implies the Hölder continuity of solutions and (quasi)minimizers [5, 20, 21]. For higher regularity, we introduced in [23] a vanishing-(A1) condition along the same lines. These previous studies apply to the isotropic case, i.e. \(m=1\). In [24, 25] we generalized the (A1)-conditions to the anisotropic case, although only the quasi-isotropic case was considered in the main results.
Chlebicka, Gwiazda, Zatorska-Goldstein and co-authors [1, 7,8,9,10,11,12, 17] considered the assumption (M) in the anisotropic case; in the next definition their condition is reformulated to make it easier to compare with the (A1) condition (see also Lemma 3.4); also note that some of the earlier works included additional restrictions in the condition.
Definition 3.2
Let \(\Phi , \Psi \in \Phi _{\mathrm{s}}(\Omega )\). We say that \(\Phi \) satisfies (A1-\(\Psi \)) or (M-\(\Psi \)) if for any \(K>0\) there exists \(\beta >0\) such that
or
for all balls \(B\subset \mathbb {R}^n\) with \(\mu (B)\leqslant 1\) and \(\xi \in \mathbb {R}^n\).
When \(\Psi (t):=t^s\) and \(\Psi :=\Phi \) we use the abbreviations (A1-s), (A1), (M-s) and (M).
The role of \(\Psi \) is to calibrate the almost continuity requirement with the information on the function we are interested in and was developed from the initial condition (A1) over the course of several studies [5, 20, 21]. For instance, we showed in [5, Theorem 3.9] that the weak Harnack inequality holds for non-negative supersolutions of \(\mathrm{div}\left( \varphi '(|\nabla u|)\frac{\nabla u}{|\nabla u|}\right) =0\) if the isotropic \(\Phi \)-function \(\varphi \) satisfies (A1-\(\Psi \)) and the supersolution satisfies \(u\in W^{1,\psi }(\Omega )\), where \(\psi \in \Phi _{\mathrm{w}}(\Omega )\) is a potentially different function. Note that this involves a trade-off, since larger \(\psi \) means more restriction on u and less restriction on \(\varphi \).
As far as I know, Chlebicka, Gwiazda, Zatorska-Goldstein and co-authors considered (M) only in the case \(\Psi (t):=t\) and \(\Psi (t):=t^p\) (i.e. (M-1) and (M-p) in the notation above). However, the next example illustrates why this does not lead to optimal results.
Example 3.3
(Variable exponent double phase) Let \(\varphi (x, t):= t^{p(x)} + a(x) t^{q(x)}\) where \(a\in C^{0, \alpha }(\Omega )\), \(a\geqslant 0\) and \(1<p \leqslant q\). Now the (A1) or (M) conditions reduce to
Let \(p^-:= \inf _{x\in \Omega } p(x)\) and \(p^+:= \sup _{x\in \Omega } p(x)\). If we only use fixed exponent gauges such as (A1-\(p^-\)) or (M-\(p^-\)), then we instead end up with the condition
which is worse, and quite unnatural as the largest value of q is bounded by the smallest value of p.
As a final remark about the formulation, we note that earlier papers used a form without the “+1” and instead restricted the range of \(\Psi _B^-\). However, if (A0) holds, then these formulations are equivalent. We prove it for (M), the same applies to (A1).
Lemma 3.4
Let \(\Phi \in \Phi _{\mathrm{s}}(\Omega )\) satisfy (A0). Then (M) holds if and only if
Proof
If the condition of the lemma holds, then (M) needs only to be checked when \((\Phi _B^-)^\mathrm{conv}(\xi )\leqslant 1\). This inequality and (A0) imply that \(|\xi |\leqslant \frac{1}{\beta }\). Thus \(\Phi _B^+(\beta ^2 \xi )\leqslant 1\) by (A0), so (M) holds with constant \(\beta ^2\).
Assume conversely that (M) holds and \((\Phi _B^-)^\mathrm{conv}(\xi )\in \left[ 1, \tfrac{K}{\mu (B)}\right] \). Then it follows that
Then Inc\(_1\) implies that \(\Phi _B^+\left( \frac{\beta }{2} \xi \right) \leqslant \frac{1}{2}\Phi _B^+(\beta \xi )\leqslant (\Phi _B^-)^\mathrm{conv}(\xi )\), so the condition of the lemma holds with constant \(\frac{\beta }{2}\). \(\square \)
4 Equivalence of Conditions
In the previous section we introduced and motivated the conditions (A1) and (M) and their variants. We now move on to the main result, and consider their relation to one another.
Since \((\Phi _{B}^-)^\mathrm{conv}\leqslant \Phi _{B}^-\), (M-\(\Psi \)) implies (A1-\(\Psi \)). If \(\varphi \) is isotropic and satisfies aInc\(_1\), then I showed in [22] that \(\varphi _{B}^-(\beta t) \leqslant (\varphi _{B}^-)^\mathrm{conv}(t)\). Hence the two conditions are equivalent in this case. However, as pointed out in [9, Remarks 2.3.14 and 3.7.6], this approach is not possible in the anisotropic case. Since I did not understand the examples implicit in these remarks without consulting the authors, I include here an explicit example based on ideas of Piotr Nayar communicated to me by Iwona Chlebicka.
Example 4.1
Let \(m=2\) and \(\Phi _k(\xi _1e_1 + \xi _2e_2):= \xi _k^2\). Then both \(\Phi _1\) and \(\Phi _2\) are convex and \(\Phi _1(e_2)=\Phi _2(e_1)=0\). Denote \(\Phi :=\min \{\Phi _1,\Phi _2\}\). It follows that
where \(\alpha _1+\alpha _2=1\) and \(\alpha _1,\alpha _2\geqslant 0\). Thus we see that \(\Phi ^\mathrm{conv}\equiv 0\). Since \(\Phi (\beta (e_1+e_2)) = \Phi _1(\beta (e_1+e_2))=\beta ^2\) but \(\Phi ^\mathrm{conv}\equiv 0\), the relation \(\Phi \simeq \Phi ^\mathrm{conv}\) does not hold.
Even though \(\Phi _{B}^-(\beta \xi ) \leqslant (\Phi _{B}^-)^\mathrm{conv}(\xi )\) does not hold in general, we next construct an almost convex minorant which is comparable to \(\Phi _B^-\) when (A1) holds which can be used in (M). We prove that (A1) implies (M) in the main case \(\Psi :=\Phi \), which corresponds to the natural energy space \(L^\Phi \) or \(W^{1,\Phi }\). The implication for (A1-\(\Psi \)) and (M-\(\Psi \)) when \(\Psi \ne \Phi \) remains an open problem.
Let \(\Phi :\mathbb {R}^m\rightarrow [0,\infty ]\) be a strong \(\Phi \)-function independent of x. Denote \(K_s:= \{ \Phi \leqslant s\}\) and observe that it is a convex compact set which includes 0 in its interior. Define
Here \(\Vert \cdot \Vert _{K_s} \) is the Minkowski functional of the set \(K_s\), first studied by Kolmogorov [26]. The Luxemburg norm \(\Vert \cdot \Vert _\Phi \) defined previously is another example of a Minkowski functional. Note that \(N_s\) is a convex function with \(\{N_s \leqslant s\} = K_s\). Since \(\Phi \) is convex, \(\Phi (\lambda \xi ) \leqslant \lambda \Phi (\xi )\) for \(\lambda \leqslant 1\). Thus \(N_s \leqslant \Phi \) outside \(K_s\), \(N_s\geqslant \Phi \) in \(K_s\) and \(N_s=\Phi \) on the boundary \(\partial K_s\). In other words, we take the s-level set of \(\Phi \) and replace \(\Phi \) outside of it by the function \(N_s\) which grows linearly.
not be even almost convex. However, in the next proposition we show that \(\min \{\Phi , N_s\}\) is almost convex, since the two functions are somehow compatible. This will be used to construct a convex minorant of \(\Phi _B^-\). The proposition also demonstrates the utility of the almost convexity condition, as it seems much more difficult to choose \(N_s\) to make the minimum convex while still being a minorant of \(\Phi _B^-\).
Proposition 4.2
Let \(\Phi :\mathbb {R}^m\rightarrow [0,\infty ]\) be a strong \(\Phi \)-function. Then \(M_s := \min \{\Phi , N_s\}\) is almost convex.
Proof
Note that \(M_s=\Phi \chi _{K_s} + N_s \chi _{\mathbb {R}^m\setminus K_s}\) and let \(\alpha ,\alpha '>0\) with \(\alpha +\alpha '=1\). If \(\xi ,\xi ' \not \in K_s\), then the convexity of \(N_s\) implies that
If \(\xi ,\xi '\in K_s\), then the inequality follows from the convexity of \(\Phi \), which holds by assumption. Therefore it suffices to show that
when \(\xi \in K_s\) and \(\xi '\not \in K_s\). Define \({\tilde{\xi }} := \alpha \xi + \alpha ' \xi '\) and \({\tilde{\zeta }} :=\frac{1}{2} {\tilde{\xi }}\). We will show that
Observe that \(M_s\) satisfies Inc\(_1\), since \(N_s\) and \(\Phi \) do. By Inc\(_1\), (4.3) implies the previous inequality with constant \(\beta :=\frac{1}{2C}\) and concludes the proof. We consider two cases to prove (4.3).
- Case 1:
-
\({\tilde{\zeta }} \in K_s\). Then \(M_s({\tilde{\zeta }}) \leqslant s \leqslant N_s(\xi ')\) and so (4.3) holds with \(C=2\) when \(\alpha ' > \frac{1}{2}\). Thus we may assume that \(\alpha \geqslant \frac{1}{2}\). Now if \(M_s({\tilde{\zeta }})\leqslant 2 \Phi (\xi )\), then (4.3) holds with \(C=4\). Hence we further assume that \(M_s({\tilde{\zeta }})> 2\Phi (\xi )\). We may assume that \(\xi \), \(\xi '\) and 0 are not collinear since in the collinear case we can choose \(\xi '_k\rightarrow \xi '\) such that \(\xi \), \(\xi '_k\) and 0 are not collinear and use the continuity of \(M_s\) and \(N_s\). Let \(\zeta '\) be the intersection of the segment \([0, \xi ']\) and the line through \(\xi \) and \({\tilde{\zeta }}\) (see Fig. 1). If \(\zeta '\in K_s\), then \({\tilde{\zeta }} = \theta \xi + (1-\theta )\zeta '\) for some \(\theta \in (0,1)\). By the convexity of \(\Phi \) and \(M_s({\tilde{\zeta }})> 2\Phi (\xi )\) we have
$$\begin{aligned} M_s({\tilde{\zeta }}) = \Phi ({\tilde{\zeta }}) \leqslant \theta \Phi (\xi ) + (1-\theta ) \Phi (\zeta ') \leqslant \tfrac{1}{2} M_s({\tilde{\zeta }}) + M_s(\zeta '). \end{aligned}$$Thus \(M_s(\zeta ')=\Phi (\zeta ') \geqslant \frac{1}{2}M_s({\tilde{\zeta }})\). If, on the other hand, \(\zeta '\not \in K_s\), then \(M_s(\zeta ')\geqslant s\geqslant M_s({\tilde{\zeta }})\). In either case, we have \(M_s(\zeta ')\geqslant \frac{1}{2} M_s({\tilde{\zeta }})\). Consider the parallelogram \((0, \xi , {\tilde{\xi }}, {\tilde{\xi }}-\xi )\). Let \(\eta '\) be the intersection of the segments \([{\tilde{\xi }}, {\tilde{\xi }}-\xi ]\) and \([0,\xi ']\) (see Fig. 1). From \({\tilde{\xi }}=(1-\alpha ')\xi +\alpha '\xi '\) we observe that
$$\begin{aligned} \alpha ' = \frac{|{\tilde{\xi }}-\xi |}{|\xi -\xi '|} = \frac{|\eta '|}{|\xi '|} \geqslant \frac{|\zeta '|}{|\xi '|} =: \nu \in (0,1); \end{aligned}$$the second equality follows since the triangles \((\xi ', {\tilde{\xi }}, \eta ')\) and \((\xi ', \xi , 0)\) are similar. Thus, by \(M_s(\zeta ')\geqslant \frac{1}{2} M_s({\tilde{\zeta }})\) from the previous paragraph and Inc\(_1\) of \(M_s\),
$$\begin{aligned} N_s(\xi ') = M_s(\xi ') \geqslant \tfrac{1}{\nu }M_s( \nu \xi ' ) = \tfrac{1}{\nu }M_s( \zeta ' ) \geqslant \tfrac{1}{2\nu } M_s({\tilde{\zeta }}) \geqslant \tfrac{1}{2\alpha '} M_s({\tilde{\zeta }}), \end{aligned}$$where we used the conclusion of the previous paragraph in the penultimate step. The inequality
$$\begin{aligned} M_s\big ({\tilde{\zeta }}\big ) \leqslant 2\alpha ' N_s(\xi ') \end{aligned}$$follows, so (4.3) holds with \(C=2\).
- Case 2 :
-
\({\tilde{\zeta }} \not \in K_s\). Let \(\nu := \Vert {\tilde{\zeta }} \Vert _{K_s}^{-1} < 1\). Since \(K_s\) is closed, it follows from the definition of \(\Vert \cdot \Vert _{K_s}\) that \(\Phi (\nu {\tilde{\zeta }}) = s\) and \(\nu {\tilde{\zeta }} \in \partial K_s\). Furthermore, \(N_s(\nu {\tilde{\zeta }})=s=M_s(\nu {\tilde{\zeta }})\) and so
$$\begin{aligned} M_s({\tilde{\zeta }}) = s \Vert {\tilde{\zeta }}\Vert _{K_s} = \tfrac{1}{\nu }s&= \tfrac{1}{\nu }M_s(\nu {\tilde{\zeta }}) \leqslant \tfrac{4}{\nu }(\alpha \Phi (\nu \xi ) + \alpha ' N_s(\nu \xi ')) \leqslant 4(\alpha \Phi (\xi ) + \alpha ' N_s(\xi ')), \end{aligned}$$where we used the previous case for \(\nu {\tilde{\zeta }} \in K_s\) in the first inequality and Inc\(_1\) for the last step.
We are ready to prove the main theorem, i.e. the equivalence of (A1) and (M).
Proof of Theorem 1.2
Since \((\Phi _{B}^-)^\mathrm{conv}\leqslant \Phi _B^-\), it follows from (M) that
when \(\xi \in \mathbb {R}^m\) with \(\Phi _B^-(\xi )\leqslant \frac{K}{\mu (B)}\), where \(B\subset \mathbb {R}^n\) is a ball, which gives (A1).
Assume now conversely that (A1) holds and let \(s:=\frac{K}{\mu (B)}+1\) for a ball \(B\subset \mathbb {R}^n\) with \(\mu (B)\leqslant 1\). Define \(N_s\) as before based on \(K_s:=\{\xi \in \mathbb {R}^m\,|\, \Phi _B^+(\beta \xi ) \leqslant s\}\) and set \(M_s(\xi ) := \min \{\Phi _B^+(\beta \xi ), N_s(\xi )\}\). By Proposition 4.2, \(M_s\) is almost convex so \(M_s(\beta '\xi ) \leqslant (M_s)^\mathrm{conv}(\xi )\) by Corollary 2.4.
If \(\xi \in K_s\), then \(M_s(\xi ) = \Phi _B^+(\beta \xi )\leqslant s\). Now either \(\Phi _B^-(\xi )\leqslant \frac{K}{\mu (B)}\) in which case (A1) implies that \(\Phi _B^+(\beta \xi ) \leqslant \Phi _{B}^-(\xi )+1\), or \(\Phi _B^-(\xi )> \frac{K}{\mu (B)}\) in which case \(\Phi _B^+(\beta \xi )\leqslant s \leqslant \Phi _{B}^-(\xi )+1\). Combining the two cases, we find that
If \(\xi \not \in K_s\), then \(\nu := \Vert \xi \Vert _{K_s}^{-1} < 1\). As in Case 2 of the previous proof \(\nu \xi \in \partial K_s\) and \(\Phi _B^+(\beta \nu \xi )=s\). If \(\Phi _B^-(\nu \xi )< \frac{K}{\mu (B)}\), then (A1) implies that \(\Phi _B^+(\beta \nu \xi ) \leqslant \Phi _B^-(\nu \xi ) + 1< s\), which is a contradiction. Therefore \(\Phi _B^-(\nu \xi )\geqslant \frac{K}{\mu (B)} = s-1\) and so
where we used Inc\(_1\) of \(\Phi _B^-\) in the last step. Note that \(\frac{s}{s-1}=1+\frac{\mu (B)}{K}\leqslant 1+\frac{1}{K}\) since we assumed that \(\mu (B)\leqslant 1\).
In the previous paragraph we have shown that \(M_s\leqslant \left( 1+\frac{1}{K}\right) \Phi _B^- + 1\). Therefore, the convex minorant of \(M_s\) is also a convex minorant of \(\left( 1+\frac{1}{K}\right) \Phi _B^-+1\), and we conclude that \((M_s)^\mathrm{conv}\leqslant \left( 1+\tfrac{1}{K}\right) (\Phi _B^-)^\mathrm{conv}+1\) since \((\Phi _B^-)^\mathrm{conv}\) is the greatest convex minorant of \(\Phi _B^-\). We noted above that \(M_s(\beta '\xi ) \leqslant (M_s)^\mathrm{conv}(\xi )\). Therefore,
Let us show that (M) holds. Assume that \((\Phi _B^-)^\mathrm{conv}(\xi )\leqslant \frac{K}{\mu (B)}\). By Inc\(_1\) and the conclusion of the previous paragraph,
Therefore \(\tfrac{K}{K+1}\beta ' \xi \in K_s\) and \(M_s\left( \tfrac{K}{K+1}\beta ' \xi \right) =\Phi _B^+\left( \tfrac{K}{K+1}\beta \beta ' \xi \right) \). Thus
and we have established (M) with constant \(\frac{K}{K+1} \beta \beta '\). \(\square \)
Remark 4.4
From the proof of Proposition 4.2, we see that the almost convexity constant equals 8. From the proof of Corollary 2.4, we see that \(\beta '=8^{-i}\), where i is the smallest integer with \(2^i\geqslant m+1\). Thus \(2^{-i}>\frac{1}{2(m+1)}\) so that \(\beta '>(2(m+1))^{-3}\). Then we see from the proof or the previous theorem that the constant in (M) can be chosen as \((2(m+1))^{-3} \frac{K}{K+1} \beta \) when \(\beta \) is the constant from (A1). In other words, the constants from the two conditions are comparable up to a constant depending on the dimension when \(K\geqslant 1\).
The assumption \(\Phi _B^-(\xi )\leqslant \frac{K}{\mu (B)}\) from (A1) is somewhat difficult to verify. In the isotropic case, if we assume that \(\varrho _\varphi (f)\leqslant 1\), then we can conclude from Jensen’s inequality that
Thus we may apply (A1) to conclude that
This argument is not possible in the anisotropic case, since \((\Phi _B^-)^\mathrm{conv}\) is not comparable to \(\Phi _B^-\). Fortunately, the condition of (M) is easier to use.
Proof of Corollary 1.3
By Theorem 1.2, \(\Phi \) satisfies (M). Since \((\Phi _B^-)^\mathrm{conv}\) is convex, it follows by Jensen’s inequality that
Therefore we can use (M) with and the previous inequality to conclude that
\(\square \)
References
Ahmida, Y., Chlebicka, I., Gwiazda, P., Youssfi, A.: Gossez’s approximation theorems in Musielak–Orlicz–Sobolev spaces. J. Funct. Anal. 275(9), 2538–2571 (2018)
Baasandorj, S., Byun, S.-S.: Irregular obstacle problems for Orlicz double phase. J. Math. Anal. Appl. 507(1), 125791 (2022)
Balci, A., Diening, L., Surnachev, M.: New examples on Lavrentiev gap using fractals. Calc. Var. Partial Differ. Equ. 59, 180 (2020)
Baroni, P., Colombo, M., Mingione, G.: Regularity for general functionals with double phase. Calc. Var. Partial Differ. Equ. 57, 62 (2018)
Benyaiche, A., Harjulehto, P., Hästö, P., Karppinen, A.: The weak Harnack inequality for unbounded supersolutions of equations with generalized Orlicz growth. J. Differ. Equ. 275, 790–814 (2021)
Borowski, M., Chlebicka, I.: Modular density of smooth functions in inhomogeneous and fully anisotropic Musielak–Orlicz–Sobolev spaces. J. Funct. Anal. (2022) (To appear)
Buliček, M., Gwiazda, P., Skrzeczkowski, J.: Parabolic equations in Musielak–Orlicz spaces with discontinuous in time function. J. Differ. Equ. 290, 17–56 (2021)
Chlebicka, I.: A pocket guide to nonlinear differential equations in Musielak–Orlicz spaces. Nonlinear Anal. 175, 1–27 (2018)
Chlebicka, I., Gwiazda, P., Świerczewska-Gwiazda, A., Wróblewska-Kamińska, A.: Partial Differential Equations in Anisotropic Musielak–Orlicz Spaces. Springer, Cham (2021)
Chlebicka, I., Gwiazda, P., Zatorska-Goldstein, A.: Well-posedness of parabolic equations in the non-reflexive and anisotropic Musielak–Orlicz spaces in the class of renormalized solutions. J. Differ. Equ. 265(11), 5716–5766 (2018)
Chlebicka, I., Gwiazda, P., Zatorska-Goldstein, A.: Parabolic equation in time and space dependent anisotropic Musielak–Orlicz spaces in absence of Lavrentiev’s phenomenon. Ann. Inst. Henri Poincaré (C) Anal. Non Linéaire 36(5), 1431–1465 (2019)
Chlebicka, I., Gwiazda, P., Zatorska-Goldstein, A.: Renormalized solutions to parabolic equations in time and space dependent anisotropic Musielak–Orlicz spaces in absence of Lavrentiev’s phenomenon. J. Differ. Equ. 267(2), 1129–1166 (2019)
Colombo, M., Mingione, G.: Regularity for double phase variational problems. Arch. Ration. Mech. Anal. 215(2), 443–496 (2015)
De Filippis, C., Mingione, G.: Lipschitz bounds and nonautonomous integrals. Arch. Ration. Mech. Anal. 242(2), 973–1057 (2021)
Diening, L., Harjulehto, P., Hästö, P., Růžička, M.: Lebesgue and Sobolev Spaces with Variable Exponents. Lecture Notes in Mathematics, vol. 2017. Springer, Heidelberg (2011)
Hadzhy, O.V., Voitovych, M.V.: Continuity at a boundary point of solutions to quasilinear elliptic equations with generalized Orlicz growth and non-logarithmic conditions. Preprint at arxiv:2109.08643 (2021)
Gwiazda, P., Skrzypczak, I., Zatorska-Goldstein, A.: Existence of renormalized solutions to elliptic equation in Musielak–Orlicz space. J. Differ. Equ. 264(1), 341–377 (2018)
Harjulehto, P., Hästö, P.: Orlicz Spaces and Generalized Orlicz Spaces. Lecture Notes in Mathematics, vol. 2236. Springer, Cham (2019)
Harjulehto, P., Hästö, P.: Double phase image restoration. J. Math. Anal. Appl. 501(1), 123832 (2021)
Harjulehto, P., Hästö, P., Lee, M.: Hölder continuity of quasiminimizers and \(\omega \)-minimizers of functionals with generalized Orlicz growth. Annali della Scuola Normale Superiore di Pisa. Classe di Scienze XXII(2), 549–582 (2021)
Harjulehto, P., Hästö, P., Toivanen, O.: Hölder regularity of quasiminimizers under generalized growth conditions. Calc. Var. Partial Differ. Equ. 56(2), 22 (2017)
Hästö, P.: The maximal operator on generalized Orlicz spaces. J. Funct. Anal. 269(12), 4038–4048 (2015)
Hästö, P., Ok, J.: Maximal regularity for local minimizers of non-autonomous functionals. J. Eur. Math. Soc. 24(4), 1285–1334 (2022)
Hästö, P., Ok, J.: Regularity theory for non-autonomous partial differential equations without Uhlenbeck structure. Arch. Ration. Mech. Anal. 245(3), 1401–1436 (2022)
Hästö, P., Ok, J.: Regularity theory for non-autonomous problems with a priori assumptions
Kolmogoroff, A.: Zur Normierbarkeit eines allgemeinen topologischen linearen Raumes. Stud. Math. 5, 29–33 (1934)
Li, Y., Yao, F., Zhou, S.: Entropy and renormalized solutions to the general nonlinear elliptic equations in Musielak–Orlicz spaces. Nonlinear Anal. Real World Appl. 61, 103330 (2021)
Maeda, F.Y., Mizuta, Y., Ohno, T., Shimomura, T.: Boundedness of maximal operators and Sobolev’s inequality on Musielak–Orlicz–Morrey spaces. Bull. Sci. Math. 137, 76–96 (2013)
Maeda, F.Y., Mizuta, Y., Ohno, T., Shimomura, T.: Trudinger’s inequality for double phase functionals with variable exponents. Czechoslovak Math. J. 71, 511–528 (2021)
Niculescu, C.P., Persson, L.-E.: Convex Functions and Their Applications: A Contemporary Approach, CMS Books in Mathematics, 2nd edn. Springer Nature, Cham (2018)
Papageorgiou, N.S., Rădulescu, V.D., Zhang, Y.: Resonant double phase equations. Nonlinear Anal. Real World Appl. 64, 103454 (2022)
Skrypnik, I.I., Voitovych, M.V.: \({\cal{B} }_1\) classes of De Giorgi–Ladyzhenskaya–Ural’tseva and their applications to elliptic and parabolic equations with generalized Orlicz growth conditions. Nonlinear Anal. 202, 112135 (2021)
Acknowledgements
I would like to thank Iwona Chlebicka for comment and help in explaining [9, Remarks 2.3.14 and 3.7.6] and the Antti and Jenny Wihuri Foundation for financial support.
Funding
Open Access funding provided by University of Turku (UTU) including Turku University Central Hospital.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hästö, P.A. A Fundamental Condition for Harmonic Analysis in Anisotropic Generalized Orlicz Spaces. J Geom Anal 33, 7 (2023). https://doi.org/10.1007/s12220-022-01052-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12220-022-01052-5