1 Introduction

In this paper, we consider the regularity of solutions to the divergence form elliptic equation

$$\begin{aligned} {\left\{ \begin{array}{ll} Lu = -{{\mathrm{div}}}A\nabla u = f &{} \quad \text {in }\quad \Omega \\ u = 0 &{} \quad \text {on }\quad \partial \Omega \end{array}\right. } \end{aligned}$$

where \(\Omega \subset \mathbb {R}^n\), \(n\ge 2\), is a bounded open set whose boundary \(\partial \Omega \) is \(C^2\), and \(A=A(x)=(a_{ij}(x))\) is an \(n\times n\) matrix of real-valued, measurable functions that satisfies the ellipticity condition

$$\begin{aligned} \lambda |\xi |^2 \le \langle A\xi ,\xi \rangle \le \Lambda |\xi |^2, \qquad 0< \lambda < \Lambda , \quad \xi \in \mathbb {R}^n. \end{aligned}$$

We derive \(L^p\) estimates, \(p<2\), for solutions of this equation when \(A\) has discontinuous coefficients and \(f\in L^p(\Omega )\).

This and related problems have a long history. If \(A\) is continuous and \(\partial \Omega \) is \(C^{2,\alpha }\), then these results are classical: see Gilbarg and Trudinger [19]. Miranda [28] showed that if \(n\ge 3\), \(\partial \Omega \) is \(C^3\), and \(A\in W^{1,n}(\Omega )\), then any weak solution of \(L u = f\), \(f\in L^q(\Omega )\), \(q\ge 2\), is a strong solution and \(\Vert D^2u \Vert _{L^2(\Omega )} \le C(\Vert f\Vert _{L^q(\Omega )}+\Vert u\Vert _{L^1(\Omega )})\). This result is false when \(n=2\): for a counter-example, see Example 1.5 below.

A similar problem for non-divergence form elliptic operators was considered by Chiarenza and Franciosi [5]. They proved that if \(n\ge 3\), \(\Omega \) is bounded and \(\partial \Omega \) is \(C^{2}\), then the non-divergence form equation \(\mathrm {tr}(A D^2u) =f\), with \(f\in L^2(\Omega )\) and \(A\) in a certain vanishing Morrey class (a generalization of \(VMO\)), has a unique solution \(u\) satisfying \(\Vert u\Vert _{W^{2,2}(\Omega )} \le C\Vert f\Vert _{L^2(\Omega )}\). This was generalized by Chiarenza et al. [6], who showed that if \(f\in L^p\), \(1<p<\infty \), then the same equation has a unique solution satisfying \(\Vert u\Vert _{W^{2,p}(\Omega )} \le C\Vert f\Vert _{L^p(\Omega )}\). These results in turn were further generalized by Vitanza [3133].

Divergence form equations of the form \({{\mathrm{div}}}A\nabla u = {{\mathrm{div}}}F\) were considered by Di Fazio [15] on bounded domains with \(\partial \Omega \in C^{1,1}\) and Iwaniec and Sbordone [24] on \(\mathbb {R}^n\); they showed that if \(A\in VMO\), then there exists a unique weak solution that satisfies \(\Vert \nabla u \Vert _{L^p(\Omega )} \le C\Vert F\Vert _{L^p(\Omega )}\), \(1<p<\infty \). The results for bounded domains were improved by Auscher and Qafsaoui [4], who showed that it suffices to assume \(\partial \Omega \) is \(C^1\) and that \(A\) does not need to be real symmetric. For a generalization to nonlinear equations, see [18]. In [27], Meyers considered the more general equation \({{\mathrm{div}}}A\nabla u = {{\mathrm{div}}}F+f\) on a bounded domain with a smooth boundary. He showed that if \(A\) satisfies (1.2), then there exists \(p_0<2\) such that for all \(p_0<p<p_0'\), there exists a weak solution that satisfies \(\Vert \nabla u \Vert _{L^p(\Omega )} \le C(\Vert F\Vert _{L^p(\Omega )}+\Vert f\Vert _{L^p(\Omega )})\) (see Theorem 3.3 below).

Our main result is a generalization of the result of Miranda to \(p< 2\) and \(n\ge 2\).

Theorem 1.1

Let \(\Omega \subset \mathbb {R}^n\), \(n\ge 2\), be a bounded open set such that \(\partial \Omega \) is \(C^1\). Let \(A\) be an \(n\times n\) real-valued matrix that satisfies (1.2). If \(A\in W^{1,n}(\Omega )\), then there exists \(p_0\in (1,2)\) so that for all \(p\in (p_0,2)\) and \(f\in L^p(\Omega )\) there exists a unique solution \(u\) of (1.1) that satisfies a local regularity estimate: given any open set \(\Omega '\Subset \Omega \),

$$\begin{aligned} \Vert D^2 u \Vert _{L^p(\Omega ')} \le D^{-1}C\Vert f\Vert _{L^p(\Omega )}, \end{aligned}$$

where \(C\) is independent of both \(u\) and \(f\) and \(D=d(\Omega ',\partial \Omega )\). If we further assume that \(\partial \Omega \) is \(C^2\), then

$$\begin{aligned} \Vert D^2 u \Vert _{L^p(\Omega )} \le C\Vert f\Vert _{L^p(\Omega )}. \end{aligned}$$

where \(C\) is independent of both \(u\) and \(f\).

Remark 1.2

To compare Theorem 1.1 to the work of Di Fazio et al. described above, note that if \(A\in W^{1,n}\) then \(A\in VMO\): see, for instance, [9].

Remark 1.3

Our techniques actually allow us to assume that \(A\) is a complex matrix that satisfies

$$\begin{aligned} |\langle A\xi ,\eta \rangle |\le \Lambda |\xi ||\eta |, \qquad \lambda |\xi |^2\le \mathrm {Re}\langle A\xi ,\xi \rangle , \quad \xi ,\eta \in \mathbb {C}^n. \end{aligned}$$

Details are left to the interested reader.

The lower bound \(p_0\) in Theorem 1.1 is intrinsic to our method of proof. It is an open question whether our results can be extended to the full range \(1<p<2\). The stronger assumptions on the boundary to get global regularity in Theorem 1.1 are not unexpected: There exist examples that show that for \(n\ge 2\) and \(p>1\), there exists a bounded \(C^1\) domain \(\Omega \) and \(f\in C^\infty (\overline{\Omega })\) such that the solution \(u\) to \(\Delta u = f\) in \(\Omega \), \(u=0\) on \(\partial \Omega \), is not in \(W^{2,1}(\Omega )\) (see [14, 25]).

When \(n\ge 3\), an examination of the constants shows that we can take \(p=2\) in our proof. This lets us give a new proof of the result of Miranda referred to above, as well as a local regularity result.

Corollary 1.4

Let \(\Omega \subset \mathbb {R}^n\), \(n\ge 3\), be a bounded open set such that \(\partial \Omega \) is \(C^1\). Let \(A\) be an \(n\times n\) real-valued matrix that satisfies (1.2). If \(A\in W^{1,n}(\Omega )\), then for all \(f\in L^2(\Omega )\), there exists a unique solution \(u\) of (1.1) that satisfies

$$\begin{aligned} \Vert D^2 u \Vert _{L^2(\Omega ')} \le D^{-1}C\Vert f\Vert _{L^2(\Omega )}, \end{aligned}$$

where \(C\) is independent of both \(u\) and \(f\), \(\Omega ' \Subset \Omega \) and \(D=d(\Omega ',\partial \Omega )\). If we further assume that \(\partial \Omega \) is \(C^2\), then

$$\begin{aligned} \Vert D^2 u \Vert _{L^2(\Omega )} \le C\Vert f\Vert _{L^2(\Omega )}. \end{aligned}$$

We now consider the case \(p=n=2\). In this case, Corollary 1.4 is false, as the next example shows.

Example 1.5

Let \(B=B_{1/2}(0)\subset \mathbb {R}^2\) be the open ball of radius \(1/2\) centered at the origin. Then there exists a matrix \(A\in W^{1,2}(B)\) satisfying (1.2) and a solution to

$$\begin{aligned} -{{\mathrm{div}}}(A\nabla u)=0 \end{aligned}$$

such that \(u\in W^{2,p}(B)\) for all \(p<2\), but \(u\notin W^{2,2}(B)\).

We can adapt the proofs of Theorem 1.1 to the case \(p=n=2\) if we assume that \(\nabla A\) satisfies stronger integrability conditions. We state these in the scale of Orlicz spaces—for a precise definition, see Sect. 2 below. For brevity, we only state the global regularity result.

Theorem 1.6

Let \(\Omega \subset \mathbb {R}^2\) be a bounded open set such that \(\partial \Omega \) is \(C^2\). Let \(A\) be a \(2\times 2\) real-valued matrix that satisfies (1.2). Suppose further that for some \(\delta >0\),

$$\begin{aligned} \Vert \nabla A\Vert _{L^2(\log L)^{1+\delta }(\Omega )}<\infty . \end{aligned}$$

If \(f\in L^2(\Omega )\) then there exists a unique solution \(u\) of (1.1) that satisfies

$$\begin{aligned} \Vert D^2u\Vert _{L^2(\Omega )}\le C \Vert \nabla A\Vert _{L^2(\log L)^{1+\delta }(\Omega )} \Vert f\Vert _{L^2(\Omega )}. \end{aligned}$$

Our second result gives information in the end point case when \(\delta =0\). In this case, we need to impose an additional regularity condition. Recall (cf. [35]) that if \(\Omega \subset \mathbb {R}^2\), a function \(u\) is contained in the Morrey space \(L^{2,\lambda }(\Omega )\) if

$$\begin{aligned} \Vert u\Vert _{L^{2,\lambda }(\Omega )} = \sup _{Q} \left( |Q|^{-\frac{\lambda }{2}} \int _{Q\cap \Omega } u^2\,\hbox {d}x\right) ^{1/2} < \infty . \end{aligned}$$

Theorem 1.7

Let \(\Omega \subset \mathbb {R}^2\) be a bounded open set such that \(\partial \Omega \) is \(C^2\). Let \(A\) be a \(2\times 2\) real-valued matrix that satisfies (1.2) and

$$\begin{aligned} \Vert A\Vert _{L^2(\log L)(\Omega )}<\infty . \end{aligned}$$

Suppose further that for some \(1<r<2\), \(\nabla A \in L^{2,\frac{4}{r}-2}(\Omega )\). If \(f\in L^2(\Omega )\) then there exists a unique solution \(u\) of (1.1) that satisfies

$$\begin{aligned} \Vert D^2u\Vert _{L^2(\Omega )}\le C(r,\Omega ) \Vert \nabla A\Vert _{L^{2,\frac{4}{r}-2}(\Omega )} \Vert f\Vert _{L^2(\Omega )}. \end{aligned}$$

Unfortunately, both of these results are weaker than they appear. In two dimensions, (1.5) implies that \(\nabla A\) is continuous: see Cianchi [7, 8]. Similarly, if we assume that \(\nabla A \in L^{2,\frac{4}{r}-2}(\Omega )\), then we also have that \(A\) is Hölder continuous: see [19, p. 298]. Thus, both of these results follow from classical Schauder estimates [19]. Nevertheless, since our proofs are different from the classical ones they are of some interest.

It remains open whether anything can be said when \(p=n=2\) and \( A \in W^{1,2}(\Omega )\) or even when \(\Vert \nabla A\Vert _{L^2(\log L)(\Omega )}<\infty \). We conjecture that in this endpoint case, \(D^2u \in L^{2)}(\Omega )\), where \(L^{2)}\) denotes the grand Lebesgue space with norm

These spaces were introduced in [22] and have proved useful in the study of endpoint estimates in PDEs [20, 21]. As evidence for this conjecture, we note that the solution \(u\) given in Example 1.5 is in \(L^{2)}(B)\). A stronger conjecture, also satisfied by our example, is that \(D^2u\) lies in the Orlicz space \(L^2(\log L)^{-1}(\Omega )\) (This space is a proper subset of \(L^{2)}\): see [20]). In both cases, our proof techniques are not sharp enough to produce these estimates and a different approach will be required.

The remainder of this paper is organized as follows. In Sect. 2, we state some preliminary definitions and weighted Fefferman–Phong type inequalities that are central to our proofs. These results depend on recent work on two-weight norm inequalities for the Riesz potential [13]. In Sect. 3, we prove Theorem 1.1. Our proof uses ideas from [5]. In Sect. 4, we consider the special case when \(n=2\): We construct Example 1.5 and sketch the proofs of Theorems 1.6 and 1.7. Throughout our notation will be standard or defined as needed. Given a vector matrix function, if way say that it belongs to a scalar function space (e.g., \(A\in W^{1,n}(\Omega )\)), we mean that each component function is an element of the function space; to compute the norm we first take the \(\ell ^2\) norm of the components. Constants \(C\), \(C(n)\), etc., may change in value at each appearance.

2 Preliminary results

In this section, we give conditions on a weight \(w\) for the two-weight Sobolev inequality

$$\begin{aligned} \Vert fw\Vert _{L^p(\Omega )}\le C\Vert \nabla f\Vert _{L^p(\Omega )} \end{aligned}$$

to hold. Such inequalities are sometimes referred to as Fefferman–Phong inequalities: see [17]. Given the classical pointwise inequality

$$\begin{aligned} |f(x)| \le C(n)I_1(|\nabla f|)(x), \qquad f \in C_0^\infty , \end{aligned}$$

it suffices to prove two-weight estimates for the Riesz potential of order one:

$$\begin{aligned} I_1f(x)=\Delta ^{-\frac{1}{2}}f(x)=c\int _{\mathbb {R}^n}\frac{f(y)}{|x-y|^{n-1}}\,\hbox {d}y. \end{aligned}$$

In Theorems 1.1 and 1.6, we will apply a sharp sufficient condition for the Riesz potential to be bounded that was proved by Pérez [29]; we will use the version from [13, Theorem 3.6] as this gives precise values for the constants. To state this result, we need to make some definitions; for additional information on Orlicz spaces and two-weight inequalities, see  [12, 13]. A convex, strictly increasing function \(\Phi :[0,\infty ]\rightarrow [0,\infty ]\) is said to be a Young function if \(\Phi (0)=0\) and \(\Phi (\infty )=\infty \). Given a Young function, there exists another Young function, \(\bar{\Phi }\), called the associate function, such that \(\Phi ^{-1}(t)\bar{\Phi }^{-1}(t)\simeq t\). For our purposes, there are two particularly important examples of Young functions that we will use. First, if \(\Phi (t)=t^r\), \(r>1\), then \(\bar{\Phi }(t)=t^{r'}\). If \(\Phi (t)=t^r\log (e+t)^a\), then \(\bar{\Phi }(t)\simeq t^{r'}\log (e+t)^{-\frac{a}{r-1}}\).

Given \(1<p<\infty \) and a Young function \(\Phi \), define

$$\begin{aligned} \alpha _{p,\Phi } = \left( \int _1^\infty \frac{\Phi (t)}{t^p}\frac{\hbox {d}t}{t}\right) ^{1/p}. \end{aligned}$$

Our conditions on weights are defined using a normalized Orlicz norm: given Young function \(\Phi \) and a cube \(Q\), let

Given a pair of weights \((u, v)\) (i.e., non-negative, locally integrable functions) and Young functions \(\Phi \) and \(\Psi \), let

$$\begin{aligned}{}[u,v]_{A^1_{p,\Psi ,\Phi }} = \sup _Q |Q|^{\frac{1}{n}} \Vert u^{1/p}\Vert _{\Psi ,Q}\Vert v^{-1/p}\Vert _{\Phi ,Q}. \end{aligned}$$

where the supremum is taken over all cubes in \(\mathbb {R}^n\) with sides parallel to the coordinate axes.

Theorem 2.1

[13, Theorem 3.6] Given \(1<p<\infty \), a pair of weights \((u,v)\), and Young functions \(\Phi \) and \(\Psi \), we have that

$$\begin{aligned} \Vert I_1 \Vert _{L^p(v)\rightarrow L^p(u)} \le C(n,p)[u,v]_{A^1_{p,\Psi ,\Phi }} \, \alpha _{p,\bar{\Phi }}\, \alpha _{p',\bar{\Psi }}. \end{aligned}$$

Remark 2.2

In Theorem 2.1, we need to apply the integral condition in (2.1) to the associate functions \(\bar{\Phi },\,\bar{\Psi }\). If \(\Phi \) and \(\Psi \) are doubling (i.e., \(\Phi (2t)\le C\Phi (t)\), \(t>0\), and similarly for \(\Psi \)), then by a change of variables, this condition can be restated in terms of \(\Phi \) and \(\Psi \). See [12, Prop. 5.10] for further information.

We can now give the Sobolev inequalities needed for our results.

Lemma 2.3

Fix \(n\ge 2\) and \(1<p<n\). Let \(\Omega \subset \mathbb {R}^n\). Then, for any \(f\in W_0^{1,p}(\Omega )\) and \(w\in L^n(\Omega )\),

$$\begin{aligned} \Vert fw\Vert _{L^p(\Omega )} \le C(n) (p'-n')^{-1/p'}\Vert w\Vert _{L^n(\Omega )} \Vert \nabla f\Vert _{L^p(\Omega )}. \end{aligned}$$


Extend \(w\) to a function on all of \(\mathbb {R}^n\) by setting it equal to \(0\) outside of \(\Omega \). Let \(\Psi (t)=t^{n}\) and \(\Phi (t)=t^r\), \(1<r<p\); the exact value of \(r\) is not significant. Then

$$\begin{aligned} \alpha _{p',\bar{\Psi }} = (p'-n')^{-1/p'}, \qquad \alpha _{p,\bar{\Phi }} = (p-r)^{-1/p}, \end{aligned}$$

and so we have that

Therefore, by Lemma 2.1 we have that for all \(f\in C_0^\infty (\Omega )\),

$$\begin{aligned} \Vert fw\Vert _{L^p(\mathbb {R}^n)}\le \Vert I_1(|\nabla f|)w\Vert _{L^p(\mathbb {R}^n)} \le C(n,p,r)(p'-n')^{-1/p'}\Vert w\Vert _{L^n(\Omega )}\Vert \nabla f\Vert _{L^p(\mathbb {R}^n)}. \end{aligned}$$

The desired inequality follows for all \(f\) by a standard approximation argument.\(\square \)

When \(n\ge 3\), we see that \(w\in L^n(\Omega )\) implies the Sobolev inequality for \(p=2\). When \(n=2\), we only get the Sobolev inequality for \(1<p<2\), and the constant blows up as \(p\) tends to \(2\) (and also as it tends to \(1\)). In general, \(w\in L^2(\Omega )\) will not be a sufficient condition for the Sobolev inequality when \(p=n=2\).

To prove Theorem 1.6, we can use the full power of Theorem 2.1 to prove a substitute for Lemma 2.3. To state it, we define the non-normalized Orlicz norm: given an open set \(\Omega \) and an Orlicz function \(\Psi \),

$$\begin{aligned} \Vert f\Vert _{ L^\Psi (\Omega )} = \inf \left\{ \lambda > 0 : \int _\Omega \Psi \left( \frac{|f(x)|}{\lambda }\right) \hbox {d}x\le 1 \right\} . \end{aligned}$$

When \(\Psi (t)=t^2\log (e+t)^{1+\delta }\), then we write \(L^\Phi (\Omega )= L^2(\log L)^{1+\delta }(\Omega )\).

Lemma 2.4

Given a bounded open set \(\Omega \subset \mathbb {R}^2\) and \(w\in L^2(\log L)^{1+\delta }(\Omega )\), if \(f\in W_0^{1,2}(\Omega )\), then

$$\begin{aligned} \Vert fw\Vert _{L^2(\Omega )} \le C\delta ^{-1/2} [1+{{\mathrm{diam}}}(\Omega )]\Vert w\Vert _{L^2(\log L)^{1+\delta }(\Omega )} \Vert \nabla f\Vert _{L^2(\Omega )}. \end{aligned}$$


We begin as in the proof of Lemma 2.3, but we now take \(\Psi (t)=t^2\log (e+t)^{1+\delta }\). Then

$$\begin{aligned} \alpha _{2,\bar{\Psi }} = \left( \int _1^\infty \frac{\hbox {d}t}{t\log (e+t)^{1+\delta }}\right) ^{1/2}= C\delta ^{-1/2}< \infty , \end{aligned}$$


$$\begin{aligned}{}[ w^2,1]_{A^1_{2,\Phi ,\Psi }} = \sup _Q |Q|^{1/2}\Vert w\Vert _{\Psi ,Q}. \end{aligned}$$

Since we may assume \({{\mathrm{supp}}}(w)\subset \Omega \), we may restrict the supremum to cubes \(Q\) such that \(|Q|\le {{\mathrm{diam}}}(\Omega )^2\). Then by the definition of the norm, we have that

The desired inequality now follows as before.\(\square \)

To prove Theorem 1.7, we need an “off-diagonal” inequality for the Riesz potential. There is a version of Theorem 2.1 in this case, but we will use a stronger result due to D. R. Adams [1] (see also [34, Theorem 4.7.2]).

Theorem 2.5

Given \(1<p<n\), \(p<q<\infty \) and a weight \(u\), if

$$\begin{aligned} u(Q) \le K|Q|^{\frac{a}{n}}, \end{aligned}$$

where \(a= \frac{q(n-p)}{p}\), then

$$\begin{aligned} \Vert I_1 f \Vert _{L^q(u)} \le C(p,q,n) K^{1/q}\Vert f\Vert _{L^p}. \end{aligned}$$

Lemma 2.6

Given an open set \(\Omega \subset \mathbb {R}^2\), suppose that for \(1<r<2\), \(w\in L^{2,\frac{4}{r}-2}(\Omega )\) If \(f\in W_0^{1,2}(\Omega )\), then

$$\begin{aligned} \Vert fw\Vert _{L^2(\Omega )} \le C(r)\Vert w\Vert _{L^{2,\frac{4}{r}-2}(\Omega )} \Vert \nabla f\Vert _{L^r(\Omega )}. \end{aligned}$$


Extend \(w\) to a function on all of \(\mathbb {R}^2\) by setting it equal to \(0\) outside of \(\Omega \). Define \(a\) as in Theorem 2.5 with \(p=r\), \(q=n=2\). Then for all cubes \(Q\) that intersect \(\Omega \),

$$\begin{aligned} |Q|^{-\frac{a}{2}}\int _Q w(x)^2\,\hbox {d}x = |Q|^{1-\frac{2}{r}}\int _{Q\cap \Omega } w(x)^2\,\hbox {d}x \le \Vert w\Vert _{L^{2,\frac{4}{r}-2}(\Omega )}^2. \end{aligned}$$

Inequality (2.4) now follows from Theorem 2.5 and an approximation argument.\(\square \)

3 Proof of Theorem 1.1

We begin with two results due to Meyers [27]. The first is a coercivity condition.

Theorem 3.1

Given a bounded open set \(\Omega \subset \mathbb {R}^n\) with \(C^1\) boundary, let \(A\) be an \(n\times n\) real-valued matrix that satisfies (1.2). Define the sesquilinear form

$$\begin{aligned} \mathfrak {a}(u,v) = \int _\Omega A\nabla u \cdot \nabla v\,\hbox {d}x. \end{aligned}$$

Then there exists \(p_0=p_0(n,\lambda ,\Lambda ,\Omega )\), \(1<p_0<2\), such that for all \(p\), \(p_0<p\le 2\), and all \(u \in W^{1,p}_0(\Omega )\),

$$\begin{aligned} \Vert u \Vert _{W^{1,p}_0(\Omega )} \approx \sup _{\Vert v\Vert _{W^{1,p'}_0(\Omega )}=1} |\mathfrak {a}(u,v)|. \end{aligned}$$

Moreover, the constants in this equivalence depend on \(\lambda \), \(\Lambda \), \(p\), \(n\), and \(\Omega \). They are independent of the specific matrix \(A\).


The upper estimate for \(\mathfrak {a}(u,v)\) is just Hölder’s inequality; it is the lower estimate that is non-trivial. From the proof of Theorem 1 in [27] we have the existence of \(p_0<2\) and \(\kappa =\kappa (\lambda ,\Lambda ,p,\Omega )>0\) such that

$$\begin{aligned} \inf _{\Vert u\Vert _{W^{1,p}_0}=1}\sup _{\Vert v\Vert _{W^{1,p'}_0}=1}|\mathfrak {a}(u,v)|\ge \kappa . \end{aligned}$$

A key hypothesis in the proof is the existence of \(q>p_0'\) such that for every \(F\in L^q(\Omega )\), there exists a unique weak solution \(\Phi \) to the equation \(\Delta \Phi ={{\mathrm{div}}}F\) on \(\Omega \) and the estimate

$$\begin{aligned} \Vert \nabla \Phi \Vert _{q}\le C\Vert F\Vert _{q}, \end{aligned}$$

holds. Since \(\partial \Omega \) is \(C^1\), by Auscher and Qafsaoui [4], we have that such a solution exists and (3.2) holds for all \(q\), \(1<q<\infty \).\(\square \)

Remark 3.2

The value of \(p_0\) is difficult to estimate from Meyer’s proof. There is an elegant proof of this result in [30] that uses the Hodge decomposition. In [23] a careful estimate is given for the resulting constants; though again the exact value is not easy to determine. In passing we note that in [30], Theorem 3.1 is proved for “regular” domains, which are defined abstractly in [23]. However, regular domains include Lipschitz domains: see [21].

For our existence results, we also need the following result of Meyers [27, Theorem 1].

Theorem 3.3

Let \(\Omega \subset \mathbb {R}^n\), \(n\ge 2\), be a bounded open set such that \(\partial \Omega \) is \(C^1\), and let \(A\) be an \(n\times n\) real-valued matrix that satisfies (1.2). Then the equation

$$\begin{aligned} Lu={{\mathrm{div}}}( A\nabla u) =f \end{aligned}$$

has a unique solution in \(W_0^{1,p}(\Omega )\) for every \(f\in L^p(\Omega )\), provided \(p_0<p<p_0'\), where \(p_0\) is the constant from Theorem 3.1. The solution satisfies the estimate

$$\begin{aligned} \Vert \nabla u\Vert _{L^p(\Omega )}\le C\Vert f\Vert _{L^p(\Omega )}, \end{aligned}$$

where \(C=C(n,\lambda ,\Lambda ,p,\Omega )\).

Proof of Theorem 1.1

Fix a matrix \(A\) satisfying (1.2), and fix \(p\), \(p_0<p<2\), where \(p_0\) is as in Theorem 3.1. By Theorem 3.3, for any \(f\in L^p(\Omega )\) the equation \(Lu=f\) has a unique solution \(u \in W^{1,p}_0(\Omega )\) such that

$$\begin{aligned} \Vert \nabla u \Vert _{L^p(\Omega )} \le C \Vert f \Vert _{L^p(\Omega )}, \end{aligned}$$

with \(C\) independent of \(f\).

We first prove the desired estimate on \(D^2 u\) in the special case when \(f\in C^\infty (\Omega )\) and \(A\in C^\infty (\Omega )\); afterward, we will prove the general case by a double approximation argument. Let \(u\) be the solution of (1.1). Then \(u\in C^\infty (\Omega )\): see Evans [16, Th. 3,Sec. 6.3.1]. (Note that in this result, there is an implicit assumption on the regularity of the boundary because of an appeal to a Poincaré-Sobolev type inequality for functions without compact support in \(\Omega \); \(C^1\) is more than sufficient for this purpose.) We now have the pointwise identity

$$\begin{aligned} f = - {{\mathrm{div}}}A\nabla u = -\sum _{i,j} \big ( a_{ij} u_{x_j}\big )_{x_i}. \end{aligned}$$

Fix \(s\) with \(1\le s \le n\) and \(\eta \in C^\infty _0(\Omega )\) with \(0\le \eta \le 1\). Then \(\eta u_{x_s}\in W_0^{1,p}(\Omega )\), so by Theorem 3.1 there exists \(v\in C_0^2(\Omega )\), \(\Vert v\Vert _{W^{1,p'}_0}=1\), and \(\kappa =\kappa (n,\lambda ,\Lambda ,\Omega )>0\) such that

$$\begin{aligned}&|\mathfrak {a}(\eta u_{x_s},v)| \ge \kappa \Vert \eta u_{x_s}\Vert _{W^{1,p}_0} \ge \kappa \Vert \nabla \big (\eta u_{x_s}\big )\Vert _{L^p(\Omega )} \nonumber \\&\qquad \ge \kappa \Vert \eta \nabla u_{x_s}\Vert _{L^p(\Omega )} - \kappa \Vert u_{x_s} \nabla \eta \Vert _{L^p(\Omega )}. \end{aligned}$$

If we multiply \(f\) by \(\eta v_{x_s}\), integrate over \(\Omega \) and then integrate by parts twice we get

$$\begin{aligned} \int _\Omega f\eta v_{x_s}\,\hbox {d}x&= - \int _\Omega \sum _{i,j} \big (a_{ij} u_{x_j}\big )_{x_i} \eta v_{x_s}\,\hbox {d}x \\&= \int _\Omega \sum _{i,j} a_{ij} u_{x_j} \big ( \eta v_{x_s}\big )_{x_i}\,\hbox {d}x\nonumber \\&= -\int _\Omega \sum _{i,j} \big (\eta a_{ij} u_{x_j}\big )_{x_s} v_{x_i}\,\hbox {d}x + \int _\Omega v_{x_s} A\nabla u\cdot \nabla \eta \, \hbox {d}x\nonumber \\&= - \int _\Omega \sum _{i,j} \eta \big (a_{ij} \big )_{x_s}u_{x_j} v_{x_i}\,\hbox {d}x - \int _\Omega \eta _{x_s} A\nabla u\cdot \nabla v\,\hbox {d}x\\&\qquad - \int _\Omega \eta A \nabla \big ( u_{x_s}\big )\cdot \nabla v\,\hbox {d}x + \int _\Omega v_{x_s} A\nabla u\cdot \nabla \eta \,\hbox {d}x.\nonumber \end{aligned}$$

Therefore, if we take absolute values, rearrange terms, and combine this with inequality (3.4), we get

$$\begin{aligned} \kappa \Vert \eta \nabla ( u_{x_s})\Vert _{L^p(\Omega )}&\le |\mathfrak {a}(\eta u_{x_s},v)| + \kappa \Vert u_{x_s}\nabla \eta \Vert _{L^p(\Omega )}\\&\le \left| \int _\Omega \eta A\nabla (u_{x_s})\cdot \nabla v\,\hbox {d}x\right| + \left| \int _\Omega u_{x_s} A\nabla \eta \cdot \nabla v\,\hbox {d}x\right| + \kappa \Vert u_{x_s}\nabla \eta \Vert _{L^p(\Omega )}\\&\le \int _\Omega \bigg |\sum _{i,j} \eta \big (a_{ij}\big )_{x_s} u_{x_j}v_{x_i}\bigg |\,\hbox {d}x +\int _\Omega |\eta _{x_s}|\,|A\nabla u\cdot \nabla v|\; \hbox {d}x\\&\qquad +\, \int _\Omega |v_{x_s}|\,|A\nabla u \cdot \nabla \eta |\; \hbox {d}x + \int _\Omega |u_{x_s}|\,|A\nabla \eta \cdot \nabla v|\; \hbox {d}x\nonumber \\&\qquad + \int _\Omega |f\eta v_{x_s}|\,\hbox {d}x + \kappa \Vert u_{x_s}\nabla \eta \Vert _{L^p(\Omega )} \\&= I_1 + I_2 + I_3 + I_4 + I_5 + I_6. \end{aligned}$$

We estimate each separately. The bound for \(I_5\) is straightforward: by Hölder’s inequality,

$$\begin{aligned} I_5 \le \Vert f\Vert _{L^p(\Omega )}\Vert v_{x_s}\Vert _{L^{p'}(\Omega )} \le \Vert f\Vert _{L^p(\Omega )}\Vert v\Vert _{W_0^{1,p'}(\Omega )} = \Vert f\Vert _{L^p(\Omega )}. \end{aligned}$$

Similarly, using Hölder’s inequality together with (1.2) we see that

$$\begin{aligned} I_2+I_3+I_4+I_6 \le C(\kappa ,\Lambda )(\sup _\Omega |\nabla \eta |)\Vert \nabla u\Vert _{L^p(\Omega )}. \end{aligned}$$

The key estimate is for \(I_1\). Define

$$\begin{aligned} A_s = \big ( \big (a_{ij}\big )_{x_s}\big ), \qquad U = |\nabla A| = \bigg (\sum _{i,j,s} \big (a_{ij})_{x_s}^2\bigg )^{1/2}, \end{aligned}$$

and fix \(\epsilon >0\); the exact value of \(\epsilon \) will be given below. Since \(U\in L^n(\Omega )\), there exists \(K=K(\epsilon ,U)\) such that

$$\begin{aligned} \left( \int _{\{x : U(x)>K\}} U(x)^n\,\hbox {d}x\right) ^{1/n} < \epsilon . \end{aligned}$$

Let \(U_1= U\chi _{\{x : U(x)>K\}}\) and \(U_2=U-U_1\). Then, by Hölder’s inequality and Lemma 2.3, here using that \(\eta \nabla u\in W^{1,p}_0(\Omega )\), we can estimate as follows:

$$\begin{aligned} I_1= & {} \int _\Omega |\eta A_s \nabla u \cdot \nabla v|\,\hbox {d}x \\&\le \int _\Omega U|\eta \nabla u||\nabla v|\,\hbox {d}x \\&\le \left( \int _\Omega |\eta \nabla u\,U|^p\,\hbox {d}x\right) ^{1/p} \left( \int _\Omega |\nabla v|^{p'}\,\hbox {d}x\right) ^{1/p'} \\&\le \left( \int _\Omega |\eta \nabla u\, U_1|^p \,\hbox {d}x\right) ^{1/p} + \left( \int _\Omega |\eta \nabla u\, U_2|^p \,\hbox {d}x\right) ^{1/p}\\&\le C(n)(p'-n')^{-1/p'}\epsilon \left( \int _\Omega |\nabla (\eta \nabla u)|^p\,\hbox {d}x\right) ^{1/p} + K(\epsilon ,U)\left( \int _\Omega |\nabla u|^p \,\hbox {d}x\right) ^{1/p}\\&\le C(n,p)\epsilon \left( \int _\Omega |\eta D^2 u|^p\,\hbox {d}x\right) ^{1/p} +C(n,p)\epsilon \left( \int _\Omega |\nabla \eta \cdot \nabla u|^p\,\hbox {d}x\right) ^{1/p}\\&\quad + {K}\left( \int _\Omega |\nabla u|^p \,\hbox {d}x\right) ^{1/p} \\&\le C(n,p)\epsilon \left( \int _\Omega |\eta D^2 u|^p\,\hbox {d}x\right) ^{1/p} + \tilde{K}(1+\Vert \nabla \eta \Vert _\infty )\left( \int _\Omega |\nabla u|^p \,\hbox {d}x\right) ^{1/p}, \end{aligned}$$

where \(\tilde{K} =\tilde{K}(n,p,\varepsilon ,K)\).

Each of the above estimates hold for all values of \(s\). Therefore, by Minkowski’s inequality, if we sum over all \(s\) and combine these estimates, we get that

$$\begin{aligned} \kappa \Vert \eta D^2u\Vert _{L^p(\Omega )}\le & {} \sum _s \kappa \Vert \eta \nabla (u_{x_s})\Vert _{L^p(\Omega )} \\\le & {} C(n,p)\epsilon \Vert \eta D^2 u\Vert _{L^p(\Omega )}+ \overline{K}(1+\Vert \nabla \eta \Vert _\infty )\Vert \nabla u\Vert _{L^p(\Omega )} + n\Vert f\Vert _{L^p(\Omega )}, \end{aligned}$$

where \(\overline{K}=\overline{K}((n,p,\Lambda , \epsilon ,K)\). Since \(\epsilon >0\) is arbitrary, we can fix \(\epsilon =\kappa /2C(n,p)\) and then rearrange terms to get

$$\begin{aligned}&\Vert \eta D^2 u\Vert _{L^p(\Omega )} \nonumber \\&\qquad \le {2\overline{K}\kappa ^{-1}}(1+\Vert \nabla \eta \Vert _\infty )\Vert \nabla u\Vert _{L^p(\Omega )} +{2n}{\kappa ^{-1}}\Vert f\Vert _{L^p(\Omega )} \le C_0 (1+\Vert \nabla \eta \Vert _\infty ) \Vert f\Vert _{L^p(\Omega )},\nonumber \\ \end{aligned}$$

where the last inequality follows from (3.3), and \(C_0=C_0(p,n,\lambda ,\Lambda ,\Omega ,K)\).

To complete the proof, fix \(\Omega '\Subset \Omega \) and choose \(\eta \in C^\infty _0(\Omega )\) such that \(\eta (x) = 1\) in \(\Omega '\) and so that \(\Vert \nabla \eta \Vert \approx D^{-1}\), where \(D=\text {dist}(\Omega ',\partial \Omega )\). Without loss of generality, we may assume \(D>1\). With this choice of \(\eta \), inequality (3.6) yields the local \(W^{2,p}(\Omega )\) estimate

$$\begin{aligned} \Vert D^2 u \Vert _{L^p(\Omega ')} \le D^{-1}C\Vert f\Vert _{L^p(\Omega )}, \end{aligned}$$

where \(C=C(n, p, \lambda ,\Lambda ,K)\). Finally, if we assume that \(\partial \Omega \) is \(C^2\), we can apply the argument given in [19, p. 187] to obtain a constant \(C>0\) depending on \(K,\; p,\; n,\; \lambda \), and \(\Lambda \) so that

$$\begin{aligned} \Vert D^2 u\Vert _{L^p(\Omega )} \le C \Vert f\Vert _{L^p(\Omega )}. \end{aligned}$$

This completes the proof of inequality (1.4) when \(f\) and \(A\) are sufficiently smooth.

We will now prove that inequalities (3.7) and (3.8) hold for general \(f\) and \(A\) satisfying the hypotheses. We will only consider the latter equation as the proof of the former is essentially the same.

We will first show that we can take an arbitrary \(f\). Fix \(f\in L^p(\Omega )\), and fix a sequence of functions \(\{f_j\}\) in \(C^\infty (\Omega )\) that converge to \(f\) in \(L^p(\Omega )\). Fix \(A \in C^\infty (\Omega )\) and let \(u_j\in W^{1,p}_0(\Omega )\) be the solution to \(L u_j = f_j\), and let \(u\in W_0^{1,p}\) be the solution to \(L u = f\). By inequality (3.3) and the Sobolev inequality, we have that

$$\begin{aligned} \Vert u-u_j\Vert _{L^p(\Omega )} \le C\Vert \nabla (u-u_j)\Vert _{L^p(\Omega )} \le C\Vert f-f_j\Vert _{L^p(\Omega )}. \end{aligned}$$

Therefore, \(u_j\rightarrow u\) in \(W_0^{1,p}(\Omega )\).

Since \(f_j\) and \(A\) have the requisite smoothness, we can apply (3.8) to \(u_i-u_j\) to get

$$\begin{aligned} \Vert D^2(u_i-u_j)\Vert _{L^p(\Omega )} \le C\Vert f_i-f_j\Vert _{L^p(\Omega )}. \end{aligned}$$

Thus, the sequence \(\{u_j\}\) is Cauchy in \(W^{2,p}(\Omega )\). For \(1\le r,\,s\le n\), let \(v_{r,s}\) denote the limit of \(\{(u_j)_{x_r,x_s}\}\). Then for any \(\phi \in C_0^\infty (\Omega )\),

$$\begin{aligned} \int _\Omega u_{x_s} \phi _{x_r}\,\hbox {d}x = \lim _{j\rightarrow \infty } \int _\Omega (u_j)_{x_s} \phi _{x_r}\,\hbox {d}x = \lim _{j\rightarrow \infty } \int _\Omega (u_j)_{x_r,x_s} \phi \,\hbox {d}x = \int _\Omega v_{r,s} \phi \,\hbox {d}x. \end{aligned}$$

Therefore, \(u\in W^{2,p}(\Omega )\) and \(u_j\rightarrow u\) in \(W^{2,p}(\Omega )\). Inequality (1.4) for \(u\) now follows immediately.

Finally, we prove that we can take arbitrary \(A\in W^{1,n}(\Omega )\). Fix such an \(A\), and let \(\{A_j\}\) be a sequence of matrices in \(C^\infty (\Omega )\) that converges to \(A\) in \(W^{1,n}(\Omega )\). It follows at once from the standard construction of the \(A_j\) (cf. Adams and Fournier [2]) that we may assume that the \(A_j\) are elliptic with the same ellipticity constants as \(A\). Finally, let \(U_j=|\nabla A_j|\); then \(U_j \rightarrow U=|\nabla A|\) in \(L^n(\Omega )\). By the converse to the dominated convergence theorem (see Lieb and Loss [26, Th. 2.7]), if we pass to a subsequence, then we may assume that \(U_j \rightarrow U\) pointwise a.e., and there exists \(g\in L^n(\Omega )\) such that \(U_j(x)\le g(x)\) a.e. Therefore, by the dominated convergence theorem (again passing to a subsequence), we may assume that (3.5) holds (with fixed \(\epsilon \)) for each \(U_j\) with a constant \(K\) independent of \(j\).

Fix \(f\in L^p(\Omega )\) and let \(u_j\in W_0^{1,p}(\Omega )\) be the solution of \(-{{\mathrm{div}}}A_j \nabla u_j = f\) and let \(u\in W_0^{1,p}(\Omega )\) be the solution of \(Lu= -{{\mathrm{div}}}A \nabla u = f\). Then for any \(\phi \in C_0^\infty (\Omega )\),

$$\begin{aligned} \int _\Omega A_j\nabla u_j\cdot \nabla \phi \,\hbox {d}x = - \int _\Omega f\phi \,\hbox {d}x = \int _\Omega A\nabla u\cdot \nabla \phi \,\hbox {d}x. \end{aligned}$$


$$\begin{aligned} \int _\Omega \big (A\nabla u - A\nabla u_j+A\nabla u_j - A_j\nabla u_j\big )\nabla \phi \,\hbox {d}x = 0, \end{aligned}$$

and so by rearranging terms we have that

$$\begin{aligned} |\mathfrak {a}( u - u_j, \phi )| = \left| \int _\Omega A(\nabla u -\nabla u_j)\cdot \nabla \phi \,\hbox {d}x\right| \ \le \int _\Omega |(A-A_j)\nabla u_j\cdot \nabla \phi |\,\hbox {d}x. \end{aligned}$$

By Theorem 3.1 there exists \(\phi \) such that \(\Vert \phi \Vert _{W_0^{1,p'}(\Omega )}=1\) and \(\kappa >0\) such that

$$\begin{aligned} \kappa \Vert u-u_j\Vert _{W_0^{1,p}}\le & {} \int _\Omega |(A-A_j)\nabla u_j\cdot \nabla \phi |\,\hbox {d}x \nonumber \\\le & {} \Vert A-A_j\Vert _{L^n(\Omega )} \Vert \nabla u_j\Vert _{L^{\frac{np}{n-p}}(\Omega )} \Vert \nabla \phi \Vert _{L^{p'}(\Omega )}. \end{aligned}$$

The last estimate follows by Hölder’s inequality, since

$$\begin{aligned} \frac{1}{n}+ \frac{n-p}{np} +\frac{1}{p'} = 1. \end{aligned}$$

The last term on the right-hand side of (3.10) is at most \(1\). By our choice of the \(A_j\), the first term tends to \(0\) as \(j\rightarrow \infty \). And by the Sobolev inequality,

$$\begin{aligned} \Vert \nabla u_j\Vert _{L^{\frac{np}{n-p}}(\Omega )} \le C\Vert D^2 u_j\Vert _{L^p(\Omega )} \le C\Vert f\Vert _{L^p(\Omega )}; \end{aligned}$$

the final inequality holds since by our choice of the \(A_j\), inequality (1.4) holds for each \(u_j\) with a constant independent of \(j\). Therefore, the middle term on the righthand side of (3.10) is uniformly bounded. Hence, \(u_j\rightarrow u\) in \(W_0^{1,p}(\Omega )\).

It remains to show \(D^2u\) exists and estimate its norm. By inequality (1.4), the sequence \(\{D^2u_j\}\) is uniformly bounded in \(L^p(\Omega )\), and so has a weakly convergent subsequence. Passing to this subsequence, we can repeat the argument at (3.9) to conclude that \(u\in W^{2,p}(\Omega )\) and \(D^2u_j\) converges weakly to \(D^2u\). But then we have that

$$\begin{aligned} \Vert D^2u\Vert _{L^p(\Omega )} \le \liminf _{j\rightarrow \infty } \Vert D^2u_j\Vert _{L^p(\Omega )} \le C\Vert f\Vert _{L^p(\Omega )}, \end{aligned}$$

and this completes the proof.\(\square \)

4 The case \(n=2\)

In this section, we consider the two-dimensional case. We first construct Example 1.5 and then prove Theorems 1.6 and 1.7.

Construction of Example 1.5

Our example is adapted from one given by Clop et al. [10, p. 205] and is based on the theory of quasiregular mappings. Let \(B=B_{1/2}(0)\) and let \(z=x+iy\). Define \(f(z)=z(1-2\log |z|)\). Then

$$\begin{aligned} \partial f(z)=-2\log |z| \quad \text {and} \quad \bar{\partial }f(z)=\frac{z}{\bar{z}}, \end{aligned}$$

and so \(f\) satisfies the Beltrami equation \(\bar{\partial }f=\mu \,\partial f\) with Beltrami coefficient

$$\begin{aligned} \mu (z)=\frac{z}{\bar{z}\log (|z|^{-2})}=\frac{z^2}{|z|^2\log (|z|^{-2})}. \end{aligned}$$

If we let let \(u= {{\mathrm{Re}}}f\), that is,

$$\begin{aligned} u(x,y)=x(1-\log (x^2+y^2)), \end{aligned}$$

then \(u\) satisfies the equation

$$\begin{aligned} -{{\mathrm{div}}}(A\nabla u)=0 \end{aligned}$$

where \(A\) is the symmetric, real-valued matrix

$$\begin{aligned} A =\left[ \begin{array}{ll}\displaystyle \frac{|1-\mu |^2}{1-|\mu |^2} &{} \displaystyle \frac{-2{{\mathrm{Im}}}\mu }{1-|\mu |^2} \\ \displaystyle \frac{-2{{\mathrm{Im}}}\mu }{1-|\mu |^2}&{}\displaystyle \frac{|1+\mu |^2}{1-|\mu |^2} \end{array}\right] =\frac{1+\sigma ^2}{1-\sigma ^2}\mathbf{Id}-\frac{2}{1-\sigma ^2}\left[ \begin{array}{cc}\alpha &{} \beta \\ \beta &{} -\alpha \end{array}\right] , \end{aligned}$$


$$\begin{aligned} \sigma =|\mu |=\frac{-1}{\log (x^2+y^2)}, \quad \alpha ={{\mathrm{Re}}}\mu =\frac{x^2-y^2}{x^2+y^2}\sigma , \quad \beta ={{\mathrm{Im}}}\mu =\frac{2xy}{x^2+y^2}\sigma . \end{aligned}$$

This follows from a straightforward calculation: for the details, see [3, p. 412].

We claim that \(A\) is elliptic and in \(W^{1,2}(B)\), and that \(u\in W^{2,p}(B)\) for \(p<2\) but not when \(p=2\). By our choice of domain, \(0\le \sigma \le k=(\log 4)^{-1}\). Let \(\xi =(\xi _1,\xi _2)\in \mathbb {R}^2\); then

$$\begin{aligned} \langle A\xi ,\xi \rangle = \frac{1+\sigma ^2}{1-\sigma ^2}|\xi |^2-\frac{2\alpha (\xi _1^2-\xi _2^2) +4\beta \xi _1\xi _2}{1-\sigma ^2}. \end{aligned}$$


$$\begin{aligned} \alpha (\xi _1^2-\xi _2^2)+4\beta \xi _1\xi _2=2(\alpha ,\beta )\cdot (\xi _1^2-\xi _2^2,2\xi _1\xi _2), \end{aligned}$$

by the Cauchy-Schwarz inequality we have that

$$\begin{aligned} |2\alpha (\xi _1^2-\xi _2^2)+4\beta \xi _1\xi _2| \le 2\sqrt{\alpha ^2+\beta ^2}\sqrt{(\xi _1^2-\xi _2^2)^2+4\xi _1^2\xi _2^2}=2\sigma |\xi |^2. \end{aligned}$$


$$\begin{aligned} -2\sigma |\xi |^2\le 2\alpha (\xi _1^2-\xi _2^2)+4\beta \xi _1\xi _2\le 2\sigma |\xi |^2, \end{aligned}$$

and if we combine this with inequality (4.1), we get

$$\begin{aligned} \frac{1-k}{1+k}|\xi |^2 \le \frac{1-\sigma }{1+\sigma }|\xi |^2 \le \langle A\xi ,\xi \rangle \le \frac{1+\sigma }{1-\sigma }|\xi |^2 \le \frac{1+k}{1-k}|\xi |^2. \end{aligned}$$

Thus, \(A\) is elliptic with \(\lambda =\frac{1-k}{1+k}\) and \(\Lambda =\frac{1+k}{1-k}\).

To see that \(A=(a_{ij})\in W^{1,2}(B)\), a lengthy (and Mathematica assisted) calculation shows that

$$\begin{aligned}&\!\!\!\! \frac{\partial a_{11}}{ \partial x} =\\&-\frac{4x\left[ x^2{-}y^2{-}2y^2\log ^3(x^2{+}y^2)+(x^2{-}y^2)\log ^2(x^2{+}y^2) {+}2(x^2{+}2y^2)\log (x^2{+}y^2)\right] }{(x^2+y^2)^2(\log ^2(x^2+y^2)-1)^2} \end{aligned}$$

and the derivatives \(\frac{\partial }{\partial x} a_{ij}\) and \(\frac{\partial }{\partial y} a_{ij}\) are similar. It follows that

$$\begin{aligned} \left| \frac{\partial }{\partial x} a_{ij}\right| , \left| \frac{\partial }{\partial y} a_{ij}\right| \le C\frac{|\log ^3(x^2+y^2)|}{(x^2+y^2)^{\frac{1}{2}}(\log ^2(x^2+y^2)-1)^2} \in L^2(B). \end{aligned}$$

Finally to see that \(u\in W^{2,p}(B)\) for \(p<2\) but not in \(W^{2,2}(B)\), another calculation shows that

$$\begin{aligned} u_{xx}(x,y)=\frac{-2x(x^2+3y^2)}{(x^2+y^2)^2}, \ \ u_{xy}(x,y)=\frac{-2y(y^2-x^2)}{(x^2+y^2)^2}, \ \ u_{yy}(x,y)=\frac{-2x(x^2-y^2)}{(x^2+y^2)^2}. \end{aligned}$$

Thus, each second derivative is bounded by a constant multiple of \((x^2+y^2)^{-\frac{1}{2}}\in L^p(B)\), so \(u\in W^{2,p}\). On the other hand,

$$\begin{aligned} \int _B |u_{xx}|^2\,\hbox {d}x\hbox {d}y=\infty , \end{aligned}$$

so \(u\not \in W^{2,2}(B)\). \(\square \)

Proof of Theorem 1.6

Most of the proof is identical to the proof of Theorem 1.1, setting \(n=p=2\). However, in two places, we need to make specific changes to the proof. The proof for \(f\) and \(A\) smooth is the same up to inequality (3.5). We again split \(U\), but now we fix \(\epsilon \) (to be determined below) and find \(K\) such that

$$\begin{aligned} \Vert U\chi _{\{U>K\}}\Vert _{L^\Psi (\Omega )}<\epsilon , \end{aligned}$$

where \(\Psi (t)=t^2\log (e+t)^{1+\delta }\). (This is again possible by the dominated convergence theorem in the context of Orlicz spaces.) Let \(U=U_1+U_2=U\chi _{\{U>K\}}+U\chi _{\{U\le K\}}\); then by Lemma 2.4,

$$\begin{aligned} \left( \int _\Omega (|\eta \nabla u|U)^2\,\hbox {d}x\right) ^{1/2}\le & {} \left( \int _\Omega (|\eta \nabla u|U_1)^2\,\hbox {d}x\right) ^{1/2} +K\left( \int _\Omega |\nabla u|^2\,\hbox {d}x\right) ^{1/2}\nonumber \\\le & {} \epsilon \, C(\delta ,\Omega ) \left( \int _\Omega |D^2 u|^2\,\hbox {d}x\right) ^{1/2} +\overline{K}(1\!+\!\Vert \nabla \eta \Vert _\infty )\Vert f\Vert _{L^2(\Omega )}.\qquad \quad \end{aligned}$$

The argument now proceeds as before, yielding

$$\begin{aligned} \Vert \eta D^2u\Vert _{L^2(\Omega )}\le C_0(1+\Vert \nabla \eta \Vert _\infty )\Vert f\Vert _{L^2(\Omega )}, \end{aligned}$$

where again the constant \(C_0=C_0(n,p,\lambda ,\Lambda ,\Omega ,K)\).

The proof for arbitrary \(f\in L^2(\Omega )\) goes through exactly as before. For the proof for arbitrary \(\nabla A \in L^\Psi (\Omega )\), note first that by the Sobolev embedding theorem, we have \(A\in L^\Psi (\Omega )\). We now fix smooth \(A_j\rightarrow A\) in \(W^{1,\Psi }(\Omega )\) (the Sobolev space defined with respect to the \(L^\Psi \) norm), and we may again assume that the \(A_j\) have the same ellipticity constants and that we may choose \(K\) such that (4.2) holds for all \(U_j=|\nabla A_j|\) with a constant \(K\) independent of \(j\). This is possible since all the arguments for \(W^{1,p}(\Omega )\) extend to \(W^{1,\Psi }(\Omega )\) with almost no change. Smooth functions are dense (see  [2]), and the proof of density again shows that ellipticity constants are preserved. The converse of dominated convergence also holds in this setting; the proof is implicit in the literature. For a proof in a different context that readily adapts to Orlicz spaces, see [11, Prop. 2.67].

The proof now continues as before until inequality (3.10). Here, we need to apply the generalized Hölder’s inequality in the scale of Orlicz spaces (see [12, Lemma 5.2]). If we let \(\Phi (t)= \exp (t^{\frac{2}{1+\delta }})-1\), then

$$\begin{aligned} \Psi ^{-1}(t)\Phi ^{-1}(t) \approx \frac{t^{1/2}}{\log (e+t)^{\frac{1+\delta }{2}}} \log (e+t)^{\frac{1+\delta }{2}} \lesssim t^{1/2}. \end{aligned}$$

Therefore, we can estimate as follows:

$$\begin{aligned}&\left| \int _\Omega A(\nabla u-\nabla u_j)\cdot \nabla \phi \,\hbox {d}x\right| \le \int _\Omega |(A-A_j)\nabla u_j\cdot \nabla \phi |\,\hbox {d}x \\&\quad \le \Vert (A-A_j)\nabla u_j\Vert _{L^2(\Omega )}\Vert \nabla \phi \Vert _{L^2(\Omega )} \le C\Vert A-A_j\Vert _{L^\Psi (\Omega )} \Vert \nabla u_j\Vert _{L^\Phi (\Omega )} \Vert \nabla \phi \Vert _{L^2(\Omega )}. \end{aligned}$$

As in the previous argument, we have chosen \(\phi \) so that \(\Vert \nabla \phi \Vert _{L^2(\Omega )}\le 1\). We also have that \( \Vert A-A_j\Vert _{L^\Psi (\Omega )}\rightarrow 0\) as \(j\rightarrow \infty \). Therefore, we could complete the proof as before if we can show that

$$\begin{aligned} \Vert \nabla u_j\Vert _{L^\Phi (\Omega )} \le C\Vert f\Vert _{L^2(\Omega )} \end{aligned}$$

with a constant independent of \(j\).

Let \(\Phi _0(t) = \exp (t^2)-1\). Then for \(t\ge 1\), \(\Phi (t)\le \Phi _0(t)\), and so by the properties of Orlicz norms (see [12, Sec.5.2]) there exists a constant depending on \(\delta \) and \(\Omega \) such that \(\Vert \nabla u_j\Vert _{L^\Phi (\Omega )}\le C \Vert \nabla u_j\Vert _{L^{\Phi _0}(\Omega )}\). But by Trudinger’s inequality [34, Thm. 2.9.1] we have the endpoint Sobolev inequality:

$$\begin{aligned} \Vert \nabla u_j\Vert _{L^{\Phi _0}(\Omega )} \le C\Vert D^2u_j\Vert _{L^2(\Omega )}. \end{aligned}$$

By the first part of the proof, we have that \(\Vert D^2u_j\Vert _{L^2(\Omega )}\le C\Vert f\Vert _{L^2(\Omega )}\) with a constant independent of \(j\); combining these inequalities, we get the desired estimate and this completes the proof.\(\square \)

Proof of Theorem 1.7

The proof is nearly identical to the proof of Theorem 1.6. Let \(\Psi (t)=t^2\log (e+t)\). The first half of the proof for smooth \(f\) and \(A\) is the same until (4.3). Here, we use the off-diagonal estimate in Lemma 2.6 and Hölder’s inequality to get

$$\begin{aligned}&\left( \int _\Omega (|\eta \nabla u|U)^2\,\hbox {d}x\right) ^{1/2} \\&\qquad \qquad \le \left( \int _\Omega (|\eta \nabla u|U_1)^2\,\hbox {d}x\right) ^{1/2} +K\left( \int _\Omega |\nabla u|^2\,\hbox {d}x\right) ^{1/2}\\&\qquad \qquad \le \epsilon \, C(\delta ,\Omega ) \left( \int _\Omega |\eta D^2 u|^r\,\hbox {d}x\right) ^{1/r} +\overline{K}(1+\Vert \nabla \eta \Vert _\infty )\Vert f\Vert _{L^2(\Omega )} \\&\qquad \qquad \le \epsilon \, C(\delta ,\Omega ) |\Omega |^{\frac{1}{(2/r)'}}\left( \int _\Omega |\eta D^2 u|^2\,\hbox {d}x\right) ^{1/2} +\overline{K}(1+\Vert \nabla \eta \Vert _\infty )\Vert f\Vert _{L^2(\Omega )}. \end{aligned}$$

We can now complete the proof of the smooth case as before.

The remainder of the proof goes through as before, only now we apply the generalized Hölder’s inequality with \(\Psi (t)\) and \(\Phi (t)=\exp (t^2)-1\) and then directly apply Trudinger’s inequality.\(\square \)

Remark 4.1

Note that in the proof of Theorem 1.7 we use the regularity assumption on \(\nabla A\) in the proof of the smooth case, and use the higher integrability assumption on \(A\) in the density argument to prove the general case.