Abstract
For a well-posed non-selfadjoint indefinite second-order linear elliptic PDE with general coefficients \({\mathbf {A}}, {\mathbf {b}},\gamma \) in \(L^\infty \) and symmetric and uniformly positive definite coefficient matrix \({\mathbf {A}}\), this paper proves that mixed finite element problems are uniquely solvable and the discrete solutions are uniformly bounded, whenever the underlying shape-regular triangulation is sufficiently fine. This applies to the Raviart-Thomas and Brezzi-Douglas-Marini finite element families of any order and in any space dimension and leads to the best-approximation estimate in \(H({{\,\mathrm{div}\,}})\times L^2\) as well as in in \(L^2\times L^2\) up to oscillations. This generalises earlier contributions for piecewise Lipschitz continuous coefficients to \(L^\infty \) coefficients. The compactness argument of Schatz and Wang for the displacement-oriented problem does not apply immediately to the mixed formulation in \(H({{\,\mathrm{div}\,}})\times L^2\). But it allows the uniform approximation of some \(L^2\) contributions and can be combined with a recent \(L^2\) best-approximation result from the medius analysis. This technique circumvents any regularity assumption and the application of a Fortin interpolation operator.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
This section introduces the non-selfadjoint indefinite second-order linear elliptic PDE and its mixed formulations. A brief review of earlier results is followed by the assertion of the stability and the best-approximation results.
1.1 Non-selfadjoint indefinite second-order linear elliptic PDEs
The strong formulations for second-order elliptic problems with coefficients \(\mathbf{A},\) b, \(\gamma \) componentwise in \(L^\infty (\Omega )\) and \(f\in L^2(\Omega )\) read \( {\mathcal {L}}_j u_j =f \) a.e. in a polyhedral bounded Lipschitz domain \(\Omega \subset \mathbb {R}^n\) with homogeneous Dirichlet boundary condition \(u_j= 0\) on \(\partial \Omega \) for \(j=1,2\) and any dimension \(n\ge 2\). For all \(v\in H^1_0(\Omega )\), the two differential operators (referred to as conservative resp. divergence form throughout this paper) read
The assumption on ellipticity means that the \(n\times n\) coefficient matrix \({\mathbf {A}}(x)\) is symmetric and positive definite with eigenvalues in one universal compact interval of positive reals for a.e. \(x\in \Omega \). This makes \({\mathcal {L}}_1, \mathcal {L}_2:H^1_0(\Omega )\rightarrow H^{-1}(\Omega )\) Fredholm operators of index zero and their weak formulations \(a(v,w):= {\langle {\mathcal {L}}_1v},{w\rangle }_{H^{-1}(\Omega )\times H^1_0(\Omega )} ={\langle {\mathcal {L}}_2 w},{v\rangle }_{H^{-1}(\Omega )\times H^1_0(\Omega )} \), for all \(v,w\in H^1_0(\Omega )\), are dual to each other in the duality bracket \({\langle \bullet ,\bullet },{_\rangle }{H^{-1}(\Omega )\times H^1_0(\Omega )}\) of \(H^{-1}(\Omega )\), the dual of \(H^1_0(\Omega )\).
Throughout this paper, zero eigenvalues are excluded and the kernel (of one of these operators) \({\mathcal {L}}_j\) is supposed to be trivial, so that \( {\mathcal {L}}_1\) and \({\mathcal {L}}_2\) are bijections. It is known from the theory of bilinear forms in reflexive Banach spaces [2, 3] that this implies well-posedness and the continuous \(\inf \)-\(\sup \) condition (e.g., when \(H^1_0(\Omega )\) is endowed with the norm \(\Vert \nabla \bullet \Vert \))
The \(\inf \)-\(\sup \) constant is the same for the original and the dual problem; a(v, w) could be replaced by a(w, v) with the same \(\alpha \). The finite element error analysis is enormously simplified under additional conditions on the coefficients that lead to an ellipticity of \(a(\bullet ,\bullet )\) and allow an application of the Lax-Milgram lemma [2,3,4]. The present situation of a general non-selfadjoint indefinite second-order linear elliptic PDE avoids any of those assumptions and examines coefficients in \(L^\infty \), which satisfy the following.
Assumption(A)
There exist two global constants \({0<\underline{\alpha }\leqslant \overline{\alpha }<\infty }\) such that \(\mathbf{A}\in L^\infty (\Omega ;{\mathbb {R}}^{n\times n})\) satisfies \({\underline{\alpha }\leqslant \lambda _1(\mathbf{A}(x))\leqslant \cdots \leqslant \lambda _n(\mathbf{A}(x))\leqslant \overline{\alpha }}\) for the eigenvalues \({\lambda _1(\mathbf{A}(x))\leqslant \dots \leqslant \lambda _n(\mathbf{A}(x))}\) of the SPD \(\mathbf{A}(x)\) for a.e. \(x\in \Omega \). The functions \({\mathbf {b}}, {\mathbf {b}}_1,{\mathbf {b}}_2 \in L^\infty (\Omega ;\mathbb {R}^n)\) and \(\gamma \in L^\infty (\Omega )\) are componentwise bounded in the bounded polyhedral Lipschitz domain \(\Omega \subset \mathbb {R}^n\).
Given the various applications to porous media and ground-water flow with rough and oscillating coefficients merely bounded in a well-posed PDE, this contribution gives an affirmative answer to the fundamental question whether the mixed finite element method be used (and then is stable and provides best-approximation property at least for fine triangulations).
1.2 Earlier contributions
For conforming finite element discretizations and sufficiently small mesh sizes, [17] establishes the existence and uniqueness of conforming finite element solutions under assumption (A). The mixed formulation for the conservation (resp. divergence) equation \( {\mathcal {L}}_1 u =f \) (resp. \({\mathcal {L}}_2 u =f \)) introduces the flux variable \(\sigma = -{\mathbf {A}}\nabla u- u\,{\mathbf {b}}\) (resp. \(\sigma = -{\mathbf {A}}\nabla u\)) and seeks the solution \(x=(\sigma ,u) \in H\) to
with \({\mathbf {b}}_1:= {\mathbf {A}}^{-1} {\mathbf {b}}\), \({\mathbf {b}}_2:=0\) (resp. \({\mathbf {b}}_1:= 0\), \({{\mathbf {b}}}_2:= {\mathbf {b}} \cdot {\mathbf {A}}^{-1} \)) and \((\sigma ,\tau )_{{\mathbf {A}}^{-1}}:=({\mathbf {A}}^{-1}\sigma ,\tau )_{L^2(\Omega )}\) in
The equivalence to the boundary value problems associated with the linear differential operators in (1.1) and their well-posedness on the continuous level can be found in [5, Sect. 2]. This implies the continuous \(\inf \)-\(\sup \) conditions [2, 3]
The existence and uniqueness of discrete solutions and optimal \(L^2\) error estimates were introduced in [9] for sufficiently fine triangulations in two and three space dimensions under high regularity assumptions, where the pair \((\sigma _h,u_h)\) is approximated in \(RT_k(\mathcal {T})\times P_k(\mathcal {T})\) with the Raviart-Thomas (RT) for 2D (resp. Raviart-Thomas-Nedelec for 3D) finite elements. Global \(L^{\infty }\) and global \(L^2\) and negative norm estimates for the conservation form were discussed in [10, 14] for smooth coefficients.
Provided the coefficients \({\mathbf {A}}\) and \({\mathbf {b}}\) are Lipschitz continuous, \(\gamma \) is piecewise Lipschitz continuous and \(H^2\) regularity of the adjoint system, an interesting convergence phenomenon for the BDM finite element family is clarified in the fairly general framework of [8].
Let \(M_k(\mathcal {T})\) be any RT or BDM finite element space of degree \(k\in {\mathbb {N}}_0\) and define the discrete space \(V(\mathcal {T}):=M_k(\mathcal {T})\times P_k(\mathcal {T})\subset H\) based on a shape-regular triangulation \(\mathcal {T}\) with mesh-sizes \({\leqslant \delta }\), written \(\mathcal {T}\in \mathbb {T}(\delta )\).
In case \({\mathbf {A}}\) and \({\mathbf {b}}\) are globally Lipschitz continuous and \(\gamma \) is piecewise Lipschitz continuous, the convergence results in [8] also establish stability in the sense
for some positive \(\delta \) and \(\beta _0\). With extra work and refined arguments along the lines of [8], but with reduced elliptic regularity and solution \(u_j\in H^1_0(\Omega )\cap H^{1+s}(\Omega )\) to \( \mathcal {L}_j u_j =f \) for some \(s>0\). Those arguments are not valid under Assumption (A).
Modern trends in the mathematics of mixed finite element schemes include local stable projections with commuting properties [11,12,13]; those techniques do not seem to allow the proof of discrete stability and best-approximation under assumption (A).
Piecewise Lipschitz continuous coefficients with regularity in \(H^{1+s}\) (for some positive s) lead in [5] to stability for the lowest-order RT FEM. The equivalence to nonconforming Crouzex-Raviart finite elements holds more generally [1] and the combination with the arguments from [17] and [5] might lead to stability results under the assumption (A) for more examples. In comparison, the methodology of this paper provides stability for any degree k and any dimension n (RT and BDM merely serve as popular model examples).
1.3 Contribution of this paper
Under the Assumption (A) and for any RT or BDM finite element space \(V(\mathcal {T}):=M_k(\mathcal {T})\times P_k(\mathcal {T})\subset H:= H({{\,\mathrm{div}\,}},\Omega )\times L^2(\Omega )\) of degree \(k\in {\mathbb {N}}_0\) [2,3,4], the discrete stability (1.6) is established for small mesh-sizes, where either \({\mathbf {b}}_1:= {\mathbf {A}}^{-1} {\mathbf {b}}\) and \({\mathbf {b}}_2:=0\) or \({\mathbf {b}}_1:= 0\) and \({{\mathbf {b}}}_2:= {\mathbf {b}} \cdot {\mathbf {A}}^{-1} \) in (1.4).
Theorem 1.1
(discrete stability) For each (positive) \(\beta _0<\beta \) with \(\beta \) from (1.5), there exists \(\delta >0\) such that (1.6) holds.
This theorem implies [2, 3] that the mixed finite element problems for the RT and the BDM finite element families of any degree k and in any space dimension n are (i) uniquely solvable, (ii) uniformly bounded in H, and (iii) fullfil quasi-optimal error estimates in the norm of H, whenever the underlying shape-regular triangulation is sufficiently fine.
Theorem 1.1 and the tools of this paper lead to \(L^2\) best-approximation up to oscillations.
Theorem 1.2
(\(L^2\) best approximation) Suppose \(\delta >0\) satisfies (1.6) with \(b(\bullet ,\bullet ) \) defined for general \({\mathbf {b}}_1,{\mathbf {b}}_2\in L^{\infty }(\Omega ;\mathbb {R}^n)\) under Assumption (A). Assume \(\mathcal {T}\in \mathbb {T}(\delta )\) and that \(x:=(\sigma ,u)\in H\) (resp. \(x_h\equiv (\sigma _h,u_h)\in V_h:=V(\mathcal {T}):= M_k(\mathcal {T})\times P_k(\mathcal {T})\)) satisfy \(b(x-x_h,y_h)=0\) for all \(y_h\in V_h\). Then the following results (a) and (b) hold.
(a) There exists a positive constant \(C_1\), which exclusively depends on \(\beta _0>0\), the \(L^\infty \) norms of (all the components of) \({\mathbf {A}}^{1/2}{\mathbf {b}}_1\), \({\mathbf {A}}^{1/2}{\mathbf {b}}_2\), and \(\gamma \), as well as on the shape-regularity of \({\mathbb {T}}\), such that the piecewise mesh size \(h_\mathcal {T}\) in \(\mathcal {T}\) and the \(L^2\) projection \(\Pi _k\) onto \(P_k(\mathcal {T})\) satisfy
(b) Suppose \({\mathbf {b}}_1=0\) and that the scalar \(\gamma (x)\) is Lipschitz continuous in \(x\in {{\,\mathrm{int}\,}}(T)\), the interior of \(T\in \mathcal {T}\), with a Lipschitz constant smaller than or equal to \({{\,\mathrm{\mathcal {L}ip}\,}}(\gamma )\). Then there exists a positive constant \(C_2\), which depends exclusively depends on \(\beta _0>0\), \(\Vert {\mathbf {A}}^{1/2}{\mathbf {b}}_2\Vert _{L^\infty (\Omega )}\), \({{\,\mathrm{\mathcal {L}ip}\,}}(\gamma )\), and the shape-regularity of \({\mathbb {T}}\), such that
The additional oscillations \(\Vert h_\mathcal {T}(u-\Pi _ku)\Vert \) and \(\Vert h_\mathcal {T}( 1-\Pi _k)( {{\,\mathrm{div}\,}}\sigma )\Vert \) can be higher-order contributions and then these terms explain the improved convergence of one variant for the BDM finite element family in [8] under Assumption (A).
This article, thus, generalises earlier contributions [5, 8,9,10, 12,13,14] for smooth or piecewise Lipschitz continuous coefficients to \(L^\infty \) coefficients without any further assumptions. The compactness argument of Schatz and Wang [17] for the displacement-oriented problem does not apply immediately to the mixed formulation in \(H({{\,\mathrm{div}\,}})\times L^2\). Remark 12 below explains that no uniform \(L^2\) approximation of the divergence component holds. This paper therefore compensates the lack of compactness by the computation and analysis of an optimal test function (the dual solution y in (1.7) of Sect. 1.4 below). Recent best-approximation for the flux in \(L^2\) from the medius analysis [6, 15] combines with the compactness for the (dual) PDE. This and a careful shift of the discrete divergence circumvents the aforementioned lack of compactness in the divergence variable. In fact, this new methodology avoids any regularity argument and any Fortin interpolation at all.
1.4 Motivation
This subsection outlines the proof of the discrete \(\inf \)-\(\sup \) stability (1.6) in an abstract framework to guide the reader through the arguments. Suppose \(X_h\times Y_h\) is a finite dimensional subspace of \(H\times H\) with dual \(X_h^* \times Y_h^*\) and let \(x_h \in S(X_h)\), i.e., \(x_h\) belongs to \(X_h\) and has norm \(\Vert x_h\Vert _H=1\). Recall (1.5) and the well-posedness of the problem (1.3). Then, the dual problem is well-posed as well and \({\langle x_h},{\bullet \rangle }_H=b(\bullet ,y)\) has a unique dual solution y in the Hilbert space \((H, {\langle \bullet },{\bullet \rangle }_H)\). The continuous \(\inf \)-\(\sup \) condition (1.5) shows
is bounded. Suppose that \(y_h\in Y_h\) is a close approximation to y with \(\Vert y-y_h\Vert _H\leqslant \varepsilon \) for some positive \(\varepsilon < 1/\Vert b\Vert \), where \(\Vert b\Vert \) is the operator norm of the bilinear form \(b(\bullet ,\bullet )\). Since
it remains to bound \(\Vert y_h\Vert _H\), e.g., with the triangle inequality
The combination of the previous two displayed formulas gives a lower bound for \(\Vert b(x_h,\bullet )\Vert _{Y_h^*}\). Under the assumption that \(\varepsilon \) is independent of y and so of \(x_h\), this estimate reads
This proves \(\beta _0\leqslant \beta ( 1- \varepsilon \Vert b\Vert )/(1+\varepsilon \beta )\) provided the approximation error \(\Vert y-y_h\Vert _H\) is small independently of \(X_h\times Y_h\) and \(x_h\in S(X_h)\). A detailed investigation in Sect. 3.3 below reveals that the above strong form of a uniform approximation appears neither available in the norm of \(H=H({{\,\mathrm{div}\,}},\Omega )\times L^2(\Omega )\) (cf. Remark 3.4) nor necessary for the stability under assumption (A). Recent results from a medius analysis [6, 15] and a careful shift of the discrete divergence variable successfully circumvent a uniform approximation in H.
1.5 Structure of the paper
Section 2 starts with the pre-compactness for uniform approximation and the precise assumptions on the set of admissible triangulations \(\mathbb {T}\). The other two preliminary subsections concern the \(L^2\) best-approximation of the fluxes and some discrete approximation result for the RT finite element family.
The stability analysis in Sect. 3 is based on the dual solution y in the conservative formulation characterised in Sect. 3.1. One contribution of y involves the PDE \(\mathcal {L}_2\phi =g\) and allows for some pre-compacness and uniform approximation in Sect. 3.2. The proof of Theorem 1.1 concludes Sect. 3. A combination of the stability result (1.6) with the approximation arguments leads in Sect. 4 to Theorem 1.2, which generalises [6, 15] to non-selfadjoint indefinite second-order linear elliptic problems.
2 Preliminaries
This section introduces notations used in the paper, fixes the assumptions on the admissible triangulation \(\mathbb {T}\), discusses an abstract version of compactness argument in [17], and then recalls some \(L^2\) best-approximation property and concludes with an observation for the RT finite element family.
2.1 Notation
Standard notation on Lebesgue and Sobolev spaces \(L^2(\Omega )\), \(L^\infty (\Omega )\), \(H_0^1(\Omega )\), \(H^{-1}(\Omega )\equiv H_0^1(\Omega )^*\), and \(H({{\,\mathrm{div}\,}},\Omega )\) apply throughout this paper. The \(L^2\) scalar product \((\bullet ,\bullet )_{L^2(\Omega ) }\) induces the norm \(\Vert \bullet \Vert := \Vert \bullet \Vert _{L^2(\Omega ) }\) and the orthogonality relation \(\perp \).
Whereas \(\Vert \bullet \Vert \) denotes the norm in \(L^2(\Omega )\) with the exception of the abbreviation \(\Vert b\Vert \) for the bound of the bilinear form \(b(\bullet ,\bullet )\), the vector space \(L^2(\Omega ;\mathbb {R}^n)\) is endowed with the weighted scalar product \((\bullet ,\bullet )_{{\mathbf {A}}^{-1}}:= ({{\mathbf {A}}^{-1}}\bullet ,\bullet )_{L^2(\Omega )}\) and induced norm \(\Vert \bullet \Vert _{{\mathbf {A}}^{-1}}:= \Vert {\mathbf {A}}^{-1/2}\bullet \Vert \) and so, for any \(\tau \in L^2(\Omega ;\mathbb {R}^n)\), is its distance \({{\,\mathrm{dist}\,}}(\tau ,M_h):=\min _{\tau _h\in M_h}\Vert \tau -\tau _h\Vert _{{\mathbf {A}}^{-1}}\) to any subspace \(M_h\) of \(L^2(\Omega ;\mathbb {R}^n)\). The norm \(\Vert (\tau ,v)\Vert _H\) in the Hilbert space \(H({{\,\mathrm{div}\,}},\Omega )\) is weighted with \({\mathbf {A}}^{-1}\) in \(L^2(\Omega ;\mathbb {R}^n)\) for the flux variable so the Hilbert space \(H\equiv H({{\,\mathrm{div}\,}},\Omega )\times L^2(\Omega )\) has the weighted scalar product \({\langle \bullet ,\bullet },{_\rangle }H\) with the induced norm \(\Vert (\tau ,v)\Vert _H\),
Duality brackets have the dual pairing as an index as in \({\langle \bullet ,\bullet },{_\rangle }{H^{-1}(\Omega )\times H^1_0(\Omega )}\) above. To abbreviate the definition of \(\inf \)-\(\sup \) constants throughout this paper, let \(S(V):=\{ v\in V: \Vert v\Vert _V=1\}\) for any normed linear space \((V, \Vert \bullet \Vert _V)\).
All emerging generic positive constants \(C_1, \dots , C_8\) in this paper exclusively depend on \(\underline{\alpha }, \overline{\alpha }, \Vert {\mathbf {b}}\Vert _{L^\infty }, \Vert {\mathbf {b}}_1\Vert _{L^\infty },\Vert {\mathbf {b}}_2\Vert _{L^\infty }, \) and \(\Vert \gamma \Vert _{L^\infty }\) as well as on \(\alpha \) in (1.2) and \(\beta \) in (1.5) and on the class of admissible triangulations \(\mathbb {T}\) specified in Subsection 2.2.
2.2 Assumptions on the discretization
The finite element spaces are based on admissible triangulations, the set \(\mathbb {T}\) of all of those has certainly infinite cardinality; the point is that the constants in standard interpolation error estimates become universal through uniform shape regularity.
Definition 2.1
(admissible triangulations) The set of admissible triangulations \(\mathbb {T}\) is a set of shape-regular triangulations of the polyhedral bounded Lipschitz domain \(\Omega \subset \mathbb {R}^n\) into simplices with uniform shape regularity and arbitrary small mesh sizes. Let \(h_{\max }(\mathcal {T}):= \max h_\mathcal {T}\) for the piecewise constant mesh-size \(h_\mathcal {T}\) for \(\mathcal {T}\in \mathbb {T}\), defined by \(h_\mathcal {T}|_T:=\mathrm{diam}(T)\) in \(T\in \mathcal {T}\), and abbreviate \(\mathcal {T}(\delta ):=\{\mathcal {T}\in \mathbb {T}: h_{\max }(\mathcal {T})\leqslant \delta \}\).
Given \(\mathcal {T}\in \mathbb {T}\), let \(P_k(T)\) denote the polynomials of total degree at most \(k\in {\mathbb {N}}_0\) seen as functions on \(T\in \mathcal {T}\in \mathbb {T}\) and set \(P_k(\mathcal {T}):=\{ v_k\in L^\infty (\Omega ): \forall T\in \mathcal {T},\; v_k|_T\in P_k(T)\}\). Let \(\Pi _k:L^2(\Omega )\rightarrow L^2(\Omega )\) be the \(L^2\) projection onto \(P_k(\mathcal {T})\) with respect to \(\mathcal {T}\in \mathbb {T}\).
Definition 2.2
(discrete spaces) Any \(\mathcal {T}\in \mathbb {T}\) is associated to the finite-dimensional subspace \(V(\mathcal {T})=M_k(\mathcal {T})\times P_k(\mathcal {T})\) of \(V:= L^2(\Omega ;\mathbb {R}^n)\times L^2(\Omega )\) with \(M_k(\mathcal {T}):=RT_k(\mathcal {T})\) or \(M_k(\mathcal {T}):=BDM_k(\mathcal {T})\) of order \(k\in {\mathbb {N}}_0\) from [2].
The best-approximation error reads \({{\,\mathrm{dist}\,}}(v, V(\mathcal {T})):= \inf \{ \Vert v-v_h\Vert : v_h \in V(\mathcal {T})\}\) with the weighted \(L^2\) norm, \(\Vert v\Vert ^2=\Vert \tau \Vert ^2_{{\mathbf {A}}^{-1}} +\Vert w\Vert ^2\) for \(v=(\tau ,w)\in V\). The density of smooth functions and standard approximation results for smooth functions proves the well-known pointwise convergence in the sense that each \(v\in V\) satisfies [2]
2.3 Pre-compactness
This subsection adopts the key argument of [17].
Lemma 2.3
(uniform approximation on compact sets) Suppose that K is a non-empty pre-compact subset of \(V:= L^2(\Omega ;\mathbb {R}^n)\times L^2(\Omega )\) with (2.2) for each \(v\in K\). Then
Proof
Given any \(\varepsilon >0\) and \(v\in K\), let \(B(v,\varepsilon /2)\) be the open ball in V with center v and radius \( \varepsilon /2\). The open cover \(\{B(v,\varepsilon /2):v\in K\}\) of the compact set \(\overline{K}\) contains a finite sub-cover and so there exist \(k_1,\dots ,k_J\in K\) with \(K\subset \bigcup _{j=1,\dots ,J} B(k_j,\varepsilon /2)\). For each \(k_j\in K\), (2.2) leads to \(\delta _j>0\) such that \(\mathcal {T}\in \mathcal {T}(\delta _j)\) implies \({{\,\mathrm{dist}\,}}(k_j,V(\mathcal {T}))<\varepsilon /2 \). Then \(\delta :=\min \{ \delta _1, \dots ,\delta _J\} \) implies \(\mathbb {T}(\delta )\subset \bigcap _{j=1,\dots ,J}\mathbb {T}( \delta _j)\). Given any \(\mathcal {T}\in \mathbb {T}(\delta )\) and any \(v\in K\subset \bigcup _{j=1,\dots ,J} B(k_j,\varepsilon /2)\), there exists \(j\in \{1,\dots ,J\}\) with \(\Vert v-k_j \Vert <\varepsilon /2\). Since \(\mathcal {T}\in \mathbb {T}( \delta _j)\), \({{\,\mathrm{dist}\,}}(k_j,V(\mathcal {T}))<\varepsilon /2 \). This and a triangle inequality show \( {{\,\mathrm{dist}\,}}(v,V(\mathcal {T}))\leqslant \Vert v-k_j \Vert +{{\,\mathrm{dist}\,}}(k_j,V(\mathcal {T}))<\varepsilon /2+\varepsilon /2 =\varepsilon . \) \(\square \)
The application of the previous lemma to the finite element approximation of the solution of the PDE reads as follows.
Lemma 2.4
(uniform approximation of solutions) For any \(\varepsilon >0\) there exists some \(\delta >0\) such that, given any \(g\in L^2(\Omega )\) and the weak solution \(\phi \in H^1_0(\Omega ) \) to \(\mathcal {L}_2\phi = g \) (with \({\mathcal {L}}_2 \) from (1.1)), the vector \(v:= ({\mathbf {A}} \nabla \phi , {\mathbf {b}}\cdot \nabla \phi +\gamma \phi )\in V\) satisfies \( \sup _{\mathcal {T}\in \mathbb {T}(\delta )} {{\,\mathrm{dist}\,}}( v , V(\mathcal {T})) \leqslant \epsilon \,\Vert g\Vert . \)
Proof
The linear and bounded bijective differential operator \(\mathcal {L}_2:H^1_0(\Omega )\rightarrow H^{-1}(\Omega )\) has a bounded inverse. The embedding \(\iota : L^2(\Omega )\hookrightarrow H^{-1}(\Omega )\) is compact and so is the composition \({\mathcal {L}}_2^{-1} \circ \iota :L^2(\Omega )\rightarrow H^1_0(\Omega )\). Define the operator \(T: L^2(\Omega )\rightarrow L^2(\Omega ;\mathbb {R}^n)\times L^2(\Omega )\) for any \(g\in L^2(\Omega )\) by
Since \({\mathcal {L}}_2^{-1}\circ \iota \) is compact, \(K:= T(S(L^2(\Omega ))\) is pre-compact in \( V= L^2(\Omega ;\mathbb {R}^n)\times L^2(\Omega )\). Given any \(\varepsilon >0\) the approximation result (2.2) and Lemma 2.3 lead to a positive \(\delta \) with (2.3). Consequently, the assertion \(\sup _{\mathcal {T}\in \mathbb {T}(\delta )} {{\,\mathrm{dist}\,}}( v , V(\mathcal {T})) \leqslant \epsilon \,\Vert g\Vert \) holds for all \(g\in S(L^2(\Omega ))\) and corresponding \(v:= ({\mathbf {A}} \nabla \phi , {\mathbf {b}}\cdot \nabla \phi +\gamma \phi )\in V\). A rescaling proves the result for all \(g\in L^2(\Omega )\).
2.4 \(L^2\) best-approximation of the fluxes
The medius analysis of mixed finite element methods employs arguments from a priori and a posteriori error analysis [6, 15] to prove new \(L^2\) best-approximation results. Recall that \(\Pi _k\) is the \(L^2\) projection onto \(P_k(\mathcal {T})\) and \(h_\mathcal {T}\) is the mesh-size associated to \(\mathcal {T}\).
Lemma 2.5
(flux \(L^2\) best-approximation) There exists a constant \(C_3\), which depends on the shape-regularity in \(\mathcal {T}\), on \(\Omega \) and on \(\underline{\alpha }, \overline{\alpha }\), such for any \({\mathbf {p}}\in H({{\,\mathrm{div}\,}},\Omega )\) and any \(\mathcal {T}\in \mathbb {T}\), there exists \({\mathbf {p}}_h\in M_k(\mathcal {T})\) such that \({{\,\mathrm{div}\,}}{\mathbf {p}}_h =\Pi _k {{\,\mathrm{div}\,}}{\mathbf {p}}\) and
This is the \(L^2\) best-approximation result from [15, Lemma 5.1] for mixed finite element approximations for the unit matrix \({\mathbf {A}}\). Although with a different focus, the paper [6] introduces a general framework with a mesh-dependent norm \(\Vert \bullet \Vert _h\) in \(P_k(\mathcal {T})\); while [11, Eq (3.6)] presents a localized refinement of this lemma.
Proof
Given \({\mathbf {p}}\in H({{\,\mathrm{div}\,}},\Omega )\), the right-hand sides \(F(w):=(w,{{\,\mathrm{div}\,}}{\mathbf {p}})_{L^2(\Omega ) }\) and \(G({\mathbf {q}}):= ({\mathbf {p}},{\mathbf {q}}) \) lead in the elliptic mixed formulation (for the Laplacian)
to the unique solution \((\sigma ,u)\equiv ({\mathbf {p}},0)\in H\). Its straight-forward mixed finite element discretisation substitutes H by \(V_h:=M_k(\mathcal {T})\times P_k(\mathcal {T})\) and leads to a unique discrete solution \(({\mathbf {p}}_h,v_h)\in V_h\) with \({{\,\mathrm{div}\,}}{\mathbf {p}}_h=\Pi _k {{\,\mathrm{div}\,}}{\mathbf {p}}\). This and [6, Thm 2.2] lead to the asserted best-approximation result (in terms of (non-weighted) \(L^2\) norms)
The constant \(C_4\) from [6, 15] does not depend on the coefficients \(\mathbf{A},\) b, \(\gamma \) but depends on the shape-regularity in \(\mathcal {T}\) and on \(\Omega \). The equivalence of norms concludes the proof and leads to the asserted constant \(C_3\), which depends on \(C_4\) and \(\underline{\alpha }, \overline{\alpha }\). \(\square \)
2.5 A discrete approximation result for Raviart-Thomas functions
In any space-dimension n and degree k, the RT functions satisfy a rather particular approximation estimate with the componentwise \(L^2\) projection \(\Pi _k\) onto \(P_k(\mathcal {T})\).
Lemma 2.6
Any \(\tau _{RT}\in RT_k(\mathcal {T})\) satisfies \( \Vert \tau _{RT}- \Pi _k\tau _{RT}\Vert \leqslant \frac{n}{(n+1)(n+k)} \Vert h_\mathcal {T}{{\,\mathrm{div}\,}}\tau _{RT}\Vert . \)
The proof will be postponed to the appendix because of its focus on the RT finite element shape functions. The statement of the above lemma fails for the BDM finite element family.
3 Stability analysis
This section deals with approximation of fluxes and stability result. The design of a test function in the proof of a discrete \(\inf \)-\(\sup \) condition is based on the characterisation and approximation of a dual solution.
3.1 Dual solution and conservative formulation
The inner structure of the dual solution y exploits the elliptic PDE and generates some compactness argument in the subsequent subsection. Recall that the operator \(\mathcal {L}_2 :H^1_0(\Omega )\rightarrow H^{-1}(\Omega )\) from (1.1) is bijective.
Theorem 3.1
(dual solution in conservative formulation) Suppose \({\mathbf {b}}_1:= {\mathbf {A}}^{-1} {\mathbf {b}}\) and \({\mathbf {b}}_2\equiv 0\) a.e. in \(\Omega \) in (1.4). Then \(x=(\sigma ,u)\in H\) and \(y=(\zeta ,z)\in H\) satisfy \({\langle x},{\bullet \rangle }_{H}=b(\bullet , y)\) in H if and only if
for the weak solution \(\phi \in H^1_0(\Omega )\) to \( {\mathcal {L}}_2\phi =g:= {\mathbf {b}} \cdot {\mathbf {A}}^{-1} \sigma +(\gamma -1)\,{{\,\mathrm{div}\,}}\sigma -u\in L^2(\Omega )\).
The function \(\phi \) originates from a known \(L^2\) orthogonal decomposition
with \( H(\mathrm{div}{=}0,\Omega ):=\{ \tau \in H({{\,\mathrm{div}\,}},\Omega ):\, {{\,\mathrm{div}\,}}\tau =0\text { a.e. in }\Omega \} \). The decomposition (3.1) is also useful in the proof of equivalence of the displacement formulation with the differential operators in (1.1) to the mixed formulations with (1.3).
Proof of Theorem 3.1
For the general version of the bilinear form \(b(\bullet ,\bullet )\), the equation \({\langle x},{\bullet \rangle }_H=b(\bullet , y)\) is equivalent to \(u = {\mathbf { b}}_1\cdot \zeta +\gamma \,z-{{\,\mathrm{div}\,}}\zeta \) a.e. in \(\Omega \) and
The test with \(\tau \in H(\mathrm{div}{=}0,\Omega )\) proves that \({\mathbf {A}}^{-1} (\sigma -\zeta ) + z {\mathbf {b}}_2\perp H(\mathrm{div}{=}0,\Omega )\) and so (3.1) leads to \(\phi \in H^1_0(\Omega )\) with
This identity allows the substitution of \({\mathbf {A}}^{-1} (\zeta -\sigma ) -z {\mathbf {b}}_2\) in the above formula (3.2) with general \(\tau \in H({{\,\mathrm{div}\,}},\Omega )\). Then, an integration by parts shows the resulting identity \((\phi +z-{{\,\mathrm{div}\,}}\sigma ,{{\,\mathrm{div}\,}}\tau )=0\). The surjectivity of \({{\,\mathrm{div}\,}}:H({{\,\mathrm{div}\,}},\Omega )\rightarrow L^2(\Omega )\) proves
The combination of the three preceding identities leads to the PDE
in the sense of distributions. Since \({\mathbf {b}}_2=0\), the right-hand side g belongs to \(L^2\). This proves one direction of the assertion; the direct proof of the converse is omitted. \(\square \)
Remark 3.2
(no divergence formulation) The proof shows the extra term \(-{{\,\mathrm{div}\,}}( z {\mathbf {A}} {\mathbf {b}}_2)\in H^{-1}(\Omega )\) in case (1.4) is considered for non-zero \({\mathbf {b}}_2\in L^\infty (\Omega ;\mathbb {R}^n)\). This term does not belong to \(L^2(\Omega )\) under Assumption (A) and is, therefore, excluded.
3.2 Approximation of the fluxes
The subsequent lemma describes the uniform approximation of the flux variable by a combination of the compactness argument and the \(L^2\) best-approximation of Subsections 2.3 and 2.4.
Lemma 3.3
(flux approximation) Given any \(\varepsilon >0\), there exists \(\delta >0\) such that the following holds for all \(\mathcal {T}\in \mathbb {T}(\delta )\) and \(g\in L^2(\Omega )\). There exists some \({\mathbf {p}}_h\in M_k(\mathcal {T})\) that approximates \({\mathbf {p}} := {\mathbf {A}}\nabla \phi \in H({{\,\mathrm{div}\,}},\Omega )\) for the weak solution \(\phi \in H^1_0(\Omega )\) to \({\mathcal {L}}_2\phi = g\) with
Proof
Given any \(\varepsilon >0\) and the constant \(C_3\) from Lemma 2.5, Lemma 2.4 leads to a positive \(\delta \leqslant \min \{1, 2^{-1} \epsilon / C_3\}\) with
(the distance is with respect to the weighted norm \(\Vert \bullet \Vert _{{\mathbf {A}}^{-1}}\) in \(L^2(\Omega ;\mathbb {R}^n)\) and \(\Vert \bullet \Vert \) in \(L^2(\Omega )\)). Lemma 2.5 applies to \({\mathbf {p}}:={\mathbf {A}}\nabla \phi \) with \({{\,\mathrm{div}\,}}{\mathbf {p}}= {\mathbf {b}}\cdot \nabla \phi +\gamma \phi -g\in L^2(\Omega )\) and, for any \(\mathcal {T}\in \mathbb {T}(\delta )\), leads to some approximation \({\mathbf {p}}_h\in M_k(\mathcal {T})\) with \({{\,\mathrm{div}\,}}{\mathbf {p}}_h =\Pi _k {{\,\mathrm{div}\,}}{\mathbf {p}}\) and
This concludes the proof. \(\square \)
Remark 3.4
(no uniform approximation in \(H({{\,\mathrm{div}\,}})\)) Lemma 3.3 does not state a uniform approximation estimate for the divergence and, in fact, an estimate of the form \( \Vert {{\,\mathrm{div}\,}}({\mathbf {p}}-{\mathbf {p}}_h)\Vert \leqslant \epsilon \Vert g\Vert \) cannot hold in general. To see this, adopt the notation of the proof of Lemma 3.3 and a reverse triangle inequality for
The flux \({\mathbf {p}}= {\mathbf {A}}\nabla \mathcal {L}_2^{-1}g\) depends on \(g\in S(L^2(\Omega ))\) and so does the crucial term \( \Vert {{\,\mathrm{div}\,}}({\mathbf {p}}- {\mathbf {p}}_h)\Vert = \Vert (1-\Pi _k){{\,\mathrm{div}\,}}{\mathbf {p}}\Vert \). Hence, \(\sup \{ \Vert g-\Pi _kg \Vert : g\in S(L^2(\Omega ))\}=1\) implies
Therefore, the approximation error \( \Vert {{\,\mathrm{div}\,}}({\mathbf {p}}- {\mathbf {p}}_h)\Vert \) will not tend to zero uniformly for all \(g\in S(L^2(\Omega ))\) as \(\epsilon \) and \(\delta \) tend to zero. \(\square \)
Example 3.5
(RT approximation in \(H({{\,\mathrm{div}\,}})\) for particular g) The stability analysis in Subsection 1.4 concerns a discrete \(x_h:=(\sigma _h,u_h)\) with norm \(\Vert (\sigma _h,u_h)\Vert _H=1\) and leads in Theorem 3.1 to the particular right-hand side \(g:= {\mathbf {b}} \cdot {\mathbf {A}}^{-1} \sigma _h +(\gamma -1)\,{{\,\mathrm{div}\,}}\sigma _h-u_h\) with \(\Vert g\Vert \leqslant C_5<\infty \) for the essential supremum \(C_5^2\) of \(|{\mathbf {A}}^{-1/2}{\mathbf {b}}|^2 + |\gamma -1|^2+1\) in \(\Omega \). This g allows for a uniform approximation of \({\mathbf {p}}\) by \({\mathbf {p}}_h\) in \(H({{\,\mathrm{div}\,}},\Omega )\) for the RT finite element family.
For instance, in the extreme case of piecewise constant coefficients, \(g-\Pi _kg= {\mathbf {b}} \cdot {\mathbf {A}}^{-1}(1-\Pi _k)\sigma _h\). With \(C_6:=\Vert {\mathbf {A}}^{-1}{\mathbf {b}} \Vert _{L^\infty (\Omega )} \), Lemma 2.6 shows \( \Vert g-\Pi _kg\Vert \leqslant \delta C_6\). The combination with Lemma 3.3 lead to \({\mathbf {p}}_h\) with \( \Vert {\mathbf {p}}-{\mathbf {p}}_h\Vert _{H({{\,\mathrm{div}\,}},\Omega )}\leqslant \epsilon \,C_5+ \delta \,C_6. \) This and the arguments of Subsection 1.4 lead to the discrete stability (1.6).
3.3 Proof of theorem 1.1
Given any \(0< \varepsilon < \beta /\Vert b\Vert \), choose \(\delta >0\) as in Lemma 3.3. Suppose \(\mathcal {T}\in \mathbb {T}(\delta )\) and let \(x_h=(\sigma _h,u_h)\in V_h:=M_k(\mathcal {T})\times P_k(\mathcal {T})\) have norm \(\Vert x_h\Vert _H=1\) and define \(g:= {\mathbf {b}} \cdot {\mathbf {A}}^{-1} \sigma _h +(\gamma -1)\,{{\,\mathrm{div}\,}}\sigma _h-u_h\). Replace x by \(x_h\) in Theorem 3.1 and let \(\phi \in H^1_0(\Omega )\) solve \( {\mathcal {L}}_2\phi =g\) to define \(\zeta = \sigma _h-{\mathbf {A}} \nabla \phi \) and \(z= {{\,\mathrm{div}\,}}\sigma _h-\phi \). Then \(y=(\zeta ,z)\in H\) is the dual solution and solves \({\langle x_h},{\bullet \rangle }_{H}=b(\bullet , y)\) in H for the bilinear form (1.4) (with \({\mathbf {b}}_1:= {\mathbf {A}}^{-1} {\mathbf {b}}\) and \({\mathbf {b}}_2\equiv 0\) a.e. in \(\Omega \)). Lemma 3.3 applies to \({\mathbf {p}}:= {\mathbf {A}}\nabla \phi \) and leads to \({\mathbf {p}}_h\) with \({{\,\mathrm{div}\,}}{\mathbf {p}}_h=\Pi _k{{\,\mathrm{div}\,}}{\mathbf {p}}\) and \(\Vert {\mathbf {p}}-{\mathbf {p}}_h\Vert _{{\mathbf {A}}^{-1}}\leqslant \varepsilon \, C_5\). Hence, \(\zeta _h:=\sigma _h- {\mathbf {p}}_h \) and \(z_h:=\Pi _kz\) define \(y_h:=(\zeta _h,z_h)\in V_h\) with
as \(\Vert {{\,\mathrm{div}\,}}\zeta _h\Vert =\Vert \Pi _k{{\,\mathrm{div}\,}}\zeta \Vert \leqslant \Vert {{\,\mathrm{div}\,}}\zeta \Vert \), \(\Vert z_h\Vert =\Vert \Pi _kz\Vert \leqslant \Vert z\Vert \) and \({{\,\mathrm{div}\,}}\sigma _h\in P_k(\mathcal {T})\). Moreover,
Piecewise Poincaré inequalities (with the Payne-Weinberger constant \(1/\pi \) for convex domains [16]) show \(\Vert \phi -\Pi _k\phi \Vert \leqslant \delta /\pi \Vert \nabla \phi \Vert \). Recall that \(H^1_0(\Omega )\) is endowed with the seminorm \(\Vert \nabla \bullet \Vert \) and let \(C_F\) denote the constant in the Friedrichs inequality \(\Vert \bullet \Vert \leqslant C_F\Vert \nabla \bullet \Vert \) in \(H^1_0(\Omega )\). Note that (1.2) leads to \(\alpha \, \Vert \nabla \phi \Vert \leqslant \sup \{ a(\psi ,\phi ) : \psi \in H^1_0(\Omega ) , \; \Vert \nabla \psi \Vert =1\}\). Hence \( a(\psi ,\phi )={\langle \mathcal {L}_2 \phi },{\psi \rangle }_{H^{-1}(\Omega )\times H^1_0(\Omega )} = {\langle g},{\psi \rangle }_{H^{-1}(\Omega )\times H^1_0(\Omega )} \leqslant C_F\Vert g\Vert \) implies \(\alpha \, \Vert \nabla \phi \Vert \leqslant C_F\,C_5\). The combination with Poincaré inequalities shows \(\Vert \phi -\Pi _k\phi \Vert \leqslant \delta C_F\,C_5/(\alpha \,\pi )\) and so
Since \(u_h\perp {{\,\mathrm{div}\,}}(\zeta -\zeta _h)\) and \(z-z_h\perp {{\,\mathrm{div}\,}}\sigma _h\) (\(\perp \) denotes orthogonality in \(L^2(\Omega )\)),
with the constant \( C_7^2:= \max \{ 2 , 2 \Vert {\mathbf {A}}^{-1/2}{\mathbf {b}} \Vert _{L^\infty (\Omega ) }^2 + \Vert \gamma \Vert _{L^\infty (\Omega ) }^2\}\). The arguments of Sect. 1.4 lead to
Since \(\beta \Vert y\Vert _H \leqslant \Vert b(\bullet , y)\Vert _{H^*}=\Vert x_h\Vert _H=1\) implies \(\Vert y\Vert _H \leqslant 1/\beta \), (3.3) reads \(\Vert y_h\Vert _H \leqslant 1/\beta + \varepsilon \, C_5\). This and (3.4) verify
Since \(x_h\) was arbitrary in \(S(V_h)\) (with \(V_h\) endowed with the norm in H),
Relabelling \(\epsilon \) and \(\delta \) proves the assertion: For any \(0< \beta _0<\beta \), there exists \(\delta >0\) with (1.6). This establishes the theorem for the conservative version of the mixed finite element discretisation with \({\mathbf {b}}_1:= {\mathbf {A}}^{-1} {\mathbf {b}}\) and \({\mathbf {b}}_2\equiv 0\) a.e. in \(\Omega \).
To deduce the same \(\inf \)-\(\sup \) constant for the other variant, define \(b_1(\bullet ,\bullet ) \) (resp. \(b_2(\bullet ,\bullet ) \)) by (1.6) for \({\mathbf {b}}_1:= {\mathbf {A}}^{-1} {\mathbf {b}}\) and \({\mathbf {b}}_2\equiv 0\) (resp. \({\mathbf {b}}_1:= 0\) and \({\mathbf {b}}_2:= {\mathbf {b}} \cdot {\mathbf {A}}^{-1} \)) a.e. in \(\Omega \). The above proof shows \(0<\beta _0\leqslant \beta _h\) and it is elementary to see that
A direct calculation shows \(b_1((\tau , -v),(\sigma ,-u))=b_2((\sigma ,u), (\tau ,v))\) for all \((\sigma ,u), (\tau ,v)\in H\). This and a duality argument (singular values of a square matrix coincide with those of its transposed) in the last equality show
Hence, the divergence formulation has the same discrete \(\inf \)-\(\sup \) constant \(\beta _h\). \(\square \)
Remark 3.6
(\(\delta \) dependence) The size of \(\delta \) in (1.6) is hidden behind a compactness argument of Lemma 3.3. Besides the norms and parameters mentioned in Assumption (A), the mapping properties of \({\mathcal {L}}_2^{-1}\) are of relevance as well. A review of the proofs of this paper shows that there is a finite sub-cover of \(S(L^2(\Omega ))\) with small balls in \(H^{-1}(\Omega )\) that leads to a finite number of (without loss of generality) smooth functions \(k_1,\dots , k_J\) as in the proof of Lemma 2.3. The size of \(\delta \) is related to the approximation properties of the weak solutions \(\Phi _j\) to \({\mathcal {L}}_2\Phi _j=k_j\) a.e. The regularity properties of \(\Phi _j\in H^1_0(\Omega )\) are not characterised for Assumption (A): In fact, it is unknown whether \(\Phi _j\) belongs to any \(H^{1+s}(\Omega )\) for any \(s>0\). Under Assumption (B) and reduced elliptic regularity, however, the afore mentioned approximation properties could be quantified more and reveal further information on \(\delta \).
4 \(L^2\) best-approximation
The notation of Theorem 1.2 applies throughout this section with continuous and discrete solutions \(x=(\sigma ,u)\) and \(x_h=(\sigma _h,u_h)\).
4.1 Proof of theorem 1.2.a
Given \({\mathbf {p}}:=\sigma \in H({{\,\mathrm{div}\,}},\Omega )\) and \(\mathcal {T}\in \mathbb {T}(\delta )\), choose \(\sigma _h^*:={\mathbf {p}}_h\in M_k(\mathcal {T})\) as in Lemma 2.5, and define \(e_h:=(\sigma _h- \sigma _h^*, u_h-\Pi _ku)\in V_h\). Given \(\beta _0>0\) in (1.6) there exists some \(y_h=(\tau _h,v_h)\in V_h\) with \(\Vert y_h\Vert _H=1\) and
Since \(u-\Pi _ku\perp {{\,\mathrm{div}\,}}\tau _h\) and \(v_h\perp {{\,\mathrm{div}\,}}(\sigma -\sigma ^*_h)\), the last term is equal to
in terms of the weighted \(L^2\) norm \(\Vert \bullet \Vert _L\leqslant \Vert \bullet \Vert _H\) in H with \(\Vert (\tau _h,v_h )\Vert _L^2:=\Vert \tau _h\Vert _{{\mathbf {A}}^{-1}}^2+ \Vert v_h\Vert ^2 \leqslant 1\) and with \(C_8^2= 1+\Vert {\mathbf {A}}^{1/2}{\mathbf {b}}_1\Vert _{L^\infty (\Omega )}^2 +\Vert {\mathbf {A}}^{1/2}{\mathbf {b}}_2\Vert _{L^\infty (\Omega )}^2 + \Vert \gamma \Vert _{L^\infty (\Omega )}^2 \). Consequently, \(\Vert e_h\Vert _H\leqslant \Big (C_8/\beta _0\Big )\, \Vert (\sigma -\sigma ^*_h, u-\Pi _ku) \Vert _L \). Lemma 2.5 shows
This and the distance \({{\,\mathrm{dist}\,}}_L\) (measured in the norm \(\Vert \bullet \Vert _L\)) lead to
This, (4.1), and a triangle inequality conclude the proof. \(\square \)
4.2 Proof of theorem 1.2.b
Throughout this subsection, let \({\mathbf {b}}_1\equiv 0\) and \({\mathbf {b}}_2:= {\mathbf {A}}^{-1} {\mathbf {b}}\) a.e. in \(\Omega \) in (1.4) and let \(f\in L^2(\Omega )\) be a fixed right-hand side for the continuous and discrete problem \({\mathcal {L}}_2u=f\).
Return to the proof of the previous subsection with \(e_h\) and follow the first lines until
Abbreviate \(\overline{\gamma }:=\Pi _0\gamma \) and utilize the Lipschitz continuity of the coefficients \(\gamma \) on each simplex
This controls the above term \((u-\Pi _ku, \gamma \, v_h)_{L^2(\Omega ) }\leqslant \Vert \Pi _k(\gamma (u-\Pi _ku)) \Vert \,\Vert v_h\Vert \) and leads to
A triangle inequality in \(L^2(\Omega ;\mathbb {R}^n)\) is followed by (4.1) in the proof of
This completes the rest of the proof. \(\square \)
4.3 Conservative formulation
Theorem 2.a includes an error estimate for the conservative formulation with \({\mathbf {b}}_1:= {\mathbf {A}}^{-1} {\mathbf {b}}\) and \({\mathbf {b}}_2:=0\) in (1.4) and \(\sigma = -{\mathbf {A}}\nabla u- u\,{\mathbf {b}}\) with \({{\,\mathrm{div}\,}}\sigma =f-\gamma u\). The refined analog of Theorem 2.b is not expected because of an extra term exemplified in the extreme case of piecewise constant coefficients \({\mathbf {b}}_1\) and \(\gamma \). The arguments of Subsection 4.1 lead to
The last term is not of higher order for the BDM finite element family as pointed out in [8] through numerical evidence. For the RT finite element family, however, Lemma 2.6 shows \(\Vert \tau _h -\Pi _k\tau _h\Vert { \lesssim } \Vert h_\mathcal {T}\, {{\,\mathrm{div}\,}}\tau _h\Vert \) and then leads to a higher-order contribution in the asserted inequality of Theorem 1.2.b as the final result. \(\square \)
The arguments could be generalised, but those result are of limited relevance as the convergence order is not generally improved in comparison with Theorem 2.a. An exception is the example of [7, Sect 3.5] (with \({\mathbf {b}}=0=\gamma \) on the unit ball) when Theorem 1.2.b guarantees \(O(\delta ^2)\) for the \(L^2\) flux error for \(k=0\).
References
Arbogast, T., Chen, Z.: On the implementation of mixed methods as nonconforming methods for second-order elliptic problems. Math. Comput. 64(211), 943–972 (1995)
Boffi, D., Brezzi, F., Fortin, M.: Mixed Finite Element Methods and Applications, Springer Series in Computational Mathematics, vol. 44. Springer, Heidelberg (2013)
Braess, D.: Finite Elements, 3rd edn. Cambridge University Press, Cambridge (2007)
Brenner, S.C., Scott, L.R.: The mathematical theory offinite element methods. In: Marsden, J.E., Sirovich, L., Antman S.S. (eds.) Texts in Applied Mathematics, third ed., vol. 15. Springer, New York (2008)
Carstensen, C., Dond, A.K., Nataraj, N., Pani, A.K.: Error analysis of nonconforming and mixed FEMs for second-order linear non-selfadjoint and indefinite elliptic problems. Numer. Math. 133(3), 557–597 (2016)
Carstensen, C., Gallistl, D., Schedensack, M.: \(L^2\) best approximation of the elastic stress in the Arnold-Winther FEM. IMA J. Numer. Anal. 36(3), 1096–1119 (2016)
Carstensen, C., Peterseim, D., Schedensack, M.: Comparison results of finite element methods for the Poisson model problem. SIAM J. Numer. Anal. 50(6), 2803–2823 (2012)
Demlow, A.: Suboptimal and optimal convergence in mixed finite element methods. SIAM J. Numer. Anal. 39(6), 1938–1953 (2002)
Douglas, J., Jr., Roberts, J.E.: Mixed finite element methods for second order elliptic problems. Mat. Apl. Comput. 1(1), 91–103 (1982)
Douglas, J., Jr., Roberts, J.E.: Global estimates for mixed methods for second order elliptic equations. Math. Comp. 44(169), 39–52 (1985)
Ern, A., Gudi, T., Smears, I., Vohralik, M.: Equivalence of local- and global-best approximations, a simple stable local commuting projector, and optimal hp approximation estimates in H(div). IMA J. Numer. Anal. 00, 1–27 (2021). https://doi.org/10.1093/imanum/draa103
Ern, A., Guermond, J.-L.: Theory and Practices of Finite Elements, 1st edn. Springer-Verlag, New York (2004)
Ern, A., Guermond, J.-L.: Mollification in strongly Lipschitz domains with application to continuous and discrete de Rham complexes. Comput. Methods Appl. Math. 16(1), 51–75 (2016)
Gastaldi, L., Nochetto, R.H.: On \(L^\infty \)-accuracy of mixed finite element methods for second order elliptic problems. Mat. Apl. Comput. 7(1), 13–39 (1988)
Huang, J., Xu, Y.: Convergence and complexity of arbitrary order adaptive mixed element methods for the Poisson equation. Sci. China Math. 55(5), 1083–1098 (2012)
Payne, L.E., Weinberger, H.F.: An optimal Poincaré inequality for convex domains. Arch. Ration. Mech. Anal. 5(1), 286–292 (1960)
Schatz, A.H., Wang, J.P.: Some new error estimates for Ritz-Galerkin methods with minimal regularity assumptions. Math. Comput. 65(213), 19–27 (1996)
Acknowledgements
The research of the first author has been supported by the Deutsche Forschungsgemeinschaft in the Priority Program 1748 under the project “foundation and application of generalied mixed FEM towards nonlinear problems in solid mechanics” (CA 151/22-2). The finalization of this paper has been supported by DST SERB MATRICS grant No. MTR/2017/000199 (NN), MATRICS grant No. MTR/201S/000309 (AKP) and SPARC project (id 235) entitled the mathematics and computation of plates.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix: proof of Lemma 2.6
Appendix: proof of Lemma 2.6
For any simplex \(T\subset \mathbb {R}^n\), let \(P_k(T;\mathbb {R}^n)\) be the linear space of vector-valued polynomials \(q_k\) of degree at most k in any component and let \(\Vert \bullet \Vert \) abbreviate the \(L^2\) norm \(\Vert \bullet \Vert _{L^2(T)}\) on T. The particular structure of the RT function \(\tau _{RT}\) leads to some polynomial \(g\in P_k(T)\) and
The argument x (will always belong to T) is often neglected as in \(\tau _{RT}:=\tau _{RT}(x)\) or \(p_k=p_k(x)\), while (with a small inconsistency, but the right emphasis) written out in the leading term \(g(x)\, x\). The latter polynomial is either identically zero or of exact degree \(k+1\) in the sense that g is a sum of monomials of exact degree k. Adopt a multi-index notation with \(\alpha =(\alpha _1,\dots ,\alpha _n)\in \mathbb {N}_0^n\) and the monomial \(x^\alpha :=x_1^{\alpha _1}x_2^{\alpha _2}\cdots x_n^{\alpha _n}\) of degree \(k=|\alpha |:=\alpha _1+\dots +\alpha _n\) for any \(x=(x_1,\dots ,x_n)\in T\). With real coefficients \(c_\alpha \) for any \(\alpha \in {\mathbb {N}}_0^n\) of degree \(|\alpha |=k\),
(The symbol \(|\alpha |=k\) under the sum sign abbreviates the set of all multi-indices \(\alpha =(\alpha _1,\dots ,\alpha _n)\in {\mathbb {N}}_0^n\) of degree k). The divergence
of the vector-valued polynomial \(g(x)\, x\) of degree \(k+1\) with respect to x is computed with the observation that, for any \(\alpha \in {\mathbb {N}}_0^n\) with \(|\alpha |=k\),
Consequently, \(x\cdot \nabla g(x)= k\, g(x)\) and
This proves \({{\,\mathrm{div}\,}}\tau _{RT}={{\,\mathrm{div}\,}}(g(x)\, x)+q_{k-1}= (n+k)\, g(x)+q_{k-1}\) for some \(q_{k-1}\in P_{k-1}(T)\). The comparison with \(\tau _{RT}= g(x)\, x + p_{k} \) leads to some polynomial remainder \(r_{k}\in P_{k}(T; \mathbb {R}^n)\) in
In other words, since \({{\,\mathrm{div}\,}}\tau _{RT}\in P_k(T)\),
for any constant vector c. For instance, the center of inertia \(c={{\,\mathrm{mid}\,}}(T)\) of T with diameter \(h_T\) satisfies \(|x-{{\,\mathrm{mid}\,}}(T)|\leqslant \left( n/(n+1)\right) h_T\) for all \(x\in T\). This leads to
This proves \(\Vert \tau _{RT}- \Pi _k\tau _{RT}\Vert _{L^2(T)} \leqslant \frac{n \, \ h_T}{(n+1)(n+k)} \Vert {{\,\mathrm{div}\,}}\tau _{RT}\Vert _{L^2(T)}\). \(\square \)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Carstensen, C., Nataraj, N. & Pani, A.K. Stability of mixed FEMs for non-selfadjoint indefinite second-order linear elliptic PDEs. Numer. Math. 150, 975–992 (2022). https://doi.org/10.1007/s00211-022-01282-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00211-022-01282-3