Stability of mixed FEMs for non-selfadjoint indefinite second-order linear elliptic PDEs

Carstensen, C.; Nataraj, Neela; Pani, Amiya K.

doi:10.1007/s00211-022-01282-3

Stability of mixed FEMs for non-selfadjoint indefinite second-order linear elliptic PDEs

Open access
Published: 01 April 2022

Volume 150, pages 975–992, (2022)
Cite this article

Download PDF

You have full access to this open access article

Numerische Mathematik Aims and scope Submit manuscript

Stability of mixed FEMs for non-selfadjoint indefinite second-order linear elliptic PDEs

Download PDF

C. Carstensen^1,2,
Neela Nataraj² &
Amiya K. Pani²

1670 Accesses
Explore all metrics

Abstract

For a well-posed non-selfadjoint indefinite second-order linear elliptic PDE with general coefficients ${\mathbf {A}}, {\mathbf {b}},\gamma $ in $L^\infty $ and symmetric and uniformly positive definite coefficient matrix ${\mathbf {A}}$, this paper proves that mixed finite element problems are uniquely solvable and the discrete solutions are uniformly bounded, whenever the underlying shape-regular triangulation is sufficiently fine. This applies to the Raviart-Thomas and Brezzi-Douglas-Marini finite element families of any order and in any space dimension and leads to the best-approximation estimate in $H({{\,\mathrm{div}\,}})\times L^2$ as well as in in $L^2\times L^2$ up to oscillations. This generalises earlier contributions for piecewise Lipschitz continuous coefficients to $L^\infty $ coefficients. The compactness argument of Schatz and Wang for the displacement-oriented problem does not apply immediately to the mixed formulation in $H({{\,\mathrm{div}\,}})\times L^2$. But it allows the uniform approximation of some $L^2$ contributions and can be combined with a recent $L^2$ best-approximation result from the medius analysis. This technique circumvents any regularity assumption and the application of a Fortin interpolation operator.

A priori error estimates of expanded mixed FEM for Kirchhoff type parabolic equation

Article 07 February 2019

Error analysis of nonconforming and mixed FEMs for second-order linear non-selfadjoint and indefinite elliptic problems

Article 18 July 2015

Stability to a class of doubly nonlinear very singular parabolic equations

Article 10 April 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

This section introduces the non-selfadjoint indefinite second-order linear elliptic PDE and its mixed formulations. A brief review of earlier results is followed by the assertion of the stability and the best-approximation results.

1.1 Non-selfadjoint indefinite second-order linear elliptic PDEs

The strong formulations for second-order elliptic problems with coefficients $\mathbf{A},$ b, $\gamma $ componentwise in $L^\infty (\Omega )$ and $f\in L^2(\Omega )$ read $ {\mathcal {L}}_j u_j =f $ a.e. in a polyhedral bounded Lipschitz domain $\Omega \subset \mathbb {R}^n$ with homogeneous Dirichlet boundary condition $u_j= 0$ on $\partial \Omega $ for $j=1,2$ and any dimension $n\ge 2$. For all $v\in H^1_0(\Omega )$, the two differential operators (referred to as conservative resp. divergence form throughout this paper) read

$$\begin{aligned} {\mathcal {L}}_1 v := -\nabla \cdot ({\mathbf {A}}\nabla v+v\, {\mathbf {b}})+ \gamma \, v \quad \text {and}\quad {\mathcal {L}}_2 v := -\nabla \cdot ({\mathbf {A}}\nabla v) + {\mathbf {b}}\cdot \nabla v+ \gamma \, v.\quad \nonumber \\ \end{aligned}$$

(1.1)

The assumption on ellipticity means that the $n\times n$ coefficient matrix ${\mathbf {A}}(x)$ is symmetric and positive definite with eigenvalues in one universal compact interval of positive reals for a.e. $x\in \Omega $. This makes ${\mathcal {L}}_1, \mathcal {L}_2:H^1_0(\Omega )\rightarrow H^{-1}(\Omega )$ Fredholm operators of index zero and their weak formulations $a(v,w):= {\langle {\mathcal {L}}_1v},{w\rangle }_{H^{-1}(\Omega )\times H^1_0(\Omega )} ={\langle {\mathcal {L}}_2 w},{v\rangle }_{H^{-1}(\Omega )\times H^1_0(\Omega )} $, for all $v,w\in H^1_0(\Omega )$, are dual to each other in the duality bracket ${\langle \bullet ,\bullet },{_\rangle }{H^{-1}(\Omega )\times H^1_0(\Omega )}$ of $H^{-1}(\Omega )$, the dual of $H^1_0(\Omega )$.

Throughout this paper, zero eigenvalues are excluded and the kernel (of one of these operators) ${\mathcal {L}}_j$ is supposed to be trivial, so that $ {\mathcal {L}}_1$ and ${\mathcal {L}}_2$ are bijections. It is known from the theory of bilinear forms in reflexive Banach spaces [2, 3] that this implies well-posedness and the continuous $\inf $-$\sup $ condition (e.g., when $H^1_0(\Omega )$ is endowed with the norm $\Vert \nabla \bullet \Vert $)

$$\begin{aligned} 0<\alpha := \inf _{v\in H^1_0(\Omega )\setminus \{0\}} \sup _{v\in H^1_0(\Omega )\setminus \{0\}} \frac{ a(v,w)}{ \Vert \nabla v\Vert \, \Vert \nabla w\Vert } . \end{aligned}$$

(1.2)

The $\inf $-$\sup $ constant is the same for the original and the dual problem; a(v, w) could be replaced by a(w, v) with the same $\alpha $. The finite element error analysis is enormously simplified under additional conditions on the coefficients that lead to an ellipticity of $a(\bullet ,\bullet )$ and allow an application of the Lax-Milgram lemma [2,3,4]. The present situation of a general non-selfadjoint indefinite second-order linear elliptic PDE avoids any of those assumptions and examines coefficients in $L^\infty $, which satisfy the following.

Assumption(A)

There exist two global constants ${0<\underline{\alpha }\leqslant \overline{\alpha }<\infty }$ such that $\mathbf{A}\in L^\infty (\Omega ;{\mathbb {R}}^{n\times n})$ satisfies ${\underline{\alpha }\leqslant \lambda _1(\mathbf{A}(x))\leqslant \cdots \leqslant \lambda _n(\mathbf{A}(x))\leqslant \overline{\alpha }}$ for the eigenvalues ${\lambda _1(\mathbf{A}(x))\leqslant \dots \leqslant \lambda _n(\mathbf{A}(x))}$ of the SPD $\mathbf{A}(x)$ for a.e. $x\in \Omega $. The functions ${\mathbf {b}}, {\mathbf {b}}_1,{\mathbf {b}}_2 \in L^\infty (\Omega ;\mathbb {R}^n)$ and $\gamma \in L^\infty (\Omega )$ are componentwise bounded in the bounded polyhedral Lipschitz domain $\Omega \subset \mathbb {R}^n$.

Given the various applications to porous media and ground-water flow with rough and oscillating coefficients merely bounded in a well-posed PDE, this contribution gives an affirmative answer to the fundamental question whether the mixed finite element method be used (and then is stable and provides best-approximation property at least for fine triangulations).

1.2 Earlier contributions

For conforming finite element discretizations and sufficiently small mesh sizes, [17] establishes the existence and uniqueness of conforming finite element solutions under assumption (A). The mixed formulation for the conservation (resp. divergence) equation $ {\mathcal {L}}_1 u =f $ (resp. ${\mathcal {L}}_2 u =f $) introduces the flux variable $\sigma = -{\mathbf {A}}\nabla u- u\,{\mathbf {b}}$ (resp. $\sigma = -{\mathbf {A}}\nabla u$) and seeks the solution $x=(\sigma ,u) \in H$ to

$$\begin{aligned} b(x,y)=(f,v)_{L^2(\Omega )} \quad \text {for all }y=(\tau ,v) \in H :=H({{\,\mathrm{div}\,}}, \Omega ) \times L^2(\Omega ) \end{aligned}$$

(1.3)

with ${\mathbf {b}}_1:= {\mathbf {A}}^{-1} {\mathbf {b}}$, ${\mathbf {b}}_2:=0$ (resp. ${\mathbf {b}}_1:= 0$, ${{\mathbf {b}}}_2:= {\mathbf {b}} \cdot {\mathbf {A}}^{-1} $) and $(\sigma ,\tau )_{{\mathbf {A}}^{-1}}:=({\mathbf {A}}^{-1}\sigma ,\tau )_{L^2(\Omega )}$ in

$$\begin{aligned} b(x,y)&=(\sigma ,\tau )_{{\mathbf {A}}^{-1}} - (u,{{\,\mathrm{div}\,}}\tau )_{L^2(\Omega )} + (v,{{\,\mathrm{div}\,}}\sigma )_{L^2(\Omega )} \nonumber \\&\quad +(u, {\mathbf {b}}_1\cdot \tau )_{L^2(\Omega )} -(v,{\mathbf { b}}_2\cdot \sigma )_{L^2(\Omega )} +(\gamma \, u,v)_{L^2(\Omega )}. \end{aligned}$$

(1.4)

The equivalence to the boundary value problems associated with the linear differential operators in (1.1) and their well-posedness on the continuous level can be found in [5, Sect. 2]. This implies the continuous $\inf $-$\sup $ conditions [2, 3]

$$\begin{aligned} 0<\beta := \inf _{x\in H \setminus \{0\}} \sup _{y\in H\setminus \{0\}} \frac{ b(x,y)}{ \Vert x\Vert _{H} \Vert y\Vert _H } = \inf _{y\in H \setminus \{0\}} \sup _{x\in H\setminus \{0\}} \frac{ b(x,y)}{ \Vert x\Vert _{H} \Vert y\Vert _H } . \end{aligned}$$

(1.5)

The existence and uniqueness of discrete solutions and optimal $L^2$ error estimates were introduced in [9] for sufficiently fine triangulations in two and three space dimensions under high regularity assumptions, where the pair $(\sigma _h,u_h)$ is approximated in $RT_k(\mathcal {T})\times P_k(\mathcal {T})$ with the Raviart-Thomas (RT) for 2D (resp. Raviart-Thomas-Nedelec for 3D) finite elements. Global $L^{\infty }$ and global $L^2$ and negative norm estimates for the conservation form were discussed in [10, 14] for smooth coefficients.

Provided the coefficients ${\mathbf {A}}$ and ${\mathbf {b}}$ are Lipschitz continuous, $\gamma $ is piecewise Lipschitz continuous and $H^2$ regularity of the adjoint system, an interesting convergence phenomenon for the BDM finite element family is clarified in the fairly general framework of [8].

Let $M_k(\mathcal {T})$ be any RT or BDM finite element space of degree $k\in {\mathbb {N}}_0$ and define the discrete space $V(\mathcal {T}):=M_k(\mathcal {T})\times P_k(\mathcal {T})\subset H$ based on a shape-regular triangulation $\mathcal {T}$ with mesh-sizes ${\leqslant \delta }$, written $\mathcal {T}\in \mathbb {T}(\delta )$.

In case ${\mathbf {A}}$ and ${\mathbf {b}}$ are globally Lipschitz continuous and $\gamma $ is piecewise Lipschitz continuous, the convergence results in [8] also establish stability in the sense

$$\begin{aligned} { 0<\beta _0\leqslant \inf _{ \mathcal {T}\in \mathbb {T}(\delta )} \inf _{x_h\in V(\mathcal {T})\setminus \{0\}}\sup _{y_h\in V(\mathcal {T})\setminus \{0\}} \frac{ b(x_h,y_h) }{ \Vert x_h\Vert _H\Vert y_h\Vert _H } (=: \beta _h)} \end{aligned}$$

(1.6)

for some positive $\delta $ and $\beta _0$. With extra work and refined arguments along the lines of [8], but with reduced elliptic regularity and solution $u_j\in H^1_0(\Omega )\cap H^{1+s}(\Omega )$ to $ \mathcal {L}_j u_j =f $ for some $s>0$. Those arguments are not valid under Assumption (A).

Modern trends in the mathematics of mixed finite element schemes include local stable projections with commuting properties [11,12,13]; those techniques do not seem to allow the proof of discrete stability and best-approximation under assumption (A).

Piecewise Lipschitz continuous coefficients with regularity in $H^{1+s}$ (for some positive s) lead in [5] to stability for the lowest-order RT FEM. The equivalence to nonconforming Crouzex-Raviart finite elements holds more generally [1] and the combination with the arguments from [17] and [5] might lead to stability results under the assumption (A) for more examples. In comparison, the methodology of this paper provides stability for any degree k and any dimension n (RT and BDM merely serve as popular model examples).

1.3 Contribution of this paper

Under the Assumption (A) and for any RT or BDM finite element space $V(\mathcal {T}):=M_k(\mathcal {T})\times P_k(\mathcal {T})\subset H:= H({{\,\mathrm{div}\,}},\Omega )\times L^2(\Omega )$ of degree $k\in {\mathbb {N}}_0$ [2,3,4], the discrete stability (1.6) is established for small mesh-sizes, where either ${\mathbf {b}}_1:= {\mathbf {A}}^{-1} {\mathbf {b}}$ and ${\mathbf {b}}_2:=0$ or ${\mathbf {b}}_1:= 0$ and ${{\mathbf {b}}}_2:= {\mathbf {b}} \cdot {\mathbf {A}}^{-1} $ in (1.4).

Theorem 1.1

(discrete stability) For each (positive) $\beta _0<\beta $ with $\beta $ from (1.5), there exists $\delta >0$ such that (1.6) holds.

This theorem implies [2, 3] that the mixed finite element problems for the RT and the BDM finite element families of any degree k and in any space dimension n are (i) uniquely solvable, (ii) uniformly bounded in H, and (iii) fullfil quasi-optimal error estimates in the norm of H, whenever the underlying shape-regular triangulation is sufficiently fine.

Theorem 1.1 and the tools of this paper lead to $L^2$ best-approximation up to oscillations.

Theorem 1.2

($L^2$ best approximation) Suppose $\delta >0$ satisfies (1.6) with $b(\bullet ,\bullet ) $ defined for general ${\mathbf {b}}_1,{\mathbf {b}}_2\in L^{\infty }(\Omega ;\mathbb {R}^n)$ under Assumption (A). Assume $\mathcal {T}\in \mathbb {T}(\delta )$ and that $x:=(\sigma ,u)\in H$ (resp. $x_h\equiv (\sigma _h,u_h)\in V_h:=V(\mathcal {T}):= M_k(\mathcal {T})\times P_k(\mathcal {T})$) satisfy $b(x-x_h,y_h)=0$ for all $y_h\in V_h$. Then the following results (a) and (b) hold.

(a) There exists a positive constant $C_1$, which exclusively depends on $\beta _0>0$, the $L^\infty $ norms of (all the components of) ${\mathbf {A}}^{1/2}{\mathbf {b}}_1$, ${\mathbf {A}}^{1/2}{\mathbf {b}}_2$, and $\gamma $, as well as on the shape-regularity of ${\mathbb {T}}$, such that the piecewise mesh size $h_\mathcal {T}$ in $\mathcal {T}$ and the $L^2$ projection $\Pi _k$ onto $P_k(\mathcal {T})$ satisfy

$$\begin{aligned}C_1^{-1}\left( \Vert \sigma -\sigma _h\Vert _{{\mathbf {A}}^{-1}}+\Vert u - u_h \Vert \right)\leqslant & {} \min _{\tau _h \in M_k(\mathcal {T})} \Vert \sigma -\tau _h \Vert _{{\mathbf {A}}^{-1}}\\&+\Vert u-\Pi _ku \Vert +\Vert h_\mathcal {T}( 1-\Pi _k){{\,\mathrm{div}\,}}\sigma \Vert . \end{aligned}$$

(b) Suppose ${\mathbf {b}}_1=0$ and that the scalar $\gamma (x)$ is Lipschitz continuous in $x\in {{\,\mathrm{int}\,}}(T)$, the interior of $T\in \mathcal {T}$, with a Lipschitz constant smaller than or equal to ${{\,\mathrm{\mathcal {L}ip}\,}}(\gamma )$. Then there exists a positive constant $C_2$, which depends exclusively depends on $\beta _0>0$, $\Vert {\mathbf {A}}^{1/2}{\mathbf {b}}_2\Vert _{L^\infty (\Omega )}$, ${{\,\mathrm{\mathcal {L}ip}\,}}(\gamma )$, and the shape-regularity of ${\mathbb {T}}$, such that

$$\begin{aligned} {C_2^{-1} \Vert \sigma -\sigma _h \Vert _{{\mathbf {A}}^{-1}} \leqslant \min _{\tau _h\in M_k(\mathcal {T}) } \Vert \sigma -\tau _h \Vert _{{\mathbf {A}}^{-1}} + \Vert h_\mathcal {T}(u-\Pi _ku)\Vert +\Vert h_\mathcal {T}( 1-\Pi _k) {{\,\mathrm{div}\,}}\sigma \Vert .} \end{aligned}$$

The additional oscillations $\Vert h_\mathcal {T}(u-\Pi _ku)\Vert $ and $\Vert h_\mathcal {T}( 1-\Pi _k)( {{\,\mathrm{div}\,}}\sigma )\Vert $ can be higher-order contributions and then these terms explain the improved convergence of one variant for the BDM finite element family in [8] under Assumption (A).

This article, thus, generalises earlier contributions [5, 8,9,10, 12,13,14] for smooth or piecewise Lipschitz continuous coefficients to $L^\infty $ coefficients without any further assumptions. The compactness argument of Schatz and Wang [17] for the displacement-oriented problem does not apply immediately to the mixed formulation in $H({{\,\mathrm{div}\,}})\times L^2$. Remark 12 below explains that no uniform $L^2$ approximation of the divergence component holds. This paper therefore compensates the lack of compactness by the computation and analysis of an optimal test function (the dual solution y in (1.7) of Sect. 1.4 below). Recent best-approximation for the flux in $L^2$ from the medius analysis [6, 15] combines with the compactness for the (dual) PDE. This and a careful shift of the discrete divergence circumvents the aforementioned lack of compactness in the divergence variable. In fact, this new methodology avoids any regularity argument and any Fortin interpolation at all.

1.4 Motivation

This subsection outlines the proof of the discrete $\inf $-$\sup $ stability (1.6) in an abstract framework to guide the reader through the arguments. Suppose $X_h\times Y_h$ is a finite dimensional subspace of $H\times H$ with dual $X_h^* \times Y_h^*$ and let $x_h \in S(X_h)$, i.e., $x_h$ belongs to $X_h$ and has norm $\Vert x_h\Vert _H=1$. Recall (1.5) and the well-posedness of the problem (1.3). Then, the dual problem is well-posed as well and ${\langle x_h},{\bullet \rangle }_H=b(\bullet ,y)$ has a unique dual solution y in the Hilbert space $(H, {\langle \bullet },{\bullet \rangle }_H)$. The continuous $\inf $-$\sup $ condition (1.5) shows

$$\begin{aligned} \beta \, \Vert y\Vert _H\leqslant \Vert b(\bullet ,y)\Vert _{H^*}=\Vert x_h\Vert _H=1,\quad \text {whence } \Vert y\Vert _H\leqslant 1/\beta \end{aligned}$$

(1.7)

is bounded. Suppose that $y_h\in Y_h$ is a close approximation to y with $\Vert y-y_h\Vert _H\leqslant \varepsilon $ for some positive $\varepsilon < 1/\Vert b\Vert $, where $\Vert b\Vert $ is the operator norm of the bilinear form $b(\bullet ,\bullet )$. Since

$$\begin{aligned} 1= b(x_h,y)=b(x_h,y_h)+ b(x_h,y-y_h)\leqslant \Vert b(x_h,\bullet )\Vert _{Y_h^*} \Vert y_h\Vert _H +\varepsilon \Vert b\Vert , \end{aligned}$$

it remains to bound $\Vert y_h\Vert _H$, e.g., with the triangle inequality

$$\begin{aligned} \Vert y_h\Vert _H\leqslant \Vert y\Vert _H+\Vert y-y_h\Vert _H\leqslant 1/\beta +\varepsilon . \end{aligned}$$

The combination of the previous two displayed formulas gives a lower bound for $\Vert b(x_h,\bullet )\Vert _{Y_h^*}$. Under the assumption that $\varepsilon $ is independent of y and so of $x_h$, this estimate reads

$$\begin{aligned} \beta \frac{ 1- \varepsilon \Vert b\Vert }{1+\varepsilon \beta } \leqslant \beta _h:= \inf _{x_h\in X_h \setminus \{0\}} \sup _{y_h\in Y_h\setminus \{0\}} \frac{ b(x_h,y_h)}{ \Vert x_h\Vert _{H} \Vert y_h\Vert _H }. \end{aligned}$$

(1.8)

This proves $\beta _0\leqslant \beta ( 1- \varepsilon \Vert b\Vert )/(1+\varepsilon \beta )$ provided the approximation error $\Vert y-y_h\Vert _H$ is small independently of $X_h\times Y_h$ and $x_h\in S(X_h)$. A detailed investigation in Sect. 3.3 below reveals that the above strong form of a uniform approximation appears neither available in the norm of $H=H({{\,\mathrm{div}\,}},\Omega )\times L^2(\Omega )$ (cf. Remark 3.4) nor necessary for the stability under assumption (A). Recent results from a medius analysis [6, 15] and a careful shift of the discrete divergence variable successfully circumvent a uniform approximation in H.

1.5 Structure of the paper

Section 2 starts with the pre-compactness for uniform approximation and the precise assumptions on the set of admissible triangulations $\mathbb {T}$. The other two preliminary subsections concern the $L^2$ best-approximation of the fluxes and some discrete approximation result for the RT finite element family.

The stability analysis in Sect. 3 is based on the dual solution y in the conservative formulation characterised in Sect. 3.1. One contribution of y involves the PDE $\mathcal {L}_2\phi =g$ and allows for some pre-compacness and uniform approximation in Sect. 3.2. The proof of Theorem 1.1 concludes Sect. 3. A combination of the stability result (1.6) with the approximation arguments leads in Sect. 4 to Theorem 1.2, which generalises [6, 15] to non-selfadjoint indefinite second-order linear elliptic problems.

2 Preliminaries

This section introduces notations used in the paper, fixes the assumptions on the admissible triangulation $\mathbb {T}$, discusses an abstract version of compactness argument in [17], and then recalls some $L^2$ best-approximation property and concludes with an observation for the RT finite element family.

2.1 Notation

Standard notation on Lebesgue and Sobolev spaces $L^2(\Omega )$, $L^\infty (\Omega )$, $H_0^1(\Omega )$, $H^{-1}(\Omega )\equiv H_0^1(\Omega )^*$, and $H({{\,\mathrm{div}\,}},\Omega )$ apply throughout this paper. The $L^2$ scalar product $(\bullet ,\bullet )_{L^2(\Omega ) }$ induces the norm $\Vert \bullet \Vert := \Vert \bullet \Vert _{L^2(\Omega ) }$ and the orthogonality relation $\perp $.

Whereas $\Vert \bullet \Vert $ denotes the norm in $L^2(\Omega )$ with the exception of the abbreviation $\Vert b\Vert $ for the bound of the bilinear form $b(\bullet ,\bullet )$, the vector space $L^2(\Omega ;\mathbb {R}^n)$ is endowed with the weighted scalar product $(\bullet ,\bullet )_{{\mathbf {A}}^{-1}}:= ({{\mathbf {A}}^{-1}}\bullet ,\bullet )_{L^2(\Omega )}$ and induced norm $\Vert \bullet \Vert _{{\mathbf {A}}^{-1}}:= \Vert {\mathbf {A}}^{-1/2}\bullet \Vert $ and so, for any $\tau \in L^2(\Omega ;\mathbb {R}^n)$, is its distance ${{\,\mathrm{dist}\,}}(\tau ,M_h):=\min _{\tau _h\in M_h}\Vert \tau -\tau _h\Vert _{{\mathbf {A}}^{-1}}$ to any subspace $M_h$ of $L^2(\Omega ;\mathbb {R}^n)$. The norm $\Vert (\tau ,v)\Vert _H$ in the Hilbert space $H({{\,\mathrm{div}\,}},\Omega )$ is weighted with ${\mathbf {A}}^{-1}$ in $L^2(\Omega ;\mathbb {R}^n)$ for the flux variable so the Hilbert space $H\equiv H({{\,\mathrm{div}\,}},\Omega )\times L^2(\Omega )$ has the weighted scalar product ${\langle \bullet ,\bullet },{_\rangle }H$ with the induced norm $\Vert (\tau ,v)\Vert _H$,

$$\begin{aligned} \Vert (\tau ,v)\Vert _H^2:= \Vert \tau \Vert _{{\mathbf {A}}^{-1}}^2 +\Vert {{\,\mathrm{div}\,}}\tau \Vert ^2+\Vert v\Vert ^2 \quad \text {for all }(\tau ,v)\in H. \end{aligned}$$

(2.1)

Duality brackets have the dual pairing as an index as in ${\langle \bullet ,\bullet },{_\rangle }{H^{-1}(\Omega )\times H^1_0(\Omega )}$ above. To abbreviate the definition of $\inf $-$\sup $ constants throughout this paper, let $S(V):=\{ v\in V: \Vert v\Vert _V=1\}$ for any normed linear space $(V, \Vert \bullet \Vert _V)$.

All emerging generic positive constants $C_1, \dots , C_8$ in this paper exclusively depend on $\underline{\alpha }, \overline{\alpha }, \Vert {\mathbf {b}}\Vert _{L^\infty }, \Vert {\mathbf {b}}_1\Vert _{L^\infty },\Vert {\mathbf {b}}_2\Vert _{L^\infty }, $ and $\Vert \gamma \Vert _{L^\infty }$ as well as on $\alpha $ in (1.2) and $\beta $ in (1.5) and on the class of admissible triangulations $\mathbb {T}$ specified in Subsection 2.2.

2.2 Assumptions on the discretization

The finite element spaces are based on admissible triangulations, the set $\mathbb {T}$ of all of those has certainly infinite cardinality; the point is that the constants in standard interpolation error estimates become universal through uniform shape regularity.

Definition 2.1

(admissible triangulations) The set of admissible triangulations $\mathbb {T}$ is a set of shape-regular triangulations of the polyhedral bounded Lipschitz domain $\Omega \subset \mathbb {R}^n$ into simplices with uniform shape regularity and arbitrary small mesh sizes. Let $h_{\max }(\mathcal {T}):= \max h_\mathcal {T}$ for the piecewise constant mesh-size $h_\mathcal {T}$ for $\mathcal {T}\in \mathbb {T}$, defined by $h_\mathcal {T}|_T:=\mathrm{diam}(T)$ in $T\in \mathcal {T}$, and abbreviate $\mathcal {T}(\delta ):=\{\mathcal {T}\in \mathbb {T}: h_{\max }(\mathcal {T})\leqslant \delta \}$.

Given $\mathcal {T}\in \mathbb {T}$, let $P_k(T)$ denote the polynomials of total degree at most $k\in {\mathbb {N}}_0$ seen as functions on $T\in \mathcal {T}\in \mathbb {T}$ and set $P_k(\mathcal {T}):=\{ v_k\in L^\infty (\Omega ): \forall T\in \mathcal {T},\; v_k|_T\in P_k(T)\}$. Let $\Pi _k:L^2(\Omega )\rightarrow L^2(\Omega )$ be the $L^2$ projection onto $P_k(\mathcal {T})$ with respect to $\mathcal {T}\in \mathbb {T}$.

Definition 2.2

(discrete spaces) Any $\mathcal {T}\in \mathbb {T}$ is associated to the finite-dimensional subspace $V(\mathcal {T})=M_k(\mathcal {T})\times P_k(\mathcal {T})$ of $V:= L^2(\Omega ;\mathbb {R}^n)\times L^2(\Omega )$ with $M_k(\mathcal {T}):=RT_k(\mathcal {T})$ or $M_k(\mathcal {T}):=BDM_k(\mathcal {T})$ of order $k\in {\mathbb {N}}_0$ from [2].

The best-approximation error reads ${{\,\mathrm{dist}\,}}(v, V(\mathcal {T})):= \inf \{ \Vert v-v_h\Vert : v_h \in V(\mathcal {T})\}$ with the weighted $L^2$ norm, $\Vert v\Vert ^2=\Vert \tau \Vert ^2_{{\mathbf {A}}^{-1}} +\Vert w\Vert ^2$ for $v=(\tau ,w)\in V$. The density of smooth functions and standard approximation results for smooth functions proves the well-known pointwise convergence in the sense that each $v\in V$ satisfies [2]

$$\begin{aligned} \lim _{\delta \rightarrow 0^+} \sup _{\mathcal {T}\in \mathbb {T}(\delta )} {{\,\mathrm{dist}\,}}(v, V(\mathcal {T}))=0. \end{aligned}$$

(2.2)

2.3 Pre-compactness

This subsection adopts the key argument of [17].

Lemma 2.3

(uniform approximation on compact sets) Suppose that K is a non-empty pre-compact subset of $V:= L^2(\Omega ;\mathbb {R}^n)\times L^2(\Omega )$ with (2.2) for each $v\in K$. Then

$$\begin{aligned} \lim _{\delta \rightarrow 0^+} \sup _{v\in K}\sup _{\mathcal {T}\in \mathbb {T}(\delta )} {{\,\mathrm{dist}\,}}(v, V(\mathcal {T}))=0. \end{aligned}$$

(2.3)

Proof

Given any $\varepsilon >0$ and $v\in K$, let $B(v,\varepsilon /2)$ be the open ball in V with center v and radius $ \varepsilon /2$. The open cover $\{B(v,\varepsilon /2):v\in K\}$ of the compact set $\overline{K}$ contains a finite sub-cover and so there exist $k_1,\dots ,k_J\in K$ with $K\subset \bigcup _{j=1,\dots ,J} B(k_j,\varepsilon /2)$. For each $k_j\in K$, (2.2) leads to $\delta _j>0$ such that $\mathcal {T}\in \mathcal {T}(\delta _j)$ implies ${{\,\mathrm{dist}\,}}(k_j,V(\mathcal {T}))<\varepsilon /2 $. Then $\delta :=\min \{ \delta _1, \dots ,\delta _J\} $ implies $\mathbb {T}(\delta )\subset \bigcap _{j=1,\dots ,J}\mathbb {T}( \delta _j)$. Given any $\mathcal {T}\in \mathbb {T}(\delta )$ and any $v\in K\subset \bigcup _{j=1,\dots ,J} B(k_j,\varepsilon /2)$, there exists $j\in \{1,\dots ,J\}$ with $\Vert v-k_j \Vert <\varepsilon /2$. Since $\mathcal {T}\in \mathbb {T}( \delta _j)$, ${{\,\mathrm{dist}\,}}(k_j,V(\mathcal {T}))<\varepsilon /2 $. This and a triangle inequality show $ {{\,\mathrm{dist}\,}}(v,V(\mathcal {T}))\leqslant \Vert v-k_j \Vert +{{\,\mathrm{dist}\,}}(k_j,V(\mathcal {T}))<\varepsilon /2+\varepsilon /2 =\varepsilon . $ $\square $

The application of the previous lemma to the finite element approximation of the solution of the PDE reads as follows.

Lemma 2.4

(uniform approximation of solutions) For any $\varepsilon >0$ there exists some $\delta >0$ such that, given any $g\in L^2(\Omega )$ and the weak solution $\phi \in H^1_0(\Omega ) $ to $\mathcal {L}_2\phi = g $ (with ${\mathcal {L}}_2 $ from (1.1)), the vector $v:= ({\mathbf {A}} \nabla \phi , {\mathbf {b}}\cdot \nabla \phi +\gamma \phi )\in V$ satisfies $ \sup _{\mathcal {T}\in \mathbb {T}(\delta )} {{\,\mathrm{dist}\,}}( v , V(\mathcal {T})) \leqslant \epsilon \,\Vert g\Vert . $

Proof

The linear and bounded bijective differential operator $\mathcal {L}_2:H^1_0(\Omega )\rightarrow H^{-1}(\Omega )$ has a bounded inverse. The embedding $\iota : L^2(\Omega )\hookrightarrow H^{-1}(\Omega )$ is compact and so is the composition ${\mathcal {L}}_2^{-1} \circ \iota :L^2(\Omega )\rightarrow H^1_0(\Omega )$. Define the operator $T: L^2(\Omega )\rightarrow L^2(\Omega ;\mathbb {R}^n)\times L^2(\Omega )$ for any $g\in L^2(\Omega )$ by

$$\begin{aligned} T(g):=( {\mathbf {A}} \nabla \phi , {\mathbf {b}}\cdot \nabla \phi +\gamma \phi ) \quad \text {with}\quad \phi :={\mathcal {L}}_2^{-1} g. \end{aligned}$$

Since ${\mathcal {L}}_2^{-1}\circ \iota $ is compact, $K:= T(S(L^2(\Omega ))$ is pre-compact in $ V= L^2(\Omega ;\mathbb {R}^n)\times L^2(\Omega )$. Given any $\varepsilon >0$ the approximation result (2.2) and Lemma 2.3 lead to a positive $\delta $ with (2.3). Consequently, the assertion $\sup _{\mathcal {T}\in \mathbb {T}(\delta )} {{\,\mathrm{dist}\,}}( v , V(\mathcal {T})) \leqslant \epsilon \,\Vert g\Vert $ holds for all $g\in S(L^2(\Omega ))$ and corresponding $v:= ({\mathbf {A}} \nabla \phi , {\mathbf {b}}\cdot \nabla \phi +\gamma \phi )\in V$. A rescaling proves the result for all $g\in L^2(\Omega )$.

2.4 $L^2$ best-approximation of the fluxes

The medius analysis of mixed finite element methods employs arguments from a priori and a posteriori error analysis [6, 15] to prove new $L^2$ best-approximation results. Recall that $\Pi _k$ is the $L^2$ projection onto $P_k(\mathcal {T})$ and $h_\mathcal {T}$ is the mesh-size associated to $\mathcal {T}$.

Lemma 2.5

(flux $L^2$ best-approximation) There exists a constant $C_3$, which depends on the shape-regularity in $\mathcal {T}$, on $\Omega $ and on $\underline{\alpha }, \overline{\alpha }$, such for any ${\mathbf {p}}\in H({{\,\mathrm{div}\,}},\Omega )$ and any $\mathcal {T}\in \mathbb {T}$, there exists ${\mathbf {p}}_h\in M_k(\mathcal {T})$ such that ${{\,\mathrm{div}\,}}{\mathbf {p}}_h =\Pi _k {{\,\mathrm{div}\,}}{\mathbf {p}}$ and

$$\begin{aligned} C_3^{-1} \Vert {\mathbf {p}}-{\mathbf {p}}_h\Vert _{{\mathbf {A}}^{-1}} \leqslant {{\,\mathrm{dist}\,}}( {\mathbf {p}},M_k(\mathcal {T}))+ \Vert h_\mathcal {T}(1-\Pi _k){{\,\mathrm{div}\,}}{\mathbf {p}}\Vert . \end{aligned}$$

This is the $L^2$ best-approximation result from [15, Lemma 5.1] for mixed finite element approximations for the unit matrix ${\mathbf {A}}$. Although with a different focus, the paper [6] introduces a general framework with a mesh-dependent norm $\Vert \bullet \Vert _h$ in $P_k(\mathcal {T})$; while [11, Eq (3.6)] presents a localized refinement of this lemma.

Proof

Given ${\mathbf {p}}\in H({{\,\mathrm{div}\,}},\Omega )$, the right-hand sides $F(w):=(w,{{\,\mathrm{div}\,}}{\mathbf {p}})_{L^2(\Omega ) }$ and $G({\mathbf {q}}):= ({\mathbf {p}},{\mathbf {q}}) $ lead in the elliptic mixed formulation (for the Laplacian)

$$\begin{aligned} (\sigma , {\mathbf {q}}) - (u, {{\,\mathrm{div}\,}}{\mathbf {q}})_{L^2(\Omega ) } + (w ,{{\,\mathrm{div}\,}}{\mathbf {p}})_{L^2(\Omega ) } = G({\mathbf {q}})+F(w) \quad \text {for all } ({\mathbf {q}},w)\in H \end{aligned}$$

to the unique solution $(\sigma ,u)\equiv ({\mathbf {p}},0)\in H$. Its straight-forward mixed finite element discretisation substitutes H by $V_h:=M_k(\mathcal {T})\times P_k(\mathcal {T})$ and leads to a unique discrete solution $({\mathbf {p}}_h,v_h)\in V_h$ with ${{\,\mathrm{div}\,}}{\mathbf {p}}_h=\Pi _k {{\,\mathrm{div}\,}}{\mathbf {p}}$. This and [6, Thm 2.2] lead to the asserted best-approximation result (in terms of (non-weighted) $L^2$ norms)

$$\begin{aligned} C_4^{-1} \Vert {\mathbf {p}}-{\mathbf {p}}_h\Vert \leqslant \inf _{{ \mathbf {q}}_h\in M_k(\mathcal {T})} \Vert {\mathbf {p}}-{ \mathbf {q}}_h\Vert + \Vert h_\mathcal {T}(1-\Pi _k){{\,\mathrm{div}\,}}{\mathbf {p}}\Vert . \end{aligned}$$

The constant $C_4$ from [6, 15] does not depend on the coefficients $\mathbf{A},$ b, $\gamma $ but depends on the shape-regularity in $\mathcal {T}$ and on $\Omega $. The equivalence of norms concludes the proof and leads to the asserted constant $C_3$, which depends on $C_4$ and $\underline{\alpha }, \overline{\alpha }$. $\square $

2.5 A discrete approximation result for Raviart-Thomas functions

In any space-dimension n and degree k, the RT functions satisfy a rather particular approximation estimate with the componentwise $L^2$ projection $\Pi _k$ onto $P_k(\mathcal {T})$.

Lemma 2.6

Any $\tau _{RT}\in RT_k(\mathcal {T})$ satisfies $ \Vert \tau _{RT}- \Pi _k\tau _{RT}\Vert \leqslant \frac{n}{(n+1)(n+k)} \Vert h_\mathcal {T}{{\,\mathrm{div}\,}}\tau _{RT}\Vert . $

The proof will be postponed to the appendix because of its focus on the RT finite element shape functions. The statement of the above lemma fails for the BDM finite element family.

3 Stability analysis

This section deals with approximation of fluxes and stability result. The design of a test function in the proof of a discrete $\inf $-$\sup $ condition is based on the characterisation and approximation of a dual solution.

3.1 Dual solution and conservative formulation

The inner structure of the dual solution y exploits the elliptic PDE and generates some compactness argument in the subsequent subsection. Recall that the operator $\mathcal {L}_2 :H^1_0(\Omega )\rightarrow H^{-1}(\Omega )$ from (1.1) is bijective.

Theorem 3.1

(dual solution in conservative formulation) Suppose ${\mathbf {b}}_1:= {\mathbf {A}}^{-1} {\mathbf {b}}$ and ${\mathbf {b}}_2\equiv 0$ a.e. in $\Omega $ in (1.4). Then $x=(\sigma ,u)\in H$ and $y=(\zeta ,z)\in H$ satisfy ${\langle x},{\bullet \rangle }_{H}=b(\bullet , y)$ in H if and only if

$$\begin{aligned} \zeta = \sigma -{\mathbf {A}} \nabla \phi \quad \text {and}\quad z= {{\,\mathrm{div}\,}}\sigma -\phi \quad \text { a.e. in }\Omega \end{aligned}$$

for the weak solution $\phi \in H^1_0(\Omega )$ to $ {\mathcal {L}}_2\phi =g:= {\mathbf {b}} \cdot {\mathbf {A}}^{-1} \sigma +(\gamma -1)\,{{\,\mathrm{div}\,}}\sigma -u\in L^2(\Omega )$.

The function $\phi $ originates from a known $L^2$ orthogonal decomposition

$$\begin{aligned} L^2(\Omega ;\mathbb {R}^n)= \nabla H^1_0(\Omega ) \oplus H(\mathrm{div}{=}0,\Omega ) \end{aligned}$$

(3.1)

with $ H(\mathrm{div}{=}0,\Omega ):=\{ \tau \in H({{\,\mathrm{div}\,}},\Omega ):\, {{\,\mathrm{div}\,}}\tau =0\text { a.e. in }\Omega \} $. The decomposition (3.1) is also useful in the proof of equivalence of the displacement formulation with the differential operators in (1.1) to the mixed formulations with (1.3).

Proof of Theorem 3.1

For the general version of the bilinear form $b(\bullet ,\bullet )$, the equation ${\langle x},{\bullet \rangle }_H=b(\bullet , y)$ is equivalent to $u = {\mathbf { b}}_1\cdot \zeta +\gamma \,z-{{\,\mathrm{div}\,}}\zeta $ a.e. in $\Omega $ and

$$\begin{aligned} (\tau ,{\mathbf {A}}^{-1} (\zeta -\sigma ) -z {\mathbf {b}}_2 )_{L^2(\Omega ) } + (z-{{\,\mathrm{div}\,}}\sigma ,{{\,\mathrm{div}\,}}\tau )_{L^2(\Omega ) }=0 \quad \text {for all } \tau \in H({{\,\mathrm{div}\,}},\Omega ). \end{aligned}$$

(3.2)

The test with $\tau \in H(\mathrm{div}{=}0,\Omega )$ proves that ${\mathbf {A}}^{-1} (\sigma -\zeta ) + z {\mathbf {b}}_2\perp H(\mathrm{div}{=}0,\Omega )$ and so (3.1) leads to $\phi \in H^1_0(\Omega )$ with

$$\begin{aligned} {\mathbf {A}} \nabla \phi = \sigma -\zeta + z {\mathbf {A}} {\mathbf {b}}_2 \quad \text { a.e. in }\Omega . \end{aligned}$$

This identity allows the substitution of ${\mathbf {A}}^{-1} (\zeta -\sigma ) -z {\mathbf {b}}_2$ in the above formula (3.2) with general $\tau \in H({{\,\mathrm{div}\,}},\Omega )$. Then, an integration by parts shows the resulting identity $(\phi +z-{{\,\mathrm{div}\,}}\sigma ,{{\,\mathrm{div}\,}}\tau )=0$. The surjectivity of ${{\,\mathrm{div}\,}}:H({{\,\mathrm{div}\,}},\Omega )\rightarrow L^2(\Omega )$ proves

$$\begin{aligned} {{\,\mathrm{div}\,}}\sigma =\phi +z \quad \text { a.e. in }\Omega . \end{aligned}$$

The combination of the three preceding identities leads to the PDE

$$\begin{aligned} -{{\,\mathrm{div}\,}}({\mathbf {A}} \nabla \phi ) + {\mathbf {b}} _1\cdot {\mathbf {A}} \nabla \phi + \gamma \phi= & {} -{{\,\mathrm{div}\,}}( z {\mathbf {A}} {\mathbf {b}}_2)+ ({\mathbf { b}}_1\cdot {\mathbf {A}} {\mathbf {b}}_2)\, z\\&+ {\mathbf {b}}_1\cdot \sigma +(\gamma -1)\, {{\,\mathrm{div}\,}}\sigma -u \end{aligned}$$

in the sense of distributions. Since ${\mathbf {b}}_2=0$, the right-hand side g belongs to $L^2$. This proves one direction of the assertion; the direct proof of the converse is omitted. $\square $

Remark 3.2

(no divergence formulation) The proof shows the extra term $-{{\,\mathrm{div}\,}}( z {\mathbf {A}} {\mathbf {b}}_2)\in H^{-1}(\Omega )$ in case (1.4) is considered for non-zero ${\mathbf {b}}_2\in L^\infty (\Omega ;\mathbb {R}^n)$. This term does not belong to $L^2(\Omega )$ under Assumption (A) and is, therefore, excluded.

3.2 Approximation of the fluxes

The subsequent lemma describes the uniform approximation of the flux variable by a combination of the compactness argument and the $L^2$ best-approximation of Subsections 2.3 and 2.4.

Lemma 3.3

(flux approximation) Given any $\varepsilon >0$, there exists $\delta >0$ such that the following holds for all $\mathcal {T}\in \mathbb {T}(\delta )$ and $g\in L^2(\Omega )$. There exists some ${\mathbf {p}}_h\in M_k(\mathcal {T})$ that approximates ${\mathbf {p}} := {\mathbf {A}}\nabla \phi \in H({{\,\mathrm{div}\,}},\Omega )$ for the weak solution $\phi \in H^1_0(\Omega )$ to ${\mathcal {L}}_2\phi = g$ with

$$\begin{aligned} {{\,\mathrm{div}\,}}{\mathbf {p}}_h=\Pi _k{{\,\mathrm{div}\,}}{\mathbf {p}} \quad \text {and} \quad \Vert {\mathbf {p}}-{\mathbf {p}}_h\Vert _{{\mathbf {A}}^{-1}}\leqslant \epsilon \Vert g\Vert . \end{aligned}$$

Proof

Given any $\varepsilon >0$ and the constant $C_3$ from Lemma 2.5, Lemma 2.4 leads to a positive $\delta \leqslant \min \{1, 2^{-1} \epsilon / C_3\}$ with

$$\begin{aligned} \sup _{\mathcal {T}\in \mathbb {T}(\delta )} {{\,\mathrm{dist}\,}}\left( ({\mathbf {A}} \nabla \phi , {\mathbf {b}}\cdot \nabla \phi +\gamma \phi ), V(\mathcal {T}) \right) \leqslant 2^{-3/2}\epsilon / C_3\,\Vert g\Vert \end{aligned}$$

(the distance is with respect to the weighted norm $\Vert \bullet \Vert _{{\mathbf {A}}^{-1}}$ in $L^2(\Omega ;\mathbb {R}^n)$ and $\Vert \bullet \Vert $ in $L^2(\Omega )$). Lemma 2.5 applies to ${\mathbf {p}}:={\mathbf {A}}\nabla \phi $ with ${{\,\mathrm{div}\,}}{\mathbf {p}}= {\mathbf {b}}\cdot \nabla \phi +\gamma \phi -g\in L^2(\Omega )$ and, for any $\mathcal {T}\in \mathbb {T}(\delta )$, leads to some approximation ${\mathbf {p}}_h\in M_k(\mathcal {T})$ with ${{\,\mathrm{div}\,}}{\mathbf {p}}_h =\Pi _k {{\,\mathrm{div}\,}}{\mathbf {p}}$ and

$$\begin{aligned} C_3^{-1} \Vert {\mathbf {p}}-{\mathbf {p}}_h\Vert _{{\mathbf {A}}^{-1}}&\leqslant {{\,\mathrm{dist}\,}}( {\mathbf {p}},M_k(\mathcal {T}))+ \delta \Vert (1-\Pi _k)( {\mathbf {b}}\cdot \nabla \phi +\gamma \phi -g ) \Vert \\&\leqslant {{\,\mathrm{dist}\,}}( {\mathbf {p}},M_k(\mathcal {T}))+ {{\,\mathrm{dist}\,}}( {\mathbf {b}}\cdot \nabla \phi +\gamma \phi , P_k(\mathcal {T})) + \delta \, \Vert g\Vert \\&\leqslant 2^{1/2} {{\,\mathrm{dist}\,}}(( {\mathbf {p}},{\mathbf {b}}\cdot \nabla \phi +\gamma \phi ) , V(\mathcal {T})) + \delta \, \Vert g\Vert \leqslant C_3^{-1} \epsilon \, \Vert g\Vert . \end{aligned}$$

This concludes the proof. $\square $

Remark 3.4

(no uniform approximation in $H({{\,\mathrm{div}\,}})$) Lemma 3.3 does not state a uniform approximation estimate for the divergence and, in fact, an estimate of the form $ \Vert {{\,\mathrm{div}\,}}({\mathbf {p}}-{\mathbf {p}}_h)\Vert \leqslant \epsilon \Vert g\Vert $ cannot hold in general. To see this, adopt the notation of the proof of Lemma 3.3 and a reverse triangle inequality for

$$\begin{aligned} \Vert g-\Pi _kg \Vert - \Vert {{\,\mathrm{div}\,}}({\mathbf {p}}- {\mathbf {p}}_h)\Vert \leqslant \Vert (1-\Pi _k)( {\mathbf {b}}\cdot \nabla \phi +\gamma \phi )\Vert \leqslant \epsilon /(2C_3)\, \Vert g\Vert . \end{aligned}$$

The flux ${\mathbf {p}}= {\mathbf {A}}\nabla \mathcal {L}_2^{-1}g$ depends on $g\in S(L^2(\Omega ))$ and so does the crucial term $ \Vert {{\,\mathrm{div}\,}}({\mathbf {p}}- {\mathbf {p}}_h)\Vert = \Vert (1-\Pi _k){{\,\mathrm{div}\,}}{\mathbf {p}}\Vert $. Hence, $\sup \{ \Vert g-\Pi _kg \Vert : g\in S(L^2(\Omega ))\}=1$ implies

$$\begin{aligned} 1- \epsilon /(2C_3) \leqslant \sup \{ \Vert (1-\Pi _k) {{\,\mathrm{div}\,}}({\mathbf {A}}\nabla {\mathcal {L}}_2^{-1}g)\Vert : g\in S(L^2(\Omega )) \}. \end{aligned}$$

Therefore, the approximation error $ \Vert {{\,\mathrm{div}\,}}({\mathbf {p}}- {\mathbf {p}}_h)\Vert $ will not tend to zero uniformly for all $g\in S(L^2(\Omega ))$ as $\epsilon $ and $\delta $ tend to zero. $\square $

Example 3.5

(RT approximation in $H({{\,\mathrm{div}\,}})$ for particular g) The stability analysis in Subsection 1.4 concerns a discrete $x_h:=(\sigma _h,u_h)$ with norm $\Vert (\sigma _h,u_h)\Vert _H=1$ and leads in Theorem 3.1 to the particular right-hand side $g:= {\mathbf {b}} \cdot {\mathbf {A}}^{-1} \sigma _h +(\gamma -1)\,{{\,\mathrm{div}\,}}\sigma _h-u_h$ with $\Vert g\Vert \leqslant C_5<\infty $ for the essential supremum $C_5^2$ of $|{\mathbf {A}}^{-1/2}{\mathbf {b}}|^2 + |\gamma -1|^2+1$ in $\Omega $. This g allows for a uniform approximation of ${\mathbf {p}}$ by ${\mathbf {p}}_h$ in $H({{\,\mathrm{div}\,}},\Omega )$ for the RT finite element family.

For instance, in the extreme case of piecewise constant coefficients, $g-\Pi _kg= {\mathbf {b}} \cdot {\mathbf {A}}^{-1}(1-\Pi _k)\sigma _h$. With $C_6:=\Vert {\mathbf {A}}^{-1}{\mathbf {b}} \Vert _{L^\infty (\Omega )} $, Lemma 2.6 shows $ \Vert g-\Pi _kg\Vert \leqslant \delta C_6$. The combination with Lemma 3.3 lead to ${\mathbf {p}}_h$ with $ \Vert {\mathbf {p}}-{\mathbf {p}}_h\Vert _{H({{\,\mathrm{div}\,}},\Omega )}\leqslant \epsilon \,C_5+ \delta \,C_6. $ This and the arguments of Subsection 1.4 lead to the discrete stability (1.6).

3.3 Proof of theorem 1.1

Given any $0< \varepsilon < \beta /\Vert b\Vert $, choose $\delta >0$ as in Lemma 3.3. Suppose $\mathcal {T}\in \mathbb {T}(\delta )$ and let $x_h=(\sigma _h,u_h)\in V_h:=M_k(\mathcal {T})\times P_k(\mathcal {T})$ have norm $\Vert x_h\Vert _H=1$ and define $g:= {\mathbf {b}} \cdot {\mathbf {A}}^{-1} \sigma _h +(\gamma -1)\,{{\,\mathrm{div}\,}}\sigma _h-u_h$. Replace x by $x_h$ in Theorem 3.1 and let $\phi \in H^1_0(\Omega )$ solve $ {\mathcal {L}}_2\phi =g$ to define $\zeta = \sigma _h-{\mathbf {A}} \nabla \phi $ and $z= {{\,\mathrm{div}\,}}\sigma _h-\phi $. Then $y=(\zeta ,z)\in H$ is the dual solution and solves ${\langle x_h},{\bullet \rangle }_{H}=b(\bullet , y)$ in H for the bilinear form (1.4) (with ${\mathbf {b}}_1:= {\mathbf {A}}^{-1} {\mathbf {b}}$ and ${\mathbf {b}}_2\equiv 0$ a.e. in $\Omega $). Lemma 3.3 applies to ${\mathbf {p}}:= {\mathbf {A}}\nabla \phi $ and leads to ${\mathbf {p}}_h$ with ${{\,\mathrm{div}\,}}{\mathbf {p}}_h=\Pi _k{{\,\mathrm{div}\,}}{\mathbf {p}}$ and $\Vert {\mathbf {p}}-{\mathbf {p}}_h\Vert _{{\mathbf {A}}^{-1}}\leqslant \varepsilon \, C_5$. Hence, $\zeta _h:=\sigma _h- {\mathbf {p}}_h $ and $z_h:=\Pi _kz$ define $y_h:=(\zeta _h,z_h)\in V_h$ with

$$\begin{aligned} \Vert y_h\Vert _H \leqslant \Vert y\Vert _H + \Vert \zeta -\zeta _h\Vert _{ {\mathbf {A}}^{-1}}\leqslant \Vert y\Vert _H+ \varepsilon \, C_5\end{aligned}$$

(3.3)

as $\Vert {{\,\mathrm{div}\,}}\zeta _h\Vert =\Vert \Pi _k{{\,\mathrm{div}\,}}\zeta \Vert \leqslant \Vert {{\,\mathrm{div}\,}}\zeta \Vert $, $\Vert z_h\Vert =\Vert \Pi _kz\Vert \leqslant \Vert z\Vert $ and ${{\,\mathrm{div}\,}}\sigma _h\in P_k(\mathcal {T})$. Moreover,

$$\begin{aligned} \Vert \zeta -\zeta _h\Vert _{{\mathbf {A}}^{-1}}^2+ \Vert z-z_h\Vert ^2\leqslant \varepsilon ^2 \, C_5^2+ \Vert \phi -\Pi _k\phi \Vert ^2. \end{aligned}$$

Piecewise Poincaré inequalities (with the Payne-Weinberger constant $1/\pi $ for convex domains [16]) show $\Vert \phi -\Pi _k\phi \Vert \leqslant \delta /\pi \Vert \nabla \phi \Vert $. Recall that $H^1_0(\Omega )$ is endowed with the seminorm $\Vert \nabla \bullet \Vert $ and let $C_F$ denote the constant in the Friedrichs inequality $\Vert \bullet \Vert \leqslant C_F\Vert \nabla \bullet \Vert $ in $H^1_0(\Omega )$. Note that (1.2) leads to $\alpha \, \Vert \nabla \phi \Vert \leqslant \sup \{ a(\psi ,\phi ) : \psi \in H^1_0(\Omega ) , \; \Vert \nabla \psi \Vert =1\}$. Hence $ a(\psi ,\phi )={\langle \mathcal {L}_2 \phi },{\psi \rangle }_{H^{-1}(\Omega )\times H^1_0(\Omega )} = {\langle g},{\psi \rangle }_{H^{-1}(\Omega )\times H^1_0(\Omega )} \leqslant C_F\Vert g\Vert $ implies $\alpha \, \Vert \nabla \phi \Vert \leqslant C_F\,C_5$. The combination with Poincaré inequalities shows $\Vert \phi -\Pi _k\phi \Vert \leqslant \delta C_F\,C_5/(\alpha \,\pi )$ and so

$$\begin{aligned} \Vert \zeta -\zeta _h\Vert _{{\mathbf {A}}^{-1}}^2+ \Vert z-z_h\Vert ^2\leqslant \varepsilon ^2 \, C_5^2+ \delta ^2 C_F^2\,C_5^2/(\alpha ^2\,\pi ^2)=:(\varepsilon ')^2. \end{aligned}$$

Since $u_h\perp {{\,\mathrm{div}\,}}(\zeta -\zeta _h)$ and $z-z_h\perp {{\,\mathrm{div}\,}}\sigma _h$ ($\perp $ denotes orthogonality in $L^2(\Omega )$),

$$\begin{aligned} b(x_h,y-y_h)&=(\sigma _h+ {\mathbf {b}} u_h,\zeta -\zeta _h)_{{\mathbf {A}}^{-1}} +(\gamma \, u_h,z-z_h)\\&\leqslant \varepsilon '\left( \Vert \sigma _h +{\mathbf {b}} u_h\Vert ^2_{{\mathbf {A}}^{-1}} + \Vert (1-\Pi _k)( \gamma \, u_h)\Vert ^2\right) ^{1/2} \\&\leqslant \varepsilon '\left( 2 \Vert \sigma _h \Vert ^2_{{\mathbf {A}}^{-1}} +2 \Vert {\mathbf {A}}^{-1/2}{\mathbf {b}} \Vert _{L^\infty (\Omega ) }^2 \Vert u_h\Vert ^2 + \Vert \gamma \Vert _{L^\infty (\Omega ) }^2 \Vert u_h\Vert ^2\right) ^{1/2}\\&\leqslant \varepsilon ' \, C_7\end{aligned}$$

with the constant $ C_7^2:= \max \{ 2 , 2 \Vert {\mathbf {A}}^{-1/2}{\mathbf {b}} \Vert _{L^\infty (\Omega ) }^2 + \Vert \gamma \Vert _{L^\infty (\Omega ) }^2\}$. The arguments of Sect. 1.4 lead to

$$\begin{aligned} 1=\Vert x_h\Vert ^2_H=b(x_h,y)=b(x_h,y_h)+b(x_h,y-y_h) \leqslant \Vert b(x_h,\bullet )\Vert _{V_h^*}\Vert y_h\Vert _H + \varepsilon ' \, C_7.\nonumber \\ \end{aligned}$$

(3.4)

Since $\beta \Vert y\Vert _H \leqslant \Vert b(\bullet , y)\Vert _{H^*}=\Vert x_h\Vert _H=1$ implies $\Vert y\Vert _H \leqslant 1/\beta $, (3.3) reads $\Vert y_h\Vert _H \leqslant 1/\beta + \varepsilon \, C_5$. This and (3.4) verify

$$\begin{aligned} 1- \varepsilon '\, C_7\leqslant \Vert b(x_h,\bullet )\Vert _{V_h^*}\Vert y_h\Vert _H\leqslant \Vert b(x_h,\bullet )\Vert _{V_h^*} (1/\beta + \varepsilon \, C_5). \end{aligned}$$

Since $x_h$ was arbitrary in $S(V_h)$ (with $V_h$ endowed with the norm in H),

$$\begin{aligned} \beta \frac{1- \varepsilon '\, C_7}{1+ \varepsilon \,\beta \, C_5} \leqslant \beta _h:= \inf _{x_h \in S(V_h)} \sup _ {y_h \in S(V_h)} b(x_h,y_h ). \end{aligned}$$

Relabelling $\epsilon $ and $\delta $ proves the assertion: For any $0< \beta _0<\beta $, there exists $\delta >0$ with (1.6). This establishes the theorem for the conservative version of the mixed finite element discretisation with ${\mathbf {b}}_1:= {\mathbf {A}}^{-1} {\mathbf {b}}$ and ${\mathbf {b}}_2\equiv 0$ a.e. in $\Omega $.

To deduce the same $\inf $-$\sup $ constant for the other variant, define $b_1(\bullet ,\bullet ) $ (resp. $b_2(\bullet ,\bullet ) $) by (1.6) for ${\mathbf {b}}_1:= {\mathbf {A}}^{-1} {\mathbf {b}}$ and ${\mathbf {b}}_2\equiv 0$ (resp. ${\mathbf {b}}_1:= 0$ and ${\mathbf {b}}_2:= {\mathbf {b}} \cdot {\mathbf {A}}^{-1} $) a.e. in $\Omega $. The above proof shows $0<\beta _0\leqslant \beta _h$ and it is elementary to see that

$$\begin{aligned} \beta _h&=\inf _{ (\tau _h,v_h) \in S(V_h)} \sup _ {(\sigma _h, u_h) \in S(V_h)} b_1((\tau _h, -v_h),(\sigma _h,-u_h)). \end{aligned}$$

A direct calculation shows $b_1((\tau , -v),(\sigma ,-u))=b_2((\sigma ,u), (\tau ,v))$ for all $(\sigma ,u), (\tau ,v)\in H$. This and a duality argument (singular values of a square matrix coincide with those of its transposed) in the last equality show

$$\begin{aligned} \beta _h=\inf _{ (\tau _h,v_h) \in S(V_h)} \sup _ {(\sigma _h, u_h) \in S(V_h)} b_2((\sigma _h,u_h),(\tau _h,v_h))= \inf _{x_h \in S(V_h)} \sup _ {y_h \in S(V_h)} b_2(x_h,y_h ). \end{aligned}$$

Hence, the divergence formulation has the same discrete $\inf $-$\sup $ constant $\beta _h$. $\square $

Remark 3.6

($\delta $ dependence) The size of $\delta $ in (1.6) is hidden behind a compactness argument of Lemma 3.3. Besides the norms and parameters mentioned in Assumption (A), the mapping properties of ${\mathcal {L}}_2^{-1}$ are of relevance as well. A review of the proofs of this paper shows that there is a finite sub-cover of $S(L^2(\Omega ))$ with small balls in $H^{-1}(\Omega )$ that leads to a finite number of (without loss of generality) smooth functions $k_1,\dots , k_J$ as in the proof of Lemma 2.3. The size of $\delta $ is related to the approximation properties of the weak solutions $\Phi _j$ to ${\mathcal {L}}_2\Phi _j=k_j$ a.e. The regularity properties of $\Phi _j\in H^1_0(\Omega )$ are not characterised for Assumption (A): In fact, it is unknown whether $\Phi _j$ belongs to any $H^{1+s}(\Omega )$ for any $s>0$. Under Assumption (B) and reduced elliptic regularity, however, the afore mentioned approximation properties could be quantified more and reveal further information on $\delta $.

4 $L^2$ best-approximation

The notation of Theorem 1.2 applies throughout this section with continuous and discrete solutions $x=(\sigma ,u)$ and $x_h=(\sigma _h,u_h)$.

4.1 Proof of theorem 1.2.a

Given ${\mathbf {p}}:=\sigma \in H({{\,\mathrm{div}\,}},\Omega )$ and $\mathcal {T}\in \mathbb {T}(\delta )$, choose $\sigma _h^*:={\mathbf {p}}_h\in M_k(\mathcal {T})$ as in Lemma 2.5, and define $e_h:=(\sigma _h- \sigma _h^*, u_h-\Pi _ku)\in V_h$. Given $\beta _0>0$ in (1.6) there exists some $y_h=(\tau _h,v_h)\in V_h$ with $\Vert y_h\Vert _H=1$ and

$$\begin{aligned} \beta _0\, \Vert e_h\Vert _H\leqslant b(e_h,y_h)=b(( \sigma _h-\sigma _h^*,u_h-\Pi _ku) , y_h)= b(( \sigma -\sigma _h^*,u-\Pi _ku) , y_h). \end{aligned}$$

Since $u-\Pi _ku\perp {{\,\mathrm{div}\,}}\tau _h$ and $v_h\perp {{\,\mathrm{div}\,}}(\sigma -\sigma ^*_h)$, the last term is equal to

$$\begin{aligned}&(\sigma -\sigma ^*_h ,\tau _h - v_h \,{\mathbf {A}} {\mathbf {b}}_2)_{{\mathbf {A}}^{-1}} +(u-\Pi _ku, {\mathbf {b}}_1\cdot \tau _h+\gamma \, v_h)_{L^2(\Omega ) }\\&\quad \leqslant C_8\, \Vert (\sigma -\sigma ^*_h, u-\Pi _ku) \Vert _L \end{aligned}$$

in terms of the weighted $L^2$ norm $\Vert \bullet \Vert _L\leqslant \Vert \bullet \Vert _H$ in H with $\Vert (\tau _h,v_h )\Vert _L^2:=\Vert \tau _h\Vert _{{\mathbf {A}}^{-1}}^2+ \Vert v_h\Vert ^2 \leqslant 1$ and with $C_8^2= 1+\Vert {\mathbf {A}}^{1/2}{\mathbf {b}}_1\Vert _{L^\infty (\Omega )}^2 +\Vert {\mathbf {A}}^{1/2}{\mathbf {b}}_2\Vert _{L^\infty (\Omega )}^2 + \Vert \gamma \Vert _{L^\infty (\Omega )}^2 $. Consequently, $\Vert e_h\Vert _H\leqslant \Big (C_8/\beta _0\Big )\, \Vert (\sigma -\sigma ^*_h, u-\Pi _ku) \Vert _L $. Lemma 2.5 shows

$$\begin{aligned} C_3^{-1}\Vert \sigma -\sigma ^*_h\Vert _{{\mathbf {A}}^{-1}} \leqslant {{\,\mathrm{dist}\,}}(\sigma ,M_k(\mathcal {T})) +\Vert h_\mathcal {T}(1-\Pi _k){{\,\mathrm{div}\,}}\sigma \Vert . \end{aligned}$$

(4.1)

This and the distance ${{\,\mathrm{dist}\,}}_L$ (measured in the norm $\Vert \bullet \Vert _L$) lead to

$$\begin{aligned} \Vert e_h\Vert _H \leqslant C_8\max \{1,C_3\} /\beta _0\, \left( {{\,\mathrm{dist}\,}}_L((\sigma ,u),V_h) +\Vert h_\mathcal {T}(1-\Pi _k){{\,\mathrm{div}\,}}\sigma \Vert \right) . \end{aligned}$$

This, (4.1), and a triangle inequality conclude the proof. $\square $

4.2 Proof of theorem 1.2.b

Throughout this subsection, let ${\mathbf {b}}_1\equiv 0$ and ${\mathbf {b}}_2:= {\mathbf {A}}^{-1} {\mathbf {b}}$ a.e. in $\Omega $ in (1.4) and let $f\in L^2(\Omega )$ be a fixed right-hand side for the continuous and discrete problem ${\mathcal {L}}_2u=f$.

Return to the proof of the previous subsection with $e_h$ and follow the first lines until

$$\begin{aligned} \beta _0\, \Vert e_h\Vert _H\leqslant b(e_h,y_h)= (\sigma -\sigma ^*_h ,\tau _h - v_h \,{\mathbf {A}} {\mathbf {b}}_2)_{{\mathbf {A}}^{-1}} +(u-\Pi _ku, \gamma \, v_h)_{L^2(\Omega ) }. \end{aligned}$$

Abbreviate $\overline{\gamma }:=\Pi _0\gamma $ and utilize the Lipschitz continuity of the coefficients $\gamma $ on each simplex

$$\begin{aligned} \Vert \Pi _k(\gamma (u-\Pi _ku)) \Vert = \Vert \Pi _k((\gamma -\overline{\gamma })(u-\Pi _ku)) \Vert \leqslant {{\,\mathrm{\mathcal {L}ip}\,}}(\gamma ) \Vert h_\mathcal {T}(u-\Pi _ku)\Vert . \end{aligned}$$

This controls the above term $(u-\Pi _ku, \gamma \, v_h)_{L^2(\Omega ) }\leqslant \Vert \Pi _k(\gamma (u-\Pi _ku)) \Vert \,\Vert v_h\Vert $ and leads to

$$\begin{aligned} \Vert \sigma _h^*-\sigma _h\Vert _{{\mathbf {A}}^{-1/2}}\leqslant & {} \Vert e_h\Vert _L\leqslant \Vert e_h\Vert _H \leqslant C_8/\beta _0\, \Vert \sigma -\sigma ^*_h\Vert _{{\mathbf {A}}^{-1}}\\&+ {{\,\mathrm{\mathcal {L}ip}\,}}(\gamma )/\beta _0\, \Vert h_\mathcal {T}(u-\Pi _ku)\Vert . \end{aligned}$$

A triangle inequality in $L^2(\Omega ;\mathbb {R}^n)$ is followed by (4.1) in the proof of

$$\begin{aligned} \Vert \sigma -\sigma _h\Vert _{{\mathbf {A}}^{-1/2}}&\leqslant C_3(1+C_8/\beta _0) \Bigl ( {{\,\mathrm{dist}\,}}(\sigma ,M_h)+\Vert h_\mathcal {T}(1-\Pi _k){{\,\mathrm{div}\,}}\sigma \Vert \Bigr ) \\&\qquad + {{\,\mathrm{\mathcal {L}ip}\,}}(\gamma )/\beta _0\, \Vert h_\mathcal {T}(u-\Pi _ku)\Vert . \end{aligned}$$

This completes the rest of the proof. $\square $

4.3 Conservative formulation

Theorem 2.a includes an error estimate for the conservative formulation with ${\mathbf {b}}_1:= {\mathbf {A}}^{-1} {\mathbf {b}}$ and ${\mathbf {b}}_2:=0$ in (1.4) and $\sigma = -{\mathbf {A}}\nabla u- u\,{\mathbf {b}}$ with ${{\,\mathrm{div}\,}}\sigma =f-\gamma u$. The refined analog of Theorem 2.b is not expected because of an extra term exemplified in the extreme case of piecewise constant coefficients ${\mathbf {b}}_1$ and $\gamma $. The arguments of Subsection 4.1 lead to

$$\begin{aligned} \beta _0\, \Vert e_h\Vert _H\leqslant & {} b(( \sigma -\sigma _h^*,u-\Pi _ku) , y_h)\\= & {} (\sigma -\sigma ^*_h ,\tau _h )_{{\mathbf {A}}^{-1}} +((u-\Pi _ku) {\mathbf {b}}_1, \tau _h -\Pi _k\tau _h)_{L^2(\Omega ) }. \end{aligned}$$

The last term is not of higher order for the BDM finite element family as pointed out in [8] through numerical evidence. For the RT finite element family, however, Lemma 2.6 shows $\Vert \tau _h -\Pi _k\tau _h\Vert { \lesssim } \Vert h_\mathcal {T}\, {{\,\mathrm{div}\,}}\tau _h\Vert $ and then leads to a higher-order contribution in the asserted inequality of Theorem 1.2.b as the final result. $\square $

The arguments could be generalised, but those result are of limited relevance as the convergence order is not generally improved in comparison with Theorem 2.a. An exception is the example of [7, Sect 3.5] (with ${\mathbf {b}}=0=\gamma $ on the unit ball) when Theorem 1.2.b guarantees $O(\delta ^2)$ for the $L^2$ flux error for $k=0$.

References

Arbogast, T., Chen, Z.: On the implementation of mixed methods as nonconforming methods for second-order elliptic problems. Math. Comput. 64(211), 943–972 (1995)
MathSciNet MATH Google Scholar
Boffi, D., Brezzi, F., Fortin, M.: Mixed Finite Element Methods and Applications, Springer Series in Computational Mathematics, vol. 44. Springer, Heidelberg (2013)
Book Google Scholar
Braess, D.: Finite Elements, 3rd edn. Cambridge University Press, Cambridge (2007)
Book Google Scholar
Brenner, S.C., Scott, L.R.: The mathematical theory offinite element methods. In: Marsden, J.E., Sirovich, L., Antman S.S. (eds.) Texts in Applied Mathematics, third ed., vol. 15. Springer, New York (2008)
Carstensen, C., Dond, A.K., Nataraj, N., Pani, A.K.: Error analysis of nonconforming and mixed FEMs for second-order linear non-selfadjoint and indefinite elliptic problems. Numer. Math. 133(3), 557–597 (2016)
Article MathSciNet Google Scholar
Carstensen, C., Gallistl, D., Schedensack, M.: $L^2$ best approximation of the elastic stress in the Arnold-Winther FEM. IMA J. Numer. Anal. 36(3), 1096–1119 (2016)
Article MathSciNet Google Scholar
Carstensen, C., Peterseim, D., Schedensack, M.: Comparison results of finite element methods for the Poisson model problem. SIAM J. Numer. Anal. 50(6), 2803–2823 (2012)
Article MathSciNet Google Scholar
Demlow, A.: Suboptimal and optimal convergence in mixed finite element methods. SIAM J. Numer. Anal. 39(6), 1938–1953 (2002)
Article MathSciNet Google Scholar
Douglas, J., Jr., Roberts, J.E.: Mixed finite element methods for second order elliptic problems. Mat. Apl. Comput. 1(1), 91–103 (1982)
MathSciNet MATH Google Scholar
Douglas, J., Jr., Roberts, J.E.: Global estimates for mixed methods for second order elliptic equations. Math. Comp. 44(169), 39–52 (1985)
Article MathSciNet Google Scholar
Ern, A., Gudi, T., Smears, I., Vohralik, M.: Equivalence of local- and global-best approximations, a simple stable local commuting projector, and optimal hp approximation estimates in H(div). IMA J. Numer. Anal. 00, 1–27 (2021). https://doi.org/10.1093/imanum/draa103
Ern, A., Guermond, J.-L.: Theory and Practices of Finite Elements, 1st edn. Springer-Verlag, New York (2004)
Book Google Scholar
Ern, A., Guermond, J.-L.: Mollification in strongly Lipschitz domains with application to continuous and discrete de Rham complexes. Comput. Methods Appl. Math. 16(1), 51–75 (2016)
Article MathSciNet Google Scholar
Gastaldi, L., Nochetto, R.H.: On $L^\infty $-accuracy of mixed finite element methods for second order elliptic problems. Mat. Apl. Comput. 7(1), 13–39 (1988)
MathSciNet MATH Google Scholar
Huang, J., Xu, Y.: Convergence and complexity of arbitrary order adaptive mixed element methods for the Poisson equation. Sci. China Math. 55(5), 1083–1098 (2012)
Article MathSciNet Google Scholar
Payne, L.E., Weinberger, H.F.: An optimal Poincaré inequality for convex domains. Arch. Ration. Mech. Anal. 5(1), 286–292 (1960)
Article Google Scholar
Schatz, A.H., Wang, J.P.: Some new error estimates for Ritz-Galerkin methods with minimal regularity assumptions. Math. Comput. 65(213), 19–27 (1996)
Article MathSciNet Google Scholar

Download references

Acknowledgements

The research of the first author has been supported by the Deutsche Forschungsgemeinschaft in the Priority Program 1748 under the project “foundation and application of generalied mixed FEM towards nonlinear problems in solid mechanics” (CA 151/22-2). The finalization of this paper has been supported by DST SERB MATRICS grant No. MTR/2017/000199 (NN), MATRICS grant No. MTR/201S/000309 (AKP) and SPARC project (id 235) entitled the mathematics and computation of plates.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Humboldt-Universität zu Berlin, Berlin, Germany
C. Carstensen
Department of Mathematics, Indian Institute of Technology Bombay, Powai,Mumbai, 400076, India
C. Carstensen, Neela Nataraj & Amiya K. Pani

Authors

C. Carstensen
View author publications
You can also search for this author in PubMed Google Scholar
Neela Nataraj
View author publications
You can also search for this author in PubMed Google Scholar
Amiya K. Pani
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to C. Carstensen.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: proof of Lemma 2.6

For any simplex $T\subset \mathbb {R}^n$, let $P_k(T;\mathbb {R}^n)$ be the linear space of vector-valued polynomials $q_k$ of degree at most k in any component and let $\Vert \bullet \Vert $ abbreviate the $L^2$ norm $\Vert \bullet \Vert _{L^2(T)}$ on T. The particular structure of the RT function $\tau _{RT}$ leads to some polynomial $g\in P_k(T)$ and

$$\begin{aligned} \tau _{RT}= g(x)\, x + p_{k} \quad \text {for all }x\in T \quad \text { and some } p_k\in P_k(T; \mathbb {R}^n). \end{aligned}$$

The argument x (will always belong to T) is often neglected as in $\tau _{RT}:=\tau _{RT}(x)$ or $p_k=p_k(x)$, while (with a small inconsistency, but the right emphasis) written out in the leading term $g(x)\, x$. The latter polynomial is either identically zero or of exact degree $k+1$ in the sense that g is a sum of monomials of exact degree k. Adopt a multi-index notation with $\alpha =(\alpha _1,\dots ,\alpha _n)\in \mathbb {N}_0^n$ and the monomial $x^\alpha :=x_1^{\alpha _1}x_2^{\alpha _2}\cdots x_n^{\alpha _n}$ of degree $k=|\alpha |:=\alpha _1+\dots +\alpha _n$ for any $x=(x_1,\dots ,x_n)\in T$. With real coefficients $c_\alpha $ for any $\alpha \in {\mathbb {N}}_0^n$ of degree $|\alpha |=k$,

$$\begin{aligned} g(x)=\sum _{|\alpha |=k} c_\alpha x^\alpha \quad \text {for all }x\in T. \end{aligned}$$

(The symbol $|\alpha |=k$ under the sum sign abbreviates the set of all multi-indices $\alpha =(\alpha _1,\dots ,\alpha _n)\in {\mathbb {N}}_0^n$ of degree k). The divergence

$$\begin{aligned} {{\,\mathrm{div}\,}}(g(x)\, x) = n\, g(x)+ x\cdot \nabla g(x) \end{aligned}$$

of the vector-valued polynomial $g(x)\, x$ of degree $k+1$ with respect to x is computed with the observation that, for any $\alpha \in {\mathbb {N}}_0^n$ with $|\alpha |=k$,

$$\begin{aligned} x\cdot \nabla (x^\alpha ) = \sum _{j=1}^n x_j \partial (x^\alpha )/\partial x_j= \sum _{j=1}^n \alpha _j x^\alpha = k \, x^\alpha . \end{aligned}$$

Consequently, $x\cdot \nabla g(x)= k\, g(x)$ and

$$\begin{aligned} {{\,\mathrm{div}\,}}(g(x)\, x) = (n+k)\, g(x)\quad \text {for all }x\in T. \end{aligned}$$

This proves ${{\,\mathrm{div}\,}}\tau _{RT}={{\,\mathrm{div}\,}}(g(x)\, x)+q_{k-1}= (n+k)\, g(x)+q_{k-1}$ for some $q_{k-1}\in P_{k-1}(T)$. The comparison with $\tau _{RT}= g(x)\, x + p_{k} $ leads to some polynomial remainder $r_{k}\in P_{k}(T; \mathbb {R}^n)$ in

$$\begin{aligned} \tau _{RT}= (n+k)^{-1}\, ({{\,\mathrm{div}\,}}\tau _{RT})\, x + r_{k} \quad \text {for all }x\in T . \end{aligned}$$

In other words, since ${{\,\mathrm{div}\,}}\tau _{RT}\in P_k(T)$,

$$\begin{aligned} (n+k)(1-\Pi _k) \tau _{RT}=(1-\Pi _k)\left( ({{\,\mathrm{div}\,}}\tau _{RT})\, x\right) =(1-\Pi _k)\left( ({{\,\mathrm{div}\,}}\tau _{RT})\, (x-c)\right) \end{aligned}$$

for any constant vector c. For instance, the center of inertia $c={{\,\mathrm{mid}\,}}(T)$ of T with diameter $h_T$ satisfies $|x-{{\,\mathrm{mid}\,}}(T)|\leqslant \left( n/(n+1)\right) h_T$ for all $x\in T$. This leads to

$$\begin{aligned} (n+k)\Vert&\tau _{RT}-\Pi _k\tau _{RT}\Vert = \Vert (1-\Pi _k) \left( ({{\,\mathrm{div}\,}}\tau _{RT}) (x-{{\,\mathrm{mid}\,}}(T))\right) \Vert \\&\quad \leqslant \Vert ({{\,\mathrm{div}\,}}\tau _{RT}) (x-{{\,\mathrm{mid}\,}}(T))\Vert \leqslant \left( n/(n+1)\right) h_T\, \Vert {{\,\mathrm{div}\,}}\tau _{RT}\Vert . \end{aligned}$$

This proves $\Vert \tau _{RT}- \Pi _k\tau _{RT}\Vert _{L^2(T)} \leqslant \frac{n \, \ h_T}{(n+1)(n+k)} \Vert {{\,\mathrm{div}\,}}\tau _{RT}\Vert _{L^2(T)}$. $\square $

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Carstensen, C., Nataraj, N. & Pani, A.K. Stability of mixed FEMs for non-selfadjoint indefinite second-order linear elliptic PDEs. Numer. Math. 150, 975–992 (2022). https://doi.org/10.1007/s00211-022-01282-3

Download citation

Received: 27 October 2020
Revised: 01 March 2022
Accepted: 02 March 2022
Published: 01 April 2022
Issue Date: April 2022
DOI: https://doi.org/10.1007/s00211-022-01282-3

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Stability of mixed FEMs for non-selfadjoint indefinite second-order linear elliptic PDEs

Abstract

Similar content being viewed by others

A priori error estimates of expanded mixed FEM for Kirchhoff type parabolic equation

Error analysis of nonconforming and mixed FEMs for second-order linear non-selfadjoint and indefinite elliptic problems

Stability to a class of doubly nonlinear very singular parabolic equations

1 Introduction

1.1 Non-selfadjoint indefinite second-order linear elliptic PDEs

Assumption(A)

1.2 Earlier contributions

1.3 Contribution of this paper

Theorem 1.1

Theorem 1.2

1.4 Motivation

1.5 Structure of the paper

2 Preliminaries

2.1 Notation

2.2 Assumptions on the discretization

Definition 2.1

Definition 2.2

2.3 Pre-compactness

Lemma 2.3

Proof

Lemma 2.4

Proof

2.4 \(L^2\) best-approximation of the fluxes

Lemma 2.5

Proof

2.5 A discrete approximation result for Raviart-Thomas functions

Lemma 2.6

3 Stability analysis

3.1 Dual solution and conservative formulation

Theorem 3.1

Proof of Theorem 3.1

Remark 3.2

3.2 Approximation of the fluxes

Lemma 3.3

Proof

Remark 3.4

Example 3.5

3.3 Proof of theorem 1.1

Remark 3.6

4 \(L^2\) best-approximation

4.1 Proof of theorem 1.2.a

4.2 Proof of theorem 1.2.b

4.3 Conservative formulation

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix: proof of Lemma 2.6

Appendix: proof of Lemma 2.6

Rights and permissions

About this article

Cite this article

Share this article

Mathematics Subject Classification

Search

Navigation