Discontinuous Galerkin Methods for a Class of Nonvariational Problems

Dedner, Andreas; Pryer, Tristan

doi:10.1007/s42967-021-00133-6

Discontinuous Galerkin Methods for a Class of Nonvariational Problems

Original Paper
Open access
Published: 13 September 2021

Volume 4, pages 634–656, (2022)
Cite this article

Download PDF

You have full access to this open access article

Communications on Applied Mathematics and Computation Aims and scope Submit manuscript

Discontinuous Galerkin Methods for a Class of Nonvariational Problems

Download PDF

Andreas Dedner¹ &
Tristan Pryer²

1742 Accesses
Explore all metrics

Abstract

We extend the finite element method introduced by Lakkis and Pryer (SIAM J. Sci. Comput. 33(2): 786–801, 2011) to approximate the solution of second-order elliptic problems in nonvariational form to incorporate the discontinuous Galerkin (DG) framework. This is done by viewing the “finite element Hessian” as an auxiliary variable in the formulation. Representing the finite element Hessian in a discontinuous setting yields a linear system of the same size and having the same sparsity pattern of the compact DG methods for variational elliptic problems. Furthermore, the system matrix is very easy to assemble; thus, this approach greatly reduces the computational complexity of the discretisation compared to the continuous approach. We conduct a stability and consistency analysis making use of the unified framework set out in Arnold et al. (SIAM J. Numer. Anal. 39(5): 1749–1779, 2001/2002). We also give an a posteriori analysis of the method in the case where the problem has a strong solution. The analysis applies to any consistent representation of the finite element Hessian, and thus is applicable to the previous works making use of continuous Galerkin approximations. Numerical evidence is presented showing that the method works well also in a more general setting.

Stable Discontinuous Galerkin FEM Without Penalty Parameters

An extended Galerkin analysis for elliptic problems

Article 02 December 2020

A Systematic Study on Weak Galerkin Finite Element Methods for Second Order Elliptic Problems

Article 26 July 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Linear, second-order, nonvariational partial differential equations (PDEs) are those which are given in the form

$$\begin{aligned} -{\varvec{A}}{:}{\mathrm {D}^2u} = f, \end{aligned}$$

(1)

where ${{\varvec{X}}}{:}{{\varvec{Y}}} = {\text {trace}}\!\left( {{{\varvec{X}}}^{{\varvec{\intercal }}} {\varvec{Y}}}\right)$ is the Frobenious inner product between matrices. If the matrix $\varvec{A}$ is differentiable, then there is an equivalence between this problem and its variational sibling

$$\begin{aligned} -{\text {div}}\!\left( {\varvec{A}\nabla u}\right) + \mathrm {D}{\varvec{A}}\nabla u = f, \end{aligned}$$

(2)

where

$$\begin{aligned} \mathrm {D}\varvec{A}= \!\left( {\sum _{i=1}^d \partial _i a_{i,1}(\varvec{x}) , \cdots , \sum _{i=1}^d \partial _i a_{i,d}(\varvec{x})}\right) , \end{aligned}$$

(3)

and d is the dimension under consideration. Rewriting in this form is sometimes undesirable. For example, if the coefficient matrix $\varvec{A}$ has near singular derivatives, the problem will become advection dominated and possibly unstable for conforming finite element methods. There is a wealth of material on the treatment of advection-dominated problems [c.f., 18, 19]. If $\varvec{A}$ is not differentiable, then the problem has no variational structure. In this case, standard finite element methods cannot be applied.

In a previous work [32], a finite element method for the approximation of the nonvariational problem (1) was introduced. This involved the introduction of a finite element Hessian represented in the same finite element space as the solution (modulo boundary conditions). The applications of the discrete representation of a Hessian of a piecewise function are becoming broader, for example, it can be used to drive anisotropic adaptive algorithms [1, 40], as a notion of discrete convexity [2] and in the design of finite element methods for nonlinear fourth-order problems [37].

The algebraic formulation of the ${\textit {C}} ^{0}$ Galerkin approximation of the nonvariational problem requires the solution of large sparse $\left( {d+1}\right) ^2 N^2$ linear system [32, Lem 3.3], where N is the number of degrees of freedom. Equivalently, using a Schur complement argument, this can be reduced to an $N^2$ full linear system. The reason that this system is full is due to the global nature of the ${\text {L}} ^{2}(\varOmega )$ projection operator into a continuous finite element space. The motivation for extending the nonvariational finite element method into the discontinuous setting is the massive gain in computational efficiency over the continuous case. Indeed, due to the local representation of the projection operators in these discontinuous spaces, we are able to make massive computational savings, in that the system matrix becomes sparse and is the same size as that of a standard discontinuous Galerkin stiffness matrix for the Laplacian.

There has been a plethera of other finite element methods posed for linear nonvariational PDEs exist including [38] in which the authors pose a stable discontinuous Galerkin scheme for linear nonvariational problems using discrete analogs of the classical methods used to prove existence of strong solutions, see also [29] where similar techniques are used over curved domains. In [22,23,24] the author considers a least squares formulation, see also [31, 35] for related formulations. Many of these formulations share the same disadvantage in the conditioning of the system matrix—the condition number scales like a fourth-order problem. As already mentioned, one of the desirable features of the formulation presented here is that it scales in the same fashion as a discontinuous Galerkin method for a second-order variational problem, a property shared by the method in [25] under the Cordes condition.

We are particularly interested in nonvariational PDEs due to their relation to general fully nonlinear PDEs

$$\begin{aligned} \mathscr {F} \!\left( { \mathrm {D}^2u }\right) = 0, \end{aligned}$$

(4)

which are of significant current research. In the literature finite element methods have been presented to solve this general class of problem. For example in [8] the author presents a ${\textit {C}} ^{1}$ finite element method shows stability and consistency (hence convergence) of the scheme which requires a high degree of smoothness on the exact solution. In [20, 21] the authors give a method in which they approximate the general second-order fully nonlinear PDE by a sequence of fourth-order quasilinear PDEs. This is reminiscent of the vanishing viscosity method introduced for classically studying first-order fully nonlinear PDEs. Efficiency of any method used to approximate a problem such as this is key. Each of the methods is computationally costly due to their reliance on ${\textit {C}} ^{1}$ finite elements [8, 21] or mixed methods [20].

In [5], a generic framework was set up to prove convergence of numerical approximations to the viscosity solutions of degenerate elliptic fully nonlinear PDEs. This involved constructing monotone sequences of approximations which are typically applied to finite difference approximations of the nonlinear problem [c.f., 36]. The assumption of consistency made in the [5] framework is incompatible with finite element methods; however, an extremely important observation made in [28] is that the consistency condition may be weakened to incorporate the finite element case using a localisation argument (in the case of isotropic diffusion).

A posteriori analysis of linear nonvariational problems is less standard than for those of variational type. Typically, results are based on “closeness” conditions of Cordes type [12] which guarantees existence of a unique smooth, ${\text {H}} ^{2}$ solution. Under these assumptions, it is relatively straightforward to derive a posteriori bounds in the natural norm for the problem, ${\text {H}} ^{2}$, or a mesh-dependent equivalent. For other methods, this has been done in [24]. It has even been shown that adaptive methods for these problems converge optimally in terms of the number of degrees of freedom [30].

We show similar a posteriori bounds that make it straightforward to incorporate the method into well-developed software package for finite element methods, shown here using Dune [6, 7]. In this work, we are interested in the asymptotic behaviour of the discontinuous approximation and the computational gains using this method. In a subsequent work, we will study the computational gains using the discontinuous framework presented over the continuous one given in [32], as well as exploit the powerful parallelisation capabilities of the package.

The rest of the paper is set out as follows. In Sect. 2, we formally introduce the model problem and give a brief review of known classical facts about nonvariational PDEs. In Sect. 3, we examine the discretisation of the nonvariational method in the discontinuous Galerkin framework, making use of the unified framework set out in [4] to derive a very general formulation of the finite element Hessian represented as a discontinuous object. We present some examples and examine the natural question of what happens when we try to eliminate the finite element Hessian from the formulation. In Sect. 4, we conduct an a posteriori analysis of the method and show upper and lower bounds to the error in an ${\text {H}} ^{2}$-like mesh-dependent norm. Finally, in Sect. 5, we detail a summary of extensive numerical experiments aimed at examining convergence and robustness of the method presented.

2 Problem Formulation

In this section, we formulate the model problem, fix notation and give some basic assumptions. In addition, we review the existence and uniqueness of the nonvariational problems. Let $\varOmega \subset \mathbb R ^d$, $d=2,3$, be a connected domain with polygonal boundary. The Lebesgue spaces are defined as

$$\begin{aligned} {\text {L}} ^{2}(\varOmega ) = \left\{ \phi :\;\int _\varOmega \left| \phi (\varvec{x})\right| ^2 \,\mathrm {d}\varvec{x}< \infty \right\} \;\; \text { and }\;\; {\text {L}} ^{\infty }(\varOmega ) = \left\{ \phi :\;\sup _{\varvec{x} \in \varOmega } \left| \phi (\varvec{x})\right| < \infty \right\} , \end{aligned}$$

(5)

and the Sobolev and Hilbert spaces

$$\begin{aligned} {\text {W}} ^{k,p}(\varOmega ) = \left\{ \phi \in {\text {L}} ^{p}(\varOmega ):\;\mathrm {D}^{{\varvec{\alpha }}}\phi \in {\text {L}} ^{p}(\varOmega ), \quad \text { for } \left| \varvec{\alpha }\right| \leqslant k \right\} \;\; \text { and }\;\;{\text {H}} ^{k}(\varOmega ) := {\text {W}} ^{k,2}(\varOmega ). \end{aligned}$$

(6)

These are equipped with the norms

$$\begin{aligned} \left\| \phi \right\| ^2_{{\text {L}} ^{2}(\varOmega )}= & {} \int _\varOmega \left| \phi \right| ^2 \,\mathrm {d}\varvec{x} , \qquad \left\| \phi \right\| _{{\text {L}} ^{\infty }(\varOmega )} = \sup _{\varvec{x} \in \varOmega } \left| \phi (\varvec{x})\right| , \end{aligned}$$

(7)

$$\begin{aligned} \left\| v\right\| _{{\text {W}} ^{k,p}(\varOmega )}^p= & {} \sum _{\left| {\varvec{\alpha }}\right| \leqslant k}\left\| \mathrm {D}^{{\varvec{\alpha }}} v\right\| _{{\text {L}} ^{p}(\varOmega )}^p \quad \text { and }\quad \left| v\right| _{{\text {W}} ^{k,p}(\varOmega )}^p = \sum _{\left| {\varvec{\alpha }}\right| = k}\left\| \mathrm {D}^{{\varvec{\alpha }}} v\right\| _{{\text {L}} ^{p}(\varOmega )}^p, \end{aligned}$$

(8)

where ${\varvec{\alpha }}= \{ \alpha _1,\cdots,\alpha _d\}$ is a multi-index, $\left| {\varvec{\alpha }}\right| = \sum _{i=1}^d\alpha _i$ and derivatives $\mathrm {D}^{{\varvec{\alpha }}}$ are understood in a weak sense. We pay particular attention to the cases $k = 1,2$ and

$$\begin{aligned} {\text {H}} ^{1}_0(\varOmega ) := \text {closure of }{\textit {C}} ^{\infty }_0(\varOmega ) \text { in } {\text {H}} ^{1}(\varOmega ). \end{aligned}$$

(9)

The model problem in strong form is: find $u\in {\text {H}} ^{2}(\varOmega )\cap {\text {H}} ^{1}_0(\varOmega )$ such that

$$\begin{aligned} \begin{aligned} \left\langle {\mathscr {L} u,\phi }\right\rangle&= \left\langle {f,\phi }\right\rangle, \quad \quad \,\forall \,\phi \in {\text {H}} ^{1}_0(\varOmega ), \\ \end{aligned} \end{aligned}$$

(10)

where the data $f\in {\text {L}} ^{2}(\varOmega )$ are prescribed and $\mathscr {L}$ is a general linear, second-order, uniformly elliptic partial differential operator. Let $\varvec{A}\in {\text {L}} ^{\infty }(\varOmega )^{d\times d}$, we then define

$$\begin{aligned} \begin{array}{rccl} {\mathscr {L}}: &{} {{\text {H}} ^{2}(\varOmega )\cap {\text {H}} ^{1}_0(\varOmega )} &{} \rightarrow &{} {{\text {L}} ^{2}(\varOmega )}, \\ &{} \quad \quad\quad\quad \quad{u} &{}\mapsto &{} {\mathscr {L} u:= -{\varvec{A}}{:}{\mathrm {D}^2u}.} \end{array}\quad \end{aligned}$$

(11)

We assume that $\varvec{A}$ is uniformly positive definite, i.e., there exists a $\gamma >0$ such that for all ${\varvec{x}}$

$$\begin{aligned} {{\varvec{y}}}^{{\varvec{\intercal }}} \varvec{A}({\varvec{x}}) {\varvec{y}} \geqslant \gamma \left| {\varvec{y}}\right| ^2, \quad \,\forall \,{\varvec{y}}\in \mathbb R ^d, \end{aligned}$$

(12)

and we call $\gamma$ the ellipticity constant.

Definition 1

(Strong solution) A strong solution of (1) is a function $u\in {\text {H}} ^{2}(\varOmega ) \cap {\text {H}} ^{1}_0(\varOmega )$, that is a twice weakly differentiable function, which satisfies the problem almost everywhere.

Theorem 1

(Existence and regularity of a strong solution of (1) [12]) Let $\varOmega \subset \mathbb R ^d$ be a convex polytope. Suppose $\varvec{A}\in {\text {L}} ^{\infty }(\varOmega )^{d\times d}$ is uniformly elliptic and $f\in {\text {L}} ^{2}(\varOmega )$. Suppose further that $\varvec{A}$ satisfies the Cordes condition, that there exists $0 < \alpha \in {\text {L}} ^{\infty }(\varOmega )$ and $0 < \beta ,\gamma \in \mathbb R$ with $\beta +\gamma < 1$ such that

$$\begin{aligned} \left| {\mathbf{I }}: {\varvec{Y}} - \alpha (x) \varvec{A}:{\varvec{Y}}\right| {} \leqslant \beta \left| {\varvec{Y}}\right| {} + \gamma \left| {\mathbf{I }}: {\varvec{Y}}\right|, {} \quad \,\forall \,{\varvec{Y}} \in {\text {Sym}}^+(\mathbb R ^{d\times d}). \end{aligned}$$

(13)

Then, (1) admits a unique strong solution. There also exists a constant independent of u such that

$$\begin{aligned} \left\| u\right\| _{{\text {H}} ^{2}(\varOmega )}\leqslant C\left\| f\right\| _{{\text {L}} ^{2}(\varOmega )}. \end{aligned}$$

(14)

Remark 1

(Less regular solutions) Note that the theory of viscosity solutions has been developed for non-classical solutions of (10), if the problem data do not satisfy the regularity assumed above, see [27].

Assumption 1

(Inf-sup condition) From hereon in, we will assume that the problem satisfies an inf-sup condition, that is, with

$$\begin{aligned} a(w,v) := \int _\varOmega -{\varvec{A}}{:}{\mathrm {D}^2w} v \,\mathrm {d}x, \end{aligned}$$

(15)

then, for all $w\in {\text {H}} ^{2}(\varOmega )\cap {\text {H}} ^{1}_0(\varOmega )$

$$\begin{aligned} \sup _{v\in {\text {L}} ^{2}(\varOmega )} \frac{a(w,v)}{\left\| v\right\| _{{\text {L}} ^{2}(\varOmega )}} \geqslant C\left\| \Delta w\right\| _{{\text {L}} ^{2}(\varOmega )}. \end{aligned}$$

(16)

This is true under a variety of conditions. For example, those of Theorem 1. Note that for $d=2$, this criterion is satisfied for all essentially bounded, symmetric positive definite $\varvec{A}$.

3 Discretisation

Let $\mathscr {T} ^{}$ be a shape regular triangulation of $\varOmega$, namely, $\mathscr {T} ^{}$ is a finite family of sets such that

1)
$K\in \mathscr {T} ^{}$ implies K is an open simplex (segment for $d=1$, triangle for $d=2$, tetrahedron for $d=3$),
2)
for any $K,J\in \mathscr {T} ^{}$ we have that $\overline{K}\cap \overline{J}$ is a full subsimplex (i.e., it is either $\emptyset$, a vertex, an edge, a face, or the whole of $\overline{K}$ and $\overline{J}$) of either $\overline{K}$ and $\overline{J}$ and
3)
$\bigcup _{K\in \mathscr {T} ^{}}\overline{K}=\overline{\varOmega }$.

We use the convention where $h:\varOmega \rightarrow \mathbb R$ denotes the mesh size function of $\mathscr {T} ^{}$, i.e.,

$$\begin{aligned} h({\varvec{x}}):=\max _{\overline{K}\ni {\varvec{x}}}h_K, \end{aligned}$$

(17)

where $h_K$ is the diameter of K. We let $\mathscr {E} {}$ be the skeleton (set of common interfaces) of the triangulation $\mathscr {T} ^{}$ and say $e\in \mathscr {E}$ if e is on the interior of $\varOmega$ and $e\in \partial \varOmega$ if e lies on the boundary $\partial \varOmega$.

Let $\mathbb P ^{k}(\mathscr {T} ^{})$ denote the space of piecewise polynomials of degree k over the triangulation $\mathscr {T} ^{}$, $\text { i.e., }$

$$\begin{aligned} \mathbb P ^{k} (\mathscr {T} ^{}) = \left\{ \phi :\;\phi |_K \in \mathbb P ^{k} (K) \right\} , \end{aligned}$$

(18)

and introduce the finite element spaces

$$\begin{aligned} \mathbb {V}= & {} \mathbb {V}\!\left( {\mathscr {T} ^{},k}\right) := \mathbb P ^{k}(\mathscr {T} ^{}), \end{aligned}$$

(19)

$$\begin{aligned} \mathbb {V}_{0}= & {} \mathbb {V}_{0}\!\left( {\mathscr {T} ^{},k}\right) := \left\{ \phi \in \mathbb P ^{k}(\mathscr {T} ^{}):\;\phi \vert _{\partial \varOmega } = 0 \right\} , \end{aligned}$$

(20)

to be the usual spaces of discontinuous piecewise polynomial functions.

Remark 2

(Generalised Hessian) Assume $v\in {\text {H}} ^{2}(\varOmega )$, let $\varvec{n}:\partial \varOmega \rightarrow \mathbb R ^d$ be the outward pointing normal of $\varOmega$, then the Hessian $\mathrm {D}^2v$ of v, satisfies the following identity:

$$\begin{aligned} \int _\varOmega {\mathrm {D}^2v} \ {\phi } \,\mathrm {d}\varvec{x} = - \int _\varOmega {\nabla v}\otimes {\nabla \phi } \,\mathrm {d}\varvec{x} + \int _{\partial \varOmega }{\nabla v}\otimes {\varvec{n} \ \phi } \,\mathrm {d}s, \quad \,\forall \,\phi \in {\text {H}} ^{1}(\varOmega ). \end{aligned}$$

(21)

If $v\in {\text {H}} ^{1}(\varOmega )$, the right-hand side of (21) is still well defined in view of duality, in this case we set

$$\begin{aligned} \left\langle \mathrm {D}^2v\,\vert \,\phi \right\rangle = - \int _\varOmega {\nabla v}\otimes {\nabla \phi }\,\mathrm {d}\varvec{x} + \int _{\partial \varOmega }{\nabla v}\otimes {\varvec{n} \ \phi } \,\mathrm {d}s, \quad \,\forall \,\phi \in {\text {H}} ^{1}(\varOmega ), \end{aligned}$$

(22)

where the last term is understood as a duality pairing.

Definition 2

(Broken Sobolev spaces, trace spaces) We introduce the broken Sobolev space

$$\begin{aligned} {\text {H}} ^{k}(\mathscr {T} ^{}) := \left\{ \phi :\;\phi |_K\in {\text {H}} ^{k}(K), \text { for each } K \in \mathscr {T} ^{} \right\} . \end{aligned}$$

(22)

We also make use of functions defined in these broken spaces restricted to the skeleton of the triangulation. This requires an appropriate trace space

$$\begin{aligned} \mathcal T \!\left( {\mathscr {E}}\right) := \prod _{K\in \mathscr {T} ^{}} {\text {L}} ^{2}(\partial K) = \prod _{K\in \mathscr {T} ^{}} {\text {H}} ^{\frac{1}{2}}(K). \end{aligned}$$

(24)

Definition 3

(Jumps, averages and tensor jumps) We define average, jump and tensor jump operators for arbitrary scalar functions $v\in \mathcal T \!\left( {\mathscr {E}}\right)$, vectors ${\varvec{v}}\in \mathcal T \!\left( {\mathscr {E}}\right) ^d$ and matrices ${\varvec{V}}\in \mathcal T \!\left( {\mathscr {E}}\right) ^{d\times d}$ as

$$\begin{aligned} \lbrace v \rbrace= & {} {\frac{1}{2}\!\left( {v|_{K_1} + v|_{K_2}}\right) }, \qquad \lbrace {\varvec{v}} \rbrace = {\frac{1}{2}\!\left( {{\varvec{v}}|_{K_1} + {\varvec{v}}|_{K_2}}\right) }, \end{aligned}$$

(25)

$$\begin{aligned} \llbracket v\rrbracket= & {} {{{v}|_{K_1} \varvec{n}_{K_1} + {v}|_{K_2}} \varvec{n}_{K_2}}, \qquad \llbracket {\varvec{v}}\rrbracket = {\!\left( {{\varvec{v}}|_{K_1}}\right) } \cdot \varvec{n}_{K_1} + {\!\left( {{\varvec{v}}|_{K_2}}\right) } \cdot \varvec{n}_{K_2}, \end{aligned}$$

(26)

$$\begin{aligned} \llbracket {\varvec{V}}\rrbracket= & {} {{{{{\varvec{V}}|_{K_1}}}\varvec{n}_{K_1} + {{{\varvec{V}}|_{K_2}}}\varvec{n}_{K_2}}}, \qquad \llbracket {\varvec{v}}\rrbracket _\otimes = {{{\varvec{v}}|_{K_1} }\otimes \varvec{n}_{K_1} + {\varvec{v}}|_{K_2} \otimes \varvec{n}_{K_2}}. \end{aligned}$$

(27)

Note that on the boundary of the domain $\partial \varOmega$, the jump and average operators are defined as

$$\begin{aligned}&\lbrace v \rbrace \Big \vert _{\partial \varOmega } := v, \qquad \lbrace \varvec{v} \rbrace \Big \vert _{\partial \varOmega } := \varvec{v}, \end{aligned}$$

(28)

$$\begin{aligned}&\llbracket v\rrbracket \Big \vert _{\partial \varOmega } := v\varvec{n}, \qquad \llbracket \varvec{v}\rrbracket \Big \vert _{\partial \varOmega } := {\varvec{v}} \cdot \varvec{n}, \end{aligned}$$

(29)

$$\begin{aligned}&\llbracket {\varvec{V}}\rrbracket \Big \vert _{\partial \varOmega } := {\varvec{V}} \varvec{n}, \qquad \llbracket {\varvec{v}}\rrbracket _\otimes \Big \vert _{\partial \varOmega } := {\varvec{v}}\otimes \varvec{n}. \end{aligned}$$

(30)

We will often use the following Proposition which we state in full for clarity but whose proof is merely using the identities in Definition 3.

Proposition 1

(Elementwise integration) For a generic vector-valued function $\varvec{p}$ and scalar-valued function $\phi$, we have

$$\begin{aligned} \begin{aligned} \sum _{K\in \mathscr {T} ^{}} \int _K {\text {div}}\!\left( {\varvec{p}}\right) \phi \,\mathrm {d}\varvec{x} = \sum _{K\in \mathscr {T} ^{}} \!\left( { - \int _K {\varvec{p}} \cdot \nabla _h \phi \,\mathrm {d}\varvec{x} + \int _{\partial K} \phi {\varvec{p}} \cdot \varvec{n}_K \,\mathrm {d}s }\right) , \end{aligned} \end{aligned}$$

(31)

where $\nabla _h = {\!\left( {\mathrm {D}_h }\right) }^{{\varvec{\intercal }}}$ is the elementwise spatial gradient. Furthermore, If we have $\varvec{p} \in \mathcal T \!\left( {\mathscr {E} \cup \partial \varOmega }\right) ^d$ and $\phi \in \mathcal T \!\left( {\mathscr {E} \cup \partial \varOmega }\right)$, the following identity holds

$$\begin{aligned} \sum _{K\in \mathscr {T} ^{}} \int _{\partial K} \phi {{\varvec{p}}} \cdot \varvec{n}_K \,\mathrm {d}s = \int _\mathscr {E} \llbracket \varvec{p}\rrbracket \lbrace \phi \rbrace \,\mathrm {d}s + \int _{\mathscr {E} \cup \partial \varOmega } {\llbracket \phi \rrbracket } \cdot \lbrace \varvec{p} \rbrace \,\mathrm {d}s = \int _{\mathscr {E} \cup \partial \varOmega } \llbracket \varvec{p} \phi \rrbracket \,\mathrm {d}s. \end{aligned}$$

(32)

An equivalent tensor formulation of (31)–(32) is

$$\begin{aligned} \begin{aligned} \sum _{K\in \mathscr {T} ^{}} \int _K \mathrm {D}_h {\varvec{p}} \phi \,\mathrm {d}\varvec{x} = \sum _{K\in \mathscr {T} ^{}} \!\left( { - \int _K {\varvec{p}} \otimes \nabla _h \phi \,\mathrm {d}\varvec{x} + \int _{\partial K} \phi {\varvec{p}} \otimes \varvec{n}_K \,\mathrm {d}s }\right) , \end{aligned} \end{aligned}$$

(33)

where

$$\begin{aligned} \sum _{K\in \mathscr {T} ^{}} \int _{\partial K} \phi {\varvec{p}} \otimes \varvec{n}_K \,\mathrm {d}s = \int _\mathscr {E} \llbracket \varvec{p}\rrbracket _\otimes \lbrace \phi \rbrace \,\mathrm {d}s + \int _{\mathscr {E} \cup \partial \varOmega } {\llbracket \phi \rrbracket }\otimes \lbrace \varvec{p} \rbrace \,\mathrm {d}s = \int _{\mathscr {E} \cup \partial \varOmega } \llbracket \varvec{p} \phi \rrbracket _\otimes \,\mathrm {d}s. \end{aligned}$$

(34)

In addition for matrix-valued ${\varvec{V}},$ we have that

$$\begin{aligned} \sum _{K\in \mathscr {T} ^{}} \int _{K} { \!\left( {\mathrm {D}_h {\varvec{p}}}\right) }{:}{{\varvec{V}}} \,\mathrm {d}{\varvec{x}} = \sum _{K\in \mathscr {T} ^{}} \!\left( { -\int _{K} { {{\varvec{p}}} }{:}{\mathrm {D}_h {\varvec{V}}}\,\mathrm {d}{\varvec{x}} + \int _{\partial \varOmega } {\!\left( {{\varvec{V}} {\varvec{p}}}\right) } \cdot {\varvec{n}} \,\mathrm {d}s }\right) \end{aligned}$$

(35)

and

$$\begin{aligned} \sum _{K\in \mathscr {T} ^{}} \int _{\partial \varOmega } {\!\left( {{\varvec{V}} {\varvec{p}}}\right) } \cdot {\varvec{n}} \,\mathrm {d}s = \int _\mathscr {E} {\llbracket {\varvec{V}}\rrbracket } \cdot \lbrace {\varvec{p}} \rbrace \,\mathrm {d}s + \int _{\mathscr {E} \cup \partial \varOmega } {\llbracket {\varvec{p}}\rrbracket _\otimes }{:}{\lbrace {\varvec{V}} \rbrace } \,\mathrm {d}s = \int _{\mathscr {E} \cup \partial \varOmega } \llbracket {\varvec{V}}\varvec{p}\rrbracket \,\mathrm {d}s. \end{aligned}$$

(36)

3.1 Construction of an Appropriate Discrete Hessian

We now use the framework set out in [4] to construct a general notion of discrete Hessian. We first give a definition using a flux formulation.

Definition 4

(Generalised finite element Hessian: flux formulation) Let $u\in {\text {H}} ^{2}(\mathscr {T} ^{})$, ${\widehat{U}} : {\text {H}} ^{1}(\mathscr {T} ^{}) \rightarrow \mathcal T \!\left( {\mathscr {E} \cup \partial \varOmega }\right)$ be a linear form and $\widehat{\varvec{p}} : {\text {H}} ^{2}(\mathscr {T} ^{}) \times {\text {H}} ^{1}(\mathscr {T} ^{})^d \rightarrow \mathcal T \!\left( {\mathscr {E} \cup \partial \varOmega }\right) ^d$ a bilinear form representing approximations to u and $\nabla u$ over the skeleton of the triangulation. Then, we define the generalised finite element Hessian $\varvec{H}[u]$ as the solution of

$$\begin{aligned} \int _K {\varvec{H}[u]} \ {\varPhi } \,\mathrm {d}\varvec{x} = - \int _K \varvec{p} \otimes \nabla _h \varPhi \,\mathrm {d}\varvec{x} + \int _{\partial K} \varvec{\widehat{p}}_K \otimes \varvec{n} \ \varPhi \,\mathrm {d}s, \quad \,\forall \,\varPhi \in {\text {H}} ^{1}(\mathscr {T} ^{}) \end{aligned},$$

(37)

$$\begin{aligned} \int _K \varvec{p} \otimes \varvec{q} \,\mathrm {d}\varvec{x} = - \int _K u \ \mathrm {D}_h \varvec{q} \,\mathrm {d}\varvec{x} + \int _{\partial K} \varvec{q} \otimes \varvec{n} \ \widehat{U}_K \,\mathrm {d}s \end{aligned}$$

(38)

for all $\varPhi \in \mathbb {V}$.

We now present the primal formulation for the generalised finite element Hessian.

Theorem 2

(Generalised finite element Hessian: primal form) Let $u\in {\text {H}} ^{2}(\mathscr {T} ^{})$ and let ${\widehat{U}}$ and $\widehat{\varvec{p}}$ be defined as in Definition 4. Then, the generalised finite element Hessian H[u] is given for each $\varPhi \in \mathbb {V}$ as

$$\begin{aligned} \begin{aligned} \int _{\varOmega } \varvec{H}[u] \ \varPhi \,\mathrm {d}\varvec{x}&= - \int _\varOmega \nabla _h u \otimes \nabla _h \varPhi \,\mathrm {d}\varvec{x} + \int _{\mathscr {E} \cup \partial \varOmega } \llbracket \varPhi \rrbracket \otimes \lbrace \varvec{{\widehat{p}}} \rbrace \,\mathrm {d}s + \int _\mathscr {E} \lbrace \varPhi \rbrace \llbracket \varvec{{\widehat{p}}}\rrbracket _\otimes \,\mathrm {d}s \\&\qquad - \int _{\mathscr {E}} \lbrace {\widehat{U}} - u \rbrace \llbracket \nabla _h \varPhi \rrbracket _\otimes \,\mathrm {d}s - \int _{\mathscr {E} \cup \partial \varOmega } \llbracket {\widehat{U}} - u\rrbracket \otimes \lbrace \nabla _h \varPhi \rbrace \,\mathrm {d}s. \end{aligned} \end{aligned}$$

(39)

Proof

Note that in view of Definition 3 for generic vector fields $\varvec{q} \in \mathbb {W}$ and $v \in \mathbb {V}$, we have the following identity:

$$\begin{aligned} \sum _{K\in \mathscr {T} ^{}} \int _{\partial K} v \varvec{q} \otimes \varvec{n} \,\mathrm {d}s = \int _{\mathscr {E} \cup \partial \varOmega } \llbracket v\rrbracket \otimes \lbrace \varvec{q} \rbrace \,\mathrm {d}s + \int _\mathscr {E} \lbrace v \rbrace \llbracket \varvec{q}\rrbracket _\otimes \,\mathrm {d}s. \end{aligned}$$

(40)

Then summing (37) over $K\in \mathscr {T} ^{}$ and making use of the identity (40) we see

$$\begin{aligned} \begin{aligned} \int _\varOmega {\varvec{H}[u]} \ {\varPhi } \,\mathrm {d}\varvec{x}&= \sum _{K\in \mathscr {T} ^{}} \int _K {\varvec{H}[u]} \ {\varPhi } \,\mathrm {d}\varvec{x} = \sum _{K\in \mathscr {T} ^{}} \!\left( {- \int _K \varvec{p} \otimes \nabla _h \varPhi \,\mathrm {d}\varvec{x} + \int _{\partial K} \varvec{\widehat{p}}_K \otimes \varvec{n} \ \varPhi }\right) \,\mathrm {d}s \\&= - \int _\varOmega \varvec{p} \otimes \nabla _h \varPhi \,\mathrm {d}\varvec{x} + \int _{\mathscr {E} \cup \partial \varOmega } \llbracket \varPhi \rrbracket \otimes \lbrace \varvec{\widehat{p}}_K \rbrace \,\mathrm {d}s + \int _\mathscr {E} \lbrace \varPhi \rbrace \llbracket \varvec{\widehat{p}}_K\rrbracket _\otimes \,\mathrm {d}s. \end{aligned} \end{aligned}$$

(41)

Using the same argument for (38)

$$\begin{aligned} \begin{aligned} \int _\varOmega \varvec{p} \otimes \varvec{q} \,\mathrm {d}\varvec{x}&= \sum _{K\in \mathscr {T} ^{}} \int _K \varvec{p} \otimes \varvec{q} \,\mathrm {d}\varvec{x} = \sum _{K\in \mathscr {T} ^{}} \!\left( { - \int _K u \ \mathrm {D}_h \varvec{q} \,\mathrm {d}\varvec{x} + \int _{\partial K} \varvec{q} \otimes \varvec{n} \ \widehat{U}_K \,\mathrm {d}s }\right) \\&= - \int _\varOmega u \ \mathrm {D}_h \varvec{q} \,\mathrm {d}\varvec{x} + \int _{\mathscr {E} \cup \partial \varOmega } \llbracket \widehat{U}\rrbracket \otimes \lbrace \varvec{q} \rbrace \,\mathrm {d}s + \int _\mathscr {E} \lbrace \widehat{U} \rbrace \llbracket \varvec{q}\rrbracket _\otimes \,\mathrm {d}s. \end{aligned} \end{aligned}$$

(42)

Note that, again making use of (40), we have for each $\varvec{q}\in {\text {H}} ^{1}(\mathscr {T} ^{})^d$ and $v\in {\text {H}} ^{1}(\mathscr {T} ^{})$ that

$$\begin{aligned} \int _\varOmega \varvec{q} \otimes \nabla _h v \,\mathrm {d}\varvec{x} = - \int _\varOmega \mathrm {D}_h \varvec{q} v\,\mathrm {d}\varvec{x} + \int _{\mathscr {E} \cup \partial \varOmega } \lbrace \varvec{q} \rbrace \otimes \llbracket v\rrbracket \,\mathrm {d}s + \int _\mathscr {E} \llbracket \varvec{q}\rrbracket _\otimes \lbrace v \rbrace \,\mathrm {d}s. \end{aligned}$$

(43)

Taking $v=u$ in (43) and substituting into (38), we see

$$\begin{aligned} \int _\varOmega \varvec{p}\otimes \varvec{q}\,\mathrm {d}\varvec{x} = \int _\varOmega \varvec{q} \otimes \nabla _h u\,\mathrm {d}\varvec{x} + \int _{\mathscr {E} \cup \partial \varOmega } \llbracket {\widehat{U}} - u\rrbracket \otimes \lbrace \varvec{q} \rbrace \,\mathrm {d}s + \int _\mathscr {E} \lbrace {\widehat{U}} - u \rbrace \llbracket \varvec{q}\rrbracket _\otimes \,\mathrm {d}s. \end{aligned}$$

(44)

Now choosing $\varvec{q} = \nabla _h \varPhi$ and substituting (44) into (37), we arrive at the fully generalised finite element Hessian given by (39).

Remark 3

(Consistent representations of the gradient operator) If one were interested in consistent representations of other derivatives, for example the gradient operator, one would need to modify the proof of Theorem 2. Examples of consistent gradient representations can be found in [4]. See also [10, 11, 17]. Using this methodology, it should be possible to construct an entire hierarchy of derivatives.

Example 1

An example of a DG formulation for the approximation to the Hessian, $\mathrm {D}^2u$, can be derived by taking the fluxes in the following way:

$$\begin{aligned} {\widehat{U}}= & {} {\left\{ \begin{array}{ll} \lbrace u_h \rbrace &{}\quad \text { over } \mathscr {E}, \\ 0 &{}\quad \text { on } \partial \varOmega , \end{array}\right. } \end{aligned}$$

(45)

$$\begin{aligned} \widehat{\varvec{p}}= & {} \lbrace \nabla _h u_h \rbrace \text { on } \mathscr {E} \cup \partial \varOmega . \end{aligned}$$

(46)

The result is a discrete representation of the Hessian $\varvec{H}[u_h]$ as a unique element of $\mathbb {V}^{d\times d}$ such that

$$\begin{aligned} \int _{\varOmega } \varvec{H}[u_h] \ \varPhi \,\mathrm {d}\varvec{x} = - \int _\varOmega \nabla _h u_h \otimes \nabla _h \varPhi \,\mathrm {d}\varvec{x} + \int _{\mathscr {E} \cup \partial \varOmega } \llbracket u_h\rrbracket \otimes \lbrace \nabla _h \varPhi \rbrace + \llbracket \varPhi \rrbracket \otimes \lbrace \nabla _h u_h \rbrace \,\mathrm {d}s. \end{aligned}$$

We can also derive unsymmetric approximations with fluxes similar to those used for the NIPG method. This is given by taking $\theta =-1$ in the following, while the symmetric version is given by $\theta =1$:

$$\begin{aligned} \begin{aligned} \int _{\varOmega } \varvec{H}[u_h] \ \varPhi \,\mathrm {d}\varvec{x}&= - \int _\varOmega \nabla _h u_h \otimes \nabla _h \varPhi \,\mathrm {d}\varvec{x} \\&\quad + \int _{\mathscr {E} \cup \partial \varOmega } \theta \llbracket u_h\rrbracket \otimes \lbrace \nabla _h \varPhi \rbrace + \llbracket \varPhi \rrbracket \otimes \lbrace \nabla _h u_h \rbrace \,\mathrm {d}s \\&= \int _\varOmega \mathrm {D}^2_h u_h \varPhi \,\mathrm {d}\varvec{x} - \int _\mathscr {E} \llbracket \nabla _h u_h\rrbracket _\otimes \lbrace \varPhi \rbrace \,\mathrm {d}s \\&\quad + \int _{\mathscr {E} \cup \partial \varOmega } \theta \llbracket u_h\rrbracket \otimes \lbrace \nabla _h \varPhi \rbrace \,\mathrm {d}s, \quad \,\forall \,\varPhi \in \mathbb {V}. \end{aligned} \end{aligned}$$

(47)

3.2 The Discontinuous Nonvariational Finite Element Method

We are now in a position to state the numerical method for the approximation of (1). We look to find $u_h\in \mathbb {V}_{0}$ together with $\varvec{H}[u_h]\in \mathbb {V}^{d\times d}$ such that

$$\begin{aligned} \mathscr {A} _h\!\left( {u_h,\varPsi }\right) = l(\varPsi ), \quad \,\forall \,\varPsi \in \mathbb {V}_{0}\end{aligned}$$

(48)

with

$$\begin{aligned} \mathscr {A} _h\!\left( {u_h,\varPsi }\right):= & {} \int _\varOmega -{\varvec{A}}{:}{\varvec{H}[u_h]} \varPsi \,\mathrm {d}\varvec{x} + \int _{\mathscr {E} \cup \partial \varOmega } \sigma h^{-1} {\llbracket u_h\rrbracket } \cdot \llbracket \varPsi \rrbracket \,\mathrm {d}s, \end{aligned}$$

(49)

$$\begin{aligned} l(\varPsi ):= & {} \int _\varOmega f \varPsi \,\mathrm {d}\varvec{x}, \end{aligned}$$

(50)

where the penalisation parameter $\sigma >0$ is to be chosen sufficiently large.

Using the ${\text {L}} ^{2}$ projection operator ${\text {P}}_{k} :{\text {L}} ^{2}(\varOmega ) \rightarrow \mathbb {V}$ defined for $v\in {\text {L}} ^{2}(\varOmega )$ through

$$\begin{aligned} \int _\varOmega {\text {P}}_{k} \!\left( {v}\right) \varPsi \,\mathrm {d}\varvec{x} = \int _\varOmega v\varPsi \,\mathrm {d}\varvec{x},\quad \forall \varPsi \in \mathbb {V}, \end{aligned}$$

(51)

it is possible to elliminate the finite element Hessian from the bilinear form for sufficiently smooth $\varvec{A}$.

Lemma 1

(Elimination of the finite element Hessian in a general setting) If the fluxes are chosen as in Example 1, then

$$\begin{aligned} \begin{aligned} \mathscr {A} _h\!\left( {u_h,\varPsi }\right)&= \int _\varOmega D_h\!\left( {{\text {P}}_{k} \!\left( {\varPsi \varvec{A}}\right) }\right) \nabla _h u_h\,\mathrm {d}\varvec{x} - \int _{\mathscr {E} \cup \partial \varOmega } \theta {\llbracket u_h\rrbracket } \cdot \lbrace D_h\!\left( {{\text {P}}_{k} \!\left( {\varPsi \varvec{A}}\right) }\right) \rbrace \,\mathrm {d}s \\&\quad - \int _{\mathscr {E} \cup \partial \varOmega } {\llbracket {\text {P}}_{k} \!\left( {\varPsi \varvec{A}}\right) \rrbracket } \cdot \lbrace \nabla _h u_h \rbrace \,\mathrm {d}s + \int _{\mathscr {E} \cup \partial \varOmega } \sigma h^{-1} {\llbracket u_h\rrbracket } \cdot \llbracket \varPsi \rrbracket \,\mathrm {d}s. \end{aligned} \end{aligned}$$

(52)

Proof

This follows from the following identity:

$$\begin{aligned} \begin{aligned} \int _\varOmega -{\varvec{A}}{:}{\varvec{H}[u_h]} \varPsi \,\mathrm {d}\varvec{x}&= \int _\varOmega -{\varvec{H}[u_h]}{:}{\!\left( {\varPsi \varvec{A}}\right) } \,\mathrm {d}\varvec{x} = \int _\varOmega -{\varvec{H}[u_h]}{:}{{\text {P}}_{k} \!\left( {\varPsi \varvec{A}}\right) } \,\mathrm {d}\varvec{x} \\&= \int _\varOmega D_h\!\left( {{\text {P}}_{k} \!\left( {\varPsi \varvec{A}}\right) }\right) \nabla _h u_h\,\mathrm {d}\varvec{x} - \int _{\mathscr {E} \cup \partial \varOmega } \theta {\llbracket u_h\rrbracket } \cdot \lbrace D_h\!\left( {{\text {P}}_{k} \!\left( {\varPsi \varvec{A}}\right) }\right) \rbrace \,\mathrm {d}s \\&\quad - \int _{\mathscr {E} \cup \partial \varOmega } {\llbracket {\text {P}}_{k} \!\left( {\varPsi \varvec{A}}\right) \rrbracket } \cdot \lbrace \nabla _h u_h \rbrace \,\mathrm {d}s. \end{aligned} \end{aligned}$$

(53)

Remark 4

The solution of the problem in this form is nontrivial due to the global ${\text {L}} ^{2}(\varOmega )$ projection appearing in the formulation. However, in the discontinuous setting, the global ${\text {L}} ^{2}(\varOmega )$ projection is in fact computable locally. We may actually exploit this fact to optimise our schemes efficiency. We will discuss this further in the sequel.

Example 2

(Laplacian formulation) Note that if in (1) we have that $\varvec{A}= \varvec{I}$, then we have that

$$\begin{aligned} f = -{A}{:}{\mathrm {D}^2u} = -\Delta u \end{aligned}$$

(54)

and our bilinear form reduces to

$$\begin{aligned} \begin{aligned} \mathscr {A} _h\!\left( {u_h,\varPsi }\right)&= \int _\varOmega {\!\left( {\nabla _h\varPsi }\right) } \cdot \nabla _h u_h\,\mathrm {d}\varvec{x} - \int _{\mathscr {E} \cup \partial \varOmega } \theta {\llbracket u_h\rrbracket } \cdot \lbrace \nabla _h\varPsi \rbrace \,\mathrm {d}s \\&\quad - \int _{\mathscr {E} \cup \partial \varOmega } {(\llbracket \varPsi \rrbracket \cdot \lbrace \nabla _h u_h \rbrace } - \sigma h^{-1} {\llbracket u_h\rrbracket } \cdot \llbracket \varPsi \rrbracket )\,\mathrm {d}s, \end{aligned} \end{aligned}$$

(55)

since ${\text {P}}_{k} \!\left( {\varPsi \varvec{A}}\right) =\varPsi {\varvec{I}}$.

The nonvariational finite element method, thus, coincides with the classical (symmetric) interior penalty method for the Laplacian [16].

Remark 5

(Relation to standard DG methods) It is not difficult to prove that choosing to numerical fluxes in the same way as presented in [4, Table 3.2] results in the same correlation to the DG methods summarised in the aforementioned paper for the case that $\varvec{A}$ is constant. For brevity, we will not prove this here.

Note that when $\varvec{A}$ is not constant, we have that the nonvariational finite element method does not coincide with its standard variational finite element counterpart. Indeed, the method is able to successfully cope with classes of advection dominated problems not falling under the variational framework without special treatment [32, §4.2] which is illustrated by the result of Lemma 1.

We conclude this section with a proof of consistency of the method and then show that Galerkin orthogonality holds.

Lemma 2

(Consistency) Let $u\in {\text {H}} ^{2}(\varOmega )$ and assume that the numerical fluxes are chosen in a consistent fashion in the sense of [4, §3.1], that is,

$$\begin{aligned} {\widehat{U}}= & {} u|_{\mathscr {E} \cup \partial \varOmega }, \end{aligned}$$

(56)

$$\begin{aligned} \widehat{\varvec{p}}= & {} \nabla u|_{\mathscr {E} \cup \partial \varOmega }. \end{aligned}$$

(57)

Then for $\varPhi \in \mathbb {V}$

$$\begin{aligned} \begin{aligned} \int _\varOmega \varvec{H}[u] \varPhi \,\mathrm {d}\varvec{x}&= \int _\varOmega \mathrm {D}^2u \varPhi \,\mathrm {d}\varvec{x}. \end{aligned} \end{aligned}$$

(58)

Therefore, we have that $\varvec{H}[u] = {\text {P}}_{k} \!\left( {\mathrm {D}^2u}\right)$.

Proof

Applying Proposition 1 to the first term in the definition of $\varvec{H}[u]$ yields

$$\begin{aligned} \begin{aligned} \int _\varOmega \varvec{H}[u] \varPhi \,\mathrm {d}\varvec{x}&= \int _\varOmega \mathrm {D}^2u \varPhi \,\mathrm {d}\varvec{x} + \int _\mathscr {E} \llbracket \widehat{{\varvec{p}}}-\nabla u\rrbracket _\otimes \lbrace \varPhi \rbrace \,\mathrm {d}s + \int _{\mathscr {E} \cup \partial \varOmega } \lbrace \varvec{{\widehat{p}}}-\nabla u \rbrace \otimes \llbracket \varPhi \rrbracket \,\mathrm {d}s \\&\quad - \int _{\mathscr {E}} \lbrace {\widehat{U}} - u \rbrace \llbracket \nabla _h \varPhi \rrbracket _\otimes \,\mathrm {d}s - \int _{\mathscr {E} \cup \partial \varOmega } \llbracket {\widehat{U}} - u\rrbracket \otimes \lbrace \nabla _h \varPhi \rbrace \,\mathrm {d}s \\&= \int _\varOmega \mathrm {D}^2u \varPhi \,\mathrm {d}\varvec{x},\quad \,\forall \,\varPhi \in \mathbb {V}, \end{aligned} \end{aligned}$$

(59)

which proves the results under the consistency conditions on the fluxes.

Lemma 3

(Galerkin orthogonality) Let $u\in {\text {H}} ^{2}(\varOmega )\cap {\text {H}} ^{1}_0(\varOmega )$ be a strong solution to the problem (1) and let $u_h\in \mathbb {V}_{0}$ be its nonvariational finite element approximation. Assume that the numerical fluxes ${\widehat{U}}$ and $\widehat{\varvec{p}}$ are consistent, then we have the following orthogonality result:

$$\begin{aligned} \mathscr {A} _h\!\left( {u_h-u,\varPsi }\right) = J(\varPsi ), \quad \,\forall \,\varPsi \in \mathbb {V}_{0}, \end{aligned}$$

(60)

with the error functional given by

$$\begin{aligned} J(\varPsi ) = \int _\varOmega {\!\left( {\mathrm {D}^2u-\varvec{H}[u]}\right) }{:}{\!\left( {\varvec{A}\varPsi }\right) } \,\mathrm {d}\varvec{x}. \end{aligned}$$

(61)

Proof

Using the consistency result and that $\llbracket u\rrbracket =0$, we conclude

$$\begin{aligned} \mathscr {A} _h\!\left( {u_h-u,\varPsi }\right)&= \mathscr {A} _h\!\left( {u_h,\varPsi }\right) + \int _\varOmega {\varvec{A}}{:}{\varvec{H}[u]} \varPsi \,\mathrm {d}\varvec{x} = l(\varPsi ) + \int _\varOmega {\varvec{H}[u]}{:}{\!\left( {\varvec{A}\varPsi }\right) } \,\mathrm {d}\varvec{x}\\&= -\int _\varOmega {\varvec{A}}{:}{\mathrm {D}^2u}\varPsi - {\varvec{H}[u]}{:}{\!\left( {\varvec{A}\varPsi }\right) }\,\mathrm {d}\varvec{x} = J(\varPsi ), \end{aligned}$$

concluding the proof.

Remark 6

If $\varvec{A}$ is piecewise constant, then since $\varvec{H}[u]={\text {P}}_{k} \!\left( {\mathrm {D}^2u}\right)$ we have $J(\varPsi ) = 0$ and we recover the usual Galerkin orthogonality $\mathscr {A} _h\!\left( {u_h-u,\varPsi }\right) = 0$.

Definition 5

(${\text {H}} ^{1}(\mathscr {T} ^{})$ and ${\text {H}} ^{2}(\mathscr {T} ^{})$ norms) We introduce the broken ${\text {H}} ^{1}(\mathscr {T} ^{})$ and ${\text {H}} ^{2}(\mathscr {T} ^{})$ norms as

$$\begin{aligned}&\left\| u_h\right\| _{{\text{DG}},{1}}^2 := \left\| \nabla _h u_h\right\| _{{\text {L}} ^{2}(\varOmega )}^2 + h^{-1} \left\| \llbracket u_h\rrbracket \right\| _{{\text {L}} ^{2}(\mathscr {E})}^2, \end{aligned}$$

(62)

$$\begin{aligned}&\left\| u_h\right\| _{{\text{DG}},{2}}^2 := \left\| \mathrm {D}^2_h u_h\right\| _{{\text {L}} ^{2}(\varOmega )}^2 + h^{-1} \left\| \llbracket \nabla _h u_h\rrbracket \right\| _{{\text {L}} ^{2}(\mathscr {E})}^2 + h^{-3} \left\| \llbracket u_h\rrbracket \right\| _{{\text {L}} ^{2}(\mathscr {E})}^2. \end{aligned}$$

(63)

These are equivalent to their continuous equivalent norms for functions in $\mathbb {V}$.

Proposition 2

(Projection approximation in $\mathbb {V}$) Let ${\text {P}}_{k} :{\text {L}} ^{2}(\varOmega ) \rightarrow \mathbb {V}$ be the ${\text {L}} ^{2}(\varOmega )$ orthogonal projection operator defined by (51). Using standard approximation arguments, we have that

$$\left\{\begin{aligned} \begin{aligned} \left\| v - {\text {P}}_{k} v\right\| _{{\text{DG}},{1}}&\leqslant C h^k\left| v\right| _{{\text {H}} ^{k+1}(\varOmega )},\\ \left\| v - {\text {P}}_{k} v\right\| _{{\text {L}} ^{2}(\varOmega )}&\leqslant C h^{k+1}\left| v\right| _{{\text {H}} ^{k+1}(\varOmega )}. \end{aligned} \end{aligned}\right.$$

(64)

Lemma 4

(Stability of $\varvec{H}$ [37, Theorem 4.10]) Let $\varvec{H}$ be defined as in Example 1. Then the DG Hessian is stable in the sense that

$$\begin{aligned} \begin{aligned} \left\| \mathrm {D}^2_h v_h - \varvec{H}[v_h]\right\| _{{\text {L}} ^{2}(\varOmega )}^2&\leqslant C\!\left( {\int _\mathscr {E} h^{-1} \left| \llbracket \nabla _h v_h\rrbracket \right| ^2 + h^{-3} \left| \llbracket v_h\rrbracket \right| ^2 \,\mathrm {d}s }\right) . \end{aligned} \end{aligned}$$

(65)

Consequently, we have

$$\begin{aligned} \left\| \varvec{H}[v_h]\right\| _{{\text {L}} ^{2}(\varOmega )}^2 \leqslant C \left\| v_h\right\| _{{\text{DG}},{2}}^2. \end{aligned}$$

(66)

4 A Posteriori Analysis

The discrete bilinear form (49) only makes sense over ${\text {H}} ^{2}\!\left( {\mathscr {T} ^{}}\right) \times {\text {H}} ^{2}\!\left( {\mathscr {T} ^{}}\right)$. To allow for an a posteriori bound we require an extension to ensure the appropriate stability arguments can be applied. We require the bilinear form to be extended to ${\text {H}} ^{2}\!\left( {\mathscr {T} ^{}}\right) \times {\text {L}} ^{2}\!\left( {\varOmega }\right)$. To do this for $\left( {u,v}\right) \in {\text {H}} ^{2}\!\left( {\mathscr {T} ^{}}\right) \times {\text {L}} ^{2}\!\left( {\varOmega }\right)$, we define

$$\begin{aligned} \mathscr {A} _h\!\left( {u,v}\right) := \int _\varOmega {\varvec{A}}{:}{\varvec{H}(u)} v + \int _\mathscr {E} \sigma h_e^{-1} \llbracket u\rrbracket \cdot \llbracket {\text {P}}_{k} v\rrbracket . \end{aligned}$$

(67)

Remark 7

Notice that the modified bilinear form (67) coincides with (49) over $\mathbb {V}\times \mathbb {V}$ and that it satisfies

$$\begin{aligned} \mathscr {A} _h\!\left( {u,v}\right) \leqslant C \,|\!|\!| {{u}} |\!|\!|_{{\text{DG}},{2}} \left\| v\right\| _{{\text {L}} ^{2}(\varOmega )} \text { for } \!\left( {u,v}\right) \in \!\left( {{\text {H}} ^{2}\!\left( {\mathscr {T} ^{}}\right) \times {\text {L}} ^{2}(\varOmega )}\right) . \end{aligned}$$

(68)

Assumption 2

(Existence of an ${\text {H}} ^{2}$ reconstruction) We will assume there exists an operator $\mathscr {R}: \mathbb {V}\rightarrow {\text {H}} ^{2}(\varOmega )$ such that

$$\begin{aligned} \,|\!|\!| {{\mathscr {R} (u_h) - u_h}} |\!|\!|_{{\text{DG}},{2}}^2 \leqslant C \!\left( { \left\| h^{-1/2}\llbracket \nabla u_h\rrbracket \right\| _{{\text {L}} ^{2}(\mathscr {E})}^2 + \left\| h^{-3/2}\llbracket u_h\rrbracket \right\| _{{\text {L}} ^{2}(\mathscr {E})}^2 }\right) . \end{aligned}$$

(69)

Such an example is given, for $d=2$, in [26, Lem 3.1] and consists of averaging techniques onto macro-elements on the Hsieh-Clough-Tocher space or in [9], for $d=3$, using virtual element spaces.

Proposition 3

(Abstract upper bound) Let $u\in {\text {H}} ^{2}(\varOmega )$ solve (1), $u_h\in \mathbb {V}$ be the finite element approximation given by (48) and $\mathscr {R} (u_h)\in {\text {H}} ^{2}(\varOmega )$ be a post-processor satisfying (69). Then,

$$\begin{aligned}&\left\| u - \mathscr {R} (u_h)\right\| _{{\text {H}} ^{2}(\varOmega )} \nonumber \\& \leqslant \frac{1}{C} \sup _{v\in {\text {L}} ^{2}(\varOmega )} \frac{\bigg ( l(v) - \mathscr {A} _h\!\left( {u_h,v}\right) + \mathscr {A} _h\!\left( {u_h - \mathscr {R} (u_h),v}\right) + \mathscr {A} _h\!\left( {\mathscr {R} (u_h),v}\right) - \mathscr {A} \!\left( {\mathscr {R} (u_h),v}\right) \bigg ) }{\left\| v\right\| _{{\text {L}} ^{2}(\varOmega )}}. \end{aligned}$$

(70)

Proof

Making use of the Miranda-Talenti inequality [34, 39] and the inf-sup condition (16) we see that

$$\begin{aligned} C\left\| u - \mathscr {R} (u_h)\right\| _{{\text {H}} ^{2}(\varOmega )} \leqslant {\tilde{C}}\left\| \Delta \!\left( {u - \mathscr {R} (u_h)}\right) \right\| _{{\text {L}} ^{2}(\varOmega )} \leqslant \sup _{v\in {\text {L}} ^{2}(\varOmega )} \frac{\mathscr {A} \!\left( {u - \mathscr {R} (u_h),v}\right) }{\left\| v\right\| _{{\text {L}} ^{2}(\varOmega )}}. \end{aligned}$$

(71)

Now, adding and subtracting appropriate terms, we have

$$\begin{aligned} \mathscr {A} \!\left( {u - \mathscr {R} (u_h),v}\right)&= l(v - v_h) - \mathscr {A} _h\!\left( {u_h,v-v_h}\right) \nonumber \\&\quad +\mathscr {A} _h\!\left( {u_h - \mathscr {R} (u_h),v}\right) + \mathscr {A} _h\!\left( {\mathscr {R} (u_h),v}\right) - \mathscr {A} \!\left( {\mathscr {R} (u_h),v}\right), \end{aligned}$$

(72)

and the result follows.

Theorem 3

(A posteriori bound) Let $u\in {\text {H}} ^{2}(\varOmega )$ solve (1) and $u_h\in \mathbb {V}$ be the finite element approximation given by (48). Then

$$\begin{aligned} \,|\!|\!| {{u - u_h}} |\!|\!|_{{\text{DG}},{2}} \leqslant C \!\left( {\sum _{K\in \mathscr {T} ^{}}\eta _K^2 + \sum _{e\in \mathscr {E}}\eta _e^2}\right) ^{1/2}, \end{aligned}$$

(73)

where

$$\begin{aligned}&\eta _K^2 := \left\| f + {\varvec{A}}{:}{\varvec{H}(u_h)}\right\| _{{\text {L}} ^{2}(K)}^2, \end{aligned}$$

(74)

$$\begin{aligned}&\eta _e^2 := h_e^{-1} \left\| \llbracket \nabla u_h\rrbracket \right\| _{{\text {L}} ^{2}(e)}^2 + h_e^{-3} \left\| \llbracket u_h\rrbracket \right\| _{{\text {L}} ^{2}(e)}^2. \end{aligned}$$

(75)

Proof

Beginning from Proposition 3 we note that the bound can be split into a residual component $\mathscr {I} _1$, a nonconformity component $\mathscr {I} _2$ and an inconsistency component $\mathscr {I} _3$ as follows:

$$\begin{aligned} \begin{aligned} \left\| u - \mathscr {R} (u_h)\right\| _{{\text {H}} ^{2}(\varOmega )}&\leqslant \frac{1}{C} \left( \sup _{v\in {\text {L}} ^{2}(\varOmega ), \left\| v\right\| _{{\text {L}} ^{2}(\varOmega )}\leqslant 1} \!\left( { l(v) - \mathscr {A} _h\!\left( {u_h,v}\right) }\right) \right. \\&\quad + \sup _{v\in {\text {L}} ^{2}(\varOmega ), \left\| v\right\| _{{\text {L}} ^{2}(\varOmega )}\leqslant 1} \mathscr {A} _h\!\left( {u_h - \mathscr {R} (u_h),v}\right) \\&\quad \left. + \sup _{v\in {\text {L}} ^{2}(\varOmega ), \left\| v\right\| _{{\text {L}} ^{2}(\varOmega )}\leqslant 1} \mathscr {A} _h\!\left( {\mathscr {R} (u_h),v}\right) - \mathscr {A} \!\left( {\mathscr {R} (u_h),v}\right) \right) \\&=: \mathscr {I} _1 + \mathscr {I} _2 + \mathscr {I} _3. \end{aligned} \end{aligned}$$

(76)

Now, we proceed to bound these term by term. In view of Cauchy-Schwartz, we have

$$\begin{aligned} \begin{aligned} \mathscr {I} _1&\leqslant \sup _{v\in {\text {L}} ^{2}(\varOmega ), \left\| v\right\| _{{\text {L}} ^{2}(\varOmega )}\leqslant 1}\int _\varOmega \!\left( {f - {\varvec{A}}{:}{\varvec{H}(u_h)}}\right) v \\&\leqslant \!\left( {\sum _{K\in \mathscr {T} ^{}} \left\| f - {\varvec{A}}{:}{\varvec{H}(u_h)}\right\| _{{\text {L}} ^{2}(K)}^2}\right) ^{1/2} . \end{aligned} \end{aligned}$$

(77)

For the nonconformity term, we note that

$$\begin{aligned} \mathscr {I} _2 \leqslant C \,|\!|\!| {{\mathscr {R} (u_h) - u_h}} |\!|\!|_{{\text{DG}},{2}} \leqslant C \!\left( { \sum _{e \in {K}} h_e^{-1} \left\| \llbracket \nabla _h v_h\rrbracket \right\| ^2_{{\text {L}} ^{2}(e)} + h_e^{-3} \left\| \llbracket v_h\rrbracket \right\| ^2_{{\text {L}} ^{2}(e)} }\right) ^{1/2}, \end{aligned}$$

(78)

by the properties of the reconstruction given in (69).

For the inconsistency term, we have that

$$\begin{aligned} \begin{aligned} \mathscr {I} _3&= \sup _{v\in {\text {L}} ^{2}(\varOmega ), \left\| v\right\| _{{\text {L}} ^{2}(\varOmega )}\leqslant 1} \frac{1}{C} \int _\varOmega {\varvec{A}}{:}{\!\left( {\varvec{H}(\mathscr {R} (u_h)) - \mathrm {D}^2\mathscr {R} (u_h)}\right) } v \\&\leqslant C \left\| \varvec{A}\right\| _{{\text {L}} ^{\infty }(\varOmega )} \left\| \varvec{H}(\mathscr {R} (u_h)) - \mathrm {D}^2\mathscr {R} (u_h)\right\| _{{\text {L}} ^{2}(\varOmega )} \\&\leqslant C \left\| \varvec{A}\right\| _{{\text {L}} ^{\infty }(\varOmega )} \bigg ( \left\| \varvec{H}(\mathscr {R} (u_h)) - \varvec{H}(u_h)\right\| _{{\text {L}} ^{2}(\varOmega )} + \left\| \varvec{H}(u_h) - \mathrm {D}^2_h u_h\right\| _{{\text {L}} ^{2}(\varOmega )} \\&\quad + \left\| \mathrm {D}^2_h u_h - \mathrm {D}^2\mathscr {R} (u_h)\right\| _{{\text {L}} ^{2}(\varOmega )} \bigg ). \end{aligned} \end{aligned}$$

(79)

Making use of Lemma 4 and the properties of the reconstruction (69), we see that

$$\begin{aligned} \begin{aligned} \left\| \varvec{H}(\mathscr {R} (u_h)) - \varvec{H}(u_h)\right\| _{{\text {L}} ^{2}(\varOmega )}&\leqslant C\,|\!|\!| {{\mathscr {R} (u_h) - u_h}} |\!|\!|_{{\text{DG}},{2}} \\&\leqslant C \!\left( { \sum _{e \in \mathscr {E}} h_e^{-1} \left\| \llbracket \nabla _h v_h\rrbracket \right\| ^2_{{\text {L}} ^{2}(e)} + h_e^{-3} \left\| \llbracket v_h\rrbracket \right\| ^2_{{\text {L}} ^{2}(e)} }\right) ^{1/2}. \end{aligned} \end{aligned}$$

(80)

Again from Lemma 4 we have that

$$\begin{aligned} \left\| \varvec{H}(u_h) - \mathrm {D}^2_h u_h\right\| _{{\text {L}} ^{2}(\varOmega )} \leqslant C \!\left( { \sum _{e \in \mathscr {E}} h_e^{-1} \left\| \llbracket \nabla _h v_h\rrbracket \right\| ^2_{{\text {L}} ^{2}(e)} + h_e^{-3} \left\| \llbracket v_h\rrbracket \right\| ^2_{{\text {L}} ^{2}(e)} }\right) ^{1/2}, \end{aligned}$$

(81)

and finally (69) gives that

$$\begin{aligned} \left\| \mathrm {D}^2_h u_h - \mathrm {D}^2\mathscr {R} (u_h)\right\| _{{\text {L}} ^{2}(\varOmega )} \leqslant C \!\left( { \sum _{e \in \mathscr {E}} h_e^{-1} \left\| \llbracket \nabla _h v_h\rrbracket \right\| ^2_{{\text {L}} ^{2}(e)} + h_e^{-3} \left\| \llbracket v_h\rrbracket \right\| ^2_{{\text {L}} ^{2}(e)} }\right) ^{1/2}. \end{aligned}$$

(82)

Hence, we have that

$$\begin{aligned} \begin{aligned} \mathscr {I} _3&\leqslant C \!\left( { \sum _{e \in \mathscr {E}} h_e^{-1} \left\| \llbracket \nabla _h v_h\rrbracket \right\| ^2_{{\text {L}} ^{2}(e)} + h_e^{-3} \left\| \llbracket v_h\rrbracket \right\| ^2_{{\text {L}} ^{2}(e)} }\right) ^{1/2}. \end{aligned} \end{aligned}$$

(83)

Collecting (77), (78) and (83) yields the desired result.

Proposition 4

(A posteriori lower bound) Using the notation of Theorem 3, through standard a posteriori techniques, we have a lower bound of the form

$$\begin{aligned} \eta _K + \sum _{e\in \partial K} \eta _e \leqslant C \!\left( {\left\| \mathrm {D}^2u - \varvec{H}[u_h]\right\| _{{\text {L}} ^{2}(\hat{K})} + \sum _{K\in \hat{K}} \eta _{I,K} }\right) , \end{aligned}$$

(84)

where

$$\begin{aligned} \eta _{I,K}^2 := \left\| \!\left( {P_{k-2}\varvec{A}- \varvec{A}}\right) : {\varvec{H}(u_h)}\right\| _{{\text {L}} ^{2}(K)}^2 + \left\| f - P_{k-2}f\right\| _{{\text {L}} ^{2}(K)}^2. \end{aligned}$$

(85)

5 Numerical Experiments

In this section, we detail numerical experiments carried out in the finite element package Dune-Fem [15] which is based on the Dune software framework [6, 7]. The code makes use of the newly developed Python frontend [15] and the unified form language [3] is used to provide the problem data. The code will be made freely available within the Dune-Fem-tutorial in a future release [13].

We present some benchmark problems designed such that the exact solution is known. In each of the experiments the domain $\varOmega = [0,1]^2$ and we consider the coefficient matrix to be

$$\begin{aligned} \varvec{A}({\varvec{x}}) = \begin{bmatrix} 1 &{} b({\varvec{x}}) \\ b({\varvec{x}}) &{} a({\varvec{x}}) \end{bmatrix} \end{aligned}$$

(86)

varying $a({\varvec{x}})$ and $b({\varvec{x}})$. We study three different choices using $\varvec{x} = \!\left( {x_1,x_2}\right)$ as follows.

1)
(Coercive) In this test, we take the components of $\varvec{A}$ such that the differential operator can be written in variational form and is coercive, fitting into a standard analytical framework:
$$\begin{aligned} a(\varvec{x})&= 1-\ln \!\left( {\!\left( {x_1-1/2}\right) ^2+10^{{-4}}}\right), \end{aligned}$$
(87)
$$\begin{aligned} b(\varvec{x})&= 0. \end{aligned}$$
(88)
2)
(Continuous not $H^2$) In this test, we take $\varvec{A}$ such that it is comparable to [32, §4.4]
$$\begin{aligned} a(\varvec{x})&= 2, \end{aligned}$$
(89)
$$\begin{aligned} b(\varvec{x})&= \!\left( {x_1^2x_2^2}\right) ^{1/3}. \end{aligned}$$
(90)
3)
(Discontinuous not $W^{1,\infty }$) In our third test, a is discontinuous and not in $W^{1,\infty }$. We take $b\equiv 0$ and choose
$$\begin{aligned} a(\varvec{x})=\alpha (|\varvec{x}|_\infty ) \end{aligned}$$
(91)
with
$$\begin{aligned} \alpha (s)&= \frac{1}{100} + 1\,000 {\left\{ \begin{array}{ll} \sqrt{\frac{1}{4}-s}, &{} \text { if } s<\frac{1}{4}, \\ \cos {\uppi s}, &{} \text { otherwise. } \end{array}\right. } \end{aligned}$$
(92)
Note that the initial grid is chosen so that it aligns with the discontinuity.

We study the behaviour of our method for polynomial degrees $k=1,2,3$, choosing the forcing such that the exact solution is given by the following two choices.

1)
(Smooth solution): $u(\varvec{x}) = \sin {2\uppi x_1} \sin {2\uppi x_2}$.
2)
($H^2{\setminus } H^3$ solution): $u(\varvec{x}) = {\left\{ \begin{array}{ll} \frac{1}{4}\Big (\cos {8\uppi \left| \varvec{x}-\frac{1}{2}\right| ^2}+1\Big ), &{} \text { if } \left| \varvec{x}-\frac{1}{2}\right| ^2 \leqslant \frac{1}{8}, \\ 0, &{} \text { otherwise. } \end{array}\right. }$

The penalty parameter is chosen as $\sigma = \lambda _{\varvec{A},{\mathrm {max}}} 5k(k+1)$ where $\lambda _{\varvec{A},{\mathrm {max}}}$ is the maximum eigenvalue of $\varvec{A}$.

Remark 8

(Compatibility of spaces for $u_h$ and $\varvec{H}(u_h)$) Note that it is not actually required that the approximations $u_h$ and $\varvec{H}(u_h)$ are represented in the same finite element space. Computationally we observe similar results when $\varvec{H}(u_h)$ is one degree lower than $u_h$ and suboptimal convergence rates when $\varvec{H}(u_h)$ is two orders lower than $u_h$. Other choices do not appear to be stable.

Benchmarking for these tests is shown in Figs. 1, 2 and 3. For linear polynomials, the ${\text {H}} ^{2}(\mathscr {T} ^{})$-error does not converge since the piecewise Hessian of $u_h$ vanishes. So we only show the errors in the ${\text {L}} ^{2}(\varOmega )$ and ${\text {H}} ^{1}(\mathscr {T} ^{})$ norms. For the smooth solution (left column), we clearly see that the method converges optimally in both norms. While for the less smooth solution convergence is slowed as would be expected when approximating an ${\text {H}} ^{2}(\varOmega )$ solution. For polynomial orders 2 and 3 convergence is optimal for all norms studied here when approximating a smooth solution and between 1 and 1.5 for the ${\text {H}} ^{2}(\varOmega )$ solution. The convergence rate hardly depends on the smoothness of A. The only case where some clear dependency is visible is in the ${\text {L}} ^{2}(\varOmega )$ errors for quadratic polynomials with the discontinuous $\varvec{A}$. In this case, the order of the ${\text {L}} ^{2}(\varOmega )$ error seems to equal the convergence in the ${\text {H}} ^{1}(\mathscr {T} ^{})$ norm, i.e., is not optimal (see Fig. 2).

We also study the condition number of the system matrix generated when assembling (49). The results are shown in Fig. 4. In contrast to many other methods, which rewrite the nonvariational model as a fourth-order problem, the condition number depends on the grid spacing in the same way as it does for second-order variational problem, i.e., it is $O(h^{-2})$.

Finally, we study the behaviour of the residual error indicator and an adaptive scheme for the approximation of the solutions to these problems. The indicator is included in Figs. 2, 3 where its reliability is clearly visible. A comparison of these globally refined simulations with an adaptive simulation using an equal distribution strategy for locally refining the grid are given in Figs. 5 and 6. As to be expected, no advantage can be gained when approximating a smooth solution. For the ${\text {H}} ^{2}(\varOmega )$ solution, the adaptive simulation approximately doubles the convergence rate of the scheme.

We conclude with a simulation for which we do not have an exact solution. We use the discontinuous $\varvec{A}$ and choose a constant forcing $f\equiv 1\,000$. As before boundary conditions are equal to zero. Results are shown in Fig. 6 where we show the convergence of the residual indicator under global and local refinement. A visualisation of the function a, the resulting discrete solution, and the adaptive grid are given in Fig. 7.

6 Conclusions and Outlook

In this work, we have extended the framework from [32] for linear nonvariational problems to incorporate discontinuous approximations. We have derived a posteriori bounds for this problem and shown they are useful to drive adaptive algorithms. We would like to point out that the approach presented here can be directly applied to the case where the approximation $u_h$ is chosen in a continuous finite element space but the finite element Hessian is defined in the discontinuous fashion described here.

In the numerical experiments, we note the method is well posed and converges optimally even for $\varvec{A}$ that are discontinuous. The method is well suited to solve nonlinear problems [33] and this will be the topic of ongoing research.

References

Agouzal, A., Vassilevski, Y.: On a discrete Hessian recovery for P₁ finite elements. J. Numer. Math. 10(1), 1–12 (2002)
Article MathSciNet Google Scholar
Aguilera, N.E., Morin, P.: On convex functions and the finite element method. SIAM J. Numer. Anal. 47(4), 3139–3157 (2009)
Article MathSciNet Google Scholar
Alnæs, M.S., Logg, A., Ølgaard, K.B., Rognes, M.E., Wells, G.N.: Unified form language: a domain-specific language for weak formulations of partial differential equations. CoRR (2012). arXiv:1211.4047
Arnold, D.N., Brezzi, F., Cockburn, B., Marini, L.D.: Unified analysis of discontinuous Galerkin methods for elliptic problems. SIAM J. Numer. Anal. 39(5), 1749–1779 (2002)
Barles, G., Souganidis, P.E.: Convergence of approximation schemes for fully nonlinear second order equations. Asymptot. Anal. 4(3), 271–283 (1991)
MathSciNet MATH Google Scholar
Bastian, P., Blatt, M., Dedner, A., Engwer, C., Klöfkorn, R., Ohlberger, M., Sander, O.: A generic grid interface for parallel and adaptive scientific computing. I. Abstract framework. Computing 82(2/3), 103–119 (2008)
Article MathSciNet Google Scholar
Bastian, P., Blatt, M., Dedner, A., Engwer, C., Klöfkorn, R., Kornhuber, R., Ohlberger, M., Sander, O.: A generic grid interface for parallel and adaptive scientific computing. II. Implementation and tests in DUNE. Computing 82(2/3), 121–138 (2008)
Article MathSciNet Google Scholar
Böhmer, K.: On finite element methods for fully nonlinear elliptic equations of second order. SIAM J. Numer. Anal. 46(3), 1212–1249 (2008)
Article MathSciNet Google Scholar
Brenner, S.C., Sung, L.-Y.: Virtual enriching operators. Calcolo 56(4), 1–25 (2019)
Article MathSciNet Google Scholar
Buffa, A., Ortner, C.: Compact embeddings of broken Sobolev spaces and applications. IMA J. Numer. Anal. 29(4), 827–855 (2009)
Article MathSciNet Google Scholar
Burman, E., Ern, A.: Discontinuous Galerkin approximation with discrete variational principle for the nonlinear Laplacian. C. R. Math. Acad. Sci. Paris 346(17/18), 1013–1016 (2008)
Article MathSciNet Google Scholar
Cordes, H.O.: Über die erste randwertaufgabe bei quasilinearen differentialgleichungen zweiter ordnung in mehr als zwei variablen. Math. Ann. 131(3), 278–312 (1956)
Article MathSciNet Google Scholar
Dedner, A., Klöfkorn, R., Nolte, M.: Python bindings for the DUNE-FEM module. Zenodo (2020). https://doi.org/10.5281/zenodo.3706993
Dedner, A., Klöfkorn, R., Nolte, M., Ohlberger, M.: A generic interface for parallel and adaptive scientific computing: abstraction principles and the DUNE-FEM module. Computing 90, 165–196 (2010)
Dedner, A., Nolte, M.: The Dune-Python Module. CoRR (2018). arXiv:1807.05252
Douglas, J. Jr., Dupont, T.: Interior penalty procedures for elliptic and parabolic Galerkin methods. In: Computing Methods in Applied Sciences (Second Internat. Sympos., Versailles, 1975). Lecture Notes in Phys., vol. 58, pp. 207–216. Springer, Berlin (1976)
Di Pietro, D.A., Ern, A.: Discrete functional analysis tools for discontinuous Galerkin methods with application to the incompressible Navier-Stokes equations. Math. Comput. 79(271), 1303–1330 (2010)
Article MathSciNet Google Scholar
Elman, H.C., Silvester, D.J., Wathen, A.J.: Finite elements and fast iterative solvers: with applications in incompressible fluid dynamics. In: Numerical Mathematics and Scientific Computation. Oxford University Press, New York (2005)
Ern, A., Guermond, J.-L.: Theory and practice of finite elements. In: Antman, S.S., Marsden, J.E., Sirovich, L.(eds) Applied Mathematical Sciences, vol. 159. Springer, New York (2004)
Feng, X., Neilan, M.: Mixed finite element methods for the fully nonlinear Monge-Ampère equation based on the vanishing moment method. SIAM J. Numer. Anal. 47(2), 1226–1250 (2009)
Article MathSciNet Google Scholar
Feng, X., Neilan, M.: Vanishing moment method and moment solutions for fully nonlinear second order partial differential equations. J. Sci. Comput. 38(1), 74–98 (2009)
Article MathSciNet Google Scholar
Feng, X., Hennings, L., Neilan, M.: Finite element methods for second order linear elliptic partial differential equations in non-divergence form. Math. Comput. 86(307), 2025–2051 (2017)
Article MathSciNet Google Scholar
Feng, X., Neilan, M., Schnake, S.: Interior penalty discontinuous Galerkin methods for second order linear non-divergence form elliptic PDES. J. Sci. Comput. 74(3), 1651–1676 (2018)
Article MathSciNet Google Scholar
Gallistl, D.: Variational formulation and numerical analysis of linear elliptic equations in nondivergence form with Cordes coefficients. SIAM J. Numer. Anal. 55(2), 737–757 (2017)
Article MathSciNet Google Scholar
Gallistl, D.: Numerical approximation of planar oblique derivative problems in nondivergence form. Math. Comput. 88(317), 1091–1119 (2019)
Article MathSciNet Google Scholar
Georgoulis, E.H., Houston, P., Virtanen, J.: An a posteriori error indicator for discontinuous Galerkin approximations of fourth-order elliptic problems. IMA J. Numer. Anal. 31(1), 281–298 (2011)
Article MathSciNet Google Scholar
Gilbarg, D., Trudinger, N.S.: Elliptic Partial Differential Equations of Second Order, 2nd edn. Springer, Berlin (1983)
MATH Google Scholar
Jensen, M., Smears, I.: On the convergence of finite element methods for Hamilton-Jacobi-Bellman equations. Technical report, 01 (2011)
Kawecki, E.L.: A DGFEM for nondivergence form elliptic equations with Cordes coefficients on curved domains. Numer. Methods Partial Differ. Equations 35(5), 1717–1744 (2019)
Article MathSciNet Google Scholar
Kawecki, E.L., Smears, I.: Convergence of adaptive discontinuous Galerkin and c⁰-interior penalty finite element methods for Hamilton-Jacobi-Bellman and Isaacs equations (2020). arXiv:2006.07215
Lakkis, O., Mousavi, A.: A least-squares Galerkin approach to gradient and Hessian recovery for nondivergence-form elliptic equations (2019). arXiv:1909.00491
Lakkis, O., Pryer, T.: A finite element method for second order nonvariational elliptic problems. SIAM J. Sci. Comput. 33(2), 786–801 (2011)
Article MathSciNet Google Scholar
Lakkis, O., Pryer, T.: A nonvariational finite element method for fully nonlinear elliptic problems. Submitted–Tech report (2012). arXiv:1103.2970
Miranda, C.: Sulle equazioni ellittiche del secondo ordine di tipo non variazionale, a coefficienti discontinui. Ann. Mat. 63(1), 353–386 (1963)
Article MathSciNet Google Scholar
Mu, L., Ye, X.: A simple finite element method for non-divergence form elliptic equations. Int. J. Numer. Anal. Model. 14(2), 306–311 (2017)
MathSciNet MATH Google Scholar
Oberman, A.M.: Convergent difference schemes for degenerate elliptic and parabolic equations: Hamilton-Jacobi equations and free boundary problems. SIAM J. Numer. Anal. 44(2), 879–895 (2006) (electronic)
Pryer, T.: Discontinuous Galerkin methods for the p-biharmonic equation from a discrete variational perspective. Electron. Trans. Numer. Anal. 41, 328–349 (2014)
MathSciNet MATH Google Scholar
Smears, I., Süli, E.: Discontinuous Galerkin finite element approximation of nondivergence form elliptic equations with Cordès coefficients. SIAM J. Numer. Anal. 51(4), 2088–2106 (2013)
Article MathSciNet Google Scholar
Talenti, G.: Sopra una classe di equazioni ellittiche a coefficienti misurabili. Ann. Mat. 69(1), 285–304 (1965)
Article MathSciNet Google Scholar
Vallet, M.-G., Manole, C.-M., Dompierre, J., Dufour, S., Guibault, F.: Numerical comparison of some Hessian recovery techniques. Int. J. Numer. Methods Eng. 72(8), 987–1007 (2007)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Mathematics Institute, University of Warwick, Coventry, CV4 7AL, UK
Andreas Dedner
Department of Mathematical Sciences, University of Bath, Bath, BA2 7AY, UK
Tristan Pryer

Authors

Andreas Dedner
View author publications
You can also search for this author in PubMed Google Scholar
Tristan Pryer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andreas Dedner.

Ethics declarations

Conflict of Interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dedner, A., Pryer, T. Discontinuous Galerkin Methods for a Class of Nonvariational Problems. Commun. Appl. Math. Comput. 4, 634–656 (2022). https://doi.org/10.1007/s42967-021-00133-6

Download citation

Received: 22 September 2020
Revised: 01 March 2021
Accepted: 22 March 2021
Published: 13 September 2021
Issue Date: June 2022
DOI: https://doi.org/10.1007/s42967-021-00133-6

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Discontinuous Galerkin Methods for a Class of Nonvariational Problems

Abstract

Similar content being viewed by others

Stable Discontinuous Galerkin FEM Without Penalty Parameters

An extended Galerkin analysis for elliptic problems

A Systematic Study on Weak Galerkin Finite Element Methods for Second Order Elliptic Problems

1 Introduction

2 Problem Formulation

Definition 1

Theorem 1

Remark 1

Assumption 1

3 Discretisation

Remark 2

Definition 2

Definition 3

Proposition 1

3.1 Construction of an Appropriate Discrete Hessian

Definition 4

Theorem 2

Proof

Remark 3

Example 1

3.2 The Discontinuous Nonvariational Finite Element Method

Lemma 1

Proof

Remark 4

Example 2

Remark 5

Lemma 2

Proof

Lemma 3

Proof

Remark 6

Definition 5

Proposition 2

Lemma 4

4 A Posteriori Analysis

Remark 7

Assumption 2

Proposition 3

Proof

Theorem 3

Proof

Proposition 4

5 Numerical Experiments

Remark 8

6 Conclusions and Outlook

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interest

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation