Weighted Error Estimates for Transient Transport Problems Discretized Using Continuous Finite Elements with Interior Penalty Stabilization on the Gradient Jumps

Burman, Erik

doi:10.1007/s10013-022-00550-x

Weighted Error Estimates for Transient Transport Problems Discretized Using Continuous Finite Elements with Interior Penalty Stabilization on the Gradient Jumps

Original Article
Open access
Published: 08 March 2022

Volume 50, pages 833–866, (2022)
Cite this article

Download PDF

You have full access to this open access article

Vietnam Journal of Mathematics Aims and scope Submit manuscript

Weighted Error Estimates for Transient Transport Problems Discretized Using Continuous Finite Elements with Interior Penalty Stabilization on the Gradient Jumps

Download PDF

Erik Burman ORCID: orcid.org/0000-0003-4287-7241¹

1721 Accesses
3 Citations
1 Altmetric
Explore all metrics

Abstract

In this paper we consider the semi-discretization in space of a first order scalar transport equation. For the space discretization we use standard continuous finite elements with a stabilization consisting of a penalty on the jump of the gradient over element faces. We recall some global error estimates for smooth and rough solutions and then prove a new local error estimate for the transient linear transport equation. In particular we show that for the stabilized method the effect of non-smooth features in the solution decay exponentially from the space time zone where the solution is rough so that smooth features will be transported unperturbed. Locally the L²-norm error converges with the expected order $O(h^{k+\frac 12})$, if the exact solution is locally smooth. We then illustrate the results numerically. In particular we show the good local accuracy in the smooth zone of the stabilized method and that the standard Galerkin fails to approximate a solution that is smooth at the final time if underresolved features have been present in the solution at some time during the evolution.

Difference scheme for an initial–boundary value problem for a singularly perturbed transport equation

Article 28 November 2017

Error analysis for discretizations of parabolic problems using continuous finite elements in time and mixed finite elements in space

Article Open access 20 June 2017

An adaptive algorithm for the transport equation with time dependent velocity

Article Open access 28 August 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The discretization of transport problems has traditionally been dominated by discontinuous Galerkin methods or finite volume methods, typically of low order, since the continuous Galerkin method is known to have robustness problems for first order partial differential equations (see [23, Chapter 5]), or convection–diffusion equations in the convection dominated regime. In certain situations the use of high order continuous Galerkin methods is appealing, for instance in the case of convection–diffusion equations, in particular where the diffusion is nonlinear, or more complex situations such as large eddy simulation of turbulent flows, where the pressure-velocity coupling can be decoupled using a pressure projection method and the convective part handled explicitly. In such situations, if continuous finite element spaces are used, one must resort to a stabilized method to avoid a reduction of accuracy due to spurious oscillations. There is a very wide literature on stabilized methods and for an overview of the topic see for example [24]. In the high order case, the Spectral Vanishing Velocity method has been a popular choice [34,35,36], but other methods have also been designed to work for high order, see the discussion in [17]. In this work we will focus on the continuous interior penalty (CIP) stabilization, that was shown to allow for close to hp-optimal error estimates in the high Peclet regime in [10]. Recently [37] this method was applied to under resolved simulations of turbulent flows using high order polynomial approximation and shown to perform very well in this context. Therein an eigenanalysis was performed which showed that the CIP finite element method has similar advantageous dispersion properties as the discontinuous Galerkin method (see also the report [19]) and in the computations it was verified that its numerical dissipation was less important than that of the spectral vanishing viscosity.

Ideally stability of the finite element method should match that of the continuous problem. This is typically, by and large, true for elliptic pde, but much harder to achieve in the hyperbolic case. Indeed, this would mean satisfaction of a discrete maximum principle and stability and error estimates in L¹. Both which typically remain open questions. Herein we will only consider the stability in the L²-norm for continuous finite element approximations and linear symmetric stabilization of gradient penalty type applied to the transient scalar, linear first order equation. The analysis will mainly focus on semi discretization in space on periodic domains, but the extension to the fully discrete case and weakly imposed boundary conditions will be sketched. The classical estimate for smooth solutions that is proven for stabilized finite element methods is on the form

$$ \|(u- u_{h})(\cdot,T)\|_{{\varOmega}} \leq C(u) h^{k+{\frac12}}, $$

(1.1)

where C(u) is a constant that depends on Sobolev norms of the exact solution and on equation data, h is the mesh-size and k the polynomial order. This estimate that is suboptimal by $h^{\frac 12}$ is known to be sharp on general meshes [38] (see also [7] for the sharpness of the estimate for the CIP method). The continuous Galerkin method without stabilization, however, only admits a bound of order h^k. The lost factor $h^{\frac 12}$ is of little consequence for smooth solutions, and high polynomial order. However for low polynomial order or rough solutions it becomes significant. In Section 4 below, we prove this type of error estimate and some variations in weak norm for rough solutions. This analysis uses ideas from [9, 12]. Some remarks on the time discretization will be added in Section 4.2. In particular we will point out the situations where the stabilization actually improves the stability of time stepping methods.

The estimate (1.1) is a weak result, but it has become a proxy for stronger estimates that give convergence also of the material derivative (see [13, 27] and Theorem 2 below) and importantly, local estimates, using weighted norms, well known in the stationary case [14, 28, 32, 33]. In the context of time dependent problems such a weighted estimate takes the form

$$ \|\varpi (u - u_{h})(\cdot,T)\|_{{\varOmega}} \leq C h^{k+\frac12} \left( {{\int}_{0}^{T}} \|\varpi D^{k+1} u\|_{{\varOmega}}^{2} ~\mathrm{d} t \right)^{\frac12}, $$

(1.2)

where D^m is a multi-index differential operator of order m and the ϖ is a weight function that is aligned with the characteristics and decays exponentially away from some zone of interest. This means that if ϖ = 1 in some zone where the solution is smooth the influence of locally large derivatives and underresolution at some distance d from this zone will be damped with a factor $e^{-d/\sqrt {h}}$. We prove such an estimate in Section 5 for the space semi-discretized stabilized formulation. To the best of my knowledge there are no previous such estimates for continuous finite element methods using symmetric stabilization. For earlier works on Streamline Upwind Petrov–Galerkin methods (SUPG) in this direction see [20, 44]. The approach in [44] relies strongly on the space time finite element discretisation and an additional artificial viscosity term and in [20] the authors consider the SUPG method together with a first order backward differentiation in time, on a form that can not easily be extended to higher order time-discretizations. In neither case can the arguments be applied independently of the time discretization. In this paper we apply the ideas from [14] where weighted estimates were proved for the stationary convection–diffusion equation with CIP-stabilization and [16], where they were applied to an inverse boundary value problem subject to a convection–diffusion equation. The result is presented in detail for the semi-discretized case only, but the extension to standard stable time discretizations is sketched. The results can also be extended to the case of convection–diffusion equations with Neumann conditions on the outflow boundary, by straightforward addition of the diffusive terms and following the argument of [14].

In the numerical section (Section 6) we will illustrate this localization property of the error and show that it is not shared by the standard (unstabilized) Galerkin finite element method. Indeed, as we shall see, without stabilization Galerkin FEM fails to approximate even smooth solutions satisfactory in case the solution has had non-smooth features at any time during the computation. Indeed it appears that the standard Galerkin method does not propagate underresolved features of the solution with the right speed, making it impossible for the method to evacuate high frequency content from the computational domain. For the stabilized method on the other hand the weighted estimate (1.2) guarantees that smooth components of the solution are untainted by spurious high frequency content at all times, since perturbations are damped exponentially when crossing the characteristics.

2 Model Problem and Finite Element Discretization

We will discuss a first order hyperbolic problem in a periodic domain, Ω = [−L,L]ⁿ, where n ≥ 1 is the space dimension. Let $\boldsymbol {\beta } \in C^{0}([0,T];[C^{m}(\bar {\varOmega })]^{n})$, m ≥ 1, be a periodic vector field satisfying ∇⋅β = 0 and consider the first order hyperbolic problem

$$ \begin{array}{@{}rcl@{}} \mathcal{L} u := \partial_{t} u + \boldsymbol{\beta} \cdot \nabla u & = &f \quad~\text{ in } (0,T) \times {\varOmega}, \end{array} $$

(2.1)

$$ \begin{array}{@{}rcl@{}} u(\cdot,0) & =& u_{0} \quad \text{ in } {\varOmega}. \end{array} $$

(2.2)

For smooth data β, u₀ and f there exists a unique solution by the method of characteristics, but the problem admits a unique solution also for more rough data [26]. The solution satisfies the following regularity estimate (a proof of this can be obtained after minor modifications of [5, Lemma 2]),

$$ \|u(t)\|_{H^{j}({\varOmega})} \leq C_{\beta} \left( \|f\|_{L^{2}((0,T);H^{j}({\varOmega}))} + \|u_{0}\|_{H^{j}({\varOmega})}\right), \quad t>0, \quad j \ge 0 \text{ when $m\ge j$}. $$

(2.3)

Below we will always assume that β is smooth enough for (2.3) to hold. The constant C_β grows exponentially in time, with coefficient dependent on the sup-norm of β, and its derivatives of order up to j. Below the notation $\beta _{\infty } = \sup _{x \in \bar {\varOmega }}|\boldsymbol {\beta }(x)|$ will be used. The L²-norm over a domain X ⊂Ω will be denoted by $\|\cdot \|_{X} = (\cdot ,\cdot )_{X}^{\frac 12}$, where $(\cdot ,\cdot )_{X}^{\frac 12}$ is the L²-scalar product over X, also $\|\cdot \|_{\infty }$ will denote the norm on $C^{0}(\bar {\varOmega })$.

Let $\{\mathcal {T}\}_{h}$ be a family of shape regular decompositions of Ω in simplices S, $\mathcal {T} = \{S\}$, indexed by the (uniform) mesh size h. Let $\mathcal {F}$ denote the set of faces of $\mathcal {T}$. C will denote a generic constant that can have different value at each appearance, but is always independent of the mesh-parameter h. Now define the finite element space

$$ V_{h} := \left\{v \in H_{per}^{1}({\varOmega}): v\vert_{S} \in \mathbb{P}_{k}(S), \text{ for all } S \in \mathcal{T}\right\}, $$

where $\mathbb {P}_{k}(S)$ denotes the set of polynomials of degree less than or equal to k on S and $H^{1}_{per}({\varOmega })$ denotes the set of periodic functions in H¹ on Ω. We may then write a semi-discretization in space, for t > 0 find u_h(t) ∈ V_h, with u_h(0) = π_hu₀, such that

$$ (\mathcal{L} u_{h}(t),v_{h})_{{\varOmega}} =F(v_{h}), \quad \forall v_{h} \in V_{h} $$

(2.4)

where F(v_h) := (f,v_h)_Ω. Above π_h denotes the L²-projection onto the finite element space V_h. For all v ∈ L²(Ω), π_hv ∈ V_h satisfies

$$ (\pi_{h} v,w_{h})_{{\varOmega}} = (v,w_{h})_{{\varOmega}},\quad \forall w_{h} \in V_{h}. $$

It is well known that on locally quasi-uniform meshes the L²-projection satisfies the approximation bound,

$$ \|v - \pi_{h} v\|_{{\varOmega}} + h \|\nabla (v - \pi_{h} v) \|_{{\varOmega}} \leq C h^{k+1} |v|_{H^{k+1}({\varOmega})}, \quad \forall v \in H^{k+1}({\varOmega}). $$

The formulation (2.4) defines a dynamical system that admits a unique solution for m ≥ 0 using standard techniques. Taking v_h = u_h in (2.4) and integrating in time we see that (2.4) satisfies the bound (2.3) with j = 0

$$ \|u_{h}(t)\|_{{\varOmega}} \leq C_{\beta} \|f\|_{L^{2}((0,T);{\varOmega})} + \|u_{0}\|_{{\varOmega}}, \quad t>0. $$

(2.5)

Since ∇⋅β = 0 the bound holds with $C_{\beta } = T^{\frac 12}$. Actually a stronger results holds for the L²-norm when the norm on f is weakened. Indeed one may use that

$$ {{\int}_{0}^{T}} (f,u_{h})_{{\varOmega}} ~\mathrm{d}t + \|u_{0}\|_{{\varOmega}}^{2}\leq \sup_{t \in (0,T)} \|u_{h}(t)\|_{{\varOmega}} \left( \|f\|_{L^{1}((0,T);L^{2}({\varOmega}))}+\|u_{0}\|_{{\varOmega}}\right) $$

to show that

$$ \sup_{t \in (0,T)} \|u_{h}(t)\|_{{\varOmega}} \leq \|f\|_{L^{1}((0,T);L^{2}({\varOmega}))} + \|u_{0}\|_{{\varOmega}}. $$

However (2.3) does not hold for u_h for j = 1. A natural question to ask is then if the solution to (2.4) gives any control of the derivatives. In case f ∈ L²((0,T);Ω) the immediate control offered by (2.1) is ${\mathscr{L}} u\in L^{2}((0,T);{\varOmega })$, that is the material derivative is bounded in L². For (2.4) we get the corresponding bound $\pi _{h} {\mathscr{L}} u_{h}\in L^{2}((0,T);{\varOmega })$. Since ${\mathscr{L}} u_{h}$ may be discontinuous over element faces (due to the presence of derivatives in space) and V_h ∈ C⁰(Ω), we see that $\pi _{h} {\mathscr{L}} u_{h} \ne {\mathscr{L}} u_{h}$. It follows that not even this weakest measure of derivatives of u is controlled by (2.4). However since we are looking for control in a discrete space we can use norm equivalence on discrete spaces in the form of the inverse inequality [3, Lemma 4.5.3],

$$ \|\nabla u_{h}\|_{S} \leq C h^{-1}\|u_{h}\|_{S},\quad \forall S \in \mathcal{T} $$

(2.6)

and observing that ∂_tu_h ∈ V_h, we see that

$$ \|\mathcal{L} u_{h}\|_{{\varOmega}} \leq \|\pi_{h} \mathcal{L} u_{h} \|_{{\varOmega}} + C\beta_{\infty} h^{-1} \|u_{h}\|_{{\varOmega}}. $$

(2.7)

Combining (2.7) with the bound (2.5)

$$ \|\mathcal{L} u_{h}\|_{L^{2}((0,T);{\varOmega})} \leq (1+C_{\beta} h^{-1}) \left( \|f\|_{L^{2}((0,T);{\varOmega})} + \|u_{0}\|_{{\varOmega}}\right). $$

So the constant in the control of the material derivative grows as O(h^− 1) under mesh refinement. Hence there is no improvement compared to obtaining an H¹ estimate by combining the L²-stability of (2.5) with (2.6).

The rationale for the addition of stabilizing terms is to improve the control of derivatives of u_h. As an example of stabilization we here propose the gradient penalty term, introduced in [21] and shown to result in improved robustness and error estimates for convection dominated flows in [15],

$$ s(w_{h},v_{h}) = {\sum}_{F \in \mathcal{F}} \left<{h_{F}^{2}} |\boldsymbol{\beta}| [\![\nabla u_{h}]\!], [\![\nabla v_{h}]\!] \right>_{F} $$

(2.8)

where $\left <u,v\right >_{F} = {\int \limits }_{F} uv ~\mathrm {d} s$, $[\![\nabla v_{h}]\!]\vert _{F} = \nabla v_{h}\vert _{F \cap \partial S_{1}} \cdot n_{1} +\nabla v_{h}\vert _{F \cap \partial S_{2}} \cdot n_{2}$ for $F = \bar S_{1} \cap \bar S_{2}$ and n₁ and n₂ denote the outward pointing normals of the simplices S₁ and S₂ respectively. To reduce the amount of crosswind diffusion the |β| factor may be replaced by |β ⋅ n|. Define the stabilization semi norm by

$$ |w_{h}|_{s} := s(w_{h},w_{h})^{\frac12}. $$

Also recall the following inverse inequality

$$ |w_{h}|_{s} \leq C h^{-\frac12} \beta_{\infty}^{\frac12} \|w_{h}\|_{{\varOmega}}, \quad \forall w_{h} \in V_{h} $$

(2.9)

which is a consequence of the scaled trace inequality, [3, Theorem 1.6.6],

$$ \|v\|_{\partial S} \leq C_{S} \left( h^{-\frac12} \|v\|_{S} + h^{\frac12}\|\nabla v\|_{S}\right), \quad \forall v \in H^{1}(S) $$

(2.10)

and (2.6).

The enhanced control of derivatives offered by this stabilization term can be expressed as

$$ \inf_{v_{h} \in V_{h}} \|h^{\frac12}(\boldsymbol{\beta} \cdot \nabla u_{h} - v_{h})\|_{{\varOmega}}^{2} \leq C_{s}\left( \beta_{\infty} |u_{h}|^{2}_{s} + h \|\nabla \boldsymbol{\beta}\|^{2}_{\infty} \|u_{h}\|_{{\varOmega}}^{2}\right). $$

(2.11)

This is an immediate consequence of the local estimate of [10, Lemma 5.3] and local approximation of β using lowest order Raviart–Thomas functions (for details see the discussion [13, Page 4]). In particular this implies (since ∂_tu_h ∈ V_h) that

$$ \|\mathcal{L} u_{h}\|_{{\varOmega}} \leq C \|\pi_{h} \mathcal{L} u_{h}\|_{{\varOmega}} + C^{\frac12}_{s} \left( h^{-\frac12} \beta_{\infty}^{\frac12} |u_{h}|_{s} + \|\nabla \boldsymbol{\beta}\|_{\infty} \|u_{h}\|_{{\varOmega}}\right). $$

(2.12)

It follows that when the finite element method has the additional stability offered by the operator s, the constant in the bound for ${\mathscr{L}} u_{h}$ will grow at the rate $O(h^{-\frac 12})$ under mesh refinement. Therefore we propose the stabilized method, find u_h(t) ∈ V_h, with u_h(0) = π_hu₀, such that

$$ (\mathcal{L} u_{h}(t),v_{h})_{{\varOmega}} + \gamma s(u_{h},v_{h}) = F(v_{h}), \quad \forall v_{h} \in V_{h} $$

(2.13)

for γ > 0. Clearly for γ = 0 we recover the standard Galerkin method.

Remark 1

Although we only consider continuous FEM below all the results holds true for dG methods if the standard Galerkin method (without stabilization) is replaced by the standard dG method with central flux and the stabilized finite element method is replaced by the standard dG method with upwind flux. There is indeed a common misconception that the enhanced stability of the dG methods (space discretization) is due to the discontinuity of the element. The discontinuity only allows for the improved control of the material derivative if there is sufficent control on the solution jump. This can be introduced through upwind fluxes, or otherwise. Indeed it is easy to see that the upwind flux formulation is obtained from the central flux formulation by adding the following stabilization term [4]

$$ s_{up}(v_{h},w_{h}):= \frac12 {\sum}_{F \in \mathcal{F}} \left<|\boldsymbol{\beta} \cdot n_{F}| [v_{h}],[w_{h}] \right>_{F}, $$

where [⋅] simply denotes the jump of the function over the element face F. In general the full jump needs to be penalized, but the minimal stabilization needed to make the dG method satisfy the bound (2.12) depends on the mesh geometry and the polynomial order [6, 39].

3 Stability Estimate of the Finite Element Method

Here we will formalize the discussion of the previous section to obtain a stability estimate that will be useful for the subsequent error analysis. First define the operator norms

$$ \|F\|_{0} := \sup_{v_{h} \in V_{h}} \frac{|F(v_{h})|}{\|v_{h}\|_{{\varOmega}}} \quad \text{ and }\quad \|F\|_{h} := \sup_{v_{h} \in V_{h}} \frac{|F(v_{h})|}{\|v_{h}\|_{{\varOmega}} + |v_{h}|_{s}}. $$

(3.1)

With these definitions the arguments discussed in the previous section may be written as follows.

Theorem 1

Let u_h solve (2.13) with γ > 0 then for all τ ∈ [0,T]

$$ \|u_{h}(\tau)\|_{{\varOmega}}^{2} + \gamma {\int}_{0}^{\tau}|u_{h}|_{s}^{2} ~\mathrm{d}t \leq C_{\beta} \left( {\int}_{0}^{\tau} \|F\|^{2}_{h}~\mathrm{d}t + \|u_{h}(0)\|_{{\varOmega}}^{2}\right) $$

where C_β = O(γ^− 1 + T).

Proof

First take v_h = u_h in (2.13) to obtain using the skew symmetry of the convective operator

$$ (\mathcal{L} u_{h}, u_{h})_{{\varOmega}} = \frac12 \frac{d}{dt} \|u_{h}(t)\|_{{\varOmega}}^{2} $$

and therefore after integration in time over (0,τ)

$$ \begin{array}{@{}rcl@{}} \frac12 \|u_{h}(\tau)\|_{{\varOmega}}^{2} +\gamma {\int}_{0}^{\tau} |u_{h}(t)|_{s}^{2} ~\mathrm{d}t &\leq& \frac12 \|u_{h}(0)\|_{{\varOmega}}^{2}+{\int}_{0}^{\tau} F(u_{h}) ~\mathrm{d}t\\ &\leq& \frac12 \|u_{h}(0)\|_{{\varOmega}}^{2}+{\int}_{0}^{\tau}\|F\|_{h} (\|u_{h}(t)\|_{{\varOmega}} + |u_{h}(t)|_{s}) ~\mathrm{d}t. \end{array} $$

Using the arithmetic-geometric inequality $ab \leq \frac 12 a^{2} + \frac 12 b^{2}$ it follows that $\|F\|_{h} (\|u_{h}(t)\|_{{\varOmega }} + |u_{h}(t)|_{s}) \leq (\gamma ^{-1} +T) \|F\|_{h}^{2} + \frac 12 T^{-1} \|u_{h}(t)\|^{2}_{{\varOmega }}+\gamma \frac 12 |u_{h}(t)|_{s}^{2}$ leading to

$$ \|u_{h}(\tau)\|_{{\varOmega}}^{2} +\gamma {\int}_{0}^{\tau} |u_{h}(t)|_{s}^{2} ~\mathrm{d}t \leq \|u_{h}(0)\|_{{\varOmega}}^{2} + (\gamma^{-1}+ T) {\int}_{0}^{\tau} \|F\|_{h}^{2} ~\mathrm{d}t + {\int}_{0}^{\tau} T^{-1} \|u_{h}(t)\|^{2}_{{\varOmega}}~\mathrm{d}t. $$

By Gronwall’s inequality we have

$$ \begin{array}{@{}rcl@{}} \|u_{h}(\tau)\|_{{\varOmega}}^{2} &\leq& \left( \exp{{\int}_{0}^{\tau} T^{-1} ~\mathrm{d}t} \right)\left( \|u_{h}(0)\|_{{\varOmega}}^{2}+ (\gamma^{-1}+T) {\int}_{0}^{\tau}\|F\|_{h}^{2} ~\mathrm{d}t \right)\\ &\leq& C \left( \|u_{h}(0)\|_{{\varOmega}}^{2} + (\gamma^{-1}+T){\int}_{0}^{\tau}\|F\|_{h}^{2} ~\mathrm{d}t\right). \end{array} $$

We may then bound

$$ \begin{array}{@{}rcl@{}} \gamma {\int}_{0}^{\tau} |u_{h}(t)|_{s}^{2} ~\mathrm{d}t &\leq& \|u_{h}(0)\|_{{\varOmega}}^{2} + (\gamma^{-1}+T) {\int}_{0}^{\tau} \|F\|_{h}^{2} ~\mathrm{d}t + {\int}_{0}^{\tau} T^{-1} \|u_{h}(t)\|^{2}_{{\varOmega}}~\mathrm{d}t \\ &\leq& C\left( \|u_{h}(0)\|_{{\varOmega}}^{2} + (\gamma^{-1}+T) {\int}_{0}^{\tau}\|F\|_{h}^{2} ~\mathrm{d}t \right) \end{array} $$

which concludes the proof. □

For the material derivative we can prove the similar bound

Corollary 1

Let u_h solve (2.13) with γ > 0 then there holds

$$ {{\int}_{0}^{T}} \|h^{\frac12} \mathcal{L} u_{h}\|_{{\varOmega}}^{2} ~\mathrm{d}t \leq C_{\beta} \zeta(\gamma)^{2} \left (\|u_{h}(0)\|_{{\varOmega}}^{2} + {{\int}_{0}^{T}} (h \|F\|_{0}^{2} + (\beta_{\infty} + h \|\nabla \boldsymbol{\beta}\|^{2}_{\infty} T ) \|F\|_{h}^{2} ) ~\mathrm{d}t\right), $$

where $\zeta (\gamma ) = \gamma ^{\frac 12} + \gamma ^{-\frac 12}$.

Proof

$$ {{\int}_{0}^{T}} \|h^{\frac12} \mathcal{L} u_{h}\|_{{\varOmega}}^{2} ~\mathrm{d}t = {{\int}_{0}^{T}} (\mathcal{L} u_{h}, h\pi_{h} \mathcal{L} u_{h})_{{\varOmega}} ~\mathrm{d}t + {{\int}_{0}^{T}} \|h^{\frac12}(I - \pi_{h}) \mathcal{L} u_{h}\|_{{\varOmega}}^{2} ~\mathrm{d}t = T_{1} + T_{2}. $$

To bound the term T₁ we use the formulation (2.13) to obtain

$$ (\mathcal{L} u_{h}, h \pi_{h} \mathcal{L} u_{h})_{{\varOmega}} = F(h \pi_{h} \mathcal{L} u_{h}) - \gamma s(u_{h},h \pi_{h} \mathcal{L} u_{h}). $$

For the first term on the right hand side we see that using the first definition of (3.1) and the stability of the L²-projection there holds

$$ F(h \pi_{h} \mathcal{L} u_{h}) \leq\|F\|_{0} \|h \pi_{h} \mathcal{L} u_{h}\|_{{\varOmega}}\leq h^{\frac12} \|F\|_{0} \|h^{\frac12} \mathcal{L} u_{h}\|_{{\varOmega}}. $$

For the second term we use (2.9) and the L²-stability of the projection to get

$$ \gamma s(u_{h},h \pi_{h} \mathcal{L} u_{h}) \leq \gamma s(u_{h},u_{h})^{\frac12} s(h \pi_{h} \mathcal{L} u_{h},h \pi_{h} \mathcal{L} u_{h})^{\frac12} \leq C \gamma \beta_{\infty}^{\frac12} |u_{h}|_{s} \|h^{\frac12} \mathcal{L} u_{h}\|_{{\varOmega}}. $$

Observe that in the last inequality a factor $h^{\frac 12}$ is lost due to the application of (2.9). Collecting these bounds we see that

$$ T_{1} \leq {{\int}_{0}^{T}} \left( h \|F\|_{0}^{2} + C^{2} \gamma^{2} \beta_{\infty} |u_{h}|^{2}_{s} + \frac12 \|h^{\frac12} \mathcal{L} u_{h}\|_{{\varOmega}}^{2}\right) ~\mathrm{d} t. $$

To bound T₂ we note that by the definition of the L²-projection $\|h^{\frac 12}(I - \pi _{h}) {\mathscr{L}} u_{h}\|_{{\varOmega }} \leq \|h^{\frac 12}({\mathscr{L}} u_{h} - v_{h})\|_{{\varOmega }} $ for all v_h ∈ V_h and apply (2.11) and the fact that ∂_tu_h ∈ V_h, leading to

$$ T_{2} = {{\int}_{0}^{T}} \inf_{v_{h} \in V_{h}} \|h^{\frac12}(\boldsymbol{\beta} \cdot \nabla u_{h} - v_{h})\|_{{\varOmega}}^{2} ~\mathrm{d} t\leq C_{s}{{\int}_{0}^{T}} \left( \beta_{\infty} |u_{h}|^{2}_{s} + h \|\nabla \boldsymbol{\beta}\|^{2}_{\infty} \|u_{h}\|_{{\varOmega}}^{2}\right) ~\mathrm{d} t. $$

The claim follows by the bounds on T₁ and T₂ and the result of Theorem 1. □

Remark 2

Observe that the presence of both positive and negative powers of γ in ζ, shows that the estimate degenerates both for vanishing stabilization and for too strong stabilization. If γ goes to infinity the solution has to become C¹ and the solution will in this case coincide with the standard Galerkin approximation in the C¹-subspace, which is unstable, see discussion in [18].

4 Error Estimates for the Stabilized Formulation (2.13)

Using the stability estimates of Theorem 1 it is straightforward to derive the error estimate (1.1) for smooth solutions. Below we will also use Corollary 1 to obtain an optimal order O(h^k) error estimate for the material derivative.

Then we will assume that f ∈ L²(0,T;Ω) in (2.3) so that we only have u ∈ L²(0,T;Ω). In this case we will show that the stabilized finite element method still converges in a weaker norm.

Theorem 2

Let u₀ ∈ H^k+ 1(Ω), f ∈ L²(0,T;H^k+ 1(Ω)), let u be the solution of (2.1) and u_h the solution of (2.13). Then there holds, for all T > 0

$$ \|(u - u_{h})(\cdot,T)\|_{{\varOmega}} + \gamma \left( {{\int}_{0}^{T}} |u_{h}|_{s}^{2} ~\mathrm{d}t\right)^{\frac12} \leq C_{\beta} \zeta(\gamma) h^{k+\frac12} (\|f\|_{L^{2}(0,T;H^{k+1}({\varOmega}))} + \|u_{0}\|_{H^{k+1}({\varOmega})}) $$

and

$$ \left( {{\int}_{0}^{T}} \|\mathcal{L}(u - u_{h})\|_{{\varOmega}}^{2} ~\mathrm{d}t\right)^{\frac12} \leq C_{\beta} \zeta(\gamma)^{2} h^{k} \|u\|_{H^{1}(0,T;H^{k+1}({\varOmega}))}, $$

where $\zeta (\gamma ) := \gamma ^{\frac 12} + \gamma ^{-\frac 12}$ and C_β depends on $\beta _{\infty }$ and $\|\nabla \boldsymbol {\beta }\|_{\infty }$ and T.

Proof

This result is a consequence of the stability of Theorem 1, the consistency and (2.11). It is standard material (see [24, Section 76.4]) however for completeness we include the short proof.

Using standard approximation estimates there holds [10, Lemma 5.6]

$$ \|\beta_{\infty}^{\frac12} h^{-\frac12}(u - \pi_{h} u)\|_{{\varOmega}} + |u- \pi_{h} u|_{s} \leq C \beta_{\infty}^{\frac12} h^{k+\frac12} |u|_{H^{k+1}({\varOmega})}. $$

(4.1)

Hence by applying a triangle inequality we only need to consider the discrete error e_h = π_hu − u_h. Injecting it in (2.1) and using (2.13) we see that

$$ (\mathcal{L} e_{h}, v_{h}) + \gamma s(e_{h},v_{h}) = F_{\pi}(v_{h}) $$

with F_π(v_h) = (∂_t(π_hu − u),v_h)_Ω + (β ⋅∇(π_hu − u),v_h)_Ω + γs(π_hu,v_h). Applying Theorem 1 we see that

$$ \|e_{h}(T)\|_{{\varOmega}}^{2} +\gamma {{\int}_{0}^{T}} |e_{h}|_{s}^{2} ~\mathrm{d}t\leq C_{\beta} {{\int}_{0}^{T}}\|F_{\pi}\|_{h}^{2} ~\mathrm{d}t + \|e_{h}(0)\|_{{\varOmega}}^{2}. $$

By the definition of u_h(0), e_h(0) = 0. Since ∂_tπ_hu = π_h∂_tu we have using L²-orthogonality and intergration by parts

$$ F_{\pi}(v_{h}) =(u - \pi_{h} u, \boldsymbol{\beta} \cdot \nabla v_{h} - w_{h})_{{\varOmega}} + \gamma s(\pi_{h} u, v_{h}), \quad \forall w_{h} \in V_{h}. $$

It now follows using the Cauchy–Schwarz inequality, (2.11) and (4.1) and recalling that under the regularity assumptions on data u(t) ∈ H²(Ω), that

$$ \|F_{\pi}\|_{h} \leq C_{\beta} \zeta(\gamma) h^{k+\frac12} |u|_{H^{k+1}({\varOmega})}. $$

(4.2)

The first claim then follows after an application of (2.3).

For the second inequality we apply Corollary 1 to see that, since e_h(0) = 0,

$$ {{\int}_{0}^{T}} \|h^{\frac12} \mathcal{L} e_{h}\|_{{\varOmega}}^{2} ~\mathrm{d}t \leq C \zeta(\gamma)^{2} {{\int}_{0}^{T}} (h \|F_{\pi}\|_{0}^{2} + \left( \beta_{\infty} + h \|\nabla \boldsymbol{\beta}\|_{\infty}^{2} T) \|F_{\pi}\|_{h}^{2}\right) ~\mathrm{d}t. $$

(4.3)

It follows that we only need to bound F in the stronger topology ∥⋅∥₀ to conclude. Using the Cauchy–Schwarz inequality and the inverse inequalities (2.6) and (2.9)

$$ \begin{array}{@{}rcl@{}} F_{\pi}(v_{h}) & = &(u - \pi_{h} u, \boldsymbol{\beta} \cdot \nabla v_{h})_{{\varOmega}} + \gamma s(\pi_{h} u, v_{h}) \\ &\leq& C \beta_{\infty} \|h^{-1} (u - \pi_{h} u)\|_{{\varOmega}} \|v_{h}\|_{{\varOmega}} + C \gamma h^{-\frac12} \beta_{\infty}^{\frac12} |\pi_{h} u|_{s} \|v_{h}\|_{{\varOmega}}. \end{array} $$

It follows from (4.1) that

$$ \|F\|_{0} \leq C_{\beta} (1 +\gamma) h^{k} |u|_{H^{k+1}({\varOmega})}. $$

Combining this bound for ∥F∥₀ with the bound (4.2) in (4.3) we see that

$$ {{\int}_{0}^{T}} \|h^{\frac12} \mathcal{L} e_{h}\|_{{\varOmega}}^{2} ~\mathrm{d}t \leq C_{\beta} \zeta(\gamma)^{4} h^{2 k+1} {{\int}_{0}^{T}} |u|^{2}_{H^{k+1}({\varOmega})} ~\mathrm{d}t $$

and we conclude using the approximation bound

$$ \|\mathcal{L} (u - \pi_{h} u) \|_{{\varOmega}} \leq C \left( h^{k+1} \|\partial_{t} u\|_{H^{k+1}({\varOmega})} + \beta_{\infty} h^{k} \|u\|_{H^{k+1}({\varOmega})}\right) $$

and the triangle inequality. □

Remark 3

Note that the error estimate on the material derivative is optimal compared with the approximation properties of the finite element space. In the corresponding analysis for (2.4) only ∥F∥₀ may be used for the upper bound in Theorem 1, resulting in a bound that is suboptimal by $O(h^{\frac 12})$.

4.1 Rough Solutions: Convergence in Weak Norms

Assume now that we have f ∈ L²((0,T);Ω) in (2.13) and u₀ ∈ L²(Ω). Then u ∈ L²((0,T);Ω) is the best we can hope for, making the error estimates of Theorem 2 invalid. However if we estimate the error in a weaker norm, we can still obtain an error bound with convergence order, provided a stabilized method is used. For $\psi \in H^{1}_{per}({\varOmega })$ consider the adjoint problem

$$ \begin{array}{@{}rcl@{}} -\mathcal{L} \varphi & =& 0,\\ \varphi(\cdot,T) & =& \psi. \end{array} $$

This problem admits a unique solution and by (2.3)

$$ \sup_{t \in (0,T)} \|\varphi(t)\|_{H^{1}({\varOmega})} \leq C_{\beta} \|\psi\|_{H^{1}({\varOmega})}. $$

(4.4)

Let $V:= H^{1}_{per} ({\varOmega })$ and introduce the dual norm

$$ \|v\|_{V^{\prime}} := \sup_{w \in V\setminus 0} \frac{\left<v,w\right>_{V^{\prime},V}}{\|w\|_{V}}, $$

where $\left <v,w\right >_{V^{\prime },V}$ is a space duality pairing that we can identify with the L²-scalar product for v ∈ L²(Ω). We now proceed using duality to prove an a posteriori bound

Proposition 1 (A posteriori error bound)

Let u be the solution of (2.1) with f ∈ L²(0,T;Ω) and u₀ ∈ L²(Ω) and u_h the solution of (2.13), with γ ≥ 0. Then there holds, for all T > 0 and for all ψ ∈ V,

$$ \begin{array}{@{}rcl@{}} \frac{ ((u - u_{h})(\cdot,T),\psi)_{{\varOmega}} }{\|\psi\|_{V}}&\leq& C_{\beta} h \| u_{0} - \pi_{h} u_{0}\|_{{\varOmega}} \\ && + C_{\beta}{{\int}_{0}^{T}} \left( \inf_{v_{h} \in V_{h}} h \|f - \boldsymbol{\beta} \cdot \nabla u_{h} - v_{h}\|_{{\varOmega}} + \gamma h^{\frac12} |u_{h}|_{s}\right)~\mathrm{d}t. \end{array} $$

Proof

Using the adjoint equation and integration by parts we see that for any $\psi \in H^{1}_{per}({\varOmega })$,

$$ \begin{array}{@{}rcl@{}} ((u - u_{h})(\cdot,T),\psi)_{{\varOmega}} & =& ((u - u_{h})(\cdot,T),\psi)_{{\varOmega}} +{{\int}_{0}^{T}} (u - u_{h},-\mathcal{L} \varphi)_{{\varOmega}} ~\mathrm{d}t \\ & =& (u_{0} - \pi_{h} u_{0}, \varphi(\cdot,0))_{{\varOmega}} + {{\int}_{0}^{T}} (\mathcal{L} (u - u_{h}), \varphi)_{{\varOmega}} ~\mathrm{d}t \\ &=& (u_{0} - \pi_{h} u_{0}, (I- \pi_{h}) \varphi(\cdot,0))_{{\varOmega}}\\ && + {{\int}_{0}^{T}} ((\mathcal{L} (u - u_{h}), \varphi - \pi_{h} \varphi)_{{\varOmega}} +\gamma s(u_{h},\pi_{h} \varphi))~\mathrm{d}t. \end{array} $$

Considering the terms of the right hand side we see that

$$ \begin{array}{@{}rcl@{}} (u_{0} - \pi_{h} u_{0}, (I- \pi_{h}) \varphi(\cdot,0))_{{\varOmega}} &\leq& C h \|u_{0} - \pi_{h} u_{0}\|_{{\varOmega}} \|\nabla \varphi(\cdot,0)\|_{{\varOmega}},\\ ((\mathcal{L} (u - u_{h}), \varphi - \pi_{h} \varphi)_{{\varOmega}} &\leq& C h \inf_{v_{h} \in V_{h}} \|f - \mathcal{L} u_{h} - v_{h}\|_{{\varOmega}} \|\nabla \varphi\|_{{\varOmega}}\\ & =& C h \inf_{v_{h} \in V_{h}} \|f - \boldsymbol{\beta} \cdot \nabla u_{h} - v_{h}\|_{{\varOmega}} \|\nabla \varphi\|_{{\varOmega}} \end{array} $$

and

$$ s(u_{h},\pi_{h} \varphi) \leq |u_{h}|_{s} h^{\frac12} \beta^{\frac12}_{\infty} \|\nabla \varphi\|_{{\varOmega}}. $$

It follows that

$$ \begin{array}{@{}rcl@{}} &&(u_{0} - \pi_{h} u_{0}, (I- \pi_{h}) \varphi(\cdot,0))_{{\varOmega}} + {{\int}_{0}^{T}} ((\mathcal{L} (u - u_{h}), \varphi - \pi_{h} \varphi)_{{\varOmega}} +\gamma s(u_{h},\pi_{h} \varphi))~\mathrm{d}t\\ &&\leq C \left( h \|u_{0} - \pi_{h} u_{0}\|+ {{\int}_{0}^{T}} (\inf_{v_{h} \in V_{h}} h \|f - \boldsymbol{\beta} \cdot \nabla u_{h} - v_{h}\|_{{\varOmega}} + \gamma \beta_{\infty}^{\frac12} h^{\frac12} |u_{h}|_{s} )~\mathrm{d}t\right)\\ &&\quad\times \sup_{t \in (0,T)} \|\varphi(t)\|_{H^{1}({\varOmega})}. \end{array} $$

We end the proof by applying the stability (4.4). □

Remark 4

A posteriori error estimates in negative norms for stationary first order pde was introduced in [30] and the case of transient problems using stabilized FEM in [9]. Observe that this a posteriori error estimate can not in general be sharp, indeed for a smooth solution, by Theorem 2 we get O(h^k+ 1) convergence in the dual norm. This follows by observing that since we may take v_h = ∂_tu_h and $f = {\mathscr{L}} u$,

$$ \inf_{v_{h} \in V_{h}} h \|f - \boldsymbol{\beta} \cdot \nabla u_{h} - v_{h}\|_{{\varOmega}} \leq h \|\mathcal{L} (u - u_{h})\|_{{\varOmega}} $$

and then applying the second bound of Theorem 2. We see that compared to the L²-estimate we have lost another power $h^{\frac 12}$. Sharp residual type a posteriori error estimates in the L²-norm for transport equations in dimension > 1, so far to the best of my knowledge, have only been obtained under a saturation assumption and using a stabilized finite element method, or a dG method with upwind flux [8].

Theorem 3 (A priori error estimate for rough solutions)

Let u be the solution of (2.1) with f ∈ L²(0,T;L²(Ω)) and u₀ ∈ L²(Ω) and u_h that of (2.13) with γ > 1. Then there holds

$$ \sup_{t \in [0,T)} \|(u-u_{h})(\cdot,t)\|_{V^{\prime}} \leq C_{\beta} (\zeta(\gamma)+1) h^{\frac12} \left( \|f\|_{L^{2}(0,T;L^{2}({\varOmega}))} + \|u_{0}\|_{{\varOmega}}\right), $$

with $\zeta (\gamma ) = \gamma ^{\frac 12} + \gamma ^{-\frac 12}$.

Proof

By definition

$$ \|u-u_{h}\|_{V^{\prime}} = \sup_{w \in V\setminus 0} \frac{(u - u_{h},w)_{{\varOmega}}}{\|w\|_{V}}. $$

Applying Proposition 1 we see that, after a Cauchy–Schwarz inequality in time, for any T > 0,

$$ \begin{array}{@{}rcl@{}} \|(u-u_{h}(\cdot,T)\|_{V^{\prime}} &\leq& C_{\beta} h \| u_{0} - \pi_{h} u_{0}\|_{{\varOmega}} \\ && + C_{\beta} h^{\frac12} T^{\frac12} \left( {{\int}_{0}^{T}} \left( \inf_{v_{h} \in V_{h}} h \|f - \boldsymbol{\beta} \cdot \nabla u_{h} - v_{h}\|^{2}_{{\varOmega}} + \gamma^{2} |u_{h}|^{2}_{s}\right)~\mathrm{d}t\right)^{\frac12}. \end{array} $$

Then noting that by (2.11) there holds

$$ \inf_{v_{h} \in V_{h}} h \|f - \boldsymbol{\beta} \cdot \nabla u_{h} - v_{h}\|^{2}_{{\varOmega}} \leq h \|f\|_{{\varOmega}}^{2} +C_{s}(|u_{h}|_{s}^{2} + h \|\nabla \boldsymbol{\beta}\|^{2}_{\infty} \|u_{h}\|^{2}_{{\varOmega}}) $$

we see that all the a posteriori terms depending on u_h are either on the form |u_h|_s or on the form $\|u_{h}\|^{2}_{{\varOmega }}$ and we conclude by applying Theorem 1. □

4.2 Time Discretization and Stabilized Methods

As a rule of thumb any time integrator with non-trivial imaginary stability boundary extending into the complex plane will be stable and accurate in the sense (1.1), possibly under a CFL condition depending on β and γ. In particular any time discretization method allowing for a time discrete version of an energy estimate of the type in Theorem 1 may be applied and will lead to optimal error estimates similar to those above. This includes all A-stable schemes, backward differentiation methods of first and second order, the Crank–Nicolson method. Explicit methods with good stability properties such as explicit strongly stable Runge–Kutta (RK) methods of order higher than, or equal to, 3 are stable [12, 40,41,42,43]. Similar stability results are expected to hold for Adams–Bashforth (AB) methods of order 3, 4, 7, 8 under standard hyperbolic CFL, δt ≤ Coh, where δt denotes the timestep and Co the Courant number. See for instance [31] for a discussion of time-discretization of advection–diffusion equation, [25] for a discussion of the stability boundaries of AB methods and [13] for numerical experiments using AB3. All these methods are energy stable regardless of whether or not stabilization is added. The second order RK method is energy stable under hyperbolic CFL only for piecewise affine approximation and with added stabilization of the form (2.8) [12] (for dG FEM and affine approximation upwind stabilization must be added [42]). In the general case (no stabilization, higher polynomial approximation) the RK2 method is stable only under a slightly more strict CFL condition, indeed one needs to assume $dt \leq Co h^{\frac 43}$, with Co fixed, but small enough. This condition is the same for both cG and dG methods (see [12, 42]). Recently an analysis of the second order backward differentiation formula and the Crank–Nicolson method (AB2) with convection extrapolated to second order from previous time steps was proposed for the discretization of (2.13) [13]. It was shown that these schemes are stable under similar conditions as the RK2 scheme. Such multi step schemes are particularly appealing in the context of IMEX methods for convection–diffusion and hence provide a one-stage alternative to the RK2 IMEX method analysed in [11].

5 Weighted Error Estimates

In this section we will consider the slightly more technically advanced case of weighted estimates. The idea is to show that stabilization makes information follow the characteristics similarly as in the physics. This means that for solutions with a localized sharp layer, the dependence of a local error in the smooth zone on the regularity of the exact solution decreases exponentially with the distance to the singularity. Hence locally large gradients in the solution can not destroy the solution globally. This is not the case for approximations produced using cG FEM without stabilization. These results touch at the very essence of stabilized FEM, unfortunately their proofs are quite technical and therefore these results in my opinion have received less attention than they deserve. Here we try to give the simplest possible exposition of these ideas, without striving for optimality of exponential decay or generality of meshes. We let the domain be infinite ($L = \infty $) and let u₀ have compact support. To simplify the discussion assume that β ≡ e_x, where e_x is the Cartesian unit vector in the x-direction, so that β ⋅∇u = ∂_xu. Since here $\beta _{\infty } = 1$, below the dependence on the speed will not be tracked. First the case of a globally smooth solution will be considered (Theorem 4). The objective is to obtain an estimate for the error in some subdomain Ω₀(t) ⊂Ω defined as

$$ {\varOmega}_{0}(t) := \{\boldsymbol{x} \in {\varOmega}: |\boldsymbol{x}_{0} + \boldsymbol{\beta} t - \boldsymbol{x} | < r_{0} \} $$

for some x₀ ∈Ω and some r₀ > 0. The derivatives of u are assumed to be moderate in a neighbourhood of Ω₀ and we will prove that the accuracy in this subdomain is independent of large derivatives in other parts of the domain, provided they are sufficiently far away, relative to the mesh size. This is achieved using weights so that the effect of portions of the domain where locally the Sobolev norm is large decays exponentially with the distance to Ω₀. Then we will show how the arguments of the smooth case can be used to prove accuracy in Ω₀ in the case where the solution is locally only L² in the far field (Corollary 2). The key message is that the local accuracy of the approximation depends only on the local smoothness of the exact solution and that perturbations due to roughness in the solution is exponentially damped, except along characteristics. Finally we will discuss how the arguments can be extended to bounded domains with weakly imposed boundary condition and time discretization.

Let φ ∈ C^k+ 1(Ω) be a smooth positive function defined using polar/spherical coordinates, depending only on r(x) = |x₀ −x|, with $\varphi ^{\prime }(r) \leq 0$, φ(r) = 1, r ≤ r₀, $\varphi (r) \sim \exp (-(r-r_{0})/\sigma )$, r > r₀, with $\sigma = K\sqrt {h}$, K > 1, and for some C > 0,

$$ |{\partial_{r}^{l}} \varphi(r)| \leq C \sigma^{-l} \varphi(r), \quad l \ge 1. $$

Remark 5

For the case k = 1 we only require φ ∈ C¹(Ω). An example of such a function with r₀ = 1 and σ = 5 is given in Fig. 1 for illustration.

Define ϖ(x,t) = φ(r(x −βt)) then, since ϖ follows the characteristics ${\mathscr{L}} \varpi = 0$, and

$$ |D^{l}\varpi| \leq C \sigma^{-l} \varpi, \quad l \ge 1 $$

(5.1)

where the derivatives are taken with respect to space or time. The objective is to prove stability and error estimates in the weighted norm

$$ \|v\|_{\varpi}:= \|\varpi v\|_{{\varOmega}}. $$

The same notation will be used occasionally below with different weight functions. The rationale for the design of the weight function is that for all $v \in L^{\infty }(0,T;L^{2}({\varOmega }))$ with ${\mathscr{L}} v \in L^{2}(0,T;{\varOmega })$, by partial integration in space and time,

$$ {{\int}_{0}^{T}} (\partial_{t} v , \varpi^{2} v)_{{\varOmega}} = \|v(\cdot, T)\|_{\varpi}^{2} - \|v(\cdot, 0)\|_{\varpi}^{2} - {{\int}_{0}^{T}} (v, \partial_{t}\varpi^{2} v+\varpi^{2} \partial_{t} v)_{{\varOmega}} ~\mathrm{d}t $$

and

$$ (\boldsymbol{\beta} \cdot \nabla v, \varpi^{2} v)_{{\varOmega}} = - (v,(\boldsymbol{\beta} \cdot \nabla \varpi^{2}) v +\varpi^{2} \boldsymbol{\beta} \cdot \nabla v)_{{\varOmega}}, $$

there holds

$$ \begin{array}{@{}rcl@{}} {{\int}_{0}^{T}} (\mathcal{L} v, \varpi^{2} v)_{{\varOmega}} ~\mathrm{d}t & =&\|v(\cdot,T)\|_{\varpi}^{2} - \|v(\cdot,0)\|_{\varpi}^{2}\\ && - {{\int}_{0}^{T}} (v, \underbrace{(\mathcal{L}\varpi^{2})}_{=0} v)_{{\varOmega}} + (v, \varpi^{2} \mathcal{L} v)_{{\varOmega}} ~\mathrm{d}t. \end{array} $$

Hence

$$ {{\int}_{0}^{T}} (\mathcal{L} v, \varpi^{2} v)_{{\varOmega}} ~\mathrm{d}t = \frac12\|v(\cdot,T)\|_{\varpi}^{2} - \frac12\|v(\cdot,0)\|_{\varpi}^{2} $$

(5.2)

and therefore the following stability is satisfied by the continuous equation, (2.1), ∀σ > 0,

$$ \frac12\|u(\cdot,T)\|_{\varpi}^{2} \leq \frac12 \|u(\cdot,0)\|_{\varpi}^{2} + {{\int}_{0}^{T}} \|f\|_{\varpi} \|u\|_{\varpi} ~\mathrm{d}t $$

from which we conclude

$$ \sup_{t \in (0,T)} \|u(\cdot,t)\|_{\varpi} \leq \|u(\cdot,0)\|_{\varpi} + 2 {{\int}_{0}^{T}} \|f\|_{\varpi} ~\mathrm{d}t. $$

This relation expresses that the solution is transported along the characteristics. The influence across characteristics will be damped exponentially as $\exp (-d/\sigma )$. However in the continuous case, since the bound holds for all σ > 0 the cut-off is sharp.

The aim is to make the error analysis for the solution of (2.13) reproduce this type of localization. For the purposes of analysis we introduce the weighted stabilization operator

$$ s_{\varpi}(v_{h},w_{h}) = {\sum}_{F \in \mathcal{F}} {\int}_{F} {h_{F}^{2}} \varpi^{2} [\![\nabla v_{h}]\!] [\![\nabla w_{h}]\!] ~\mathrm{d} s,\text{ with semi-norm } |w|_{s,\varpi} := s_{\varpi}(w,w)^{\frac12} $$

and note that s(v_h,ϖ²w_h) = s_ϖ(v_h,w_h). Also recall the following weighted versions of (2.11) from [14, Lemma 3.1, equation (3.1) and (3.2)], here $\boldsymbol {\beta }_{0}\vert _{S} \in \mathbb {R}^{n}$ is some piecewise constant per element,

$$ \|h^{\frac12} (\boldsymbol{\beta}_{0} \cdot \nabla v_{h} - \pi_{h} \boldsymbol{\beta}_{0} \cdot \nabla v_{h})\|^{2}_{\varpi} \leq C_{ws} ||\boldsymbol{\beta}_{0}| v_{h}|^{2}_{s,\varpi} $$

(5.3)

and

$$ \|h^{\frac12} (\boldsymbol{\beta} \cdot \nabla (\varpi^{2} v_{h}) - \pi_{h}(\boldsymbol{\beta} \cdot \nabla (\varpi^{2} v_{h})))\|^{2}_{\varpi^{-1}} \leq C_{ws} |v_{h}|^{2}_{s,\varpi} + C_{\beta} K^{-2} \|v_{h}\|^{2}_{\varpi}. $$

(5.4)

The second bound differs from the bound in [14], since there the derivative of v_h appears in the second term of the right hand side. The proof however is similar. For completeness we detail it in ??. We will need to use approximation in the weighted norm and therefore collect some results on the L²-projection in the following lemmas. The first one is taken from [2] and we refer to this reference for the proof. The following two are variations on results from [14] and for completeness we give the proofs in ??. We note that all the above inequalities hold both for the weight ϖ and ϖ^− 1, since by the construction of the weight,

$$ |\nabla \varpi^{-1}| = |\varpi^{-2} \nabla \varpi| \leq C \varpi^{-2} \sigma^{-1} \varpi = C \sigma^{-1} \varpi^{-1}. $$

It follows that (5.1) is satisfied also for ϖ^− 1.

Lemma 1 (Stability L ²-projection)

Let π_h denote the L²-projection onto V_h. Then, if ϕ is a function satisfying

$$ |\nabla \phi(x)| \leq \nu h^{-1} |\phi(x)|, $$

for some ν > 0, sufficiently small then there holds

$$ \| \pi_{h} v\|_{\phi} \leq C\|v \|_{\phi}, $$

(5.5)

$$ \|\nabla\pi_{h} v\|_{\phi} \leq C \|\nabla v \|_{\phi} $$

(5.6)

and

$$ \|\nabla \pi_{h} v\|_{\phi} \leq C h^{-1} \|v\|_{\phi} , \quad \forall v \in H^{1}({\varOmega}). $$

(5.7)

Proof

The estimates (5.5)–(5.7) are taken verbatim from [2, bounds (1.7)–(1.9)] (see also [22, Appendix]). □

The above stability estimates allows us to prove bounds on the L²-error in the weighted norm.

Lemma 2 (Weighted approximation)

Let π_h denote the L²-projection onto V_h. Then for $h^{\frac 12}/K$ sufficiently small and I_δ = [t − δt,t + δt] ∩ [0,T] with $\delta t \in \mathbb {R}^{+}$, $\delta t \sim h$, there holds

$$ \max_{(x,t) \in S \times I_{\delta}} \varpi(x,t) \|v\|_{S} \leq 2 \min_{t \in I_{\delta} }\| v \varpi(\cdot,t)\|_{S}, \quad \forall v \in L^{2}(S), $$

(5.8)

$$ \|(v - \pi_{h} v)\|_{\varpi} + h \|\nabla(v - \pi_{h} v)\|_{\varpi} \leq C h^{k+1} \|D^{k+1} v\|_{\varpi}, \quad \forall v \in H^{k+1}({\varOmega}) $$

(5.9)

and

$$ |v - \pi_{h} v|_{s,\varpi} \leq C h^{k+\frac12} \|D^{k+1} v\|_{\varpi}, \quad \forall v \in H^{k+1}({\varOmega}). $$

(5.10)

For the analysis we also need the following interpolation estimates on weighted discrete functions.

Lemma 3 (Super approximation)

Let v_h ∈ V_h. Assume that $h^{\frac 12}/K$ is sufficiently small. Then there holds

$$ \|\varpi^{2} v_{h} - \pi_{h} (\varpi^{2} v_{h})\|_{\varpi^{-1}} + h \|\nabla (\varpi^{2} v_{h} - \pi_{h} (\varpi^{2} v_{h}))\|_{\varpi^{-1}}\leq C h^{\frac12} K^{-1} \|v_{h} \|_{\varpi} $$

(5.11)

and

$$ \left( {\sum}_{S \in \mathcal{T}} \|\varpi^{-1}\nabla(\varpi^{2} v_{h} - \pi_{h} (\varpi^{2} v_{h}))\|_{\partial S}^{2} \right)^{\frac12} \leq C h^{-1} K^{-1} \|v_{h}\|_{\varpi}. $$

(5.12)

We will now derive a weighted stability estimate for the finite element formulation (2.13). First use similar arguments as for (5.2) to obtain for any v_h ∈ C¹(0,T;V_h),

$$ {{\int}_{0}^{T}} (\mathcal{L} v_{h}, \varpi^{2} v_{h})_{{\varOmega}} ~\mathrm{d}t = \frac12 \|v_{h}(\cdot,T)\|_{\varpi}^{2} - \frac12 \|v_{h}(\cdot,0)\|_{\varpi}^{2} $$

and, since ϖ ∈ C¹(Ω) we see that

$$ s(v_{h},\varpi^{2} v_{h}) = |v_{h}|_{s,\varpi}^{2}. $$

Therefore,

$$ \|v_{h}(\cdot,T)\|_{\varpi}^{2} + 2 {\gamma{\int}_{0}^{T}} |v_{h}|_{s,\varpi}^{2}~\mathrm{d}t = 2 {{\int}_{0}^{T}}((\mathcal{L} v_{h},\varpi^{2} v_{h})_{{\varOmega}} + \gamma s(v_{h},\varpi^{2} v_{h}) )~\mathrm{d}t + \|v_{h}(\cdot,0)\|_{\varpi}^{2}. $$

(5.13)

However, since ϖ²v_h∉V_h the equality can not be used directly for the finite element formulation. We need to show that stability similar to (5.13) can be obtained by testing by some interpolant of ϖ²v_h.

Proposition 2 (Weighted stability)

Let γ > 0, K > 1. Assume that $h^{\frac 12}/K$ is sufficiently small. For all v_h ∈ C¹(0,T;V_h) there holds

$$ \begin{array}{@{}rcl@{}} \|v_{h}(\cdot,T)\|_{\varpi}^{2} + \gamma {{\int}_{0}^{T}} |v_{h}|_{s,\varpi}^{2} ~\mathrm{d}t &\leq& C/K^{2} {{\int}_{0}^{T}} \|v_{h}\|_{\varpi}^{2} ~\mathrm{d}t \\ &&+ 2 {{\int}_{0}^{T}}((\mathcal{L} v_{h}, w_{h})_{{\varOmega}} + \gamma s(v_{h},w_{h}) )~\mathrm{d}t + \|v_{h}(\cdot,0)\|_{\varpi}^{2}, \end{array} $$

where w_h = π_hϖ²v_h and the constant $C \sim \gamma + \gamma ^{-1}$.

Proof

Starting from the equality (5.13) we add and subtract the finite element formulation tested with some function w_h,

$$ \begin{array}{@{}rcl@{}} \|v_{h}(\cdot,T)\|_{\varpi}^{2} + 2 \gamma {{\int}_{0}^{T}} |v_{h}|_{s,\varpi}^{2} ~\mathrm{d}t & =& 2{{\int}_{0}^{T}}((\mathcal{L} v_{h}, \varpi^{2} v_{h}- w_{h})_{{\varOmega}} + \gamma s(v_{h},\varpi^{2} v_{h} - w_{h}) )~\mathrm{d}t \\ &&+ 2 {{\int}_{0}^{T}}((\mathcal{L} v_{h}, w_{h})_{{\varOmega}} + \gamma s(v_{h},w_{h}) )~\mathrm{d}t + \|v_{h}(\cdot,0)\|_{\varpi}^{2}. \end{array} $$

We choose w_h = π_h(ϖ²v_h) to obtain, for an arbitrary y_h ∈ V_h

$$ \begin{array}{@{}rcl@{}} (\mathcal{L} v_{h}, \varpi^{2} v_{h} - \pi_{h} (\varpi^{2} v_{h}))_{{\varOmega}} &=& (\boldsymbol{\beta} \cdot \nabla v_{h} - y_{h}, \varpi^{2} v_{h} - \pi_{h} (\varpi^{2} v_{h}))_{{\varOmega}} \\ &\leq& \!\!\inf_{y_{h} \in V_{h}} \|h^{\frac12} (\boldsymbol{\beta} \cdot \nabla v_{h} - y_{h})\|_{\varpi} h^{-\frac12}\| (\varpi^{2} v_{h} - \pi_{h} (\varpi^{2} v_{h}))\|_{\varpi^{-1}}. \end{array} $$

Considering the stabilization term we see that

$$ s(v_{h},\varpi^{2} v_{h} -\pi_{h} (\varpi^{2} v_{h})) \leq |v_{h}|_{s,\varpi} h \beta_{\infty}^{\frac12} \left( {\sum}_{F \in \mathcal{F}} \|\varpi^{-1} [\![{\nabla (\varpi^{2} v_{h} -\pi_{h} (\varpi^{2} v_{h}))}]\!]\|_{F}^{2}\right)^{\frac12}. $$

(5.14)

Using the arithmetic-geometric inequality ab ≤ (2𝜖)^− 1a² + (𝜖2^− 1)b², to split the terms in the right hand side, with 𝜖 = 2 in (5.14), we obtain

$$ \begin{array}{@{}rcl@{}} \|v_{h}(\cdot,T)\|_{\varpi}^{2} + \frac74\gamma {{\int}_{0}^{T}} |v_{h}|_{s,\varpi}^{2} ~\mathrm{d}t & \leq& \epsilon^{-1} \gamma^{-1} h^{-1}{{\int}_{0}^{T}} \underbrace{\| (\varpi^{2} v_{h} - \pi_{h} (\varpi^{2} v_{h}))\|_{\varpi^{-1}}^{2}}_{T_{1}} ~\mathrm{d}t \\ &&+ \gamma h^{2} \beta_{\infty}{{\int}_{0}^{T}} \!\!\underbrace{{\sum}_{F \in \mathcal{F}} \!\!\|\varpi^{-1} [\![{\nabla (\varpi^{2} v_{h} - \pi_{h} (\varpi^{2} v_{h}))}]\!]\|_{F}^{2}}_{T_{2}}~\mathrm{d}t \\ &&+ \epsilon \gamma {{\int}_{0}^{T}} \underbrace{\inf_{y_{h} \in V_{h}} \|h^{\frac12} (\boldsymbol{\beta} \cdot \nabla v_{h} - y_{h})\|_{\varpi}^{2}}_{T_{3}}~\mathrm{d}t \\ &&+ 2 {{\int}_{0}^{T}}((\mathcal{L} v_{h}, w_{h})_{{\varOmega}} + \gamma s(v_{h},w_{h}) )~\mathrm{d}t + \|v_{h}(\cdot,0)\|_{\varpi}^{2}. \end{array} $$

We need to bound the contributions T₁, T₂ and T₃ in terms of the quantities of the left hand side and ∥v_h∥_ϖ. Using (5.11) immediately yields

$$ T_{1} = \|(\varpi^{2} v_{h} - \pi_{h} (\varpi^{2} v_{h}))\|_{\varpi^{-1}}^{2} \leq C K^{-2} h \|v_{h}\|_{\varpi}^{2}. $$

By distribution of the integrals over the faces on simplices, splitting the jumps on the contributions from the two sides and applying (5.12) there holds

$$ T_{2} \leq C{\sum}_{S \in \mathcal{T}} \|\varpi^{-1} \nabla (\varpi^{2} v_{h} -\pi_{h} (\varpi^{2} v_{h}))\|_{\partial S}^{2} \leq C/K^{2} h^{-2} \|v_{h}\|_{\varpi}^{2}. $$

Finally for the term T₃ apply the weighted stabilization bound (5.3), with β₀ ≡ e_x, where e_x is the Cartesian unit vector in the x-direction

$$ T_{3} = \inf_{y_{h} \in V_{h}} \|h^{\frac12} (\boldsymbol{\beta} \cdot \nabla v_{h} - y_{h})\|_{\varpi}^{2} \leq C_{ws} |v_{h}|_{s,\varpi}^{2}. $$

Collecting the bounds for T₁-T₃ and choosing 𝜖 = (2C_ws)^− 1 we see that

$$ \begin{array}{@{}rcl@{}} \|v_{h}(\cdot,T)\|_{\varpi}^{2} + \gamma {{\int}_{0}^{T}} |v_{h}|_{s,\varpi}^{2} ~\mathrm{d}t & \leq& (\gamma^{-1} + \gamma) C/K^{2} {{\int}_{0}^{T}} \|v_{h}\|_{\varpi}^{2} ~\mathrm{d}t \\ &&+ 2 {{\int}_{0}^{T}}((\mathcal{L} v_{h}, w_{h})_{{\varOmega}} + \gamma s(v_{h},w_{h}) )~\mathrm{d}t + \|v_{h}(\cdot,0)\|_{\varpi}^{2}. \end{array} $$

□

Theorem 4

Assume that the hypothesis of Proposition 2 are satisfied. Let $u \in L^{\infty }(0,T; H^{k+1}({\varOmega }))$ be the solution of (2.1) and u_h the solution of (2.13). Then for all T > 0 there holds

$$ \|(u - u_{h})(\cdot,T)\|_{\varpi} \leq C_{K} h^{k+\frac12} \left( h \|D^{k+1} u (\cdot,T)\|_{\varpi}^{2} + (\gamma + \gamma^{-1}) {{\int}_{0}^{T}} \|D^{k+1} u\|_{\varpi}^{2} ~\mathrm{d} t \right)^{\frac12}. $$

The constant C_K grows exponentially in time with coefficient proportional to (γ + γ^− 1)K^− 2.

First note that we may split the error as $u - u_{h} = \underbrace {u - \pi _{h} u}_{=-\eta } + \underbrace {\pi _{h} u - u_{h}}_{=e_{h}}$ and by (5.9),

$$ \|(u - \pi_{h} u)(\cdot,T)\|_{\varpi} \leq C h^{k+1} \|D^{k+1} u (\cdot,T)\|_{\varpi}. $$

By the triangle inequality we only need to prove the bound on ∥e_h(⋅,T)∥_ϖ.

Using the stability of Proposition 2 we see that, since e_h(⋅,0) = 0,

$$ \begin{array}{@{}rcl@{}} \|e_{h}(\cdot,T)\|_{\varpi}^{2} + \gamma {{\int}_{0}^{T}} |e_{h}|_{s,\varpi}^{2} ~\mathrm{d}t & \leq& C/K^{2} {{\int}_{0}^{T}} \|e_{h}\|_{\varpi}^{2} ~\mathrm{d}t \\ &&+ 2 {{\int}_{0}^{T}}((\mathcal{L} e_{h}, w_{h})_{{\varOmega}} + \gamma s(e_{h},w_{h}) )~\mathrm{d}t \end{array} $$

with w_h = π_h(ϖ²e_h). Now observe that the following consistency property holds

$$ {{\int}_{0}^{T}} (\mathcal{L} (e_{h} - \eta), v_{h})_{{\varOmega}} - \gamma s(u_{h}, v_{h}) ~\mathrm{d}t= 0,\quad \forall v_{h} \in V_{h} $$

and hence

$$ {{\int}_{0}^{T}}((\mathcal{L} e_{h}, w_{h})_{{\varOmega}} + \gamma s(e_{h},w_{h}) )~\mathrm{d}t = {{\int}_{0}^{T}}((\mathcal{L} \eta, w_{h})_{{\varOmega}} + \gamma s(\pi_{h} u_{h},w_{h}))~\mathrm{d}t. $$

This leads to a perturbation equation on the form

$$ \begin{array}{@{}rcl@{}} \|e_{h}(\cdot,T)\|_{\varpi}^{2} + \gamma {{\int}_{0}^{T}} |e_{h}|_{s,\varpi}^{2} ~\mathrm{d}t & \leq& C K^{-2} {{\int}_{0}^{T}} \|e_{h}\|_{\varpi}^{2} ~\mathrm{d}t \\ &&+ 2 {{\int}_{0}^{T}}((\mathcal{L} \eta, w_{h})_{{\varOmega}} + \gamma s(\pi_{h} u_{h},w_{h}))~\mathrm{d}t. \end{array} $$

(5.15)

Considering the first term of the second integral in the right hand side we have using that time derivation and the L²-projection commute and the L²-orthogonality of η

$$ \begin{array}{@{}rcl@{}} (\mathcal{L} \eta, w_{h})_{{\varOmega}} & =& -(\eta, \boldsymbol{\beta} \cdot \nabla w_{h} - y_{h})_{{\varOmega}} \leq h^{-\frac12} \|\eta\|_{\varpi} h^{\frac12} \inf_{y_{h} \in V_{h}} \|\boldsymbol{\beta} \cdot \nabla w_{h} - y_{h}\|_{\varpi^{-1}} \\ &\leq& h^{-1} \gamma^{-1} C \|\eta\|_{\varpi}^{2} + \frac14 \gamma |e_{h}|^{2}_{s,\varpi} + C\gamma /K^{2} \|e_{h}\|^{2}_{\varpi}. \end{array} $$

Here we used the inequality ab ≤ 4^− 1a² + b² and that by the triangle inequality followed by the bounds (5.11), and (5.4) there holds

$$ \begin{array}{@{}rcl@{}} h^{\frac12} \inf_{y_{h} \in V_{h}}\|\boldsymbol{\beta} \cdot \nabla w_{h} - y_{h}\|_{\varpi^{-1}} &\leq& h^{\frac12} \|\boldsymbol{\beta} \cdot \nabla \pi_{h} (\varpi^{2} e_{h}) - \boldsymbol{\beta} \cdot \nabla (\varpi^{2} e_{h})\|_{\varpi^{-1}}\\ && + h^{\frac12} \inf_{y_{h} \in V_{h}}\|\boldsymbol{\beta} \cdot \nabla (\varpi^{2} e_{h}) - y_{h}\|_{\varpi^{-1}} \\ &\leq& h^{\frac12} \beta_{\infty} \|\nabla (\pi_{h} (\varpi^{2} e_{h}) - \varpi^{2} e_{h})\|_{\varpi^{-1}}\\ && + (C_{ws} |e_{h}|^{2}_{s,\varpi} + C_{\beta} K^{-2} \|e_{h}\|^{2}_{\varpi})^{\frac12} \\ &\leq& C K^{-1} \|e_{h}\|_{\varpi}+ C_{ws} |e_{h}|_{s,\varpi} . \end{array} $$

For the last term in the right hand side of (5.15) we have

$$ \begin{array}{@{}rcl@{}} s(\pi_{h} u_{h} ,w_{h})& =& s(\pi_{h} u_{h},\pi_{h} (\varpi^{2} e_{h}) - \varpi^{2} e_{h}) + s(\pi_{h} u_{h}, \varpi^{2} e_{h}) \\ & \leq& C |\pi_{h} u_{h}|^{2}_{s,\varpi} + \frac14 |e_{h}|^{2}_{s,\varpi} \\ &&+ h^{2} \beta_{\infty}^{2} {\sum}_{F \in \mathcal{F}} \|\varpi^{-1} [\![{\nabla (\varpi^{2} e_{h} -\pi_{h} (\varpi^{2} e_{h}))}]\!]\|_{F}^{2}. \end{array} $$

Applying the bound (5.12) to each term of the jump separately in the last term in the right hand side and collecting the estimates it follows that

$$ (\mathcal{L} \eta, w_{h})_{{\varOmega}} + \gamma s(\pi_{h} u_{h},w_{h}) \leq C (\gamma |\pi_{h} u_{h}|^{2}_{s,\varpi} + h^{-1} \gamma^{-1} \|\eta\|_{\varpi}^{2}) + \frac12\gamma |e_{h}|^{2}_{s,\varpi} + \gamma C/K^{2} \|e_{h}\|_{\varpi}^{2}. $$

Applying this bound in (5.15) we have

$$ \begin{array}{@{}rcl@{}} \|e_{h}(\cdot,T)\|_{\varpi}^{2} + \frac12 {\gamma{\int}_{0}^{T}} |e_{h}|^{2}_{s,\varpi} ~\mathrm{d}t & \leq& C(\gamma + \gamma^{-1})/K^{2} {{\int}_{0}^{T}} \|e_{h}\|_{\varpi}^{2} ~\mathrm{d}t \\ &&+ C {{\int}_{0}^{T}}\!\!\left( \gamma |\pi_{h} u_{h}|^{2}_{s,\varpi} + h^{-1} \gamma^{-1} \|\eta\|_{\varpi}^{2}\right) \!\!~\mathrm{d}t. \end{array} $$

(5.16)

Since the solution is assumed regular, $u(\cdot ,t) \in H^{\frac 32+\epsilon }({\varOmega })$, 𝜖 > 0 we have $|\pi _{h} u_{h}|^{2}_{s,\varpi } = |\eta |^{2}_{s,\varpi }$. Applying Lemma 2 yields

$$ {{\int}_{0}^{T}}\left( \gamma |\eta|^{2}_{s,\varpi} + h^{-1} \gamma^{-1} \|\eta\|_{\varpi}^{2}\right)~\mathrm{d}t\leq C h^{2 k+1} (\gamma + \gamma^{-1}) {{\int}_{0}^{T}}\|D^{k+1} u\|^{2}_{\varpi} ~\mathrm{d}t. $$

The claim now follows by an application of Gronwall’s inequality.

5.1 Discussion of Estimates for Rough Solutions

Consider the following subsets of Ω, Ω₀(t) := {x ∈Ω : ϖ(x,t) = 1} and Ω_p(t) := {x ∈Ω : ϖ(x,t) ≤ h^p,p > 0}. Then denoting d = dist(Ω₀,Ω_p) it follows by the construction of ϖ that

$$ d \sim K p \sqrt{h} |\log (h)|, $$

and the following bound holds

$$ \|(u-u_{h})(\cdot,T)\|_{{\varOmega}_{0}} \leq C h^{k+\frac12} \left( \max_{t \in [0,T]} \|D^{k+1} u\|_{L^{2}({\varOmega} \setminus {\varOmega}_{p})} + h^{p} \max_{t \in [0,T]} \|D^{k+1} u\|_{L^{2}({\varOmega}_{p})}\right). $$

It follows that D^k+ 1u can be large, O(h^−p), in Ω_p without destroying the solution in Ω₀. To apply the argument to u₀ that is only piecewise in H^k+ 1 one can use the weighted L²-stability in the error analysis above and still obtain estimates. We present a sketch of this result in a corollary.

Corollary 2

Assume that the hypothesis of Proposition 2 are satisfied. Let p = k + 1. Assume that $u \in L^{\infty }(0,T;L^{2}({\varOmega }))$, with $u\vert _{{\varOmega } \setminus {\varOmega }_{p}} \in H^{k+1}({\varOmega } \setminus {\varOmega }_{p})$, for all t ∈ [0,T] is the solution of (2.1) and u_h the solution of (2.13). Then there holds (omitting for simplicity the dependence on γ).

$$ \|(u-u_{h})(\cdot,T)\|_{{\varOmega}_{0}} \leq C_{K} h^{k+\frac12} \left( \max_{t \in [0,T]} \| u\|_{H^{k+1}({\varOmega} \setminus {\varOmega}_{p})} + \max_{t \in [0,T]} \|u\|_{L^{2}({\varOmega}_{p})}\right). $$

Proof

The proof follows that of Theorem 4 closely. We only need to substitute the L²-projection for an interpolant with more local properties before applying approximation. Let the domain Ω_p,ih(t) be defined by the union of all the elements that intersect Ω_p(T) and an integer i layers of nearest neighbours. The norm over Ω_p,ih(t) will be denoted $\|\cdot \|_{{\varOmega }_{p,ih}}$. Let C_h denote the Clément interpolant defined using local projections. It is well known [23, Lemma 1.127] that if for a given $S \in \mathcal {T}$, Δ_S denotes the set of simplices sharing at least one vertex with S and for a face F, Δ_F denotes the set of simplices sharing at least one vertex with F, then

$$ \begin{array}{@{}rcl@{}} &&\|v - C_{h} v\|_{H^{m}(S)} \leq C h^{l-m} \|v\|_{H^{l}({\Delta}_{S})}, \quad\|v - C_{h} v\|_{H^{m}(F)} \leq C h^{l-m-\frac12} \|v\|_{H^{l}({\Delta}_{F})}, \\ && 0 \leq m \leq l \leq k+1. \end{array} $$

(5.17)

It is then straightforward to use the approximation properties of C_h in Ω ∖Ω_p,1h and the local stability of C_h in Ω_p,1h to show the estimates

$$ \begin{array}{@{}rcl@{}} \|(u - C_{h} u)(\cdot,t)\|_{\varpi} &\leq& C\left( h^{k+1} \|D^{k+1} u(\cdot,t)\|_{{\varOmega} \setminus {\varOmega}_{p}} + h^{p} \|u(\cdot,t)\|_{{\varOmega}_{p,2h}}\right) \\ &\leq& C h^{k+1} \left( \|u(\cdot,t)\|_{H^{k+1}({\varOmega} \setminus {\varOmega}_{p})} + \|u(\cdot,t)\|_{{\varOmega}_{p}}\right) \end{array} $$

(5.18)

and

$$ \begin{array}{@{}rcl@{}} |C_{h} u(\cdot,t)|_{s,\varpi} &\leq& C\left( h^{k+\frac12} \|D^{k+1} u(\cdot,t)\|_{{\varOmega} \setminus {\varOmega}_{p}} + h^{-\frac12+p} \|u(\cdot,t)\|_{{\varOmega}_{p,2h}}\right) \\ &\leq& C h^{k+\frac12} \left( \|u(\cdot,t)\|_{H^{k+1}({\varOmega} \setminus {\varOmega}_{p})} + \|u(\cdot,t)\|_{{\varOmega}_{p}}\right). \end{array} $$

(5.19)

For the second inequality we divide |C_hu(⋅,t)|_s,ϖ into the sum over faces in Ω ∖Ω_p,1h and Ω_p,1h. The two different sets are treated differently. For faces in Ω ∖Ω_p,1h we proceeded as usual using that $u(\cdot ,t)\vert _{{\varOmega }\setminus {\varOmega }_{p}} \in H^{\frac 32+\epsilon }({\varOmega }\setminus {\varOmega }_{p})$ and apply the local approximation properties on faces of C_h (second inequality of (5.17)). For faces in Ω_p,1h we can not use approximation and instead apply (2.10) and (2.6). We also used that $\varpi \vert _{{\varOmega }_{p,1h}} \leq C h^{p}$ by construction. Observe that by the weighted L²-stability (5.5) we have

$$ \|(u - \pi_{h} u)(\cdot,T)\|_{\varpi} \leq \|\pi_{h} (u - C_{h} u)(\cdot,T)\|_{\varpi} + \|(u - C_{h} u)(\cdot,T)\|_{\varpi} \leq C \|(u - C_{h} u)(\cdot,T)\|_{\varpi} $$

(5.20)

and hence as before we only need to prove the bound for ∥e_h(⋅,T)∥_ϖ. The inequality (5.16) still holds. To conclude we observe that using (5.20)

$$ {{\int}_{0}^{T}}h^{-1} \|\eta\|^{2}_{\varpi} ~\mathrm{d}t \leq C {{\int}_{0}^{T}} h^{-1} \|u - C_{h} u\|^{2}_{\varpi} ~\mathrm{d}t . $$

(5.21)

By combining the inequality

$$ |v_{h}|_{s,\varpi} \leq C h^{-\frac12} \|v_{h}\|_{\varpi} $$

(that is immediate by (2.10), (2.6) and (5.8)) with (5.20) we also have

$$ \begin{array}{@{}rcl@{}} {{\int}_{0}^{T}} |\pi_{h} u|^{2}_{s,\varpi} ~\mathrm{d}t &\leq& 2 {{\int}_{0}^{T}} \left( |\pi_{h} u- C_{h} u|^{2}_{s,\varpi} +|C_{h} u_{h}|^{2}_{s,\varpi}\right) ~\mathrm{d}t\\ & \leq& C{{\int}_{0}^{T}} \left( h^{-1} \|u - C_{h} u\|^{2}_{\varpi} +|C_{h} u_{h}|^{2}_{s,\varpi}\right) ~\mathrm{d}t. \end{array} $$

(5.22)

We conclude as before after applying (5.18) and (5.19) in (5.21) and (5.22). □

5.2 Time Discretization and Weakly Imposed Boundary Conditions

In practice and in the numerical section below of course we need to include boundary conditions and time discretizations in the above arguments. Depending on the time-discretization this can be a challenging exercise, but we will here focus on the 𝜃-scheme and the main steps of its analysis using the ideas above in the case of the backward Euler scheme (𝜃 = 1). Boundary conditions are imposed weakly using the standard upwind technique known from discontinuous Galerkin methods. We consider a polygonal domain Ω and denote its boundary by Γ := ∂Ω with outward pointing normal n. We decompose Γ into an inflow part

$$ {\Gamma}_{-} := \{x \in {\Gamma}: \boldsymbol{\beta}(x) \cdot n<0\} $$

and an outflow part Γ₊ := ∂Ω ∖Γ₋. The space V_h will here denote the standard finite element space of continuous piecewise polynomial functions, without boundary conditions defined on $\mathcal {T}$. We are now interested in the the solution of (2.1) with the additional inflow boundary condition

$$ u = g~\text{ on }~{\Gamma}_{-}, $$

where $g \in L^{2}(0,T;L^{2}_{\beta \cdot n}({\Gamma }_{-}))$ with $L^{2}_{\boldsymbol {\beta } \cdot n}({\Gamma }_{-}) := \{ v:{\Gamma }_{-} \mapsto \mathbb {R}: \||\boldsymbol {\beta }\cdot n|^{\frac 12} v\|_{L^{2}({\Gamma }_{-})} < \infty \}$. We will assume that the g, Γ₋ and Γ₊ are such that the exact solution is smooth enough for our purposes. The timestep δt := T/N for some $N \in \mathbb {N}^{+}$ will be assumed to satisfy δt ≤ Ch for some C > 0, and the discrete solution $u_{h} := \{{u_{h}^{n}}\}_{n=0}^{N}$ collects the finite element approximations on the discrete time levels tⁿ = nδt. The so-called 𝜃-scheme takes the form: find ${u_{h}^{n}} \in V_{h}$ such that for $ n = 1,2,3 {\dots } N$,

$$ (\mathcal{L}_{\theta}^{n} u_{h}, v_{h})_{{\varOmega}} + \left< |\boldsymbol{\beta} \cdot n| u^{n_{\theta}}_{h},v_{h} \right>_{{\Gamma}_{-}} + s\left( u_{h}^{n_{\theta}},v_{h}\right) = (f^{n_{\theta}},v_{h})_{{\varOmega}} + \left<|\boldsymbol{\beta} \cdot n| g^{n_{\theta}},v_{h} \right>_{{\Gamma}_{-}},~\forall v_{h} \in V_{h}, $$

(5.23)

where $u_{h}^{n_{\theta }} := \theta {u_{h}^{n}} + (1-\theta ) u_{h}^{n-1}$, $g^{n_{\theta }} := g(\cdot , t^{n}+\theta \delta t)$, $f^{n_{\theta }}:=f(\cdot , t^{n}+\theta \delta t)$,

$$ \mathcal{L}_{\theta}^{n} u_{h} := \delta t^{-1} \left( {u_{h}^{n}} - u_{h}^{n-1}\right) + \boldsymbol{\beta} \cdot \nabla u_{h}^{n_{\theta}}, \quad \theta \in [1/2,1] $$

and ${u_{h}^{0}} = \pi _{h} u_{0}$. Compared to the time continuous analysis we have two additional points to study

1.
the time discrete character of the equation,
2.
the boundary penalty term.

We recall that the theta scheme includes the well-known backward Euler scheme (𝜃 = 1) and the Crank–Nicolson scheme (𝜃 = 1/2). A complete analysis of the 𝜃 scheme is beyond the scope of the present paper. To give some insight in the validity of the above arguments in the fully discrete case we will show the modifications necessary to prove Proposition 2 in the time discrete case with weakly imposed boundary conditions, for 𝜃 = 1. Theorem 4 then follows using the arguments above and standard truncation error analysis. We will then show numerically that also the Crank–Nicolson scheme enjoys the local accuracy property. For further evidence of the local accuracy property we refer to [12, Section 5.2 and Fig. 1] for examples using explicit Runge–Kutta methods and [13, Section 6] for examples using explicit extrapolated multistep methods. For the analysis we need the following Lemma the proof of which is given in the ??.

Lemma 4

Let ϖ_n(x) = ϖ(x,t_n), where ϖ is a weightfunction satisfying (5.1) and v_h ∈ V_h, then for δt small enough there holds

$$ \left\|v_{h}{\int}_{t_{n-1}}^{t_{n}} \partial_{t} \varpi ~\mathrm{d}t\right\|_{{\varOmega}} + \left\|v_{h}\left|{\int}_{t_{n-1}}^{t_{n}} {\int}_{t}^{t_{n}}{\partial_{t}^{2}} \varpi^{2} ~\mathrm{d}s \mathrm{d}t\right|^{\frac12}\right\|_{{\varOmega}}\leq C K^{-1}\delta t^{\frac12} \|v_{h}\|_{\varpi_{n}}. $$

The following weighted L²-stability estimate is the key ingredient of the analysis of the fully discrete scheme.

Proposition 3

Consider the scheme (5.23) with 𝜃 = 1, then assuming δt < 1 small enough there holds, with ${w_{h}^{n}} = \pi _{h} \varpi ^{2} {v_{h}^{n}}$,

$$ \begin{array}{@{}rcl@{}} &&\|{v_{h}^{N}}\|_{\varpi_{N}}^{2} + {\sum}_{n=1}^{N} \|{v_{h}^{n}} - v_{h}^{n-1}\|_{\varpi_{n}}^{2}+\delta t {\sum}_{n=1}^{N}\left( \||\boldsymbol{\beta} \cdot n|^{\frac12} {v_{h}^{n}} \varpi_{n}\|_{\Gamma}^{2} + \gamma |{v_{h}^{n}}|_{s,\varpi_{n}}^{2}\right) \\ &&\qquad \leq C_{K} \left( \|{v_{h}^{0}}\|_{\varpi_{0}}^{2}+\delta t {\sum}_{n=1}^{N} \left( (\mathcal{L}_{\theta}^{n} v_{h}, {w_{h}^{n}})_{{\varOmega}} + \left< |\boldsymbol{\beta} \cdot n| {v^{n}_{h}},{w^{n}_{h}} \right>_{{\Gamma}_{-}} + \gamma s({v_{h}^{n}},{w^{n}_{h}})\right)\right). \end{array} $$

The constant C_K grows exponentially in time with exponential coefficient 1/K².

Proof

First we observe that using standard partial integration and ∇⋅β = 0 we have

$$ \begin{array}{@{}rcl@{}} &&(\boldsymbol{\beta} \cdot \nabla v_{h}, \varpi^{2} v_{h})_{{\varOmega}} +\left< |\boldsymbol{\beta} \cdot n| v_{h},\varpi^{2} v_{h} \right>_{{\Gamma}_{-}}\\ &&\qquad= - (\boldsymbol{\beta} \cdot \nabla v_{h}, \varpi^{2} v_{h})_{{\varOmega}}- (v_{h},(\boldsymbol{\beta} \cdot \nabla \varpi^{2}) v_{h})_{{\varOmega}} + \left< |\boldsymbol{\beta} \cdot n| v_{h}, \varpi^{2} v_{h}\right>_{{\Gamma}_{+}}. \end{array} $$

As a consequence

$$ (\boldsymbol{\beta} \cdot \nabla v_{h}, \varpi^{2} v_{h})_{{\varOmega}} +\left< |\boldsymbol{\beta} \cdot n| v_{h},\varpi^{2} v_{h} \right>_{{\Gamma}_{-}}= - \frac12 (v_{h},(\boldsymbol{\beta} \cdot \nabla \varpi^{2}) v_{h})_{{\varOmega}} + \frac12 \left< |\boldsymbol{\beta} \cdot n| v_{h}, \varpi^{2} v_{h}\right>_{\Gamma}. $$

We also have

$$ \left( {v_{h}^{n}} - v_{h}^{n-1},{\varpi^{2}_{n}} {v^{n}_{h}}\right)_{{\varOmega}} = \frac12 \|{v_{h}^{n}}\|_{\varpi_{n}}^{2} + \frac12 \|{v_{h}^{n}} - v_{h}^{n-1}\|_{\varpi_{n}}^{2}- \frac12 \|v_{h}^{n-1}\|_{\varpi_{n}}^{2}. $$

It follows that

$$ \begin{array}{@{}rcl@{}} &&\delta t {\sum}_{n=1}^{N} ((\mathcal{L}_{\theta}^{n} v_{h}, {\varpi^{2}_{n}} {v^{n}_{h}})_{{\varOmega}} + \left< |\boldsymbol{\beta} \cdot n|{v^{n}_{h}},{\varpi^{2}_{n}} {v^{n}_{h}} \right>_{{\Gamma}_{-}} + \gamma s({v_{h}^{n}},{\varpi^{2}_{n}} {v^{n}_{h}}) ) \\ &&\quad = \frac12 \|{v_{h}^{N}}\|_{\varpi_{N}}^{2} + \frac12 {\sum}_{n=1}^{N} \left( \|{v_{h}^{n}} - v_{h}^{n-1}\|_{\varpi_{n}}^{2}- ((v_{h}^{n-1})^{2},{\varpi_{n}^{2}}-\varpi_{n-1}^{2})_{{\varOmega}}\right) - \frac12 \|{v_{h}^{0}}\|_{\varpi_{0}}^{2}\\ &&\qquad - \frac12 \delta t {\sum}_{n=1}^{N} (({v_{h}^{n}})^{2},\boldsymbol{\beta} \cdot \nabla {\varpi^{2}_{n}})_{{\varOmega}} + \frac12 \delta t {\sum}_{n=1}^{N}\left( \||\boldsymbol{\beta} \cdot n|^{\frac12} {v_{h}^{n}} \varpi_{n}\|_{\Gamma}^{2} + 2 \gamma s({v_{h}^{n}},{\varpi_{n}^{2}} {v_{h}^{n}})\right). \end{array} $$

Identifying the terms in the right hand side that do not have a sign we see that we need to control

$$ {\sum}_{n=1}^{N} ((v_{h}^{n-1})^{2},{\varpi_{n}^{2}}-\varpi_{n-1}^{2})_{{\varOmega}}+\delta t (({v_{h}^{n}})^{2},\boldsymbol{\beta} \cdot \nabla {\varpi^{2}_{n}})_{{\varOmega}}). $$

We rewrite the first term

$$ ((v_{h}^{n-1})^{2}, ({\varpi_{n}^{2}}-\varpi_{n-1}^{2}) )_{{\varOmega}} = ((v_{h}^{n-1})^{2} - ({v_{h}^{n}})^{2},{\varpi_{n}^{2}}-\varpi_{n-1}^{2})_{{\varOmega}} + (({v_{h}^{n}})^{2}, ({\varpi_{n}^{2}}-\varpi_{n-1}^{2}))_{{\varOmega}}. $$

For the first term on the right hand side we develop a² − b² = (a + b)(a − b) and apply Cauchy–Schwarz inequality and the arithmetic-geometric inequality, followed by Lemma 4 and the inequality (5.8) to obtain the bound

$$ \begin{array}{@{}rcl@{}} &&((v_{h}^{n-1})^{2} - ({v_{h}^{n}})^{2},{\varpi_{n}^{2}}-\varpi_{n-1}^{2})_{{\varOmega}} = ((v_{h}^{n-1}+{v_{h}^{n}})(v_{h}^{n-1}-{v_{h}^{n}}),{\varpi_{n}^{2}}-\varpi_{n-1}^{2})_{{\varOmega}}\\ &&\qquad = \left( (v_{h}^{n-1}+{v_{h}^{n}})(v_{h}^{n-1}-{v_{h}^{n}}),(\varpi_{n}+\varpi_{n-1}){\int}_{t_{n-1}}^{t_{n}} \partial_{t} \varpi(\cdot,t) ~\mathrm{d}t\right)_{{\varOmega}}\\ &&\qquad\ge - \epsilon^{-1} \left\|({v_{h}^{n}}+v_{h}^{n-1}) {\int}_{t_{n-1}}^{t_{n}} \partial_{t} \varpi(\cdot,t) ~\mathrm{d}t\right\|_{{\varOmega}}^{2} - \frac{\epsilon}{2} (({v_{h}^{n}}-v_{h}^{n-1})^{2}, {\varpi_{n}^{2}}+\varpi_{n-1}^{2})_{{\varOmega}} \\ &&\qquad\ge - C K^{-2} \epsilon^{-1} \delta t \left( \|{v_{h}^{n}}\|_{\varpi_{n}}^{2} + \|v_{h}^{n-1}\|_{\varpi_{n-1}}^{2}\right) - \frac{C \epsilon}{2} \|{v_{h}^{n}}-v_{h}^{n-1}\|_{\varpi_{n}}^{2}. \end{array} $$

Considering the remaining terms, using the relation ${\mathscr{L}} \varpi ^{2} = 0$, and applying once again Lemma 4, yields the bound

$$ \begin{array}{@{}rcl@{}} (({v_{h}^{n}})^{2},{\varpi_{n}^{2}}-\varpi_{n-1}^{2})_{{\varOmega}} + \delta t (({v_{h}^{n}})^{2},\boldsymbol{\beta} \cdot \nabla {\varpi^{2}_{n}})_{{\varOmega}} & =& \left( ({v_{h}^{n}})^{2},{\int}_{t_{n-1}}^{t_{n}} \partial_{t} \varpi^{2} ~\mathrm{d}t- \delta t\partial_{t} {\varpi^{2}_{n}}\right)_{{\varOmega}} \\ & =& \left( ({v_{h}^{n}})^{2},{\int}_{t_{n-1}}^{t_{n}} {\int}_{t_{n}}^{t} \partial_{tt} \varpi^{2} ~\mathrm{d}s ~\mathrm{d}t\right)_{{\varOmega}}\\ & \ge& - \delta t C/K^{2} \|{v_{h}^{n}}\|_{\varpi_{n}}^{2}. \end{array} $$

Taking 𝜖 sufficiently small so that C𝜖/2 ≤ 1/4 it follows that

$$ \begin{array}{@{}rcl@{}} &&\|{v_{h}^{N}}\|_{\varpi_{N}}^{2} + {\sum}_{n=1}^{N} (\|{v_{h}^{n}} - v_{h}^{n-1}\|_{\varpi_{n}}^{2}+\delta t {\sum}_{n=1}^{N}(\||\boldsymbol{\beta} \cdot n|^{\frac12} {v_{h}^{n}} \varpi_{n}\|_{\Gamma}^{2} + \gamma |{v_{h}^{n}}|_{s,\varpi_{n}}^{2}) \\ && \leq C \left( \|{v_{h}^{0}}\|_{\varpi_{0}}^{2}+\delta t {\sum}_{n=1}^{N} ((\mathcal{L}_{\theta}^{n} v_{h}, {\varpi^{2}_{n}} {v^{n}_{h}})_{{\varOmega}} + \left< |\boldsymbol{\beta} \cdot n| {v^{n}_{h}},{\varpi^{2}_{n}} {v^{n}_{h}} \right>_{{\Gamma}_{-}}\right.\\ &&\quad + \gamma s({v_{h}^{n}},{\varpi^{2}_{n}} {v^{n}_{h}}) + C K^{-2} \|{v_{h}^{n}}\|_{\varpi_{n}}^{2})\Bigg). \end{array} $$

Proceeding as before we add and subtract ${w_{h}^{n}} := \pi _{h} ({\varpi ^{2}_{n}} {v^{n}_{h}})$ in the right slot of the bilinear forms of the right hand side

$$ \begin{array}{@{}rcl@{}} &&\!\!\!\!\!\!\|{v_{h}^{N}}\|_{\varpi_{N}}^{2} + {\sum}_{n=1}^{N} \|{v_{h}^{n}} - v_{h}^{n-1}\|_{\varpi_{n}}^{2}+\delta t {\sum}_{n=1}^{N}\left( \||\boldsymbol{\beta} \cdot n|^{\frac12} {v_{h}^{n}} \varpi_{n}\|_{\Gamma}^{2} + \gamma |{v_{h}^{n}}|_{s,\varpi_{n}}^{2}\right) \\ && \leq C\left( \|{v_{h}^{0}}\|_{\varpi_{0}}^{2} +\delta t {\sum}_{n=1}^{N} \left( (\mathcal{L}_{\theta}^{n} v_{h}, w_{h})_{{\varOmega}} + \left< |\boldsymbol{\beta} \cdot n| {v^{n}_{h}},w_{h} \right>_{{\Gamma}_{-}} + s({v_{h}^{n}},w_{h}) +\delta t C \|{v_{h}^{n}}\|_{\varpi_{n}}^{2}\right)\right.\\ &&\left. +\delta t {\sum}_{n=1}^{N} \left( (\mathcal{L}_{\theta}^{n} v_{h}, {\varpi^{2}_{n}} {v^{n}_{h}} - w_{h})_{{\varOmega}} + \left< |\boldsymbol{\beta} \cdot n| {v^{n}_{h}},{\varpi^{2}_{n}} {v^{n}_{h}} - w_{h} \right>_{{\Gamma}_{-}} + \gamma s({v_{h}^{n}},{\varpi^{2}_{n}} {v^{n}_{h}} - w_{h})\right)\right). \end{array} $$

Only the term introduced for the weak imposition of boundary conditions differs from the time-continuous analysis. For this term we observe that

$$ \left< |\boldsymbol{\beta} \cdot n| {v^{n}_{h}},{\varpi^{2}_{n}} {v^{n}_{h}} - w_{h} \right>_{{\Gamma}_{-}} \ge -\epsilon \||\boldsymbol{\beta} \cdot n|^{\frac12} {v_{h}^{n}} \varpi_{n}\|_{\Gamma}^{2} - \frac{\beta_{\infty}}{4 \epsilon} \|\varpi_{n}^{-1} \left( {\varpi^{2}_{n}} {v^{n}_{h}} - \pi_{h} {\varpi^{2}_{n}} {v^{n}_{h}}\right)\|_{{\Gamma}_{-}}^{2}. $$

For the second term on the right hand side we have the bound

$$ \|\varpi_{n}^{-1} \left( {\varpi^{2}_{n}} {v^{n}_{h}} - \pi_{h} {\varpi^{2}_{n}} {v^{n}_{h}}\right)\|_{{\Gamma}_{-}}^{2} \leq C/K^{2} \|{v^{n}_{h}} \|_{\varpi}^{2}. $$

This follows by applying the trace inequality (2.10), the properties of ϖ and the inequality (5.11). Proceeding as in the time-continuous case we then obtain the bound

$$ \begin{array}{@{}rcl@{}} &&\|{v_{h}^{N}}\|_{\varpi_{N}}^{2} + {\sum}_{n=1}^{N} \|{v_{h}^{n}} - v_{h}^{n-1}\|_{\varpi_{n}}^{2}+\delta t {\sum}_{n=1}^{N}\left( \||\boldsymbol{\beta} \cdot n|^{\frac12} {v_{h}^{n}} \varpi_{n}\|_{\Gamma}^{2} + \gamma |{v_{h}^{n}}|_{s,\varpi_{n}}^{2}\right) \\ &&\leq C\left( \|{v_{h}^{0}}\|_{\varpi_{0}}^{2} + \delta t {\sum}_{n=1}^{N} \left( (\mathcal{L}_{\theta}^{n} v_{h}, w_{h})_{{\varOmega}} + \left< |\boldsymbol{\beta} \cdot n| {v^{n}_{h}},w_{h} \right>_{{\Gamma}_{-}} + \gamma s({v_{h}^{n}},w_{h}) + K^{-2} \|{v_{h}^{n}}\|_{\varpi_{n}}^{2}\right)\right). \end{array} $$

Choosing δt sufficiently small the term $\delta t C K^{-2} \|{v_{h}^{N}}\|_{\varpi _{N}}^{2}$ in the right hand side can be absorbed in the left hand side and we conclude by an application of the discrete Gronwall’s inequality. □

Remark 6

A consequence of the previous analysis is that the proposed method can be used in the context of problems, where the boundary or initial data is unknown or partially known. Assume for example that g is unknown and replaced by zero. Then, since the effect of the erroneous boundary condition is damped exponentially for non-characteristic directions, the solution can still be approximated with good accuracy in subsets Ω₀ whose domain of dependence is sufficiently far from the boundary. Similarly if the initial data is unknown in some parts of the domain, the solution will still remain accurate in subdomains where the initial data in the domain of dependence is known. This result is a time-dependent analogue to the analysis of [16].

6 Numerical Examples

All numerical examples were produced using the package FreeFEM++ [29]. The method (5.23) is considered with 𝜃 = 1/2, corresponding to the second order Crank–Nicolson scheme. This choice was made to minimize the perturbation of the global energy estimate by the time-discretization. The consistent mass matrix is used and exact quadrature is applied to all the forms. We first consider transport in the disc ${\varOmega }:= \{(x,y) \in \mathbb {R}^{2} : x^{2}+y^{2} < 1\}$ under the velocity field β = (y,−x). Approximations are computed on a series of unstructured meshes. We set f = 0 and consider two different functions u₀ as initial data. One is smooth

$$ u_{0} = e^{-30((x-0.5)^{2}+y^{2})} $$

(6.1)

and one is rough

$$ \tilde u_{0} = \left\{\begin{array}{l} 1, \quad \sqrt{(x+0.5)^{2} +y^{2}} <0.2,\\ 0 \quad \text{otherwise}. \end{array} \right. $$

The velocity field simply turns the disc with the initial data and one full turn is computed so that the final solution should be equal to the initial data. Two numerical experiments are considered where the solution is approximated for the initial data u₀ and $u_{0} + \tilde u_{0}$.

We report the global error in the material derivative over the space time domain, the global L²-norm of the error at the final time, and in the case where both the rough and the smooth initial data are combined, the error obtained in the smooth part, i.e. the L²-norm over {(x,y) ∈Ω : x > 0}. The discretization parameters for piecewise affine (P₁ below) approximation have been chosen as $dt = \tfrac 12 h = \pi /nele$, where nele is the number of cell faces on the disc perimeter. For piecewise quadratic (P₂ below) approximation h = 2π/nele and $dt = \tfrac 12 h^{\frac 32}$, to make the error of the time and space discretization similar. In the left panel of Fig. 2 the smooth and rough initial data, interpolated on a very fine mesh, are presented. In the middle panel the solution after one turn without stabilization and in the right panel the solution after one turn with stabilization for P₁, on the mesh resolution nele = 80 are reported. We see that the sharp layers are smeared on this coarse mesh when the stabilized method is used, but contrary to the unstabilized case the smooth part of the solution is accurately captured.

In Fig. 3 the convergence of stabilized and unstabilized methods with P₁ and P₂ elements are compared for the smooth initial data. We observe that when the solution is globally smooth both methods perform well in the L²-norm. Nevertheless, the improvement of the convergence rate for the stabilized method is clearly visible for both approximation spaces, both in the L²-error and in the material derivative. The results when part of the solution is rough (initial data from Fig. 2, left plot) are reported in Fig. 4. Note that both methods have similar global error in the L²-norm. The stabilized method on the other hand still has optimal convergence in the part where the solution is smooth, in accordance with the theory of Section 5. Its material derivative is also more stable under refinement. The unstabilized method has equally poor convergence in the smooth and in the rough part of the solution.

6.1 An Example with Inflow and Outflow and Weakly Imposed Boundary Conditions

Here we consider transport in the unit square with β = (1,0)^T. We use a structured mesh with nele cell faces on the side of the square. The initial data consists of a cylinder of radius r = 0.2 centered in the middle of the square and a Gaussian centered on the left boundary (see Fig. 5, left plot). The exact shapes are the same as those of the previous example. The solution is approximated over the time interval (0,1] so that the cylinder leaves the domain at t = 0.7 and at t = 1 the Gaussian is centered on the right boundary. The time dependent inflow boundary condition u = g on Γ₋ is imposed weakly as described in (5.23) (g is chosen as the trace of the known exact solution). In Fig. 5, the final time approximation is reported in the middle plot without stabilization and the in right plot with stabilization. Observe that from t = 0.7 the solution is smooth. Nevertheless the unstabilized Galerkin method fails to produce an accurate approximation of the smooth final time solution. Spurious oscillations from the discontinuity have spread over the whole computational domain and remain also when the rough part of the solution has left. The convergence of the L²-error at final times for the stabilized and unstabilized approaches is shown in Fig. 6 (h = 1/nele, nele = 40,80,160,320). We see that for the stabilized method both the P₁ and P₂ approximations have optimal convergence to the smooth solution. The unstabilized method converges approximately as $O(h^{\frac 12})$ in both cases and its material derivative diverges.

6.2 Long Term Stability

To see the effect of perturbations on the solution for long time we revisit the computational example of the previous section, but extend the time interval to (0,3). The cylinder leaves the domain at t = 0.7 and at the final time the solution is very small. One would then expect the error of the method to go to zero with machine precision, since the solution to approximate is very close to the trivial zero solution. In Fig. 7 the global L²-norm is reported, for two consecutive meshes (nele = 40 and nele = 80) and both the stabilized (full line) and the unstabilized (dashed line) methods. In the stabilized case the improvement of the approximation at t = 0.7, when the cylinder leaves the domain, is clearly visible and the solution also improves as the Gaussian is evacuated. We see convergence to zero at machine precision of the error and also convergence under mesh refinement. In the unstabilized case the change at time t = 0.7 is barely visible, the error decreases only very slowly in time and not noticeably under mesh refinement. Similarly as in the previous example, we conclude that the standard Galerkin method with weakly imposed boundary conditions in our simulations fails to evacuate the high frequency perturbations produced by the discontinuous initial data on the two meshes considered.

Code Availability

Codes used to produce approximate solutions can be made available upon reasonable request.

References

Bertoluzza, S: The discrete commutator property of approximation spaces. C. R. Acad. Sci. Paris Sr. I Math. 329, 1097–1102 (1999)
Article MathSciNet Google Scholar
Boman, M: Estimates for the l₂-projection onto continuous finite element spaces in a weighted l_p-norm. BIT Number. Math. 46, 249–260 (2006)
Article Google Scholar
Brenner, SC, Scott, LR: The Mathematical Theory of Finite Element Methods, 3rd edn. Texts in Applied Mathematics, vol. 15. Springer, New York (2008)
Book Google Scholar
Brezzi, F, Marini, LD, Süli, E: Discontinuous Galerkin methods for first-order hyperbolic problems. Math. Models Methods Appl. Sci. 14, 1893–1903 (2004)
Article MathSciNet Google Scholar
Burman, E, Gillissen, JJJ, Oksanen, L: Stability estimate for scalar image velocimetry. arXiv:2008.09451 (2020)
Burman, E, Stamm, B: Minimal stabilization for discontinuous Galerkin finite element methods for hyperbolic problems. J. Sci. Comput. 33, 183–208 (2007)
Article MathSciNet Google Scholar
Burman, E: A unified analysis for conforming and nonconforming stabilized finite element methods using interior penalty. SIAM J. Numer. Anal. 43, 2012–2033 (2005)
Article MathSciNet Google Scholar
Burman, E: A posteriori error estimation for interior penalty finite element approximations of the advection-reaction equation. SIAM J. Numer. Anal. 47, 3584–3607 (2009)
Article MathSciNet Google Scholar
Burman, E: Robust error estimates in weak norms for advection dominated transport problems with rough data. Math. Models Methods Appl. Sci. 24, 2663–2684 (2014)
Article MathSciNet Google Scholar
Burman, E, Ern, A: Continuous interior penalty hp-finite element methods for advection and advection-diffusion equations. Math. Comput. 76, 1119–1140 (2007)
Article MathSciNet Google Scholar
Burman, E, Ern, A: Implicit-explicit Runge–Kutta schemes and finite elements with symmetric stabilization for advection-diffusion equations. ESAIM Math. Model. Numer. Anal. 46, 681–707 (2012)
Article MathSciNet Google Scholar
Burman, E, Ern, A, Fernández, MA: Explicit Runge–Kutta schemes and finite elements with symmetric stabilization for first-order linear PDE systems. SIAM J. Numer. Anal. 48, 2019–2042 (2010)
Article MathSciNet Google Scholar
Burman, E, Guzmán, J: Implicit-explicit multistep formulations for finite element discretisations using continuous interior penalty. arXiv:2012.05727. ESAIM Math. Model. Numer. Anal. (to appear) (2020)
Burman, E, Guzmán, J, Leykekhman, D: Weighted error estimates of the continuous interior penalty method for singularly perturbed problems. IMA J. Numer. Anal. 29, 284–314 (2009)
Article MathSciNet Google Scholar
Burman, E, Hansbo, P: Edge stabilization for Galerkin approximations of convection–diffusion–reaction problems. Comput. Methods Appl. Mech. Eng. 193, 1437–1453 (2004)
Article MathSciNet Google Scholar
Burman, E, Nechita, M, Oksanen, L: A stabilized finite element method for inverse problems subject to the convection-diffusion equation, II: convection-dominated regime. arXiv:2006.13201 (2020)
Burman, E, Quarteroni, A, Stamm, B: Stabilization strategies for high order methods for transport dominated problems. Boll. Unione Mat. Ital. Ser. (9) 1, 57–77 (2008)
Burman, E, Quarteroni, A, Stamm, B: Interior penalty continuous and discontinuous finite element approximations of hyperbolic equations. J. Sci. Comput. 43, 293–312 (2010)
Article MathSciNet Google Scholar
Moura, R.C., da Silva, A.F.C., Burman, E., Sherwin, S.J.: Eigenanalysis of gradient-jump penalty (GJP) stabilisation for CG. Technical report. https://doi.org/10.13140/RG.2.2.32887.85924 (2020)
de Frutos, J, García-Archilla, B, Novo, J: Local error estimates for the SUPG method applied to evolutionary convection–reaction–diffusion equations. J. Sci. Comput. 66, 528–554 (2016)
Article MathSciNet Google Scholar
Douglas, J, Dupont, T: Interior penalty procedures for elliptic and parabolic Galerkin methods. In: Glowinski, R, Lions, JL (eds.) Computing Methods in Applied Sciences (Second International Symposium, Versailles, 1975). Lecture Notes in Physics, vol. 58, pp 207–216. Springer, Berlin (1976)
Eriksson, K, Johnson, C: Adaptive finite element methods for parabolic problems. II. Optimal error estimates in $l_{\infty }l_{2}$ and $l_{\infty } l_{\infty }$. SIAM J. Numer. Anal. 32, 706–740 (1995)
Article MathSciNet Google Scholar
Ern, A, Guermond, J-L: Theory and Practice of Finite Elements. Applied Mathematical Sciences, vol. 159. Springer, New York (2004)
Book Google Scholar
Ern, A, Guermond, J-L: Finite Elements, vol. III. Springer, Cham (2021)
Book Google Scholar
Ghrist, ML, Fornberg, B, Reeger, JA: Stability ordinates of Adams predictor-corrector methods. BIT Numer. Math. 55, 733–750 (2015)
Article MathSciNet Google Scholar
Girault, V, Scott, LR: On a time-dependent transport equation in a Lipschitz domain. SIAM J. Math. Anal. 42, 1721–1731 (2010)
Article MathSciNet Google Scholar
Guermond, J-L: Subgrid stabilization of Galerkin approximations of linear contraction semi-groups of class c⁰ in Hilbert spaces. Numer. Methods Partial Differ. Equ. 17, 1–25 (2001)
Article Google Scholar
Guzmán, J: Local analysis of discontinuous Galerkin methods applied to singularly perturbed problems. J. Numer. Math. 14, 41–56 (2006)
Article MathSciNet Google Scholar
Hecht, F: New development in FreeFem++. J. Numer. Math. 20, 251–265 (2013)
MathSciNet MATH Google Scholar
Houston, P, Mackenzie, JA, Süli, E., Warnecke, G: A posteriori error analysis for numerical approximations of Friedrichs systems. Numer. Math. 82, 433–470 (1999)
Article MathSciNet Google Scholar
Hundsdorfer, W, Verwer, J: Numerical Solution of Time-Dependent Advection-Diffusion-Reaction Equations. Springer Series in Computational Mathematics, vol. 33. Springer, Berlin (2003)
Book Google Scholar
Johnson, C, Schatz, AH, Wahlbin, LB: Crosswind smear and pointwise errors in streamline diffusion finite element methods. Math. Comput. 49, 25–38 (1987)
Article MathSciNet Google Scholar
Johnson, C, Nävert, U, Pitkäranta, J: Finite element methods for linear hyperbolic problems. Comput. Methods Appl. Mech. Eng. 45, 285–312 (1984)
Article MathSciNet Google Scholar
Karamanos, G. -S., Karniadakis, GE: A spectral vanishing viscosity method for large-eddy simulations. J. Comput. Phys. 163, 22–50 (2000)
Article MathSciNet Google Scholar
Maday, Y, Tadmor, E: Analysis of the spectral vanishing viscosity method for periodic conservation laws. SIAM J. Numer. Anal. 26, 854–870 (1989)
Article MathSciNet Google Scholar
Moura, RC, Aman, M, Peiró, J, Sherwin, SJ: Spatial eigenanalysis of spectral/hp continuous Galerkin schemes and their stabilisation via DG-mimicking spectral vanishing viscosity for high Reynolds number flows. J. Comput. Phys. 406, 109112 (2020)
Article MathSciNet Google Scholar
Moura, RC, Cassinelli, A, da Silva, A.F.C., Burman, E, Sherwin, SJ: Gradient jump penalty stabilisation of spectral/hp element discretisation for under-resolved turbulence simulations. Comput. Methods Appl. Mech. Eng. 388, 114200 (2022)
Article MathSciNet Google Scholar
Peterson, TE, Shuster, DB: Non-optimal behaviour of finite element methods for first order hyperbolic problems. Appl. Math. Comput. Sci. 5, 579–596 (1995)
MathSciNet MATH Google Scholar
Wang, H, Liu, Y, Zhang, Q, Shu, C-W: Local discontinuous Galerkin methods with implicit-explicit time-marching for time-dependent incompressible fluid flow. Math. Comput. 88, 91–121 (2019)
Article MathSciNet Google Scholar
Xu, Y, Shu, C-W, Zhang, Q: Error estimate of the fourth-order Runge–Kutta discontinuous Galerkin methods for linear hyperbolic equations. SIAM J. Numer. Anal. 58, 2885–2914 (2020)
Article MathSciNet Google Scholar
Xu, Y, Zhang, Q, Shu, C-W, Wang, H: The l²-norm stability analysis of Runge–Kutta discontinuous Galerkin methods for linear hyperbolic equations. SIAM J. Numer. Anal. 57, 1574–1601 (2019)
Article MathSciNet Google Scholar
Zhang, Q, Shu, C-W: Error estimates to smooth solutions of Runge–Kutta discontinuous Galerkin methods for scalar conservation laws. SIAM J. Numer. Anal. 42, 641–666 (2004)
Article MathSciNet Google Scholar
Zhang, Q, Shu, C-W: Stability analysis and a priori error estimates of the third order explicit Runge–Kutta discontinuous Galerkin method for scalar conservation laws. SIAM J. Numer. Anal. 48, 1038–1063 (2010)
Article MathSciNet Google Scholar
Zhou, GH: A local l²-error analysis of the streamline diffusion method for nonstationary convection-diffusion systems. RAIRO Model. Math. Anal. Numér. 29, 577–603 (1995)
Article MathSciNet Google Scholar

Download references

Funding

The author acknowledges funding from EPSRC grants EP/P01576X/1 and EP/T033126/1.

Author information

Authors and Affiliations

Department of Mathematics, University College London, Gower Street, London, UK–WC1E 6BT, UK
Erik Burman

Authors

Erik Burman
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Erik Burman.

Additional information

Availability of data and material

The data used to produce figures can be made available upon reasonable request.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Dedicated to Professor Alfio Quarteroni on his 70th birthday.

Appendix

Here we give the proofs of the approximation results for the L²-projection, Lemmas 2 and 3 and finally the weighted discrete interpolation result (5.4).

First we give a simple super approximation result for the Lagrange interpolant i_h that will be useful for the proofs of inequalities (5.11) and (5.12). For a general discussion of discrete commutator properties we refer to [1].

Lemma 5

Let $\phi \in W^{k+1,\infty }({\varOmega })$ satisfying (5.1) with K > 1 and h < 1. Then for $h^{\frac 12}/K$ sufficiently small, there holds for all v_h ∈ V_h, $S \in \mathcal {T}$,

$$ |\phi v_{h} - i_{h} (\phi v_{h}) |_{H^{s}(S)} \leq C h^{\frac12-s}/K \|\phi v_{h}\|_{S},\quad 0 \leq s \leq 2. $$

Proof

By the approximation properties of i_h there holds

$$ |\phi v_{h} - i_{h} (\phi v_{h})|_{H^{s}(S)} \leq C h^{k+1-s} \|D^{k+1} (\phi v_{h})\|_{S}. $$

(6.2)

Using the product rule and the fact that D^k+ 1v_h = 0 since $v_{h} \vert _{S} \in \mathbb {P}_{k}(S)$, we see that

$$ \|D^{k+1} (\phi v_{h})\|_{S} \leq C{\sum}_{l=1}^{k+1} |\phi|_{W^{l,\infty}(S)} |v_{h}|_{H^{k+1-l}(S)}. $$

By applying the inverse inequality (2.6) repeatedly the derivatives on v_h can be eliminated at the price of factors of the inverse of h,

$$ h^{k+1-s} \|D^{k+1} (\phi v_{h})\|_{S} \leq C h^{1-s} \|v_{h}\|_{S} {\sum}_{l=1}^{k+1} h^{l-1} |\phi|_{W^{l,\infty}(S)}. $$

(6.3)

Using the bound (5.1) it then follows that

$$ {\sum}_{l=1}^{k+1} h^{l-1} |\phi|_{W^{l,\infty}(S)} \leq C {\sum}_{l=1}^{k+1} h^{l-1} (K h^{\frac12})^{-l}\|\phi\|_{L^{\infty}(S)} \leq C (K h^{\frac12})^{-1}\|\phi\|_{L^{\infty}(S)}. $$

(6.4)

Where we used the assumption that h < 1 and K > 1 in the last inequality. Combining the bounds (6.2), (6.3) and (6.4) it follows that

$$ \|\phi v_{h} - i_{h} (\phi v_{h}) \|_{H^{s}(S)} \leq C h^{1-s} (K h^{\frac12})^{-1} \|\phi\|_{L^{\infty}(S)} \|v_{h}\|_{S}. $$

The claim now follows by applying (5.8). □

Proof

(Lemma 2) First note that by the construction of ϖ there holds

$$ |\nabla \varpi| \leq C (\sqrt{h} K)^{-1} \varpi \leq (C \sqrt{h}/K) h^{-1} \varpi $$

and we see that we may apply (5.5)–(5.7) with ϕ = ϖ for $ (C \sqrt {h}/K)$ small enough. □

Proof of (5.8).

To prove (5.8), consider a triangle S, assume that the max value in $\max \limits _{(x,t) \in S\times I_{\delta }} \varpi (x,t)$ is taken at (x^∗,t^∗) ∈ S × I_δ. Then

$$ \begin{array}{@{}rcl@{}} \max_{(x,t) \in S \times I_{\delta}} \varpi(x,t) \|v\|_{S} = \|\varpi(x^{\ast},t^{\ast}) v\|_{S} &\leq& \|(\varpi(x^{\ast},t^{\ast}) - \varpi(\cdot, \tilde t)) v\|_{S} + \| \varpi v\|_{S} \\ &\leq& C h^{\frac12} K^{-1} \varpi(x^{\ast},t^{\ast}) \|v\|_{S} + \| \varpi(\cdot, \tilde t) v\|_{S}, \end{array} $$

for any $\tilde t \in I_{\delta }$. Assuming that $C h^{\frac 12} K^{-1} \leq \frac 12$ we see that

$$ \max_{(x,t) \in S\times I_{\delta}} \varpi(x,t) \|v\|_{S} \leq 2 \|\varpi(\cdot, \tilde t) v\|_{S},\quad \forall \tilde t \in I_{\delta}. $$

Proof of (5.9).

For the proof of (5.9) first apply the stabilities (5.5)–(5.6). For the L²-norm this yields

$$ \|\varpi (v - \pi_{h} v)\|_{{\varOmega}} \leq \|\varpi (v - i_{h} v)\|_{{\varOmega}} + \|\varpi \pi_{h} (i_{h} v- v)\|_{{\varOmega}} \leq C \|\varpi (v - i_{h} v)\|_{{\varOmega}}. $$

Then apply interpolation locally and (5.8).

$$ \begin{array}{@{}rcl@{}} \|\varpi (v - i_{h} v)\|_{S} \leq \max_{x \in S} \varpi(x) \|v - i_{h} v\|_{S} &\leq& C \max_{x \in S} \varpi(x) h^{k+1} \|D^{k+1} v\|_{S}\\ & \leq& 2 C h^{k+1} \|\varpi D^{k+1} v\|_{S}. \end{array} $$

The claim follows by summing over $S \in \mathcal {T}$. The bound on the H¹-norm is identical.

Proof of (5.10).

The stabilization operator is defined by the sum of the jumps of the gradient over the faces of the element. The first step is to split that jump using the triangle inequality over each face. Given a face F = ∂S₁ ∩ ∂S₂ for elements S₁ and S₂ this takes the form.

$$ \|[\![{\nabla (v - \pi_{h} v)}]\!]\|_{F}^{2} \leq 2 \left( \|\nabla (v - \pi_{h} v)\|_{\partial S_{1} \cap F}^{2} + \|\nabla (v - \pi_{h} v)\|_{\partial S_{2} \cap F}^{2}\right). $$

By breaking up the jumps on the contributions from respective element faces in this was we have

$$ s_{\varpi}(v - \pi_{h} v,v - \pi_{h} v) \leq C {\sum}_{S \in \mathcal{T}} \left( \max_{x \in S} \varpi(x)\right)^{2} h^{2}\beta_{\infty} \|\nabla (v- \pi_{h} v) \|_{\partial S}^{2}. $$

Now apply the trace inequality (2.10) on each element to see that

$$ \|\nabla (v- \pi_{h} v) \|_{\partial S} \leq C\left( h^{\frac12} |\nabla (v- \pi_{h} v) |_{H^{1}(S)} + h^{-\frac12} \|\nabla (v- \pi_{h} v) \|_{S}\right). $$

For the first term in the right hand side add and subtract i_hu, split it using a triangle inequality and use an inverse inequality in one of the terms and interpolation in the other to see that

$$ \begin{array}{@{}rcl@{}} |\nabla (v- \pi_{h} v) |_{H^{1}(S)} &\leq& C\left( |\nabla (v- i_{h} v) |_{H^{1}(S)} + |\nabla (i_{h} v- \pi_{h} v) |_{H^{1}(S)} \right)\\ &\leq& C h^{k-1} \|D^{k+1} v\|_{S} + C h^{-1} \|\nabla (v-\pi_{h} v) \|_{S}. \end{array} $$

It follows using (5.8) that

$$ {\sum}_{S \in \mathcal{T}} \varpi(x)^{2} h^{2} \beta_{\infty}\|\nabla (v- \pi_{h} v) \|^{2}_{\partial S} \leq C \beta_{\infty} h^{2k+1} \| D^{k+1} v\|^{2}_{\varpi} + C \beta_{\infty} h \|\nabla (v- \pi_{h} v) \|_{\varpi}^{2}. $$

The claim now follows by applying (5.9) to the second term of the right hand side.

Proof

(Lemma 3) □

Proof of (5.11).

To prove (5.11) recall that

$$ |\nabla \varpi^{-1}| = |\varpi^{-2} \nabla \varpi| \leq C (\sqrt{h} K)^{-1}\varpi^{-1} $$

and we may apply (5.5) with ϕ = ϖ^− 1 to get

$$ \|\varpi^{-1}(\varpi^{2} v_{h} - \pi_{h} (\varpi^{2} v_{h}))\|_{{\varOmega}} \leq C \|\varpi^{-1}(\varpi^{2} v_{h} - i_{h} (\varpi^{2} v_{h}))\|_{{\varOmega}}. $$

Consider one simplex S, take out the weight and then apply Lemma 5 followed by (5.8)

$$ \|\varpi^{-1}(\varpi^{2} v_{h} - i_{h} (\varpi^{2} v_{h}))\|_{S} \leq \left( \max_{x \in S} \varpi^{-1}\right) \|\varpi^{2} v_{h} - i_{h} (\varpi^{2} v_{h})\|_{S} \leq C h^{\frac12}/K \|\varpi v_{h}\|_{S}. $$

Finally take the square of both sides and sum over the simplices. The H¹-norm estimate follows using similar arguments.

Proof of (5.12).

For the inequality (5.12) we consider one element of the sum and apply the trace inequality (2.10),

$$ \begin{array}{@{}rcl@{}} {}\|\varpi^{-1}\nabla(\varpi^{2} v_{h} - \pi_{h} (\varpi^{2} v_{h}))\|_{\partial S} & \leq& C\left( \max_{x \in S} \varpi^{-1} h^{\frac12}|\nabla(\varpi^{2} v_{h} - \pi_{h} (\varpi^{2} v_{h}))|_{H^{1}(S)}\right. \\ && + \left.\max_{x \in S} \varpi^{-1} h^{-\frac12}\|\nabla(\varpi^{2} v_{h} - \pi_{h} (\varpi^{2} v_{h}))\|_{S}\right). \end{array} $$

(6.5)

In the first term, add and subtract ∇i_h(ϖ²v_h) and use the triangle inequality followed by an inverse inequality to obtain

$$ \begin{array}{@{}rcl@{}} &&\max_{x \in S} \varpi^{-1} h^{\frac12}|\nabla(\varpi^{2} v_{h} - \pi_{h} (\varpi^{2} v_{h}))|_{H^{1}(S)} \\ &&\quad\leq C\max_{x \in S} \varpi^{-1} h^{\frac12} \left( |\nabla(\varpi^{2} v_{h} - i_{h} (\varpi^{2} v_{h}))|_{H^{1}(S)} + h^{-1} \|\nabla(i_{h} \varpi^{2} v_{h} - \pi_{h} (\varpi^{2} v_{h}))\|_{S}\right). \end{array} $$

For the first term in the right hand side we use Lemma 5, with s = 2,

$$ \begin{array}{@{}rcl@{}} h^{\frac12} \max_{x \in S} \varpi^{-1}|\nabla(\varpi^{2} v_{h} - i_{h} (\varpi^{2} v_{h}))|_{H^{1}(S)} &\leq& C \max_{x \in S}\varpi^{-1}K^{-1} h^{-1} \|\varpi^{2} v_{h}\|_{S}\\ & \leq& C K^{-1} h^{-1} \|\varpi v_{h}\|_{S}. \end{array} $$

(6.6)

To bound the second term we use (5.8), sum over $S \in \mathcal {T}$ and use the stability of the L²-projection (5.6) to get

$$ {\sum}_{S \in \mathcal{T}} \left( \max_{x \in S} \varpi^{-2}\right)h^{-1} \|\nabla(i_{h} \varpi^{2} v_{h} - \pi_{h} (\varpi^{2} v_{h}))\|_{S}^{2} \leq C h^{-1} \|\varpi^{-1} \nabla(i_{h} \varpi^{2} v_{h} - \varpi^{2} v_{h})\|_{{\varOmega}}^{2}. $$

We see that after summation over S the second term in the right hand side of (6.5) also is on this form.

On every S take out the factor $\max \limits _{x \in S} \varpi ^{-1}$ and apply Lemma 5 followed by (5.8) to arrive at

$$ h^{-\frac12} \|\varpi^{-1} \nabla(i_{h} \varpi^{2} v_{h} - \varpi^{2} v_{h})\|_{{\varOmega}}\leq C K^{-1} h^{-1} \|\varpi v_{h}\|_{{\varOmega}} $$

which together with (6.6), summed over S, concludes the proof of (5.12).

Proof

(Inequality (5.4)). For simplicity consider the form β ⋅∇u_h = ∂_xu_h. Using the product rule ∂_x(ϖ²v_h) = (∂_xϖ²)v_h + ϖ²∂_xv_h and the triangle inequality it follows that

$$ \begin{array}{@{}rcl@{}} \|h^{\frac12} (\partial_{x} (\varpi^{2} v_{h}) - \pi_{h}(\partial_{x} (\varpi^{2} v_{h})))\|_{\varpi^{-1}}^{2} &\leq& 2h \|(\partial_{x} \varpi^{2}) v_{h} - \pi_{h}(\partial_{x} \varpi^{2} v_{h})\|_{\varpi^{-1}}^{2} \\ &&+ 2h \|(\varpi^{2} \partial_{x} v_{h}) - \pi_{h}(\varpi^{2}\partial_{x} v_{h})\|_{\varpi^{-1}}^{2}. \end{array} $$

(6.7)

Noting that by the L²-stability of π_h, the bound of ϖ, Lemma 5, (5.1) and (5.8)

$$ h \|(\partial_{x} \varpi^{2}) v_{h} - \pi_{h}(\partial_{x} \varpi^{2} v_{h})\|_{\varpi^{-1}}^{2} \leq C h \|(\partial_{x} \varpi^{2}) v_{h} - i_{h}(\partial_{x} \varpi^{2} v_{h})\|_{\varpi^{-1}}^{2} \leq C K^{-2} \|v_{h}\|^{2}_{\varpi}. $$

It only remains to bound the second term of (6.7). We add and subtract π₀ϖ² defined by

$$ \pi_{0} \varpi^{2} \vert_{S} = |S|^{-1} {\int}_{S} \varpi^{2} $$

and use the triangle inequality to obtain

$$ \begin{array}{@{}rcl@{}} h \|(\varpi^{2} \partial_{x} v_{h}) - \pi_{h}(\varpi^{2} \partial_{x} v_{h})\|_{\varpi^{-1}}^{2} & \leq& Ch \|(\varpi^{2} \partial_{x} v_{h} - (\pi_{0} \varpi^{2}) \partial_{x} v_{h} )\|_{\varpi^{-1}}^{2}\\ && +C h \|((\pi_{0} \varpi^{2}) \partial_{x} v_{h} - \pi_{h}((\pi_{0} \varpi^{2}) \partial_{x} v_{h} )\|_{\varpi^{-1}}^{2}\\ &&+ Ch \| (\pi_{h}((\pi_{0} \varpi^{2}) \partial_{x} v_{h} ) - \pi_{h}(\varpi^{2} \partial_{x} v_{h}))\|_{\varpi^{-1}}^{2}\\ & =& T_{1}+T_{2}+T_{3}. \end{array} $$

First, for T₃, observe that by the stability of the L²-projection (5.5) we have

$$ h \| (\pi_{h}((\pi_{0} \varpi^{2}) \partial_{x} v_{h}) - \pi_{h}(\varpi^{2} \partial_{x} v_{h}))\|_{\varpi^{-1}}^{2} \leq C h \|(\varpi^{2} \partial_{x} v_{h} - (\pi_{0} \varpi^{2}) \partial_{x} v_{h})\|_{\varpi^{-1}}^{2} \leq C T_{1}, $$

(6.8)

so only T₁ and T₂ need to be bounded. For T₁, by the approximation $\|\varpi ^{2} - \pi _{0} \varpi ^{2}\|_{L^{\infty }(S)} \leq C h^{\frac 12}/K \|\varpi \|_{L^{\infty }(S)}^{2}$ and applying (5.8) repeatedly, we have for one simplex S,

$$ \begin{array}{@{}rcl@{}} \|\varpi^{-1} (\varpi^{2} \partial_{x} v_{h} - (\pi_{0} \varpi^{2}) \partial_{x} v_{h})\|_{S} &\leq& h^{\frac12}/K \max_{x \in S}\varpi^{2} \max_{x \in S} \varpi^{-1} \|\partial_{x} v_{h}\|_{S}\\ &\leq& C h^{-\frac12} K^{-1} \|\varpi v_{h}\|_{S}. \end{array} $$

Taking the square of both sides and summing over all simplices yields the bound for T₁,

$$ h \|(\varpi^{2} \partial_{x} v_{h} - (\pi_{0} \varpi^{2}) \partial_{x} v_{h})\|_{\varpi^{-1}}^{2} \leq C K^{-2} \|v_{h}\|_{\varpi}^{2}. $$

Finally for the term T₂ we use (5.3) with β₀ = (π₀ϖ²)e_x. This leads to

$$ h \|((\pi_{0} \varpi^{2}) \partial_{x} v_{h} - \pi_{h}((\pi_{0} \varpi^{2}) \partial_{x} v_{h} )\|_{\varpi^{-1}}^{2} \leq C_{ws} s_{\varpi^{-1}}\left( (\pi_{0} \varpi^{2}) v_{h},(\pi_{0} \varpi^{2}) v_{h}\right). $$

Adding and subtracting ϖ² and using the triangle inequality and the fact that ϖ² is smooth leads to

$$ s_{\varpi^{-1}}\left( (\pi_{0} \varpi^{2}) v_{h},(\pi_{0} \varpi^{2}) v_{h}\right) \!\leq\! 2 s_{\varpi}(v_{h},v_{h}) + 2 s_{\varpi^{-1}}\left( (\varpi^{2} - \pi_{0} \varpi^{2}) v_{h},(\varpi^{2} -\pi_{0} \varpi^{2}) v_{h}\right). $$

For the second term of the right hand side consider the boundary of one triangle and apply the trace inequality (2.10), followed by the approximation of π₀ to get

$$ \begin{array}{@{}rcl@{}} \|h (\varpi^{2} -\pi_{0} \varpi^{2}) \nabla v_{h}\|_{\partial S} &\leq& C \max_{x \in S} \varpi^{2} K^{-1} h^{\frac32} \left( h^{\frac12} |\nabla v_{h}|_{H^{1}(S)} + h^{-\frac12} \|\nabla v_{h}\|_{S}\right) \\ &\leq& C K^{-1} \|\varpi^{2} v_{h}\|_{S}. \end{array} $$

The last step followed using the inverse inequality (2.6) and (5.8). Proceeding by applying the previous bound to all triangle faces, it follows that

$$ \begin{array}{@{}rcl@{}} &&\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!s_{\varpi^{-1}}((\varpi^{2} - \pi_{0} \varpi^{2}) v_{h},(\varpi^{2} - \pi_{0} \varpi^{2}) v_{h}) \!\leq\! C {\sum}_{S \in \mathcal{T}} \max_{x \in S} \varpi^{-2} \|h (\varpi^{2} - \pi_{0} \varpi^{2}) \nabla v_{h}\|_{\partial S}^{2} \\ &&\leq C{\sum}_{S \in \mathcal{T}} \max_{x \in S} \varpi^{-2} K^{-2} \|\varpi^{2} v_{h}\|_{S}^{2} \leq C K^{-2} \| v_{h}\|_{\varpi}^{2}, \end{array} $$

(6.9)

where the last step follows using (5.8). The proof is now finished by collecting the bounds (6.8)–(6.9). □

Proof

(Lemma 4). Using δt ≤ Ch and (5.1) there holds

$$ \begin{array}{@{}rcl@{}} \left\|v_{h}{\int}_{t_{n-1}}^{t_{n}} \partial_{t} \varpi ~\mathrm{d}t\right\|_{{\varOmega}} &\leq& C\delta t/(K h^{\frac12}) \left( {\sum}_{S \in \mathcal{T}} \max_{(x,t) \in S \times [t_{n-1},t_{n}]} \varpi(x,t)^{2} \|v_{h} \|_{S}^{2}\right)^{\frac12}\\ &\leq& \delta t^{\frac12} C/K \|v_{h}\|_{\varpi_{n}}. \end{array} $$

For the second inequality we applied (5.8) elementwise and then upper bounded $\min \limits _{t \in [t_{n-1},t_{n}]} \|v_{h} \varpi (\cdot ,t)\|_{S}$ by $\|v_{h}\|_{\varpi _{n}}$. For the bound of the second term observe that, estimating

$$ \left|{\int}_{t_{n-1}}^{t_{n}} {\int}_{t}^{t_{n}}\partial_{tt} \varpi^{2} ~\mathrm{d}s \mathrm{d}t\right| \leq \delta t^{2} \max_{t \in [t_{n-1},t_{n}] }|\partial_{tt} \varpi^{2} | $$

and then applying (5.1) repeatedly with l = 1 and 2, to show

$$ \max_{t \in [t_{n-1},t_{n}] }|\partial_{tt} \varpi^{2} | \leq C^{2} h^{-1} K^{-2} \max_{t \in [t_{n-1},t_{n}] }\varpi^{2}. $$

It follows that for all $S \in \mathcal {T}$,

$$ \left\|v_{h}\left|{\int}_{t_{n-1}}^{t_{n}} {\int}_{t}^{t_{n}}\partial_{tt} \varpi^{2} ~\mathrm{d}s \mathrm{d}t\right|^{\frac12}\right\|_{S} \leq \delta t^{\frac12} C K^{-1}\max_{(x,t) \in S \times [t_{n-1},t_{n}] }\varpi \|v_{h}\|_{S}. $$

Applying (5.8) we conclude that

$$ \begin{array}{@{}rcl@{}} {\sum}_{S \in \mathcal{T}} \left\|v_{h}\left|{\int}_{t_{n-1}}^{t_{n}} {\int}_{t}^{t_{n}}\partial_{tt} \varpi^{2} ~\mathrm{d}s \mathrm{d}t\right|^{\frac12}\right\|_{S}^{2} &\leq& \delta t C^{2} K^{-2} {\sum}_{S \in \mathcal{T}} \min_{t \in [t_{n-1},t_{n}]} \|v_{h} \varpi(\cdot,t)\|^{2}_{S}\\ &\leq& \delta t C^{2} K^{-2} \|v_{h}\|^{2}_{\varpi_{n}}. \end{array} $$

□

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Burman, E. Weighted Error Estimates for Transient Transport Problems Discretized Using Continuous Finite Elements with Interior Penalty Stabilization on the Gradient Jumps. Vietnam J. Math. 50, 833–866 (2022). https://doi.org/10.1007/s10013-022-00550-x

Download citation

Received: 14 April 2021
Accepted: 08 October 2021
Published: 08 March 2022
Issue Date: October 2022
DOI: https://doi.org/10.1007/s10013-022-00550-x

Keywords

Mathematics Subject Classification (2010)

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Weighted Error Estimates for Transient Transport Problems Discretized Using Continuous Finite Elements with Interior Penalty Stabilization on the Gradient Jumps

Abstract

Similar content being viewed by others

Difference scheme for an initial–boundary value problem for a singularly perturbed transport equation

Error analysis for discretizations of parabolic problems using continuous finite elements in time and mixed finite elements in space

An adaptive algorithm for the transport equation with time dependent velocity

1 Introduction

2 Model Problem and Finite Element Discretization

Remark 1

3 Stability Estimate of the Finite Element Method

Theorem 1

Proof

Corollary 1

Proof

Remark 2

4 Error Estimates for the Stabilized Formulation (2.13)

Theorem 2

Proof

Remark 3

4.1 Rough Solutions: Convergence in Weak Norms

Proposition 1 (A posteriori error bound)

Proof

Remark 4

Theorem 3 (A priori error estimate for rough solutions)

Proof

4.2 Time Discretization and Stabilized Methods

5 Weighted Error Estimates

Remark 5

Lemma 1 (Stability L 2-projection)

Proof

Lemma 2 (Weighted approximation)

Lemma 3 (Super approximation)

Proposition 2 (Weighted stability)

Proof

Theorem 4

5.1 Discussion of Estimates for Rough Solutions

Corollary 2

Proof

5.2 Time Discretization and Weakly Imposed Boundary Conditions

Lemma 4

Proposition 3

Proof

Remark 6

6 Numerical Examples

6.1 An Example with Inflow and Outflow and Weakly Imposed Boundary Conditions

6.2 Long Term Stability

Code Availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Availability of data and material

Publisher’s Note

Appendix

Appendix

Lemma 5

Proof

Proof

Proof of (5.8).

Proof of (5.9).

Proof of (5.10).

Proof

Proof of (5.11).

Proof of (5.12).

Proof

Proof

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification (2010)

Search

Navigation

Lemma 1 (Stability L ²-projection)