Boundary Treatment and Multigrid Preconditioning for Semi-Lagrangian Schemes Applied to Hamilton–Jacobi–Bellman Equations

Reisinger, Christoph; Rotaetxe Arto, Julen

doi:10.1007/s10915-016-0351-1

Boundary Treatment and Multigrid Preconditioning for Semi-Lagrangian Schemes Applied to Hamilton–Jacobi–Bellman Equations

Open access
Published: 13 January 2017

Volume 72, pages 198–230, (2017)
Cite this article

Download PDF

You have full access to this open access article

Journal of Scientific Computing Aims and scope Submit manuscript

Boundary Treatment and Multigrid Preconditioning for Semi-Lagrangian Schemes Applied to Hamilton–Jacobi–Bellman Equations

Download PDF

1553 Accesses
9 Citations
Explore all metrics

Abstract

We analyse two practical aspects that arise in the numerical solution of Hamilton–Jacobi–Bellman equations by a particular class of monotone approximation schemes known as semi-Lagrangian schemes. These schemes make use of a wide stencil to achieve convergence and result in discretization matrices that are less sparse and less local than those coming from standard finite difference schemes. This leads to computational difficulties not encountered there. In particular, we consider the overstepping of the domain boundary and analyse the accuracy and stability of stencil truncation. This truncation imposes a stricter CFL condition for explicit schemes in the vicinity of boundaries than in the interior, such that implicit schemes become attractive. We then study the use of geometric, algebraic and aggregation-based multigrid preconditioners to solve the resulting discretised systems from implicit time stepping schemes efficiently. Finally, we illustrate the performance of these techniques numerically for benchmark test cases from the literature.

Auxiliary Space Preconditioners for a $$C^{0}$$ Finite Element Approximation of Hamilton–Jacobi–Bellman Equations with Cordes Coefficients

Article 03 August 2022

Robust Preconditioners for DG-Discretizations with Arbitrary Polynomial Degrees

A Fast Solver for Boundary Integral Equations of the Modified Helmholtz Equation

Article 19 December 2014

1 Introduction

We consider semi-Lagrangian schemes, as described in [5, 9], for the numerical approximation of solutions to the Hamilton–Jacobi–Bellman (HJB) equation

$$\begin{aligned}&u_t - \inf _{\alpha \in {\mathcal {A}}} \left\{ L^{\alpha }[u](t, x) + c^{\alpha }(t, x) u(t,x) + f^{\alpha }(t, x) \right\} = 0, \quad (t,x) \in (0, T] \times \Omega , \end{aligned}$$

(1.1)

$$\begin{aligned}&u(0, x) = g(x), \quad x \in {{\bar{\Omega }}}, \end{aligned}$$

(1.2)

$$\begin{aligned}&u(t, x) = \psi (x), \quad (t,x) \in (0, T] \times \partial \Omega , \end{aligned}$$

(1.3)

where $\Omega $ is a domain, $Q_T :=(0, T] \times {\bar{\Omega }}$ with ${\bar{\Omega }} :=\Omega \cup \partial \Omega \subseteq {\mathbb {R}}^d$, ${\mathcal {A}}$ is a compact set,

$$\begin{aligned} L^{\alpha }[u](t, x) = \text {tr}[a^{\alpha }(t, x) D^2u(t, x)] + b^{\alpha }(t,x)Du(t,x) \end{aligned}$$

(1.4)

is a second order differential operator, and $\psi $ and g are the Dirichlet and initial conditions.

The coefficients $a^{\alpha } = \frac{1}{2} \sigma ^{\alpha } \sigma ^{\alpha , T}$, $b^{\alpha }$, $c^{\alpha }$, $f^{\alpha }$, the initial data g and the boundary conditions $\psi $ take their values, respectively, in ${\mathbb {S}}^d$, the space of $d \times d$ symmetric matrices, ${\mathbb {R}}^d$, ${\mathbb {R}}$, ${\mathbb {R}}$, ${\mathbb {R}}$, and ${\mathbb {R}}$, and $\sigma ^{\alpha } \in {\mathbb {R}}^{d \times P}$ such that $a^{\alpha }$ is positive semi-definite. We also assume the usual well-posedness conditions on the PDE coefficients, i.e. Lipschitz continuous in x uniformly in $\alpha $, Hölder continuous with exponent $\frac{1}{2}$ in time and continuous in $\alpha $ for each $(t, x) \in Q_T$ [18]. The relevant notion of solution for this type of non-linear equations is that of viscosity solutions [7] and the above conditions guarantee existence and uniqueness.

In general, the viscosity solution to (1.1)–(1.3) is unknown, thus it is necessary in practice to compute approximations numerically. Sufficient conditions for a numerical scheme to converge to the unique viscosity solution of (1.1)–(1.3) were proved by Barles and Souganidis [2] in terms of consistency, $L^\infty $-stability and monotonicity. We restrict our attention to finite difference discretizations of the differential operator (1.4).

The requirement of monotonicity drastically affects the properties and construction of finite difference schemes. Theorem 4 in [27] proves that local monotone discretizations have at most first order for first-order equations and second order for second-order equations. What is more, standard fixed stencil methods are monotone only under restrictions on the diffusion matrix, such as diagonal dominance [9, 12]. Results from [6, 21] further illustrate the limitations of such methods for the monotone approximation of second order derivatives.

This implies that generally approximations have to be non-local on the discrete level, i.e. the distance between mesh points involved in the scheme at a given point grows in relation to the mesh width as the mesh is refined. Such schemes are referred to as wide stencils. For general diffusion matrices, first order accurate wide stencils of the type considered here have been proposed in [5, 9], and a mixed fixed- and wide-stencil scheme in [19].

In this article, we analyse two issues arising in practice when numerically solving (1.1)–(1.3) using the class of schemes described in [5, 9, 20] to discretize the second order differential operator (1.4). This approximation combines wide stencils in the directions determined by the columns of the diffusion matrix $\sigma ^\alpha $ and the drift $b^\alpha $, together with (linear) interpolation. Following the notation in [9], we write the matrix $\sigma ^\alpha \in {\mathbb {R}}^{d \times P}$ as $(\sigma ^\alpha _1, \sigma ^\alpha _2, \ldots , \sigma ^\alpha _P)$, where $\sigma ^\alpha _p \in {\mathbb {R}}^d$ for $p \in \{1, 2, \ldots , P\}$ denotes the p-th column of $\sigma ^\alpha $, and observe that for $k > 0$ and any smooth function $\phi $,

$$\begin{aligned} \frac{1}{2} \text {tr} \left[ \sigma ^\alpha \sigma ^{\alpha ~ T} D^2 \phi (x) \right]&= \frac{1}{2} \sum _{p = 1}^P \frac{\phi (x + k \sigma ^\alpha _p ) -2 \phi (x) + \phi (x - k \sigma ^\alpha _p) }{k^2} + {\mathcal {O}}(k^2), \end{aligned}$$

(1.5)

$$\begin{aligned} b^\alpha D\phi (x)&= \frac{\phi (x + k^2 b^\alpha ) - \phi (x)}{k^2} + {\mathcal {O}}(k^2), \end{aligned}$$

(1.6)

where ${\mathcal {O}}(k^2)$ is the local truncation error of the finite difference and for compactness we write $b^\alpha \equiv b^\alpha (t, x)$ and $\sigma ^\alpha \equiv \sigma ^\alpha (t, x)$. As these approximations will be used for points lying on a discrete spatial grid $\Omega _{\Delta x}$ with nodes $\{x_j: 1 \le j \le N\}$, the displaced points $x + k^2 b^\alpha $, $x \pm k \sigma ^\alpha _p$ do not generally coincide with nodes of $\Omega _{\Delta x}$. Therefore, $\phi $ is replaced by an interpolant ${\mathcal {I}}_{\Delta x} \phi $ on that grid. We restrict our attention to linear interpolants, defined by the standard piecewise multilinear non-negative basis functions $\{w_j(\cdot ): 1 \le j \le N\}$ associated with the mesh nodes, such that for any function $\phi $

$$\begin{aligned} ({\mathcal {I}}_{\Delta x} \phi ) (x) = \sum _{j\in {\mathcal {N}}(x)} \phi (x_j) w_j(x), \end{aligned}$$

(1.7)

for all $x \in \Omega $, $x_j \in \Omega _{\Delta x}$, where ${\mathcal {N}}(x)$ is the set of neighbours of x on the mesh $\Omega _{\Delta x}$, i.e. the mesh points with non-zero interpolation weight. The resulting scheme is referred to as the Linear Interpolation Semi-Lagrangian (LISL) scheme.

It is shown in [9] that the leading order terms of the local truncation error are proportional to $k^2$ and $\frac{\Delta x^2}{k^2}$, where the last quantity corresponds to the linear interpolation error in the finite difference formulae (1.5) and (1.6) by replacing $\phi $ by its interpolant. Therefore, by choosing $k = \sqrt{\Delta x}$, the resulting scheme is locally of first order in $\Delta x$.

Following the notation in [9], the LISL finite difference approximations for the differential operator in (1.4) can be expressed as

$$\begin{aligned}&L_{\Delta x}^{\alpha }[{\mathcal {I}}_{\Delta x}\phi ](t, x) \nonumber \\&\quad :=\sum _{p =1}^M \frac{({\mathcal {I}}_{\Delta x}\phi )(t, x + y_{p}^{\alpha , +}(t, x)) - 2 ({\mathcal {I}}_{\Delta x}\phi )(t,x) + ({\mathcal {I}}_{\Delta x}\phi )(t, x + y_{p}^{\alpha , -}(t, x))}{2 \Delta x},\nonumber \\ \end{aligned}$$

(1.8)

for $x \in \Omega _{\Delta x}$, and some $M \ge 1$.

Different schemes can be obtained depending on the values taken by M and $y_{p}^{\alpha , \pm }(t, x)$. In particular, [9] discusses the following three schemes:

Examples of LISL schemes.

1.
Scheme 1: The approximation of Camilli and Falcone [5], corresponding to $y^{\alpha , \pm }_{p} = \pm \sqrt{\Delta x} \sigma ^{\alpha }_{p} + \frac{\Delta x}{P} b^{\alpha }$ and $M = P$.
2.
Scheme 2: The approximation in [9], corresponding to $y^{\alpha , \pm }_{p} = \pm \sqrt{\Delta x} \sigma ^{\alpha }_{p}$ for $p \le P$, $y^{\alpha , \pm }_{P+1} = \Delta x b^{\alpha }$, and $M = P + 1$.
3.
Scheme 3: A more efficient version of the Camilli–Falcone approximation, corresponding to $y^{\alpha , \pm }_{p} = \pm \sqrt{\Delta x} \sigma ^{\alpha }_{p}$ for $p < P$, $y^{\alpha , \pm }_{P} = \pm \sqrt{\Delta x} \sigma ^{\alpha }_{P} + \Delta x b^{\alpha }$, and $M = P$.

The authors show that this family of discretizations of (1.4) is consistent and monotone. Monotonicity of the scheme is fulfilled as the discrete approximation $L_{\Delta x}^{\alpha }\left[ {\mathcal {I}}_{\Delta x} \phi \right] $ is the composition of monotone finite differences and a monotone interpolation operation. Once discretized in space, the final scheme arises from discretising in time using the standard $\theta $-time stepping scheme for $\theta \in [0, 1]$, where $\theta = 0$ corresponds to the explicit Euler time stepping and $\theta = 1$ to the implicit case, on a time grid represented by a strictly increasing sequence of points $\{t_n\}_{n = 0}^{N_t+ 1}$ with $t_0 = 0$, $t_{N_t+1} = T$, and $\Delta t_n :=t_n - t_{n-1} \le \Delta t$ for all n. The scheme being monotone, it can be written as described in the following definition, where for any grid function $V:\{t_n\}_{n = 0}^{N_t+ 1} \times \Omega _{\Delta x} \rightarrow {\mathbb {R}}$, $V^n_i \equiv V(t_n, x_i)$.

Definition 1.1

(Equation (4.1) in [9]) A scheme is said to be of positive type, if it can be written as

$$\begin{aligned} \max _{\alpha \in {\mathcal {A}}} \left\{ {\mathcal {B}}^{\alpha , n,n}_{j, j} U^n_j - \sum _{ i \ne j} {\mathcal {B}}^{\alpha , n,n}_{j, i} U^n_i - \sum _{i=1}^N {\mathcal {B}}^{\alpha , n,n-1}_{j, i} U^{n-1}_i - F^{\alpha , n - 1 + \theta }_j \right\} = 0, \end{aligned}$$

(1.9)

for $ j = 1,\ldots , N$, on the discrete domain $\{t_n\}_{n = 0}^{N_t+ 1} \times \Omega _{\Delta x}$, where $U^n_i$ is the numerical solution at node $(t_n, x_i)$ and all the coefficients ${\mathcal {B}}$ are non-negative.

For the convenience of the reader, we reproduce the expressions for ${\mathcal {B}}^{\alpha , n, \cdot }_{ j, \cdot }$ of the LISL schemes as in [9], for all $1\le i \ne j \le N$, $x_i,x_j \notin \partial \Omega $,

$$\begin{aligned}&{\mathcal {B}}^{\alpha , n, n}_{j, j} = 1 + \theta \Delta t_n \left( \frac{M}{2 \Delta x} - l^{\alpha , n}_{j, j} - c^{\alpha , n-1+\theta }_j \right) ,&{\mathcal {B}}^{\alpha , n, n}_{j, i} = \theta \Delta t_n \, l^{\alpha , n}_{j, i},\\&{\mathcal {B}}^{\alpha , n, n-1}_{j, j} = 1 - (1 - \theta ) \Delta t_n \left( \frac{M}{2 \Delta x} - l^{\alpha , n-1}_{j, j} - c^{\alpha , n-1+\theta }_j \right) ,&{\mathcal {B}}^{\alpha , n, n-1}_{j, i} = (1-\theta ) \Delta t_n \, l^{\alpha , n-1}_{j, i}, \end{aligned}$$

where $c^{\alpha , n-1+\theta }_j = c^\alpha (t_{n-1}+\theta \Delta t,x_j)$ and

$$\begin{aligned} l^{\alpha , n}_{ j, i} = \sum ^{M}_{p=1} \frac{w_{i} (x_j + y^{\alpha , +}_{p}(t_{n}, x_j)) + w_{i} (x_j + y^{\alpha , -}_{p} (t_{n}, x_j))}{2 \Delta x}. \end{aligned}$$

The schemes described above have a wide stencil as the length of the stencil, being proportional to the ratio $k/\Delta x \sim 1/\sqrt{\Delta x}$, tends to $\infty $ as $\Delta x \rightarrow 0$. Hence, when applied on a bounded discrete grid, the stencil will generally exceed the domain for points close to its boundary. As discussed in [9], the overstepping may pose a problem depending on the equation and the type of boundary conditions imposed. We consider Dirichlet boundary conditions here.

Our first goal is to present and analyse a modification of the LISL scheme to deal with overstepping for problems on bounded domains with Dirichlet boundary conditions, and general drift and diffusion coefficients. We describe how to truncate the LISL stencil so that the truncation remains consistent and monotone. We prove that the resulting stencil for Scheme 2 above is of positive type (as per Definition 1.1), and since the coefficients ${\mathcal {B}}$ in (1.9) do not depend on U, it is also monotone. This is not the case for Schemes 1 and 3. We also observe that the truncation has both local and global impacts on the properties of the scheme. Locally, the modification of the scheme leads to a loss of accuracy of half an order in the consistency error, i.e. ${\mathcal {O}}(\sqrt{\Delta x})$ instead of ${\mathcal {O}}(\Delta x)$, due to the loss of symmetry. We compare the accuracy of the truncation with extrapolations of the boundary conditions by way of numerical tests for benchmark problems. As the mesh points requiring truncation of the scheme are restricted to an ${\mathcal {O}}(\sqrt{\Delta x})$ layer at the boundary, convergence rates close to ${\mathcal {O}}(\Delta x)$ are observed empirically for the new scheme. The truncation has a global effect in the sense that it modifies the CFL condition of explicit schemes by at least half an order, from $\Delta t = {\mathcal {O}}(\Delta x)$ to $\Delta t = {\mathcal {O}}(\Delta x^{3/2})$. As the empirical error is ${\mathcal {O}}(\Delta t) + {\mathcal {O}}(\Delta x)$ for fully implicit schemes, the computationally most efficient choice is $\Delta t \sim \Delta x$, outside the stability region of explicit schemes.

The second goal is therefore the use of implicit schemes and the efficient solution of the discrete system (1.9) using multigrid preconditioning. For $\theta \ne 0$, the coupling of the optimal control and the coefficients makes (1.9) a non-linear system of algebraic equations,

$$\begin{aligned} \max _{\alpha \in {\mathcal {A}}} \left( A_i^\alpha X - F_i^\alpha \right) = 0, \qquad i=1,\ldots , N, \end{aligned}$$

(1.10)

where $A_i^\alpha $ is the i-th row of a matrix $A^\alpha $ with elements $A_{i,j}^\alpha $, $i,j=1,\ldots , N$, and control $\alpha \in {\mathcal {A}}$. Comparing with (1.9), $A_{i,j}^\alpha = {\mathcal {B}}^{\alpha , n, n}_{i, j}$, $F_i^\alpha = F^{\alpha , n - 1 + \theta }_i$, and $X= (X_i) = (U^n_i)$ is the solution vector for the n-th time step. The maximisation over $\alpha $ in (1.10) is row-wise and usually done by linear search. By construction of the LISL scheme, $A^\alpha $ is an M-matrix with non-negative row sum. Therefore, following results in [4], we can use policy iteration to compute U. Then, within each policy iteration, a linear system $A_i^{\alpha _i} X = F_i^{\alpha _i}$, $i=1,\ldots , N$, with fixed control vector $(\alpha _i)_{1\le i \le N}$ has to be solved. We find (in contrast to [19]) that this last step is the computationally most costly part of the overall algorithm if direct linear solvers or standard iterative solvers are used.^{Footnote 1} We therefore study multigrid preconditioners (see Table 16.)

In the literature on multigrid for HJB equations, two main approaches are observed: on the one hand, multigrid is applied directly to the non-linear problem, as in [3, 13, 14]; and on the other hand, multigrid is applied to a linearised problem, as in [1]. In particular, [3, 14] provide the first multigrid algorithms for HJB equations and prove convergence, while [13] presents a novel smoother for HJB equations based on damped value iteration [17]. These articles have in common the use of standard fixed stencil finite difference approximations and the use of a geometric structure when building the hierarchy of multigrid subspaces.

The novelty of this article is to study the application of multigrid preconditioning to a wide stencil discretization. We will demonstrate, both by Fourier analysis of a model problem and by numerical tests in a more complex application, that standard geometric multigrid does not give mesh-size independent convergence.

We then investigate algebraic multigrid methods. The basis for the specific algorithm we use was introduced in [24] for linear elliptic PDEs. It empirically showed that “aggregation based methods could yield robust^{Footnote 2} and convergent schemes if used as preconditioners of a Krylov method, and were part of an enhanced multigrid cycle, not simple V- or W-cycles” as considered in [31]. By enhanced multigrid cycles, the authors refer to recursive schemes in which at each coarse level the solution to the residual equation is computed using a number of Krylov subspace iterations as in [26] or with a semi-iterative method based on Chebyshev polynomials called the AMLI cycle, see Section 5.6 of [34]. The aggregates were formed using heuristic criteria following coupling in the strongest direction.

In [22] the authors introduced an aggregation-based multigrid method with guaranteed convergence rate for symmetric M-matrices with non-negative row sum. A LISL discretization matrix is only symmetric in very specific cases with limited practical interest. For non-symmetric matrices, in [25] convergence of a simplified two-grid scheme using aggregation is proved for non-singular M-matrices with non-negative row and column sums. This requirement ensures that the symmetric part of the coefficient matrix A given by $A + A^T$ meets the assumptions in [22] and allows the use of its theoretically justified algorithms. We will derive conditions on the coefficients of the HJB equation such that this theory applies, and show empirically that aggregation-based multigrid gives roughly mesh-size independent convergence.

The rest of the article is organised as follows. Section 2 discusses the truncation of the LISL scheme for points whose stencil exceeds the domain and compares its performance to naïve extrapolations of the boundary conditions. Section 3 considers the application of three different multigrid methods to linear systems where the coefficient matrix arises from LISL discretizations. Section 4 contains the final remarks.

2 Boundary Treatment for the LISL Scheme

In this section, we analyse adaptations of Schemes 1–3 for initial-boundary value problems on bounded domains. As described in the introduction, for points x close to the boundaries of the domain, the stencil points $x + y^{\alpha , \pm }_{p} (t, x)$ in (1.8) generally do not lie in a mesh element. In the following, we therefore discuss the truncation of (1.8) so that the resulting scheme remains monotone, consistent, and $L^{\infty }$-stable. The proposed truncation samples the boundary points on the straight lines defined by the point x and $x + y^{\alpha , \pm }_{p}(t, x)$ and adjusts the corresponding finite difference weights for consistency.

2.1 Definition of Truncated Stencils

We take $\Omega \subset {\mathbb {R}}^d$ for $d \ge 2$. We first outline how the method can be defined on a general domain with curved boundary, but later (especially in the numerical tests) focus for simplicity on rectangular domains. We start with a Cartesian mesh on ${\mathbb {R}}^d$ with uniform mesh width $\Delta x$ and then choose $\Omega _{\Delta x}$ as all the points which lie inside $\Omega $. See Fig. 1.

We now fix a mesh node $x \in \Omega _{\Delta x}$ . There are two distinct situations where interpolation at the point $x + y^{\alpha , \pm }_{p}(t, x)$ as per (1.8) is not possible for given $t, \alpha $ and p:

A.
$x + y^{\alpha , \pm }_{p}(t, x) \notin {\bar{\Omega }}$ (bottom left in Fig. 1);
B.
$x + y^{\alpha , \pm }_{p}(t, x) \in {\bar{\Omega }}$, but the element it is contained in has vertices outside ${\bar{\Omega }}$ (top right).

We say the stencil “oversteps”. In such cases, the objective is to find truncated or extended stencil vectors ${\hat{y}}^{\alpha , \pm }_{p}(t, x)$ and corresponding finite difference weights $A^\alpha _p\equiv A^\alpha _p(t, x)$ and $B^\alpha _p \equiv B^\alpha _p(t, x)$, such that $x + {\hat{y}}^{\alpha , \pm }_{p}(t, x) \in \partial \Omega $ and the truncated scheme

$$\begin{aligned}&{\hat{L}}_{\Delta x}^{\alpha }[{\mathcal {I}}_{\Delta x}\phi ](t, x) :=\nonumber \\&\quad \sum _{p =1}^M \frac{A^\alpha _p ({\mathcal {I}}_{\Delta x}\phi )(t, x + {\hat{y}}_{p}^{\alpha , +}(t, x)) - (A^\alpha _p + B^\alpha _p) \phi (t, x) + B^\alpha _p ({\mathcal {I}}_{\Delta x}\phi )(t, x + {\hat{y}}_{p}^{\alpha , -}(t, x)) }{2 \Delta x} \end{aligned}$$

(2.1)

is a consistent approximation of (1.4) as $\Delta x \rightarrow 0$. If the stencil does not overstep, we have that ${\hat{y}}_{p}^{\alpha , \pm }(t, x) = y_{p}^{\alpha , \pm }(t, x)$ and $A^\alpha _p = B^\alpha _p = 1$. If it does, for any t we define

$$\begin{aligned} {\hat{y}}^{\alpha , \pm }_{p}(t, x)= & {} \mu ^{\alpha , \pm }_p(t, x) y_{p}^{\alpha , \pm }(t, x), \quad \text { where }\\ \mu ^{\alpha , \pm }_p(t, x)= & {} \min \left\{ \mu \ge 0 \,: \, x + \mu y_{p}^{\alpha , \pm }(t, x) \in \partial \Omega \right\} . \end{aligned}$$

In case A, this means $\mu <1$, while in case B we have $\mu >1$.

In the remainder of this section we restrict our attention to the truncation of the scheme on rectangular domains, in which case the elements of the Cartesian mesh cover exactly the domain and case B does not occur. Moreover, this means that interior mesh points cannot be arbitrarily close to the boundary, but are always at least $\Delta x$ away.^{Footnote 3} This allows the derivation of CFL conditions for the explicit schemes as given below in Sect. 2.3.

2.2 Consistency Conditions

In the truncated scheme (2.1) there are M pairs of weights, which can be chosen freely, subject to positivity, in order to obtain a consistent scheme. As we will see below, this is only possible for Scheme 2.

In the following, we denote $[[1, j]] \equiv [1, j] \cap {\mathbb {Z}}$ and for a vector $v \in {\mathbb {R}}^d$, $(v)_i$ denotes its i-th element. As in the introduction, we have that $b^{\alpha } \in {\mathbb {R}}^{d}$, and $\sigma ^{\alpha } = (\sigma ^{\alpha }_1, \ldots , \sigma ^{\alpha }_p, \ldots , \sigma ^{\alpha }_P) \in {\mathbb {R}}^{d \times P}$ where $\sigma ^{\alpha }_p \in {\mathbb {R}}^d$ denotes the p-th column vector. For compactness, we omit the dependence of the coefficients and the stencil related functions with respect to the position, that is $b^{\alpha } \equiv b^{\alpha }(t, x)$, $\sigma ^{\alpha }_p \equiv \sigma ^{\alpha }_p(t, x)$, $y_{p}^{\alpha , \pm } \equiv y_{p}^{\alpha , \pm }(t, x)$ and $\mu _{p}^{\alpha , \pm } \equiv \mu _{p}^{\alpha , \pm }(t, x)$. We add a second subscript taking values 1, 2 or 3 to $A^\alpha _{p}$, $B^\alpha _{p}$ and $y_{p}^{\alpha , \pm }$ to make the discretization scheme explicit.

Proposition 2.1

The truncated version of Schemes 1 and 3 is generally not consistent.

Proof

By Taylor expansion of a smooth test function we find that the consistency conditions for Scheme 1 are

$$\begin{aligned} \sum _{p \in {\mathcal {P}}} \left( A^\alpha _{1, p} ({\hat{y}}^{\alpha , +} _{1, p})_i + B^\alpha _{1, p} ({\hat{y}}^{\alpha , -}_{1, p})_i \right)&= 2 \Delta x \frac{|{\mathcal {P}}|}{P} (b^{\alpha })_i + o(\Delta x), \\ \sum _{p \in {\mathcal {P}}} \left( A^\alpha _{1, p} ({\hat{y}}^{\alpha , +}_{1, p})_{i_1} ({\hat{y}}^{\alpha , +}_{1, p})_{i_2} + B^ \alpha _{1, p} ({\hat{y}}^{\alpha , -}_{1, p})_{i_1} ({\hat{y}}^{\alpha , -}_{1, p})_{i_2} \right)&= 2\Delta x \sum _{p \in {\mathcal {P}}} (\sigma ^{\alpha }_{p})_{i_1} (\sigma ^{\alpha }_{p})_{i_2} + o(\Delta x), \end{aligned}$$

where ${\mathcal {P}} \subseteq [[1, P]]$ denotes the set of stencils overstepping the domain and $i, i_1, i_2 \in [[1, d]]$.

In Scheme 1, there are $2 |{\mathcal {P}}| \le 2 d$ variables, but $(d^2 + 3d)/2$ equations, d from the condition on the Jacobian and $(d^2 +d)/2$ from the condition on the Hessian. This overdetermined system has a solution only if there is linear dependence between the equations. Except for special cases, e.g. $|{\mathcal {P}}|=0$ or $\sigma ^\alpha _p$ parallel to $b^\alpha $ for some p, this is not the case. Hence, in general the truncated Scheme 1 is not consistent.

We observe that the same principle applies to Scheme 3 for $y^{\alpha , \pm }_{3, P} = \pm \sqrt{\Delta x} \sigma _P^{\alpha } + \Delta x b^{\alpha }$. $\square $

For example, consider $x_0 = (0, 0)^T$, ${\bar{\Omega }} = [-5, 1]^2$, $\sqrt{\Delta x} \sigma ^\alpha _1(x_0) = (2, 0)^T$, $\sqrt{\Delta x} \sigma _2^\alpha (x_0) = (0, 1)^T$, and $\Delta x b^\alpha (x_0) = (0, 1)^T$, then the truncated version of Scheme 1 is not consistent, but the one for Scheme 3 is. However, if $\Delta x b^\alpha (x_0) = ( 1, 1)^T$ then neither of them is consistent.

We conclude that for points whose stencil oversteps the boundary, the approximations of the first and second derivative should be considered separately, as done in Scheme 2.

Proposition 2.2

For Scheme 2 and all $p \in [[1, P+1]]$, let $\mu ^{\alpha , \pm }_{p} \in (0, 1]$ be the largest constant such that $x + \mu \ y_{2,p}^{\alpha , \pm } \in {\bar{\Omega }}$ for all $\mu \in [0,\mu ^{\alpha , \pm }_{p}]$, and define

$$\begin{aligned} A^\alpha _{2, P+1} = B^\alpha _{2, P+1} = \frac{1}{\mu ^{\alpha , +}_{P+1}} \left( = \frac{1}{\mu ^{\alpha , -}_{P+1}}\right) , \end{aligned}$$

(2.2)

and, for $p \in [[1, P]]$,

$$\begin{aligned} A^\alpha _{2, p} = \frac{2}{(\mu ^{\alpha , +}_{p})^2 + \mu ^{\alpha , +}_{p} \mu ^{\alpha , -}_{p}}, \qquad B^\alpha _{2, p} = \;\; \frac{2}{(\mu ^{\alpha , -}_{p})^2 + \mu ^{\alpha , -}_{p} \mu ^{\alpha , +}_{p}}. \end{aligned}$$

(2.3)

Then the scheme defined by (2.1) is consistent unless both $\mu ^{\alpha , +}_{p}, \mu ^{\alpha , -}_{p} \sim \mathcal {O}(\sqrt{\Delta x})$.

Proof

If the stencil oversteps, then the truncated stencil consists of the point at the intersection between the boundary $\partial \Omega $ and one of the segments $\{x, x+ \sqrt{\Delta x} \sigma ^{\alpha }_p\}$, $\{x, x - \sqrt{\Delta x} \sigma ^{\alpha }_p\}$, or $\{x, x+ \Delta x b^{\alpha }\}$. For each point (t, x) Scheme 2 requires the calculation of at most $2P + 1$ different weights, i.e. 2P for the second order term and one for the first order term. For the latter we have that ${\hat{y}}^{\alpha , +}_{2, P+1} = {\hat{y}}^{\alpha , -}_{2, P+1}$, therefore $A^\alpha _{2, P+1}= B^\alpha _{2, P+1}$. Ignoring the interpolation error for the time being, the coefficients are obtained from the consistency conditions (up to a term $o(\Delta x)$),

$$\begin{aligned}&(A^\alpha _{2, P+1} + B^\alpha _{2, P+1}) ({\hat{y}}^{\alpha , \pm }_{2, P+1})_i = 2 \Delta x (b^{\alpha })_i, \quad \forall i \in [[1, d]], \end{aligned}$$

(2.4)

for the first order term, and

$$\begin{aligned}&A^{\alpha }_{2, p} ({\hat{y}}^{\alpha , +}_{2,p})_i + B^{\alpha }_{2, p} ({\hat{y}}^{\alpha , -}_{2, p})_i = 0,\quad \forall i \in [[1, d]], \end{aligned}$$

(2.5)

$$\begin{aligned}&A^{\alpha }_{2, p} ({\hat{y}}^{\alpha , +}_{2, p})_{i_1} ({\hat{y}}^{\alpha , +}_{2, p})_{i_2} + B^{\alpha }_{2, p} ({\hat{y}}^{\alpha , -}_{2, p})_{i_1} ({\hat{y}}^{\alpha , -}_{2, p})_{i_2} = 2 \Delta x (\sigma ^{\alpha }_{p})_{i_1} (\sigma ^{\alpha }_{p})_{i_2},\quad \forall (i_1, i_2) \in [[1, d]]^2, \end{aligned}$$

(2.6)

for the second order term.

By construction of the truncated stencil (2.4) and (2.5) are linearly dependent across i, and (2.6) across $i_1$ and $i_2$, resulting in one (linearly independent) equation for the first order term weights and two for $A^\alpha _{2, p}$, $B^\alpha _{2, p}$, with solutions given by

$$\begin{aligned} A^\alpha _{2, P+1} = B^\alpha _{2, P+1} = \Delta x \frac{(b^{\alpha })_i}{({\hat{y}}^{\alpha , \pm }_{2, P+1})_i}, \end{aligned}$$

(2.7)

and

$$\begin{aligned} A^\alpha _{2, p} = \frac{2 \Delta x (\sigma ^{\alpha }_{p})^2_{i} }{({\hat{y}}^{\alpha , +}_{2, p})_{i} (({\hat{y}}^{\alpha , +}_{2, p})_{i} - ({\hat{y}}^{\alpha , -}_{2, p})_{i})}, \qquad B^\alpha _{2, p} = \frac{2 \Delta x (\sigma ^{\alpha }_{p})^2_{i} }{({\hat{y}}^{\alpha , -}_{2, p})_{i} (({\hat{y}}^{\alpha , -}_{2, p})_{i} - ({\hat{y}}^{\alpha , +}_{2, p})_{i})}, \end{aligned}$$

(2.8)

which are seen to be equivalent to Eqs. (2.2) and (2.3).

The contribution to the consistency error of (2.1) from the bilinear interpolation operator ${\mathcal {I}}$ is bounded by $(\Delta x)^{-1} \sum _p (|A_p|+|B_p|) (\Delta x)^2$, which is goes to 0 if $|A_p|+|B_p| = o((\Delta x)^{-1})$ for all p, which is violated if and only if $\mu ^{\alpha , +}_{p}, \mu ^{\alpha , -}_{p} \sim \mathcal {O}(\sqrt{\Delta x}).$ $\square $

Corollary 2.3

For the truncated Scheme 2, (2.1), (2.2) and (2.3), the following holds:

(a)
The scheme is of positive type and monotone with $A^{\alpha }_{2, p}, B^{\alpha }_{2, p} \ge 1$ for all $p \in [[1, P+1]]$.
(b)
For points x within a distance ${\mathcal {O}}(\Delta x)$ of the boundary and $p \ne P+1$, as $\Delta x \rightarrow 0$,
$$\begin{aligned} \text {if } |{\hat{y}}^{\alpha , +}_{2,p}|< \sqrt{\Delta x}|\sigma ^{\alpha }_{p}| \text { and } |{\hat{y}}^{\alpha , -}_{2,p}| = \sqrt{\Delta x}|\sigma ^{\alpha }_{p}|&\implies A^{\alpha }_{2, p} \sim {\mathcal {O}}(\Delta x^{-1/2}) \text { and } \lim _{\Delta x \rightarrow 0} B^{\alpha }_{2, p} = 2, \\ \text {if } |{\hat{y}}^{\alpha , -}_{2,p}|< \sqrt{\Delta x}|\sigma ^{\alpha }_{p}| \text { and } |{\hat{y}}^{\alpha , +}_{2,p}| = \sqrt{\Delta x}|\sigma ^{\alpha }_{p}|&\implies \lim _{\Delta x \rightarrow 0} A^{\alpha }_{2, p} = 2 \text { and } B^{\alpha }_{2, p} \sim {\mathcal {O}}(\Delta x^{-1/2}), \\ \text {if } |{\hat{y}}^{\alpha , \pm }_{2,p}| < \sqrt{\Delta x}|\sigma ^{\alpha }_{p}|&\implies A^{\alpha }_{2, p}, B^{\alpha }_{2, p} \sim {\mathcal {O}}(\Delta x^{-1}). \end{aligned}$$
(c)
The local consistency error for points with truncation and $p \ne P+1$ is ${\mathcal {O}}(\sqrt{\Delta x})$ if only one side of the stencil oversteps, and ${\mathcal {O}}(1)$ if both sides overstep.

Proof

The claim in (a) follows from (2.2), (2.3), and the fact that $\mu ^{\alpha , \pm }_{p} \in (0, 1]$ and the coefficients $A^{\alpha }_{2, p}, B^{\alpha }_{2, p}$ do not depend on the numerical solution U. The limits in (b) follow from (2.3) and noting that if the stencil oversteps for a point x lying ${\mathcal {O}}(\Delta x)$ away from the boundary, but at least $\Delta x$ by the assumption made on the mesh, then $\mu ^{\alpha , +}_{p} \sim {\mathcal {O}}(\sqrt{\Delta x})$ and/or $\mu ^{\alpha , -}_{p} \sim {\mathcal {O}}(\sqrt{\Delta x})$, but not $o(\sqrt{\Delta x})$.

To prove (c) we use Taylor expansions for each p and conclude using the limits in b). Let $\phi : {\bar{\Omega }} \rightarrow {\mathbb {R}}$ be a smooth function and for any $p \in ({\mathcal {P}} \cap [[1, P]])$, where ${\mathcal {P}}$ denotes the set of stencils overstepping the domain, then by Taylor expansion and the consistency conditions (2.5)–(2.6) the local consistency error $\tau $ for the p-th addend of (2.1) using multi-index notation is given by

$$\begin{aligned} \tau&:=\frac{A^\alpha _p \phi (t, x + {\hat{y}}_{p}^{\alpha , +}) - (A^\alpha _p + B^\alpha _p) \phi (t, x) + B^\alpha _p \phi (t, x + {\hat{y}}_{p}^{\alpha , -}) }{2 \Delta x} - \frac{1}{2} \mathrm {tr}[\sigma ^\alpha _p \sigma ^{\alpha , T}_p D^2 \phi ] \\&= \frac{1}{2 \Delta x} \sum _{|\beta | \ge 3} \frac{1}{|\beta |!} ( A^\alpha _p ({\hat{y}}_{p}^{\alpha , +})^\beta + B^\alpha _p ({\hat{y}}_{p}^{\alpha , -})^\beta ) D^\beta \phi , \end{aligned}$$

where, due to the truncation of the stencil, the scheme is not central and therefore the terms for odd $|\beta |$ do not cancel out. If only one side of the stencil oversteps then for $|\beta | = 3$

$$\begin{aligned}\frac{A^\alpha _p ({\hat{y}}_{p}^{\alpha , +})^\beta + B^\alpha _p ({\hat{y}}_{p}^{\alpha , -})^\beta }{\Delta x} \sim {\mathcal {O}}( \sqrt{\Delta x}),\end{aligned}$$

whereas if both sides overstep then the error from interpolation dominates and is ${\mathcal {O}}(1)$ for points ${\mathcal {O}}(\Delta x)$ from the boundary, as seen at the end of the proof of Proposition 2.2. $\square $

Remark 2.1

(Two-sided overstepping) We note that it is possible for both sides of the stencil to overstep if the diffusion direction $\sigma ^\alpha _p$ is (almost) parallel to the domain boundary, for points close to a locally convex smooth boundary with high curvature in that direction, as well as close to corners; see Remark 2.4 and Table 5 below.

The scheme is consistent at points with two-sided overstepping if the truncated scheme is not interpolated at the boundary but uses the exact boundary values. In that case, the consistency error for those points is ${\mathcal {O}}( \Delta x)$.

2.3 Properties of the Truncated Stencil

The changes in the finite difference weights of scheme (2.1) introduced by the truncation, modify the positivity conditions given in Lemma 4.1 in [9]. We will show that the scheme remains conditionally $L_\infty $-stable and monotone, but the CFL conditions are more restrictive in the truncated case for time-stepping schemes with $\theta < 1$. We start by writing the scheme on a discrete time-space grid with mesh parameters $\Delta t$ and $\Delta x$ as

$$\begin{aligned}&{\hat{L}}^{\alpha }_{\Delta x} [{\mathcal {I}}_{\Delta x} \phi (t, \cdot )](t_{n}, x_j) \nonumber \\&\quad = \sum ^{M}_{p=1} \frac{1}{2 \Delta x} \left[ A^{\alpha , n}_p ({\mathcal {I}}_{\Delta x} \phi (t_{n}, \cdot ))(x_j + {\hat{y}}^{\alpha , +}_{p}) - (A^{\alpha , n}_p + B^{\alpha , n}_p) \phi (t_{n}, x_j) \right. \nonumber \\&\qquad \left. +\, B^{\alpha , n}_p ({\mathcal {I}}_{\Delta x} \phi (t_{n}, \cdot ))(x_j + {\hat{y}}^{\alpha , -}_{p})\right] \nonumber \\&\quad = \sum ^{M}_{p=1} \Bigg \{ \sum _{i \in {\mathcal {N}}(x_j + {\hat{y}}^{\alpha , +}_{p})} \frac{1}{2 \Delta x} \left[ A^{\alpha , n}_p w_{i} (x_j + {\hat{y}}^{\alpha , +}_{p} )\right] (\phi (t_{n}, x_i) - \phi (t_{n}, x_j)) \, \nonumber \\&\qquad + \sum _{i \in {\mathcal {N}}(x_j + {\hat{y}}^{\alpha , -}_{p})} \frac{1}{2 \Delta x} \left[ B^{\alpha , n}_p w_{ i} (x_j + {\hat{y}}^{\alpha , -}_{p} )\right] (\phi (t_{n}, x_i) - \phi (t_{n}, x_j)) \Bigg \} \nonumber \\&\quad = \sum _{i=1}^N \sum ^{M}_{p=1} \frac{A^{\alpha , n}_p w_{i} (x_j + {\hat{y}}^{\alpha , +}_{p}) + B^{\alpha , n}_p w_{i} (x_j + {\hat{y}}^{\alpha , -}_{p})}{2 \Delta x} (\phi (t_{n}, x_i) - \phi (t_{n}, x_j)) \nonumber \\&\quad = \sum _{i = 1}^N {\hat{l}}^{\alpha , n}_{j, i} (\phi (t_{n}, x_i) - \phi (t_{n}, x_j)), \end{aligned}$$

(2.9)

where ${\mathcal {N}}$ is the set of neighbours as in (1.7), and

$$\begin{aligned} {\hat{l}}^{\alpha , n}_{ j, i} = \sum ^{M}_{p=1} \frac{A^{\alpha , n}_p w_{ i} (x_j + {\hat{y}}^{\alpha , +}_{p}(t_{n}, x_j)) + B^{\alpha , n}_p w_{i} (x_j + {\hat{y}}^{\alpha , -}_{p} (t_{n}, x_j))}{2 \Delta x}. \end{aligned}$$

The first equality follows from (2.1), the second from (1.7) and since for all $1\le i, j \le N$

$$\begin{aligned} w_j(x) \ge 0, \quad w_i(x_j) = \delta _{ij}, \quad \text {and} \quad \sum _{i \in {\mathcal {N}}(x)} w_i (x) \equiv 1, \end{aligned}$$

(2.10)

for multi-linear interpolation. Here,

$$\begin{aligned} \sum _{i=1}^N {\hat{l}}^{\alpha , n}_{j, i} = \sum ^{M}_{p=1} \frac{A^{\alpha , n}_p + B^{\alpha , n}_p}{2\Delta x} \ge \frac{M}{\Delta x}, \end{aligned}$$

with equality only in the absence of domain overstepping for all $p \in [[1, M]]$ at $(t_n, x_j, \alpha )$.

Writing the overall scheme in the form (1.9) of Definition 1.1, we have that

$$\begin{aligned}&\sup _{\alpha } \left\{ \left[ 1 + \theta \Delta t_n \left( \sum ^{M}_{p= 1} \frac{A^{\alpha , n}_p + B^{\alpha , n}_p}{2 \Delta x} - {\hat{l}}^{\alpha , n}_{ j, j} - c^{\alpha , n-1+\theta }_j \right) \right] U^n_j - \theta \Delta t_n \sum _{i \ne j} {\hat{l}}^{\alpha , n}_{ j, i} U^n_i + \right. \nonumber \\&\quad \left. - \,\left[ 1 - (1 - \theta ) \Delta t_n \left( \sum ^{M}_{p= 1} \frac{A^{\alpha , n-1}_p + B^{\alpha , n-1}_p}{2 \Delta x} - {\hat{l}}^{\alpha , n-1}_{j, j} - c^{\alpha , n-1+\theta }_j \right) \right] U^{n-1}_j + \right. \nonumber \\&\quad \left. -\, (1-\theta ) \Delta t_n \sum _{i \ne j} {\hat{l}}^{\alpha , n-1}_{j, i} U^{n-1}_i - \Delta t_n f^{\alpha , n-1+\theta }_{j} \right\} =0. \end{aligned}$$

(2.11)

It is straightforward to write down the expressions for the coefficients in (1.9):

$$\begin{aligned} {\mathcal {B}}^{\alpha , n, n}_{j, j}&= 1 + \theta \Delta t_n \left( \sum ^{M}_{p= 1} \frac{A^{\alpha , n}_p + B^{\alpha , n}_p}{2 \Delta x} - {\hat{l}}^{\alpha , n}_{ j, j} - c^{\alpha , n-1+\theta }_j \right) , \\ {\mathcal {B}}^{\alpha , n, n-1}_{ j, j}&= 1 - (1 - \theta ) \Delta t_n \left( \sum ^{M}_{p= 1} \frac{A^{\alpha , n-1}_p + B^{\alpha , n-1}_p}{2 \Delta x} - {\hat{l}}^{\alpha , n-1}_{j, j} - c^{\alpha , n-1+\theta }_j \right) , \\ {\mathcal {B}}^{\alpha , n, n}_{j, i}&= \theta \Delta t_n \, {\hat{l}}^{\alpha , n}_{j, i}, \qquad {\mathcal {B}}^{\alpha , n, n-1}_{j, i} = (1-\theta ) \Delta t_n \, {\hat{l}}^{\alpha , n-1}_{j, i}. \end{aligned}$$

Remark 2.2

In writing down (2.9), we assumed that the value at the boundary is interpolated from other mesh points, which is feasible on rectangular cuboids, but not for general domain boundaries. In both cases, the Dirichlet boundary value at $x_j + {\hat{y}}^{\alpha , \pm }_{p}$ can be used. This has the advantage that interpolation error is avoided. Moreover, as this value then contributes to the right-hand-side f of Eq. (2.11) instead of the off-diagonal matrix elements, the system matrix becomes more diagonally dominant. This is advantageous for the iterative solution, see Sect. 3.4.

The next proposition contains the positivity conditions for the coefficients ${\mathcal {B}}$ defined above.

Proposition 2.4

The scheme (2.11) is of positive type if the following conditions hold,

$$\begin{aligned} (1 - \theta ) \Delta t_n \left[ \sum ^{M}_{p=1} \frac{A^{\alpha , n-1}_p + B^{\alpha , n-1}_p}{2\Delta x} - c^{\alpha , n-1+\theta }_i \right] \le 1, \quad \text {and } \;\; \theta \Delta t_n c^{\alpha , n-1+\theta }_i \le 1, \end{aligned}$$

(2.12)

for all $\alpha , n, i$.

Corollary 2.5

In the case of overstepping and $\theta < 1$, monotonicity requires that $\Delta t \sim {\mathcal {O}}(\Delta x^{3/2})$ if only one side of the diffusion stencils oversteps, or $\Delta t \sim {\mathcal {O}}(\Delta x^{2})$ if both sides overstep. However, if the stencil is not truncated, the positivity condition remains as in [9], that is $\Delta t \sim {\mathcal {O}}(\Delta x)$.

Proof

From Corollary 2.3, if the corresponding stencil is truncated on one side $A^{\alpha , n-1}_\cdot + B^{\alpha , n-1}_\cdot \sim {\mathcal {O}}(\Delta x^{-1/2})$ for sufficiently small $\Delta x$, $A^{\alpha , n-1}_\cdot + B^{\alpha , n-1}_\cdot \sim {\mathcal {O}}(\Delta x^{-1})$ if both sides are truncated, whereas if there is no overstepping, $A^{\alpha , n-1}_\cdot + B^{\alpha , n-1}_\cdot \sim {\mathcal {O}}(1)$. $\square $

The $L^\infty $-stability follows from the proof of Lemma 4.1 in [9] and the new CFL conditions in Proposition 2.4.

2.4 Numerical Experiments

To test the truncation of the stencil, we consider Problems A and B in Section 9.3 from [9]. Both problems follow the formulation in (1.1)–(1.3) with homogeneous Dirichlet boundary conditions and have smooth solutions.

Problem A (see Section 9.3 from [9]). It has exact solution $u(t, x_1, x_2) = \left( \frac{3}{2} - t \right) \sin x_1 \sin x_2,$ and coefficients and control set are given by

$$\begin{aligned}&f^\alpha = \left( \frac{1}{2} - t \right) \sin x_1 \sin x_2 + \left( \frac{3}{2} - t \right) \Big [ \sqrt{\cos ^2 x_1 \sin ^2 x_2 + \sin ^2 x_1 \cos ^2 x_2} + \\&\quad \qquad ~ - 2 \sin (x_1 + x_2) \cos (x_1 + x_2) \cos x_1 \cos x_2 \Big ], \\&c^\alpha = 0, ~ b^\alpha = \alpha , \quad \sigma ^\alpha = \sqrt{2} \begin{pmatrix} \sin (x_1 + x_2) \\ \cos (x_1 + x_2) \end{pmatrix}, \quad {\mathcal {A}} = \{ \alpha \in {\mathbb {R}}^2 : \alpha ^2_1 + \alpha ^2_2 = 1\}. \end{aligned}$$

Problem B (see Section 9.3 from [9]). It has exact solution $u(t, x_1, x_2) = \left( 2 - t\right) \sin (x_1) \sin (x_2)$, and coefficients and control set

$$\begin{aligned}&f^\alpha = \left( 1 - t \right) \sin x_1 \sin x_2 - 2 \alpha _1 \alpha _2 (2 - t) \cos x_1 \cos x_2, \\&c^\alpha = 0, ~ b^\alpha = 0, \quad \sigma ^\alpha = \sqrt{2} \begin{pmatrix} \alpha _1 \\ \alpha _2 \end{pmatrix}, \quad {\mathcal {A}} = \{ \alpha \in {\mathbb {R}}^2 : \alpha ^2_1 + \alpha ^2_2 = 1\}. \end{aligned}$$

Both problems are solved on the domain $(t, x_1, x_2) \in [0, T] \times [-\pi , \pi ]^2$ with $T = \frac{1}{2}$. We discretize the spatial domain using Cartesian grids with $N_x \times N_x$ equispaced nodes and for the control set ${\mathcal {A}}$ we take $N_\alpha $ equally spaced points. Here, ${\mathcal {I}}_{\Delta x}$ is the usual bilinear interpolator on rectangles.

For illustration of the stencil and its non-locality, the top row of Fig. 2 represents the stencil for Problems A and B on a Cartesian grid of $11 \times 11$ points and 10 points in the control set ${\mathcal {A}}$. Colour coded lines link the stencil points with the node where the numerical solution is computed, the different colours correspond to the different ${\hat{y}}^{\alpha , \cdot }_\cdot $. On top of some of the stencil points we print the value of the finite difference weights, for compactness we set $A \equiv A^\alpha _{2, 1}(x)$, $B \equiv B^\alpha _{2, 1}(x)$ and $C \equiv (\mu ^\alpha _{2, 2}(x))^{-1}$, following the notation in (2.3) and (2.2). The bottom row of Fig. 2 represents the non-locality of the diffusion stencil by counting the number of stencil points at a given distance from the central node. The distance is measured as multiples of $\Delta x$ and given by $\left\lfloor \frac{(\sigma ^\alpha (x))_i}{\sqrt{\Delta x}} \right\rfloor $, where the grid is of size $641 \times 641$ and 10 points in the control set ${\mathcal {A}}$.

Problems A and B were obviously chosen in [9] for their periodic solutions, to be able to analyse the convergence of the scheme without the complication of boundary conditions. Here, we do not make use of the periodicity but only use the values at the boundary and not outside the domain.

We note that the problems being linear in t, a single time step with $\Delta t = T$ suffices to obtain an exact solution in t. However, in order to check the effect of the truncation on the stability, in addition to $\Delta t = T$, we also investigate $\Delta t$ equal to $\frac{\Delta x}{4}$, $\Delta x^{3/2}$, and $\Delta x^2$. We report the $\infty $-norm of the errors over two regions: the first one comprising the whole domain, and the second one comprising part of the interior of the domain.

We consider explicit and implicit time stepping schemes, corresponding to $\theta = 0$ and $\theta = 1$ respectively. For the explicit scheme in the case of overstepping we test the following modifications of the scheme:

1.
truncation of the stencil as discussed in Sect. 2.2 (Table 1 for Problem A and Table 12 for Problem B);
2.
constant extrapolation of the boundary value in the direction of the semi-Lagrangian step (Table 2 for Problem A and Table 13 for Problem B);
3.
linear extrapolation of the boundary value in the direction of the semi-Lagrangian step (Table 3 for Problem A and Table 14 for Problem B).

For the implicit case we only consider the first modification, i.e. truncation of the stencil (Table 4 for Problem A and Table 15 for Problem B).

The results confirm the impact of the truncation on the stability of the scheme, when $\theta = 0$. However, when $\theta = 1$, we do not observe any instability regardless of the size of the time step. When stable, the truncation of the stencil outperforms the two extrapolations of the boundary conditions considered. Furthermore, as the mesh and time steps are refined, only the truncated scheme, if stable, achieves convergence orders close to ${\mathcal {O}}(\Delta x)$ when the error at $t = T$ is measured on the entire spatial grid. This can be explained without rigorous proof by the observation that the truncation error of order $\sqrt{\Delta x}$ is restricted to a boundary layer of width $\sqrt{\Delta x}$. Therefore, as seen from the last two columns in Table 4, choosing $\Delta t$ of order higher than 1 in $\Delta x$ does not improve the accuracy of the numerical results and leads to computational inefficiency.

Remark 2.3

Regarding the discretization of the control set, we take $N_\alpha = 40$ equally spaced points. For this choice, the discretization error of the LISL scheme is found to dominate the control discretization error for the problems and the space-time mesh sizes considered.

Table 1 Results using the truncation of the stencil for explicit method with $N_{\alpha } = 40$ for Problem A

Boundary Treatment and Multigrid Preconditioning for Semi-Lagrangian Schemes Applied to Hamilton–Jacobi–Bellman Equations

Abstract

Similar content being viewed by others

Auxiliary Space Preconditioners for a $$C^{0}$$ Finite Element Approximation of Hamilton–Jacobi–Bellman Equations with Cordes Coefficients

Robust Preconditioners for DG-Discretizations with Arbitrary Polynomial Degrees

A Fast Solver for Boundary Integral Equations of the Modified Helmholtz Equation

1 Introduction

Definition 1.1

2 Boundary Treatment for the LISL Scheme

2.1 Definition of Truncated Stencils

2.2 Consistency Conditions

Proposition 2.1

Proof

Proposition 2.2

Proof

Corollary 2.3

Proof

Remark 2.1

2.3 Properties of the Truncated Stencil

Remark 2.2

Proposition 2.4

Corollary 2.5

Proof

2.4 Numerical Experiments

Remark 2.3

Remark 2.4

Remark 2.5

3 Multigrid Preconditioning

Definition 3.1

Definition 3.2

3.1 On the Spectrum of LISL Matrices

3.2 Local Fourier Analysis of the Smoothers

Definition 3.3

Definition 3.4

Example 3.1

Example 3.2

Example 3.3

3.3 Performance of Geometric Multigrid

3.4 Properties of the LISL Matrix

Remark 3.1

Remark 3.2

Proposition 3.1

Proof

Remark 3.3

3.5 Performance of the Algebraic Approaches

4 Conclusions

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation