A study of defect-based error estimates for the Krylov approximation of φ-functions

Jawecki, Tobias

doi:10.1007/s11075-021-01190-x

A study of defect-based error estimates for the Krylov approximation of φ-functions

Original Paper
Open access
Published: 08 November 2021

Volume 90, pages 323–361, (2022)
Cite this article

Download PDF

You have full access to this open access article

Numerical Algorithms Aims and scope Submit manuscript

A study of defect-based error estimates for the Krylov approximation of φ-functions

Download PDF

Tobias Jawecki ORCID: orcid.org/0000-0002-5720-3613¹

886 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

Prior recent work, devoted to the study of polynomial Krylov techniques for the approximation of the action of the matrix exponential e^tAv, is extended to the case of associated φ-functions (which occur within the class of exponential integrators). In particular, a posteriori error bounds and estimates, based on the notion of the defect (residual) of the Krylov approximation are considered. Computable error bounds and estimates are discussed and analyzed. This includes a new error bound which favorably compares to existing error bounds in specific cases. The accuracy of various error bounds is characterized in relation to corresponding Ritz values of A. Ritz values yield properties of the spectrum of A (specific properties are known a priori, e.g., for Hermitian or skew-Hermitian matrices) in relation to the actual starting vector v and can be computed. This gives theoretical results together with criteria to quantify the achieved accuracy on the fly. For other existing error estimates, the reliability and performance are studied by similar techniques. Effects of finite precision (floating point arithmetic) are also taken into account.

Computable upper error bounds for Krylov approximations to matrix exponentials and associated $${\varvec{\varphi }}$$ -functions

Article Open access 11 September 2019

Analysis of quasi-optimal polynomial approximations for parameterized PDEs with deterministic and stochastic coefficients

Article 23 March 2017

Piecewise Polynomial Taylor Expansions—The Generalization of Faà di Bruno’s Formula

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Overview on prior work

The matrix exponential and associated φ-functions play a crucial role in some numerical methods for solving systems of differential equations. In practice, this means that the vector e^tAv for a time step t, for a given matrix A and a given vector v, representing the time propagation for a linear initial value problem, is to be approximated. Similarly, the associated φ-functions (see (2.2) below) conform to solutions of certain inhomogeneous differential equations. In particular, evaluation of φ-functions is used in exponential integrators [27].

If the matrix A is sparse and large, approximation of the action of these matrix functions in the class of Krylov subspaces is a general and well-established technique. For the matrix exponential and φ-functions, this goes back to early works in the field of chemical physics [39, 44], parabolic problems [20], some nonlinear problems [18], etc. The case of a symmetric or skew-Hermitian matrix A is the most prominent one. Krylov approximations of the matrix exponential were early studied for the symmetric case in [12, 13, 46], and together with φ-functions in a more general setting [26, 28].

Concerning different approaches for the numerical approximation of the matrix exponential see [36]. In [46] it is shown for the symmetric case that the Krylov approximation is equivalent to interpolation of the exponential function at associated Ritz values. This automatically results in a near-best approximation among other choices of interpolation nodes, see also [12, 52] and further works [3] with similar results for the non-symmetric case and general functions including φ-functions. For other polynomial approaches approximating the matrix exponential, we mention truncated Taylor series [2] (and many works well in advance), Chebychev interpolation [54], or the Leja method [8], where [2] also covers φ-functions.

In general, Krylov approximations (or other polynomial approximations) result in an accurate approximation if the time step t in e^tAv is sufficiently small or the dimension of the Krylov subspace (i.e., the degree of the approximating matrix polynomial) is sufficiently large, see for instance [26]. The dimension of the Krylov subspace is limited in practice, and large time steps require a restart of the iteration generating the Krylov basis. A larger time step t can be split into smaller substeps for which the Krylov approximation can be applied in a nested way. Such a restarting strategy in the sense of a time integrator was already exploited in [44]. In particular we refer to the EXPOKIT package [49]. Similar ideas can be applied for the evaluation of φ-functions [28, 41, 49].

In practice, a posteriori error estimates are used to choose a proper Krylov dimension or proper (adaptive) substeps if the method is restarted as a time integrator. Different approaches for a posteriori error estimation concerning the exponential function make use of a series expansion for the error given [46] or use a formulation via the defect (also called residual) of the Krylov approximation [5, 9, 11, 28]. A prominent error estimate concerning φ-functions is the generalized residual estimate introduced in [28], which is based on the residual of a matrix inverse. Furthermore, a series expansion of the error concerning φ-functions is given in [49] (similar to the series expansion concerning the exponential in [46]) and leading terms of this series are used for a posteriori error estimation in [41, 49]. Further a priori as well as a posteriori error estimates for the exponential function are are given in [3, 10, 30, 31, 34, 37, 56], where [10, 30] also consider φ-functions. Restarting via substeps based on different choices of error estimates is further discussed in [30]. A restart with substeps together with a strategy to choose the Krylov dimension in terms of computational cost was presented in [6, 41]. For various other approaches for restarting (without adapting the time step) we refer to [1, 5, 9, 15, 16, 40, 48, 53].

The influence of round-off errors on the construction of the Krylov basis in floating point arithmetic was early studied for the symmetric case in [43, 45]. The orthogonalization procedure can behave numerically unstable, typically due to a loss of orthogonality. Nevertheless, the near-best approximation property and related a priori convergence results are not critically affected [11, 13]. Following [11], in the symmetric case the defect obtained in floating point arithmetic results in numerically stable error estimates.

Beside the polynomial Krylov method, further studies are devoted to the approximation of matrix functions using so called extended Krylov subspaces [14, 21, 32], rational Krylov subspaces [17, 22, 38], or polynomial Krylov subspaces with a harmonic Ritz approach [25, 48, 57].

Overview on results presented here

In Section 2, we introduce the problem setting and recapitulate basic properties of Krylov subspaces.

In Section 3, we introduce the defect associated with Krylov approximations to φ-functions, including the exponential function as the basic case. Our approach for the defect is different from [57] and is based on an inhomogeneous differential equation for the approximation error. This is used in Theorem 1 to obtain an integral representation of the error, also taking effects of floating point arithmetic into account.^{Footnote 1} In contrast to previous works ([11, 30]), this result is extended to φ-functions here.

This upper bound is further analyzed in Section 4 to obtain computable a posteriori bounds, in particular a new a posteriori bound (Theorem 4). We also study the accuracy of our and other existing defect-based bounds [30] with respect to spectral properties of the Krylov Hessenberg matrix (the representation of A in the orthogonal Krylov basis). To this end we use properties of divided differences including a new asymptotic expansion for these given in Appendix C. In Section 4.1, we consider error estimates based on a quadrature estimate of the defect norm integral: The generalized residual estimate [28] for the approximation of φ-functions which conforms to a quadrature of the defect norm integral (namely, the right-endpoint rectangle rule), and the effective order estimate, which was introduced for the approximation of the matrix exponential in [30] and is extended to φ-functions in the present work. We also discuss cases for which the defect norm behaves oscillatory and reliable quadrature estimates may be difficult to obtain. In Section 4.2, we specify a stopping criterion for the so-called lucky breakdown in floating point arithmetic which is justified by our a posteriori error bounds.

In Section 5, we illustrate our results via numerical experiments. This includes further remarks on previously known error estimates for the Krylov approximation of φ-functions.

2 Problem statement and Krylov approximation

We discuss the approximation via Krylov techniques for evaluation of the matrix exponential, and in particular of the associated φ-functions, for a step size t > 0 and matrix $A\in {\mathbb {C}}^{n\times n}$ applied to an initial vector $v\in {\mathbb {C}}^{n}$. Here,

$$ \mathrm{e}^{tA} v = \sum\limits_{k=0}^{\infty} \frac{(tA)^{k}}{k!} v. $$

(2.1)

The matrix exponential u(t) = e^tAv is the solution of the differential equation

$$ u^{\prime}(t)=Au(t),~~~u(0)=v. $$

The associated φ-functions are given by

$$ \varphi_{p}(tA)v = \sum\limits_{k=0}^{\infty} \frac{(tA)^{k}}{(k+p)!} v,~~~ p \in \mathbb{N}_{0}. $$

(2.2)

This includes the case $\varphi _{0} = \exp $. The matrix functions (2.1) and (2.2) are defined according to their scalar counterparts. The following definitions of φ_p are equivalent to (2.2): For $z\in {\mathbb {C}}$ we have $ \varphi _{0}(z) = {\mathrm {e}}^{z} $, and

$$ \varphi_{p}(z) = \frac{1}{(p-1)!} {{\int}_{0}^{1}} \mathrm{e}^{(1-\theta) z} \theta^{p-1} \mathrm{d}\theta, \quad p \in \mathbb{N}. $$

(2.3)

(See also [24, Section 10.7.4].) The function w_p(t) = t^pφ_p(tA)v ($ p \in \mathbb {N} $) is the solution of an inhomogeneous differential equation of the form

$$ w_{p}^{\prime}(t) = A w_{p}(t) + \frac{t^{p-1}}{(p-1)!} v,~~~w_{p}(0)=0, $$

(2.4)

see for instance [41]. This follows from (2.2),

$$ \begin{array}{@{}rcl@{}} \frac{\mathrm{d}}{\mathrm{d} t}\big(t^{p}\varphi_{p}(t A)v\big) &=& \frac{\mathrm{d}}{\mathrm{d} t}\Big(\sum\limits_{k=0}^{\infty} \frac{t^{k+p}A^{k}v}{(k+p)!} \Big) = A \sum\limits_{k=0}^{\infty} \frac{t^{k+p}A^{k}v}{(k+p)!} +\frac{t^{p-1}v}{(p-1)!}\\ &=& A (t^{p}\varphi_{p}(t A)v) + \frac{t^{p-1}v}{(p-1)!}. \end{array} $$

The φ-functions appear for instance in the field of exponential integrators, see for instance [27].

For the case of A being a large and sparse matrix, e.g., the spatial discretization of a partial differential operator using a localized basis, Krylov subspace techniques are commonly used to approximate (2.2) in an efficient way.

Notation and properties of Krylov subspaces

^{Footnote 2} We briefly recapitulate the usual notation and properties of standard Krylov subspaces, see for instance [47]. For a given matrix $A\in \mathbb {C}^{n\times n} $, a starting vector $v\in {\mathbb {C}}^{n}$ and Krylov dimension 0 < m ≤ n, the Krylov subspace is given by

$$ \mathscr{K}_{m}(A,v) = {\text{span}}(v,Av,\ldots,A^{m-1}v). $$

Let $ V_{m} \in \mathbb {C}^{n\times m} $ represent the orthonormal basis of ${\mathscr{K}}_{m}(A,v)$ with respect to the Hermitian inner product, constructed by the Arnoldi method and satisfying $V_{m}^{\ast } V_{m} = I_{m\times m}$. Its first column is given by $V_{m}^{\ast } v = \beta e_{1} $ with β = ∥v∥₂. Here, the matrix

$$ H_{m} = V_{m}^{\ast} A V_{m} \in \mathbb{C}^{m\times m} $$

is upper Hessenberg. We further use the notation $h_{m+1,m}=(H_{m+1})_{m+1,m}\in \mathbb {R}$, and $v_{m+1}\in \mathbb {C}^{n}$ for the (m + 1)th column of V_m+ 1, with $V_{m}^{\ast } v_{m+1}=0$ and ∥v_m+ 1∥₂ = 1.

The Arnoldi decomposition (in exact arithmetic) can be expressed in matrix form,

$$ AV_{m} = V_{m}H_{m} + h_{m+1,m} v_{m+1} e_{m}^{\ast} . $$

(2.5)

Remark 1

The numerical range $\text {W}(A)=\{y^{\ast } A y/y^{\ast } y, 0 \not = y \in \mathbb {C}^{n}\} $ plays a role in our analysis. Note that $ \text {W}(H_{m}) \subseteq \text {W}(A) $ (see (A.1)).

Remark 2

The case (H_m)_{k+ 1, k} = 0 occurs if ${\mathscr{K}}_{k}(A,v)$ is an invariant subspace of A, whence the Krylov approximation given in (2.9) below is exact. This exceptional case is referred to as a lucky breakdown. In general, we assume that no lucky breakdown occurs, whence the lower subdiagonal entries of H_m are real and positive, 0 < (H_m)_{j+ 1, j} for j = 1,…, m − 1, and $0<h_{m+1,m}\in \mathbb {R}$.

For the special case of a Hermitian or skew-Hermitian matrix A the Arnoldi iteration simplifies to a three-term recurrence, the so-called Lanczos iteration. This case will be addressed in Remark 4 below.

Krylov subspaces in floating point arithmetic

We proceed with some results for the Arnoldi decomposition in computer arithmetic, assuming complex floating point arithmetic with a relative machine precision ε, see also [23]. For practical implementation different variants of the Arnoldi procedure exist, using different ways for the orthogonalization of the Krylov basis. These are based on classical Gram-Schmidt, modified Gram-Schmidt, the Householder algorithm, the Givens algorithm, or variants of Gram-Schmidt with reorthogonalization (see also [47, Algorithm 6.1–6.3] and others). We refer to [7] and references therein for an overview on the stability properties of these different variants.

In the sequel, the notation V_m, H_m, etc., will again be used for the result of the Arnoldi method in floating point arithmetic. We now accordingly adapt some statements formulated in the previous paragraph. By construction, H_m remains to be upper Hessenberg with positive lower subdiagonal entries. Assuming floating point arithmetic, we use the notation $U_{m}\in \mathbb {C}^{n\times m}$ for a perturbation of the Arnoldi decomposition (2.5) caused by round-off, i.e.,

$$ AV_{m} = V_{m}H_{m} + h_{m+1,m} v_{m+1} e_{m}^{\ast} + U_{m}. $$

(2.6)

An upper norm bound for U_m was first introduced in [43] for the Lanczos iteration in real arithmetic. For different variants of the Arnoldi or Lanczos iteration, this is discussed in [58] and others. We assume ∥U_m∥₂ is bounded by a constant C₁ which can depend on m and n in a moderate way and is sufficiently small in a typical setting,

$$ \|U_{m}\|_{2} \leq C_{1} \varepsilon \|A\|_{2}. $$

(2.7a)

We further assume that the normalization of the columns of V_m is accurate, in particular that the (m + 1)th basis vector v_m+ 1 is normalized correctly up round-off with a sufficiently small constant C₂ (see e.g., [43, (14)]),

$$ |\|v_{m+1}\|_{2}-1|\leq C_{2} \varepsilon. $$

(2.7b)

Concerning V_m+ 1 which represents an orthogonal basis in exact arithmetic, numerical loss of orthogonality has been well-studied. Loss of orthogonality can be significant (see for instance [7, 45] and others), depending on the starting vector v. Reorthogonalization schemes or orthogonalization via Householder or Givens algorithm can be used to obtain orthogonality of V_m+ 1 on a sufficiently accurate level.

The numerical range of H_m obtained in floating point arithmetic (see (2.6)) can be characterized as

$$ \text{W}(H_{m}) \subseteq U_{C_{3}\varepsilon}(\text{W}(A)), $$

(2.7c)

with $U_{C_{3}\varepsilon }(\text {W}(A))$ being the neighborhood of W(A) in $\mathbb {C}$ with a distance C₃ε. With the assumption that V_m+ 1 is sufficiently close to orthogonal (e.g., semiorthogonal [50]), the constant C₃ in (2.7c) (which also depends on C₁ and problem sizes) can be shown to be moderate-sized. Further details on this aspect are given in Appendix A.

Krylov approximation of φ-functions

^{Footnote 3} Let $V_{m}\in \mathbb {C}^{n\times m}$, $H_{m}\in \mathbb {C}^{m\times m}$ and $\beta \in \mathbb {R}$ be the result of the Arnoldi method in floating point arithmetic for ${\mathscr{K}}_{m}(A,v)$ as described above. For a time-step $0<t\in {\mathbb {R}}$ and p ≥ 0, the vector φ_p(tA)v can be approximated in the Krylov subspace ${{\mathscr{K}}}_{m}(A,v)$ by the Krylov propagator

$$ u_{p,m}(t):= V_{m} \varphi_{p}(tV_{m}^{\ast} AV_{m})V_{m}^{\ast} v = \beta V_{m} \varphi_{p}(tH_{m}) e_{1} ,~~~p\in\mathbb{N}. $$

(2.8a)

The special case p = 0 reads

$$ u_{0,m}(t) = \beta V_{m} \mathrm{e}^{tH_{m}} e_{1}. $$

(2.8b)

We remark that the small-dimensional problem $\varphi _{p}(tH_{m})e_{1}\in \mathbb {C}^{m}$, typically with m ≪ n, can be evaluated cheaply by standard methods. In the sequel, we denote

$$ y_{p,m}(t)=\beta \varphi_{p}(tH_{m}) e_{1}\in\mathbb{C}^{m}, \quad \text{i.e.,} \quad u_{p,m}(t) = V_{m} y_{p,m}(t). $$

(2.9)

For p = 0, the small dimensional problem $y_{0,m}(t) = \beta \mathrm {e}^{t H_{m}} e_{1}$ solves the differential equation

$$ {y}^{\prime}_{0,m}(t) = H_{m} {y}_{0,m}(t),~~~{y}_{0,m}(0)=\beta e_{1}. $$

(2.10)

For later use, we introduce the notation

$$ \widehat{y}_{p,m}(t) = t^{p} y_{p,m}(t), $$

(2.11a)

which for $p\in \mathbb {N}$ and according to (2.4) satisfies the differential equation

$$ \widehat{y}^{\prime}_{p,m}(t) = H_{m} \widehat{y}_{p,m}(t) + \frac{t^{p-1}}{(p-1)!} \beta e_{1},~~~\widehat{y}_{p,m}(0)=0. $$

(2.11b)

Remark 3

Although we take rounding effects in the Arnoldi decomposition into account, we do not give a full study of round-off errors at this point. Round-off errors in substeps such as the evaluation of y_{p, m}(t) or the matrix-vector multiplication V_my_{p, m}(t) will be ignored. We refer to [23] for a more general study of these effects.

Remark 4

In the special cases A = B or A = iB for a Hermitian matrix $B\in \mathbb {C}^{n\times n}$ (with A being skew-Hermitian in the latter case) the orthogonalization of the Krylov basis of ${\mathscr{K}}_{m}(B,v)$ simplifies to a three-term recursion, the so-called Lanczos method. In the skew-Hermitian case (A = iB) the Krylov propagator (2.8a) can be evaluated by βV_mφ_p(itH_m)e₁, i.e., we approximate the function λ↦φ_p(itλ) in the Krylov subspace ${{\mathscr{K}}}_{m}(B,v)$. The advantage is a cheaper computation of the Krylov subspace in terms of computational cost and better conservation of geometric properties. For details we refer to the notation e^σtB as introduced in [30], with σ = ±i and a Hermitian matrix B for the skew-Hermitian case.

The error of the Krylov propagator

We denote the error of the Krylov propagator given in (2.9) by

$$ l_{p,m}(t) = \beta V_{m}\varphi_{p}(tH_{m})e_{1} - \varphi_{p}(tA)v,~~~p\in\mathbb{N}_{0}. $$

(2.12)

We are further interested in computable a posteriori estimates for the error norm, ζ_{p, m}(t) ≈∥l_{p, m}(t)∥₂, which in the best case can be proven to be upper bounds on the error norm ∥l_{p, m}(t)∥₂ ≤ ζ_{p, m}(t). Norm estimates of the error (2.12) can be used in practice to stop the Krylov iteration after k steps if ∥l_{p, k}(t)∥₂ satisfies (2.13) below, or to restrict the time-step t to obtain an accurate approximation and restart the method with the remaining time. For details on the total error with this restarting approach, see also [30, 49].

A prominent task is to test if the error norm per unit step is bounded by a tolerance tol,

$$ \zeta_{p,m}(t) \leq t \cdot \text{tol},~~~\text{which should entail}~~~\|l_{p,m}(t)\|_{2} \leq t \cdot \text{tol}. $$

(2.13)

In case of ζ_{p, m}(t) being an upper bound on the error norm, this results in a reliable bound.

3 An integral representation for the error of the Krylov propagator

We proceed with discussing the error l_{p, m} of the Krylov propagator. To this end, we first define its scalar defect by

$$ \delta_{p,m}(t) = \beta e_{m}^{\ast} t^{p} \varphi_{p}(t H_{m}) e_{1} = t^{p} \big(y_{p,m}(t)\big)_{m}\in\mathbb{C}, $$

(3.1a)

and the defect integral by^{Footnote 4}

$$ L_{p,m}(t) = \frac{h_{m+1,m}}{t^{p}}{{\int}_{0}^{t}} |\delta_{p,m}(s)| \mathrm{d} s\in\mathbb{R}. $$

(3.1b)

Theorem 1

Let $\delta _{p,m}(t)\in \mathbb {C}$ be the defect defined in (3.1a). For $y_{p,m}(t)\in \mathbb {C}^{m}$ defined in (2.9) and a numerical perturbation $U_{m}\in \mathbb {C}^{n\times m}$ of the Arnoldi decomposition (see (2.6)), we have:

(a)
The error l_{p, m}(t) of the Krylov propagator (see (2.12)) enjoys the integral representation
$$ l_{p,m}(t) = -\frac{h_{m+1,m}}{t^{p}}{{\int}_{0}^{t}} \mathrm{e}^{(t-s)A}v_{m+1} \delta_{p,m}(s) \mathrm{d} s - \frac{1}{t^{p}}{{\int}_{0}^{t}} \mathrm{e}^{(t-s)A} U_{m} s^{p} y_{p,m}(s) \mathrm{d} s. $$
(3.2a)
(b)
For given machine precision ε and constants C₁, C₂ representing round-off effects (see (2.7a),(2.7b)), and with $\kappa _{1} = \max \limits _{s\in [0,t]}\|\mathrm {e}^{sA}\|_{2} $ and $\kappa _{2} = \max \limits _{s\in [0,t]}\|\mathrm {e}^{sH_{m}}\|_{2} $ the error norm is bounded by
$$ \|l_{p,m}(t)\|_{2} \leq (1+C_{2} \varepsilon ) \kappa_{1} L_{p,m}(t) + C_{1}\varepsilon \|A\|_{2} \frac{\beta \kappa_{1}\kappa_{2} t}{(p+1)!}, $$
(3.2b)
with the defect integral L_{p, m}(t) defined in (3.1b).

Proof

(a)
For the exact matrix function, we use the notation
$$ u_{p}(t)=\varphi_{p}(tA)v, \quad \text{and} \quad w_{p}(t)=t^{p} u_{p}(t). $$
For the Krylov propagator, we denote
$$ u_{p,m}(t)=V_{m} y_{p,m}(t) ~~~\text{with}~~ y_{p,m}(t)=\beta \varphi_{p}(tH_{m})e_{1} $$
(see (2.9)), and we also define
$$ w_{p,m}(t) = t^{p} u_{p,m}(t) = V_{m} \widehat{y}_{p,m}(t), ~~~\text{with}~~ \widehat{y}_{p,m}(t)=t^{p}{y}_{p,m}(t)~ \text{defined in~(2.11a).--} $$
- For $ p \in \mathbb {N} $, the functions w_p(t) and w_{p, m}(t) satisfy the differential equations (see (2.4), (2.11b))
  $$ \begin{array}{@{}rcl@{}} &&w^{\prime}_{p,m}(t) = V_{m} \widehat{y}^{\prime}_{p,m}(t) = V_{m}\left( H_{m} \widehat{y}_{p,m}(t) + \frac{t^{p-1}}{(p-1)!} \beta e_{1}\right),\\ &&w^{\prime}_{p}(t) = A w_{p}(t) + \frac{t^{p-1}}{(p-1)!} v, \quad \text{and}~~~{w}_{p}(0)={w}_{p,m}(0)=0. \end{array} $$
  (3.3)
- For p = 0, i.e., w₀(t) = u₀(t) and w_{0, m}(t) = V_my_{0, m}(t), according to (2.10), we have
  $$ \begin{array}{@{}rcl@{}} &&w^{\prime}_{0}(t) = A w_{0}(t), \quad w^{\prime}_{0,m}(t) = V_{m} H_{m} {y}_{0,m}(t), \\ &&\text{and}~~~w_{0}(0)=v,~~~w_{0,m}(0)=\beta V_{m} e_{1} =v. \end{array} $$
Local error representation in terms of the defect We defined the re-scaled error
$$ \widehat{l}_{p,m}(t) = w_{p,m}(t) - w_{p}(t) = t^{p} l_{p,m}(t). $$
- For $ p \in \mathbb {N} $, this satisfies
  $$ \widehat{l}^{~'}_{p,m}(t) = w^{\prime}_{p,m}(t) - w^{\prime}_{p}(t) = A \widehat{l}_{p,m}(t) + d_{p,m}(t),~~~\widehat{l}_{p,m}(0)=0, $$
  (3.4)
  with the defect of w_{p, m}(t) with respect to the differential (3.3),
  $$ \begin{array}{@{}rcl@{}} d_{p,m}(t) &=& w^{\prime}_{p,m}(t) - A w_{p,m}(t) - \frac{t^{p-1}}{(p-1)!} v \\ &=& V_{m}\big(H_{m} \widehat{y}_{p,m}(t) + \frac{t^{p-1}}{(p-1)!} \beta e_{1}\big) - A V_{m} \widehat{y}_{p,m}(t) - \frac{t^{p-1}}{(p-1)!} v \\ &=& \big(V_{m} H_{m} - A V_{m} \big) \widehat{y}_{p,m}(t) + \frac{t^{p-1}}{(p-1)!}(\beta V_{m} e_{1} - v). \end{array} $$
  
  Together with (2.6) and using of βV_me₁ = v, the defect can be written as
  $$ d_{p,m}(t) = - h_{m+1,m} (e_{m}^{\ast} \widehat{y}_{p,m}(t)) v_{m+1} - U_{m} \widehat{y}_{p,m}(t). $$
- For p = 0, in an analogous way, we obtain
  $$ d_{0,m}(t) = - h_{m+1,m} (e_{m}^{\ast} {y}_{0,m}(t)) v_{m+1} - U_{m} {y}_{0,m}(t). $$
We conclude
$$ d_{p,m}(t) = -h_{m+1,m} \delta_{p,m}(t) v_{m+1} - t^{p} U_{m} y_{p,m}(t),~~~p\in\mathbb{N}_{0}, $$
(3.5)
with the scalar defect defined in (3.1a). Due to (3.4), we have
$$ \widehat{l}_{p,m}(t) = {{\int}_{0}^{t}} \mathrm{e}^{(t-s) A} d_{p,m}(s) \mathrm{d} s,~~~p\in\mathbb{N}_{0}, $$
and for ${l}_{p,m}(t) = t^{-p} \widehat {l}_{p,m}(t)$ together with (3.5) this implies (3.2a).
(b)
With $\kappa _{1} =\max \limits _{s\in [0,t]} \|\mathrm {e}^{sA}\|_{2}$, ∥U_m∥₂ ≤ C₁ε∥A∥₂ and ∥v_m+ 1∥₂ ≤ 1 + C₂ε, the representation (3.2a) implies the upper bound
$$ \begin{array}{@{}rcl@{}} \|l_{p,m}(t)\|_{2} &&\leq (1+C_{2}\varepsilon) \kappa_{1} \frac{ h_{m+1,m} }{t^{p}} {{\int}_{0}^{t}} |\delta_{p,m}(s)| \mathrm{d} s\\ &&~~~~~~~~~~+ C_{1}\varepsilon \|A\|_{2} \frac{\kappa_{1}}{t^{p}} {{\int}_{0}^{t}} s^{p} \| y_{p,m}(s)\|_{2} \mathrm{d} s . \end{array} $$
(3.6)
With the defect integral L_{p, m}(t) defined in (3.1b) we obtain the first term in (3.2b). For the second integral term (with y_{p, m}(t) = βφ_p(tH_m)e₁), we use the upper bound
$$ {{\int}_{0}^{t}} s^{p} \| \varphi_{p}(sH_{m})e_{1}\|_{2} \mathrm{d} s \leq \max_{s\in[0,t]}\| \varphi_{p}(sH_{m})e_{1}\|_{2} \frac{t^{p+1}}{p+1}. $$
(3.7)
- For $p \in \mathbb {N}$ we apply the integral representation due to (2.3) for φ_p(tH_m)e₁ to obtain the norm bound
  $$ \max_{s\in[0,t]} \|\varphi_{p}(s H_{m}) e_{1}\|_{2} \leq \frac{\max_{s\in[0,t]} \|\mathrm{e}^{s H_{m}}\|_{2}}{(p-1)!} {{\int}_{0}^{1}}\theta^{p-1} \mathrm{d}\theta = \frac{\max_{s\in[0,t]} \|\mathrm{e}^{s H_{m}}\|_{2} }{p!}. $$
  (3.8)
- For p = 0, we obtain (3.8) in a direct way.
Combining (3.7) with (3.8) and denoting $\kappa _{2} = \max \limits _{s\in [0,t]}\|\mathrm {e}^{sH_{m}}\|_{2}$, we obtain
$$ \frac{\kappa_{1}}{t^{p}} {{\int}_{0}^{t}} s^{p} \| y_{p,m}(s)\|_{2} \mathrm{d} s \leq \frac{\beta \kappa_{1}\kappa_{2} t}{(p+1)!}. $$
Combining these estimates with (3.6), we conclude (3.2b).

□

Remark 5

The error norm of the Krylov propagator scales with $\kappa _{1} = \max \limits _{s\in [0,t]}\|\mathrm {e}^{sA}\|_{2} $ and $\kappa _{2} = \max \limits _{s\in [0,t]}\|\mathrm {e}^{sH_{m}}\|_{2} $ in a natural way. ^{Footnote 5} It is well known that

$$ \begin{array}{@{}rcl@{}} \|\mathrm{e}^{tA}\|_{2} \leq &&\mathrm{e}^{t \mu_{2}(A)}~~\text{with the logarithmic norm}\\ &&\mu_{2}(A) = \max\{\text{Re}(\text{W}(A))\} = \max \{ \text{spec}(A+A^{\ast})/2\}, \end{array} $$

see for instance [24, Theorem 10.11]. Problems with μ₂(A) > 0 can be arbitrary ill-conditioned and difficult to solve with proper accuracy. (For further results on the stability of the matrix exponential see also [36, 55].). We will not further discuss problems with μ₂(A) > 0 and assume μ₂(A) ≤ 0. We refer to the case μ₂(A) ≤ 0 as the dissipative case, with κ₁ = 1.

For the dissipative case with μ₂(A) ≤ 0, the error bound (3.2b) from Theorem 1 reads

$$ \|l_{p,m}(t)\|_{2} \leq (1+C_{2} \varepsilon ) L_{p,m}(t) + C_{1}\varepsilon \|A\|_{2} \frac{\beta \kappa_{2} t}{(p+1)!}. $$

(3.9)

The dissipative behavior of e^tA carries over to the Krylov propagator up to a perturbation which depends on round-off errors, including the loss of orthogonality of V_m. In terms of the numerical range W(H_m), with ${\text {W}}(H_{m})\subseteq U_{C_{3}\varepsilon }({\text {W}}(A))$, we have μ₂(H_m) ≤ μ₂(A) + C₃ε, for a constant C₃ε depending on round-off effects (2.7c). Thus, μ₂(H_m) ≤ C₃ε and $\kappa _{2} \leq {\mathrm {e}}^{t C_{3} \varepsilon }$.

Our aim is to construct an upper norm bound for the error per unit step (2.13) via (3.9). Let the tolerance tol be given and t be a respective time step for (2.13). Then the round-off error terms in (3.9) are negligible if

$$ C_{2} \varepsilon\ll 1,~~~\text{and}~~ C_{1} \varepsilon \|A\|_{2} \beta \mathrm{e}^{t C_{3} \varepsilon} /(p+1)! \ll \text{tol}. $$

(3.10)

Concerning the constants C₁, C₂ and C₃ see (2.7). We recapitulate that C₁ and C₂ given in (2.7a) and (2.7b) can be considered to be small enough in a standard Krylov setting. The constant C₃ can be larger in the case of a loss of orthogonality of the Krylov subspace, which can however be avoided at the cost of additional computational effort. The constant C₃ only appears as an exponential prefactor for the round-off term in (3.10) and is less critical compared to C₁ and C₂.

With the previous observation on the round-off errors taken into account in (3.9) we consider the following upper bound to be stable in computer arithmetic in accordance to a proper value of tol, see (3.10).

Corollary 1

For the case μ₂(A) ≤ 0 and with the assumption that round-off error is negligible, the error of the Krylov propagator is bounded by the defect integral L_{p, m}(t),

$$ \|l_{p,m}(t)\|_{2} \leq \frac{h_{m+1,m}}{t^{p}}{{\int}_{0}^{t}} |\delta_{p,m}(s)| \mathrm{d} s = L_{p,m}(t),~~~p\in\mathbb{N}_{0}. $$

Note that the defect norm |δ_{p, m}(s)| cannot be integrated exactly in general. This point will further be studied in the sequel.

Representing the defect in terms of divided differences

Divided differences play an essential role in this work. We use the notation

$$ f[\lambda_{1},\ldots,\lambda_{m}] $$

for the divided differences of a function f over the nodes λ₁,…, λ_m. (This is to be understood in the confluent sense for the case of multiple nodes λ_j, see for instance [24, Section B.16].)

Theorem 2 (see for instance 9)

Let $H_{m}\in \mathbb {C}^{m\times m}$ be an upper Hessenberg matrix with positive secondary diagonal entries, $0<(H_{m})_{j+1,j}\in \mathbb {R}$ for j = 1,…, m − 1, and eigenvalues λ₁,…, λ_m. Let f be an analytic function for which f(H_m) is well defined. Then,

$$ e_{m}^{\ast} f(H_{m})e_{1} = \gamma_{m} f[\lambda_{1},\ldots,\lambda_{m}], $$

with $\gamma _{m}={\prod }_{j=1}^{m-1} (H_{m})_{j+1,j}$.

For f = (φ_p)_t : λ↦φ_p(tλ), we will also make use of the following result. ^{Footnote 6}

Theorem 3 (Corollary 1 in 49)

(Expressing φ-functions via dilated $ \exp $-functions.) For $ t \in \mathbb {R} $,

$$ t^{p} e_{m}^{\ast} \varphi_{p}(t H_{m})e_{1} = e_{m+p}^{\ast} \exp(t\widetilde{H}_{p,m}) e_{1} $$

with

$$ \widetilde{H}_{p,m} = \begin{pmatrix} H_{m} & 0_{m\times p}\\ e_{1} e_{m}^{\ast} & J_{p\times p} \end{pmatrix} \in\mathbb{C}^{(m+p)\times(m+p)}~~~\text{and}~~~ J_{p\times p} = \begin{pmatrix} 0& &&\\ 1 & 0 &&\\ & {\ddots} &\ddots& \\ && 1&0 \end{pmatrix}\in\mathbb{C}^{p\times p}. $$

The matrix $\widetilde {H}_{p,m}$ in Theorem 3 is block triangular with eigenvalues equal to those of H_m and J_p×p. Therefore, $ \text {spec}(\widetilde {H}_{m}) = \{\lambda _{1},\ldots ,\lambda _{m},0,\ldots ,0\}$, with 0 as an eigenvalue of multiplicity p (at least). In our context, $\widetilde {H}_{m}$ is upper Hessenberg with a positive lower secondary diagonal and $\gamma _{m}={\prod }_{j=1}^{m-1} (H_{m})_{j+1,j} = {\prod }_{j=1}^{m+p-1} (\widetilde {H}_{m})_{j+1,j}$. In accordance with Theorem 2, the result of Theorem 3 holds for divided differences in a similar manner,

$$ t^{p} (\varphi_{p})_{t}[\lambda_{1},\ldots,\lambda_{m}] = \exp_{t} [ \lambda_{1},\ldots,\lambda_{m},\underbrace{0,\ldots,0}_{\text{\textit{p} times}} ]. $$

With Theorem 2 and 3 the following equivalent formulations can be used to rewrite the scalar defect δ_{p, m}(t) defined in (3.1a).

Corollary 2

Let δ_{p, m}(t) be the scalar defect given in (3.1a) for the upper Hessenberg matrix $H_{m}\in \mathbb {C}^{m\times m}$ with positive secondary diagonal entries. Denote $0<\gamma _{m} = {\prod }_{j=1}^{m-1} (H_{m})_{j+1,j}$. Let $\widetilde {H}_{p,m}\in {\mathbb {C}}^{m+p}$ be given as in Theorem 3. For the scalar defect, we obtain the following equivalent formulations:

(i)
$\delta _{p,m}(t) = \beta e_{m}^{\ast } t^{p} \varphi _{p}(t H_{m}) e_{1}$
(ii)
= βγ_mt^p(φ_p)_t[λ₁,…, λ_m]
(iii)
$ = \beta e_{m+p}^{\ast } \exp (t \widetilde {H}_{p,m}) e_{1}$
(iv)
$ = \beta \gamma _{m} \exp _{t}[\lambda _{1},\ldots ,\lambda _{m},0_{p}]$^{Footnote 7}

We remark that the eigenvalues λ₁,…, λ_m of the Krylov Hessenberg matrix H_m are also referred to as Ritz values (of A) in the literature.

4 Computable a posteriori error bounds for the Krylov propagator

The following two propositions are used for the proof of Theorem 4 below.^{Footnote 8}

Proposition 1

For arbitrary nodes $\lambda _{j}\in \mathbb {C}$ and $p\in \mathbb {N}_{0}$,

$$ {{\int}_{0}^{t}} s^{p} (\varphi_{p})_{s}[\lambda_{1},\ldots,\lambda_{k}] \mathrm{d} s = t^{p+1} (\varphi_{p+1})_{t}[\lambda_{1},\ldots,\lambda_{k}]. $$

Proof

See Appendix B. □

Proposition 2 (Lemma including (5.1.1) in 35)

For arbitrary nodes $\lambda _{j}=\xi _{j}+\mathrm {i}\eta _{j} \in \mathbb {C}$,

$$ |\exp_{t}[\lambda_{1},\ldots,\lambda_{k}]| \leq \exp_{t}[\xi_{1},\ldots,\xi_{k}]. $$

Proof

See Appendix B. □

We now derive upper bounds for the error via its representation by the defect integral (3.1b).

Theorem 4

Let $p\in \mathbb {N}_{0}$, μ₂(A) ≤ 0, and assume that round-off errors are sufficiently small (see Corollary 1). For the eigenvalues of H_m, we write λ_j = ξ_j + iη_j, j = 1,…, m. An upper bound on the error norm is given by

$$ \|l_{p,m}(t)\|_{2} \leq \beta h_{m+1,m} \gamma_{m} t (\varphi_{p+1})_{t}[\xi_{1},\ldots,\xi_{m}]. $$

(4.1)

Proof

Due to Corollary 2, (iv),

$$ \delta_{p,m}(t) = \beta \gamma_{m} \exp_{t} [\lambda_{1},\ldots,\lambda_{m},0_{p}]. $$

(4.2a)

The divided differences in (4.2a) span over complex nodes λ₁,…, λ_m and $ 0_{p}\in \mathbb {C}^{p}$, with real parts ξ₁,…, ξ_m. Propositions 2 and 1 imply

$$ {{\int}_{0}^{t}} |\exp_{s} [\lambda_{1},\ldots,\lambda_{m},0_{p}]| \mathrm{d} s \!\leq\! {{\int}_{0}^{t}} \exp_{s}[\xi_{1},\ldots,\xi_{m},0_{p}] \mathrm{d} s = t (\varphi_{1})_{t} [\xi_{1},\ldots,\xi_{m},0_{p}]. $$

(4.2b)

From Corollary 2, we obtain

$$ t (\varphi_{1})_{t}[\xi_{1},\ldots,\xi_{m},0_{p}] = \exp_{t}[\xi_{1},\ldots,\xi_{m},0_{p+1}] = t^{p+1} (\varphi_{p+1})_{t}[\xi_{1},\ldots,\xi_{m}]. $$

(4.2c)

Equations (4.2a)–(4.2c) together with Corollary 1 imply (4.1).

□

For the case of H_m having real eigenvalues, the assertion of Theorem 4 can be reformulated in the following way (see [30, Proposition 6]).

Corollary 3

Assume μ₂(A) ≤ 0 and that round-off errors are sufficiently small (see Corollary 1). For the case of H_m having real eigenvalues $\lambda _{1},\ldots ,\lambda _{m}\in \mathbb {R} $, the upper bound on the error norm in Theorem 4 yields an exact evaluation of the defect integral. Hence,

$$ \|l_{p,m}(t)\|_{2} \leq L_{p,m}(t) = \beta h_{m+1,m} t \big(e_{m}^{\ast} \varphi_{p+1}(tH_{m}) e_{1} \big). $$

As a further corollary we formulate an upper bound on the error norm which is cheaper to evaluate compared to the bound from Theorem 4 but may be less tight. Using the Mean Value Theorem, [24, (B.26)] or [4, (44)], for the divided differences in Theorem 4 (4.1), we obtain the following result which corresponds to [30, Theorem 1 and 2]. For the exponential of a skew-Hermitian matrix, a similar error estimate has been used in [33] and is based on ideas of [44] with some lack of theory.

Corollary 4

Let $p\in \mathbb {N}_{0}$, μ₂(A) ≤ 0, and assume that round-off errors are sufficiently small (see Corollary 1). Let ${\xi }_{\max \limits } = 0 $ for $p\in \mathbb {N}$ and ${\xi }_{\max \limits } = \max \limits _{j=1,\ldots ,m} \xi _{j} \leq 0$ for p = 0 and eigenvalues $\lambda _{j}=\xi _{j}+{\mathrm {i}}\mu _{j}\in {\mathbb {C}}$ of H_m. An upper bound on the error norm is given by

$$ \|l_{p,m}(t)\|_{2} \leq \beta h_{m+1,m} \frac{ \gamma_{m} t^{m} \mathrm{e}^{t{\xi}_{\max}} }{(m+p)!} \leq \beta h_{m+1,m} \frac{ \gamma_{m} t^{m} }{(m+p)!}. $$

For the case of H_m having purely imaginary eigenvalues, the divided differences in Theorem 4 (see (4.1)) can be evaluated directly via [24, (B.27)],

$$ t (\varphi_{p+1})_{t}[0_{m}] = t^{-p} \exp_{t}[0_{m+p+1}] = \frac{ t^{m} }{(m+p)!}; $$

hence, the assertions of Theorem 4 and Corollary 4 coincide in this case.

Accuracy of the previously specified upper bounds on the error norm

In the following, we again denote $\lambda _{1},\ldots ,\lambda _{m}\in \mathbb {C}$ for the eigenvalues of H_m, with λ_j = ξ_j + iη_j. For the scalar defect δ_{p, m}(t) (see (3.1a)) we recapitulate Corollary 2, in particular

$$ \delta_{p,m}(t) = \beta \gamma_{m} t^{p} (\varphi_{p})_{t}[\lambda_{1},\ldots,\lambda_{m}] = \beta \gamma_{m} \exp_{t}[\lambda_{1},\ldots,\lambda_{m},0_{p}]. $$

(4.3)

Theorem 4 and its corollaries make use of the error bound given in Corollary 1 and computable upper bounds on the defect integral L_{p, m}(t). A refinement of the upper bound from Corollary 1 would require further applications of the large-dimensional matrix-vector product with $A\in {\mathbb {C}}^{n\times n}$ and has been shown to be inefficient in terms of computational cost, see also [30, Remark 7]. The computable upper bounds on the defect integral L_{p, m}(t) will be further discussed. We recapitulate the upper bound of the divided differences given in Proposition 2,

$$ |\exp_{t}[\lambda_{1},\ldots,\lambda_{m},0_{p}]|\leq \exp_{t}[\xi_{1},\ldots,\xi_{m},0_{p}]. $$

(4.4)

Thus, in the case of H_m having eigenvalues with a sufficiently small imaginary part, the upper bound in Proposition 2, is tight. In the following proposition, this statement is made more precise.

Proposition 3 (Part of a proof in 35, (5.2.3))

For nodes $\lambda _{j}=\xi _{j}+\mathrm {i}\eta _{j} \in \mathbb {C}$ and t ≥ 0 with $\max \limits _{j} t|\eta _{j}| \leq \widetilde {\eta }_{t} < \pi /2$,

$$ 0<\cos(\widetilde{\eta}_{t}) \exp_{t}[\xi_{1},\ldots,\xi_{k}] \leq |\exp_{t}[\lambda_{1},\ldots,\lambda_{k}]|. $$

Proof

See Appendix B. □

Under the assumptions of Proposition 3, we conclude

$$ 0<\cos(\widetilde{\eta}_{t}) \exp_{t}[\xi_{1},\ldots,\xi_{m},0_{p}] \leq |\exp_{t}[\lambda_{1},\ldots,\lambda_{m},0_{p}]|. $$

(4.5)

With (4.3), (4.4), (4.5), and following the proof of Theorem 4, the defect integral in (3.1b) can be enclosed by

$$ \begin{array}{@{}rcl@{}} &&0<\cos(\widetilde{\eta}_{t}) \cdot \beta \gamma_{m} h_{m+1,m} t (\varphi_{p+1})_{t}[\xi_{1},\ldots,\xi_{m}]\\ &&~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~\leq L_{p,m}(t) \leq \beta \gamma_{m} h_{m+1,m} t (\varphi_{p+1})_{t}[\xi_{1},\ldots,\xi_{m}]. \end{array} $$

(4.6)

Hence,

$$ L_{p,m}(t) = \big(1 - \mathscr{O}(|t\eta|^{2}) \big) \beta \gamma_{m} h_{m+1,m} t (\varphi_{p+1})_{t}[\xi_{1},\ldots,\xi_{m}], $$

(4.7)

using the notation ${\mathscr{O}}(|t\eta |^{2})$ in the sense of ${\mathscr{O}}(|t\eta |)= {\mathscr{O}}(\max \limits _{j} t|\eta _{j}|)$ for t|η_j|→ 0. Following Proposition 3 the choice of $\widetilde {\eta }_{t}$ is independent of ξ₁,…, ξ_m, and this carries over to the constant in (4.7).

Summarizing, we see that the defect integral can be computed exactly for the case of H_m having real eigenvalues (Corollary 3), and a computable upper bound can be given which is tight for the case of H_m having eigenvalues sufficiently close to the real axis (Theorem 4 and (4.7)).

The approach underlying Theorem 4 does not enable us to specify the asymptotic constant in (4.7). Therefore, we use the asymptotic expansion of the divided differences, $|\exp _{t}[\lambda _{1},\ldots ,\lambda _{m},0_{p}]|$ in (4.3), derived in Appendix C, to discuss the asymptotic behavior of the defect norm |δ_{p, m}(t)| for t → 0. Theorem 5 from Appendix C implies

$$ \begin{array}{@{}rcl@{}} &&| \exp_{t}[\lambda_{1},\ldots,\lambda_{m},0_{p}] | = \frac{t^{m+p-1}}{(m+p-1)!} \exp\big(\rho_{1} t + \rho_{2} t^{2}/2 + \mathscr{O}(t^{3}) \big),\\ &&\text{with}~~~ \rho_{1} = \text{avg}_{p}(\xi) ~~~\text{and}~~~ \rho_{2} = \frac{\text{var}_{p}(\xi) - \text{var}_{p}(\eta)}{m+p+1}. \end{array} $$

(4.8)

Here, the asymptotics holds for t → 0, $ \text {avg}_{p}(\xi ) = {\sum }_{j=1}^{m} \xi _{j} /(m+p)$ is the average, and $ \text {var}_{p}(\xi ) = \big ({\sum }_{j=1}^{m} (\xi _{j} - \text {avg}_{p}(\xi ))^{2} + p\ \text {avg}_{p}(\xi )^{2} \big )/ (m+p) $ is the variance of the sequence {ξ₁,…, ξ_m,0_p} and var_p(η) is the variance of the sequence {η₁,…, η_m,0_p}.

Remark 6

For H_m with purely imaginary eigenvalues ($\lambda _{j}\in \mathrm {i}\mathbb {R}$), e.g., in the skew-Hermitian case, the following asymptotic expansion for the defect is obtained from (4.8), ^{Footnote 9}

$$ |\delta_{p,m}(t)| =\beta \gamma_{m} \frac{t^{m+p-1}}{(m+p-1)!} \exp\Big(-\frac{\text{var}_{p}(\eta)}{2(m+p+1)}t^{2} + \mathscr{O}(t^{3}) \Big) ~~~\text{for}~~ t\to 0. $$

(4.9)

We use the expansion from (4.8) for $| \exp _{t}[\lambda _{1},\ldots ,\lambda _{m},0_{p}] |$ and $ \exp _{t}[\xi _{1},\ldots ,\xi _{m},0_{p}] $ to obtain

$$ |\delta_{p,m}(t)| = \exp\Big(-\frac{\text{var}_{p}(\eta)}{2(m+p+1)}t^{2} +\mathscr{O}(t^{3})\Big) \cdot \beta \gamma_{m} t^{p} (\varphi_{p})_{t}[\xi_{1},\ldots,\xi_{m}]. $$

(4.10)

Termwise integration of (4.10) and the proper prefactor gives an asymptotic expansion for the defect integral L_{p, m}(t), similar to (4.7),

$$ L_{p,m}(t) = \Big(1-\frac{\text{var}_{p}(\eta) (m+p)t^{2}}{2(m + p + 1)(m + p + 2)} +\mathscr{O}(t^{3}) \Big) \cdot \beta h_{m+1,m}\gamma_{m} t(\varphi_{p+1})_{t}[\xi_{1},\ldots,\xi_{m}]. $$

(4.11)

Omitting further details we state that (4.11) is to be understood in an asymptotic sense with an remainder of ${\mathscr{O}}(t^{3}|\xi ||\eta |^{2}+t^{4}|\eta |^{4})$. In contrast to (4.7) the remainder is depending on ξ terms but (4.11) reveals further constants which can be relevant for practical applications.

Remark 7

With (4.11) we obtain a computable estimate for the relative deviation from the defect integral to the upper bound in (4.6). The criterion

$$ \text{ac.est.}1(t):=\frac{\text{var}_{p}(\eta) (m+p)t^{2}}{2(m+p+1)(m+p+2)} > 0.1, $$

can indicate that a tighter estimate on the defect integral could improve the error bound given in Theorem 4 in terms of accuracy. A possible choice are quadrature estimates on the defect integral, see Section 4.1 below.

A similar criterion can be given for the accuracy of the upper bound,

$$ L_{p,m}(t) \leq \beta h_{m+1,m}\gamma_{m} \frac{t^{m}}{(m+p)!}, $$

(4.12)

which appears in Corollary 4 (with $\xi _{\max \limits }=0$) and [30, Theorem 1 and 2]. With (4.8), and ρ₁ and ρ₂ given therein, the defect integral can be written as

$$ L_{p,m}(t) = \beta h_{m+1,m}\gamma_{m} \frac{t^{m}}{(m+p)!} \Big(1+\rho_{1}\frac{(m+p)t}{m+p+1} + ({\rho_{1}^{2}}+\rho_{2})\frac{(m+p)t^{2}}{2(m+p+2)} +\mathscr{O}(t^{3})\Big) $$

(4.13)

for t → 0. In contrast to the error bound in Corollary 4, the formulas for ρ₁ and ρ₂ in (4.8) require the evaluation of the eigenvalues of H_m. The following Proposition gives a formula for ρ₁ and ρ₂ which does not require computation of the eigenvalues of H_m and can be evaluated on the fly.

Proposition 4 (Evaluation of ρ ₁ and ρ ₂ in terms of entries of H _m)

The coefficients ρ₁ and ρ₂ in (4.8) can be rewritten as

$$ \begin{array}{@{}rcl@{}} &&\rho_{1} = \frac{\text{Re}(S_{1})}{m+p},~~~\rho_{2}= \frac{\text{Im}(S_{1})^{2}-\text{Re}(S_{1})^{2}}{(m+p)^{2}}+ \frac{\text{Re}({S_{1}^{2}}+S_{2}) }{(m+p)(m+p+1)},~~~\text{with}\\ &&S_{1}=\sum\limits_{j=1}^{m} (H_{m})_{j,j} ~~~\text{and}~~ S_{2}=\sum\limits_{j=1}^{m} (H_{m})^{2}_{j,j} + 2\sum\limits_{j=1}^{m-1} (H_{m})_{j+1,j}(H_{m})_{j,j+1}. \end{array} $$

Proof

For the coefficients ρ₁ and ρ₂ we use (C.17) with $ m \leftarrow m + p $ and S₁ and S₂ from (C.3). For the nodes λ₁,…, λ_m,0_p (with λ₁,…, λ_m eigenvalues of H_m) we obtain

$$ \begin{array}{@{}rcl@{}} &&{}S_{1} = \sum\limits_{j=1}^{m} \lambda_{j} = \text{Trace}(H_{m}) = \sum\limits_{j=1}^{m} (H_{m})_{j,j}~~~\text{and}\\ &&{}S_{2} = \sum\limits_{j=1}^{m} {\lambda_{j}^{2}} = \text{Trace}({H_{m}^{2}}) = \sum\limits_{j=1}^{m} (H_{m})^{2}_{j,j} + 2\sum\limits_{j=1}^{m-1} (H_{m})_{j+1,j}(H_{m})_{j,j+1}. \end{array} $$

(4.14)

The identity for $\text {Trace}({H_{m}^{2}})$ in (4.14) holds true due to the upper Hessenberg structure of H_m.

□

Following the proof of Theorem 5 we observe that the case ρ₁ = 0 is possible but results in ρ₂≠ 0.

Remark 8

With (4.13) and Proposition 4 we obtain a computable estimate for the relative deviation from the defect integral to the upper bound in (4.12). The criterion

$$ \text{ac.est.}2(t):=\Big| \rho_{1}\frac{(m+p)t}{m+p+1} + ({\rho_{1}^{2}}+\rho_{2})\frac{(m+p)t^{2}}{2(m+p+2)} \Big| > 0.1 $$

can indicate that a tighter estimate on the defect integral could improve the error bound given in Corollary 4 in terms of accuracy. We refer to the error bound in Theorem 4 in case the eigenvalues of H_m have a significant real part (which can be observed via ρ₁).

4.1 Quadrature-based error estimates

First we recapitulate some prior results. In the dissipative case the integral formulation of the error from Theorem 1 can be bounded via the defect integral via Corollary 1 up to round-off. We conclude that the defect integral can be computed exactly for the case of H_m having real eigenvalues (Corollary 3), and a computable upper bound exists which is tight for the case of H_m having eigenvalues sufficiently close to the real axis (Theorem 4 and (4.6)).

For the case of H_m having eigenvalues with a significant imaginary part, tight estimates are more difficult to obtain. It can be favorable to approximate the defect integral (3.1b) by quadrature to obtain an error estimate via Corollary 1. The aim of using quadrature is to obtain an error estimate which is tighter compared to previous upper norm bounds on the error. In contrast to the proven upper error bounds given in Theorem 4, Corollary 3, and Corollary 4, the following quadrature estimates do not result in upper error bounds in general. However, in many practical cases, such quadrature estimates turn out to be still reliable.

Here, some remarks on the defect are in order to explain some subtleties with quadrature estimates for the defect integral L_{p, m}(t). We discuss a test problem with a skew-Hermitian matrix $A\in \mathbb {C}^{n\times n}$. Following Remark 4 we choose A = iB with a Hermitian matrix B, in particularly, $B=\text {tridiag}(-1,2,-1)\in {\mathbb {R}}^{n\times n}$ with n = 1000. The matrix B is related to a finite difference discretization of the one-dimensional negative Laplacian operator and A corresponds to a free Schrödinger type problem. The eigenvalues σ_j, for j = 1,…, n, of B are given by

$$ \sigma_{j}=4\sin(j\pi/(2(n+1)))^{2} ~~\text{with respective eigenvector} \psi_{j}\in\mathbb{R}^{n}. $$

(4.15)

Here, μ₂(A) = 0, and the conditions of Corollary 1 hold. For a given starting vector $v\in \mathbb {C}^{n}$ the time propagation for the discretized free Schrödinger equation is given by $\exp (tA)v$ and can be approximated by the Krylov propagator with p = 0. The following different cases for the starting vector v will be discussed.

(a)
Choose a random starting vector $v\in \mathbb {R}^{n}$.
(b)
Start close to a linear combination of eigenvectors, $v = 10^{6} {\sum }_{j=1}^{25} \psi _{j} + {\sum }_{j=26}^{n} \psi _{j}$ for eigenvectors ψ_j of the discretized negative Laplacian operator, (4.15).
(c)
Start close to a linear combination of eigenvectors which are more spread on the spectrum, $v = 10^{5} {\sum }_{j=1}^{20} \psi _{j} + 10^{5} {\sum }_{j=n-19}^{n} \psi _{j} $ for eigenvectors ψ_j of the discretized negative Laplacian operator, (4.15).

In addition to the setting from (a)–(c) we normalize v, ∥v∥₂ = 1. The defect δ_{p, m}(t) for p = 0 is computed in Matlab, using expm to evaluate the matrix exponential of H_m and divided differences for a fixed Krylov dimension m = 20.

In Fig. 1 we observe $|\delta _{p,m}(t)|={\mathscr{O}}(t^{m-1})$ (for t → 0) up to t ≈ 10¹ for the case (a)–(c). The values of |δ_{p, m}(t)| in this time regime vary strongly among these cases. We further remark that in the case (b) for t ≥ 4 ⋅ 10¹ the defect |δ_{p, m}(t)| behaves similar to the divided differences of the exponential over the first eigenvalues $\lambda _{1}^{(b)},\ldots ,\lambda _{4}^{(b)}$ of H_m with a proper prefactor. This behavior occurs if eigenvalues of H_m are clustered, in this case $\lambda _{1}^{(b)},\ldots ,\lambda _{4}^{(b)}\approx 0$, and will be further discussed below, see Fig. 2. For the case (c) the eigenvalues of H_m are clustered at ≈ 0 and ≈ 4. Also in this case, there is a time regime for which the defect behaves similar to a lower order function in t with some additional oscillations (This may be explained by the existence of different eigenvalue clusters of the same size.).

As a conclusion from the example illustrated in Fig. 1, we observe that quadrature of the defect can be relevant up to a time t for which the quadrature based estimate of ∥l_{p, m}(t)∥₂ (via the defect integral) is equal to a given tolerance, see (2.13). This regime of t would depend on the choice of tol and additional factors such as β, h_{m+ 1, m} etc. which appear in the error bound from Corollary 1. Depending on parameters and the starting vector v the defect can be highly oscillatory for relevant times t and, respectively, a quadrature estimate of the defect integral can be difficult to obtain. Such effects seem to be relevant for special choices of starting vectors v, for example case (b) and (c). The effect of H_m having clustered eigenvalues and the prefactor used in Fig. 1 (+ ) are explained in the following model problem, see Fig. 2.

Divided differences with clustered nodes: an example

Choose m = 3 with nodes a₁ = 1.123, a₂ = 1.231, a₃ = 5.43. With this choice, we obtain cluster of nodes, a₁ ≈ a₂. For the given example, we obtain $ | \exp _{t}[\mathrm {i} a_{2},{\mathrm {i}} a_{3}] | \ll | \exp _{t}[{\mathrm {i}} a_{1},{\mathrm {i}} a_{2}] | $ for t large enough; hence, using the recursive definition of the divided differences (see [24, (B.24)] or others), we obtain

$$ | \exp_{t}[\mathrm{i} a_{1},\mathrm{i} a_{2},\mathrm{i} a_{3}] | = \Big| \frac{ \exp_{t}[\mathrm{i} a_{2},\mathrm{i} a_{3}] - \exp_{t}[\mathrm{i} a_{1},\mathrm{i} a_{2}] }{a_{3} - a_{1}}\Big| \!\approx\! \Big| \frac{ \exp_{t}[\mathrm{i} a_{1},\mathrm{i} a_{2}] }{a_{3} - a_{1}}\Big|,~~\text{for larger \textit{t}.} $$

This example is illustrated in Fig. 2. This behavior can be generalized for a larger number of nodes and is also observed in Fig. 1.

Quadrature estimates for the defect integral

With the previous observations on the defect we now discuss different quadrature-based estimates.

The generalized residual estimate, which was introduced in [28] and appeared in a similar manner in [5, 11, 34, 46], conforms to a quadrature on the defect norm integral which is related to the error norm via Corollary 1.

Remark 9 (Generalized residual estimate, see also 28)

Applying the right-endpoint rectangle rule we have

$$ {{\int}_{0}^{t}} |\delta_{p,m}(s)| \mathrm{d} s \approx t|\delta_{p,m}(t)|, $$

and with Corollary 1 (and δ_{p, m}(t) given in (3.1a)) we obtain the error estimate

$$ \|l_{p,m}(t)\|_{2} \approx h_{m+1,m} t^{1-p} |\delta_{p,m}(t)| = \beta h_{m+1,m} t |e_{m}^{\ast} \varphi_{p}(t H_{m}) e_{1}| . $$

Assume that $\max \limits _{s\in [0,t]} |\delta _{p,m}(s)|=|\delta _{p,m}(t)|$, e.g., |δ_{p, m}(t)| is monotonically increasing in t. Then,

$$ {{\int}_{0}^{t}} |\delta_{p,m}(s)| \mathrm{d} s \leq t \max_{s\in[0,t]} |\delta_{p,m}(s)| = t |\delta_{p,m}(t)|. $$

In this case, the generalized residual estimate from Remark 9 results in an upper bound on the error norm.

In the most general case, the defect is of high order for t → 0 and in a relevant time regime, see also Fig. 1 case (a) and previous remarks. Then, the defect is a higher order function and the right-endpoint quadrature does result in an upper bound but is not tight. In this case, we can improve the estimate by a prefactor depending on the effective order defined in Appendix C. If the defect is sufficiently smooth in a relevant time regime, this results in a tight upper bound on the error norm.

Remark 10 (Effective order estimate, see also 30)

Denote $f(t) = | \exp _{t}[\lambda _{1},\ldots ,\lambda _{m},0_{p}] |$ for the time-dependent part of the defect with eigenvalues λ₁,…, λ_m of H_m. Assume f(t) > 0 for a sufficiently small time regime t > 0. We consider the effective order ρ(t) defined in (C.4a). With the following estimate for the integral of the defect,

$$ {{\int}_{0}^{t}} |\delta_{p,m}(s)| \mathrm{d} s \approx \frac{t}{\rho(t)+1}|\delta_{p,m}(t)|, $$

and from Corollary 1 (with δ_{p, m}(t) given in (3.1a)), we obtain

$$ \|l_{p,m}(t)\|_{2} \approx h_{m+1,m} \frac{ t^{1-p} }{ \rho(t)+1 } |\delta_{p,m}(t)| = \beta h_{m+1,m} \frac{ t }{ \rho(t)+1 } |e_{m}^{\ast} \varphi_{p}(t H_{m}) e_{1}| . $$

In [30], the effective order is defined for $|e_{m}^{\ast } \mathrm {e}^{tH_{m}} e_{1}|$ (p = 0) which is equivalent to the definition via the divided differences of f(t). (This follows from Corollary 2 and the definition of the effective order which is independent of a constant prefactor.)

Some of the following observations already appeared in [30]. The quadrature scheme in Remark 10 is motivated by the following relation of the effective order and the integral of the divided differences f(t). From (C.4a),

$$ f(t)=\frac{f^{\prime}(t) t}{\rho(t)}. $$

Integration and application of the mean value theorem shows the existence of t^∗∈ [0, t] with

$$ {{\int}_{0}^{t}} f(s) \mathrm{d} s= \frac{1}{\rho(t^{\ast})} {{\int}_{0}^{t}} f^{\prime}(s) s \mathrm{d} s, $$

and integration by parts gives

$$ {{\int}_{0}^{t}} f(s) \mathrm{d} s = \frac{t f(t)}{1+\rho(t^{\ast})}. $$

(4.16)

This result can passed over to the integral of the defect.

Assume the effective order is monotonically decreasing for t small enough, with $\min \limits _{s\in (0,t]}\rho (s) = \rho (t) \geq 0$. This holds in an asymptotic regime for the dissipative case up to round-off, see also Theorem 5 with the real parts ξ₁,…, ξ_m of the eigenvalues of H_m being non-positive. With (4.16) and the assumption 0 ≤ ρ(t) ≤ ρ(s) ≤ m + p − 1 = ρ(0+) for s ∈ [0, t], we inclose the integral of the defect by

$$ \frac{t}{m} |\delta_{p,m}(t)| \leq {{\int}_{0}^{t}} |\delta_{p,m}(s)| \mathrm{d} s \leq \frac{t}{\rho(t)+1} |\delta_{p,m}(t)| \leq t |\delta_{p,m}(t)|. $$

(4.17)

Combining (4.17) and Corollary 1, we obtain the upper bound

$$ \|l_{p,m}(t)\|_{2}\leq \frac{ h_{m+1,m} t^{1-p} }{\rho(t)+1}\cdot|\delta_{p,m}(t)| \leq h_{m+1,m} t^{1-p} \cdot|\delta_{p,m}(t)|. $$

A computable expression for the effective order was given in [30, (6.10)]. This result can be generalized to the case $p\in \mathbb {N}_{0}$,

$$ \begin{array}{@{}rcl@{}} \rho(t) = \left\{ \begin{array}{ll} t \text{Re}\big((H_{m})_{m,m} + (H_{m})_{m,m-1} (y_{p,m}(t))_{m-1}/(y_{p,m}(t))_{m} \big)~~&\text{for}~ p=0,~~\text{and}\\ \text{Re}((y_{p-1,m}(t))_{m}/(y_{p,m}(t))_{m} )~~&\text{for}~ p\in\mathbb{N}, \end{array}\right. \end{array} $$

with $y_{p,m}(t)\in \mathbb {C}^{m}$ from (2.9). The expression for the case $p\in \mathbb {N}$ can be obtained by [30, (6.10)] applied on the representation $|e_{m+p}^{\ast } \mathrm {e}^{t\widetilde {H}_{m}} e_{1}|$ for the defect ((iii). in Corollary 2) and making use of the special structure of $\widetilde {H}_{m}$, $\beta e_{m+p}^{\ast } \mathrm {e}^{t\widetilde {H}_{m}} e_{1} = t^{p} (y_{p,m}(t))_{m}$ (see Corollary 2) and $\beta e_{m+p-1}^{\ast } \mathrm {e}^{t\widetilde {H}_{m}} e_{1} = t^{p-1} (y_{p-1,m}(t))_{m}$ (see [49, Corollary 1]).

As illustrated in Fig. 1 the defect can be highly oscillatory in a relevant time regime, especially for specific starting vectors, and in this case the quadrature estimates should be handled with care.

4.2 A stopping criterion for the lucky breakdown

The special case h_{k+ 1, k} = 0 during the construction of the Krylov subspace is considered to be a lucky breakdown, a breakdown of the Arnoldi or Lanczos iteration with the benefit of an exact approximation of φ_p(tA)v for any t > 0 via the Krylov subspace ${\mathscr{K}}_{k}(A,v)$. In floating point arithmetic, the lucky breakdown results in h_{k+ 1, k} ≈ 0 and can lead to stability issues if the Arnoldi or Lanczos method is not stopped properly. The condition that the Krylov propagator is exact is not exactly determinable in floating point arithmetic but can be weakened to the error condition in (2.13) for a given tolerance tol per unit step. With this approach, we introduce a stopping criterion which can be applied on the fly to detect a lucky breakdown and satisfies an error bound. This does not depend on any a priori information as long the tolerance tol is chosen properly so that round-off errors can be neglected, see remarks before Corollary 1.

Proposition 5

Let μ₂(A) ≤ 0 and assume that round-off errors are sufficiently small, see Corollary 1. Let tol be a given tolerance and

$$ \frac{\beta h_{k+1,k}}{(p+1)!}\leq \text{tol} $$

(4.18)

be satisfied at the k th step of the Arnoldi or Lanczos iteration. Then, the iteration can be stopped and the Krylov subspace ${\mathscr{K}}_{k}(A,v)$ can be used to approximate the vector φ_p(tA)v with a respective error per unit step ∥l_{p, k}(t)∥₂ ≤ t ⋅tol.

Proof

We use the upper bound on the error norm from Corollary 1,

$$ \|l_{p,k}(t)\|_{2} \leq \frac{ h_{k+1,k}}{t^{p}}{{\int}_{0}^{t}} |\delta_{p,k}(s)| \mathrm{d} s. $$

(4.19)

To obtain a uniform bound on the defect integral, we use

$$ |\delta_{p,k}(t)| \leq \beta t^{p} \|e_{k}\|_{2} \|\varphi_{p}(t H_{k}) e_{1}\|_{2} = \beta t^{p} \|\varphi_{p}(t H_{k}) e_{1}\|_{2}. $$

(4.20)

For p > 0, we apply the integral representation (2.3) on φ_p(tH_m)e₁ to obtain the upper bound
$$ \|\varphi_{p}(t H_{m}) e_{1}\|_{2} \leq \frac{\max_{s\in[0,t]} \|\mathrm{e}^{s H_{m}}\|_{2}}{(p-1)!} {{\int}_{0}^{1}}\theta^{p-1} \mathrm{d}\theta = \frac{\max_{s\in[0,t]} \|\mathrm{e}^{s H_{m}}\|_{2} }{p!}. $$
(4.21)
For p = 0, the analogous result is directly obtained: Combine (4.20) and (4.21) with $\|\mathrm {e}^{sH_{k}}\|_{2} \leq \mathrm {e}^{t\mu _{2}(H_{k})} \leq {\mathrm {e}}^{t\mu _{2}(A)}$ up to round-off and μ₂(A) ≤ 0, giving
$$ |\delta_{p,k}(t)| \leq \beta \frac{t^{p} }{p!},~~~\text{and}~~{{\int}_{0}^{t}} |\delta_{p,k}(s)| \mathrm{d} s \leq \beta \frac{t^{p+1} }{(p+1)!}. $$

Together with (4.19) and (4.18), we conclude ∥l_{p, k}(t)∥₂ ≤ t ⋅tol.

□

5 Numerical experiments

The notation for the error l_{p, m}(t), the estimate of the error norm ζ_{p, m}(t) and the tolerance tol have been introduced in (2.12) and (2.13). The notation ζ_{p, m} will be used for different choices of error estimates discussed in the previous section. Theorem 4 and Corollary 4 result in upper bounds on the error norm, ∥l_{p, m}(t)∥₂ ≤ ζ_{p, m}(t). The quadrature-based error estimates given in Remark 9 and 10 result in estimates for the error norm, ∥l_{p, m}(t)∥₂ ≈ ζ_{p, m}(t), and with additional conditions also give upper bounds.

For a fixed tolerance tol, we use the notation t(m) for the smallest time t with ζ_{p, m}(t) = t ⋅tol, see (2.13). This choice of t(m) helps us to verify the tested error estimates for a time t which is of the most practical interest. With the help of a reference solution, the true error norm per unit step can be tested by ∥l_{p, m}(t(m))∥₂/t(m).

We also consider the following previously known error estimates in our numerical experiments. The generalized residual estimate [28] was recapitulated in Remark 9 and will be discussed in the numerical experiments. Furthermore, we test the performance of the error bound given in [10, Proposition 6]. This upper bound on the error norm applies to the Krylov approximation of φ_p(−tA)v for $p\in {\mathbb {N}}_{0}$, a matrix $A\in {\mathbb {R}}^{n\times n}$ with a numerical range in the right complex half-plane (up to a potential shift), and $v\in \mathbb {R}^{n}$. In this case, the matrix A can have real and complex eigenvalues, where the latter come in complex conjugate pairs. Concerning the skew-Hermitian case, a similar error bound for the Krylov approximation to φ_p(−itB)v for a Hermitian matrix $B\in {\mathbb {R}}^{n\times n}$ and $p\in {\mathbb {N}}_{0}$ is given separately in [10, Proposition 8]. To evaluate these error bounds the eigenvalues of H_m and the terms h_{m+ 1, m} and γ_m are used.

A series expansion for the error concerning φ-functions is given in [49, Theorem 2] and the leading terms of this expansion can be used for error estimation, cf. [41, 49]. In general [49] suggests to evaluate more than one term of this series to ensure reliability of the obtained error estimate, which requires further matrix-vector multiplications in the given large dimensional space. This can often be inefficient in terms of computational cost, cf. [30, Remark 7], and we avoid this series expansion in the general case. However, when the Ritz values are real-valued, the error bound in Corollary 3 (corresponding to the bound in Theorem 4) coincides with the leading term of the error series in [49, Theorem 2]. Thus, the first term of the error series in [49, Theorem 2] yields a reliable error bound in this case. For the convection-diffusion equation with parameter ν = 100 in Section 5.1 below (the Ritz values have negligible imaginary parts in this case), the error bound of Theorem 4 performs well (comparable to the effective order estimate and better than the other error estimates considered, e.g., the generalized residual estimate), and this potentially carries over to the error estimates in [41, 49].

5.1 Convection-diffusion equation

Consider the following two-dimensional convection-diffusion equation with t ≥ 0 and x ∈ [0,1]²,

$$ \partial_{t} u = L u,~~~\text{with}~~L= {\Delta} + \nu (\partial_{x_{1}} + \partial_{x_{2}} ) ,~~~u=u(t,x),~\nu\in\mathbb{R}. $$

(5.1)

Let $A\in \mathbb {R}^{n\times n}$ be obtained by the two-dimensional finite difference discretization of the operator L in (5.1) with zero Dirichlet boundary conditions and N = 500 inner mesh points in each spatial direction, hence, n = N². This test problem is similar to other convection-diffusion equations appearing in the study of Krylov subspace methods, see also [6, 15, 19, 30] and others.

For the convection parameter we choose ν = 100,500 which results in a non-normal matrix A. Considering the spectrum of A the case ν = 100 is closer to the Hermitian case and ν = 500 is closer to the skew-Hermitian case. In both cases, the numerical range of A is contained in the left complex plane, μ₂(A) ≤ 0.

We discuss error estimates for the Krylov approximation of the matrix exponential (p = 0) and a φ-function (for which we choose p = 2). For the case p = 0, the action of the matrix exponential e^tAv is approximated in the Krylov subspace ${{\mathscr{K}}}_{m}(A,v)$, see (2.8b). Analogously, for the case p = 2 we approximate φ_p(tA)v as given in (2.8a). As a starting vector we choose the normalized vector $v=(1/N,\ldots ,1/N)^{\ast }\in {\mathbb {R}}^{n}$. In Fig. 3, we compare the error bounds given in Theorem 4, Corollary 4 and [10, Proposition 6], and the generalized residual estimate (Remark 9) and the effective order estimate (Remark 10), for the convection-diffusion equation. The error bound of Corollary 4 is applied with $\xi _{\max \limits }=0$ (the effect of $\xi _{\max \limits }$ is negligible for the current examples). Concerning the error bound given in [10, Proposition 8], we choose the parameter ε by minimizing [10, (39)], and a = 0.

For the case ν = 100 the eigenvalues of H_m have a negligible imaginary part and the upper bound given in Theorem 4 constitutes a tight upper bound on the exact evaluation of the scaled defect integral, which yields a tight error bound. This error bound and the effective order estimate (Remark 10), which is based on a quadrature estimate on the defect integral, yield approximately the same results for the case ν = 100. The performance of the generalized residual estimate (Remark 9) is similar to the performance of the error bound in [10, Proposition 6], especially for larger choices of m. The error bound in Corollary 4 is only accurate for small m in the current example. The high accuracy of the error bound in Theorem 4 and the effective order estimates results in time steps t(m) which are larger than the time steps suggested by generalized residual estimate and the error bound in [10, Proposition 6], and significantly larger compared to the time steps given by Corollary 4. Comparing the cases p = 0 and p = 2, the time steps suggested by the error bounds of Corollary 4 and [10, Proposition 6] are slightly smaller in relation to the time step prescribed by the effective order estimate for p = 2. Considering the true error for the time steps computed by the error bound in Theorem 4, the effective order estimate and the generalized residual estimate, the performance of these estimates only differs slightly between the cases p = 0 and p = 2.

For the case ν = 500, the matrix H_m has eigenvalues with larger imaginary parts (especially for larger m). In this case the error bound in Theorem 4, is less tight, and the effective order estimate (Remark 10) performs best comparing to the other error estimates. Comparing the cases p = 0 and p = 2, we observe that the time steps suggested by the error bounds of Theorem 4, Corollary 4 and [10, Proposition 6] are slightly smaller in relation to the time step of the effective order estimate for p = 2.

The criterion ac.est.1(t) given in Remark 7 is evaluated for ν = 100,500 and p = 0,2 with t(m) corresponding to Theorem 4 (see caption of Fig. 3). For ν = 100 we obtain ac.est.1(t(m)) < 0.1 for any m tested and p = 0,2. For ν = 500 the smallest m with ac.est.1(t(m)) > 0.1 is m = 40 and m = 36 for p = 0 and p = 2, respectively. The error bound in Theorem 4 conforms to an upper bound of the scaled defect integral, and in the case of ac.est.1(t(m)) > 0.1 a more accurate estimate on the defect integral is likely to perform better. For ν = 500 and m = 40 (p = 0) and m = 36 (p = 2), we observe that this is the case for the effective order estimate. Similar to the criterion ac.est.1(t), we test ac.est.2(t) given in Remark 8 for t(m) according to Corollary 4. For ν = 100 the smallest m with ac.est.2(t(m)) > 0.1 is m = 7 for p = 0,2 individually. Otherwise, for ν = 500 the smallest m with ac.est.2(t(m)) > 0.1 is m = 8 and m = 7 for p = 0 and p = 2, respectively.

5.2 Free Schrödinger equation, a skew-Hermitian problem

For the free Schrödinger equation, we let A be a finite difference discretization of the Laplace operator, precisely, we choose A corresponding to L in (5.1) with ν = 0 and N = 500. With A corresponding to a discretized Laplace operator, the vector e^itAv yields a solution to a discretized free Schrödinger equation with starting vector v. The free Schrödinger equation represents a skew-Hermitian problem, and following Remark 4 we approximate e^itAv in the Krylov subspace ${\mathscr{K}}_{m}(A,v)$ by $\beta V_{m} \mathrm {e}^{\mathrm {i} t H_{m}}e_{1}$. Analogously to the previous subsection, we choose the normalized starting vector $v=(1/N,\ldots ,1/N)^{\ast }\in {\mathbb {R}}^{n}$, and we also consider the Krylov approximation to φ_p(itA)v for p = 2, i.e., βV_mφ_p(itH_m)e₁.

In Fig. 4, the error bounds given in Corollary 4 (which coincides with the error bound given in Theorem 4 in the skew-Hermitian case) and [10, Proposition 8] (the counterpart to [10, Proposition 6] for the skew-Hermitian case), the effective order estimate (Remark 10), and the generalized residual estimate (Remark 9) are evaluated for the current example. For the parameter ε in [10, Proposition 8], we choose ε = m/t as suggested in the numerical experiments therein.

For the skew-Hermitian case, the effective order estimate (Remark 10) yields the largest time steps compared to the other error estimates. The error bound of Corollary 4 performs well for moderate m and better than the error bound in [10, Proposition 8] for any of the tested m here. For larger m the generalized residual estimate performs better than the error bound of Corollary 4. Similar to examples of the previous subsection, the error bound of Corollary 4 performs better for the case p = 0 compared to p = 2. Similar results can be observed for the error bound of [10, Proposition 8]. The performance of the effective order estimate and the generalized residual estimate only differs slightly between the cases p = 0 and p = 2.

We test ac.est.2(t) given in Remark 8 for t(m) according to Corollary 4. The smallest m with ac.est.2(t(m)) > 0.1 is m = 15 and m = 13 for p = 0 and p = 2, respectively. Following Remark 8, the error bound given in Corollary 4 overestimates the error by a factor 1.1 (in an asymptotic sense) for these values of m, which fits to the results shown in Fig. 4.

5.3 Free Schrödinger equation with a double well potential and a Gaussian wave packet as an initial state

In the following numerical experiment, we choose a special starting vector which results in the matrix H_m having clustered eigenvalues, and we observe effects which were previously discussed in Section 4.1. Typically, this is related to regularity properties of the underlying initial state.

We consider the one-dimensional free Schrödinger equation with a double well potential,

$$ \partial_{t} \psi= -\mathrm{i} H\psi,~~~\text{with}~~H = -{\Delta} + V,~~~\psi=\psi(t,x)\in\mathbb{C},~V=V(x)\in\mathbb{R}, $$

(5.2)

for t ≥ 0, x ∈ [− 10,10] and V (x) = x⁴ − 15x². Let $B\in \mathbb {C}^{n\times n}$ be the discretized version of the Hamiltonian operator H in (5.2) with periodic boundary conditions using a finite difference scheme with a mesh of size n = 10000. With B Hermitian, the full problem A = −iB is skew-Hermitian (see Remark 4) with μ₂(A) = 0. For the initial state of (5.2) we choose a Gaussian wave packet,

$$ \psi(t=0,x)=(0.2\pi)^{-1/4}\exp(-(x+2.5)^{2}/(0.4)), $$

(5.3)

which is evaluated on the mesh and normalized to obtain a discrete starting vector $v\in \mathbb {R}^{n}$. This problem also appears in [29, 51].

We discuss error estimates for the case p = 0 (Krylov approximation of e^−itBv). The implementation of the skew-Hermitian problem is described in Remark 4. In Fig. 5 the upper bound given in Corollary 4 (which coincides with the error bound given in Theorem 4 for the skew-Hermitian case) and the error estimates given in Remark 9 and 10 are compared. Additionally, we consider the error bound given in [10, Proposition 8] with the parameter choice ε = m/t.

The error bounds given in Corollary 4 and [10, Proposition 8] are reliable but not tight for the current example. Thus, the time steps t(m) which are suggested by these error bounds are significantly smaller than the time steps suggested by the quadrature-based error estimates (Remarks 9 and 10), and comparing with the numerical experiments of the previous subsection, this seems to be highly affected by the choice of the starting vector. For the error bound in Corollary 4, this can be explained by the loss of order of the defect. However, the error bound in Corollary 4 shows a better performance compared to the error bound in [10, Proposition 8].

In terms of accuracy, the effective order estimate (Remark 10) performs significantly better compared to the error bounds in Corollary 4 and [10, Proposition 8], and better compared to the generalized residual estimate (Remark 9). In terms of reliability, we have argued that the effective order estimate and the generalized residual estimate constitute upper bounds on the error norm when the defect norm behaves sufficiently smooth. The defect norm |δ_m,0(t)|, which is presented in the lower right corner of Fig. 5, does have an oscillatory behavior in a specific time regime which can be related to the starting vector, cf. Section 4.1. For the time steps which are relevant for the current example, this does not critically affect the quadrature estimates on the defect integral related to Remark 9 and 10. Under certain conditions, e.g., a different choice for the tolerance tol, this oscillatory behavior of the defect can lead to failure of the error estimates given in Remark 9 and 10. However, the quadrature of the defect integral can be further improved in such cases to ensure a reliable error estimate.

6 Conclusions and outlook

In this work, various a posteriori bounds and estimates on the error norm, which have their origin in an integral representation of the error using the defect (residual), are studied. We have characterized the accuracy of these error bounds by the positioning of Ritz values (i.e., eigenvalues of H_m) on the complex plane. The case of real Ritz values is the most favorable one to obtain a tight error bound via an integral on the defect norm (Corollary 3). A new error bound (Theorem 4) has shown to be tight if Ritz values are close to the real axis and in this case favorably compares with existing error bounds. We further recapitulate an existing error bound (Corollary 4) which remains relevant, especially for the case of Ritz values with a significant imaginary part. In addition for the error bound in Theorem 4 and Corollary 4, we have provided a criterion to quantify the achieved accuracy on the fly. For an illustration of the claims concerning the new error bound, we primary refer to the numerical example given in Section 5.1. The quadrature-based error estimates in Section 4.1 (e.g., the generalized residual estimate) do not yield proven upper bounds on the error norm and we addressed special cases (e.g., the numerical example in Section 5.3) for which the reliability of these estimates can be problematic. These cases are also analyzed in terms of Ritz values in Section 4.1 and this relation can be of further interest for a numerical implementation. Nevertheless, in most cases, the quadrature-based estimates remain valid, whereat the effective order quadrature stands out in terms of performance.

We also remark that the theory provided in our work gives the possibility to adapt the choice of the error estimate on the fly to obtain an estimate which is as reliable, accurate and economic as possible. This is the topic of further work.

Notes

Cf. [11] for the case of the matrix exponential.
In the sequel, e_j denotes the j th unit vector in $ \mathbb {C}^{m} $ or $ \mathbb {C}^{n} $, respectively.
Remark concerning notation: “u” objects live in $ \mathbb {C}^{n} $, and “y” objects live in $ \mathbb {C}^{m} $.
This and the result of Theorem 1 remain valid for the case t = 0.
Taking the maximum $\max \limits _{s\in [0,t]}$ in the definition of κ₁ and κ₂ is necessary to cover the case p > 0. For the special case p = 0 the upper norm bound given in Theorem 1 can be adapted to scale with ${\mathrm {e}}^{t\mu _{2}(A)}$.
Theorem 3 can be generalized to the case $ t^{p} e_{m}^{\ast } \varphi _{k+p}(t H_{m})e_{1} = e_{m+p}^{\ast } \varphi _{k}(t\widetilde {H}_{p,m}) e_{1}$ with $ k \in \mathbb {N} $, see [2, Theorem 2.1]. The case k = 0 is sufficient for our purpose.
Here, we introduce the notation $(\lambda _{1},\ldots ,\lambda _{m},0_{p})=(\lambda _{1},\ldots ,\lambda _{m},0,\ldots ,0)\in \mathbb {C}^{m+p}$ for $p\in \mathbb {N}_{0}$.
We use the notation introduced in the previous sections.
It can be shown that the remainder is of even order ${\mathscr{O}}(t^{4})$ in this case.

References

Afanasjew, M., Eiermann, M., Ernst, O., Güttel, S.: Implementation of a restarted Krylov subspace method for the evaluation of matrix functions. Linear Algebra Appl. 429(10), 2293–2314 (2008). https://doi.org/10.1016/j.laa.2008.06.029
Article MathSciNet MATH Google Scholar
Al-Mohy, A., Higham, N.: Computing the action of the matrix exponential, with an application to exponential integrators. SIAM J. Sci. Comput. 33(2), 488–511 (2011). https://doi.org/10.1137/100788860
Article MathSciNet MATH Google Scholar
Beckermann, B., Reichel, L.: Error estimates and evaluation of matrix functions via the Faber transform. SIAM J. Numer. Anal. 47(5), 3849–3883 (2009). https://doi.org/10.1137/080741744
Article MathSciNet MATH Google Scholar
de Boor, C.: Divided differences. Surv. Approx. Theory 1, 46–69 (2005)
MathSciNet MATH Google Scholar
Botchev, M., Grimm, V., Hochbruck, M.: Residual, restarting and Richardson iteration for the matrix exponential. SIAM J. Sci. Comput. 35(3), A1376–A1397 (2013). https://doi.org/10.1137/110820191
Article MathSciNet MATH Google Scholar
Botchev, M., Knizhnerman, L.: ART: Adaptive Residual-time restarting for Krylov subspace matrix exponential evaluations. J. Comput. Appl. Math. https://doi.org/10.1016/j.cam.2019.06.027 (2019)
Braconnier, T., Langlois, P., Rioual, J.: The influence of orthogonality on the Arnoldi method. Linear Algebra Appl. 309(1), 307–323 (2000). https://doi.org/10.1016/S0024-3795(99)00100-7
Article MathSciNet MATH Google Scholar
Caliari, M., Kandolf, P., Ostermann, A., Rainer, S.: The Leja method revisited: backward error analysis for the matrix exponential. SIAM J. Sci. Comput. 38(3), A1639–A1661 (2016). https://doi.org/10.1137/15M1027620
Article MathSciNet MATH Google Scholar
Celledoni, E., Moret, I.: A Krylov projection method for systems of ODEs. Appl. Numer. Math. 24(2), 365–378 (1997). https://doi.org/10.1016/S0168-9274(97)00033-0
Article MathSciNet MATH Google Scholar
Diele, F., Moret, I., Ragni, S.: Error estimates for polynomial Krylov approximations to matrix functions. SIAM J. Matrix Anal. Appl. 30(4), 1546–1565 (2009). https://doi.org/10.1137/070688924
Article MathSciNet MATH Google Scholar
Druskin, V., Greenbaum, A., Knizhnerman, L.: Using nonorthogonal Lanczos vectors in the computation of matrix functions. SIAM J. Sci. Comput. 19(1), 38–54 (1998). https://doi.org/10.1137/S1064827596303661
Article MathSciNet MATH Google Scholar
Druskin, V., Knizhnerman, L.: Two polynomial methods of calculating functions of symmetric matrices. USSR Comput Math. Math. Phys. 29 (6), 112–121 (1989). https://doi.org/10.1016/S0041-5553(89)80020-5
Article MathSciNet MATH Google Scholar
Druskin, V., Knizhnerman, L.: Error bounds in the simple Lanczos procedure for computing functions of symmetric matrices and eigenvalues. Comput. Math. Math. Phys. 31(7), 20–30 (1992)
Google Scholar
Druskin, V., Knizhnerman, L.: Extended Krylov subspaces: Approximation of the matrix square root and related functions. SIAM. J. Matrix Anal. Appl. 19(3), 755–771 (1998). https://doi.org/10.1137/S0895479895292400
Article MathSciNet MATH Google Scholar
Eiermann, M., Ernst, O.: A restarted Krylov subspace method for the evaluation of matrix functions. SIAM J. Numer. Anal. 44, 2481–2504 (2006). https://doi.org/10.1137/050633846
Article MathSciNet MATH Google Scholar
Eiermann, M., Ernst, O., Güttel, S.: Deflated restarting for matrix functions. SIAM J. Matrix Anal. Appl. 32(2), 621–641 (2011). https://doi.org/10.1137/090774665
Article MathSciNet MATH Google Scholar
van den Eshof, J., Hochbruck, M.: Preconditioning Lanczos approximations to the matrix exponential. SIAM J. Sci. Comput. 27(4), 1438–1457 (2006). https://doi.org/10.1137/040605461
Article MathSciNet MATH Google Scholar
Friesner, R., Tuckerman, L., Dornblaser, B., Russo, T.: A method for exponential propagation of large systems of stiff nonlinear differential equations. J. Sci. Comput. 4 (4), 327–354 (1989). https://doi.org/10.1007/BF01060992
Article MathSciNet Google Scholar
Frommer, A., Güttel, S., Schweitzer, M.: Efficient and stable Arnoldi restarts for matrix functions based on quadrature. SIAM J. Matrix Anal. Appl. 35(2), 661–683 (2014). https://doi.org/10.1137/13093491X
Article MathSciNet MATH Google Scholar
Gallopoulos, E., Saad, Y.: Efficient solution of parabolic equations by Krylov approximation methods. SIAM J. Sci. Statist. Comput. 13(5), 1236–1264 (1992). https://doi.org/10.1137/0913071
Article MathSciNet MATH Google Scholar
Göckler, T., Grimm, V.: Convergence analysis of an extended Krylov subspace method for the approximation of operator functions in exponential integrators. SIAM J. Numer. Anal. 51(4), 2189–2213 (2013). https://doi.org/10.1137/12089226X
Article MathSciNet MATH Google Scholar
Güttel, S.: Rational Krylov methods for operator functions. Ph.D. thesis, Technische universität Bergakademie Freiberg, Germany. http://eprints.ma.man.ac.uk/2586/. Dissertation available as MIMS Eprint 2017.39 (2010)
Higham, N.: Accuracy and Stability of Numerical Algorithms, 2nd edn. Society for Industrial and Applied Mathematics, USA (2002). https://doi.org/10.1137/1.9780898718027
Book MATH Google Scholar
Higham, N.: Functions of matrices. Society for industrial and applied mathematics, philadelphia, PA USA. https://doi.org/10.1137/1.9780898717778 (2008)
Hochbruck, M., Hochstenbach, M.: Subspace extraction for matrix functions. Tech. rep., Dept. of Math., Case Western Reserve University. http://na.math.kit.edu/download/papers/funext.pdf (2005)
Hochbruck, M., Lubich, C.: On Krylov subspace approximations to the matrix exponential operator. SIAM J. Numer. Anal. 34(5), 1911–1925 (1997). https://doi.org/10.1137/S0036142995280572
Article MathSciNet MATH Google Scholar
Hochbruck, M., Ostermann, A.: Exponential integrators. Acta Numerica 19, 209–286 (2010). https://doi.org/10.1017/S0962492910000048
Article MathSciNet MATH Google Scholar
Hochbruck, W., Lubich, C., Selhofer, H.: Exponential integrators for large systems of differential equations. SIAM J. Sci. Comput. 19(5), 1552–1574 (1998). https://doi.org/10.1137/S1064827595295337
Article MathSciNet MATH Google Scholar
Iserles, A., Kropielnicka, K., Singh, P.: Compact schemes for laser-matter interaction in schrödinger equation based on effective splittings of Magnus expansion. J. Comput. Phys. Comm. 234, 195–201 (2019). https://doi.org/10.1016/j.cpc.2018.07.010
Article Google Scholar
Jawecki, T., Auzinger, W., Koch, O.: Computable upper error bounds for Krylov approximations to matrix exponentials and associated φ-functions BIT. https://doi.org/10.1007/s10543-019-00771-6 (2019)
Jia, Z., Lv, H.: A posteriori error estimates of Krylov subspace approximations to matrix functions. Numer. Algorithms 69(1), 1–28 (2015). https://doi.org/10.1007/s11075-014-9878-0
Article MathSciNet MATH Google Scholar
Knizhnerman, L., Simoncini, V.: A new investigation of the extended Krylov subspace method for matrix function evaluations. Numer. Linear Algebra Appl. 17, 615–638 (2010). https://doi.org/10.1002/nla.652
MathSciNet MATH Google Scholar
Kuleff, A., Breidbach, J., Cederbaum, L.: Multielectron wave-packet propagation: General theory and application. J. Chem. Phys. 123(4), 044111 (2005). https://doi.org/10.1063/1.1961341
Article Google Scholar
Lubich, C.: From Quantum to Classical Molecular Dynamics; Reduced Models and Numerical Analysis. Zurich lectures in advanced mathematics. European Math. Soc. zürich (2008)
McCurdy, A., Ng, K., Parlett, B.: Accurate computation of divided differences of the exponential function. Math. Comp. 43, 501–528 (1984). https://doi.org/10.2307/2008291
Article MathSciNet MATH Google Scholar
Moler, C., Van Loan, C.: Nineteen dubious ways to compute the exponential of a matrix, twenty-five years later. SIAM Rev. 45(1), 3–49 (2003). https://doi.org/10.1137/S00361445024180
Article MathSciNet MATH Google Scholar
Moret, I., Novati, P.: An interpolatory approximation of the matrix exponential based on Faber polynomials. J. Comput. Appl. Math. 131(1), 361–380 (2001). https://doi.org/10.1016/S0377-0427(00)00261-2
Article MathSciNet MATH Google Scholar
Moret, I., Novati, P.: RD-rational approximations of the matrix exponential. BIT 44, 595–615 (2004). https://doi.org/10.1023/B:BITN.0000046805.27551.3b
Article MathSciNet MATH Google Scholar
Nauts, A., Wyatt, R.: New approach to many-state quantum dynamics: the recursive-residue-generation method. Phys. Rev. Lett. 51, 2238–2241 (1983). https://doi.org/10.1103/PhysRevLett.51.2238
Article Google Scholar
Niehoff, J.: Projektionsverfahren Zur Approximation Von Matrixfunktionen Mit Anwendungen Auf Die Implementierung Exponentieller Integratoren. Ph.D. thesis, Heinrich-Heine-Universität Düsseldorf (2007)
Niesen, J., Wright, W.: Algorithm 919: A Krylov subspace algorithm for evaluating the ϕ-functions appearing in exponential integrators. ACM Trans. Math. Softw. 38(3), 22:1–22:19 (2012). https://doi.org/10.1145/2168773.2168781
Article MathSciNet MATH Google Scholar
Opitz, G.: Steigungsmatrizen. Z. Angew. Math. Mech. 44(S1), T52–T54 (1964). https://doi.org/10.1002/zamm.19640441321
Article MathSciNet MATH Google Scholar
Paige, C.: Error analysis of the Lanczos algorithm for tridiagonalizing a symmetric matrix. IMA J. Appl. Math. 18(3), 341–349 (1976). https://doi.org/10.1093/imamat/18.3.341
Article MathSciNet MATH Google Scholar
Park, T., Light, J.: Unitary quantum time evolution by iterative Lanczos reduction. J. Chem. Phys. 85, 5870–5876 (1986). https://doi.org/10.1063/1.451548
Article Google Scholar
Parlett, B.: The symmetric eigenvalue problem. Society for industrial and applied mathematics, philadelphia, PA USA. https://doi.org/10.1137/1.9781611971163 (1998)
Saad, Y.: Analysis of some Krylov subspace approximations to the matrix exponential operator. SIAM J. Numer. Anal. 29(1), 209–228 (1992). https://doi.org/10.1137/0729014
Article MathSciNet MATH Google Scholar
Saad, Y.: Iterative methods for sparse linear systems, 2nd edn. Society for Industrial and Applied Mathematics, Philadelphia, PA USA (2003)
Schweitzer, M.: Restarting and error estimation in polynomial and extended Krylov subspace methods for the approximation of matrix functions. Ph.D. thesis, Bergische Universität Wuppertal, Germany. http://nbn-resolving.de/urn/resolver.pl?urn=urn%3Anbn%3Ade%3Ahbz%3A468-20160212-112106-7 (2015)
Sidje, R.: Expokit: A software package for computing matrix exponentials. ACM Trans. Math. Software 24(1), 130–156 (1998). https://doi.org/10.1145/285861.285868
Article MATH Google Scholar
Simon, H.: Analysis of the symmetric Lanczos algorithm with reorthogonalization methods. Linear Algebra Appl. 61, 101–131 (1984). https://doi.org/10.1016/0024-3795(84)90025-9
Article MathSciNet MATH Google Scholar
Singh, P.: Sixth-order schemes for laser-matter interaction in the Schrödinger equation. J. Chem. Phys. 150(15), 154111 (2019). https://doi.org/10.1063/1.5065902
Article Google Scholar
Stewart, D., Leyk, T.: Error estimates for Krylov subspace approximations of matrix exponentials. J. Comput. Appl. Math. 72(2), 359–369 (1996). https://doi.org/10.1016/0377-0427(96)00006-4
Article MathSciNet MATH Google Scholar
Tal-Ezer, H.: On restart and error estimation for Krylov approximation of W = F(A)V. SIAM J. Sci. Comput. 29 (6), 2426–2441 (2007). https://doi.org/10.1137/040617868
Article MathSciNet MATH Google Scholar
Tal-Ezer, H., Kosloff, R.: An accurate and efficient scheme for propagating the time dependent schrödinger equation. J. Chem. Phys. 81(9), 3967–3971 (1984). https://doi.org/10.1063/1.448136
Article Google Scholar
Van Loan, C.: The sensitivity of the matrix exponential. SIAM J. Numer. Anal. 14(6), 971–981 (1977). https://doi.org/10.1137/0714065
Article MathSciNet MATH Google Scholar
Wang, H., Ye, Q.: Error bounds for the Krylov subspace methods for computations of matrix exponentials. SIAM J. Matrix Anal. Appl. 38 (1), 155–187 (2017). https://doi.org/10.1137/16M1063733
Article MathSciNet MATH Google Scholar
Wu, G., Zhang, L., Xu, T.: A framework of the harmonic Arnoldi method for evaluating ϕ-functions with applications to exponential integrators. Adv. Comput. Math. 42 (3), 505–541 (2016). https://doi.org/10.1007/s10444-015-9433-0
Article MathSciNet MATH Google Scholar
Zemke, J.: Krylov Subspace Methods in Finite Precision : a Unified Approach. Ph.D. thesis, Technische Universität Hamburg. https://doi.org/10.15480/882.8(2003)

Download references

Funding

Open access funding provided by TU Wien (TUW). This work was supported by the Doctoral College TU-D, Technische Universität Wien.

Author information

Authors and Affiliations

Institut für Analysis und Scientific Computing, Technische Universität Wien, Wiedner Hauptstrasse 8–10/E101, A-1040, Vienna, Austria
Tobias Jawecki

Authors

Tobias Jawecki
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tobias Jawecki.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A: Properties of the Krylov subspace in exact and floating point arithmetic

Let $H_{m} = V_{m}^{\ast } A V_{m}$ and $V_{m}^{\ast } V_{m} =I_{m\times m}$ in exact arithmetic. For z ∈W(H_m) (numerical range of H_m), there exists $x\in \mathbb {C}^{m}$ with

$$ z=\frac{x^{\ast} H_{m} x }{x^{\ast} x} = \frac{x^{\ast} V_{m}^{\ast} A V_{m} x }{x^{\ast} V_{m}^{\ast} V_{m} x} = \frac{y^{\ast} A y }{y^{\ast} y},~~~\text{for $y=V_{m} x$}, $$

(A.1)

whence $\text {W}(H_{m})\subseteq \text {W}(A)$.

Similar results hold in floating point arithmetic with relative machine precision ε and certain additional assumptions. Assume there exists an orthonormal basis $\widehat {V}_{m}\in {\mathbb {C}}^{n\times m}$ and a perturbation $\widetilde {U}_{m}\in {\mathbb {C}}^{n\times m}$, which is sufficiently small in norm (i.e., there exists a moderate constant C₃ with $\|\widetilde {U}_{m}\|_{2} \leq C_{3} \varepsilon $), with

$$ H_{m} = \widehat{V}_{m}^{\ast} A \widehat{V}_{m} + \widetilde{U}_{m}. $$

(A.2)

With assumption (A.2) and basic properties of the numerical range we obtain

$$ \text{W}(H_{m}) \subseteq \text{W}(\widehat{V}_{m}^{\ast} A \widehat{V}_{m}) + \text{W}(\widetilde{U}_{m}). $$

(A.3)

Similar to (A.1) we obtain

$$ \text{W}(\widehat{V}_{m}^{\ast} A \widehat{V}_{m}) \subseteq \text{W}(A). $$

(A.4)

Then, we combine (A.3) and (A.4) and make use of $\|\widetilde {U}_{m}\|_{2} \leq C_{3} \varepsilon $ to obtain

$$ \text{W}(H_{m}) \subseteq U_{C_{3} \varepsilon}(\text{W}(A)), $$

with $U_{C_{3}\varepsilon }(\text {W}(A))$ being the neighborhood of W(A) with a distance C₃ε.

In [50, Theorem 5], the existence of the representation (A.2) is proven for the Lanczos method with a sufficiently small constant C₃ and the assumption that the Krylov basis is semiorthogonal.

For the general case of the Arnoldi method the representation (A.2) can be derived using (2.6), (2.7a) and an additional condition on the level of orthogonality of the Krylov basis, e.g., assuming that an orthonormal basis $\widehat {V}_{m}$ exists for which $\|\widehat {V}_{m} - V_{m}\|_{2}$ is small enough (see also [7, Theorem 2.1] and references therein).

Appendix B: Some properties of divided differences

Proof of Proposition 1

For $p \in \mathbb {N}_{0}$ and any $A\in \mathbb {C}^{m\times m}$, $w\in \mathbb {C}^{m}$, from the series representation (2.2) we obtain

$$ {{\int}_{0}^{t}} s^{p} \varphi_{p}(sA)w \mathrm{d} s = {{\int}_{0}^{t}}\Big(\sum\limits_{k=0}^{\infty} \frac{s^{k+p} A^{k} w}{(k+p)!}\Big) \mathrm{d} s = \sum\limits_{k=0}^{\infty} \frac{t^{k+p+1} A^{k} w}{(k+p+1)!} = t^{p+1} \varphi_{p+1}(tA)w. $$

(B.1)

This identity carries over to divided differences in the following way. Let

$$ {\Theta}_{m} = \begin{pmatrix} \lambda_{1}& &&\\ 1 & \lambda_{2} &&\\ & {\ddots} &\ddots& \\ && 1&\lambda_{m} \end{pmatrix}\in\mathbb{C}^{m\times m}. $$

As a consequence of the Opitz formula, see [42] and remarks in [4, Proposition 25], we have

$$ (\varphi_{p})_{t}[\lambda_{1},\ldots,\lambda_{m}] = e_{m}^{\ast} \varphi_{p}(t {\Theta}_{m})e_{1}. $$

(B.2)

Using (B.1) and (B.2), we obtain

$$ \begin{array}{@{}rcl@{}} {{\int}_{0}^{t}} s^{p} (\varphi_{p})_{s}[\lambda_{1},\ldots,\lambda_{m}] \mathrm{d} s &=& e_{m}^{\ast} {{\int}_{0}^{t}} s^{p} \varphi_{p}(s{\Theta}_{m}) e_{1} \mathrm{d} s = e_{m}^{\ast} t^{p+1} \varphi_{p+1}(t{\Theta}_{m})e_{1}\\ &=& t^{p+1} (\varphi_{p+1})_{t}[\lambda_{1},\ldots,\lambda_{m}], \end{array} $$

which completes the proof.

□

Remark 11

We will make use of the following integral representation for divided differences, the so-called Hermite-Genocchi formula, [24, (B.25)]. With the differential operator $(D^{(m-1)}f_{t})(\lambda ) = \frac {{\mathrm {d}}^{m-1}}{{\mathrm {d}} \lambda ^{m-1}}f(t\lambda )$,

$$ \begin{array}{@{}rcl@{}} {f_{t}}[\lambda_{1},\ldots,\lambda_{m}] &= &{\int}_{[\lambda_{1},\ldots,\lambda_{m}]}D^{(m-1)}f_{t} \\ & =& {{\int}_{0}^{1}} {\int}_{0}^{s_{1}} \cdots {\int}_{0}^{s_{m-2}} D^{(m-1)}f \Big(\lambda_{1} + \sum\limits_{j=1}^{m-1}s_{j}(\lambda_{j+1}-\lambda_{j}) \Big) \mathrm{d} s_{m-1} {\ldots} \mathrm{d} s_{2} \mathrm{d} s_{1}. \end{array} $$

(B.3)

Proof of Proposition 2

Applying (B.3) to the exponential function gives

$$ \begin{array}{@{}rcl@{}} |\exp_{t}[\lambda_{1},\ldots,\lambda_{k}]| & \leq\!&\! {{\int}_{0}^{1}} {\int}_{0}^{s_{1}} \cdots {\int}_{0}^{s_{k-2}} t^{k-1} \Big|\exp\Big(\lambda_{1} + \sum\limits_{j=1}^{k-1}s_{j}(\lambda_{j+1}-\lambda_{j}) \Big)\Big| \mathrm{d} s_{k-1} {\ldots} \mathrm{d} s_{2} \mathrm{d} s_{1}\\ & =& {{\int}_{0}^{1}} {\int}_{0}^{s_{1}} \cdots {\int}_{0}^{s_{k-2}} t^{k-1} \exp\Big(\xi_{1} + \sum\limits_{j=1}^{k-1}s_{j}(\xi_{j+1}-\xi_{j}) \Big) \mathrm{d} s_{k-1} {\ldots} \mathrm{d} s_{2} \mathrm{d} s_{1}\\ & = &\exp_{t}[\xi_{1},\ldots,\xi_{k}], \end{array} $$

which completes the proof. □

Proof of Proposition 3

We use (B.3) to obtain

$$ \begin{array}{@{}rcl@{}} \exp_{t}[\lambda_{1},\ldots,\lambda_{k}] & =& {{\int}_{0}^{1}} {\int}_{0}^{s_{1}} \cdots {\int}_{0}^{s_{k-2}} t^{k-1} \exp\Big(t\Big(\lambda_{1} + \sum\limits_{j=1}^{k-1}s_{j}(\lambda_{j+1}-\lambda_{j}) \Big)\Big) \mathrm{d} s_{k-1} {\ldots} \mathrm{d} s_{2} \mathrm{d} s_{1}\\ & =& {{\int}_{0}^{1}} {\int}_{0}^{s_{1}} \cdots {\int}_{0}^{s_{k-2}} t^{k-1} \\ &&~~~~~~\cdot\Big[\cos\Big(t\Big(\eta_{1} + \sum\limits_{j=1}^{k-1}s_{j}(\eta_{j+1}-\eta_{j}) \Big)\Big) +\mathrm{i}\sin\Big(t\Big(\eta_{1} + \sum\limits_{j=1}^{k-1}s_{j}(\eta_{j+1}-\eta_{j}) \Big)\Big)\Big]\\ &&~~~~~~ \cdot \exp\Big(t\Big(\xi_{1} + \sum\limits_{j=1}^{k-1}s_{j}(\xi_{j+1}-\xi_{j}) \Big)\Big) \mathrm{d} s_{k-1} {\ldots} \mathrm{d} s_{2} \mathrm{d} s_{1}\\ &=&(\cos(tx)+\mathrm{i}\sin(ty))\exp_{t}[\xi_{1},\ldots,\xi_{k}] \quad \text{for certain $ x,y \in \text{Conv}(\{\eta_{1},\ldots,\eta_{k}\}). $} \end{array} $$

Here, in the last step, we have used the Mean Value Theorem for the integral. In this way we end up with the estimate

$$ |\exp_{t}[\lambda_{1},\ldots,\lambda_{m}]| = |\cos(tx)+\mathrm{i}\sin(ty)|\cdot\exp_{t}[\xi_{1},\ldots,\xi_{m}]. $$

With $|tx|, |ty| \leq \widetilde {\eta }_{t} < \pi /2 $ we obtain

$$ \cos(\widetilde{\eta}_{t}) \leq \cos(tx) \leq |\cos(tx)+\mathrm{i}\sin(ty)|, $$

which completes the proof. □

Appendix C: A new asymptotic expansion of divided differences

Our goal is to derive an asymptotic expansion for $ |\exp _{t}[\lambda _{1},\ldots ,\lambda _{m}]| $, see Theorem 5 at the end of this section.

Let $\lambda _{1},\ldots ,\lambda _{m} \in \mathbb {C}$. We use the shortcut κ_k for the divided differences of power functions,

$$ \kappa_{k}=(\cdot)^{m-1+k}[\lambda_{1},\ldots,\lambda_{m}]~~~\text{for}~~k\in\mathbb{N}_{0}, $$

(C.1)

where (⋅)^j : z↦z^j for $j\in \mathbb {N}_{0}$. Note that

$$ (\cdot)^{j}[\lambda_{1},\ldots,\lambda_{m}] = 0 ~~~\text{for}~~j=0,\ldots,m-2. $$

With the notation (C.1) and the series representation of the exponential function we obtain

$$ \begin{array}{@{}rcl@{}} \exp_{t}[\lambda_{1},\ldots,\lambda_{m}] &=& {\sum}_{j=0}^{\infty} \frac{t^{j} (\cdot)^{j}[\lambda_{1},\ldots,\lambda_{m}]}{j!} = t^{m-1} \sum\limits_{k=0}^{\infty} \frac{t^{k} \kappa_{k} }{(m-1+k)!} \end{array} $$

(C.2a)

$$ \begin{array}{@{}rcl@{}} &=& \frac{ t^{m-1}}{(m-1)!} + \mathscr{O}(t^{m})~~~\text{for~~$t\to 0$.} \end{array} $$

(C.2b)

We also introduce the notation

$$ S_{l} = \sum\limits_{j=1}^{m} {\lambda_{j}^{l}},~~~l\in\mathbb{N}. $$

(C.3)

For κ₀, κ₁, and κ₂, we obtain the following formula.

Proposition 6

For κ_k introduced in (C.1) we have

$$ \kappa_{0} = 1,~~~\kappa_{1} = S_{1},~~~ \kappa_{2} = ({S_{1}^{2}}+S_{2})/2. $$

Proof

This follows from [4, (27)]. □

To simplify the notation, we write

$$ f(t) = |\exp_{t}[\lambda_{1},\ldots,\lambda_{m}]|. $$

The following asymptotic expansion of f(t) for t → 0 is motivated by the concept of effective order. The effective order of the function f(t) can be understood as the slope of the double-logarithmic function

$$ \ln(f(\mathrm{e}^{\tau}))~~~\text{with}~~\tau=\ln t, \quad \text{and with derivative}~~\frac{f^{\prime}(\mathrm{e}^{\tau}) \mathrm{e}^{\tau}}{f(\mathrm{e}^{\tau})}. $$

We denote the effective order by

$$ \begin{array}{@{}rcl@{}} \rho(t) &= &\frac{f^{\prime}(t) t}{f(t)}, \end{array} $$

(C.4a)

$$ \begin{array}{@{}rcl@{}} \text{satisfying} \quad \rho(t)/t&=&\big(\log(f(t))\big)'. \end{array} $$

(C.4b)

We now analyze the divided differences close to an asymptotic regime under the assumption f(t) > 0, which holds for sufficiently small t > 0. The effective order ρ(t) is then well-defined by (C.4a). The following expansion (C.5) for ρ(t) is to be considered in an asymptotic sense for t → 0; convergence of the series is not an issue here.

We make the ansatz

$$ \rho(t)=\sum\limits_{k=0}^{\infty} \rho_{k} t^{k} $$

(C.5)

Using (C.5) in (C.4b), we obtain

$$ \begin{array}{@{}rcl@{}} \frac{\rho(t)}{t} =\Big(\rho_{0} \log(t) + {\sum}_{k=1}^{\infty} \rho_{k} t^{k}/k\Big)' &=& (\log(f(t)))'\\ c \exp\Big(\rho_{0} \log(t) + {\sum}_{k=1}^{\infty} \rho_{k} t^{k}/k \Big) &=& f(t),\\ c t^{\rho_{0}} \exp\Big({\sum}_{k=1}^{\infty} \rho_{k} t^{k}/k \Big) &=& f(t). \end{array} $$

From (C.2b), we see that c = 1/(m − 1)! and ρ₀ = m − 1, whence

$$ \rho(t)=m-1+{\sum}_{k=1}^{\infty} \rho_{k} t^{k}, $$

(C.6)

and for sufficiently small t,

$$ f(t) = |\exp_{t}[\lambda_{1},\ldots,\lambda_{m}]| = \frac{t^{m-1}}{(m-1)!}\exp\Big({\sum}_{k=1}^{\infty} \rho_{k} t^{k}/k \Big). $$

(C.7)

We aim for deriving a formula for the coefficients ρ_k. To avoid the square roots, we choose q(t) = f(t)², such that $ f^{\prime }(t) = q^{\prime }(t)/(2q(t)^{1/2})$. Due to (C.4a) the effective order ρ(t) satisfies

$$ q(t) \rho(t)=q^{\prime}(t)t/2. $$

(C.8)

We proceed by rewriting q(t) and $q^{\prime }(t)$ to obtain a formulation for ρ_k (k ≥ 1) via (C.8). From (C.2a),

$$ q(t) = |\exp_{t}[\lambda_{1},\ldots,\lambda_{m}]|^{2} = t^{2(m-1)} \Big(\sum\limits_{k=0}^{\infty}\frac{t^{k}\kappa_{k}}{(m-1+k)!}\Big) \Big({\sum}_{\ell=0}^{\infty}\frac{t^{\ell}\overline{\kappa}_{\ell}}{(m-1+\ell)!}\Big). $$

The representation of q(t) as well as $ t q^{\prime }(t)/2 $ as a Cauchy product can be written in the form

$$ q(t) = \frac{t^{2(m-1)}}{((m-1)!)^{2}} \sum\limits_{k=0}^{\infty} \alpha_{k} t^{k},~~~\text{and}~~~ tq^{\prime}(t)/2 = \frac{t^{2(m-1)}}{((m-1)!)^{2}} \sum\limits_{k=0}^{\infty} \big((m-1)+k/2\big) \alpha_{k} t^{k}, $$

(C.9)

with coefficients α_k given by

$$ \alpha_{0}=1,~~~\text{and}~~~\alpha_{k} = {\sum}_{j=0}^{k} \frac{((m-1)!)^{2} \kappa_{j}\overline{\kappa}_{k-j} }{(m-1+j)! (m-1+k-j)!} ~~~\text{for}~ k\in\mathbb{N}. $$

With κ₀ = 1 (see Proposition 6), this can be written as

$$ \alpha_{k} = \frac{2 (m-1)! \text{Re}(\kappa_{k})}{(m-1+k)!} + \sum\limits_{j=1}^{k-1} \frac{((m-1)!)^{2} \kappa_{j}\overline{\kappa}_{k-j}}{(m-1+j)! (m-1+k-j)!} ~~~\text{for}~ k\in\mathbb{N}. $$

(C.10)

Furthermore, from (C.6) and (C.9), we obtain a representation of q(t)ρ(t) in form of a Cauchy product,

$$ q(t) \rho(t) = \frac{t^{2(m-1)}}{((m-1)!)^{2}} \sum\limits_{k=0}^{\infty} \theta_{k} t^{k}, ~~~\text{with}~~\theta_{k} = {\sum}_{j=0}^{k-1} \alpha_{j} \rho_{k-j}+ (m-1)\alpha_{k} ,~~~k\in\mathbb{N}_{0}. $$

(C.11)

We remark that (C.11) only holds for t small enough. With α₀ = 1, in (C.11), we have

$$ \theta_{0}= m-1,~~~\text{and}~~~ \theta_{k} = \rho_{k} + \sum\limits_{j=1}^{k-1} \alpha_{j} \rho_{k-j} + (m-1)\alpha_{k} ,~~~k\in\mathbb{N}. $$

(C.12)

For the implicit (C.8), we combine (C.9) and (C.11) to obtain

$$ \sum\limits_{k=0}^{\infty} \theta_{k} t^{k} = \sum\limits_{k=0}^{\infty} (m-1+k/2) \alpha_{k} t^{k}. $$

(C.13)

Comparing coefficients of t^k in (C.13) and using (C.12) we conclude

$$ \theta_{k} = (m-1+k/2) \alpha_{k},~~~\text{and} ~~~\rho_{k} = \frac{k\alpha_{k} }{2} - {\sum}_{l=1}^{k} \alpha_{l} \rho_{k-l},~~~k \geq 1. $$

(C.14)

From (C.14), we obtain a recursion for the coefficients ρ_k in the expansion (C.6) which can be resolved using (C.1) and (C.10).

We now evaluate the lower coefficients of ρ(t). For α₁ and α₂, using Proposition 6 in (C.10) gives

$$ \alpha_{1} = \frac{2 \text{Re}(\kappa_{1})}{m} =\frac{2 \text{Re}(S_{1})}{m}, ~~~\text{and}~~~\alpha_{2} = \frac{|\kappa_{1}|^{2}}{m^{2}} + \frac{2 \text{Re}(\kappa_{2}) }{m(m+1)} =\frac{|S_{1}|^{2}}{m^{2}} + \frac{\text{Re}({S_{1}^{2}}+S_{2})}{m(m+1)}, $$

(C.15)

with S₁, S₂ according to definition (C.3) From the recursion in (C.14), we have

$$ \rho_{1} = \frac{\alpha_{1}}{2},~~~\rho_{2}= \frac{1}{2}\big(2 \alpha_{2} - {\alpha_{1}^{2}} \big), $$

(C.16)

and combining (C.15) with (C.16), we eventually obtain

$$ \begin{aligned} \rho_{1} &= \frac{\text{Re}(S_{1})}{m}, \\ \rho_{2} &= \frac{|S_{1}|^{2}}{m^{2}} + \frac{\text{Re}({S_{1}^{2}}+S_{2}) }{m(m+1)} - \frac{2 \text{Re}(S_{1})^{2}}{m^{2}} = \frac{\text{Im}(S_{1})^{2}- \text{Re}(S_{1})^{2}}{m^{2}} + \frac{\text{Re}({S_{1}^{2}}+S_{2}) }{m(m+1)}. \end{aligned} $$

(C.17)

To study the influence of the real and imaginary parts of the nodes λ_j = ξ_j + iη_j, we introduce the notation

$$ S_{lk} = \sum\limits_{j=1}^{m}{\xi_{j}^{l}} {\eta_{j}^{k}},~~~l,k\in\mathbb{N}_{0}. $$

(C.18)

Basic computations, mostly binomial sums in (C.3), show

$$ S_{1} = S_{10} + \mathrm{i} S_{01},~~~S_{2} = S_{20} + 2\mathrm{i} S_{11} - S_{02},~~~\text{and}~~{S_{1}^{2}} = S_{10}^{2} + \mathrm{i} S_{10}S_{01} - S_{01}^{2}, $$

and

$$ \text{Im}(S_{1})=S_{01},~~~\text{Re}(S_{1})=S_{10},~~~\text{Re}(S_{2})=S_{20}-S_{02},~~~\text{and}~~\text{Re}({S_{1}^{2}})=S_{10}^{2}-S_{01}^{2}. $$

(C.19)

Combining (C.17) with (C.19) gives

$$ \rho_{1} =\frac{S_{10}}{m},~~~\text{and}~~ \rho_{2} = \frac{ S_{01}^{2} - S_{10}^{2}}{m^{2}(m+1)} + \frac{S_{20}-S_{02}}{m(m+1)}. $$

(C.20)

After all these technicalities, we arrive at the following asymptotic expansion.

Theorem 5

Assume that for λ_j = ξ_j + iη_j at least one of the sequences $\{\xi _{j}\}_{j=1}^{m}$ and $\{\eta _{j}\}_{j=1}^{m}$ is not constant, and ξ_j ≤ 0 for j = 1,…, m. Let $ \text {avg}(\xi ) = {\sum }_{j= 1}^{m} \xi _{j}/m$ be the average and $ \text {var}(\xi ) = {\sum }_{j=1}^{m} (\xi _{j}-\text {avg}(\xi ))^{2}/m $ be the variance of {ξ₁,…, ξ_m}, and var(η) the variance of {η₁,…, η_m}. Then,

1.
$$ |\exp_{t}[\lambda_{1},\ldots,\lambda_{m}]| = \frac{t^{m-1}}{(m-1)!}\exp\big(\rho_{1} t + \rho_{2}t^{2}/2 + \mathscr{O}(t^{3}) \big) ~~~\text{for}~ t\to 0, $$
with
$$ \rho_{1} = \text{avg}(\xi),~~~ \rho_{2} = \frac{\text{var}(\xi) - \text{var}(\eta)}{m+1}, $$
and either ρ₁≠ 0 or ρ₂≠ 0.
2.
The derivative of the effective order ρ(t) (see (C.4a)) satisfies $\rho ^{\prime }(t) = \rho _{1} + \rho _{2}t + {\mathscr{O}}(t^{2})$ for t → 0, and
$$ \rho^{\prime}(0+) < 0. $$

Proof

We use the expansion (C.7) for sufficiently small t. For the variance, we obtain

$$ \text{var}(\xi) = \frac{1}{m} \sum\limits_{j=1}^{m} (\xi_{j}-\text{avg}(\xi))^{2} = \frac{1}{m} \Big(\sum\limits_{j=1}^{m} {\xi_{j}^{2}} - \frac{1}{m}\big(\sum\limits_{j=1}^{m} \xi_{j}\big)^{2}\Big). $$

The first coefficients ρ₁ and ρ₂ are given in (C.20). With the notation from (C.18) we observe avg(ξ) = S₁₀/m (for the average avg(ξ)) and $\text {var}(\xi ) = (S_{20}-S_{10}^{2}/m)/m $, $\text {var}(\eta ) = (S_{02}-S_{01}^{2}/m)/m $ (for the variance var(ξ) and var(η), respectively), whence

$$ \rho_{1} = \text{avg}(\xi),~~~\text{and}~~ \rho_{2} = \frac{\text{var}(\xi) - \text{var}(\eta)}{m+1}. $$

With ξ₁,…, ξ_m ≤ 0 for j = 1,…, m we obtain ρ₁ ≤ 0 and ρ₁ = 0 iff ξ₁,…, ξ_m = 0. For the case ξ₁,…, ξ_m = 0, we obtain var(ξ) = 0 and

$$ \rho_{2} = - \frac{\text{var}(\eta)}{m+1} \leq 0. $$

Here, ρ₂ = 0 only in the trivial case with ξ₁,…, ξ_m = 0 and a constant sequence η₁,…, η_m. This proves (a). For the proof of (b) we take the derivative of ρ(t) in an asymptotic sense and make use of ρ₁ ≤ 0 and ρ₂ < 0 iff ρ₁ = 0, see (a). □

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jawecki, T. A study of defect-based error estimates for the Krylov approximation of φ-functions. Numer Algor 90, 323–361 (2022). https://doi.org/10.1007/s11075-021-01190-x

Download citation

Received: 31 January 2020
Accepted: 17 August 2021
Published: 08 November 2021
Issue Date: May 2022
DOI: https://doi.org/10.1007/s11075-021-01190-x

Keywords

Mathematics Subject Classification (2010)

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A study of defect-based error estimates for the Krylov approximation of φ-functions

Abstract

Similar content being viewed by others

Computable upper error bounds for Krylov approximations to matrix exponentials and associated $${\varvec{\varphi }}$$ -functions

Analysis of quasi-optimal polynomial approximations for parameterized PDEs with deterministic and stochastic coefficients

Piecewise Polynomial Taylor Expansions—The Generalization of Faà di Bruno’s Formula

1 Introduction

Overview on prior work

Overview on results presented here

2 Problem statement and Krylov approximation

Notation and properties of Krylov subspaces

Remark 1

Remark 2

Krylov subspaces in floating point arithmetic

Krylov approximation of φ-functions

Remark 3

Remark 4

The error of the Krylov propagator

3 An integral representation for the error of the Krylov propagator

Theorem 1

Proof

Remark 5

Corollary 1

Representing the defect in terms of divided differences

Theorem 2 (see for instance 9)

Theorem 3 (Corollary 1 in 49)

Corollary 2

4 Computable a posteriori error bounds for the Krylov propagator

Proposition 1

Proof

Proposition 2 (Lemma including (5.1.1) in 35)

Proof

Theorem 4

Proof

Corollary 3

Corollary 4

Accuracy of the previously specified upper bounds on the error norm

Proposition 3 (Part of a proof in 35, (5.2.3))

Proof

Remark 6

Remark 7

Proposition 4 (Evaluation of ρ 1 and ρ 2 in terms of entries of H m)

Proof

Remark 8

4.1 Quadrature-based error estimates

Divided differences with clustered nodes: an example

Quadrature estimates for the defect integral

Remark 9 (Generalized residual estimate, see also 28)

Remark 10 (Effective order estimate, see also 30)

4.2 A stopping criterion for the lucky breakdown

Proposition 5

Proof

5 Numerical experiments

5.1 Convection-diffusion equation

5.2 Free Schrödinger equation, a skew-Hermitian problem

5.3 Free Schrödinger equation with a double well potential and a Gaussian wave packet as an initial state

6 Conclusions and outlook

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Appendices

Appendix A: Properties of the Krylov subspace in exact and floating point arithmetic

Appendix B: Some properties of divided differences

Proof of Proposition 1

Remark 11

Proof of Proposition 2

Proof of Proposition 3

Appendix C: A new asymptotic expansion of divided differences

Proposition 6

Proof

Theorem 5

Proof

Rights and permissions

About this article

Cite this article

Proposition 4 (Evaluation of ρ ₁ and ρ ₂ in terms of entries of H _m)