1 Introduction

The use of computers is essential for predicting and understanding the behaviour of many physical systems. Sensitive dependence on initial conditions is typical in many physical systems. This sensitivity problem raises nontrivial reliability and stability issues regarding any computational approach to such systems. Moreover, it strongly motivates the study of reliable computational methods for understanding statistical properties of physical systems.

In this note we consider the rigorous computation of diffusion coefficients in a class of systems where a central limit theorem holds. Such coefficients are focal in the study of limit theorems and fluctuations for dynamical systems (see [8, 12, 13, 17, 23, 28] and references therein). Given a piecewise expanding map, an observable, and a pre-specified tolerance on error, we approximate in a certified way the diffusion coefficient up to the per-specified error (see Theorem 2.3).

Our rigorous approximation is based on a suitable finite dimensional approximation (discretization) of the system, called Ulam’s method [36]. Ulam’s method is known to provide rigorous approximations of SRB (Sinai-Ruelle-Bowen) measures and other important dynamical quantities for different types of dynamical systems (see [13, 9, 10, 14, 15, 25, 29, 30] and references therein). Moreover, this method was also used to detect coherent structures in geophysical systems (see e.g. [7, 34]).

In [32], following the approach of [18], a Fourier approximation scheme was used to estimate diffusion coefficients for expanding maps. The approach of [32] requires the map to have a Markov partition and to be piecewise analytic. Although the result of [32] provides an order of convergence, it does not compute the constant hiding in the rate of convergence. In our approach, we do not require the map to admit a Markov partition and we only assume it is piecewise \(C^{2}\). More importantly, our approximation is rigorous. To give the reader a flavour of what we mean by rigorous, we close this section by providing in part (b) of the following theorem a prototype result of this paper:Footnote 1

Theorem 1.1

LetFootnote 2

$$\begin{aligned} T(x)=2x+\frac{1}{2}x(1-x) \qquad {\text {(mod } 1)}. \end{aligned}$$
(1.1)
  1. (a)

    T admits a unique absolutely continuous invariant measure \(\nu \) and if \(\psi \) is a function of bounded variation the Central Limit Theorem holds:

    $$\begin{aligned} \frac{1}{\sqrt{n}}\left( \sum _{i=0}^{n-1}\psi (T^ix)-n\int _I\psi d\nu \right) {\overset{\text {law}}{\longrightarrow }}\mathcal N(0, \sigma ^2). \end{aligned}$$
  2. (b)

    For \(\psi =x^2\) the diffusion coefficient \(\sigma ^2\in [0.3458,0.4152]\).

In Sect. 2, we first introduce our framework and the assumptions on it. We then state the problem and introduce the method of approximation. The statement of the general results (Theorems 2.3, 2.5) and an application to expanding maps with a neutral fixed point are also included in Sect. 2. Section 3 contains the proofs and an algorithm. Section 4 contains an example, using Lanford’s map, that illustrates the implementation of the algorithm of Sect. 3 and proves part (b) of Theorem 1.1.

2 The Setting

2.1 The System and Its Transfer Operator

Let \((I, \mathcal {B}, m)\) be the measure space, where \(I:=[0,1]\), \(\mathcal B\) is Borel \(\sigma \)-algebra, and m is the Lebesgue measure on I. Let \(T:I\rightarrow I\) be piecewise \(C^2\) and expanding (see [22, 31] for original referencesFootnote 3 and [6] for a profound background on such systems). The transfer operator (Perron-Frobenius) [4] associated with T, \(P:L^1\rightarrow L^1\) is defined by duality: for \(f\in L^1\) and \(g\in L^{\infty }\)

$$\begin{aligned} \int _I f\cdot g\circ T dm= \int _I P(f)\cdot gdm. \end{aligned}$$

Moreover, for \(f\in L^1\) we have

$$\begin{aligned} Pf(x)=\sum _{y=T^{-1}x}\frac{f(y)}{|T'(y)|}. \end{aligned}$$

For \(f\in L^1\), we define

$$\begin{aligned} Vf=\inf _{\overline{f}}\{\text {var}\overline{f}\, : f=\overline{f} \text { a.e.}\}, \end{aligned}$$

where

$$\begin{aligned} \text {var}\overline{f}=\sup \left\{ \sum _{i=0}^{l-1}|\overline{f}(x_{i+1})-\overline{f}(x_i)|\, :0=x_0<x_1<\dots <x_l=1\right\} . \end{aligned}$$

We denote by BV the space of functions of bounded variation on I equipped with the norm \(||\cdot ||_{BV}=V(\cdot )+||\cdot ||_{1}\). Further, we introduce the mixed operator norm which will play a key role in our approximation:

$$\begin{aligned} |||P|||=\underset{||f||_{BV}\le 1}{\sup }||Pf||_1. \end{aligned}$$

2.2 Assumptions

We assume:Footnote 4

(A1) \(\exists \,\ \alpha \in (0,1)\), and \(B_0\ge 0\) such that \(\forall f\in BV\)

$$\begin{aligned} VPf\le \alpha Vf+B_0||f||_1; \end{aligned}$$

(A2) P, as operator on BV, has 1 as a simple eigenvalue. Moreover P has no other eigenvalues whose modulus is unity.

Remark 2.1

It is important to remark that the constants \(\alpha \) and \(B_0\) in (A1) depend only on the map T and have explicit analytic expressions (see [22]).

The above assumptions imply that T admits a unique absolutely continuous invariant measure \(\nu \), such that \(\frac{d\nu }{dm}:=h\in BV\). Moreover, the system \((I,\mathcal B, \nu , T)\) is mixing and it enjoys exponential decay of correlations for observables in BV (see [4] for a profound background on this topic).

2.3 The Problem

Let \(\psi \in BV\) and define

$$\begin{aligned} \sigma ^2:=\lim _{n\rightarrow \infty }\frac{1}{n}\int _{I}\left( \sum _{i=0}^{n-1}\psi (T^ix)-n\int _I\psi d\nu \right) ^2d\nu . \end{aligned}$$
(2.1)

Under our assumptions the limit in (2.1) exists (see [12]), and by using the summability of the correlation decay and the duality property of P, one can rewrite \(\sigma ^2\) as

$$\begin{aligned} \sigma ^2:=\int _I\hat{\psi }^2 hdm+2\sum _{i=1}^{\infty }\int _IP^i(\hat{\psi }h)\hat{\psi }dm, \end{aligned}$$
(2.2)

where

$$\begin{aligned} \hat{\psi }:=\psi -\mu \text { and }\mu :=\int _I\psi d\nu . \end{aligned}$$

The number \(\sigma ^2\) is called the variance, or the diffusion coefficient, of \(\sum _{i=0}^{n-1}\psi (T^ix)\). In particular, for the systems under consideration, it is well known (see [12]) that the Central Limit Theorem holds:

$$\begin{aligned} \frac{1}{\sqrt{n}}\left( \sum _{i=0}^{n-1}\psi (T^ix)-n\int _I\psi d\nu \right) {\overset{\text {law}}{\longrightarrow }}\mathcal N(0, \sigma ^2). \end{aligned}$$

Moreover, \(\sigma ^2>0\) if and only if \(\psi \not = c+\phi \circ T-\phi \), \(\phi \in BV\), \(c\in \mathbb R\).

The goal of this paper is to provide an algorithm whose output approximates \(\sigma ^2\) with rigorous error bounds. The first step in our approach will be to discretize P as follows:

2.4 Ulam’s Scheme

Let \(\eta :=\{I_k\}_{k=1}^{d(\eta )}\) be a partition of [0, 1] into intervals of size \(\lambda (I_k)\le \varepsilon \). Let \(\mathfrak B_{\eta }\) be the \(\sigma \)-algebra generated by \(\eta \) and for \(f\in L^1\) define the projection

$$\begin{aligned} \Pi _{\varepsilon }f=E(f|\mathfrak B_{\eta }), \end{aligned}$$

and

$$\begin{aligned} P_{\varepsilon }=\Pi _{\varepsilon }\circ P\circ \Pi _{\varepsilon }. \end{aligned}$$

\(P_{\varepsilon }\), which is called Ulam’s approximation of P, is finite rank operator which can be represented by a (row) stochastic matrix acting on vectors in \(\mathbb R^{d(\eta )}\) by left multiplication. Its entries are given by

$$\begin{aligned} P_{kj}=\frac{\lambda (I_k\cap T^{-1}(I_j))}{\lambda (I_k)}. \end{aligned}$$

The following lemma collects well known results on \(P_\varepsilon \). See for instance [25] for proofs of (1)-(4) of the lemma, and [15, 25] and references therein for statement (5) of the lemma.

Lemma 2.2

For \(f\in BV\) we have

  1. (1)

    \(V(\Pi _{\varepsilon }f)\le V(f)\);

  2. (2)

    \(||f-\Pi _{\varepsilon }f||_{1}\le \varepsilon V(f)\);

  3. (3)
    $$\begin{aligned} VP_{\varepsilon }f\le \alpha Vf+B_0||f||_1, \end{aligned}$$

    where \(\alpha \) and \(B_0\) are the same constants that appear in (A1);

  4. (4)

    \(|||P_{\eta }-P|||\le \Gamma \varepsilon ,\) where \(\Gamma =\max \{\alpha +1,B_0\}\);

  5. (5)

    \(P_{\varepsilon }\) has a unique fixed point \(h_{\varepsilon }\in BV\). Moreover, \(\exists \) a computable constant \(K_*\) such that

    $$\begin{aligned} ||h_{\varepsilon }-h||_1\le K_*\varepsilon \ln \varepsilon ^{-1}. \end{aligned}$$

    In particular, for any \(\tau >0\), there exists \(\varepsilon _*\) such that \(||h_{\varepsilon _*}-h||_1\le \tau \).

2.5 Statement of the General Result

Define

$$\begin{aligned} \hat{\psi }_{\varepsilon }:=\psi -\mu _{\varepsilon }\text { and }\mu _{\varepsilon }:=\int _{I}\psi h_{\varepsilon }dm. \end{aligned}$$

Set

$$\begin{aligned} \sigma ^2_{\varepsilon ,l}:=\int _I\hat{\psi }_{\varepsilon }^2 h_\varepsilon dm+2\sum _{i=1}^{l-1}\int _IP_\varepsilon ^i(\hat{\psi }_{\varepsilon }h_\varepsilon )\hat{\psi }_{\varepsilon }dm. \end{aligned}$$

Theorem 2.3

For any \(\tau >0\), \(\exists \) \(l_*>0\) and \(\varepsilon _*>0\) such that

$$\begin{aligned} |\sigma ^2_{\varepsilon _*,l_*}-\sigma ^2|\le \tau . \end{aligned}$$

Remark 2.4

Theorem 2.3 says that given a pre-specified tolerance on error \(\tau >0\), one finds \(l_*>0\) and \(\varepsilon _*>0\) so that \(\sigma ^2_{\varepsilon _*,l_*}\) approximates \(\sigma \) up to the pre-specified error \(\tau \). In Sect. 3.1 we provide an algorithm that can be implemented on a computer to find \(l_*\) and \(\varepsilon _*\), and consequently \(\sigma ^2_{\varepsilon _*,l_*}\).

To illustrate the issue of the rate of convergence and to elaborate on why we define the approximate diffusion by \(\sigma ^2_{\varepsilon ,l}\) as a truncated sum, let us define

$$\begin{aligned} \sigma ^2_{\varepsilon }:=\int _I\hat{\psi }_{\varepsilon }^2 h_\varepsilon dm+2\sum _{i=1}^{\infty }\int _IP_\varepsilon ^i(\hat{\psi }_{\varepsilon }h_\varepsilon )\hat{\psi }_{\varepsilon }dm. \end{aligned}$$

Theorem 2.5

\(\exists \) a computable constant \(\tilde{K}_*\) such that

$$\begin{aligned} |\sigma ^2_{\varepsilon }-\sigma ^2|\le \tilde{K}_*\varepsilon (\ln \varepsilon ^{-1})^2. \end{aligned}$$

Remark 2.6

Note that \(\sigma _{\varepsilon }^2\) can be written as

$$\begin{aligned} \begin{array}{ll} \sigma ^2_{\varepsilon }&{}\displaystyle =\int _I\hat{\psi }_{\varepsilon }^2 h_\varepsilon dm+2\sum \limits _{i=1}^{\infty }\int _IP_\varepsilon ^i(\hat{\psi }_{\varepsilon }h_\varepsilon )\hat{\psi }_{\varepsilon }dm\\ &{}\displaystyle =-\int _I\hat{\psi }_{\varepsilon }^2 h_\varepsilon +2\int _I\hat{\psi }_{\varepsilon }(\mathbf{1}- P_\varepsilon )^{-1}(\hat{\psi }_{\varepsilon }h_\varepsilon ) dm. \end{array} \end{aligned}$$
(2.3)

Since \(P_\varepsilon \) has a matrix representation, and consequently \((I-P_{\varepsilon })^{-1}\) is a matrix, one may think that \(\sigma ^2_{\varepsilon }\) provides a more sensible formula to approximate \(\sigma ^2\) than \(\sigma ^2_{\varepsilon ,l}\). However, from the rigorous computational point of view one has to take into account the errors that arise at the computer level when estimating \((I-P_{\varepsilon })^{-1}\). Indeed \((I-P_\varepsilon )^{-1}\) can be computed rigorously on the computer by estimating it by a finite sum plus an error term coming from estimating the tail of the sum.Footnote 5 This is what we do in Theorem 2.3.

Remark 2.7

In [5] an example of a highly regular expanding map (piecewise affine) was presented where the exact rate of Ulam’s method for approximating the invariant density h is \(\varepsilon \ln \varepsilon ^{-1}\). In Theorem 2.5 the rate for approximating \(\sigma ^2\) is \(\varepsilon (\ln \varepsilon ^{-1})^2\). This is due to the fact that \(||h-h_{\varepsilon }||_1\) is an essential part in estimating \(\sigma ^2\) and the extra \(\ln \varepsilon ^{-1}\) appears because of the infinite sum in the formula of \( \sigma ^2\).

Remark 2.8

By using the representation (2.3) of \(\sigma _{\varepsilon }^2\), it is obvious that the main task in the proof of Theorem 2.5 is to estimate

$$\begin{aligned} |||(\mathbf{1}-P)^{-1}-(\mathbf{1}-P_{\varepsilon })^{-1}|||_{BV_0\rightarrow L^1}, \end{aligned}$$

where \(BV_0=\{f\in BV \text { s.t.} \int fdm=0\}\). Thus, it would be tempting to use estimate (9) in Theorem 1 of [19], which reads:

$$\begin{aligned} \begin{array}{ll} &{}|||(\mathbf{1}-P)^{-1}-(\mathbf{1}-P_{\varepsilon })^{-1}|||_{BV_0\rightarrow L^1}\\ &{}\qquad \le |||P-P_{\varepsilon }|||_{BV_0\rightarrow L^1}^{\theta }(c_1||(\mathbf{1}-P_{\varepsilon })^{-1}||_{BV_0}+c_2||(\mathbf{1}-P_{\varepsilon })^{-1}||_{BV_0}^2), \end{array} \end{aligned}$$
(2.4)

where \(\theta =\frac{\ln (r/\alpha )}{\ln (1/\alpha )}\), \(r\in (\alpha ,1)\), and \(c_1,c_2\) are constants that dependent only on \(\alpha \), \(B_0\) and r. On the one hand, this would lead to a shorter proof than the one we present in Sect. 3; however, estimate (2.4) would lead to a convergence rate of order \(\varepsilon ^{\theta }\), where \(0<\theta <1\) which is slower than the rate obtained in Theorem 2.5. Naturally, this have led us to opt for using the proofs of Sect. 3.

2.6 Approximating the Diffusion Coefficient for Non-uniformly Expanding Maps

We now show that Theorem 2.3 can be used to approximate the diffusion coefficient for non-uniformly expanding maps. We restrict the presentation to the model that was popularized by Liverani–Saussol–Vaienti [27]. Such systems have attracted the attention of both mathematicians [27, 37] and physicists because of their importance in the study of intermittent transition to turbulence [33]. Let

$$\begin{aligned} S(x)=\left\{ \begin{array}{ll} x(1+2^{\gamma }x^{\gamma }) &{}\quad x\in \left[ 0,\frac{1}{2}\right] \\ 2x-1 &{}\quad x\in \left( \frac{1}{2},1\right] \end{array},\right. \end{aligned}$$
(2.5)

where the parameter \(\gamma \in (0, 1)\). S has a neutral fixed point at \(x=0\). It is well known that S admits a unique absolutely continuous probability measure \(\tilde{\nu }\), and the system enjoys polynomial decay of correlation for Hölder observables [37]. For \(\gamma \in (0,\frac{1}{2})\) it is known that the system satisfies the Central Limit Theorem.Footnote 6 To study such systems it is often useful to first induce S on a subset of I where the induced map T is uniformly expanding. In particular for the map (2.5), denoting its first branch by \(S_1\) and the second one by \(S_2\), one can induce S on \(\Delta :=[\frac{1}{2},1]\). For \(n\ge 0\) we define

$$\begin{aligned} x_0:=\frac{1}{2}\text { and }x_{n+1} = S_{1}^{-1}(x_{n}). \end{aligned}$$

Set

$$\begin{aligned} W_{0}:=(x_0,1),\text {and } W_{n}:=(x_{n},x_{n-1}),\, n\ge 1. \end{aligned}$$

For \(n\ge 1\), we define

$$\begin{aligned} Z_n:=S_2^{-1}(W_{n-1}). \end{aligned}$$

Then we define the induced map \(T:\Delta \rightarrow \Delta \) by

$$\begin{aligned} T(x)=S^{n}(x)\text { for }x\in Z_{n} \end{aligned}$$
(2.6)

Observe that

$$\begin{aligned} S(Z_{n})= W_{n-1}\text { and } R_{Z_{n}}=n, \end{aligned}$$

where \(R_{Z_{n}}\) is the first return time of \(Z_{n}\) to \(\Delta \). For \(x\in \Delta \), we denote by R(x) the first return time of x to \(\Delta \). Let f be Hölder with \(\int _{I} fd\tilde{\nu }=0\). Then diffusion coefficient of the system S can be written using the data of the induced map T (see [17]). In particular, for \(x\in \Delta \), writing \(\psi (x)=\sum _{i=0}^{R(x)-1}f(S^ix)\), the diffusion coefficient is given by

$$\begin{aligned} \sigma ^2:=\int _{\Delta }\psi ^2 hdm_{\Delta }+2\sum _{i=1}^{\infty }\int _{\Delta }P^i(\psi h)\psi dm_{\Delta }, \end{aligned}$$

where h is the unique invariant density of induced map T, P is the Perron–Frobenius operator associated with T, and \(m_{\Delta }\) is normalized Lebesgue measure on \(\Delta \). Thus, for \(\psi \in BV\) one can use,Footnote 7 Theorem 2.3 to approximate \(\sigma ^2\).

3 Proofs and an Algorithm

We first prove two lemmas that will be used to prove Theorem 2.3. The explicit estimates of Lemma 3.2 below will also be used in Sect. 3.1 where we present our algorithm to rigorously estimate diffusion coefficients.

Lemma 3.1

For \(\psi \in BV\), we have

  1. (1)

    \(||\hat{\psi }||_{\infty }\le 2||\psi ||_{\infty }\) and \(||\hat{\psi }_{\varepsilon }||_{\infty }\le 2||\psi ||_{\infty }\);

  2. (2)

    \(|\int _I(\hat{\psi }^2 h-\hat{\psi }_{\varepsilon }^2 h_\varepsilon )dm|\le 8||\psi ||_{\infty }^2||h_\varepsilon -h||_1.\)

Proof

Using the definition of \(\hat{\psi }\), \(\hat{\psi }_{\varepsilon }\) we get (1). We now prove (2). We have

$$\begin{aligned} \begin{array}{ll} \displaystyle |\int _I(\hat{\psi }^2_{\varepsilon }-\hat{\psi }^2)hdm|&{}=\displaystyle |\int _I(\hat{\psi }_{\varepsilon }-\hat{\psi })(\hat{\psi }_{\varepsilon }+\hat{\psi })h dm|=|\int _I(\mu -\mu _{\varepsilon })(2\psi -\mu -\mu _{\varepsilon })hdm|\\ &{}\displaystyle \le 4||\psi ||_{\infty }|\mu _{\varepsilon }-\mu |\int _Ih dm\le 4||\psi ||_{\infty }^2||h_\varepsilon -h||_1. \end{array} \end{aligned}$$
(3.1)

We now use (1) and (3.1) to get

$$\begin{aligned} \begin{array}{ll} \displaystyle |\int _I(\hat{\psi }^2 h-\hat{\psi }_{\varepsilon }^2 h_\varepsilon )dm|&{}\displaystyle \le |\int _I(\hat{\psi }^2 h-\hat{\psi }_{\varepsilon }^2 h)dm|+|\int _I(\hat{\psi }_{\varepsilon }^2 h-\hat{\psi }_{\varepsilon }^2 h_\varepsilon )dm|\\ &{}\displaystyle \le 8||\psi ||_{\infty }^2||h_\varepsilon -h||_1. \end{array} \end{aligned}$$

\(\square \)

Lemma 3.2

For any \(l\ge 1\) we have

$$\begin{aligned} \begin{array}{ll} &{}\displaystyle |\sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }_{\varepsilon }h_{\varepsilon })\hat{\psi }_{\varepsilon }-P^i(\hat{\psi }h)\hat{\psi }\right) dm|\le 8(l-1)\cdot ||\psi ||^2_{\infty }\cdot ||h_\varepsilon -h||_1\\ &{}\quad \displaystyle +\,2||\psi ||_{\infty } |||P_{\varepsilon }-P||| \sum \limits _{i=1}^{l-1}\sum \limits _{j=0}^{i-1}\left( 2||\psi ||_{\infty }(B_j+1+\frac{\alpha ^jB_0}{1-\alpha })+\frac{\alpha ^j(B_0+1-\alpha )}{1-\alpha }V\psi \right) , \end{array} \end{aligned}$$

where \(B_{j}=\sum _{k=0}^{j-1}\alpha ^{k}B_0.\)

Proof

$$\begin{aligned} \begin{array}{ll} &{}\displaystyle |\sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }_{\varepsilon }h_{\varepsilon })\hat{\psi }_{\varepsilon }-P^i(\hat{\psi }h)\hat{\psi }\right) dm|\\ &{}\displaystyle \quad \le |\sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }_{\varepsilon }h_{\varepsilon })\hat{\psi }_{\varepsilon }-P^i_\varepsilon (\hat{\psi }h)\hat{\psi }\right) dm|+|\sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }h)\hat{\psi }-P^i(\hat{\psi }h)\hat{\psi }\right) dm|\\ &{}\displaystyle \quad \le |\sum \limits _{i=1}^{l-1}\int _I P^i_{\varepsilon }(\hat{\psi }_{\varepsilon }h_{\varepsilon }-\hat{\psi }h)\psi dm|+|\sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }_{\varepsilon }h_{\varepsilon })\mu _{\varepsilon }-P^i_\varepsilon (\hat{\psi }h)\mu \right) dm|\\ &{}\displaystyle \qquad +\,|\sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }h)\hat{\psi }-P^i(\hat{\psi }h)\hat{\psi }\right) dm|\\ &{}\quad :=(I)+(II) +(III). \end{array} \end{aligned}$$

We have

$$\begin{aligned} \begin{array}{ll} (I)&{}\displaystyle \le ||\psi ||_{\infty }\sum \limits _{i=1}^{l-1}\int _I |\hat{\psi }_{\varepsilon }h_{\varepsilon }-\hat{\psi }h|dm\\ &{}\displaystyle = ||\psi ||_{\infty }\cdot (l-1)\int _I |\hat{\psi }_{\varepsilon }h_{\varepsilon }-\hat{\psi }_{\varepsilon }h+\hat{\psi }_{\varepsilon }h-\hat{\psi }h|dm\\ &{}\displaystyle \le ||\psi ||_{\infty }\cdot ( l-1)\left( ||\hat{\psi }_{\varepsilon }||_{\infty }||h_\varepsilon -h||_1+|\mu -\mu _{\varepsilon }|\right) \\ &{}\displaystyle \le 3||\psi ||^2_{\infty }\cdot ( l-1) \cdot ||h_\varepsilon -h||_1. \end{array} \end{aligned}$$
(3.2)

We know estimate (II):

$$\begin{aligned} \begin{array}{ll} (II)&{}\displaystyle \le |\sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }_{\varepsilon }h_{\varepsilon })\mu _{\varepsilon }-P^i_\varepsilon (\hat{\psi }h)\mu _\varepsilon \right) dm|+|\sum \limits _{i=1}^{l-1}\int _I\left( P^i_\varepsilon (\hat{\psi }h)\mu _\varepsilon -P^i_\varepsilon (\hat{\psi }h)\mu \right) dm|\\ &{}\displaystyle \le (l-1)|\mu _\varepsilon | \int _I\left| \hat{\psi }_{\varepsilon }h_{\varepsilon }-\hat{\psi }h\right| dm +2(l-1)\cdot ||\psi ||_{\infty }|\mu _{\varepsilon }-\mu |\\ &{}\displaystyle \le 3||\psi ||^2_{\infty }\cdot ( l-1) \cdot ||h_\varepsilon -h||_1+2(l-1)\cdot ||\psi ||^2_{\infty }||h_{\varepsilon }-h||_1\\ &{}\displaystyle =5||\psi ||^2_{\infty }\cdot ( l-1) \cdot ||h_\varepsilon -h||_1. \end{array} \end{aligned}$$
(3.3)

Finally we estimate (III)

$$\begin{aligned} \begin{array}{ll} (III)&{}\displaystyle \le 2||\psi ||_{\infty }\sum \limits _{i=1}^{l-1}\sum \limits _{j=0}^{i-1}||P_{\varepsilon }^{i-1-j}(P_\varepsilon -P)P^j(\hat{\psi }h)||_1\\ &{}\displaystyle \le 2||\psi ||_{\infty }\cdot |||P_{\varepsilon }-P|||\cdot \sum \limits _{i=1}^{l-1}\sum \limits _{j=0}^{i-1}||P^{j}(\hat{\psi }h)||_{BV}\\ &{}\displaystyle \le 2||\psi ||_{\infty }\cdot |||P_{\varepsilon }-P|||\cdot \sum \limits _{i=1}^{l-1}\sum \limits _{j=0}^{i-1}\left( \alpha ^jV(\hat{\psi }h)+(B_j+1)||\hat{\psi }h||_1\right) \\ &{}\displaystyle \le 2||\psi ||_{\infty } |||P_{\varepsilon }-P||| \sum \limits _{i=1}^{l-1}\sum \limits _{j=0}^{i-1}\left( 2||\psi ||_{\infty }(B_j+1+\frac{\alpha ^jB_0}{1-\alpha })+\frac{\alpha ^j(B_0+1-\alpha )}{1-\alpha }V\psi \right) , \end{array} \end{aligned}$$
(3.4)

where in the above estimate we have used (A1) and its consequence that \(Vh\le \frac{B_0}{1-\alpha }\). Combining estimates (3.2),(3.3) and (3.4) completes the proof of the lemma.\(\square \)

Proof

(Proof of Theorem 2.3)

$$\begin{aligned} \begin{array}{ll} \displaystyle |\sigma ^2_{\varepsilon ,l}-\sigma ^2|&{}\displaystyle \le \left| \int _I(\hat{\psi }^2 h-\hat{\psi }_{\varepsilon }^2 h_\varepsilon )dm\right| + 2\left| \sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }_{\varepsilon }h_{\varepsilon })\hat{\psi }_{\varepsilon }-P^i(\hat{\psi }h)\hat{\psi }\right) dm\right| \\ &{}\quad \displaystyle +\,4||\psi ||_{\infty }\sum \limits _{i=l}^{\infty }||P^i(\hat{\psi }h)||_{1}\\ &{}:=(I)+(II)+(III). \end{array} \end{aligned}$$

We start with (III). Since \(\int _I\hat{\psi }hdm=0\), there exists a computable constant \(C_*\) and a computable numberFootnote 8 \(\rho _*\), where \(\alpha <\rho _*<1\), such that

$$\begin{aligned} ||P^i(\hat{\psi }h)||_{1}\le ||P^i(\hat{\psi }h)||_{BV}\le ||\hat{\psi }h||_{BV}C_*\rho _*^{i}\le \left( 2||\psi ||_{\infty }+V(\psi )\right) \frac{B_0+1-\alpha }{1-\alpha }C_*\rho _*^{i}. \end{aligned}$$

Consequently,

$$\begin{aligned} (III)\le 4||\psi ||_{\infty }\left( 2||\psi ||_{\infty }+V(\psi )\right) \frac{B_0+1-\alpha }{(1-\alpha )(1-\rho _*)}C_*\rho _*^{l}. \end{aligned}$$

Thus, choosing \( l_*\) such that

$$\begin{aligned} l_*:=\left\lceil \frac{\log (\tau /2)-\log \left( 4||\psi ||_{\infty }\left( 2||\psi ||_{\infty }+V(\psi )\right) \frac{B_0+1-\alpha }{(1-\alpha )(1-\rho _*)}C_*\right) }{\log \rho _*}\right\rceil \end{aligned}$$
(3.5)

implies

$$\begin{aligned} 4||\psi ||_{\infty }\sum _{i=l_*}^{\infty }||P^i(\hat{\psi }h)||_{1}\le \frac{\tau }{2}. \end{aligned}$$

Fix \(l_*\) as in (3.5). Now using Lemmas 2.2, 3.1 and 3.2, we can find \(\varepsilon _*\) such that

$$\begin{aligned} \left| \int _I(\hat{\psi }^2 h-\hat{\psi }_{\varepsilon _*}^2 h_{\varepsilon _*})dm\right| + 2\left| \sum _{i=1}^{l_*-1}\int _I\left( P^i_{\varepsilon _*}(\hat{\psi }_{\varepsilon }h_{\varepsilon _*})\hat{\psi }_{\varepsilon }-P^i(\hat{\psi }h)\hat{\psi }\right) dm\right| \le \frac{\tau }{2}. \end{aligned}$$

This completes the proof of the theorem.\(\square \)

3.1 Algorithm

Theorem 2.3 suggests an algorithm as follows. Given T that satisfies (A1) and (A2) and \(\tau >0\) a tolerance on error:

  1. (1)

    Find \(l_*\) such that

    $$\begin{aligned} 4||\psi ||_{\infty }\sum _{i=l_*}^{\infty }||P^i(\hat{\psi }h)||_{1}\le \frac{\tau }{2}. \end{aligned}$$
  2. (2)

    Fix \(l_*\) from (1).

  3. (3)

    Find \(\varepsilon _*=\text {mesh}(\eta )\) such that

    $$\begin{aligned} \begin{array}{ll} &{}(16(l_*-1)+8)\cdot ||\psi ||^2_{\infty }\cdot ||h_{\varepsilon _*}-h||_1\\ &{}\quad +\,4||\psi ||_{\infty } \sum \limits _{i=1}^{l_*-1}\sum \limits _{j=0}^{i-1}\left( 2||\psi ||_{\infty }(B_j+1+\frac{\alpha ^jB_0}{1-\alpha })+\frac{\alpha ^j(B_0+1-\alpha )}{1-\alpha }V\psi \right) |||P_{\varepsilon _*}-P|||\le \frac{\tau }{2}. \end{array} \end{aligned}$$
  4. (4)

    Output \(\sigma ^2_{\varepsilon _*,l_*}:=\int _I\hat{\psi }_{\varepsilon _*}^2 h_{\varepsilon _*} dm+2\sum _{i=1}^{l_*-1}\int _IP_{\varepsilon _*}^i(\hat{\psi }_{\varepsilon _*} h_{\varepsilon _*})\hat{\psi }_{\varepsilon _*} dm\).

Remark 3.3

Note that the split of \(\frac{\tau }{2}\) between items (1) and (2) in Algorithm 3.1 to lead to an error of at most \(\tau \) can be relaxed in following way. One can compute the error in item (1) to be at most \(\frac{\tau }{k}\) and in item (2) to be \(\frac{k-1}{k}\tau \) for any integer \(k\ge 2\). We exploit this fact in the implementation in Sect. 4.

Proof

(Proof of Theorem 2.5)

$$\begin{aligned} \begin{array}{ll} \displaystyle \left| \sigma ^2_{\varepsilon }-\sigma ^2\right| &{}\displaystyle \le \left| \int _I(\hat{\psi }^2 h-\hat{\psi }_{\varepsilon }^2 h_\varepsilon )dm\right| + 2\left| \sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }_{\varepsilon }h_{\varepsilon })\hat{\psi }_{\varepsilon }-P^i(\hat{\psi }h)\hat{\psi }\right) dm\right| \\ &{}\quad \displaystyle +\,4||\psi ||_{\infty }\sum \limits _{i=l}^{\infty }||P^i(\hat{\psi }h)||_{BV}+4||\psi ||_{\infty }\sum \limits _{i=l}^{\infty }||P_{\varepsilon }^i(\hat{\psi }_{\varepsilon } h_\varepsilon )||_{BV}\\ &{}\quad \displaystyle :=(I)+(II)+(III)+(IV). \end{array} \end{aligned}$$

We first get an estimate on (III) and (IV). There exists a computable constant \(C_*\) and a computable number \(\rho _*\), where \(\alpha <\rho _*<1\), such that

$$\begin{aligned} (III)+(IV)\le 8||\psi ||_{\infty }\left( 2||\psi ||_{\infty }+V(\psi )\right) \frac{B_0+1-\alpha }{(1-\alpha )(1-\rho _*)}C_*\rho _*^{l}. \end{aligned}$$

For (II), as in Lemma 3.2, in particular (3.4), and by using Lemma 2.2, we have

$$\begin{aligned} \begin{array}{ll} (II)&{}\le 4||\psi ||_{\infty }\sum \limits _{i=1}^{l-1}\sum \limits _{j=0}^{i-1}||P_{\varepsilon }^{i-1-j}(P_\varepsilon -P)P^j(\hat{\psi }h)||_1+16(l-1)\cdot ||\psi ||^2_{\infty }\cdot ||h_\varepsilon -h||_1\\ &{}\le 4||\psi ||_{\infty }\Gamma \cdot \left( \alpha V(\psi ) \frac{B_0+1-\alpha }{1-\alpha }+||\psi ||_{\infty }\frac{2B_0+\alpha B_0}{1-\alpha }\right) (l-1)\varepsilon \\ &{}\quad +\,K_*16(l-1)\varepsilon \ln \varepsilon ^{-1}. \end{array} \end{aligned}$$

For (I) we use Lemmas 2.2 and 3.1 to obtain

$$\begin{aligned} (I)\le 8||\psi ||_{\infty }^2||h_\varepsilon -h||_1\le 8||\psi ||_{\infty }^2K_*\varepsilon \ln \varepsilon ^{-1}. \end{aligned}$$

Finally, choosing \(l=\lceil \frac{\ln \varepsilon }{\ln \rho _*}\rceil \) leads to the rate \(\tilde{K}_*\varepsilon (\ln \varepsilon ^{-1})^2\).\(\square \)

4 Implementation of the Algorithm and Estimating the Diffusion Coefficient for Lanford’s Map

Let

$$\begin{aligned} T(x)=2x+\frac{1}{2}x(1-x) \qquad {\text {(mod } 1)}. \end{aligned}$$
(4.1)

The map defined in (4.1) is known as Lanford’s map [21]. In this section we let \(\psi =x^2\) and compute the diffusion coefficient up to a pre-specified error \(\tau =0.035\). A plot of T on [0, 1] and an approximation of its invariant density computed through Ulam’s approximation are plotted in Fig. 1.

Fig. 1
figure 1

The map T in (4.1) to the left, and its approximated invariant density to the right.

4.1 Rigorous Projections on the Ulam Basis

To compute the diffusion coefficient rigorously we have to compute rigorously the projection of an observable on the Ulam basis, i.e., given an observable \(\phi \) in BV, and the projection \(\Pi _{\varepsilon }\) we need to compute rigorously the coefficients \(\{v_0,\ldots ,v_n\}\) such that

$$\begin{aligned} \Pi _{\varepsilon }\phi =\sum _{i=0}^{n-1}v_i\cdot \frac{\chi _{I_i}}{m(I_i)}, \end{aligned}$$

where

$$\begin{aligned} v_i=\int _{I_i}\phi \,dm. \end{aligned}$$

To do so, we will use rigorous integration through interval arithmetics, as explained in the book [35].

Once an observable is projected on the Ulam basis, many operations involved in the computation of the diffusion coefficient become componentwise operations on vectors; we explain this point in more details.

The first operation is the integral with respect to Lebesgue measure of an observable projected on the Ulam basis. This is given by the following formula:

$$\begin{aligned} \int _0^1 \Pi _{\varepsilon }\phi \, dm=\int _0^1\sum _{i=0}^n v_i\frac{\chi _{I_i}}{m(I_i)}\,dm=\sum _i v_i. \end{aligned}$$

Suppose now we have computed an approximation \(h_{\varepsilon }\) of the invariant density with respect to the partition, i.e., \(\int _0^1 h_{\varepsilon } dx=1\). In the following we will denote its coefficients on the Ulam basis by \(\{w_0,\ldots w_n\}\). Note that the i-th component, \(w_i\), is the measure of \(I_i\) with respect to the measure \(h_{\varepsilon }dm\).

The second operation we are interested in is the pointwise product of functions and the relation of the projection \(\Pi _{\varepsilon }\) with this operation. We claim that:

$$\begin{aligned} \Pi _{\varepsilon } (\phi \cdot h_{\varepsilon })(x)=\Pi _{\varepsilon }\phi (x)\cdot h_{\varepsilon }(x). \end{aligned}$$

We will prove this by expressing the components of \(\Pi _{\varepsilon } (\phi \cdot h_{\varepsilon })\) as a function of the components \(\{w_0,\ldots ,w_n\}\) of \(h_{\varepsilon }\) and the components \(\{v_0,\ldots ,v_n\}\) of \(\Pi _{\varepsilon }\phi \). We claim that

$$\begin{aligned} \Pi _{\varepsilon }(\phi \cdot h_{\varepsilon })_i=\frac{v_i\cdot w_i}{m(I_i)}. \end{aligned}$$

First of all recalling that \(\chi _{I_i}^2=\chi _{I_i}\) and that \(\chi _{I_i}\cdot \chi _{I_j}=0\) for \(i\ne j\) we have:

$$\begin{aligned} \sum _i \frac{v_i\cdot w_i}{m(I_i)}\cdot \frac{\chi _{I_i}(x)}{m(I_i)}&=\sum _i v_i \cdot \frac{\chi _{I_i}(x)}{m(I_i)}\sum _i w_j \cdot \frac{\chi _{I_j}(x)}{m(I_j)}=(\Pi _{\varepsilon }\phi )(x)\cdot h_{\varepsilon }(x). \end{aligned}$$

On the right hand side, since \(h_{\varepsilon }\) is constant on each \(I_i\) and equal to \(w_{i}\), we have:

$$\begin{aligned} (\Pi _{\varepsilon } (\phi h_{\varepsilon }))_i=\int _{I_i} h_{\varepsilon }\phi \, dm=\int _{I_i} w_{i}\cdot \frac{\chi _{I_i}}{m(I_i)}\phi \, dm=\frac{w_i}{m(I_i)}\cdot \int _{I_i}\phi \, dm=\frac{w_i\cdot v_i}{m(I_i)}. \end{aligned}$$

These identities simplify the computations when dealing with the Ulam basis. It is worth noting that these identities imply that:

$$\begin{aligned} \int _0^1 \phi \cdot h_{\varepsilon } dm=\sum _i \frac{v_i\cdot w_i}{m(I_i)}. \end{aligned}$$

Moreover, it is worth observing that, if \(P_{\varepsilon }\) is the Ulam approximation and \(\phi \) is an observable:

$$\begin{aligned} P_{\varepsilon }(\phi \cdot h_{\varepsilon })=\Pi _{\varepsilon }P\Pi _{\varepsilon }(\phi \cdot h_{\varepsilon })=\Pi _{\varepsilon }P\Pi _{\varepsilon }\Pi _{\varepsilon }(\phi \cdot h_{\varepsilon })=P_{\varepsilon }(\Pi _{\varepsilon }\phi \cdot h_{\varepsilon }). \end{aligned}$$

4.2 Item (1) in Algorithm 3.1

In this step, we find \(l^{*}\) such that item (1) of Algorithm 3.1 is satisfied. In particular we want to find \(l^{*}\) such that

$$\begin{aligned} 4||\psi ||_{\infty }\sum _{i=l^{*}}^{+\infty }||P^{i}((\hat{\psi }\cdot h))||_{1}\le \frac{\tau }{256}. \end{aligned}$$

As explained in Remark 3.3, instead of verifying item (1) to be smaller than \(\frac{\tau }{2}\), we verify that it is smaller than \(\frac{ \tau }{256}\). This will give us more room in verifying item (2) so that the sum of the errors from both items is smaller than \(\tau \). Since the system satisfies (A2), there exist \(0<\rho _{*}<1\), and \(C_{*}>0\) such that for any \(g\in BV_{0}\), and any \(k\in \mathbb {N}\),

$$\begin{aligned} \Vert P^{k}g\Vert _{1}\le C_{*}\rho _{*}^{k}\Vert g\Vert _{BV}. \end{aligned}$$
(4.2)

We want to find a \(0<\rho _{*}<1\) and a \(C_{*}>0\) so that (4.2) is satisfied.

Once these two numbers are computed, we can easily find \(l_{*}\) (see (3.5)) so that item (1) is satisfied. To compute \(\rho ^{*}\) and \(C_{*}\) we follow [16] whose main idea is to build a system of iterated inequalities governed by a positive matrix \(\mathcal {M}\) such that:

$$\begin{aligned} \left( \! \begin{array}{c} \Vert P ^{in_{1}}g\Vert _{BV} \\ \Vert P ^{in_{1}}g\Vert _{L^{1}} \end{array} \!\right) \preceq \mathcal {M}^{i}\left( \! \begin{array}{c} \Vert g \Vert _{BV} \\ \Vert g \Vert _{L^{1}} \end{array} \!\right) , \end{aligned}$$
(4.3)

where \(\preceq \) means component-wise inequalities, e.g. for vectors \( \overrightarrow{x}=(x_{1},x_{2})\) and \(\overrightarrow{y}=(y_{1},y_{2})\), if \(\overrightarrow{x}\preceq \overrightarrow{y}\), then, \(x_{1}\le y_{1}\) and \( x_{2}\le y_{2}\).

By using Lemma 2.2 and Appendix, we get that, if \( ||P^n_{\varepsilon }|_{BV_0}||_1\le \alpha _2\), the following inequalities are satisfied:

$$\begin{aligned} {\left\{ \begin{array}{ll} \Vert P^{n_1 }f\Vert _{BV} \le \alpha ^{n_1 } \Vert f\Vert _{BV} + (\frac{B_0}{1-\alpha }) \Vert f\Vert _1 \\ \Vert P^{n_1 } f \Vert _1 \le \alpha _2 \Vert f \Vert _1 + \varepsilon M( (\frac{1+\alpha }{ 1-\alpha } ) \Vert f\Vert _{BV} + B_0{n_1 }(1+\alpha +M) \Vert f\Vert _1. \end{array}\right. } \end{aligned}$$
(4.4)

Using the inequalities above we have that:

$$\begin{aligned} \mathcal {M}= \begin{pmatrix} \alpha ^{n_1 } &{} B \\ \varepsilon M(\frac{1+\alpha }{1-\alpha } ) &{} \varepsilon M B_0{n_1 } (1+\alpha +M) + \alpha _2 \end{pmatrix} . \end{aligned}$$

Following the ideas of [16] we have that

$$\begin{aligned} \Vert P^{k n_1} g \Vert _{1} \le \frac{1}{b} \rho _*^{k} \Vert g \Vert _{BV}, \end{aligned}$$
(4.5)

where \(\rho _*\) is the dominant eigenvalue of \(\mathcal {M}\) and (ab) is the corresponding left eigenvector.

Thus, our main task now is to identify all the entries of the above matrix. The first one is M, a bound on the \(L^{1}\) norm of the iterates of P and \(P_{\varepsilon }\); by definition, we have that \(||P^{n}||\le 1\) and \(||P_{\varepsilon }||_{1}\le 1\), therefore \(M=1\). The two constants \(\alpha _{2}\) and \(n_{1}\) in \(\mathcal {M}\) are two constants that give us an estimate of the speed at which \(P_{\varepsilon }\) contracts the space \(BV_{0}\). Let \(P_{\varepsilon }\) be the discretized Ulam operator and fix \(\alpha _{2}<1\); we want to find and \(n_{1}\ge 0\) such that \(\forall v\in BV_{0}\)

$$\begin{aligned} \Vert P_{\varepsilon }^{n_{1}}v\Vert _{1}\le \alpha _{2}\Vert v\Vert _{1} \end{aligned}$$
(4.6)

with \(\alpha _{2}<1\). We follow the idea of [15] and use the computer to estimate \(n_{1}\) with a rigorous computation; we refer to their paper for the algorithm used to certify \(n_{1}\) and the corresponding numerical estimates and methods. Consequently, (4.3) is satisfied with \(n_1=28\) , \(\alpha \le 0.66666667\), \(B\le 1.444444445\), \(\varepsilon =1/16384\), \(M=1\), \( \alpha _2=1/64\); i.e.,

$$\begin{aligned} \mathcal {M}= \begin{pmatrix} 1.18\cdot 10^{-5} &{} 4.3333334 \\ 0.000306 &{} 0.022208 \end{pmatrix} . \end{aligned}$$

Thus, \(\rho _*= 0.05\) and the eigenvector (ab) associated to the eigenvalue \(\rho _*\) is given by \(a\in [0.006,0.007]\), \(b\in [0.993,0.994]\).

Thus, by (4.5), we obtain

$$\begin{aligned} \Vert P^{28k}g \Vert _{L^1} \le (1.007) \times 0.05^{k} \Vert g \Vert _{BV} \end{aligned}$$

Consequently we can compute \(l_*\ge 112\).

Remark 4.1

Using equation (4.5) and supposing \(l_{*}=k\cdot n_{1}\) we see that, for any \(\psi \) in \(BV_{0}\):

$$\begin{aligned} \sum _{i=l_{*}}^{+\infty }||P^{i}(\psi )||_{1}\le ||\psi ||_{BV}\frac{1 }{b}\cdot n_{1}\sum _{i=k}^{+\infty }\rho _{*}^{i}\le ||\psi ||_{BV} \frac{1}{b}n_{1}\frac{\rho _{*}^{k}}{1-\rho _{*}}. \end{aligned}$$

4.3 Item (2) of Algorithm 3.1

From now on \(l_*\) is fixed and it is equal to 112. So far, we executed the first loop of the Algorithm 3.1; i.e.,

$$\begin{aligned} 4 \Vert \psi \Vert _\infty \sum _{i=112}^\infty \Vert P^i (\hat{\psi })\Vert _{1} \le \frac{ \tau }{256}. \end{aligned}$$

Remark 4.2

Note in our computation above we have obtained \(l_*\) such that

$$\begin{aligned} 4||\psi ||_{\infty }\sum _{i=l^{*}}^{+\infty }||P^{i}((\hat{\psi }\cdot h))||_{1}\le \frac{0.01 }{256}\le \frac{\tau }{256}. \end{aligned}$$

4.4 Item (3) of Algorithm 3.1

In this step, we have to find \(\varepsilon _*\), a mesh size of the Ulam discretization, such that

$$\begin{aligned}&(16(l_*-1)+8) \cdot \Vert \psi \Vert _\infty ^2 \cdot \Vert h_{\varepsilon _*}-h\Vert _1 \nonumber \\&\qquad +\,4\Vert \psi \Vert _\infty \sum \limits _{i=1}^{l_*-1} \sum \limits _{j=0}^{i-1} \bigg (2\Vert \psi \Vert _\infty (B_j +1 + \frac{\alpha ^j B_0}{1-\alpha }) + \frac{\alpha ^j (B_0 +1-\alpha )}{1-\alpha } V \psi \bigg ) ||| P_{\varepsilon _*}-P||| \nonumber \\&\quad \le \frac{255}{256}\tau . \end{aligned}$$
(4.7)

To bound this term we need a rigorous approximation of the T-invariant density h, in the \(L^{1}\)-norm; we follow the ideas (and refer to the algorithm) of [15]. Set:

$$\begin{aligned} \kappa :=4\Vert \psi \Vert _{\infty }|||P_{\varepsilon _{*}}-P|||\sum _{i=1}^{l_{*}-1}\sum _{j=0}^{i-1}\bigg (2\Vert \psi \Vert _{\infty }(B_{j}+1+\frac{\alpha ^{j}B_{0}}{1-\alpha })+\frac{\alpha ^{j}(B_{0}+1-\alpha )}{1-\alpha }V\psi \bigg ). \end{aligned}$$
(4.8)

The following table contains, for different mesh sizes \(\varepsilon \), error bounds for the terms in equation (4.7); in particular a bound on \(\kappa \) defined in (4.8):

\(\varepsilon \)

\(2^{-12}\)

\(2^{-24}\)

\(2^{-25}\)

\(\Vert h_{\varepsilon _{*}}-h\Vert _{1}\le \)

0.016

\(3.2\cdot 10^{-5}\)

\(1.7\cdot 10^{-5}\)

\((16(l_{*}-1)+8)\cdot \Vert \psi \Vert _{\infty }^{2}\cdot \Vert h_{\varepsilon _{*}}-h\Vert _{1}\le \)

28.55

0.0571

0.0304

\(\kappa \le \)

8.08

0.0079

0.00395.

4.5 Item (4) in Algorithm 3.1

$$\begin{aligned} |\sigma _{\varepsilon _{*},l_{*}}^{2}-\sigma ^{2}|\le 0.01/256+(0.0304+0.00395)\cdot 255/256\le 0.0342, \end{aligned}$$

and we compute \(\sigma _{\varepsilon _{*},l_{*}}^{2}\)

$$\begin{aligned} \sigma _{\varepsilon _{*},l_{*}}^{2}:=\int _{I}\hat{\psi } _{\varepsilon _{*}}^{2}h_{\varepsilon _{*}}dm+2\sum _{i=1}^{l_{*}-1}\int _{I}P_{\varepsilon _{*}}^{i}(\hat{\psi }_{\varepsilon _{*}}h_{\varepsilon _{*}})\hat{\psi }_{\varepsilon _{*}}dm\in [0.38,0.381]. \end{aligned}$$

Remark 4.3

The code implementing rigorous computation of diffusion coefficients for piecewise uniformly expanding maps is avalaible at the research section of the following personal page:

http://www.im.ufrj.br/nisoli/

4.6 A Non Rigorous Verification

We also perform a non-rigorous experiment to compute \(\sigma ^2\) in the above example. Let \(\mathcal {F}_\zeta \) be the set of floating point numbers in [0, 1] with \(\zeta \) binary digits.

Note that the system has high entropy, so we have to be careful in our computation and choose \(\zeta \) big. Due to high expansion of the system, in few iterations the ergodic average along the simulated orbit may have little in common with the orbit of the real system. So, we have to do computations with a really high number of digits (\(\zeta =1024\) binary digits).

Let \(\{x_0,\ldots ,x_{n-1}\}\) be n random floating points in \(\mathcal {F}_l\); fix k and for each \(i=0,\ldots , n-1\) let

$$\begin{aligned} A_k(x_i)=\frac{1}{k}\sum _{j=0}^{k-1} \phi (T^j(x_i)). \end{aligned}$$

Let \(\mu \) be an approximation of the average of \(\phi \) with respect to the invariant measure, obtained by integrating the observable using the approximation of the invariant density:

$$\begin{aligned} \mu =[0.383,0.384]. \end{aligned}$$

Now, for each point \(\{x_0,\ldots ,x_{n-1}\}\) we compute the value \(A_k(x_0),\ldots ,A_k(x_{n-1})\) and from these we compute the following two estimators:

$$\begin{aligned} \tilde{\mu }&=\frac{1}{n}\sum _{i=0}^{n-1} A_k(x_i)\\ \tilde{\sigma }^2&=\frac{1}{n} \sum _{i=0}^{n-1} \frac{(k\cdot A_k(x_i)-k\mu )^2}{k}. \end{aligned}$$

The estimator \(\tilde{\mu }\) gives a non-rigorous estimate for the average of the observable with respect to the invariant measure, while the estimator \(\tilde{\sigma }^2\) is an estimator for the diffusion coefficient.

Fig. 2
figure 2

Distribution of the averages \(A_{k}(x_i)\), \(i=0,\ldots ,19999\) for Lanford’s map

The table below shows the outcome of the experiment with \(n = 20,000\). In Fig. 2, a histogram plot of the distribution of \(A_k(x_i)\) for \( k=10 \), \(k = 200\), \(n=20,000\). In red we have the normal distribution with average \(\mu \) and variance \(\sigma _{\varepsilon _*,l_*}^2/\sqrt{k}\).

k

\(\tilde{\mu }\)

\(\tilde{\sigma }^2\)

90

[0.383, 0.384]

[0.361, 0.362]

95

[0.383, 0.384]

[0.362, 0.363]

100

[0.383, 0.384]

[0.362, 0.363]

The output of this non-rigourous experiment is in line with the output from our rigorous computation in Sect. 4.5.