Rigorous Approximation of Diffusion Coefficients for Expanding Maps

Bahsoun, Wael; Galatolo, Stefano; Nisoli, Isaia; Niu, Xiaolong

doi:10.1007/s10955-016-1523-y

Rigorous Approximation of Diffusion Coefficients for Expanding Maps

Open access
Published: 26 April 2016

Volume 163, pages 1486–1503, (2016)
Cite this article

Download PDF

You have full access to this open access article

Journal of Statistical Physics Aims and scope Submit manuscript

Rigorous Approximation of Diffusion Coefficients for Expanding Maps

Download PDF

Wael Bahsoun¹,
Stefano Galatolo²,
Isaia Nisoli³ &
…
Xiaolong Niu¹

1479 Accesses
9 Citations
Explore all metrics

Abstract

We use Ulam’s method to provide rigorous approximation of diffusion coefficients for uniformly expanding maps. An algorithm is provided and its implementation is illustrated using Lanford’s map.

Rigorous Computation of Diffusion Coefficients for Expanding Maps

Article Open access 30 November 2017

O. Jenkinson, M. Pollicott & P. Vytnova

Accelerating solutions to diffusion equation

Article 21 June 2021

Felipe A. Asenjo & Sergio A. Hojman

Diffusion Means and Heat Kernel on Manifolds

1 Introduction

The use of computers is essential for predicting and understanding the behaviour of many physical systems. Sensitive dependence on initial conditions is typical in many physical systems. This sensitivity problem raises nontrivial reliability and stability issues regarding any computational approach to such systems. Moreover, it strongly motivates the study of reliable computational methods for understanding statistical properties of physical systems.

In this note we consider the rigorous computation of diffusion coefficients in a class of systems where a central limit theorem holds. Such coefficients are focal in the study of limit theorems and fluctuations for dynamical systems (see [8, 12, 13, 17, 23, 28] and references therein). Given a piecewise expanding map, an observable, and a pre-specified tolerance on error, we approximate in a certified way the diffusion coefficient up to the per-specified error (see Theorem 2.3).

Our rigorous approximation is based on a suitable finite dimensional approximation (discretization) of the system, called Ulam’s method [36]. Ulam’s method is known to provide rigorous approximations of SRB (Sinai-Ruelle-Bowen) measures and other important dynamical quantities for different types of dynamical systems (see [1–3, 9, 10, 14, 15, 25, 29, 30] and references therein). Moreover, this method was also used to detect coherent structures in geophysical systems (see e.g. [7, 34]).

In [32], following the approach of [18], a Fourier approximation scheme was used to estimate diffusion coefficients for expanding maps. The approach of [32] requires the map to have a Markov partition and to be piecewise analytic. Although the result of [32] provides an order of convergence, it does not compute the constant hiding in the rate of convergence. In our approach, we do not require the map to admit a Markov partition and we only assume it is piecewise $C^{2}$. More importantly, our approximation is rigorous. To give the reader a flavour of what we mean by rigorous, we close this section by providing in part (b) of the following theorem a prototype result of this paper:^{Footnote 1}

Theorem 1.1

Let^{Footnote 2}

$$\begin{aligned} T(x)=2x+\frac{1}{2}x(1-x) \qquad {\text {(mod } 1)}. \end{aligned}$$

(1.1)

(a)
T admits a unique absolutely continuous invariant measure $\nu $ and if $\psi $ is a function of bounded variation the Central Limit Theorem holds:
$$\begin{aligned} \frac{1}{\sqrt{n}}\left( \sum _{i=0}^{n-1}\psi (T^ix)-n\int _I\psi d\nu \right) {\overset{\text {law}}{\longrightarrow }}\mathcal N(0, \sigma ^2). \end{aligned}$$
(b)
For $\psi =x^2$ the diffusion coefficient $\sigma ^2\in [0.3458,0.4152]$.

In Sect. 2, we first introduce our framework and the assumptions on it. We then state the problem and introduce the method of approximation. The statement of the general results (Theorems 2.3, 2.5) and an application to expanding maps with a neutral fixed point are also included in Sect. 2. Section 3 contains the proofs and an algorithm. Section 4 contains an example, using Lanford’s map, that illustrates the implementation of the algorithm of Sect. 3 and proves part (b) of Theorem 1.1.

2 The Setting

2.1 The System and Its Transfer Operator

Let $(I, \mathcal {B}, m)$ be the measure space, where $I:=[0,1]$, $\mathcal B$ is Borel $\sigma $-algebra, and m is the Lebesgue measure on I. Let $T:I\rightarrow I$ be piecewise $C^2$ and expanding (see [22, 31] for original references^{Footnote 3} and [6] for a profound background on such systems). The transfer operator (Perron-Frobenius) [4] associated with T, $P:L^1\rightarrow L^1$ is defined by duality: for $f\in L^1$ and $g\in L^{\infty }$

$$\begin{aligned} \int _I f\cdot g\circ T dm= \int _I P(f)\cdot gdm. \end{aligned}$$

Moreover, for $f\in L^1$ we have

$$\begin{aligned} Pf(x)=\sum _{y=T^{-1}x}\frac{f(y)}{|T'(y)|}. \end{aligned}$$

For $f\in L^1$, we define

$$\begin{aligned} Vf=\inf _{\overline{f}}\{\text {var}\overline{f}\, : f=\overline{f} \text { a.e.}\}, \end{aligned}$$

where

$$\begin{aligned} \text {var}\overline{f}=\sup \left\{ \sum _{i=0}^{l-1}|\overline{f}(x_{i+1})-\overline{f}(x_i)|\, :0=x_0<x_1<\dots <x_l=1\right\} . \end{aligned}$$

We denote by BV the space of functions of bounded variation on I equipped with the norm $||\cdot ||_{BV}=V(\cdot )+||\cdot ||_{1}$. Further, we introduce the mixed operator norm which will play a key role in our approximation:

$$\begin{aligned} |||P|||=\underset{||f||_{BV}\le 1}{\sup }||Pf||_1. \end{aligned}$$

2.2 Assumptions

We assume:^{Footnote 4}

(A1) $\exists \,\ \alpha \in (0,1)$, and $B_0\ge 0$ such that $\forall f\in BV$

$$\begin{aligned} VPf\le \alpha Vf+B_0||f||_1; \end{aligned}$$

(A2) P, as operator on BV, has 1 as a simple eigenvalue. Moreover P has no other eigenvalues whose modulus is unity.

Remark 2.1

It is important to remark that the constants $\alpha $ and $B_0$ in (A1) depend only on the map T and have explicit analytic expressions (see [22]).

The above assumptions imply that T admits a unique absolutely continuous invariant measure $\nu $, such that $\frac{d\nu }{dm}:=h\in BV$. Moreover, the system $(I,\mathcal B, \nu , T)$ is mixing and it enjoys exponential decay of correlations for observables in BV (see [4] for a profound background on this topic).

2.3 The Problem

Let $\psi \in BV$ and define

$$\begin{aligned} \sigma ^2:=\lim _{n\rightarrow \infty }\frac{1}{n}\int _{I}\left( \sum _{i=0}^{n-1}\psi (T^ix)-n\int _I\psi d\nu \right) ^2d\nu . \end{aligned}$$

(2.1)

Under our assumptions the limit in (2.1) exists (see [12]), and by using the summability of the correlation decay and the duality property of P, one can rewrite $\sigma ^2$ as

$$\begin{aligned} \sigma ^2:=\int _I\hat{\psi }^2 hdm+2\sum _{i=1}^{\infty }\int _IP^i(\hat{\psi }h)\hat{\psi }dm, \end{aligned}$$

(2.2)

where

$$\begin{aligned} \hat{\psi }:=\psi -\mu \text { and }\mu :=\int _I\psi d\nu . \end{aligned}$$

The number $\sigma ^2$ is called the variance, or the diffusion coefficient, of $\sum _{i=0}^{n-1}\psi (T^ix)$. In particular, for the systems under consideration, it is well known (see [12]) that the Central Limit Theorem holds:

$$\begin{aligned} \frac{1}{\sqrt{n}}\left( \sum _{i=0}^{n-1}\psi (T^ix)-n\int _I\psi d\nu \right) {\overset{\text {law}}{\longrightarrow }}\mathcal N(0, \sigma ^2). \end{aligned}$$

Moreover, $\sigma ^2>0$ if and only if $\psi \not = c+\phi \circ T-\phi $, $\phi \in BV$, $c\in \mathbb R$.

The goal of this paper is to provide an algorithm whose output approximates $\sigma ^2$ with rigorous error bounds. The first step in our approach will be to discretize P as follows:

2.4 Ulam’s Scheme

Let $\eta :=\{I_k\}_{k=1}^{d(\eta )}$ be a partition of [0, 1] into intervals of size $\lambda (I_k)\le \varepsilon $. Let $\mathfrak B_{\eta }$ be the $\sigma $-algebra generated by $\eta $ and for $f\in L^1$ define the projection

$$\begin{aligned} \Pi _{\varepsilon }f=E(f|\mathfrak B_{\eta }), \end{aligned}$$

and

$$\begin{aligned} P_{\varepsilon }=\Pi _{\varepsilon }\circ P\circ \Pi _{\varepsilon }. \end{aligned}$$

$P_{\varepsilon }$, which is called Ulam’s approximation of P, is finite rank operator which can be represented by a (row) stochastic matrix acting on vectors in $\mathbb R^{d(\eta )}$ by left multiplication. Its entries are given by

$$\begin{aligned} P_{kj}=\frac{\lambda (I_k\cap T^{-1}(I_j))}{\lambda (I_k)}. \end{aligned}$$

The following lemma collects well known results on $P_\varepsilon $. See for instance [25] for proofs of (1)-(4) of the lemma, and [15, 25] and references therein for statement (5) of the lemma.

Lemma 2.2

For $f\in BV$ we have

(1)
$V(\Pi _{\varepsilon }f)\le V(f)$;
(2)
$||f-\Pi _{\varepsilon }f||_{1}\le \varepsilon V(f)$;
(3)
$$\begin{aligned} VP_{\varepsilon }f\le \alpha Vf+B_0||f||_1, \end{aligned}$$
where $\alpha $ and $B_0$ are the same constants that appear in (A1);
(4)
$|||P_{\eta }-P|||\le \Gamma \varepsilon ,$ where $\Gamma =\max \{\alpha +1,B_0\}$;
(5)
$P_{\varepsilon }$ has a unique fixed point $h_{\varepsilon }\in BV$. Moreover, $\exists $ a computable constant $K_*$ such that
$$\begin{aligned} ||h_{\varepsilon }-h||_1\le K_*\varepsilon \ln \varepsilon ^{-1}. \end{aligned}$$
In particular, for any $\tau >0$, there exists $\varepsilon _*$ such that $||h_{\varepsilon _*}-h||_1\le \tau $.

2.5 Statement of the General Result

Define

$$\begin{aligned} \hat{\psi }_{\varepsilon }:=\psi -\mu _{\varepsilon }\text { and }\mu _{\varepsilon }:=\int _{I}\psi h_{\varepsilon }dm. \end{aligned}$$

Set

$$\begin{aligned} \sigma ^2_{\varepsilon ,l}:=\int _I\hat{\psi }_{\varepsilon }^2 h_\varepsilon dm+2\sum _{i=1}^{l-1}\int _IP_\varepsilon ^i(\hat{\psi }_{\varepsilon }h_\varepsilon )\hat{\psi }_{\varepsilon }dm. \end{aligned}$$

Theorem 2.3

For any $\tau >0$, $\exists $ $l_*>0$ and $\varepsilon _*>0$ such that

$$\begin{aligned} |\sigma ^2_{\varepsilon _*,l_*}-\sigma ^2|\le \tau . \end{aligned}$$

Remark 2.4

Theorem 2.3 says that given a pre-specified tolerance on error $\tau >0$, one finds $l_*>0$ and $\varepsilon _*>0$ so that $\sigma ^2_{\varepsilon _*,l_*}$ approximates $\sigma $ up to the pre-specified error $\tau $. In Sect. 3.1 we provide an algorithm that can be implemented on a computer to find $l_*$ and $\varepsilon _*$, and consequently $\sigma ^2_{\varepsilon _*,l_*}$.

To illustrate the issue of the rate of convergence and to elaborate on why we define the approximate diffusion by $\sigma ^2_{\varepsilon ,l}$ as a truncated sum, let us define

$$\begin{aligned} \sigma ^2_{\varepsilon }:=\int _I\hat{\psi }_{\varepsilon }^2 h_\varepsilon dm+2\sum _{i=1}^{\infty }\int _IP_\varepsilon ^i(\hat{\psi }_{\varepsilon }h_\varepsilon )\hat{\psi }_{\varepsilon }dm. \end{aligned}$$

Theorem 2.5

$\exists $ a computable constant $\tilde{K}_*$ such that

$$\begin{aligned} |\sigma ^2_{\varepsilon }-\sigma ^2|\le \tilde{K}_*\varepsilon (\ln \varepsilon ^{-1})^2. \end{aligned}$$

Remark 2.6

Note that $\sigma _{\varepsilon }^2$ can be written as

$$\begin{aligned} \begin{array}{ll} \sigma ^2_{\varepsilon }&{}\displaystyle =\int _I\hat{\psi }_{\varepsilon }^2 h_\varepsilon dm+2\sum \limits _{i=1}^{\infty }\int _IP_\varepsilon ^i(\hat{\psi }_{\varepsilon }h_\varepsilon )\hat{\psi }_{\varepsilon }dm\\ &{}\displaystyle =-\int _I\hat{\psi }_{\varepsilon }^2 h_\varepsilon +2\int _I\hat{\psi }_{\varepsilon }(\mathbf{1}- P_\varepsilon )^{-1}(\hat{\psi }_{\varepsilon }h_\varepsilon ) dm. \end{array} \end{aligned}$$

(2.3)

Since $P_\varepsilon $ has a matrix representation, and consequently $(I-P_{\varepsilon })^{-1}$ is a matrix, one may think that $\sigma ^2_{\varepsilon }$ provides a more sensible formula to approximate $\sigma ^2$ than $\sigma ^2_{\varepsilon ,l}$. However, from the rigorous computational point of view one has to take into account the errors that arise at the computer level when estimating $(I-P_{\varepsilon })^{-1}$. Indeed $(I-P_\varepsilon )^{-1}$ can be computed rigorously on the computer by estimating it by a finite sum plus an error term coming from estimating the tail of the sum.^{Footnote 5} This is what we do in Theorem 2.3.

Remark 2.7

In [5] an example of a highly regular expanding map (piecewise affine) was presented where the exact rate of Ulam’s method for approximating the invariant density h is $\varepsilon \ln \varepsilon ^{-1}$. In Theorem 2.5 the rate for approximating $\sigma ^2$ is $\varepsilon (\ln \varepsilon ^{-1})^2$. This is due to the fact that $||h-h_{\varepsilon }||_1$ is an essential part in estimating $\sigma ^2$ and the extra $\ln \varepsilon ^{-1}$ appears because of the infinite sum in the formula of $ \sigma ^2$.

Remark 2.8

By using the representation (2.3) of $\sigma _{\varepsilon }^2$, it is obvious that the main task in the proof of Theorem 2.5 is to estimate

$$\begin{aligned} |||(\mathbf{1}-P)^{-1}-(\mathbf{1}-P_{\varepsilon })^{-1}|||_{BV_0\rightarrow L^1}, \end{aligned}$$

where $BV_0=\{f\in BV \text { s.t.} \int fdm=0\}$. Thus, it would be tempting to use estimate (9) in Theorem 1 of [19], which reads:

$$\begin{aligned} \begin{array}{ll} &{}|||(\mathbf{1}-P)^{-1}-(\mathbf{1}-P_{\varepsilon })^{-1}|||_{BV_0\rightarrow L^1}\\ &{}\qquad \le |||P-P_{\varepsilon }|||_{BV_0\rightarrow L^1}^{\theta }(c_1||(\mathbf{1}-P_{\varepsilon })^{-1}||_{BV_0}+c_2||(\mathbf{1}-P_{\varepsilon })^{-1}||_{BV_0}^2), \end{array} \end{aligned}$$

(2.4)

where $\theta =\frac{\ln (r/\alpha )}{\ln (1/\alpha )}$, $r\in (\alpha ,1)$, and $c_1,c_2$ are constants that dependent only on $\alpha $, $B_0$ and r. On the one hand, this would lead to a shorter proof than the one we present in Sect. 3; however, estimate (2.4) would lead to a convergence rate of order $\varepsilon ^{\theta }$, where $0<\theta <1$ which is slower than the rate obtained in Theorem 2.5. Naturally, this have led us to opt for using the proofs of Sect. 3.

2.6 Approximating the Diffusion Coefficient for Non-uniformly Expanding Maps

We now show that Theorem 2.3 can be used to approximate the diffusion coefficient for non-uniformly expanding maps. We restrict the presentation to the model that was popularized by Liverani–Saussol–Vaienti [27]. Such systems have attracted the attention of both mathematicians [27, 37] and physicists because of their importance in the study of intermittent transition to turbulence [33]. Let

$$\begin{aligned} S(x)=\left\{ \begin{array}{ll} x(1+2^{\gamma }x^{\gamma }) &{}\quad x\in \left[ 0,\frac{1}{2}\right] \\ 2x-1 &{}\quad x\in \left( \frac{1}{2},1\right] \end{array},\right. \end{aligned}$$

(2.5)

where the parameter $\gamma \in (0, 1)$. S has a neutral fixed point at $x=0$. It is well known that S admits a unique absolutely continuous probability measure $\tilde{\nu }$, and the system enjoys polynomial decay of correlation for Hölder observables [37]. For $\gamma \in (0,\frac{1}{2})$ it is known that the system satisfies the Central Limit Theorem.^{Footnote 6} To study such systems it is often useful to first induce S on a subset of I where the induced map T is uniformly expanding. In particular for the map (2.5), denoting its first branch by $S_1$ and the second one by $S_2$, one can induce S on $\Delta :=[\frac{1}{2},1]$. For $n\ge 0$ we define

$$\begin{aligned} x_0:=\frac{1}{2}\text { and }x_{n+1} = S_{1}^{-1}(x_{n}). \end{aligned}$$

Set

$$\begin{aligned} W_{0}:=(x_0,1),\text {and } W_{n}:=(x_{n},x_{n-1}),\, n\ge 1. \end{aligned}$$

For $n\ge 1$, we define

$$\begin{aligned} Z_n:=S_2^{-1}(W_{n-1}). \end{aligned}$$

Then we define the induced map $T:\Delta \rightarrow \Delta $ by

$$\begin{aligned} T(x)=S^{n}(x)\text { for }x\in Z_{n} \end{aligned}$$

(2.6)

Observe that

$$\begin{aligned} S(Z_{n})= W_{n-1}\text { and } R_{Z_{n}}=n, \end{aligned}$$

where $R_{Z_{n}}$ is the first return time of $Z_{n}$ to $\Delta $. For $x\in \Delta $, we denote by R(x) the first return time of x to $\Delta $. Let f be Hölder with $\int _{I} fd\tilde{\nu }=0$. Then diffusion coefficient of the system S can be written using the data of the induced map T (see [17]). In particular, for $x\in \Delta $, writing $\psi (x)=\sum _{i=0}^{R(x)-1}f(S^ix)$, the diffusion coefficient is given by

$$\begin{aligned} \sigma ^2:=\int _{\Delta }\psi ^2 hdm_{\Delta }+2\sum _{i=1}^{\infty }\int _{\Delta }P^i(\psi h)\psi dm_{\Delta }, \end{aligned}$$

where h is the unique invariant density of induced map T, P is the Perron–Frobenius operator associated with T, and $m_{\Delta }$ is normalized Lebesgue measure on $\Delta $. Thus, for $\psi \in BV$ one can use,^{Footnote 7} Theorem 2.3 to approximate $\sigma ^2$.

3 Proofs and an Algorithm

We first prove two lemmas that will be used to prove Theorem 2.3. The explicit estimates of Lemma 3.2 below will also be used in Sect. 3.1 where we present our algorithm to rigorously estimate diffusion coefficients.

Lemma 3.1

For $\psi \in BV$, we have

(1)
$||\hat{\psi }||_{\infty }\le 2||\psi ||_{\infty }$ and $||\hat{\psi }_{\varepsilon }||_{\infty }\le 2||\psi ||_{\infty }$;
(2)
$|\int _I(\hat{\psi }^2 h-\hat{\psi }_{\varepsilon }^2 h_\varepsilon )dm|\le 8||\psi ||_{\infty }^2||h_\varepsilon -h||_1.$

Proof

Using the definition of $\hat{\psi }$, $\hat{\psi }_{\varepsilon }$ we get (1). We now prove (2). We have

$$\begin{aligned} \begin{array}{ll} \displaystyle |\int _I(\hat{\psi }^2_{\varepsilon }-\hat{\psi }^2)hdm|&{}=\displaystyle |\int _I(\hat{\psi }_{\varepsilon }-\hat{\psi })(\hat{\psi }_{\varepsilon }+\hat{\psi })h dm|=|\int _I(\mu -\mu _{\varepsilon })(2\psi -\mu -\mu _{\varepsilon })hdm|\\ &{}\displaystyle \le 4||\psi ||_{\infty }|\mu _{\varepsilon }-\mu |\int _Ih dm\le 4||\psi ||_{\infty }^2||h_\varepsilon -h||_1. \end{array} \end{aligned}$$

(3.1)

We now use (1) and (3.1) to get

$$\begin{aligned} \begin{array}{ll} \displaystyle |\int _I(\hat{\psi }^2 h-\hat{\psi }_{\varepsilon }^2 h_\varepsilon )dm|&{}\displaystyle \le |\int _I(\hat{\psi }^2 h-\hat{\psi }_{\varepsilon }^2 h)dm|+|\int _I(\hat{\psi }_{\varepsilon }^2 h-\hat{\psi }_{\varepsilon }^2 h_\varepsilon )dm|\\ &{}\displaystyle \le 8||\psi ||_{\infty }^2||h_\varepsilon -h||_1. \end{array} \end{aligned}$$

$\square $

Lemma 3.2

For any $l\ge 1$ we have

$$\begin{aligned} \begin{array}{ll} &{}\displaystyle |\sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }_{\varepsilon }h_{\varepsilon })\hat{\psi }_{\varepsilon }-P^i(\hat{\psi }h)\hat{\psi }\right) dm|\le 8(l-1)\cdot ||\psi ||^2_{\infty }\cdot ||h_\varepsilon -h||_1\\ &{}\quad \displaystyle +\,2||\psi ||_{\infty } |||P_{\varepsilon }-P||| \sum \limits _{i=1}^{l-1}\sum \limits _{j=0}^{i-1}\left( 2||\psi ||_{\infty }(B_j+1+\frac{\alpha ^jB_0}{1-\alpha })+\frac{\alpha ^j(B_0+1-\alpha )}{1-\alpha }V\psi \right) , \end{array} \end{aligned}$$

where $B_{j}=\sum _{k=0}^{j-1}\alpha ^{k}B_0.$

Proof

$$\begin{aligned} \begin{array}{ll} &{}\displaystyle |\sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }_{\varepsilon }h_{\varepsilon })\hat{\psi }_{\varepsilon }-P^i(\hat{\psi }h)\hat{\psi }\right) dm|\\ &{}\displaystyle \quad \le |\sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }_{\varepsilon }h_{\varepsilon })\hat{\psi }_{\varepsilon }-P^i_\varepsilon (\hat{\psi }h)\hat{\psi }\right) dm|+|\sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }h)\hat{\psi }-P^i(\hat{\psi }h)\hat{\psi }\right) dm|\\ &{}\displaystyle \quad \le |\sum \limits _{i=1}^{l-1}\int _I P^i_{\varepsilon }(\hat{\psi }_{\varepsilon }h_{\varepsilon }-\hat{\psi }h)\psi dm|+|\sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }_{\varepsilon }h_{\varepsilon })\mu _{\varepsilon }-P^i_\varepsilon (\hat{\psi }h)\mu \right) dm|\\ &{}\displaystyle \qquad +\,|\sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }h)\hat{\psi }-P^i(\hat{\psi }h)\hat{\psi }\right) dm|\\ &{}\quad :=(I)+(II) +(III). \end{array} \end{aligned}$$

We have

$$\begin{aligned} \begin{array}{ll} (I)&{}\displaystyle \le ||\psi ||_{\infty }\sum \limits _{i=1}^{l-1}\int _I |\hat{\psi }_{\varepsilon }h_{\varepsilon }-\hat{\psi }h|dm\\ &{}\displaystyle = ||\psi ||_{\infty }\cdot (l-1)\int _I |\hat{\psi }_{\varepsilon }h_{\varepsilon }-\hat{\psi }_{\varepsilon }h+\hat{\psi }_{\varepsilon }h-\hat{\psi }h|dm\\ &{}\displaystyle \le ||\psi ||_{\infty }\cdot ( l-1)\left( ||\hat{\psi }_{\varepsilon }||_{\infty }||h_\varepsilon -h||_1+|\mu -\mu _{\varepsilon }|\right) \\ &{}\displaystyle \le 3||\psi ||^2_{\infty }\cdot ( l-1) \cdot ||h_\varepsilon -h||_1. \end{array} \end{aligned}$$

(3.2)

We know estimate (II):

$$\begin{aligned} \begin{array}{ll} (II)&{}\displaystyle \le |\sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }_{\varepsilon }h_{\varepsilon })\mu _{\varepsilon }-P^i_\varepsilon (\hat{\psi }h)\mu _\varepsilon \right) dm|+|\sum \limits _{i=1}^{l-1}\int _I\left( P^i_\varepsilon (\hat{\psi }h)\mu _\varepsilon -P^i_\varepsilon (\hat{\psi }h)\mu \right) dm|\\ &{}\displaystyle \le (l-1)|\mu _\varepsilon | \int _I\left| \hat{\psi }_{\varepsilon }h_{\varepsilon }-\hat{\psi }h\right| dm +2(l-1)\cdot ||\psi ||_{\infty }|\mu _{\varepsilon }-\mu |\\ &{}\displaystyle \le 3||\psi ||^2_{\infty }\cdot ( l-1) \cdot ||h_\varepsilon -h||_1+2(l-1)\cdot ||\psi ||^2_{\infty }||h_{\varepsilon }-h||_1\\ &{}\displaystyle =5||\psi ||^2_{\infty }\cdot ( l-1) \cdot ||h_\varepsilon -h||_1. \end{array} \end{aligned}$$

(3.3)

Finally we estimate (III)

$$\begin{aligned} \begin{array}{ll} (III)&{}\displaystyle \le 2||\psi ||_{\infty }\sum \limits _{i=1}^{l-1}\sum \limits _{j=0}^{i-1}||P_{\varepsilon }^{i-1-j}(P_\varepsilon -P)P^j(\hat{\psi }h)||_1\\ &{}\displaystyle \le 2||\psi ||_{\infty }\cdot |||P_{\varepsilon }-P|||\cdot \sum \limits _{i=1}^{l-1}\sum \limits _{j=0}^{i-1}||P^{j}(\hat{\psi }h)||_{BV}\\ &{}\displaystyle \le 2||\psi ||_{\infty }\cdot |||P_{\varepsilon }-P|||\cdot \sum \limits _{i=1}^{l-1}\sum \limits _{j=0}^{i-1}\left( \alpha ^jV(\hat{\psi }h)+(B_j+1)||\hat{\psi }h||_1\right) \\ &{}\displaystyle \le 2||\psi ||_{\infty } |||P_{\varepsilon }-P||| \sum \limits _{i=1}^{l-1}\sum \limits _{j=0}^{i-1}\left( 2||\psi ||_{\infty }(B_j+1+\frac{\alpha ^jB_0}{1-\alpha })+\frac{\alpha ^j(B_0+1-\alpha )}{1-\alpha }V\psi \right) , \end{array} \end{aligned}$$

(3.4)

where in the above estimate we have used (A1) and its consequence that $Vh\le \frac{B_0}{1-\alpha }$. Combining estimates (3.2),(3.3) and (3.4) completes the proof of the lemma.$\square $

Proof

(Proof of Theorem 2.3)

$$\begin{aligned} \begin{array}{ll} \displaystyle |\sigma ^2_{\varepsilon ,l}-\sigma ^2|&{}\displaystyle \le \left| \int _I(\hat{\psi }^2 h-\hat{\psi }_{\varepsilon }^2 h_\varepsilon )dm\right| + 2\left| \sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }_{\varepsilon }h_{\varepsilon })\hat{\psi }_{\varepsilon }-P^i(\hat{\psi }h)\hat{\psi }\right) dm\right| \\ &{}\quad \displaystyle +\,4||\psi ||_{\infty }\sum \limits _{i=l}^{\infty }||P^i(\hat{\psi }h)||_{1}\\ &{}:=(I)+(II)+(III). \end{array} \end{aligned}$$

We start with (III). Since $\int _I\hat{\psi }hdm=0$, there exists a computable constant $C_*$ and a computable number^{Footnote 8} $\rho _*$, where $\alpha <\rho _*<1$, such that

$$\begin{aligned} ||P^i(\hat{\psi }h)||_{1}\le ||P^i(\hat{\psi }h)||_{BV}\le ||\hat{\psi }h||_{BV}C_*\rho _*^{i}\le \left( 2||\psi ||_{\infty }+V(\psi )\right) \frac{B_0+1-\alpha }{1-\alpha }C_*\rho _*^{i}. \end{aligned}$$

Consequently,

$$\begin{aligned} (III)\le 4||\psi ||_{\infty }\left( 2||\psi ||_{\infty }+V(\psi )\right) \frac{B_0+1-\alpha }{(1-\alpha )(1-\rho _*)}C_*\rho _*^{l}. \end{aligned}$$

Thus, choosing $ l_*$ such that

$$\begin{aligned} l_*:=\left\lceil \frac{\log (\tau /2)-\log \left( 4||\psi ||_{\infty }\left( 2||\psi ||_{\infty }+V(\psi )\right) \frac{B_0+1-\alpha }{(1-\alpha )(1-\rho _*)}C_*\right) }{\log \rho _*}\right\rceil \end{aligned}$$

(3.5)

implies

$$\begin{aligned} 4||\psi ||_{\infty }\sum _{i=l_*}^{\infty }||P^i(\hat{\psi }h)||_{1}\le \frac{\tau }{2}. \end{aligned}$$

Fix $l_*$ as in (3.5). Now using Lemmas 2.2, 3.1 and 3.2, we can find $\varepsilon _*$ such that

$$\begin{aligned} \left| \int _I(\hat{\psi }^2 h-\hat{\psi }_{\varepsilon _*}^2 h_{\varepsilon _*})dm\right| + 2\left| \sum _{i=1}^{l_*-1}\int _I\left( P^i_{\varepsilon _*}(\hat{\psi }_{\varepsilon }h_{\varepsilon _*})\hat{\psi }_{\varepsilon }-P^i(\hat{\psi }h)\hat{\psi }\right) dm\right| \le \frac{\tau }{2}. \end{aligned}$$

This completes the proof of the theorem.$\square $

3.1 Algorithm

Theorem 2.3 suggests an algorithm as follows. Given T that satisfies (A1) and (A2) and $\tau >0$ a tolerance on error:

(1)
Find $l_*$ such that
$$\begin{aligned} 4||\psi ||_{\infty }\sum _{i=l_*}^{\infty }||P^i(\hat{\psi }h)||_{1}\le \frac{\tau }{2}. \end{aligned}$$
(2)
Fix $l_*$ from (1).
(3)
Find $\varepsilon _*=\text {mesh}(\eta )$ such that
$$\begin{aligned} \begin{array}{ll} &{}(16(l_*-1)+8)\cdot ||\psi ||^2_{\infty }\cdot ||h_{\varepsilon _*}-h||_1\\ &{}\quad +\,4||\psi ||_{\infty } \sum \limits _{i=1}^{l_*-1}\sum \limits _{j=0}^{i-1}\left( 2||\psi ||_{\infty }(B_j+1+\frac{\alpha ^jB_0}{1-\alpha })+\frac{\alpha ^j(B_0+1-\alpha )}{1-\alpha }V\psi \right) |||P_{\varepsilon _*}-P|||\le \frac{\tau }{2}. \end{array} \end{aligned}$$
(4)
Output $\sigma ^2_{\varepsilon _*,l_*}:=\int _I\hat{\psi }_{\varepsilon _*}^2 h_{\varepsilon _*} dm+2\sum _{i=1}^{l_*-1}\int _IP_{\varepsilon _*}^i(\hat{\psi }_{\varepsilon _*} h_{\varepsilon _*})\hat{\psi }_{\varepsilon _*} dm$.

Remark 3.3

Note that the split of $\frac{\tau }{2}$ between items (1) and (2) in Algorithm 3.1 to lead to an error of at most $\tau $ can be relaxed in following way. One can compute the error in item (1) to be at most $\frac{\tau }{k}$ and in item (2) to be $\frac{k-1}{k}\tau $ for any integer $k\ge 2$. We exploit this fact in the implementation in Sect. 4.

Proof

(Proof of Theorem 2.5)

$$\begin{aligned} \begin{array}{ll} \displaystyle \left| \sigma ^2_{\varepsilon }-\sigma ^2\right| &{}\displaystyle \le \left| \int _I(\hat{\psi }^2 h-\hat{\psi }_{\varepsilon }^2 h_\varepsilon )dm\right| + 2\left| \sum \limits _{i=1}^{l-1}\int _I\left( P^i_{\varepsilon }(\hat{\psi }_{\varepsilon }h_{\varepsilon })\hat{\psi }_{\varepsilon }-P^i(\hat{\psi }h)\hat{\psi }\right) dm\right| \\ &{}\quad \displaystyle +\,4||\psi ||_{\infty }\sum \limits _{i=l}^{\infty }||P^i(\hat{\psi }h)||_{BV}+4||\psi ||_{\infty }\sum \limits _{i=l}^{\infty }||P_{\varepsilon }^i(\hat{\psi }_{\varepsilon } h_\varepsilon )||_{BV}\\ &{}\quad \displaystyle :=(I)+(II)+(III)+(IV). \end{array} \end{aligned}$$

We first get an estimate on (III) and (IV). There exists a computable constant $C_*$ and a computable number $\rho _*$, where $\alpha <\rho _*<1$, such that

$$\begin{aligned} (III)+(IV)\le 8||\psi ||_{\infty }\left( 2||\psi ||_{\infty }+V(\psi )\right) \frac{B_0+1-\alpha }{(1-\alpha )(1-\rho _*)}C_*\rho _*^{l}. \end{aligned}$$

For (II), as in Lemma 3.2, in particular (3.4), and by using Lemma 2.2, we have

$$\begin{aligned} \begin{array}{ll} (II)&{}\le 4||\psi ||_{\infty }\sum \limits _{i=1}^{l-1}\sum \limits _{j=0}^{i-1}||P_{\varepsilon }^{i-1-j}(P_\varepsilon -P)P^j(\hat{\psi }h)||_1+16(l-1)\cdot ||\psi ||^2_{\infty }\cdot ||h_\varepsilon -h||_1\\ &{}\le 4||\psi ||_{\infty }\Gamma \cdot \left( \alpha V(\psi ) \frac{B_0+1-\alpha }{1-\alpha }+||\psi ||_{\infty }\frac{2B_0+\alpha B_0}{1-\alpha }\right) (l-1)\varepsilon \\ &{}\quad +\,K_*16(l-1)\varepsilon \ln \varepsilon ^{-1}. \end{array} \end{aligned}$$

For (I) we use Lemmas 2.2 and 3.1 to obtain

$$\begin{aligned} (I)\le 8||\psi ||_{\infty }^2||h_\varepsilon -h||_1\le 8||\psi ||_{\infty }^2K_*\varepsilon \ln \varepsilon ^{-1}. \end{aligned}$$

Finally, choosing $l=\lceil \frac{\ln \varepsilon }{\ln \rho _*}\rceil $ leads to the rate $\tilde{K}_*\varepsilon (\ln \varepsilon ^{-1})^2$.$\square $

4 Implementation of the Algorithm and Estimating the Diffusion Coefficient for Lanford’s Map

Let

$$\begin{aligned} T(x)=2x+\frac{1}{2}x(1-x) \qquad {\text {(mod } 1)}. \end{aligned}$$

(4.1)

The map defined in (4.1) is known as Lanford’s map [21]. In this section we let $\psi =x^2$ and compute the diffusion coefficient up to a pre-specified error $\tau =0.035$. A plot of T on [0, 1] and an approximation of its invariant density computed through Ulam’s approximation are plotted in Fig. 1.

4.1 Rigorous Projections on the Ulam Basis

To compute the diffusion coefficient rigorously we have to compute rigorously the projection of an observable on the Ulam basis, i.e., given an observable $\phi $ in BV, and the projection $\Pi _{\varepsilon }$ we need to compute rigorously the coefficients $\{v_0,\ldots ,v_n\}$ such that

$$\begin{aligned} \Pi _{\varepsilon }\phi =\sum _{i=0}^{n-1}v_i\cdot \frac{\chi _{I_i}}{m(I_i)}, \end{aligned}$$

where

$$\begin{aligned} v_i=\int _{I_i}\phi \,dm. \end{aligned}$$

To do so, we will use rigorous integration through interval arithmetics, as explained in the book [35].

Once an observable is projected on the Ulam basis, many operations involved in the computation of the diffusion coefficient become componentwise operations on vectors; we explain this point in more details.

The first operation is the integral with respect to Lebesgue measure of an observable projected on the Ulam basis. This is given by the following formula:

$$\begin{aligned} \int _0^1 \Pi _{\varepsilon }\phi \, dm=\int _0^1\sum _{i=0}^n v_i\frac{\chi _{I_i}}{m(I_i)}\,dm=\sum _i v_i. \end{aligned}$$

Suppose now we have computed an approximation $h_{\varepsilon }$ of the invariant density with respect to the partition, i.e., $\int _0^1 h_{\varepsilon } dx=1$. In the following we will denote its coefficients on the Ulam basis by $\{w_0,\ldots w_n\}$. Note that the i-th component, $w_i$, is the measure of $I_i$ with respect to the measure $h_{\varepsilon }dm$.

The second operation we are interested in is the pointwise product of functions and the relation of the projection $\Pi _{\varepsilon }$ with this operation. We claim that:

$$\begin{aligned} \Pi _{\varepsilon } (\phi \cdot h_{\varepsilon })(x)=\Pi _{\varepsilon }\phi (x)\cdot h_{\varepsilon }(x). \end{aligned}$$

We will prove this by expressing the components of $\Pi _{\varepsilon } (\phi \cdot h_{\varepsilon })$ as a function of the components $\{w_0,\ldots ,w_n\}$ of $h_{\varepsilon }$ and the components $\{v_0,\ldots ,v_n\}$ of $\Pi _{\varepsilon }\phi $. We claim that

$$\begin{aligned} \Pi _{\varepsilon }(\phi \cdot h_{\varepsilon })_i=\frac{v_i\cdot w_i}{m(I_i)}. \end{aligned}$$

First of all recalling that $\chi _{I_i}^2=\chi _{I_i}$ and that $\chi _{I_i}\cdot \chi _{I_j}=0$ for $i\ne j$ we have:

$$\begin{aligned} \sum _i \frac{v_i\cdot w_i}{m(I_i)}\cdot \frac{\chi _{I_i}(x)}{m(I_i)}&=\sum _i v_i \cdot \frac{\chi _{I_i}(x)}{m(I_i)}\sum _i w_j \cdot \frac{\chi _{I_j}(x)}{m(I_j)}=(\Pi _{\varepsilon }\phi )(x)\cdot h_{\varepsilon }(x). \end{aligned}$$

On the right hand side, since $h_{\varepsilon }$ is constant on each $I_i$ and equal to $w_{i}$, we have:

$$\begin{aligned} (\Pi _{\varepsilon } (\phi h_{\varepsilon }))_i=\int _{I_i} h_{\varepsilon }\phi \, dm=\int _{I_i} w_{i}\cdot \frac{\chi _{I_i}}{m(I_i)}\phi \, dm=\frac{w_i}{m(I_i)}\cdot \int _{I_i}\phi \, dm=\frac{w_i\cdot v_i}{m(I_i)}. \end{aligned}$$

These identities simplify the computations when dealing with the Ulam basis. It is worth noting that these identities imply that:

$$\begin{aligned} \int _0^1 \phi \cdot h_{\varepsilon } dm=\sum _i \frac{v_i\cdot w_i}{m(I_i)}. \end{aligned}$$

Moreover, it is worth observing that, if $P_{\varepsilon }$ is the Ulam approximation and $\phi $ is an observable:

$$\begin{aligned} P_{\varepsilon }(\phi \cdot h_{\varepsilon })=\Pi _{\varepsilon }P\Pi _{\varepsilon }(\phi \cdot h_{\varepsilon })=\Pi _{\varepsilon }P\Pi _{\varepsilon }\Pi _{\varepsilon }(\phi \cdot h_{\varepsilon })=P_{\varepsilon }(\Pi _{\varepsilon }\phi \cdot h_{\varepsilon }). \end{aligned}$$

4.2 Item (1) in Algorithm 3.1

In this step, we find $l^{*}$ such that item (1) of Algorithm 3.1 is satisfied. In particular we want to find $l^{*}$ such that

$$\begin{aligned} 4||\psi ||_{\infty }\sum _{i=l^{*}}^{+\infty }||P^{i}((\hat{\psi }\cdot h))||_{1}\le \frac{\tau }{256}. \end{aligned}$$

As explained in Remark 3.3, instead of verifying item (1) to be smaller than $\frac{\tau }{2}$, we verify that it is smaller than $\frac{ \tau }{256}$. This will give us more room in verifying item (2) so that the sum of the errors from both items is smaller than $\tau $. Since the system satisfies (A2), there exist $0<\rho _{*}<1$, and $C_{*}>0$ such that for any $g\in BV_{0}$, and any $k\in \mathbb {N}$,

$$\begin{aligned} \Vert P^{k}g\Vert _{1}\le C_{*}\rho _{*}^{k}\Vert g\Vert _{BV}. \end{aligned}$$

(4.2)

We want to find a $0<\rho _{*}<1$ and a $C_{*}>0$ so that (4.2) is satisfied.

Once these two numbers are computed, we can easily find $l_{*}$ (see (3.5)) so that item (1) is satisfied. To compute $\rho ^{*}$ and $C_{*}$ we follow [16] whose main idea is to build a system of iterated inequalities governed by a positive matrix $\mathcal {M}$ such that:

$$\begin{aligned} \left( \! \begin{array}{c} \Vert P ^{in_{1}}g\Vert _{BV} \\ \Vert P ^{in_{1}}g\Vert _{L^{1}} \end{array} \!\right) \preceq \mathcal {M}^{i}\left( \! \begin{array}{c} \Vert g \Vert _{BV} \\ \Vert g \Vert _{L^{1}} \end{array} \!\right) , \end{aligned}$$

(4.3)

where $\preceq $ means component-wise inequalities, e.g. for vectors $ \overrightarrow{x}=(x_{1},x_{2})$ and $\overrightarrow{y}=(y_{1},y_{2})$, if $\overrightarrow{x}\preceq \overrightarrow{y}$, then, $x_{1}\le y_{1}$ and $ x_{2}\le y_{2}$.

By using Lemma 2.2 and Appendix, we get that, if $ ||P^n_{\varepsilon }|_{BV_0}||_1\le \alpha _2$, the following inequalities are satisfied:

$$\begin{aligned} {\left\{ \begin{array}{ll} \Vert P^{n_1 }f\Vert _{BV} \le \alpha ^{n_1 } \Vert f\Vert _{BV} + (\frac{B_0}{1-\alpha }) \Vert f\Vert _1 \\ \Vert P^{n_1 } f \Vert _1 \le \alpha _2 \Vert f \Vert _1 + \varepsilon M( (\frac{1+\alpha }{ 1-\alpha } ) \Vert f\Vert _{BV} + B_0{n_1 }(1+\alpha +M) \Vert f\Vert _1. \end{array}\right. } \end{aligned}$$

(4.4)

Using the inequalities above we have that:

$$\begin{aligned} \mathcal {M}= \begin{pmatrix} \alpha ^{n_1 } &{} B \\ \varepsilon M(\frac{1+\alpha }{1-\alpha } ) &{} \varepsilon M B_0{n_1 } (1+\alpha +M) + \alpha _2 \end{pmatrix} . \end{aligned}$$

Following the ideas of [16] we have that

$$\begin{aligned} \Vert P^{k n_1} g \Vert _{1} \le \frac{1}{b} \rho _*^{k} \Vert g \Vert _{BV}, \end{aligned}$$

(4.5)

where $\rho _*$ is the dominant eigenvalue of $\mathcal {M}$ and (a, b) is the corresponding left eigenvector.

Thus, our main task now is to identify all the entries of the above matrix. The first one is M, a bound on the $L^{1}$ norm of the iterates of P and $P_{\varepsilon }$; by definition, we have that $||P^{n}||\le 1$ and $||P_{\varepsilon }||_{1}\le 1$, therefore $M=1$. The two constants $\alpha _{2}$ and $n_{1}$ in $\mathcal {M}$ are two constants that give us an estimate of the speed at which $P_{\varepsilon }$ contracts the space $BV_{0}$. Let $P_{\varepsilon }$ be the discretized Ulam operator and fix $\alpha _{2}<1$; we want to find and $n_{1}\ge 0$ such that $\forall v\in BV_{0}$

$$\begin{aligned} \Vert P_{\varepsilon }^{n_{1}}v\Vert _{1}\le \alpha _{2}\Vert v\Vert _{1} \end{aligned}$$

(4.6)

with $\alpha _{2}<1$. We follow the idea of [15] and use the computer to estimate $n_{1}$ with a rigorous computation; we refer to their paper for the algorithm used to certify $n_{1}$ and the corresponding numerical estimates and methods. Consequently, (4.3) is satisfied with $n_1=28$ , $\alpha \le 0.66666667$, $B\le 1.444444445$, $\varepsilon =1/16384$, $M=1$, $ \alpha _2=1/64$; i.e.,

$$\begin{aligned} \mathcal {M}= \begin{pmatrix} 1.18\cdot 10^{-5} &{} 4.3333334 \\ 0.000306 &{} 0.022208 \end{pmatrix} . \end{aligned}$$

Thus, $\rho _*= 0.05$ and the eigenvector (a, b) associated to the eigenvalue $\rho _*$ is given by $a\in [0.006,0.007]$, $b\in [0.993,0.994]$.

Thus, by (4.5), we obtain

$$\begin{aligned} \Vert P^{28k}g \Vert _{L^1} \le (1.007) \times 0.05^{k} \Vert g \Vert _{BV} \end{aligned}$$

Consequently we can compute $l_*\ge 112$.

Remark 4.1

Using equation (4.5) and supposing $l_{*}=k\cdot n_{1}$ we see that, for any $\psi $ in $BV_{0}$:

$$\begin{aligned} \sum _{i=l_{*}}^{+\infty }||P^{i}(\psi )||_{1}\le ||\psi ||_{BV}\frac{1 }{b}\cdot n_{1}\sum _{i=k}^{+\infty }\rho _{*}^{i}\le ||\psi ||_{BV} \frac{1}{b}n_{1}\frac{\rho _{*}^{k}}{1-\rho _{*}}. \end{aligned}$$

4.3 Item (2) of Algorithm 3.1

From now on $l_*$ is fixed and it is equal to 112. So far, we executed the first loop of the Algorithm 3.1; i.e.,

$$\begin{aligned} 4 \Vert \psi \Vert _\infty \sum _{i=112}^\infty \Vert P^i (\hat{\psi })\Vert _{1} \le \frac{ \tau }{256}. \end{aligned}$$

Remark 4.2

Note in our computation above we have obtained $l_*$ such that

$$\begin{aligned} 4||\psi ||_{\infty }\sum _{i=l^{*}}^{+\infty }||P^{i}((\hat{\psi }\cdot h))||_{1}\le \frac{0.01 }{256}\le \frac{\tau }{256}. \end{aligned}$$

4.4 Item (3) of Algorithm 3.1

In this step, we have to find $\varepsilon _*$, a mesh size of the Ulam discretization, such that

$$\begin{aligned}&(16(l_*-1)+8) \cdot \Vert \psi \Vert _\infty ^2 \cdot \Vert h_{\varepsilon _*}-h\Vert _1 \nonumber \\&\qquad +\,4\Vert \psi \Vert _\infty \sum \limits _{i=1}^{l_*-1} \sum \limits _{j=0}^{i-1} \bigg (2\Vert \psi \Vert _\infty (B_j +1 + \frac{\alpha ^j B_0}{1-\alpha }) + \frac{\alpha ^j (B_0 +1-\alpha )}{1-\alpha } V \psi \bigg ) ||| P_{\varepsilon _*}-P||| \nonumber \\&\quad \le \frac{255}{256}\tau . \end{aligned}$$

(4.7)

To bound this term we need a rigorous approximation of the T-invariant density h, in the $L^{1}$-norm; we follow the ideas (and refer to the algorithm) of [15]. Set:

$$\begin{aligned} \kappa :=4\Vert \psi \Vert _{\infty }|||P_{\varepsilon _{*}}-P|||\sum _{i=1}^{l_{*}-1}\sum _{j=0}^{i-1}\bigg (2\Vert \psi \Vert _{\infty }(B_{j}+1+\frac{\alpha ^{j}B_{0}}{1-\alpha })+\frac{\alpha ^{j}(B_{0}+1-\alpha )}{1-\alpha }V\psi \bigg ). \end{aligned}$$

(4.8)

The following table contains, for different mesh sizes $\varepsilon $, error bounds for the terms in equation (4.7); in particular a bound on $\kappa $ defined in (4.8):

$\varepsilon $	$2^{-12}$	$2^{-24}$	$2^{-25}$
$\Vert h_{\varepsilon _{*}}-h\Vert _{1}\le $	0.016	$3.2\cdot 10^{-5}$	$1.7\cdot 10^{-5}$
$(16(l_{}-1)+8)\cdot \Vert \psi \Vert _{\infty }^{2}\cdot \Vert h_{\varepsilon _{}}-h\Vert _{1}\le $	28.55	0.0571	0.0304
$\kappa \le $	8.08	0.0079	0.00395.

4.5 Item (4) in Algorithm 3.1

$$\begin{aligned} |\sigma _{\varepsilon _{*},l_{*}}^{2}-\sigma ^{2}|\le 0.01/256+(0.0304+0.00395)\cdot 255/256\le 0.0342, \end{aligned}$$

and we compute $\sigma _{\varepsilon _{*},l_{*}}^{2}$

$$\begin{aligned} \sigma _{\varepsilon _{*},l_{*}}^{2}:=\int _{I}\hat{\psi } _{\varepsilon _{*}}^{2}h_{\varepsilon _{*}}dm+2\sum _{i=1}^{l_{*}-1}\int _{I}P_{\varepsilon _{*}}^{i}(\hat{\psi }_{\varepsilon _{*}}h_{\varepsilon _{*}})\hat{\psi }_{\varepsilon _{*}}dm\in [0.38,0.381]. \end{aligned}$$

Remark 4.3

The code implementing rigorous computation of diffusion coefficients for piecewise uniformly expanding maps is avalaible at the research section of the following personal page:

http://www.im.ufrj.br/nisoli/

4.6 A Non Rigorous Verification

We also perform a non-rigorous experiment to compute $\sigma ^2$ in the above example. Let $\mathcal {F}_\zeta $ be the set of floating point numbers in [0, 1] with $\zeta $ binary digits.

Note that the system has high entropy, so we have to be careful in our computation and choose $\zeta $ big. Due to high expansion of the system, in few iterations the ergodic average along the simulated orbit may have little in common with the orbit of the real system. So, we have to do computations with a really high number of digits ($\zeta =1024$ binary digits).

Let $\{x_0,\ldots ,x_{n-1}\}$ be n random floating points in $\mathcal {F}_l$; fix k and for each $i=0,\ldots , n-1$ let

$$\begin{aligned} A_k(x_i)=\frac{1}{k}\sum _{j=0}^{k-1} \phi (T^j(x_i)). \end{aligned}$$

Let $\mu $ be an approximation of the average of $\phi $ with respect to the invariant measure, obtained by integrating the observable using the approximation of the invariant density:

$$\begin{aligned} \mu =[0.383,0.384]. \end{aligned}$$

Now, for each point $\{x_0,\ldots ,x_{n-1}\}$ we compute the value $A_k(x_0),\ldots ,A_k(x_{n-1})$ and from these we compute the following two estimators:

$$\begin{aligned} \tilde{\mu }&=\frac{1}{n}\sum _{i=0}^{n-1} A_k(x_i)\\ \tilde{\sigma }^2&=\frac{1}{n} \sum _{i=0}^{n-1} \frac{(k\cdot A_k(x_i)-k\mu )^2}{k}. \end{aligned}$$

The estimator $\tilde{\mu }$ gives a non-rigorous estimate for the average of the observable with respect to the invariant measure, while the estimator $\tilde{\sigma }^2$ is an estimator for the diffusion coefficient.

The table below shows the outcome of the experiment with $n = 20,000$. In Fig. 2, a histogram plot of the distribution of $A_k(x_i)$ for $ k=10 $, $k = 200$, $n=20,000$. In red we have the normal distribution with average $\mu $ and variance $\sigma _{\varepsilon _*,l_*}^2/\sqrt{k}$.

k	$\tilde{\mu }$	$\tilde{\sigma }^2$
90	[0.383, 0.384]	[0.361, 0.362]
95	[0.383, 0.384]	[0.362, 0.363]
100	[0.383, 0.384]	[0.362, 0.363]

The output of this non-rigourous experiment is in line with the output from our rigorous computation in Sect. 4.5.

Notes

Part (a) of Theorem 1.1 is well know, see for instance [12]. Sect. 4 contains the application of our method to the Lanford map, which proves Theorem 1.1.
Computer experiments on the orbit structure of this map were performed by Lanford III in [21], and since then it is known as Lanford’s map.
In our work, we do not differentiate between maps with finite number of branches [22] or countable (infinite) number of branches [31]. All that we need is a setting where assumptions (A1) and (A2) are satisfied. In fact, using these assumptions, this work can be extended to the multidimensional case [24] by taking care of the dimension [25] and by working with appropriate observables since the space of functions of bounded variations in higher dimension is not contained in $L^{\infty }$.
It is well known that the systems under consideration satisfy a Lasota-Yorke inequality. What we are assuming in (A1) is that there is no constant in front of $\alpha $. Such an assumption is satisfied for instance when $\inf _{x}|T'(x)|>2$ or when T is piecewise onto. When the original map T does not satisfy the assumption (A1), one can find an iterate of T where (A1) is satisfied, and then apply the results of this paper.
Of course, usual computer software would give an estimated matrix of $(I-P_\varepsilon )^{-1}$, but it does not give the errors it made in its approximation.
See [37] for this result and [17] for a more general result.
Although T has countable (infinite) number of branches, one can still implement the approximation on a computer. One way to do so is as follows: first one may perform an intermediate step by considering a map $\tilde{T}$ identical to T on $I\setminus H$, such that $\tilde{T}$ has finite number of branches on $I\setminus H$ while on H it has, say, one expanding linear branch, with $m(H)\le \delta $ and $\frac{\delta }{\tau }$ is sufficiently small. The diffusion coefficients of T and $\tilde{T}$ can be made arbitrarily close using the result of [20], and then one can apply Ulam’s method and Theorem 2.3 to $\tilde{T}$.
There are many ways to approximate (III). In the implementation in Sect. 4 we follow the work of [16] to estimate (III).

References

Bahsoun, W.: Rigorous numerical approximation of escape rates. Nonlinearity 19(11), 2529–2542 (2006)
Article ADS MathSciNet MATH Google Scholar
Bahsoun, W., Bose, C.: Invariant densities and escape rates: rigorous and computable approximations in the $L^{\infty }$ -norm. Nonlinear Anal. 74, 4481–4495 (2011)
Article MathSciNet MATH Google Scholar
Bahsoun, W., Bose, C., Duan, Y.: Rigorous Pointwise approximations for invariant densities of nonuniformly expanding maps. Ergod. Theory Dyn. Syst. 35(4), 1028–1044 (2015)
Article MathSciNet MATH Google Scholar
Baladi, V.: Positive Transfer Operators and Decay of Correlations. Advanced Series in Nonlinear Dynamics, vol. 16. World Scientific Publishing, River Edge (2000)
MATH Google Scholar
Bose, C., Murray, R.: The exact rate of approximation in Ulam’s method. Discret. Contin. Dyn. Syst. 7(1), 219–235 (2001)
MathSciNet MATH Google Scholar
Boyarsky, A., Góra, P.: Laws of Chaos. nvariant Measures and Dynamical Systems in One Dimension. Birkhäuser, Canton (1997)
MATH Google Scholar
Dellnitz, M., Froyland, G., Horenkamp, C., Padberg-Gehle, K., Gupta, A.S.: Seasonal variability of the subpolar gyres in the SouthernOcean: a numerical investigation based on transfer operators. Nonlinear Process. Geophys. 16, 655–664 (2009)
Article ADS Google Scholar
Dolgopyat, D.: Limit theorems for partially hyperbolic systems. Trans. Am. Math. Soc. 356(4), 1637–1689 (2004)
Article MathSciNet MATH Google Scholar
Froyland, G.: Finite approximation of Sinai-Bowen-Ruelle measures for Anosov systems in two dimensions. Random Comput. Dyn. 3(4), 251–263 (1995)
MathSciNet MATH Google Scholar
Froyland, G.: Using Ulam’s method to calculate entropy and other dynamical invariants. Nonlinearity 12(1), 79–101 (1999)
Article ADS MathSciNet MATH Google Scholar
Higham. : Accuracy and Stability of Numerical Algorithms, 2nd edition (2002) SIAM publishing, Philadela (PA), US, ISBN 0-89871-521-0
Hofbauer, F., Keller, G.: Ergodic properties of invariant measures for piecewise monotonic transformations. Math. Z. 180(1), 119–40 (1982)
Article MathSciNet MATH Google Scholar
Holland, M., Melbourne, I.: Central limit theorems and invariance principles for Lorenz attractors. J. Lond. Math. Soc. 76(2), 345–364 (2007)
Article MathSciNet MATH Google Scholar
Galatolo, S., Nisoli, I., Rigorous computation of invariant measures and fractal dimension for piecewise hyperbolic maps: 2D Lorenz like maps. Ergod. Theory Dyn. Syst. (2015).doi:10.1017/etds.2014.145
Galatolo, S., Nisoli, I.: An elementary approach to rigorous approximation of invariant measures. SIAM J. Appl. Dyn. Syst. 13(2), 958–985 (2014)
Article MathSciNet MATH Google Scholar
Galatolo, S., Nisoli, I., Saussol, S.: An elementary way to rigorously estimate convergence to equilibrium and escape rates. J. Comput. Dyn. 2(1), 51–64 (2015)
Article MathSciNet MATH Google Scholar
Gouëzel, S.: Central limit theorem and stable laws for intermittent maps. Probab. Theory Relat. Fields 128(1), 82–122 (2004)
Article MathSciNet MATH Google Scholar
Jenkinson, O., Pollicott, M.: Orthonormal expansions of invariant densities for expanding maps. Adv. Math. 192, 1–34 (2005)
Article MathSciNet MATH Google Scholar
Keller, G.: Stability of the spectrum for transfer operators. Ann. Scuola Norm. Sup. Pisa Cl. Sci. 28(1), 141–152 (1999)
MathSciNet MATH Google Scholar
Keller, G., Howard, P., Klages, R.: Continuity properties of transport coefficients in simple maps. Nonlinearity 21(8), 1719–1743 (2008)
Article ADS MathSciNet MATH Google Scholar
Lanford III, O.E.: Informal remarks on the orbit structure of discrete approximations to chaotic maps. Exp. Math. 7(4), 317–324 (1998)
Article MathSciNet MATH Google Scholar
Lasota, A., Yorke, J.A.: On the existence of invariant measures for piecewise monotonic transformations. Trans. Am. Math. Soc. 186, 481–488 (1973)
Article MathSciNet MATH Google Scholar
Liverani, C., Central limit theorem for deterministic systems. In: Ledrappier, F., Levovicz, J., Newhouse, S. (eds) International Conference on Dynamical Systems, Montevideo 1995, a tribute to Ricardo Mane, Pitman Research Notes in Mathematics Series, vol. 362 (1996)
Liverani, C.: Multidimensional expanding maps with singularities: a pedestrian approach. Ergod. Theory Dyn. Syst. 33(1), 168–182 (2013)
Article MathSciNet MATH Google Scholar
Liverani, C.: Rigorous numerical investigation of the statistical properties of piecewise expanding maps: A feasibility study. Nonlinearity 14(3), 463–490 (2001)
Article ADS MathSciNet MATH Google Scholar
Liverani, C.: Decay of correlations for piecewise expanding maps. J. Stat. Phys. 78(3–4), 1111–1129 (1995)
Article ADS MathSciNet MATH Google Scholar
Liverani, C., Saussol, B., Vaienti, S.: A probabilistic approach to intermittency. Ergod. Theory Dyn. Syst. 19, 671–685 (1999)
Article MathSciNet MATH Google Scholar
Melbourne, I., Nicol, M.: Large deviations for nonuniformly hyperbolic systems. Trans. Am. Math. Soc. 360(12), 6661–6676 (2008)
Article MathSciNet MATH Google Scholar
Murray, R.: Existence, mixing and approximation of invariant densities for expanding maps on $R^r$. Nonlinear Anal. 45(1), 37–72 (2001)
Article MathSciNet MATH Google Scholar
Murray, R.: Ulam’s method for some non-unoformly expanding maps. Discrete. Contin. Dyn. Syst. 26(3), 1007–1018 (2010)
Article MathSciNet MATH Google Scholar
Pianigiani, G.: First return map and invariant measures. Israel J. Math. 35, 32–48 (1980)
Article MathSciNet MATH Google Scholar
Pollicott, M., Estimating variance for expanding maps. http://homepages.warwick.ac.uk/masdbl/preprints
Pomeau, Y., Manneville, P.: Intermittent transition to turbulence in dissipative dynamical systems. Commun. Math. Phys. 74(2), 189–197 (1980)
Article ADS MathSciNet Google Scholar
Santitissadeekorn, N., Froyland, G., Monahan, A.: Phys. Rev. E 82, 056311 (2010)
Article ADS Google Scholar
Tucker, W.: Auto-Validating Numerical Methods (Frontiers in Mathematics). Birkhäuser, Canton (2010)
Google Scholar
Ulam, S.M.: A Collection of Mathematical Problems (Interscience Tracts in Pure and Applied Mathematics, vol. 8. Interscience, New York (1960)
Google Scholar
Young, L.-S.: Recurrence times and rates of mixing. Israel J. Math. 110, 153–188 (1999)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgments

WB and SG would like to thank The Leverhulme Trust for supporting mutual research visits through the Network Grant IN-2014-021. SG thanks the Department of Mathematical Sciences at Loughborough University for hospitality. WB thanks Dipartimento di Matematica, Universita di Pisa. The research of SG and IN is partially supported by EU Marie-Curie IRSES “Brazilian-European partnership in Dynamical Systems” (FP7-PEOPLE-2012-IRSES 318999 BREUDS).

Author information

Authors and Affiliations

Department of Mathematical Sciences, Loughborough University, Loughborough, Leicestershire, LE11 3TU, UK
Wael Bahsoun & Xiaolong Niu
Dipartimento di Matematica, Università di Pisa, Largo Pontecorvo, Pisa, Italy
Stefano Galatolo
Instituto de Matematica - UFRJ Av. Athos da Silveira Ramos 149, Centro de Tecnologia - Bloco C Cidade Universitaria - Ilha do Fundão., Caixa Postal 68530, Rio de Janeiro, RJ, 21941-909, Brazil
Isaia Nisoli

Authors

Wael Bahsoun
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Galatolo
View author publications
You can also search for this author in PubMed Google Scholar
Isaia Nisoli
View author publications
You can also search for this author in PubMed Google Scholar
Xiaolong Niu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wael Bahsoun.

Appendix: Proof of Equation 4.4

Lemma 5.1

$$\begin{aligned} \Vert (P^n- P^n_\varepsilon ) f \Vert _1 \le \varepsilon ( \left( \frac{1+\alpha }{1-\alpha } \right) \Vert f\Vert _{BV} + B_0n(2+\alpha ) \Vert f\Vert _1. \end{aligned}$$

Proof

$$\begin{aligned}&\Vert \Pi _\varepsilon \Vert _1 = \frac{\Vert \frac{1}{\lambda (I_k)} \int _{I_k} f d \lambda \Vert _1}{\Vert f\Vert _1}\le 1.\\&\Vert P^n\Vert _1 =\Vert P_{\varepsilon }^n\Vert =1. \end{aligned}$$

$$\begin{aligned} \Vert (P- P_\varepsilon ) f \Vert _1 \le \Vert \Pi _\varepsilon P \Pi _\varepsilon f - \Pi _\varepsilon Pf \Vert _1 + \Vert \Pi _\varepsilon Pf -Pf\Vert _1= & {} \Vert \Pi _\varepsilon P( \Pi _\varepsilon f-f)\Vert _1\\&+ \,\Vert \Pi _\varepsilon Pf -Pf\Vert _1. \end{aligned}$$

$$\begin{aligned} \displaystyle \Vert \Pi _\varepsilon P( \Pi _\varepsilon f-f)\Vert _1\le & {} \Vert \Pi _\varepsilon f-f\Vert _1 \le \varepsilon V(f)\le \varepsilon \Vert f\Vert _{BV};\\ \displaystyle \Vert \Pi _\varepsilon Pf -Pf\Vert _1\le & {} \varepsilon \Vert Pf\Vert _{BV} \le \varepsilon ( \alpha \Vert f\Vert _{BV} + B_0\Vert f\Vert _1).\\ \displaystyle \Vert (P- P_\varepsilon ) f \Vert _1\le & {} \varepsilon \Vert f\Vert _{BV} + \varepsilon ( \alpha \Vert f\Vert _{BV} + B_0\Vert f\Vert _1)\le \varepsilon ((1+\alpha ) \Vert f\Vert _{BV} + B_0\Vert f\Vert _1).\\ \displaystyle \Vert (P^n- P^n_\varepsilon ) f \Vert _1\le & {} \sum _{k=1}^n \Vert P^{n-k}_\varepsilon (P- P_\varepsilon ) P^{k-1} f \Vert _1\le \Vert (P- P_\varepsilon ) P^{k-1} f \Vert _1\\ \displaystyle\le & {} \varepsilon \sum _{k=1}^n ((1+\alpha ) \Vert P^{k-1} f\Vert _{BV} + B_0\Vert P^{k-1} f\Vert _1)\\ \displaystyle\le & {} \varepsilon \sum _{k=1}^n \left( (1+\alpha ) ( \alpha ^{k-1} \Vert f\Vert _{BV} + \left( \frac{B_0}{1-\alpha }\right) \Vert f\Vert _1) + B_0\Vert f\Vert _1\right) \\ \displaystyle\le & {} \varepsilon ( \left( \frac{1+\alpha }{1-\alpha } ) \Vert f\Vert _{BV} + B_0n(2+\alpha \right) \Vert f\Vert _1. \end{aligned}$$

$\square $

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Bahsoun, W., Galatolo, S., Nisoli, I. et al. Rigorous Approximation of Diffusion Coefficients for Expanding Maps. J Stat Phys 163, 1486–1503 (2016). https://doi.org/10.1007/s10955-016-1523-y

Download citation

Received: 15 February 2016
Accepted: 11 April 2016
Published: 26 April 2016
Issue Date: June 2016
DOI: https://doi.org/10.1007/s10955-016-1523-y

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

\(\varepsilon \)	\(2^{-12}\)	\(2^{-24}\)	\(2^{-25}\)
\(\Vert h_{\varepsilon _{*}}-h\Vert _{1}\le \)	0.016	\(3.2\cdot 10^{-5}\)	\(1.7\cdot 10^{-5}\)
\((16(l_{}-1)+8)\cdot \Vert \psi \Vert _{\infty }^{2}\cdot \Vert h_{\varepsilon _{}}-h\Vert _{1}\le \)	28.55	0.0571	0.0304
\(\kappa \le \)	8.08	0.0079	0.00395.

Rigorous Approximation of Diffusion Coefficients for Expanding Maps

Abstract

Similar content being viewed by others

Rigorous Computation of Diffusion Coefficients for Expanding Maps

Accelerating solutions to diffusion equation

Diffusion Means and Heat Kernel on Manifolds

1 Introduction

Theorem 1.1

2 The Setting

2.1 The System and Its Transfer Operator

2.2 Assumptions

Remark 2.1

2.3 The Problem

2.4 Ulam’s Scheme

Lemma 2.2

2.5 Statement of the General Result

Theorem 2.3

Remark 2.4

Theorem 2.5

Remark 2.6

Remark 2.7

Remark 2.8

2.6 Approximating the Diffusion Coefficient for Non-uniformly Expanding Maps

3 Proofs and an Algorithm

Lemma 3.1

Proof

Lemma 3.2

Proof

Proof

3.1 Algorithm

Remark 3.3

Proof

4 Implementation of the Algorithm and Estimating the Diffusion Coefficient for Lanford’s Map

4.1 Rigorous Projections on the Ulam Basis

4.2 Item (1) in Algorithm 3.1

Remark 4.1

4.3 Item (2) of Algorithm 3.1

Remark 4.2

4.4 Item (3) of Algorithm 3.1

4.5 Item (4) in Algorithm 3.1

Remark 4.3

4.6 A Non Rigorous Verification

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix: Proof of Equation 4.4

Appendix: Proof of Equation 4.4

Lemma 5.1

Proof

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation