Abstract
We present a semigroup-theoretic approach to an example being one of the main motivations of the famous Freidlin–Wentzell averaging principle. In this example, as a result of a singular perturbation of the semigroups involved, the generator of the limit semigroup is related to a transmission condition emerging in a place where no transmission conditions were needed before.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The theory of one-parameter semigroups of operators, of beauty and importance of its own, is deeply rooted in the theories of partial differential equations and stochastic processes [13–15, 23, 24, 32]. Although there are good textbooks on semigroups which successfully avoid the notion of a stochastic process (see e.g. [14]), it it is quite impossible to imagine the early stages of the theory without the impetus it has gained from the giants like Feller [16–18]. In fact, this is not surprising at all, since the very core of both theories lies in understanding dynamics, be it stochastic or deterministic. For example, it has been realized quite early that the main theorems of probability (e.g. the Central Limit Theorem), may be proved in an elegant way using approximation theorems for semigroups [4, 15, 22]. On the other hand, it transpires that if seen from a proper perspective, exponential formulae of the semigroup theory become special instances of limit theorems of probability theory [11].
The theory of semigroups of operators and that of stochastic processes, continue to influence each other with mutual benefit. Modern books on Markov processes (like the old ones, see e.g. [13, 18]) treat semigroups, or at least Feller semigroups, extensively and take advantage of the core semigroup-theoretical results [15, 25]; in particular, since the publication of the treatise [15] it has become evident that convergence of stochastic processes may be efficiently treated with the help of Trotter–Kato-type theorems. On the other hand, phenomena encountered in stochastic processes do not cease to provide new and interesting challenges for semigroup-theorists.
This article is devoted to a semigroup-theoretical approach to a motivating example of the famous averaging principle of Freidlin and Wentzell [19, 21] (see also [20])—although the principle is quite well-known in probabilistic circles, its semigroup-theoretical treatment seems to be still missing. Such a treatment is needed for a clear picture of singular perturbations involving perturbation of boundary and transmission conditions. To explain, the approximation procedure considered here is a close relative of those of [2, 5, 9, 10]. In [5, 9, 10], as a result of stochastic averaging the limit semigroup is generated by a convex combination of generators of original semigroups. In [2] it is the domain of the generator of the limit semigroup that is perturbed: this domain is the kernel of a convex combination of operators describing domains of generators of original semigroups. In other words, the boundary or transmission conditions are changed in the limit, as a result of averaging. Here, in this paper, the averaging process leads to another, related, phenomenon: it forces emergence of tranmission conditions at a point where they were not needed before.
2 Semigroup-theoretical tools
The main semigroup-theoretic devices we need are (a) the semigroups generated by operators related to quadratic forms in Hilbert spaces [12, 26, 27] (a theory that has been intensively developing in recent years, see [1, 3, 31] and the references cited there), and (b) Kurtz’s singular perturbation theorem [15, 28, 29].
To recall, suppose a sesquilinear form \(\mathfrak {a}\) in a Hilbert space \({\mathbb {X}}\) is sectorial and closed, and that its domain \(V=V_{\mathfrak {a}}\) is dense in \({\mathbb {X}}\). Then, there exists a linear, closed operator \(A\) with domain \(D(A)\) satisfying the following conditions:
-
(a)
\(D(A)\subset V_{\mathfrak {a}} \) and for \(x \in D(A), y \in V_{\mathfrak {a}}\),
$$\begin{aligned} \mathfrak {a}(x,y)= - (Ax, y), \end{aligned}$$ -
(b)
if \(x \in V_{\mathfrak {a}} \) and there is a \(z\in X\) such that \( \mathfrak {a}(x,y) = - (z,y)\) for all \(y \in V_{\mathfrak {a}}\), then \(x\in D(A)\) and \(Ax=z.\)
Using the Lax–Milgram lemma it may be shown that the range of \(\lambda - A\) is the whole of \({\mathbb {X}}\) provided \(\mathfrak {R}\lambda >\gamma \) where \(\gamma \) is the vertex of the sector
Also, \(A\) is uniquely determined by the conditions given above. Moreover, \(A\) generates a holomorphic semigroup in \({\mathbb {X}}.\)
Turning to Kurtz’s singular perturbation theorem [15, 28, 29], let \(\left( \epsilon _n \right) _{n\ge 1} \) be a sequence of positive numbers converging to 0. Suppose \(A_n , n \ge 1\) are generators of equi-bounded semigroups \(\{\mathrm {e}^{tA_n}, t \ge 0\}\) in a Banach space \({\mathbb {X}}\), and \(Q\) generates a strongly continuous semigroup \(\left( \mathrm {e}^{t{Q}}\right) _{t \ge 0}\) such that
exists. Then \(P\) is a bounded idempotent operator and
Let
Theorem 2.1
(Kurtz’s theorem) Let \(A\) be an operator in \({\mathbb {X}}\), \(D\) be a subset of its domain, and assume that
-
(a)
for \(x \in D\), there exist \(x_n \in D(A_n)\) such that \(\lim _{n\rightarrow \infty }x_n = x\) and \(\lim _{n\rightarrow \infty }A_nx_n = Ax\),
-
(b)
for \(y \) in a core \(D'\) of \(Q\), there exist \(y_n \in D(A_n)\) such that \(\lim _{n\rightarrow \infty }y_n = y\) and \(\lim _{n\rightarrow \infty }\epsilon _n A_n y_n = Qy\),
-
(c)
the operator \(PA \) with domain \(D \cap {\mathbb {X}}' \) is closable and its closure \(\overline{PA}\) generates a strongly continuous semigroup in \({\mathbb {X}}'.\)
Then
For \(x \in {\mathbb {X}}'\) the same is true for \(t=0\) as well, and the limit is almost uniform in \(t\in [0,\infty );\) for other \(x \) the limit is almost uniform in \(t \in (0,\infty ).\)
3 The example
Let us present the details of the example this paper is devoted to. To repeat: this example was one of the main motivations for the averaging principle of Freidlin and Wentzell [19, 21] (a semigroup theoretical treatment of another motivation for this principle is discussed in the companion article [8]). Imagine \(N\) diffusions (with different diffusion and drift coefficients) on \(N\) copies of an interval with Neumann boundary conditions at the ends, see Fig. 1. Suppose also that, as in e.g. [5, 9, 10], these diffusions are coupled by Markov chains: while at the \(i\)th copy, the process behaves according to the rules governed by operator
but after a random time depending on its position, may jump to another copy of the interval to behave according to the rules described by the operator defined there. In distinction to the situation of [5, 9, 10] we assume, however, that on the left part of the interval no communication is possible: the intensities of jumps are zero here.
Freidlin and Wentzell’s result says that [19], Theorem 5.1] as the intensities of jumps (in the right part of the interval) tend to infinity, the processes involved converge weakly to a diffusion on a graph formed by identifying corresponding points of all the right-parts of the intervals, see Fig. 2. The generator of the limit process is a convex combination of the generators of the involved diffusions—a phenomenon thoroughly studied in [5, 9, 10]. However, a new phenomenon is observed here as well: at the junction \(x^*\) where the intervals meet, transmission conditions need to be introduced. They are of the form:
where \(\pi _i (x^*), i =1, \ldots ,N\) are probabilities of the equilibrium state of the Markov chain at \(x^*\), \(f_{i,-}'(x^*)\) is the left-hand derivative of \(f\) at \(x^*\) calculated on the \(i\)th interval, and \(f_{+}' (x^*) \) is the right-hand derivative of \(f\) at \(x^*\) calculated on the edge formed by amalgamating right parts of the original intervals.
4 Analysis in \(L^2\)
In what follows we drop secondary features of the example to focus on the reason for emergence of transmission condition (3.1) and its unique form. More specifically, we consider the case of two intervals (i.e. we take \(N=2\)), and assume that diffusion coefficients are constant throughout these intervals and there is no drift at all, so that \(a_i(x) = a_i >0\) and \(b_i (x) =0 \) for \(x\in [0,1]\), and some constants \(a_i>0.\) (The effect of convex combination has been studied in [5, 9] for processes much more general than diffusion processes, so we would gain no generality by introducing variable coefficients here.) Moreover, we assume that for some \(x^*\in (0,1)\), the Kolmogorov matrix does not depend on \(x \in [x^*,1]\) and equals
where \(\alpha , \beta > 0\) are given constants (so that the stationary distribution is \((\frac{\alpha }{\alpha + \beta }, \frac{\beta }{\alpha + \beta }))\); for \(x \in [0,x^*)\), the Kolmogorov matrix is 0.
Since the operator related to such a choice of Kolmogorov matrices (see below, Eq. (4.2)) does not leave the space of continuous functions invariant (the original Friedlin–Wentzell theorem [19] concerns a weak convergence, with test functions being continuous), we will work in the Hilbert space \({\mathbb {H}}\) of pairs \(f= (f_1,f_2)\) of square integrable functions on the unit interval, equipped with the scalar product
For \(\kappa >0\), we define a sesquilinear form
where for \(f= (f_1,f_2)\) and \(g= (g_1,g_2) \),
while
Here, the domain \(V\) of the form \(\mathfrak {a}\) is the space of pairs \(f=(f_1,f_2)\) where \(f_i \in H^1[0,1]\), i.e. \(f_i\) are absolutely continuous with square integrable derivatives \(f_i' \in L^2[0,1]\), while the domain of \({\mathfrak {q}}\) is the whole of \({\mathbb {H}}\).
It is clear that both \(\mathfrak {a}\) and \({\mathfrak {q}}\) are symmetric, and
A direct calculation shows that the operator related to \(\mathfrak {a}\) (see Sect. 2) is
with domain composed of pairs \((f_1,f_2) \) where \(f_i \in H^2[0,1]\) (i.e., \(f_i\) are continuously differentiable functions \(f_i \in L^2[0,1]\) with \(f'_i \in H^1[0,1]\)) satisfying \(f_i'(0)=f_i'(1)=0.\) Moreover, the operator related to \({\mathfrak {q}}\) is given by
where \(1_{[x^*,1]}\) is the indicator function of the interval \([x^*,1]\). It follows that both \(A\) and \(Q\) generate semigroups of self-adjoint contraction operators, and the same is true for the operators \(A_\kappa = A + \kappa Q\) related to forms \(\mathfrak {a}_{\kappa }.\)
The problem is that of finding the limit
To this end, we introduce \({\mathbb {H}}_0\) as the subspace of \({\mathbb {H}}\) composed of \((f_1,f_2)\) such that \((f_1- f_2)1_{[x^*,1]} =0\); such pairs may be identified with square integrable functions on a \(Y\)-shaped graph obtained by removing the middle segment in the left-hand part in Fig. 2.
Let \(V_0 = {\mathbb {H}}_0 \cap V\) and let
by definition of \({\mathbb {H}}_0\), \(f_1\) and \(g_1\) in the second integral may be replaced by \(f_2\) and \(g_2\), respectively, without altering \(\mathfrak {b}\). Again,
and \(\mathfrak {b}\) is symmetric. Integration by parts shows that the related operator \(B\) in \({\mathbb {H}}_0\) is given by
where \( \chi _i = a_i 1_{[0,x^*)} + (\frac{\alpha }{\alpha + \beta } a_1 + \frac{\beta }{\alpha + \beta } a_2)1_{[x^*,1]}. \) Its domain is composed of pairs \((f_1,f_2)\) such that
-
(A)
\(f_i\) are continuous on \([0,1]\), and continuously differentiable on \([0,x^*]\) and \([x^*,1]\) separately (one sided derivatives at \(x^*\) may differ),
-
(B)
\((f_i)'_{|[0,x^*]} \in H^1[0,x^*]\) and \((f_i)'_{|[x^*,1]} \in H^1[x^*,1]\),
-
(C)
we have \(f_i'(1)=f_i'(0)=0\) and
$$\begin{aligned} (\alpha a_1 + \beta a_2)f_{1,+}' (x^*) = \alpha a_1 f_{1,-}' (x^*) + \beta a_2 f_{2,-}(x^*) , \end{aligned}$$(4.3)where \(+\) and \(-\) denote the right-sided and left-sided derivatives, respectively. (Again, \(f_{1,+}' (x^*)\) on the left-hand side may be replaced by \(f_{2,+}' (x^*)\).)
Certainly, (4.3) is a counterpart of (3.1): these conditions inform of a flux balance; we stress that along with this transmission condition, the continuity condition at \(x^*\) is tacitly assumed (as implied by (A) above).
Theorem 4.1
We have
strongly and almost uniformly in \(t >0\), where projection \(P \in \mathcal L ({\mathbb {H}})\) is given by \(P(f_1,f_2) = (g_1,g_2)\),
To prove this result, we need Simon’s theorem [33] saying roughly that convergence of positive symmetric forms implies convergence of resolvents of the related operators. A version of this theorem, due to Kato, presented in the classic treatise [26] is slightly weaker as it assumes that the limit ‘upper bound’ form is densely defined (which is not the case in the example we study). We note that Simon’s theorem has recently been generalized by Batty and ter Elst to the case of series of sectorial forms [3].
Theorem 4.2
(Simon’s theorem) Let \(0\le {\mathfrak {c}}_1\le {\mathfrak {c}}_2\le \dots \) be positive symmetric closed sesquilinear forms in a Hilbert space \({\mathbb {H}}\). Define the form \({\mathfrak {c}}\) by
and \({\mathfrak {c}}(x,y) = \lim _{n\rightarrow \infty }{\mathfrak {c}}_n(x,y).\) Then, \({\mathfrak {c}}\) is closed, positive and symmetric. Moreover, denoting by \(C\) the closed operator related to \({\mathfrak {c}}\), defined in \({\mathbb {H}}_0 = \overline{D({\mathfrak {c}})}\), and by \(C_n\) the closed operators related to \({\mathfrak {c}}_n\) defined in \({\mathbb {H}}\), we have
Proof of Theorem 4.1
Since \({\mathfrak {q}} \ge 0\), the forms \(\mathfrak {a}_\kappa \) increase with \(\kappa \), so that we are in the set-up of Simon’s theorem. It is clear that \(\sup _{\kappa >0} \mathfrak {a}_\kappa [f] < \infty \) iff \({\mathfrak {q}} [f] =0,\) i.e. iff \(f \in {\mathbb {H}}_0.\) Therefore, the limit form coincides with \(\mathfrak {a}\) restricted to \(V_0\) and this equals \(\mathfrak {b}\). By Simon’s theorem it follows that
Given \(f \in D(B)\) and \(\lambda >0\), we may find \(g \in {\mathbb {H}}_0\) such that \(f = \left( \lambda - B\right) ^{-1}g\). Taking \(f_\kappa = \left( \lambda - A_\kappa \right) ^{-1}\) we see that \(\lim _{\kappa \rightarrow \infty }f_\kappa = f\) and, using the resolvent equation, \(\lim _{\kappa \rightarrow \infty }A_\kappa f_\kappa = Bf.\) Hence, condition (a) of the Kurtz singular perturbation theorem is satisfied with \(D= D(B).\)
We check the remaining conditions. For all \(f \in D(A)\), \(\lim _{\kappa \rightarrow \infty }\kappa ^{-1} A_\kappa f = Qf\) while \(D(A)\) is dense in \({\mathbb {H}}\) and \(Q\) is bounded, proving condition (b). Next, \(Q\) and \(P\) are related by the following formula:
It follows that \(\lim _{t \rightarrow \infty } \mathrm {e}^{tQ} = P\). Finally, for \(f \in D(B)\) we have
In particular, \(PA\) is is closed, proving assumption (c) in Kurtz’s theorem and completing the proof. \(\square \)
5 Analysis in \(L^1\)
The approach of the previous chapter is quite elegant: the quadratic forms contain all the information needed for the limit theorem. The arguable elegance, however, comes perhaps at the cost of blurring the mechanism of emergence of transmission conditions in the limit. To explain: the information about these conditions is compressed, or, so to say, ‘zipped’ in the quadratic form \(\mathfrak {b}\) corresponding to the operator \(B\). While it seen without a shadow of a doubt that the limit form cannot be anything other than \(\mathfrak {b}\), from the perspective of forms it is still somewhat difficult to grasp the way transmission conditions come into existence. In other words, with forms, the picture is quite clear, but some part of the mystery is still there—unless you master the connection between the operator and the form. Therefore, in this chapter, we present another approach, where calculations are much more explicit, if a bit complex. As we shall see, convergence of semigroups involved may be deduced from convergence of solutions of a linear system of equations with four unknowns (see (5.9)).
To this end, we work with the semigroups of Markov operators (contraction operators that preserve the positive cone, see e.g. [30]) in the space of absolutely integrable functions. These semigroups are responsible for evolution of densities of the Markov processes involved (and are perhaps a bit more natural than those of the previous chapter). More specifically, we work with the space \({\mathbb {X}}= L^1(\mathbb {R}) \times L^1(\mathbb {R})\) identified with the space of integrable functions \(f : \mathbb {R}\times \{1,2\} \rightarrow \mathbb {R}.\) In other words, each pair \((f_1,f_2) \in {\mathbb {X}}\) is identified with such a function, defined by \(f (x, i) = f_i(x), i=1,2, x\in \mathbb {R}.\) The norm in \({\mathbb {X}}\) is given by
Given two diffusion coefficients \(a_1,a_2>0\) we define
for \(f_1,f_2 \in W^{2,1}(\mathbb {R})\). Then, \(\mathcal A\) generates a semigroup of Markov operators in \({\mathbb {X}}\): the reader has noticed probably that we got rid of reflecting (Neumann) boundary conditions (which have no bearing on the phenomenon under study) and have allowed Brownian particles to diffuse freely on two copies of the real axis. Next, for given intensities \(\alpha , \beta >0\) we define bounded linear operators in \({\mathbb {X}}\) by
and \( \mathcal P = \frac{1}{\alpha +\beta } \mathcal Q + I_{{\mathbb {X}}}\), so that
where
It is easy to see that \( \mathcal P \) is a Markov operator; it follows that \(\mathcal Q\) generates a semigroup \(\left( \mathrm {e}^{t{Q}}\right) _{t \ge 0}\) of such operators. This semigroup describes the process in which states \((x,i)\) where \(x< 0\) are absorbing, while \((x,1)\) and \((x,2)\) communicate as states of a Markov chain with intensity matrix (4.1). (In particular, it is the point \(0\) that plays now the role of \(x^*\) of the previous chapter.) Therefore, the Phillips perturbation theorem combined with the Trotter product formula, implies that for each \(\kappa >0\) the operator
generates a semigroup of Markov operators in \({\mathbb {X}}.\) (This semigroup is in a sense dual to that generated by \(A_\kappa \) of the previous chapter.)
Since we want to find the limit
we turn to studying the resolvent equation \( \lambda f - A_\kappa f = g ,\) for \(\lambda >0\) and \(g \in {\mathbb {X}}.\) As we shall see, the solution \(f \in D(\mathcal A)\) (which exists and is unique, \(A_\kappa \) generating the contraction semigroup) may be found in a quite explicit way.
To begin with, we note that on the left half-axis, the resolvent equation takes the form
Basic principles of ordinary differential equations tell us that there are solutions to this equation that are integrable on the left half-axis, and they are given by
where \(l_1,l_2\) are (yet) unknown constants.
On the right half-axis, the resolvent equation takes the form
More precisely, this is (a part of) the resolvent equation for \(\mathcal A_1\); the general case will be recovered later by replacing each instance of \(\alpha \) and \(\beta \) by \(\kappa \alpha \) and \(\kappa \beta \), respectively. Moreover, \(g_1,g_2 \) are now treated as members of \(L^1(\mathbb {R}^+)\) and solutions are also sought in this space. The question of existence of these solutions is answered in Lemma 5.1, below, but we need to make some preparatory remarks for this result.
The quadratic equation:
has precisely two real solutions:
where
Moreover, since \(\sqrt{\Delta }< (\lambda + \alpha ) a_1 + (\lambda + \beta ) a_2,\) these solutions are positive; we note that
We look for (the first coordinate of) solutions of (5.3) of the formFootnote 1
where \(k_1,k_2\) are constants, \(e_t (x) = \mathrm {e}^{-\sqrt{t} x}, x\ge 0\) and, for \(t >0\) and \(h \in L^1(\mathbb {R}^+)\),
and
We note that \(G_t\) is a bounded linear operator in \(L^1(\mathbb {R}^+)\).
Lemma 5.1
The pair \((f_1,f_2) \), where \(f_1\) is given by (5.5), and
solves (5.3) (for any constants \(k_1,k_2\)).
Proof
By the very definition of \(f_2\), the first equality in (5.3) is satisfied, and we are left with proving the second one. To this end, we note that \(G_t h \in W^{2,1} (\mathbb {R}^+)\), and
It follows that
Since
we obtain
Therefore, invoking (5.7) (again) and the definition of \(h_i\) (for the first time),
since \(t_i\)s are roots of the quadratic. For the same reason, the terms involving \(g_1\) cancel out, both being equal to \(\alpha \beta g_1\). To summarize, using (5.4) one more time,
This proves the second equality in (5.3). \(\square \)
Now, the pair \((f_1,f_2) \in {\mathbb {X}}\) defined by (5.2), (5.5) and (5.8) belongs to \(D(\mathcal A)\) iff
The first of these conditions (compatibility of values) may be written as
where
Since
the other condition (compatibility of derivatives) reads
Hence, we have the following linear system of equations for \(l_1,l_2,k_1,k_2\):
where for further simplicity of notation \(s_i = \frac{\lambda }{a_i} .\) The Jordan–Gauss elimination method (stopped one step before completion) yields now
Before continuing, we present a lemma summarizing asymptotic behavior of constants and functions appearing in the definition of \(f_1\) and \(f_2\).
Lemma 5.2
Let
If each occurence of \(\alpha \) and \(\beta \) is replaced by \(\kappa \alpha \) and \(\kappa \beta \), respectively, then
Proof
Except for (vi), all claims are immediate by standard calculus, if proven consequtively. To show (vi), we note that for each \(t >0\),
is a bounded linear functional of norm 1. Moreover, for \(h \in L^1(\mathbb {R}^+)\cap L^\infty (\mathbb {R}^+)\),
implying \(\lim _{t\rightarrow \infty }F_t h =0.\) Therefore,
because \(L^1(\mathbb {R}^+)\cap L^\infty (\mathbb {R}^+)\) is dense in \(L^1(\mathbb {R}^+)\) and the functionals are equibounded. Since \(\sqrt{t_1} C_1 = F_{t_1} \sqrt{t_1} h_1\) and \(\lim _{\kappa \rightarrow \infty }t_1 = \infty \), this combined with (v) completes the proof. \(\square \)
We are finally able to state the first of the two main results of this section: it provides information on convergence of resolvents.
Proposition 5.3
If each occurence of \(\alpha \) and \(\beta \) is replaced by \(\kappa \alpha \) and \(\kappa \beta \), respectively, then:
-
(a)
For \(\kappa \) large enough, (5.9) has a unique solution. Moreover, equations (5.2), (5.5) and (5.8) with \(k_1,k_2,l_1,l_2\) calculated from (5.9), give the solution to the resolvent equation for \(A_\kappa \).
-
(b)
As \(\kappa \rightarrow \infty \), the solutions to the resolvent equations for \(A_\kappa \) converge to \((f_1,f_2)= R_\lambda (g_1,g_2)\) defined by
$$\begin{aligned} f_i (x) = {\left\{ \begin{array}{ll} l_i \mathrm {e}^{\sqrt{\frac{\lambda }{a_{i}}}x} + \frac{1}{2\sqrt{{\lambda }{a_{i}}}} \int _{-\infty }^0 \mathrm {e}^{-\sqrt{\frac{\lambda }{a_{i}}}|y-x| } g_i(y) \, \mathrm {d}y, &{} \quad x < 0, \\ k\mathrm {e}^{-\sqrt{\frac{\lambda }{a_{\mathrm a}}} x } + \frac{\alpha '}{2 \sqrt{\lambda a_{\mathrm a}} }\int _0^\infty \mathrm {e}^{- \sqrt{\frac{\lambda }{a_{\mathrm a}}} |x - y|} (g_1 + g_2) (y) \, \mathrm {d}y, &{}\quad x \ge 0, i=1, \\ \frac{\beta }{\alpha } f_1(x), &{} \quad x \ge 0, i=2, \end{array}\right. } \end{aligned}$$(5.11)
where
Proof
(a) For the main determinant, say \(W\), of the last two equations in (5.9), we have
Since \(\lim _{\kappa \rightarrow \infty }\sqrt{t_1} =\infty \), this shows uniqueness of \(k_1\) and \(k_2\) in (5.9), which in turn implies uniqueness of \(l_1\) and \(l_2\) (by the first two equations). The rest is clear.
(b) Using Cramer’s rule for the last two equations in (5.9) and Lemma 5.2, we see that \(\lim _{\kappa \rightarrow \infty }k_1 =0\) and \(\lim _{\kappa \rightarrow \infty }k_2 =k\) for \(k\) defined in (5.12). Passing to the limit in the first two equations (and using Lemma 5.2) we obtain \(\lim _{\kappa \rightarrow \infty }l_1 = k + C_{2,\infty } - D_1\) and \( \lim _{\kappa \rightarrow \infty }l_2 = \frac{\beta }{\alpha }k + \frac{\beta }{\alpha }C_{2,\infty } - D_2.\) By (5.5) and (5.8), this shows that the solutions of the resolvent equations converge to \((f_1,f_2)\) defined by formula (5.11) because the map \(t \mapsto e_t \in L^1(\mathbb {R}^+)\) is norm-continuous (with \(\lim _{t\rightarrow \infty }e_t =0\)), and the map \(t \mapsto G_t \in \mathcal L(L^1(\mathbb {R}^+))\) is strongly continuous. \(\square \)
All that is left to do now is to interpret, or decipher, this result and provide the link with the Freidlin–Wentzell transmission conditions. To this end, let
This subspace of \({\mathbb {X}}\) is isometrically isomorphic to \(L^1(Y)\), the space of integrable functions on an ‘infinite y’-shaped graph depicted at Fig. 3. The latter space in turn may be identified with
the isometric isomorphism \(I: {\mathbb {X}}_0 \rightarrow \mathbb {Y}\) being given by
We have \( I^{-1} (f_1,f_2,f_3) = (f_1 \cup \alpha ' f_3, f_2\cup \beta ' f_3) ,\) where for \(f \in L^1(\mathbb {R}^-)\) and \(g \in L^1(\mathbb {R}^+)\), \(f \cup g\) is a ‘union of graphs’-function:
Next, let \(B_0\) be the operator in \(\mathbb {Y}\) given by
on the domain composed of \((f_1,f_2,f_3)\) such that \(f_1,f_2 \in W^{2,1}(\mathbb {R}^-), f_3 \in W^{2,1} (\mathbb {R}^+)\) satisfy the transmission conditions
These transmission conditions are dual to those of the previous chapter: to be more precise, they describe the same physical/biological phenomenon, yet in a different, ‘dual’ space.
Now, the isomorphic copy of \(B_0\) in \({\mathbb {X}}_0\) is given by
where \( \chi _i = a_i 1_{(-\infty ,0)} + a_{\mathrm a}1_{[0,\infty )}, \) on the domain composed of \((f_1,f_2)\) such that \(f_1\) and \(f_2\) are continuous on \(\mathbb {R}\) (so that in particular
this corresponds to the first two conditions in (5.14)),
and
(It goes without saying that \((f_1,f_2) \in {\mathbb {X}}_0\).)
It is quite easy to solve the resolvent equation for \(B\). Given \(\lambda >0\) and \((g_1,g_2) \in {\mathbb {X}}\), we have
Here, the constants \(m_1,m_2\) and \(n\) are chosen so that transmission conditions (5.15) and (5.16) are satisfied, i.e.,
where \(C = \frac{1}{2 \sqrt{\lambda a_{\mathrm a}} }\int _0^{\infty }\mathrm {e}^{-\sqrt{\frac{\lambda }{a_{\mathrm a}} } y} g_1(y) \, \mathrm {d}y\) and, as previously,
Finally, for \((g_1,g_2) \in {\mathbb {X}}_0,\) \(\alpha ' (g_1+g_2)_{|\mathbb {R}^+} = {g_1}_{|\mathbb {R}^+} \) so that \(C_{2,\infty }= C\), and (5.17) is the same as (5.11), except perhaps for the constants. Moreover, a bit of algebra shows that for \(k\) of (5.12), we have then
This, however, is just the assertion that \(k\) is the same as \(n\) of (5.18). It follows that \(m_i = l_i,i=1,2\) and so
More generally,
where \(\mathcal P\) is defined in (5.1).
Theorem 5.4
We have
strongly and almost uniformly in \(t >0\).
Proof
Relation (5.20) (in fact, (5.19) suffices) shows that condition (a) of Kurtz’s singular perturbation theorem is satisfied. The rest of the argument is precisely the same as in the proof of Theorem 4.1. \(\square \)
To summarize, we have proved a ‘dual’ version of the main result of the previous section. In contrast to the arguments presented there, here the question of convergence of resolvents is reduced to that of a (singular) convergence of solutions of a system of linear equations in \(\mathbb {R}^4.\) Probabilistically, the main theorem of this section speaks of convergence of densities of the related stochastic processes in \(L^1\) norm.
6 A cosine family
In this (last) section, we want to show that the operator \(B_0\) introduced in (5.13) and (5.14) generates a bounded cosine family of operators. (Of course, this implies that so does its isomorphic image \(B\) in \({\mathbb {X}}_0\).) To begin with, we note that in the definition of \(B_0\), the constants \(\alpha \) and \(\beta \) do not appear directly, but merely via \(\alpha '\) and \(\beta '\). Therefore, from now on and without loss of generality, we assume that \(\alpha + \beta =1\). This will simplify notations: in (5.10) and (5.14) the primes may be dropped.
We use Lord Kelvin’s method of images [6, 7]. To this end, let
(equipped with the norm \(\Vert (f_1,f_2,f_3)\Vert _{\mathbb {Z}} = \sum _{i=1}^3 \Vert f_i\Vert _{\mathbb {R}} \)), and let \(C = \{C(t), t \in \mathbb {R}\} \) be the cosine family in \(\mathbb {Z}\) defined as follows. For \(f = (f_1,f_2,f_3)\in \mathbb {Z},\) \(C(t) f \) is the triple \((k_1,k_2,k_3) \in \mathbb {Z}\) given by
where \(\sigma _i = \sqrt{a_i}, i=1,2\) and \(\sigma _3 = \sqrt{a_{\mathrm a}}.\) Our aim is to show that \(\mathbb {Y}\) (equivalently: \(L^1(Y)\)) is isomorphic to a certain subspace, say \(\mathbb {Z}_0\), of \(\mathbb {Z}\) which is invariant for \(C\) (see Lemma 6.1). This will allow describing \(B_0\) as the generator of the isomorphic image of \(C\) restricted to \(\mathbb {Z}_0\) (see Theorem 6.2).
An \(f =(f_1,f_2,f_3) \) is a member of \(\mathbb {Z}_0\) iff
for almost all \(x\ge 0\), where
In particular, the values of \(f_1\) and \(f_2\) on the left half-axis and those of \(f_3\) on the right half-axis determine the whole triple \(f \in \mathbb {Z}_0\). Therefore, there is a one-to-one correspondence between members of \(\mathbb {Z}_0\) and those of \(\mathbb {Y}\). Let \(J: \mathbb {Z}_0 \rightarrow \mathbb {Y}\) be the appropriate mapping. Since coordinates of \(Jf\) are restrictions of those of \(f\), it is clear that \(\Vert Jf\Vert _{\mathbb {Y}} \le \Vert f\Vert _{\mathbb {Z}_0}.\) By the Open Mapping theorem
and the constant appearing here may be found using (6.1).
More interestingly, (6.1) implies
It follows that
This means that \(J^{-1}\) preserves integral modulo the constant \(\frac{4}{1+c}.\)
Instead of showing directly that \(\mathbb {Z}_0\) is invariant for \(C\) (such a calculation is possible but unrewarding, because it does not explain why \(\mathbb {Z}_0\) should be defined as above), we claim first that coordinates of each triple \(f = (f_1,f_2,f_3) \in D(B_0)\) may be uniquely extended to \(\tilde{f} = (\tilde{f}_1, \tilde{f_2}, \tilde{f}_3) \in \mathbb {Z}_0 \), where the last triple fulfills the following requirement: Fix \(t\in \mathbb {R}\) and let \((k_1,k_2,k_3) = C(t) \tilde{f}.\) Then
To prove the claim we let
to be the unknown parts of \(\tilde{f}_i\)’s. Then condition (6.3) (see transmission conditions (5.14)), is equivalent to the following system of equations for \(g_i\)’s, where \(f_i \)’s are known:
where \(t \ge 0.\) Substituting \(g_1\) and \(g_2\) calculated from the first two equations, after some algebra, we obtain
for almost all \(x \ge 0.\) Taking into account that \(g_3\) is supposed to be integrable,
for almost all \( x \ge 0.\) This leads to the conclusion that \(g_1,g_2\) and \(g_3\) must be given by the right-hand sides of (6.1). Checking that the corresponding \(\tilde{f}\) satisfies the required condition is now straightforward.
Lemma 6.1
\(\mathbb {Z}_0\) is invariant for \(C\).
Proof
Since \(D(B_0)\) is dense in \(\mathbb {Y}\), its image is dense in \(\mathbb {Z}_0\). Hence, it if suffices to show that for all \(s\in \mathbb {R}\), \(C(s) J^{-1} f \) is a member of this image, provided \(f \in D(B_0)\). We will argue that
Thus, our aim is to show that \(C(s) J^{-1} f \), which clearly is an extension of \(JC(s) J^{-1} f \), is the extension given by (6.1) (we have proved that this extension is unique). In other words we need to show that for any \(t \in \mathbb {R}\), \( C(t) C(s) J^{-1} f \) belongs to \(D(B_0)\). However, by definition of \( C(t) C(s) J^{-1} f\), since \(D(B_0)\) is linear,
completing the proof. \(\square \)
Theorem 6.2
\(B_0\) is the generator of a cosine family \(C_{B_0} = \{C_{B_0} (t), t \in \mathbb {R}\} \) of equibounded operators given by the abstract Kelvin formula
These operators preserve the integral, i.e., \(F C_{B_0}(t) f = F f \) for all \(f \in \mathbb {Y}, t \in \mathbb {R}, \) where \(F \in \mathbb {Y}^*\) is defined by
Proof
The d’Alembert formula for \(C_{B_0}\) and strong continuity of \(t \mapsto C_{B_0} (t)\) are direct consequences of the corresponding properties of \(C\)—use (6.4). The operators in the abstract Kelvin formula are equibounded because \(C(t), t \in \mathbb {R}\) are Markov operators in \(\mathbb {Z}\) (and hence are contractions). Preservation of the integral is an immediate consequence of (6.2) coupled with fact that \(C(t) , t \in \mathbb {R}\) preserve the integral in \(\mathbb {Z}.\)
Thus, all we need to show is that \(B_0\) is the generator of the cosine family defined by (6.5). We note that the domain of the generator, say \(G\), of \(C\) is \(W^{2,1}(\mathbb {R}) \times W^{2,1}(\mathbb {R}) \times W^{2,1}(\mathbb {R}) ,\) and \(G (f_i)_{i=1,2,3} = (\sigma _i f_i'')_{i=1,2,3},\) and that the generator of \(C\) restricted to \(\mathbb {Z}_0\) is the part \(G_{\mathrm p}\) of \(G\) in \({\mathbb {X}}_0.\) Since on the right-hand side of (6.5) instead of \(C(t)\) we could write its restriction to \(\mathbb {Z}_0\), we conclude that the generator of the cosine family given by the Kelvin formula is the isomorphic copy of \(G_{\mathrm p}\) (via \(J\)).
Let \(f \in D(G_{\mathrm p})\). Then, the coordinates \(f_1,f_2\) and \(f_3\) of \(f\) are members of \(W^{2,1}(\mathbb {R}).\) Since the right-hand and left-hand limits of \(f_i\)’s at \(x=0\) must coincide, the last equation in (6.1) implies, after simple algebra, that
and then the first two equations yield \(f_1 (0) = \alpha f_3(0)\) and \(f_2(0 ) = \beta f_3(0).\) Similarly a short calculation based on the third equation in (6.1) shows that \(a_{\mathrm a}f_3'(0) = a_1 f_1'(0) + a_2 f_2 '(0) \). This proves that the image of \(D(G_{\mathrm p})\) via \(J\) is contained in \(D(B_0).\) Since for \(f \) in this image \(J G_{\mathrm p} f = B_0 f \), \(B_0\) extends the isomorphic image of \(G_{\mathrm p}\). The extension cannot be proper, because both operators are generators of semigroups. This completes the proof. \(\square \)
Remark 6.3
\(C_{B_0}\) is not a cosine family of Markov operators (even though the related semigroup is composed of Markov operators). For example, if \(\alpha \sigma _1 - \beta \sigma _2 < \sigma _3\), taking \(f_2=0\), \(f_3 =0\) and \(f_1 = 1_{[-1,0]} \in L^1(\mathbb {R}^-)\), we obtain that \(\tilde{f}_1(x) = \frac{2\alpha }{1+c} \frac{\sigma _1}{\sigma _3} - 1 <0 , \) for \(x \in [0,1]\), and \(\tilde{f}_1(x)= 0\) for \( x >1\). Then, the first coordinate of \(C_{B_0}(2\sigma _{1}^{-1} ) f \) equals, for \(x \in [-2,-1]\),
Notes
Such a form is suggested by straightforward, but lengthy analysis involving standard methods of ODEs with constant coefficients, accompanied with the care to obtain solutions in \(L^1(\mathbb {R}^+)\).
References
Arendt, W.: Heat Kernels—Manuscript of the 9th Internet Seminar. (2006). Freely available at http://www.uni-ulm.de/fileadmin/website_uni_ulm/mawi.inst.020/arendt/downloads/internetseminar.pdf
Banasiak, J., Bobrowski, A.: A semigroup related to a convex combination of boundary conditions obtained as a result of averaging other semigroups. J. Evol. Equ. 15(1), 223–237 (2015)
Batty, C.J.K., Elst, A.F.M.ter: On series of sectorial forms. J. Evol. Equ. 14(1), 29–47 (2014)
Bobrowski, A.: Functional Analysis for Probability and Stochastic Processes: An Introduction. Cambridge University Press, Cambridge (2005)
Bobrowski, A.: On a semigroup generated by a convex combination of two Feller generators. J. Evol. Equ. 7(3), 555–565 (2007)
Bobrowski, A.: Generation of cosine families via Lord Kelvin’s method of images. J. Evol. Equ. 10(3), 663–675 (2010)
Bobrowski, A.: Lord Kelvin’s method of images in the semigroup theory. Semigroup Forum 81, 435–445 (2010)
Bobrowski, A.: Singular perturbations involving fast diffusion. J. Math. Anal. Appl. 427(2), 1004–1026 (2015)
Bobrowski, A., Bogucki, R.: Semigroups generated by convex combinations of several Feller generators in models of mathematical biology. Studia Math. 189, 287–300 (2008)
Bobrowski, A., Bogucki, R.: Two theorems on singularly perturbed semigroups with applications to models of applied mathematics. Discr. Cont. Dyn. Syst. B 17(3), 735–757 (2012)
Chung, K.L.: On the exponential formulas of semi-group theory. Math. Scand. 10, 153–162 (1962)
Debnath, L., Mikusiński, P.: Introduction to Hilbert Spaces with Applications, 3rd edn. Elsevier, Amsterdam (2005)
Doob, J.L.: Stochastic Processes. Wiley, New York (1953)
Engel, K.-J., Nagel, R.: One-Parameter Semigroups for Linear Evolution Equations. Springer, New York (2000)
Ethier, S.N., Kurtz, T.G.: Markov Processes. Characterization and Convergence. Wiley, New York (1986)
Feller, W.: Two singular diffusion problems. Ann. Math. 54, 173–182 (1951)
Feller, W.: The parabolic differential equations and the associated semi-groups of transformations. Ann. Math. 55, 468–519 (1952)
Feller, W.: An Introduction to Probability Theory and Its Applications, vol. 2, Wiley, New York, 1966, Second edition (1971)
Freidlin, M.I., Wentzell, A.D.: Diffusion processes on graphs and the averaging principle. Ann. Math. 21, 2215–2245 (1993)
Freidlin, M.I., Wentzell, A.D.: Diffusion processes on an open book and the averaging principle. Stoch. Process. Appl. 113(1), 101–126 (2004)
Freidlin, M.I., Wentzell, A.D.: Random Perturbations of Dynamical Systems, Third ed., Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], vol. 260, Translated from the 1979 Russian original by Joseph Szücs (2012), Springer, Heidelberg, (2012)
Goldstein, J.A.: Semigroup-theoretic proofs of the central limit theorem and other theorems of analysis. Semigroup Forum 12(3), 189–206 (1976)
Goldstein, J.A.: Semigroups of Linear Operators and Applications. Oxford University Press, New York (1985)
Hille, E., Phillips, R.S.: Functional Analysis and Semi-Groups, Am. Math. Soc. Colloq. Publ. 31. Am. Math. Soc., Providence, RI (1957)
Kallenberg, O.: Foundations of Modern Probability. Springer, New York (1997)
Kato, T.: Perturbation Theory for Linear Operators. Classics in Mathematics Series. Springer, Berlin (1995). reprint of the 1980 edition
Kisyński, J.: Semi-groups of Operators and Some of Their Applications to Partial Differential Equations, Control Theory and Topics in Functional Analysis (Internat. Sem., Internat. Centre Theoret. Phys., Trieste, 1974), vol. III, International Atomic Energy Agency, Vienna, pp. 305–405 (1976)
Kurtz, T.G.: A limit theorem for perturbed operator semigroups with applications to random evolutions. J. Funct. Anal. 12, 55–67 (1973)
Kurtz, T.G.: Applications of an abstract perturbation theorem to ordinary differential equations. Houston J. Math. 3(1), 67–82 (1977)
Lasota, A., Mackey, M.C.: Chaos, Fractals, and Noise. Stochastic Aspects of Dynamics. Springer, New York (1994)
Ouhabaz, E.M.: Analysis of Heat Equations on Domains. Lond. Math. Soc. Monograph Series, vol. 30. Princeton University Press, Princeton (2005)
Pazy, A.: Semigroups of Linear Operators and Applications to Partial Differential Equations. Springer, New York (1983)
Simon, B.: A canonical decomposition for quadratic forms with applications to monotone convergence theorems. J. Funct. Anal. 28(3), 377–385 (1978)
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Jerome A. Goldstein.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Bobrowski, A. Emergence of Freidlin–Wentzell’s transmission conditions as a result of a singular perturbation of a semigroup. Semigroup Forum 92, 1–22 (2016). https://doi.org/10.1007/s00233-015-9729-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00233-015-9729-x