Semigroup approach to diffusion and transport problems on networks

Banasiak, Jacek; Falkiewicz, Aleksandra; Namayanja, Proscovia

doi:10.1007/s00233-015-9730-4

Semigroup approach to diffusion and transport problems on networks

Research Article
Open access
Published: 16 June 2015

Volume 93, pages 427–443, (2016)
Cite this article

Download PDF

You have full access to this open access article

Semigroup Forum Aims and scope Submit manuscript

Semigroup approach to diffusion and transport problems on networks

Download PDF

Jacek Banasiak^1,2,
Aleksandra Falkiewicz² &
Proscovia Namayanja¹

1898 Accesses
17 Citations
Explore all metrics

Abstract

Models describing transport and diffusion processes occurring along the edges of a graph and interlinked by its vertices have been recently receiving a considerable attention. In this paper we generalize such models and consider a network of transport or diffusion operators defined on one dimensional domains and connected through boundary conditions linking the end-points of these domains in an arbitrary way (not necessarily in the way the edges of a graph are connected). We prove the existence of $C_0$-semigroups solving such problems and provide conditions fully characterizing when they are positive.

Conservative and Semiconservative Random Walks: Recurrence and Transience

Article 27 February 2017

On a new Kantorovich operator based on Hermite polynomials

Article 08 April 2024

{Euclidean, metric, and Wasserstein} gradient flows: an overview

Article Open access 14 March 2017

1 Introduction

Recently there has been an interest in dynamical problems on graphs, where some evolution operators, such as transport or diffusion, act on the edges of a graph and interact through its nodes. One can mention here quantum graphs, see e.g. [1–4], diffusion on graphs in probabilistic context, [2, 5, 6], transport problems, both linear and nonlinear, [7–12], migrations, [13], and several other applications discussed in e.g. [4, 14]. In particular, the recent monograph [14] is a rich source of network models and methods. However, most of these works focus on particular problems. For instance, in the quantum graph theory the main interest is to determine whether the operators defined on the edges of a graph are self-adjoint and the work is confined to the Hilbert space setting. Most papers on the linear transport theory on graphs focus on long term dynamics of the flow. Papers such as [2, 5, 6], motivated by probabilistic applications, look at Feller or Markov processes on graphs.

The present paper, which provides the theoretical foundation for [15], is similar in spirit to [5, 6] in the sense that we prove the existence of strongly continuous semigroups in the space of continuous functions, as well as in the space of integrable functions, that solve the diffusion problem on a network. However, we extend the existing results of [5, 6] by considering processes that are more general than the diffusion along edges of a metric graph with Robin boundary conditions at its vertices. More precisely, we allow for communication between domains that are not necessarily physically connected. In fact, the models we analyse can be also interpreted as diffusion on a hypergraph, [16], but we shall not pursue this line of research here. For completeness, we also present similar results for transport problems, generalizing [7] in a similar way.

To explain the idea of our extension, in the next section we consider two examples, see [5, 17].

1.1 Motivation

First, let us introduce basic notation which will help to formulate the problems and results. We will work in a finite dimensional space, say, ${\mathbb {R}}^m.$ The boldface characters will typically denote vectors in ${\mathbb {R}}^m$, e.g. ${\mathbf {u}} = (u_1,\ldots ,u_m).$ We denote $\mathcal {M} = \{1,\ldots ,m\}.$ Further, for any Banach space X, we will use the notation ${\mathbf {X}} = \underbrace{X\times \ldots \times X}_{m\,times}$, e.g. for $X=L_1(I), I=[0,1]$ we denote ${\mathbf {L}}_1(I)=\underbrace{L_1(I)\times \ldots \times L_1(I)}_{m\,times}$.

Let $({\mathbf {A}}, D({\mathbf {A}}))$ be an operator in ${\mathbf {X}}$. If ${\mathbf {A}}$ is a generator, we will denote by $\{e^{t{\mathbf {A}}}\}_{t \ge 0}$ the semigroup generated by ${\mathbf {A}}$.

1.1.1 Diffusion

We consider a finite metric graph without loops and isolated edges, $\mathcal {G} = (\mathcal {V}, \mathcal {E})$ with, say, n vertices and m edges. On each edge there is a substance with density $u_j, j\in \mathcal {M},$ which diffuses along this edge according to

$$\begin{aligned} \partial _t u_j = \sigma _j \partial _{xx}u_j, \quad x\in ]0,1[, \quad j\in \mathcal {M}, \end{aligned}$$

(1)

where $\sigma _j>0$ are constant diffusion coefficients, and can also enter the adjacent edges. To simplify considerations, each edge is identified with the unit interval I. In the model of [5], the particles can permeate between the edges across the vertices that join them according to a version of the Fick law. To write down its analytical form, first we note that, since diffusion does not have a preferred direction, we can assign the tail, or the left endpoint, (that is, 0) and the head, or the right endpoint (that is, 1) to the endpoints of the edge in an arbitrary way. Let $l_i$ and $r_i$ be the rates at which the substance leaves $e_i$ through, respectively, the left and the right endpoints and $l_{ik}$ and $r_{ik}$ be the rates of at which it subsequently enters the edge $e_k$. Then the Fick law at, respectively, the head and the tail of $e_i,$ gives

$$\begin{aligned} -\partial _x u_{i}(1)= & {} r_{i}u_{i}(1) - \sum _{j\ne i}{r_{ij}u_{j}(v)},\nonumber \\ \partial _x u_{i}(0)= & {} l_iu_i(0) - \sum _{j\ne i}{l_{ij}u_{j}(v)}, \end{aligned}$$

(2)

where we have written $u_j(v)$ as v may be either the tail or the head of the incident edge $e_j$. In particular, if there are no edges incident to the tail of $e_i$, or there are no edges incident to the head of $e_i$, then the Fick’s laws take the form

$$\begin{aligned} \partial _xu_{i}(0) = l_iu_i(0), \qquad -\partial _xu_{i}(1) = r_{i}u_{i}(1), \end{aligned}$$

(3)

respectively, where either coefficient on the right hand side can be 0.

It is clear that if $r_{ij}\ne 0$ or $l_{ij}\ne 0,$ the edges $e_i$ and $e_j$ are incident and thus we can define an $m\times m$ matrix ${\mathbb {A}}=\{a_{ij}\}_{1\le i\le m, 1\le j\le m}$ by setting $a_{ij}=1$ if either $r_{ij}\ne 0$ or $l_{ij} \ne 0$ and zero otherwise. Then ${\mathbb {A}}$ is the adjacency matrix of the line graph $L(\mathcal {G})$ of $\mathcal {G}$, see e.g. [18]. However, it turns out that such a matrix is not easy to use. On the other hand, introducing, for any $i,j \in \mathcal {M},$

$$\begin{aligned} k^{00}_{ij}= & {} -l_{ij} \quad \mathrm {if}\; v = 0,\qquad k^{01}_{ij} = -l_{ij}\quad \mathrm {if}\; v=1, \qquad k^{00}_{ii} = l_{i},\nonumber \\ k^{10}_{ij}= & {} r_{ij}\quad \mathrm {if}\; v = 0,\qquad k^{11}_{ij} = r_{ij} \quad \mathrm {if}\; v=1, \qquad k^{11}_{ii} = -r_{i}, \end{aligned}$$

(4)

where v is, respectively, the tail or the head of the edge under consideration, the problem can be written as

$$\begin{aligned} \partial _t {\mathbf {u}}(x,t)= & {} {\mathbb {D}}\partial _{xx}{\mathbf {u}}(x,t), \qquad (x,t) \in ]0,1[\times {\mathbb {R}}_+,\nonumber \\ \partial _x {\mathbf {u}}(0,t)= & {} {\mathbb {K}}^{00}{\mathbf {u}}(0,t) + {\mathbb {K}}^{01}{\mathbf {u}}(1,t),\qquad t>0, \nonumber \\ \partial _x {\mathbf {u}}(1,t)= & {} {\mathbb {K}}^{10} {\mathbf {u}}(0,t)+{\mathbb {K}}^{11} {\mathbf {u}}(1,t), \qquad t>0,\nonumber \\ {\mathbf {u}}(x,0)= & {} \mathring{{\mathbf {u}}}(x), \qquad x\in ]0,1[, \end{aligned}$$

(5)

where ${\mathbf {u}} = (u_1,\ldots ,u_m)$, ${\mathbb {D}} = \mathrm {diag}\{\sigma _{i}\}_{1\le i \le m}$ and $\mathring{{\mathbf {u}}}$ is the initial distribution. This form of the problem allows for its analysis.

It is also clear that there is no mathematical reason why the matrices $K^{\omega }$, $\omega \in \Omega = \{00, 01,10,11\}$, in (5) should be restricted to the matrices given by (4) which, indeed, form a strictly smaller class, see [17]. In this paper we study the well-posedness of (5) for arbitrary matrices $K^{\omega }$ in spaces ${\mathbf {C}}(I)$ and ${\mathbf {L}}_1(I),$ extending and simplifying the results of [5, 6]. We also find necessary and sufficient conditions for the semigroup solving (5) to be positive.

1.1.2 Transport problems

We consider a digraph $G =(V(G), E(G)) = (\{v_1,\ldots , v_n\}, \{e_1,\ldots , e_m\})$ with n vertices and m edges. We suppose that none of the vertices is isolated. As before, each edge is normalized so as to be identified with I with the head at 1 and the tail at 0. Following [7, 9–12], we consider a substance of density $u_j(x,t)$ on the edge $e_j$, moving with speed $c_j$ along this edge. The conservation of mass at each vertex is expressed by the Kirchhoff law,

$$\begin{aligned} \sum \limits _{j =1}^{m} \phi ^-_{ij}c_ju_j(0,t)= \sum \limits _{j =1}^{m} \phi ^+_{ij}c_ju_j(1,t), \quad t>0, i \in 1,\ldots ,n, \end{aligned}$$

(6)

where $\Phi ^{-}= \{\phi _{ij}^{-}\}_{1\le i\le n, 1\le j\le m}$ and $\Phi ^{+}=\{\phi _{ij}^{+}\}_{1\le i\le n, 1\le j\le m}$ are, respectively, the outgoing and incoming incidence matrices; that is, matrices with the entry $\phi _{ij}^-$ (resp. $\phi _{ij}^+$) equals 1 if there is edge $e_j$ outgoing from (res. incoming to) the vertex $v_i$, and zero otherwise. Note that due to definitions of the matrices $\Phi ^-$ and $\Phi ^+,$ the summation on the right hand side is over all incoming edges of the vertex $v_i$ and on the left hand side over all outgoing edges of $v_i$.

In [7] we considered a slightly more general model

$$\begin{aligned}&\partial _t u_{j}(x,t)+ c_j\partial _xu_{j}(x,t)=0,\quad x\in (0,1),\quad t\ge 0,\nonumber \\&u_{j}(x,0) = \mathring{u}_{j}(x),\nonumber \\&\phi _{ij}^{-}\xi _jc_ju_j(0,t) = w_{ij} \sum \limits _{k=1}^{m}{\phi _{ik}^{+}(\gamma _{k}c_ku_k(1,t))}, \end{aligned}$$

(7)

where $\gamma _j>0$ and $\xi _j>0$ are the absorption/amplification coefficients at, respectively, the head and the tail of $e_j.$ Here the matrix $\{w_{ij}\}_{1\le i\le n, 1\le j\le m}$ describes the distribution of the incoming flow at the vertex $v_i$ into the edges outgoing from it; it is a column stochastic matrix, [11]. We denote ${\mathbb {C}} = \mathrm {diag} \{c_j\}_{1\le j\le m}, \Xi = \mathrm {diag} \{\xi _j\}_{1\le j\le m}$ and $\Gamma = \mathrm {diag} \{\gamma _j\}_{1\le j\le m}.$

It follows, [7], that if G has a sink, than there is no $C_0$-semigroup solving (7). Hence we discard this case and then it can be proved, e.g. [9, Proposition 3.1], that (7) can be written as an abstract Cauchy problem

$$\begin{aligned} {\mathbf {u}}_t = {\mathbf {A}}{\mathbf {u}}, \qquad {\mathbf {u}}(0) = \mathring{{\mathbf {u}}}, \end{aligned}$$

(8)

in ${\mathbf {X}} = {\mathbf {L}}_1(I)$, where ${\mathbf {A}}$ is the realization of $\mathsf{A} = \mathrm {diag}\{-c_j \partial _x\}_{1\le j\le m}$ on the domain

$$\begin{aligned} D({\mathbf {A}}) = \left\{ {\mathbf {u}} \in {\mathbf {W}}^{1}_1(I);\; {\mathbf {u}}(0) = \Xi ^{-1}{\mathbb {C}}^{-1} {\mathbb {B}}\Gamma {\mathbb {C}}{\mathbf {u}}(1)\right\} , \end{aligned}$$

(9)

where ${\mathbb {B}}$ is the (transposed) adjacency matrix of the line graph of G.

As with the diffusion problems, there is no mathematical reason to restrict our analysis to the matrices of the form $\Xi ^{-1}{\mathbb {C}}^{-1} {\mathbb {B}}\Gamma {\mathbb {C}}$ in the boundary conditions which, in fact, see [17], form a strict subset of the set of all matrices. Thus, we consider the following generalization of (8), (9),

$$\begin{aligned} {\mathbf {u}}_t = {\mathbf {A}}{\mathbf {u}}, \qquad {\mathbf {u}}(0)={\mathbb {K}} {\mathbf {u}}(1),\qquad {\mathbf {u}}(0) = \mathring{{\mathbf {u}}}, \end{aligned}$$

(10)

where ${\mathbb {K}}$ is an arbitrary matrix.

Example 1.1

The main difference between (10) with arbitrary ${\mathbb {K}}$ and the model with ${\mathbb {K}}$ given in (9) is that in the former, the exchange of the substance can occur instantaneously between any edges, while in the latter the edges must be physically connected by vertices for the exchange to take place. So, for instance, if there is a connection $e_1\rightarrow e_2$ and $e_2\rightarrow e_3$, then there is no connection $e_1\rightarrow e_3$. So, while in general (10) cannot model a flow in a physical network, it can describe e.g. a mutation process. Indeed, let a population of cells be divided into m subpopulations, with ${\mathbf {v}}(x) =(v_1(x),\ldots , v_m(x)),$ where $v_j(x), j\in \mathcal {M}, x\in [0,1],$ is the density of cells of age x whose genotype belongs to a class j (for instance, having j copies of a gene of a particular type). We assume that cells of class j mature and divide upon reaching maturity at $x=1,$ with offspring, due to mutations, appearing in class i with probability $k_{ij}$, $i\in \mathcal {M}$. In such a case $(k_{ij})_{1\le i,j\le m}$ is a column stochastic matrix. We note that a particular case of this model is the discrete Rotenberg-Rubinov-Lebowitz model, [19], where the cells are divided into classes according to their maturation velocity.

A similar interpretation can be given to (5) where the variable x, instead of the age, denotes the size of the organism, e.g. [20].

Moreover, problems of the form (10) arise e.g. in queuing theory, [21].

2 Well-posedness of the diffusion problem

We shall consider solvability of (5) in ${\mathbf {X}}={\mathbf {C}}(I)$ and ${\mathbf {X}}= {\mathbf {L}}_1(I).$ For technical reasons, we also shall need the solvability of (5) in ${\mathbf {W}}^1_1(I)$. The norms in these spaces will be denoted, respectively, by $\Vert \cdot \Vert _\infty , \Vert \cdot \Vert _0, \Vert \cdot \Vert _1$. However, if it does not lead to any misunderstanding, we will use ${\mathbf {X}}$ to denote any of these spaces and $\Vert \cdot \Vert $ or $\Vert \cdot \Vert _{\mathbf {X}}$ to denote the norm in ${\mathbf {X}}$. Similarly, by $|||\cdot |||_{{\mathbf {X}}}$ we denote the operator norm in the space of bounded linear operators from ${\mathbf {X}}$ to ${\mathbf {X}}$.

Our results are based on [22] and thus, introducing relevant spaces and operators, we try to keep notation consistent with op.cit. First, consider

$$\begin{aligned} {\mathbf {X}} \ni {\mathbf {u}} \rightarrow L{\mathbf {u}} = (\gamma _0 \partial _x{\mathbf {u}}, \gamma _1 \partial _x {\mathbf {u}}) \in {\mathbf {Y}} = {\mathbb {R}}^m\times {\mathbb {R}}^m, \end{aligned}$$

(11)

where $\gamma _i, i=0,1,$ is the trace operator at $x=i$ (taking the value at $x=i$ if ${\mathbb {X}} = {\mathbf {C}}(I)$). The domains of L are $D(L) = {\mathbf {C}}^1(I)$ if ${\mathbf {X}} ={\mathbf {C}}(I)$ and $D(L) = {\mathbf {W}}^2_1(I)$ in two other cases. Then we define the operator

$$\begin{aligned} {\mathbf {X}}\ni {\mathbf {u}} \rightarrow \Phi {\mathbf {u}} = {\mathbb {K}}(\gamma _0 {\mathbf {u}}, \gamma _1{\mathbf {u}}) = \left( \begin{array}{c@{\quad }c} {\mathbb {K}}^{00}&{} {\mathbb {K}}^{01}\\ {\mathbb {K}}^{10}&{} {\mathbb {K}}^{11} \end{array}\right) \left( \begin{array}{c} \gamma _0 {\mathbf {u}}\\ \gamma _1 {\mathbf {u}} \end{array}\right) \in {\mathbb {R}}^m\times {\mathbb {R}}^m. \end{aligned}$$

Further, let us denote

$$\begin{aligned} \Phi ^*{\mathbf {u}} = {\mathbb {K}}^*(\gamma _0 {\mathbf {u}}, \gamma _1{\mathbf {u}}) = \left( \begin{array}{c@{\quad }c} {\mathbb {K}}^{00\,T}&{} -{\mathbb {K}}^{10\,T}\\ -{\mathbb {K}}^{01\,T}&{} {\mathbb {K}}^{11\,T} \end{array}\right) \left( \begin{array}{c} \gamma _0 {\mathbf {u}}\\ \gamma _1 {\mathbf {u}} \end{array}\right) , \end{aligned}$$

(12)

where ${\mathbb {K}}^T$ denotes the transpose of ${\mathbb {K}}$. Clearly $(\Phi ^*)^* = \Phi $.

Let $\mathsf A$ denote the differential expression $\mathsf A{\mathbf {u}} := {\mathbb {D}}\partial _{xx}{\mathbf {u}}.$ Then we define the operators ${\mathbf {A}}^\alpha _\Phi $, $\alpha =\infty , 0, 1$ by the restriction of $\mathsf A$ to the domains

$$\begin{aligned} D({\mathbf {A}}^\infty _\Phi )= & {} \left\{ {\mathbf {u}} \in {\mathbf {C}}^2(I);\;L{\mathbf {u}} = \Phi {\mathbf {u}}\right\} ,\nonumber \\ D({\mathbf {A}}^0_\Phi )= & {} \left\{ {\mathbf {u}} \in {\mathbf {W}}^2_1(I);\;L{\mathbf {u}} = \Phi {\mathbf {u}}\right\} ,\nonumber \\ D({\mathbf {A}}^1_\Phi )= & {} \left\{ {\mathbf {u}} \in {\mathbf {W}}^3_1(I);\;L{\mathbf {u}} = \Phi {\mathbf {u}}\right\} , \end{aligned}$$

(13)

respectively. As before, we drop the indices from the notation if it will not lead to any misunderstanding.

2.1 Basic estimates in the scalar case

Consider the general resolvent equation for (5)

$$\begin{aligned} \lambda {\mathbf {u}} - {\mathbb {D}}\partial _{xx} {\mathbf {u}} = {\mathbf {f}}, \quad x \in ]0,1[. \end{aligned}$$

(14)

Since without the boundary conditions the system is uncoupled, its solution ${\mathbf {u}}$ is given by

$$\begin{aligned} {\mathbf {u}}(x) = {\mathbb {E}}(-{\varvec{\mu }}x) {\mathbf {C}}_1+ {\mathbb {E}}({\varvec{\mu }}x){\mathbf {C}}_2 + {\mathbf {U}}_{{\varvec{\mu }}}(x) \end{aligned}$$

(15)

where ${\mathbb {E}}(\pm {\varvec{\mu }}x) = \mathrm {diag}\{e^{\pm \mu _ix}\}_{1\le i\le m},$ $0\ne \lambda /\sigma _i = \mu _i^2 = |\lambda /\sigma _i|e^{i\theta }$ with $\mathfrak {R}\mu _i >0,$ ${\mathbf {U}}_{{\varvec{\mu }}}(x) = (U_{\mu _1}(x),\ldots , U_{\mu _m}(x))$ with

$$\begin{aligned} U_{\mu _i}(x) = \frac{1}{2\mu _i\sigma _i} \int \limits _{0}^{1} e^{-\mu _i|x-s|}f_i(s)ds, \end{aligned}$$

(16)

and the vector constants ${\mathbf {C}}_1$ and ${\mathbf {C}}_2$ are determined by the boundary conditions. Further, denote

$$\begin{aligned} \Sigma _{\alpha ,\theta _0}= \left\{ \lambda = |\lambda |e^{i\theta }\in {\mathbb {C}};\;|\lambda |\ge \alpha , |\theta |\le \theta _0 <\pi \right\} \end{aligned}$$

and $ \Sigma _{0, \theta _0} = \Sigma _{\theta _0}.$

The starting point are well-known estimates for the scalar Dirichlet and Neumann problems, e.g. [23]. We briefly recall them here in a slightly more precise form, similarly to [24]. In the scalar case we can use $\sigma =1,$ as will become clear when the estimate is derived. It follows that for the Dirichlet problem we have

$$\begin{aligned} C_1^D = \frac{U_\mu (1)-U_\mu (0)e^{\mu }}{e^\mu -e^{-\mu }},\qquad C_2^D = \frac{U_\mu (0)e^{-\mu }-U_\mu (1)}{e^\mu -e^{-\mu }}, \end{aligned}$$

(17)

and for the Neumann problem

$$\begin{aligned} C_1^N = \frac{U_\mu (0)e^\mu +U_\mu (1)}{e^\mu -e^{-\mu }},\qquad C_2^N = \frac{U_\mu (1)+U_\mu (0)e^{-\mu }}{e^\mu -e^{-\mu }}. \end{aligned}$$

Further, [23],

$$\begin{aligned} \Vert U_\mu \Vert _\infty \le \frac{\Vert f\Vert _\infty }{|\mu |\mathfrak {R}\mu } \le \frac{\Vert f\Vert _\infty }{|\lambda |\cos \theta /2}, \qquad \Vert U_\mu \Vert _0 \le \frac{\Vert f\Vert _0}{|\mu |\mathfrak {R}\mu } \le \frac{\Vert f\Vert _0}{|\lambda |\cos \theta /2}, \end{aligned}$$

(18)

and, for $ i=0,1$,

$$\begin{aligned} |U_\mu (i)| \le \frac{\Vert f\Vert _\infty (1-e^{-\mathfrak {R}\mu })}{2|\mu |\mathfrak {R}\mu },\quad |U_\mu (i)| \le \frac{\Vert f\Vert _0}{2|\mu |}. \end{aligned}$$

(19)

The following result plays an essential role in deriving precise estimates,

$$\begin{aligned} \left| \frac{1}{e^\mu -e^{-\mu }}\right| = e^{-\mathfrak {R}\mu }\left| \sum \limits _{n=0}^{\infty }e^{-2n\mu }\right| \le e^{-\mathfrak {R}\mu }\sum \limits _{n=0}^{\infty }e^{-2n\mathfrak {R}\mu } = \frac{e^{-\mathfrak {R}\mu }}{1-e^{-2\mathfrak {R}\mu }}. \end{aligned}$$

(20)

Then, for either Dirichlet or Neumann problem in C(I), for $\lambda \in \Sigma _{\theta _0}$ for any $\theta _0<\pi $, we have for $\omega = D,N$

$$\begin{aligned} \Vert u^\omega \Vert _\infty&\le |C^\omega _1| +|C^\omega _2| e^{\mathfrak {R}\mu } + \frac{\Vert f\Vert _\infty }{|\lambda |\cos \theta _0/2}\nonumber \\&\le \frac{\Vert f\Vert _\infty }{|\lambda |\cos \theta _0/2}\left( \frac{1}{1-e^{-2\mathfrak {R}\mu }} \left( \frac{1-e^{-2\mathfrak {R}\mu }}{2} + \frac{1-e^{-2\mathfrak {R}\mu }}{2}\right) + 1\right) \nonumber \\&\le \frac{2\Vert f\Vert _0}{|\lambda |\cos \theta _0/2}. \end{aligned}$$

(21)

Similarly, in $L_1(I)$, we have

$$\begin{aligned} \Vert u^\omega \Vert _0&\le |C^\omega _1|\frac{1-e^{-\mathfrak {R}\mu }}{\mathfrak {R}\mu } +|C^\omega _2| \frac{e^{\mathfrak {R}\mu }-1}{\mathfrak {R}\mu } + \frac{\Vert f\Vert _0}{|\lambda |\cos \theta _0/2}\nonumber \\&\le \frac{\Vert f\Vert _0}{|\lambda |\cos \theta _0/2}\left( \frac{1}{1-e^{-2\mathfrak {R}\mu }} \left( \frac{1-e^{-2\mathfrak {R}\mu }}{2} + \frac{1-e^{-2\mathfrak {R}\mu }}{2}\right) + 1\right) \nonumber \\&\le \frac{2\Vert f\Vert _0}{|\lambda |\cos \theta _0/2}. \end{aligned}$$

(22)

Consider now $A^1_0,$ see (13); that is, the operator corresponding to the Neumann boundary conditions in one dimension. We have

Proposition 2.1

$A^1_0$ generates an analytic semigroup in $W^1_1(I)$ with the resolvent $R(\lambda , A^1_0)$ satisfying the estimate

$$\begin{aligned} \Vert R(\lambda , A^1_0)f\Vert _{1} \le \frac{2\Vert f\Vert _{1}}{|\lambda |\cos \theta _0/2}, \qquad \lambda \in \Sigma _{\theta _0}. \end{aligned}$$

(23)

Proof

Consider the resolvent equation

$$\begin{aligned} \lambda u - \partial _{xx}u = f, \quad \partial _x u(0)=\partial _x u(1) = 0. \end{aligned}$$

If $f \in W^1_1(I)$, then $u\in W^3_1(I)$ and we can differentiate the differential equation getting for $v:=\partial _xu$,

$$\begin{aligned} \lambda v - \partial _{xx} v = \partial _x f, \quad v(0)=v(1) =0; \end{aligned}$$

(24)

that is, v satisfies the resolvent equation for the Dirichlet problem. Hence

$$\begin{aligned} \Vert u\Vert _{1} = \Vert u\Vert _0 + \Vert \partial _xu\Vert _0 \le \frac{2\Vert f\Vert _0}{|\lambda |\cos \theta _0/2} + \frac{2\Vert \partial _xf\Vert _0}{|\lambda |\cos \theta _0/2} = \frac{2\Vert f\Vert _{1}}{|\lambda |\cos \theta _0/2}. \end{aligned}$$

$\square $

Since $\lambda u - \sigma \partial _{xx}u = f$ is equivalent to $\sigma ^{-1}\lambda u - \partial _{xx}u = \sigma ^{-1}f$, we see that the estimates above are independent of $\sigma $.

2.2 Solvability of (5)

The ideas in this section are based on [5, 6] but the analysis is simplified by using the analyticity of the semigroup and the application of [22, Theorem 2.4].

Since in the vector case and for the Neumann boundary conditions the system of the resolvent equations decouples, we obtain

$$\begin{aligned} \Vert R(\lambda , {\mathbf {A}}_0)f\Vert _{{\mathbf {X}}} \le \frac{2\Vert f\Vert _{{\mathbf {X}}}}{|\lambda |\cos \theta _0/2}, \qquad \lambda \in \Sigma _{\theta _0} \end{aligned}$$

(25)

for any $\theta _0<\pi $ and ${\mathbf {X}}={\mathbf {C}}(I), {\mathbf {L}}_1(I), {\mathbf {W}}^1_1(I)$. Hence, in particular, ${\mathbf {A}}_0$ generates an analytic semigroup in ${\mathbf {X}}$.

We begin with a straightforward consequence of Ref. [22].

Theorem 2.2

Let ${\mathbf {X}}$ be either ${\mathbf {C}}(I),$ or ${\mathbf {W}}^1_1(I)$. Then the operator ${\mathbf {A}}_\Phi $ generates an analytic semigroup in ${\mathbf {X}}$ with the resolvent $R(\lambda , {\mathbf {A}}_\Phi )$ satisfying the estimate

$$\begin{aligned} |||R(\lambda , {\mathbf {A}}_\Phi )|||_{{\mathbf {X}}} \le 2|||R(\lambda , {\mathbf {A}}_0)|||_{{\mathbf {X}}}, \qquad \lambda \in \Sigma _{\alpha , \theta _0} \end{aligned}$$

(26)

for some $\alpha \ge 0$.

Proof

The result follows directly from [22, Theorem 2.4], since ${\mathbf {A}}_0$ generates an analytic semigroup in ${\mathbf {X}}$, L is an unbounded operator on ${\mathbf {X}}$ which is, however, bounded as an operator from $D({\mathbf {A}})$ to ${\mathbf {Y}}$, and it is a surjection (the right inverse in each case is given by $[L^{-1}_r({\mathbf {y}}_0,{\mathbf {y}}_1)](x) = \frac{1}{2}({\mathbf {y}}_1- {\mathbf {y}}_0)x^2 -{\mathbf {y}}_0 x.$ Furthermore, $\Phi $ is a bounded operator on ${\mathbf {X}}$ (if ${\mathbf {X}} = {\mathbf {W}}^1_1(I)$ this follows since ${\mathbf {W}}^1_1(I)$-functions are absolutely continuous on I). Hence, the assumptions [22, (1.13)] are satisfied and the theorem follows from [22, Theorem 2.4]. $\square $

In ${\mathbf {X}} = {\mathbf {L}}_1(I)$ the situation is more complicated as $\Phi $ is not bounded on ${\mathbf {X}}$. First we show that $R(\lambda , {\mathbf {A}}^1_\Phi )$ extends to a resolvent on ${\mathbf {L}}_1(I)$.

Lemma 2.3

We have

$$\begin{aligned} \overline{{\mathbf {A}}^1_\Phi }^{{\mathbf {L}}_1(I)} = {\mathbf {A}}^0_\Phi \end{aligned}$$

and the resolvent set of ${\mathbf {A}}^0_\Phi $ satisfies $\rho ({\mathbf {A}}^0_\Phi )\cap {\mathbb {R}}_+ \ne \emptyset $.

Proof

Let $D({\mathbf {A}}^1_\Phi )\ni {\mathbf {u}}_n \rightarrow {\mathbf {u}} \in {\mathbf {L}}_1(I)$ and ${\mathbf {A}}_\Phi ^0 {\mathbf {u}}_n = \mathsf A {\mathbf {u}}_n \rightarrow {\mathbf {v}} \in {\mathbf {L}}_1(I)$. This shows that ${\mathbf {u}}_n\rightarrow {\mathbf {u}}$ in ${\mathbf {W}}^2_1(I)$ and thus ${\mathbf {v}} = \mathsf A{\mathbf {u}}$. Since taking the trace of a ${\mathbf {W}}^2_1(I)$ function and of its derivative is continuous in ${\mathbf {W}}^2_1(I)$, the boundary values of ${\mathbf {u}}_n$ and $\partial _x{\mathbf {u}}_n$ are preserved in the limit and thus ${\mathbf {u}} \in D({\mathbf {A}}^0_\Phi )$. Hence

$$\begin{aligned} \overline{{\mathbf {A}}^1_\Phi }^{{\mathbf {L}}_1(I)} \subset {\mathbf {A}}^0_\Phi . \end{aligned}$$

On the other hand, let ${\mathbf {u}} \in D({\mathbf {A}}^0_\Phi )$. Then ${\mathbf {v}} ={\mathbf {u}} -{\mathbf {f}} \in \mathop {{\mathbf {W}}}\limits ^{0}\!\!\!\!\!\!\!\!^2_1(I)$, where

$$\begin{aligned} {\mathbf {f}}(x)= & {} ({\mathbf {u}}(0) + {\mathbf {u}}'(0) -2{\mathbf {u}}(1) +{\mathbf {u}}'(1))x^3\\&-(2{\mathbf {u}}(0) + 2{\mathbf {u}}'(0) -3{\mathbf {u}}(1) +{\mathbf {u}}'(1))x^2 + {\mathbf {u}}'(0)x + {\mathbf {u}}(0) \in D({\mathbf {A}}^0_\Phi ). \end{aligned}$$

Thus there is a sequence $({{\mathbf {h}}}_{n})_{{n}\in {\mathbb {N}}} \subset {\mathbf {C}}^{\infty }_0(I)$ converging to ${\mathbf {v}}$ in ${\mathbf {W}}^2_1(I)$ and hence ${\mathbf {u}}$ is the limit in ${\mathbf {W}}^2_1(I)$ of functions ${\mathbf {u}}_n = {\mathbf {h}}_n + {\mathbf {f}} \in D({\mathbf {A}}^1_\Phi )$. Since the convergence in ${\mathbf {W}}^2_1(I)$ implies the convergence of both ${\mathbf {u}}_n$ and ${\mathbf {A}}{\mathbf {u}}_n$ in ${\mathbf {L}}_1(I)$ we see that also

$$\begin{aligned} \overline{{\mathbf {A}}^1_\Phi }^{{\mathbf {L}}_1(I)} \supset {\mathbf {A}}^0_\Phi . \end{aligned}$$

Finally, we see that, as in the scalar case (36), the solvability of the resolvent problem for (5) is equivalent to solvability of the linear system

$$\begin{aligned} -{\mathbb {M}} {\mathbf {C}}_1 + {\mathbb {M}} {\mathbf {C}}_2= & {} {\mathbb {K}}^{00}({\mathbf {C}}_1 + {\mathbf {C}}_2) +{\mathbb { K}}^{01}(e^{-\mu } {\mathbf {C}}_1 + e^\mu {\mathbf {C}}_2)\nonumber \\&-{\mathbb {M}} {\mathbf {U}}_{{\varvec{\mu }}} (0) + {\mathbb {K}}^{00} {\mathbf {U}}_{{\varvec{\mu }}}(0)+{\mathbb { K}}^{01} {\mathbf {U}}_{{\varvec{\mu }}}(1),\nonumber \\ -{\mathbb {M}} {\mathbb {E}}(-{\varvec{\mu }}){\mathbf {C}}_1 + {\mathbb {M}} {\mathbb {E}}({\varvec{\mu }}) {\mathbf {C}}_2= & {} {\mathbb {K}}^{10}({\mathbf {C}}_1 + {\mathbf {C}}_2) + {\mathbb {K}}^{11}({\mathbb {E}}(-{\varvec{\mu }}) {\mathbf {C}}_1 + {\mathbb {E}}({\varvec{\mu }}) {\mathbf {C}}_2)\nonumber \\&+{\mathbb {M}} {\mathbf {U}}_{{\varvec{\mu }}} (1) +{\mathbb { K}}^{01} {\mathbf {U}}_{{\varvec{\mu }}}(0)+{\mathbb { K}}^{11} {\mathbf {U}}_{{\varvec{\mu }}}(1), \end{aligned}$$

(27)

for the vectors ${\mathbf {C}}_1, {\mathbf {C}}_2$, where ${\mathbb {M}}=\mathrm {diag}\{{\varvec{\mu }}\}$. Once the constants are found, the solution is given by the vector version of (15) and thus belongs to ${\mathbf {W}}^2_1(I)\subset {\mathbf {L}}_1(I)$. The system above is solvable provided its determinant is different from zero. However, the determinant is clearly an entire function in $\mu $ and thus can have only isolated zeros in ${\mathbb {C}}$. Hence, there must be positive values of $\lambda \in \rho ({\mathbf {A}}^0_\Phi )$. $\square $

Theorem 2.4

The operator ${\mathbf {A}}^0_\Phi $ generates an analytic semigroup in ${\mathbf {L}}_1(I).$

Proof

We use the formula from Lemma 1.4 of Ref. [22] stating that

$$\begin{aligned} R(\lambda , {\mathbf {A}}^1_\Phi ) = (I-L_\lambda \Phi )^{-1}R(\lambda , {\mathbf {A}}^1_N) \end{aligned}$$

(28)

in ${\mathbf {X}} = {\mathbf {W}}^1_1(I),$ as $(I-L_\lambda \Phi )^{-1}$ is an isomorphism of ${\mathbf {X}}$ for large $|\lambda |$; here $L_\lambda = (L|_{Ker (\lambda - {\mathbf {A}})})^{-1}, \lambda \in \rho ({\mathbf {A}}^1_N)$, see the proof of [22, Theorem 2.4].

However, $R(\lambda , {\mathbf {A}}^1_N)$ extends by density to $R(\lambda , {\mathbf {A}}^0_N)$ (the resolvent on ${\mathbf {L}}_1(I)$) which is a bounded linear operator from ${\mathbf {L}}_1(I)$ to $D({\mathbf {A}}^1_N)\subset {\mathbf {W}}^2_1(I).$ Since the latter space is continuously embedded in ${\mathbf {W}}^1_1(I),$ $R(\lambda , {\mathbf {A}}^1_\Phi )$ extends to a bounded linear operator, say $R_\Phi (\lambda )$, on ${\mathbf {L}}_1(I)$. Since $R(\lambda , {\mathbf {A}}^1_\Phi )$ is a resolvent on ${\mathbf {W}}^1_1(I)$, we obtain, by density, that $R_\Phi (\lambda )$ is a pseudoresolvent on ${\mathbf {L}}_1(I)$. Furthermore, ${\mathbf {C}}_0^\infty (]0,1[)\subset D({\mathbf {A}}^1_\Phi ),$ thus it is also a subset of the range of $R_\Phi (\lambda )$ and therefore the range is dense in ${\mathbf {L}}_1(I)$. Also, since the range of $R(\lambda , {\mathbf {A}}^0_N)$ is in ${\mathbf {W}}^2_1(I)\subset {\mathbf {W}}^1_1(I),$ there is no need to extend $(I-L_\lambda \Phi )^{-1}$. Hence $R_\Phi (\lambda )$ is a one-to-one operator as a composition of two injective operators. Thus, by Proposition III.4.6 in Ref. [23], $R_\Phi (\lambda )$ is the resolvent of a densely defined operator, say $\hat{{\mathbf {A}}}_\Phi $. Clearly, $\hat{{\mathbf {A}}}_\Phi $ is a closed extension of ${\mathbf {A}}^1_\Phi $ and thus $\hat{{\mathbf {A}}}_\Phi \supset {{\mathbf {A}}}^0_\Phi = \overline{{\mathbf {A}}^1_\Phi }^{{\mathbf {L}}_1(I)}. $ From Lemma 2.3, there is $\lambda \in \rho (\hat{{\mathbf {A}}}_\Phi ) \cap \rho ({\mathbf {A}}^0_\Phi ),$ thus $R(\lambda , \hat{{\mathbf {A}}}_\Phi )\supset R(\lambda , {{\mathbf {A}}}^0_\Phi )$ and hence $\hat{{\mathbf {A}}}_\Phi = {{\mathbf {A}}}^0_\Phi $ and, consequently, $R(\lambda , {\mathbf {A}}^0_\Phi )$ is defined in some sector. To prove that is a sectorial operator, we use the idea of [6] but in a somewhat simpler way. We consider the operator ${\mathbf {A}}^\infty _{\Phi ^*}.$ Let $R^{\#}_\lambda $ denote the adjoint to $R(\lambda , {\mathbf {A}}^\infty _{\Phi ^*})$ for $\lambda \in \rho ({\mathbf {A}}^\infty _{\Phi ^*})$, which acts in the space of signed (vector) Borel measures on I, [25]. $R(\lambda , {\mathbf {A}}^\infty _{\Phi ^*})$ is given by (15) with ${\mathbf {C}}_1, {\mathbf {C}}_2$ given by (27) with the matrices ${\mathbb {K}}^\omega $ replaced by the corresponding matrices in (12). Hence, apart from ${\mathbf {U}}_{{\varvec{\mu }}}(x)$, $R(\lambda , {\mathbf {A}}^\infty _{\Phi ^*})$ is a composition of an algebraic operator coming from inverting the matrix in (27) with a vector of functionals acting on ${\mathbf {f}}$. Thus, if ${\mathbf {f}}\in {\mathbf {L}}_1(I)$ is the density of an absolutely continuous measure, a standard calculation shows that

$$\begin{aligned} R^{\#}_\lambda {\mathbf {f}} = R(\bar{\lambda }, {\mathbf {A}}^0_\Phi ){\mathbf {f}}. \end{aligned}$$

From the definition of the norm of a signed Borel measure, if the latter is absolutely continuous, its norm is equal to the $L_1$ norm of its density. Since taking the adjoint preserves the norm of the operator, we obtain

$$\begin{aligned} |||R(\lambda , {\mathbf {A}}^0_\Phi )|||_{{\mathbf {L}}_1(I)} = |||R(\lambda , {\mathbf {A}}^\infty _{\Phi ^*})|||_{{\mathbf {C}}(I)}. \end{aligned}$$

(29)

Hence, by Theorem 2.2, ${\mathbf {A}}^0_\Phi $ is sectorial and, being densely defined, it generates an analytic semigroup on ${\mathbf {L}}_1(I)$. $\square $

2.3 Positivity of the semigroup

Let us recall that for an element ${\mathbf {u}}$ of a Banach lattice, we write ${\mathbf {u}} >0$ if $0\ne {\mathbf {u}}\ge 0.$ We have the following result

Theorem 2.5

The semigroup $\{e^{t{\mathbf {A}}^\infty _\Phi }\}_{t \ge 0}$ is resolvent positive if and only if

$$\begin{aligned}&-{\mathbb {K}}^{00}, \qquad {\mathbb {K}}^{11}\;\mathrm {are\;nonnegative}\; \mathrm {off}\hbox {-}\mathrm {diagonal}\nonumber \\&\quad -{\mathbb {K}}^{01}, \quad {\mathbb {K}}^{10}\;\mathrm { are\; nonnegative}. \end{aligned}$$

(30)

Proof

To prove the result we use Theorem B-II.1.6 in [26], which states that if an operator A on C(K) (where K is compact) generates a semigroup, then the semigroup is positive (or, equivalently, A is resolvent positive) if and only if A satisfies the positive minimum principle: for every $0\le f\le D(A)$ and $x\in K$, if $f(x)=0$, then $(Af)(x)\ge 0$. However, to rephrase the problem at hand in the language of the positive maximum principle, we have to work in the space of real valued continuous functions. To achieve this, we identify ${\mathbf {C}}(I)$ with the space $C({\mathbf {I}}),$ where ${\mathbf {I}} = [0_1,1_1]\cup \cdots \cup [0_m,1_m]$; that is, instead of considering a vector function on I, we consider a scalar function on a disconnected compact space composed of m disjoint closed intervals. In particular, each edge $e_j, j\in \mathcal {M},$ is identified with the closed interval $[0_j,1_j].$ Then ${\mathbf {A}}^\infty _\Phi $ will be changed to $A^\infty _\Phi $ which is the restriction of

$$\begin{aligned} (Au)(x) = \sum \limits _{j\in \mathcal {M}}\chi _{[0_j,1_j]}(x)\sigma _j\partial _{xx}u(x) \end{aligned}$$

to the space of twice differentiable functions on $Int {\mathbf {I}} = ]0_1,1_1[\cup \cdots \cup ]0_m,1_m[$, differentiable on ${\mathbf {I}}$ and satisfying

$$\begin{aligned} \partial _xu(0_j)= & {} \sum \limits _{k=1}^m k^{00}_{jk}u(0_k) + \sum \limits _{k=1}^m k^{01}_{jk}u(1_k),\\ \partial _xu(1_j)= & {} \sum \limits _{k=1}^m k^{10}_{jk}u(0_k) + \sum \limits _{k=1}^m k^{11}_{jk}u(1_k). \end{aligned}$$

Thus, if $0\le u \in D(A^\infty _\Phi )$ takes the value 0 at some $x\in {\mathbf {I}}$, then either $x \in Int {\mathbf {I}},$ and then classically $\partial _{xx} u (x)\ge 0,$ or $x = 0_j$ or $x=1_j$ for some $j \in \mathcal {M}$. If $x=0_j$, then $\partial _{x} u(0_j) \ge 0$. If $\partial _{x} u(0_j) = 0,$ then $\partial _{xx} u(0_j)\ge 0$ follows as in Example B-II.1.24 of Ref. [26] (or simply by noting that the even extension to $[-1,0]$ gives a $C^2$ function on $[-1,1]$ with minimum at $x=0$). If $\partial _x u(0_j)<0$ then, since $u(0_j)=0,$ such a function cannot be nonnegative on $[0_j,1_j]$. Analogous considerations hold if $ u(1_j)=0$. Therefore the positive minimum principle is satisfied and (30) yields the positivity of $\{e^{tA^\infty _\Phi }\}_{t \ge 0}$ and thus of $\{e^{t{\mathbf {A}}^\infty _\Phi }\}_{t \ge 0}$.

To prove the converse we introduce the following notation. For any ${\varvec{\alpha }}:=({\varvec{\alpha }}^0,{\varvec{\alpha }}^1)\ge 0$ with $\alpha ^j_i=0$ for some $i \in \mathcal {M}, j=0,1,$ we have

$$\begin{aligned} \Xi :=\left( \begin{array}{c@{\quad }c} -{\mathbb {K}}^{00}&{}-{\mathbb {K}}^{01}\\ {\mathbb {K}}^{10}&{}{\mathbb {K}}^{11}\end{array}\right) . \end{aligned}$$

(31)

Accordingly, we denote

$$\begin{aligned} (\Xi {\varvec{\alpha }})^r_s = (-1)^{r+1}\sum \limits _{l=0,1}\left( \sum \limits _{j\in \mathcal {M}}k^{rl}_{sj}\alpha ^l_j \right) , \quad r=0,1, s\in \mathcal {M}. \end{aligned}$$

Let us assume that (30) is not satisfied. Then there is a non-diagonal element of $\Xi $ which is strictly negative. Suppose $(-1)^{r+1}k^{rs}_{ij}<0$ for some $i\ne j$ and $r,s =0,1,$ and consider a vector ${\varvec{\alpha }}$ with

$$\begin{aligned} \alpha ^s_j=1, \alpha ^r_i=0, \alpha ^t_l = \delta >0, \quad t=0,1, \quad l\ne j\;\mathrm {if}\;t = s\;\mathrm {and}\; l\ne i\;\mathrm {if}\;t= s. \end{aligned}$$

(32)

Then for $r=s$ we obtain

$$\begin{aligned} (\Xi {\varvec{\alpha }})^r_i= (-1)^{r+1}\left( k^{rr}_{ij} +\delta \left( \sum \limits _{l\in \mathcal {M}, l\ne j,i}k^{rr}_{il} + \sum \limits _{l\in \mathcal {M}}k^{rt}_{il}\right) \right) <0, \end{aligned}$$

(33)

where $t=0$ if $r=1$ and $t=1$ if $r=0$, while for $r\ne s$

$$\begin{aligned} (\Xi {\varvec{\alpha }})^r_i= (-1)^{r+1}\left( k^{rt}_{ij} +\delta \left( \sum \limits _{l\in \mathcal {M}, l\ne i}k^{rr}_{il} + \sum \limits _{l\in \mathcal {M}, l\ne j}k^{rt}_{il}\right) \right) <0 \end{aligned}$$

(34)

for sufficiently small $\delta >0$. We shall prove that there exists a function $0\le u \in C^\infty ({\mathbf {I}})$ satisfying $u (r_i) = \alpha ^r_i$ and $(-1)^{r+1}\partial _x u(r_i) = (\Xi {\varvec{\alpha }})^r_i$ which additionally satisfies $\partial _{xx}u (r_i)<0$.

For a given constants $\alpha _i^r,\beta _i^r = (\Xi {\varvec{\alpha }})^r_i, r=0,1,$ we consider auxiliary functions $f_i^r(x) = \beta _i^rx(x-1) +\alpha _i^r$. We have $f_i^r(r) = \alpha _i^r, \partial _x f_i(r) = (-1)^{r+1}\beta _i^r$ and $\partial _{xx} f_i^r = 2\beta _i^r$. We observe that as long as $\alpha _i^r>0$, there is a one-sided interval ($]0,\omega _i[$ if $r=0$ and $]\omega _i, 1[$ if $r=1$), where $f_i^r\ge 0$ irrespective of the sign of $\beta _i^r$. On the other hand, if $\alpha _i^r=0$, for local nonnegativity we need $\beta _i^r\le 0$. Now let $\phi $ be a nonnegative $C^\infty $ function which is 1 on $[-a,a]$ and 0 outside $[-2a,2a]$ where $0<2a <\min _{i\in \mathcal {M}}\omega _i$ and define

$$\begin{aligned} u (x) = \phi (x) f_j^0 (x) + \phi (1-x) f^1_j(x), \quad x \in [0_j,1_j], j \in \mathcal {M}. \end{aligned}$$

is a $C^\infty ({\mathbf {I}})$ function that satisfies:

1.
$u\ge 0$;
2.
$ u(0_j) = \alpha ^0_j$ and $\partial _x u(0_j) = \partial _x f^0_j(0_j) = -\beta ^0_j = (\Xi {\varvec{\alpha }})^0_j$;
3.
$ u(1_j) =\alpha ^1_j$ and $\partial _x u(1_j) = \partial _x f^1_j(1_j) = \beta ^1_j = (\Xi {\varvec{\alpha }})^1_j,$

so that $0\le u \in D(A^\infty _\Phi ).$ Recalling that we assumed $(-1)^{r+1}k^{rs}_{ij}<0$ for some $i\ne j$ and $r,s =0,1,$ we consider coefficients $\{\alpha ^t_j\}_{t=0,1, j\in \mathcal {M}}$ satisfying (32). Then we have $ u(r_i) =0$. On the other hand, $\partial _{xx} u(r_i) = 2\beta ^s_i = 2(\Xi {\varvec{\alpha }})^r_i<0$ by (33) or (34). Thus, there is a nonnegative element $u\in D(A^\infty _\Phi )$ for which $\partial _{xx} u <0$ at a point where the global minimum of zero is attained. $\square $

We can use this result to prove an analogous result in ${\mathbf {L}}_1(I)$.

Corollary 2.6

The operator $A^0_\Phi $ generates a positive semigroup if and only if the assumptions of Theorem 2.5 are satisfied.

Proof

In one direction the result immediately follows by density of ${\mathbf {C}}(I)$ in ${\mathbf {L}}^1(I)$. Conversely, if $\{e^{tA^0_\Phi }\}_{t \ge 0}\ge 0$ then, in particular, for any $0\le \,\mathring{{\mathbf {u}}}\,\in \! {\mathbf {C}}(I)$ we have $e^{tA^0_\Phi }\mathring{{\mathbf {u}}} = e^{tA^\infty _\Phi }\mathring{{\mathbf {u}}}\,\ge 0$. $\square $

3 Solvability of (8)

We return to the problem (8) where, we emphasize, ${\mathbb {K}}$ is an arbitrary matrix. In this section we restrict our attention to ${\mathbf {X}} = {\mathbf {L}}_1(I)$ as in ${\mathbf {C}}(I)$ the operator ${\mathbf {A}}$ is not densely defined. Let us recall that ${\mathbf {A}}$ is the realization of $\mathsf{A} = \mathrm {diag}\{-c_j \partial _x\}_{1\le j\le m}$ on the domain $D({\mathbf {A}}) = \{{\mathbf {u}} \in {\mathbf {W}}^{1}_1(I);\; {\mathbf {u}}(0) = {\mathbb {K}}{\mathbf {u}}(1)\}$.

The following theorem for (7) has been proved in [7] (see also [27]) but the proof for any nonnegative matrix ${\mathbb {K}}$ is practically the same. Here we extend this proof to an arbitrary ${\mathbb {K}}$. However, for the proof in the general case we need to provide basic steps of the proof for nonnegative matrices.

Theorem 3.1

The operator $({\mathbf {A}}, D({\mathbf {A}}))$ generates a $C_0$-semigroup on ${\mathbf {L}}_1(I)$. The semigroup is positive if and only if ${\mathbb {K}}\ge 0$.

Proof

Clearly, ${\mathbf {C}}_0^\infty (]0,1[) \subset D({\mathbf {A}})$ and hence $D({\mathbf {A}})$ is dense in ${\mathbf {X}}$. To find the resolvent of ${\mathbf {A}},$ the first step is to solve

$$\begin{aligned} \lambda u_j + c_j \partial _x u_j = f_j, \quad j=1,\ldots ,m,\quad x \in ]0,1[, \end{aligned}$$

(35)

with $(u_1,\ldots ,u_m)={\mathbf {u}} \in D({\mathbf {A}})$. Integrating, we find

$$\begin{aligned} {\mathbf {u}}(x) = {\mathbb {E}}_\lambda (x){\mathbf {v}} +{\mathbb {C}}^{-1} \int \limits _{0}^{x}{\mathbb {E}}_\lambda (x-s){\mathbf {f}}(s)ds, \end{aligned}$$

(36)

where ${\mathbf {v}} = (v_1,\ldots ,v_m)$ is an arbitrary vector and ${\mathbb {E}}_{\lambda }(s) = \mathrm {diag}\left\{ e^{-\frac{\lambda }{c_j}s}\right\} _{1\le j\le m}.$ To determine ${\mathbf {v}}$ so that ${\mathbf {u}} \in D({\mathbf {A}})$, we use the boundary conditions. At $x=1$ and at $x=0$ we obtain, respectively

$$\begin{aligned} {\mathbf {u}}(1) = {\mathbb {E}}_\lambda (1){\mathbf {v}} + {\mathbb {C}}^{-1}\int \limits _{0}^{1}{\mathbb {E}}_\lambda (1-s){\mathbf {f}}(s)ds, \qquad {\mathbf {u}}(0) = {\mathbf {v}}. \end{aligned}$$

Then, using the boundary condition ${\mathbf {u}}(0) = {\mathbb {K}}{\mathbf {u}}(1)$ we obtain

$$\begin{aligned} (\mathbb {I} - {\mathbb {K}}{\mathbb {E}}_\lambda (1)){\mathbf {v}} = {\mathbb {K}}{\mathbb {C}}^{-1} \int \limits _{0}^{1}{\mathbb {E}}_\lambda (1-s){\mathbf {f}}(s)ds. \end{aligned}$$

(37)

Since the norm of ${\mathbb {E}}_\lambda (1)$ can be made as small as one wishes by taking large $\lambda $, we see that ${\mathbf {v}}$ is uniquely defined by the Neumann series provided $\lambda $ is sufficiently large and hence the resolvent of ${\mathbf {A}}$ exists.

Let us first consider ${\mathbb {K}}\ge 0$. Then the Neumann series expansion ensures that ${\mathbf {A}}$ is a resolvent positive operator and hence we can consider only ${\mathbf {f}}\ge 0$. Adding together the rows in (37) we obtain

$$\begin{aligned} \sum \limits _{j=1}^{m} v_j = \sum \limits _{j=1}^{m} \kappa _je^{-\frac{\lambda }{c_j}}v_j + \sum \limits _{j=1}^{m} \frac{\kappa _j}{c_j} \int \limits _{0}^{1}e^{\frac{\lambda }{c_j}(s-1)}f_j(s)ds, \end{aligned}$$

(38)

where $\kappa _j = \sum \limits _{i=1}^m k_{ij}.$ Renorming ${\mathbf {X}}$ with the norm $\Vert {\mathbf {u}}\Vert _c = \sum _{j=1}^m c^{-1}_j\Vert u_j\Vert _{L_1(I)}$ and using (36) and (38), we obtain

$$\begin{aligned} \Vert {\mathbf {u}}\Vert _c = \frac{1}{\lambda } \sum \limits _{j=1}^m\! v_je^{-\frac{\lambda }{c_j}}(\kappa _j\!-1) + \frac{1}{\lambda } \sum \limits _{j=1}^m\!\frac{\kappa _j\!-1}{c_j}\!\! \int \limits _{0}^{1} \!\! e^{\frac{\lambda }{c_j}(s-1)}f_j(s)ds + \frac{1}{\lambda } \Vert {\mathbf {f}}\Vert _c. \end{aligned}$$

We consider three cases.

(a)
$\kappa _j \le 1$ for $j\in \mathcal {M}.$ Then $\Vert {\mathbb {E}}_\lambda (-1){\mathbb {K}}\Vert <1$ and thus ${\mathbf {v}},$ and hence $R(\lambda , {\mathbf {A}})$, are defined and positive for any $\lambda >0$. Further, dropping the first two terms in (39) we get
$$\begin{aligned} \Vert {\mathbf {u}}\Vert _c \le \frac{1}{\lambda } \sum \limits _{j=1}^m \frac{1}{c_j}\int \limits _{0}^{1} f_j(s)ds = \frac{1}{\lambda }\Vert {\mathbf {f}}\Vert _c, \quad \lambda >0. \end{aligned}$$
Hence $({\mathbf {A}},D({\mathbf {A}}))$ generates a positive semigroup of contractions in $({\mathbf {X}},\Vert \cdot \Vert _c)$.
(b)
$\kappa _j \ge 1$ for $j\in \mathcal {M}.$ Then (39) implies that for some $\lambda >0$ and $c = 1/\lambda $ we have
$$\begin{aligned} \Vert R(\lambda ,{\mathbf {A}}){\mathbf {f}}\Vert _c \ge c\Vert {\mathbf {f}}\Vert _c \end{aligned}$$
and, by density of $D({\mathbf {A}})$, the application of the Arendt-Batty-Robinson theorem [28, 29], gives the existence of a positive semigroup generated by ${\mathbf {A}}$ in $({\mathbf {X}},\Vert \cdot \Vert _c)$. Since $\Vert \cdot \Vert _c$ is equivalent to $\Vert \cdot \Vert _{{\mathbf {X}}},$ ${\mathbf {A}}$ generates a positive semigroup in ${\mathbf {X}}$.
(c)
$\kappa _j < 1$ for $j\in I_1$ and $\kappa _j \ge 1$ for $j\in I_2,$ where $I_1\cap I_2 =\emptyset $ and $I_1\cup I_2 = \{1,\ldots ,m\}$. Let ${\mathbb {L}} = (l_{ij})_{1\le i,j\le m},$ where $l_{ij} = k_{ij}$ for $j\in I_2$ and $l_{ij}=1$ for $j\in I_1$. Denoting by ${\mathbf {A}}_{{\mathbb {L}}}$ the operator given by the differential expression $\mathsf A$ restricted to $ D({\mathbf {A}}_{{\mathbb {L}}}) = \{{\mathbf {u}} \in {\mathbf {W}}^{1}_1(I);\; {\mathbf {u}}(0) = {\mathbb {L}}{\mathbf {u}}(1)\} $ we see, by (37), that
$$\begin{aligned} 0\le R(\lambda , {\mathbf {A}})\le R(\lambda , {\mathbf {A}}_{{\mathbb {L}}}) \end{aligned}$$
(39)
for any $\lambda \in \rho ({\mathbf {A}}_{{\mathbb {L}}}).$ By item (b), ${\mathbf {A}}_{{\mathbb {L}}}$ generates a positive $C_0$-semigroup and thus satisfies the Hille–Yosida estimates. Since clearly (39) yields $R^k(\lambda , {\mathbf {A}}) \le R^k(\lambda , {\mathbf {A}}_{{\mathbb {L}}})$ for any $k\in \mathbb {N}$, for some $\omega >0$ and $M\ge 1$ we have
$$\begin{aligned} \Vert R^k(\lambda , {\mathbf {A}})\Vert \le \Vert R^k(\lambda , {\mathbf {A}}_{{\mathbb {L}}})\Vert \le M(\lambda -\omega )^{-k}, \qquad \lambda >\omega , k\in \mathbb {N}, \end{aligned}$$
and hence we obtain the generation of a semigroup by ${\mathbf {A}}$.

Assume now that ${\mathbb {K}}$ is arbitrary. The analysis up to (37) remains valid. Then (37) can be expanded as

$$\begin{aligned} {\mathbf {v}} = \sum \limits _{n=0}^\infty ({\mathbb {K}}{\mathbb {E}}_\lambda (1))^n {\mathbb {K}}{\mathbb {C}}^{-1} \int \limits _{0}^{1}{\mathbb {E}}_\lambda (1-s){\mathbf {f}}(s)ds \end{aligned}$$

(40)

and hence

$$\begin{aligned} {\mathbf {u}}(x) = {\mathbb {E}}_\lambda (x) \sum \limits _{n=0}^\infty ({\mathbb {K}}{\mathbb {E}}_\lambda (1))^n {\mathbb {K}}{\mathbb {C}}^{-1} \int \limits _{0}^{1}{\mathbb {E}}_\lambda (1-s){\mathbf {f}}(s)ds + {\mathbb {C}}^{-1}\int \limits _{0}^{x}{\mathbb {E}}_\lambda (x-s){\mathbf {f}}(s)ds. \nonumber \\ \end{aligned}$$

(41)

Denoting now $|{\mathbf {u}}|=(|u_1|,\ldots ,|u_n|)$ and $|{\mathbb {K}}|=(|k_{ij}|)_{1\le i,j\le m},$ and using the fact that only ${\mathbb {K}}$ may have non positive entries, we find

$$\begin{aligned} |{\mathbf {u}}(x)|\le & {} {\mathbb {E}}_\lambda (x)\!\! \sum \limits _{n=0}^\infty (|{\mathbb {K}}|{\mathbb {E}}_\lambda (1)|)^n |{\mathbb {K}}|{\mathbb {C}}^{-1}\!\!\! \int \limits _{0}^{1}{\mathbb {E}}_\lambda (1-s)\mathbf {|}{\mathbf {f}}|(s)ds\\&+ {\mathbb {C}}^{-1}\!\!\!\int \limits _{0}^{x}\!{\mathbb {E}}_\lambda (x-s)\mathbf {|}{\mathbf {f}}|(s)ds = R(\lambda , {\mathbf {A}}_{|{\mathbb {K}}|})|{\mathbf {f}}|, \end{aligned}$$

where ${\mathbf {A}}_{|{\mathbb {K}}|}$ denotes $\mathsf A$ restricted to $ D({\mathbf {A}}_{|{\mathbb {K}}|}) = \{{\mathbf {u}} \in {\mathbf {W}}^{1}_1(I);\; {\mathbf {u}}(0) = \mathbb {|}{\mathbb {K}}|{\mathbf {u}}(1)\} $. So, we can write

$$\begin{aligned} |R(\lambda , {\mathbf {A}}_{{\mathbb {K}}}){\mathbf {f}}|\le R(\lambda , {\mathbf {A}}_{|{\mathbb {K}}|})|{\mathbf {f}}| \end{aligned}$$

(42)

and, iterating,

$$\begin{aligned} |R(\lambda , {\mathbf {A}}_{{\mathbb {K}}})^k{\mathbf {f}}| \le R(\lambda , {\mathbf {A}}_{|{\mathbb {K}}|})^k|{\mathbf {f}}|. \end{aligned}$$

Using the fact that taking the modulus does not change the norm, we find

$$\begin{aligned} \Vert R(\lambda , {\mathbf {A}}_{{\mathbb {K}}})^k{\mathbf {f}}\Vert \le \Vert R(\lambda , {\mathbf {A}}_{|{\mathbb {K}}|})^k|{\mathbf {f}}|\Vert \le \frac{M}{\lambda -\omega }\Vert {\mathbf {f}}\Vert . \end{aligned}$$

with M and $\omega $ following from the Hille–Yosida estimates for $A_{|{\mathbb {K}}|}$.

The fact that ${\mathbb {K}}\ge 0$ yields the positivity of the semigroup follows from the first part of the proof. To prove the converse, let $k_{ij}<0$ for some i, j and consider the initial condition ${\mathbf {f}}(x) = (f_1(x),\ldots , f_m(x))$ with $f_k=0$ for $k\ne j$ and $f_j\in C^1([0,1])$ with $f_j(0)=f_j(1) =0,$ so that ${\mathbf {f}} \in D(A)$, and $f_j(x)>0$ for $0<x<1$. Then, at least for $t<\min _{1\le j\le m} \{1/c_j\}$, $u_i$ satisfies

$$\begin{aligned} \partial _t u_i=c_i\partial _x u_i, \quad u_i(x,0)=0, u_i(0,t) = k_{ij}f_j(1-c_jt); \end{aligned}$$

that is,

$$\begin{aligned} u_i(x,t) = k_{ij}f_j\left( 1+\frac{c_jx}{c_i}-c_jt\right) , \qquad t\ge \frac{x}{c_i}, \end{aligned}$$

and we see that the solution is negative for such t. This ends the proof. $\square $

References

Kant, U., Klauss, T., Voigt, J., Weber, M.: Dirichlet forms for singular one-dimensional operators and on graphs. J. Evol. Equ. 9, 637–659 (2009)
Article MathSciNet MATH Google Scholar
Kostrykin, V., Potthoff, J., Schrader, R.: Contraction semigroups on metric graphs. In: Proceedings of Symposia in Pure Mathematics, vol. 77, pp. 423–458. AMS (2008)
Kuchment, P.: Quantum graphs: I. Some basic structures. Waves Random Media 14, S107–S128 (2004)
Article MathSciNet MATH Google Scholar
Kuchment, P.: Analysis on Graphs and Its Applications. In: Proceedings of Symposia in Pure Mathematics, pp. 291–314. AMS (2008)
Bobrowski, A.: From diffusion on graphs to markov chains via asymptotic state lumping. Ann. Henri Poincaré 13, 1501–1510 (2012)
Article MathSciNet MATH Google Scholar
Gregosiewicz, A.: Asymptotic behaviour of diffusion on graphs. In: Banek, T., Kozłowski, E. (eds.) Probability in Action, pp. 83–96. Lublin University of Technology Press, Lublin (2014)
Google Scholar
Banasiak, J., Namayanja, P.: Asymptotic behaviour of flows on reducible networks. Netw. Heterog. Media 9(2), 197–216 (2014)
Article MathSciNet MATH Google Scholar
Bressan, A., Čanić, S., Garavello, M., Herty, M., Piccoli, B.: Flows on networks: recent results and perspectives. EMS Surv. Math. Sci. 1, 47–111 (2014)
Article MathSciNet MATH Google Scholar
Dorn, B.: Semigroups for flows in infinite networks. Semigroup Forum 76, 341–356 (2008)
Article MathSciNet MATH Google Scholar
Dorn, B., Kramar Fijavž, M., Nagel, R., Radl, A.: The semigroup approach to transport processes in networks. Physica D 239, 1416–1421 (2010)
Article MathSciNet MATH Google Scholar
Kramar, M., Sikolya, E.: Spectral properties and asymptotic periodicity of flows in networks. Math. Z. 249, 139–162 (2005)
Article MathSciNet MATH Google Scholar
Matrai, T., Sikolya, E.: Asymptotic behaviour of flows in networks. Forum Math. 19, 429–461 (2007)
Article MathSciNet MATH Google Scholar
Knopoff, D.: On the modelling of migration phenomena on small networks. Math. Models Methods Appl. Sci. 23(3), 541–563 (2013)
Article MathSciNet MATH Google Scholar
Mugnolo, D.: Semigroup Methods for Evolution Equations on Networks. Springer, Cham (2014)
Book MATH Google Scholar
Banasiak, J., Falkiewicz, A., Namayanja, P.: Asymptotic state lumping in network transport and diffusion problems. Math. Models Methods Appl. Sci. (2016, accepted). arXiv:1503.00683
Berge, C.: Hypergraphs: Combinatorics of Finite Sets. North-Holland, Amsterdam (1989)
MATH Google Scholar
Banasiak, J., Falkiewicz, A.: Some transport and diffusion processes on networks and their graph realizability. Appl. Math. Lett. 45, 25–30 (2015)
Article MathSciNet MATH Google Scholar
Hemminger, R.L., Beineke, L.W.: Line graphs and line digraphs. In: Beineke, L.W., Wilson, R.J. (eds.) Selected Topics in Graph Theory I, pp. 271–305. Academic Press, London (1978)
Google Scholar
Rotenberg, M.: Transport theory for growing cell population. J. Theor. Biol. 103, 181–199 (1983)
Article MathSciNet Google Scholar
Hadeler, K.P.: Structured populations with diffusion in state space. Math. Biosci. Eng. 7(1), 37–49 (2010)
Article MathSciNet MATH Google Scholar
D’Apice, C., El Habil, B., Rhandi, A.: Positivity and stability for a system of transport equations with unbounded boundary perturbations. Electron. J. Differ. Equ. 137, 1–13 (2009)
MathSciNet MATH Google Scholar
Greiner, G.: Perturbing the boundary conditions of a generator. Houst. J. Math. 13(2), 213–228 (1987)
MathSciNet MATH Google Scholar
Engel, K.-J., Nagel, R.: One-Parameter Semigroups for Linear Evolution Equations. Springer, New York (1999)
MATH Google Scholar
Bobrowski, A.: Generalized telegraph equation and the Sova–Kurtz version of the Trotter–Kato theorem. Ann. Polon. Math. LXIV.1, 37–45 (1996)
MathSciNet MATH Google Scholar
Bobrowski, A.: Functional Analysis for Probability and Stochastic Processes. Cambridge University Press, Cambridge (2005)
Book MATH Google Scholar
Nagel, R. (ed.): One Parameter Semigroups of Positive Operators. Lecture Notes in Mathematics, vol. 1184. Springer, Berlin (1986)
MATH Google Scholar
Banasiak, J.: Kinetic models in natural sciences. In: Banasiak, J., Mokhtar-Kharroubi, M. (eds.) Evolutionary Equations with Applications in Natural Sciences. Lecture Notes in Mathematics, vol. 2126, pp. 133–198. Springer, Heidelberg (2015)
Google Scholar
Arendt, W.: Resolvent positive operators. Proc. Lond. Math. Soc. 54, 321–349 (1987)
Article MathSciNet MATH Google Scholar
Banasiak, J., Arlotti, L.: Positive Perturbations of Semigroups with Applications. Springer, London (2006)
MATH Google Scholar
Pazy, A.: Semigroups of Linear Operators and Applications to Partial Differential Equations. Springer, New York (1983)
Book MATH Google Scholar

Download references

Acknowledgments

Research of J.B. and A.F. was done during NRF/IIASA SA YSSP at the University of Free State and was partly supported by National Science Centre of Poland through the Grant No. N201605640. Research of P.N. was supported by TWOWS and the UKZN Research Fund.

Author information

Authors and Affiliations

University of Kwazulu-Natal, Durban, South Africa
Jacek Banasiak & Proscovia Namayanja
Technical University of Łódź, Lodz, Poland
Jacek Banasiak & Aleksandra Falkiewicz

Authors

Jacek Banasiak
View author publications
You can also search for this author in PubMed Google Scholar
Aleksandra Falkiewicz
View author publications
You can also search for this author in PubMed Google Scholar
Proscovia Namayanja
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jacek Banasiak.

Additional information

Communicated by Jerome A. Goldstein.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Banasiak, J., Falkiewicz, A. & Namayanja, P. Semigroup approach to diffusion and transport problems on networks. Semigroup Forum 93, 427–443 (2016). https://doi.org/10.1007/s00233-015-9730-4

Download citation

Received: 03 March 2015
Accepted: 21 May 2015
Published: 16 June 2015
Issue Date: December 2016
DOI: https://doi.org/10.1007/s00233-015-9730-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Semigroup approach to diffusion and transport problems on networks

Abstract

Similar content being viewed by others

Conservative and Semiconservative Random Walks: Recurrence and Transience

On a new Kantorovich operator based on Hermite polynomials

{Euclidean, metric, and Wasserstein} gradient flows: an overview

1 Introduction

1.1 Motivation

1.1.1 Diffusion

1.1.2 Transport problems

Example 1.1

2 Well-posedness of the diffusion problem

2.1 Basic estimates in the scalar case

Proposition 2.1

Proof

2.2 Solvability of (5)

Theorem 2.2

Proof

Lemma 2.3

Proof

Theorem 2.4

Proof

2.3 Positivity of the semigroup

Theorem 2.5

Proof

Corollary 2.6

Proof

3 Solvability of (8)

Theorem 3.1

Proof

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation