Abstract
In this paper, we establish convergence to equilibrium for a drift–diffusion–recombination system modelling the charge transport within certain semiconductor devices. More precisely, we consider a two-level system for electrons and holes which is augmented by an intermediate energy level for electrons in so-called trapped states. The recombination dynamics use the mass action principle by taking into account this additional trap level. The main part of the paper is concerned with the derivation of an entropy–entropy production inequality, which entails exponential convergence to the equilibrium via the so-called entropy method. The novelty of our approach lies in the fact that the entropy method is applied uniformly in a fast-reaction parameter which governs the lifetime of electrons on the trap level. Thus, the resulting decay estimate for the densities of electrons and holes extends to the corresponding quasi-steady-state approximation.
Similar content being viewed by others
1 Introduction and main results
The formulation and mathematical analysis of drift–diffusion type semiconductor models reach back to the middle of the last century, see e.g. [18, 20, 24] and the references therein; yet these models still form a highly relevant workhorse in the simulation of semiconductor devices and batteries.
Physically, drift–diffusion models describe the transport of charge carriers via diffusion and convection governed by electric fields. In semiconductors, charge carriers are electrons and holes (positively charged quasi-particles, which represent the absence of an electron). Pairs of electrons and holes can be “generated” and “destroyed” by recombination processes. Generation of an electron-hole pair occurs when an electron is lifted from a low-energy valence band to a high-energy conduction band, where electrons are mobile—leaving behind an equally mobile hole in the valence band. A pivotal generation–recombination model was formulated by Shockley, Read and Hall [16, 21]. Mathematically, Shockley–Read–Hall recombination introduces quadratic non-linear reaction terms into the drift–diffusion dynamics.
A derivation of the Shockley–Read–Hall model considers a generation–recombination process as sketched in Fig. 1. It assumes that appropriately distributed foreign atoms in the crystal lattice of the semiconductor material facilitate the generation of electron-hole pairs by providing in-between energy levels, requiring smaller amounts of energy for each step. Since electrons are immobile at these in-between energy levels, they are called trapped states. Also their maximal density is limited. The Shockley–Read–Hall model of electron-hole recombination is obtained as a quasi-steady-state approximation of the trapped-state dynamics as detailed in the following.
We denote the charge densities of electrons, holes and trapped states by n, p and \(n_{tr}\) and consider the following PDE–ODE drift–diffusion–recombination system:
with the drift–diffusion fluxes and reaction terms
The constants \(n_0, p_0, \tau _n, \tau _p>0\) are positive recombination parameters and \(\varepsilon \in (0,\varepsilon _0]\) for arbitrary \(\varepsilon _0>0\) is a positive relaxation parameter. \(V_n\) and \(V_p\) represent external time-independent potentials.
The reaction term \(R_n\) models transitions of electrons from the trap level to the conduction band (proportional to \(n_{tr}\)) and vice versa (proportional to \(-n(1-n_{tr})\)), where the maximum capacity of the trap level is normalised to one. The analogue processes with respect to the valence band are described by \(R_p\). Note that the rate of hole generation is equivalent to the rate of electrons moving from the valence band to the trap level, which is proportional to (\(1-n_{tr}\)). Similarly, the annihilation of a hole corresponds to an electron that jumps from the trap level to the valence band, which yields a reaction rate proportional to \(-pn_{tr}\). Moreover, \(n_0, p_0 > 0\) represent reference levels for the charge concentrations n and p, while \(\tau _n, \tau _p > 0\) are inverse reaction parameter. Note that the concentration of trapped states satisfies \(n_{tr} \in [0,1]\) provided this holds true for their initial concentration (cf. Theorem 1.1).
The dynamical equation for \(n_{tr}\) in (1) is an ODE in time and pointwise in space with a right hand side depending on n, p and \(n_{tr}\) via \(R_n\) and \(R_p\). We stress that there is no drift–diffusion term for \(n_{tr}\) since trapped electrons are immobile. This is due to the correlation between foreign atoms and the corresponding trap levels which are locally bound near these crystal impurities.
The parameter \(\varepsilon > 0\) models the lifetime of trapped states, where lifetime refers to the expected time until an electron in a trapped state moves either to the valence or the conduction band. The Shockley–Read–Hall recombination model is obtained in the (formal) limit \(\varepsilon \rightarrow 0\), where the concentration of trapped states is determined from the algebraic relation \(0 = R_p(p, n_{tr}) - R_n(n, n_{tr})\):
In this quasi-steady-state approximation, the density of trapped states \(n_{tr}\) and its evolution are eliminated from system (1), while the evolutions of the charge carriers n and p are subject to the Shockley–Read–Hall recombination terms
A rigorous proof of this quasi-steady-state approximation has been performed in [15], even for more general models. See also [18] for semiconductor models with reaction terms of Shockley–Read–Hall-type.
We complete the mathematical description by considering system (1) on a bounded domain \(\Omega \subset {\mathbb {R}}^m\), \(m \ge 1\), with sufficiently smooth boundary \(\partial \Omega\). Without loss of generality, we suppose that the volume of \(\Omega\) is normalised, i.e. \(|\Omega |=1\), which can be achieved by an appropriate scaling of the spatial variables.
We impose no-flux boundary conditions for \(J_n\) and \(J_p\),
where \({\hat{n}}\) denotes the outer unit normal vector on \(\partial \Omega\), and we prescribe non-negative and bounded initial data \(n_I\), \(p_I\), \(n_{tr,I} \in L^\infty (\Omega )\) together with \(\Vert n_{tr,I} \Vert _{L^\infty (\Omega )} \le 1\). As a consequence, the following charge conservation law holds:
with \(M \in {\mathbb {R}}\). Finally, the potentials \(V_n\) and \(V_p\) are assumed to satisfy
where the last condition means that the potentials are confining.
The main goal of this paper is to prove exponential convergence to equilibrium of system (1)–(4) with explicit bounds on rates and constants, which are independent of the relaxation time \(\varepsilon\). We therefore consider \(\varepsilon \in (0, \varepsilon _0]\) for arbitrary but fixed \(\varepsilon _0 > 0\). Our study also includes the limiting case \(\varepsilon =0\).
The main tool in quantifying the large-time behaviour of global solutions to system (1) is the entropy functional
For n and p, we encounter contributions of the Boltzmann-entropy form \(a \ln a - (a - 1) \ge 0\), whereas \(n_{tr}\) enters the entropy functional via a non-negative integral term. Note that the integral \(\int _{1/2}^{n_{tr}} \ln \bigl ( \frac{s}{1-s} \bigr ) ds\) is non-negative and well-defined for all \(n_{tr}(x)\in [0,1]\). By introducing the entropy production functional
it holds (formally) true along solution trajectories of system (1), (2) that
The entropy production functional consists of two non-negative flux terms and two equally non-negative reaction terms of the form \((a-1) \ln a \ge 0\). Thus, the entropy E and its production P are non-negative functionals, which formally implies the entropy E to be monotonically decreasing in time.
In a rigorous proof of the entropy decay, one has to control the two reaction terms in (7), which are unbounded for \(n_{tr}(t,x)\rightarrow 0,1\) or \(n(t,x),p(t,x)\rightarrow 0\). Hence the entropy production is potentially unbounded even for smooth solutions.
The following Theorem 1.1 comprises sufficient existence and regularity results for solutions to satisfy the weak version of (6). We shall call a global weak solution to system (1)–(4) a triple \((n, p, n_{tr}) : [0, \infty ) \rightarrow H^1(\Omega )^2 \times L^\infty (\Omega )\) such that, first,
for all \(T \in (0, \infty )\) and, second, \((n,p,n_{tr})\) solves (1) where n and p satisfy their dynamic equations and the boundary conditions (2) in the weak sense. From PDE-theory (see e.g. [3]), we further obtain the embedding \(W_2(0, T) \hookrightarrow C([0, T], L^2(\Omega ))\).
Theorem 1.1
(Time-dependent system) Let \(n_0, p_0, \tau _n, \tau _p\) and \(\varepsilon\) be positive constants. Assume that \(V_n\) and \(V_p\) satisfy (4) and that \(\Omega \subset {\mathbb {R}}^m\), \(m \ge 1\), is a bounded, sufficiently smooth domain.
Then, for any non-negative initial datum \((n_I, p_I, n_{tr,I}) \in L^\infty (\Omega )^3\) satisfying \(\Vert n_{tr,I}\Vert _{L^\infty (\Omega )} \le 1\), there exists a unique non-negative global weak solution \((n, p, n_{tr})\) of system (1) with boundary conditions (2). More precisely, we find that for all \(T \in (0, \infty )\)
and
Moreover, there exist positive constants \(C_n(\Vert n_I\Vert _{L^\infty (\Omega )},V_n)\), \(C_p(\Vert p_I\Vert _{L^\infty (\Omega )},V_p)\) and \(K_n(V_n)\), \(K_p(V_p)\) independent of \(\varepsilon\) such that
In addition, the concentration \(n_{tr}(t, x)\) is bounded away from zero and one in the sense that for all times \(\tau >0\) there exist positive constants \(\eta = \eta (\varepsilon _0, \tau , \tau _n, \tau _p)\), \(\theta = \theta (C_n,C_p,K_n,K_p)\) and a sufficiently small constant \(\gamma (\tau ,C_n,C_p,K_n,K_p)>0\) such that
where \(\eta \tau = \frac{\gamma }{1+\theta \tau }\) such that the linear and the inverse linear bound intersect at time \(\tau\). As a consequence of (12), there exist positive constants \(\mu\), \(\Gamma >0\) (depending on \(\tau\), \(\eta\), \(\theta\), \(\gamma\), \(V_n\), \(V_p\)) such that
where \(\mu \frac{\tau ^2}{2} = \frac{\Gamma }{1+\theta \tau }\) such that the quadratic and the inverse linear bound intersect at the same time \(\tau\).
Remark 1.2
(Proof of Theorem 1.1) The existence theory of Theorem 1.1 for the coupled ODE-PDE problem (1) applies standard parabolic methods and pointwise ODE estimates. It relates to previous results like [15] in assuming \(L^\infty\) initial data and proving \(L^\infty\)-bounds in order to control non-linear terms. The proof is therefore postponed to the Appendix.
Our first main result proves exponential convergence of solutions to (1)–(4) to a unique positive equilibrium state \((n_\infty (x), p_\infty (x), n_{tr,\infty })\), which is stated in detail in Theorem 2.1.
Theorem 1.3
(Exponential convergence to equilibrium) Let \((n,p,n_{tr})\) be a global weak solution of system (1)–(4) as given in Theorem 1.1 above corresponding to the non-negative initial data \((n_I,p_I,n_{tr,I}) \in L^\infty (\Omega )^3\) satisfying \(\Vert n_{tr,I}\Vert _{L^{\infty }(\Omega )} \le 1\). Then, this solution satisfies the weak entropy production law
for all \(0< t_0 \le t_1 < \infty\) and the following versions of the exponential decay towards the equilibrium:
where \(E_I\) and \(E_\infty\) denote the initial entropy and the equilibrium entropy of the system, respectively, and the equilibrium \((n_\infty , p_\infty , n_{tr,\infty })\) is given in Theorem 2.1. Moreover,
where \(C := C_{\mathrm {CKP}}^{-1}\) and \(K := C_{\mathrm {EEP}}^{-1}\) (see Theorem 1.5 and Proposition 6.1 for the definition of \(C_\mathrm {EEP}\) and \(C_\mathrm {CKP}\), respectively) are explicitly computable constants independent of \(\varepsilon \in (0, \varepsilon _0]\) for arbitrary but fixed \(\varepsilon _0 > 0\).
Remark 1.4
(Theorem 1.1 proves the weak entropy production law (14)) The regularity of n and p of Theorem 1.1 as well as the lower and upper bounds (12) for \(n_{tr}\) and the lower bounds (13) for n and p allow to prove that any solution of Theorem 1.1 satisfies the weak entropy production law (14).
The proof of Theorem 1.3 applies the so-called entropy method, which derives a functional inequality of the form
where n, p and \(n_{tr}\) are non-negative functions satisfying the same conservation law as solutions to (1)–(4), see below. The proof provides an explicit estimate of the constant \(C>0\). Applying this entropy–entropy production (EEP) inequality to the entropy production law (14) entails exponential decay of the relative entropy via a general Gronwall-argument. A Csiszár–Kullback–Pinsker-type inequality yields then exponential convergence in \(L^1\) as stated in (15).
The key step of the entropy method is to prove (as second main result) a suitable EEP functional inequality independently from solutions to (1)–(4) and independently from \(\varepsilon\).
Theorem 1.5
(Entropy–Entropy Production Inequality) Let \(\varepsilon _0\), \(\tau _n\), \(\tau _p\), \(n_0\), \(p_0\) be positive constants and \(M\in {\mathbb {R}}\). Let \((n_\infty , p_\infty , n_{tr,\infty })\) be the corresponding equilibrium as in Theorem 2.1. Consider an arbitrarily large positive constant \(M_1>0\) and non-negative functions \((n,p,n_{tr}) \in L^1(\Omega )^3\) satisfying the \(L^1\)-bound \(\overline{n}, \overline{p} \le M_1\), the \(L^\infty\)-bound \(\Vert n_{tr}\Vert _{L^{\infty }(\Omega )} \le 1\), and the conservation law
where \(\overline{f} := \int _{\Omega } f(x) \, dx\) (recall \(|\Omega |=1\)).
Then, there exists an explicitly computable constant \(C_{\mathrm {EEP}} > 0\) such that for all \(\varepsilon \in (0, \varepsilon _0]\) the following functional inequality, called entropy–entropy production inequality, holds true:
Remark 1.6
We point out that Theorem 1.5 derives a general functional inequality for admissible functions \((n,p,n_{tr})\), which only share few natural properties like the \(L^1\)-integrability, boundedness of the trapped states and the conservation law with solutions to (1)–(4). It is a nice robustness feature of the entropy method to be based on functional inequalities which can be reused in related contexts, rather than deriving solution-specific estimates. The constant \(C_{\mathrm {EEP}}\) is independent of \(\varepsilon \in (0, \varepsilon _0]\). It only depends on the upper bound \(\varepsilon _0 > 0\), which can be chosen arbitrarily.
Remark 1.7
We emphasise that the EEP inequality (16) does not depend on the lower and upper solution bounds (11)–(13). These bounds are only needed to prove that solutions to (1)–(4) satisfy the weak entropy production law (14), which is neither directly obvious nor part of the existence theory. Therefore, (14) implies that solutions to Theorem 1.1 may only feature singularities of P at time zero due to a lacking regularity of the initial data or due to initial data \(n_{tr,I}(x)\in [0,1]\), \(n_I(x),p_I(x)\in [0,\infty ).\)
The proof of the EEP-inequality of Theorem 1.5 captures in a certain sense the entire non-linear and global dynamics of system (1)–(4). Hence, its derivation is ought to be an involved task. A key step is the proof of a functional EEP-inequality for the special cases of spatially homogeneous concentrations, which fulfil the conservation law (3) and the \(L^1\)-bounds (cf. Proposition 5.3). This core estimate is then extended to the case of arbitrary concentrations satisfying the same assumptions in Proposition 5.5. This extension also forces one to bound \(\sqrt{n} - \overline{\sqrt{n}}\), \(\sqrt{p} - \overline{\sqrt{p}}\), and \(\sqrt{n_{tr}} - \overline{\sqrt{n_{tr}}}\) in \(L^2(\Omega )\) by the entropy production. Due to the diffusive part in the dynamical equations for n and p, this is easily achieved for the expressions involving n and p by applying Poincaré’s inequality (see the Proof of Theorem 1.5 in Sect. 6). However, this is not possible for \(n_{tr}\) as no diffusion is acting on \(n_{tr}\). On the other hand, \(n_{tr}\) is subject to indirect diffusive effects, which allow for a control on \(\sqrt{n_{tr}} - \overline{\sqrt{n_{tr}}}\) in terms of a suitable functional inequality. Indirect diffusive effects occur when a reversible reaction transfers diffusive behaviour from a diffusive species to a non-diffusive species. A first functional inequality which quantifies an indirect diffusion effect was proven in [4] with significant generalisations to reaction–diffusion systems in [8, 12], volume–surface reaction–diffusion systems [11] and reaction–diffusion systems with non-linear diffusion [13]. Here, the corresponding estimate is proven in Proposition 5.6 and might also be of independent interest.
Our two last results on system (1)–(4) combine the exponential convergence to equilibrium as proven in Theorem 1.3 with the solution bounds of Theorem 1.1. This entails uniform-in-time solution bounds for n and p as well as exponential convergence to equilibrium in \(L^\infty (\Omega )\) for n, p, and \(n_{tr}\). As opposed to (15), the convergence result for \(n_{tr}\) in Corollary 1.9 holds true without the coefficient \(\varepsilon\).
Corollary 1.8
The solutions n and p of Theorem 1.1 are uniformly-in-time bounded in \(L^{\infty }\), i.e. there exists a constant \(Z > 0\) independent of \(\varepsilon \in (0, \varepsilon _0]\) such that
Moreover, the bounds (17) allow to improve the bounds (12), (13) and to obtain uniform-in-time bounds in the sense that for all \(\tau >0\), there exist sufficiently small and \(\varepsilon\)-independent constants \(\eta , \gamma , \mu , \Gamma > 0\) such that
and
for all \(t\ge 0\) and a.e. \(x\in \Omega\) where \(\eta t\) and \(\gamma\) as well as \(\mu t^2/2\) and \(\Gamma\) intersect at time \(\tau > 0\).
Corollary 1.9
Under the hypotheses of Theorem 1.3, there exist constants \(0< C, K < \infty\) independent of \(\varepsilon \in (0, \varepsilon _0]\) such that
is valid for all \(t \ge 0\).
The final topic of this paper considers the limit \(\varepsilon \rightarrow 0\), which recovers the well-known Shockley–Read–Hall drift–diffusion–recombination model (see [15, 18]):
where
Remarking that the entropy–entropy production inequality derived in Theorem 1.5 holds uniformly in the fast-reaction parameter \(0<\varepsilon \le \varepsilon _0\), one intuitively expects the entropy method and the convergence result of Theorem 1.3 to extend to system (20). Here, we are interested to make this conjecture rigorous also in view of a better general understanding of the equilibration of systems which are derived as fast-reaction limits or quasi-steady-state approximations. One technical point is how to bypass the \(\varepsilon\)-dependency of the conservation law (3). The details of this singular limit are subject of the last Sect. 7. Altogether, we prove for system (20) the Theorems 7.3, 7.2 and Corollary 7.5 as corresponding versions of Theorems 1.3, 1.5 and Corollary 1.8.
Up to our knowledge, this is a first result in performing the entropy method in a non-linear reaction–diffusion-type system uniformly in a fast-reaction limit. Note that our approach yields global convergence to equilibrium for all initial data rather than just exponential stability of equilibria as proven, for instance, in a related 1D Poisson–Nernst–Planck system uniformly in the permittivity entering Poisson’s equation [17].
The rest of the paper is organised in the following manner. Section 2 proves the existence of a unique equilibrium (Theorem 2.1) as well as uniform-in-\(\varepsilon\) bounds of \(n_\infty\), \(p_\infty\) and \(n_{tr,\infty }\). In Sect. 3, we collect a couple of technical lemmata, and within Sect. 4, we state a preliminary proposition which serves as a first result towards an EEP-inequality. An abstract version of the EEP-estimate is proven in Sect. 5, first for constant concentrations and based on that also for general concentrations. Section 6 is concerned with the proofs of the EEP-inequality from Theorem 1.5, the announced exponential convergence from Theorem 1.3 and the uniform \(L^\infty\)-bounds from Corollary 1.8, whereas Sect. 7 is devoted to the same issues in the situation \(\varepsilon \rightarrow 0\). Finally, the proof of Theorem 1.1 is contained in the Appendix.
2 Properties of the equilibrium
We prove the existence of a unique positive equilibrium \((n_\infty , p_\infty , n_{tr,\infty })\) of system (1)–(3) in a suitable (and natural) function space. Note that uniqueness is only satisfied once the total charge M in (3) is fixed. This equilibrium can either be seen as the unique solution of the below stationary system (21) or as the unique state for which the entropy production (7) vanishes.
Theorem 2.1
(Stationary system and uniformly bounded equilibrium) Let \(M \in {\mathbb {R}}\), \(\varepsilon \in (0,\varepsilon _0]\) for arbitrary \(\varepsilon _0>0\) and \((n_\infty , p_\infty , n_{tr,\infty }) \in X\) where X is defined via
Then, the following statements are equivalent.
-
1.
\((n_\infty , p_\infty , n_{tr,\infty })\in X\) is a solution of the stationary system
$$\begin{aligned} \nabla \cdot J_n(n_\infty ) + R_n(n_\infty ,n_{tr,\infty })&= 0, \end{aligned}$$(21a)$$\begin{aligned} \nabla \cdot J_p(p_\infty ) + R_p(p_\infty ,n_{tr,\infty })&= 0, \end{aligned}$$(21b)$$\begin{aligned} R_p(p_\infty ,n_{tr,\infty }) - R_n(n_\infty ,n_{tr,\infty })&= 0. \end{aligned}$$(21c) -
2.
\(P(n_\infty , p_\infty , n_{tr,\infty }) = 0\).
-
3.
\(J_n(n_\infty ) = J_p(p_\infty ) = R_n(n_\infty ,n_{tr,\infty }) = R_p(p_\infty ,n_{tr,\infty }) = 0\) a.e. in \(\Omega\).
-
4.
The state \((n_\infty , p_\infty , n_{tr,\infty })\) satisfies
$$\begin{aligned} n_\infty = n_*e^{-V_n}, \quad p_\infty = p_*e^{-V_p}, \quad n_{tr,\infty } = \frac{n_*}{n_*+ n_0} = \frac{p_0}{p_*+ p_0} \end{aligned}$$(22)where the positive constants \(n_*, p_*>0\) are uniquely determined by the condition
$$\begin{aligned} n_*p_*= n_0 p_0 \end{aligned}$$(23)and the conservation law
$$\begin{aligned} n_*\overline{\mu _n} - p_*\overline{\mu _p} + \varepsilon \, n_{tr,\infty } = M. \end{aligned}$$(24)
Consequently, the unique positive equilibrium \((n_\infty , p_\infty , n_{tr,\infty }) \in X\) is given by (22)–(24), and
Finally, for all \(M\in {\mathbb {R}}\) and for \(\varepsilon \in (0,\varepsilon _0]\), there exist two constants \(\gamma \in (0, 1/2)\) and \(\Gamma \in (1/2, \infty )\) depending only on \(\varepsilon _0\), \(n_0\), \(p_0\), M, \(V:= \max ( \Vert V_n \Vert _{L^\infty (\Omega )}, \Vert V_p \Vert _{L^\infty (\Omega )})\) such that
Proof of Theorem 2.1
We shall prove the equivalence of the statements in the theorem by a circular reasoning. Assume that \((n_\infty , p_\infty , n_{tr,\infty }) \in X\) is a solution of the stationary system (21). In this case,
We test Eq. (21a) with \(\ln (n_\infty /(n_0 \mu _n))\). Due to \(n_\infty \in H^1(\Omega )\) and \(n_\infty \ge \gamma\) a.e. in \(\Omega\), the test function \(\ln (n_\infty /(n_0 \mu _n))\) belongs to \(H^1(\Omega )\). We find
In the same way, we test Eq. (21b) with \(\ln (p_\infty /(p_0 \mu _p)) \in H^1(\Omega )\). This yields
Moreover, we multiply (21c) with \(\ln (n_{tr,\infty }/(1-n_{tr,\infty })) \in L^2(\Omega )\), integrate over \(\Omega\) and obtain
Taking the sum of the three expressions above, we arrive at
By the non-negativity of the terms in the last two lines, equality holds if and only if \(R_n(n_\infty ,n_{tr,\infty }) = 0=R_p(p_\infty ,n_{tr,\infty })\). Hence, \(P(n_\infty , p_\infty , n_{tr,\infty }) = 0\) readily implies \(J_n(n_\infty ) = J_p(p_\infty ) = R_n(n_\infty ,n_{tr,\infty }) = R_p(p_\infty ,n_{tr,\infty }) = 0\) a.e. in \(\Omega\).
Because of \(J_n(n_\infty ) = \mu _n \nabla \bigl (\frac{n_\infty }{\mu _n}\bigr ) = 0 = J_p(p_\infty ) = \mu _p \nabla \bigl (\frac{p_\infty }{\mu _p}\bigr )\), we know that
with constants \(n_*, p_*\). Moreover, \(R_n(n_\infty ,n_{tr,\infty }) = R_p(p_\infty ,n_{tr,\infty }) = 0\) gives rise to
Consequently, \(n_*p_*= n_0 p_0>0\), which implies \(n_*, p_*>0\) and
Moreover, for M fixed, the constants \(n_*\) and \(p_*\) are uniquely determined by the conservation law
where the uniqueness follows from the strict monotonicity of \(f(n_*) {:}{=}n_*\overline{\mu _n} - \frac{n_0 p_0 }{n_*}\overline{\mu _p} + \varepsilon \, \frac{n_*}{n_*+ n_0}\) on \((0, \infty )\) and the asymptotics \(f(n_*) \rightarrow -\infty\) for \(n_*\rightarrow 0^+\) and \(f(n_*) \rightarrow \infty\) for \(n_*\rightarrow \infty\).
Finally, concluding the circular reasoning \(1. \rightarrow 2. \rightarrow 3. \rightarrow 4. \rightarrow 1.\), we observe that
obviously satisfies \(J_n(n_\infty ) = J_p(p_\infty ) = R_n(n_\infty ,n_{tr,\infty }) = R_p(p_\infty ,n_{tr,\infty }) = 0\) a.e. in \(\Omega\) which proves \((n_\infty , p_\infty , n_{tr, \infty })\) to be a solution of the stationary system.
In order to prove the bounds (26), we observe
the left hand side is strictly monotone increasing from \(-\infty\) to \(+\infty\) as \(n_*\in (0,\infty )\), while the right hand side is strictly monotone decreasing and bounded between \((M,M-\varepsilon _0)\) as \(n_*\in (0,\infty )\). Both monotonicity and unboundedness/boundedness imply uniform positive lower and upper bounds for \(n_*\) as explicitly proven in the following: First, we derive that
for all \(\varepsilon \in (0,\varepsilon _0]\). Note that (27) is not an explicit representation of \(n_*\) since \(n_{tr,\infty }\) depends itself on \(n_*\). Because of \(n_{tr,\infty } \in (0, 1)\), we further observe that
where \(\beta =\beta (\varepsilon _0,n_0,p_0,M,V)\). And as a result of the elementary inequality \(\sqrt{a + b} \ge \sqrt{a} + \frac{b}{2 \sqrt{a} + \sqrt{b}}\) for \(a \ge 0\) and \(b > 0\), we also conclude that
where \(\alpha =\alpha (\varepsilon _0,n_0,p_0,M,V)\). Similar arguments show that corresponding bounds \(\alpha\) and \(\beta\) are also available for \(p_*\). Hence,
Due to \(n_\infty = n_*e^{-V_n}\), \(p_\infty = p_*e^{-V_p}\) and the \(L^\infty\)-bounds on \(V_n\) and \(V_p\), the claim of the proposition follows. \(\square\)
3 Some technical lemmata
A particularly useful relation between the concentrations n, p and \(n_{tr}\) is the following Lemma.
Lemma 3.1
The conservation law \(\overline{n} - \overline{p} + \varepsilon \, \overline{n_{tr}} = M\) and the equilibrium condition (25) imply
Proof
With \(\overline{n_\infty } - \overline{p_\infty } + \varepsilon \, n_{tr,\infty } = M\) (note that \(n_{tr,\infty } = \overline{n_{tr,\infty }}\) is constant), we have \(\overline{p} - \overline{p_\infty } = \overline{n} - \overline{n_\infty } + \varepsilon (\overline{n_{tr}} - n_{tr,\infty })\). We employ this relation to replace \(\overline{p} - \overline{p_\infty }\) on the left hand side of (28) and calculate
Now, the first term on the right hand side vanishes due to \(n_*p_*= n_0 p_0\) while we use \(p_*/p_0 = (1-n_{tr,\infty }) / n_{tr,\infty }\) for the second term and obtain
as claimed above. \(\square\)
Lemma 3.2
(Relative Entropy) The entropy relative to the equilibrium reads
Proof
By the definition of \(E(n,p,n_{tr})\) in (5), we note that
We expand the first integrand as \(n \ln \bigl ( \frac{n}{n_0 \mu _n} \bigr ) = n \ln \bigl ( \frac{n}{n_\infty } \bigr ) + n \ln \bigl ( \frac{n_\infty }{n_0 \mu _n} \bigr ).\) Thus, with \(n_\infty / \mu _n = n_*\), we get
Together with an analogous calculation of the p-terms, we obtain
Lemma 3.1 allows us to reformulate the last two lines as
which proves the claim. \(\square\)
Lemma 3.3
(Csiszár–Kullback–Pinsker inequality) Let \(f, g: \Omega \rightarrow {\mathbb {R}}\) be non-negative measurable functions. Then,
Proof
Following a proof by Pinsker, we start with the elementary inequality \(3 (x-1)^2 \le (2x + 4) (x \ln x - (x-1))\). This allows us to derive the following Csiszár–Kullback–Pinsker-type inequality:
where we applied Hölder’s inequality in the last step. \(\square\)
The subsequent lemma provides \(L^1\)-bounds for n and p in terms of the initial entropy of the system and some further constants.
Lemma 3.4
(\(L^1\)-bounds) Due to the monotonicity of the entropy functional, any entropy producing solution of (1) satisfies
Proof
Employing Lemma 3.3 and Young’s inequality, we find
Solving this inequality for \(\overline{n}\) yields
Therefore, we arrive at
where we used the monotonicity of the entropy functional in the last step. In the same way, we may bound \(\overline{p}\) from above. \(\square\)
At certain points, we will have to estimate the difference between terms like \(\overline{n/n_\infty }\) and \(\overline{n} / \overline{n_{\infty }}\). Using Lemma 3.5 below, we can bound this difference by the \(J_n\)-flux term and, hence, by the entropy production.
Lemma 3.5
Let \(f \in L^1(\Omega )\) and \(g \in L^\infty (\Omega )\) such that \(f \ge 0\), \(g \ge \gamma > 0\) a.e. on \(\Omega\) and f/g is weakly differentiable. Then, there exists an explicit constant \(C(\Vert f\Vert _{L^1(\Omega )}, \Vert g\Vert _{L^\infty (\Omega )}, \gamma ) > 0\) such that
Proof
We define \(\delta := \frac{f}{g} - \overline{\left( \frac{f}{g} \right) }\) and obtain \(f = g \left( \overline{\left( \frac{f}{g} \right) } + \delta \right)\) and
by utilising \(|\Omega | = 1\). Therefore,
by applying Poincaré’s inequality to \(\delta\) with \(\overline{\delta }= 0\) and some constant \(C_P(\Omega ) > 0\). As \(g \ge \gamma > 0\) is uniformly positive on \(\Omega\) and \(\overline{g} \ge \gamma\), we arrive at
Finally, we deduce
employing Hölder’s inequality in the second step. \(\square\)
4 Two preliminary propositions
Notation 4.1
For arbitrary functions f, we define the normalised quantity
The following logarithmic Sobolev inequality on bounded domains was proven in [6] by following an argument of Stroock [22].
Lemma 4.2
(Logarithmic Sobolev inequality on bounded domains) Let \(\Omega\) be a bounded domain in \({\mathbb {R}}^{m}\), \(m \ge 1\), such that the Poincaré (–Wirtinger) and Sobolev inequalities
hold with \(q=\infty\) for \(m=1\) and any \(q<\infty\) for \(m=2\). Then, the logarithmic Sobolev inequality
holds (for some constant \(L(\Omega ,m)>0\)).
The Log-Sobolev inequality allows to bound an appropriate part of the entropy functional by the flux parts of the entropy production. The normalised variables on the left hand side of the subsequent inequality naturally arise when reformulating the flux terms on the right hand side in such a way that we can apply the Log-Sobolev inequality on \(\Omega\).
Proposition 4.3
Recall the assumptions \(\Vert V_n\Vert _{L^\infty (\Omega )}, \Vert V_p\Vert _{L^\infty (\Omega )}\le V\). Then, there exists a constant \(C(V)>0\) such that
Proof
From the definition of \(J_n\) one obtains
We set
and observe that \(\alpha =\frac{\overline{\mu _n}}{\overline{n}} \int _{\Omega } \frac{n}{\mu _n}\,dx \le \overline{\mu _n} e^V\) is bounded independently of n. Next, we introduce the rescaled variable \(y {:}{=}\alpha ^{-\frac{1}{m}} x\) where \(m \ge 1\) denotes the space dimension. Note that \(\Vert \phi \Vert _{L^2(dx)}\) is in general different from one, whereas \(\Vert \phi \Vert _{L^2(dy)} = 1\). We now estimate with \(\Vert V_n\Vert _{L^\infty (\Omega )}\le V\) and the logarithmic Sobolev inequality (31)
The corresponding estimate involving \(J_n\) reads
The same arguments apply to the terms involving p. \(\square\)
The following proposition contains the first step towards an entropy–entropy production inequality. The relative entropy can be controlled by the flux part of the entropy production and three additional terms, which mainly consist of square roots of averaged quantities. The proof that the entropy production also serves as an upper bound for these terms will be the subject of the next section.
Proposition 4.4
There exists an explicit constant \(C(\gamma , \Gamma , M_1) > 0\) such that for \((n_\infty , p_\infty , n_{tr,\infty }) \in X\) from Theorem 2.1 and all non-negative functions \((n,p,n_{tr}) \in L^1(\Omega )^3\) satisfying \(n_{tr} \le 1\), the conservation law
and the \(L^1\)-bound
the following estimate holds true:
(Note that the right hand side of (32) vanishes at the equilibrium \((n_\infty , p_\infty , n_{tr,\infty })\).)
Proof
According to Lemma 3.2, we have
Recall that \(n = \widetilde{n}\,\overline{n}\), \(n_\infty = \widetilde{n_\infty }\,\overline{n_\infty }\) and \(\widetilde{n_\infty } = \widetilde{\mu _n}\). Using these relations, we rewrite the first two integrands as
and analogously for the p-terms. This results in
The terms in the second line of (33) can be estimated using the Log-Sobolev inequality of Proposition 4.3. Moreover, the elementary inequality \(x \ln x - (x-1) \le (x-1)^2\) for \(x>0\) gives rise to
and an analogous estimate for the corresponding expressions involving p. The second term on the right hand side of the previous line can be bounded from above by applying Lemma 3.5, which guarantees a constant \(C(\gamma , \Gamma , M_1) > 0\) such that
for some constant \(c_1(\gamma , \Gamma , M_1) > 0\). Besides,
See (26) and Lemma 3.4 for the bounds on \(n_*\), \(n_\infty\) and \(\overline{n}\). We have thus verified that
with some \(c_2(\gamma , \Gamma , M_1) > 0\). A similar estimate holds true for the corresponding part of (33) involving p.
Considering the last line in (33), we further know that for all \(x \in \Omega\) there exists some mean value
such that
Consequently,
In fact, we will prove that there even exists some constant \(\xi \in (0,1/2)\) such that
for all \(x \in \Omega\). Thus, the function \(\theta (x)\) is uniformly bounded away from 0 and 1 on \(\Omega\). To see this, we first note that \(n_{tr,\infty } \in [\gamma , 1-\gamma ]\) using the constant \(\gamma \in (0, 1/2)\) from (26). In addition,
for all \(x \in \Omega\). Together with (34), this estimate implies
We now choose an arbitrary \(x \in \Omega\) and distinguish two cases. If \(| n_{tr}(x) - n_{tr,\infty } | \ge \gamma /2\), then
As a consequence of \(\ln (s/(1-s)) \rightarrow \infty\) for \(s \rightarrow 1^-\) and \(\ln (s/(1-s)) \rightarrow -\infty\) for \(s \rightarrow 0^+\), there exists some constant \(\xi \in (0,\gamma )\) depending only on \(\gamma\) such that \(\theta (x) \in (\xi , 1-\xi )\). If \(| n_{tr}(x) - n_{tr,\infty } | < \gamma /2\), then \(n_{tr,\infty } \in [\gamma , 1 - \gamma ]\) implies \(n_{tr}(x) \in (\gamma /2, 1-\gamma /2)\) and, hence, \(\theta (x) \in (\gamma /2, 1-\gamma /2)\). Again the constant \(\xi\) depends only on \(\gamma\).
As a result of the calculations above, we may rewrite the last line in (33) as
Applying the mean-value theorem to the expression in brackets and observing that
we find
with some \(\sigma (x) \in (\min \{\theta (x), n_{tr,\infty }\}, \max \{\theta (x), n_{tr,\infty }\})\). Since both \(\theta (x), n_{tr,\infty } \in (\xi , 1-\xi )\) for all \(x \in \Omega\), we also know that \(\sigma (x) \in (\xi , 1-\xi )\) for all \(x \in \Omega\). Thus, \((\sigma (x)(1-\sigma (x)))^{-1}\) is bounded uniformly in \(\Omega\) in terms of \(\xi = \xi (\gamma )\). Consequently,
with a constant \(c_3(\gamma ) > 0\) after applying the estimate \(|\theta (x) - n_{tr,\infty }| \le |n_{tr}(x) - n_{tr,\infty }|\) for all \(x \in \Omega\). Finally, we arrive at
with a constant \(C(\gamma , \Gamma , M_1) > 0\). \(\square\)
5 Abstract versions of the EEP-inequality
Notation 5.1
We set
and define the positive constants
The motivation for introducing the additional variable \(n_{tr}'\) is the possibility to symmetrise expressions like \((n(1-n_{tr}) - n_{tr})^2 + (pn_{tr} - (1-n_{tr}))^2\) as \((nn_{tr}' - n_{tr})^2 + (pn_{tr} - n_{tr}')^2\). Similar terms will appear frequently within the subsequent calculations.
Remark 5.2
We may consider \(n_{tr}'\) as a fourth independent variable within our model. In this case, the reaction–diffusion system features the following two independent conservation laws:
The special formulation of the first conservation law will become clear when looking at the following two Propositions. There, we derive relations for general variables a, b, c and d, which correspond to \(\sqrt{n/(n_0 \mu _n)}\), \(\sqrt{p/(p_0 \mu _p)}\), \(\sqrt{n_{tr}}\) and \(\sqrt{n_{tr}'}\), respectively.
In addition, we have the following \(L^1\)-bound (cf. Lemma 3.4):
The following Proposition 5.3 establishes an upper bound for the terms in the second line of (32) in the case of constant concentrations a, b, c and d. This result is then generalised in Proposition 5.5 to non-constant states a, b, c, d.
Proposition 5.3
(Homogeneous concentrations) Let \(a, b, c, d \ge 0\) be constants such that their squares satisfy the conservation laws
for any \(\varepsilon \in (0,\varepsilon _0]\) and arbitrary \(\varepsilon _0>0\). Moreover, assume
Then, there exists an explicitly computable constant \(C(\varepsilon _0, n_0, p_0, M, M_1, V) > 0\) such that
for all \(\varepsilon \in (0,\varepsilon _0]\).
Proof
We first introduce the following change of variable: Due to the non-negativity of the concentrations a, b, c, d, we define constants \(\mu _1, \mu _2, \mu _3, \mu _4 \in [-1,\infty )\) such that
where \(\nu _\infty\), \(\pi _\infty\), \(\nu _{tr,\infty }\) and \(\nu _{tr,\infty }'\) are uniformly positive and bounded for all \(\varepsilon \in (0,\varepsilon _0]\) in terms of \(\varepsilon _0, n_0, p_0, M\) and V by (26). Thus, the boundedness of a, b, c, d implies the existence of a constant \(K(\varepsilon _0, n_0, p_0, M, M_1, V)>0\), such that \(\mu _i \in [-1,K]\) for all \(1 \le i \le 4\). The left hand side of (35) expressed in terms of the \(\mu _i\) rewrites as
Employing the equilibrium conditions (25), we also find
and
Moreover, the two conservation laws from the hypotheses rewrite as
The relations (36), (37) allow to express \(\varepsilon \mu _3\) and \(\varepsilon \mu _4\) in terms of \(\mu _1\) and \(\mu _2\), although not explicitly:
where the last definition follows from inserting the previous expression (38) for \(\varepsilon \mu _3\) while the factor \(2+\mu _3\) is bounded in \([1,K+2]\) since \(\mu _i \in [-1,K]\) for all \(1 \le i \le 4\). Therefore, all the terms \(f_{i,j}\) are uniformly positive as well as bounded from above:
All constants \(\underline{C_{i,j}}\) and \(\overline{C_{i,j}}\) only depend on \(\varepsilon _0\), \(n_0\), \(p_0\), M, \(M_1\) and V, and there exist corresponding bounds \({\underline{C}} > 0\) and \(\overline{C} > 0\) such that for all i, j
In order to prove (35), we show that under the constraints of the conservation laws (36), (37), respectively, the relations (38), (39), there exists a constant \(C(\varepsilon _0, n_0, p_0, M, {\underline{C}}, \overline{C}) > 0\) for all \(\varepsilon \in (0,\varepsilon _0]\) such that
which is equivalent to
Recall that \(\nu _\infty ^2 \le \Gamma /n_0\), \(\pi _\infty ^2 \le \Gamma /p_0\) and \(\nu _{tr,\infty }^2, \nu _{tr,\infty }'^{\, 2} \in [\gamma , 1-\gamma ]\) with \(\gamma \in (0, 1/2)\) and \(\Gamma \in (1/2, \infty )\) depending on \(\varepsilon _0\), \(n_0\), \(p_0\) and M for all \(\varepsilon \in (0,\varepsilon _0]\) (cf. the proof of Proposition 26). Since numerator and denominator of (40) are sums of quadratic terms, it is sufficient to bound the denominator from below in terms of its numerator omitting the prefactors \(\nu _\infty ^2\), \(\pi _\infty ^2\), \(\nu _{tr,\infty }^2\) and \(\nu _{tr,\infty }'^{\,2}\), i.e. to prove that
More precisely, we will prove that there exists a constant \(c(\varepsilon _0, {\underline{C}}, \overline{C}) > 0\) for all \(\varepsilon \in (0,\varepsilon _0]\) such that
and that
For this reason, we distinguish four cases and we shall frequently use estimates like
since \(\mu _j \ge -1\) for all \(1 \le j \le 4\). We mention already here that all subsequent constants \(c_1\), \(c_2\) are strictly positive and depend only on \(\varepsilon _0\), \({\underline{C}}\) and \(\overline{C}\) uniformly for \(\varepsilon \in (0,\varepsilon _0]\).
-
Case 1: \(\mu _1 \ge 0 \wedge \mu _2 \ge 0\): If \(\mu _3 \ge 0\), then (37) implies \(\mu _4 \le 0\) and \(\mu _2 + \mu _3 + \mu _2 \mu _3 - \mu _4 \ge \mu _2\). Moreover, \(\mu _3 \ge 0\) yields
$$\begin{aligned} f_{2,3} \mu _2 \ge f_{1,3} \mu _1 \ \Rightarrow \ \overline{C_{2,3}} \mu _2 \ge \underline{C_{1,3}} \mu _1 \ \Rightarrow \ \mu _2 \ge \underline{C_{1,3}} / \overline{C_{2,3}} \, \mu _1 \end{aligned}$$and
$$\begin{aligned} \mu _2 + \mu _3 + \mu _2 \mu _3 - \mu _4 \ge \mu _2 \ge \mu _2 / 2 + \underline{C_{1,3}} / ( 2 \, \overline{C_{2,3}} ) \mu _1 \ge c_1 (\mu _1 + \mu _2). \end{aligned}$$Hence, \((*) \ge \left( \mu _2 + \mu _3 + \mu _2 \mu _3 - \mu _4 \right) ^2 \ge c_2 (\mu _1^2 + \mu _2^2)\). Besides, \((*) \ge \left( \mu _2 + \mu _3 + \mu _2 \mu _3 - \mu _4 \right) ^2 \ge \mu _3^2\). If \(\mu _3 < 0\), (37) yields \(\mu _4 > 0\) and \(\mu _1 + \mu _4 + \mu _1 \mu _4 - \mu _3 \ge \mu _1\). Since \(\mu _3 < 0\), (38) implies
$$\begin{aligned} f_{1,3} \mu _1 \ge f_{2,3} \mu _2 \ \Rightarrow \ \overline{C_{1,3}} \mu _1 \ge \underline{C_{2,3}} \mu _2 \ \Rightarrow \ \mu _1 \ge \underline{C_{2,3}} / \overline{C_{1,3}} \, \mu _2 \end{aligned}$$and
$$\begin{aligned} \mu _1 + \mu _4 + \mu _1 \mu _4 - \mu _3 \ge \mu _1 \ge \mu _1 / 2 + \underline{C_{2,3}} / ( 2 \, \overline{C_{1,3}} ) \mu _2 \ge c_1 (\mu _1 + \mu _2). \end{aligned}$$As above, \((*) \ge c_2 (\mu _1^2 + \mu _2^2)\). The signs \(\mu _3 \le 0 \le \mu _1,\mu _4\) yield \((*) \ge \left( \mu _1 + \mu _4 + \mu _1 \mu _4 - \mu _3 \right) ^2 \ge \mu _3^2\).
-
Case 2: \(\mu _1 \ge 0 \wedge \mu _2 < 0\): Equations (38) and (39) imply \(\mu _3 \le 0\) and \(\mu _4 \ge 0\), and we deduce for all \(\varepsilon \in (0,\varepsilon _0]\)
$$\begin{aligned}&\mu _1 + \mu _4 + \mu _1 \mu _4 - \mu _3 \ge \mu _4 - \mu _3 \\&\quad = \varepsilon ^{-1}(f_{1,3} + f_{1,4}) \mu _1 - \varepsilon ^{-1}(f_{2,3} + f_{2,4}) \mu _2 \\&\quad \ge \varepsilon _0^{-1}(\underline{C_{1,3}} + \underline{C_{1,4}}) |\mu _1| + \varepsilon _0^{-1}(\underline{C_{2,3}} + \underline{C_{2,4}}) |\mu _2| \ge c_1 (|\mu _1| + |\mu _2|) \end{aligned}$$and, thus, \((*) \ge (\mu _1 + \mu _4 + \mu _1 \mu _4 - \mu _3)^2 \ge c_2 (\mu _1^2 + \mu _2^2)\). Since \(\mu _2, \mu _3 \le 0 \le \mu _4\), we have
$$\begin{aligned} (*) \ge \left( \mu _4 - \mu _3 - \mu _2 (1 + \mu _3) \right) ^2 \ge \mu _3^2. \end{aligned}$$ -
Case 3: \(\mu _1 < 0 \wedge \mu _2 \ge 0\): Here, \(\mu _3 \ge 0\) due to (38) and, thus, \(\mu _4 \le 0\) by (39), which yields for all \(\varepsilon \in (0,\varepsilon _0]\)
$$\begin{aligned}&\mu _2 + \mu _3 + \mu _2 \mu _3 - \mu _4 \ge \mu _3 - \mu _4 = \varepsilon ^{-1}(f_{2,3} + f_{2,4}) \mu _2 - \varepsilon ^{-1}(f_{1,3} + f_{1,4}) \mu _1 \\&\quad \ge \varepsilon _0^{-1}(\underline{C_{1,3}} + \underline{C_{1,4}}) |\mu _1| + \varepsilon _0^{-1}(\underline{C_{2,3}} + \underline{C_{2,4}}) |\mu _2| \ge c_1 (|\mu _1| + |\mu _2|) \end{aligned}$$and \((*) \ge (\mu _2 + \mu _3 + \mu _2 \mu _3 - \mu _4)^2 \ge c_2 (\mu _1^2 + \mu _2^2)\). And as \(\mu _1,\mu _4 \le 0\le \mu _3\), one has
$$\begin{aligned} (*) \ge \left( \mu _3 - \mu _4 - \mu _1 (1 + \mu _4) \right) ^2 \ge \mu _3^2. \end{aligned}$$ -
Case 4: \(\mu _1< 0 \wedge \mu _2 < 0\): Supposing that \(\mu _3 \ge 0\) and thus \(\mu _4 \le 0\) by (39), we observe
$$\begin{aligned} |\mu _1 + \mu _4 + \mu _1 \mu _4 - \mu _3| = \mu _3 - \mu _1 - \mu _4(1 + \mu _1) \ge -\mu _1. \end{aligned}$$Furthermore, \(\mu _3 \ge 0\) enables us to estimate
$$\begin{aligned} f_{1,3} \mu _1 \le f_{2,3} \mu _2 \ \Rightarrow \ \overline{C_{1,3}} \mu _1 \le \underline{C_{2,3}} \mu _2 \ \Rightarrow \ -\mu _1 \ge -\underline{C_{2,3}} / \overline{C_{1,3}} \, \mu _2. \end{aligned}$$and
$$\begin{aligned} |\mu _1 + \mu _4 + \mu _1 \mu _4 - \mu _3| \ge -\mu _1 \ge -\mu _1 / 2 - \underline{C_{2,3}} / ( 2 \, \overline{C_{1,3}} ) \mu _2 \ge c_1 (|\mu _1| + |\mu _2|). \end{aligned}$$Hence, \((*) \ge (\mu _1 + \mu _4 + \mu _1 \mu _4 - \mu _3)^2 \ge c_2 (\mu _1^2 + \mu _2^2)\). The second estimate in terms of \(\mu _3^2\) follows with \(\mu _1, \mu _4 \le 0 \le \mu _3\) from
$$\begin{aligned} (*) \ge \left( \mu _3 - \mu _4 - \mu _1 (1 + \mu _4) \right) ^2 \ge \mu _3^2. \end{aligned}$$In the opposite case that \(\mu _3 < 0\) and thus \(\mu _4 \ge 0\) due to (39), we estimate
$$\begin{aligned} |\mu _2 + \mu _3 + \mu _2 \mu _3 - \mu _4| = \mu _4 - \mu _2 - \mu _3 (1+\mu _2) \ge -\mu _2 \end{aligned}$$and
$$\begin{aligned} f_{2,3} \mu _2 \le f_{1,3} \mu _1 \ \Rightarrow \ \overline{C_{2,3}} \mu _2 \le \underline{C_{1,3}} \mu _1 \ \Rightarrow \ -\mu _2 \ge -\underline{C_{1,3}} / \overline{C_{2,3}} \, \mu _1. \end{aligned}$$We, thus, arrive at
$$\begin{aligned} |\mu _2 + \mu _3 + \mu _2 \mu _3 - \mu _4| \ge -\mu _2 \ge -\mu _2 / 2 - \underline{C_{1,3}} / ( 2 \, \overline{C_{2,3}} ) \mu _1 \ge c_1 (|\mu _1| + |\mu _2|) \end{aligned}$$and \((*) \ge (\mu _2 + \mu _3 + \mu _2 \mu _3 - \mu _4)^2 \ge c_2 (\mu _1^2 + \mu _2^2)\). The corresponding inequality for \(\mu _3\) reads
$$\begin{aligned} (*) \ge \left( \mu _4 - \mu _3 - \mu _2 (1 + \mu _3) \right) ^2 \ge \mu _3^2, \end{aligned}$$which follows from \(\mu _2,\mu _3 \le 0 \le \mu _4\).
The proof of the proposition is now complete. \(\square\)
Notation 5.4
From now on, \(\Vert \cdot \Vert\) without further specification shall always denote the \(L^2\)-norm in \(\Omega\).
Within the subsequent Proposition 5.5, the expressions \((ad-c)^2\) and \((bc-d)^2\) on the right hand side of (35) will be generalised to \(\Vert ad-c\Vert ^2\) and \(\Vert bc-d\Vert ^2\) in (42). We will later show in the proof of Theorem 1.5 that \(\Vert ad-c\Vert ^2\) (and also \(\Vert bc-d\Vert ^2\)) can be estimated from above via the reaction terms within the entropy production (7) when using the special choices \(\sqrt{n/(n_0 \mu _n)}\), \(\sqrt{p/(p_0 \mu _p)}\), \(\sqrt{n_{tr}}\), and \(\sqrt{n_{tr}'}\) for a, b, c, and d.
Proposition 5.5
(Inhomogeneous concentrations) Let \(a, b, c, d: \Omega \rightarrow {\mathbb {R}}\) be measurable, non-negative functions such that their squares satisfy the conservation laws
for any \(\varepsilon \in (0,\varepsilon _0]\) and arbitrary \(\varepsilon _0>0\). In addition, we assume
Then, there exists an explicitly computable constant \(C(\varepsilon _0, n_0, p_0, M, M_1, V) > 0\) such that
for all \(\varepsilon \in (0,\varepsilon _0]\).
Proof
We divide the proof into two steps. In the first part, we shall derive lower bounds for the reaction terms \(\Vert ad - c \Vert ^2 + \Vert bc - d \Vert ^2\) involving \(( \overline{a} \, \overline{d} - \overline{c} )^2 + ( \overline{b} \, \overline{c} - \overline{d} )^2\). This will allow us to apply Proposition 5.3 in the second step.
Step 1: We show
and
with some explicitly computable constant \(c_1>0\). For this reason, we define
and note that \(\overline{\delta _1} = \overline{\delta _2} = \overline{\delta _3} = \overline{\delta _4} = 0\). Moreover,
due to Young’s inequality, \(\overline{a^2}\), \(\overline{b^2} \le C(n_0, p_0, M_1, V)\) and \(\overline{c^2}\), \(\overline{d^2} \le 1\).
We now define
and split the squares of the \(L^2(\Omega )\)-norm as
and
respectively. In order to estimate the first integral in (43) from below, we write
This yields
where we used Young’s inequality \(2xy \ge -x^2/2 - 2y^2\) for \(x,y \in {\mathbb {R}}\) in the second step and the boundedness of \(\delta _i\), \(1 \le i \le 4\), in the last step. Similarly, we deduce
The second integral in (43) is mainly estimated by deriving an upper bound for the measure of \(\Omega \backslash S\). For all \(i \in \{1,\ldots ,4\}\) we have
and, hence,
As a consequence of \(| \overline{a} \, \overline{d} - \overline{c} | \le C(n_0, p_0, M_1, V)\), we obtain
This implies
and, analogously,
Taking the sum of both contributions to (43), we finally arrive at
and
Step 2: We introduce constants \(\mu _i \ge -1\), \(1 \le i \le 4\), such that
We recall that (26) guarantees the uniform positivity and boundedness of \(\nu _\infty\), \(\pi _\infty\), \(\nu _{tr,\infty }\), and \(\nu _{tr,\infty }'\) for all \(\varepsilon \in (0, \varepsilon _0]\) in terms of \(\varepsilon _0\), \(n_0\), \(p_0\), M, and V. Therefore, the bounds \(\overline{a^2}\), \(\overline{b^2} \le C(n_0, p_0, M_1, V)\) and \(\overline{c^2}\), \(\overline{d^2} \le 1\) give rise to a constant \(K(\varepsilon _0, n_0, p_0, M, M_1, V) > 0\) such that \(\mu _i \in [-1,K]\) for all \(1 \le i \le 4\) uniformly for \(\varepsilon \in (0, \varepsilon _0]\).
We now want to derive a formula for \(\overline{a}\) in terms of \(\delta _1\) and \(\mu _1\). Since \(\overline{a^2} - \overline{a}^2 = \Vert a - \overline{a}\Vert ^2 = \Vert \delta _1\Vert ^2 = \overline{\delta _1^2}\), one finds
and analogous expressions for \(\overline{b}\), \(\overline{c}\) and \(\overline{d}\):
Furthermore,
and, similarly,
One observes that the expansions above in terms of \(\overline{\delta _i^2}\) are singular if, e.g., \(\overline{a^2}\) is zero. We therefore distinguish the following two cases.
Case 1: \(\overline{a^2} \ge \kappa ^2 \wedge \overline{b^2} \ge \kappa ^2 \wedge \overline{c^2} \ge \kappa ^2 \wedge \overline{d^2} \ge \kappa ^2\): The constant \(\kappa > 0\) will be chosen according to the calculations in the other Case 2. Here, we have
and
for all \(\varepsilon \in (0, \varepsilon _0]\) due to the bounds on \(\nu _\infty\) and \(\pi _\infty\) from (26). Equation (46) further implies
with an explicit constant \(c_2\) thanks to \(\nu _\infty \nu _{tr,\infty }' = \nu _{tr,\infty }\) (cf. (25)) and \(|\mu _i|, \overline{\delta _i^2} \le c_1(\varepsilon _0, n_0, p_0, M, M_1, V)\). In a similar fashion using \(\pi _\infty \nu _{tr,\infty } = \nu _{tr,\infty }'\), one obtains
In order to finish the proof, it is—according to Step 1—sufficient to show that
for appropriate constants \(C_1, C_2 > 0\). But due to Step 2 it is sufficient to show that for suitable constants \(C_1, C_2 > 0\),
Collecting all \(\overline{\delta _i^2}\)-terms on the right hand side, one only has to prove that
or, equivalently,
In order to verify (47), we start with the estimate
and a corresponding one involving b. The last term on the right hand side satisfies
due to Lemma 3.5 with a constant \(c(\kappa , n_0, p_0, M_1, V) > 0\). Similarly,
Proposition 5.3 (with \(a^2\), \(b^2\), \(c^2\), and \(d^2\) therein replaced by \(\overline{\mu _n a^2}/\overline{\mu _n}\), \(\overline{\mu _p b^2}/\overline{\mu _p}\), \(\overline{c^2}\), and \(\overline{d^2}\)) tells us that there exists an explicitly computable constant \(C(\varepsilon _0, n_0, p_0, M, M_1, V)>0\) such that
for all \(\varepsilon \in (0, \varepsilon _0]\). Using an analogue expansion as before, we further deduce with \(\overline{d^2} \le 1\),
As a corresponding estimate holds true also for the other expression on the right hand side of (48), we have shown that there exists a constant \(C_1(\kappa , \varepsilon _0, n_0, p_0, M, M_1, V)>0\) independent of \(\varepsilon\) for \(\varepsilon \in (0, \varepsilon _0]\) such that
Choosing \(C_2>0\) now sufficiently large, Eq. (47) holds true.
Case 2: \(\overline{a^2}< \kappa ^2 \vee \overline{b^2}< \kappa ^2 \vee \overline{c^2}< \kappa ^2 \vee \overline{d^2} < \kappa ^2\): In this case, we will not need Proposition 5.3 and we shall directly prove (42) employing only the result of Step 1. In fact, for \(\kappa\) chosen sufficiently small, the states considered in Case 2 are necessarily bounded away from the equilibrium and the following arguments show that consequentially the right hand side of (42) is also bounded away from zero, which allows to close the estimate (42). As a result of the hypotheses \(\overline{a^2}, \, \overline{b^2} \le C(n_0, p_0, M_1, V)\) and \(\overline{c^2}, \, \overline{d^2} \le 1\), we use Young’s inequality to estimate \(\overline{a}, \overline{b}, \overline{c}, \overline{d} \le c(n_0, p_0, M_1, V)\) and
with \(C > 0\) uniformly for \(\varepsilon \in (0, \varepsilon _0]\). We stress that the subsequent cases are not necessarily exclusive.
Case 2.1: \(\overline{c^2} < \kappa ^2\): First, \(\overline{c} = \sqrt{\overline{c}^2} \le \sqrt{\overline{c^2}} < \kappa\). This yields
For \(\kappa > 0\) sufficiently small, we then have \(0 < 1 - C(n_0, p_0, M_1, V) \kappa - \kappa ^2 \le ( \overline{b} \, \overline{c} - \overline{d} )^2 + \overline{\delta _4^2}\) and, hence,
by (45) with some \(K(\kappa , \varepsilon _0, n_0, p_0, M, M_1, V) > 0\). Let us call the parameter \(\kappa\) from above \(\kappa _c\).
Case 2.2: \(\overline{d^2} < \kappa ^2\): Now \(\overline{d} = \sqrt{\overline{d}^2} \le \sqrt{\overline{d^2}} < \kappa\) and
Again \(\kappa > 0\) sufficiently small gives rise to \(0 < 1 - C(n_0, p_0, M_1, V) \kappa - \kappa ^2 \le ( \overline{a} \, \overline{d} - \overline{c} )^2 + \overline{\delta _3^2}\) and
for some constant \(K(\kappa , \varepsilon _0, n_0, p_0, M, M_1, V) > 0\) using (44). This \(\kappa > 0\) shall be denoted by \(\kappa _d\).
Case 2.3: \(\overline{a^2} < \kappa ^2\): We first notice that \(\overline{a} < \kappa\) and \(2 \, \overline{c} \, \overline{d} \le \overline{c}^2 + \overline{d}^2 \le \overline{c^2} + \overline{d^2} = 1\). Now, we choose \(\kappa _a := \kappa > 0\) sufficiently small such that \(2 \kappa < \kappa _c^2\). Then, if \(\overline{c^2} < 2 \kappa\), we have \(\overline{c^2} < \kappa _c^2\), and the estimate
with \(K(\kappa , \varepsilon _0, n_0, p_0, M, M_1, V) > 0\) immediately follows from Case 2.1. And if \(\overline{c^2} \ge 2 \kappa\), then
Consequently, \(0 < \kappa \le ( \overline{a} \, \overline{d} - \overline{c} )^2 + \overline{\delta _3^2}\) and
due to (44) with a constant \(K(\kappa , \varepsilon _0, n_0, p_0, M, M_1, V) > 0\).
Case 2.4: \(\overline{b^2} < \kappa ^2\): Again \(\overline{b} < \kappa\) and \(2 \, \overline{c} \, \overline{d} \le \overline{c}^2 + \overline{d}^2 \le \overline{c^2} + \overline{d^2} = 1\). Here, we choose \(\kappa _b := \kappa > 0\) sufficiently small such that \(2\kappa < \kappa _d^2\). If \(\overline{d^2} < 2\kappa\), we have \(\overline{d^2} < \kappa _d^2\), and due to Case 2.2 there exists some \(K(\kappa , \varepsilon _0, n_0, p_0, M, M_1, V) > 0\) such that
If \(\overline{d^2} \ge 2 \kappa\), then
This implies \(0 < \kappa \le ( \overline{b} \, \overline{c} - \overline{d} )^2 + \overline{\delta _4^2}\) and
with \(K(\kappa , \varepsilon _0, n_0, p_0, M, M_1, V) > 0\) employing (45).
All arguments within Step 2 remain valid, if we finally set \(\kappa := \min (\kappa _a, \kappa _b, \kappa _c, \kappa _d)\). We also observe that the constants \(K > 0\) above are independent of \(\varepsilon \in (0, \varepsilon _0]\). And since \(\kappa\) only depends on \(n_0\), \(p_0\), \(M_1\) and V, we may skip the explicit dependence of \(C_2\) on \(\kappa\) at the end of Case 1. This finishes the proof. \(\square\)
We already pointed out that \(\Vert ad - c\Vert ^2\) and \(\Vert bc - d\Vert ^2\) can be controlled by the reaction terms of the entropy production, if we replace a, b, c, d by \(\sqrt{n/(n_0 \mu _n)}\), \(\sqrt{p/(p_0 \mu _p)}\), \(\sqrt{n_{tr}}\) and \(\sqrt{n_{tr}'}\) (see the proof of Theorem 1.5 in Sect. 6 for details). In this proof, also \(\Vert \nabla a\Vert ^2\), \(\Vert \nabla b\Vert ^2\), \(\Vert a - \overline{a}\Vert ^2\) and \(\Vert b - \overline{b}\Vert ^2\) may be bounded by the entropy production. However, \(\Vert c - \overline{c}\Vert ^2\) and \(\Vert d - \overline{d}\Vert ^2\) may not be estimated with the help of Poincaré’s inequality since this would yield terms involving \(\nabla n_{tr}\), which do not appear in the entropy production.
Instead, we are able to derive the following estimates for \(\Vert c - \overline{c}\Vert ^2\) and \(\Vert d - \overline{d}\Vert ^2\), which describe an indirect diffusion transfer from c to b and from d to a, respectively: Even if c and d are lacking an explicit diffusion term in the dynamical equations, they do experience indirect diffusive effects thanks to the reversible reaction dynamics and the diffusivity of a and b. This is the interpretation of the following functional inequalities.
Proposition 5.6
(Indirect diffusion transfer) Let \(a,b,c,d:\Omega \rightarrow {\mathbb {R}}\) be non-negative functions such that
holds true a.e. in \(\Omega\). Then,
Proof
We only verify the second inequality; the first one can be checked along the same lines. First, we notice that
because of the bound \(0 \le d \le 1\). Besides, we deduce
employing \(0 \le c,d \le 1\). For the subsequent estimates, we need two auxiliary inequalities: For every function \(f: \Omega \rightarrow {\mathbb {R}}\) and all \(\lambda \in {\mathbb {R}}\), we have
And for all \(x \ge 0\), one has
Since \(c^2 + d^2 = 1\), we obtain
where we applied (50) in the last step. Consequently,
and
using (49). \(\square\)
6 EEP-inequality and convergence to the equilibrium
We are now prepared to prove Theorem 1.5.
Proof of Theorem 1.5
Let \((n,p,n_{tr}) \in L^1(\Omega )^3\) be non-negative functions satisfying \(n_{tr} \le 1\), the conservation law \(\overline{n} - \overline{p} + \varepsilon \overline{n_{tr}} = M\) and the \(L^1\)-bound \(\overline{n}, \overline{p} \le M_1\). Keeping in mind that \(\nu _\infty = \sqrt{n_*/n_0}\) and \(\pi _\infty = \sqrt{p_*/p_0}\) (cf. Notation 5.1), Proposition 4.4 guarantees that there exists a positive constant \(C_1(\gamma , \Gamma , M_1) > 0\) such that
Next, we have to bound the second line of (51) in terms of the entropy production. To this end, we apply Proposition 5.5 with the choices \(a := \sqrt{n/(n_0 \mu _n)}\), \(b := \sqrt{p/(p_0 \mu _p)}\), \(c := \sqrt{n_{tr}}\) and \(d := \sqrt{n_{tr}'}\) (as always \(n_{tr}' = 1 - n_{tr}\)). The hypotheses of this proposition are fulfilled as a consequence of the conservation law \(\overline{n} - \overline{p} + \varepsilon \overline{n_{tr}} = M\) and the \(L^1\)-bound \(\overline{n}, \overline{p} \le M_1\). As a result, we obtain
for all \(\varepsilon \in (0, \varepsilon _0]\) with a constant \(C_2(\varepsilon _0, n_0, p_0, M, M_1, V) > 0\). Thanks to Poincaré’s inequality, we are able to bound the second and third line on the right hand side from above:
and
Moreover, the elementary inequality \((\sqrt{x} - 1)^2 \le (x-1) \ln (x)\) for \(x>0\) gives rise to
and similarly
Proposition 5.6 further implies that
Combining the above estimates, we arrive at
with a constant \(C_3(\varepsilon _0, \tau _n, \tau _p, n_0, p_0, M, M_1, V) > 0\) uniformly for \(\varepsilon \in (0, \varepsilon _0]\). With respect to (51), we now find
And since the constant \(C_1\) in (51) only depends on \(\varepsilon _0\), \(n_0\), \(p_0\), M, \(M_1\) and V (via the constants \(\gamma\) and \(\Gamma\)), we have finally proven that
for a constant \(C_4(\varepsilon _0, \tau _n, \tau _p, n_0, p_0, M, M_1, V) > 0\) independent of \(\varepsilon \in (0, \varepsilon _0]\). \(\square\)
Theorem 1.5 provides an upper bound for the relative entropy in terms of the entropy production. This already implies exponential convergence of the relative entropy. The subsequent proposition now yields a lower bound for the relative entropy involving the \(L^1\)-distance of the solution to the equilibrium. This will allow us to establish exponential convergence in \(L^1\).
Proposition 6.1
(Csiszár–Kullback–Pinsker inequality) Let \(\varepsilon _0\), \(n_0\), \(p_0\), M, \(M_1\) and V be positive constants. Then, there exists an explicit constant \(C_{\mathrm {CKP}} > 0\) such that for all \(\varepsilon \in (0, \varepsilon _0]\), the equilibrium \((n_\infty , p_\infty , n_{tr,\infty }) \in X\) from Theorem 2.1 and all non-negative functions \((n,p,n_{tr}) \in L^1(\Omega )^3\) satisfying \(n_{tr} \le 1\), the conservation law
and the \(L^1\)-bound
the following Csiszár–Kullback–Pinsker-type inequality holds true:
Proof
Due to Lemma 3.2, we know that the relative entropy reads
Similar to Proposition 4.4, we employ the mean-value theorem and observe that
for all \(s \in (0,1)\). Thus, there exists some \(\sigma (s)\) between \(n_{tr,\infty }\) and s such that
Moreover, we utilise the Csiszár–Kullback–Pinsker-inequality from Lemma 3.3 to estimate
where \(c(\varepsilon _0, n_0, p_0, M, M_1, V) > 0\) is a positive constant independent of \(\varepsilon \in (0, \varepsilon _0]\). As a corresponding estimate holds true also for p, we have verified that
for some \(C(\varepsilon _0, n_0, p_0, M, M_1, V) > 0\) uniformly for \(\varepsilon \in (0, \varepsilon _0]\). \(\square\)
Now, we are able to prove exponential convergence in relative entropy and \(L^1\).
Proof of Theorem 1.3
We first prove exponential convergence of the relative entropy
using a Gronwall argument as stated in [25]. To this end, we choose \(0< t_0 \le t_1 \le t < T\) and rewrite the entropy production law as
where we applied Theorem 1.5 with \(K := C_{\mathrm {EEP}}^{-1}\) in the second step. Furthermore, we set
and obtain from (52) the estimate \(K \Psi (t_1) \le \psi (t_1) - \psi (t)\) which yields
Integrating this inequality from \(t_1 = t_0\) to \(t_1 = t\) and observing that \(\Psi (t) = 0\) gives rise to
As a consequence of (52) with \(t_1 = t_0\), one has \(- \Psi (t_0) \ge (\psi (t) - \psi (t_0)) / K\) and, hence,
But this is equivalent to
for all \(t\ge t_0>0\). In order to conclude that
for all \(t \ge 0\), we observe that the rate K is independent of \(t_0\) and that the entropy \(E(n,p,n_{tr})(t_0)\) extends in (53) continuously to \(t_0 \rightarrow 0\) since \(n,p \in C([0,T);L^2(\Omega ))\) for all \(T>0\) by Theorem 1.1. This results in the announced exponential decay of the relative entropy, while the exponential convergence in \(L^1\) follows from Proposition 6.1. \(\square\)
Proof of Corollary 1.8
We first prove that the linearly growing \(L^{\infty }\)-bounds together with parabolic regularity for system (1) and assumption (4) entail polynomially growing \(W^{1,q}\)-bounds, \(q \in (1,\infty )\), for n and p. To this end, we consider
and introduce the variable \(w=n \, e^{V_n}\). We observe that \(\nabla \cdot J_n = \nabla \cdot \big (e^{-V_n} \nabla w \big )=e^{-V_n}\left( \Delta w - \nabla V_n \cdot \nabla w\right)\) and thus,
Under the assumptions of Corollary 1.8, this equation is of the form
where \(f_i \in C([0, \infty ), L^\infty (\Omega ))\) for \(i \in \{1,2\}\), \(f_3 \in C([0, \infty ), L^\infty (\Omega )^m)\) and \({\hat{n}} \cdot \nabla w = 0\) on \(\partial \Omega\). Testing this equation with \(-(q-1) |\nabla w|^{q-2} \Delta w\) yields
Using the inequalities \(|ab| \le (a^2 + b^2)/2\) and \((a+b+c)^2 \le 3(a^2 + b^2 + c^2)\) for \(a,b,c \in {\mathbb {R}}\), we find
Together with \(C > 0\) satisfying \(|f_i(t,x)^2| \le C\) for all \(i \in \{1,2,3\}\), \(t \ge 0\) and a.e. \(x \in \Omega\), we derive
An integration by parts and Young’s inequality with \(C_1(C, q) > 0\) give rise to
Hence, there exists a constant \(C_2(C, q) > 0\) such that
where \(A, B > 0\) result from the linearly growing \(L^\infty\)-bounds from (11). For any fixed \(t_0 > 0\) and all \(t \ge t_0\), we now have
A Gronwall lemma (see e.g. [2]) now proves the desired polynomial growth of \(\Vert \nabla w \Vert _{L^q(\Omega )}\) and \(\Vert \nabla n \Vert _{L^q(\Omega )}\):
Next, we use the Gagliardo–Nirenberg–Moser interpolation inequality in \({\mathbb {R}}^m\), \(m \ge 1\) (see e.g. [23]):
Then, interpolating with the exponentially decaying \(L^1\)-norm of \(n-n_\infty\), we obtain
due to the exponential convergence to equilibrium (15). The estimate for p follows in the same way. \(\square\)
Proof of Corollary 1.9
We first notice that exponential convergence of n and p in \(L^q(\Omega )\), \(1< q < \infty\), immediately follows from Theorem 1.3 and Corollary 1.8 via
and an analogous estimate for p where \(0< C_q, K_q < \infty\) are constants independent of \(\varepsilon \in (0, \varepsilon _0]\). Reusing the Gagliardo–Nirenberg–Moser interpolation inequality in \({\mathbb {R}}^m\), \(m \ge 1\), from (56) and the polynomial bound on the growth of \(\Vert \nabla n (t) \Vert _{L^{2m}(\Omega )}\) from (55), we derive
thus, establishing exponential convergence of n and p in \(L^\infty (\Omega )\). Concerning the convergence of \(n_{tr}\), we recall the following identities from (25) and (22):
We abbreviate \(u {:}{=}n_{tr} - n_{tr,\infty }\) and calculate (pointwise in x) by adding and subtracting \(n_{tr,\infty }\) and \(n_{tr}\) multiple times and by using the relations (58):
Hence, due to \(0\le n_{tr}(t, x) \le 1\) for all \(t \ge 0\) and a.e. \(x \in \Omega\), \(\mu _n = e^{-V_n}\), \(\mu _p = e^{-V_p}\), and \(V = \max ( \Vert V_n \Vert _{L^\infty (\Omega )}, \Vert V_p \Vert _{L^\infty (\Omega )})\), we estimate with (57)
where \(C {:}{=}C_m e^V \left( (\tau _p p_0)^{-1} + (\tau _n n_0)^{-1}\right)\). For the following calculation, we assume w.l.o.g. \(2 \varepsilon _0 K_m \le K\). Hence, using \(\Vert u(0, \cdot )\Vert _{L^\infty (\Omega )} \le 1\), we arrive at
where the last bound is independent of \(\varepsilon\). \(\square\)
7 A limiting entropy method for system (20)
The existence theory of the Shockley–Read–Hall model applies classical methods (see e.g. [18]) and can be carried out analogously to Theorem 1.1. Therefore, we state here the corresponding results without proof.
Theorem 7.1
(Shockley–Read–Hall for \(\varepsilon = 0\)) Under the assumptions of Theorem 1.1, there exists a unique non-negative global weak solution \((n, p) \in \left( C([0, T], L^2(\Omega ))\cap W_2(0,T) \cap L^\infty ((0, T), L^\infty (\Omega ))\right) ^2,\) of system (20) for all \(T \in (0, \infty )\) satisfying the boundary conditions (2).
Moreover, there exist positive constants \(C_n(\Vert n_I\Vert _{L^\infty (\Omega )},V_n)\), \(C_p(\Vert p_I\Vert _{L^\infty (\Omega )},V_p)\) and \(K_n(V_n)\), \(K_p(V_p)\) such that
Finally, there exist positive constants \(\mu\), \(\Gamma\), \(\theta >0\) (depending on \(\tau\), \(C_n\), \(C_p\), \(K_n\), \(K_p\), \(V_n\), \(V_p\)) such that
where \(\mu \tau = \frac{\Gamma }{1+\theta \tau }\) such that the bounds \(\mu t\) and \(\Gamma /(1 + \theta t)\) intersect at time \(\tau\).
The entropy functional (5) extends continuously to the limit \(\varepsilon = 0\):
which is again an entropy (the free energy) functional of the Shockley–Read–Hall model (20). The corresponding entropy production (free energy dissipation) functional reads as
Next, we recall from the introduction \(n^{qssa}_{tr}=n^{qssa}_{tr}(n,p)\) such that \(R_n(n, n_{tr}^{qssa}) = R_p(p, n_{tr}^{qssa})\), i.e.
\(n^{qssa}_{tr}(n,p)\) denotes the pointwise equilibrium value of the trapped states in (1) for fixed n and p, which corresponds to the quasi-steady-state approximation \(\varepsilon = 0\).
Moreover, we observe that the Shockley–Read–Hall entropy production functional (61) can be identified as the entropy production functional \(P(n, p, n^{qssa}_{tr})\) as given in (7) along trajectories of (1) with \(\varepsilon = 0\) when \(n_{tr}\equiv n^{qssa}_{tr}(n,p)\):
where one uses \(R=R_n=R_p\) at \(n_{tr}=n^{qssa}_{tr}\) and that the involved integrals are finite.
Analogously to Theorem 2.1, there exists a unique equilibrium \((n_{\infty ,0}, p_{\infty ,0}) \in X_0\) in the case \(\varepsilon = 0\), where
This equilibrium reads
where \(n_{*,0}, p_{*,0}>0\) are uniquely determined by \(n_{*,0} p_{*,0} = n_0 p_0\) and \(n_{*,0} \overline{\mu _n} - p_{*,0} \overline{\mu _p} = M\).
We are now in the position to formulate the EEP-inequality.
Theorem 7.2
(Entropy–Entropy Production Inequality for \(\varepsilon = 0\)) Let \(\tau _n\), \(\tau _p\), \(n_0\), \(p_0\), \(M_1\) and V be positive constants. Consider \(M\in {\mathbb {R}}\) and the correspondingly unique equilibrium \((n_{\infty ,0}, p_{\infty ,0}) \in X_0\).
Then, the following EEP-inequality holds true for all non-negative functions \((n,p) \in L^1(\Omega )^2\) satisfying the conservation law \(\overline{n} - \overline{p} = M,\) the \(L^1\)-bound \(\overline{n}, \overline{p} < M_1\) as well as the conditions \(E_0(n, p)<\infty\), \(P_0(n, p)<\infty\), \(P(n, p, n^{qssa}_{tr}) < \infty\) for some \(\varepsilon _0 > 0\):
where \(C_{\mathrm {EEP}} > 0\) is the same constant as in Theorem 1.5.
Theorem 7.3
(Exponential convergence for \(\varepsilon = 0\)) Let (n, p) be a global weak solution of system (20) as given in Theorem 7.1 corresponding to the non-negative initial data \((n_I,p_I) \in L^\infty (\Omega )^2\). Then, this solution satisfies the entropy production law
for all \(0< t_0 \le t_1 < \infty\).
Moreover, the following versions of the exponential decay towards the equilibrium \((n_{\infty ,0}, p_{\infty ,0}) \in X_0\) hold true:
and
where \(C := C_{\mathrm {CKP}}^{-1}\) and \(K := C_{\mathrm {EEP}}^{-1}\) are the same constants as in Theorem 1.3. Moreover, \(E_I\) and \(E_\infty\) denote the initial entropy of the system and the entropy in the equilibrium, respectively.
Remark 7.4
We believe that the entropy–entropy production inequality (64) can also be proven by combining estimates of Sect. 5 with previous works on the entropy method for detailed balanced reaction–diffusion models, see e.g. [5, 7, 10, 19]. We emphasise, however, that our goal with Theorem 7.2 is to be able to derive an entropy–entropy production inequality via the fast-reaction parameter \(\varepsilon \rightarrow 0\).
Finally, in the same way as for strictly positive \(\varepsilon > 0\), we can derive uniform-in-time \(L^\infty\)-bounds for n and p also in the case \(\varepsilon = 0\). As before, these bounds further improve the lower bounds on n and p.
Corollary 7.5
There exists a constant \(Z > 0\) such that
And for all \(\tau >0\) there exist sufficiently small constants \(\mu , \Gamma > 0\) such that
for all \(t\ge 0\) and a.e. \(x\in \Omega\), where \(\mu \tau =\Gamma\) such that the bounds \(\mu t\) and \(\Gamma\) intersect at time \(\tau > 0\).
Corollary 7.6
Under the hypotheses of Theorem 7.3, there exist constants \(0< C, K < \infty\) such that
holds true for all \(t \ge 0\).
Proof of Theorem 1.5
Our goal is to derive an estimate of the form
by applying the EEP-inequality from Theorem 1.5 directly to the functions n, p and \(n^{qssa}_{tr}\). However, since we assume that n and p satisfy
the triple \((n, p, n^{qssa}_{tr})\) does not satisfy the conservation law with right hand side M but
In order to resolve this issue, we shall apply the EEP-inequality from Theorem 1.5 to a suitably defined sequence of functions \((n_\varepsilon , p_\varepsilon , n_{tr,\varepsilon }) \in L^1(\Omega )^3\) which fulfil \(\Vert n_{tr,\varepsilon } \Vert _{L^\infty (\Omega )} \le 1\), the \(L^1\)-bound \(\overline{n_\varepsilon }, \overline{p_\varepsilon } \le M_1\) and the conservation law
A convenient choice is \(n_\varepsilon {:}{=}n\), \(p_\varepsilon {:}{=}p + \varepsilon \overline{n_{tr}^{qssa}}\) and \(n_{tr,\varepsilon } {:}{=}n^{qssa}_{tr}\), where \(n^{qssa}_{tr}=n^{qssa}_{tr}(n,p)\) as defined in (62). For this choice, we derive the stated EEP-estimate for the case \(\varepsilon = 0\) via the following steps, which are proven below:
We recall that n and p are assumed to satisfy \(E_0(n, p)<\infty\) and \(P_0(n, p),P(n, p, n^{qssa}_{tr})<\infty\), which implies that \(P_0(n, p)=P(n, p, n^{qssa}_{tr})\) as discussed in the introduction.
Step 1. Proof of (69): We first show, that with \((n_\varepsilon , p_\varepsilon , n_{tr,\varepsilon })=(n, p_\varepsilon , n^{qssa}_{tr})\)
Recalling that
we first notice that \(p_\varepsilon =p + \varepsilon \overline{n_{tr}^{qssa}} \rightarrow p\) monotonically decreasing for \(\varepsilon \rightarrow 0\) for all \(x\in \Omega\). Thus, by using \(\overline{n_{tr}^{qssa}} \le 1\) and the elementary estimate \(p_\varepsilon \ln p_\varepsilon \le 2 p\, (\ln p + \ln 2)\) for \(p\ge \max \{\varepsilon _0,1\}\), the Lebesgue dominated convergence theorem, the \(L^1\)-bounds \(\overline{n},\overline{n_\varepsilon }, \overline{p},\overline{p_\varepsilon } \le M_1\) and \(E_0(n, p)<\infty\) imply the convergence of the \(p_\varepsilon\)-integral in (72). The convergence of the third integral follows directly from
Using analogue arguments, the convergence
follows from observing the monotone convergence \(n_*\rightarrow n_{*,0}\) and \(p_*\rightarrow p_{*,0}\) for \(\varepsilon \rightarrow 0\) due to (27) in the proof of Theorem 2.1, which directly implies the monotone convergence \(n_\infty \rightarrow n_{\infty ,0}\) and \(p_\infty \rightarrow p_{\infty ,0}\) for all \(x\in \Omega\), where \((n_\infty , p_\infty , n_{tr,\infty })\) and \((n_{\infty ,0}, p_{\infty ,0})\) are defined in (22) and (63), respectively.
Step 2. Proof of (70): We observe that the functions \((n_\varepsilon , p_\varepsilon , n_{tr,\varepsilon }) = (n, p + \varepsilon \overline{n_{tr}^{qssa}}, n^{qssa}_{tr}) \in L^1(\Omega )^3\) satisfy \(\Vert n_{tr,\varepsilon } \Vert _{L^\infty (\Omega )} \le 1\), the conservation law
as well as the \(L^1\)-bounds \(\overline{n}_{\varepsilon } \le M_1\) and \(\overline{p_\varepsilon } \le \overline{p} + \varepsilon '\) where \(\varepsilon \in (0, \varepsilon '] \subset (0, \varepsilon _0]\). Because of \(\overline{p} < M_1\), we have \(\overline{p_\varepsilon } \le M_1\) for \(\varepsilon ' > 0\) sufficiently small. As a consequence,
where \(C_{EEP} > 0\) is the same constant as in Theorem 1.5.
Step 3. Proof of (71): As the constant \(C_{EEP} > 0\) is independent of \(\varepsilon \in (0, \varepsilon _0]\), it suffices to show that
To this end, we consider the representation
where we have already taken into account that \(n_\varepsilon =n\), \(\nabla p_\varepsilon = \nabla p\) and \(n_{tr,\varepsilon }=n^{qssa}_{tr}\) for all \(\varepsilon > 0\).
We note first that the convergence of the second, third and forth integral follows from the pointwise convergence of \(p_{\varepsilon }\) for all \(x\in \Omega\) and from the Lebesgue dominated convergence theorem by estimating
where the function on the right hand side is integrable due to the finiteness of \(P(n, p, n^{qssa}_{tr})\).
Secondly, the product
converges pointwise for all \(x\in \Omega\) as \(\varepsilon \rightarrow 0\). In order to conclude the convergence of the corresponding integral via the Lebesgue dominated convergence theorem, we use similar to Step 1 the elementary inequality \(p_\varepsilon \ln p_\varepsilon \le 2 p\, (\ln p + \ln 2)\) for \(p\ge \max \{\varepsilon _0,1\}\) and the finiteness of \(P(n, p, n^{qssa}_{tr})\). This yields
and therefore, \(P(n_\varepsilon , p_\varepsilon , n_{tr,\varepsilon }) \rightarrow P(n, p, n^{qssa}_{tr})\) for \(\varepsilon \rightarrow 0\). \(\square\)
Proof of Theorem 1.3
We only have to check that the assumptions on the finiteness of the entropy \(E_0\) and the entropy production functionals \(P_0\) and P within Theorem 2 are satisfied. The claim of this theorem then follows from the same arguments as in the proof of Theorem 1.3.
Due to the uniform \(L^\infty\)-bounds (59) of n(t) and p(t) for all \(t \ge 0\), we know that \(E_0(n, p) < \infty\) for all \(t \ge 0\). Similarly, we deduce that \(P(n, p, n^{qssa}_{tr})\) and \(P_0(n, p)\) are finite for all strictly positive \(t > 0\) since n, p are bounded away from zero and \(n^{qssa}_{tr}\) is bounded away from zero and one uniformly in \(\Omega\).
Finally, the lower bounds (60) guarantee similar to Theorem 1.5 that solutions satisfy the weak entropy production law (65) for all \(t_0 > 0\). \(\square\)
8 Conclusion
We have investigated the drift–diffusion–recombination system (1) modelling the transport, generation and annihilation of negatively charged electrons and positively charged holes (vacancies of electrons) in certain types of semiconductors. As depicted in Fig. 1, we have considered a two-level system augmented by an additional intermediate energy level, the so-called trap level, which results from the presence of foreign atoms inside the crystal of the semiconductor. We have derived an entropy–entropy production (EEP) inequality (cf. Theorem 1.5) which bounds the entropy functional (5) from above in terms of the entropy production functional (7). This EEP-inequality has then be used to show that the concentrations of electrons and holes converge to their equilibrium distributions at an exponential rate as time tends to infinity (cf. Theorem 1.3).
A novel achievement of our studies is the fact that the entropy method has been applied uniformly in a small time-related parameter. More precisely, the constant \(C_\mathrm {EEP}\) in Theorem 1.5 is independent of the lifetime \(\varepsilon\) of electrons on the trap level (cf. (1)) provided \(\varepsilon \in (0, \varepsilon _0]\) for some \(\varepsilon _0 > 0\). The \(\varepsilon\)-independence of \(C_\mathrm {EEP}\) transfers to the constants appearing in the exponential decay estimate in Theorem 1.3. This proves that the exponential convergence rate is independent of a quasi-steady-state approximation of the electrons on the trap level, which leads to the famous Shockley–Read–Hall recombination model [16, 21].
In particular, we were able to derive an EEP-inequality and the convergence estimate for the limiting Shockley–Read–Hall model. This fact is notable from a conceptual point of view as we transfer the results for \(\varepsilon > 0\) to the case \(\varepsilon = 0\) by performing the limit \(\varepsilon \rightarrow 0\). We believe that our limiting approach to the Shockley–Read–Hall model may serve as an example for possible applications of this technique to fast-reaction limits and quasi-steady-state approximations.
In view of the technicalities of the proofs and the resulting length of the current paper, our results are still limited by not taking into account the self-consistent potential generated by electrons and holes, which is required by a physically more precise model. However, this leads to an additional coupling of (1) to Poisson’s equation and a further increase in complexity of the problem. We expect however to resolve these issues in a future work by combining techniques and results presented in the current paper with ideas in [9], which considered a self-consistent Shockley–Read–Hall model without trapped states.
References
Alikakos, N.D.: ${L}^p$ bounds of solutions of reaction–diffusion equations. Commun. Partial Differ. Equ. 4(8), 827–868 (1979)
Beesak, P.R.: Gronwall Inequalities, vol. 11, Carleton Math. Lecture Notes (1975)
Chipot, M.: Elements of Nonlinear Analysis. Birkhäuser Advanced Texts. Birkhäuser, Basel (2000)
Desvillettes, L., Fellner, K.: Entropy methods for reaction–diffusion equations: degenerate diffusion. DCDS Supplements Special, pp. 304–312 (2007)
Desvillettes, L., Fellner, K.: Entropy methods for reaction–diffusion equations: slowly growing a priori bounds. Rev. Mat. Iberoam. 24, 407–431 (2008)
Desvillettes, L., Fellner, K.: Exponential convergence to equilibrium for a nonlinear reaction–diffusion system arising in reversible chemistry. Syst. Model. Optim. IFIP AICT 443, 96–104 (2014)
Di Francesco, M., Fellner, K., Markowich, P.: The entropy dissipation method for inhomogeneous reaction–diffusion systems. Proc. R. Soc. A 464, 3272–3300 (2008)
Einav, A., Morgan, J., Tang, B.Q.: Indirect diffusion effect in degenerate reaction–diffusion systems. arxiv:2001.00852v1
Fellner, K., Kniely, M.: On the entropy method and exponential convergence to equilibrium for a recombination-drift-diffusion system with self-consistent potential. Appl. Math. Lett. 79, 196–204 (2018)
Fellner, K., Tang, B.Q.: Explicit exponential convergence to equilibrium for nonlinear reaction–diffusion systems with detailed balance condition. Nonlinear Anal. 159, 145–180 (2017)
Fellner, K., Rosenberger, S., Tang, B.Q.: Quasi-steady-state approximation and numerical simulation for a volume-surface reaction–diffusion system. Commun. Math. Sci. 14, 1553–1580 (2016)
Fellner, K., Prager, W., Tang, B.Q.: The entropy method for reaction–diffusion systems without detailed balance: first order chemical reaction networks. Kinet. Relat. Mod. 10, 1055–1087 (2017)
Fellner, K., Latos, E., Tang, B.Q.: Global regularity and convergence to equilibrium of reaction–diffusion systems with nonlinear diffusion. J. Evol. Equ. (2019)
Gilbarg, D., Trudinger, N.S.: Elliptic Partial Differential Equations of Second Order. Springer, Berlin (1977)
Goudon, T., Miljanović, V., Schmeiser, C.: On the Shockley–Read–Hall model: generation-recombination in semiconductors. SIAM J. Appl. Math. 67, 1183–1201 (2007)
Hall, R.N.: Electron-hole recombination in Germanium. Phys. Rev. 87, 387 (1952)
Hsieh, C.-Y., Lin, T.-C.: Exponential decay estimates for the stability of boundary layer solutions to Poisson–Nernst–Planck systems: one spatial dimension case. SIAM J. Math. Anal. 47, 3442–3465 (2015)
Markowich, P.A., Ringhofer, C.A., Schmeiser, C.: Semiconductor Equations. Springer, Wien (1990)
Mielke, A., Haskovec, J., Markowich, P.A.: On uniform decay of the entropy for reaction–diffusion systems. J. Dyn. Differ. Equ. 27, 897–928 (2015)
Mock, M.S.: Analysis of Mathematical Models of Semiconductor Devices. Advances in Numerical Computation Series, vol. 3. Boole Press, Dublin (1983)
Shockley, W., Read, W.T.: Statistics of the recombinations of holes and electrons. Phys. Rev. 87, 835–842 (1952)
Stroock, D.: Logarithmic Sobolev inequalities for Gibbs states. Lect. Notes Math. 1563, 194–228 (1993)
Taylor, M.E.: Partial Differential Equation III—Nonlinear Equations. Springer Series Applied Mathematical Sciences, vol. 117. Springer, New York (1996)
Van Roosbroeck, W.: Theory of the flow of electrons and holes in Germanium and other semiconductors. Bell Syst. Tech. J. 29, 560–607 (1950)
Willett, D.: A linear generalization of Gronwall’s inequality. Proc. Am. Math. Soc. 16, 774–778 (1965)
Wu, H., Markowich, P.A., Zheng, S.: Global existence and asymptotic behavior for a semiconductor drift-diffusion-Poisson model. Math. Models Methods Appl. Sci. 18, 443–487 (2008)
Acknowledgements
Open access funding provided by Austrian Science Fund (FWF).
Funding
The second author has been supported by the International Research Training Group IGDK 1754 “Optimization and Numerical Analysis for Partial Differential Equations with Nonsmooth Structures”, funded by the German Research Council (DFG) and the Austrian Science Fund (FWF) under grant number [W 1244-N18].
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
Author Klemens Fellner declares that he has no conflict of interest. Author Michael Kniely declares that he has no conflict of interest.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix: Proof of the existence theorem
Appendix: Proof of the existence theorem
Proof of Theorem 1.1
In order to simplify the notation, we set the parameter \(\tau _n := \tau _p := 1\) and \(n_0 := p_0 := 1\) throughout the proof. All arguments also apply in the case of arbitrary values for \(\tau _n\), \(\tau _p\), \(n_0\) and \(p_0\). The structure of system (1) can be further simplified via introducing new variables
One obtains
which results in
Analogously, we derive
For convenience, we also introduce the abbreviations
as well as \(\alpha , \beta > 0\) such that the following estimates hold true a.e. in \(\Omega\):
Next, we introduce the new variable
for reasons of symmetry. In fact, we can prove the positivity of \(n_{tr}'\) in the same way as for \(n_{tr}\), which then implies the desired bound \(0 \le n_{tr} \le 1\). A further ingredient for establishing the positivity of the variables u, v, \(n_{tr}\) and \(n_{tr}'\) is to project them onto \([0, \infty )\) and [0, 1], respectively, on the right hand side of the PDE-system. In this context, we use \(X^+\! := \max (X, 0)\) to denote the positive part of an arbitrary function X and \(X^{[0,1]} := \min (\max (X, 0), 1)\) for the projection of X to the interval [0, 1]. The modified system now reads
The no-flux boundary conditions of (1) transfer to similar conditions on u and v. In detail, we have
and, hence,
Therefore, the corresponding boundary conditions for u and v read
Furthermore, we assume that the corresponding initial states satisfy
In this situation, \(\Vert n_{tr,I}\Vert _{L^\infty (\Omega )} + \Vert n_{tr,I}'\Vert _{L^\infty (\Omega )} \ge 1\) and we set
We now aim to apply Banach’s fixed-point theorem to obtain a solution of (74)–(76).
Step 1: Definition of the fixed-point iteration. For any time \(T > 0\) (to be chosen sufficiently small in the course of the fixed-point argument), we introduce the space
and the closed subspace
The fixed-point mapping \({\mathcal {S}}: X_T \rightarrow X_T\) is now defined via
where \((u, v, n_{tr}, n_{tr}')\) is the solution of the following PDE-system subject to the boundary and initial conditions specified above:
We first show that \((u, v, n_{tr}, n_{tr}') = {\mathcal {S}}(\widetilde{u}, \widetilde{v}, \widetilde{n}_{tr}, \widetilde{n}_{tr}') \in X_T\) provided \((\widetilde{u}, \widetilde{v}, \widetilde{n}_{tr}, \widetilde{n}_{tr}') \in X_T\). Due to \(\widetilde{f}_1\), \(\widetilde{f}_2 \in L^2((0, T) \times \Omega )\), it is known from classical PDE-theory (see e.g. [3]) that
And since
for almost all \(t \in [0, T]\), we deduce
Hence, \(n_{tr} \in L^\infty ((0, T), L^2(\Omega ))\). Moreover, we derive for \([0, T] \ni t_k \rightarrow t \in [0, T]\),
This proves \(n_{tr} \in C([0, T], L^2(\Omega ))\). The same arguments can be applied to \(n_{tr}'\).
Step 2: Invariance of \(M_T\). Now, let \((\widetilde{u}, \widetilde{v}, \widetilde{n}_{tr}, \widetilde{n}_{tr}') \in M_T\). Similar to the strategy of e.g. [1, 15, 26], we perform the subsequent calculations for any \(q \in 2 {\mathbb {N}}_+\) and every \(t \in [0, T]\):
Note that the last term in the second and the first term in the third line are both non-positive due to \(q \in 2{\mathbb {N}}\) and assumption (4). Introducing \(\gamma := 2 \alpha I + \beta + 2 \beta I\), we obtain
This inequality already implies a linear bound on the \(L^\infty\)-norm of u as we shall see below (cf. [2]). We define
and note that \(U(0) = 0\). Estimate (78) entails
for all \(t \in [0, T]\), where \(\eta > 0\) is an arbitrary constant, which guarantees that the expression \(X:=\eta + \Vert u(0) \Vert _{L^q(\Omega )}^{q} + U(t)\) is strictly positive. Multiplying both sides with \(X^{(1-q)/q}\) and integrating from 0 to t gives
We now substitute \(\sigma := U(s)\) and deduce
where we have used (78) in the last step. Therefore,
and, taking the limit \(\eta \rightarrow 0\),
As the bound on the right hand side is independent of q, we even obtain
for all \(t \in [0, T]\). This result naturally gives rise to
An analogous estimate is valid for v. As a result, we obtain
for \(T > 0\) chosen sufficiently small.
Employing (79), we also derive
The same argument is applicable to v, which results in
for sufficiently small \(T>0\). The corresponding bounds on \(n_{tr}\) and \(n_{tr}'\) can be deduced from the formula
and from an analogous one for \(n_{tr}'\). In fact,
and, hence,
for \(T>0\) sufficiently small.
Step 3: Contraction property of \({\mathcal {S}}\). We consider \((\widetilde{u}_1, \widetilde{v}_1, \widetilde{n}_{tr,1}, \widetilde{n}_{tr,1}'), (\widetilde{u}_2, \widetilde{v}_2, \widetilde{n}_{tr,2}, \widetilde{n}_{tr,2}') \in M_T\) and the corresponding solutions \((u_1, v_1, n_{tr,1}, n_{tr,1}') = {\mathcal {S}}(\widetilde{u}_1, \widetilde{v}_1, \widetilde{n}_{tr,1}, \widetilde{n}_{tr,1}') \in M_T\) and \((u_2, v_2, n_{tr,2}, n_{tr,2}') = {\mathcal {S}}(\widetilde{u}_2, \widetilde{v}_2, \widetilde{n}_{tr,2}, \widetilde{n}_{tr,2}') \in M_T\). We further introduce the notation
and similarly v, \(n_{tr}\), \(n_{tr}'\), \(\widetilde{v}\), \(\widetilde{n}_{tr}\) and \(\widetilde{n}_{tr}'\). Then, we have to show that
with a constant \(c \in (0, 1)\) on a time interval [0, T] small enough. The norm in \(X_T\) is defined as
We obtain the following system by taking the difference of corresponding equations of the system for the 1- and the 2-variables, respectively:
We mention that u and v are subject to the boundary conditions
and the homogeneous initial conditions
First, one finds
where \(C_1 > 0\) is the constant resulting from the embedding \(W_2(0, T) \hookrightarrow C([0, T], L^2(\Omega ))\). The constant \(C_2 > 0\) originates from well-known parabolic regularity estimates for \(\Vert u \Vert _{W_2(0, T)}\) in terms of the \(L^2\)-norms of \(\widetilde{f}_1\) and \(u(0)=0\). Therefore,
Moreover, every \(f \in C([0, T], L^2(\Omega ))\) fulfils
and we proceed with the previous estimates to derive
In a similar way, we arrive at
Due to \(n_{tr}(0) = 0\), one obtains
for \(t \in [0, T]\) and, using similar techniques as above,
Note that because of \(\widetilde{f}_4 = -\widetilde{f}_3\), the last estimate equally serves as an upper bound for \(\Vert n_{tr}'(t) \Vert _{L^2(\Omega )}\). Taking the sum of the above estimates and choosing \(T > 0\) sufficiently small yields
with some \(c \in (0, 1)\).
Step 4: Solution of (1). Step 2 and Step 3 imply that for \(T > 0\) sufficiently small the mapping \({\mathcal {S}}: M_T \rightarrow M_T\) is a contraction. Banach’s fixed point theorem, thus, guarantees that there exists a unique \((u, v, n_{tr}, n_{tr}') \in M_T\) such that \({\mathcal {S}}(u, v, n_{tr}, n_{tr}') = (u, v, n_{tr}, n_{tr}')\). Moreover, due to standard parabolic regularity for (u, v), the fixed-point \((u, v, n_{tr}, n_{tr}')\) is the unique weak solution of
In order to prove the non-negativity of u, v, \(n_{tr}\) and \(n_{tr}'\), we adapt an argument from [26]. First, we define
on \([0, T] \times \Omega\) and notice that \(h \le 0\) and \(h(t=0) = 0\) a.e. since \(u(0) \ge 0\) a.e. We now multiply the first equation in (81) with h and integrate over \((0, t) \times \Omega\) for \(t \in [0, T]\). This yields
The first term on the right hand side of (82) can be seen to be non-positive using integration by parts and the boundary condition from (75):
due to \(u h \ge 0\), \({\hat{n}} \cdot \nabla V_n \ge 0\), and since \(\nabla h \ne 0\) holds true only in the case \(u < 0\), where we have \(\nabla u = \nabla h\) in \(L^2\), see e.g. [14]. Moreover,
and the third term in (82) is again non-positive as an integral over non-positive quantities:
as a consequence of \(u^+ h = 0\) in \(L^2(\Omega )\). The left hand side of (82) can be reformulated as
For the first step, we have used that the integrand \(\partial _s u \, h\) only contributes to the integral if \(h < 0\). But in this case, \(u = h\) and, hence, \(\partial _s u = \partial _s h\) in \(L^2\), see e.g. [14]. This proves \(\Vert h(t) \Vert _{L^2(\Omega )} \le 0\) for all \(t \in [0, T]\), which establishes \(h(t) = 0\) in \(L^2(\Omega )\) for all \(t \in [0, T]\), and thus \(u(t, x) \ge 0\) for all \(t \in [0, T]\) and a.e. \(x \in \Omega\). In the same way, one can show that \(v(t, x) \ge 0\) for all \(t \in [0, T]\) and a.e. \(x \in \Omega\).
The non-negativity of \(n_{tr}\) follows from a similar idea using
Again, \(h \le 0\) and \(h(t=0) = 0\) due to \(n_{tr}(0) \ge 0\). Multiplying the third equation of (81) with h and integrating over \((0, t) \times \Omega\), \(t \in [0, T]\), we find
As before, all terms under the integral on the right hand side involving \(n_{tr}^{[0, 1]}\) vanish. Consequently,
for all \(t \in [0, T]\). The same result holds true for \(n_{tr}'\). Therefore, we have verified that \(n_{tr}(t, x)\), \(n_{tr}'(t, x) \ge 0\) for all \(t \in [0, T]\) and a.e. \(x \in \Omega\).
The non-negativity of \(n_{tr}\) and \(n_{tr}'\) together with \(n_{tr}' = 1 - n_{tr}\) from (73) now even imply
This allows us to identify the unique weak solution \((u, v, n_{tr}, n_{tr}')\) of (81) to equally solve
which is the transform version of the original problem (1).
Up to now, we have proven that there exists a unique solution \((u, v, n_{tr}) \in C([0, T], L^2(\Omega ))^3\) such that \((u, v, n_{tr}, 1 - n_{tr}) \in M_T\) on a sufficiently small time interval [0, T].
Step 5: Global solution. We now fix \(T^*>0\) in such a way that \([0, T^*)\) is the maximal time interval of existence for the solution \((u, v, n_{tr}) \in C([0, T], L^2(\Omega ))^3\) of (83). Moreover, we choose some arbitrary \(q \in {\mathbb {N}}_{\ge 2}\) and multiply the first equation in (83) with \(u^{q-1}\). Integrating over \(\Omega\) at time \(t \in [0, T^*)\) gives
Integration by parts and the estimates \(|A_n| \le \alpha\), \(\big |e^\frac{V_n}{2} n_{tr} - e^{V_n} u (1-n_{tr})\big | \le \beta (1 + u)\) further yield
Moreover, we derive
where we used \(|\Omega | = 1\). Hence,
after defining \(\gamma := \alpha + 2 \beta\). This results in
which can be integrated over time from 0 to t:
From this generalised Gronwall-type inequality, we deduce (cf. [2])
and
since \(1+\Vert u(0) \Vert _{L^q(\Omega )} \le 1+\Vert u(0) \Vert _{L^\infty (\Omega )} \le I\). As \(I e^{\gamma t}\) is independent of q, we even arrive at
In the same way, we can show that \(\Vert v(t) \Vert _{L^\infty (\Omega )} \le I e^{\gamma t}\) for all \(t \in [0, T^*)\). As a consequence, we obtain that the solution \((u, v, n_{tr}) \in C([0, T], L^2(\Omega ))^3\) can be extended for all times, i.e. \(T^*= \infty\).
Step 6: \(L^\infty\)-bounds for n and p. We now prove the linearly growing \(L^{\infty }\)-bounds (11) for n and p. We only detail the bound for p and sketch how the bound for n follows in a similar fashion. After recalling (with \(\tau _p=1\) w.l.o.g.)
we introduce the variable \(w=p \,e^{V_p}\) and observe that \(\nabla \cdot J_p = \nabla \cdot \left( e^{-V_p} \nabla w \right) =e^{-V_p}\left( \Delta w - \nabla V_p \cdot \nabla w\right)\) and thus,
while the no-flux boundary condition \({\hat{n}} \cdot J_p = 0\) on \(\partial \Omega\) transforms to the homogeneous Neumann condition \({\hat{n}} \cdot \nabla w = 0\) on \(\partial \Omega\).
Next, by testing (85) with the positive part \((w-r-st)_{+}:=\max \{0, w-r-st\}\) for two constant \(r,s>0\) to be chosen, we calculate by integration by parts in the first two terms
since \({\hat{n}} \cdot V_p\ge 0\) by assumption (4). Moreover, since \(n_{tr}\in [0,1]\) and \(w\ge 0\), we have
Thus, by choosing \(s {:}{=}\Vert e^{V_p}\Vert _{L^\infty (\Omega )}\) and \(r {:}{=}\Vert w(\tau ,\cdot )\Vert _{L^\infty (\Omega )}\) for some time \(\tau \ge 0\), we conclude that
and a Gronwall lemma implies
Transforming back, this yields
In order to deduce the analogue bound for n in (11), we consider (with \(\tau _n=1\) w.l.o.g.)
We introduce the variable \(\omega =n \,e^{V_n}\) and obtain in the same way as in (54)
Following the same arguments as above,
Transforming back, this yields
and thus (11).
Step 7: Regularity and bounds for \(n_{tr}\). We still have to verify \(n_{tr} \in C([0, T], L^\infty (\Omega ))\) for all \(T > 0\). Now, let \(T>0\) and recall that
in \(L^2(\Omega )\) for all \(t \in [0,T]\). Considering a sequence \((t_k)_{k \in {\mathbb {N}}} \subset [0, T]\), \(t_k \rightarrow t\), we thus arrive at
for \(k \rightarrow \infty\). This proves the assertion.
The claim \(\partial _t n_{tr} \in C([0, T], L^2(\Omega ))\) for all \(T > 0\) is an immediate consequence of the last equation in (83) together with the \(L^2\)-continuity and \(L^\infty\)-bounds of u, v and \(n_{tr}\).
Next, concerning the bounds (12), we recall system (1) and observe that for all \(\varepsilon \in (0,\varepsilon _0]\)
in the sense of \(L^2(\Omega )\), where \(h(n_{tr}=0)\ge \frac{1}{\tau _p}>0\) and \(h(n_{tr}=1)\le -\frac{1}{\tau _n} < 0\) uniformly for all non-negative n and p. Therefore, wherever \(n_{tr,I}(x)=0\) (or analogously \(n_{tr,I}(x)=1\)), an elementary argument proves that \(n_{tr}(t,x)\) grows (or decreases) linearly in time and decays back to 0 (or 1) at most like \((a+bt)^{-1}\). More precisely, we reuse the transformed variable \(w = p \, e^{V_p}\) and find
for some constants r and s due to the estimate (86). Setting \(\tau _n {:}{=}\tau _p {:}{=}1\) w.l.o.g., we have
with appropriate \(\widetilde{r}, \widetilde{s} > 0\) independent of \(\varepsilon\). By observing that the ODE \(\varepsilon _0{\dot{y}} = 1 - (\widetilde{r} + \widetilde{s} t) y\) features the positive nullcline \(y_0(t) = 1/(\widetilde{r} + \widetilde{s} t)\), which moreover attracts all solution trajectories, standard comparison arguments (pointwise in \(x\in \Omega\)) imply that for all times \(\tau > 0\), there exist positive constants \(\eta = \eta (\varepsilon _0, \tau , \tau _n, \tau _p)\), \(\theta = \theta (C_n,C_p,K_n,K_p)\) and a sufficiently small constant \(\gamma (\tau ,C_n,C_p,K_n,K_p)>0\) such that
where \(\eta \tau = \frac{\gamma }{1+\theta \tau }\) such that the linear and the inverse linear bound intersect at time \(\tau\).
Finally, the upper bounds (12) follow from analogue arguments.
Step 8: Lower bounds for n and p. Finally, we prove the bounds (13). We will only detail the argument for the lower bound on n, as the bound for p follows in an analogue way. Recalling the transformed equation for \(\omega =e^{V_n}n\) (satisfying \({\hat{n}} \cdot \nabla \omega = 0\) on \(\partial \Omega\)), we estimate
where \(\alpha > 0\) and \(c>0\) are positive constants due to the assumptions (4) and \(e^{V_n} \omega n_{tr}\ge 0\).
Next, we use (12), i.e. that for all \(\tau >0\) fixed, there exist constants \(\eta\), \(\theta\) and \(\gamma\) such that \(n_{tr}(t,x) \ge \eta t\) for all \(0\le t \le \tau\) and a.e. \(x\in \Omega\), while \(n_{tr}(t,x) \ge \gamma /(1 + \theta t)\) for all \(t \ge \tau\) and a.e. \(x\in \Omega\). Then, by introducing the negative part \((\omega )_{-}:= \min \{\omega ,0\}\) and testing (90) with \(\bigl (\omega -\frac{\mu t^2}{2}\bigr )_{-}\) for a constant \(\mu >0\) to be chosen below, we estimate
Thus, for \(0\le t \le \tau\) when \(n_{tr}(t,x) \ge \eta t\), we have
If we choose \(\mu \bigl (\alpha \frac{\tau }{2}+1\bigr )\le c \eta\), we obtain
Hence, since \(\int _{\Omega } \bigl (\omega (0,x)\bigr )_{-}^2dx=0\), we deduce from a Gronwall lemma
which yields in particular \(\omega (t,x)\ge \frac{\mu t^2}{2}\) for all \(0 \le t \le \tau\) and a.e. \(x\in \Omega\).
Moreover, for \(t\ge \tau\) when \(n_{tr}(t,x) \ge \frac{\gamma }{1+\theta t}\), we test (90) with \(\bigl (\omega -\frac{\Gamma }{1+\theta t}\bigr )_{-}\) for a constant \(\Gamma >0\) to be chosen below, and estimate similar to above
And as \(n_{tr}(t,x) \ge \frac{\gamma }{1+\theta t}\) for \(t \ge \tau\), we find
Choosing \(\alpha \Gamma \le c \gamma\), we arrive at
By further reducing either \(\Gamma\) or \(\mu\), we are able to satisfy \(\frac{\Gamma }{1+\theta \tau } = \frac{\mu \tau ^2}{2}\). On the one hand, this implies that \(\int _{\Omega } \Bigl (\omega (\tau ,x) - \frac{\Gamma }{1+\theta \tau }\Bigr )_{-}^2 \,dx=0\), which results—by using a Gronwall argument—in
and, hence, \(\omega (t,x)\ge \frac{\Gamma }{1+\theta t}\) for all \(t \ge \tau\) and a.e. \(x\in \Omega\). On the other hand, the increasing and decreasing bounds now again intersect at time \(\tau\) as desired. \(\square\)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Fellner, K., Kniely, M. Uniform convergence to equilibrium for a family of drift–diffusion models with trap-assisted recombination and the limiting Shockley–Read–Hall model. J Elliptic Parabol Equ 6, 529–598 (2020). https://doi.org/10.1007/s41808-020-00068-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41808-020-00068-8
Keywords
- Drift–diffusion–recombination models
- Semiconductors
- Shockley–Read–Hall
- Trapped states
- Entropy method
- Convergence to equilibrium
- Exponential rate of convergence
- Quasi-steady-state approximation
- Fast-reaction limit