Abstract
The modeling and simulation of stochastic reaction–diffusion processes is a topic of steady interest that is approached with a wide range of methods. At the level of particle-resolved descriptions, where chemical reactions are coupled to the spatial diffusion of individual particles, there exist comprehensive numerical simulation schemes, while the corresponding mathematical formalization is relatively underdeveloped. The aim of this paper is to provide a framework to systematically formulate the probabilistic evolution equation, termed chemical diffusion master equation (CDME), that governs particle-based stochastic reaction–diffusion processes. To account for the non-conserved and unbounded particle number of this type of open systems, we employ a classical analogue of the quantum mechanical Fock space that contains the symmetrized probability densities of the many-particle configurations in space. Following field-theoretical ideas of second quantization, we introduce creation and annihilation operators that act on single-particle densities and provide natural representations of symmetrized probability densities as well as of reaction and diffusion operators. These operators allow us to consistently and systematically formulate the CDME for arbitrary reaction schemes. The resulting form of the CDME further serves as the foundation to derive more coarse-grained descriptions of reaction–diffusion dynamics. In this regard, we show that a discretization of the evolution equation by projection onto a Fock subspace generated by a finite set of single-particle densities leads to a generalized form of the well-known reaction–diffusion master equation, which supports non-local reactions between grid cells and which converges properly in the continuum limit.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
A great variety of chemical and biochemical phenomena on all scales hinge on the combination of diffusion and chemical reactions; examples range from classical front propagation [1,2,3], self-organization of excitable media [4,5,6,7] and pattern-forming coatings of animals [8, 9], over the formation of morphogen gradients [10,11,12,13] and MinE protein oscillations [14,15,16] in developmental biology, to spreading of diseases [17,18,19] and innovations [20]. Conventionally, such processes have been modeled by means of deterministic reaction–diffusion equations, which govern the temporal evolution of concentration fields [9, 21]. Such continuum descriptions, however, often break down for small copy numbers of molecules, as is typically the case for biochemical processes at cellular and subcellular scales [22,23,24]. The discreteness of copy numbers gives rise to intrinsic noise, which has been included heuristically in reaction–diffusion models in the form of spatiotemporal Gaussian white noise, e.g., in calcium signaling [25,26,27,28], but also for species extinction [29, 30].
A more systematic, microscopic approach suggests to switch from concentration fields to spatially resolved probability distributions of copy numbers, which leads to the reaction–diffusion master equation (RDME) [31,32,33]: position space is partitioned into a regular mesh of volume elements, the diffusion of molecules is replaced by a jump process on the mesh, and reactions occur only within each volume element under assumed well-mixed conditions. Such conditions are justified in the spatiotemporal chemical master equation (ST-CME) [34, 35], which has been put forward in the context of cellular environments to account for intracellular structures and compartmentalization. It uses an irregular partitioning of position space into comparably few metastable subsets, meaning that the subsets are separated by barriers such that jumps between them occur rarely. Another obstacle toward effective continuum models for biochemical processes is macromolecular crowding, i.e., the dense and heterogeneous packing of cellular spaces by macromolecules that do not participate in the reaction, which has consequences for product formation rates as well as diffusion-influenced reaction kinetics [36,37,38,39,40,41]. Some of the aspects of crowding have been included in recent extensions of numerical reaction–diffusion schemes [42, 43], showing the potential for qualitative changes of the observed phenomenology.
An alternative to the above probabilistic descriptions is stochastic simulations of particle-based reaction–diffusion (PBRD) models, which offer a high resolution down to the scale of molecules combined with great modeling flexibility. The idea is that molecules are represented by point particles undergoing Brownian motion and that bimolecular reactions between close-by particles occur with a given rate depending on the separation distance of the pair; the most common schemes use either a reaction volume (Doi model [44, 45]) or a reaction surface (Smoluchowski model [46]) in terms of this distance. PBRD schemes are constructed in a bottom-up way and based on the extensive theory of diffusion-influenced reactions [47,48,49,50,51]. A number of algorithms for the PBRD scheme exist [52,53,54,55,56,57], differing in their implementation of Brownian motion, whether physical interactions between molecules are supported [58, 59], and in their degree of molecular resolution [60,61,62]. Applications reach from enzyme kinetics under crowding conditions [63,64,65] to nanomaterial-based catalysis [66]. A recent comparison of the two probabilistic models given by RDME and PBRD with a focus on intracellular kinetics can be found in the review by Smith and Grima [24].
A mathematical formalization of PBRD models as an open many-particle system undergoing reactions and diffusion was developed only partially [44, 45, 67,68,69,70]. Yet, such a framework would be a highly desirable starting point to systematically derive numerical schemes for reaction–diffusion processes and to analytically connect PBRD models with coarse-grained descriptions such as the RDME and the ST-CME. For example, these compartmentalized descriptions are obtained by locally integrating out the spatial degrees of freedom, turning the diffusion process into a continuous-time random walk on a mesh of subdomains. For bimolecular reactions, however, the procedure gives rise to effective reaction rates [58, 71, 72] and markedly non-Markovian reaction time distributions [73,74,75]. For the standard RDME, where second-order reactions may occur only between particles of the same subdomain, it was shown that for decreasing mesh size, the dynamics converge to a limit where second-order reactions cease to occur [76]. This problem was addressed in terms of a convergent RDME [77, 78], which allows bimolecular reactions to take place also between particles located in different subdomains.
The aim of this work is to develop a framework for the systematic formulation of the probabilistic evolution equation for particle-based stochastic reaction–diffusion processes, which we refer to as chemical diffusion master equation (CDME). The CDME is a family of Fokker–Planck equations, each of them describing the diffusion processes for a given n-particle probability density. The equations within the family are coupled by the reaction dynamics similar in form to a chemical master equation (CME) [79, 80]. The underlying reaction schemes are not restricted to mass-conserving reactions and may include effective reactions such as insertion and removal of particles, e.g., by considering only a subset of the chemical species. Thus, the CDME is capable of describing the dynamics of an open system characterized by the overall number of particles changing in time, similarly to the standard CME [81]. To systematically formulate the CDME given a general set of chemical reactions, our framework uses creation and annihilation operators. We exemplify the framework for a birth–death process as well as the second-order reaction of mutual annihilation. Our work further provides the basis for putting reaction–diffusion models at different resolutions on the same footing, which clears the way to derive relationships between the various coarse-grained descriptions and numerical schemes for reaction–diffusion dynamics. The latter is fundamental to develop consistent multi-scale simulations. As a first step, we discretize the particle-based dynamics by doing a Galerkin projection [82] of the CDME onto a partition of the position space into subvolumes. This yields a generalized RDME, where bimolecular reactions can occur naturally between particles in different, yet close-by subvolumes. In particular, we find explicit relations between the reaction rate constants of the RDME and of the underlying particle-based model as given by the CDME.
A number of technical challenges need to be solved to achieve these goals. In general, reaction–diffusion systems are open systems in the sense that the number of particles is not fixed, but changes in the course of time due to the reactions. Whereas such situations are well-known in quantum field theory and solid-state physics, where they are addressed in terms of creation and annihilation operators, the analogous formulations for classical systems of indistinguishable particles are comparably underdeveloped; this is particularly true for the stochastic dynamics of particles undergoing Brownian motion in an open system. In a probabilistic description of reaction–diffusion dynamics, the number of particles of each species is no longer determined but obeys a statistical distribution, which evolves in time along with the particle positions. Further, the many-particle distributions do not distinguish individual particles of the same species, and the distributions are thus symmetric with respect to permutations of particle labels. The symmetry must be preserved under time evolution, which requires non-trivial combinatorial factors in the reaction operators. These aspects are addressed by borrowing the concept of the Fock space [83] from quantum mechanics and translating it to the classical setting [44, 67, 84]. A major mathematical obstacle in this endeavor is that probability densities are integrable, but not necessarily square-integrable functions, as opposed to quantum wavefunctions, and that the underlying function space is not a Hilbert space [70]. Some of these issues have been addressed in the literature of probability theory for population dynamics [68,69,70, 85, 86]; however these works do not explore the applications in the context of chemical reactions.
Outline. We start in Sect. 2 by introducing a probabilistic model for particle-based reaction–diffusion dynamics and motivating the structure of the chemical diffusion master equation that we look for. In Sect. 3, the probability functions are interpreted as elements of a Fock space; the creation and annihilation operators are introduced and central algebraic relations are derived. The main results of this work are presented in Sect. 4, where the creation and annihilation operators are used to express the symmetrized densities and the diffusion and reaction operators. This allows us to systematically formulate the CDME on a general level, as well as for two exemplary reaction systems. Finally, Galerkin projection methods are applied in Sect. 5 to derive the generalized RDME as a projection of the underlying CDME.
2 Probabilistic model for particle-based reaction–diffusion dynamics
The object of interest in this work is a collection of molecules that diffuse in space and can undergo chemical reactions. These reactions will cause changes in the number of molecules of a given species. Focusing on a single species as a first step, elementary reactions include the creation and annihilation of molecules, which is then straightforward to extend to, e.g., binary reactions among molecules of different species. The goal is to find a probabilistic description that includes both the spatial movement of molecules and the changes in the population size. In this section, we introduce the structure of the CDME and formulate the equations for an exemplary birth–death process. A systematic approach to formulate the CDME for general reaction schemes will be given in Sect. 4.
2.1 Symmetric probability density functions
Specifically, we consider point particles of only one chemical species in a bounded domain \({\mathbb {X}}\subset {\mathbb {R}}^3\). Due to the reactions, the number of particles \(N(t) \in {\mathbb {N}}_0\) at time t can vary in the course of time. Assuming that there are \(n=N(t)\) particles at time t, the configuration of all particle positions is denoted by the vector \(x^{(n)}=\left( x^{(n)}_1,\dots ,x^{(n)}_n\right) \in {\mathbb {X}}^n\). The statistical law of the configurations is encoded in the conditional probability density function \(p_n(x^{(n)},t|N(t)=n)\) at time t given that \(N(t)=n\). Following the situation of a fixed particle number, we adopt the convention that \(p_n\) is normalized:
for all \(t \geqslant 0\) and each fixed n. The unconditional probability density for \(n\geqslant 1 \) is then given by
where \({\mathbb {P}}\left[ N(t)=n\right] \) is the probability of having n particles in the system at time t; for the empty system, we set \(\rho _0(t) := {\mathbb {P}}\left[ N(t) =0\right] \). It follows that
In particular, the admissible probability densities \(\rho _n\) belong to the space of absolutely integrable functions, \(L^1({\mathbb {X}}^n) = \{ \eta : {\mathbb {X}}^n \rightarrow {\mathbb {R}} \,; \Vert \eta \Vert _n < \infty \}\) with the standard norm
For \(n=0\), there is no dependence on the spatial position, so we have \(\rho _0(t) \in {\mathbb {R}}\) and set \(\Vert \rho _0(t)\Vert _0 := |\rho _0(t)|\).
As the n particles are of the same species and thus chemically indistinguishable, we consider two-particle configurations as equivalent if they differ only by a permutation of particle indices. Restricting to this reduced configuration space, the relevant probability densities are symmetric under interchanging any pair of their arguments. The same symmetry is known for bosonic many-body wave functions in quantum mechanics, albeit of totally different physical origin. The statistical indistinguishability of classical particles was introduced already by Gibbs to resolve the mixing paradox in statistical mechanics [87].
The open system with a variable number of particles is then characterized by the family of symmetrized probability densities,
Note that we will sometimes skip the time argument t in the notation. Recalling that \(\Vert \rho _n(\cdot , t)\Vert _n = {\mathbb {P}}\left[ N(t)=n\right] \), the total probability theorem implies that
which expresses the conservation of probability also within chemical reaction. The underlying space of such a family of distributions has a specific algebraic structure and is called the Fock space; it will be introduced in detail in Sect. 3. A similar probabilistic description, albeit differing in detail, was employed recently in a model of open systems that are coupled explicitly to a spatially separated reservoir [88, 89]. In particular, the many-particle densities therein obey a set of coupled Liouville-type evolution equations that preserve total probability and the particle exchange symmetry.
We emphasize that the family of densities [Eq. (5)] must not be confused with the hierarchy of correlation functions \(f_1, f_2, \dots \) that is used in statistical mechanics and is governed by the BBGKY equations [90], where \(f_n\) is also referred to as the reduced n-particle (phase space) density. Specifically, \(f_1\) is the probability density of a tagged particle’s position and \(f_2\) refers to the correlation between a pair of particles in the presence of all other particles. These functions are obtained by marginalization of the distribution of the overall system that consists of a large, but fixed number of particles. In particular, \(f_1\) can be obtained from \(f_2\) by integration, which is conceptually different from the present approach, where \(\rho _1\) and \(\rho _2\) do not share such a relation.
2.2 Chemical diffusion master equation
The main goal of this work is to develop a framework to formulate the evolution equation for the family of symmetrized n-particle densities in the presence of diffusion and reactions.
In the absence of reactions, the particle number is fixed and the system of interest may be associated with a closed system of n particles that diffuse in space, possibly subject to physical interactions or an imposed flow. In this case, the conditional density \(p_n(x^{(n)},t|N(t)=n)\) of n-particle configurations \(x^{(n)} \in {\mathbb {X}}^n\) obeys the Fokker–Planck equation
where \({\mathcal {D}}_n\) is the corresponding Fokker–Planck operator. In the most general situation, it is a semi-elliptic linear operator and takes the form [91]
where \(A_i=A_i(x^{(n)},t)\) describes the deterministic drift, the \(D_{ij} = D_{ij}(x^{(n)})\) are \(3\times 3\) diffusion matrices composing the \(3n \times 3n\) diffusion tensor, and \(\nabla _i\) denotes differentiation with respect to the position \(x_i^{(n)}\) of particle i. If the drift originates from an interaction potential \(U(x^{(n)})\), then
In the absence of the drift term and assuming that diffusion is isotropic, the diffusion operator reduces to that of standard Brownian motion, \({\mathcal {D}}_n \rho _n = D\Delta \rho _n\), with a scalar diffusion constant \(D > 0\), where \(\Delta \) is the Laplace operator.
In addition to diffusive motion, the particles of the considered species \(\text {A}\) undergo reactions of the form
For each reaction, there is a reaction rate function
where the value \(\lambda (y^{(l)},x^{(k)})\geqslant 0\) corresponds to the rate at which the reaction occurs given that the k reactants are at positions \(x^{(k)}\in {\mathbb {X}}^k\) and the l products are placed at positions \(y^{(l)}\in {\mathbb {X}}^l\). The reaction rate function \(\lambda \) is assumed to be symmetric under particle exchange within the configuration \(y^{(l)}\) and also within \(x^{(k)}\).
In the presence of both reactions and diffusion, the total number of particles and their positions can change simultaneously over time. Given that there are S different chemical reactions, the family of n-particle densities, \(\rho = (\rho _0, \rho _1, \dots )\), should obey a linear evolution equation of the form
which we will refer to as the chemical diffusion master equation (CDME). Here, \({\mathcal {D}}\) is the diffusion operator as above, and the reaction operator \({\mathcal {R}}_r\) encodes the r-th reaction, which is parametrized by a reaction rate function \(\lambda _r\). A central assumption is that diffusion and reactions occur independently, which allows us to split the operator on the right-hand side into the sum \({\mathcal {D}} + {\mathcal {R}}\), where \({\mathcal {R}}= \sum _{r=1}^S {\mathcal {R}}_r\) combines all reaction operators into one to simplify the notation. The reaction operators couple components \(\rho _n\) of \(\rho \) for different n, similarly to the CME. More precisely, treating \(\rho \) as an infinitely long column vector, Eq. (12) reads in matrix notation:
Each entry \({\mathcal {Q}}_{nm}\) is an operator that condenses the effect of all the reactions that act on a given \(\rho _{n'}\) yielding the temporal change of \(\rho _n\), similarly to the CME. To ensure conservation of probability [Eq. (6)] we impose reflective boundary conditions at the boundaries of the domains \({\mathbb {X}}^n\) separately for every n, i.e., the diffusive flux across these boundaries is zero. Under the assumption of well-mixed conditions, the spatial degrees of freedom can be integrated out, and Eq. (13) should yield the conventional CME [80]. On the other hand, if there are no reactions, the equations will uncouple and yield a set of independent Fokker–Planck equations for the symmetrized densities \(\rho _n\), see Eq. (7).
The stochastic system trajectories corresponding to Eq. (13) combine continuous diffusion in the configuration spaces \(({\mathbb {X}}^0, {\mathbb {X}}^1, \dots , {\mathbb {X}}^n, \dots )\) for fixed n and a discrete jump process between \({\mathbb {X}}^m\) and \({\mathbb {X}}^n\) for \(m\ne n\), modeling creation and annihilation of particles due to reactions (Fig. 1). The model thus fits into the category of stochastic hybrid systems, where continuous dynamics and discrete events coexist in the same process. However, a particular challenge for the mathematical formalization is the change of dimensionality that is induced by the discrete jumps.
Example (birth–death process). Consider a chemical species \(\text {A}\) that undergoes degradation and creation reactions: Otherwise, it looks as if Fig. 1 contains the reactions that we refer to
Here, \(\lambda _d(x)\) denotes the rate for reaction \(\mathrm {(I)}\) to occur for a particle located at position \(x\in {\mathbb {X}}\) (i.e., the probability per unit of time for this particle to disappear), while \(\lambda _c(x)\) is the rate for a new particle to be created at position x by reaction \(\mathrm {(II)}\). Explicitly, we assume that the rates depend only on the position in space, but not on the configuration of all particle positions. Both \(\lambda _d\) and \(\lambda _c\) are special cases of the generic reaction rate function \(\lambda \) in (11).
Arguments analogous to the formulation of the CME lead us to propose the following equation for the time evolution of \(\rho _n(x^{(n)},t)\) for \(n\geqslant 1\), taking into account the symmetry of the densities:
where the first line refers to spatial diffusion, the second and third lines are gain and loss terms due to reaction \(\mathrm {(I)}\), and the last two lines relate to reaction \(\mathrm {(II)}\). For \(n=0\), we have
The boundary-value problem induced by Eqs. (15) and (16) is well-posed for an elliptic diffusion operator \({\mathcal {D}}\) and regular reaction rate functions \(\lambda _\mathrm {d}\) and \(\lambda _\mathrm {c}\). Equation (15) corresponds to the n-th row of Eq. (13), and the components of the reaction operator matrix are:
and for \(n=0\), they are
where \(\rho _0\in {\mathbb {R}}\) is a constant.
We verify that the CME related the reactions (I) and (II) is recovered from Eq. (15) by integrating out the spatial degrees of freedom [Eq. (3)]. To this end, we define the marginal distribution
and assume the reaction rate functions to be constants, \(\lambda _d(x)=\gamma _\mathrm {d}\) and \(\lambda _c(x)=\gamma _\mathrm {c}\) for all \(x\in {\mathbb {X}}\). Then, integrating Eq. (15) over the space \({\mathbb {X}}^n\) yields:
where \(|{\mathbb {X}}|<\infty \) is the volume of the domain \({\mathbb {X}}\). Note that the diffusion term vanishes due to the no-flux boundary condition by Gauss’ theorem. Equation (20) is exactly the CME as derived from the classical law of mass action for reactions of order zero and one in spatially well-mixed systems. One checks easily that summation of the right-hand side over \(n \geqslant 0\) yields zero, as required for a continuous-time Markov chain.
In the next two sections, we will develop a systematic way to construct the CDME such as Eq. (15) corresponding to a given set of chemical reactions [e.g., Eq. (14)]. In particular, we will specify the reaction operators [Eqs. (17a)–(17c)] and find their correct combinatorial prefactors, which is often not a straightforward task. We do so by means of the Fock space formalism, which yields one possible explicit representation of the probabilistic evolution described by the CDME.
3 Fock space formalism
The mathematical formalization of the dynamics of open systems requires means for the insertion and deletion of particles. In reaction–diffusion problems, this can occur everywhere in the domain \({\mathbb {X}}\). Further, in the probabilistic description, a particle is described by a probability density rather than by a single point. A similar problem was solved in quantum field theory, where the n-particle densities are represented by symmetric or anti-symmetric wavefunctions and (quasi-)particles can be created in or annihilated from such states. The underlying algebraic construction is called Fock space and relies on the fact that the space of n-particle densities is generated from products of single-particle densities. In the quantum case, the wavefunctions are square-integrable and form a Hilbert space whereas the probability densities of classical particles are absolutely integrable [Eq. (3)]. As an important consequence, the corresponding space \(L^1({\mathbb {X}}^n)\) is merely a Banach space, i.e., it does not possess an inner product. This technical deficiency can be circumvented by resorting to the dual space. In the following, we will introduce the Fock space for the family of symmetrized probability densities. Suitably defined creation and annihilation operators will then serve as efficient instruments to represent the reaction operators in the CDME as well as its solutions. This program will be carried out in Sect. 4.
3.1 Symmetrized n-particle spaces
Single-particle space. We start with the one-particle space \(H:=L^1({\mathbb {X}})\), which contains the probability density functions of a single particle’s position on the space of motion \({\mathbb {X}}\), and re-collect some facts from functional analysis [92]. The dual space of \(L^1({\mathbb {X}})\) is isometrically isomorph to \(L^\infty ({\mathbb {X}})\), so that we can identify the dual space \(H^*\) with \(L^\infty ({\mathbb {X}})\), i.e., the bounded functions on \({\mathbb {X}}\). In the application to reaction–diffusion dynamics below, we will see that, for example, reaction rate functions are elements of \(H^*\). A dual pairing \(\langle \cdot ,\cdot \rangle : H^* \times H\rightarrow {\mathbb {R}}\) is defined by
which acts as a substitute for the missing inner product on H and yields the “overlap” between the functions \(\zeta \in H^*\) and \(\eta \in H\). The next step is to choose a (Schauder) basis \((u_1, u_2, \dots )\) of the space H, which can even be taken to be nonnegative [93] and normalized, \(u_\alpha \geqslant 0\) and \(\Vert u_\alpha \Vert _1 = 1\) for all \(\alpha \in {\mathbb {N}}\). The basis induces the dual set of linear functionals \((u_1^*, u_2^*, \dots )\), here identified with functions \(u^*_\alpha \in L^\infty ({\mathbb {X}})\), such that \( \langle u^*_\alpha , u_\beta \rangle = \delta _{\alpha ,\beta } \) for all \(\alpha , \beta \in {\mathbb {N}}\), using the Kronecker symbol \(\delta _{\alpha ,\beta }\). Thus, a density \(\eta \in H\) of a single particle has the representation
in terms of the one-particle basis \((u_\alpha )_{\alpha \in {\mathbb {N}}}\). We note that the family \((u_{\alpha }^*)_{\alpha \in {\mathbb {N}}}\) is countable and thus cannot span the dual space \(H^* \cong L^\infty (X)\), which is not separable. However, this deficiency of \((u_\alpha ^*)\) is not of practical relevance for the following treatment.
Tensor spaces. A natural extension to the space \(L^1({\mathbb {X}}^n)\) of n-particle densities uses the observation that the pure tensor \(u_{\alpha _1} \otimes \dots \otimes u_{\alpha _n}\), given as products of n one-particle densities,
span the space \(L^1({\mathbb {X}}^n)\). Thus, the tensor space
coincides with \(L^1({\mathbb {X}}^n)\) and we refer to \((u_{\alpha _1} \otimes \dots \otimes u_{\alpha _n})_{\alpha _i \in {\mathbb {N}}}\) as a tensor basis. As convention for \(n=0\), we set \(H^{\otimes 0} := {\mathbb {R}}\). Similarly to the one-particle case, we define a dual pairing for \(\eta \in H^{\otimes n}\), \(\zeta \in (H^{\otimes n})^*\) as
Then, the dual set of the tensor basis consists of the dual pure tensor
which satisfy
for all multi-indices \((\alpha _1, \dots , \alpha _n), (\alpha _1', \dots , \alpha _n') \in {\mathbb {N}}^n\). Finally, the basis representation of an element \(\eta \in H^{\otimes n}\) reads
with coefficients \(c_{\alpha _1,\dots ,\alpha _n} := \langle u_{\alpha _1}^* \otimes \dots \otimes u_{\alpha _n}^*,\eta \rangle \) and each of the summation indices \(\alpha _1 \dots \alpha _n\) running from 1 to \(\infty \).
Symmetrization. The probability density of a system of n identical particles is symmetric in all particle positions (see Sect. 2.1). Thus, the spaces \(H^{\otimes n}\) are too large for our purposes and we need to project onto the symmetrized functions. To this end, we introduce the symmetrization operator \({\mathcal {S}}_n: H^{\otimes n} \rightarrow H^{\otimes n}\),
where \(\Sigma _n\) is the set of permutations on \(\{1,\dots ,n\}\). We note that for \(n=0\) or \(n=1\), the symmetrization is the identity: \({\mathcal {S}}_0\eta = \eta \) for \(\eta \in H^{\otimes 0}\) or \(\eta \in H^{\otimes 1}\). Dividing by the number n! of possible permutations ensures that normalization is preserved, i.e., \(\Vert {\mathcal {S}}_n \eta \Vert _n = \Vert \eta \Vert _n\). The operator \({\mathcal {S}}_n\) is indeed a linear projection, in particular \({\mathcal {S}}_{n}^2 = {\mathcal {S}}_{n}\), and it is also “orthogonal” with respect to the dual pairing (proof in Appendix A.1):
for any \(\eta \in H^{\otimes n}\) and \(\zeta \in (H^{\otimes n})^*\). Throughout this work, we will refer to the symmetrized tensor space \({\mathcal {S}}_nH^{\otimes n}= \{ {\mathcal {S}}_n v : v \in H^{\otimes n} \}\) as the n-particle space.
A basis of \({\mathcal {S}}_nH^{\otimes n}\) is obtained by symmetrization of the n-particle tensor basis:
where the action of \({\mathcal {S}}_n\) on a pure tensor amounts to a linear combination of all permutations of the factors \(u_{\alpha _1}, \dots , u_{\alpha _n}\). The ordering of the indices \(\alpha _1 \leqslant \cdots \leqslant \alpha _n\) is needed to avoid double counting; the equality is included here as, for example, \(u_1 \otimes u_1\) belongs to \({\mathcal {S}}_2 H^{\otimes 2}\). The dual set of the symmetrized tensor basis obeys again a biorthogonality relation [see Eq. (27)]:
with the understanding that in the expression \({\mathcal {S}}_n( \dots )^*\) dualization is carried out after symmetrization. It is evident from Eq. (30) that the two operations do not commute, which motivates us to introduce “normalization” coefficients
so that the symmetrized duals are given in terms of the unsymmetrized duals by
As an example, for \(n=2\), one calculates
so that \(s_{\alpha \beta } = \frac{1}{2}\) for \(\alpha \ne \beta \) and \(s_{\alpha \beta } = 1\) otherwise. With this, a symmetrized function \(\eta \in {\mathcal {S}}_nH^{\otimes n}\) has the basis representation
with coefficients
for \(1\leqslant \alpha _1\leqslant \dots \leqslant \alpha _n < \infty \), using Eqs. (30) and (34) in the second line. For later reference, we again define the subspace spanned by the duals of the symmetrized tensor basis:
Copy number representation. In the basis representation (36), the indices \(\alpha _1,\dots ,\alpha _n\) take values that are not necessarily different from each other. This motivates counting the occurrence of every value \(\alpha =1, 2, \dots \) in a given multi-index \((\alpha _1,\dots ,\alpha _n)\) and switching to a sequence of frequencies \((N_1, N_2, \dots )\), where most of the entries are zero. We refer to these frequencies as copy numbers since \(N_\alpha \) counts how many times the factor \(u_\alpha \) is repeated in an element of the tensor basis, i.e., how many particles have a position distributed according to \(u_\alpha \); in quantum mechanics, the term occupation number is used instead. Given the number of particles \(n\in {\mathbb {N}}\), we define
as the set of possible sequences of copy numbers and use it to enumerate the symmetrized n-particle tensor basis. Then, for \(\eta \in {\mathcal {S}}_nH^{\otimes n}\), we have
with coefficients \(p_{N} = \bigl \langle {\mathcal {S}}_n(u_1^{\otimes N_1}\otimes u_2^{\otimes N_2}\otimes \dots )^*, \eta \bigr \rangle \) . Expanding the symmetrization operator and contracting the dual pairings in Eq. (33), one can show that
so that
where \({\mathcal {S}}_n\) has been omitted in the left factor on the r.h.s. since \(\eta \) is symmetrized. As \(N_\alpha \) is nonzero for at most n values of \(\alpha \) (and recalling that \(0! = 1\) in all other cases), the multi-nomial coefficients \(n!/(N_1!N_2! \dots )\) are well defined. Note these multi-nomial coefficients simply correspond to the “normalization” coefficients defined in Eq. (33).
3.2 The Fock space
The Fock space \(F(H)\) holds the probability densities on the configuration space of the open system, depicted in Fig. 1. The elements of \(F(H)\) are families \(\rho =(\rho _0,\rho _1,\rho _2,\dots )\) of symmetrized n-particle densities, which form the direct sum of symmetrized n-particle spaces (see Sect. 3.1),
It is sometimes convenient to interpret \(\rho _n \in {\mathcal {S}}_n H^{\otimes n}\) as the Fock space element \((0,\dots ,0,\rho _n,0,\dots )\) and to introduce a component-wise addition on \(F(H)\), which allows for a compact notation such as in
Further, we introduce the following generalization of the \(L^1\)-norm,
with \(\Vert \cdot \Vert _n\) as in Eq. (4), and define \(F(H)\) as the set of probability densities of the open system that are integrable and absolutely summable:
In reaction–diffusion problems, we will consider probability densities \(\rho \in F(H)\) which are normalized, \(\Vert \rho \Vert = 1\), and preserve this normalization under time evolution [see Eq. (6)].
In contrast to the \(L^2\)-norm used in quantum mechanics, the \(L^1\)-norm on H and likewise the norm \(\Vert \cdot \Vert \) on \(F(H)\) is not induced by any inner product. Hence, \(F(H)\) cannot be a Hilbert space, but is merely a Banach space. The dual space of \(F(H)\) is given by
with elements \(\nu =(\nu _0, \nu _1,\nu _2,\dots )\) and the supremums norm,
The dual pairing for \(\rho \in F(H)\) and \(\nu \in F(H)^*\) is
with \(\langle \nu _n, \rho _n\rangle \) defined in Eq. (25).
3.3 Creation and annihilation operators
The creation and annihilation operators raise or lower the number of particles in the system by mapping an n-particle density to an \((n+1)\)-particle density or \((n-1)\)-particle density, respectively. The creation operator \(a^+\{w\}\) adds a particle whose position is described by the probability density \(w(x)\in H\), and the annihilation operator \(a^-\{f\}\) removes a particle with reaction rate function \(f(x)\in H^*\). These operators are defined on the symmetrized n-particle spaces and, most importantly, preserve symmetrization:
There action on a symmetrized pure tensor \(v={\mathcal {S}}_n (v_1 \otimes \dots \otimes v_n) \) for \(v_i \in H\) is defined as
where \(v_j\) is omitted in \(v_{\setminus \{j\}}:=v_1\otimes \dots \otimes v_{j-1}\otimes v_{j+1}\otimes \dots \otimes v_n\); for \(n=0\), we set \(a^-\{f\} 1 := 0\). By linearity, these definitions extend naturally to the space \({\mathcal {S}}_nH^{\otimes n}\), since the elements of its tensor basis are of the form v, and to the whole Fock space \(F(H)\) by acting component-wise on \(\rho \).
The sum in the annihilation operator [Eq. (51b)] expresses the fact that there are n different ways to remove a particle. It reduces to a factor n in the well-mixed case without spatial resolution (where all \(v_i\) are equal to the uniform distribution). The above definitions thus differ from the ones used for quantum systems, where \(\sqrt{n+1}\) and \(\sqrt{n}\) are the coefficients of \(a^+\) and \(a^-\), respectively. The present choice is suitable for systems of indistinguishable classical particles [44, 67, 94], and for chemical systems, they avoid cumbersome prefactors in the subsequent results.
The creation and annihilation operators have expansions in the basis \((u_\alpha )\) of the single-particle space H [see Eqs. (B2) and (B4)]:
A basis-free representation of the action of the two operators on a symmetrized n-particle density is given in Eqs. (B3) and (B5). We note that for a given annihilation operator \(a^-\{f\}\) the choice of the reaction rate function f is not unique. It is seen from the second relation that \(a^-\{f\} = a^-\{f + \zeta \}\) for any \(\zeta \in H^*\) such that \(\langle \zeta , u_\alpha \rangle = 0\) for all \(\alpha \). Such \(\zeta \ne 0\) exist since \((u_\alpha ^*)\) does not span \(H^*\). However, this mathematical issue has no further implications for the present work.
Operator algebra. The creation and annihilation operators satisfy a number of basic relations that define an operator algebra, which is useful to derive representations in terms of these operators.
-
1.
Removing a particle from the (normalized) vacuum element \(\rho _\text {vac}:= (1, 0, 0, \dots ) \in F(H)\) yields zero, i.e., \(a^-\{f\} \rho _\text {vac}= 0\) for any \(f \in H^*\).
-
2.
A symmetrized pure tensor \({\mathcal {S}}_n (v_1 \otimes \dots \otimes v_n) \in {\mathcal {S}}_nH^{\otimes n}\) with \(v_i \in H\) is generated from a sequence of creation operators acting on \(\rho _\text {vac}\):
$$\begin{aligned} {\mathcal {S}}_n (v_1 \otimes \dots \otimes v_n) = a^+\{ v_1 \} \dots a^+\{ v_n \} \rho _\text {vac}. \end{aligned}$$(53) -
3.
The creation and annihilation operators satisfy the following commutation relations:
$$\begin{aligned} {[}a^-\{f\},a^+\{w\}]&= \langle f,w\rangle {\mathcal {I}}, \end{aligned}$$(54a)$$\begin{aligned} {[}a^+\{w\},a^+\{\nu \}]&= 0, \end{aligned}$$(54b)$$\begin{aligned} {[}a^-\{f\},a^-\{g\}]&= 0, \end{aligned}$$(54c)for \(w, \nu \in H\) and \(f, g\in H^*\) and using the commutator \([a,b]:=ab-ba\) of operators a, b on \(F(H)\). The proofs are found in Appendix A.3.
-
4.
The particle number operator is defined as
$$\begin{aligned} {\mathcal {N}}:= \sum _{\alpha } a^+\{u_\alpha \} a^-\{u_\alpha ^*\}, \end{aligned}$$(55)where \((u_\alpha )_{\alpha \in {\mathbb {N}}}\) is a basis of the single-particle space H. The name of the operator refers to the fact that a density \(\rho _n\) with fixed particle number n is an eigenfunction to the eigenvalue n, that is
$$\begin{aligned} {\mathcal {N}}\rho _n = n\rho _n , \quad \rho _n \in {\mathcal {S}}_n H^{\otimes n}. \end{aligned}$$(56)We prove this statement in Appendix A.4. For a general, normalized function \(\rho (t) \in F(H)\) with \(\Vert \rho (t) \Vert =1\), the average number of particles is obtained as:
$$\begin{aligned} \Vert {\mathcal {N}} \rho (t) \Vert = \sum _{n\geqslant 0} \Vert {\mathcal {N}} \rho _n(t) \Vert _n = \sum _{n\geqslant 0} n P(n,t) = {\mathbb {E}}\left[ N(t)\right] . \end{aligned}$$(57)Powers of \({\mathcal {N}}\) yield the higher-order factorial moments of N(t), for example,
$$\begin{aligned} \Vert {\mathcal {N}}^2\rho (t) \Vert = {\mathbb {E}}\left[ N(t) (N(t) - 1)\right] , \end{aligned}$$(58)as shown in Appendix A.5. We note that \(\Vert \rho \Vert =1\) does not necessarily imply \({\mathcal {N}} \rho \in F(H)\) and the domain of \({\mathcal {N}}\) is only a subspace of \(F(H)\). In particular, there are reaction networks showing an explosion in finite time \(T<\infty \), i.e., \(\Vert {\mathcal {N}} \rho (t)\Vert = {\mathbb {E}}[N (t)] \rightarrow \infty \) for \(t \rightarrow T\) although \(\Vert \rho (t)\Vert =1\) for all \(0 \leqslant t \leqslant T\).
4 Chemical diffusion master equation in terms of creation and annihilation operators
The chemical diffusion master equation (CDME) is generally composed of diffusion and reaction operators, see Eq. (12). In this section, we will show how to formulate the CDME in a systematic way by expressing these operators in terms of creation and annihilation operators. We distinguish operators that conserve the number of particles and those that do not. The diffusion operator belongs to the first class, whereas the reaction operator of a general single reaction can be decomposed into a conserving and a non-conserving part. We start with particle number conserving operators and then follow up with reaction operators. The scheme is worked out in detail for two exemplary reaction systems at the end of this section.
4.1 Particle number conserving operators
We begin by expanding operators that do not change the number of particles if applied to a function \(\rho _n \in {\mathcal {S}}_n H^{\otimes n}\), i.e., one with a determined number of particles. One can think of their action as moving particles in space or extracting information from the system. These operators will look different depending on the number of particles they act on. As an example, consider a many-particle system where particles diffuse independently. In this case, the diffusion operator is a conserving operator acting on single particles. Conversely, diffusion in the presence of pair interactions can be expressed by a conserving operator acting on two particles at a time.
4.1.1 Conserving single-particle operators
For operators acting on a single particle, let A denote a linear operator on the single-particle space H. We denote by \({\mathcal {A}}_j\) the action of A on the jth argument of a function in the many-particle space \(H^{\otimes n}\). The action of A on every particle of an n-particle system (\(n \geqslant 1\)) is then given by
and focusing on symmetrized spaces, one has \({\mathcal {A}}^n: {\mathcal {S}}_n H^{\otimes n} \rightarrow {\mathcal {S}}_n H^{\otimes n}\). For any operator \(A:H\rightarrow H\) it is shown in Appendix B.2 that the operator \({\mathcal {A}}^n\) can be written as
where we dropped the superindex n as the form of the right-hand side is the same for every n. Note that if A is the identity, we recover the particle number operator \({\mathcal {N}}\), see Eq. (55).
Applying the operator \({\mathcal {A}}\) component-wise extends its action to general elements of the Fock space, \(\rho = (\rho _0,\rho _1,\rho _2,\dots )\in F(H)\), that is \({\mathcal {A}}\rho := ({\mathcal {A}} \rho _0,{\mathcal {A}} \rho _1,{\mathcal {A}} \rho _2,\dots )\), with the convention that \({\mathcal {A}} \rho _0 := 0\).
4.1.2 Conserving two-particle operators
Analogously, let B denote a linear operator on the two-particle space \(H^{\otimes 2}\). The action of B on every possible pair of particles of an n-particle system (\(n \geqslant 2\)) is
where \({\mathcal {B}}_{ij}\) denotes the action of B on the ith and jth components of a symmetrized many-particle function:
where \(v_{\setminus \{i,j\}}:=v_1\otimes \dots \otimes v_{i-1}\otimes v_{i+1}\otimes \dots \otimes \dots \otimes v_{j-1}\otimes v_{j+1}\otimes \dots \otimes v_n\) for \(i < j\). We further assume B to be symmetric, \({\mathcal {B}}_{i,j}={\mathcal {B}}_{j,i}\), so it does not distinguish the labeling of the particles. We prove in Appendix B.3 that \({\mathcal {B}}^n\) has the following expansion:
for \(s_{\alpha \beta }\) defined in Eq. (33). Note that we again dropped the super index n on \({\mathcal {B}}\) since the representations do not depend on n. The factor 1/2 appears in the derivation, it accounts for the fact that removing first particle i and then j is the same as removing particle j first and then i. Expansion (63) is more adequate to easily obtain the explicit coefficients of complex expansions, e.g., three-particle operators (\(n=3\)). However, throughout this work, we will write the expansions in the form of Eq. (64) due to its simpler notation. Also note that due to Eq. (30), the expansion (63) only needs the symmetrization operator in one of the arguments of the dual pairing.
We can again extend the action of the operator \({\mathcal {B}}\) to general functions in the Fock space, \(\rho = (\rho _0,\rho _1,\rho _2,\dots )\in F(H)\), by applying the operator component-wise according to \({\mathcal {B}}\rho := ({\mathcal {B}} \rho _0,{\mathcal {B}} \rho _1,{\mathcal {B}} \rho _2,\dots ). \) Note \({\mathcal {B}} \rho _0 = {\mathcal {B}} \rho _1 = 0\) by construction.
Analogously, we could extend this result to conserving n-particle operators for \(n \geqslant 3\); the coefficient leading the expansion would then be 1/n!. However, in practice, it will be unlikely to encounter these operators for \(n \geqslant 3\), so we do not explicitly work them out here.
4.2 Reaction operators
In the previous section, we focused on conserving operators that yield a result in \({\mathcal {S}}_n H^{\otimes n}\) if acting on \({\mathcal {S}}_n H^{\otimes n}\). However, reactions involve a change in the number of particles, and thus, they cannot be expressed only in terms of particle number conserving operators. For an arbitrary reaction, the reaction operator can naturally be decomposed into two parts:
-
a conserving operator to specify the probability outflow from the current state,
-
a non-conserving operator to specify the probability inflow from another state (with a different number of particles) into the current state.
In order to specify the two operators that form the reaction operator of a general reaction \( k\text {A}\rightarrow l \text {A}\), we first need to define their action on a group of k particles. Using the corresponding reaction rate function \(\lambda \) (see Sect. 2.2), we introduce these operators as \(\Lambda ^{(k)}:H^{\otimes k} \rightarrow H^{\otimes k}\) and \(\Lambda ^{(k,l)}:H^{\otimes k} \rightarrow H^{\otimes l}\) with
Here, \(\Lambda ^{(k)}\) encodes the rate of leaving a given configuration \(x^{(k)}\) with k particles and going to any other configuration with l particles, whereas \(\Lambda ^{(k,l)}\) is the rate of going from any configuration of k particles to the configuration \(y^{(l)}\) with l particles.
From the previous subsection, we already know how to extend a conserving operator to act on all possible combinations of k particles in an n-particle space. For instance, for the reaction \(\text {A}+\text {A}\rightarrow \text {A}\), the conserving part of the operator is [Eq. (64)]:
Analogously, we can construct a similar expression for the non-conserving operator, where two particles are removed and one particle is created:
The proofs of the expansion of non-conserving operators are analogous to those for conserving operators given in Appendices B.2 and B.3. The reaction operator \({\mathcal {R}}\) in the Fock space is composed by the sum of these two operators, acting pointwise according to \(({\mathcal {R}}\rho )_n= {\mathcal {R}}^{(2,1)}\rho _{n+1}-{\mathcal {R}}^{(2)}\rho _n\). Note that the conserving operator will always carry a minus in the CDME, as it refers to the outflow from a given state. The joint action of both operators ensures that probability is conserved in the reaction.
For a general reaction \(k\text {A}\rightarrow l\text {A}\), the reaction operator is given component-wise by \(({\mathcal {R}}\rho )_n= {\mathcal {R}}^{(k,l)}\rho _{n+k-l}-{\mathcal {R}}^{(k)}\rho _n\), where
Note that it holds \({\mathcal {R}}^{(k)}\rho _n = {\mathcal {R}}^{(k,l)}\rho _n = 0\) for \(n<k\), which implies that \(({\mathcal {R}}\rho )_n= -{\mathcal {R}}^{(k)}\rho _n\) for \(k \leqslant n< l\), while \(({\mathcal {R}}\rho )_n= {\mathcal {R}}^{(k,l)}\rho _{n+k-l}\) for \(l \leqslant n< k\). As there are k! ways to choose (i.e., remove) the same k particles, the factor 1/k! is required to avoid double counting. Note these expansions can also be written in terms of the symmetrized basis, see Appendix B.4.
Below we list the expansions of the non-conserving part of frequently used reaction operators involving one species:
These reaction operators are useful to formulate the CDME for a general reaction system.
4.3 Representation of the chemical diffusion master equation
Given a set of reactions indexed by \(r=1,..,S\), each of the form \(k\text{ A } \rightarrow l \text{ A }\) for some \(k,l \in {\mathbb {N}}_0\) and with corresponding reaction operators \({\mathcal {R}}_r\) composed by their conserving part \({\mathcal {R}}_r^{(k)}\) and their non-conserving part \({\mathcal {R}}_r^{(k,l)}\), the CDME (12) may be written as
where k and l depend on r. For each component \(\rho _n\) of \(\rho \in F(H)\) this means
That is, for several reactions, the overall reaction operator is simply the sum of individual reaction operators.
The representation Eq. (72) of the CDME, together with the reaction operator expansions in Eqs. (69) and (70) form the main results of this work. It allows for a structured and systematic formulation of the probabilistic evolution described by the CDME with the physics of the system encoded in the given reaction rate functions and diffusion operators. We note that the expansions of the conserving single and two-particle operators in Eqs. (60) and (64), respectively, are further useful to expand the diffusion operator or higher-order particle-conserving operators.
In (72), the operators do not explicitly depend on the particle number n, which greatly simplifies the expressions. With this, the operators \({\mathcal {Q}}_{nm}\) entering the matrix in Eq. (13) are given as
The \({\mathcal {Q}}\)-matrix has a band-structure since there is only a finite number of reactions, each of them changing the number of particles by a finite amount, i.e., there is some \(\Delta n_\text {max}\) such that \({\mathcal {Q}}_{nm} = 0\) for \(|n-m| > \Delta n_\text {max}\). Another important property is related to the local conservation of probability and generalizes the fact that for a Markov chain on a finite state space column sums of the rate matrix are zero. For the CDME, we show in Appendix C that this is expressed for any fixed \(m\in {\mathbb {N}}_0\) as
in terms of the linear functionals
As an interpretation, \({\mathcal {J}}_{nm}(\cdot )\) yields the probability flux from the space with m particles to one with n particles for \(n \ne m\); the total loss from the m-particle space is summarized in \(-{\mathcal {J}}_{mm}(\cdot )\).
CDME as an evolution equation. The CDME (72), shortly denoted as \(\partial _t \rho = ({\mathcal {D}} + {\mathcal {R}})\rho \) has the form of a linear evolution equation, and the question arises whether it is well-posed in the sense that solutions exist (in a finite time horizon, at least) and are unique. Furthermore, it is a priori not clear that the positivity and normalization of the probability density are preserved, \(\Vert \rho (t)\Vert = 1\) and \(\rho _n(t) \geqslant 0\) for \(t \geqslant 0\) and for each n, and how a permissible initial density \(\rho (0)\) looks like. These questions are brought up already when considering the scenario of well-mixed reaction–diffusion systems characterized by the conventional CME, which is an evolution equation on the space \(\ell ^1\) of discrete probability distributions \(p=(p_0, p_1, \dots )\), where \(p_n(t) = \Vert \rho _n(t)\Vert _n\). The issue was addressed by the pioneers of probability theory for a broad class of Markov jump processes on a countable state space [95,96,97], providing non-trivial criteria to answer these questions [98]. Pathological examples can be found in refs. [98,99,100] and the topic is still a subject of ongoing research [101, 102].
Technically, one employs the Hille–Yosida theory in the setting of an abstract Banach space, which is \(F(H)\) in our case, to specify necessary and sufficient conditions that the operator \({\mathcal {A}} := {\mathcal {D}}+{\mathcal {R}}\) generates a contraction semigroup \(\bigl (\exp (t {\mathcal {A}})\bigr )_{t\geqslant 0}\) on F(H). For a dissipative operator, the Lumer–Phillips theorem [103,104,105] provides more tractable criteria for this to hold. For example, in addition to verifying that \({\mathcal {A}}\) is dissipative on a suitable dense domain, which is proven in Appendix C, one would need to investigate when the operator \(\mu - {\mathcal {A}}\) is surjective for each \(\mu > 0\). A comprehensive analysis of the issue for the CDME exceeds the scope of the present work and is left for future research.
Multiple species. Using the formalism developed above, we can obtain the CDME systematically for an arbitrary reaction–diffusion system with one chemical species. Extensions to multiple species require a straightforward generalization of the notation: In the first place, for a system of L species, the probability densities \(\rho _n \in {\mathcal {S}}_n H^{\otimes n}\) generalize to \(\rho _{n_1,\ldots ,n_L} \in {\mathcal {S}}_{n_1} H^{\otimes n_1} \otimes \dots \otimes {\mathcal {S}}_{n_L} H^{\otimes n_L}\) with an index \(n_l\) for the number of particles of each species; the arguments of \(\rho _{n_1,\ldots ,n_L}\) refer to the particle positions for each species. Note that since any two particles of different species are distinguishable, the \(\rho _{n_1,\ldots ,n_L}\) must not be symmetrized with respect to all particle positions, but only with respect to those of each species. The basis of the space of \(\rho _{n_1,\ldots ,n_L}\) is the tensor product of the bases of the \(n_1\)- to \(n_L\)-particle spaces of the single species case. Secondly, the diffusion part hardly changes except for the fact that the diffusion constant and physical interactions may vary among species, i.e., the diffusion operator will have as many indices as the system has species, indicating its different actions on particles of different species. Thirdly, for the reaction part, we get two indices per species, counting the gains or losses with respect to that species on the reactant as well as on the product side of the reaction. As a consequence, we will have rate functions \(\lambda (x^{(l_1)},\ldots x^{(l_L)};x^{(k_1)},\ldots x^{(k_L)})\) which depend on 2L position tuples for L species. In the many-species generalization, the conserving part of the reaction operator will involve an integration over all product positions of all species. Analogously, the non-conserving part integrates over the reactant positions of all species. Also the creation and annihilation operators have to be specified as to which species it is that is created or destroyed, so that there are creation and annihilation operators separately for each species.
4.4 Exemplary reaction systems
For the examples given in the following, we will reside with the setting of a single species for the sake of clarity. Given the diffusion properties and the reaction rate functions of the system, which encode the physics, we will specify the CDME for some exemplary reaction systems in terms of creation and annihilation operators, and we will show that the equations are consistent with well-known results, e.g., those given in Sect. 2.2. Any chemical reaction can be decomposed into a combination of unimolecular reactions and/or bimolecular reactions, we thus focus on creation and degradation to represent unimolecular reactions and on mutual annihilation as an example for a bimolecular reaction.
4.4.1 Creation and degradation
We consider the birth–death process given already in Eq. (14). It consists of the degradation and creation reactions, \(\text {A}\rightarrow \varnothing \) and \(\varnothing \rightarrow \text {A}\), occurring with rate functions \(\lambda _d(x)\) and \(\lambda _c(x)\), respectively; in addition, each particle of species \(\text {A}\) diffuses freely with diffusion coefficient D. The chemical diffusion master equation can be written in terms of a diffusion operator \({\mathcal {D}}\) and one reaction operator per reaction, \({\mathcal {R}}_d\) and \({\mathcal {R}}_c\), as shown in Sect. 4.2:
with
where \(\rho _n = \rho _n(x^{(n)},t)\), and the reaction rate operators for each reaction are
see Eqs. (65) and (66). Writing the CDME in matrix form as in Eq. (13), the reaction operator
attains a tridiagonal form. Along the diagonal there is the conserving part \({\mathcal {R}}_d^{(1)} + {\mathcal {R}}_c^{(0)}\) with a minus sign, while the non-conserving parts \({\mathcal {R}}_d^{(1,0)}\) and \({\mathcal {R}}_c^{(0,1)}\) are found on the secondary diagonals. Moreover, as the reaction operators given in terms of creation and annihilation operators do not depend on the particle number n, we let them act directly on elements of the Fock space, \(\rho =(\rho _0,\rho _1,\dots ,\rho _n,\dots )\), and rewrite the CDME in compact form:
To write the CDME explicitly as in Sect. 2, we start with the particle number conserving operators. The expansions in terms of creation/annihilation operators corresponds to applying the operators to all the possible particles the operators can act on, thus
where the last one corresponds to a conserving zero-particle operator and thus it acts on no particles. For the non-conserving operators, we need to explicitly apply the creation and annihilation operators. In Appendix B.1, we show how to apply the creation and annihilation operators from Eq. (51) to densities. Using these relations and using \(\rho _n=\sum _{\beta _1\leqslant \dots \leqslant \beta _n}{\tilde{c}}_{\beta _1,\dots ,\beta _n} {\mathcal {S}}_n (u_{\beta _1} \otimes \dots \otimes u_{\beta _n})\), we obtain
and
Gathering all the terms, this matches exactly the proposal in Eq. (15), which serves as a first consistency check of the presented formalism.
4.4.2 Mutual annihilation
Next, we consider the mutual annihilation reaction \(\text {A}+ \text {A}\rightarrow \varnothing \) with rate function \(\lambda (x_1,x_2)\). The CDME in component-wise form is simply
for \(n\geqslant 0\). The diffusion term is the same as before, and the reaction operator can be decomposed into two parts, as shown in Sect. 4.2:
In matrix notation, the full reaction operator acting on \(\rho \in F(H)\) reads
We will now recover the explicit form of the CDME for mutual annihilation. For the conserving part of the reaction operator, we use the relations Eqs. (61), (62) and (64) and apply definition (65) of the reaction rate operator to obtain
For the non-conserving operator, we insert the definitions of the reaction rate operators [Eq. (66)] and of the annihilation operator [Eq. (51b)],
for \(v_{\setminus \{i,j\}}:=v_1\otimes \dots \otimes v_{i-1}\otimes v_{i+1}\otimes \dots \otimes \dots \otimes v_{j-1}\otimes v_{j+1}\otimes \dots \otimes v_{n+2}\). So, the full expression of the CDME of mutual annihilation is given in component-wise form as
By comparing to the CME for mutual annihilation [35], one finds that both the structure of the equation and the coefficients obtained within the present approach are consistent with well-known models at a more coarse-grained level. Note that for many analyses and derivations, this explicit form of the equation is not needed. Instead, we will work on the operator level in terms of creation and annihilation operators in the following.
5 Spatial discretization
In the previous sections, we laid out a theoretical basis for the probabilistic description of particle-based reaction–diffusion systems. Adapting the concept of the Fock space, we developed a systematic method to formulate CDMEs for arbitrary reaction networks. The CDME fully characterizes the stochastic reaction–diffusion dynamics. Solving it analytically or numerically, however, will in general be a demanding issue. It either requires solving directly a huge system of partial differential equations (PDEs) or performing stochastic simulations of the underlying particle-based reaction–diffusion process to obtain costly Monte Carlo estimates of the solution to the CDME. However, there are situations where the physical or structural features of a specific system permit an approximation of the solutions by a (small) finite set of distinguished basis functions. In such cases, a significant complexity reduction is achieved by a projection onto the subspace spanned by this reduced basis set. Our framework is an ideal starting point for this as it already provides a basis representation of the CDME. As an example, we will use a basis of indicator functions completely covering the domain of particle positions, which amounts to a spatial coarse-graining. We thus obtain a generalized RDME—consistent with the convergent RDME [77, 78]—that extends the standard RDME by reactions between particles located in different subdomains.
5.1 Galerkin projection
Let \({\hat{H}} \subset H\) be a finite-dimensional linear subspace of H, and let \(\xi _1,\dots ,\xi _M\) be a normed basis of \({\hat{H}}\), i.e., \(\Vert \xi _i\Vert _1=1\) for \(i=1,\dots ,M\). The dual basis \(\xi _1^*,\dots ,\xi _M^*\) is such that \(\langle \xi _i^*,\xi _j \rangle = \delta _{i,j}\) for all i, j. For example, the \(\xi _i\) could be indicator functions of subsets for a given spatial discretization—a special case which will be analyzed in Sect. 5.4. The set \({\hat{H}}= {{\,\mathrm{span}\,}}( \xi _i )\) induces a subspace \(\hat{F}\subset F(H)\) of the Fock space F(H) defined in Eq. (43). We will now consider a projection \(Q:F(H) \rightarrow \hat{F} \subset F(H)\) onto this subspace.
The Galerkin ansatz for an element \(\rho \in F(H)\) is given by \({\hat{\rho }} =({\hat{\rho }}_0,{\hat{\rho }}_1,\dots )\in \hat{F}\) with
for coefficients
with \(s_{\xi _{i_1},\dots ,\xi _{i_n}}\) defined in Eq. (33). In accordance with the notation in Eq. (44), we write
for the basis representation, or, in terms of creation operators,
where \(\rho _\text {vac}= (1, 0, 0, \dots ) \in F(H)\) is the normalized vacuum element. Equal indices are again included in the sum because, e.g., \(\xi _1 \otimes \xi _1\) is an allowed two-particle density. In the following, we will use the abbreviations
For the diffusion operators \({\mathcal {D}}\) and the reaction operators \({\mathcal {R}}\) we derive the projected operators \(\hat{{\mathcal {D}}}=Q{\mathcal {D}}Q\) and \(\hat{{\mathcal {R}}}=Q{\mathcal {R}}Q\), respectively, by extending the basis of the subspace \(\hat{H}\) by the complement basis \(\chi _1,\chi _2,\dots \), such that \((u_1,u_2,\dots ) = (\xi _1,\dots , \xi _M, \chi _1,\chi _2,\dots )\) is a basis of the full space H. Then, using the operator expansions derived in Sect. 4, we obtain equivalent expressions for the projected operators: After projecting, only the sum over the \(\xi _i\) components remains due biorthogonality, i.e., \(\langle \chi _j^*, \xi _i \rangle = \langle \xi _i^*, \chi _j \rangle = 0\) for any i, j. The expressions for the projected operators can thus be obtained by simply replacing sums over \((u_\alpha )\) by sums over \((\xi _i)\). For example, for the diffusion operator \({\mathcal {D}}\), we obtain
see Eq. (60), while for the non-conserving operator \({\mathcal {R}}^{(0,1)} \) of the reaction \(\varnothing \rightarrow A\), we get
instead of the original expression Eq. (71a).
Copy number representation. The copy number representation introduced in Eq. (40) is particularly useful for densities projected onto a finite Galerkin basis \(\xi _1,\ldots , \xi _M\). We simply need to redefine the index set \({\mathbb {M}}_n\) of Eq. (39) by restricting to the M basis elements, i.e., to multi-indices \(N=(N_1,\dots ,N_M)\). Analogously to Eq. (40), the n-particle density \(\hat{\rho }_n\) as in Eq. (91) can then be written as
with coefficients
In the following sections, we will use the short-hand notation
for the copy number representation. An element of the projected Fock space \(\hat{F}\) then has the copy number representation
replacing the representation (94).
5.2 Normalization and positivity of the projected densities
Given \(\rho \in F(H)\) with \(\rho _n\geqslant 0\) for all n and \(\Vert \rho \Vert =1\), we would like to get a projected function \({\hat{\rho }}\) which fulfills these properties, too. In general, using Eqs. (45) and (4) as well as the standard triangle inequality, we have
From \(\Vert \xi _i\Vert _1=1\) for all i, it follows that \(\Vert \xi _1^{\otimes N_1}\otimes \dots \otimes \xi _M^{\otimes N_M}\Vert _n=1\) as well as \(\Vert {\mathcal {S}}_{n}(\xi _1^{\otimes N_1}\otimes \dots \otimes \xi _M^{\otimes N_M})\Vert _n=1\) for all \(N=(N_1,\dots ,N_M)\in {\mathbb {M}}_n\), such that we obtain
In order to obtain equality, we have to assume both the basis functions \(\xi _i\) and the coefficients \(p_{N_1,\ldots ,N_M}\) to be positive, in which case it holds \({\hat{\rho }}_n\geqslant 0\) for all n and
This will be true for the special case of a full spatial partition with rescaled indicator functions, as we will see below.
However, as soon as the basis functions \(\xi _i\) are positive-valued and overlapping in position space \({\mathbb {X}}\), the dual basis functions \(\xi _i^*\) have negative values and, consequently, also the coefficients \(p_{N_1,\ldots ,N_M}\) as defined by the dual pairing (99) can become negative. In this case, equality (104) is not fulfilled and an interpretation of the coefficients as probabilities becomes pointless.
The positivity of the coefficients is only guaranteed if we assume the dual basis functions \(\xi _i^*\) to be positive, \(\xi _i^*\geqslant 0\) for all i. Demanding in addition that the dual basis functions sum up to one everywhere in position space, \(\sum _i \xi _i^*(x^{(1)}) =1\) for all \( x^{(1)} \in {\mathbb {X}}\) (as given, e.g., for committor functions in the context of transition path theory [106, 107]), this guarantees that the coefficients sum up to one, as well. In this case, the functions \(\xi _i\) themselves are not necessarily positive, and consequently, also the projected functions \(\hat{\rho }_n\) can have negative values, which renders their interpretation as projected probability distribution questionable. These insights motivate us to restrict the following analysis to a full-partition approach.
5.3 Full-partition approach
In order to make sure that both the coefficients and the projected probability distributions are positive-valued, we consider non-overlapping basis functions as indicator functions of a full spatial partition into subsets. That is, we split the position space \({\mathbb {X}}\) into finitely many disjoint subsets \({\mathbb {X}}_i\), \(i=1,\dots ,M\) such that
Given these subsets \({\mathbb {X}}_i\subset {\mathbb {X}}\), we can define the rescaled indicator functions
which fulfill \(\Vert \xi _i\Vert _1=1\). In this special case, the duals are given by
For this approach of a full spatial partition, the norm is naturally retained: Let \(\rho \in F(H)\) be such that \(\rho _n \geqslant 0\) for all n and \(\Vert \rho \Vert =1\) for the norm defined in Eq. (45). This means that it holds
for \({\mathbb {P}}[N=n] := \int \rho _n(x^{(n)}) \, \hbox {d}x^{(n)} \geqslant 0,\) see also Eq. (3). Now we note that for fixed n it holds
for the coefficients \(p_N=p_{N_1,\ldots .,N_M}\) defined in Eq. (99). Moreover, we have
for the given basis functions \(\xi _i\). Inserting into Eq. (108) we obtain
so, due to property (107), we find
with \(p_N \geqslant 0\) for all N. This means that the norm is preserved and the coefficients \(p_N\) may be interpreted as probabilities. Also \(\hat{\rho }_n\geqslant 0\) is fulfilled such that \(\hat{\rho }_n\) is again a probability density function. This discretization can be understood in analogy to conservative finite volume methods, where the conserved quantity is the probability.
Average concentration field. As a simple application of the Galerkin projection and the probabilistic interpretation of Eq. (99), we calculate the expected number of particles in a ball \(B_\varepsilon (y)\) of radius \(\varepsilon \) centered at \(y \in {\mathbb {X}}\). For the Galerkin basis, we choose only \(M=2\) basis functions, namely the indicator functions on \({\mathbb {X}}_1 = B_\varepsilon (y)\) and on its complement \({\mathbb {X}}_2 = {\mathbb {X}} \setminus B_\varepsilon (y)\). Then, the probability that at time t there are n particles in the system and exactly k out of them are located in \(B_\varepsilon (y)\) is obtained from the n-particle component of a given \(\rho (t) \in F(H)\) as the coefficient \(p_{k,n-k}(t)\) in Eq. (99), explicitly:
The average number of particles in \(B_\varepsilon (y)\) is found by taking the expectation of k with respect to the joint distribution of k and n as
Dividing further by the volume of the ball and taking its radius to zero yields the average molecular concentration at point y:
which, after carrying out the limit, reduces to
since in the first line the integral is nonzero only for \(k=1\) as \(\{y\}^{k-1}\) is a null set otherwise. The last result, Eq. (115), can serve as starting point to derive the deterministic reaction–diffusion equation in terms of average concentrations from a given CDME.
5.4 Generalized reaction–diffusion master equation
Given the setting of a full spatial partition with basis functions defined in Eq. (106), we will investigate next the projected diffusion and reaction operators for some basic scenarios and derive the corresponding generalized RDME as an evolution equation for the coefficients \(p_N\). Explicit calculations can be found in Appendix D.2.
5.4.1 Diffusion
Consider the projected diffusion operator given in Eq. (96) and define \(d_{ij} := \left\langle \xi _i^*, {\mathcal {D}}\xi _j \right\rangle \), such that
This means that \(d_{ij}\) refers to the rate to go from basis j to basis i. We apply this operator \(\hat{{\mathcal {D}}}\) to \({\hat{\rho }}\) given in Eq. (101). We find (see Eq. (D8))
Defining the operator \(\mathrm {D}\) acting on the coefficients according to
we can write
In Eq. (118), we used the identity \(d_{ii}= - \sum _{j\ne i} d_{ji}\), which follows for the chosen basis of indicator functions from the equality \(\xi ^*_i =1-\sum _{j\ne i} \xi ^*_j\) after integration by parts.
In the time-dependent setting, where \(\rho =\rho (\cdot ,t)\) is a function of t, we turn the diffusion equation \(\partial _t \rho = {\mathcal {D}} \rho \) into a “diffusion master equation” for the time-dependent coefficients \(p_{N_1,\dots ,N_M}(t)\) of \({\hat{\rho }} (t)\) and get
as the diffusive part of the RDME.
5.4.2 Creation and degradation
Consider the chemical diffusion master equation for creation and degradation, see Eq. (77). Let \(\lambda _d\) denote the rate function of the degradation reaction \(\text {A}\rightarrow \varnothing \), while \(\lambda _c\) is the rate function for the creation reaction \(\varnothing \rightarrow \text {A}\). The corresponding operators are denoted by \({\mathcal {R}}^{(1)}_d\) and \({\mathcal {R}}^{(1,0)}_d\) for degradation and by \({\mathcal {R}}^{(0)}_c\) and \({\mathcal {R}}^{(0,1)}_c\) for creation.
Degradation. The projected conserving one-particle reaction operator \(\hat{{\mathcal {R}}}_d^{(1)}\) acts similarly to the diffusion operator. Defining
for \(\Lambda _d^{(1)}\) given in Eq. (79a), we find [Eq. (D9)]:
Applying the projected non-conserving operator \(\hat{{\mathcal {R}}}_d^{(1,0)}= \sum _i \langle 1,\Lambda _1^{(1,0)}\xi _i \rangle a^-_i\) to \({\hat{\rho }} = \sum _{n \geqslant 0}{\hat{\rho }}_n\) given in Eq. (101), we get [Eq. (D10)]
Combining the results, we define the operator \(\mathrm {R}_{d}\) of the degradation reaction in terms of coefficients by
Creation. As for the creation reaction \(\varnothing \rightarrow \text {A}\), the conserving part \(\hat{{\mathcal {R}}}_c^{(0)}\) acts as a mere constant [Eq. (D11)]:
with the reaction rate constant
For the non-conserving part \(\hat{{\mathcal {R}}}^{(0,1)}\), we set
and obtain [Eq. (D12)]
In terms of coefficients, and observing that \(\Lambda _c^{(0)} =\sum _i \lambda _c^i\), we obtain the operator \(\mathrm {R}_{c}\) as
In combination with the diffusion operator, we obtain the standard RDME for the system of creation and degradation
Scaling by volume. Assuming that the reaction rate function \(\lambda _c\) of creation is constant, \(\lambda _c(x)=\gamma \) for all \(x\in {\mathbb {X}}\), it holds \(\lambda _c^i=\gamma \int \xi ^*_i(x) \hbox {d}x= \gamma \mathrm {vol}({\mathbb {X}}_i)\) and \(\lambda _c= \gamma \mathrm {vol}({\mathbb {X}})\). As for degradation, we analogously obtain \(\lambda _d^i=\gamma \). In both cases, this is the usual volume-scaling of a reaction of order zero or one, respectively.
5.4.3 Mutual annihilation
For the mutual annihilation given by the reaction \(\text {A}+ \text {A}\rightarrow \varnothing \), let \(\lambda (x^{(2)})\) denote the rate for two particles to react when they are located at \(x^{(2)}=(x_1^{(2)},x_2^{(2)})\in {\mathbb {X}}^2\). By virtue of Eq. (66) with \(k=2\) and \(l=0\) we set
and obtain the conserving part \(\hat{{\mathcal {R}}}^{(2)}\) of the reaction as [Eq. (D15)]
On the other hand, under the assumption of non-overlapping basis functions, we find the same \(\lambda ^{ij}\) for the non-conserving part:
and thus [Eq. (D18)]
In terms of the coefficients, we obtain the generalized RDME
where the last two lines define the operator \(\mathrm {R}\). The very last line refers to reactions between particles located in different “boxes.” This makes the equation more general than the standard RDME which would only allow reactions between particles located in the same box.
Volume scaling of the rate constant. Let the reaction rate function \(\lambda \) be of the form \(\lambda (x,y)=\gamma 1_{B_r(x)}(y)\), which corresponds to the standard situation of the Doi reaction model [45]; here, \(\gamma > 0\) is a reaction rate constant, \(r > 0\) is some maximum reaction distance, and \(1_{B_r(x)}\) denotes the indicator function of the ball \(B_r(x)\) of radius r centered at x, i.e., \(1_{B_r(x)}(y) = 1\) if \(\Vert x - y\Vert \leqslant r\) and 0 otherwise. For simplicity, we ignore boundary effects for positions x close to a subdomain boundary such that the reactive volume around x can be assumed to be the same all over a subdomain \({\mathbb {X}}_i\), i.e., \(\int _{{\mathbb {X}}_i} 1_{B_r(x)}(y) \hbox {d}y = V_0\) does not depend on \(x \in {\mathbb {X}}_i\), which is justified if the volume of \({\mathbb {X}}_i\) is large compared to the reactive volume [108]. Then, we obtain
which is the standard scaling of the reaction rate constant for a second-order reaction in a finite volume.
6 Discussion and perspectives
In summary, we developed a probabilistic framework to formalize stochastic, particle-resolved reaction–diffusion dynamics. A main result is the structured formulation of the CDME (73), which governs the temporal evolution of the probability distribution of the many-particle open system, by means of systematically constructed diffusion and reaction operators. As the framework is based on suitable creation and annihilation operators, we translated the concept of Fock space, well-known for Hilbert spaces in quantum mechanics, to the setting of probability densities, which are absolutely integrable rather than square-integrable. We introduced creation and annihilation operators [Eq. (51)] as a natural means to express operations on symmetrized many-particle densities, and we used them as basic building blocks to represent the reaction and diffusion operators, and thus the complete CDME. Diffusion operators are expressed as conserving particle operators, as they do not change the number of particles. However, reaction operators are decomposed into two parts, one that conserves and one that does not conserve the number of particles, representing loss and gain contributions to the probability density, respectively [e.g., Eqs. (78) and (86)]. At the core of the reaction operators are position-dependent reaction rate functions, which encode details of the underlying PBRD model such as where to place reaction products [47, 52, 59, 109]. Domains with open boundary conditions can be simulated by including reactions that insert or remove particles in a boundary layer only. Some of the reaction operators take explicitly the form of integrals over n-particle densities, e.g., [Eq. (89)], so the CDME in general is a family of coupled integro-partial-differential equations. The CDME is proposed as an evolution equation on the Fock space \(F(H)\). Similarly as for the CME—valid for well-mixed systems—there is the question when do solutions to the CDME exist and are unique, which will be addressed elsewhere. We worked out the explicit form of the CDME for typical examples, namely a birth–death process [Eq. (77)] and a mutual annihilation reaction [Eq. (85)]. We point out that the framework extends beyond these specific examples and is applicable to general reaction schemes. We briefly sketched the extension to multiple species (see Sect. 4.3), which is straightforward for few species, but results in a clumsy notation for a larger number of species. Using the Galerkin projection technique to discretize space, we derived a generalized RDME with non-local higher-order reactions as an approximation of the CDME [e.g., Eqs. (130) and (135)]. As a by-product, we found relations between the reaction rate functions of the CDME and the rate constants of the RDME [Eq. (131)].
The (generalized) RDME as a spatial discretization of the CDME may, in principle, appear suitable for a direct numerical treatment due to its form as an evolution equation for the expansion coefficients of the Fock space element \({\hat{\rho }} \in {\hat{F}}\). However, such an attempt would be computationally prohibitive, which follows from a brief estimate of the amount of data to be processed. For a Galerkin basis of length M and limiting the maximum copy number per basis element to \(N_\mathrm {max}\), corresponding to a maximum of \(M \times N_\mathrm {max}\) molecules in the system, the function \({\hat{\rho }}\) has coefficients \(p_{N_1,\dots ,N_M}\) for M indices \(N_1, \dots , N_M\) each ranging from 1 to \(N_\mathrm {max}\), see Eq. (98). The memory requirement to store such a dense coefficient array is \((N_\mathrm {max})^M\) floating-point numbers, which in the setting of the ST-CME with few compartments and an exemplary, conservative choice of \(M=5\) and \(N_\mathrm {max} = 100\) would require already \((N_\mathrm {max})^M=10^{10}\) double words or 75 gigabytes of memory that need to be processed in every time integration step. Conversely, in the RDME-like setting of a regular mesh of \(M=10\times 10\) cells with \(N_\mathrm {max}=5\), one would need the unthinkable amount of \((N_\mathrm {max})^M\approx 10^{70}\) double words. Thus, instead of solving the CDME or RDME directly, the method of choice is to calculate Monte Carlo estimates from simulations of the underlying stochastic process either in continuous space for the CDME [57, 58] or, much less costly, on a lattice for the RDME [110,111,112].
The obtained framework is broadly applicable for formulating and analyzing a wide range of arbitrary reaction–diffusion processes. It provides the theoretical backbone to unify reaction–diffusion models at multiple scales that emerge from the CDME in specific limits or regimes. For instance, macroscopic and mesoscopic descriptions rely on (possibly fluctuating) concentration fields. Testing the assumptions behind the emergence of these descriptions requires analysis on a more refined model such as the CDME. The present formalism also enables the development of consistent hybrid and multi-scale numerical schemes. For example, we have shown that the spatial discretization of the CDME by a Galerkin projection naturally yields a consistent, non-local RDME, which supports higher-order reactions between different subdomains and which converges to the original CDME as the mesh size tends to zero—in contrast to the standard local RDME. Our work thus complements previous numerical studies on convergent RDMEs [77, 78].
A relevant direction of future research would consist on fully integrating out the spatial degrees of freedom from the CDME, which should recover the classical CME under well-mixed conditions. This is straightforward for unimolecular reactions [Eq. (20)], whereas higher-order reactions are influenced by diffusion such that the spatial distribution of reacting molecules enters the rates effectively observed in the CME [47,48,49, 58]; further it gives rise to additional reaction channels [71, 72] and non-Markovian effects [73,74,75]. Moreover, deterministic reaction–diffusion equations emerge from spatially resolved stochastic models in the limit of large copy numbers [108, 113, 114]. It would be of practical value to derive such, in general nonlinear, macroscopic equations from the CDME; see Eq. (115) for how to obtain locally averaged molecular concentrations from the probability densities of the CDME. Future research may also include the extension of Sect. 5 to other types of spatial discretizations by projecting onto basis functions different from indicator functions, e.g., committor functions or locally supported, piecewise-linear functions (hat functions), yielding alternative RDME models that could be numerically advantageous. In conclusion, we have given the probabilistic description of particle-resolved reaction–diffusion systems along with systematic means to formulate the corresponding CDME, which opens rich perspectives for future mathematical and numerical investigations.
Data availability
Data sharing is not applicable to this article as no datasets were generated or analyzed during the current study.
References
Fisher, R.A.: The wave of advance of advantageous genes. Ann. Eugen. 7, 355–369 (1937). https://doi.org/10.1111/j.1469-1809.1937.tb02153.x
Kolmogorov, A., Petrovsky, I., Piscounov, N.: Étude de l’équations de la diffusion avec croissance de la quantité de matière et son application a un problème biologique. Bull. Univ. Moscow Ser. Internat. Sec. A 1 (1937). https://ci.nii.ac.jp/naid/10003528013
van Saarloos, W.: Front propagation into unstable states. Phys. Rep. 386, 29–222 (2003). https://doi.org/10.1016/j.physrep.2003.08.001
Nicolis, G., Prigogine, I.: Self-Organization in Nonequilibrium Systems. Wiley, New York (1977)
Kuramoto, Y.: Chemical Oscillations, Waves, and Turbulence. Springer, Berlin (1984)
Vasquez, D.A.: Chemical instability induced by a shear flow. Phys. Rev. Lett. 93, 104501 (2004). https://doi.org/10.1103/physrevlett.93.104501
Straube, A.V., Pikovsky, A.: Mixing-induced global modes in open active flow. Phys. Rev. Lett. 99, 184503 (2007). https://doi.org/10.1103/physrevlett.99.184503
Cross, M.C., Hohenberg, P.C.: Pattern formation outside of equilibrium. Rev. Mod. Phys. 65, 851–1112 (1993). https://doi.org/10.1103/revmodphys.65.851
Murray, J.D.: Mathematical Biology, vol. 1,2. Springer, Berlin (2003)
Wartlick, O., Kicheva, A., Gonzalez-Gaitan, M.: Morphogen gradient formation. Cold Spring Harb. Perspect. Biol. 1, a001255 (2009). https://doi.org/10.1101/cshperspect.a001255
Fradin, C.: On the importance of protein diffusion in biological systems: the example of the Bicoid morphogen gradient. Biochim. Biophys. Acta Proteins Proteom. 1865, 1676–1686 (2017). https://doi.org/10.1016/j.bbapap.2017.09.002
Huang, A., Saunders, T.E.: A matter of time: formation and interpretation of the bicoid morphogen gradient, chapter 3. In: Small, S., Briscoe, J. (eds.) Gradients and Tissue Patterning, Current Topics in Developmental Biology, vol. 137, pp. 79–117. Academic Press, Cambridge (2020)
Stapornwongkul, K.S., Vincent, J.-P.: Generation of extracellular morphogen gradients: the case for diffusion. Nat. Rev. Genet. 22, 393–411 (2021). https://doi.org/10.1038/s41576-021-00342-y
Halatek, J., Frey, E.: Highly canalized MinD transfer and MinE sequestration explain the origin of robust MinCDE-protein dynamics. Cell Rep. 1, 741–752 (2012). https://doi.org/10.1016/j.celrep.2012.04.005
Amiranashvili, A., Schnellbächer, N.D., Schwarz, U.S.: Stochastic switching between multistable oscillation patterns of the min-system. New J. Phys. 18, 093049 (2016). https://doi.org/10.1088/1367-2630/18/9/093049
Denk, J., Kretschmer, S., Halatek, J., Hartl, C., Schwille, P., Frey, E.: MinE conformational switching confers robustness on self-organized Min protein patterns. Proc. Natl. Acad. Sci. 115, 4553–4558 (2018). https://doi.org/10.1073/pnas.1719801115
Britton, T., Pardoux, E. (eds.): Stochastic Epidemic Models with Inference. Springer, Berlin (2019). https://doi.org/10.1007/978-3-030-30900-8
Sego, T.J., Aponte-Serrano, J.O., Gianlupi, J.F., Glazier, J.A.: Generation of multicellular spatiotemporal models of population dynamics from ordinary differential equations, with applications in viral infection. BMC Biol. 19, 196 (2021). https://doi.org/10.1186/s12915-021-01115-z
Ganyani, T., Faes, C., Hens, N.: Simulation and analysis methods for stochastic compartmental epidemic models. Annu. Rev. Stat. Appl. 8, 69–88 (2021). https://doi.org/10.1146/annurev-statistics-061120-034438
Djurdjevac Conrad, N., Helfmann, L., Zonker, J., Winkelmann, S., Schütte, C.: Human mobility and innovation spreading in ancient times: a stochastic agent-based simulation approach. EPJ Data Sci. 7, 1–22 (2018). https://doi.org/10.1140/epjds/s13688-018-0153-9
Epstein, I.R., Pojman, J.A.: An Introduction to Nonlinear Chemical Dynamics: Oscillations, Waves, Patterns, and Chaos, vol. 1,2. Oxford University Press, Oxford (1998)
Grima, R., Schnell, S.: Modelling reaction kinetics inside cells. Essays Biochem. 45, 41–56 (2008). https://doi.org/10.1042/bse0450041
Wilkinson, D.J.: Stochastic modelling for quantitative description of heterogeneous biological systems. Nat. Rev. Genet. 10, 122–133 (2009). https://doi.org/10.1038/nrg2509
Smith, S., Grima, R.: Spatial stochastic intracellular kinetics: a review of modelling approaches. Bull. Math. Biol. 81, 2960–3009 (2018). https://doi.org/10.1007/s11538-018-0443-1
Falcke, M.: Deterministic and stochastic models of intracellular Ca2 waves. New J. Phys. 5, 96 (2003). https://doi.org/10.1088/1367-2630/5/1/396
Rüdiger, S.: Stochastic models of intracellular calcium signals. Phys. Rep. 534, 39–87 (2014). https://doi.org/10.1016/j.physrep.2013.09.002
Powell, J., Falcke, M., Skupin, A., Bellamy, T.C., Kypraios, T., Thul, R.: A statistical view on calcium oscillations. In: Islam, M.S. (ed.) Calcium Signaling. Advances in Experimental Medicine and Biology, vol. 1131, pp. 799–826. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-12457-1_32
Friedhoff, V.N., Antunes, G., Falcke, M., de Souza, F.M.S.: Stochastic reaction–diffusion modeling of calcium dynamics in 3D dendritic spines of Purkinje cells. Biophys. J. 120, 2112–2123 (2021). https://doi.org/10.1016/j.bpj.2021.03.027
Reichenbach, T., Mobilia, M., Frey, E.: Noise and correlations in a spatial population model with cyclic competition. Phys. Rev. Lett. 99, 238105 (2007). https://doi.org/10.1103/physrevlett.99.238105
Ovaskainen, O., Meerson, B.: Stochastic models of population extinction. Trends Ecol. Evol. 25, 643–652 (2010). https://doi.org/10.1016/j.tree.2010.07.009
Hellander, S., Hellander, A., Petzold, L.: Reaction rates for mesoscopic reaction–diffusion kinetics. Phys. Rev. E 91, 023312 (2015). https://doi.org/10.1103/PhysRevE.91.023312
Isaacson, S.A.: The reaction–diffusion master equation as an asymptotic approximation of diffusion to a small target. SIAM J. Appl. Math. 70, 77–111 (2009). https://doi.org/10.1137/070705039
Isaacson, S.A., Peskin, C.S.: Incorporating diffusion in complex geometries into stochastic chemical kinetics simulations. SIAM J. Sci. Comput. 28, 47–74 (2006). https://doi.org/10.1137/040605060
Winkelmann, S., Schütte, C.: The spatiotemporal master equation: approximation of reaction–diffusion dynamics via Markov state modeling. J. Chem. Phys. 145, 214107 (2016). https://doi.org/10.1063/1.4971163
Winkelmann, S., Schütte, C.: Stochastic Dynamics in Computational Biology. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-62387-6
Zhou, H.-X., Rivas, G., Minton, A.P.: Macromolecular crowding and confinement: biochemical, biophysical, and potential physiological consequences. Ann. Rev. Biophys. 37, 375–397 (2008). https://doi.org/10.1146/annurev.biophys.37.032807.125817
Höfling, F., Franosch, T.: Anomalous transport in the crowded world of biological cells. Rep. Prog. Phys. 76, 046602 (2013). https://doi.org/10.1088/0034-4885/76/4/046602
Weiss, M.: Crowding, diffusion, and biochemical reactions. In: Hancock, R., Jeon, K.W. (eds.) New Models of the Cell Nucleus: Crowding, Entropic Forces, Phase Separation, and Fractals. International Review of Cell and Molecular Biology, chapter 11, vol. 307, pp. 383–417. Academic Press, Cambridge (2014). https://doi.org/10.1016/B978-0-12-800046-5.00011-4
Lanoiselée, Y., Moutal, N., Grebenkov, D.S.: Diffusion-limited reactions in dynamic heterogeneous media. Nat. Commun. 9, 4398 (2018). https://doi.org/10.1038/s41467-018-06610-6
Froemberg, D., Sokolov, I.M.: Stationary fronts in an \(\text{ A }+\text{ B } \rightarrow 0\) reaction under subdiffusion. Phys. Rev. Lett. 100, 108304 (2008). https://doi.org/10.1103/PhysRevLett.100.108304
Sereshki, L.E., Lomholt, M.A., Metzler, R.: A solution to the subdiffusion-efficiency paradox: Inactive states enhance reaction efficiency at subdiffusion conditions in living cells. EPL (Europhys. Lett.) 97, 20008 (2012). https://doi.org/10.1209/0295-5075/97/20008
Engblom, S., Lötstedt, P., Meinecke, L.: Mesoscopic modeling of random walk and reactions in crowded media. Phys. Rev. E 98, 033304 (2018). https://doi.org/10.1103/PhysRevE.98.033304
Sarkar, S.: Concentration dependence of diffusion-limited reaction rates and its consequences. Phys. Rev. X 10, 041032 (2020). https://doi.org/10.1103/physrevx.10.041032
Doi, M.: Second quantization representation for classical many-particle system. J. Phys. A: Math. Gen. 9, 1465 (1976). https://doi.org/10.1088/0305-4470/9/9/008
Doi, M.: Stochastic theory of diffusion-controlled reaction. J. Phys. A: Math. Gen. 9, 1479 (1976). https://doi.org/10.1088/0305-4470/9/9/009
Smoluchowski, M.: Attempt for a mathematical theory of kinetic coagulation of colloid solutions. Z. Phys. Chem. 92, 129–168 (1917)
Agmon, N., Szabo, A.: Theory of reversible diffusion-influenced reactions. J. Chem. Phys. 92, 5270–5284 (1990). https://doi.org/10.1063/1.458533
Hänggi, P., Talkner, P., Borkovec, M.: Reaction-rate theory: fifty years after Kramers. Rev. Mod. Phys. 62, 251 (1990). https://doi.org/10.1103/RevModPhys.62.251
Rice, S.A.: Diffusion-Limited Reactions. Elsevier, Amsterdam (1985)
Szabo, A., Schulten, K., Schulten, Z.: First passage time approach to diffusion controlled reactions. J. Chem. Phys. 72, 4350–4357 (1980). https://doi.org/10.1063/1.439715
Szabo, A., Shoup, D., Northrup, S.H., McCammon, J.A.: Stochastically gated diffusion-influenced reactions. J. Chem. Phys. 77, 4484–4493 (1982). https://doi.org/10.1063/1.444397
Andrews, S.S., Bray, D.: Stochastic simulation of chemical reactions with spatial resolution and single molecule detail. Phys. Biol. 1, 137 (2004). https://doi.org/10.1088/1478-3967/1/3/001
Van Zon, J.S., Ten Wolde, P.R.: Green’s-function reaction dynamics: a particle-based approach for simulating biochemical networks in time and space. J. Chem. Phys. 123, 234910 (2005). https://doi.org/10.1063/1.2137716
Erban, R., Chapman, S.J.: Stochastic modelling of reaction–diffusion processes: algorithms for bimolecular reactions. Phys. Biol. 6, 046001 (2009)
Lipková, J., Zygalakis, K.C., Chapman, S.J., Erban, R.: Analysis of Brownian dynamics simulations of reversible bimolecular reactions. SIAM J. Appl. Math. 71, 714–730 (2011). https://doi.org/10.1137/100794213
Klann, M., Koeppl, H.: Spatial simulations in systems biology: from molecules to cells. Int. J. Mol. Sci. 13, 7798–7827 (2012). https://doi.org/10.3390/ijms13067798
Hoffmann, M., Fröhner, C., Noé, F.: ReaDDy 2: fast and flexible software framework for interacting-particle reaction dynamics. PLoS Comput. Biol. 15, e1006830 (2019). https://doi.org/10.1371/journal.pcbi.1006830
Dibak, M., Fröhner, C., Noé, F., Höfling, F.: Diffusion-influenced reaction rates in the presence of pair interactions. J. Chem. Phys. 151, 164105 (2019). https://doi.org/10.1063/1.5124728
Fröhner, C., Noé, F.: Reversible interacting-particle reaction dynamics. J. Phys. Chem. B 122, 11240–11250 (2018). https://doi.org/10.1021/acs.jpcb.8b06981
del Razo, M.J., Dibak, M., Schütte, C., Noé, F.: Multiscale molecular kinetics by coupling Markov state models and reaction–diffusion dynamics (2021). arXiv:2103.06889 [physics.chem-ph]
Dibak, M., Del Razo, M.J., De Sancho, D., Schütte, C., Noé, F.: MSM/RD: coupling Markov state models of molecular kinetics with reaction–diffusion simulations. J. Chem. Phys. 148, 214107 (2018). https://doi.org/10.1063/1.5020294
del Razo, M.J., Qian, H., Noé, F.: Grand canonical diffusion-influenced reactions: a stochastic theory with applications to multiscale reaction–diffusion simulations. J. Chem. Phys. 149, 044102 (2018). https://doi.org/10.1063/1.5037060
Ridgway, D., Broderick, G., Lopez-Campistrous, A., Ru’aini, M., Winter, P., Hamilton, M., Boulanger, P., Kovalenko, A., Ellison, M.J.: Coarse-grained molecular simulation of diffusion and reaction kinetics in a crowded virtual cytoplasm. Biophys. J. 94, 3748–3759 (2008). https://doi.org/10.1529/biophysj.107.116053
Echeverria, C., Kapral, R.: Enzyme kinetics and transport in a system crowded by mobile macromolecules. Phys. Chem. Chem. Phys. 17, 29243–29250 (2015). https://doi.org/10.1039/C5CP05056A
Weilandt, D.R., Hatzimanikatis, V.: Particle-based simulation reveals macromolecular crowding effects on the Michaelis–Menten mechanism. Biophys. J. 117, 355–368 (2019). https://doi.org/10.1016/j.bpj.2019.06.017
Lin, Y.-C., Kim, W.K., Dzubiella, J.: Coverage fluctuations and correlations in nanoparticle-catalyzed diffusion-influenced bimolecular reactions. J. Phys. Chem. C 124, 24204–24214 (2020). https://doi.org/10.1021/acs.jpcc.0c06898
Grassberger, P., Scheunert, M.: Fock-space methods for identical classical objects. Fortschr. Phys. 28, 547–578 (1980). https://doi.org/10.1002/prop.19800281004
Birch, D.A., Young, W.R.: A master equation for a spatial population model with pair interactions. Theor. Popul. Biol. 70, 26–42 (2006). https://doi.org/10.1016/j.tpb.2005.11.007
Dodd, P.J., Ferguson, N.M.: A many-body field theory approach to stochastic models in population biology. PLoS ONE 4, e6855 (2009). https://doi.org/10.1371/journal.pone.0006855
Kolokoltsov, V.N.: Nonlinear Markov Processes and Kinetic Equations, vol. 182. Cambridge University Press, Cambridge (2010)
Gopich, I.V., Szabo, A.: Diffusion modifies the connectivity of kinetic schemes for multisite binding and catalysis. Proc. Natl. Acad. Sci. 110, 19784–19789 (2013). https://doi.org/10.1073/pnas.1319943110
Gopich, I.V., Szabo, A.: Diffusion-induced competitive two-site binding. J. Chem. Phys. 150, 094104 (2019). https://doi.org/10.1063/1.5079748
Gopich, I.V., Szabo, A.: Theory of diffusion-influenced reaction networks. J. Phys. Chem. B 122, 11338–11354 (2018). https://doi.org/10.1021/acs.jpcb.8b07250
Grebenkov, D.S., Metzler, R., Oshanin, G.: Strong defocusing of molecular reaction times results from an interplay of geometry and reaction control. Commun. Chem. 1, 96 (2018). https://doi.org/10.1038/s42004-018-0096-x
Frömberg, D., Höfling, F.: Generalized master equation for first-passage problems in partitioned spaces. J. Phys. A: Math. Theor. 54, 215601 (2021). https://doi.org/10.1088/1751-8121/abf2ec
Hellander, S., Hellander, A., Petzold, L.: Reaction–diffusion master equation in the microscopic limit. Phys. Rev. E 85, 042901 (2012). https://doi.org/10.1103/PhysRevE.85.042901
Isaacson, S.A.: A convergent reaction–diffusion master equation. J. Chem. Phys. 139, 054101 (2013). https://doi.org/10.1063/1.4816377
Isaacson, S.A., Zhang, Y.: An unstructured mesh convergent reaction–diffusion master equation for reversible reactions. J. Comput. Phys. 374, 954–983 (2018). https://doi.org/10.1016/j.jcp.2018.07.036
Gillespie, D.T.: A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. J. Comput. Phys. 22, 403–434 (1976). https://doi.org/10.1016/0021-9991(76)90041-3
Van Kampen, N.G.: Stochastic Processes in Physics and Chemistry. Elsevier, Amsterdam (1992). https://doi.org/10.1007/3-540-34286-9_5
Qian, H.: Open-system nonequilibrium steady state: statistical thermodynamics, fluctuations, and chemical oscillations. J. Phys. Chem. B 110, 15063–15074 (2006). https://doi.org/10.1021/jp061858z
Deuflhard, P., Huisinga, W., Jahnke, T., Wulkow, M.: Adaptive discrete Galerkin methods applied to the chemical master equation. SIAM J. Sci. Comput. 30, 2990–3011 (2008). https://doi.org/10.1137/070689759
Fock, V.: Konfigurationsraum und zweite quantelung. Z. Phys. 75, 622–647 (1932). https://doi.org/10.1007/BF01344458
Bressloff, P.C.: Construction of stochastic hybrid path integrals using operator methods. J. Phys. A: Math. Theor. 54, 185001 (2021). https://doi.org/10.1088/1751-8121/abf38f
Carinci, G., Giardina, C., Giberti, C., Redig, F.: Dualities in population genetics: a fresh look with new dualities. Stoch. Process. Appl. 125, 941–969 (2015). https://doi.org/10.1016/j.spa.2014.10.009
Jansen, S., Kurt, N.: On the notion(s) of duality for Markov processes. Probab. Surv. 11, 59–120 (2014). https://doi.org/10.1214/12-PS206
Reif, F.: Fundamentals of Statistical and Thermal Physics. McGraw Hill, New York (1965)
Delle Site, L., Klein, R.: Liouville-type equation for the \(n\)-particle distribution function of an open system. J. Math. Phys. 61, 083102 (2020). https://doi.org/10.1063/5.0008262
Klein, R., Ebrahimi Viand, R., Höfling, F., Delle Site, L.: Nonequilibrium induced by reservoirs: physico-mathematical model and numerical tests. Adv. Theory Simul. 4, 2100071 (2021). https://doi.org/10.1002/adts.202100071
Hansen, J.-P., McDonald, I.R.: Theory of Simple Liquids, 3rd edn. Academic Press, Amsterdam (2006)
Dhont, J.K.G.: An Introduction to Dynamics of Colloids, Studies in Interface Science. Elsevier, Amsterdam (1996)
Werner, D.: Funktionalanalysis. Springer, Berlin (2006)
Johnson, W.B., Schechtman, G.: A Schauder basis for \(L_1(0,\infty )\) consisting of non-negative functions (2015). arXiv:1502.07557 [math.FA]
Baez, J., Biamonte, J.D.: Quantum Techniques in Stochastic Mechanics. World Scientific, Singapore (2018). https://doi.org/10.1142/10623 . arXiv:1209.3632 [quant-ph]
Feller, W.: On the integro-differential equations of purely discontinuous Markoff processes. Trans. Am. Math. Soc. 48, 488–515 (1940). https://doi.org/10.1090/s0002-9947-1940-0002697-3
Kolmogorov, A.N.: On some problems concerning the differentiability of the transition probabilities in a temporally homogeneous Markov process having a denumerable set of states. Ucenye Zapiski (Matem.) Moskov. Gos. Univ. 148, 53–59 (1951)
Doob, J.: Stochastic Processes. Wiley, New York (1953)
Anderson, W.J.: Continuous-Time Markov Chains. Springer, New York (2012). https://www.ebook.de/de/product/25435927/william_j_anderson_continuous_time_markov_chains.html
Kendall, D.G.: Some further pathological examples in the theory of denumerable Markov processes. Q. J. Math. 7, 39–56 (1956). https://doi.org/10.1093/qmath/7.1.39
Reuter, G.E.H.: Denumerable Markov processes and the associated contraction semigroups on \(\ell \). Acta Math. 97, 1–46 (1957). https://doi.org/10.1007/bf02392391
Chen, A., Zhang, H., Liu, K., Rennolls, K.: Birth-death processes with disaster and instantaneous resurrection. Adv. Appl. Probab. 36, 267–292 (2004). https://doi.org/10.1017/s0001867800012969
Feinberg, E., Mandava, M., Shiryaev, A.N.: Kolmogorov’s equations for jump Markov processes with unbounded jump rates. Ann. Oper. Res. (2017). https://doi.org/10.1007/s10479-017-2538-8
Engel, K.-J., Nagel, R.: One-Parameter Semigroups for Linear Evolution Equations, vol. 194. Springer, Berlin (2000)
Staffans, O.: Well-Posed Linear Systems, vol. 103. Cambridge University Press, Cambridge (2005)
Schnaubelt, R.: Evolution Equations, Lecture Notes. Karlsruhe Institute of Technology (2020). https://www.math.kit.edu/iana3/~schnaubelt/media/evgl-skript.pdf
Weinan, E., Vanden-Eijnden, E.: Towards a theory of transition paths. J. Stat. Phys. 123, 503–523 (2006). https://doi.org/10.1007/s10955-005-9003-9
Metzner, P., Schütte, C., Vanden-Eijnden, E.: Transition path theory for Markov jump processes. Multiscale Model. Simul. 7, 1192–1219 (2009). https://doi.org/10.1137/070699500
Kostré, M., Schütte, C., Noé, F., del Razo, M.J.: Coupling particle-based reaction-diffusion simulations with reservoirs mediated by reaction-diffusion PDEs (2020). arXiv:2006.00003 [q-bio.QM]
del Razo, M.J., Qian, H.: A discrete stochastic formulation for reversible bimolecular reactions via diffusion encounter. Commun. Math. Sci. 14, 1741–1772 (2016). https://doi.org/10.4310/CMS.2016.v14.n6.a13
Drawert, B., Engblom, S., Hellander, A.: URDME: a modular framework for stochastic simulation of reaction-transport processes in complex geometries. BMC Syst. Biol. 6, 76 (2012). https://doi.org/10.1186/1752-0509-6-76
Roberts, E., Stone, J.E., Luthey-Schulten, Z.: Lattice microbes: high-performance stochastic simulation method for the reaction–diffusion master equation. J. Comput. Chem. 34, 245–255 (2013). https://doi.org/10.1002/jcc.23130
Hallock, M.J., Stone, J.E., Roberts, E., Fry, C., Luthey-Schulten, Z.: Simulation of reaction diffusion processes over biologically relevant size and time scales using multi-GPU workstations. Parallel Comput. 40, 86–99 (2014). https://doi.org/10.1016/j.parco.2014.03.009
Isaacson, S.A., Ma, J., Spiliopoulos, K.: Mean field limits of particle-based stochastic reaction–diffusion models (2020a). arXiv:2003.11868 [math.PR]
Isaacson, S.A., Ma, J., Spiliopoulos, K.: How reaction–diffusion PDEs approximate the large-population limit of stochastic particle models (2020b). arXiv:2009.01392 [math.AP]
Acknowledgements
We thank Rupert Klein for a fruitful exchange on the manuscript. We acknowledge the support of Deutsche Forschungsgemeinschaft (DFG) through the Collaborative Research Center SFB 1114 “Scaling Cascades in Complex Systems” project no. 235221301, subproject C03, and under Germany’s Excellence Strategy—MATH+ : The Berlin Mathematics Research Center (EXC-2046/1)—project no. 390685689 (subproject AA1-1). MJR acknowledges support by the Dutch Institute of Emergent Phenomena (DIEP) cluster at the University of Amsterdam and support from DFG Grant No. RA 3601/1-1 during the final stages of the project.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article belongs to the themed collection: Mathematical Physics and Numerical Simulation of Many-Particle Systems; V. Bach and L. Delle Site (eds.).
Appendices
Appendix A: Basic relations and properties
Here, we prove some basic relations and properties for creation and annihilation operators that are quoted in the main text or are used for proving some of the subsequent results.
1.1 Relations for the dual pairing
For the contraction of dual pairings [Eq. (21)], it holds for \(\eta \in H\) and \(\zeta \in H^*\) and an operator \(A : H\rightarrow H\) that
where we inserted the basis expansion of \(\eta \) given in Eq. (22). In the second line, the corresponding expansion of \(A\eta \in H\) was used.
Next, we prove Eq. (30). Assuming \(\zeta \in (H^{\otimes n})^*\) and \(\eta \in H^{\otimes n}\), it holds
As the integral is over the whole domain, the labeling of the integration variables does not matter, and consequently
where \(\Sigma _n\) is the set of all permutations \(\sigma \) on \(\{1, \dots , n\}\). Following the same argument in reverse gives \( \langle {\mathcal {S}}_n \zeta , {\mathcal {S}}_n \eta \rangle = \langle \zeta , {\mathcal {S}}_n \eta \rangle \) and thus
1.2 Products of \(a^+\) and \(a^-\) operators
In the following, we prove relations for the action of products of \(a^+\) and \(a^-\) operators on a symmetrized pure tensor \(v={\mathcal {S}}_n (v_1 \otimes \cdots \otimes v_n)\in {\mathcal {S}}_n H^{\otimes n}\) for \(v_i \in H\) and \(i=1, \dots , n\).
Applying \(a^+ a^-\) for elements of the single-particle basis to v yields
where we used the definitions (51), as well as the short-hand notation \(v_{\setminus \{j\}} = v_1\otimes \dots v_{j-1}\otimes v_{j+1} \dots \otimes v_n\). Similarly, we find for the product \(a^+ a^+ a^- a^-\) that
1.3 Commutation relations
We prove the commutator relations (54). Again, it is sufficient to consider a symmetrized pure tensor \(v \in {\mathcal {S}}_n H^{\otimes n}\) as above. Let \(w, \nu \in H\) and \(f, g\in H^*\), and we begin with the first identity:
To expand the first term, we apply Eq. (51a) followed by Eq. (51b). The second term is given in Eq. (A5), and we obtain
The other two commutators follow similarly by inserting the definitions (51):
and
1.4 Particle number operator
Putting \(\alpha = \beta \) in Eq. (A5), summation over \(\alpha \) yields the particle number operator:
exploiting biorthogonality, \(\langle u_\alpha ^*, v_j\rangle = \delta _{\alpha ,j}\), in the last-but-one step. As a general density \(\rho _n \in {\mathcal {S}}_n H^{\otimes n}\) is a linear combination of symmetric pure tensors, the previous relation implies \({\mathcal {N}}\rho _n= n \rho _n\). It follows straightforwardly that the identity operator (resolution of unity) on \({\mathcal {S}}_n H^{\otimes n}\) reads
1.5 Second-order particle number operator
Taking \(\alpha = \gamma \) and \(\beta = \delta \) in Eq. (A6) and summing over \(\alpha \) and \(\beta \), we obtain the second-order particle number operator:
Due to linearity, this implies \({\mathcal {N}}^2\rho _n= n(n-1) \rho _n\). In analogy to the particle number operator it holds
The generalization is straightforward. In general, \({\mathcal {N}}^k\) can be expressed as k sums where each term is composed by k creation operators followed by k annihilation operators. It holds that
which are the factorial moments of N(t).
Appendix B: Expansions in terms of creation and annihilation operators
In the following, we always assume \(v \in {\mathcal {S}}_n H^{\otimes n}\) to be a symmetrized pure tensor, \(v={\mathcal {S}}_n (v_1 \otimes \cdots \otimes v_n)\) for \(v_i \in H\).
1.1 Creation and annihilation operators acting on densities
We rewrite the creation and annihilation operators from Eq. (51) acting explicitly on symmetrized densities. For \(\rho _n \in {\mathcal {S}}_n H^{\otimes n}\) and \(w \in H\) it holds
which evaluates at a point \(x^{(n+1)} \in {\mathbb {X}}^{n+1}\) to
with the notation \(x_{\setminus \{j\}}^{(n+1)} = (x_1^{(n+1)}, \dots , x_{j-1}^{(n+1)}, x_{j+1}^{(n+1)}, \dots x_{n+1}^{(n+1)}) \in {\mathbb {X}}^n\).
Applying the annihilation operator to \(\rho _n=\sum _{\beta _1\leqslant \dots \leqslant \beta _n}{\tilde{c}}_{\beta _1,\dots ,\beta _n} {\mathcal {S}}_n (u_{\beta _1} \otimes \dots \otimes u_{\beta _n})\) yields for \(f \in H^*\):
which evaluates to
1.2 Conserving single-particle operators
We prove that the operator \({\mathcal {A}}={\mathcal {A}}^n\) defined in Eq. (59) fulfills Eq. (60):
1.3 Conserving two-particle operators
Per definition (62), it holds
The representation of \(B (v_i \otimes v_j)\) in terms of the symmetrized 2-particle tensor basis reads [Eq. (36)]:
Together this gives
and after interchanging dualization and symmetrization using Eq. (34):
where the symmetrization \({\mathcal {S}}_{2}\) in the argument of the symmetric operator B may be omitted. Moreover, we note that for \(v \in {\mathcal {S}}_n H^{\otimes n}\) it holds
where Eq. (30) was used in the last line. Inserting this equality into Eq. (B10), the operator fulfills
By means of Eq. (35), and using the symmetry of the summands, we finally obtain
Note that the need of the factor 1/2 becomes clear as a consequence of applying the annihilation operator twice; removing first particle i and then j is the same as removing particle j and then i. If an operator expansion consists of n annihilation operators, it will require a factor 1/n! to account for all the equivalent orderings.
One can verify the equation above directly by expanding the creation and annihilation operators and by contracting the expansions in the non-symmetrized basis:
1.4 Reaction operators
The reaction operators
can also be written in terms of their symmetric basis:
where \(s_{\beta _1,\dots ,\beta _n}\) is defined on Eq. (33). The proofs of these expansions work analogously to the proof in Appendix B.3. The coefficients \(s_{\beta _1,\dots ,\beta _k}\) and the coefficients \(s_{\alpha _1,\dots ,\alpha _k}\) (or \(s_{\alpha _1,\dots ,\alpha _l}\)) that appear when commuting symmetrization with duality compensate for the duplicate terms in the non-symmetrized expansion.
Appendix C: Local conservation of probabilities and dissipativity of the CDME operator
The dissipativity of the reaction operator is closely related to the local conservation of probability, expressed by Eq. (75), which we will prove first. This can be done for each reaction separately, such that we can restrict to the case of a single reaction with fixed stoichiometric coefficients k and l. Then, each “column” m of the reaction operator \({\mathcal {R}}= ({\mathcal {Q}}_{nm})\) has at most two nonzero entries (see Eq. (74)) and the column sum reduces to
for \(m \geqslant k\) and dropping the subscript r on the reaction operators. (For \(m < k\), the reaction does not occur and \({\mathcal {J}}_{nm}(\rho _m) = 0\), irrespective of the value of n.) For the sum in (C1) to be zero it has to hold
for \(\rho _m\in {\mathcal {S}}_m H^{\otimes m}\), which means that the considered reaction induces the probability outflow from the m-particle space given on the l.h.s., which is balanced by the inflow into the space with \(m-k+l\) particles on the r.h.s.
We here give the proof of Eq. (C2) for the case \(k=2\) and \(l=1\), i.e., the reaction \(\text {A}+ \text {A}\rightarrow \text {A}\); the general case for arbitrary k, l follows analogously. The reaction operators read [Eqs. (69) and (71b)]:
with the coefficients given in terms of the same reaction rate function \(\lambda (x^{(1)},x^{(2)})\) as [Eqs. (65) and (66)]:
Integrating the remaining degrees of freedom of each equation and using Fubini’s theorem, we obtain the relation
With this, we conclude Eq. (C2) as follows:
inserting the definition of the creation operator [(51a)],
omitting the symmetrization under the integral,
carrying out the sums over \(\alpha _1, \alpha _2\) upon recognizing the expansion of \(\Lambda ^{(2)}( u_{\beta _1} \otimes u_{\beta _2})\) in the 2-particle basis [Eq. (28)],
interchanging the sums and integrals,
making use of the identity (C7),
and finally expanding \(\Lambda ^{(2,1)}( u_{\beta _1} \otimes u_{\beta _2} )\) in the 1-particle basis [Eq. (28)],
This implies (C2) for the considered reaction. Using equivalent argumentation for other reactions and combining the results delivers Eq. (75).
For the dissipativity of the reaction operator \({\mathcal {R}}\) one has to show that
Here, \({{\,\mathrm{dom}\,}}({\mathcal {R}}) \subset F(H)\) denotes the domain of \({\mathcal {R}}\), which is taken such that \(\rho \in {{\,\mathrm{dom}\,}}({\mathcal {R}})\) satisfies \(\Vert {\mathcal {R}}\rho \Vert < \infty \) and \(\sum _{n=0}^\infty |{\mathcal {J}}_{nm}(\rho _m)| < \infty \) for each \(m \in {\mathbb {N}}_0\). The last condition together with Eq. (75) implies the relation
noting that only finitely many of the \({\mathcal {Q}}_{nm}\) are nonzero for fixed n (see Eq. (74)), which allows us to interchange the m-sum and the integral over \({\mathbb {X}}^n\) and finally the order of summation. With this relation, it is straightforward to infer the dissipativity of \({\mathcal {R}}\):
making use of the triangle inequality in the second line and that both sums are absolutely convergent in the last-but-one step.
Finally, the calculation in (C11) can directly be transferred to the reaction–diffusion operator \({\mathcal {D}}+{\mathcal {R}}\), noticing that due to the reflecting boundary conditions at the boundary of \({\mathbb {X}}\), it holds
for any \(n\in {\mathbb {N}}\). In total, we obtain the dissipativity of the CDME operator \({\mathcal {A}}={\mathcal {R}}+ {\mathcal {D}}\).
Appendix D: Spatial discretization
The crucial step in performing the spatial discretization is to consistently define the respective diffusion and reaction operators, which are expressed in terms of creation and annihilation operators, acting on the basis elements of the copy number representation. We lay a basis for doing so in the following.
1.1 Creation and annihilation operators acting on a discrete space
The action of the creation and annihilation operators on the Galerkin-projected Fock space \({\hat{F}}\) is particularly transparent for basis elements of the copy number representation. Using the short-hand notation (100), it holds
Relation (D1) follows directly from the definition of the operator \(a_k^+\). In order to prove the second relation, Eq. (D2), we make the following observation. From the commutator relation \(\left[ a^- \{f\}, a^+ \{w\}\right] = \left\langle f,w \right\rangle {\mathcal {I}}\), see Eq. (54), we get
where \(\delta _{ij}\) is the Kronecker delta. More generally, for \(n\geqslant 1\), we have
and thus
We can use this formula to calculate the following identities:
which, applied to \(\rho _{vac}\), yields Eq. (D2). By repeated application of Eq. (D2) if follows directly that
for \(k\ne l\), and
which will be used in the next section for deriving the projected reaction operators of mutual annihilation.
1.2 Diffusion and reaction operators
For the calculation of the diffusion and reaction operators we need to apply the properties of the creation and annihilation operators given in Appendix D.1 and subsequently shift indices. In order to simplify notation in this index shift, we define \(p_{N_1,\ldots , N_M}:=0\) for all \((N_1,\dots ,N_M) \notin {\mathbb {N}}_0^M\) (especially for negative indices).
Diffusion. For the projected diffusion operator \(\hat{{\mathcal {D}}}\) given in Eq. (96) and specified in Eq. (116) we find, using the copy number representation (101) as well as the properties (D1) and (D2),
Degradation. Analogously, we obtain for the conserving one-particle reaction operator \(\hat{{\mathcal {R}}}_d^{(1)}=\sum _{i,j} \langle \xi _i^*, \Lambda ^{(1)}_d \xi _j\rangle a_i^+ a_j^-\) of degradation \(\text {A}\rightarrow \varnothing \) by using definition (121):
and for the non-conserving one-particle reaction operator \(\hat{{\mathcal {R}}}_d^{(1,0)}=\sum _i \langle 1, \Lambda ^{(1,0)}_d \xi _i\rangle a_i^-\) of degradation, by means of definition (121):
Creation. For the creation reaction \(\varnothing \rightarrow I \) the conserving part is given by
and the non-conserving one-particle reaction operator acts as:
where we applied definition (127).
Mutual annihilation. For the reaction \(\text {A}+ \text {A}\rightarrow \varnothing \) of mutual annihilation, the projected conserving operator \(\hat{{\mathcal {R}}}^{(2)}\) is given by
see Eq. (86a). For the rescaled indicator functions given in Eq. (106), the factor
is for \(k=i\) and \(l=j\) equal to \(\lambda ^{ij}\) defined in Eq. (131), and zero otherwise.
We thus obtain
where the last equality follows from the fact that the reaction rate function \(\lambda (x^{(2)})\) is symmetric, which implies \(\lambda ^{ij}=\lambda ^{ji}\).
For the non-conserving part, it holds
where
again for \(\lambda ^{ij}\) defined in Eq. (131), such that
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
del Razo, M.J., Frömberg, D., Straube, A.V. et al. A probabilistic framework for particle-based reaction–diffusion dynamics using classical Fock space representations. Lett Math Phys 112, 49 (2022). https://doi.org/10.1007/s11005-022-01539-w
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11005-022-01539-w
Keywords
- Particle-based reaction–diffusion models
- Reaction–diffusion master equation
- Fock space methods
- Classical many-particle systems
- Galerkin projection