Continuous-time Markov-chain models for reaction systems: fast and slow processes

MacDonald, Iain L.; Pienaar, Etienne A. D.

doi:10.1007/s11144-023-02440-w

Continuous-time Markov-chain models for reaction systems: fast and slow processes

Open access
Published: 06 July 2023

Volume 136, pages 1757–1773, (2023)
Cite this article

Download PDF

You have full access to this open access article

Reaction Kinetics, Mechanisms and Catalysis Aims and scope Submit manuscript

Continuous-time Markov-chain models for reaction systems: fast and slow processes

Download PDF

802 Accesses
Explore all metrics

Abstract

It is sometimes of interest to identify the fast and slow processes in a reaction system. We present here an approach to this problem which is based on a simple stochastic model, a continuous-time Markov chain on a small number of states. We show how it is possible to use such a stochastic model to find and plot the time-courses of concentrations, and to find simple short-time and long-time approximations to these time-courses; that is, to separate the fast and the slow processes. The most significant computation involved is the exponentiation of many small matrices, which is easily accomplished in the computing environment R.

Existence and Uniqueness of Quasi-stationary Distributions for Symmetric Markov Processes with Tightness Property

Article 17 January 2019

A Data–Driven Approximation of the Koopman Operator: Extending Dynamic Mode Decomposition

Article 05 June 2015

Conservative and Semiconservative Random Walks: Recurrence and Transience

Article 27 February 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

In the modelling of chemical and other reaction systems, there are essentially two alternatives: deterministic models and stochastic (i.e. probabilistic) ones. The former type enables one to draw conclusions about (e.g.) concentrations from a system of ordinary differential equations, the rate equations, the latter typically seeks to answer similar questions by finding transition probabilities in a continuous-time Markov chain representing the reactions.

To many chemists and other physical scientists, deterministic models and their associated differential equations are probably the more familiar approach, but that is not necessarily the simpler route to a useful conclusion, merely the more familiar of the two. Similarly, those of a statistical background would tend to choose stochastic models. The books of Érdi and Tóth [4] and Érdi and Lente [3] cover (respectively) both deterministic and stochastic models, and stochastic models only.

One sometimes sees authors writing as if the system of rate equations IS the system of chemical reactions, as opposed to one very useful representation thereof; we prefer to distinguish the two systems. For example, there are literature references to Robertson [12] which cite that work as the source of the ‘Robertson reaction’. There are no reactions at all in [12], but there is discussed in that work a system of differential equations (DEs) that could very well arise as the rate equations implied by a system of chemical reactions. Furthermore, it is in our view useful to have at one’s disposal both deterministic and stochastic modelling. There may be insights readily available from one of the techniques that are not as readily available from the other. See, for instance, MacDonald [10, pp. 3,4], in which continuous-time Markov chains are used (instead of the rate equations) as a simple route to clear-cut conclusions about a unimolecular triangle reaction system.

Alvarez-Ramirez et al. [1] use the rate equations, the singular value decomposition (SVD) of a certain matrix arising in those equations, and singular perturbation theory in order to separate the fast and slow processes in a reaction system. We present here an alternative formulation of their worked examples which demonstrates how continuous-time Markov chains (CTMCs) can instead be used to study the time-courses of the concentrations of the species involved, and thereby draw conclusions about the fast and slow processes in the system. It is to some extent possible to derive implied fast or slow manifolds, but examination of these time-courses is also informative, and apparently easier to carry out.

Our approach is essentially that described by MacDonald [10]; states are defined, the generator matrix of the (time-homogeneous) CTMC representing the system is identified, and the generator is then exponentiated in order to find the transition probabilities, which can deduce the vector of concentrations at a given time from the initial concentrations. The time-courses of the concentrations can thereby be computed and plotted.

Kinetics of three-lump hydrocracking

The reaction system

The first example of Alvarez-Ramirez et al. [1] relates to the kinetics of three-lump hydrocracking (HDC). The system, consisting of three monomolecular reactions, is as follows:

$$\begin{aligned} \begin{array}{lcr} \mathrm R&{} \overset{k_1}{\longrightarrow }\ &{} \mathrm L\\ \mathrm L&{} \overset{k_3}{\longrightarrow }\ &{} \mathrm G\\ \mathrm R&{} \overset{k_2}{\longrightarrow }\ &{} \mathrm G, \end{array} \end{aligned}$$

and can be depicted as in Fig. 1.

The symbol R denotes heavy oil residue, and L and G are the liquid and the gas products. (In passing: the use of $k_3$, not $k_2$, in the second reaction above is intentional.)

As noted by Alvarez-Ramirez et al., the system of ordinary differential equations (ODEs) representing the reaction system is:

$$\begin{aligned} \frac{\textrm{d}C_R(t)}{\textrm{d}t}= & {} -(k_1+k_2) C_R(t) \nonumber \\ \frac{\textrm{d}C_L(t)}{\textrm{d}t}= & {} k_1 C_R(t) - k_3 C_L(t) \nonumber \\ \frac{\textrm{d}C_G(t)}{\textrm{d}t}= & {} k_2 C_R(t) + k_3 C_L(t). \end{aligned}$$

(1)

The linearity and triangular structure of the Eq. 1 make it easy to solve them explicitly,^{Footnote 1} but we take a slightly different route.

In view of the explicit solution, the approximations based on the SVD and singular perturbation may seem redundant, but the implications of such approximations can if necessary be compared to those of the exact solution. In the example presented by Alvarez-Ramirez et al., the values of the rate constants (per hour) are: $k_1= 0.2787$, $k_2=0.0361$, $k_3 = 0.0101$; these values arise in the work of Soto-Azuara et al. [13].

The Markov-chain model

With the states being R, L and G, the generator of the associated CTMC is

$$\begin{aligned} \varvec{Q} = \begin{pmatrix} -(k_1+k_2) &{} k_1&{} k_2\\ 0 &{}-k_3 &{} k_3 \\ 0 &{} 0 &{} 0 \end{pmatrix}. \end{aligned}$$

(We use the convention that the row sums of such a matrix are zero.) The last row reflects the fact that G is an absorbing state. Since $\varvec{Q}$ is upper-triangular, its eigenvalues are $-(k_1+k_2)$, $-k_3$ and 0. Apart from certain obvious special cases (which we exclude here), these eigenvalues are distinct and the generator therefore diagonalizable. A matrix of right (i.e. column) eigenvectors, with the same ordering, is

$$\begin{aligned} \varvec{U} = \begin{pmatrix} 1 &{} k_1 &{} 1\\ 0 &{} (k_1+k_2-k_3) &{} 1\\ 0 &{} 0 &{} 1 \end{pmatrix}, \end{aligned}$$

the inverse of which can be found explicitly:

$$\begin{aligned} \varvec{U}^{-1} = \frac{1}{k_1+k_2-k_3} \begin{pmatrix} (k_1+k_2-k_3)\ \ &{} -k_1\ \ &{} -(k_2-k_3)\\ 0 &{} 1 &{} -1\\ 0 &{} 0 &{} (k_1+k_2-k_3) \end{pmatrix}. \end{aligned}$$

One can therefore find the required transition probability matrix (TPM) $\varvec{P}(t) = \exp (t\varvec{Q})$ explicitly as

$$\begin{aligned} \varvec{P}(t) = \textit{\textbf{U D}}_t \varvec{U}^{-1}, \end{aligned}$$

where $\varvec{D}_t$ is the diagonal matrix with diagonal $(e^{-(k_1+k_2)t}, e^{-k_3 t}, 1 )$. In any case, it follows that each transition probability $p_{ij}(t)$ is of the form

$$\begin{aligned} a e^{-(k_1+k_2)t} + b e^{-k_3 t} +c, \end{aligned}$$

where a, b and c are independent of t. For instance,

$$\begin{aligned} p_{11}(t)=\, & {} e^{-(k_1+k_2)t}, \nonumber \\ p_{12}(t)=\, & {} k_1 (e^{-k_3t} - e^{-(k_1+k_2)t})/(k_1+k_2-k_3), \nonumber \\ p_{13}(t)=\, & {} 1 - p_{11}(t) - p_{12}(t), \end{aligned}$$

(2)

the second of which peaks at time $t= \frac{1}{k_1+k_2-k_3} \log \left( \frac{k_1+k_2}{k_3} \right)$.

The (relative) concentrations at time t, adding to 1 and corresponding to the three states, are supplied by the row vector

$$\begin{aligned} \varvec{u}(t) = \varvec{u}(0)\varvec{P}(t), \end{aligned}$$

where $\varvec{P}(t)$ is given either by the explicit expression derived above, or numerically by $\varvec{P}(t)=\exp (t\varvec{Q}),$ the matrix exponential being evaluated by (e.g.) the excellent R routine expm [5]. (If a matrix is diagonalizable, it is easily exponentiated; expm does not assume diagonalizability.) For ease of exposition we shall assume here that $\varvec{u} (0) = (1,0,0)$, but that assumption is inessential. The concentrations at time t are then

$$\begin{aligned} (u_1(t), u_2(t), u_3(t))= (1,0,0) \varvec{P}(t)= (p_{11}(t), p_{12}(t), p_{13}(t)), \end{aligned}$$

with the transition probabilities $p_{1j}(t)$ as given by (2). Differences or sums of concentrations might well be of interest, and are of course easily deduced.

Fast and slow processes

Since we have simple explicit expressions for the concentrations, we can directly investigate their approximate behaviour for small t and for large t, and thereby possibly find fast and slow ‘manifolds’. Firstly, note that, if $(k_1+k_2)t$ and $k_3 t$ are sufficiently small,

$$\begin{aligned} 1-u_1(t) \approx (k_1+k_2)t \end{aligned}$$

and

$$\begin{aligned} u_2(t) \approx \frac{k_1}{k_1+k_2-k_3} \left( -k_3t + (k_1+k_2)t \right) = k_1 t. \end{aligned}$$

Hence

$$\begin{aligned} u_2(t) / (1-u_1(t)) \approx \frac{k_1}{k_1+k_2} = 0.8853. \end{aligned}$$

This identifies a fast manifold which is almost exactly in agreement with that given by Eq. 21 of Alvarez-Ramirez et al.:

$$\begin{aligned} 0 = 0.8859(C_{R,0} -C_R) -C_L. \end{aligned}$$

It is obviously less in agreement with their Eq. 33:

$$\begin{aligned} 0 = 0.9147(C_{R,0} -C_R) -C_L. \end{aligned}$$

If one makes the assumption (only) that $k_3=0$, then it follows that (exactly)

$$\begin{aligned} u_2(t) / (1-u_1(t)) = \frac{k_1}{k_1+k_2} = 0.8853; \end{aligned}$$

so the fast manifold discussed above is an exact consequence of the assumption that $k_3=0$.

Secondly, note that, if $(k_1+k_2)t$ is sufficiently large, $u_1(t) \approx 0$ and so $u_2(t) + u_3(t) \approx 1$, which identifies a slow manifold. But the search for fast and slow manifolds does not seem to add much information if it is easily possible to plot $\varvec{u}(t)$ for a given generator $\varvec{Q}$ and initial distribution $\varvec{u}(0)$.

If (as in this example) $k_3$ is less than $k_2$ and very much less than $k_1$, it is clear that conversion to gas is slow, as seen in Fig. 2, which can be compared with Fig. 1 of Alvarez-Ramirez et al.

Generator lacking distinct eigenvalues

It has been suggested that we should not confine ourselves here to the case of the generator $\varvec{Q}$ having distinct eigenvalues. As pointed out by Hirsch et al. [6, p. 100], ‘most’ matrices have distinct eigenvalues. Hence ‘most’ (square) matrices are diagonalizable. More precisely, the set of real $n\times n$ matrices that have n distinct eigenvalues is (open and) dense in the set of all real $n\times n$ matrices [6, p. 101]. The case of $\varvec{Q}$ being deficient in that respect is therefore of theoretical rather than practical significance. To proceed with a theoretical analysis, one would need to write the generator in Jordan form or Schur triangular form; for the latter, see Meyer [11, p. 524]. But there would seem to be little to be gained by doing so. If such a deficient generator $\varvec{Q}$ were to turn out to be of interest, it could of course be nondiagonalizable, but that would not prevent one from exponentiating $t\varvec{Q}$ with appropriate software, e.g. expm in R.

Kinetics of four-lump hydrocracking

In addition to three-lump hydrocracking, Alvarez-Ramirez et al. also discuss more briefly the four-lump HDC model of Soto-Azuara et al [13]. Here the four states are (in order) R, HLP (heavy liquid product), LLP (light liquid product) and G, and the generator of our CTMC model is

$$\begin{aligned} \varvec{Q} = \begin{pmatrix} - (k_1+k_2+k_3) &{} k_1&{} k_2 &{} k_3\\ 0 &{}-(k_4+k_5) &{} k_4 &{} k_5\\ 0 &{} 0 &{} -k_6 &{} k_6 \\ 0 &{} 0 &{} 0 &{} 0 \end{pmatrix}. \end{aligned}$$

Its eigenvalues are $-(k_1+k_2+k_3)$, $-(k_4+k_5)$, $-k_6$ and 0. Again, apart from certain special cases the eigenvalues are distinct and the generator therefore diagonalizable. But in the example considered, one of those special cases does indeed arise, since $k_4=k_5=0$. Zero is then a repeated eigenvalue, and states 2 (HLP) and 4 (G) are both absorbing states. Alvarez-Ramirez et al. describe the resulting model as ‘physically meaningless’, but it seems worthwhile to explore the consequences, for the CTMC, of the assumption that $k_4=k_5=0$.

The full set of rate constants considered by Soto-Azuara et al. and Alvarez-Ramirez et al. is: $\varvec{k} =(0.2323, 0.0487, 0.0336, 0, 0, 0.1244)$. The two absorbing states give rise to an infinity of equilibrium distributions (all convex linear combinations of the two row vectors (0, 1, 0, 0) and (0, 0, 0, 1)). But from the structure of the resulting network it seems plausible that, for a given initial distribution $\varvec{u}(0)$, there is a unique limiting distribution.

That conjecture can be proved as follows. In spite of the repeated eigenvalue (zero) the generator $\varvec{Q}$ can be diagonalized: two linearly independent right eigenvectors, both corresponding to the eigenvalue zero, are $(k_1/(k_1+k_2+k_3), 1,0,0)^{\textrm{T}}$ and $(1,1,1,1)^{\textrm{T}}$. More simply, perhaps: two linearly independent left eigenvectors, corresponding to the eigenvalue zero, are (0, 1, 0, 0) and (0, 0, 0, 1). Hence, for some nonsingular matrix $\varvec{U}$,

$$\begin{aligned} \varvec{P}(t)= & {} \varvec{U} \text{ diag }(e^{-(k_1+k_2+k_3)t}, 1, e^{-k_6 t}, 1) \varvec{U}^{-1}\\\overset{t \rightarrow \infty }{\longrightarrow } & {} \varvec{U} \text{ diag }(0, 1, 0, 1) \varvec{U}^{-1}, \end{aligned}$$

provided that $k_1+k_2+k_3$ and $k_6$ are strictly positive. It then follows that

$$\begin{aligned} \varvec{u}(t) = \varvec{u}(0) \varvec{P}(t) \overset{t \rightarrow \infty }{\longrightarrow }\ \varvec{u}(0) \varvec{U} \text{ diag }(0, 1, 0, 1) \varvec{U}^{-1}. \end{aligned}$$

The model may indeed be physically meaningless, in that it precludes conversion of HLP to LLP or G, but its properties are not difficult to establish. Fig. 3 displays the time-courses of the concentrations of the four species corresponding to $\varvec{k}$ as given above and initial concentrations $\varvec{u}(0) = (1,0,0,0)$.