Statistics of Nascent and Mature RNA Fluctuations in a Stochastic Model of Transcriptional Initiation, Elongation, Pausing, and Termination

Filatova, Tatiana; Popovic, Nikola; Grima, Ramon

doi:10.1007/s11538-020-00827-7

Statistics of Nascent and Mature RNA Fluctuations in a Stochastic Model of Transcriptional Initiation, Elongation, Pausing, and Termination

Original Article
Open access
Published: 22 December 2020

Volume 83, article number 3, (2021)
Cite this article

Download PDF

You have full access to this open access article

Bulletin of Mathematical Biology Aims and scope Submit manuscript

Statistics of Nascent and Mature RNA Fluctuations in a Stochastic Model of Transcriptional Initiation, Elongation, Pausing, and Termination

Download PDF

2046 Accesses
13 Citations
3 Altmetric
Explore all metrics

Abstract

Recent advances in fluorescence microscopy have made it possible to measure the fluctuations of nascent (actively transcribed) RNA. These closely reflect transcription kinetics, as opposed to conventional measurements of mature (cellular) RNA, whose kinetics is affected by additional processes downstream of transcription. Here, we formulate a stochastic model which describes promoter switching, initiation, elongation, premature detachment, pausing, and termination while being analytically tractable. We derive exact closed-form expressions for the mean and variance of nascent RNA fluctuations on gene segments, as well as of total nascent RNA on a gene. We also obtain exact expressions for the first two moments of mature RNA fluctuations and approximate distributions for total numbers of nascent and mature RNA. Our results, which are verified by stochastic simulation, uncover the explicit dependence of the statistics of both types of RNA on transcriptional parameters and potentially provide a means to estimate parameter values from experimental data.

Estimating Effects of Extrinsic Noise on Model Genes and Circuits with Empirically Validated Kinetics

Measuring Transcription Dynamics in Living Cells Using Fluctuation Analysis

A single-molecule view of transcription reveals convoys of RNA polymerases and multi-scale bursting

Article Open access 27 July 2016

1 Introduction

Transcription, the production of RNA from a gene, is an inherently stochastic process. Specifically, the interval of time between two successive transcription events is a random variable whose statistics depend on multiple single-molecule events behind transcription (Sanchez and Golding 2013). When the distribution of this random variable is exponential, we say that expression is constitutive; in that case, the number of transcripts produced in a certain interval of time follows a Poisson distribution. On the other hand, when the distribution of times between two successive transcripts is non-exponential, then the number of transcripts is non-Poissonian. A special case of such non-constitutive behaviour is bursty expression, whereby transcripts are produced in short bursts that are separated by long silent intervals (Suter et al. 2011; Halpern et al. 2015). In yeast, genes whose expression is constitutive include MDN1, KAP104, and DOA1, whereas PDR5 is an example of a gene whose expression is bursty (Zenklusen et al. 2008).

For two decades, mathematical models of gene expression have been developed to predict the distribution of RNA abundance. By matching the theoretical distribution with experimental measurements from microscopy-based methods (Raj et al. 2008), one hopes to obtain insight into the underlying kinetics of transcription and to estimate transcriptional parameters. The standard model of gene expression which has been used for these analyses is the telegraph model (Peccoud and Ycart 1995), whereby a gene can be in two states. Transcription occurs in one of the states, whereupon RNA degrades; first-order kinetics is assumed for all processes. While the distribution obtained from the telegraph model can typically fit cellular RNA abundance data, there are innate difficulties with the interpretation of that fit: fluctuations in cellular RNA numbers and, hence, the shape of the experimental RNA distribution do not only reflect transcription, but also many processes downstream thereof, such as splicing, RNA degradation, and partitioning during cell division.

To counteract these difficulties, in the past few years, mathematical models (Choubey et al. 2015; Choubey 2018; Heng et al. 2016; Cao and Grima 2020) have been developed to predict the statistics of nascent RNA, i.e. of RNA in the process of being synthesised by the RNA polymerase molecule (RNAP), which can be visualised and quantified due to recent advances in fluorescence microscopy (Lenstra et al. 2016; Skinner et al. 2016; Larson et al. 2011; Antoine et al. 2014; Brouwer and Lenstra 2019). In contrast to cellular RNA, the statistics of nascent RNA is a direct reflection of the transcription process; hence, these models can potentially give more insight than the simpler, but cruder telegraph model. Choubey and collaborators (Choubey et al. 2015; Choubey 2018) have developed a stochastic model with the following properties: (i) a gene can be in two states (active or inactive); (ii) from the active state, transcription initiation occurs in two sequential steps: the pre-initiation complex is formed, after which the RNA polymerase escapes the promoter; (iii) once on the gene, the polymerase moves from one base pair to the next (with some probability) until the end of the gene is reached, when transcription is terminated and polymerase detaches. Queuing theory is used to derive analytical expressions for the transient and steady-state means and variances of numbers of RNAP that are attached to the gene in the long-gene limit when the elongation time is practically deterministic. Heng et al. (2016) have considered a coarse-grained version of that model, whereby the movement of RNAP from one base pair to the next is not explicitly modelled, obtaining an analytical expression for the total RNAP distribution in steady-state conditions. More recently, Cao and Grima (2020) have studied a model of eukaryotic gene expression that yields approximate time-dependent distributions of both nascent and cellular RNA abundance as a function of the parameters controlling gene switching, DNA duplication, partitioning at cell division, gene dosage compensation, and RNA degradation; in their coarse-grained model, the movement of RNAP is not explicitly modelled, while the elongation time is assumed to be exponentially distributed, which simplifies the requisite analysis.

The complexity of nascent RNA models has thus far not allowed the same detailed level of analysis as has been possible with the much simpler telegraph model. A few shortcomings of current models can be summarised as follows: (i) distributions of nascent RNA have been derived from models that do not explicitly model the movement of RNAP along a gene (Heng et al. 2016; Cao and Grima 2020), resulting in a disconnect between theoretical description and the microscopic processes underlying transcription; (ii) while the analysis of single-cell sequencing data and electron micrograph data yields the positions of individual polymerases along the gene, allowing for the calculation of statistics (means and variances) of the numbers of RNAP on gene segments that are obtained after binning, detailed models of RNAP elongation (Choubey et al. 2015; Choubey 2018) provide analytical results only for total RNAP on a gene and hence cannot be used to understand gene segment data; (iii) analytical calculations of the statistics of nascent RNA ignore important details of the transcription process such as pausing, traffic jams, backtracking, and premature termination, some of which have to date been explored via stochastic simulation (Klumpp and Hwa 2008; Rajala et al. 2010; Choubey et al. 2015; Rodriguez et al. 2019; Md Zulfikar et al. 2020).

In this paper, we overcome some of the aforementioned shortcomings of analytically tractable models for the transcription process. In Sect. 2, we study a stochastic model for promoter switching and the stochastic movement of RNAP along a gene, allowing for premature termination. We derive exact closed-form expressions for the first and second moments (means and variances) of local RNAP fluctuations on gene segments of arbitrary length, which allows us to study how these statistics vary along a gene as a function of transcriptional parameters; we also obtain expressions for the mean and variance of the total RNAP on the gene which generalise previous work by Choubey et al. (2015). In Sect. 3, we investigate approximations for the distributions of total RNAP and mature RNA, showing in particular that Negative Binomial distributions can provide an accurate approximation in certain biologically meaningful limits. In Sect. 4, we illustrate the difference between the statistics of local and total RNAP fluctuations and those of light fluorescence due to tagged nascent RNA. In Sect. 5, we extend our model to include pausing by deriving approximate expressions for the mean, variance, and distribution of observables. We conclude with a discussion of our results in Sect. 6.

2 Detailed Stochastic Model of Transcription: Set-up and Analysis

In this section, we specify the stochastic model studied here; then, we derive closed-form expressions for the moments of mature RNA and of local and total RNAP fluctuations in various parameter regimes.

2.1 Set-up of Model

We consider a stochastic model of transcription that includes the processes of initiation, elongation, and termination, as illustrated in Fig. 1. For simplicity, we divide the gene into L segments; the RNAP on gene segment i is then denoted by $P_i$. The promoter can be either in the inactive state ($G_{\mathrm{off}}$) or the active state ($G_{\mathrm{on}}$), switching from the inactive state to the active one with rate $s_u$ and from the active state to the inactive one with rate $s_b$. When the promoter is active, initiation commences via the binding of an RNAP with rate r, denoted by $P_1$. Subsequently, the RNAP either moves from a gene segment to the neighbouring segment with rate k, or it prematurely detaches with rate d. Note that here we have made two assumptions: (i) the movement of RNAP is unidirectional, away from the promoter site and hence left to right, with no pausing or backtracking allowed; (ii) the detachment and elongation rates are independent of the position of RNAP on the gene. Each RNAP has associated with it a nascent RNA tail that grows longer as the RNAP transcribes more of the gene. When the RNAP reaches the last gene segment, termination occurs, i.e. the RNAP–nascent RNA complex gets dissociated from the gene leading to a mature RNA (M) which degrades with rate $d_m$. Note that for simplicity, we have not considered excluded-volume interaction between adjacent RNAPs here; hence, we make the implicit assumption of low ‘traffic’, which is plausible when the initiation rate is sufficiently low. (We test the validity of this assumption through simulations below.)

Note that, while the choice of L is arbitrary, it should be kept in mind that L needs to be sufficiently large for the dynamics to be described at a fine spatial resolution. However, L also has to be small enough for the length of each gene segment to be much larger than the footprint of an RNAP; the latter is needed to ensure the validity of the low-traffic assumption. The elongation time which is the total time T from initiation to termination, that is, conditioning on those realisations for which the RNAP does not prematurely detach, is Erlang distributed with mean $L/(k+d)$ and coefficient of variation $1/\sqrt{L}$; see ‘Appendix A’ for a derivation and Fig. 1b, c for verification through stochastic simulation (SSA).

Note that the total number of RNAPs transcribing the gene is equal to the number of nascent RNA molecules present, irrespective of their lengths; to shed light on the fluctuations of nascent RNA, in this section we therefore focus on the calculation of statistics of local and total RNAP fluctuations. We define the vector of molecule numbers $\vec {m}=(n_0,n_1,\dots {},n_L,n)$, and we write $\langle n_0 \rangle $, $\langle n_i \rangle $ ($i=1,2,\dots {},L$), and $\langle n \rangle $ for the average numbers of molecules of active gene, RNAP, and mature RNA, respectively. The above model can then be conveniently described by $L+2$ species interacting via a set of $2L+4$ reactions with the following rate functions:

Species	Molecule numbers	Position (in $\vec {m}$)
$G_\mathrm{on}$	$n_0$	1
$P_i,\quad i\in {}\{1,\dots {},L\}$	$n_i$	$i+1$
M	n	$L+2$

Reaction	Rate function $f_j$
$G_\mathrm{on} {\mathop {\longrightarrow }\limits ^{s_b}}G_\mathrm{off}$	$f_1=s_b \langle n_0 \rangle $
$G_\mathrm{off}{\mathop {\longrightarrow }\limits ^{s_u}}G_\mathrm{on}$	$f_2=s_u (1 - \langle n_0 \rangle )$
$G_\mathrm{on} {\mathop {\longrightarrow }\limits ^{r}}G_\mathrm{on}+P_1$	$f_3=r\langle n_0 \rangle $
$P_i {\mathop {\longrightarrow }\limits ^{k}}P_{i+1}$, $i\in {}\{1,\dots {},L-1\}$	$f_{i+3}=k \langle n_i \rangle $
$P_L {\mathop {\longrightarrow }\limits ^{k}}M$	$f_{L+3}=k \langle n_L \rangle $
$P_i {\mathop {\longrightarrow }\limits ^{d}}\varnothing $, $i\in {}\{1,\dots {},L\}$	$f_{i+L+3}=d \langle n_i \rangle $
$M {\mathop {\longrightarrow }\limits ^{d_m}}\varnothing $	$f_{2L+4}=d_m \langle n \rangle $

Note that $G_\mathrm{off}$ is not an independent species; the reason is that the binary state of the gene implies a conservation law, with the sum of the numbers of $G_\mathrm{on}$ and $G_\mathrm{off}$ equalling 1. Hence, the number of independent species in the model is $L+2$. The rate functions $f_j$ are the averaged propensities from the underlying chemical master equation (CME); note that, because our reaction network is composed of first-order reactions, these rate functions also equal the reaction rates in the corresponding deterministic rate equations. The description of our model is completed by the $(L+2)\times {}(2L+4)$-dimensional stoichiometric matrix $\mathbf {S}$; the element $\mathbf {S}_{ij}$ of $\mathbf {S}$ gives the net change in the number of molecules of the ith species when the jth reaction occurs. Given the ordering of species and reactions as described in the tables above, it follows that the matrix $\mathbf {S}$ has the simple form

$$\begin{aligned} \begin{aligned} \mathbf {S}_{11}&=-1,&\qquad {} \mathbf {S}_{12}&=1,\\ \mathbf {S}_{i,i+1}&=1,&\qquad {} \mathbf {S}_{i,i+2}&=-1,&\qquad {} \mathbf {S}_{i,i+L+2}&=-1, \\ \mathbf {S}_{L+2,L+3}&=1,&\qquad {} \mathbf {S}_{L+2,2L+4}&=-1, \end{aligned} \end{aligned}$$

(1)

where $i=2,\dots {},L+1$.

2.2 Closed-Form Expressions for Moments of Mature RNA and Local RNAP

In this subsection, we outline the derivation of the steady-state means and variances of local RNAP fluctuations (on each gene segment), as well as of mature RNA. Our results are summarised in the following two propositions.

Proposition 1

Let $\eta = s_u/(s_u + s_b)$ be the fraction of time the gene spends in the active state, let $\rho _k = r/k$ be the mean number of RNAPs binding to the promoter site in the time it takes for a single RNAP to move from one gene segment to the next, let $\rho = r/d_m$ be the mean number of RNAPs binding to the promoter site in the time it takes for a mature RNA to decay, and let $\mu = k/(k+d)$ be the probability that an RNAP molecule moves to the next gene segment rather than detaching prematurely. Then, the steady-state mean numbers of molecules of active gene, RNAP, and mature RNA are given by

$$\begin{aligned} \langle n_0 \rangle&= \eta , \end{aligned}$$

(2a)

$$\begin{aligned} \langle n_{i} \rangle&= \eta \rho _k \mu ^{i} \qquad \text {for }i=1,\dots {},L, \end{aligned}$$

(2b)

$$\begin{aligned} \langle n\rangle&= \eta \rho \mu ^{L}, \end{aligned}$$

(2c)

respectively.

Proposition 1 can be proved in a straightforward fashion, as follows. Using the underlying CME, one can show from the corresponding moment equations (Warren et al. 2006) that the time evolution of the vector $\vec {\langle m \rangle }$ of mean molecule numbers in a system of zeroth-order or first-order reactions, i.e. with propensities that are linear in the number of molecules, is given by the time derivative $\hbox {d}\vec {\langle m \rangle }/\hbox {d}t = \mathbf {S}\cdot \vec {f}$. Given the form of the stoichiometric matrix $\mathbf {S}$ and of the rate functions $f_j$, as described in Sect. 2.1, it follows that the mean numbers of all species in steady state can be obtained by solving the following system of $L+2$ algebraic equations:

$$\begin{aligned} \begin{aligned} 0&= s_u (1 - \langle n_0 \rangle ) - s_b \langle n_0 \rangle , \\ 0&= r \langle n_0 \rangle - (k+d) \langle n_1 \rangle , \\ 0&= k \langle n_{i-1} \rangle - (k + d) \langle n_{i} \rangle \qquad \mathrm{for} \ i=2,\dots {},L, \\ 0&= k \langle n_L \rangle - d_m \langle n\rangle . \end{aligned} \end{aligned}$$

(3)

These equations can easily be solved simultaneously to yield the steady-state value of $\vec {\langle m \rangle }$, as given in Eq. (2).

Proposition 2

Let $\tau _p = 1/(d+k)$, $\tau _g = 1/(s_u+s_b)$, and $\tau _m = 1/d_m$ be the timescales of fluctuations of RNAP, gene, and mature RNA, respectively, and define the three new parameters

$$\begin{aligned} \alpha =\frac{1}{1+\tau _p/\tau _g}, \quad \gamma =\frac{1}{1+\tau _p/\tau _m}, \quad \text {and} \quad \theta =\frac{1}{1+\tau _m/\tau _g}. \end{aligned}$$

Furthermore, let $\beta =s_b/s_u$ denote the ratio of gene inactivation and activation rates. Then, the variances and covariances of molecule number fluctuations of active gene, RNAP, and mature RNA are given by

$$\begin{aligned} \mathrm{Var}(n_0)&=\langle {}n_0\rangle {}^2\beta {}, \end{aligned}$$

(4a)

$$\begin{aligned} \mathrm{Cov}(n_0,n_i)&= \langle {}n_0\rangle {}\langle {}n_i\rangle {}\alpha \beta {}\cdot {}f_{1i},&\quad {}&\text {where }f_{1i}=\alpha ^{i-1}; \end{aligned}$$

(4b)

$$\begin{aligned} \mathrm{Cov}(n_0,n)&= \langle {}n_0\rangle {}\langle {}n\rangle {}\alpha \beta {}\cdot {}f_{1M},&\quad {}&\text {where }f_{1M}=\theta {}\alpha ^{L-1}, \end{aligned}$$

(4c)

$$\begin{aligned} \mathrm{Cov}(n_i,n_j)&=\delta _{ij}\langle {}n_i\rangle {}+ \langle {}n_i\rangle {}\langle {}n_j\rangle {}\alpha \beta {}\cdot {}f_{ij},&\quad {}&\text {where }f_{ij}=f(i,j)+f(j,i), \end{aligned}$$

(4d)

$$\begin{aligned} \mathrm{Cov}(n_i,n)&= \langle {}n_i\rangle {}\langle {}n\rangle {}\alpha \beta {}\cdot {}f_{iM},&\quad {}&\text {where }f_{iM}=\gamma {}^{i}\theta \alpha ^{L-1}+(1-\gamma {})\sum \limits _{q=1}^{i}\gamma {}^{i-q}{f}_{qL}, \end{aligned}$$

(4e)

$$\begin{aligned} \mathrm{Var}(n,n)&=\langle {}n\rangle {}+ \langle {}n\rangle {}^2\alpha \beta {}\cdot {}f_{MM},&\quad {}&\text {where }f_{MM}=f_{LM}, \end{aligned}$$

(4f)

and where $i,j=1,\dots {},L$. Here, $\delta _{ij}$ is the Kronecker delta; moreover,

$$\begin{aligned} {f}(i,j) =\dfrac{\alpha ^{i+j-1}}{(2\alpha -1)^{i}} +\dfrac{1}{2^{i+j-1}}\left( {\begin{array}{c}i+j-1\\ i\end{array}}\right) \Big [ 1-\dfrac{2\alpha -1}{2\alpha }{}_2F_1\big (1,i+j;j;\tfrac{1}{2\alpha }\big ) \Big ], \end{aligned}$$

where ${}_2F_1$ denotes the generalised hypergeometric function of the second kind (Digital Library of Mathematical Functions 2020a), which is defined as

$$\begin{aligned} {}_2F_1(a_1,a_2;b_1;z)=\sum _{s=0}^{\infty }\dfrac{(a_1)_s(a_2)_s}{(b_1)_s}\dfrac{z^s}{s!}, \end{aligned}$$

with $(a)_s=\Gamma (a+s)/\Gamma (a)$ the Pochhammer symbol.

Here, we note that an alternative representation of the functions $f_{ij}$ in Eq. (4d), in terms of finite sums, is given in Eq. (B.33) of ‘Appendix B’.

As above, since the underlying propensities are linear in the number of molecules, the CME implies (Warren et al. 2006) that the corresponding second moments in steady state are exactly given by a Lyapunov equation. That equation, which is precisely the same as the one that is obtained from the linear-noise approximation (LNA) (Elf and Ehrenberg 2003), takes the form

$$\begin{aligned} \mathbf {J}\cdot {}\mathbf {C}+\mathbf {C}\cdot {}\mathbf {J}^{T}+\mathbf {D}=\mathbf {0}. \end{aligned}$$

(5)

Here, $\mathbf {C}$, $\mathbf {J}$, and $\mathbf {D}$ are $(L+2)\times {}(L+2)$-dimensional matrices; $\mathbf {C}$ is a variance–covariance matrix that is symmetric ($\mathbf {C}_{ij}=\mathbf {C}_{ji}$), $\mathbf {J}$ is the Jacobian matrix with elements $\mathbf {J}_{ij}=\partial (\mathbf {S}\cdot {}\vec {f})_{i}/\partial {}\langle n_j\rangle $, and $\mathbf {D}=\mathbf {S}\cdot {}\mathbf {Diag}(\vec {f})\cdot {}\mathbf {S}^{T}$ is a diffusion matrix, where $\mathbf {Diag}(\vec {f})$ is a diagonal matrix whose elements are the entries in the rate function vector $\vec {f}$. The nonzero elements of $\mathbf {J}$ are given by

$$\begin{aligned} \begin{aligned} \mathbf {J}_{11}&=-(s_u+s_b),\\ \mathbf {J}_{21}&=r,&\qquad {} \mathbf {J}_{22}&=-(k+d),\\ \mathbf {J}_{i,i-1}&=k,&\qquad {} \mathbf {J}_{ii}&=-(k+d) \qquad \text {for }i=3,\dots {},L+1,\\ \mathbf {J}_{L+2,L+1}&=k,&\qquad {} \mathbf {J}_{L+2,L+2}&=-d_m,\\ \end{aligned} \end{aligned}$$

(6)

while the nonzero elements $\mathbf {D}_{i}$ read

$$\begin{aligned} \begin{aligned} \mathbf {D}_{11}&=s_b \langle n_0 \rangle +s_u(1-\langle n_0 \rangle ),\\ \mathbf {D}_{22}&=r \langle n_0 \rangle +(k+d)\langle n_1 \rangle ,&\mathbf {D}_{23}&=-k\langle n_1 \rangle ,&\\ \mathbf {D}_{i,i-1}&=-k\langle n_{i-2} \rangle ,&\mathbf {D}_{ii}&=k\langle n_{i-1}\rangle +(k+d)\langle n_{i} \rangle&\&\text {for }i=3,\dots {},L+1,\\ \mathbf {D}_{i,i+1}&=-k\langle n_{i-1} \rangle&&\&\text {for }i=3,\dots {},L,\\ \mathbf {D}_{L+2,L+1}&=-k\langle n_{L} \rangle ,&\mathbf {D}_{L+2,L+2}&=k\langle n_{L} \rangle +d_m\langle n \rangle .&\end{aligned} \nonumber \\ \end{aligned}$$

(7)

Given the structure of the matrices $\mathbf {J}$ and $\mathbf {D}$ above, the Lyapunov Eq. (5) can be solved explicitly for the covariance matrix $\mathbf {C}$ whose elements are given by Eq. (4). The solution by induction is involved and can be found in ‘Appendix B’, which proves Proposition 2.

2.2.1 Simplification in Bursty and Constitutive Limits

Bursty limit: We now consider a particular parameter regime—the limit of large initiation rate r and large gene inactivation rate $s_b$ such that $b=r/s_b$ is constant. Since the fraction of time spent in the active state is $\eta $, it follows that the gene is mostly in the inactive state in that limit. During the short periods of time when it transitions to the active state, a burst of initiation events occur; in particular, a mean number b of RNAPs bind to the promoter during activation. Hence, such genes are often termed bursty, since transcription proceeds via sporadic bursts of activity and b is called the mean transcriptional burst size. For r and $s_b$ large with b constant, the expressions for the first two moments of RNAP at every gene segment and of mature RNA from Eqs. (2) and (4), respectively, simplify to

$$\begin{aligned} \langle n_i\rangle _b&=b\upsilon _k\mu {}^{i}, \end{aligned}$$

(8a)

$$\begin{aligned} \langle n\rangle _b&=b\upsilon _m\mu {}^{L},&\end{aligned}$$

(8b)

$$\begin{aligned} \mathrm{Cov}(n_i,n_j)_b&=\delta _{ij}\langle {}n_i\rangle {}_b+\langle {}n_i\rangle {}_b\langle {}n_j\rangle {}_b(\upsilon _k\mu )^{-1}\cdot {}{h}_{ij},&\text {where }h_{ij}=\dfrac{1}{2^{i+j-2}}\dfrac{\Gamma (i+j-1)}{\Gamma (i)\Gamma (j)}, \end{aligned}$$

(8c)

$$\begin{aligned} \mathrm{Cov}(n_i,n)_b&=\langle {}n_i\rangle {}_b\langle {}n\rangle {}_b(\upsilon _k\mu )^{-1}\cdot {}{h}_{iM},&\text {where }h_{iM}=(1-\gamma {})\sum _{q=1}^{i}\gamma {}^{i-q}\cdot {}{h}_{qL} \end{aligned}$$

(8d)

$$\begin{aligned} \mathrm{Var}(n)_b&=\langle {}n\rangle {}_b+\langle {}n\rangle {}^2_b(\upsilon _k\mu )^{-1}\cdot {}{h}_{MM},&\text {where }h_{MM}=h_{LM}; \end{aligned}$$

(8e)

here, the subscript b denotes the moments in the bursty limit. Moreover, $\upsilon _k=s_u/k$, $\upsilon _m=s_u/d_m$, and ${h}_{ij}={f}_{ij}|_{\alpha \rightarrow {}0}$ denotes the simplified function ${f}_{ij}$ in the limit of $\alpha {\longrightarrow {}}0$, which is achieved when $s_b\rightarrow \infty $. We note that the above expressions for the functions $h_{ij}$ are derived from the expressions for $f_{ij}$ that are given in Eq. (B.33), rather than from those in Eq. (4d). The reason is that, in the bursty limit, we have that $\frac{1}{2\alpha }\rightarrow {}\infty $, in which case the identity in Eq. (B.36) does not hold. The bursty limit in Eq. (B.33) is simply taken by collecting terms that are not dependent on $\alpha $, since $\alpha \longrightarrow 0$ in that limit.

To test the accuracy of our theory, in Fig. 2 we compare our analytical expressions for the mean of local RNAP numbers, as well as for various measures of local RNAP fluctuations—the coefficient of variation $\mathrm {CV}$, the Fano factor $\mathrm {FF}$, and the Pearson correlation coefficient $\mathrm {CC}$—with those calculated from stochastic simulation using Gillespie’s algorithm (SSA) (Gillespie 1977). Simulations are performed for two different scenarios: (i) without volume exclusion, where the footprint of RNAPs is not taken into account; and (ii) with volume exclusion, where RNAPs are treated as solid objects with a footprint of 35 bp, which is the value reported in Md Zulfikar et al. (2020). For our simulations in Fig. 2, we use parameter values characteristic for the gene PDR5 of length 3070 bp, as reported in Zenklusen et al. (2008). Our choice of $L=30$ implies that the length of each gene segment is about 100 bp and, hence, that at most 3 RNAPs can fit in each segment when volume exclusion is taken into account. In this case, Gillespie’s algorithm is modified such that the initiation and RNAP ‘hopping’ rates are proportional to the available volume in the gene segment which the RNAP is moving to. That is achieved by rescaling the transcription initiation rate as $r\mapsto {}r(1-n_1/3)$ and the RNAP hopping rate from the ith to the $(i+1)$th gene segment as $k\mapsto {}k(1-n_{i+1}/3)$. Since we use parameters measured for a gene that demonstrates bursty expression ($\mathrm {PDR}5$) (Zenklusen et al. 2008), we test the accuracy of both the exact theory from Eqs. (2) and (4) and the approximate expressions given in Eq. (8).

The perfect agreement between our exact theory (solid lines) and simulation without volume exclusion (dots) provides a numerical validation of that theory. Our approximate theory (dashed lines) also yields a reasonably good approximation; the mismatch can be decreased if the degree of burstiness is increased, i.e. by increasing the parameters r and $s_b$ relative to the other rates in the model. We also note that the theory is in good agreement with simulation with volume exclusion (open circles), which shows that the ‘low traffic’ assumption upon which our theory is based is valid.

The following interesting observations can be made from these figures: (i) if the rate of premature detachment is greater than zero, then the mean of local RNAP decreases monotonically with the distance i from the promoter according to a power law, whereas that mean is constant along the gene if there is no premature detachment, as expected; (ii) the size of RNAP fluctuations, as measured by $\mathrm{CV}$, decreases with i for small premature detachment rates, but increases with i for sufficiently large values of the detachment rate; (iii) the Fano factor approaches 1—the value of $\mathrm{FF}$ for a Poissonian distribution—as i increases, which is due to the dispersal of the burst as stochastic elongation proceeds; (iv) the correlation coefficient between the local RNAP on two neighbouring gene segments decreases monotonically with i, which is exacerbated by premature detachment and is a direct result of the stochasticity inherent in the elongation process.

The observation in (iii) can be explained in detail as follows. When the detachment rate is zero, a burst of RNAPs rapidly bind to the promoter, leading to large fluctuations near that site; however, thereafter each RNAP moves distinctly from all others due to stochastic elongation. Hence, the burst is gradually dispersed as elongation proceeds, which implies a decrease in the variance of fluctuations with increasing i. When the detachment rate is nonzero, then the same effect is at play; however, the increase in the variance of fluctuations along the gene is now counteracted by the decrease in mean RNAP numbers, which leads to two types of behaviour: for small i, $\mathrm{CV}$ decreases with i, since the variance dominates over the mean, while for large i, the opposite occurs and $\mathrm{CV}$ increases with i.

Constitutive limit: The other common parameter regime is that of constitutive gene expression, where the gene spends most of its time in the active state and transcription is continuous, which corresponds to the limit of very small $s_b$. In that limit, the expressions from Eqs. (2) and (4) simplify to

$$\begin{aligned} \begin{aligned} \langle n_i\rangle _c =\mathrm{Var}(n_i)_c=\rho _k\mu {}^{i} \qquad {}{} \text {and} \qquad {}{} \langle n\rangle _c&=\mathrm{Var}(n)_c=b\rho \mu {}^{L}, \end{aligned} \end{aligned}$$

(9)

while the covariances $\mathrm{Cov}(n_i,n_j)_c$ and $\mathrm{Cov}(n_i,n)_c$ between the species are zero; here, the subscript c denotes the constitutive limit. This drastic simplification reflects the fact that, in the constitutive limit, the distributions of mature RNA and local RNAP are Poissonian: as the regulatory network is effectively given by $\emptyset \rightarrow P_1 \rightarrow P_2 \rightarrow ... \rightarrow P_L \rightarrow M \rightarrow \emptyset $ then, the result follows directly from the exact solution provided in Jahnke and Huisinga (2007).

To further test the accuracy of our theory, in Fig. 3 we compare our analytical expressions for the mean of local RNAP numbers, as well as for various measures of local RNAP fluctuations, with those calculated from stochastic simulation using Gillespie’s algorithm, where we use parameters measured for a gene that demonstrates constitutive expression ($\mathrm {DOA}1$) (Zenklusen et al. 2008). As before, we test the accuracy of both the exact theory given by Eqs. (2) and (4) and the approximate expressions from Eq. (9). Unsurprisingly, we observe agreement between exact theory (solid lines) and simulation (dots); the mismatch between our approximate theory and simulation is due to the fact that the gene does not spend 100% of its time in the active state—the true constitutive limit—but, rather, $s_u/(s_u + s_b)\approx 85\%$. The local mean RNAP number decreases with distance from the promoter, as was the case for bursty expression in the previous subsubsection, which is to be expected. The various measures which depend on the second moments are, however, considerably different: $\mathrm {CV}$ increases monotonically with i, independently of the rate of premature detachment, while $\mathrm {FF}$ and $\mathrm {CC}$ are very close to 1 and zero, respectively; moreover, the latter two measures practically show very little variation along the gene. The lack of transcriptional bursting explains all these effects in a straightforward fashion.

Finally, we remark that the accuracy of our expressions for the mean and variance of mature RNA, as given in Eq. (2) and (4), is verified by simulation (SSA) in Fig. 4a, b for parameters typical of the bursty $\mathrm {PDR}5$ gene. The meaning of the dependence of descriptive statistics on L is discussed in the next section.

2.3 Closed-Form Expressions for Moments of Total RNAP

While local RNAP fluctuations are measurable in experiment, as discussed in the Introduction, measurements of total RNAP on a gene are typically reported. Hence, in this section, we briefly discuss descriptive statistics of total RNAP fluctuations.

Recalling that $n_i$ is the number of RNAP molecules on the ith gene segment, the total number of RNAPs on the gene—arbitrarily divided into L segments—is given by $n_\mathrm{tot}=\sum _{i=1}^{L}n_i$. Given Eq. (2) and (4), the steady-state mean $\langle n_\mathrm{tot} \rangle =\sum _{i=1}^{L}\langle n_i\rangle $ and the steady-state variance $\mathrm{Var}(n_\mathrm{tot})=\sum _{i,j=1}^{L}\mathrm{Cov}(n_i,n_j)$ of the total RNAP distribution are given by

$$\begin{aligned} \begin{aligned} \langle n_\mathrm{tot}\rangle =\eta {}\rho _k\mu \dfrac{\mu ^L-1}{\mu -1} \quad \text {and}\quad \mathrm{Var}(n_\mathrm{tot}) =\langle {}n_\mathrm{tot}\rangle {}+\alpha \beta (\eta {}\rho {}_k)^2\sum _{i,j=1}^{L}\mu ^{i+j}\cdot {}{f}_{ij}. \end{aligned}\nonumber \\ \end{aligned}$$

(10)

For a detailed derivation of the variance in Eq. (10), we refer to ‘Appendix C’. These expressions for the mean and variance of the total RNAP distribution simplify in the bursty and constitutive limits, as can be seen in ‘Appendix D’. The accuracy of Eq. (10) is tested by comparing against stochastic simulation with SSA in Fig. 4c, d. Both mean and variance are seen to increase monotonically with the number of gene segments L, as we keep the mean elongation time constant; the mean shows very little dependence on L, while the dependence of the variance is more pronounced. We recall that, while the parameter L is arbitrary in principle, it actually determines the size of fluctuations in the elongation time. Since that time is the sum of L independent exponential variables with mean $1/(k+d)$ each, it follows that the distribution of the elongation time T is Erlang with mean $\langle T \rangle =L/(k+d)$ and coefficient of variation squared equal to 1/L. Hence, the larger L is, the narrower is the distribution of T and the more deterministic is elongation itself. Thus, Fig. 4c, d predicts that the mean and variance of total RNAP increase rapidly with decreasing fluctuations in the elongation time T. It hence follows that models in which the elongation rate is assumed to be exponentially distributed (Cao and Grima 2020), which correspond to the case where $L = 1$ in our model, underestimate the size of nascent RNA fluctuations.

2.4 Special Case of Deterministic Elongation

Next, we derive expressions for the descriptive statistics of total RNAP and mature RNA in the limit of large L taken at constant mean elongation time, which corresponds to deterministic elongation. As is shown in Fig. 4, these statistics converge quickly to the ones obtained in the large-L limit; hence, the resulting limiting expressions are likely to be useful across a variety of genes.

Moments of total RNAP distribution: We define the non-dimensional parameters $\updelta _g=\tau _g/\tau _d$, $T_g=\langle {}T\rangle {}/\tau _g$, and $T_d=\langle {}T\rangle {}/\tau _d$, which correspond to the ratio of the gene timescale and the polymerase detachment timescale, the ratio of the mean elongation time and the gene timescale, and the ratio of the mean elongation time and the polymerase detachment timescale, respectively; here, $\tau _d=1/d$, as before. Substituting $k\mapsto {}L/\langle {}{}T\rangle {}{}-d$ into Eq. (10) and taking the limit of deterministic elongation, i.e. letting $L\rightarrow {}\infty $ at constant $\langle T\rangle $, we obtain the following expressions for the mean, variance, and $\mathrm{CV}^2$ of total RNAP:

$$\begin{aligned} \begin{aligned} \langle {}n_\mathrm{tot}\rangle {}_{\infty }&= \eta \dfrac{r}{d}(1-\hbox {e}^{-T_d}) , \\ \mathrm{Var}(n_\mathrm{tot})_{\infty }&=\langle {}n_\mathrm{tot}\rangle {}_{\infty } +\langle {}n_\mathrm{tot}\rangle {}_{\infty }^2\cdot {} \beta {}\updelta _g \dfrac{(\updelta _g-1)+(\updelta _g+1)\hbox {e}^{-2T_d}-2\updelta _g\hbox {e}^{-T_g}\hbox {e}^{-T_d}}{(\updelta _g-1)(\updelta _g+1)(1-\hbox {e}^{-T_d})^2} ,\\ \mathrm{CV}^2(n_\mathrm{tot})_{\infty }&=\langle {}n_\mathrm{tot}\rangle {}_{\infty }^{-1} + \beta {}\updelta _g \dfrac{(\updelta _g-1)+(\updelta _g+1)\hbox {e}^{-2T_d}-2\updelta _g\hbox {e}^{-T_g}\hbox {e}^{-T_d}}{(\updelta _g-1)(\updelta _g+1)(1-\hbox {e}^{-T_d})^2}. \end{aligned} \nonumber \\ \end{aligned}$$

(11)

Here, the subscript $\infty $ denotes the limit of $L\rightarrow {}\infty $. A detailed derivation of the variance in Eq. (11) can be found in Lemma C.1 of ‘Appendix C’.

In the special case when RNAP does not prematurely detach from the gene, i.e. for $d=0$, the expressions in Eq. (11) simplify to

$$\begin{aligned} \begin{aligned} \langle {}n_\mathrm{tot}\rangle {}_{(\infty ;0)}&=\eta {}r\langle {}T\rangle {}, \\ \mathrm{Var}(n_\mathrm{tot})_{(\infty ;0)}&=\langle {}n_\mathrm{tot}\rangle {}_{(\infty ;0)}+ \langle {}n_\mathrm{tot}\rangle {}_{(\infty ;0)}^2\cdot {} 2\beta {}T_g^{-1}\big (1-T_g^{-1}+T_g^{-1}\hbox {e}^{-T_g}\big ) ,\\ \mathrm{CV}^2_{(\infty ;0)}&=\langle {}n_\mathrm{tot}\rangle {}_{(\infty ;0)}^{-1}+ 2\beta {}T_g^{-1}\big (1-T_g^{-1}+T_g^{-1}\hbox {e}^{-T_g}\big ), \end{aligned} \end{aligned}$$

(12)

where the subscript $(\infty ;0)$ denotes the limit of $(L,d)\rightarrow {}(\infty ,0)$. The expressions in Eq. (12) have been previously reported in Choubey et al. (2015), where they were derived using queuing theory. Hence, our expressions in Eq. (11) constitute a generalisation of known results, by further taking into account premature detachment of RNAP from the gene.

Equation (12) shows that the coefficient of variation squared of total RNAP, denoted by $\mathrm{CV}^2_{(\infty ;0)}$, can be written as the sum of two terms: (i) the inverse of the mean which is expected if the distribution of total RNAP is Poissonian, and (ii) a term that increases with increasing $\beta $ and decreasing $T_g$. Hence, the latter term provides a measure for the deviation of the total RNAP distribution from a Poissonian. In particular, it shows that the deviation is significant in genes for which (i) the fraction of time spent in the inactive state is large (large $\beta $), and (ii) the elongation time is much shorter than the switching time between the active and inactive states (small $T_g$).

Moments of mature RNA distribution: Similarly, in the limit of deterministic elongation, it is straightforward to show that the expressions for the mean and variance of the distribution of mature RNA given by Eqs. (2) and (4) reduce to

$$\begin{aligned} \begin{aligned} \langle {} n \rangle {}_{\infty }&=\eta \rho \hbox {e}^{-T_d}&\quad {}\text {and}\quad \mathrm{Var}(n)_{\infty }&=\langle {} n \rangle {}_{\infty }+\langle {} n \rangle {}_{\infty }^2\cdot {}\beta \theta . \end{aligned} \end{aligned}$$

(13)

These expressions can be further simplified in the special case of no premature detachment to read

$$\begin{aligned} \begin{aligned} \langle {} n \rangle {}_{(\infty ;0)}=\eta \rho \quad \text {and}\quad \mathrm{Var}(n)_{(\infty ;0)}&=\langle {} n \rangle {}_{(\infty ;0)}+\langle {} n \rangle {}_{(\infty ;0)}^2\cdot {}\beta \theta . \end{aligned} \end{aligned}$$

(14)

Note that the mean and variance are precisely the same as would be obtained from the telegraph model, for which the corresponding Fano factor in the bursty limit is given by Eq. (16) below. Hence, we anticipate that, in the limit of no premature detachment and deterministic elongation, the distribution of mature RNA from our transcription model is the same as the distribution obtained from the coarser telegraph model. A formal proof of that claim will be given in Sect. 3.

Relationship between Fano factors of total RNAP and mature RNA: Specifying to the case of no premature detachment, it is interesting to note that in the bursty limit, i.e. for $r,s_b \rightarrow \infty $ at constant mean burst size $b=r/s_b$ in Eq. (12), the Fano factor of total RNAP is given by

$$\begin{aligned} {\mathrm{FF}_n}_{(b;\infty ;0)} = 1 + 2b; \end{aligned}$$

(15)

see also Eq. (D.3) in ‘Appendix D’. Here, the subscript n denotes nascent RNA (total RNAP). Eq. (15) is in contrast to the Fano factor of mature RNA in the same bursty limit:

$$\begin{aligned} {\mathrm{FF}_m}_{(b;\infty ;0)} = 1 + b, \end{aligned}$$

(16)

see Eq. (D.8) in ‘Appendix D’, where the subscript m denotes mature RNA. (Note that ${\mathrm{FF}_m}_{(b;\infty ;0)}$ also equals the Fano factor of the telegraph model in the same bursty limit (Raj et al. 2006).) Hence, by comparing Eqs. (15) and (16), we can deduce the following for bursty expression: (i) if the telegraph model is used to estimate the mean transcriptional burst size from total RNAP data where the elongation time is deterministic, then the mean burst size will be overestimated by a factor of two—in other words, the implicit assumption that the elongation time is exponentially distributed is inadequate; (ii) fluctuations in total RNAP (nascent RNA) deviate more from Poisson statistics, for which the Fano factor equals one, than fluctuations in mature RNA.

More generally, if we do not enforce the bursty limit, then we find the following relationship between the Fano factors of total RNAP and mature RNA, which are calculated from Eqs. (12) and (14), respectively:

$$\begin{aligned} \begin{aligned} \frac{{\mathrm{FF}}_{n(\infty ;0)}}{{\mathrm{FF}}_{m(\infty ;0)}} = 1 + \frac{\hbox {e}^{-T_g} T_r T_{s_b} \Xi }{T_g^2 \big [T_r T_{s_b}+T_g(T_g+T_m)\big ]}. \end{aligned} \end{aligned}$$

(17)

Here,

$$\begin{aligned} \begin{aligned} \Xi = 2 (T_g+T_m)+\hbox {e}^{T_g} [2 (T_g-1) T_m+(T_g-2)T_g], \end{aligned} \end{aligned}$$

(18)

while $T_g = (s_u + s_b)\langle T \rangle $, $T_r = r\langle T \rangle $, $T_m = d_m \langle T \rangle $, and $T_{s_b} = s_b \langle T \rangle $ are non-dimensional parameters representing the ratio of the mean elongation time to the timescales of promoter switching, initiation, decay of mature RNA, and gene deactivation, respectively. From Eq. (17), we deduce that $\mathrm {FF}_{n(\infty ;0)}>\mathrm {FF}_{m(\infty ;0)}$ if and only if $\Xi >0$. From the contour plot of $\Xi $ in Fig. 5, one can deduce that

$$\begin{aligned} \Xi > 0\quad \text {if and only if}\quad T_m > rapprox 1 - \frac{5}{8} T_g. \end{aligned}$$

(19)

Hence, the Fano factor of nascent RNA is larger than that of mature RNA if and only if the above (approximate) condition is satisfied. In the bursty limit, $T_g \rightarrow \infty $ due to $s_b \rightarrow \infty $ which, together with $T_m > 0$, implies that Eq. (19) holds; the condition is also satisfied if promoter switching is very fast compared to elongation. By contrast, if $T_m < 1$ and $T_g < 1$, then it is possible to have the opposite scenario where the Fano factor of mature RNA is larger than that of nascent RNA, which occurs, for example, if promoter switching and mature RNA decay are very slow compared to elongation.

Sensitivity of coefficient of variation of total RNAP and mature RNA: Since we have found explicit expressions for the first two moments of the distributions of total RNAP and of mature RNA, we can now estimate the sensitivity of the noise in each of those to small perturbations in the transcriptional parameters. Specifically, we calculate the logarithmic sensitivity (LS), which is also known as the relativity sensitivity, of the coefficient of variation (CV) to a parameter s, which is defined as $\Lambda {}_{s}=(s/{\mathrm{CV}})(\partial {}{\mathrm{CV}}/\partial {}s)$. (That definition implies that a $1\%$ change in the value of the parameter s results in a change of $\Lambda _{s}\%$ in $\mathrm{CV}$.)

In Table 1b, we report the logarithmic sensitivity of the coefficient of variation of total RNAP fluctuations, which is obtained from Eq. (12), to perturbations in the parameters $s_u$, $s_b$, r, and $\langle {}T\rangle {}$. Similarly, in Table 1c, we report the logarithmic sensitivity of the coefficient of variation of mature RNA fluctuations from Eq. (14) to perturbations in the parameters $s_u$, $s_b$, r, and $d_m$. In both cases, these sensitivities are calculated for parameter values estimated for five genes in yeast, as reported in Zenklusen et al. (2008); see Table 1a.

The following observations can be made regarding the sensitivity of the noise in total RNAP fluctuations: (i) for the two genes PDR5 and POL1 which spend most of their time in the inactive state due to $s_b \gg s_u$, $\mathrm {CV}$ is most sensitive to changes in the parameters $s_u$ and $\langle {}T\rangle {}$; (ii) for the genes DOA1, MDN1, and KAP104 which spend most of their time in the active state due to $s_u \gg s_b$, $\mathrm {CV}$ is most sensitive to changes in the parameters r and $\langle {}T\rangle {}$; (iii) the size of mature RNA fluctuations is found to be most sensitive to perturbations in $s_u$ and $d_m$ for PDR5 and POL1, and to perturbations in r and $d_m$ for the other three genes. We furthermore note that for both total RNAP and mature RNA, r is the least sensitive parameter for the genes which are mostly inactive, whereas it is among the most sensitive parameters for genes that are mostly active.

Table 1 Logarithmic sensitivity (LS) of the coefficient of variation CV of total RNAP and mature RNA fluctuations for five genes in yeast; see Sect. 2.4 for a discussion. (a) Parameter values from Supplemental Tables 2 and 4 in Zenklusen et al. (2008). The degradation rate $d_m$ of mature mRNA is estimated from the reported mean number of mature RNA, the parameters $s_u$, $s_b$, r, and Eq. (14) for the mean. (b) Logarithmic sensitivity of CV of total RNAP fluctuations. (c) Logarithmic sensitivity of CV of mature mRNA fluctuations. The most sensitive parameter and the next most sensitive one are marked in dark bold and italic, respectively

Full size table

3 Approximate Distributions of Total RNAP and Mature RNA

Thus far, we have derived expressions for the first two moments of the distributions of total RNAP and mature RNA. Naturally, it would also be useful to derive closed-form expressions for the distributions themselves; such a derivation is, however, analytically intractable in general (Jahnke and Huisinga 2007) due to the presence of the catalytic reaction $G_{\mathrm{on}} \rightarrow G_{\mathrm{on}} + P_1$, which models initiation of the transcription process. Still, there are two special cases where analytical distributions are known: (i) when the elongation time is considered to be fixed, which corresponds to our model with $L \rightarrow \infty $ at constant $\langle T \rangle $ (Heng et al. 2016; ii) when the elongation time is exponentially distributed, corresponding to our model with $L=1$, in which case the distribution of total RNAP is identical to the one which is derived from the telegraph model (Peccoud and Ycart 1995; Raj et al. 2006). While one may argue that the analytical distribution of RNAP for deterministic elongation times may well approximate the stochastic (finite-L) case, the issue remains that the exact solution is not given in terms of simple functions unless promoter switching is slow compared to initiation, elongation, and termination, in which case the solution reduces to a weighted sum of two Poisson distributions (Heng et al. 2016). Hence, it is generally very difficult to apply in practice, such as to infer parameters from data using a Bayesian approach. Moreover, to our knowledge, no exact solutions are known for the distribution of mature RNA in our model. In this section, we aim to devise a simple approximation for the distribution of total RNAP numbers in terms of the Negative Binomial (NB) distribution; these simple distributions have shown great flexibility in describing complex gene expression models with a large number of parameters (Cao and Grima 2020). Finally, by means of singular perturbation theory, we will obtain the distribution of mature RNA under the assumption that RNA polymerase elongation is faster than degradation of mature RNA.

3.1 Approximation of Total RNAP Distribution

We approximate the distribution of total RNAP transcribing the gene via a Negative Binomial distribution, as follows. The mean and variance of the Negative Binomial distribution NB(q, p) are given by $pq/(1-p)$ and $pq/(1-p)^2$, respectively. By assuming that these are equal to the exact mean and variance, respectively, of the total RNAP distribution, see Eq. (10), we obtain effective values for the parameters p and q:

$$\begin{aligned} n_\mathrm{tot}\sim \mathrm{NB}(q,p)\equiv {}\mathrm{NB}\bigg (\dfrac{\langle {}n_\mathrm{tot}\rangle {}^2}{\mathrm{Var}(n_\mathrm{tot})-\langle {}n_\mathrm{tot}\rangle {}},\dfrac{\mathrm{Var}(n_\mathrm{tot})-\langle {}n_\mathrm{tot}\rangle {}}{\mathrm{Var}(n_\mathrm{tot})} \bigg ). \end{aligned}$$

(20)

In Fig. 6, we show a comparison between the distributions of total RNAP obtained from SSA (dots) and the Negative Binomial approximation in Eq. (20) (solid lines). Our results are presented for two different values of the number of gene segments: $L=1$ (exponentially distributed elongation time; left column) and $L=50$ (quasi-deterministic elongation time; right column). Additionally, we rescale our gene inactivation rate as $s_b\mapsto {}s_b\epsilon $, and we present results for three different values of the parameter $\epsilon $: $10^{-3}$, the constitutive limit of the gene being mostly in the active state (top row); $10^{-1}$, where the gene spends almost equal amounts of time in the active and inactive states, with $s_b\approx {}s_u$ (middle row); and 1, the bursty limit, where the gene spends most of its time in the inactive state (bottom row).

We can make several observations, as follows. For both $L = 1$ and $L = 50$, the Negative Binomial approximation performs well for bursting and constitutive expression (top and bottom rows), whereas it is appreciably poor when expression is in between those two limits (middle row). Intuitively, this observation can be explained via the following reasoning. In the limits of the gene being mostly in the active state (constitutive expression) or the inactive state (bursty expression), the distribution of total RNAP is necessarily unimodal. However, when the gene spends a considerable amount of time in each state, the distribution is the sum of two conditional distributions which can manifest either as bimodality or as a wide unimodal distribution, neither of which can be captured by a Negative Binomial distribution. Assuming bursty expression, the Negative Binomial distribution is a more accurate approximation to the distribution obtained from SSA for $L = 1$ than it is for $L = 50$; the reason is that $L = 1$ corresponds to the telegraph model (Raj et al. 2006), in which case it can be proven analytically that the distribution reduces to a Negative Binomial in the limit of bursty expression. For constitutive expression, the Negative Binomial approximation is equally good for $L=1$ and $L=50$, as the distribution is necessarily Poissonian then and as it is well known that a Negative Binomial distribution can approximate a Poissonian to a high degree of accuracy. In summary, our results hence indicate that Eq. (20) yields a good approximation for the total RNAP distribution of bursty and constitutively expressed genes.

We also note from Fig. 6 that the comparison between the SSA distributions for $L = 1$ and $L = 50$, with equal mean elongation times, highlights the importance of modelling elongation with the correct distribution of elongation times for genes that are non-constitutive, i.e. for $\epsilon = 10^{-1}$ or $\epsilon =1$. In particular, if the elongation time is quasi-deterministic ($L = 50$), there appears to be a significant increase in the probability of observing zero total RNAP transcribing the gene compared to models with an exponentially distributed elongation time ($L = 1$).

3.2 Approximation of Mature RNA Distribution

Next, we apply singular perturbation theory to formally derive the distribution of mature RNA when the elongation rate is much larger than the degradation rate of mature RNA.

We start by defining $P_j(\vec {n};t)$ ($j=0,1$) as the probability of the state $\vec {n}=(n_1,\dots {},n_L,n)$ at time t while the gene is either active (0) or inactive (1). Note that $n_i$ is the number of RNAPs on gene segment i for $i=1,\dots ,L$, while n is the number of mature RNAs. The time evolution of the probabilities $P_{j}(\vec {n};t)$ can be described by a system of coupled CMEs:

$$\begin{aligned} \partial {}_{t}P{}_{0}= & {} s_uP{}_{1}-s_bP{}_{0} +r(\mathbb {E}_{n_1}^{-1}-1)P_{0} +k\sum _{i=1}^{L-1}\big (\mathbb {E}_{n_i}\mathbb {E}_{n_{i+1}}^{-1}-1\big )n_iP_{0} \\&+k\big (\mathbb {E}_{n_L}\mathbb {E}_{n}^{-1}-1\big )n_LP_{0}+d\sum _{i=1}^{L}(\mathbb {E}_{n_i}-1)n_iP_{0} +d_m(\mathbb {E}_{n}-1)nP{}_{0}, \\ \partial {}_{t}P_{1}= & {} s_bP_{0}-s_uP_{1} +k\sum _{i=1}^{L-1}\big (\mathbb {E}_{n_i}\mathbb {E}_{n_{i+1}}^{-1}-1\big )n_iP_{1} +k\big (\mathbb {E}_{n_L}\mathbb {E}_{n}^{-1}-1\big )n_LP_{1} \\&+d\sum _{i=1}^{L}(\mathbb {E}_{n_i}-1)n_iP_{1} +d_m(\mathbb {E}_{n}-1)nP_{1}, \end{aligned}$$

(21)

where $\mathbb {E}_{n_i}^c[f(\vec {n})]=f(n_1,n_2,\dots {},n_i+c,\dots {},n_{L},n)$, with $c\in \mathbb {Z}$, denotes the standard step operator. We assume that the elongation rate k is faster than the degradation rate $d_m$ of mature RNA, i.e. that $k/d_m\gg {}1$. Since $k=L/\langle T \rangle -d$, it follows that in the limit of deterministic elongation ($k \rightarrow \infty $), i.e. for $L \rightarrow \infty $ at constant mean elongation time $\langle T \rangle $, the condition $k/d_m\gg {}1$ is naturally satisfied.

In order to find an analytical expression for the propagator probabilities $P(\vec {n};t)$ which satisfies the system of CMEs in Eq. (21), we define the probability-generating function as $F=\sum _jF_j$, with $F_{j}(\vec {z};t)=\sum _{\vec {n}=\vec {0}}^{\infty {}}P_{j}({\vec {n}};t)\vec {z}^{\vec {n}}$; here, $\vec {z}=(z_1,\dots {},z_{L},z)$ is a vector of variables corresponding to the state $\vec {n}$. Given the equations for $P_{j}({\vec {n}};t)$ from Eq. (21), we obtain the following systems of PDEs for the corresponding generating functions $F_{j}(\vec {z};t)$:

$$\begin{aligned} \begin{aligned} \mathbb {L}[F_{0}]&=s_uF_{1}-s_bF_{0}+r(z_1-1)F_{0}, \\ \mathbb {L}[F_{1}]&={}s_bF_{0}-s_uF_{1}, \end{aligned} \end{aligned}$$

(22)

where

$$\begin{aligned} \mathbb {L}=\partial {}_{t} +k\sum _{i=1}^{L-1}(z_{i}-z_{i+1})\partial {}_{z_{i}} +k(z_{L}-z)\partial {}_{z_{L}} +d\sum _{i=1}^{L}(z_i-1)\partial {}_{z_{i}} +d_m(z-1)\partial {}_{z} \nonumber \\ \end{aligned}$$

(23)

is a differential operator acting on the generating functions $F_0$ and $F_1$. Eq. (22) represents a system of coupled, linear, first-order partial differential equations (PDEs). Now, we introduce the new variables $u_i=z_i-1$ ($i=1,\dots {},L$) and $u=z-1$ to rewrite Eq. (22) as

$$\begin{aligned} \begin{aligned} \mathbb {L}[F_{0}]&=s_uF_{1}-s_bF_{0}+r u_1F_{0}, \\ \mathbb {L}[F_{1}]&=s_bF_{0}-s_uF_{1}; \end{aligned} \end{aligned}$$

(24)

here, the operator in Eq. (23) now takes the form

$$\begin{aligned} \mathbb {L}=\partial {}_{t} +k\sum _{i=1}^{L-1}(u_{i}-u_{i+1})\partial {}_{u_{i}} +k(u_{L}-u)\partial {}_{u_{L}} +d\sum _{i=1}^{L}u_i\partial {}_{u_{i}} +d_mu\partial {}_{u}. \end{aligned}$$

(25)

In order to find an analytical solution to Eq. (24), we rescale all rates and the time variable by the decay rate of mature RNA; then, we apply the method of characteristics, with s being the characteristic variable. The first characteristic equation gives $d_m(\hbox {d}t/\hbox {d}s)=1$, with solution $s\equiv {}t'=d_mt$; hence, we can use the variable $t'$ as the independent variable and thus convert the system of PDEs in Eq. (24) into a characteristic system of ordinary differential equations (ODEs),

$$\begin{aligned} {\dot{u}}_i&=(k/d_m)[u_i-u_{i+1}+(d/k) u_i] \qquad \text {for }i=1,\dots {},L-1, \end{aligned}$$

(26a)

$$\begin{aligned} {\dot{u}}_L&=(k/d_m)[u_L-u+(d/k) u_L], \end{aligned}$$

(26b)

$$\begin{aligned} {\dot{u}}&=u, \end{aligned}$$

(26c)

$$\begin{aligned} {\dot{F}}_{0}&=(s_u/d_m)F_{1}-(s_b/d_m)F_{0}+(r/d_m) u_1F_{0}, \end{aligned}$$

(26d)

$$\begin{aligned} {\dot{F}}_{1}&=(s_b/d_m)F_{0}-(s_u/d_m)F_{1}, \end{aligned}$$

(26e)

where the overdot denotes differentiation with respect to $t'$. The existence of an integral-form solution to Eq. (26) follows from the fact that the reaction scheme in Fig. 1 contains first-order reactions only. Under the assumption that $k\gg {}d_m$, we define $\varepsilon =d_m/k$; then, we apply Geometric Singular Perturbation Theory (GSPT) (Fenichel 1979; Jones 1995), with $0<\varepsilon \ll {}1$ as the (small) singular perturbation parameter. We hence separate the system in Eq. (26) into fast and slow dynamics, which will allow us to find an asymptotic approximation for $F_{0}$ and $F_{1}$ in steady state. A brief introduction to GSPT can be found in ‘Appendix E’. Given the above definition of $\varepsilon $, Eqs. (26a) and (26b), the governing equations for $u_i$ in the ‘slow system’, become

$$\begin{aligned} \begin{aligned} \varepsilon {\dot{u}}_i&=u_i-u_{i+1}+(d/k) u_i \qquad \text {for }i=1,\dots {},L-1,\\ \varepsilon {\dot{u}}_L&=u_L-u+(d/k) u_L, \end{aligned} \end{aligned}$$

(27)

where $u_i$ ($i,\dots {},L$) are the fast variables and u, $F_{0}$, and $F_{1}$ are the slow ones. Setting $\varepsilon =0$ in Eq. (27), we can express the variables $u_i$ as $u_i=\mu \cdot {}u_{i+1}$, with $\mu =k/(k+d)$ for $i=1,\dots ,L$. Finally, we write the variable $u_1$ as $u_1=\mu {}^L\cdot {}u$. Next, given Eq. (26c), we apply the chain rule, with $\hbox {d}t'\equiv {}\hbox {d}u\cdot {} u$, to rewrite Eqs. (26d) and (26e) as

$$\begin{aligned} F'_0d_mu&=s_uF_1-s_bF_0+r \mu ^L uF_0, \end{aligned}$$

(28a)

$$\begin{aligned} F'_1d_mu&=s_bF_0-s_uF_1, \end{aligned}$$

(28b)

where the prime now denotes differentiation with respect to u. Solving Eq. (28a) for $F_1$ and substituting the result into Eq. (28b), we obtain the second-order ODE

$$\begin{aligned} d_m^2uF''_0+d_m(d_m+s_b+s_u-r \mu ^L u)F'_0-r \mu {}^L(d_m+s_u)F_0=0 \end{aligned}$$

(29)

for $F_0(u)$. Eq. (29) is a confluent hypergeometric differential equation (Kummer’s equation) (Digital Library of Mathematical Functions 2020b) which admits the solution

$$\begin{aligned} F_0(u)=C\cdot {}_1F_1\Big (\dfrac{d_m+s_u}{d_m};\dfrac{d_m+s_b+s_u}{d_m};\dfrac{r}{d_m}\mu {}^L u\Big ), \end{aligned}$$

(30)

where ${}_1F_1$ denotes the confluent hypergeometric function; here, we consider only one of two independent fundamental solutions of Kummer’s differential equation, as we are seeking a solution in steady state where the variable u is bounded. The constant C in Eq. (30) is a constant of integration that is determined from the normalisation condition on the full generating function: $F=F_0+F_1$. From Eq. (28), one finds that F satisfies

$$\begin{aligned} F'=\dfrac{r}{d_m}\mu {}^LF_0. \end{aligned}$$

(31)

Making use of Eq. (31) and applying the normalisation condition $F|_{u=0}=1$, we find that the generating function in steady state reads

$$\begin{aligned} F(z)={}_1F_1\Big (\dfrac{s_u}{d_m};\dfrac{s_b+s_u}{d_m};\dfrac{r}{d_m} \mu {}^L(z-1)\Big ). \end{aligned}$$

(32)

The probability distribution P(n) of mature RNA can be found from the formula

$$\begin{aligned} P(n)=\frac{1}{n!}\frac{d^n}{dz^n}F(z)|_{z=0}, \end{aligned}$$

which yields the analytical expression

$$\begin{aligned} P(n)=\dfrac{1}{n!}\dfrac{(s_u)_n}{(s_b+s_u)_n}\Big (\dfrac{r}{d_m}\Big )^n(\mu ^{L})^n{}_1F_1\big (\tfrac{s_u}{d_m}+n;\tfrac{s_b+s_u}{d_m}+n;-\tfrac{r}{d_m}\mu ^L\big ), \end{aligned}$$

(33)

where $(\cdot )_n$ is the Pochhammer symbol, as before. Note that the mean and variance of mature mRNA, as calculated from the distribution in Eq. (33), agree exactly with Eqs. (2c) and (4f) in the limit of fast elongation ($k\rightarrow {}\infty $). Note also that the solution in Eq. (33) depends on the parameter $\mu {}^L$, which represents the survival probability of an RNAP molecule, i.e. the probability that RNAP will not prematurely detach from the gene. Finally, we take the limit of deterministic elongation, letting $L \rightarrow \infty $ at constant $\langle T \rangle $, which leads to

$$\begin{aligned} P(n)=\dfrac{1}{n!}\dfrac{(s_u)_n}{(s_b+s_u)_n}\Big (\dfrac{r}{d_m}\Big )^n \hbox {e}^{-nd\langle T \rangle }{}_1F_1\big (\tfrac{s_u}{d_m}+n;\tfrac{s_b+s_u}{d_m}+n;-\tfrac{r}{d_m}\hbox {e}^{-d\langle T \rangle }\big ). \end{aligned}$$

(34)

Note that in the limit of no premature detachment ($d = 0$), Eq. (34) is precisely equal to the distribution of mature RNA predicted by the telegraph model, which is in wide use in the literature (Raj et al. 2006). Hence, our perturbative approach can be seen as a means to formally derive the conventional telegraph model of gene expression starting from a more fundamental and microscopic model. In Fig. 7, we verify our analytical solution with stochastic simulation for two different genes in yeast. We also note that, for nonzero premature detachment rates ($d \ne 0$), Eq. (34) is the steady-state solution predicted by the telegraph model, with parameter r renormalised to $r \hbox {e}^{-d\langle T \rangle }$; that is to be expected, as the latter is the rate at which RNAPs undergo termination, leading to mature RNAs.

4 Statistics of Fluorescent Nascent RNA Signal

Thus far, we have determined the statistics of the total number of RNAP transcribing the given gene; these are also the statistics of the number of nascent RNA molecules. However, in experiments using single-molecule fluorescence in situ hybridisation [smFISH (Heng et al. 2016)], molecule numbers of nascent RNA cannot be directly determined. Rather, the experimentally measured RNA ‘abundance’ is the fluorescent signal emitted by oligonucleotide probes bound to the RNA. Since the length of the nascent RNA grows as RNAP moves away from the promoter, it follows that we must account for the increase in the fluorescent signal as elongation proceeds.

In this section, we take into account these experimental details to obtain closed-form expressions for the mean and variance of the fluorescent signal of local and total nascent RNA. We assume that the signal from nascent RNA on the ith gene segment is given by $r_i=(\nu {}/L)in_i$ for $i=1,\dots {},L$, where $\nu $ is some experimental constant; the value of the parameter $(\nu {}/L)i$ is increasing with i, which models the fact that the fluorescent signal becomes stronger as RNAP moves along the gene. The formula for the mean fluorescent signal at gene segment i is then given by $\langle {}r_i\rangle {}=(\nu {}/L)i\langle {}n_i\rangle $, where $\langle {}n_i\rangle $ follows from Eq. (2b); the covariance of two fluorescent signals along the gene, $r_i$ and $r_j$ ($i,j=1,\dots ,L$), is given by $\mathrm{Cov}(r_i,r_j)=(\nu /L)^2ij\mathrm{Cov}(n_i,n_j)$, where $\mathrm{Cov}(n_i,n_j)$ is obtained from Eq. (4d). In Fig. 8a, b, we plot the mean and Fano factor of the local signal as a function of the gene segment i; note the contrast between the statistics of the fluorescent signal and the corresponding statistics of local RNAP—which is the statistics of nascent RNA—shown in Fig. 2a, c.

Similarly, denoting by $r_\mathrm{tot}=\sum _{i=1}^{L}r_i$ the total fluorescent signal across the gene, we find the following expressions for the steady-state mean $\langle r_\mathrm{tot}\rangle =\sum _{i=1}^{L}\langle {}r_i\rangle {}$ and the steady-state variance $\mathrm{Var}(r_\mathrm{tot})=\sum _{i,j=1}^{L}\mathrm{Cov}(r_i,r_j)$:

$$\begin{aligned} \begin{aligned} \langle r_\mathrm{tot}\rangle&= \nu \eta \rho _k\mu \dfrac{\mu ^L[L\mu -(L+1)]+1}{L(\mu -1)^2}, \\ \mathrm{Var}(r_\mathrm{tot})&=\Big (\dfrac{\nu }{L}\Big )^2\eta \rho {}_k \sum _{i=1}^{L}i^2\mu ^{i}+ \Big (\dfrac{\nu }{L}\Big )^2\alpha \beta (\eta \rho {}_k)^2\sum _{i,j=1}^{L}ij\cdot {}\mu ^{i+j}\cdot {}{f}_{ij}. \end{aligned} \end{aligned}$$

(35)

For a detailed derivation of the variance in Eq. (35), see Eq. (F.1) in ‘Appendix F’; see also ‘Appendix G’ for the corresponding expressions in the bursty, constitutive, and deterministic elongation limits. In Fig. 8c, d, we show the mean and Fano factor of the total signal as a function of the number of gene segments (L); as above, we note the contrasting difference between the statistics of the fluorescent signal and the corresponding statistics of total RNAP—which is the statistics of total nascent RNA—shown in Fig. 4c, d.

Hence, the calculation of the statistics of the number of nascent RNAs from the raw signal intensity presents a challenge and has to be approached carefully. The expressions presented above allow for the inference of transcriptional parameters from the first two moments of the fluorescent signal by means of moment-based inference techniques (Zechner et al. 2012). Quantitative information about nascent RNA can also be obtained from electron micrograph images (El Hage et al. 2010), which avoids the challenges presented by smFISH.

5 Model Extension with Pausing of RNAP

Thus far, we have studied a model where RNAPs do not pause as they move along the gene. A natural extension is provided by a modified model in which RNAPs pause along the gene at random sites and elongation is characterised by three processes: forward hopping, pausing, and unpausing of RNAP. The motivation for studying this extended model, which has recently been considered via stochastic simulation in Md Zulfikar et al. (2020), is that experiments have revealed that RNAP exhibits pauses of varying duration, typically on the timescale of few seconds (Forde et al. 2002; Adelman et al. 2002).

5.1 Closed-Form Expressions for Moments of Local RNAP Fluctuations

We extend the model described in Fig. 1 by assuming that the RNAP on gene segment i can switch between a non-paused (actively moving) state $P_i$ and a paused state $\bar{P}_i$. The actively moving state $P_i$ switches to $\bar{P}_i$ with rate $r_p$, while the reverse reaction occurs with rate $r_a$. Premature detachment from the actively moving RNAP occurs with rate $d_a$, whereas it occurs with rate $d_p$ from the paused RNAP. The resulting extended model is illustrated in Fig. 9a. In ‘Appendix A’, we derive the mean and variance of the corresponding elongation time, which is not Erlang distributed now, as was the case for the model without pausing. Furthermore we find two interesting properties of the coefficient of variation ${\mathrm{CV}}_T^2$ of the elongation time: (i) in the limit of large L at constant mean elongation time, ${\mathrm{CV}}_T^2$ does not tend to zero, which implies that elongation is not deterministic; (ii) for small rates of premature detachment, ${\mathrm{CV}}_T^2$ is at its maximum when $r_p \approx r_a$, i.e. when RNAP spends roughly half of its time in the paused state. See ‘Appendix A’ for details and Fig. 9b for a confirmation through stochastic simulation.

Proposition 3

Let the number of RNAP molecules in the active state $P_i$ be denoted by $n^a_i$, let the number of molecules in the paused state $\bar{P}_i$ be $n^p_i$, and let the number of molecules of mature RNA be denoted by n. Let $\sigma =r_p/r_a$ be the ratio of the pausing and activation rates, let $\pi _{r_a}=r_a/(r_a+d_p)$ be the probability of RNAP switching to the actively moving state from the paused state, and let $\pi _{d_p}=d_p/(r_a+d_p)$ be the probability of premature RNAP detachment from the paused state. Furthermore, define the new parameters ${\tilde{\mu }}=k/(k+d_a+r_p\pi _{d_p})$ and ${\lambda }=\sigma \pi _{r_a}$.

Then, it follows that the steady-state mean number of RNAP molecules in the active and paused states on gene segment i ($i=1,\dots {}L$) is given by

$$\begin{aligned} \langle {}{n}^a_i\rangle {}= \eta {}\rho {}_k\tilde{\mu {}}^{i} \quad \text {and}\quad \langle {}{n}^p_i\rangle {} = \langle n^a_i\rangle \lambda . \end{aligned}$$

(36)

Hence, the total mean number of RNAP molecules on each gene segment i reads

$$\begin{aligned} \langle {}{n}_i\rangle {}=\langle {}{n}^{a}_i\rangle {}+\langle {}{n}^{p}_i\rangle {}=\langle {}{n}^a_i\rangle {}(1+\lambda ). \end{aligned}$$

(37)

The proof of Proposition 3 can be found in ‘Appendix H’. Note that in the limit of no pausing, i.e. for $r_p = 0$, Eq. (37) reduces to the expression for the mean of RNAP reported in Eq. (2b).

Proposition 4

Let $\tau _{r_a}=1/r_a$ be the timescale of RNAP activation from the paused state, let $\tau _{d_p}=1/d_p$ be the timescale of premature termination of paused RNAP, let $\tau _p=1/(k+d_a)$ be the typical time that an actively moving RNAP spends on a gene segment, and let $\tau _{pp}=1/(r_a+d_p)$ be the typical time spent in the paused state. Furthermore, define the new parameters $\lambda _{r_p}=\pi _{r_p}/(1-\pi _{r_p})$, where $\pi _{r_p}=r_p/(r_p+k+d_a)$ is the probability of the actively moving RNAP switching to the paused state, as well as

$$\begin{aligned} \begin{aligned} \omega _{r_a}=\dfrac{\pi _{r_a}\tau _g}{\pi _{r_a}\tau _{r_a}+\tau _g},\quad {} {\tilde{\alpha }}{}=\dfrac{\tau _g+\lambda _{r_p}\pi _{d_p}\tau _g}{\tau _g+\tau _p+\lambda _{r_p}\tau _g(1-\omega _{r_a})}, \quad {}\text {and }\quad \omega =\dfrac{\tau _g}{\tau _{pp}+\tau _g}. \end{aligned} \nonumber \\ \end{aligned}$$

(38)

Assume that the elongation rate is faster than the rates of RNAP pausing, activation, and premature termination, i.e. that $k\gg {}r_a,r_p,d_a,d_p$. Then, it follows that to leading order in 1/k, asymptotic expressions for the variances and covariances of molecule number fluctuations of active and paused RNAP are given by

$$\begin{aligned} \begin{aligned}&\mathrm{Cov}({n}^a_i,{n}^a_j)=\delta _{ij} \langle {}{n}^a_i\rangle {}+\langle {}{n}^a_i\rangle {}\langle {}{n}^a_j\rangle {}{\tilde{\alpha }}{}\beta {}\cdot {}{g}^{aa}_{ij},&\qquad {}{} \text {where } {g}^{aa}_{ij}&={g}^{aa}(i,j)+{g}^{aa}(j,i), \\&\mathrm{Cov}({n}^a_i,{n}^p_j)=\langle {}{n}^a_i\rangle {}\langle {}{n}^p_j\rangle {}{\tilde{\alpha }}{}\beta {}\cdot {}{g}^{ap}_{ij},&\qquad {}{} \text {where } {g}^{ap}_{ij}&=\omega {\tilde{\alpha }}{}^{j-1}, \\&\mathrm{Cov}({n}^p_i,{n}^a_j)=\langle {}{n}^p_i\rangle {}\langle {}{n}^a_j\rangle {}{\tilde{\alpha }}{}\beta {}\cdot {}{g}^{pa}_{ij},&\qquad {}{} \text {where } {g}^{pa}_{ij}&=\omega {\tilde{\alpha }}{}^{i-1}, \\&\mathrm{Cov}({n}^p_i,{n}^p_j)=\delta _{ij} \langle {}{n}^p_i\rangle {}+\langle {}{n}^p_i\rangle {}\langle {}{n}^p_j\rangle {}{\tilde{\alpha }}{}\beta {}\cdot {}{g}^{pp}_{ij},&\qquad {}{} \text {where } {g}^{pp}_{ij}&=({g}^{ap}_{ij}+{g}^{pa}_{ij})/2; \end{aligned} \end{aligned}$$

(39)

here, $i,j=1,2,\dots {},L$ and

$$\begin{aligned} {g}^{aa}(i,j) =\dfrac{{\tilde{\alpha }}{}^{i+j-1}}{(2{\tilde{\alpha }}{}-1)^{i}} +\dfrac{1}{2^{i+j-1}}\left( {\begin{array}{c}i+j-1\\ i\end{array}}\right) \Big [ 1-\dfrac{2{\tilde{\alpha }}{}-1}{2{\tilde{\alpha }}{}}{}_2F_1\big (1,i+j;j;\tfrac{1}{2{\tilde{\alpha }}{}}\big ) \Big ]. \end{aligned}$$

These results are proved in full in ‘Appendix H’. From ‘Appendix A’, we also have that the mean elongation time in the pausing model is given by

$$\begin{aligned} \langle T \rangle = L\frac{(r_a+d_p)^2+r_a r_p}{(r_a+d_p) [(k+d_a)(r_a+d_p)+d_p r_p]}. \end{aligned}$$

(40)

Solving Eq. (40) for the elongation rate k, we find that in the limit of $L \rightarrow \infty $ taken at constant mean elongation time, k tends to infinity and hence is much larger than $r_a$, $r_p$, $d_a$, and $d_p$, which implies that the results of Proposition 4 hold naturally in that limit.

5.2 Approximate Distributions of Total RNAP and Mature RNA

Negative Binomial approximation of total RNAP distribution: We define the total number of RNAP molecules as ${n}_\mathrm{tot}=\sum _{i=1}^{L}{n}_i$. It then immediately follows from Eq. (37) that the mean of the total RNAP distribution in the pausing model is given by

$$\begin{aligned} \begin{aligned} \langle {n}_\mathrm{tot}\rangle =\eta {}\rho _k(1+\lambda ){\tilde{\mu }}\dfrac{{\tilde{\mu }}^L-1}{{\tilde{\mu }}-1}. \end{aligned} \end{aligned}$$

(41)

It can also be shown that the variance of total RNAP fluctuations reads

$$\begin{aligned} \mathrm{Var}(n_\mathrm{tot})=\langle {}n_\mathrm{tot}\rangle {}+(\eta {}\rho _k)^2{\tilde{\alpha }}{}\beta {}\bigg [2\sum _{i,j=1}^{L}{g}^{aa}(i,j)+ \lambda (2+\lambda )\omega {}L\dfrac{{\tilde{\alpha }}{}^L-1}{{\tilde{\alpha }}{}-1}\bigg ]; \end{aligned}$$

(42)

see ‘Appendix H’. Next, we approximate the distribution of total RNAP by a Negative Binomial distribution whose mean and variance match those just derived, i.e. we consider Eq. (20) with the mean and variance of the total RNAP distribution given by Eqs. (41) and (42) now, respectively. The resulting approximate Negative Binomial distribution is compared with the distribution obtained from SSA in Fig. 10a, b for two different yeast genes, PDR5 and DOA1. The results verify that our approximation is accurate provided the elongation rate k is significantly larger than the other parameters, as assumed in Proposition 4.

Perturbative approximation of mature RNA distribution: We can apply singular perturbation theory to formally derive the distribution of mature RNA, assuming that $k/d_m\gg {}1$ and $r_a/d_m\gg {}1$. Following the derivation in Sect. 3.2, we find the following analytical expression for the steady-state probability distribution of mature RNA:

$$\begin{aligned} P(n)=\dfrac{1}{n!}\dfrac{(s_u)_n}{(s_b+s_u)_n}\Big (\dfrac{r}{d_m}\Big )^n\big ({\tilde{\mu }}^{L}\big )^n{}_1F_1\Big (\dfrac{s_u}{d_m}+n;\dfrac{s_b+s_u}{d_m}+n;-\dfrac{r}{d_m}{\tilde{\mu }}^L\Big ); \end{aligned}$$

(43)

see ‘Appendix I’ for details. Note that the solution in Eq. (43) is dependent on the parameter ${\tilde{\mu }}{}^L$, which gives the probability that an RNAP molecule does not prematurely detach before termination; see ‘Appendix A’. Also, note that in the limit of zero premature termination, i.e. for $d_a=0=d_p$, Eq. (43) is identical to the distribution of mature RNA predicted by the telegraph model. Finally, by solving Eq. (40) for k, then substituting the resulting expression into Eq. (43) and taking the long-gene limit of $L \rightarrow \infty $ at constant $\langle T \rangle $, we obtain that the probability distribution of mature RNA has the same functional form as in Eq. (43), albeit with

$$\begin{aligned} \lim _{L\rightarrow {}\infty }{\tilde{\mu }}^L=\hbox {e}^{-\psi \langle {}T\rangle {}}, \quad {}{} \text {where } \psi =\dfrac{d_a+r_p\pi _{d_p}}{1+\sigma \pi _{r_a}}. \end{aligned}$$

(44)

Note that Eqs. (43) and (44) equal the steady-state solution predicted by the telegraph model, with the initiation rate r renormalised to $r{\tilde{\mu }}^L$ or $r \hbox {e}^{-\psi \langle T \rangle }$, respectively. In Fig. 10c, d, we verify the accuracy of our analytical solution using stochastic simulation for two different genes in yeast. Note that a change in the pausing rate $r_p$ has relatively little effect on the distribution of mature RNA, as compared to the effect on the distribution of total RNAP; cf. panels (a) and (b) of Fig. 10 in comparison with panels (c) and (d), respectively.

Table 2 Summary of main results

Full size table

Table 3 Definition of parameters and functions

Full size table

6 Summary and Conclusion

In this paper, we have analysed a detailed stochastic model of transcription. Our model extends previous analytical work (Choubey et al. 2015; Heng et al. 2016) by (i) taking into account salient processes, such as premature detachment and pausing of RNAP, that were previously not considered analytically; (ii) deriving explicit expressions for the mean and variance of RNAP numbers (nascent RNA) on gene segments as well as on the entire gene; (iii) deriving explicit expressions for the mean and variance of the fluorescent nascent RNA signal obtained from smFISH and identifying differences between the statistics thereof and those of direct measurements of nascent RNA; and (iv) finding approximate distributions of total nascent RNA fluctuations on a gene, without assuming slow promoter switching. A number of interesting observations from our work include the following:

(i)
When the premature detachment rate of RNAP is nonzero and gene expression is bursty, the coefficient of variation of local RNAP fluctuations can either decrease or increase with distance from the promoter. By contrast, when expression is constitutive, the coefficient of variation increases monotonically with distance from the promoter. Other statistical measures such as the mean, Fano factor, and correlation coefficient of local RNAP numbers decrease monotonically with distance from the promoter.
(ii)
In the limits of bursty expression, deterministic elongation, and no premature detachment or pausing, the Fano factor of total nascent RNA equals $1 + 2b$, whereas that of mature RNA is $1 + b$, where b denotes the mean burst size. An implication is that the telegraph model will result in an overestimate of the mean burst size from nascent RNA data by a factor of 2. Another implication is that deviations from Poisson fluctuations are more apparent in data for nascent RNA than they are for mature RNA. One can further state the following relationship: the Fano factor of nascent RNA equals twice the Fano factor of mature RNA, minus 1. If expression is non-bursty, then the Fano factor of nascent RNA can be larger or smaller than that of mature RNA, as determined by the condition in Eq. (19).
(iii)
For genes characterised by bursty expression, the sensitivity of the noise in total RNAP fluctuations is highest to perturbations in the gene activation rate and the mean elongation time; for constitutive genes, the most sensitive parameters are the initiation rate and the mean elongation time.
(iv)
A Negative Binomial distribution, parameterised with the expressions for the mean and variance of total nascent RNA derived here, provides a good approximation to the true distribution of total nascent RNA fluctuations on a gene when expression is either bursty or constitutive; the approximation is not accurate when the gene spends roughly equal amounts of time in the active and inactive states. We show that the distribution of nascent RNA is highly sensitive to the distribution of elongation times. In particular, if the elongation time is assumed to be exponentially distributed, as is implicitly assumed by telegraph models of nascent RNA, then the probability of observing zero RNA is much lower than if the elongation time is assumed to be fixed.
(v)
Using geometric singular perturbation theory (GSPT), we have rigorously proven that, in the limit of deterministic elongation (or fast elongation), no pausing and premature detachment, the steady-state distribution of mature RNA in our model is identical to that in the telegraph model (Raj et al. 2006). Consideration of pausing and premature detachment leads to a distribution that can also be obtained from a telegraph model with appropriately renormalised parameters.

A summary of the main theoretical results can be found in Table 2, with all requisite parameters and functions defined in Table 3. The main limiting assumption of our theoretical approach is that the initiation rate is slow enough such that RNAP molecules do not frequently collide with each other while moving along the gene. Hence, the expressions we have derived are reasonable for all but the strongest promoters which are characterised by very fast initiation rates. We anticipate that approximate closed-form expressions for the corresponding moments can also be derived when volume exclusion between RNAPs is taken into account by a modification of methods previously devised to understand molecular movement and kinetics in crowded conditions (Cianci et al. 2016; Smith et al. 2017). It is also possible to extend our model by including translation of mature RNA to protein; one can then again apply GSPT to derive distributions for protein numbers in the limit of RNA decaying much faster than protein; however, given item (v) above, we anticipate that the resulting protein distribution will be very similar to those derived from models that do not explicitly take into account nascent RNA (Shahrezaei and Swain 2008; Popović et al. 2016). Further research is required to develop simple approximations of the nascent RNA distribution that are accurate independently of the ratio of gene switching rates. Finally, given the strong recent interest in the development of statistical inference techniques in molecular biology (Gorin et al. 2020; Zechner et al. 2012; Kaan Öcal et al. 2019), we expect that our closed-form expressions for the moments and distributions of nascent and mature RNA will be useful for developing computationally efficient and accurate methods for estimating transcriptional parameters.

References

Adelman K, La Porta A, Santangelo TJ, Lis JT, Roberts JW, Wang MD (2002) Single molecule analysis of RNA polymerase elongation reveals uniform kinetic behavior. Proc Nat Acad Sci 99(21):13538–13543
Article Google Scholar
Ali MZ, Choubey S, Das D, Brewster RC (2020) Probing mechanisms of transcription elongation through cell-to-cell variability of RNA polymerase. Biophys J 118(7):1769–1781
Article Google Scholar
Brouwer I, Lenstra TL (2019) Visualizing transcription: key to understanding gene expression dynamics. Curr Opin Chem Biol 51:122–129
Article Google Scholar
Cao Z, Grima R (2020) Analytical distributions for detailed models of stochastic gene expression in eukaryotic cells. Proc Nat Acad Sci 117(9):4682–4692
Article Google Scholar
Choubey S (2018) Nascent RNA kinetics: transient and steady state behavior of models of transcription. Phys Rev E 97(2):022402
Article Google Scholar
Choubey S, Kondev J, Sanchez A (2015) Deciphering transcriptional dynamics in vivo by counting nascent RNA molecules. PLoS Comput Biol 11(11):e1004345
Article Google Scholar
Cianci C, Smith S, Grima R (2016) Molecular finite-size effects in stochastic models of equilibrium chemical systems. J Chem Phys 144(8):084101
Article Google Scholar
Coulon A, Ferguson ML, Va de Turris M, Palangat CCC, Larson DR (2014) Kinetic competition during the transcription cycle results in stochastic RNA processing. eLife 3:e03939
Digital Library of Mathematical Functions (2020a) Chapter 15: https://dlmf.nist.gov/15. Accessed 15 May 2020
Digital Library of Mathematical Functions (2020b) Chapter 15: https://dlmf.nist.gov/13. Accessed 15 May 2020
El Hage A, French SL, Beyer AL, Tollervey D (2010) Loss of topoisomerase i leads to r-loop-mediated transcriptional blocks during ribosomal RNA synthesis. Genes Dev 24(14):1546–1558
Article Google Scholar
Elf J, Ehrenberg M (2003) Fast evaluation of fluctuations in biochemical networks with the linear noise approximation. Genome Res 13(11):2475–2484
Article Google Scholar
Fenichel N (1979) Geometric singular perturbation theory for ordinary differential equations. J Differ Equ 31(1):53–98
Article MathSciNet MATH Google Scholar
Forde NR, Izhaky D, Woodcock GR, Wuite GJL, Bustamante C (2002) Using mechanical force to probe the mechanism of pausing and arrest during continuous elongation by escherichia coli RNA polymerase. Proc Nat Acad Sci 99(18):11682–11687
Article Google Scholar
Gillespie DT (1977) Exact stochastic simulation of coupled chemical reactions. J Phys Chem 81(25):2340–2361
Article Google Scholar
Gorin G, Wang M, Golding I, Heng X (2020) Stochastic simulation and statistical inference platform for visualization and estimation of transcriptional kinetics. PLoS ONE 15(3):e0230736
Article Google Scholar
Halpern KB, Tanami S, Landen S, Chapal M, Szlak L, Hutzler A, Nizhberg A, Itzkovitz S (2015) Bursty gene expression in the intact mammalian liver. Mol Cell 58(1):147–156
Article Google Scholar
Heng X, Skinner SO, Sokac AM, Golding I (2016) Stochastic kinetics of nascent RNA. Phys Rev Lett 117(12):128101
Article Google Scholar
Jahnke T, Huisinga W (2007) Solving the chemical master equation for monomolecular reaction systems analytically. J Math Biol 54(1):1–26
Article MathSciNet MATH Google Scholar
Jones CKRT (1995) Geometric singular perturbation theory. In: Dynamical systems. Springer, pp 44–118
Klumpp S, Hwa T (2008) Stochasticity and traffic jams in the transcription of ribosomal RNA: intriguing role of termination and antitermination. Proc Nat Acad Sci 105(47):18159–18164
Article Google Scholar
Larson DR, Zenklusen D, Bin W, Chao JA, Singer RH (2011) Real-time observation of transcription initiation and elongation on an endogenous yeast gene. Science 332(6028):475–478
Article Google Scholar
Lenstra TL, Rodriguez J, Chen H, Larson DR (2016) Transcription dynamics in living cells. Ann Rev Biophys 45:25–47
Article Google Scholar
Öcal K, Grima R, Sanguinetti G (2019) Parameter estimation for biochemical reaction networks using Wasserstein distances. J Phys A Math Theor 53(3):034002
Article Google Scholar
Peccoud J, Ycart B (1995) Markovian modeling of gene-product synthesis. Theor Popul Biol 48(2):222–234
Article MATH Google Scholar
Popović N, Marr C, Swain PS (2016) A geometric analysis of fast-slow models for stochastic gene expression. J Math Biol 72(1–2):87–122
Article MathSciNet MATH Google Scholar
Raj A, Peskin CS, Tranchina D, Vargas DY, Tyagi S (2006) Stochastic mRNA synthesis in mammalian cells. PLoS Biol 4(10):e309
Article Google Scholar
Raj A, Van Den Bogaard P, Rifkin SA, Van Oudenaarden A, Tyagi S (2008) Imaging individual mRNA molecules using multiple singly labeled probes. Nat Methods 5(10):877–879
Article Google Scholar
Rodriguez J, Ren G, Day CR, Zhao K, Chow CC, Larson DR (2019) Intrinsic dynamics of a human gene reveal the basis of expression heterogeneity. Cell 176(1–2):213–226
Article Google Scholar
Sanchez A, Golding I (2013) Genetic determinants and cellular constraints in noisy gene expression. Science 342(6163):1188–1193
Article Google Scholar
Shahrezaei V, Swain PS (2008) Analytical distributions for stochastic gene expression. Proc Nat Acad Sci 105(45):17256–17261
Article Google Scholar
Skinner SO, Xu H, Nagarkar-Jaiswal S, Freire PR, Zwaka TP, Golding I (2016) Single-cell analysis of transcription kinetics across the cell cycle. eLife 5:e12175
Smith S, Cianci C, Grima R (2017) Macromolecular crowding directs the motion of small molecules inside cells. J R Soc Interface 14(131):20170047
Article Google Scholar
Suter DM, Molina N, Gatfield D, Schneider K, Schibler U, Naef F (2011) Mammalian genes are transcribed with widely different bursting kinetics. Science 332(6028):472–474
Article Google Scholar
Tiina R, Antti H, Shannon H, Olli Y-H, AndreS R (2010) Effects of transcriptional pausing on gene expression dynamics. PLoS Comput Biol 6(3):e1000704
Article MathSciNet Google Scholar
Veerman F, Marr C, Popović N (2018) Time-dependent propagators for stochastic models of gene expression: an analytical method. J Math Biol 77(2):261–312
Article MathSciNet MATH Google Scholar
Warren PB, Tănase-Nicola S, ten Wolde PR (2006) Exact results for noise power spectra in linear biochemical reaction networks. J Chem Phys 125(14):144904
Article Google Scholar
Zechner C, Ruess J, Krenn P, Pelet S, Peter M, Lygeros J, Koeppl H (2012) Moment-based inference predicts bimodality in transient gene expression. Proc Nat Acad Sci 109(21):8340–8345
Article Google Scholar
Zenklusen D, Larson DR, Singer RH (2008) Single-RNA counting reveals alternative modes of gene expression in yeast. Nat Struct Mol Biol 15(12):1263
Article Google Scholar

Download references

Acknowledgements

R.G. acknowledges useful discussions with Zhixing Cao and Tineke Lenkstra. This work was supported by a departmental PhD studentship to T.F.

Author information

Authors and Affiliations

School of Biological Sciences, The University of Edinburgh, Edinburgh, UK
Tatiana Filatova & Ramon Grima
School of Mathematics and Maxwell Institute for Mathematical Sciences, The University of Edinburgh, Edinburgh, UK
Tatiana Filatova & Nikola Popovic

Authors

Tatiana Filatova
View author publications
You can also search for this author in PubMed Google Scholar
Nikola Popovic
View author publications
You can also search for this author in PubMed Google Scholar
Ramon Grima
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ramon Grima.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix

A. Distribution of Elongation Time

In this section, we answer the following question: what is the distribution of the elongation time, i.e. the time between initiation and termination? In other words, with reference to Fig. 9—which includes the non-pausing model in Fig. 1 as a special case-we want to find the distribution of the time at which RNAP leaves gene segment L (termination) if it was in the active state on gene segment 1 at time $t = 0$ (initiation).

Let $z_i(t)$ be the probability of an RNAP to be on gene segment i in the active state at time t, let $\tilde{z}_i(t)$ be the probability of the RNAP to be on gene segment i in the paused state at time t, and let $z_i^*(t)$ be the probability of the RNAP moving to gene segment $i+1$ at time t; note that $z_L^*(t)$ is the probability of the RNAP falling off the gene and forming a mature RNA, since for $i = L$, gene segment $L+1$ does not exist. Then, it follows from the reaction scheme illustrated in Fig. 9 that the master equations describing the Markovian dynamics on gene segment i are given by

$$\begin{aligned} \partial _t z_i(t)&= -(r_p+k+d_a) z_i(t) + r_a \tilde{z}_i(t), \end{aligned}$$

(A.1a)

$$\begin{aligned} \partial _t \tilde{z}_i(t)&= -(d_p+r_a) \tilde{z}_i(t) + r_p z_{i}(t), \end{aligned}$$

(A.1b)

$$\begin{aligned} \partial _t z_i^*(t)&= k z_{i}(t). \end{aligned}$$

(A.1c)

Now, we use these equations to find the distribution of the time when RNAP jumps to gene segment $i+1$, given that it is on gene segment i in the active state at $t=0$, i.e. that $z_i(0)=1$ and $\tilde{z}_i(0)=0$. Taking the Laplace transform of Eqs. (A.1a) and (A.1b), we find

$$\begin{aligned} s\hat{z}_i(s)-1&= -(r_p+k+d_a) \hat{z}_i(s)+r_a\hat{\tilde{z}}_i(s), \end{aligned}$$

(A.2a)

$$\begin{aligned} s\hat{\tilde{z}}_i(s)&= -(d_p+r_a)\hat{\tilde{z}}_i(s)+r_p \hat{z}_{i}(s), \end{aligned}$$

(A.2b)

where $\hat{f}(s)=\int _{0}^\infty \hbox {e}^{-st}f(t)\,\hbox {d}t$. Solving these equations simultaneously, we obtain

$$\begin{aligned} \hat{z}_i(s) = \frac{s+d_p+r_a}{(s+k+d_a)(s+d_p+r_a)+r_p(s+d_p)} \end{aligned}$$

(A.3)

Let $w(t)\hbox {d}t$ be the probability that the RNAP moves from segment i to $i+1$ in the time interval $(t,t+\hbox {d}t)$. Then, it follows from Eq. (A.1c) that $w(t)=\partial _t z_i^*(t)=k z_i(t)$. Integrating w(t) over all times gives us the probability that the RNAP ultimately moves to the next segment $i+1$,

$$\begin{aligned} \int _{0}^\infty w(t)\,\hbox {d}t = \hat{w}(0) = k \hat{z}_i(0) = \frac{k \left( r_a+d_p\right) }{\left( d_a+k\right) \left( r_a+d_p\right) +d_p r_p}. \end{aligned}$$

(A.4)

Note that $\hat{w}(0)$ is identical to the parameter ${\tilde{\mu }}$, as defined in Proposition 3. Let $y(t) \hbox {d}t$ be the probability that the RNAP moves from gene segment i to segment $i+1$ in the time interval $(t,t+\hbox {d}t)$, conditioned on those realisations that lead to an RNAP moving to the next gene segment $i+1$. (In other words, we exclude those realisations that lead to premature detachment.) Then, it follows by the definition of conditional probabilities that $y(t) = w(t)/\hat{w}(0)$, which implies

$$\begin{aligned} \hat{y}(s) = \frac{\hat{w}(s)}{\hat{w}(0)} = \frac{[(d_a+k)(r_a+d_p)+d_p r_p](r_a+d_p+s)}{(r_a+d_p)[(d_a+k+s)(r_a+d_p+s)+r_p(d_p+s)]}. \end{aligned}$$

(A.5)

It follows that the mean $\langle t \rangle $ and variance ${\mathrm{Var}}(t)$ of the time t it takes RNAP to move to the next gene segment are given by

$$\begin{aligned} \langle t \rangle&= -\frac{d \hat{y}(s)}{\hbox {d}s}\biggr |_{s=0} = \frac{(r_a+d_p)^2+r_a r_p}{(r_a+d_p)[(d_a+k)(r_a+d_p)+d_p r_p]}, \end{aligned}$$

(A.6a)

$$\begin{aligned} {\mathrm{Var}}(t)&= \frac{d^2 \hat{y}(s)}{\hbox {d}s^2}\biggr |_{s=0}-\biggl (\frac{d \hat{y}(s)}{\hbox {d}s}\biggr |_{s=0}\biggr )^2 \nonumber \\&=\frac{2 r_a r_p(r_a+d_p)(d_a+r_a+d_p+k)+(r_a+d_p)^4+r_a r_p^2 (r_a+2d_p)}{(r_a+d_p)^2[(d_a+k)(r_a+d_p)+d_pr_p]^2}, \end{aligned}$$

(A.6b)

respectively. Since RNAP can only move forwards in our model (irreversible motion), it follows that the time it takes an RNAP to move from the ith to the ($i+1$)th gene segment is independent of the time taken to move from another, jth segment to the ($j+1$)th segment. Hence, the time required for an RNAP to move across the entire gene from the first to the Lth segment, i.e. the ‘elongation’ time T from initiation to termination, is a sum of L independent and identical random variables. Thus, we can immediately state that the mean elongation time is $\langle T \rangle = L \langle t \rangle $, whereas the variance of the elongation time is ${\mathrm{Var}}(T) = L {\mathrm{Var}}(t)$. The coefficient of variation squared takes the form

$$\begin{aligned} {\mathrm{CV}}_T^2 = \frac{{\mathrm{Var}}(T)}{\langle T \rangle ^2} = 1 + \frac{2 r_a r_p [(d_a+k)(r_a+d_p)+d_pr_p]}{[(r_a+d_p){}^2+r_a r_p]{}^2}. \end{aligned}$$

(A.7)

From Eq. (A.7), it can be shown that for small premature detachment rates, the coefficient of variation of the elongation time is maximised when $r_p \approx r_a$. Taking the limit of infinitely many gene segments at constant mean elongation time, i.e. solving for k from the expression for the mean elongation time in Eq. (A.6), substituting into Eq. (A.7), and taking the limit of $L \rightarrow \infty $, we obtain

$$\begin{aligned} \lim _{L\rightarrow {}\infty }{\mathrm{CV}}_T^2=\frac{2 r_a r_p}{\langle T \rangle (r_a+d_p) [(r_a+d_p){}^2+r_ar_p]}. \end{aligned}$$

(A.8)

For the non-pausing model shown in Fig. 1, the above results simplify considerably due to $r_p = 0 = d_p$ and $d_a = d$; in that case, the inverse Laplace transform of Eq. (A.5) implies that y(t) is an exponential distribution with parameter $k+d$. Hence, the total time it takes an RNAP to move across the entire gene is the sum of L independent and identically distributed exponential random variables, i.e. an Erlang distribution with shape parameter L and rate $k + d$, which implies that the mean elongation time is $L/(k+d)$, with coefficient of variation $1/\sqrt{L}$. It can be seen from Eq. (A.8) that deterministic elongation can only be observed when there is no pausing, i.e. when $r_p = 0$.

B. Solution of Lyapunov Equation

Proof of Proposition 2

We start by defining the symmetric functions $f_{ij}=f_{ji}$ for $i,j=1,\dots {},L$ as

$$\begin{aligned} \begin{aligned} f_{00}&=1 ,&\quad {} f_{0j}&=\alpha ^{j-1},&\quad {} f_{0M}&=\theta \alpha ^{L-1}, \\ f_{ij}&=(f_{i-1,j}+f_{i,j-1})/2,&\quad {} f_{iM}&=\gamma f_{i-1,M}+(1-\gamma ) f_{iL},&\quad {} f_{MM}&=f_{LM}, \end{aligned} \end{aligned}$$

(B.1)

where the non-dimensional parameters $\alpha $, $\gamma $, and $\theta $ are defined in Proposition 2. The elements of the Lyapunov equation given by Eq. (5) can be written explicitly as a set of simultaneous equations:

$$\begin{aligned}&\mathbf {C}_{11}\cdot 2\mathbf {J}_{11} =-\mathbf {D}_{11},&\end{aligned}$$

(B.2a)

$$\begin{aligned}&\mathbf {C}_{12}\cdot (\mathbf {J}_{11}+\mathbf {J}_{22}) =-\mathbf {J}_{21}\mathbf {C}_{11},&\end{aligned}$$

(B.2b)

$$\begin{aligned}&\mathbf {C}_{1j}\cdot (\mathbf {J}_{11}+\mathbf {J}_{jj}) =-\mathbf {J}_{j,j-1}\mathbf {C}_{1,j-1} \quad \text {for }j=3,\dots {},L+1, \end{aligned}$$

(B.2c)

$$\begin{aligned}&\mathbf {C}_{1,L+2}\cdot (\mathbf {J}_{11}+\mathbf {J}_{L+2,L+2}) =-\mathbf {J}_{L+2,L+1}\mathbf {C}_{1,L+1},&\end{aligned}$$

(B.2d)

$$\begin{aligned}&\mathbf {C}_{22}\cdot 2\mathbf {J}_{22}=-2\mathbf {J}_{21}\mathbf {C}_{12}-\mathbf {D}_{22},&\end{aligned}$$

(B.2e)

$$\begin{aligned}&\mathbf {C}_{23} \cdot (\mathbf {J}_{22}+\mathbf {J}_{33})=-\mathbf {J}_{21}\mathbf {C}_{13}-\mathbf {J}_{32}\mathbf {C}_{22}-\mathbf {D}_{23},&\end{aligned}$$

(B.2f)

$$\begin{aligned}&\mathbf {C}_{2j} \cdot (\mathbf {J}_{22}+\mathbf {J}_{jj})=-\mathbf {J}_{21}\mathbf {C}_{1j}-\mathbf {J}_{j,j-1}\mathbf {C}_{2,j-1}&\quad&\text {for } j=4,\dots {},L+1, \end{aligned}$$

(B.2g)

$$\begin{aligned}&\mathbf {C}_{2,L+2} \cdot (\mathbf {J}_{22}+\mathbf {J}_{L+2,L+2})=-\mathbf {J}_{21}\mathbf {C}_{1,L+2}-\mathbf {J}_{L+2,L+1}\mathbf {C}_{2,L+1},&\end{aligned}$$

(B.2h)

$$\begin{aligned}&\mathbf {C}_{ii}\cdot {}2\mathbf {J}_{ii} =-2\mathbf {J}_{i,i-1}\mathbf {C}_{i-1,i}-\mathbf {D}_{ii}&\quad&\text {for }i=3,\dots {},L+1, \end{aligned}$$

(B.2i)

$$\begin{aligned}&\mathbf {C}_{i,i+1} \cdot {}(\mathbf {J}_{ii}+\mathbf {J}_{i+1,i+1})=-\mathbf {J}_{i,i-1}\mathbf {C}_{i-1,i+1}-\mathbf {J}_{i+1,i}\mathbf {C}_{ii}-\mathbf {D}_{i,i+1}&\quad&\text {for }i=3,\dots {},L, \end{aligned}$$

(B.2j)

$$\begin{aligned}&\mathbf {C}_{ij} \cdot {}(\mathbf {J}_{ii}+\mathbf {J}_{jj})=-\mathbf {J}_{i,i-1}\mathbf {C}_{i-1,j}-\mathbf {J}_{j,j-1}\mathbf {C}_{i,j-1}&\text {for }i=3,\dots {},L+1\nonumber {}\\&\text {and }j=i+2,\dots {},L+1, \end{aligned}$$

(B.2k)

$$\begin{aligned}&\mathbf {C}_{i,L+2} \cdot {}(\mathbf {J}_{ii}+\mathbf {J}_{L+2,L+2})=-\mathbf {J}_{i,i-1}\mathbf {C}_{i-1,L+2}-\mathbf {J}_{L+2,L+1}\mathbf {C}_{i,L+1}&\quad&\text {for }i=3,\dots {},L+1, \end{aligned}$$

(B.2l)

$$\begin{aligned}&\mathbf {C}_{L+2,L+2}\cdot {}2\mathbf {J}_{L+2,L+2}=-2\mathbf {J}_{L+2,L+1}\mathbf {C}_{L+1,L+2}-\mathbf {D}_{L+2,L+2}.&\end{aligned}$$

(B.2m)

Now, we substitute the elements of the Jacobian matrix $\mathbf {J}$ and the diffusion matrix $\mathbf {D}$ from Eqs. (6) and (7), respectively, into the above system of algebraic equations, which we then solve to find the elements of the covariance matrix $\mathbf {C}$. Note that, for the following mathematical derivation, we take into account the expressions for the steady-state mean numbers of species given in Eq. (2), as well as the definition of the functions $f_{ij}$ in Eq. (B.1).

From Eq. (B.2a), one easily obtains $\mathbf {C}_{11}=\eta ^2\beta $. Then, it follows from Eq. (B.2b) that

$$\begin{aligned} \begin{aligned} \mathbf {C}_{12}&=\dfrac{r}{s_u+s_b+k+d}\mathbf {C}_{11} =\rho {}_k\mu \alpha (\eta ^2\beta ) =\eta (\eta \rho {}_k\mu {})\alpha \beta =\eta \langle {}n_1\rangle {}\alpha \beta \cdot {}f_{01}. \end{aligned} \end{aligned}$$

(B.3)

Eq. (B.2c) implies that, for $j=3,\dots {},L+1$:

$$\begin{aligned} \begin{aligned} \mathbf {C}_{1j}&=\dfrac{k}{s_u+s_b+k+d}\mathbf {C}_{1,j-1} =\mu {}\alpha \cdot {}\mathbf {C}_{1,j-1} =(\mu {}\alpha )^{j-2}\mathbf {C}_{12}\\&=(\mu {}\alpha )^{j-2}(\eta \langle {}n_1\rangle {}\alpha \beta ) =\eta \langle {}n_{j-1}\rangle {}\alpha \beta \cdot {}f_{0,j-1}. \end{aligned} \end{aligned}$$

(B.4)

From Eq. (B.2d), we have that

$$\begin{aligned} \begin{aligned} \mathbf {C}_{1,L+2}&=\dfrac{k}{s_u+s_b+d_m}\mathbf {C}_{1,L+1} =\dfrac{k}{d_m}\theta (\langle {}n_L\rangle {}\alpha \beta \cdot {}f_{0L})\\&=\eta (\dfrac{k}{d_m}\langle {}n_L\rangle {})(\alpha \beta )(\theta \cdot {}f_{0L}) =\eta \langle {}n\rangle {}\cdot {}f_{0M}; \end{aligned} \end{aligned}$$

(B.5)

from Eq. (B.2e), we find

$$\begin{aligned} \begin{aligned} \mathbf {C}_{22}&= \dfrac{r\langle {}n_0\rangle {} +(k+d)\langle {}n_1\rangle {} }{2(k+d)}+\dfrac{r}{k+d}\mathbf {C}_{12} = \dfrac{\rho {}_k\mu \eta +\langle {}n_1\rangle {}}{2} \\&\quad +(\rho {}_k\mu {})(\eta \langle {}n_1\rangle {}\alpha \beta \cdot {}f_{01})=\langle {}n_1\rangle {} +\langle {}n_1\rangle {}^2\alpha \beta \cdot {}f_{11}, \end{aligned} \end{aligned}$$

(B.6)

since $f_{11}=(f_{01}+f_{10})/2=f_{01}$ from the definition in Eq. (B.1).

From Eq. (B.2f), we obtain

$$\begin{aligned} \mathbf {C}_{23}= & {} -\dfrac{k}{2(d+k)}\langle {}n_1\rangle {}+ \dfrac{r}{2(k+d)}\mathbf {C}_{13}+\dfrac{k}{2(d+k)}\mathbf {C}_{22} \\= & {} -\dfrac{\langle {}n_2\rangle {}}{2} +\dfrac{1}{2}(\rho {}_k\mu \eta )\langle {}n_2\rangle {}\alpha \beta \cdot {}f_{02} +\dfrac{1}{2}\mu {} [\langle {}n_1\rangle {}+\langle {}n_1\rangle {}^2\alpha \beta \cdot {}f_{11}] \\= & {} -\dfrac{\langle {}n_2\rangle {}}{2} +\dfrac{1}{2}\langle {}n_1\rangle {}\langle {}n_2\rangle {}\alpha \beta \cdot {}f_{02} +\dfrac{\langle {}n_2\rangle {}}{2} +\dfrac{1}{2} (\mu {}\langle {}n_1\rangle {})\langle {}n_1\rangle {}\alpha \beta \cdot {}f_{11} \\= & {} \dfrac{1}{2}\langle {}n_1\rangle {}\langle {}n_2\rangle {}\alpha \beta \cdot {}f_{02} +\dfrac{1}{2} \langle {}n_2\rangle {}\langle {}n_1\rangle {}\alpha \beta \cdot {}f_{11} = \langle {}n_1\rangle {}\langle {}n_2\rangle {}\alpha \beta \dfrac{1}{2}(f_{02}+f_{11})\\= & {} \langle {}n_1\rangle {}\langle {}n_2\rangle {}\alpha \beta \cdot {}f_{12}, \end{aligned}$$

(B.7)

since $f_{12}=(f_{02}+f_{11})/2$ from the definition in Eq. (B.1).

From Eq. (B.2g), we have that, for $j=4,\dots {},L+1$,

$$\begin{aligned} \begin{aligned} \mathbf {C}_{2j}&=\dfrac{r}{2(k+d)}\mathbf {C}_{1j}+\dfrac{k}{2(k+d)}\mathbf {C}_{2,j-1} =\dfrac{\rho {}_k\mu }{2}\mathbf {C}_{1j}+\dfrac{\mu }{2}\mathbf {C}_{2,j-1}\\&=\dfrac{\rho {}_k\mu }{2}\sum _{q=0}^{j-4}\Big (\dfrac{\mu }{2}\Big )^q\mathbf {C}_{1,j-q}+\Big (\dfrac{\mu }{2}\Big )^{j-3}\mathbf {C}_{23}. \end{aligned} \end{aligned}$$

(B.8)

The proof of Eq. (B.8) is given in Lemma B.1. The above expression for $\mathbf {C}_{2j}$ can be further simplified to

$$\begin{aligned} \begin{aligned} \mathbf {C}_{2j}&=\dfrac{\rho {}_k\mu }{2}\sum _{q=0}^{j-4}\Big (\dfrac{\mu }{2}\Big )^q\eta \langle {}n_{j-q-1}\rangle {}\alpha \beta \cdot {}f_{0,j-q-1} +\Big (\dfrac{\mu }{2}\Big )^{j-3}\langle {}n_1\rangle {}\langle {}n_2\rangle {}\alpha \beta \cdot {}f_{12} \\&=\sum _{q=0}^{j-4}\Big (\dfrac{1}{2}\Big )^{q+1}(\rho {}_k\mu \eta )(\mu ^q\langle {}n_{j-q-1}\rangle {})\alpha \beta \cdot {}f_{0,j-q-1} +\Big (\dfrac{1}{2}\Big )^{j-3}\langle {}n_1\rangle {}(\mu ^{j-3}\langle {}n_2\rangle {})\alpha \beta \cdot {}f_{12} \\&=\sum _{q=0}^{j-4}\Big (\dfrac{1}{2}\Big )^{q+1}\langle {}n_1\rangle {}\langle {}n_{j-1}\rangle {}\alpha \beta \cdot {}f_{1,j-q-1} +\Big (\dfrac{1}{2}\Big )^{j-3}\langle {}n_1\rangle {}\langle {}n_{j-1}\rangle {}\alpha \beta \cdot {}f_{12} \\&=\langle {}n_1\rangle {}\langle {}n_{j-1}\rangle {}\alpha \beta \bigg [\sum _{q=0}^{j-4}\Big (\dfrac{1}{2}\Big )^{q+1}f_{1,j-q-1} +\Big (\dfrac{1}{2}\Big )^{j-3}f_{12}\bigg ] =\langle {}n_1\rangle {}\langle {}n_{j-1}\rangle {}\alpha \beta \cdot {}f_{1,j-1}. \end{aligned} \end{aligned}$$

(B.9)

For the proof of the last equality in Eq. (B.9), see Lemma B.2.

From Eq. (B.2h), we have that

$$\begin{aligned} \mathbf {C}_{2,L+2}= & {} \dfrac{r}{k+d+d_m}\mathbf {C}_{1,L+2}+\dfrac{k}{k+d+d_m}\mathbf {C}_{2,L+1} =\rho {}_k\mu \gamma {}\mathbf {C}_{1,L+2}+\mu {}\gamma \mathbf {C}_{2,L+1}\\= & {} (\rho {}_k\mu \gamma {})(\eta \langle {}n\rangle {}\alpha \beta \cdot {}f_{0M})+(\mu \gamma )(\langle {}n_1\rangle {}\langle {}n_L\rangle {}\alpha \beta \cdot {}f_{1L}) \\= & {} (\rho {}_k\eta \mu )\langle {}n\rangle {}\alpha \beta \cdot {}\gamma {}f_{0M}+\mu \dfrac{d_m}{k}\langle {}n_1\rangle {}\dfrac{k}{d_m}\langle {}n_L\rangle {}\alpha \beta \cdot {}\gamma f_{1L}\\= & {} \langle {}n_1\rangle {}\langle {}n\rangle {}\alpha \beta \cdot {}\Big [\gamma {}f_{0M}+\mu \dfrac{d_m}{k}\gamma f_{1L}\Big ] \\= & {} \langle {}n_1\rangle {}\langle {}n\rangle {}\alpha \beta \cdot {}[\gamma {}{}f_{0M}+ (1-\gamma {}{})\cdot {}f_{1L}] = \langle {}n_1\rangle {}\langle {}n\rangle {}\alpha \beta \cdot {}f_{1M}, \end{aligned}$$

(B.10)

where $f_{1M}$ is defined in Eq. (B.1).

Eqs. (B.2i) through (B.2k) yield the system

$$\begin{aligned} \begin{aligned} \mathbf {C}_{ii}&= \dfrac{k\langle {}n_{i-2}\rangle +(k+d)\langle {}n_{i-1} \rangle }{2(k+d)}+\dfrac{k}{k+d}\mathbf {C}_{i-1,i}=\langle {}n_{i-1}\rangle {}+ \mu {}\mathbf {C}_{i-1,i},\\ \mathbf {C}_{i,i+1}&= \dfrac{\mu }{2}\mathbf {C}_{i-1,i+1}+ \dfrac{\mu }{2}\mathbf {C}_{ii}-\dfrac{\mu }{2}\langle {}n_{i-1}\rangle = \dfrac{\mu }{2}(\mathbf {C}_{i-1,i+1}+\mu {}\mathbf {C}_{i-1,i}),\\ \mathbf {C}_{ij}&= \dfrac{\mu }{2}(\mathbf {C}_{i-1,j}+\mathbf {C}_{i,j-1}), \end{aligned} \end{aligned}$$

(B.11)

which can be rewritten more compactly as

$$\begin{aligned} \begin{aligned} \mathbf {C}_{ij}=\delta _{ij}\langle {}n_{i-1}\rangle +\langle {}n_{i-1}\rangle \langle {}n_{j-1}\rangle \alpha \beta {}\cdot {}f_{i-1,j-1}&\qquad \text {for }i,j=3,\dots {},L+1, \end{aligned} \nonumber \\ \end{aligned}$$

(B.12)

where $\delta _{ij}$ is the Kronecker delta. A detailed derivation is given in Lemma B.3.

From Eq. (B.2l), we have that for $i=3,\dots {},L+1$,

$$\begin{aligned} \mathbf {C}_{i,L+2}= & {} \dfrac{k}{k+d+d_m}\mathbf {C}_{i-1,L+2}+\dfrac{k}{k+d+d_m}\mathbf {C}_{i,L+2}\\= & {} \mu {}\gamma {}\mathbf {C}_{i-1,L+2}+(k/d_m)(1-\gamma )\mathbf {C}_{i,L+1} \\= & {} \gamma {}(\mu \langle {}n_{i-2}\rangle {})\langle {}n\rangle {}\alpha \beta \cdot {}f_{i-2,M}+(1-\gamma )\langle {}n_{i-1}\rangle {}(k/d_m\langle {}n_L\rangle {})\alpha \beta \cdot {}f_{i-1,L} \\= & {} \langle {}n_{i-1}\rangle {}\langle {}n\rangle {}\alpha \beta \cdot {}[\gamma {}f_{i-2,M}+(1-\gamma )f_{i-1,L}] =\langle {}n_{i-1}\rangle {}\langle {}n\rangle {}\alpha \beta \cdot {}f_{i-1,M},\\ \end{aligned}$$

(B.13)

where $f_{iM}$ is defined in Eq. (B.1).

Finally, Eq. (B.2m) yields

$$\begin{aligned} \begin{aligned} \mathbf {C}_{L+2,L+2}&= \dfrac{k\langle {}n_L\rangle +d_m\langle {}n\rangle }{2d_m}+\dfrac{k}{d_m}\mathbf {C}_{L+1,L+2}\\&=\langle {}n\rangle +(k/d_m)\langle {}n_L\rangle \langle {}n\rangle \alpha \beta {}\cdot {}f_{LM} =\langle {}n\rangle +\langle {}n\rangle ^2\alpha \beta {}\cdot {}f_{MM}, \end{aligned} \end{aligned}$$

(B.14)

where $f_{MM}=f_{LM}$ is defined in Eq. (B.1).

Summarising the above results, we conclude that the solution for the symmetric covariance matrix $\mathbf {C}$ is given by the system in Eq. (4), where we have that $\mathrm{Cov}(n_i,n_j)=\mathbf {C}_{i+1,j+1}$, $\mathrm{Cov}(n_i,n)=\mathbf {C}_{i+1,L+2}$ for $i,j=0,\dots {},L$, and $\mathrm{Var}(n,n)=\mathbf {C}_{L+2,L+2}$. Here, the functions $f_{ij}$ are defined as in Eq. (B.1). Now, the recurrence relation $f_{ij}=(f_{i-1,j}+f_{i,j-1})/2$ in Eq. (B.1) can be solved for $i,j=1,2,\dots {},L$ via the method of generating functions, which gives the following analytical expression:

$$\begin{aligned} \begin{aligned} f_{ij}=f(i,j)+f(j,i), \end{aligned} \end{aligned}$$

(B.15)

where

$$\begin{aligned} {f}(i,j) =\dfrac{\alpha ^{i+j-1}}{(2\alpha -1)^{i}} +\dfrac{1}{2^{i+j-1}}\left( {\begin{array}{c}i+j-1\\ i\end{array}}\right) \Big [ 1-\dfrac{2\alpha -1}{2\alpha }{}_2F_1\big (1,i+j;j;\tfrac{1}{2\alpha }\big ) \Big ]; \end{aligned}$$

see Lemma B.5 for a detailed derivation. Additionally, we can easily prove that the function $f_{iM}$ in Eq. (B.1) can be rewritten as

$$\begin{aligned} f_{iM}=\gamma {}^{i}f_{0M}+(1-\gamma {})\sum \limits _{q=1}^{i}\gamma {}^{i-q}{f}_{qL}, \end{aligned}$$

(B.16)

as shown in Lemma B.4. $\square $

Lemma B.1

For $j=4,\dots {},L+1$, we have the identity

$$\begin{aligned} \mathbf {C}_{2j} =\dfrac{\rho {}_k\mu }{2}\mathbf {C}_{1j}+\dfrac{\mu }{2}\mathbf {C}_{2,j-1}=\dfrac{\rho {}_k\mu }{2}\sum _{q=0}^{j-4}\Big (\dfrac{\mu }{2}\Big )^q\mathbf {C}_{1,j-q}+\Big (\dfrac{\mu }{2}\Big )^{j-3}\mathbf {C}_{23}, \end{aligned}$$

(B.17)

as stated in Eq. (B.8).

Proof

The identity in Eq. (B.17) will be proved by induction: one can easily show that it holds for $j=4$. Now, we assume that Eq. (B.17) is true for some $j\ge {}5$; hence, for $j+1$, we have

$$\begin{aligned} \mathbf {C}_{2,j+1}= & {} \sum _{q=0}^{j-3}\Big (\dfrac{\mu }{2}\Big )^q\dfrac{\rho {}_k\mu }{2}\mathbf {C}_{1,j+1-q}+\Big (\dfrac{\mu }{2}\Big )^{j-2}\mathbf {C}_{23} \\= & {} \dfrac{\rho {}_k\mu }{2}\mathbf {C}_{1,j+1} + \sum _{q=1}^{j-3}\Big (\dfrac{\mu }{2}\Big )^{q}\dfrac{\rho {}_k\mu }{2}\mathbf {C}_{1,j+1-q}+\Big (\dfrac{\mu }{2}\Big )^{j-2}\mathbf {C}_{23} \\= & {} \dfrac{\rho {}_k\mu }{2}\mathbf {C}_{1,j+1} +\dfrac{\mu }{2} \bigg [ \sum _{q=1}^{j-3}\Big (\dfrac{\mu }{2}\Big )^{q-1}\dfrac{\rho {}_k\mu }{2}\mathbf {C}_{1,j+1-q}+\Big (\dfrac{\mu }{2}\Big )^{j-3}\mathbf {C}_{23}\bigg ] \\= & {} \dfrac{\rho {}_k\mu }{2}\mathbf {C}_{1,j+1} +\dfrac{\mu }{2} \bigg [\sum _{q=0}^{j-4}\Big (\dfrac{\mu }{2}\Big )^{q}\dfrac{\rho {}_k\mu }{2}\mathbf {C}_{1,j-q} +\Big (\dfrac{\mu }{2}\Big )^{j-3}\mathbf {C}_{23}\bigg ] \\= & {} \dfrac{\rho {}_k\mu }{2}\mathbf {C}_{1,j+1}+\dfrac{\mu }{2}\mathbf {C}_{2j}, \end{aligned}$$

(B.18)

as claimed, which implies that the identity in Eq. (B.17) holds for all $j=4,\dots {},L+1$. $\square $

Lemma B.2

The function $f_{1j}$, which is defined by the recurrence relation $f_{1j}=(f_{0j}+f_{1,j-1})/2$ in Eq. (B.1), satisfies the identity

$$\begin{aligned} f_{1j} =\sum _{q=0}^{j-3}\Big (\dfrac{1}{2}\Big )^{q+1}f_{0,j-q} +\Big (\dfrac{1}{2}\Big )^{j-2}f_{12}\qquad \text {for }j=3,\dots {},L, \end{aligned}$$

(B.19)

as stated in Eq. (B.9).

Proof

We will again prove Eq. (B.19) by induction. For $j=3$, we have from Eq. (B.19) that $f_{13}=(f_{03}+f_{12})/2$, which is true by the definition of $f_{13}$. We assume that the identity in Eq. (B.19) is correct for some $j\ge {}4$; then, for $j+1$, the definition of $f_{1,j+1}$, in combination with our assumption, implies

$$\begin{aligned} \begin{aligned} f_{1,j+1}&=\dfrac{1}{2}f_{0,j+1}+\dfrac{1}{2}f_{1j}= \dfrac{1}{2}f_{0,j+1}+\dfrac{1}{2}\bigg [ \sum _{q=0}^{j-3}\Big (\dfrac{1}{2}\Big )^{q+1}f_{0,j-q} +\Big (\dfrac{1}{2}\Big )^{j-2}f_{12} \bigg ]\\&=\dfrac{1}{2}f_{0,j+1}+\dfrac{1}{2}\bigg [ \sum _{q=1}^{j-2}\Big (\dfrac{1}{2}\Big )^{q}f_{0,j+1-q} +\Big (\dfrac{1}{2}\Big )^{j-2}f_{12}\bigg ] \\&=\sum _{q=0}^{j-2}\Big (\dfrac{1}{2}\Big )^{q+1}f_{1,j+1-q} +\Big (\dfrac{1}{2}\Big )^{j-1}f_{12}, \end{aligned} \end{aligned}$$

as claimed. Hence, the equality in Eq. (B.19) is true for all $j=3,\dots {},L$. $\square $

Lemma B.3

The system in Eq. (B.11), which is given by

$$\begin{aligned} \mathbf {C}_{ii}= & {} \langle {}n_{i-1}\rangle {}+\mu {}\mathbf {C}_{i-1,i} \quad \quad \quad \text {for }i=3,\dots {},L, \\ \mathbf {C}_{i,i+1}= & {} \dfrac{\mu }{2}(\mathbf {C}_{i-1,i+1}+\mu {}\mathbf {C}_{i-1,i}) \quad \text {for }i=3,\dots {},L, \\ \mathbf {C}_{ij}= & {} \dfrac{\mu }{2}(\mathbf {C}_{i-1,j}+\mathbf {C}_{i,j-1}) \quad \text {for }i=3,\dots {},L+1\text { and }j=i+1,\dots {},L+1,\\ \end{aligned}$$

(B.20)

is equivalent to the system

$$\begin{aligned} \begin{aligned} \mathbf {C}_{ij}=\delta _{ij}\langle {}n_{i-1}\rangle +\langle {}n_{i-1}\rangle \langle {}n_{j-1}\rangle \alpha \beta {}\cdot {}f_{i-1,j-1}&\qquad \text {for }i,j=3,\dots {},L+1, \end{aligned}\nonumber \\ \end{aligned}$$

(B.21)

as stated in Eq. (B.12). Here, the functions $f_{ij}$ are defined as in Eq. (B.1).

Proof

We again use the method of induction. For $i=3$, we have

$$\begin{aligned} \begin{aligned} \mathbf {C}_{33}&=\langle {}n_2\rangle {}+\langle {}n_2\rangle {}^2\alpha \beta {}\cdot {}f_{22} =\langle {}n_2\rangle {}+\mu {}\langle {}n_1\rangle {}\langle {}n_2\rangle {}\alpha \beta {}\cdot {}f_{12} =\langle {}n_2\rangle {}+\mu {}\mathbf {C}_{23},\\ \mathbf {C}_{34}&=\langle {}n_2\rangle {}\langle {}n_3\rangle {}(\beta {}\alpha )\cdot {}f_{34} =\langle {}n_2\rangle {}\langle {}n_3\rangle {}\alpha \beta (f_{22}+f_{13})/2\\&=[\langle {}n_2\rangle {}\langle {}n_3\rangle {}\alpha \beta \cdot {}f_{22}+\langle {}n_2\rangle {}\langle {}n_3\rangle {}\alpha \beta \cdot {}f_{13}]/2\\&=[\mu {}\langle {}n_1\rangle {}\langle {}n_3\rangle {}\alpha \beta {}\cdot {}f_{13}+\mu ^2\langle {}n_1\rangle {}\langle {}n_2\rangle {}\alpha \beta {}\cdot {}f_{12}]/2\\&=\dfrac{\mu {}}{2}[\langle {}n_1\rangle {}\langle {}n_3\rangle {}\alpha \beta \cdot {}f_{13}+\mu {}\langle {}n_1\rangle {}\langle {}n_2\rangle {}\alpha \beta \cdot {}f_{12}] =\dfrac{\mu {}}{2}(\mathbf {C}_{24}+\mu {}\mathbf {C}_{23}),\\ \mathbf {C}_{3j}&=\langle {}n_2\rangle {}\langle {}n_{j-1}\rangle {}\alpha \beta \cdot {}f_{2,j-1} =\langle {}n_2\rangle {}\langle {}n_{j-1}\rangle {}\alpha \beta (f_{2,j-2}+f_{1,j-1})/2\\ {}&=[\langle {}n_2\rangle {}\langle {}n_{j-1}\rangle {}\alpha \beta \cdot {}f_{2,j-2}+\langle {}n_2\rangle {}\langle {}n_{j-1}\rangle {}\alpha \beta \cdot {}f_{1,j-1}]/2 \\ {}&=[\langle {}n_2\rangle {}\mu {}\langle {}n_{j-2}\rangle {}\alpha \beta \cdot {}f_{2,j-2}+\mu {}\langle {}n_1\rangle {}\langle {}n_{j-1}\rangle {}\alpha \beta \cdot {}f_{1,j-1}]/2 \\ {}&=\dfrac{\mu {}}{2}[\langle {}n_2\rangle {}\langle {}n_{j-2}\rangle {}\alpha \beta \cdot {}f_{2,j-2}+\langle {}n_1\rangle {}\langle {}n_{j-1}\rangle {}\alpha \beta \cdot {}f_{1,j-1}]\\ {}&=\dfrac{\mu {}}{2}(\mathbf {C}_{3,j-1}+\mathbf {C}_{2j})\\ \end{aligned} \end{aligned}$$

(B.22)

Now, we assume that the statement is true for some $i\ge {}4$; then, for $i+1$, we have

$$\begin{aligned} \begin{aligned} \mathbf {C}_{i+1,i+1}&=\langle {}n_{i}\rangle {}+\langle {}n_{i}\rangle {}^2\alpha \beta \cdot {}f_{ii} =\langle {}n_{i}\rangle {}+\mu {}\langle {}n_{i-1}\rangle {}\langle {}n_{i}\rangle {}\alpha \beta \cdot {}f_{i-1,i} =\langle {}n_{i}\rangle {}+\mu {}\mathbf {C}_{i,i+1}, \\ \mathbf {C}_{i+1,i+2}&=\langle {}n_{i}\rangle {}\langle {}n_{i+1}\rangle {}\alpha \beta \cdot {}f_{i,i+1}=\langle {}n_{i}\rangle {}\langle {}n_{i+1}\rangle {}\alpha \beta ( f_{i-1,i+1}+f_{ii})/2 \\&=[\langle {}n_{i}\rangle {}\langle {}n_{i+1}\rangle {}\alpha \beta \cdot {}f_{i-1,i+1}+\langle {}n_{i}\rangle {}\langle {}n_{i+1}\rangle {}\alpha \beta \cdot {}f_{ii}]/2\\&=[\mu {}\langle {}n_{i-1}\rangle {}\langle {}n_{i+1}\rangle {}\alpha \beta \cdot {}f_{i-1,i+1}+\mu {}^2\langle {}n_{i-1}\rangle {}\langle {}n_{i}\rangle {}\alpha \beta \cdot {}f_{i-1,i}]/2\\&=\dfrac{\mu {}}{2}[\langle {}n_{i-1}\rangle {}\langle {}n_{i+1}\rangle {}\alpha \beta \cdot {}f_{i-1,i+1}+\mu {}\langle {}n_{i-1}\rangle {}\langle {}n_{i}\rangle {}\alpha \beta \cdot {}f_{i-1,i}] \\ {}&=\dfrac{\mu {}}{2}(\mathbf {C}_{i,i+2}+\mu {}\mathbf {C}_{i,i+1}),\\ \mathbf {C}_{i+1,j}&=\langle {}n_{i}\rangle {}\langle {}n_{j-1}\rangle {}\alpha \beta \cdot {}f_{i,j-1} =\langle {}n_{i}\rangle {}\langle {}n_{j-1}\rangle {}\alpha \beta (f_{i-1,j-1}+f_{i,j-2})/2 \\&=[\langle {}n_{i}\rangle {}\langle {}n_{j-1}\rangle {}\alpha \beta \cdot {}f_{i-1,j-1}+\langle {}n_{i}\rangle {}\langle {}n_{j-1}\rangle {}\alpha \beta \cdot {}f_{i,j-2}]/2 \\&=[\mu {}\langle {}n_{i-1}\rangle {}\langle {}n_{j-1}\rangle {}\alpha \beta \cdot {}f_{i-1,j-1}+\mu {}\langle {}n_{i}\rangle {}\langle {}n_{j-2}\rangle {}\alpha \beta \cdot {}f_{i,j-2}]/2 \\&=\dfrac{\mu {}}{2}[\langle {}n_{i-1}\rangle {}\langle {}n_{j-1}\rangle {}\alpha \beta \cdot {}f_{i-1,j-1}+\langle {}n_{i}\rangle {}\langle {}n_{j-2}\rangle {}\alpha \beta \cdot {}f_{i,j-2}]\\&=\dfrac{\mu {}}{2}(\mathbf {C}_{ij}+\mathbf {C}_{i+1,j-1}),\\ \end{aligned} \end{aligned}$$

(B.23)

which is also correct. Hence, the statement of the lemma is true for all i and j, as stated. $\square $

Lemma B.4

For $i=1,\dots {},L$, the function $f_{iM}$ defined in Eq. (B.1) can be simplified as in Eq. (B.16); specifically, we have the identity

$$\begin{aligned} f_{iM} =\gamma f_{i-1,M}+(1-\gamma ) f_{i,L} =\gamma {}^{i}\cdot {}f_{0M}+(1-\gamma {})\sum _{q=1}^{i}\gamma {}^{i-q}\cdot {}f_{qL}. \end{aligned}$$

(B.24)

Proof

The proof is by induction: for $i=1$, the identity is obvious. We now suppose that Eq. (B.24) is true for some $i\ge {}2$; hence, for $i+1$, we have

$$\begin{aligned} \begin{aligned} f_{i+1,M}&= \gamma {}^{i+1}\cdot {}f_{0M}+(1-\gamma {})\sum _{q=1}^{i+1}\gamma {}^{i+1-q}\cdot {}f_{qL}\\ {}&= \gamma {}\bigg [ \gamma {}^{i}\cdot {}f_{0M}+(1-\gamma {})\sum _{q=1}^{i}\gamma {}^{i-q}\cdot {}f_{qL} \bigg ]+(1-\gamma )f_{i+1,L}\\&=\gamma {}f_{iM}+(1-\gamma )f_{i+1,L}, \end{aligned} \end{aligned}$$

(B.25)

which is correct. Hence, Eq. (B.24) is true for all i, as stated. $\square $

Lemma B.5

For $i,j=1,\dots {},L$, the solution of the recurrence relation $f_{ij}=(f_{i,j-1}+f_{i-1,j})/2$ in Eq. (B.1) is given by $f_{ij}=f(i,j)+f(j,i)$, where

$$\begin{aligned} {f}(i,j) =\dfrac{\alpha ^{i+j-1}}{(2\alpha -1)^{i}} +\dfrac{1}{2^{i+j-1}}\left( {\begin{array}{c}i+j-1\\ i\end{array}}\right) \Big [ 1-\dfrac{2\alpha -1}{2\alpha }{}_2F_1\big (1,i+j;j;\tfrac{1}{2\alpha }\big ) \Big ].\nonumber \\ \end{aligned}$$

(B.26)

Proof

In order to solve the recurrence relation for the function $f_{ij}$, we take into account the initial conditions $f_{00}=1$ and $f_{0j}=f_{j0}=\alpha ^{j-1}$. Then, we define a generating function g(x, y) via

$$\begin{aligned} \begin{aligned} g(x,y)&=\sum _{i,j\ge {}0}f_{ij}x^iy^j=f_{00}+\sum _{j\ge {}1}f_{0j}y^j+\sum _{i\ge {}1}f_{i0}x^i+\sum _{i,j\ge {}1}f_{ij}x^iy^j, \end{aligned} \end{aligned}$$

(B.27)

where the last term can be rewritten as

$$\begin{aligned} \begin{aligned} \sum _{i,j\ge {}1}f_{ij}x^iy^j&=\sum _{i,j\ge {}1}\dfrac{1}{2}(f_{i-1,j}+f_{i,j-1})x^iy^j\\&=\dfrac{1}{2}x\sum _{i,j\ge {}1}f_{i-1,j}x^{i-1}y^j+\dfrac{1}{2}y\sum _{i,j\ge {}1}f_{i,j-1}x^iy^{j-1} \\&=\dfrac{1}{2}x\sum _{i\ge {}0}\sum _{j\ge {}1}f_{ij}x^{i}y^j+\dfrac{1}{2}y\sum _{i\ge {}1}\sum _{j\ge {}0}f_{ij}x^iy^{j} \\&=\dfrac{1}{2}x\bigg (\sum _{i,j\ge {}0}f_{ij}x^{i}y^j-\sum _{i\ge {}0}f_{i0}x^i\bigg )+\dfrac{1}{2}y\bigg (\sum _{i,j\ge {}0}f_{ij}x^iy^{j}-\sum _{j\ge {}0}f_{0j}y^j\bigg ) \\&=\dfrac{1}{2}x\bigg (g(x,y)-\sum _{i\ge {}0}f_{i0}x^i\bigg )+\dfrac{1}{2}y\bigg (g(x,y)-\sum _{j\ge {}0}f_{0j}y^j\bigg ). \end{aligned} \end{aligned}$$

(B.28)

Hence, Eq. (B.27) becomes

$$\begin{aligned} g(x,y)&=f_{00}+\sum _{j\ge {}1}f_{0j}y^j+\sum _{i\ge {}1}f_{i0}x^i+\dfrac{1}{2}x\bigg (g(x,y)-\sum _{i\ge {}0}f_{i0}x^i\bigg )\\&\quad +\dfrac{1}{2}y\bigg (g(x,y)-\sum _{j\ge {}0}f_{0j}y^j\bigg ), \end{aligned}$$

which is equivalent to

$$\begin{aligned} g(x,y)\Big (1-\dfrac{1}{2}x-\dfrac{1}{2}y\Big )&=f_{00}\Big (1-\dfrac{1}{2}x-\dfrac{1}{2}y\Big )+\Big (1-\dfrac{1}{2}y\Big )\sum _{j\ge {}1}f_{0j}y^j\\&\quad +\Big (1-\dfrac{1}{2}x\Big )\sum _{i\ge {}1}f_{i0}x^i \end{aligned}$$

or

$$\begin{aligned} g(x,y)&=f_{00}+\Big (1-\dfrac{1}{2}y\Big )\dfrac{1}{1-\dfrac{1}{2}x-\dfrac{1}{2}y}\sum _{j\ge {}1}f_{0j}y^j\nonumber \\&\quad +\Big (1-\dfrac{1}{2}x\Big )\dfrac{}{1-\dfrac{1}{2}x-\dfrac{1}{2}y}\sum _{i\ge {}1}f_{i0}x^i. \end{aligned}$$

(B.29)

Taking into account the initial conditions, we find that

$$\begin{aligned} \sum _{j\ge {}1}f_{0j}y^j=\sum _{j\ge {}1}\alpha ^{j-1}y^j =\dfrac{1}{\alpha }\sum _{j\ge {}1}(\alpha {}y)^j\quad \text {and}\quad \sum _{i\ge {}1}f_{i0}x^i=\dfrac{1}{\alpha }\sum _{i\ge {}1}(\alpha {}x)^i, \end{aligned}$$

(B.30)

which we substitute into Eq. (B.29) to obtain

$$\begin{aligned} g(x,y)&=1+\Big (1-\dfrac{1}{2}y\Big )\dfrac{1}{1-\frac{1}{2}x-\frac{1}{2}y}\dfrac{1}{\alpha }\sum _{j\ge {}1}(\alpha {}y)^j\nonumber \\&\quad +\Big (1-\dfrac{1}{2}x\Big )\dfrac{1}{1-\frac{1}{2}x-\frac{1}{2}y}\dfrac{1}{\alpha }\sum _{i\ge {}1}(\alpha {}x)^i. \end{aligned}$$

(B.31)

Making use of the well-known symmetric, bivariate generating function of the binomial coefficients

$$\begin{aligned} \dfrac{1}{1-s-t}=\sum _{i,j\ge {}0}\left( {\begin{array}{c}i+j\\ i\end{array}}\right) s^it^j, \end{aligned}$$

(B.32)

we can rewrite Eq. (B.31) as

$$\begin{aligned} \begin{aligned} g(x,y)&=1+\Big (1-\dfrac{1}{2}y\Big )\dfrac{1}{\alpha }\sum _{j\ge {}1}(\alpha {}y)^j\sum _{i,j\ge {}0}\left( {\begin{array}{c}i+j\\ i\end{array}}\right) \dfrac{x^iy^j}{2^{i+j}}\\&\quad +\Big (1-\dfrac{1}{2}x\Big )\dfrac{1}{\alpha }\sum _{i\ge {}1}(\alpha {}x)^i\sum _{i,j\ge {}0}\left( {\begin{array}{c}i+j\\ i\end{array}}\right) \dfrac{x^iy^j}{2^{i+j}} \\&= \Big (1-\dfrac{1}{2}y\Big ) \sum _{i,j\ge {}0} \sum _{q=0}^{j-1}\left( {\begin{array}{c}i+q\\ i\end{array}}\right) \dfrac{\alpha {}^{j-q-1}}{2^{i+q}}x^iy^j\\&\quad + \Big (1-\dfrac{1}{2}x\Big ) \sum _{i,j\ge {}0}\sum _{q=0}^{i-1}\left( {\begin{array}{c}j+q\\ q\end{array}}\right) \dfrac{\alpha {}^{i-q-1}}{2^{j+q}}x^iy^j. \end{aligned} \end{aligned}$$

Rearranging sums in the above expression, we find

$$\begin{aligned} \begin{aligned} g(x,y)&= \sum _{i,j\ge {}0}\bigg [ \sum _{q=0}^{j-1}\left( {\begin{array}{c}i+q\\ i\end{array}}\right) \dfrac{\alpha {}^{j-q-1}}{2^{i+q}} - \sum _{q=0}^{j-2}\left( {\begin{array}{c}i+q\\ i\end{array}}\right) \dfrac{\alpha {}^{j-q-2}}{2^{i+q+1}} \\&\quad +\sum _{q=0}^{i-1}\left( {\begin{array}{c}j+q\\ q\end{array}}\right) \dfrac{\alpha {}^{i-q-1}}{2^{j+q}} - \sum _{q=0}^{i-2}\left( {\begin{array}{c}j+q\\ q\end{array}}\right) \dfrac{\alpha {}^{i-q-2}}{2^{j+q+1}}\bigg ]x^{i}y^j. \end{aligned} \end{aligned}$$

Hence, we obtain the following exact expression for the function $f_{ij}$,

$$\begin{aligned} f_{ij}&=\sum _{q=0}^{j-1}\left( {\begin{array}{c}i+q\\ q\end{array}}\right) \dfrac{\alpha {}^{j-q-1}}{2^{i+q}} - \sum _{q=0}^{j-2}\left( {\begin{array}{c}i+q\\ q\end{array}}\right) \dfrac{\alpha {}^{j-q-2}}{2^{i+q+1}}\nonumber \\&\quad + \sum _{q=0}^{i-1}\left( {\begin{array}{c}j+q\\ q\end{array}}\right) \dfrac{\alpha {}^{i-q-1}}{2^{j+q}} - \sum _{q=0}^{i-2}\left( {\begin{array}{c}j+q\\ q\end{array}}\right) \dfrac{\alpha {}^{i-q-2}}{2^{j+q+1}}. \end{aligned}$$

(B.33)

The expression in Eq. (B.33) can be simplified further due to its symmetry with respect to the indices i and j: we write $f_{ij}=f(i,j)+f(j,i)$, where f(i, j) is defined as

$$\begin{aligned} f(i,j)=\sum _{q=0}^{j-1}\left( {\begin{array}{c}i+q\\ q\end{array}}\right) \dfrac{\alpha {}^{j-q-1}}{2^{i+q}} - \sum _{q=0}^{j-2}\left( {\begin{array}{c}i+q\\ q\end{array}}\right) \dfrac{\alpha {}^{j-q-2}}{2^{i+q+1}}. \end{aligned}$$

(B.34)

The function f(i, j) can be further simplified as

$$\begin{aligned} f(i,j)= & {} \left( {\begin{array}{c}i+j-1\\ j-1\end{array}}\right) \dfrac{1}{2^{i+j-1}}+ 2\alpha {}\sum _{q=0}^{j-2}\left( {\begin{array}{c}i+q\\ q\end{array}}\right) \dfrac{\alpha {}^{j-q-2}}{2^{i+q+1}}- \sum _{q=0}^{j-2}\left( {\begin{array}{c}i+q\\ q\end{array}}\right) \dfrac{\alpha {}^{j-q-2}}{2^{i+q+1}} \\= & {} \left( {\begin{array}{c}i+j-1\\ j-1\end{array}}\right) \dfrac{1}{2^{i+j-1}}+ (2\alpha {}-1)\dfrac{\alpha {}^{j-2}}{2^{i+ 1}}\sum _{q=0}^{j-2}\left( {\begin{array}{c}i+q\\ q\end{array}}\right) \Big (\dfrac{1}{2\alpha }\Big )^q; \\ \end{aligned}$$

(B.35)

next, we use the identity

$$\begin{aligned} \sum _{q=0}^{j}\left( {\begin{array}{c}i+q\\ i\end{array}}\right) x^q =\dfrac{1}{(1-x)^{i+1}}-x^{j+1}\left( {\begin{array}{c}j+1+i\\ j+1\end{array}}\right) {}_2F_1(1,j+i+2;j+2;x), \nonumber \\ \end{aligned}$$

(B.36)

where ${}_2F_1$ is again the generalised hypergeometric function of the second kind (Digital Library of Mathematical Functions 2020a). Note that the above identity can be used only when $|x|< 1$, as the hypergeometric function ${}_2F_1$ is not defined otherwise.

Hence, Eq. (B.35) becomes

$$\begin{aligned} f(i,j)= & {} \left( {\begin{array}{c}i+j-1\\ j-1\end{array}}\right) \dfrac{1}{2^{i+j-1}} + (2\alpha {}-1)\dfrac{\alpha {}^{j-2}}{2^{i+1}}\\&\quad \times \bigg [\Big (\dfrac{2\alpha }{2\alpha -1}\Big )^{i+1}-\dfrac{1}{(2\alpha )^{j-1}} \left( {\begin{array}{c}j+i-1\\ j-1\end{array}}\right) {}_2F_1\big (1,j+i;j;\tfrac{1}{2\alpha }\big ) \bigg ] \\= & {} \dfrac{\alpha {}^{i+j-1}}{(2\alpha {}-1)^{i}}+ \dfrac{1}{2^{i+j-1}}\left( {\begin{array}{c}i+j-1\\ i\end{array}}\right) \Big [ 1-\dfrac{2\alpha -1}{2\alpha }{}_2F_1\big (1,j+i;j;\tfrac{1}{2\alpha }\big )\Big ] \\ \end{aligned}$$

(B.37)

Given the expression for f(i, j) in Eq. (B.37), one can find the corresponding expression for f(j, i) by exchanging the indexes $i\leftrightarrow {}j$. $\square $

C. Variance of Total RNAP Distribution

In this section, we derive the exact expression for the variance of the total RNAP distribution, as stated in Eq. (10), which is given by the sum over the covariances $\mathrm{Cov}(x_i,x_j)$ ($i,j=1,\dots {},L$), as defined in Eq. (4d). Hence, we have

$$\begin{aligned} \begin{aligned} \mathrm{Var}(n_\mathrm{tot})&=\sum _{i,j=1}^{L}\mathrm{Cov}(n_i,n_j) =\sum _{i=1}^{L}\mathrm{Var}(n_i)+\sum _{i\ne {}j}\mathrm{Cov}(n_i,n_j) \\ {}&=\sum _{i=1}^{L}\big [\langle {}n_i\rangle {}+\langle {}n_i\rangle {}^2\alpha \beta \cdot {}{f}_{ii}\big ] +\sum _{i\ne {}j}\langle {}n_i\rangle {}\langle {}n_j\rangle {}\alpha \beta \cdot {}{f}_{ij}\\&=\sum _{i=1}^{L}\langle {}n_i\rangle {}+\alpha \beta \bigg (\sum _{i=1}^{L}\langle {}n_i\rangle {}^2\cdot {}{f}_{ii}+\sum _{i\ne {}j}\langle {}n_i\rangle {}\langle {}n_j\rangle {}\cdot {}{f}_{ij}\bigg ) \\ {}&=\sum _{i=1}^{L}\langle {}n_i\rangle {}+\alpha \beta \sum _{i,j=1}^{L}\langle {}n_i\rangle {}\langle {}n_j\rangle {}\cdot {}{f}_{ij}, \end{aligned} \end{aligned}$$

(C.1)

where the function $f_{ij}$ is given in Eq. (10). The first term in Eq. (C.1) equals $\langle {}n_\mathrm{tot}\rangle {}$, the mean of the total RNAP distribution, as stated in Eq. (10); substituting in the expressions for the means $\langle {}n_i\rangle {}$ from Eq. (2b), as well, we obtain

$$\begin{aligned} \mathrm{Var}(n_\mathrm{tot}) =\langle {}n_\mathrm{tot}\rangle {}+\alpha \beta (\eta {}\rho {}_k)^2\sum _{i,j=1}^{L}\mu ^{i+j}\cdot {}{f}_{ij}. \end{aligned}$$

(C.2)

Lemma C.1

In the limit of deterministic elongation, i.e. for $L\rightarrow {}\infty $, the expression for $\mathrm{Var}(n_\mathrm{tot})$ in Eq. (10) simplifies to

$$\begin{aligned} \mathrm{Var}(n_\mathrm{tot})_{\infty }&= \langle {}n_\mathrm{tot}\rangle {}_{\infty }\nonumber \\&\quad + \beta (\eta {}r)^2\dfrac{(s_b+s_u-d)-(s_b+s_u+d)\hbox {e}^{-2d\langle {}T\rangle {}}+2d\hbox {e}^{-(s_b+s_u+d)\langle {}T\rangle {}}}{d(s_b+s_u+d)(s_b+s_u-d)}, \nonumber \\ \end{aligned}$$

(C.3)

which can be further simplified to the expression in Eq. (11).

Proof

In order to find the limit of $L\rightarrow \infty $ in Eq. (10) (or Eq. (C.2)), we have to evaluate the term $\sum _{i,j=1}^{L}\mu ^{i+j}\cdot {}{f}_{ij}$ in that limit. For the following derivation, we consider the function $f_{ij}=f(i,j)+f(j,i)$, where f(i, j) is defined in terms of sums in Eq. (B.34). Hence, we have

$$\begin{aligned} \sum _{i,j=1}^{L}\mu ^{i+j}\cdot {}{f}_{ij}= & {} \sum _{i,j=1}^{L} \mu ^{i+j}{f}(i,j)+\sum _{i,j=1}^{L} \mu ^{i+j}{f}(j,i) =2\sum _{i,j=1}^{L}\mu ^{i+j}{f}(i,j)\nonumber \\= & {} 2\bigg [\sum _{i,j=1}^{L}\sum _{q=0}^{j-1} \mu ^{i+j}\left( {\begin{array}{c}i+q\\ q\end{array}}\right) \dfrac{\alpha {}^{j-q-1}}{2^{i+q}} - \sum _{i,j=1}^{L}\sum _{q=0}^{j-2} \mu ^{i+j}\left( {\begin{array}{c}i+q\\ q\end{array}}\right) \dfrac{\alpha {}^{j-q-2}}{2^{i+q+1}}\bigg ] \nonumber \\= & {} \underbrace{2 \sum _{i,j=1}^{L} \mu ^{i+j}\left( {\begin{array}{c}i+j-1\\ i\end{array}}\right) \dfrac{1}{2^{i+j-1}}}_{G_1} +\underbrace{2(2\alpha -1) \sum _{i,j=1}^{L}\sum _{q=0}^{j-2} \mu ^{i+j}\left( {\begin{array}{c}i+q\\ q\end{array}}\right) \dfrac{\alpha {}^{j-q-2}}{2^{i+q+1}}}_{G_2}.{\nonumber }\\ \end{aligned}$$

(C.4)

Substituting $k\rightarrow {}L/\langle {}T\rangle {}-d$ in Eq. (C.4) and taking the limit of $L\rightarrow {}\infty $, we have that $G_1\underset{L\rightarrow \infty }{\longrightarrow }{}{}0$; hence, $\mathrm{Var}(n_\mathrm{tot})$ evaluates to

$$\begin{aligned} \mathrm{Var}(n_\mathrm{tot})_{\infty }=\langle {}n_\mathrm{tot}\rangle {}_{\infty }+ \lim _{L\rightarrow \infty }[\alpha \beta (\eta \rho _k)^2 G_2] \end{aligned}$$

(C.5)

in that limit, which yields the expression in Eq. (C.3), as can easily be verified with the computer algebra package Mathematica. Hence, in the limit of deterministic elongation, the expression for the variance of the RNAP distribution in Eq. (10) reduces to the one in Eq. (11), as claimed. $\square $

D. Moments of Total RNAP and Mature RNA in Bursty and Constitutive Limits

Moments of total RNAP in the bursty limit: In the bursty limit, the expressions for the mean and variance of the total RNAP distribution given in Eq. (10) simplify to

$$\begin{aligned} \begin{aligned} \langle {}n_\mathrm{tot}\rangle {}_b =b\dfrac{s_u}{k}\mu \dfrac{\mu ^L-1}{\mu -1}, \quad \text {and}\quad \mathrm{Var}(n_\mathrm{tot})_b =\langle {}n_\mathrm{tot}\rangle {}_b+b^2\dfrac{s_u}{k}\sum _{i,j=1}^{L}\mu ^{i+j}\cdot {}{h}_{ij}. \end{aligned} \nonumber \\ \end{aligned}$$

(D.1)

If, furthermore, we take the limit of deterministic elongation, with $L \rightarrow \infty $ at constant $\langle T \rangle $, Eq. (D.1) simplifies to

$$\begin{aligned} \begin{aligned} \langle {}n_\mathrm{tot}\rangle {}_{(b;\infty )}&=b\dfrac{s_u}{d}(1-\hbox {e}^{-T_d})\quad \text {and}\quad \mathrm{Var}(n_\mathrm{tot})_{(b;\infty )}\\&=\langle {}n_\mathrm{tot}\rangle {}_{(b;\infty )}+\langle {}n_\mathrm{tot}\rangle {}_{(b;\infty )}^2 \dfrac{d}{s_u}\dfrac{1+\hbox {e}^{-T_d}}{1-\hbox {e}^{-T_d}}, \end{aligned} \end{aligned}$$

(D.2)

where the subscript $(b;\infty )$ denotes the bursty limit with infinite L. In the limit of zero RNAP detachment, Eq. (D.2) further simplifies to

$$\begin{aligned} \begin{aligned} \langle {}n_\mathrm{tot}\rangle {}_{(b;\infty ;0)} =b s_u\langle {}T\rangle {}\quad \text {and}\quad \mathrm{Var}(n_\mathrm{tot})_{(b;\infty ;0)}=\langle {}n_\mathrm{tot}\rangle {}_{(b;\infty ;0)}(1+2b), \end{aligned} \end{aligned}$$

(D.3)

where the subscript $(b;\infty ;0)$ denotes the bursty limit, with $L\rightarrow {}\infty $ and $d\rightarrow {}0$.

Moments of total RNAP in the constitutive limit: In the constitutive limit, Eq. (10) simplifies to

$$\begin{aligned} \begin{aligned} \langle {}n_\mathrm{tot}\rangle {}_{c} =\dfrac{r}{k}\mu \dfrac{\mu ^L-1}{\mu -1}=\mathrm{Var}(n_\mathrm{tot})_{c}. \end{aligned} \end{aligned}$$

(D.4)

If, furthermore, we take the limit of deterministic elongation, i.e. $L \rightarrow \infty $ at constant $\langle T \rangle $, Eq. (D.4) simplifies to

$$\begin{aligned} \begin{aligned} \langle {}n_\mathrm{tot}\rangle {}_{(c;\infty )}= \dfrac{r}{d}(1-\hbox {e}^{-T_d})=\mathrm{Var}(n_\mathrm{tot})_{(c;\infty )}; \end{aligned} \end{aligned}$$

(D.5)

finally, in the limit of zero RNAP detachment, Eq. (D.5) further simplifies to

$$\begin{aligned} \begin{aligned} \langle {}n_\mathrm{tot}\rangle {}_{(c;\infty ;0)}=r\langle {}T\rangle {}=\mathrm{Var}(n_\mathrm{tot})_{(c;\infty ;0)}. \end{aligned} \end{aligned}$$

(D.6)

Moments of mature RNA distribution in the bursty limit: In that limit, the closed-form expressions in Eq. (8) are given by

$$\begin{aligned} \begin{aligned} \langle n\rangle _b =b\upsilon _m\mu {}^{L}\quad \text {and}\quad \mathrm{Var}(n)_b =\langle {}n\rangle {}_b+\langle {}n\rangle {}^2_b(\upsilon _k\mu )^{-1}\cdot {} h_{MM}, \end{aligned} \end{aligned}$$

(D.7)

which in the limit of deterministic elongation simplify to

$$\begin{aligned} \begin{aligned} \langle {} n \rangle {}_{(b;\infty )}=b\upsilon _m \hbox {e}^{-T_d}\quad \text {and}\quad \mathrm{Var}(n)_{(b;\infty )}=\langle {} n \rangle {}_{(b;\infty )}+\langle {} n \rangle {}_{(b;\infty )}^2\upsilon _m^{-1}. \end{aligned} \end{aligned}$$

(D.8)

In the limit of zero RNAP detachment, these expressions further simplify to

$$\begin{aligned} \begin{aligned} \langle {} n \rangle {}_{(b;\infty ;0)}=b\upsilon _m\quad \text {and}\quad \mathrm{Var}(n)_{(b;\infty ;0)}=\langle {} n \rangle {}_{(b;\infty ;0)}+\langle {} n \rangle {}_{(b;\infty ;0)}^2\upsilon _m^{-1}. \end{aligned} \end{aligned}$$

(D.9)

E. Introduction to Geometric Singular Perturbation Theory (GSPT)

We consider a system of first-order autonomous ordinary differential equations in the general (‘standard’) form

$$\begin{aligned} \varepsilon \dot{\mathbf {x}}&=\mathbf {f}(\mathbf {x},\mathbf {y},\varepsilon ), \end{aligned}$$

(E.1)

$$\begin{aligned} \dot{\mathbf {y}}&=\mathbf {g}(\mathbf {x},\mathbf {y},\varepsilon ), \end{aligned}$$

(E.2)

where $(\mathbf {x},\mathbf {y})\in {}\mathbb {R}^m\times {}\mathbb {R}^l$, with $m,l\in {}\mathbb {N}$. Here, $0<\varepsilon {}\ll {}1$ is a (real) singular perturbation parameter, and the overdot denotes differentiation with respect to the ‘slow’ time t. (Correspondingly, Eq. (E.1) is referred to as the ‘slow’ system.) The variable $\mathbf {x}$ is referred to as the ‘fast variable’, while $\mathbf {y}$ is the ‘slow variable’. For simplicity, the functions $\mathbf {f}:\mathbb {R}^m\times {}\mathbb {R}^l\times {}\mathbb {R}^+\rightarrow {}\mathbb {R}^m$ and $\mathbf {g}:\mathbb {R}^m\times {}\mathbb {R}^l\times {}\mathbb {R}^+\rightarrow {}\mathbb {R}^l$ are assumed to be $\mathcal {C}^{\infty }$-smooth in all their arguments. In the context of our analysis of the characteristic system in Eq. (26), we have the ‘slow system’

$$\begin{aligned} \varepsilon {}{\dot{u}}_i&=u_i-u_{i+1}+(d/k)u_i \qquad \text {for }i=1,\dots {},L-1, \end{aligned}$$

(E.3a)

$$\begin{aligned} \varepsilon {}{\dot{u}}_L&=u_L-u+(d/k)u_L, \end{aligned}$$

(E.3b)

$$\begin{aligned} {\dot{u}}&=u, \end{aligned}$$

(E.3c)

$$\begin{aligned} {\dot{F}}_{0}&=(s_u/d_m)F_{1}-(s_b/d_m)F_{0}+(r/d_m) u_1F_{0}, \end{aligned}$$

(E.3d)

$$\begin{aligned} {\dot{F}}_{1}&=(s_b/d_m)F_{0}-(s_u/d_m)F_{1}. \end{aligned}$$

(E.3e)

By comparing the system of equations in Eq. (E.3) with the general form in Eq. (E.1), we see that $u_i$ ($i=1,\dots {},L$) are the fast variables, while u, $F_0$, and $F_1$ are slow. Correspondingly, we have $m=L$ and $l=3$ in the above notation, which implies $\mathbf {f}=(f_1,f_2,\dots {},f_L)$, with $f_i=f_i(u_i,u_{i+1})=u_i-u_{i+1}+(d/k)u_i$ for $i=1,\dots {},L-1$, $f_L=f_L(u_L,u)=u_L-u+(d/k) u_L$, and $\mathbf {g}=(g_1,g_2,g_3)(u_1,u,F_{0},F_{1})=(u,(s_u/d_m)F_{1}-(s_b/d_m)F_{0}+(r/d_m) u_1F_{0},(s_b/d_m)F_{0}-(s_u/d_m)F_{1})$.

Now, we introduce a new ‘fast’ time $\tau {}=t/\varepsilon $, which we substitute into Eq. (E.1) to find the ‘fast system’

$$\begin{aligned} \mathbf {x^{\prime }}&=\mathbf {f}(\mathbf {x},\mathbf {y},\varepsilon ), \end{aligned}$$

(E.4a)

$$\begin{aligned} \mathbf {y^{\prime }}&=\varepsilon {}\mathbf {g}(\mathbf {x},\mathbf {y},\varepsilon ) \end{aligned}$$

(E.4b)

corresponding to Eq. (E.1); here, the prime denotes the derivative with respect to $\tau {}$. Hence, rewriting Eq. (E.3) in the fast formulation, we find

$$\begin{aligned} {u}^{\prime }_i&=u_i-u_{i+1}+(d/k) u_i \qquad \text {for }i=1,\dots {},L-1, \end{aligned}$$

(E.5a)

$$\begin{aligned} {u}^{\prime }_L&=u_L-u+(d/k) u_L, \end{aligned}$$

(E.5b)

$$\begin{aligned} {u}^{\prime }&=\varepsilon {}u, \end{aligned}$$

(E.5c)

$$\begin{aligned} {F}^{\prime }_{0}&=\varepsilon {}[(s_u/d_m)F_{1}-(s_b/d_m)F_{0}+(r/d_m) u_1F_{0}], \end{aligned}$$

(E.5d)

$$\begin{aligned} {F}^{\prime }_{1}&=\varepsilon {}[(s_b/d_m)F_{0}-(s_u/d_m)F_{1}]. \end{aligned}$$

(E.5e)

For positive $\varepsilon $, the systems in Eqs. (E.1) and (E.4)—and, correspondingly, the systems in Eqs. (E.3) and (E.5)—are equivalent; however, in the singular limit of $\varepsilon \rightarrow {}0$, we obtain two different systems: setting $\varepsilon =0$ in Eq. (E.1), we have the ‘reduced problem’

$$\begin{aligned} \mathbf {0}&=\mathbf {f}(\mathbf {x},\mathbf {y},0),\end{aligned}$$

(E.6a)

$$\begin{aligned} \dot{\mathbf {y}}&=\mathbf {g}(\mathbf {x},\mathbf {y},0), \end{aligned}$$

(E.6b)

while we obtain the ‘layer problem’

$$\begin{aligned} \mathbf {x'}&=\mathbf {f}(\mathbf {x},\mathbf {y},0), \end{aligned}$$

(E.7a)

$$\begin{aligned} \mathbf {y'}&=\mathbf {0} \end{aligned}$$

(E.7b)

for $\varepsilon =0$ in Eq. (E.4). The ‘reduced problem’ for the system in Eq. (E.3) implies that the flow of $(u,F_0,F_1)$ is constrained to lie on the $(l=3)$-dimensional ‘critical manifold’ $\mathcal {S}_0$ that is defined by $\mathbf {f}=\mathbf {0}$:

$$\begin{aligned} u_i=\mu \cdot {}u_{i+1}=\mu ^{i+L-1}\cdot {}u\qquad \text {for }i=1,\dots ,L, \end{aligned}$$

(E.8)

where $u_{L+1}\equiv {}u$ and $(F_{0},F_{1})$ are assumed to vary in an appropriately chosen subset of $\mathbb {R}^2$.

From the ‘layer problem’ of the system in Eq. (E.3), we conclude that $\mathbf {y}=(u,F_0,F_1)$ is a parameter which parameterises the $(m=L)$-dimensional flow of $u'_i=f_i$ ($i=1,\dots ,L$), the equilibria of which are located on $\mathcal {S}_0$.

The Jacobian matrix $\mathbf {D}_{\mathbf {x}}\mathbf {f}(\mathbf {x},\mathbf {y},0)$ of the ’layer problem’ corresponding to Eq. (E.5) about $\mathcal {S}_0$ has the eigenvalues

$$\begin{aligned} \lambda _i=k(1+(d/k)-u_{i+1})=(k+d)(1-\mu ^{i+L+1}u)\qquad \text {for }i=1,\dots {},L. \nonumber \\ \end{aligned}$$

(E.9)

Since our definition of the generating function $F(z,\tau )$ in Sect. 3.2 assumed $z\in {}[-1,1]$, we may restrict to $u\in {}[-2,0]$ which, by Eq. (E.9), implies that $\lambda _i>0$. Hence, the critical manifold $\mathcal {S}_0$ is ‘normally hyperbolic’—and, in fact, normally repelling—with an $(m+l=L+3)$-dimensional unstable manifold $\mathcal {W}^\mathrm{u}(\mathcal {S}_0)$.

The geometric singular perturbation theory due to Fenichel (1979) thus implies that $\mathcal {S}_0$ will persist, for $\varepsilon $ positive and sufficiently small, as a slow manifold’ $\mathcal {S}_\varepsilon $ that is (locally) invariant, smooth, and $\mathcal {O}(\varepsilon )$-close to $\mathcal {S}_0$. (As the unstable manifold $\mathcal {W}^\mathrm{u}(\mathcal {S}_0)$ equals the entire phase space of Eq. (E.3), it trivially persists as the unstable manifold $\mathcal {W}^\mathrm{u}(\mathcal {S}_\varepsilon )$ for $\mathcal {S}_\varepsilon $.) In particular, as $\mathcal {S}_0$ is repelling in forward time, it follows that the inverse characteristic transformation corresponding to Eq. (26) is well defined in backward time; details can be found in Veerman et al. (2018), Popović et al. (2016).

F. Variance of Fluctuating Total Fluorescent Signal

By definition, the variance of the total fluorescent signal is given by the sum over all elements $\mathrm{Cov}(r_i,r_j)$ for $i,j=1,\dots {},L$, where $r_i=(\nu {}/L)in_i$; the corresponding definitions can be found in Sect. 4 of the main text. Hence, we have that

$$\begin{aligned} \begin{aligned} \mathrm{Var}(r_\mathrm{tot})&=\sum _{i,j=1}^{L}\mathrm{Cov}(r_i,r_j) =\sum _{i,j=1}^{L}\mathrm{Cov}\Big (\dfrac{\nu }{L}in_i,\dfrac{\nu }{L}jn_j\Big ) =\Big (\dfrac{\nu }{L}\Big )^2\sum _{i,j=1}^{L}ij\cdot {}\mathrm{Cov}(n_i,n_j) \\&=\Big (\dfrac{\nu }{L}\Big )^2\bigg ( \sum _{i=1}^{L}i^2\mathrm{Var}(n_i)+\sum _{i\ne {}j}ij\cdot {}\mathrm{Cov}(n_i,n_j)\bigg ) \\&=\Big (\dfrac{\nu }{L}\Big )^2\bigg ( \sum _{i=1}^{L}i^2[\langle n_i \rangle +\langle n_i \rangle ^2\alpha \beta \cdot {}{f}_{ii}] + \sum _{i\ne {}j}ij\langle n_i \rangle \langle n_j \rangle \alpha \beta \cdot {}{f}_{ij} \bigg )\\ {}&=\Big (\dfrac{\nu }{L}\Big )^2 \sum _{i=1}^{L}i^2\langle n_i \rangle +\Big (\dfrac{\nu }{L}\Big )^2\alpha \beta \bigg ( \sum _{i=1}^{L}i^2\langle n_i \rangle ^2 \cdot {}{f}_{ii}+\sum _{i\ne {}j}ij\langle n_i \rangle \langle n_j \rangle \cdot {}{f}_{ij} \bigg )\\ {}&=\Big (\dfrac{\nu }{L}\Big )^2 \sum _{i=1}^{L}i^2\langle n_i \rangle +\Big (\dfrac{\nu }{L}\Big )^2\alpha \beta \sum _{i,j=1}^{L}ij\langle n_i \rangle \langle n_j \rangle \cdot {}{f}_{ij}. \end{aligned} \end{aligned}$$

(F.1)

Substituting the expressions for the means $\langle {}n_i\rangle {}$ from Eq. (2b) into Eq. (F.1), we obtain

$$\begin{aligned} \mathrm{Var}(r_\mathrm{tot})= \Big (\dfrac{\nu }{L}\Big )^2\eta \rho {}_k \sum _{i=1}^{L}i^2\mu ^{i}+ \Big (\dfrac{\nu }{L}\Big )^2\alpha \beta (\eta \rho {}_k)^2\sum _{i,j=1}^{L}ij\cdot {}\mu ^{i+j}\cdot {}{f}_{ij}, \end{aligned}$$

(F.2)

which is the expression stated in Eq. (35).

G. Moments of Fluctuations in Total Fluorescent Signal in Various Limits

Deterministic elongation Substituting $k\mapsto {}L/\langle {}T\rangle {}-d$ and taking the long-gene limit of $L\rightarrow {}\infty $ in Eq. (35), we obtain the simplified expressions

$$\begin{aligned} \begin{aligned} \langle r_\mathrm{tot} \rangle _{\infty }&=\dfrac{\nu \eta {}r}{\hbox {d}T_d}[1-(1+T_d )\hbox {e}^{-T_d}], \\ \mathrm{Var}(r_\mathrm{tot})_{\infty }&=\langle r_\mathrm{tot} \rangle _{\infty } \cdot {}\mathcal {F}_0 +\langle r_\mathrm{tot} \rangle _{\infty }^2\cdot {}\beta {}\updelta _g \dfrac{\mathcal {F}_1+\mathcal {F}_2+\mathcal {F}_3}{2(\updelta _g-1)^2(\updelta _g+1)^2[1-(1+T_d)\hbox {e}^{-T_d}]^2} , \end{aligned} \end{aligned}$$

(G.1)

where

$$\begin{aligned} \begin{aligned} \mathcal {F}_0&= \nu {}\bigg [\frac{2}{T_d}-\dfrac{T_d\hbox {e}^{-T_d}}{1-(1+T_d)\hbox {e}^{-T_d}}\bigg ], \\ \mathcal {F}_1&=(\updelta _g-1)^2(2\updelta _g+1),\\ \mathcal {F}_2&=(\updelta _g+1)^2[2\updelta _g(1+T_d)(1+T_d-T_g)-1]\hbox {e}^{-2T_d}, \\ \mathcal {F}_3&=-4\updelta _g^3(1+T_d+T_g)\hbox {e}^{-T_g}\hbox {e}^{-T_d}; \end{aligned} \end{aligned}$$

(G.2)

the expression for the variance in Eq. (G.1) is found via the same method as is used in Lemma C.1 of ‘Appendix C’. When there is no detachment of RNAP from the gene, i.e. when $d=0$, Eq. (G.1) simplifies to

$$\begin{aligned} \begin{aligned} \langle r_\mathrm{tot} \rangle _{(\infty ;0)}&=\dfrac{1}{2}\nu {}\eta {}r\langle {}T\rangle {} , \\ \mathrm{Var}(r_\mathrm{tot})_{(\infty ;0)}&= \langle r_\mathrm{tot} \rangle _{(\infty ;0)}\dfrac{2\nu {}}{3} \\&\quad +\langle r_\mathrm{tot} \rangle _{(\infty ;0)}^2\cdot {} 8\beta {}T_g^{-1} \Big [ \dfrac{1}{3}-\dfrac{1}{2}T_g^{-1}+T_g^{-3}-T_g^{-3}\big (1+T_g\big )\hbox {e}^{-T_g} \Big ]. \end{aligned} \end{aligned}$$

(G.3)

Bursty limit: In the limit when the rates $s_b$ and r are large, the expressions for the mean and variance of the total fluorescent signal given in Eq. (35) become

$$\begin{aligned} \begin{aligned} \langle r_\mathrm{tot} \rangle _b&= \nu {}b\dfrac{s_u}{d}\Big (\dfrac{k}{d}\dfrac{(1-\mu ^L)}{\mu {}L}-\mu ^L \Big ), \\ \mathrm{Var}(r_\mathrm{tot})_b&=\Big (\dfrac{\nu }{L}\Big )^2b\dfrac{s_u}{k} \sum _{i=1}^{L}i^2\mu ^{i}+ \Big (\dfrac{\nu }{L}\Big )^2b^2\dfrac{s_u}{k}\sum _{i,j=1}^{L}ij\cdot {}\mu ^{i+j}\cdot {} f_{ij}. \end{aligned} \end{aligned}$$

(G.4)

Constitutive limit: When the gene spends most of its time in the active state, Eq. (35) simplifies to

$$\begin{aligned} \begin{aligned} \langle {} r_\mathrm{tot}\rangle {}_{c}&= \dfrac{\nu }{L}\rho {}_k\mu \dfrac{1+\mu ^L[L(\mu -1)-1]}{(\mu -1)^2}, \\ \mathrm{Var}(r_\mathrm{tot})_c&=\Big (\dfrac{\nu }{L}\Big )^2\rho {}_k\mu \dfrac{1+\mu -\mu ^L[L^2\mu ^2+(1+L)^2\mu -(2L^2+2L-1)]}{(1-\mu )^3}. \end{aligned} \end{aligned}$$

(G.5)

Bursty expression with deterministic elongation. In this case, Eq. (G.4) simplifies to

$$\begin{aligned} \begin{aligned} \langle r_\mathrm{tot} \rangle _{(b;\infty )}&=\dfrac{\nu {}bs_u}{\hbox {d}T_d}\big [1-(1+T_d)\hbox {e}^{-T_d}\big ], \\ \mathrm{Var}(r_\mathrm{tot})_{(b;\infty )}&=\langle r_\mathrm{tot} \rangle _{(b;\infty )}\cdot {}\mathcal {F}_0 +\langle r_\mathrm{tot} \rangle _{(b;\infty )}^2\cdot {} \dfrac{d}{2s_u}\dfrac{1-(1+2T_d+2T_d^2)\hbox {e}^{-2T_d}}{\big [1-(1+T_d)\hbox {e}^{-T_d}\big ]^2}, \end{aligned} \end{aligned}$$

(G.6)

where $\mathcal {F}_0$ is given by Eq. (G.2). In the special case of no premature RNAP detachment from the gene ($d\rightarrow {}0$), Eq. (G.6) can be further simplified to

$$\begin{aligned} \begin{aligned} \langle r_\mathrm{tot}\rangle _{(b;\infty ;0)}&=\dfrac{1}{2}\nu {}bs_u\langle {}T\rangle {}, \\ \mathrm{Var}(r_\mathrm{tot})_{(b;\infty ;0)}&= \langle r_\mathrm{tot}\rangle _{(b;\infty ;0)}\cdot {}\dfrac{2\nu {}}{3}+ \langle r_\mathrm{tot}\rangle _{(b;\infty ;0)}^2\cdot {}\dfrac{8}{3s_u\langle {}T\rangle {}} . \end{aligned} \end{aligned}$$

(G.7)

Constitutive expression with deterministic elongation: In this case, Eq. (G.5) simplifies to

$$\begin{aligned} \begin{aligned} \langle r_\mathrm{tot} \rangle _{(c;\infty )}&=\dfrac{\nu }{T_d}\dfrac{r}{d}\big [1-(1+T_d )\hbox {e}^{-T_d}\big ] \quad {}\text {and}\\ \mathrm{Var}(r_\mathrm{tot})_{(c;\infty )}&=\dfrac{\nu ^2}{T_d^2}\dfrac{r}{d}\big [2-(2+2T_d+T_d^2)\hbox {e}^{-T_d}\big ], \end{aligned} \end{aligned}$$

(G.8)

which reduces to

$$\begin{aligned} \begin{aligned} \langle r_\mathrm{tot} \rangle _{(c;\infty ;0)} =\dfrac{1}{2}\nu {}r\langle {}T\rangle {}\quad {}\text {and}\quad \mathrm{Var}(r_\mathrm{tot})_{(c;\infty ;0)}&=\dfrac{1}{3}\nu ^2r\langle {}T\rangle {} \end{aligned} \end{aligned}$$

(G.9)

for the special case of zero RNAP detachment from the gene.

H. Extended Model with RNAP Pausing

Proof of Proposition 3

The new pausing model presented in Fig. 9 can be conveniently described by $2L+2$ species interacting via an effective set of $5L+4$ reactions. The vector $\vec {m}$ of the number of molecules of the respective species is given by $\vec {m}=(n_0,n^a_1,\dots {},n^a_L,n^p_1,\dots {},n^p_L,n)$; in the table below, we summarise the respective positions of each entry in $\vec {m}$, as well as the definition of the rate functions $f_j$, for $j=1,\dots {},5L+4$.

Species	Molecule numbers	Position (in $\vec {m}$)
$G_\mathrm{on}$	$n_0$	1
$P_i$, $i\in {}\{1,\dots {},L\}$	$n^a_i$	$i+1$
$\bar{P}_i$, $i\in {}\{1,\dots {},L\}$	$n^p_i$	$i+L+1$
M	n	$2L+2$

Reaction	Rate function $f_j$
$G_\mathrm{on} {\mathop {\longrightarrow }\limits ^{s_b}}G_\mathrm{off}$	$f_1=s_b \langle n_0 \rangle $
$G_\mathrm{off}{\mathop {\longrightarrow }\limits ^{s_u}}G_\mathrm{on}$	$f_2=s_u (1 - \langle n_0 \rangle )$
$G_\mathrm{on} {\mathop {\longrightarrow }\limits ^{r}}G_\mathrm{on}+P_1$	$f_3=r\langle n_0 \rangle $
$P_i {\mathop {\longrightarrow }\limits ^{k}}P_{i+1}$, $i\in {}\{1,\dots {},L-1\}$	$f_{i+3}=k \langle n^a_i \rangle $
$P_L {\mathop {\longrightarrow }\limits ^{k}}M$	$f_{L+3}=k \langle n^a_L \rangle $
$P_i {\mathop {\longrightarrow }\limits ^{d_a}}\varnothing $, $i\in {}\{1,\dots {},L\}$	$f_{i+L+3}=d_a \langle n^a_i \rangle $
$P_i {\mathop {\longrightarrow }\limits ^{r_p}}\bar{P}_i$, $i\in {}\{1,\dots {},L\}$	$f_{i+2L+3}=r_p \langle n^a_i \rangle $
$\bar{P}_i {\mathop {\longrightarrow }\limits ^{r_a}}P_i$, $i\in {}\{1,\dots {},L\}$	$f_{i+3L+3}=r_a \langle n^p_i \rangle $
$\bar{P}_i {\mathop {\longrightarrow }\limits ^{d_p}}\varnothing $, $i\in {}\{1,\dots {},L\}$	$f_{i+4L+3}=d_p \langle n^p_i \rangle $
$M {\mathop {\longrightarrow }\limits ^{d_m}}\varnothing $	$f_{5L+4}=d_m \langle n \rangle $

Note that we do not consider $G_\mathrm{off}$ as an independent species, as a conservation law implies $\langle {}G_\mathrm{off}\rangle {}=1-\langle {}n_0\rangle {}$. Given the ordering of species and reactions as described in the above tables, we can define the $(2L+2)\times {}(5L+4)$-dimensional stoichiometry matrix $\mathbf{S} $, with nonzero elements given by

$$\begin{aligned} \begin{aligned} \mathbf{S} _{11}&=-1,&\quad {} \mathbf{S} _{12}&=1,\\ \mathbf{S} _{i,i+1}&=1,&\quad {} \mathbf{S} _{i,i+2}&=-1,&\quad {} \mathbf{S} _{i,i+L+2}&=-1, \\ \mathbf{S} _{i,i+2L+2}&=-1,&\quad {} \mathbf{S} _{i,i+3L+2}&=1,\\ \mathbf{S} _{i+L,i+2L+2}&=1,&\quad {} \mathbf{S} _{i+L,i+3L+2}&=-1,&\quad {} \mathbf{S} _{i+L,i+4L+2}&=-1,\\ \mathbf{S} _{2L+2,L+3}&=1,&\quad {} \mathbf{S} _{2L+2,5L+4}&=-1, \end{aligned} \end{aligned}$$

(H.1)

where $i=2,\dots {},L+1$. From the associated CME, it can be shown via the moment equations that the time evolution of the vector $\vec {\langle m \rangle }$ of mean molecule numbers in a system of reactions with propensities that are linear in the number of molecules is determined by $d\vec {\langle m \rangle }/\hbox {d}t = \mathbf {S}\cdot \vec {f}$. Given the form of the stoichiometric matrix $\mathbf {S}$ and of the rate functions $f_j$, it follows that the mean numbers of molecules of active gene, active and paused RNAP, and mature RNA in steady-state can be obtained by solving the following system of $2L+2$ algebraic equations:

$$\begin{aligned} \begin{aligned} 0&=s_u(1-\langle {}n_0\rangle )-s_b\langle {}n_0\rangle ,&\\ 0&=r\langle {}n_0\rangle -(k+d_a+r_p)\langle {}n^a_1\rangle +r_a\langle {}n^p_{1}\rangle , \\ 0&=k\langle {}n^a_{i-1}\rangle -(k+d_a+r_p)\langle {}n^a_i\rangle +r_a\langle {}n^p_{i}\rangle&\qquad \text {for }i=2,\dots {},L, \\ 0&=r_p\langle {}n^a_{i}\rangle -(r_a+d_p)\langle {}n^p_{i}\rangle&\qquad \text {for }i=1,\dots {},L,\\ 0&=k\langle {}n^a_L\rangle -d_m\langle {}n\rangle .&\end{aligned} \end{aligned}$$

(H.2)

Here, we recall the definition of the following parameters from the main text: $\eta =s_u\tau _g$, where $\tau _g=1/(s_u+s_b)$ is the gene switching timescale, $\rho _k=r/k$, and $\rho =r/d_m$. Also, we define several new parameters: $\sigma =r_p/r_a$ as the ratio of the pausing and activation rates; $\pi _{r_a}=r_a/(r_a+d_p)$, which is the probability of RNAP switching to the active state; $\pi _{d_p}=d_p/(r_a+d_p)$, which is the probability of premature termination from the paused RNAP state; ${\tilde{\mu }}=k/(k+d_a+r_p\pi _{d_p})$; and $\lambda =\sigma \pi _{r_a}$. It follows that the solution of Eq. (H.2) can be written as

$$\begin{aligned} \begin{aligned} \langle n_0\rangle =\eta , \quad {} \langle n^a_i\rangle =\eta {}\rho {}_k\tilde{\mu {}}^{i}, \quad {} \langle n^p_i\rangle =\langle n^a_i\rangle \lambda , \quad {}\text {and}\quad \langle n\rangle =\eta {}\rho \tilde{\mu {}}^L. \end{aligned} \end{aligned}$$

(H.3)

Proof of Proposition 4

In order to solve the Lyapunov equation $\mathbf {J}\cdot {}\mathbf {C}+\mathbf {C}\cdot {}\mathbf {J}^{T}+\mathbf {D}=\mathbf {0}$ for the symmetric elements $\mathbf {C}_{ij}=\mathbf {C}_{ji}$ of the $(2L+2)\times {}(2L+2)$-dimensional covariance matrix $\mathbf {C}$, we will follow the same approach as in ‘Appendix B’. First, we define the $(2L+2)\times {}(2L+2)$-dimensional Jacobian and diffusion matrices for our system. The Jacobian matrix $\mathbf {J}$ has the following nonzero elements:

$$\begin{aligned} \begin{aligned} \mathbf {J}_{11}&=-(s_u+s_b),\\ \mathbf {J}_{21}&=r,&\quad {} \mathbf {J}_{22}&=-(k+d_a+r_p),&\quad {} \mathbf {J}_{2,2+L}&=r_a,\\ \mathbf {J}_{i,i-1}&=k,&\quad {} \mathbf {J}_{ii}&=-(k+d_a+r_p),&\quad {} \mathbf {J}_{i,i+L}&=r_a&\quad \text {for }\;i=3,\dots {},L+1,\\ \mathbf {J}_{i+L,i}&=r_p,&\quad {} \mathbf {J}_{i+L,i+L}&=-(r_a+d_p)&\quad \text {for }\;i=2,\dots {},L+1,\\ \mathbf {J}_{2L+2,L+1}&=k,&\quad {} \mathbf {J}_{2L+2,2L+2}&=-d_m, \end{aligned} \nonumber \\ \end{aligned}$$

(H.4)

while the nonzero elements of the symmetric diffusion matrix $\mathbf{D} $ are given by

$$\begin{aligned} \begin{aligned}&\mathbf {D}_{11} = s_u(1-\langle {}n_0\rangle )+s_b\langle {}n_0\rangle ,\\&\mathbf {D}_{22} = r\langle {}n_0\rangle +(k+d_a+r_p)\langle {}n^a_1\rangle +r_a\langle {}n^p_1\rangle ,&\quad {}&\mathbf {D}_{23} =-k\langle {}n^a_{1}\rangle , \qquad {}{} \mathbf {D}_{2,2+L}=-r_p\langle {}n^a_1\rangle -r_a\langle {}n^p_1\rangle ;\\&\text {for }\;i=3,\dots {},L+1:\\&\mathbf {D}_{ii} = k\langle {}n^a_{i-2}\rangle +(k+d_a+r_p)\langle {}n^a_{i-1}\rangle +r_a\langle {}n^p_{i-1}\rangle ,&\qquad {}&\mathbf {D}_{{i,i+1}{[i\le {}L]}} =-k\langle {}n^a_{i-1}\rangle , \quad \mathbf {D}_{L+1,2L+2} =-k\langle {}n^a_{L}\rangle , \\&\mathbf {D}_{i,i+L}=-r_p\langle {}n^a_{i-1}\rangle -r_a\langle {}n^p_{i-1}\rangle ; \\&\text {for }\;i=2,\dots {},L+1:\\&\mathbf {D}_{i+L,i+L}=r_p\langle {}n^a_{i-1}\rangle +(r_a+d_p)\langle {}n^p_{i-1}\rangle ,\\&\mathbf {D}_{2L+2,2L+2}=k\langle {}n^a_L\rangle +d_m\langle {}n\rangle . \end{aligned} \nonumber \\ \end{aligned}$$

(H.5)

Next, using the definition of $\mathbf{J} $ and $\mathbf{D} $ from Eqs. (H.4) and (H.5), respectively, we solve the Lyapunov equation. Here, we note that we are only interested in expressions for the covariances of fluctuations in active and paused RNAP, but not of mature RNA fluctuations; hence, we require closed-form expressions for the elements $\mathbf {C}_{ij}$ with $i,j\ne 2L+2$, which we derive by following the same procedure as in ‘Appendix B’.

Now, we recall that $\beta =s_b/s_u$ is the ratio of gene deactivation and activation rates, while $\tau _p=1/(k+d_a)$ is the typical time that an actively moving RNAP spends on a gene segment. Additionally, let $\tau _{r_a}=1/r_a$ be the timescale of RNAP activation from the paused state, let $\tau _{d_p}=1/d_p$ be the timescale of premature termination of paused RNAP, and let $\tau _{pp}=1/(r_a+d_p)$ be the typical time spent in the paused state. Finally, we define the following new parameters: $\lambda _{r_p}=\pi _{r_p}/(1-\pi _{r_p})$, where $\pi _{r_p}=r_p/(r_p+k+d_a)$ is the probability of actively moving RNAP switching to the paused state, as well as

$$\begin{aligned} \begin{aligned} \omega _{r_a}=\dfrac{\pi _{r_a}\tau _g}{\pi _{r_a}\tau _{r_a}+\tau _g}, \quad {} {\tilde{\alpha }}{}=\dfrac{\tau _g+\lambda _{r_p}\pi _{d_p}\tau _g}{\tau _g+\tau _p+\lambda _{r_p}\tau _g(1-\omega _{r_a})}, \quad {}\text {and}\quad \omega =\dfrac{\tau _g}{\tau _{pp}+\tau _g}; \end{aligned} \nonumber \\ \end{aligned}$$

(H.6)

then, closed-form expressions for the covariances of the active gene with itself and the remaining species are given by

$$\begin{aligned} \begin{aligned}&\mathrm{Var}(n_0)=\eta ^2\beta \cdot {}g^{aa}_{00},&\qquad {}{}&\text {where }g^{aa}_{00}=1, \\&\mathrm{Cov}(n_0,{n}^a_j)= \eta \langle {}n^a_j\rangle {}{\tilde{\alpha }}{}\beta {}\cdot {}g^{aa}_{0j},&\qquad {}{}&\text {where }g^{aa}_{0j}={\tilde{\alpha }}{}^{j-1}, \\&\mathrm{Cov}(n_0,{n}^p_j)=\eta \langle {}n^p_j\rangle {}{\tilde{\alpha }}{}\beta {}\cdot {}g^{ap}_{0j},&\qquad {}{}&\text {where }g^{ap}_{0j}=\omega {}{\tilde{\alpha }}{}^{j-1}. \end{aligned} \end{aligned}$$

(H.7)

Similarly, closed-form expressions for the covariances between all RNAP species read

$$\begin{aligned} \begin{aligned}&\mathrm{Cov}({n}^a_i,{n}^a_j)=\delta _{ij} \langle {}{n}^a_i\rangle {}+\langle {}{n}^a_i\rangle {}\langle {}{n}^a_j\rangle {}{\tilde{\alpha }}{}\beta {}\cdot {}g^{aa}_{ij},\\&\mathrm{Cov}({n}^a_i,{n}^p_j)=\langle {}{n}^a_i\rangle {}\langle {}{n}^p_j\rangle {}{\tilde{\alpha }}{}\beta {}\cdot {}g^{ap}_{ij},\\&\mathrm{Cov}({n}^p_i,{n}^a_j)=\langle {}{n}^p_i\rangle {}\langle {}{n}^a_j\rangle {}{\tilde{\alpha }}{}\beta {}\cdot {}g^{pa}_{ij},\\&\mathrm{Cov}({n}^p_i,{n}^p_j)=\delta _{ij} \langle {}{n}^p_i\rangle {}+\langle {}{n}^p_i\rangle {}\langle {}{n}^p_j\rangle {}{\tilde{\alpha }}{}\beta {}\cdot {}g^{pp}_{ij},\\ \end{aligned} \end{aligned}$$

(H.8)

where the functions ${g}^{aa}_{ij}={g}^{aa}_{ji}$, ${g}^{ap}_{ij}={g}^{pa}_{ji}$, and ${g}^{pp}_{ij}={g}^{pp}_{ji}$ satisfy the following recurrence relations:

$$\begin{aligned} \begin{aligned}&g^{aa}_{ij}=\dfrac{[(k+d_a)(r_a+d_p)+r_pd_p](g^{aa}_{i-1,j}+g^{aa}_{i,j-1})+r_ar_p(g^{ap}_{ij}+g^{pa}_{ij})}{{2(k+d_a+r_p)(r_a+d_p)}},\\&g^{ap}_{ij}=\dfrac{[(k+d_a)(r_a+d_p)+r_pd_p]g^{ap}_{i-1,j}+(r_a+d_p)^2g^{aa}_{ij}+r_ar_pg^{pp}_{ij}}{(k+d_a+r_a+r_p+d_p)(r_a+d_p)},\\&g^{pp}_{ij}=\dfrac{g^{ap}_{ij}+g^{pa}_{ij}}{2}. \end{aligned} \end{aligned}$$

(H.9)

Now, we assume that the elongation rate is faster than the rates of RNAP pausing, activation, and premature termination, i.e. that $k\gg {}r_a,r_p,d_a,d_p$ in Eq. (H.9). Taking the limit of $k\rightarrow \infty $, we find that the expressions in Eqs. (H.7) and (H.8) remain unchanged, while Eq. (H.9) simplifies to

$$\begin{aligned} g^{aa}_{ij}&=(g^{aa}_{i-1,j}+g^{aa}_{i,j-1})/2, \end{aligned}$$

(H.10a)

$$\begin{aligned}&g^{ap}_{ij}=g^{ap}_{i-1,j}, \end{aligned}$$

(H.10b)

$$\begin{aligned}&g^{pp}_{ij}=(g^{ap}_{ij}+g^{pa}_{ij})/2; \end{aligned}$$

(H.10c)

in particular, to leading order in 1/k, the functions $g^{aa}_{ij}$, $g^{ap}_{ij}$, $g^{pa}_{ij}$, and $g^{pp}_{ij}$ hence do not depend on k. Eq. (H.10a) defines a recurrence relation for the symmetric function $g^{aa}_{ij}=g^{aa}_{ji}$ with initial conditions $g^{aa}_{00}$ and $g^{aa}_{0j}$ from Eq. (H.7). Using the same mathematical technique as in Lemma B.5, we find that the solution for the function $g^{aa}_{ij}$ is given by $g^{aa}_{ij}=g^{aa}(i,j)+g^{aa}(j,i)$, where

$$\begin{aligned} {g}^{aa}(i,j) =\dfrac{{\tilde{\alpha }}{}^{i+j-1}}{(2{\tilde{\alpha }}{}-1)^{i}} +\dfrac{1}{2^{i+j-1}}\left( {\begin{array}{c}i+j-1\\ i\end{array}}\right) \Big [ 1-\dfrac{2{\tilde{\alpha }}{}-1}{2{\tilde{\alpha }}{}}{}_2F_1\big (1,i+j;j;\tfrac{1}{2{\tilde{\alpha }}{}}\big ) \Big ];\nonumber \\ \end{aligned}$$

(H.11)

Eq. (H.10b) is a recurrence relation for the function $g^{ap}_{ij}$ with initial conditions $g^{ap}_{0j}$ from Eq. (H.7); the corresponding solution is then given by $g^{ap}_{ij}=\omega {}{\tilde{\alpha }}{}^{j-1}$. Finally, the solution of the recurrence relation in Eq. (H.10c) for $g^{pp}_{ij}$ is given by $g^{pp}_{ij}=\omega ({\tilde{\alpha }}{}^{j-1}+{\tilde{\alpha }}{}^{i-1})/2$. In sum, the leading-order asymptotics (in 1/k) of the covariances between the various RNAP species for k large is hence given by Eq. (H.8), with $g^{aa}_{ij}$, $g^{ap}_{ij}=g^{pa}_{ij}$, and $g^{pp}_{ij}$ as stated above. $\square $

Asymptotics of variance of total RNAP distribution: The variance of the total RNAP distribution for the pausing model is given by

$$\begin{aligned} \mathrm{Var}(n_\mathrm{tot})=\sum _{i,j=1}^{L}\big (\mathrm{Cov}(n^{a}_i,n^{a}_j)+\mathrm{Cov}(n^{a}_i,n^{p}_j)+\mathrm{Cov}(n^{p}_i,n^{a}_j)+\mathrm{Cov}(n^{p}_i,n^{p}_j)\big ), \nonumber \\ \end{aligned}$$

(H.12)

where the expressions for the corresponding covariances are given in Eq. (39). In order to simplify the above expression, we consider each term on the right-hand side in Eq. (H.12) separately, as follows:

$$\begin{aligned} \begin{aligned} \sum _{i,j=1}^{L}\mathrm{Cov}(n^{a}_i,n^{a}_j)&=\sum _{i,j=1}^{L}\delta _{ij}\langle {}n^a_i\rangle {}+(\eta {}\rho _k)^2{\tilde{\alpha }}{}\beta {}\sum _{i,j=1}^{L}g^{aa}_{ij},\\ \sum _{i,j=1}^{L}\mathrm{Cov}(n^{a}_i,n^{p}_j)&=(\eta {}\rho _k)^2{\tilde{\alpha }}{}\beta {}\lambda {}\sum _{i,j=1}^{L}g^{ap}_{ij},\\ \sum _{i,j=1}^{L}\mathrm{Cov}(n^{p}_i,n^{a}_j)&=(\eta {}\rho _k)^2{\tilde{\alpha }}{}\beta {}\lambda {}\sum _{i,j=1}^{L}g^{pa}_{ij},\\ \sum _{i,j=1}^{L}\mathrm{Cov}(n^{p}_i,n^{p}_j)&=\sum _{i,j=1}^{L}\delta _{ij}\langle {}n^p_i\rangle {}+(\eta {}\rho _k)^2{\tilde{\alpha }}{}\beta {}\lambda {}^2\sum _{i,j=1}^{L}g^{pp}_{ij}. \end{aligned} \end{aligned}$$

(H.13)

Since $\sum _{i,j=1}^{L}\big (\delta _{ij}\langle {}n^a_i\rangle {}+\delta _{ij}\langle {}n^p_i\rangle {}\big )=\sum _{i=1}^L\langle {}n_i\rangle {}=\langle {}n_\mathrm{tot}\rangle {}$, Eq. (H.12) becomes

$$\begin{aligned} \mathrm{Var}(n_\mathrm{tot})=\langle {}n_\mathrm{tot}\rangle {}+(\eta {}\rho _k)^2{\tilde{\alpha }}{}\beta {}\sum _{i,j=1}^{L}\big (g^{aa}_{ij}+\lambda {}g^{ap}_{ij}+\lambda {}g^{pa}_{ij}+\lambda {}^2g^{pp}_{ij}\big ). \end{aligned}$$

(H.14)

Using the expressions for the functions $g^{aa}_{ij}$, $g^{ap}_{ij}$, $g^{pa}_{ij}$, and $g^{pp}_{ij}$ from Eq. (39), we conclude that Eq. (H.14) further simplifies to

$$\begin{aligned} \mathrm{Var}(n_\mathrm{tot})=\langle {}n_\mathrm{tot}\rangle {}+(\eta {}\rho _k)^2{\tilde{\alpha }}{}\beta {} \bigg [ 2\sum _{i,j=1}^{L}g^{aa}(i,j) + \lambda (2+\lambda )\omega {}L\dfrac{{\tilde{\alpha }}{}^L-1}{{\tilde{\alpha }}{}-1} \bigg ]. \nonumber \\ \end{aligned}$$

(H.15)

I. Approximation of Mature RNA Distribution in Extended Model

Similarly to Sect. 3.2, we apply geometric singular perturbation theory (GSPT) to formally derive the distribution of mature RNA for the extended pausing model. As was done there, we define $P_j(\vec {n};t)$ ($j=0,1$) as the probability of the state $\vec {n}=(n^a_1,\dots {},n^a_L,n^p_1,\dots {},n^p_L,n)$ at time t while the gene is either active (0) or inactive (1); then, the time evolution of these probabilities can be described by a system of coupled CMEs:

$$\begin{aligned} \begin{aligned} \partial {}_{t}P{}_{0}&=s_uP{}_{1}-s_bP{}_{0}+r(\mathbb {E}_{n_1}^{-1}-1)P_{0} +k\sum _{i=1}^{L-1}(\mathbb {E}_{n^a_i}\mathbb {E}_{n^a_{i+1}}^{-1}-1)n^a_iP_{0}+k(\mathbb {E}_{n^a_L}\mathbb {E}_{n}^{-1}-1)n^a_LP_{0} \\&\quad +d_a\sum _{i=1}^{L}(\mathbb {E}_{n^a_i}-1)n^a_iP_{0} +r_p\sum _{i=1}^{L}(\mathbb {E}_{n^a_i}\mathbb {E}_{n^p_{i}}^{-1}-1)n^a_iP_{0}+r_a\sum _{i=1}^{L}(\mathbb {E}_{n^p_i}\mathbb {E}_{n^a_{i}}^{-1}-1)n^p_iP_{0} \\&\quad +d_p\sum _{i=1}^{L}(\mathbb {E}_{n^p_i}-1)n^p_iP_{0} +d_m(\mathbb {E}_{n}-1)nP{}_{0}, \\ \partial {}_{t}P_{1}&=s_bP_{0}-s_uP_{1} +k\sum _{i=1}^{L-1}(\mathbb {E}_{n^a_i}\mathbb {E}_{n^a_{i+1}}^{-1}-1)n^a_iP_{1} +k(\mathbb {E}_{n^a_L}\mathbb {E}_{n}^{-1}-1)n^a_LP_{1} \\&\quad +d_a\sum _{i=1}^{L}(\mathbb {E}_{n^a_i}-1)n^a_iP_{1} +r_p\sum _{i=1}^{L}(\mathbb {E}_{n^a_i}\mathbb {E}_{n^p_{i}}^{-1}-1)n^a_iP_{1} +r_a\sum _{i=1}^{L}(\mathbb {E}_{n^p_i}\mathbb {E}_{n^a_{i}}^{-1}-1)n^p_iP_{1} \\&\quad +d_p\sum _{i=1}^{L}(\mathbb {E}_{n^p_i}-1)n^p_iP_{1} +d_m(\mathbb {E}_{n}-1)nP{}_{1}. \end{aligned} \end{aligned}$$

(I.1)

In order to find analytical expressions for the propagator probabilities $P(\vec {n};t)$ which satisfy the system of CMEs in Eq. (I.1), we define the probability-generating functions $F_{j}(\vec {z};t)$, where $\vec {z}=(z^a_1,\dots {},z^a_L,z^p_1,\dots {},z^p_L,z)$ is a vector of variables corresponding to the state $\vec {n}$. Given the equations for $P_{j}(\vec {n};t)$ from Eq. (I.1), we obtain the following system of PDEs for the corresponding generating functions $F_{j}(\vec {z};t)$:

$$\begin{aligned} \begin{aligned} \mathbb {L}[F_{0}]&=s_uF_{1}-s_bF_{0}+r(z^a_1-1)F_{0}, \\ \mathbb {L}[F_{1}]&=s_bF_{0}-s_uF_{1}; \end{aligned} \end{aligned}$$

(I.2)

here,

$$\begin{aligned} \begin{aligned} \mathbb {L}=\partial {}_{t}+d_m(z-1)\partial {}_{z} +k\sum _{i=1}^{L-1}(z^a_{i}-z^a_{i+1})\partial {}_{z^a_{i}} +k(z^a_{L}-z)\partial {}_{z^a_{L}} +d_a\sum _{i=1}^{L}(z^a_i-1)\partial {}_{z^a_{i}} \\ +r_p\sum _{i=1}^{L}(z^a_{i}-z^p_{i})\partial {}_{z^a_{i}} +r_a\sum _{i=1}^{L}(z^p_{i}-z^a_{i})\partial {}_{z^p_{i}} +d_p\sum _{i=1}^{L}(z^p_i-1)\partial {}_{z^p_{i}} \end{aligned} \end{aligned}$$

(I.3)

is a differential operator acting on the functions $F_{0}$ and $F_{1}$. Eq. (I.2) represents a system of coupled, linear, first-order PDEs. Now, we introduce new variables $u^a_i=z^a_i-1$, $u^p_i=z^p_i-1$, and $u=z-1$; we also rescale all rates and the time variable with the degradation rate $d_m$ of mature RNA. Next, we apply the method of characteristics, with s being the characteristic variable. The first characteristic equation will give us $d_m(\hbox {d}t/\hbox {d}s)=1$, with solution $s\equiv {}d_mt$; hence, we can use the variable $t'=d_mt$ as the independent characteristic variable and thus convert the system of PDEs in Eq. (I.2) into a characteristic system of ODEs:

$$\begin{aligned} {\dot{u}}^a_i&=(k/d_m)[(u^a_i-u^a_{i+1})+(d_a/k)u^a_i+(r_p/k)(u^a_i-u^p_i)]&\qquad&\text {for }i=1,\dots {},L-1, \end{aligned}$$

(I.4a)

$$\begin{aligned} {\dot{u}}^a_L&=(k/d_m)[(u^a_L-u)+(d_a/k)u^a_L+(r_p/k)(u^a_L-u^p_L)], \end{aligned}$$

(I.4b)

$$\begin{aligned} {\dot{u}}^p_i&=(r_a/d_m)[(u^p_i-u^a_i)+(d_p/r_a)u^p_i]&\qquad&\text {for }i=1,\dots {},L, \end{aligned}$$

(I.4c)

$$\begin{aligned} {\dot{u}}&=u, \end{aligned}$$

(I.4d)

$$\begin{aligned} {\dot{F}}_{0}&=(s_u/d_m)F_{1}-(s_b/d_m)F_{0}+(r/d_m) u^a_1F_{0},&\end{aligned}$$

(I.4e)

$$\begin{aligned} {\dot{F}}_{1}&=(s_b/d_m)F_{0}-(s_u/d_m)F_{1},&\end{aligned}$$

(I.4f)

where the overdot denotes differentiation with respect to t. Here, we assume that $k/d_m\gg {}1$ and $r_a/d_m\gg {}1$; hence, we define $\varepsilon = d_m/k$ as the singular perturbation parameter, and we write $d_m/r_a=\varepsilon \delta $, where $\delta =k/r_a=\mathcal {O}(1)$ by assumption. Since $0<\varepsilon \ll {}1$ is small, we can apply GSPT in order to separate the system in Eq. (I.4) into fast and slow dynamics, which will allow us to find an asymptotic approximation for $F_{0}$ and $F_{1}$ in steady state. With the above definitions, the governing equations for $u^a_i$ and $u^p_i$ in the ‘slow system’ in Eqs. (I.4a) through (I.4c) become

$$\begin{aligned} \varepsilon {\dot{u}}^a_i&=(u^a_i-u^a_{i+1})+(d_a/k)u^a_i+(r_p/k)(u^a_i-u^p_i)&\qquad&\text {for }i=1,\dots {},L-1, \end{aligned}$$

(I.5a)

$$\begin{aligned} \varepsilon {\dot{u}}^a_L&=(u^a_L-u)+(d_a/k)u^a_L+(r_p/k)(u^a_L-u^p_L),&\end{aligned}$$

(I.5b)

$$\begin{aligned} \varepsilon {\dot{u}}^p_i&=[(u^p_i-u^a_i)+(d_p/r_a)u^p_i]/\delta&\qquad&\text {for }i=1,\dots {},L. \end{aligned}$$

(I.5c)

It follows that $u^a_i$ and $u^p_i$ ($i=1,\dots ,L$) are the fast variables in our system, while u, $F_{0}$, and $F_{1}$ are the slow ones; see ‘Appendix E’. Setting $\varepsilon =0$ and solving the system in Eq. (I.5), we find $u^a_1=\tilde{\mu {}}^L\cdot {}u$, where ${\tilde{\mu }}=k/(k+d_a+r_p\pi _{d_p})$ has previously been defined in Proposition 3. Now, given Eq. (I.4d), we apply the chain rule, $\hbox {d}t'\equiv {}\hbox {d}u\cdot {} u$, to rewrite Eqs. (I.4e) and (I.4f) as:

$$\begin{aligned} F'_0d_mu&=s_uF_1-s_bF_0+r {\tilde{\mu }}^L uF_0, \end{aligned}$$

(I.6a)

$$\begin{aligned} F'_1d_mu&=s_bF_0-s_uF_1, \end{aligned}$$

(I.6b)

where the prime now denotes differentiation with respect to u. The system in Eq. (I.6) is the same as that in Eq. (28), with the substitution $\mu \mapsto {}{\tilde{\mu }}$; hence, following the same derivation as in Sect. 3.2, we conclude that the steady-state analytical expression for the probability distribution of mature RNA is given by

$$\begin{aligned} P(n)=\dfrac{1}{n!}\dfrac{(s_u)_n}{(s_b+s_u)_n}\Big (\dfrac{r}{d_m}\Big )^n({\tilde{\mu }}^{L})^n{}_1F_1\Big (\dfrac{s_u}{d_m}+n;\dfrac{s_b+s_u}{d_m}+n;-\dfrac{r}{d_m}{\tilde{\mu }}^L\Big ). \end{aligned}$$

(I.7)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Filatova, T., Popovic, N. & Grima, R. Statistics of Nascent and Mature RNA Fluctuations in a Stochastic Model of Transcriptional Initiation, Elongation, Pausing, and Termination. Bull Math Biol 83, 3 (2021). https://doi.org/10.1007/s11538-020-00827-7

Download citation

Received: 13 May 2020
Accepted: 26 October 2020
Published: 22 December 2020
DOI: https://doi.org/10.1007/s11538-020-00827-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Species	Molecule numbers	Position (in \(\vec {m}\))
\(G_\mathrm{on}\)	\(n_0\)	1
\(P_i,\quad i\in {}\{1,\dots {},L\}\)	\(n_i\)	\(i+1\)
M	n	\(L+2\)

Reaction	Rate function \(f_j\)
\(G_\mathrm{on} {\mathop {\longrightarrow }\limits ^{s_b}}G_\mathrm{off}\)	\(f_1=s_b \langle n_0 \rangle \)
\(G_\mathrm{off}{\mathop {\longrightarrow }\limits ^{s_u}}G_\mathrm{on}\)	\(f_2=s_u (1 - \langle n_0 \rangle )\)
\(G_\mathrm{on} {\mathop {\longrightarrow }\limits ^{r}}G_\mathrm{on}+P_1\)	\(f_3=r\langle n_0 \rangle \)
\(P_i {\mathop {\longrightarrow }\limits ^{k}}P_{i+1}\), \(i\in {}\{1,\dots {},L-1\}\)	\(f_{i+3}=k \langle n_i \rangle \)
\(P_L {\mathop {\longrightarrow }\limits ^{k}}M\)	\(f_{L+3}=k \langle n_L \rangle \)
\(P_i {\mathop {\longrightarrow }\limits ^{d}}\varnothing \), \(i\in {}\{1,\dots {},L\}\)	\(f_{i+L+3}=d \langle n_i \rangle \)
\(M {\mathop {\longrightarrow }\limits ^{d_m}}\varnothing \)	\(f_{2L+4}=d_m \langle n \rangle \)

Species	Molecule numbers	Position (in \(\vec {m}\))
\(G_\mathrm{on}\)	\(n_0\)	1
\(P_i\), \(i\in {}\{1,\dots {},L\}\)	\(n^a_i\)	\(i+1\)
\(\bar{P}_i\), \(i\in {}\{1,\dots {},L\}\)	\(n^p_i\)	\(i+L+1\)
M	n	\(2L+2\)

Reaction	Rate function \(f_j\)
\(G_\mathrm{on} {\mathop {\longrightarrow }\limits ^{s_b}}G_\mathrm{off}\)	\(f_1=s_b \langle n_0 \rangle \)
\(G_\mathrm{off}{\mathop {\longrightarrow }\limits ^{s_u}}G_\mathrm{on}\)	\(f_2=s_u (1 - \langle n_0 \rangle )\)
\(G_\mathrm{on} {\mathop {\longrightarrow }\limits ^{r}}G_\mathrm{on}+P_1\)	\(f_3=r\langle n_0 \rangle \)
\(P_i {\mathop {\longrightarrow }\limits ^{k}}P_{i+1}\), \(i\in {}\{1,\dots {},L-1\}\)	\(f_{i+3}=k \langle n^a_i \rangle \)
\(P_L {\mathop {\longrightarrow }\limits ^{k}}M\)	\(f_{L+3}=k \langle n^a_L \rangle \)
\(P_i {\mathop {\longrightarrow }\limits ^{d_a}}\varnothing \), \(i\in {}\{1,\dots {},L\}\)	\(f_{i+L+3}=d_a \langle n^a_i \rangle \)
\(P_i {\mathop {\longrightarrow }\limits ^{r_p}}\bar{P}_i\), \(i\in {}\{1,\dots {},L\}\)	\(f_{i+2L+3}=r_p \langle n^a_i \rangle \)
\(\bar{P}_i {\mathop {\longrightarrow }\limits ^{r_a}}P_i\), \(i\in {}\{1,\dots {},L\}\)	\(f_{i+3L+3}=r_a \langle n^p_i \rangle \)
\(\bar{P}_i {\mathop {\longrightarrow }\limits ^{d_p}}\varnothing \), \(i\in {}\{1,\dots {},L\}\)	\(f_{i+4L+3}=d_p \langle n^p_i \rangle \)
\(M {\mathop {\longrightarrow }\limits ^{d_m}}\varnothing \)	\(f_{5L+4}=d_m \langle n \rangle \)

Statistics of Nascent and Mature RNA Fluctuations in a Stochastic Model of Transcriptional Initiation, Elongation, Pausing, and Termination

Abstract

Similar content being viewed by others

Estimating Effects of Extrinsic Noise on Model Genes and Circuits with Empirically Validated Kinetics

Measuring Transcription Dynamics in Living Cells Using Fluctuation Analysis

A single-molecule view of transcription reveals convoys of RNA polymerases and multi-scale bursting

1 Introduction

2 Detailed Stochastic Model of Transcription: Set-up and Analysis

2.1 Set-up of Model

2.2 Closed-Form Expressions for Moments of Mature RNA and Local RNAP

Proposition 1

Proposition 2

2.2.1 Simplification in Bursty and Constitutive Limits

2.3 Closed-Form Expressions for Moments of Total RNAP

2.4 Special Case of Deterministic Elongation

3 Approximate Distributions of Total RNAP and Mature RNA

3.1 Approximation of Total RNAP Distribution

3.2 Approximation of Mature RNA Distribution

4 Statistics of Fluorescent Nascent RNA Signal

5 Model Extension with Pausing of RNAP

5.1 Closed-Form Expressions for Moments of Local RNAP Fluctuations

Proposition 3

Proposition 4

5.2 Approximate Distributions of Total RNAP and Mature RNA

6 Summary and Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices

Appendix

A. Distribution of Elongation Time

B. Solution of Lyapunov Equation

Proof of Proposition 2

Lemma B.1

Proof

Lemma B.2

Proof

Lemma B.3

Proof

Lemma B.4

Proof

Lemma B.5

Proof

C. Variance of Total RNAP Distribution

Lemma C.1

Proof

D. Moments of Total RNAP and Mature RNA in Bursty and Constitutive Limits

E. Introduction to Geometric Singular Perturbation Theory (GSPT)

F. Variance of Fluctuating Total Fluorescent Signal

G. Moments of Fluctuations in Total Fluorescent Signal in Various Limits

H. Extended Model with RNAP Pausing

Proof of Proposition 3

Proof of Proposition 4

I. Approximation of Mature RNA Distribution in Extended Model

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation