Abstract
Fluctuations in the initiation rate of transcription, the first step in gene expression, ensue from the stochastic behavior of the molecular process that controls transcription. In steady state, the regulatory process is often assumed to operate reversibly, i.e., in equilibrium. However, reversibility imposes fundamental limits to information processing. For instance, the assumption of equilibrium is difficult to square with the precision with which the regulatory process executes its task in eukaryotes. Here we provide evidence — from microscopic analyses of the transcription dynamics at a single gene copy of yeast — that the regulatory process for transcription is cyclic and irreversible (out of equilibrium). The necessary coupling to reservoirs of free energy occurs via sequence-specific transcriptional activators and the recruitment, in part, of ATP-dependent chromatin remodelers. Our findings may help explain how eukaryotic cells reconcile the dual but opposing requirements for fast regulatory kinetics and high regulatory specificity.
Similar content being viewed by others
Introduction
Eukaryotic transcriptional activators promote the transcription of specific genes by virtue of their ability to bind gene-specific DNA sequences (enhancers) and recruit a host of proteins that either pertain to the transcription machinery or facilitate its assembly at the transcription start site, including ATP-dependent chromatin remodelers1,2. It is generally assumed that the transcriptional output of a gene is determined by the equilibrium occupancy of its enhancers by activators3,4,5.
Activators find their target sequences by trial and error and thus bind DNA also non-specifically. For eukaryotic activators, the energetic differences between specific and non-specific sequence binding are surprisingly small6,7,8, small enough to be easily erased by the abundance of non-specific activators. How the eukaryotic cell solves this specificity problem is unknown.
Specific binding events differ from non-specific ones by a statistically longer residence time of the activator on the DNA8. A potential solution to the specificity problem, therefore, is to respond to activator residence time rather than occupancy, a feat that might be accomplished by kinetically proofreading activator identity9,10. The kinetic proofreading mechanism increases specificity by exploiting the kinetic difference between specific and non-specific DNA binding twice, before and after activator-dependent removal of a transcriptional repressor which delays advance to the transcriptionally active state10. Repressor removal requires energy, which is provided not by the surrounding heat bath but by an external work reservoir, avoiding the entropic cost of converting heat into work10,11,12. The expended free energy is eventually dissipated into the heat bath as the gene returns to its thermodynamically preferred repressed state, closing an irreversible reaction cycle where all microscopic paths in the same direction of rotation, say clockwise, are more probable than their time-reversed, anticlockwise counterparts (cf. Figure 1a). The gene, thus, is driven in and out of states that support transcription. As a consequence, anticlockwise paths, i.e., paths that bypass the proofreading delay step, are discouraged.
The thermodynamics of activator-DNA binding imposes an upper bound to the specificity of transcriptional regulation. In equilibrium, the regulatory process approaches this bound as the activator concentration tends to zero; the time between transcription events, then, tends to infinity; the regulatory process becomes slow and increasingly erratic10. Away from equilibrium, kinetic proofreading permits regulatory specificities close to the upper bound of specificity, including values above that bound, at activator concentrations that afford fast regulatory kinetics and low transcription noise10.
The nucleosome appears to be ideally suited to assume the task of repressor in the kinetic proofreading scheme (nucleosomal proofreading hypothesis)10. The nucleosome is a ubiquitous repressor of nuclear gene transcription13. Promoter nucleosomes evidently partake in the random dynamics of the regulatory mechanism for transcription, for genes assume alternative promoter nucleosome configurations in activating conditions, including the nucleosome-free and fully nucleosomal promoter14. Gene regulation entails the unwinding of promoter nucleosomes15. Nucleosome unwinding, for most DNA sequences, requires a significant amount of work, \({\Delta {G}}^{{\circ} { \,\,\, {\prime}}}\, \approx\, 44\,{k}_{{{{\rm{B}}}}}T\) per nucleosome on the assumption that nucleosomes were unwound reversibly16. Thermodynamics, therefore, favors nucleosome formation over a wide range of histone concentrations. In this event, nucleosome unwinding elevates the system’s free energy, while rewinding dissipates this energy into the surrounding heat bath10,11. There is little doubt that nucleosomes are unwound by ATP-dependent chromatin remodelers that are recruited to promoters by activators17,18,19,20.
It is unknown, however, whether the nucleosome dynamics of transcriptionally induced promoters are driven away from equilibrium in this or any other way. Both the wrapping and unwrapping of nucleosomes are catalyzed by ATP-dependent chromatin remodelers21. ATP may be expended to overcome high energy barriers and thus accelerate the approach to equilibrium rather than maintain a non-equilibrium steady-state3.
Most data appear to conform to the expectations of a regulatory process in equilibrium3. It is possible, however, that this conformity is the result of inevitable, and often also deliberate, coarse-graining. Most microstates of a system remain indistinguishable to the observer who therefore lumps them together into observable mesostates. Observational coarse-graining may easily generate a false impression of reversibility22. For instance, an irreversible (nonequilibrium) steady-state process requires that its reaction network (graph) is cyclic; when coarse-graining reduces the cyclic to an acyclic graph, the steady-state process becomes reversible11,23. (Thus, by virtue of its non-cyclic transition graph, the standard two-state random-telegraph model of gene regulation24 implies equilibrium dynamics.) Furthermore, predicted differences between reversible and irreversible processes are often subtle or otherwise difficult to measure5.
Here we provide evidence that the regulatory process of PHO5 transcription in yeast, a classic paradigm for the analysis of transcriptional regulation by chromatin structure25, breaches a barrier imposed by the detailed balance conditions for equilibrium.
Results
The biological model
Transcriptional activation of PHO5 requires binding of the basic helix-loop-helix factor Pho4 to its enhancer UASp126. Pho4 binding increases the probability of promoter states with fewer nucleosomes14, presumably by recruitment of ATP-dependent chromatin remodelers via its activation domain17. Loss of promoter nucleosomes allows Pho4 to access UASp2, a second, previously occluded enhancer27,28. At high media concentrations of orthophosphate, Pho4 is maintained in the cytoplasm; its target genes, therefore, remain inactive. However, in low phosphate or the absence of Pho80, a repressor of the phosphate signaling pathway, Pho4 accumulates in the nucleus, and PHO5 is expressed29. All strains analyzed for this study were pho80\(\Delta\).
Experimental setup
To observe transcription at single PHO5 copies in vivo at the high temporal resolution, we built a multifocus fluorescence microscope (MFM), which allowed us to simultaneously acquire images at multiple focal planes, removing the spatiotemporal uncertainties associated with successive sampling of z-stacks30. PHO5 transcripts were labeled by the insertion of 14 binding sequences for the RNA-binding coat protein of the PP7 phage into the 5’-untranslated region of PHO5 (Fig. 1b)31. The coat protein was expressed as a fusion with the green fluorescent protein under control of the RPS2 promoter (Fig. 1b). We acquired focal stacks — seven images spaced \(500{nm}\) apart along the optical axis — every \(2.5s\) over a total period of \(1250s\) (Fig. 1c).
To delineate periods of transcriptional activity (ON) and inactivity (OFF), we employed change point detection analysis (Fig. 1b, c)32. This delineation is part of the observational coarse-graining. We call periods whose length is known because they began and ended within the observational time window internal, and periods of unknown length because they either began or ended or began and ended outside the window of observation external. The length \(T\) of ON and OFF periods is a random variable. We record the statistical distribution of \(T\) in terms of its survival curve, which indicates the fraction (probability) of periods that are still alive at time \(t\), \(P(T\, > \,t)\), and the corresponding (probability) density of \(T\), i.e., the time-derivative of \(P(T\, \le \, t)=1-P(T\, > \,t)\).
Dwell-time denisty functions are peaked
On the assumption that the observed dynamics can be modeled as a memory-free stochastic process (Markov process) that transitions between two microstates, ON and OFF (random telegraph model)24, it is expected that survival curves fit an exponential decay function (cf. SI, Corollary 1). However, we found that the survival curve for ON periods was sigmoidal rather than exponential (Fig. 2a). Although the observed dynamics were non-Markovian (i.e., endowed with memory), the survival curve was well fit by a biexponential function, suggesting that the observed dynamics may be modeled by a coarse-grained hidden Markov process with three microstates: two ON microstates, which are not distinguished by the observer and thus are combined into a single ON mesostate, and one OFF microstate (cf. SI, Theorem 1). (The limitation to one OFF microstate does not limit the generality of our conclusions, for the internal microstructure of the OFF state is irrelevant for the calculation of the ON dwell-time distribution, cf. SI). A biexponential survival curve implies a biexponential density function, \(f(t)={c}_{1}{e}^{{\,\lambda }_{1}t}+{c}_{2}{e}^{{\,\lambda }_{2}t}\), where \({\lambda }_{1},{\lambda }_{2} < 0\) and \({c}_{1},{c}_{2}\) are constants — three of which (\({\lambda }_{1},\,{\lambda }_{2}\) and \({c}_{1}\)) may be independently chosen for fitting to the data (cf. “Methods”). Notably, the best biexponential fit suggested \(f(0)={c}_{1}+{c}_{2}=0\) (cf. Fig. 2a). The density function, thus, must exhibit a local maximum, i.e., the density is non-monotonic or peaked (cf. Fig. 2a, insert). For \(f(0)=0\), we say the density is maximally peaked.
The observed process is irreversible
The finding of a peaked dwell-time density has far-reaching topological and thermodynamics implications. First, dwell-time densities for processes on graphs with three states that are strongly connected — i.e., every state is connected to any other by a string of directed edges — are always monotonic (i.e., \({c}_{1},{c}_{2}\ge 0\)), unless the reaction graph is cyclic (cf. SI, Theorem 2). The observed dynamics, therefore, must be modeled on a cyclic graph (cf. Fig. 3a). Second, dwell-time densities are always monotonic unless the process is irreversible — this is Tu’s theorem33 (cf. SI, Theorem 4). The observed process, therefore, was irreversible. Consistently, the only 3-state model that allows for a maximally peaked ON dwell-time density is the random clock model of Fig. 3a with \(\alpha=0\) (cf. SI, Theorem 3): the process strictly enters the ON state via one of its microstates and exits from the other; the reverse sequence is excluded. (This boundary case can be approached only asymptotically, for the definitive exclusion of reverse transitions violates the principle of microscopic reversibility and would require infinite entropy production, cf. Figure 3b.)
That a strongly peaked dwell-time density is indicative of a strongly irreversible process may be understood intuitively: The statistical preference for leaving the (observed) ON state via microstate 2 (cf. Fig. 3a) and not 1 (the microstate of entry into ON) delays exit from the ON state and thus reduces the probability of shorter ON periods.
In our 3-state clock model, a gradual decrease in the entropy production per cycle (increasing \(\alpha\) in Fig. 3b), and thus a decrease in the statistical imbalance between forward and reverse paths, shifts the density at \(t=0\) from zero to larger values until it reaches a maximum in equilibrium (Fig. 3c). The density, thus, is gradually transformed from a maximally peaked function (\(f(0)\, \le \,f(t)\) for all \(t \, > \, 0\)) to a monotonic, non-peaked, function (\(f(0)\, > \, f(t)\) for all \(t\, > \, 0\); Fig. 3c). In equilibrium, when no entropy is produced, and forward and reverse direction of any stochastic path are equally probable34, the dwell-time density is strictly monotonic, in accordance with Tu’s theorem (Fig. 3b, c). The monotonicity of the dwell-time density constitutes a Hopfield barrier, a bound that may be breached only at the expense of free energy, i.e., away from equilibrium35.
The distinction between sigmoidal and non-sigmoidal survival curves was subtle (cf. Supplementary Fig. 1). However, our conclusion of a peaked dwell-time density remained robust under perturbation: \({c}_{1}\) remained negative for all densities generated by bootstrapping (random sampling with replacement); only a small number of densities (< 2%) had peaks close enough to \(t=0\) such that the density appeared monotonic; the remaining densities (> 98%) were manifestly peaked (Supplementary Fig. 2a).
With \(f(0)=0\), the best-fit biexponential density for the internal OFF period length was likewise maximally peaked (Supplementary Fig. 3), suggesting that both entry into and exit from the OFF state were strongly irreversible — as implied by our analysis of ON lengths — and therefore represented not forward and reverse path of the same reaction but the forward paths of two different reactions. This conclusion, evidently, depends on the ability to decompose ON or OFF into multiple microstates. The existence of two rather than one microstate was less evident for the OFF than the ON state: although \({c}_{1}\) was negative for 98% of bootstrap densities, nearly 50% of biexponential fits peaked close to \(t=0\) and thus were difficult to distinguish from the exponential function.
The regulatory process dissipates free energy
The observed process is a complicated convolution of multiple processes: the regulatory process for transcription, subordinate molecular processes, such as polymerization and nuclear export of the RNA, the RNA-binding dynamics of the fluorescent protein, and the physical processes within the instruments of observation — the microscope, CCD camera, laser, etc. It may be asked, therefore, whether the irreversibility of the observed process was due to energy-consuming processes other than the regulatory process mechanism for transcription.
To test this latter hypothesis, we deleted parts of the activation domain of Pho4, the central, gene-specific component of the regulatory mechanism. The deletion reduced PHO5 expression by about 80% and markedly increased promoter nucleosome occupancy relative to wild type (cf. Supplementary Fig. 4)14,36. According to the nucleosomal proofreading hypothesis, the activator mutation disrupts the energy supply line for the regulatory process by weakening the activator’s ability to recruit ATP-dependent chromatin remodelers to the promoter; the process relaxes, therefore, into a new steady state closer to equilibrium, where nucleosome removal consumes, and reformation dissipates, less free energy per cycle, on average. We found that the activator mutation transformed the maximally peaked into a non-peaked distribution (Fig. 2d), as expected for the transformation of a strongly irreversible into a weakly irreversible regulatory process (cf. Fig. 3c and b). Alternative explanations must invoke unexpected pleiotropic effects — e.g., that pho4 mutations somehow affect the energetics of RNA polymerization.
The ON dwell-time densities for the activator mutant and wild type were clearly distinct: While the densities of 90% of bootstrap replicates for the mutant were monotonic or weakly peaked, only 1.3% of bootstrap densities for the wild type were monotonic, the remainder were strongly or maximally peaked (Supplementary Fig. 2a, b).
Atp-Dependent nucleosome removal drives the regulatory process away from equilibrium
Loss of the ATP-dependent chromatin remodelers Chd1 and Isw2 reduced Pho5 activity by about 30% and 50%, respectively, as shown by phosphatase activity measurements. Electron microscopy analysis of PHO5 molecules isolated from chd1\(\Delta\) cells showed a concomitant increase in promoter nucleosome occupancy relative to wild type (Supplementary Fig. 4). For both remodeler mutants, the density for the internal ON period length was again peaked. Fewer than 1% of bootstrap densities were apparently monotonic. However, for both remodeler mutants, we found \({f(0)=c}_{1}+{c}_{2}\, > \, 0\) (for the best-biexponential fit) and not \({c}_{1}+{c}_{2}=0\); the densities were strongly but not maximally peaked, corresponding to a process that dissipated less free energy (produced less entropy) per cycle than the wild type (cf. Fig. 3c and d). That nucleosome occupancy increased with decreasing entropy production was as expected if nucleosome removal, and not reformation, drives the regulatory process away from equilibrium, i.e., the probability current flowed in the correct direction for proofreading (clockwise in Fig. 1a). We note however that values for \({c}_{1}+{c}_{2}\) varied widely among bootstrap replicates (cf. legend to Fig. 2).
Activator and chromatin remodelers control the length of external but not internal off periods
All mutations analyzed had little effect on the average length of internal ON periods and, surprisingly, also internal OFF periods (cf. Figs. 2, 4, and Supplementary Fig. 3): ON periods became shorter and OFF periods longer by about 15% in all mutants compared to wild type. The main effect of activator and remodeler mutations became evident upon analysis of external OFF periods. The length of external OFF periods, \(T\), was exponentially distributed (Fig. 4). \(T\) could be modeled, therefore, as a sojourn of the process in a single, additional microstate (Fig. 3d). Although \(T\) could not be known for individual periods, the lack-of-memory property of the exponential distribution nevertheless allowed us to determine mean period lengths37. On average, external OFF periods were significantly longer than internal OFF periods; the process either left the ON state for a short-lived OFF state or a long-lived OFF state (Fig. 3d). Since the lifetime of long-lived OFF states was also long relative to the length of the observational time window, the process seldom returned to the ON state within the observational time window after leaving the ON state for the long-lived OFF state. Exit from the long-lived OFF microstate was promoted by both activator and chromatin remodelers, for external OFF periods markedly increased in length in all three mutants compared to the wild type in order of decreasing PHO5 expression (cf. Fig. 4).
The regulatory mechanism controls both frequency and length of burst runs
Individual ON periods, in all likelihood, corresponded to bursts of transcription with multiple nascent transcripts. Genes with fewer than three transcripts may not have been observable (Supplementary Fig. 5). Except for the activator mutant, ON periods were more probably succeeded by short-lived rather than long-lived OFF periods. Transcription events, thus, were two-fold clustered: into bursts (ON periods) and bursts of bursts (burst runs). The sequence-specific activator and chromatin remodelers controlled the frequency of burst runs alone, not the frequency of bursting within runs, the frequency of transcription initiation within bursts, or the rate of transcription itself. Furthermore, the probability that an (internal) ON period was followed by a long-lived and not short-lived OFF period increased nearly two-fold in the pho4 mutant, from 0.39 (wild type) to 0.65, indicating that the activator controlled the length of burst runs. The activator, thus, exerted control over the rate of transcription initiation in more than one way. In contrast, the loss of Chd1 and Isw2 had little effect on the length of burst runs. The finding that chromatin remodelers affected a subset of steps affected by the activator and not others is in keeping with the notion that activators recruit chromatin remodelers to genes18,19, but also control subsequent steps toward transcription by recruitment of additional factors, e.g., Mediator38,39. Both assumptions are critical for the theory of nucleosomal proofreading of activator identity10.
Discussion
The main finding of our analysis is that the regulatory process of transcription initiation operates away from equilibrium. Our analysis detected a telltale of irreversibility: peaked dwell-time densities for observational states33. Peaked dwell-time densities have been reported earlier40,41; however, the topological and thermodynamic implications went unnoticed, and the question of whether the regulatory process was irreversible was, therefore, not addressed.
A mutation in the central component of the regulatory mechanism for transcription transformed the observed process from a strongly irreversible into a weakly irreversible process, suggesting that the observed process was irreversible not because RNA synthesis and the instruments of observation dissipate free energy (which they do) but because the regulatory process was irreversible.
The observed ON-OFF dynamics are the result of extensive coarse-graining, which may easily transform an irreversible into a reversible process (cf. Introduction). It may be asked, therefore, whether the reverse is possible too, i.e., whether coarse-graining may convert a reversible into an irreversible process? The answer, arguably, is no, for it can be proved that the entropy production of any coarse-grained version of a stochastic process provides a lower bound on the true entropy production (cf. SI, Theorem 5)22,42.
When fit to the data, the hidden Markov process provides a model for the observed dynamics, not the underlying molecular dynamics. Additional microstates were invoked to imbed the observed non-Markovian process into a Markov process. The identification of model states with particular microstates of the underlying molecular process is, therefore, entirely speculative. The OFF state evidently encompasses numerous microstates, and so does the ON state — e.g., PHO5 transcription may be driven by alternative activator-enhancer configurations. It is not evident how to interpret (in this way) the finding that the process when out of equilibrium, visits ON microstates in a defined order.
Our analysis was undertaken to test a critical prediction of the kinetic proofreading hypothesis for activator identity: that the regulatory process for transcription initiation is irreversible. The hypothesis passed its test. The alternative conclusion — that the regulatory process is reversible — would have falsified the proofreading hypothesis in any form. Instead, our findings (if accepted) refute the equilibrium assumption and suggest that the regulatory process is driven away from equilibrium. The driving, it appears, is at least partially due to ATP-dependent chromatin remodelers, as posited by the nucleosomal proofreading hypothesis.
Our results have no direct bearing on the question of whether the regulatory dissipation of free energy improves the specificity of transcriptional regulation. Other problems of information processing may be solved away from equilibrium3,35,43. Also, the possibility of free energy expenditure for no physiological purpose whatsoever is not excluded. However, if the nucleosome dynamics of transcriptionally induced, but not uninduced, promoters are out of equilibrium, as suggested by our analysis, the conclusion that the structural dynamics of promoter chromatin improve activator specificity follows with little effort7,10.
The problem of regulatory specificity may not be equally acute for all genes. Housekeeping genes, genes that are active at all times, may rely on promiscuous activator control or, perhaps, no activators at all. Such genes have no need for activator proofreading. Their promoters may be, therefore, almost always nucleosome-free. Whether such promoters exist is currently unknown. Constitutive lack of promoter nucleosomes was implicitly invoked as part of an explanation for sub-Poissonian transcription noise43, i.e., noise below the expectation for a random birth-and-death (Poisson) process35. (In contrast, the noise of PHO5 transcription markedly exceeded the expectation of a Poisson process.) We note that sub-Poissonian RNA noise also requires free energy expenditure — arguably to drive promoter state transitions and thus narrow the distribution of interarrival times between transcription events43.
Methods
Strains
All strains for live cell fluorescent microscopy were derived from yM8.14, which contains sequence insertions upstream and downstream of the PHO5 gene for enzymatic release of the PHO5 gene from its chromosomal locus and lacks the PHO80 gene, which was replaced with HIS3. We inserted the PP7 binding sequence clusters into the PHO5 locus via homologous recombination. To this end, the cluster, along with a selection marker (KanMX) flanked by loxP recombination sequences, was PCR amplified from plasmid pTL31 using primers p446 (5’-CTTCATCTCTCATGAGAATAAGAACAACAACAAATAGAGCAAGCAAATTCGAGATTACCAcaaagtgggagcgaggagatcc-3’) and p447 (5’-AATGGTACCTGCATTGGCCAAAGAAGCGGCT AAAATTGAATAAACAACAGATTTAAACATgcataggccactagtggatctg-3’) whose 5’-ends are homologous to the PHO5 5’ UTR. This insertion yielded strain yM249.1. Subsequent removal of the KanMX marker was accomplished by transformation with pSH47, which bears the P recombinase gene, and induction of recombinase expression. This gave strain yM255.3. Finally, we inserted the gene for the PCP (PP7 coat protein)-GFP fusion together with the URA3 gene of Candida albicans into the ura3 locus of yM255.3. To this end, we transformed yM255.3 with pTL205 (a derivative of pSIVURA344) digested with PacI and selected for uracil prototrophy. The strain thus generated, yRS102, is called here wild type for its possession of wild type copies of CHD1, ISW2, and PHO4. Strains yRS99 (chd1\(\Delta :{{{\rm{KanMX}}}}\)) and yRS118 (isw2\(\Delta :{{{\rm{KanMX}}}}\)) were obtained by transformation of yM255.3 with pCM123 digested with XbaI and KpnI and pCM122 digested with XbaI and KpnI, respectively, and subsequent insertion of the PCP-GFP fusion gene into the ura3 locus, as described above. The activator mutant strain, yRS120, was derived from yM255.3 by first replacing PHO4 with URA3 via homologous recombination using pCM4.5 digested with BamHI and SalI. The resulting strain, yRS103, was transformed with pCM64.2 cut with BamHI and SalI to replace URA3 with pho4\(\Delta\)[75–90]. This yielded yRS113 upon selection for resistance to 5-fluoroorotic acid (5-FOA). We finally inserted the gene for the PCP-GFP fusion into the ura3 locus of yRS113, as described above. Cells were transformed using the lithium acetate method45.
Live-cell imaging
Yeast Cells were grown in \(50\) ml of liquid culture to \(3\)-\(5\times {10}^{7}\) cells/ml. Cells were then concentrated and spotted onto an agar patch on a #1.5 coverslip, as described elsewhere46. For coverslip preparation we used the mold described by ref. 47. Prior to imaging, cells were then incubated at \(30^\circ C\) for \(30\) minutes. A stage top incubator (OkoLab) was used to maintain a constant temperature of \(30^\circ C\) during imaging. MFM images were recorded with on an EMCCD camera (Andor Ixon Ultra). Images were taken every \(2.5\) second over \(20\) minutes with an exposure time of \(250{ms}\) and a laser power of 30% (Lambda XL). Excitation and emission paths were filtered using a Semrock GFP-30LP-B Brightline filter set. The MFM was assembled as described30,48 and appended to the light path of an existing inverted widefield microscopy chassis (Nikon Eclipse TI), which maintained a \(60\times\) objective (plano Apo 60X/1.4 oil immersion).
MFM raw data were analyzed in four steps: \((i)\) Identification of transcription sites. \(({ii})\) Assignment of transcription sites to nuclei. \(({iii})\) Tracking of each transcription site over time. \(({iv})\) Quantification of signal intensity at transcription site.
To identify transcription sites, we first calculated a maximum projection of the z-stack for each time point; the maximum projection minimizes fluctuations in puncta intensity due to RNA movement in the z-direction (i.e., along the optical axis) during imaging. We then applied a band-pass filter to maximum projections with bandwidth to match the width of the point spread function of the microscope (approximately \(1.5\) pixels); the band-pass filter reduces the rate of false positives due to background nuclear fluorescence caused by cytoplasmic RNA and unbound coat protein49,50. To find local maxima (i.e., candidate transcription sites), we applied a search window of \(4\) pixels and a threshold of five standard deviations above the mean filtered image signal. We assigned transcription sites to nuclei, using the background nuclear fluorescence to determine nuclear boundaries. We always identified the brightest site within a nuclear boundary as the site of transcription. Time series were generated by following individual transcription sites over time. When no candidate transcription site could be detected within a nucleus at a given time point, the location of the previously identified transcription site was used instead. In the case of no previous transcription site, we used the location of the brightest nuclear pixel instead. Finally, to quantify the signal intensity of transcription sites, we used a Gaussian mask algorithm49,50.
We corrected for photobleaching by detrending time series; a trend was inferred by regression analysis of the average time series49,51. Variation in nuclear intensity creates variations in background intensity. To correct for offsets of time series against each other along the y-axis (signal intensity), we subtracted from each time point the mode of the estimated probability function (kernel density estimate) of signal intensity of each time series. The mode was used because, unlike mean and median, it correctly reflects the background on the assumption that the most likely number of nascent transcripts is zero — an assumption borne out by our observations.
We assumed that sample paths (i.e., fluorescence signal traces) were step functions subject to Gaussian noise. Steps, time intervals of signal that appeared to be drawn from the same Gaussian distribution, were determined by change point detection (CPD) analysis32,49; we used a window-based search method together with a Gaussian cost function in the ruptures python package31. We observed essentially two types of steps: low-noise steps at lower signal (OFF steps) and high-noise steps at higher signal (ON steps). CPD hyperparameters (e.g., window size) were selected to reduce the number of sequential low-noise steps on the assumption that the background signal, the signal during OFF periods, is step-free. In contradistinction, the transcription signal, the signal during ON periods, may encompass multiple steps corresponding to different numbers of the nascent transcript. For wild type we analyzed 218 sample paths (cells); for chd1\(\Delta\), 205 cells; for isw2\(\Delta\), 291 cells, and for pho4\(\Delta\)[75–90], 158 cells.
Model fitting
The survival-function model, \(P\left(T\, > \,t\right)=-a{e}^{-{\lambda }_{1}t}+(1+a){e}^{-{\lambda }_{2}t}\), was fit to the experimental data using the FindFit function in Mathematica (Wolfram) with search parameters \(a,{\lambda }_{1},{\lambda }_{2}\), and variable \(t\)49. The corresponding density function is given by \(f(t)={c}_{1}{e}^{-{\lambda }_{1}t}+{c}_{2}{e}^{-{\lambda }_{2}t}\) with \({c}_{1}=-a{\lambda }_{1}\) and \({c}_{2}=(1+a){\lambda }_{2}\). The search algorithm was left unconstrained, which allowed for negative densities. When FindFit was constrained by \(-a{\lambda }_{1}+(1+a){\lambda }_{2}\, \ge \, 0\) to densities \(0\ge\) at \(t=0\), the parameters satisfied \({c}_{1}+{c}_{2}=0\) exactly for internal ON periods in the wild type. The constrained and unconstrained fits were essentially identical except for \(t=0\) and time points very close to it. Bootstrapping analyses were performed with an unconstrained Mathematica FindFit function49. For some bootstrap replicas, we found \({\lambda }_{1}\, > > \, {\lambda }_{2}\). The corresponding density functions, therefore, assumed either large negative or large positive values at \(t=0\). In these cases, the density function was virtually discontinuous at \(t=0\) and indistinguishable from the exponential density, \(f(t)={{\lambda }_{2}e}^{-{\lambda }_{2}t}\).
Single-molecule fish
Yeast cells were grown in liquid culture (\(50{ml}\)) to mid-log-phase, crosslinked with formaldehyde, lysed with lyticase, and adhered to poly-L-lysine coated coverslips, as previously described52,53. Coverslips were hybridized for 5 hours at 37 °C, with \(2.5{nM}\) of fluorescently labeled antisense-DNA probe against PCP binding sequences. Probes were labeled with Quasar 570 and Quasar 670 (Biosearch Technologies). For microscopy, coverslips were mounted onto glass slides with mounting media containing DAPI (4’, 6-diamidino-2-phenylindole; ProLong Gold, Life Technologies). Cytoplasmic and nascent RNA were identified by using background fluorescence and DAPI staining to estimate cellular and nuclear boundaries, respectively. The number of nascent RNAs was calculated by normalizing the transcription site intensity (nuclear punctum) with the average intensity of cytoplasmic transcripts. All analysis was accomplished using previously published code54.
Pho5 Isolation and Em analysis
Gene isolation and psoralen-EM analysis were performed as previously described55. Molecules were analyzed with custom-made programs written in Python. At least 200 PHO5 molecules were analyzed for each strain14.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
The MFM data of this study are available at https://datadryad.org/stash/share/p-nEcqmpD7aj4RUSfzuVmR2kZ_iBEvh1pwf80qBeO1Y.
Code availability
All computer codes used are available at https://doi.org/10.5281/zenodo.12208276.
References
Cosma, M. P. Ordered recruitment: Gene-specific mechanism of transcription activation. Mol. Cell 10, 227–236 (2002).
Ptashne, M. How eukaryotic transcriptional activators work. Nature 335, 683–689 (1988).
Wong, F. & Gunawardena, J. Gene regulation in and out of equilibrium. Annu. Rev. Biophys. 49, 199–226 (2020).
Lionnet, T. & Wu, C. Single-molecule tracking of transcription protein dynamics in living cells: seeing is believing, but what are we seeing? Curr. Opin. Genet Dev. 67, 94–102 (2021).
Bintu, L. et al. Transcriptional regulation by the numbers: models. Curr. Opin. Genet Dev. 15, 116–124 (2005).
Zoller, B., Gregor, T. & Tkačik, G. Eukaryotic gene regulation at equilibrium, or non? Curr. Opin. Syst. Biol. 31, 100435 (2022).
Boeger, H. The energetics of activator–promoter recognition. Curr. Opin. Syst. Biol. 31, 100434 (2022).
Geertz, M., Shore, D. & Maerkl, S. J. Massively parallel measurements of molecular interaction kinetics on a microfluidic platform. Proc. Natl. Acad. Sci. USA 109, 16540–16545 (2012).
Cepeda-Humerez, S. A., Rieckh, G. & Tkačik, G. Stochastic proofreading mechanism alleviates crosstalk in transcriptional regulation. Phys. Rev. Lett. 115, 248101 (2015).
Shelansky, R. & Boeger, H. Nucleosomal proofreading of activator–promoter interactions. Proc. Natl. Acad. Sci. USA 117, 2456–2461 (2020).
Boeger, H. Kinetic proofreading. Annual Rev. Biochem. 91, 423–447 (2022).
Hopfield, J. J. Kinetic proofreading: a new mechanism for reducing errors in biosynthetic processes requiring high specificity. Proc. Natl. Acad. Sci. USA 71, 4135–4139 (1974).
Kornberg, R. D. & Lorch, Y. Primary role of the nucleosome. Mol. Cell 79, 371–375 (2020).
Brown, C. R., Mao, C., Falkovskaia, E., Jurica, M. S. & Boeger, H. Linking stochastic fluctuations in chromatin structure and gene expression. PLoS Biol. 11, e1001621 (2013).
Boeger, H., Griesenbeck, J., Strattan, J. S. & Kornberg, R. D. Removal of promoter nucleosomes by disassembly rather than sliding in vivo. Mol. Cell 14, 667–673 (2004).
Mack, A. H., Schlingman, D. J., Ilagan, R. P., Regan, L. & Mochrie, S. G. J. Kinetics and thermodynamics of phenotype: unwinding and rewinding the nucleosome. J. Mol. Biol. 423, 687–701 (2012).
Brown, C. R., Mao, C., Falkovskaia, E., Law, J. K. & Boeger, H. In vivo role for the chromatin-remodeling enzyme SWI/SNF in the removal of promoter nucleosomes by disassembly rather than sliding. J. Biol. Chem. 286, 40556–40565 (2011).
Yudkovsky, N., Logie, C., Hahn, S. & Peterson, C. L. Recruitment of the SWI/SNF chromatin remodeling complex by transcriptional activators. Genes Dev. 13, 2369–2374 (1999).
Ferreira, M. E., Prochasson, P., Berndt, K. D., Workman, J. L. & Wright, A. P. Activator-binding domains of the SWI/SNF chromatin remodeling complex characterized in vitro are required for its recruitment to promoters in vivo. FEBS J. 276, 2557–2565 (2009).
Bartholomew, B. Regulating the chromatin landscape: structural and mechanistic perspectives. Annu. Rev. Biochem. 83, 671–696 (2014).
Haushalter, K. A. & Kadonaga, J. T. Chromatin assembly by DNA-translocating motors. Nat. Rev. Mol. Cell Biol. 4, 613–620 (2003).
Skinner, D. J. & Dunkel, J. Improved bounds on entropy production in living systems. Proc. Natl. Acad. Sci. USA 118, e2024300118 (2021).
Kelly, F. P. Reversibility and Stochastic Networks. (John Wiley and Sons Ltd, 1979).
Lammers, N. C., Kim, Y. J., Zhao, J. & Garcia, H. G. A matter of time: Using dynamics and theory to uncover mechanisms of transcriptional bursting. Curr. Opin. Cell Biol. 67, 147–157 (2020).
Korber, P. & Barbaric, S. The yeast PHO5 promoter: from single locus to systems biology of a paradigm for gene regulation through chromatin. Nucleic Acids Res. 42, 10888–10902 (2014).
Vogel, K., Horz, W. & Hinnen, A. The two positively acting regulatory proteins PHO2 and PHO4 physically interact with PHO5 upstream activation regions. Mol. Cell Biol. 9, 2050–2057 (1989).
Mao, C., Brown, C. R., Griesenbeck, J. & Boeger, H. Occlusion of regulatory sequences by promoter nucleosomes In Vivo. PLoS ONE 6, 2–11 (2011).
Venter, U., Svaren, J., Schmitz, J., Schmid, A. & Horz, W. A nucleosome precludes binding of the transcription factor Pho4 in vivo to a critical target site in the PHO5 promoter. Embo J. 13, 4848–4855 (1994).
O’Neill, E. M., Kaffman, A., Jolly, E. R. & O’Shea, E. K. Regulation of PHO4 nuclear localization by the PHO80-PHO85 cyclin-CDK complex. Science 271, 209–212 (1996).
Abrahamsson, S. et al. Fast multicolor 3D imaging using aberration-corrected multifocus microscopy. Nat. Methods 10, 60–63 (2013).
Larson, D. R., Zenklusen, D., Wu, B., Chao, J. A. & Singer, R. H. Real-time observation of transcription initiation and elongation on an endogenous yeast gene. Science 332, 475–478 (2011).
Truong, C., Oudre, L. & Vayatis, N. Selective review of offline change point detection methods. Signal Process. 167, 107299 (2020).
Tu, Y. The nonequilibrium mechanism for ultrasensitivity in a biological switch: Sensing by Maxwell’s demons. Proc. Natl. Acad. Sci. USA 105, 11737–11741 (2008).
Van den Broeck, C. Stochastic thermodynamics: A brief introduction. In Physics of Complex Colloids 155–193 (IOS Press, 2013).
Estrada, J., Wong, F., DePace, A. & Gunawardena, J. Information integration and energy expenditure in gene regulation. Cell 166, 234–244 (2016).
Mao, C. et al. Quantitative analysis of the transcription control mechanism. Mol. Syst. Biol. 6, 1–12 (2010).
Feller, W. An Introduction to Probability Theory and Its Applications. (John Wiley & Sons, 1968).
Sanborn, A. L. et al. Simple biochemical features underlie transcriptional activation domain diversity and dynamic, fuzzy binding to mediator. ELife 10, 1–77 (2021).
Kornberg, R. D. Mediator and the mechanism of transcriptional activation. Trends Biochem. Sci. 30, 235–239 (2005).
Suter, D. M. et al. Mammalian genes are transcribed with widely different bursting kinetics. Science 332, 472–474 (2011).
Tantale, K. et al. A single-molecule view of transcription reveals convoys of RNA polymerases and multi-scale bursting. Nat. Commun. 7, 12248 (2016).
Seifert, U. From stochastic thermodynamics to thermodynamic inference. Annu. Rev. Condens. Matter Phys. 10, 171–192 (2019).
Weidemann, D. E., Holehouse, J., Singh, A., Grima, R. & Hauf, S. The minimal intrinsic stochasticity of constitutively expressed eukaryotic genes is sub-Poissonian. Sci. Adv. 9, eadh5138 (2023).
Wosika, V. et al. New families of single integration vectors and gene tagging plasmids for genetic manipulations in budding yeast. Mol. Genet. Genomics 291, 2231–2240 (2016).
Boeger, H., Griesenbeck, J., Strattan, J. S. & Kornberg, R. D. Nucleosomes unfold completely at a transcriptionally active promoter. Mol. Cell 11, 1587–1598 (2003).
Zawadzki, K. & Broach, J. A rapid technique for the visualization of live immobilized yeast cells. J. Vis. Exp. 84 https://doi.org/10.3791/84 (2006).
Skinner, S. O., Sepúlveda, L. A., Xu, H. & Golding, I. Measuring mRNA copy number in individual Escherichia coli cells using single-molecule fluorescent in situ hybridization. Nat. Protoc. 8, 1100–1113 (2013).
Abrahamsson, S. et al. Multifocus microscopy with precise color multi-phase diffractive optics applied in functional neuronal imaging. Biomed. Opt. Express 7, 855–869 (2016).
Shelansky, R. & Boeger, H. Analysis Tools for MFM Movies of Nascent Transcription (2024). https://doi.org/10.5281/zenodo.12208276.
Thompson, R. E., Larson, D. R. & Webb, W. W. Precise nanometer localization analysis for individual fluorescent probes. Biophys. J. 82, 2775–2783 (2002).
Wu, Z., Huang, N. E., Long, S. R. & Peng, C. K. On the trend, detrending, and variability of nonlinear and nonstationary time series. In Proceedings of the National Academy of Sciences of the United States of America (2007).
Trcek, T. et al. Single-mRNA counting using fluorescent in situ hybridization in budding yeast. Nat. Protoc. 7, 408–419(2012).
Patel, H. P., Brouwer, I. & Lenstra, T. L. Optimized protocol for single-molecule RNA FISH to visualize gene expression in S. cerevisiae. STAR Protoc. 2, 100647 (2021).
Coulon, A. & Larson, D. R. Fluctuation analysis dissecting transcriptional kinetics with signal theory. Methods Enzymol. 572, 159–191 (2016).
Brown, C. R. et al. Chromatin structure analysis of single gene molecules by psoralen cross-linking and electron microscopy. Methods Mol. Biol. 1228, 93–121 (2015).
Schnakenberg, J. Network theory of microscopic and macroscopic behavior of master equation systems. Rev. Mod. Phys. 48, 571–585 (1976).
Acknowledgements
H.B. and S.A. gratefully acknowledge support from the National Science Foundation (awards #2111763 and #1828636, respectively). We thank John Field and Heta Patel for their critical comments on the manuscript. H.B. is indebted to Jeremey Gunawardena for stimulating discussions.
Author information
Authors and Affiliations
Contributions
R.S. generated strains, conducted MFM, single molecule-FISH, analyzed the primary data, and wrote Python programs for data analysis. S.A. built the microscope for MFM together with R.S. C.R.B. and M.D. isolated PHO5 molecules and analyzed single molecules by psoralen-crosslinking and electron microscopy. T.L. and D.R.L. provided software and training to R.S. H.B. conceived the project, analyzed dwell time distributions, and wrote the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Shelansky, R., Abrahamsson, S., Brown, C.R. et al. Single gene analysis in yeast suggests nonequilibrium regulatory dynamics for transcription. Nat Commun 15, 6226 (2024). https://doi.org/10.1038/s41467-024-50419-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-024-50419-5
- Springer Nature Limited