Abstract
In this paper, we study Jarzynski’s equality and fluctuation theorems for diffusion processes. While some of the results considered in the current work are known in the (mainly physics) literature, we review and generalize these nonequilibrium theorems using mathematical arguments, therefore enabling further investigations in the mathematical community. On the numerical side, variance reduction approaches such as importance sampling method are studied in order to compute free energy differences based on Jarzynski’s equality.
This is a preview of subscription content,
to check access.







References
Abrams, C., Bussi, G.: Enhanced sampling in molecular dynamics using metadynamics, replica-exchange, and temperature-acceleration. Entropy 16(1), 163–199 (2014)
Abreu, D., Seifert, U.: Extracting work from a single heat bath through feedback. EPL 94(1), 10001 (2011)
Ambrosio, L., Gigli, N., Savaré, G.: Gradient Flows: In Metric Spaces and in the Space of Probability Measures. Lectures in Mathematics. Birkhäuser, Basel (2005)
Aurell, E., Mejía-Monasterio, C., Muratore-Ginanneschi, P.: Optimal protocols and optimal transport in stochastic thermodynamics. Phys. Rev. Lett. 106, 250601 (2011)
Banyaga, A., Hurtubise, D.: Lectures on Morse Homology. Texts in the Mathematical Sciences. Springer, Berlin (2004)
Bennett, C.H.: Efficient estimation of free energy differences from monte carlo data. J. Comput. Phys. 22(2), 245–268 (1976)
Bishop, C.M.: Pattern Recognition and Machine Learning (Information Science and Statistics). Springer, New York (2006)
Callen, H.: Thermodynamics and an Introduction to Thermostatistics. Wiley, New York (1985)
Chaudhari, P., Oberman, A., Osher, S., Soatto, S., Carlier, G.: Deep relaxation: partial differential equations for optimizing deep neural networks. Submitted (2017)
Chetrite, R., Gawȩdzki, K.: Fluctuation relations for diffusion processes. Commun. Math. Phys. 282(2), 469–518 (2008)
Christ, C.D., Mark, A.E., van Gunsteren, W.F.: Basic ingredients of free energy calculations: a review. J. Comput. Chem. 31(8), 1569–1582 (2010)
Ciccotti, G., Kapral, R., Vanden-Eijnden, E.: Blue moon sampling, vectorial reaction coordinates, and unbiased constrained dynamics. ChemPhysChem 6(9), 1809–1814 (2005)
Ciccotti, G., Lelièvre, T., Vanden-Eijnden, E.: Projection of diffusions on submanifolds: application to mean force computation. Commun. Pure Appl. Math. 61(3), 371–408 (2008)
Crooks, G.E.: Nonequilibrium measurements of free energy differences for microscopically reversible Markovian systems. J. Stat. Phys. 90(5), 1481–1487 (1998)
Crooks, G.E.: Entropy production fluctuation theorem and the nonequilibrium work relation for free energy differences. Phys. Rev. E 60, 2721–2726 (1999)
Crooks, G.E.: Path-ensemble averages in systems driven far from equilibrium. Phys. Rev. E 61, 2361–2366 (2000)
Darbon, J., Osher, S.: Algorithms for overcoming the curse of dimensionality for certain Hamilton–Jacobi equations arising in control theory and elsewhere. Res. Math. Sci. 3(1), 19 (2016)
de Koning, M., Cai, W., Antonelli, A., Yip, S.: Efficient freeenergy calculations by the simulation of nonequilibrium processes. Comput. Sci. Eng. 2(3), 88–96 (2000)
Dellago, C., Hummer, G.: Computing equilibrium free energies using non-equilibrium molecular dynamics. Entropy 16(1), 41 (2014)
Dupuis, P., Spiliopoulos, K., Wang, H.: Importance sampling for multiscale diffusions. Multiscale Model. Simul. 10(1), 1–27 (2012)
Evans, L.C., Gariepy, R.F.: Measure Theory and Fine Properties of Functions. Studies in Advanced Mathematics. CRC Press, Boca Raton (1991)
Fleming, W.H., Soner, H.M.: Controlled Markov Processes and Viscosity Solutions. Springer, Berlin (2006)
Frenkel, D., Smit, B.: Understanding Molecular Simulation: From Algorithms to Applications. Computational Science Series. Elsevier, San Diego (2001)
Ge, H., Jiang, D.-Q.: Generalized Jarzynski’s equality of inhomogeneous multidimensional diffusion processes. J. Stat. Phys. 131(4), 675–689 (2008)
Givon, D., Kupferman, R., Stuart, A.: Extracting macroscopic dynamics: model problems and algorithms. Nonlinearity 17(6), R55–R127 (2004)
Gore, J., Ritort, F., Bustamante, C.: Bias and error in estimates of equilibrium free-energy differences from nonequilibrium measurements. Proc. Natl. Acad. Sci. USA 100(22), 12564–12569 (2003)
Gyöngy, I.: Mimicking the one-dimensional marginal distributions of processes having an Ito differential. Probab. Theory Relat. Fields 71(4), 501–516 (1986)
Hartmann, C., Richter, L., Schütte, C., Zhang, W.: Variational characterization of free energy: theory and algorithms. Entropy 19(11), 626 (2017)
Hartmann, C., Schütte, C., Weber, M., Zhang, W.: Importance sampling in path space for diffusion processes with slow-fast variables. Probab. Theory Relat. Fields 170, 177–228 (2017)
Hartmann, C., Schütte, C., Zhang, W.: Model reduction algorithms for optimal control and importance sampling of diffusions. Nonlinearity 29(8), 2298–2326 (2016)
Haussmann, U.G., Pardoux, E.: Time reversal of diffusions. Ann. Probab. 14(4), 1188–1205 (1986)
Hendrix, D.A., Jarzynski, C.: A “fast growth” method of computing free energy differences. J. Chem. Phys. 114(14), 5974–5981 (2001)
Horowitz, J.M., Vaikuntanathan, S.: Nonequilibrium detailed fluctuation theorem for repeated discrete feedback. Phys. Rev. E 82, 061120 (2010)
Hummer, G., Kevrekidis, I.G.: Coarse molecular dynamics of a peptide fragment: free energy, kinetics, and long-time dynamics computations. J. Chem. Phys. 118(23), 10762–10773 (2003)
Hummer, G., Szabo, A.: Free energy reconstruction from nonequilibrium single-molecule pulling experiments. Proc. Natl. Acad. Sci. USA 98(7), 3658–3661 (2001)
Jarzynski, C.: Equilibrium free-energy differences from nonequilibrium measurements: a master-equation approach. Phys. Rev. E 56, 5018–5035 (1997)
Jarzynski, C.: Nonequilibrium equality for free energy differences. Phys. Rev. Lett. 78, 2690–2693 (1997)
Jarzynski, C.: Rare events and the convergence of exponentially averaged work values. Phys. Rev. E 73, 046105 (2006)
Jarzynski, C.: Nonequilibrium work relations: foundations and applications. Eur. Phys. J. B 64(3), 331–340 (2008)
Krantz, S.G., Parks, H.R.: Geometric Integration Theory. Birkhäuser, Boston (2008)
Kubo, R.: The fluctuation-dissipation theorem. Rep. Prog. Phys. 29(1), 255 (1966)
Legoll, F., Lelièvre, T.: Effective dynamics using conditional expectations. Nonlinearity 23(9), 2131–2163 (2010)
Lelièvre, T., Rousset, M., Stoltz, G.: Computation of free energy differences through nonequilibrium stochastic dynamics: the reaction coordinate case. J. Comput. Phys. 222(2), 624–643 (2007)
Lelièvre, T., Rousset, M., Stoltz, G.: Free Energy Computations: A Mathematical Perspective. N.J. Imperial College Press, London (2010)
Lelièvre, T., Rousset, M., Stoltz, G.: Langevin dynamics with constraints and computation of free eneregy differences. Math. Comput. 81(280), 2071–2125 (2012)
MacKay, D.J.C.: Information Theory, Inference, and Learning Algorithms. Cambridge University Press, New York (2002)
Maragliano, L., Vanden-Eijnden, E.: A temperature accelerated method for sampling free energy and determining reaction pathways in rare events simulations. Chem. Phys. Lett. 426(13), 168–175 (2006)
Marconi, U.M.B., Puglisi, A., Rondoni, L., Vulpiani, A.: Fluctuationdissipation: response theory in statistical physics. Phys. Rep. 461(4), 111–195 (2008)
Minh, D.D.L., Chodera, J.D.: Optimal estimators and asymptotic variances for nonequilibrium path-ensemble averages. J. Chem. Phys. 131(13), 134110 (2009)
Oberhofer, H., Dellago, C.: Optimum bias for fast-switching free energy calculations. Comput. Phys. Commun. 179(13):41–45 (2008). Special Issue Based on the Conference on Computational Physics 2007CCP (2007)
Oberhofer, H., Dellago, C., Geissler, P.L.: Biased sampling of nonequilibrium trajectories: can fast switching simulations outperform conventional free energy calculation methods? J. Phys. Chem. B 109(14), 6902–6915 (2005)
Øksendal, B.: Stochastic Differential Equations: An Introduction with Applications, 5th edn. Springer, Berlin (2000)
Ponmurugan, M.: Generalized detailed fluctuation theorem under nonequilibrium feedback control. Phys. Rev. E 82, 031129 (2010)
Rousset, M., Stoltz, G.: Equilibrium sampling from nonequilibrium dynamics. J. Stat. Phys. 123(6), 1251–1272 (2006)
Rubinstein, R.Y., Kroese, D.P.: The Cross-Entropy Method: A Unified Approach to Combinatorial Optimization, Monte-Carlo Simulation and Machine Learning (Information Science and Statistics), 1st edn. Springer, Berlin (2004)
Sagawa, T., Ueda, M.: Generalized Jarzynski equality under nonequilibrium feedback control. Phys. Rev. Lett. 104, 090602 (2010)
Sagawa, T., Ueda, M.: Fluctuation theorem with information exchange: role of correlations in stochastic thermodynamics. Phys. Rev. Lett. 109, 180602 (2012)
Sagawa, T., Ueda, M.: Nonequilibrium thermodynamics of feedback control. Phys. Rev. E 85, 021104 (2012)
Schmiedl, T., Seifert, U.: Optimal finite-time processes in stochastic thermodynamics. Phys. Rev. Lett. 98, 108301 (2007)
Spiliopoulos, K.: Large deviations and importance sampling for systems of slow-fast motion. Appl. Math. Optim. 67, 123–161 (2013)
Then, H., Engel, A.: Computing the optimal protocol for finite-time processes in stochastic thermodynamics. Phys. Rev. E 77, 041105 (2008)
Vaikuntanathan, S., Jarzynski, C.: Escorted free energy simulations: improving convergence by reducing dissipation. Phys. Rev. Lett. 100, 190601 (2008)
Vaikuntanathan, S., Jarzynski, C.: Escorted free energy simulations. J. Chem. Phys. 134(5), 054107 (2011)
Vanden-Eijnden, E.: Some recent techniques for free energy calculations. J. Comput. Chem. 30(11), 1737–1747 (2009)
Vanden-Eijnden, E., Weare, J.: Rare event simulation of small noise diffusions. Commun. Pure Appl. Math. 65(12), 1770–1803 (2012)
Weinan, E., Han, J., Jentzen, A.: Deep learning-based numerical methods for high-dimensional parabolic partial differential equations and backward stochastic differential equations. Commun. Math. Stat. 5(4), 349–380 (2017)
Ytreberg, F.M., Swendsen, R.H., Zuckerman, D.M.: Comparison of free energy methods for molecular systems. J. Chem. Phys. 125(18), 184114 (2006)
Ytreberg, F.M., Zuckerman, D.M.: Single-ensemble nonequilibrium path-sampling estimates of free energy differences. J. Chem. Phys. 120(23), 10876–10879 (2004)
Zhang, W.: Ergodic SDEs on submanifolds and related numerical sampling schemes. Submitted (2018)
Zhang, W., Hartmann, C., Schütte, C.: Effective dynamics along given reaction coordinates, and reaction rate theory. Faraday Discuss. 195, 365–394 (2016)
Zhang, W., Schütte, C.: Reliable approximation of long relaxation timescales in molecular dynamics. Entropy 19(7), 367 (2017)
Zhang, W., Wang, H., Hartmann, C., Weber, M., Schütte, C.: Applications of the cross-entropy method to importance sampling and optimal control of diffusions. SIAM J. Sci. Comput. 36(6), A2654–A2672 (2014)
Acknowledgements
The authors acknowledge financial support by the Einstein Center of Mathematics (ECMath) through project CH21, and the DFG-CRC 1114 “Scaling Cascades in Complex Systems” through project A05 “Probing scales in equilibrated systems by optimal nonequilibrium forcing” and B05 “Origin of the scaling cascades in protein dynamics”. Part of the work was done while CS and WZ were attending the program “Complex High-Dimensional Energy Landscapes” at IPAM (UCLA), 2017. The authors thank the institute for hospitality and support.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A: Connections with Thermodynamic Integration and Adiabatic Switching: Alchemical Transition Case
In this appendix, we study two (essentially equivalent) asymptotic regimes of nonequilibrium processes using formal arguments. In particular, we will derive the thermodynamic integration identity from Jarzynski’s identity, therefore bridging these two different free energy calculation methods. Let us point out that such a connection is indeed known in physics community [14], although we are not aware of its mathematical derivation in the literature. For simplicity, we only consider the alchemical transition case studied in Sect. 2 and assume the protocol \(\lambda (\cdot )\) is deterministic with \(\epsilon = 0\).
From Jarzynski’s equality to thermodynamic integration Thermodynamic integration is a well known method and has been widely used to compute free energy differences [23]. From the definition of the normalization constant \(Z(\cdot )\) in (10), we can derive the thermodynamic integration identity by the simple argument
In the following, using a formal argument, we show that the identity (144) corresponds to the Jarzynski’s equality (29) in certain asymptotic limit. For this purpose, we consider the dynamics
on \(s \in [0, T]\), where \(0 < \tau \ll 1\) and \(\lambda (s)\) satisfies the ODE
Clearly, dynamics (145) is related to (1) by rescaling time with the parameter \(0 < \tau \ll 1\), and its infinitesimal generator is \(\frac{1}{\tau } {\mathcal {L}}_1\), where \({\mathcal {L}}_1\) is defined in (6) with \(\lambda (\cdot )\) being time dependent. The main observation is that, repeating the argument from Sect. 2.2, the Jarzynski’s equality (29) holds for (145) and (146) for any \(\tau >0\). As a consequence,
where the function g now satisfies
To show that (147) reduces to the thermodynamic integration identity (144) as \(\tau \rightarrow 0\), it is enough to study the asymptotic limit of (148). To this end, we consider the formal asymptotic expansion
as \(\tau \rightarrow 0\), where \(g_0, g_1, \cdots \) are functions independent of \(\tau \). Substituting this expansion into (148) and comparing terms of different powers of \(\tau \), we can conclude that \(g_0=g_0(\lambda , t)\) is independent of x and satisfies
Taking the expectation with respect to \(\mu _{\lambda }\) on both sides of (149) and noticing that \({\mathbf {E}}_{\mu _\lambda }({\mathcal {L}}_1 g_1) = 0\), we obtain
It is easy to verify that the solution of (150) is given by
where \(\lambda (s)\) satisfies (146) with initial value \(\lambda (t) = \lambda \). Taking the limit \(\tau \rightarrow 0\) in (147) then yields
which is equivalent to the thermodynamic integration identity (144).
Adiabatic switching Now we turn to another (equivalent) asymptotic regime where the protocol \(\lambda (\cdot )\) is switched infinitely slowly. Specifically, given \(\lambda _0, \lambda _1 \in {\mathbb {R}}^m\), the protocol \(\lambda (\cdot )\) satisfying \(\lambda (0) = \lambda _0\) and \(\lambda (T) = \lambda _1\) as \(T\rightarrow +\infty \) is called adiabatic switching. For the nonequilibrium process \(x(\cdot )\) in (1) under adiabatic switching, it is well known that we have
i.e., the free energy difference equals to the average work performed during the switching. In the following we provide a formal mathematical argument to derive the above identity. For this purpose, we define
which, by the Feynman–Kac formula, satisfies
Notice that, as \(T\rightarrow +\infty \), the switching becomes infinitely slow and \({\dot{\lambda }}(t) = f\) goes to zero. Instead, we rescale the time by \({\bar{t}} = \frac{t}{T} \in [0, 1]\) and define \({\bar{\lambda }}({\bar{t}}\,) = \lambda (\frac{{\bar{t}}}{\tau })\), where \(\tau = \frac{1}{T} \rightarrow 0\). \({\bar{\lambda }}(\cdot )\) satisfies \({\bar{\lambda }}(0) = \lambda _0, {\bar{\lambda }}(1) = \lambda _1\) and
where \({\bar{f}}(\cdot , {\bar{t}}\,) = \frac{1}{\tau }f(\cdot , \frac{{\bar{t}}}{\tau })\) is a function of \({\mathcal {O}}(1)\). Under this time scaling, PDE (155) becomes
Consider the expansion \(u = u_0 + \tau u_1 + \tau ^2 u_2 + \cdots \), then the same argument as above yields that the function \(u_0\) is independent of x and satisfies
The solution of (158) can be directly computed:
where \({\bar{\lambda }}(\cdot )\) satisfies (156) on \([{\bar{t}}, 1]\) with \({\bar{\lambda }}({\bar{t}}\,) = \lambda \). In particular, taking \({\bar{t}} = 0\) and applying the thermodynamic integration identity (144), gives
Therefore,
which concludes the proof of (153).
Appendix B: Thermodynamic Integration Identity in the Reaction Coordinate Case
In the reaction coordinate case considered in Sect. 3, connections between the thermodynamic integration identity and the Jarzynski’s equality as well as the adiabatic switching regime can be studied using the same asymptotic argument as in Appendix A. In this section, we omit the derivation and only provide the thermodynamic integration identity. We emphasize that both the identity and its proof can be found in the literature, e.g., [42, 44]. The result is included for readers’ convenience.
Recall the definition of the probability measure \(\mu _z\) in (86), where the normalization constant is given by
and the free energy is defined in (87). Let \(z(s) \in {\mathbb {R}}^d\) satisfy the ODE (101) on [0, T]. Similar to the derivations in (144), and using Lemma 3 below, we can compute
where Einstein’s summation convention has been used.
Lemma 3
Let the function Q be defined in (161). For \(1 \le \gamma \le d\), we have
Proof
Let \(\varphi : {\mathbb {R}}^d \rightarrow {\mathbb {R}}\) be a smooth test function with compact support. For \(1 \le \gamma \le d\), integrating by parts and using (161), we have
On the other hand, from the relation
and the definition of the \(d \times d\) matrix \(\Psi \) in (88), we obtain
Therefore, integrating by parts, (163) simplifies to
from which we can conclude after simplification. \(\square \)
Appendix C: An Alternative Proof of Theorem 2
In this appendix, we provide an alternative proof of Theorem 2. Different from the proof in Sect. 2.3 where only the Feynman–Kac formula has been used, the proof below relies on the combination of both the Feynman–Kac formula and Girsanov’s Theorem. While the idea is inspired by the derivations in [10], the proof below is shorter.
Alternative proof of Theorem 2
First of all, we recall the definition of u in (43) as well as the equations (40), (44), (45) used in the proof of Theorem 2 in Sect. 2.3. In accordance with (45), we define
and consider the function \(\omega (x, \lambda ,t) = u\big ( x, \lambda , T-t\,;x',\lambda ',t'\big )\). From (44) and (45), we know that \(\omega \) satisfies
where \((x,\lambda ) \in {\mathbb {R}}^n \times {\mathbb {R}}^m\) and \(\overline{{\mathcal {L}}}_{(x,\lambda ,t)}\) is the operator (165) evaluated at \((x,\lambda ,t)\). On the other hand, applying the Feynman–Kac formula to (166), we observe that
where \(\overline{{\mathbf {E}}}_{x,\lambda ,t}\) denotes the conditional expectation under the path ensemble of the dynamics
and the control protocol
starting from \({\bar{x}}(t) = x\) and \({\bar{\lambda }}(t) = \lambda \) at time t. Note that the infinitesimal generator of the dynamics (168) and (169) is given by the operator \(\overline{{\mathcal {L}}}\) in (165).
Now we apply Girsanov’s theorem to change the probability measure in (167) from the path ensemble of the dynamics (168), (169) to the path ensemble of the dynamics (15), (3). Specifically, starting from \((x, \lambda )\) at time t, let \({\mathbf {P}}_{x, \lambda }\) and \(\overline{{\mathbf {P}}}_{x,\lambda }\) denote the path measures on the time interval \([t,T-t']\) corresponding to (15), (3) and (168), (169), respectively. Applying Girsanov’s theorem, we obtain after some straightforward calculations
Therefore, changing the probability measure in (167) from \(\overline{{\mathbf {P}}}_{x,\lambda }\) to \({\mathbf {P}}_{x,\lambda }\), using (170), (13), we find
Note that in the last equality above, we have converted Ito integration to Stratonovich integration according to (15). Substituting t by \(T-t\), integrating by parts, and recalling the expression (43), we obtain
where \({\mathcal {W}}\) is defined in (42). \(\square \)
Appendix D: Proof of Theorem 3
In this appendix, we provide the proof of Theorem 3 in Sect. 3.2.
Proof of Theorem 3
We consider the quantities on both sides of the equality (98). For the left hand side of (98), let us fix \((y',t') \in {\mathbb {R}}^n \times [0,T]\) and define the function u by
for \((y,t) \in {\mathbb {R}}^n \times [t',T]\). It is known that u satisfies the PDE
where the operator \({\mathcal {L}}^R\) is defined in (97) and \(\big ({\mathcal {L}}^R\big )^*\) denotes its formal \(L^2\) adjoint. A direct calculation shows that
for a smooth function \(\phi \).
For the right hand side of (98), fixing \((y', t') \in {\mathbb {R}}^n \times [0, T]\), we define the function g for \((y,t) \in {\mathbb {R}}^n \times [t',T]\) as
where \({\mathcal {W}}\) is defined in (99), and the dynamics \(y(\cdot )\) satisfies the SDE (93). Using the same argument as in Lemma 1, we can verify that g satisfies the PDE
where the operator \(\overline{{\mathcal {L}}}\) is defined as
for a smooth function \(\phi \). Now consider the function \(\omega (y,t) = e^{-\beta V(y)} g(y,t)\). A direct calculation shows that
Combining (97), (174), (175), (176), it follows that the function \(\omega \) satisfies the PDE
Comparing this with the equation of function u in (172), we obtain
which is equivalent to (98). \(\square \)
Rights and permissions
About this article
Cite this article
Hartmann, C., Schütte, C. & Zhang, W. Jarzynski’s Equality, Fluctuation Theorems, and Variance Reduction: Mathematical Analysis and Numerical Algorithms. J Stat Phys 175, 1214–1261 (2019). https://doi.org/10.1007/s10955-019-02286-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10955-019-02286-4