Abstract
We present general and rigorous results showing that the microcanonical and canonical ensembles are equivalent at all three levels of description considered in statistical mechanics—namely, thermodynamics, equilibrium macrostates, and microstate measures—whenever the microcanonical entropy is concave as a function of the energy density in the thermodynamic limit. This is proved for any classical many-particle systems for which thermodynamic functions and equilibrium macrostates exist and are defined via large deviation principles, generalizing many previous results obtained for specific classes of systems and observables. Similar results hold for other dual ensembles, such as the canonical and grand-canonical ensembles, in addition to trajectory or path ensembles describing nonequilibrium systems driven in steady states.
Similar content being viewed by others
Notes
Although this has not been proved rigorously, it is thought that the presence of long-range interactions is a necessary but not sufficient condition for having nonconcave entropies. It is known at least that not all long-range systems have nonconcave entropies.
There are strong reasons to believe that this cannot be otherwise: that is, many-body systems should have equilibrium states in the thermodynamic limit only when they are described by large deviation theory or, more precisely, when their distribution follows what is called the large deviation principle; see Sect. 2.
There are many reasons for considering Polish spaces: one is that projections of measurable subsets of a Polish space are measurable; another is that the set of probability measures defined on a Polish space is also Polish; see Appendix D of [38] for more details.
We should define the LDP more precisely for the sequence \(\{P_{N,\beta }\}\) of probability measures associated with the sequence \(\{M_{N}\}\) of random variables. Here, we simplify the presentation by referring directly to macrostates and their probabilities.
In information theory, the sign \(\doteq \) is sometimes used instead of \(\asymp \) [41].
Consider the obvious example of the mean energy \(h_{N},\) which does not fluctuate in the microcanonical ensemble but does in the canonical ensemble.
Convexity is used instead of concavity because I is defined as a rate function rather than an entropy function.
The term ‘superdifferentials’ should be used for concave functions, but we will keep to the more common term ‘subdifferentials’.
References
Boltzmann, L.: Über die eigenschaften monozyklischer und anderer damit verwandter Systeme. In: Hasenöhrl, F. (ed.) Wissenschaftliche Abhandlungen, vol. III. Chelsea, New York (1968)
Gibbs, J.W.: Elementary Principles in Statistical Mechanics with Especial Reference to the Rational Foundation of Thermodynamics (Yale University Press, 1902) reprinted by Dover, New York (1960)
Touchette, H., Ellis, R.S., Turkington, B.: An introduction to the thermodynamic and macrostate levels of nonequivalent ensembles. Physica A 340, 138–146 (2004)
Ellis, R.S., Haven, K., Turkington, B.: Large deviation principles and complete equivalence and nonequivalence results for pure and mixed ensembles. J. Stat. Phys. 101, 999–1064 (2000)
Lewis, J.T., Pfister, C.-E., Sullivan, W.G.: Large deviations and the thermodynamic formalism: a new proof of the equivalence of ensembles. In: Fannes, M., Maes, C., Verbeure, A. (eds.) On Three Levels, pp. 183–192. Plenum Press, New York (1994)
Lewis, J.T., Pfister, C.-E., Sullivan, G.W.: The equivalence of ensembles for lattice systems: some examples and a counterexample. J. Stat. Phys. 77, 397–419 (1994b)
Lewis, J.T., Pfister, C.-E., Sullivan, W.G.: Entropy, concentration of probability and conditional limit theorem. Markov Process. Relat. Fields 1, 319–386 (1995)
Touchette, H.: Ensemble equivalence for general many-body systems. Europhys. Lett. 96, 50010 (2011)
Ruelle, D.: Statistical Mechanics: Rigorous Results. W. A. Benjamin, Amsterdam (1969)
Griffiths, R.B.: Rigorous results and theorems. In: Domb, C., Green, M.S. (eds.) Phase Transitions and Critical Phenomena, vol. 1, pp. 7–109. Academic Press, London (1972)
Lynden-Bell, D., Wood, R.: The gravo-thermal catastrophe in isothermal spheres and the onset of red-giant structure for stellar systems. Mon. Not. R. Astron. Soc. 138, 495–525 (1968)
Lynden-Bell, D., Lynden-Bell, R.M.: On the negative specific heat paradox. Mon. Not. R. Astron. Soc. 181, 405–419 (1977)
Lynden-Bell, D.: Negative specific heat in astronomy, physics and chemistry. Physica A 263, 293–304 (1999)
Thirring, W.: Systems with negative specific heat. Z. Phys. A 235, 339–352 (1970)
Chavanis, P.-H.: Phase transitions in self-gravitating systems. Int. J. Mod. Phys. B 20, 3113–3198 (2006)
Campa, A., Dauxois, T., Ruffo, S.: Statistical mechanics and dynamics of solvable models with long-range interactions. Phys. Rep. 480, 57–159 (2009)
Dauxois, T., Ruffo, S., Cugliandolo, L.F. (eds.): Long-Range Interacting Systems, vol. 90. Lecture Notes of the Les Houches Summer School 2008. Oxford University Press, Oxford (2010)
Kastner, M.: Nonequivalence of ensembles for long-range quantum spin systems in optical lattices. Phys. Rev. Lett. 104, 240403 (2010a)
Kastner, M.: Nonequivalence of ensembles in the Curie–Weiss anisotropic quantum Heisenberg model. J. Stat. Mech. 2010, P07006 (2010)
Olivier, G., Kastner, M.: Microcanonical analysis of the Curie–Weiss anisotropic quantum Heisenberg model in a magnetic field. J. Stat. Phys. 157, 456–473 (2014)
Chalony, M., Barré, J., Marcos, B., Olivetti, A., Wilkowski, D.: Long-range one-dimensional gravitational-like interaction in a neutral atomic cold gas. Phys. Rev. A 87, 013401 (2013)
Gurarie, V.: The equivalence between the canonical and microcanonical ensembles when applied to large systems. Am. J. Phys. 75, 747–751 (2007)
Galgani, L., Scotti, A., Valz Gris, F.: Asymptotic equivalence of classical ensembles by the method of the maximum. Physica 47, 601–609 (1970)
Galgani, L., Manzoni, L., Scotti, A.: Asymptotic equivalence of equilibrium ensembles of classical statistical mechanics. J. Math. Phys. 12, 933–935 (1971)
Lanford III, O.E.: Entropy and equilibrium states in classical statistical mechanics. In: Lenard, A. (ed.) Statistical Mechanics and Mathematical Problems. Lecture Notes in Physics, vol. 20, pp. 1–113. Springer, Berlin (1973)
Deuschel, J.-D., Stroock, D.W., Zessin, H.: Microcanonical distributions for lattice gases. Commun. Math. Phys. 139, 83–101 (1991)
Stroock, D.W., Zeitouni, O.: Microcanonical distributions, Gibbs’ states, and the equivalence of ensembles. In: Durrett, R., Kesten, H. (eds.) Random Walks, Brownian Motion and Interacting Particle Systems: A Festschrift in Honor of Frank Spitzer, pp. 399–424. Birkhäuser, Boston (1991)
Stroock, D.W.: Microcanonical distributions for one dimensional lattice gases. In: Cruzeiro, A.B., Zambrini, J.-C. (eds.) Stochastic Analysis and Applications: Proceedings of the 1989 Lisbon Conference, Progress in Probability, vol. 26, pp. 190–197. Birkhäuser, Boston (1991)
Roelly, S., Zessin, H.: The equivalence of equilibrium principles in statistical mechanics and some applications to large particle systems. Expo. Math. 11, 385–405 (1993)
Georgii, H.-O.: Large deviations and maximum entropy principle for interacting random fields on \({ Z}^{d}\). Ann. Probab. 21, 1845–1875 (1993)
Georgii, H.-O.: Large deviations and the equivalence of ensembles for Gibbsian particle systems with superstable interaction. Probab. Theory Relat. Fields 99, 171–195 (1994)
Georgii, H.-O.: The equivalence of ensembles for classical systems of particles. J. Stat. Phys. 80, 1341–1378 (1995)
Kac, M., Uhlenbeck, G.E., Hemmer, P.C.: On the van der Waals theory of the vapor–liquid equilibrium I: discussion of a one-dimensional model. J. Math. Phys. 4, 216–228 (1963)
Ellis, R.S.: Entropy, Large Deviations, and Statistical Mechanics. Springer, New York (1985)
Ellis, R.S.: An overview of the theory of large deviations and applications to statistical mechanics. Scand. Actuar. J. 1, 97–142 (1995)
Ellis, R.S.: The theory of large deviations: from Boltzmann’s 1877 calculation to equilibrium macrostates in 2D turbulence. Physica D 133, 106–136 (1999)
Touchette, H.: The large deviation approach to statistical mechanics. Phys. Rep. 478, 1–69 (2009)
Dembo, A., Zeitouni, O.: Large Deviations Techniques and Applications, 2nd edn. Springer, New York (1998)
Pathria, R.K.: Statistical Mechanics, 2nd edn. Butterworth Heinemann, Oxford (1996)
Dorlas, T.C.: Statistical Mechanics: Fundamentals and Model Solutions. IOP, Bristol (1999)
Cover, T.M., Thomas, J.A.: Elements of Information Theory. Wiley, New York (1991)
Varadhan, S.R.S.: Asymptotic probabilities and differential equations. Commun. Pure Appl. Math. 19, 261–286 (1966)
Rockafellar, R.T.: Convex Analysis. Princeton University Press, Princeton (1970)
Costeniuc, M., Ellis, R.S., Touchette, H., Turkington, B.: The generalized canonical ensemble and its universal equivalence with the microcanonical ensemble. J. Stat. Phys. 119, 1283–1329 (2005)
Touchette, H.: Equivalence and nonequivalence of the microcanonical and canonical ensembles: a large deviations study. PhD Thesis, Department of Physics, McGill University (2003)
Touchette, H., Ellis, R.S.: Nonequivalent ensembles and metastability. In: Beck, C., Benedek, G., Rapisarda, A., Tsallis, C. (eds.) Complexity, Metastability and Nonextensivity, pp. 81–87. World Scientific, Singapore (2005)
Eyink, G.L., Spohn, H.: Negative-temperature states and large-scale, long-lived vortices in two-dimensional turbulence. J. Stat. Phys. 70, 833–886 (1993)
Yukalov, V.: Fluctuations of composite observables and stability of statistical systems. Phys. Rev. E 72, 066119 (2005)
Csiszar, I.: \({I}\)-divergence geometry of probability distributions and minimization problems. Ann. Probab. 3, 146–158 (1975)
Csiszar, I.: Sanov property, generalized \({I}\)-projection and a conditional limit theorem. Ann. Probab. 12, 768–793 (1984)
Csiszar, I.: The method of types. IEEE Trans. Inf. Theory 44, 2505–2523 (1998)
Shields, P.C.: Two divergence-rate counterexamples. J. Theor. Probab. 6, 521–545 (1993)
Grosskinsky, S., Schütz, G.M., Spohn, H.: Condensation in the zero range process: stationary and dynamical properties. J. Stat. Phys. 113, 389–410 (2003)
Grosskinsky, S., Schütz, G.: Discontinuous condensation transition and nonequivalence of ensembles in a zero-range process. J. Stat. Phys. 132, 77–108 (2008)
Chleboun, P., Grosskinsky, S.: Condensation in stochastic particle systems with stationary product measures. J. Stat. Phys. 154, 432–465 (2014)
Föllmer, H., Orey, S.: Large deviations for the empirical field of a Gibbs measure. Ann. Probab. 16, 961–977 (1988)
Föllmer, H.: Random fields and diffusion processes. In: Hennequin, P.-L. (ed.) École d’Été de Probabilités de Saint-Flour 1985–87. Lecture Notes in Mathematics, vol. 1362, pp. 101–203. Springer, New York (1988)
Georgii, H.-O.: Gibbs Measures and Phase Transitions, vol. 9. Walter de Gruyter and Co., Berlin (1988)
Campa, A., Ruffo, S., Touchette, H.: Negative magnetic susceptibility and nonequivalent ensembles for the mean-field \(\phi ^{4}\) spin model. Physica A 385, 233–248 (2007)
Asmussen, S., Glynn, P.W.: Stochastic Simulation: Algorithms and Analysis. Stochastic Modelling and Applied Probability. Springer, New York (2007)
Squartini, T., de Mol, J., den Hollander, F., Garlaschelli, D.: Breaking of ensemble equivalence in networks. arxiv:1501.00388
Jack, R.L., Sollich, P.: Large deviations and ensembles of trajectories in stochastic models. Prog. Theor. Phys. Suppl. 184, 304–317 (2010)
Chetrite, R., Touchette, H.: Nonequilibrium microcanonical and canonical ensembles and their equivalence. Phys. Rev. Lett. 111, 120601 (2013)
Chetrite, R., Touchette, H.: Nonequilibrium Markov processes conditioned on large deviations. Ann. Inst. Poincaré A (2014)
Evans, R.M.L.: Detailed balance has a counterpart in non-equilibrium steady states. J. Phys. A 38, 293–313 (2005)
Chandler, D., Garrahan, J.P.: Dynamics on the way to forming glass: bubbles in space–time. Ann. Rev. Chem. Phys. 61, 191–217 (2010)
Acknowledgments
I would like to thank many colleagues who have provided useful ideas, comments, and support during the last 12 years that I worked on long-range systems and nonequivalent ensembles: Julien Barré, Freddy Bouchet, Raphael Chetrite, Thierry Dauxois, Rosemary J. Harris, Michael Kastner, Cesare Nardini, and Stefano Ruffo. I especially want to thank Richard S. Ellis for introducing me to many gems and subtleties of large deviations. The present paper owes much to his work.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix 1: Concavity of the Entropy
Let \(s{\text {:}}\,\mathbb {R}\rightarrow \mathbb {R}\cup \{-\infty \}\) be a real function with domain \({\text {dom}}\,s,\) and consider the inequality
The set of all \(\beta \in \mathbb {R}\) for which this inequality is satisfied is called the subdifferential set or simply the subdifferential of s at u and is denoted by \(\partial s(u).\) Footnote 8 The interpretation of this inequality is shown in Fig. 1: if it is possible to draw a line passing through the graph of \(s(u)\) which is everywhere above s, then \(\partial s(u)\ne \emptyset .\) In this case, we also say that s admits a supporting line at u, which is unique if s is differentiable at u. If \(\partial s(u)=\emptyset ,\) then s admits no supporting line at u.
It is easy to see geometrically that nonconcave points of s do not admit supporting lines, while concave points have supporting lines, except possibly if they lie on the boundary of \({\text {dom}}\,s;\) see Sect. 24 of [43] or Appendix A of [44]. The reason for possibly excluding boundary points arises because \(s(u)\) may have diverging ‘slopes’ where \(\partial s(u)\) is not defined, as in the following example adapted from [43, p. 215]:
In this case, \(s(u)=s^{**}(u)\) for all \(u\in {\text {dom}}\,s=[-1,\,1],\) so that s is a concave function, but it has supporting lines only over \((-1,\,1)=\mathrm{int}\ ({\text {dom}}\,s),\) since \(s^{\prime }(u)\) diverges as \(u\rightarrow \pm 1\) from within its domain. All cases of concave points with no supporting lines are of this type, since it can be proved in \(\mathbb {R}\) that
see again Sect. 24 of [43] or Appendix A of [44].
With this proviso on boundary points, s is often defined to be strictly concave at u if it admits supporting lines at u that do not touch other points of its graph. If s has a supporting line at u touching other points of its graph, then s is said to be non-strictly concave at s. Finally, if s admits no supporting line at u, then s is said to be nonconcave at u. These definitions are also illustrated in Fig. 1. For generalizations of these definitions to \(\mathbb {R}^{d}\) in terms of supporting hyperplanes, see [43] and Appendix A of [44].
Appendix 2: Varadhan’s Theorem and the Laplace Principle
We recall in this section two important results about Laplace approximations of exponential integrals in general spaces. In the following, \(\{a_{n}\}_{n=1}^{\infty }\) is an increasing sequence such that \(a_{n}\nearrow \infty \) when \(n\rightarrow \infty .\) Moreover, \(\{P_{n}\}_{n=1}^{\infty }\) is a sequence of probability measures defined on a (Polish) space \(\mathcal {X}.\) In this paper, N takes the role of \(a_{n}\) and n.
Theorem 13
(Varadhan, 1966 [42]) Assume that \(P_{n}(dx)\) satisfies the LDP with speed \(a_{n}\) and rate function I on \(\mathcal {X}.\) Let F be a continuous function.
-
(a)
(Bounded case) assume that \(\sup _{x} F(x)<\infty .\) Then
$$\begin{aligned} \lim \limits _{n\rightarrow \infty } \frac{1}{a_{n}} \ln \int \nolimits _{\mathcal {X}} e^{a_{n} F(x)} P_{n}(dx) =\sup \limits _{x\in \mathcal {X}} \{F(x)-I(x)\}<\infty . \end{aligned}$$(87) -
(b)
(Unbounded case) assume that F satisfies
$$\begin{aligned} \lim \limits _{L\rightarrow \infty } \lim \limits _{n\rightarrow \infty } \frac{1}{a_{n}}\ln \int \nolimits _{\{F\ge L\}} e^{a_{n} F(x)} P_{n}(dx)=-\infty . \end{aligned}$$(88)Then the result of (a) holds and is finite. In particular, if F is bounded above on the support of \(P_{n},\) then (a) holds.
For a proof of this result, see the Appendix B of [34] or Theorem 4.3.1 in [38]. For historical notes on this result, see Sect. 3.7 of [37].
Consider now the exponentially tilted probability measure
where
This is also known as the exponential family or Esscher transform of \(P_{n}.\)
Theorem 14
(LDP for tilted measures) Assume that \(W_{n,F}<\infty .\) Then \(P_{n,F}\) satisfies the LDP with speed \(a_{n}\) and rate function
where
A proof of this result can be found in Theorem 11.7.2 of [34] or by combining Proposition 3.4 and Theorem 9.1 of [35]. A thermodynamic version of this result also appears in Theorem 4.1 of [7].
Rights and permissions
About this article
Cite this article
Touchette, H. Equivalence and Nonequivalence of Ensembles: Thermodynamic, Macrostate, and Measure Levels. J Stat Phys 159, 987–1016 (2015). https://doi.org/10.1007/s10955-015-1212-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10955-015-1212-2