Skip to main content
Log in

Equivalence and Nonequivalence of Ensembles: Thermodynamic, Macrostate, and Measure Levels

  • Published:
Journal of Statistical Physics Aims and scope Submit manuscript

Abstract

We present general and rigorous results showing that the microcanonical and canonical ensembles are equivalent at all three levels of description considered in statistical mechanics—namely, thermodynamics, equilibrium macrostates, and microstate measures—whenever the microcanonical entropy is concave as a function of the energy density in the thermodynamic limit. This is proved for any classical many-particle systems for which thermodynamic functions and equilibrium macrostates exist and are defined via large deviation principles, generalizing many previous results obtained for specific classes of systems and observables. Similar results hold for other dual ensembles, such as the canonical and grand-canonical ensembles, in addition to trajectory or path ensembles describing nonequilibrium systems driven in steady states.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Notes

  1. Although this has not been proved rigorously, it is thought that the presence of long-range interactions is a necessary but not sufficient condition for having nonconcave entropies. It is known at least that not all long-range systems have nonconcave entropies.

  2. There are strong reasons to believe that this cannot be otherwise: that is, many-body systems should have equilibrium states in the thermodynamic limit only when they are described by large deviation theory or, more precisely, when their distribution follows what is called the large deviation principle; see Sect. 2.

  3. There are many reasons for considering Polish spaces: one is that projections of measurable subsets of a Polish space are measurable; another is that the set of probability measures defined on a Polish space is also Polish; see Appendix D of [38] for more details.

  4. We should define the LDP more precisely for the sequence \(\{P_{N,\beta }\}\) of probability measures associated with the sequence \(\{M_{N}\}\) of random variables. Here, we simplify the presentation by referring directly to macrostates and their probabilities.

  5. In information theory, the sign \(\doteq \) is sometimes used instead of \(\asymp \) [41].

  6. Consider the obvious example of the mean energy \(h_{N},\) which does not fluctuate in the microcanonical ensemble but does in the canonical ensemble.

  7. Convexity is used instead of concavity because I is defined as a rate function rather than an entropy function.

  8. The term ‘superdifferentials’ should be used for concave functions, but we will keep to the more common term ‘subdifferentials’.

References

  1. Boltzmann, L.: Über die eigenschaften monozyklischer und anderer damit verwandter Systeme. In: Hasenöhrl, F. (ed.) Wissenschaftliche Abhandlungen, vol. III. Chelsea, New York (1968)

    Google Scholar 

  2. Gibbs, J.W.: Elementary Principles in Statistical Mechanics with Especial Reference to the Rational Foundation of Thermodynamics (Yale University Press, 1902) reprinted by Dover, New York (1960)

  3. Touchette, H., Ellis, R.S., Turkington, B.: An introduction to the thermodynamic and macrostate levels of nonequivalent ensembles. Physica A 340, 138–146 (2004)

  4. Ellis, R.S., Haven, K., Turkington, B.: Large deviation principles and complete equivalence and nonequivalence results for pure and mixed ensembles. J. Stat. Phys. 101, 999–1064 (2000)

    Article  MATH  MathSciNet  Google Scholar 

  5. Lewis, J.T., Pfister, C.-E., Sullivan, W.G.: Large deviations and the thermodynamic formalism: a new proof of the equivalence of ensembles. In: Fannes, M., Maes, C., Verbeure, A. (eds.) On Three Levels, pp. 183–192. Plenum Press, New York (1994)

    Chapter  Google Scholar 

  6. Lewis, J.T., Pfister, C.-E., Sullivan, G.W.: The equivalence of ensembles for lattice systems: some examples and a counterexample. J. Stat. Phys. 77, 397–419 (1994b)

    Article  ADS  MATH  MathSciNet  Google Scholar 

  7. Lewis, J.T., Pfister, C.-E., Sullivan, W.G.: Entropy, concentration of probability and conditional limit theorem. Markov Process. Relat. Fields 1, 319–386 (1995)

    MATH  MathSciNet  Google Scholar 

  8. Touchette, H.: Ensemble equivalence for general many-body systems. Europhys. Lett. 96, 50010 (2011)

    Article  ADS  Google Scholar 

  9. Ruelle, D.: Statistical Mechanics: Rigorous Results. W. A. Benjamin, Amsterdam (1969)

    MATH  Google Scholar 

  10. Griffiths, R.B.: Rigorous results and theorems. In: Domb, C., Green, M.S. (eds.) Phase Transitions and Critical Phenomena, vol. 1, pp. 7–109. Academic Press, London (1972)

    Google Scholar 

  11. Lynden-Bell, D., Wood, R.: The gravo-thermal catastrophe in isothermal spheres and the onset of red-giant structure for stellar systems. Mon. Not. R. Astron. Soc. 138, 495–525 (1968)

    Article  ADS  Google Scholar 

  12. Lynden-Bell, D., Lynden-Bell, R.M.: On the negative specific heat paradox. Mon. Not. R. Astron. Soc. 181, 405–419 (1977)

    Article  ADS  Google Scholar 

  13. Lynden-Bell, D.: Negative specific heat in astronomy, physics and chemistry. Physica A 263, 293–304 (1999)

    Article  ADS  Google Scholar 

  14. Thirring, W.: Systems with negative specific heat. Z. Phys. A 235, 339–352 (1970)

    Article  Google Scholar 

  15. Chavanis, P.-H.: Phase transitions in self-gravitating systems. Int. J. Mod. Phys. B 20, 3113–3198 (2006)

    Article  ADS  MATH  Google Scholar 

  16. Campa, A., Dauxois, T., Ruffo, S.: Statistical mechanics and dynamics of solvable models with long-range interactions. Phys. Rep. 480, 57–159 (2009)

    Article  ADS  MathSciNet  Google Scholar 

  17. Dauxois, T., Ruffo, S., Cugliandolo, L.F. (eds.): Long-Range Interacting Systems, vol. 90. Lecture Notes of the Les Houches Summer School 2008. Oxford University Press, Oxford (2010)

  18. Kastner, M.: Nonequivalence of ensembles for long-range quantum spin systems in optical lattices. Phys. Rev. Lett. 104, 240403 (2010a)

    Article  ADS  Google Scholar 

  19. Kastner, M.: Nonequivalence of ensembles in the Curie–Weiss anisotropic quantum Heisenberg model. J. Stat. Mech. 2010, P07006 (2010)

    Article  Google Scholar 

  20. Olivier, G., Kastner, M.: Microcanonical analysis of the Curie–Weiss anisotropic quantum Heisenberg model in a magnetic field. J. Stat. Phys. 157, 456–473 (2014)

    Article  ADS  MATH  Google Scholar 

  21. Chalony, M., Barré, J., Marcos, B., Olivetti, A., Wilkowski, D.: Long-range one-dimensional gravitational-like interaction in a neutral atomic cold gas. Phys. Rev. A 87, 013401 (2013)

    Article  ADS  Google Scholar 

  22. Gurarie, V.: The equivalence between the canonical and microcanonical ensembles when applied to large systems. Am. J. Phys. 75, 747–751 (2007)

    Article  ADS  Google Scholar 

  23. Galgani, L., Scotti, A., Valz Gris, F.: Asymptotic equivalence of classical ensembles by the method of the maximum. Physica 47, 601–609 (1970)

    Article  ADS  MathSciNet  Google Scholar 

  24. Galgani, L., Manzoni, L., Scotti, A.: Asymptotic equivalence of equilibrium ensembles of classical statistical mechanics. J. Math. Phys. 12, 933–935 (1971)

    Article  ADS  Google Scholar 

  25. Lanford III, O.E.: Entropy and equilibrium states in classical statistical mechanics. In: Lenard, A. (ed.) Statistical Mechanics and Mathematical Problems. Lecture Notes in Physics, vol. 20, pp. 1–113. Springer, Berlin (1973)

  26. Deuschel, J.-D., Stroock, D.W., Zessin, H.: Microcanonical distributions for lattice gases. Commun. Math. Phys. 139, 83–101 (1991)

    Article  ADS  MATH  MathSciNet  Google Scholar 

  27. Stroock, D.W., Zeitouni, O.: Microcanonical distributions, Gibbs’ states, and the equivalence of ensembles. In: Durrett, R., Kesten, H. (eds.) Random Walks, Brownian Motion and Interacting Particle Systems: A Festschrift in Honor of Frank Spitzer, pp. 399–424. Birkhäuser, Boston (1991)

    Chapter  Google Scholar 

  28. Stroock, D.W.: Microcanonical distributions for one dimensional lattice gases. In: Cruzeiro, A.B., Zambrini, J.-C. (eds.) Stochastic Analysis and Applications: Proceedings of the 1989 Lisbon Conference, Progress in Probability, vol. 26, pp. 190–197. Birkhäuser, Boston (1991)

    Chapter  Google Scholar 

  29. Roelly, S., Zessin, H.: The equivalence of equilibrium principles in statistical mechanics and some applications to large particle systems. Expo. Math. 11, 385–405 (1993)

    MATH  MathSciNet  Google Scholar 

  30. Georgii, H.-O.: Large deviations and maximum entropy principle for interacting random fields on \({ Z}^{d}\). Ann. Probab. 21, 1845–1875 (1993)

    Article  MATH  MathSciNet  Google Scholar 

  31. Georgii, H.-O.: Large deviations and the equivalence of ensembles for Gibbsian particle systems with superstable interaction. Probab. Theory Relat. Fields 99, 171–195 (1994)

    Article  MATH  MathSciNet  Google Scholar 

  32. Georgii, H.-O.: The equivalence of ensembles for classical systems of particles. J. Stat. Phys. 80, 1341–1378 (1995)

    Article  ADS  MATH  MathSciNet  Google Scholar 

  33. Kac, M., Uhlenbeck, G.E., Hemmer, P.C.: On the van der Waals theory of the vapor–liquid equilibrium I: discussion of a one-dimensional model. J. Math. Phys. 4, 216–228 (1963)

    Article  ADS  MATH  MathSciNet  Google Scholar 

  34. Ellis, R.S.: Entropy, Large Deviations, and Statistical Mechanics. Springer, New York (1985)

    Book  MATH  Google Scholar 

  35. Ellis, R.S.: An overview of the theory of large deviations and applications to statistical mechanics. Scand. Actuar. J. 1, 97–142 (1995)

    Article  Google Scholar 

  36. Ellis, R.S.: The theory of large deviations: from Boltzmann’s 1877 calculation to equilibrium macrostates in 2D turbulence. Physica D 133, 106–136 (1999)

    Article  ADS  MATH  MathSciNet  Google Scholar 

  37. Touchette, H.: The large deviation approach to statistical mechanics. Phys. Rep. 478, 1–69 (2009)

    Article  ADS  MathSciNet  Google Scholar 

  38. Dembo, A., Zeitouni, O.: Large Deviations Techniques and Applications, 2nd edn. Springer, New York (1998)

    Book  MATH  Google Scholar 

  39. Pathria, R.K.: Statistical Mechanics, 2nd edn. Butterworth Heinemann, Oxford (1996)

    MATH  Google Scholar 

  40. Dorlas, T.C.: Statistical Mechanics: Fundamentals and Model Solutions. IOP, Bristol (1999)

    MATH  Google Scholar 

  41. Cover, T.M., Thomas, J.A.: Elements of Information Theory. Wiley, New York (1991)

    Book  MATH  Google Scholar 

  42. Varadhan, S.R.S.: Asymptotic probabilities and differential equations. Commun. Pure Appl. Math. 19, 261–286 (1966)

    Article  MATH  MathSciNet  Google Scholar 

  43. Rockafellar, R.T.: Convex Analysis. Princeton University Press, Princeton (1970)

    MATH  Google Scholar 

  44. Costeniuc, M., Ellis, R.S., Touchette, H., Turkington, B.: The generalized canonical ensemble and its universal equivalence with the microcanonical ensemble. J. Stat. Phys. 119, 1283–1329 (2005)

  45. Touchette, H.: Equivalence and nonequivalence of the microcanonical and canonical ensembles: a large deviations study. PhD Thesis, Department of Physics, McGill University (2003)

  46. Touchette, H., Ellis, R.S.: Nonequivalent ensembles and metastability. In: Beck, C., Benedek, G., Rapisarda, A., Tsallis, C. (eds.) Complexity, Metastability and Nonextensivity, pp. 81–87. World Scientific, Singapore (2005)

  47. Eyink, G.L., Spohn, H.: Negative-temperature states and large-scale, long-lived vortices in two-dimensional turbulence. J. Stat. Phys. 70, 833–886 (1993)

    Article  ADS  MATH  MathSciNet  Google Scholar 

  48. Yukalov, V.: Fluctuations of composite observables and stability of statistical systems. Phys. Rev. E 72, 066119 (2005)

    Article  ADS  Google Scholar 

  49. Csiszar, I.: \({I}\)-divergence geometry of probability distributions and minimization problems. Ann. Probab. 3, 146–158 (1975)

    Article  MATH  MathSciNet  Google Scholar 

  50. Csiszar, I.: Sanov property, generalized \({I}\)-projection and a conditional limit theorem. Ann. Probab. 12, 768–793 (1984)

    Article  MATH  MathSciNet  Google Scholar 

  51. Csiszar, I.: The method of types. IEEE Trans. Inf. Theory 44, 2505–2523 (1998)

    Article  MATH  MathSciNet  Google Scholar 

  52. Shields, P.C.: Two divergence-rate counterexamples. J. Theor. Probab. 6, 521–545 (1993)

    Article  MATH  MathSciNet  Google Scholar 

  53. Grosskinsky, S., Schütz, G.M., Spohn, H.: Condensation in the zero range process: stationary and dynamical properties. J. Stat. Phys. 113, 389–410 (2003)

    Article  MATH  Google Scholar 

  54. Grosskinsky, S., Schütz, G.: Discontinuous condensation transition and nonequivalence of ensembles in a zero-range process. J. Stat. Phys. 132, 77–108 (2008)

    Article  ADS  MATH  MathSciNet  Google Scholar 

  55. Chleboun, P., Grosskinsky, S.: Condensation in stochastic particle systems with stationary product measures. J. Stat. Phys. 154, 432–465 (2014)

    Article  MATH  MathSciNet  Google Scholar 

  56. Föllmer, H., Orey, S.: Large deviations for the empirical field of a Gibbs measure. Ann. Probab. 16, 961–977 (1988)

    Article  MATH  MathSciNet  Google Scholar 

  57. Föllmer, H.: Random fields and diffusion processes. In: Hennequin, P.-L. (ed.) École d’Été de Probabilités de Saint-Flour 1985–87. Lecture Notes in Mathematics, vol. 1362, pp. 101–203. Springer, New York (1988)

  58. Georgii, H.-O.: Gibbs Measures and Phase Transitions, vol. 9. Walter de Gruyter and Co., Berlin (1988)

    Book  MATH  Google Scholar 

  59. Campa, A., Ruffo, S., Touchette, H.: Negative magnetic susceptibility and nonequivalent ensembles for the mean-field \(\phi ^{4}\) spin model. Physica A 385, 233–248 (2007)

    Article  ADS  MathSciNet  Google Scholar 

  60. Asmussen, S., Glynn, P.W.: Stochastic Simulation: Algorithms and Analysis. Stochastic Modelling and Applied Probability. Springer, New York (2007)

  61. Squartini, T., de Mol, J., den Hollander, F., Garlaschelli, D.: Breaking of ensemble equivalence in networks. arxiv:1501.00388

  62. Jack, R.L., Sollich, P.: Large deviations and ensembles of trajectories in stochastic models. Prog. Theor. Phys. Suppl. 184, 304–317 (2010)

    Article  ADS  MATH  Google Scholar 

  63. Chetrite, R., Touchette, H.: Nonequilibrium microcanonical and canonical ensembles and their equivalence. Phys. Rev. Lett. 111, 120601 (2013)

    Article  ADS  Google Scholar 

  64. Chetrite, R., Touchette, H.: Nonequilibrium Markov processes conditioned on large deviations. Ann. Inst. Poincaré A (2014)

  65. Evans, R.M.L.: Detailed balance has a counterpart in non-equilibrium steady states. J. Phys. A 38, 293–313 (2005)

    Article  ADS  MATH  MathSciNet  Google Scholar 

  66. Chandler, D., Garrahan, J.P.: Dynamics on the way to forming glass: bubbles in space–time. Ann. Rev. Chem. Phys. 61, 191–217 (2010)

    Article  Google Scholar 

Download references

Acknowledgments

I would like to thank many colleagues who have provided useful ideas, comments, and support during the last 12 years that I worked on long-range systems and nonequivalent ensembles: Julien Barré, Freddy Bouchet, Raphael Chetrite, Thierry Dauxois, Rosemary J. Harris, Michael Kastner, Cesare Nardini, and Stefano Ruffo. I especially want to thank Richard S. Ellis for introducing me to many gems and subtleties of large deviations. The present paper owes much to his work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hugo Touchette.

Appendices

Appendix 1: Concavity of the Entropy

Let \(s{\text {:}}\,\mathbb {R}\rightarrow \mathbb {R}\cup \{-\infty \}\) be a real function with domain \({\text {dom}}\,s,\) and consider the inequality

$$\begin{aligned} s(v)\le s(u)+\beta (v-u),\quad v\in \mathbb {R}. \end{aligned}$$
(84)

The set of all \(\beta \in \mathbb {R}\) for which this inequality is satisfied is called the subdifferential set or simply the subdifferential of s at u and is denoted by \(\partial s(u).\) Footnote 8 The interpretation of this inequality is shown in Fig. 1: if it is possible to draw a line passing through the graph of \(s(u)\) which is everywhere above s, then \(\partial s(u)\ne \emptyset .\) In this case, we also say that s admits a supporting line at u, which is unique if s is differentiable at u. If \(\partial s(u)=\emptyset ,\) then s admits no supporting line at u.

It is easy to see geometrically that nonconcave points of s do not admit supporting lines, while concave points have supporting lines, except possibly if they lie on the boundary of \({\text {dom}}\,s;\) see Sect. 24 of [43] or Appendix A of [44]. The reason for possibly excluding boundary points arises because \(s(u)\) may have diverging ‘slopes’ where \(\partial s(u)\) is not defined, as in the following example adapted from [43, p. 215]:

$$\begin{aligned} s(u)=\left\{ \begin{array}{lll} \sqrt{1-x^{2}} &{} &{} |x|\le 1\\ -\infty &{} &{} \mathrm{otherwise}. \end{array} \right. \end{aligned}$$
(85)

In this case, \(s(u)=s^{**}(u)\) for all \(u\in {\text {dom}}\,s=[-1,\,1],\) so that s is a concave function, but it has supporting lines only over \((-1,\,1)=\mathrm{int}\ ({\text {dom}}\,s),\) since \(s^{\prime }(u)\) diverges as \(u\rightarrow \pm 1\) from within its domain. All cases of concave points with no supporting lines are of this type, since it can be proved in \(\mathbb {R}\) that

$$\begin{aligned} \mathrm{int}\ ({\text {dom}}\,s)\subseteq {\text {dom}}\,\partial s\subseteq {\text {dom}}\,s; \end{aligned}$$
(86)

see again Sect. 24 of [43] or Appendix A of [44].

With this proviso on boundary points, s is often defined to be strictly concave at u if it admits supporting lines at u that do not touch other points of its graph. If s has a supporting line at u touching other points of its graph, then s is said to be non-strictly concave at s. Finally, if s admits no supporting line at u, then s is said to be nonconcave at u. These definitions are also illustrated in Fig. 1. For generalizations of these definitions to \(\mathbb {R}^{d}\) in terms of supporting hyperplanes, see [43] and Appendix A of [44].

Appendix 2: Varadhan’s Theorem and the Laplace Principle

We recall in this section two important results about Laplace approximations of exponential integrals in general spaces. In the following, \(\{a_{n}\}_{n=1}^{\infty }\) is an increasing sequence such that \(a_{n}\nearrow \infty \) when \(n\rightarrow \infty .\) Moreover, \(\{P_{n}\}_{n=1}^{\infty }\) is a sequence of probability measures defined on a (Polish) space \(\mathcal {X}.\) In this paper, N takes the role of \(a_{n}\) and n.

Theorem 13

(Varadhan, 1966 [42]) Assume that \(P_{n}(dx)\) satisfies the LDP with speed \(a_{n}\) and rate function I on \(\mathcal {X}.\) Let F be a continuous function.

  1. (a)

    (Bounded case) assume that \(\sup _{x} F(x)<\infty .\) Then

    $$\begin{aligned} \lim \limits _{n\rightarrow \infty } \frac{1}{a_{n}} \ln \int \nolimits _{\mathcal {X}} e^{a_{n} F(x)} P_{n}(dx) =\sup \limits _{x\in \mathcal {X}} \{F(x)-I(x)\}<\infty . \end{aligned}$$
    (87)
  2. (b)

    (Unbounded case) assume that F satisfies

    $$\begin{aligned} \lim \limits _{L\rightarrow \infty } \lim \limits _{n\rightarrow \infty } \frac{1}{a_{n}}\ln \int \nolimits _{\{F\ge L\}} e^{a_{n} F(x)} P_{n}(dx)=-\infty . \end{aligned}$$
    (88)

    Then the result of (a) holds and is finite. In particular, if F is bounded above on the support of \(P_{n},\) then (a) holds.

For a proof of this result, see the Appendix B of [34] or Theorem 4.3.1 in [38]. For historical notes on this result, see Sect. 3.7 of [37].

Consider now the exponentially tilted probability measure

$$\begin{aligned} P_{n,F}(dx) = \frac{e^{a_{n} F(x)} P_{n}(dx)}{W_{n,F}}, \end{aligned}$$
(89)

where

$$\begin{aligned} W_{n,F}=\int \nolimits _{\mathcal {X}} e^{a_{n}F(x)} P_{n}(dx)=E_{P}\left[ e^{a_{n}F(X)}\right] . \end{aligned}$$
(90)

This is also known as the exponential family or Esscher transform of \(P_{n}.\)

Theorem 14

(LDP for tilted measures) Assume that \(W_{n,F}<\infty .\) Then \(P_{n,F}\) satisfies the LDP with speed \(a_{n}\) and rate function

$$\begin{aligned} I_{F} (x)=I(x)-F(x)+\lambda _{F}, \end{aligned}$$
(91)

where

$$\begin{aligned} \lambda _{F}=\lim \limits _{n\rightarrow \infty }\frac{1}{a_{n}}\ln W_{n,F}. \end{aligned}$$
(92)

A proof of this result can be found in Theorem 11.7.2 of [34] or by combining Proposition 3.4 and Theorem 9.1 of [35]. A thermodynamic version of this result also appears in Theorem 4.1 of [7].

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Touchette, H. Equivalence and Nonequivalence of Ensembles: Thermodynamic, Macrostate, and Measure Levels. J Stat Phys 159, 987–1016 (2015). https://doi.org/10.1007/s10955-015-1212-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10955-015-1212-2

Keywords

Navigation