Skip to main content
Log in

Does the PBR Theorem Rule out a Statistical Understanding of QM?

  • Published:
Foundations of Physics Aims and scope Submit manuscript

Abstract

The PBR theorem gives insight into how quantum mechanics describes a physical system. This paper explores PBRs’ general result and shows that it does not disallow the ensemble interpretation of quantum mechanics and maintains, as it must, the fundamentally statistical character of quantum mechanics. This is illustrated by drawing an analogy with an ideal gas. An ensemble interpretation of the Schrödinger cat experiment that does not violate the PBR conclusion is also given. The ramifications, limits, and weaknesses of the PBR assumptions, especially in light of lessons learned from Bell’s theorem, are elucidated. It is shown that, if valid, PBRs’ conclusion specifies what type of ensemble interpretations are possible. The PBR conclusion would require a more direct correspondence between the quantum state (e.g., \( \left| {\psi \rangle } \right. \)) and the reality it describes than might otherwise be expected. A simple terminology is introduced to clarify this greater correspondence.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. This interpretation was brought back to prominence and explored in detail by Leslie Ballentine; see Ballentine [1, 2]

  2. Valentini says: “… I think the word 'seismic' is likely to apply to this paper.” E. Reich, Quantum theorem shakes foundations, Nature: News (Nov. 17, 2011).

  3. This is the rule that |ψ(x)|2 is the probability of finding the particle(s) between x and x + dx.

  4. Or, in more complex systems, the Kochen-Specker theorem. The GHZ theorem is due to Daniel M. Greenberger, Michael A. Horne, and Anton Zeilinger.

  5. To understand the details of the technical argument, I recommend the PBR article itself (see [3]) along with the article on the web by Matt Leifer at: http://mattleifer.info/2011/11/20/can-the-quantum-state-be-interpreted-statistically.

  6. These concepts relevant to ontological models were introduced by Harrigan and Spekkens [15]. They say “a hidden variable model is ψ-ontic if every complete physical state or ontic state in the theory is consistent with only one pure quantum state; we call it ψ-epistemic if there exist ontic states that are consistent with more than one pure quantum state.” This language might be a chief driver in losing the larger understanding of statistical theory, for ψ-epistemic models do not constitute the sum total of all statistical models.

  7. So, we might, for small N, choose to call it β as T connotes the statistical meaning that depends on large N and reflects the temperature of physical bodies. (see Appendix 1 for more discussion).

  8. The temperature characterizes the energy binning that is most likely for the system of fixed E and N. The concept of temperature viewed through the parts of a system, i.e. T, is a statistical one; it does not appear until one considers the most probable states.

  9. If we are willing to wait, we can just allow the finite mass of the box to allow the momenta (and thus the energy) to redistribute. We should also note that one must be careful to not to pick a special initial state that will not equilibrate. The need to do so is well known in such statistical analysis.

  10. Lewis et al. [16]. Title changed when published in Phys. Rev: Lewis et al. [17].

  11. As well as the below quote in the text, PBR says, for example: “..then \( \left| {\psi \rangle } \right. \) can justifiably be regarded as ‘mere’ information.”

  12. Another way to read “quantum state” is as: “the physical states prepared according to ψ”; this gives the same problem but has the additional issue of referring to physical reality as “representing” something.

  13. Again, the important general point that arises out of the analysis in the appendix is that, though this is an approximate result: for a given temperature there is always only a single physical value for the total energy establishing the correspondence between temperature and total energy.

  14. This is the classical continuum limit result calculated in the appendix. Using quantum mechanical energy bins, in one dimension one gets: E = kT; in 3D, one gets: E = (3/2)kT.

  15. As an example where the distributions overlap, consider the case of two isolated ideal gases each at the same temperature and pressure, but in boxes of different volumes V1 and V2. The ensembles labeled by V1 and V2 would overlap. Yet, one clearly has some information (but not all the information, and it is clearly a statistical theory).

  16. Said another way, PBR asks whether each distinct ψ refers to a group of distinct physical states or not.

  17. If the μ’s have overlapping support, the situation is said to give “minimal” information; otherwise we say it gives non-minimal information, i.e., more information. In PBR’s language, one says: it is not “mere” information, not epistemic.

  18. This would also be true for two boxes of ideal gas in thermal contact with finite sized heat reservoirs of different temperatures. While the ideal gases would no longer confined to be a certain energy and thus the argument for distinct physical states given for the isolated box would fail, it would not fail for each box + heat − reservoir system as each of these systems is isolated. So including the full reality gets one back to non-overlapping ensembles.

  19. “Painting one particle red” is just a way of indicating that we can distinguish it from the rest and then measure its energy state. Whatever method of distinguishing our chosen particle that we choose, it should not affect the dynamics of the particle at our level of analysis. For example, considering one of the particles to be in an excited states, under certain conditions, would enable us to distinguish it without changing the nature of the system (at our level of analysis).

  20. Remember because probability is a result of ignorance (i.e., it results from ignoring, leaving out, part of a system), we cannot completely specify all elements without ceasing to have probability distributions. PBR say, in the first arXiv version of their paper (see [4]): “If the quantum state is statistical in nature…, then a full specification of λ need not determine the quantum state uniquely.” In fact, if it is statistical in nature, it cannot be uniquely determined.

  21. The following example is a special case of partial overlap in which one is a subset of the other in which not all of its boundary is inside of the other. One could imagine a case where one is completely surrounded by the other, so it is kind of an island (the smaller set) inside of an ocean (the larger set). Even this would still be considered partial overlap as picking a point in the “ocean” will distinguish the larger set.

  22. They will have different “temperatures,” but here we distinguish the macro-states by total energy.

  23. I say “premises” because they can also be considered as assumptions of the theorem though they are implicit, appearing in the form of definitions. They come before the actual stated assumptions.

  24. Including ones not often understood clearly.

  25. Another such example occurs in the GHZ proof. If one recognizes the statistical nature of QM, then one quickly sees that one has to consider several alternative potential measured values simultaneously that are not, in fact, simultaneously measurable. This leads, for example, to the widely recognized need for QM contextuality. Moving to a general point that we have implicitly alluded to before, it is not that “no-go” proofs are not important, but that they often reveal more than their simple conclusions; in particular, their explicit and hidden assumptions reveal a lot. Bell famously said, I think in trying to bring out this point: “…long may Louis de Broglie continue to inspire those who suspect that what is proved by impossibility proofs is lack of imagination” Bell [18].

  26. Similar caveats, at some level, probably also apply to PBR reasoning.

  27. This caveat can be lumped under the title, “detection loophole,” but notice the title is not appropriate in so far as something that is in the nature of quantum mechanics as this mechanism would be is not a “loophole.” Loophole tends to imply one just has not yet found a good enough (efficient enough) detector and if one did then we would see that quantum mechanics is correct in its predictions. In the hypothetical (that seems not to be the case), quantum mechanics would continue to be verified with better and better detectors, because such experiments would keep giving the same results for the detections observed.

  28. Of course, such superluminal travel would have to be unmeasureable to avoid violating special relativity; said another way, to avoid violating special relativity, one must, in principle, not be able send information with the superluminal interaction.

  29. Assuming that one keeps the real world, upon which, it should be obvious, the whole of physics depends.

  30. The preparation independence assumption is discussed in the following: Schlosshauer and Fine [20] and Mansfield [21].

  31. “But our present QM formalism is not purely epistemological; it is a peculiar mixture describing in part realities of Nature, in part incomplete human information about Nature—all scrambled up by Heisenberg and Bohr into an omelette that nobody has seen how to unscramble. Yet we think that the unscrambling is a prerequisite for any further advance in basic physical theory. For, if we cannot separate the subjective and objective aspects of the formalism, we cannot know what we are talking about; it is just that simple.” E. T. Jaynes, Complexity, Entropy, and the Physics of Information, (ed. by W. H. Zurek) 381 (Addison-Wesley, 1990). This is quoted in Reference [3].

  32. For example, the blossoming field of quantum computing is intimately connected with thought about the nature of entanglement (to which Bell is a primary contributor).

  33. Indeed, we would want to find a formal theory to describe the sub-quantum world.

  34. See footnote [5,6,7] for reference to Physics for Realists: QM in which those insights are joined with others and brought into a coherent whole pregnant with the possibility of more investigation to further deepen our understanding.

  35. Talk titled: Is the end in sight for theoretical physics?, given at his inauguration to the Lucasian Professor of Mathematics at University of Cambridge. Former holders include Newton and Dirac. He said: “I want to discuss the possibility that the goal of theoretical physics might be achieved in the not too distant future, say, by the end of the century. By this I mean that we might have a complete, consistent and unified theory of the physical interactions which would describe all possible observations.”

  36. See “The Last Word in Physics” from Jaki [22] for examples (involving top flight scientists) of this tendency in physics history for scientists to think physics is finished.

  37. PBR also say in their response: “Difficulties with the finite precision of experiments are unavoidable in physics. There will always be an infinite number of theories that predict the same results to within current experimental error. There is always the hope that future experiments will be able to distinguish them, and in the meantime we can use other criteria, such as simplicity, to select a preferred theory. We join with most physicists in preferring quantum theory at present.”

  38. They say: “If there is no collapse, on the other hand, then after a measurement takes place the joint quantum state of the system and measuring apparatus is entangled and contains a component corresponding to each possible macroscopic measurement outcome. This would be unproblematic if the quantum state merely reflected a lack of information about which outcome occurred. However, if the quantum state is a physical property of the system and apparatus, it is hard to avoid the conclusion that each macroscopically different component has a direct counterpart in reality.”

  39. By “collapse,” I mean the general understanding of measurement of a given interpretation. In the ensemble interpretation, there is no collapse in the sense of requiring the reduction of the wave function to an eigenstate of the measured variable. In the context of the ensemble interpretation, the word collapse just refers to the process of discovering what value the variable has for a given measurement.

  40. This is not at all like the hardware store analogy; in that analogy the bolt would be moved somewhere else every time you probed to get information about its location—clearly, this is not a good analogy for QM. A key reason is that the analogy treats the reality of the state as if it were one simple “there or not” there as opposed to many entities that can be in many different states some of which stay the same as the states change. In a complex system, like our ideal gas, changing one element, would force us to say the system had a different λ.

  41. As we expect it would already, before detailed analysis, from our intuition about quantum mechanics.

  42. Of course, |cat+>|atom-e>+|cat->|atom-d> is a different quantum state from both |cat+>|atom-e> and |cat->|atom-d> and so none of these have common physical support, i.e. none of the systems selected from any two of these ensembles will be the same.

  43. Said another way, the physical microstates that correspond to the quantum state (the “macro-state”).

  44. Note that our use of \( \epsilon \), not \( \varepsilon \), here; in the main text of the article, the \( \varepsilon \) was the energy of a given particle in the gas; \( \epsilon \) represents the energy of a given bin when a single particle is in it.

  45. Using, instead, the more accurate (and the one more typically called Sterling’s approximation) \( \ln m! = m\ln m - m \)gives the exact same result as found in the ensuing calculations. This more accurate approximation estimates the factorial within 14 % for ni =10.

  46. So, the probability of finding a particle in the ith cell is: \( p\left({{\epsilon}_{i}} \right)\equiv \frac{{{n}_{i}}}{N}=\frac{{{e}^{-\alpha}}{{e}^{-\beta {{\epsilon}_{i}}}}}{{{e}^{-\alpha}}\sum\limits_{1}^{M}{{{e}^{-{{\epsilon}_{i}}/kT}}}}=\frac{{{e}^{-\beta {{\epsilon}_{i}}}}}{\sum\limits_{1}^{M}{{{e}^{-\beta {{\epsilon}_{i}}}}}} \). This is generalized to continuum case, as: \( p\left(\epsilon \right)\equiv \frac{n\left(\epsilon \right)}{N}=\frac{{{e}^{-\epsilon/kT}}d\epsilon}{\int_{0}^{{{E}_{1}}}{{{e}^{-\epsilon/kT}}d\epsilon}}=\frac{{{e}^{-\epsilon/kT}}d\epsilon}{kT\left(1-{{e}^{-{{E}_{1}}/kT}} \right)} \).

  47. Note, we are characterizing the whole system of N particles, so we focus on T and E1; if we were interested in the individual particle states, we would also note that the average energy of an individual particle is: \( \left\langle E \right\rangle = E_{1}/N \).

References

  1. Ballentine, L.E.: The statistical interpretation of quantum mechanics. Rev. Mod. Phys. 42(4), 358–381 (1970)

    Article  ADS  Google Scholar 

  2. Ballentine, L.E.: Quantum Mechanics: A Modern Development. World Scientific Publishing, Singapore (1998)

    Book  Google Scholar 

  3. Pusey, M.F., Barrett, J., Rudolph, T.: On the reality of the quantum state. Nat. Phys. 8, 475–478 (2012)

    Article  Google Scholar 

  4. Pusey, M.F., Barrett, J., Rudolph, T.: arXiv:1111.3328 [quant-ph], Nov 14, 2011

  5. Rizzi, A.: The Science Before Science, (SBS). IAP Press, Baton Rouge (2004)

    Google Scholar 

  6. Rizzi, A.: The Meaning of Bell’s Theorem, arXiv:quant-ph/0310098v1

  7. Rizzi, A.: Physics for Realists: Quantum Mechanics. IAP Press, Rochester, NY (2018)

    Google Scholar 

  8. Pearle, P.: In: Struppa, Tollaksen (eds.) Collapse Miscellany in Quantum Theory: A Two-Time Success Story, p. 131. Springer, New York, NY (2014)

  9. Penrose, : The Road to Reality. Alfred A. Knopf, New York, NY (2004)

    Google Scholar 

  10. Valentini, A.: Inflationary Cosmology as a Probe of Primordial Quantum Mechanics, p. 1–44, arXiv:hep-th0805.0163v2. Sep 9, 2010

  11. Valentini, A.: Cosmological Data Hint at a Level of Physics Underlying Quantum Mechanics, Scientific American, Nov 2013

  12. Valentini, A.: Signal-locality, uncertainty, and the subquantum H-theorem. I. Phys. Lett. A 156(1), 5–11 (1991)

    Article  ADS  MathSciNet  Google Scholar 

  13. Valentini, A.: Signal-locality, uncertainty, and the subquantum H-theorem. II. Phys. Lett. A 158, 1–8 (1991)

    Article  ADS  MathSciNet  Google Scholar 

  14. Colin, S., Valentini, A.: Mechanism for the suppression of quantum noise at large scales on expanding space. Phys. Rev. D 88, 103515 (2013)

    Article  ADS  Google Scholar 

  15. Harrigan, N., Spekkens Einstein, R.W.: Incompleteness, and the epistemic view of quantum states. Found. Phys. 40, 125 (2010)

    Article  ADS  MathSciNet  Google Scholar 

  16. Lewis, P.G., Jennings, D., Barrett, J., Rudolph, T.: The quantum state can be interpreted statistically. arXiv: 1201.6554v1 [quant-ph], Jan 31, 2012

  17. Lewis, P., Jennings, D., Barrett, J., Rudolph, T.: Distinct quantum states can be compatible with a single state of reality. Phys. Rev. Lett. 109, 150404 (2012)

    Article  ADS  Google Scholar 

  18. Bell, J.S.: On the impossible pilot wave. Found. Phys. 10, 989 (1982)

    Article  ADS  MathSciNet  Google Scholar 

  19. Hensen, B., et al.: Experimental loophole-free violation of a Bell inequality using entangled electron spins separated by 1.3 km. Nature 526, 682–686 (2015)

    Article  ADS  Google Scholar 

  20. Schlosshauer, M., Fine, A.: No-go theorem for the composition of quantum systems. Phys. Rev. Lett. 112, 070407 (2014)

    Article  ADS  Google Scholar 

  21. Mansfield, S.: Reality of the quantum state: towards a stronger ψ-ontology theorem. Phys. Rev. A 94, 042124 (2016)

    Article  ADS  MathSciNet  Google Scholar 

  22. Jaki, S.: Patterns or Principles and Other Essays. Intercollegiate Studies Institute, Delaware (1995)

    Google Scholar 

  23. Halataei, S.M.H.: Testing the reality of the quantum state. Nat. Phys. 10, 174 (2014)

    Article  Google Scholar 

  24. Kastner, R.E.: Why quantum theory isn’t a shell game (PBR Theorem for the layperson). https://transactionalinterpretation.org/2014/06/14/why-quantum-theory-isnt-a-shell-game-pbr-theorem-for-the-layperson/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anthony Rizzi.

Appendices

Appendix 1: Derivation of Statistics of Ideal Gas (N Particles, Total Energy = E 1 at Temperature T)

(Note, on a glance, this title and the appendix itself might look like simply a standard derivation. However, it is not. It is written to address the issues raised by the PBR theorem about the nature of the statistical analysis of a physical system. Parts of the calculation are standard; however, it is approached and analyzed from a wholly unique perspective that, as mentioned in the text, affects the understanding, direction and final output of the calculation. Even the standard parts, in addition to being integral to the presentation, are needed to enable reflection on (and reference to) the key aspects of a statistical description of a physical system.)

As is evident from Eq. (1), there are many ways to assign the energy of each particle so as to get the total energy E1, indeed infinitely many ways. To avoid the infinities, start with energy bins of finite size; later, we can take the continuum limit. Divide the energy axis from zero to E1 into M bins, and label the energy of a bin by the energy of the middle of the bin. Thus, the ith bin has energy \( {{\epsilon}_{i}} \).Footnote 44 We take M to be very large and we take \( N \gg M \). Now, taking ni to be the number of particles in the ith bin, the energy and particle number constraint are:

$$ \sum\limits_{i=1}^{M}{{{n}_{i}}{{\epsilon}_{i}}}=E,\quad \sum\limits_{i=1}^{M}{{{n}_{i}}}=N $$
(7)

Any possible way of arranging the particles that satisfies these constraints is a possible physical state for the gas of temperature T. We call each possible state a microstate. However, we divide these states up into classes of intermediate states. These intermediate states are simply defined by how we bin the particles. We specify the binning by listing the number of particles in each energy state. Thus, each intermediate state is defined by a list that looks like: \( \{n_{1},n_{2},n_{3} \ldots \} \equiv \{n_{i} \} \). Clearly, there are many different arrangements of particles that can satisfy any given binning assignment. Indeed, combinatorics shows that the number of microstates, \( \varOmega \), in a given intermediate state, \( \lambda \equiv \{{{n}_{i}}\} \) is given by \( \varOmega \left({\{n_{i} \}} \right) \equiv \frac{N!}{{n_{1} !n_{2} !n_{3} ! \ldots n_{M} !}} \). The differences between the various microstates that make up an intermediate state are ignored. After all, the difference between microstates within one intermediate state just arises by switching the particles with each other. One can change which particles are where without changing the number in each bin. For example, we could switch particle 2400 in \( n_{1} \) with particle 22 which is currently in \( n_{49} \)without affecting the number of particles in each bin. This makes no difference, because, at our level of analysis, a particle with a given energy acts the same as another particle with the same energy. Thus, an ensemble of microstates makes up an intermediate state. We will not attempt to completely specify this ensemble, but we will need the number of states in it, i.e. \( \varOmega \left({\{n_{i} \}} \right) \) .

Now, many intermediate states can be assigned to the same macro-state, i.e. something evident on ordinary human scales. That is, the differences between the various intermediate states are not evident on that ordinary scale. Hence, we can consider an ensemble of intermediate states to represent a given macro-state, such as our gas with energy E. When we look in further detail at a given macro-state, we can find any member of this ensemble. This is what triggered our trek to understand the statistics of an ideal gas. But, we are not yet done.

Which member of the ensemble is most likely to be picked. Answer: the one with the most number of microstates associated with it. Thus, we need to maximize \( \varOmega \left({\{n_{i} \}} \right) \)with respect to \( \{n_{i} \} \). Because an additive quantity is easier to work with than a muliplicative one, we define the entropy, S as:

$$ S \equiv k\ln \varOmega \left({\{n_{i} \}} \right) $$
(8)

Now, using Lagrange multipliers to incorporate the constraints from Eq. (7), our maximization condition (δS = \( \sum\nolimits_{i = 1}^{M} {\frac{\delta S}{{\delta n_{i}}}\delta n_{i}} \)) becomes:

$$ \delta S=k\delta \left(\ln N!-\ln {{n}_{1}}!-\ln {{n}_{2}}! \ldots .-\ln {{n}_{M}}!-\alpha \sum\limits_{i=1}^{M}{{{n}_{i}}}-\beta \sum\limits_{i=1}^{M}{{{n}_{i}}}{{\epsilon}_{i}} \right)=0. $$
(9)

Using Sterling’s approximation (\( \ln m! \approx m\ln m \), for large m)Footnote 45 by taking \( n_{i} \) large and \( \delta N = \delta \sum\limits_{i = 1}^{M} {n_{i}} = \sum\limits_{i = 1}^{M} {\delta n_{i}} = 0 \), \( \delta E=\delta \sum\limits_{i=1}^{M}{{{n}_{i}}}{{\epsilon}_{i}}=\sum\limits_{i=1}^{M}{{{\epsilon}_{i}}\delta {{n}_{i}}}=0 \) gives:

$$ \delta S=k\delta \left(N\ln N-\sum\limits_{i=1}^{M}{{{n}_{i}}\ln {{n}_{i}}}-\alpha \sum\limits_{i=1}^{M}{{{n}_{i}}}-\beta \sum\limits_{i=1}^{M}{{{n}_{i}}}{{\epsilon}_{i}} \right)=0 $$
(10)

so: \( \left(-\sum\limits_{i=1}^{M}{\delta {{n}_{i}}\ln {{n}_{i}}}-\sum\limits_{i=1}^{M}{\delta {{n}_{i}}}-\alpha \sum\limits_{i=1}^{M}{\delta {{n}_{i}}}-\beta \sum\limits_{i=1}^{M}{\delta {{n}_{i}}}{{\epsilon}_{i}} \right)=0 \).

$$ \left(\sum\limits_{i=1}^{M}{\delta {{n}_{i}}\ln {{n}_{i}}}+\sum\limits_{i=1}^{M}{\delta {{n}_{i}}}+\alpha \sum\limits_{i=1}^{M}{\delta {{n}_{i}}}+\beta \sum\limits_{i=1}^{M}{\delta {{n}_{i}}}{{\epsilon}_{i}} \right)=\sum\limits_{i=1}^{M}{\left(\alpha +\beta {{\epsilon}_{i}}+\ln {{n}_{i}} \right)\delta {{n}_{i}}}=0. $$

So, we get: \( \alpha +\beta {{\epsilon}_{i}}+\ln {{n}_{i}}=0 \), or Footnote 46:

$$ {{n}_{i}}={{e}^{-\alpha}}{{e}^{-\beta {{\epsilon}_{i}}}} $$
(11)

(Note: if we took the more accurate version of Sterling’s approximation, i.e. \( \ln m! \approx m\ln m - m + \frac{1}{2}\ln 2\pi m \), we would get: \( {{n}_{i}}{{e}^{\frac{1}{4\pi {{n}_{i}}}}}={{e}^{-\alpha}}{{e}^{-\beta {{\epsilon}_{i}}}} \). This is actually M equations, one for each of the \( n_{i} \)’s. We take \( {{\epsilon}_{i}}\equiv i\,{{E}_{1}}/M \)and we also take M to be fixed. \( a \equiv e^{- \alpha} \) is a normalization factor that is fixed by the number constraint: \( N=\sum\nolimits_{1}^{M}{{{n}_{i}}} \). We also have the energy constraint: \( {{E}_{1}}=\sum\nolimits_{1}^{M}{{{n}_{i}}{{\epsilon}_{i}}} \). So, we have \( M + 2 \) equations and \( M + 2 \)unknowns (M ni’s, a, β and E1). In theory, we could solve the system of equations iteratively as follows. For a given, \( n_{i} \), say \( n_{1} \), we substituting chosen values for M and E1 and, we pick a trial a and a trial β and numerically find ni (say by graphical methods). We, then, do the same to find all the ni’s. These should satisfy the number and energy constraints. If they do not we iterate our choices for a and β until the result comes out correctly.

The important point here is that for fixed E1, we have a single β. That is, there is, even for low N, a one to one correspondence the total system energy and β (note: β characterizes the highest probability energy-binning state and will become 1/kT in the large N limit.)

Now, we’d like to take the continuum limit of the bins, which involves \( M \to \infty \). To make this easier, shift the sums above to sums from \( i = 0 \) to \( i = M - 1 \). This means, applying our system of binning (which could have been anything up till now) that we gave above, we get: \( {{\epsilon}_{i}}=i{{E}_{1}}/M \). We thus have:

$$ n_{i} = {e^{- \alpha}} e^{{- \beta iE_{1}/M}},so \, N = e^{- \alpha} \sum\nolimits_{i = 0}^{M - 1} {e^{{- \beta iE_{1}/M}}}. $$
(12)

where we take \( a={{e}^{-\alpha}} \). Then, noting that the geometric series partial sum is: \( \sum\nolimits_{i = 0}^{M - 1} {ar^{i}} = a\left({\frac{{1 - r^{M}}}{1 - r}} \right) \), taking \( r = e^{{- \beta E_{1}/M}} \), we get: \( N = a\left({\frac{{1 - e^{{- \beta E_{1}}}}}{{1 - e^{{- \beta E_{1}/M}}}}} \right) \); this implies: \( a = N\left({\frac{{1 - e^{{- \beta E_{1}/M}}}}{{1 - e^{{- \beta E_{1}}}}}} \right) \). Note how:

$$ n_{i} = \mathop {{\rm lim}}\limits_{M \to \infty} ae^{{- \beta iE_{1}/M}} \sim \mathop {{\rm lim}}\limits_{M \to \infty} N\left({\frac{{1 - \left({1 - \beta E_{1}/M} \right)}}{{1 - e^{{- \beta E_{1}}}}}} \right)e^{{- \beta iE_{1}/M}} = \mathop {{\rm lim}}\limits_{M \to \infty} \frac{N}{M}\left({\frac{{\beta E_{1}}}{{1 - e^{{- \beta E_{1}}}}}} \right) \to 0. $$

As the bin gets arbitrarily small (i.e., \( M \to \infty \)), this is to be expected as this is the only way that the total number of particles can be finite as the number of bins approaches infinity. But, recalling our earlier requirement, we want \( N \gg M \), so to maintain this (and thus also maintain our approximation that \( n_{i} \)is large), we would have to let N also go to infinity. We want to go to the continuum limit in the energy-binning, but, of course, we do not want to have an infinite number of particles. This leads us to back to the core of the matter; the bin occupation number ni is a way of asking how many particles are in a given energy range (with a fixed number of particles). This fact is implicit in the case of a finite number (M) of bins. We need to be more explicit in the limiting case.

Thus, to take the \( M \to \infty \)limit, we consider, instead of the number of particles in a bin, the number of particles with a given energy in an arbitrarily small region \( \Delta \epsilon \). That is, we define:

$$ \rho \equiv \underset{\Delta \epsilon \to \,0}{\mathop{{\rm lim}}}\,\frac{{{n}_{i}}}{\Delta \epsilon}=\underset{M\to \,\infty}{\mathop{{\rm lim}}}\,\frac{a{{e}^{-\beta i{{E}_{1}}/M}}}{{{E}_{1}}/M}. $$
(13)

Now, calculating the total number of particles, we get, substituting for a and using the Riemann definition of an integral:

$$ \begin{aligned} {\text{Total number of particles }} & = \mathop {\lim }\limits_{M \to \,\infty } \frac{M}{{E_{1} }}\sum\limits_{i = 0}^{M - 1} {ae^{{ - \beta iE_{1} /M}} } \frac{{E_{1} }}{M} \\ & = \mathop {\lim }\limits_{M \to \,\infty } \frac{M}{{E_{1} }}\sum\limits_{i = 0}^{M} {\frac{N}{M}\left( {\frac{{\beta E_{1} }}{{1 - e^{{ - \beta E_{1} }} }}} \right)e^{{ - \beta \left( {iE_{1} /M} \right)}} } \frac{{E_{1} }}{M} \\ & = N\left( {\frac{\beta }{{1 - e^{{ - \beta E_{1} }} }}} \right)\int_{0}^{{E_{1} }} {e^{{ - \beta\epsilon \text{}}} d\epsilon\text{}} = N\left( {\frac{\beta }{{1 - e^{{ - \beta E_{1} }} }}} \right)\left( {\frac{{1 - e^{{ - \beta E_{1} }} }}{\beta }} \right) = N \\ \end{aligned} $$
(14)

which confirms our calculation by giving a result of N.

Now, by comparing \( N=\int_{0}^{{{E}_{1}}}{\rho \left(\epsilon \right)}d\epsilon \) with the last line above, we can write the continuum limit of our binning result:

$$ \rho \left(\epsilon \right)=N\left(\frac{\beta}{1-{{e}^{-\beta {{E}_{1}}}}} \right){{e}^{-\beta \epsilon}} $$
(15)

Connecting this to the empirically verified Maxwell Boltzmann distribution validates our assumptions and gives us \( \beta = 1/kT \), so that we have:

$$ \rho \left(\epsilon \right)=\frac{N}{kT}\frac{1}{\left(1-{{e}^{-{{E}_{1}}/kT}} \right)}{{e}^{-\epsilon/kT}} $$
(16)

Note the total energy is:

$$ {{E}_{1}}=\int_{0}^{{{E}_{1}}}{\epsilon \,\rho \left(\epsilon \right)d\epsilon}=\frac{N}{kT}\frac{1}{\left(1-{{e}^{-{{E}_{1}}/kT}} \right)}\int_{0}^{{{E}_{1}}}{\epsilon \,{{e}^{-\epsilon/kT}}d\epsilon}=\frac{N{{E}_{1}}}{\left(1-{{e}^{{{E}_{1}}/kT}} \right)}+NkT $$
(17)

For fixed T, graphing, \( E_{1} \)and \( \frac{{NE_{1}}}{{\left({1 - e^{{E_{1}/kT}}} \right)}} + NkT \) reveals that there is a solution at \( E_{1} = 0 \), which is physically invalid and there is a solution nearFootnote 47

$$ E_{1} = NkT\, {\text{for large}}\,N. $$
(18)

At E1=NkT, \( E_{1} - \left({\frac{{NE_{1}}}{{\left({1 - e^{{E_{1}/kT}}} \right)}} + NkT} \right) = \frac{{N^{2} kT}}{{e^{N} - 1}} \). So, for small N, this solution is far off but for our case of large N, it is a good approximation; in particular, \( \frac{N^{2} kT}{e^{N}-1} \ll {E_{1}} \) implies, for \( N \gg 1, N^{2} \ll \frac{{e^{N} - 1}}{kT}E_{1} \approx e^{N} \left({\frac{{E_{1}}}{kT}} \right) \) or \( {{N}^{2}}{{e}^{-N}} \ll \left(\frac{{{E}_{1}}}{kT} \right) \). For macroscopic values, i.e. \( N \approx 6.02 \times 10^{23} \approx 10^{24} \), the log of the left hand side is: \( 2\ln N - N \approx - 10^{24} \), so \( N^{2} e^{- N} \approx 10^{{- 10^{24}}} \), an incredibly small number. Hence, only when the temperature is extremely large or the total system energy is extremely small is this solution not approximately valid.

As mentioned in the text of the article, the important general point is that: for an isolated idea gas (as described in text) at given a temperature, there is a single value for the total energy, establishing the correspondence between temperature and total energy.

Appendix 2 (Imposing a Different Binning Requirement)

Let’s see what happens if we let our system of total energy E1 be such that no single particle can have an energy less than \( \varepsilon_{0} \). We proceed as above.

To take the limit of \( M \to \infty \), we first apply this different binning to get: \( {{\epsilon}_{i}}={{\varepsilon}_{0}}+i\left({{E}_{1}}-{{\varepsilon}_{0}} \right)/M \). We thus have, using equation \( {{n}_{i}}={{e}^{-\alpha}}{{e}^{-\beta {{\epsilon}_{i}}}} \):

$$ n_{i} = e^{- \alpha} e^{{- \beta \left({\varepsilon_{0} + i\left({E_{1} - \varepsilon_{0}} \right)/M} \right)}} = e^{{- \left({\alpha + \beta \varepsilon_{0}} \right)}} e^{{- \beta i\left({E_{1} - \varepsilon_{0}} \right)/M}}, $$
(19)

so \( N=a\sum\nolimits_{i=0}^{M-1}{{{e}^{-i\beta \left({{E}_{1}}-{{\varepsilon}_{0}} \right)/M}}} \), where we take \( a = e^{{- \left({\alpha + \beta \varepsilon_{0}} \right)}} \).

Then, noting that the geometric series partial sum is: \( \sum\nolimits_{i=0}^{M-1}{a{{r}^{i}}}=a\left(\frac{1-{{r}^{M}}}{1-r} \right) \), taking \( r = e^{{- \beta \left({E_{1} - \varepsilon_{0}} \right)/M}} \)and, we get: \( N = a\left({\frac{{1 - e^{{- \beta \left({E_{1} - \varepsilon_{0}} \right)}}}}{{1 - e^{{- \beta \left({E_{1} - \varepsilon_{0}} \right)/M}}}}} \right) \); this implies: \( a = N\left({\frac{{1 - e^{{- \beta \left({E_{1} - \varepsilon_{0}} \right)/M}}}}{{1 - e^{{- \beta \left({E_{1} - \varepsilon_{0}} \right)}}}}} \right) \).

Note how:

$$ \begin{aligned} n_{i} & = \mathop {\lim }\limits_{M \to \infty } ae^{{ - \beta iE_{1} /M}} \sim \mathop {\lim }\limits_{M \to \infty } N\left( {\frac{{1 - \left( {1 - \beta E_{1} /M} \right)}}{{1 - e^{{ - \beta E_{1} }} }}} \right)e^{{ - \beta iE_{1} /M}}\\ &\quad =\mathop {\lim }\limits_{M \to \infty } \frac{N}{M}\left( {\frac{{\beta E_{1} }}{{1 - e^{{ - \beta E_{1} }} }}} \right)e^{{ - \beta iE_{1} /M}} \to 0 \\ n_{i} & = \mathop {\lim }\limits_{M \to \infty } ae^{{ - \beta i\left( {E_{1} - \varepsilon_{0} } \right)/M}} \sim \mathop {\lim }\limits_{M \to \infty } N\left( {\frac{{1 - \left( {1 - \beta \left( {E_{1} - \varepsilon_{0} } \right)/M} \right)}}{{1 - e^{{ - \beta \left( {E_{1} - \varepsilon_{0} } \right)}} }}} \right)e^{{ - \beta i\left( {E_{1} - \varepsilon_{0} } \right)/M}} \\ & = \mathop {\lim }\limits_{M \to \infty } \frac{N}{M}\left( {\frac{{\beta \left( {E_{1} - \varepsilon_{0} } \right)}}{{1 - e^{{ - \beta \left( {E_{1} - \varepsilon_{0} } \right)}} }}} \right)e^{{ - \beta i\left( {E_{1} - \varepsilon_{0} } \right)/M}} \to 0 \\ \end{aligned} $$

As the bin gets arbitrarily small (i.e., \( M \to \infty \)), this is to be expected as this is the only way that the total number of particles can be finite as the number of bins approaches infinity. As above, to take the \( M \to \infty \)limit, we thus consider instead of the number of particles in a bin, the number of particles with a given energy in an arbitrarily small region \( \Delta \epsilon \). That is, we define, using \( n_{i} = ae^{{- \beta i\left({E_{1} - \varepsilon_{0}} \right)/M}} \):

$$ \rho \equiv \underset{\Delta \epsilon \to \,0}{\mathop{{\rm lim}}}\,\frac{{{n}_{i}}}{\Delta \epsilon}=\underset{M\to \,\infty}{\mathop{{\rm lim}}}\,\frac{a{{e}^{-\beta i\left({{E}_{1}}-{{\varepsilon}_{0}} \right)/M}}}{\left({{E}_{1}}-{{\varepsilon}_{0}} \right)/M}. $$
(20)

Now, calculating the total number of particles, we get, substituting for a and using the Riemann definition of an integral:

$$ \begin{aligned} {\text{Total number of particles }} & = \mathop {\lim }\limits_{M \to \,\infty } \frac{M}{{\left( {E_{1} - \varepsilon_{0} } \right)}}\sum\limits_{i = 0}^{M - 1} {ae^{{ - \beta i\left( {E_{1} - \varepsilon_{0} } \right)/M}} } \frac{{\left( {E_{1} - \varepsilon_{0} } \right)}}{M} \\ & = \mathop {\lim }\limits_{M \to \,\infty } \frac{M}{{\left( {E_{1} - \varepsilon_{0} } \right)}}\sum\limits_{i = 0}^{M - 1} {\frac{N}{M}\left( {\frac{{\beta \left( {E_{1} - \varepsilon_{0} } \right)}}{{1 - e^{{ - \beta \left( {E_{1} - \varepsilon_{0} } \right)}} }}} \right)}\\&\qquad \times \,{e^{{ - \beta i\left( {E_{1} - \varepsilon_{0} } \right)/M}} } \frac{{\left( {E_{1} - \varepsilon_{0} } \right)}}{M} \\ & = N\left( {\frac{\beta }{{1 - e^{{ - \beta \left( {E_{1} - \varepsilon_{0} } \right)}} }}} \right)\int_{0}^{{E_{1} - \varepsilon_{0} }} {e^{{ - \beta\epsilon \text{}}} d\epsilon\text{}}\\ & = N\left( {\frac{\beta }{{1 - e^{{ - \beta \left( {E_{1} - \varepsilon_{0} } \right)}} }}} \right)\int_{{\varepsilon_{0} }}^{{E_{1} }} {e^{{ - \beta \left( {\text{} \epsilon - \varepsilon_{0} } \right)}} d\epsilon\text{}} \\ & = N\left( {\frac{\beta }{{1 - e^{{ - \beta \left( {E_{1} - \varepsilon_{0} } \right)}} }}} \right)\left( {\frac{{1 - e^{{ - \beta \left( {E_{1} - \varepsilon_{0} } \right)}} }}{\beta }} \right) = N \\ \end{aligned} $$
(21)

which confirms our calculation by giving a result of N.

Now, by comparing \( N=\int_{0}^{{{E}_{1}}}{\rho \left(\epsilon \right)}d\epsilon \)with the last line above, we can write the continuum limit of our binning result, using \( \beta = 1/kT \):

$$ \rho \left(\epsilon \right)=\frac{N}{kT}\left(\frac{1}{1-{{e}^{-\left({{E}_{1}}-{{\varepsilon}_{0}} \right)/kT}}} \right){{e}^{-\left(\epsilon -{{\varepsilon}_{0}} \right)/kT}} $$
(22)

Note the total energy is:

$$ E_{1} = \int_{\varepsilon_{0}}^{{E_{1} }} {\epsilon \,\rho \left( \epsilon \right)d\epsilon } = \frac{N}{{kT}}\frac{1}{{\left( {1 - e^{{ {-(E_{1} - \varepsilon_{0} )} /kT}} } \right)}}\int_{\varepsilon_{0}}^{{E_{1} }} {\epsilon e^{{ - (\epsilon-{\varepsilon_{0}} /kT}} d\epsilon } = \frac{{N\left( {E_{1} - \varepsilon_{0} e^{{\left( {E_{1} - \varepsilon_{0} } \right)/kT}} } \right)}}{{\left( {1 - e^{{\left( {E_{1} - \varepsilon_{0} } \right)/kT}} } \right)}} + NkT $$
(23)

For fixed T, graphing, E1 and \( \frac{{N\left( {E_{1} - \varepsilon_{0} e^{{\left( {E_{1} - \varepsilon_{0} } \right)/kT}} } \right)}}{{\left( {1 - e^{{\left( {E_{1} - \varepsilon_{0} } \right)/kT}} } \right)}} + NkT \) reveals that there is a solution at \( {{E}_{1}}={{\varepsilon}_{0}} \), which is physically invalid and there is, again, a solution near:

$$ E_{1} = NkT + \varepsilon_{0} \, {\text{for large}}\,N. $$
(24)

At \( {{E}_{1}}=NkT+{{\varepsilon}_{0}} \), \( E_{1} - \left({\frac{{N\left({E_{1} - \varepsilon_{0} e^{{\left({E_{1} - \varepsilon_{0}} \right)/kT}}} \right)}}{{\left({1 - e^{{\left({E_{1} - \varepsilon_{0}} \right)/kT}}} \right)}} + NkT} \right) = \frac{{N(N(\varepsilon_{0} + kT) - \varepsilon_{0})}}{{e^{{N\left({\varepsilon_{0}/kT + 1} \right) - \varepsilon_{0}/kT}} - 1}} \). So, for small N, this solution is far off but for our case of large N, it is a good approximation; in particular, \( \frac{{N(N(\varepsilon_{0} + kT) - \varepsilon_{0})}}{{\,e^{{N\left({\varepsilon_{0}/kT + 1} \right) - \varepsilon_{0}/kT}} - 1}} < < E_{1} \) implies, for \( N\gg 1 \), \( {{N}^{2}}\ll {{E}_{1}}\frac{{{e}^{N\left({{\varepsilon}_{0}}/kT+1 \right)}}-1}{({{\varepsilon}_{0}}+kT)}\approx {{E}_{1}}\frac{{{e}^{N\left({{\varepsilon}_{0}}/kT+1 \right)}}}{({{\varepsilon}_{0}}+kT)} \), or \( N^{2} e^{{- N\left({\varepsilon_{0}/kT + 1} \right)}} \ll \frac{{E_{1}}}{{(\varepsilon_{0} + kT)}} \),

Hence, for macroscopic size N, as before, only when the temperature is extremely large or the total system energy is extremely small is this solution not approximately valid. (Note: for small, i.e., \( \varepsilon_{0} \ll kT \), the inequality becomes: \( N^{2} e^{- N} \ll \frac{{E_{1}}}{kT} \), i.e., the same as before. For \( \varepsilon_{0} > > kT \), we have\( N^{2} e^{{- N\varepsilon_{0}/kT}} < < \frac{{E_{1}}}{{\varepsilon_{0}}} \); the smallest the right hand side can become is one since \( E_{1} \ge \varepsilon_{0} \).)

Again, a key general point is that for a given temperature, there is always a single value of E1 corresponding to it for any fixed N.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rizzi, A. Does the PBR Theorem Rule out a Statistical Understanding of QM?. Found Phys 48, 1770–1793 (2018). https://doi.org/10.1007/s10701-018-0225-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10701-018-0225-5

Keywords

Navigation