1 Introduction

Among the most famous thought experiments are the experiments on the consistency check of Heisenberg’s uncertainty principle (HUP) proposed in the Einstein-Bohr debate. It is the aim of the present note to reformulate Bohr’s response to Einstein’s photon box thought experiment at the Solvay conference 1930 [1] and to discuss other possible responses and related proposals. The focus is on the physical arguments and not on the historical context. The theoretical framework of the Einstein-Bohr-debate is not quantum mechanics but classical statistical physics augmented by the postulate that only statistical ensembles are admitted that satisfy the HUP. We will refer to this modified classical theory as “semi-classical” without using this term in a technical sense of “first order in \(\hbar\)” or the like. The Einstein-Bohr debate (at least its may be un-historical reconstruction considered here) is about the consistency of such a theory: Is it possible to construct a measurement scheme within this theory that disproves the HUP? Einstein’s photon box thought experiment is hence viewed as a proposal to prepare an ensemble of photons such that the spread of the emission times δt and the spread of the energies δE can both be made arbitrarily small. Bohr’s response will be reconstructed as a detailed analysis of the proposed preparation and measurement scheme showing that, on the contrary, the HUP of the form \(\delta t \delta E\gtrsim h\) is nevertheless satisfied. Typically, the analysis need not be too precise; it would suffice to obtain order of magnitude estimates corresponding to the vagueness of the statement \(\delta t \delta E\gtrsim h\), which should be read as “the product of δt and δE is bounded from below by a non-zero quantity of order of magnitude h”.

While Bohr’s general line of defense, including appeal to gravitational time dilation, is well known, it is difficult (for me) to follow the details of the arguments usually put forward in the literature, and therefore a careful reconstruction seems to be appropriate. This also has a pedagogical aspect: I think the treatment of Einstein’s thought experiments lends itself very well to lectures because various elementary physical arguments are interwoven; from there, a consistent presentation might be useful.

The paper is organized as follows. In Section 2 we present the general setting of the photon box experiment and consider particular versions which we find easier to analyze. After the emission of the photon the box will oscillate about its new rest position with the period T. The second weighing can be replaced by a measurement of the amplitude of the oscillation, see Section 3, i. e., by a position measurement either after the time T/4, see Section 3.1, or after the time T = T/2, see subsection 3.2. In both cases, the energy-time HUP can be derived from the fact that transformation between coordinate time and proper time varies according to uncertainty of the exact motion of the box. It has been noted in the literature that Bohr could have refuted Einstein’s photon box objection without recourse to general relativity [3,4,5]. We treat this issue in Section 4, where we discuss the uncertainty relation devised by P. Busch that holds for energy uncertainty and the opening time of the shutter and a similar estimate based on semiclassical quantum theory. However, the shutter is not decisive; another third version of Einstein’s photon box without shutter can also be shown to satisfy the HUP without the need to employ general relativity, see Section 5. But interestingly, one has to argue with gravitational redshift to explain why the photon increases the weight of the box, see Section 6. We conclude with a Summary in Section 7.

2 Generalities

As mentioned in the Introduction the conceptual framework of the Einstein-Bohr discussion of the counterexamples is a modified classical statistical mechanics, where the modification consists in the fact that only statistical ensembles are admitted which fulfill the HUP. (The treatment of photons is a different topic, but can be left aside here since we will only consider the photon box with the exception of Section 6.) This can be understood as a kind of compromise between the two adversaries: If Bohr insisted on using full quantum mechanics for the discussion, Einstein could complain of a petitio principii, since his counterexamples, after all, question the very consistency of quantum mechanics. Conversely, modified statistical mechanics as a limiting case of quantum mechanics should be acceptable to Bohr, even more so if it is already sufficient to invalidate the counterexamples. Statistical mechanics without HUP, on the other hand, would be unacceptable to Bohr, and also uninteresting for the discussion. The interpretation of the uncertainty thus becomes ambiguous, depending on whether one understands it purely classically or quantum mechanically, but this is not a shortcoming, but a feature of the discussion.

Einstein’s thought experiment consists, in short, in the emission of a photon from a photon box at an arbitrarily sharply defined time and a determination of its energy by a weighing of the box before and after the emission, invoking the famous E = mc2. Bohr’s reply rests on an analysis of the weighing process. The box is suspended from a spring in a gravitational field and its weight is determined by reading the vertical position of the box on a pointer, see Fig. 1. As has been noted several times in the literature [3, 7], Bohr’s original arguments are difficult to follow, but see [8] for a counter-position. By analogy with a balance the weighing is performed by attaching smaller and smaller weighing pieces until the position of the pointer on the box returns to the original position, except for a residual uncertainty δz. It is not entirely clear whether one waits for complete damping of the box’ oscillations, which may involve further energy dissipation that would be difficult to be determined. Moreover, the Bohr weighing process seems to amount to very many position measurements, classically harmless, but quantum mechanically questionable, because additional perturbations of the photon box system would have to be taken into account. Finally, the decisive argument is as follows: If the final measurement phase takes time \(\widetilde {T}\), then the clock in the photon box accumulates a time uncertainty \(\delta \widetilde {T}\) due to gravitational time dilation and position uncertainty δz in the gravitational field. But why does \(\delta \widetilde {T}\) coincide with the uncertainty δt of the emission timing?

Fig. 1
figure 1

A schematic drawing of the photon box, from [2]

To cope with these problems, we will modify the weighing procedure by neglecting damping and calculating the mass loss m from the amplitude of the oscillation of the box after the emission of the photon. Thus we measure the position of the box pointer just before the time τ = 0 of emission and then at a fixed time τ = T/4 (or T/2) after emission. (The time values denoted by τ always refer to the clock installed in the box and T is the period of the oscillation of the box). From the difference of the two position measurements we determine the amplitude of the oscillation of the box and from this the mass loss m corresponding to the photon’s energy E = mc2. At the time immediately after the emission the system is described by a probability distribution in the phase space which satisfies the HUP. After that a classical time evolution with a harmonic oscillator Hamiltonian is assumed, where also the classical uncertainties evolve in such a way that according to Liouville’s theorem the HUP holds unchanged. The final uncertainties δE and δt are interpreted as statistical fluctuations (standard deviations) in an ensemble realized by a large number of repetitions of the single experiment. (Admittedly, this is not the only possible interpretation of the HUP, see the Summary.) Thus, the Einstein photon box experiment (in the present modified version) is understood as a preparation procedure for an ensemble of photons in which both time measurements and energy measurements can be made (but not simultaneously in a single experiment) with the results δE and δt for the statistical fluctuations. We do not have to care about the corresponding quantum mechanical observables (generalized observables in the case of time measurements) since we only do (semi-)classical calculations.

We now consider N measurements of the arrival times of the photons on a screen or detector in the single experiments as the readings t(1),t(2),…,t(N) of a stationary “standard clock”. It is clear that these arrival times alone are not sufficient to determine δt. They have to be referred to “time zero points” for each single experiment. Let \(t_{0}^{(1)}, t_{0}^{(2)},\ldots ,t_{0}^{(N)}\) be these time zero points (again defined as readings of a standard clock), then the statistics of arrival times are calculated from the differences \(t^{(i)} -t_{0}^{(i)}, i=1, ...,N\). For example, the zero points could be defined by signals sent to the standard clock from the apparatus in each single experiment. For the analogous results of energy measurements E(i) the same applies in principle; however, for theoretical reasons one can assume that the results do not depend on time and the specification of \(t_{0}^{(i)}\) will be superfluous. Of course, we cannot accurately calculate the standard deviations δE and δt based on the previous data. However, we can give estimates determined by the uncertainty of the measurements at the photon box. For example, in the semi-classical domain it is very plausible that statistical fluctuations δm from the results of the two weighings (multiplied by the factor c2) is smaller than the standard deviation δE of the photon energy measurement.

The analogous analysis must be performed for the estimation of the statistical fluctuation δt of the arrival times of the photons from the single experiments. In the single experiment, the photon is emitted at time τ = 0. The exact time is uncertain due to the finite opening time of the shutter. However, it is asumed in the following that this opening time can be chosen arbitrarily small (but see the discussion in Section 4). As noted above, it is also necessary to define a time zero for the single experiment and to send it to the standard clock. For this, the emitted photon itself cannot be used. To send a second photon at time τ = 0 would be possible in principle, but would unduly complicate the analysis. We decide to define the time zero point by the very signal that is send from the box to an external experimenter in order to trigger the second position measurement at time τ = T/4 (or T/2). The back reaction of this signal (emission of a second photon) on the box is negligible with respect to the second position measurement, because of the arbitrarily small time difference between these two events.

Thus the problem narrows down to the question why the time difference between τ = 0 and τ = T/4 (or T/2) can vary statistically in such a way that this variation does not become arbitrarily small without influencing other quantities. According to Bohr, the reason is the difference between the readings of the stationary standard clock and the readings of the box clock due to gravitational time dilation.

3 Two versions of Einstein’s box

As indicated in the preceding Section the weighing process will be performed as follows: The box with rest mass M + m, of which a section is shown in Figure 1 is suspended in a spring-balance with spring constant k and is furnished with a pointer to read its position on a scale fixed to the balance support. The z-axis, which represents the vertical scale, is chosen to point upwards and its zero point corresponds to the rest length of the spring, so that at the beginning the position of the pointer z0 < 0 satisfies an equilibrium condition of the simple form

$$- k z_{0}= (M+m) g ,$$
(1)

where g is the gravitational acceleration, assumed to be approximately constant. After the emission of the photon with energy E into the direction chosen as the x-axis the rest mass of the box will be reduced by the amount

$$m = \frac{E}{c^{2}} .$$
(2)

This follows from special relativity by expanding the Minkowski length of the energy-momentum 4-vector

$$\left( \begin{array}{c} M c^{2}\\ 0\\ 0\\ 0 \end{array} \right)+ \left( \begin{array}{c} E\\ E\\ 0\\ 0 \end{array} \right)$$
(3)

of the box after the emission into a Taylor series and taking the lowest linear order w. r. t. \(\frac {m}{M}\). Consequently, we will assume

$$m\ll M$$
(4)

throughout this paper. Although it is unrealistic to assume that the photon will be emitted exactly into a horizontal direction, the consideration of a vertical recoil of the box would only give rise to a further uncertainty proportional to \(\frac {m}{M}\) and can be neglected.

The reduced mass M of the box after the emission of the photon corresponds to a new equilibrium position z1 < 0 of the pointer defined by

$$- k z_{1}= M g ,$$
(5)

such that by (1) we have

$$k (z_{1}-z_{0})= m g .$$
(6)

The box will perform vertical damped harmonic oscillations about the new equilibrium position z1 until it reaches this position after long times. A measurement of z1 after long times would then indirectly give the value of m and hence of E. However, it would be necessary to consider the dissipated energy and its mass equivalent to correct the asymptotic equilibrium position z1.

Here we will follow a different strategy based on the observation that the amplitude of the (undamped) oscillation is z1z0, the starting position being z0, and hence the position z1 will be reached after a quarter period \(\frac {T}{4}\) of the oscillation. Alternatively, the amplitude can be calculated from the position z2 = 2z1z0 of the pointer after a half period \(\frac {T}{2}\). We may use the description of the oscillation according to classical mechanics and thus set

$$T=\frac{2 \pi}{\omega},\quad \text{where } \omega =\sqrt{\frac{k}{M}} .$$
(7)

In this way we will realize the second weighing of the box by a position measurement of the pointer at the time \(\frac {T}{4}\) (first version) or \(\frac {T}{2}\) (second version) after the emission of the photon.

3.1 First version of Einstein’s box

Now we consider the reconstruction of Bohr’s answer corresponding to the first version. He may use the condition that all quantities used do not have sharp values but vary according to the HUP, of course except the quantities t and E under discussion. Usually it will not be necessary to consider all possible uncertainties but to concentrate on the relevant ones that are required to obtain \(\delta t \delta E\gtrsim h\). The other quantities can be treated as if they would have a sharp value. For example, the first weighing of the box can be performed by means of a position measurement leading to z0 with a corresponding uncertainty for M but this will be irrelevant for what follows (but see the Sections 4 and 5).

Obvious candidates for relevant uncertainties are the vertical coordinates zi and the corresponding vertical momenta pi of the box, where the index i = 0,1 refers to the time of emission and of the second position measurement, resp. . These uncertainties can be legitimately assumed to satisfy the HUP:

$$\begin{array}{@{}rcl@{}} \delta z_{0} \delta p_{0}&\gtrsim& h , \end{array}$$
(8)
$$\begin{array}{@{}rcl@{}} \delta z_{1} \delta p_{1}&\gtrsim& h . \end{array}$$
(9)
Fig. 2
figure 2

The oscillation of the photon box can be viewed as a rigid rotation in the phase space with scaled coordinates ωz and \(\frac {p}{M}\). Three events corresponding to the emission of the photon at τ = 0,z = z0 and the second position measurement at τ = T/4,z = z1 or τ = T/2,z = z2 are represented as blue dots on the circle with center (ωz = ωz1,p/M = 0) and radius r = ω(z1z0). The uncertainties of these phase space points are indicated by small red ellipses which also rigidly rotate in phase space. The first ellipse on the far left corresponds to the uncertainty δϕ of the phase angle ϕ such that \({\sin \limits } \delta \phi \approx \frac {\delta p_{0}/M}{\omega (z_{1}-z_{0})}\). The figure corresponds to the first and the second version of Einstein’s box considered in this paper

Since the oscillation of the photon box can be viewed as a rigid rotation in the phase space with scaled coordinates ωz and \(\frac {p}{M}\), see Figure 2, the role of the uncertainties is exchanged after a 90 rotation corresponding to the time evolution between the two mentioned instants of time. More precisely, we conclude without further calculation

$$\begin{array}{@{}rcl@{}} \omega \delta z_{1} &=& \frac{1}{M}\delta p_{0} , \end{array}$$
(10)
$$\begin{array}{@{}rcl@{}} \frac{1}{M}\delta p_{1} &=& \omega \delta z_{0} , \end{array}$$
(11)

in accordance with (8) and (9). This yields a first estimate of the uncertainty of m:

$$\begin{array}{@{}rcl@{}} \delta m &\overset{(6)}{\approx}& \frac{k}{g} \delta z_{1}\overset{(7)}{=}\frac{M\omega^{2}}{g} \delta z_{1} \end{array}$$
(12)
$$\begin{array}{@{}rcl@{}} &\overset{(10)}{=}&\frac{M\omega^{2}}{M\omega g} \delta p_{0}= \frac{\omega}{g} \delta p_{0} . \end{array}$$
(13)

Next we come to the uncertainty δt of the quantity t1. The latter means the time of the emission of the photon as it is measured by a stationary clock. This may be different to the time measured by the moving clock inside the box due to gravitational and special-relativistic time dilatation effects. To choose a particular physical model we consider the Schwarzschild metric, see, e. g., [14], and its linear approximation in the vicinity of the photon box. Then the stationary clock mentioned above can be chosen as a clock at a large distance from the center and hence measuring the Schwarzschild coordinate t to an arbitrary precision. This clock should be used to measure the two times of emission of the photon at τ = 0 and of the second position measurement at τ = T/4 and its difference t1. This has to be distinguished from the proper time difference τ1 = T/4 measured by the moving clock installed in the box. One can, of course, calculate the transformation between τ1 and t1 if the exact motion of the box is known. And here the uncertainty δz0 comes into play. Different motions of the clock with an initial position z0 varying with a latitude of δz0 lead to different values of the coordinate time difference t1 between emission and second position measurement, although the proper time between the two events measured by the moving clock has always the same value \(\tau _{1}=\frac {T}{4}\). This yields the desired value of δt that, hopefully, would satisfy the HUP.

Anticipating the result of the following calculation to be

$$\delta t \approx \frac{g}{c^{2} \omega} \delta z_{0} ,$$
(14)

we conclude the desired result

$$\delta t \delta E \overset{(13,14,2)} {\approx} \left( \frac{g}{c^{2} \omega}\delta z_{0}\right) \left( \frac{c^{2}\omega}{g}\delta p_{0} \right)\overset {(8)}{\gtrsim} h .$$
(15)

In order to derive (14) we consider the Schwarzschild metric [14]

$$d\tau^{2}= \varphi(r) dt^{2}- \frac{1}{c^{2} \varphi(r)} dr^{2} ,$$
(16)

where

$$\varphi(r) =1-\frac{2 G M_{g}}{c^{2} r} ,$$
(17)

G denoting the gravitational constant and Mg, e. g., the earth’s mass if the thought experiment is performed in a terrestrial laboratory

Let r0 be the Schwarzschild radial coordinate of the second equilibrium position corresponding to z1 such that

$$g=\frac{M_{g} G}{{r_{0}^{2}}} .$$
(18)

r0 is approximately the earth’s radius. Solving (16) for \(\frac {dt}{d\tau }\) yields

$$\begin{array}{@{}rcl@{}} \frac{dt}{d\tau}&=&\sqrt{\frac{1}{\varphi(\tau)}+ \frac{1}{\varphi(\tau)^{2}}\frac{v^{2}}{c^{2}}}=\varphi(\tau)^{-1/2}+\varphi(\tau)^{-3/2}\frac{v^{2}}{2 c^{2}}\\ && +O\left( \frac{v^{4}}{c^{4}}\right) . \end{array}$$
(19)

Here we have set \(v=\frac {dr(\tau )}{d\tau }\), assuming vc, for the vertical velocity of the box and abbreviated φ(r(τ)) by φ(τ). The series expansion (19) clearly shows that the time dilatation has two components: The pure gravitational part φ(τ)− 1/2 and the special-relativistic part \(\varphi (\tau )^{-3/2}\frac {v^{2}}{2 c^{2}}\). For the calculation of δt the latter leads to a similar result as the former one but with a factor \(\frac {\pi m}{M}\) smaller and hence can be neglected.

Next we will write φ(τ) = φ(r0 + z(τ)) where z(τ) represents the oscillation of the box after the emission of the photon. A convenient approximation to the correct solution of the equation of motion that would result in general relativity is an oscillation proportional to \({\cos \limits } (\omega \tau )\), as follows from classical mechanics, except for the substitution of the time coordinate for the proper time τ. The amplitude of the oscillation will be written as z1z0 + δz0 corresponding to an initial position of the box at z = z0δz0. This leads to

$$r(\tau)= r_{0}+z(\tau)= r_{0}+\left( z_{0}-z_{1}-\delta z_{0} \right) \cos (\omega \tau ) ,$$
(20)

taking into account the conditions r(T/4) = r0 and \(r(0)= r_{0}+\left (z_{0}-z_{1}-\delta z_{0} \right )\). Then the gravitational time dilation factor φ(τ)− 1/2 can be further approximated by

$$\begin{array}{@{}rcl@{}} \varphi(\tau)^{-1/2}&=&\frac{1}{\sqrt{1-\frac{2 g {r_{0}^{2}}}{c^{2} \left( r_{0}+z(\tau)\right)}}} \end{array}$$
(21)
$$\begin{array}{@{}rcl@{}} &\approx& 1+\frac{g r_{0}}{c^{2} \left( 1+z(\tau)/r_{0}\right)} \end{array}$$
(22)
$$\begin{array}{@{}rcl@{}} &\approx& 1+\frac{g}{c^{2}}(r_{0}-z(\tau)) , \end{array}$$
(23)

using that z(τ) ≪ r0 in (23) and \(\frac {2 g r_{0}}{c^{2}}=\frac {2 G M_{g}}{c^{2} r_{0}}\approx 1.4\times 10^{-9}\ll 1\) in (22). The coordinate time difference t1 is obtained by the integration

$$\begin{array}{@{}rcl@{}} t_{1}&=& {\int}_{0}^{\frac{T}{4}}\frac{dt}{d\tau} d\tau\overset{(7,19)}{\approx} {\int}_{0}^{\frac{\pi}{2\omega}}\varphi(\tau)^{-1/2} d\tau \end{array}$$
(24)
$$\begin{array}{@{}rcl@{}} &\overset{(20,23)}{\approx} & {\int}_{0}^{\frac{\pi}{2\omega}} \left( 1+ \frac{g}{c^{2}}\left( r_{0}+(z_{1}-z_{0}+\delta z_{0})\cos \omega \tau \right) \right) d\tau . \end{array}$$
(25)

δt is the part of (25) that is proportional to δz0 and hence

$$\delta t \approx {\int}_{0}^{\frac{\pi}{2\omega}}\delta z_{0} \frac{g}{c^{2}}\cos \omega\tau d\tau=\frac{g}{c^{2} \omega} \delta z_{0} ,$$
(26)

which proves (14).

3.2 Second version of Einstein’s box

It will be instructive to discuss further possible objections to the above reconstruction of Bohr’s response to Einsteins’s box example. The choice of τ1 = T/4 for the proper time difference between the opening of the shutter and the (second) position measurement of the pointer attached to the box is not the only possibility. Another choice would be τ2 = T/2 corresponding to the proper time between two extremal values of z(τ), the minimal value z(0) = z0 and the maximal one z(τ2) = z2 such that z2z0 = 2(z1z0) (ignoring for the moment the uncertainty δz0). This alternative choice has two consequences:

  1. 1.

    After a rotation of the harmonic oscillator’s phase space with the angle 180 the uncertainties δz and δp do not swap but assume their original values, see Fig. 2, i. e., (10-11) has to be replaced by

    $$\begin{array}{@{}rcl@{}} \delta z_{2} &=& \delta z_{0} , \end{array}$$
    (27)
    $$\begin{array}{@{}rcl@{}} \delta p_{2} &=& \delta p_{0} . \end{array}$$
    (28)

    This blocks the arguments leading to (13) and (15).

  2. 2.

    The integral analogous to (26) would vanish if extended over the interval \([0, \frac {\pi }{\omega }]\) and hence only the special-relativistic time dilation remains that is by a factor m/M too small. Thus the analogous derivation of (14) would completely break down.

Fortunately, both problems can be solved at the same time, by “killing two birds with one stone”: The uncertainty of the initial position of the box system in phase space need not be of “horizontal type” only, leading to an uncertainty δz0, but could also be of “vertical type” to be described by an uncertainty δp0 of the initial momentum. This corresponds to an uncertainty δϕ of the initial angle variable ϕ in phase space given by

$$\delta \phi \approx \arcsin \frac{\delta p_{0}}{M \omega (z_{1}-z_{0})} ,$$
(29)

see Fig. 2. Hence (18) has to be replaced by

$$r(\tau)=r_{0}+ z(\tau) \equiv r_{0}+ (z_{0}-z_{1}) \cos\left( \omega \tau - \delta \phi\right) ,$$
(30)

and the integral (25) by

$$\begin{array}{@{}rcl@{}} t_{2}&=&{\int}_{0}^{\frac{\pi}{\omega}}\varphi(\tau)^{-1/2} d\tau \end{array}$$
(31)
$$\begin{array}{@{}rcl@{}} &\approx& {\int}_{0}^{\frac{\pi}{\omega}}\left( 1+\frac{g}{c^{2}}(z_{0}-z_{1})\cos\left( \omega \tau - \delta \phi\right) \right) d\tau \end{array}$$
(32)
$$\begin{array}{@{}rcl@{}} &=&\frac{\pi}{\omega}-\frac{2 g (z_{0}-z_{1})}{\omega c^{2}} \sin\delta \phi \end{array}$$
(33)
$$\begin{array}{@{}rcl@{}} &\overset{(30)}{\approx}& \frac{\pi}{\omega}+\frac{2 g \delta p_{0}}{M \omega^{2} c^{2}} . \end{array}$$
(34)

The latter yields

$$\delta t \approx \frac{2 g}{M \omega^{2} c^{2}}\delta p_{0} .$$
(35)

Next note that instead of (6) we now have

$$k(z_{2}-z_{0})=2 g m ,$$
(36)

and hence

$$\delta E = c^{2} \delta m \approx c^{2} \frac{k}{2 g}\delta z_{2} \overset{(28)}{=} \frac{c^{2} k}{2 g}\delta z_{0} .$$
(37)

Finally,

$$\begin{array}{@{}rcl@{}} \delta E \delta t & \overset{(38,36)}{\approx}& \frac{ c^{2} k}{2 g}\delta z_{0} \frac{2 g}{M \omega^{2} c^{2}}\delta p_{0} \end{array}$$
(38)
$$\begin{array}{@{}rcl@{}} &\overset{(7)}{=}& \delta z_{0} \delta p_{0} \overset{(8)}{\gtrsim} h , \end{array}$$
(39)

which confirms the HUP also for the second version of Einstein’s box.

4 Energy - shutter opening time uncertainty

For the question of whether it is really necessary to invoke general relativity to defend the HUP, it is appropriate to examine other solutions to the problem posed by Einstein’s thought experiment. Treder’s derivation [3, 4] of an energy-time uncertainty based on an alleged analogy between the photon box performing damped oscillations and a metastable quantum state does not appear convincing. Most attractive seems to me the very general proposal of P. Busch [5] to use the Mandelstam-Tamm energy-time uncertainty relation to derive an inequality of the following form

$${\Delta} E T_{0} {\gtrsim} h ,$$
(40)

where T0 is the opening time of the shutter in the Einstein photon box experiment and ΔE is the standard deviation of the box Hamiltonian H w. r. t. the initial (pure) state ϕ, i. e.,

$${\Delta} E := \sigma(H)_{\phi} := \sqrt{\left\langle \phi \left| H^{2} \right| \phi\right\rangle -\left\langle \phi \left| H \right| \phi\right\rangle^{2} }$$
(41)

For the convenience of the reader, we will briefly recapitulate the derivation of (40) for the problem at hand in simplified form. Let T0 be the time required to transform the initial state ϕ to another state \(U_{T_{0}}\phi\) orthogonal to ϕ, i. e.,

$$0=\left\langle \phi |U_{T_{0}}|\phi\right\rangle:=\left\langle \phi | \exp\left( -\mathsf{i} H T_{0}/\hbar \right)|\phi\right\rangle ,$$
(42)

where H is the Hamiltonian of the total quantum system representing the photon box. Let \(\langle A \rangle :=\left \langle \phi |A|\phi \right \rangle\) for each linear operator A and replace H by H −〈H〉 in (42) which yields

$$\begin{array}{@{}rcl@{}} &&\left\langle \exp\left( -\mathsf{i} (H-\langle H \rangle) T_{0}/\hbar \right)\right\rangle\\ &=& \left\langle \exp\left( -\mathsf{i} H T_{0}/\hbar \right)\right\rangle \exp\left( \mathsf{i} \langle H \rangle T_{0}/\hbar \right) \overset{(43)}{=}0 . \end{array}$$
(43)

The real part of (43) reads

$$0=\left\langle \cos\left( (H-\langle H \rangle) T_{0}/\hbar \right)\right\rangle ,$$
(44)

which, by means of \({\cos \limits } x\ge 1-x^{2}/2\) and functional calculus, gives rise to the inequality

$$\begin{array}{@{}rcl@{}} &0&\ge \left\langle\mathbbm{1}-\frac{1}{2}\left( (H-\langle H \rangle) T_{0}/\hbar \right)^{2} \right\rangle \end{array}$$
(45)
$$\begin{array}{@{}rcl@{}} &\Leftrightarrow & \left\langle \left( H-\langle H\rangle \right)^{2}\right\rangle {T_{0}^{2}} \ge 2 \hbar^{2} \end{array}$$
(46)
$$\begin{array}{@{}rcl@{}} &\overset{(42)}{\Leftrightarrow} & {\Delta} E T_{0} \ge \sqrt{2} \hbar=\frac{h}{\sqrt{2} \pi} . \end{array}$$
(47)

It can be argued that if the initial state ϕ represents the box with a closed shutter then the state \(U_{T_{0}} \phi\) representing the box with open shutter should be approximately orthogonal to ϕ. (Hence the opening time would rather be 2T0 but the factor 2 is irrelevant in (40).) Weakening (42) to

$$0\approx \left\langle \phi \left| U_{T_{0}}\right|\phi\right\rangle ,$$
(48)

then yields (40) as a corresponding weak form of (47).

Fig. 3
figure 3

Sketch of a physical realization of a shutter mechanism by a vibrating slab (yellow region) with a circular hole of diameter L that overlaps with the hole in the wall of the box (blue region) for a short moment of duration T0

According to what has been said in Section 2 the implicit rules of the Einstein-Bohr debate require that arguments using pure quantum theory have to be reformulated in the semi-classical theory considered above. This would also hold for the energy - shutter opening time uncertainty relation derived above. Instead of the rather general form of (40), we will use a concrete physical implementation of the shutter mechanism by a small hole of diameter L in the box wall and a vibrating slab with a similar hole, see Figure 3. The vibrating slab will be modelled as a harmonic oscillator with mass μ and total energy ε and position-momentum variables (q,p) subject to the HUP

$$\delta p \delta q {\gtrsim} h .$$
(49)

If the amplitude of the oscillation of the slab is large compared with L the two holes in the wall and in the slab will overlap for approximately the time \(T_{0}\simeq L/v\) allowing the passage of a photon, where v is the maximal velocity of the oscillating slab given by

$$\frac{\mu}{2} v^{2} = \varepsilon ,$$
(50)

and hence satisfying

$$\delta \varepsilon\simeq \mu v \delta v = v \delta p .$$
(51)

Due to the statistical fluctuations of the harmonic oscillator distribution the temporal position of the opening time window will be uncertain by an amount of

$$\delta t \simeq \frac{\delta q}{v}\overset{(50)}{\gtrsim} \frac{h}{v \delta p} \overset{(52)}{\simeq}\frac{h}{\delta \varepsilon} ,$$
(52)

which yields the uncertainty relation

$$\delta \varepsilon \delta t {\gtrsim} h$$
(53)

analogous to (40). There are, however, some subtle differences: Instead of the opening time T0 itself the quantity δt occurs in (53) which represents the fluctuation of the position of the time window, not its size. But for estimating the fluctuation of the arrival times of photons, it is just as good. The second difference from (40) is that δε is the standard deviation not of the total energy of the box, but only of a part, namely the shutter mechanism. This is also sufficient for the present purpose since one can with good reason exclude that the variations of ε are compensated by other parts of the box if the initial distribution is chosen in product form w. r. t. variables of the shutter mechanism and the remaining part of the box.

Concerning the box experiment as modeled in Section 3.1 one could say that validity of the energy-time HUP (53) already limits the uncertainty of the first weighing before τ = 0. The statistical fluctuation of the total energy before τ = 0, or, equivalently of the total mass of the box, means that we cannot longer assume that this first weighing can be performed with arbitrary small uncertainty independent of the uncertainty δt of the emission time of the photon. On the other hand, the relativistic calculations in the Sections 3.1 and 3.2, which aim to reconstruct Bohr’s response, refer to the second weighing process at time τ = T/4 (or T/2).

In summary, we have translated the Busch proposal [5] into the semi-classical language of the Einstein-Bohr debate and confirmed that Bohr could have formulated a refutation of Einstein’s objections without recourse to general relativity.

5 Third version of Einstein’s box

It will be instructive to continue the fictitious discussion between Einstein and Bohr for one more round. An obvious response to the objection described in the last Section would be to dispense with a shutter mechanism altogether. Instead, a hole in the wall of the box could be kept open all the time, through which the photon would eventually escape and be traced in a detector. In this detector, an energy or time-of-arrival measurement could still be taken, as desired.

Fig. 4
figure 4

A schematic drawing of the third version of the photon box, drawn by deleting items from [2]. The box contains only one photon which eventually escapes through the hole in the wall. At the beginning the total rest mass M + m is measured by weighing; after escape of the photon the time t1 is measured at which the oscillating box passes the new equilibrium position

Of course, the question then arises how to determine the time of emission of the photon without controlling a shutter. In terms of classical mechanics, one can argue that this is also the time at which the box begins to oscillate around its new rest position at z = z1. Thus, if we measure the time t1 at which the pointer of the box passes the plane z = z1 and subtract from it the quarter period T/4, we obtain the time t0 = t1T/4 of the emission of the photon. Thus, compared to the version in Section 3.1, we have replaced a position measurement at time t = t1 with an arrival time measurement at the position z = z1. Apparently, the problem then arises that the calculation of both z1 and T requires knowledge of the rest mass M of the box without the emitted photon. We solve this problem by assuming that the box contains only one photon at the beginning, see Fig. 4, and that the rest mass M of the empty box can either be calculated or measured independently of the actual experiment with arbitrary precision. After all, the rest mass of an elementary particle is not an observable in the usual sense, but a physical constant. M could therefore be calculated as the sum of the masses of the elementary particles that make up the box minus the binding energy divided by c2.

The latter consideration is of course not valid for the rest mass M + m of the system box plus photon, which therefore has to be measured at the beginning of the experiment by a weighing (following Einstein) by means of a position measurement of z0 (following Bohr). We then encounter the problem of describing the “state after the first measurement” in semiclassical theory. In full quantum mechanics there is an elaborated theory for this, see, e. g., [15], but it need not be adopted because of the special rules of the Einstein-Bohr debate. Nevertheless, it seems reasonable to conceptualize the first position measurement as a phase-space observable measuring jointly position z0 and momentum p0 with respective uncertainties δz0 and δp0 satisfying \(\delta z_{0} \delta p_{0}{\gtrsim } h\), see [16]. The state after this measurement could then be taken as the ground state of a harmonic oscillator as far as the degree of freedom of the center-of-mass coordinates of the loaded box is concerned. However, these considerations serve only to give the reader a sense of the physical situation; for the Einstein-Bohr debate, it is merely assumed that the state after the first measurement satisfies the HUP.

Thus we can describe the third version of the Einstein box in two different ways: either as a preparation of an ensemble of correlated systems “box plus photon” (3.1) or as a preparation of an ensemble of photons (3.2). In both readings, the composite system “box plus photon” is brought into a particular state at the beginning by an initial position measurement. After the emission of the photon, a time measurement is made at the box, which can be used to define the time zero point for possible arrival time measurements of the photon. The said correlation refers to the energy, or rest mass, of the two subsystems in the sense that the initial rest mass M + m equals the sum of the final values of the rest mass M of the empty box and the energy-mass equivalent m of the photon. It is also assumed that the time t0 of emission of the photon coincides with its arrival time at the detector minus the time required for the path from the box to the detector. In this sense, the situation is somewhat similar to the production of a correlated EPR pair, which has already been noted and discussed by [12] and [13].

We focus on the second way (3.2) and interpret the uncertainties δE and δt as standard deviations of the energy and arrival time measurements, resp. , in the ensemble of photons. These standard deviations are estimated using semi-classical calculations and the correlations mentioned above. For the first energy measurement we have the balance condition (1) and hence

$$\begin{array}{@{}rcl@{}} \delta m &\approx&\frac{k}{g}\delta z_{0} , \end{array}$$
(54)
$$\begin{array}{@{}rcl@{}} \delta E &\approx& \frac{c^{2} k}{g}\delta z_{0}\overset{(7)}{=}\frac{c^{2} M \omega^{2}}{g}\delta z_{0} , \end{array}$$
(55)

using δM ≈ 0, as explained above.

The second measurement of the time t1 at which the pointer of the box passes the plane z = z1 gives the exact emission time t0 = t1T/4 only if the box is at rest at time t = t0. If, however, it has a certain momentum δp0 at time t = t0, it will reach the plane z = z1 slightly earlier (for δp0 > 0) which leads to time uncertainty

$$\delta t \approx \frac{k}{M m g \omega^{2}}\delta p_{0}\overset{(7)}{=}\frac{1}{m g}\delta p_{0} .$$
(56)

This follows from

$$\delta t =\frac{1}{\omega}\delta \varphi ,$$
(57)

and

$$\delta \varphi \approx\frac{\delta p_{0}}{M (z_{1}-z_{0}) \omega}\overset{(6)}{=}\frac{k \delta p_{0}}{M m g \omega} ,$$
(58)

see Fig. 2. Hence we obtain

$$\begin{array}{@{}rcl@{}} \delta E \delta t &\overset{(56,57)}{\approx}& \frac{c^{2} M \omega^{2}}{g}\frac{1}{m g}\delta p_{0} \delta z_{0} \end{array}$$
(59)
$$\begin{array}{@{}rcl@{}} &\overset{(8)}{\gtrsim}& \frac{c^{2} M \omega^{2}}{m g^{2}} h=\left( \frac{c}{v_{1}}\right)^{2}\frac{M}{m} h , \end{array}$$
(60)

where

$$v_{1}:=\frac{g}{\omega}\overset{(7)}{=}\frac{g T}{2\pi}$$
(61)

is of the order of magnitude of the velocity reached in free fall during the period of oscillation T of the box. Thus the HUP for energy and time is not only fulfilled, but is above the lower bound of h by the large factor (c/v1)2(M/m), see (60). Note that the difference between proper time of the moving box and the coordinate time that was crucial in the first two versions of Einstein’s box in the Sections 3.1 and 3.2 has been neglected here. Probably the gravitational time dilation would give a further contribution to δt, which alone would be sufficient to derive the HUP \(\delta E \delta t\gtrsim h\), but is not needed here.

Summarizing, also the third version of Einstein’s photon box does not threaten the consistency of (semi-classical) quantum mechanics and general relativity is not needed to show this.

6 Einstein’s Box Consistency Check

So far, we have always assumed that the debate follows the pattern of Einstein proposing a measurement that appears to violate the HUP and Bohr showing by a more detailed analysis of the measurement that the HUP is nevertheless satisfied. But how about Bohr questioning the consistency of Einstein’s proposal itself? Let’s play this out now for the case that Bohr doubts that one can determine the energy of a single photon by a weighing of the box. He could thus argue that the photon exerts the same radiation pressure on the ceiling and the bottom of the box and therefore does not contribute to the weight of the box. Ironically, in this case Einstein would be forced to use general relativity to show that due to the gravitational red shift, the radiation pressure on the ceiling is less than on the floor and that therefore the photon exerts a downward force upon the box.

We will calculate this force for the simple case of a purely vertical motion of a photon along a null geodesic of the Schwarzschild metric. Alternatively, one could calculate the radiation pressure using general relativistic Maxwell theory with probably the same result.

With respect to an orthonormal frame field in (1 + 1) dimensions the tangent vector of the null geodesic will be written as

$${\mathbf k}= {\Omega \choose c k} ,$$
(62)

such that \(E=\hbar {\Omega }\) will be the energy of the photon and \(p=\hbar k\) its vertical momentum satisfying

$${\Omega} = c k \Leftrightarrow E=c p .$$
(63)

The gravitational red shift assumes the form

$${\Omega}= {\Omega}_{0}\frac{\sqrt{1-\frac{2M_{g} G}{c^{2} r_{0}}}}{\sqrt{1-\frac{2M_{g} G}{c^{2} r}}} ,$$
(64)

compare [14, 6.3.5], where we have identified the coordinate of the vertical direction with the r of the Schwarzschild metric. Let r0 be the coordinate of the center of the box with height B and write r = r0 + ζ. Then, in linear approximation w. r. t. ζ, Ω can be written as

$${\Omega}\simeq\bar{\Omega} -{\Omega}_{0} \sqrt{1-\frac{2M_{g}G}{c^{2} r_{0}}} \frac{g \zeta}{c^{2}} ,$$
(65)

using the gravitational acceleration g according to (18). Analogously,

$$p=\frac{\hbar}{c}{\Omega}\simeq\bar{p} -\hbar {\Omega}_{0} \sqrt{1-\frac{2M_{g}G}{c^{2} r_{0}}} \frac{g \zeta}{c^{3}} .$$
(66)

Assuming perfect reflection, the momentum transfer to the box will be 2p and its difference between ceiling and the bottom (corresponding to ζ = B/2 and ζ = −B/2, resp. ) amounts to

$${\Delta} p \simeq -2 \hbar {\Omega}_{0} \sqrt{1-\frac{2M_{g}G}{c^{2} r_{0}}} \frac{g B}{c^{3}} .$$
(67)

Next we consider the lapse of time between two subsequent reflections of the photon at the ceiling (or the bottom) of the box. It will be sensible to take the proper time w. r. t the center of the box which reads

$${\Delta} \tau = \sqrt{1-\frac{2M_{g}G}{c^{2} r_{0}}} {\Delta} t \simeq \sqrt{1-\frac{2M_{g}G}{c^{2} r_{0}}} \frac{2 B}{c} ,$$
(68)

using that, in lowest order of approximation, B/c is the (coordinate) time the photon need to pass a box of height B. Hence for the mean force \(\bar {F}\) resulting from the difference between the radiation pressure at the ceiling and the bottom of the box we obtain

$$\bar{F}=\frac{\Delta p}{\Delta \tau}\simeq -\frac{2 \hbar {\Omega}_{0} \sqrt{1-\frac{2M_{g}G}{c^{2} r_{0}}} \frac{g B}{c^{3}}} {\sqrt{1-\frac{2M_{g}G}{c^{2} r_{0}}} \frac{2 B}{c}}= -\frac{\hbar {\Omega}_{0}}{c^{2}} g .$$
(69)

This means that the mean force equals the weight of an extra mass of size \(m=\frac {\hbar {\Omega }_{0}}{c^{2}}\) corresponding to the energy \(E_{0}=\hbar {\Omega }_{0}\) of the photon in the center of the box.

We can interpret this result to mean that Einstein’s photon box experiment can be consistently understood only within the framework of general relativity. This is hardly surprising, since the experiment is based on the special-relativistic formula E = mc2 and a weighing in the gravitational field. Thus, when Bohr, for his part, uses general relativity for his answer (which, as we have seen, he was not forced to do), this does not create a paradoxical situation, as has occasionally been speculated [20, pp. 122 - 123]. He does not reveal a mysterious connection between two independent theories, but merely moves within the framework of Einstein’s proposal.

7 Summary

The Einstein-Bohr debate and especially the discussion of the photon box has been evaluated differently over time. Initially Bohr’s account in [1] dominated and promoted the interpretation as a brilliant victory (“Einstein beaten with his own weapons”). In the last decades voices have been raised either questioning Bohr’s reading of the debate or criticizing his argumentation as unclear and/or superfluous, see [3,4,5,6,7,8,9,10,11,12,13]. In this situation, we have tried in the present article, on the one hand, to reconstruct Bohr’s answer in such a way that the original intention is clearly and understandably expressed, and, on the other hand, to discuss the suggestions of a possible answer to Einstein’s objections without recourse to the general theory of relativity. Here the implicit rules of the debate play a certain role, which we understand in such a way that the discussion is conducted in a semiclassical framework which makes a dialogue between originally “incommensurable” positions in the sense of T. S. Kuhn [17] possible at all. In this sense, we could specifically reconstruct Busch’s proposal based on the Mandelstam-Tamm energy-time uncertainty in terms of an analysis of the shutter mechanism. Further, we considered still another version of the Einstein photon box without any shutter which can nevertheless be shown to satisfy the HUP without invoking general relativity. While this is not proof that this will always be the case with similar versions of the photon box, it does support the somewhat sober position that there is no secret connection between quantum theory and general relativity that would be revealed by the Einstein-Bohr debate. Moreover, the discussion in section 6 shows that the photon box, properly understood, is already a general relativistic system.

The meaning of “uncertainty” in the Einstein-Bohr debate allows for a certain ambiguity which, as mentioned in the Introduction, is even constitutive for the debate. The reason is that “uncertainty” interpreted as statistical fluctuation in semi-classical ensembles allows a realistic ignorance interpretation. The values of all observables have sharp values in a single experiment, but one does not know them. This realistic interpretation should be close to Einstein’s position, while Bohr apparently rejected it. Nevertheless, both can usefully discuss whether in sophisticated thought experiments the semi-classical uncertainty can be pushed below the limits given by the HUP. In our reconstruction we have always understood “uncertainty” in the neutral sense of statistical standard deviations w. r. t. an ensemble of photons.

The detailed discussion of the Einstein-Bohr thought experiment sheds particular light on the need for temporal synchronization between preparation and measurement processes, which is usually taken for granted but needs to be examined closely when the exchange of single photons is important. There exist further interpretations of “uncertainty” in the sense of the difference between ideal and actual measurements that have been recently put into a rigorous framework, see [16, 18, 19]. It is likely that these interpretations can also be applied to the Einstein-Bohr debate, but we have refrained from doing so in order not to complicate the discussion.