Abstract
In 2017, Lienert and Tumulka proved Born’s rule on arbitrary Cauchy surfaces in Minkowski space-time assuming Born’s rule and a corresponding collapse rule on horizontal surfaces relative to a fixed Lorentz frame, as well as a given unitary time evolution between any two Cauchy surfaces, satisfying that there is no interaction faster than light and no propagation faster than light. Here, we prove Born’s rule on arbitrary Cauchy surfaces from a different, but equally reasonable, set of assumptions. The conclusion is that if detectors are placed along any Cauchy surface \(\Sigma \), then the observed particle configuration on \(\Sigma \) is a random variable with distribution density \(|\Psi _\Sigma |^2\), suitably understood. The main different assumption is that the Born and collapse rules hold on any spacelike hyperplane, i.e., at any time coordinate in any Lorentz frame. Heuristically, this follows if the dynamics of the detectors is Lorentz invariant.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
In its usual form, Born’s rule asserts that if we measure the positions of all particles of a quantum system at time t, the observed configuration has probability distribution with density \(|\Psi _t|^2\). One would expect that Born’s rule also holds on arbitrary Cauchy surfacesFootnote 1\(\Sigma \) in Minkowski space-time \(\mathbb {M}\) in the following sense: If we place detectors along \(\Sigma \), then the observed particle configuration has probability distribution with density \(|\Psi _\Sigma |^2\), suitably understood. We call the latter statement the curved Born rule; it contains the former statement as a special case in which \(\Sigma \) is a horizontal 3-plane in the chosen Lorentz frame. We prove here the curved Born rule as a theorem; more precisely, we prove that the Born rule holds on arbitrary Cauchy surfaces assuming (i) that the Born rule holds on hyperplanes, i.e., on flat surfaces (flat Born rule), (ii) that the collapse rule holds on hyperplanes (flat collapse rule), (iii) that the unitary time evolution contains no interaction terms between spacelike separated regions (interaction locality), and (iv) that wave functions do not spread faster than light (propagation locality). A similar theorem was proved by Lienert and Tumulka in [24]. As we will discuss in more detail in Sect. 1.2, the central difference is that the detection process was modeled in a different way; our model of the detection process is in a way more natural and leads to a simpler proof of the theorem.
This paper is structured as follows. In the remainder of Sect. 1, we describe our results. In Sect. 2, we provide technical details of the concepts used. In Sect. 3, we derive the Born rule on triangular surfaces. In Sect. 4, we prove our statements about approximating Cauchy surfaces with triangular surfaces. In Sect. 5, we provide the proof of our main theorem.
1.1 Hypersurface Evolution
In order to formulate the curved Born rule, we need to have a mathematical object \(\Psi _\Sigma \) available that represents the quantum state on \(\Sigma \). To this end, we regard as given a hypersurface evolution (precise definition given in Sect. 2 or [24]) that provides a Hilbert space \({\mathscr {H}}_\Sigma \) for every Cauchy surface \(\Sigma \) and a unitary isomorphism \(U_\Sigma ^{\Sigma '}:{\mathscr {H}}_\Sigma \rightarrow {\mathscr {H}}_{\Sigma '}\) representing the evolution between any two Cauchy surfaces, \(\Psi _{\Sigma '}=U_\Sigma ^{\Sigma '} \, \Psi _\Sigma \). The situation is similar in spirit to the Tomonaga–Schwinger approach [33, 34, 36], although Tomonaga and Schwinger used the interaction picture for identifying all \({\mathscr {H}}_\Sigma \) with each other.
We take the detected particle configuration on \(\Sigma \) to be an element of the unordered configuration space of a variable number of particles,
the set of all finite subsets of \(\Sigma \). (If more than one, say \(m\in \mathbb {N}\), species of particles are present, one may either, by straightforward generalization of our results, consider \(\Gamma (\Sigma )^m\) as the configuration space or apply the mapping \(\Gamma (\Sigma )^m \rightarrow \Gamma (\Sigma ): (q_1,\ldots ,q_m) \mapsto q_1 \cup \ldots \cup q_m\) that erases the species labels and still consider probability distributions on \(\Gamma (\Sigma )\), as we will do here.)
It will be convenient to write the \(|\Psi _\Sigma |^2\) distribution (the curved Born distribution) in the form of the measure \(\langle \Psi _\Sigma |P_\Sigma (\cdot )|\Psi _\Sigma \rangle =\Vert P_\Sigma (\cdot )\Psi _\Sigma \Vert ^2\), where \(P_\Sigma \) is the appropriate projection-valued measure (PVM) onFootnote 2\(\Gamma (\Sigma )\) acting on \({\mathscr {H}}_\Sigma \). That is, if \(\Psi _\Sigma \) can be regarded as a function on \(\Gamma (\Sigma )\), then, for any \(S\subseteq \Gamma (\Sigma )\), \(P_\Sigma (S)\) is the multiplication by the characteristic function of S and
with dq the appropriate volume measure on \(\Gamma (\Sigma )\). But we do not have to regard \(\Psi _\Sigma \) as a function, we can treat it abstractly as a vector in the given Hilbert space \({\mathscr {H}}_\Sigma \). The PVM \(P_\Sigma \) is automatically given if the \({\mathscr {H}}_\Sigma \) are Fock spaces or tensor products thereof.
Another way of putting the curved Born rule (although perhaps not fully equivalent with regard to a curved collapse rule, see Remark 3 in Sect. 1.4) is to say that \(P_\Sigma \) is the configuration observable on \(\Sigma \). So, our theorem could be summarized as showing that if \(P_E\) is the configuration observable on every hyperplane E, then \(P_\Sigma \) is the configuration observable on every Cauchy surface \(\Sigma \), provided interaction locality (IL) and propagation locality (PL) hold.
A hypersurface evolution is specified by specifying the \({\mathscr {H}}_\Sigma \)’s, the \(U_\Sigma ^{\Sigma '}\)’s, and the \(P_\Sigma \)’s; we denote it by \(({\mathscr {H}}_\circ ,U_\circ ^\circ ,P_\circ )\) with \(\circ \) a placeholder for Cauchy surfaces. Some examples are described in [24] and in Remark 10 in Sect. 2.2; they arise especially from multi-time wave functions [9, 11, 25, 33]; see [23] for an introduction and overview. While certain ways of implementing an ultraviolet cutoff [7, 26] lead to multi-time wave functions that cannot be evaluated on arbitrary Cauchy surfaces, models without cutoff define a hypersurface evolution, either on the non-rigorous [28, 29] or on the rigorous level [6, 19,20,21,22]. As a consequence, our result proves in particular a Born rule for multi-time wave functions, thereby generalizing a result of Bloch [4] (see also Remark 4 in [24]).
We do not, as one would in quantum electrodynamics or quantum chromodynamics, exclude states of negative energy; it remains for future work to extend our result in this direction.
1.2 Previous Result
A theorem similar to ours has been proved by Lienert and Tumulka [24]; our result differs in what exactly is assumed, and how the detection process is modeled. The fact that the curved Born rule can be obtained through different models of the detection process and from different sets of assumptions suggests that it is a robust consequence of the flat Born rule.
In fact, our result was already conjectured by Lienert and Tumulka, who also suggested the essentials of the model of the detection process we use here, although their theorem concerned a different model. The biggest difference between their theorem and ours is that we assume the Born rule and collapse rule to hold on tilted hyperplanes, whereas Lienert and Tumulka assumed them only on horizontal hyperplanes in a fixed Lorentz frame.
a Our detection process is based on approximating a curved surface \(\Sigma \) by a piecewise flat surface. b The detection process used by Lienert and Tumulka is based on approximating a curved surface \(\Sigma \) by disconnected pieces of horizontal surfaces. We have set the speed of light to \(c=1\). Color online
Our model of the detection process is perhaps more natural than the one at the basis of Lienert and Tumulka’s theorem, as it approximates detectors on tilted surfaces through detectors on tilted hyperplanes, rather than on numerous small pieces of horizontal hyperplanes. On the other hand, the result of Lienert and Tumulka is stronger than ours in that it assumes the Born rule only on horizontal hyperplanes (“horizontal Born rule”) and not on all tilted spacelike hyperplanes (“flat Born rule”). Then again, our model allows for a somewhat simpler proof compared to that of Lienert and Tumulka, and the assumption of the Born and collapse rules on tilted hyperplanes seems natural if the workings of detectors are Lorentz invariant. Yet, our proof does not require the Lorentz invariance of the hypersurface evolution of the observed system (see also Remark 13 in Sect. 2.2); in particular, the hypersurface evolution may involve external fields that break the Lorentz symmetry.
Other works in recent years dealing with a physical analysis of the quantum measurement process include [1, 12,13,14].
1.3 Detection Process
Our definition of the detection process is based on approximating any given Cauchy surface \(\Sigma \) by spacelike surfaces \(\Upsilon \) that are piecewise flat, and whose (countably many) flat pieces are 3d (non-regular) tetrahedra. We call such surfaces triangular surfaces; see Fig. 2. While the precise definition of a triangular surface will be postponed to Sect. 2, it may be useful to formulate already here a basic fact that we will prove in Sect. 4:
Proposition 1
For every Cauchy surface \(\Sigma \) in Minkowski space-time, there is a sequence \((\Upsilon _n)_{n\in \mathbb {N}}\) of triangular Cauchy surfaces that converges increasingly and uniformly to \(\Sigma \).
Here, “increasing” means thatFootnote 3\(\Upsilon _{n+1}\subseteq \mathrm {future}(\Upsilon _n)\) for all n; see Fig. 3. Uniform convergence in a given Lorentz frame means that for every \(\varepsilon >0\), all but finitely many \(\Upsilon _n\) lie in \(\{x+(s,0,0,0):x\in \Sigma ,|s|<\varepsilon \}\); equivalently, since \(\Sigma \) is the graph of a function \(f:\mathbb {R}^3\rightarrow \mathbb {R}\) and \(\Upsilon _n\) the graph of a function \(f_n:\mathbb {R}^3\rightarrow \mathbb {R}\), uniform convergence \(\Upsilon _n\rightarrow \Sigma \) means that \(f_n\) converges uniformly to f. It turns out that this notion is Lorentz invariant:
Proposition 2
If a sequence \((\Sigma _n)_{n\in \mathbb {N}}\) of Cauchy surfaces converges uniformly to a Cauchy surface \(\Sigma \) in one Lorentz frame, then also in every other.
Again, the proof is given in Sect. 4. The following notation will be convenient: for any subset \(A\subseteq \Sigma \), let
be the sets of configurations with no, at least one, or all particles in A (see Fig. 4). Note that \(\exists (A)^c = \emptyset (A) = \forall (A^c)\), where \(A^c\) means the complement of A with respect to \(\Sigma \). We also briefly write \(\forall A\) for \(\forall (A)\), and similarly \(\exists A\) and \(\emptyset A\).
We define the detection distribution on \(\Sigma \) as the limit of the detection distributions on the \(\Upsilon _n\), and we show in Theorem 1 that this limit exists and agrees with \(|\Psi _\Sigma |^2\). But to this end, we first need to talk about detection probabilities on triangular surfaces \(\Upsilon \).
So let \(\Delta _k\) be the open and disjoint tetrahedra such that
(the bar denotes closure, \({\mathscr {K}}\) is a countably infinite index set). We want to consider a detector in a bounded region \(B\subset \Upsilon \) that yields outcome 1 if there is at least one particle in B and outcome 0 if there is no particle in B. To this end, we imagine several smaller detectors, one in each region \(B_k := B\cap \Delta _k\), and set the B-outcome equal to 1 whenever any of the small detectors clicked. Now each region \(B_k\), being a subset of \(\Delta _k\), lies in some hyperplane \(E_k\), and on hyperplanes we assume the Born rule and collapse rule:
Flat Born rule. If on the hyperplane E the state vector is \(\Psi _E\in {\mathscr {H}}_E\) with \(\Vert \Psi _E\Vert =1\), and a detection is attempted in the region \(B\subseteq E\), then the probability of outcome 1 is \(\Vert P_E(\exists (B))\, \Psi _E\Vert ^2\) and that of outcome 0 is \(\Vert P_E(\exists (B)^c)\, \Psi _E\Vert ^2\).
Flat collapse rule. If the outcome is 1, then the collapsed wave function is
otherwise
There are two natural possibilities for defining the detection probabilities on \(\Upsilon \) in terms of those on \(E_k\): the sequential detection process and the parallel detection process. According to the sequential detection process, we choose an arbitrary ordering of the set \({\mathscr {K}}\) indexing the tetrahedra or hyperplanes and carry out, in this order, a quantum measurement in each \(E_k\) representing the detection attempt in \(B_k\) including appropriate collapse and then use the unitary evolution \(U_{E_k}^{E_{k+1}}\) to evolve to the next hyperplane in the chosen order, here written as \(E_{k+1}\). For the parallel detection process, consider the projection operators \(P_{E_k}(\exists (B_k))\) associated with attempted detection in \(B_k\); we show that they, after being transferred to \({\mathscr {H}}_{\Upsilon }\) by means of \(U_{E_k}^\Upsilon \), commute with each other if interaction locality holds, so they can be “measured simultaneously.” The simultaneous quantum measurement of these projections in \({\mathscr {H}}_{\Upsilon }\) provides the parallel detection process for \(B\subset \Upsilon \) with outcome 1 whenever any of the quantum measurements yielded 1. It turns out that the sequential and the parallel process agree with each other and with the Born rule on \(\Upsilon \):
Proposition 3
Fix a hypersurface evolution satisfying interaction locality (IL) (Definition 6), a triangular Cauchy surface \(\Upsilon \), a bounded subset \(B\subset \Upsilon \), and a normalized quantum state \(\Psi \), and assume the flat Born rule and the flat collapse rule. The sequential detection process in any order of the tetrahedra of \(\Upsilon \) yields the same detection probability, called \(\mathbb {P}^\Psi _B\); it agrees with the one given by the curved Born distribution on \(\Upsilon \), which is \(\Vert P_{\Upsilon }(\exists (B))\Psi _\Upsilon \Vert ^2\). Moreover, the parallel detection process also yields the same detection probability.
Proposition 3 will follow as a direct consequence of Proposition 5 in Sect. 3.
Actually, for either a triangular surface \(\Upsilon \) or a general Cauchy surface \(\Sigma \), we want more than just to detect for a subset B whether there is a particle in B. We want to allow the use of several detectors, each covering a region \(P_1,\ldots ,P_r\subset \Sigma \); the outcome of the experiment is \(L=(L_1,\ldots ,L_r)\) with \(L_\ell =1\) if a particle gets detected in \(P_\ell \) and \(L_\ell =0\) otherwise. It seems physically reasonable that the region covered by a detector is bounded and has boundary of measure zero.
Definition 1
An admissible partition \({\mathscr {P}}=(P_1,\ldots ,P_r)\) of \(\Sigma \) is defined by choosing finitely many subsets \(P_\ell \) of \(\Sigma \) that are mutually disjoint, \(P_\ell \cap P_m = \emptyset \) for \(\ell \ne m\), and such that each \(P_\ell \) is bounded and has boundary in \(\Sigma \) of (invariant) 3-volume 0. Here, the term bounded refers to the Euclidean norm on \( \mathbb {R}^4 \). We set \(P_{r+1}=\Sigma \setminus (P_1 \cup \ldots \cup P_r)\) to make \((P_1,\ldots ,P_{r+1})\) a partition of \(\Sigma \).
The idea is that there is no detector in \(P_{r+1}\). Let \(M_{{\mathscr {P}}}(L)\) denote the set of configurations in \(\Gamma (\Sigma )\) such that, for each \(\ell =1,\ldots ,r\), there is no point in \(P_\ell \) if \(L_\ell =0\) and at least one point in \(P_\ell \) if \(L_\ell =1\); that is, \(M_{{\mathscr {P}}}(L)\) is the set of configurations compatible with outcome L.
Now the definition of detection probabilities on a triangular surface \(\Upsilon \) can straightforwardly be generalized from a bounded set \(B\subset \Upsilon \) to an admissible partition \({\mathscr {P}}=(P_1,\ldots ,P_r)\) of \(\Upsilon \) in both the sequential and the parallel sense, and we find:
Proposition 4
Fix a hypersurface evolution satisfying interaction locality, a triangular Cauchy surface \(\Upsilon \), an admissible partition \({\mathscr {P}}=(P_1,\ldots ,P_r)\) of \(\Upsilon \), and a normalized quantum state \(\Psi \), and assume the flat Born rule and the flat collapse rule. The joint distribution \(\mathbb {P}^\Psi _{{\mathscr {P}}}(L)\) of \(L=(L_1,\ldots ,L_r)\) according to the sequential detection process in any order of the tetrahedra of \(\Upsilon \) and according to the parallel detection process agree with each other and with the one given by the curved Born distribution on \(\Upsilon \), which is \(\Vert P_{\Upsilon }(M_{{\mathscr {P}}}(L))\Psi _\Upsilon \Vert ^2\).
Proposition 4 can be regarded as a statement of the Born rule on triangular surfaces. It follows from Proposition 5, which is proven in Sect. 3.
1.4 Main Result
Before we elucidate the result, let us briefly introduce some more terminology.
Definition 2
Let \( \Sigma , \Sigma ' \) be Cauchy surfaces and \( A \subseteq \Sigma \). We then define the grown set of A in \( \Sigma ' \) as (see Fig. 5)
Similarly, we define the shrunk set of A in \( \Sigma ' \) as:
The following aspect of our result requires some explanation: once we have a triangular surface \(\Upsilon \) approximating a given Cauchy surface \(\Sigma \), and once we are given an admissible partition \({\mathscr {P}}=(P_1,\ldots ,P_r)\) on \(\Sigma \), we want to approximate the sets \(P_\ell \subset \Sigma \) by sets \(B_\ell \) in \(\Upsilon \). One may think of two natural possibilities of defining \(B_\ell \): (i) project \(P_\ell \) downwards along the direction of the \(x^0\) axis of a chosen Lorentz frame; or (ii) take \(B_\ell =\mathrm{Sr}(P_\ell ,\Upsilon )\), the smallest set on \(\Upsilon \) that in some sense corresponds to \(P_\ell \). Our result holds in both variants; we formulate it in variant (i) (see Remark 5 in Sect. 5 about (ii)). That is, choose a Lorentz frame and let
be the projection to the space coordinates. It is known [27, p. 417] that the restriction \(\pi _\Sigma \) of the projection \(\pi \) to \(\Sigma \) is a homeomorphism \(\Sigma \rightarrow \mathbb {R}^3\); thus, \(\pi _{\Sigma }^\Upsilon :=\pi ^{-1}_\Upsilon \circ \pi _\Sigma \) is a homeomorphism \(\Sigma \rightarrow \Upsilon \). We set
Of course, since we prove that the limiting probability distribution on \(\Gamma (\Sigma )\) is given by the curved Born distribution, the limiting probability distribution is independent of the choice of Lorentz frame used for defining \(\pi _\Sigma ^\Upsilon \).
We can now state our main result.
Theorem 1
Let \(\Sigma \) be a Cauchy surface in Minkowski space-time \(\mathbb {M}\) and \((\Upsilon _n)_{n\in \mathbb {N}}\) a sequence of triangular Cauchy surfaces that converges increasingly and uniformly to \(\Sigma \). Let \({\mathscr {E}}=({\mathscr {H}}_\circ , P_\circ , U_\circ ^\circ )\) be a hypersurface evolution satisfying propagation locality and \(\Psi _0\in {\mathscr {H}}_{\Sigma _0}\) with \(\Vert \Psi _0\Vert =1\) for some \(\Sigma _0\) in the past of \(\Sigma \). Then for any admissible partition \({\mathscr {P}}\) of \(\Sigma \), \({\mathscr {B}}_n:=\bigl (\pi _\Sigma ^{\Upsilon _n}(P_1),\ldots ,\pi _\Sigma ^{\Upsilon _n}(P_r)\bigr )\) is an admissible partition of \(\Upsilon _n\), and
for all \(L\in \{0,1\}^r\).
Together with Proposition 4, we obtain:
Corollary 1
Assume the hypotheses of Theorem 1 together with the flat Born rule, the flat collapse rule, and interaction locality. Define the detection probabilities for \({\mathscr {P}}\) on \(\Sigma \) as the limit of the detection probabilities for \({\mathscr {B}}_n\) on \(\Upsilon _n\) and the latter through either the sequential or the parallel detection process. Then the detection probabilities for \({\mathscr {P}}\) on \(\Sigma \) are given by the curved Born rule, \(\bigl \Vert P_\Sigma (M_{{\mathscr {P}}}(L)) \, \Psi _\Sigma \bigr \Vert ^2\) for all \(L\in \{0,1\}^r\).
The proof of Theorem 1 (see Sect. 5) makes no special use of dimension \(3+1\) and applies equally in dimension \(d+1\) for any \(d\in \mathbb {N}\); tetrahedra then need to be replaced by d-dimensional simplices.
Remark
-
1.
Shrunk set \(\mathrm{Sr}(A,\Sigma ')\) Definition (8) is equivalent to saying that the shrunk set is the intersection of \(\Sigma '\) and the domain of dependence of \(\Sigma \).
-
2.
Uniqueness of the measure on \(\Gamma (\Sigma )\) It was shown in Proposition 3 in Section 6 of [24] that if two probability measures \(\mu ,\mu '\) on \(\Gamma (\Sigma )\) agree on all detection outcomes, \(\mu (M_{{\mathscr {P}}}(L))=\mu '(M_{{\mathscr {P}}}(L))\) for every \(L\in \{0,1\}^r\) and every admissible partition \({\mathscr {P}}\) of \(\Sigma \), then \(\mu =\mu '\). Thus, the whole \(|\Psi _\Sigma |^2\) distribution is uniquely determined by the detection probabilities. In fact, a probability measure \(\mu \) on \(\Gamma (\Sigma )\) is already uniquely determined by the values \(\mu (\emptyset (A))\), where A runs through those subsets of \(\Sigma \) whose projection \(\pi (A)\) to \(\mathbb {R}^3\) is a union of finitely many open balls (see the proof of Proposition 3 in [24]). This fact might suggest that, in order to prove the curved Born rule, it would have been sufficient to prove the statement of Theorem 1 only for a single detector (i.e., for partitions with \(r=1\) consisting of \(P_1=A\) and \(P_{r+1}=\Sigma \setminus A\)) in a region A of the type described. However, we prove the stronger statement for arbitrary r because it is not obvious that the detection probabilities for arbitrary r fit together to form a measure on \(\Gamma (\Sigma )\) (in other words, that detection probabilities for \(r>1\) will agree with the Born distribution, given that detection probabilities for \(r=1\) do).
-
3.
Curved collapse rule One can also consider a curved collapse rule: Suppose that r detectors are placed along \(\Sigma \), that each detector (say the \(\ell \)-th) only measures whether there is a particle in the region \(P_\ell \),where \({\mathscr {P}}=(P_1,\ldots ,P_r)\) is an admissible partition, and that each detector acts immediately (i.e., is infinitely fast). If the outcome was \(L=(L_1,\ldots ,L_r)\in \{0,1\}^r\),then the wave function immediately after detection is the collapsed wave function
$$\begin{aligned} \Psi '_\Sigma = \frac{P_\Sigma (M_{{\mathscr {P}}}(L)) \, \Psi _\Sigma }{\Vert P_\Sigma (M_{{\mathscr {P}}}(L)) \, \Psi _\Sigma \Vert }\,. \end{aligned}$$(12)There is a sense in which the curved collapse rule also follows from our result and a sense in which it does not. To begin with the latter, our justification of the Born rule on triangular surfaces was based on the idea that on each tetrahedron \(\Delta _k\), we apply a detector to \(B_{k\ell }=\Delta _k\cap B_\ell \) and deduce from the outcomes whether a particle has been detected anywhere in \(B_\ell \). This detection process measures more than whether there is a particle in \(B_\ell \), as it also measures which of the \(B_{k\ell }\) contain particles; as a consequence, this detection process would collapse \(\Psi \) more narrowly than (12). However, if we assume that on triangular surfaces \(\Upsilon \) we can have detectors that only measure whether there is a particle in \(B_\ell \) for an admissible partition \({\mathscr {B}}=(B_1,\ldots ,B_r)\), so that the collapse rule (12) holds upon replacing \(\Sigma \rightarrow \Upsilon \) and \({\mathscr {P}}\rightarrow {\mathscr {B}}\), then sufficient approximation of an arbitrary Cauchy surface \(\Sigma \) by triangular surfaces leads to a collapsed wave function arbitrarily close to (12). Indeed, we have that (see Sect. 5 for the proof)
Corollary 2
Under the hypotheses of Theorem 1,
-
4.
Other observables As the curved Born rule shows, the PVM \(P_\Sigma \) can be regarded as the totality of position observables on \(\Sigma \). What about other observables? In a sense, all other observables are indirectly determined by the position observable. As Bell [3, p. 166] wrote:
[I]n physics, the only observations we must consider are position observation, if only the positions of instrument pointers. [...] If you make axioms, rather than definitions and theorems, about the ‘measurements’ of anything else, then you commit redundancy and risk inconsistency.
A detailed description of how self-adjoint obervables arise from the Hamiltonian of an experiment, the quantum state of the measuring apparatus, and the position observable (of its pointer), can be found in [12, Sect. 2.7]. A conclusion we draw is that specifying a quantum theory’s hypersurface evolution is an informationally complete description. As another conclusion, the PVM \(P_\Sigma \) serves not only for representing detectors. When we want to argue that certain experiments are quantum measurements of certain observables, we may use it to link the quantum state with macro-configurations (say, of pointer positions), and in fact to obtain probabilities for pointer positions. A related but quite different question is how the algebras of local operators common in algebraic QFT (such as smeared field operator algebras or Weyl algebras) are related to \(P_\Sigma \). It would be a topic of interest for future work to make this relation explicit. Coming back to the Bell quote, one may also note that for the same reason, making the curved Born rule an axiom in addition to the flat Born rule means to commit redundancy and to risk inconsistency. That is why we have made the curved Born rule a theorem. Of course, we have still committed a little bit of the redundancy that Bell talked about by assuming the Born and collapse rules on all spacelike hyperplanes while it suffices to assume them on horizontal hyperplanes [24].
-
5.
Objections Some authors [37] have criticized the very idea of evolving states from one Cauchy surface to another on the grounds that such an evolution cannot be unitarily implemented for the free second-quantized scalar Klein–Gordon field. It seems to us that these difficulties do not invalidate the approach but stem from analogous difficulties with 1-particle Klein–Gordon wave functions, which are known to lack a covariantly-defined timelike probability current 4-vector field that could be used for defining a Lorentz-invariant inner product that makes the time evolution unitary (e.g., [33]). In contrast, a hypersurface evolution according to our definition can indeed be defined for the free second-quantized Dirac equation allowing negative energies [5, 8, 10, 24]. Other results ([35, Sect. 1.8], [17, 18]) may raise doubts about propagation locality; on the other hand, these results presuppose positive energy, which we do not require here; moreover, violations of propagation locality would seem to allow for superluminal signaling. Be that as it may, we simply assume here a propagation-local hypersurface evolution as given; further developments of this notion can be of interest for future works.
-
6.
Evolution between hyperplanes Following [24, Sect. 8], we conjecture that a hypersurface evolution \({\mathscr {E}}\) satisfying interaction locality and propagation locality is uniquely determined up to unitary equivalence [24, Sect. 3.2 Remark 14] by its restriction to hyperplanes. We conjecture further that a hypersurface evolution that is in addition Poincaré covariant (see Remark 13 in Sect. 2.2) is uniquely determined by its restriction to horizontal hyperplanes \(\{x^0=\mathrm {const.}\}\). While we do not have a proof of these statements, a related statement follows from our results: Suppose two hypersurface evolutions \( {\mathscr {E}}= ({\mathscr {H}}_\circ , P_\circ , U_\circ ^\circ ) \) and \( {\tilde{{\mathscr {E}}}} = ({\mathscr {H}}_\circ , P_\circ , {\tilde{U}}_\circ ^\circ ) \) use the same Hilbert spaces and PVMs but potentially different evolution operators; suppose further that the evolution operators agree on hyperplanes, \(U_E^{E'}={\tilde{U}}_E^{E'}\) for all spacelike hyperplanes \(E,E'\); finally, suppose that both \({\mathscr {E}}\) and \({\tilde{{\mathscr {E}}}}\) satisfy interaction locality and propagation locality. Then they yield the same Born distribution on every Cauchy surface \(\Sigma \), i.e., for every \(\Psi _0\in {\mathscr {H}}_{E_0}\) on \(E_0=\{x^0=0\}\) and every \(S\subseteq \Gamma (\Sigma )\),
$$\begin{aligned} \Vert P_\Sigma (S)\, U_{E_0}^\Sigma \, \Psi _0\Vert ^2 = \Vert P_\Sigma (S)\, {\tilde{U}}_{E_0}^\Sigma \, \Psi _0\Vert ^2 \,. \end{aligned}$$(14)Indeed, by Remark 2, (14) holds for all \(S\subseteq \Sigma \) if it holds for all \(M_{{\mathscr {P}}}(L)\) for all admissible partitions \({\mathscr {P}}\) of \(\Sigma \). By Theorem 1, both sides can be expressed as the limits of detection probabilities on triangular surfaces. Those in turn can be expressed, using the sequential detection process, in terms of \(U_E^{E'}\) respectively \({\tilde{U}}_E^{E'}\) only for hyperplanes \(E,E'\), so they are equal.
2 Definitions
2.1 Geometric Notions
We now begin the more technical part of this paper. We consider flat Minkowski space-time \( \mathbb {M}\) in \( 3+1 \) dimensions with metric tensor \( \eta _{\mu \nu } = \mathrm{diag}(1,-1,-1,-1) \). space-time points are denoted by \( x = x^\mu = (x^0,{\varvec{x}}) = (x^0,x^1,x^2,x^3) \), the Minkowski square is denoted by \( x^2 = x^\mu x_\mu \), Cauchy surfaces are denoted by \( \Sigma \subset \mathbb {M}\). For piecewise flat Cauchy surfaces, we reserve the notation \( \Upsilon \subset \mathbb {M}\), for flat Cauchy surfaces (spacelike 3-planes), the notation \(E\subset \mathbb {M}\); \(E_0 = \{x^\mu : x^0 = 0\} \cong \mathbb {R}^3 \) is the time-zero hyperplane. For a topological space X, we will denote by \({\mathscr {B}}(X) \) the corresponding Borel \( \sigma \)-algebra. The topology on \( \Sigma \) is that induced by the Euclidean \( \mathbb {R}^4 \)-norm on \( \mathbb {M}\). Restricting the projection \( \pi \) as in (9) to \( \Sigma \), we obtain a homeomorphism \(\pi _\Sigma = \pi |_\Sigma : \Sigma \rightarrow \mathbb {R}^3 \), which can be used to identify \({\mathscr {B}}(\Sigma ) \) with \({\mathscr {B}}(\mathbb {R}^3) \): For \( R \subseteq \Sigma \), we have that \( R \in {\mathscr {B}}(\Sigma ) \; \Leftrightarrow \; \pi (R) \in {\mathscr {B}}(\mathbb {R}^3) \). By Rademacher’s theorem, \( \Sigma \) possesses a tangent plane almost everywhere [24, Sect. 3]. If a tangent plane exists at \( x \in \Sigma \), the pullback of \( \eta _{\mu \nu } \) under the embedding \( \Sigma \hookrightarrow \mathbb {M}\) is either degenerate or a Riemann 3-metric. This metric can be used to define a volume measure \( \mu _\Sigma \) on \( (\Sigma ,{\mathscr {B}}(\Sigma )) \), as well as a volume measureFootnote 4\(\mu _{\Gamma (\Sigma )}\) on \((\Gamma (\Sigma ),{\mathscr {B}}(\Gamma (\Sigma )))\). In the configuration space \(\Gamma (\Sigma )\), we denote the n-particle sector by
Note that for disjoint sets \( A \cap B = \emptyset \), we have
with bijective identification map \( q \mapsto (q \cap A, q \cap B) \).
Definition 3
A triangular surface is a Cauchy surface \(\Upsilon \subset \mathbb {M}\) such that
where \({\mathscr {K}}\) is a countably infinite index set, each \(\Delta _k\) is a 3-open, non-degenerate, spacelike tetrahedron (i.e., the non-empty 3-interior of the convex hull of \(3+1\) points that are mutually spacelike), the \(\Delta _k\) are mutually disjoint (\(\Delta _{k_1} \cap \Delta _{k_2} = \emptyset \) for \(k_1\ne k_2\)), and every bounded region \(B\subset \Upsilon \) intersects only finitely many \(\Delta _k\).
2.2 Hypersurface Evolution
Definition 4
A hypersurface evolution \({\mathscr {E}} = ( {\mathscr {H}}_\circ , P_\circ , U_{\circ }^{\circ })\) is a collection of
-
1.
Hilbert spaces \({\mathscr {H}}_\Sigma \) for every Cauchy surface \(\Sigma \), equipped with
-
2.
A PVM \(P_\Sigma : {\mathscr {B}}(\Gamma (\Sigma )) \rightarrow \mathrm{Proj}({\mathscr {H}}_\Sigma )\), the set of projections in \({\mathscr {H}}_\Sigma \),
-
3.
Unitary isomorphisms \(U_{\Sigma }^{\Sigma '}:{\mathscr {H}}_\Sigma \rightarrow {\mathscr {H}}_{\Sigma '}\) (“evolution”), and
-
4.
A factorization mapping for every \(A\subseteq \Sigma \), i.e., with the abbreviation
$$\begin{aligned} {\mathscr {H}}_{\Sigma ,A} := {{\,\mathrm{Ran}\,}}P_\Sigma (\forall (A)), \end{aligned}$$(18)(where \({{\,\mathrm{Ran}\,}}\) denotes the range), a unitary isomorphism \(T_{\Sigma , A}:{\mathscr {H}}_\Sigma \rightarrow {\mathscr {H}}_{\Sigma ,A} \otimes {\mathscr {H}}_{\Sigma ,\Sigma \setminus A}\) (“translation”)
with the following properties:
-
(0)
\(U_\Sigma ^\Sigma = I_\Sigma \) and \(U_{\Sigma '}^{\Sigma ''} U_{\Sigma }^{\Sigma '} = U_{\Sigma }^{\Sigma ''}\) for all Cauchy surfaces \(\Sigma , \Sigma ', \Sigma ''\).
-
(i)
For every \(S\subset \Gamma (\Sigma )\) with \(\mu _{\Gamma (\Sigma )}(S) = 0\), also \(P_\Sigma (S) = 0\).
-
(ii)
For every \(\Sigma \), \(\dim {{\,\mathrm{Ran}\,}}P_\Sigma (\emptyset (\Sigma ))=1\). That is, up to a phase, there is a unique vacuum state \(| \emptyset (\Sigma ) \rangle \in {{\,\mathrm{Ran}\,}}P_\Sigma (\emptyset (\Sigma ))\) with \(\bigl \Vert |\emptyset (\Sigma ) \rangle \bigr \Vert = 1\).
-
(iii)
\(T_{\Sigma , \Sigma \setminus A} = \Pi T_{\Sigma , A}\) with \(\Pi \) the permutation of two tensor factors
-
(iv)
Factorization of the PVM:Footnote 5 For all \(A,B \subseteq \Sigma \),
$$\begin{aligned} P_\Sigma (\forall (B))= T_{\Sigma , A}^{-1} \bigl [P_\Sigma (\forall (A\cap B)) \otimes P_\Sigma (\forall (A^c\cap B))\bigr ] T_{\Sigma , A} \,. \end{aligned}$$(19)
This definition is equivalent to the one given in [24] but formulated in a more detailed way, as the isomorphisms T were previously not made explicit. We will often follow [24] and not make the isomorphism T explicit; that is, instead of saying “the given unitary isomorphism \(T_{\Sigma , A}\) maps \({\mathscr {H}}_\Sigma \) to \({\mathscr {H}}_{\Sigma ,A} \otimes {\mathscr {H}}_{\Sigma ,\Sigma \setminus A}\),” we simply say “\({\mathscr {H}}_\Sigma ={\mathscr {H}}_{\Sigma ,A} \otimes {\mathscr {H}}_{\Sigma ,\Sigma \setminus A}\).” Likewise, instead of (19), we simply write
where \(P_A\) means the restriction of \(P_\Sigma \) to subsets of \(\forall (A)\) as in Footnote 5.
Remark
-
7.
Uniqueness of the vacuum state Actually, our Propositions and the Theorem do not make use of property (ii), the uniqueness of the vacuum state. The reason we make it part of the definition of \({\mathscr {E}}\) is that it is part of the concept of hypersurface evolution as introduced in [24].
-
8.
\(P_\Sigma \) factorizes From (19) or (20), it follows that \(P_\Sigma \) factorizes not just for all-sets (i.e., sets of the form \(\forall (B)\)) but for all product sets in configuration space: for all \(A\subseteq \Sigma \), \(S_A\subseteq \forall A\), and \(S_{A^c}\subseteq \forall (\Sigma \setminus A)\),
$$\begin{aligned} P_\Sigma (S_A\times S_{A^c})= P_A(S_A) \otimes P_{A^c}(S_{A^c}) \end{aligned}$$(21)with \(S_A \times S_{A^c}\) understood as a subset of \(\Gamma (\Sigma )\). That is because, first, \(\forall B = \forall (A\cap B) \times \forall (A^c \cap B)\), second, the all-sets \(\forall C\) form a \(\cap \)-stable generator of \({\mathscr {B}}(\Gamma (\Sigma ))\), and third, it is a standard theorem in probability theory that measures (and hence also PVMs) agreeing on a \(\cap \)-stable generator of a \(\sigma \)-algebra agree on the whole \(\sigma \)-algebra; so, roughly speaking, relations true for all all-sets are true for all sets. Relation (21) is exactly the definition of the tensor product of two POVMs, so it can equivalently be expressed as
$$\begin{aligned} P_\Sigma = P_A \otimes P_{A^c}\,. \end{aligned}$$(22) -
9.
Splitting into more than two regions The restriction \(T_{\Sigma , B, A}\) of \(T_{\Sigma , A}\) to \({\mathscr {H}}_{\Sigma , B}\) maps \({\mathscr {H}}_{\Sigma , B}\) unitarily to \({\mathscr {H}}_{\Sigma ,A\cap B}\otimes {\mathscr {H}}_{\Sigma ,A^c\cap B}\). Moreover, (19) for \(A\subseteq B\) yields that P factorizes also in B, i.e., for every \(A\subseteq B\subseteq \Sigma \), \(S_A\subseteq \forall A\), and \(S_{B \setminus A}\subseteq \forall (B \setminus A)\),
$$\begin{aligned} P_\Sigma (S_A\times S_{B \setminus A})= T_{\Sigma , B, A}^{-1} \bigl [P_\Sigma (S_A) \otimes P_\Sigma (S_{B \setminus A})\bigr ] T_{\Sigma , B, A} \end{aligned}$$(23)with \(S_A \times S_{B\setminus A}\) understood as a subset of \(\forall B\). Furthermore, it follows that \(T_{\Sigma , B, B\setminus A} = \Pi T_{\Sigma , B, A}\), and that an associative law holds for the \(T_{\Sigma ,B,A}\): For any partition \(A_1,A_2,A_3\) of \(B\subseteq \Sigma \),
$$\begin{aligned} \bigl (I_{\Sigma , A_1} \otimes T_{\Sigma , A_2\cup A_3, A_2} \bigr )T_{\Sigma , B, A_1} = \bigl (T_{\Sigma ,A_1\cup A_2,A_1}\otimes I_{\Sigma , A_3} \bigr ) T_{\Sigma , B,A_1\cup A_2}\,. \end{aligned}$$(24)Hence, the Hilbert spaces and PVMs factorize also for finite partitions. The upshot is that it is OK to identify
$$\begin{aligned} {\mathscr {H}}_{\Sigma }&=\bigotimes _i {\mathscr {H}}_{\Sigma , A_i}~\text {and}\end{aligned}$$(25)$$\begin{aligned} P_\Sigma&=\bigotimes _i P_{A_i} \end{aligned}$$(26)for any finite partition \(\Sigma =\bigcup _i A_i\).
-
10.
Examples for hypersurface evolutions \({\mathscr {E}}\) Some examples for hypersurface evolutions can be found in [24]. As described there in Remark 15 and Section 4.1, the simplest example is provided by the non-interacting Dirac field without a Dirac sea, which also satisfies (IL) and (PL) as defined below. Further examples are provided by Tomonaga–Schwinger equations and multi-time wave functions (whose n-particle sectors are functions of n space-time points, rather than n space points [23]); explicit models include the emission-absorption model of [28] and the rigorous model with contact interaction of [20, 21]. Given an evolution law for multi-time wave functions \(\phi \), \(U_\Sigma ^{\Sigma '}\) can be defined by \(U_\Sigma ^{\Sigma '}:\phi |_\Sigma \mapsto \phi |_{\Sigma '}\); of course, one still has to check that this \(U_\Sigma ^{\Sigma '}\) is indeed unitary. In fact, multi-time wave functions have provided a major motivation for considering the curved Born rule.
2.3 Locality Properties
Definition 5
\({\mathscr {E}}\) is propagation local (PL) if and only if
for all Cauchy surfaces \(\Sigma ,\Sigma '\) and all \(A\subseteq \Sigma \).
Here, \(R\le S\) means that \(S-R\) is a positive operator; if R and S are projections, then \(R\le S\) is equivalent to \({{\,\mathrm{Ran}\,}}R \subseteq {{\,\mathrm{Ran}\,}}S\). In words, (PL) means that if \( \Psi _\Sigma \) is concentrated in \( A \subseteq \Sigma \), i.e., \( \Psi _\Sigma \in {\mathscr {H}}_{\Sigma , A} \), then \( \Psi _{\Sigma '} = U_{\Sigma }^{\Sigma '} \Psi _{\Sigma } \) is concentrated in \(\mathrm{Gr}(A,\Sigma ') \). Also this definition is equivalent to the one given in [24].
Also the definition of interaction locality was already given in [24] but will be formulated here in a more detailed way. We begin with a summary of the condition: First, in a region A where \(\Sigma \) and \(\Sigma '\) overlap (see Fig. 6), \({\mathscr {H}}_{\Sigma , A}\) and \({\mathscr {H}}_{\Sigma ', A}\) can be identified. The identification fits together with P and T. Second, the time evolution from \(\Sigma \setminus A\) to \(\Sigma '\setminus A\) (see Fig. 6) is given by a unitary isomorphism \(V_{\Sigma \setminus A}^{\Sigma '\setminus A}:{\mathscr {H}}_{\Sigma \setminus A} \rightarrow {\mathscr {H}}_{\Sigma ' \setminus A}\), the “local evolution” replacing \(U_\Sigma ^{\Sigma '}\). The fact that one can evolve from \(\Sigma \setminus A\) to \(\Sigma '\setminus A\) means in particular that this evolution does not depend on the state in A, that is, there is no interaction term in the evolution that would couple \(\Sigma \setminus A\) to A. Finally, we require that \(V_{\Sigma \setminus A}^{\Sigma '\setminus A}\) does not change when we deform A while keeping it spacelike from \(\Sigma \setminus A\).
Definition 6
\({\mathscr {E}}\) is interaction local (IL) if it is equipped in addition with, for all Cauchy surfaces \(\Sigma ,\Sigma '\) and \(A\subseteq \Sigma \cap \Sigma '\), a unitary isomorphism \(J_{A, \Sigma }^{\Sigma '}: {\mathscr {H}}_{\Sigma , A} \rightarrow {\mathscr {H}}_{\Sigma ', A}\) (“identification”) such that
with some unitary isomorphism \(V^{\Sigma '}_{\Sigma \setminus A,\Sigma } : {\mathscr {H}}_{\Sigma ,\Sigma \setminus A} \rightarrow {\mathscr {H}}_{\Sigma ',\Sigma '\setminus A}\) such that for all \({{\tilde{\Sigma }}}\supseteq (\Sigma \setminus A)\), setting \({{\tilde{A}}}:= {{\tilde{\Sigma }}}\setminus (\Sigma \setminus A)\) and \({\tilde{\Sigma }}' := {{\tilde{A}}} \cup (\Sigma '\setminus A)\),
Henceforth, we will not mention the J-operators explicitly any more and following [24], we will simply write
Further, we will write \( V_{\Sigma \setminus A}^{\Sigma ' \setminus A} \) in place of \( V_{\Sigma \setminus A, \Sigma }^{\Sigma '} \), which is compatible with the Hilbert space identification.
Remark
-
11
Other notions of locality There are several inequivalent (though not unrelated) concepts of locality; they often play important roles in selecting time evolution laws (e.g., [16, 32]). In the Wightman axioms (e.g., [31, p. 65]), a locality condition appears that is different from both (IL) and (PL), viz., (anti-)commutation of field operators at spacelike separation. It seems clear that Wightman’s locality is closely related to (IL) and (PL), and it would be of interest to study this relation in detail in a future work. Another different locality condition is often called Einstein locality or Bell locality or just locality. It implies (IL) and (PL) but is not implied by (IL) and (PL) together; it asserts that there are no influences between events in spacelike separated regions; that may sound similar to (IL), but it is not. In fact, Bell’s theorem [2, 15] shows that Bell locality is violated, whereas (IL) seems to be valid in our universe.
-
12
Consistency condition It is known that multi-time equations require a consistency condition (e.g., [23, Chap. 2]). We note here that neither (IL) nor (PL) follow from the consistency condition alone. Indeed, examples of (artificial) multi-time equations with an instantaneous interaction (violating (IL)) that leaves the multi-time equations consistent were given in Lemma 2.5 of [6], while the non-interacting multi-time equations with Schrödinger Hamiltonians \(-\Delta _j\) for each particle j provide an example of consistent multi-time equations violating (PL).
-
13
Poincaré covariance While the flat Born rule is inspired by the thought that the full theory should be covariant under Poincaré transformations (i.e., Lorentz transformation and space-time translations), we do not assume covariance of the hypersurface evolution. To make this point, it may be helpful to say explicitly what it would mean for a hypersurface evolution \(({\mathscr {H}}_\circ ,U_\circ ^\circ ,P_\circ )\) to be Poincaré covariant: It would mean that for every properFootnote 6Poincaré transformation g and every Cauchy surface \(\Sigma \) there is a unitary isomorphism \(S_{g,\Sigma }: {\mathscr {H}}_\Sigma \rightarrow {\mathscr {H}}_{g\Sigma }\) (thought of as just Poincaré transforming the wave function without evolving it) such that
$$\begin{aligned}&S_{\mathrm {id},\Sigma }=I_\Sigma \,,~~~S_{h,g\Sigma } \, S_{g,\Sigma } = S_{hg,\Sigma } \end{aligned}$$(34)$$\begin{aligned}&U^{g\Sigma '}_{g\Sigma } = S_{g,\Sigma '} \, U^{\Sigma '}_\Sigma \, S_{g,\Sigma }^{-1} \end{aligned}$$(35)$$\begin{aligned}&P_{g\Sigma }(\forall (gA))=S_{g,\Sigma }\, P_\Sigma (\forall A) \, S^{-1}_{g,\Sigma } \end{aligned}$$(36)$$\begin{aligned}&T_{g\Sigma ,gA} \, S_{g,\Sigma } \, T_{\Sigma , A}^{-1} = S_{g,\Sigma }|_{{\mathscr {H}}_{\Sigma ,A}} \otimes S_{g,\Sigma }|_{{\mathscr {H}}_{\Sigma ,\Sigma \setminus A}} \end{aligned}$$(37)with \(T_{\Sigma , A}\) as in Definition 4item 4. The representation U(g) of the proper Poincaré group on \({\mathscr {H}}_{E_0}\) (\(E_0=\{x^0=0\}\)) that features (e.g.) in the Wightman axioms (e.g., [31, p. 65]) corresponds to
$$\begin{aligned} U(g) = U^{E_0}_{gE_0} \, S_{g,E_0}\,, \end{aligned}$$(38)that is, to using the Poincaré transformation g to shift \(\Psi \) from \(E_0\) and subsequently using the time evolution to bring the state vector back to \(E_0\).
3 Detection Process on Triangular Surfaces
We now give the detailed definitions of the sequential and parallel detection processes and prove Propositions 3 and 4.
To begin with, consider an admissible partition \({\mathscr {P}}=(P_1,\ldots ,P_r)\) of a Cauchy surface \( \Sigma \) and a vector \(L = (L_1,\ldots ,L_r) \in \{0,1\}^r\). Actually, in this section we will not make use of the assumption in Definition 1 that the boundaries \(\partial P_\ell \) are null sets, an assumption we need for Theorem 1.
The set of configurations in \( \Gamma (\Sigma ) \) compatible with the single outcome \(L_\ell \) at an attempted detection in \(P_\ell \) is
The set of configurations compatible with the measurement outcome vector L when detection is attempted in \(P_1,\ldots ,P_r\) is
Now consider a triangular surface \(\Upsilon =\bigcup _{k\in {\mathscr {K}}} \overline{\Delta _k}\) and an admissible partition \({\mathscr {B}}=(B_1,\ldots ,B_r)\) of \(\Upsilon \). For either the sequential or the parallel detection process on \(\Upsilon \), we imagine a small detector checking for particles in each
with outcome \(s_{k\ell }=1\) if a particle was found and \(s_{k\ell }=0\) otherwise.Footnote 7
We say that the outcome matrix s is compatible with L (denoted s : L) whenever
Let \(E_k\) be the hyperplane containing \(\Delta _k\). The configurations in \(E_k\) compatible with outcomes \( s_{k\ell } \) or \( s_k :=(s_{k1},\ldots ,s_{kr})\) are then given by
Likewise,
It follows that, based on the definition (40),
meaning that the symmetric difference between the two sets is a set of measure 0 in \(\Gamma (\Upsilon )\). This is the case because, as described in Footnote 7, the configurations in the symmetric difference have at least one particle in the 2d set \(\partial \Delta _k\) for some k.
3.1 Sequential Detection Process
We now formulate the definition of the sequential detection process and prove agreement with the Born rule. Fix an ordering of \({\mathscr {K}}\), i.e., a bijection \({\mathscr {K}}\rightarrow \mathbb {N}\). For ease of notation, we will simply replace \({\mathscr {K}}\) by \(\mathbb {N}\) using this particular ordering. The detection process is:
-
Set \(E_0=\{x^0=0\}\) and \(\Psi _0=\Psi _{E_0}\).
-
For each k in the specified order, do:
-
Evolve \(\Psi _{k-1}\) to \(E_k\).
-
Carry out detections of \(B_{k\ell }\) for all \(\ell =1,\ldots ,r\), i.e., quantum measurements of \(P_{E_k}(\exists (B_{k\ell }))\), and collapse accordingly, resulting in the (normalized) state vector \(\Psi _k\in {\mathscr {H}}_{E_k}\).
-
Repeat.
-
Note that by Definition 3, each \(B_\ell \) intersects only finitely many \(\Delta _k\). Thus, from some \(K+1\) onwards, all \(B_{k\ell }\) are empty, \(s_{k\ell }=0\), and no quantum measurement needs to be carried out in \(\Delta _k\). Hence, it suffices to consider finitely many repetitions in the above loop, namely those for k up to K.
From the flat Born rule and the flat collapse rule, we can now express the detection probabilities and the collapsed state vectors. Fix some k and \(\ell \); suppose that in the previous tetrahedra \(k'<k\) (i.e., none if \(k=1\)), the measurements have already been carried out with outcomes \(s_{k'\ell '}\); suppose further that in the previous detector regions \(B_{k\ell '}\) with \(\ell '<\ell \) (i.e., none if \(\ell =1\)) in the same tetrahedron \(\Delta _k\), the measurements have already been carried out with outcomes \(s_{k\ell '}\); suppose further that \(\Psi _{k,\ell -1}\) is the collapsed wave function after the previous measurements, which for \(\ell >1\) is given by the previous step, for \(\ell =1\) and \(k>1\) is given by
(with \(\Psi _{k-1,r}=\Psi _{k-1}\) in the notation of the process description above), and for \(\ell =1,k=1\) is given by
Conditional on the previous detection outcomes, the probability distribution of the next detection outcome \(s_{k\ell }\) is, by the flat Born rule,
and the state vector collapses, by the flat collapse rule, to
This completes the definition of the sequential detection process.
Lemma 1
Assume the flat Born rule and the flat collapse rule. Conditional on the measurements in the tetrahedra \(k'<k\), the joint distribution of all outcomes \((s_{k\ell })_{\ell =1..r}=s_k\) in \(\Delta _k\) is
and the collapsed wave function after the kr-measurement, given \(s_k\) with nonzero probability, is
Proof
It is a well known general fact about PVMs P that
and that a quantum measurement of \(P(S_1)\) with outcome \(s_1\) on \(\Psi \), followed by one of \(P(S_2)\) with outcome \(s_2\), have joint Born distribution
and collapsed state vector, given \(s_1=1,s_2=1\),
Iteration with r sets rather than 2 and the definition of \(M_{k E_k}(s_k)\) yield Lemma 1. \(\square \)
Lemma 2
(IL) implies that
Proof
Decompose \( {\mathscr {H}}_{E_k} = {\mathscr {H}}_{\Delta _k} \otimes {\mathscr {H}}_{E_k \setminus \Delta _k} \) and \( {\mathscr {H}}_{\Upsilon } = {\mathscr {H}}_{\Delta _k} \otimes {\mathscr {H}}_{\Upsilon \setminus \Delta _k} \). By (IL), we have that
We know that \(\Gamma (E_k)=\Gamma (\Delta _k) \times \Gamma (E_k\setminus \Delta _k)\). The set \(M_{k E_k}(s_k)\subseteq \Gamma (E_k)\) factorizes in the same way:
That is because whether a configuration q is compatible with the outcome \( s_k \), i.e., \( q \in M_{E_k}(s_k) \), does not depend on the points in q outside of \( \Delta _k \). Here, the set \( N_{k \Delta _k}(s_k) \subseteq \Gamma (\Delta _k) \) is defined in the analogous way to \(M_{k E_k}(s_k)\), i.e.,
where \(\exists _A(B)\) means the set of all configurations in \(\Gamma (A)\) with at least one particle in B. Hence, the projection \( P_{E_k} (M_{k E_k}(s_k)) \) decomposes into a tensor product
and by (57),
for the same reasons as (60). \(\square \)
Proposition 5
Assume the flat Born rule, the flat collapse rule, and (IL). The unconditional joint distribution of all outcomes, i.e., of the matrix s comprising all \(s_{k\ell }\), agrees with the Born distribution on \(\Upsilon \),
with \(\Psi _\Upsilon =U^\Upsilon _{E_0} \Psi _0\) (actually regardless of whether \(\partial B_\ell \) are null sets). In particular, the distribution of \(L=(L_1,\ldots ,L_r)\) is the Born distribution \(\Vert P_\Upsilon (M_{{\mathscr {B}}\Upsilon }(L))\Psi _\Upsilon \Vert ^2\).
Proof
As noted before, all \(s_{k\ell }\) vanish from some \(K+1\) onwards (and formulas below will take for granted they do), and we need consider only \(k\le K\). The fact, used before in (54), that for subsequent measurements the projections multiply, yields from Lemma 1 that
as claimed. \(\square \)
Proposition 4, insofar as it concerns the sequential detection process, follows from Proposition 5 (actually regardless of whether \(\partial B_\ell \) are null sets), and Proposition 3 follows further as the special case in which \(r=1\), \(B_1=B\), and \(B_{r+1}=B^c\).
3.2 Parallel Detection Process
We now formulate the definition of the parallel detection process and prove the Born rule for it. Throughout the whole subsection, (IL) is assumed.
The proof of Lemma 2 also shows that, analogously to (56),
As outlined in Sect. 1.3, the idea is to think of the detection attempt in \(B_{k\ell }\) as a quantum measurement of the observable
which is (65) for \(s_{k\ell }=1\). Since \(B_{k\ell }\) is non-empty only for finitely many k (for \(k=1,\ldots ,K\)), we are considering only finitely many observables. They commute because projections belonging to the same PVM always commute. Their simultaneous measurement is the definition of the parallel detection process.
We now prove the Born rule for the parallel detection process. When considering the simultaneous measurement of the operators (66), we need their joint diagonalization; the joint eigenspace with eigenvalues \((s_{k\ell })_{k\ell }\) is the range of
so the probability of the outcomes \((s_{k\ell })_{k\ell }\) is
and the probability of outcome L is
because the sets \(\bigcap _{k=1}^K M_{k \Upsilon }(s_{k})\) are mutually disjoint and thus their projections are mutually orthogonal, and because of (45) and property (i) in Definition 4. That is, the probability of outcome L agrees with the Born rule. This proves the statement about the parallel detection process in Proposition 4 and thus also in Proposition 3.
Another way of looking at the parallel detection process is based on tensor products: Since \(\Upsilon =\bigcup _{k=1}^K \Delta _k \cup R\) with remainder set \(R=\Upsilon \setminus \bigcup _{k=1}^K \Delta _k\), we have from Remark 9 in Sect. 2.2 that
By (IL), each \({\mathscr {H}}_{\Delta _k}\) can be regarded as a factor in \({\mathscr {H}}_{E_k} = {\mathscr {H}}_{\Delta _k} \otimes {\mathscr {H}}_{E_k \setminus \Delta _k}\). With the flat Born rule in mind, or with the idea that \(P_{E_k}\) is the configuration observable on \(E_k\), the attempted detection in \(B_{k\ell }\) can be regarded as a quantum measurement in \({\mathscr {H}}_{E_k}\) of the observable \(P_{E_k}(\exists (B_{k\ell }))\), which is of the form
Thus, the attempted detection in \(B_{k\ell }\) can also be regarded as a quantum measurement in \({\mathscr {H}}_{\Delta _k}\) of the observable \(P_{\Delta _k}(\exists _{\Delta _k}(B_{k\ell }))\). These observables commute for different \(\ell \) and equal k because they belong to the same PVM \(P_{\Delta _k}\), and they commute for different k in \({\mathscr {H}}_\Upsilon \) because of the tensor product structure (70). It follows that
with \(N_{k \Delta _k}\) as in (59), which agrees again with the Born rule on \(\Upsilon \), as claimed in Proposition 4.
4 Approximation by Triangular Surfaces
In this section, we prove Propositions 1 and 2.
Proof of Proposition 1
Fix an \(n\in \mathbb {N}\) and set \(\varepsilon =3^{-n}\). We construct a \( 3 \varepsilon \)-approximation \( \Upsilon _n \) to \( \Sigma \). First, consider the function \( f_t: \mathbb {M}\rightarrow \mathbb {M}, \; (x^0,{\varvec{x}}) \mapsto (x^0 - t, {\varvec{x}}) \), which “lowers a point by an amount t in time.” We use f to define the sets (see Fig. 7):
So \( \Sigma _{2\varepsilon } \) is a version of \( \Sigma \), lowered by \( 2 \varepsilon \) and \( \Sigma _{\varepsilon ..3\varepsilon } \) is a slice below \( \Sigma \) of thickness \( 2 \varepsilon \), centered at \( \Sigma _{2\varepsilon } \).
We now choose a decomposition of \( \mathbb {R}^3 \) into (non-regular) tetrahedra \( \mathbb {R}^3 = \bigcup _{k \in \mathbb {N}} \overline{{\tilde{\Delta }}^n_k} \) with open \( {\tilde{\Delta }}^n_k \) such that each pair of vertices \( {\varvec{x}}^n_{k,i}, {\varvec{x}}^n_{k,j}, i, j \in \{1,2,3,4\} \) has a distance \( \Vert {\varvec{x}}^n_{k,i} - {\varvec{x}}^n_{k,j} \Vert \le \varepsilon \) and such that every bounded region intersects only finitely many tetrahedra. For example, we may subdivide \(\mathbb {R}^3\) into axiparallel cubes with vertices on \(\frac{\varepsilon }{\sqrt{3}}\mathbb {Z}^3\) and subdivide each cube into 3! tetrahedra with vertices on \(\frac{\varepsilon }{\sqrt{3}}\mathbb {Z}^3\).
The four space-time points \( x^n_{k,i} := \pi |_{\Sigma _{2 \varepsilon }}^{-1} {\varvec{x}}^n_{k,i} \in \mathbb {M}\) (obtained by lifting \( {\varvec{x}}^n_{k,i} \) up to the \( 2 \varepsilon \)-surface, with \(i=1,2,3,4\)) span a spacelike open tetrahedron \( \Delta ^n_k\) in \(\mathbb {M}\). Now set \( \Upsilon _n := \bigcup _{k \in \mathbb {N}} \overline{\Delta ^n_k} \).
Claim. \( \Upsilon _n \) is a uniform \( \varepsilon \)-approximation of \( \Sigma _{2 \varepsilon } \), i.e., \(\Upsilon _n \subset \Sigma _{\varepsilon .. 3\varepsilon }\) (see Fig. 7).
Proof. Regard the surfaces \( \Upsilon _n \) and \( \Sigma _{2 \varepsilon } \) as the graphs of functions \(\mathbb {R}^3\rightarrow \mathbb {R}\), henceforth denoted simply by \( \Upsilon _n(\cdot )\) and \(\Sigma _{2 \varepsilon }(\cdot )\), that is, \((\Upsilon _n({\varvec{x}}),{\varvec{x}})\in \Upsilon _n\) for all \({\varvec{x}}\in \mathbb {R}^3\) and \(x=(\Upsilon _n(\pi (x)),\pi (x))\) for all \(x\in \Upsilon _n\). Both functions are Lipschitz-continuous with Lipschitz constant 1. Further, there is always a vertex of \( {\tilde{\Delta }}^n_k\) (possibly several ones) that maximizes \(\Upsilon _n(\cdot )\) on \(\overline{{\tilde{\Delta }}^n_k}\) (a “highest” vertex), and one (or several) that minimizes \(\Upsilon _n(\cdot )\) (a “lowest” vertex). Now consider the “height difference function” \( h({\varvec{x}}) = \Upsilon _n({\varvec{x}}) - \Sigma _{2 \varepsilon }({\varvec{x}}) \). (It is Lipschitz continuous with Lipschitz constant 2.) For any vertex \( x^n_{k,i} \), we have that \( h(\pi (x^n_{k,i})) = 0 \). And for any other point \( y \in \Delta ^n_k \), we have that \(|\pi (x^n_{k,i}) - \pi (y)|_{\mathbb {R}^3} < \varepsilon \), so by Lipschitz continuity,
If \( x^n_{k,i} \) is a highest vertex, then
(see Fig. 8). The same reasoning with a lowest vertex yields \( h(\pi (y)) > -\varepsilon \), so in total \( |h(\pi (y))| < \varepsilon \), which proves the claim.\(\Box \)
Claim. \( \Upsilon _n \) is a Cauchy surface.
Proof. We need to show that \(\Upsilon _n\) is intersected exactly once by every causal inextendible curve \(\gamma : (-\infty , \infty ) \rightarrow \mathbb {M}\). We regard \(\Upsilon _n\) again as the graph of an equally denoted function \(\Upsilon _n: \mathbb {R}^3 \rightarrow \mathbb {R}\). Now, consider the height difference function \(h(t) = \gamma ^0(t) - \Upsilon _n(\pi (\gamma (t)))\), which tells us “by how much \(\gamma \) is above \(\Upsilon _n\).” Since \(\Upsilon _n\) consists of spacelike tetrahedra, \(\Upsilon _n\) is Lipschitz-continuous with Lipschitz constant \(\le 1\). As \(\gamma \) is timelike-or-lightlike and w.l.o.g. directed toward the future, we have that h is strictly increasing, so there can be at most one t with \(h(t) = 0\). That is, there is at most one intersection of \(\gamma \) with \(\Upsilon _n\).
On the other hand, an intermediate value argument yields that there must be at least one intersection point: Otherwise, either \(h(t)>0\) for all t or \(h(t)<0\) for all t; w.l.o.g., assume the former case. Since \(\Upsilon _n\) is an \(\varepsilon \)-approximation to \(\Sigma _{2\varepsilon }\), we know that \(\gamma ^0(t)>\Upsilon _n(\pi (\gamma (t)))>\Sigma _{2\varepsilon }(\pi (\gamma (t)))-\varepsilon =\Sigma _{3\varepsilon }(\pi (\gamma (t)))\), which implies that \(\gamma \) does not intersect \(\Sigma _{3\varepsilon }\), but that is impossible because \(\Sigma _{3\varepsilon }\) is a Cauchy surface. \(\Box \)
We can now complete the proof of Proposition 1. Since \( \Upsilon _n \) approximates \( \Sigma _{2 \varepsilon } \) up to \(\varepsilon \), it approximates \(\Sigma \) up to \( 3 \varepsilon \). Furthermore, \( \Upsilon _n \subset \Sigma _{\varepsilon .. 3 \varepsilon } \) and \( \Upsilon _{n+1} \subset \Sigma _{\frac{1}{3} \varepsilon .. \varepsilon } \), and since \(\Sigma _{\frac{1}{3} \varepsilon .. \varepsilon }\) lies in the future of \(\Sigma _{\varepsilon .. 3 \varepsilon }\) while being disjoint from it, \(\Upsilon _{n+1}\) lies in the future of \(\Upsilon _n\) (see Fig. 7). This completes the proof of Proposition 1. \(\square \)
Proposition 2 follows from the following statement:
Proposition 6
Let \(\varepsilon >0\), \(\Sigma \) be a Cauchy surface, \(a_\varepsilon :=(\varepsilon ,0,0,0)\) the vertical 4-vector of length \(\varepsilon \), and \( g : \mathbb {M}\rightarrow \mathbb {M},\; g \in {\mathcal {P}}_+^{\uparrow } \) a proper Poincaré transformation. Then
with
with \(\beta \in [0,1)\) the boost velocity of g and \(\gamma :=(1-\beta ^2)^{-1/2}\) (the “Lorentz factor”).
\(\Sigma \) is being translated in the Proof of Proposition 6. Color online
Proof of Proposition 6
A Poincaré transformation g consists of a translation and a Lorentz transformation \(\Lambda \), which in turn consists of a rotation and a subsequent boost \(\Lambda _0\). The rotation leaves \(a_\varepsilon \) invariant. Thus, \(g[\Sigma + a_\varepsilon ]= g\Sigma + \Lambda _0 a_\varepsilon \). Without loss of generality, \(\Lambda _0\) is a boost in the \(x^1\) direction (see Figs. 9 and 10),
Consider any point \( x_a = (x^0_a,{\varvec{x}}_a) \in g\Sigma \). Denote by \( x_b = (x^0_b,{\varvec{x}}_b) \) the point on \( g[\Sigma +a_\varepsilon ] \) right above \( x_a \), \({\varvec{x}}_b={\varvec{x}}_a\). We want to show that \(x_b^0 \le x^0_a + {\tilde{\varepsilon }}\). Set \(x_c := x_a + \Lambda _0 a_\varepsilon \) Since \( g[\Sigma +a_\varepsilon ] \) is a Cauchy surface, any two points on it (such as \(x_b\) and \(x_c\)) must be spacelike separated, so
Now the triangle inequality implies the desired bound
\(\square \)
5 Proof of Theorem 1
Here is a quick outline of the proof. We want to show that
converges, as \(n\rightarrow \infty \), to
The proof is done by a squeeze-theorem argument: We will define two probability distributions \( {\widehat{\mathbb {P}}}_n \) and on \(\{0,1\}^r\) such that
![](http://media.springernature.com/lw410/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ83_HTML.png)
and prove that both converge to \(\mathbb {P}_{{\mathscr {P}}}(L)\) as \(n\rightarrow \infty \).
We go through some preparations for the proof. To begin with, it is easy to see that \({\mathscr {B}}_n=(B_{n1},\ldots ,B_{nr})\) with
is an admissible partition of \(\Upsilon _n\): First, \(B_{n\ell }\cap B_{nm}=\emptyset \) for \(\ell \ne m\) because \(\pi _\Sigma ^{\Upsilon _n}\) is a bijection. Second, \(B_{n\ell }\) is bounded because \(\pi _\Sigma ^{\Upsilon _n}\) maps bounded sets to bounded sets. Third, the boundary \(\partial B_{n\ell }\) of \(B_{n\ell }\) in \(\Upsilon _n\) is \(\pi _\Sigma ^{\Upsilon _n}(\partial P_\ell )\) because \(\pi _\Sigma ^{\Upsilon _n}\) is a homeomorphism. Finally, in order to obtain that \(\mu _{\Upsilon _n}(\partial B_{n\ell })=0\) we note that \(\mu _\Sigma (\partial P_\ell )=0\), that \(\Sigma \) (and \(\Upsilon _n\)) possesses a spacelike tangent plane almost everywhere (relative to Lebesgue measure \(\lambda \) on \(\mathbb {R}^3\)), and that, at points with a spacelike tangent plane, \(\mu _\Sigma \) possesses a nonzero density relative to \(\lambda \circ \pi _\Sigma \), so \(\mu _\Sigma \) and \(\lambda \circ \pi _\Sigma \) have the same null sets.
For the definition of we introduce more notation:
We define
![](http://media.springernature.com/lw276/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ85_HTML.png)
The corresponding sets of compatibility in configuration space \(\Gamma (\Sigma )\) are
![](http://media.springernature.com/lw506/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ86_HTML.png)
![](http://media.springernature.com/lw388/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ87_HTML.png)
The probability distributions that serve for the squeeze-theorem bounds are defined by
![](http://media.springernature.com/lw499/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ88_HTML.png)
Lemma 3
[Squeeze-theorem bound for \( \mathbb {P}_{{\mathscr {P}}} \)]. For all \(L\in \{0,1\}^r\),
![](http://media.springernature.com/lw220/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ89_HTML.png)
![](http://media.springernature.com/lw368/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ90_HTML.png)
![](http://media.springernature.com/lw215/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ91_HTML.png)
Proof
The statement is actually true for any triangular surface \(\Upsilon \), regardless of whether it belongs to a sequence converging to \(\Sigma \). Since we need it for \(\Upsilon _n\), we use here the notation that refers to \(\Upsilon _n\).
The inclusion
![](http://media.springernature.com/lw280/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ92_HTML.png)
is obvious, since \( \pi [{\widehat{C}}_{n\ell }] \) is a shrunk version of \( \pi [P_\ell ]\) (i.e., smaller) and is a grown version of it (i.e., larger).
We “lift” those sets to configuration space, keeping in mind that
By definition (86) we then have:
![](http://media.springernature.com/lw232/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ94_HTML.png)
Inclusions persist under intersections, i.e.,
This yields (89). The transition from sets M to projections P(M) as in (90) is straightforward, and sandwiching between \(\Psi _\Sigma \)’s yields (91). \(\square \)
Lemma 4
[Squeeze-theorem bound for \(\mathbb {P}_{{\mathscr {B}}_n}\)]. Assume (PL). Then, for all \(L\in \{0,1\}^r\),
![](http://media.springernature.com/lw402/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ96_HTML.png)
![](http://media.springernature.com/lw235/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ97_HTML.png)
Proof
Also this statement is actually true for any triangular surface \(\Upsilon \), regardless of whether it belongs to a sequence converging to \(\Sigma \).
By (PL) (27),
Since \((\exists A)^c = \emptyset A = \forall (A^c)\), we have that
and
Thus, inserting \(A\rightarrow B_{n\ell }\), \(\Sigma \rightarrow \Upsilon _n\), and \(\Sigma '\rightarrow \Sigma \),
On the other hand, inserting , \(\Sigma '\rightarrow \Upsilon _n\), and \(\Sigma \rightarrow \Sigma \),
![](http://media.springernature.com/lw300/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ102_HTML.png)
Since for \(A\subseteq \Sigma \) always
and since \(A\subseteq B\) implies \(\exists (A) \subseteq \exists (B)\) and \(\emptyset (A) \supseteq \emptyset (B)\), we have that
![](http://media.springernature.com/lw242/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ104_HTML.png)
Putting together (101), (102), (104),
![](http://media.springernature.com/lw334/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ105_HTML.png)
that is, in another notation,
![](http://media.springernature.com/lw423/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ106_HTML.png)
Now we want to conclude an analogous statement about L instead of \(L_\ell \). Note that \(U_{\Upsilon _n}^{\Sigma } \, P_{\Upsilon _n}(\cdot ) \, U_{\Sigma }^{\Upsilon _n}\) and \(P_\Sigma (\cdot )\) are two different PVMs that will in general not even commute with each other. The argument that we need has the following general form: For two different PVMs \( P_1, P_2 \), the ranges satisfy the relations
Applying this argument to (106) and the finite intersection \( \bigcap _{\ell } \) yields (96). \(\square \)
Lemma 5
Fix \(\ell \in \{1,\ldots ,r\}\); is a decreasing sequence of sets,
, with
![](http://media.springernature.com/lw241/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ108_HTML.png)
\({\widehat{C}}_{n\ell }\) is an increasing sequence of sets, \({\widehat{C}}_{n\ell } \subseteq {\widehat{C}}_{n+1,\ell }\), with
In particular,
![](http://media.springernature.com/lw152/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ110_HTML.png)
Moreover, equality holds in (108), (109), and (110) whenever \(\Upsilon _n\cap \Sigma =\emptyset \).
Convergence of the sets as \(n\rightarrow \infty \) for fixed \(\ell \) as in Lemma 5. Color online
Proof
The decreasing/increasing behavior of the sequence is a direct consequence of \(\Upsilon _{n+1}\subseteq \mathrm{future}(\Upsilon _n)\) and the definition of grown and shrunk set. For demonstrating (108), since \(\pi _\Sigma \) is a homeomorphism \(\Sigma \rightarrow \mathbb {R}^3\), it suffices to show that in \(\mathbb {R}^3\). If \({\varvec{y}}\notin \overline{\pi (P_\ell )}\), then it has positive distance to \(\pi (P_\ell )\) and \(\pi [[\Sigma -(\varepsilon ,0,0,0)] \cap \mathrm{past}(\pi _\Sigma ^{-1}({\varvec{y}}))]\) is disjoint from \(\pi (P_\ell )\) for sufficiently small \(\varepsilon >0\), so
for sufficiently large n. Similar arguments yield (109). Concerning the statement about equality, in that case for every \(x\in B_{n\ell }\), \(\mathrm{future}(x) \cap \mathrm{past}(\Sigma )\) has nonempty interior in \(\mathbb {M}\), so
contains an open neighborhood of \(\pi (P_\ell )\) and thus \(\overline{\pi (P_\ell )}\). Similarly for the interior. \(\square \)
Lemma 6
For every \(L\in \{0,1\}^r\), is a null set w.r.t. \(\mu _{\Gamma (\Sigma )}\).
Proof
We make use here of the requirement \(\mu _\Sigma (\partial P_\ell )=0\) in Definition 1. Consider first and \( {{\widehat{M}}_{n\ell }(L_\ell )} \). In case \( L_\ell = 1 \), we have that
![](http://media.springernature.com/lw292/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ111_HTML.png)
In case \( L_\ell = 0 \), we have that
![](http://media.springernature.com/lw259/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ112_HTML.png)
So either way,
![](http://media.springernature.com/lw391/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ113_HTML.png)
Now we want to consider L instead of \(L_\ell \). It is a general fact about sets that if \(A_\ell \subseteq B_\ell \) for all \(\ell \), then
Thus, for \(A_\ell = {\widehat{M}}_{n\ell }(L_\ell )\) and ,
![](http://media.springernature.com/lw462/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ115_HTML.png)
Now we want to take the intersection over all \(n\in \mathbb {N}\). In this regard, we first note the following extension of (93): if \((A_n)_{n\in \mathbb {N}}\) is a decreasing sequence of sets, then
After all, if q is a finite set that intersects every \(A_n\), then it must contain a point from \(\bigcap _n A_n\); conversely, a finite set q intersecting \(\bigcap _n A_n\) trivially intersects every \(A_n\).
Applying this to , which is decreasing because
is, we obtain that
![](http://media.springernature.com/lw338/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ117_HTML.png)
It is another general fact about sets (not unrelated to (116)) that if for every \(\ell \in \{1,\ldots ,r\}\), \((A_{n\ell })_{n\in \mathbb {N}}\) is a decreasing sequence of sets, then
Thus, for ,
![](http://media.springernature.com/lw435/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ119_HTML.png)
by Lemma 5 and (93). For any set A with \(\mu _\Sigma (A)=0\) it follows that \(\exists A\) is, in every sector of configuration space \(\Gamma (\Sigma )\), a finite union of null sets, so \(\mu _{\Gamma (\Sigma )}(\exists A)=0\). For \(A=\bigcup _{\ell } \partial P_\ell \) we obtain the statement of Lemma 6. \(\square \)
Proof of Theorem 1
By Lemma 3 and 4, it suffices to show that for every \(L\in \{0,1\}^r\),
![](http://media.springernature.com/lw222/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ120_HTML.png)
From Lemma 6 and the requirement (i) of Definition 4, according to which \(P_\Sigma \) must be absolutely continuous with respect to \(\mu _{\Gamma (\Sigma )}\), we have that
![](http://media.springernature.com/lw248/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ121_HTML.png)
The continuity property of measures \(\mu \) says that, for every decreasing sequence \(A_n\) of sets with \(\bigcap _n A_n =: A_\infty \), \(\mu (A_n) \rightarrow \mu (A_\infty )\) as \(n\rightarrow \infty \). For every \(\Psi _\Sigma \in {\mathscr {H}}_\Sigma \), \(\mu (\cdot ) := \langle \Psi _\Sigma | P_\Sigma (\cdot ) |\Psi _\Sigma \rangle \) is a measure. We know from Lemma 3 that .
We show that for every \(L\in \{0,1\}^r\), the sequence is decreasing: It suffices to show that
is decreasing and \( {\widehat{M}}_{n\Sigma }(L)\) is increasing. We know from Lemma 5 that
is decreasing and \({\widehat{C}}_{n\ell }\) is increasing, so by (93), both
and \(\emptyset {\widehat{C}}_{n\ell }\) are decreasing, so
(which is either
or \(\emptyset {\widehat{C}}_{n\ell }\), depending on \(L_\ell \)) is decreasing, and so is
![](http://media.springernature.com/lw173/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ122_HTML.png)
Likewise, \({\widehat{M}}_{n\ell }(L_\ell )\) (which is either \(\exists {\widehat{C}}_{n\ell }\) or , depending on \(L_\ell \)) is increasing, and so is \({\widehat{M}}_{n\Sigma }(L)\). Therefore, \(A_n\) is decreasing, as claimed.
We can conclude that
![](http://media.springernature.com/lw483/springer-static/image/art%3A10.1007%2Fs00023-021-01130-4/MediaObjects/23_2021_1130_Equ123_HTML.png)
This establishes the desired squeeze theorem argument and finishes the proof of Theorem 1. \(\square \)
Proof of Corollary 2
It is well known that for a sequence \(P_n\) of projections, weak convergence to the projection P (i.e., \(\langle \Psi |P_n|\Psi \rangle \rightarrow \langle \Psi |P|\Psi \rangle \) for every \(\Psi \)) implies strong convergence (i.e., \(P_n\Psi \rightarrow P\Psi \) for every \(\Psi \)).Footnote 8 Set \(P_n=U_{\Upsilon _n}^\Sigma P_{\Upsilon _n}(M_{{\mathscr {B}}_n}(L)) U_\Sigma ^{\Upsilon _n}\) and \(P=P_{\Sigma }(M_{{\mathscr {P}}}(L))\). Then Theorem 1 provides the weak convergence, and the strong convergence was what we claimed. \(\square \)
Remark
-
14.
Type of convergence of \((\Upsilon _n)_{n\in \mathbb {N}}\) The proof of Theorem 1 still goes through unchanged if the convergence of the sequence \((\Upsilon _n)_{n\in \mathbb {N}}\) is not uniform but uniform on every bounded set.
-
15.
Alternative definition of \(B_{n\ell }\) In order to avoid the choice of a particular Lorentz frame in the definition of \(B_{n\ell }\) and thus of the detection probabilities, we could replace \(B_{n\ell }\) by
(124)(The use of \(\mathrm{Gr}\) instead of \(\mathrm{Sr}\) would lead to overlap among the \(B_{n\ell }\), so they would no longer form a partition.) With this change, Theorem 1 remains valid. In the proof, we then need to modify the definition of \({\widehat{C}}_{n\ell }\) to
(125)while the definition of
is kept as it is. We would still use a preferred Lorentz frame for the definition of
, but that is a matter of the method of proof, not of the statement of the theorem. The proof goes through as before, except that (109) needs to be checked anew: it is still true because for every x in the 3-interior of \(P_\ell \), \(\mathrm{Gr}(\mathrm{Gr}(x,\Upsilon _n),\Sigma )\subset P_\ell \) for sufficiently large n.
Notes
We use the definition that a Cauchy surface [38] is a subset of space-time intersected by every inextendible causal (i.e., timelike-or-lightlike) curve in exactly one point. Thus, a Cauchy surface can have lightlike tangent vectors but cannot contain a lightlike line segment.
We use the Borel \(\sigma \)-algebra on \(\mathbb {M}\), \(\Sigma \), \(\Gamma (\Sigma )\) [24] etc.; when speaking of subsets, we always mean Borel measurable subsets.
In this paper, the “future” of a set R in space-time means the causal future, often denoted \(J^+(R)\) [27], as opposed to the timelike future \(I^+(R)\); note that \(R\subseteq J^+(R)\); likewise for the “past.”
One of us claimed in [24] that the null sets of \(\mu _\Sigma \), when projected to \(\mathbb {R}^3\) with \(\pi \), are exactly the null sets of the Lebesgue measure in \(\mathbb {R}^3\); this is equivalent to saying that the set of points of \(\Sigma \) with a lightlike tangent, when projected to \(\mathbb {R}^3\), is a null set. While we conjecture that this is true, we do not see how to prove it. The statement is neither used in [24] nor here.
Note that \(P_\Sigma \), restricted to subsets of \(\forall (A)\), maps \({\mathscr {H}}_{\Sigma , A}\) to itself and in fact defines a PVM on \({\mathscr {H}}_{\Sigma , A}\).
A proper Poincaré transformation is one that reflects neither space nor time; the set of proper Poincaré transformations is often denoted by \({\mathcal {P}}_+^{\uparrow }\).
We could also have defined \(B_{k\ell }\) by \(\overline{\Delta _k}\cap B_\ell \) instead of (41), but that would have caused a bit of trouble because these sets would not have been disjoint. Our choice (41), on the other hand, has the consequence, which may at first seem like a drawback, that \(\cup _k B_{k\ell }\ne B_\ell \) because we have removed the points on the 2d triangles where two tetrahedra meet. However, the set removed, being a subset of a countable union of 2d triangles, has measure 0 on \(\Upsilon \), and for any set \(A\subseteq \Sigma \) of measure 0, \(\exists (A)\) has measure 0 in \(\Gamma (\Sigma )\) and, by Definition 4, also \(P_\Sigma (\exists (A))=0\).
For the sake of completeness, here is a proof: First, \(P_n^2=P_n\) and \(P^2=P\) imply that \(\Vert P_n\Psi \Vert ^2=\langle \Psi |P_n^2|\Psi \rangle = \langle \Psi |P_n|\Psi \rangle \rightarrow \langle \Psi |P|\Psi \rangle = \Vert P\Psi \Vert ^2\). Second, since \(\langle \Psi |S|\Phi \rangle \) can be expressed through \(\langle \Psi \pm \Phi |S|\Psi \pm \Phi \rangle \) and \(\langle \Psi \pm i\Phi |S|\Psi \pm i\Phi \rangle \) (polarization identity [30, p. 63]), weak convergence implies \(\langle \Psi |P_n|\Phi \rangle \rightarrow \langle \Psi |P|\Phi \rangle \) for every \(\Psi \) and \(\Phi \). Now \(\Vert P_n\Psi - P\Psi \Vert ^2=\langle \Psi |(P_n-P)^2| \Psi \rangle =\langle \Psi |P_n^2-P_nP-PP_n+P^2 |\Psi \rangle =\Vert P_n\Psi \Vert ^2-\langle \Psi |P_n| P\Psi \rangle - \langle P\Psi |P_n|\Psi \rangle +\Vert P\Psi \Vert ^2 \rightarrow \Vert P\Psi \Vert ^2-\langle \Psi | P|P\Psi \rangle - \langle P\Psi |P|\Psi \rangle +\Vert P\Psi \Vert ^2=0\).
References
Ballesteros, M., Benoist, T., Fraas, M., Fröhlich, J.: The appearance of particle tracks in detectors. Commun. Math. Phys. 385: 429-463 (2021) arXiv:2007.00785
Bell, J.S.: On the Einstein-Podolsky-Rosen paradox. Physics 1, 195–200 (1964)
Bell, J.S.: Speakable and Unspeakable in Quantum Mechanics. Cambridge University Press (1987)
Bloch, F.: Die physikalische Bedeutung mehrerer Zeiten in der Quantenelektrodynamik. Phys. Z. Sowjetunion 5, 301–305 (1934)
Deckert, D.-A., Merkl, F.: External field QED on Cauchy surfaces for varying electromagnetic fields. Commun. Math. Phys. 345, 973–1017 (2016). arxiv: 1505.06039
Deckert, D.-A., Nickel, L.: Consistency of multi-time Dirac equations with general interaction potentials. J. Math. Phys. 57, 072301 (2016). arxiv: 1603.02538
Deckert, D.-A., Nickel, L.: Multi-time dynamics of the Dirac-Fock-Podolsky model of QED. J. Math Phys. 60, 072301 (2019). arxiv: 1903.10362
Dimock, J.: Dirac quantum fields on a manifold. Trans. AMS 269, 133–147 (1982)
Dirac, P.A.M.: Relativistic quantum mechanics. Proc. R. Soc. Lond. A 136, 453–464 (1932)
Dirac, P.A.M.: Lectures on Quantum Mechanics. Yeshiva University, Belfer Graduate School of Science, New York (1964)
Dirac, P.A.M., Fock, V.A., Podolsky, B.: On quantum electrodynamics. Phys. Z. Sowjetunion 2(6), 468–479 (1932). Reprinted in J. Schwinger: Selected papers on quantum electrodynamics, New York: Dover (1958)
Dürr, D., Goldstein, S., Zanghì, N.: Quantum equilibrium and the role of operators as observables in quantum theory. J Stat. Phys. 116, 959–1055 (2004). arXiv:quant-ph/0308038
Fewster, C.J.: A generally covariant measurement scheme for quantum field theory in curved spacetimes. In: Finster, F., Giulini, D., Kleiner, J., Tolksdorf , J. (eds.) Progress and Visions in Quantum Theory in View of Gravity, pp. 253–268. Springer, Berlin (2020) arXiv: 1904.06944
Fewster, C.J., Verch, R.: Quantum fields and local measurements. Commun. Math. Phys. 378, 851–889 (2020). arXiv: 1810.06512
Goldstein, S., Norsen, T., Tausk, D.V., Zanghì, N.: Bell’s theorem. Scholarpedia 6(10), 8378 (2011) http://www.scholarpedia.org/article/Bell%27s_theorem
Haag, R.: Local Quantum Physics. Springer, Berlin (1996)
Halvorson, H., Clifton, R.: No place for particles in relativistic quantum theories? Philos. Sci. 69, 1–28 (2002). arXiv:quant-ph/0103041
Hegerfeldt, G.: Instantaneous spreading and Einstein causality in quantum theory. Ann. Phys. 7, 716–725 (1998)
Kiessling, M., Lienert, M., Tahvildar-Zadeh, S.: A Lorentz-covariant interacting electron-photon system in one space dimension. Lett. Math. Phys. 110, 3153–3195 (2020). arXiv:1906.03632
Lienert, M.: A relativistically interacting exactly solvable multi-time model for two mass-less Dirac particles in 1+1 dimensions. J. Math. Phys. 56, 042301 (2015). arXiv:1411.2833
Lienert, M., Nickel, L.: A simple explicitly solvable interacting relativistic \(N\)-particle model. J. Phys. A Math. Theor. 48, 325301 (2015). arXiv:1502.00917
Lienert, M., Nickel, L.: Multi-time formulation of particle creation and annihilation via interior-boundary conditions. Rev. Math. Phys. 32(2), 2050004 (2020). arXiv:1808.04192
Lienert, M., Petrat, S., Tumulka, R.: Multi-Time Wave Functions: An Introduction. Springer, Heidelberg (2020)
Lienert, M., Tumulka, R.: Born’s rule for arbitrary Cauchy surfaces. Lett. Math. Phys. 110, 753–804 (2020) arXiv:1706.07074
Lienert, M., Tumulka, M.: Multi-time version of the Landau-Peierls formulation of quantum electrodynamics. In preparation (2021)
Lill, S., Nickel, L., Tumulka, L.: Consistency proof for multi-time Schrödinger equations with particle creation and ultraviolet cut-off. Ann. Henri Poincaré. 22: 1887–1936 (2021) arXiv:2001.05920
O’Neill, B.: Semi-Riemannian Geometry. Academic Press, San Diego (1983)
Petrat, S., Tumulka, R.: Multi-time wave functions for quantum field theory. Ann. Phys. 345, 17–54 (2014). arXiv:1309.0802
Petrat, S., Tumulka, R.: Multi-Time Formulation of Pair Creation. Journal of Physics A: Mathematical and Theoretical 47, 112001 (2014). arXiv:1401.6093
Reed, M., Simon, B.: Methods of Modern Mathematical Physics. Functional Analysis. Academic Press, San Diego (1980)
Reed, M., Simon, B.: Methods of Modern Mathematical Physics. Fourier Analysis, Self-adjointness. Academic Press, San Diego (1975)
Scharf, G.: Finite Quantum Electrodynamics, 2nd edn. Springer, Berlin (1995)
Schweber, S.: An Introduction To Relativistic Quantum Field Theory. Harper and Row, New York (1961)
Schwinger, J.: Quantum electrodynamics. I. A covariant formulation. Phys. Rev. 74(10), 1439–1461 (1948)
Thaller, B.: The Dirac Equation. Springer-Verlag, Berlin (1992)
Tomonaga, S.: On a relativistically invariant formulation of the quantum theory of wave fields. Prog. Theor. Phys. 1(2), 27–42 (1946)
Torre, C.G., Varadarajan, M.: Functional evolution of free quantum fields. Class. Quantum Gravity 16, 2651–2668 (1999). arXiv:hep-th/9811222
Wald, R.M.: General Relativity. University of Chicago Press, Chicago (1984)
Acknowledgements
We thank Matthias Lienert and Stefan Teufel for helpful discussions. This work was financially supported by the Wilhelm Schuler-Stiftung Tübingen, by DAAD (Deutscher Akademischer Austauschdienst) and also by the Basque Government through the BERC 2018-2021 program and by the Ministry of Science, Innovation and Universities: BCAM Severo Ochoa accreditation SEV-2017-0718.
Open Access
This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Karl-Henning Rehren.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Lill, S., Tumulka, R. Another Proof of Born’s Rule on Arbitrary Cauchy Surfaces. Ann. Henri Poincaré 23, 1489–1524 (2022). https://doi.org/10.1007/s00023-021-01130-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00023-021-01130-4