Symmetry, Reference Frames, and Relational Quantities in Quantum Mechanics

We propose that observables in quantum theory are properly understood as representatives of symmetry-invariant quantities relating one system to another, the latter to be called a reference system. We provide a rigorous mathematical language to introduce and study quantum reference systems, showing that the orthodox"absolute"quantities are good representatives of observable relative quantities if the reference state is suitably localised. We use this relational formalism to critique the literature on the relationship between reference frames and superselection rules, settling a long-standing debate on the subject.


Introduction
In classical physics, symmetry, reference frames and the relativity of physical quantities are intimately connected. The position of a material object is defined as relative to a given frame, and the relative position of object to frame is a shift-invariant quantity. Galiliean directions/angles, velocities and time of events are all relative, and invariant only once the frame-dependence has been accounted for. The relativity of these quantities is encoded in the Galilei group, and the observable quantities are those which are invariant under its action. Einstein's theory engendered a deeper relativity-the length of material bodies and time between spatially separated events are also frame-dependent quantities-and observables must be sought in accordance with their invariance under the action of the Poincaré group.
In quantum mechanics the analogues of those quantities mentioned above (e.g., position, angle) must also be understood as being relative to a reference frame. As in the normal presentation of the classical theory, the reference frame-dependence is implicit. However, in the quantum case, there arises an ambiguity regarding the definition of a reference frame: if it is classical, this raises the spectre of the lack of universality of quantum mechanics along with technical difficulties surrounding hybrid classical-quantum systems; if quantum, such a frame is subject to difficulties of definition and interpretation arising from indeterminacy, incompatibility, entanglement, and other quantum properties (see, e.g., [1,2,3] for early discussions of some of the important issues).
In previous work [4,5], following classical intuition we have posited that observable quantum quantities are invariant under relevant symmetry transformations, and examined the properties of quantum reference frames (viewed as physical systems) which allow for the usual description, in which the reference frame is implicit, to be recovered. We constructed a map which brings out the relative nature of quantities normally presented in "absolute" form in conventional treatments, which allows for a detailed study of the relativity of states and observables in quantum mechanics and the crucial role played by reference localisation.
The objectives for this paper are: 1) To provide a mathematically rigorous and conceptually clear framework with which to discuss quantum reference frames, making precise existing work on the subject (e.g., [6]) and providing proofs of the main claims in [5]; 2) to construct examples, showing how symmetry dictates that the usual text book formulation of quantum theory describes the relation between a quantum system and an appropriately localised reference system; 3) to provide further conceptual context for the quantitative trade-off relations proven in [4]; 4) to provide explicit and clear explanation of what it means for states/observables to be defined relative to an external reference frame, and show how such an external description is compatible with quantum mechanics as a universal theory; 5) to introduce the concepts of absolute coherence and mutual coherence, showing the latter to be required for good approximation of relative quantities by absolute ones, and demonstrating it to be the crucial property for interference phenomena to manifest in the presence of symmetry; 6) to address the questions of dynamics and measurement under symmetry, offering an interpretation of the Wigner-Araki-Yanase theorem based on relational quantities; 7) to analyse simplified models similar to those appearing in the literature purporting to produce superpositions typically thought "forbidden" due to superselection rules, and provide a critical analysis of large amplitude limits in this context guided by two interpretational principles due to Earman and Butterfield, leading directly to 8) to provide a historical account of two differing views on the nature of superselection rules ( [7,8] "versus" [10,9,6]), their fundamental status in quantum theory and precisely what restrictions arise in the presence of such a rule, showing how our framework brings a unity to the opposing standpoints; 9) to remove ambiguities and inconsistencies appearing in all previous works on the subject of the connection between superselection rules and reference frames; 10) to offer a fresh perspective, based on the concept of mutual coherence, on the nature and reality of quantum optical coherence, settling a long-standing debate on the subject of whether laser light is "truly" coherent. See also [11] for an important contribution on this topic. We provide general arguments and many worked examples to show precisely how the framework presented works in practice, and which simplify a number of models appearing in the literature.
Our paper constitutes further effort in a long line of enquiries (e.g., ([6, 12, 13, 14, 15, 16, 17]) aimed at capturing the relationalism at the heart of the quantum mechanical world view. The fundamental role of symmetry has not impressed itself strongly upon previous consideration of the relative nature of the quantum description, and we view this work (along with [4,5]) as opening new lines of enquiry in this direction. Our work is inspired by [6] and visits similar themes, and is complementary to recent work on resource theories (e.g., [6,18,19,20,21]), which focus primarily on practical questions surrounding, for example, high-precision quantum metrology.
We now provide standard mathematical background material, and will work in units where = 1.  (2) E(X) ≥ 0 for all X ∈ F, (3) E ( X i ) = E(X i ) for disjoint sequences X i ⊂ F (sum converging weakly).

Notation and Some Definitions
(Here ≤, ≥ denote the standard operator ordering.) Normalised poms represent observables (subject to extra constraints in the presence of symmetry, discussed below). Throughout this paper, the pair (Ω, F) will normally correspond to (R n , B(R n )) (or possibly subsets/subalgebras) with B(·) denoting the Borel sets. The operators E(X) are called effects (occasionally also pom elements or effect operators); they satisfy O ≤ E(X) ≤ ½. The unit operator interval [O, ½] comprises the set of all effects E(H). E(H) is convex as a subset of the real linear space of self-adjoint operators in L(H), and the collection of extremal elements is the set of projections, characterised as the idempotent effects. If all elements of a pom E are idempotent, then E is called a projection valued measure (pvm), and if E is defined on R, it defines a unique self-adjoint operator A := E(dλ) with spectral measure E A ≡ E. An observable defined by a self-adjoint operator, or equivalently, a pvm, will be called sharp, and all others unsharp. Normality is equivalent to σ-weak continuity. We will denote the trace class of H by L 1 (H) and the trace functional by tr [·]. Normal states are then obtained by setting A = C in Definition 2; any normal state is of the form A → tr [ρA] ≡ A ρ , where ρ ∈ L 1 (H) is a positive operator and tr [ρ] = 1. The set of normal states, denoted S(H), is (identified with) a σ-convex subset of the real vector space L 1 (H) sa of self-adjoint elements of L 1 (H). Henceforth all states are assumed to be normal, and we freely move between algebraic (linear functional) and spatial (density operator) notions of states. The extreme points of S(H), corresponding to the pure states, are given by the rank one projections, which will be denoted P ϕ ≡ |ϕ ϕ|, where ϕ ∈ H, ϕ = 1. We will usually identify pure normal states with unit vectors in H. States generate expectation-valued functionals L(H) sa → R on L(H) sa -the self-adjoint part of L(H) -and when restricted to E(H) can be viewed as generalised probability measures E(H) → [0, 1]. For a given pom E : F → L(H) and ρ ∈ L 1 (H) we will write X → p E ρ (X) for the probability measure X → tr [E(X)ρ] and if E = E A we use the shorthand X → p A ρ (X) to represent the measure X → tr E A (X)ρ . 2.2. Covariant poms and Localisability. Covariant poms will feature as reference quantities in the sequel, and their localisation properties will play an important role. We review these basic notions here.
Definition 3. Let U denote a unitary representation of a locally compact group G, and let F : F → L(H) be a POM whose outcome space Ω is a G-space. Then (U, F, H) is a system of covariance for G if F(g.X) = U (g)F(X)U (g) * for all g ∈ G, X ∈ F.
F is called a covariant pom for U . The triple (U, F, H) is called a system of imprimitivity if F is projection-valued. We often consider the case Ω = G and G abelian.
Remark 1. Systems of covariance/imprimitivity may also be defined for projective representations.
We give a definition relating to the localisability of poms-the so-called norm-1 property (see, e.g., [22]): The following is an immediate consequence. Lemma 1. If E satisfies the norm-1 property, then for any X for which E(X) = 0, there exists a sequence of unit vectors (ϕ k ) ⊂ H for which lim k→∞ ϕ k | E(X)ϕ k = 1.
This entails that such a pom gives rise to probability distributions which are (approximately) localisable in every set X for which E(X) = 0. In comparison, for a projection valued measure P, for any X with P(X) = 0, there is a unit vector ϕ ∈ H for which ϕ | P(X)ϕ = 1 (any unit vector in the range of P(X) will have this property). Hence poms with the norm-1 property have, in a limiting sense, the localisability properties possessed by all pvms.

Remark 2.
In the case of a covariant pom, we do not need to check all the subsets X to confirm the norm-1 property, as shown in the following lemma.
Lemma 2. Let E be a covariant pom with Ω = G. The following are equivalent.
(i) E has the norm-1 property.
(ii) E(X) = 1 for all X ∈ F with e ∈ X and E(X) = 0.
(iii) For all X ∈ F with e ∈ X and E(X) = 0, there exists a sequence of unit vectors (ϕ k ) such that lim k ϕ k |E(X)ϕ k = 1.
Proof. Assume (ii). Then for an arbitrary Y ∈ F with E(Y ) = 0, there exists g ∈ G such that e ∈ g.Y ∈ F holds. Thus (i) follows. The other relations are trivial. 4 2.2.2. Positions, Momenta, and Covariant Phase Space poms. For the case G = R, the position operator Q with spectral measure E Q : B(R) → L(L 2 (R)) acting by multiplication, and the strongly continuous representation U (x) = e ixP (P momentum) of R gives rise to a system of imprimitivity (U, E Q , L 2 (R)) under the covariance The momentum operator P with spectral measure E P : B(R) → L(L 2 (R)) satisfies, with V (p) = e ipQ , the following covariance relation with respect to boosts: yielding the system of imprimitivity (V, E P , L 2 (R)). Unsharp versions of position (for instance, smeared positions (e.g., [23])) are also covariant; indeed it is such a covariance requirement that defines the class of unsharp positions (analogously for unsharp momenta). Let µ be a probability ("confidence") measure on R. A smeared position observable E µ is defined as E µ (X) := (µ * E)(X) = R E(X + q)dµ(q) (4) where " * " denotes convolution of measures. Under the assumption of absolute continuity we can write µ(X) = X e(x)dx and E µ (X) ≡ E e (X) = (χ X * e)(Q). Such a quantity is called a smeared position observable with confidence function e. It is covariant, and in the limit that e becomes a delta function, or equivalently, the associated µ becomes a point measure, the sharp position is returned. An example of a necessarily unsharp covariant quantity is provided by a covariant phase-space pom M : B(R 2 ) → L(L 2 (R)) which is both shift and boost covariant, i.e., W (q, p)M (Z)W (q, p) * = M (Z + (q, p)), where W (q, p) := e (−i/2)qp e −iqP e ipQ are the Weyl operators. M contains unsharp positions and momenta as marginals.
We now turn to the case of G = S 1 which plays a major role in the rest of the paper. 4 The restriction on X ∈ F for E(X) = 0 is not needed in the case of S 1 and R d . Let E be a covariant pom with a compact Ω = G. Then the following are equivalent.
(ii) E(X) = 1 for all X ∈ F with e ∈ X.
(iii) For all X ∈ F with e ∈ X, there exists a sequence of unit vectors (ϕ k ) such that lim k ϕ k |E(X)ϕ k = 1.
Proof. Assume (ii). Suppose that there is a neighbourhood X of e satisfying E(X) = 0. G is covered by {g.X}, which can be reduced to a finite cover {gn.X}. E(G) ≤ n E(gn.X) = 0 gives a contradiction.

Covariant Phases.
We identify S 1 with [0, 2π] or occasionally [−π, π] (identifying also the endpoints of these intervals); θ → U (θ) = e iN θ is a strongly continuous unitary representation, for self-adjoint N , of S 1 in L 2 (S 1 ) and F : B(S 1 ) → L(L 2 (S 1 )) is called a covariant phase pom if (where ∔ denotes addition modulo 2π). There is a constraint on the spectrum of the unique self-adjoint generator N . Using the spectral representation of N , N = xE N (dx), we have e i2πx E N (dx) = ½ so that the spectrum of N must consist of integers. Recall that the generator N associated with a phase shift group is called a number operator. We consider three typical cases of generators N and covariant phase poms associated with them.
Example 1. Consider the canonical pair of an angular momentum component and the associated angle variable of a particle in three dimensions. In this case, N = L z (say), where L z generates rotations about the z axis, and as a covariant phase pom one can take the spectral measure of the self-adjoint azimuthal angle operator, E = E Φ , where Φψ(r, θ, φ) = φψ(r, θ, φ). Note that the spectrum of N is Z.
Example 2. The second example is motivated by the number operator counting the eigenvalues of the harmonic oscillator Hamiltonian. The associated phase poms are covariant under rotations in phase space. These cannot be pvms due to the fact that N is bounded from below [27,28]; the preceding example then figures naturally as the minimal Naimark extension of the canonical phase (defined presently). Thus, let {e n } be an orthonormal basis in H ≃ ℓ 2 and N := ∞ n=0 nP [e n ] ≡ ∞ n=0 nP n be a number operator. Any covariant phase pom conjugate to N is known to be of the form where (c nm ) is a so-called phase matrix -a positive matrix for which c nn = 1 for all n ∈ N.
The canonical phase F can is singled out by the condition c n,m = 1 for all n, m ∈ N ∪ {0}. F can is characterised by various optimality properties (see [26]), in particular it satisfies the norm-1 property.
Example 3. As the third example we consider covariant phase poms in finite dimensional Hilbert spaces; one such instance is the spin phase (e.g., [23]). The norm-1 property and the associated localisability are lost when we move to the finite dimensional setting, as shown in Lemma 3 below.
Let H ≃ C d and consider the operator N ∈ L(H) defined by N = d−1 n=0 nP n . An example of a covariant phase pom is given by For a set X ∈ B([0, 2π)), we denote its Lebesgue measure by |X|. Proof. The inequality follows immediately from tr [ρF(X)] ≤ tr [F(X)] and the fact that due to the covariance condition (6) the phase distribution is uniform in the number states, i.e., for each θ, n | F(X)|n = n | F(X ∔ θ)|n , therefore n | F(X)|n = |X|/(2π), and so tr [F(X)] = d|X|/(2π).
We note that all covariant phase poms are absolutely continuous with respect to the Lebesgue measure. The localisation lemma puts stringent bounds on the magnitude of the localisation probability if |X| is small. Conversely, in order to get high localisation probability (close to 1) for small intervals X in a finite dimensional system, one needs to choose the dimension d to be large. We will use covariant quantities to construct reference frames in the relativisation model, where large (typically infinite-dimensional) Hilbert spaces are required for the reference system to to be good, in a sense to be discussed.

Symmetry
3.1. Observables as Invariant Quantities. A quantum system S is constrained to behave in accordance with the symmetries of the spacetime it inhabits and to the concomitant conservation laws that arise. An upshot of such a constraint is that certain quantities require two systems for their definition or, more colloquially, require a reference frame. Henceforth, absolute quantities will be understood as those whose formal representation does not explicitly rely on such a reference, which is therefore viewed as external [6]. Such absolute quantities should not be taken to be observable: 5 the absolute position of a system is not meaningful, but both the relative positions of parts of S as a compound system and the position of S (or, e.g., its centre of mass) relative to some other system R is. Relative position is a shift-invariant quantity. We proceed with the hypothesis that what can be measured is invariant with respect to the relevant transformation group, with particular emphasis on the group of phase shifts.
Thus, a (locally compact) symmetry group G acts in the Hilbert space H S of the system S via a (strongly continuous, projective) unitary representation U . In non-relativistic quantum mechanics G (the spacetime symmetry group) is the Galilei group, and the stipulation of symmetry is that any pom E of S to be deemed observable must satisfy U (g)E(X)U (g) * = E(X) for all g ∈ G and X ∈ B(Ω) (Ω is any appropriate G-space). In this paper we simplify the problem, treating only unitary representations, and focus on shifts in one dimension (G = R), and rotations (G = S 1 ). The latter case has a spacetime realisation as rotations about an axis, and an "internal" realisation as shifts in phase of, say, a laser beam.
3.2. Number and Phase. Consider a (possibly unbounded) number operator N = n nP n acting in H S , generating a strongly continuous unitary representation U S of S 1 in H S via the unitary operators U S (θ) := e iN S θ , giving rise by conjugation to an action on L(H S ), i.e., A → e iN S θ Ae −iN S θ , and on states ρ → e −iN S θ ρe iN S θ .
Consider the mapping τ S : L(H S ) → L(H S ) defined by with its predual τ S * : T 1 (H S ) → T 1 (H S ) taking the same form: τ S * is familiar from various contexts. In the quantum theory of measurement, it is the Lüders map arising from a non-selective measurement of N S ; in an optical setting it is called a dephasing channel. It also appears in decoherence theory. τ S * is trace-preserving, and hence bounded and trace-norm continuous. It can be shown that for any pure state P [φ], τ S * (P [φ]) is the mixture of N S -eigenstates which minimises the Hilbert-Schmidt distance from P [φ]; we omit the proof.
Proposition 1. Let µ denote the (normalised) Haar measure on S 1 . For self-adjoint A ∈ L(H S ) the following are equivalent: (1) [A, P n ] = 0 for all n (and thus [A, N S ] = 0 for bounded N S ). 5 To be proper, we should write "absolute" in quotation marks to emphasise that such quantities are not represented in reality, but are mere notational short-hands. We avoid this only for aesthetic reasons.
We omit the proof of (1) which is straightforward, and note that (2) follows from the duality the final equality is not part of the proof, but shows that demanding states or observables to be invariant is equivalent to demanding the invariance of both. The proposition holds also if appropriately rephrased for unsharp E in place of A. This shows that no invariant quantity (i.e., no boda fide observable) of S can distinguish between ρ and its invariant "counterpart", τ S * (ρ). Operationally, then, the stipulation of invariance of observables partitions the state space into equivalence classes of indistinguishable states under the obvious equivalence relation. In the dual picture, keeping to invariant states means absolute and invariant quantities cannot be distinguished. Thus, stipulating that either states or observables must be invariant constitutes a restriction to ordinary quantum theory, and only if both states and observables are unrestricted do we have the usual textbook description.
3.3. Position and Momentum. The shift group on R is unitarily implemented in L 2 (R) by the operators U (x) = e ixP , with P the momentum operator in one space dimension. The spectral measure E Q of position Q is singled out (among spectral measures) by the condition E Q (X − x) = U (x)E Q (X)U (x) * . Unsharp positions also satisfy such a covariance criterion, and thus, as noninvariant quantities, absolute positions (sharp or unsharp) do not represent observable quantities, reflecting the lack of absolute space. However, it may be possible to distinguish separate parts of a given quantum system, S and R, and it may be possible to speak of the position of S relative to R, and therefore to measure the shift-invariant quantity Q S ⊗ ½ − ½ ⊗ Q R or, indeed, any other shift-invariant quantity of S + R.

Similar considerations apply to boosts.
Remark 3. The non-compactness of (the shift group on) R rules out the existence of a normalisable Haar measure playing the role of µ in Proposition 1 (as also pointed out recently by Smith et al. [29]). Therefore, the absolute position Q S should be understood as representing the relative position Q S − Q R , in the situation that the Q R system may be suppressed, or "externalised" [6] from the description. We now turn to a general analysis of the possibility of such an externalisation for arbitrary groups and relative quantities, before turning once more to typical examples.

Relativisation
In this section we introduce a relativisation mapping , and prove various mathematical properties satisfied by it. We discuss the physical interpretation of as the making explicit of a reference system, and in the following section show that under high localisation of the reference system with respect to an appropriate covariant quantity used to define , the description of the system alone in terms of absolute, non-invariant quantities can provide a statistically good account of the relative quantities. Conversely, it is shown that in the case of reference system delocalisation, the description afforded by system quantities is necessarily invariant, giving generally poor representation of relative observables. The map generalises (by considering a pom for the reference and more general groups) and makes mathematically precise (by avoiding improper states, and giving rigorous meaning to the integral) the "$" map of [6]. We also introduce the predual, * , which de-relativises states, replacing the erroneous use of $ also on states in [6].
will be called a relativisation map, various properties of which will be given in proposition 8. also acts on poms by ( • E)(X) := (E(X)).
We must first give the definition of this integral. If H R is finite-dimensional and G is compact and metrisable, there exists a unique positive T such that [30]. Then [4], the integral (12) may be defined by For the case that H S and H R are of infinite dimension, more work is required (see also [31], section 5.4). Let G be a locally compact second countable group. We first construct for a subset A ⊂ L(H S ) on which the action α g is norm continuous, noting that this subset is weakly dense; see the discussion below.
For e ∈ G, for any ǫ > 0 there exists a neighbourhood U such that α g (A) − A ≤ ǫ A . By translating this U we obtain a covering G = ∪ g∈G gU . The Lindelöf property of G allows us to obtain a countable cover G = ∪ i U i out of it. By taking their intersections we obtain a disjoint countable cover (a mesh) G = ∪ n V n such that for any g, g ′ ∈ V n it holds that α g (A) − α g ′ (A) ≤ ǫ A . Let {ǫ N } be a decreasing sequence converging to 0. By employing the above construction we can construct a mesh G = ∪ n V N n for each N so that V N n = ∪ m∈H N M V M m holds for each N ≤ M with some proper H N M ⊂ N. (That is, a mesh of M is strictly finer than that of N .) We choose g N n ∈ V N n for each n (and N ). Now we assume a covariant pom E(·) on the reference side to be projection-valued. This suffices since any pom F can be dilated to a pvm by Naimark extension.
We introduce for each N , the mapping It is easy to see that this is bounded. In fact, for an arbitrary normalised vector |ψ ∈ H S ⊗ H R , it holds that where p N n := ψ|½ ⊗ E(V N n )|ψ and ρ N n is a density operator uniquely determined by tr[ρ N n X] = ψ|X ⊗ E(V N n )|ψ . Now we show that the sequence { N (A)} is a Cauchy sequence. As an arbitrary A can be decomposed into two self-adjoint operators, it suffices to show the property for self-adjoint A. For ǫ > 0, For an arbitrary normalised |ψ ∈ H S ⊗ H R , we have Now we discuss the density of A in L(H S ). We assume that the action is weakly continuousa natural assumption from a physical point of view. In addition the action is assumed to be implemented by a unitary operator U (g). Then one can see that the representation U (g) is strongly continuous. We can define, for a smooth function f whose support is compact in G, A(f ) := µ(dg)U (g)AU (g) * f (g). For such A(f ) the action α g is norm continuous. We may introduce A as a subalgebra generated by such elements. If we take f to be localised around the unit of G, A(f ) gets close to A with respect to the weak topology. Thus A is dense in L(H S ).
As a final remark, we show that this (A) can be defined on the whole L(H S ) for Abelian G implemented by a true unitary representation. As each U (g) commutes with each U (g ′ ), their generators can be diagonalised simultaneously. For simplicity we treat here only G = R and write its generator as K = R kP(dk). Now we introduce P E = |k|≤E P(dk). Then one can see that for any A ∈ L(H S ), P E AP E is a "smooth" element (i.e., α g (P E AP E ) is norm continuous) and (P E AP E ) is defined. In addition its norm is bounded as (P E AP E ) ≤ A . Now for arbitrary vectors |ψ , |φ ∈ H S ⊗ H R , we define a sesquilinear form Q(|φ , |ψ ) by lim E→∞ φ| (P E AP E )|ψ . We first confirm that this is well-defined. For any ǫ > 0, there exists E 0 such that (½ − P E 0 )|ψ , (½ − P E 0 )|φ ≤ ǫ. Then for any E ≥ E ′ ≥ E 0 , we have Thus this sequence is Cauchy. On the other hand, each quantity is bounded by A |ψ |φ . Thus it converges. It is also easy to see that this sesquilinear form is bounded as Q(|ψ , |φ ) ≤ A |ψ |φ . Thus there exists an operator (A) satisfying ψ| (A)|φ = Q(|ψ , |φ ). Moreover, it is easy to see that such defined (A) is bounded as (A) ≤ A .  is positive and hence bounded.
is completely positive.
If U S (g)AU S (g) * = A for all g ∈ G, then (A) = A ⊗ ½. Proof.
(1) These properties follow immediately from the definition. (2) To prove positivity, consider ψ | (A)ψ for ψ ∈ H T and assume that A is positive, Writing γ g (A) = U S (g)B 2 U S (g) * and B 2 = m B|ϕ m ϕ m |B, the expression for ψ | (A)ψ becomes where we have defined ξ m (g) := k,l c kl ϕ m | BU S (g) * ϕ k φ l . The right hand side of the expression (16) is manifestly positive. Any positive (linear) map between C * -algebras with unit is automatically bounded-see [32], Prop. 33.4. Now we note that the right hand side of (16) can be written we introduce an orthonormal basis {η k } ⊂ C n . Then the proof runs along essentially the same lines as (2) above, and one finds that for Ψ = i,j,k c ijk ϕ i ⊗ φ j ⊗ η k , and repeating the argument, letting A = B 2 = p B|ϕ p ϕ p |B, Ψ | (A) ⊗ ½Ψ is given as which may be written as where ζ g (p) := l,m,n ϕ p | BU (g) * ϕ l φ m ⊗ η n . Thus, by the same argument as in (2), is completely positive.
The product measure defined by X × Y → F(X)F(Y ) is zero whenever X ∩ Y = ∅, and hence the right hand side of (20) reduces to which is the expression for ϕ ⊗ φ | (AB)ϕ ′ ⊗ φ ′ . (7) We compute: Therefore to each bounded self-adjoint operator of the system the map assigns a bounded self-adjoint operator (see below) (A) acting in H T which is invariant under the action of U .
Remark 4. From (5) we observe that not only relativises self-adjoint operators, but also their spectral measures, and more generally any pom. An example of the latter, which we will encounter in subsection 4.2.3, is the relativisation of a covariant phase pom, resulting in a relative phase observable.

4.2.
Examples. In this subsection we give examples of familiar relative quantities obtained under to demonstrate that functions as expected. Its main utility, however, lies in the fact that it relativises arbitrary quantities.

Position and Momentum.
Consider the spectral measure E Q S of the position Q S , E Q R of Q R , and unitary shifts U S (x) = e ixP S and U R (x) = e ixP R . Then, which may be written as This is easily recognised as the spectral measure of the relative position Therefore, one may formally write (Q S ) = Q S − Q R . Under the given relativisation, the spectral measure E P S of the momentum P S takes the simple form ( • E P S )(X) = E P S (X) ⊗ ½, and again we write (P S ) = P S ⊗ ½.
Relativisation of momentum under boosts follows an identical argument; we find that (using the same symbol for relativising with respect to boosts) yielding which is the spectral measure of P S − P R . Under this relativisation, Q → Q ⊗ ½.
Remark 5. With respect to the boost part of the Galilei group, the momentum relativisation assumed S and R are of equal mass. For systems with different mass, i.e., m S and m R , must be appropriately redefined.
We note also the possibility of unsharp relativisations, that is, allowing for one or both of the spectral measures E Q S and E Q R to be replaced by unsharp or smeared (covariant) positions. The same applies for unsharp momenta.
As an example, which we revisit in section 6, we may take a smeared position (χ X * e)(Q R ) ≡ E Q R e (X) for the reference, yielding 4.2.2. Angle. Here we consider two systems with Θ S , Θ R being their angle operators conjugate to the z-components of their angular momenta. For the covariant pvm of R we thus choose F = E Θ R . Then we obtain: Exploiting the covariance of the spectral measure E Θ S and performing the substitution θ + θ ′ ≡ θ ′′ we find the above equal to Therefore, (Θ S ) = Θ S − Θ R .

4.2.3.
Phase. We may use to construct a relative phase observable as given in [25,33]. Let F be a covariant phase pom as defined in (7), and denote by F R a covariant phase for R. Then • F is given by: where c n,m,k,l ≡ c n,m c ′ k,l . Writing |n, k ≡ |n ⊗ |k , we have which is a relative phase observable.

Basic Properties.
Consider now a fixed state ω of R and the isometric embedding V ω :  (1) Γ ω is linear, unital, adjoint-preserving.
(4) Γ ω is a (normal) conditional expectation in the sense of von Neumann algebras.
We recall (e.g., [30]) that given L(H) and a von Neumann subalgebra W ⊂ L(H) a normal conditional expectation E : L(H) → W is a positive, adjoint-preserving normal map satisfying the additional properties i) E(X) = X if and only if X ∈ W and ii) [30,Ch. 9]; for the final part we view L(H S ) as a subalgebra of L(H T ) by We extend by linearity to finite sums i,j A i ⊗ B j ∈ L(H T ), and to infinite sums by continuity.
Thus we will sometimes refer to Γ ω as a restriction channel. Γ ω restricts poms of S + R to those of S, and is used to translate back from the relative picture to the absolute one, contingent upon the state ω of R. For a pure product state, for example, for a given self-adjoint R ∈ L(H T ) and fixed unit φ ∈ H R the expression · ⊗ φ | R · ⊗φ determines a bounded, real valued quadratic form H S × H S → C and therefore a unique bounded self-adjoint operator The restriction map is related to the trace as follows. By the duality L(

Further
Properties. The restriction maps Γ ω have further properties of interest, which we collect here. For the purpose of characterising the relationship between the choice of state ω and the quantities thus obtained under Γ ω it is convenient to introduce covariant channels. Definition 6. Let U and V be unitary representations of G in Hilbert spaces H and K respectively, and let Λ : Covariance for maps acting on the trace class takes an obvious analogous form. The next lemma demonstrates that the restriction map Γ ω applied to an invariant quantity is invariant if ω is invariant.
Proof. Writing U (g) = U S (g) ⊗ U R (g), the first part follows immediately from the covariance condition to hold for an arbitrary bounded operator i,j A i ⊗ B j ∈ L(H) (or a limit of such terms) and all Thus, for invariant (observable) R ∈ L(H T ), the only possible way to achieve a non-invariant restriction Γ ω (R) is by choosing a non-invariant ω. An invariant ω therefore yields, in the case of number/phase, restricted quantities satisfying (1)-(4) of Proposition 1.
A simple calculation shows that the partial trace map tr H R : is covariant with respect to U S and U (and analogously for the partial trace over H S ). The following is a trivial consequence: Hence we have the following:

Restrictions after .
The restriction map Γ ω may be composed with the relativisation map to give, for arbitrary G, where the measure µ F ω := ω•F (or, for a density operator ρ corresponding to ω, µ F ρ (X) = tr [F(X)ρ]). As we shall see in the next section, the measure µ F ω dictates the proximity of A and (Γ ω • )(A).

Localisation and Delocalisation
6.1. High Localisation. Recall (Lemma 1) that if a pom F satisfies the norm-1 property then for each X for which F(X) = 0 there exists a sequence of unit vectors (φ n ) ⊂ H R such that lim n→∞ φ n | F(X)φ n = 1. This "localising sequence" (φ n ) allows for the expression of expectation values of relative observables to be given in terms of those of absolute quantities to arbitrary precision: Theorem 1. Let F have the norm-1 property and let G be either S 1 (which we identify with the interval [−π, π)) or R written additively with identity 0. If (φ n ) ⊂ H R is a sequence of unit vectors which becomes well localised at g = 0, then for each Proof. Assuming without loss of generality that ϕ = 1, we write where c n is the "error" for each n which we show goes to zero as n becomes large.
Let ∆ n = (−1/2n, 1/2n); then Now the φ n are chosen as follows: for each n, there is a φ n for which | φ n | F(∆ n )φ n − 1| < 1/n and | φ n | F(G\∆ n )φ n | < 1/n. Therefore the second term is bounded above by 2 A G\∆n φ n | F(dg)φ n which vanishes in the limit. For the first term, writing From the continuity for self-adjoint A of the real function f A ϕ it follows that f A ϕ (g) → 0 as g → 0, and therefore the right hand side of (44) goes to zero in the g → 0 limit. This extends to arbitrary bounded A, as can be seen by decomposing A into real and imaginary parts. Therefore by choosing a localising sequence (φ n ) ⊂ H R we can make ϕ ⊗ φ n | (A)ϕ ⊗ φ n as close to ϕ | Aϕ as we like. This result rests crucially on the assumption that the chosen F satisfies the norm-1 property. The main result may thus be rephrased (for the localising sequence (φ n )) in terms of and Γ as follows: in the weak topology on L(H S ), that is, in the topology of pointwise convergence of expectation values.
Example 4. Qubit algebra. Consider the space L(C 2 ) and a basis of Pauli operators with identity: {½, σ 1 , σ 2 , σ 3 }. Let N S := 1 2 (½ + σ 3 ) (which has spectrum {0, 1} and corresponding eigenvectors denoted |0 , |1 ). We addend an infinite dimensional reference system, H R , with "number basis" {|n : n ∈ N}, thus defining N R := ∞ n=0 n|n n| . Then we may use F ≡ F can on H R (see eq. (7)) to construct , and we find that The possibility of good phase localisation of states with respect to F can allows the entire qubit algebra L(C 2 ) to be recovered in the following way. Let A ∈ L(C 2 ) be an arbitrary self-adjoint element, and ϕ ∈ C 2 an arbitrary unit vector. Define |φ n = 1 √ n+1 n j=0 |j , which represents an approximately localised phase centred at zero. Remark 6. The property that {|φ n } represents an approximate phase eigenstate at phase value θ = 0 for F can means the following: for every δ > 0, the probability of localisation in the interval [−δ/2, +δ/2] approaches 1 as n → ∞. Thus, We sketch a proof of this property. In fact, we will find that the speed of convergence can be specified more precisely: we can allow δ to tend to zero as δ = δ n := (n + 1) (−1+ǫ)/2 for any ǫ ∈ (0, 1). We put ∆ n := [− δn 2 , δn 2 ] and X n = [−π, π] \ ∆ n .
Thus, we show that the probability p n := φ n F can [−π, π] \ ∆ n φ n → 0 as n → ∞. We have we see that as n becomes large, we indeed reproduce for any unit vectors ϕ ∈ H S the expectation values of the basis operators ½, σ 1 , σ 2 , σ 3 , and therefore for all A ∈ L(C 2 ): In conclusion, we see that by making the reference system explicit and taking the limit of a highly phase-localised state, the statistics of any absolute qubit effect offers an accurate representation of the relative qubit effect in H S ⊗ H R .
Example 5. Finite cyclic group. We may construct so that (A) is invariant with respect to a unitary representation U S ⊗ U R of some finite cyclic group G. Hence, let G be a group of cyclic permutations of a finite index set I, which can therefore be identified with G. We consider a Hilbert space H R that allows a direct sum decomposition into subspaces of equal dimension, H R = i∈I H R,i , and define a unitary representation U R : G → L(H R ) such that for any g ∈ G, U R (g) maps a given orthonormal basis of each H R,i to a given orthonormal basis of With so that φ may be chosen to be any unit vector φ ∈ H R,e , where e is the identity element of G: in this case P g φ = δ g,e φ, and (53) collapses to i.e., Γ φ ( (A)) = A. Therefore, by choosing a state localised at the identity of G, φ ∈ H R,e , all expectation values of any self-adjoint A ∈ L(H S ) are precisely those of the relativised (A) ∈ L(H T ).
Example 6. Unsharp Position. The quality of the reference system, understood as the localisability of the covariant quantity, dictates the quality of approximation of relative quantities by absolute ones (see [4] for a detailed investigation of this phenomenon.) An intuitive example of this behaviour is given by fixing a smeared position E Q R e for the reference, yielding, for sharp After restriction, we therefore wish to find the pomẼ Q S defined by and to compare this with E Q S . Moving to the position representation the right hand side of this expression may be written: which we write as After some manipulations we find that and that thereforeẼ The spread of the functionẽ = e * |φ| 2 dictates the (in)accuracy ofẼ Q S as it approximates E Q S . This spread can be quantified in different ways. Using the variance measure, we find that Var(ẽ) = Var(e) + Var |φ| 2 .
Definition 7. For 0 ≤ ǫ < 1, the overall width W ǫ (p) at confidence level 1 − ǫ of a probability measure p is defined by here the infimum of the lengths, |I|, is taken over all intervals I in R.
We may also use the overall width W ǫ (ẽ) applied to the densityẽ and the fact that the overall width of a convolution is bounded below by the width of the function with the greatest width, i.e., Therefore, the quality (localisability of the smeared position) of the reference system dictates the quality of the representing absolute quantity. Here the inaccuracy inherent in the reference system features as a lower bound on the inaccuracy of the absolute quantity. Even with perfect localisation of the reference with respect to the sharp position, there is a residual inaccuracy in the position of the system arising from the unsharpness of the covariant reference position. To get perfect accuracy, we need the preparation to be highly localised at 0, and the smearing distribution to be highly localised around 0.
6.2. Phase Delocalisation. At the other extreme (to that of high localisation) we may also consider very poorly localised reference states, including the worst case scenario of complete delocalisation, possible only for compact groups. For concreteness, we focus on the phase case.
Consider covariant phase pom F and invariant state ω. Such a state is completely delocalised . This is the Haar measure on S 1 under identification of S 1 with [0, 2π]. Thus we may formally write µ F ω (dθ) = dθ/2π. The composition of relativisation and restriction for a delocalised state ω then has the following effect on L(H S ) and L 1 (H S ) respectively: with predual/Schrödinger picture The above equations hold if (for example) we choose for ω the density operator τ R * (ρ R ) for any ρ R of R. Thus, from the perspective of , completely delocalised reference states give rise to restricted quantities/states which are phase-shift invariant/commute with N S . We note that the above relations (eq. (62) and (63)) also generalise, i.e., Proposition 6. Let Λ denote a general relative (invariant) self-adjoint operator acting in H T . If ω is invariant, then Γ ω is covariant. Then there exists a self-adjoint A ∈ L(H S ) for which The proof is a simple corollary of Lemma 4 and Proposition 1. We include it separately to emphasise the general property of invariance of restricted quantities obtained from general invariant quantities and invariant reference states. 6.3. Discussion. We have now seen that in the case of perfect reference phase localisation, absolute quantities of S, along with non-invariant states, provide an adequate theoretical and empirical account of the statistics produced by invariant quantities of S + R. In this case, the reference can be externalised, and excluded from the description. Though such a localised state is certainly a quantum state, there is a sense in which it may be viewed as classical-if the reference were provided by an abelian algebra (say, C 0 (G), embedded in L(L 2 (G))), classical pure states correspond to points in G which, are of course, localised (moreover, as shown in [4], "good" reference frames must be large, pointing to some form of classicality). These observations go some way towards justifying the informal use of external/classical reference frames in working with non-invariant states of S, which has become common-place in the literature.
The τ S mapping (and τ S * ) manifests in (at least) two distinct ways. Initially, we observed that the assumption that observables of S are invariant implies that ρ S and τ S * (ρ S ) cannot be distinguished, yielding an equivalence class of indistinguishable states (indeed, the notion of state could be redefined as this class). Now we see that τ S * produces a state description for S applicable when R is prepared in a completely phase-indefinite state (for example, an eigenstate of N R .) Indeed, τ S * (or (63)) is the "twirling" operation used in, e.g., [6], to yield a state description in which some observer of S "lacks a phase reference". There, it is argued that this is the description one would use if some experimenter wished to describe the state of S, but had no knowledge of the value of the (classical) phase reference that the state of S implicitly refers to.
In our formulation, this averaging arises as part of the physical description of an experiment in which the reference phase is completely phase-indefinite. Number states are of this type, and the phase-indefiniteness is a quantum restriction arising from number-phase preparation uncertainty relations. Therefore, we are able to give an alternative interpretation of to what the "lack of a phase reference" may be understood to refer: it is the situation in which a quantum phase reference is completely phase-indeterminate. There is no requirement of epistemic arguments regarding information possessed by experimenters, nor any need to refer to classical phase references at all.
6.4. The findings of [4]: General Considerations. We briefly review the findings of [4], which presents a study of "intermediate" situations for the reference frame, in between the very high and very low localisation covered in the present paper. [4] provides quantitative and operational size-versus-inaccuracy trade-off relations highlighting the necessity of large apparatus for good agreement between some arbitrary effect A and Γ ω (E) for invariant effect E.
We now state the main results more precisely. In the following, the Hilbert spaces involved are assumed finite dimensional. The operator norm · on L(H S ) induces the metric D(A, B) := A − B = sup σ |σ(A)−σ(B)|, which when restricted to the set of effects E(H S ) gives an operational measure of the discrepancy between the effects A and B.
In the following, the quantity W 0 ǫ (µ F ω R ) refers to the overall width (cf. Definition 7) of the where I(0, w) denotes the closed interval of width w ≤ 2π centred at θ ∈ [−π.π). Of course, . This inf can be replaced by min (see, e.g., [34], chapter 12). In the case in which relative quantities of S + R are obtained through , strong localisation of ω around θ = 0 gives good approximation between absolute and relativised effects: Let be a relativisation map and Γ ω a restriction map. For an arbitrary effect A and 0 ≤ ǫ < 1, it holds that Bad localisation gives bad approximation: And finally, One may go beyond the case of invariant quantities being obtained using , and consider general invariant quantities. The following holds for a general (a finite-dimensional, connected) Lie group G, acting via projective unitary representations U S and U R in H S and H R respectively, with self-adjoint generators N S and N R .
and N R are number operators on H S and H R respectively, and ω R ∈ S(H R ). Then the following inequality holds: Therefore, good approximation between arbitrary absolute effects (of S) and relative effects (of S + R), a large spread in the reference's number operator is required, and sufficient for this is good phase localisation.
6.5. Absolute Coherence. The stipulation that observable quantities are invariant under the given symmetry action bears strongly upon the possibility of operationally discerning between coherent superpositions (of eigenstates of the generator, in our case, number) and incoherent mixtures. It will therefore be useful to have a simple working definition and quantification of the absolute coherence of states with respect to a number operator (see [35] for other measures and observables). For an arbitrary state ϕ = n c n |n , no self-adjoint operator A commuting with N , i.e., no observable quantity, can distinguish between P [ϕ] and τ * (P [ϕ]) = |c n | 2 |n n|, i.e., between a state with absolute coherence and a state without it (Proposition 2).
Since localised states with respect to phase conjugate to N are not invariant, they are necessarily absolutely coherent. We now show that highly localised states have large absolute coherence, for the case of S 1 and finite dimensional Hilbert spaces.
and for an arbitrary ǫ, it holds that Proof. For simplicity we assume The first term in the right-hand side is 1 − ǫ. The second term in the right-hand side is now estimated. For any θ, by the definition of the overall width, tr(ρE(∆ + θ)) ≤ 1 − ǫ holds. For 6.6. Summary and Analysis of a Potential Objection. Returning to the situation in which we identify a system S and a reference R with number observables N S and N R respectively, an apparent circularity arises. Briefly summarising the story so far, we have argued that observable quantities are (defined as) those which are invariant under given symmetries. Given a quantum object, this entails that, in the phase-shift-invariance case, the states ρ and τ T * (ρ) are observationally equivalent and occupy the same equivalence class. From this point of view, absolute coherence is not a necessary feature of any description of the quantum object. However, we have argued that in certain circumstances the given object may be separated into two parts: system S and reference R, and that invariant quantities of S + R can, in the case of R having a phase quantity possessing the norm-1 property, be arbitrarily well approximated by absolute quantities of S, given a highly localised state of R. Absolute quantities are, in particular, sensitive to the difference between an absolutely coherent state ρ and its invariant, absolutely incoherent counterpart τ S * (ρ).
Therefore, a description of S + R in terms of only restricted/absolute quantities of S (not commuting with N S ), along with states with absolute coherence is possible, given states localised with respect to the absolute phase of R, which requires that such states have absolute coherence with respect to N R . This poses a difficulty, since it appears that we claim the description in terms of S alone is a relational one, depending implicitly on R, but have offered no such account for R. Moreover, the appearance of absolute coherence (of states of) S appears to depend on the actuality of absolute coherence of (states of) R.
What we therefore seek to develop in the next section is a "fully relational" picture in which S and R are treated on an equal footing. What emerges is that coherence is a truly relational notion in quantum mechanics, requiring two systems for its definition. From this, through development of the new concept of mutual coherence, we are able to give an analysis of interference experiments in terms of mutual coherence, and provide novel perspectives on the "reality" of optical coherence and the subtle issue of superselection rules and their relationship with quantum reference frames.

Fully Relational Picture
In order to obviate the objection raised in the previous section, we now rephrase our findings in what we describe as a fully relational picture, that is, in presenting our main results without taking recourse to absolute coherence, absolute localisation, absolute quantities, etc. In short, we may present the main theorem of the paper so far-Theorem 1-in a fully invariant manner for S + R. This does not change the mathematical content of the theorem, but highlights that only invariant states of S + R are required for good approximation of relational quantities by absolute ones, from which we may conclude that non-invariant states of S are representative of invariant ones of S + R, in direct analogy to the case of observables. It also motivates the concept of mutual coherence, to be presented in the next section. 7.1. States. We return once again to the situation wherein R has phase quantity satisfying the norm-1 property. First, with (φ i ) ⊂ H R a localising sequence (around 0), we may write equation holding for all ρ ∈ L 1 (H S ) and A ∈ L(H S ). Thus, since (A) is invariant we find that for all ρ ∈ L 1 (H S ) and A ∈ L(H S ). Hence, the limit and the resulting approximation may be carried out using only invariant/absolutely incoherent states of S + R.
Just as absolute quantities of S may be used to represent invariant ones of S + R, with good approximation coming with good localisation, a state ρ of S with absolute coherence may be used to represent invariant/absolutely incoherent states of the form τ T * (ρ ⊗ P [φ]) of S + R, again with good approximation coming with high phase localisation of φ.
However, we recall that there are difficulties with ascribing physical significance to the absolutely localised state φ with respect to the absolute phase pom appearing in the definition of , namely, absolute properties such as localisation and coherence, and absolute quantities such as phase should be understood as relative to a reference, with the reference being only implicit. Therefore, we should seek a consistent formulation in which absolute properties of R are not required-the description should be entirely relational. The above discussion is a step in this direction: the states {τ T * (ρ ⊗ P [φ])} for some localised φ no longer "contain" the localised φ in the sense that the partial trace over system or reference yields invariant/delocalised/absolutely incoherent states, and therefore a localised state cannot be attributed to R.
We now introduce the concept of mutual coherence, which we view as the fully relational version of ordinary coherence.

Coherence Revisited: Mutual Coherence
The need for a relational understanding of coherence has been clearly enunciated in the literature (e.g., [9,6,50,11]). However, little formalism is provided to deal precisely with such a relational notion, and there is no framework capable of making sense of an external classical frame (which appears in [6,50,11]). We will analyse this in more detail in Section 11 ; here we introduce the concept of mutual coherence, which we view as the relational counterpart (and a generalisation of) of absolute coherence (usually referred to as coherence in standard treatments).
We treat the number/phase case, recalling that we view poms which are invariant under relevant symmetry transformations as the truly observable quantities, and use the term "invariant quantity" for such objects.

Lemma 5. It holds that
Proof. We denote eigenvalue decompositions by Since τ S (B) = n P S n BP S n and τ R (C) = m P R m CP R m , a simple calculation shows that Corollary 2. The following two conditions are equivalent.
(i) There exists an invariant quantity E (thus τ T (E(X)) = E(X) for all X) and an X such that (ii) There exists an invariant quantity E (τ T (E(X)) = E(X) for all X) and an X such that Proof. Assume (i) holds. Then for E(X) satisfying the condition (i), Thus (ii) follows and vice versa.
Moreover, one can observe that for condition (i) to hold both τ S * (ρ S ) = ρ S and τ R * (ρ R ) = ρ R must be satisfied.
Therefore, since (τ S ⊗id)•τ T = (id⊗τ R )•τ T , one can conclude that a system state ρ S is coherent relative to the reference state ρ R if and only if ρ R is coherent relative to ρ S . Thus, coherence has a truly relational character. This motivates the following definition: A quantitative measure M(Θ) of mutual coherence of Θ may be provided by the quantity (where the supremum is taken over invariant effects) The above equalities follow easily from Lemma 5. For Θ = ρ S ⊗ ρ R we may write this quantitative measure as M(ρ S , ρ R ).
We note that this measure of mutual coherence is invariant with respect to the unitary representations U S ⊗ id and id ⊗ U R (and therefore also under U S ⊗ U R ). The following propositions show that if either (system or reference) state is invariant (absolutely incoherent), the mutual coherence vanishes, and at the other extreme (high reference localisation), the mutual coherence is well approximated by the absolute coherence (of the system state).
In particular, for an invariant state ρ R , the mutual coherence M(ρ S , ρ R ) vanishes.
Proposition 11. For a highly phase-localised state ρ R of R, M(ρ S , ρ R ) is approximately C(ρ S ) (the absolute coherence of ρ S ).
Proof. (We give a proof for finite dimensional Hilbert spaces.) For highly localised ρ R , we have shown (Proposition 7) that for an arbitrary effect E of S, Γ ρ R ( (E)) well approximates E, as From the definition of M we observe that For a fixed effect E, we have, using (67): Since E is arbitrary, we have We recall Proposition 10, which states that M(ρ S , ρ R ) ≤ min{C(ρ S ), C(ρ R )}. In the high localisation regime for ρ R , the second expression on the right side of (68) becomes small and we may assume that C(ρ S ) ≤ C(ρ R ). Therefore, we have the approximate equality M(ρ S , ρ R ) ≈ C(ρ S ), with the quality of approximation becoming arbitrarily good as ρ R becomes highly localised.
In other words, the mutual coherence takes on the appearance of absolute coherence in the high reference localisation limit.
We shall soon discuss the role of mutual coherence in interference phenomena and superselection rules. First, we note the following observations relating to approximation of relational observables by absolute quantities for some (A) constructed using a phase pom possessing the norm-1 property.
Suppose that we have some non-invariant state ρ S = τ S * (ρ S ). Then for arbitrary A, tr [ρ S A] and tr [ρ S ⊗ ρ R (A)] can be made equal only if (ρ S , ρ R ) is mutually coherent. The reason is clear: Suppose that (ρ S , ρ R ) is not mutually coherent. Then, by Definition 10, for any invariant . This latter quantity can never equal tr [ρ S A] for non-invariant A and non-invariant ρ S .
In fact, this also establishes the more general result that for any invariant R ∈ L(H T ) and non-invariant ρ S , tr [ρ S ⊗ ρ R R] = tr [ρ S A] for arbitrary A only if (ρ S , ρ R ) is mutually coherent. Theorem 2 demonstrates for a specific non-invariant effect A ∈ E(H S ) that for a ρ R with poor localisation (∆ ρ R N R < 1/6), the discrepancy D(A, Γ ρ R ( (A))) > 1/32, and therefore that tr [ρ S A] and tr [ρ S ⊗ ρ R (A)] cannot even be close in this case.

Measurement
The enquiry thus far has been of a kinematical nature. We now consider the important role played by dynamical evolution of states and ensuing measurements, considered in light of the relational perspective presented. The main theorem regarding the role of symmetry in quantum measurements is the Wigner-Araki-Yanase (WAY) theorem [40,41,42], which addresses measurements in the presence of additive conserved quantities of system-plus-reference. After presenting the essentials of the quantum theory of measurement required for our analysis, we present a "strong" form of the WAY theorem, assuming the system on its own has a conserved quantity, followed by two readings of the WAY theorem: the orthodox reading, as presented in [43], and the relational viewpoint. 9.1. Measurement Theory: Brief Overview. We briefly describe the quantum theory of measurement of relevance to this work. For simplicity we present these concepts without the impositions of symmetry.
Let H S be the Hilbert space representing a quantum system S under investigation, H A that representing a measuring apparatus, with the combined system then given by H = H S ⊗ H A . A unitary mapping U : H → H models a measurement interaction, serving to correlate the states of the system to those of the apparatus during an interaction period T . The specification of a selfadjoint "pointer observable" Z on H A , a fixed state φ ∈ H A (which for convenience is assumed to be pure) and the scaling function f (which maps the values of the pointer to those of the measured observable) then fix the measurement scheme M ≡ H A , U, φ, Z, f for observable E of S. With Ψ T = U (ϕ ⊗ φ) ∈ H, M must satisfy the probability reproducibility condition: where E Z f −1 (X) are spectral projections of Z, and (69) holds for all ϕ and X. In words, (69) stipulates that the outcome distribution for E in any state ϕ may be recovered from the pointer statistics in the final state Ψ T . Conversely, given a measurement scheme as described above, this relation determines the measured observable E. A measurement (scheme) is said to be repeatable if, upon immediate repetition of the measurement, the same outcome is achieved with certainty. This may be written: We note that (69) does not entail (70), and therefore the question of repeatability must be treated independently of that of probability reproducibility.
9.2. Conservation Laws: Strong and Weak WAY Theorems. We present here the standard version and interpretation of the theorem of Wigner, Araki and Yanase (WAY), as presented in [43], giving both the no-go part, prohibiting sharp, repeatable measurements of an observable (in the ordinary sense) which does not commute with (the system part of) an additive conserved quantity, and the positive part demonstrating conditions under which good approximation can be achieved. We begin with a version of the WAY theorem subject to a stronger assumption than is typical-that subsystem quantities are conserved-which we therefore refer to as the strong WAY theorem.
9.2.1. Strong Conservation. The stipulation that observability entails invariance follows as a theorem in the quantum theory of measurement from a constraint on M, namely the conservation of some quantity of H S (see also [34], ch. 21). Consider the strongly continuous unitary group described by the operators U S (t) ≡ e itL S , with t ∈ R or t ∈ [0, 2π] and L S a self-adjoint operator acting in H S . Then the following holds. 6 Proposition 12. Suppose that for any measurement scheme M for E, L S is conserved, i.e., for all value sets X ∈ F and all t.
Proof. It is sufficient to consider measurement schemes for which the pointer function f in (69) is the identity map. Equation (69) gives for all object initial states ϕ and all X. Replacing ϕ with U S (t)ϕ does not change the left hand side, and therefore the right hand side is also unchanged, immediately giving (71).

Remark 7.
If S is considered to be an elementary system, the operators U S (t) comprise an irreducible representation, in which case the only effects satisfying (71) are those of the form , the trivial effects. If, however, S is more complex, comprising several elementary systems for instance, S can be separated into an "object" system S O and a "reference" system S R . Absolute quantities of S O then function only as representatives of observables of S as a whole, and depending on the composition of S R may or may not accurately represent observables; as we have seen, S R under certain localisation requirements allows for the absolute quantities of S O to be good representations of observables. In particular, S R may be viewed as a measuring apparatus, as is the case in the WAY theorem, which we initially present in its conventional form, and subsequently reinterpret in a relative vein.

Weak Conservation: Wigner-Araki-Yanase Theorem.
In this instance a conservation law is applied to the system-apparatus combination, but is not assumed to hold 'locally', i.e., for the system under investigation and apparatus separately. We present the traditional reading of the WAY theorem. We refer to [43] for a proof. Following Ozawa [45], we refer to the condition that the pointer observable Z commutes with the apparatus part of the conserved quantity L A as the Yanase condition [46]. In the case that [A, L S ] = 0, there is a positive counterpart to the impossibility result: approximate measurements of A, with approximate repeatability properties, are feasible, with increasingly good approximation properties the larger the variance (∆ φ L 2 ) 2 becomes (see [43], where more general measures of spread are also considered) and indeed that such large "spread" is necessary for good measurements of A.
Thus, in its usual reading, the WAY theorem does not prohibit accurate measurements of unsharp observables which do not commute with L S , thus leaving room for a positive rephrasing of the theorem where a smeared, approximate version of A can be measured accurately. We now address this point, arguing that, just as in the discussion following the strong version of the theorem, one should actually conclude that the measured observable in the WAY theorem must be understood as a representative of a relative observable of system and apparatus combined.
The standard interpretation of the WAY theorem states that any sharp A not commuting with (the object part L 1 of) an additive conserved quantity L for which the Yanase condition is satisfied ([Z, L 2 ] = 0) cannot be measured precisely. Moreover, good approximation can occur if there is large uncertainty with respect to the apparatus part L 2 of the conserved quantity in the initial state φ of the apparatus, i.e., if (∆ φ L 2 ) 2 is large.
In light of the theme of this paper, namely understanding the consequences of the principle that observables are invariant, we may reconsider the message of the WAY theorem. We recall that, for fixed φ ∈ H R , the equation when stipulated to hold for all X, ϕ determines the pom E. In other words, E(X) = Γ φ (U * ½ ⊗ E Z (X)U ). Given the Yanase condition ([Z, L 2 ] = 0) and the conservation law ([U, L] = 0), it follows that [U * ZU, L] = 0. Writing U * ZU ≡ Z(τ ), it is therefore evident that Z(τ ) is invariant under the symmetry generated by L and that, furthermore, in the limit that (∆ φ L 2 ) 2 becomes large, A (which is not necessarily observable) can become a good approximation of the observable Z(τ ).
If L 2 is the shift-generator in a conjugate quantity (e.g., a number operator generating phase shifts), then large L 2 spread in φ corresponds to high localisation with respect to φ in the conjugate quantity, completely in line with the view that for A to be a good representative of an invariant observable, the reference system must be highly localised with respect to a phase-like quantity,à la . This also sheds light on the reason that L 2 must have large spread in the initial state of the apparatus φ. This view of the ordinary WAY theorem then arises when the strong WAY theorem is applied to system-plus-apparatus together, viewed as an isolated system. The relational view just discussed may be exemplified in a position measurement model of Ozawa, introduced in [47] and analysed further in [48], [49]. We consider the momentum-conserving position measurement scheme in which S + R interacts with two apparatus systems A + B. This scheme measures the absolute position Q with the pointer observable P B − P A , a relativised momentum. Contrary to the claim in [47], a WAY-type limitation is exhibited for this model. However, we show that the same scheme may be used to measure the relative position observable, Q ⊗ ½ − ½ ⊗ Q R ≡ Q − Q R ; in this case there is no localisation requirement at all for good measurements, as would be expected since Q − Q R is already shift-invariant. Moreover, we demonstrate that the absolute position Q well represents Q − Q R precisely when Q R is highly position-localised, corresponding to a large momentum spread in the reference system R (cf. Example 6).
The unitary measurement coupling is given by U = e i λ 2 (Q−Q R )(Q A −Q B ) , which commutes with the total momentum P + P R + P A + P B (notice also that P + P R is separately conserved and therefore falls under the remit of Proposition 12). Subsequently the pointer Z, given by the difference of momentum operators P B − P A , is measured. We consider the initial state Ψ 0 (x, y, u, v) = ϕ(x)φ(y)ξ a (u)ξ b (v), where x, y, u, v are spectral values of Q, Q R , P B − P A , P A + P B respectively. The unique measured pom E : B(R) → L H ⊗ H R ≡ L(L 2 (R 2 )) is extracted from the condition required to hold for all ϕ, φ. It then follows that where the right-hand side is the convolution of the set indicator function χ X with the probability distribution e (λ) ( a (s) = √ λξ a (λs). We see that E is a smeared version of E Q−Q R . As such the former can be considered an approximation of the latter, and we may quantify the inaccuracy or error of that approximation by the variance of the distribution function e (λ) (other measures such as overall width can also be used: see [49]). The variance of e (λ) is Var( e (λ) ) = 4 λ 2 Var |ξ a | 2 . Therefore by tuning λ to be large, arbitrarily accurate measurements of Q − Q R can be achieved with no localisation requirement on the reference system R.
The absolute position Q then acts as an approximation of the observable Q−Q R , the approximation becoming good with good Q R localisation. By fixing φ in the initial state Ψ 0 , the measurement scheme can be viewed as "measuring" a pom E for S: This is of the form E(X) = χ X * e (λ) (Q) with e (λ) given by e (λ) (x) = φ 2 * ξ (λ) a 2 (x). The probability distribution for the relative position has thereby been re-expressed in terms of a smeared distribution for the absolute position by considering a fixed reference state φ. The approximation error of E relative to E Q is given by Var(e (λ) ) = Var |φ| 2 + 4 λ 2 Var |ξ a | 2 . The probability distributions corresponding to the relative coordinate in the states ϕ ⊗ φ become indistinguishable from those of the absolute coordinate Q in the limit that the localisation of the state φ with respect to Q R is arbitrarily good (provided also that λ is tuned to be large.) This model, therefore, highlights how relative observables such as Q − Q R may be measured whilst preserving overall symmetry imposed by the conservation of total momentum, with a pointer observable that also respects symmetry. In this case, the apparatus is not required to function as a reference system, which is internal to the measurement device and whose localisation controls the quality of the approximation by the absolute quantity.
We have therefore seen that the picture of observables as relative quantities may be well maintained in the presence of dynamics. It was shown that the WAY theorem has a relative interpretation, and the model of Ozawa provided a measurement scheme for the relative position observable, which could be re-expressed as an accurate measurement of an absolute position precisely when the reference system was well position-localised. Absolute quantities were seen to be good representatives of observables again in the high reference localisation limit, the interpretation therefore not differing from the "static" case. We now consider the impact of this enquiry on the status of superpositions, interference and superselection rules.

Interference Phenomena
We begin this section with a typical analysis of interference phenomena. Absolute coherence is the usual requirement for interference effects to manifest. We show that, from our relational perspective, mutual coherence replaces absolute coherence in regard to interference. We provide several models which serve to illustrate the problem of observing coherence, and then turn to the role played by high phase localisation of the reference.
The stipulation that observables are phase-shift invariant implies that relative phase factors 7 between states in a superposition of number eigenstates (with differing eigenvalue) ought to be unobservable. We have seen, for example, that the state P [ϕ] of S with ϕ = |0 + e iθ |1 / √ 2 cannot be distinguished from τ S * (P [ϕ]) = 1/2(|0 0| + |1 1|) (which has no dependence on θ) by any quantity commuting with N S .
The usual reading of θ-dependent expectation values (appearing in states akin to that discussed above) arising in measurements is that a coherent (in our language, absolutely coherent) superposition has been prepared and measured (to be coherent). Since this is equivalent to measuring an absolute quantity, this conclusion warrants further scrutiny. There is a history of debate and controversy surrounding the meaning of θ-dependent expectation values in superpositions of number states (also understood as charge eigenstates) in the subject of superselection rules [7,9,8], and of 7 It may be of help to note here that there are two distinct uses of the term relative phase: up to this point, we have used the term to designate observables that are defined as relative to a reference system; in the context of interference experiments, one speaks of relative phase factors to indicate that these are actually phase differences between two states appearing in superposition, contrasting these observable quantities with the unobservable overall phase factors one may attach to a state vector. photon number states in the so-called optical coherence controversy [39,11], regarding the reality of coherent states in describing the output field of a laser.
In the forthcoming subsections we motivate the question of relative phase-factor sensitivity more formally and in a dynamical context, by first discussing a generic interference experiment, followed by three model considerations. We finish with a discussion of the role of high phase localisation and the accompanying interpretation of the measurement statistics. We then use our findings to analyse points of agreement and points of friction between our interpretation and those appearing in the literature. 10.1. Interferometry. Ramsey interferometry exemplifies the typical form of interference experiments (see, e.g., [50]). Here, an atom enters a cavity in its ground state |g , interacts with the cavity, and exits in a superposition of ground and excited states. At the level of the atom the following sequence (or similar) of (unitary) state evolutions is often given: where |e represents an excited state of the atom. If the observable P g ≡ |g g| is measured in the final state, we see ψ f | P g ψ f = sin 2 (θ/2). The orthodox reading of such a measurement is that this θ-dependent probability distribution for the observable P g in the state ψ f validates the coherence of the superposition state 1 √ 2 (|g − ie −iθ |e ). However, the Hamiltonian generating such an evolution certainly does not commute with P g , P e (i.e., N S ) and is, itself, therefore not phase-shift invariant (and thus not (an) observable). Equations (75) and (76) must, if applicable at all, therefore be viewed as approximate, reduced descriptions of the true, energy-conserving dynamics of system-plus-cavity.
We will obtain a consistent description of measurements which at first sight appear sensitive to relative phase factors between number superpositions. Keeping in mind that observables are invariant and that states are class representatives, we may obtain statistics which look as if absolute quantities have been measured or, alternatively (and equivalently), that relative phase factors across number eigenspaces have been observed. Again, this is a reduced, approximate description and not a true representation of the state of affairs. The models to be presented have strong formal similarities to the case of observability of phase factors between states of different charge and different baryon number, allowing for comparison to the issue of whether superselection rules may be obviated in practice (cf. [9,6,50]). We show that all such attempts may be phrased purely in terms of measurements of relative quantities (i.e., observables), highlighting the fact the absolute quantities are never measured.
10.2. Model 1: Two-level System. We first consider a model in Hilbert space dimension 4 to show how to dynamically introduce a relative phase factor between number states (of the same total number eigenvalue), whilst respecting symmetry. The restriction to low dimensions highlights the relational nature of the relative phase factor. The generic structure of this model can then be applied to the scenario where the reference system's Hilbert space has infinite dimension, which resembles the situation for which there have been claims purporting to "lift" [6] or evade superselection rules. However, we argue that there is no reason for the interpretation of measurement statistics in the infinite dimensional setting to be different from the model discussed below, except for the observation that with infinite dimensional reference systems, expectation values of absolute quantities (can be made to) agree arbitrarily well with those of the relative ones (contingent on a choice of reference state).
Let N S ∈ L(H S ) ≡ L(C 2 ) be a number operator so that N S |0 = 0, N S |1 = |1 , and let N R ∈ L(H R ) have the same definition. Any self-adjoint operator A ∈ L(H T ) must commute with We introduce two unitary operators U 1 and U 2 which represent two stages of time evolution, defined as and it can be seen that [U 1 , N ] = [U 2 , N ] = 0. Furthermore, it is important to note that U 2 does not depend on θ, which can be seen by the action of U 2 on the initial product states given by ). The purpose of applying U 2 is to allow a measurement of an invariant quantity of S, which gives rise to θ-sensitive measurement statistics.
In other words, U 1 introduces the factor θ, U 2 redistributes the θ-dependence, so that the measurement of an invariant S-quantity depends on θ, which then validates the superposition present after U 1 .
Writing P 0 := |0 0|, ψ = |0 |1 , and noting that τ S (P 0 ) = P 0 , we compute post-U 2 statistics: This yields the probability p P 0 U 2 U 1 ψ (0) = cos 2 θ 2 , which depends explicitly on the phase θ. Applying τ T * at every stage does not alter the probabilities; we have, for example Then tr P 0 ⊗ ½τ T * (P U 2 U 1 ψ ) coincides with the expression in (77). The unitary maps U 1 followed by U 2 mimic what might occur in a realistic interference experiment in which the reference system is confined to a low dimensional Hilbert space. The interference fringes dictated by θ may be observed through the measurement of an invariant system-apparatus quantity. This does not require absolute coherence, i.e., does not imply the coherence of superpositions across N -eigenspaces. It does, however, require mutual coherence. Considering the states arising after application of U 1 , it is immediately clear that the reduced states tr H R [P U 1 |i |j ] and tr H S P U 1 |i |j have no dependence on θ, indicating that θ relates to both S and R. Since the post-U 1 states are entangled, the only means by which we may identify a system and a reference is via the partial trace.
We may define a restriction map Γ ρ R with ρ R = tr H S P U 1 |i |j , which, since ρ R is invariant, yields only invariant restricted quantities for S (if assumed invariant for S + R). The conclusion, then, is that θ-dependent expectation values do not correspond to the observation of states with absolute coherence.
We now analyse a variety of infinite dimensional examples, and argue in subsection 10.6 that we must draw the same conclusion: relative phase factor sensitive measurement statistics can be achieved by measuring observables (and only observables), i.e., the relevant relative phase factors occur within an N -eigenspace. Only in the high reference phase localisation do these appear as though they pertain to the system alone.
10.3. Model 2: Angular Momentum and Angle. We now adapt the previous model, replacing the space C 2 of the reference system with an infinite dimensional space, and construct a new unitary mapping (still calling it U 1 and restricting to the subspace spanned by {|0 , |1 } for the first system): Here the basis vectors are the eigenvectors of n . We observe that the partial trace over system or reference yields reduced states which do not depend on θ. Linearity and continuity entail The initial state Ψ 0 under τ T * takes the form (sums taken for n running from −∞ to ∞) where the P n are the infinite-rank projectors onto the eigenspaces of N = N 1 + N 2 given as We consider what observation may reveal about θ in the state τ T * (P Ψ f ) for Ψ f as given in (81). We have Here it is manifest that τ T * (P ψ f ) is a mixture of states of different N -eigenvalues, and within each eigenspace labelled by n there is a relative phase factor between the states of the same Neigenvalue. There exists an invariant quantity of S +R which is sensitive to θ in the state τ T * (P Ψ f ). For example, we may choose A = |0, n 1, n−1|+h.c. and invoke relation (11) (replacing the spectral projections with the self-adjoint operators they define). We may extend the analysis and, in the spirit of the finite dimensional example, introduce a second unitary U 2 (which is independent of θ), which with U ≡ U 2 U 1 yields on the number basis states In analogy to the 2 × 2 case, we see that for example tr |0 0|tr K [P U |0 |n ] = cos 2 θ 2 , and again, since we have measured an observable (i.e., |0 0| ⊗ ½), applying τ T * at all stages does not alter the result. The purpose of U 2 is then to bring the final states into a form wherein the measured observable is non-trivial only for S and measurement of an invariant quantity for S gives θ-dependent expectation values. This validates the presence of mutual coherence, and does not indicate the existence of absolute coherence at any stage. As shall be shown, this becomes crucial in the question of whether superselection rules can be effectively overcome through a judicious choice of unitary mappings and measurements.
j . A simple N -preserving unitary mapping is given by: Following the now familiar approach, we introduce a second unitary map U 2 , under which U ≡ U 2 U 1 implements |n + cos θ 2 |1 |n − 1 n > 0. Then tr |0 0|tr K P U |0 |n = cos 2 θ 2 and once again we have a θ-dependent probability distribution for an observable in the state U |0 |n . Moreover, this does not differ from the distribution in the state τ (P U |0 |n ). Of course, the θ-dependence only corroborates the "reality" of the relative phase factor, within the eigenspace of N with eigenvalue n in the state on the top line of equation (86).

10.5.
Coherence and Mutual Coherence: Brief Discussion. In each of the models we have discussed, the crucial component for witnessing interference effects, in the form of θ-dependent expectation values, is the presence of non-zero mutual coherence for states of S + R (which is possible even in the absence of absolute coherence for S + R). Mutual coherence allows for (and is necessary for) the appearance of absolute coherence, even without a limit being taken.
For instance, ϕ = α|0 + β|1 gives, for A = |0 1| + |1 0| the expectation value 2Re(αβ). One can define the invariant (entangled) stateφ = α|01 + β|10 andÃ = |01 10| + |10 01| (which does not commute with N T ) so that φÃ |φ = ϕ | Aϕ = 2Re(αβ). This can also be done with an invariantÃ ′ = τ T (Ã). Thus, in this case, asymmetric statistics of S can be given by symmetric ones of S + R without the need for localisation. However, the physical interpretation is unclear due to the non-separability of Ψ. The important observation is that mutual coherence ofφ is required for the possibility of the appearance of absolute coherence of states of S.
Next we examine the role of high reference phase localisation in the interpretation of the measurement statistics.
10.6. High Phase Localisation. We turn now to the behaviour of model 2 in the regime that the initial state of the reference system is highly phase-localised. Let c n = e inθ ′ √ 2j+1 for −j ≤ n ≤ j and 0 otherwise, and let |θ ′ j = j −j c n |n . This state is approximately localised around the value θ ′ , i.e., is an approximate eigenstate of the self-adjoint angle Θ R with eigenvalue θ ′ , with the quality of approximation becoming increasingly good as j becomes large. Indeed, the sequence (|θ ′ j ) is an approximate eigenstate of Θ R , in the sense that θ ′ j Θ R θ ′ j = θ ′ and Var(Θ R ) θ ′ j → 0 as j → ∞. (The sequence also describes a an approximately localised state in terms of concentration of probabilities, as described for the similar sequence (φ n ) of Example 4.) Using the form of U 1 from section 10.3, we find where the state Clearly |error j 2 = (2j + 1) −1 and therefore |error j → 0 as j → ∞. As this error term becomes arbitrarily small, Ψ f is arbitrarily norm-close (modulo an overall phase) to the product state |0 + e i(θ+θ ′ ) |1 |θ ′ j / √ 2. Let R ∈ L(H T ) be invariant and self-adjoint. By continuity, lim j→∞ R|error j = 0. Suppose we fix θ ′ = 0 and R = (A) for some self-adjoint A which does not commute with N S . Then, with ϕ := |0 +e iθ |1 |θ ′ j / √ 2. Therefore, in this model, the expectation of the absolute quantity A in the absolutely coherent state ϕ approximates arbitrarily well the relational (A) in the invariant (absolutely incoherent) state τ T * (P [Ψ f ]).
We may also consider the action of U 2 , leading to an overall evolution U : with lim j→∞ |error j = lim leading to the evolution up to a term of arbitrarily small norm of Therefore, if the error term |error j , which can be made to have arbitrarily small norm by choosing j large enough is ignored, the state of the system alone is given by the first factor in the tensor product, achieved by partial tracing over R. For simplicity we set θ ′ = 0. Then measurement sensitivity of the observable |0 0| to θ (which is still present after operating with τ T * ) seems to validate the existence and measurability of (the relative phase factor θ in) the superposition 1 √ 2 |0 + e iθ |1 , since the latter state is given as the state of the system again by ignoring the error term in equation (88). It looks as though coherence across N 1 eigenspaces has been prepared and confirmed. We now critically analyse this conclusion. 10.7. Interpretation. Analysis of the post-U 1 and post-U states in the above high reference localisation regime highlights several key points, variants of which will reappear throughout the rest of this paper under various guises. We first recapitulate: -Any reasonable measure of entanglement capable of capturing this situation would show that the state Ψ f becomes arbitrarily close to an unentangled state for suitably large j. -Continuity (of R) dictates that the statistics of absolute A in absolutely coherent ϕ can be approximated arbitrarily well, for suitably large j, by (A) in the state τ T * (P [Ψ f ]). In particular, θ-dependent expectation values are present before the limit is taken.
The limit j → ∞ itself must be treated with extreme caution-the rigorous existence of such limits must be questioned, and the meaning of physical conclusions drawn from the limit may not be clear. The main dangers of taking the large amplitude limits in the example we have discussed are summarised below.
-The limit j → ∞ in the state |θ ′ j does not yield a normalisable Hilbert space vector. -N 2 (and thus N ) is not a bounded/continuous operator and therefore N |error j need not vanish even as |error j does in the large j limit. -If the error term is ignored, the dynamics no longer conserve number (this is due to the unboundedness). This is most acutely observed by noting that in (86), θ may take any real value. Choosing θ = π and ignoring the error state, the evolution takes the form U |0 |θ ′ j = |1 |θ ′ j . It appears as though the state |1 has been manufactured from |0 with no energy cost.
-Ignoring the error term leads to a "reduced" unitary U eff = U S ⊗ ½ and it is clear that [U eff , N ] = 0 and [U S , N 1 ] = 0. Therefore, no matter how small |error j may become, in order to properly account for energy/N conservation, it must not be taken to be zero. -The partial trace tr R [τ T * (P [Ψ f ])], for any finite j, yields an invariant/absolutely incoherent state of S (by Proposition 5.) Only in the limit does absolute coherence for S appear. We now discuss the large amplitude limit in more detail. 10.7.1. Meaning of the Limit. In analysing the physical interpretation of the high amplitude limit, we will be guided by two principles, referred to by Landsman [38] as Earman's principle [36] and Butterfield's principle [37]. Earman's principle states that While idealisations are useful and, perhaps, even essential to progress in physics, a sound principle of interpretation would seem to be that no effect can be counted as a genuine physical effect if it disappears when the idealisations are removed. ( [36, p. 191]). Butterfield's principle then addresses the question of idealisations given by infinite limits, and describes in more detail the type of behaviour that must exist prior to a limit being taken: There is a weaker, yet still vivid, novel and robust behaviour that occurs before we get to the limit, i.e. for finite N . And it is this weaker behaviour which is physically real. ([37, p. 1065]). Here, N refers to particle number, but this principle is readily adapted to our situation. Taking this all into account, the following appears to be a consistent interpretation.
The overall (S + R) dynamics are number-conserving, and the observables which may be measured are invariant under phase shifts generated by N (hence, commute with N ). There is a reduced description, applicable to S on its own in which, in direct analogy to the discussion of high localisation in the kinematical case, absolute quantities, absolute coherence, and non-N -conserving dynamical maps approximately capture the observed statistics.
This reduced description is suitable in its convenience and usefulness in certain situations, and provides an adequate tool for computing, to arbitrary approximation, empirically verifiable measurement statistics. For instance, the descriptions afforded by A and (A) may be observed to be arbitrarily close given arbitrarily high reference phase localisation, with the limit then featuring as an idealisation in which A and (A) (or more correctly, (Γ φ • )(A)) may be taken to be equal.
However, in addressing fundamental issues, the use of the idealisation (high localisation limit) betrays the essence of the phenomena under investigation. Guided by Earman's and Butterfield's principles, we may therefore discard as being artefacts of the idealisation those phenomena present in the limit but which disappear prior to the high localisation limit actually being taken. The attribution to S of a state which is a superposition of eigenstates (and which is physically different from its corresponding mixture) of different N S eigenvalue is such an example: taking the partial trace (over R, with θ ′ = 0) (in (88)) with the error term included (finite j) yields the state ρ S = 1 2 (|0 0| + |1 1|), whereas ignoring the error term (infinite j) we find the state tr R P Ψ f = P 1 √ 2 (|0 +e iθ |1 ) , i.e., the projection onto the vector unit vector ϕ S = 1 √ 2 (|0 + e iθ |1 ). 8 Another is the violation of energy conservation. At any finite j, energy is manifestly conserved, whereas in the limit, with the error term ignored, energy conservation is violated. These two instances must therefore be viewed as pertaining to not physically real effects in the sense of Earman. The physically real effects are the statistics arising from the measurement of invariant quantities. Approximating these statistics in a convenient manner by non-invariant quantities is legitimate, but attributing the measurement statistics to such quantities as observables is not. The measurement statistics containing θ-dependent terms, close in approximation to the absolutely coherent superposition, are physically real, but the state description of S as absolutely coherent is not.
Working with the idealised limit is legitimate when it comes to computing certain expectation values which may arise in experiments. For example, using A rather than (A) is unproblematic, provided the reference frame is prepared in a highly localised state. However, given the nature of our enterprise, that is, to understand the fundamental role played by symmetry upon the definability and measurability of quantum mechanical quantities, it is illegitimate to move to an idealisation in which the symmetry is no longer manifest, a fortiori when the symmetry is present at every finite value of j prior to the limit being taken.
In other words, since we are interested in symmetry, we should not have recourse to a theoretical description in which, even though valid insofar as certain calculations are concerned, the symmetry in question is no longer present. Thinking of the description of a ball bouncing against a wall (cf. [6]), there is no problem, as far as the modelling of the ball is concerned, in taking the wall to be of infinite mass. But if one is performing an investigation of the limitations on dynamics imposed by momentum conservation, then taking the large mass limit of the wall-the limit in which momentum conservation is violated-cannot be viewed as fundamentally valid and completely obscures the issue at hand, namely the role played by symmetry and conservation.
We now address controversies surrounding superselection rules and the reality of optical coherence, by critically analysing a number of models in the literature aimed at, in essence, obviating superselection rules. We will observe the use of dynamics and limits very similar to those discussed above, with identical interpretation.

Controversies
The final part of this paper addresses a number of controversies which have appeared in the literature over the last 65 years. The first relates to the fundamental status of superselection rules and the role played by reference frames there, and the second, appearing much later but strongly connected to the superselection rule debate, the question of the reality of optical coherence of laser beams.
We critique two opposing standpoints on the meaning and validity of superselection rules. Wick, Wightman and Wigner's (WWW's) seminal 1952 paper [7] was met with objection from Aharonov 8 We may again analyse the states after the second stage of evolution. Writing U = U2U1 and considering the situation post-U2, ignoring the error term and partial tracing over R yields the state ϕ ′ S = cos θ 2 |0 − i sin θ 2 |1 , and therefore, in contrast to the post-U1 situation, the observable P0 = |0 0| ⊗ ½ gives probabilities which are dependent on θ. There is, however, no reason to believe that such sensitivity to θ here entails anything about observable relative phase factors between states of different number; we have already argued that θ, properly interpreted, pertains to system and reference combined, and only takes on the appearance of a relative phase factor between number states when the error term is not taken into account-an illegitimate manoeuvre as far as the conservation law is concerned. We also note that the effective U2 does not commute with N1.
and Susskind (AS) 15 years later [9], which was then obliquely criticised again by WWW. Subsequent efforts have been devoted on the one hand to rigorous work on superselection rules in quantum field theory (see, e.g., [51]), whilst on the other towards more practical questions on the role of superselection in information and communication theoretic tasks (e.g., [6,52]). After briefly introducing Wick, Wightman and Wigner's original argument, we focus on Aharonov and Susskind's contribution, highlighting points of agreement and disagreement between our perspective and theirs. For instance, the meaning of coherence/superpositions as requiring a relational understanding [9] we view as ground-breaking, and this point of view has inspired much of the work in this paper. However, we do not support their conclusion (e.g., in the abstract of [9]) that "contrary to a widespread belief, interference may be possible between states with different charges"; nor do we agree that this conclusion follows from their argument. The paper suffers from mathematical flaws and a lack of conceptual clarity; what is at stake is nothing more than the appearance of measurability of absolute quantities/coherence in the presence of symmetry, and therefore the explicitly relational framework presented is well-suited to bringing a consistent and clear explanation of the issue of whether superselection rule "forbidden" states can be superposed to give a physically different state from its corresponding mixture.
We also critique more recent contributions [6,50] along similar lines, focussing on the latter. The former [6] suffers from serious mathematical defects, some of which we have already pointed out and some of which are irreparable, which severely limit the conclusions that can be drawn from the work. The scope of [6] is also catered heavily towards the role of reference frames in informationtheoretic tasks and agent-based scenarios, e.g., entanglement theory, quantum key distribution, communication tasks, all when the given agents have no knowledge of each other's reference frame. The ensuing practical limitations gradually morph through the paper into fundamental ones, with far reaching conclusions that we contend are not warranted. We again give points of agreement (e.g., that "all observable quantities ought to be relational") and disagreement ("superselection rules cannot provide any fundamental restrictions on quantum theory"), and again clear up dubious arguments by consistently applying the principle that observable quantities are invariant. This also applies to [50], which shares many mathematical problems with [6]. We find the language vague and occasionally conceptually unclear, and we will critique this work in detail, drawing upon ideas thus far presented. 11.1. Brief Overview. The notion of a superselection rule was introduced by Wick, Wightman and Wigner [7], who proposed that superpositions of states of bosons and fermions should be considered as equivalent to the associated mixture (i.e., that relative phase factors in superpositions of bosons and fermions are unobservable in principle), and a similar position was advocated for states of differing electric charge. Aharonov and Susskind [9] disagreed with the latter claim and offered a concrete experimental arrangement, very similar those we have considered in this paper (for the express purpose of critique), to demonstrate the possibility of preparing and observing coherent superpositions of states of different electric charge, via a formal analogy to the case of angular momentum. WWW then replied [8] with a theorem demonstrating that coherence is required in the initial state of one system in order to observe it in another, pointing to a circularity in Aharonov and Susskind's argument and similar to the objection raised here in subsection 6.6.
Subsequently the issue of the "reality" of quantum optical coherence was raised by Mølmer ([39]), who suggested that if the gain medium of the laser is properly accounted for, the actual laser field is described by a mixture of number states, and that therefore the coherence is merely "convenient fiction".
Bartlett, Spekkens and Rudolph [6] (BRS), also in collaboration with Dowling [50] (DBRS), have shed light on aspects of the superselection rule debate, particularly in clarifying the position of Aharonov and Susskind [9], and on the "optical coherence controversy" [11], highlighting the relative nature of states (and also, therefore, of coherence) and the accompanying role of reference frames.
We now present the form that these controversies take from the perspective of the relational formalism presented here. We believe that the framework we have developed for dealing with relative quantities clarifies the seemingly opposing viewpoints of AS and WWW, and in a certain sense unifies them. We will see that the attempts to overcome or "lift" superselection rules (as they arise through the lack of a reference frame -see [6]) correspond to model considerations that take the same form as the dynamical models already considered (many of which are modelled on [9] and [50]). The framework afforded by observables-as-invariants allows for a circumvention of the "relative" and "global" decompositions of the system-apparatus Hilbert space described in [6,50] (see also [49]) which have mathematical flaws, and allows for a direct assessment of the status of claims to, in essence, obviate superselection rules.  [7] made a simple argument to demonstrate the existence of a dichotomy between the assumption that all self-adjoint operators represent observables on one hand (a working assumption since von Neumann's book [53, p. 313]), and relativistic invariance on the other. Since double time reversal, T 2 : H S → H S (with H S ≡ H S b ⊕ H S f , the decomposition into bosonic and fermionic subspaces defining the projections P b and P f respectively), they argue, cannot be observed, and since T 2 has the effect of leaving bosonic states invariant and introducing a minus sign on fermionic states: it must be that any observable leaves the bosonic and fermionic sectors invariant, with the sign difference then unobservable. This follows since for any self-adjoint A, for the consequences of a double time reversal to be unobservable, it must be that Ψ + | AΨ + = Ψ − | AΨ − , from which it follows that any observable A must commute with P b and P f and thus any observable W with P b and P f as spectral projections. W is then a superselection observable.
We observe that the stipulation that observables of S commute with W leads to the equivalence of states ρ and τ S * (ρ), with τ S * (ρ) := P b ρP b + P f ρP f in this case. WWW also conjectured (subsequently proven in quantum field theory by Strocchi and Wightman [56]) that the relative phase factors in superpositions of states of different electric charge have the same status, namely cannot be determined by experiment, even in principle, and therefore that the states ρ and P n ρP n ≡ τ S * (ρ) are equivalent, with the sum running over all possible values of electric charge. Any observable must commute with charge, and must thus be invariant under shifts in phase/angle conjugate to charge.
The stipulation of such a superselection rule is formally identical to the limitation imposed by the a priori assumption that observables are invariant under symmetry (phase shifts in the charge case). Therefore, it must be understood whether the statement of a (say, electric charge) superselection rule amounts to anything more than the restriction thus far discussed. First, we discuss the reply of AS to the WWW paper, along with another model (due to Dowling et al., [50]) purporting to prepare and measure (absolutely) coherent superpositions of atoms and molecules (against baryon number superselection).
As has been shown, the requirement that observables be phase shift invariant allows for the relative phase of system and reference to be observed, with the absolute phase representing the relative phase given high reference phase localisation. The reply by Aharonov and Susskind to the WWW paper advocating the possibility of measuring relative phase factors in charge superpositions paper makes explicit use of such phase references. We now review their reply and hope to clarify their position by employing the methods and language introduced in this paper.

11.2.2.
Reply of Aharonov and Susskind: Proton-Neutron Superpositions. In favour of observability of relative phase factors between superselection rule "forbidden" superpositions, we sketch two thought experiments; the first is due to Aharonov and Susskind [9], conceived so as to demonstrate a realistic scenario in which coherent superpositions of states of different electric charge can be prepared and measured. The second, due to Dowling et al. [50], is similar in spirit, and purports to prove that atoms and (diatomic) molecules can be (absolutely) coherently superposed. 9 It will be shown that in both of these examples there is an implicit relativisation of the operators to be measured, thereby constructing an invariant operator (observable) not unlike the ones we have discussed. Furthermore, in both cases a crucial role is played by the limit of high localisation of a reference state (in both cases provided by a coherent state) with respect to a covariant phase-like operator conjugate to the symmetry generator, in direct analogy to the models and general results that have been presented. To our knowledge, it has not been explicitly stated anywhere that such localisation is the key property.
The Hilbert space H S ⊗ H R 1 ⊗ H R 2 of Aharonov and Susskind's thought experiment is to correspond to a proton-neutron system S and two cavities (R 1 , R 2 ) capable of containing any integer number of negatively charged mesons. Aharonov and Susskind imagine preparing R 1 and R 2 in charge-coherent states (we include normalisation factors that were omitted in the original treatment) and respectively, where |n denotes a charge eigenstate corresponding to n negatively charged mesons. The parameters q 1 and q 2 represent the respective mean charge values in the coherent states, corresponding to the observables Q 1 , Q 2 , which are structurally identical to the number operators we have encountered thus far, except that n takes (only) non-positive values. The initial state of the nucleon is a proton |P , and we will use |N to represent a neutron. |P and |N are thus eigenstates of the charge observable Q S of S with eigenvalues 1, and 0, respectively. The dynamics, which take place in two stages, are governed by a Jaynes-Cummingstype Hamiltonian (which commutes with charge) H = g(t)(σ + a − + σ − a + ) where σ + = |N P |, σ − = |P N | (sometimes referred to as the isospin operators), and a ± are meson creation and annihilation operators which act on the states of the cavities. The function g(t) describes the interaction strength and fixes the duration of the interaction (given physically by the passage time of the nucleon travelling through the cavity). Explicitly, the dynamics are governed by The unitary U 1 effects the following transitions on charge eigenstates (omitting the second cavity): |P |n −→ cos g √ n + 1 |P |n + i sin T g √ n + 1 |N |n + 1 .
Referring back to equation (87), these are of an almost identical form. Analogous to what we saw there, we find here that we may measure the observable |P P | ⊗ ½ in the state U 1 |P |n to find the proton probability cos 2 T g √ n + 1 .
Starting with the initial state Ψ 0 = |P |q 1 , θ |q 2 , θ ′ , the state after the first cavity is One must then consider the limit of large q 1 , which yields The nucleon is then approximately "separated" from the cavities; it enters the second cavity and exits, this time in the large q 2 limit, as + i e iθ ′ cos (gT √ q 1 ) sin (gT √ q 2 ) + e iθ sin (gT √ q 1 ) cos (gT √ q 2 ) |N |q 1 , θ |q 2 , θ ′ .
As observed, the proton probability (i.e., tr [|0 0|U Ψ 0 ]) now depends on θ − θ ′ , the relative phase between R 1 and R 2 . Therefore, as argued by Aharonov and Susskind, the nucleon is in a coherent superposition of proton and neutron with relative phase (θ − θ ′ ) "when referred to the frame provided by R 2 ". The idea is that the absolutely coherent superposition is created by the first cavity (cf. (100)) and then confirmed by measuring an invariant quantity of S after passage through the second cavity. However, the model presented by AS suffers from the same kind of difficulties as discussed in subsection 10.7.
From the perspective developed in the present paper, we would instead say that in the limit of high reference system localisation, we are faced with the appearance of measuring an absolute quantity (namely a phase-like quantity sensitive to relative phase in nucleon superpositions) in an absolutely coherent state, but this is appropriately understood as pertaining to a relative phase-like observable between the nucleon and the cavities (and a mutually coherent state). The analogy to the angular momentum/angle case, as employed by Aharonov and Susskind to compel one to believe in the observability of proton-neutron superpositions, is indeed a good one. However, we argue for the opposite conclusion: it is not that since absolute coherence of states of different angular momentum is observable, therefore so is the relative phase factor in superpositions of charge states, but rather, absolute coherence for angular momentum is not possible, and nor is it in the charge case.
Indeed, it is stated quite explicitly in [9], that "the coherence of states of different angular momentum is measured relative to a frame of reference". Thus coherence itself is viewed as a relative feature; from this point of view, there is no absolute coherence of states of angular momentum. Once again, we see the importance of the mutual coherence concept. The Aharonov-Susskind paper was understood by many as proving the possibility of coherent superpositions of states of different electric charge-a situation conjectured impossible by Wick, Wightman and Wigner (WWW) [7] 15 years previously. Three years after Aharonov and Susskind's contribution, WWW demonstrated the necessity of using superpositions of states of different charge (i.e., the absolutely coherent cavity states) in order to demonstrate their existence (i.e., for the nucleon); see subsection 11.2.3. Aharonov and Susskind were alert to such a circularity and tried to avoid any possible objection in the final part of their paper, where they attempted to construct a charge eigenstate out of the two charge coherent states, providing a manifestly (phase-shift) invariant state. This takes the form of an integral ( [9], final page) where the initial state |i is then an eigenstate of charge q + q ′ and fairly well-defined phase θ ′ − θ. 10 They claim that the proton probability distribution is unchanged even when the cavities are prepared in a charge eigenstate. The following calculation demonstrates that their proposal is flawed: if the two-cavity system is prepared in a charge eigenstate, |i , under charge-conserving evolution the approximation they give can never be valid. Suppose under evolution U which conserves total charge we have where Q c φ i+1 = (i + 1)φ i+1 . Then for arbitrary ψ = α|P + γ|N , |α| 2 + |γ| 2 = 1, we note the following trivial observation: φ i+1 − ψ ⊗ |j = 0 if and only if γ = 0 and i = j.
By contrast, in the example where Thus the resulting state is a finite (norm) distance from an eigenstate, independent of the "size" of the reference system. This "fix" by Aharonov and Susskind is therefore untenable, and their conclusion that interference effects may be observed between states of different electric charge, given the restriction of not assuming its possibility from the outset, does not follow from their argument.
The approximation based on high amplitude coherent states was mathematically valid and results in states close to a product state containing proton-neutron superpositions in the system Hilbert space. High amplitude coherent states, however, already exhibit absolute coherence if the observables are not restricted to invariants (such a constraint on observables is barely mentioned in AS's paper.) The above result demonstrates that if the coherent states are replaced with a charge eigenstate, no such approximation can occur. WWW responded to the AS paper, also implicitly criticising the error, which we now discuss. 11.2.3. Response of Wick, Wightman, Wigner. Wick, Wightman and Wigner [8] responded to Aharonov and Susskind's challenge to the superselection rule for charge, making three key points which we now summarise. We note that WWW's argument was not to offer a proof of charge superselection, but rather to argue that superpositions of states of different charge cannot arise from (composition of) invariant states, charge-conserving dynamics, and subsystem separation. This therefore take place in the Schrödinger picture.
We assume that charge (Q S for S and Q R for R) may take positive and negative values, and recall that τ T * (ρ) = ∞ −∞ P n ρP n (with appropriate indices for subsystems, S and R). (1) The composition ρ S ⊗ ρ R for ρ S and ρ R invariant yields a state which commutes with total charge (i.e., is invariant) and no interference of states of different charge of S +R is possible.
Equivalently, τ S * (tr R [τ T * (ρ)]) = tr R [τ T * (ρ)]. The three steps outlined above correspond to composition, evolution and separation, respectively. Regarding observing absolutely coherent superpositions of states of different charge of S, WWW showed that it "takes one to know one"; specifically, a coherent superposition of states of different charge for R are required in order to observe them at the level of S, showing that AS's argument, as presented, is circular (in using coherent states for the cavities) or flawed (in using a charge eigenstate for the combined cavities). 10 The numbers q and q ′ pertain to the amplitude of the coherent states |q, θ1 and |q ′ , θ2 and, as such, are continuous. However, for the expression in equation (103) to represent an eigenstate of the total charge, q and q ′ must be restricted to taking integer values.
Since discussions pertaining to the type of convergence thus far encountered here have been somewhat informal in the existing work, we provide a proof in the appendix that, for example, We may write where However, one can also consider the post U 3 state; again, asymptotically and ignoring the error term we have (as given in [50]) The interpretation given in [50] is that since one can apply τ T * at every stage (under the approximation) and still achieve atom/molecule probabilities of sin 2 (φ/2) and cos 2 (φ/2) respectively, a coherent superposition of an atom and a molecule has been observed. 11 In view of the work we have presented, along with the argument of WWW [8], we do not agree with this view. Given the problems with taking the limit (violation of the conservation law, nonexistence of limit for states, unphysical nature of such a limit), we believe that the limit should not be taken in considering the fundamental status of these experiments. As such, the analysis of WWW holds, and absolute coherence cannot be observed for atom-molecule "superpositions". What is instead observed is mutual coherence, and the observability of the interference effects as given by (for example) sin 2 (φ/2) only demonstrates the feasibility of measuring relative phase factors within a sector, and the phase φ/2 should be viewed as precisely this. The large reference system, which provides high reference phase localisation, again provides the appearance of a relative phase factor at the level of the system only.
Therefore, we return once more to the main point: absolute quantities are not measurable, but represent measurable, relative quantities, with good approximation coming with good localisation (suitable, relationally, interpreted). We conclude this section with a final analysis of the two views concerning the observability of "forbidden" superpositions. 11.3.1. Analysis of the Opposing Standpoints. Following, for example, the prescription given in section 10.3, it is possible to follow WWW's three-step sequence to the letter: (1) Compose: |Ψ 0 Ψ 0 | = |0 0| ⊗ |c n | 2 |n n|; (2) Evolve: |Ψ 0 Ψ 0 | evolves according to the charge-conserving unitary defined in (79) and (80) yielding τ T * (|Ψ f Ψ f |) = n |c n | 2 P 1 √ 2 (|0,n +e iθ |1,n−1 ) (eqn.(83)); (3) Separate: tr R [τ (|Ψ f Ψ f |)] = 1/2½ (on the two-dimensional subspace spanned by {|0 , |1 ). 11 E.g., "...it is possible, in principle, to perform a Ramsey-type interference experiment to exhibit a coherent superposition of a single atom and a diatomic molecule". We note, however, that DBRS do acknowledge that in states like that appearing in Eq. (110) with the error term taken to be zero, one would be inclined to again "twirl" (τS * ) the resulting S-state 1 √ 2 |A − ie iθ |M , going "full-circle" and returning from whence we came: to an equivalence between coherent and incoherent descriptions. From this ensues a discussion of alternative tensor product decompositions of the system-reference Hilbert space-relative and global-as a way of interpreting the new state. This procedure cannot work in general (or even in the case they give), and is unnecessary anyway.
On this basis, it is clear that there can never be interference observed between |0 and |1 under the processes outlined by WWW.
On the other hand, as described in subsection 10.6, we may prepare the state |0 0| ⊗ τ R * P [Ψ 0 ], with Ψ 0 = n c n |n , choosing c n = e inθ ′ √ 2j+1 for |n| ≤ j and 0 otherwise. Then, for finite j, there exists invariant A ∈ L(H T ) so that tr [Aτ T * P [Ψ f ]] depends on θ. This Ψ f , as j becomes arbitrarily large, becomes arbitrarily close to the product state Then employing relation (11), the statistics of an invariant quantity in τ T * (P Ψ f ) are identical to the statistics in Ψ f . One finds that, for example, Ψ f | (Θ − Θ R )Ψ f gives rise to statistics which are sensitive to the relative phase e i(θ+θ ′ ) . With θ ′ = 0, one finds that Thus it appears as though one has measured an absolute observable in a superposition state.
In order to attempt to avoid the appearance of measuring an absolute quantity, the second unitary (e.g., that introduced in (87)) allows, on the system level and "once the limit has been taken", for something like this to occur: Then the observable (e.g.) |0 0| can be measured and a θ-dependent probability distribution achieved.
The upshot is that both WWW and AS/DBRS make arguments which bear out (once the errors have been remedied). The former show, quite correctly, that strictly speaking, only (absolute) coherence begets (absolute) coherence, and if you don't have it, you'll never get it, as one would expect. The latter "camp", in their attempt to show the positive possibility of creating absolute coherence from states without it, actually show the possibility of well-approximating absolute quantities and states with absolute coherence by relative quantities and states without absolute coherence. The crucial ingredients for such an approximation are mutual coherence and high localisation. 11.4. Further analysis: Superselection Reconsidered. Bartlett, Spekkens and Rudolph [6] argue a superselection rule may be "lifted", that is (we think), the following holds: a superselection rule applies to some system S. A reference frame R may be included, the superselection rule applied to S + R, whose statistics then exactly give those of S as if there weren't a superselection rule for S. This is taken as proof that "superselection rules cannot provide any fundamental restrictions on quantum theory" since, they argue, a SSR is simply a lack of an appropriate frame, which can always be introduced. 12 We do not endorse this view. First, the analysis preceding the above quote in [6] is mathematically flawed. Second, the reason given for the (e.g.,photon number) superselection rule is a practical one: agents may not share a classical phase reference. Finally, as we have noted, if the analysis is done rigorously, one sees that the "superselection-violating statistics" of S can be achieved only when there is a localised/absolutely coherent state for R, which just shifts the problem of absolute coherence from S to R. Only through the mutual coherence concept can this circularity be avoided. The question, then, is whether mutually coherent states exist in all given situations, i.e., for all phase-like quantities.
In more concrete terms, we have seen that, through the construction, absolute quantities and absolutely coherent states can arbitrarily well approximate the statistics of a relational quantity in an invariant state, contingent on a highly localised reference state. We view this statistical equivalence not as "lifting" in order to show that it can be violated for S, but rather as an expression of the fact that the ordinary usage of quantum mechanics, with its absolute quantities and absolutely coherent states, captures to a very good degree the true, physical situation represented by invariant quantities of system plus reference, in line with fundamental symmetry requirements.
The situation of "lacking a phase reference", in our conception, pertains not to the lack of shared knowledge of physicists, but to the physical scenario in which the physical system being used as a reference is completely delocalised with respect to phase, for instance, if it is a number state. This gives rise to a "reduced" description in which the structure of a superselection rule must be enforced. Whether such a reduced description afforded by absolute quantities and absolutely coherent states does yield what is observed in any given situation is an empirical question. It seems, to us, that there may be situations in which they do not, in which case a "superselection rule" stronger than that mooted for photon number could be in force. For example, there may be physical situations in which it is impossible for mutually coherent states to arise from unitary evolution of absolutely incoherent product states, making the approximation of relative quantities by absolute ones impossible. A "strong" conservation law, as presented in subsection 9.2 for instance, would have this effect.
Finally, superselection rules, as they arise in quantum field theory, correspond to inequivalent representations of the algebra of observables (possible only for systems with infinitely many degrees of freedom-also suspicious according to Earman and Butterfield) and entirely different in nature, it would seem, from the kind of constraint arising from the non-observability of absolute quantities. The connection of these with the superselection rules we have discussed in this manuscript remains a task for the future.
We conclude this section with a note of caution about the possibility of "lifting" a superselection rule arising from the indistinguishability of quantum particles. 11.4.1. A Cautionary Note. In order to urge a degree of circumspection regarding the idea that reference frames can be used to overcome superselection rules in general, we discuss now an example based on the indistinguishable particle superselection rule in which the physical meaning of a reference frame is unclear.
Consider a tensor product space L 2 (R) ⊗ L 2 (R) with the action of Z 2 which exchanges particle numbering, i.e., U (a)Ψ(x 1 , x 2 ) = Ψ(x 2 , x 1 ) (a is the non-identity element). Indistinguishability requires that any observable A satisfies [A, U (a)] = 0 (cf. [55]). Addend another Hilbert space C 2 with projectors P ( 1 0 ) and P ( 0 1 ) with Z 2 action U ′ (a)P ( 1 0 ) = P ( 0 1 ). Then by demanding invariance of observables only at the level of H 1 ⊗ H 2 ⊗ C 2 one can take an arbitrary A ⊗ B ∈ L(H 1 ⊗ H 2 ) and see that defines an invariant quantity (observable). Indeed, this is (A ⊗ B) for this (finite) group. Then, for all ϕ and φ the 'phase-localised' state φ = ( 1 0 ). Therefore one can introduce a reference system in order to "measure" particle labelling. In the BRS language, the corresponding SSR has been "lifted". However, such a "reference frame" provided by the C 2 system appears highly artificial and there is a question of whether it makes any physical sense. 11.5. Reality of Optical Coherence. In [39], Mølmer claimed that the representation of laser light using coherent states, i.e., states of the form |β := e −|β| 2 2 ∞ n=0 β n n! |n , while being legitimate for the purposes of calculation, does not reflect the true state of affairs. Actually, he claimed, that, after analysing the internal workings of laser light production in a physical system, the "actual" state is (in our notation) τ S * (P [|β ]), and (the coherence of) |β is nothing more than a 'convenient fiction'. The ensuing controversy is well described in [11] (see also references therein), where a fictional dialogue is presented between hypothetical physicists representing two groups with contrasting views: those who believe in the "fact" of optical coherence, and those who view it as fictional. Given the nature of the problem (of the reality of laser coherence), we may re-visit the controversy and provide a perspective based on the formal framework developed here (see also [5]).
The issue is whether |β and τ S * (P [|β ]) of some laser system S can be empirically distinguished, given that no invariant quantity of S can tell |β from τ S * (P [|β ]). As we have seen, however, noninvariant quantities of S can be used to represent invariant quantities of S + R, contingent on a suitable state of R. The question then is whether there is a feasible physical experiment in which |β and τ S * (P [|β ]), in their role as representing invariant states of S + R, give rise to differing physical predictions.
An absolute phase observable F S of S (in particular, the canonical phase) is mathematically suitable for separating |β from τ S * (P [|β ]). We may choose also a canonical phase for R, and use to construct the relative phase observable F T = • F S . Fixing a sequence (β R i ) ⊂ H R of coherent states with the property of becoming increasingly well localised at 0 as i becomes large, we then find that for each X ∈ B(S 1 ). From an absolute point of view, absolute coherence (of β R i for large i) is required to witness absolute coherence of |β . From a relational point of view, all that is required (for good approximation of the right hand side by the left) is mutual coherence of the pair (|β , |β R i ). The final line of equation (118) shows that the limit can be taken using only invariant states of S + R, and that an absolute phase with an absolutely coherent (coherent) state captures the statistics to arbitrarily good approximation.
Given that absolute phase observables F S can be reconstructed in homodyne detection experiments (e.g. [54]), with the reference state/local oscillator given as a high-amplitude coherent state, we conclude that laser light is mutually coherent. In the high amplitude limit, the mutual coherence takes on the appearance of absolute coherence for |β . We therefore have a resolution of the puzzle of optical coherence through the application of the 'observables are invariants' principle and the concept of mutual coherence.

Summary and Conclusion
The thesis of this paper is that observable quantities are invariant under symmetry and that, in quantum mechanical laboratory experiments, the measured statistics pertain not to some absolute quantity, but rather to an observable, relative quantity, corresponding to the system and apparatus combined, along with the appropriate high localisation limit on the side of the apparatus. This is quite general, and not specific to any particular absolute quantity, though in this paper special attention has been given to phase, angle and position.
Through our relativisation procedure, we have shown that absolute quantities with absolutely coherent states provide a good account of the observable, relative quantities (with absolutely incoherent states) under high reference localisation. In this sense, the incorporation of a reference frame into the physical description makes it look "as though" symmetry-violating statistics exist for a subsystem. However, since we argue that the description afforded by subsystem quantities is theoretical shorthand for the relative description, we do not believe it is consistent to argue that symmetry may be violated by the introduction of a reference frame. Indeed, it is the introduction of such a frame that makes symmetry explicit; some quantities simply require two systems for definition, and one of these may me called a reference frame.
Therefore, we agree with prominent physicists (Aharonov/Susskind, Bartlett/Spekkens/Rudolph) that quantum states refer not only to systems to which they symbolically refer (i.e., the system under investigation), but also to external physical objects which are not explicitly part of the theoretical description. We have shown that complete reference phase delocalisation gives rise to a reduced description formally identical to one in which a superselection rule is present, giving a new interpretation of the phrase "lack of a phase reference implies a photon number superselection rule". The idea that such a rule may be "lifted" [6], as we understand it, corresponds to the observation that a superselection rule may be applied to system-plus-reference, in which case, under reference localisation, it appears as though a superselection rule is not applicable to the system. We believe that, since the "reduced" description is not a full account of the state of affairs, it is not correct to conclude that superselection-rule-"violating" superpositions can be produced or measured. This would indicate that absolute quantities can be measured.
An important question, however, is whether, in all mooted instances of superselection rules, a reference frame may exist which makes it look like the superselection rule can be lifted or overcome. It is empirically the case that for photon number, such a frame does exist. Mutually coherent pairs of systems exist in this case, making absolute phases and coherent states a suitable shorthand description for the true, relative description, with the associated relative phase observable. On the other hand, a reference frame for lifting a superselection rule corresponding to indistiguishability appears highly suspect. As far as we know, it has yet to be settled in a laboratory whether absolute phases conjugate to charge provide an empirically adequate account.
Exploiting the continuity of cosine, for each k ∈ N define δ k such that | n m − 1| < δ k implies cos n m π 4 − cos π 4 < 1 k (and therefore f m (n) < 1 k 2 ). For each k ∈ N, let M = k 2 δ 2 k , and so for m > M , δ k > k √ m . In (122), we therefore have that n w m (n)f m (n) < n∈I k w m (n) + 3 1 Since k is arbitrary, this proves the result.