An Alternative Foundation of Quantum Theory

A new approach to quantum theory is proposed in this paper. The basis is taken to be theoretical variables, variables that may be accessible or inaccessible, i.e., it may be possible or impossible for an observer to assign arbitrarily sharp numerical values to them. In an epistemic process, the accessible variables are just ideal observations connected to an observer or to some communicating observers. Group actions are defined on these variables, and group representation theory is the basis for developing the Hilbert space formalism here. Operators corresponding to accessible theoretical variables are derived, and in the discrete case, it is proved that the possible physical values are the eigenvalues of these operators. The focus of the paper is some mathematical theorems paving the ground for the proposed foundation of quantum theory. It is shown here that the groups and transformations needed in this approach can be constructed explicitly in the case where the accessible variables are finite-dimensional. This simplifies the theory considerably: To reproduce the Hilbert space formulation, it is enough to assume the existence of two complementary variables. The interpretation inferred from the proposed foundation here may be called a general epistemic interpretation of quantum theory. A special case of this interpretation is QBism; it also has a relationship to several other interpretations.


Introduction
For an outsider, one of the really difficult things to accept about quantum mechanics is its state concept: The state of a physical system is given by a normalized vector in a complex separable Hilbert space.One question that I will raise here, is whether this state concept can be derived, or at least motivated, by some other considerations.The discussions given here, may be seen as a continuation of the discussions in the book [1].
My point of departure will be the notion of theoretical variables, variables attached to some observer or to a group of communicating observers.This will be taken as a very wide concept, but it includes physical variables like position, time, momentum and energy.The term 'theoretical' points to a distinction from real, measured variables, that in a typical connection may be seen as a theoretical variable plus some random error.
The notion of theoretical variables can also be connected to several interpretations of quantum mechanics.To give an example of such a connection, according to Relational Quantum Mechanics, see Rovelli [2] and van Fraassen [3], variables take values only at interactions, and the values that they take are only relative to the other system affected by the interaction.This other system might well be an observer, and I will think of such a situation.
Other examples of theoretical variables are decision variables in a quantum decision context.This will be briefly discussed below, and is discussed in more details in [5].
Variables which can take definite values relative to an observer or group of observers will be called accessible.But relative to the observers, there may also be other theoretical variables which I will call inaccessible.An example may be the vector (position, momentum) connected to the observation of a particle.Another example may be the spin vector of a particle, where I think of a mental model of the spin, where the spin components are seen as quantizised values of the projection of this vector upon some given direction.
These inaccessible variables must be seen as purely mathematical variables; from a physical point of view, they do not take any values, but nevertheless they can be seen as variables.
The distinction between accessible and inaccessible variables is very important to my approach.On the one hand, this distinction can be explained roughly to outsiders.On the other hand, one can give many precise physical examples.I am currently working on a book together with Harish Parthasarathy, where we among other things will illustrate this distinction on several examples from quantum field theory.
In this article, I discuss these notions more closely with a focus on the more mathematical aspects of the situations described above.First, concentrate on the variables and the corresponding group theory.I assume the existence of a concrete (physical) situation, and that there is a space Ω φ on which an inaccessible theoretical variable φ varies, with a group K acting on this space.There is at least one accessible theoretical variable, θ , defined, where it is assumed that it can be seen as a function on Ω φ .This is a crucial assumption.
This θ varies on a space Ω θ , and the group K may or may not induce a transformation group G on Ω θ .In any case, I will focus on such a group G on Ω θ , whether it is induced by K or not.An essential requirement is that G is transitive on Ω θ .It is shown below that the existence of G, together with symmetry assumptions assumed by the author in [4], will be satisfied without further assumptions in the finite-dimensional case.
A special situation is when φ is a spin vector, and θ is a spin component in a given direction.In the simple spin situation, the natural group K for the spin vector does not directly induce groups on the components.But does so if we redefine φ to be the projection of the spin vector upon the plane spanned by two different directions in which spin can be measured and take K to be the corresponding rotation group.
When there are several potential accessible variables, I may denote this by a superscript a: θ a for the variables and G a for the groups, with elements g a .Both here and in [1,4,5] I use the word 'group' as synonymous to 'group action' or transformation group on some set, not as an abstract group.
The article has some overlap with the papers [4,5], but the results there are further discussed and clarified here.
My derivations here will be independent of other Hilbert space reconstructions in the literature, but they will in some sense compete with several rather deep investigations recently on deriving the Hilbert space structure from various assumptions [6,7,8,9,10].By relying on group representation theory, I use at the outset some Hilbert space structure, but this is by construction, not by assumption.The construction is indicated to be realizable in the important finite-dimensional case.It is interesting to see, as stated in [11], that there is a problem connecting the above general derivations to the many different interpretations of quantum theory.By contrast, the derivation presented here seems to lead naturally to a particular interpretation: A general epistemic interpretation.
There is a large literature on different interpretations of quantum mechanics.In my opinion, one should try to clarify the questions around quantum foundations before one goes into a deeper discussion of possible interpretations.Brief discussions of interpretation are given in a later Section of this article.This discussion is in agreement with the view expressed by Robert Spekkens in a recent video: It may be seen as a categorical error to look upon quantum theory as describing the real world; it is a theory of our knowledge of the world.Of course the word knowledge can be interpreted in many ways, and an epistemic view upon quantum mechanics may be discussed, but in my opinion a focus on describing our knowledge of the world leads to more attractive interpretations of the theory.
There has been many attempts in the literature to motivate the Hilbert space formalism.What is special about my approach?First, it is based upon notions that can be communicated to outsiders with a limited mathematical background.Secondly, it is based on very few postulates.One postulate, Postulate 3 in Section 3 below, may be seen by some people as strong, but this postulate is discussed here from several points of view.One view is a possible relation between science and religion; see also [77,80].
Finally, I see it as crucial that my main result, also formulated in Section 3, in addition to the postulates only assumes the existence of just two maximal accessible variables in order to establish essential elements of the Hilbert space apparatus, These two variables may be seen, in the language of Niels Bohr, to be complementary.
One of the last derivations of the Hilbert space formalism is Brezhnev [78], where also other derivations are referred to.[78] is based upon an unlimited validity of the superposition principle.As a contrast, in the present approach the operators associated with theoretical variables are derived first, and a ket vector is only seen as a valid state vector if it is an eigenvector of a physically meaningful operator.This leads to a limitation of the superposition principle, but in my opinion, it facilitates the interpretation of quantum mechanics.
Several applications of the theory given here can be mentioned.One application is to give an explanation [34,61] of the fact that Bell's inequalities may be violated in real experiments, contrary to what could be expected intuitively.Other applications are discussions of the two-slit experiment and the paradoxes connected to Schrödinger's cat and Wigner's friend [47].In [47] I also consider links to field theory and to general relativity theory, themes that are currently under further considerations.
The link to quantum decision theory [5] is also under further considerations.One goal is to find connections to statistical inference theory, where my accessible theoretical variables will be statistical parameters.(I have avoided the term 'parameter' in this article, since in physics this word has other meanings.)Such interdisciplinary investigations would have been impossible with the common quantum theory formalism.
In my proofs I use group representation theory in an essential way.Group representation theory in discussing quantum foundation has also been used in other places; see for instance [12].In quantum field theory and particle physics theory, the use of group representation theory is crucial [13,79].
Sections 2 and 3 give some background.In Sections 4 and 5 I introduce some basic group theory and group representation theory that is needed in the paper.Then, in Sections 6 and 7, I formulate my approach to the foundation of quantum theory.Simple postulates for the relevant situations are assumed.From this, the ordinary quantum formalism is derived, and it is shown how operators attached to accessible physical variables may be defined.A brief comparison with some other approaches towards quantum foundation is included in Section 8.In Section 9 a corresponding interpretation of quantum mechanics is discussed, in Section 10 a slightly different approach based upon category thory is discussed, and Section 11 gives some concluding remarks.
To complete the derivation of quantum theory along these lines, one will also need a derivation of the Born rule under suitable conditions, and a derivation of the Schrödinger equation.A very brief discussion of this is included in the crucial Section 7 for completeness; a more thorough discussion is given in [1].The main focus of the present article is the construction of the Hilbert space apparatus from simple assumptions and the corresponding interpretation.

Convivial Solipsism
In Hervé Zwirn's paper [30] it is shown how a long range of foundational issues in quantum mechanics, including the famous measurement problem, can be enlightened using a new philosophy called Convivial Solipsism.The basic thesis is: Every description of the world must be relative to some observer.But different observers can communicate.Mathematically the philosophy rests upon Everett's relative state formulation of pure wave mechanics [49,53]; see also [71] for philosophical issues.The distinction between absolute states, which can be taken to describe the whole, includ-ing physical system, measuring apparatus and observer, on the one hand, and relative states, describing, say, the state of an observer's brain after a measurement, is crucial.
In concrete terms, assume a system which can be in one out of two states |S1⟩ or |S2⟩; the corresponding states of the measurement apparatus are |E1⟩ and E2⟩, while this induces states of the observer's brain |B1⟩, respectively |B2⟩.All these are relative states.The global state will be of the form Zwirn distinguishes between the states of the brain and the 'states' of the consciousness of the corresponding observer, which can be denoted by B1 or B2.From this, the measurement problem and other problems related to the interpretation of quantum mechanics are discussed.The basic principles behind this discussion are the hanging-up mechanism and the relativity of states assumption; see [30].
All this presupposes a foundation of quantum mechanics based upon ket vectors as representing physical states.Below, I will try to avoid this formal apparatus as a starting point and start with completely different notions.But I will keep Zwirn's basic thesis.

Motivation, first postulates, and some basic results
Can one find a new foundation of quantum theory, a foundation that ultimately leads to the full theory, but at the same time a foundation that can also be explained to persons who never have been exposed to the ordinary Hilbert space machinery?
My answer is yes.I have up to now discussed my approach in two books [1,24] and in several papers [4,5,25,34,47,61,73]. Now I aim to collect all the mathematical arguments in a single article and also give results beyond the above books and articles.
My basic notion is that of a theoretical variable, which is a very wide notion.This variable can be a physical variable, a statistical parameter, a future data variable, a decision variable, or perhaps also other things.In this discussion, the variables can always be seen as mathematical variables.I divide the variables into accessible ones and inaccessible ones, as briefly discussed in the Introduction.From a mathematical point of view, I only require that if θ is accessible and λ is a function of θ , then λ is also accessible.
Here are some examples: 1) Spin of one particle.An observer A can have the choice between measuring the spin component in the x direction or in the z direction.This gives two different accessible variables related to A. An inaccessible variable is the unit spin vector φ , which we think of as a three dimensional vector such that the spin component in a certain direction is a discretized version of the projection of φ in that direction.In the qubit case, the spin component in any direction a can be modeled as a simple function of φ : θ a = sign(cos(a, φ )), taking the values -1 or +1.A correct distribution of each θ a will result if we let φ have a uniform distribution on the unit sphere.By using the Born rule, which requires a separate derivation, a conditional distribution of θ b , given θ a , a ̸ = b, can also be found.This is the first example of quantum mechanics discussed in the book [48].
2) The EPR situation with Alice and Bob.For an independent observer Charlie, the unit spin vectors of both are inaccessible, say φ A and φ B .But it can be shown that the dot product of the two is accessible to Charlie: η = φ A • φ B .Specifically, one can show that Charlie is forced to be associated with an eigenstate for the operator corresponding to the variable η, which is the entangled singlet state corresponding to η = −3.One can show that this implies that for Charlie and for the measured components in some fixed direction a, the component of Alice is opposite to the component of Bob.Note that Charlie can be any person.See Subsection 7.5 below for more details.
3) The Bell experiment situation.Look at the subsample of data where Alice measures her spin component in direction a and gets a response A, either -1 or +1, and where Bob measures in a direction b and gets a similar response B. For an independent observer Charlie, analysing the data after the experiment, both A and B are accessible.This implies by Born's formula -anticipating this formula -a fixed joint distribution of A and B. But Charlie has his limitations, as in 2).In an article [34] in Foundations of Physics, I discuss what this limitation implies for him, using my point of view.This may be used to explain the now well-known violation of Bell's inequality in practice.Unfortunately [34] contains some smaller errors, which were corrected later in the article [61].
4) Consider first a general decision problem with two alternatives.In the simplest case the observer A knows the consequences of both choices, they are accessible.But in more complicated cases, the consequences are inaccessible, and hence the results of his choice are inaccessible.Then an option can be to make a simpler sub-decision, where he knows the consequences.Alternatively, a decision variable can be connected to a decision process, and this decision variable can be said to be accessible if A is able to make a decision.Maximal accessible decision variables seem to be of some interest here.Quantum decision problems are discussed in [5], where further references are given.See also Subsection 7.4 below.
All these examples can be coupled to my approach towards quantum mechanics.I will now sketch the basic elements of this approach.
My point of departure is a statement of Hervé Zwirn's Convivial Solipsism, as noted before: Every description of the world must be relative to some observer A or to a group of communicating observers.I will assume that this observer/ these observers is/ are in some fixed physical or non-physical context.My primitive concept is then theoretical variables related to this situation.
A concrete area where a non-physical context is meaningful and required is quantum decision theory; see Subsection 7.4.

Postulate 1:
If η is a theoretical variable and γ = f (η) for some function f , then γ is also a theoretical variable.
The theoretical variables may be accessible or inaccessible to A (or to the group).Very roughly we can say: If θ is accessible, A will, in principle in some future be able to find an accurate value of θ as he likes.But as a referee remarks, this rough definition raises many questions: What is meant by the future?Is the accuracy limited by the Planck length?-and so on.So I will just take 'accessible' as a primitive notion.From a mathematical point of view, I only assume: Postulate 2: If θ is accessible to A and λ = f (θ ) for some function f , then λ is also accessible to A.
The crucial model assumption is now the following (; see also [1,4,5]): Postulate 3: In the given context there exists an inaccessible variable φ such that all the accessible ones can be seen as functions of φ .

First look at two physical examples:
A) Consider a Stern-Gerlach experiment where the spin component of a particle is measured in an arbitrary direction a.These spin components are accessible variables, and can, as noted in 1) above be seen as functions of an inaccessible variable, the full spin vector, which only exists theoretically relative to the relevant observer.
B) The position of a particle is accessible, and so is its momentum.Both are functions of the variable φ =(position, momentum), which is inaccessible by Heisenberg's inequality.
In any more general mathematical setting, following ideas by Palmer [76], Postulate 3 can be motivated as follows in the finite-dimensional case, say where maximal accessible variables have dimension n. (See Definition 1 below for a definition of 'maximal'): Assume that the variable φ is generated by some chaotic process, and let it be written in base-n.Then we can let the maximal accessible variable θ be determined by the millonth digit in φ , the maximal variable η be determined by digit number 1000 003, and so on.By Postulate 4 below, all accessible variables can be seen as some function of a maximal accessible variable, so Postulate 3 will hold in this setting.Note, however, that with this choice of φ , it seems difficult to define a transitive group K on Ω φ such that the functions θ (•) are permissible with respect to K (see later).
A more satisfying explanation of Postulate 3, is to assume some relation between science and religion.I know that many scientists are skeptical to such relationships, but here it seems to me to clarify the possible background of quantum theory.One can simply assume that φ is known to God, but unknown to us humans.This also explains why one can think of a quantum foundation attached to a human observer.Presumably, quantum mechanics and its generalizations have existed in some form forever, but human beings only for some million of years.
As a background for my theory, I assume that God has existed forever, and so have basic physical laws.A rather common assumption in various religions is that we humans are created in God's image.Thus, in very metaphysical terms, one can simplify a theory of God's mind by a theory of an observer's mind, which is partly done in this article.For more on my views on religion and quantum foundation, see [77,80].
In general, it should only be noted that φ is a variable and is not assumed to take any value known to us.This is also analoguous to the main assumptions of probability theory, which is based upon functions on a probability space (Ω, F , P), with no assumption that ω ∈ Ω is known to us.
It will be shown below that Postulate 3, taken together with some symmetry assumptions, has far-reaching consequences.It will imply the existence of the whole Hilbert space apparatus, in particular, that each accessible variable has a unique symmetric operator connected to it.These symmetry assumptions will be shown in this article to be satisfied when all accessible variables take a finite number of values.
One can consider a concrete context with an observer A or with a set of communicating observers in this context.Let φ be an inaccessible theoretical variable varying in a space Ω φ .It is a basic philosophy of the present paper that I always regard groups as group actions or transformations, acting on some space.
Starting with Ω φ and a group K acting on Ω φ , let θ (•) be an accessible function on Ω φ , and let Ω θ be the range of this function.
As mentioned in the Introduction, I regard 'accessible' and 'inaccessible' as primitive notions.But they have concrete interpretations, at least in the physical case: Roughly, a physical variable θ is called accessible if an observer, by a suitable measurement, can obtain as accurate values of θ as he wants to.More prcisely, from a mathematical point of view, I only assume that if θ is accessible, and λ can be defined as a fixed function of θ , then λ is also accessible.
If Ω θ and Ω φ are equipped with topologies, all functions are assumed to be Borelmeasurable.
Definition 1.The accessible variable θ is called maximal if the following holds: If θ can be written as θ = f (ψ) for a function f that is not surjective, the theoretical variable ψ is not accessible.In other words: θ is maximal under the partial ordering defined by α ≤ β iff α = f (β ) for some function f .Note that this partial ordering is consistent with accessibility: If β is accessible and α = f (β ), then α is accessible.Also, φ from Postulate 3 is an upper bound under this partial ordering.
Postulate 4: There exist maximal accessible variables relative to this partial ordering.For every accessible variable λ there exists a maximal accessible variable θ such that λ is a function of θ .
Two different maximal accessible variables come very close to what Bohr called complementary variables; see Plotnitsky [37] for a thorough discussion.
It is crucial what is meant by 'different' here.If θ = f (η) where f is a bijective function, there is a one-to-one correspondence between θ and η, they contain the same information, and they must be considered 'equal' in this sense.θ and η are said to be 'different' if they are not 'equal' in this meaning.This is consistent with the partial ordering in Definition 1.The word 'different' is used in the same meaning in the Theorems below.
Postulate 4 can be motivated by using Zorn's lemma -if this lemma, which is equivalent to the axiom of choice, is assumed to hold -and Postulate 3, but such a motivation is not necessary if Postulate 4 is accepted.Physical examples of maximal accessible variables are the position or the momentum of some particle, or the spin component in some direction.In a more general situation, the maximal accessible variable may be a vector, whose components are simultaneously measurable.
In example A) the individual spin components can be taken to be maximal.In example B) both position and momentum are maximal as accessible theoretical variables.
A statistical model for position measurement might be that the measured position is equal to the theoretical position plus noise.In this model, the accessible variable 'theoretical position' is a statistical parameter.
These 4 postulates are all that I assume.The first goal of this article is to prove through some mathematical arguments versions of the following theorem: Theorem 0: Assume that there relative to an observer A in some given context among other variables exist two different maximal accessible variables, each taking n values.Assume that these two are not bijective functions of each other.Then there exists an n-dimensional Hilbert space H describing the situation, and every accessible variable in this situation will have a unique self-adjoint operator in H associated with it.
This Theorem can be seen as my first starting point for developing the quantum formalism from simple postulates.In a more general version of this Theorem, see [4], it is assumed that the two basic variables above are related.This may seem to be necessary in the more general case, but can be dispensed with.The property of being related will be defined here: Definition 2 Let θ and η be two maximal accessible variables in some context, and let θ = f (φ ) for some function f .If there is a transformation k of Ω φ such that η(φ ) = f (kφ ), we say that θ and η are related (relative to this φ ).If no such (φ , k) can be found, we say that θ and η are non-related relative to the variable φ .
It is easy to show that the property of being related is an equivalence relation.And if θ is maximal, it follows from the relationship property that η above also is maximal.Finally, if G is a group acting on Ω θ , there can be defined a corresponding group H acting on Ω η by hη(φ ) = g f (kφ ) if η(φ ) = f (kφ ).The mapping from g to h is an isomorphism.
Theorem 0 is identical to Theorem 6 of Section 7, which can be deduced from the more general Theorem 4. Note that the crucial assumption is that we have two -in Niels Bohr's terminology -complementary variables.
Using the above 4 postulates in the finite-dimensional case, further results can be proved, among other things: -The eigenvalues of the operator associated with θ are the possible values of θ .
-The accessible variable θ is maximal if and only if all eigenvalues are simple.
-The eigenspaces of the operator are associated with one of several variables, say θ .are in one-to-one correspondence with questions of the form 'What is θ ?/What will θ be if it is measured?' together with sharp answers 'θ = u' for some u.In the maximal case, this gives a simple interpretation of eigenvectors.
To show all this in detail requires some further conceptual developments.In particular, for the most general version of my approach, I need: Postulate 5 One can define a group K of actions on the space Ω φ associated with φ .For at least one maximal accessible variable θ there is a group G of actions on the associated space Ω θ .I will also need the following: Definition 3. The accessible variable θ is called permissible with respect to K if the following holds: θ (φ 1 ) = θ (φ 2 ) implies θ (tφ 1 ) = θ (tφ 2 ) for all group elements t ∈ K.
With respect to parameters and subparameters along with their estimation, the concept of permissibility is discussed in some detail in Chapter 3 in [24].The main conclusion, which also is valid in this setting, is that under the assumption of permissibility, one can define a group G of actions on Ω θ with elements g defined for any t ∈ K by (gθ )(φ ) := θ (tφ ); t ∈ K. ( Herein I use different notations for the group actions g on Ω θ and the group actions t on Ω φ ; by contrast, the same symbol g was used in [24].The background for that is Lemma 1. Assume that θ is a permissible variable.The function from K to G defined by ( 2) is then a group homomorphism.
Proof.See [25].□ In general, whether the function θ (•) is permissible or not, I will assume that a transitive group G is acting on θ , and this is enough for the general Theorem 4 below, from which Theorem 6/Theorem 0 can be deduced.To prove the above properties of the Hilbert space formulation, I need a further basic result.Theorem 5 of Section 8 is the general result, and Theorem 7 is a simpler version, valid in the finite-dimensional case.
Note that my approach here can be seen as fully epistemic.It has to do with an agent seeking knowledge.In the finite-dimensional case, we may concentrate on state vectors that are eigenvectors of some meaningful operator.If this operator is associated with a maximal accessible variable θ , then in general these state vectors have interpretations as questions-and-answers: First, look at questions of the form: 'What is θ ?' or 'What will θ be if we measure it?'.Then consider sharp answers of the form 'θ = u', where u is an eigenvalue of the operator corresponding to θ .
To show this requires some mathematics, given in this article, where also a further discussion is given.What is lacking in this development, are arguments for the Schrödinger equation and for the Born formula from simple assumptions.These issues will also be briefly discussed in Section 7 below, and are discussed in more detail in [1].
It is crucial for my development that operators corresponding to the accessible theoretical variables are found first.As a consequence of this article I can consider a version of quantum mechanics where a ket vector is only seen as a state vector when it is an eigenvector of some physically meaningful operator.
The development above was limited to a single observer A. Now the same mathematics applies to the following situation: There is a set of communicating observers, and jointly accessible or inaccessible variables in some context are associated with these observers.There may be difficulties, in general, to establish what the basic inaccessible variable φ should be in the latter situation, but at least in the two physical examples above the construction is clear.Through discussions the set of observers can establish their theoretical variables, and find out which of them are accessible.The only difference now is that, in order to secure communication, the variables must be possible to define by words.
Note that in this whole discussion, I have said nothing about the ontology.I am fully convinced that there exists an external world, but the detailed properties of this world may be outside our ability to find out.Quantum mechanics as a model, although it is a very good model, can sometimes only give partial answers.Ontological aspects of my approach are discussed in [73].
I admit that this approach is unusual and that the postulates to some may seem a little farfetched.However, for an outsider, I will claim that it is much easier to understand these postulates than jumping right into the usual Hilbert space formalism.For those of us who have learned formalism, the approach may in some sense require some unlearning first.
Possible relationships between my approach and some other approaches towards the foundation of quantum mechanics will be briefly discussed in Section 8 below.

Group actions and measures
Starting with a point θ 0 ∈ Ω θ , an orbit of a group G acting on Ω θ is the set {gθ 0 : g ∈ G}.It is trivial to see that the orbits are disjoint, and their union is the full space Ω θ .The point θ 0 may be replaced by any point of the same orbit.In the case of one orbit filling the whole space, the group is said to be transitive.
The isotropy group at a point θ ∈ Ω θ is the set of g such that gθ = θ .It is easy to see that this is a group.
It is important to define left and right invariant measures, both on the groups and on the spaces of theoretical variables.In the mathematical literature, see for instance [26,27], Haar measures on the groups are defined (assuming locally compact groups).Right (µ G ) and left (ν G ) Haar measures on the group G satisfy for g ∈ G and D ⊂ G, respectively.
Next, define the corresponding measures on Ω θ .As is commonly done, I assume that the group operations (g 1 , g 2 ) → g 1 g 2 , (g 1 , g 2 ) → g 2 g 1 and g → g −1 are continuous.Furthermore, I will assume that the action (g, θ ) → gθ is continuous.
As discussed in Wijsman [28], an additional condition is that every inverse image of compact sets under the function (g, θ ) → (gθ , θ ) should be compact.A continuous action by a group G on a space Ω θ satisfying this condition is called proper.This technical condition turns out to have useful properties and is assumed throughout this paper.When the group action is proper, the orbits of the group can be proved to be closed sets relative to the topology of Ω θ .
The following result, originally due to Weil, is proved in [26,28]; for more details on the right-invariant case, see also [24].
Theorem 1.The left-invariant measure ν on Ω θ exists if the action of G on Ω θ is proper and the group is locally compact.
The connection between ν G defined on G and the corresponding left invariant measure ν defined on Ω θ is relatively simple: If for some fixed value θ 0 of the theoretical variable the function β on G is defined by β : g → gθ 0 , then ν(E) = ν G (β −1 (E)).This connection between ν G and ν can also be written ν G (dg) = dν(gθ 0 )), so that dν(hgφ Note that ν can be seen as an induced measure on each orbit of G on Ω θ , and it can be arbitrarily normalized on each orbit.ν is finite on a given orbit if and only if the orbit is compact.In particular, ν can be defined as a probability measure on Ω θ if and only if all orbits of Ω θ are compact.Furthermore, ν is unique only if the group action is transitive.Transitivity of G as acting on Ω θ will be assumed throughout this paper. In a corresponding fashion, a right invariant measure can be defined on Ω θ .This measure satisfies dµ(ghφ 0 ) = dµ(gφ 0 ) for all g, h ∈ G.In many cases the left invariant measure and the right invariant measure are equal.

A brief discussion of group representation theory
A group representation of G is a continuous homomorphism from G to the group of invertible linear operators V on some vector space H : ( It is also required that V (e) = I, where I is the identity, and e is the unit element of G.This assures that the inverse exists: . The representation is unitary if the operators are unitary (V (g) † V (g) = I).If the vector space is finite-dimensional, we have a representation D(V ) on the square, invertible matrices.For any representation V and any fixed invertible operator U on the vector space, we can define a new equivalent representation as W (g) = UV (g)U −1 .One can prove that two equivalent unitary representations are unitarily equivalent; thus U can be chosen as a unitary operator.
A subspace The null-space {0} and the whole space H are trivially invariant; other invariant subspaces are called proper.A group representation V of a group G in H is called irreducible if it has no proper invariant subspace.A representation is said to be fully reducible if it can be expressed as a direct sum of irreducible subrepresentations.A finite-dimensional unitary representation of any group is fully reducible.In terms of a matrix representation, this means that we can always find a W (g) = UV (g)U −1 such that D(W ) is of minimal block diagonal form.Each one of these blocks represents an irreducible representation, and they are all one-dimensional if and only if G is Abelian.The blocks may be seen as operators on subspaces of the original vector space, i.e., the irreducible subspaces.The blocks are important in studying the structure of the group.
A useful result is Schur's Lemma; see for instance [27]: Let V 1 and V 2 be two irreducible representations of a group G; V 1 on the space H 1 and V 2 on the space H 2 .Suppose that there exists a linear map T from for all g ∈ G and v ∈ H 1 .Then either T is zero or it is a linear isomorphism.Furthermore, if H 1 = H 2 , then T = λ I for some complex number λ .
Let ν be the left-invariant measure of the space Ω θ induced by the group G, and consider in this connection the Hilbert space ).This representation always exists, and it can be shown to be unitary, see [29].
If V is an arbitrary representation of a compact group G in some Hilbert space H , then there exists in H a new scalar product defining a norm equivalent to the initial one, relative to which V is a unitary representation of G.
For references to some of the vast literature on group representation theory, see Appendix A.2.4 in [24].

The construction of operators for the hypothetical case of an irreducible representation of the basic group
In the quantum-mechanical context defined in [1] and discussed above, θ is an accessible variable, and one should be able to introduce an operator associated with θ .The following discussion, which is partly inspired by [29,31], assumes first an irreducible unitary representation of G on a complex Hilbert space H .In the next Section, the assumption of irreducibility will be removed, by simply assuming that we have two related maximal accessible variables in the given context.

A resolution of the identity
In the following, I assume that the group G has representations that give square-integrable coherent state systems (see page 43 of [29]).For instance, this is the case for all representations of compact semisimple groups, representations of discrete series for real semisimple groups, and some representations of solvable Lie groups.
Let G be an arbitrary such group, and let V (•) be one of its unitary irreducible representations acting on a Hilbert space H . Assume that G is acting transitively on the space Ω θ , and fix θ 0 ∈ Ω θ .Then every θ ∈ Ω θ can be written as θ = gθ 0 for some g ∈ G.I also assume that the isotropy groups of G are trivial.Then this establishes a one-to-one correspondence between G and Ω θ .In particular, this implies that the group action is proper and a left-invariant measure ν on Ω θ exists; see Theorem 1 above.Also, fix a vector |θ 0 ⟩ ∈ H , and define the coherent states |θ ⟩ = |θ (g)⟩ = V (g)|θ 0 ⟩.With ν being the left invariant measure on Ω θ , introduce the operator Note that the measure here is over Ω θ , but the elements are parametrized by G. T is assumed to be a finite operator.
, and since the measure ν is left-invariant, it follows that V (h)T = TV (h).□ From the above and Schur's Lemma, it follows that T = λ I for some λ .Since T by construction only can have positive eigenvalues, we must have λ > 0. Defining the measure dρ(θ ) = λ −1 dν(θ ) we therefore have the important resolution of the identity For a more elaborate similar construction taking into account the isotropy subgroups, see Chapter 2 of [31].In [4] a corresponding resolution of the identity is derived for states defined through representations of the group K acting on Ω φ .

Simple quantum operators
Let now θ be a maximal accessible variable and let G be a group acting on θ , satisfying the requirements of the last subsection.
In general, an operator corresponding to θ may be defined by A θ is defined on a domain D(A θ ) of vectors |v⟩ ∈ H where the integral defining ⟨v|A θ |v⟩ converges.This mapping from an accessible variable θ to an operator A has the following properties: (i) If θ = 1, then A θ = I.
(ii) If θ is real-valued, then A θ is symmetric (for a definition of this concept for operators and its relationship to self-adjointness, see [32].) (iii) The change of basis through a unitary transformation is straightforward.
For further important properties, we need some more theory.First, consider the situation where we regard the group G as generated by a group K defined on the space of an inaccessible variable φ .This represents no problem if the mapping from φ to θ is permissible, a case discussed in [4], and in this case, the operators corresponding to several accessible variables can be defined on the same Hilbert space.In the opposite case, we have the following theorem.
Theorem 2. Let H be the subgroup of K consisting of any transformation h such that θ (hφ ) = gθ (φ ) for some g ∈ G. Then H is the maximal group under which the variable θ is permissible.
Use the left invariance of ρ. □ Further properties of the mapping from θ to A may be developed in a similar way.The mapping corresponds to the usual way that the operators are allocated to observables in the quantum mechanical literature.But note that this mapping comes naturally here from the notions of theoretical variables and accessible variables on which group actions are defined.7 The main theorems and theoretical results.

The general case
Up to now, I have assumed an irreducible representation of the group G.A severe problem with this, however, is that the group G in many applications is Abelian, and Abelian groups have only one-dimensional irreducible representations.Then the above theory is trivial.
In [4] this problem is solved by taking as a point of departure two different related maximal accessible variables θ and η.Recall here the meaning of the word 'different' discussed in Sction 3. The main result is then as follows.
Theorem 4 Consider a context where there are two different related maximal accessible variables θ and η.Assume that both θ and η are real-valued or real vectors, taking at least two values.Make the following additional assumptions: (i) On one of these variables, θ , there can be defined a transitive group of actions G with a trivial isotropy group and with a left-invariant measure ρ on the space Ω θ .
(ii) There exists a unitary multi-dimensional representation U(•) of the group behind the group actions G such that for some fixed |θ 0 ⟩ the coherent states U(g)|θ 0 ⟩ are in one-to-one correspondence with the values of g and hence with the values of θ .
Then there exists a Hilbert space H connected to the situation, and to every (realvalued or vector-valued) accessible variable there can be associated a symmetric operator on H .
For conditions under which a symmetric operator is self-adjoint/ Hermitian, see [32].
The crucial point in the proof of Theorem 4 is to construct a group N acting on the vector ψ = (θ , η), and then a representation W (•) of N which I prove is irreducible.The coherent states |v n ⟩ = W (n)|v 0 ⟩ are then in one-to-one correspondence with n ∈ N.For the details of all this, I refer to Appendix 1 and to [4].
This gives the crucial identity where µ is a left-invariant measure on the group N. One can show that there is a function f θ on N such that θ = f θ (n), and a function f η on N such that η = f η (n).We can now define operators corresponding to θ and η: The properties (i)-(iii) of Subsection 5.2 can now be proved for the operators A θ and A η .All proofs are in Appendix 1.
Note that any pair of related maximal accessible variables may be used as a basis for Theorem 4. Accessible variables that are not maximal, can always be seen as functions of a maximal variable by postulate 4. Hence for these variables, the spectral theorem may be used, and operators constructed as in the last part of Appendix 1.
An essential part of the proof of Theorem 4 is to prove that if U(•) is a representation of G which is not irreducible, then W (•) is an irreducible representation of N. In order to carry out this part of the proof, I need a representation U(•) which is at least three-dimensional, so that it can be reduced to a lower-dimensional space if not irreducible, and similarly the representation of the corresponding group H acting upon η must be at least three-dimensional.(The two-dimensional case, the qubit case, is treated separately in [1].) The transformation k defining η = f (kφ ) from θ = f (φ ) cannot be just the trivial one interchanging θ and η, taken together with the assumption that the group G acting upon θ is just the identity.This is clear, since if such a trivial interchange was allowed in this case, every pair of variables would be related by the above definitions.Note, however, that according to Definition 2, the notion of being related depends on the inaccessible variable φ .If we for instance take φ as the vector (position, momentum), and have a sufficiently large group G connected to θ =position (say, the translation group), then an interchange of position and momentum is permitted relative to this φ .More concretely, this interchange will involve a Fourier transform, as in the ordinary theory.
If there is a transitive group M acting on Ω φ , then one can always, with two different maximal variables θ and ξ , find a bijective function f and a variable η = f (ξ ) such that θ and η are related.
To complete the construction of the usual Hilbert space formalism from the mathematical model of Sections 2 and 5, I need a further main theorem.
Theorem 5 Assume that the functions θ (•) and η(•) are permissible with respect to a group K acting on Ω φ .Assume that K is transitive and has a trivial isotropy group.Let T (•) be a unitary representation of K such that the coherent states T (t)|ψ 0 ⟩ are in one-to-one correspondence with t.For any transformation t ∈ K and any such unitary representation T of K, the operator T (t) † A θ T (t) is the operator corresponding to θ ′ defined by θ ′ (φ ) = θ (tφ ).
In addition, if θ and η are different, but related through a transformation k of Ω φ , there is a unitary operator S(k) such that A η = S(k) † A θ S(k).This is also proved in Appendix 1.One final remark to the developments above: The above theorems have so far been connected to a single observer A and the mathematical model of Section 3.But the same arguments can be used with the following model: Assume a set of communicating observers and assume that these have defined joint variables that may be accessible or inaccessible to the set of observers.Then the same mathematics is valid, and the same mathematical/physical examples of variables may be used.

The case where the maximal accessible variable takes a finite number of values
I will show here first that if θ takes a finite n number of values, then we can choose G and k such that the symmetry assumptions of Theorem 4 are satisfied.This leads to a simplification of the theory.The case n = 2, the qubit case, is discussed separately in [1]; see Subsection 4.5.3 and Section 5.2 there.I will here assume n ≥ 3, which in fact can be shown to be needed in the proof in Appendix 1 of Theorem 4.
In the finite case, it is crucial that the reducibility of the representation U(•) is permitted.Concretely, let G be the cyclic group acting on the distinct values u 1 , ..., u n of θ , that is, the group generated by the element g 0 such that g 0 u i = u i+1 for i = 1, ..., n−1 and g 0 u n = u 1 .This is an Abelian group, which only has one-dimensional irreducible representations.However, we can define U(•) as taking values as diagonal unitary n × n matrices with different complex nth roots of the identity on the diagonal.For the specific matrix U(g 0 ), take these nth roots in their natural order, and then let every It is easy to see then that the coherent states U(g)|θ 0 ⟩ are in one-to-one correspondence with the group elements g ∈ G when |θ 0 ⟩ is a unit vector with one element equal to 1 and the others zero, and this can be generalized to any |θ 0 ⟩.Also, G is transitive in its range and has a trivial isotropy group.
Thus the only assumption of Theorem 4 that is left to verify, is the assumption that η can be found as a related variable to θ , that is, the existence of an inaccessible variable φ and a transformation k in the corresponding space Ω φ such that η(φ ) = θ (kφ ).
To this end, let Ω φ be the three-dimensional unit sphere, plot the values of η along the equator E, and the values of θ along the great circle F containing the south pole and the north pole.See Figure 1.
Without loss of generality, we can let the values of θ and η be equidistant.(If this is not the case, we can define new variables θ and η by taking one-to-one functions.If operators are proved to exist for these functions, operators of the original variables can be constructed from the spectral theorem.)If these values are plotted in a corresponding way, we can transform the values of θ onto the values of η by a 90 o rotation k of According to Postulate 5 of Section 3, there always exists a group K acting on Ω φ .The question is now whether this group can be constructed such that the symmetry assumptions of Theorem 5 also are satisfied.Take as a basis the cyclic group G acting on the values of θ , and let a corresponding group H act on the values of η.Without loss of generality, assume these values to be equidistant.Construct K, acting on two copies of the sphere as in Figure 2 as follows: Make a grid on both spheres by discretizing the longitude angle λ and the latitude angle φ as on the Figure (not to be confused with the basic inaccessible variable φ ).In the first copy, this includes the lines E (representing θ ) and F (representing η).In the second copy, E (θ ) and F (η) are switched.The states are the intersections of meridians and latitude circles.In the first copy, let G act on the latitude circles and H act on the meridians.In the second copy, G and H are switched.This gives a version of G ⊗ H acting on both copies.We let K consist of this and the following elements: From the North pole and the South pole of both copies there is a group action element j going from one copy to an arbitrarily chosen state of the other.This state can be chosen by a uniform distribution.
The representation U(•) of G is taken as above, we construct a similar representation V (•) of H, and we take T (K) = W (K) be the irreducible group used in the proof of Theorem 4, where K is extended as above.
Using this geometry, the following must be proved, but proofs are omitted here: 1) K is transitive on its values and has a trivial isotropy group.
2) θ (•) and η(•) are permissible functions of the state variable φ with respect to K.
3) Taking as |ψ 0 ⟩ one of the points where the great circles E and F intersect, the coherent states T (t)|ψ 0 ⟩ are in one-to-one correspondence with the group elements t ∈ K.
From this, the conditions of the first part of Theorem 5 are satisfied, and we have: Theorem 7 Assume Postulate 1 to Postulate 4 of Section 3, and that there exist two different accessible variables θ and η, each taking n values.Let T (•) be the unitary representation of K defined above.Then for any transformation t ∈ K, the operator T (t) † A θ T (t) is the operator corresponding to θ ′ defined by θ ′ (φ ) = θ (tφ ).
For the special transformation k above, we have A η = S(k) † A θ S(k) for some unitary matrix S(k). .From these two theorems follow a rich class of results, as discussed in detail in [1] (The first part of Theorem 7 is not needed for these results): -Every accessible variable has a self-adjoint operator connected to it.
-The set of eigenvalues of the operator is equal to the set of possible values of the variable.
-An accessible variable is maximal if and only if all eigenvalues are simple.
-The eigenvectors can, in the maximal case, be interpreted in terms of a question together with its answer.Specifically, this means that in a context with several variables, a chosen maximal variable θ may be identified with the question 'What will θ be if we measure it?' and a specific eigenvector of A θ , corresponding to the eigenvalue u may be identified with the answer 'θ = u'.
-In the general case, eigenspaces have the same interpretation.
-The operators of related variables are connected by a unitary similarity transform.
For the proofs of the second and third statements above, see Appendix 2. The third and fourth statements can be taken as basis for my proposal of a version of quantum mechanics: A ket vector is only seen as a valid state vector if it is an eigenvector of a physically meaningful operator.This requires a separate discussion of the superposition principle, In my theory, the only valid state vectors are related to some variable η and the possible values v k of this η.Assume for simplicity that η is maximal.Then the statement η = v k corresponds to an eigenvector |v k ⟩ of A η , with the resolution of the identity This implies a large class of possible new state vectors but it does not follow from this that every superposition of the orthogonal ket vectors |v k ⟩ can be written in this way.Note that |ψ⟩ also has an interpretation in terms of some variable θ and some statement θ = u.The physical situation may be that we know something about θ or know something about η, but we can also be without such knowledge.The most definite statement about knowledge of η is of the form η = v, but we may also know probability statements of the form {π(v k )}, which is formalized by a density matrix As is well known, this leads by Born's formula to probability statements for θ : Going back to the superposition principle: Let |α⟩ and |β ⟩ be two different state vectors.Then |α⟩ can be connected to a statement γ = u for some theoretical variable γ, or equivalently, c(γ) = c(u) for any bijective c, and |β ⟩ can be connected to a statement ξ = v for some theoretical variable ξ , or equivalently, d(ξ ) = d(v) for any bijective d.By Postulate 4, if γ and ξ are not maximal, they are functions, say f 1 and f 2 , of some maximal variables, say θ and η, respectively.
We have two possibilities.Either θ and η are bijective functions of each other.Then every states connected to them can be expressed in the same basis {|v k ⟩}.Or they are different.Then by Theorem 6 they can be taken to construct the Hilbert space and the necessary operators.
In either case we have by ( 9) and ( 10) and the spectral theorem, This is the operator of the theoretical variable λ = aγ + bξ , or more generally λ = ac(γ) + bd(ξ ) for some bijective mappings c and d, and every state |w⟩ associated with λ = w can be expressed as Postulate 6: If {u i }; i ∈ I are the possible values of the accessible variable γ, {v j }; j ∈ J are the possible values of the accessible variable ξ , and λ = f (γ, ξ ) is accessible, then the possible values of λ is contained in { f (u i , v j }; i ∈ I, j ∈ J. Theorem 8 Assume that |α⟩ and |β ⟩ are possible state vectors, and |α⟩ can be associated with an event f 1 (θ ) = u, and |β ⟩ can be associated with an event f 2 (η) = v.Here θ and η are two different meaningful maximal accessible variables.Then a|α⟩ + b|β ⟩, (a ̸ = 0, b ̸ = 0) is a possible state vector if and only if there exist bijective functions c and d such that λ = ac( f 1 (θ )) + bc( f 2 (η)) is a meaningful variable. .Proof.As above, if |α⟩ is interpreted as γ = u, and |β ⟩ is interpreted as ξ = v, this implies an interpretation of a|α⟩ + b|β ⟩ as λ = ac(γ) + bd(ξ ) = w = au + bv for some bijective mappings c and d, and this λ is of the desired form.It is left to prove the 'only if' part.To this end, assume that θ .η and λ have the assumed properties.Without loss of generality, let c and d be identities.By Postulate 4 there exist a maximal variable µ such that λ is a function of µ.Since θ and η are different, and µ must involve η in a non-trivial way, θ and µ must be different.Then by Theorem 6 we can construct a Hilbert space based upon θ and µ, operators A µ and then the operator A λ by the spectral theorem correponding to the meaningful variable λ exist.Let {w k } be the possible values of λ .
Associated with λ = w k there must according to Postulate 6 be an i = i(k) and a j = j(k) such that w k = au i(k) + bv j(k) .Here u i(k) is a possible value of γ, an eigenvalue of A γ with a corresponding eigenvector |α i(k) ⟩, and v j(k) is a possible value of ξ , an eigenvalue of A ξ with a corresponding eigenvector |β j(k) ⟩.This gives that each a|α i(k) ⟩ + b|β j(k] ⟩ is a possible state vector, corresponding to λ = w k .□ Note the assumption in Theorem 8.For instance, in the Schrödinger cat example.let |α 1 ⟩ be the state corresponding to a cat which is known to be dead, and let |β 1 ⟩ corresponding to a cat which is known to be alive.Let |α 2 ⟩ and |β 2 ⟩ correspond to the opposite of these two events.Denote the indicators of the mentioned events γ i and ξ j .
An observer outside the box knows nothing, and is associated with the state ||α 2 ⟩ ⊗ |β 2 ⟩, corresponding to γ 2 = ξ 2 = 1.Any superposition of the states |α i ⟩ and |β j ⟩ is for him meaningless.An observer inside the box, wearing a gas mask, will know the answers, and is associated with the state given by γ 1 = 1 or ξ 1 = 1 Again, superposition is meaningless.The two observers will agree on the status of the cat once the door to the box is opened.
It is crucial now that this full theory seems to follow by -in addition to the simple Postulate1 to Postulate 4 of Section 3 -only assuming that two different maximal accessible variables exist, in Niels Bohr's terminology, the existence of two different complementary variables.Born's formula requires additional postulates, as briefly discussed in Subsection 7.6, and more fully in [1].

The case of position and momentum of a particle
It is of interest also to develop further the basis of quantum theory for the general case where θ and η are continuous theoretical variables, but this is outside the scope of the present paper.But it is fairly straightforward now to complete the theory for an important special case: Let θ be the theoretical position of some particle and let η be its theoretical momentum.I choose the accessible variables to be such theoretical variables and assume that a measurement consists of a theoretical value plus a measurement error.This is similar to how measurements are modeled in statistics.
The simplest approach is the following: Approximate θ with an n-valued variable θ n , find an operator A n corresponding to θ n , and let n tend to infinity.This approach is carried out in Section 5.3 in [1].It is shown that the Hilbert space for θ can be taken to be L 2 (R, dx), and the transformation k, which gives the operator for momentum, is a Fourier transform on this Hilbert space.The operators connected to θ and η are the usual ones.
A more direct approach, using the general theory here, is to take the group G acting on θ to be the translation group, and let the group K acting on φ = (θ , η) be the Heisenberg-Weyl group; see [31].This will not be further discussed here.

Quantum decision theory
There is a large literature on quantum decision theory; see for instance the survey article [22], the book [23] and the series of articles [38][39][40][41][42] by Yukalov and Sornette.The whole field of quantum decisions can be linked to the theory introduced here, as discussed in [5].The clue is to let my variables θ , η, ξ , ... no longer be physical variables, but decision variables.In the simplest case, a decision variable takes a finite number of values.
Let a person A be in a concrete decision situation.He is among other things faced with the choice between taking actions a 1 , ..., a n .Define a decision variable θ as equal to j if he chooses to make a decision a j .If this is linked to my theory, we have to define what is meant by accessible and inaccessible decision variables.Let θ be accessible if A really is able to perform all the actions a 1 , ..., a n , and is able to make a decision here.
If not, we say that θ is inaccessible.
To carry out this connection, we have to give meaning to all the Postulates 1 to 4 of Section 3. Postulate 1 gives no problem; all variables connected to A satisfy this postulate.Postulate 2 must be assumed.Then we consider, corresponding to the concrete decision associated with θ , simpler decisions, with decision variables λ , such that each λ is a function of θ .A way to achieve this is to let these simpler decisions be associated with disjoint subsets of the actions a 1 , ..., a n .It then seems obvious that the simpler decisions are accessible when the decision connected to θ is accessible.
Postulate 3 is a challenge here, but it can be satisfied in the following situation: Assume that A has concrete ideals when making his decisions, and he can imagine that one of these ideals has made similar decisions before, but he does not know this so concretely that he can figure out what the ideal person would have done in his concrete case.Let the inaccessible variable φ correspond to the choices that A's ideal would have done.
Postulate 4 may be justified by appealing to Zorn's lemma for the partial order defined by taking functions of decision variables.The maximal decisions that can be made by A will have a special place in the proposed quantum decision theory.
If all these assumptions are made, we now have the results of Theorem 6 and Theorem 7, which give a Hilbert space apparatus connected to the situation.We then make the assumption that A really at the same time is confronted with two difficult decisions, each involving decision variables which to him are maximal.
To complete the link to quantum decisions, we must find probabilities connected to the decision variables.For this, one can use the Born formula, which is briefly discussed below; a detailed derivation is given in [1].
I hope to discuss all this further and give concrete examples, elsewhere.

On entanglement and EPR
Consider two spin 1/2 particles, originally in the state of total spin 0, then separated, one particle sent to Alice and one particle sent to Bob.This can be described by the entangled singlet state where |1+⟩ means that particle 1 has spin component +1 in some fixed direction, and |1−⟩ means that the component is -1; similarly, for |2+⟩ and |2−⟩ with particle 2.
As in David Bohm's version of the EPR situation, let Alice measure the spin component of her particle in some direction a, and let Bob measure the spin component of his particle in the same direction.As has been described in numerous papers, there seemingly is a strange correlation here: The spin components are always opposite.
I want to couple this with the philosophy of Convivial Solipsism [30]: Every description of the world must be relative to some observer.So let us introduce an observer, Charlie, observing the results of both Alice and Bob.Charlie's observations are all connected to the entangled state (18).
Let us try to describe all this in terms of accessible and inaccessible variables.The unit spin vectors φ 1 and φ 2 of the two particles are certainly inaccessible to Charlie, but it turns out that their dot product η = φ 1 • φ 2 is accessible to him.In fact, Charlie's observations are forced to be related to the state given by η = −3.
Note that Charlie can be any person.So, we conclude: To any observing person, the spin components as measured by Alice and Bob must be opposite.This seems to be a necessary conclusion, implied by the fact that the person, relative to his observations, is related to the state given by (18).
For the further limitations of the observer Charlie and the corresponding explanation of why the CHSH inequality can be violated in Bell-type experiments, see [34,61].

The Born rule
Born's formula is the basis for all probability calculations in quantum mechanics, The version given in most textbooks can be formulated as follows: Given some mixed state ρ the expectation of the theoretical variable θ is given in terms of the associated operator A θ as E(θ ) = trace(ρA θ ).
In textbooks, Born's formula is usually stated as a separate axiom, but it has also been argued for by using various sets of assumptions [54][55][56]; see also Campanella et al. [50] for some references.In fact, the first argument for the Born formula, assuming that there is an affine mapping from a set of density functions to the corresponding probability functions, is due to von Neumann [51].Many modern arguments rely on Gleason's theorem, which is not valid in dimension 2, and also requires some assumptions, in particular the assumption of non-contextuality.In [1] a simple version of Born's formula is derived under reasonable assumptions from a Gleason-type theorem due to Busch [52], a theorem which also holds in dimension 2. It is shown that (19) can be derived from this simple version for the case where θ is maximal.In general, (19) requires an extra assumption: There is a maximal accessible variable λ such that θ is a function of λ .The distribution of λ , given θ , is uniform.

The Hamiltonian and the Schrödinger equation
First: In the approach of this article, I have concentrated on the construction of operators connected to accessible variables.From this, state vectors or ket vectors can in some generality be seen as eigenvectors of some operator; see Section 10 below and also a further discussion in [1].
During a time when no measurement is done on the system, the ket vectors are known in quantum mechanics to develop according to the Schrödinger equation: where H is a self-adjoint operator called the Hamiltonian.(Referring to the general theory above, this is the operator corresponding to the variable θ = total energy, in a context where the relevant observer or set of communicating observers also think of the complementary variable time).
In [1] I gave two sets of arguments for the Schrödinger equation, one rough and general, and then one specific related to position.The last argument also includes a discussion of the wave function.The general argument for the Schrödinger equation will be reproduced here.

The general argument for the Schrödinger equation; unitary transformations and entanglement
Assume that the system at time 0 has a state given by the ket |ψ⟩ 0 and at time t by the ket |ψ⟩ t .Let's assume that the contexts are given as follows: We can ask an epistemic question about a maximal accessible variable θ , and the ket corresponding to a specific value of this variable is |θ ⟩ 0 at time 0 and |θ ⟩ t at time t.We have the choice between making a perfect measurement at time 0 or at time t.Since there is no disturbance through measurement of the system between these two time points, the probability distribution of the answer must be the same whatever choice is made.Hence according to the simple version of Born's formula Now we refer to a general theorem by Wigner [58], proved in detail by Bargmann [59]: If an equation like (21) holds, then there must be a unitary or antiunitary transformation from |ψ⟩ 0 to |ψ⟩ t .(Antiunitary U means U −1 = −U † .)Since by continuity, an antiunitary transformation can be excluded here, so we have for some unitary operator U t .Writing U t = exp( 2πA t ih ) for some selfadjoint operator A t , and assuming that A t is linear in t: A t = Ht, this is equivalent to (20).In fact, assuming that {U t } forms a strongly continuous group of unitary transformations, the form Ht of A t follows from a theorem by Stone; see Holevo [60].
Unitary transformations of states play an important role in quantum mechanics.Both in the continuous case and in the discrete case such a transformation can be used to illuminate the state concept as introduced in the present article.More specifically, a unitary change of an operator can quite generally be coupled to a concrete change of the involved theoretical variable; see Theorem 5 and Theorem 7 above.When an operator is changed in this way, its eigenvectors are changed accordingly, hence there is a change of states.Note that, subject to linearity, a unitary operator U always can be written as U = exp( 2πHt ih ) for some suitable Hamiltonian H, so these transformations can be seen as closely related to time developments of states.
Consider the discrete case.Let the initial state be |a; k⟩ ⊗ |b; j⟩, corresponding to the answers of two focused questions: θ a = u a k and γ b = v b j .Assume that θ a and γ b are maximal.By a unitary transformation, essentially by a time development, this initial state is transformed into a state which cannot be written as a product of states in this way but is a linear combination of such states.This is an entangled state.Thus, in my terminology, entangled states can at least in some cases be given concrete interpretations: Some fixed time ago they were given as answers to two focused questions.By the inverse unitary transformation, the entangled state may be transformed back to the state |a; k⟩ ⊗ |b; j⟩ again.Thus, we have then a concrete interpretation of the entangled state: Subject to a suitable Hamiltonian, the state can be interpreted as the answer to two focused questions posed at some past time.

A brief comparison with some other approaches
As mentioned in the Introduction, there are several rather recent investigations with the purpose of deriving the Hilbert space structure from physical assumptions.Some of the resulting models are called generalized probability models.I will briefly discuss some of these approaches.
The article [6] by Hardy is a pioneering one.This article led to several other investigations, summarized in Hardy [65].All these investigations start with a set of postulates, stated slightly differently.
Both in [6] and in [65] postulates are stated in terms of two basic numbers N and K. Here, N is the maximum number of states for a given system for which there exists some measurement, which can identify which state from a set in a single shot, while K is the number of probabilities that are entries in the state vectors that are constructed.Hardy presents a set of postulates that characterize both classical systems and quantum systems, and one postulate which distinguishes the two.It is proved that K = N for classical systems, and K = N 2 for quantum systems.In modern form, Hardy's postulates are stated as P1, P2, P3, P4' and P5 of [65].These lead to either classical mechanics or quantum mechanics, while a variant, P4, singles out quantum mechanics.In fact, it leads to a very general version of quantum mechanics, which is further discussed in [65].
In contrast, my postulates of Section 3 and Section 4 here can be associated with a version of quantum theory where I limit the pure state concept to ket vectors that are eigenvectors of some physically meaningful operator.
Another difference is that Hardy starts his investigations by introducing pure states and measurements, while I start with the notion of theoretical variables and their associated operators.It is an open problem to find a minimal set of postulates which cover all different approaches towards the formalism.It is crucial that my approach also leads to a particular view of the interpretation of quantum mechanics, discussed below.
Among the various articles that are related to the ones by Hardy, I can mention Goyal [9], who relies on the framework of information geometry, and Masanes and Müller [10], who state 5 axioms based on elementary assumptions regarding preparations, transformations, and measurements, Rovelli's book [64], which is thought-provoking and very informative, is more concerned with interpretation than with foundation.I find very much in this book that I appreciate, and Rovelli's interpretation of quantum mechanics is close to mine.

Interpretation of quantum mechanics
Based on the results above, we can now start to discuss the interpretation of quantum theory.The results were based upon accessible theoretical variables θ , which were assumed to be connected to an observer or jointly to a group of communicating observers.From a scientific point of view, these variables are the ones for which questions of the form 'What is θ ?' or 'What will θ be if we measure it?' can be posed.It is tempting here to cite Rovelli [64]: 'I believe that we need to adapt our philosophy to our science, and not our science to our philosophy.'I fully agree with this.The mathematical discussion above belongs to the domain of science; the philosophical discussion of quantum interpretation should come after this.
My theory seems to be more connected to our knowledge of reality rather than reality itself.I will call this a general epistemic interpretation of quantum mechanics.
There exist several interpretations of quantum mechanics, and the discussions between the supporters of the different interpretations are still going on.In recent years, there have been held a broad range of international conferences on the foundation of quantum mechanics.A great number of interpretations have been proposed; some of them look very peculiar to the layman.The many worlds interpretation assumes that there exist millions or billions of parallel worlds and that a new world appears every time one performs a measurement; there is also a related many mind's interpretation.
On two of these conferences recently there was taken an opinion poll among the participants [14,15].It turned out to be an astonishing disagreement on many fundamental and simple questions.One of these questions was: Is quantum mechanics a description of the objective world, or is it only a description of how we obtain knowledge about reality?The first of these descriptions is called ontological, and the second is called epistemic.Up to now, most physicists have supported some version of an ontological or realistic interpretation of quantum mechanics, but variants of the epistemic interpretation have received a fresh impetus in recent years.
I look upon my book 'Epistemic Processes' [1] and also this article as a contribution to this debate.An epistemic process can denote any process to achieve knowledge.It can be a statistical investigation or a quantum mechanical measurement, but it can also be a simpler process.The book starts with an informal interpretation of quantum states, which in the traditional theory has a very abstract definition.In my opinion, a quantum state can under wide circumstances be connected to a focused question and a sharp answer to this question, see above.
A related interpretation is QBism, or quantum Bayesianism, see Fuchs [16,17,18] and von Baeyer [19].(What started as a variant of Bayesianism, has now developed into a somewhat wider QBism.)The predictions of quantum mechanics involve probabilities, and a QBist interprets these as purely subjective probabilities, attached to a concrete observer.Many elements in QBism represent something completely new in relation to classical physical theory, in relation to many people's conception of science in general and to earlier interpretations of quantum mechanics.
QBism has been discussed by several authors.For instance, Hervé Zwirn's views on QBism, which I largely agree with, are given in [20].
By using group theory, group representation theory, and some simple category theory, I aim to study a general situation involving theoretical variables mathematically, and it seems to appear from this that essential elements of the quantum formulation can be derived under weak conditions.This may be of some scientific relevance.Empirically, it has turned out that the the quantum formalism provides a very extensive description of our world as we know it [21], and in physical situations in microcosmos an all-embracing description.
Focus on the case where the accessible variable θ takes a discrete set of values.In the case where θ takes an infinite discrete set of values, we can still prove that Theorem 6 and Theorem 7 hold; the proof goes by taking a limit of cases where θ takes a finite number of values.
The following simple observation should be noted and is in correspondence with the ordinary textbook interpretation of quantum states: Trivially, every vector |v⟩ is the eigenvector of some operators.Assume that there is one such operator A that is physically meaningful, and for which |v⟩ is also a non-degenerate eigenvector, say with a corresponding eigenvalue u.Let λ be a physical variable associated with A = A λ .Then |v⟩ can be interpreted as the question 'What is the value of λ ?' along with the definite answer 'λ = u'.
More generally, accepting operators with non-degenerate eigenspaces (corresponding to observables that are accessible, but not maximally accessible), each eigenspace can be interpreted as a question along with an answer to this question.
Binding together these two paragraphs, we can also think of the case where λ is a vector, such that each component λ i corresponds to an operator A λ i i , and these operators are mutually commuting.Then A λ = i A λ i i has eigenspaces which can be interpreted as a set of questions 'What are the values of λ i i = 1, 2, ...?' together with sharp answers to these questions.In the special case of systems of qubits, Höhn and Wever [35] have recently proved that there is a one-to-one correspondence between sets of questionand-answer pairs and state vectors.
The following is proved in [1,36] under certain general technical conditions, and also specifically in the case of spin/ angular momentum: Given a vector |v⟩ in a Hilbert space H and a number u, there is at most one pair (a, j) such that |a; j⟩ = |v⟩ modulus a phase factor, and |a; j⟩ is an eigenvector of an operator A a with eigenvalue u.
The main interpretation in [1] is motivated as follows: Suppose the existence of such a vector |v⟩ with |v⟩ = |a; j⟩ for some a and j.Then the fact that the state of the system is |v⟩ means that one has focused on a question ('What is the value of λ a ?') and obtained the definite answer (λ a = u.)The question can be associated with the orthonormal basis {|a; j⟩; j = 1, 2, ..., d}, equivalently with a resolution of the identity I = ∑ j |a; j⟩⟨a; j|.The general technical result of [1] is also valid in the case where λ a and u are real-valued vectors.
After this, we are left with the problem of determining the exact conditions under which all vectors |v⟩ ∈ H in the non-degenerate discrete case and all projection operators in the general case can be interpreted as above.This will require a rich index set A determining the index a.This problem will not be considered further here, but this is stated as a general question to the quantum community [36].But from the evidence above, I will in this paper rely on the assumption that each quantum state/ eigenvector space can be associated in a unique way with a question-and-answer pair.Strictly speaking, this requires a new version of quantum mechanics, where we only permit state vectors that are eigenvectors of some physically meaningful operator.
Superposition of quantum states can be introduced in my setting as follows: Take as a point of departure the states |a; j⟩, each such state interpreted in the way that we know that λ a = u a j for a maximally accessible variable λ a .Then consider another maximal variable λ b and a hypothetical possible value u b i for λ b .Since ∑ j |a; j⟩⟨a; j| = I, we have Here the corresponding operators A a and A b do not commute, and this is a fairly general linear combination of states |a; j⟩.Such linear combinations will then be state vectors.The state |b; i⟩ may be a very hypothetical state, not coupled to the observer's concrete knowledge.Then (22) corresponds to a 'do not know' state.This discussion of superposition may also be generalized to the double-slit experiment and to more general experiments involving multiple paths; see for instance Rovelli [64] for such experiments.The inference pattern in the double-slit experiment can be explained by a momentum variable in the plane of the slits orthogonal to the two slits, a momentum which by de Broglie's theory is connected to a wave.What is not known, is the position variable in the same direction, in particular, the answer to the question 'Which slit?'.The answer to a similar question is also unknown in experiments involving multiple paths.
When λ is a continuous scalar or vector variable, we can still interpret the eigenspaces of the operator A λ as questions 'What is the value of λ ?' together with answers in terms of intervals or more generally sets for λ .This is related to the spectral decomposition of A λ , which gives the resolution of the identity (recall (34)) This resolution of the identity is tightly coupled to the question 'What is the value of λ ?', and it implies projections related to indicators of intervals/sets C for λ , that is, yes-no questions of the type 'Does λ belong to C?', as This is of course just simple quantum logic.But it can be related to an interpretation if we can agree on the basic assumption of Convivial Solipsism: Every description of reality should be relative to an observer or a group of communicating observers.All theoretical variables of this article are assumed to have such a relation.Thus yes-no questions associated with such a variable should be related to an observer or a group of observers.
Then (24) can be interpreted as connected to a general epistemic interpretation of quantum states and projection operators.A special case is the QBist interpretation but this interpretation is more general.It can also be seen as a concrete specification of Relational Quantum Mechanics: Accessible variables of a system are seen as relative to other systems, where one of these other systems may be an observer.In the multipleworld interpretation, only variables connected to one world are accessible to a given observer at some fixed time.In the Bohm interpretation, the position of a particle at time t will be accessible, but the path from time t 1 to time t 2 will be inaccessible.
There is a huge literature on interpretations of quantum theory.Some of the proposed interpretations have relationships to my epistemic interpretation, but I will not go into more details with this discussion here.
In general, λ may be seen as a maximal accessible variable associated with the operator A λ .If θ is another maximal accessible variable, it will be associated with another operator A θ , and A λ and A θ will not be commuting.We can then say that λ and θ are complementary variables in the sense of Bohr.More precisely, it is the questions related to these variables that are complementary, each given by an orthonormal full set of ket vectors.Variables/operators corresponding to the same formal question, but having different sharp answers to this question, are equivalent in this respect.They are given by the same orthonormal basis, and the variables are bijective functions of each other.
In a physical context, Niels Bohr's complementarity concept has been thoroughly discussed by Plotnitsky [37].
Here is Plotnitsky's definition of complementarity: (a) a mutual exclusivity of certain phenomena, entities, or conceptions; and yet (b) the possibility of applying each one of them separately at any given point; and (c) the necessity of using all of them at different moments for a comprehensive account of the totality of phenomena that we consider.
This definition points to the physical situation discussed above and has Niels Bohr's interpretation of quantum mechanics as a point of departure.In [1] I have also tried to couple the complementarity concept to macroscopic situations.
Here is one remark concerning QBism, which can be said to represent a special case of my views: Subjective Bayes probabilities have also been in fashion among groups of statisticians.In my opinion, it can be very fruitful to look for analogies between statistical inference theory and quantum mechanics, but then one must look more broadly at statistics and statistical inference theory, not only focusing on subjective Bayesianism.This is only one of several philosophies that can form a basis for statistics as a science.Studying connections between these philosophies is an active research area today.From such discussions, one might infer that another interesting version of Bayesianism is objective Bayesianism, with a prior based on group actions.
Finally, I need to discuss my epistemic interpretation of quantum mechanics in light of the various no-go theorems in the literature.
For the relationship to Bell's theorem, I refer to my recent article [61].It seems possible to avoid the non-locality assumption if we replace it with an assumption that all observers are limited in some specific sense.
My inaccessible variables are more general than what is usually perceived as hidden variables.Nevertheless, the Kochen-Specker theorem may a priori be of some relevance.However, the Kochen-Specker theorem only excludes noncontextual hidden variable theories.I think of my theory of an observer as connected to a fixed physical context at some fixed time.
A greater challenge is the recent Pusey-Barrett-Rudolph (PBR) theorem.This theorem seemingly excludes models where a pure quantum state represents only knowledge about an underlying physical state of the relevant system.A crucial assumption, however, is the existence, for every system, of such an underlying 'real physical state'.This assumption can be questioned.Also, the arguments against an epistemic interpretation given by Colbeck and Rennes [70] rely on certain assumptions.In particular, it assumes a list of elements of reality Λ which satisfies a certain Markov condition.A detailed discussion will not be given here.
It is important, however, that also arguments against a realistic interpretation of quantum states are given in the literature [74,75].

From inaccessible variables to notions
In the previous Sections, all variables were seen as mathematical variables.In this Section I will limit the variable concept to only physical variables, and these physical variables will be accessible.I will limit the discussion to some concrete physical context, where all physical variables in principle are measurable given some choice by the observer A or by a group of communicating observerss.The space Ω φ will then be replaced by some abstract space Ω, and group theory on φ will be replaced by category theory with object Ω, called a space of notions.It is assumed that these notions exist relative to the observer A or to a group of communicating observers.Functions from Ω onto Ω or onto other objects, called morphisms, can be defined.In this setting Postulate 3 can be replaced by: Postulate 3': Related to a given physical context there exists a space of notions Ω such that for each accessible variable η there exists a morphism from Ω onto Ω η ⊂ Ω.
I repeat my definition of a physical context: All variables can in principle be measurable by A. This can be seen to agree for instance with the context-concept used in the Kochen-Specker theorem.In [72] contextuality is discussed in more detail using the ordinary formal apparatus of quantum mechanics.This apparatus will not be assumed here.Of some interest for the current approach are various inequalities discussed in [72], in particular Bell inequalities.My explanation of why a particular inequality, the CHSH inequality, has now turned out to be violated in loophole-free experiments, is given in [34,61].
In the previous Sections, I built the arguments on group theory and group representation theory.The basic concepts were theoretical variables and groups acting upon these variables.These variables were assumed to exist relative to an observer or jointly to some communicating observers.
The very concept of a variable assumes something which varies, but in the previous discussion I did not explicitly assume that all variables can take definite values, at least not known to us; see Section 3. To some, this distinction may be problematic.It is not problematic at all for what I have called accessible physical variables.The variable 'position' can take values x 0 , and the variable 'momentum' can take values p 0 .But according to Heisenberg's inequality, we cannot in a physical context assume joint values (x 0 , p 0 ).
In general, we cannot in a physical context assume maximal inaccessible quantities φ taking physical values.In this Section, I still want to consider an observer or a set of communicating observers, but relative to these, I will in general consider 'notions', not only 'variables'.A notion is anything that a person can have as a basis for his thinking, or that a communicating set of persons can think jointly of.
To make this precise, mathematically, we need partly to go from group theory to category theory.Category theory was founded by Mac Lane [62], and has been used by several physicists [63,65,69] in the foundation of quantum mechanics.Bob Coecke has in several papers discussed quantum mechanics from the point of view of category theory; see also [65].
Group theory is a specialization of the more general category theory, and my task in this Section will be to generalize the discussion of Section 3 correspondingly.In general terms, a group is a category with one object in which every morphism has a (two-sided) inverse under composition.Just the simplest aspects of category theory will be used in this article.
As said, the space of variables Ω φ must be replaced by a space of notions Ω, and transformations in this space must be replaced by labeled arrows, automorphisms, from Ω onto Ω.The groups G a associated with accessible variables θ a can be kept as before, but the group K acting on φ must be replaced by something new.But first, consider the basic postulates in this setting.
Postulate 1': It is assumed that if θ is a notion and there is a morphism defined on Ω θ ⊆ Ω which maps θ onto some λ , then λ is a notion.
Note that everything that is connected to accessible variables goes as before, but the context of the previous Section is now replaced by a physical context, including all variables that are accessible to A before, during and after a measurement.So, in particular, Postulate 2, Definition 1 and Postulate 4 can be taken over from Section 3. Definition 2 must be modified to: Definition 2' Let θ and η be two maximal accessible variables in some physical context, and let v θ and v η be the morphisms from Ω onto Ω θ and Ω η , respectively.If there is an automorphism k in Ω such that k has an inverse and v η = v θ • k, we say that θ and η are related.If no such k can be found, we say that θ and η are non-related.
This can be seen to be an equivalence relation.Again, it follows from the relationship that η is maximal iff θ is maximal.And if G is a group acting on θ , we can define a group H acting on η by h • v η = (g • v θ ) • k.The mapping from g to h is an isomorphism.
Returning to the two physical examples of Section 3, here is the definition of k in these cases: If θ = θ a and η = θ b are the spin components in directions a and b, we can consider the plane spent by a and b, and let k be the 180 • rotation around the midline between the two vectors.If θ is the theoretical position and η is the theoretical momentum for a particle, we can define k by a Fourier transform.
In Subsection 7.2, it is showed how k can be defined in general in the finitedimensional case for two variables both taking n values.In this case, all such pairs of variables are related.
My main basic result is then that Theorem 0 holds also in this setting.More generally, Theorem 4 and Theorem 6 of Section 7 hold in a physical context with the above four postulates.
Note that the basic requirement is only that we have two different maximal accessible (complementary) variables in the given physical setting.This seems to be a weaker requirement than conditions formulated in other approaches towards the reconstruction of the Hilbert space apparatus.In general, each of the maximal accessible may be a vector, whose components are jointly measurable variables.
Versions of the basic Theorem 5 and Theorem 7 of Section 7 are still valid in this context but will require some more work.
Proofs and further discussions are given in Appendix 3. Again, all the results are valid when the single actor A is replaced by a set of communicating observers, and the notions are shared among these observers.
There may be an obvious objection against the present approach: Everything is connected to an observer or jointly to a communicating group of observers.How can this be connected to a theory of the real world?My answer to this is that I base everything on the main thesis of Convivial Solipsism: Every description of the world must be from the point of view of an observer or some communicating group of observers.In particular, every accessible variable that A may be thinking of, has a parallel real value variable connected to any measurement of experiment that A may make.Quantum mechanics is a model related to A, but in this way, it will also be a model of real experiments and measurements.I say more about the interpretation of quantum mechanics in Section 9.

Concluding remarks
The treatment of this paper is not quite complete.Some remaining problems include: -A further development of the case of continuous theoretical variables.
-Giving concrete conditions under which the Born formula is applicable in practice.This is in particular relevant in connection to cognitive modeling.
-Developing an axiomatic basis in the spirit of quantum logic (see for instance [43]).But note the simple postulates of Section 3 above.
-A treatment of open quantum systems.
-A further discussion of the relationship to other approaches and to other interpretations.
-More concrete examples of how this approach can be used to address the conceptual and technical challenges of quantum theory.Some aspects of this are discussed in [47] and in [61].
-A discussion of the implications of this approach for the experimental and technological aspects of quantum mechanics.
Group theory and quantum mechanics are intimately connected, as discussed in detail in several books and papers.In this article, it is shown that the familiar Hilbert space formulation can be derived mathematically from a simple basis of groups acting on theoretical variables.The consequences of this are further discussed in [1].The discussion there also seems to provide a link to statistical inference.
From the viewpoint of purely statistical inference, the accessible variables θ discussed in this paper are parameters.In many statistical applications it is useful to have a group of actions G defined on the parameter space; see for instance the discussion in [44].In the present paper, the basic group G is assumed to be transitive, hence, tentatively, if we have a group on some parameter that is not transitive, the quantization of quantum mechanics can be derived from the following principle: all model reductions in some given model should be to an orbit of the group.
It is of some interest that the same criterion can be used to derive the statistical model corresponding to the partial least squares algorithm in chemometrics [45], and this connection also motivates an important case of the recently proposed more general envelope model [46].
The main message of the present paper: the reconstruction of quantum mechanics from simple assumptions, is connected to an observer or jointly to a group of communicating observers, by just taking accessible variables as a primitive notion.But some postulates like Postulate 3 and Postulate 4 seem to be necessary.Then under weak assumptions, the main condition needed seems to be that there exist two different complementary variables, where the word 'complementary' is taken to mean different maximal accessible theoretical variables.
In this paper, the first axioms of quantum theory are derived from reasonable assumptions.As briefly stated in [1], one can perhaps expect after this that such a relatively simple theoretical basis for quantum theory may facilitate a further discussion regarding its relationship to relativity theory.One can regard physical variables as theoretical variables, inaccessible inside black holes.These ideas are further developed in [47].
Further aspects of the connection between quantum theory and statistical inference theory, which relies heavily on decisions, decisions that in my view can largely be modeled by using quantum decision theory, are under investigation.
Finally, I want to point out that some of my previous published papers contain certain errors and inaccuracies.In [4] and [5] it is erroneously stated that the basic group representation U should be irreducible.This is now corrected in [4].Sloppy formulations in [34] are now cleared up in [61].Most errors are now corrected in the book [66], but the proof there, p. 33, that the variable θ can be written as a function of the group element n, is incorrect.The correct version is in Lemma A2 below.It is a strong hope that all mathematical arguments of the present article are correct.there exists a transformation k such that η(φ ) = θ (kφ ).Let g ∈ G be a transitive group action on θ , and let h ∈ H be the transitive group action on η defined by hη(φ ) = g 1 θ (kφ ) when η(φ ) = θ (kφ ), where g 1 ∈ G 1 , an independent copy of G.This gives a group isomorphism between G and H.
Consider the representation U(•) of the group corresponding to G, acting on some Hilbert space H , with the property that if we fix some vector |v 0 ⟩ ∈ H , then the vectors U(g)|v 0 ⟩ are in one-to-one correspondence with the group elements g ∈ G and hence with the values gθ 0 of θ for some fixed θ 0 .I choose to use the notation |v 0 ⟩ in this proof instead of |θ 0 ⟩, since several sets of coherent states will be considered.
For each element g ∈ G there is an element h = jg j ∈ H and vice versa.Note that j • j = e, the unit element.Let U( j) = J be some unitary operator on H such that J • J = I.Then for the representation U(•) of the group corresponding to G, there is a representation V (•) of the group corresponding to H given by V ( jg j) = JU(g)J.These are acting on the same Hilbert space H with vectors |v⟩, and they are equivalent in the concrete sense that the groups of operators {U(g)} and {V (h)} are isomorphic.
Note that J must satisfy JU( jg j) = U(g)J.By Schur's Lemma, this demands J to be an isomorphism or the zero operator if the representation U(•) is irreducible.In the reducible case a non-trivial operator J exists, however: In such a case there exists at least one proper invariant subrepresentation U 0 acting on some vector space H 0 , a proper subspace of H , and another proper invariant subrepresentation U ′ 0 acting on an orthogonal vector space H ′ 0 .Fix |v 0 ⟩ ∈ H 0 and |v ′ 0 ⟩ ∈ H ′ 0 , and then define J|v 0 ⟩ = |v ′ 0 ⟩, J|v ′ 0 ⟩ = |v 0 ⟩ and if necessary J|v⟩ = |v⟩ for any |v⟩ ∈ H which is orthogonal to |v 0 ⟩ and |v ′ 0 ⟩.Now we can define a representation W (•) of the full group N acting on ψ = (θ , η) in the natural way: W (g) = U(g) for g ∈ G, W (h) = V (h) for h ∈ H, W ( j) = J, and then on products from this.
If U is irreducible, then also V is an irreducible representation of H, and we can define operators A θ corresponding to θ and A η corresponding to η as in (7).If not, we need to show that the representation W of N constructed above is irreducible.
Lemma A1 W (•) as defined above is irreducible.
Proof.Assume that W (•) is reducible, which implies that both U(•) and V (•) are reducible, i.e., can be defined on a proper sub-space H 0 ⊂ H , and that J = W ( j) also can be defined on this sub-space.Let R(•) be the representation U(•) of G restricted to vectors |u⟩ in H orthogonal to H 0 .Fix some vector |u 0 ⟩ in this orthogonal space; then consider the coherent vectors in this space given by R(g)|u 0 ⟩.Note that the vectors orthogonal to H 0 together with the vectors in H 0 span H , and the vectors U(g)|u 0 ⟩ in H are in one-to-one correspondence with θ .Then the vectors R(g)|u 0 ⟩. are in oneto-one correspondence with a subvariable θ 1 .And define the representation S(•) of H by S( jg j) = R(g) and vectors S(h)|v 0 ⟩, where |v 0 ⟩ is a fixed vector of H , orthogonal to H 0 .These are in one-to-one correspondence with a subparameter η 1 of η.
Fix θ 0 ∈ Ω θ .Given a value θ , there is a unique element g θ ∈ G such that θ = g θ θ 0 .(It is assumed that the isotropy group of G is trivial.) From this look at the fixed vector S( jg θ j)|v 0 ⟩.By what has been said above, this corresponds to a unique value η 1 , which is determined by g θ , and hence by θ .But this means that a specification of θ determines the vector (θ , η 1 ), contrary to the assumption that θ is maximal as an accessible variable.Thus W (•) cannot be reducible.
□ Note that it is crucial for this proof that the space H is multi-dimensional.In particular, the proof does not work for the following case: φ = (θ , η), the transformation k defining relationship exchanges θ and η, and G is just the identity.Then H would be one-dimensional.If this was allowed in the proof and in the corresponding definition of reducibility, all maximal accessible variables would by definition be related.This lemma shows that there are group actions n ∈ N acting on ψ = (θ , η) and an irreducible representation W (•) of N on the Hilbert space H . Hence the identity (6) holds if G is replaced by N, and the coherent states by |v n ⟩ = W (n)|v 0 ⟩: where µ is some left-invariant measure on N, and |v 0 ⟩ is some fixed vector in H .
Lemma A2 There is a function f θ of n such that θ = f θ (n), and a function f η of n such that η = f η (n).

Proof.
Consider a transformation n transforming ψ 0 = (θ 0 , η 0 ) into ψ 1 = (θ 1 , η 1 ).There is then a unique g transforming θ 0 into θ 1 , and a unique h transforming η 0 into η 1 .Since the groups G and H are assumed to be transitive and with a trivial isotropy group, the group elements g and h correspond to unique variable elements θ and η.These are then determined by n. □ We are now ready to define operators corresponding to θ and η: A η = f η (n)|v n ⟩⟨v n |µ(dn).
It is clear that these operators are symmetric when θ and η are real-valued variables.Under some technical assumptions [32] they will be self-adjoint/ Hermitian.Also, if θ = 1, then A θ is the identity.
In addition, if s is any transformation in N, and W (•) is the representation of N used in the above proof, we have, following the proof of Theorem 3 of Subsection 7.2 and using the left-invariance of µ: W (s −1 )A θ W (s) = f θ (sn)|v n ⟩⟨v n |µ(dn), Consider an application of this: Recall that θ = θ (φ ), where φ varies over some space Ω φ , and φ is inaccessible.Let K be some group of transformations of Ω φ .Assume that θ (•) is permissible with respect to K, and also that η(•) is permissible.Let T (•) be a unitary representation of K such that the coherent states T (t)|v 0 ⟩ are in one-to-one correspondence with t.Then for t ∈ K the operator T (t) † A θ T (t) is the operator corresponding to θ ′ (φ ) = θ (tφ ).
Proof Since θ is permissible, θ (tφ ) = g(t)θ (φ ) for some transformation g(t) of Ω θ .Recall that for g ∈ G, the basic group acting on Ω θ , it is assumed that the states U(g)|v 0 ⟩ are in one-to-one correspondence with g.Comparing with the properties of T (•), we must then have g(t) ∈ G, and T (t) = U(g(t)) = W (s(t)) for an induced transformation s(t) in N.
The detailed arguments for this are as follows: η(•) is also assumed to be permissible.Hence there is a h(t) such that η(tφ ) = h(t)η(φ ), so t induces a transformation s = s(t) on ψ = (θ , η).Since (θ , η) is in one-to-one correspondence with (g, h), the transformation s also acts in a unique way on (g, h).By the permissibility of θ (•), g(t) must belong to G. Since T (t) is in one-to-one correspondence with t, and U(g) is in one-to-one correspondence with g, we must have T (t) = U(g(t)).By the same argument T (t) = V (h(t)).Looking at the way N and W (•) are constructed in the above proof, we must have that s(t) ∈ N, and T (t) = W (s(t)).
For these transformations and an arbitrary g ∈ G, we can define θ ′ (g) = g(t)θ (g), and get θ ′ (g) = f θ (s(t)n g ), where n g is the transformation in N which is induced by g as above.Taking s = s(t) and W (s) = W (s(t)) in (28) completes the proof.□ This also completes the proof of Theorem 5 in Section 7. The last statement there follows from the fact that in this case j = j(k) acts on ψ = (θ , η) and induces a transformation s(k) on the group N. Take s = s(k) and S(k) = W (s(k)) in (28).Theorem 7 follows as in the main text.
We also have a converse of the last part of Theorem 6.This can be used to make precise the main result of [34,61].
Lemma A3 Consider two maximal accessible theoretical variables θ and η.If there is a transformation s of Ω ψ such that A η = W (s) † A θ W (s), then θ and η are related.
Proof By (28) we have this way, operators associated with equivalent maximal variables may be defined.This concludes the proof of Theorem 4.
The case with a discrete spectrum is discussed in the main text.In this case, we have where {u j } are the eigenvalues and {P j } the projections upon the eigenspaces of A θ .The equations ( 34) and ( 35) can be written in a similar way.Important special cases of (35) include f (θ ) = I(θ ∈ B) for sets B, or f can be a linear combination of such indicator variables.As in [1], Subsection 5.3, this can be used to find operators of continuous variables by just using the theory associated with finite-valued variables.This makes the approach towards a theory in Subsection 7.2 especially important.
A further important case is connected to statistical inference theory in the way it is advocated in [1].Assume that there are data X and a statistical model for these data of the form P(X ∈ C|θ ) for sets C. Then a positive operator-valued measure (POVM) on the data space can be defined by The density of M at a point x is called the likelihood effect in [1] and is the basis for the focused likelihood principle formulated there.
In [1] the probability measure π was assumed to have one out of three possible interpretations: 1) as a Bayesian prior, 2) as a Bayesian posterior or 3) as a frequentist confidence distribution (see [33]).
The proof of Theorem 5/ Theorem 7 will require some more work.One seemingly needs to replace the representation theory for groups with a more general theory.This can be done by considering the multiplicative group M of square matrices over complex numbers, such that a space K of automorphisms Ω → Ω are mapped into M, and these mappings are homomorphisms.Such a set of homomorphism will be called a representation T of K.If the matrices in the domain of T are unitary, we say that the representation is unitary.
However, such a general theory will not be assumed below.I will limit the discussion to the case where K is a group acting on the elements u ∈ Ω. Recall first Definition 2' from Section 10.
What we do need here, is a generalization of the concept of permissibility.
Definition 4 Consider a space K of automorphisms Ω → Ω.An accessible variable θ is said to be permissible with respect to K if there exists a morphism v from Ω onto Ω θ with the following properties: 1) For any automorphism t of Ω and any k ∈ K, the identity v Lemma 3 If K is a group and θ is permissible with respect to K under the morphism v, then G = v • K is a group.
Proof.Check that under permissibility every g ∈ G is an automorphism on Ω θ with a two-sided inverse.□ Theorem 5' Assume that the related accessible variables θ and η are permissible with respect to a group K of automorphisms in Ω, under the morphisms v θ and v η .Assume that K is transitive and has a trivial isotropy group.Let T be a unitary representation of K such that the coherent states T (t)|ψ 0 ⟩ are in one-to-one correspondence with t ∈ K.For any transformation t ∈ K and any such representation T the operator T (t) † A θ T (t) is the operator of θ ′ , the image from Ω under the morphism v θ • t.
In addition, since θ and η are related, there is a unitary operator W such that A η = W † A θ W .
Proof.For the first part use a modification of the proof of Theorem 5 in Appendix 1. First, by permissibility and Lemma 3, we have that for given t ∈ K there exist group elements g(t) = v θ • t and h(t) = v η • t such that θ (t • u) = g(t)θ (u) and η(t • u) = h(t)η(u) for any u ∈ Ω.This induces a transformation s = s(t) on ψ = (θ , η).The rest of the proof goes as before.
For the second part, use (28) and the fact that the righthand side of ( 28) is equal to (27).(see the proof of Lemma A3.) □ For the finite-dimensional case, we can define k and K acting on ψ = (θ , η) as in Subsection 7.2.Theorem 7 seems then to be true also in this setting as it is formulated in that Subsection with the modifications that Postulate 1' to Postulate 4 of Section 4 are assumed, and θ ′ (t) is defined as v θ (t • u) for u ∈ Ω such that θ = v θ (u).

Figure 1 :
Figure 1: The construction of the transformation k.

Figure 2 :
Figure 2: The construction of the group K, acting on grids on two copies of the sphere.