Abstract
Koopman and von Neumann (KvN) extended the Liouville equation by introducing a phase space function \(S^{(K)}(q,p,t)\) whose physical meaning is unknown. We show that a different S(q, p, t), with well-defined physical meaning, may be introduced without destroying the attractive “quantum-like” mathematical features of the KvN theory. This new S(q, p, t) is the classical action expressed in phase space coordinates. It defines a mapping between observables and operators which preserves the Lie bracket structure. The new evolution equation reduces to Schrödinger’s equation if functions on phase space are reduced to functions on configuration space. This new kind of “quantization” does not only establish a correspondence between observables and operators, but provides in addition a derivation of quantum operators and evolution equations from corresponding classical entities. It is performed by replacing \(\frac{\partial }{\partial p}\) by 0 and p by \(\frac{\hbar }{\imath } \frac{\partial }{\partial q}\), thus providing an explanation for the common quantization rules.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
There are two fundamentally different opinions as regards the interpretation of the formalism of quantum theory (QT). These two opinions may be associated with the names of Albert Einstein and Niels Bohr. The first opinion, favored by Einstein (and in fact also by the early Bohr), says that QT is a theory about statistical ensembles. This means, the predictions of QT must be tested experimentally by performing a large number of experiments on identically prepared individual systems (statistical ensembles). The second opinion, favored by Bohr, says that QT makes predictions about individual systems. In this note, we use as a starting point a theory which can be associated with the first line of reasoning, namely the quantum-like version of the probabilistic description of classical particles due to Koopman and von Neumann [1, 2].
A probabilistic ensemble in 2n-dimensional phase space, with probability density \(\rho (q,p,t)\), is described by the Liouville equation. To make this equation look more “quantum-like” we multiply it with a factor \(\frac{\hbar }{\imath }\) and write it in the form
The linear operator \(\hat{D}_{H}\) is the Lie derivative defined by H. In terms of the usual notation \(\left\{ A,B\right\} \), for the Poisson bracket of two observables \(A,\,B\), it may also be written in the form \(\hat{D}_{H}=\left\{ H,\cdot \right\} \). The extraordinary simple structure of the Liouville equation leads to a enormous manifold of solutions: Given two solutions g, h their product gh, as well as their sum, is again a solution; given a solution \(\rho \), an arbitrary function of \(\rho \) is again a solution. As a consequence (1) may be rewritten in the form
where \(\psi \) is a complex-valued variable, whose amplitude and phase (or real and imaginary parts) both obey the original Eq. (1). The new complex variable may be written in many different ways as a function of \(\rho \); a most useful choice is
Introducing a new dynamic variable \(S^{(K)}(q,p,t)\) makes the classical theory more similar, at least formally, to QT. Continuing along this lines, we define an inner product
It is now easy to see that the operator \(\hat{K}\) is self-adjoint with regard to this inner product. The relations (2)–(5) represent the basic equations of the Koopman–von Neumann (KvN) theory [1, 2], as developed by Gozzi, Mauro, Pagani [3,4,5] and others.
2 Discussion of the Koopman–von Neumann theory
According to its definition the differential equation (3) is equivalent to two decoupled Liouville equations for \(\rho \) (or \(\sqrt{\rho }\)) and \(S^{(K)}\). The physical (probabilistic) predictions of the KvN equation are, therefore, exactly the same as the predictions of the original Liouville equation, as shown explicitly in Ref. [3]. The results obtained for \(S^{(K)}\) are irrelevant (This is, in fact, a consistency requirement because we do not know what \(S^{(K)}\) means). The advantage of (3) is that it offers a variety of mathematical methods, know from QT, for the classical domain; several examples have been reported in the literature [3,4,5].
Let us next ask if the KvN theory is the only possibility to extend the Liouville equation in this purely technical sense (we exclude for the moment questions as for the physical meaning of S). Any “quantum-like”, but still fundamentally classical, extension of (1) must obey the following conditions
-
Linearity of the evolution equation for \(\psi \),
-
Self-adjointness of the relevant differential operator,
-
Decoupled differential equations for \(\sqrt{\rho }\) and S.
The last condition could be called the “classicality condition”. It means that the laws of probability theory do not interfere with the deterministic laws describing the system. This is, simple as it is, a most fundamental characterization of classical probabilistic physics.
Looking for the simplest modifications of (3), one finds that addition of an arbitrary phase space function, say \(-\bar{L}_{H}\), to \(\hat{K}\) is compatible with all three requirements. Thus, we perform the replacement
Here, the arbitrary (and still unknown) function \(\bar{L}(q,p,t)\) has been labeled by an index H, taking the possibility into account that \(\hat{L}_H\) is a generalized Lie derivative with respect to H. The basic Eq. (3) is replaced by
We can easily see that all three conditions are fulfilled. Equation (7) is obviously linear. A short calculation shows that \(\hat{L}_H\) is self-adjoint. Finally, inserting (4) (with \(S^{(K)}\) replaced by S) in (7) and separating real- and imaginary parts we obtain the two decoupled equations
Equations (8) and (9) differ from the corresponding result of the KvN equation (3) by the inhomogeneous term \(\bar{L}_{H}\) in Eq. (9). If \(\sqrt{\rho }\) and S are, however, combined according to (4), the resulting Eq. (7) has no inhomogeneous term anymore and a “quantum-like” unrestricted superposition principle holds true.
Thus, there is an infinite number of classical “quantum-like” versions of the Liouville equation, each one characterized by a particular phase space function \(\bar{L}_{H}\). These theories are all equivalent, as regards their physical content. On the other hand they differ with regard to the details of the “quantum-like” mathematics they offer, because the self-adjoint operators \(\hat{L}_H\) are different. The KvN theory, obtained by setting \(\bar{L}_{H}=0\), is just one of these versions.
3 A new extension of Liouville’s equation
This variety of theories leads of course to the question which \(\bar{L}_{H}\) is “the best”. Are the \(\bar{L}_{H}\) really all equivalent in a physical sense ? This is certainly true, as long as we stay strictly within the realm of classical physics. It is, however, not true if we widen our horizon and take the relation between classical physics and QT into account. A proper choice of \(\bar{L}_{H}\) could possibly close the gap between classical physics and QT or help, at least, to elucidate the still rather mysterious relation between these two fields.
Clearly, a proper choice of \(\bar{L}_{H}\) has to ascribe, first of all, a clear physical meaning to the variable S. Let us consider the canonical equations for a Hamiltonian H(q, p), not depending on time,
The solutions of (10) for initial values \(q_{s},\,p_{s}\) at time \(t_{s}\) are written (omitting indices and summation signs; i.e., \(qp=\sum _{i=1}^{n}q_{i}p_{i}\), etc) as
These relations define, for each value of t, a mapping of phase space onto itself, from initial values \(q_{s},\,p_{s}\) at \(t_{s}\) to the values \(q,\,p\) at some later time t. This mapping is also referred to as the flow defined by H.
As it is well-known, the evolution in time, described by the mapping (11) may be interpreted as a canonical transformation [6, 7]. As a consequence a generating function \(S_{H}(q_{s},p_{s},t)\) must exist. As shown in advanced text books [6, 7], it takes the form
The integrand has the form of a Lagrangian \(L_{H}\), associated with H, and \(S_{H}\) is the corresponding classical action. The latter is written here as a function of initial values. Generally, the action (as a functional evaluated at the stationary points) depends on \(2n+2\) variables, namely 2n integration constants and two boundary points. In the most common representation the integration constants are identified with the coordinates of the initial and final positions, \(q_{s}\) and q, in the present representation they are identified with the initial positions and momenta \(q_{s},\,p_{s}\).
The physical meaning of Eq. (12) is that of an integrability condition for the canonical equations (10). Thus, relation (12), defining \(S_{H}\), must be considered as an integral part of the present theory; this makes it a natural candidate for our field S. Let us investigate if indeed an associated phase space function \(S_{H}(q,p,t)\) exists, which fulfills Eq. (9). As the solutions (11) represent a canonical transformation, they may be inverted to give the initial values \(q_{s},\,p_{s}\) as a function of \(q,\,p,\,t\),
Let us consider an arbitrary function \(G_{s}\) which depends, like our \(S_{H}\), on the initial values \(q_{s},\,p_{s}\) and also explicitly on time t. Using (13) we obtain for each \(G_{s}(q_{s},\,p_{s},\,t)\) an associated phase space function \(G(q,\,p,\,t)\), which is defined by
The function G is, in principle, determined by the form of \(G_{s}\) and by the solutions of the equations of motion. Alternatively, a partial differential equation for G may be derived [8]. It may be found using the fact that the initial values are “constants of motion”. Calculating the derivative of (14) with respect to t, and replacing \(\dot{q}\) and \(\dot{p}\) by phase space functions according to (10), we obtain the differential equation
If G is identified with \(S_{H}\), this equation takes indeed the form of (9), where \(\bar{L}_{H}\) is given by the Lagrangian,
Note that the first of the canonical equations (10) has been used to replace \(\dot{q}\) by the derivative of H with respect to p. We arrived at a new extension of Liouville’s equation which takes the form (7), with a differential operator \(\hat{L}_H\) defined by (6) and (17), and a “classical wave function”
with a well-defined physical meaning; its squared modulus is the classical probability density \(\rho (q,p,t)\) and its phase (we omit the index H) is the classical action S(q, p, t). As mentioned already this is an entirely classical equation; the dependence on \(\hbar \) is spurious.
4 Projection of the new equation to configuration space
As noted already, all possible extensions of Liouville’s equation, following the rule (6), are equivalent in a purely classical sense. The question is, if the new equation is able to contribute to our understanding of the intricate relation between classical physics and QT. If the definition of \(\hat{L}_H\) is inserted, Eq. (7) takes the form
An obvious but very important difference between the classical wave function, denoted here for distinction by \(\psi ^{(c)}\), and the true quantum wave function, denoted by \(\psi \), is the number of independent variables; \(\psi ^{(c)}\) depends, neglecting time for the moment, on 2n variables (typically the standard phase space variables \(q,\,p\) used in this paper) while \(\psi \) depends on n variables (typically the configuration space variables q). A quantum state is a point in Hilbert space, or a function defined on configuration space. There is general agreement with regard to this statement. On the other hand, there is no agreement as regards the question what to consider as classical counterpart of a quantum state. It is widely believed, following Bohr, that the classical counterpart of a quantum state is a point in phase space, describing the “position” of a single mechanical system. Here, we assume, following Einstein, that the classical counterpart of a quantum state is a classical probabilistic ensemble, described by a probability density \(\rho (q,p)\), or a point \(\psi ^{(c)}\) in an associated classical Hilbert space.
Considered from this point of view, which will be further confirmed elsewhere, the transition to QT is basically a kind of projection from phase space to configuration space, with an associated reduction in the number of degrees of freedom. This process may be performed in several ways. The way of proceeding reported in this paper is not the most complete one, but seems attractive because of its extraordinary simplicity and its ability to explain the quantization rules in an intuitive way. The classical law in phase space (19) must be projected to a quantum law in configuration space, with states \(\psi \) depending only on q, and a differential operator containing only derivatives with respect to q. This can obviously be done by implementing the rules
in the differential operator acting on \(\psi ^{(c)}\). The left member of (20) does not (explicitly) belong to the standard quantization rules; it simply expresses the fact that the new states do not depend on the generalized coordinates p. The right member of (20) is the standard quantization rule, expressing the strange fact that a variable must be replaced by an operator. Implementing these rules in Eq. (19) creates the famous differential equation
found by Schrödinger. Of course, the parameter \(\hbar \) can now, as a consequence of the modification of the original equation, no longer be eliminated, but becomes instead a new fundamental constant. This derivation shows that the version (19) of the Liouville equation may be interpreted as classical counterpart of the Schrödinger equation. The common quantization rule, replacing p by a derivative with respect to q, can be understood as part of a projection from phase space to configuration space.
Note that, in contrast to the common treatment, the present “quantization” starts from a classical evolution equation and not from a classical Hamiltonian function. The process of quantization creates in a single step both the new evolution equation and the new Hamiltonian operator.
We should note at this point that the form of \(\bar{L}_{H}(q,p)\) given in (17) is not the most general one. An infinite number of admissible \(\bar{L}_{H}(q,p)\) may be found by adding to (17) a total derivative of an arbitrary function C(q, p); this kind of gauge freedom in the Lagrangian is actually well-known. We restricted ourselves in the present paper to the choice \(C(q,p)=0\) associated with the most common (configuration space) representation of the Schrödinger equation. Other choices lead to the Schrödinger equation in momentum space and to the “phase-space Schrödinger equation” derived by Torres-Vega and Frederick [9], as will be shown in more detail in later work.
5 The correspondence between observables and operators
The above development [see Eqs. (6), (17)] shows that the operator \(\hat{L}_H\) is, in the course of the projection from phase space to configuration space, reduced to the Hamiltonian operator \(\hat{H}\),
The operators \(\hat{L}_H\) and \(\hat{H}\) are completely determined by the analytical form of the classical phase space function H(q, p); we neglect ordering problems, which actually do not play any role for the most important observables. The above rules for the assignment of phase space operators to classical observables and subsequent reduction to configuration space operators, may therefore, be applied to arbitrary observables A(q, p):
The two processes displayed in Eq. (23) will be abbreviated by the statement \(A \rightharpoondown \hat{A}\). Considered from the point of view of probability theory, the fact that observables become operators represents a breakdown of the standard concept of random variables.
From a formal point of view, the above quantization process consists of two consecutive steps. In a first step, abbreviated here as \(\Rightarrow \), phase space operators are defined. In a second step, abbreviated as \(\searrow \), configuration (or momentum) space operators are created. After completion of this work, the author learned that both of these steps occur also in a mathematical discipline called geometric (de)quantization [10,11,12] and are denoted there as “prequantization” and “polarization”. Unfortunately, the physical content of most of these works is not easy to access due to their highly abstract and sometimes speculative character. On the other hand, it is to be expected, that this high-level mathematical work contains many important results. More popular accounts (such as section II D of [11]) would be highly desirable.
5.1 Several observables
It is clear that commutativity under ordinary multiplication of two functions \(A,\,B\) is, in general, not preserved under these transitions: if \(A \rightharpoondown \hat{A}\), \(B \rightharpoondown \hat{B}\), then, in general, \(\hat{A}\hat{B} \ne \hat{B}\hat{A}\).
The fact that the commutativity of classical observables is not recovered in the corresponding operator structure is no surprise in the present approach: A state in QT has a whole ensemble of trajectories as its classical counterpart, and the notion of classical observables, which is closely related to the idea that all properties of a system are determined by a single point in phase space, becomes meaningless. We cannot expect that structural properties of classical observables are mapped in a one-to-one manner to corresponding properties of operators. The impossibility to achieve this, by any reordering of terms, is the content of Groenewold’s theorem [13].
A “quantum-like” non-commuting structure may, in fact, already be found at the classical level, if ensembles of trajectories are taken into consideration. Each observable A creates its own flow with the help of canonical equations like (10). A second observable B is invariant under the flow created by A, if
In this case, the inverse, namely the invariance of A under the flow created by B, is also true. Two Lie derivatives performed in consecutive order do, in general, not commute. It can, however, be shown (see, for example [6]) that
holds true, where the bracket is an abbreviation for the commutator of \(\hat{D}_A\) and \(\hat{D}_B\) (the latter is defined, for general operators \(\hat{U},\,\hat{V}\), by \([\hat{U},\hat{V}]=\hat{U}\hat{V}-\hat{V}\hat{U}\)). We see here a certain analogy between Poisson brackets and commutators. This leads to the question if a mapping from Poisson brackets to quantum commutators does exist. More precisely, does a mapping \(A \rightharpoondown \hat{O}_A\) from phase space observables A to quantum operators \(\hat{O}_A\) exist, such that \(A \rightharpoondown \hat{O}_A\), \(B \rightharpoondown \hat{O}_B\), and \(\{A,B\} \rightharpoondown \hat{O}_{\{A,B\}}\) implies \([\hat{O}_A,\hat{O}_B]=\hat{O}_{\{A,B\}}\) ? A further consistency requirement is the condition \(\hat{O}_1=\hat{1}\). If these conditions are fulfilled, we will say that the Lie bracket structure is preserved.
It is seen at once, that the Lie bracket structure is not preserved under the rule \(A \Rightarrow \hat{D}_A\), because the condition \(\hat{D}_1 = \hat{1}\) does not hold. Also, the correct position operator of QT is not obtained if the projection to configuration space is performed. Thus, the assignment used in the KvN theory, does not preserve the Lie bracket structure.
5.2 The assignment \(A \Rightarrow \hat{L}_A\)
Let us now study the new assignment \(A \Rightarrow \hat{L}_A\), defined by (23). We will first ask if the Lie bracket structure is preserved in the transition from observables to phase space operators. In a second step, the transition to quantum operators, will be considered. The extended operator \(\hat{L}_A = -\frac{\hbar }{\imath } \hat{D}_A -\bar{L}_{A}\) fulfills obviously the condition \(\hat{L}_1=\hat{1}\). Furthermore, \(\hat{L}_A\) obeys the condition
which shows that the Lie bracket structure is preserved, as far as the transition to phase space operators is concerned. To prove (26) it is convenient to write \(\hat{L}_A\) tentatively in the form
where M[A] is a still unknown phase space function determined by A. Inserting this Ansatz into Eqs. (26) and using (25) leads to the condition
for M[A]. It can now be shown by straightforward calculation that \(M[A]=-\bar{L}_{A}\) is a solution of (28). This proves that the differential operator \(\hat{L}_H\), occurring in our new Eq. (19), has been constructed by means of a “reasonable”, Lie-bracket-preserving assignment \(A \Rightarrow \hat{L}_A\). The present result for \(\hat{L}_A\) has been derived before, in the framework of a mathematical process called “prequantization”. It has been shown [14] that \(\hat{L}_A\) as given by (23) is the most general solution of the problem defined by Eqs. (26), (27) and the condition \(\hat{L}_1=\hat{1}\). This result confirms our construction of the classical Schrödinger equation (19).
To perform, in a second step, the transition to QT, Eq. (26) must be projected to configuration space, using the rules (20). This projection will certainly not preserve the Lie bracket structure for all observables. Considering for simplicity a single particle (the case \(n=3\)), the most important observables are position \(q_{k}\), momentum \(p_{k}\), angular momentum \(l_{k}=\epsilon _{kil}q_{i}p_{l}\), and kinetic energy \(t=\frac{p_{j}p_{j}}{2m}\).
The well-known quantum operators \(\hat{q}_{k},\,\hat{p}_{k},\,\hat{l}_{k},\,\hat{t}\) associated with these observables are obtained at once from Eq. (23) (no ordering problem here). Actually, these operators—or their higher-dimensional counterparts—are the only ones which play a role in real-world applications of QT (neglecting here spin, which will be dealt with in forthcoming work). The reason is probably that the corresponding observables describe fundamental properties of space, or may be conserved quantities as a consequence of corresponding symmetries. This defines a context for these operators, within which measurements may be performed in a meaningful way, even if the classical meaning of observables has been lost in QT.
Calculating the Poisson brackets of \(q_{k},\,p_{k},\,l_{k} ,\,t\) one obtains
To transform this Poisson bracket structure to QT we must first assign phase space operators \(\hat{L}_{q_{k}},\,\hat{L}_{p_{k}},\,\hat{L}_{l_{k}} ,\,\hat{L}_t\) to \(q_{k},\,p_{k},\,l_{k} ,\,t\), and then, in a second step, project the corresponding commutator relations (26) to configuration space. Omitting the details of this calculation, the result is given by
This result, obtained here from classical theory, may of course be verified with the help of the explicit definitions of the operators \(\hat{q}_{k},\,\hat{p}_{k},\,\hat{l}_{k},\,\hat{t}\).
The above commutator structure may also be obtained by “translation” from the classical formulas, by means of the following translation rule: If \(A \rightharpoondown \hat{A},\,B \rightharpoondown \hat{B},\,C \rightharpoondown \hat{C}\) and \(C=\{A,B\}\), then
This structural similarity represents one of the strongest links between classical physics and QT. Despite its success, nobody knows where this rule comes from. The present derivation sheds some light on this question: Apparently, quantum operators are related to infinitesimal generators of (canonical) transformations, which describe the flow created by classical observables in phase space. The quantum commutator relations are a vestige of the corresponding phase space relations (26), which itself describe, in a way not yet understood in detail, the commutativity properties of the flow. The flow describes not an individual system but a statistical ensemble.
References
Koopman, B.O.: Hamiltonian systems and transformations in Hilbert space. Proc. Natl. Acad. Sci. USA 17, 315 (1931)
von Neumann, J.: Zur Operatorenmethode in der Klassischen Mechanik. Ann. Math. 33, 587 (1932)
Mauro, D.: On Koopman–von Neumann waves. Int. J. Mod. Phys. A 17, 1301 (2002)
Gozzi, E., Mauro, D.: On Koopman–von Neumann waves II. Int. J. Mod. Phys. A 19, 1475 (2004)
Gozzi, E., Pagani, C.: Universal local symmetries and non-superposition in classical mechanics. Phys. Rev. Lett. 105, 15064 (2010)
Fasano, A., Marmi, S.: Analytical Mechanics—An Introduction. Oxford University Press, New York (2006)
Sudarshan, E.C.G., Mukunda, N.: Classsical Dynamics: A Modern Perspective. Wiley, New York (1974)
Caratheodory, C.: Calculus of Variations and Partial Differential Equations of the First Order, Part I. Holden-Day, Inc, San Francisco (1965)
Torres-Vega, G., Frederick, J.H.: Quantum mechanics in phase space: new approaches to the correspondence principle. J. Chem. Phys. 93, 8862 (1990)
Woodhouse, N.: Geometric Quantization, 2nd edn. Clarendon Press, Oxford (1991)
Aldaya, V., Jaramillo, J.L., Guerrero, J.: Gauge principle revisited: towards a unification of space-time and internal gauge interactions. J. Math. Phys. 44(11), 5166 (2003)
Abrikosov Jr., A.A., Gozzi, E., Mauro, D.: Geometric dequantization. Ann. Phys. 317, 24 (2005)
Groenewold, H.J.: On the principles of elementary quantum mechanics. Physica 12, 405 (1946)
Wollenberg, L.S.: Derivations of the Lie algebra of polynomials under Poisson bracket. Proc. Am. Math. Soc. 20, 315 (1969)
Acknowledgements
Open access funding provided by Johannes Kepler University Linz.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Klein, U. From Koopman–von Neumann theory to quantum theory. Quantum Stud.: Math. Found. 5, 219–227 (2018). https://doi.org/10.1007/s40509-017-0113-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40509-017-0113-2
Keywords
- Quantum theory
- Liouville equation
- Koopman–von Neumann theory
- Derivation of Schrödinger equation
- Quantum–classical relation