Abstract
In order to unravel the origin of the nucleon spin, one has to study in detail the question of orbital angular momentum, and in particular the reference point about which it is defined. With this in mind, we review the concept of relativistic center of mass, generalize the discussion to the case of asymmetric energy-momentum tensors, and establish the link with the light-front formalism. We find that the p-wave in the Dirac plane-wave solutions arises from a relativistic quantum-mechanical effect which forces the canonical reference point to depend on the observer. This explains why longitudinal spin is much simpler to study than transverse spin. It is also the reason behind the observation of induced shifts and distortions in the parton distributions defined within the light-front formalism.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
One usually expects to recover ordinary quantum mechanics from quantum field theory (QFT) by considering the non-relativistic limit. While position is considered to be an observable in quantum mechanics, it is demoted to a mere parameter in QFT. How can one then recover a position operator in the non-relativistic limit?
The absence of a position operator in QFT is usually explained by the fact that a particle with mass m can at best be localized over distances of the order of the associated reduced Compton wavelength \({\bar{\lambda }}_C=\hbar /mc\). If one tries to localize a particle over shorter distances, energies larger than \(mc^2\) have to be involved, with the risk of creating additional particles in the system. Consequently, the concept of position for a single particle has to be abandoned in QFT.
The energy-momentum tensor (EMT) of a system can however always be used to define the concept of center of inertia.Footnote 1 For a non-relativistic particle, it should coincide with the actual position of the particle. In QFT, the center of inertia may be considered as the closest we can get to a definition of position observable. Systems with non-zero spin are however delicate, since in that case it seems impossible to find a Lorentz covariant definition for the center of inertia leading to commuting coordinates [1].
In QFT, the questions involving position are often simply ignored or avoided. Contrary to the electroweak sector of the standard model, the elementary constituents of quantum chromodynamics never show up in the spectrum of asymptotic states. In other words, experiments can only detect bound states of quarks and gluons. Understanding the origin of the nucleon spin becomes therefore a natural fundamental question to investigate. In particular, since orbital angular momentum is defined relative to a reference point, the problem of nucleon localization cannot be avoided. Despite intense efforts invested in the decomposition of the nucleon spin [2], we feel that this problem has so far been insufficiently treated in the literature.
The present work aims at filling this gap. We start with a reminder of the definition of various contributions to the Poincaré generators in Sect. 2. Then we review in Sects. 3 and 4 the concept of center of inertia assuming as usual that the EMT is symmetric. We generalize in Sect. 5 this discussion to the more general case of systems with intrinsic angular momentum naturally characterized by an asymmetric EMT, and establish in Sect. 7 the connection with the light-front formalism widely used to investigate the internal structure of the nucleon. To keep the presentation simple, we work at the level of classical fields in flat spacetime, but the quantum version proceeds analogously,Footnote 2 as explained in detail in [1, 3,4,5]. We will also use for convenience the natural units \(\hbar =c=1\).
2 Poincaré generators
The Noether currents associated with the Poincaré invariance of a relativistic theory consist in \(T^{\mu \nu }(x)\) for spacetime translations, and \(M^{\mu \alpha \beta }(x)=-M^{\mu \beta \alpha }(x)\) for Lorentz transformations. At the classical level, they are mere functionals of the classical fields depending on the space-time coordinates \(x^\mu \), and are understood to be evaluated for a solution of the classical field equations.
As a consequence of Noether’s theorem, these currents are conserved
and transform as rank-2 and 3 Lorentz tensors, respectively. The 10 generators of Poincaré transformations are obtained by integrating the current densities over a spacelike hypersurface
These Noether charges are interpreted as the total four-momentum and generalized angular momentum (AM) of the system. Assuming as usual that surface terms at spatial infinity do not contribute, one can show that these generators are time-independent and transform as rank-1 and 2 Lorentz tensors, respectively.
2.1 External and internal parts
The generalized AM \(J^{\alpha \beta }\) is defined relative to the origin O of the coordinate system. Like in Newtonian mechanics, one can also consider the generalized AM defined relative to another reference point (or pivot). Denoting by \(X^\mu (x^0)\) the function describing the worldline of a generic reference point parametrized by the time coordinate \(x^0\) in the Lorentz frame \(\mathcal S\), the generalized AM
can be decomposed into an external part which depends explicitly on the coordinates of the reference point
and an internal part
which represents the generalized AM about the new reference point. When the reference point coincides with the origin, one naturally gets \(J^{\alpha \beta }=S^{\alpha \beta }_O\). While we keep here X totally general, we will see later that particular choices for the reference point beside the origin, like e.g. the center of inertia, play a particular role.
Setting \(\beta =0\) in Eq. (5) allows us to express the coordinates of the reference point asFootnote 3
where we used the relation \(X^0(x^0)=x^0\) resulting from our choice of parametrization for the reference worldline in terms of the time coordinate \(x^0\).
When we consider the time derivative of this expression, we obtainFootnote 4
where we denoted the derivative of a function of a single variable by a dot. This shows that whenever the velocity of the reference point and the momentum are not parallel \(\dot{X}^\mu (x^0)\not \propto P^\mu \), there is a transfer of generalized AM between external and internal parts \(\dot{S}^{\mu 0}_X(x^0)=-\dot{L}^{\mu 0}_X(x^0)\ne 0\).
If one changes the reference point from \(X^\mu (x^0)\) to \(\widetilde{X}^\mu (x^0)=X^\mu (x^0)+Q^\mu (x^0)\), the corresponding generalized internal AM are related as follows
and the shift \(Q^\mu (x^0)\) can be expressed as
We naturally have \(Q^0(x^0)=0\) since \(X^\mu (x^0)\) and \(\widetilde{X}^\mu (x^0)\) represent two simultaneous positions in \(\mathcal S\).
2.2 Boost and rotation generators
Just like the 6 independent components of the electromagnetic tensor \(F^{\mu \nu }(x)=-F^{\nu \mu }(x)\) can be expressed in some Lorentz frame \(\mathcal S\) in terms of an electric vector \(F^{0i}(x)=-E^i(x)\) and a magnetic vector \(F^{ij}(x)=-\epsilon ^{ijk}B^k(x)\), the 6 independent components of the generalized AM tensor can be expressed in terms of two spatial vectors
which generate boosts and rotations of the system, respectively. Under boosts, the two vectors \(\varvec{K}\) and \(\varvec{J}\) get mixed with each other, indicating that their definition depends on the observer.
A more covariant definition of boost and rotation generators requires the introduction of an object that specifies the observer independently of the frame. Choosing the spatial axes to be parallel in all inertial frames, any observer can be identified by a timelike unit four-vectorFootnote 5 \(u^\mu \) (taken to be constant in flat spacetime) which represents its four-velocity relative to some Lorentz frame \(\mathcal S\). The generalized AM tensor can then be written as
where we used the convention \(\epsilon _{0123}=+1\) and defined the four-vectorsFootnote 6
These two four-vectors are orthogonal to \(u^\mu \) and are hence spacelike. In the observer’s frame \(\mathcal S_u\), the four-velocity reduces to its canonical form \(u^\mu =(1,\varvec{0})\) and we get
This shows that the four-vectors \(K^\mu _u\) and \(J^\mu _u\) represent in a covariant way the boost and rotation generators as defined by an observer with four-velocity \(u^\mu \) relative to the Lorentz frame \(\mathcal S\).
Under a change of observer, the covariant boost and rotation generators get mixed with each other
In particular, we find in the \(\mathcal S_u\) frame
Choosing to parametrize the reference worldline \(X^\mu (\tau _u)\) in terms of the Lorentz-invariant expression for the observer’s time coordinate \(\tau _u=X^\mu (\tau _u)u_\mu \), the covariant boost generators
can be further decomposed into external and internal parts
and we can express the coordinates of the reference point as
Similarly, for the covariant rotation generators we have
with
2.3 Center-of-mass frame
Among all the Lorentz frames, the center-of-mass (CM) frame \(\mathcal S_\star \) defined as the frame where the system is at rest, is special because its four-velocity relative to some Lorentz frame \(\mathcal S\) is expressed directly in terms of the translation generators \(u^\mu _\star =P^\mu /M\) with \(M=\sqrt{P^2}\). Physical quantities defined in \(\mathcal S_\star \) are proper to the system in the sense that they can be expressed in a covariant way using only the Poincaré generators. For example, the Lorentz invariant quantity \(M=P\cdot u_\star \overset{\mathcal S_\star }{=}P^0\) coincides in the CM frame with the energy or inertia of the system. In other words, M represents the proper inertia of the system, explaining why it is usually called the (invariant) mass of the system in the literature.
Using \(u^\mu _\star =P^\mu /M\) in Eq. (23) shows that the proper AM is purely internal \(J^\mu _\star =S^\mu _\star \), and hence does not depend on the choice of the reference point. It is called the spin of the system, and is proportional to the Pauli–Lubański pseudo-vector \(W^\mu =M S^\mu _\star \) defined as
Moreover, by a suitable choice of the reference point, we can make the proper boost generators purely external. This reference point, called the center of mass of the system, is readily obtained from Eq. (21) by setting Eq. (20) to zero
where \(\tau =R^\mu _\star P_\mu /M\) is the proper time of the system. The time-dependent term cancels out in the expression for the external part of the generalized AM (4), and so the center of mass defines a time-independent decomposition of the generalized AM tensor into external and internal parts. In particular, the proper boost generators \(K^\mu _\star =-MR^\mu _\star (0)\) can be interpreted as giving (up to a sign) the mass dipole moment of the system at the initial proper time \(\tau =0\).
2.4 Composite systems
Composite systems may consist of several types of constituents. For example, hadrons are bound states made of quarks and gluons. In this case, one can naturally split the total currents into partial currents associated with the different constituent types
Unlike the total currents, the partial ones are in general not conserved, and so the corresponding charges usually depend on time
The partial generalized AM can naturally be decomposed into external and internal parts
where
These satisfy
and can be interpreted as the contributions from constituent type a to the external and internal parts of the generalized total AM.
The partial generalized AM can also be written in terms of partial covariant generators
where
satisfy
These can then be interpreted as the contribution from constituent type a to the covariant generators of boosts and rotations.
3 Center of inertia
A natural choice for the reference point is the center of inertia, which is the relativistic generalization of the familiar concept of center of mass appearing in Newtonian Mechanics. Since the inertial mass is given by the energy in a relativistic theory, the center of inertia is determined by the EMT.
The energy dipole moment of the system in some Lorentz frame \(\mathcal S\) is given by
The center of inertia is defined as the point where the inertia of the whole system \(P^0=\int \mathrm {d}^3x\,T^{00}(x)\) can be concentrated without changing the energy dipole moment. Its position is then given by [1, 3, 6, 7]
In particular, one finds that \(R^0(x^0)=x^0\) which means that the component \(R^0\) represents the time \(x^0\) at which the center of inertia is determined.
In general, the center of inertia is not at rest in \(\mathcal S\). Assuming as usual that surface terms at spatial infinity do not contribute, the velocity of the center of inertia can be expressed as
where \(\mathcal J^\mu (x^0)=\int \mathrm {d}^3x\,T^{\mu 0}(x)\) is the energy current. Unlike the four-momentum \(P^\mu =\int \mathrm {d}^3x\, T^{0\mu }(x)\), the energy current can in general be time-dependent. This means that the center of inertia does not necessarily move along a straight line with constant velocity.
3.1 Generalized orbital angular momentum
From the EMT, one can also define an orbital (or convective) tensor
whose four-divergence is given by
with \(T^{[\alpha \beta ]}=T^{\alpha \beta }-T^{\beta \alpha }\). The corresponding charge
will be called the generalized orbital angular momentum (OAM) since its purely spatial components represent the total OAM of the system
For the other components, we find
so that the position of the center of inertia can be expressed as
and the velocity as
The same expression can be obtained from Eq. (38) by writing the energy current as \(\mathcal J^\mu (x^0)=P^\mu +\int \mathrm {d}^3x\,T^{[\mu 0]}(x)\) and using Eq. (40). Note that unlike (38), the expression for the velocity in Eq. (45) does not rely on the assumption that surface terms vanish.
3.2 Alternative definition
Let \(X^\mu (x^0)\) denote the position of some reference point at the time \(x^0\). Similarly to Eq. (5), the internal generalized OAM can be defined as
In particular, \(\ell ^{\mu 0}_X\) represents the energy dipole moment about the reference point
and \(\ell ^{\mu 0}_X/P^0\) gives the shift of the center of inertia relative to the reference point
This means that the center of inertia can alternatively be defined by the condition
Indeed, by construction the center of inertia is the reference point about which the energy dipole moment vanishes. In other words, it is the point for which orbital boost generators are purely external, as already expressed by Eq. (43). In practice we will omit the label R whenever the center of inertia is chosen as reference point.
3.3 Centroids
Is is clear from the definition (37) that the position of the center of inertia does not transform as a Lorentz four-vector. This is also clearly illustrated by Møller’s famous example [3, 6], see Fig. 1. Consider an homogeneous sphere rotating about some axis in the CM frame \(\mathcal S_\star \). By symmetry, the center of inertia necessarily lies on the rotation axis. Consider now another frame \(\mathcal S\) moving with constant velocity in a direction orthogonal to the rotation axis. From the perspective of \(\mathcal S\), symmetrical points with respect to the rotation axis do not move with the same speed anymore, and hence are attributed different inertias. As a result, the center of inertia determined by \(\mathcal S\) is shifted in a direction orthogonal to both the rotation axis and the relative velocity between \(\mathcal S_\star \) and \(\mathcal S\). This means that the actual center of inertia, i.e. the representative point and not only its coordinates, depends on the observer. It cannot be in general identified with a physical point inside the body.
A covariant definition of the center of inertia can be obtained if one keeps track of the (inertial) observer with respect to which it is defined. The covariant energy dipole moment defined by some observer with four-velocity \(u^\mu \) relative to \(\mathcal S\) is given by
where \(\mathrm {d}_u^3x\equiv \mathrm {d}^4x\,\delta (x\cdot u-\tau _u)\) and \(\tau _u\) are the Lorentz-invariant expressions for the volume measure and time coordinate defined by the observer. Since by construction \(D^\mu _u(\tau _u)\) is a Lorentz four-vector, it can be used to define the coordinates of a physical point, referred to as centroid in the following
where \(P\cdot u=\int \mathrm {d}_u^3x\,T^{\alpha \beta }(x)u_\alpha u_\beta \) represents the Lorentz-invariant inertia defined by the observer. In particular, one has \(R^\mu _u(\tau _u) u_\mu =\tau _u\) which indicates that the centroid worldline is parametrized by the observer’s time coordinate. In a general Lorentz frame \(\mathcal S\), the centroid determined by Eq. (51) and the center of inertia determined by Eq. (37) are usually different. They coincide however in the observer’s frame \(\mathcal S_u\) where the four-velocity reduces to \(u^\mu =(1,\varvec{0})\). This shows that the centroid represents in a covariant way the center of inertia defined by some observer. It is in this sense the Lorentz-invariant extension [2, 8] of the concept of center of inertia.
Similarly, the covariant generalized OAM tensor is defined as
By contraction with the four-velocity, we find that
The covariant internal OAM defined by the centroid reads
and provides an alternative definition of the centroid
4 Symmetric energy-momentum tensor
In relativistic mechanics, one often assumes that the generalized total AM current has a pure orbital form [9]
Two important consequences can then be derived from the conservation of this tensor:
-
(i)
The EMT must be symmetricFootnote 7
$$\begin{aligned} 0=\partial _\mu M^{\mu \alpha \beta }(x)=T^{[\alpha \beta ]}(x). \end{aligned}$$(57)The energy current coincides therefore with the four-momentum \(\mathcal J^\mu (x^0)=P^\mu \) and transforms as a time-independent Lorentz four-vector.
-
(ii)
The generalized OAM is a time-independent Lorentz tensor
$$\begin{aligned} M^{\alpha \beta }_\text {orb}(x^0)=M^{\alpha \beta }_{\text {orb},u}(\tau _u)=J^{\alpha \beta } \end{aligned}$$(58)identified with the generators of Lorentz transformations.
In the following, we discuss in some detail the implications for the center of inertia.
4.1 Relativistic center-of-mass theorem
As one can see from Eq. (44), the conservation of the generalized OAM tensor forces the center of inertia to move along a straight line with constant velocity
In particular, \(Y^\mu =R^\mu (0)\) represents the position of the center of inertia at the initial time. While the generators of time translations \(H=P^0\), spatial translations \(\varvec{P}\) and rotations \(\varvec{J}\) give the energy, linear and angular momentum of the system, the generators of boosts \(\varvec{K}\) give the initial energy dipole moment of the system
and hence determine the initial position of the center of inertia.
Differentiating Eq. (59) with respect to time shows that
which is nothing but the relativistic version of the CM theorem. Moreover, one obtains the relativistic version of König’s first theorem
when the generalized AM is decomposed into external and internal contributions.
Actually, the conservation of the generalized OAM tensor forces all the centroids to move along a straight line
In order to determine the position of the centroid at some fixed time in \(\mathcal S\), we need to parametrize the corresponding worldline in terms of the time coordinate \(x^0\) instead of the observer’s time coordinate \(\tau _u\). Since by definition \(x^0=R^0_u(\tau _u)\), we find
which shows that the two time coordinates are related by a time dilation factor \(\gamma _u=P^0/(P\cdot u)\) and a constant term \(Y^0_u=J^{0\mu }u_\nu /(P\cdot u)=-\gamma _u(Y\cdot u)\) accounting for the relativity of simultaneity. Eliminating \(\tau _u\) in Eq. (63) in favor of \(x^0\) leads to
Since the centroid worldline is parametrized in terms of the time coordinate \(x^0\), it is not surprising that manifest Lorentz covariance is lost. This equation indicates two things: (i) the dependence of \(X^\mu _u\) on the four-velocity \(u^\mu \) shows explicitly that the definition of a centroid is observer-dependent, and (ii) all the simultaneous centroids move along parallel straight lines with the same constant velocity.
Because of the special role played by the CM frame \(\mathcal S_\star \), let us consider the corresponding centroid. Using \(u^\mu _\star =P^\mu /M\) in Eq. (63), we find that the position in \(\mathcal S\) of the proper centroid is given by
This expression coincides with Eq. (26) since \(Y^\mu _\star =-K^\mu _\star /M\), showing that the CM is nothing but the proper centroid.
4.2 Supplementary spin condition
The inertial motion of the centroids (65) implies that the corresponding external and internal parts of the generalized AM tensor are separately time-independent
When the generalized AM tensor is purely orbital, the internal part reads \(S^{\alpha \beta }_u=\ell ^{\alpha \beta }_u\). Choosing the centroid as the reference point leads then to the equation
which is known in the literature of classical spinning bodies under the name of supplementary spin condition (SSC), see e.g. [10] and references therein. In particular, choosing the CM as the reference point corresponds to imposing the Tulczyjew-Dixon SSC \(S^{\alpha \beta }_\star P_\beta =0\) [10,11,12].
One can think of a SSC as the requirement that covariant boost generators do not contribute to the internal part of the generalized AM. Accordingly, we can write
owing to Eq. (11). In other words, the tensor \(S^{\alpha \beta }_u\) represents the internal AM of the system defined by an observer with four-velocity \(u^\mu \) relative to \(\mathcal S\). Working out explicitly the expression for \(S^{\alpha \beta }_u\) in Eq. (67) with the centroid given by Eq. (63), we find that it is related to the Pauli–Lubański pseudo-vector as follows
The internal AM defined by an observer with four-velocity \(u^\mu \) is then given by
For \(u^\mu _\star =P^\mu /M\), we naturally recover \(J^\mu _\star =S^\mu _\star =W^\mu /M\).
4.3 Transverse shifts
Using Eqs. (9) and (21), one can see that the two conserved quantities \(L^{\mu 0}_u/P^0\) and \(S^{\mu 0}_u/P^0\) respectively represent the initial position of the centroid and the shift of the center of inertia relative to it
Combined with Eq. (70) for the internal AM, we get
which shows that the shift in \(\mathcal S\) is spatial and transverse to both \(\varvec{W}\) and \(\varvec{u}\).
More generally, one can consider the shiftFootnote 8 between any two centroids at any fixed time \(x^0\)
This shift can also be written as
which shows that \(Q^\mu _{uv}\) is spacelike and orthogonal to both the Pauli–Lubański pseudo-vector \(W^\mu \) and the relative velocity four-vector \(V^\mu \equiv \gamma _v v^\mu -\gamma _u u^\mu \).
4.4 Møller’s disk
Instead of looking at the position of the centroids at some fixed time in \(\mathcal S\), we can look at their position at some fixed proper time \(\tau \). A contraction of Eq. (63) with \(u^\mu _\star \) allows us to relate the parameter \(\tau _u\) with the proper time \(\tau \)
leading to
This clearly shows that all the centroids have the same four-velocity \(u^\mu _\star \).
For the proper shift of a centroid relative to the CM \(\widetilde{Q}^\mu _u\equiv {\widetilde{X}}^\mu _u(\tau )-R^\mu _\star (\tau )\), we find that it is given by
In other words, the CM is a center of inertia in all Lorentz frames only when the spin of the system vanishes \(J^\mu _\star =S^\mu _\star =0\). When the latter is nonzero, the proper shift is orthogonal to \(P^\mu \), \(u^\mu \) and \(S^\mu _\star \), and therefore is the same in both the observer’s frame \(\mathcal S_u\) and the CM frame \(\mathcal S_\star \). In the CM frame, our expression (78) reduces to [3, 6]
where \(\varvec{v}=\varvec{u}/u^0\) is the velocity of the observer relative to \(\mathcal S_\star \), and the spin is given by \(S^\mu _\star \overset{\mathcal S_\star }{=}(0,\varvec{S}_0)\). By varying the relative velocity \(\varvec{v}\), we obtain the position of all the centroids relative to the CM. They form the so-called Møller disk [3, 6], which is orthogonal to the spin of the system \(\varvec{S}_0\), has radius
and whose geometric center coincides with the CM. The disk is at rest in \(\mathcal S_\star \) and moves as a rigid body with constant velocity in any other frame \(\mathcal S\). Note that Møller’s disk is an open set since \(|\varvec{v}|<1\) for massive observers.
Møller’s disk plays an important role for extended bodies because it provides a lower bound on the dimensions of a classical system. If one assumes that (i) the energy density in the convex spherical hull of the system is positive in any frame and (ii) the AM is purely orbital, then Møller’s disk lies fully inside the convex spherical hull [3, 6, 10]. In other words, a classical system with spin \(|\varvec{S}_0|\) and mass M necessarily has a typical radial dimension r larger than Møller’s radius
This is a purely relativistic effect. Indeed, reinstating the factor of 1 / c in the above expression shows that the lower bound vanishes in the non-relativistic limit \(c\rightarrow \infty \). The appearance of Møller’s radius in a relativistic theory can also be understood somewhat intuitively. If we imagine that the mass M of an extended body can be concentrated at one of its physical points P, the spin of the system will then be given by the rotation of P about the axis parallel to \(\varvec{S}_0\) passing through the CM. Denoting the distance between P and the rotation axis by r and the angular velocity by \(\omega \), one finds that \(|\varvec{S}_0|=M\omega r^2\). Relativity imposes that the velocity of P cannot exceed the speed of light \(\omega r\le c\). It then follows that r must be larger than Møller’s radius.
5 Asymmetric energy-momentum tensor
In the literature, it is often claimed that the fundamental EMT must be symmetric. This claim is essentially based on two arguments: (i) in General Relativity the gravitational EMT of matter, defined as the source of the gravitational field, appears to be symmetric, and (ii) conservation of the orbital tensor forces the EMT to be symmetric [9]. These arguments are however by no means actual proofs [15].
General Relativity does not require the fundamental EMT to be symmetric, but simply indicates that only a symmetric part of it couples to gravitation. Note also that the symmetry of this gravitational part follows from the assumption that the torsion of spacetime vanishes. When the latter condition is relaxed, the gravitational EMT turns out in general to be asymmetric [16]. Regarding the second argument, the assumption that the orbital tensor is conserved comes from Classical Mechanics, where there exists only one form of AM. As soon as one introduces the intrinsic AM of Quantum Mechanics, which is a new form of AM, there is no reason to maintain this assumption. In conclusion, we do not see any fundamental reason for requiring the EMT to be symmetric.
In field theory, the generic Lorentz transformation law of a multicomponent relativistic field reads in the active viewpoint
where \(M[\varLambda ]\) is a matrix acting on the field components. It then follows from the canonical formalism that the associated conserved current naturally receives two contributions
Beside the orbital (or extrinsic) tensor associated with the transformation of the point, there is an intrinsic tensor \(M^{\mu \alpha \beta }_\text {int}=-M^{\mu \beta \alpha }_\text {int}\) associated with the mixing of the field components.
From the conservation of \(M^{\mu \alpha \beta }(x)\), one concludes that the asymmetry of the EMT is related to the non-conservation of the intrinsic tensor [16,17,18,19]
Focusing on the spatial currents \(\alpha \beta =ij\), this means that orbital and intrinsic AM are not separately conserved in general. They can be converted into each other owing to spin-orbit couplings, leading to an asymmetric EMT. This phenomenon is illustrated by e.g. the Einstein-de Haas effect [20, 21] routinely used to measure the gyromagnetic ratio of atoms and molecules [22].
Because of the non-conservation of the orbital tensor \(\partial _\mu M^{\mu \alpha \beta }_\text {orb}(x)\ne 0\), the results of Sect. 4 should not be expected to hold in general anymore. In particular, the centroids are not forced to move along straight lines with constant velocity, so that there can be an exchange of generalized AM between the external and internal parts. This is annoying since it jeopardizes the usefulness of the concept of center of inertia in field theory, and hence the clear connection with classical mechanics.
5.1 Belinfante–Rosenfeld procedure
There exists a freedom in the definition of the EMT and the generalized AM tensor. Starting from some couple \(T^{\mu \nu }(x),M^{\mu \alpha \beta }(x)\) a whole family of alternative couples can be defined by
where the superpotential is required to satisfy a symmetry property \(G^{[\lambda \mu ]\nu }(x)=0\), so that the new tensors remain conserved and lead to the same Poincaré generators as the original ones, provided that surface terms vanish at spatial infinity
The effect of the superpotential consists therefore in a mere relocalization of the linear and angular momentum distributions [16]. In particular, we observe that the superpotential operates a transfer between orbital and intrinsic tensors
and therefore shifts the position of the centroids by
Belinfante and Rosenfeld [23,24,25] noticed that the new intrinsic part can be set to zero using the particular choice
Since the Belinfante–Rosenfeld generalized AM tensor is by construction purely orbital
we fall back to the case studied in Sect. 4. In particular, it follows from the conservation of the Belinfante–Rosenfeld generalized AM tensor \(\partial _\mu M^{\mu \alpha \beta }_\text {Bel}(x)=0\) that the Belinfante–Rosenfeld EMT is symmetric \(T^{[\alpha \beta ]}_\text {Bel}(x)=0\), and the Belinfante–Rosenfeld centroids move along straight lines with constant velocity as given by Eq. (63)
Because of its symmetry, the Belinfante–Rosenfeld EMT is often considered in the literature to be more fundamental than the canonical one. We disagree with this point of view for several reasons. The first reason is that, as argued above, there does not exist any fundamental reason which requires the EMT to be symmetric. Second, the relocalization of linear and angular momentum distributions in the Belinfante–Rosenfeld procedure is totally ad hoc and does not follow from a canonical approach. The last reason is also the most important one. The Belinfante–Rosenfeld procedure is compatible with General Relativity, because the assumed covariance under diffeomorphisms implies that only the charges (i.e. integrated current densities) can be considered as physical. In other words, the relocalization of the linear and angular momentum distributions is considered to be unphysical and hence harmless. Orbital and intrinsic forms of AM seem therefore indistinguishable in the context of general relativity. This is at odds with quantum mechanics, where orbital and intrinsic forms of AM are fundamentally different and distinguishable, as confirmed by numerous experiments. Linear and angular momentum distributions cannot be relocalized at will. In our view, the Belinfante–Rosenfeld procedure corresponds actually to an effective description of the system, where the intrinsic contribution is mimicked by a modification of the distribution of energy and momentum. For further discussions about canonical and Belinfante–Rosenfeld tensors, see [2].
5.2 Intrinsic energy dipole moment
It follows from Eq. (83) that the generalized AM tensor can be covariantly decomposed as
where the separate orbital and intrinsic contributions are in general time-dependent. The Belinfante–Rosenfeld centroids are therefore shifted from the original ones
by a term depending on the intrinsic partFootnote 9
Accordingly, the quantity
can be interpreted as an intrinsic covariant energy dipole moment. Like spin, it is an intrinsic property of elementary particles. Its time dependence indicates that, just like AM, some energy dipole moment can be exchanged between orbital and intrinsic parts. It is because of this exchange that the relativistic CM theorem does not hold in general. Since the covariant centroid velocity is given by
this amounts to say that the covariant energy current \(\mathcal J^\mu _u(\tau _u)\equiv \int \mathrm {d}^3_ux\,T^{\mu \nu }(x)u_\nu \) is in general time-dependent.
5.3 Intrinsic spin conjecture
Instead of following the Belinfante–Rosenfeld procedure and demanding that the new intrinsic tensor vanishes \(M^{\mu \alpha \beta }_{G,\text {int}}(x)\overset{!}{=}0\), we observe from Eq. (96) that we can impose a weaker condition to recover the validity of the relativistic CM theorem. We simply require that all the intrinsic energy dipole moments must vanish
Combined with the antisymmetry in the last two indices \(M^{\mu [\alpha \beta ]}_\text {int}=0\), this amounts to requiring the intrinsic tensor to be totally antisymmetric, and hence expressible in terms of a sole pseudo-vector \(\mathcal A_\lambda (x)\equiv \frac{1}{3!}\,\epsilon _{\lambda \mu \alpha \beta }\,M^{\mu \alpha \beta }_\text {int}(x)\) as follows
For example, the pseudo-vector is related to the axial-vector bilinear \(\mathcal A^\mu _{1/2}(x)=\frac{1}{2}\,{\overline{\psi }}(x)\gamma ^\mu \gamma _5\psi (x)\) in Dirac theory, and to the Chern–Simons current \(\mathcal A^\mu _1(x)=-\frac{1}{3}\,\epsilon ^{\mu \nu \alpha \beta }A_\nu (x)F_{\alpha \beta }(x)\) in Maxwell theory.Footnote 10 We will refer to the condition in Eq. (101) as the intrinsic spin conjecture (ISC), since it enforces the intrinsic part to contribute to the rotation generators only
with \(s^\mu _u(\tau _u)=\int \mathrm {d}^3_ux\,\mathcal A^\mu (x)\). In other words, elementary particles are characterized by some intrinsic AM but no intrinsic energy dipole momentFootnote 11 [32].
Differentiating Eq. (84) and using the antisymmetry of the intrinsic tensor in its first two indices implies that the EMT satisfies another conservation law
beside the usual one \(\partial _\mu T^{\mu \nu }(x)=0\). Clearly, this condition is fulfilled when the EMT is symmetric, but it is not a necessary requirement. It follows from this new conservation law that the covariant energy current is time-independent \(\dot{\mathcal J}^\mu _u(\tau _u)=0\), confirming that the relativistic CM theorem holds, see Eq. (99).
The validity of the CM theorem is sufficient to recover the results obtained in Sects. 4.2–4.4. The only difference is that now the internal part can be further decomposed as follows
where the orbital and intrinsic contributions are given by
For later convenience, we left open the possibility to use a reference point \(X^\mu (\tau _u)\) different from the centroid \(R^\mu _u(\tau _u)\). Because of the ISC we have the relation \(s^{\alpha \beta }_u(\tau _u)u_\beta =0\), and so the shift of the centroid relative to the reference point is simply given by
Moreover, if \(X^\mu _v(\tau _u)\) is the centroid defined by some observer with four-velocity \(v^\mu \) relative to the Lorentz frame \(\mathcal S\), we can write
since the SSC \(S^{\alpha \beta }_vv_\beta =0\) (68) is satisfied by definition of the centroid.
5.4 Elementary particles
A free elementary particle has by definition no internal structure and is usually pictured as a pointlike object. Where is this point located? It is often thought that the answer is either the center of inertia or the reference point, but the correct one is the CM, because it is the only physical point (i.e. with components transforming as a Lorentz four-vector) unambiguously defined by the system. In the CM frame, there is no ambiguity since the particle is at rest and the CM coincides with the center of inertia. Moreover, the AM is purely internal and hence independent of the reference point. In this frame, we therefore expect that the EMT takes the form \(T^{\mu \nu }(x)\overset{\mathcal S_\star }{=}\eta ^{0\mu }\eta ^{0\nu }\,\delta ^{(4)}(x-R_\star (\tau ))\), where \(\eta _{\mu \nu }=\text {diag}(+1,-1,-1,-1)\) is the Minkowski metric, leading to \(\ell ^{\alpha \beta }_\star (x^0)\overset{\mathcal S_\star }{=}0\) and hence \(J^\mu _\star \overset{\mathcal S_\star }{=}s^\mu _\star (x^0)\).
The situation is more complicated for a moving particle. Indeed, the canonical description of a moving state is obtained from a rotationless boost applied to a state at rest [33]. Since the definition of rotationless boosts depends on the observer, one may expect that the reference point naturally associated with this moving state (hereafter called canonical reference point) will be observer-dependent.Footnote 12 The naive guess would be that the canonical reference point \(\mathcal R^\mu (x^0)\) does coincide with the center of inertia \(R^\mu (x^0)\). A careful calculation shows that it lies in fact somewhere in between the center of inertia and the CM, see Appendix B. As a result, the internal AM will receive in general both orbital and intrinsic contributions \(S^\mu _{\mathcal R}=\ell ^\mu _{\mathcal R}(x^0)+s^\mu (x^0)\), because the canonical reference point appears to be shifted sideways relative to the CM when polarization is not aligned with momentum. This simple phenomenon explains why the longitudinal spin decomposition is frame-independent whereas the transverse spin decomposition is not. It can be illustrated with the example of a Dirac particle.
A free elementary spin-\(\tfrac{1}{2}\) particle with mass m, momentum \(\varvec{p}\), energy \(E=\sqrt{\varvec{p}^2+m^2}\), and canonical polarization \(s=\uparrow ,\downarrow \) can be described by the positive-energy Dirac spinor
In the standard representation with spin quantization axis along the z-direction, the momentum-space spinor reads
It containsFootnote 13 both an s-wave (upper two components) and a p-wave (lower two components). The AM sum rule reads
where \(p_{R,L}=p^1\pm ip^2\). The first term represents the intrinsic contribution and the second term represents the OAM contribution defined relative to the canonical reference point, which is set at the origin of the coordinate system so that the external AM vanishes. In the rest frame of the particle \(\varvec{p}=\varvec{0}\), the p-wave vanishes and the AM is purely intrinsic. When the particle is moving \(\varvec{p}\ne \varvec{0}\), the p-wave sets in and provides the orbital contribution to the AM. As the momentum increases, the contribution from the p-wave becomes more and more important, and ultimately reaches the same weight as the s-wave in the ultrarelativistic limit \(|\varvec{p}|\rightarrow \infty \). As long as the momentum remains parallel to spin, the AM will be purely intrinsic because the orbital contribution of the p-wave vanishes (this corresponds to \(l=1\) but \(l_z=0\)). As soon as the momentum develops a component orthogonal to spin, the intrinsic contribution decreases and is compensated by an increase of the orbital contribution. In the limit of infinitely large transverse momentum, the intrinsic contribution vanishes (or better averages out between s and p-waves) so that the AM becomes purely orbital. In this ultrarelativistic limit, the value \(\tfrac{1}{2}\) for the AM arises from the average of an s-wave and a p-wave with equal weight. This generalizes straightforwardly to spin-j particles with maximal polarization orthogonal to momentum, where the AM in the ultrarelativistic regime arises from the average of 2j partial waves with equal weight \(\frac{1}{2j_\text {max}+1}\sum \nolimits _{l_\text {max}=0}^{2j_\text {max}}l_\text {max}=j_\text {max}\), with \(l_\text {max}\) and \(j_\text {max}\) denoting the maximal projections of orbital and total AM along the quantization axis, respectively.
Why does the canonical reference point depend in general on the observer? The reason is simply because we are considering eigenstates with a fixed (quantized) eigenvalue of the internal AM.
We have seen that \(J^{\alpha \beta }\) behaves as an antisymmetric rank-2 Lorentz tensor. It then follows that, like for the components of the electromagnetic field, the value of \(J^z\) will depend on the frame, see Eq. (17). There is no way to preserve in general the value of \(J^z\) under a canonical boost. Unlike \(J^{\alpha \beta }\), the external and internal parts need not transform individually as antisymmetric rank-2 Lorentz tensors. They will behave as Lorentz tensors only if the coordinates of the reference point \(X^\mu \) transform as a Lorentz four-vector, i.e. only when the reference point is a physical point that does not depend on the observer. If we choose a reference point whose components do not transform as a Lorentz four-vector, i.e. whose definition depends on observer like e.g. the center of inertia, there is a possibility to preserve the value of the internal AM.
The canonical reference point \(\mathcal R^\mu \), derived in Appendix B, is precisely the one for which the internal AM \(S^z_{\mathcal R}\) stays constant under a canonical Lorentz transformation from the rest frame. In the rest frame, the canonical reference point coincides with the CM. When the momentum of the system is increased by a canonical boost, the canonical reference point shifts away from the CM and generate an additional OAM contribution which ensures that the internal AM \(S^z_{\mathcal R}\) remains the same in any frame. In other words, the appearance of a p-wave in the Dirac plane-wave solutions can be understood as a relativistic quantum-mechanical effect.
Usually, one identifies for convenience the origin O of the coordinate system with the canonical reference point \(\mathcal R\) to get rid of the external AM contribution represented by the first term on the right-hand side of Eq. (198). This amounts to identifying the total AM with the internal part \(J^z=S^z_O=S^z_{\mathcal R}\). In the above discussion, it was essential to distinguish total AM from the internal part, since otherwise one would have concluded that the origin gets shifted under Lorentz boosts!
6 Recap of the generalized angular momentum decomposition
As we have seen, the generalized AM can be decomposed in several ways. In order to clarify the global picture and the terminology, we pause for a moment and summarize the general structure of this decomposition.
-
(1)
By choosing an observer, the generalized AM can be decomposed into boost and angular momentum contributions.
-
(2)
By choosing a reference point (or pivot), boosts and AM can further be decomposed into external (or translation-dependent) and internal (or translation-independent) contributions.
-
(3)
Boosts are purely orbital (or extrinsic), whereas AM receives both orbital and intrinsic contributions.
-
(4)
When the system consists of several types of constituents, all the above contributions can further be decomposed according to the constituent types.
We also summarize the terminology
-
(1)
The spin of a system is the AM defined by an observer sitting in the CM frame. It is purely internal and decomposes into orbital and intrinsic contributions when seen from a generic Lorentz frame.
-
(2)
The center of mass is the particular reference point for which boosts defined by an observer sitting in the CM frame become purely external. It coincides with the center of inertia (or energy) in the CM frame.
-
(3)
The canonical reference point is the pivot about which the internal AM remains constant under canonical boosts.
7 Light-front formalism
Dirac showed that there exist several forms in which relativistic dynamics can be formulated [36]. So far, we have considered the so-called instant form, which is naturally associated with massive observers. In the context of high-energy physics, where mass effects can often be neglected, it is the so-called front form that appears to be more useful [37].
The covariant formulation used in this paper allows us to easily transpose the results obtained in instant form to the light-front (LF) formalism. In the covariant formulation of instant form, an observer is characterized by a timelike unit four-vector \(u^\mu \) representing its four-velocity relative to the Lorentz frame \(\mathcal S\). In the covariant formulation of front form we obtain the same expressions, but this time an observer will be characterized by a lightlike four-vector \(n^\mu \) [38]. In some sense, using the front form of dynamics amounts to adopting the point of view of a massless observer, formalizing therefore Einstein’s thought experiment of riding a photon. Note that instant and front forms coincide in the infinite-momentum frame since
7.1 Light-front components
For later convenience, we introduce the dual lightlike four-vector \({\bar{n}}^\mu \) satisfying by definition \({\bar{n}}\cdot n=1\). The two lightlike four-vectors can then be used to perform the Sudakov decomposition of any four-vector
where the LF components are defined as \(a^+\equiv a\cdot n\) and \(a^-\equiv a\cdot {\bar{n}}\), and the transverse components by \(a^\mu _\perp \equiv \eta ^{\mu \nu }_\perp a_\nu \) with the transverse projector \(\eta ^{\mu \nu }_\perp =\eta ^{\mu \nu }-n^\mu {\bar{n}}^\nu -\bar{n}^\mu n^\nu \).
The scalar product of two four-vectors can be written as
In the particular case of four-position and four-momentum, we find \(x\cdot P=x^+ P^-+x^-P^++x_\perp \cdot P_\perp \). If we choose \(x^+\) to the represent the LF time coordinate, then the LF energy component is represented by \(P^-\), whereas the LF (longitudinal) components of position and momentum are represented by \(x^-\) and \(P^+\), respectively. Note that for a massive system, we have \(P^\pm >0\).
In the LF formalism, the generalized AM tensor can be written as
where the covariant LF boost and rotation generators are defined by
In particular, we have \(K^+_n=0\) and \(J^-_n=0\).
The LF operators can naturally be expressed in terms of the standard ones in instant form. We can write in general
with \(\varvec{n}\) a unit vector representing in the Lorentz frame \(\mathcal S\) the direction of motion of the massless observer,Footnote 14 and \(\varLambda \) some nonzero scaling factor. The LF longitudinal momentum and energy are then given by
The transverse momentum components \(P^i_\perp \) are obviously the same in both instant and front forms. The covariant LF Lorentz generators correspond to particular combinations of covariant instant form boost and rotation generators
Note that in the \(\mathcal S_u\) frame, the longitudinal components in both forms are simply proportional to each other
whereas the transverse components get mixed
7.2 Center of light-front momentum
Substituting \(u^\mu \) by \(n^\mu \) in Eq. (50), we see that the role of inertia in the LF formalism is taken over by the LF momentum. The coordinates of the LF centroid (or center of LF momentum) are then defined byFootnote 15
Provided that the ISC holds, the LF centroid moves along a straight line with constant LF velocity
Looking at the \(\mu =0\) component allows us to relate the LF time coordinate with the ordinary time coordinate in \(\mathcal S\)
and to reparametrize the LF centroid worldline as follows
This shows that the LF centroid moves along a line parallel to the instant form centroids with the same constant velocity.
7.3 Møller’s circle
One can naturally also express the LF time coordinate in terms of the proper time \(\tau \)
The LF centroid worldline then reads
and appears to be shifted relative to the CM by
Note that the shift is manifestly independent of the scaling factor \(\varLambda \), since both the numerator and the denominator involve the same number of contractions with \(n^\mu \). In the CM frame, the shift reduces to
Since \(\varvec{n}\) is a unit vector, we see that the set of all LF centroids forms a circle corresponding to the boundary of Møller’s disk. This is in line with the interpretation of the LF formalism as corresponding to the viewpoint of massless observers, i.e. observers moving at the speed of light.
7.4 Impact-parameter distributions
The LF formalism is well suited to study the internal structure of the nucleon, see e.g. [37, 39,40,41]. In particular, non-perturbative correlation functions extracted from deeply virtual Compton scattering experiments, known as generalized parton distributions (GPDs), have attracted a lot of attention in the last two decades [42,43,44]. The reason for this is because GPDs were shown to give access to the EMT and hence to the AM content of the nucleon [45], and to provide tomographic pictures of the internal structure in impact-parameter space [46, 47]. One can even map out the distribution of AM in impact-parameter space [48].
It has been observed that the impact-parameter distributions (IPDs) get distorted whenever the state is transversely polarized [47, 49, 50]. These distortions are understood as originating from a relativistic artifact of the LF formalism associated with the internal OAM of quarks and gluons inside the target. When the target is a charged elementary particle, there is no substructure at leading order in QED. The internal OAM must therefore vanish at that order, leading immediately to an unambiguous definition of the natural values for the electromagnetic moments [51]. In particular, we found that the gyromagnetic ratio of elementary particles is given by \(g=2\) at tree level for any spin.
Using the following general parametrization for the matrix elements of the Belinfante–Rosenfeld EMT in a spin-\(\frac{1}{2}\) state
where \(a^{\{\mu }b^{\nu \}}=a^\mu b^\nu +a^\nu b^\mu \) and with \(P=\frac{p'+p}{2}\) the average momentum, \(\varDelta =p'-p\) the momentum transfer, M the mass, and \({\mathsf {S}}\) (\({\mathsf {S}}'\)) the initial (final) polarization four-vector of the state, Ji derived in the seminal paper [45] a relation between the quark contribution to internal AM and the energy-momentum form factors
Moreover, he showed that these energy-momentum form factors can be expressed in terms of twist-2 quark vector GPDs
where x is the fraction of LF momentum carried by the quark. For a recent review of the GPD phenomenology, see [52].
Burkardt investigated in Ref. [53] the relation between AM and transverse distortions of IPDs. Focusing on the “good” LF component of the Belinfante–Rosenfeld quark EMT, he found for a nucleon at rest with transverse polarization
where \(\epsilon ^{ij}_\perp \equiv \epsilon ^{ij+-}\) and \(\mathcal N\) is a normalization factor depending on the wave packet \(|\psi \rangle \). A similar result was obtained using a tower of twist-2 operators generalizing the Belinfante–Rosenfeld AM density [54] or the Pauli–Lubański pseudo-vector [55]. In his derivation, Burkardt insisted on the fact that \(|\psi \rangle \) must be a delocalized state centered around the origin. Indeed, if one uses the standard transversely polarized LF state localized at the LF centroid, one would find \(\frac{1}{2}B_q(0)\) instead of \(J_q\) on the RHS of Eq. (134). Based on the Melosh-Wigner rotation relating canonical polarizations in instant and front forms [56,57,58], Burkardt argued that the LF wave packet of a transversely polarized spin-\(\frac{1}{2}\) state must be shifted sideways by half a Compton wavelength relative to the instant form one. This shift adds a contribution proportional to \(\frac{1}{2}A_q(0)\), leading then to Eq. (134).
The distortions of the IPDs in the LF formalism and their relation to AM can be more easily understood based on the results obtained in the previous sections. We will work at a fixed LF time \(x^+=0\) and drop all references to it in the following for convenience. Let us write the quark LF boost generators as the following telescoping series
where \(R^\mu _{qn}\), \(R^\mu _n\) and \(X^\mu _\star \) are the positions at \(x^+=0\) of the LF centroid of the quark subsystem, the LF centroid of the whole system, and the CM of the whole system, respectively. The first term of this series can be expressed as
and represents the LF dipole moment of the quark subsystem relative to the LF centroid. Focusing on the transverse LF components \(\mu =i\), we get
where \(\varvec{b}_\perp \) is the usual impact-parameter variable defined as the transverse position relative to the LF centroid.
The “position” of a system is usually identified with that of the canonical reference point,Footnote 16 see Appendix B. In the LF formalism, the canonical reference point coincides with the LF centroid. If we set the origin of the coordinate system at the LF centroid, the last two terms in Eq. (135) cancel each other and we are left with the first one. This term provides the LF dipole moment associated with the distortions of the IPDs, and reads in terms of the energy-momentum form factors [47]
In instant form, the canonical reference point coincides in the rest frame with the CM. If we set the origin of the coordinate system at the CM, only the last term in Eq. (135) vanishes. Beside the contribution (138), we have another one which takes into account the shift between the LF centroid and the CM. Focusing on the transverse LF components, we find
using Eq. (130). For a spin-\(\tfrac{1}{2}\) state we have \(|\varvec{S}_0|=\tfrac{1}{2}\), giving the shift by half a Compton wavelength between instant and front forms advocated by Burkardt. It simply arises from the dependence of the canonical reference point on the observer. We do not need to invoke Melosh-Wigner rotation effectsFootnote 17 and delocalized wave packets. Now since \(\langle P^+_q\rangle =A_q(0)P^+\), we finally get
Gathering all the contributions, we find that in the rest frame with the CM sitting at the origin of the coordinate system
which agrees with Eq. (134). In the RHS, we used the fact that the combination of energy-momentum form factors \(\tfrac{1}{2}\left[ A_q(0)+B_q(0)\right] \) is precisely the one giving the quark contribution to the nucleon spin \(J_q\) (132).
7.5 Interpretation of Ji’s relation
Twenty years ago, Ji looked for a Lorentz-invariant spin sum rule. For a spin-\(\tfrac{1}{2}\) state, he wrote [61]
where the Pauli–Lubański pseudo-vector \(W^\mu =\sum _a \mathcal W^\mu _a(x^0)\) has been decomposed into contributions associated with the various types of constituents. Choosing the z-axis along the momentum of the system and the longitudinal polarization \(\varvec{{\mathsf {S}}}\parallel \varvec{p}\), he obtained the helicity sum rule
In particular, the quark contribution to the nucleon spin is defined as
and can be expressed in terms of the energy-momentum form factors as in Eq. (132). This matrix element is valid for any momentum \(\varvec{p}\) as long as one considers longitudinal polarization.
Reading Eq. (134) backwards suggests another interpretation of Ji’s relation (132) in terms of transverse polarization instead of longitudinal polarization [53, 55]. It can be regarded as the sum of two contributions: a term \(\frac{1}{2}B_q(0)\) arising from the distortion of the IPD in the LF formalism when the state is transversely polarized, supplemented by a term \(\frac{1}{2}A_q(0)\) arising from an overall transverse shift when going from transversely polarized states at rest in instant form to front form. Pushing the interpretation further, it has even been suggested [53,54,55] that the GPD combination \(J_q(x)\equiv \frac{x}{2}\left[ H_q(x,0,0)+E_q(x,0,0)\right] \) should be regarded as the distribution of quark internal AM in x-space for a transversely polarized target. This simple partonic interpretation is however not well founded [2, 62], one of the reasons being that although the quantity \(J_q\) appears on the RHS of Eq. (134), the LHS corresponds actually to the matrix element of the transverse LF boost generators at \(x^+=0\) and not the transverse AM as one would naively expect [63, 64]. Moreover, we note that Eq. (134) provides a relation between boost generators and spin only at the level of matrix elements. Since interpretations made at the level of matrix elements may be misleading [65], one should first determine whether this relation remains valid at the operator level. We doubt this is possible because the dependence on the momentum fraction x, being obtained through a non-local operator, cannot unambiguously be related to Ji’s AM [62, 66, 67].
Ji et al. [68, 69] tried to justify the above alternative interpretation of Ji’s relation starting from the quark contribution to the Pauli–Lubański pseudo-vector
To do so, they discarded by hand an annoying term involving the \({\bar{C}}_q(0)\) energy-momentum form factor, motivated by the fact that \(\sum _a{\bar{C}}_a(\varDelta ^2)=0\) as a consequence of the conservation of the EMT. This is obviously not an acceptable argument when quark and gluon contributions are considered separately. The annoying term has been derived explicitly in both instant and front forms, and shown to depend on the observer [70,71,72].
Once again, we can easily understand the above observations based on the results obtained in the previous sections. The quark contribution to the generalized AM tensor at a fixed LF time can be written as
The quark contribution to the Pauli–Lubański pseudo-vector introduced by Ji, Xiong and Yuan then reads
Because of the first term on the RHS, we see that \(\mathcal W^\mu _q(x^+)\) does not represent in general the quark contribution to the nucleon spin despite the fact that \(\sum _a\mathcal W^\mu _a(x^+)=W^\mu \). When summed over all quark and gluons contributions, this first term does however vanish owing to energy-momentum conservation \(\sum _aP^\mu _a(x^+)=P^\mu \). At the initial LF time \(x^+=0\), we get
Treating with care the matrix elements as explained in Appendix B, we find using the parametrization (131) that the first term on the RHS is precisely the annoying one proportional to \({\bar{C}}_q(0)\). Since this term is external, it depends on the choice of origin. Setting as usual the origin at the canonical reference point explains the observer dependence found in [70,71,72]. Clearly, the only proper ways to get rid of the annoying term is to either restrict ourselves to the longitudinal component like Ji did originally [61], or set the origin at the CM of the system. Note that based on Eq. (146), one can also easily understand the frame dependence of the transverse AM decomposition obtained in instant form by Leader [2, 73].
8 Conclusions
Motivated by the question of orbital angular momentum in hadronic physics, we reviewed the concept of relativistic center of mass in field theory. We extended the discussion to asymmetric energy-momentum tensors and the light-front formalism which constitute the most suitable framework to study the nucleon internal structure.
We found that the canonical reference point with respect to which orbital angular momentum is defined in field theory depends on the observer. This dependence arises because of the quantization of angular momentum in a relativistic theory, and provides a simple explanation for the presence of a p-wave in the plane-wave solutions to Dirac equation. It clarifies the difference between longitudinal and transverse spin sum rules, and the origin of various induced shifts and distortions observed in the distributions defined within the light-front formalism.
The results presented in this work are expected to provide a new perspective on various phenomena like e.g. Thomas precession, Zitterbewegung, and other effects associated with relativistic spin-orbit coupling.
Notes
By inertia we mean inertial mass, i.e. the system’s resistance to being accelerated by a force.
Typically, forthcoming results can be taken as applying to the quantum theory, provided simple technical adjustments are made. For example, Noether currents and charges become operator-valued distributions acting on a Hilbert space, and classical products AB are replaced by symmetric products \(\frac{1}{2}(AB+BA)\) of the corresponding operators.
We restrict ourselves to the case of massive systems for which the evaluation of \(P^0\) leads to a strictly positive number. In the quantum theory, \(1/P^0\) has to be understood as the inverse energy operator \((P^0)^{-1}\) whose eigenvalue when acting on an energy eigenstate coincides with the inverse of the energy eigenvalue.
To get the four-velocity, one has to multiply this expression by the Lorentz boost factor \(\gamma =P^0/M\), where M is the mass of the system.
More explicitly, this unit four-vector is the timelike vierbein whose components are \(u^\mu =\varLambda ^{\mu }_{\phantom {\mu }0}\), where \(\varLambda \) is the Lorentz transformation relating \(\mathcal S_u\) to \(\mathcal S\).
One might be worried about the change in the spatial integration for the generators defined relative to \(\mathcal S_u\). However, it follows from Noether’s theorem that the Poincaré generators transform as ordinary Lorentz tensors, and therefore do not depend on how the spatial integrations are performed.
It is also quite common to find the reverse argument in the literature. Assuming that the EMT is symmetric, one deduces that the corresponding orbital tensor must be conserved. The latter is then identified with the generalized total AM tensor.
For a gauge-invariant formulation, see [29].
In the language of first quantization, the upper and lower components of the Dirac spinor are simultaneous eigenstates of the longitudinal intrinsic AM operator \(\hat{S}_z=\frac{1}{2}\,\gamma ^3\gamma _5\) and orbital AM operator \(\hat{L}_z=\gamma ^0\left( p_R\,\frac{\partial }{\partial p_R}-p_L\,\frac{\partial }{\partial p_L}\right) \). For a state with polarization \(s=\uparrow \) corresponding to the eigenvalue of total AM \(j_z=+\frac{1}{2}\), the upper components have eigenvalues \(s_z=+\frac{1}{2}\) and \(l_z=0\), and the lower components lead to a superposition of \(s_z=+\frac{1}{2}\) and \(l_z=0\) with \(s_z=-\frac{1}{2}\) and \(l_z=+1\).
In practice, one usually makes the canonical choice \(\varvec{n}=-\varvec{e}_z\) corresponding to the viewpoint of a photon moving along the \(-z\) direction.
One could also in principle define the center of LF energy \(R^\mu _{\bar{n}}(x^+)=\frac{1}{P^-}\int \mathrm {d}^3_nx\,x^\mu T^{+-}(x)\), which is less interesting because \(P^-\) does not leave the LF hyperplane \(x^+=0\) invariant, unlike \(P^+\).
We used the word position with quotation marks because its definition depends on the observer, and therefore does not represent the actual position of the system which should better be identified with that of the CM, see Sect. 5.4.
Melosh–Wigner rotation effects and dependence of the canonical reference point on the observer are however related as they both arise from the use of a particular subset of Lorentz transformations in the definition of moving states [33]. This provides a new perspective on the origin of model relations among various parton distributions and OAM [59, 60].
Since at the classical level the generators are functionals of the fields, the Poisson brackets of two generators are defined as \(\{A,B\}_\text {PB}=\int \mathrm {d}^3x\sum _a\left[ \frac{\delta A}{\delta \phi _a(x)}\frac{\delta B}{\delta \pi _a(x)}-\frac{\delta A}{\delta \pi _a(x)}\frac{\delta B}{\delta \phi _a(x)}\right] \), where \(\pi _a(x)\) is the conjugate field of \(\phi _a(x)\). In the quantum theory, Poisson brackets are replaced by the standard commutators \(\{A,B\}_\text {PB}\mapsto \frac{1}{i}[A,B]\).
In the quantum theory, \(p^\mu \) corresponds to the expectation value of the four-momentum operator \(P^\mu \).
References
M.H.L. Pryce, Proc. R. Soc. Lond. A 195, 62 (1948)
E. Leader, C. Lorcé, Phys. Rep. 541(3), 163 (2014)
C. Møller, Theor. Phys. vol. 3. Commun. Dublin Inst. Advanced Studies A 5, 3 (1949)
M. Born, L. Infeld, Proc. R. Soc. Lond. A 150, 141 (1935)
G.N. Fleming, Phys. Rev. 137, 188 (1965)
C. Møller, Ann. Inst. Henri Poincaré 11, 215 (1949)
A.D. Fokker, Relativiteitstheorie (P. Noordhoff, Groningen, 1929)
P. Hoodbhoy, X.D. Ji, W. Lu, Phys. Rev. D 59, 074010 (1999)
C.W. Misner, K.S. Thorne, J.A. Wheeler, “Gravitation,” San Francisco (1973)
L.F.O. Costa, J. Natário, Fundam. Theor. Phys. 179, 215 (2015)
W. Tulczyjew, Acta Phys. Pol. 18, 393 (1959)
W.G. Dixon, Il Nuovo Cimento 34, 317 (1964)
J.Y. Chen, D.T. Son, M.A. Stephanov, H.U. Yee, Y. Yin, Phys. Rev. Lett. 113(18), 182302 (2014)
J.Y. Chen, D.T. Son, M.A. Stephanov, Phys. Rev. Lett. 115(2), 021601 (2015)
C. Lorcé, JHEP 1508, 045 (2015)
F.W. Hehl, Rep. Math. Phys. 9, 55 (1976)
A. Papetrou, Philos. Mag. 40, 937 (1949)
C. Truesdell, The classical field theories, in Principles of Classical Mechnics and Field Theory, Encyclopedia of Physics, vol. 2/3/1, ed. by S. Flügge (Springer, Berlin, 1960)
C. Eringen, Polar and nonlocalfield theories, in Continuum Physics, vol. 4, ed. by C. Eringen (Academic Press, New York, 1976)
A. Einstein, W.J. de Haas, Verh. Dtsch. Phys. Ges. 17, 152 (1915)
V. Ya Frenkel, Sov. Phys. Uspekhi 22(7), 580 (1979)
S.J. Barnett, G.S. Kenny, Phys. Rev. 91, 408 (1953)
F.J. Belinfante, Physica 6, 887 (1939)
F.J. Belinfante, Physica 7, 449 (1940)
L. Rosenfeld, Mém. Acad. R. Belg. 18, 1 (1940)
R. Medina, J. Stephany, arXiv:1404.1590 [physics.class-ph]
R. Medina, J. Stephany, arXiv:1404.3334 [hep-th]
R. Medina, J. Stephany, arXiv:1703.02109 [physics.class-ph]
Z.Q. Guo, I. Schmidt, Phys. Rev. D 87(11), 114016 (2013)
G. Salesi, L. Deleidi, Hadron. J. 38, 249 (2015)
T. Ivezic, J. Phys. Conf. Ser. 437, 012014 (2013)
J.S. Anandan, Phys. Rev. Lett. 85, 1354 (2000)
W.N. Polyzou, W. Glöckle, H. Witala, Few Body Syst. 54, 1667 (2013)
R.A. Muller, Am. J. Phys. 60, 313 (1992)
K. Rebilas, Am. J. Phys. 83, 199 (2015)
P.A.M. Dirac, Rev. Mod. Phys. 21, 392 (1949)
S.J. Brodsky, H.C. Pauli, S.S. Pinsky, Phys. Rep. 301, 299 (1998)
J. Carbonell, B. Desplanques, V.A. Karmanov, J.F. Mathiot, Phys. Rep. 300, 215 (1998)
S. Meissner, A. Metz, M. Schlegel, JHEP 0908, 056 (2009)
C. Lorcé, B. Pasquini, JHEP 1309, 138 (2013)
C. Lorcé, B. Pasquini, M. Vanderhaeghen, JHEP 1105, 041 (2011)
M. Diehl, Phys. Rep. 388, 41 (2003)
A.V. Belitsky, A.V. Radyushkin, Phys. Rep. 418, 1 (2005)
A. Bacchetta, Eur. Phys. J. A 52(6), 163 (2016)
X.D. Ji, Phys. Rev. Lett. 78, 610 (1997)
M. Burkardt, Phys. Rev. D 62, 071503 (2000) Erratum: [Phys. Rev. D 66, 119903 (2002)]
M. Burkardt, Int. J. Mod. Phys. A 18, 173 (2003)
C. Lorcé, L. Mantovani, B. Pasquini, Phys. Lett. B 776, 38 (2018)
G.A. Miller, Ann. Rev. Nucl. Part. Sci. 60, 1 (2010)
C. Lorcé, B. Pasquini, Phys. Rev. D 93(3), 034040 (2016)
C. Lorcé, Phys. Rev. D 79, 113011 (2009)
K. Kumericki, S. Liuti, H. Moutarde, Eur. Phys. J. A 52(6), 157 (2016)
M. Burkardt, Phys. Rev. D 72, 094020 (2005)
P. Hoodbhoy, X.D. Ji, W. Lu, Phys. Rev. D 59, 014013 (1999)
X. Ji, X. Xiong, F. Yuan, Phys. Rev. Lett. 109, 152005 (2012)
E.P. Wigner, Ann. Math. 40, 149 (1939)
H.J. Melosh, Phys. Rev. D 9, 1095 (1974)
P.L. Chung, W.N. Polyzou, F. Coester, B.D. Keister, Phys. Rev. C 37, 2000 (1988)
C. Lorcé, B. Pasquini, Phys. Rev. D 84, 034039 (2011)
C. Lorcé, B. Pasquini, Phys. Lett. B 710, 486 (2012)
X.D. Ji, in *Amsterdam (1996), Spin 96* pp. 68–74
K.F. Liu, C. Lorcé, Eur. Phys. J. A 52(6), 160 (2016)
E. Leader, C. Lorcé, Phys. Rev. Lett. 111, 039101 (2013)
A. Harindranath, R. Kundu, A. Mukherjee, R. Ratabole, Phys. Rev. Lett. 111, 039102 (2013)
C. Lorcé, Eur. Phys. J. C 78(2), 120 (2018)
Y. Hatta, S. Yoshida, JHEP 1210, 080 (2012)
C. Lorcé, Phys. Lett. B 719, 185 (2013)
X. Ji, X. Xiong, F. Yuan, Phys. Lett. B 717, 214 (2012)
X. Ji, X. Xiong, F. Yuan, Phys. Rev. Lett. 111, 039103 (2013)
E. Leader, Phys. Lett. B 720, 120 (2013) Erratum: [Phys. Lett. B 726, 927 (2013)]
Y. Hatta, K. Tanaka, S. Yoshida, JHEP 1302, 003 (2013)
A. Harindranath, R. Kundu, A. Mukherjee, Phys. Lett. B 728, 63 (2014)
E. Leader, Phys. Rev. D 85, 051501 (2012)
W. Greiner, J. Reinhardt, Field Quantization (Springer, Berlin, 1996)
L. Susskind, Phys. Rev. 165, 1535 (1968)
L. Susskind, Phys. Rev. 165, 1547 (1968)
J.B. Kogut, D.E. Soper, Phys. Rev. D 1, 2901 (1970)
B.L.G. Bakker, E. Leader, T.L. Trueman, Phys. Rev. D 70, 114001 (2004)
W.B. Case, Am. J. Phys. 76, 937 (2008)
X.d Ji, Phys. Rev. Lett. 91, 062001 (2003)
A.V. Belitsky, X.d Ji, F. Yuan, Phys. Rev. D 69, 074014 (2004)
M.V. Polyakov, Phys. Lett. B 555, 57 (2003)
C. Lorcé, B. Pasquini, Phys. Rev. D 84, 014015 (2011)
C. Lorcé, B. Pasquini, X. Xiong, F. Yuan, Phys. Rev. D 85, 114006 (2012)
C. Lorcé, Phys. Rev. D 97(1), 016005 (2018)
Acknowledgements
We are thankful to C. Kopper for his careful reading and comments about the present work. This work has been supported by the Agence Nationale de la Recherche under the project ANR-16-CE31-0019.
Author information
Authors and Affiliations
Corresponding author
Appendices
A Poincaré algebra
The generators \(P^\mu \) and \(J^{\alpha \beta }\) satisfy the Poincaré algebra defined by the following set of (equal-time) Poisson bracketsFootnote 18 [74]
The first bracket indicates the four-momentum is invariant under spacetime translations. In particular, it is conserved in the sense that it does not depend on time. The other two brackets enforce the components of \(P^\mu \) to transform as a Lorentz four-vector and the components of \(J^{\alpha \beta }\) to transform an (antisymmetric) rank-2 Lorentz tensor. Contracting Eqs. (149) and (150) with \(P_\mu \) shows that \(P^2\) has vanishing Poisson bracket with all the Poincaré generators, and can therefore be used as a frame-independent label of the system.
The Poincaré algebra becomes more transparent when expressed in terms of the generators defined in some Lorentz frame \(\mathcal S\). Denoting by \(H=P^0\) the Hamiltonian of the system, one finds that
Since the Poincaré generators are time-independent, these relations indicate that only the boost generators involve the time coordinate explicitly owing to
with \(\dot{K}^i=\mathrm {d}K^i/\mathrm {d}t\). At the same time, the relations (152) indicate that the energy of the system is invariant under translations and rotations, but gets mixed up with momentum under boosts.
One gets also from the Poincaré algebra
that \(\varvec{P}\), \(\varvec{J}\) and \(\varvec{K}\) transform as ordinary three-vectors under rotations. Moreover, these relations tell us that total AM depends explicitly on the coordinates of some particular point owing to
and gets mixed up with boosts under boosts. Total AM contains OAM which is defined with respect to a pivot, and so explicitly depends on the coordinates \(\varvec{X}\) of the latter. In practice, the pivot is identified with the origin of the coordinate system \(\varvec{X}=\varvec{0}\), see Eq. (39). One therefore omits to write this dependence and identifies \(\varvec{x}\) with the position relative to the pivot. Under an active translation generated by the Poisson brackets, the system is translated by an infinitesimal amount, but the pivot remains at the origin. This changes the total AM and explains the nonvanishing of \(\{J^i,P^j\}_\text {PB}\). The latter can however be compensated by an infinitesimal translation of the pivot represented by the term \(\nabla ^j J^i\). In other words, Eq. (155) expresses the invariance of total AM when both the system and the pivot are translated.
The last set obtained from the Poincaré algebra reads
These relations indicate that contrary to momentum, boosts do also depend explicitly on the coordinate of some particular point. Moreover, they confirm that boosts mix momentum with energy, and boosts with rotations.
1.1 A.1 Poincaré generators relative to a generic frame
The previous discussion about the Poincaré generators can be done in a more covariant way. Note that the object \(u^\mu \) introduced in Sect. 2.2 is an auxiliary four-vector, in the sense that it transforms as a four-vector under a change of reference frame \(\mathcal S\) (passive transformation), but has vanishing Poisson brackets with all Poincaré generators (active transformation).
Just like the generators of Lorentz transformations (11) can be decomposed into generators of boosts and rotations relative to \(\mathcal S_u\), the four-momentum can be decomposed into Hamiltonian and momentum operators
where \(H_u\equiv P\cdot u\) and \(P^\mu _u\equiv \varDelta ^{\mu \nu }_uP_\nu \) with the projector \(\varDelta ^{\mu \nu }_u\equiv \eta ^{\mu \nu }-u^\mu u^\nu \). In terms of these covariant generators, the Poincaré algebra reads
Among all the possible Lorentz frames \(\mathcal S_u\), a special role is played by the system rest frame \(\mathcal S_\star \). This frame is identified by the four-velocity \(u^\mu _\star =p^\mu /m\), with m the proper mass and \(p^\mu \) the four-momentum obtained from the evaluation of the spacetime translation generators \(P^\mu \) for the particular field configuration describing the system.Footnote 19 In particular, we have
which are naturally reminiscent of the Lorentz transformation laws for the electric and magnetic fields. Note that because of \(\{K^i,P^j\}_\text {PB}=-\delta ^{ij}H\), it is possible to set the field evaluation of the boost generators to zero through a suitable translation. Denoting the field evaluation of the generators by the corresponding lower case letters, we get
We therefore find that \(\varvec{A}\equiv m\varvec{k}_\star \) is nothing but the (relativistic version of the) Laplace–Runge–Lenz vector of Classical Mechanics in absence of external forces.
1.2 A.2 Poincaré generators relative to the instantaneous rest frame
Instead of working with an auxiliary four-vector \(u^\mu \), we can use the generators of spacetime translations \(P^\mu \) to define
where \(W^\mu \) is the standard Pauli–Lubański pseudo-vector, and therefore write
Clearly, \(N^\mu \) and \(W^\mu \) are both orthogonal to \(P^\mu \).
Unlike \(K^\mu _\star =-J^{\mu \nu }p_\nu /m\) and \(J^\mu _\star =\tfrac{1}{2}\,\epsilon ^{\mu \alpha \beta \lambda }J_{\alpha \beta }p_\lambda /m\), the objects \(N^\mu \) and \(W^\mu \) behave as four-vectors under active Lorentz transformations generated by the Poisson brackets
They satisfy
In particular, it is readily seen that \(W^2\) (like \(P^2\)) has vanishing Poisson brackets with all the Poincaré generators. The first two relations are simple to interpret. They indicate that the relativistic spin \(S^\mu =W^\mu /\sqrt{P^2}\) is independent of the choice of origin, and obeys standard commutation relations for AM only in the rest frame.
The difference between the algebras (158) and (164) appears in the Poisson brackets \(\{W^\mu ,N^\nu \}_\text {PB}\not \propto \{J^\mu _\star ,K^\nu _\star \}_\text {PB}\) and \(\{N^\mu ,N^\nu \}_\text {PB}\not \propto \{K^\mu _\star ,K^\nu _\star \}_\text {PB}\). This can be understood by the fact that \(K^\mu _\star \) and \(J^\mu _\star \) represent the boost and rotation generators relative to the rest frame of the system determined prior any active Poincaré transformation, whereas \(N^\mu /\sqrt{P^2}\) and \(W^\mu /\sqrt{P^2}\) represent the boost and rotation generators relative to the rest frame of the system at the moment of their action. Since active boosts change the frame in which the system is at rest, subsequent \(N^\mu \) and \(W^\mu \) do not coincide anymore with \(mK^\mu _\star \) and \(mJ^\mu _\star \).
1.3 A.3 Algebra of the covariant position, orbital and spin angular momentum
In Sect. 3, we found that the position of the center of inertia, and the associated external and internal parts of AM are constructed from the Poincaré generators as follows
They all transform as ordinary three-vectors under rotations
and satisfy familiar Poisson brackets with energy
and momentum
The remaining Poisson brackets [4]
differ from the familiar ones by terms proportional to \(S^i/H^2\). In the nonrelativistic limit, the contribution of these terms vanishes as one can easily see by reinstating the factors of c in the above expressions. They can therefore be understood as relativistic corrections. Note also that in the system rest frame, all the relativistic corrections disappear except for \(\{R^i,R^j\}_\text {PB}\).
For the corresponding covariant quantities defined by an observer in \(\mathcal S_u\), we have
They satisfy the following Poisson brackets with the covariant energy
and the covariant momentum
The remaining Poisson brackets read
For the instantaneous CM operator \(R^\mu (\tau )=\tau P^\mu /M-N^\mu /M^2\), we find
It follows from Eq. (163) that \(R^\mu (\tau )\) transforms as a Lorentz four-vector
1.4 A.4 Light-front form
Using the notations \(\mathcal B_L=J^{+-}\), \(\mathcal B^i_\perp =J^{+i}\), \(\mathcal J_L=\frac{1}{2}\,\epsilon ^{ij}_\perp J^{ij}\) and \(\mathcal J^i_\perp =\epsilon ^{ij}_\perp J^{-j}\) for the LF boost and rotation generators introduced in Sect. 7.1, the Poincaré algebra reads
and
We see that the first set forms a two-dimensional Galilean subgroup where the LF momentum plays the role of a non-relativistic “mass” in the transverse plane [75,76,77].
In terms of the transverse LF position \(\varvec{R}_\perp =(x^+ \varvec{P}_\perp -\varvec{\mathcal B}_\perp )/P^+\), we have
and for the longitudinal LF position \(R^-=(x^+ P^--\mathcal B_L)/P^+\), we have
The other Poisson brackets read
B Canonical reference point
When it comes to evaluating matrix elements of OAM in a quantum theory, one has to be careful with the treatment of the position variable [2, 78]. The problem is that OAM requires the knowledge of both position and momentum, explaining why a standard plane-wave approach fails. One solution is to consider wave packets [78], but the price to pay is that calculations usually become quite lengthy and cumbersome. In particular, one has to identify and remove the part associated with the structure of the wave packet, which we are not interested in. An equivalent and much simpler solution is to use the Wigner-Weyl representation [79].
Consider the matrix element of an operator O in some state \(|\psi \rangle \) normalized as \(\langle \psi |\psi \rangle =1\). It can conveniently be written in the form
with the density operator \(\rho _\psi =|\psi \rangle \langle \psi |\). A closed system with mass M, average position \(\varvec{\mathcal R}(x^0)\) and momentum \(\varvec{P}\) defined in the Wigner sense at some time \(\mathcal R^0(x^0)=x^0\) is represented by the following relativistic phase-space operatorFootnote 20
where the four-vectors \(P=\frac{p'+p}{2}\) and \(\varDelta =p'-p\) satisfy the constraints \(P\cdot \varDelta =0\) and \(P^2+\frac{\varDelta ^2}{4}=M^2\) which arise from the mass shell conditions. A similar operator can be introduced within the LF formalism [83, 84].
The phase-space operator is properly normalized \(\mathrm {Tr}[\rho _{\varvec{\mathcal R},\varvec{P}}]=1\), and plane waves are recovered by averaging over \(\varvec{\mathcal R}\)
Note that the time dependence drops out because the integral over \(\varvec{\mathcal R}\) imposes \(\varDelta =0\). Defining “position” states at some time \(r^0(x^0)=x^0\) as
with normalization \(\langle r'|r\rangle =\delta ^{(3)}(\varvec{r}'-\varvec{r})\), the phase-space operator (193) can alternatively be expressed as
with \(Z^0=0\). If we average over \(\varvec{P}\), we recover the “spatial” density operator
Let us now consider the matrix elements of the OAM operator \(J^i_\text {orb}=\epsilon ^{ijk}\int \mathrm {d}^3x\,x^j\,T^{0k}(x)\). After some standard manipulations [2], we find
where \(E=P^0|_{\varDelta =0}=\sqrt{\varvec{P}^2+M^2}\). These two terms represent the external and internal parts of the OAM defined by the canonical reference point \(\varvec{\mathcal R}(x^0)\). In practice, one usually works at a fixed initial time \(x^0=0\) and sets the origin at the initial average position of the system \(\varvec{\mathcal R}(0)=\varvec{0}\).
In order to determine where this canonical reference point is situated, we consider the matrix elements of the operator \(R^i(x^0)=\frac{1}{P^0}\int \mathrm {d}^3x\,x^i\,T^{00}(x)\) giving the position of the center of inertia. After some algebra, we find
For a closed spin-\(\frac{1}{2}\) system, we can use the parametrization (131) with \(A(0)=1\) and \(B(0)={\bar{C}}(0)=0\) which arise from the conservation of total linear and angular momenta [2]. We then get
where we used the generic expression for the Dirac bilinear derived in [85]. The shift between the center of inertia and the CM being given by Eq. (78)
we conclude that the canonical reference point \(\varvec{\mathcal R}\) is situated on the segment joining the center of inertia to the CM. In the Breit frame \(\varvec{P}=\varvec{0}\), which corresponds to the average CM frame, the canonical reference point coincides with both the center of inertia and the CM. In the infinite-momentum frame \(|\varvec{P}|\rightarrow \infty \), the canonical reference point coincides with the center of inertia and is half a Compton wavelength away from the CM. Between these two limiting frames, all three points differ.
Within the LF formalism, one usually works in the symmetric frame \(\varvec{P}_\perp =\varvec{0}_\perp \). Using again the results of [85], we find that the canonical reference point will always coincide with the center of LF momentum in the transverse plane
in agreement with the discussion in [47] based on the Galilean subgroup (176)–(178).
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Funded by SCOAP3
About this article
Cite this article
Lorcé, C. The relativistic center of mass in field theory with spin. Eur. Phys. J. C 78, 785 (2018). https://doi.org/10.1140/epjc/s10052-018-6249-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1140/epjc/s10052-018-6249-3