Abstract
We show that the big bang is a coordinate singularity for a large class of \(k = -1\) inflationary FLRW spacetimes which we have dubbed ‘Milne-like.’ By introducing a new set of coordinates, the big bang appears as a past boundary of the universe where the metric is no longer degenerate—a result which has already been investigated in the context of vacuum decay (Coleman and De Luccia in Phys Rev D 21:3305–3315, 1980). We generalize their results and approach the problem from a more mathematical perspective. Similar to how investigating the geometrical properties of the \(r = 2m\) event horizon in Schwarzschild led to a better understanding of black holes, we believe that investigating the geometrical properties of the big bang coordinate singularity for Milne-like spacetimes could lead to a better understanding of cosmology. We show how the mathematics of these spacetimes may help illuminate certain issues associated with dark energy, dark matter, and the universe’s missing antimatter.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
In this paper we show that the big bang is a coordinate singularity for a large class of \(k = -1\) inflationary FLRW spacetimes which we have dubbed ‘Milne-like.’ This may seem surprising and one may justifiably ask: don’t the singularity theorems imply that this cannot happen? In Appendix 1 we show why the singularity theorems don’t apply to inflationary spacetimes. Indeed Milne-like spacetimes can almost always be used as counterexamples [12].
The coordinate singularity appearing in Milne-like spacetimes may offer a new geometrical perspective of our universe. We believe that understanding the geometry of the coordinate singularity for Milne-like spacetimes could lead to a better understanding of cosmology. One of the goals of this paper is to demonstrate how the mathematics of these spacetimes may help illuminate certain issues associated with dark energy, dark matter, and the universe’s missing antimatter. Our solution to the antimatter problem is similar to the solution presented in [6].
We remark a previous time when understanding the geometry of a coordinate singularity led to advances in theoretical physics. The \(r = 2m\) event horizon in Schwarzschild is a coordinate singularity. Understanding the geometry of event horizons led to Hawking’s area theorem for black holes [11, 19, 40] which played a pivotal role in the development of black hole thermodynamics [40].
Throughout this paper we set constants \(G = c = \hbar = 1\). Our signature convention is \((-,+,+,+)\).
1.1 Summary of Results
In this section we give a brief summary of our results. We highlight four main points.
- (1)
The big bang is a coordinate singularity for Milne-like spacetimes.
- (2)
The geometric solution to the horizon problem.
- (3)
The cosmological constant appears as an initial condition.
- (4)
Lorentz invariance and its implications for dark matter and antimatter.
(1) The big bang is a coordinate singularity for Milne-like spacetimes
The Milne universe is the \(k = -1\) FLRW spacetime with manifold \(M = (0,\infty ) \times \mathbb {R}^3\) and metric
and scale factor \(a(\tau ) = \tau\). Here we are using hyperbolic coordinates \((\tau , R, \theta , \phi )\). By introducing coordinates \((t,r,\theta ,\phi )\) via \(t = \tau \cosh (R)\) and \(r = \tau \sinh (R)\), the metric is
which is just the Minkowski metric. Notice that \(\tau = 0\) corresponds to the lightcone \(t = r\) at the origin \(\mathscr {O}\). Hence the big bang is a coordinate singularity for the Milne universe.
A Milne-like spacetime is a \(k = -1\) FLRW spacetime with scale factor assumed to satisfy \(a(\tau ) = \tau + o(\tau ^{1 + \varepsilon })\) for small \(\tau\). Letting \(\varepsilon = 1/2\) shows that this condition is satisfied for any Taylor expansion \(a(\tau ) = \tau + c_2\tau ^2 + c_3\tau ^3 + \cdots\). Since this is a limiting condition, Milne-like spacetimes can include an inflationary era, a radiation-dominated era, a matter-dominated era, and a dark energy-dominated era. Therefore they can model the dynamics of our universe. We introduce coordinates \((t,r,\theta , \phi )\) via \(t = b(\tau ) \cosh (R)\) and \(r = b(\tau )\sinh (R)\) where b satisfies \(b' = b/a\). Putting \(\Omega = 1/b' = a/b\), the metric is
The condition \(a(\tau ) = \tau + o(\tau ^{1 + \varepsilon })\) implies \(0< \Omega (0) < +\infty\) where \(\Omega (0) :=\lim _{\tau \rightarrow 0} \Omega (\tau )\). Therefore there is no degeneracy at \(\tau = 0\) in these coordinates. Thus the big bang is a coordinate singularity for Milne-like spacetimes. See Sect. 3.3 for the full proof (Fig. 1).
Figure 2 shows how the big bang is a coordinate singularity for the Milne universe and Milne-like spacetimes. In both cases the lightcone at the origin \(\mathscr {O}\) acts as a past boundary of the universe. It separates our universe from the extension.
The discussion so far has been informal. In Sect. 2 we give precise mathematical definitions to what we mean by coordinate and curvature singularities. We then apply these definitions in Sect. 3 to show the big bang is a coordinate singularity for Milne-like spacetimes. In Sect. 3.4 we show Milne-like spacetimes admit no curvature singularities provided the second derivative of the scale factor satisfies an asymptotic condition.
(2) The geometric solution to the horizon problem
We briefly recall the horizon problem in cosmology. It is the main motivating reason for inflationary theory [42, p. 208]. The problem comes from the uniform temperature of the CMB radiation. From any direction in the sky, we observe the CMB temperature as 2.7 K. The uniformity of this temperature is puzzling: if we assume the universe exists in a radiation-dominated era all the way down to the big bang (i.e. no inflation), then the points p and q on the surface of last scattering don’t have intersecting past lightcones. So how can the CMB temperature be so uniform if p and q were never in causal contact in the past? See Fig. 3.
An inflationary era before the radiation-dominated era would allow for causal contact in the past. This is depicted in Fig. 4. For a Milne-like spacetime the solution becomes apparent in the conformal Minkowski coordinates. Since the spacetime is conformal to Minkowski space, the lightcones are given by 45° angles. Therefore any two points p and q in a Milne-like spacetime will have past lightcones which intersect at some point above the origin \(\mathscr {O}\).
(3) The cosmological constant appears as an initial condition
Here we show how the cosmological constant \(\Lambda\) appears as an initial condition for Milne-like spacetimes. This may help explain the origin of \(\Lambda\). If dark energy is really modeled by a cosmological constant and not by some other model (e.g. quintessence), then \(\Lambda\) would have been fixed at the big bang.
The Einstein equations with a cosmological constant are
Let \(u = \partial / \partial \tau\) denote the four-velocity of the comoving observers and let e be any unit spacelike orthogonal vector (its choice does not matter by isotropy). We define the energy density \(\rho\) and pressure function p in terms of the Einstein tensor: \(\rho = \frac{1}{8\pi }G_{\mu \nu }u^\mu u^\nu\) and \(p = \frac{1}{8\pi }G_{\mu \nu }e^\mu e^\nu\) where \(G_{\mu \nu } = R_{\mu \nu } - \frac{1}{2}R g_{\mu \nu }\). We define the normal energy density \(\rho _{\mathrm{normal}}(\tau )\) and normal pressure function \(p_{\mathrm{normal}}(\tau )\) in terms of the energy-momentum tensor \(\rho _{\mathrm{normal}} = T_{\mu \nu }u^\mu u^\nu\) and \(p_{\mathrm{normal}} = T_{\mu \nu }e^\mu e^\nu\). Therefore \(\rho = \rho _{\mathrm{normal}} + \Lambda / 8\pi\) and \(p = p_{\mathrm{normal}} - \Lambda / 8\pi\).
If \(\rho _{\mathrm{normal}} = p_{\mathrm{normal}} = 0\) (e.g. de Sitter), then the equation of state for the cosmological constant is fixed for all \(\tau\).
In Sect. 4.2 we show that this equation of state appears as an initial condition for Milne-like spacetimes. Specifically, we show if the scale factor satisfies \(a''(\tau ) = \alpha \tau + o(\tau )\), then
Consequently, if \(\rho _{\mathrm{normal}}(\tau )\) and \(p_{\mathrm{normal}}(\tau ) \rightarrow 0\) as \(\tau \rightarrow 0\) (i.e. if the cosmological constant was the dominant energy source during the Planck era), then we have \(\Lambda = 3\alpha\). Note that \(\alpha = a'''(0)\). Therefore we have
This provides a connection between the cosmological constant \(\Lambda\) and the initial condition of the scale factor.
(4) Lorentz invariance and its implications for dark matter and antimatter
In Sect. 4.3 we show the Lorentz group \(\text {L} = \text {O}(1,3)\) acts by isometries on Milne-like spacetimes at the origin \(\mathscr {O}\). Therefore these spacetimes have a notion of Lorentz invariance. This follows because the Minkowski metric is Lorentz invariant and the conformal factor \(\Omega\) appearing in Eq. (1.3) is Lorentz invariant. Since Lorentz invariance plays a pivotal role in QFT (e.g. the field operators are constructed out of finite dimensional irreducible represenations of the Lorentz group [38, 41]), Milne-like spacetimes are a good background model if one wants to develop a quantum theory of cosmology with Lorentz invariance.
A Possible Dark Matter Particle?
There are two different kinds of symmetries in quantum theory. The local (gauge) symmetry group \(\text {SU}(3) \times \text{ SU }(2) \times \text {U}(1)\) of the standard model and the global spacetime symmetry group.
The irreducible unitary representations of \(\text {SL}(2,\mathbb {C})\) (which is the simply connected double cover of the connected component of \(\text {O}(1,3)\) containing the identity) are characterized by two different types of particles. See Theorem 10.9 in [38]. The first class of particles is the principal series. These particles are characterized by the parameter \(\nu = -iw\) where w is real and spin \(j = 0, 1/2, 1, \dotsc\) The second class of particles is the complementary series. These particles are characterized by the parameter \(-1 \le \nu \le 1\) and spin \(j = 0\). The terminology ‘principal’ and ‘complementary’ comes from the classification of irreducible unitary representations of semi-simple Lie groups [37].
Since the comoving observers emanate from the origin \(\mathscr {O}\) in a Milne-like spacetime (see Fig. 5), a possible physical interpretation of this classification would be that these are the particles created at the big bang. If this interpretation is correct, then the principal series would correspond to the particles of the standard model (in analogy to Wigner’s classification of the Poincaré group [43]). But this leaves the complementary series up to interpretation. Perhaps
But is there any evidence that dark matter is comprised of spin 0 particles? Yes. Scalar field dark matter (SFDM) [17, 18, 26, 36] also known as Bose–Einstein condensate (BEC) dark matter [22, 27, 32,33,34, 39] also known as wave dark matter (WDM) [7, 8, 15, 29] also known as fuzzy dark matter (FDM) [20, 21] all use the Klein–Gordon wave equation (i.e. the wave equation for spin 0 particles) to model dark matter. The difference in name comes from a difference in motivation. One reason for introducing models of dark matter based on the Klein–Gordon equation is to alleviate the cusp problem associated with the weakly interacting massive particle (WIMP) models of dark matter [23]. Furthermore, the models based on the Klein–Gordon equation reproduce the observed spiral pattern density in disk galaxies (see Figs. 1–4 in [8]) which makes these models promising.
If the identification “principal series = normal matter” and “complementary series = dark matter” is true, then the distinguishing feature could be related to the parameter \(\nu\) which is determined by Casimir operators built out of rotations and Lorentz boosts (see Eq. 10.3-1 in [38]). Perhaps this could offer an explanation for dark matter’s lack of interaction with electromagnetism.
What Lies Beyond\(\tau = 0\)?
Recall the Lorentz group \(\text {L} = \text {O}(1,3)\) has four connected components \(\text {L}^\uparrow _{+}\), \(\text {L}^\uparrow _-\), \(\text {L}^\downarrow _{+}\), \(\text {L}^\downarrow _{-}\). The ± corresponds to \(\det \Lambda = \pm 1\), the \(\uparrow\) corresponds to \(\Lambda ^0_{\,\,\,0} \ge 1\), and the \(\downarrow\) corresponds to \(\Lambda ^0_{\,\,\,0} \le -1\). Since Milne-like spacetimes are defined for \(t > 0\), only the subgroup \(\text {L}^\uparrow = \text {L}^\uparrow _+ \cup \text {L}^\uparrow _-\) acts by isometries on Milne-like spacetimes.
What about \(\text {L}^\downarrow = \text {L}^\downarrow _+ \cup \text {L}^\downarrow _-\)? If \(\text {L}^\downarrow\) also acts by isometries, then there would be a universe isometric to ours with the isometry given by the \(\text {PT}\) transformation \((t,x,y,z) \mapsto (-t,-x,-y,-z)\). Given the CPT theorem [35], perhaps the universe’s missing antimatter is contained in the PT symmetric universe. In Sect. 4.5 we show how one can interpret the PT symmetric universe as an antimatter universe via a Lorentz invariant Dirac equation.
We remark this idea is closely related to the same idea in [6]. There the authors consider a \(k = 0\) FLRW spacetime with metric \(g = -d\tau ^2 + a^2(\tau )\big [dx^2 + dy^2 + dz^2\big ]\) in a radiation dominated era \(a(\tau ) \propto \sqrt{\tau }\). By moving to conformal time \(\tilde{\tau }\) given by \(d\tilde{\tau } = d\tau /a(\tau )\), one arrives at the metric \(g = a^2(\tilde{\tau })\big [-d\tilde{\tau }^2 + dx^2 + dy^2 + dz^2 \big ]\) where \(a(\tilde{\tau }) \propto \tilde{\tau }\). They then analytically extend the function \(a(\tilde{\tau })\) from \((0, + \infty )\) to \(\mathbb {R}\) and call the \((-\infty , 0)\) part the ‘CPT-symmetric’ universe. However, at \(\tilde{\tau } = 0\), the metric is \(g = 0\). Hence it’s degenerate. Therefore this is not a spacetime extension (Fig. 6).
1.2 Open Problems
- (1)
Is \(\tau = 0\) a coordinate singularity for \(k =1\) and \(k= 0\) FLRW spacetimes? From [13] it is known that no extension can exist with spherical symmetry.
Open problems (2)–(4) refer to the past boundary \(\partial ^-M\) of a spacetime (Definition 2.4).
- (2)
Is \(\tau = 0\) a coordinate singularity for Milne-like spacetimes with compact \(\tau\)-slices? The null expansion \(\theta\) of the future lightcone in Minkowski space diverges as one approaches the origin \(\mathscr {O}\) along the cone. This suggests that, in the compact case, the past boundary \(\partial ^-M\) cannot be compact (if nonempty).
- (3)
To understand what can lie beyond \(\tau = 0\), it is desired to understand the maximal analytic extension whenever \(\Omega\) is analytic on \(M \cup \partial ^-M\). Minkowski space is the maximal analytic extension of the Milne universe. De Sitter space is the maximal analytic extension of the Milne-like spacetime with scale factor \(a(\tau ) = \sinh (\tau )\). Therefore we suggest
Conjecture: Let (M, g) be a Milne-like spacetime with an analytic \(\Omega\) on \(M \cup \partial ^-M\). If (M, g) is asymptotically flat (i.e. admits a null scri structure), then the maximal analytic extension contains a noncompact Cauchy surface. If (M, g) is asymptotically de Sitter (i.e. admits a spacelike scri structure), then the maximal analytic extension contains a compact Cauchy surface. A discussion of analytic spacetime extensions and when they are unique can be found in Sect. 4.6 of [10].
- (4)
Eventually one would want to study models that are not perfectly homogeneous and isotropic. This poses a natural question: what is the correct initial value problem for the lightcone \(\partial ^-M\) which adequately describes our universe? For example, the null geodesics on \(\partial ^-M\) emanating from \(\mathscr {O}\) never focus and so there are no null conjugate points on \(\partial ^-M\). How does this constrain the initial data? Some other ideas/questions for initial conditions on \(\partial ^-M\) are listed below.
Perhaps a constant scalar curvature on \(\partial ^-M\) should be assumed. This could offer a geometric origin for how the cosmological constant \(\Lambda\) appears as an initial condition.
Perhaps one should assume the Weyl curvature tensor vanishes on \(\partial ^-M\) which would be in accordance with Penrose’s Weyl curvature hypothesis.
What initial conditions would adequately describe Lorentz invariance?
What initial conditions force a rigidity result like the one in Sect. 3.5?
2 Definition of Coordinate and Curvature Singularities
2.1 Spacetimes
Let \(k \ge 0\) be an integer. A \(C^k\)manifold of dimension \(n+1\) is a topological space M endowed with a maximal \(C^k\)-atlas \(\mathcal {A}\) of dimension \(n+1\). A coordinate system is an element \(\phi \in \mathcal {A}\). Specifically, a coordinate system is a \(C^{k}\)-diffeomorphism \(\phi :U \rightarrow \phi (U) \subset \mathbb {R}^{n+1}\) where U is an open subset of M. The coordinate system \(\phi\) introduces coordinates which are \(C^{k}\) maps \(x^\mu :U \rightarrow \mathbb {R}\) via \(x^\mu = \pi ^\mu \circ \phi\) where \(\pi ^\mu :\mathbb {R}^{n+1} \rightarrow \mathbb {R}\) is the canonical projection and \(\mu\) runs over the indices \(0, 1, \dotsc , n\). The coordinate systems allow us to define \(C^{k}\) curves over M. For \(k \ge 1\) we use \(C^1\) curves to generate tangent vectors at a point \(p \in M\). This construction yields the tangent space \(T_pM\) and the corresponding \(C^{k-1}\) tangent bundle TM.
Let \(k \ge 0\). A \(C^{k}\)metric on a \(C^{k+1}\) manifold M is a nondegenerate symmetric tensor \(g:TM \times TM \rightarrow \mathbb {R}\) with constant signature whose components \(g_{\mu \nu }\) in any coordinate system \(\phi \in \mathcal {A}\) are \(C^{k}\) functions. Symmetric means \(g(X,Y) = g(Y,X)\) for all \(X,Y \in TM\). Nondegenerate means \(g(X,Y) = 0\) for all \(Y \in TM\) implies \(X = 0\). With constant signature means there is an integer r such that at each point \(p \in M\), there is a basis \(e_0, \dotsc ,e_r,\dotsc , e_n \in T_pM\) such that \(g(e_\mu , e_\mu ) = -1\) for \(0 \le \mu \le r\) and \(g(e_\mu , e_\mu ) = 1\) for \(r + 1 \le \mu \le n\). If \(g(e_0, e_0) = -1\) and \(g(e_i, e_i) = 1\) for all \(i = 1, \dotsc , n\), then g is called a Lorentzian metric and (M, g) is called a \(C^k\) Lorentzian manifold. If \(g(e_\mu , e_\mu ) = 1\) for all \(\mu = 0, 1, \dotsc , n\), then g is called a Riemannian metric and (M, g) is called a \(C^k\) Riemannian manifold. We will only work with Lorentzian manifolds in this paper. Our convention will be that greek indices \(\mu\) and \(\nu\) will run through indices \(0, 1, \dotsc , n\) and latin indices i and j will run through \(1, \dotsc , n\).
Fix \(k \ge 0\). Let (M, g) be a \(C^k\) Lorentzian manifold. A nonzero vector \(X \in T_pM\) is timelike, null, or spacelike if \(g(X,X) < 0, \, =0, \,\, > 0\), respectively. A nonzero vector is causal if it is either timelike or null. A Lorentzian manifold (M, g) is time-oriented provided there is a \(C^0\) timelike vector field \(X \in TM\). A causal vector \(Y \in T_pM\) is future-directed if \(g(X,Y) < 0\) and past-directed if \(g(X,Y) > 0\). Note that \(-X\) defines an opposite time-orientation, and so a future-directed causal vector Y with respect to X is a past-directed causal vector with respect to \(-X\).
Definition 2.1
(Spacetime). Let \(k \ge 0\). A \(C^k\)spacetime is a pair (M, g) where M is a connected, Hausdorff, and second-countable \(C^{k+1}\) manifold and g is a \(C^k\) Lorentzian metric such that (M, g) is time-oriented.
Remarks
-
The Einstein tensor \(R_{\mu \nu } - \frac{1}{2}R g_{\mu \nu }\) requires two derivatives of the metric. Therefore the relevant differentiability class for general relativity is \(C^2\).
-
We assume M is connected because we can not make any observations of any other connected components.
-
The Hausdorff condition guarantees uniqueness of limits.
-
The second-countable property allows us to construct partitions of unity whenever needed (e.g. to construct a complete Riemannian metric on M).
Fix \(k \ge 0\). Let (M, g) be a \(C^k\) spacetime. A timelike curve\(\gamma\) is a piecewise \(C^1\) map \(\gamma :[a,b] \rightarrow M\) such that \(\gamma '(t)\) is future-directed timelike at all its differentiable points, and in the case \(t_0 \in [a,b]\) is a break point, we have \(\lim _{t \nearrow t_0} \gamma '(t)\) and \(\lim _{t \searrow t_0} \gamma '(t)\) are both future-directed timelike. If \(t_0 = a\) or \(t_0 = b\), then we only require this for the one-sided limit. Note this means that \(\gamma |_{[b-\varepsilon ,\, b)}\) can be extended to a \(C^1\) timelike curve for \(\varepsilon > 0\) small enough. Letting timelike curves be piecewise \(C^1\) allows us to concatenate two timelike curves to form another timelike curve. A unit timelike curve is a timelike curve \(\gamma\) such that \(g(\gamma ', \gamma ') = -1\) at all its differentiable points. Note that ‘future-directed’ is implicit in our definition of a timelike curve. Given a timelike curve \(\gamma :[a,b] \rightarrow M\), we will often write \(\gamma \subset U\) instead of \(\gamma \big ([a,b]\big ) \subset U\). Likewise with \(\gamma \cap U\) (Fig. 7).
Given a point \(p \in M\) and an open set \(U \subset M\), the timelike future of p within U, denoted by \(I^+(p, U)\), is the set of all points \(q \in M\) such that there is a timelike curve \(\gamma :[a,b] \rightarrow U\) from p to q. The timelike past of p within U, denoted by \(I^-(p, U)\), is defined with the opposite time-orientation. If we wish to emphasize the Lorentzian metric g being used, then we will write \(I^+_g(p,U)\). From Proposition 2.6 in [30], we know that these are open sets.
Proposition 2.2
\(I^+(p, U)\)and\(I^-(p,U)\)are open sets.
We reproduce the proof of this proposition in Appendix 2.
2.2 Spacetime Extensions
Coordinate singularities coincide with spacetime extensions. For example, the \(r = 2m\) coordinate singularity in Schwarzschild extends the \(r > 2m\) region to the maximal analytic extension which contains the \(r < 2m\) region. Therefore an understanding of spacetime extensions is needed to understand coordinate singularities.
Fix \(k \ge 0\). Let (M, g) be a \(C^{k}\) spacetime. Let \(0 \le l \le k\). A \(C^{l}\) spacetime \((M_{\mathrm{ext}}, \,g_{\mathrm{ext}})\) with the same dimension as (M, g) is a \(C^{l}\)-extension of (M, g) if there is a proper isometric \(C^{l+1}\)-embedding
We identify M with its image under the embedding. The topological boundary of M within \(M_{\mathrm{ext}}\) is denoted by \(\partial (M, M_{\mathrm{ext}}) = \overline{M} \setminus M\). If \((M_{\mathrm{ext}}, \, g_{\mathrm{ext}})\) is a \(C^l\)-extension for all \(l \ge 0\), then we say \((M_{\mathrm{ext}},\, g_{\mathrm{ext}})\) is a smooth or \(C^\infty\)-extension.
For the rest of this section, we will fix a \(C^l\)-extension \((M_{\mathrm{ext}}, \,g_{\mathrm{ext}})\) of a \(C^k\) spacetime (M, g).
Proposition 2.3
\(\partial (M, M_{\mathrm{ext}}) \,\ne \, \emptyset\).
Proof
If this were not true, then \(\overline{M} = M\), and so \(M_{\mathrm{ext}}\) would be the disjoint union of the nonempty open sets M and \(M_{\mathrm{ext}}\setminus M\). However, this implies \(M_{\mathrm{ext}}\) is not connected which contradicts the definition of a spacetime. \(\square\)
Definition 2.4
Let (M, g) be a \(C^k\) spacetime and \((M_{\mathrm{ext}}, \, g_{\mathrm{ext}})\) a \(C^l\)-extension. A timelike curve \(\gamma :[a,b] \rightarrow M_{\mathrm{ext}}\) is called future terminating for a point \(p \in \partial (M, M_{\mathrm{ext}})\) provided \(\gamma (b) = p\) and \(\gamma \big ([a,b)\big ) \subset M\). It is called past terminating if \(\gamma (a) = p\) and \(\gamma \big ((a,b]\big ) \subset M\). The future and past boundaries of M with respect to \(M_{\mathrm{ext}}\) are
Remark
If \((M_{\mathrm{ext}},\, g_{\mathrm{ext}})\) is clear from context, then we will simply write \(\partial ^+M\) for \(\partial ^+(M,M_{\mathrm{ext}})\). Likewise for \(\partial ^-M\) and \(\partial M\) (Fig. 8).
Lemma 2.5
Let\(\gamma :[a,b] \rightarrow M_{\mathrm{ext}}\)be a timelike curve fromptoq.
- (1)
If\(p \in M\)and\(q \notin M\), then\(\gamma\)intersects\(\partial ^+M\).
- (2)
If\(p \notin M\)and\(q \in M\), then\(\gamma\)intersects\(\partial ^-M\).
Proof
Consider case (1). Define \(t_* = \sup \{ t \in [a,b] \mid \gamma \big ([a,t)\big ) \subset M\}\). Since M is open we have \(t_* > a\). Since \(q \notin M\), we have \(\gamma (t_*) \notin M\). On the other hand \(\gamma (t_*)\) is an accumulation point of M. Hence \(\gamma (t_*) \in \partial M\). The restriction \(\gamma |_{[a, \, t_*]}\) is a future terminating timelike curve for \(\gamma (t_*)\). Hence \(\gamma (t_*) \in \partial ^+M\). Case (2) follows by reversing the time orientation. \(\square\)
Proposition 2.6
\(\partial ^+ M \cup \partial ^-M \, \ne \, \emptyset\).
Proof
Fix \(p \in \partial M\). Let \(U \subset M_{\mathrm{ext}}\) be an open set around p. Fix \(q \in I^-(p, U)\). Let \(\gamma \subset U\) be a timelike curve from q to p. We either have \(q \in M\) or \(q \notin M\). First assume \(q \in M\). Then Lemma 2.5 implies \(\gamma \cap \partial ^+M \ne \emptyset\). Now assume \(q \notin M\). Since \(p \in \partial M\), the open set \(I^+(q, U)\) of p contains a point \(r \in M\). Hence there is a timelike curve \(\lambda \subset U\) from q to r. Then Lemma 2.5 shows \(\lambda \cap \partial ^-M \ne \emptyset\). \(\square\)
Proposition 2.6 can be used to show a spacetime (M, g) is \(C^l\)-inextendible (i.e. it admits no \(C^l\)-extension). Suppose one assumes (M, g) has a \(C^l\)-extension and then proves \(\partial ^+M = \emptyset\) and \(\partial ^-M = \emptyset\), then Proposition 2.6 yields a contradiction. This is how Sbierski proves the \(C^0\)-inextendibility of Minkowski space and the Schwarzschild spacetime [30, 31].
2.3 Definition of Coordinate Singularities
In this section we give a precise definition of what we mean by a ‘coordinate singularity.’ Our goal is to identify when we have made a ‘poor’ choice of coordinates. Before giving the definition, we start with a couple of motivating examples.
Motivating Examples
- (1)
Consider the smooth spacetime (M, g) where \(M = (0,\infty ) \times \mathbb {R}\) with the metric \(g = -\tau ^2 d\tau ^2 + dx^2\). Since the metric becomes degenerate at \(\tau = 0\), we cannot extend (M, g) using the coordinates \((\tau , x)\). However, if we introduce the coordinate \(t = \frac{1}{2}\tau ^2\), then, with respect to these coordinates, the spacetime manifold becomes \((0, \infty ) \times \mathbb {R}\) with metric \(-dt^2 + dx^2\). Since the metric is nondegenerate at \(t = 0\), we have no problem extending the spacetime using the coordinates (t, x). As such, we say \((\tau , x)\) were a ‘poor choice’ of coordinates, and \(\tau = 0\) merely represents a coordinate singularity. This example demonstrates that a coordinate singularity depends on a spacetime being inextendible within one coordinate system while being extendible in another.
- (2)
Consider the smooth spacetime (M, g) where \(M = (0,\infty ) \times \mathbb {R}\) and \(g = -f(t)dt^2 + dx^2\) where \(f :(0,\infty ) \rightarrow \mathbb {R}\) is the smooth function given by \(f(t) = 1 + \sqrt{t}\). In this case (M, g) extends through \(t = 0\) via the spacetime \(M_{\mathrm{ext}}= \mathbb {R}\times \mathbb {R}\) and \(g_{\mathrm{ext}}= -dt^2 + dx^2\) for \(t \le 0\) and \(g_{\mathrm{ext}}= g\) for \(t > 0\). However \((M_{\mathrm{ext}}, g_{\mathrm{ext}})\) is not a smooth extension of (M, g). It is only a \(C^0\)-extension. In this case we would not say that (t, x) are a ‘poor’ choice of coordinates for (M, g), since the coordinates (t, x) can still be used to extend the spacetime just not smoothly.
Fix \(k \ge 0\). Let (M, g) be a \(C^k\) spacetime with dimension \(n+1\). Recall a coordinate system is an element of the maximal \(C^{k+1}\)-atlas for M. Specifically, a coordinate system is a \(C^{k+1}\)-diffeomorphism \(\phi :U \rightarrow \phi (U) \subset \mathbb {R}^{n+1}\) where U is an open subset of M.
Let \(\phi :U \rightarrow \mathbb {R}^{n+1}\) be a coordinate system for a \(C^k\) spacetime (M, g). Let \(\Omega = \phi (U) \subset \mathbb {R}^{n+1}\). Then (U, g) is \(C^{k}\)-isometric to \((\Omega ,\, \phi _* g)\) where \(\phi _*\) is the push forward. Let \(0 \le l \le k\). Suppose there exists an open set \(\Omega ' \subset \mathbb {R}^{n+1}\) which properly contains \(\Omega\) and a \(C^l\)-Lorentzian metric \(g'\) on \(\Omega '\) such that \((\Omega ',\, g')\) is a \(C^l\)-extension of \((\Omega ,\, \phi _*g)\). Then we say \((\Omega ',\, g')\) is a \(C^l\)-coordinate extension of \((\Omega ,\, \phi _* g)\). If such an \((\Omega ',\, g')\) exists, then we say \(\phi\) is not\(C^l\)-maximal. If no such \((\Omega ',\, g')\) exists, then we say \(\phi\) is \(C^l\)-maximal. For example, the coordinates \((\tau , x)\) in the first example above are \(C^0\)-maximal. The coordinates (t, x) in the second example are \(C^1\)-maximal but not \(C^0\)-maximal.
Remark
Intuitively, a coordinate system \(\phi\) is \(C^0\)-maximal if the metric components \(g_{\mu \nu }\) with respect to \(\phi\) are maximally extended. If they were extended any further, then there would be a degeneracy in the metric (e.g. Schwarzschild).
Definition 2.7
(Coordinate singularity). Fix \(k \ge 0\) and \(0 \le l \le k\). Let (M, g) be a \(C^k\) spacetime and let \(\phi :U \rightarrow \mathbb {R}^{n+1}\) be a coordinate system which is \(C^0\)-maximal. We say \(\phi\) admits a \(C^l\)-coordinate singularity for (M, g) if there is a \(C^l\)-extension \((M_{\mathrm{ext}}, \, g_{\mathrm{ext}})\) and a coordinate system \(\psi :V \rightarrow \mathbb {R}^{n+1}\) for \(M_{\mathrm{ext}}\) such that
Remark
In the definition \(\phi\) represents the ‘poor’ choice of coordinates. \(\psi\) represents the ‘better’ choice of coordinates.
For example consider (M, g) from example (1) above. The coordinate system \(\phi = (\tau , x)\) is a \(C^\infty\)-coordinate singularity for (M, g). This follows because
\(\phi\) is \(C^0\)-maximal
\((M_{\mathrm{ext}}, \, g_{\mathrm{ext}})\) is a \(C^\infty\)-extension of (M, g) where \(M_{\mathrm{ext}}= \mathbb {R}^2\) and \(g_{\mathrm{ext}}= -dt^2 + dx^2\).
In this example we simply take \(U = M\) and \(V = M_{\mathrm{ext}}\) and \(\psi = (t,x)\).
2.4 Definition of Curvature Singularities
In this section we give a precise definition of what we mean by a ‘curvature singularity.’ Before doing so, let’s fix notation by recalling the definition of curvature.
Fix \(k \ge 2\). Let (M, g) be a \(C^k\) spacetime and \(\nabla\) its unique compatible affine connection. Then the Riemann curvature tensor is the (3, 1) tensor defined by
where [X, Y] is the Lie derivative of Y with respect to X. Let \(\{\partial _\mu \}\) be a coordinate vector basis with dual one-form basis \(\{dx^\mu \}\). The components of the Riemann curvature tensor with respect to \(\{\partial _\mu \}\) are defined by \(R_{\mu \nu \alpha }^{\,\,\,\,\,\,\,\, \beta } = dx^\beta \big (R(\partial _\mu , \partial _\nu )\partial _\alpha \big )\). Using index notation \(\nabla _X Y = (X^\mu \nabla _\mu Y^\nu ) \partial _\nu\) and the linear and Leibniz properties of the affine connection, we have
Here we see the non-commutativity of the second covariant derivatives of Z expressed in terms of the components of the curvature tensor.
Since gravitation is mathematically described by curvature (e.g. the Einstein equations and tidal acceleration), we can use divergences in the curvature tensor as a way to identify the breakdown of general relativity. Since we want to measure divergences in a coordinate-independent way, we construct curvature invariants out of scalar quantities.
Definition 2.8
Fix \(k \ge 2\). Let (M, g) be a \(C^k\) spacetime. A curvature invariant on (M, g) is a scalar function which is a polynomial in the components of the metric \(g_{\mu \nu }\), its inverse \(g^{\mu \nu }\), and the curvature tensor \(R_{\mu \nu \alpha }^{\,\,\,\,\,\,\,\, \beta }\).
Examples of Curvature Invariants
- (1)
The scalar curvature \(R = g^{\mu \nu }R_{\mu \nu } = g^{\mu \nu }R_{\mu \alpha \nu }^{\,\,\,\,\,\,\,\, \alpha }\).
- (2)
The Kretschmann scalar \(R_{\mu \nu \alpha \beta }R^{\mu \nu \alpha \beta }\).
- (3)
\(R_{\mu \nu }R^{\mu \nu }\).
Fix \(k \ge 0\). Let (M, g) be a \(C^k\) spacetime. A future inextendible timelike curve is a curve \(\gamma :[a, b) \rightarrow M\) such that for any \(a< c < b\), the restriction \(\gamma |_{[a,c]}\) is a timelike curve, and for all \(p \in M\) the extended function \(\gamma _p :[a,b] \rightarrow M\) is not continuous where \(\gamma _p\) is given by \(\gamma _p(t) = \gamma (t)\) for all \(a \le t < b\) and \(\gamma _p(b) = p\). Past inextendible timelike curves are defined time-dually.
Definition 2.9
(Curvature singularity). Fix \(k \ge 2\). Let (M, g) be a \(C^k\) spacetime. We say (M, g) admits a future curvature singularity if there is a future inextendible timelike curve \(\gamma :[a,b) \rightarrow M\) and a curvature invariant C such that \(C \circ \gamma (t)\) diverges as \(t \rightarrow b\). Time-dualizing the definition gives past curvature singularities.
2.5 A Classical Example: The Schwarzschild Spacetime
In this section we apply our definitions of coordinate and curvature singularities to the Schwarzschild spacetime. We will show how the \(r = 2m\) event horizon in Schwarzschild is just a coordinate singularity and how \(r = 0\) is a curvature singularity.
Definition 2.10
Let \(m > 0\). Define two manifolds
and the metric
where \((S^2, d\Omega ^2)\) is the usual round two-sphere. Then \((M_{\mathrm{safe}}, \, g)\) and \((M_{\mathrm{unsafe}}, \, g)\) are the safe and unsafe Schwarzschild spacetimes.
Let \(0< \theta < \pi\) and \(0< \phi < 2\pi\) be the standard coordinates on \(S^2\). Let \(U\subset M_{\mathrm{safe}}\) be the open set \(U = \mathbb {R}\times (2m, \infty ) \times (0,\pi ) \times (0,2\pi )\). Then the metric in the coordinate system \(\xi = (t,r,\theta ,\phi ):U \rightarrow \mathbb {R}^4\) is
Note that the coordinate system \(\xi :U \rightarrow \mathbb {R}^4\) is \(C^0\)-maximal.
Now we introduce other coordinates \((v,r,\theta , \phi )\) given by \(v = t +r^*(r)\) where \(r^*(r) = r + 2m\log (r/2m - 1)\). The metric in these coordinates is
There is no degeneracy at \(r = 2m\) with respect to these coordinates. Therefore we can define a \(C^\infty\)-extension \((M_{\mathrm{ext}},\, g_{\mathrm{ext}})\) where \(M_{\mathrm{ext}}= \mathbb {R}^2 \times S^2\) and
Let \(V = \mathbb {R}\times (0,\infty ) \times (0,\pi ) \times (0,2\pi )\). Then \(\zeta = (v,r,\theta ,\phi ) :V \rightarrow \mathbb {R}^4\) is a coordinate system for \((M_{\mathrm{ext}},\, g_{\mathrm{ext}})\) such that \(V \cap M_{\mathrm{safe}}= U\) and \(V \cap (M_{\mathrm{ext}}\setminus M_{\mathrm{safe}}) \ne \emptyset\). Thus
Proposition 2.11
\(\xi = (t, r, \theta , \phi )\)admits a\(C^\infty\)-coordinate singularity for\((M_{\mathrm{safe}},\,g)\).
It’s not hard to see that \((M_{\mathrm{unsafe}}, \,g)\) is \(C^\infty\)-isometric to the region \(r < 2m\) of \((M_{\mathrm{ext}},\, g_{\mathrm{ext}})\). We end this section by demonstrating that \((M_{\mathrm{unsafe}}, \, g)\) admits a future curvature singularity.
Proposition 2.12
\((M_{\mathrm{unsafe}},\, g)\)admits a future curvature singularity.
Proof
Consider the future inextendible timelike curve \(\gamma :[m, 0) \rightarrow M_{\mathrm{unsafe}}\) given by \(\gamma (r) = (t_0, r, \theta _0, \phi _0)\). Recall that \(\partial /\partial r\) is timelike on \(M_{\mathrm{unsafe}}\). Let \(C = R_{\mu \nu \alpha \beta }R^{\mu \nu \alpha \beta }\) denote the Kretschmann scalar. Then a calculation shows \(C = 48 m^2/r^6\). Therefore \(C \circ \gamma (r) \rightarrow \infty\) as \(r \rightarrow 0\). \(\square\)
3 The Coordinate Singularity for Milne-Like Spacetimes
Let \(I \subset \mathbb {R}\) be an open interval. Let \((\Sigma ,h)\) be a three-dimensional complete Riemannian manifold with constant sectional curvature. We say (M, g) is an FLRW spacetime if \(M = I \times \Sigma\) and \(g = -d\tau ^2 + a^2(\tau )h\) where \(a :I \rightarrow (0,\infty )\) is a continuous function called the scale factor. The integral curves of \(\partial / \partial \tau\) are called the comoving observers. Physically, they model the trajectories of galaxies.
Remark
We don’t assume any differentiability assumption on the scale factor. Therefore the lowest regularity class for FLRW spacetimes is \(C^0\).
Let \((\mathbb {R}^3, h)\) be hyperbolic space with sectional curvature \(k = -1\). Let (M, g) be the corresponding FLRW spacetime. We use the standard coordinates \(\xi = (\tau , R, \theta , \phi )\) for M where \(\xi :U \rightarrow \mathbb {R}^4\) and \(U = I \times (0,\infty ) \times (0,\pi ) \times (0, 2\pi )\). With respect to the coordinate system \(\xi = (\tau , R, \theta , \phi )\), the metric is
We will first demonstrate how \(\xi = (\tau , R, \theta , \phi )\) admits a \(C^\infty\)-coordinate singularity for (M, g) in two familiar cases: (1) when (M, g) is the Milne universe and (2) when (M, g) is the open-slicing coordinate system for de Sitter space. Then we will show how \(\xi = (\tau , R, \theta , \phi )\) admits a \(C^0\)-coordinate singularity for a class of inflationary spacetimes which we have dubbed ‘Milne-like.’ The justification for calling these ‘inflationary’ comes in Sect. 4.1 where we show that these spacetimes solve the horizon problem in cosmology.
In Sect. 3.4 we show that Milne-like spacetimes do not admit curvature singularities provided the scale factor satisfies \(a''(\tau ) = \alpha \tau + C \tau ^3 + o(\tau ^3)\) with \(\alpha , C \in \mathbb {R}\). In Sect. 3.5 we show a rigidity result: if a Milne-like spacetime satisfies both the weak and strong energy conditions, then it must be the Milne universe.
3.1 The Milne Universe
Let \((\mathbb {R}^3, h)\) be hyperbolic space with sectional curvature \(k = -1\). The Milne universe is the corresponding FLRW spacetime (M, g) given by \(M = (0, \infty ) \times \mathbb {R}^3\) and with scale factor \(a(\tau ) = \tau\). With respect to the coordinate system \(\xi = (\tau , R, \theta , \phi )\), the metric is
We introduce a new coordinate system \(\zeta = (t,r,\theta , \phi )\) where \(\theta\) and \(\phi\) are unchanged, but t and r are given by
Then we have \(-dt^2 + dr^2 = -d\tau ^2 + \tau ^2 dR^2\), so that the metric in the coordinate system \(\zeta = (t,r, \theta , \phi )\) is
which is just the usual Minkowski metric. The coordinate system \(\xi = (\tau , R, \theta , \phi )\) is \(C^0\)-maximal, but we can find a \(C^\infty\)-extension via \(\zeta = (t,r, \theta , \phi )\). Therefore
Proposition 3.1
\(\xi = (\tau , R, \theta , \phi )\)admits a\(C^\infty\)-coordinate singularity for (M, g).
The constant \(\tau\) slices are hyperboloids sitting inside the future lightcone of the origin. We take the extension to be \((M_{\mathrm{ext}},\, g_{\mathrm{ext}}) =\) Minkowski space. As \(\tau \rightarrow 0\), these slices approach the lightcone at the origin \(\mathscr {O}\) in Minkowski space where the extended metric \(g_{\mathrm{ext}}\) is nondegenerate (Fig. 9).
3.2 De Sitter Space
The open slicing coordinate system for de Sitter space is a \(k = -1\) FLRW spacetime \(M = (0, \infty ) \times \mathbb {R}^3\) with scale factor \(a(\tau ) = \sinh (\tau )\). With respect to the coordinate system \(\xi = (\tau , R, \theta , \phi )\), the metric is
We introduce a new coordinate system \(\zeta = (t,r, \theta , \phi )\) where \(\theta\) and \(\phi\) are unchanged, but t and r are given by
where \(b(\tau ) = \tanh (\tau /2) = \sinh \tau / (1 + \cosh \tau )\). Then \(b'(\tau ) = b(\tau )/a(\tau )\), and so we have the following relationship between (t, r) and \((\tau , R)\).
Therefore the metric is
which is conformal to the Minkowski metric. Using \(b(\tau ) = \tanh (\tau /2)\) and \(b^2(\tau ) = t^2 - r^2\), we have \(\tau = 2 \tanh ^{-1}(\sqrt{t^2 - r^2})\). Therefore \(1/b'(\tau ) = a(\tau )/b(\tau ) = 2/(1 - t^2 + r^2)\). Thus the metric in the coordinate system \(\zeta = (t, r, \theta , \phi )\) is
The coordinate system \(\xi = (\tau , R, \theta , \phi )\) is \(C^0\)-maximal, but we can define a \(C^\infty\)-extension via \(\zeta = (t,r,\theta , \phi )\). Thus
Proposition 3.2
\(\xi = (\tau , R, \theta , \phi )\)admits a\(C^\infty\)-coordinate singularity for (M, g).
The constant \(\tau\) slices are hyperboloids sitting inside the future lightcone at the origin. We take the extension to be \((M_{\mathrm{ext}},\, g_{\mathrm{ext}}) =\) a smooth spacetime conformal to Minkowski space. As \(\tau \rightarrow 0\), these slices approach the lightcone where the extended metric \(g_{\mathrm{ext}}\) is nondegenerate (Figs. 10 and 11).
3.3 Milne-Like Spacetimes
Now we wish to show that \(\xi = (\tau , R, \theta , \phi )\) is a coordinate singularity for scale factors that can model the dynamics of our universe. That is, we wish to show \(\tau = 0\) is a coordinate singularity for suitably chosen scale factors \(a(\tau )\) which
begin inflationary \(a(\tau ) \sim \sinh (\tau )\)
then transitions to a radiation dominated era \(a(\tau ) \sim \sqrt{\tau }\)
then transitions to a matter dominated era \(a(\tau ) \sim \tau ^{2/3}\)
and ends in a dark energy dominated era \(a(\tau ) \sim e^{\sqrt{\frac{\Lambda }{3}} \tau }\)
If we assume for small \(\tau\), the scale factor satisfies \(a(\tau ) \sim \tau\), then, by curve fitting, we can use \(a(\tau )\) to represent each of the above eras, thus modeling the dynamics of our universe. To make this precise, we assume for small \(\tau\), the scale factor satisfies \(a(\tau ) = \tau + o(\tau ^{1+\varepsilon })\) for some \(\varepsilon > 0\) (i.e. \(\big [a(\tau ) -\tau \big ]/\tau ^{1+\varepsilon } \rightarrow 0\) as \(\tau \rightarrow 0\)). In particular any convergent Taylor expansion \(a(\tau ) = \sum _{n = 1}^\infty c_n\tau ^n\) (with \(c_1 = 1\)) will satisfy this condition.
Definition 3.3
-
(1)
Let (M, g) be an FLRW spacetime. We say (M, g) is inflationary if the scale factor for small \(\tau\) satisfies \(a(\tau ) = \tau + o(\tau ^{1+\varepsilon })\) for some \(\varepsilon > 0\).
-
(2)
We say (M, g) is Milne-like if it is an inflationary FLRW spacetime such that \((\Sigma , h) = (\mathbb {R}^3, h)\) where h is the hyperbolic metric with sectional curvature \(k = -1\). We assume the coordinate system \(\xi = (\tau , R, \theta , \phi )\) is \(C^0\)-maximal.
Remarks
-
The motivation for the word ‘inflationary’ comes in Sect. 4.1 where we show that the particle horizon is infinite for scale factors which obey \(a(\tau ) = \tau + o(\tau ^{1 + \varepsilon })\).
-
A \(C^k\) Milne-like spacetime is one such that the spacetime is \(C^k\) (i.e. the scale factor \(a(\tau )\) is a \(C^k\) function).
-
For inflationary spacetimes we have \(a(0) := \lim _{\tau \rightarrow 0}\, a(\tau ) = 0\).
-
The coordinate system \(\xi = (\tau , R, \theta , \phi )\) is defined for all \(\tau \in I = (0, \tau _{\mathrm{max}})\) where \(\tau _{\mathrm{max}} \in (0, +\infty ]\). For our universe, we expect \(\tau _{\mathrm{max}} = +\infty\) due to dark energy.
The next theorem improves and refines Theorem 3.4 in [13].
Theorem 3.4
\(\xi = (\tau , R, \theta , \phi )\)admits a\(C^0\)-coordinate singularity for Milne-like spacetimes.
Proof
Let (M, g) be a Milne-like spacetime. With respect to the coordinate system \(\xi = (\tau , R, \theta , \phi )\), the metric is
Fix any \(\tau _0 \in I\). The specific choice does not matter; any \(\tau _0\) will do. Define a new coordinate system \(\zeta = (t,r,\theta , \phi )\) by
where \(b :I \rightarrow (0,\infty )\) is given by
Note that \(b(\tau )\) is an increasing \(C^1\) function and hence it’s invertible. Therefore \(\tau\) as a function of t and r is
Note that t and r are defined for all points such that \(t^2 - r^2 < b^2(\tau _{\mathrm{max}})\). With respect to the coordinate system \(\zeta = (t, r, \theta , \phi )\), the metric takes the form
where
Now we prove \(\xi = (\tau , R, \theta , \phi )\) admits a \(C^0\)-coordinate singularity for (M, g). For this it suffices to show \(\Omega (0) := \lim _{\tau \rightarrow 0}\Omega (\tau )\) exists and is a finite positive number. Indeed this will imply the Lorentzian metric given by Eq. (3.14) extends continuously through \(\tau = 0\) which corresponds to the lightcone \(t = r\), i.e. this will imply that (M, g) is \(C^0\)-extendible via \(\zeta = (t,r,\theta ,\phi )\).
To show \(0< \Omega (0) < \infty\), put \(b'(0) := \lim _{\tau \rightarrow 0}b'(\tau ) = \lim _{\tau \rightarrow 0} b(\tau )/a(\tau )\). By our definition of an inflationary spacetime, there is an \(\varepsilon _0 > 0\) such that \(a(\tau ) = \tau + o (\tau ^{1 + \varepsilon _0})\). Therefore \(\lim _{\tau \rightarrow 0}f(\tau )/\tau ^{1 + \varepsilon _0} = 0\) where \(f(\tau )\) is given by \(a(\tau ) = \tau + f(\tau )\). Therefore for any \(\varepsilon > 0\), there exists a \(\delta > 0\) such that for all \(0< \tau < \delta\), we have \(|f(\tau )| < \varepsilon \tau ^{1 + \varepsilon _0}\). Choosing \(\varepsilon = 1\), we have \(\tau - \tau ^{1 + \varepsilon _0}< \tau + f(\tau ) < \tau + \tau ^{1 + \varepsilon _0}\). Thus \(b(\tau )/a(\tau )\) is squeezed between
Evaluating the integrals, we find
Since this holds for all \(0< \tau < \delta\), we have \(\Omega (0) = 1/b'(0) = \tau _0\). \(\square\)
Remark
The past boundary \(\partial ^-M\) acts as a Cauchy horizon for the Milne-like spacetime. As such the white region beyond \(\partial ^-M\) cannot be uniquely determined.
3.4 Milne-Like Spacetimes Without Curvature Singularities
The scalar curvature for a \(k = -1\) FLRW spacetime is given by
For a Milne-like spacetime, we have \(a'(0) = 1\). If the second derivative of the scale factor satisfies \(a''(\tau ) = \alpha \tau + o(\tau )\), then we have \(a'(\tau ) = 1 + \frac{1}{2}\alpha \tau ^2 + o(\tau ^2)\). Plugging these into \(R(\tau )\), we find \(\lim _{\tau \rightarrow 0}R(\tau ) = 12\alpha\). Thus the scalar curvature does not diverge at the big bang.
In this section we generalize the above result and show that Milne-like spacetimes do not admit any curvature singularities provided the second derivative of the scale factor satisfies
(i.e. \([a''(\tau ) - \alpha \tau + C\tau ^3]/\tau ^3 \rightarrow 0\) as \(\tau \rightarrow 0\)). Here \(\alpha ,C \in \mathbb {R}\) are constants. The limiting condition implies \(\alpha = a'''(0)\). This limiting condition applies to any convergent Taylor expansion \(a(\tau ) = \sum _{n = 1}^\infty c_n \tau ^n\) with \(c_1 = 1\) and \(c_2 = 0\) and \(c_4 = 0\). Indeed this limiting condition applies to \(a(\tau ) = \sinh (\tau )\) (i.e. the open-slicing of de Sitter space). This agrees with the fact that de Sitter space has no curvature singularities.
An example of a Milne-like spacetime where we do have a curvature singularity is given by the scale factor \(a(\tau ) = \tau + \tau ^2\). The scalar curvature diverges as \(\tau \rightarrow 0\). Indeed in this case we have \(a''(\tau ) = 2 \ne \alpha \tau + C\tau ^3 + o(\tau ^3)\).
Lemma 3.5
Fix\(k \ge 2\). Let (M, g) be a\(C^k\)Milne-like spacetime. Suppose the second derivative of the scale factor satisfies\(a''(\tau ) = \alpha \tau + C\tau ^3 + o(\tau ^3)\)where\(\alpha ,C \in \mathbb {R}\). Then for any\(p \in \partial ^-M\), the limits of
as\((t,r,\theta ,\phi ) \rightarrow p\)all exist and are finite.
The proof of Lemma 3.5 is in Appendix 2.
Theorem 3.6
Fix\(k \ge 2\). Let (M, g) be a\(C^k\)Milne-like spacetime. Suppose the second derivative of the scale factor satisfies\(a''(\tau ) = \alpha \tau + C\tau ^3 + o(\tau ^3)\)where\(\alpha ,C \in \mathbb {R}\). Then (M, g) admits no past curvature singularities.
Proof
Let \(\gamma :(0, b] \rightarrow M\) be any past-inextendible timelike curve parameterized by \(\tau\) (we can parameterize by \(\tau\) since it’s a time function). Since \(\gamma\) is past inextendible and timelike, Fig. 12 shows that there exists a point \(p \in \partial ^-M\) such that \(p = \lim _{\tau \searrow 0} \gamma (\tau )\). More rigorously, the point p can be determined by writing out \(\gamma\) in the \(\zeta = (t, r, \theta , \phi )\) coordinate system.
The existence of these limits follows from \(\gamma\) being past-inextendible and timelike. Since any curvature invariant is constructed out of first and second derivatives of the metric coefficients (i.e. the first and second derivatives of \(\Omega\) in this case), Lemma 3.5 implies any curvature invariant has a finite-value quantity at p. Thus there are no past curvature singularities for (M, g). \(\square\)
3.5 A Rigidity Result
In this section we will establish the following rigidity result.
Theorem 3.7
Fix\(k \ge 2\). Suppose (M, g) is a\(C^k\)Milne-like spacetime which satisfies both the weak and strong energy conditions. Then (M, g) is the Milne universe.
Before proving Theorem 3.7, we recall the definitions of the weak and strong energy conditions.
Definition 3.8
Fix \(k \ge 2\) and let (M, g) be a \(C^k\) spacetime. The Einstein tensor is \(G_{\mu \nu } = R_{\mu \nu } - \frac{1}{2}R g_{\mu \nu }\). We say (M, g) satisfies
the weak energy condition if \(G_{\mu \nu }X^\mu X^\nu \ge 0\) for all timelike X.
the strong energy condition if \(R_{\mu \nu }X^\mu X^\nu \ge 0\) for all timelike X.
Let (M, g) be an FLRW spacetime. Following [28], we define the energy density\(\rho\) and pressure functionp in terms of the Einstein tensor. If \(u = \partial / \partial \tau\) and e is any unit spacelike vector orthogonal to u (its choice does not matter by isotropy), then
We make use of the following proposition. See also Eqs. (9.2.19) and (9.2.20) in [40].
Proposition 3.9
Let (M, g) be an FLRW spacetime.
- (a)
The weak energy condition is equivalent to\(\rho \ge 0\)and\(\rho + p \ge 0\).
- (b)
The strong energy condition is equivalent to\(\rho + 3p \ge 0\)and\(\rho + p \ge 0\).
The proof of Proposition 3.9 is in Appendix 2.
Proof of Theorem 3.7
Friedmann’s equations are (see Eqs. (5.2.14) and (5.2.15) in [40]):
The weak energy condition implies \(\rho \ge 0\). Therefore \(a'(\tau ) \ge 1\) for all \(\tau\) by Eq. (3.18). The strong energy condition implies \(\rho + 3p \ge 0\). Therefore \(a''(\tau ) \le 0\) for all \(\tau\) by Eq. (3.19). Hence \(a'\) is decreasing. Since \(a'(0) := \lim _{\tau \rightarrow 0} a'(\tau ) = 1\), we have \(a'(\tau ) \le 1\) for all \(\tau\). Therefore \(a'(\tau ) = 1\) identically, and so \(a(\tau ) = \tau\) for all \(\tau\). Thus (M, g) is the Milne universe. \(\square\)
4 Cosmological Properties of Milne-Like Spacetimes
4.1 The Geometric Solution to the Horizon Problem
Our definition for an inflationary FLRW spacetime was one whose scale factor satisfies \(a(\tau ) = \tau + o(\tau ^{1+\varepsilon })\) for some \(\varepsilon > 0\). Our motivation is that these spacetimes solve the horizon problem, and this is true for \(k = +1\), 0, or \(-1\). However, what’s unique about Milne-like spacetimes is that they extend into a larger spacetime because the big bang is just a coordinate singularity. This offers a new geometrical picture of how Milne-like spacetimes solve the horizon problem as we discuss below.
We briefly recall the horizon problem in cosmology. It is the main motivating reason for inflationary theory [42]. The problem comes from the uniform temperature of the CMB radiation. From any direction in the sky, we observe the CMB temperature as 2.7 K. The uniformity of this temperature is puzzling: if we assume the universe exists in a radiation dominated era all the way down to the big bang (i.e. no inflation), then the points p and q on the surface of last scattering don’t have intersecting past lightcones. So how can the CMB temperature be so uniform if p and q were never in causal contact in the past? (Fig. 13)
By using conformal time \(\tilde{\tau }\) given by \(d\tilde{\tau } = d\tau /a(\tau )\), it is an elementary exercise to show that there is no horizon problem provided the particle horizon at the moment of last scattering is infinite:
This condition widens the past lightcones of p and q so that they intersect before \(\tau = 0\). See Fig. 14.
Proposition 4.1
The particle horizon for an inflationary spacetime is infinite.
Proof
From the definition of an inflationary spacetime, we have
Therefore for any \(\varepsilon > 0\) there exists a \(\delta > 0\) such that \(|a(\tau )/\tau - 1| < \varepsilon\) for all \(0< \tau < \delta\). Hence \(1/a(\tau ) > 1/(1+\varepsilon )\tau\) for all \(0< \tau < \delta\). Then the particle horizon at the moment of last scattering is
Thus the particle horizon is infinite. \(\square\)
For Milne-like spacetimes, the origin \(\mathscr {O}\) plays a unique role. The lightcones of any two points must intersect above \(\mathscr {O}\). This follows from the metric being conformal to Minkowski space, \(g_{\mu \nu } = \Omega ^2(\tau )\eta _{\mu \nu }\). As such the lightcones are given by 45° angles; see Fig. 15 which clarifies the situation depicted in Fig. 14.
Also we observe that the comoving observers all emanate from the origin \(\mathscr {O}\). Indeed a comoving observer \(\gamma (\tau )\) is specified by a point \((R_0, \theta _0, \phi _0)\) on the hyperboloid.
In the \((t, r, \theta , \phi )\) coordinates introduced in Eq. (3.11), the comoving observer is given by
where
Thus the relationship between t and r for \(\gamma\) is \(t = \coth (R_0) r\). Therefore for any comoving observer, we have \(t = Cr\) for some \(C > 1\). Thus the comoving observers emanate from the origin.
4.2 The Cosmological Constant Appears as an Initial Condition
In this section we show how the cosmological constant \(\Lambda\) can appear as an initial condition for Milne-like spacetimes. This may help explain the origin of \(\Lambda\). If dark energy is really modeled by a cosmological constant and not by some other model (e.g. quintessence), then \(\Lambda\) would have been fixed at the big bang.
Another interesting result in this direction is [1]. In their paper the authors show how the cosmological constant may arise from a topological quantity via Chern–Simons invariants. The authors make use of exotic 4-manifolds admitting hyperbolic geometry which are ultimately expressed as hyperbolic FLRW spacetimes.
Fix \(k\ge 2\). For this section let (M, g) denote a \(C^k\) Milne-like spacetime. Consider the Einstein equations with a cosmological constant
Let \(u = \partial / \partial \tau\) denote the four-velocity of the comoving observers and let e be any unit spacelike orthogonal vector (its choice does not matter by isotropy). We define the normal energy density \(\rho _{\mathrm{normal}}(\tau )\) and normal pressure function \(p_{\mathrm{normal}}(\tau )\) in terms of the energy-momentum tensor
Then the energy density \(\rho\) and pressure function p in terms of \(\rho _{\mathrm{normal}}\) and \(p_{\mathrm{normal}}\) are given by
If \(\rho _{\mathrm{normal}} = p_{\mathrm{normal}} = 0\) (e.g. de Sitter), then the equation of state for the cosmological constant is fixed for all \(\tau\).
We show that this equation of state appears as an initial condition. For the following theorem, we define \(\rho (0) := \lim _{\tau \rightarrow 0} \rho (\tau )\). Likewise with p(0) and \(\rho _{\mathrm{normal}}(0)\) and \(p_{\mathrm{normal}}(0)\).
Theorem 4.2
Suppose the scale factor satisfies\(a''(\tau ) = \alpha \tau + o(\tau )\). Then
We prove Theorem 4.2 at the end of this section. First we understand its implications. If the cosmological constant \(\Lambda\) is the dominant energy source during the Planck era, then we have the following connection between \(\Lambda\) and the initial condition of the scale factor.
Proposition 4.3
Suppose the scale factor satisfies\(a''(\tau ) = \alpha \tau + o(\tau )\), and we have\(\rho _{\mathrm{normal}}(0) = p_{\mathrm{normal}}(0) = 0\). Then
Proof
This follows from Theorem 4.2 and Eq. (4.10). \(\square\)
Remark
In (3+1)-dimensional de Sitter space we have \(T_{\mu \nu } = 0\) and \(\Lambda = 3\). In the open slicing coordinates of de Sitter, we have \(a(\tau ) = \sinh (\tau )\). Hence \(\alpha = a'''(0) = 1\). Therefore de Sitter space is a special example of Proposition 4.3.
Now we examine how an inflaton scalar field behaves in the limit \(\tau \rightarrow 0\). We will demonstrate that slow-roll inflation follows if the initial condition for the potential is given by the cosmological constant: \(V|_{\tau = 0} = \Lambda /8\pi\). Recall the energy-momentum tensor for a scalar field \(\phi\) is
And its energy density and pressure function are
The following proposition shows that when the initial condition for V is determined by the cosmological constant, then one obtains a slow-roll era.
Proposition 4.4
Suppose the scale factor satisfies\(a''(\tau ) = \alpha \tau + o(\tau )\), and we have
- (1)
If\(V\big (\phi (0)\big ) = 3\alpha /8\pi\), then\(\phi '(0) = 0\).
- (2)
If\(\rho _{{\mathrm{normal}}}(0) = p_{{\mathrm{normal}}}(0) =0\)and\(V\big (\phi (0)\big ) = \Lambda /8\pi\), then\(\phi '(0) = 0\)
Proof
(1) follows from Theorem 4.2 and (2) follows from Proposition 4.3. \(\square\)
Proof of Theorem 4.2
Friedmann’s equations are \((8\pi /3)\rho = H^2 -1/a^2\) and \(8\pi p = -2a''/a - (8\pi /3)\rho\) where \(H = a'/a\) is the Hubble parameter. Using \(a(\tau ) = \tau + f(\tau )\), the Friedmann equations become
and
By definition of an inflationary spacetime, we have \(f'(0) := \lim _{\tau \rightarrow 0}f(\tau )/\tau = 0\). Also, since \(a''(\tau ) = \alpha \tau + o(\tau )\), we have \(0 = a''(0) = f''(0) = \lim _{\tau \rightarrow 0}f'(\tau )/\tau\) and \(\alpha = \lim _{\tau \rightarrow 0}f''(\tau ) /\tau\). Therefore for all \(\varepsilon > 0\), there is a \(\delta > 0\) such that \(|f''(\tau )/\tau - \alpha | < \varepsilon\) for all \(0< \tau < \delta\). Integrating this expression gives \((\alpha - \varepsilon )\tau /2< f'(\tau )/\tau < (\alpha + \varepsilon )\tau /2\). Plugging this into the first Friedmann equation yields \(8\pi \rho (0)/ 3 = \alpha\). Using this for the second Friedmann equation yields \(-8\pi p(0) = 3\alpha\). \(\square\)
4.3 Lorentz Invariance
In this section we show that the isometry group for Milne-like spacetimes contains the Lorentz group. Since Lorentz invariance plays a pivotal role in QFT (e.g. the field operators are constructed out of finite dimensional irreducible represenations of the Lorentz group [38, 41]), Milne-like spacetimes are a good background model if one wants to develop a quantum theory of cosmology.
Remark
In this section \(\Lambda\) will always denote an element of the Lorentz group (i.e. a Lorentz transformation) and not the cosmological constant.
Let \(\eta _{\mu \nu }\) be the Minkowski metric. The Lorentz group is
A Lorentz transformation \(\Lambda\) shifts elements in Minkowski space via \(x^\mu \mapsto \Lambda ^\mu _{\,\,\,\nu }x^\nu\), but it leaves the hyperboloids fixed. More generally this applies to any Milne-like spacetime by the same map (Fig. 17).
For a Milne-like spacetime, we have \(g_{\mu \nu } = \Omega ^2(\tau ) \eta _{\mu \nu }\) where \(\eta _{\mu \nu }\) is the usual Minkowski metric. Since a Lorentz transformation leaves hyperboloids invariant, we have
Recall the Lorentz group \(\text {L} = \text {O}(1,3)\) has four connected components \(\text {L}^\uparrow _{+}\), \(\text {L}^\uparrow _-\), \(\text {L}^\downarrow _{+}\), \(\text {L}^\downarrow _{-}\). The ± corresponds to \(\det \Lambda = \pm 1\), the \(\uparrow\) corresponds to \(\Lambda ^0_{\,\,\,0} \ge 1\), and the \(\downarrow\) corresponds to \(\Lambda ^0_{\,\,\,0} \le -1\).
Lorentz transformations fix the origin (i.e. \(\Lambda \mathscr {O} = \mathscr {O}\)) and are isometries on the spacetime manifold with boundary \((M \cup \partial ^-{M}) \setminus \{\mathscr {O}\}\). We will say that any map which fixes \(\mathscr {O}\) and is an isometry on the spacetime manifold with boundary \((M \cup \partial ^-M) \setminus \{\mathscr {O}\}\) is an isometry on\(M \cup \partial ^-M\). Note that the set of isometries on \(M \cup \partial ^-M\) forms a group via composition. Since Milne-like spacetimes are defined for \(t > 0\), only the subgroup \(\text {L}^\uparrow = \text {L}^\uparrow _+ \cup \text {L}^\uparrow _-\) acts by isometries on Milne-like spacetimes. If (M, g) admits a \(C^2\)-extension, then we obtain an isomorphism.
Theorem 4.5
Let (M, g) be a Milne-like spacetime. Then any\(\Lambda \in \text {L} ^\uparrow\)is an isometry on\(M \cup \partial ^-M\).
Theorem 4.6
If a Milne-like spacetime admits a\(C^2\)-extension, then\(\text {L} ^\uparrow\)is isomorphic to the group of isometries on\(M \cup \partial ^-M\).
Remark
To the best of the author’s knowledge, Theorem 4.6 is a new result. Its proof relies on the existence of \(\partial ^-M\).
Proofs of Theorems 4.5 and 4.6
Let \(\Lambda\) be an element of \(\text {L}^\uparrow\). It produces a unique map, \(x \mapsto \Lambda x\) via \(x^\mu \mapsto \Lambda ^\mu _{\,\,\,\nu }x^\nu\) where \((x^0, x^1, x^2, x^3) = (t, x, y, z)\) are the conformal Minkowski coordinates introduced in the proof of Theorem 3.4. Since (M, g) is only defined for \(t > 0\), we must restrict to Lorentz transformations \(\Lambda \in \text {L}^\uparrow\). Consider a point \(p \in M\) and a tangent vector \(X = X^\mu \partial _\mu\) at p. Then \(\Lambda\) acts on X by \(d\Lambda (X) = \Lambda ^\mu _{\,\,\,\nu } X^\nu \partial _\mu\) and sending it to the point \(\Lambda p \in M\). Since \(\Omega (\tau \circ p) = \Omega (\tau \circ \Lambda p)\), we have
Thus \(\Lambda\) is an isometry. Now consider \(p \in \partial ^-M\). Then \(\Lambda p \in \partial ^-M\) and we have \(\Omega |_p = \Omega |_{\Lambda p} = \Omega (0)\). Therefore the calculation above carries through in this case as well. This proves Theorem 4.5.
Now we prove Theorem 4.6. By Theorem 4.5 we have \(\text {L}^\uparrow\) is a subgroup, so it suffices to show it’s the whole group. Suppose f is an isometry on \(M \cup \partial ^-M\). The differential map \(df_{\mathscr {O}}\) is a linear isometry on the tangent space at \(\mathscr {O}\). Therefore \(df_{\mathscr {O}}\) corresponds to an element of the Lorentz group, say \(\Lambda ^\mu _{\,\,\,\nu }\). It operates on vectors X at \(\mathscr {O}\) via \(df(X) = \Lambda ^\mu _{\,\,\,\nu }X^\nu \partial _\mu\). Now we define the isometry \(\tilde{f}\) by \(\tilde{f}(x) = \Lambda ^\mu _{\,\,\,\nu }x^\nu\). Consider the set
Note that if \(df_p = d\tilde{f}_p\), then \(f(p) = \tilde{f}(p)\). Hence it suffices to show \(A = M \cup \partial ^-M\). A is nonempty since \(\mathscr {O} \in A\), and A is closed because \(df - d\tilde{f}\) is continuous. So since \(M \cup \partial ^-M\) is connected, it suffices to show A is open in the subspace topology. Let \(p \in A\). Since \(\Omega\) is \(C^2\), there is a normal neighborhood U about p. If \(q \in U\), there is a vector X at p such that \(\exp _p(X) = q\). Since isometries map geodesics to geodesics, they satisfy the property \(f \circ \exp _p = \exp _{f(p)} \circ \, df_p\) for all points in U (see p. 91 of [28]). Therefore
Thus \(f(q) = \tilde{f}(q)\) for all \(q \in U\); hence \(df_q = d\tilde{f}_q\) for all \(q \in U\). Therefore A is open. \(\square\)
4.4 A Possible Dark Matter Particle?
The symmetries in quantum theory can be characterized by local symmetries and global spacetime symmetries.
Local symmetries correspond to the gauge symmetry group \(\text {SU}(3) \times \text {SU}(2) \times \text {U}(1)\) of the standard model. In gauge theory the Lagrangian is invariant under position-dependent gauge transformations; hence the world ‘local.’ Local gauge invariance necessitates the existence of gauge fields in the Lagrangian which are then checked experimentally. The \(\text {SU}(3)\) part describes the strong interaction of quantum chromodynamics. The \(\text {SU}(2) \times \text {U}(1)\) part describes the electroweak interaction.
Global spacetime symmetries are the symmetries of the underlying spacetime manifold. Since the standard model is modeled on Minkowski space, the global spacetime symmetry group is the Poincaré group which are the isometries in Minkowski space. Wigner’s classification [43] of the irreducible unitary representations of the Poincaré group described the spin properties of elementary particles which is considered a huge success in mathematical quantum field theory.
If one wants to build a quantum theory on a cosmological background, then Milne-like spacetimes are a preferred model since they are Lorentz invariant. Wigner’s success in the classification of the Poincaré group motivates us to seek the irreducible unitary representations of the Lorentz group. Similar to Wigner’s analysis, we desire all projective unitary representations to lift to unitary representations. Therefore we really seek the irreducible unitary representations of \(\text {SL}(2,\mathbb {C})\) which is the simply connected double cover of \(\text {L}^\uparrow _+\).
Classification of the Irreducible Unitary Representations of \(\text {SL}(2,\mathbb {C})\)
This classification comes from Theorem 10.9 in [38]. There are two classes of irreducible unitary representations of \(\text {SL}(2,\mathbb {C})\). The first class is the principal series. These particles are characterized by the parameter \(\nu = -iw\) where w is real and spin \(j = 0, 1/2, 1, \dotsc\) The second class is the complementary series. These particles are characterized by the parameter \(-1 \le \nu \le 1\) and spin \(j = 0\). The terminology ‘principal’ and ‘complementary’ comes from the classification of irreducible unitary representations of semi-simple Lie groups [37].
Given that the comoving observers in a Milne-like spacetime all emanate from the origin \(\mathscr {O}\) (see Fig. 16), a possible physical interpretation of this classification would be that these are the particles created at the big bang. Then the principal series would correspond to the particles which make up the standard model (in analogy to Wigner’s classification of the Poincaré group), but this leaves the complementary series up to interpretation. Perhaps
But is there any evidence that dark matter is comprised of spin 0 particles? Yes. Scalar field dark matter (SFDM) [17, 18, 26, 36] also known as Bose-Einstein condensate (BEC) dark matter [22, 27, 32,33,34, 39] also known as wave dark matter (WDM) [7, 8, 15, 29] also known as fuzzy dark matter (FDM) [20, 21] all use the Klein–Gordon equation (i.e. the wave equation for spin 0 particles) to model dark matter. The difference in name comes from a difference in motivation. One reason for introducing models of dark matter based on the Klein–Gordon equation is to alleviate the cusp problem associated with the weakly interacting massive particle (WIMP) models of dark matter [23]. Furthermore, the models based on the Klein–Gordon equation reproduce the observed spiral pattern density in disk galaxies (see Figs. 1–4 in [8]) which makes these models promising.
The parameters \(\nu\) and j in the classification of \(\text {SL}(2,\mathbb {C})\) are determined by Casimir operators built out of rotations and Lorentz boosts (see Eq. 10.3-1 in [38]). The spin parameter j comes from the usual rotation generators \([J_i, J_j] = i\varepsilon ^{ijk}J_k\). The parameter \(\nu\) is determined by requiring unitary of the boost generators \([K_i, K_j] = -i\varepsilon ^{ijk}J_k\). See Sect. 10.3.3 and Appendix VII of [38] for the full details.
What’s interesting is that the parameter \(\nu\) takes on very different forms for the principal series and the complementary series. It would be interesting if there is any new physics here. If the identification “principal series = normal matter” and “complementary series = dark matter” is true, then the distinguishing feature could be related to this parameter \(\nu\). Perhaps this could offer an explanation for dark matter’s lack of interaction with electromagnetism.
4.5 What Lies Beyond \(\tau = 0\)?
Since Milne-like spacetimes extend through \(\tau = 0\), it is an interesting question to ask what exists in the extension. Of course this is only speculation, but hints can be found when one considers the maximal analytic extension whenever \(\Omega\) is analytic on \(M \cup \partial ^-M\). For \(a(\tau ) = \tau\) (i.e. the Milne universe), the maximal analytic extension is Minkowksi space. For \(a(\tau ) = \sinh (\tau )\), the maximal analytic extension is de Sitter space.
For Minkowski space and de Sitter space, we have the full Lorentz group \(\text {L} = \text {O}(1,3)\) acting as isometries at the origin \(\mathscr {O}\). When elements in \(\text {L}^\downarrow = \text {L}^\downarrow _+ \cup \text {L}^\downarrow _-\) act at the origin, it produces a PT symmetric spacetime (i.e. one where the map \((t,x,y,z) \mapsto (-t,-x,-y,-z)\) is an isometry).
Lorentz Invariance at \(\mathscr {O}\) Implies an Antimatter Universe
Let (M, g) be a Milne-like spacetime. Requiring \(\mathscr {O}\) to be Lorentz invariant (i.e. the full Lorentz group \(\text {L} = \text {O}(1,3)\) acts at the origin \(\mathscr {O}\)) produces a PT symmetric universe. Given the CPT theorem [35], perhaps the universe’s missing antimatter is contained in the PT symmetric universe.
We remark this idea is closely related to the same idea in [6]. There the authors consider a \(k = 0\) FLRW spacetime with metric \(g = -d\tau ^2 + a^2(\tau )\big [dx^2 + dy^2 + dz^2\big ]\) in a radiation dominated era \(a(\tau ) \propto \sqrt{\tau }\). By moving to conformal time \(\tilde{\tau }\) given by \(d\tilde{\tau } = d\tau /a(\tau )\), one arrives at the metric \(g = a^2(\tilde{\tau })\big [-d\tilde{\tau }^2 + dx^2 + dy^2 + dz^2 \big ]\) where \(a(\tilde{\tau }) \propto \tilde{\tau }\). They then analytically extend the function \(a(\tilde{\tau })\) from \((0, + \infty )\) to \(\mathbb {R}\) and call the \((-\infty , 0)\) part the ‘CPT-symmetric’ universe. However, at \(\tilde{\tau } = 0\), the metric is \(g = 0\). Hence it’s degenerate. Therefore this is not a spacetime extension.
Remark
The antimatter universe in Fig. 18 was speculated by assuming \(\text {L}^{\downarrow }\) acts by isometries. Similarly one can speculate what lies in the white region between the universe and antimatter universe. If the Lorentz group acts by isometries, then the white region will be foliated by three-dimensional de-Sitter spacetimes.
How Can we Interpret the PT Symmetric Universe as an Antimatter Universe?
Consider an experimentalist. If the universe is modeled by a Milne-like spacetime, then the experimentalist will use coordinates that coincide with the comoving observers (except for a small correction due to the Milky Way’s velocity relative to the CMB). Therefore the experimentalist will build rods and clocks which measure distances and times with respect to the metric
With these coordinates, the experimentalist will measure an energy \(p^0\) and momentum \(p^i\) of a particle with mass m such that \(-g_{\mu \nu }p^\mu p^\nu = -\Omega ^2(\tau )\eta _{\mu \nu }p^\mu p^\nu = m^2.\) Assuming the de Broglie relations \(p^\mu \rightarrow i\partial ^\mu\) for a spin 0 field \(\psi\) yields a Lorentz invariant Klein–Gordon equation for Milne-like spacetimes \(\big [\Omega ^2(\tau )\eta ^{\mu \nu }\partial _\mu \partial _\nu \big ] \psi = m^2 \psi .\) Likewise, for a Dirac spinor \(\psi\), we have a Lorentz invariant Dirac equation for Milne-like spacetimes
(Recall we are using the \((-, +, +, +)\) signature convention, so our Dirac equation does not include a factor of i, but our \(\gamma ^\mu\) matrices do.) Lorentz invariance follows because both \(\Omega\) and the original Dirac equation are Lorentz invariant.
In the Weyl representation, the matrices \(\gamma ^\mu\) are
where \(\sigma ^j\) are the usual Pauli spin matrices.
Choose a single comoving observer in the PT extension. Pick coordinates (t, x, y, z) so that they are aligned with this comoving observer (this can always be done by a Lorentz transformation at \(\mathscr {O}\)). Then \(x = y = z =0\) along the observer’s path. From Eq. (3.11), the relationship between \(\tau\) and t is \(b(\tau ) = t\). Therefore \(\partial _t = b'(\tau )\partial _\tau\). Hence \(\partial _\tau = \Omega (\tau )\partial _t\), and so Eq. (4.20) becomes
Remark
There is a certain elegance in the simplicity of Eq. (4.23). In this form we see the Dirac equation is manifestly Lorentz invariant, and it highlights a correspondence between mass and the proper time of comoving observers. This suggests that the global spacetime symmetry group for Milne-like spacetimes should be \(\mathbb {R}\times \text {O}(1,3)\) where the \(\mathbb {R}\) factor corresponds to translations in cosmic time \(\tau\) and yields the physical quantity mass. Compare this with the Poincaré group \(\mathbb {R}^4 \rtimes \text {O}(1,3)\) which is the global spacetime symmetry group for Minkowski space. The mass for the Poincaré group comes from the \(\mathbb {R}^4\) factor in the semi-direct product. See [25] for a mathematical discussion of this comparison.
Experimentalists in our universe \(I^+(\mathscr {O})\) would use coordinates (t, x, y, z) to make observations and measurements. Anti-experimentalists in the anti-universe \(I^-(\mathscr {O})\) would use coordinates \((-t,-x,-y,-z)\). Therefore the Dirac equation that the anti-experimentalists would use is \(\big [\Omega (\tau )\gamma ^\mu (-\partial _\mu )\big ]\psi = m\psi\). Note that this is equivalent to \(\Omega (\tau )\gamma ^\mu \partial _\mu \psi = -m\psi\) (i.e. it’s the Dirac equation with negative mass). This explains the ± ambiguity one arrives at when deriving the Dirac equation.
Whether \(\psi\) solves the Dirac equation for \(I^+(\mathscr {O})\) or \(I^-(\mathscr {O})\), the anticommutation Clifford relations imply
We can introduce electromagnetism in the Dirac equation via an electromagnetic potential \(A_\mu\) with the usual prescription \(i\partial _\mu \rightarrow i\partial _\mu - eA_\mu\), or equivalently, \(\partial _\mu \rightarrow \partial _\mu + ieA_\mu\). Then the corresponding Dirac equations for the experimentalist in \(I^+(\mathscr {O})\) and the anti-experimentalist in \(I^-(\mathscr {O})\) are, respectively,
Define the matrices
where \(\underline{x} = - x^\mu \sigma ^\nu \eta _{\mu \nu }\) and \(\text {P}x = (x^0, -x^1, -x^2, -x^3)\). Let \(\text {PT} \in \text {GL}(4,\mathbb {C})\) be an element which reverses both space and time. There are two choices which differ by a negative sign. We choose
Then \(\gamma (-x) = \text {PT}\, \gamma (x)(\text {PT})^{-1}\). Hence \(\text {PT}\) reverses space and time by acting on \(\gamma (x)\) via conjugation. Note that \(\text {PT}\,\gamma ^\mu = - \gamma ^\mu \;\text {PT}\) and \((\gamma ^\mu )^* = -\gamma ^\mu\). Therefore matrix multiplication and complex conjugation yield the following table.
Spinor field | Equation | An interpretation |
---|---|---|
\(\psi\) | \(\Omega \gamma ^\mu (\partial _\mu + ieA_\mu )\psi = m\psi\) | \(\psi\) in \(I^+\mathscr {O})\) |
\(\psi ^*\) | \(\Omega \gamma ^\mu (-\partial _\mu + ieA_\mu )\psi ^* = m\psi ^*\) | \(\psi\) in \(I^-(\mathscr {O})\) |
\(\text {PT}\,\psi\) | \(\Omega \gamma ^\mu (-\partial _\mu - ieA_\mu )\text {PT}\,\psi = m\text {PT}\,\psi\) | Anti \(\psi\) in \(I^-(\mathscr {O})\) |
\(\text {PT}\,\psi ^*\) | \(\Omega \gamma ^\mu (\partial _\mu - ieA_\mu )\text {PT}\,\psi ^* = m\text {PT}\,\psi ^*\) | Anti \(\psi\) in \(I^+(\mathscr {O})\) |
Given the interpretation, perhaps the big bang at \(\mathscr {O}\) produced equal amounts of matter and antimatter. The matter, represented by \(\psi\), traveled into our universe \(I^+(\mathscr {O})\) while the antimatter, represented by \(\text {PT}\,\psi\), traveled into the antimatter universe \(I^-(\mathscr {O})\). The antimatter that we observe in our universe comes in the form \(\text {PT}\,\psi ^*\) while the anti-antimatter the anti-experimentalists observe comes in the form \(\psi ^*\).
Remark
The Lorentz invariance of the Dirac equation described in this section can be carried over to Lagrangians in QFT by appropriately including factors of \(\Omega\).
References
Asselmeyer-Maluga, T., Krol, J.: How to obtain a cosmological constant from small exotic \(\mathbb{R}^4\). Phys. Dark Univ. 19, 66–77 (2018)
Borde, A.: Open and closed universes, initial singularities, and inflation. Phys. Rev. D 50, 3692–3702 (1994)
Borde, A., Vilenkin, A.: Singularities in inflationary cosmology: a review. Int. J. Mod. Phys. D 05(06), 813–824 (1996)
Borde, A., Vilenkin, A.: Violation of the weak energy condition in inflating spacetimes. Phys. Rev. D 56, 717–723 (1997)
Borde, A., Guth, A.H., Vilenkin, A.: Inflationary spacetimes are incomplete in past directions. Phys. Rev. Lett. 90, 151301 (2003)
Boyle, L., Finn, K., Turok, N.: cpt-symmetric universe. Phys. Rev. Lett. 121, 251301 (2018)
Bray, H.L.: On wave dark matter, shells in elliptical galaxies, and the axioms of general relativity, (2012) arXiv:1212.5745
Bray, H.L.: On dark matter, spiral galaxies, and the axioms of general relativity. Geometric Anal. Math. Relativ. Nonlinear Partial Differ. Equ. 599, 1–64 (2013)
Carroll, S.M., Chen, J.: Spontaneous inflation and the origin of the arrow of time, (2004) arXiv:hep-th/0410270
Chruściel, P.T., Cortier, J.: Maximal analytic extensions of the Emparan-Reall black ring. J. Differ. Geom. 85(3), 425–459 (2010)
Chruściel, P.T., Delay, E., Galloway, G.J., Howard, R.: Regularity of horizons and the area theorem. Ann. Henri Poincaré 2(1), 109–178 (2001)
Coleman, S., De Luccia, F.: Gravitational effects on and of vacuum decay. Phys. Rev. D 21, 3305–3315 (1980)
Galloway, G.J., Ling, E.: Some remarks on the C0-inextendibility of spacetimes. Ann. Henri Poincaré 18(10), 3427–3447 (2017)
Galloway, G.J., Ling, E.: Topology and singularities in cosmological spacetimes obeying the null energy condition. Commun. Math. Phys. (2017)
Goetz, A.S.: The Einstein-Klein-Gordon Equations, Wave Dark Matter, and the Tully-Fisher Relation, ProQuest LLC, Ann Arbor, MI, Thesis (Ph.D.). Duke University (2015)
Guth, A.H.: Inflationary universe: a possible solution to the horizon and flatness problems. Phys. Rev. D 23, 347–356 (1981)
Guzmán, F.S., Matos, T.: Scalar fields as dark matter in spiral galaxies. Classical Quantum Gravity 17(1), L9–L16 (2000)
Guzmán, F.S., Arturo Ureña López, L.: Evolution of the schrödinger-newton system for a self-gravitating scalar field. Phys. Rev. D 69, 124033 (2004)
Hawking, S.W., Ellis, G.F.R.: The Large Scale Structure of Space-Time, Cambridge Monographs on Mathematical Physics, No 1. Cambridge University Press, London (1973)
Hu, W., Barkana, R., Gruzinov, A.: Fuzzy cold dark matter: the wave properties of ultralight particles. Phys. Rev. Lett. 85(6), 1158–1161 (2000)
Hui, L., Ostriker, J.P., Tremaine, S., Witten, E.: Ultralight scalars as cosmological dark matter. Phys. Rev. D 95(4), 043541 (2017)
Ji, S.U., Sin, S.J.: Late-time phase transition and the galactic halo as a bose liquid. ii. The effect of visible matter. Phys. Rev. D 50, 3655–3659 (1994)
Lee, J.-W.: Is dark matter a bec or scalar field? J. Korean Phys. Soc. 54 (2009)
Lesourd, M.: Cosmological singularities from high matter density without global topological assumptions. Gen. Relat. Gravit. 51(9), 113 (2019)
Ling, E.: Milne-like spacetimes and their symmetries, (2018) arXiv:1803.00174
Magaña, J., Matos, T.: A brief review of the scalar field dark matter model. J. Phys. 378(1), 012012 (2012)
Matos, T., Ureña López, L.A.: Flat rotation curves in scalar field galaxy halos. Gen. Relat. Gravit. 39(8), 1279–1286 (2007)
O’Neill, B.: Semi-Riemannian Geometry, Pure and Applied Mathematics, vol. 103. Academic Press Inc. [Harcourt Brace Jovanovich Publishers], New York (1983)
Parry, A.R.: Wave Dark Matter and Dwarf Spheroidal Galaxies, ProQuest LLC, Ann Arbor, MI, Thesis (Ph.D.). Duke University (2013)
Sbierski, J.: The C0-inextendibility of the Schwarzschild spacetime and the spacelike diameter in Lorentzian geometry. J. Differ. Geom. 108(2), 319–378 (2018)
Sbierski, J.: On the proof of the C0-inextendibility of the schwarzschild spacetime. J. Phys. 968(1), 012012 (2018)
Seidel, E., Suen, W.-M.: Dynamical evolution of boson stars: perturbing the ground state. Phys. Rev. D 42, 384–403 (1990)
Sharma, R., Karmakar, S., Mukherjee, S.: Boson star and dark matter (2008.) arXiv:0812.3470
Sin, S.-J.: Late-time phase transition and the galactic halo as a bose liquid. Phys. Rev. D 50, 3650–3654 (1994)
Streater, R.F., Wightman, A.S.: PCT, Spin and Statistics, and All That. W. A. Benjamin Inc., New York (1964)
Suárez, A., Robles, V.H., Matos, T.: A review on the scalar field/bose-einstein condensate dark matter model. In: Claudia González, M., Madriz, A., José, E., Reyes, B., Luz, M. (eds.) Accelerated Cosmic Expansion (Cham), pp. 107–142. Springer, New York (2014)
Tung, A.: Representation Theory of Semisimple Groups: An Overview Based on Examples. Princeton University Press, Princeton (1986)
Tung, W.-K.: Group Theory in Physics. World Scientific Publishing Co., Philadelphia (1985)
Ureña López, L.A., Bernal, A.: Bosonic gas as a galactic dark matter halo. Phys. Rev. D 82, 123535 (2010)
Wald, R.M.: General Relativity. University of Chicago Press, Chicago (1984)
Weinberg, S.: The Quantum Theory of Fields, vol. I. Cambridge University Press, Cambridge (2005). Foundations
Weinberg, S.: Cosmology. Oxford University Press, Oxford (2008)
Wigner, E.: On unitary representations of the inhomogeneous lorentz group. Ann. Math. 40(1), 149–204 (1939)
Acknowledgements
Open access funding provided by Royal Institute of Technology. The author thanks Greg Galloway, Paul Tod, Bob Wald, Hugh Bray, Gary Horowitz, Gerald Folland, Orlando Alvarez, Michel Mizony, Bob Klauber, and Torsten Asselmeyer-Maluga for helpful comments and discussions.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix 1: A Brief Account of Cosmological Singularity Theorems
In this appendix we give a brief account of some of the singularity theorems used in cosmology. The purpose of this appendix is to demonstrate that the singularity theorems don’t always apply to inflationary spacetimes. Indeed Milne-like spacetimes can almost always be used as counterexamples.
The first step in developing a cosmological theory is to assume the Copernican principle. This assumption is supported by the highly uniform CMB radiation. The Copernican principle implies that the spacetimes (M, g) which model cosmology are given by \(M = I \times \Sigma\) and \(g = -d\tau ^2 + a^2(\tau )h\) where \(I\subset \mathbb {R}\) is an interval and \((\Sigma ,h)\) are spaces of constant sectional curvature (i.e. maximally symmetric spaces). These are called FLRW spacetimes. Let \(\tau _{0} = \inf I\). If one assumes the universe is in a radiation-dominated era for all \(\tau _{0}< \tau < \tau _1\) given some \(\tau _1\), then one finds \(a(\tau ) \rightarrow 0\) as \(\tau \rightarrow \tau _{0}\) and \(\tau _{0} > -\infty\). In this case we say \(\tau _{0}\) is the big bang. By shifting the \(\tau\) coordinate, we can assume \(\tau _0 = 0\). Moreover the scalar curvature diverges as \(\tau \rightarrow 0\), so \(\tau = 0\) admits a curvature singularity. These arguments generalize if one replaces the assumption that the universe is in a radiation-dominated era with the assumption that the universe is nonvacuum and obeys the strong energy condition.
The singularity theorems of Hawking and Penrose [19] demonstrated that singularities (in the sense of timelike or null geodesic incompleteness) are a generic feature of physically relevant spacetimes. These theorems don’t assume any symmetry conditions on the spacetime manifold, but they do assume the strong energy condition. Hawking’s cosmological singularity theorems (see Theorems 55A and 55B in [28]) both assume the strong energy condition.
There is a problem with the strong energy condition assumption in the singularity theorems. Assuming this condition in our universe, one finds that the particle horizon is finite. This implies that there are parts of the CMB that never achieved causal contact in the past. But if this is true, then how could the CMB have such a perfectly uniform temperature? This became known as the horizon problem in cosmology [42].
A resolution to the horizon problem is to assume that the universe underwent a brief period of accelerated expansion, \(a''(\tau ) > 0\), immediately after the big bang and right before the radiation-dominated era. This would allow for causal contact between the different points on the CMB. This theory became known as inflationary theory and was first put forth by Alan Guth [16]. It also solved the flatness problem of cosmology and the magnetic monopole problem of certain grand unified theories [42].
Assuming an inflationary era, \(a''(\tau ) > 0\), then Friedmann’s equation implies \(\rho + 3p < 0\). Hence inflationary models do not satisfy the strong energy condition. Therefore the singularity theorems above no longer apply. New singularity theorems were sought that did not require the strong energy condition. This was done by Borde and Vilenkin [2, 3] and others [14, 24]. Borde and Vilenkin found that some models of inflationary theory also violate the weak energy condition [4]. Then Guth, Borde, and Vilenkin produced a singularity theorem [5], which showed that, even if the weak energy condition is violated, then one has past incompleteness. However their theorem only applies to inflating regions of a spacetime. For example, their theorem applies to the Milne universe (because their theorem only requires an averaged Hubble expansion condition), but the Milne universe isometrically embeds into Minkowski space which is geodesically complete. See Fig. 19.
Note that the geodesic displayed in Fig. 19 is not a geodesic of a comoving observer. All the comoving observers emanate from the origin \(\mathscr {O}\) (see Fig. 5). Section 4.1 gives a derivation of this result.
Appendix 2: Miscellaneous Proofs
1.1 Proof of Proposition 2.2
Definition B.1
The Minkowski metric in \(\mathbb {R}^{n+1}\) is \(\eta = \eta _{\mu \nu }dx^\mu dx^\nu = -(dx^0)^2 + \delta _{ij}dx^idx^j\). For \(0< \varepsilon < 1\), we define the narrow and wide Minkowski metrics
Lemma B.2
Fix\(k \ge 0\). Let (M, g) be a\(C^k\)spacetime. Fix\(p \in M\). For any\(0< \varepsilon < 1\)there is a coordinate system\(\phi :U_\varepsilon \rightarrow \mathbb {R}^{n+1}\)with the following properties
- (1)
\(\phi (p) = 0\)
- (2)
\(g_{\mu \nu }(p) = \eta _{\mu \nu }\)
- (3)
\(I^+_{\eta ^{\varepsilon }} (p, U_\varepsilon ) \,\subset \, I^+ (p, U_\varepsilon ) \,\subset \, I^+_{\eta ^{-\varepsilon }} (p, U_\varepsilon )\).
Moreover if\(\gamma :[a,b] \rightarrow M\)is a unit timelike curve with\(\gamma (b) = p\), then we can choose the coordinate system so that\(\phi \circ \gamma (t) = (t-b, 0, \dotsc , 0)\).
Proof
Pick a coordinate system \(\phi :U \rightarrow \mathbb {R}^{n+1}\) with \(\phi (p) = 0\) and apply Gram-Schmidt to obtain (2). By continuity of the metric, given any \(\varepsilon ' > 0\), we can shrink our neighborhood so that \(|g_{\mu \nu }(x) - \eta _{\mu \nu }| < \varepsilon '\). Let \(X = X^\mu \partial _{\mu }\) be a unit tangent vector (i.e. \(|X^0| = 1\)). Then
If X is \(\eta ^{\varepsilon }\)-timelike, then \(|X^i|^2/|X^0|^2 < (1-\varepsilon )/(1+\varepsilon )\). Since \(|X^0| = 1\), we have
By taking \(\varepsilon ' > 0\) small enough, we can ensure \(2\varepsilon /(1+\varepsilon )\) is larger than the bracket term. This proves the first inclusion in (3). The proof of the second is analogous.
Now let \(\gamma :[a,b] \rightarrow M\) be a unit timelike curve with \(\gamma (b) = p\). Let \((y^0, y^1, \dotsc , y^n)\) be the coordinates on U (i.e. \(y^\mu = \pi ^\mu \circ \phi\) where \(\pi ^\mu :\mathbb {R}^n \rightarrow \mathbb {R}\) are the canonical projections). Since \(g_{\mu \nu }(p) = \eta _{\mu \nu }\), we can shrink U so that \(y^0\) is a time function (i.e. \(\nabla y^0\) is past-directed timelike). Since the definition of a timelike curve requires \(\lim _{t \nearrow b}\gamma '(t)\) to be future-directed timelike, the function \((y^0 \circ \gamma )'(t)\) approaches a nonzero number as \(t \nearrow b\). Therefore the inverse function theorem guarantees an interval \((b - \delta , \,b + \delta )\) around b and a diffeomorphism \(f:(b - \delta ,\, b+\delta ) \rightarrow (-\delta ', \delta ')\) such that \(f = y^0 \circ \gamma\) on \((b-\delta ,\,b]\). Let \(U' \subset U\) be the preimage of \((-\delta ', \delta )\) under \(y^0\). We define new coordinates \((x^0, x^1, \dotsc , x^n)\) on \(U'\) by
With these coordinates we have (Fig. 20)
\(\square\)
Proof of Proposition 2.2
Fix \(q \in I^+(p, U)\) and let \(\gamma :[a,b] \rightarrow U\) be a timelike curve with \(\gamma (a) = p\) and \(\gamma (b) = q\). By rescaling we can assume \(\gamma\) is a unit timelike curve. Let \(\phi :U_\varepsilon \rightarrow \mathbb {R}^{n+1}\) be a coordinate system from Lemma B.2 centered around q. Choose \(\varepsilon = 3/5\) so that \(\eta ^\varepsilon\) has lightcones with slope 2. Choose \(t < b\) such that \(\gamma (t) \in U_\varepsilon\). Then \(I^+_{\eta ^\varepsilon }\big (\gamma (t), U_\varepsilon \cap U\big )\) is an open set since it’s just the interior of a cone intersected with an open set. Moreover, it’s contained in \(I^+\big (\gamma (t), U_\varepsilon \cap U\big )\) which is contained in \(I^+(p,U)\). \(\square\)
1.2 Proof of Lemma 3.5
Fix \(k \ge 2\). Let (M, g) be a \(C^k\) Milne-like spacetime with a scale factor whose second derivative satisfies \(a''(\tau ) = \alpha \tau + C \tau ^3 + o(\tau ^3)\) where \(\alpha , C \in \mathbb {R}\). Recall the coordinate system \(\zeta = (t,r,\theta , \phi )\) from the proof of Theorem 3.4. Here t and r are given by
where \(b :I \rightarrow (0,\infty )\) is given by \(b(\tau ) = \exp \left( \int _{\tau _0}^\tau \frac{1}{a(s)}ds \right)\) for some \(\tau _0 > 0\). With respect to the coordinate system \(\zeta = (t, r, \theta , \phi )\), the metric takes the form
where \(\Omega = 1/b' = a/b\). We have to show for any \(t_0 \ge 0\) the limits
exist and are finite. Note that t and r appearing in the limits above are defined on the open set \(U = \big \{(t, r, \theta , \phi ) \mid t^2 - r^2 < b^2(\tau _{\mathrm{max}}) \text { and } t > 0 \text { and } r \ge 0\big \}\) and where \(\tau _{\mathrm{max}} \in (0, +\infty ]\) is given from the interval \(I = (0, \tau _{\mathrm{max}})\) of the scale factor.
Note that b is a strictly increasing \(C^1\) function which is never zero. Therefore it is invertible and the derivative of its inverse is \((b^{-1})'\big (b(\tau )\big ) = 1/b'(\tau )\). Recall \(\tau = b^{-1}\big (\sqrt{t^2 - r^2} \big )\). Therefore \(\partial \tau / \partial t = t/(b'b)\). Since \(\Omega = a/b = 1/b'\), the chain rule gives
Let’s simplify notation by letting \(a(\tau ) = \tau + f(\tau )\). Then we have
Taking another derivative, we get
Plugging \(b'' = (b/a)' = (b'a - a'b)/a^2 = -bf'/a^2\) into the above expression gives
From the proof of Theorem 3.4, for \(\varepsilon = 1\) there exists a \(\delta > 0\) such that for all \(0< \tau < \delta\), we have
where \(\varepsilon _0\) is given by \(a(\tau ) = \tau + o(\tau ^{1 + \varepsilon _0})\). Since \(f''(\tau ) = \alpha \tau + C\tau ^3 + o(\tau ^3)\), we have \(f'(\tau ) = \frac{1}{2}\alpha \tau ^2 + \frac{1}{4}C\tau ^4 + o(\tau ^4)\). Using (B.67) along with \(b' = b/a\), we see that for small \(\tau\), we have \(b = \tau /\tau _0 + o(\tau )\) and \(b'(\tau ) = 1/\tau _0 + o(\tau )\). Therefore the squeeze theorem gives
Note that we needed the asymptotic condition \(a''(\tau ) = \alpha \tau + C\tau ^3 + o(\tau ^3)\) to get the above equality. Similarly, we have both of the limits
exist and are both finite. Plugging in (B.68) and (B.69) into (B.66) and (B.62), we see that for any \(t_0 \ge 0\), we have that the limits
exist and are finite. Similarly, we obtain the same conclusion for \(\frac{\partial \Omega }{\partial r}\) and \(\frac{\partial ^2 \Omega }{\partial r^2}\). This completes the proof of Lemma 3.5.
1.3 Proof of Proposition 3.9
Fix \(k \ge 2\). Let (M, g) be a \(C^k\) spacetime. Recall the Einstein tensor is \(G_{\mu \nu } = R_{\mu \nu } - \frac{1}{2}Rg_{\mu \nu }\). We say (M, g) satisfies the weak energy condition if \(G_{\mu \nu }X^\mu X^\nu \ge 0\) for all timelike X, and the strong energy condition if \(R_{\mu \nu }X^\mu X^\nu \ge 0\) for all timelike X.
Let (M, g) be an FLRW spacetime. Following [28] and [40], we define the energy density \(\rho\) and pressure function p in terms of the Einstein tensor. If \(u = \partial / \partial \tau\) and e is any unit spacelike vector orthogonal to u, then
Also, as a consequence of isotropy, we have \(R_{\mu \nu }u^\mu e^\nu = 0\) (see Corollary 12.10 of [28]). Therefore
The following proof shows
- (a)
The weak energy condition is equivalent to \(\rho \ge 0\) and \(\rho + p \ge 0\).
- (b)
The strong energy condition is equivalent to \(\rho + p \ge 0\) and \(\rho + 3p \ge 0\).
Proof of Proposition 3.9
-
(a)
Suppose (M, g) satisfies the weak energy condition. Then \(\rho = \frac{1}{8\pi } G_{\mu \nu }u^\mu u^\nu \ge 0\). Now let e be a unit spacelike vector orthogonal to u. Fix \(\varepsilon > 0\) and let X be the timelike vector given by \(X^\mu = (1 + \varepsilon )u^\mu + e^\mu\). Then by Eqs. (B.71) and (B.72), we have
$$\begin{aligned} 0 \,\le \, \frac{1}{8\pi }G_{\mu \nu }X^\mu X^\nu = (1 + \varepsilon )^2\rho + p. \end{aligned}$$Since this is true for all \(\varepsilon > 0\), we have \(\rho + P \ge 0\). Conversely, suppose \(\rho \ge 0\) and \(\rho + p \ge 0\). Let X be any timelike vector. Decompose \(X^\mu = a u^\mu + b e^\mu\) where e is a unit spacelike vector orthogonal to u. Then by Eqs. (B.71) and (B.72), we have \(\frac{1}{8\pi }G_{\mu \nu } X^\mu X^\nu = a^2 \rho + b^2 p\). We have two cases: (1) \(p \ge 0\) and (2) \(p < 0\). In the first case we have \(G_{\mu \nu }X^\mu X^\nu = a^2 \rho + b^2 p \ge 0\). The inequality follows from \(\rho \ge 0\). Thus the weak energy condition holds. Now consider the case \(p < 0\). Then, since \(\rho + p \ge 0\), we have
$$\begin{aligned} \frac{1}{8\pi }G_{\mu \nu }X^\mu X^\nu = a^2 \rho + b^2 p \,\ge \, (- a^2 + b^2)p = p g_{\mu \nu } X^\mu X^\nu \,\ge \, 0. \end{aligned}$$The last inequality follows because \(p < 0\) and X is timelike.
-
(b)
Let \(G = g^{\mu \nu }G_{\mu \nu } = 8\pi (-\rho + 3p)\). Then contracting the Einstein equation with \(g^{\mu \nu }\) shows \(G = -R\). Rearranging the Einstein tensor gives
$$\begin{aligned} R_{\mu \nu } = G_{\mu \nu } + \frac{1}{2}Rg_{\mu \nu } = G_{\mu \nu } + 4\pi (\rho - 3p)g_{\mu \nu } \end{aligned}$$Assume the strong energy condition holds. Then using the above equation, we have \(0 \le R_{\mu \nu } u^\mu u^\nu = 8\pi \rho - 4\pi (\rho - 3p) = 4\pi (\rho + 3p)\) which establishes \(\rho + 3p \ge 0\). To establish \(\rho + p \ge 0\), fix \(\varepsilon > 0\) and consider the timelike vector \(X = (1 + \varepsilon ) u + e\). Then using Eqs. (B.71) and (B.72), we have
$$\begin{aligned} 0\le\, & {} R_{\mu \nu }X^\mu X^\nu = \big [ G_{\mu \nu } + 4\pi (\rho - 3p)g_{\mu \nu }\big ]X^\mu X^\nu \\=\, & {} 4\pi \big [(1 + \varepsilon )^2\rho + \rho - p + 3p(1+\varepsilon )^2\big ]. \end{aligned}$$Since \(\varepsilon > 0\) was arbitrary, we have \(\rho + p \ge 0\). Conversely, suppose \(\rho + p \ge 0\) and \(\rho + 3p \ge 0\). Let X be any timelike vector. Decompose \(X^\mu = a u^\mu + b e^\mu\). Then we have \(\frac{1}{8\pi }G_{\mu \nu }X^\mu X^\nu = a^2 \rho + b^2 p\). Using this in our expression for \(R_{\mu \nu }\), we have
$$\begin{aligned} R_{\mu \nu }X^\mu X^\nu=\, & {} G_{\mu \nu }X^\mu X^\nu + 4\pi (\rho - 3p)g_{\mu \nu }X^\mu X^\nu \\=\, & {} 4\pi \big [a^2(\rho + 3p) + b^2(\rho - p) \big ]. \end{aligned}$$There are two cases to consider: (1) \(\rho - p > 0\) and (2) \(\rho - p \le 0\). In case (1), we immediately have \(R_{\mu \nu }X^\mu X^\nu \ge 0\). Now consider case (2). Since X is timelike, we have \(-a^2 + b^2 < 0\). Therefore it suffices to show \((\rho - p)/ (\rho + 3p) < 1\). Hence it suffices to show \(p > 0\). Indeed \(\rho + p \ge 0\) and \(\rho - p \le 0\) together imply \(p \ge 0\).
\(\square\)
Rights and permissions
OpenAccess This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Ling, E. The Big Bang is a Coordinate Singularity for \(k = -1\) Inflationary FLRW Spacetimes. Found Phys 50, 385–428 (2020). https://doi.org/10.1007/s10701-020-00335-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10701-020-00335-7