The Confrontation between General Relativity and Experiment
Later version available View article history
- First Online:
- Accepted:
DOI: 10.12942/lrr-2006-3
- Cite this article as:
- Will, C.M. Living Rev. Relativ. (2006) 9: 3. doi:10.12942/lrr-2006-3
- 417 Citations
- 1.2k Downloads
Abstract
The status of experimental tests of general relativity and of theoretical frameworks for analyzing them is reviewed. Einstein’s equivalence principle (EEP) is well supported by experiments such as the Eötvös experiment, tests of special relativity, and the gravitational redshift experiment. Ongoing tests of EEP and of the inverse square law are searching for new interactions arising from unification or quantum gravity. Tests of general relativity at the post-Newtonian level have reached high precision, including the light deflection, the Shapiro time delay, the perihelion advance of Mercury, and the Nordtvedt effect in lunar motion. Gravitational wave damping has been detected in an amount that agrees with general relativity to better than half a percent using the Hulse-Taylor binary pulsar, and other binary pulsar systems have yielded other tests, especially of strong-field effects. When direct observation of gravitational radiation from astrophysical sources begins, new tests of general relativity will be possible.
1 Introduction
At the time of the birth of general relativity (GR), experimental confirmation was almost a side issue. Einstein did calculate observable effects of general relativity, such as the perihelion advance of Mercury, which he knew to be an unsolved problem, and the deflection of light, which was subsequently verified. But compared to the inner consistency and elegance of the theory, he regarded such empirical questions as almost peripheral. Today, experimental gravitation is a major component of the field, characterized by continuing efforts to test the theory’s predictions, to search for gravitational imprints of high-energy particle interactions, and to detect gravitational waves from astronomical sources.
The modern history of experimental relativity can be divided roughly into four periods: Genesis, Hibernation, a Golden Era, and the Quest for Strong Gravity. The Genesis (1887–1919) comprises the period of the two great experiments which were the foundation of relativistic physics — the Michelson-Morley experiment and the Eötvös experiment — and the two immediate confirmations of GR — the deflection of light and the perihelion advance of Mercury. Following this was a period of Hibernation (1920–1960) during which theoretical work temporarily outstripped technology and experimental possibilities, and, as a consequence, the field stagnated and was relegated to the backwaters of physics and astronomy.
But beginning around 1960, astronomical discoveries (quasars, pulsars, cosmic background radiation) and new experiments pushed GR to the forefront. Experimental gravitation experienced a Golden Era (1960–1980) during which a systematic, world-wide effort took place to understand the observable predictions of GR, to compare and contrast them with the predictions of alternative theories of gravity, and to perform new experiments to test them. The period began with an experiment to confirm the gravitational frequency shift of light (1960) and ended with the reported decrease in the orbital period of the Hulse-Taylor binary pulsar at a rate consistent with the GR prediction of gravity wave energy loss (1979). The results all supported GR, and most alternative theories of gravity fell by the wayside (for a popular review, see [282]).
Since 1980, the field has entered what might be termed a Quest for Strong Gravity. Many of the remaining interesting weak-field predictions of the theory are extremely small and difficult to check, in some cases requiring further technological development to bring them into detectable range. The sense of a systematic assault on the weak-field predictions of GR has been supplanted to some extent by an opportunistic approach in which novel and unexpected (and sometimes inexpensive) tests of gravity have arisen from new theoretical ideas or experimental techniques, often from unlikely sources. Examples include the use of laser-cooled atom and ion traps to perform ultra-precise tests of special relativity; the proposal of a “fifth” force, which led to a host of new tests of the weak equivalence principle; and recent ideas of large extra dimensions, which have motived new tests of the inverse square law of gravity at sub-millimeter scales.
Instead, much of the focus has shifted to experiments which can probe the effects of strong gravitational fields. The principal figure of merit that distinguishes strong from weak gravity is the quantity ϵ ∼ GM / (Rc^{2}), where G is the Newtonian gravitational constant, M is the characteristic mass scale of the phenomenon, R is the characteristic distance scale, and c is the speed of light. Near the event horizon of a non-rotating black hole, or for the expanding observable universe, ϵ ∼ 0.5; for neutron stars, ϵ ∼ 0.2. These are the regimes of strong gravity. For the solar system, ϵ < 10^{−5}; this is the regime of weak gravity. At one extreme are the strong gravitational fields associated with Planck-scale physics. Will unification of the forces, or quantization of gravity at this scale leave observable effects accessible by experiment? Dramatically improved tests of the equivalence principle, of the inverse square law, or of local Lorentz invariance are being mounted, to search for or bound the imprinted effects of Planck-scale phenomena. At the other extreme are the strong fields associated with compact objects such as black holes or neutron stars. Astrophysical observations and gravitational wave detectors are being planned to explore and test GR in the strong-field, highly-dynamical regime associated with the formation and dynamics of these objects.
In this Living Review, we shall survey the theoretical frameworks for studying experimental gravitation, summarize the current status of experiments, and attempt to chart the future of the subject. We shall not provide complete references to early work done in this field but instead will refer the reader to the appropriate review articles and monographs, specifically to Theory and Experiment in Gravitational Physics [281], hereafter referred to as TEGP. Additional recent reviews in this subject are [276, 284, 286, 71, 98, 239]. References to TEGP will be by chapter or section, e.g., “TEGP 8.9 [281]”.
2 Tests of the Foundations of Gravitation Theory
2.1 The Einstein equivalence principle
The principle of equivalence has historically played an important role in the development of gravitation theory. Newton regarded this principle as such a cornerstone of mechanics that he devoted the opening paragraph of the Principia to it. In 1907, Einstein used the principle as a basic element in his development of general relativity. We now regard the principle of equivalence as the foundation, not of Newtonian gravity or of GR, but of the broader idea that spacetime is curved. Much of this viewpoint can be traced back to Robert Dicke, who contributed crucial ideas about the foundations of gravitation theory between 1960 and 1965. These ideas were summarized in his influential Les Houches lectures of 1964 [93], and resulted in what has come to be called the Einstein equivalence principle (EEP).
One elementary equivalence principle is the kind Newton had in mind when he stated that the property of a body called “mass” is proportional to the “weight”, and is known as the weak equivalence principle (WEP). An alternative statement of WEP is that the trajectory of a freely falling “test” body (one not acted upon by such forces as electromagnetism and too small to be affected by tidal gravitational forces) is independent of its internal structure and composition. In the simplest case of dropping two different bodies in a gravitational field, WEP states that the bodies fall with the same acceleration (this is often termed the Universality of Free Fall, or UFF).
- 1.
WEP is valid.
- 2.
The outcome of any local non-gravitational experiment is independent of the velocity of the freely-falling reference frame in which it is performed.
- 3.
The outcome of any local non-gravitational experiment is independent of where and when in the universe it is performed.
For example, a measurement of the electric force between two charged bodies is a local non-gravitational experiment; a measurement of the gravitational force between two bodies (Cavendish experiment) is not.
- 1.
Spacetime is endowed with a symmetric metric.
- 2.
The trajectories of freely falling test bodies are geodesics of that metric.
- 3.
In local freely falling reference frames, the non-gravitational laws of physics are those written in the language of special relativity.
The argument that leads to this conclusion simply notes that, if EEP is valid, then in local freely falling frames, the laws governing experiments must be independent of the velocity of the frame (local Lorentz invariance), with constant values for the various atomic constants (in order to be independent of location). The only laws we know of that fulfill this are those that are compatible with special relativity, such as Maxwell’s equations of electromagnetism. Furthermore, in local freely falling frames, test bodies appear to be unaccelerated, in other words they move on straight lines; but such “locally straight” lines simply correspond to “geodesics” in a curved spacetime (TEGP 2.3 [281]).
General relativity is a metric theory of gravity, but then so are many others, including the Brans-Dicke theory and its generalizations. Theories in which varying non-gravitational constants are associated with dynamical fields that couple to matter directly are not metric theories. Neither, in this narrow sense, is superstring theory (see Section 2.3), which, while based fundamentally on a spacetime metric, introduces additional fields (dilatons, moduli) that can couple to material stress-energy in a way that can lead to violations, say, of WEP. It is important to point out, however, that there is some ambiguity in whether one treats such fields as EEP-violating gravitational fields, or simply as additional matter fields, like those that carry electromagnetism or the weak interactions. Still, the notion of curved spacetime is a very general and fundamental one, and therefore it is important to test the various aspects of the Einstein equivalence principle thoroughly. We first survey the experimental tests, and describe some of the theoretical formalisms that have been developed to interpret them. For other reviews of EEP and its experimental and theoretical significance, see [126, 162].
2.1.1 Tests of the weak equivalence principle
A number of projects are in the development or planning stage to push the bounds on η even lower. The project MICROSCOPE (MICRO-Satellite à Trainée Compensée pour l’Observation du Principe d’Équivalence) is designed to test WEP to 10^{−15}. It is being developed by the French space agency CNES for a possible launch in March, 2008, for a one-year mission [59]. The drag-compensated satellite will be in a Sun-synchronous polar orbit at 700 km altitude, with a payload consisting of two differential accelerometers, one with elements made of the same material (platinum), and another with elements made of different materials (platinum and titanium).
Another, known as Satellite Test of the Equivalence Principle (STEP) [247], is under consideration as a possible joint effort of NASA and the European Space Agency (ESA), with the goal of a 10^{−18} test. STEP would improve upon MICROSCOPE by using cryogenic techniques to reduce thermal noise, among other effects. At present, STEP (along with a number of variants, called MiniSTEP and QuickSTEP) has not been approved by any agency beyond the level of basic design studies or supporting research and development. An alternative concept for a space test of WEP is Galileo-Galilei [261], which uses a rapidly rotating differential accelerometer as its basic element. Its goal is a bound on η at the 10^{−13} level on the ground and 10^{−17} in space.
2.1.2 Tests of local Lorentz invariance
the classic Michelson-Morley experiment and its descendents [186, 237, 141, 46],
the Ives-Stillwell, Rossi-Hall, and other tests of time-dilation [136, 229, 103],
tests of the independence of the speed of light of the velocity of the source, using both binary X-ray stellar sources and high-energy pions [44, 5],
In addition to these direct experiments, there was the Dirac equation of quantum mechanics and its prediction of anti-particles and spin; later would come the stunningly successful relativistic theory of quantum electrodynamics.
In 2005, on the 100th anniversary of the introduction of special relativity, one might ask “what is there to test?”. Special relativity has been so thoroughly integrated into the fabric of modern physics that its validity is rarely challenged, except by cranks and crackpots. It is ironic then, that during the past several years, a vigorous theoretical and experimental effort has been launched, on an international scale, to find violations of special relativity. The motivation for this effort is not a desire to repudiate Einstein, but to look for evidence of new physics “beyond” Einstein, such as apparent violations of Lorentz invariance that might result from certain models of quantum gravity. Quantum gravity asserts that there is a fundamental length scale given by the Planck length, L_{Pl} = (ħG/c^{3})^{1/2} = 1.6 × 10^{−33} cm, but since length is not an invariant quantity (Lorentz-FitzGerald contraction), then there could be a violation of Lorentz invariance at some level in quantum gravity. In brane world scenarios, while physics may be locally Lorentz invariant in the higher dimensional world, the confinement of the interactions of normal physics to our four-dimensional “brane” could induce apparent Lorentz violating effects. And in models such as string theory, the presence of additional scalar, vector, and tensor long-range fields that couple to matter of the standard model could induce effective violations of Lorentz symmetry. These and other ideas have motivated a serious reconsideration of how to test Lorentz invariance with better precision and in new ways.
A simple and useful way of interpreting some of these modern experiments, called the c^{2}-formalism, is to suppose that the electromagnetic interactions suffer a slight violation of Lorentz invariance, through a change in the speed of electromagnetic radiation c relative to the limiting speed of material test particles (c_{0}, made to take the value unity via a choice of units), in other words, c ≠ 1 (see Section 2.2.3). Such a violation necessarily selects a preferred universal rest frame, presumably that of the cosmic background radiation, through which we are moving at about 370 km s^{−1} [167]. Such a Lorentz-non-invariant electromagnetic interaction would cause shifts in the energy levels of atoms and nuclei that depend on the orientation of the quantization axis of the state relative to our universal velocity vector, and on the quantum numbers of the state. The presence or absence of such energy shifts can be examined by measuring the energy of one such state relative to another state that is either unaffected or is affected differently by the supposed violation. One way is to look for a shifting of the energy levels of states that are ordinarily equally spaced, such as the Zeeman-split 2J + 1 ground states of a nucleus of total spin J in a magnetic field; another is to compare the levels of a complex nucleus with the atomic hyperfine levels of a hydrogen maser clock. The magnitude of these “clock anisotropies” would be proportional to δ ≡|c^{−2} − 1|.
Also included for comparison is the corresponding limit obtained from Michelson-Morley type experiments (for a review, see [127]). In those experiments, when viewed from the preferred frame, the speed of light down the two arms of the moving interferometer is c, while it can be shown using the electrodynamics of the c^{2} formalism, that the compensating Lorentz-FitzGerald contraction of the parallel arm is governed by the speed c_{0} = 1. Thus the Michelson-Morley experiment and its descendants also measure the coefficient c^{−2} − 1. One of these is the Brillet-Hall experiment [46], which used a Fabry-Perot laser interferometer. In a recent series of experiments, the frequencies of electromagnetic cavity oscillators in various orientations were compared with each other or with atomic clocks as a function of the orientation of the laboratory [297, 168, 190, 12, 248]. These placed bounds on c^{−2} − 1 at the level of better than a part in 10^{9}. Haugan and Lömmerzahl [125] have considered the bounds that Michelson-Morley type experiments could place on a modified electrodynamics involving a “vector-valued” effective photon mass.
The c^{2} framework focusses exclusively on classical electrodynamics. It has recently been extended to the entire standard model of particle physics by Kostelecký and colleagues [63, 64, 155]. The “Standard Model Extension” (SME) has a large number of Lorentz-violating parameters, opening up many new opportunities for experimental tests (see Section 2.2.4). A variety of clock anisotropy experiments have been carried out to bound the electromagnetic parameters of the SME framework [154]. For example, the cavity experiments described above [297, 168, 190] placed bounds on the coefficients of the tensors \({{\tilde k}_{{\rm{e}} -}}\) and \({{\tilde k}_{{\rm{o +}}}}\) (see Section 2.2.4 for definitions) at the levels of 10^{−14} and 10^{−10}, respectively. Direct comparisons between atomic clocks based on different nuclear species place bounds on SME parameters in the neutron and proton sectors, depending on the nature of the transitions involved. The bounds achieved range from 10^{−27} to 10^{−32} GeV.
Mattingly [182] gives a thorough and up-to-date review of both the theoretical frameworks and the experimental results for tests of LLI.
2.1.3 Tests of local position invariance
After almost 50 years of inconclusive or contradictory measurements, the gravitational redshift of solar spectral lines was finally measured reliably. During the early years of GR, the failure to measure this effect in solar lines was siezed upon by some as reason to doubt the theory. Unfortunately, the measurement is not simple. Solar spectral lines are subject to the “limb effect”, a variation of spectral line wavelengths between the center of the solar disk and its edge or “limb”; this effect is actually a Doppler shift caused by complex convective and turbulent motions in the photosphere and lower chromosphere, and is expected to be minimized by observing at the solar limb, where the motions are predominantly transverse. The secret is to use strong, symmetrical lines, leading to unambiguous wavelength measurements. Successful measurements were finally made in 1962 and 1972 (TEGP 2.4 (c) [281]). In 1991, LoPresto et al. [172] measured the solar shift in agreement with LPI to about 2 percent by observing the oxygen triplet lines both in absorption in the limb and in emission just off the limb.
The most precise standard redshift test to date was the Vessot-Levine rocket experiment that took place in June 1976 [264]. A hydrogen-maser clock was flown on a rocket to an altitude of about 10,000 km and its frequency compared to a similar clock on the ground. The experiment took advantage of the masers’ frequency stability by monitoring the frequency shift as a function of altitude. A sophisticated data acquisition scheme accurately eliminated all effects of the first-order Doppler shift due to the rocket’s motion, while tracking data were used to determine the payload’s location and the velocity (to evaluate the potential difference ΔU, and the special relativistic time dilation). Analysis of the data yielded a limit |α| < 2 × 10^{−4}.
A “null” redshift experiment performed in 1978 tested whether the relative rates of two different clocks depended upon position. Two hydrogen maser clocks and an ensemble of three superconducting-cavity stabilized oscillator (SCSO) clocks were compared over a 10-day period. During the period of the experiment, the solar potential U/c^{2} changed sinusoidally with a 24-hour period by 3 × 10^{−13} because of the Earth’s rotation, and changed linearly at 3 × 10^{−12} per day because the Earth is 90 degrees from perihelion in April. However, analysis of the data revealed no variations of either type within experimental errors, leading to a limit on the LPI violation parameter |α^{H} − α^{SCSO}| < 2 × 10^{−2} [258]. This bound has been improved using more stable frequency standards, such as atomic fountain clocks [120, 216, 23]. The current bound, from comparing a Cesium atomic fountain with a Hydrogen maser for a year, is |α^{H} − α^{Cs}| < 2.1 × 10^{−5} [23].
The varying gravitational redshift of Earth-bound clocks relative to the highly stable millisecond pulsar PSR 1937+21, caused by the Earth’s motion in the solar gravitational field around the Earth-Moon center of mass (amplitude 4000 km), was measured to about 10 percent [251]. Two measurements of the redshift using stable oscillator clocks on spacecraft were made at the one percent level: One used the Voyager spacecraft in Saturn’s gravitational field [158], while another used the Galileo spacecraft in the Sun’s field [160].
The gravitational redshift could be improved to the 10^{−10} level using an array of laser cooled atomic clocks on board a spacecraft which would travel to within four solar radii of the Sun [180].
Modern advances in navigation using Earth-orbiting atomic clocks and accurate time-transfer must routinely take gravitational redshift and time-dilation effects into account. For example, the Global Positioning System (GPS) provides absolute positional accuracies of around 15 m (even better in its military mode), and 50 nanoseconds in time transfer accuracy, anywhere on Earth. Yet the difference in rate between satellite and ground clocks as a result of relativistic effects is a whopping 39 microseconds per day (46 µs from the gravitational redshift, and −7 µs from time dilation). If these effects were not accurately accounted for, GPS would fail to function at its stated accuracy. This represents a welcome practical application of GR! (For the role of GR in GPS, see [15, 16]; for a popular essay, see [287].)
Bounds on cosmological variation of fundamental constants of non-gravitational physics. For an in-depth review, see [262].
Constant k | Limit on \(\dot k/k\) (yr^{−1}) | Redshift | Method |
---|---|---|---|
(yr^{−1}) | |||
Fine structure constant (α_{EM} = e^{2}/ħc) | < 30 × 10^{−16} | 0 | |
< 0.5 × 10^{−16} | 0.15 | ||
< 3.4 × 10^{−16} | 0.45 | ^{187}Re decay in meteorites [205] | |
(6.4 ± 1.4) × 10^{−16} | 0.2–3.7 | ||
< 1.2 × 10^{−16} | 0.4–2.3 | ||
Weak interaction constant \(({\alpha _{\rm{W}}} = {G_{\rm{f}}}m_{\rm{p}}^2c/{\hbar ^3})\) | < 1 × 10^{−11} | 0.15 | Oklo Natural Reactor [72] |
< 5 × 10^{−12} | 10^{9} | ||
e-p mass ratio | < 3 × 10^{−15} | 2.6–3.0 | Spectra in distant quasars [135] |
Experimental bounds on varying constants come in two types: bounds on the present rate of variation, and bounds on the difference between today’s value and a value in the distant past. The main example of the former type is the clock comparison test, in which highly stable atomic clocks of different fundamental type are intercompared over periods ranging from months to years (variants of the null redshift experiment). If the frequencies of the clocks depend differently on the electromagnetic fine structure constant α_{EM}, the electron-proton mass ratio m_{e}/m_{P}, or the gyromagnetic ratio of the proton g_{P}, for example, then a limit on a drift of the fractional frequency difference translates into a limit on a drift of the constant(s). The dependence of the frequencies on the constants may be quite complex, depending on the atomic species involved. The most recent experiments have exploited the techniques of laser cooling and trapping, and of atom fountains, in order to achieve extreme clock stability, and compared the Rubidium-87 hyperfine transition [181], the Mercury-199 ion electric quadrupole transition [31], the atomic Hydrogen 1S–2S transition [111], or an optical transition in Ytterbium-171 [209], against the ground-state hyperfine transition in Cesium-133. These experiments show that, today, \({{\dot \alpha}_{{\rm{EM}}}}/{\alpha _{{\rm{EM}}}} < 3 \times {10^{- 15}}{\rm{y}}{{\rm{r}}^{- 1}}\).
The second type of bound involves measuring the relics of or signal from a process that occurred in the distant past and comparing the inferred value of the constant with the value measured in the laboratory today. One sub-type uses astronomical measurements of spectral lines at large redshift, while the other uses fossils of nuclear processes on Earth to infer values of constants early in geological history.
Earlier comparisons of spectral lines of different atoms or transitions in distant galaxies and quasars produced bounds α_{EM} or g_{p}(m_{e}/m_{P}) on the order of a part in 10 per Hubble time [298]. Dramatic improvements in the precision of astronomical and laboratory spectroscopy, in the ability to model the complex astronomical environments where emission and absorption lines are produced, and in the ability to reach large redshift have made it possible to improve the bounds significantly. In fact, in 1999, Webb et al. [269, 193] announced that measurements of absorption lines in Mg, Al, Si, Cr, Fe, Ni, and Zn in quasars in the redshift range 0.5 < Z < 3.5 indicated a smaller value of α_{EM} in earlier epochs, namely Δα_{EM}/α_{EM} = (−0.72 ± 0.18) × 10^{−5}, corresponding to \({{\dot \alpha}_{{\rm{EM}}}}/{\alpha _{{\rm{EM}}}} = (6.4 \pm 1.4) \times {10^{- 16}}{\rm{y}}{{\rm{r}}^{{\rm{- 1}}}}\) (assuming a linear drift with time). Measurements by other groups have so far failed to confirm this non-zero effect [242, 51, 219]; a recent analysis of Mg absorption systems in quasars at \(0.4 < Z < 2.3\,{\rm{gave}}\,{{\dot \alpha}_{{\rm{EM}}}}/{\alpha _{{\rm{EM}}}} = (- 0.6 \pm 0.6) \times {10^{- 16}}{\rm{y}}{{\rm{r}}^{- 1}}\) [242].
Another important set of bounds arises from studies of the “Oklo” phenomenon, a group of natural, sustained ^{235}U fission reactors that occurred in the Oklo region of Gabon, Africa, around 1.8 billion years ago. Measurements of ore samples yielded an abnormally low value for the ratio of two isotopes of Samarium, ^{149}Sm/^{147}Sm. Neither of these isotopes is a fission product, but ^{149}Sm can be depleted by a flux of neutrons. Estimates of the neutron fluence (integrated dose) during the reactors’ “on” phase, combined with the measured abundance anomaly, yield a value for the neutron cross-section for ^{149}Sm 1.8 billion years ago that agrees with the modern value. However, the capture cross-section is extremely sensitive to the energy of a low-lying level (E ∼ 0.1 eV), so that a variation in the energy of this level of only 20 meV over a billion years would change the capture cross-section from its present value by more than the observed amount. This was first analyzed in 1976 by Shlyakter [241]. Recent reanalyses of the Oklo data [72, 116, 210] lead to a bound on \({{\dot \alpha}_{{\rm{EM}}}}\) at the level of around 5 × 10^{−17} yr^{−1}.
In a similar manner, recent reanalyses of decay rates of ^{187}Re in ancient meteorites (4.5 billion years old) gave the bound \({{\dot \alpha}_{{\rm{EM}}}}/{\alpha _{{\rm{EM}}}} < 3.4 \times {10^{- 16}}{\rm{y}}{{\rm{r}}^{{\rm{- 1}}}}\) [205].
2.2 Theoretical frameworks for analyzing EEP
2.2.1 Schiff’s conjecture
Because the three parts of the Einstein equivalence principle discussed above are so very different in their empirical consequences, it is tempting to regard them as independent theoretical principles. On the other hand, any complete and self-consistent gravitation theory must possess sufficient mathematical machinery to make predictions for the outcomes of experiments that test each principle, and because there are limits to the number of ways that gravitation can be meshed with the special relativistic laws of physics, one might not be surprised if there were theoretical connections between the three sub-principles. For instance, the same mathematical formalism that produces equations describing the free fall of a hydrogen atom must also produce equations that determine the energy levels of hydrogen in a gravitational field, and thereby the ticking rate of a hydrogen maser clock. Hence a violation of EEP in the fundamental machinery of a theory that manifests itself as a violation of WEP might also be expected to show up as a violation of local position invariance. Around 1960, Schiff conjectured that this kind of connection was a necessary feature of any self-consistent theory of gravity. More precisely, Schiff’s conjecture states that any complete, self-consistent theory of gravity that embodies WEP necessarily embodies EEP. In other words, the validity of WEP alone guarantees the validity of local Lorentz and position invariance, and thereby of EEP.
If Schiff’s conjecture is correct, then Eötvös experiments may be seen as the direct empirical foundation for EEP, hence for the interpretation of gravity as a curved-spacetime phenomenon. Of course, a rigorous proof of such a conjecture is impossible (indeed, some special counter-examples are known [204, 194, 62]), yet a number of powerful “plausibility” arguments can be formulated.
2.2.1.1 Box 1. The THϵμ formalism
Coordinate system and conventions:
x^{0} = t: time coordinate associated with the static nature of the static spherically symmetric (SSS) gravitational field; x = (x, y, z): isotropic quasi-Cartesian spatial coordinates; spatial vector and gradient operations as in Cartesian space.
- Matter and field variables:
m_{0a}: rest mass of particle a.
e_{a}: charge of particle a.
\(x_a^\mu (t)\): world line of particle a.
\(\upsilon _a^\mu = dx_a^\mu/dt\): coordinate velocity of particle a.
A_{μ} =: electromagnetic vector potential; E = ∇A_{0} − ∂A/∂t, B = ∇ × A.
Gravitational potential:
U(x).
Arbitrary functions:
$$T(U),H(U),\epsilon (U),\mu (U);\,{\rm{EEP}}\,{\rm{is}}\,{\rm{satisfied}}\,{\rm{if}}\,\epsilon = \mu = {(H/T)^{1/2}}{\rm{for}}\,{\rm{all}}\,U.$$Action:
$$I = - \sum\limits_a {{m_{{0a}}}\int {{{(T - H\upsilon_a^2)}^{1/2}}dt}} + \sum\limits_a {{e_a}\int {{A_\mu}(x_a^\nu)\upsilon_a^\mu dt + {{(8\pi)}^{- 1}}\int {(\epsilon {E^2} - {\mu ^{- 1}}{B^2})} {d^4}x.}}$$Non-metric parameters:
where c_{0} = (T_{0}/H_{0})^{1/2} and subscript “0” refers to a chosen point in space. If EEP is satisfied, Γ_{0} ≡ Λ_{0} ≡ ϒ_{0} ≡ 0.$${\Gamma _0} = - c_0^2{\partial \over {\partial U}}\ln {[\epsilon {(T/H)^{1/2}}]_0},\quad {\Lambda _0} = - c_0^2{\partial \over {\partial U}}\ln {[\mu {(T/H)^{1/2}}]_0},\quad {\Upsilon _0} = 1 - {(T{H^{- 1}}\epsilon \mu)_0},$$
2.2.2 The THϵμ formalism
Lightman and Lee then calculated explicitly the rate of fall of a “test” body made up of interacting charged particles, and found that the rate was independent of the internal electromagnetic structure of the body (WEP) if and only if Equation (9) was satisfied. In other words, WEP ⇒ EEP and Schiff’s conjecture was verified, at least within the restrictions built into the formalism.
The redshift is the standard one (α = 0), independently of the nature of the clock if and only if Γ_{0} ≡ Λ_{0} ≡ 0. Thus the Vessot-Levine rocket redshift experiment sets a limit on the parameter combination |3Γ_{0} − Λ_{0}| (see Figure 3); the null-redshift experiment comparing hydrogen-maser and SCSO clocks sets a limit on \(\vert{\alpha _{\rm{H}}} - {\alpha _{{\rm{SCSO}}}}\vert = {3 \over 2}\vert{\Gamma _0} - {\Lambda _0}\vert\). Alvarez and Mann [7, 6, 8, 9, 10] extended the THϵμ formalism to permit analysis of such effects as the Lamb shift, anomalous magnetic moments and non-baryonic effects, and placed interesting bounds on EEP violations.
2.2.3 The c^{2} formalism
The electrodynamics given by Equation (17) can also be quantized, so that we may treat the interaction of photons with atoms via perturbation theory. The energy of a photon is ħ times its frequency ω, while its momentum is ħω/c. Using this approach, one finds that the difference in round trip travel times of light along the two arms of the interferometer in the Michelson-Morley experiment is given by L_{0}(υ^{2}/c)(c^{−2} − 1). The experimental null result then leads to the bound on (c^{−2} − 1) shown on Figure 2. Similarly the anisotropy in energy levels is clearly illustrated by the tensorial terms in Equations (18, 20); by evaluating \(\tilde E_{\rm{B}}^{{\rm{ES}}ij}\) for each nucleus in the various Hughes-Drever-type experiments and comparing with the experimental limits on energy differences, one obtains the extremely tight bounds also shown on Figure 2.
The behavior of moving atomic clocks can also be analyzed in detail, and bounds on (c^{−2} − 1) can be placed using results from tests of time dilation and of the propagation of light. In some cases, it is advantageous to combine the c^{2} framework with a “kinematical” viewpoint that treats a general class of boost transformations between moving frames. Such kinematical approaches have been discussed by Robertson, Mansouri and Sexl, and Will (see [279]).
2.2.4 The Standard Model Extension (SME)
Kostelecký and collaborators developed a useful and elegant framework for discussing violations of Lorentz symmetry in the context of the standard model of particle physics [63, 64, 155]. Called the Standard Model Extension (SME), it takes the standard SU(3) × SU(2) × U(1) field theory of particle physics, and modifies the terms in the action by inserting a variety of tensorial quantities in the quark, lepton, Higgs, and gauge boson sectors that could explicitly violate LLI. SME extends the earlier classical THϵμ and c^{2} frameworks, and the χ − g framework of Ni [194] to quantum field theory and particle physics. The modified terms split naturally into those that are odd under CPT (i.e. that violate CPT) and terms that are even under CPT. The result is a rich and complex framework, with many parameters to be analyzed and tested by experiment. Such details are beyond the scope of this review; for a review of SME and other frameworks, the reader is referred to the Living Review by Mattingly [182].
In the rest frame of the universe, these tensors have some form that is established by the global nature of the solutions of the overarching theory being used. In a frame that is moving relative to the universe, the tensors will have components that depend on the velocity of the frame, and on the orientation of the frame relative to that velocity.
2.3 EEP, particle physics, and the search for new interactions
Thus far, we have discussed EEP as a principle that strictly divides the world into metric and non-metric theories, and have implied that a failure of EEP might invalidate metric theories (and thus general relativity). On the other hand, there is mounting theoretical evidence to suggest that EEP is likely to be violated at some level, whether by quantum gravity effects, by effects arising from string theory, or by hitherto undetected interactions. Roughly speaking, in addition to the pure Einsteinian gravitational interaction, which respects EEP, theories such as string theory predict other interactions which do not. In string theory, for example, the existence of such EEP-violating fields is assured, but the theory is not yet mature enough to enable a robust calculation of their strength relative to gravity, or a determination of whether they are long range, like gravity, or short range, like the nuclear and weak interactions, and thus too short range to be detectable.
On the other hand, whether one views such effects as a violation of EEP or as effects arising from additional “matter” fields whose interactions, like those of the electromagnetic field, do not fully embody EEP, is to some degree a matter of semantics. Unlike the fields of the standard model of electromagnetic, weak and strong interactions, which couple to properties other than mass-energy and are either short range or are strongly screened, the fields inspired by string theory could be long range (if they remain massless by virtue of a symmetry, or at best, acquire a very small mass), and can couple to mass-energy, and thus can mimic gravitational fields. Still, there appears to be no way to make this precise.
As a result, EEP and related tests are now viewed as ways to discover or place constraints on new physical interactions, or as a branch of “non-accelerator particle physics”, searching for the possible imprints of high-energy particle effects in the low-energy realm of gravity. Whether current or proposed experiments can actually probe these phenomena meaningfully is an open question at the moment, largely because of a dearth of firm theoretical predictions.
2.3.1 The “fifth” force
On the phenomenological side, the idea of using EEP tests in this way may have originated in the middle 1980s, with the search for a “fifth” force. In 1986, as a result of a detailed reanalysis of Eötvös’ original data, Fischbach et al. [108] suggested the existence of a fifth force of nature, with a strength of about a percent that of gravity, but with a range (as defined by the range λ of a Yukawa potential, e^{−r/λ}/r) of a few hundred meters. This proposal dovetailed with earlier hints of a deviation from the inverse-square law of Newtonian gravitation derived from measurements of the gravity profile down deep mines in Australia, and with emerging ideas from particle physics suggesting the possible presence of very low-mass particles with gravitational-strength couplings. During the next four years numerous experiments looked for evidence of the fifth force by searching for composition-dependent differences in acceleration, with variants of the Eötvös experiment or with free-fall Galileo-type experiments. Although two early experiments reported positive evidence, the others all yielded null results. Over the range between one and 10^{4} meters, the null experiments produced upper limits on the strength of a postulated fifth force between 10^{−3} and 10^{−6} of the strength of gravity. Interpreted as tests of WEP (corresponding to the limit of infinite-range forces), the results of two representative experiments from this period, the free-fall Galileo experiment and the early Eöt-Wash experiment, are shown in Figure 1. At the same time, tests of the inverse-square law of gravity were carried out by comparing variations in gravity measurements up tall towers or down mines or boreholes with gravity variations predicted using the inverse square law together with Earth models and surface gravity data mathematically “continued” up the tower or down the hole. Despite early reports of anomalies, independent tower, borehole, and seawater measurements ultimately showed no evidence of a deviation. Analyses of orbital data from planetary range measurements, lunar laser ranging (LLR), and laser tracking of the LAGEOS satellite verified the inverse-square law to parts in 10^{8} over scales of 10^{3} to 10^{5} km, and to parts in 10^{9} over planetary scales of several astronomical units [250]. A consensus emerged that there was no credible experimental evidence for a fifth force of nature, of a type and range proposed by Fischbach et al. For reviews and bibliographies of this episode, see [107, 109, 110, 4, 278].
2.3.2 Short-range modifications of Newtonian gravity
Although the idea of an intermediate-range violation of Newton’s gravitational law was dropped, new ideas emerged to suggest the possibility that the inverse-square law could be violated at very short ranges, below the centimeter range of existing laboratory verifications of the 1/r^{2} behavior. One set of ideas [13, 11, 221, 220] posited that some of the extra spatial dimensions that come with string theory could extend over macroscopic scales, rather than being rolled up at the Planck scale of 10^{−33} cm, which was then the conventional viewpoint. On laboratory distances large compared to the relevant scale of the extra dimension, gravity would fall off as the inverse square, whereas on short scales, gravity would fall off as 1/R^{2+n}, where n is the number of large extra dimensions. Many models favored n = 1 or n = 2. Other possibilities for effective modifications of gravity at short range involved the exchange of light scalar particles.
Following these proposals, many of the high-precision, low-noise methods that were developed for tests of WEP were adapted to carry out laboratory tests of the inverse square law of Newtonian gravitation at millimeter scales and below. The challenge of these experiments has been to distinguish gravitation-like interactions from electromagnetic and quantum mechanical (Casimir) effects. No deviations from the inverse square law have been found to date at distances between 10 µm and 10 mm[171, 130, 129, 52, 170]. For a comprehensive review of both the theory and the experiments, see [3].
3 Tests of Post-Newtonian Gravity
3.1 Metric theories of gravity and the strong equivalence principle
3.1.1 Universal coupling and the metric postulates
- 1.
there exists a symmetric metric,
- 2.
test bodies follow geodesics of the metric, and
- 3.
in local Lorentz frames, the non-gravitational laws of physics are those of special relativity.
The property that all non-gravitational fields should couple in the same manner to a single gravitational field is sometimes called “universal coupling”. Because of it, one can discuss the metric as a property of spacetime itself rather than as a field over spacetime. This is because its properties may be measured and studied using a variety of different experimental devices, composed of different non-gravitational fields and particles, and, because of universal coupling, the results will be independent of the device. Thus, for instance, the proper time between two events is a characteristic of spacetime and of the location of the events, not of the clocks used to measure it.
Consequently, if EEP is valid, the non-gravitational laws of physics may be formulated by taking their special relativistic forms in terms of the Minkowski metric η and simply “going over” to new forms in terms of the curved spacetime metric g, using the mathematics of differential geometry. The details of this “going over” can be found in standard textbooks (see [189, 270], TEGP 3.2. [281]).
3.1.2 The strong equivalence principle
In any metric theory of gravity, matter and non-gravitational fields respond only to the spacetime metric g. In principle, however, there could exist other gravitational fields besides the metric, such as scalar fields, vector fields, and so on. If, by our strict definition of metric theory, matter does not couple to these fields, what can their role in gravitation theory be? Their role must be that of mediating the manner in which matter and non-gravitational fields generate gravitational fields and produce the metric; once determined, however, the metric alone acts back on the matter in the manner prescribed by EEP.
What distinguishes one metric theory from another, therefore, is the number and kind of gravitational fields it contains in addition to the metric, and the equations that determine the structure and evolution of these fields. From this viewpoint, one can divide all metric theories of gravity into two fundamental classes: “purely dynamical” and “prior-geometric”.
By “purely dynamical metric theory” we mean any metric theory whose gravitational fields have their structure and evolution determined by coupled partial differential field equations. In other words, the behavior of each field is influenced to some extent by a coupling to at least one of the other fields in the theory. By “prior geometric” theory, we mean any metric theory that contains “absolute elements”, fields or equations whose structure and evolution are given a priori, and are independent of the structure and evolution of the other fields of the theory. These “absolute elements” typically include flat background metrics η or cosmic time coordinates t.
General relativity is a purely dynamical theory since it contains only one gravitational field, the metric itself, and its structure and evolution are governed by partial differential equations (Einstein’s equations). Brans-Dicke theory and its generalizations are purely dynamical theories; the field equation for the metric involves the scalar field (as well as the matter as source), and that for the scalar field involves the metric. Rosen’s bimetric theory is a prior-geometric theory: It has a non-dynamical, Riemann-flat background metric η, and the field equations for the physical metric g involve η.
By discussing metric theories of gravity from this broad point of view, it is possible to draw some general conclusions about the nature of gravity in different metric theories, conclusions that are reminiscent of the Einstein equivalence principle, but that are subsumed under the name “strong equivalence principle”.
A theory which contains only the metric g yields local gravitational physics which is independent of the location and velocity of the local system. This follows from the fact that the only field coupling the local system to the environment is g, and it is always possible to find a coordinate system in which g takes the Minkowski form at the boundary between the local system and the external environment (neglecting inhomogeneities in the external gravitational field). Thus the asymptotic values of g_{μν} are constants independent of location, and are asymptotically Lorentz invariant, thus independent of velocity. General relativity is an example of such a theory.
A theory which contains the metric g and dynamical scalar fields φ_{A} yields local gravitational physics which may depend on the location of the frame but which is independent of the velocity of the frame. This follows from the asymptotic Lorentz invariance of the Minkowski metric and of the scalar fields, but now the asymptotic values of the scalar fields may depend on the location of the frame. An example is Brans-Dicke theory, where the asymptotic scalar field determines the effective value of the gravitational constant, which can thus vary as φ varies. On the other hand, a form of velocity dependence in local physics can enter indirectly if the asymptotic values of the scalar field vary with time cosmologically. Then the rate of variation of the gravitational constant could depend on the velocity of the frame.
A theory which contains the metric g and additional dynamical vector or tensor fields or prior-geometric fields yields local gravitational physics which may have both location and velocity-dependent effects.
- 1.
WEP is valid for self-gravitating bodies as well as for test bodies.
- 2.
The outcome of any local test experiment is independent of the velocity of the (freely falling) apparatus.
- 3.
The outcome of any local test experiment is independent of where and when in the universe it is performed.
The above discussion of the coupling of auxiliary fields to local gravitating systems indicates that if SEP is strictly valid, there must be one and only one gravitational field in the universe, the metric g. These arguments are only suggestive however, and no rigorous proof of this statement is available at present. Empirically it has been found that almost every metric theory other than GR introduces auxiliary gravitational fields, either dynamical or prior geometric, and thus predicts violations of SEP at some level (here we ignore quantum-theory inspired modifications to GR involving “R^{2}” terms). The one exception is Nordström’s 1913 conformally-flat scalar theory [195], which can be written purely in terms of the metric; the theory satisfies SEP, but unfortunately violates experiment by predicting no deflection of light. General relativity seems to be the only viable metric theory that embodies SEP completely. In Section 3.6, we shall discuss experimental evidence for the validity of SEP.
3.2 The parametrized post-Newtonian formalism
Despite the possible existence of long-range gravitational fields in addition to the metric in various metric theories of gravity, the postulates of those theories demand that matter and non-gravitational fields be completely oblivious to them. The only gravitational field that enters the equations of motion is the metric g. The role of the other fields that a theory may contain can only be that of helping to generate the spacetime curvature associated with the metric. Matter may create these fields, and they plus the matter may generate the metric, but they cannot act back directly on the matter. Matter responds only to the metric.
Thus the metric and the equations of motion for matter become the primary entities for calculating observable effects, and all that distinguishes one metric theory from another is the particular way in which matter and possibly other gravitational fields generate the metric.
The PPN Parameters and their significance (note that α_{3}has been shown twice to indicate that it is a measure of two effects).
Parameter | What it measures relative to GR | Value in GR | Value in semi-conservative theories | Value in fully conservative theories |
---|---|---|---|---|
ϒ | How much space-curvature produced by unit rest mass? | 1 | ϒ | ϒ |
β | How much “nonlinearity” in the superposition law for gravity? | 1 | β | β |
ξ | Preferred-location effects? | 0 | ξ | ξ |
α_{1} | Preferred-frame effects? | 0 | α_{1} | 0 |
α_{2} | 0 | α_{2} | 0 | |
α_{3} | 0 | 0 | 0 | |
α_{3} | Violation of conservation of total momentum? | 0 | 0 | 0 |
ζ_{1} | 0 | 0 | 0 | |
ζ_{2} | 0 | 0 | 0 | |
ζ_{3} | 0 | 0 | 0 | |
ζ_{4} | 0 | 0 | 0 |
- 1.
in GR,
- 2.
in any theory of gravity that possesses conservation laws for total momentum, called “semi-conservative” (any theory that is based on an invariant action principle is semi-conservative), and
- 3.
in any theory that in addition possesses six global conservation laws for angular momentum, called “fully conservative” (such theories automatically predict no post-Newtonian preferred-frame effects).
The PPN formalism was pioneered by Kenneth Nordtvedt [197], who studied the post-Newtonian metric of a system of gravitating point masses, extending earlier work by Eddington, Robertson and Schiff (TEGP 4.2 [281]). Will [274] generalized the framework to perfect fluids. A general and unified version of the PPN formalism was developed by Will and Nordtvedt. The canonical version, with conventions altered to be more in accord with standard textbooks such as [189], is discussed in detail in TEGP 4 [281]. Other versions of the PPN formalism have been developed to deal with point masses with charge, fluid with anisotropic stresses, bodies with strong internal gravity, and post-post-Newtonian effects (TEGP 4.2, 14.2 [281]).
3.2.1 Box 2. The Parametrized Post-Newtonian formalism
Coordinate system:
The framework uses a nearly globally Lorentz coordinate system in which the coordinates are (t, x^{1},x^{2},x^{3}). Three-dimensional, Euclidean vector notation is used throughout. All coordinate arbitrariness (“gauge freedom”) has been removed by specialization of the coordinates to the standard PPN gauge (TEGP 4.2 [281]). Units are chosen so that G = c = 1, where G is the physically measured Newtonian constant far from the solar system.
- Matter variables:
ρ: density of rest mass as measured in a local freely falling frame momentarily comoving with the gravitating matter.
υ^{i}= (dx^{i}/dt): coordinate velocity of the matter.
w^{i}: coordinate velocity of the PPN coordinate system relative to the mean rest-frame of the universe.
p: pressure as measured in a local freely falling frame momentarily comoving with the matter.
Π: internal energy per unit rest mass (it includes all forms of non-rest-mass, non-gravitational energy, e.g., energy of compression and thermal energy).
PPN parameters:
$$\gamma, \,\beta, \,\xi, \,{\alpha _1},\,{\alpha _2},\,{\alpha _3},\,{\varsigma _1},\,{\varsigma _2},\,{\varsigma _3},\,{\varsigma _4}.$$Metric:
$$\begin{array}{*{20}c} {{g_{00}} = - 1 + 2U - 2\beta {U^2} - 2\xi {\Phi _W} + (2\gamma + 2 + {\alpha _3} + {\zeta _1} - 2\xi){\Phi _1} + 2(3\gamma - 2\beta + 1 + {\zeta _2} + \xi){\Phi _2}}\\ {\quad \quad + 2(1 + {\zeta _3}){\Phi _3} + 2(3\gamma + 3{\zeta _4} - 2\xi){\Phi _4} - ({\zeta _1} - 2\xi){\mathcal A} - ({\alpha _1} - {\alpha _2} - {\alpha _3}){w^2}U - {\alpha _2}{w^i}{w^j}{U_{ij}}}\\ {\quad \quad + (2{\alpha _3} - {\alpha _1}){w^i}{V_i} + {\mathcal O}({\epsilon ^3}),}\\ {{g_{0i}} = - {1 \over 2}(4\gamma + 3 + {\alpha _1} - {\alpha _2} + {\zeta _1} - 2\xi){V_i} - {1 \over 2}(1 + {\alpha _2} - {\zeta _1} + 2\xi){W_i} - {1 \over 2}({\alpha _1} - 2{\alpha _2}){w^i}U}\\ {\quad \quad - {\alpha _2}{w^j}{U_{ij}} + {\mathcal O}({\epsilon ^{5/2}}),}\\ {{g_{ij}} = (1 + 2\gamma U){\delta _{ij}} + {\mathcal O}({\epsilon ^2}).}\\ \end{array}$$Metric potentials:
$$\begin{array}{*{20}c}{U = \int {{{\rho \prime} \over {\vert {\rm{x}} - {\rm{x}}\prime \vert}}{d^3}x\prime ,\quad \quad \quad \quad \quad \quad \quad}} \\ {{U_{ij}} = \int {{\rho \prime {{(x - x\prime)}_i}{{(x - x\prime)}_j}} \over {\vert {\rm{x}} - {\rm{x}}\prime {\vert ^3}}}{d^3}x\prime ,\quad \quad \quad} \\ {\quad \quad \quad \quad \quad {\Phi _W} = \int {{\rho \prime \rho \prime \prime ({\rm{x}} - {\rm{x}}\prime)} \over {\vert {\rm{x}} - {\rm{x}}\prime {\vert ^3}}}\cdot\left({{{{\rm{x}}\prime - {\rm{x}}\prime \prime} \over {\vert {\rm{x}} - {\rm{x}}\prime \prime \vert}} - {{{\rm{x}} - {\rm{x}}\prime \prime} \over {\vert {\rm{x}}\prime - {\rm{x}}\prime \prime \vert}}} \right){d^3}x\prime {d^3}x\prime \prime ,} \\ {A = \int {{\rho \prime {{[{\rm{v}}\prime \cdot({\rm{x}} - {\rm{x}}\prime)]}^2}} \over {\vert {\rm{x}} - {\rm{x}}\prime {\vert ^3}}}{d^3}x\prime ,\quad \quad \quad \quad} \\ {{\Phi _1} = \int {{{\rho \prime \upsilon {\prime ^2}} \over {\vert {\rm{x}} - {\rm{x}}\prime \vert}}{d^3}x\prime ,\quad \quad \quad \quad \quad \quad}} \\ {{\Phi _2} = \int {{{\rho \prime U\prime} \over {\vert {\rm{x}} - {\rm{x}}\prime \vert}}{d^3}x\prime ,\quad \quad \quad \quad \quad \quad}} \\ {{\Phi _3} = \int {{{\rho \prime \Pi \prime} \over {\vert {\rm{x}} - {\rm{x}}\prime \vert}}} {d^3}x\prime ,\quad \quad \quad \quad \quad \quad} \\ {{\Phi _4} = \int {{{p\prime} \over {\vert {\rm{x}} - {\rm{x}}\prime \vert}}{d^3}x\prime ,\quad \quad \quad \quad \quad \quad}} \\ {{V_i} = \int {{{\rho \prime \upsilon {\prime _i}} \over {\vert {\rm{x}} - {\rm{x}}\prime \vert}}{d^3}x\prime ,\quad \quad \quad \quad \quad \quad}} \\ {\,\,\,\,{W_i} = \int {{\rho \prime [{\rm{v}}\prime \cdot({\rm{x}} - {\rm{x}}\prime)]{{(x - x\prime)}_i}} \over {\vert {\rm{x}} - {\rm{x}}\prime {\vert ^3}}}{d^3}x\prime .\quad \quad} \\ \end{array}$$Stress-energy tensor (perfect fluid):
$$\begin{array}{*{20}c} {{T^{00}} = \rho (1 + \prod + {v^2} + 2U)}\\ {{T^{0i}} = \rho {v^i}\left({1 + \prod + {v^2} + 2U + {p \over p}} \right),}\\ {{T^{ij}} = \rho {v^i}{v^j}\left({1 + \prod + {v^2} + 2U + {p \over p}} \right) + p{\delta ^{ij}}(1 - 2\gamma U).}\\ \end{array}$$- Equations of motion:
Stressed matter: T^{μν;ν} = 0.
Test bodies: \({{{d^2}{x^\mu}} \over {d{\lambda ^2}}} + {\Gamma ^\mu}\nu \lambda {{d{x^\nu}} \over {d\lambda}}{{d{x^\lambda}} \over {d\lambda}} = 0\).
Maxwell’s equations: F^{μν;ν} = 4πJ^{μ}, F_{μν} = A_{ν;μ} − A_{μ;ν}.
3.3 Competing theories of gravity
One of the important applications of the PPN formalism is the comparison and classification of alternative metric theories of gravity. The population of viable theories has fluctuated over the years as new effects and tests have been discovered, largely through the use of the PPN framework, which eliminated many theories thought previously to be viable. The theory population has also fluctuated as new, potentially viable theories have been invented.
A full compendium of alternative theories circa 1981 is given in TEGP 5 [281].
Many alternative metric theories developed during the 1970s and 1980s could be viewed as “straw-man” theories, invented to prove that such theories exist or to illustrate particular properties. Few of these could be regarded as well-motivated theories from the point of view, say, of field theory or particle physics.
A number of theories fall into the class of “prior-geometric” theories, with absolute elements such as a flat background metric in addition to the physical metric. Most of these theories predict “preferred-frame” effects, that have been tightly constrained by observations (see Section 3.6.2). An example is Rosen’s bimetric theory.
A large number of alternative theories of gravity predict gravitational wave emission substantially different from that of general relativity, in strong disagreement with observations of the binary pulsar (see Section 7).
Scalar-tensor modifications of GR have become very popular in unification schemes such as string theory, and in cosmological model building. Because the scalar fields could be massive, the potentials in the post-Newtonian limit could be modified by Yukawa-like terms.
Vector-tensor theories have attracted recent attention, in the spirit of the SME (see Section 2.2.4), as models for violations of Lorentz invariance in the gravitational sector.
3.3.1 General relativity
The metric g is the sole dynamical field, and the theory contains no arbitrary functions or parameters, apart from the value of the Newtonian coupling constant G, which is measurable in laboratory experiments. Throughout this article, we ignore the cosmological constant Λ_{C}. We do this despite recent evidence, from supernova data, of an accelerating universe, which would indicate either a non-zero cosmological constant or a dynamical “dark energy” contributing about 70 percent of the critical density. Although Λ_{C} has significance for quantum field theory, quantum gravity, and cosmology, on the scale of the solar-system or of stellar systems its effects are negligible, for the values of Ac inferred from supernova observations.
Metric theories and their PPN parameter values (α_{3} = ζ_{i} = 0 for all cases). The parameters γ′, β′, α′_{1}, and α′_{2}denote complicated functions of u and of the arbitrary constants. Here Λ is not the cosmological constant Λ_{C}, but is defined by Equation (37).
Theory | Arbitrary functions or constants | Cosmic matching parameters | PPN parameters | ||||
---|---|---|---|---|---|---|---|
γ | β | ξ | α_{1} | α_{2} | |||
General relativity | none | none | 1 | 1 | 0 | 0 | 0 |
Scalar-tensor | |||||||
Brans-Dicke | ω_{BD} | ϕ_{0} | \({{1 + {\omega _{{\rm{BD}}}}} \over {2 + {\omega _{{\rm{BD}}}}}}\) | 1 | 0 | 0 | 0 |
General | A(φ), V(φ) | ϕ_{0} | \({{1 + \omega} \over {2 + \omega}}\) | 1+Λ | 0 | 0 | 0 |
Vector-tensor | |||||||
Unconstrained | ω, c_{1}, c_{2}, c_{3}, c_{4} | u | γ′ | β′ | 0 | \(\alpha _1\prime\) | \(\alpha _2\prime\) |
Einstein-Æther | c_{1},c_{2},c_{3},c_{4} | none | 1 | 1 | 0 | \(\alpha _1\prime\) | \(\alpha _2\prime\) |
Rosen’s bimetric | none | c_{0}, c_{1} | 1 | 1 | 0 | 0 | \({{{c_0}} \over {{c_1}}} - 1\) |
3.3.2 Scalar-tensor theories
Negative values of β_{0} correspond to a “locally unstable” scalar potential (the overall theory is still stable in the sense of having no tachyons or ghosts). In this case, objects such as neutron stars can experience a “spontaneous scalarization”, whereby the interior values of φ can take on values very different from the exterior values, through non-linear interactions between strong gravity and the scalar field, dramatically affecting the stars’ internal structure and leading to strong violations of SEP. On the other hand, in the case β_{0} < 0, one must confront that fact that, with an unstable φ potential, cosmological evolution would presumably drive the system away from the peak where α_{0} ≈ 0, toward parameter values that could be excluded by solar system experiments.
Scalar fields coupled to gravity or matter are also ubiquitous in particle-physics-inspired models of unification, such as string theory [254, 176, 85, 82, 83]. In some models, the coupling to matter may lead to violations of EEP, which could be tested or bounded by the experiments described in Section 2.1. In many models the scalar field could be massive; if the Compton wavelength is of macroscopic scale, its effects are those of a “fifth force”. Only if the theory can be cast as a metric theory with a scalar field of infinite range or of range long compared to the scale of the system in question (solar system) can the PPN framework be strictly applied. If the mass of the scalar field is sufficiently large that its range is microscopic, then, on solar-system scales, the scalar field is suppressed, and the theory is essentially equivalent to general relativity.
3.3.3 Vector-tensor theories
General vector-tensor theory; ω, τ, ϵ, η (see TEGP 5.4 [281])
The gravitational Lagrangian for this class of theories had the form R+ωu_{μ}u^{μ}R+ηu^{μ}u^{ν}R_{μν} −ϵF_{μν}F^{μν} + τ∇_{μ}u_{ν}∇^{μ}u^{ν}, where F_{μν} = ∇_{μ}u_{ν}∇ − ∇_{ν}u_{μ}, corresponding to the values c_{1} = 2ϵ − τ, c_{2} = −η, c_{1} + c_{2} + c_{3} = −τ, c_{4} = 0. In these theories γ, β, α_{1}, and α_{2} are complicated functions of the parameters and of u^{2} = −u^{μ}u_{μ}, while the rest vanish.
Will-Nordtvedt theory (see [290])
This is the special case c_{1} = −1, c_{2} = c_{3} = c_{4} = 0. In this theory, the PPN parameters are given by γ = β = 1, α_{2} = u^{2}/(1 + u^{2}/2), and zero for the rest.
Hellings-Nordtvedt theory; ω (see [128])
This is the special case c_{1} = 2, c_{2} = 2ω, c_{1} + c_{2} + c_{3} = 0 = c_{4}. Here γ, β, α_{1} and α_{2} are complicated functions of the parameters and of u^{2}, while the rest vanish.
The Einstein-Æther theories were motivated in part by a desire to explore possibilities for violations of Lorentz invariance in gravity, in parallel with similar studies in matter interactions, such as the SME. The general class of theories was analyzed by Jacobson and collaborators [137, 183, 138, 99, 113], motivated in part by [156].
By requiring that gravitational wave modes have real (as opposed to imaginary) frequencies, one can impose the bounds c_{1}/(c_{1} + c_{4}) ≥ 0 and (c_{1} + c_{2} + c_{3})/(c_{1} + c_{4}) ≥ 0. Considerations of positivity of energy impose the constraints c_{1} > 0, c_{1} + c_{4} > 0 and c_{1} + c_{2} + c_{3} > 0.
3.4 Tests of the parameter γ
With the PPN formalism in hand, we are now ready to confront gravitation theories with the results of solar-system experiments. In this section we focus on tests of the parameter γ, consisting of the deflection of light and the time delay of light.
3.4.1 The deflection of light
It is interesting to note that the classic derivations of the deflection of light that use only the corpuscular theory of light (Cavendish 1784, von Soldner 1803 [277]), or the principle of equivalence (Einstein 1911), yield only the “1/2” part of the coefficient in front of the expression in Equation (46). But the result of these calculations is the deflection of light relative to local straight lines, as established for example by rigid rods; however, because of space curvature around the Sun, determined by the PPN parameter γ, local straight lines are bent relative to asymptotic straight lines far from the Sun by just enough to yield the remaining factor “γ/2”. The first factor “1/2” holds in any metric theory, the second “γ/2” varies from theory to theory. Thus, calculations that purport to derive the full deflection using the equivalence principle alone are incorrect.
However, the development of radio-interferometery, and later of very-long-baseline radio interferometry (VLBI), produced greatly improved determinations of the deflection of light. These techniques now have the capability of measuring angular separations and changes in angles to accuracies better than 100 microarcseconds. Early measurements took advantage of a series of heavenly coincidences: Each year, groups of strong quasistellar radio sources pass very close to the Sun (as seen from the Earth), including the group 3C273, 3C279, and 3C48, and the group 0111+02, 0119+11, and 0116+08. As the Earth moves in its orbit, changing the lines of sight of the quasars relative to the Sun, the angular separation δθ between pairs of quasars varies (see Equation (48)). The time variation in the quantities d, d_{r}, χ, and Φ_{r} in Equation (48) is determined using an accurate ephemeris for the Earth and initial directions for the quasars, and the resulting prediction for δθ as a function of time is used as a basis for a least-squares fit of the measured δθ, with one of the fitted parameters being the coefficient \({1 \over 2}(1 + \gamma)\). A number of measurements of this kind over the period 1969–1975 yielded an accurate determination of the coefficient \({1 \over 2}(1 + \gamma)\). A 1995 VLBI measurement using 3C273 and 3C279 yielded (1 + γ)/2 = 0.9996 ± 0.0017 [164].
In recent years, transcontinental and intercontinental VLBI observations of quasars and radio galaxies have been made primarily to monitor the Earth’s rotation (“VLBI” in Figure 5). These measurements are sensitive to the deflection of light over almost the entire celestial sphere (at 90° from the Sun, the deflection is still 4 milliarcseconds). A 2004 analysis of almost 2 million VLBI observations of 541 radio sources, made by 87 VLBI sites yielded (1 + γ)/2 = 0.99992 ± 0.00023, or equivalently, γ − 1 = (−1.7 ± 4.5) × 10^{−4} [240].
Analysis of observations made by the Hipparcos optical astrometry satellite yielded a test at the level of 0.3 percent [115]. A VLBI measurement of the deflection of light by Jupiter was reported; the predicted deflection of about 300 microarcseconds was seen with about 50 percent accuracy [257]. The results of light-deflection measurements are summarized in Figure 5.
3.4.2 The time delay of light
In the two decades following Irwin Shapiro’s 1964 discovery of this effect as a theoretical consequence of GR, several high-precision measurements were made using radar ranging to targets passing through superior conjunction. Since one does not have access to a “Newtonian” signal against which to compare the round-trip travel time of the observed signal, it is necessary to do a differential measurement of the variations in round-trip travel times as the target passes through superior conjunction, and to look for the logarithmic behavior of Equation (50). In order to do this accurately however, one must take into account the variations in round-trip travel time due to the orbital motion of the target relative to the Earth. This is done by using radar-ranging (and possibly other) data on the target taken when it is far from superior conjunction (i.e. when the time-delay term is negligible) to determine an accurate ephemeris for the target, using the ephemeris to predict the PPN coordinate trajectory x_{e}(t) near superior conjunction, then combining that trajectory with the trajectory of the Earth x_{⊕}(t) to determine the Newtonian round-trip time and the logarithmic term in Equation (50). The resulting predicted round-trip travel times in terms of the unknown coefficient \({1 \over 2}(1 + \gamma)\) are then fit to the measured travel times using the method of least-squares, and an estimate obtained for \({1 \over 2}(1 + \gamma)\).
The targets employed included planets, such as Mercury or Venus, used as passive reflectors of the radar signals (“passive radar”), and artificial satellites, such as Mariners 6 and 7, Voyager 2, the Viking Mars landers and orbiters, and the Cassini spacecraft to Saturn, used as active retransmitters of the radar signals (“active radar”).
The results for the coefficient \({1 \over 2}(1 + \gamma)\) of all radar time-delay measurements performed to date (including a measurement of the one-way time delay of signals from the millisecond pulsar PSR 1937+21) are shown in Figure 5 (see TEGP 7.2 [281] for discussion and references). The 1976 Viking experiment resulted in a 0.1 percent measurement [222].
A significant improvement was reported in 2003 from Doppler tracking of the Cassini spacecraft while it was on its way to Saturn [29], with a result γ − 1 = (2.1 ± 2.3) × 10^{−5}. This was made possible by the ability to do Doppler measurements using both X-band (7175 MHz) and Ka-band (34316 MHz) radar, thereby significantly reducing the dispersive effects of the solar corona. In addition, the 2002 superior conjunction of Cassini was particularly favorable: With the spacecraft at 8.43 astronomical units from the Sun, the distance of closest approach of the radar signals to the Sun was only 1.6 R_{⊕}.
From the results of the Cassini experiment, we can conclude that the coefficient \({1 \over 2}(1 + \gamma)\) must be within at most 0.0012 percent of unity. Scalar-tensor theories must have ω > 40000 to be compatible with this constraint.
3.4.3 Shapiro time delay and the speed of gravity
In 2001, Kopeikin [147] suggested that a measurement of the time delay of light from a quasar as the light passed by the planet Jupiter could be used to measure the speed of the gravitational interaction. He argued that, since Jupiter is moving relative to the solar system, and since gravity propagates with a finite speed, the gravitational field experienced by the light ray should be affected by gravity’s speed, since the field experienced at one time depends on the location of the source a short time earlier, depending on how fast gravity propagates. According to his calculations, there should be a post^{1/2}-Newtonian correction to the normal Shapiro time-delay formula (49) which depends on the velocity of Jupiter and on the velocity of gravity. On September 8, 2002, Jupiter passed almost in front of a quasar, and Kopeikin and Fomalont made precise measurements of the Shapiro delay with picosecond timing accuracy, and claimed to have measured the correction term to about 20 percent [112, 153, 148, 149].
Current limits on the PPN parameters. Here η_{N}is a combination of other parameters given by η_{N} = 4β − γ − 3 − 10ξ/3 − α_{1} +2α_{2}/3 − 2ζ_{1}/3 − ζ_{2}/3.
Parameter | Effect | Limit | Remarks |
---|---|---|---|
γ−1 | time delay | 2.3 × 10^{−5} | Cassini tracking |
light deflection | 4 × 10^{−4} | VLBI | |
β−1 | perihelion shift | 3 × 10^{−3} | J_{2} = 10^{−7} from helioseismology |
Nordtvedt effect | 2.3 × 10^{−4} | η_{N} = 4β − γ − 3 assumed | |
ξ | Earth tides | 10^{−3} | gravimeter data |
α_{1} | orbital polarization | 10^{−4} | Lunar laser ranging |
2 × 10^{−4} | PSR J2317+1439 | ||
α_{2} | spin precession | 4 × 10^{−7} | solar alignment with ecliptic |
α_{3} | pulsar acceleration | 4 × 10^{−20} | pulsar Ṗ statistics |
η_{N} | Nordtvedt effect | 9 × 10^{−4} | lunar laser ranging |
ζ_{l} | — | 2 × 10^{−2} | combined PPN bounds |
ζ_{2} | binary acceleration | 4 × 10^{−5} | \({\ddot P_{\rm{P}}}\) for PSR 1913+16 |
ζ_{3} | Newton’s 3rd law | 10^{−}^{8} | lunar acceleration |
ζ_{4} | — | — | not independent (see Equation (58)) |
3.5 The perihelion shift of Mercury
The explanation of the anomalous perihelion shift of Mercury’s orbit was another of the triumphs of GR. This had been an unsolved problem in celestial mechanics for over half a century, since the announcement by Le Verrier in 1859 that, after the perturbing effects of the planets on Mercury’s orbit had been accounted for, and after the effect of the precession of the equinoxes on the astronomical coordinate system had been subtracted, there remained in the data an unexplained advance in the perihelion of Mercury. The modern value for this discrepancy is 43 arcseconds per century. A number of ad hoc proposals were made in an attempt to account for this excess, including, among others, the existence of a new planet Vulcan near the Sun, a ring of planetoids, a solar quadrupole moment and a deviation from the inverse-square law of gravitation, but none was successful. General relativity accounted for the anomalous shift in a natural way without disturbing the agreement with other planetary observations.
The first term in Equation (51) is the classical relativistic perihelion shift, which depends upon the PPN parameters γ and β. The second term depends upon the ratio of the masses of the two bodies; it is zero in any fully conservative theory of gravity (α_{1} ≡ α_{2} ≡ α_{3} ≡ ζ_{2} ≡ 0); it is also negligible for Mercury, since µ/m ≈ m_{Merc}/M_{⊙} ≈ 2 × 10^{−7}. We shall drop this term henceforth.
3.6 Tests of the strong equivalence principle
The next class of solar-system experiments that test relativistic gravitational effects may be called tests of the strong equivalence principle (SEP). In Section 3.1.2 we pointed out that many metric theories of gravity (perhaps all except GR) can be expected to violate one or more aspects of SEP. Among the testable violations of SEP are a violation of the weak equivalence principle for gravitating bodies that leads to perturbations in the Earth-Moon orbit, preferred-location and preferred-frame effects in the locally measured gravitational constant that could produce observable geophysical effects, and possible variations in the gravitational constant over cosmological timescales.
3.6.1 The Nordtvedt effect and the lunar Eötvös experiment
Since August 1969, when the first successful acquisition was made of a laser signal reflected from the Apollo 11 retroreflector on the Moon, the LLR experiment has made regular measurements of the round-trip travel times of laser pulses between a network of observatories and the lunar retroreflectors, with accuracies that are at the 50 ps (1 cm) level, and that may soon approach 5 ps (1 mm). These measurements are fit using the method of least-squares to a theoretical model for the lunar motion that takes into account perturbations due to the Sun and the other planets, tidal interactions, and post-Newtonian gravitational effects. The predicted round-trip travel times between retroreflector and telescope also take into account the librations of the Moon, the orientation of the Earth, the location of the observatories, and atmospheric effects on the signal propagation. The “Nordtvedt” parameter η_{N} along with several other important parameters of the model are then estimated in the least-squares method.
Numerous ongoing analyses of the data find no evidence, within experimental uncertainty, for the Nordtvedt effect [295, 296] (for earlier results see [95, 294, 192]). These results represent a limit on a possible violation of WEP for massive bodies of about 1.4 parts in 10^{13} (compare Figure 1).
However, at this level of precision, one cannot regard the results of LLR as a “clean” test of SEP until one eliminates the possibility of a compensating violation of WEP for the two bodies, because the chemical compositions of the Earth and Moon differ in the relative fractions of iron and silicates. To this end, the Eöt-Wash group carried out an improved test of WEP for laboratory bodies whose chemical compositions mimic that of the Earth and Moon. The resulting bound of 1.4 parts in 10^{13} [19, 2] from composition effects reduces the ambiguity in the LLR bound, and establishes the firm SEP test at the level of about 2 parts in 10^{13}. These results can be summarized by the Nordtvedt parameter bound |η_{N}| = (4.4 ± 4.5) × 10^{−4}.
In the future, the Apache Point Observatory for Lunar Laser ranging Operation (APOLLO) project, a joint effort by researchers from the Universities of Washington, Seattle, and California, San Diego, plans to use enhanced laser and telescope technology, together with a good, high-altitude site in New Mexico, to improve the LLR bound by as much as an order of magnitude [296].
In GR, the Nordtvedt effect vanishes; at the level of several centimeters and below, a number of non-null general relativistic effects should be present [201].
Tests of the Nordtvedt effect for neutron stars have also been carried out using a class of systems known as wide-orbit binary millisecond pulsars (WBMSP), which are pulsar-white-dwarf binary systems with small orbital eccentricities. In the gravitational field of the galaxy, a non-zero Nordtvedt effect can induce an apparent anomalous eccentricity pointed toward the galactic center [86], which can be bounded using statistical methods, given enough WBMSPs (see [243] for a review and references). Using data from 21 WBMSPs, including recently discovered highly circular systems, Stairs et al. [244] obtained the bound Δ < 5.6 × 10^{−3}, where Δ = η_{N}(E_{g}/M)_{NS}. Because (E_{g}/M)_{NS} ∼ 0.1 for typical neutron stars, this bound does not compete with the bound on η_{N} from LLR; on the other hand, it does test SEP in the strong-field regime because of the presence of the neutron stars.
3.6.2 Preferred-frame and preferred-location effects
Some theories of gravity violate SEP by predicting that the outcomes of local gravitational experiments may depend on the velocity of the laboratory relative to the mean rest frame of the universe (preferred-frame effects) or on the location of the laboratory relative to a nearby gravitating body (preferred-location effects). In the post-Newtonian limit, preferred-frame effects are governed by the values of the PPN parameters α_{1}, α_{2}, and α_{3}, and some preferred-location effects are governed by ξ (see Table 2).
The most important such effects are variations and anisotropies in the locally-measured value of the gravitational constant which lead to anomalous Earth tides and variations in the Earth’s rotation rate, anomalous contributions to the orbital dynamics of planets and the Moon, self-accelerations of pulsars, and anomalous torques on the Sun that would cause its spin axis to be randomly oriented relative to the ecliptic (see TEGP 8.2, 8.3, 9.3, and 14.3 (c) [281]). An bound on α_{3} of 4 × 10^{−20} from the period derivatives of 21 millisecond pulsars was reported in [26, 244]; improved bounds on α_{1} were achieved using LLR data [191], and using observations of the circular binary orbit of the pulsar J2317+1439 [25]. Negative searches for these effects have produced strong constraints on the PPN parameters (see Table 4).
3.6.3 Constancy of the Newtonian gravitational constant
Most theories of gravity that violate SEP predict that the locally measured Newtonian gravitational constant may vary with time as the universe evolves. For the scalar-tensor theories listed in Table 3, the predictions for Ġ/G can be written in terms of time derivatives of the asymptotic scalar field. Where G does change with cosmic evolution, its rate of variation should be of the order of the expansion rate of the universe, i.e. Ġ/G ∼ H_{0}, where H_{0} is the Hubble expansion parameter and is given by H_{0} = 100 h km s^{−1} Mpc^{−1} = 1.02 × 10^{−10}h yr^{−1}, where current observations of the expansion of the universe give h ≈ 0.73 ± 0.03.
Constancy of the gravitational constant. For binary pulsar data, the bounds are dependent upon the theory of gravity in the strong-field regime and on neutron star equation of state. Big-bang nucleosynthesis bounds assume specific form for time dependence of G.
The best limits on a current Ġ/G come from LLR measurements (for earlier results see [95, 294, 192]). These have largely supplanted earlier bounds from ranging to the 1976 Viking landers (see TEGP, 14.3 (c) [281]), which were limited by uncertain knowledge of the masses and orbits of asteroids. However, improvements in knowledge of the asteroid belt, combined with continuing radar observations of planets and spacecraft, notably the Mars Global Surveyor (1998–2003) and Mars Odyssey (2002–present), may enable a bound on Ġ/G at the level of a part in 10^{13} per year. For an initial analysis along these lines, see [212]. It has been suggested that radar observations of the planned 2012 Bepi-Colombo Mercury orbiter mission over a two-year integration with 6 cm rms accuracy in range could yield Δ(Ġ/G) < 10^{−13} yr^{−1}; an eight-year mission could improve this by a factor 15 [187, 17].
Although bounds on Ġ/G from solar-system measurements can be correctly obtained in a phenomenological manner through the simple expedient of replacing G by G_{0} + Ġ_{0}(t − t_{0}) in Newton’s equations of motion, the same does not hold true for pulsar and binary pulsar timing measurements. The reason is that, in theories of gravity that violate SEP, such as scalar-tensor theories, the “mass” and moment of inertia of a gravitationally bound body may vary with variation in G. Because neutron stars are highly relativistic, the fractional variation in these quantities can be comparable to ΔG/G, the precise variation depending both on the equation of state of neutron star matter and on the theory of gravity in the strong-field regime. The variation in the moment of inertia affects the spin rate of the pulsar, while the variation in the mass can affect the orbital period in a manner that can subtract from the direct effect of a variation in G, given by Ṗ_{b}/P_{b} = −2Ġ/G [200]. Thus, the bounds quoted in Table 5 for the binary pulsar PSR 1913+16 and others [143] (see also [87]) are theory-dependent and must be treated as merely suggestive.
In a similar manner, bounds from helioseismology and big-bang nucleosynthesis (BBN) assume a model for the evolution of G over the multi-billion year time spans involved. For example, the concordance of predictions for light elements produced around 3 minutes after the big bang with the abundances observed indicate that G then was within 20 percent of G today. Assuming a power-law variation of G ∼ t^{−α} then yields a bound on Ġ/G today shown in Table 5.
3.7 Other tests of post-Newtonian gravity
3.7.1 Search for gravitomagnetism
Gravitomagnetism plays a role in a variety of measured relativistic effects involving moving material sources, such as the Earth-Moon system and binary pulsar systems. Nordtvedt [199, 198] has argued that, if the gravitomagnetic potential (55) were turned off, then there would be anomalous orbital effects in LLR and binary pulsar data.
The Relativity Gyroscope Experiment (Gravity Probe B or GPB) at Stanford University, in collaboration with NASA and Lockheed-Martin Corporation [246], recently completed a space mission to detect this frame-dragging or Lense-Thirring precession, along with the “geodetic” precession (see Section 3.7.2). A set of four superconducting-niobium-coated, spherical quartz gyroscopes were flown in a polar Earth orbit (642 km mean altitude, 0.0014 eccentricity), and the precessions of the gyroscopes relative to a distant guide star (HR 8703, IM Pegasi) were measured. For the given orbit, the predicted secular angular precession of the gyroscopes is in a direction perpendicular to the orbital plane at a rate \({1 \over 2}(1 + \gamma + {1 \over 4}{\alpha _1}) \times 41 \times {10^{- 3}}\) arcsec yr^{−1}. The accuracy goal of the experiment is about 0.5 milliarcseconds per year. The spacecraft was launched on April 20, 2004, and the mission ended in September 2005, as scheduled, when the remaining liquid helium boiled off.
It is too early to know whether the relativistic precessions were measured in the amount predicted by GR, because an important calibration of the instrument exploits the effect of the aberration of starlight on the pointing of the on-board telescope toward the guide star, and completing this calibration required the full mission data set. In addition, part of the measured effect includes the motion of the guide star relative to distant inertial frames. This was measured before, during and after the mission separately by radio astronomers at Harvard/SAO and elsewhere using VLBI, and the results of those measurements were to be strictly embargoed until the GPB team has completed its analysis of the gyro data. Final results from the experiment are expected in 2006.
Another way to look for frame-dragging is to measure the precession of orbital planes of bodies circling a rotating body. One implementation of this idea is to measure the relative precession, at about 31 milliarcseconds per year, of the line of nodes of a pair of laser-ranged geodynamics satellites (LAGEOS), ideally with supplementary inclination angles; the inclinations must be supplementary in order to cancel the dominant (126 degrees per year) nodal precession caused by the Earth’s Newtonian gravitational multipole moments. Unfortunately, the two existing LAGEOS satellites are not in appropriately inclined orbits, and no concrete plans exist at present to launch a third satellite in a supplementary orbit. Nevertheless, Ciufolini and Pavlis [56] combined nodal precession data from LAGEOS I and II with improved models for the Earth’s multipole moments provided by two recent orbiting geodesy satellites, Europe’s CHAMP (Challenging Minisatellite Payload) and NASA’s GRACE (Gravity Recovery and Climate Experiment), and reported a 5–10 percent confirmation of GR. In earlier reports, Ciufolini et al. had reported tests at the the 20–30 percent level, without the benefit of the GRACE/CHAMP data [55, 57, 54]. Some authors stressed the importance of adequately assessing systematic errors in the LAGEOS data [226, 133].
3.7.2 Geodetic precession
For the GPB gyroscopes orbiting the Earth, the precession is 6.6 arcseconds per year. A goal of GPB is to measure this effect to 8 × 10^{−5}; if achieved, this could bound the parameter γ to a part in 10^{4}, not competitive with the Cassini bound.
3.7.3 Tests of post-Newtonian conservation laws
A non-zero value for any of these parameters would result in a violation of conservation of momentum, or of Newton’s third law in gravitating systems. An alternative statement of Newton’s third law for gravitating systems is that the “active gravitational mass”, that is the mass that determines the gravitational potential exhibited by a body, should equal the “passive gravitational mass”, the mass that determines the force on a body in a gravitational field. Such an equality guarantees the equality of action and reaction and of conservation of momentum, at least in the Newtonian limit.
A classic test of Newton’s third law for gravitating systems was carried out in 1968 by Kreuzer, in which the gravitational attraction of fluorine and bromine were compared to a precision of 5 parts in 10^{5}.
3.8 Prospects for improved PPN parameter values
A number of advanced experiments or space missions are under development or have been proposed which could lead to significant improvements in values of the PPN parameters, of J_{2} of the Sun, and of Ġ/G.
LLR at the Apache Point Observatory (APOLLO project) could improve bounds on the Nordvedt parameter to the level 3 × 10^{−5} and on Ġ/G to better than 10^{−13} yr^{−1} [296].
The proposed 2012 ESA Bepi-Columbo Mercury orbiter, in a two-year experiment, with 6 cm range capability, could yield improvements in γ to 3 × 10^{−5}, in β to 3 × 10^{−4}, in α_{1} to 10^{−5}, in Ġ/G to 10^{−13} yr^{−1}, and in J_{2} to 3 × 10^{−8}. An eight-year mission could yield further improvements by factors of 2–5 in β, α_{1}, and J_{2}, and a further factor 15 in Ġ/G [187, 17].
GAIA is a high-precision astrometric orbiting telescope (a successor to Hipparcos), which could measure light-deflection and γ to the 10^{−6} level [101]. It is planned for launch by ESA in the 2011 time frame.
LATOR (Laser Astrometric Test of Relativity) is a concept for a NASA mission in which two microsatellites orbit the Sun on Earth-like orbits near superior conjunction, so that their lines of sight are close to the Sun. Using optical tracking and an optical interferometer on the International Space Station, it may be possible to measure the deflection of light with sufficient accuracy to bound γ to a part in 10^{8} and J_{2} to a part in 10^{8}, and to measure the solar frame-dragging effect to one percent [259, 260].
Nordtvedt [202] has argued that “grand fits” of large solar system ranging data sets, including radar ranging to Mercury, Mars, and satellites, and laser ranging to the Moon, could yield substantially improved measurements of PPN parameters. A recent contribution in that direction is [212].
4 Strong Gravity and Gravitational Waves: A New Testing Ground
4.1 Strong-field systems in general relativity
4.1.1 Defining weak and strong gravity
The system may contain strongly relativistic objects, such as neutron stars or black holes, near and inside which ϵ ∼ 1, and the post-Newtonian approximation breaks down. Nevertheless, under some circumstances, the orbital motion may be such that the interbody potential and orbital velocities still satisfy ϵ ≪ 1 so that a kind of post-Newtonian approximation for the orbital motion might work; however, the strong-field internal gravity of the bodies could (especially in alternative theories of gravity) leave imprints on the orbital motion.
The evolution of the system may be affected by the emission of gravitational radiation. The 1PN approximation does not contain the effects of gravitational radiation back-reaction. In the expression for the metric given in Box 2, radiation back-reaction effects do not occur until \({\mathcal O}({\epsilon ^{7/2}})\) in g_{00}, \({\mathcal O}({\epsilon ^3})\) in \({g_{{0_i}}}\), and \({\mathcal O}({\varepsilon ^{5/2}})\) in g_{ij}. Consequently, in order to describe such systems, one must carry out a solution of the equations substantially beyond 1PN order, sufficient to incorporate the leading radiation damping terms at 2.5PN order. In addition, the PPN metric described in Section 3.2 is valid in the near zone of the system, i.e. within one gravitational wavelength of the system’s center of mass. As such it cannot describe the gravitational waves seen by a detector.
The system may be highly relativistic in its orbital motion, so that U ∼ υ^{2} ∼ 1 even for the interbody field and orbital velocity. Systems like this include the late stage of the inspiral of binary systems of neutron stars or black holes, driven by gravitational radiation damping, prior to a merger and collapse to a final stationary state. Binary inspiral is one of the leading candidate sources for detection by a world-wide network of laser interferometric gravitational wave observatories nearing completion. A proper description of such systems requires not only equations for the motion of the binary carried to extraordinarily high PN orders (at least 3.5PN), but also requires equations for the far-zone gravitational waveform measured at the detector, that are equally accurate to high PN orders beyond the leading “quadrupole” approximation.
Of course, some systems cannot be properly described by any post-Newtonian approximation because their behavior is fundamentally controlled by strong gravity. These include the imploding cores of supernovae, the final merger of two compact objects, the quasinormal-mode vibrations of neutron stars and black holes, the structure of rapidly rotating neutron stars, and so on. Phenomena such as these must be analyzed using different techniques. Chief among these is the full solution of Einstein’s equations via numerical methods. This field of “numerical relativity” is a rapidly growing and maturing branch of gravitational physics, whose description is beyond the scope of this review (see [165, 24] for reviews). Another is black hole perturbation theory (see [188, 146, 235] for reviews).
4.1.2 Compact bodies and the strong equivalence principle
When dealing with the motion and gravitational wave generation by orbiting bodies, one finds a remarkable simplification within GR. As long as the bodies are sufficiently well-separated that one can ignore tidal interactions and other effects that depend upon the finite extent of the bodies (such as their quadrupole and higher multipole moments), then all aspects of their orbital behavior and gravitational wave generation can be characterized by just two parameters: mass and angular momentum. Whether their internal structure is highly relativistic, as in black holes or neutron stars, or non-relativistic as in the Earth and Sun, only the mass and angular momentum are needed. Furthermore, both quantities are measurable in principle by examining the external gravitational field of the bodies, and make no reference whatsoever to their interiors.
Damour [70] calls this the “effacement” of the bodies’ internal structure. It is a consequence of the SEP, described in Section 3.1.2.
General relativity satisfies SEP because it contains one and only one gravitational field, the spacetime metric g_{μν}. Consider the motion of a body in a binary system, whose size is small compared to the binary separation. Surround the body by a region that is large compared to the size of the body, yet small compared to the separation. Because of the general covariance of the theory, one can choose a freely-falling coordinate system which comoves with the body, whose spacetime metric takes the Minkowski form at its outer boundary (ignoring tidal effects generated by the companion). There is thus no evidence of the presence of the companion body, and the structure of the chosen body can be obtained using the field equations of GR in this coordinate system. Far from the chosen body, the metric is characterized by the mass and angular momentum (assuming that one ignores quadrupole and higher multipole moments of the body) as measured far from the body using orbiting test particles and gyroscopes. These asymptotically measured quantities are oblivious to the body’s internal structure. A black hole of mass m and a planet of mass m would produce identical spacetimes in this outer region.
The geometry of this region surrounding the one body must be matched to the geometry provided by the companion body. Einstein’s equations provide consistency conditions for this matching that yield constraints on the motion of the bodies. These are the equations of motion. As a result the motion of two planets of mass and angular momentum m_{1}, m_{2}, J_{1}, and J_{2} is identical to that of two black holes of the same mass and angular momentum (again, ignoring tidal effects).
This effacement does not occur in an alternative gravitional theory like scalar-tensor gravity. There, in addition to the spacetime metric, a scalar field ϕ is generated by the masses of the bodies, and controls the local value of the gravitational coupling constant (i.e. G is a function of ϕ). Now, in the local frame surrounding one of the bodies in our binary system, while the metric can still be made Minkowskian far away, the scalar field will take on a value ϕ_{0} determined by the companion body. This can affect the value of G inside the chosen body, alter its internal structure (specifically its gravitational binding energy) and hence alter its mass. Effectively, each body can be characterized by several mass functions m_{A}(ϕ), which depend on the value of the scalar field at its location, and several distinct masses come into play, such as inertial mass, gravitational mass, “radiation” mass, etc. The precise nature of the functions will depend on the body, specifically on its gravitational binding energy, and as a result, the motion and gravitational radiation may depend on the internal structure of each body. For compact bodies such as neutron stars and black holes these internal structure effects could be large; for example, the gravitational binding energy of a neutron star can be 10–20 percent of its total mass. At 1PN order, the leading manifestation of this phenomenon is the Nordtvedt effect.
This is how the study of orbiting systems containing compact objects provides strong-field tests of GR. Even though the strong-field nature of the bodies is effaced in GR, it is not in other theories, thus any result in agreement with the predictions of GR constitutes a kind of “null” test of strong-field gravity.
4.2 Motion and gravitational radiation in general relativity
The motion of bodies and the generation of gravitational radiation are long-standing problems that date back to the first years following the publication of GR, when Einstein calculated the gravitational radiation emitted by a laboratory-scale object using the linearized version of GR, and de Sitter calculated N-body equations of motion for bodies in the 1PN approximation to GR. It has at times been controversial, with disputes over such issues as whether Einstein’s equations alone imply equations of motion for bodies (Einstein, Infeld, and Hoffman demonstrated explicitly that they do, using a matching procedure similar to the one described above), whether gravitational waves are real or are artifacts of general covariance (Einstein waffled; Bondi and colleagues proved their reality rigorously in the 1950s), and even over algebraic errors (Einstein erred by a factor of 2 in his first radiation calculation; Eddington found the mistake). Shortly after the discovery of the binary pulsar PSR 1913+16 in 1974, questions were raised about the foundations of the “quadrupole formula” for gravitational radiation damping (and in some quarters, even about its quantitative validity). These questions were answered in part by theoretical work designed to shore up the foundations of the quadrupole approximation, and in part (perhaps mostly) by the agreement between the predictions of the quadrupole formula and the observed rate of damping of the pulsar’s orbit (see Section 5.1). Damour [70] gives a thorough historical and technical review of this subject up to 1986.
The problem of motion and radiation in GR has received renewed interest since 1990, with proposals for construction of large-scale laser interferometric gravitational wave observatories, such as the LIGO project in the US, VIRGO and GEO600 in Europe, and TAMA300 in Japan, and the realization that a leading candidate source of detectable waves would be the inspiral, driven by gravitational radiation damping, of a binary system of compact objects (neutron stars or black holes) [1, 256]. The analysis of signals from such systems will require theoretical predictions from GR that are extremely accurate, well beyond the leading-order prediction of Newtonian or even post-Newtonian gravity for the orbits, and well beyond the leading-order formulae for gravitational waves.
This presented a major theoretical challenge: to calculate the motion and radiation of systems of compact objects to very high PN order, a formidable algebraic task, while addressing a number of issues of principle that have historically plagued this subject, sufficiently well to ensure that the results were physically meaningful. This challenge has been largely met, so that we may soon see a remarkable convergence between observational data and accurate predictions of gravitational theory that could provide new, strong-field tests of GR.
Here we give a brief overview of the problem of motion and gravitational radiation in GR.
4.3 Einstein’s equations in “relaxed” form
At the same time, just as in electromagnetism, the formal integral (64) must be handled differently, depending on whether the field point is in the far zone or the near zone. For field points in the far zone or radiation zone, |x| > ƛ > |x′| (ƛ is the gravitational wavelength divided by 2π), the field can be expanded in inverse powers of R = |x| in a multipole expansion, evaluated at the “retarded time” t − R. The leading term in 1/R is the gravitational waveform. For field points in the near zone or induction zone, |x| ∼ |x′| < ƛ, the field is expanded in powers of |x − x′| about the local time t, yielding instantaneous potentials that go into the equations of motion.
However, because the source τ^{αβ} contains h^{αβ} itself, it is not confined to a compact region, but extends over all spacetime. As a result, there is a danger that the integrals involved in the various expansions will diverge or be ill-defined. This consequence of the non-linearity of Einstein’s equations has bedeviled the subject of gravitational radiation for decades. Numerous approaches have been developed to try to handle this difficulty. The “post-Minkowskian” method of Blanchet, Damour, and Iyer [35, 36, 37, 76, 38, 33] solves Einstein’s equations by two different techniques, one in the near zone and one in the far zone, and uses the method of singular asymptotic matching to join the solutions in an overlap region. The method provides a natural “regularization” technique to control potentially divergent integrals (see [34] for a thorough review). The “Direct Integration of the Relaxed Einstein Equations” (DIRE) approach of Will, Wiseman, and Pati [291, 208] retains Equation (64) as the global solution, but splits the integration into one over the near zone and another over the far zone, and uses different integration variables to carry out the explicit integrals over the two zones. In the DIRE method, all integrals are finite and convergent. Itoh and Futamase have used an extension of the Einstein-Infeld-Hoffman matching approach combined with a specific method for taking a point-particle limit [134], while Damour, Jaranowski, and Schäfer have pioneered an ADM Hamiltonian approach that focuses on the equations of motion [139, 140, 77, 79, 78].
These methods assume from the outset that gravity is sufficiently weak that ∥h^{αβ}∥ < 1 and harmonic coordinates exists everywhere, including inside the bodies. Thus, in order to apply the results to cases where the bodies may be neutron stars or black holes, one relies upon the SEP to argue that, if tidal forces are ignored, and equations are expressed in terms of masses and spins, one can simply extrapolate the results unchanged to the situation where the bodies are ultrarelativistic. While no general proof of this exists, it has been shown to be valid in specific circumstances, such as at 2PN order in the equations of motion, and for black holes moving in a Newtonian background field [70].
Methods such as these have resolved most of the issues that led to criticism of the foundations of gravitational radiation theory during the 1970s.
4.4 Equations of motion and gravitational waveform
These formalisms have also been generalized to include the leading effects of spin-orbit and spin-spin coupling between the bodies [145, 144, 289].
Another approach to gravitational radiation is applicable to the special limit in which one mass is much smaller than the other. This is the method of black hole perturbation theory. One begins with an exact background spacetime of a black hole, either the non-rotating Schwarzschild or the rotating Kerr solution, and perturbs it according to \({g_{\mu \nu}} = g_{\mu \nu}^{(0)} + {h_{\mu \nu}}\). The particle moves on a geodesic of the background spacetime, and a suitably defined source stress-energy tensor for the particle acts as a source for the gravitational perturbation and wave field h_{μν}. This method provides numerical results that are exact in υ, as well as analytical results expressed as series in powers of υ, both for non-rotating and for rotating black holes. For non-rotating holes, the analytical expansions have been carried to 5.5PN order, or ϵ^{5.5} beyond the quadrupole approximation. All results of black hole perturbation agree precisely with the m_{1} → 0 limit of the PN results, up to the highest PN order where they can be compared (for reviews see [188, 235]).
4.5 Gravitational wave detection
5 Stellar System Tests of Gravitational Theory
5.1 The binary pulsar and general relativity
The 1974 discovery of the binary pulsar B1913+16 by Joseph Taylor and Russell Hulse during a routine search for new pulsars provided the first possibility of probing new aspects of gravitational theory: the effects of strong relativistic internal gravitational fields on orbital dynamics, and the effects of gravitational radiation reaction. For reviews of the discovery, see the published Nobel Prize lectures by Hulse and Taylor [132, 252]. For reviews of the current status of pulsars, including binary and millisecond pulsars, see [173, 243].
Parameter | Symbol (units) | Value | |
---|---|---|---|
(i) | “Physical” parameters: | ||
Right Ascension | α | 19^{h}15^{m}27.^{s}99999(2) | |
Declination | δ | 16^{°}06′27.″4034(4) | |
Pulsar period | Pp (ms) | 59.0299983444181(5) | |
Derivative of period | Ṗ_{p} | 8.62713(8) × 10^{−18} | |
(ii) | “Keplerian” parameters: | ||
Projected semimajor axis | a_{p} sin i (s) | 2.341774(1) | |
Eccentricity | e | 0.6171338(4) | |
Orbital period | P_{b} (day) | 0.322997462727(5) | |
Longitude of periastron | ω_{0} (°) | 226.57518(4) | |
Julian date of periastron | T_{0} (MJD) | 46443.99588317(3) | |
(iii) | “Post-Keplerian” parameters: | ||
Mean rate of periastron advance | \(\left\langle {\dot \omega} \right\rangle\) (° yr^{−1}) | 4.226595(5) | |
Redshift/time dilation | γ′ (ms) | 4.2919(8) | |
Orbital period derivative | Ṗ_{b} (10−12) | −2.4184(9) |
Three factors make this system an arena where relativistic celestial mechanics must be used: the relatively large size of relativistic effects [υ_{orbit} ≈ (m/r)^{1/2} ≈ 10^{−3}], a factor of 10 larger than the corresponding values for solar-system orbits; the short orbital period, allowing secular effects to build up rapidly; and the cleanliness of the system, allowing accurate determinations of small effects. Because the orbital separation is large compared to the neutron stars’ compact size, tidal effects can be ignored. Just as Newtonian gravity is used as a tool for measuring astrophysical parameters of ordinary binary systems, so GR is used as a tool for measuring astrophysical parameters in the binary pulsar.
- 1.
non-orbital parameters, such as the pulsar period and its rate of change (defined at a given epoch), and the position of the pulsar on the sky;
- 2.
five “Keplerian” parameters, most closely related to those appropriate for standard Newtonian binary systems, such as the eccentricity e, the orbital period P_{b}, and the semi-major axis of the pulsar projected along the line of sight, a_{p} sin i; and
- 3.
five “post-Keplerian” parameters.
Because f_{b} and e are separately measured parameters, the measurement of the three post-Keplerian parameters provide three constraints on the two unknown masses. The periastron shift measures the total mass of the system, Ṗ_{b} measures the chirp mass, and γ′ measures a complicated function of the masses. GR passes the test if it provides a consistent solution to these constraints, within the measurement errors.
The consistency among the constraints provides a test of the assumption that the two bodies behave as “point” masses, without complicated tidal effects, obeying the general relativistic equations of motion including gravitational radiation. It is also a test of strong gravity, in that the highly relativistic internal structure of the neutron stars does not influence their orbital motion, as predicted by the SEP of GR.
Recent observations [157, 271] indicate variations in the pulse profile, which suggests that the pulsar is undergoing geodetic precession on a 300-year timescale as it moves through the curved spacetime generated by its companion (see Section 3.7.2). The amount is consistent with GR, assuming that the pulsar’s spin is suitably misaligned with the orbital angular momentum. Unfortunately, the evidence suggests that the pulsar beam may precess out of our line of sight by 2025.
5.2 A zoo of binary pulsars
B1534+12
This is a binary pulsar system in our galaxy [245, 243, 18]. Its pulses are significantly stronger and narrower than those of B1913+16, so timing measurements are more precise, reaching 3 µs accuracy. The orbital plane appears to be almost edge-on relative to the line of sight (i ≃ 80°); as a result the Shapiro delay is substantial, and separate values of the parameters r and s have been obtained with interesting accuracy. Assuming GR, one infers that the two masses are m_{1} = 1.335 ± 0.002 M_{⊙} and m_{2} = 1.344 ± 0.002 M_{⊙}. The rate of orbit decay Ṗ_{b} agrees with GR to about 15 percent, but the precision is limited by the poorly known distance to the pulsar, which introduces a significant uncertainty into the subtraction of galactic acceleration. Independently of Ṗ_{b}, measurement of the four other post-Keplerian parameters gives two tests of strong-field gravity in the non-radiative regime [253].
B2127+11C
This system appears to be a clone of the Hulse-Taylor binary pulsar, with very similar values for orbital period and eccentricity (see Table 7). The inferred total mass of the system is 2.706 ± 0.011 M_{⊙}. But because the system is in the globular cluster M15 (NGC 7078), it suffers Doppler shifts resulting from local accelerations, either by the mean cluster gravitational field or by nearby stars, that are more difficult to estimate than was the case with the galactic system B1913+16. This makes a separate, precision measurement of the relativistic contribution to Ṗ_{b} essentially impossible.
J0737-3039A, B
This binary pulsar system, discovered in 2003 [48], was already remarkable for its extraordinarily short orbital period (0.1 days) and large periastron advance (16.88° yr^{−1}), but then the companion was also discovered to be a pulsar [175]. Because two projected semi-major axes can now be measured, one can obtain the mass ratio directly from the ratio of the two values of ap sin i, and thereby obtain the two masses by combining that ratio with the periastron advance, assuming GR. The results are m_{A} = 1.337±0.005 M_{⊙} and m_{B} = 1.250±0.005 M_{⊙}, where A denotes the primary (first) pulsar. From these values, one finds that the orbit is nearly edge-on, with sin i = 0.9991, a value which is completely consistent with that inferred from the Shapiro delay parameter (see Table 7). In fact, the five measured post-Keplerian parameters plus the ratio of the projected semi-major axes give six constraints on the masses (assuming GR): All six overlap within their measurement errors. This system provides a unique opportunity for tight tests of strong-field and radiative effects in GR. Furthermore, it is likely that galactic proper motion effects will play a significantly smaller role in the interpretation of Ṗ_{b} measurements than they did in B1913+16.
J1141-6545
This is a case where the companion is probably a white dwarf [20, 18]. The masses of the pulsar and companion are 1.30±0.02 and 0.986±0.02 M_{⊙}, respectively. Ṗ_{b} has been measured to about 25 percent, consistent with the GR prediction. But because of the asymmetry in sensitivities (S_{NS} ∼ 0.2, S_{WD} ∼ 10^{−4}), there is the possibility, absent in the double neutronstar systems, to place a strong bound on scalar-tensor gravity (see Section 5.4).
J1756-2251
Discovered in 2004, this pulsar is in a binary system with a probable neutron star companion, with P_{b} = 7.67 hr, e = 0.18, and \(\dot \omega = 2.585 \pm 0.002\deg {\rm{y}}{{\rm{r}}^{{\rm{- 1}}}}\) [104].
J1906+0746
The discovery of this system was reported in late 2005 [174]. It is a young, 144-ms pulsar in a relativistic orbit with P_{b} = 3.98 hr, e = 0.085, and \(\dot \omega = 7.57 \pm 0.03\deg {\rm{y}}{{\rm{r}}^{{\rm{- 1}}}}\).
Parameters of other binary pulsars. References may be found in the text; for an online catalogue of pulsars with reasonably up-to-date parameters, see [18].
Parameter | B1534+12 | B2127+11C | J1141−6545 | J0737−3039(A, B) | |
---|---|---|---|---|---|
(i) | “Keplerian” parameters: | ||||
a_{p} sin i (s) | 3.7294626(8) | 2.520(3) | 1.85894(1) | 1.41504(2)/1.513(3) | |
e | 0.2736767(1) | 0.68141(2) | 0.171876(2) | 0.087779(5) | |
P_{b} (day) | 0.420737299153(4) | 0.335282052(6) | 0.1976509587(3) | 0.102251563(1) | |
(ii) | “Post-Keplerian” parameters: | ||||
\(\left\langle {\dot \omega} \right\rangle\) (° yr^{−1}) | 1.755805(3) | 4.457(12) | 5.3084(9) | 16.90(1) | |
γ′ (ms) | 2.070(2) | 4.67 | 0.72(3) | 0.382(5) | |
Ṗ_{b} (10^{−}^{12}) | −0.137(3) | −3.94 | −0.43(10) | −1.21(6) | |
r (μs) | 6.7(1.0) | 6.2(5) | |||
s = sin i | 0.975(7) | 0.9995(4) |
5.3 Binary pulsars and alternative theories
Soon after the discovery of the binary pulsar it was widely hailed as a new testing ground for relativistic gravitational effects. As we have seen in the case of GR, in most respects, the system has lived up to, indeed exceeded, the early expectations.
On the other hand, the early observations of PSR 1913+16 already indicated that, in GR, the masses of the two bodies were nearly equal, so that, in theories of gravity that are in some sense “close” to GR, dipole gravitational radiation would not be a strong effect, because of the apparent symmetry of the system. The Rosen theory, and others like it, are not “close” to GR, except in their predictions for the weak-field, slow-motion regime of the solar system. When relativistic neutron stars are present, theories like these can predict strong effects on the motion of the bodies resulting from their internal highly relativistic gravitational structure (violations of SEP). As a consequence, the masses inferred from observations of the periastron shift and γ′ may be significantly different from those inferred using GR, and may be different from each other, leading to strong dipole gravitational radiation damping. By contrast, the Brans-Dicke theory is “close” to GR, roughly speaking within 1/ω_{BD} of the predictions of the latter, for large values of the coupling constant ω_{BD}. Thus, despite the presence of dipole gravitational radiation, the binary pulsar provides at present only a weak test of Brans-Dicke theory, not competitive with solar-system tests.
5.4 Binary pulsars and scalar-tensor gravity
Unfortunately, because of the near equality of the neutron star masses in the binary pulsar, dipole radiation is suppressed, and the bounds obtained are not competitive with the Cassini bound on γ [293], except for those generalized scalar-tensor theories, with β_{0} < 0 [74]. Bounds on the parameters α_{0} and β_{0} from solar system, binary pulsar, and gravitational wave observations (see Sections 5.1 and 6.3) are found in [74].
6 Gravitational Wave Tests of Gravitational Theory
6.1 Gravitational wave observatories
Some time in the next decade, a new opportunity for testing relativistic gravity will be realized, when a worldwide network of kilometer-scale, laser interferometric gravitational wave observatories in the U.S. (LIGO project), Europe (VIRGO and GEO600 projects), and Japan (TAMA300 project) begins regular detection and analysis of gravitational wave signals from astrophysical sources. These broad-band antennas will have the capability of detecting and measuring the gravitational waveforms from astronomical sources in a frequency band between about 10 Hz (the seismic noise cutoff) and 500 Hz (the photon counting noise cutoff), with a maximum sensitivity to strain at around 100 Hz of h ∼ Δl/l ∼ 10^{−22} (rms), for the kilometer-scale LIGO/VIRGO projects. The most promising source for detection and study of the gravitational wave signal is the “inspiralling compact binary” — a binary system of neutron stars or black holes (or one of each) in the final minutes of a death spiral leading to a violent merger. Such is the fate, for example, of the Hulse-Taylor binary pulsar B1913+16 in about 300 Myr, or the “double pulsar” J0737-3039 in about 85 Myr. Given the expected sensitivity of the “advanced LIGO” (around 2010), which could see such sources out to many hundreds of megaparsecs, it has been estimated that from 40 to several hundred annual inspiral events could be detectable. Other sources, such as supernova core collapse events, instabilities in rapidly rotating newborn neutron stars, signals from non-axisymmetric pulsars, and a stochastic background of waves, may be detectable (for reviews, see [1, 256]; for updates on the status of various projects, see [114, 45]).
A similar network of cryogenic resonant-mass gravitational antennas have been in operation for many years, albeit at lower levels of sensitivity (h ∼ 10^{−19}). While modest improvements in sensitivity may be expected in the future, these resonant detectors are not expected to be competitive with the large interferometers, unless new designs involving masses of spherical, or nearly spherical shape come to fruition. These systems are primarily sensitive to waves in relatively narrow bands about frequencies in the hundreds to thousands of Hz range [206, 123, 32, 217], although future improvements in sensitivity and increases in bandwidth may be possible [61].
In addition, plans are being developed for an orbiting laser interferometer space antenna (LISA for short). Such a system, consisting of three spacecraft orbiting the sun in a triangular formation separated from each other by five million kilometers, would be sensitive primarily in the very low frequency band between 10^{−4} and 10^{−1} Hz, with peak strain sensitivity of order h ∼ 10^{−23} [90].
In addition to opening a new astronomical window, the detailed observation of gravitational waves by such observatories may provide the means to test general relativistic predictions for the polarization and speed of the waves, for gravitational radiation damping and for strong-field gravity.
6.2 Polarization of gravitational waves
A suitable array of gravitational antennas could delineate or limit the number of modes present in a given wave. The strategy depends on whether or not the source direction is known. In general there are eight unknowns (six polarizations and two direction cosines), but only six measurables (R_{0i0j}). If the direction can be established by either association of the waves with optical or other observations, or by time-of-flight measurements between separated detectors, then six suitably oriented detectors suffice to determine all six components. If the direction cannot be established, then the system is underdetermined, and no unique solution can be found. However, if one assumes that only transverse waves are present, then there are only three unknowns if the source direction is known, or five unknowns otherwise. Then the corresponding number (three or five) of detectors can determine the polarization. If distinct evidence were found of any mode other than the two transverse quadrupolar modes of GR, the result would be disastrous for GR. On the other hand, the absence of a breathing mode would not necessarily rule out scalar-tensor gravity, because the strength of that mode depends on the nature of the source.
Some of the details of implementing such polarization observations have been worked out for arrays of resonant cylindrical, disk-shaped, spherical, and truncated icosahedral detectors (TEGP 10.2 [281], for recent reviews see [169, 266]); initial work has been done to assess whether the ground-based or space-based laser interferometers (or combinations of the two types) could perform interesting polarization measurements [267, 47, 177, 117, 273]. Unfortunately for this purpose, the two LIGO observatories (in Washington and Louisiana states, respectively) have been constructed to have their respective arms as parallel as possible, apart from the curvature of the Earth; while this maximizes the joint sensitivity of the two detectors to gravitational waves, it minimizes their ability to detect two modes of polarization.
6.3 Gravitational radiation back-reaction
In the binary pulsar, a test of GR was made possible by measuring at least three relativistic effects that depended upon only two unknown masses. The evolution of the orbital phase under the damping effect of gravitational radiation played a crucial role. Another situation in which measurement of orbital phase can lead to tests of GR is that of the inspiralling compact binary system. The key differences are that here gravitational radiation itself is the detected signal, rather than radio pulses, and the phase evolution alone carries all the information. In the binary pulsar, the first derivative of the binary frequency ḟ_{b} was measured; here the full nonlinear variation of f_{b} as a function of time is measured.
Broad-band laser interferometers are especially sensitive to the phase evolution of the gravitational waves, which carry the information about the orbital phase evolution. The analysis of gravitational wave data from such sources will involve some form of matched filtering of the noisy detector output against an ensemble of theoretical “template” waveforms which depend on the intrinsic parameters of the inspiralling binary, such as the component masses, spins, and so on, and on its inspiral evolution. How accurate must a template be in order to “match” the waveform from a given source (where by a match we mean maximizing the cross-correlation or the signal-to-noise ratio)? In the total accumulated phase of the wave detected in the sensitive bandwidth, the template must match the signal to a fraction of a cycle. For two inspiralling neutron stars, around 16,000 cycles should be detected during the final few minutes of inspiral; this implies a phasing accuracy of 10^{−5} or better. Since υ ∼ 1/10 during the late inspiral, this means that correction terms in the phasing at the level of υ^{5} or higher are needed. More formal analyses confirm this intuition [67, 105, 68, 214].
Because it is a slow-motion system (υ ∼ 10^{−3}), the binary pulsar is sensitive only to the lowest-order effects of gravitational radiation as predicted by the quadrupole formula. Nevertheless, the first correction terms of order υ and υ^{2} to the quadrupole formula were calculated as early as 1976 [268] (see TEGP 10.3 [281]).
But for laser interferometric observations of gravitational waves, the bottom line is that, in order to measure the astrophysical parameters of the source and to test the properties of the gravitational waves, it is necessary to derive the gravitational waveform and the resulting radiation back-reaction on the orbit phasing at least to 2PN order beyond the quadrupole approximation, and preferably to 3PN order.
Similar expressions can be derived for the loss of angular momentum and linear momentum. Expressions for non-circular orbits have also been derived [121, 75]. These losses react back on the orbit to circularize it and cause it to inspiral. The result is that the orbital phase (and consequently the gravitational wave phase) evolves non-linearly with time. It is the sensitivity of the broadband laser interferometric detectors to phase that makes the higher-order contributions to df/dt so observationally relevant.
If the coefficients of each of the powers of f in Equation (97) can be measured, then one again obtains more than two constraints on the two unknowns m_{1} and m_{2}, leading to the possibility to test GR. For example, Blanchet and Sathyaprakash [42, 41] have shown that, by observing a source with a sufficiently strong signal, an interesting test of the 4π coefficient of the “tail” term could be performed.
Another possibility involves gravitational waves from a small mass orbiting and inspiralling into a (possibly supermassive) spinning black hole. A general non-circular, non-equatorial orbit will precess around the hole, both in periastron and in orbital plane, leading to a complex gravitational waveform that carries information about the non-spherical, strong-field spacetime around the hole. According to GR, this spacetime must be the Kerr spacetime of a rotating black hole, uniquely specified by its mass and angular momentum, and consequently, observation of the waves could test this fundamental hypothesis of GR [231, 213].
6.4 Speed of gravitational waves
According to GR, in the limit in which the wavelength of gravitational waves is small compared to the radius of curvature of the background spacetime, the waves propagate along null geodesics of the background spacetime, i.e. they have the same speed c as light (in this section, we do not set c = 1). In other theories, the speed could differ from c because of coupling of gravitation to “background” gravitational fields. For example, in the Rosen bimetric theory with a flat background metric η, gravitational waves follow null geodesics of η, while light follows null geodesics of g (TEGP 10.1 [281]).
The simplest attempt to incorporate a massive graviton into general relativity in a ghost-free manner suffers from the so-called van Dam-Veltman-Zakharov (vDVZ) discontinuity [263, 299]. Because of the 3 additional helicity states available to the massive spin-2 graviton, the limit of small graviton mass does not coincide with pure GR, and the predicted perihelion advance, for example, violates experiment. A model theory by Visser [265] attempts to circumvent the vDVZ problem by introducing a non-dynamical flat-background metric. This theory is truly continuous with GR in the limit of vanishing graviton mass; on the other hand, its observational implications have been only partially explored. Braneworld scenarios predict a tower or a continuum of massive gravitons, and may avoid the vDVZ discontinuity, although the full details are still a work in progress [91, 66].
The foregoing discussion assumes that the source emits both gravitational and electromagnetic radiation in detectable amounts, and that the relative time of emission can be established to sufficient accuracy, or can be shown to be sufficiently small.
However, there is a situation in which a bound on the graviton mass can be set using gravitational radiation alone [285]. That is the case of the inspiralling compact binary. Because the frequency of the gravitational radiation sweeps from low frequency at the initial moment of observation to higher frequency at the final moment, the speed of the gravitons emitted will vary, from lower speeds initially to higher speeds (closer to c) at the end. This will cause a distortion of the observed phasing of the waves and result in a shorter than expected overall time Δt_{a} of passage of a given number of cycles. Furthermore, through the technique of matched filtering, the parameters of the compact binary can be measured accurately (assuming that GR is a good approximation to the orbital evolution, even in the presence of a massive graviton), and thereby the emission time Δt_{e} can be determined accurately. Roughly speaking, the “phase interval” fΔt in Equation (101) can be measured to an accuracy 1/ρ, where ρ is the signal-to-noise ratio.
Thus one can estimate the bounds on λ_{g} achievable for various compact inspiral systems, and for various detectors. For stellar-mass inspiral (neutron stars or black holes) observed by the LIGO/VIRGO class of ground-based interferometers, D ≈ 200 Mpc, f ≈ 100 Hz, and f Δt ∼ ρ^{−1} ≈ 1/10. The result is λ_{g} > 10^{13} km. For supermassive binary black holes (10^{4} to 10^{7}M_{⊙}) observed by the proposed laser interferometer space antenna (LISA), D ≈ 3 Gpc, f ≈ 10^{−3} Hz, and f Δt ∼ ρ^{−1} ≈ 1/1000. The result is λ_{g} > 10^{17} km.
A full noise analysis using proposed noise curves for the advanced LIGO and for LISA weakens these crude bounds by factors between two and 10 [285, 292, 27, 28]. For example, for the inspiral of two 10^{6}M_{⊙} black holes with aligned spins at a distance of 3 Gpc observed by LISA, a bound of 2 × 10^{16} km could be placed [27]. Other possibilities include using binary pulsar data to bound modifications of gravitational radiation damping by a massive graviton [106], and using LISA observations of the phasing of waves from compact white-dwarf binaries, eccentric galactic binaries, and eccentric inspiral binaries [69, 142].
6.5 Strong-gravity tests
One of the central difficulties of testing GR in the strong-field regime is the possibility of contamination by uncertain or complex physics. In the solar system, weak-field gravitational effects could in most cases be measured cleanly and separately from non-gravitational effects. The remarkable cleanliness of the binary pulsar permitted precise measurements of gravitational phenomena in a strong-field context.
Unfortunately, nature is rarely so kind. Still, under suitable conditions, qualitative and even quantitative strong-field tests of GR could be carried out.
One example is in cosmology. From a few seconds after the big bang until the present, the underlying physics of the universe is well understood, in terms of a Standard Model of a nearly spatially flat universe, 13.6 Gyr old, dominated by dark matter and dark energy. Some alternative theories of gravity that are qualitatively different from GR fail to produce cosmologies that meet even the minimum requirements of agreeing qualitatively with big-bang nucleosynthesis (BBN) or the properties of the cosmic microwave background (TEGP 13.2 [281]). Others, such as Brans-Dicke theory, are sufficiently close to GR (for large enough ω_{BD}) that they conform to all cosmological observations, given the underlying uncertainties. The generalized scalar-tensor theories, however, could have small ω at early times, while evolving through the attractor mechanism to large ω today. One way to test such theories is through big-bang nucleosynthesis, since the abundances of the light elements produced when the temperature of the universe was about 1 MeV are sensitive to the rate of expansion at that epoch, which in turn depends on the strength of interaction between geometry and the scalar field. Because the universe is radiation-dominated at that epoch, uncertainties in the amount of cold dark matter or of the cosmological constant are unimportant. The nuclear reaction rates are reasonably well understood from laboratory experiments and theory, and the number of light neutrino families (3) conforms to evidence from particle accelerators. Thus, within modest uncertainties, one can assess the quantitative difference between the BBN predictions of GR and scalar-tensor gravity under strong-field conditions and compare with observations. For recent analyses, see [234, 84, 58, 60].
Another example is the exploration of the spacetime near black holes and neutron stars via accreting matter. Studies of certain kinds of accretion known as advection-dominated accretion flow (ADAF) in low-luminosity binary X-ray sources may yield the signature of the black hole event horizon [185]. The spectrum of frequencies of quasi-periodic oscillations (QPO) from galactic black hole binaries may permit measurement of the spins of the black holes [218]. Aspects of strong-field gravity and frame-dragging may be revealed in spectral shapes of iron fluorescence lines from the inner regions of accretion disks [225, 224]. Because of uncertainties in the detailed models, the results to date of studies like these are suggestive at best, but the combination of higher-resolution observations and better modelling could lead to striking tests of strong-field predictions of GR.
7 Conclusions
We find that general relativity has held up under extensive experimental scrutiny. The question then arises, why bother to continue to test it? One reason is that gravity is a fundamental interaction of nature, and as such requires the most solid empirical underpinning we can provide. Another is that all attempts to quantize gravity and to unify it with the other forces suggest that the standard general relativity of Einstein is not likely to be the last word. Furthermore, the predictions of general relativity are fixed; the theory contains no adjustable constants so nothing can be changed. Thus every test of the theory is either a potentially deadly test or a possible probe for new physics. Although it is remarkable that this theory, born 90 years ago out of almost pure thought, has managed to survive every test, the possibility of finding a discrepancy will continue to drive experiments for years to come.
Acknowledgments
This work has been supported since the initial version in part by the National Science Foundation, Grant Numbers PHY 96-00049, 00-96522, and 03-53180, and by the National Aeronautics and Space Administration, Grant Number NAG5-10186. We also gratefully acknowledge the support of the Centre National de la Recherche Scientifique, and the hospitality of the Institut d’Astrophysique de Paris, where this update was completed. Comments from referees were particularly helpful in improving this update.