1 Introduction

The total mass of a nearby disk galaxy is typically obtained from measuring its rotation curve and deducing from it the mass using Newton’s dynamics. The rationale for this non-relativistic treatment is the small velocity of stars: \(v/c \ll 1\) sufficiently far from the central galactic black hole. However, there are other relativistic corrections beside \((v/c)^n\) terms. Inspecting the perturbative post-Newtonian [1] Lagrangian for two masses \(M_1\) and \(M_2\) separated by r, reveals terms such as \(V_{1pn}=G^2M_{1}M_{2}(M_{1}+M_{2})/2r^2\) (G is the gravitational constant) that are not suppressed at small v. These terms can be non-negligible if \(M_1\) and \(M_2\) are large enough, but for galaxies, they happen to be generally small. However, terms such as \(V_{1pn}\) are perturbative corrections, i.e. they omit non-perturbative dynamics, and we will show here that they indeed fail to provide the full relativistic dynamics associated with large masses. Regardless of the size of their contribution, terms like \(V_{1pn}\) exemplify the non-linear nature of General Relativity (GR), which arises from its field self-interaction: the gravitational field has an energy and hence gravitates too.

Field self-interactions are well-known in particle physics: quantum chromodynamics (QCD, the gauge theory of the strong force between quarks) features color-charged fields that self-interact. In fact, GR and QCD have similar Lagrangians, including self-interacting terms, as the polynomial form of the Einstein-Hilbert Lagrangian shows [2, 3]. Field self-interaction in QCD, which causes quark confinement, occurs even for static sources: numerous bound states of heavy quarks (in which quarks have \(v\approx 0\)) exist [4]. This, and the correspondence of terms in the GR and QCD Lagrangians, shows that for bodies massive enough a relativistic treatment is required regardless of their velocity. Finally, the measured speeds at the rotation curve plateaus are of several hundreds of km/s, e.g. 300 km/s (or \(v/c=0.1\%\)) for NGC 2841. They are similar to that of stars which orbit the Galactic central black hole and which clearly display a relativistic dynamics [5].

These arguments call for an investigation of the importance of relativistic dynamics in galaxies. The exact resolution of the self-gravitating disk problem within GR being unknown, an approximate method is required. From experience with QCD, a non-perturbative approach is required to fully account for field self-interaction. In fact, perturbative QCD underestimates the large distance effect of field self-interaction by two orders of magnitude [6]. The identical Lagrangian structure of GR and QCD suggests similar non-linear issues for galactic dynamics. Furthermore, post-Newtonian calculations show that the perturbative series – which may already underestimate field self-interaction – diverges for masses similar to those of the heaviest known galaxies. These facts make a perturbative expansion based on terms like \(V_{1pn}\) suspicious, and warranty a non-perturbative investigation. In Refs. [2, 3], a non-perturbative numerical lattice method was used. Here, we propose to approach the problem with a mean-field technique combined with gravitational lensing. The advantages compared to the lattice method [2, 3] are: (1) it is an entirely independent method, thereby providing a thorough check of the lattice result; (2) it is not restricted to the static limit of the lattice method and can be applied to systems with complex geometries; (3) it is less CPU-intensive than a lattice calculation, and hence faster; (4) it clarifies that the effect calculated in Refs. [2, 3] is classical. The lattice approach – an inherently quantum field theory (QFT) technique – may suggest that quantum phenomena are involved. However, the lattice calculations [2, 3] were performed in the high-temperature, i.e. classic, limit [3]; (5) the lensing formalism is more familiar to astronomers, in contrast to lattice techniques with its QFT underpinning and terminology.

2 Mean-field lensing model

2.1 Method

The mean-field technique has been widely employed. In the context of gravitation, it has been used e.g. to calculate the propagation of gravitational waves near large masses [8]; to solve, by using the gravity/gauge correspondence, the non-perturbative bound-state problem of QCD [9]; or to derive the background field method introduced to quantize gravity [10,11,12]. Hence the mean-field approach is a common and diverse technique. We propose to use it together with the gravitational lensing formalism to compute the self-interaction of the gravitational fields that generates GR’s characteristic non-linearities. Here, the mean field represents the overall effect of the massive components of the galaxy and is treated as a spacetime curvature. The gravitational interaction between a test particle at a distance r and the mass inclosed within r, is treated as a traditional force. Once this part of the gravitational field is treated as a force rather than geometrically, it can be characterized by field lines. The effect of curvature on the traditional force presents in the curved spacetime accounts for the gravitational field self-interaction. To see this, one can imagine that the traditional force acting on the test particle is the electric force rather than gravity. Spacetime curvature does affect the electric force [13] and it represents the gravitational and electromagnetic fields interacting. Thus, for a gravitational force acting on test particle, the effect addressed by the present method is indeed gravity’s self-interaction. This picture of the method is shown in Fig. 1.

We will compute here how gravity’s field lines are distorted by a mass distribution in the same way as light is lensed by such distribution [14]. Then, the field line flux is computed to obtain the gravitational force including its self-interaction effects. The usual lensing formalism can be employed because electromagnetic and gravitational field lines are affected identically by curved spacetime, that is, in particle language, photons and gravitons follow the same null geodesics [15, 16].

Fig. 1
figure 1

Illustration of the background field method. The left sketch shows field lines between two masses or charges in flat spacetime. The right panel shows the field lines deformation in spacetime curved by a background field. In this article, the mass deforming spacetime is the total galactic mass and the effect on the field lines connecting two parts of the galaxy is determined by using the standard gravitational lensing formalism

2.2 Calculations and model of disk galaxy

The disk galaxy producing the background field is modeled as an axisymmetric homogeneous disk of surface brightness I(Rz) decreasing exponentially with projected radius R and altitude z according to the characteristic radial scale \(h_R\) and scale height \(h_z\), respectively. While the field lines originate from the whole mass distribution, considering only those that stem from the galaxy center happens to yield representative results, as it can be intuited from the large baryonic matter density around the galactic center and its fast decrease with radius. Thus, for simplicity we will first consider only field lines emerging from the galaxy center and refer the reader to Appendix A for the full calculation accounting for the fluxes originating from the whole mass distribution. We note that although the simplified and full calculations happen to yield similar results, the full calculation reveals that the central galactic nucleus plays a critical role.

We will compute numerically the distortion of the field lines. These originate from the galaxy center at angle \(\phi \) with respect to the \(z=0\) plane. A small angular deviation \(\delta \phi \) of a field line passing near a point mass \({\mathcal {M}}\) is approximately given by

$$\begin{aligned} \delta \phi = { {4G{\mathcal {M}}} \over {hc^2} }, \end{aligned}$$
(1)

where h is the impact parameter and c the speed of light. For our calculation, we model the galaxy disk as made of concentric rings of radius r, thickness \(\Delta r\) and height 2h. The ring masses \({\mathcal {M}}\) bend the field lines according to Eq. (1). Figure 2 shows a sketch of the rings and the bending of a field line. In addition, rings at altitudes \(- (2j+1)h\) (\(j \in {\mathbb {N}} \)) contribute to deflecting more the field lines toward the \(z=0\) plane, while rings at \(+ (2j+1)h\) deflect the field lines away from the \(z=0\) plane. Nevertheless, the dominant bending comes from the rings with mid-planes at \(z=0\), henceforth referred to as “central rings”.

Fig. 2
figure 2

Deflection \(\delta \phi \) of a ray (black plain line) by massive rings (blue dashed lines and red dotted lines) of thickness \(\Delta r\) and height 2h, symmetric around the \({\varvec{z}}\) axis and the horizontal \(z=0\) plane. One ring is represented, as well as the cross-sections of the others (rectangles). In addition to the central rings (i.e. of mid-plane at \(z=0\), represented with blue dashed lines), other rings above \(+h\) and below \(-h\) (their rectangular cross-sections are shown with red dotted lines) also contribute to the deflection, although their total effect largely cancels. The step \(\Delta r\) is small compared to the galactic characteristic radial scale \(h_R\). The height h is determined at each step by the bent ray

For disk galaxies, a good general model for the baryonic matter density is that it exponentially decreases with radius r and altitude z according the characteristic scales \(h_R\) and \(h_z\), respectively [17]. Thus, the mass \({\mathcal {M}}^j_k\) of a ring of radius \(r \equiv k\Delta r\) (\(k \in {\mathbb {N}} \)) and of horizontal mid-height plane at altitude \(z=jh(r)\) is [18]

$$\begin{aligned} {\mathcal {M}}^j_k&= { M^*_\mathrm {tot} \over {h_R^2 h_z}} \int _{(2j-1)h}^{(2j+1)h} \int _{k\Delta r/2}^{(k+1)\Delta r/2} r \mathrm{{e}}^{-r/h_R}\mathrm{{e}}^{-|z|/h_z} \mathrm{{d}}z\mathrm{{d}}r, \end{aligned}$$
(2)

with \(M^*_\mathrm {tot}\) the total mass of the galaxy (the label \(*\) signals that only baryonic mass is considered). Therefore, the mass of a central ring (\(j=0\)) of radius \( k\Delta r\) is, after integrating over \(-h \le z \le h\):

$$\begin{aligned} {\mathcal {M}}_k = {2M^*_\mathrm {tot} \over {h_R^2 }} ( 1-\mathrm{{e}}^{-|z|/h_z} ) \int _{k\Delta r/2}^{(k+1)\Delta r/2} r \mathrm{{e}}^{-r/h_R} \mathrm{{d}}r , \end{aligned}$$
(3)

and similarly upon integrating over \(k\Delta r/2 \le r \le (k+1)\Delta r/2\), we obtain:

$$\begin{aligned} {\mathcal {M}}_k = 2M^*_\mathrm {tot} ( 1-\mathrm{{e}}^{-|z|/h_z} ) ( (a_{-}+1 )\mathrm{{e}}^{-a_{-}} - (a_{+}+1 )\mathrm{{e}}^{-a_{+}} ) , \end{aligned}$$
(4)

with \(a_{\pm } \equiv (2k\pm 1)\Delta r/(2h_R)\). For non-central rings \(j\ne 0\) (which contribute less to the bending than \(j=0\) rings), the dz integration yields \(( \mathrm{{e}}^{-(2j-1)|z|/h_z} - \mathrm{{e}}^{-(2j+1)|z|/h_z} )\) rather than \(( 1-\mathrm{{e}}^{-|z|/h_z})\). Therefore, their mass is

$$\begin{aligned} {\mathcal {M}}^j_k = { {\mathcal {M}}_k \over {2(1-\mathrm{{e}}^{-|z|/h_z})}} ( \mathrm{{e}}^{-(2j-1)|z|/h_z} - \mathrm{{e}}^{-(2j+1)|z|/h_z} ). \end{aligned}$$
(5)

The angular bending from a ring is then given by:

$$\begin{aligned} \delta \phi (r,z) = { {G{\mathcal {M}}^j_k} \over {\pi c^2}}E(r,z) , \end{aligned}$$
(6)

where is the complete elliptical integral of the first kind. The case described in this article is that of pure disk galaxies i.e. Hubble types 5 or 6. Using the method for earlier Hubble types, with prominent bulges, would require modifying the mass density distribution but does not fundamentally change the method. An example of a calculation is visualized with raytracing in Fig. 3. For this example, we used densities larger than those typical of galaxies to make the bending of the field lines conspicuous. The bending for actual galaxies densities is small but, as explained next, the ensuing effect is magnified at large distances, making its consequence on galactic rotation sizable at large distances.

Fig. 3
figure 3

Field lines for a disk centered at \(x=y=z=0\), symmetric about \({\varvec{z}}\), and of density decreasing exponentially with radius r. Left: without field self-interaction. Right: with self-interaction. For clarity, only field lines emerging with \(| \phi | < 0.005^\circ \) from the center of the disk are drawn, and the vertical scale on the bottom right panel is zoomed in. Since densities typical of galaxies cause a small bending, larger densities were used here to make the effect clearly visible. The figure showing an academic case rather than a physical one, the axis units are therefore unspecified. The top panels show the lines in the radial plane (\(x,y=0,z\)). The dashed lines on the right top plot are straight lines with \(| \phi | = 0.005^\circ \), i.e. the envelope of the generated lines when self-interaction is not accounted for. The yellow band shows the area where the lines in the (\(x,y=0,z\)) plane are approximately parallel, yielding a 1/r behavior of the force

Fig. 3 elucidates two important facts:

  1. (1)

    Although the mass density, and thus the force, is larger at small radii, the difference between the field lines in the right panel and those in the left panel increases with r. Hence, even if the force is strongest at small r, the consequence of self-interaction is more evident at larger r. This explains why the missing mass discrepancy is largest at larger r where fields are weak and, seemingly paradoxically, local self-interaction is not important (\(\delta \phi \approx 0\)). For example, the field lines for \(r > rapprox \) 4 are straight, i.e. \(\delta \phi \approx 0\), but are also roughly parallel to each other in a radial plane and for small z. This implies a force with a r-dependence differing from \(1/r^2\). This difference would generate a large missing/dark mass in a Newtonian analysis. In contrast to this straightforward consequence and to empirical observations, dark matter halo models tend to predict large densities of dark matter at small r, which is known as the core-cusp problem [19]. Furthermore, this explains why the gravity modifications of MOND [20] are enabled below a small characteristic acceleration.

  2. (2)

    The field lines at small r display an approximately isotropic distribution, leading to the familiar \(1/r^2\)-dependence of gravity. At larger r and small z, indicated in Fig. 3 by a yellow band, the field lines, while still axisymmetric around \( {\varvec{z}}\), tend to become parallel within radial planes. This leads to a \(\sim 1/r\) behavior of the force and a logarithmic potential \(\ln (r)\) within a disk of height about 0.01 in the example of Fig. 3. For this thin disk, the rotation curves are flat. The field lines outside the yellow region show that gravity still acts outside the galaxy, albeit with depleted strength. This contrasts with the calculations of Refs. [2, 3] in which the field needed to be set to zero outside the disk.

The gravitational force in the radial direction, \(F_r(r,z)\), is computed as the flux \({\mathcal {F}}(r,z)\) of the gravity field through a small surface at radius r and altitude z. In practice in the numerical calculation, \({\mathcal {F}}(r,z)\) is given by the number of generated field lines passing through the small surface. The total number of generated field lines is chosen so that the small fraction of field lines passing through the surface is large enough to make discretization effects, viz the fact that \({\mathcal {F}}(r,z)\) is an integer (number of lines), negligible; see Appendix B. We verified that in the case without self-interaction (\(\delta \phi \equiv 0\)), the expected \(1/r^2\) dependence of the force is recovered. For an idealized galaxy with homogeneous mass distribution, the two-dimensional flux in a horizontal (z=constant) plane, \({\mathcal {F}}_\mathrm {h}\), must be unaffected by lensing due to the \( {\varvec{z}}\)-axisymmetry: in a z=constant plane, the bending from masses on one side of the field line is canceled by the bending from the masses on the other side. After verifying numerically that indeed \({\mathcal {F}}_\mathrm {h} \propto 1/r\), we greatly accelerated and simplified the numerical calculation by generating field lines only in the vertical plane (\(x,y=0,z\)) and recording the flux \({\mathcal {F}}_\mathrm {v}\) through a small vertical segment. Then, the radial component of the force is \(F_r(r,z) \propto {\mathcal {F}}_\mathrm {v}{\mathcal {F}}_\mathrm {h} \propto {\mathcal {F}}_\mathrm {v}/r\).

In this article, we are chiefly interested in the r-dependence of the force and how it affects galactic rotation curves. For the absolute determination of the force, and thus of the rotation speed, the proportionality constant between \(F_r(r,z)\) and \({\mathcal {F}}(r,z)\) can be determined by equating at small r the relative \(F_r(r,z)\) to the absolute Newtonian expectation, since for \(r \ll h_R\) the difference between \(F_r(r,z)\) and the Newtonian expectation is negligible, see Fig. 4. It is worth repeating however that while this difference is negligible for \(r \ll h_R\) because the effect of the field line distortions has not propagated yet, this region is at the origin of the significant difference at larger r, see discussion (1) at the beginning of this section.

2.3 Galaxy model and systematic studies

The physical parameters of the galaxy model and their chosen nominal values are: (1) the galaxy total mass \(M^*_\mathrm {tot}=3 \times 10^{11}M_{\odot }\). This is a typical baryonic mass of a large spiral galaxy; (2) the radial scale \(h_R\). The present radial scale for spiral galaxies is typically \(h_R \approx \) (0.5–5) kpc. However, \(h_R\) increases with time: in the Milky Way, for instance, \(h_R\) for old stars is about twice smaller than that of young stars [21]. The initial (smaller) radial scale must be chosen so we used \(h_R=1.5\) kpc; (3) the scale height \(h_z\). The stellar scale height of disk galaxies typically follows \(h_z^* \approx 0.1 h_R\), while the gas scale height \(h_z^\mathrm {gas}\) is smaller: \(h_z^\mathrm {gas} \approx (0.02\)\(0.04)h_R\). We used a typical value \(h_z \approx 0.03 h_R\) for two reasons: firstly, gas friction and attraction between the stars at same r but different z tend to make the the denser layers at small z to drag the less dense higher-z layers. Secondly, high-density gas produces the stars. Hence, the initial star distribution has a smaller \(h_z^*\) than the one observed in mature galaxies, as demonstrated by the smaller \(h_z^*\) of young blue stars compared to the \(h_z^*\) of older stars. Although \(h_z^*\) increases with time due to dispersion from two-body interactions, angular momentum conservation and the smallness of the vertical speed \(v_z\) acquired from two-body interaction, \(v_z \ll v_\theta \), impose that the rotation speed of the stars remains nearly constant. The rotation speed is thus largely determined by the force and density well within a small-z disk before star dispersion occurs. We varied the parameters within the ranges \(10^{11}M_{\odot }\) \(\le M^*_\mathrm {tot} \le 10^{12}M_{\odot }\), 0.5 kpc \(\le h_R \le \) 5 kpc and 0.01 kpc \(\le h_z \le \) 0.5 kpc, and found that the self-interaction effect mostly depends on \(M^*_\mathrm {tot}\) and \(h_z\), and less on \(h_R\).

In actual galaxies, the smooth decrease of the density with r and h is an average dependence. Measurements of HI gas density distribution show that it varies by as much as several times the average. It can be assumed that the stellar density fluctuates similarly. Since averaging is a linear operation, average quantities may not be adequate inputs for non-linear systems. Thus, to investigate the effect of density fluctuations, we randomly varied each \({\mathcal {M}}^j_k\) following a Gaussian distribution centered on the value that \({\mathcal {M}}^j_k\) would have assuming an exponential decrease, and of various widths, with tails truncated symmetrically so that \({\mathcal {M}}^j_k >0\). We found that the effect of the fluctuations is negligible, including the dependence on the width. Beside random density variations, systematic ones are also possible, e.g. a warp of the galactic disk. We show in Appendix C that such warping does not influence significantly the flux.

3 Results

The calculation result for the nominal galactic parameters is shown in Fig. 4 along with fits to the fluxes using the form \({\mathcal {F}}_\mathrm {v} = 1/r+\alpha r\), with \([\alpha ]=\text{ kpc}^{-2}\), and \({\mathcal {F}} \equiv {\mathcal {F}}_\mathrm {v}{\mathcal {F}}_\mathrm {h}= {\mathcal {F}}_\mathrm {v}/r\). Since the force depends on z, we averaged it over \(h_z\). (For our parameter values, averaging is equivalent to a calculation done with \(M^*_\mathrm {tot}\) reduced by 19%.)

Fig. 4
figure 4

Top: two-dimensional fluxes vs. radial distance r. Green line: horizontal flux \({\mathcal {F}}_\mathrm {h}(r)\). Red line: vertical flux \({\mathcal {F}}_\mathrm {v}(r)\). The modeled galaxy has a total mass \(M^*_\mathrm {tot} = 3 \times 10^{11}M_{\odot }\), a radial scale length \(h_R=1.5\) kpc (deprojection corrections are included), and a scale height \(h_z=0.03h_R\). The dashed black line is a 1/r curve to guide the eye. The dotted blue line shows the fit \({\mathcal {F}}_\mathrm {v} = 1/r+0.0042r\), for r in kpc. Bottom: total flux \({\mathcal {F}}(r)\) (red line) with fit \({\mathcal {F}}= 1/r^2+0.0042\) for r in kpc (dotted blue line). The dashed black line is the Newtonian \(1/r^2\) expectation

3.1 Rotation curves

Fig. 5
figure 5

Top: rotation curves generated using the results in Fig. 4. The red line is the rotation curve accounting for General Relativity’s field self-interaction. The black line is without self-interaction (Newtonian case). The dotted line is the quadratic difference of the two. This represents the missing/dark mass that one would need to introduce in an analysis employing Newton’s law, rather than General Relativity, in order to recover the red curve. Bottom: apparent mass contained within r. Each curve corresponds to one of the top panel rotation curves analyzed using Newton’s law. The red and dotted lines are the effective masses obtained from expressing the field self-interaction as an equivalent mass. The black line shows the actual mass

Once field self-interaction is effectively included in the calculation of \({\varvec{F}}(r)\), the Newtonian kinematical formalism can be used to obtain the rotation curves. The radial force is parameterized as

$$\begin{aligned} F_r(r) = (Gm_1m_2/r^2)[1+ \alpha r^2], \end{aligned}$$
(7)

with \(\alpha \) determined from the fit to the flux, see Fig. 4. This and the radial equilibrium condition for a body in circular orbit at distance r yield the rotation speed:

$$\begin{aligned} v_\theta (r) = \sqrt{GM(r)[1/r+ \alpha r]}. \end{aligned}$$
(8)

Here, M(r) is the mass , enclosed within deprojected radius r and determined by using the Abel equation applied to \(I(R)\propto \mathrm{{e}}^{-R/h_R}\), with R the projected radius. The resulting \(v_\theta (r)\) is shown in the top panel of Fig. 5 and displays the plateau that is typically observed at large r.

We remark that to derive \(v_\theta (r)\), the force is approximated as if M(r) were concentrated at \(r=0\). While this is exactly true only for spherically symmetric matter distributions and \(F \propto 1/r^2\) (Newton’s shell theorem), the sharp density peaking near the galaxy center makes this approximation acceptable in the present case. To check this, we performed a N-body simulation using both the Newtonian and the perturbative post-Newtonian formalisms and compared the resulting rotation curves to that obtained using the shell theorem. They are comparable. For the non-perturbative case, the field lines tend to become confined within the disk [2, 3] and in that limit, the resulting 1/r force makes the (now in two dimensions) shell theorem exactly valid again.

3.2 Effective dark mass profile

The effective dark mass profile can be obtained straightforwardly. In a dark matter framework \(F = GMm/r^2\) and , where \(M^*(r)\) and \(M_\mathrm {dark}(r)\) are the baryonic and dark masses enclosed within r, respectively. The equivalent dark mass profile is then:

$$\begin{aligned} M_\mathrm {dark}(r)=rv^2_\theta (r)/G -M^*(r). \end{aligned}$$
(9)

This is shown by the dotted line in the bottom panel of Fig. 5. Its rise with r illustrates the earlier discussion that the consequence of self-interaction is magnified at larger r despite negligible local self-interactions there.

4 Predictions and verifications

The model shows that the effect of field self-interaction depends mainly on the total baryonic mass \(M^*_\mathrm {tot}\), specifically on the galactic density at small r, and the scale height \(h_z\). The latest suggests that the inferred dark mass \(M_\mathrm {dark}\) of a galaxy correlates with its \(h_z\). We check this prediction by using two different sets of data [22, 23] that provide \(M_{dark}\) and disk characteristic scales. The first set, from Sofue [22], provides only the radial scale \(h_R\), but the proportionality relation, \(h_z \simeq \epsilon h_R\), allows us to perform the analysis (a correlation analysis is independent of the value of \(\epsilon \)). The second set, from Martinson et al. [23], provides both \(h_z\) and \(M_{dark}\). Figure 6 shows \(M_\mathrm {dark}/M^*_\mathrm {disk}\) vs. \(h_z\) for both sets (set [23] was recombined in 200 pc bins of \(h_z\) for clarity). We normalized \(M_\mathrm {dark}\) by the disk baryonic mass \(M^*_\mathrm {disk}\) to cancel the expected dependence of GR’s non-linearity with the baryonic mass, as well as other known relation between \(h_R\) (viz \(h_z\)) and luminosity or \(M^*_\mathrm {disk}\). The predicted correlation is clearly visible: fitting with \( M_\mathrm {dark}/M^*_\mathrm {disk} = ah_z^b\) yields \(b\ne 0\); \(b=-1.48 \pm 0.11\) for set [22] and \(-1.25 \pm 0.14\) for [23]. Although the galaxies in the two sets are distinct, the two values of b agree. We can also quantify the degree of correlation by calculating the Pearson linear correlation coefficient between \(\ln (M_\mathrm {dark}/M^*_\mathrm {disk})\) and \(\ln h_z\). It is \(-0.70\) and \(-0.69\) for sets [22, 23] respectively. Normalizing \(M_\mathrm {dark}\) to the total baryonic mass \(M^*_\mathrm {disk+bulge}\) yields similar results but with lower values of b: \(-\,1.10 \pm 0.13\) and \(-\,1.5 \pm 0.30\) for [22, 23], respectively. This is because no causal connection is expected between \(h_z\) and \(M^*_\mathrm {bulge}\). Also shown in Fig. 6 are the results of our model calculated for \(M^*_\mathrm {tot}=0.9\times 10^{11}\) M\(_\odot \) (the approximate average of \(M^*\) for set [22]) and \(h_R=1.5\) kpc. In Refs. [22] and [23], \(M_\mathrm {dark}\) is estimated at \(R_{200}\) which ranges from tens to hundreds of kpc. Hence, we calculated our effective dark mass up to \(r=100\) kpc. The result (not adjusted to the data) is shown by the plain line in Fig. 6. It agrees with the observations, with a \(\chi ^2\) similar to that of the \(ah_z^b\) fit.

Fig. 6
figure 6

Galactic dark mass \( M_\mathrm {dark}\) normalized to baryonic disk mass vs. vertical scale length \(h_z\). The squares show the data from Ref. [22] and triangles from [23]. \( M_\mathrm {dark}/M^*_\mathrm {disk}\) correlates with \(h_z\). The dashed and dotted lines give the best fit to Refs. [22, 23], respectively, assuming a \(ah_z^b\) form. The plain line is the lensing calculation described in this article

5 Summary

The values of galactic masses and characteristic distances suggest that field self-interaction, a feature of general relativity (GR), needs to be included in studies of galaxy dynamics [2, 24, 25]. Field self-interaction increases gravity’s strength compared to the Newtonian expectation. The effect will become noticeable in systems of large enough mass. In a Newtonian analysis of such systems, the effect would be misinterpreted as a missing mass (dark matter). Furthermore, the similarities between, on the one hand the GR’s Lagrangian and observations linked to dark matter and energy, and on the other the QCD Lagrangian and hadron structure phenomenology, offer another compelling reason to investigate GR self-interaction as an explanation for the universe’s dark content. Finally, the exclusion by direct searches of most of the natural phase space for WIMP and axion candidates, and the emergence of dark energy from GR self-interactions [26], make this explanation of the missing mass problem more plausible.

In this article, we presented a new approach to compute the effects of GR’s self-interaction based on a mean-field technique and the formalism of gravitational lensing. We find that self-interaction effects are important for galaxy dynamics and tend to flatten rotation curves. This agrees with Refs. [2, 3] in which a different method (numerical lattice calculation of path integrals) was used to account for field self-interaction. The present method is faster and applicable to any mass distribution. However, it is less directly based on GR’s equations than the path integral approach. Beside flattening the galactic rotation curves, the method explains straightforwardly why the missing mass discrepancy worsens at large galactic radii. In contrast, dark matter halo models must be tuned to the specific density profile of a given galaxy to flatten its rotation curve (disk-halo conspiracy), and they tend to predict larger dark matter density at small radii (core-cusp problem). The present approach predicts a correlation between the missing mass inferred in galaxies and their vertical scale length. We verified this correlation with two separate data sets.