1 Introduction

The quest for understanding the true nature of the dark components of our Universe is expected to enter a new exciting era of deep insights due to the unprecedented quality data which missions like the James Webb Space Telescope, Euclid and the Square Kilometer Array will provide us in the very next future. In the wait for such new data, the present observational state-of-the-art [1] teaches us a lot about both Dark Matter (DM) and Dark Energy (DE), but yet not enough to boast unequivocal claims about them.

In this work we are going to focus more specifically on DM. DM has a long history, starting from the very first claim for its existence [2], passing through collecting decisive evidences [3], and arriving to confirmation(s) by the most varied cosmological and astrophysical probes [4,5,6]. Although nowadays many dedicated experiments have been realized and are led to detect DM (see, e.g. [7,8,9,10,11]), we do not have any proof of any theorized DM candidate particles [12]. Moreover, we also know that most (if not all) the theoretical scenarios we have to explain its nature, origin and dynamics, have many problems [13].

In order to shed some light onto it, we decided to follow a very specific and (until not long ago) less-conventional approach: the DM problem (as well as the DE one) might arise due to the assumption of General Relativity (GR) as the ultimate theory of gravity in our standard cosmological picture. Thus, a possible solution could be to extend GR into a more general gravity theory. This is the approach of the so-called Extended Theories of gravity (ETGs). The ways in which GR might be extended are almost uncountable [14], in theory, but practically a gravity modification can not be done arbitrarily. Indeed, each and every proposed ETG must reduce to GR at Solar System scales, where GR has been proven to work perfectly. Moreover, in order to preserve the Equivalence Principle, there must be a screening mechanism [15,16,17], that suppresses at small scales any large-scale gravity modifications.

In [18], we started to investigate a specific class [19, 20] of ETGs called Degenerate Higher-Order Scalar Tensor (DHOST) theories [19,20,21,22,23], which represent a generalization of the Beyond Horndeski theories [24] (which in turn generalize the well known Horndeski’s theory of gravity [25]). The main characteristic of such a class which drove our attention, is that it can be characterized by a partial breaking of the corresponding screening mechanism, the so-called Vainshtein screening [26]. This means that the gravity modifications which are introduced at large cosmological scales to specifically mimic GR-based DE models, might leak onto smaller astrophysical scales, and thus might play some role as “effective” DM.

In [18] we tested the chosen model with a sample of 16 high-mass galaxy clusters belonging to the Cluster Lensing and Supernova survey with Hubble (CLASH) program [27] for which we had data available from two complementary probes, namely X-ray and strong-and-weak gravitational lensing observations. We considered two scenarios. In the first one, we assumed the DHOST model only as an alternative to DE, so that the internal dynamics of the clusters would be ruled by standard DM plus some possible “external” influence of the DHOST from cosmological scales. In the second one, instead, we assumed that the DHOST was playing both the role of DE and of DM in its entirety. In the former case, the DHOST model showed a mild Bayesian evidence over GR, although it alleviated the discrepancy present in GR between X-ray hydrostatic and lensing mass estimates. In the latter case, GR still seemed to be statistically mildly favored with respect to the DHOST model.

Here we continue to explore the same DHOST model at galactic scales. In particular, we analyze the internal kinematics of a class of low surface brightness galaxies, called Ultra-diffuse galaxies (UDGs) [28, 29]. Quite recently one of them, NGC1052-DF2, has been under deep scrutiny because it seems to be characterized by an abnormally low amount of DM [30, 31]. It has become (in)famously known as the “lacking DM galaxy”, although such strong statement might be deemed as too much overhasty, as suggested by the debate which has followed. Nevertheless, such deficiency in DM content must be explained somehow, and it has even been considered as a crucial deathblow to ETGs, because “\(\ldots \)a dark matter signature should always be detected, as it is an unavoidable consequence of the presence of ordinary matter” [30]. Thus, we think it is quite interesting to analyse it and verify if such claims are correct.

This lack of DM is indeed strange since DM plays a crucial role in the galaxy formation process, as galaxies are supposed to form due to the cooling and condensation of gas in potential wells of DM halos. In particular, UDGs have been detected primarily in highly dense environment [28, 32, 33], but also in galaxy groups and even in voids [29, 34,35,36]. Their presence in different environments demonstrates that their low DM content could be a consequence of the interaction with the surrounding environment [37,38,39,40,41], or due to specific internal processes [42,43,44]. Indeed, hydrodynamical simulations [40, 45, 46] have shown that it is even possible to create a galaxy lacking DM in the standard cosmological scenario, and in [47,48,49] it has been suggested that NGC1052-DF2 may have formed as a consequence of high velocity collisions of gas-rich galaxies. In a most recent work [50], it was added that it might represent a sample of a possibly larger family, as a total of eleven low surface galaxies with the same properties have been found.

Further explanations regarding NGC1052-DF2 have been put forward. In [51] it is claimed that NGC1052-DF2 might be located at a distance of 13 Mpc, lower than the \(\sim 19\) Mpc from [30], allowing enough room for enough DM to be in full concordance with the standard picture and thus discarding any peculiarity. Other measurements [52] seem to confirm instead a larger distance, pushing it up to 22.1 Mpc. In [53] it is claimed that the uncertainties on the mass estimates have been underestimated and they are actually much larger, so that only a weak inference can be performed. In [54] it is shown how the choice of the tracer densities might have an influence. And in [55] the point is raised about how to analyze in the proper statistical way the observational data, as it might be that they are not really Gaussian distributed.

It is thus clear that the measurement of the dynamical mass of these objects from kinematical and dynamical data is crucial to discriminate between several formation processes and even to confute (or not) ETGs. Due to their low surface brightness, which implies a low signal-to-noise ratio, a measurement of the dynamical mass of UDGs through stellar velocity dispersion is not affordable. An alternative method, particularly well suited for UDGs, is based on the dynamics of the globular clusters (GCs) within them. The poor-gas galaxy NGC1052-DF2 was observed with the Dragonfly Telescope Array [56]Footnote 1 and its dynamics was studied in detail in [30, 58, 59]) exactly using GCs. From such observations it was inferred for the first time the possibility that it could host a very low amount of DM, pointing toward the possibility of a galactic dynamics mainly dominated by stars.

As anticipated above, the very low amount of DM shown by NGC1052-DF2 has been seen since the beginning as a critical problem for ETGs. Some attempts have been made to explain its dynamics within the Modified Newtonian Dynamics (MOND) scenario [60]. However, as specified in [30], within the MOND framework the expected velocity dispersion of NGC1052-DF2, derived using its population of GCs, appears to be a factor of two higher than the \(90\%\) of the upper limit on the observed velocity dispersion.

In [61, 62], the dynamics of NGC1052-DF2 has been explored in many different ETGs scenarios. In addition to MOND, the authors considered MOND with External Field Effect (EFE) [63], Weyl conformal gravity [64, 65] and Moffat’s Modified Gravity (MOG) [66]. In [62], it has been shown that the MOG and the MOND with (EFE) [67, 68] provide a good fit to the data, while the Weyl conformal gravity fits acceptably the data concluding that NGC1052-DF2 does not imply a dead-end for ETGs.

For all the above reasons, we are interested here to study if the dynamics of NGC1052-DF2 is a trouble for the DHOST model we have chosen. As done in [18], we will test the reliability of two different scenarios. In the first one, we will study the internal kinematics of NGC1052-DF2 with the DHOST model as an alternative to DE only. In the second scenario, we will test a unified scenario of DE and DM, where both have influence due to the partial breaking of the Vainshtein screening mechanism, with no need of adding any ad hoc DM component.

The paper is organized as follows: in Sect. 2 we introduce the DHOST model and the theoretical basis for the analysis of NGC1052-DF2 dynamics; in Sect. 3 we present how we model the galaxy and each of its mass components; in Sect. 4 we explain how we performed our statistical analysis; in Sect. 5 we present the results and the implications of our analysis; in Sect. 6 we draw our conclusions.

2 Theoretical model

The DHOST theory we consider here [19, 23, 69] exhibits a Vainshtein screening mechanism which is partially broken, leading to the gravitational (\(\varPhi \)) and metric (\(\varPsi \)) potentials:

$$\begin{aligned} \frac{d\varPhi }{dr}&= \frac{G_NM(r)}{r^2} + \varXi _1 G_N M''(r), \end{aligned}$$
(1)
$$\begin{aligned} \frac{d\varPsi }{dr}&= \frac{G_NM(r)}{r^2} + \varXi _2 \frac{G_N M'(r)}{r} + \varXi _3 G_N M''(r), \end{aligned}$$
(2)

where: \(G_N\) is the measured gravitational constant that might be different from the bare one G defined through the Planck Mass \(M_{Pl} = (8\pi G)^{-1}\); M(r) is the spherical mass enclosed in the radius r; \(M'(r)\) and \(M''(r)\) are respectively the first and second order derivative of the mass with respect to the radius r; and \(\varXi _{1,2,3}\) (using the notation of [70]) are the three coupling parameters that characterize the model (they all go to zero in the GR limit). Here we are not interested in a cosmological analysis of the DHOST model and through this work we assume that it reproduces fairly well a \(\varLambda \)CDM background cosmology. Qualitative considerations supporting this statement can be found in [71]; a more quantitative comparison to data (although using only Planck 2018 power spectra) is in [72, 73]. When needed, we will consider a \(\varLambda \)CDM background with \(H_0 =67.74\) km s\(^{-1}\) Mpc\(^{-1}\), \(\varOmega _m =0.3089\) and \(\varOmega _\varLambda = 0.6911\) [4].

The parameters \(\varXi _{1,2,3}\) can be written in terms of more fundamental Effective Field Theory (EFT) constants [20]. Taking into account the constraints which can be derived from multi-messenger observation of GW170817 [74], such relations can be expressed as [71, 75,76,77,78]:

$$\begin{aligned} \varXi _1= & {} -\frac{1}{2}\frac{(\alpha _H+\beta _1)^2}{\alpha _H+2\beta _1}, \end{aligned}$$
(3)
$$\begin{aligned} \varXi _2= & {} \alpha _H,\end{aligned}$$
(4)
$$\begin{aligned} \varXi _3= & {} -\frac{\beta _1}{2}\frac{(\alpha _H+\beta _1)}{\alpha _H+2\beta _1}, \end{aligned}$$
(5)
$$\begin{aligned} \gamma _0= & {} - \alpha _H - 3 \beta _1, \end{aligned}$$
(6)

where \(\gamma _0\) is the fractional difference between G and \(G_N\),

$$\begin{aligned} \gamma _0 = \left( 8 \pi M_{PL}^{2} G_N \right) ^{-1}-1. \end{aligned}$$
(7)

The EFT parameters are: \(\alpha _H\), which is connected to the kinetic mixing between matter and the scalar field introduced by the DHOST theory [20]; and \(\beta _1\), which parameterizes the presence of higher-order operators in the lagrangian [20]. Note also that we assume \(G_N\) fixed at its measured value, so that \(\gamma _0\) is fully determined by \(\alpha _H\) and \(\beta _1\), while G should be derived from them. The \(\varLambda \)CDM limits for these two parameters are \(\alpha _H,\beta _1 \rightarrow 0\).

2.1 Galactic dynamics theory

The internal kinematics of NGC1052-DF2 can be described using the Jeans equation (with the assumptions of spherical symmetry and collisionless tracers population):

$$\begin{aligned} \frac{d(l(r)\sigma ^2_r(r))}{dr} + \frac{\beta (r)}{r}(l(r)\sigma ^2_r(r)) = l(r)\frac{d\varPhi (r)}{dr}, \end{aligned}$$
(8)

where l(r) is the luminosity density of the galaxy and \(\beta (r)\) is the anisotropy parameter [79],

$$\begin{aligned} \beta (r) = 1 - \frac{\sigma ^2_t(r)}{\sigma ^2_r(r)}, \end{aligned}$$
(9)

where \(\sigma _t\) is the tangential velocity dispersion (defined as a combination of the two angular components of the velocity dispersion tensor \(\sigma ^2_t = (\sigma ^2_\theta + \sigma ^2_\phi )/2\)), and \(\sigma _r\) is the radial component of the velocity dispersion tensor. If \(\beta = 0\) the system is fully isotropic; if \(\beta = 1\) the system is defined as purely radial; if \(\beta \rightarrow -\infty \) the system is purely tangential.

The information about the gravity theory is fully encoded in the gravitational potential \(\varPhi (r)\) on the right-hand side of Eq. (8). Consequently, it is possible to study the internal kinematics of a galaxy using in our modified gravity scenario including Eq. (1) in Eq. (8).

In GR, integrating once equation Eq. (8) allows us to solve the Jeans equation finding [80]

$$\begin{aligned} l(r)\sigma ^2_r(r) = \frac{1}{f(r)}\int ^\infty _rds\, f(s)l(s)\frac{M(s)}{s^2}, \end{aligned}$$
(10)

with

$$\begin{aligned} \frac{d\log f(r)}{d\log r} = 2\beta (r), \end{aligned}$$
(11)

where f(r) depends on the specific parametrization used for the anisotropy parameter \(\beta (r)\). Projecting Eq. (10) along the line of sight, we define the line-of-sight (los) velocity dispersion as

$$\begin{aligned} \sigma ^2_{los}(R)&= \frac{2}{I(R)}\int ^\infty _R dr\,r\frac{l(r)\sigma ^2_r(r)}{\sqrt{r^2 - R^2}} \nonumber \\&\quad -R^2\int ^\infty _R dr\,\beta (r)\frac{l(r)\sigma ^2_r(r)}{r\sqrt{r^2 - R^2}}, \end{aligned}$$
(12)

where R is the projected radius, and I(R) is the stellar surface brightness.

Finally, inserting Eq. (10) into Eq. (12), one can obtain the velocity dispersion along the line as

$$\begin{aligned} \sigma ^2_{los}(R) = \frac{2G_N}{I(R)}\int ^\infty _Rdr\, K\biggl (\frac{r}{R}\biggr )l(r)M(r)\frac{dr}{r} \end{aligned}$$
(13)

where K(r/R) is the kernel function whose expression depends on the specific parametrization of the anisotropy parameter \(\beta (r)\) (see [80] for more details).

In order to generalize Eq. (13) to our DHOST scenario, we notice that Eq. (1) can be written as

$$\begin{aligned} \frac{d\varPhi }{dr} = \frac{G_NM_{eff}(r)}{r^2}, \end{aligned}$$
(14)

where we define the effective mass as \(M_{eff}(r) = M(r) + \varXi _1 r^2 M''(r)\). In such a way, Eq. (13) simply becomes

$$\begin{aligned} \sigma ^2_{los}(R) =\frac{2G_N}{I(R)}\int ^\infty _Rdr\, K\biggl (\frac{r}{R}\biggr )l(r)M_{eff}(r)\frac{dr}{r}. \end{aligned}$$
(15)

Note also that by using Eq. (15) we can put direct constraints on \(\varXi _1\) only, which in turn will result on indirect correlated constraints on \(\{\alpha _{H}, \beta _1\}\).

3 Data and galaxy model

The UDG NGC1052-DF2 was initially identified using the Dragonfly Telescope Array in [81]. Structural photometric parameters, like the size, surface brightness, magnitude and color, have been measured in combination with the Hubble and reported in [30]. Here we use such data.Footnote 2

The spectroscopy of the compact objects associated with the galaxy was realized with the W. M. Keck Observatory in two different runs. The first one was carried out using the Deep Imaging Multi-Object Spectrograph (DEIMOS) [83] on Keck II, and the last one with the Low-Resolution Imaging Spectrometer (LRIS) [84] Keck I. Ten compact objects similar to globular clusters have been observed, for which both radial velocities and velocity dispersion could be measured [30].

The dynamics of NGC1052-DF2 has been studied in [52, 58, 59, 85]. In the following sections we describe our galaxy modelling and the differences between our analysis and the literature.

3.1 Stellar component

The stellar component of the galaxy is modeled as a single Sérsic profile

$$\begin{aligned} I(R) = I_{0} \exp \left[ -\left( \frac{R}{a_{s}}\right) ^{1/n}\right] , \end{aligned}$$
(16)

with: \(I_0\), the central surface brightness; \(a_s\), the Sérsic scale parameter; n, the Sérsic index. The Sérsic scale parameter \(a_s\) is generally expressed in terms of the effective (half-light) radius \(R_{eff}\) by the relation

$$\begin{aligned} a_s = \frac{R_{eff}}{(b_n)^n} \end{aligned}$$
(17)

with \(b_n = 2n - 0.33\) [86]. The photometric analysis presented in [30] fixes the Sérsic index \(n = 0.6\), the effective radius \(R_{eff} = 22.6\) arcsec and \(I_0 = 24.4\) mag arcsec\(^{-2}\) in the \(V_{606}\) band. At the fiducial distance \(D = 20\) Mpc of [30], this corresponds to an absolute magnitude \(M_V= -15.4\) and luminosity (in solar units) \(L_{tot} = 1.12\cdot 10^8 L_\odot \).

The luminosity density could be derived by Abel inversion of Eq. (16), but no analytical expression is known for a Sérsic with free n. In [87] it is shown that it can be well approximated by a function

$$\begin{aligned} \ell (r) =\ell _1\widetilde{\ell }(r/a_s), \end{aligned}$$
(18)

with:

$$\begin{aligned}&\widetilde{\ell }(x) \simeq x^{-p_n} \, \exp (-x^{1/n}), \end{aligned}$$
(19)
$$\begin{aligned}&\ell _{1} = \frac{L_{\mathrm {tot}}}{4 \pi \, n \, \varGamma [(3-p_n)n] a_{s}^{3}}. \end{aligned}$$
(20)

The function \(p_n\) is defined in [88] as:

$$\begin{aligned} p_n \simeq 1.0 - 0.6097/n + 0.05463/n^2, \end{aligned}$$
(21)

and the total galaxy luminosity is

$$\begin{aligned} L_{tot} = 10^{-0.4(m_{V_{606}} - \mu (D) - M_{\odot ,V_{606}})} \end{aligned}$$
(22)

where \(m_{V_{606}}\) is the apparent magnitude of the galaxy; \(\mu (D) = 5\log _{10}D + 25\) is the distance modulus, with the distance to the galaxy, D, expressed in Mpc; and \(M_{\odot ,V_{606}}\) is the total magnitude of the Sun in the \(V_{606}\) band [89].

To get the mass density we have to multiply Eq. (18) by the light-to-mass ratio \(\varUpsilon \),

$$\begin{aligned} \rho _*(r) = \varUpsilon \ell \biggl (\frac{r}{R_e}\biggr ). \end{aligned}$$
(23)

Finally, the total mass enclosed in the radius r can be derived by integrating Eq. (23), and can be expressed analytically as

$$\begin{aligned}&M_*(<r) = 2\pi n \varUpsilon I_0 \left( \frac{R_{eff}}{b^n_n}\right) ^2\frac{\varGamma (2n)}{\varGamma [(3 - p_n)n]} \end{aligned}$$
(24)
$$\begin{aligned}&\quad \times \left\{ \varGamma [(3 - p_n)n] - \gamma \left[ (3 - p_n)n,b_n\left( \frac{r}{R_{eff}}\right) ^{1/n}\right] \right\} , \end{aligned}$$

where \(\varGamma \) and \(\gamma \) are the Euler and the incomplete gamma functions respectively.

As it is well known, Eq. (8) is characterized by a degeneracy between the radial dispersion profile and the velocity anisotropy, which is practically not measurable. This requires assumptions on the anisotropy parameter \(\beta (r)\) in order to correctly recover the mass profile of the galaxy. We consider two different possibilities. The first case is a constant anisotropy profile, \(\beta (r)\equiv \beta _c\). Then, we also analyze a model with a radial anisotropy profile, first proposed in [90] and then found specifically appropriate for UDGs in [91], given by

$$\begin{aligned} \beta (r) = \beta _0 + (\beta _\infty - \beta _0)\frac{r}{r + r_a}, \end{aligned}$$
(25)

where \(\beta _0\) is the inner anisotropy (at \(r = 0\)); \(\beta _\infty \) is the outer anisotropy (at \(r = \infty \)); and \(r_a\) is the scale radius of profile. This radial anisotropy functional form is a monotonic increasing function that reduces to a constant anisotropy \(\beta \equiv \beta _c = \beta _0\) if and only if the inner and the outer anisotropy contributes coincide. Each form of \(\beta (r)\) implies in Eq. (13) a specific Kernel function, as tabulated in [80].

Thus, in our analysis the free parameters for the stellar component will be \(\{\varUpsilon , D, \beta _c\}\) or \(\{\varUpsilon , D, \beta _0, \beta _{\infty }, r_a\}\) depending on the considered velocity dispersion model. It is worth to stress here that a change in the distance D introduces a change in the conversion factor between arcsec and kpc, thus affecting both the values of the Sérsic parameters \(R_{eff}\) and \(I_{0}\), when expressed in kpc and solar units, respectively, and of the data of the dispersion curve, which in [30] is provided in kpc (at their fiducial distance).

3.2 Dark matter component

When we assume a DM component, we consider a generalized Navarro–Frenk–White density profile (gNFW) [92,93,94]

$$\begin{aligned} \rho _{gNFW}(r) = \rho _s\biggl (\frac{r}{r_s}\biggr )^{-\gamma }\biggl (1 + \frac{r}{r_s}\biggr )^{\gamma - 3} \end{aligned}$$
(26)

where \(\rho _s\) and \(r_s\) are the characteristics NFW density and radius parameters and \(\gamma \) represents the inner log-slope. For \(\gamma = 1\) we recover the typical NFW density profile of DM halos [95, 96], but Eq. (26) is preferred because it has more freedom and (if needed) can recover a larger variety of inner profiles than the standard NFW profile.

From Eq. (26) we can derive the mass profile, as

$$\begin{aligned} M_{DM}(<r)&= \frac{4 \pi \rho _{s} r^{3}_{s}}{3-\gamma } \left( \frac{r}{r_{s}} \right) ^{3-\gamma }\, \nonumber \\&\quad \times \, _2F_1\left[ 3 - \gamma ,3- \gamma ,4 - \gamma ,- \frac{r}{r_s}\right] , \end{aligned}$$
(27)

with \(_2F_1\) being the hypergeometric function.

When working with DM, it is common to introduce the concentration parameter, \(c_{\varDelta }\), defined as

$$\begin{aligned} c_{\varDelta } = \frac{r_{\varDelta }}{r_s}, \end{aligned}$$
(28)

where the \(\varDelta \) means that all quantities are calculated at the radius \(r_{\varDelta }\), where the density of the system is \(\varDelta \) times the critical density of the Universe, \(\rho _c(z)\), at the same redshift of the object. In our case, we consider \(\varDelta =200\), the so-called virial value, and the critical density of the Universe reads,

$$\begin{aligned} \rho _{c}(z) = \frac{3 H^{2}(z)}{8\pi G_{N}}, \end{aligned}$$
(29)

where H(z) is from the fiducial \(\varLambda \)CDM cosmology we have defined in Sect. 2, and \(z= 0.004963\) is the redshift of NGC1052 derived from the NED database.Footnote 3

Thus, we can introduce the virial mass, \(M_{200}\)

$$\begin{aligned} M_{200} = \frac{4\pi }{3}200\rho _cr^3_{200}, \end{aligned}$$
(30)

from which we can express the scale density \(\rho _s\) as

$$\begin{aligned} \rho _s = \frac{200}{3}\rho _{c}(z) \frac{(3-\gamma )(c_{200})^\gamma }{_2F_1[3 - \gamma ,3- \gamma ,4 - \gamma ,- c_{200}]}. \end{aligned}$$
(31)

For convenience, the free parameters for the gNFW component in our statistical analysis will be \(\{c_{200},M_{200},\gamma \}\).

4 Statistical analysis

Assuming a Gaussian likelihood, the \(\chi ^2\) for the data we have is defined as

$$\begin{aligned} \chi ^2({{\varvec{\theta }}}) = \sum _{i}^{{\mathcal {N}}_{data}} \frac{(v_i - v_{sys})^2}{\sigma _{i}^2} + \ln \left( 2\pi \sigma _{i}^2\right) \end{aligned}$$
(32)

where: \({\mathcal {N}}_{data}=10\) is the number of observed GCs within NGC1052-DF2; \(v_{sys}\) is the systemic velocity of NGC1052-DF2 (and thus of the ten GCs); \(v_{i}\) are the observed velocities of the ten GCs (at given distances from the center of the galaxy, \(R_{i}\)); \(\sigma _{i}^2 = \sigma ^2_{los,i}({\varvec{\theta }}) + \sigma ^2_{v_i}\) is the total error budget on the velocities \(v_{i}\), with \(\sigma _{v_{i}}\) the measurement uncertainty and \(\sigma ^2_{los,i}({\varvec{\theta }})\) the velocity dispersion, which explicitly depends on the model parameters (see Eqs. (13) and (15)); and \({\varvec{\theta }}\) is the vector of the model parameters. When we work with GR and with a constant anisotropy profile, \({\varvec{\theta }} = \{v_{sys},D,\varUpsilon ,\beta _c,c_{200_c},M_{200},\gamma \}\); instead with a radial anisotropy, given by Eq. (25), \({\varvec{\theta }} = \{v_{sys},D,\varUpsilon ,\beta _0,\beta _\infty ,r_a,c_{200_c},M_{200},\gamma \}\). When the DHOST model is considered as DE only we have \({\varvec{\theta }} = \{v_{sys},D,\varUpsilon ,\beta _c,c_{200_c},M_{200},\gamma ,\varXi _1\}\) taking a constant anisotropy parameter and \({\varvec{\theta }} = \{v_{sys},D,\varUpsilon ,\beta _0,\beta _{\infty },r_a,c_{200_c},M_{200},\gamma ,\varXi _1\}\) with a radial one. When the DHOST model is both DE and DM, we have \({\varvec{\theta }} = \{v_{sys},D,\varUpsilon ,\beta _c,\varXi _1\}\) and \({\varvec{\theta }} = \{v_{sys},D,\varUpsilon ,\beta _0,\beta _{\infty },r_a,\varXi _1\}\).

Additionally, we also apply one control and a series of priors. The control is of physical nature: \(\sigma _{los}>0\) at any distance R from the center which is sampled by the data. This control is designed to check if the MCMC explores a region of the parameters which, even if satisfying the priors which we define in the following, may still return an unphysical value for the velocity dispersion.

Moreover, we apply: on the systemic velocity, a Gaussian prior \(v_{sys} = 1801.6\pm 5\) km s\(^{-1}\) [58]; on the mass-to-light ratio, another Gaussian prior \(\varUpsilon = 1.7 \pm 0.5\), derived from stellar population studies [30, 97]; while on the dispersion velocity parameters, we have a lognormal prior \(\log \left( 1-\beta _{i}\right) = 0 \pm 0.5\) km s\(^{-2}\) (where \(\beta _{i} = \{\beta _c,\beta _0,\beta _{\infty }\}\)) on the range \(\beta _{i}\in [-10,1]\) [58].

4.1 Prior on the distance

The distance of the galaxy NGC1052-DF2 is a sensitive element in the analysis of this galaxy, as it has been envisaged [51] that a different (closer) location might resolve at all the strangeness of a galaxy lacking dark matter.

In [30] the distance is estimated by the Surface Brightness Fluctuations (SBF) method [98, 99] and is \(D= 19.0 \pm 1.7\) Mpc. In [58] a compatible result always derived from SBF is \(D = 19.0 \pm 1\) Mpc. In [52], the distance is estimated using the method of the tip of the red-giant branch (TRGB) [100]. Measurements with the Advance Camera for Surveys (ACS) of Hubble give a result of \(D = 22.1 \pm 1.2 \) Mpc. This measurement completely rules out the one from [51], of \(D = 13\) Mpc, and even states that the ten GCs objects are more luminous than previously measured. This is, eventually, the prior we apply on the distance D.

4.2 Priors on \(c_{200}\) and \(M_{200}\)

Given the limited extension (in distance from the center of NGC1052-DF2) of the GCs data, to leave the DM parameters \(c_{200}\) and \(M_{200}\) totally free, with only some uninformative flat prior on them, results in having such parameters totally unconstrained. For such a reason, the fit is performed applying a lognormal prior on \(c_{200}\), taking advantage of the many \(c-M\) relations which can be found in literature.

One of the most used is from [101], but it does not really cover the mass range of the UDGs in which we are more interested. In [58] the authors use the \(c-M\) relation from [102], which requires the calculation of too many cosmologically-related quantities, and we want to avoid this as it would imply to assume too many things both at the background and at the perturbative level, when we do not really have a fully detailed literature about DHOST in the context of matter power spectrum and growth of perturbations. We must point out also that all the \(c-M\) relations available in literature are based on GR simulations. Thus, any usage of them in an ETG context is somehow extrapolated.

In this work we have chosen the relation provided by [103], which separately considers different redshifts ranges and is updated to a Planck 2015 cosmology (see their Appendix B1). The standard deviation we apply is \(\sigma _{\log c_{200}} = 0.16\) dex.

We additionally consider two different scenarios for the prior on the halo mass \(\log M_{200}\): one lognormal prior based on the stellar-to-halo mass relation (SHMR) from [104], which constrains \(M_{200}\) from the total stellar mass \(M_{*} = \varUpsilon L_{tot}\), with dispersion 0.3 dex; and one with no SHMR, in which we have an uniformative flat prior on \(\log M_{200}\in [2,15]\). For the gNFW slope parameter \(\gamma \), we consider a uniform flat prior on the range [0, 2].

Finally, it is important to stress that when we work in the scenario in which the DHOST might play the role of an effective DM component, we do not apply any prior at all on such possible effective-DM behaviour, and we leave the parameter \(\varXi _1\) totally free.

4.3 Bayesian analysis

The total \(\chi ^2\), taking into account all the priors, is minimized using our own Monte Carlo Markov Chain (MCMC) code, whose convergence is checked using the method developed in [105].

After having performed the fit, the next most important step is to compare the GR and the DHOST scenario in a reliable way. For this, we compute a series of quantities which are recognized nowadays as the most accurate for this goal [1]. Taking advantage of the outputs of our MCMCs, which directly provide the posteriors for each parameters, we use our own implementation of the nested sampling algorithm described in [106] to calculate them with the appropriate modifications.

First of all, we calculate the evidence \({\mathcal {E}}_{i} \equiv {\mathcal {E}}({\mathcal {M}}_{i})\), defined as the probability of the data \({\mathbf {d}}\) given the model \({\mathcal {M}}_{i}\) with a set of parameters \({\varvec{\theta }}\),

$$\begin{aligned} {\mathcal {E}}_{i} = \int d{\varvec{\theta }}\, {\mathcal {L}}_{i}({\varvec{\theta }}) \pi _{i}({\varvec{\theta }}) , \end{aligned}$$
(33)

where \({\mathcal {L}}_{i} \propto \exp ^{-\chi _{i}^2}\) and \(\pi _{i}\) are respectively the likelihood and the prior function of the model \({\mathcal {M}}_{i}\), with the posterior being \(\mathcal {P}_{i} = {\mathcal {L}}_{i} \pi _{i}/{\mathcal {E}}_{i}\).

Table 1 Results from the statistical analysis of NGC1052-DF2. For each parameter we provide the median and the \(1\sigma \) constraints; unconstrained parameters are in italic font. The parameters are, from left to right: distance D; mass-to-light ratio \(\varUpsilon \); systemic velocity \(v_{sys}\); anisotropy function parameters, depending on the model assumed, constant \((\beta _{c})\) or radial from [91] \((\beta _{0},\beta _{infty},r_{a})\); gNFW concentration \(c_{200}\), mass \(M_{200}\), and inner log-slope \(\gamma \); DHOST characteristic scaling \(\varXi _1\); logarithm of the Bayes factor \(\log {\mathcal {B}}_{i}^{j}\); and the suspiciousness \(\log {\mathcal {S}}_{i}^{j}\)
Fig. 1
figure 1

Velocity offset profiles of NGC1052-DF2 with GR and no DM component, but only the stellar contribution. Black dots and bars are observational data, \(v_{i}-v_{sys}\), with uncertainties, \(\sigma _{v_{i}}\). Colored dashed lines and shaded regions are respectively the median and the \(1\sigma \) confidence region of the \(\sigma _{los}\) profile derived from Eq. (13). Left panel: constant velocity anisotropy profile. Right panel: radial velocity profile

From it, we determine the Bayes Factor \({\mathcal {B}}^{i}_{j} \equiv {\mathcal {E}}_{i} / {\mathcal {E}}_{j}\), where our reference model \({\mathcal {M}}_j\) is the GR scenario with only stellar component. We interpret the Bayes Factor using the Jeffrey’s scale [107]: if \(\ln {\mathcal {B}}^{i}_{j} < 1\) the evidence in favour of the model \({\mathcal {M}}_i\) is not significant respect to the one of \({\mathcal {M}}_j\); if \(1< \ln {\mathcal {B}}^{i}_{j} < 2.5\) the evidence of \({\mathcal {M}}_i\) is mild; if \(2.5< \ln {\mathcal {B}}^{i}_{j} < 5\) the evidence is strong; when \(\ln {\mathcal {B}}^{i}_{j} > 5\) the evidence is decisive.

However, since the Bayes factor can be prior-dependent [108], in order to perform a proper comparison between models, prior-independent quantities should be used. In [109,110,111] a new statistical quantity called suspiciousness \({\mathcal {S}}^{i}_{j}\) is introduced and defined as

$$\begin{aligned} \log {\mathcal {S}}^{i}_{j} = \log {\mathcal {B}}^{i}_{j} + {\mathcal {D}}_{KL,i} - {\mathcal {D}}_{KL,j}\, \end{aligned}$$
(34)

to quantify the mismatch between the model \({\mathcal {M}}_i\) and \({\mathcal {M}}_j\) in a prior-independent way. The suspiciousness may be considered as the value of the Bayes ratio which corresponds to the narrowest possible priors that do not significantly alter the shape of the posteriors.

In Eq. (34), \({\mathcal {D}}_{KL,i}\) is the Kullback–Leibler (KL) divergence [112] which quantifies the information gain between prior and posterior,

$$\begin{aligned} {\mathcal {D}}_{KL,i} = \int d{\varvec{\theta }}\, \frac{{\mathcal {L}}_{i}({\varvec{\theta }})}{{\mathcal {E}}_{i}} \log \frac{{\mathcal {L}}_{i}({\varvec{\theta }})}{{\mathcal {E}}_{i}}. \end{aligned}$$
(35)

From the above definition, we can see that the KL divergence \({\mathcal {D}}_{KL,i}\) is prior-dependent too. But in [110] it is shown how the Bayes ratio \({\mathcal {B}}^{i}_{j}\) and the difference between the KL divergences for \({\mathcal {M}}_i\) and \({\mathcal {M}}_j\) transform similarly by varying the prior volume, and therefore how in the suspiciousness \({\mathcal {S}}^{i}_{j}\) the prior dependency is removed.

The interpretation of the suspiciousness for model selection, in analogy to the Jeffrey scale, is described in Fig. 4 of [110]: a consistency between two models \({\mathcal {M}}_i\) and \({\mathcal {M}}_j\) is achieved for positive values of \(\log {\mathcal {S}}^{i}_{j}\), whilst a negative value of \(\log {\mathcal {S}}^{i}_{j}\) shows a tension between the models. More specifically, “an overly negative value of \(\log S^{i}_{j}\) indicates discordance, and an overly positive value suspicious concordance” [109, 110].

5 Results

All the scenarios which we have considered, depending on the mass components involved and on the priors applied, are shown in Table 1.

The first step has been to analyze the GR scenario, which is our reference model against which we can compare and assess the statistical validity of the DHOST model described by Eq. (1). More specifically, our reference scenario is the GR case with only stellar contribution and a constant anisotropy profile. Note that in [58] there is always a DM component, so the scenario with only stars is described here for the first time. In the left panel of Fig. 1 we can easily see, even just by visual inspection, how we can fit the data with only the stars, with a spatially averaged total dispersion \(\sigma \sim 10.53\) km s\(^{-1}\). From a statistical point of view, i.e. from the Bayes Factors and Suspiciousness, this scenario corresponds to the best match to observations. From Table 1 we see that no deviation from the applied priors is found, and the constant anisotropy profile shows a preference for highly tangential modes.

Fig. 2
figure 2

Velocity offset profiles of NGC1052-DF2 with GR and a gNFW DM component. Black dots and bars are observational data, \(v_{i}-v_{sys}\), with uncertainties, \(\sigma _{v_{i}}\). Colored dashed line and shaded regions are respectively the median and the \(1\sigma \) confidence region of the \(\sigma _{los}\) profile derived from Eq. (13). Left panels: constant velocity anisotropy profile. Right panels: Radial velocity profile. Top panels: With SHMR prior. Bottom panels: Without SHMR prior

In the right panel of Fig. 1 we have the same scenario, but we allow for a radial variation of the anisotropy function. From Table 1 we can see how there is no statistically significant difference with respect to the previous case. We detect a rise in the velocity offset at small scales, while at large scales we have the same descending profile, although the spatially average dispersion velocity is still consistent with the data, being \(\sigma \sim 11.10\) km s\(^{-1}\). The \(\beta \) parameters are quite well constrained, still pointing to a highly tangential anisotropy profile, but we also note how the radius parameter \(r_a\) is basically unconstrained. Moreover, we see that from the Bayesian point of view, this scenario is disfavoured with respect to our chosen reference one: not in a significant way for the Jeffrey’s scale, but already in a “statistical tension” regime, although very mild, when looking at the suspiciousness.

When we include a gNFW DM component in the GR scenario, things change drastically and are strongly related to the presence of the SHMR prior. All cases are shown in Fig. 2. The first main differences among the SHMR and the no-SHMR case are a slight shift toward a smaller distance, a smaller mass-to-light ratio and a lower systemic velocity in the former case, although they are still statistically consistent with each other. Moreover, the anisotropy parameters are less negative, but still fully tangential. In the SHMR case, the spatially averaged dispersion is \(\sigma \sim 17\) km s\(^{-1}\), thus in tension with most of the data; while in the no-SHMR case we have \(\sigma \sim 11\) km s\(^{-1}\), much more in agreement with observations.

When looking in more details to the DM parameters, we see the most net differences. The SHMR case has perfect gaussian constraints on both \(c_{200}\) and \(M_{200}\), while for \(\gamma \) we can only set an upper limit, being it basically consistent with zero. We must point out that the median value we get for \(c_{200}\sim 8\), does not exactly correspond to the median value we would expect from the \(c-M\) relation from [103] using our \(M_{200}\) MCMC outputs, which would be \(c_{200}\sim 11\). Although, given the uncertainties, they are still statistically consistent with each other. Nevertheless, the SHMR case is both strongly disfavoured and in strong tension with the reference scenario.

Things are different when we relax the SHMR assumption, and we do not impose such a prior. All parameters are perfectly consistent with the reference case, but now the DM has quite different properties: a very high concentration, \(c_{200}\sim 24\); a very low mass content, \(\log M_{200}<7\) \(M_{\odot }\); and \(\gamma \) totally unconstrained, with a uniform distribution all over the allowed range. Note that the limit on \(M_{200}\) is only an upper one.

Fig. 3
figure 3

Velocity offset profiles of NGC1052-DF2 with DHOST. Black dots and bars are observational data, \(v_{i}-v_{sys}\), with uncertainties, \(\sigma _{v_{i}}\). Colored dashed line and shaded regions are respectively the median and the \(1\sigma \) confidence region of the \(\sigma _{los}\) profile derived from Eq. (15). Top panels: DHOST as DE component only. Bottom panels: DHOST replacing both DE and DM

This is the main clue on which is based the claim of NGC1052-DF2 being a “lacking dark matter” galaxy and is described in [58]. We also show in our Table 1 that such scenario is only in mild tension with the reference star-only case. Which is quite the same as to say that the absence of DM at such scales in NGC1052-DF2 is a more than highly statistically valid hypothesis. We would like to point out also that we have considered the possibility that the gNFW might somehow fail to describe properly the DM halo of the galaxy (if any). We have thus relied also on a different DM model, the Einasto profile [113],

$$\begin{aligned} \rho _{Ein}(r) = \rho _s\exp \left\{ -\frac{2}{\gamma }\left[ \left( \frac{r}{r_s}\right) ^\gamma - 1\right] \right\} , \end{aligned}$$
(36)

which is another three-parameters model which successfully applies to galactic scales, but absolutely no qualitative difference has come out.

Fig. 4
figure 4

Comparison of ETF parameter constraints from our DHOST analysis with the results from [18, 20, 114]. Left panel: Results under the assumption that DHOST mimicks dark energy. Dashed black lines are \(1\sigma \) constraints from this work for the case with SHMR prior; solid black lines are \(1\sigma \) constraints from this work for the case without the SHMR prior. Right panel: Results when DHOST is assumed to play the role of both dark energy and dark matter. Solid black lines are \(1\sigma \) constraints from this work. In all cases we assume a constant stellar anisotropy profile. In both panels blue regions are derived from stellar physics considerations [20]; the red region is derived from \(2\sigma \) limits on \(\gamma _0\) from the Hulse–Taylor pulsar; helioseismology \(2\sigma \) constraints [114] are shown as green regions. Single constraints from CLASH clusters as obtained by [18] are shown as grey points/crosses

We now turn our attention to the DHOST model, as our main goal is to test, and possibly show, its viability in describing the kinematical data of NGC1052-DF2. We first consider the scenario in which the DHOST theory plays the role of DE, whose effects might be felt at galactic scale by the breaking of the screening mechanism. In this case, we still need to include in the matter budget a DM component, which is always parametrized by a gNFW model.

As it can be seen from Table 1 and from the top panels of Fig. 3, the presence of the DHOST model does not really change the results we get in the GR case, which are actually statistically equivalent. Indeed, the value of the DHOST parameter \(\varXi _1\) is consistent with zero: as it might be expected, DE does not play a relevant role on galactic scales. On the other hand, we see that the addition of the DHOST model slightly raises both the Bayes factor and the suspiciousness with respect to the case GR+DM: even if they remain mildly disfavoured, the tension is somehow alleviated, and this effect is most noticeable when the SHMR prior is applied.

The same conclusions can be driven in the case in which the DHOST plays both the role of DE at cosmological scales and entirely mimic DM at galactic ones. Results are shown both in Table 1 and in the bottom panels of Fig. 3. The most interesting thing to note is that, although the parameters are more or less statistically equivalent to the GR star-only case, and although the characteristic DHOST parameter has a non-zero median value but is consistent with the GR limit at \(1\sigma \) confidence level, we have a raise in both the Bayes factor and in the suspiciousness. In the case of constant anisotropy the Bayes factor is even slightly positive, although by a negligible amount, for which we cannot really conclude that it should be preferred with respect to the reference GR-based case. The suspiciousness also becomes positive (only case among all the ones we have considered), meaning that it is fully consistent and not in tension with our reference model.

That is the main conclusion and goal of our work: we have shown that even in a galaxy with a very low content of DM, or even lacking DM at all, DHOST theories cannot be discarded, but can be as much successful as GR in explaining observational data. Thus, NGC1052-DF2 has not dealt any deathblow to DHOST theories, which can be still be investigated as reliable ETGs candidates.

One further interesting and important point to address, is that in the case in which the DHOST plays also the role of DM, the chains, given only a limited number of physically reasonable priors, automatically set a sharp upper limit on the possible values of the characteristic DHOST parameters, namely \(\varXi _{1}\lesssim 0.5\) at \(2\sigma \) confidence level.

Finally, in Fig. 4 we compare our constraints on \(\varXi _1\), as function of the ETG parameters \(\alpha _H\) and \(\beta _1\), with others which are in literature. In blue, we have the constraints obtained from stellar physics arguments as conditions for dynamical equilibrium and for a minimal mass of red dwarf stars; in red, the \(2\sigma \) limits on \(\gamma _0\) from the Hulse–Taylor pulsar [20]; in green, the constraints provided by helioseismology arguments [114]; grey points with error bars represent the results we got in [18] from the analysis of the CLASH galaxy clusters. The new constraints on \(\alpha _H\) and \(\beta _1\) which can be derived from our estimations for \(\varXi _1\) in this work are represented by dashed black lines for the case with the SHMR prior, and solid black ones for the case without the SHMR prior, both with constant anisotropy profile. We can conclude that, independently of the considered scenario (i.e. DHOST as DE only or as DM too) our constraints are consistent with GR limits and with literature. In particular with stellar scales limits (blue and green regions), and “contain” the limits from pulsars (red line), which are known to be the most stringent. There is instead a larger tension with estimations from clusters of galaxies (grey points).

6 Conclusions

In this work we proceeded with our personal analysis of the DHOST model introduced in [19, 23, 69]. This model has one interesting feature: a breaking of the corresponding screening mechanism which might help to unify DE and DM under one single theoretical scenario. In [18] we started our journey at clusters of galaxies’ scales; here we tried to shed light on the dynamics of the Ultra-diffuse galaxy NGC1052-DF2 which has been claimed to be a “lacking DM galaxy” [30] and, as such, might be a potential big problem for ETGs.

We tested the DHOST model described by Eq. (1) in two different scenarios: one more conservative, in which the selected ETG only plays the role of DE at cosmological scales, but could alleviate the weight of DM by its broken screening mechanism; and one more ambitious, in which the DHOST model substitute entirely DM, thus playing the role of an “effective” mass. In the latter case we thus assume that the mass of NGC1052-DF2 is composed only by the baryonic (stellar) component.

We infer the mass of NGC1052-DF2 using the Jeans equation, Eq. (8), and we model the galaxy’s total mass as the sum of DM (when DM is assumed) and a stellar contribution. We describe DM using a generalization of the classical Navarro–Frenk–White profile. In addition, we consider two different models for the anisotropy parameter \(\beta \): a case in which it is constant, and one with a radial profile described by Eq. (25).

In agreement with results from [58], we find that when GR is assumed, the best match with the data is obtained when no DM is included at all. While in [58] the authors always include a DM component, and conclude that it should be present in a very low amount, in this work we also explicitly consider the case with no DM, thus having a purely baryonic galaxy. We can infer that, at least at the scales tested by the observations, the hypothesis of a total DM absence might be considered a totally satisfying option, from the statistical point of view. Indeed, the inclusion of DM only makes worse all the Bayesian indexes we have considered (Bayes factor and suspiciousness). Such results are also quite independent on the priors which can be adopted.

When the DHOST model is assumed to act only as DE, we have a substantial equivalence between this scenario and the corresponding GR cases, which is somehow expected because if we have a DM component, the large scale effects of DE might be expected to be negligible at galactic scales. But we need to notice that all the Bayesian indicators are improved with respect to the GR+DM cases. Even so, the reference case of GR with only stars is still the most favoured, statistically speaking.

Finally, we assume the DHOST model as fully mimicking DM: when considering a constant anisotropy profile, we even get both a positive Bayes Factor and a positive suspiciousness. Although they are only slightly greater than zero (but different from it at least at \(1\sigma \)), they clearly point to the fact that NGC1052-DF2 can be quite satisfactory described by our DHOST model, as much successful as by GR.

Of course, any further conclusion cannot be definitely driven here, because the sample of analyzed objects is too small. But we are planning to extend it (and we have already started to work on that), so to include more UDGs with resolved kinematics, i.e. with data accurate enough to perform a kinematical analysis, such as those described in [115,116,117,118]. Even more interestingly, UDGs are the perfect test arena for ETGs because they exhibit a wide range of behaviours, such that in the same family we can enlist also objects which seem to be highly DM dominated [119, 120]. Analysis of these case are left to forthcoming papers.