1 Introduction

Precise measurements of the Hubble Constant (\(H_0 = 100h_0\) \(\mathrm {{km \, s^{-1} \, Mpc^{-1}}}\)) are of great interest not only to improve our understanding of the current cosmic evolution but also to provide new insights into some fundamental questions of the physics of the early Universe (see e.g., [1] for a broad discussion). Currently, there is a \(\sim 4.4\sigma \) tension between measurements of \(h_0\) performed in the local universe using type Ia supernovae (SN) observations calibrated with Cepheid distances to SN host galaxies, \(H_0 = 74.03 \pm 1.42 \, \mathrm {{km \, s^{-1} \, Mpc^{-1}}}\) (hereafter R19 [2]; see also [3, 4]), and the estimates of \(H_0\) obtained from the Cosmic Microwave Background (CMB) data in the context of the standard \(\Lambda \)CDM model, \(H_0 = 67.36 \pm 0.54 \, \mathrm {{km \, s^{-1} \, Mpc^{-1}}}\) (hereafter P18 [5]; see also [6]). Many previous works attempted to address this problem using different approaches, e.g., in light of the cosmic variance due to nearby inhomogeneities [7,8,9,10,11,12,13,14,15,16,17,18,19], re-calibrating the distance ladder [20,21,22,23,24,25,26,27,28,29], or looking at extensions to the standard model of Cosmology [30,31,32,33,34,35,36,37,38,39,40,41,42,43]. Meta-studies of the \(H_0\) measurements and tension also confirmed a \(H_0\) tension above \(3\sigma \) [44]. New Hubble Constant measurements from strongly lensed quasars time delays [45], and using the tip of the red giant branch [46, 47], could not solve this issue [48].

Our goal in this paper is to quantify how the incompleteness of current SN data can affect the \(H_0\) variance, i.e., a shot noise variance estimator. Rather than reassessing these measurements, we focus on quantifying how does the tension between the \(H_0\) measurements change due to it. To do this, we map the anisotropy of the Hubble Constant across the sky using low-z SN (\(0.023< z < 0.150\)) from simulations based on the latest compilation available, namely, the Pantheon data [49]. We produce sets of Monte Carlo (MC) realizations assuming different sky coverage configurations which coincide with isotropic and the real data (non-uniform) sky distributions. This allows to assess how the SN sampling due to the uneven redshift distribution, in addition to their distance uncertainties and incomplete celestial distribution of data points, impacts the \(H_0\) measurements. We also discuss how much does the \(H_0\) cosmic variance decrease when larger, more homogeneous SN data sets are considered. If the cosmic variance becomes much lower than the current values, we can rule it out as a possible explanation for this result, thereby strengthening the hypothesis of physics beyond the concordance model.

2 Data analysis

In our analysis, we use the Pantheon SN Ia compilation, which consists of the currently largest and most complete SN data-set, i.e., 1049 objects lying in the interval \(0.01< z < 2.30\) compiled from the PanSTARRS1 Medium Deep Survey, SDSS, SNLS, in addition to many low-z and HST data points. As we will focus on a model-independent analysis, as explained further on with more details, we will only select objects in the range \(0.023< z < 0.150\), hence reducing our sample to 237 data points. We exclude the SNe at \(z<0.023\) to reduce the cosmic variance impact in very low redshift ranges (see [36]). We do so by assuming that the FLRW metric holds true, thence one can expand the scale factor around the present time, and then measure distances regardless of the Universe dynamics. This is the well-known cosmographic approach largely discussed e.g. in [52, 53]. The luminosity distance reads

$$\begin{aligned} D_\mathrm{L}(y) = 3000h^{-1}_0 \left[ y + \frac{(3 - q_0)}{2}{y^2} + O(y^3)\right] \;, \end{aligned}$$
(1)

where \(y \equiv z/(1+z)\) [54] is the redshift observed in the comoving rest frame with respect to the expansion of the Universe, \(h_0\) and \(q_0\) are the dimensionless Hubble constant, defined as \(h_0 \equiv H_0/100\), and decelerating parameter at present time, respectively, for \(D_\mathrm{L}(y)\) given in Mpc. From \(D_\mathrm{L}(y)\), we can obtain the distance modulus of the SN such as

$$\begin{aligned} \mu (y) = 5\log _{10}\left( {D_\mathrm{L}(y)/\mathrm {Mpc}}\right) + 25 \;. \end{aligned}$$
(2)

As shown in Eq. (1), the luminosity distance depends only on \(h_0\) and \(q_0\) up to the second order in redshift. Therefore, we restrict our analysis up to that order. As shown in [50], this truncation does not bias \(\mu \) in the interval \(z \lesssim 0.20\).

Fig. 1
figure 1

Left panel: the sky distribution of SN at \(0.023< z < 0.150\) following the MC-iso1 prescription. Central panel: same as the left panel, but for the MC-iso2 instead. Right panel: a realization assuming the original SN celestial distribution, corresponding to the MC-pantheon case

We probe the \(h_0\) cosmic variance by mapping the directional dependence of \(h_0\) in the interval \(0.023< z < 0.150\), accordingly to the analysis of [11]. We define hemispheres whose symmetry axes are given by HEALPix [55] pixel centers at \(N_\mathrm{side}=16\) grid resolution, and then we obtain \(h_0\) best-fits for all SN enclosed in 3072 distinct hemispheres. To do so, we minimize the following quantity

$$\begin{aligned} \chi ^2 = \sum _\mathrm{i} \left[ ( \mu ^\mathrm{th}(\mathbf {p},y_\mathrm{i})-\mu ^\mathrm{obs}_\mathrm{i} ) \over \sigma _\mathrm{i} \right] ^2 \,; \end{aligned}$$
(3)

where \(\mathbf {p}\) represents the set of parameters \(\{h_0,q_0\}\). The latter is fixed at \(q_0 = -\,0.574\), thus fully consistent with the Pantheon best fit for a flat \(\Lambda \)CDM model with \(\Omega _\mathrm{m}=0.274\). A similar procedure was adopted in [56].Footnote 1 In addition, \(\mathrm{{i}}\) represents the \(\mathrm{{i}}\)-th data point belonging to each hemisphere, \(\mu ^\mathrm{obs}_\mathrm{i}\) and \(\sigma ^2_\mathrm{i}\) correspond to its distance modulus and respective uncertainty, and \(\mu ^\mathrm{th}(\mathbf {p},y_\mathrm{i})\) represents the theoretically expected distance modulus calculated according to Eq. (2). We note that only the statistical errors for \(\mu ^\mathrm{obs}_\mathrm{i}\) are used in our analyses, as the full covariance matrix would significantly degrade the constraints at such low-z range - specially because we are selecting hemispherical sub-samples of the SN data-set. Because our results are weakly sensitive to the cosmological model, we expect it to not be strongly affected by the SN light-curve nuisance parameters, whose values were also fixed at the flat \(\Lambda \)CDM model best-fits.

The cosmic variance of the Hubble Constant is estimated from three sets of 1000 Monte Carlo (MC) realizations according to the following prescriptions:

  • MC-iso1: The SN original positions in the sky are changed according to an isotropic distribution, but with the galactic plane (\(|\mathrm {b}|<10^{\circ }\)), and the highest declinations (\(\mathrm {DEC}>30^{\circ }\)) excised;

  • MC-iso2: The SN original positions in the sky are changed according to an isotropic distribution, but with the galactic plane (\(|\mathrm {b}|<10^{\circ }\)) excised only;

  • MC-pantheon: The SN original positions are maintained as the original Pantheon sample.

In all these realizations (see Fig. 1), the original distance moduli are changed according to a value drawn from a normal distribution, i.e.,

$$\begin{aligned} \mu ^\mathrm{MC}_\mathrm{th}(y_\mathrm{i}) = \mathcal {N}(\mu _\mathrm{fid}(\mathbf {p_\mathrm{fid}},y_\mathrm{i}),\sigma _i) \,, \end{aligned}$$
(4)

which corresponds to a distribution centered at \(\mu _\mathrm{fid}\), that is, the distance modulus fixed at the fiducial Cosmology in \(\mathbf {p_\mathrm{fid}} = \{h_{0;\mathrm {fid}},q_{0;\mathrm {fid}}\} = \{0.6736,-\,0.574\)}, and whose standard deviation is given by the original distance modulus uncertainty, \(\sigma \), for each data point i. This way, we circumvent the problem of \(h_0\) being implicitly assumed in the SN light-curve calibration procedure.

We quantify the cosmic variance of the MCs by means of

$$\begin{aligned} \Delta h \equiv h^\mathrm{max}_0 - h^\mathrm{min}_0 \;, \end{aligned}$$
(5)

in a similar fashion to [11], where \(h^\mathrm{max}_0\) and \(h^\mathrm{min}_0\) correspond to the maximum and minimum \(h_0\) values fitted across the whole celestial sphere in each realization following Eq. (3), where \(\mu ^\mathrm{MC}_\mathrm{th}(y_\mathrm{i}) \rightarrow \mu ^\mathrm{obs}_\mathrm{i}\). For the sake of comparison, the tension can be quantified by

$$\begin{aligned} T_{h_0} \equiv \frac{h^\mathrm{R19}_0 - h^\mathrm{P18}_0}{\sqrt{\sigma _{R19}^2 + \sigma _{P18}^2}} \simeq 4.4\sigma \;, \end{aligned}$$
(6)

where \(\Delta h^\mathrm{tension} \equiv h^\mathrm{R19}_0 - h^\mathrm{P18}_0 = 0.0667\) and \(\sigma _{\mathrm {R19}}^2 + \sigma _{\mathrm {P18}}^2 = 2.3 \times 10^{-4}\). We redefine the tension between the R19 and P18 measurements due to the cosmic variance similarly to [36], that is,

$$\begin{aligned} T_{h_0} = \frac{h^\mathrm{R19}_0 - h^\mathrm{P18}_0}{\sqrt{\sigma _{\mathrm {R19}}^2 + \sigma _{\mathrm {P18}}^2 + \Delta h_\mathrm{med}^2}} \,, \end{aligned}$$
(7)

with \(\Delta h_\mathrm{med}^2\) representing the median \(\Delta h\) value for each MC case. Since we have a different \(\Delta h\) value for each MC realization, we define the maximum and minimum value for \(T_{h_0}\) by means of the upper (\(\Delta h_{+}\)) and lower (\(\Delta h_{-}\)) bounds of \(\Delta h\) at 99.7% CL such as

$$\begin{aligned} T^{+}_{h_0}\equiv & {} \frac{h^\mathrm{R19}_0 - h^\mathrm{P18}_0}{\sqrt{\sigma _{\mathrm {R19}}^2 + \sigma _{\mathrm {P18}}^2 + \Delta h_{+}^2}} \;, \end{aligned}$$
(8a)
$$\begin{aligned} T^{-}_{h_0}\equiv & {} \frac{h^\mathrm{R19}_0 - h^\mathrm{P18}_0}{\sqrt{\sigma _{\mathrm {R19}}^2 + \sigma _{\mathrm {P18}}^2 + \Delta h_{-}^2}} \,. \end{aligned}$$
(8b)
Fig. 2
figure 2

The \(\Delta h\) of all MC realizations compared to the value obtained from the real data (pink vertical line), as well as the \(\Delta h^\mathrm{tension}\) at 1 and \(2\sigma \) C.L. displayed in light gray and blue shades, respectively

Table 1 Respectively: The MC prescription, the median \(\Delta h\), the 99.7% CL limits for \(\Delta h\) obtained from each case, and the \(h_0\) tension as defined in Eqs. (7)–(8a)

Finally, we repeat these analyses for simulated (isotropic) data-sets 4 and 10 times larger than the current one assuming the same redshift distribution, and the same relative distance uncertainties as the real data. Hence, we estimate how well future data can constrain the \(h_0\) cosmic variance and discuss its cosmological implications.

Fig. 3
figure 3

Left panel: the \(\Delta h\) of 200 MC-iso1 realizations assuming the original number of data points, as well as for those assuming samples 4 and 10 times larger. Right panel: same as before, but for MC-iso2 instead. The light gray and blue shades are the same as Fig. 2, but \(\Delta h\) was clipped at 0.05 to ease visualization

Table 2 Respectively: Same as Table 1, but for 200 MCs assuming 4 and 10 times more data points than the current sample instead

3 Results

In Fig. 2, we show the results obtained from 1000 MCs with 237 data points for each sky coverage configuration. The blue histogram gives the \(\Delta h\) distribution for the MC-iso1 case, whereas the red and black ones provide, respectively, the MC-iso2 and MC-pantheon results. The light blue and gray shades correspond to the \(\Delta h^\mathrm{tension}\) quantity for 2 and \(3\sigma \) CL, and the pink vertical line is the \(\Delta h\) obtained from the actual Pantheon data as reported in [51]. We note that the \(\Delta h\) obtained from all the MCs overlap with this \(3\sigma \) shade, but the MC-pantheon gives the largest values among all of them, illustrating that the non-uniform sky coverage increases the \(h_0\) cosmic variance

In Table 1, we provide the median, upper and lower values (at 99.7% CL) for \(\Delta h\), in addition to \(T_{h_0}\) and their respective upper and lower limits. We readily note that the MC-pantheon realizations perform the best, as \(T_{h_0}\) decreases from \(\simeq 4.4\sigma \) to \(2.7\sigma \) for the median \(\Delta h\), and \(1.9\sigma \, (3.4\sigma )\) for \(T^{+}_{h_0}\) (\(T^{-}_{h_0}\)). Both isotropic sets of MCs can alleviate the tension to \(3.4\sigma \) for the median case, and \(2.5\sigma \) for the most optimistic (\(T^{+}_{h_0}\)) one. We also note that the median \(\Delta h\) from both isotropic MCs are consistent with the estimates from [7, 36], i.e., \(\Delta h \simeq 0.009\), but in our case we can naturally incorporate other observational features of incomplete celestial coverage and SN distance measurement uncertainties, which contributes to reduce the tension even further. We also tested whether the individual uncertainties of the best-fitted \(h_0\) values affect our analyses. We found that their average \(1\sigma \) uncertainties for isotropic MCs lead to an additional (average) spread of 0.004 in \(\Delta h\) for the isotropic MCs, and 0.005 for the MC-pantheon case. Since these values are smaller than the \(\Delta h\) values presented in Table 1, we stress that they do not impact our analysis.

Moreover, we estimate the \(h_0\) variance of future SN compilations. We do this by producing 200 MC-iso1 and MC-iso2 realizations according to the same redshift distribution of SN as the Pantheon sample for two cases: one assuming data-sets 4 times larger than the Pantheon sample (nearly 950 objects), and another assuming 10 times its size (nearly 2400 objects). If \(\Delta h\) reduces in these cases, this would imply that cosmic variance cannot alleviate the \(\Delta h^\mathrm{tension}\) - and hence increase the evidence for physics beyond \(\Lambda \)CDM as its explanation.

We present the results of these analyses in Fig. 3. The \(\Delta h\) values and \(T_{h_0}\) are provided in Table 2. We can clearly note that these \(\Delta h\) are much smaller compared to the previous sets of MCs, so that \(T_{h_0} > 3.2\sigma \) for the realizations with 950 data points, and \(T_{h_0} > 3.8\sigma \) for those with 2400 objects. Given the advent of forthcoming, nearly all-sky distance data-sets as expected from LSST [58] and WALLABY [59], along with future \(H_0\) measurements from standard sirens [60, 61], and new assessments of the cosmic distance ladder [48], we should be able to determine whether the \(H_0\) tension arises due to its cosmic variance from uneven sky coverage and limited observational data, or whether it actually indicates physics beyond the concordance model.

4 Conclusions

The tension between the R19 and P18 estimates of \(H_0\) are one of the greatest puzzles of the concordance model of Cosmology at present. Because the Hubble constant is fundamental to our understanding of the cosmic evolution, it is necessary to seek explanations for this tension. The main ones correspond to an additional error budget to the \(H_0\) measurement due to local inhomogeneities and sample incompleteness, and to extensions of \(\Lambda \)CDM model such as time-evolving dark energy, interacting dark energy or new contributions to the effective number of relativistic species \(N_\mathrm{eff}\), among others. None of them could fully account for the \(H_0\) tension, but only alleviate it to \(\simeq 3\sigma \).

We focused on the former as a possible explanation for such a tension. This time around, we estimate the \(H_0\) cosmic variance from a hemispherical comparison estimator, so we can quantify how does the incomplete data sampling affect the \(H_0\) tension. We produced realizations of the Pantheon SN in the interval \(0.023< z < 0.150\) uniformly redistributed across the sky (MC-iso1 and MC-iso2), besides those assuming the actual distribution of objects (MC-pantheon). We also replaced the actual SN distance moduli to values drawn from a normal distribution assuming the \(\Lambda \)CDM model, but the original distance errors. Hence, we quantified the \(H_0\) variance due to the sample incompleteness within the concordance model framework.

We found that the \(H_0\) tension could be alleviated from nearly \(4.4\sigma \) to \(2.5\sigma \), at best, for the isotropic simulations, but the MCs assuming the actual SN distribution gave a median value of \(\simeq 2.7\sigma \) for a lower (upper) bound of \(1.9\sigma \, (3.4\sigma )\). These results significantly improves previous analyses that pointed out \(T_{h_0} \simeq 3\sigma \) due to the \(h_0\) cosmic variance, as we could incorporate further SN sample limitations, such as uneven sky coverage and distance measurement uncertainties. Furthermore, we verified that the \(\Delta h\) will be significantly smaller when larger samples becomes available, and so we can rule out cosmic variance as an explanation for this tension.

Our conclusion is that the cosmic variance can alleviate the \(H_0\) tension - specially if the non-uniform celestial coverage is taken into account. It is worth mentioning that there might also be unidentified systematics in the observations that lead to such tension, as discussed in [57]. As mentioned earlier, the forthcoming, nearly all-sky distance data make the prospects very good for solving this issue in the near future.