1 Introduction

Constraining the local primordial non-Gaussianity (PNG) parameter \(f_{\textrm{NL}}\) presents an important challenge for next-generation large-scale structure surveys. Local-type PNG affects the power spectrum of dark matter tracers by inducing a scale-dependent correction to the tracer bias [1, 2]. This correction is strongly suppressed on small to medium scales, but on very large scales, it is \(\propto f_{\textrm{NL}}(H_0^{2}/k^{2})\). This means that ultra-large survey volumes are required for high-precision constraints.

The Planck survey of the cosmic microwave background (CMB) gives the current state-of-the-art constraint \(\sigma (f_{\textrm{NL}}) =5.1\) from the bispectrum [3]. Future CMB surveys will lead to an improvement on the Planck precision, but not by much and not sufficient to access \(\sigma (f_{\textrm{NL}}) \lesssim 1\). This level of precision would enable us to rule out many single-field and multi-field inflationary scenarios [4,5,6]. The hope is that this goal can be achieved by using the extra modes in 3-dimensional surveys of the large-scale structure (see e.g. [7,8,9,10,11,12]). However, extracting \(f_{\textrm{NL}}\) from these surveys faces formidable difficulties:

  • Observational systematics on very large scales [13,14,15,16,17,18,19]. For our simplified Fisher analysis, we neglect these systematics, except for the foreground contamination of 21 cm intensity mapping (see below).

  • Astrophysical complications entailed in the modelling of tracer clustering, in particular, the uncertainties in modelling the scale-dependent bias [20, 21]. We use a simplified model since a full treatment requires simulations, beyond the scope of this paper.

  • A theoretical systematic arising from the neglect of lensing magnification and other relativistic lightcone effects on the power spectrum, which can bias the best-fit \(f_{\textrm{NL}}\) [22,23,24,25,26,27,28,29,30,31,32,33,34,35]. Although the bias on best-fit can be large, the error \(\sigma (f_{\textrm{NL}})\) is not significantly affected and we therefore omit these lightcone effects.

  • Cosmic (or sample) variance that dominates ultra-large scale measurements. We deal with this problem via a multi-tracer approach – which also helps to mitigate uncorrelated systematics.

Recent constraints on \(f_{\textrm{NL}}\) have used the BOSS galaxy [36] and eBOSS quasar [37] samples, with the tightest constraint to date of \(\sigma (f_{\textrm{NL}}) = 21\) from eBOSS [38]. The much larger volumes of upcoming surveys should facilitate significant improvement on this precision. But if we rely on individual surveys using the power spectrum of single tracers, it turns out that \(\sigma (f_{\textrm{NL}}) \lesssim 1\) is not achievable – even if we neglect systematics and complexities in scale-dependent bias [11]. The fundamental problem is cosmic variance.

A way to evade cosmic variance is the multi-tracer technique [39,40,41,42,43,44,45,46,47,48,49]. Forecasts indicate that multi-tracing next-generation surveys can surpass the CMB precision, and in some cases can achieve \(\sigma (f_{\textrm{NL}}) \lesssim 1\) [48,49,50,51,52,53,54,55,56,57,58,59,60,61]. The performance of the multi-tracer improves when the difference in tracer properties, especially the Gaussian clustering biases, is significant. It also helps to suppress uncorrelated systematics.

In this paper, our aim is not to identify survey combinations that can achieve \(\sigma (f_{\textrm{NL}}) \lesssim 1\) – for this purpose, one typically requires the very high number densities of photometric surveys [48, 52]. Instead, our aim is to investigate the improvements over single-tracer constraints when using a new type of spectroscopic large-scale structure survey – neutral hydrogen (HI) intensity mapping of the 21 cm emission line – in combination with spectroscopic galaxy surveys. Such a pair of spectroscopic samples has very different clustering biases and systematics, which accentuates the advantages of a multi-tracer analysis.

We use a simple Fisher forecast on mock surveys, which are based on next-generation surveys that are starting to observe or close to starting. The two surveys we have in mind are: the Dark Energy Spectroscopic Instrument (DESI), with the Bright Galaxy Sample (BGS) and Emission Line Galaxy (ELG) samples [62, 63], and the Square Kilometre Array Observatory (SKAO), with intensity mapping surveys in Bands 1 and 2 [57, 64]. Galaxy number count surveys are well established, but a cosmological HI intensity mapping survey has not yet been implemented. However, pilot surveys on the SKAO precursor telescope MeerKAT have already been used to:

  1. (a)

    measure the cross-power between MeerKAT and WiggleZ galaxy surveys [65], and

  2. (b)

    start developing a pipeline for the planned SKAO surveys [66, 67].

We combine pairs of these surveys at low and high redshifts, using the Fisher single-tracer constraints in the non-overlap volume and the multi-tracer constraints in the overlap volume. The results show a reasonable improvement for the low-redshift pair, and a significant improvement for the high-redshift pair, compared to the standard single-tracer forecasts for \(\sigma (f_{\textrm{NL}})\). The combination of all low- and high-redshift surveys improve on Planck, with \(\sigma (f_{\textrm{NL}}) \sim 4\). This constraint is based on avoiding foreground contamination of HI intensity mapping on very large scales. Foreground contamination of HI intensity mapping has a strong effect on the precision: if we neglect this foreground contamination, we find that \(\sigma (f_{\textrm{NL}})\sim 1\).

2 Multi-tracer power spectra

At first order in perturbations, the density (or temperature) contrast of a tracer A is related to the matter density contrast \(\delta \) by

$$\begin{aligned} \Delta _A(z, \varvec{k}) = \Big [b_A(z)+f(z)\mu ^{2}\Big ]\delta (z, \varvec{k}), \end{aligned}$$
(1)

where \(b_A\) is the (Gaussian) clustering bias, \(\mu = \varvec{\hat{k}} \cdot \varvec{{n}}\) is the projection along the line-of-sight direction \(\varvec{n}\), and f is the linear growth rate. We assume \(b_A\) has a known redshift evolution \(\beta _A\), following [68], so that

$$\begin{aligned} b_{A}(z) = b_{A0}\,\beta _A(z), \end{aligned}$$
(2)

where the amplitude \(b_{A0}\) is a free parameter. We highlight the point that a multi-tracer approach reduces the impact of this assumption on \(\sigma (f_{\textrm{NL}})\) [35].

The Fourier power spectra at tree-level are given by

$$\begin{aligned} \big \langle \Delta _A(z,\varvec{k})\Delta _{B}(z,\varvec{k}') \big \rangle = (2\pi )^{3}P_{AB}(z,\varvec{k})\delta ^{\textrm{D}}(\varvec{k}+\varvec{k}'), \end{aligned}$$
(3)

where the linear matter power spectrum (computed using CLASS [69]) is

$$\begin{aligned} P_{AB}=\big (b_A+f\mu ^2 \big )\big (b_B+f\mu ^2 \big )P. \end{aligned}$$
(4)

In the presence of local PNG, the bias acquires a scale-dependent correction [1, 2, 21]:

$$\begin{aligned} {\hat{b}}_A(z,k)= & {} b_{A}(z) + b_{A\phi }(z)\frac{f_\textrm{NL}}{{\mathcal {M}}(z,k)}, \nonumber \\ {\mathcal {M}}(z,k)= & {} \frac{2}{3\Omega _{m0}H_{0}^{2}}\frac{D(z)}{g_\textrm{in}}T(k)\,k^{2}. \end{aligned}$$
(5)

Here T(k) is the matter transfer function (normalized to 1 on very large scales), D is the growth factor (normalized to 1 today) and \(g_\textrm{in}\) is the initial growth suppression function, defined deep in the matter era. For \(\Lambda \)CDM [70]

$$\begin{aligned} g_{\textrm{in}} =\frac{3}{5}\big (1+z\big )D(z)\bigg [1+\frac{2f(z)}{3\Omega _{m}(z)} \bigg ]. \end{aligned}$$
(6)

The non-Gaussian bias factor \(b_{A\phi }\) is halo-dependent and should be determined with the aid of simulations [20, 71, 72]. A simplified model reduces it to

Table 1 Sky area and redshift range of surveys
$$\begin{aligned} b_{A\phi }(z) = 2 \delta _{\textrm{c}}\big [b_{A}(z)-p_A\big ], \end{aligned}$$
(7)

where the critical collapse density is given by \(\delta _{\textrm{c}} =1.686\) and \(p_A\) are halo-dependent constants. In the simplest (universal) halo model, \(p_A=1\) for all tracers A. But this model is not consistent with simulations [20, 71, 72]. An improved (but still over-simplified) model allows the constant \(p_A\) to vary with tracer. For galaxies chosen by stellar mass, simulations indicate that the rough approximation [71]

$$\begin{aligned} p_{g} \approx 0.55, \end{aligned}$$
(8)

is an improvement, and we use this for the DESI-like samples. For HI intensity mapping, we use the approximation [72]

$$\begin{aligned} p_{H} \approx 1.25. \end{aligned}$$
(9)

The \(f_{\textrm{NL}}\) terms in the power spectra are of order \(H_0^2/k^2\) on ultra-large scales, \(k\lesssim k_\textrm{eq}\), where \(T\approx 1\). Local PNG therefore dominates the power on ultra-large scales – but these are also the scales where cosmic variance is largest. We deal with the cosmic variance via a multi-tracer analysis, which includes the information from all auto- and cross-power spectra of the two (or more) tracers (see Sect. 4).

3 Modelling the surveys

We consider two mock spectroscopic surveys:

\(A=g\): galaxy survey, similar to DESI surveys [62, 63];

\(A=H\): 21 cm HI intensity mapping (IM) survey, similar to SKAO surveys [57, 64, 73].

In both cases, we have a low- and a high-redshift survey. The sky and redshift coverage of the individual and overlapping surveys are given in Table 1.

Fig. 1
figure 1

Background comoving galaxy number densities (red, left-hand y-axis) and HI IM temperature (blue, right-hand y-axis)

The one-parameter model (2) for the Gaussian clustering bias of the galaxy surveys follows [29]

$$\begin{aligned} b_{g}(z) = \frac{b_{g0}}{D(z)} . \end{aligned}$$
(10)

Fiducial values are \(b_{g0}=1.34\) for the Bright Galaxy Sample (BGS) and \(b_{g0}=0.84\) for the Emission Line Galaxy (ELG) sample. For the HI IM surveys we use a fit based on [74, 75]:

$$\begin{aligned}{} & {} b_{H}(z)= {b_{H0}} \big (1 + 0.823\,z- 0.0546\,z^{2}\big )\nonumber \\{} & {} \quad \qquad \text{ with } \text{ fiducial } \text{ value }\ b_{H0}=0.842. \end{aligned}$$
(11)

The background comoving number density \({\bar{n}}_g\) for the BGS and ELG samples is modelled following [29], which leads to the smoothed curves shown in Fig. 1.

For HI IM, the background brightness temperature is modelled via the fit given in [66, 76]:

$$\begin{aligned} \bar{T}_{H}(z) = 0.0559 +0.2324\,z -0.0241\, z^{2}~{\textrm{mK}}, \end{aligned}$$
(12)

which is also shown in Fig. 1.

3.1 Noise

In galaxy surveys, the shot noise (assumed to be Poissonian) is

$$\begin{aligned} P^{\textrm{shot}}_{gg}(z) = \frac{1}{{\bar{n}}_{ g}(z)}. \end{aligned}$$
(13)

Then the total signal that we measure for the galaxy auto-power spectrum is

Fig. 2
figure 2

Shot noise for galaxy surveys (left) and thermal noise for IM surveys (right)

$$\begin{aligned} \tilde{P}_{gg}(z,k,\mu ) = P_{gg}(z,k,\mu ) + P^{\textrm{shot}}_{gg}(z), \end{aligned}$$
(14)

where \(P_{gg}\) is given by (4).

In IM surveys, there is shot noise, but also thermal (or instrumental) noise. The Poissonian shot noise (see [77] for non-Poissonian corrections) is derived in a halo-model framework and given by [74, 78],

$$\begin{aligned} P_{H\!H}^{\textrm{shot}}(z)=\frac{1}{{\bar{n}}_{H}(z)} = \frac{1}{\bar{\rho }_H^{\,2}(z)}\int {\textrm{d}} M\,{\mathcal {N}}_h(M,z)\,M_H^2(M,z), \end{aligned}$$
(15)

where \({\bar{n}}_{H}\) is the effective average comoving number density of HI, \(\bar{\rho }_H\) is the average comoving HI density, \({{\mathcal {N}}}_h\) is the halo mass function (average comoving halo number density per mass) and \(M_H\) is the average HI mass in a halo of mass M. However, on the linear scales that we consider, the shot noise is much smaller than the thermal noise and can be safely neglected [74, 78].

The thermal noise in HI IM depends on the sky temperature in the radio band, the survey specifications and the array configuration (single-dish or interferometer). For the single-dish mode of SKAO-like surveys, the thermal noise power spectrum is [79,80,81]:

$$\begin{aligned} P_{H\!H}^{\textrm{therm}}(z) = \frac{\Omega _{\textrm{sky}}}{2\nu _{21} t_{\textrm{tot}}}\,\frac{(1+z) r(z)^{2} }{{\mathcal {H}}(z) }\,\left[ \frac{T_{\textrm{sys}}(z)}{\bar{T}_{H}(z)}\right] ^2\, \frac{1}{N_{\textrm{d}}}, \end{aligned}$$
(16)

where \(\nu _{21}=1420\,{\textrm{MHz}}\) is the rest-frame frequency of the 21 cm emission, \(t_{\textrm{tot}}\) is the total observing time, and the number of dishes is \(N_{\textrm{d}}=197\) (with dish diameter \(D_{\textrm{d}}=15\) m). The system temperature is modelled as [82]:

$$\begin{aligned} T_{\textrm{sys}}(z)= & {} T_{\textrm{d}}(z)+T_\textrm{sky}(z) \nonumber \\= & {} T_{\textrm{d}}(z) + 2.7 + 25\bigg [\frac{400\,{\textrm{MHz}}}{\nu _{21}} (1+z)\bigg ]^{2.75}~{\textrm{K}}, \end{aligned}$$
(17)

where \(T_{\textrm{d}}\) is the dish receiver temperature (see [73]). The total signal is then

$$\begin{aligned} \tilde{P}_{H\!H}(z,k,\mu )= & {} P_{H\!H}(z,k,\mu ) + P^{\textrm{shot}}_{H\!H}(z) \nonumber \\{} & {} + P_{H\!H}^{\textrm{therm}}(z)\approx P_{H\!H}(z,k,\mu ) + P_{H\!H}^{\textrm{therm}}(z).\nonumber \\ \end{aligned}$$
(18)

The noise for all surveys is shown in Fig. 2.

In the case of the cross-power spectrum \(P_{gH}\), cross-shot noise arises if the halos that host galaxies and HI overlap. Following the arguments given in [73, 83], we assume that the cross-shot noise may be neglected. The total cross-power signal is then

$$\begin{aligned} {\tilde{P}}_{gH}=P_{gH}\quad \text{ with }\ P_{gH}^{\textrm{shot}}\approx 0. \end{aligned}$$
(19)

3.2 Intensity mapping beam and foregrounds

HI IM surveys like the SKAO surveys have poor angular resolution, which results in power loss on small transverse scales, i.e. for large \(k_{\perp }=(1-\mu ^2)^{1/2}k\). This effect is typically modelled by a Gaussian beam factor [79]:

$$\begin{aligned} {{\mathcal {D}}}_{\textrm{b}}(z,k,\mu )= & {} \exp \left[ -{(1-\mu ^2)k^2 r(z)^2\theta _{\textrm{b}}(z)^2\over 16\ln 2} \right] \quad \text{ with }\quad \nonumber \\ \theta _{\textrm{b}}(z)= & {} 1.22\,{\lambda _{21}(1+z)\over D_{\textrm{d}}}. \end{aligned}$$
(20)
Fig. 3
figure 3

The monopole power spectra at \(z=0.3\) (left) and \(z=1.0\) (right)

HI IM surveys are also contaminated by foregrounds that are much larger than the HI signal. Since these foregrounds are spectrally smooth, they can be separated from the non-smooth signal on small to medium scales. However, on very large radial scales, i.e. for small \(k_\Vert =\mu k\), the signal becomes smoother and therefore the separation fails. A comprehensive treatment includes simulations of foreground cleaning of the HI signal (e.g. [15, 18]). For a simplified Fisher forecast we can instead use a foreground avoidance approach, by excising the regions of Fourier space where the foregrounds are significant. This means removing large radial scales, which can be modelled via an exponential suppression factor [75, 84]:

$$\begin{aligned} {{\mathcal {D}}}_\textrm{fg}(k,\mu ) = {1-\exp \bigg [-\Big (\frac{\mu k}{k_\textrm{fg}} \Big )^2 \bigg ].} \end{aligned}$$
(21)

We choose a typically used value:

$$\begin{aligned} k_{\textrm{fg}} = 0.01\,{h} \,{\textrm{Mpc}}^{-1} . \end{aligned}$$
(22)

Figure 3 illustrates the consequences of foreground avoidance for the monopoles of the HI power spectra,

$$\begin{aligned} P_{AB,0}\,(z,k) = \frac{1}{2}\int _{-1}^{+1}{\textrm{d}}\mu \,P_{AB}\,(z,k,\mu ). \end{aligned}$$
(23)

Note the consequence of foreground avoidance for the monopole of the HI power spectra \(P_{AH,0}\), which are suppressed on large scales. It is clear that large-scale (small k) power in \(P_{AH,0}\) is lost and that the effect of local PNG on these large scales is suppressed by foreground avoidance. Note that \(f_{\textrm{NL}}\) reduces the HI power on very large scales at \(z=0.3\), since \(b_H<1.25\), which leads to \(b_{H\phi }<0\) in (7). Also note that the beam effect on \(P_{AH,0}\), which tends to suppress small scale (large k) modes, is negligible at the low redshift, but becomes apparent at higher redshift.

In summary, the effects of the radio telescope beam and radio foreground avoidance lead to the following modifications of the power spectra \(P_{AH}\):

$$\begin{aligned} P_{H\!H}(z,{k}, \mu )\rightarrow & {} {{\mathcal {D}}}_{\textrm{fg}}(k,\mu )\,{{{\mathcal {D}}}_{\textrm{b}}(z,k,\mu )}^{2}\,P_{H\!H}(z,{k}, \mu ),\nonumber \\ \end{aligned}$$
(24)
$$\begin{aligned} P_{gH}(z,{k}, \mu )\rightarrow & {} {{\mathcal {D}}}_{\textrm{fg}}(k,\mu ) \,{{\mathcal {D}}}_{\textrm{b}}(z,k,\mu )\,P_{gH}(z,{k}, \mu ).\nonumber \\ \end{aligned}$$
(25)

We note that these equations are consistent with the recent paper [85] on foreground cleaning via a transfer function, which shows that the effect of foregrounds is the same in the HI auto-power as in the cross-correlation power spectra with galaxies. However, we should also point out that the modelling of foreground removal is an ongoing project which has not yet reached maturity.

4 Fisher forecast

For a multi-tracer combination of two dark matter tracers g and H, the data vector of power spectra is

$$\begin{aligned} \varvec{P} = \big ( P_{gg},\, P_{gH},\, P_{H\!H}\big ). \end{aligned}$$
(26)

We exclude the noise from the data vector, since the noise is independent of the cosmological and nuisance parameters. (The noise appears in the covariance, as given below.) The standard cosmological parameters are measured on medium to small scales and are effectively independent of the ultra-large scale parameter \(f_{\textrm{NL}}\). Fixing their values could bias the best-fit value of \(f_{\textrm{NL}}\) but is unlikely to have any significant impact on \(\sigma (f_{\textrm{NL}})\). We include in the Fisher analysis the cosmological parameters that directly affect the large-scale amplitude \((\sigma _{8,0})\) and shape \((n_s)\) of the power spectrum. We therefore consider the following cosmological and nuisance parameters:

$$\begin{aligned} \vartheta _{\alpha } = \big (\sigma _{8,0},\,n_{s},\,f_{\textrm{NL}};\,b_{g0},\,b_{H0}\big ). \end{aligned}$$
(27)

Here we assume that the degeneracies between \(b_A\) and \(\sigma _{8,0}\), and between \({\bar{T}}_H\) and \(\sigma _{8,0}\), have been broken by other surveys that focus on medium to small scales.

Fig. 4
figure 4

1\(\sigma \) contours, including foreground avoidance in HI intensity mapping and marginalising over bias parameters

The covariance for the multi-tracer power spectra is given by [86, 87]:

$$\begin{aligned} {\textrm{Cov}}(\varvec{P},\varvec{P})&= \frac{k_{\textrm{f}}^3}{2\pi k^{2} \Delta k}\,\frac{2}{\Delta \mu }\, \nonumber \\&\quad \times \!\! \begin{pmatrix} {\tilde{P}}_{gg}^2 &{} &{} {\tilde{P}}_{gg}{\tilde{P}}_{gH} &{} &{} {\tilde{P}}_{gH}^2 \\ &{} &{} &{} &{} \\ {\tilde{P}}_{gg}{\tilde{P}}_{gH} &{} &{} \frac{1}{2}\big [{\tilde{P}}_{gg}{\tilde{P}}_{H\!H}{+} {\tilde{P}}_{gH}^2 \big ] &{} &{} {\tilde{P}}_{H\!H}{\tilde{P}}_{gH} \\ &{} &{} &{} &{} \\ {\tilde{P}}_{gH}^2 &{} &{} {\tilde{P}}_{H\!H}{\tilde{P}}_{gH} &{} &{} {\tilde{P}}_{H\!H}^2 \end{pmatrix}\!. \end{aligned}$$
(28)

Here \(\Delta k\) and \(\Delta \mu \) are the bin-widths for k and \(\mu \) and \(k_{\textrm{f}}\) is the fundamental mode, corresponding to the longest wavelength, which is determined by the comoving survey volume of the redshift bin centred at z:

$$\begin{aligned} V(z) = \frac{\Omega _{\textrm{sky}}}{3} \bigg [r\Big (z+\frac{\Delta z}{2}\Big )^{3} - r\Big (z-\frac{\Delta z}{2}\Big )^{3} \bigg ]= \bigg [\frac{2\pi }{k_{\textrm{f}}(z)} \bigg ]^3 . \end{aligned}$$
(29)

Then the multi-tracer Fisher matrix in a redshift bin is

$$\begin{aligned} F_{\alpha \beta }^{\varvec{P}} = \sum _{\mu =-1}^{+1}\,\sum _{k=k_{\textrm{min}}}^{k_{\textrm{max}}}\,\partial _{\alpha }\,\varvec{P} \cdot \textrm{Cov}(\varvec{P},\varvec{P})^{-1} \cdot \partial _{\beta }\,\varvec{P}^{\textrm{T}}, \end{aligned}$$
(30)

where \(\partial _{\alpha } = \partial \,/\,\partial \vartheta _{\alpha }\). We choose the bin-widths and \(k_{\textrm{min}}\) following [88,89,90]:

$$\begin{aligned} \Delta \mu = 0.04, \quad \Delta k = k_{\textrm{f}},\quad k_{\textrm{min}} = k_{\textrm{f}}. \end{aligned}$$
(31)

The smallest scale (largest k) is chosen to ensure that linear perturbations remain accurate, since we do not require information from nonlinear scales [91]:

$$\begin{aligned} k_{\textrm{max}} = 0.08(1+z)^{2/(2+n_{s})}\,h\,\text {Mpc}^{-1}. \end{aligned}$$
(32)

The galaxy and HI IM surveys do not have the same sky area and redshift ranges (see Table 1). The overlap redshift ranges are obvious; for the sky areas, we assume a nominal overlap area of 10,000 \(\hbox {deg}^2\). The multi-tracer applies in the overlap redshift range and overlap sky area. However, we can add the independent Fisher matrices obtained in the non-overlapping regions of the two individual surveys to the multi-tracer Fisher matrix in the overlapping region [73]. In detail, the non-overlap region is

  • the non-overlap sky area of each survey, across the full redshift range of each survey;

  • the overlap sky area, across the non-overlap parts of the redshift ranges of each survey.

Then the full Fisher matrix (denoted by \(g\otimes H\)) is

$$\begin{aligned} F_{\alpha \beta }^{g \otimes H}&= F_{\alpha \beta }^{\varvec{P}}(\text {overlap})+ F_{\alpha \beta }^{g}(\text {non-overlap})\nonumber \\&\quad + F_{\alpha \beta }^{H}(\text {non-overlap}) . \end{aligned}$$
(33)

Figure 4 shows the 1\(\sigma \) error contours computed from this Fisher matrix after marginalising over the bias parameters in (27). We use the fiducial values \(\sigma _{8,0}=0.8102\), \(n_{s}=0.9665\) and \(f_{\textrm{NL}}=0\), keeping other cosmological parameters fixed to their Planck 2018 best-fit values [92]. Table 2 lists the marginalised \(\sigma (f_{\textrm{NL}})\) constraints, including the improvements (in parentheses) that follow when we ignore the HI IM foregrounds, i.e. when \(k_{\textrm{fg}}=0\).

The results for BGS, Band 1 and Band 2 are consistent with [35], which uses the angular power spectrum rather than the Fourier power spectrum used here.

Table 2 Cumulative marginalised error \(\sigma (f_{\textrm{NL}})\) for: the single-tracer surveys; the low-z and high-z multi-tracer pairs – including single-tracer information from the non-overlap volumes as in (33); and the sum of the multi-tracer pairs. Values in parenthesis correspond to ignoring foregrounds, i.e., with \(k_{\textrm{fg}}=0\)

5 Conclusion

We applied a simplified Fisher forecast analysis in order to gain insights into the improvements on \(\sigma (f_{\textrm{NL}})\) from a new multi-tracer combination of large-scale structure surveys that will be made possible by the upcoming HI intensity mapping surveys with the SKAO. For the spectroscopic galaxy samples, we used mock surveys based on DESI. This allowed for multi-tracer pairs at low and high redshifts, in order to see the level of improved precision from the higher volume at high redshifts.

We included all available Fisher information from these mock surveys, using the multi-tracer Fisher matrix on the overlap volume (sky area and redshift range) and the single-tracer Fisher matrices on the non-overlap volumes, as in (33). Our results for \(\sigma (f_{\textrm{NL}})\) are summarised in Table 2, which gives the single-tracer results for the 4 samples, the full multi-tracer results from (33) for the low- and high-redshift pairs, and finally the results from the Fisher sum of these pairs. The impact of foreground avoidance can be seen from the results (in parenthesis) when foregrounds are ignored, corresponding to \(k_\textrm{fg}=0\) in (21).

Figure 4 shows the \(1\sigma \) contours for the 3 cosmological parameters (\(\sigma _{8,\,0}\), \(n_{s}\), \(f_{\textrm{NL}}\)), marginalising over the 2 bias nuisance parameters \((b_{g0}, b_{H0}).\)

As expected, the foreground avoidance severely degrades the constraining power of the HI intensity surveys. Without foregrounds, the high-redshift Band 1 survey would perform as well as the multi-tracer combination with the ELG survey: \(\sigma (f_{\textrm{NL}})=1.8\). The reduced precision on \(f_\textrm{NL}\) for single-tracer HI intensity constraints is consistent with the findings of [15], where foreground cleaning is simulated. However, using the multi-tracer with a spectroscopic galaxy survey helps to reduce the effect of foregrounds on \(\sigma (f_{\textrm{NL}})\), as shown by Table 2.

The multi-tracer analysis is also expected to mitigate other systematics in the galaxy and HI intensity samples. For example, HI intensity mapping is also affected by radio frequency interference, receiver noise, calibration errors and polarisation leakage. Incorporating these and the galaxy systematics requires extensive simulations, beyond the scope of our paper.