1 Introduction

Precision measurements of the cosmic microwave background (CMB) anisotropies and the large-scale matter distribution have in the past two decades yielded some of the tightest constraints on particle physics. The most widely known amongst these are upper limits on the neutrino mass sum, which at \(\sum m_\nu \lesssim 0.12\) eV for a one-parameter extension of the standard \(\Lambda \)CDM model [1],Footnote 1 supersedes at face value current laboratory \(\beta \)-decay end-point spectrum measurements by a factor of 30 [2]. Of lesser prominence albeit equal significance are CMB constraints on the neutrino lifetime.

Relativistic invisible neutrino decay occurring on time scales of CMB formation leave an interesting signature on the CMB anisotropies. In the cosmological context, Standard-Model neutrinos decouple at temperatures of \(T \sim 1\) MeV and free-stream thereafter. Free-streaming could be inhibited, however, if non-standard collisional processes such as neutrino decay and especially its concurrent inverse decay—the latter of which is kinematically feasible only in the limit of an ultra-relativistic neutrino ensemble—should become efficient at \(T \lesssim 1\) MeV. At CMB formation times, the absence of neutrino free-streaming prevents the transfer of power from the monopole (density) and dipole (velocity divergence) fluctuations of the neutrino fluid to higher multipole moments (e.g., anisotropic stress) [3]. Such a suppression of higher-order anisotropies manifests itself predominantly in the CMB primary anisotropies as an enhancement of power at multipoles \(\ell > rsim 200\) in the CMB temperature power spectrum, which can be exploited to place constraints on the non-standard interaction(s) responsible for the phenomenon [4,5,6,7,8,9,10,11,12,13,14,15,16].

Following this line of argument, measurements of the Planck mission (2018 data release) currently constrain the rest-frame neutrino lifetime to \(\tau _0 > rsim (4 \times 10^5 \rightarrow 4 \times 10^6)\ (m_\nu /50\, \mathrm{meV})^5\) s, assuming invisible decay of a parent neutrino not exceeding \(m_{\nu } \simeq 0.2\) eV in mass into a massless particles [9].Footnote 2 This lower bound is to be compared with laboratory limits obtained from analyses of the neutrino disappearance rate in JUNO and KamLAND+JUNO, \(\tau _0/m_\nu > 7.5 \times 10^{-11}\) (s/eV) and \(\tau _0/m_\nu >1.1 \times 10^{-9}\) (s/eV) for a normal and an inverted mass ordering respectively [19], as well as astrophysical limits from solar neutrinos \(\tau _0/m_\nu > rsim 10^{-5} \rightarrow 10^{-3}\) (s/eV) [20, 21], SN1987A \(\tau _0 > rsim (1 \rightarrow 10)\) s [22], and IceCube \(\tau _0/m_\nu > rsim 2.4 \times 10^3\) (s/eV) [23].

Central to the extraction of \(\tau _0\) from cosmological data is the neutrino anisotropy loss rate or transport rate \(\varGamma _\mathrm{T}\), particularly how this rate relates to \(\tau _0\) itself and other fundamental properties of the particles participating in the interaction. Some of us have recently considered this issue via a rigorous computation of the transport rate from the full Boltzmann hierarchies incorporating the two-body decay \(\nu _H \leftrightarrow \nu _l + \phi \) and its inverse process induced by a Yukawa interaction [9]. For decay into massless daughter particles, Ref. [9] finds a transport rate \(\varGamma _\mathrm{T} \sim (m_\nu /E_\nu )^5 (1/\tau _0)\) under the assumption of linear response. This result is in contrast to the rate \(\varGamma _\mathrm{T} \sim (m_\nu /E_\nu )^3 (1/\tau _0)\) employed in previous analyses [4, 5, 7, 8], which is based upon a heuristic but (as we shall show) incorrect random walk argument.

While representing a considerable step forward, the result of Ref. [9] is nonetheless incomplete in that the mass difference between the parent neutrino \(\nu _H\) and daughter neutrino \(\nu _l\) must respect the neutrino mass spectra allowed by neutrino oscillations experiments. Global three-flavour analyses of oscillation data currently find the squared mass differences to be [24,25,26]

$$\begin{aligned} \varDelta m^2_{21}= & {} 7.50^{+0.22}_{-0.20} \times 10^{-5}~\mathrm{eV}^2, \end{aligned}$$
(1)
$$\begin{aligned} |\varDelta m^2_{31}|_\mathrm{N}= & {} 2.55^{+0.02}_{-0.03} \times 10^{-3}~\mathrm{eV}^2, \end{aligned}$$
(2)
$$\begin{aligned} |\varDelta m^2_{31}|_\mathrm{I}= & {} 2.45^{+0.02}_{-0.03} \times 10^{-3}~\mathrm{eV}^2, \end{aligned}$$
(3)

where \(|\varDelta m^2_{31}|_\mathrm{N}\) applies to a normal mass ordering (NO; \(m_3> m_2 > m_1\)), and \(|\varDelta m^2_{31}|_\mathrm{I}\) to an inverted mass ordering (IO; \(m_2> m_1 > m_3\)). Relative to the benchmark parent neutrino mass of 0.2 eV, these squared mass differences are clearly small enough that the assumption of a massless daughter neutrino may become untenable and consequently alter the relationship between the transport rate \(\varGamma _\mathrm{T}\) and the rest-frame decay rate \(\tau _0\) away from \(\varGamma _\mathrm{T} \sim (m_\nu /E_\nu )^5 (1/\tau _0)\).

In this work, we improve upon the original calculation of Ref. [9] to explicitly allow for a massive daughter neutrino \(\nu _l\). We furthermore extend the computation of the anisotropic stress (i.e., multipole \(\ell =2\)) damping rate to \(\ell > 2\) multipole moments—for compatibility with the neutrino Boltzmann hierarchy [27] solved in the CMB code class [28]—and put the generalisation of the calculation to other interaction structures (besides Yukawa) on firmer grounds. We find that, in the presence of a massive \(\nu _l\), a consistent transport rate \(\varGamma _\mathrm{T}\) is suppressed by an extra phase space factor proportional to \((\varDelta m_\nu ^2/m_\nu ^2)^2\) in the limit of a small squared mass difference between the parent and daughter neutrinos, i.e., \(\varGamma _\mathrm{T} \sim (\varDelta m_\nu ^2/m_\nu ^2)^2 (m_\nu /E_\nu )^5 (1/\tau _0)\), and this factor is in addition to the phase space suppression (\(\sim \varDelta m_\nu ^2/m_\nu ^2\)) already inherent in the lifetime \(\tau _0\).

Using a general version of this result valid for all \(\varDelta m_\nu ^2\), we fit the Planck 2018 CMB data to derive a new set of cosmological constraints on the neutrino lifetime as functions of the lightest neutrino mass (i.e., \(m_1\) in NO and \(m_3\) in IO) that are consistent with mass difference measurements. Readers interested in a quick number can jump directly to Fig. 3 where the new constraints on \(\tau _0\) are presented. Here, we note that for a parent neutrino of \(m_\nu \lesssim 0.1\) eV, the new phase space suppression factor weakens the CMB constraint on its lifetime by as much as a factor of 50 if \(\varDelta m_\nu ^2\) corresponds to the atmospheric mass gap and by five orders of magnitude if the solar mass gap, compared with naïve limits that do not account for the daughter neutrino mass.

The paper is organised as follows. We begin with a description of the physical system in Sect. 2. We then introduce the basic equations of motion for the individual \(\nu _H\), \(\nu _l\) and \(\phi \) component in Sect. 3, before condensing them into a single set of effective equations of motion for the combined neutrino+\(\phi \) system in Sect. 4. Using standard statistical inference techniques presented in Sect. 5, we derive in Sect. 6 new CMB constraints on the neutrino lifetime that are consistent with the experimentally determined neutrino mass splittings. We conclude in Sect. 7. Four appendices contain respectively full expressions of the collision integrals (Appendix 1), computational details of the \(\ell \ge 2\) effective damping rates (Appendix 1), a revised random walk picture explaining the \(\propto (m_\nu /E_\nu )^5\) dependence of the transport rate and why the old \(\propto (m_\nu /E_\nu )^3\) rate is incorrect (Appendix 1), and summary statistics of our parameter estimation analyses (Appendix 1).

2 The physical system

We consider the two-body decay of a parent neutrino \(\nu _H\) into a massive daughter neutrino \(\nu _l\) and a massless particle \(\phi \), whose transition probability is described by a Lorentz-invariant squared matrix element \(|{{{\mathcal {M}}}}|^2\). Previously in Ref. [9] we had assumed \(\phi \) to be a scalar and the decay described by a Yukawa interaction, motivated by Majoron models [29,30,31] in which a new Goldstone boson of a spontaneously broken global \(U(1)_{B-L}\) symmetry couples to neutrinos. Another possibility arises within gauged \(L_i -L_j\) models, where a new vector boson plays the role of the massless state [32,33,34,35]. Models of neutrinos decaying on cosmological timescales can be found in Refs. [36,37,38,39]. Here, in order to keep our analysis as general as possible, apart from the quantum statistical requirement that \(\phi \) must be boson, we shall leave its nature and the nature of the interaction unspecified.

Because cosmological observables do not measure neutrino spins, it is not possible to define a reference direction in the decaying neutrino’s rest-frame. Thus, for an ensemble of neutrinos with randomly aligned spins, we can effectively consider the decay to be isotropic in the decaying neutrino’s rest-frame, with a decay rate \(\varGamma _\mathrm{dec}^0 \equiv 1/\tau _0\) related to the squared matrix element via

$$\begin{aligned} \varGamma ^0_\mathrm{dec} = \frac{ \varDelta (m_{\nu H},m_{\nu l},m_\phi )}{16 \pi m_{\nu H}^3} |{{{\mathcal {M}}}}|^2, \end{aligned}$$
(4)

where

$$\begin{aligned} \varDelta (x,y,z) \equiv \sqrt{\left[ x^2-\left( y+z\right) ^2 \right] \left[ x^2-\left( y-z\right) ^2 \right] } \end{aligned}$$
(5)

is the Källén function, and it is understood that \(|{{{\mathcal {M}}}}|^2\) needs to be evaluated at particle energies and momenta consistent with conservation laws.

In the cosmological context, the decay process can be expected to come into effect around scale factors \(a \sim a_\mathrm{dec}\), where \(a_\mathrm{dec}\) is defined via \(\varGamma _\mathrm{dec} (a_\mathrm{dec})= H(a_\mathrm{dec})\), with \(\varGamma _\mathrm{dec}=\left\langle m_{\nu H}/E_{\nu H} \right\rangle \varGamma _\mathrm{dec}^0\) the ensemble-averaged Lorentz-boosted decay rate. We are interested in the decay of \(\nu _H\) occurring (i) up to the time of recombination but substantially after neutrino decoupling, i.e., \(10^{-10} \ll a_\mathrm{dec} \lesssim 10^{-3}\), and (ii) while the bulk of the \(\nu _H\) ensemble is ultra-relativistic. Two observations are in order:

  1. 1.

    The ultra-relativistic requirement guarantees that the inverse decay process \(\nu _l + \phi \rightarrow \nu _H\) is kinematically allowed and kicks in on roughly the same time scale as \(\nu _H\) decay, enabling \(\nu _H\), \(\nu _l\), and \( \phi \) to form a coupled system whose combined anisotropic stress can be suppressed through collisional damping. The same requirement also effectively limits the parent neutrino mass to the ballpark \(m_{\nu H} \lesssim 3 T_{\nu } (a_\mathrm{rec})\sim 3 \times (4/11)^{1/3} T_{\gamma } (a_\mathrm{rec}) \ \sim 0.2\) eV, if the phenomenology of anisotropic stress loss is to remain unambiguously applicable up to the recombination era when the temperature of the photon bath drops to \(T_\gamma (a_\mathrm{rec}) \sim 0.1\) eV.

  2. 2.

    Since decay and inverse decay occur by construction only after neutrino decoupling, our scenario does not generate an excess of radiation. Rather, the \((\nu _H,\nu _l,\phi )\)-system merely shares the energy that was in the original \(\nu _H\) and \(\nu _l\) populations. Consequently, in the time frame of interest cosmology is unaffected at the background level, and the phenomenology of the \((\nu _H,\nu _l,\phi )\)-system vis-à-vis the CMB primary anisotropies is at leading order limited to anisotropic stress loss.

These observations tell us that, as far as the CMB primary anisotropies are concerned, the \((\nu _H,\nu _l,\phi )\)-system can be modelled at leading order as a single massless fluid whose energy density is equivalent to that of the original \(\nu _H\) and \(\nu _l\) populations at \(a <a_\mathrm{dec}\). What remains to be specified is the rate at which the fluid’s anisotropic stress and higher-order multipole moments are lost via decay and inverse decay; this is the subject of Sects. 3 and 4.

3 Basic equations of motion

A complete set of evolution equations for the individual \(\nu _H\), \(\nu _l\), and \(\phi \) phase space density under the influence of gravity and decay/inverse decay has been presented in Ref. [9]. We briefly describe the relevant ones below.

We work in the synchronous gauge defined by the invariant \(\mathrm{d}s^2 = a^2(\tau ) \left[ - \mathrm{d} \tau ^2 + (\delta _{ij} + h_{ij}) \mathrm{d} x^i \mathrm{d} x^j \right] \), and split the Fourier-transformed phase space density of species i into a homogeneous and isotropic component and a perturbed part, i.e., \(f_i(\mathbf {k},\mathbf {q},\tau ) = {\bar{f}}_i(|\mathbf {q}|,\tau ) + F_i (\mathbf {k},\mathbf {q},\tau )\), where \(\tau \) is conformal time, \(\mathbf {q}\equiv |\mathbf {q}| {\hat{q}}\) the comoving 3-momentum, and \(\mathbf {k} \equiv |\mathbf {k}| {\hat{k}}\) is the Fourier wave vector conjugate to the comoving spatial coordinates \(\mathbf {x}\) of the line element.

Because for the scenario at hand the \((\nu _H,\nu _l,\phi )\)-system behaves at leading order essentially as a single massless fluid whose energy density remains constant in the time frame of interest, the evolution of the individual homogeneous phase space density \({\bar{f}}_i(|\mathbf {q}|,\tau )\) is not a main concern. In fact, in Sect. 4 we shall approximate all \({\bar{f}}_i\)’s with equilibrium distributions in order to derive closed-form expressions for the effective damping rates.

Of greater interest to us is the perturbed component \(F_i (\mathbf {k},\mathbf {q},\tau )\). Following standard practice [27], we decompose \(F_i\) in terms of a Legendre series,

$$\begin{aligned} \begin{aligned} F_{i}(|\mathbf {k}|,|\mathbf {q}|,x)&\equiv \sum _{\ell =0}^{\infty }(-\mathrm{i})^{\ell } (2\ell +\!1) F_{i,\ell }(|\mathbf {k}|,|\mathbf {q}|) P_{\ell }(x), \\ F_{i,\ell }(|\mathbf {k}|,|\mathbf {q}|)&\equiv \frac{\mathrm{i}^{\ell }}{2} \int _{-1}^{1} \mathrm {d}x \, F_{i}(|\mathbf {k}|,|\mathbf {q}|,x) P_{\ell }(x), \end{aligned} \end{aligned}$$
(6)

where \(x\equiv {\hat{k}}\cdot {\hat{q}}\), and \(P_\ell (x)\) is a Legendre polynomial of degree \(\ell \). Then, the equations of motion for the Legendre moments \({F}_{i,\ell }\) can be written as

$$\begin{aligned} \begin{aligned} {\dot{F}}_{i,0}&= -\frac{|\mathbf {q}| |\mathbf {k}|}{\epsilon _i} F_{i,1}+ \frac{1}{6} \frac{\partial {\bar{f}}_{i}}{\partial \ln |\mathbf {q}|}{\dot{h}} + \left( \frac{\mathrm{d} f_i}{\mathrm{d} \tau }\right) _{C,0}^{(1)} , \\ {\dot{F}}_{i,1}&= \frac{|\mathbf {q}| |\mathbf {k}|}{\epsilon _i} \left( -\frac{2}{3} F_{i,2}+\frac{1}{3} F_{i,0}\right) + \left( \frac{\mathrm{d} f_i}{\mathrm{d} \tau }\right) _{C,1}^{(1)},\\ {\dot{F}}_{i,2}&= \frac{|\mathbf {q}| |\mathbf {k}|}{\epsilon _i} \left( -\frac{3}{5} F_{i,3} + \frac{2}{5} F_{i,1} \right) \\&\quad - \frac{\partial {\bar{f}}_{i}}{\partial \ln |\mathbf {q}|} \left( \frac{2}{5} \dot{\eta } + \frac{1}{15} {\dot{h}} \right) + \left( \frac{\mathrm{d} f_i}{\mathrm{d} \tau }\right) _{C,2}^{(1)} ,\\ {\dot{F}}_{i,\ell >2}&=\, \frac{|\mathbf {k}|}{2\ell +1} \frac{|\mathbf {q}|}{\epsilon _i} \left[ \ell F_{i,\ell -1} - (\ell + 1)F_{i,\ell +1} \right] \\&\quad + \left( \frac{\mathrm{d} f_i}{\mathrm{d} \tau }\right) _{C,\ell }^{(1)} . \end{aligned} \nonumber \\ \end{aligned}$$
(7)

Here, \(\cdot \equiv \partial /\partial \tau \), \(\epsilon _i (|\mathbf {q}|)\equiv (|\mathbf {q}|^2 + a^2 m_i^2)^{1/2}\) is the comoving energy of the particle species i of mass \(m_i\), \(h \equiv \delta ^{ij} h_{ij}(\mathbf {k},\tau )\) and \(\eta = \eta (\mathbf {k},\tau ) \equiv - k^i k^j h_{ij}/(4 k^2) + h/12\) are the trace and traceless components of \(h_{ij}\) in Fourier space respectively, and \(\left( \mathrm{d} f_i/\mathrm{d} \tau \right) _{C,\ell }^{(1)}\) represents the \(\ell \)th Legendre moment of the linear-order Boltzmann collision integral.

The linear-order decay/inverse decay collision integrals have been derived in Ref. [9] assuming a Yukawa interaction. However, following the arguments of Sect. 2 and using the fact that the squared matrix element is Lorentz-invariant, it is straightforward to generalise the integrals of Ref. [9] to any interaction structure. The same argument also allows us to recast the collision integrals in favour of the rest-frame decay rate \(\varGamma _\mathrm{dec}^0\) as a fundamental parameter (over the squared matrix element \(|{{{\mathcal {M}}}}|^2\)). The interested reader can find the full expressions in Eqs. (A.1)–(A.3) in Appendix 1.

The equations of motion (7) together with the collision integrals (A.1)–(A.3) can be programmed directly into and solved by a CMB code such as class [28]. In practice, however, numerical solutions particularly in the ultra-relativistic limit can be highly non-trivial, because of the vastly different time scales at play: the basic decay time scale \(1/\varGamma _\mathrm{dec}\) and the emergent transport time scale \(1/\varGamma _\mathrm{T} \sim (E_\nu /m_\nu )^4 (1/\varGamma _\mathrm{dec})\). In Sect. 4 we show how these equations of motion can be condensed in the ultra-relativistic limit into a single set of equations for the combined \((\nu _H,\nu _l,\phi )\)-system under certain reasonable simplifying assumptions.

4 Effective equations of motion for the neutrino+\(\phi \) system

We are interested in the evolution of perturbations in the combined \((\nu _H,\nu _l,\phi )\)-system in the ultra-relativistic limit. To this end we define the integrated Legendre moment,

$$\begin{aligned} \begin{aligned} {{{\mathcal {F}}}}_\ell (|\mathbf {k}|)&\equiv \, \frac{(2 \pi ^2 a^4)^{-1}}{\bar{\rho }_{\nu \phi }} \sum _{i=\nu _H,\nu _l,\phi } g_i \\&\quad \times \int \mathrm{d} |\mathbf {q}|\; |\mathbf {q}|^2 \epsilon _i \left( \frac{|\mathbf {q}|}{\epsilon _i} \right) ^\ell F_{i,\ell } (|\mathbf {q}|), \end{aligned} \end{aligned}$$
(8)

where \(g_i\) is the internal degree of freedom of the ith particle species, and \(\bar{\rho }_{\nu \phi }\) represents the total background energy density of the \((\nu _H,\nu _l,\phi )\)-system which we take to be ultra-relativistic, i.e., \(\bar{\rho }_{\nu \phi } \propto a^{-4}\), and equal to the energy density in the pre-decay \(\nu _H\) and \(\nu _l\) populations.

Summing and integrating the equations of motion, Eq. (7), as per the definition (8) and taking the ultra-relativistic limit everywhere except the collision terms, we obtain

$$\begin{aligned} \begin{aligned} \dot{{{\mathcal {F}}}}_0&= - |\mathbf {k}| {{{\mathcal {F}}}}_1 - \frac{2}{3} {\dot{h}}, \\ \dot{{{\mathcal {F}}}}_1&= \frac{|\mathbf {k}|}{3} \left( {{{\mathcal {F}}}}_0 - 2{{{\mathcal {F}}}}_2 \right) ,\\ \dot{{{\mathcal {F}}}}_{2}&= \frac{|\mathbf {k}|}{5} \left( 2 {{{\mathcal {F}}}}_1 - 3 {{{\mathcal {F}}}}_3 \right) + \frac{4}{15} {\dot{h}} + \frac{8}{5} \dot{\eta } + \left( \frac{\mathrm{d} {{{\mathcal {F}}}}}{\mathrm{d} \tau }\right) _{C,2} , \\ \dot{{{\mathcal {F}}}}_{\ell >2}&=\, \frac{|\mathbf {k}|}{2\ell +1}\left[ \ell {{{\mathcal {F}}}}_{\ell -1} - (\ell +1){{{\mathcal {F}}}}_{\ell +1} \right] + \left( \frac{\mathrm{d} {{{\mathcal {F}}}}}{\mathrm{d} \tau }\right) _{C,\ell } , \end{aligned} \nonumber \\ \end{aligned}$$
(9)

where

$$\begin{aligned} \begin{aligned} \left( \frac{\mathrm {d} {{{\mathcal {F}}}}}{\mathrm {d}\tau }\right) _{C,\ell }&\equiv \, \frac{(2 \pi ^2 a^4)^{-1}}{\bar{\rho }_{\nu \phi }} \sum _{i=\nu _H,\nu _l,\phi } g_i \\&\quad \times \int \mathrm{d} |\mathbf {q}|\; |\mathbf {q}|^2 \epsilon _i \left( \frac{|\mathbf {q}|}{\epsilon _i} \right) ^\ell \left( \frac{\mathrm {d} f_i}{\mathrm {d}\tau }\right) _{C, \ell }^{(1)} (|\mathbf {q}|) \end{aligned} \end{aligned}$$
(10)

are the effective collision integrals constructed from the individual collision integrals (A.1)–(A.3). Note that the effective collision integrals for \(\ell =0,1\) integrated moments must vanish exactly because of energy and momentum conservation in the \((\nu _H,\nu _l,\phi )\)-system.

4.1 Effective collision integrals

Equations (9) and (10) are not yet in a closed form. To put them into a closed form, several simplifying approximations to the effective collision integrals are in order [9]:

  1. 1.

    We assume equilibrium Maxwell-Boltzmann statistics, meaning that the background phase space density of each particle species can be described at any instant by \({\bar{f}}_i(|\mathbf {q}|) = e^{-(\epsilon _i-\mu _i)/T_0}\), with a common comoving temperature \(T_0\) and chemical potentials satisfying \(\mu _{\nu H} = \mu _{\nu l} + \mu _\phi \). Physically, this means we take the background equilibration rate \(\varGamma _\mathrm{dec}\) to be much larger than the loss rates of the anisotropic stress and higher-order anisotropies we are interested to compute. As demonstrated in Ref. [9], relativistic \(\nu _H\) decay and inverse decay attain a steady-state/quasi-equilibrium regime in which the above equilibrium conditions are reasonably well satisfied.

  2. 2.

    We take the individual species’ Legendre moments \(F_{i,\ell }(|\mathbf {k}|,|\mathbf {q}|)\) to be separable functions of \(|\mathbf {k}|\) and \(|\mathbf {q}|\), given by [12]

    $$\begin{aligned} F_{i,\ell } (|\mathbf {k}|,|\mathbf {q}|) \simeq - \frac{1}{4} \frac{\mathrm{d} {\bar{f}}_i}{\mathrm{d} \ln |\mathbf {q}|}\, {{{\mathcal {F}}}}_{i,\ell } (|\mathbf {k}|), \end{aligned}$$
    (11)

    where \({{{\mathcal {F}}}}_{i,\ell }\) is the analogue of the integrated Legendre moment \({{{\mathcal {F}}}}_{\ell }\) of Eq. (8) but for the ith species alone. In the absence of collisions, this so-called separable ansatz (11) is exact in the ultra-relativistic limit and reflects the fact that, for massless particles, gravity is achromatic and all \(|\mathbf {q}|\)-modes evolve in the same way. For collisional systems near equilibrium, i.e., where the momentum-dependence of the background phase space densities is largely constant in time, the ansatz also appears to capture the evolution of \( {{{\mathcal {F}}}}_{i,\ell \le 2} (|\mathbf {k}|)\) well in the ultra-relativistic limit, across a range of \(|\mathbf {k}|\)-modes and redshifts [12].

  3. 3.

    In standard cosmology, all \(\ell \ge 2\) anisotropies are induced by gravity after a Fourier \(\mathbf {k}\)-mode enters the horizon. Gravity induces the same perturbation contrast \({{{\mathcal {F}}}}_{i,\ell }\) in all particle species at the same point in space, which immediately leads us to our third approximation

    $$\begin{aligned} {{{\mathcal {F}}}}_{\nu H,\ell } (|\mathbf {k}|) \simeq {{{\mathcal {F}}}}_{ \nu l,\ell } (|\mathbf {k}|) \simeq {{{\mathcal {F}}}}_{\phi ,\ell } (|\mathbf {k}|) \simeq {{{\mathcal {F}}}}_{\ell } (|\mathbf {k}|), \end{aligned}$$
    (12)

    where \({{{\mathcal {F}}}}_{\ell } (|\mathbf {k}|)\) is identically the \(\ell \)th integrated Legendre moment of Eq. (8).

Note that by making these reasonable assumptions, we have essentially recast the problem of establishing the effective transport rate into a relaxation time calculation in terms of the system’s linear response to a small external perturbation.

Under the above approximations and assuming further that \(m_\phi =0\), explicit evaluation of the effective collision integral (10) yields

$$\begin{aligned} \left( \frac{\mathrm{d} {{{\mathcal {F}}}}}{\mathrm{d} \tau } \right) _{C,\ell }= & {} - \alpha _\ell \, a \, \tilde{\varGamma }_\mathrm{dec} \left( \frac{ a m_{\nu H}}{T_0} \right) ^4 \nonumber \\&\times {\mathscr {F}} \left( \frac{a m_{\nu H}}{T_0}\right) \, \Phi \left( \frac{m_{\nu l}}{m_{\nu H}} \right) \, {{{\mathcal {F}}}}_\ell . \end{aligned}$$
(13)

See Appendix 1 for details of the derivation. Here, the rate \(\tilde{\varGamma }_\mathrm{{dec}}\) is approximately the boosted decay rate and is defined as in Ref. [9] via

$$\begin{aligned} \tilde{\varGamma }_\mathrm{dec}\equiv & {} \, \frac{(4 \pi ^2 a^4)^{-1}}{\bar{\rho }_{\nu \phi }} \, a m_{\nu H} T_0^3 \, g_{\nu H} \, e^{\, \mu _{\nu H}/T_0} \, \varGamma ^0_\mathrm{dec} \nonumber \\= & {} \, \frac{1}{12} \, \frac{(a^4 \bar{\rho }_{\nu H})}{(a^4 \bar{\rho }_{\nu \phi })} \, \frac{a m_{\nu H}}{T_0} \, \varGamma ^0_\mathrm{dec}, \end{aligned}$$
(14)

where \(\bar{\rho }_{\nu H} \equiv (3 g_{\nu H} T_0^4/a^4 \pi ^2) \, e^{\, \mu _{\nu H} / T_0}\) is the homogeneous, background energy density of \(\nu _H\) in the ultra-relativistic limit.

The dimensionless function \({\mathscr {F}}(x)\) in Eq. (13) is a decreasing function of x given by

$$\begin{aligned} {\mathscr {F}}(x) = \frac{1}{2} e^{-x} \Big [ - 1 + x - e^x (x^2-2) \varGamma (0,x) \Big ], \end{aligned}$$
(15)

where \(\varGamma (0,x)\) denotes the incomplete gamma function. For \( x \sim 10^{-10} \rightarrow 0.1\), \({\mathscr {F}}(x)\) evaluates to \(\sim 20 \rightarrow 1\). Beyond \(x \sim 1\), \({\mathscr {F}}(x)\) quickly drops to zero, reflecting the fact that after \(\nu _H\) becomes non-relativistic, inverse decay must also shut down so that the combined neutrino+\(\phi \) fluid reverts to a free-streaming system. In order to bypass the incomplete gamma function in our numerical solution, we find it convenient to use the small-x expansion

$$\begin{aligned} \begin{aligned} {\mathscr {F}}_\mathrm{sm}(x)&\simeq -\frac{1}{2} + 2 x - x^2 -\frac{x^3}{9} + \frac{x^4}{96} - \frac{x^5}{900} \\&\quad + \frac{x^6}{8640} +\left( \gamma + \log x\right) \left( \frac{x^2}{2} -1 \right) + {{{\mathcal {O}}}}(x^7), \end{aligned}\nonumber \\ \end{aligned}$$
(16)

where \(\gamma \simeq 0.57721\) is Euler-Mascheroni constant. The expansion is accurate to \(<7\)% for \(x \le 2\). For \(x > 2\), we switch to the function \({\mathscr {F}}_\mathrm{lg}(x) \simeq 2.4 \, e^{-x}/x^{1.5}\), which approximates Eq. (15) to \(<10\)% at \(2< x < 17\).

Relative to the result of Ref. [9], the collision integral (13) also contains two new elements. Firstly, the phase space factor,

$$\begin{aligned} \Phi (y) = \frac{1}{1 - y^2} \left( 1 - y^4 + 4\, y^2 \ln y \right) , \end{aligned}$$
(17)

allows for the possibility of a nonzero \(m_{\nu l}\) and takes on values \(1 \rightarrow 0\) in the region \(0 \le y < 1\) (or, equivalently, \(0 \le m_{\nu l} < m_{\nu H}\)). In the limit of a small squared mass gap, i.e., \(\varDelta m_\nu ^2 \equiv m_{\nu H}^2-m_{\nu l}^2 \ll m_{\nu H}^2\), it reduces to \(\Phi (m_{\nu l}/m_{\nu H}) \rightarrow (1/3) (\varDelta m_\nu ^2/m_{\nu H}^2)^2\) and traces its origin to the fact that the momentum carried by each decay product in the rest frame of \(\nu _H\) is in fact \(\varDelta m_\nu ^2/(2 m_{\nu H})\), not \(m_{\nu H}/2\) as in the case of massless decay products. Secondly, the \(\ell \)-dependent coefficients \(\alpha _\ell \) are given by the expression

$$\begin{aligned} \alpha _\ell = \frac{1}{32} \left( 3 \ell ^4 + 2 \ell ^3 - 11 \ell ^2 + 6 \ell \right) \end{aligned}$$
(18)

for all \(\ell \ge 0\). These coefficients increase with \(\ell \), reflecting the fact that, for a fixed decay opening angle, the decay and inverse decay processes become more efficient at wiping out anisotropies on progressively smaller angular scales.

Lastly, we highlight again that the effective collision rate in Eq. (13) has in total five powers of \(a m_{\nu H}/T_0\), one of which is the Lorentz boost appearing in the boosted decay rate (14). This result is in drastic contrast with the much larger \(\propto (a m_{\nu H}/T_0)^3\) rate used in previous works [4, 5, 7, 8], which we stress is not an ab initio result, but rather one based upon a heuristic random walk argument. Appendix 1 shows that this heuristic argument is in fact incomplete, and that a consistent random walk picture accounting for all same-order effects must yield the much smaller \(\propto (a m_{\nu H}/T_0)^5\) rate derived from first principles in this work.

4.2 Three-state to two-state approximation

The decay scenario presented thus far concerns only two of the three Standard-Model neutrino mass states. Without model-specific motivations to exclude one particular mass state from the decay interaction, one might generally expect all three mass states to couple similarly at the Lagrangian level and participate in the decay/inverse decay process as parent particles and/or decay products. Then, kinematics largely determine the decay rates, and we can deduce from the measured neutrino squared mass differences that the rest-frame decay rates \(\varGamma ^0_{i \rightarrow j}\) for the processes \(\nu _i \rightarrow \nu _j + \phi \) follow the patterns

$$\begin{aligned} \varGamma _{3 \rightarrow 1 }^0 \simeq \varGamma ^0_{3 \rightarrow 2} \gg \varGamma ^0_{2 \rightarrow 1} \end{aligned}$$
(19)

assuming a normal mass ordering, and

$$\begin{aligned} \varGamma _{2 \rightarrow 3 }^0 \simeq \varGamma ^0_{1 \rightarrow 3} \gg \varGamma ^0_{2 \rightarrow 1} \end{aligned}$$
(20)

assuming an inverted mass ordering. All IO mass spectra and those NO mass spectra consistent with \(m_1 > rsim 0.03\) eV satisfy the condition (19) or (20) at better than 10%. For \(m_1 \lesssim 0.03\) eV in NO, the ratio \(\varGamma ^0_{2 \rightarrow 1}/\varGamma ^0_{3\rightarrow 2}\) rises to \(\sim 0.25\) while \(\varGamma ^0_{3 \rightarrow 1}/\varGamma ^0_{3\rightarrow 2}\) approaches 0.85 [9]. Thus, Eq. (19) is still a reasonably well satisfied.

If the two larger (boosted) decay rates satisfy the equilibrium condition \(\varGamma _{i \rightarrow j} > rsim H\), then, irrespective of the mass ordering, we can expect the phenomenology of the three-flavour system to be similar to the two-flavour case discussed in Sect. 2, save for the understanding that it is now the \(\nu _1\), \(\nu _2\), \(\nu _3\), and \(\phi \) that form a single massless fluid system. It remains to determine the effective collision integral and hence the \(\ell \ge 2\) anisotropy damping rates, which we approximate as follows.

Normal mass ordering Here, we assume the two lightest mass states with masses \(m_1\) and \(m_2\) to be effectively degenerate and set \(\varGamma ^0_{2 \rightarrow 1} = 0\). Then, following the recipe laid down in Ref. [9], the equations of motion can be condensed by multiplying by a factor of 2 (i) the \(\nu _H\) and \(\phi \) collision integrals (A.1) and (A.3), and (ii) all momentum-integrated quantities pertaining to \(\nu _l\). This immediately leads to an effective collision integral

$$\begin{aligned} \left( \frac{\mathrm {d} {{{\mathcal {F}}}}}{\mathrm {d}\tau }\right) _{C,\ell }= & {} \, 2 \, \frac{(2 \pi ^2 a^4)^{-1}}{\bar{\rho }_{\nu \phi }} \sum _{i=\nu _H,\nu _l,\phi } g_i \nonumber \\&\times \int \mathrm{d} |\mathbf {q}|\; |\mathbf {q}|^2 \epsilon _i \left( \frac{|\mathbf {q}|}{\epsilon _i} \right) ^\ell \, \left( \frac{\mathrm {d} f_i}{\mathrm {d}\tau }\right) _{C, \ell }^{(1)} (|\mathbf {q}|) , \end{aligned}$$
(21)

that differs from the original definition (10) and hence Eq. (13) by an overall factor of 2.

Inverted mass ordering Here, we treat the two heaviest mass states as effectively degenerate and again set \(\varGamma ^0_{2 \rightarrow 1}=0\). Then, multiplying by a factor of 2 (i) the \(\nu _l\) and \(\phi \) collision integrals (A.2) and (A.3), and (ii) all momentum-integrated quantities of \(\nu _H\), it is straightforward to show the same overall factor of 2 appears again in front of the effective collision integral as in the normal mass ordering case in Eq. (21).

4.3 Effective temperature and density ratio

We close this section with a brief discussion of the effective comoving temperature \(T_0\) and the \(\nu _H\) background energy density \(\bar{\rho }_{\nu H}\) after equilibration, both of which appear in the effective collision integral (13).

The \(\nu _H\) background energy density appears in the energy density ratio \((a^4 \bar{\rho }_{\nu H})/(a^4 \bar{\rho }_{\nu \phi })\), where we remind the reader that \(\bar{\rho }_{\nu \phi }\) stands for the total energy density in the combined \((\nu _H, \nu _l, \phi )\)-system and is, leaving aside dilution due to expansion, equal to the pre-decay energy density in \(\nu _H\) and \(\nu _l\) . This energy density ratio generally evolves with time, but at a rate much smaller than the Hubble expansion H in the steady-state/quasi-equilibrium regime of our assumptions; Ref. [9] finds numerically a fairly constant

$$\begin{aligned} \frac{(a^4 \bar{\rho }_{\nu H})}{(a^4 \bar{\rho }_{\nu \phi })} \sim \frac{1}{3} \rightarrow \frac{1}{2} \end{aligned}$$
(22)

in the time frame of interest (i.e., where \(\varGamma _\mathrm{dec} \gg H\) and the ultra-relativistic approximation is well satisfied). In our analysis, we take this ratio to be 1/3.

The temperature \(T_0\) is the effective comoving temperature of the combined \((\nu _H, \nu _l, \phi )\)-system, and should be lower than the pre-decay neutrino comoving temperature \(T_{\nu ,0} \simeq 1.95~\mathrm{K}\simeq 1.68 \times 10^{-4}\) eV. To estimate \(T_0\), we note that energy conservation imposes the relation \(a_\mathrm{pre}^4 \left( \bar{\rho }_{\nu H, \mathrm{pre}} + \bar{\rho }_{\nu l, \mathrm{pre}} \right) = a_\mathrm{post}^4 \left( \bar{\rho }_{\nu H, \mathrm{post}} + \bar{\rho }_{\nu l, \mathrm{post}} + \bar{\rho }_{\phi }\right) \) between the pre-decay and post-decay energy densities. Assuming Maxwell-Boltzmann statistics (i.e., \(\bar{\rho } =3 T_0 {\bar{n}}/a\)), this leads to

$$\begin{aligned} \frac{T_0}{T_{\nu ,0}} = \frac{a_\mathrm{pre}^3 \left( {\bar{n}}_{\nu H, \mathrm{pre}} + {\bar{n}}_{\nu l, \mathrm{pre}}\right) }{a_\mathrm{post}^3 \left( {\bar{n}}_{\nu H, \mathrm{post}} + {\bar{n}}_{\nu l, \mathrm{post}} + {\bar{n}}_{\phi } \right) }. \end{aligned}$$
(23)

The two-body decay also stipulates number conservation laws, namely, \(a_\mathrm{pre}^3 {\bar{n}}_{\nu H,\mathrm{pre}} = a_\mathrm{post}^3 \left( {\bar{n}}_{\nu H,\mathrm{post}} + {\bar{n}}_\phi \right) \) and \(a_\mathrm{pre}^3 {\bar{n}}_{\nu l,\mathrm{pre}} = a_\mathrm{post}^3 \left( {\bar{n}}_{\nu l,\mathrm{post}} - {\bar{n}}_\phi \right) \), which, upon substitution into Eq. (23), yields

$$\begin{aligned} \frac{T_0}{T_{\nu ,0} }= & {} 1 - \frac{ {\bar{n}}_{\phi }}{{\bar{n}}_{\nu H, \mathrm{post}} + {\bar{n}}_{\nu l, \mathrm{post}} + {\bar{n}}_{\phi } } \nonumber \\\simeq & {} 1 - \frac{ \bar{\rho }_{\phi }}{\bar{\rho }_{\nu \phi }}. \end{aligned}$$
(24)

Numerically, Ref. [9] finds the energy density ratio to be at most \(\bar{\rho }_{\phi }/\bar{\rho }_{\nu \phi } \sim 0.1\) in the steady-state regime of interest. Therefore, in this analysis, we take \(T_0\) to be the same as the pre-decay neutrino comoving temperature \(T_{\nu ,0}\).

5 Statistical inference

Having derived the effective equations of motion in a way consistent with neutrino oscillation data, we wish now to put constraints on the neutrino rest-frame lifetime \(\tau _0\) using cosmological data.

Before we describe our data analysis, observe that the effective collision integral (13) can be broken down in terms of its dependence on the scale factor a into

$$\begin{aligned} \left( \frac{\mathrm{d} {{{\mathcal {F}}}}}{\mathrm{d} \tau } \right) _{C,\ell } = - \alpha _\ell \, a^6 \, Y\, {\mathscr {F}} \left( a X \right) {{{\mathcal {F}}}}_\ell . \end{aligned}$$
(25)

Here,

$$\begin{aligned} X \equiv \frac{m_{\nu H}}{T_0} \simeq 298 \, \left( \frac{m_{\nu H}}{0.05~\mathrm{eV}}\right) \, \left( \frac{T_{\nu ,0}}{T_0}\right) \end{aligned}$$
(26)

is an effective mass parameter, while

$$\begin{aligned} \begin{aligned} Y&\equiv \, \frac{C}{12} \, \frac{(a^4 \bar{\rho }_{\nu H})}{(a^4 \bar{\rho }_{\nu \phi })} \, \Phi \left( \frac{m_{\nu l}}{m_{\nu H}}\right) \left( \frac{m_{\nu H}}{T_0}\right) ^5 \varGamma ^0_\mathrm{dec} \\&\simeq 6.55\, C \times 10^{10} \, \left( \frac{(a^4 \bar{\rho }_{\nu H})}{(a^4 \bar{\rho }_{\nu \phi })} \Big / \frac{1}{3}\right) \left( \frac{T_{\nu ,0}}{T_0}\right) ^5 \\&\quad \times \Phi \left( \frac{m_{\nu l}}{m_{\nu H}}\right) \, \left( \frac{m_{\nu H}}{0.05~\mathrm{eV}}\right) ^5\, \varGamma ^0_\mathrm{dec} \end{aligned} \end{aligned}$$
(27)

is an effective transport rate, with \(C=1\) for the one parent/one daughter scenario (“one decay channel”), and \(C=2\) if we wish to use the three-state to two-state approximation to describe the case of two parents/one daughter or one parent/two daughters (“two decay channels”). Both X and Y contain only quantities that are taken to be constant in time in our analysis (see Sect. 4.3), and together these two effective parameters determine completely the phenomenology of the effective collision integral (13).

5.1 Model parameter space

To derive credible limits on the rest-frame lifetime \(\tau _0\), we consider two one-variable extensions to the standard six-variable \(\Lambda \text {CDM}\) fit, scenarios A and B, whose parameter spaces are spanned respectively by

$$\begin{aligned} \begin{aligned}&{\varvec{\theta }}_A = \{\omega _{b}, \omega _{c}, \theta _s, \tau _\mathrm{reio}, n_s, \ln (10^{10} A_s), Y \}, \\&{\varvec{\theta }_A}^\mathrm{fixed} = \{N_\mathrm{fs} =1, N_\mathrm{int} = 2.0440, X=X_\mathrm{scan}\}, \end{aligned}\nonumber \\ \end{aligned}$$
(28)

and

$$\begin{aligned} \begin{aligned}&\varvec{\theta }_B = \{\omega _{b}, \omega _{c}, \theta _s, \tau _\mathrm{reio}, n_s, \ln (10^{10} A_s), Y \}, \\&{\varvec{\theta }_B}^\mathrm{fixed} = \{N_\mathrm{fs} =0, N_\mathrm{int} = 3.0440,X=X_\mathrm{scan}\}. \end{aligned}\nonumber \\ \end{aligned}$$
(29)

In both scenarios, the variables are the physical baryon density parameter \(\omega _{b}\), the physical cold dark matter density parameter \(\omega _{c}\), the angular sound horizon \(\theta _s\), the optical depth to reionisation \(\tau _\mathrm{reio}\), the spectral index of the primordial curvature power spectrum \(n_{s}\) and its amplitude \(A_s\) at the pivot scale \(k_\mathrm{piv} = 0.05~\mathrm{Mpc}^{-1}\), and the effective transport rate Y. The scenarios differ in the choices of fixed parameters in the neutrino sector.

Scenario A Here, the numbers of free-streaming and interacting neutrinos are always fixed at \(N_\mathrm{fs}=1\) and \(N_\mathrm{int}=2.0440\) respectively (such that \(N_\mathrm{fs}\) and \(N_\mathrm{int}\) add up to the standard \(N_\mathrm{eff}^\mathrm{SM} = 3.0440\) [40, 41]), while the effective mass parameter X is fixed at a value \(X_\mathrm{scan}\) to be discussed below. We model all neutrinos as massless states in class, in keeping with the limits of validity of our effective equations of motion (13). Planck 2018 CMB data alone currently constrain the neutrino mass sum to \(\sum m_\nu \lesssim 0.3\) eV (95%C.I.) in a \(\Lambda \)CDM+\(m_\nu \) 7-variable fit [1]; in combination with non-CMB data, the bound tightens to \(\sum m_\nu \lesssim 0.12\) eV (95%C.I.) [1]. Therefore, as long as we fit CMB data only and cap the parent neutrino mass at \(m_{\nu H}\simeq 0.1\) eV in the interpretation of our fit outcomes, our modelling is internally consistent. Note that this is a lower cap than the limit \(m_{\nu H}\lesssim 0.2\) eV established in Sect. 2, which, we remind the reader, stems from requiring that the neutrinos stay ultra-relativistic up to recombination.Footnote 3

The mass cap also sets an upper limit on the value of \(X_\mathrm{scan}\) that can be unambiguously explored with our modelling. Specifically, for the choice of \(T_0 = T_{\nu ,0}\), we are limited to the parameter range

$$\begin{aligned} X_\mathrm{scan} \in [52,595], \end{aligned}$$
(30)

where the lower end corresponds to the smallest possible mass gap consistent with oscillation measurements, i.e., \(\sqrt{\varDelta m_{21}^2}=0.0087\) eV, and the upper limit is the mass cap discussed immediately above. Then, with appropriate post-processing, scenario A can be interpreted to represent six physically distinct cases, summarised in Table 1.

Table 1 All decay and free-streaming (FS) scenarios considered in this work, and their corresponding mass ordering (NO or IO), decay mass gap, and minimum allowed parent neutrino mass \(m_{\nu H}\). We use the convention \(m_3> m_2 > m_1\) for normal mass ordering and \(m_2> m_1 > m_3\) for inverted mass ordering, and a common value \(\varDelta m_{21}^2 = 7.50 \times 10^{-5}\). Note that numerically we do not distinguish between \(|\varDelta m_{31}^2|_\mathrm{N}\), \(|\varDelta m_{31}^2|_\mathrm{I}\), \(\varDelta m_{32}^2|_\mathrm{N} \equiv |\varDelta m_{31}^2|_\mathrm{N} - \varDelta m_{21}^2\), and \(|\varDelta m_{23}^2|_\mathrm{I} \equiv |\varDelta m_{31}^2|_\mathrm{I} + \varDelta m_{21}^2\): the difference between them is less than 5% [24,25,26]. This allows us to lump four physically distinct cases into one single scenario A1 and to use a single representative mass splitting \(|\varDelta m_{31}^2| = 2.5 \times 10^{-3}~\mathrm{eV}^2\). The same is in principle also true for two physical cases of scenario B. Nonetheless, we distinguish between B1 and B2, because the former comprises two decay modes, while the latter has only one: this makes the definitions of the particle lifetime differ by a factor of two between the two cases

Scenario B This scenario has \(N_\mathrm{int}=3.0440\) interacting neutrinos in the three-state to two-state approximation and no free-streaming neutrinos. It represents two physically distinct cases, as summarised in Table 1. Again, as with scenario A, to consistently interpret the fit outcomes requires that we limit the parent neutrino masses to \(m_{\nu H}\lesssim 0.1\) eV. This in turn limits the range of effective mass X to

$$\begin{aligned} X_\mathrm{scan} \in [298,595] \end{aligned}$$
(31)

for the choice of \(T_0 =T_{\nu ,0}\), where the lower bound corresponds to the mass gap \(\sqrt{|\varDelta m_{31}^2|} = 0.05\) eV.

Fig. 1
figure 1

CMB TT power spectra of ultra-relativistic invisible neutrino decay scenarios relative to the Planck 2018 best-fit \(\Lambda \)CDM cosmology. In each case, we vary only the effective mass parameter X (Eq. (26)) and effective transport rate Y (Eq. (27)), while keeping all other cosmological parameters fixed at their best-fit \(\Lambda \)CDM values. Top: Variations between scenarios A (solid line) and B (dashed line) for a range of Y values, with X fixed at \(X = 298\) or, equivalently, \(m_{\nu H}=0.05\) eV assuming \(T_0 = T_{\nu ,0}\). Bottom: Variations in scenario B with respect to the decaying neutrino mass \(m_{\nu H}\), with \(T_0=T_{\nu ,0}\) and at a fixed the effective transport rate \(Y=10^4\) s\(^{-1}\)

Figure 1 shows a selection of CMB TT power spectra computed using our implementation of the effective equations of motion (25) in class [28],Footnote 4 relative to the prediction for a reference \(\Lambda \)CDM cosmology. For a fixed effective mass parameter \(X = 298\) or, equivalently, \(m_{\nu H}=0.05\) eV assuming \(T_0 = T_{\nu ,0}\), the top panel contrasts the signatures of ultra-relativistic neutrino decay for a range of effective transport rates Y in both scenarios A and B. Observe that for the same value of Y, the signature of decay in scenario A is approximately a factor of 2/3 times that of scenario B. This is consistent with expectation, as only two out of three neutrinos interact in the former scenario, while all three neutrinos participate in the interaction in the latter case. We can therefore expect constraints on Y to be stronger in scenario B than in scenario A.

The bottom panel of Fig. 1 shows variations in the decay signature with respect to the effective mass parameter X, or \(m_{\nu H}\) at a fixed \(T_0\), for a fixed effective transport rate \(Y = 10^4~\mathrm{s}^{-1}\) in scenario B. Here, we see that the signature is weaker for larger values of X. This is because X appears only in the prefactor \({\mathscr {F}}(a X)\) in the effective equation of motion (25). Since \({\mathscr {F}}(a X)\) is a decreasing function of its argument, for a fixed evolution history a(t), larger values of X will generally lead to smaller prefactors and hence weaker signatures in the CMB anisotropies. This also means that we can expect constraints on Y to weaken with increasing X.

5.2 Data and analysis

We compute the CMB temperature and polarisation anisotro-pies for a large sample of parameter values in the parameter spaces defined in Eqs. (28) and (29) using our modified version of class. We test these outputs against the Planck 2018 data [1] using

  1. 1.

    The TTTEEE likelihood at \(\ell \ge 30\),

  2. 2.

    The Planck low-\(\ell \) temperature+polarisation likelihood, and

  3. 3.

    The Planck lensing likelihood,

a combination labelled “TTTEEE+lowE+lensing” in Ref. [1]. The parameter spaces of Eqs. (28) and (29) are sampled as Monte Carlo Markov Chains (MCMC) generated with the package MontePython-3 [42]. We construct credible intervals using GetDist [43].

The prior probability densities employed in the analysis are always flat and linear in each parameter directions, with prior boundaries as follows:

  • For the variables \(\omega _{b}, \omega _{c}, \theta _s, n_s\), and \(\ln (10^{10}A_s)\), the priors are unbounded at both ends.

  • For the optical depth to reionisation, we impose \(\tau _\mathrm{reio} \in [0.01, \infty )\).

  • For the effective transport rate, we use \(Y \in [0,\infty )\) in scenario A and \(Y \in [0,4000]\) in scenario B.

We generate in excess of 250, 000 samples in each case (and over a million samples in some cases). Convergence of the chains is defined by the Gelman-Rubin convergence criterion \(R-1 < 0.015\).

6 Results and discussions

Fig. 2
figure 2

1D marginalised posteriors for the effective transport rate Y (Eq. (27)) obtained from our MCMC analysis for a range of fixed effective mass values \(X=X_\mathrm{scan}\) in scenarios A (top panel) and B (bottom panel), whose parameter spaces are defined in Eqs. (28) and (29) respectively. The data used are the TTTEEE+lowE+lensing combination from the Planck 2018 data release [1]

Figure 2 shows the 1D marginal posteriors for the effective transport rate Y for a range of fixed effective mass parameter values \(X=X_\mathrm{scan}\) in both scenarios A (top panel) and B (bottom panel). The corresponding one-sided 95% credible limits on Y and the equivalent free-streaming redshift \(z_\mathrm{fs}\) in each case are summarised in Table 2. This redshift is defined via the condition

$$\begin{aligned} \varGamma _\mathrm{T}(z_\mathrm{fs})=H(z_\mathrm{fs}), \end{aligned}$$
(32)

where we have identified the transport rate \(\varGamma _\mathrm{T}\) with the \(\ell =2\) damping rate, i.e.,

$$\begin{aligned} \varGamma _\mathrm{T} \equiv a^5 Y {\mathscr {F}}(a X). \end{aligned}$$
(33)

Physically, \(z_\mathrm{fs}\) characterises the time at which the combined \((\nu _H,\nu _l,\phi )\)-system switches from a free-streaming gas to a tightly-coupled fluid [5].

Table 2 Marginalised one-sided 95% credible limits on the effective transport rate Y and the equivalent free-streaming redshift \(z_\mathrm{fs}\) (Eq. (32)) for fixed values of the effective mass parameter X (and their equivalent parent neutrino mass \(m_{\nu H}\) assuming \(T_0=T_{\nu ,0}\)) in scenarios A and B

Firstly, we observe that within each scenario, the constraint on Y weakens with increasing X and that for the same X, the scenario B constraint is typically an order of magnitude tighter than its scenario A counterpart. Both trends are consistent with expectations (see Sect. 5.1). Likewise, the constraints on the free-streaming redshift \(z_\mathrm{fs}\) are independent of the choice of X within each scenario—to better than 1% in scenario A and about 10% in scenario B—as one would expect from the definition (32).

Having established these \(z_\mathrm{fs}\) bounds also allows us to roughly map our bounds on Y in Table 2 to a simple expression

$$\begin{aligned} Y \lesssim 5.3 \times 10^4 /{\mathscr {F}}(X/2460)~\mathrm{s}^{-1} \end{aligned}$$
(34)

in scenario A, and

$$\begin{aligned} Y \lesssim 2 \times 10^3 /{\mathscr {F}}(X/1500)~\mathrm{s}^{-1} \end{aligned}$$
(35)

in scenario B. In the case of the latter, our new constraint \(z_\mathrm{fs} \lesssim 1500\) is clearly tighter than the previous limit \(z_\mathrm{fs} \lesssim 1965\) [7]. The difference is likely attributable to improved precision of the Planck 2018 CMB data over the 2013 data used in Ref. [7], although of course the different time dependences of the transport rates assumed in the analyses—\(\propto a^3\) in [7] vs our \(\propto a^5 {\mathscr {F}}(a X)\) in Eq. (33)—as well as the assumption of instantaneous recoupling in Ref. [7] could also have played a role.

Secondly, while we have quoted in Table 2 one-sided 95% credible limits on Y and \(z_\mathrm{fs}\), it is evident in Fig. 2 that in almost all cases the 1D posterior for Y peaks at a nonzero value. Indeed, Ref. [8] also found a similar feature, despite using a transport rate with a \(\propto a^3\) time dependence [as opposed to our \(\propto a^5 {\mathscr {F}}(a X)\) rate]. Reference [7] likewise found a shifted peak in their analysis of the Planck 2013 data for \(z_\mathrm{fs}>0\), notwithstanding the assumption of instantaneous recoupling.

However, in all cases the peak shift of either Y or \(z_\mathrm{fs}\) from zero is statistically insignificant (i.e., \(<2\sigma \)) and appears to be driven by CMB polarisation data [8]. We therefore dwell no further on the subject, except to note that the shifted Y peak is accompanied by a marginal increase in the inferred scalar spectral index \(n_s\)—a trend also observed in Ref. [8]—and consequently the small-scale RMS fluctuation \(\sigma _8\), in comparison with their \(Y=0\) counterparts derived from the same data combination. There are otherwise no strong degeneracies between Y and other base \(\Lambda \)CDM parameters. The interested reader may consult Appendix 1 for more summary statistics of our parameter estimation analyses. Here, we conclude in view of the degeneracy directions that adding baryon acoustics oscillations measurements to the analysis is unlikely to improve the bound on Y, while a low-redshift small-scale \(\sigma _8\) measurement could conceivably tighten the bound.Footnote 5

6.1 Constraints on the neutrino lifetime

Fig. 3
figure 3

Lower bounds on the neutrino lifetime converted from the 95% credible limits on the effective transport rate Y (Table 2) for the three single-decay-channel cases of scenario A and the two two-decay-channel cases of scenario B (see Table 1), as functions of the lightest neutrino mass \(m_1\) (NO) or \(m_3\) (IO). Orange (black) solid lines denote the revised bounds of this work for scenarios A (B), while the dotted lines represent the naïve constraints we would have obtained had we assumed a massless daughter neutrino \(m_{\nu l}=0\) in the phase space factor \(\Phi (m_{\nu l}/m_{\nu H})\) (Eq. (17)). The interpretation of these bounds in conjunction with Table 1 is unambiguous in scenario A. In scenario B1 the bounds apply to \(\nu _3\), while in B2 they apply to both \(\nu _1\) and \(\nu _2\) assuming equal lifetimes. We additionally show currents bounds from SN1987A [22] and from the IceCube 2015 data analysed in Ref. [23] (“IC 2015”) assuming the flavour composition of a full pion decay scenario, i.e., \((f_e,f_{\mu },f_{\tau })=(1,2,0)\), at the source, as well as the projected reach of future IceCube runs (“IceCube 8 yr”) and a combination of future neutrino telescopes (including Baikal-GVD, KM3NeT, P-ONE, TAMBO, and IceCube-Gen2), the latter of which also assumes improved oscillations parameter measurements from JUNO, DUNE, and Hyper-Kamiokande. See Ref. [23] for details. Also displayed in red in the left panel is the invisible decay scenario proposed in Ref. [44] that would solve the tension between cascade and track data sets of IceCube

To reinterpret our marginalised one-sided 95% credible limits on the effective transport rate Y (Table 2) as limits on the rest-frame neutrino lifetime \(\tau _0\), we first interpolate the constraints on Y as a function of the effective mass parameter X (or, equivalently, the mass of the decaying neutrino \(m_{\nu H}\)) within each scenario. Then, using Eq. (27) these upper limits on Y constraints can be converted into lower bounds on \(\tau _0 = 1/\varGamma _\mathrm{dec}^0\) for any neutrino mass spectrum desired. Alternatively, we can use Eq. (27) to directly translate the approximate limits of Eqs. (34) and (35) into

$$\begin{aligned} \begin{aligned} \tau _0& > rsim 1.2 \times 10^6 \, {\mathscr {F}} \left[ 0.12\left( \frac{m_{\nu H}}{0.05 \mathrm{eV}} \right) \right] \\&\qquad \quad \times \Phi \left( \frac{m_{\nu l}}{m_{\nu H}}\right) \, \left( \frac{m_{\nu H}}{0.05~\mathrm{eV}}\right) ^5~\mathrm{s} \end{aligned} \end{aligned}$$
(36)

for scenario A, and

$$\begin{aligned} \begin{aligned} \tau _0& > rsim 6.6 \times 10^7 \, {\mathscr {F}} \left[ 0.2 \left( \frac{m_{\nu H}}{0.05 \mathrm{eV}} \right) \right] \\&\qquad \quad \times \Phi \left( \frac{m_{\nu l}}{m_{\nu H}}\right) \, \left( \frac{m_{\nu H}}{0.05~\mathrm{eV}}\right) ^5~\mathrm{s} \end{aligned} \end{aligned}$$
(37)

for scenario B2.

Note that in the case of B1, the lifetime of \(\nu _3\) needs to be computed from \(\tau _0 = 1/(2 \varGamma _\mathrm{dec}^0)\), because of its two distinct decay modes. See Table 1.

Figure 3 shows the limits on \(\tau _0\) constructed from interpolation for the physical cases listed in Table 1, as functions of the lightest neutrino masses \(m_1\) (NO) or \(m_3\) (IO). Also displayed in the same plots are the naïve constraints on \(\tau _0\) we would have obtained had we set the mass of the daughter neutrino to zero in the phase space factor \(\Phi \). For reference we also show current constraints from SN1987A [22] and IceCube [23], as well as projected constraints from future measurements by IceCube and other neutrino telescopes [23].Footnote 6

Relative to our previous estimate of \(\tau _0 > rsim (4 \times 10^5 \rightarrow 4 \times 10^6 (m_{\nu H}/0.05 \mathrm{eV})^5\) s [9], the new naïve (i.e., \(m_{\nu l}=0\)) constraints are comparable to the high end at \(m_{\nu H} \simeq 0.05\) eV in scenario A (one decay channel) and a factor of 50 tighter than the high end in scenario B (two decay channels) at the same mass point. These differences can be attributed to (i) different prefactors in the relation between the effective transport rate and the rest-frame decay rate (Eq. (27)), and (ii) the fact that the old estimate had been obtained from a rough conversion of the \(z_\mathrm{fs}\) bound of Ref. [7]—itself derived from older CMB data—using the \(\varGamma _\mathrm{T} (z_\mathrm{fs})=H(z_\mathrm{fs})\) condition. Furthermore, because the constraints on Y weaken with \(m_{\nu }\) [due to \({\mathscr {F}}(a m_\nu /T_0)\) in Eq. (25)], the scaling of the naïve \(\tau _0\) bounds with \(m_{\nu H}\) in fact follows more closely \(\propto m_{\nu H}^4\) than \(\propto m_{\nu H}^5\) in the mass range of interest (as can be seen most clearly in the middle panel of Fig. 3).

However, once a finite \(m_{\nu l}\) has been accounted for via the new phase space factor \(\Phi \) in accordance with oscillation measurements, the lifetime bounds on the decaying neutrino relax significantly in all physical cases. Specifically, if the parent-daughter neutrino mass gap corresponds to the atmospheric mass splitting (i.e., scenarios A1 and B), then irrespective of the neutrino mass ordering we see that the bound on \(\tau _0\) at any one mass point weakens by up to a factor of 50 relative to the naïve bound for a lightest neutrino mass \(m_1\) (NO) or \(m_3\) (IO) not exceeding 0.1 eV.

Even more dramatic relaxations can be seen in those cases in which the mass gap is determined by the solar mass splitting, i.e., scenarios A2 and A3. Here, the \(\tau _0\) constraints weaken by four to five orders relative to the naïve bounds in the case of IO (i.e., scenario A3) across the whole \(m_3\) range of interest. In the case of NO (i.e., scenario A2), the relaxation also reaches five orders of magnitude, although for a smaller range of \(m_1\).

Interestingly, inclusion of the new phase space factor \(\Phi \) also translates into revised lifetime constraints that are remarkably independent of the neutrino mass. We have seen previously in Sect. 4.1 that \(\Phi \) asymptotes to \((1/3) (\varDelta m_\nu ^2/m_{\nu H}^2)^2\) in the limit \(\varDelta m_\nu ^2 \equiv m_{\nu H}^2 - m_{\nu l}^2 \ll m_{\nu H}^2\). Since the naïve bounds on \(\tau _0\) scale effectively as \(m_{\nu H}^4\) in the mass range of interest, we see immediately that including \(\Phi \) must reduce the dependence of the final \(\tau _0\) bounds to \(\propto (\varDelta m_\nu ^2)^2\), which, depending on the neutrino pair participating in the decay, is always fixed by either the solar or the atmospheric squared neutrino mass splitting. Indeed, we find in scenarios A2 and A3 a fairly mass-independent revised constraint of \(\tau _0 > rsim (400 \rightarrow 500)\) s, while in scenario A1 the bound is some \((|\varDelta m_{31}^2|/\varDelta m_{21}^2)^2 \sim 10^3\) times tighter at \(\tau _0 > rsim (6 \rightarrow 10)\times 10^5\) s. The two scenario B bounds are tighter still at \(\tau ^0 > rsim (2 \rightarrow 6) \times 10^7\) s, but apply to two decay channels with a near-common atmospheric mass gap.

In the case of A2 and A3, we note while bearing in mind the caveats of Footnote 6 that our revised CMB constraints on \(\tau _0\) are merely a factor of a few more stringent than current bounds derived in Ref. [23] from the 2015 IceCube measurements of the flavour composition of TeV–PeV-energy astrophysical neutrinos [45], assuming the flavour ratio of a full pion decay scenario, i.e., \((f_e,f_{\mu },f_{\tau })=(1,2,0)\), at the source. Indeed, according to the projections of Ref. [23], the sensitivity to the neutrino lifetime of IceCube alone based on a combination 8 years of starting events and through-going track will likely already supersede our CMB constraints entirely in scenario A3 and partially in scenario A2 in the mass range presented. Together with improved measurements of the neutrino mixing parameters by JUNO [46], DUNE [47], and Hyper-Kamiokande [48], even the A2 CMB constraints could become be entirely overtaken in the next two decades [23] by data collected at an array of future neutrino telescopes such as Baikal-GVD [49], KM3NeT [50], P-ONE [51], TAMBO [52], and IceCube-Gen2 [53]. In light of this possibility, it would be interesting to see if the neutrino telescope constraints on and projected sensitivities to \(\tau _0\) would vary significantly if derived under model assumptions more consistent with our scenarios A2 and A3. Likewise, it remains to be determined if future CMB measurements on a comparable timeline such as CMB-S4 [54] will be competitive in this space.

Lastly, one may be tempted to use Eqs. (36) and (37) to extrapolate our lifetime limits to parent neutrino mass values beyond our scan range, i.e., \(m_{\nu H} > rsim 0.1\) eV. Leaving aside that such large masses might run into problems with CMB bounds, it is interesting to note the function \({\mathscr {F}}(x)\) naturally shuts down the anisotropy loss when the condition of ultra-relativistic decay cannot be satisfied, which translates in turn into a rapid deterioration of the lifetime bounds at large values of \(m_{\nu H}\). In fact, for \(m_{\nu H} > rsim 1\) eV in scenario A and \(m_{\nu H} > rsim 0.5\) eV in scenario B, we expect the lifetime bounds to deteriorate as \(\sim \exp (-x)/x^{0.5}\), where \(x=x_A \simeq 0.12 \times (m_{\nu H}/0.05~\mathrm{eV})\) and \(x=x_B \simeq 0.2 \times (m_{\nu H}/0.05~\mathrm{eV})\) for scenarios A and B respectively. In other words, at such large \(m_{\nu H}\) values, there are no neutrino lifetime constraints from free-streaming requirements.

7 Conclusions

We have considered in this work invisible neutrino decay \(\nu _H \rightarrow \nu _l+\phi \) and its inverse process in the ultra-relativistic limit, and derived the effective Boltzmann hierarchy and associated \(\ell \)th anisotropy loss rate (or, equivalently, the transport rate \(\varGamma _\mathrm{T}\)) for the coupled \((\nu _H,\nu _l,\phi )\)-system relevant for CMB anisotropy computations. Relative to our previous work [9] which assumed both decay products to be massless, we have in this work allowed the daughter neutrino \(\nu _l\) to remain massive.

We find that a nonzero \(m_{\nu l}\) introduces a new phase space suppression factor \(\Phi (m_{\nu l}/m_{\nu H})\) (Eq. (17)) in the transport rate, which in the limit of a small squared mass gap, \(\varDelta m_\nu ^2 \equiv m_{\nu H}^2-m_{\nu l}^2 \ll m_{\nu H}^2\), asymptotes to \((1/3) (\varDelta m_\nu ^2/m_{\nu H}^2)^2\). Thus, the transport rate \(\varGamma _\mathrm{T}\) is related to the rest-frame lifetime \(\tau _0\) of the parent neutrino \(\nu _H\) roughly as \(\varGamma _\mathrm{T} \sim (\varDelta m_\nu ^2/m_{\nu H}^2)^2 (m_{\nu H}/E_\nu )^5(1/\tau _0)\), where the factor \((m_{\nu H}/E_\nu )^5\) was established previously in Ref. [9]. Note that while we have derived our transport rates rigorously from first principles under a reasonable set of assumptions, it is possible to attribute the various components that make up the expressions to physical quantities such as the opening angle and the momenta of the decay products. We have dedicated Appendix 1 to interpreting our transport rate in terms of a random walk using these physical quantities, as well as explaining why the old random walk argument of Ref. [4] is incomplete and hence yields a transport rate that is too large. We emphasise that the new phase space factor \(\Phi \) is in addition to the phase space suppression already inherent in the lifetime \(\tau _0\), and traces its origin to the momenta carried by the decay products in the decay rest frame.

Implementing our effective Boltzmann hierarchy (9) and transport rates (25) into the CMB code class [28], we have performed a series of MCMC fits to the CMB temperature and polarisation anisotropy measurements by the Planck mission (2018 data release) and determined a new set of constraints on the neutrino lifetime \(\tau _0\) in a manner consistent with the neutrino mass splittings established by oscillation experiments. These new constraints are presented in Fig. 3 as functions of the lightest neutrino mass, i.e., \(m_1\) in the normal mass ordering (\(m_3> m_2>m_1\)) and \(m_3\) in the inverted mass ordering \((m_2> m_1>m_3)\). For a parent neutrino mass up to about \(m_{\nu H} \simeq 0.1\) eV, we find that the presence of the new phase space factor weakens the CMB constraint on its lifetime by as much as a factor of 50 if the daughter neutrino mass \(m_{\nu l}\) is separated from \(m_{\nu H}\) by the atmospheric mass gap, and by up to five orders of magnitude if separated by the solar mass gap, in comparison with naïve limits that assume \(m_{\nu l}=0\).

The revised constraints are (i) \(\tau ^0 > rsim (6 \rightarrow 10) \times 10^5\) s and \(\tau ^0 > rsim (400 \rightarrow 500)\) s if only one neutrino decays to a daughter neutrino separated by, respectively, the atmospheric and the solar mass gap, and (ii) \(\tau ^0 > rsim (2 \rightarrow 6) \times 10^7\) s in the case of two decay channels with one near-common atmospheric mass gap. Relative to existing constraints derived from other probes, these CMB limits are, despite significant revision, still globally the most stringent. However, we note that the relaxation of the bounds has also opened up a swath of parameter space likely within the projected reach of IceCube and other telescopes in the next two decades [23]. This may in turn call for closer scrutiny of the neutrino anisotropy loss modelling upon which the derivation of the CMB constraints of this work is based. For example, where cosmological and astrophysical constraints meet, a more consistent treatment of neutrinos masses and quantum statistics may be required to accurately delineate the viable decay parameter space. We defer this investigation to a future work.

Finally, we remind the reader that while we have consistently accounted for the daughter neutrino mass according to experimentally measured mass splittings, we have assumed \(m_{\phi }=0\) in the determination of the lifetime constraints. This assumption need not be true, and a choice of \(m_\phi \) satisfying \(m_{\nu H} > m_\phi \gg m_{\nu l}\) in the regions (i) \(m_1, m_3 \lesssim 10^{-2}\) eV in scenarios A1 and B and (ii) \(m_1 \lesssim 2 \times 10^{-3}\) eV in A2 may in fact also relax the cosmological constraints on \(\tau _0\) presented in Fig. 3. (Phase space considerations limit the range of permissible \(m_\phi \) values to \(m_\phi < m_{\nu H}- m_{\nu l}\) and hence their effects on the \(\tau _0\) constraints where a finite \(m_{\nu l}\) already produces a significant suppression, except for a few finely-tuned values in the vicinity of \(m_\phi = \varDelta m_\nu \).) Such a relaxation may be similarly modelled using the phase space factor \(\Phi \) (Eq. (17)) with \(m_{\nu l} \rightarrow m_{\phi }\), although the resulting limits on \(\tau _0\) would depend strongly on the choice of mass for the hypothetical \(\phi \) particle. We leave the exploration of \(m_\phi \)-dependent neutrino lifetime constraints as an exercise to the interested reader.