1 Introduction

It is a viable possibility that isospin-singlet neutrinos mix with active flavors \(\nu _e,\nu _\mu ,\nu _\tau \) after electroweak symmetry breaking. Notably, such a mixing is inherent in the Type-I seesaw model [1,2,3,4], and although the mixing angle is undetectably small in most of the parameter space, it can be sizable when there is a cancellation in the seesaw mass. To see this, suppose, for example, there are three isospin-singlet neutrinos with a Majorana mass \(M_N\), having a Dirac mass term \(m_D\) with active neutrinos. Since the tiny neutrino mass is derived as \(-m_D M_N^{-1} m_D^T\), one can express \(m_D\) as [5]

$$\begin{aligned} m_D&= U_{PMNS} \, i\left( \! \begin{array}{c@{\quad }c@{\quad }c} \sqrt{m_1} &{} 0 &{} 0 \\ 0 &{} \sqrt{m_2} &{} 0 \\ 0 &{} 0 &{} \sqrt{m_3} \end{array} \!\right) \, R_3(\alpha ,\beta ,\gamma ) \, \sqrt{M_N}, \end{aligned}$$
(1)

where \(U_{PMNS}\) denotes the Pontecorvo–Maki–Nakagawa–Sakata (PMNS) mixing matrix [6,7,8], \(m_1,m_2,m_3\) are the tiny neutrino mass, and \(R_3\) is a 3-dimensional rotation matrix with complex-valued rotation angles \(\alpha ,\beta ,\gamma \). Unless \(m_1,m_2,m_3\) are degenerate, one can arbitrarily enhance \(m_D\) and the mixing angles between isospin-singlet and active neutrinos by taking large imaginary values for \(\alpha ,\beta ,\gamma \). In this case, a cancellation is taking place in the components of \(-m_D^T M_N^{-1} m_D\). Large mixing between isospin-singlet and active neutrinos is also realized in extensions of the Type-I seesaw model, such as the inverse seesaw model [9, 10].

In the presence of isospin-singlet neutrino mixing, transitions among active flavors require for their consistent description extra parameters beyond the standard oscillation parameters, the three mixing angles \(\theta _{12},\theta _{23},\theta _{13}\), one CP-violating phase \(\delta _{CP}\) and two mass differences \(\Delta m_{12}^2,\Delta m_{13}^2\). Tension in fitting neutrino oscillation data with the standard parameters would thus be evidence for isospin-singlet neutrino mixing. If isospin-singlet neutrinos are “heavy”, by which we mean that their mass is much above the neutrino beam energy, they do not contribute to oscillations and simply alter the mixing matrix \(U_{PMNS}\) into a non-unitary matrix in the following fashion:

$$\begin{aligned} U_{PMNS}&\rightarrow \left( \begin{array}{c@{\quad }c@{\quad }c} \alpha _{11} &{} 0 &{} 0 \\ \alpha _{21} &{} \alpha _{22} &{} 0 \\ \alpha _{31} &{} \alpha _{32} &{} \alpha _{33} \end{array} \right) U_{PMNS} \end{aligned}$$
(2)

where \(\alpha _{ii}\)’s are real and \(\alpha _{ij}\)’s \((i\ne j)\) are complex numbers.

One purpose of this paper is to estimate the largest possible significance of heavy isospin-singlet neutrino mixing in the Tokai-to-Hyper-Kamiokande (T2HK) [11] and the Tokai-to-Hyper-Kamiokande-to-Korea (T2HKK) [12] (for early proposals, see Refs. [13,14,15,16,17,18,19,20,21,22,23]) experiments, as well as an experiment in which a new detector at Oki Islands is added to the T2HK (for early proposals on a detector at Oki, see Refs. [22,23,24]). Emphasis is on how the significance varies with CP-violating phases in the PMNS and the non-unitary mixing matrices and to what extent the sensitivity deteriorates with these phases. The secondary goal of this paper is to pursue the possibility that the neutrino mass hierarchy and the standard CP-violating phase in the PMNS matrix \(\delta _{CP}\) are measured in the presence of heavy neutrino mixing by fitting neutrino data with the standard oscillation parameters only, namely, without assuming heavy neutrino mixing. This is achievable if data allow us to discern contributions from the standard oscillation and those from the non-unitary mixing matrix. Such measurement is of practical importance when a hint of heavy neutrino mixing were discovered but were not conclusive. The mass hierarchy and \(\delta _{CP}\) measurement in the presence of heavy neutrino mixing has been investigated in a general context [25] and for the T2K, T2HK, NO\(\nu \)A and DUNE experiments [26,27,28,29].

The T2HK and the T2HKK are proposed extensions of the on-going T2K experiment [31], utilizing an off-axis \(\nu _\mu +\bar{\nu }_{\mu }\) beam delivered from J-PARC. In the T2HK, there are two water Cerenkov detectors, each with 187 kton fiducial volume, located solely at Kamioka, where the baseline length is \(L=295\) km and the off-axis-angle is \(2.5^\circ \). In one plan of the T2HKK, one water Cerenkov detector with 187 kton fiducial volume is placed at Kamioka and the same detector is also at a site in Korea where the baseline length is \(L\simeq 1000\) km and the off-axis-angle is \(1^\circ \). We also consider a plan of placing, in addition to the T2HK, a small 10 kton water Cerenkov detector at Oki Islands, where the baseline length is \(L=653\) km and the off-axis-angle is \(1.0^\circ \).

The merits of the above long baseline experiments for the search for heavy neutrino mixing are as follows: (i) In disappearance experiments, the effects of heavy neutrino mixing as parametrized by Eq. (2) cancel between measurements at the near and far detectors, while the T2K and its extensions are primarily appearance experiments. (ii) \(\nu _\mu \rightarrow \nu _e\) (\(\bar{\nu }_\mu \rightarrow \bar{\nu }_e\)) transition amplitude arising from the non-unitary mixing matrix Eq. (2) interferes with the standard oscillation amplitude, enhancing the impact of non-unitary mixing on the transition probability. The merit (ii), however, bears two subtleties:

  • CP-violating phases in the PMNS and the non-unitary mixing matrices possibly lead to suppression of the interference.

  • Non-unitary mixing \(\alpha _{21}\) in Eq. (2) can mimic matter effects and the effect of the standard phase \(\delta _{CP}\), which distorts the measurement of the mass hierarchy and \(\delta _{CP}\).

These issues become transparent by expressing the transition amplitude in the leading order of \(\sin \theta _{13}\), \(\Delta m_{21}^2\), \(\alpha _{21}\) and matter effects as

$$\begin{aligned}&i\langle \nu _e \vert e^{-i\int _0^L \mathrm{d}x \, H(x)} \vert \nu _\mu \rangle = 2 \, e^{-i\Delta m_{31}^2 L/(4E)} \nonumber \\&\quad \times \sin \left( \frac{\Delta m_{31}^2 L}{4E} \right) e^{-i\delta _{CP}}\sin \theta _{13}\sin \theta _{23} \nonumber \\&\quad +\frac{\Delta m_{21}^2 L}{4E}\sin (2\theta _{12})\cos \theta _{23} + i\alpha _{21}^*\nonumber \\&\quad -iV_{cc} \, L \, \left\{ \, \frac{\Delta m_{31}^2 L}{4E}e^{-i\delta _{CP}}\sin \theta _{13}\sin \theta _{23} \right. \nonumber \\&\quad \left. + \frac{1}{2}\frac{\Delta m_{21}^2 L}{4E}\sin (2\theta _{12})\cos \theta _{23} \, \right\} , \end{aligned}$$
(3)
$$\begin{aligned}&\quad i\langle \bar{\nu }_e \vert e^{-i\int _0^L \mathrm{d}x \, H(x)} \vert \bar{\nu }_\mu \rangle = 2 \, e^{-i\Delta m_{31}^2 L/(4E)}\nonumber \\&\quad \times \sin \left( \frac{\Delta m_{31}^2 L}{4E} \right) e^{i\delta _{CP}}\sin \theta _{13}\sin \theta _{23} \nonumber \\&\quad + \frac{\Delta m_{21}^2 L}{4E}\sin (2\theta _{12})\cos \theta _{23} + i\alpha _{21} \nonumber \\&\quad +iV_{cc} \, L \, \left\{ \, \frac{\Delta m_{31}^2 L}{4E}e^{i\delta _{CP}}\sin \theta _{13}\sin \theta _{23} \right. \nonumber \\&\quad \left. + \frac{1}{2}\frac{\Delta m_{21}^2 L}{4E}\sin (2\theta _{12})\cos \theta _{23} \, \right\} , \end{aligned}$$
(4)

where E denotes the neutrino energy and L the baseline length, and \(V_{cc}\) is the potential due to charged current interaction inside matter.

We notice that the following interference terms:

$$\begin{aligned}&\vert \langle \nu _e \vert \, e^{-i\int _0^L \mathrm{d}x \, H(x)} \, \vert \nu _\mu \rangle \vert ^2\nonumber \\&\quad \supset -4\sin \left( \frac{\Delta m_{31}^2 L}{4E} \right) \sin \theta _{13}\sin \theta _{23}\nonumber \\&\qquad \times \;{\mathrm{Im}}\left( \alpha _{21} e^{-i\Delta m_{31}^2 L/(4E)-i\delta _{CP}}\right) \nonumber \\&\qquad + 2\frac{\Delta m_{21}^2 L}{4E}\sin (2\theta _{12})\cos \theta _{23} \, {\mathrm{Im}}(\alpha _{21}), \end{aligned}$$
(5)
$$\begin{aligned}&\vert \langle \bar{\nu }_e \vert \, e^{-i\int _0^L \mathrm{d}x \, H(x)} \, \vert \bar{\nu }_\mu \rangle \vert ^2\nonumber \\&\quad \supset 4\sin \left( \frac{\Delta m_{31}^2 L}{4E} \right) \sin \theta _{13}\sin \theta _{23} \nonumber \\&\qquad \times \; {\mathrm{Im}}\left( \alpha _{21} e^{i\Delta m_{31}^2 L/(4E)-i\delta _{CP}}\right) \nonumber \\&\qquad - 2\frac{\Delta m_{21}^2 L}{4E}\sin (2\theta _{12})\cos \theta _{23} \, {\mathrm{Im}}(\alpha _{21}), \end{aligned}$$
(6)

are simultaneously suppressed at the first oscillation peak, \(\Delta m_{31}^2 L/(4E)=\pi /2\), if Arg(\(\alpha _{21})\simeq \delta _{CP}\pm \pi /2\) holds accidentally (note the relation \(\Delta m_{21}^2 L/(4E) \simeq 0.1\cdot 2\sin \theta _{13}\) at the first oscillation peak). This suppression is mitigated if the energy distribution is not concentrated at the first oscillation peak. We thus infer that when Arg(\(\alpha _{21})\simeq \delta _{CP}\pm \pi /2\) holds, the T2HKK with a Korea detector placed at \(1^\circ \) off-axis angle exhibits better sensitivity to \(\alpha _{21}\) than the T2HK, and that the extension of the T2HK with an Oki detector gives qualitative improvement. Of course, these anticipations depend on statistics, namely, competition with the loss of statistics in the T2HKK, or the size of an Oki detector.

We observe in Eqs. (3) and (4) that when \(\delta _{CP}\simeq 0,\pi \), the real part of non-unitary mixing Re(\(\alpha _{21}\)) plays exactly the same role as matter effects, which afflicts the measurement of the mass hierarchy because it is probed through interference with matter effects. However, when measurements with different baselines and hence different \(V_{cc}L\)’s are combined, one resolves degeneracy between the non-unitary mixing and matter effects, thereby resurrecting sensitivity to the mass hierarchy. We thus expect that the mass hierarchy measurement in the T2HK is subject to strong influence from heavy neutrino mixing, whereas the T2HKK maintains sensitivity to the mass hierarchy even with heavy neutrino mixing. Also, the addition of a small detector at Oki to the T2HK leads to qualitative improvement in the mass hierarchy measurement.

Equations (3), (4) tell us that when \(\delta _{CP}\simeq \)Arg\((\alpha _{21})\), non-unitary mixing \(\alpha _{21}\) and the first term in the first line of Eqs. (3) and (4), which is proportional to \(e^{i\delta _{CP}}\), interfere maximally. In such a case, the presence of \(\alpha _{21}\) affects the measurement of \(\delta _{CP}\) and causes a discrepancy between the true \(\delta _{CP}\) and the value measured by fitting data with the standard oscillation parameters only. This effect is mitigated if the baseline is longer, because more information as regards the energy dependence of the standard oscillation amplitude helps distinguish it from non-unitary mixing \(\alpha _{21}\). We thus expect that the discrepancy is smaller in the T2HKK than in the T2HK.

In the main body of this paper, we demonstrate, through a simulation, the following:

  1. (a)

    The sensitivity to heavy neutrino mixing is suppressed for Arg(\(\alpha _{21})\simeq \delta _{CP}\pm \pi /2\). Contrary to our anticipations, the T2HK shows better sensitivity than the T2HKK for any values of CP-violating phases, simply because of higher statistics. Also, adding a small detector at Oki to the T2HK does not make qualitative difference.

  2. (b)

    In the presence of heavy neutrino mixing, the wrong mass hierarchy can be favored in the T2HK. The T2HKK maintains sensitivity to the mass hierarchy and measures it correctly despite heavy neutrino mixing. The extension of the T2HK with a small detector at Oki exhibits revived sensitivity to the mass hierarchy, even if statistics at Oki is quite subdominant compared to that at Kamioka.

  3. (c)

    In the presence of heavy neutrino mixing, the true \(\delta _{CP}\) and the value obtained by fitting data with the standard oscillation parameters can have a discrepancy which is of the same order as \(1\sigma \) resolution of the \(\delta _{CP}\) measurement. The discrepancy tends to be larger in the T2HK than in the T2HKK. Adding a small Oki detector to the T2HK does not make any difference.

Our most prominent finding is related to (b), which is that adding a small 10 kton detector at Oki to the T2HK leads to revived sensitivity to the mass hierarchy.

This paper is organized as follows: In Sect. 2, we present the formalism for incorporating the effects of heavy isospin-singlet neutrino mixing with a non-unitary mixing matrix. In Sect. 3, we estimate the largest possible significance of heavy neutrino mixing in the T2HK, the T2HKK and the extension of the T2HK with a small detector at OKi, and we further study the mass hierarchy and \(\delta _{CP}\) measurement in the presence of heavy neutrino mixing. Here, we describe our benchmark model, procedures for simulating the long baseline experiments, and analysis with a \(\chi ^2\) fitting. Sect. 4 summarizes the paper.

2 Formalism for incorporating heavy isospin-singlet neutrino mixing

Suppose we have \(N-3\) isospin-singlet neutrinos \(\nu _{s_f} \, (f=4,5,\ldots ,N)\) in addition to the three active flavors \(\nu _e,\nu _\mu ,\nu _\tau \). Accordingly, there are N mass eigenstates \(\nu _i \, (i=1,2,\ldots ,N)\) belonging to mass eigenvalues \(m_i^2\) and the neutrino mixing is expressed as

$$\begin{aligned} \vert \nu _\alpha \rangle&= \sum _{i=1}^N \, U_{\alpha i} \vert \nu _i \rangle , \quad \alpha =e,\mu ,\tau ,s_f, \end{aligned}$$
(7)

where \(U_{\alpha i}\) is a \(N\times N\) unitary matrix. In this paper, the \(N-3\) isospin-singlet neutrinos are assumed to be heavier than about 10 GeV.

In long baseline neutrino oscillation experiments, only the light mass eigenstates \(\nu _1,\nu _2,\nu _3\) are produced and propagate. Therefore, the probability for a neutrino of active flavor \(\alpha \) with energy E to transition to active flavor \(\beta \) after traveling a distance L inside matter is

$$\begin{aligned} P(\nu _\alpha \rightarrow \nu _\beta )&= \left| \langle \nu _\beta \vert U_{3\times 3} \, \exp \left( -i\int _0^L \mathrm{d}x \, H(x) \right) \, (U^\dagger )_{3\times 3} \vert \nu _\alpha \rangle \right| ^2 \nonumber \\&\quad \times \; (\alpha ,\beta =e,\mu ,\tau ), \end{aligned}$$
(8)
$$\begin{aligned} H(x)&= \mathrm{diag}\left( \ -\sqrt{E^2-m_1^2}, \, -\sqrt{E^2-m_2^2}, \, -\sqrt{E^2-m_3^2} \ \right) \end{aligned}$$
(9)
$$\begin{aligned}&\quad + \left[ \, U^\dagger \, \mathrm{diag}\left( \ V_{nc}(x)+V_{cc}(x), \, V_{nc}(x), \, \right. \right. \nonumber \\&\quad \times \;\left. \left. V_{nc}(x), \, 0,\ldots , \, 0 \ \right) \, U \, \right] _{3\times 3}, \end{aligned}$$
(10)

where \(U_{3\times 3}\) denotes the \(\alpha =e,\mu ,\tau \) and \(i=1,2,3\) part of the mixing matrix U in Eq. (7), and \((U^\dagger )_{3\times 3}\) denotes the same part for \(U^\dagger \). Note that \(U_{3\times 3}\ne (U^\dagger )_{3\times 3}\) in general. The factor \(-\sqrt{E^2-m_i^2}\) \((i=1,2,3)\) is derived from the fact that a phase shift due to time evolution, \(E\mathrm{d}t\), is equivalent to \(-\sqrt{E^2-m_i^2}\mathrm{d}x\) for the mass eigenstate with mass \(m_i\). \([...]_{3\times 3}\) in Eq. (10) signifies that one extracts the \(i,j=1,2,3\) part of the \(N\times N\) matrix. \(V_{nc}(x)\) is the potential of the neutral current interaction inside matter divided by the neutrino velocity, and \(V_{cc}(x)\) is that of the charged current interaction, which are evaluated to be

$$\begin{aligned} V_{nc}(x) \, L&= -\frac{1}{\sqrt{2}}G_F \, n_n \, \nonumber \\ L&= -Y_n \, 0.193 \left( \frac{\rho (x)}{\mathrm{g/cm}^3}\right) \left( \frac{L}{1000~\mathrm{km}}\right) ,\nonumber \\ V_{cc}(x) \, L&= \sqrt{2}G_F \, n_e \, \nonumber \\ L&= Y_e \, 0.385 \left( \frac{\rho (x)}{{\mathrm{g/cm}}^3}\right) \left( \frac{L}{1000~{\mathrm{km}}}\right) , \end{aligned}$$
(11)

where \(n_n\) and \(n_e\), respectively, denote the neutron and electron number density, \(Y_n\) and \(Y_e\) are, respectively, the number of neutrons and electrons per one nucleon, which are typically 1 / 2, and \(\rho (x)\) is the matter density.

It is convenient to parametrize U in terms of mixing angles, \(\theta _{ij}\), and CP-violating phases, \(\phi _{ij}\), \((i<j, \, i,j=1,2,\ldots ,N)\) Footnote 1 as [30, 33] (for the case with \(N=6\), see also Refs. [34, 35])

$$\begin{aligned} U&= \Omega _{N-1,N} \, \Omega _{N-2,N} \, \cdots \, \Omega _{1,N} \, \Omega _{N-2,N-1} \, \Omega _{N-3,N-1} \nonumber \\&\quad \cdots \,\Omega _{1,N-1} \,\Omega _{N-3,N-2} \, \cdots \, \Omega _{1,2},\nonumber \\ (\Omega _{n,m})_{kk}&= 1 \ \ {\mathrm{for}} \ k\ne n,m, \ \ \ \nonumber \\ (\Omega _{n,m})_{nn}&= (\Omega _{n,m})_{mm}=\cos \theta _{nm}, \ \ \ \nonumber \\ (\Omega _{n,m})_{nm}&= \sin \theta _{nm} \, e^{-i\phi _{nm}}, \ \ \ \nonumber \\ (\Omega _{n,m})_{mn}&= -\sin \theta _{nm} \, e^{i\phi _{nm}}, \quad (\Omega _{n,m})_{kl}=0 \ \ {\mathrm{otherwise}}. \end{aligned}$$
(12)

Note that the PMNS mixing matrix \(U_{PMNS}\) corresponds to \((\Omega _{2,3}\Omega _{1,3}\Omega _{1,2})\) with phases \(\phi _{12}\), \(\phi _{23}\) set to 0 by the phase redefinition of \(\vert \nu _1 \rangle , \ \vert \nu _2 \rangle \). We can thus express U in terms of the PMNS mixing matrix as

$$\begin{aligned} U&= \left( \begin{array}{c@{\quad }c} N_{NU} U_{PMNS} &{} S_{3\times N-3} \\ W_{N-3\times 3} &{} T_{N-3\times N-3} \end{array} \right) , \nonumber \\ N_{NU}&= \left( \begin{array}{c@{\quad }c@{\quad }c} \alpha _{11} &{} 0 &{} 0 \\ \alpha _{21} &{} \alpha _{22} &{} 0 \\ \alpha _{31} &{} \alpha _{32} &{} \alpha _{33} \end{array} \right) , \end{aligned}$$
(13)

where \(S_{3\times N-3}\), \(W_{N-3\times 3}\) and \(T_{N-3\times N-3}\) denote \(3\times (N-3)\), \((N-3)\times 3\) and \((N-3)\times (N-3)\) matrices, respectively, \(\alpha _{11}\), \(\alpha _{22}\) and \(\alpha _{33}\) are real numbers and \(\alpha _{21}\), \(\alpha _{31}\) and \(\alpha _{32}\) are complex ones.

Since the mixing between isospin-singlet and active neutrinos is already constrained to be tiny, we make the approximation that we ignore terms in the second order of the mixing angles between active and isospin-singlet neutrinos. This gives \((U^\dagger )_{3\times 3}=(U_{3\times 3})^\dagger \) and

$$\begin{aligned}&\left[ \, U^\dagger \, {\mathrm{diag}}\left( \ V_{nc}(x)+V_{cc}(x), \, V_{nc}(x), \, V_{nc}(x), \, 0,\ldots , \, 0 \ \right) \, U \, \right] _{3\times 3}\\&\quad = (U_{3\times 3})^\dagger \, {\mathrm{diag}} \left( \ V_{nc}(x)+V_{cc}(x), \, V_{nc}(x), \, V_{nc}(x) \ \right) \, U_{3\times 3} \ . \end{aligned}$$

We then arrive at the following formula for the transition probabilities:

$$\begin{aligned} P(\nu _\alpha \rightarrow \nu _\beta )&= \left| \langle \nu _\beta \vert \, N_{NU}U_{PMNS} \,\exp \left( -i\int _0^L \mathrm{d}x \, H(x) \right) \right. \nonumber \\&\quad \left. \times \, U_{PMNS}^\dagger N_{NU}^\dagger \, \vert \nu _\alpha \rangle \phantom {\frac{1}{2}}\right| ^2 (\alpha ,\beta =e,\mu ,\tau ),\nonumber \\ H(x)&= \frac{1}{2E}{\mathrm{diag}}\left( 0, \, \Delta m_{21}^2, \, \Delta m_{31}^2 \right) +U_{PMNS}^\dagger N_{NU}^\dagger \nonumber \\&\quad \times \;\left( \begin{array}{c@{\quad }c@{\quad }c} V_{nc}(x)+V_{cc}(x) &{} 0&{} 0 \\ 0 &{} V_{nc}(x)&{} 0 \\ 0 &{} 0&{} V_{nc}(x) \end{array} \right) \nonumber \\&\quad \times \;N_{NU}U_{PMNS}, \end{aligned}$$
(14)

where we have made a further approximation with \(E^2 \gg m_1^2,m_2^2,m_3^2\) and discarded terms proportional to the unit matrix in H(x). The transition probabilities for antineutrinos, \(P(\bar{\nu }_\alpha \rightarrow \bar{\nu }_\beta )\), are obtained by changing \(\delta _{CP} \rightarrow -\delta _{CP}\) in \(U_{PMNS}\), \(\alpha _{ij} \rightarrow \alpha _{ij}^*\) in \(N_{NU}\), and \(V_{cc}(x) \rightarrow -V_{cc}(x), \ V_{nc}(x) \rightarrow -V_{nc}(x)\) in Eq. (14).

3 Significance of heavy neutrino mixing and its effects on the mass hierarchy and \(\delta _{CP}\) measurement

We estimate the largest possible significance of heavy neutrino mixing and its effects on the mass hierarchy and \(\delta _{CP}\) measurement, through the following simulation: first we introduce a benchmark model where \(\alpha _{11},\alpha _{22},\vert \alpha _{21}\vert \) in Eq. (2) are set at the current experimental bounds, in order to evaluate maximal impact of heavy neutrino mixing. Neutrino detection in the T2HK, the T2HKK, and the extension of the T2HK with a small detector at Oki is simulated based on the above benchmark, with Particle Data Group values employed for the standard parameters \(\theta _{12},\theta _{23},\theta _{13},\Delta m^2_{21},\vert \Delta m^2_{31}\vert \). Since no conclusive data on the mass hierarchy and \(\delta _{CP}\) exist, and the phase of \(\alpha _{21}\) is undetermined in the benchmark, we repeat the simulation for both mass hierarchies and for various values of \(\delta _{CP}\) and Arg\((\alpha _{21})\). To assess merits of the extension of the T2HK with an Oki detector, we additionally consider, for comparative study, an experiment where the detector at Oki is instead placed at Kamioka. Finally, we perform a \(\chi ^2\) fit of the simulation results under the assumption that no heavy neutrino mixing is present, namely, \(N_{NU}=I_3\) in Eq. (13). The minimum of \(\chi ^2\) is the square of the significance of heavy neutrino mixing. The mass hierarchy and the value of \(\delta _{CP}\) with which \(\chi ^2\) is minimized correspond to their measured values obtained without assuming heavy neutrino mixing.

3.1 Benchmark model

Our analysis is based on the following benchmark model: We employ the parametrization of Eq. (13). Since \(\nu _\mu \rightarrow \nu _e\) process is most affected by \(\alpha _{11}, \, \alpha _{22}\) and \(\alpha _{21}\), we further set

$$\begin{aligned} N_{NU}&= \left( \begin{array}{c@{\quad }c@{\quad }c} \alpha _{11} &{} 0 &{} 0 \\ \alpha _{21} &{} \alpha _{22} &{} 0 \\ 0 &{} 0 &{} 1 \end{array} \right) . \end{aligned}$$
(15)

At present, the most severe constraint on \(\alpha _{11}, \, \alpha _{22}\) and \(\alpha _{21}\) derives from experimental tests of lepton flavor universality and a mathematical inequality among \(\alpha _{11}, \, \alpha _{22}\) and \(\alpha _{21}\). These are enumerated below:

  • One test of lepton flavor universality has been done in \(\pi ^\pm \) decays. When the non-unitary matrix Eq. (15) enters into the neutrino mixing Eq. (7), the ratio of the branching fractions becomes

    $$\begin{aligned} \frac{\Gamma (\pi ^\pm \rightarrow e^\pm \nu )}{\Gamma (\pi ^\pm \rightarrow \mu ^\pm \nu )}&= \frac{m_e^2}{m_\mu ^2}\frac{\left( 1-m_e^2/m_{\pi ^\pm }^2\right) ^2}{\left( 1-m_\mu ^2/m_{\pi ^\pm }^2\right) ^2} \nonumber \\&\quad \times \;(1+\epsilon _{\mathrm{SM}}) \, \frac{\alpha _{11}^2}{\alpha _{22}^2+\vert \alpha _{21}\vert ^2}, \end{aligned}$$
    (16)

    where \(\epsilon _{{\mathrm{SM}}}\) indicates a Standard Model correction taking into account final state radiation. The experimental value and the Standard Model prediction are, respectively, given by [36,37,38, 42, 43]

    $$\begin{aligned}&\frac{\Gamma (\pi ^\pm \rightarrow e^\pm \nu )}{\Gamma (\pi ^\pm \rightarrow \mu ^\pm \nu )}\vert _{\mathrm{experimental}} \nonumber \\&\quad = (1.2354\pm 0.0002)\times 10^{-4},\nonumber \\&\frac{m_e^2}{m_\mu ^2}\frac{\left( 1-m_e^2/m_{\pi ^\pm }^2\right) ^2}{\left( 1-m_\mu ^2/m_{\pi ^\pm }^2\right) ^2} (1+\epsilon _{\mathrm{SM}})\nonumber \\&\quad = (1.230\pm 0.004)\times 10^{-4}, \end{aligned}$$
    (17)

    from which we obtain the following bound:

    $$\begin{aligned} \frac{\alpha _{11}^2}{\alpha _{22}^2+\vert \alpha _{21}\vert ^2}&=0.9956\pm 0.0032. \end{aligned}$$
    (18)
  • Another test of lepton flavor universality comes from unitarity test of the Cabibbo–Kobayashi–Maskawa (CKM) matrix. In the presence of the non-unitary matrix Eq. (15), the Fermi constant measured in the muon decay becomes \(G_\mu = G_F \, \alpha _{11}\sqrt{\alpha _{22}^2+\vert \alpha _{21}\vert ^2}\), whereas the Fermi constant measured in decays of hadrons involving \(e\nu \) becomes \(G_\beta = G_F \, \alpha _{11}\). Deviation of \(G_\beta /G_\mu \) from 1 mimics non-unitarity in the first row of the CKM matrix if \(\vert V_{ud}\vert \) is measured in \(\beta \)-decays of nuclei and \(\vert V_{us}\vert \) is measured in \(K \rightarrow \pi e \nu \) decays, since \(G_\mu \) is regarded as the true Fermi constant in these measurements while it is \(G_\beta \) that is involved in the processes, and \(\vert V_{ub}\vert ^2\) is known to be smaller than experimental error of \(\vert V_{us}\vert ^2\). We therefore have

    $$\begin{aligned}&\frac{1}{\alpha _{22}^2+\vert \alpha _{21}\vert ^2} = \left( \frac{G_\beta }{G_\mu }\right) ^2 \simeq \vert V_{ud}\vert ^2 + \vert V_{us}\vert ^2 \nonumber \\&\quad (\text {measured in }K \rightarrow \pi e \nu \text { decays}) + \vert V_{ub}\vert ^2. \end{aligned}$$
    (19)

    Measurements of superallowed \(\beta \)-decays of nuclei yield the following experimental value [39]:

    $$\begin{aligned} \vert V_{ud} \vert&= 0.97417\pm 0.00021. \end{aligned}$$
    (20)

    Combining measurements of the branching ratios of only three processes \(K^\pm \rightarrow \pi ^0 e \nu \), \(K_L \rightarrow \pi ^\pm e \nu \) and \(K_S \rightarrow \pi ^\pm e \nu \) [39], we obtain the following experimental value for \(\vert V_{us}\vert \) times the kaon form factor, \(f_+(0)\):

    $$\begin{aligned} \vert V_{us} \vert \ f_+(0)&= 0.21633\pm 0.00055. \end{aligned}$$
    (21)

    The average of lattice calculations of the kaon form factor \(f_+(0)\) in Refs. [40, 41] is given by

    $$\begin{aligned} f_+(0)&= 0.9677\pm 0.0037. \end{aligned}$$
    (22)

    From Eqs. (20), (21) and (22), we obtain the following bound:

    $$\begin{aligned} \frac{1}{\alpha _{22}^2+\vert \alpha _{21}\vert ^2}&=0.99898\pm 0.00061. \end{aligned}$$
    (23)
  • The mathematical inequality among \(\alpha _{11}, \, \alpha _{22}\) and \(\alpha _{21}\) stems from the fact that the non-unitary mixing matrix \(N_{NU}\) is derived from the product of mixing matrices as in Eq. (12). The inequality reads [42,43,44]

    $$\begin{aligned} 4(1-\alpha _{11})(1-\alpha _{22})&> \vert \alpha _{21}\vert ^2. \end{aligned}$$
    (24)

We combine the 3\(\sigma \) experimental bounds in Eqs. (18), (23) and the mathematical inequality Eq. (24), and we obtain the following 3\(\sigma \) boundFootnote 2:

$$\begin{aligned} \alpha _{11}&>0.992573, \quad \alpha _{22} > 0.999589, \nonumber \\&\vert \alpha _{21}\vert < 0.003494 \quad {\mathrm{at }} \; 3\sigma . \end{aligned}$$
(25)

To estimate the maximum significance of heavy neutrino mixing, we set the values of \(\alpha _{11}, \, \alpha _{22}\) and \(\vert \alpha _{21}\vert \) at the above 3\(\sigma \) bounds and therefore employ a benchmark model with the following non-unitary mixing matrix \(N_{NU}\):

$$\begin{aligned} N_{NU}&= \left( \begin{array}{c@{\quad }cc@{\quad }c} \alpha _{11} &{} 0 &{} 0 \\ \alpha _{21} &{} \alpha _{22} &{} 0 \\ 0 &{} 0 &{} 1 \end{array} \right) , \nonumber \\ \alpha _{11}&= 0.992573, \quad \alpha _{22}=0.999589, \nonumber \\ \vert \alpha _{21}\vert&=0.003494. \end{aligned}$$
(26)

Since no experimental bound is reported on the phase of \(\alpha _{21}\), we leave it as a free parameter.

We comment in passing on constraints from past neutrino oscillation experiments. Among those experiments, the NOMAD experiment [45], a short baseline experiment searching for \(\nu _\mu \rightarrow \nu _e\) and \(\bar{\nu }_\mu \rightarrow \bar{\nu }_e\) transitions, gives the most severe bound, which is translated into the following:

$$\begin{aligned} 2\vert \alpha _{21}\vert ^2 \alpha _{11}^2&\le 1.4\times 10^{-3} \quad \text { at } \; 90\% \; \text { confidence level}. \end{aligned}$$
(27)

Our benchmark Eq. (26) satisfies the above bound.

Finally, we mention constraints from lepton flavor violating processes. These constraints can always be evaded by tuning heavy neutrino mass \(m_4,m_5,m_6,\ldots ,m_N\) around the W boson mass, because the amplitude of a lepton flavor violating process involving charged leptons \(\ell _\alpha \) and \(\ell _\beta \), \(A_{\alpha \beta }\), depends on \(N_{NU}\) through the following combination:

$$\begin{aligned} A_{\alpha \beta }&\propto \sum _{i=1}^3 \, (N_{NU}U_{PMNS})_{\beta i} \, F\left( \frac{m_i}{M_W}\right) \, \left( U_{PMNS}^\dagger N_{NU}^\dagger \right) _{i\alpha } \nonumber \\&\quad +\;\sum _{j=4}^N \, (S_{3\times N-3})_{\beta j} \, F\left( \frac{m_j}{M_W}\right) \, \left( S_{3\times N-3}^\dagger \right) _{j\alpha }\nonumber \\&\simeq \left( N_{NU}N_{NU}^\dagger \right) _{\beta \alpha } \, F(0)\nonumber \\&\quad +\;\sum _{j=4}^N \, (S_{3\times N-3})_{\beta j} \, F\left( \frac{m_j}{M_W}\right) \, \left( S_{3\times N-3}^\dagger \right) _{j\alpha }, \end{aligned}$$
(28)

where \(m_i,m_j\) are neutrino mass eigenvalues, \(M_W\) is the W boson mass, and F(x) is a function depending on the process, and in the second line, we make the approximation with \(m_1,m_2,m_3 \ll M_W\) as \(m_1,m_2,m_3\) correspond to the tiny neutrino mass. It is evident that one can negate the term \((N_{NU}N_{NU}^\dagger )_{\beta \alpha }F(0)\) by taking appropriate O(1) values for \(m_j/M_W\) \((j=4,5,6,\ldots ,)\). We therefore neglect any constraints from lepton flavor violating processes.

In Appendix C, we consider another benchmark that satisfies the experimental bounds of Eqs. (18) and (23) at \(2\sigma \) level as well as the mathematical inequality Eq. (24).

3.2 Simulation

We simulate the neutrino propagation and detection based on the benchmark model Eq. (26). The oscillation probability is calculated numerically with the formula Eq. (14) without making any approximation. For the standard oscillation parameters, we employ the central values of \(\theta _{12}, \, \theta _{13}, \, \theta _{23}, \, \Delta m^{2}_{21}, \, \vert \Delta m^{2}_{32} \vert \) found in the Particle Data Group [39]. As the CP-violating phase \(\delta _{CP}\) and the sign of \(\Delta m_{32}^2\) have not been measured conclusively, and the phase of \(\alpha _{21}\), Arg\((\alpha _{21})\), is undetermined in the benchmark, we repeat the simulation for various values of \(\delta _{CP}\) and Arg\((\alpha _{21})\) and for both cases with \(\Delta m_{32}^2>0\) and \(\Delta m_{32}^2<0\). The values of the parameters used in the simulation are shown in Table 1.

Table 1 Parameters used in our simulation

We assume the baseline length L, fiducial volume, matter density along the baseline and neutrino beam off-axis angle [22, 23, 32] for the water Cerenkov detectors used in the T2HK, the T2HKK, the extension of the T2HK with a detector at Oki (T2HK+Oki), and the extension with the same detector at Kamioka (T2HK+Kami) as Table 2. The matter density is approximated to be uniform [32, 46].

Table 2 Parameters assumed for the detectors used in the experiments

We assume that J-PARC operates with 1.3 MW beam power for 10 years in the neutrino mode and for another 10 years in the antineutrino mode, delivering \(27\times 10^{21}\) proton-on-target (POT) flux of neutrino-focusing beam and \(27\times 10^{21}\) POT flux of antineutrino-focusing beam, as Table 3.

The energy distributions of \(\nu _\mu \) and \(\bar{\nu }_\mu \) in neutrino-focusing and antineutrino-focusing beams at each site are calculated based on Ref. [47]. \(\nu _e\) and \(\bar{\nu }_e\) components in the beam are ignored in this analysis. In Appendix A, we present the energy distributions.

The cross sections for charged current quasi-elastic scattering between a neutrino and a proton, \(\nu _\ell \, n \rightarrow \ell ^- \, p\), and that between an antineutrino and a neutron, \(\bar{\nu }_\ell \, p \rightarrow \ell ^+ \, n\), (p and n denote proton and neutron, respectively, and \(\ell \) denotes e or \(\mu \)) are quoted from Ref. [48]. In Appendix B, we present the cross sections.

We assume the ideal measurement of \(\nu _e\) and \(\bar{\nu }_e\) and do not take into account acceptance and efficiency factors and finite energy resolution. Signals coming from processes other than charged current quasi-elastic scattering, as well as backgrounds due to neutral current quasi-elastic scatterings and other processes are not considered in the analysis.

Table 3 Flux of neutrino-focusing beam and antineutrino-focusing beam from J-PARC assumed

The total numbers of neutrino events at the detectors at Kamioka, Oki and in Korea in the T2HK, T2HKK, T2HK+Oki, and T2HK+Kami with the detector size of Table 2 and the beam flux of Table 3 are tabulated in Table 4.

Table 4 The total number of neutrino events at each detector, with the detector size of Table 2 and the beam flux of Table 3. The numbers for the neutrino-focusing beam and for the antineutrino-focusing beam are shown separately
Fig. 1
figure 1

The minimum of \(\chi ^2(\Pi )\) Eq. (32), \(\min \chi ^2\), on the plane of the standard CP-violating phase \(\delta _{CP}\) and the new phase in the non-unitary mixing matrix Arg(\(\alpha _{21}\)). The benchmark with Eq. (26) is assumed, and the true mass hierarchy is normal. The upper-left, upper-right and lower-right subplots correspond to the T2HK, the T2HKK, and the plan of the T2HK plus a 10 kton water Cerenkov detector at Oki, respectively. For comparative study, we show in the lower-left a subplot for a plan of the T2HK plus a 10 kton water Cerenkov detector at Kamioka. \(\min \chi ^2=1,\,4,\,9\) on the red solid, blue dashed, and purple dot-dashed contours, respectively

Fig. 2
figure 2

The difference between the minima (with respect to \(\delta _{CP}\) only) of \(\chi ^2(\Pi )\) for the wrong and true mass hierarchies, \(\{\min \chi ^2(\text {wrong H})-\min \chi ^2(\text {true H})\}\). The benchmark with Eq. (26) is assumed, and the true mass hierarchy is normal. The upper-left, upper-right and lower-right subplots correspond to the T2HK, the T2HKK, and the plan of the T2HK plus a 10 kton water Cerenkov detector at Oki, respectively. For comparative study, we show in the lower-left a subplot for a plan of the T2HK plus a 10 kton water Cerenkov detector at Kamioka. \(\{\min \chi ^2(\text {wrong H})-\min \chi ^2(\text {true H})\}=4,\,16,\,36\) on the red solid, blue dashed, and purple dot-dashed contours, respectively, and \(\{\min \chi ^2(\text {wrong H})-\min \chi ^2(\text {true H})\}=18^2,\,19^2,\,20^2\) on the green, brown, and orange dot-dashed contours, respectively. The black-filled regions are where the wrong mass hierarchy is favored over the true mass hierarchy, i.e., \(\min \chi ^2(\text {wrong H})-\min \chi ^2(\text {true H})<0\)

We calculate the numbers of \(\nu _e\) and \(\bar{\nu }_e\) events in 0.05 GeV bins of the neutrino energy E in the range \(0.4~\hbox {GeV}\le E \le 3~\hbox {GeV}\) for various values of \(\delta _{CP}\) and Arg\((\alpha _{21})\) and for both cases with \(\Delta m_{31}^2>0\) and \(\Delta m_{31}^2<0\). We recall that in real experiments, the neutrino flux is determined by the measurement of \(\nu _\mu \) and \(\bar{\nu }_\mu \) at the near detector, for which the standard oscillation is negligible but the non-unitary mixing reduces the \(\nu _\mu \) and \(\bar{\nu }_\mu \) flux by the following amount:

$$\begin{aligned} P(\nu _\mu \rightarrow \nu _\mu )_{\text {near detector}}&= P(\bar{\nu }_\mu \rightarrow \bar{\nu }_\mu )_{{\mathrm{near \ detector}}} \nonumber \\&= (N_{NU}N_{NU}^\dagger )_{\mu \mu } = \vert \alpha _{21}\vert ^2+\alpha _{22}^2. \end{aligned}$$
(29)

In the analysis, we consider a virtual neutrino flux that is larger than the one found in Ref. [47] by \(1/(\vert \alpha _{21}\vert ^2+\alpha _{22}^2)\), to compensate the reduction of Eq. (29). We therefore compute the number of events with a neutrino-focusing beam, \(N_{e,i}\), and that with an antineutrino-focusing beam, \(\tilde{N}_{e,i}\), as

$$\begin{aligned} {\mathrm{for}} \ i&=1,2,3,\ldots ,52, \nonumber \\ N_{e,i}&= \left( \, {\mathrm{Number \ of \ }} \nu _e \text { and } \bar{\nu }_e \ {\mathrm{with}} \ (0.4+i \cdot 0.05)\,{\mathrm{GeV}}\right. \nonumber \\&\left. \ge E \ge (0.4+(i-1) \cdot 0.05)\,{\mathrm{GeV}} \, \right) \nonumber \\&= \int _{0.4+(i-1) \cdot 0.05\,{\mathrm{GeV}}}^{0.4+i \cdot 0.05\,{\mathrm{GeV}}} \mathrm{d}E\nonumber \\&\quad \times \left\{ \, \frac{\Phi _{\nu _\mu }(E)}{\vert \alpha _{21}\vert ^2+\alpha _{22}^2} \ P(\nu _\mu \rightarrow \nu _e; \, E) \ N_n \ \sigma (\nu _e n \rightarrow e^- p; \, E)\right. \nonumber \\&\quad \left. + \frac{\Phi _{\bar{\nu }_\mu }(E)}{\vert \alpha _{21}\vert ^2 +\alpha _{22}^2} \ P(\bar{\nu }_\mu \rightarrow \bar{\nu }_e; \, E) \ N_p \ \sigma (\bar{\nu }_e p \rightarrow e^+ n; \, E) \, \right\} ,\nonumber \\ \tilde{N}_{e,i}&= ( \, \Phi _{\nu _\mu }\rightarrow \tilde{\Phi }_{\nu _\mu } \ {\mathrm{and}} \ \Phi _{\bar{\nu }_\mu }\rightarrow \tilde{\Phi }_{\bar{\nu }_\mu } \ {\mathrm{in \ above}} \, ), \end{aligned}$$
(30)

where \(\Phi _{\nu _\mu }(E)\) and \(\Phi _{\bar{\nu }_\mu }(E)\), respectively, denote the neutrino and antineutrino flux per energy in a neutrino-focusing beam, and \(\tilde{\Phi }_{\nu _\mu }(E)\) and \(\tilde{\Phi }_{\bar{\nu }_\mu }(E)\) are those in an antineutrino-focusing beam. P denotes the transition probability Eq. (14), with E dependence made explicit. \(N_n\) and \(N_p\) are, respectively, the number of neutrons and protons in a water Cerenkov detector. \(\sigma \) denotes the cross sections for \(\nu _\ell n \rightarrow \ell ^- p\) and \(\bar{\nu }_\ell p \rightarrow \ell ^+ n\) processes.

Fig. 3
figure 3

The difference between the true value of \(\delta _{CP}\), and the value that minimizes \(\chi ^2(\Pi )\) of Eq. (32) with true mass hierarchy. The benchmark with Eq. (26) is assumed, and the true mass hierarchy is normal. The upper-left, upper-right and lower-right subplots correspond to the T2HK, the T2HKK, and the plan of the T2HK plus a 10 kton water Cerenkov detector at Oki, respectively. For comparative study, we show in the lower-left a subplot for a plan of the T2HK plus a 10 kton water Cerenkov detector at Kamioka. \(\vert \Delta \delta _{CP}\vert =0, \ 0.05{\mathrm{rad}}, \ 0.1{\mathrm{rad}}\), and \(0.15{\mathrm{rad}}\) on the black solid, red sold, blue dashed, and purple dot-dashed contours, respectively

Fig. 4
figure 4

The minimum of \(\chi ^2(\Pi )\) Eq. (32), \(\min \chi ^2\). The benchmark with Eq. (26) is assumed, and the true mass hierarchy is inverted. The upper-left, upper-right and lower-right subplots correspond to the T2HK, the T2HKK, and the plan of the T2HK plus a 10 kton water Cerenkov detector at Oki, respectively. For comparative study, we show in the lower-left a subplot for a plan of the T2HK plus a 10 kton water Cerenkov detector at Kamioka. \(\min \chi ^2=1,\,4,\,9\) on the red solid, blue dashed, and purple dot-dashed contours, respectively

Fig. 5
figure 5

The difference between the minima (with respect to \(\delta _{CP}\) only) of \(\chi ^2(\Pi )\) for the wrong and true mass hierarchy, \(\{\min \chi ^2(\text {wrong H})-\min \chi ^2(\text {true H})\}\), which is the square of the significance of rejecting the wrong hierarchy. The benchmark with Eq. (26) is assumed, and the true mass hierarchy is inverted. The upper-left, upper-right and lower-right subplots correspond to the T2HK, the T2HKK, and the plan of the T2HK plus a 10 kton water Cerenkov detector at Oki, respectively. For comparative study, we show in the lower-left a subplot for a plan of the T2HK plus a 10 kton water Cerenkov detector at Kamioka. \(\{\min \chi ^2(\text {wrong H})-\min \chi ^2(\text {true H})\}=4,\,16,\,36\) on the red solid, blue dashed, and purple dot-dashed contours, respectively, and \(\{\min \chi ^2(\text {wrong H})-\min \chi ^2(\text {true H})\}=18^2,\,19^2,\,20^2\) on the green, brown, and orange dot-dashed contours, respectively. The black-filled regions are where the wrong mass hierarchy is favored over the true mass hierarchy, i.e., \(\min \chi ^2(\text {wrong H})-\min \chi ^2(\text {true H})<0\)

Fig. 6
figure 6

The difference between the true value of \(\delta _{CP}\), and the value that minimizes \(\chi ^2(\Pi )\) of Eq. (32) with the true mass hierarchy. The benchmark with Eq. (26) is assumed, and the true mass hierarchy is inverted. The upper-left, upper-right and lower-right subplots correspond to the T2HK, the T2HKK, and the plan of the T2HK plus a 10 kton water Cerenkov detector at Oki, respectively. For comparative study, we show in the lower-left a subplot for a plan of the T2HK plus a 10 kton water Cerenkov detector at Kamioka. \(\vert \Delta \delta _{CP}\vert =0, \ 0.05{\mathrm{rad}}, \ 0.1{\mathrm{rad}}\), and \(0.15{\mathrm{rad}}\) on the black solid, red sold, blue dashed, and purple dot-dashed contours, respectively

3.3 \(\chi ^2\) analysis

We fit the numbers of events in the bins with neutrino-focusing and antineutrino-focusing beams, detected at Kamioka/Korea/Oki, \(N_{e,i,{\mathrm{site}}}\) and \(\tilde{N}_{e,i,{\mathrm{site}}}\) \((i=1,2,\ldots ,52)\) (site\(=\)Kamioka, Korea, Oki), under the assumption that no heavy neutrino mixing is present, that is, \(N_{NU}=I_3\). This is performed by minimizing \(\chi ^2(\Pi )\),

$$\begin{aligned} \chi ^2(\Pi )&= \chi ^2_{\mathrm{statistical}}(\Pi ) + \chi ^2_{\mathrm{parametrical}}(\Pi ) \nonumber \\&\quad +\;\chi ^2_{\mathrm{systematic}}, \nonumber \\ \chi ^2_{\mathrm{statistical}}(\Pi )&= \sum _{\mathrm{site}} \, \sum _{i=1}^{52} \, \left\{ \ \frac{\left( N_{e,i,\mathrm{site}} - N_{e,i,\mathrm{site}}^{\mathrm{unitary}}(\Pi ) \right) ^2}{N_{e,i,\mathrm{site}}} \right. \nonumber \\&\quad \left. +\;\frac{\left( \tilde{N}_{e,i,\mathrm{site}} - \tilde{N}_{e,i,\mathrm{site}}^{\mathrm{unitary}}(\Pi ) \right) ^2}{\tilde{N}_{e,i,\mathrm{site}}} \ \right\} , \end{aligned}$$
(31)

with respect to the standard oscillation parameters \(\Pi =(\theta _{12},\theta _{13},\theta _{23},\Delta m^2_{21},\Delta m^2_{32},\delta _{CP})\). Here, \(\chi ^2_{\mathrm{statistical}}\) represents statistical uncertainty, \(N_{e,i,\mathrm{site}}^{\mathrm{unitary}}(\Pi )\) is the number of \(\nu _e\) and \(\bar{\nu }_e\) events in a bin with a neutrino-focusing beam when \(N_{NU}=I_3\), calculated as a function of the standard oscillation parameters \(\Pi \), and \(\tilde{N}_{e,i,\mathrm{site}}^{\mathrm{unitary}}(\Pi )\) is the corresponding number with an antineutrino-focusing beam.

The following simplification is made for \(\chi ^2_{\mathrm{parametrical}}(\Pi )\): we note that \(\theta _{12},\theta _{13},\theta _{23},\Delta m^2_{21},\vert \Delta m^2_{32}\vert \) can be accurately measured in disappearance experiments with \(\bar{\nu }_e \rightarrow \bar{\nu }_e\), \(\nu _\mu \rightarrow \nu _\mu \) and \(\bar{\nu }_\mu \rightarrow \bar{\nu }_\mu \), which are not influenced by heavy neutrino mixing as parametrized by \(N_{NU}\), because \(N_{NU}\) equally affects the neutrino flux of the same flavor at the near and far detectors. (\(\cos \delta _{CP}\) and the sign of \(\Delta m^2_{32}\) are also possibly measured in disappearance experiments, but the sensitivity is limited.) We therefore assume that the true values of \(\theta _{12},\theta _{13},\theta _{23},\Delta m^2_{21},\vert \Delta m^2_{32}\vert \) are known prior to the experiments considered in this paper, and accordingly we fix \(\theta _{12},\theta _{13},\theta _{23},\Delta m^2_{21},\vert \Delta m^2_{32}\vert \) in \(\chi ^2(\Pi )\) at the values used in the simulation and set \(\chi ^2_{\mathrm{parametrical}}(\Pi )=0\).

Finally, we have \(\chi ^2_{\mathrm{systematic}}=0\) in the current analysis, because no acceptance, efficiency, energy resolution and background events are considered.

To summarize, \(\chi ^2(\Pi )\) is approximated by

$$\begin{aligned}&\chi ^2(\Pi ) = \chi ^2_{\mathrm{statistical}}\left( \, \delta _{CP}, \, \mathrm{sgn}\left( \Delta m^2_{32}\right) \, \right) \nonumber \\&\quad \text { with } \; \theta _{12},\theta _{13},\theta _{23},\Delta m^2_{21},\vert \Delta m^2_{32}\vert {\mathrm{\ given \ in \ Table}}~1. \end{aligned}$$
(32)

We numerically evaluate the minimum of \(\chi ^2(\Pi )\) Eq. (32), which corresponds to the square of the significance for heavy neutrino mixing. In Fig. 1, we show contour plots of the minimum of \(\chi ^2(\Pi )\),

$$\begin{aligned} \min \chi ^2&\equiv \min \left\{ \ \min _{\delta _{CP}}\chi ^2(\delta _{CP}, \ \Delta m^2_{32}>0), \ \right. \nonumber \\&\qquad \left. \min _{\delta _{CP}}\chi ^2\left( \delta _{CP}, \ \Delta m^2_{32}<0\right) \ \right\} , \end{aligned}$$
(33)

on the plane spanned by the CP-violating phase in the PMNS matrix \(\delta _{CP}\) and the phase in the non-unitary mixing matrix Arg(\(\alpha _{21}\)), when the true mass hierarchy is normal. The four subplots correspond to the T2HK, the T2HKK, the extension of the T2HK with a detector at OKi, and that with the same detector at Kamioka.

We study the impact of heavy neutrino mixing on the mass hierarchy measurement that is performed by fitting data without incorporating heavy neutrino mixing. For this purpose, we evaluate the difference between the minimum (with respect to \(\delta _{CP}\) only) of \(\chi ^2(\Pi )\) for the wrong hierarchy and that for the true mass hierarchy, which corresponds to the square of the significance of rejecting the wrong mass hierarchy. We show in Fig. 2 contour plots of the difference,

$$\begin{aligned}&\min \chi ^2(\text {wrong H})-\min \chi ^2(\text {true H})\nonumber \\&\quad \equiv \min _{\delta _{CP}}\chi ^2\left( \delta _{CP}, \ {\mathrm{wrong}} \ \mathrm{sgn}\left( \Delta m^2_{32}\right) \right) \nonumber \\&\qquad -\;\min _{\delta _{CP}}\chi ^2\left( \delta _{CP}, \ {\mathrm{true}} \ {\mathrm{sgn}}\left( \Delta m^2_{32}\right) \right) , \end{aligned}$$
(34)

on the same plane when the true mass hierarchy is normal. The black-filled regions in the left subplots are where the wrong mass hierarchy is favored over the true mass hierarchy, i.e., \(\min \chi ^2(\text {wrong H})-\min \chi ^2(\text {true H})<0\).

We study the impact of heavy neutrino mixing on the value of \(\delta _{CP}\) that minimizes \(\chi ^2(\Pi )\). To assess the impact, we compute the difference between the true value of \(\delta _{CP}\), and the value that minimizes \(\chi ^2(\Pi )\) of Eq. (32) with the true mass hierarchy. We show in Fig. 3 contour plots of the difference,

$$\begin{aligned} \Delta \delta _{CP}&\equiv \left( \delta _{CP} \ {\mathrm{that \ minimizes}} \ \chi ^2\left( \delta _{CP}, \ {\mathrm{true}} \ \mathrm{sgn}\left( \Delta m^2_{32}\right) \right) \right. \nonumber \\&\quad -\;({\mathrm{true}} \ \delta _{CP}) \end{aligned}$$
(35)

on the same plane when the true mass hierarchy is normal.

Figures 456 are the corresponding figures when the true mass hierarchy is inverted.

The following observations are made:

  1. (A)

    In Figs. 1 and 4, we see that the significance of heavy neutrino mixing can be above \(3\sigma \) for Arg(\(\alpha _{21})\sim \delta _{CP}\), but decreases considerably for Arg(\(\alpha _{21})\simeq \delta _{CP}\pm \pi /2\) in all the experiments, confirming that the sensitivity deteriorates due to suppression of interference between the standard oscillation amplitude and non-unitary mixing \(\alpha _{21}\) at the first oscillation peak, as can be read from Eqs. (5) and (6). The regions of depleted sensitivity is slightly off the lines Arg(\(\alpha _{21})=\delta _{CP}\pm \pi /2\), because of the influence of the second, subleading interference term in Eqs. (5) and (6). We also find that the significance does not ameliorate in the T2HKK and in the extension of the T2HK with a 10 kton detector at Oki. Rather, the T2HKK shows inferior performance due to the loss of statistics, which is seen in Table 4.

  2. (B)

    In Figs. 2 and 5, we observe that in the T2HK, the wrong mass hierarchy is favored when \((\delta _{CP}, \, {\mathrm{Arg}}(\alpha _{21}))\sim (0, \, 0), \, (\pi , \, \pi )\) holds in the normal hierarchy case, and when \((\delta _{CP}, \, \mathrm{Arg}(\alpha _{21}))\sim (\pi , \, 0)\) holds in the inverted hierarchy case. To estimate the impact of non-unitary mixing on the mass hierarchy measurement in the T2HK, we show in Fig. 7 the values of \(\{\min \chi ^2(\text {wrong H})-\min \chi ^2(\text {true H})\}\) when non-unitarity is absent, i.e. \(N_{NU}=I\), calculated with the same procedure as Figs. 2 and 5.

Fig. 7
figure 7

The difference between the minima (with respect to \(\delta _{CP}\) only) of \(\chi ^2(\Pi )\) for the wrong and true mass hierarchy, \(\{\min \chi ^2(\text {wrong H})-\min \chi ^2(\text {true H})\}\), when the neutrino mixing is unitary (no heavy neutrino mixing is present), in the T2HK. The results are presented as functions of the true value of \(\delta _{CP}\). The black solid line corresponds to the case when the true hierarchy is normal, and the red dashed line to the case when the true hierarchy is inverted

Comparison of Figs. 2 and 5 with Fig. 7 indicates that, for \(\delta _{CP}\sim 0, \pi \), the T2HK has limited sensitivity to the mass hierarchy in the unitary case, and the presence of non-unitary mixing may aggravate it to the extent that the wrong mass hierarchy is favored. Such impact of non-unitary mixing is because \(\alpha _{21}\) cancels matter effects in the neutrino transition probability Eq. (3) when \((\delta _{CP}, \, {\mathrm{Arg}}(\alpha _{21}))\sim (0, \, 0), \, (\pi , \, \pi )\) holds in the normal hierarchy case, and when \((\delta _{CP}, \, {\mathrm{Arg}}(\alpha _{21}))\sim (\pi , \, 0)\) holds in the inverted hierarchy case. Although \(\alpha _{21}\) and matter effects add positively in the antineutrino transition probability Eq. (4) in the same cases, it does not change the situation because neutrino flux is larger than antineutrino flux in our setup (see Table 3 and Fig. 9).

The T2HKK maintains strong sensitivity to the mass hierarchy in the presence of heavy neutrino mixing. Also, adding a 10 kton detector at Oki to the T2HK drastically ameliorates sensitivity to the mass hierarchy, in spite of the small size of the Oki detector. The advantage of the T2HKK and T2HK+Oki is because measurements with different matter effects resolve degeneracy between non-unitary mixing \(\alpha _{21}\) and matter effects.

We stress that even a small 10 kton detector at Oki negates the impact of heavy neutrino mixing on the mass hierarchy measurement and allows for correct measurement.

  1. (C)

    In Figs. 3 and 6, we find that the value of \(\delta _{CP}\) that minimizes \(\chi ^2\) under the assumption of standard unitary oscillation can deviate from the true \(\delta _{CP}\) by more than 0.15 radian in the T2HK, and by more than 0.1 radian in the T2HKK. The result should be compared to the resolution of \(\delta _{CP}\). To this end, we present in Fig. 8 the difference between \(\delta _{CP}\) that gives \(\chi ^2=1\) and the true \(\delta _{CP}\) when non-unitarity is absent (\(N_{NU}=I\)),

    $$\begin{aligned} (\Delta \delta _{CP} \ {\mathrm{at}} \ 1\sigma )&\equiv (\delta _{CP} \ \mathrm{that \ gives} \ \chi ^2=1) - ({\mathrm{true}} \ \delta _{CP}), \end{aligned}$$
    (36)

    which quantifies \(1\sigma \) resolution of the \(\delta _{CP}\) measurement in the standard oscillation case. Equation (36) is calculated through a simulation performed with the same procedure.

\(1\sigma \) resolution for \(\delta _{CP}\) is better than 0.1 radian in the T2HK and than 0.08 radian in the T2HKKFootnote 3, which is of the same order as the deviation of \(\delta _{CP}\) due to non-unitarity. This indicates that non-unitarity can have a non-negligible effect on the \(\delta _{CP}\) measurement, and hence, if a hint of non-unitarity has been discovered, it must be incorporated in the data fitting in order to measure \(\delta _{CP}\) correctly.

The deviation due to non-unitarity is enhanced when \(({\mathrm{true}} \ \delta _{CP})\sim {\mathrm{Arg}}(\alpha _{21})\) holds. This is because interference between the part of the standard oscillation amplitude proportional to \(e^{i\delta _{CP}}\) and the amplitude involving non-unitary mixing \(\alpha _{21}\) is maximized, and so is the influence of \(\alpha _{21}\) on the value of \(\delta _{CP}\) that minimizes \(\chi ^2\).

Fig. 8
figure 8

The resolution of the \(\delta _{CP}\) measurement when non-unitarity is absent, quantified as the difference between \(\delta _{CP}\) that gives \(\chi ^2=1\) and the true \(\delta _{CP}\) Eq. (36). The results are presented as functions of the true \(\delta _{CP}\). The left and right plots correspond to the cases when the true mass hierarchy is normal and inverted, respectively. In each plot, the solid line indicates the sensitivity in the T2HK, and the dashed line the sensitivity in the T2HKK

The deviation is smaller in the T2HKK than in the T2HK. This is due to the fact that the energy dependence of the part of the standard oscillation amplitude proportional to \(e^{i\delta _{CP}}\), which helps distinguish the standard and non-unitary amplitudes, is more accurately measured with a longer baseline. Hence, the \(\delta _{CP}\) measurement is less affected by non-unitarity in the T2HKK.

4 Summary

We have studied the discovery potential for the mixing of heavy isospin-singlet neutrinos in the Tokai-to-Hyper-Kamiokande (T2HK), the Tokai-to-Hyper-Kamiokande-to-Korea (T2HKK), and a plan of adding a small detector at Oki Islands to the T2HK, and further examined the feasibility of measuring the mass hierarchy and the standard CP-violating phase \(\delta _{CP}\) in the presence of heavy neutrino mixing by fitting data without assuming heavy neutrino mixing. The mixing of heavy neutrinos is parametrized with a non-unitary mixing matrix for active flavors. A benchmark that maximizes the non-unitary mixing and is consistent with the current experimental and theoretical bounds is employed to estimate the largest possible significance of heavy neutrino mixing and its impact on the mass hierarchy and \(\delta _{CP}\) measurement.

Through a simulation, we have revealed that the significance of heavy neutrino mixing can be above \(3\sigma \) in all the experiments when the standard phase \(\delta _{CP}\) and a new CP-violating phase originating from heavy neutrino mixing, Arg(\(\alpha _{21}\)), satisfy Arg(\(\alpha _{21})\simeq \delta _{CP}\). The significance decreases considerably for Arg(\(\alpha _{21})\simeq \delta _{CP}\pm \pi /2\), due to suppression of interference between the standard oscillation amplitude and a non-unitary mixing term.

In the T2HK, the mass hierarchy measurement is so affected by heavy neutrino mixing that the wrong mass hierarchy is favored for \((\delta _{CP}, \, {\mathrm{Arg}}(\alpha _{21}))\sim (0, \, 0), \, (\pi , \, \pi )\) in the normal hierarchy case, and for \((\delta _{CP}, \, \mathrm{Arg}(\alpha _{21}))\sim (\pi , \, 0)\) in the inverted hierarchy case. This is because a non-unitary mixing term possibly cancels matter effects. In contrast, the T2HKK and the extension of the T2HK with a 10 kton detector at Oki show strong sensitivity to the mass hierarchy even with heavy neutrino mixing, because measurements with different baseline lengths at Kamioka and in Korea or at Oki resolve degeneracy between a non-unitary mixing term and matter effects. We thus conclude that (i) the mass hierarchy measurement in the T2HKK is highly stable against any effects of heavy neutrino mixing, and that (ii) although the mass hierarchy measurement in the T2HK is easily affected, the addition of a small detector at Oki negates effects of heavy neutrino mixing.

In the presence of non-unitarity, the value of \(\delta _{CP}\) measured without incorporating non-unitarity deviates from the true \(\delta _{CP}\). The deviation is enhanced when \(\mathrm{Arg}(\alpha _{21})\) is close to the true \(\delta _{CP}\), because in such a case, the part of the standard oscillation amplitude proportional to \(e^{i\delta _{CP}}\) interferes maximally with a non-unitary mixing term at oscillation peaks. The deviation can be of the same order as \(1\sigma \) resolution of the \(\delta _{CP}\) measurement, which suggests that if a hint of non-unitarity has been discovered, one must employ an appropriate ansatz including the non-unitary mixing terms to fit neutrino oscillation data and measure \(\delta _{CP}\).