1 Introduction

The discovery of neutrino oscillations and at least two non-vanishing neutrino masses, provides clear evidence for Physics Beyond the Standard Model (SM). The simplest extension of the SM accommodating two non-vanishing neutrino masses involves the addition of at least two right-handed neutrinos. The most general gauge invariant Lagrangian includes a right-handed bare Majorana mass matrix M. As a result, the scale of M can be much larger than the electroweak scale, which leads to an elegant explanation for the smallness of neutrino masses, through the seesaw mechanism [1,2,3,4,5]. The seesaw mechanism necessarily implies violations from \(3 \times 3\) unitarity of the Pontecorvo–Maki–Nakagawa–Sakata (PMNS) matrix, as well as Z-mediated lepton flavour violating couplings. The introduction of these heavy right-handed neutrinos can also have profound cosmological implications since they are a crucial component of the Leptogenesis mechanism to create the observed Baryon Asymmetry of the Universe (BAU) [6]. Leptogenesis is a very appealing scenario, for reviews see for example [7,8,9,10,11,12,13] but for heavy neutrinos with masses many orders of magnitude higher than the electroweak scale it is difficult or impossible to test it at low energies [14,15,16]. Furthermore, without a flavour model, CP violation at high energies, relevant for Leptogenesis, cannot be related to CP violation at low energies [17, 18]. This may be possible in the context of a flavour model, involving symmetries which allow to establish a connection between high and low energies [19,20,21,22,23,24,25,26].

At this stage, it should be emphasised that within the seesaw type I framework, the observed pattern of neutrino masses and mixing does not require that all the heavy neutrino masses be much larger than the electroweak scale. In this paper, we carefully examine the question of whether it is possible, within the seesaw type I mechanism, to have experimentally detectable violations of \(3 \times 3\) unitarity, taking into account the present experimental constraints. In particular, we address the following questions:

  1. (i)

    In the seesaw type I mechanism, is it possible to have significant deviations from \(3\times 3\) unitarity of the leptonic mixing matrix? By significant, we mean deviations which are sufficiently small to conform to all present stringent experimental constraints on these deviations, but are sufficiently large to be detectable in the next round of experiments. These experimental constraints arise from bounds on rare processes.

  2. (ii)

    In the case the scenario described in (i) can indeed be realised within the framework of seesaw type I, what are the requirements on the pattern of heavy neutrino masses?

For definiteness, we will work in a framework where three right-handed neutrinos are added to the spectrum of the SM. Our analysis starts with the introduction of the unitary \(6 \times 6\) mixing matrix \(\mathcal{V}\), characterising all the leptonic mixing. We write this \(6 \times 6\) mixing matrix in terms of four blocks of \(3 \times 3\) matrices. Using unitarity of \(\mathcal{V}\), we show that the full matrix \(\mathcal{V}\) can be expressed in terms of only three blocks of \(3 \times 3\) matrices. Then we apply these results to the diagonalisation of the \(6 \times 6\) neutrino mass matrix, including both Dirac and Majorana mass terms. Through the use of a specially convenient exact parametrisation of the \(6 \times 6\) leptonic unitary mixing matrix, we evaluate deviations from \(3 \times 3\) unitarity and derive the maximum value of the lightest heavy neutrino mass which is required in order to generate significant deviations from unitarity in the framework of the seesaw type I mechanism.

The paper is organised as follows. In the next section, we review the seesaw mechanism, define our notation and introduce a specially convenient exact parametrisation of the \(6 \times 6\) leptonic mixing matrix \(\mathcal{V}\). We evaluate the size of the deviations of \(3 \times 3\) unitarity in the present framework and derive a constraint on the magnitude of the mass of the heavy Majorana neutrinos, in order to have significant deviations of unitarity. Numerical examples are given in Sect. 3 and some of the derivations of our results are put in an Appendix. Finally we present our conclusions in the last section.

2 Deviations from unitarity in the leptonic sector

2.1 Type I seesaw mechanism

In the context of the Type I seesaw mechanism, with only three right-handed neutrinos added to the Lagrangian of the SM, the leptonic mass terms are given by:

$$\begin{aligned} \mathcal{L}_m= & {} -\Bigg [\overline{{\nu }_{L}^0} m \nu _{R}^0 + \frac{1}{2} \nu _{R}^{0T} C \ M \nu _{R}^0+ \overline{l_L^0} m_l l_R^0\Bigg ] + h. c. \nonumber \\= & {} - \Bigg [\frac{1}{2}n_{L}^{T} C \ \mathcal{M}^* n_L + \overline{l_L^0} m_l l_R^0\Bigg ] + h. c., \end{aligned}$$
(1)

There is no loss of generality in choosing a weak basis where \(m_l\) is already real and diagonal. The analysis that follows is performed in this basis. The neutrino mass matrix \(\mathcal M \) is a \(6 \times 6\) matrix and has the form:

$$\begin{aligned} \mathcal{M}= \left( \begin{array}{c@{\quad }c} 0 &{} m \\ m^T &{} M \end{array}\right) . \end{aligned}$$
(2)

This matrix is diagonalised by the unitary transformation

$$\begin{aligned} \mathcal{V}^T \mathcal{M}^* \mathcal{V} = \mathcal{D} \qquad \text{ i.e. } \qquad \mathcal{V}^\dagger \mathcal{M} = \mathcal{D} \mathcal{V}^T \end{aligned}$$
(3)

where

$$\begin{aligned} \mathcal{D}=\left( \begin{array}{c@{\quad }c} d &{} 0 \\ 0 &{} D \end{array}\right) \end{aligned}$$
(4)

with \(d = \text{ diag }.(m_1, m_2, m_3)\) and \(D = \text{ diag }.(M_1, M_2, M_3)\) denoting respectively, the light and the heavy Majorana neutrino masses. The unitary \(6 \times 6\) matrix \(\mathcal{V} \) is often denoted in the literature as:

$$\begin{aligned} \mathcal{V} =\left( \begin{array}{c@{\quad }c} K &{} R \\ S &{} Z \end{array} \right) ,\ \end{aligned}$$
(5)

where K, R, S and Z are \(3\times 3\) matrices. For K and Z non singular, we may write

$$\begin{aligned} \mathcal{V} =\left( \begin{array}{c@{\quad }c} K &{} 0 \\ 0 &{} Z \end{array} \right) \left( \begin{array}{c@{\quad }c} {1\>\!\!\!\mathrm {I}}\, &{} Y \\ -X &{} {1\>\!\!\!\mathrm {I}}\, \end{array} \right) ;\quad -X=Z^{-1}S;\quad Y=K^{-1}R \end{aligned}$$
(6)

From the unitary relation \(\mathcal{V} \ \mathcal{V} ^{\dagger }={1\>\!\!\! \mathrm {I}}_{(6\times 6)}\), we promptly conclude that

$$\begin{aligned} Y=X^{\dagger }. \end{aligned}$$
(7)

The matrix \(\mathcal{V} \) can thus be written:

$$\begin{aligned} \mathcal{V} =\left( \begin{array}{c@{\quad }c} K &{} KX^\dagger \\ -ZX &{} Z \end{array}\right) . \end{aligned}$$
(8)

We have thus made clear that the unitary \(6 \times 6\) matrix \(\mathcal{V}\) can be expressed in terms of three independent \(3 \times 3\) matrices. From the unitarity of \(\mathcal{V} \), we obtain:

$$\begin{aligned} \begin{array}{l} K\ \left( {1\>\!\!\!\mathrm {I}}\,+X^{\dagger }X\right) \ K^{\dagger } ={1\>\!\!\!\mathrm {I}} \\ \\ Z\ \left( {1\>\!\!\!\mathrm {I}}\,+X\ X^{\dagger }\right) \ Z^{\dagger } ={1\>\!\!\!\mathrm {I}}, \end{array} \end{aligned}$$
(9)

showing that the matrix X parametrizes the deviations from unitary of the matrices K and Z. More explicitly:

$$\begin{aligned} \begin{array}{l} K\ K^{\dagger } = {1\>\!\!\!\mathrm {I}} - K\ X^{\dagger }X \ K^{\dagger } \\ Z\ Z^{\dagger } = {1\>\!\!\!\mathrm {I}} - Z\ X X^{\dagger } \ Z^{\dagger }. \end{array} \end{aligned}$$
(10)

From Eq. (3) we derive:

$$\begin{aligned}&-X^\dagger Z^\dagger m^T = d K^T \end{aligned}$$
(11)
$$\begin{aligned}&K^\dagger m -X^\dagger Z^\dagger M = - d X^T Z^T \end{aligned}$$
(12)
$$\begin{aligned}&Z^\dagger m^T = D X^* K^T \end{aligned}$$
(13)
$$\begin{aligned}&XK^\dagger m + Z^\dagger M = D Z^T, \end{aligned}$$
(14)

replacing \(Z^\dagger m^T\) from Eq. (13) into Eq. (11) we get

$$\begin{aligned} d = - X^T \ D \ X , \end{aligned}$$
(15)

which implies that:

$$\begin{aligned} X = \pm i \sqrt{D^{-1}} O_c \sqrt{d}, \end{aligned}$$
(16)

where \(O_c\) is a complex orthogonal matrix, i.e., \(O_c^T O_c = 1\>\!\!\!\mathrm {I}\), or explicitly:

$$\begin{aligned} |X_{ij}| = \left| (O_c)_{ij} \sqrt{\frac{m_j}{M_i}} \right| . \end{aligned}$$
(17)

Note that Eq. (15) is equivalent to Eq. (2.30) derived in Ref. [17], which is an exact relation resulting from the existence of the zero \(3 \times 3\) block in the matrix \(\mathcal{M}\) of Eq. (2).

It should be stressed that the parametrisation of the \(6 \times 6\) unitary matrix \(\mathcal {V}\) given by Eq. (8) has the especial property of allowing to connect in a straightforward and simple way the masses of the light and the heavy neutrinos through an orthogonal complex matrix \(O_c\), as can be seen from Eqs. (15) and (16). This is an important new result which plays a crucial rôle in our analysis.

Previously, in a variety of contexts, there have been various attempts at describing the possibility of having non-unitary lepton mixing [27, 28]. The key point of our analysis is its generality. The parametrisation of Eq. (8) allows one to write the unitary \(6 \times 6\) mixing matrix \(\mathcal{V}\) in terms of two quasi-unitary matrices K and Z and a matrix X which controls the deviations from unitarity of both K and Z. This is explicitly shown in Eq. (10). The analysis we present here has the advantage of leading to a clearcut quantification of the phenomenological non-unitarity bounds in terms of the related \(6 \times 6\) lepton mixing matrix parameters. Another important point is the fact that, unlike in the case of Ref. [29], we do not assume conservation of lepton number. In the sequel, we shall also present a clear and straightforward relation between the non-unitary deviations and the scale of the neutrino Dirac mass matrix.

Since \(O_c\) is an orthogonal complex matrix, not all of its elements need to be small; furthermore, not all the \(M_i\) need to be much larger than the electroweak scale, in order for the seesaw mechanism to lead to naturally suppressed neutrino masses. These observations about the size of the elements of X are specially relevant in view of the fact that some of the important physical implications of the seesaw model depend crucially on X. In particular, the deviations from \(3 \times 3\) unitarity are controlled by X, as shown in Eq. (10).

Given the importance of the matrix X, one may ask whether it is possible to write the \(6 \times 6\) unitary matrix \(\mathcal{V}\) in terms of \(3 \times 3\) blocks, where only \(3 \times 3\) unitary matrices enter, together with the matrix X. In the Appendix, we show that this is indeed possible, and that the matrix \(\mathcal{V}\) can be written:

$$\begin{aligned}&\mathcal{V}=\left( \begin{array}{c@{\quad }c} K &{} R \\ S &{} Z \end{array} \right) \nonumber \\&\quad \!=\left( \begin{array}{c@{\quad }c} \Omega \left( \sqrt{{1\>\!\!\!\mathrm {I}}\,+X^{\dagger }X}\ \right) ^{-1} &{} \Omega \left( \sqrt{{1\>\!\!\!\mathrm {I}} \,+X^{\dagger }X}\ \right) ^{-1}X^{\dagger } \\ - \Sigma \left( \sqrt{{1\>\!\!\!\mathrm {I}}\,+X\ X^{\dagger }}\ \right) ^{-1}X\ &{} \Sigma \left( \sqrt{{1\>\!\!\!\mathrm {I}} \,+X\ X^{\dagger }}\ \right) ^{-1} \end{array} \right) , \nonumber \\ \end{aligned}$$
(18)

where \(\Omega \) and \(\Sigma \) are \(3 \times 3\) unitary matrices given by:

$$\begin{aligned} \Omega = U_{K}U^{\dagger } \qquad \Sigma = W_{Z}\ W^{\dagger }, \end{aligned}$$
(19)

and U, W are the unitary matrices that diagonalise respectively \(X^\dagger X\) and \(X X^\dagger \):

$$\begin{aligned} U^\dagger \ X^{\dagger }X \ U = d_{X}^{2}; \qquad W^\dagger X X^\dagger \ W = d_{X}^{2}. \end{aligned}$$
(20)

It is also shown in the Appendix, that \(U_K\) and \(W_Z\) defined by:

$$\begin{aligned} U_{K} \equiv K\ U\ \sqrt{\left( {1\>\!\!\!\mathrm {I}} \,+d_{X}^{2}\right) } \qquad W_Z \equiv Z\ W\ \sqrt{\left( {1\>\!\!\!\mathrm {I}} \,+d_{X}^{2}\right) }, \end{aligned}$$
(21)

are in fact unitary matrices.

As will be explained in the next section, this will allow us, in our analysis, to trade the matrix K by the combination \(U_{K}U^{\dagger }\) which we identify as the best fit for \(U_{PMNS}\) derived under the assumption of unitarity, multiplied by the remaining factor that parametrises the deviations from unitarity.

2.2 On the size of deviations from unitarity

In the framework of the type I seesaw, it is the block K of the matrix \(\mathcal{V}\) that takes the rôle played by \(U_{PMNS}\) matrix at low energies in models with only Dirac-type neutrino masses. Clearly, in this framework, K is no longer a unitary matrix. However, present neutrino experiments are putting stringent constraints on the deviations from unitarity. In our search for significant deviations from unitarity of K, we must make appropriate choices for the matrix X in order to comply with the experimental bounds, while at the same time obtain deviations that are sizeable enough to be detected experimentally in the near future. Our aim is to show how one can achieve this result with at least one of the heavy neutrinos with a mass at the TeV scale, without requiring unnaturally small Yukawa couplings and still have light neutrino masses not exceeding one eV.

Deviations from unitarity [30,31,32,33,34,35] of K have been parametrised as the product of an Hermitian matrix by a unitary matrix [34]:

$$\begin{aligned} K = ({1\>\!\!\!\mathrm {I}} - \eta ) V, \end{aligned}$$
(22)

where \(\eta \) is an Hermitian matrix with small entries. In order to identify the different components of our matrix K, given in Eq. (18), with the parametrisation of Eq. (22) we rewrite K as:

$$\begin{aligned} K= & {} U_{K}U^{\dagger }\left( \sqrt{{1\>\!\!\!\mathrm {I}}\,+X^{\dagger }X}\ \right) ^{-1}\nonumber \\= & {} \left[ U_{K}U^{\dagger }\left( \sqrt{{1\>\!\!\!\mathrm {I}}\,+X^{\dagger }X}\ \right) ^{-1} U U_{K}^{\dagger } \right] U_{K}U^{\dagger }, \end{aligned}$$
(23)

inside the square brackets we wrote the Hermitian matrix that we identify with \(({1\>\!\!\!\mathrm {I}} - \eta )\), and which will parametrise the deviations from unitarity. The matrix \(V \equiv U_{K}U^{\dagger }\) is a unitarity matrix which is identified with \(U_{PMNS}\) obtained from the standard parametrisation [36] for a unitary matrix. One can also write:

$$\begin{aligned}&\left[ U_{K}U^{\dagger }\left( \sqrt{{1\>\!\!\!\mathrm {I}}\,+X^{\dagger }X}\ \right) ^{-1} U U_{K}^{\dagger } \right] \nonumber \\&\quad \equiv \left[ U_{K} \left( \sqrt{{1\>\!\!\!\mathrm {I}}\,+ d^2_X }\ \right) ^{-1} U_{K}^{\dagger } \right] , \end{aligned}$$
(24)

where \(d^2_X\) is a \(3 \times 3\) diagonal matrix, introduced in Eq. (20). Identifying the second expression of Eq. (24) to \(({1\>\!\!\!\mathrm {I}} - \eta )\) we derive:

$$\begin{aligned} \eta = {1\>\!\!\!\mathrm {I}} - U_{K} \left( \sqrt{{1\>\!\!\!\mathrm {I}}\,+ d^2_X }\ \right) ^{-1} U_{K}^{\dagger } \approx \frac{1}{2} U_K \ d^2_X \ U_{K}^{\dagger }, \end{aligned}$$
(25)

for small \(d^2_X\) . The matrix \(U_{PMNS}\) is then fixed making use of the present best fit values obtained from a global analysis based on the assumption of unitarity. As pointed out in [34], from the phenomenological point of view it is very useful to parametrise K with the unitary matrix on the right, due to the fact that experimentally it is not possible to determine which physical light neutrino is produced, and therefore, one must sum over the neutrino indices. As a result, most observables depend on \(K K^\dagger \) which depends on the following combination:

$$\begin{aligned} \left( K K^\dagger \right) _{\alpha \beta } = \delta _{\alpha \beta } - 2 \eta _{\alpha \beta } + \mathcal {O}\left( \eta ^2_{\alpha \beta }\right) . \end{aligned}$$
(26)

The standard parametrisation for \(U_{PMNS}\) is given by [36]:

$$\begin{aligned}&U_{ PMNS}\nonumber \\&\quad \! =\left( \!\begin{array}{c@{\quad }c@{\quad }c} c_{12} c_{13} &{} s_{12} c_{13} &{} s_{13} e^{-i \delta } \\ -s_{12} c_{23} - c_{12} s_{23} s_{13} e^{i \delta } &{} \quad c_{12} c_{23} - s_{12} s_{23} s_{13} e^{i \delta } \quad &{} s_{23} c_{13} \\ s_{12} s_{23} - c_{12} c_{23} s_{13} e^{i \delta } &{} -c_{12} s_{23} - s_{12} c_{23} s_{13} e^{i \delta } &{} c_{23} c_{13} \end{array}\!\right) \cdot P\!, \nonumber \\ \end{aligned}$$
(27)

with P given by

$$\begin{aligned} P=\mathrm {diag} \ (1,e^{i\alpha _{21}}, e^{i\alpha _{31}}) \end{aligned}$$
(28)

where \(c_{ij} = \cos \theta _{ij}\), \(s_{ij} = \sin \theta _{ij}\) and \(\delta \) is a Dirac-type CP violating phase, while \(\alpha _{21}\), \(\alpha _{31}\) denote Majorana phases. Neutrino oscillation experiments are not sensitive to these factorisable phases.

There are several groups performing global phenomenological fits on \(\theta _{12}\), \(\theta _{23}\), \(\theta _{13}\) and \(\delta \), as well as on neutrino mass differences [37,38,39].

The specific bounds vary slightly from group to group. For definiteness we present in Table 1 the present bounds on neutrino masses and leptonic mixing from [37]. The quantities \(\Delta m^2_{ij}\) are defined by \((m^2_i - m^2_j)\).

Table 1 Neutrino oscillation parameter summary from [37]. For \(\Delta m^2_{31}\), \(\sin ^2 \theta _{23}\) , \(\sin ^2 \theta _{13}\), and \(\delta \) the upper (lower) row corresponds to normal (inverted) neutrino mass hierarchy. \({}^a\)There is a local minimum in the second octant, at \(\sin ^2 \theta _{23}= 0.596 \) with \(\Delta \chi ^2 = 2.08\) with respect to the global minimum. \({}^b\) There is a local minimum in the first octant, at \(\sin ^2 \theta _{23}= 0.426 \) with \(\Delta \chi ^2 = 1.68\) with respect to the global minimum for IO

In Ref. [34] global constraints are derived on the matrix \(\eta \) through a fit of twenty eight observables including the W boson mass, the effective mixing weak angle \(\theta _W\), several ratios of Z fermionic decays, the invisible width of the Z, several ratios of weak decays constraining EW universality, weak decays constraining CKM unitarity and some radiative lepton flavour violating (LFV) processes. The final result is translated into:

$$\begin{aligned} \left| 2 \eta _{\alpha \beta } \right| \le \left( \begin{array}{c@{\quad }c@{\quad }c} 2.5 \times 10^{-3} &{} 2.4 \times 10^{-5} &{} 2.7 \times 10^{-3} \\ 2.4 \times 10^{-5} &{} 4.0 \times 10^{-4} &{} 1.2 \times 10^{-3} \\ 2.7 \times 10^{-3} &{} 1.2 \times 10^{-3} &{} 5.6 \times 10^{-3} \end{array} \right) . \end{aligned}$$
(29)

Ref. [34] also compares these bounds with those of previous studies [33, 40], pointing out that in general there is good agreement. Variations in the scale of the masses of the heavy neutrinos lead to small effects and therefore, do not significantly change our analysis. In view of the smallness of the entries of this matrix one can infer from Eq. (26) that the deviations from unitarity of the matrix K are very much constrained.

2.3 The elements of the neutrino Dirac mass matrix m and deviations from unitarity

In this subsection, we show that there is a correlation among:

  • The size of deviations from unitarity of the \(3 \times 3\) leptonic mixing matrix.

  • The mass of the lightest heavy neutrino.

From Eq. (13) we get

$$\begin{aligned} m= K X^\dagger D {Z^*}^{-1}, \end{aligned}$$
(30)

the experimental fact that K is almost unitary implies that Z is also almost unitary. Therefore the Dirac mass matrix m is of the same order as X times D. Notice that the scale of D may be of the order of the top quark mass, so that indeed the Yukawa couplings need not be extremely small.

The elements of the neutrino Dirac mass matrix m are connected to the deviations from unitarity of the \(3 \times 3\) leptonic mixing matrix. From Eq. (30) together with Eqs. (21), (23) and (24), we obtain:

$$\begin{aligned} m = U_{K}\left( \sqrt{\left( {1\>\!\!\!\mathrm {I}}\,+d_{X}^{2}\right) }\ \right) ^{-1}d_{X}\ W^{\dagger }\ D\ W^{*}\left( \sqrt{\left( {1\>\!\!\!\mathrm {I }}\,+d_{X}^{2}\right) }\ \right) W_{Z}^{T}, \nonumber \\ \end{aligned}$$
(31)

where we have used \(d_{X}=\) \(W^{\dagger }\ X\ U\) from Eq. (20). Thus, we find

$$\begin{aligned} Tr\left[ mm^{\dagger }\right]= & {} Tr\left[ \left( \sqrt{\left( {1\>\!\!\!\mathrm {I}}\,+d_{X}^{2}\right) }\ \right) ^{-1}d_{X}\ W^{\dagger }\ D\ W^{*}\right. \nonumber \\&\left. \times \left( {1\>\!\!\!\mathrm {I}} \,+d_{X}^{2}\right) \ W^{T}\ D\ W\ d_{X}\ \left( \sqrt{\left( {1\>\!\!\! \mathrm {I}}\,+d_{X}^{2}\right) }\ \right) ^{-1}\right] \nonumber \\ \end{aligned}$$
(32)

As previously emphasised, deviations from \(3 \times 3\) unitarity in \(U_{PMNS}\) are controlled by the matrix X, as it is clear from Eq. (10). For \(X=0\), there are no deviations from unitarity. Small deviations from unitarity correspond to \(d_X\) small and, in that case, one has to a very good approximation,

$$\begin{aligned} Tr\left[ mm^{\dagger }\right] = Tr\left[ d_{X}\ W^{\dagger }D^{2}W\ d_{X}\right] , \end{aligned}$$
(33)

which can be written as:

$$\begin{aligned}&Tr\left[ mm^{\dagger }\right] \nonumber \\&\quad = d_{X_{1}}^{2}\left( M_{1}^{2}\left| W_{11}\right| ^{2}+M_{2}^{2}\left| W_{21}\right| ^{2}+M_{3}^{2}\left| W_{31}\right| ^{2}\right) \nonumber \\&\qquad +\,d_{X_{2}}^{2}\left( M_{1}^{2}\left| W_{12}\right| ^{2}+M_{2}^{2}\left| W_{22}\right| ^{2}+M_{3}^{2}\left| W_{32}\right| ^{2}\right) \nonumber \\&\qquad +\,d_{X_{3}}^{2}\left( M_{1}^{2}\left| W_{13}\right| ^{2}+M_{2}^{2}\left| W_{23}\right| ^{2}+M_{3}^{2}\left| W_{33}\right| ^{2}\right) . \end{aligned}$$
(34)

It can be shown that, using the properties of orthogonal complex matrices, only one of the \(d_{X_i}\), which we take for definiteness as \(d_{X_3}\), can have a significant value (e.g. \(d_{X_3} \approx 10^{-2}\)), while the other two are negligible. In fact from the eigenvalue equation of X we find:

$$\begin{aligned} \begin{array}{l} d_{X_{1}}^2d_{X_{2}}^2d_{X_{3}}^2=\frac{m_{1}m_{2}m_{3}}{M_{1}M_{2}M_{3}} \\ \\ \frac{d_{X_{1}}^{2}d_{X_{2}}^{2}+d_{X_{1}}^{2}d_{X_{3}}^{2}+d_{X_{2}}^{2}d_{X_{3}}^{2} }{d_{X_{1}}^{2}+d_{X_{2}}^{2}+d_{X_{3}}^{2}}\lessapprox O(\frac{m_{3}}{M_{1}} ) \end{array} \end{aligned}$$
(35)

choosing \(d_{X_{3}}^2\) large enough to be experimentally relevant forces the other two eigenvalues to be extremelly small.

Thus, we find in good approximation

$$\begin{aligned} Tr\left[ mm^{\dagger }\right] = d_{X_{3}}^{2}\left( M_{1}^{2}\left| W_{13}\right| ^{2}+M_{2}^{2}\left| W_{23}\right| ^{2}+M_{3}^{2}\left| W_{33}\right| ^{2}\right) \end{aligned}$$
(36)

or using the unitary of W

$$\begin{aligned} Tr\left[ mm^{\dagger }\right]= & {} d_{X_{3}}^{2}M_{1}^{2}\left( 1+\left( \frac{M_{2}^{2}}{M_{1}^{2}}-1\right) \left| W_{23}\right| ^{2}\right. \nonumber \\&\left. +\,\left( \frac{M_{3}^{2}}{M_{1}^{2}} -1\right) \left| W_{33}\right| ^{2}\right) \end{aligned}$$
(37)

which, with the choice \(M_{3}\ge M_{2}\ge M_{1}\), leads to

$$\begin{aligned} d_{X_{3}}^{2}M_{1}^{2}\le Tr\left[ mm^{\dagger }\right] =\sum _{i,j} \left| m_{ij}\right| ^{2} \end{aligned}$$
(38)

From Eq. (38), it is clear that for significant values of \(d_{X_3}\), \(M_1\) cannot be too large in order to avoid a too large value of \(Tr\left[ mm^{\dagger }\right] \), which in turn would imply that at least one of the \(\left| m_{ij}\right| ^{2}\) is too large. This can be seen in both Fig. 1 and Fig. 3 where we plot \(\frac{1}{2}d_{X_{3}}^{2}\) versus \(M_1\). Significant values of \(d_{X_{3}}^{2}\) can only be obtained for \(M_1 \le 1 - 2\) TeV.

Fig. 1
figure 1

Maximum deviations from unitarity as a function of \(M_1\), generated under the condition that Tr\(( mm^\dagger )\le m_t^2\) and \(|\eta _{12}|\le 10^{-4}\)

In all our plots, we require \(Tr\left[ mm^{\dagger }\right] \le m_{t}^{2}\). We consider the case of normal ordering and vary over the values of light neutrinos masses \(m_{i}\), up to \(m_3 = 0.5\) eV. Concerning the heavy Majorana masses \(M_i\), we allow \(M_3\) to reach values of the order of \(10^4 m_t\) and allow for all possible forms of \(O_{c}\). In Figs. 12 we impose the condition that \(\left| \eta _{12}\right| \le 10^{-4}\), while in Figs. 34, we chose \(\left| \eta _{12}\right| \le 2\times 10^{-5}\).

In Figs. 24, we also plot the \(|\eta _{11}|\) deviations from unitarity.

Fig. 2
figure 2

\(|\eta _{11}|\) deviations from unitarity as a function of \(M_1\), generated under the condition that Tr\((mm^\dagger )\le m_t^2\) and \(|\eta _{12}|\le 10^{-4}\)

Fig. 3
figure 3

Maximum deviations from unitarity as a function of \(M_1\), generated under the condition that Tr\((mm^\dagger )\le m_t^2\) and \(|\eta _{12}|\le 2\times 10^{-5}\)

Fig. 4
figure 4

\(|\eta _{11}|\) deviations from unitarity as a function of \(M_1\), generated under the condition that Tr\((mm^\dagger )\le m_t^2\) and \(|\eta _{12}|\le 2\times 10^{-5}\)

3 Numerical examples

In this section, we present some illustrative results of our numerical analysis showing that it is possible to obtain the observed pattern of neutrino masses and mixing without requiring that all masses of the heavy Majorana neutrinos, \(M_i\), be much larger than the electroweak scale. We have realistic examples even when all the three heavy neutrinos have masses below 2 TeV. One might expect that lowering the scale of the heavy neutrino masses would result in the need for extremely small Yukawa couplings for the Dirac mass terms, thus defeating the rationale for the seesaw mechanism. However, this is not the case and, to illustrate, we include for each example the corresponding moduli of the entries of the neutrino Dirac mass matrices m and the trace of the product \(mm^\dagger \).

Our strategy is to fix the matrix X by choosing the matrices d, \(O_c\) and D, with d and D denoting the masses of the physical neutrinos. This immediately determines \(d^2_X\), U and W and the deviations from unitarity of K and Z. Next we chose \(U_K\) in such a way that the block K falls within the experimental limits. The form of Z still depends on \(W_Z\) which can still be freely chosen. Since we have no experimental evidence constraining this block of \(\mathcal {V}\) we choose, for simplicity, \(W_Z\) equal to the identity, in some cases involving permutations, as explained in what follows.

Table 2 Examples of deviations from unitarity, for the case of NO, with light neutrino masses \(m_1=0.005050 ~ eV\), \(m_2=0.01005 ~ eV\), \(m_3=0.05075 ~ eV\). The deviations from unitarity are expressed by the Hermitian matrix \(\eta \). We consider three different hierarchies for the heavy Majorana neutrino masses \(M_i\). For each hierarchy we give two examples, where we vary the choice of the matrix X. The third column contains the Dirac-type neutrino mass matrix, expressed in GeV units, and the fifth one contains the Majorana-type neutrino mass matrix, expressed in units of the top quark mass. The fourth column contains the trace of the product \(mm^\dagger \) in units of the squared top quark mass

In Table 2, we include examples with normal ordering of light neutrino masses, for a particular fixed value of these three masses, common to all examples. We consider three different hierarchies for the heavy Majorana neutrino masses \(M_i\). For each choice of heavy neutrino mass hierarchy we give two examples, where we vary the matrix \(O_c\). The other set of free parameters in our analysis are the Majorana-type phases entering in the choice of matrix \(\Omega \).

In the first four examples given in Table 2 the orthogonal complex matrix is of the form:

$$\begin{aligned} O_{c}=\left( \begin{array}{c@{\quad }c@{\quad }c} 0 &{} \sqrt{x^{2}+1} &{} i\ x \\ 0 &{} i\ x &{} -\sqrt{x^{2}+1} \\ 1 &{} 0 &{} 0 \end{array} \right) \ \left( \begin{array}{c@{\quad }c@{\quad }c} \cos \tau &{} 0 &{} \sin \tau \\ 0 &{} 1 &{} 0 \\ -\sin \tau &{} 0 &{} \cos \tau \end{array} \right) , \end{aligned}$$
(39)

While in the last two examples given in Table 2 the orthogonal complex matrix is of the form:

$$\begin{aligned} O_{c}= & {} \left( \begin{array}{c@{\quad }c@{\quad }c} 0 &{} 0 &{} 1 \\ 1 &{} 0 &{} 0 \\ 0 &{} 1 &{} 0 \end{array} \right) \ \left( \begin{array}{c@{\quad }c@{\quad }c} 0 &{} \sqrt{x^{2}+1} &{} i\ x \\ 0 &{} i\ x &{} -\sqrt{x^{2}+1} \\ 1 &{} 0 &{} 0 \end{array} \right) \nonumber \\&\times \left( \begin{array}{c@{\quad }c@{\quad }c} \cos \tau &{} 0 &{} \sin \tau \\ 0 &{} 1 &{} 0 \\ -\sin \tau &{} 0 &{} \cos \tau \end{array} \right) \end{aligned}$$
(40)
Table 3 Examples of deviations from unitarity, for the case of IO, with light neutrino masses \(m_1=0.05064 ~ eV\), \(m_2=0.05138 ~ eV\), \(m_3=0.00864 ~ eV\). The deviations from unitarity are expressed by the Hermitian matrix \(\eta \). We consider three different hierarchies for the heavy Majorana neutrino masses \(M_i\). For each hierarchy we give two examples, where we vary the choice of the matrix X. The third column contains the Dirac-type neutrino mass matrix, expressed in GeV units, and the fifth one contains the Majorana-type neutrino mass matrix, expressed in units of the top quark mass. The fourth column contains the trace of the product \(mm^\dagger \) in units of the squared top quark mass

with different choices of x and \(\tau \) for the different cases, respectively:

$$\begin{aligned} \begin{array}{c@{\quad }c} x= 2.40 \times 10^{5} &{} \ \tau = \frac{\pi }{2.9} \\ x= 2.12 \times 10^{5} &{} \ \tau = \frac{\pi }{2.8} \\ x= 2.40 \times 10^{5} &{} \ \tau = \frac{\pi }{2.9} \\ x= 2.12 \times 10^{5} &{} \ \tau = \frac{\pi }{2.8} \\ x= 4.16 \times 10^{5} &{} \ \tau = \frac{\pi }{2.9} \\ x= 3.60 \times 10^{5} &{} \ \tau = \frac{\pi }{2.8} \end{array} \end{aligned}$$
(41)

The modulus of m is obtained from Eq. (30) and requires a choice for the \(W_Z\) matrix. This choice has implications for the entries of the blocks S and Z of the matrix \(\mathcal {V}\). Since, at the moment, there are no direct experimental constraints on these entries, providing guidance for this choice, we made the simplest one, for both orderings, by fixing \(W_Z\) to be equal to the identity, in the first four examples. In the last two examples, it was used \(W_Z= \left( \begin{array}{c@{\quad }c@{\quad }c} 1 &{} 0 &{} 0 \\ 0 &{} 0 &{} 1 \\ 0 &{} 1 &{} 0 \end{array} \right) \), such that the Majorana-type mass matrix, M, would be of the same form as the ones from the previous examples. With different choices of \(W_Z\) we could in principle homogenise the orders of magnitude of the entries of m so that all of the Yukawa couplings would be of the same order of magnitude or close.

In Table 3 we include examples with inverted ordering of light neutrino masses, for a particular fixed value of these three masses, common to all examples. We consider the same three different hierarchies for the heavy Majorana neutrino masses \(M_i\), as in the cases of Table 2. For each choice of heavy neutrino mass hierarchy we give two examples, where we vary the matrix \(O_c\). In the second and fourth examples of Table 3, \(O_c\) is of the form given by Eq. (39). In the sixth example of Table 3, \(O_c\) is of the form given by Eq. (40). In the first and third examples \(O_c\) is of the form:

$$\begin{aligned} O_{c}=\left( \begin{array}{c@{\quad }c@{\quad }c} \sqrt{x^{2}+1}&{} 0 &{} i\ x \\ i\ x &{} 0 &{} -\sqrt{x^{2}+1} \\ 0 &{} 1 &{} 0 \end{array} \right) \ \left( \begin{array}{c@{\quad }c@{\quad }c} \cos \tau &{} \sin \tau &{} 0 \\ -\sin \tau &{} \cos \tau &{} 0 \\ 0 &{} 0 &{} 1 \end{array} \right) , \end{aligned}$$
(42)

While in the fifth example given in Table 3 the orthogonal complex matrix is of the form:

$$\begin{aligned} O_{c}= & {} \left( \begin{array}{c@{\quad }c@{\quad }c} 0 &{} 0 &{} 1 \\ 1 &{} 0 &{} 0 \\ 0 &{} 1 &{} 0 \end{array} \right) \ \left( \begin{array}{c@{\quad }c@{\quad }c} \sqrt{x^{2}+1}&{} 0 &{} i\ x \\ i\ x &{} 0 &{} -\sqrt{x^{2}+1} \\ 0 &{} 1 &{} 0 \end{array} \right) \nonumber \\&\times \left( \begin{array}{c@{\quad }c@{\quad }c} \cos \tau &{} \sin \tau &{} 0 \\ -\sin \tau &{} \cos \tau &{} 0 \\ 0 &{} 0 &{} 1 \end{array} \right) \end{aligned}$$
(43)

The choices of x and \(\tau \) are respectively:

$$\begin{aligned} \begin{array}{c@{\quad }c} x= 1.45 \times 10^{5} &{} \ \tau = \frac{\pi }{10} \\ x= 8.38 \times 10^{4} &{} \ \tau = \frac{\pi }{3.9} \\ x= 1.48 \times 10^{5} &{} \ \tau = \frac{\pi }{10} \\ x= 8.38 \times 10^{4} &{} \ \tau = \frac{\pi }{3.9} \\ x=2.45 \times 10^{5} &{} \ \tau = \frac{\pi }{10} \\ x= 1.45 \times 10^{5} &{} \ \tau = \frac{\pi }{3.9} \end{array} \end{aligned}$$
(44)

In all our examples the matrix X will have several entries of order at most \(10^{-2}\).

4 Conclusions

We have studied the possibility of having significant deviations from \(3\times 3\) unitarity of the leptonic mixing matrix in the framework of type-I seesaw mechanism. The analysis was done in the framework of an extension of the Standard Model where three right-handed neutrinos are added to the spectrum. We have shown that the \(6 \times 6\) unitary leptonic mixing matrix \(\mathcal{V}\) can be written in terms of two unitary \(3\times 3\) matrices and a matrix denoted X, which controls the deviations from unitarity of the \(3\times 3\) PMNS matrix. This parametrisation of the matrix \(\mathcal{V}\), played a crucial role in showing that one may have significant deviations from \(3\times 3\) unitarity while conforming to all present data on neutrino masses and mixing, as well as respecting all stringent bounds on deviations from \(3\times 3\) unitarity of the PMNS matrix.

We have presented specific examples where the above deviations from unitarity are sufficiently large to have the potential for being observed at the next round of experiments. An important feature of our analysis is the fact that the mass of the lightest heavy sterile neutrino is, in principle, within experimental reach. This can be achieved without unnaturally small neutrino Yukawa couplings.