1 Introduction

In the last decade compactified large extra dimensions (LED) attracted a lot of attention [1, 2], by providing an attractive possibility to solve the hierarchy problem. This can be achieved by allowing Standard Model (SM) singlets, e.g. gravitons, to propagate in spatial extra dimensions leading to a suppression of the Planck scale by a volume factor of the extra dimensions.

Since a right handed neutrino is a SM singlet it could also be allowed to propagate in the extra dimension, resulting in a suppression of the Yukawa coupling to the left handed neutrino, and thereby, suppressing the neutrino mass [3]. Additionally, if a right handed neutrino feels the extra dimensions, an infinite tower of Kaluza–Klein excitations with masses \(\sim R^{-1}\) appears when integrating out the extra dimensions, resulting in an additional suppression of the neutrino mass by an extra dimensional variant of the type I seesaw mechanism, which was investigated e.g. in [4,5,6]. In this paper, we explain the observed active neutrino mixing within a minimal extra dimensional extension of the SM, where only one right handed neutrino field is introduced, which can propagate in one extra dimension while gravity is allowed to feel a larger number of extra dimensions. Furthermore, the brane where the SM particles and interactions are located is shifted away from the fix points of the \(S_1 / \mathcal {Z}_2\) orbifold.

Without this brane shift the model is only capable of generating one neutrino mass difference. Consequently, the brane shift is necessary to generate a realistic result. A similar setup was discussed with only one generation of neutrinos and a focus on neutrinoless double beta decay [5] or on leptogenesis [7] while in [8] lower limits on the fundamental scale of gravity were derived. A systematic study of right handed neutrinos with a bulk mass term propagating within one flat extra dimension is given in [9].

An important consequence of the active neutrino mixing with sterile neutrinos is that the resulting effective three by three mixing matrix of the active neutrinos is not unitary anymore. This leads to some phenomenological consequences, e.g. in rare lepton decays [8, 10, 11].

The paper is structured as follows: In Sect. 2 the general setup is introduced and the complete mass matrix for the active neutrinos and the Kaluza Klein excitations is derived. In Sect. 3 this mass matrix is analyzed. By employing some approximations, the mixing matrix for the neutrinos in a non-unitarity violating limit is derived and used to constrain the parameter space of the model. Finally, in Sect. 4, the unitarity violation of the system is investigated in more detail and the resulting effects on lepton flavor violating decays are studied.

2 Setup

In this section, we introduce the field content and general properties of the model. A right handed neutrino is added to the SM particle content. Since it is not charged under the SM gauge groups it is allowed to propagate in the extra dimension while all SM particles are confined to a (\(3+1\)) dimensional subspace, called brane. The analysis assumes that the neutrino experiences only one extra dimension, which is not necessarily the case for gravity.Footnote 1

The 5-dimensional bulk neutrino and the SM lepton fields are described by:

$$\begin{aligned} N \left( x^{\mu } , y \right) = \left( \begin{array}{l} \Psi _1 \left( x^{\mu } , y \right) \\ \bar{\Psi }_2 \left( x^{\mu } , y \right) \end{array} \right) ,\quad L \left( x \right) = \begin{pmatrix} \nu _l \left( x^{\mu } \right) \\ l\left( x^{\mu } \right) \end{pmatrix}, \quad l_R \left( x^{\mu } \right) . \end{aligned}$$
(2.1)

\(L\left( x \right) \) and \(l_R \left( x \right) \) are the SM lepton fields with \(l=e, \mu , \tau \). The \(x^{\mu }\), with \(\mu \) running from 0 to 3, are the usual coordinates, y is the extra dimensional coordinate and \(\Psi _1\) and \(\Psi _2\) are 5-dimensional two component spinors.Footnote 2 The extra dimension is compactified on a \(S_1 / \mathbb {Z}_2\) orbifold. \(\Psi _1\) is chosen to be even and \(\Psi _2\) to be odd under a \(y \rightarrow -y\) transformation.

The SM fields, including the left handed neutrinos, are restricted to a brane at \(y=a\). In order to secure \(\mathbb {Z}_2\) invariance, it is necessary to introduce another brane at \(y=2 \pi R - a\), which is not relevant for the problem and therefore is not mentioned further in the following. For previous discussions of extra dimensional models with branes shifted away from the orbifold fixed points compare [4, 5, 7, 8], while in [15] the first string realization of low scale gravity and braneworlds was given.

The Lagrangian of the model is given by [4, 7]:

$$\begin{aligned} \mathcal {L}&= \int \limits _0^{2 \pi R} dy \left\{ \bar{N} \left( i \gamma ^{\mu } \partial _{\mu } + \gamma ^5 \partial _y \right) N \right. \nonumber \\&\quad \left. -\,\frac{M}{2} \left( N^T C^{(5)-1}N + h.c. \right) \right. \nonumber \\&\quad \left. +\, \delta \left( y-a \right) \left[ \frac{h_1^l}{M_F^{1 /2}} L \tilde{\Phi }^* \Psi _1 + \frac{h_2^l}{M_F^{1 /2}} L \tilde{\Phi }^* \Psi _2 \right] \right. \nonumber \\&\quad \left. +\, \delta \left( y-a \right) \mathcal {L}_{SM} \right\} . \end{aligned}$$
(2.2)

Here \(\tilde{\Phi } = i \sigma _2 \Phi ^{*}\) is the hypercharge conjugate of the SM Higgs doublet \(\Phi \) and \(\mathcal {L}_{SM}\) is the SM Lagrangian. The 5D \(\gamma \) matrices and the charge conjugation operator are defined as [7]:

$$\begin{aligned} \gamma ^{\mu }&=\begin{pmatrix} 0 &{}\sigma ^{\mu } \\ \bar{\sigma }^{\mu } &{}0 \end{pmatrix}\quad \gamma ^5 = \begin{pmatrix} -1_2 &{}0 \\ 0 &{}1_2 \end{pmatrix}\\ C^{5}&=- \gamma _1 \gamma _3 = \begin{pmatrix} -i \sigma _2 &{}0 \\ 0 &{}-i \sigma _2 \end{pmatrix}, \end{aligned}$$

where \(\sigma ^{\mu }=(1_2,\sigma )\) and \(\bar{\sigma }^{\mu }=(1_2,-\sigma ) \) with \(\sigma \) being the usual 4D Pauli matrices and \(C^{5}\) is the 5-dimensional analog to charge conjugation in 4 dimensions while, as discussed in [7], the gauge invariant mass term \(N^T C^{(5)-1}N\) is not a true Majorana mass term. However, after integrating out the extra dimension a Majorana mass term in the effective 4-dimensional theory is obtained. The fundamental dimensionless 5D Yukawa couplings are defined as \(h_{1/2}^l\) and \(M_F\) is the fundamental higher dimensional scale of gravity.

In a further step, it is necessary to perform the y-integration in the Lagrangian (2.2). The fields \(\Psi _1\) and \(\Psi _2\) are symmetric and antisymmetric under the y to \(-y\) transformation. Consequently, they can be expanded in a Fourier series:

$$\begin{aligned} \Psi _1 \left( x^{\mu }, y \right)= & {} \frac{1}{\sqrt{2 \pi R}} S_0 \left( x^{\mu } \right) + \frac{1}{\sqrt{\pi R}} \sum \limits _{k=1}^{\infty } S_k \left( x^{\mu } \right) \cos \left( \frac{ky}{R} \right) \end{aligned}$$
(2.3)
$$\begin{aligned} \Psi _2 \left( x^{\mu }, y \right)= & {} \frac{1}{\sqrt{\pi R}} \sum \limits _{k=1}^{\infty } A_k \left( x^{\mu } \right) \sin \left( \frac{ky}{R} \right) . \end{aligned}$$
(2.4)

In the next step, the series expansion for \(\Psi _1\) and \(\Psi _2\) is substituted into the Lagrangian (2.2) and the y-Integration is performed, yielding the following effective Lagrangian:

$$\begin{aligned} \mathcal {L}_{eff}&= \mathcal {L}_{SM} + \bar{S}_0(i \bar{\sigma }_{\mu } \partial ^{\mu })S_0\nonumber \\&\quad + \left( h_1^{l(0)} L \tilde{\Phi }^* S_0 - \frac{M}{2} S_0 S_0 + h.c. \right) \nonumber \\&\quad + \sum \limits _{k=1}^{\infty } \left[ \bar{S}_k(i \bar{\sigma }_{\mu } \partial ^{\mu })S_k + \bar{A}_k(i \bar{\sigma }_{\mu } \partial ^{\mu })A_k \right. \nonumber \\&\quad \left. +\, \frac{k}{R} \left( \bar{A}_k \bar{S}_k+ S_k A_k \right) - \frac{M}{2} \left( S_k S_k+ \bar{A}_k \bar{A}_k + h.c. \right) \right. \nonumber \\&\quad \left. +\, \sqrt{2} \left( h_1^{l(k)} L \tilde{\Phi }^* S_k + h_2^{l(k)} L \tilde{\Phi }^* A_k + h.c. \right) \right] . \end{aligned}$$
(2.5)

The \(\delta \) function in the Yukawa coupling terms leads to 4D Yukawa couplings \( h_1^{l(k)}\) and \( h_2^{l(k)}\), depending on the brane shift a away from the fixed points:

$$\begin{aligned} h_1^{l(k)}&= \frac{h_1^l}{\left( 2 \pi M_F R \right) ^{\frac{1}{2}}} \cos \left( \frac{ka}{R} \right) ,\nonumber \\ h_2^{l(k)}&= \frac{h_2^l}{\left( 2 \pi M_F R \right) ^{\frac{1}{2}}} \sin \left( \frac{ka}{R} \right) . \end{aligned}$$
(2.6)

Note that \(h_2^{l(k)}\) vanishes for \(a=0\) due to the \(\mathbb {Z}_2\) invariance and the fact that \(\Psi _2\) is odd under \(y \rightarrow -y\). Using the relation between the fundamental scale of gravity \(M_F\) and the Planck scale \(M_P\) in dependence of the number of extra dimension n assuming extra dimensions with an equal radius R, \(M_P = \left( 2 \pi M_F R \right) ^{\frac{n}{2}} M_F\), it is obtained:

$$\begin{aligned} h_1^{l(k)}&= \left( \frac{M_F}{M_P} \right) ^{\frac{1}{n}} h_1^l \cos \left( \frac{ka}{R} \right) ,\nonumber \\ h_2^{l(k)}&= \left( \frac{M_F}{M_P} \right) ^{\frac{1}{n}} h_2^l \sin \left( \frac{ka}{R} \right) . \end{aligned}$$
(2.7)

Thus, the 5D Yukawa couplings, expected to be of \(\mathcal {O} \left( 1 \right) \), are suppressed by \(\bar{h}_i^l = \left( \frac{M_F}{M_P} \right) ^{\frac{1}{n}} h_i^l\).

Rewriting the fields S and A into a new basis, the so called weak basis for Kaluza Klein Weyl Spinors, yields:

$$\begin{aligned} \chi _{\pm k} = \frac{1}{\sqrt{2}} \left( S_k \pm A_k \right) . \end{aligned}$$
(2.8)

This leads to the following kinetic term in the Lagrangian (for a more detailed calculation see [4, 5, 7] that use the same setup):

$$\begin{aligned} \mathcal {L} = \bar{\chi } i \bar{\sigma }^{\mu } \partial _{\mu } \chi - \left( \frac{1}{2} \chi ^{T} \mathcal {M} \chi + h.c. \right) , \end{aligned}$$
(2.9)

where

$$\begin{aligned} \mathcal {M}&= \begin{pmatrix} 0 &{} 0 &{} 0 &{} m_0^e &{} m_{+1}^{e} &{} m_{-1}^{e} &{} \cdots \\ 0 &{} 0 &{} 0 &{} m_0^{\mu } &{} m_{+1}^{\mu } &{} m_{-1}^{\mu } &{} \cdots \\ 0 &{} 0 &{} 0 &{} m_0^{\tau } &{} m_{+1}^{\tau } &{} m_{-1}^{\tau } &{} \cdots \\ m_0^{e} &{} m_0^{\mu } &{} m_0^{\tau } &{} M &{} 0 &{} 0 &{} \cdots \\ m_{+1}^{e} &{} m_{+1}^{\mu } &{} m_{+1}^{\tau } &{} 0 &{} M+\frac{1}{R} &{} 0 &{} \cdots \\ m_{-1}^{e} &{} m_{-1}^{\mu } &{} m_{-1}^{\tau } &{} 0 &{} 0 &{} M-\frac{1}{R} &{} \cdots \\ \vdots &{} \vdots &{} \vdots &{} \vdots &{} \vdots &{} \vdots &{} \ddots \end{pmatrix} \nonumber \\&= \begin{pmatrix} 0 &{} Y^T \\ Y &{} \mathcal {M}_{\text {KK}} \end{pmatrix} \end{aligned}$$
(2.10)

and \(\chi ^{T} = (\nu _l^{e},\nu _l^{\mu },\nu _l^{\tau }, \chi _0, \chi _{+1}, \chi _{-1}, \ldots )\) hold. The \(m_k\) are a combination of the Yukawas \(h_1^{(k)}\) and \(h_2^{(k)}\):

$$\begin{aligned} m_k^l&= \frac{v}{\sqrt{2}} \left[ \bar{h}_1^l \cos \left( \frac{ka}{R} \right) + \bar{h}_2^l \sin \left( \frac{ka}{R} \right) \right] \nonumber \\&= A^l \cos \left( \frac{ka}{R} + \Phi ^l \right) . \end{aligned}$$
(2.11)

where \(A^l = \frac{v}{\sqrt{2}} \sqrt{ \left( \bar{h}_1^l \right) ^2 + \left( \bar{h}_2^l \right) ^2}\), \(\Phi ^l = -\arctan \left( \frac{h_2^l}{h_1^l} \right) \) and v is the VEV of the Higgs.

Now the \(\chi _k\) are rearranged in a way, that \(\chi _0\) corresponds to the smallest diagonal entry \( |M_0| = \text {min}|M \pm \frac{k}{R} |\) in the mass matrix [4], with \(|M_0|<\frac{1}{2R}\). Therefore, the mass scale M is irrelevant for the neutrino masses and replaced by \(R^{-1}\). Assuming the minimum to lie at \(k=k_0\) the phases in the \(m_k^l\) need to be changed:

$$\begin{aligned} \Phi ^l = -\arctan \left( \frac{h_2^l}{h_1^l} \right) - \frac{k_0 a}{R}. \end{aligned}$$
(2.12)

Then, the four component spinor vector \(\Psi _{\nu }\) is defined:

$$\begin{aligned} \Psi _{\nu }^T = \left( \begin{pmatrix} \nu _l \\ \bar{\nu }_l \end{pmatrix} , \begin{pmatrix} \chi _{k_0} \\ \bar{\chi }_{k_0} \end{pmatrix}, \begin{pmatrix} \chi _{k_0+1} \\ \bar{\chi }_{k_0+1} \end{pmatrix}, \begin{pmatrix} \chi _{k_0-1} \\ \bar{\chi }_{k_0-1} \end{pmatrix} , \ldots \right) . \end{aligned}$$
(2.13)

Hence, the kinetic term can be written as:

(2.14)

At this point, the dimensionless product \(A^l R\) has to be analyzed since only for the case that \( A^l R \ll 1\) holds a seesaw kind of behavior is possible. By using Eq. (2.6), the relation between the new scale of gravity \(M_F\) and the inverse Radius \(R^{-1}\)

$$\begin{aligned} R^{-1} = 2 \pi \left( \frac{M_F}{M_P} \right) ^{\frac{2}{n}} M_F, \end{aligned}$$
(2.15)

and assuming \(h^l_i\) to be of \(\mathcal {O} \left( 1 \right) \), we obtain:

$$\begin{aligned} A^l R = \left( \frac{M_P}{M_F} \right) ^{\frac{n+1}{n}} \frac{v}{M_P} \frac{1}{2 \pi }. \end{aligned}$$
(2.16)

The value of \(\log _{10} \left( A^l R \right) \) is shown in Fig. 1. For a small \(\frac{M_F}{M_P}\), which is necessary to solve the hierarchy problem, a larger number of extra dimensions is needed to obtain a seesaw kind of behavior. Note that in principle even in the red regions of the plot it is possible to obtain a seesaw kind of behavior by choosing a small value for the 5D Yukawa coupling.

Fig. 1
figure 1

\(\log _{10} \left( A^l R \right) \) in dependence of the number of extra dimensions n and the new fundamental scale of gravity \(M_F\) with \(h^l_i=1\). The black line corresponds to the values, where \(A^l R = 1\) is obtained. Above this line a seesaw like scenario will take place, whereas below this line \(A^l R \gg 1\) could hold and thus generate a scenario similar to pseudo Dirac neutrinos

3 Neutrino masses and mixing

To obtain the neutrino masses the eigenvalues of (2.10) have to be found. Calculating the characteristic polynomial results in:

$$\begin{aligned} 0=P \left[ K_3 \lambda ^3 +K_2 \lambda ^2 + K_1 \lambda + K_0 \right] , \end{aligned}$$
(3.1)

where

$$\begin{aligned} K_3= & {} 1\end{aligned}$$
(3.2)
$$\begin{aligned} K_2= & {} \sum \limits _{k=-\infty }^{\infty } \sum \limits _{F} \frac{\left( m_k^F \right) ^2}{M_0+\frac{k}{R}-\lambda } \end{aligned}$$
(3.3)
$$\begin{aligned} K_1= & {} \sum \limits _{k,j=-\infty }^{\infty } \sum \limits _{F_1>F_2} \frac{\left( m_k^{F_1} \right) ^2 \left( m_j^{F_2} \right) ^2 - m_k^{F_1} m_j^{F_1} m_k^{F_2} m_j^{F_2} }{\left( M_0+\frac{k}{R}-\lambda \right) \left( M_0+\frac{j}{R}-\lambda \right) } \end{aligned}$$
(3.4)
$$\begin{aligned} K_0= & {} \sum \limits _{k,j,l=-\infty }^{\infty } \sum \limits _{F_1,F_2,F_3} \frac{- m_k^e m_j^{\mu } m_l^{\tau } \varepsilon _{F_1 F_2 F_3} m_k^{F_1} m_j^{F_2} m_l^{F_3}}{\left( M_0+\frac{k}{R}-\lambda \right) \left( M_0+\frac{j}{R}-\lambda \right) \left( M_0+\frac{l}{R}-\lambda \right) }\end{aligned}$$
(3.5)
$$\begin{aligned} P= & {} \prod \limits _{k=-\infty }^{\infty } M_0 + \frac{k}{R} - \lambda . \end{aligned}$$
(3.6)

The sums over F run over the flavors and \(F_1>F_2\) has to be understood according to the mass ordering of the charged SU(2) partners. Firstly, we find that \(M_0 \pm \frac{k}{R}\) is not a solution of the equation, since the term in the brackets of Eq. (3.1) is divergent for \(\lambda \rightarrow M_0 \pm \frac{k}{R}\).

Secondly, if the \(m_k^F\) (2.6) factorize into a k and into a F dependent part, \(m_k^F=m_k m^F\), the factors \(K_0\) and \(K_1\) are vanishing, resulting in two mass eigenvalues equal to zero. Since for \(A^l R \ll 1\) the three lightest eigenvalues should correspond to the three active neutrinos, this would lead to only one mass difference.

The \(m_k^F\) are factorizable if \(\Phi ^e = \Phi ^{\mu } = \Phi ^{\tau }\) and/or \(a=0,\frac{\pi R}{2},\pi R\). Consequently, it is not possible to generate two mass differences without a brane shift away from the orbifold fixed points, which means \(a \ne 0, \pi R\). Additionally, \(a=\frac{\pi R}{2}\) is also forbidden, since this localization of the brane leads to a vanishing contribution of \(\Psi _1\) instead of a vanishing contribution of \(\Psi _2\) as for \(a=0,\pi R\), resulting in a factorizable \(m_k^F\).

Next, the infinite sums in the \(K_i\) are solved. This is done explicitly in Appendix A and the following result is obtained:

$$\begin{aligned} \frac{S \left( F_1 , F_2, \lambda \right) }{\pi R A^{F_1} A^{F_2}}&= \left[ \cot \left( \pi R \left[ M_0 - \lambda \right] \right) \cos \left( \Phi ^{F_1} - a \left[ M_0 - \lambda \right] \right) \right. \nonumber \\&\quad \times \left. \cos \left( \Phi ^{F_2} - a \left[ M_0 - \lambda \right] \right) \right. \nonumber \\&\quad \left. - \frac{1}{2} \sin \left( \Phi ^{F_1} + \Phi ^{F_2} - 2a \left[ M_0 - \lambda \right] \right) \right] . \end{aligned}$$
(3.7)

Therewith, the coefficients take the form:

$$\begin{aligned} K_2&= \sum \limits _{F} S\left( F , F, \lambda \right) \\ K_1&= \sum \limits _{F_1>F_2} S \left( F_1 , F_1, \lambda \right) S \left( F_2 , F_2, \lambda \right) - S \left( F_1 , F_2, \lambda \right) ^2\\ K_3&= 1 \\ K_0&= -\sum \limits _{F_1,F_2,F_3} \varepsilon _{F_1 F_2 F_3} S \left( \text {e} , F_1, \lambda \right) S \left( \mathrm {\mu } , F_2, \lambda \right) S \left( \mathrm {\tau } , F_3, \lambda \right) =0. \end{aligned}$$

Since \(K_0=0\), one eigenvalue is always zero, meaning the lightest neutrino mass eigenvalue vanishes.

3.1 Neutrino masses for \(A^F R \ll 1\)

In the following, the neutrino mass generation for \(A^l R \ll 1\) is discussed in more detail.Footnote 3 We are mostly interested in the masses of the active neutrinos and to obtain analytic expressions for them. For that reason, it is assumed that the three lowest eigenvalues correspond to the three active neutrino masses. Consequently, they should be found by performing a series expansion for \(\lambda \) around zero up to third order in Eq. (3.1). The expansion results in \(\sum \nolimits _{i=0}^3 C_i \lambda ^i = 0\) with:

$$\begin{aligned} C_1&= \frac{\pi ^2 R^2}{8} \left[ \sum \limits _{F_1 > F_2} \left( A^{F_1} \right) ^2 \left( A^{F_2} \right) ^2 \right. \nonumber \\&\quad \times \left. \left( \cos \left( 2 \left[ \Phi ^{F_1} - \Phi ^{F_2} \right] \right) - 1 \right) \right] \nonumber \\ C_2&= \frac{\pi R}{2 \sin \left( M_0 \pi R \right) } \left[ \sum \limits _F \left( A^{F} \right) ^2 \right. \nonumber \\&\quad \times \left. \left( \cos \left( M_0 \pi R \right) + \cos \left( M_0 \left[ 2a - \pi R \right] - 2 \Phi ^F \right) \right) \right] \nonumber \\ C_3&= \frac{1}{2 \sin \left( M_0 \pi R \right) ^2} \left[ 1 - \cos \left( 2 M_0 \pi R \right) + \sum \limits _F \left( A^F \right) ^2\right. \nonumber \\&\quad \times \left. \left( \pi ^2 R^2 + \left( \pi R -a \right) \pi R \cos \left( 2 M_0 a - 2 \Phi ^F \right) \right. \right. \nonumber \\&\quad \left. \left. +\, a \pi R \cos \left( 2 M_0 \pi R - 2 \Phi ^F - 2 M_0 a \right) \right) \right] . \end{aligned}$$
(3.8)

In a first approach, it is assumed that the \(\Phi ^F\) are all equal. As shown before, equal \(\Phi ^F\) are leading to two zero mass eigenvalues. The remaining nonzero eigenvalue is calculated to show the behavior of the neutrino mass for different regions of the dimensionless parameter \(M_0 \pi R\). Later, the second mass difference is generated by small differences in the \(\Phi ^F\), \(\delta \Phi \).

By setting \(\Phi ^e=\Phi ^{\mu }=\Phi ^{\tau }= \Phi \), the nonzero eigenvalue results in \(\lambda _3 = - \frac{C_2}{C_3}\):

  • \(A^F R \ll 1\) The eigenvalue results in:

    $$\begin{aligned} \lambda _3&\approx \pi R \sum \limits _F \left( A^F \right) ^2\nonumber \\&\times \frac{\cos \left( a M_0 - \Phi \right) \cos \left( a M_0 - M_0 \pi R - \Phi \right) }{\sin \left( M_0 \pi R \right) } \nonumber \\&= \pi R \sum \limits _F \left( A^F \right) ^2 f \left( a, M_0, R, \Phi \right) . \end{aligned}$$
    (3.9)

    The result splits into two products. The first one \(\pi R \sum \nolimits _F \left( A^F \right) ^2\) is similar to the well known seesaw mass term (\(\frac{m^2}{M}\)). The mass of the heavy right handed neutrino is replaced by \(R^{-1}\). The mass of the introduced right handed bulk neutrino no longer has to be very large, instead a small extra dimension in comparison to the \(A^F\) is required. The second factor is a function \(f \left( a, M_0, R, \Phi \right) \) of the ‘form’ of the extra dimension described by the placement of the brane in the extra dimension a, the lowest diagonal entry in the mass matrix for the KK states \(M_0\), the radius of the extra dimension R and the phase \(\Phi \). This allows to lower the neutrino mass by the function f. Furthermore, if we assume the 5D Yukawa couplings to be of \(\mathcal {O} \left( 1 \right) \) and substitute Eqs. (2.15) and (2.6) for R and \(A_F\), respectively, the product \(\pi R \sum _F \left( A^F \right) ^2\) yields:

    $$\begin{aligned} \pi R \sum _F \left( A^F \right) ^2 \sim \frac{v^2}{M_F}. \end{aligned}$$
    (3.10)

    Here, v is the Higgs VEV and \(M_F\) is the new fundamental scale of gravity. Thus, the first factor in \(\lambda _3\) can be interpret as the typical type I seesaw formula with \(M_F\) playing the roll of the heavy right handed neutrino mass. However, if \(M_F\) is of \(\mathcal {O} \left( 10 \, \mathrm {TeV} \right) \), the first factor in \(\lambda _3\) is of \(\mathcal {O} \left( 1 \, \mathrm {GeV} \right) \). Consequently, the second factor \(f \left( a , M_0 , R, \Phi \right) \) is required to be small in order to achieve a neutrino mass of \(\mathcal {O}\left( 10^{-2} \, \mathrm {eV} \right) \). Another possibility to realize \(m_\nu \sim 10^{-2} \, \mathrm {eV}\) is to allow for larger scales \(M_F\). Within this setup the correct neutrino mass could also be obtained with a larger \(f \left( a , M_0 , R, \Phi \right) \). Moreover, for \(M_F \ge 10^{11} \, \mathrm {GeV}\) a seesaw like scenario (compare with Fig. 1) can be realized within a symmetric setup, i.e. gravity is propagating in same number of extra dimensions as the right handed neutrino does. It should be noticed that expression (3.9) in the limit of \(a \rightarrow 0\) does not coincide with the result obtained for \(A^F R \ll 1\) with a vanishing brane shift,

    $$\begin{aligned} \lambda _3 = -\pi R \cot \left( \pi M_0 R \right) \sum \limits _F \left( \bar{h}_1^F \right) ^2. \end{aligned}$$
    (3.11)

    This issue can be resolved by assuming that new physics enters above the scale \(M_F\), leading to an exponential suppression of KK-excitations with masses greater than \(M_F\). For a more detailed discussion see chapter 4 of [5]. However, the presented formula for the eigenvalue is valid as long as \(a \gg M_F^{-1}\) holds. If in the following a small a is considered, it is important to keep in mind that \(a \gg M_F^{-1}\) still holds.

  • \(M_0 \pi R \rightarrow 0 \quad \text {and} \quad M_0 \gg A^F \Rightarrow \lambda _3=-\left( A^F \right) ^2 M_0^{-1} \cos \left( \Phi \right) ^2\) Here the assumption \(M_0 \ll R^{-1}\) is added. Thus, the important scale for the seesaw mechanism is \(M_0\) instead of \(R^{-1}\). Within this limit, only the lightest KK excitation is relevant for the neutrino mass generation.

Another advantage of the limit \(A^F R \ll 1\) is that the KK excitation can be integrated out. As a consequence, it is possible to obtain an effective three by three mass matrix for the active neutrinos by calculating the diagrams presented in Fig. 2.

Fig. 2
figure 2

Tree level diagram to generate the entries of a effective \(3 \times 3\) mass matrix

$$\begin{aligned}&\mathcal {M}_{F_1,F_2}^{\text {eff}}=\sum \limits _{k=-\infty }^{\infty } \frac{m_k^{F_1} m_k^{F_2}}{M_0 + \frac{k}{R}} = Y^T \mathcal {M}_{\text {KK}}^{-1} Y= S \left( F_1, F_2 , 0 \right) , \end{aligned}$$
(3.12)

where S is the solution of the infinite sum (3.7A.3). Calculating the eigenvalues of \(\mathcal {M}_{F_1,F_2}\) with equal \(\Phi ^F\) yields the same eigenvalue as presented in the approximation \(A^F R \ll 1\) (3.9).

The next step is to analyze the influence of slightly different \(\Phi ^F\). For that it is defined:

$$\begin{aligned} \Phi ^e=\Phi \quad \Phi ^{\mu }=\Phi + \delta \Phi \quad \Phi ^{\tau }=\Phi + r \delta \Phi \end{aligned}$$
(3.13)

To simplify the expressions for the neutrino masses, a series expansion in \(\delta \Phi \) up to leading order is performed. The expansion results in (3.9) for \(\lambda _3\) and in

$$\begin{aligned} \lambda _2 = \frac{\pi R \sin \left( M_0 \pi R \right) \left[ \left( A^e A^{\mu } \right) ^2 + \left( A^e A^{\tau } \right) ^2 r^2 + \left( A^{\mu } A^{\tau } \right) ^2 \left( r-1 \right) ^2 \right] }{4 \left[ \cos \left( a M_0 - \Phi \right) \cos \left( a M_0 - M_0 \pi R - \Phi \right) \right] \left[ \left( A^e \right) ^2 + \left( A^{\mu } \right) ^2 + \left( A^{\tau } \right) ^2 \right] } \delta \Phi ^2. \end{aligned}$$
(3.14)

Moreover, we define: \(A^F=c_F Y\), with \(c_e=1\), and

$$\begin{aligned} s \left( c_{\mu }, c_{\tau } \right)&= 1 + c_{\mu }^2 + c_{\tau }^2 \end{aligned}$$
(3.15)
$$\begin{aligned} w \left( c_{\mu }, c_{\tau } , r \right)&= c_{\mu }^2 + c_{\tau }^2 r^2 + c_{\mu }^2 c_{\tau }^2 \left( r-1 \right) ^2. \end{aligned}$$
(3.16)

With these definitions, the eigenvalues of \(\mathcal {M}_{F_1,F_2}^{\text {eff}}\) are given by:

$$\begin{aligned} \lambda _1= & {} 0 \end{aligned}$$
(3.17)
$$\begin{aligned} \lambda _2= & {} -\frac{\pi R Y^2}{4} \frac{w \left( c_{\mu }, c_{\tau } , r \right) }{s \left( c_{\mu }, c_{\tau } \right) f \left( a , M_0 , R, \Phi \right) } \delta \Phi ^2 \end{aligned}$$
(3.18)
$$\begin{aligned} \lambda _3= & {} \pi R Y^2 s \left( c_{\mu }, c_{\tau } \right) f \left( a , M_0 , R, \Phi \right) . \end{aligned}$$
(3.19)

Eventually, we want to comment on current collider bounds on large extra dimensions [16]. The ATLAS collaboration found an lower limit on the fundamental scale of gravity \(M_F\) of \( \frac{M_F}{TeV} \ge \left( 5.25 , 4.11 , 3.57, 3.27, 3.06 \right) \) for \(n=\left( 2, 3, 4, 5, 6 \right) \) extra dimensions. These limits can be translated into upper bounds on the radius of the extra dimension by applying formula (2.15). The limits are compatible with the observed neutrino masses within the presented framework. The correct neutrino mass scale can be achieved by either choosing a small R (corresponding to a larger \(M_F\)) or a small \(\delta \phi \) since \(\lambda _2 \lambda _3 \sim \pi R Y^2 \delta \phi ^2\), while the correct ratio for the eigenvalues can be accommodated for by choosing a suitable \(f \left( a , M_0 , R, \Phi \right) \) since \(\frac{\lambda _2}{\lambda _3} \sim \left( \frac{\delta \phi }{f \left( a , M_0 , R, \Phi \right) } \right) ^2\).

3.2 Neutrino mixing in leading order in \(\delta \Phi \) for \(A^l R \ll 1\)

In the following considerations only the case \(A^l R \ll 1\) is considered, which was capable of generating a seesaw like scenario. Furthermore, it is also possible to obtain a good approximation for the mixing matrix by diagonalizing \(\mathcal {M}_{F_1,F_2}^{\text {eff}}\) (3.12), by \(U^T \mathcal {M}_{F_1,F_2}^{\text {eff}} U = \mathcal {M}_{\text {diag}}\). The obtained U will be unitary, while the exact three by three PMNS matrix is not. The deviation from unitary, \(\sim Y^T \mathcal {M}_{\text {KK}}^{-2} Y \), is analyzed in more detail in Sect. 4.

Calculating the entries of the mixing matrix U up to leading order in \(\delta \Phi \) with the assumption of a normal mass hierarchy \(|\lambda _3|>|\lambda _2|\), yields:

$$\begin{aligned} U =\begin{pmatrix} \frac{c_{\mu } c_{\tau } \left( r -1 \right) }{\sqrt{w \left( c_{\mu } , c_{\tau }, r \right) }} &{} \frac{c_{\mu }^2 + r c_{\tau }^2}{\sqrt{s \left( c_{\mu } , c_{\tau } \right) w \left( c_{\mu }, c_{\tau } , r \right) }} &{} \frac{1}{\sqrt{s \left( c_{\mu } , c_{\tau } \right) }} \\ -\frac{c_{\tau } r}{\sqrt{w \left( c_{\mu } , c_{\tau }, r \right) }} &{} \frac{c_{\mu } c_{\tau }^2 \left( r - 1 \right) - c_{\mu }^2}{\sqrt{s \left( c_{\mu } , c_{\tau } \right) w \left( c_{\mu }, c_{\tau } , r \right) }} &{} \frac{c_{\mu }}{\sqrt{s \left( c_{\mu } , c_{\tau } \right) }} \\ \frac{c_{\mu }}{\sqrt{w \left( c_{\mu } , c_{\tau }, r \right) }} &{} -\frac{c_{\tau } \left( c_{\mu }^2 \left( r - 1 \right) + r \right) }{\sqrt{s \left( c_{\mu } , c_{\tau } \right) w \left( c_{\mu }, c_{\tau } , r \right) }} &{} \frac{c_{\tau }}{\sqrt{s \left( c_{\mu } , c_{\tau } \right) }} \\ \end{pmatrix}. \end{aligned}$$
(3.20)

Every entry contains a zeroth order contribution in \(\delta \Phi \). Remarkably, this approximated result only depends on three parameters of the model: \(c_{\mu }\), \(c_{\tau }\) and r. Thus, comparing this form of U with the standard parametrization of the neutrino mixing matrix excluding the Majorana phases, which are irrelevant for neutrino oscillations,

$$\begin{aligned} \begin{pmatrix} c_{12} c_{13} &{} s_{12} c_{13} &{} s_{13} e^{-i \delta } \\ -c_{23} s_{12} - s_{23} s_{13} c_{12} e^{i \delta } &{} c_{23} c_{12} - s_{23} s_{13} s_{12} e^{i \delta } &{} s_{23} c_{13} \\ s_{23} s_{12} - c_{23} s_{13} c_{12} e^{i \delta } &{} -s_{23} c_{12} - c_{23} s_{13} s_{12} e^{i \delta } &{} c_{23} c_{13} \end{pmatrix}, \end{aligned}$$
(3.21)

allows to identify these three parameters with the mixing angles and results in a predictive framework. The CP violating phase \(\delta \) is zero in our scenario,Footnote 4 since real Yukawa couplings were assumed. Consequently, \(c_{\mu }\), \(c_{\tau }\) and r are given by:

$$\begin{aligned} c_{\mu }^2= & {} \cot ^2 \left( \Theta _{13} \right) \sin ^2 \left( \Theta _{23} \right) \quad c_{\tau }^2 = \cot ^2 \left( \Theta _{13} \right) \cos ^2 \left( \Theta _{23} \right) \end{aligned}$$
(3.22)
$$\begin{aligned} r= & {} \frac{\frac{\tan \left( \Theta _{12} \right) }{\sin \left( \Theta _{13} \right) } + \tan \left( \Theta _{23} \right) }{\frac{\tan \left( \Theta _{12} \right) }{\sin \left( \Theta _{13} \right) } - \cot \left( \Theta _{23} \right) }, \quad \text {for r>1} \end{aligned}$$
(3.23)
$$\begin{aligned} r= & {} \frac{\frac{\tan \left( \Theta _{12} \right) }{\sin \left( \Theta _{13} \right) } - \tan \left( \Theta _{23} \right) }{\frac{\tan \left( \Theta _{12} \right) }{\sin \left( \Theta _{13} \right) } + \cot \left( \Theta _{23} \right) }, \quad \text {for r<1} \end{aligned}$$
(3.24)

Present neutrino oscillation data for the mixing angles (see Table 1) [17] is used to obtain regions for the parameters \(c_{\mu }\), \(c_{\tau }\) and r.

Table 1 Three-flavor oscillation parameters from [17]
Table 2 Allowed Parameter regions which reproduce the observed neutrino mixing for the different possible orderings of the mass eigenvalues. The first to cases correspond to the NO and the last two to the IO. The values are obtained by using the Best Fit values (BF) for the mixing angles and the \(3 \sigma \) regions, respectively

The possible values for r are obtained from the Eqs. (3.23) and (3.24) while the values for \(c_{\mu }\) and \(c_{\tau }\) are obtained from the Eq. (3.22).

The ordering \(m_1=|\lambda _1|<m_2=|\lambda _2|<m_3=|\lambda _3|\) is not the only possible ordering for the mass eigenvalues \(\lambda _i\). There are three other cases left to discuss (two additional cases are already excluded since \(\lambda _1=0\) and \(m_2^2>m_1^2\) has to be satisfied). The other three cases are:

  • Case II: \(m_1=|\lambda _1|<m_2=|\lambda _3|<m_3=|\lambda _2|\) (NO)

  • Case III: \(m_1=|\lambda _2|<m_2=|\lambda _3|>m_3=|\lambda _1|\) (IO)

  • Case IV: \(m_1=|\lambda _3|<m_2=|\lambda _2|>m_3=|\lambda _1|\) (IO).

The procedure to obtain expressions for the parameters is the same as presented for case I and is repeated for the other cases. The possible \(3 \sigma \) regions for the parameters are presented in Table 2. Note that for case II it is not possible to find an analytic expression for the parameters in dependence of the mixing angles.

4 Unitarity violation and lepton flavor violation

In the previous section, an approximated non unitary violating mixing matrix for the SM neutrinos was calculated. In this section the unitarity violation of the system is analyzed. The deviation of unitarity is given by (calculated in Appendix B):

$$\begin{aligned} \left( \mathcal {E} \right) _{F_1 F_2}&=- \left( Y^T \mathcal {M}_{\text {KK}}^{-2} Y \right) _{F_1 F_2} \nonumber \\&= -\sum \limits _{k=-\infty }^{\infty } \frac{m_k^{F_1} m_k^{F_2}}{\left( M_0 + \frac{k}{R} \right) ^2} =: - S_2 \left( F_1, F_2 \right) \nonumber \\&=\frac{d}{d M_0} S_1 \left( F_1,F_2,0 \right) . \end{aligned}$$
(4.1)

Noteworthy is the influence of the unitarity violation on e.g. rare lepton decays or lepton flavor violating Z decays. These influences are discussed in the following. Note that unitarity violation also has an influence on neutrino oscillation. These influences are not further discussed here but e.g. the effect of large extra dimensions on the DUNE experiment is discussed in [18].

As has been pointed out in [11, 19], the decay width of rare lepton decays \(l_{\alpha } \rightarrow l_{\beta } \gamma \), mediated at one loop level as shown in Fig. 3, strongly depends on the unitarity violation. Furthermore, the ratio of its decay width to the decay width of \(l_{\alpha } \rightarrow v_{\alpha } \bar{\nu }_\beta l_{\beta }\) is given by: [11, 19]

Fig. 3
figure 3

Lepton flavor violating decay at one loop

$$\begin{aligned} \frac{\Gamma \left( l_{\alpha } \rightarrow l_{\beta } \gamma \right) }{\Gamma \left( l_{\alpha } \rightarrow l_{\beta } \bar{\nu }_{\beta } \nu _{\alpha } \right) } = \frac{3 \alpha }{32 \pi } \frac{|\sum _{k=1}^{\infty } U_{\alpha k} U_{k \beta }^{\dagger } F\left( x_k \right) |^2}{\left( U U^{\dagger } \right) _{\alpha \alpha } \left( U U^{\dagger } \right) _{\beta \beta } } . \end{aligned}$$
(4.2)

The matrix U is the mixing matrix as defined in Appendix B. In the sum over k, \(k=1,2,3\) correspond to the mass eigenvalues of the active neutrinos. The ones corresponding to \(k>3\) are the ones close to the masses of the KK excitations. The function \(F\left( x_k \right) \) is a loop function with \(x_k= \frac{m_{\nu _k}^2}{m_W^2}\), where \(m_{\nu _k}\) is the mass of the kth neutrino mass eigenstate and the \(F\left( x_k \right) \) is given by:

$$\begin{aligned} F\left( x \right) = \frac{10 - 43 x+ 78 x^2 -49 x^3 +4 x^4 + 18 x^3 \ln \left( x \right) }{3\left( x-1 \right) ^4} \end{aligned}$$
(4.3)

If the sum \(\sum \nolimits _{k=1}^{\infty } U_{\alpha k} U_{k \beta }^{\dagger } F\left( x_k \right) \) is split into \(\sum \nolimits _{k=1}^{3} U_{\alpha k} U_{k \beta }^{\dagger } F\left( x_k \right) + \sum \nolimits _{k=4}^{\infty } U_{\alpha k} U_{k \beta }^{\dagger } F\left( x_k \right) \) it is reasonable to assume \(F \left( x_k \right) \approx \frac{10}{3}\) for \(k=1,2,3\), since then \(m_{\nu _k} \ll m_W\) holds, what allows to simplify the first sum:

$$\begin{aligned} \sum \limits _{k=1}^{3} U_{\alpha k} U_{k \beta }^{\dagger } F\left( x_k \right) \approx \frac{10}{3} \left( U_P U_P^T \right) _{\alpha \beta } \approx \frac{10}{3} \left( \mathcal {E} \right) _{\alpha \beta }. \end{aligned}$$
(4.4)

Since the complete mixing matrix is unitary, \(\sum \nolimits _{k=4}^{\infty } U_{\alpha k} U_{k \beta }^{\dagger }=-\left( \mathcal {E} \right) _{\alpha \beta } \) is valid. For \(x \ge 0\) the function \(F \left( x \right) \) is always decreasing starting from the value \(F\left( 0 \right) =\frac{10}{3}\) and reaching its minimal value for \(F \left( \infty \right) = \frac{4}{3}\). Assuming \(\mathcal {E}_{\alpha \beta } \ll 1\) and \(F\left( x_k \right) = \frac{4}{3}\) for \(k \ge 4\), what is equivalent to assuming \(M_0 \gg m_W\), allows to find an upper bound for the decay rate or a good approximation for the case \(M_0 \gg m_W\), respectively.

$$\begin{aligned} \frac{\Gamma \left( l_{\alpha } \rightarrow l_{\beta } \gamma \right) }{\Gamma \left( l_{\alpha } \rightarrow l_{\beta } \bar{\nu }_{\beta } \nu _{\alpha } \right) }&\approx \frac{3 \alpha }{32 \pi } |\sum \limits _{k=1}^{3} U_{\alpha k} U_{k \beta }^{\dagger } F\left( x_k \right) \nonumber \\&\quad + \sum \limits _{k=4}^{\infty } U_{\alpha k} U_{k \beta }^{\dagger } F\left( x_k \right) |^2 \le \frac{3 \alpha }{8 \pi } \left( \mathcal {E} \right) _{\alpha \beta }^2 \end{aligned}$$
(4.5)

Next, we derive lower bounds on the decay rate for \(M_0 \approx m_W\) and \(M_0 \ll m_W\). To this end we assume \(F \left( x_k \right) =F \left( M_0 \right) \) for \(k \ge 4\). This is justified since the Loop function \(F \left( \frac{m_i^2}{m_W^2} \right) \) is decreasing with an increasing \(m_i\) and the decay rate is proportional to \(\sum \nolimits _{i>3} \left( A- F \left( \frac{m_i^2}{m_W^2} \right) \right) \). Thus, by choosing \(m_i = M_0\), which is the lowest KK mass, for all i a lower bound on the decay rate is obtained. We discuss the following cases:

  • \(M_0 \approx m_W\) With \(F \left( 1 \right) =\frac{17}{6}\), the lower bound results in:

    $$\begin{aligned} \frac{\Gamma \left( l_{\alpha } \rightarrow l_{\beta } \gamma \right) }{\Gamma \left( l_{\alpha } \rightarrow l_{\beta } \bar{\nu }_{\beta } \nu _{\alpha } \right) } \ge \frac{3 \alpha }{128 \pi } \left( \mathcal {E} \right) _{\alpha \beta }^2 \end{aligned}$$
    (4.6)

    In comparison with the upper bound, a factor \(\frac{1}{16}\) is multiplied to the upper bound.

  • \(M_0 \ll m_W\) A series expansion for small arguments of \(F \left( x \right) \) up to first order yields \(F \left( \frac{M_0^2}{m_W^2} \right) \approx \frac{10}{3} - \frac{M_0^2}{m_W^2}\). Thus, the lower bound results in:

    $$\begin{aligned} \frac{\Gamma \left( l_{\alpha } \rightarrow l_{\beta } \gamma \right) }{\Gamma \left( l_{\alpha } \rightarrow l_{\beta } \bar{\nu }_{\beta } \nu _{\alpha } \right) } \ge \frac{3 \alpha }{32 \pi } \frac{M_0^2}{m_W^2} \left( \mathcal {E} \right) _{\alpha \beta }^2 \end{aligned}$$
    (4.7)

    In this case, the lower bound is additionally suppressed by the small factor \(\frac{M_0^2}{m_W^2}\).

Table 3 Analytic expressions (LO in \(\delta \Phi \)) for the ratios of the branching ratios of the rare lepton decays in terms of the mixing angles and their \(3 \sigma \) regions for the different cases. Case I and II correspond to NO and Case III and IV to IO
Fig. 4
figure 4

Upper bound (upper line), lower bound (lower line) and numerically obtained value (intermediate line) for \(Br_{\mu e}\) are plotted against \(M_0\) for two different configurations of \((q,y,\Phi )\): the left panel shows the branching ratio for \((0.1 , \frac{\pi }{3}, \frac{\pi }{6} )\) and the right panel for \((\epsilon , \frac{\pi }{3}, \frac{\pi }{6})\) with \(\epsilon =10^{-9}\). For the configuration in the right panel the branching ratio is close to the experimental bound (dotted line). The numerically estimated value lies always between the two bounds, has a maximum near \(M_0 = m_W\) and seems to approach a constant value for \(M_0 < m_W\)

Using the experimental values for the branching ratios of the processes \(l_{\alpha } \rightarrow l_{\beta } \bar{\nu }_{\beta } \nu _{\beta }\), see e.g. [20], it is found an expression for the branching ratios of the three processes:

$$\begin{aligned} Br_{\mu e}&\le \frac{3 \alpha }{8 \pi } \left( \mathcal {E} \right) _{\mu e}^2, \quad \quad Br_{\tau e} \le \frac{1}{5.6} \frac{3 \alpha }{8 \pi } \left( \mathcal {E} \right) _{\tau e}^2 \quad \text {and} \nonumber \\ Br_{\tau \mu }&\le \frac{1}{5.9} \frac{3 \alpha }{8 \pi } \left( \mathcal {E} \right) _{\tau \mu }^2 \end{aligned}$$
(4.8)

In the limit of a small \(\delta \Phi \), \(\mathcal {E}_{\alpha \beta }\) is given in leading order in \(\delta \Phi \) by:

$$\begin{aligned} \frac{\mathcal {E}_{\alpha \beta }}{c_{\alpha } c_{\beta }}&= \frac{\left( \pi R Y \right) ^2}{2 \sin \left( y \right) ^2} \left[ 1 - (q-1) \cos \left( 2 y q - 2 \Phi \right) \right. \nonumber \\&\quad \left. +\, q \cos \left( 2 \left[ y - qy + \Phi \right] \right) \right] \end{aligned}$$
(4.9)
$$\begin{aligned}&= \left( \pi R Y \right) ^2 h \left( q,y,\Phi \right) , \end{aligned}$$
(4.10)

where \(q=\frac{a}{\pi R}\) and \(y=\pi M_0 R\). Remarkably, the only dependence on the flavor is given by the factors \(c_{\alpha }\), \(c_{\beta }\). Thus, the ratio of two different branching ratios of rare lepton decays is to leading order in \(\delta \Phi \) given by \(\frac{Br_{\alpha \beta }}{Br_{\gamma \delta }} = \frac{c_{\alpha }^2 c_{\beta }^2}{c_{\gamma }^2 c_{\delta }^2}\).

Note that the ratios of the LFV decays in leading order \(\delta \phi \) are independent of any simplifications of the loop function, e.g. \(F(x_k)=F(M_0)\) for \(k \ge 4\). This is the case since, as shown in Appendix C, \(U_{\alpha k} U_{k \beta }^{\dagger } = K c_{\alpha } c_{\beta } + \mathcal {O} \left( \delta \phi \right) \) holds. Consequently, the only flavor dependent terms, the factors \(c_{\alpha } c_{\beta }\), can be pulled out of the sum in Eq. (4.2) and therefore the ratios of the decay rates in leading order \(\delta \phi \) are independent of the approximations adopted in the loop functions. The results for all four cases for the ratios \(\frac{Br_{\tau \mu }}{Br_{\mu e}}=\frac{c_{\tau }^2}{5.9}\) and \(\frac{Br_{\tau e}}{Br_{\mu e}} = \frac{c_{\tau }^2}{5.6 c_{\mu }^2}\) are presented in Table 3. According to this, there is no reason to expect larger rates for the LFV \(\tau \) decays than for the LFV \(\mu \) decays. For the IO \(Br_{\tau \mu }\) and \(Br_{\tau e}\) can even be expected to be one to two orders of magnitude smaller than \(Br_{\mu e}\).

Furthermore, the same ratios can be expected for lepton flavor violating Z decays, as well, since any one loop diagram contributing to \(Z \rightarrow l_{\alpha } l_{\beta }\) includes a factor of \(|\sum _{k=1}^{\infty } U_{\alpha k} U_{k \beta }^{\dagger } F_Z\left( x_k \right) |^2\), where \(F_Z\left( x_k \right) \) is the loop function of the respective diagram. As for the rare lepton decays, the only dependence on the flavor originates from the mixing matrix elements, which are in leading order in \(\delta \Phi \) proportional to \(c_{\alpha } c_{\beta }\).

If Eq. (4.9) is combined with Eq. (3.183.19) and \(\lambda _3 \approx \sqrt{\Delta m_{\text {atm}}^2} \equiv m_{\nu }\) is assumed, for the branching ratio one obtains:

$$\begin{aligned}&\frac{3 \alpha \left( m_{\nu } \pi R \right) ^2 c_{\mu }^2}{32 \pi \, s \left( c_{\mu }, c_{\tau } \right) ^2} \left[ \frac{10}{3} - F \left( \frac{M_0^2}{m_W^2} \right) \right] ^2 \frac{h \left( q, y, \Phi \right) ^2}{f \left( q, y, \Phi \right) ^2} \nonumber \\&\quad \le Br_{\mu e} \le \frac{3 \alpha \left( m_{\nu } \pi R \right) ^2 c_{\mu }^2}{8 \pi \, s \left( c_{\mu }, c_{\tau } \right) ^2} \frac{h \left( q, y, \Phi \right) ^2}{f \left( q, y, \Phi \right) ^2}. \end{aligned}$$
(4.11)

In Fig. 4, the upper and the lower bound are presented for two configurations of the parameters q, y and \(\Phi \) as well as a numerically obtained value for \(Br_{\mu e}\). For \(M_0 \gtrsim m_W\) the lower bound approaches the upper bound and the numerical value is almost exact. The numerical value for \(Br_{\mu e}\) differs significantly from the bounds for \(M_0 \lesssim m_W\). It reaches its maximum at roughly \(M_0 = m_W\) and decreases slowly afterwards. The maximum value can be estimated by evaluating the lower bound at \(M_0=m_W\):

$$\begin{aligned} Br_{\mu e}&\approx \frac{3 \alpha }{128 \pi } \frac{c_{\mu }^2}{s \left( c_{\mu }, c_{\tau } \right) ^2} \left( \frac{m_{\nu }}{m_W} \right) ^2 y^2 \frac{h \left( q, y, \Phi \right) ^2}{f \left( q, y, \Phi \right) ^2} \nonumber \\&\approx 2 \times 10^{-31} y^2 \frac{h \left( q, y, \Phi \right) ^2}{f \left( q, y, \Phi \right) ^2} . \end{aligned}$$
(4.12)

In the last step, we adopted the best fit values for scenario I for \(c_{\mu }\) and \(c_{\tau }\) (see Table 2). If the branching ratio \(Br_{\mu e}\) is analyzed for different values of q, y and \(\Phi \), it is found that \(Br_{\mu e}\) lies far below the experimental bounds for most of these values since the ratio \(\frac{h \left( q, y, \Phi \right) ^2}{f \left( q, y, \Phi \right) ^2}\) is not much larger than one. This case is illustrated on the left panel of Fig. 4. However, there are configurations of q, y and \(\Phi \) where the factor \(h^2 f^{-2}\) can enhance the branching ratio for this process significantly.

In order to generate a \(Br_{\mu e}\) close to the experimental bounds, the factor \(h^2 f^{-2}\) is required to be roughly \(10^{17}\).

The factor is divergent for three different configurations of the parameters q, y and \(\Phi \). Concerning a large \(Br_{\mu e}\), a small deviation from these divergent configurations \(\epsilon \) is needed and the dependence on \(\epsilon \) is shown in Table 4. Additionally, the influence of these configurations on the value for the phase shift \(\delta \Phi \) is presented, which was required to be small. It can be estimated by the ratio of the non-vanishing eigenvalues \(\lambda _2\) (3.183.19) and \(\lambda _3\) (3.183.19), which has to be \(\sim 1\) for the IO and \(\sim 5\) or \(\sim 0.2\) for the NO, respectively.

To conclude, the extra dimensional setup allows for \(Br_{\mu e}\) close to experimental limits even with 5D Yukawa couplings of \(\mathcal {O} \left( 1 \right) \), if the phases of the Yukawa couplings are close to \(\Phi ^F \approx \frac{2n+1}{2} \pi + qy\) or \(\Phi = \frac{2n+1}{2} \pi + qy -y\).

Table 4 Dependence of \(h^2 f^{-2}\) and \(\delta \Phi \) on the deviation \(\epsilon \) of the presented configurations. Since \(h^2 f^{-2}\) is always \(\sim \epsilon ^{-2}\), \(\epsilon \approx 10^{-7}\) is required to generate \(Br_{\mu e} \approx 10^{-13} - 10^{-14}\). Consequently, this would lead to a large \(\delta \Phi \) for the first case, which is not desirable since \(\delta \Phi \ll 1\) was assumed before. Note that it could be rescued by \(\Phi \approx \frac{\pi }{2}\). The remaining two cases have \(\delta \Phi \sim \epsilon \), thus leading to a very small \(\delta \Phi \) for a large \(Br_{\mu e}\). The \(\epsilon \) dependence of \(\delta \Phi \) is calculated by taking the ratio of \(\lambda _2\) and \(\lambda _3\), leading to \(\delta \Phi \sim f\)

Furthermore, it is possible to extract some information about the fundamental scale of gravity \(M_F\). As we figured out in Sect. 3, for 5D Yukawa couplings of \(\mathcal {O} \left( 1 \right) \) the neutrino mass scale is given by (3.9), (3.10):

$$\begin{aligned} m_{\nu } = \frac{s \left( c_{\mu }, c_{\tau } \right) v^2}{4 M_F} f \left( q,y,\Phi \right) . \end{aligned}$$
(4.13)

In case of a large \(Br_{\mu e}\), it is \(f \left( q,y,\Phi \right) \approx \epsilon \) and \(\frac{h \left( q, y, \Phi \right) ^2}{f \left( q, y, \Phi \right) ^2} \approx \epsilon ^{-2}\), with \(\epsilon \ll 1\). Assuming \(M_0 > m_W\) and combining the expressions for the neutrino mass scale with the expression for \(Br_{\mu e}\) (4.11) yields:

$$\begin{aligned} m_{\nu } = \sqrt{\frac{3 \alpha }{8 \pi }} \frac{c_{\mu }}{s \left( c_{\mu } , c_{\tau } \right) } \frac{m_{\nu } \pi R}{\sqrt{Br_{\mu e}}} \frac{s \left( c_{\mu } , c_{\tau } \right) }{4} \frac{v^2}{M_F}. \end{aligned}$$
(4.14)

Rewriting the radius R in terms of the fundamental scale of gravity \(M_F\) and the number of extra dimensions n, that are experienced by gravity, and using Eq. (2.15) one finds a lower bound on \(M_F\) in terms of n:

$$\begin{aligned} M_F&= \left( \sqrt{\frac{3 \alpha c_{\mu }^2}{512 \pi Br_{\mu e}}} v^2 M_P^{\frac{2}{n}} \right) ^{\frac{n}{2 \left( n + 1 \right) }}\nonumber \\&\ge \left( 1.33 \times 10^9 \mathrm {GeV}^2 M_P^{\frac{2}{n}} \right) ^{\frac{n}{2 \left( n + 1 \right) }} \end{aligned}$$
(4.15)

For some values of n, the lower limit of \(M_F\) is presented in Table 5. Note that for \(n \rightarrow \infty \) the limit approaches \(M_F \ge 36.5 \, \mathrm {TeV}\).

Likewise, one finds a lower limit on the inverse radius \(R^{-1}\) in terms of the number of extra dimensions, which results in \(R^{-1} \ge 5.6 \, \mathrm {GeV}\) for \(n=2\). Since the KK neutrino mass is to a good approximation given by \(M_k = y \left( \pi R \right) ^{-1} + k R^{1} \) with \(|y| \le \frac{\pi }{2}\) the heavier mass eigenstates corresponding to the KK neutrinos, in most cases, cannot be produced in kaon or muon decays. Note that at least the lightest one could be produced for \(y \ll 1\).

Table 5 Lower Limits for the fundamental scale of gravity \(M_F\) for different number of extra dimensions n

5 Summary and conclusions

In this paper, we have studied an extra dimensional seesaw mechanism with a single right handed bulk neutrino. The SM particles are confined to a 4D brane. Shifting the brane away from the orbifold fixed points allows to generate two non-vanishing mass-squared differences as required by neutrino oscillation experiments.

In particular, we have worked out the flavor structure without adopting a non-unitarity approximation of the \(3 \times 3\) submatrix. This allows us to study the phenomenological consequences of the right handed bulk neutrino.

In a first step, we studied the neutrino mass generation and mixing. We further simplified the analysis by assuming CP conversation and that the ratios of the Yukawa coupling of the \(Z_2\) even component and \(Z_2\) odd component of the right handed neutrino to the SM neutrinos \(\frac{h_2^l}{h_1^l}\) are almost the same for all three generations. The allowed parameter space is presented in Table 2. Additionally, the model predicts one massless neutrino which can be probed in large scale structure surveys in cosmology.

It is pointed out that the model is capable of generating \(Br_{\mu e}\) close to the experimental bounds even with Yukawa couplings close to one. As discussed in Sect. 4, the contribution to \(l_i \rightarrow l_j \gamma \) is maximized if the lightest KK excitation has roughly the W-Boson mass. Due to the suppression of the Yukawa coupling by the extra dimension it is still possible to generate the observed neutrino mass with a Yukawa coupling of order one in this case. However, this effect is not strong enough to produce \(Br_{\mu e}\) close to \(10^{-13}\). Therefore, some fine tuning of the brane shift, the ratio of the lowest KK mass to \(R^{-1}\) and \(\frac{h_2^l}{h_1^l}\) is necessary. Note that this behavior is not an exclusive feature of the brane shifted model and is also possible without a brane shift. In this case, \(M_0\) close to \(\frac{1}{2} R^{-1}\) is required to generate a sizable \(Br_{\mu e}\). However, the brane shift is necessary to generate two neutrino mass squared differences.

A strong prediction of the model within the approximations mentioned above are the ratios of flavor violating charged lepton decay and Z decay branching ratios which are correlated with the neutrino mixing angles and the neutrino mass hierarchy. Thus, the model could be tested by the next generation of experiments looking for charged lepton flavor violation. Furthermore, it could allow for a distinction of the neutrino mass hierarchies by the measurement of lepton flavor violating processes.

Finally, note that the model might also be probed in neutrino oscillation experiments due to effects of non-unitarity [21], although these effects are not further investigated within this work.