1 Introduction

The discovery of neutrino oscillations and the first information on lepton mixing angles has led to an intense research in this field. The data could shed light on the flavour mystery and help to establish some organising principle relevant for fermion masses and mixing parameters. Before the determination of the reactor mixing angle \(\theta _{13}\) several flavour symmetries, usually broken in a particular way, have been proposed, see [1, 2], which give rise to mixing patterns with vanishing \(\theta _{13}\). A prime example is tri-bimaximal (TB) mixing [3, 4] which was well compatible with the experimental data and can be derived from flavour groups such as \(A_4\) [58] and \(S_4\) [912]. The discovery of a relatively large reactor mixing angle [1318], however, requires most likely additional ingredients beyond the presence of a small discrete flavour symmetry. The simplest assumption which can be made is to invoke sufficiently large corrections to the leading order (LO) pattern such that current data [19] can be reproduced, see e.g. [20]. A disadvantage of this Ansatz is that the predictability of such models is in general reduced, since several corrections are present. Discrete groups larger than \(A_4\) or \(S_4\) have also been considered in order to accommodate a non-vanishing \(\theta _{13}\) [21, 22]. Another possibility is to apply less stringent requirements on the residual symmetries in the charged lepton and neutrino sectors [2326]. In this case \(\theta _{13}\) becomes essentially a free parameter. However, correlations among the other mixing angles and \(\theta _{13}\) could still provide a testable prediction.

The possible role of CP and its ability to explain a non-vanishing reactor mixing angle and to constrain, at the same time, CP phases has been explored to some extent in the past. In such a case CP acts non-trivially on flavour space [27, 28] and is thus dubbed generalised CP transformation. A well-known example is given by the \(\mu \tau \) reflection symmetry [2932], which exchanges a muon (tau) neutrino with a tau (muon) antineutrino in the charged lepton mass basis. If this symmetry is imposed, the atmospheric mixing angle is predicted to be maximal, while \(\theta _{13}\) is in general non-vanishing for a maximal Dirac phase \(\delta \). This symmetry is preserved by the neutrino mass matrices of two recently proposed models [33, 34] which combine the flavour symmetries \(S_4\) and \(A_4\), respectively, with CP. In [33] a non-supersymmetric extension of the standard model (SM) is considered in which neutrino masses arise from the type I seesaw mechanism. The flavour symmetry, which is augmented by a direct product of \(Z_2\) symmetries, is broken spontaneously through the vacuum expectation values (VEVs) of gauge singlets. The neutrino mass spectrum allows for normal as well as inverted hierarchy. However, crucial aspects, such as the vacuum alignment, are only sketched and restrictions on couplings being real or imaginary seem to be imposed by hand. The model in [34] is also non-supersymmetric and neutrino masses are generated through the type II seesaw mechanism. Neutrino masses are normally ordered. The VEVs of multiple Higgs doublets and triplets break the flavour symmetry spontaneously. However, their potential is not discussed explicitly. Furthermore, the \(\mu \tau \) reflection symmetry of the neutrino mass matrix seems to be accidental and the value of the solar mixing angle is not protected by a symmetry.Footnote 1

Taking these examples as starting point, we have considered an approach in which a discrete flavour group \(G_f\) and a CP symmetry are broken in a particular way such that the Pontecorvo–Maki–Nakagawa–Sakata (PMNS) mixing matrix depends only on one continuous parameter. In the presence of a flavour symmetry \(G_f\) CP transformations cannot be defined arbitrarily, but they are subject to certain consistency conditions [4446]. If these are fulfilled, we can consider a theory with \(G_f\) and CP and can study the scenario in which residual symmetries \(G_e\) and \(G_\nu \) are preserved in the charged lepton and neutrino sectors, respectively. \(G_e\) is assumed to be a subgroup of \(G_f\), while, generalising the idea of the \(\mu \tau \) reflection symmetry, \(G_\nu \) is taken to be the direct product of CP and a \(Z_2\) symmetry contained in \(G_f\). In [45] we have shown, in a model-independent way, that such a breaking leads to a mixing pattern with only one free parameter and thus mixing angles as well as CP phases are strongly correlated. We have furthermore shown that for \(G_f=S_4\) several new mixing patterns, well compatible with current experimental data [19], can be derived.

In the present paper we discuss a concrete model for leptons in which the above idea of the breaking of a flavour and a CP symmetry is implemented. The model is SUSY and adopts the gauge symmetry, lepton superfields and Higgs multiplets of the Minimal SUSY SM (MSSM).Footnote 2 \(G_f=S_4\times Z_3\) plays the role of the flavour symmetry. The \(Z_3\) symmetry is employed in order to naturally generate the charged lepton mass hierarchy. We introduce additional degrees of freedom, flavons and driving fields, which are singlets under the gauge group and are necessary for the correct spontaneous breaking of \(G_f\) and CP. Two distinct sets of flavons are present whose VEVs contribute either to the charged lepton or to the neutrino mass matrix at LO. This separation is achieved by the \(Z_3\) symmetry as well as with the help of an additional \(Z_{16}\) group that is, in general, responsible for forbidding unwanted operators. The VEVs of the former set of flavons leave \(G_e\) invariant, while those of the latter \(G_\nu =Z_2 \times \)CP (\(\times Z_3\)). The \(Z_3\) symmetry preserved in the neutrino sector is irrelevant as regards lepton mixing and masses and thus will be omitted in the following. The vacuum alignment is achieved via the requirement of vanishing F-terms of the driving fields. Neutrino masses arise in our model from the Weinberg operator. The lepton mixing matrix is of the same form as in one of the cases exemplified in [45] and the following predictions are obtained: \(\sin ^2 \theta _{23}=1/2\), \(\sin ^2 \theta _{12} \gtrsim 1/3\) and \(\delta \) maximal. The free parameter which is directly related to the reactor mixing angle is naturally of the correct size due to its origin from a subleading contribution to the neutrino mass matrix. All mixing angles agree within \(3 \, \sigma \) with the latest global fit results [19]. Both Majorana phases are trivial at LO. The maximality of the atmospheric mixing angle and of the Dirac phase are consequences of the invariance of the neutrino mass matrix under the \(\mu \tau \) reflection symmetry in the charged lepton mass basis. Since some of the contributions to the neutrino mass matrix that are compatible with the preservation of \(G_\nu \) are suppressed in our model, the neutrino mass spectrum can only be normally ordered. Indeed, the three neutrino masses effectively depend on only two different parameters so that, after fitting the two known mass squared differences, the absolute mass scale and the sum of the neutrino masses are strongly constrained.

The paper is organised as follows: in Sect. 2 we briefly recall the framework of [45] and we apply it to the case of our interest. We assign the transformation properties of lepton and Higgs supermultiplets under \(G_f=S_4\times Z_3\) and CP and we determine the most general VEVs of the flavon fields, compatible with the desired residual symmetries in the charged lepton and neutrino sectors. In Sect. 3 we specify the superpotential involving lepton superfields, we analyze charged lepton and neutrino mass matrices, both at LO and at next-to-leading order (NLO), and we present the results for lepton masses and mixing parameters. In Sect. 4 we discuss the vacuum alignment and show that at LO the desired pattern of VEVs is a minimum of the scalar potential in the SUSY limit. Furthermore, we examine NLO contributions to the flavon superpotential which induce shifts in the flavon VEVs. In Sect. 5 we summarise the main results of our paper. Some details of the group theory of \(S_4\) are found in Appendix A.

2 Approach

We elaborate on a proposal which we have recently presented in [45]. We assume a finite, discrete flavour group, here \(G_f=S_4 \times Z_3\), the direct product of the permutation group of four distinct objects and the cyclic symmetry \(Z_3\). This group is combined with a generalised CP transformation \(X\) which also acts on flavour space. If these symmetries are broken in a particular way, all lepton mixing angles and CP phases can be predicted in terms of only one free parameter.

The group \(S_4\) can be conveniently represented in terms of three generators \(S\), \(T\) and \(U\) with \(S\) and \(U\) having order two, while \(T\) is of order three [50]. For details as regards the group theory of \(S_4\) see Appendix A. A generic multiplet of fields is denoted by \(\phi \) and transforms in a certain representation \((\mathbf{r}, q)\) of the flavour group \(G_f\) (\(\mathbf{r}\) is one of the \(S_4\) representations and \(q\) stands for the \(Z_3\) charge of the multiplet: 1, \(\omega \) or \(\omega ^2\) with \(\omega =e^{2 \pi i/3}\)). The CP transformation \(X\) acts on \(\phi \) as

$$\begin{aligned} \phi ' (x) = X \phi ^* (x_\mathrm{CP}) \end{aligned}$$
(1)

with \(x_\mathrm{CP}=(x^0,-\vec {x})\). The choice of \(X\) is not arbitrary, but \(X\) has to fulfill certain conditions: first, we require \(X\) to be a unitary and symmetric matrix,

$$\begin{aligned} X X^\dagger = X X^* = {\small 1}\!\!1, \end{aligned}$$
(2)

so that \(CP^2=1\) holds. Secondly, \(X\) also has to fulfill a consistency condition involving transformations which belong to the group \(G_f\). For all elements \(g\) of the group \(G_f\) the following statement has to hold: let \(A\) be the (unitary) matrix which represents \(g\) in \((\mathbf{r}, q)\), then an element \(g'\in G_f\) should exist such that

$$\begin{aligned} (X^{-1}A X)^*=A' \end{aligned}$$
(3)

with \(A'\) denoting the (unitary) matrix associated with \(g'\) in \((\mathbf{r}, q)\). In general, \(g\) and \(g'\) are distinct elements. The mathematical structure of the group \(G_\mathrm{CP}\) comprising \(G_f\) and CP turns out to be a semi-direct product and can be written as \(G_\mathrm{CP}= G_f \rtimes H_\mathrm{CP}\) with \(H_\mathrm{CP}\) being the parity group generated by CP [4446].

Closely following the analysis in [45], we show that the lepton mixing parameters can be predicted from the breaking of the group \(G_\mathrm{CP}\) to \(G_\nu = Z_2 \times \)CP in the neutrino and to \(G_e=Z_3^{(D)}\) in the charged lepton sector. The group \(Z_3^{(D)}\) is the diagonal subgroup of the group \(Z_3 \subset S_4\), which is generated by \(T\) and the additional \(Z_3\) symmetry.Footnote 3 The requirement that \(G_\nu \) should be a direct product of \(Z_2 \subset G_f\) and CP leads to a further constraint on \(X\)

$$\begin{aligned} XZ^*-ZX=0 \end{aligned}$$
(4)

with \(Z\) being the generator of the \(Z_2\) symmetry in the representation \((\mathbf{r}, q)\).

In [45] we have performed a comprehensive study of all admissible \(X\) for the possible choices of \(Z\) in the case of \(G_f=S_4\). As can be checked these transformations \(X\) are also viable for \(G_f=S_4 \times Z_3\), since the cyclic symmetry \(Z_3\) in \(G_f\) does not impose any further constraints on \(X\). In order to simplify our considerations regarding the diagonal subgroup \(Z_3^{(D)}\), we choose a basis in which the generator \(T\) is diagonal in all representations of \(S_4\). Its explicit realisations as well as those of the other two generators, \(S\) and \(U\), can be found in Appendix A.

If we take

$$\begin{aligned} Z=S, \end{aligned}$$
(5)

a consistent choice for \(X\) satisfying Eqs. (24) is

$$\begin{aligned}&X_{\mathbf{1}} =1 \; , \; X_{\mathbf{1'}} =-1 \; , \; X_{\mathbf{2}}=\left( \begin{array}{c@{\quad }c} 0&{}1 \\ 1&{}0 \end{array} \right) ,\nonumber \\&X_{\mathbf{3}}= - \left( \begin{array}{c@{\quad }c@{\quad }c} 1 &{} 0 &{} 0 \\ 0 &{} 0 &{} 1 \\ 0 &{} 1 &{} 0 \end{array}\right) ,\nonumber \\&X_{\mathbf{3'}} = \left( \begin{array}{c@{\quad }c@{\quad }c} 1 &{} 0 &{} 0\\ 0 &{} 0 &{} 1\\ 0 &{} 1 &{} 0 \end{array} \right) \;. \end{aligned}$$
(6)

Here we denote by \(X_{\mathbf{r}}\) the realisation of the CP transformation \(X\) in the representation \(\mathbf{r}\) of \(S_4\). This choice together with Eq. (1) specifies the action of CP on flavour space. Note that the transformations in Eq. (6) correspond to the choice \(X_{\mathbf{3'}}={\small 1}\!\!1\) in [45] (this can be checked by applying the basis transformation mentioned in footnote 7 in [45]). The generalised CP transformation \(X_{\mathbf{3'}}\) in Eq. (6) is known as \(\mu \tau \) reflection symmetry in the literature [2932].

Let us consider that left-handed leptons \(l\) transform as \((\mathbf{3'},1)\) under \(S_4\times Z_3\). For \(G_e=Z_3^{(D)}\), the combination \(m_l^\dagger m_l\) (\(m_l\) is the charged lepton mass matrix in the right-left basis, \(l^c \, m_l \, l\)) is diagonal and thus does not contribute to the lepton mixing, up to possible relabelling of the generations. For neutrinos we assume that they are Majorana particles and that their masses arise from the Weinberg operator. Preserving \(G_\nu =Z_2 \times \)CP in the neutrino sector requires that the neutrino mass matrix \(m_\nu \) fulfills the invariance conditions

$$\begin{aligned} Z^Tm_\nu Z=m_\nu ,~~~~~~~X_{\mathbf{3'}} m_\nu X_{\mathbf{3'}}= m_\nu ^*~~~ \end{aligned}$$
(7)

from which follows

$$\begin{aligned} U_{PMNS}^T~m_\nu ~ U_{PMNS}=m_\nu ^\mathrm{diag}~~~ \end{aligned}$$
(8)

with

$$\begin{aligned}&U_{{ PMNS}}\nonumber \\&\quad = \frac{1}{\sqrt{6}} \, \left( \begin{array}{c@{\quad }c@{\quad }c} 2 \cos \theta &{} \sqrt{2} &{} 2 \sin \theta \\ -\cos \theta + i \sqrt{3} \sin \theta &{} \sqrt{2} &{} -\sin \theta - i \sqrt{3} \cos \theta \\ -\cos \theta - i \sqrt{3} \sin \theta &{} \sqrt{2} &{} -\sin \theta + i \sqrt{3} \cos \theta \\ \end{array} \right) K\nonumber \\ \end{aligned}$$
(9)

and \(m_\nu ^\mathrm{diag}\) containing the neutrino masses \(m_i\), \(i=1,2,3\). The unitary matrix \(K\) is diagonal with entries \(\pm 1\) and \(\pm i\), which encode the CP parity of the neutrino states. Notice that, without a particular model, the mixing matrix is only determined up to permutations of rows and columns, since neither charged lepton nor neutrino masses are constrained by the above conditions. All mixing parameters, angles and Dirac as well as Majorana phases, are predicted in terms of the parameter \(\theta \), which depends on the entries of the neutrino mass matrix and that can take in general values between 0 and \(\pi \). From Eq. (9) the mixing angles can be read offFootnote 4

$$\begin{aligned}&\sin ^2 \theta _{13} =\frac{2}{3} \sin ^2 \theta , \;\; \sin ^2 \theta _{12}=\frac{1}{2+\cos 2 \theta }, \sin ^2 \theta _{23}=\frac{1}{2}.\nonumber \\ \end{aligned}$$
(10)

Thus, for \(\theta \approx 0.185\) the best fit value of the reactor mixing angle can be accommodated well, \(\sin ^2 \theta _{13} \approx 0.023\), and also the solar and atmospheric mixing angles are within their \(3 \, \sigma \) ranges [19], \(\sin ^2 \theta _{12} \approx 0.341\). For a \(\chi ^2\) analysis we refer to [45]. Since the second column of the mixing matrix in Eq. (9) is tri-maximal, the solar mixing angle is constrained by \(\sin ^2 \theta _{12} \ge 1/3\) [5357]. The Majorana phases \(\alpha \) and \(\beta \) are trivial,

$$\begin{aligned} \sin \alpha =0, \;\; \sin \beta =0, \end{aligned}$$
(11)

while the Dirac phase \(\delta \) is maximal

$$\begin{aligned} |\sin \delta |=1 \end{aligned}$$
(12)

and thus \(|J_\mathrm{CP}|=|\sin 2 \theta |/(6 \sqrt{3})\), which results in \(|J_\mathrm{CP}| \approx 0.0348\) for \(\theta \approx 0.185\). A characteristic feature of the matrix in Eq. (9), which has already been noticed in the literature [2932], is that the absolute values of the entries of the second and third rows are equal, i.e. \(|U_{\mu i}|=|U_{\tau i}|\) for \(i=1,2,3\). This directly leads to maximal atmospheric mixing, since \(\sin \theta _{23} = \cos \theta _{23}\) and for non-vanishing \(\theta _{13}\) also to a maximal Dirac phase, \(\cos \delta =0\).

In the subsequent section we want to construct an explicit model based on the above choices for \(Z\) and \(X\), see Eqs. (5, 6), since this choice leads to, at least, one non-trivial CP phase and, simultaneously, all mixing angles are in good accordance with the experimental data for a certain value of \(\theta \). Footnote 5 Relevant aspects of the model are outlined in the following. The flavour symmetry is broken spontaneously by the VEVs of gauge singlets. While we have chosen left-handed leptons to be in \((\mathbf{3'}, 1)\) under \(S_4 \times Z_3\), the right-handed charged leptons transform as singlets: \(e^c \sim (\mathbf{1'}, 1)\), \(\mu ^c \sim (\mathbf{1}, \omega )\) and \(\tau ^c \sim (\mathbf{1}, \omega ^2)\). This choice turns out to be useful for accommodating the hierarchy among charged lepton masses correctly and at the same time does not affect the results for lepton mixing. As we argue in Sect. 3, the mass hierarchy can be naturally achieved with a judicious choice of flavons. The auxiliary symmetry \(Z_{16}\) is invoked to sufficiently disentangle the two flavour symmetry breaking sectors and to forbid, in general, unwanted operators. The transformation properties of the matter superfields and the MSSM Higgs doublets under \(S_4 \times Z_3\) and \(Z_{16}\) can be found in Table 1.

Table 1 Matter superfields and MSSM Higgs doublets \(h_{u,d}\) of our model

Since we wish to break the group \(G_\mathrm{CP}\) spontaneously to \(G_e=Z_3^{(D)}\) and \(G_\nu =Z_2 \times \)CP, it is convenient to list the most general form of the VEVs that flavon fields in the different representations of \(S_4\) can take and which leave \(G_e\) and \(G_\nu \) invariant, respectively. In order to preserve the group \(G_e=Z_3^{(D)}\), flavons \(\xi \) and \(\xi '\) in one-dimensional representations \(\mathbf{1}\) and \(\mathbf{1'}\) of \(S_4\), which have a non-vanishing VEV, should transform trivially under \(Z_3\). In the case of a flavon \(\chi \) being a doublet under \(S_4\) the group \(G_e\) can be preserved only if \(\chi \) carries a non-trivial charge under \(Z_3\). The allowed VEVs are

$$\begin{aligned}&\chi \sim (\mathbf{2}, \omega ) \; : \;\; \langle \chi \rangle \propto \left( \begin{array}{c} 0 \\ 1 \end{array} \right) \;\;\; \text{ and } \;\;\; \chi \sim (\mathbf{2}, \omega ^2):\nonumber \\&\quad \langle \chi \rangle \propto \left( \begin{array}{c} 1 \\ 0 \end{array} \right) . \end{aligned}$$
(13)

Flavons \(\psi \sim \mathbf{3}\) and \(\varphi \sim \mathbf{3'}\) can always acquire a non-zero VEV, independent of their charge under \(Z_3\). The latter charge, however, determines which of the components actually acquires a VEV:

$$\begin{aligned}&\psi \sim (\mathbf{3}, 1) \, : \, \langle \psi \rangle \propto \left( \begin{array}{c} 1 \\ 0 \\ 0 \end{array} \right) \, , \, \psi \sim (\mathbf{3}, \omega )\, : \, \langle \psi \rangle \propto \left( \begin{array}{c} 0 \\ 1 \\ 0 \end{array} \right) ,\nonumber \\&\psi \sim (\mathbf{3}, \omega ^2) \, : \, \langle \psi \rangle \propto \left( \begin{array}{c} 0 \\ 0 \\ 1 \end{array} \right) \, . \end{aligned}$$
(14)

The same is true for the structure of the VEVs of \(\varphi \sim \mathbf{3'}\).

The VEVs that preserve \(G_\nu \) are of the form

$$\begin{aligned}&\langle \xi \rangle =v_\xi , \;\; \langle \xi ' \rangle = v_{\xi '}, \;\; \langle \chi \rangle = \left( \begin{array}{c} v_{\chi } \\ v^*_{\chi } \end{array} \right) ,\nonumber \\&\langle \psi \rangle = v_{\psi } \, \left( \begin{array}{c} 1 \\ 1 \\ 1 \end{array} \right) ,\\&\langle \varphi \rangle = v_{\varphi } \, \left( \begin{array}{c} 1 \\ 1 \\ 1 \end{array} \right) ,\nonumber \end{aligned}$$
(15)

with the parameters \(v_\xi \) and \(v_\varphi \) being real, \(v_{\xi '}\) and \(v_{\psi }\) being imaginary, while \(v_\chi \) is in general complex. Because of \(G_\nu =Z_2 \times \)CP it is irrelevant for the vacuum structure preserving \(G_\nu \) how these fields transform under \(Z_3\). Since left-handed leptons \(l\) are in the triplet \(\mathbf{3'}\) of \(S_4\), only the flavons \(\xi \), \(\chi \) and \(\varphi \) can couple to the Weinberg operator at LO, see Appendix A. Upon breaking to \(G_\nu \) the most general form that the neutrino mass matrix can take is

$$\begin{aligned} m_\nu&= \left( \begin{array}{c@{\quad }c@{\quad }c} y_\xi v_\xi + 2 \, y_\varphi v_\varphi &{} y_\chi v_\chi - y_\varphi v_\varphi &{} y_\chi v^*_\chi - y_\varphi v_\varphi \\ y_\chi v_\chi - y_\varphi v_\varphi &{} y_\chi v^*_\chi + 2 \, y_\varphi v_\varphi &{} y_\xi v_\xi - y_\varphi v_\varphi \\ y_\chi v^*_\chi - y_\varphi v_\varphi &{} y_\xi v_\xi - y_\varphi v_\varphi &{} y_\chi v_\chi + 2 \, y_\varphi v_\varphi \end{array} \right) \nonumber \\&\times \,\,\frac{\langle h_u \rangle ^2}{\Lambda ^2} \end{aligned}$$
(16)

with \(y_\xi \), \(y_\chi \) and \(y_\varphi \) indicating the Yukawa couplings to the flavons \(\xi \), \(\chi \) and \(\varphi \), respectively. These couplings are real because of the invariance of the original theory under a CP symmetry generated by \(X\). \(\Lambda \) is the generic high-energy cutoff scale and it suppresses all higher-dimensional operators appropriately. In our model it is inversely proportional to the neutrino mass scale. As expected, the matrix in Eq. (16) is diagonalised by the PMNS matrix in Eq. (9) (remember that the charged lepton mass matrix is diagonal). If the imaginary part of \(v_\chi \) vanishes, \(m_\nu \) in Eq. (16) becomes \(\mu \tau \) symmetric and \(G_\nu \) is promoted to \(Z_2 \times Z_2 \times \)CP.Footnote 6 Thus, its relative size with respect to the other VEVs is directly related to the size of the reactor mixing angle \(\theta _{13}\). In our model this imaginary part (and consequently \(\theta _{13}\)) is appropriately suppressed, since it arises effectively from an operator with two flavons, whereas flavons transforming as \(\mathbf{1}\) and \(\mathbf{3'}\) contribute through operators with one flavon to the matrix \(m_\nu \). In addition, the contribution corresponding to the real part of \(v_\chi \) is doubly suppressed so that neutrino masses depend effectively on only two parameters at LO in our model (which are determined by the measured mass squared differences). As a consequence, they have to follow a normal ordering.

3 Model with \(S_4\) and CP

As anticipated, we adopt the MSSM framework with left-handed lepton and right-handed charged lepton superfields and neglect all SUSY breaking effects in the following. The flavour symmetry is \(G_f=S_4\times Z_3\) and the CP transformation \(X\) is defined in Eq. (6) for each representation \((\mathbf{r}, q)\) of \(G_f\). The transformation properties under \(G_f\times Z_{16}\) of leptons and MSSM Higgs doublets are collected in Table 1, while those of flavons and driving fields are presented in Tables 2 and 3, respectively. We discuss the results for lepton masses and mixing at LO in Sect. 3.1 and at NLO in Sect. 3.2. All operators directly contributing to the lepton mass matrices are non-renormalisable in our model and are suppressed by appropriate powers of the cutoff scale \(\Lambda \). We consider two different sets of flavons, one responsible for the breaking to \(G_e\) and one for the breaking to \(G_\nu \). At LO the fields whose VEVs leave \(G_e\) invariant only contribute to charged lepton masses, while those with VEVs leaving \(G_\nu \) intact generate neutrino masses. At the subleading level this changes and all flavons contribute to charged lepton as well as neutrino mass matrices.

Table 2 Flavons and their transformation properties under \(G_f\times Z_{16}\)
Table 3 Driving fields and their transformation properties under \(G_f\times Z_{16}\)

3.1 Leading order results

As outlined in Sect. 2, left-handed leptons form a triplet \(\mathbf{3'}\), while right-handed charged leptons are singlets under \(S_4 \times Z_3\). With this assignment the different charged lepton masses can arise at the one-, two- and higher-flavon level in our model and thus are naturally hierarchical. As flavon fields \(\varphi _E \sim (\mathbf{3'}, \omega )\) and \(\chi _E \sim (\mathbf{2}, \omega )\) under \(S_4\times Z_3\) are included.Footnote 7 The most relevant operators for charged lepton masses are thenFootnote 8

$$\begin{aligned} w_e&= y _\tau (l \varphi _E) \tau ^c h_d/\Lambda + y_{\mu , 1} (l \varphi _E^2) \mu ^c h_d/\Lambda ^2\nonumber \\&+\,\, y_{\mu , 2} (l \chi _E \varphi _E) \mu ^c h_d/\Lambda ^2. \end{aligned}$$
(17)

Since we impose CP as symmetry on the theory in the unbroken phase, all couplings in \(w_e\) and in the following superpotentials are real. According to the results of Sect. 4, the flavons \(\chi _E\) and \(\varphi _E\) acquire at LO the VEVs

$$\begin{aligned} \langle \chi _E \rangle = v_{\chi _E} \, \left( \begin{array}{c} 0 \\ 1 \end{array} \right) \;\;\; \text{ and } \;\;\; \langle \varphi _E \rangle = v_{\varphi _E} \, \left( \begin{array}{c} 0 \\ 1 \\ 0 \end{array} \right) \end{aligned}$$
(18)

with \(v_{\chi _E}\) and \(v_{\varphi _E}\) being complex, see Eq. (47). If we compare the form of the VEVs in Eq. (18) with the VEVs given in Eqs. (13, 14), we see that \(\langle \chi _E \rangle \) and \(\langle \varphi _E \rangle \) indeed leave the subgroup \(G_e=Z_3^{(D)}\) of \(S_4 \times Z_3\) invariant. Plugging them into the operators in Eq. (17), we find \(m_l\) to be diagonal in flavour space and to only have non-zero entries in the (22) and (33) elements that correspond to the muon and tau lepton masses

$$\begin{aligned} m_\mu&= \left| (2~y_{\mu ,1} v_{\varphi _E} + y_{\mu ,2} v_{\chi _E}) \, \frac{v_{\varphi _E}}{\Lambda ^2} \right| \, \langle h_d \rangle \;\;\; \text{ and }\nonumber \\&m_\tau = \left| y_\tau \frac{v_{\varphi _E}}{\Lambda } \right| \, \langle h_d \rangle , \end{aligned}$$
(19)

respectively. Their relative hierarchy is correctly reproduced for

$$\begin{aligned} |v_{\chi _E}|, |v_{\varphi _E}| \sim \lambda ^2 \, \Lambda \;\;\; \text{ with } \;\;\; \lambda \approx 0.2. \end{aligned}$$
(20)

Since \(|v_{\varphi _E}|/\Lambda \approx \lambda ^2 \approx 0.04\), \(\tan \beta = \langle h_u \rangle / \langle h_d \rangle \) is expected to lie in the interval \(2 \lesssim \tan \beta \lesssim 12\) (for \(0.5 \lesssim y_\tau \lesssim 3\)). The electron remains massless at this level and its mass is generated via higher-dimensional operators, as discussed in Sect. 3.2.

As mentioned, neutrino masses arise in our model from the Weinberg operator. The flavon multiplets \(\xi _N\), \(\tilde{\xi }_N\) and \(\varphi _{N}\) are in \((\mathbf{1},1)\) and in \((\mathbf{3'},1)\) under \(S_4 \times Z_3\), respectively, and each of them can be coupled independently to neutrinos, while the fields \(\xi '_N \sim (\mathbf{1'}, 1)\) and \(\chi _{N} \sim (\mathbf{2},1)\) couple to the latter in the combination \(\chi _N \xi '_N\). This situation can be arranged by an appropriate choice of the charges under the auxiliary symmetry \(Z_{16}\), see Table 2. Note that the two fields \(\xi _N\) and \(\tilde{\xi }_N\) have the same quantum numbers and both are necessary in order to achieve the correct vacuum alignment with the potential discussed in Sect. 4. However, only one of them acquires a non-vanishing VEV, see Eq. (22). The lowest dimensional operators responsible for neutrino masses are

$$\begin{aligned} w_\nu&= y_{\nu ,1} (l l) \xi _N h_u^2/\Lambda ^2 + \tilde{y}_{\nu ,1} (l l) \tilde{\xi }_N h_u^2/\Lambda ^2 \nonumber \\&+\,\,y_{\nu ,2} (l l \varphi _N) h_u^2/\Lambda ^2+y_{\nu ,3} (l l \chi _N \xi '_N) h_u^2/\Lambda ^3. \end{aligned}$$
(21)

As shown in Sect. 4, the flavons \(\xi _N\), \(\tilde{\xi }_N\), \(\xi '_N\), \(\chi _N\) and \(\varphi _{N}\) develop the following VEVs at LO

$$\begin{aligned}&\langle \tilde{\xi }_N \rangle =0, \;\;\; \langle \xi _N \rangle = v_{\xi _N}, \;\;\; \langle \xi '_N \rangle = v_{\xi '_N}, \nonumber \\&\langle \chi _N \rangle = v_{\chi _N} \, \left( \begin{array}{c} 1 \\ 1 \end{array} \right) ,\langle \varphi _N \rangle = v_{\varphi _N} \, \left( \begin{array}{c} 1 \\ 1 \\ 1 \end{array} \right) \end{aligned}$$
(22)

with \(v_{\xi _N}\), \(v_{\chi _N}\) and \(v_{\varphi _N}\) being complex, while \(v_{\xi '_N}\) is imaginary. The VEVs \(v_{\xi _N}\), \(v_{\chi _N}\) and \(v_{\varphi _N}\) are related via the parameters of the flavon superpotential, see Eq. (50), and thus, under very mild assumptions on these parameters, they carry the same phase (up to \(\pi \)). The neutrino mass matrix \(m_\nu \) takes the form

$$\begin{aligned} m_\nu = \left( \begin{array}{c@{\quad }c@{\quad }c} t_\nu + 2 \, u_\nu &{} - u_\nu - i x_\nu &{} -u_\nu + i x_\nu \\ - u_\nu - i x_\nu &{} 2 \, u_\nu + i x_\nu &{} t_\nu - u_\nu \\ - u_\nu + i x_\nu &{} t_\nu - u_\nu &{} 2 \, u_\nu - i x_\nu \end{array} \right) \, \frac{\langle h_u \rangle ^2}{\Lambda } \end{aligned}$$
(23)

with

$$\begin{aligned} t_\nu = y_{\nu ,1} \, \frac{v_{\xi _N}}{\Lambda }, \;\; u_\nu = y_{\nu ,2} \, \frac{v_{\varphi _N}}{\Lambda }, \;\; x_\nu = i \, y_{\nu , 3} \, \frac{v_{\xi '_N} v_{\chi _N}}{\Lambda ^2}.\nonumber \\ \end{aligned}$$
(24)

The common phase of \(v_{\xi _N}\), \(v_{\chi _N}\) and \(v_{\varphi _N}\) is unphysical as regards lepton masses and mixing at this level, since it can be factored out in \(m_\nu \) in Eq. (23). Thus, the parameters \(t_\nu \), \(u_\nu \) and \(x_\nu \) can be considered as real. Among the leading contributions coming from the operators in Eq. (21) the one comprising the fields \(\xi '_N\) and \(\chi _N\) renders \(m_\nu \) invariant only under \(G_\nu =Z_2 \times \)CP and not \(Z_2 \times Z_2 \times \)CP. Note that the matrix in Eq. (23) slightly differs from the one in Eq. (16), since, after extracting the overall phase, the combination \(\chi _N \xi '_N\) only develops an imaginary VEV. The latter is responsible for a non-vanishing value of \(\theta _{13}\) and is naturally smaller than the other contributions to \(m_\nu \), because it arises from an operator with two flavons. The correct size of \(\theta _{13}\) is achieved, if the relative strength of this contribution is suppressed by a factor \(\lambda \) with respect to the ones from the first three operators in Eq. (21). This is the case, if we chooseFootnote 9

$$\begin{aligned} |v_{\xi _N}|, |v_{\xi '_N}|, |v_{\chi _N}|, |v_{\varphi _N}| \sim \lambda \, \Lambda . \end{aligned}$$
(25)

As mentioned in Sect. 2, the neutrino mass matrix \(m_\nu \) is diagonalised by a mixing matrix of the form given in Eq. (9). At LO this matrix gives the PMNS mixing matrix, since the charged lepton mass matrix is diagonal at this level. Thus, maximal atmospheric mixing and a maximal Dirac phase are obtained. The angle \(\theta \) is directly related to \(\theta _{13}\), see Eq. (10), and it depends on the two parameters \(u_\nu \) and \(x_\nu \) in our model

$$\begin{aligned} \tan 2 \theta = \frac{x_\nu }{\sqrt{3} \, u_\nu }. \end{aligned}$$
(26)

Our predictions for the mixing parameters, especially maximal atmospheric mixing and the maximal Dirac phase, can be tested in various ways: indirectly, with global fits of all neutrino data, e.g. [19], as well as directly, with the experiments T2K and NO\(\nu \)A which could determine after five to six years of data taking the octant of the atmospheric mixing angle at the \((2 \div 3) \, \sigma \) level [58], depending on the actual value of \(\theta _{23}\). The same experiments can possibly exclude certain ranges of the Dirac phase within the next ten years, depending on \(\theta _{23}\) and the neutrino mass hierarchy [59]. Indeed, T2K has recently reported a slight preference for \(\delta \) close to \(3 \, \pi /2\) [60], when their data are combined with the measurements of the reactor experiments. This preference is compatible with the \(1 \, \sigma \) preferred range for \(\delta \), \(\pi \lesssim \delta \lesssim 1.9 \, \pi \), found in global fits [19, 61]. Dedicated experiments like LBNE [62] and Hyper-Kamiokande [63, 64] will allow for a measurement of the Dirac phase with a certain precision and thus have the potential to rule out our model. The neutrino masses read

$$\begin{aligned}&m_1 = \left| t_\nu + \tilde{u}_\nu \right| \, \frac{\langle h_u \rangle ^2}{\Lambda } \; , \;\; m_2 = \left| t_\nu \right| \, \frac{\langle h_u \rangle ^2}{\Lambda },\nonumber \\&\quad m_3 = \left| t_\nu - \tilde{u}_\nu \right| \, \frac{\langle h_u \rangle ^2}{\Lambda } \end{aligned}$$
(27)

and we have defined

$$\begin{aligned} \tilde{u}_\nu = \sqrt{3} \, \text{ sign }(u_\nu \cos 2 \theta ) \, \sqrt{3 u_\nu ^2 + x_\nu ^2}. \end{aligned}$$
(28)

Neutrino masses as in Eq. (27) imply the normal mass hierarchy: the requirement \(m_2 > m_1\) entails \(t_\nu \tilde{u}_\nu <0\) and \(2 \, |t_\nu | \gtrsim |\tilde{u}_\nu |\) and, consequently, the atmospheric mass squared difference

$$\begin{aligned} \Delta m_{\mathrm {atm}}^2 = m_3^2 - m_1^2 = - 4 \, t_\nu \tilde{u}_\nu \, \left( \frac{\langle h_u \rangle ^2}{\Lambda } \right) ^2 \end{aligned}$$
(29)

is always positive. The smallness of the ratio \(\Delta m_{\mathrm {sol}}^2/\Delta m_{\mathrm {atm}}^2\) is achieved by an appropriate tuning of \(2 \, t_\nu + \tilde{u}_\nu \). The normal mass hierarchy is an important consequence of the fact that \(\chi _N \xi '_N\) only develops an imaginary VEV, which implies that \(m_\nu \) contains only three real parameters at LO, see Eq. (23).

Since the neutrino masses only depend on the two real parameters \(t_\nu \) and \(\tilde{u}_\nu \), both of them can be fixed by the measured values of the mass squared differences and the absolute scale of neutrino masses and also their sum are predicted. Taking the best fit values for \(\Delta m_\mathrm{sol}^2\) and \(\Delta m_\mathrm{atm}^2\) quoted in [19] we get

$$\begin{aligned} m_1 \approx 0.016 \, \mathrm {eV}, \;\;\; m_2 \approx 0.018 \, \mathrm {eV}, \;\;\; m_3 \approx 0.052 \, \mathrm {eV}\nonumber \\ \end{aligned}$$
(30)

and

$$\begin{aligned} \Sigma \, m_\nu \approx 0.086 \, \mathrm {eV}. \end{aligned}$$
(31)

This value of the sum of the neutrino masses is well compatible with the latest results from the Planck satellite [65]. For \(\tan \beta \approx 10\) and \(\Lambda \approx 3 \times 10^{14} \, \mathrm {GeV}\), \(\langle h_u \rangle ^2/\Lambda = 0.1 \, \mathrm {eV}\) follows and the parameters \(t_\nu \), \(u_\nu \) and \(x_\nu \) are of order \(\lambda \), \(\lambda \) and \(\lambda ^2\), respectively. The quantity \(m_\beta \), which is measured in \(\beta \) decay experiments, is \(m_\beta \approx 0.018 \, \mathrm {eV}\), which is an order of magnitude below the expected sensitivity of the KATRIN experiment [66]. Thus, a positive signal of KATRIN or of the proposed experiments Project 8, ECHo, MARE and PTOLEMY could rule out our model, see e.g. [67]. Also an experiment like EUCLID [68] which could reach a \(1 \, \sigma \) precision of 0.01 eV for the sum of the neutrino masses [69] could disfavour our model. An exclusion of the wrong mass ordering is expected at the \(3 \, \sigma \) level with experiments like PINGU, ORCA, JUNO and RENO50 (and LBNE), within the next 10 to 15 years, for recent studies see [70, 71]. At any rate the information on the mass hierarchy will be useful for improving the reach of measuring e.g. CP violation.

Both Majorana phases \(\alpha \) and \(\beta \) are determined by the form of the neutrino masses in Eq. (27) and by the requirement \(t_\nu \tilde{u}_\nu < 0\)

$$\begin{aligned} \alpha = \pi , \,\, \beta =\pi \end{aligned}$$
(32)

and the quantity \(m_{ee}\), which encodes neutrinoless double \(\beta \) decay [72, 73] is \(m_{ee} \approx 0.003 \, \mathrm {eV}\). Such a low value is experimentally very challenging [7276]. On the other hand, if a positive signal is found, our model would be disfavoured.

3.2 Next-to-leading order results

In this subsection we address relevant contributions arising from shifts in the LO vacuum and from operators with more flavons than those discussed above. It is important to keep in mind in the following that we have chosen different expansion parameters in the charged lepton and neutrino sectors, see Eqs. (20, 25).

We first discuss the most important corrections to the neutrino mass matrix. These arise from the shift in the VEV of the flavon \(\chi _N\), see Eq. (54), if plugged into the fourth operator of Eq. (21), and they lead to a contribution to the neutrino mass matrix in Eq. (23) which is suppressed by \(\lambda ^2\) with respect to those coming from the first three terms in Eq. (21). This new contribution cannot be absorbed via a re-definition of the three parameters \(t_\nu \), \(u_\nu \) and \(x_\nu \) in \(m_\nu \), but instead it requires a fourth parameter \(p_\nu \). This parameter is real, since the corrections to the VEV of \(\chi _N\) as well as \(v_{\xi '_N}\) are both imaginary and all couplings are real. Furthermore, the contribution is \(\mu \tau \)-symmetric in flavour space. The corrected neutrino mass matrix thus reads

$$\begin{aligned}&m_\nu ^{\mathrm {NLO}}\nonumber \\&\quad =\left( \begin{array}{c@{\quad }c@{\quad }c} t_\nu + 2 u_\nu &{} - u_\nu - i x_\nu + p_\nu &{} -u_\nu + i x_\nu + p_\nu \\ - u_\nu - i x_\nu + p_\nu &{} 2 u_\nu + i x_\nu + p_\nu &{} t_\nu - u_\nu \\ - u_\nu + i x_\nu + p_\nu &{} t_\nu - u_\nu &{} 2 u_\nu - i x_\nu + p_\nu \end{array} \right) \nonumber \\&\qquad \times \,\,\frac{\langle h_u \rangle ^2}{\Lambda } \end{aligned}$$
(33)

with

$$\begin{aligned} p_\nu = i \, y_{\nu , 3} \, \alpha \, \frac{v_{\xi '_N} v_{\chi _N}}{\Lambda ^2} \, \lambda \, \end{aligned}$$
(34)

being real, since \(i \, v_{\xi '_N}\) is real. This matrix has the most general form which is compatible with the preservation of the group \(G_\nu \) in the neutrino sector (modulo the overall phase which also shows up in the parameter \(p_\nu \) and thus can be factored out). Consequently, the results for the mixing parameters remain unchanged, whereas the neutrino masses, see Eq. (27), acquire corrections proportional to \(p_\nu \). Since \(p_\nu \) is suppressed by \(\lambda ^2\) with respect to \(t_\nu \) and \(u_\nu \) (and thus \(\tilde{u}_\nu \)), the correction to the LO results for the neutrino masses is of relative order \(\lambda ^2\).

The largest correction to the mass matrix \(m_\nu \) which alters its structure in such a way that it is no longer compatible with the preservation of the residual symmetry \(G_\nu \) arises at order \(\lambda ^6\) in units of \(\langle h_u \rangle ^2/\Lambda \).Footnote 10 It originates from shifting the vacuum of the field \(\xi '_N\) so that \(\langle \xi '_N \rangle \) gets a small real part, being suppressed by \(\lambda ^4\) with respect to \(v_{\xi '_N}\) (which is imaginary). This shift contributes to \(m_\nu \) through the fourth operator in Eq. (21) and leads to unequal real parts of the (12) and (13) and (22) and (33) elements of \(m_\nu \), respectively, whose difference is of relative order \(\lambda ^5\) compared to the LO term in these elements. If we include operators with more flavons and compute their contribution using the LO vacuum of these fields, corrections to \(m_\nu \) that break the residual symmetry \(G_\nu \) of \(m_\nu ^{\mathrm {NLO}}\) in Eq. (33) are generated at order \(\lambda ^7\) (and smaller) in units of \(\langle h_u \rangle ^2/\Lambda \). The most relevant ones contain three flavons belonging to the set \(\{ \chi _E, \varphi _E \}\) and one being either \(\xi _N\), \(\tilde{\xi }_N\) or \(\varphi _N\). Both types of corrections will affect our results for the mixing parameters. However, they are suppressed by at least \(\lambda ^4\) with respect to the parameters in \(m_\nu \) in Eq. (23) and so can be safely neglected.

In the same fashion we can analyze the subleading contributions to the charged lepton mass matrix which is diagonal at LO and does not allow for a non-zero electron mass. The latter instead is generated in two ways: first through operators with five flavons with an index \(N\), if we take into account the shift in the VEV of the flavon \(\chi _N\) (in the following we only mention operators that actually give a non-zero contribution to the electron mass, omit all (real) Yukawa couplings and do not specify whether these operators arise from only one independent \(S_4\) contraction or whether there are several independent ones):

$$\begin{aligned}&l e^c \xi _N^3 \chi _N \varphi _N h_d/\Lambda ^5 + l e^c \xi _N \chi _N \varphi _N^3 h_d/\Lambda ^5\nonumber \\&\quad +\,\, l e^c \xi _N \chi _N^3 \varphi _N h_d/\Lambda ^5, \end{aligned}$$
(35)

and secondly through operators with six flavons with an index \(N\) (one of them being \(\xi '_N\), an even number of fields \(\chi _N\) and the rest either \(\xi _N\), \(\tilde{\xi }_N\) or \(\varphi _N\)), if we use the LO vacuum of these fields,

$$\begin{aligned}&l e^c \xi _N^4 \xi '_N \varphi _N h_d/\Lambda ^6 + l e^c \xi _N^2 \xi '_N \varphi _N^3 h_d/\Lambda ^6 + l e^c \xi '_N \varphi _N^5 h_d/\Lambda ^6\nonumber \\&\quad +\,\, l e^c \xi _N^2 \xi '_N \chi _N^2 \varphi _N h_d/\Lambda ^6 + l e^c \xi '_N \chi _N^2 \varphi _N^3 h_d/\Lambda ^6 \nonumber \\&\quad +\,\, l e^c \xi '_N \chi _N^4 \varphi _N h_d/\Lambda ^6. \end{aligned}$$
(36)

All these operators induce contributions to the \((1i)\) elements, \(i=1,2,3\) of the charged lepton mass matrix that are of the order \(\lambda ^6 \, \langle h_d \rangle \) and equal. They lead to an electron mass of order \(\lambda ^6 \langle h_d \rangle \), which is in agreement with the observed mass hierarchy among the charged leptons. The reason for these operators to give rise to a contribution that is equal for all three elements of the first row of \(m_l\) is the following: \(l e^c h_d \sim (\mathbf{3}, 1)\) under \(S_4 \times Z_3\) and the most general form of the VEV of a field in the representation \(\mathbf{3}\) of \(S_4\), which leaves the group \(G_\nu \) invariant is a vector with three equal entries, see Eq. (15), so that the contribution to the charged lepton mass matrix involving only fields with an index \(N\) leads to equal \((1i)\) elements.

Also the other elements of the charged lepton mass matrix are corrected at a subleading level. One type of corrections originates from the shifts in the VEVs of the fields \(\chi _E\) and \(\varphi _E\), see Eq. (56), and it results in \((31)\) and \((32)\) elements of order \(\lambda ^6\) and \((21)\) and \((23)\) elements of order \(\lambda ^8\) in units of \(\langle h_d \rangle \). A second source of corrections are operators with more flavons which turn out to have the following structure: an operator with \(\mu ^c\) comprises two fields from the set \(\{ \chi _E, \varphi _E \}\) in order to saturate the \(Z_3\) charge and either one field \(\chi _N\) together with three flavons of the type \(\{ \xi _N, \tilde{\xi }_N, \varphi _N \}\) or three fields \(\chi _N\) and one field \(\xi _N\), \(\tilde{\xi }_N\) or \(\varphi _N\), whereas an operator with \(\tau ^c\) only requires one field from the set \(\{ \chi _E, \varphi _E\}\), while otherwise maintaining the same form as the operator with \(\mu ^c\), e.g.

$$\begin{aligned} l \mu ^c \chi _E^2 \xi _N^2 \varphi _N \chi _N h_d/\Lambda ^6 + l \tau ^c \varphi _E \xi _N^3 \chi _N h_d/\Lambda ^5. \end{aligned}$$
(37)

Taking into account the LO form of the flavon VEVs, we see that such operators contribute at the same level to the charged lepton mass matrix as the leading operators, if the latter are computed with the shifted vacuum of \(\chi _E\) and \(\varphi _E\). Eventually we find the order of magnitudes of the different entries of the charged lepton mass matrix, including subleading corrections, to be

$$\begin{aligned} m_l^{\mathrm {NLO}} \sim \left( \begin{array}{c@{\quad }c@{\quad }c} \lambda ^6 &{} \lambda ^6 &{} \lambda ^6\\ \lambda ^8 &{} \lambda ^4 &{} \lambda ^8\\ \lambda ^6 &{} \lambda ^6 &{} \lambda ^2 \end{array} \right) \; \langle h_d \rangle . \end{aligned}$$
(38)

The charged lepton masses thus follow the correct hierarchy

$$\begin{aligned} m_e : m_\mu : m_\tau \approx \lambda ^4 : \lambda ^2 : 1 \end{aligned}$$
(39)

with \(m_\tau \approx \lambda ^2 \, \langle h_d \rangle \) and we can estimate the size of the contribution to the lepton mixing angles coming from the charged lepton sector as

$$\begin{aligned} \theta _{ij}^l \sim \lambda ^4 \;\;\; \text{ with } \;\;\; ij=12,13,23. \end{aligned}$$
(40)

Such corrections have a small impact on our results for the lepton mixing parameters and can be neglected.

4 Flavon superpotential

In order to construct the superpotential responsible for the alignment of the flavon VEVs we assume the existence of an \(R\)-symmetry \(U(1)_R\) [5] under which matter superfields carry charge +1, fields acquiring a non-vanishing vacuum (i.e. \(h_{u,d}\) and all flavons) are uncharged and so-called driving fields, indicated with the superscript “0” carry charge +2. In this way all terms in the superpotential either contain two matter superfields (like the terms leading to lepton masses) or one driving field, like those to be discussed in this section. In the limit of unbroken SUSY, the F-terms of these fields have to vanish and in this way the vacuum of the flavons gets aligned.Footnote 11 The driving fields necessary for our construction of the flavon superpotential are given in Table 3. Their transformation properties under the flavour and auxiliary symmetries are chosen in such a way that, at the renormalisable level, the vacua of the sets of fields \(\{ \chi _E , \varphi _E \}\), \(\{ \xi _N, \tilde{\xi }_N, \chi _N, \varphi _N \}\) and \(\xi '_N\) are aligned separately: the vacuum of the first set of fields spontaneously breaks \(G_\mathrm{CP}\) to \(Z_3^{(D)}\), while the one of the second set breaks \(G_\mathrm{CP}\) to \(Z_2 \times G_\nu \) (up to the overall phase of the VEVs, see Eq. (50), which is irrelevant for lepton masses and mixing at LO), which is broken to \(G_\nu \) if the vacuum of \(\xi '_N\) is also considered.

4.1 Leading order results

We can divide the flavon superpotential \(w_\mathrm{fl}\) in three parts, \(w_{\mathrm{fl},e}\), \(w_{\mathrm{fl},\nu }\) and \(w_{\mathrm{fl},\xi }\),

$$\begin{aligned} w_\mathrm{fl}= w_{\mathrm{fl},e} + w_{\mathrm{fl},\nu } + w_{\mathrm{fl},\xi } \end{aligned}$$
(41)

with

$$\begin{aligned} w_{\mathrm{fl},e}&= a_e \, \xi ^0_E (\chi _E \chi _E) + \tilde{a}_e \, \tilde{\xi }^0_E (\varphi _E \varphi _E) + b_e \, (\chi ^0_E \chi _E \chi _E)\nonumber \\&+\,\, c_e \, (\chi ^0_E \varphi _E \varphi _E), \end{aligned}$$
(42)
$$\begin{aligned} w_{\mathrm{fl}, \nu }&= a_\nu \, \xi ^0_N \xi ^2_N + \tilde{a}_\nu \, \xi ^0_N \tilde{\xi }^2_N + \bar{a}_\nu \, \xi ^0_N \xi _N \tilde{\xi }_N \!+\! b_\nu \, \xi ^0_N (\chi _N \chi _N)\nonumber \\&+\,\, c_\nu \, \xi ^0_N (\varphi _N \varphi _N) +\,\, d_\nu \, (\chi ^0_N \chi _N \chi _N) \!+\! e_\nu \, (\chi ^0_N \varphi _N \varphi _N)\nonumber \\&+\,\, f_\nu \, \tilde{\xi }_N (\varphi _N^0 \varphi _N) + g_\nu \, (\varphi _N^0 \varphi _N \varphi _N) \end{aligned}$$
(43)

and

$$\begin{aligned} w_{\mathrm{fl},\xi } = \tilde{\xi }^0_N M^2 + a_\xi \tilde{\xi }^0_N (\xi '_N \xi '_N). \end{aligned}$$
(44)

A few things are noteworthy: all couplings are real, since the theory is invariant under CP in the unbroken phase; the driving fields \(\xi ^0_E\) and \(\tilde{\xi }^0_E\) are defined in such a way that the former only couples to \((\chi _E \chi _E)\) and the latter only to \((\varphi _E \varphi _E)\); similarly, we use the freedom to define the fields \(\xi _N\) and \(\tilde{\xi }_N\) in such a way that only \(\tilde{\xi }_N\) couples to \(\varphi _N\) (at the renormalisable level); moreover, we neglect all terms containing the MSSM Higgs doublets \(h_{u,d}\), since these are completely irrelevant in the discussion of the alignment of the flavon VEVs, which are typically of order \(10^{13}\) GeV.

The F-terms of the driving fields with an index \(E\) are of the form

$$\begin{aligned}&\frac{\partial w_\mathrm{fl}}{\partial \xi ^0_E} = 2 a_e \, \chi _{E,1} \chi _{E,2},\end{aligned}$$
(45a)
$$\begin{aligned}&\frac{\partial w_\mathrm{fl}}{\partial \tilde{\xi }^0_E} = \tilde{a}_e \, (\varphi ^2_{E,1} + 2 \varphi _{E,2} \varphi _{E,3}),\end{aligned}$$
(45b)
$$\begin{aligned}&\frac{\partial w_\mathrm{fl}}{\partial \chi ^0_{E,1}} = b_e \, \chi ^2_{E,1} + c_e \, (\varphi ^2_{E,3} + 2 \varphi _{E,1} \varphi _{E,2}),\end{aligned}$$
(45c)
$$\begin{aligned}&\frac{\partial w_\mathrm{fl}}{\partial \chi ^0_{E,2}} = b_e \, \chi ^2_{E,2} + c_e \, (\varphi ^2_{E,2} + 2 \varphi _{E,1} \varphi _{E,3}). \end{aligned}$$
(45d)

Equating these to zero and solving for the VEVs of the flavons \(\chi _E\) and \(\varphi _E\) we get

$$\begin{aligned} \langle \chi _E \rangle = v_{\chi _E} \, \left( \begin{array}{c} 0 \\ 1 \end{array} \right) \;\;\; \text{ and } \;\;\; \langle \varphi _E \rangle = v_{\varphi _E} \, \left( \begin{array}{c} 0 \\ 1 \\ 0 \end{array} \right) \end{aligned}$$
(46)

up to symmetry transformations belonging to \(S_4\). Obviously, also the trivial vacuum is a solution which, however, can easily be excluded by requiring that at least one of the flavons \(\chi _E\) and \(\varphi _E\) acquires a non-vanishing VEV. The two VEVs \(v_{\chi _E}\) and \(v_{\varphi _E}\) are related through

$$\begin{aligned} v_{\varphi _E}^2 = - \frac{b_e}{c_e} \, v_{\chi _E}^2 \end{aligned}$$
(47)

with \(v_{\chi _E}\) being a free parameter which is in general complex. The presence of this parameter indicates a flat direction which exists in the flavon potential in the SUSY limit. Since \(v_{\chi _E}\) and \(v_{\varphi _E}\) are directly related, it is natural to expect them to be of the same order of magnitude, as we did in the preceding discussion. Since their size is related to a flat direction, we, however, cannot fix it, but we choose it by hand in order to correctly accommodate the hierarchy among the tau and the muon mass, see Eqs. (19,20).

Analogously, the F-terms of the driving fields \(\xi ^0_N\), \(\chi ^0_N\) and \(\varphi ^0_N\) are given by

$$\begin{aligned}&\frac{\partial w_\mathrm{fl}}{\partial \xi ^0_N} = a_\nu \, \xi ^2_N + \tilde{a}_\nu \, \tilde{\xi }^2_N + \bar{a}_\nu \, \xi _N \tilde{\xi }_N + 2~ b_\nu \, \chi _{N,1} \chi _{N,2} \nonumber \\&\qquad \qquad +\,\,c_\nu \, (\varphi _{N,1}^2 + 2 \varphi _{N,2} \varphi _{N,3}),\end{aligned}$$
(48a)
$$\begin{aligned}&\frac{\partial w_\mathrm{fl}}{\partial \chi ^0_{N,1}} = d_\nu \, \chi _{N,1}^2 + e_\nu \, (\varphi _{N,3}^2 + 2 \varphi _{N,1} \varphi _{N,2}),\end{aligned}$$
(48b)
$$\begin{aligned}&\frac{\partial w_\mathrm{fl}}{\partial \chi ^0_{N,2}} = d_\nu \, \chi _{N,2}^2 + e_\nu \, (\varphi _{N,2}^2 + 2 \varphi _{N,1} \varphi _{N,3}),\end{aligned}$$
(48c)
$$\begin{aligned}&\frac{\partial w_\mathrm{fl}}{\partial \varphi ^0_{N,1}} = f_\nu \, \tilde{\xi }_N \varphi _{N,1} + 2 \, g_\nu \, (\varphi _{N,1}^2 - \varphi _{N,2} \varphi _{N,3}),\end{aligned}$$
(48d)
$$\begin{aligned}&\frac{\partial w_\mathrm{fl}}{\partial \varphi ^0_{N,2}} = f_\nu \, \tilde{\xi }_N \varphi _{N,3} + 2 \, g_\nu \, (\varphi _{N,2}^2 - \varphi _{N,1} \varphi _{N,3}),\end{aligned}$$
(48e)
$$\begin{aligned}&\frac{\partial w_\mathrm{fl}}{\partial \varphi ^0_{N,3}} = f_\nu \, \tilde{\xi }_N \varphi _{N,2} + 2 \, g_\nu \, (\varphi _{N,3}^2 - \varphi _{N,1} \varphi _{N,2}). \end{aligned}$$
(48f)

Equating these to zero, we find several possible solutions for the vacuum. These can be classified and have different features: one class is formed by the trivial vacuum, one class by solutions with vanishing VEVs of the fields \(\chi _N\) and \(\varphi _N\), a third class comprises the vacua in which the VEV of \(\chi _N\) vanishes, while the other flavons acquire non-zero VEVs, and the last class requires \(\langle \tilde{\xi }_N \rangle =0\) and all other VEVs to be different from zero. We are indeed interested in this last class and, as one sees, it can be selected by requiring that (at least) the flavon \(\chi _N\) being in the two-dimensional representation of \(S_4\) acquires a non-zero VEV. Then all solutions belonging to this class are related by \(S_4\) transformations and/or by a change of the relative sign of the VEVs of the two components of the flavon \(\chi _N\). The latter feature can be traced back to the fact that the expressions in Eqs. (48b, 48c) only depend quadratically on \(\chi _{N,i}\). Given these different possibilities, we choose as vacuum

$$\begin{aligned} \langle \xi _N \rangle =v_{\xi _N}, \;\;\; \langle \chi _N \rangle = v_{\chi _N} \, \left( \begin{array}{c} 1 \\ 1 \end{array} \right) \; , \;\;\; \langle \varphi _N \rangle = v_{\varphi _N} \, \left( \begin{array}{c} 1 \\ 1 \\ 1 \end{array} \right) \nonumber \\ \end{aligned}$$
(49)

and \(\langle \tilde{\xi }_N \rangle =0\).Footnote 12 As can be derived from Eqs. (48a48c), the VEVs \(v_{\xi _N}\), \(v_{\chi _N}\) and \(v_{\varphi _N}\) are related through

$$\begin{aligned} v^2_{\xi _N} = \frac{3}{a_\nu } \, \left( 2~ \frac{b_\nu e_\nu }{d_\nu } - c_\nu \right) \, v_{\varphi _N}^2 \;\;\; \text{ and } \;\;\; v_{\chi _N}^2= - 3 \, \frac{e_\nu }{d_\nu } \, v_{\varphi _N}^2\nonumber \\ \end{aligned}$$
(50)

with \(v_{\varphi _N}\) parametrising the second flat direction of our flavon potential in the SUSY limit. This parameter is in general complex so that also the VEVs of \(\xi _N\), \(\chi _N\) and \(\varphi _N\) are in general complex. Apparently, CP is broken in this vacuum and thus in the neutrino sector. However, as explained in Sect. 3.1, such violation of CP is irrelevant for (the residual symmetry of) the neutrino mass matrix \(m_\nu \) and thus the lepton mixing (at LO), as long as the VEVs \(v_{\xi _N}\), \(v_{\chi _N}\) and \(v_{\varphi _N}\) have the same phase up to \(\pi \), because the latter becomes then an overall phase which can be factored out in \(m_\nu \), see Eqs. (23,24). From Eq. (50) we see that this is achieved for \(e_\nu /d_\nu <0\) and \((2 b_\nu e_\nu /d_\nu - c_\nu )/a_\nu >0\). Another way to show that the common phase of the VEVs \(v_{\xi _N}\), \(v_{\chi _N}\) and \(v_{\varphi _N}\) is unphysical is to re-define the fields of the model so that this phase becomes an overall phase of the LO superpotential. The VEVs are expected to have the same order of magnitude, which we choose to be \(\lambda \, \Lambda \), since they are directly related through the parameters of \(w_{\mathrm{fl},\nu }\), see Eq. (50). Eventually, requiring the F-term of \(\tilde{\xi }^0_N\) to vanish leads to

$$\begin{aligned} v^2_{\xi '_N} = - \frac{M^2}{a_\xi }. \end{aligned}$$
(51)

For \(M^2 /a_\xi >0\) we immediately see that the VEV of \(\xi '_N\) is imaginary, as it should in order to be compatible with \(G_\nu \). Notice for \(M \sim \lambda \, \Lambda \) also \(\langle \xi '_N \rangle \) is of this order of magnitude. Such type of alignment has been proposed in [77].

4.2 Next-to-leading order results

We now turn to the discussion of the non-renormalisable contributions to the flavon superpotential. These induce shifts in the vacuum obtained at LO. Taking into consideration the sizes of the LO VEVs (fields with an index \(E\) have a VEV of order \(\lambda ^2 \, \Lambda \), whereas those with an index \(N\) have VEVs of order \(\lambda \, \Lambda \)), we find the most relevant subleading corrections to arise from the following three operators

$$\begin{aligned}&s_{\nu , 1} \, \xi _N \, (\chi _N^0 \xi '_N \chi _N)/\Lambda + \tilde{s}_{\nu , 1} \, \tilde{\xi }_N \, (\chi _N^0 \xi '_N \chi _N)/\Lambda \nonumber \\&\quad +\,\, s_{\nu , 2} \, (\varphi _N^0 \xi '_N \chi _N \varphi _N)/\Lambda . \end{aligned}$$
(52)

Adding these to \(w_\mathrm{fl}\), re-computing the F-terms in a linear expansion in the shifts of the vacuum and equating them to zero, we find that they give rise to a correction of the VEV of \(\chi _N\)

$$\begin{aligned} \delta v_{\chi _{N,1}} = -\delta v_{\chi _{N,2}} = \frac{s_{\nu ,1}}{2 \, d_\nu } \, \frac{v_{\xi _N}}{\Lambda } \, v_{\xi '_N} \end{aligned}$$
(53)

with \(\delta v_{\chi _{N,i}}\) denoting the shifts of the vacuum of the \(i\mathrm{th}\) component of \(\chi _N\), \(\langle \chi _{N,i} \rangle = v_{\chi _N} + \delta v_{\chi _{N,i}}\). Since \(\delta v_{\chi _{N,i}}\) are proportional to \(v_{\xi _N}\), they carry the phase that is associated with the second flat direction of the flavon potential and that is also carried by the LO VEVs of the fields \(\xi _N\), \(\chi _N\) and \(\varphi _N\). At the same time, \(\delta v_{\chi _{N,i}}\) are proportional to \(v_{\xi '_N}\), which is imaginary. Their relative suppression with respect to the LO VEVs of the flavons with index \(N\) is \(\lambda \). Thus, the shifted vacuum of \(\chi _N\) can be parameterised as

$$\begin{aligned} \langle \chi _N \rangle = v_{\chi _N} \, \left( \begin{array}{c} 1 + i \, \alpha \, \lambda \\ 1 - i \, \alpha \, \lambda \end{array} \right) \end{aligned}$$
(54)

with \(\alpha \) real. Its form leaves the group \(G_\nu \) invariant (up to the above-mentioned phase of \(v_{\chi _N}\)), see Eq. (15), because the operators in Eq. (52) that induce the shift only contain fields with index \(N\) (especially, exactly one of them is \(\xi '_N\)). The other VEVs do not acquire non-trivial shifts at this order. Indeed, all terms, which lead to contributions of order \(\lambda ^5 \, \Lambda ^2\) or larger to the F-terms of the driving fields \(\xi ^0_N\), \(\chi ^0_N\), \(\varphi ^0_N\) and \(\tilde{\xi }^0_N\), if the LO flavon VEVs are plugged in, only comprise flavons with an index \(N\) and thus cannot lead further non-trivial shifts in the vacuum.Footnote 13 The most relevant terms including the flavons \(\chi _E\) and \(\varphi _E\) which do change the alignment are

$$\begin{aligned} \tilde{\xi }^0_N (\varphi _E^3)/\Lambda + \tilde{\xi }^0_N (\varphi _E^2 \chi _E)/\Lambda + \tilde{\xi }^0_N (\chi _E^3)/\Lambda . \end{aligned}$$
(55)

Their contribution to the F-term of \(\tilde{\xi }^0_N\) is of order \(\lambda ^6 \, \Lambda ^2\) and hence they are relatively suppressed by \(\lambda ^4\) compared to the contribution from the LO terms in Eq. (44) [see also Eq. (51) and below]. Since the VEVs of \(\chi _E\) and \(\varphi _E\) carry an undetermined phase, see Eq. (47), such contributions lead to a shift in the phase of the VEV of \(\xi '_N\) so that it acquires a small real part which is suppressed by \(\lambda ^4\) with respect to \(v_{\xi '_N}\).

Similarly, we can discuss the subleading contributions to \(w_{\mathrm{fl},e}\). The ones leading to shifts in the vacuum of \(\chi _E\) and \(\varphi _E\) that cannot be absorbed into its LO form, given in Eq. (46), comprise six flavons. Two of them belong to the set \(\{ \chi _E, \varphi _E \}\) and are necessary in order to saturate the \(Z_3\) charge, while the other four ones carry an index \(N\) (there are either three fields of the set \(\{\xi _N, \tilde{\xi }_N, \varphi _N \}\) and one power of \(\chi _N\) or three fields \(\chi _N\) and one of the set \(\{\xi _N, \tilde{\xi }_N, \varphi _N \}\)). Assuming the sizes of the VEVs, as discussed above, these contributions to the F-terms of the driving fields \(\xi ^0_E\), \(\tilde{\xi }^0_E\) and \(\chi ^0_{E}\) are of the order \(\lambda ^8 \, \Lambda ^2\) and are thus suppressed by \(\lambda ^4\) with respect to the contributions from the renormalisable terms in \(w_{\mathrm{fl},e}\), see Eqs. (42, 45). Consequently, they induce shifts in the VEVs of \(\chi _E\) and \(\varphi _E\) at relative order \(\lambda ^4\) such that the latter take the form

$$\begin{aligned} \langle \chi _E \rangle \!=\! v_{\chi _E} \, \left( \! \begin{array}{c} \beta \, \lambda ^4 \\ 1 \end{array} \!\right) \quad \text{ and } \quad \langle \varphi _E \rangle \!=\! v_{\varphi _E} \, ( 1 \!+\! \gamma \, \lambda ^2) \, \left( \! \begin{array}{c} \epsilon \, \lambda ^4 \\ 1 \\ \eta \, \lambda ^4 \end{array} \!\right) \nonumber \\ \end{aligned}$$
(56)

with \(\beta \), \(\gamma \), \(\epsilon \) and \(\eta \) having absolute values of order one and being in general complex.

5 Summary

In this paper we have constructed a SUSY model for leptons with the flavour group \(G_f=S_4 \times Z_3\) and a generalised CP symmetry. The spontaneous breaking of these symmetries to a \(Z_3\) subgroup in the charged lepton and to \(Z_2 \times \)CP (\(\times Z_3\)) in the neutrino sector leads to a mixing matrix with one free parameter. The latter is determined by the elements of the neutrino mass matrix and is directly related to the reactor mixing angle. This angle is naturally of the correct order \(\lambda \) in our model due to its origin from a subleading operator. At the same time, the other mixing parameters are also predicted, especially the atmospheric mixing angle is maximal as well as the Dirac phase. The two Majorana phases are trivial. The solar mixing angle fulfills \(\sin ^2 \theta _{12} \gtrsim 1/3\). We predict the three neutrino masses to effectively depend on only two parameters at LO and thus the mass spectrum can only be normally ordered with the absolute neutrino mass scale being determined. We have discussed in detail the vacuum alignment of the flavour and CP symmetry breaking fields and have shown that this alignment is stable against the inclusion of higher-order terms in the flavon superpotential. We have computed leading as well as subleading contributions to the charged lepton and neutrino mass matrices and have shown that all our predictions are only slightly corrected by the latter contributions. The most relevant effect of the subleading corrections is the generation of the electron mass. Indeed, the charged lepton mass hierarchy is correctly reproduced without any tuning in our model, because the different lepton masses arise from operators with one, two and several flavons, respectively.

It would be interesting to extend our predictive and successful model from the lepton to the quark sector, for example in the framework of a SUSY grand unified theory.

Note Added: During the completion of this manuscript another paper [78] has been submitted to the arXiv that contains similar SUSY models in which the flavour group \(S_4\) and a generalised CP symmetry are broken to \(Z_2 \times \)CP in the neutrino sector and the charged lepton mass matrix is diagonal.