1 Introduction

In the standard model, the number of fermion generations appears as an arbitrary parameter, meaning that a mathematically consistent theory can be built up using any number of fermion generations. The same is true for many extensions of the standard model, including grand unification models based on the gauge groups SU(5) and SO(10). An interesting question related to the extension of the standard model is whether the number of fermion generations can in any way be explained through the internal consistency of the model. In the literature, there is some discussion of grand unification models based on large orthogonal groups like SO(18), where one spinor multiplet contains all known fermion fields of all generations, and much more [1]. It was shown that [2, 3], with a suitable symmetry breaking scheme, only three generations can remain light, whereas the others obtain masses at much above the electroweak scale.

In a quite different line of development, it was shown that if one extends the electroweak group to \(\mathrm{{SU}}(3) \times \mathrm{U}(1)\) and tries to accommodate the standard fermions into multiplets of this gauge group which must include some new fermions, cancelation of gauge anomalies can restrict the number of generations and one obtains consistent models with the number of generations equal to three or any multiple of it [48].

These models are described briefly in Sect. 2. Then, in Sects. 3 and 4, we try to see whether these models can be embedded into a simple SU(N) group. We conclude that if one uses only completely antisymmetric tensor representations, such an embedding cannot be found. Then, in Sect. 5, we start looking for general conditions that will specify the number of fermion generations for arbitrary SU(N) groups. In Sect. 6, we analyze one simple model, based on the group SU(9), which gives three generations. The renormalization group analysis of various scales in these models is performed in Sect. 7. We end with some concluding remarks in Sect. 8.

2 The 3-3-1 models

The 3-3-1 models are based on the gauge group \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\). The first factor in the gauge group is the group of QCD, whereas the other two factors pertain to electroweak interactions. There are two versions of such models, and we discuss them one by one.

In one version, proposed by Pisano and Pleitez [5] and by Frampton [6] (to be denoted as the PPF model), the left-chiral fermions and antifermions belong to the following representations of the gauge group:

$$\begin{aligned}&f_a = \left( \begin{array}{l} \hat{\ell }\\ \nu _\ell \\ \ell \end{array} \right) _a \sim \left[ 1,3,0\right] \end{aligned}$$
(1a)
$$\begin{aligned}&Q_1 = \left( \begin{array}{l} T_1 \\ u_1 \\ d_1 \end{array} \right) \sim \left[ 3,3,{2\over 3}\right] \end{aligned}$$
(1b)
$$\begin{aligned}&Q_i = \left( \begin{array}{l} B \\ d \\ u \end{array} \right) _i \sim \left[ 3,3^*,-{1\over 3}\right] \end{aligned}$$
(1c)
$$\begin{aligned}&\hat{u}_a \sim \left[ 3^*,1,-{2\over 3}\right] _a \end{aligned}$$
(1d)
$$\begin{aligned}&\hat{d}_a \sim \left[ 3^*,1,{1\over 3} \right] _a \end{aligned}$$
(1e)
$$\begin{aligned}&\hat{T}_1 \sim \left[ 3^*,1,-{5\over 3}\right] \end{aligned}$$
(1f)
$$\begin{aligned}&\hat{B}_i \sim \left[ 3^*,1,{4\over 3}\right] _i. \end{aligned}$$
(1g)

Note that there are two kinds of generation indices: \(a\) goes from 1 to 3, whereas \(i\) takes only the values 2 and 3. An antifermion has been denoted by a hat. We emphasize that all representations given above pertain to the left-chiral components only. The representation of a right-chiral fermion would be the complex conjugate of that of the left-chiral antifermion, and vice versa. Note that there are extra quark fields, i.e., fields which are triplets of \(\mathrm{{SU}}(3)_\mathrm{c}\), but there is no neutrino field that is sterile under the standard model. Different generations of fermions are not copies of one another. Gauge anomalies cancel between the three generations [5, 6], but not within a single generation. Thus, the consistency of the model requires three generations of fermions. Of course, this pattern of three generations can be repeated, so one would obtain a number of generations that is a multiple of three.

The other version of 3-3-1 models was proposed by Singer et al. [4] (and hence will be referred to as the SVS model) and was examined by other authors later [7, 8]. In this version, there are sterile neutrinos, the left-chiral component of which has been denoted by \(\hat{\nu }\) in the list below:

$$\begin{aligned}&f_a = \left( \begin{array}{l} \nu _\ell \\ \ell \\ \widehat{\nu }_\ell \end{array} \right) _a \sim \left[ 1,3,-{1\over 3}\right] \end{aligned}$$
(2a)
$$\begin{aligned}&\hat{\ell }_a \sim \left[ 1,1,1\right] \end{aligned}$$
(2b)
$$\begin{aligned}&Q_1 = \left( \begin{array}{l} u_1 \\ d_1 \\ u'_1 \end{array} \right) \sim \left[ 3,3,{1\over 3}\right] \end{aligned}$$
(2c)
$$\begin{aligned}&Q_i = \left( \begin{array}{l} d \\ u \\ d' \end{array} \right) _i \sim \left[ 3,3^*,0\right] \end{aligned}$$
(2d)
$$\begin{aligned}&\hat{u}_a, \hat{u}'_1 \sim \left[ 3^*,1,-{2\over 3}\right] \end{aligned}$$
(2e)
$$\begin{aligned}&\hat{d}_a, \hat{d}'_i \sim \left[ 3^*,1,{1\over 3}\right] . \end{aligned}$$
(2f)

The notation for the generation indices is as before. The primed fields are extra quark fields, which are not present in the standard model. Like the previous model, gauge anomalies cancel between various generations.

The difference between the two models may be summarized in the following way. The standard model gauge group is not a maximal subgroup of the group \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\). In particular, the electroweak \(\mathrm{{SU}}(3)_\mathrm{L}\) has two neutral generators, and some combination of these two, along with the generator of \(\mathrm{U}(1)_\mathrm{X}\), form \(Y\) and \(I_{3L}\), the two neutral generators of the standard electroweak model. Using the standard normalization of the SU(3) generators,

$$\begin{aligned} \mathrm{{tr}} (T_A T_B) = \frac{1}{2} \delta _{AB}, \end{aligned}$$
(3)

these combinations are given by

$$\begin{aligned}&I_{3L} = - \frac{1}{2} T_3 + \frac{\surd 3}{2} T_8, \end{aligned}$$
(4a)
$$\begin{aligned}&Y = \frac{3}{2} T_3 + \frac{\surd 3}{2} T_8 + X \end{aligned}$$
(4b)

in the first model, whereas in the second one, they are given by

$$\begin{aligned}&I_{3L} = T_3, \end{aligned}$$
(5a)
$$\begin{aligned}&Y = - \frac{1}{\surd 3} T_8 + X. \end{aligned}$$
(5b)

The SU(3) generators are given by

$$\begin{aligned} T_i =\! \left\{ \begin{array}{l@{\quad }l} \frac{1}{2} \lambda _i &{} \hbox {for the fundamental representation}, \\ -\frac{1}{2} \lambda ^*_i &{} \hbox {for the anti-fundamental representation}, \end{array}\right. \end{aligned}$$
(6)

where the \(\lambda \) are the well-known Gell-Mann matrices.

3 Seeking embeddings into SU(6)

Since the group \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) is of rank 5, the smallest unitary group that contains it as a subgroup is SU(6). Therefore, in this section, we analyze whether the models discussed in Sect. 2 can be embedded into an SU(6) grand unified model.

For the sake of convenience, let us announce here the notation that will be used for denoting representations of various groups. For the grand unified group, the notations will be denoted by boldface. The \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) representations will be denoted by three numbers in parentheses, e.g., \([a,b,c]\), as has already been done in Eqs. (1) and (2). The representation of the standard model group, when required, will be denoted by curly brackets, e.g., \(\{a,b,c\}\). Thus, with this notation, the fundamental representation of SU(6) has the decomposition

$$\begin{aligned} \mathbf {6}&\rightarrow \left[ 3,1,-\frac{1}{3}\right] + \left[ 1,3,\frac{1}{3}\right] \nonumber \\&\rightarrow \left\{ 3,1,-\frac{1}{3}\right\} + \left\{ 1,2,\frac{1}{2}\right\} + \left\{ 1,1,0\right\} . \end{aligned}$$
(7)

It is now easy to see that neither the PPF nor the SVS model can be embedded into SU(6). For the PPF model [5, 6], note the representation of leptons given in Eq. (1a). Certainly, it is not contained in the fundamental representation of SU(6) or its complex conjugate, \(\mathbf {6}^*\). Higher representations can be obtained by taking Kronecker products of the \(\mathbf {6}\) and \(\mathbf {6}^*\) representations, and they will be of the generic form \((\mathbf {6})^m(\mathbf {6}^*)^n\). Denoting the two different \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) representations that appear in Eq. (7) by \(A\) and \(B\), we can write

$$\begin{aligned} (\mathbf {6})^m(\mathbf {6}^*)^n \!\rightarrow \! \sum _{m',n'} {m \atopwithdelims ()m'} {n \atopwithdelims ()n'} (A)^{m-m'} \!(B)^{m'} (A^*)^{n-n'} (B^*)^{n'}.\nonumber \\ \end{aligned}$$
(8)

Take any term in the sum. Contributions to the \(\mathrm{U}(1)_\mathrm{X}\) quantum number come from all four factors, and it is given by

$$\begin{aligned} X = \frac{1}{3} ( -m + 2m' + n - 2n'). \end{aligned}$$
(9)

Thus, for the lepton representation such as in Eq. (1a), we need

$$\begin{aligned} m-n = 2(m'-n'). \end{aligned}$$
(10)

in order to obtain \(X=0\). Only the \(B\) and \(B^*\) contributes to non-trivial \(\mathrm{{SU}}(3)_\mathrm{L}\) representations. In order to obtain a triplet, we need

$$\begin{aligned} m' - n' = 1 \mod 3, \end{aligned}$$
(11)

considering the triality of the representations. Moreover, the lepton must be a color singlet, which means that we should have

$$\begin{aligned} (m-m') - (n-n') = 0 \mod 3. \end{aligned}$$
(12)

These three conditions cannot be satisfied with integers, and hence it is impossible to obtain a \((1,3,0)\) representation of \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) in any representation of SU(6).

For the SVS model, the same kind of analysis can be performed keeping an eye towards the antilepton in Eq. (2a). In order to produce a singlet of both SU(3) factors, one needs the Kronecker product of equal numbers of \(6\) and \(6^*\). However, such products will give \(X=0\).

We have thus proved that neither the PPF, nor the SVS model can be embedded into SU(6). This result should not be understood to imply that an SU(6) grand unified model is impossible. We can take a different embedding of the standard model generators \(I_{3L}\) and \(Y\), viz.,

$$\begin{aligned}&I_{3L} = T_3, \end{aligned}$$
(13a)
$$\begin{aligned}&Y = \frac{1}{\surd 3} \lambda _8 + X. \end{aligned}$$
(13b)

The SM reduction of the \(6^*\) representation of SU(6) can easily be read from Eq. (7):

$$\begin{aligned} \mathbf {6}^* \rightarrow \underbrace{\left\{ {3^*,1,\frac{1}{3}}\right\} }_{\hat{d}} + \underbrace{\left\{ { 1,2,-\frac{1}{2}}\right\} }_{L} + \left\{ 1,1,0 \right\} . \end{aligned}$$
(14)

Also, the antisymmetric rank-2 representation has the following decomposition under the SM gauge group:

$$\begin{aligned} \mathbf {15}&\rightarrow \left[ 3^*,1, -{\frac{2}{3}}\right] + \left[ 1,3^*,{\frac{2}{3}}\right] + \left[ 3,3,0\right] \nonumber \\&\rightarrow \underbrace{ \left\{ 3^*,1,-\frac{2}{3}\right\} }_{\hat{u}} + \left\{ 1,2,\frac{1}{2}\right\} + \underbrace{\left\{ 1,1,1\right\} }_{\hat{\ell }} + \underbrace{\left\{ 3,2,\frac{1}{6}\right\} }_{Q}\nonumber \\&+ \left\{ 3,1,-\frac{1}{3}\right\} . \end{aligned}$$
(15)

In both Eqs. (14) and (15), we have marked the known fermions which correspond to the SM representations. We observe that all known fermions of a single generation belong to these two representations. This is therefore like the minimal SU(5) grand unified model where the corresponding representations contain all known fermion fields of a single generation and nothing more. In this case, there are some extra fermions to complete the SU(6) representations.

There is one big difference between the SU(5) and this SU(6) model. This can be seen from the anomaly coefficients of different representations. For the completely antisymmetric tensorial representations of SU(N), the anomaly coefficients are as follows [9]:

(16)

Here, the completely antisymmetric representation of rank \(n\) has been denoted by \({[\![} \, n \, {]\!]}\). The anomaly coefficient of any representation and its complex conjugate will be the negatives of each other. We see that, for the SU(5) group, anomalies cancel between the \(\mathbf {5}^*\) and \(\mathbf {10}\) representations. For the SU(6) gauge group, the \(\mathbf {6}^*\) and the \(\mathbf {15}\) representations have anomaly coefficients of \(-1\) and 2, respectively. Thus, along with three copies of the \(\mathbf {15}\) representation, which is necessary in order to have three quark doublets, we need six copies of the \(\mathbf {6}^*\) representation in order to cancel anomalies. There will thus be three extra copies of the lepton doublet \(L\). But these will form gauge invariant masses with the three copies of the \(\left\{ 1,2,\frac{1}{2}\right\} \) representation that appear in the three \(\mathbf {15}\)-plets, and these masses can be much heavier than the electroweak scale. Similarly, among the six copies of the \(\hat{d}\) representation that appear in the \(\mathbf {6}^*\) multiplets, three will form gauge invariant masses with the \(\left\{ 3,1,-\frac{1}{3}\right\} \) representations present in the \(\mathbf {15}\)-plets, leaving only three \(\hat{d}\)’s for the electroweak scale. The sterile neutrino fields, i.e., the \(\left\{ 1,1,0\right\} \) representations shown in Eq. (14), can also have mass terms that are invariant under the SM gauge group. If all these masses are large, the only fields that are left over at the electroweak scale are not different from the fields that are obtained in the three generations of \(\mathbf {5}^*\) and \(\mathbf {10}\) multiplets of SU(5). Moreover, the model has no explanation for the number of generations. Any number of \(\mathbf {15}\)-plets, along with twice the number of \(\mathbf {6}^*\)-plets, would be anomaly-free. Hence this model is not interesting for our discussion.

4 Generalities of SU(N) models with \(N>6\)

We now consider models based on the gauge groups SU(N), with \(N>6\). Henceforth we will use completely antisymmetric tensor representations only. Such a representation of rank \(n\) will be denoted by \({[\![} \, n \, {]\!]}\), as was done in Eq. (16). The fundamental representation will thus be denoted by \({[\![} \, 1 \, {]\!]}\) in this notation, whereas its complex conjugate will be \({[\![} \, N-1 \, {]\!]}\). We will show that, with antisymmetric representations only, neither the PPF nor the SVS model can be embedded into an SU(N) grand unified group.

The crucial aspect of both the PPF and the SVS model is that, among the left-chiral fields, the quarks, i.e., color triplets, appear in triplet or antitriplet of \(\mathrm{{SU}}(3)_\mathrm{L}\), whereas the antiquarks, i.e., color antitriplets, are all singlets of \(\mathrm{{SU}}(3)_\mathrm{L}\). In particular, then, there is no multiplet that transforms like \(\left[ 3^*,3,\star \right] \) or \(\left[ 3^*,3^*,\star \right] \). On the other hand, among the quark fields, some should be in the \(\big [3,3,\star \big ]\) and some in the \(\left[ 3,3^*,\star \right] \) representation, thereby ensuring anomaly cancelation among different generations. These features are not available in the decomposition of any \({[\![} \, m \, {]\!]}\) representation of an SU(N) group, as we show now.

Since \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) is not a maximal subgroup of the groups under consideration, it should be possible to embed the \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) group into the SU(N) grand unified group in more than one way. First, we assume that the decomposition of the fundamental representation of SU(N) into \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) is given by

$$\begin{aligned} {}{[\![} \, 1 \, {]\!]} \equiv \mathbf {N} \rightarrow \Big [3,1,\star \Big ] + \Big [1,3,\star \Big ] + \sum _{k=7}^N \Big [1,1,\star \Big ], \end{aligned}$$
(17)

where the \(\mathrm{U}(1)_\mathrm{X}\) charges have been left unspecified, denoted by the star symbol. There are various ways of assigning the \(\mathrm{U}(1)_\mathrm{X}\) quantum numbers, and the specifics are irrelevant for our discussion. The representation \({[\![} \, m \, {]\!]}\) can contain a \(\left[ 3^*,3,\star \right] \) submultiplet if, among the \(m\) tensor indices, two come from the color part and one from the \(\mathrm{{SU}}(3)_\mathrm{L}\) part, and the remaining \(m-3\) indices should belong to the \(\mathrm{U}(1)_\mathrm{X}\) subgroup. However, the count of \(\left[ 3,3^*,\star \right] \) is exactly the same. Thus, we obtain equal numbers of \(\left[ 3^*,3,\star \right] \) and \(\left[ 3,3^*,\star \right] \) submultiplets. The only way to get rid of the \(\left[ 3^*,3,\star \right] \) submultiplets is for them to form \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) invariant masses with \(\left[ 3,3^*,\star \right] \) submultiplets and become superheavy. But then there is no remaining \(\left[ 3,3^*,\star \right] \) submultiplet to contribute to the fermion content of the PPF or the SVS model. Hence the impasse.

The situation is not different if we assume that the decomposition of the fundamental representation follows the rule

$$\begin{aligned} {}{[\![} \, 1 \, {]\!]} \equiv \mathbf {N} \rightarrow \left[ 3,1,\star \right] + \left[ 1,3^*,\star \right] + \sum _{k=7}^N \left[ 1,1,\star \right] . \end{aligned}$$
(18)

In this case, following the same argument, we would conclude that the number of \(\big [3,3,\star \big ]\) and \(\left[ 3^*,3^*,\star \right] \) are equal. Now, the \(\left[ 3^*,3^*,\star \right] \) submultiplets can be got rid of by forming superheavy masses with \(\left[ 3,3,\star \right] \) submultiplets. But then there will be no \(\big [3,3,\star \big ]\) left at the \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) level, which is unacceptable for both the PPF and the SVS model.

5 Other embeddings into SU(N)

At this point, we ignore the intermediate symmetry \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) and try to see whether it is possible to embed the standard model fermions into an SU(N) grand unified group that would provide an explanation for the number of fermion generations. To this end, we first list the decomposition of the lowest rank antisymmetric tensor representations of SU(N) into the SM gauge group. The fundamental representation decomposes as follows:

$$\begin{aligned} {[\![} \, 1 \, {]\!]} \!&= \! \left\{ 3,1,-\frac{1}{3}\right\} \!+\! \left\{ 1,2,\frac{1}{2}\right\} \!+\! (N-5) \cdot \Big \{1,1,0\Big \},\qquad \end{aligned}$$
(19)

where the number within parentheses indicates the number of copies of the SM gauge singlet in the last term. Similarly, we obtain

$$\begin{aligned} {{[\![} \, 2 \, {]\!]}} \!&= \! \left\{ 3^*,1,-\frac{2}{3}\right\} + \left\{ 3,2,\frac{1}{6}\right\} + (N-5) \cdot \left\{ 3,1,-\frac{1}{3}\right\} \nonumber \\&\!+\! \Big \{1,1,1\Big \} \!+\! (N\!-\!5) \cdot \left\{ 1,2,\frac{1}{2}\right\} \!+\! \left( \begin{array}{c} N-5 \\ 2\end{array}\right) \cdot \Big \{1,1,0\Big \}, \nonumber \\ {[\![} \, 3 \, {]\!]} \!&= \! \Big \{1,1,-1\Big \} + \left\{ 3^*,2,-\frac{1}{6}\right\} + (N-5) \cdot \left\{ 3^*,1,-\frac{2}{3}\right\} \nonumber \\&+ (N-5) \cdot \left\{ 3,2,\frac{1}{6}\right\} + \left\{ 3,1,\frac{2}{3}\right\} \nonumber \\&+\left( \begin{array}{c} N-5 \\ 2 \end{array}\right) \left\{ 3,1,-\frac{1}{3}\right\} + (N-5) \cdot \Big \{1,1,1\Big \} \nonumber \\&+ \left( \begin{array}{c} N-5\\ 2 \end{array}\right) \left\{ 1,2, \frac{1}{2}\right\} + \left( \begin{array}{c}N-5 \\ 3\end{array}\right) \Big \{1,1,0\Big \}. \end{aligned}$$
(20)

Suppose now we consider a model with \(n_1\) copies of \({[\![} \, 1 \, {]\!]}\) representation, \(n_2\) copies of \({[\![} \, 2 \, {]\!]}\), and \(n_3\) copies of \({[\![} \, 3 \, {]\!]}\). In counting the ‘copies’, we will denote a complex conjugate representation by a negative number. We count the number of different SM multiplets in the antisymmetric representations of SU(N) and summarize the result in Table 1. We see that, in order to obtain \(n_g\) generations of fermions, we need

$$\begin{aligned} n_2 + (N-6) \,n_3 = n_g \end{aligned}$$
(21a)

in order to ensure the correct number of quark doublets, as well as of \(\hat{u}\) and \(\hat{\ell }\). In addition, we need

$$\begin{aligned} n_1 + (N-5) n_2 + {N-5 \atopwithdelims ()2} \,n_3 = - n_g, \end{aligned}$$
(21b)

so that the correct numbers of lepton doublets and \(\hat{d}\) are obtained. Anomaly cancelation between the representations will be ensured if we have

$$\begin{aligned} n_1 + (N-4) n_2 + \frac{1}{2} (N-3)(N-6) \,n_3 = 0, \end{aligned}$$
(21c)

making use of the anomaly coefficients given in Eq. (16). However, only two of the three relations in Eq. (21) are independent. We find it simple to work with Eqs. (21a) and (21c). The general solution of these equations is given by

$$\begin{aligned} n_1&= -(N-4) n_g + \frac{1}{2} (N-5)(N-6) \,n_3, \nonumber \\ n_2&= n_g - (N-6) \,n_3. \end{aligned}$$
(22)
Table 1 Number of different SM multiplets occurring in completely antisymmetric representations of SU(N)

If we take \(n_3=0\), we obtain \(n_g=n_2\), and hence no explanation of generations. This is what was done for the SU(6) grand unified model discussed in Sect. 3, a model which was found to be uninteresting precisely because it could not predict the number of generations. However, we can consider other kinds of solutions of Eq. (22). For example, if we take \(n_2=0\), we obtain

$$\begin{aligned} n_g = (N-6) \,n_3. \end{aligned}$$
(23)

In this case, for the grand unified group SU(9) we find that the number of generations must be three or its multiple.

It should be noted that it is just as easy to obtain solutions of Eq. (22) with \(n_1, n_2\), and \(n_3\) all non-zero. One such model with an SU(8) gauge group was the subject matter of Ref. [10], where the authors took \(n_1=-9, n_2=1, n_3=1\) and obtained three generations of fermions. From our analysis, it seems that they could have obtained any other number of generations by adjusting the number of copies of various representations. For example, \(n_1=-13, n_2=2, n_3=1\) would give four generations. However, the merit of the \(n_2=0\) solutions is that the number of generations cannot be arbitrary: it must be a multiple of three.

6 Anatomy of an SU(9) model

As seen from our earlier analysis, an SU(9) model, in the absence of rank-2 antisymmetric multiplets, automatically gives three fermion generations, provided we take

$$\begin{aligned} n_3=1,\quad n_1=-9, \end{aligned}$$
(24)

which means that there should be nine copies of the anti-fundamental representation, and one copy of the rank-3 antisymmetric multiplet.

To see in more detail how different fermion representations of the standard model are obtained from these representations of the grand unified group SU(9), we first discuss the decomposition of these multiplets under the group \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\). The decomposition of the fundamental of SU(9) can be taken to be as given in Eq. (17). We can choose the \(\mathrm{U}(1)_\mathrm{X}\) quantum number in a way that it vanishes for all singlets of \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{{SU}}(3)_\mathrm{L}\). Then, choosing the normalization of the \(\mathrm{U}(1)_\mathrm{X}\) arbitrarily, we can write

$$\begin{aligned} {[\![} \, \bar{1} \, {]\!]} \equiv {\bar{\mathbf {9}}} \rightarrow \left[ 3^*,1,\frac{1}{3}\right] + \left[ 1,3^*,-\frac{1}{3}\right] + 3 \cdot \Big [1,1,0\Big ].\nonumber \\ \end{aligned}$$
(25)

This gives, for the rank-3 representation of SU(9), the following decomposition into \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) multiplets:

$$\begin{aligned}&{[\![} \, 3 \, {]\!]} \equiv \mathbf {84} = \Big [1,1,0\Big ] + \Big [1,1,1\Big ] + \Big [1,1,-1\Big ] \nonumber \\&\quad +\, 3 \cdot \Big [3,3,0\Big ] + \left[ 3^*,3,-\frac{1}{3}\right] + \left[ 3,3^*,\frac{1}{3}\right] \nonumber \\&\quad +\, 3 \cdot \left[ 3^*,1,-\frac{2}{3}\right] + 3 \cdot \left[ 1,3^*,\frac{2}{3}\right] + 3 \cdot \left[ 1,3,\frac{1}{3}\right] \nonumber \\&\quad +\, 3 \cdot \left[ 3,1,-\frac{1}{3}\right] . \end{aligned}$$
(26)

Looking at these decompositions, we find that the multiplets specified by Eq. (24) contain the following vector-like combinations of \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) submultiplets:

$$\begin{aligned} \begin{array}{l@{\quad }l@{\quad }l} 28 &{} \hbox {copies of}&{} \big [1,1,0\big ], \\ 3 &{} \hbox {copies of}&{} \left[ 3^*,1,\frac{1}{3}\right] + \left[ 3,1,-\frac{1}{3}\right] , \\ 3 &{} \hbox {copies of} &{}\left[ 1,3^*,-\frac{1}{3}\right] + \left[ 1,3,\frac{1}{3}\right] , \\ 1 &{} \hbox {copy of}&{} \left[ 3^*,3,-\frac{1}{3}\right] + \left[ 3,3^*,\frac{1}{3}\right] , \\ 1 &{} \hbox {copy of} &{}\left[ 1,1,1\right] + \left[ 1,1,-1\right] . \end{array} \end{aligned}$$
(27)

Such things can have \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) invariant mass terms at the level where the said symmetry is intact, and they do not affect the SM reduction of the model. These singlets and vector-like particles do not contribute to the \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) anomalies.

In addition, we find that the decomposition consists of several chiral multiplets of \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\). Under the group \(\mathrm{{SU}}(3)_\mathrm{L}\), these multiplets transform either like triplets or antitriplets, or like singlets. We want the triplets and antitriplets to contain the doublets of the standard electroweak gauge group. For that, we first have to discuss how the SM is embedded into \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\). Since the members of an \(\mathrm{{SU}}(2)_\mathrm{L}\) doublet have the same value of hypercharge \(Y\), we need to ensure that the diagonal \(Y\) generator has two equal entries in the \(3\times 3\) representation of \(\mathrm{{SU}}(3)_\mathrm{L}\). We notice that the PPF model was constructed in such a way that the second and the third elements of a triplet end up with the same value of \(Y\) through Eq. (4), whereas in the SVS model, the first two elements are equal. In order to obtain something different, we can now try a solution in which the first and the third elements of \(Y\) should have the same diagonal elements. This solution is given by

$$\begin{aligned}&I_{3L} = \frac{1}{2} T_3 + \frac{\surd 3}{2} T_8, \end{aligned}$$
(28a)
$$\begin{aligned}&Y = \frac{1}{2} T_3 - \frac{1}{2\surd 3} T_8 + X. \end{aligned}$$
(28b)

With these assignments, we now present the chiral multiplets of \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\), which are present in the choice of representations in Eq. (24), as well as their SM decompositions:

$$\begin{aligned}&3 \cdot \Big [3,3,0\Big ] \rightarrow 3 \cdot \left\{ 3,2,\frac{1}{6}\right\} + 3 \cdot \left\{ 3,1,-\frac{1}{3}\right\} ,\end{aligned}$$
(29a)
$$\begin{aligned}&6 \cdot \left[ 3^*,1,\frac{1}{3}\right] \rightarrow 6 \cdot \left\{ 3^*,1,\frac{1}{3}\right\} , \end{aligned}$$
(29b)
$$\begin{aligned}&6 \cdot \left[ 1,3^*,-\frac{1}{3}\right] \rightarrow 6 \cdot \left\{ 1,2,-\frac{1}{2}\right\} + 6 \cdot \Big \{1,1,0\Big \}, \end{aligned}$$
(29c)
$$\begin{aligned}&3 \cdot \left[ 3^*,1,-\frac{2}{3}\right] \rightarrow 3 \cdot \left\{ 3^*,1,-\frac{2}{3}\right\} , \end{aligned}$$
(29d)
$$\begin{aligned}&3 \cdot \left[ 1,3^*,\frac{2}{3}\right] \rightarrow 3 \cdot \left\{ 1,2, \frac{1}{2}\right\} + 3 \cdot \Big \{1,1,1\Big \}. \end{aligned}$$
(29e)

We see that, at the SM level, there are three quark doublets, three multiplets that transform like \(\hat{u}_L\), and also three that transform like the \(\hat{\ell }_L\): precisely the numbers necessary for obtaining the three fermion generations in the SM. We find six SM multiplets that transform like the \(\hat{d}_L\) in Eq. (29b), but that is not a problem, because three of them can pair up with an equal number of \(\left\{ 3,1,-\frac{1}{3}\right\} \) multiplets as appear in Eq. (29a), and one can obtain a bare mass term that is invariant under the SM gauge group. Similarly, the three \(\left\{ 1,2, \frac{1}{2}\right\} \) multiplets pair up with three of the six \(\left\{ 1,2, -\frac{1}{2}\right\} \) multiplets, leaving three lepton doublets at the SM level. Thus, we are left with exactly the three chiral generations of SM fermions which are necessary for building up the standard model. In addition, there are vector-like combinations of the SM gauge group which come from Eqs. (27) and (29). We write these combinations as follows:

(30)

This model is therefore very different from the existing models based on the gauge group \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\), the ones outlined in Sect. 2. In the existing models, cancelation of gauge anomalies is obtained by making one fermion generation different from the others. In the present model, this is not the case. At the level of the \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) group, all fermion generations have the same transformation properties. However, there are extra fermions that help cancel the anomalies, fermions which become vector-like at the SM level. Thus, even apart from the grand unification prospects via SU(9), the multiplets presented in Eq. (29) can be taken to represent a new \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) model, which can be studied in its own right. However, if we consider it purely as a \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) model, there is no explanation of the number of generations. One can obtain any number of generations by changing the number of copies of each multiplet, keeping the ratios intact, and obtain any number of generation one wants. Thus, it is fair to say that the explanation of the number of generations comes directly from the grand unification group SU(9), and not from its subgroup \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\).

7 Gauge coupling unification and the intermediate scales

We now address the unification picture of the SU(9) model defined by the condition given in Eq. (24). For the sake of simplicity, we shall assume in our analysis that the SU(9) GUT model is spontaneously broken, through the scheme given by Eq. (19), directly to the SM gauge group at a unique unification scale \(\Lambda \). Thus, the unification condition for the SM gauge couplings \(\alpha _{1,2,3}\) at the scale \(\Lambda \) reads

$$\begin{aligned} \alpha _U = k_1\alpha _1(\Lambda )=k_2\alpha _2(\Lambda )=k_3 \alpha _3(\Lambda ). \end{aligned}$$
(31)

The normalization constants \(k_i\) are defined by

$$\begin{aligned} k_i\equiv \frac{\mathrm{{tr}}T^2_i}{\mathrm{{tr}}T^2}, \end{aligned}$$
(32)

where \(T\) and \(T_i\) are the same unbroken generator properly normalized to the GUT group and its subgroup \(G_i\), respectively, for a given representation. Taking into account the decomposition of the fundamental representation given in Eq. (19) and the GUT normalization in Eq. (3), one easily derives that the normalization constants \(k_i\) take the canonical values

$$\begin{aligned} k_1=\frac{5}{3},\quad k_2=k_3=1, \end{aligned}$$
(33)

like in the SU(5) GUT models. The relation given in Eq. (31) is only valid at the unification scale and one has to relate these gauge couplings with measurable quantities at the electroweak scale.

The evolution of the gauge couplings in the one-loop approximation is ruled by the solutions of the Renormalization Group Equations (RGEs), which depend on the masses of the particles in the model. We take the SM particles and \(n_\mathrm{H}\) Higgs doublets at the electroweak scale. In addition, the vector-like combinations given in Eq. (30) can also have masses between \(M_Z\) and the unification scale \(M_U\). We will assume, for the sake of simplicity, that all extra vector-like fermions with the same quantum number share their mass scales, and we denote these scales collectively as \(M_I\), where \(I\) can take the ‘values’ \(Q,U,D,L,E\) as shown in Eq. (30). The solutions of the RGEs can then be written as

$$\begin{aligned} \alpha ^{-1}_1(\mu )&= \alpha ^{-1}_1(M_Z)\!-\!\frac{b_1}{2\pi }\ln \left( \frac{\mu }{M_Z}\right) \!-\!\sum _I\frac{b^I_1}{2\pi }\ln \left( \frac{\mu }{M_I}\right) \!,\nonumber \\ \end{aligned}$$
(34a)
$$\begin{aligned} \alpha ^{-1}_2(\mu )&= \alpha ^{-1}_2(M_Z)\!-\!\frac{b_2}{2\pi }\ln \left( \frac{\mu }{M_Z}\right) \!-\!\sum _I\frac{b^I_2}{2\pi }\ln \left( \frac{\mu }{M_I}\right) \!,\nonumber \\ \end{aligned}$$
(34b)
$$\begin{aligned} \alpha ^{-1}_3(\mu )&= \alpha ^{-1}_3(M_Z)\!-\!\frac{b_3}{2\pi }\ln \left( \frac{\mu }{M_Z}\right) \!-\!\sum _I\frac{b^I_3}{2\pi }\ln \left( \frac{\mu }{M_I}\right) \!,\nonumber \\ \end{aligned}$$
(34c)

where the \(b_i\) are the one-loop beta coefficients that take into account the quantum numbers of the SM fermions, the gauge bosons, and \(n_\mathrm{H}\) Higgs doublets:

$$\begin{aligned}&b_1=\frac{20}{9}n_g+\frac{n_\mathrm{H}}{6},\quad b_2=\frac{4}{3}n_g+\frac{n_\mathrm{H}}{6}-\frac{22}{3},\nonumber \\&b_3=\frac{4}{3}n_g-11. \end{aligned}$$
(35)

The \(b^I_i\) are the one-loop beta coefficients for the intermediate extra vector-like fermions. The values of these \(b^I_i\) are given in Table 2.

Table 2 Beta coefficients for the extra vector-like fermions

In order to get some insight into the unification in the one-loop approximation, i.e. to understand the intermediate scales which lead to successful unification, let us define the effective beta coefficients \(B_i\) [11, 12],

$$\begin{aligned} B_i\equiv \frac{1}{k_i}\left( b_i+\sum _I b_i^Ir_I\right) \!, \end{aligned}$$
(36)

where the ratio \(r_{I}\) is given by

$$\begin{aligned} r_I=\frac{\ln \left( \Lambda /M_I\right) }{\ln \left( \Lambda /M_Z\right) }. \end{aligned}$$
(37)

It is also convenient to introduce the differences \(B_{ij}\equiv B_i-B_j\), such that

$$\begin{aligned} B_{ij}= B^{\text {SM}}_{ij}+\sum _I\Delta ^I_{ij}r_I, \end{aligned}$$
(38)

where \(B^{\text {SM}}_{ij}\) corresponds to the SM particle contribution and

$$\begin{aligned} \Delta ^I_{ij}= \frac{b^I_i}{k_i}-\frac{b^I_j}{k_j}. \end{aligned}$$
(39)

We then find that

$$\begin{aligned}&B\equiv \frac{B_{23}}{B_{12}} = \frac{\sin ^2\theta _W-\dfrac{k_2}{k_3}\dfrac{\alpha }{\alpha _s}}{\dfrac{k_2}{k_1}-\left( 1+\dfrac{k_2}{k_1}\right) \sin ^2\theta _W}, \end{aligned}$$
(40a)
$$\begin{aligned}&B_{12}\ln \left( \frac{\Lambda }{M_Z}\right) = \frac{2\pi }{\alpha } \left[ \frac{ 1}{k_1}-\left( \frac{1}{k_1}+\frac{1}{k_2} \right) \sin ^2\theta _W\right] \!. \end{aligned}$$
(40b)

Notice that the right-hand sides of Eq. (40) depend only on low-energy electroweak data and the group factors \(k_i\). Adopting the following experimental values at \(M_Z\) [13]:

$$\begin{aligned}&\alpha ^{-1} = 127.916\pm 0.015, \end{aligned}$$
(41a)
$$\begin{aligned}&\sin ^2\theta _W = 0.23116\pm 0.00012, \end{aligned}$$
(41b)
$$\begin{aligned}&\alpha _s = 0.1184\pm 0.0007, \end{aligned}$$
(41c)

the above relations give

$$\begin{aligned}&B = 0.718\pm 0.003, \end{aligned}$$
(42)
$$\begin{aligned}&B_{12}\ln \left( \frac{\Lambda }{M_Z}\right) = 185.0\pm 0.2, \end{aligned}$$
(43)

in the canonical GUT models with \(k_i=(5/3,1,1)\). The coefficients \(B_{ij}\), which appear in the left-hand sides of Eq. (40), strongly depend on the particle content of the theory. For instance, considering the SM-like particles, with \(n_g\) generations, together with \(n_\mathrm{H}\) light Higgs doublets, one has for the coefficients \(B_{12}\) and \(B_{23}\):

$$\begin{aligned} B_{12}=\frac{22}{3}-\frac{n_\mathrm{H}}{15},\quad B_{23}=\frac{11}{3}+ \frac{n_\mathrm{H}}{6}. \end{aligned}$$
(44)

In the case of SM, i.e. \(n_g=3\) and \(n_\mathrm{H}=1\) one has

$$\begin{aligned} B=115/218\approx 0.53, \end{aligned}$$
(45)

which is not compatible with the calculated value in Eq. (42) and, clearly, the B-test fails badly in the SM. In Table 3, we have summarized the contributions of the vector-like fermions to \(B_{12}\) and \(B_{23}\). Using these values, we have taken random values of the intermediate scales of the extra vector-like fermions and found combinations of these intermediate scales which are compatible with successful unification. In our calculations we have assumed only one Higgs doublet, i.e. \(n_\mathrm{H}=1\), and we have taken a rough lower bound on the unification scale, \(M_U > 6 \times 10^{15}\) GeV, coming from the unobservability of proton decay [13] into \(e^+\pi ^0\). A set of a million of such random combinations were taken. A few of the allowed ones are presented in Fig. 1. From the entire set of combinations used in our calculations, we find that the allowed range for the unification scale obtained is

$$\begin{aligned} 6\times 10^{15}~\text {GeV}\le \Lambda \le 2.2\times 10^{16}~\text {GeV}, \end{aligned}$$
(46)

which is also roughly what the limited data of Fig. 1 indicates. For the scales of the vector-like extra fermions we have

$$\begin{aligned}&5.5\times 10^{13}~\text {GeV}\le M_Q\le 2.4\times 10^{14}~\text {GeV},\end{aligned}$$
(47a)
$$\begin{aligned}&1.2\times 10^{15}~\text {GeV}\le M_U \le 2.2\times 10^{16}~\text {GeV},\end{aligned}$$
(47b)
$$\begin{aligned}&6.6\times 10^{13}~\text {GeV}\le M_D \le 1.2\times 10^{16}~\text {GeV},\end{aligned}$$
(47c)
$$\begin{aligned}&7.4\times 10^{13}~\text {GeV}\le M_L\le 2.0\times 10^{16}~\text {GeV},\end{aligned}$$
(47d)
$$\begin{aligned}&1.7\times 10^{15}~\text {GeV}\le M_E\le 2.1\times 10^{16}~\text {GeV}. \end{aligned}$$
(47e)

We see that the range of mass scales of the vector-like fermions are high, but they are indeed necessary for driving the evolution of the gauge couplings to perfect unification. If one varies the number of Higgs doublets, \(n_\mathrm{H}=2,3\), the ranges of the intermediate scales above do not change significantly.

Fig. 1
figure 1

The intermediate scales of the extra vector-like fermions in function of the unification scale \(\Lambda \)

Table 3 Beta coefficients for the extra vector-like fermions

8 Conclusions

To summarize, we have succeeded in building a grand unified model based on the group SU(9). The model uses fermions in antisymmetric representations only, and the consistency of the model demands that the number of fermion generations is three. As mentioned in the introduction, earlier models based on the gauge group \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) and SO(18) also had this property of predicting the number of generations. It is interesting to note that our group SU(9) contains \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\), and that it is contained in SO(18). However, in our model, the transformation properties of the known fermions in the \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) subgroup of SU(9) are not the same as those used in earlier models based on the \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\) group. This is different where the known fermions of all generations transform in the same way under \(\mathrm{{SU}}(3)_\mathrm{c} \times \mathrm{SU}(3)_\mathrm{L} \times \mathrm{U}(1)_\mathrm{X}\). On the other hand, a comparison with the SO(18) models also reveals an interesting connection. In SO(18) models, the fermion generations are contained in the spinor representation. The spinor representation of SO(18), which is \(\mathbf {256}\)-dimensional, decomposes under the SU(9) subgroup as follows:

$$\begin{aligned} \mathbf {256} = {[\![} \, 1 \, {]\!]} + {[\![} \, 3 \, {]\!]} + {[\![} \, 5 \, {]\!]} + {[\![} \, 7 \, {]\!]} + {[\![} \, 9 \, {]\!]}. \end{aligned}$$
(48)

Our SU(9) model uses only the first two kinds of these submultiplets to predict the number of fermion generations.

We have also shown that our model may be consistent with unification requirements. In this part of the analysis, we have assumed a direct breaking of SU(9) into the SM gauge group. A more detailed analysis, including possibilities of intermediate symmetry breaking scales, will be taken up in a future work.