Abstract
We study the minimization of the potentials of multi-Higgs models with symmetries that are softly broken. The powerful method of geometric minimization enables analytic minimization of the potentials of multi-Higgs models invariant under large symmetries. When these symmetries are softly broken, the method needs to be adapted. We propose a useful generalization that considers the effect of the soft-breaking terms to the quadratic part of the potential, by applying the procedure to restricted orbit spaces. We exemplify our novel methodology by finding and classifying the minima for an \(S_4\) multi-Higgs model that is softly broken with specific terms.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The Standard Model (SM) is unable to account for the dark matter (DM) abundance observed in the Universe. This inability is a strong motivation to study theories Beyond the SM. Among these, a very simple class of model considers extending the field content with additional Higgs doublets—Multi-Higgs Doublet Models (MHDMs), which can provide dark matter candidates as well as the possibility of spontaneous CP violation.
Within MHDMs, the simplest generalization is the 2HDM [1], introduced to provide CP violation. The 2HDM is very well studied (see, e.g. the reviews [2, 3]). The 3HDM [4], introduced also with CP violation in mind, has several features worth considering. With 3HDMs, there is the possibility to have DM candidates and spontaneous CP violation simultaneously. Furthermore, an important motivation to consider 3HDMs is the attempt to understand the Yukawa sector in a way that can explain the patterns of fermion masses and mixings. Due to the large number of parameters in the most general 3HDM, symmetries are often considered [5,6,7,8]. A pioneering example was the use of \(S_3\) [9,10,11].
The minimization of the potential to find the alignment of the vacuum expectation values (vevs) is important but non-trivial and usually becomes impossible to perform analytically. A very powerful method, of Geometric Minimization [12], has been proposed to address this problem in the presence of MHDM potentials with large symmetries.Footnote 1 With the great phenomenological interest of softly broken potentials [7, 8], we extend here the method of geometric minimization to the study of the vevs of softly broken potentials. We present the general considerations and exemplify the method in the potentials of softly broken \(S_4\) 3HDM. Although we have found cases in the softly broken \(\Sigma (36)\) potential [7] where analytical expressions can be obtained with our method, we have analysed here the \(S_4\) potential as an illustrative example. The \(S_4\) symmetry stands out as one of the smallest symmetries with three dimensional irreducible representations, and because the potential invariant under \(S_4\) is a simple modification of the potential invariant under \(A_4\)—two of the most frequently used symmetries used to explain the patterns of fermion masses and mixings. The main advantage of this method is that we are able to obtain analytical results, which we then verified numerically.
We note also that in the analysis performed here we do not include radiative corrections, which are relevant given they can modify the vacuum structure.
In Sect. 2 we review the geometric minimization method and generalize the formalism to apply to potentials with soft-symmetry breaking. A detailed example of the application of this method to the \(S_4\) invariant potential follows in Sect. 3. The minima are presented in Sect. 4 and we present our conclusions in Sect. 5.
2 Geometric minimization
2.1 Fully symmetric models
We begin by reviewing the geometric method described in [12] in the context of 3HDM, in preparation of what is to follow.
First, one needs to introduce the set of appropriate variables for the problem. In any given 3HDM the scalar potential is written as a function of 3 Higgs doublets, \(\varPhi _i\), which for the purposes of this work can be taken as doublets of complex numbers,
We consider then as our variables the following 9 combinations constructed from bilinear contractions of the \(\varPhi _i\),
which satisfy
One important aspect of the last expression is that equality is reached, that is,
if and only if one works in a neutral vacuum, with all \(c_i = 0\) up to an SU(2) rotation. We also define the variables
An important property of these is that they are sensitive only to the overall vev alignment. More precisely, given a set of vevs \((v_1,v_2,v_3)\), the same value of \(y_i\) would be obtained for any rescaled alignment of the form \(\beta (v_1,v_2,v_3), \, \beta > 0\), since both \(r_0\) and the \(r_i\) are homogeneous functions of degree two in \(\beta\).
In this formalism, the most general 3HDM potential is written as
which has 64 free parameters. However, in the context of 3HDMs (and MHDMs in general), in order to control this proliferation of the number of degrees of freedom it is usual to make V symmetric under some symmetry group G. From now on let us consider only symmetries that impose that all \(M_i = 0\), so that only \(M_0\) remains. This is the case, for example, for \(G = \{A_4, S_4, \Delta (54), \Sigma (36)\}\) [14]. As intended, these groups also drastically reduce the number of quartic parameters of the potential, by enforcing certain equalities between themselves, or by setting some of them to be zero. Let us denote the final number of free quartic parameters by \(k+1\), where for the aforementioned groups k does not exceed four. Thus, in these cases we can recast (6) in a more convenient form,
with
Here, the parameter \(M_0\) in assumed to be positive. Within \(V_4\) the \(x_i\) are the combinations of the \(y^2\) that, due to the symmetry group, end up coupling to the same parameter \(\varLambda _i\).Footnote 2 Note that since the \(x_i\) are invariant under rescaling by \(\beta\), then so is \(V_4\), and as such this recast has led to a separation of the dependence of V on the total magnitude \(v^2 = v_1^2 + v_2^2 + v_3^2\) and on the specific alignments compatible with this magnitude, akin to a separation of radial and angular variables in spherical coordinates.
Explicitly, this means that for any continuous set of vevs \(\beta (v_1,v_2,v_3), \; \; \beta > 0\) with \(v_i\) fixed we will always obtain the same value of \(V_4\), and only the value of \(r_0\) will change. Thus, from this point of view, for the set \((v_1,v_2,v_3)\), V is a function of \(r_0\) only, with minimum
occurring at the magnitude
Then, the minimization of the potential reduces to determining which alignment leads to the lowest value of V. Looking at (9), we conclude that the desired alignment is the one that leads to the smallest positive value of \(V_4\), since \(V_2\) is simply a constant. The positivity of \(V_4\) ensures that the model is bounded from below.
In order to determine the minimum alignment we take advantage of the linear nature of \(V_4\) on the \(x_i\), which reduces the minimization to the determination of the point that minimizes \(\varLambda _i x_i\) for a given set of \(\varLambda _i\), or, conversely, maximizes the dot product \(\vec {n} \cdot \vec {x}\), with \(n_i = - \varLambda _i\). In order to do so, we construct the geometric shape in \({\mathbb {R}}^k\) of all the points on the space of the \(x_i\) that can be reached by all the different vev alignments, which we denote by \(\varGamma\). Then, if \(k \le 3\) one can visualize this shape and by direct observation determine the coordinates in this shape that protrude the farthest in the direction of the vector \(\vec {n}\) for a given set of \(\varLambda _i\), and from there works backwards to determine which alignments produce those \(x_i\) coordinates. An example of this procedure is shown in Fig. 1.
2.2 Soft-symmetry breaking
Now, in order to introduce soft-symmetry breaking, we consider potentials where \(V_2\) is of the general form
We note that in this expression we continue to assume that \(M_0> 0\), as well as all the \(M_i\), for the sake of simplicity. However, this need not be the case, and one can consider models where some of these parameters are negative, as long as \(V_2 > 0\) in some region of the \(y_i\) domain.
One approach to this problem was suggested in the work where the geometric minimization was introduced [12], where to proceed one would first apply a non-unitary transformation T on the Higgs doublet space with the purpose of diagonalizing \(V_2\), in order to bring it to the canonical form of (8) and then follow the method as usual. This, however, proves to be technically challenging to accomplish since such a diagonalization inevitably introduces a dependence on the specific values of \(M_i\) in the shape of the orbit space, making its analysis a more involved task.
We can circumvent this by relaxing the requirement that \(V_2\) must be in the canonical form. If we take a look at (9), we see that if \(V_2\) has any \(M_i \ne 0\) then the conclusion that the minimum of the potential must lie at the border of \(\varGamma\) ceases to be true, since while it is still where \(V_4\) attains its minimum value, one now also needs to consider the effect of \(V_2\) to value of the potential. Indeed we will now find the true minimum at some other point in \(\varGamma\), which can in general lie inside it.
However, if we fix the value of \(V_2\) by fixing the values of \(y_i=\alpha _i\), with some constant set of \(\alpha _i\),Footnote 3 we obtain the region of \(\varGamma\) compatible with that given value of \(V_2\), which we denote by \(\varXi (\vec {\alpha })\) (the region depends on the set of \(\alpha _i\)). Then, the minimum of this restrained problem is in the border of \(\varXi (\vec {\alpha })\), whose description now becomes a function of the \(\alpha _i\). Within this region, the values of \(\varLambda _i\) select the direction of steepest descent, exactly as in the case of \(\varGamma\). An example is of this procedure is shown in Fig. 2.
With this, we have recast the minimization problem into the minimization of the function
From this point on, one can proceed with the usual method of minimization by derivatives, with the added detail that the domain over which the \(\alpha _i\) range, henceforth denoted by \(\Omega\), is bounded as one can infer from (3). The critical points are obtained by solving the system of equations
where \(\partial _{i} = \frac{\partial }{\partial {\alpha _i}}\). We can cancel the prefactor of \(V_2\) by noticing that for any phenomenologically acceptable solution we must have that \(V_2 > 0\) in order for the minimum to occur away from the origin leading to massive scalars. Eliminating it from (13), we obtain
As a technical remark, when one considers more than one of the \(M_i \ne 0\) it is usually easier to begin by solving
due to the fact that (14) still admits as a solution \(V_2 = V_4 = 0\), which we know to be unphysical and is removed by (15). This step of the process may be performed numerically.
3 An example: \(S_4\) symmetric 3HDM
We now detail the application of this method to the \(S_4\) symmetric 3HDM with nonzero \((M_3, M_8)\). This symmetry group was chosen due to the fact that it has received extensive treatment with the geometric method, and thus has a large set of results we can build upon. Regarding the choice of \(M_i\), it was motivated by the specific form of the potential \(V_{S_4}\), leading to a simpler situation in which to illustrate the procedure.
3.1 Fully symmetric summary
In the notation of (2), the \(S_4\) symmetric potential is given by [12]
The complete orbit space for \(S_4\) has been obtained in [12]. Here, however, we are interested only in its neutral part. This corresponds to the dashed region that will appear later in Fig. 4.
The set of bounded from below (BFB) conditions for \(S_4\) have also been obtained, even in the case of soft-symmetry breaking. In a compact form, they read [15]
where for every condition one needs to add its counterpart obtained by the substitution \(\varLambda _1 \rightarrow \varLambda _3\). Note that (17) implies that all of the conditions must be met simultaneously. These will be of use in excluding phenomenologically forbidden parameter regions from the analysis to come.
3.2 Soft-symmetry breaking
As mentioned, we will focus on the case where we add nonzero \(M_3\) and \(M_8\), meaning that
From now on, we focus only on the neutral subspace. This simplification lets us obtain all of the possible minimum alignments as well as necessary conditions on the parameters for them to be the global minimum. If one wishes to go further and obtain necessary and sufficient conditions, two approaches can be taken: (1) either prove that \(\varXi (\alpha _3, \alpha _8)\) is convex even for non-neutral vacuum configurations, at which point requiring that the squared masses of all the physical particles be positive at the neutral minimum yields the full set of sufficient and necessary conditions, or (2) in case where it is not convex conduct a full analysis of its shape.
With this remark, we proceed with the doublets parameterized as
3.2.1 \(\Omega (y_3, y_8)\)
As discussed, \(y_3\) and \(y_8\) can only assume values within a certain domain \(\Omega (y_3, y_8)\), which we now determine. From their definitions and (19) we obtain that
where we have set \(v_1 ^ 2 + v_2 ^ 2 + v_3 ^ 2 = 1\) without loss of generality. Some straightforward manipulations then lead to
which when evaluated at the minimum and maximum of \(v_1^2\) yields
These inequalities describe an equilateral triangle with sides of length \(\sqrt{3}\) and vertices at \(A=(\sqrt{3}/2,1/2)\), \(B=(-\sqrt{3}/2,1/2)\) and \(C=(0,-1)\). We display this in Fig. 3a, as well as a numerical scan of the values of \(y_3\) and \(y_8\) obtained via a random scan over general neutral doublet configurations. We emphasize this is merely encoding the possible directions of \((v_1,v_2,v_3)\) in terms of \(y_3\), \(y_8\).
We start by restricting the domain of \((M_3, M_8)\). We know a priori that this is possible, due to the fact that the relative ordering of the magnitude of the doublet vevs is unphysical, and as such any specific choice is equally valid.
To see this explicitly, let us consider that \(M_3 < 0\). Then, under the interchange \(\varPhi _1 \leftrightarrow \varPhi _2\) we have that \(y_3 \rightarrow -y_3\) which can be absorbed by \(M_3\), making it a positive quantity. Similarly, the remaining interchanges reflect \((M_3, M_8)\) along the symmetry axes of the triangle of Fig. 3a. With this, we conclude that we can always perform an unphysical transformation on the doublets that brings any pair of \((M_3, M_8)\) to the region defined by
It is useful to introduce the dimensionless parameters
Furthermore, we can combine this result with the \(S_3\) symmetry of the triangle in order to reduce the domain of \((y_3, y_8)\). Since the interchanges \(\varPhi _i \leftrightarrow \varPhi _j\) are symmetries of the potential then \(v_4\) is invariant under them, by definition. Thus, if we consider the specific form of (9) the conclusion is that for all the points connected by reflections along the axis of symmetry of Fig. 3a the one that leads to the lowest value of V is the one that maximizes \(M_i y_i\). Given the specific choice of (24) then the \((y_3, y_8)\) are also constrained by
This particular choice of parameters selects \(v_1> v_2 > v_3\).
As a final step, it is convenient to introduce a new set of independent variables related to \((y_3, y_8)\) by a rotation of \(\pi /6\) radians,
such that the axis \(y_3^{\prime }\) lies along the symmetry axis \(y_3 = \sqrt{3} y_8\). We accompany this with the same rotation on the \((M_3, M_8)\) plane.
As such, the description of the relevant variables and parameters is summarized as
with
This domain is shown in Fig. 3b.
With these parameters, we have that
To avoid confusion, we note in advance that throughout Sect. 4 we will drop the prime superscripts for the \(\alpha _3^{\prime }\), \(\alpha _8^{\prime }\) \(\varepsilon _3^{\prime }\) and \(\varepsilon _3^{\prime }\).
3.2.2 \(\varXi (\alpha _3^{\prime },\alpha _8^{\prime })\)
We now proceed to the characterization of \(\varXi (\alpha _3^{\prime },\alpha _8^{\prime })\). Given the specific form of (16), it is convenient to eliminate \(x_2\) by using (4), which leaves \(x_1\) and \(x_3\) as the quartic variables. Then,
where \(\varLambda _{ij}^{\pm } \equiv \varLambda _i \pm \varLambda _j\). It is important to note how this substitution reduces the number of relevant quartic free parameters from four to three, by “absorbing” \(\varLambda _2\). With both \(\alpha _3^{\prime }\) and \(\alpha _8^{\prime }\) fixed, it is straightforward to verify that
since the primed and unprimed variables are related by a rotation. We thus conclude that \(\varXi (\alpha _3^{\prime },\alpha _8^{\prime })\) will be a horizontal line in the \((x_1, x_3)\) plane. Therefore, the points of \(\varXi\) that minimize \(v_4\) and thus minimize the potential are the extremities of this straight line, which we denote by L and R for the left and right points, and the model will have two different classes of alignments.
Given the form of (31) we can also deduce under which conditions each point will correspond to the minimum: if \(\varLambda _{12}^- > 0\) then the minimum of \(v_4\) lies at L; otherwise, if \(\varLambda _{12}^- < 0\) the minimum lies at R. This finding is in agreement with the definitions of \(\varLambda _1\) and \(\varLambda _2\) in (16), where these parameters couple to the real and imaginary parts of the mixed products of the doublets, respectively. Therefore, if for example \(\varLambda _{12}^- > 0\), equivalent to \(\varLambda _1 > \varLambda _2\), the configurations of doublets with nonzero relative phases lead to an overall smaller contribution to \(v_4\) than their phaseless counterparts at a given magnitude, and as such lead to a smaller value of V.
Thus, we must determine the analytical expressions for R and L. We have that, according to our parameterization in (19),
where we defined \(\delta _{ij}=\omega _i - \omega _j\) and \(q_1 =\frac{v_1}{v_3}\), \(q_2 = \frac{v_2}{v_3}\). The introduction of the ratios \(q_i\) simply reflects the fact that \(x_1\) is a scale-invariant quantity and as such we are always able to eliminate one degree of freedom in the magnitude parameters \(v_i\).
Note that in general one may have alignments in which \(v_3 = 0\), in which case both \(q_1\) and \(q_2\) will diverge. Thus, in the course of our analysis, when we encounter these divergences we need to interpret them as an alignment with null \(v_3\), \((v_1, v_2, 0)\). Thus, following the point just made regarding the magnitude degree of freedom, in this case we turn our attention to the ratio \(q_1/q_2 = v_1/v_2\). If we also find that this quantity diverges, we immediately know that the alignment is of the form \((v_1,0,0)\).
We start by obtaining what conditions on the \(q_i\) are enforced by fixing \((y_3^\prime , y_8^\prime )=(\alpha _3^\prime , \alpha _8^\prime )\). Equating degrees of freedom, we have two \(\alpha\)’s and two q’s, and as such the latter must be fully determined by the former. Explicitly, we have that
whose solution is
Combining these expression with the domain defined in (28) (which is depicted in Fig. 3b) and the above discussion regarding the divergences of these ratios, we can obtain an intuitive understanding of what this domain entails in terms of vev alignments:
-
Inside of it these quantities are finite, and the alignment has three nonzero components \((v_1, v_2, v_3)\)
-
Along the edge \(\alpha _8^{\prime } = (1 - \alpha _3^{\prime }) / \sqrt{3}\) both ratios diverge, and the alignment is of the form \((v_1, v_2, 0)\)
-
At the vertex \((\alpha _3^{\prime },\alpha _8^{\prime }) = (1, 0)\) the ratio \(q_1/q_2\) also diverges, meaning that the vev alignment is of the form \((v_1, 0, 0)\).
Given that fixing \(\alpha _3^{\prime }\) and \(\alpha _8^{\prime }\) fully determines \(q_1\) and \(q_2\), the parameters that determine the extrema of \(x_1\) are the relative phases \(\delta _{ij}\) of the doublets. The maximum value can be inferred by direct observation: if all \(\omega _i=0\) then the cosines achieve their maximum values simultaneously and we must have the maximum of \(x_1\). By substituting \(q_1\), \(q_2\) and null phases, we obtain that
This result is to be expected, since in the case of the fully symmetric model the line \(x_1 + x_3 = 1\) in the \(x_i\) space corresponds to the case where all the doublets have real vevs.
Regarding L, the determination of its analytical expression for \(x_1\) is more involved and as such is deferred to Appendix A. The final result is the piece-wise function
with relative phases defined by
These results are shown in Fig. 4 in both regimes of \(\alpha _3^{\prime }\). We note here that some of these results are obtained numerically.
At this point several remarks are in order. To start, after this analysis we already have a general picture on how the different vev alignments will be like, and under what conditions on the parameters of the potential, without having fully solved the minimization problem. This insight is a direct consequence of the intuitive nature of the geometric method, which carries over from the fully symmetrical case.
Nevertheless, some of the shortcomings of this method must also be recognized, in order for its application to be successful. The first point to be addressed is what parameters \(M_i\) one should choose to study. In our analysis we chose \((M_3, M_8)\) as the symmetry breaking parameters. However, let us suppose that we chose to consider only \(M_8\) as our parameter. In that case, the value of \(x_3\) would not be fixed, and one obtains for \(\varXi (\alpha _8^{\prime })\) a more complex two-dimensional shape. The interested reader can obtain this shape by algebraic manipulations akin to the ones shown in Appendix A. This is in stark contrast with the simple straight line of \(\varXi (\alpha _3^{\prime },\alpha _8^{\prime })\), afforded by the specific expression of the \(S_4\) potential (16), whose analysis then becomes a more straightforward task. Thus, in this case, if one wants to obtain the results for \(M_8\) as the only nonzero parameter it is simpler to start by considering both \(M_3, M_8 \ne 0\) and then set \(M_3 = 0\) in the final results.
This is not to say, however, that one should adopt the strategy of adding as many parameters as possible to restrict \(\varXi (\vec {\alpha })\) to a point and then set all the undesired parameters to zero at the end of the computations. In fact, by doing so we are immediately met with algebraic complications which can stem from a number of sources. The first source is connected with the fact that we have 8 possible \(y_i\) and only 4 independent parameters \(q_1, q_2, \delta _{13}, \delta _{23}\), meaning that it is impossible to fix all \(y_i = \alpha _i\) independently. In fact, what happens is that for more than four nonzero \(M_i\) the values of \(\alpha _i\) become functions of a chosen subset of 4 \(\alpha _j\), whose dependence \(\alpha _i(\alpha _j)\) can be non-trivial. Furthermore, even if one circumvents this, the analytical expressions obtained for the boundary of \(\varXi (\vec {\alpha })\) may themselves be non-trivial, which then makes finding the solutions of (15) a highly complex task. We highlight here that the evident simplicity of the expressions for \(x_1^R\) and \(x_1^L\) is more of a coincidence than a general feature of this method. The method can be applied to other cases, and we have found additional cases where the method leads to analytical expressions—for softly broken \(S_4\) [7] with other SBPs (namely, off-diagonal contributions such as \(r_1\)), and also for softly broken \(\Sigma (36)\) [8]. Nonetheless, in order for the method to be useful, the number of free parameters should be small. In our featured example, the potential invariant under \(S_4\) is softly broken by few additional terms and we are able to visualize the relevant domains in two dimensions. Lastly, we also mention that all of the difficulties from the fully symmetric method [12] are evidently inherited.
The upshot from this discussion is that when applying this method one needs to keep in mind the specific structure of the potential as well as the expressions of \(y_i\), and that finding solvable cases will in general be based on a trial and error approach.
4 Minimization of V
In this section we conduct the detailed minimization of the general form (12). We note that, as mentioned when they were introduced in Sect. 3, in this Section we refer to the rotated \(\alpha _3^{\prime }\), \(\alpha _8^{\prime }\) \(\varepsilon _3^{\prime }\) and \(\varepsilon _3^{\prime }\) instead as \(\alpha _3\), \(\alpha _8\), \(\varepsilon _3\), \(\varepsilon _8\).
We have \(v_2\) given by (30). Regarding \(v_4\), the results of the previous section show that two different cases must be considered, namely
and
The general minimization procedure is as follows.
We start by the identification of the critical points of \(V(\alpha _3, \alpha _8)\), by solving (15). Let us suppose that we have found the set of such solutions, S, possibly with more than one element. Then, in order for these critical points to be able to correspond to the true minimum of the potential V they must simultaneously meet the validity requirements:
-
(a) A positive-definite Hessian evaluated at that point
-
(b) The point must lie inside of the domain defined by (28)
In general, these requirements will place constraints in both the \(\varLambda _i\) and the \(\varepsilon _i\) to lie within a certain region of their respective spaces.
Now, if S has only one element and for a given set of parameters of the potential it passes both (a) and (b) it is automatically the minimum of the potential. If, however, multiple solutions to (15) satisfying (a) and (b) exist one needs to further compare them in order to determine the solution that yields the lowest value of the V, which will in general impose further conditions on the parameters of the potential.
However, if no element of S meets both (a) and (b) we turn to the analysis of the critical points along the edges of the domain, and proceed in exactly the same manner as for the “regular” critical points, forming the set of solutions to (15), now denoted by E, to which (a) and (b) are again imposed. Note that in this case (a) reduces to a positive second derivative, since along the edge V becomes a single variable function.
When none of the elements of E are found to comply with the above conditions then we turn to the vertices of the domain. Here, we simply need to determine which vertex leads to a lower value of the potential, and under what conditions.
4.1 \(\varLambda _{12}^- < 0\)
Let us start with the simpler case of \(\varLambda _{12}^- < 0\). As stated,
It is important to highlight that from the already reduced three parameter dependence in (31), this expression further reduces the number of free parameters to two, completely eliminating the dependence on \(\varLambda _2\). This means that in this case the analytical results for the minimum will depend only on two parameters, and the BFB conditions further demand that \(\varLambda _{01}^+ > 0\).
Applying the generic form of equation (15) leads to the single solution \(S_1\)
where we have defined
It turns out that this parameter solely determines the concavity of \(V(\alpha _3, \alpha _8)\), so its detailed analysis is in order. As mentioned, the BFB conditions force \(\varLambda _{01}^+ > 0\). Regarding \(\varLambda _{31}^-\), up until now we have found no explicit requirements for it. Therefore, any possible restrictions must be imposed by the BFB conditions. In order to determine if any such restrictions exist, we write \(\varLambda _3\) as
Substituting into (17) we find the particularly interesting condition
Since \(\varLambda _{01}^+ > 0\), we must have that \(\overline{\varLambda } > -1\), meaning that the BFB conditions imply that this parameter is also (coincidentally) bounded from below. Regarding the remaining BFBs, a simple graphical analysis on the \((\varLambda _0, \varLambda _1)\) plane shows that whatever the value of \(\overline{\varLambda } > -1\) that one chooses, it is always possible to find values of \(\varLambda _0\) and \(\varLambda _1\) such that all BFB are met at the same time. As such, we conclude that there exists this single lower limit on \(\overline{\varLambda }\). Following along the general procedure, imposing (a) and (b) simultaneously results in
This implies that once again we are met with two different regimes, separated by the sign of \(\overline{\varLambda }\).
4.1.1 \(\overline{\varLambda } > 0\)
In this case, \(S_1\) (from (45)) corresponds to the minimum, provided that
These conditions ensure that \(S_1\) lies inside the allowed domain for \(\alpha _3\) and \(\alpha _8\). If they are not met we turn to the edges of the domain. However, in this specific case we can forego the full analysis of its three sides by noticing that when we break these conditions the critical point of an unrestricted version of \(V(\alpha _3,\alpha _8)\) lies the closest to the edge defined by \(\alpha _8 = (1 - \alpha _3)/\sqrt{3}\), and therefore this edge must lead to the lowest value of V at its critical point. Denoting this point by \(E_1\), we obtain by solving (15) that its explicit expression is
For this point to be inside the allowed domain we need \(\frac{1}{4} \le \alpha _3 \le 1\). From its expression and the constraints in (28) it is clear that the left inequality is always satisfied. Regarding the right inequality, it is met when
If the conditions on \(\varepsilon _8\) are not satisfied, then the minimum lies at the point \((\alpha _3,\alpha _8)=(1,0)\).
4.1.2 \(-1< \overline{\varLambda } < 0\)
Contrary to the previous case, \(S_1\) is discarded. A simple analysis of the three edges of the domain leads to the same conclusion. As such, it follows that the minimum is at a vertex. Comparing the values of V at the three vertices it is straightforward to verify that the point \((\alpha _3, \alpha _8)=(1,0)\) always leads to the smallest value of the potential, and is therefore the minimum. Thus, in this case we are presented with a rigid structure for the vev alignment, regardless of the specific values of the parameters.
4.1.3 The role of \(\overline{\varLambda }\)
A remarkable feature of the results just shown is their exclusive dependence on \(\overline{\varLambda }\), in the sense that this is the only combination of the quartic parameters that appears in the expressions and inequalities we have derived. In this sense, this parameter fully encapsulates the influence of the quartic coupling parameters in the selection of the minimum alignment, effectively mapping the initial two-dimensional dependence (in \(\varLambda _{01}^+\) and \(\varLambda _{31}^+\)) to a single dimension.Footnote 4 Furthermore, it greatly facilitates one’s understanding of what the BFB conditions entail in terms of allowed parameter ranges, by allowing a factorization of the four quartic parameters into \(1+3\), leading to a more tractable geometric treatment. Lastly, it also acts as a sort of discriminant for the model, since its sign separates two different alignment regimes.
4.2 \(\varLambda _{12}^- > 0\)
Here, our starting point is
where \(\varLambda _{01}^+> 0\), \(\varLambda _{12}^- > 0\) and \(\varLambda _{31}^-\) can assume positive or negative values. Computing the solutions of (15), we arrive at the single solution \(S_1\)
for \(0 \le \alpha _3 \le \frac{1}{4}\), and
for \(\frac{1}{4} < \alpha _3 \le 1\). In these expressions we have defined
As we will see these parameters play the same role as \(\overline{\varLambda }\) on the previous analysis. This particular choice’s motivation was that it allows for a simple identification of the valid ranges of \((\varepsilon _3, \varepsilon _8)\). For example, from (55) it is evident that \(\varepsilon _3 \le \overline{\varLambda }_1\) for the critical point to lie inside the domain. With the adequate parameters defined, the next step is the determination of the restrictions imposed on them both by \(\varLambda _{12}^- > 0\) and by the BFB conditions in (17). Inverting the relations (58) we obtain that
Requiring positivity of \(\varLambda _{12}^-\) and taking into account that the BFB conditions imply that \(\varLambda _{01}^+ > 0\) we encounter two possibilities:
Now, when we substitute \(\varLambda _2\) and \(\varLambda _3\) as functions of \(\overline{\varLambda }_1\) and \(\overline{\varLambda }_2\) into the full set of BFB conditions, three of them involve solely \(\overline{\varLambda }_1\), \(\overline{\varLambda }_2\) and \(\varLambda _{01}^+\), namely
Combining (61) and (62) we can immediately conclude that \(\overline{\varLambda }_1 > -1\). Note here the identical lower bound to \(\overline{\varLambda }\) of the previous section, despite their different definitions. Regarding the two other conditions we must separate their analysis according to the two possible cases identified for \(\varLambda _{12}^- > 0\)
-
I.
In this case, from (61) comes that \(\overline{\varLambda }_2 > - 4\), since its denominator is positive by hypothesis. Furthermore, this also implies that (63) reduces to \((1 + \overline{\varLambda }_1)(1 + \overline{\varLambda }_2) > 0\), imposing \(\overline{\varLambda }_2 > -1\) which is a stronger condition than \(\overline{\varLambda }_2 > -4\), and as such takes precedence over it.
-
II.
Here, the negative denominator of (61) implies that \(\overline{\varLambda }_2 < -4\). Regarding (63), it now reads \((1 + \overline{\varLambda }_1)(1 + \overline{\varLambda }_2) < 0\), and thus is satisfied when \(\overline{\varLambda }_2 < -1\). Contrary to I this condition is now the least restraining, meaning that we take \(\overline{\varLambda }_2 < -4\).
.Summarizing the results one is then led to the two disjoint regions depicted in Fig. 5. Regarding the remaining 7 BFB conditions, a similar graphical analysis to the case of \(\varLambda _{12}^- < 0\) in the \((\varLambda _0, \varLambda _1)\) plane shows that they do not further constrain the \(\overline{\varLambda }\)—with their values already obeying the BFB as outlined above, one is always able to find values of \(\varLambda _0\) and \(\varLambda _1\) such that all BFB are simultaneously met. Now, imposing (a) and (b) on top of the results for \(\overline{\varLambda }_1\) and \(\overline{\varLambda }_2\), we obtain the single restriction for \(S_1\) in (55) and (56), (57)
on the quartic parameters.
At this point, we could proceed as in the previous section, considering two different cases according to the sign of \(\overline{\varLambda }_1\). However, the exposition of results becomes clearer if we instead further subdivide the \((\overline{\varLambda }_1,\overline{\varLambda }_2)\) plane according to the validity of the critical points at the edges. A simple analysis leads to the regions shown in Fig. 5. Within these, the following critical points are valid:
- \(\mathbf {R_A}\)::
-
This region is defined by \(\overline{\varLambda }_2 > 0, \; 0 < \overline{\varLambda }_1 \le \frac{\overline{\varLambda }_2}{4}\). Here, \(S_1\) is valid, for both (55) and (56), (56). Furthermore, all critical points along the edges are valid as well.
- \(\mathbf {R_B}\)::
-
This region is defined by \(\overline{\varLambda }_2 > 0, \; -1 < \overline{\varLambda }_1 \le 0\). Only the critical points along the edges \(\alpha _8 = (1- \alpha _3)/\sqrt{3}\) and \(\alpha _8 = 0\),\(\alpha _3>1/4\) are valid.
- \(\mathbf {R_C}\)::
-
This region is defined by \(\overline{\varLambda }_2< 0, \; -1 < \overline{\varLambda }_1 \le \frac{\overline{\varLambda }_2}{4}\). No critical point is valid.
- \(\mathbf {R_D}\)::
-
This region is defined by \(\overline{\varLambda }_2< -4, \; 0 < \overline{\varLambda }_1\), and has the same critical points as \(R_A\).
- \(\mathbf {R_E}\)::
-
This region is defined by \(\overline{\varLambda }_2< -4, \; -1< \overline{\varLambda }_1 < 0\), and has the same critical points as \(R_B\).
With this partition of the \((\overline{\varLambda }_1,\overline{\varLambda }_2)\) parameter space it is clear that we now have five different cases to analyse. We simply present the results without proof, since obtaining them is a simple matter of imposing conditions (a) and (b) and the manipulation of inequalities, which would clutter the exposition of results. We do however note that in the treatment of \(S_1\) (of (56) and (57)) it is easier to analyse the properties of the Hessian if one studies its trace and determinant rather than the eigenvalues directly, since the former are significantly simpler than the latter. Before we proceed, it is convenient to introduce the analytic expression for the critical point along the edge defined by \(\alpha _8 = ( 1 - \alpha _3 ) / \sqrt{3}\):
4.2.1 \(\mathbf {R_A}\)
The critical point \(S_1\) in both forms (55) and (56), (57) is valid if
If the SBPs are such that these conditions are not met, then the minimum occurs at one of the edges. Comparing the values of the potential at each of these one obtains that the deepest one occurs at the one defined by \(\alpha _8 = ( 1 - \alpha _3 ) / \sqrt{3}\), valid if
Otherwise, the vertex \((\alpha _3, \alpha _8)=(1, 0)\) is the minimum of the potential.
4.2.2 \(\mathbf {R_B}\)
Within this region, both forms of \(S_1\) (55) and (56), (57) cease to be valid, and we turn to the critical points E. Of those, the edge \(\alpha _8 = (1 - \alpha _3)/\sqrt{3}\) always leads to the lowest value of the potential. It is valid if, as before,
Otherwise, the minimum occurs at the vertex \((\alpha _3, \alpha _8)=(1, 0)\).
4.2.3 \(\mathbf {R_C}\)
In this case, since neither S nor E have valid elements we conclude that the minimum must lie at a vertex. Comparing the value of the potential at the three points, the conclusion is that \((\alpha _3, \alpha _8)=(1,0)\) produces the minimum alignment.
4.2.4 \(\mathbf {R_D}\)
Here, \(S_1\) of (55) and (56),(57) is valid, under the conditions
Note that these are almost identical to the situation with positive \(\overline{\varLambda }_2\), differing only in the missing upper bound on \(\varepsilon _3\). If these conditions are not met, then the minimum is at the point given by (65), (66).
4.2.5 \(\mathbf {R_E}\)
Finally, within this region we have that only E has valid critical points, and of those the one of (65), (66) always yields the global minimum alignment, without further restrictions on \(\varepsilon _3\) and \(\varepsilon _8\).
4.2.6 Summary of results
Having determined all the possible minima and the conditions under which they occur, in this section we compile all the above inequalities on the SBPs into graphical representations on the \((\varepsilon _3,\varepsilon _8)\) plane, in which we also overlay the results of a numerical minimization of (12), with the common parameters \(M_0 = 5\), \(\varLambda _{01}^+ = 4\) and \(0 \le \varepsilon _3 < 0.15\), \(0 \le \varepsilon _8 < 0.1\). These ranges were simply chosen in order to obtain representations in which all features are clearly visible. Regarding the colouring, blue corresponds to \(S_1\); in the case \(\varLambda _{12}^-> 0\) where \(S_1\) has two separate analytical expressions we differentiate between them by assigning the darker tone to \(\alpha _3 \le 1/4\), where it is given by (55), and the lighter to \(1/4< \alpha _3 < 1\), where its expression is (56), (57); red corresponds to \(E_1\) and orange to the vertex (1, 0).
We depict the case of \(\varLambda _{12}^- < 0\) in the diagrams of Fig. 6 and the case of \(\varLambda _{12}^-> 0\) in Fig. 7.
From these results we highlight the similarities between both signs of \(\varLambda _{12}^-\), afforded by the specific definitions of the auxiliary parameters \(\overline{\varLambda }\), \(\overline{\varLambda }_1\) and \(\overline{\varLambda }_2\).
Although we do not consider here in further detail the phenomenological implications of these results, it is straightforward to perform an analysis on the observables, such as the masses of the physical states, their couplings and decays (see, e.g. [8]).
5 Conclusion
We have extended the method of Geometric Minimization to softly broken Multi-Higgs Doublet Model potentials. This powerful method is very appropriate to potentials invariant under large symmetries, which contain few parameters, but it works well with the soft-breaking parameters and allows the classification of the possible vacuum expectation values.
The new methodology contrasts to the existing proposal as it does not consist in employing a unitary transformation that would bring back the softly broken quadratic terms to canonical form, but in doing so, introduce complications in the quartic terms. Instead, we propose to perform a constrained geometric minimization by keeping the symmetric quartic terms while considering the effect that the softly broken quadratic terms have in the alignment of the minimum.
The advantages of these geometric methods are that they enable obtaining analytical expressions. We exemplify the new method and this advantage by considering a softly broken \(S_4\) invariant potential with specific soft-breaking parameters. In the following analysis, we show the relevance of characterizing the allowed domains, and systematically present the minimum for each relevant region of parameter space, as divided by analytical inequalities that apply to the quartic couplings. As we note, the conditions obtained from requiring the potential to be bounded from below powerfully restrict the quartics. We stress that, in our examples, a careful definition of the relevant combinations of quartics allows to greatly simplify the respective inequalities into two main regions of parameter space, one which splits into two classes and the other into five classes of behaviour in terms of the possible minima.
The method can be applied to other cases (indeed, we were able to obtain analytical expressions for cases of the softly broken \(\Sigma (36)\) potential and for more cases in the softly broken \(S_4\) potential). Nonetheless, as was the case with the geometric minimization of highly symmetric potentials, the usefulness of the method is somewhat limited by the number of free parameters.
Data availability statement
No data associated in the manuscript.
Notes
Another method has been proposed in [13].
The \(\varLambda _{i}\) are now functions of the various \(\varLambda _{ij}\) in (6), whose specific form is dictated by the specific symmetry one works with. We have that \(x_0 = 1\) by convention meaning that \(\varLambda _{0}=\varLambda _{00}\).
We introduce the \(\alpha _i\) notation to emphasize that the \(y_i\) are fixed. However, they are functionally equivalent to simply using \(y_i\).
Note that the dependence on \(\varLambda _{01}^+\) has not been eliminated. In fact, it will influence the magnitude of the alignment, \(r_0\).
References
T.D. Lee, Phys. Rev. D 8, 1226–1239 (1973). https://doi.org/10.1103/PhysRevD.8.1226
G.C. Branco, P.M. Ferreira, L. Lavoura, M.N. Rebelo, M. Sher, J.P. Silva, Phys. Rept. 516, 1–102 (2012). https://doi.org/10.1016/j.physrep.2012.02.002. arXiv:1106.0034 [hep-ph]
I.P. Ivanov, Prog. Part. Nucl. Phys. 95, 160–208 (2017). https://doi.org/10.1016/j.ppnp.2017.03.001. arXiv:1702.03776 [hep-ph]]
S. Weinberg, Phys. Rev. Lett. 37, 657 (1976). https://doi.org/10.1103/PhysRevLett.37.657
I.P. Ivanov, E. Vdovin, Phys. Rev. D 86, 095030 (2012). https://doi.org/10.1103/PhysRevD.86.095030. arXiv:1206.7108 [hep-ph]
N. Darvishi, M.R. Masouminia, A. Pilaftsis, Phys. Rev. D 104(11), 115017 (2021). https://doi.org/10.1103/PhysRevD.104.115017. arXiv:2106.03159 [hep-ph]
I. de Medeiros Varzielas, I.P. Ivanov, M. Levy, Eur. Phys. J. C 81(10), 918 (2021). https://doi.org/10.1140/epjc/s10052-021-09681-w. arXiv:2107.08227 [hep-ph]
I. de Medeiros Varzielas, D. Ivo, Eur. Phys. J. C 82(5), 415 (2022). https://doi.org/10.1140/epjc/s10052-022-10331-y. arXiv:2202.00681 [hep-ph]
E. Derman, D.R.T. Jones, Phys. Lett. B 70, 449–452 (1977). https://doi.org/10.1016/0370-2693(77)90411-7
E. Derman, Phys. Lett. B 78, 497–500 (1978). https://doi.org/10.1016/0370-2693(78)90496-3
E. Derman, Phys. Rev. D 19, 317–329 (1979). https://doi.org/10.1103/PhysRevD.19.317
A. Degee, I.P. Ivanov, V. Keus, JHEP 02, 125 (2013). https://doi.org/10.1007/JHEP02(2013)125. arXiv:1211.4989 [hep-ph]
I. de Medeiros Varzielas, S.F. King, C. Luhn, T. Neder, Phys. Lett. B 775, 303–310 (2017). https://doi.org/10.1016/j.physletb.2017.11.005. arXiv:1704.06322 [hep-ph]
I.P. Ivanov, C.C. Nishi, JHEP 01, 021 (2015). https://doi.org/10.1007/JHEP01(2015)021. arXiv:1410.6139 [hep-ph]
I.P. Ivanov, F. Vazão, JHEP 11, 104 (2020). https://doi.org/10.1007/JHEP11(2020)104. arXiv:2006.00036 [hep-ph]
Acknowledgements
IdMV acknowledges funding from Fundação para a Ciência e a Tecnologia (FCT) through the contract UID/FIS/00777/2020 and was supported in part by FCT through projects CFTP-FCT Unit 777 (UID/FIS/00777/2019), PTDC/FIS-PAR/29436/2017, CERN/FIS-PAR/0004/2019 and CERN/FIS-PAR/0008/2019 which are partially funded through POCTI (FEDER), COMPETE, QREN and EU. DI acknowledges funding from Fundação para a Ciência e a Tecnologia (FCT) through project CERN/FIS-PAR/0008/2019 and NuPhys - CERN/FIS-PAR/0004/2019.
Funding
Open access funding provided by FCT|FCCN (b-on).
Author information
Authors and Affiliations
Corresponding author
Appendix: Minimization of \(x_1\)
Appendix: Minimization of \(x_1\)
In this Appendix we detail the determination of the left extremity of \(x_1\). The first step is the linearization of (33) by means of the cosine double angle identity, leading to
where
Differentiating \(x_1^\delta\) w.r.t \(\delta _{13}\) and \(\delta _{23}\) and equating to zero, one obtains
where we know from the main text that \(q_1 \ne 0\). Subtracting (77) from (78) leads to
Taking (77), expanding the sine of the difference and using (79) results in
if one chooses to keep \(\sin 2\delta _{13}\) by using (79), and otherwise to
where we used \(\cos 2\delta _{13} = \pm \sqrt{1 - \sin ^2 \delta _{23}}\). Let us start by considering the second factor equal to zero. Some simple manipulations lead to the single solution
where \(\cos 2\delta _{13}\) was obtained by equating the second factor of (80) to zero. However, the existence of this solution is dependent on its boundedness. Demanding that both cosines be bounded leads to the simple constraint
We highlight here that the simplicity of this result reinforces the usefulness of the rotation introduced in (27).
Substituting this solution into (73) and making use of (79), we obtain that
This means that this solution must be the minimum of \(x_1\), given the fact that within \(\varGamma\) in this range of \(\alpha _3^\prime\) we have obtained the smallest value of \(x_1\) for a given \(x_3\). If, however, \(\alpha _3^\prime > \frac{1}{4}\) we turn to the first term in (81), which results in
forcing
with both k and l integers. Given the periodicity of \(\pi\) of \(x_1\) and our knowledge that \(k = l = 0\) leads to the maximum of \(x_1\), we are left with three possibilities,
The last possibility is immediately excluded due to \(\alpha _8^\prime > 0\). Regarding the first two, some simple algebraic manipulations show that \((k,l) = (1, 0)\) is always smaller within the domain, for \(\alpha _3^\prime > \frac{1}{4}\). Thus, the final result for the minimum of \(x_1\) is the one shown in (40).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
de Medeiros Varzielas, I., Ivo, D. Geometric minimization of softly broken potentials. Eur. Phys. J. Plus 138, 81 (2023). https://doi.org/10.1140/epjp/s13360-023-03670-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1140/epjp/s13360-023-03670-6