Abstract
It seems that the literature suggests to go in two opposing directions simultaneously. On the one hand, many papers construct basis-independent quantities, since exactly these quantities appear in the expressions for observables. This means that the mixing angles such as \(\tan \beta \) in the generic Two Higgs Doublet Model must drop out when calculating anything physical. On the other hand, there are many attempts to renormalize such mixing angles – this is in the opposite direction to basis-independence. This basis-dependent approach seems to bring gauge-dependence and singular behaviour, both of which are required to be absent in mixing renormalization. Most importantly, mixing angle counterterms single out a preferred basis and further basis rotations lead to inconsistencies. In contrast, we argue that the bare mixing angles should be identified with the renormalized ones – this is the basis-independent approach – such that all the mixing renormalization requirements are fulfilled in a trivial and consistent manner.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Nowadays, the renormalization procedure is mostly well-established and is no longer considered to just “sweep infinities under the rug”, however, this establishment is not complete. For example, it does not seem that there is an agreed-upon recipe for the renormalization of mixing angles and the literature suggests a myriad of renormalization schemes [1,2,3,4,5,6,7,8,9,10,11] to name a few. Even more so, there appears to exist two different philosophies regarding the renormalization of mixing angles, sometimes even used simultaneously [9] or proposed as alternatives [10]. This is a rather unpleasant situation since particle mixing is present already in the quark sector of the Standard Model (SM) as well as in nearly all models with extended scalar sectors as compared to the SM.
In slightly more detail, the two renormalization approaches differ in whether the mixing angles receive counterterms or not. The more common treatment is to introduce mixing angle counterterms, which are rather inevitably related to the field renormalization (e.g. [1]). In turn, this causes these mixing counterterms to be gauge-dependent – an unwanted feature – such that additional effort must be put in to separate the gauge-independent part (e.g. [7]). The less common approach is to trade the mixing matrix counterterms for the off-diagonal mass matrix counterterms such that the bare mixing matrix is already renormalized (e.g. [9]). It seems that the latter, although not as popular, does not introduce downsides such as unwanted gauge-dependence.
The fact that there are two rather different philosophies, one of them in general leading to gauge-dependent mixing angle counterterms, seems to be an expression of the fact that mixing angles are basis-dependent and, therefore, not physical quantities. For example, this has been rather explicitly noted in [12, 13] at tree-level when considering basis-independent methods for the Two Higgs Doublet Model (THDM). An analogous statement on the redundancy of the renormalization of mixing angles was also made in [10] in the context of the THDM. Seeing that mixing angles are basis-dependent is simple, for example, the flavour basis of the SM has no mixing matrices, but rotation to the quark mass-eigenstate basis produces the quark mixing matrix. Of course, many other bases where the quarks are not in their mass-eigenstates also contain some mixing matrix. The not so simple point, which seems to cause a lot of confusion, is whether and how to renormalize these basis-dependent quantities.
In this work we do not intend to propose a particular renormalization scheme, instead we want to establish a conceptually consistent philosophy for the renormalization of mixing angles such that renormalization schemes can later be constructed. All of our upcoming arguments are geared to highlight renormalization scheme independent structures. For example, in Sect. 3.2 we argue that the mixing matrix counterterms are naturally associated with gauge-dependent structures independently of a renormalization scheme, while a specific scheme may make use of this structure or not. Further, given the scheme-independent arguments, we conclude that schemes in fact should make use of these structures by not associating counterterms to mixing angles as is done in, e.g. [14], where we propose a renormalization scheme for fermions. The absence of mixing angle counterterms seems to offer all of the required properties for mixing renormalization [5, 11, 15] and is a step towards basis-independence. Therefore, we consider this approach to be the consistent one and the one that should be used in practice over the more common approach with counterterms for mixing angles.
The paper is structured as follows: Sect. 2 introduces nearly all the needed notation and relations, Sect. 3 is then dedicated to providing arguments for having the mixing angle counterterms set to 0. In particular, Sect. 3.1 is based on basis-independence arguments, Sect. 3.2 discusses the gauge-dependence and Sect. 3.3 considers the degenerate mass limit. In Sect. 4 we give our conclusions.
2 Basis rotations and renormalization
In this section we set up the discussion of mixing, mass, and field renormalization by generalizing the discussion found in [10], while more specific arguments will be given in further sections.
For simplicity, let us consider a system of real scalar fields
where the 0 (sub)superscripts indicate that the fields are bare. Now, one may relate the fields \(\varvec{\phi }_0\) in the initial basis to some other basis of the fields \(\varvec{h}_0\) via an orthogonal rotation matrix \(\varvec{R}_0\)
Considering the kinetic term in the Lagrangian in momentum space we may write this relation as
where T in the superscript stands for transposition, \(p^2\) is the squared momentum, \(\varvec{M}^2_0\) (\(\varvec{{\widetilde{M}}}^2_0\)) is the bare mass-squared matrix in the \(\varvec{\phi }_0\) (\(\varvec{h}_0\)) basis, which is in general not diagonal. We have used
in the momentum term and defined
Apart from performing basis rotations, the fields may be renormalized
Here \(\varvec{Z}\) is the field renormalization constant, \(\delta \varvec{Z}\) is the corresponding counterterm that can be considered to be of 1-loop order, and \(\varvec{\phi }\) stands for the vector of renormalized fields. Analogously, the fields \(\varvec{h}_0\) may also be renormalized
The renormalization procedure also requires counterterms for the mass matrices
where \(\varvec{M}^2\left( \varvec{{\widetilde{M}}^2}\right) \) is the renormalized mass matrix and the \(\delta \varvec{M}^2\left( \delta \varvec{{\widetilde{M}}^2}\right) \) is the mass matrix counterterm in the \(\varvec{\phi }\left( \varvec{h}\right) \) basis. For the sake of the argument we also introduce mixing matrix counterterms
such that both the bare and the renormalized mixing matrices are orthogonal. The following property stems from orthogonality at 1-loop
Now, we should be able to apply the renormalization procedure to the kinetic term, Eq. (3), in any basis. For example, taking Eqs. (3a) and (3c) we get
where we dropped all the terms non-linear in the counterterms. Alternatively, taking Eq. (3b), where the mixing matrix \(\varvec{R}_0\) is present, leads to the following
where we have
Splitting the field counterterms into the symmetric and anti-symmetric parts
with
and by using Eq. (10) we may rewrite the kinetic term as
where \(\left[ \dots , \dots \right] \) is the commutator. The commutator term shows that the counterterms \(\delta \varvec{R}\) and \(\delta \widetilde{\varvec{Z}}^A\) cannot be determined separately and only the combination \(\varvec{R}^T \delta \varvec{R}+ \delta \varvec{{\widetilde{Z}}}^A\) can be fixed, i.e. the mixing matrix counterterms are degenerate with the anti-symmetric part of the field renormalization, which is a slightly more general version of the statement made in [10]. This degeneracy implies that the mixing may be renormalized through the (anti-symmetric part of the) field renormalization, which is what enables, for example, the scheme in [9]. However, we attempt to make the statement stronger – the mixing angle/matrix counterterms should always be included in the field renormalization. In the following sections we give arguments for why one should set \(\delta \varvec{R} = 0\) by comparing Eqs. (11b), (11d), and (12) in terms of basis-dependence and by discussing gauge-dependence and the degenerate mass limit.
3 Arguments for having \(\delta \varvec{R}=0\)
3.1 Basis independence
Basis-independent methods are often sought after since observables must be expressed in terms of basis-independent quantities, for example, see [12, 13, 16,17,18]. In a similar manner it is desirable for the renormalization procedure to also show some basis-independent features. For example, the form of the renormalized kinetic term in Eqs. (11b) and (11d) is the same although the bases are different – this is welcome. In contrast, the form of Eq. (12) is already different due to additional mixing/rotation matrix counterterms, even though all three equations (should) correspond to the same bare kinetic term.
It is rather simple to see that Eq. (12) can be brought to the form of Eq. (11d), by simply setting \(\varvec{R}_0 = \varvec{R} \Leftrightarrow \delta \varvec{R}=0\) or, equivalently, by redefining the anti-symmetric part of the field renormalization to include \(\varvec{R}^T\delta \varvec{R}\). Once \(\delta \varvec{R}\) no longer appears we may easily equate Eqs. (11d) and (12) and get
Further, Eqs. (11b) and (11d) correspond to the same bare kinetic term if
and
In more detail, with \(\delta \varvec{R}\ne 0\) one is, or at least should be, free to perform a rotation by \(\varvec{R}^T\) on the renormalized fields \(\varvec{h}\) in Eq. (16)
Here \(\varvec{h} ^\prime =\varvec{R} \varvec{h}\),Footnote 1 we have used Eqs. (13) and (18) for the symmetric part of the field renormalization. Evidently, all the terms except for the one with \(\delta \varvec{R}\) contain quantities in the basis of \(\varvec{\phi }\) even though the fields are labeled as \(\varvec{h}^\prime \). This means that one computes identical amplitudes in both the \(\varvec{\phi }\) and \(\varvec{h}^\prime \) bases, except that they are renormalized with different sets of counterterms. The presence of the \(\delta \varvec{R}\) counterterm is the source of inconsistency.
For one thing, because of the \(\delta \varvec{R}\) counterterm the basis rotations of the anti-symmetric part of field renormalization do not seem to follow the same law as the other counterterms. For the symmetric part we could use Eq. (18), while the anti-symmetric part gives
To preserve the same law of basis transformations, Eq. (18), one must have \(\delta \varvec{R}=0\).
For another view at the inconsistency, one easily notices that the \(\delta \varvec{R}\) counterterm in the basis \(\varvec{h}^\prime \) does not have an associated renormalized parameter. This means that it is impossible to form the bare mixing matrix \(\varvec{R}_0\) in the \(\varvec{h}^\prime _0\) basis, i.e. the bare kinetic term no longer follows the form of Eq. (3) and instead becomes
Here we have used the inverse of \(\varvec{h}^\prime _0= \varvec{Z} \varvec{h}^\prime \). The only way to preserve the bare kinetic term and more generally the bare Lagrangian, which defines the theory, is for the commutator term to vanish. However, this gets us back to Eq. (21) and so, setting \(\delta \varvec{R}=0\) preserves not only the form of basis transformations, but also the form of the bare Lagrangian.
The third and final view of the inconsistency may be seen by considering why in Eq. (22) we have \({\mathcal {K}}^\prime \ne {\mathcal {K}}\). We started with the bare kinetic term in Eq. (3a), rotated it by \(\varvec{R}_0\) to Eq. (3b), renormalized it to get Eq. (12), and tried to rotate back into the \(\varvec{\phi }\) basis by \(\varvec{R}^T\). However, instead of Eq. (11b) the rotation took us into Eq. (20) and \({\mathcal {K}}^\prime \) in Eq. (22)! In other words, we see that basis rotations and the renormalization procedure do not commute, i.e. there is a difference if one renormalizes the theory before or after basis rotations. This is a rather awkward feature since there is nothing special about basis rotations or renormalization and we should be working with the same theory in whichever basis we choose to renormalize the theory. In turn, we formulate a consistency condition, which we also imposed in [14], that basis rotations should commute with the renormalization procedure. This condition automatically requires the bare rotations to be identified with the renormalized ones, i.e. \(\varvec{R}_0=\varvec{R}\) and \(\delta \varvec{R}=0\).
The upshot is that having the bare rotation matrix set to the renormalized one, \(\varvec{R}_0=\varvec{R}\), allows to freely change the basis at any point, be it for the bare fields as in Eq. (2) or the renormalized ones in Eq. (19) while keeping the same form of the Lagrangian. Alternatively, this may be rephrased as having a basis-invariant set of counterterms, i.e. upon basis rotations
but not
where \(\delta \lambda \) and \(\delta \widetilde{\lambda }\) stand for the counterterms of other parameters in the theory in the two respective bases.
There is also a formulation in slightly more philosophical terms. One of the main points of the renormalization procedure is that it takes some measurement (observable) as a reference point in order to make the theory predictive. The standard book-keeping device of these measurements are the counterterms. Since the observables must be basis-independent it also makes sense to have a basis-independent set of counterterms – this means \(\delta \varvec{R} = 0\). Of course, one may argue that things such as the Cabbibo–Kobayashi–Maskawa (CKM) matrix [19, 20] elements can be measured and, hence, should receive counterterms. However, the CKM matrix itself can in principle be expressed in terms of the initial (renormalized) mass matrices of the up- and down-type quarks. It is the renormalization of these mass matrices that provides a set of basis-independent counterterms and also ensures cancellations of UV divergences. Put in another way, in a diagonal mass basis, measurement of the mixing angles and masses is the same as measuring the non-diagonal mass matrix in some initial basis. In turn, mixing matrices may still be used as they are a nice way of parameterizing the mixing, but it should not be forgotten that they are derived and basis-dependent quantities and, hence, should not have counterterms.
In the two following sections we show that setting \(\delta \varvec{R}\) to 0 is not only conceptually consistent, but also of practical importance.
3.2 Gauge dependence
Let us consider the case with \(\delta \varvec{R} \ne 0\) and see how it leads to difficulties. One of the requirements for the mixing renormalization is that it should be gauge-invariant [5, 11, 15]. However, this is a rather complicated task because of Eq. (16) and the degeneracy between \(\delta \varvec{Z}^A\) and \(\varvec{R}^T\delta \varvec{R}\). A way to investigate gauge dependence is via the Nielsen Identities [21, 22], which allow to take gauge derivatives of the self-energies.
For concreteness, let us proceed in the basis of the fields \(\varvec{h}\) and consider the 1-loop case, for which the derivative w.r.t. the gauge parameter \(\xi \) of the bare self-energy \(\varvec{\Pi }^0\left( p^2\right) \) is [22]Footnote 2
where \(\varvec{\Lambda }\) is a correlation function involving BRST sources, describes the gauge-dependence of \(\varvec{\Pi }^0\left( p^2\right) \), and is a matrix in flavour space. Just as for the field renormalization in Eq. (14), we may split \(\varvec{\Lambda }\) in its symmetric and anti-symmetric parts, then the Nielsen Identity becomes
Let us also consider the self-energy \(\varvec{\Pi }\left( p^2\right) \) renormalized as in Eq. (16)
Now, we may take the gauge derivative of the renormalized self-energy and arrive at
Here we assumed \(\varvec{{\widetilde{M}}}^2\) and \(\varvec{R}\) to be gauge-independent. It is evident that the field counterterms as well as \(\delta \varvec{R}\) are naturally associated with gauge-dependent structures. In turn, it is rather hard to fix \(\delta \varvec{R}\) in a gauge-independent way since that immediately requires an additional renormalization condition to break the degeneracy between the field and mixing matrix counterterms. Once again, the easiest way around this is to simply set \(\delta \varvec{R}=0\).
In contrast, the mass counterterm \(\varvec{R}^{T}\delta \varvec{M}^{2}\varvec{R}\) is not associated with any gauge-dependent structure and so it can be defined in a naturally gauge-independent way, only non-physical renormalization conditions can induce gauge-dependence in the mass counterterm.
3.3 Non-singular degenerate mass limit
If one keeps \(\delta \varvec{R}\ne 0\) and manages to renormalize it in a gauge-independent way, the counterterm will still be problematic. To see this, let us for simplicity explicitly choose a basis where the mass matrix is diagonal
and take Eq. (27)
Here \(i,\, j\) are flavour indices, the non-bold notation (where appropriate) indicates matrix elements, and the counterterm \(\left( \varvec{R}^{T}\delta \varvec{M}^{2}\varvec{R}\right) _{ij}\) is in general not diagonal even if \(\varvec{{\widetilde{M}}}^2\) is.
Further, the counterterms must cancel the UV divergences in the bare self-energy independently of the chosen scheme, hence, we only take the UV parts, although the arguments carry over to the finite parts without difficulty. In addition, the UV divergences in the bare-self energy must be accompanied by the same structures as the counterterms, since otherwise one could not use the counterterms to cancel the UV divergences, or at least could not do so for every momentum \(p^2\). As the structures with \(p^2-m^2_{i}\) and \(p^2-m^2_{j}\) multiply the symmetric part of the field renormalization, which is not related with \(\delta \varvec{R}\), we simply drop such terms (as indicated by ) for simplicity. With these considerations and as well taking only off-diagonal terms with \(i\ne j\) we have
Here lies part of the problem: in the literature there are many schemes (e.g. [1, 25,26,27,28]) where the off-diagonal mass counterterm \(\left( \varvec{R}^{T}\delta \varvec{M}^{2}\varvec{R}\right) _{ij}\) is set to 0. Another part is that in the degenerate mass limit, i.e. \(m_i\rightarrow m_j\), the bare self-energy in Eq. (31) does not vanish in general, but the mixing and field counterterms are multiplied by \({m_i^2-m_j^2}\), which does vanish in this limit. In such schemes the UV divergences in Eq. (31) must be canceled with the counterterms \(\delta \varvec{R}\) or \(\delta \varvec{{\widetilde{Z}}}^A\), but this is only possible if these counterterms are proportional to \(\left( m_i^2-m_j^2\right) ^{-1}\). In other words, the counterterms \(\delta \varvec{R}\) or \(\delta \varvec{{\widetilde{Z}}}^A\) must be singular in the degenerate mass limit for the cancellation to work out. In turn, these singularities can cause numerical problems, which are required to be absent for the mixing renormalization [11].
Alternatively, the non-diagonal mass counterterm can naturally cancel the non-vanishing terms without being singular. Also note that according to Sect. 3.2 (and with the diagonal mass matrix) the gauge-dependent parts vanish in the degenerate mass limit [11, 29] so that the mass counterterms can be defined in a gauge-independent way. Even when the renormalization is performed in a basis where the (renormalized) mass matrix is diagonal the corresponding counterterm has to be a matrix with possible non-trivial off-diagonal elements depending on the particular model – this avoids singularities in the degenerate mass limit. Out of \(\Pi _{ij}^0\) only terms which are gauge-independent and proportional to \({m_i^2-m_j^2}\) could possibly be included in \(\delta \varvec{R}\) such that it is non-singular and gauge-invariant. However, this is a step towards basis-dependence and it is best to keep \(\delta \varvec{R} = 0\) and to avoid inconsistencies altogether.
Finally, even without considering the degenerate mass limit, the non-diagonal mass counterterms are essential to ensure that all the relevant UV divergences cancel out without the need for a non-trivial \(\delta \varvec{R}\) as is explicitly done in [9, 14]. These explicit schemes with \(\delta \varvec{R} = 0 \), where the UV divergences are properly taken care of, reinforce the more general and scheme-independent arguments laid out in this paper.
4 Conclusions
In this paper we have considered the interplay between basis rotations of the fields and the renormalization procedure. In particular, we have found that adding counterterms to mixing angles is a step towards basis-dependence and introduces various problems. For one thing, counterterms of mixing angles are naturally associated with gauge-dependent structures, while at the same time a gauge-independent definition of them is likely to be singular in the degenerate mass limit. Neither of these two properties are welcome, since the former makes physical amplitudes gauge-dependent and the latter can cause numerical instabilities. More importantly, mixing angle counterterms obstruct the form of basis transformations such that the renormalization procedure does not commute with basis rotations – we see this as an inconsistency and a step towards basis-dependence. In contrast, stepping in the direction of basis-independence by setting mixing angle counterterms to 0 completely avoids inconsistencies together with all the gauge-dependence and singular behaviour problems. We conclude that the basis-independent approach is practically far more simple, consistent and should be taken.
Data Availability Statement
This manuscript has no associated data or the data will not be deposited. [Authors’ comment: No data was generated during this research.]
Notes
For \(\varvec{R}_0=\varvec{R}\) one trivially has \(\varvec{h}^\prime = \varvec{\phi }\).
Note that achieving this form requires the inclusion of tadpole diagrams (or at least the gauge-dependent ones) in the self-energy. This can be understood in terms of the Fleischer-Jegerlehner scheme [23, 24], which ensures that the theory is at the true minimum such that modification of the Nielsen identity with \(\delta _t\) in Eq. (6) in [22] does not arise.
The usual notation of the scale is \(\mu \), but then the derivative might be confused with a Lorentz one so we have opted for the letter s.
The mass for the Goldstone comes from the gauge-fixing term, which is not renormalized in the linear \(R_\xi \) gauges [38].
References
A. Denner, T. Sack, Renormalization of the quark mixing matrix. Nucl. Phys. B 347(1–2), 203–216 (1990). https://doi.org/10.1016/0550-3213(90)90557-T
B.A. Kniehl, A. Pilaftsis, Mixing renormalization in Majorana neutrino theories. Nucl. Phys. B 474(2), 286–308 (1996). https://doi.org/10.1016/0550-3213(96)00280-5
A. Barroso, L. Brücher, R. Santos, Renormalization of the Cabibbo–Kobayashi–Maskawa matrix. Phys. Rev. D 62(9), 096003 (2000). https://doi.org/10.1103/PhysRevD.62.096003
A.O. Bouzas, Mixing-matrix renormalization revisited. Eur. Phys. J. C 20(2), 239–252 (2001). https://doi.org/10.1007/s100520100663
K.-P.O. Diener, B.A. Kniehl, On-mass-shell renormalization of fermion mixing matrices. Nucl. Phys. B 617(1–3), 291–307 (2001). https://doi.org/10.1016/S0550-3213(01)00453-9. arXiv:hep-ph/0109110
A. Denner, E. Kraus, M. Roth, Physical renormalization condition for the quark-mixing matrix. Phys. Rev. D 70(3), 033002 (2004). https://doi.org/10.1103/PhysRevD.70.033002. arXiv:hep-ph/0402130
B.A. Kniehl, A. Sirlin, Simple on-shell renormalization framework for the Cabibbo–Kobayashi–Maskawa matrix. Phys. Rev. D 74(11), 116003 (2006). https://doi.org/10.1103/PhysRevD.74.116003
B.A. Kniehl, A. Sirlin, M.L. Marshak, Novel formulations of CKM matrix renormalization. In: AIP Conference Proceedings, vol. 1182, pp. 327–330 (2009). https://doi.org/10.1063/1.3293809
N. Baro, F. Boudjema, Automatized full one-loop renormalization of the MSSM. II. The chargino–neutralino sector, the sfermion sector, and some applications. Phys. Rev. D 80(7), 076010 (2009). https://doi.org/10.1103/PhysRevD.80.076010. arxiv:0906.1665
L. Altenkamp, S. Dittmaier, H. Rzehak, Renormalization schemes for the Two-Higgs-Doublet Model and applications to h \(\rightarrow \) WW/ZZ \(\rightarrow \) 4 fermions. J. High Energy Phys. 2017(9), 134 (2017). https://doi.org/10.1007/JHEP09(2017)134. arXiv:1704.02645
A. Denner, S. Dittmaier, J.-N. Lang, Renormalization of mixing angles. J. High Energy Phys. 104(11), 61 (2018). https://doi.org/10.1007/JHEP11(2018)104. arXiv:1808.03466
S. Davidson, H.E. Haber, Basis-independent methods for the two-Higgs-doublet model. Phys. Rev. D 72(9), 099902 (2005). https://doi.org/10.1103/PhysRevD.72.099902. arXiv:hep-ph/0504050
H.E. Haber, D. O’Neil, Basis-independent methods for the two-Higgs-doublet model. II. The significance of tan \(\beta \). Phys. Rev. D 74(1), 015018 (2006). https://doi.org/10.1103/PhysRevD.74.015018
S. Draukšas, On-shell renormalization of fermion masses, fields, and mixing matrices at 1-loop. Eur. Phys. J. Plus 138(7), 671 (2023). https://doi.org/10.1140/epjp/s13360-023-04226-4
A. Freitas, D. Stöckinger, Gauge dependence and renormalization of \(\tan \beta \) in the minimal supersymmetric standard model. Phys. Rev. D 66(9), 095014 (2002). https://doi.org/10.1103/PhysRevD.66.095014
S. Davidson, J. Ellis, Basis-independent measures of R-parity violation. Phys. Lett. B 390(1–4), 210–220 (1997). https://doi.org/10.1016/S0370-2693(96)01398-6
S. Davidson, Basis independent parametrisations of R parity violation in the soft SUSY breaking sector. Phys. Lett. B 439(1–2), 63–70 (1998). https://doi.org/10.1016/S0370-2693(98)01006-5
S. Davidson, M. Losada, Basis independent neutrino masses in the R p violating minimal supersymmetric standard model. Phys. Rev. D 65(7), 075025 (2002). https://doi.org/10.1103/PhysRevD.65.075025
N. Cabibbo, Unitary symmetry and leptonic decays. Phys. Rev. Lett. 10(12), 531–533 (1963). https://doi.org/10.1103/PhysRevLett.10.531
M. Kobayashi, T. Maskawa, \(CP\)-violation in the renormalizable theory of weak interaction. Prog. Theor. Phys. 49(2), 652–657 (1973). https://doi.org/10.1143/PTP.49.652
N.K. Nielsen, On the gauge dependence of spontaneous symmetry breaking in gauge theories. Nucl. Phys. B 101(1), 173–188 (1975). https://doi.org/10.1016/0550-3213(75)90301-6
P. Gambino, P.A. Grassi, Nielsen identities of the SM and the definition of mass. Phys. Rev. D 62(7), 076002 (2000). https://doi.org/10.1103/PhysRevD.62.076002. arXiv:hep-ph/9907254
J. Fleischer, F. Jegerlehner, Radiative corrections to Higgs-boson decays in the Weinberg–Salam model. Phys. Rev. D 23(9), 2001–2026 (1981). https://doi.org/10.1103/PhysRevD.23.2001
A. Denner, L. Jenniches, J.N. Lang, C. Sturm, Gauge-independent \( \overline{\rm MS}\) renormalization in the 2HDM. J. High Energy Phys. 2016(9), 115 (2016). https://doi.org/10.1007/JHEP09(2016)115. arXiv:1607.07352
B.A. Kniehl, All-order renormalization of propagator matrix for unstable Dirac fermions. Phys. Rev. D 89(9), 096005 (2014). https://doi.org/10.1103/PhysRevD.89.096005
B.A. Kniehl, All-order renormalization of the propagator matrix for fermionic systems with flavor mixing. Phys. Rev. Lett. 112(7), 071603 (2014). https://doi.org/10.1103/PhysRevLett.112.071603
M. Krause, R. Lorenz, M. Mühlleitner, R. Santos, H. Ziesche, Gauge-independent renormalization of the 2-Higgs-doublet model. J. High Energy Phys. 2016(9), 143 (2016). https://doi.org/10.1007/JHEP09(2016)143. arXiv:1502.07716
A. Denner, S. Dittmaier, Electroweak radiative corrections for collider physics. Phys. Rep. 864, 1–163 (2020). https://doi.org/10.1016/j.physrep.2020.04.001. arXiv:1912.06823
Y. Yamada, Gauge dependence of the on-shell renormalized mixing matrices. Phys. Rev. D (2001). https://doi.org/10.1103/PhysRevD.64.036008. arXiv:hep-ph/0103046
C. Balzereit, T. Hansmann, T. Mannel, B. Plümper, Renormalization group evolution of the CKM matrix. Eur. Phys. J. C 9(2), 197–211 (1999). https://doi.org/10.1007/s100529900040
A. Pilaftsis, Gauge and scheme dependence of mixing matrix renormalization. Phys. Rev. D 65(11), 115013 (2002). https://doi.org/10.1103/PhysRevD.65.115013. arXiv:hep-ph/0203210
A.S. Cornell, L.-X. Liu, Evolution of the CKM matrix in the universal extra dimension model. Phys. Rev. D 83(3), 033005 (2011). https://doi.org/10.1103/PhysRevD.83.033005
J.R. Espinosa, Y. Yamada, Scale- and gauge-independent mixing angles for scalar particles. Phys. Rev. D 67(3), 036003 (2003). https://doi.org/10.1103/PhysRevD.67.036003
P.H. Chankowski, S. Pokorski, J. Rosiek, Complete on-shell renormalization scheme for the minimal supersymmetric Higgs sector. Nucl. Phys. B 423(2–3), 437–496 (1994). https://doi.org/10.1016/0550-3213(94)90141-4
A. Dabelstein, The one-loop renormalization of the MSSM Higgs sector and its application to the neutral scalar Higgs masses. Zeitschrift für Physik C Part. Fields 67(3), 495–511 (1995). https://doi.org/10.1007/BF01624592
S. Kanemura, M. Kikuchi, K. Yagyu, Fingerprinting the extended Higgs sector using one-loop corrected Higgs boson couplings and future precision measurements. Nucl. Phys. B 896, 80–137 (2015). https://doi.org/10.1016/j.nuclphysb.2015.04.015
S. Dittmaier, H. Rzehak, Electroweak renormalization based on gauge-invariant vacuum expectation values of non-linear Higgs representations. Part II. Extended Higgs sectors. J. High Energy Phys. 2022(8), 245 (2022). https://doi.org/10.1007/JHEP08(2022)245
L. Baulieu, Perturbative gauge theories. Phys. Rep. 129(1), 1–74 (1985). https://doi.org/10.1016/0370-1573(85)90091-2
Acknowledgements
The author would like to thank his supervisor T. Gajdosik for reading of the manuscript as well as for helpful comments and discussions. The author also thanks V. Dūdėnas for discussions regarding Appendix A.
Funding
The authors have no relevant financial or non-financial interests to disclose.
Author information
Authors and Affiliations
Contributions
Not applicable.
Corresponding author
Ethics declarations
Conflict of interest
The authors have no relevant financial or non-financial interests to disclose.
Ethics approval
Not applicable.
Consent to participate
Not applicable.
Consent for publication
Not applicable.
Code availability
Not applicable.
Appendices
Appendix A
A note on the renormalization group with \(\delta \varvec{R} = 0\)
The renormalization group running of various parameters can be determined from the counterterms (given dimensional regularization is used), however, our proposal is to set \(\delta \varvec{R}\) to 0. Therefore, in this section we want to clarify what happens with the running of mixing angles when the corresponding counterterm is trivial.
We begin by considering the bare mass matrix in some arbitrary basis, which is renormalization scale (s) independent
Henceforth we will abbreviate the derivative asFootnote 3
Returning to the mass, we may renormalize it and simply get
Changing to some other basis via Eqs. (13) and (17) we have
Next we take the \(\ln s\) derivatives
To simplify a little we multiply by \(\varvec{R}^T\) on the left and by \(\varvec{R}\) on the right, and then use relations stemming from orthogonality, i.e. \(\varvec{R}^T\varvec{R}=\varvec{1}\Rightarrow \partial _s\varvec{R}^T\varvec{R}=-\varvec{R}^T\partial _s\varvec{R}\), such that we arrive at
Here we may restore the bare mass matrix
Here we get that the bare mass matrix depends on the scale s as long as \(\partial _s \varvec{R}\ne 0\), which is in contradiction with general principles. In turn, it must be the case that the rotation matrix does not depend on the scale s
Let us consider Eq. (A6) in more detail in a basis where \(\varvec{{\widetilde{M}}}^2=\mathrm {diag.}\left( m_1^2,\dots , m_n^2\right) \) as in Eq. (29). In addition, let us keep only 1-loop contributions, which means that the terms like \(\sim \delta \varvec{{\widetilde{M}}}^2\varvec{R}^T\partial _s\varvec{R}\) are automatically of higher order and can be neglected. Even more so, as we have not specified a scheme, the renormalized masses \(m_i\) are not necessarily the physical (pole) masses \({\mathfrak {m}}_i\), but can receive corrections. However, when taken non-trivial, \(\partial _s \varvec{R}\) is at least of 1-loop order so that we can think of the renormalized masses as the physical ones and the corrections contribute only at higher orders. With these considerations Eq. (A6) becomes
The consistent option, which ensures that the bare mass matrix does not run with scale, is to take Eq. (A8) such that Eq. (A9) becomes
This means that once the renormalized mass is diagonalized at some scale, it will not remain diagonal at some other scale, i.e. \(\varvec{{\widetilde{M}}}_{ij}(s_0)=\mathrm {diag.}\) and \(\varvec{{\widetilde{M}}}_{ij}(s_1)\ne \mathrm {diag.}\) Of course, at the new scale \(s_1\) one can perform another basis change such that the renormalized mass is again diagonal.
The inconsistent option is to keep the running of \(\varvec{R}\), which, however, allows to keep the renormalized mass diagonal at all scales so that we have for the diagonal components ( \({i=j}\))
and for the off-diagonal terms ( \({i\ne j}\)) we have
With this option one can in principle define the running of rotation (mixing) matrices, but this immediately brings back the problematic factor of \(\left( \mathfrak {m}_i^2-\mathfrak {m}_j^2\right) ^{-1}\), which is associated with gauge-dependence and the singular mass limit as per Sects. 3.2 and 3.3. This factor is also found in works by other authors, for example, in [30,31,32] in the context of fermions and for scalars in [33]. If Eq. (A12) is used, in a roundabout way it also defines a mixing matrix counterterm by relating it to the mass counterterm, which is similar to the approach of [7]. However, as per discussion in Sect. 3, mixing matrix counterterms should not be used.
In conclusion, in a fully consistent treatment the mixing angles should not run with the scale, although that means the running of all elements of the mass matrix. It may seem that the diagonalization of the renormalized mass matrix at every scale implies the running of the rotation matrix, but this implication is rather put in by hand. In principle, at every scale one is free to choose any basis so that between scales the rotation matrices are not related and, therefore, there is no natural running. One can only impose this running, by selecting a class of rotation matrices that diagonalizes the renormalized matrix at every scale, but this is non-physical and brings the singular factor of \(\left( \mathfrak {m}_i^2-\mathfrak {m}_j^2\right) ^{-1}\) in Eq. (A12) (after expressing \(\partial _s \varvec{R}\)).
Appendix B
A note on \(\tan \beta \) in the THDM
Having mentioned the THDM mixing angle \(\beta \) in the abstract we see it fitting to make a few comments regarding this angle and our proposal to set the corresponding counterterm to 0. For simplicity, let us assume CP-conservation such that there is no mixing between the CP-odd and CP-even states. Then, given the bare Higgs doublets (\(i=1,2\))
we have
Here the angle \(\beta \) rotates to the mass eigenstates of the charged and CP-odd bosons, while the angle \(\alpha \) rotates to the CP-even mass eigenstates, \(G^{0\,\pm }\) and \(G^0\) are the charged and neutral bare Goldstone bosons, h, H are the CP-even mass eigenstates, \(A^0\) is the CP-odd one, and \(H^{0\pm }\) is the charged one. \(v_i\) are the respective vacuum expectation values.
The literature on the renormalization of angles \(\alpha \) and \(\beta \) is rather vast, e.g. [10, 11, 15, 24, 27, 34,35,36,37]. Both angles are used to rotate to the mass eigenstates, but the literature seems to treat them slightly differently. For example, one can find approaches, where the angle \(\alpha \) has a trivial counterterm like in the appendix of [10]. On the other hand, the angle \(\beta \) seems to always get a counterterm, perhaps, because one always wants to have a clear handle on the physical and non-physical degrees of freedom and one usually starts from a basis where the Goldstone bosons together with the angle \(\beta \) are evident. In such a basis one usually renormalizes the angle \(\beta \). On yet another hand, one could choose the so-called Higgs-basis, where the angle \(\beta \) simply does not appear and in turn does not require a counterterm.
Given the discussion in this paper, it should be possible to set both counterterms \(\delta \alpha \) and \(\delta \beta \) to 0 while still cancelling all the relevant UV divergences. The remedy seems to be the introduction of non-diagonal mass counterterms even if the renormalized mass is a diagonal one. We are aware of two explicit schemes, where a non-diagonal mass matrix takes care of all the UV divergences without introducing a counterterm for mixing angles: for squarks in a Minimal Supersymmetric Standard Model [9] and for fermions in [14] in general. We propose that the same could be done also for the renormalization angles \(\alpha \) and \(\beta \). For example, Eqs. (4.17) and (4.30) in [10] already imply that this can be done for the angle \(\alpha \) since the UV divergences can be taken care of by the mass counterterm. Analogously, Eq. (31) in our paper can lead to the same conclusion in the context of the THDM. The same approach can in principle be taken for the angle \(\beta \). However, this approach for \(\beta \) may be cumbersome in practice, since the mass structure \({m_i^2-m_j^2}\) no longer appears. This happens because one of the masses always corresponds to a would-be Goldstone boson and is 0.Footnote 4 On the other hand, there seems to be no other natural condition for the angle \(\beta \) counterterm, for example, the authors in [36] simply introduce a condition by hand and require \(\delta \beta \) to be the gauge-independent part of a relevant self-energy, but could have come up with some other renormalization condition.
Another subtle point regarding the angle \(\beta \) is that it can be promoted to a physical parameter if, for example, a \(Z_2\) symmetry is imposed on the THDM. However, the takeaway is that this angle is a physical parameter only with respect to a certain basis [13] and it still should not receive a counterterm. One can see this by writing down a generic THDM potential in terms of basis-invariants as in [13] or in the Higgs basis such that the angle \(\beta \) does not appear. The \(Z_2\) symmetry then can be imposed on this form of the Lagrangian even if it may be realized in more complicated ways than in some preferred basis. Imposing this symmetry does not introduce new parameters, but can only restrict the already available ones. Further, one can renormalize in this basis – this does not introduce a counterterm for the angle \(\beta \) – and afterwards rotate to some preferred basis, where the \(Z_2\) (or some other) symmetry is more evident. Then, one can still measure the angle \(\beta \) but it does not need a counterterm.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Funded by SCOAP3. SCOAP3 supports the goals of the International Year of Basic Sciences for Sustainable Development.
About this article
Cite this article
Draukšas, S. Relations between basis sets of fields in the renormalization procedure. Eur. Phys. J. C 83, 962 (2023). https://doi.org/10.1140/epjc/s10052-023-12121-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1140/epjc/s10052-023-12121-6