The idea that the sector responsible for the breaking of the electroweak symmetry may be composite [1,2,3] is as old as the Standard Model (SM) itself [4]. The first theories were inspired by quantum chromodynamics (QCD), and as such they did not feature a light degree of freedom associated to the Higgs boson [5]. However, it was soon realised that enlarging the symmetry breaking pattern can generate a light Higgs boson as long as the unbroken subgroup is large enough to contain the electroweak symmetry \(SU(2)_L \times U(1)_Y\). In this new version, it is the misalignment of the vacuum of the theory with respect to the weak gauging that breaks the symmetry [6]. The light Higgs boson, therefore, emerges as a pseudo-Nambu-Goldstone boson (pNGB).

The most minimal model is based on the coset \(SO(5)/SO(4)\) [7] with its foundations in the idea of holography [8], i.e. a conjectured duality between warped extra dimensional theories and four-dimensional strongly-interacting conformal field theories. Minimality here refers to the number of light composite scalars, which amount to the Higgs boson and the longitudinal polarisations of the \(W^\pm \) and Z bosons. However, for models based on an underlying gauge-fermion dynamics (like QCD), minimality, now defined in terms of the number of elementary constituents, always requires at least the presence of an additional light pNGB [9, 10]. The new minimal model, therefore, features the symmetry breaking pattern SU(4)/Sp(4) [11, 12] and it has 2 massive pNGBs: the composite Higgs boson h and a pseudo-scalar \(\eta \). Note that the same theory can be studied in the “Technicolor” limit, where the Higgs boson may emerge as a light (non-pNGB) resonance and the two light pNGBs act as Dark Matter candidates [13]. In the composite Higgs limit, the singlet \(\eta \) cannot be a Dark Matter candidate because it couples to the electroweak gauge bosons via the Wess-Zumino-Witten topological term [14, 15]: the absence of a coupling to photons at leading order [12] makes the \(\eta \) very difficult to detect at colliders [9, 16, 17]. The most promising channels involve decays of heavy top partners [18,19,20] or the Higgs boson itself [21]. The presence of the singlet pNGB has also been used to study strong order phase transitions at the electroweak scale [22,23,24] and new mechanisms to generate an asymmetric Dark Matter in extended models [25].

In this letter, we want to focus on anomalous couplings of photons to the composite pNGBs, which may reveal the internal structure of the scalar fields. Couplings to photons have long been considered as typical signatures for compositeness in the electroweak sector. Here we focus on new couplings involving one photon and two massive pNGBs, i.e. one Higgs and one singlet \(\eta \). The new coupling reads [26]:

$$\begin{aligned} \mathcal {L} \supset e \frac{d_{B}}{\Lambda ^{2}} \sin \theta \left( \eta \overleftrightarrow {\partial }_{\mu } h\right) \partial _{\nu } F^{\mu \nu }\,, \end{aligned}$$
(1)

and it is generated by the fact that the components of the neutral pNGBs h and \(\eta \) carry electromagnetic charges. Thus, following Ref. [26], we will refer to it as the charge-radius coupling. The coefficient \(d_B\) can be associated to the masses of the spin-1 resonances once the underlying model is specified [26]. The angle \(\theta \) describes the misalignment of the vacuum, and can be expressed in terms of the electroweak scale \(v_\textrm{SM} = 246\) GeV and the pNGB decay constant f as

$$\begin{aligned} \sin \theta =\frac{v_{\textrm{SM}}}{f} \,. \end{aligned}$$
(2)

This relation is universal, in the sense that it does not depend on the specific coset of the underlying model [27, 28]. Note that the couplings in Eq. (1) can be seen as stemming from a dimension-7 operator in the SM effective field theory, with two Higgs field insertions, hence it is usually not considered in the literature (see, e.g., [29, 30]).

To better understand the physics behind the charge-radius coupling, we will focus on a specific realisation, based on a confining hyper-colour gauge group \(Sp(2 N_c)\) with two Dirac hyper-fermions U and D, transforming as the pseudo-real fundamental representation. As in the traditional Technicolor model, the left-handed components form a doublet of the SM \(SU(2)_L\), while the right-handed ones form a doublet of the global \(SU(2)_R\) (where the hypercharge is identified with the diagonal generator). In this way, custodial symmetry is automatically respected by the model [31]. This set-up was first discussed in Ref. [11], and later reconsidered in the Technicolor [13] and in the composite Higgs [12] limits. As already mentioned, the two are related by the misalignment angle \(\theta \), so that \(\theta = \pi /2\) corresponds to the Technicolor limit and \(\theta \approx 0\) to the composite Higgs one [9]. To connect to the lattice results in Ref. [26], we recall that in the Technicolor limit a global \(U(1)_X\) remains unbroken, under which h and \(\eta \) form a charged scalar \(\phi = (h + i \eta )/\sqrt{2}\). The charge-radius coupling in Eq. (1) can be written as

$$\begin{aligned} \mathcal {L} \supset i e \frac{d_{B}}{\Lambda ^{2}}\left( \phi ^{*}{\partial }_{\mu } \phi \right) \partial _{\nu } F^{\mu \nu } + \text{ h.c. } \,, \end{aligned}$$
(3)

where this coupling matches Eq. (1) in the limit \(\sin \theta = 1\). The coefficient is related to the masses of the spin-1 resonances \(\rho \) as follows [26]:

$$\begin{aligned} \frac{d_{B}}{\Lambda ^{2}}=\frac{m_{\rho _{U}}^{2}-m_{\rho _{D}}^{2}}{2 m_{\rho _{U}}^{2} m_{\rho _{D}}^{2}}\,. \end{aligned}$$
(4)

The mass difference is generated by isospin violating effects in the theory, and is typically small, thus allowing us to approximate the coupling as

$$\begin{aligned} \frac{d_{B}}{\Lambda ^{2}} \approx \frac{\delta m_{\rho }}{m_{\rho }^{3}}\,. \end{aligned}$$
(5)

We remark that a similar coupling will also exist with the Z boson replacing the photon, but with a different coupling strength.

In the following, we will focus on the phenomenology consequences of this coupling. Firstly, we observe that the amplitude generated by the coupling (1) vanishes for on-shell photons (or Z’s), thus it will not generate any two-body decays of h or \(\eta \). Three-body decays via an off-shell photon can be generated, such as \(h\rightarrow \eta \gamma ^*\rightarrow \eta e^+ e^-\) for \(m_\eta < m_h\), however leading to very small branching ratios. Hence, the major effect will be in a novel production mechanism for the pNGBs, i.e. production of \(\eta \) in association with a Higgs boson:

$$\begin{aligned} e^+ e^-,\; pp \rightarrow h\ \eta \,, \end{aligned}$$
(6)

via an off-shell photon or Z boson. As in this letter we aim at testing the feasibility of this search, we will only include the photon coupling, knowing that the Z can give additional contributions of the same order. A physically relevant case when the Z contribution is negligible occurs in models where, in the underlying model, the operator in Eq. (3) is generated for the hypercharge gauge boson. The mass of the pseudo-scalar \(\eta \) can acquire very different values, depending on the details of the underlying models. In principle, the \(\eta \) can be much lighter then the compositeness or the electroweak scales as its mass comes from explicit breaking of the \(U(1)_X\) symmetry mentioned above. For masses \(m_\eta < m_Z\), the dominant constraint on this new state come from Z decays via \(Z\rightarrow \eta \gamma \), which will be tested with high accuracy at the Tera-Z run of future \(e^+ e^-\) colliders [32]. Henceforth, we will consider \(m_\eta > m_Z\) in the following: for \(m_Z< m_\eta < 2 m_W\), the dominant decay is \(\eta \rightarrow Z \gamma \), with branching ratio close to 100%; while for \(m_\eta > 2 m_W\) the dominant decays will be \(\eta \rightarrow W^+ W^-,\ ZZ\) [16]. In this exploratory work we focus on the mass range \(m_Z< m_\eta < 2 m_W\) because it has a unique final state with 100% branching ratio, \(\eta \rightarrow Z \gamma \), and because the presence of a monochromatic photon in the final state makes it much easier to extract from the background, as we will see later.

1 Production cross-sections

We first present, in Fig. 1, the cross-sections for the process of concern at various colliders. First, the solid lines show the total cross-sections at hadron colliders, starting with the LHC run at 14 TeV in blue. The values correspond to the benchmark coupling

$$\begin{aligned} \frac{d_B}{\Lambda ^2} \sin \theta = 0.011~\text{ TeV}^{-2}\,, \end{aligned}$$
(7)

corresponding, e.g., to \(d_B \sin \theta = 0.1\) and \(\Lambda = 3\) TeV. We recall that the cross-sections scale with the coupling squared and \(\sin ^2 \theta \). Note that the cross-sections are not very sensitive to the mass of the singlet \(\eta \), thanks to the growth in energy of the amplitude, while increasing the centre-of-mass energy of the collider gives substantial gains: at 27 TeV, a factor of \(\sim 6\) is gained, while it grows to roughly \(\sim 100\) at 100 TeV. In dashed lines, we also show the expected cross-sections at a future \(e^+ e^-\) collider.

Fig. 1
figure 1

Cross sections for the production of \(h\eta \) via a virtual photon at various colliders. In solid colours, we show the LHC at 14 TeV, and other pp future options. In dashed lines, future plans for \(e^+ e^-\) colliders (ILC)

All in all, the cross-sections are extremely feeble, thus high integrated luminosities are necessary in order to constrain this signal. Moreover, as we will see, the backgrounds can be easily suppressed. In the rest of this letter we will focus on estimating the potential reach of the High-Luminosity run of the LHC (HL-LHC), which is expected to collect 3 ab\(^{-1}\) of integrated luminosity, and use these results to project the potential reach of high centre-of-mass energy .

Fig. 2
figure 2

Sample of kinematic distributions used in the cut flow. The top row shows characteristics of the Higgs-like fat-jet, the bottom row properties of the photon

2 Signal and background simulation

We want to focus on the following process:

$$\begin{aligned} pp \rightarrow \gamma ^*\rightarrow h\eta \rightarrow h_{bb} Z_{ll} \gamma \,, \end{aligned}$$
(8)

at a centre-of-mass energy of 14 TeV, where we consider the Higgs boson decaying into b-jets and the Z into leptons (electrons and muons). As an illustrative benchmark, we have chosen \(m_{\eta }=150\) GeV in the simulations. A similar analysis can be performed in the whole mass range \([m_Z, 2 m_W]\), with the remark that smaller values suffer from larger background due to the softer photon in the final state. Henceforth, the benchmark mass was chosen to increase the sensitivity of our simple phenomenological analysis. The model files for the \(\eta \) couplings have been generated using FEYNRULES [33]. The parton-level amplitudes for the signal and backgrounds were generated using MADGRAPH [34] for proton collisions at a centre-of-mass energy of 14 TeV, using the parton distribution function NNLO1 [35]. Showering and hadronisation were done in PYTHIA 8 [36]. All cross sections and distributions we considered in this exploratory study are at leading-order. For the signal, we consider the benchmark coupling in (7).

The SM provides an irreducible background \(pp \rightarrow h Z \gamma \), which we will refer to as “hza” in the following. Additional backgrounds depend on the choice for the optimal decay channel for the Higgs and the Z bosons. To maximise the signal yield, we choose the dominant Higgs boson decay, \(h \rightarrow b\bar{b}\). For the Z, the cleanest channel is into two charged leptons (electrons and muons). While the branching ratio is small, this channel allows to reduce the pure QCD background and fully reconstruct the invariant mass of the \(\eta \), which will be a crucial discriminant for the signal. Hence, we will also include the irreducible background \(pp \rightarrow l^+ l^- b \bar{b} \gamma \) (henceforth referred to as “bblla”). To include the possibility of mistag of a light jet as a b-jet, we also simulate the reducible backgrounds \(pp \rightarrow l^+ l^- c \bar{c} \gamma \) (henceforth referred to as “cclla”) and \( l^+ l^- j j \gamma \) (henceforth referred to as “jjlla”), where j represents any light quark. Another potentially relevant background is \(t\bar{t} + \gamma \), with leptonic decays of the top quarks. However, it becomes negligible when we consider a boosted topology for the Higgs boson which decays to a pair of b quarks which are more collimated, as compared to the b’s from the tops which are almost back-to-back. In addition, we require two leptons from the top pair to reconstruct Z mass within a narrow window, and the resultant Z to be closer to the photon. Hence, we will not include this background in the simulation.

2.1 Event generation

All events are generated with the minimal transverse momenta, \(p_T\), of 40 GeV for jets and 30 GeV for leptons. Moreover, we define a signal region by selecting the following final state phase space: the invariant mass of the jet pair ranges from \(30< m_{b\bar{b}}< 200\) GeV; for the charged lepton pair \(30< m_{ll} <150\) GeV; the maximum radial distance between two jets is 2; the photon has a minimum transverse momentum of \(p_T^\gamma >80\) GeV. The resulting cross-sections are reported in the first row of Table 1, where the fiducial cross-sections in the signal region includes the branching ratios for the Higgs and Z bosons. For the backgrounds, we generated \(10^6\) events (\(10^5\) for jjlla), while for the signal \(10^4\) events.

2.2 Higgs boson reconstruction

The final state particles, excluding photons and leptons, are clustered into fat-jets, i.e. a jet with large jet radius parameter, using FASTJET [37, 38] by employing the Cambridge-Aachen algorithm [39, 40] for clustering, where we set the jet radius parameter \(R = 1.2\). We ensure that each event has two opposite sign leptons and a photon along with, at least, one fat-jet. The events which have a fat-jet with \(p_T>50\) GeV are retained at the pre-selection stage.

We first define the criteria that identify the fat-jet as a Higgs boson candidate. We firstly ensure that it has two sub-jets, the radial distance between the sub-jets is less than 0.5 (see Fig. 2a) and both sub-jets have a transverse momentum of 50 GeV at least. The mass of the fat-jet is further required to be in a wide range around the Higgs mass, defined as \(100 \le M_\mathrm{fat-jet} \le 170\) GeV. To better discriminate the signal from the background, we also require that the transverse momentum is greater than 380 GeV, as illustrated in Fig. 2b. Note that the requirement on the angle between the two sub-jets is correlated to the high \(p_T\) of the signal Higgs boson candidate with respect to the expected backgrounds.

To control the dominant QCD background with light jets, we also impose that both sub-jets are tagged as b-jets. To simplify the analysis, we impose a flat \(p_T\)-independent probability on all the events. This probability is imposed at the end of the cut flow, described below, to retain the maximum number of events in the analysis. The working point we chose corresponds to the following probabilities: \(70\%\) efficiency in tagging bottom quarks, with \(20\%\) probability of mistagging a c-quark and \(1\%\) for light jets [41].

Table 1 Cut-flow for signal and backgrounds for the benchmark \(\eta \) mass. We report the number of events in our simulation surviving after each requirement or cut. In the bottom row, we show the expected number of events at the HL-LHC with an integrated luminosity of 3 ab\(^{-1}\). For the cuts, the units are in GeV

2.3 Cut-flow analysis

For our signal, the leptons and the photon originate from the decay of the \(\eta \). As the Higgs boson has a substantial transverse momentum, the \(\eta \) is also expected to be boosted, thus we could expect the angle between the photon and the Z to be small. As shown in Fig. 2c, imposing a maximum cut on the radial distance between photon and Z allows to reduce the backgrounds, where the photon is dominantly radiated by the initial state partons. Hence, we impose a cut \(p_T^\gamma > 85\) GeV and \(\Delta R_{\gamma Z} < 1.2\). The cut on the photon transverse momentum allows to remove most of the background where the photon comes from soft radiation, at the price of reducing significantly the signal. The Z is reconstructed by selecting a mass window for the lepton invariant mass, which we chose to be \(86 \le m_{ll} \le 96\) GeV. Finally, we reconstruct the \(\eta \) mass by the invariant mass of the photon and Z, and impose a cut in a narrow mass window around the mass of the pseudo-scalar (see Fig. 2d): \(145 \le m_{Z\gamma } \le 156\) GeV. The cut-flow is summarised in Table 1, with the number of events surviving each subsequent cut. As already mentioned, the b-tagging efficiencies are imposed a flat rates, which we apply at the end in order to keep the full set of generated events. In the bottom row, we show the expected number of events at the HL-LHC, for an integrated luminosity of 3 ab\(^{-1}\) and for the benchmark value of the coupling.

2.4 Results

The results of our proposed cut-flow demonstrate that the background can be significantly reduced compared to the signal. However, for the benchmark value of the coupling, no signal events survive at the HL-LHC due to the small production cross-section. Hence, we determine the sensitivity of the search by combining signal (S) and background (B) events as follows:

$$\begin{aligned} Z (\mathcal {L}, \Lambda ) = \frac{S}{\sqrt{S+B}}\,, \end{aligned}$$
(9)

which depends on the coupling (in terms of \(\Lambda \) for \(d_B = 0.1\)) and the integrated luminosity \(\mathcal {L}\). We then use this formula to estimate the \(3\sigma \) and \(5\sigma \) reach in \(\Lambda \) as a function of the luminosity, shown by the solid and dashed lines respectively in Fig. 3. For the HL-LHC, only scales up to the TeV can be probed.

Fig. 3
figure 3

Sensitivity to the scale \(\Lambda \) (for \(d_B \sin \theta = 0.1\)) as a function of the integrated luminosity at various hadron collider energies. The solid lines indicate the \(3\sigma \) reach, while the dashed ones correspond to \(5\sigma \)

However, as we have seen in Fig. 1, the signal is highly enhanced for increasing energy of the proton collisions. We estimate the reach of the HE-LHC option at 27 TeV and of a future hadron collider at 100 TeV by rescaling signal and backgrounds by the ratio of the fiducial cross-sections, defined in the same fiducial region we used for the LHC. For the HE-LHC, the backgrounds are enhanced by a factor between 2.3 and 3, while the signal increases by a factor of 4.7, thus allowing to probe scales \(\Lambda \) up to 1.5 TeV. For the 100 TeV collider, the enhancements amount to factors in the range \(10\div 20\) for the background and 80 for the signal. This allows to probe \(\Lambda \) up to 3 TeV, thus including our benchmark coupling. These projections are shown in purple and green, respectively, in Fig. 3. We remark that these estimates are conservative as they do not take into account boost factors, which could improve the reconstruction, for instance, of the Higgs boson and hence ameliorate the signal over background discrimination.

3 Concluding remarks

We have presented a preliminary study of the collider relevance of the charge-radius coupling of a composite Goldstone Higgs boson, involving a photon and a singlet pseudo-scalar. The peculiarity of this coupling is that it entails the electromagnetic and isospin properties of the constituents of the composite Higgs boson, thus offering a direct probe of its nature. In particular, we focused on a mass range for the new pseudo-scalar where it decays dominantly to a photon and a Z boson.

This coupling manifests itself in the pair production of the two spin-0 states, the Higgs and the pseudo-scalar bosons, via an off-shell photon. The rates, however, are very small, thus requiring high luminosities. We employed a simple cut-and-count strategy to reduce the reducible and irreducible backgrounds, proving that they can be suppressed enormously with respect to the signal, making this search a background-free one, effectively. However, the number of surviving signal events is very small. We show that the LHC in its high-luminosity phase can only access scales up to one TeV in the couplings, while a future 100 TeV collider can access up to 3 TeV. This is due to the enhancement with energy of the signal, compared to the backgrounds.

This preliminary analysis demonstrates the feasibility of searching for this kind of signal from a deep probe of the composite nature of the Higgs boson. The sensitivity could be improved by designing a more effective procedure that allows more signal events to survive the selection. We believe that employing machine learning or deep learning techniques can help to achieve the desired improvement, as the signal has several kinematical features that help to distinguish it from the backgrounds. This will be the main objective of a future investigation.