Diphotons from diaxions

We study models in which the 750 GeV diphoton excess is due to a scalar decaying to two pseudo-Nambu-Goldstone bosons that subsequently decay into two pairs of highly boosted photons, misidentified as individual photons. Performing a model independent analysis we find that, with axion mass around 200 MeV, this class of theories can naturally explain the observed signal with a large width, without violating monojet constraints. At the same time the requirement of a prompt axion decay can be satisfied only with a relatively large axion-photon coupling, leading to many new charged particles at the TeV scale. However, the required multiplicities of such fields is still moderately reduced relative to models with direct diphoton production.


Introduction
Recently both the ATLAS and CMS collaborations observed an excess in diphoton events with an invariant mass in the region of 750 GeV [1,2], and the particularly interesting feature that it appears to have a large width Γ ∼ 40 GeV.
The excess has attracted an enormous amount of attention from the theoretical community . A signal at 13 TeV can be compatible with the previous 8 TeV run if the production of a new scalar resonance is assumed to occur through gluon fusion. This mechanism has a ∼ 4.5 difference in parton luminosities for 8 and 13 TeV collisions, allowing both measurements to be reconciled with a tension of 2σ. The simplest way to parameterise such a resonance is where the couplings are normalised so that c g = c F = 1 corresponds to the one loop effects of a heavy analogue of the top quark obtaining mass from symmetry breaking at a scale Λ. Fitting the signal we obtain the well known constraints, on c F and c g in our case, shown in figure 1. New colour charged states are expected to appear close to Λ, and LHC constraints require that these have a mass TeV, so the number of observed events can be fit with O(few) multiplicities of new coloured and charged fields. Furthermore, the current measurement hints that the signal might be better described by a resonance with width ∼ 40 GeV. Even though the significance of the currently observed width is very small, for the purpose of our present work we will assume that this feature is required. In the effective model of eq. (1.1) the partial decay width into photons and the gluons is given by Γ(s → γγ, gg) ∼ 1.6 × 10 −3 c g TeV Λ 2 + 10 −5 c F TeV Λ 2 GeV , (1.2) which means c g (TeV/Λ) 160 or c f (TeV/Λ) 2000 is needed to have Γ(s → γγ, gg) ∼ 40 GeV. However such large values of the couplings are incompatible with current observations: due to either the additional photon production [182][183][184][185] or constraints from dijet searches [183]. Even allowing invisible decay channels, obtaining Γ ∼ 40 GeV is problematic: 8 TeV monojet searches [186] constrain the production cross section to be less than ∼ 0.5pb (this bound is obtained by recasting at the parton level the current experimental limits from [187,188]). As a result we are forced into the region with large values of c F (TeV/Λ) 80 meaning very large multiplicities of the new charged particles at the TeV scale. This problem is ameliorated if one instead considers the following process that could mimic a diphoton signal: gg → s → aa → 4γ, where the field a is a very light scalar so that two photons from its decays are highly collimated and can be misidentified as a single photon. 1 In this construction the interaction between s and a can occur at tree, rather 1 The possibility that a pair of photons from a highly boosted particle could be identified as a single photon has previously been studied in [189][190][191][192][193][194][195], and considered in the context of the 750 GeV signal [196][197][198][199][200].

JHEP05(2016)077
than loop, level so that effectively the model has large values of "c F ". It is natural for the field a to be very light if it is a pseudo Nambu Goldstone boson (PNGB) of a global symmetry, and the PNGB structure does not forbid interactions of the form aFF .
In the rest of this paper we study this topology in detail, under the assumption that a is the PNGB (i.e. the axion) associated with the breaking of a global U(1) Peccei-Quinn (PQ) symmetry. Unlike the simple direct production model this allows a large width without problems from monojet or dijet limits. If the scale of PQ symmetry breaking is low, 400 GeV, the entire signal can be generated with a relatively modest number of additional fields charged under the Standard Model (SM) gauge group, but the theory is very close to strong coupling. At larger f models can remain perturbative up to higher scales ∼ 10 7 GeV, with the width coming from invisible decays. One natural possibility we consider is that the invisible decays are to other PNGBs in a more complex hidden sector.
The structure of the paper is the following: we first present a model independent study of the mechanism in section 2; in section 3 we construct explicit models fitting the diphoton excess; in section 4 we compare the theories obtained to models of direct production; and in section 5 we discuss some additional possibilities and conclude.

Model independent analysis
A beneficial feature of the decay topology of interest is that many of the phenomenological features can be studied in an effective field theory (EFT) approach below the scale of the global symmetry breaking. Assuming that the axion arises from the breaking of a U(1) PQ symmetry by a field φ we can expand it as Then the generic interactions important for the diphoton resonance are In particular, all the relevant interactions can be parameterised in terms of three variables c g , c γ , f . To obtain phenomenologically viable models we also give the axion a small mass m a , which is generated by a (technically natural) small explicit breaking of the PQ symmetry.

Production
The scalar s is produced through gluon fusion and, to a good approximation, the cross section for this is 3)

JHEP05(2016)077
where F are the usual fermion loop functions defined by eq. (A.1) in appendix A. The partial decays width can be straightforwardly obtained (2.4) Hence for s to have a width to axions of ∼ 40 GeV we need f 320 GeV. Alternatively, higher values of f can give a significant width if s has additional decay channels and an invisible width Γ inv . Assuming that the efficiency of photon identification is ∼ 100% [2], and ∼ 85% of axions successfully fake a photon, we obtain the number of events as a function of the PQ scale, both if the total width is raised with invisible decays, and also if there are no invisible decays (in which case the width is small unless f is small) (2.5) Since it is known that, at 2σ, N ev ∈ [4,22] (for the excess reported by ATLAS) this can be translated into a constraint on the parameters of the theory In figure 2 we plot the values of c g and f compatible with the ATLAS observation. From this we see that in the regime where f is small (that is, the width comes from the decay of s to a) we need quite a small coupling to gluons c g ∼ 0.12. In section 3 we show how this may arise from a model with additional heavy vector like quarks.
In the large f case the value of c g needed is fairly large, which can be naturally generated from models with new chiral quarks. Even when the width comes from invisible decays, monojet searches are much less constraining than for the direct production model. In the parameter regions of interest σ pp→invisible 0.1 pb, since typically Γ s→aa 1 GeV.

Axion faking a photon
To reproduce the observed excess the two photons from the decay of each axion must be identified as a single photon. The ATLAS collaboration reports that the photons in the analysis are required to pass the "tight photon" tag which means meeting the selection criteria given in [210]. Although the photon event variables used by ATLAS are sophisticated, we can estimate that this roughly corresponds to requiring both photons hit the same plate of the first layer of ECAL, which has a very fine granularity in the η direction ∆η 0.0031 .
In appendix B we show how this can be translated into a bound on the axion mass, and also show that our simple analysis gives a good approximation to the result obtained from a more complete consideration of the actual cuts and detector properties. At the same time we need the axion to decay before reaching the calorimeter (otherwise it would lead to displaced vertices, or escape the detector entirely). At ATLAS this is located 1.3 m from the interaction point. Therefore we obtain a constraint on c γ , m a , and f , which are related to the decay length of the axion in the lab frame by where the decay length of 0.6 m corresponds to an approximately 80% probability that both axions decay before the ECAL. This gives (2.10) The viable parameter space, calculated following the procedure in appendix B, is presented in figure 3, as a function of c γ and m a for different values of f . Demanding that 60% of axion pairs fake two individual photons constrains the axion mass to a relatively narrow window of 100 ÷ 200 MeV, and c γ can be smallest when f is small.

Other constraints
There are also other experimental constraints on light axion like particles. The couplings to photons in the parameter ranges of interest are close to being probed by beam dump experiments (other limits come from astrophysics and e + + e − → γ + invisible searches, but these are too weak to have any effect) [201]. However, as argued in the previous section, the axion must have a decay length l 1 m so cannot be constrained by the current NuCal [203] and future experiments NA62 [204] and SHiP [205,206] QCD axions with decay constants ∼ TeV are strongly excluded by the decay K + → π + a that looks like the rare SM process K + → π + νν [207]. In our models this does not automatically apply since, by construction, the axion typically decays inside the detector, resembling the much more common process K + → π + π 0 albeit with a different pion mass. However, for simplicity and to be sure limits from this and similar processes are avoided, we mostly consider models where the extra SM charged states are such that a has no anomalous coupling to gluons.

Constructing explicit models
Having discussed the generic features arising from the topology of interest we now consider models that can generate the required EFTs for f in different ranges.

Small f models
If f is fairly small, 500 GeV, production and decay of s requires that the parameters of the effective Lagrangian are roughly in the range The large value of c γ relative to c g immediately points towards the states generating the axion coupling to photons being separate from those responsible for the coupling of s to gluons. Further, stringent collider constraints on new coloured states mean that any coloured states that gain a mass dominantly from the vacuum expectation value (VEV) of φ would require a non-perturbatively large Yukawa coupling ∼ 1.5 TeV/ (250 GeV). Therefore we introduce two pairs of quarks that are vector like under the SM gauge groups and the PQ symmetry, but which couple to φ from cross terms. For example,

JHEP05(2016)077
with charges under the SM gauge group and U(1) P Q of where the electric charge is normalised as These fermions generate a coupling of s to gluons where L (L c ) have charge −1 (0) under the PQ symmetry, leading to a coupling In the case of N copies of L, L c we need To obtain models without a very large number of new states we need extra leptons with charges of at least +2. As they can be produced by the electroweak Drell-Yan processes the LHC has set fairly stringent limits on the masses of such states. In particular, if the new leptons are stable on collider scales and have charge +2 the current bound is roughly 700 GeV [211]. Since we require a multiplicity of leptons N ∼ 4 this translates into M 850 GeV (by considering PDFs and keeping the number of signal events fixed). In the case of Q L = 4, N = 1 and we need M l 800 GeV.
Alternatively the leptons could decay promptly inside the collider. For charge +2 leptons this can occur through a dimension six operator, for example which couples W + to the right handed current (Lγ µ e R ). This leads to a same sign dilepton final state, which is constrained by searches for doubly charged Higgs bosons [212]. Taking

JHEP05(2016)077
into account the leptonic branching fraction of W we see that doubly charged leptons with mass m L 600-700 GeV will be consistent with the current limits. Allowing only couplings to τ s will further reduce the bound. We can estimate it by noting that the τ decays leptonically ∼ 40% of the time, so that the constraint on the lepton mass becomes ∼ 600 GeV. Searches for charge 5/3 top partners also lead to similar limits when translated to our models.
Models with small f are typically close to being strongly coupled. First, the new leptons will make the U(1) Y gauge coupling g Y run faster than in the SM. Also, these leptons need a large Yukawa coupling to get a sufficiently large mass. Finally, the small value of f relative to 750 GeV means that the quartic coupling of s is big. For the model with N pairs of singlet leptons with charge Q L , the one loop renormalisation group equations for these couplings are where M 2 s = λf 2 , and m l = y L f / √ 2. In figure 4 we show the scale of strong coupling (defined as any coupling = 4π, or the quartic running negative) as a function of f from a numerical solution of these equations. The charge of the leptons is fixed to be equal to 2 or 4 and the number of species N is taken to be minimal satisfying the inequality eq. (2.10) for a sufficiently prompt axion decay We see that models with small PQ breaking generically have a very low cut off scale, and, in the case when the partial decay width into two axion is 40 GeV, our model is already strongly coupled at the TeV scale. However, as mentioned in section 4, in this case we can easily explain the observed width with N = 4, Q L = 2.
For f slightly larger, ∼ 500 GeV, perturbativity can be maintained up to 10 4 GeV, while keeping a width Γ s→aa of 15 GeV. Given the current uncertainties on the signal this may be regarded as sufficient. Alternatively, the width can be increased to 40 GeV by including hidden sector fermions, uncharged under the SM, with PQ preserving couplings y ψ φψ h ψ h that give the fermions masses. These give an invisible width 12) where N is the number of extra vector-like fermions. With 4 pairs of vector-like hidden sector fermions a total width of 40 GeV can be obtained for f = 510 GeV. The invisible decays mean the production cross-section must be correspondingly increased, however considering eq. (2.6) and eq. (3.5) it can be seen that this is not problematic.

Large f models
In the opposite regime when f is large, taking for definiteness, f ∼ 1.2 TeV and Γ inv = 40, we require c g ∈ [1.1, 2.6] , c γ 11 , (3.13) so much larger charges/multiplicities are needed to have the correct value of c γ . Since larger values of c g are required for production it is possible for a single set of chiral PQ charged fermions to give both c γ and c g simultaneously. The larger VEV also allows the coloured chiral fermions to be safely outside LHC limits without non-perturbatively large Yukawa couplings. Consider, for example, the model where the quantum numbers of the fields under the SM gauge group and the U(1) P Q are These generate couplings (3.16)

JHEP05(2016)077
Consequently we need fields with the electric charge larger than 4, or we can again introduce chiral leptons to increase c γ . The decay width to axions is too narrow to account for the experimental observations, therefore we need a large invisible width, which requires additional light states coupled strongly to s. Simply introducing extra scalars or fermions to the theory does not automatically help, since fields with large couplings to φ are expected to gain a mass ∼ f and decays of s to these are kinematically forbidden.
One option is to tune a state light (in the next section we discuss an alternative of a model with multiple PNGBs). For example adding a new scalar η with a bare mass and a coupling to φ of λ |φ| 2 η 2 , gives an invisible width Γ(s → ηη) = 1 8π Similarly a fermion can be made light by introducing two fermions N 1,2 with opposite PQ charges. Then the vector-like mass term M 12 iN T 2 σ 2 N 1 + h.c. can be tuned against the mass induced by a coupling to φ of y 1 φ iN T 1 σ 2 N 1 + h.c. + (N 1 ↔ N 2 ). This gives an invisible width of Despite needing more new SM charged matter than if f is small, the large f case can remain perturbative to a higher scale. The quartic coupling of s is small since M s < f , and the Yukawa couplings of the new quarks do not need to be especially large. Instead strong coupling first occurs due to the U(1) hypercharge gauge coupling. Since the change in the beta function is proportional to the sum of the hypercharges of the states Y 2 1 + Y 2 2 , while c γ is proportional to Y 2 1 − Y 2 2 , the best chance to obtain a weakly coupled theory is to have Y 2 = 0. Then the change in the U(1) Y beta function can be related to the anomaly coefficient by ∆β Y = 8c γ . For f ∼ TeV, models can remain weakly coupled until ∼ 10 6 GeV as shown in figure 4.

Invisible width from a larger coset
A large invisible width can occur without tuning in models with many PNGBs. Suppose that the hidden sector contains n pairs of quarks, coupled to n 2 complex scalar fields, φ ij , by a term Then the theory has a U(n) × U(n) = SU(n) × SU(n) × U(1) A × U(1) V global symmetry, with the axial U(1) A anomalous. We assume the potential of the theory is such that (after rotating in field space) the diagonal scalars φ ii all get approximately equal VEVs. This spontaneously breaks the global symmetry to the SU(n) V × U(1) V subgroup, and there are n 2 PNGBs. The scalar sector of the theory can be expanded aŝ

JHEP05(2016)077
HereΠ and a are the PNGB associated with SU(n) and U(1) breaking, s is the corresponding Higgs field along the diagonal direction, which will be the 750 GeV state, andρ are the remaining n 2 − 1 massive scalars. Out of the PNGBs only a will have an anomalous coupling to photons, since Tr T a Y 2 = 0 , (3.21) where T a are the generators of SU(n). Similarly the couplings of the scalarsρ to gluons and photons will also vanish. The n 2 PNGBs couple equally to s, so its total width, and branching ratio to axions, is The couplings of s and a to gluons and photons respectively are given by (analogously to eq. (3.15)) Therefore, using eq. (2.5), we find that the observed number of events can be explained if F ∼ 4 TeV. Fitting the width we find n ∼ 12, and the hypercharge must be Y 1 2.5 if Y 2 = 0. Since the other scalars couple only weakly to the Standard Model, they do not violate any existing observations.
Similarly we can extend the small f models (see section 3.1) to the U(n) × U(n) construction with a Lagrangian The couplings to gluons and photons are given by Requiring the width is Γ = 40 GeV, we obtain a constraint F/n = 320 GeV. Further, for prompt axion decay Q L 4 is needed, independently of F and n, and the appropriate values of c g can be obtained for y y ∼ 1. Reasonable points in parameter space are possible, for example n = 3 and F, M 1,2 ∼ TeV. Although in our conventions F can be large, the running of the quartic scalar coupling is enhanced relative to eq. (3.10) by the number of PNGBs. The exact form of the beta function depends on the scalar potential. For example, if the quartic dominantly comes from a term V ∼ λTr φ † φφ † φ , it has a dependence on n of β λ ∼ (4n + 6) λ 2 . To compare with the n = 1 small f case, we redefine the quartic λ = (4n + 6) λ, so that the running of λ is independent of n at one loop. The mass of s is related to λ by M 2 s = F 2 λ/n, therefore the value of λ is related to the width by We see that increasing n decreases the required value of λ by an order one amount. As a result strong coupling can occur at a higher scale, and for n ∼ 3 the theory could remain weakly coupled up to scales ∼ 10 6 GeV.

Comparison with the direct diphoton production
It is interesting to compare the requirements on UV completions in direct diphoton production and diaxion models. In particular, this allows us to determine if anything has been gained from the extra structure introduced. Let us start by looking at the axion construction in the small f regime. In the f 400 GeV limit the setup provides a nice explanation of the width of the diphoton signal without introduction of new invisible decays. In contrast this seems to be almost impossible in the models parametrized by the Lagrangian eq. (1.1). Unfortunately these nice features are accompanied by still moderately large multiplicities of new particles √ N Q L 4.6, and strong coupling at the TeV scale.
Both the diaxion in the large f regime and the diphoton construction, require invisible decay channels, as well as large multiplicities of SM charged particles, although for different reasons. In the direct diphoton construction they are needed to fit the observed signal as well as avoid bounds from invisible decay searches, while for diaxions they are needed for prompt axion decays. We can compare the needed multiplicities of new fields for the two setups by assuming the interactions with photons are both generated by N species of charge Q chiral fermions. Then the numerical values of the effective couplings are related by where we have set the cutoff scale of eq. (1.1) to be Λ ≡ f . From eq. (2.10), for the axion model to work we need a theory that generates couplings In contrast, for the direct production model to match the signal while satisfying the monojet constraints c F (TeV/f ) 80 is needed. Comparing this estimate with eq. (4.3) we see that the axion allows the multiplicity of new particles to be reduced by roughly a factor of ∼ 2 (m a /0.2 GeV) 2 . The improvement seems to be almost marginal, however note that this value depends strongly on the collimated photon condition m a 0.25 (see eq. (2.8)). If our estimate turns out to be over-constraining and e.g. m a 0.4 GeV can be tolerated, a reduction of ∼ 10 in the multiplicity of new particles is possible. If a large width is not confirmed models with f 700 GeV (see figure 2) can still provide a viable description of the signal, since the experimental resolution is around 6 GeV. However, unlike models with a direct decay to diphotons where the signal can be fitted with moderate couplings to photons (see figure 1), the prompt axion decay condition still requires a large coupling JHEP05(2016)077 to photons (see figure 3), reducing the appeal of the diaxion construction.

Conclusion
In conclusion let us summarise the main results of our paper. We have considered the possibility that the recent ATLAS results are due to the decay of a 750 GeV resonance into two axions which subsequently decay to two pairs of highly collimated photons. The conditions and constraints on such a scenario have been discussed in a model independent EFT framework, and we have shown that the tentative signal width can be explained. However, for the axions to be sufficiently collimated while also decaying before reaching the calorimeter, we need the axion mass to be in a narrow window 150 ÷ 200 MeV. A drawback of our analysis, which is crucial for identifying the viable parameter space, is the lack of a full detector simulation of the conditions for axion decays to fake individual photons. Although we have made conservative estimates, and reproduce the results of more sophisticated simulations well [192], our results should therefore be taken with a pinch of salt.
There are also some model building questions that we have not considered but need to be addressed in a complete theory. For example, the new chiral matter needs a mechanism to decay. This is not a major obstacle, and can occur via a PQ preserving coupling to SM matter. An explicit source of PQ breaking must also be introduced so that the axion gets an appropriate mass. It is also interesting to consider UV completions, especially since strong coupling is often not far from the TeV scale. It may also be possible to connect the model to the SM flavour structure, suppressing the Yukawa couplings of the light SM fields with the PQ symmetry. Although we have not considered it in the present work, models where the axions have some invisible decay channels, e.g. into light hidden sector gauge bosons, might be possible, and could relax the prompt decay requirement (see figure 5).
A phenomenological feature of our model is that unlike many other explanations of the diphoton anomaly it does not predict any excess in the ZZ and WW final states. Therefore if such a signal is observed the topologies we consider will be strongly disfavoured. On the other hand, if no excesses are seen in these channels (and the diphoton anomaly persists) the motivation for the theories we have considered will improve significantly. Similarly if the diphoton signal remains and the width is indeed large, this will also provide reason to further consider these type of models. Also, the presence of two photons instead of one increases the chance of production of an e + e − pair in the detector tracking system, which would be identified as a converted photon. It is an interesting question for future work whether this generic prediction allows collimated photons to be efficiently distinguished from real diphoton signals.

Acknowledgments
We are grateful to Giovanni Villadoro for very useful discussions. The work of A.R. is supported by the ERC Advanced Grant no. 267985 "DaMESyFla".

A Loop functions
For completeness the loop functions for the production of s used in eq. (2.3) are [208,209] F (m f , M s ) = 3

B Mistaking axions for photons
Considering the decay of an axion with velocity v into two photons, the total angle between the two photons in the lab frame θ B is related to that in the axion rest frame θ by cos θ = ± 2v 2 − 1 − cos θ B v 2 (1 − cos θ B ) . (B.1)

JHEP05(2016)077
Then the probability that the angle between two photons will be less than θ < θ M will be given by the integral π/2 θ X sin θ dθ = cos θ X , 3) The distance between the collision point and the calorimeter is of the order of L D = O(1 m) and we need the axion to decay before reaching the calorimeter plates. The plates of the liquid argon calorimeter have a minimum angular separation δη ATLAS = 0.0031 and much larger granularity in the ∆φ direction. The reference [192] studied a probability of faking a photon by an axion for the Higggs decay. They found that the complete ATLAS analysis used for the tight photon selection, which utilizes more sophisticated variables, effectively means that the cut on the δη separation between the photons to be δη γγ < 1 2 δη ATLAS , δη ATLAS = 0.0031 .

(B.4)
Instead of a putting a cut only on the δη direction between the photons we impose the conservative requirement that total angular separation between the photons be less then To validate our approximations we have compared the results obtained using our technique for the SM Higgs decay to the results of [192] who performed a more complete simulation accounting for the varying granularity over the detector. We find our results match theirs fairly accurately, and we are slightly more conservative on the maximum mass possible to resemble a photon.
Now we proceed to the decay of the 750 resonance. Since the boost of the state s is negligible, the axion boost is 375 GeV/m a . The results are presented in figure 6 where we show the probability to mistag the four photons from two axions as two photons as a function of the axion mass. In order for a reasonable proportion to be misidentified the decay length in the lab frame should be roughly less than 1 meter and the axion should be in the region of ∼ O(100) MeV. Open Access. This article is distributed under the terms of the Creative Commons Attribution License (CC-BY 4.0), which permits any use, distribution and reproduction in any medium, provided the original author(s) and source are credited.