1 Introduction

High energy particles of cosmic origin interact with our atmosphere and create cascades of secondary particles known as extensive air showers (EASs). These EASs can be observed by recording the fluorescence, Cherenkov, and radio emission produced while propagating through the atmosphere and/or detecting the secondary particles that reach ground level. In order to predict and understand the EAS observations the interactions within the atmosphere must be compared to Monte Carlo simulations commonly performed with software packages like CORSIKA [1]. This simulation package bundles electromagnetic and several selectable hadronic interaction packages to calculate the development of the particle cascades. The hadronic interaction event generators are separated in low and high energy models and in the simulation package, one or the other is selected depending on the energy of the interaction being considered. Per default, the energy at which the switch between high and low energy models occurs is 80 GeV.

In a recent study [2], it was found that for cosmic-ray protons with an energy just above the typical switching energy, the properties of the simulated EASs have a strong dependency on the selection of the hadronic interaction model. With increasing energy of the incident proton (up to 100 TeV), the average differences in air shower properties between the models seemed to reduce. In this follow-up study, we focus on the switching energy domain, where the differences between the models are most prominently exposed (up to \(60\%\) difference in the ground level observables). We compare in detail the nature of the early shower development and its relation to ground-level observables for different hadronic interaction models.

The differences in ground-level observables at 100 GeV are driven by the early shower development and can therefore expose intrinsic differences between the hadronic interaction models. Therefore, low energy showers serve as a laboratory for comparison of differences in the first interactions from the ground level outcome. Another domain where the first interaction will play an important role is in the estimation of background rates for gamma ray astronomy. A fraction of hadronic first interactions will generate an energetic \(\pi ^0\), which will subsequently generate an electromagnetic cascade, mimicking a gamma-ray induced air shower. The rates of these events generated by different hadronic interaction models has a significant impact on sensitivity studies for gamma-ray observatories [3]. Ideally, the different models should be directly compared to measurements made at dedicated experiments at particle accelerators (for example [4]).

2 Simulation methodology

All simulations for this study were performed with the Monte Carlo air shower event generator CORSIKA v7.64. The hadronic models tested were EPOS-LHC [5], QGSJetII-04 [6], SIBYLL 2.3c [7, 8] and UrQMD [9].

The transition energy is not an a priori defined parameter and can flexibly be set up to a few hundred GeV. Nevertheless, it was set to 80 GeV for all high energy models (default value in CORSIKA). UrQMD in contrast is a low energy model and rules the hadronic interactions in air showers below the transition energy.

The main goal of this research is to understand the hadronic interaction models’ deviations in the sub-TeV regime, where they have been seen to significantly disagree [2]. We chose an initial proton energy of 100 GeV (including the rest mass) to perform all simulations. Since the energy range studied is within the validity region of the transition energy, agreement was expected between using only the low energy model and using the combination of high- and low-energy models. UrQMD’s full shower simulations were carried out by setting the transition energy to values larger than the initial proton energy.

In order to remove the effect of model intrinsic cross-section differences (given in Appendix A) and track the particles produced in the first interaction, all simulations were performed with a fixed first interaction altitude. We set the altitude of the first interaction to 17.55 km which corresponds to the average cross-section of the interaction models. The atmospheric model chosen was GDAS/May, corresponding to an atmospheric depth of 85 g cm\(^{-2}\) at the initial altitude. Moreover, the collision setup consisted of a fixed target nitrogen nucleus and a zero degree zenith angle incoming proton. The nucleon-nucleon centre-of-mass energy is \(\sqrt{s_{NN}} \approx 14\) GeV.

The most important feature in the simulations was the possibility of observing event-by-event the final state immediately after the first interaction and at ground level. It allowed us to compare ground-level observables from different models for fixed classes of first interactions. Two observational points were defined; the first, 1 cm below the initial interaction point to register the initial particle production; and the second at the ground level of the HAWC gamma-ray observatory at an altitude of 4100 m [10]. We used the lateral distribution functions (LDF) of the muon and electromagnetic (EM) components as the main observables at ground level.

3 First interaction contributions to the lateral distribution functions

3.1 First interaction classification

To relate the initial collisions with ground level observables, the different first interaction scenarios will be characterised. As a huge diversity of particles can be produced in the hadronic interaction, we group particles into four different “families” (see Table 1). The classification is motivated by the family’s impact on the shower development and the preservation of a causal correlation between first interaction products and ground level observables, so for example the muonic family contains muons and particles which typically decay to produce muons. The other hadrons family was defined to contain particles which have decay channels that simultaneously contribute to multiple components. It is important to separate these hadrons in order to avoid mistaken links between the post-first interaction particle spectrum and ground level observable particle type.

Table 1 Definition of particle families

A common parameter used to characterise the energy distributions in an interaction is the inelasticity,

$$\begin{aligned} \kappa =1-\frac{E_{\text {LP}}}{E_{\text {FI}}} \approx 1 - x^{\text {LP}}_F, \end{aligned}$$
(1)

where \(E_{\text {FI}}\) is the total shower energy (sum of all particle energies) after the first interaction and \(E_{\text {LP}}\) the leading particle energy. The inelasticity \(\kappa \) is closely related to Feynman’s scaling variable \(x^{\text {LP}}_F\) and is a measure for how much energy is available for the production of secondary particles. The leading particle jointly with the variable \(\kappa \) allows one to identify similar types of interactions within the different models.

Note that the total shower energy after the first interaction is used in the definition of the inelasticity to ensure that \(\kappa \) is in the range [0, 1]. Normalisation to the energy of the initial system (proton plus target) would violate this constraint, since in all four models some violation of energy-momentum conservation was observed. This was additionally checked at generator level using the CRMC software package [11] for EPOS-LHC, QGSJetII-04 and SIBYLL 2.3c. Small differences in the amount of energy violation between the two software packages are present, caused by CORSIKA’s energy cutoff in tracked particles. QGSJetII-04, SIBYLL 2.3c and UrQMD show a similar behaviour after the first interaction, violating approximately \(\pm 5\) GeV from the initial proton energy. In EPOS-LHC’s case, events violating up to \(\pm 15\) GeV were registered. Currently, these anomalies are under investigation in collaboration with the models’ authors.

To further constrain the initial interaction scenarios we define three inelasticity bands: elastic and diffractive (\(\kappa _1\approx [0,0.2]\)), intermediate-inelastic (\(\kappa _2 \approx [0.2,0.4]\)) and highly inelastic collisions (\(\kappa _3 \approx [0.4,1]\)).

  • The \(\kappa _1\) regime encompasses events with leading particles carrying energies above the threshold defined by the transition from high to low energy hadronic model (\(E^{\text {LP}}_{\kappa _{1}} =[80 - E_{p,\text {FI}}]\) GeV). These showers will undergo at least one additional interaction ruled by the high energy hadronic model, potentially leading to a larger impact in the ground level outcome. Furthermore, as most energy is concentrated in a single particle, the branching of the shower is very weak. We will refer as highly diffractive events to those in which \(\kappa _{\text {Hel}}<0.05\subset \kappa _1\). This sub-regime represents events in which less than \(5\%\) of the available energy is available for particle production in the first interaction. Besides, these showers are expected to replicate a scaled version of the first interaction spectrum in the upcoming interaction.

  • \(\kappa _2\) showers have leading particles in the energy range \(E^{\text {LP}}_{\kappa _{2}} =[60{-}80]\) GeV. The next interaction these showers undergo is governed by the low energy model, therefore the ground level observables will reflect the difference in the spectrum of particles that accompany the leading particle and their normalisation.

  • Finally, in the \(\kappa _3\) region leading particles have energies in the range \(E^{\text {LP}}_{\kappa _{3}} =[0{-}60]\) GeV. These events correspond to highly inelastic first interactions in which the model’s initial particle production spectrum plays a determinant role.

Fig. 1
figure 1

EPOS-LHC’s inelasticity distributions for each leading particle family. The two shaded regions indicate the systematic uncertainties in the definition of \(\kappa \) due to event-by-event fluctuations in the amount of energy violation

In Fig. 1, EPOS-LHC’s leading particle inelasticities are shown for each family. In most cases the leading particle is a nucleon, only at very large inelasticities particles from the muonic family dominate. Showers with leading EM particles or “other hadrons” contribute less than \(7.5\%\) in all models.

Fig. 2
figure 2

Breakdown of the muon (left panels) and EM component (right panels) LDFs into the contributions from the three inelasticity regimes in nucleon led events. To have leading particles with similar properties, here energy ranges are taken as a proxy for the respective \(\kappa \)-regions, the boundaries of which are smeared due to event-by-event fluctuations in the energy violation of the models

3.2 Lateral distribution functions

To investigate the source of the disagreements between the models we study the ground level outcome arising from events with similar first interactions, focusing on the two leading contributions as explained in Appendix B. In Fig. 2 the four models’ muon and EM LDFs in the three classes are presented for events with a leading nucleon. As each \(\kappa \)-regime encloses broadly the same varieties of first interactions, we expect to obtain similar LDFs in each contribution.

The largest differences between the muon LDFs appear in the largest contributors, the \(\kappa _1\) and \(\kappa _3\) regimes. Initial collisions with large \(\kappa \) produce abundant particle content (mostly muonic family particles) along with the leading particle, which later decay and contribute to the number of muons at ground level. In events where the initial interaction is diffractive, higher inelasticity interactions occur deeper in the atmosphere, where the production of secondary particles from the muonic family will more likely affect the outcome at ground level. For the three high energy models, the largest ground level muon source originates from \(\kappa _3\) events (highly inelastic interactions) while for UrQMD it is from \(\kappa _1\) (diffractive interactions).

In the \(\kappa _3\) regime, the LDFs provide interesting information on the particle spectrum in the first interaction. Considering the integral over the LDF, SIBYLL 2.3c produces most muons (1.85 per this type of shower) followed by EPOS-LHC (1.79), QGSJetII-04 (1.70) and lastly UrQMD (1.45). The relative difference between SIBYLL 2.3c’s and EPOS-LHC’s average muon number is clear from the LDFs shape. As also seen in [2], QGSJetII-04 concentrates its smaller average number of muons at low core distances due to a lower muon transverse momentum (to be discussed in the next section).

In the LDF of the EM component, the largest contribution to the energy flow is present in the \(\kappa _1\) nucleon led events. As in the muon LDF, a strong contribution from UrQMD’s diffractive peak can be seen. A large contribution from \(\kappa _2\) showers is not expected as the number of events in this regime is significantly lower \(\left( \left. \frac{\text {N}_{\kappa _{2}}}{\text {N}_{\kappa _{3}}} \right| _{\text {EPOS-LHC}}\approx 0.29\right) \), but also in highly inelastic events the ground level contribution to the EM component is small. This leads to the conclusion that the ground level EM component is driven by diffractive interactions. Physically this effect is understood, as at the energy of this study an EM component produced high in the atmosphere is very likely to die out before reaching ground level, while lower \(\kappa \) interactions produce showers that develop deeper in the atmosphere and hence produce EM showers more likely to reach the ground.

Fig. 3
figure 3

Muon LDF contribution from events lead by muonic family particles

Separately, in Fig. 3 the muon LDF from events with a leading particle from the muonic family is presented integrated over all \(\kappa \) ranges. In such events, a large fraction of the shower energy is assigned to multiple particles from the muonic family in the first interactions, therefore the large differences are expected to be caused by dissimilarities in the production of this type of secondary particles. For QGSJetII-04 the muon production at short distances (r \(\lesssim \) 200 m) is approximately three times larger than for the other models. Similarly to the case of the \(\kappa _3\) nucleon led muon LDF, the integral over the LDF does not indicate a sizeable difference in the absolute number of muons reaching ground level. QGSJetII-04’s absolute number of muons from this type of scenario is only \(4\%\) larger than UrQMD’s, and \(20\%\) more than SIBYLL 2.3c’s and EPOS-LHC’s. The observed excess at distances relevant for experimental purposes is therefore not expected to come from a larger number of events but from a different character of the muonic family particles.

Concluding this section, we have seen that a breakdown of the ground level particle distributions into contributions from different types of initial interactions reveals striking differences between the models considered in this study. As the inelasticity ranges represent roughly similar physical events, a better agreement was expected.

4 First interaction analysis and sources of disagreement unravelled

4.1 First interaction rates

In order to find the cause of the observed ground level differences we continue by studying the phenomenology of the first interaction. To understand the source of dissimilarities in the different inelasticity bands we begin by analysing the various leading particle inelasticity distributions for the two most contributing families, shown on Fig. 4.

Fig. 4
figure 4

Inelasticity distributions for the four studied models, for events initiated by the two dominant families; nucleons and muonic particles

Within the diffractive regime, UrQMD’s number of events falling into the diffractive peak (\(\kappa <0.05\)) composes over \(21\%\) of the total number of showers (\(67\%\) of the \(\kappa _1\) events and over five times EPOS-LHC’s diffractive number). This abundance is responsible for the observed excess in both \(\kappa _1\) LDFs (Fig. 2). As the shower development is spatially delayed, lower energy muonic family particles produced in secondary interactions will be more likely to contribute to the ground level products. Additionally, the EM component is directly enhanced as the first interaction is only weakly inelastic and therefore, the EM shower maximum will be reached deeper in the atmosphere. Compared to the other models, UrQMD compensates for the high probability of low inelasticity events (Fig. 4) with a larger cross-section (Table 2 in Appendix A), i.e. more interactions take place in UrQMD showers but they produce fewer particles on average.

Approximately \(5\%\) of the total number of EPOS-LHC’s and SIBYLL 2.3c’s nucleon led events fall in the highly diffractive peak (\(\kappa < 0.05\)). However, it is necessary to recall the consequences of energy violation in the first interaction. In EPOS-LHC’s case, the number of highly diffractive events significantly decreases if the first interaction is not considered to violate energy conservation. It implies that highly diffractive events lose considerable amounts of the shower energy and therefore, the leading particle energy is lower.

For QGSJetII-04 the \(\kappa _1\) regime shows a difference in behaviour when compared to the other models. Its diffractive peak is localised and composed of only a small fraction of the total number of events. While the rest of the models show a minimum around \(\kappa \sim 0.1\), QGSJetII-04 maintains a constant level as soon as energy for particle production is available. This means that QGSJetII-04 has a higher rate of events with particle production than EPOS-LHC and SIBYLL 2.3c in the \(\kappa _1\)-regime. Such behaviour accounts for the absolute number of muons and EM component energy domination shown on Fig. 2 (top panels). Furthermore, the dissimilar fractions of events at \(\kappa \sim 0.1{-}0.2\) suggest an important source of disagreement for the event rate and particle production in primaries below the TeV scale. At higher energies a better agreement and smoother transition in this regime is reached, significantly reducing the relative differences in the ground level \(\kappa _1\) contributions.

In the \(\kappa _2\) regime, the distribution shapes broadly agree between models and only differ by a normalisation factor. Correspondingly, it is also shown in the middle panels of Fig. 2, where the larger number of events results in a larger number of ground level particles rather than in a change in the shape of the LDF. These events exhibit the best average ground level agreement as the low energy model governs most of the shower behaviour. The accompanying particle spectra are determined by the available energy, the leading particles of the different models place the shower development all under the same low energy model description. With increasing inelasticity more of the particle production happens in the first interaction within the high energy model, thereby again reducing the importance of the low-energy model for the description of the ground-level observables.

The \(\kappa _3\) nucleon led distributions present large differences in the normalisation of event rates. For example, SIBYLL 2.3c strongly enhances large \(\kappa \) collisions led by nucleons over muonic family particles, presenting no inelasticity regimes where the most likely leading particle is a muonic particle. The large variations seen in the \(\kappa _3\) distributions show how the models’ relative rates correlate with the observed differences in the muon number density in the \(\kappa _3\) LDFs. It follows that even in inelastic events, the ground level differences arise from incompatibilities in same first interaction scenarios.

To close up this section, we have seen that the number of events falling in each inelasticity regime correlates with the hierarchy of the interaction products at ground level observed in the LDFs of Sect. 3.2. This correspondence shows that a large part of the ground level differences originates from the models’ dissimilar event rates. In other words, the normalisation differences emerge from a dissimilar nucleon-nucleon cross-section. Although certain compensation between scenarios is reasonable (e.g. \(\kappa _3\) nucleon led and muonic family particle led), the strong disagreement around \(\kappa \sim 0.1\) reflects intrinsically large model incompatibilities.

4.2 Accompanying particle production

In addition to the differences in the inelasticity distributions and consequently in the event rates, other characteristics have been found to directly affect the ground level muon LDF. Figure 5 shows the average ground level muon number per event as a function of the first interaction inelasticity for all types and nucleon led events. Regardless of the inelasticity distributions, some of the models generate a noticeably different number of muons at ground level per event. For example, QGSJetII-04’s events produce the largest average number of ground level muons for all events, but also on the selection of nucleon led events. The greater average number of ground level muons combined with the enhancement of the rate of events results in an overall larger muon number. Comparing both curves, it can be observed that although EPOS-LHC’s and SIBYLL 2.3c’s muon production in nucleon led events show slight differences, they vanish when all events are considered. This illustrates how these two models have a similar ground level impact from a compensation between both leading families contributions. Furthermore, the larger number of \(\kappa _1\) muons exhibits two interesting points; a stronger influence of the high energy model results in a larger number of muons, and secondly, a disagreement between the high and low energy models.

Fig. 5
figure 5

Average number of ground level muons per event as a function of the first interaction inelasticity for nucleon led and all types of events

Due to the transition of the high energy model to the low energy model, there is a reduction of muon number at \(\kappa \sim 0.2\), above which all secondaries from the first interaction are handled by the low-energy model. The drop is most pronounced for QGSJetII-04. This illuminates the general difference between high and low energy models and the impact of the secondary interactions. As we are plotting a ground level quantity in terms of an first interaction parameter, the distributions shown are sensitive to the late shower development. At these inelasticities, the influence of the low energy model on the shower is at its largest as the transition energy was set to 80 GeV and the leading particle’s next interaction will be determined by the UrQMD model. QGSJetII-04’s sudden change in slope shows how events in which the leading particle is ruled by the low energy model produce a lower number of muons at ground level (as UrQMD overall does). Such convergence towards UrQMD’s nature shows how showers are built from dissimilar model approaches and how sensitive the ground level number of muons is to the model switch.

In order to explain the general muon excess from QGSJetII-04 (compared to the other models), it is not enough to only look at the leading particle in the first interaction, but also the accompanying particles need to be studied. The early production of particles from the muonic family will directly impact the ground level muons. The minimum energy for 50% of the muons to travel approximately 17 km without decaying is \(\sim 5\) GeV. We will use this energy as a threshold to select the first interaction muonic family particles that contribute directly to the ground level muon number.

Fig. 6
figure 6

Muonic family particle production ratios with respect to EPOS-LHC considering all particle energies (top panel) and above an energy threshold of 5 GeV (bottom panel)

In Fig. 6, the average numbers of muonic family particles produced in nucleon led events are studied. On the top panel, the ratio between the average number of all accompanying muonic particles is presented and in the lower panel, the cut of 5 GeV is applied. Considering all muonic family particles produced, QGSJetII-04’s average larger ground level muon number is not evident as it does not show an increased abundance over the other models. However, when considering high energy muons, QGSJetII-04 exhibits an average effective muonic particle surplus over the other models, accounting for the average shower muon excess at ground level.

The investigation in [2] of the muonic family particle production and ground level muon number at higher primary energies shows that a better agreement between the models is not found by increasing the primary energy. This means that the discontinuity in the description of hadronic physics when switching from the high-energy to the low-energy model is not reduced. A consistent description of particle production would require a well defined transition energy where the physics implemented in the models agrees.

4.3 Transverse momenta of first interaction muonic family particles and distribution of ground level muons

As discussed in Sect. 3, the number of muons at small distances (r \(<200\) m) from the shower core produced in events with large inelasticity (\(\kappa _3\)) and with a leading nucleon or particle from the muonic family does not reflect the total number of produced muons. The difference seen can instead be understood from the larger number of events falling in this regime and the average muonic family particle production. An explanation for the muon concentration (QGSJetII-04’s case) and a spread (UrQMD’s case) is important for ground-level observations. A parameter directly related to this spread is the transverse momentum of the ground level particles; lower \(p_{\text {t}}\) muons concentrate at short distances while large \(p_{\text {t}}\) are spread away from the shower axis. In Fig. 7 the mean transverses momentum from energetic muonic family particles in the first interaction and ground level muons is shown. From the top panel it can be concluded that as inelasticity increases and more energy is being assigned to muonic particles, the transverse momenta significantly diverge between models. This disagreement in the first interaction is directly correlated to the muon \(p_{\text {t}}\) at ground level, resulting in the discussed differences from Figs. 2 (bottom left panel) and 3. The large differences between models in the transverse momentum spectra of muonic family particles (mostly produced in large \(\kappa \) events) and their extrapolation to ground level muons was already pointed out in [2], where the higher energies muonic particles were shown to diverge between models.

Fig. 7
figure 7

Ratios with respect to EPOS-LHC of the (top panel) first interaction muonic family particles (energy above 5 GeV) and (bottom panel) ground level muons average transverse momenta as a function of the first interaction inelasticity

Worthy of note is the slight reduction of transverse momentum of ground level muons in low inelasticity collisions and the abrupt change at \(\kappa =0.2\). As was commented in Fig. 5, this is caused by the switching from high to low energy models. This is most clearly seen between models with most different muonic family particles production and properties, e.g. QGSJetII-04 and UrQMD.

5 Model summary

In this section, the main characteristics and differences of the models are summarised:

  • EPOS-LHC ground level components present a relative deficit in the contribution from \(\kappa _1\) events, as shown in the muon and the EM component LDFs. This deficit is a consequence of the weak impact of EPOS-LHC’s diffractive collisions as their leading particles are not as energetic as in the other models. Moreover, this model is more affected by the energy violation in the first interaction. Over 10 GeV variations were registered which influenced the computation of the first interaction inelasticity. The diffractive peak at \(\kappa <0.05\) would be completely smeared out, as the leading particles from highly diffractive events have on average lower energies in EPOS-LHC than in the other models. Moreover, EPOS-LHC’s production of accompanying high energy muonic family particles is slightly lower than in the other models but compensated by a larger production of secondary high energy nucleons.

  • QGSJetII-04 has been shown to present a significantly larger number of muons at ground level. Moreover, the excess is critical at short core distances where simulations are most relevant for experimental purposes. The overproduction arises from a different energy spectrum of secondary particles that does not cause an overall larger number of muonic family particles but an increase in the amount of those carrying energies over 5 GeV. Additionally, highly diffractive events are suppressed while at larger inelasticities, where more energy is available for particle production, the rates are enhanced. QGSJetII-04’s muon concentration at short core distances is caused by lower transverse momenta in the muonic family particles produced. It is clearest at large \(\kappa \) values where muonic particles carry great fractions of the shower energy. The muonic family particles’ \(p_{\text {t}}\) is transferred in their decay to the muons which reach ground level closer to the shower axis.

  • SIBYLL 2.3c’s ground level spectrum is similar to EPOS-LHC’s. Despite this agreement, both models have been shown to be very different; the event rate at large \(\kappa \) values is significantly different as well as the accompanying particle production. In the \(\kappa >0.4\) regime, events led by nucleons with a strong accompaniment of particles from the muonic family are enhanced over events that are led directly by muonic particles. In contrast, EPOS-LHC’s \(\kappa _3\)-nucleon led events have a larger fraction of accompanying nucleons but result in a subtle difference at ground level.

  • UrQMD’s proton cross-section is shown to be significantly larger than that of the other high energy models in Appendix A. This has a severe impact on the development of showers initiated by 100 GeV primaries if the height of the first interaction is not fixed. Furthermore, we have shown that over 20% of the events are highly diffractive, showing a great difference with the other models. Although a compensation between the two effects could be expected in non-fixed first interactions, it has been shown in [2] that this does not occur and causes an overall deficit of the ground level components. In comparison to the high energy models and specifically to QGSJetII-04, UrQMD’s high energy muonic family particles show large transverse momenta. Consequently, its muon LDF is spread over a larger surface, resulting in the opposite behaviour to QGSJetII-04. UrQMD’s observed disagreement in the production of muonic particles (\(>5\) GeV) and their transverse momentum, questions the setting of the transition energy. It has been shown that in the \(\kappa _2\) regime all high energy model predictions blend together due to the large influence of UrQMD’s physics in secondary interactions. In contrast, all models seem to diverge in regimes where the low energy model does not dominate; this can be seen in \(\kappa <0.2\) and \(\kappa >0.5\) events.

6 Impact of model choice and transition energy setting on air shower development

So far, the main focus has been on the interaction of 100 GeV protons with atmospheric nitrogen. An air shower can be viewed as a superposition of cascades initiated by particles with a fraction of the initial particle energy. Therefore, it is expected that the model differences observed for interactions with 100 GeV primaries, might also manifest themselves throughout the development of the air showers that were initiated by primaries with energies well above 100 GeV. Moreover, as the event generators select the hadronic interactions by the centre of mass energy and the participants involved, the correspondence between the studied first interactions and the mid-shower development is ensured. It is not practical to track the impact of each individual sub-shower generated by a hadronic interaction on the air shower as whole. However, to get an idea of the impact of the hadronic interaction model choice and the transition energy between low and high energy models, the CORSIKA data access tools (COAST) package [1] was used to track the rate of hadronic interactions throughout the whole shower development. This analysis contextualises the studied differences at higher energies and more commonly studied primaries. Shifting the transition energy therefore reveals whether the studied differences enhance the disagreement or bring models into accordance.

Fig. 8
figure 8

On the left y-axis, the ratio between the hadronic interaction distributions in PeV proton showers is shown for each high energy model with respect to EPOS-LHC. Labelled in green and on the right y-axis, EPOS-LHC’s hadronic interactions distribution is shown for a PeV proton initiated event

In Fig. 8 we display the hadronic interaction ratios in PeV proton initiated showers (with default transition energy). Firstly, EPOS-LHC’s events undergo a considerably larger number of interactions below the 10 TeV scale, over 10% more than QGSJetII-04 and SIBYLL 2.3c at energies at which muon production peaks. This reflects how, for the same initial conditions, models generate different shower scenarios which agree in their ground level product [2]. Consequently, the differences in particle production and shower dynamics between models can also be inferred here. Worthy of note is QGSJetII-04’s and SIBYLL 2.3c’s high energy interaction rate. Although the likelihood of these collisions is well suppressed, the whole shower is affected by a larger number of interactions carrying large fractions of the initial energy.

We show in the Appendix C the effect of modifying the values of the transition energy on the hadronic interaction profiles. There it is shown how the increase or decrease of the transition energy results in opposite effects for EPOS-LHC and the other models. Strengthening the role of the low energy model causes a decrease in the number of interactions under 10 TeV in EPOS-LHC and an increase in QGSJetII-04’s and SIBYLL 2.3c’s. This can be understood as UrQMD’s hadronic interaction profile lying in between EPOS-LHC’s and SIBYLL 2.3c’s.

In the previous sections we discussed how the models, high or low energy, do not agree for the default transition energy value. While a better compatibility in the inelasticity distributions can be found at higher energy, increasing the transition energy could mute the properties of the individual high energy models and threaten the consistency of events at higher energy. This inconsistency and abrupt change in the secondary interaction physics demands for a better agreement between models and/or a redefinition of the transition scales.

7 Conclusions

Through a detailed investigation of events initiated by 100 GeV protons we cast light on the sources of disagreement between hadronic interaction models [2]. By a simultaneous analysis of the first interaction and ground level particles, we can conclude that models present an intrinsically different behaviour in the low energy regime. An outline on each model’s arguments is presented in Sect. 5.

Phenomenological variables were used to correlate differences in ground level observables to properties of the first interaction for some commonly employed high energy models. Discrepancies in the ground level particle number emerged from event rate deviations that were drawn from the study of first interaction inelasticity distributions. Within same events types (same leading particle and inelasticity), the production of accompanying particles (e.g. muonic family particles) disagrees causing differences in the ground-level muon component. The transverse momentum spectra of muonic particles that are produced in the first interaction has also been shown to differ between models. This disagreement becomes worse as their energy increases.

The focus in this study was on the differences between models in the early shower development, however, the model differences are of course not restrained to the early development stage. When increasing the primary energy, the number of interactions around the switching energy will increase, however, the differences on the average shower parameters are washed away by the dominance of particle production by the low energy model. When selecting air showers with special properties, like proton induced air showers that mimic gamma-ray induced air showers, differences between the modelling of hadronic interactions might be exposed again [3], also at higher primary energies (TeV domain).

Additionally, we have shown how the event generators diverge by studying the number of hadronic interactions in showers initiated at high energies. Large differences have been spotted remarking how the models produce very dissimilar showers from the same initial conditions. In this line, we have shown how sensitive the shower development is to modifications of the transition energies. The low energy model shows a clear domination of the shower (ruling over 85% of the interactions in a PeV shower with default transition energy and increasing percentage for higher primaries) while the high energy models set the initial conditions. This points to low energy model investigations as a possible important contribution to the solution of the hadronic interaction model puzzles [12,13,14].

As the focused energy regime falls within the validity range of all models, a reasonable agreement would be expected between all high energy models and UrQMD. The large differences found demand for a convergent model tuning to existing and/or future accelerator data or a redefinition of the transition regime from high to low energy hadronic interaction models that ensures consistency with the high energy partners.