1 Introduction

The description of inclusive hadron production in high energy hadron-hadron collisions remains subject to significant theoretical uncertainties. At TeV energies the dominant source of secondary particle production is the fragmentation of quarks and gluons in semihard scattering [1], referred to as minijet production. However, various processes that cannot be directly calculated from first principles in quantum chromodynamics (QCD) also contribute to particle production, i.e. multiparton interactions (MPIs), and fragmentation of the remnants. Together with initial- and final-state radiation these additional particle production mechanisms are typically referred to as the underlying event and are modelled phenomenologically in Monte Carlo (MC) event generators with parameters tuned using data [2,3,4]. In addition, especially in the forward phase space, diffractive processes play an important role [5]. Furthermore, final-state parton rescattering effects, a possible hydrodynamical phase transition, or other collective phenomena can impact and modify particle production in hadron-hadron collisions at high energies [6].

The energy carried by particles emitted into the very forward region (\(-6.6<\eta <-5.2\)) covered by the CASTOR calorimeter [7] of the CMS experiment was shown to be a powerful probe of the activity of the underlying event [8, 9]. For the first time measurements presented in this paper correlate the hadronic energy at very forward rapidities to the central region in proton-proton collisions, offering a new approach to the study of hadron production at the CERN LHC. Such measurements over a very large rapidity interval provide additional information on the underlying event compared to those based only on the central region, e.g. Refs. [10, 11].

The very forward region covered by the data contains the highest energy densities, \(\mathrm{d}{}E\)/\(\mathrm{d}\eta \) [12, 13], so far observed in proton-proton collisions at the LHC. Therefore, the present results can improve event generators used in simulations of extensive air showers induced by cosmic rays at ultra-high energies [14]. Specifically, current air shower simulations are known to significantly underestimate muon production (see Ref. [15] and references therein). The fraction of the energy going into the production of electrons or photons rather than long-lived hadrons has a crucial impact on the muon production rate in extensive air showers, see Ref. [16]. Since CASTOR consists of separate electromagnetic and hadron calorimeters, the data presented here provide new information that may improve understanding of muon production in air showers.

2 Experimental setup and Monte Carlo simulation

The main feature of the CMS apparatus is a superconducting solenoid of 6\(\text {\,m}\) internal diameter that can provide a nominal magnetic field of 3.8\(\text {\,T}\). Within the solenoid volume in the central region are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter, and a brass and scintillator hadron calorimeter. Muons are measured in gas-ionisation detectors embedded in the steel return yoke. The central detectors of CMS are complemented by calorimeters in the forward direction, which all rely on the detection of Cherenkov photons produced when charged particles pass through their active quartz components. The “hadron forward” (HF) calorimeters cover the pseudorapidity interval \(3.0<|\eta |<5.2\) and use quartz fibres embedded in a steel absorber. The CASTOR calorimeter is a sampling calorimeter composed of layers of fused silica quartz plates and tungsten absorbers. It is located on only one side of CMS and covers the region \(-6.6<\eta <-5.2\). CASTOR is segmented into 16 azimuthal towers, each with 14 longitudinal channels. The two front channels have a combined depth of 20 radiation lengths and form the electromagnetic section of each tower. The remaining 12 channels constitute the hadronic section. The full depth of a tower amounts to 10 hadronic interaction lengths. A more detailed description of the CMS detector, together with a definition of the coordinate system used and all relevant kinematic variables, can be found in Ref. [17]. A detailed description of the CASTOR calorimeter is given in Refs. [7, 9, 18]. For triggering purposes, the Beam Pickup Timing for the eXperiment (BPTX) devices were used [19].

The data are compared to a broad range of model predictions covering different parameter tunes as well as entirely different physics approaches. The models considered are pythia 8 [20] (version 8.212) with tune CUETP8M1 [21], and tune 4C [3], combined with the MBR [22] model to describe diffractive processes. The data are also compared to the predictions of epos lhc [23] and sibyll 2.1 [24]. For these models, a detailed Monte Carlo simulation of the CMS detector response is performed with the Geant4  [25] toolkit. The simulated events are processed and reconstructed in the same way as the collision data. Furthermore, predictions by QGSJetII.04 [26], sibyll 2.3c [27], pythia 8 tune CP5 [28], and herwig  7.1 [29, 30] with the default tune for soft interactions [31] are also compared to the data. These simulations are produced only at the generator level. A forward folding method is developed to compare generator-level simulations to the data. This technique can be used to compare any model or theoretical prediction to the data and will be described in detail.

3 Data analysis and systematic uncertainties

This analysis is based on data recorded during the low-luminosity startup operation of the LHC in June 2015, at a proton-proton centre-of-mass energy of 13\(\,\text {TeV}\). In this period the CMS solenoid was turned off. The data correspond to an integrated luminosity of 0.22\(\,\text {nb}^{-1}\), with an average proton-proton interaction probability of about 30% per bunch crossing.

The event selection criteria are optimised to select inelastic collision events with minimal bias. The residual contribution of electronic noise and beam background in these events is well below 1%. Events were selected online with an unbiased trigger requiring only the presence of two colliding bunches. The offline event selection requires activity in the HF calorimeters: at least one tower with reconstructed energy larger than 5\(\,\text {GeV}\) in either the positive or negative HF calorimeter. In addition, at least one reconstructed track with \(|\eta |<2\) is required in the CMS pixel detector. A modified tracking algorithm from Ref. [32] is used in the absence of a magnetic field. Information from the pixel detector is used to reconstruct straight tracks. Signals in all three layers of the pixel detector are required to lie within a cone of radius \(R = \sqrt{\smash [b]{(\Delta \phi )^2+(\Delta \eta )^2}} = 0.02\) (where \(\phi \) is the azimuthal angle in radians) around the reconstructed track. The efficiency to find more than two hits in the pixel detector drops quickly for \(|\eta |>2\); the search for tracks is therefore limited to \(|\eta |<2\). Tracks are retained if they originate from the expected interaction region and are linked to at least one interaction vertex. This pixel track reconstruction has an efficiency of about 76% and a probability of \(\approx \)5% of spurious tracks for charged particles with a transverse momentum \(p_{\mathrm {T}}\) larger than 200\(\,\text {MeV}\).

To reject events with more than one simultaneous proton-proton interaction (pileup), an additional constraint on the reconstructed interaction vertices is applied. Events with two reconstructed vertices are rejected if the vertices are separated by more than 0.5\(\,\text {cm}\) along the z axis. This minimises the rejection of events with high particle multiplicity, where the reconstruction may create multiple spurious vertices. The probabilities for events to have additional collisions is evaluated in both data and simulation to be 1.5% (visible vertex) and 2.3% (invisible vertex). The correction of these background events is not straightforward, since the correction depends on the track multiplicity in the central region as well as on the model used in simulation. Therefore, the contribution from pileup events to the forward energy is considered part of the systematic uncertainty of the measurement.

The total energy deposited in CASTOR is obtained by summing the energy measured in each calorimeter tower above the noise threshold, which is determined independently for each tower and varies between 2 and 2.5\(\,\text {GeV}\). On average, 76% of the showers due to single electrons or photons are contained within the electromagnetic section of CASTOR, and single hadrons are 71% contained in the hadronic section. Moreover, for a given particle energy, the energies deposited by hadron-induced showers are smaller than electron-induced showers, which is known as noncompensation. These properties were precisely measured with a test beam and are implemented in the detector simulation. It was previously shown that the energy deposited in the corresponding sections of CASTOR can serve as good estimators for the particle-level energy of electrons/photons and hadrons [9]. The electromagnetic and hadronic energies of a given event are defined as the energies deposited in the corresponding detector sections of CASTOR, and the total energy as the sum of both.

The events are classified according to the number of reconstructed charged tracks from the vertex. The average total, electromagnetic, and hadronic energy per event is calculated for each track multiplicity bin. The present data make it possible to study track multiplicities up to 150. The statistical uncertainties of the energy measurement are below 2%, much smaller than the systematic uncertainties. The most important sources of systematic uncertainties are described in the following and are summarised in Table 1:

CASTOR energy scale The energy scale uncertainty of CASTOR is 17% [9]. The energy scale is determined using a calibration procedure based on SPS test-beam data, LHC beam halo muon events, a cross-calibration to the HF calorimeters, and LED test pulses, in combination with a precise detector alignment. The precision is currently limited by systematic effects related to the modelling and understanding of particle shower cascades in the calorimeter ranging from GeV to TeV energies.

CASTOR intercalibration The relative intercalibration is performed using the measured response of each channel to single LHC beam halo muon events, which were recorded with a dedicated trigger during LHC interfill periods. This procedure is limited by the available muon statistics. For a measurement of the total energy, the uncertainty caused by intercalibration is averaged over the whole calorimeter and is 2–3%. For the determination of the electromagnetic and hadronic energy fractions, on the other hand, the effect of relative calibration becomes more significant. Dedicated studies based on full detector simulations of collision events demonstrate that the observed average shape of the longitudinal shower absorption in the calorimeter is consistent with only a slight overestimation of electromagnetic energies, and a corresponding underestimation of hadronic energies. We determine a maximum decrease of the electromagnetic energy by 8% and a corresponding increase of the hadronic energy by 15%, which are included as systematic uncertainties.

Pileup rejection The uncertainty arising from the pileup contribution is estimated by considering alternative vertex multiplicity selections; events with exactly one reconstructed vertex, as well as events with two vertices separated by less than 0.7\(\,\text {cm}\), are selected. These changes mainly affect the high-multiplicity region and lead to a systematic energy uncertainty of up to 10% for multiplicity >140. Collisions that do not create visible vertices in the detector introduce an additional uncertainty that is below 0.8%.

HF energy scale The uncertainty in the reconstructed HF energies is 10% [33]. Varying the threshold for the event selection from 5.0\(\,\text {GeV}\) per HF calorimeter tower to 4.5 and 5.5\(\,\text {GeV}\) changes the average energy observed in CASTOR by less than 0.5%.

Tracking The track reconstruction uncertainty has been previously determined from studies comparing data and simulation [32]. The uncertainties in the tracking and vertexing efficiencies affect the number of reconstructed tracks by 1.8 and 2–3%, respectively. These are combined linearly, yielding a 5% systematic uncertainty in the number of reconstructed tracks. The effect in the average energy is below 5%.

Most of the uncertainties described here are uncorrelated and are therefore added in quadrature. Moreover, in the measured ratios between electromagnetic and hadronic energies the absolute energy scale uncertainty cancels, while the intercalibration uncertainty introduces a particular anticorrelated effect since a systematic decrease of the electromagnetic energy causes an increase of the hadronic energy and vice versa.

Table 1 Uncertainties in the average energies measured with the CASTOR calorimeter at the detector level. Ranges indicate the variation as a function of the track multiplicity

4 Forward folding of model predictions

The measured track multiplicity is distorted with respect to the true charged particle multiplicity by the effects of acceptance and efficiency of the CMS pixel tracker. Likewise, the energies observed in CASTOR are affected by the energy resolution and the response of the calorimeter. In the present paper, the data are not corrected for these effects, and should thus be compared to the results of a full Monte Carlo detector simulation to compare with other experimental data and to future model predictions. For this purpose, a “forward folding” approach is used here, in which all known detector effects are applied to a given model prediction or theoretical calculation. The forward folding approach is chosen since it yields better systematic uncertainties compared to an unfolding of these data.

At the generator level, events are selected that match the detector-level event selection. At least one charged particle with \(p_{\mathrm {T}} >200\,\text {MeV} \) is required within \(|\eta |< 2\). Furthermore, a fractional momentum loss of the scattered proton of \(\xi > 10^{-6}\) is required. To determine \(\xi \) all stable (\(c\tau >1\,\text {cm} \)) final-state particles are divided into two systems, X and Y, based on their position with respect to the largest rapidity gap in the event. All particles on the negative side of the largest gap are assigned to system X, while the particles on the positive side are assigned to system Y. Based on this, we determine \(\xi =\max \left( M_{\mathrm {X}}^2/s,\,M_{\mathrm {Y}}^2/s\right) \), where \(M_{\mathrm {X}}\) and \(M_{\mathrm {Y}}\) are the invariant masses of the two systems. The selection based on \(\xi \) is relevant at very low particle multiplicities, and leads to an optimal agreement with the event selection as implemented at the detector level. It is also consistent with previous CMS publications, e.g. Ref. [9, 34].

Four-dimensional migration matrices k describing the probability to reconstruct an event with central multiplicity \(N_{\mathrm {tracks}}\) and forward energy \(E_{\mathrm {reco}}\) for given values \(N_{\mathrm {ch}}\) and \(E_{\mathrm {true}}\) are calculated based on all available Monte Carlo samples with full detector simulation. At the generator level, the central multiplicity \(N_{\mathrm {ch}}\) is defined as the number of stable charged final-state particles with \(p_{\mathrm {T}} >200\) \(\,\text {MeV}\) and \(|\eta |< 2\), and the forward energy \(E_{\mathrm {true}}\) is defined as the sum of the energies of all particles within \(-6.6<\eta <-5.2\) except for muons and neutrinos. At the detector level, the number of reconstructed tracks with \(|\eta |<2\) is \(N_{\mathrm {tracks}}\) and the reconstructed energy in CASTOR is \(E_{\mathrm {reco}}\). The four-dimensional matrices \(k_{ij}^{lm}\) are constructed with 20 bins in \(N_{\mathrm {ch}}\) and \(N_{\mathrm {tracks}}\) ranging from 1 to 200 (dimensions i and l) , as well as 46 bins in \(E_{\mathrm {true}}\) and \(E_{\mathrm {reco}}\) ranging from 0 to 10\(\,\text {TeV}\) (dimensions j and m). The bin intervals used at detector and generator level are identical. The range of k is larger than that used for the final results in order to allow for the effects of bin migration. Final results are presented for \(N_{\mathrm {tracks}}\) between 1 and 150.

All four components of k have one extra underflow bin to handle the event selection efficiency. If an event does not pass the event selection criteria at the generator level (\(N_{\mathrm {ch}}\ge 1\) and \(\xi >10^{-6}\)), it is recorded in the underflow region with \(N_{\mathrm {ch}}=0\) and \(E_{\mathrm {true}}=-1\,\text {GeV} \). If an event is not selected at the detector level (one HF tower above 5\(\,\text {GeV}\) and \(N_{\mathrm {tracks}}\ge 1\)), it is recorded in the underflow region with \(N_{\mathrm {tracks}}=0\) and \(E_{\mathrm {reco}}=-1\,\text {GeV} \). In this way, the effects of inefficiencies and migrations from outside the visible phase space are included in k. For example, the selection efficiency for events having a specific \(N_{\text {ch}}\) and \(E_{\text {true}}\) is the ratio of the number of events without the underflow bin to the number of events with the underflow bin.

Two-dimensional distributions, \(N_{\text {reco}}^{ij}\), describing the event yields in bins (ij) of \(N_{\mathrm {tracks}}\) and \(E_{\mathrm {reco}}\) can then be obtained for any given event generator or theoretical prediction by means of the following matrix multiplication:

$$\begin{aligned} N_{\text {reco}}^{ij} = \sum _{l,m} k_{ij}^{lm} N_{\text {true}}^{lm}, \end{aligned}$$
(1)

where \(N_{\text {true}}^{lm}\) is the distribution of generator-level events in bins (lm) of \(N_{\text {ch}}\) and \(E_{\text {true}}\). The average energy in each track multiplicity bin is calculated from \(N_{\text {reco}}^{ij}\) excluding the underflow bins, and is compared to the data directly at the detector level. The results obtained by using the forward folding method coincide with those obtained with the full detector simulation to better than 1%.

The matrix k has a slight dependence on the \(\eta \), \(p_{\mathrm {T}}\) and multiplicity distributions of the final-state particles in the event generator used in the full detector simulation. To quantify this dependence, four matrices are provided based on pythia 8 tune CUETP8M1, pythia 8 tune 4C+MBR, epos lhc, and sibyll 2.1. A fifth matrix is obtained by averaging the matrices of these models and serves as the central value for all forward-folded results. The spread of the results obtained with the individual matrices is an estimate of the systematic uncertainty related to the model dependence; it is mostly well below 5%, but reaches 15% in a few bins. All five variations of k are available in a rivet [35] plugin. This way, the forward folding can be applied to any other model prediction. Moreover, the full point-to-point correlation of the model-related uncertainty can be studied.

5 Results

Fig. 1
figure 1

Top panel: Average total energy reconstructed in the CASTOR calorimeter as a function of the number of reconstructed tracks for \(|\eta |<2\). Bottom panel: Average total energy reconstructed in the CASTOR calorimeter normalised to that in the first bin (\(N_{\mathrm {ch}}<10\)) as a function of the number of reconstructed tracks for \(|\eta |<2\). In all figures, the data are shown as black circles and the corresponding systematic uncertainties with a gray band; horizontal bars are used to indicate the bin width. The predictions of various event generators are compared to the data, which are the same in both panels. The bands associated with the model predictions illustrate the model uncertainty

Fig. 2
figure 2

Top panel: Average electromagnetic energy reconstructed in the CASTOR calorimeter as a function of the number of reconstructed tracks for \(|\eta |<2\). Bottom panel: Average hadronic energy reconstructed in the CASTOR calorimeter as a function of the number of reconstructed tracks for \(|\eta |<2\). In all figures, the data are shown with black circles and the corresponding systematic uncertainties with a gray band; horizontal bars are used to indicate the bin width. The predictions of various event generators are compared to the data, which are the same in both panels. The bands associated with the model predictions illustrate the model uncertainty

Fig. 3
figure 3

Ratio of average electromagnetic and hadronic energies reconstructed in the CASTOR calorimeter as a function of the number of reconstructed tracks for \(|\eta |<2\). The data are shown with black circles and the corresponding systematic uncertainties with a gray band; horizontal bars are used to indicate the bin width. Predictions of various event generators are compared to the data, which are the same in both panels. The bands associated with the model predictions illustrate the model uncertainty

Various measurements of the average energy reconstructed in the region \(-6.6<\eta <-5.2\) are presented as a function of the track multiplicity for \(|\eta |<2\) in Figs. 1, 2, and 3. The statistical uncertainties of the data are small and therefore not visible. The systematic uncertainties are shown with a gray band. The data are not corrected for detector effects and are compared to the predictions of models commonly used to describe hadron interactions at the LHC and in high energy cosmic ray air showers. These models are grouped into two sets:

The first contains pythia 8 tune CUETP8M1 and tune 4C+MBR, epos lhc and sibyll 2.1. All these have a full detector simulation. The error bands shown for these models reflect only the Monte Carlo statistical uncertainties. These become visible especially in the last bin.

The second set of models consists of sibyll 2.3c, QGSJetII.04, pythia 8 tune CP5, and herwig  7.1. Predictions from these models are obtained using the forward-folding method. The uncertainty bands shown for these models also include the systematic uncertainties from the forward-folding procedure discussed in the previous section.

The average total energy in CASTOR, shown in Fig. 1 (upper), increases with the track multiplicity. This feature is consistent with the general behaviour of the underlying event measured at central rapidities (see for example Refs. [10, 11]) and is reproduced by all models. The rise can be associated to an initial correlation of central and forward event activity, which is damped by energy conservation in the most violent collisions. All models describe these data with at most minor discrepancies. This implies that the model parameters for the underlying event determined at central rapidities are valid also for the very forward data. In detail, the energies predicted by pythia4C+MBR and sibyll 2.3c are slightly too low at small multiplicity. Conversely, at intermediate multiplicities, pythia 8 CP5 predicts average energies larger than those observed.

The systematic uncertainty in the data is dominated by the energy scale uncertainty contribution, which is fully correlated between the multiplicity bins. Therefore, the distributions can be normalised to the first bin, so that, when comparing their shapes, the systematic uncertainty is significantly smaller (cf. Fig. 1, lower). The rise is steep at low multiplicities and becomes more gradual at higher multiplicities. All pythia 8 tunes have very similar shapes, inconsistent with that observed in the data. The disagreement is strongest for pythia 8 CP5, a tune optimised on underlying event data at central rapidity. This tune uses parton distribution functions at next-to-next-to-leading order and features a softer MPI cutoff compared to pythia 8 CUETP8M1 (see Ref. [28] for details). The data therefore provide relevant information for future generator improvements and tunes. The epos lhc, QGSJetII.04, and herwig 7.1 models predict saturation at multiplicities above 80, which is not seen in the data. Both versions of sibyll provide predictions in agreement with the data.

The individual electromagnetic and hadronic energy distributions are shown in Figs. 2 (upper) and 2 (lower). All models, with the exception of sibyll 2.3c, describe the electromagnetic component well. pythia4C+MBR slightly underestimates the electromagnetic energy at low multiplicities. Conversely, the other models tend to overestimate the hadronic component. Specifically these data can be very relevant for improving the simulation of cosmic ray induced extensive air showers, and specifically the modelling of the production of neutral versus charged pions or other hadrons with longer lifetimes, since the energies in the region \(-6.6<\eta <-5.2\) are close to those in the peak of the forward energy flow.

The data are also used to determine the ratio of the average electromagnetic and hadronic energies (Fig. 3). Here, the relative calibration of the electromagnetic and hadronic sections is the main source of uncertainty and results in a very asymmetric uncertainty band. The measured ratio is approximately constant over the whole multiplicity range. The ratio is sensitive to the details of hadronisation, and discrepancies between models and data may reflect an inadequate description of the hadron production mechanisms. String fragmentation, remnant fragmentation, initial- or final-state radiation, the effects of a possible very dense hydrodynamical phase, or the decay of short-lived resonances may be relevant to the understanding of the data. The observed independence of the measured ratio of track multiplicity indicates that no dramatic change of the particle production mechanism is observed at this very forward pseudorapidity. All model predictions are lower than the data, specifically those of the modern tunes pythia 8 CP5 and sibyll 2.3c, whereas QGSJetII.04, sibyll 2.1, and herwig 7.1 provide the best description of the ratio.

6 Summary and discussion

The average energy per event in the pseudorapidity region \(-6.6<\eta <-5.2\) was measured as a function of the observed central track multiplicity (\(|\eta |<2\)) in proton-proton collisions at a centre-of-mass energy of 13\(\,\text {TeV}\). The data are recorded during the first days of 13\(\,\text {TeV}\) running with low beam intensities. The measurement is presented in terms of the total energy as well as its electromagnetic and hadronic components. The very forward region covered by the data contains the highest energy densities studied in proton-proton collisions at the LHC so far. This makes the present data relevant for improving the modelling of multiparticle production in event generators of ultra-high energy cosmic ray air showers.

The measured average total energy as a function of the track multiplicity is described by all models reasonably well. This demonstrates that the underlying event parameter tunes determined at central rapidity can be safely extrapolated to the very forward region within experimental uncertainties. A shape analysis indicates, however, that there are significant differences among the models and large deviations from the data. The generator sibyll 2.1 gives the best description of the measured multiplicity dependence of the average total energy.

The data are also presented in terms of the average electromagnetic and hadronic energies per event as a function of the central track multiplicity. This is useful in the study of different particle production mechanisms, since the former is primarily due to the decay of neutral pions and the latter to the production of hadrons with longer lifetimes, mostly charged pions. All models give a good description of the electromagnetic energy dependence on the multiplicity, with the exception of sibyll 2.3c. Conversely, the predictions for the hadronic energy have a significantly larger spread compared to the electromagnetic case.

The ratio between the electromagnetic and hadronic energies is also presented. The data exhibit a larger fraction of electromagnetic energy than the models, and disagree with the two most recent model tunes, i.e. sibyll 2.3c and pythia 8 CP5. Therefore, these models cannot explain the muon deficit in ultra-high energy air shower simulations since the data indicate that even more energy must be channelled into the electromagnetic part of the cascade and is thus lost for the generation of further hadrons [16].