1 Introduction

Measurements of the yields and kinematic distributions of particles produced in proton–proton (pp) collisions at the Large Hadron Collider (LHC) can provide a better understanding of the mechanisms of hadron production in high-energy hadronic interactions. Two types of processes contribute to the production of most of the final-state particles at LHC energies. Semi-hard (multi)parton scattering, with exchanged momenta of a few \(~\text {GeV}\), is the dominant contribution. Diffractive scattering in more peripheral pp interactions, where one or both protons survive the interaction and/or are excited into a low-mass state, accounts for 15–40 % [1, 2] of the pp inelastic cross section. As the particle multiplicity produced in these processes is modelled phenomenologically in the existing Monte Carlo (MC) event generators of hadronic interactions, experimental results provide an important input for tuning of the models.

The results presented here focus on the charged-particle multiplicity density (\(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \), also referred to as the pseudorapidity distribution) corrected down to zero transverse momentum (\(p_{\mathrm {T}}\)), in the pseudorapidity ranges \(|\eta | < 2.2\) and \(5.3<|\eta |<6.4\), where \(\eta \) is defined as \(-\ln [\tan (\theta /2)]\), with \(\theta \) being the polar angle of the particle trajectory with respect to the anticlockwise-beam direction. Inclusive measurements of the \(\eta \) and \(p_{\mathrm {T}}\) distributions of charged particles have previously been performed in pp and \(\mathrm {p}\mathrm {\overline{p}}\) collisions for different centre-of-mass energies and phase space regions [314].

In this paper, the data samples were collected with a minimum bias trigger generated by at least one arm of the TOTEM T2 telescopes, which also triggered the readout of the Compact Muon Solenoid (CMS). Three event samples with different final-state topologies are selected offline: a sample of inclusive inelastic pp events, a sample dominated by non-single diffractive dissociation (NSD) events and a sample enriched in single diffractive dissociation (SD) events. The measured data are compared to the predictions of MC event generators that model pp collider data and high-energy cosmic ray hadronic interactions.

2 Experimental setup

The CMS and TOTEM experiments use a right-handed coordinate system, with the origin at the nominal interaction point (IP), the \(x\)-axis pointing to the centre of the LHC ring, the \(y\)-axis pointing upwards and the \(z\)-axis pointing along the anticlockwise-beam direction. The azimuthal angle, \(\phi \), is measured in the \((x,y)\) plane, where \(\phi =0\) is the \(+x\) and \(\phi =\pi /2\) is the \(+y\) direction.

A complete description of the CMS detector can be found in Ref. [15]. The central feature of the CMS apparatus is a superconducting solenoid of 6\(\text {~m}\) internal diameter, providing a uniform magnetic field of 3.8\(\text {~T}\) parallel to the beam axis. Inside the magnetic field are the pixel tracker, the silicon-strip tracker, the lead tungstate electromagnetic calorimeter and the brass/scintillator hadron calorimeter. Muons are measured in gas-ionisation detectors embedded in the steel return yoke outside the solenoid. In addition to the barrel and endcap detectors, which extend up to \(|\eta |=3.0\), the steel/quartz-fibre hadron forward calorimeters (HF) cover the region \(2.9<|\eta |<5.2\). The tracking detector consists of 1440 silicon pixel and 15 148 silicon strip detector modules. The barrel is composed of 3 pixel and 10 strip layers around the interaction point at radii from 4.4 to 110\(\text {~cm}\). The forward and backward endcaps each consist of 2 pixel disks and 12 strip disks in up to 9 rings. Three of the strip rings and four of the barrel strip layers contain an additional plane, with a stereo angle of 100\(\text {~mrad}\), to provide a measurement of the \(r\)-coordinate and \(z\)-coordinate, respectively. The tracker is designed to provide a longitudinal and transverse impact parameter resolution of about 100\(~\upmu \text {m}\) and a \(p_{\mathrm {T}}\) resolution of about 0.7 % for 1\({~\text {GeV/}c}\) charged particles at \(\eta =0\) [16].

The TOTEM experiment [17, 18] is composed of three subdetectors: the Roman pots and the T1 and T2 telescopes. Minimum bias events are triggered by the two T2 telescopes, which are placed symmetrically on each side of the IP at about \(|z|=\)14\(\text {~m}\). They detect charged particles produced in the polar angular range of \(\approx \)3–10\(\text {~mrad}\) (\(5.3<|\eta |<6.5\)), with full azimuthal acceptance. Each telescope consists of two half-arms, with each half-arm composed of 10 semicircular planes of triple-GEM (gas electron multiplier) chambers, arranged within a 40\(\text {~cm}\) space along the \(z\)-axis. Each chamber provides two-dimensional information on the track position covering 192\(^{\circ }\) in azimuthal angle with a small vertical overlap region between chambers of two neighbouring half-arms. Every chamber has a double-layered readout board containing two columns of 256 concentric strips (400\(~\upmu \text {m}\) pitch, 80\(~\upmu \text {m}\) width) to measure the radial coordinate and a matrix of 1560 pads, each covering \(\varDelta \eta \times \varDelta \phi \approx 0.06\times 0.018\text {~rad} \), to measure the azimuthal coordinate, and for triggering. The radial and azimuthal coordinate resolutions are about 110\(~\upmu \text {m}\) and 1\(^{\circ }\), respectively [19]. Local angles of single tracks are reconstructed with an average resolution of 0.5\(\text {~mrad}\), and the track pseudorapidity resolution for charged particles is better than 0.05 [20], once the track is identified as coming from the vertex.

The detailed MC simulations of the CMS and TOTEM detectors are based on Geant4  [21]. Simulated events are processed and reconstructed in the same manner as collision data.

3 Monte Carlo models

Various MC event generators for hadronic collisions are used for data corrections and for comparison with the final, fully corrected results. The pythia 6 (version 6.426) [22] generator is used with tune Z2* and pythia 8 (version 8.153) [23] with tune 4C [24]. These programs provide different descriptions of the diffractive component and they both use a model [25] in which multiple partonic interactions are interleaved with parton showering. The Z2* tune [26] uses the CTEQ6L1 [27] parton distribution function (PDF) set. Tune 4C of pythia 8 is based on early LHC data [24]. Parton showers in pythia are modelled according to the Dokshitzer–Gribov–Lipatov–Altarelli–Parisi (DGLAP) [2830] prescription and hadronisation is based on the Lund string fragmentation model [31]. Diffractive cross sections are described by the Schuler–Sjöstrand model [32]. In pythia 6, particle production from a low-mass state, \(X\), with \(M_X < 1\) \({~\text {GeV/}c^\text {2}}\) is treated as an isotropic two-body decay, while for high-mass states it is based on the string model. In pythia 8, the same model is used to generate the cross section and the diffractive mass, but particle production is modelled differently. For low-mass states, the string model is used, but for higher masses (\(M_X > 10\) \({~\text {GeV/}c^\text {2}}\)) a perturbative description of pomeron–proton scattering is introduced, based on diffractive PDFs [3335], which represent probability distributions for partons in the proton under the constraint that the proton emerges intact from the collision. The non-perturbative string model introduces a mass dependence on the relative probability for a pomeron to couple to a quark or a gluon [36]. The perturbative treatment of pomeron–proton scattering results in harder \(p_{\mathrm {T}}\) spectra and higher multiplicity for diffractive events generated with pythia 8 than for those obtained with pythia 6.

The herwig++ (version 2.5.0) [37] MC event generator, with a recent tune to LHC data (UE-EE-3 with CTEQ6L1 PDFs [27, 38]), is also used for comparison with the data. This generator is based on matrix element calculations similar to those used in pythia. However, herwig++ features DGLAP-based parton showers ordered in angle and uses cluster fragmentation for the hadronisation [39]. The description of hard diffractive processes also makes use of diffractive PDFs; however, soft diffraction is not implemented.

The data are also compared to predictions from two MC event generators used in cosmic ray physics [40]: epos [41] with the LHC tune (based on epos 1.99) [42] and qgsjetII-04 [43]. Both models include contributions from soft- and hard-parton dynamics. The soft component is described in terms of the exchange of virtual quasi-particle states, as in Gribov’s Reggeon field theory [44], with multi-pomeron exchanges accounting for underlying-event effects. At higher energies, the interaction is described in terms of the same degrees of freedom, but generalised to include hard processes via hard-pomeron scattering diagrams, which are equivalent to a leading-order perturbative Quantum Chromodynamics (QCD) approach with DGLAP evolution. These models are retuned to LHC data [45], including cross section measurements by TOTEM, and charged-particle multiplicity measurements in the central region, by ALICE and CMS, at \(\sqrt{s}=7\) \(~\text {TeV}\).

4 Datasets

The data were collected in July 2012 during a dedicated run with low probability (\(\sim \)4 %) of overlapping pp interactions in the same bunch crossing (pileup) and a non-standard \(\beta ^*\) = 90\(\text {~m}\) optics configuration, where \(\beta ^*\) is the amplitude function of the beam at the interaction point. These data correspond to an integrated luminosity of \(\mathcal {L} = 45~\upmu {\mathrm {b}}^{-1} \). A minimum bias trigger was provided by the TOTEM T2 telescopes and contributed to the CMS global trigger decision, which initiated simultaneous readout of both the CMS and the TOTEM detectors. The CMS orbit-counter reset signal delivered to the TOTEM electronics at the start of the run ensures the time synchronisation of the two experiments. Events are combined offline by requiring that both the CMS and the TOTEM reconstructed events have the same LHC orbit and bunch numbers. The minimum bias trigger required at least one track candidate (trigger track) in the T2 detector, in either \(z\)-direction [46]. With this selection the visible cross section seen by T2 has been estimated to be 91–96 % of the total pp inelastic cross section at \(\sqrt{s}=8\) \(~\text {TeV}\)  [47]. Zero bias data, obtained by triggering on random bunch crossings, are used to measure the trigger efficiency and to cross-check the pileup probability estimate.

MC samples were used to determine the event selection efficiency and the tracking performance. The efficiency corrections and related uncertainties for the CMS tracker are based on the pythia 6, pythia 8, and epos samples. The MC-based corrections for the TOTEM T2 detector and the corresponding uncertainties were determined by using pythia 8 and epos, which were found to bracket the measured \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \) distributions in the forward region.

5 Event selection and track reconstruction

The T2 track reconstruction is based on a Kalman filter-like algorithm, simplified thanks to the small amount of material in the GEM planes and the weak magnetic field in the T2 region [20]. In these conditions, the particle trajectory can be successfully reconstructed with a straight-line fit. Single tracks are reconstructed with almost 100 % efficiency for \(p_{\mathrm {T}} >20{~\text {MeV/}c} \), but because of multiple scattering and the magnetic field, tracks can be identified as coming from the IP with an efficiency that increases as a function of \(p_{\mathrm {T}}\) and is greater than 80 % for \(p_{\mathrm {T}} >40{~\text {MeV/}c} \) [18]. The pseudorapidity of a track in T2 is defined as the average pseudorapidity of all T2 track hits, calculated from the angle between the \(z\)-axis and the line joining the hit and the nominal IP. This definition is adopted on the basis of MC simulation studies and gives an optimal estimation of the pseudorapidity of the selected primary (i.e. produced at the IP) particle. Because of the small scattering angle of the particles reconstructed in T2, the position of the vertex does not affect significantly the reconstruction of the track pseudorapidity. Due to the limited number of primary particles in T2 per event, no vertex reconstruction based on T2 information is used for the analysis. The pseudorapidity region \(5.4<|\eta |<5.6\) is not included in the analysis because of the large effect that the beam pipe cone at \(|\eta |\approx 5.5\) has on the propagation of the primary particles.

About 80 % of the reconstructed tracks in T2 are due to non-primary particles, hereafter referred to as secondary particles, that are mainly electrons and positrons generated by photon conversions at the edge of the HF calorimeter of CMS and in the conical section of the beam pipe at \(|\eta |\approx 5.5\). It is therefore important to discriminate these particles from the primary charged particles. The most effective primary/secondary-particle separation is achieved by using the \(z_\mathrm{{impact}}\) track parameter (see Fig. 1), which is defined as the \(z\) coordinate of the intersection point between the track and the plane “\(\pi \)2”. This is the plane which contains the \(z\)-axis and is orthogonal to the plane “\(\pi \)1” defined by the \(z\)-axis and containing the track entry point in T2 [9]. This parameter is found to be stable against residual misalignment biases. Simulation studies demonstrate that the \(z_\mathrm{{impact}}\) distribution in the primary-particle region can be described by the sum of two Gaussian distributions plus an exponential distribution. The Gaussians, hereafter referred to as a “double-Gaussian” distribution, are mainly due to primary particles, whereas the exponential distribution accounts for most secondary particles. Simulations predict a contamination of the double-Gaussian distribution by secondary particles of about 20 %, primarily given by photons converted in the material between the IP and T2, with a contribution from decay products of strange particles; the \(z_\mathrm{{impact}}\) distribution of these particles is a Gaussian centred around \(z_\mathrm{{impact}}=0\).

Fig. 1
figure 1

Definition of the \(z_\mathrm{{impact}}\) parameter

Figure 2 shows the \(z_\mathrm{{impact}}\) distribution at the median \(\eta \) of the inclusive forward pseudorapidity distribution. A combined fit is performed for each \(\eta \) bin of the \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}\eta \) distribution with the sum of a double-Gaussian and an exponential function, yielding standard deviations (amplitudes) of both Gaussian functions that increase (decrease) with \(\eta \). The mean, required to be the same for both Gaussian distributions, the standard deviations and the amplitudes of the two Gaussian functions, as well as the mean and the amplitude of the exponential, are left free in the fit. The widths of the double-Gaussian distributions are consistent with the observed angular resolution of about 0.5\(\text {~mrad}\) for the T2 track reconstruction. The relative abundance of secondary particles is found to be smaller for higher \(|\eta |\). The fit of the \(z_\mathrm{{impact}}\) distribution is also repeated by using a second degree polynomial for the description of the background. The results are found to be stable with respect to the choice of the background function. The integral of the fitting function approximates the area of the \(z_\mathrm{{impact}}\) distribution to within 1 %.

Fig. 2
figure 2

The \(z_\mathrm{{impact}}\) parameter distribution measured in the data, for tracks reconstructed in one T2 half-arm in the range \(-5.8<\eta <-5.75\). A global (double-Gaussian + exponential function) fit, performed in the range from \(-8\) to 15\(\text {~m}\) is shown by the solid curve. The dotted curve represents the exponential component from secondary particles, while the dashed curve is the double-Gaussian component, mainly due to primary tracks

The T2 tracks are considered “primary candidates” if they are in the \(z_\mathrm{{impact}}\) range corresponding to 96 % of the area of the double-Gaussian, taken symmetrically around the mean.

The standard CMS track reconstruction algorithm is based on a combinatorial track finder (CTF) [16]. The collection of reconstructed tracks is produced by multiple iterations of the CTF track reconstruction sequence, in a process called iterative tracking. The reconstruction of the interaction vertices in the event uses the available reconstructed track collection. Prompt tracks are selected based on given quality criteria, and the selected tracks are then clustered in \(z\) using a “deterministic annealing” (DA) algorithm [48]. After identifying candidate vertices based on the DA clustering, the candidates containing at least two tracks are fitted by means of an adaptive vertex fit, where tracks in the vertex are assigned a weight between 0 and 1 based on their compatibility with the common vertex. In the central region, covered by the CMS tracker, high-purity primary tracks [49] are selected with \(p_{\mathrm {T}} >0.1\) \({~\text {GeV/}c}\) and relative \(p_{\mathrm {T}}\) uncertainty less than 10 %. To maximise the track-vertex association efficiency, a selection is applied on the track impact parameter, both along the \(z\)-axis and in the transverse plane. The impact parameter with respect to the beam spot in the transverse plane, \(d_{xy}\), is required to be \(|d_{xy}/\sigma _{xy} | < 3\), while for the point of closest approach to the primary vertex along the \(z\)-direction, \(d_{z}\), the requirement \(|d_z/\sigma _z | < 3\) is imposed. Here \(\sigma _{xy}\) and \(\sigma _z\) denote the uncertainties in \(d_{xy}\) and \(d_z\), respectively. The analysis is restricted to \(|\eta |<2.2\), to avoid effects from tracks close to the geometric edge of the tracker. Events with more than one reconstructed vertex are discarded, thus reducing the effect of pileup to a negligible level (\(<\)1 %).

The pseudorapidity distributions of the charged particles are measured in the central and forward regions for three different event samples, with topologies corresponding to three different event selection criteria. An inclusive sample of events is selected by requiring at least one primary-track candidate in T2. Event samples enhanced in non-single diffractive dissociation (NSD) and single diffractive dissociation (SD) events are also selected, the former defined by requiring a least one primary candidate in each of the two T2 telescopes and the latter by selecting events with at least one primary candidate in one T2 telescope and none in the other. Therefore, the intersection of the NSD-enhanced and SD-enhanced samples is empty, while the union is the inclusive sample.

The inclusive sample includes \(\sim \)99 % of non-diffractive events. The reconstruction efficiency for diffractive events is 50 % for a diffractive mass \(M\sim 3.6\) \({~\text {GeV/}c^\text {2}}\) and increases rapidly to 99 % for \(M>10\) \({~\text {GeV/}c^\text {2}}\). Most of the non-diffractive and double diffractive events, as well as the single diffractive events with masses larger than 1.3\({~\text {TeV/}c^\text {2}}\), produce particles in both T2 telescopes and are therefore included in the NSD-enhanced sample. Simulation studies based on pythia 8 and epos show that the fraction of NSD events in the SD-enhanced sample amounts to 45–65 %.

6 Data analysis

The pseudorapidity density measurements presented here refer to “stable” primary charged particles, with an average lifetime longer than \(3\times 10^{-11}\) s, either directly produced in pp collisions or from decays of particles with shorter lifetimes. Such a definition, consistent with that of previous studies [37, 9], considers as secondary particles the decay products of \(\mathrm {K^0_S}\) and \(\mathrm {\varLambda }\) hadrons and all of the charged particles generated by interactions with the material in front and around the detectors.

6.1 Trigger efficiency correction

The trigger efficiency is determined with a zero bias event sample separately for events with primary-track candidates reconstructed offline in both arms of T2 and in only one arm. All zero bias data taken were used to determine the trigger inefficiency. The inefficiency of the trigger is mainly due to non-operating and noisy channels. For each event category, the trigger efficiency is calculated as a function of the total number of tracks reconstructed offline in T2, \(n_{\mathrm{T}2}\), as \(\epsilon _\mathrm{{trig}}=N(n_{\mathrm{T}2})_\mathrm{{trig}}/N(n_{\mathrm{T}2})_\mathrm{{zb}}\), where \(N(n_{\mathrm{T}2})_\mathrm{{trig}}\) is the number of events with the total T2 track multiplicity \(n_{\mathrm{T}2}\) passing the trigger selection and \(N(n_{\mathrm{T}2})_\mathrm{{zb}}\) is the number of events with \(n_{\mathrm{T}2}\) tracks selected with the zero bias trigger. The measured trigger efficiency is shown in Fig. 3.

Fig. 3
figure 3

Trigger efficiency as a function of the total track multiplicity in the T2 telescopes for single-sided events with primary candidates in only the \(z>0\) (\(+\)) or \(z<0\) (\(-\)) telescope and for double-sided events with primary candidates in both telescopes

The trigger efficiency correction, \(1/\epsilon _\mathrm{{trig}}\), is applied separately for the three event categories and is significant for events with \(n_{\mathrm{T}2}=1\), while it approaches unity for \(n_{\mathrm{T}2}\ge 3\). The overall trigger inefficiency results in a 0.1–0.2 % relative correction to the total \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}\eta \) distributions for the three event categories. The value of the associated systematic uncertainty is conservatively assumed to be equal to this relative correction.

6.2 Event selection correction

In order to take into account the differences between the sample defined at the MC-particle level (“gen selected”) and the one selected based on the reconstructed tracks (“reco selected”), a correction factor needs to be introduced. This correction is calculated for each \(\eta \) bin and event category from the ratio

$$\begin{aligned} C_\mathrm{{sel}}(\eta )=\frac{\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}\eta _\mathrm{{gen}}|_\mathrm{{gen~selected}}}{\mathrm {d}{}N_\mathrm{{ch}}/ \mathrm {d}{}\eta _\mathrm{{gen}}|_\mathrm{{reco~selected}}}, \end{aligned}$$
(1)

where the numerator is the pseudorapidity density obtained from the MC simulation for events selected based on the charged particles within the T2 acceptance, while the denominator is the density obtained by selecting the simulated events according to the topology of the primary candidates in T2, as explained in Sect. 5. In general, \(C_\mathrm{{sel}}\) is different from unity because of migrations between the different event categories. Because of misidentification of secondary particles as primaries and of track reconstruction inefficiencies, an event can be classified (according to the configuration of its reconstructed tracks) in a category that does not reflect its true charged-particle content.

For the inclusive and NSD-enhanced analysis, \(C_\mathrm{{sel}}\) is evaluated with pythia 8 and epos. Moreover, to quantify possible biases related to this correction, the analyses are repeated with the same selection method defined in Sect. 5 but without the primary candidate \(z_\mathrm{{impact}}\) requirement.

As the SD-enhanced multiplicity is smaller than the NSD-enhanced multiplicity, a larger selection inefficiency is expected for the former class of events. Moreover, as the SD-enhanced sample represents only 26 % of the inclusive sample, the NSD events that are wrongly identified as SD can introduce a large bias in the measured SD-enhanced \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \) distributions. Additional studies of the event selection strategy for the SD-enhanced analysis are therefore performed. The analysis is repeated with different event selection strategies and \(C_\mathrm{{sel}}\) is reevaluated for each as a function of \(\eta \) and of the track multiplicity in T2. The selection methods differ in the maximum number of tracks and of primary candidates reconstructed in T2 on each side of the IP. Simulation studies show that, depending on the method and the MC generator used, the selection efficiency is in the range of 70–90 %. The purity of the SD-enhanced sample, defined as the fraction of the selected events with primary charged particles in only one arm of T2, varies between 66 and 81 %. The dependence of the SD-enhanced \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \) distributions on the selection methods is used to evaluate the systematic uncertainty related to the event selection. More details on the numerical values of \(C_\mathrm{{sel}}\) are given in Sects. 6.3 and 6.4.

6.3 Measurement of \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \) in the central region

The charged-particle pseudorapidity distributions in the central region are obtained from the raw distributions of charged tracks after applying a number of corrections according to the formula:

$$\begin{aligned}&\frac{\mathrm {d}{} N_\mathrm{{ch}}}{\mathrm {d}{} \eta } \nonumber \\&\quad =\frac{C_\mathrm{{sel}}(\eta )\sum _\mathrm{{evt}} \omega _\mathrm{{evt}}(n_\mathrm{{CMS}}, n_{\mathrm{T}2}) \sum _{\text {trk} \in S}\omega _\mathrm{{trk}}(n_\mathrm{{CMS}}, p_{\mathrm {T}},\eta ) }{\varDelta \eta \sum _{n_\mathrm{{CMS}}} N_\mathrm{{evt}} (n_\mathrm{{CMS}},n_{\mathrm{T}2})\omega _\mathrm{{evt}}(n_\mathrm{{CMS}}, n_{\mathrm{T}2})},\nonumber \\ \end{aligned}$$
(2)

where \(S\) is the sample of tracks that pass the selection criteria for a given \(\eta \) bin, \(n_\mathrm{{CMS}}\) and \(n_{\mathrm{T}2}\) is the total number of reconstructed tracks in the CMS tracker and T2, respectively, \(N_\mathrm{{evt}}\) is the number of triggered events in the corresponding track multiplicity bins, \(\omega _\mathrm{{evt}}\) corrects for the trigger and the vertex reconstruction efficiencies, \(\omega _\mathrm{{trk}}\) corrects for the tracking efficiency and the effect of non-primary tracks and \(C_\mathrm{{sel}}\) corrects for the effect of the event and primary-track selection with T2. The bin width in \(\eta \) is \(\varDelta \eta =0.2\).

The event correction, \(\omega _\mathrm{{evt}}\), depends on the track multiplicity in T2, \(n_{\mathrm{T}2}\), as well as on the multiplicity in the CMS tracker because of the minimum number of tracks required in the vertex reconstruction. It is given by

$$\begin{aligned} \omega _\mathrm{{evt}}(n_\mathrm{{CMS}},n_{\mathrm{T}2}) = \frac{1}{\epsilon _\mathrm{{trig}}(n_{\mathrm{T}2})\epsilon _\mathrm{{vtx}}(n_\mathrm{{CMS}})}, \end{aligned}$$
(3)

where \(\epsilon _\mathrm{{trig}}\) is the trigger efficiency (Fig. 3) and \(\epsilon _\mathrm{{vtx}}\) is the primary vertex reconstruction and selection efficiency, calculated with pythia 6 as the ratio of the number of reconstructed events satisfying the primary vertex selection to the total number of generated events.

Fig. 4
figure 4

Top Tracking efficiency, \(\epsilon _\mathrm{{trk}}\), as a function of \(p_{\mathrm {T}}\) and \(\eta \) and averaged over all multiplicity bins (\(n_\mathrm{{CMS}}\)), for tracks with \(p_{\mathrm {T}} >0.1\) \({~\text {GeV/}c}\) and \(|\eta |<2.2\). Bottom Correction factor, \(f_{\mathrm {np}}\), for non-primary tracks as a function of \(p_{\mathrm {T}}\) and \(\eta \) and averaged over all multiplicity bins (\(n_\mathrm{{CMS}}\)), for tracks with \(p_{\mathrm {T}} >0.1\) \({~\text {GeV/}c}\) and \(|\eta |<2.2\)

Table 1 Systematic and statistical uncertainties of the \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \) measurement in the central region. The given ranges indicate the \(\eta \) dependence of the uncertainties

The correction for the tracking efficiency and non-primary tracks, \(\omega _\mathrm{{trk}}(n_\mathrm{{CMS}}, p_{\mathrm {T}},\eta )\), is defined as:

$$\begin{aligned}&\omega _\mathrm{{trk}}(n_\mathrm{{CMS}}, p_{\mathrm {T}},\eta ) \nonumber \\&\quad =\frac{1-f_\mathrm{{np}}(n_\mathrm{{CMS}},p_{\mathrm {T}},\eta )}{\epsilon _\mathrm{{trk}}(n_\mathrm{{CMS}},p_{\mathrm {T}},\eta ) \left( 1+f_{\mathrm {m}}(n_\mathrm{{CMS}},p_{\mathrm {T}},\eta )\right) }. \end{aligned}$$
(4)

Here, \(\epsilon _\mathrm{{trk}}\) corrects for the geometric detector acceptance and the reconstruction tracking efficiency; the correction factor \(f_\mathrm{{np}}\) accounts for the fraction of non-primary tracks, i.e. secondary and misidentified tracks, while \(f_{\mathrm {m}}\) corrects for the effect of single charged particles that are reconstructed multiple times. These quantities are obtained from a detector simulation in bins of \(n_\mathrm{{CMS}}\), \(p_{\mathrm {T}}\) and \(\eta \). The effect of bin migrations is found to be negligible. Reconstructed events are required to pass the event selection and the generated particles are matched to the reconstructed tracks by using spatial and momentum information.

The tracking efficiency, \(\epsilon _\mathrm{{trk}}\), is determined as the ratio of the number of all reconstructed tracks that are matched to generated particles and satisfy the track selection criteria in an (\(n_\mathrm{{CMS}}, p_{\mathrm {T}},\eta \)) bin to the number of generated primary charged particles in that bin. As shown in Fig. 4 (top), \(\epsilon _\mathrm{{trk}}\) approaches unity for tracks with \(p_{\mathrm {T}} >0.5\) \({~\text {GeV/}c}\) and \(|\eta |<1.5\).

The correction for non-primary tracks, \(f_{\mathrm {np}}\), is estimated as the ratio of the number of reconstructed tracks not matched to a generated primary particle in a \(n_\mathrm{{CMS}}, p_{\mathrm {T}}, \eta \) bin to all reconstructed tracks in that bin. The correction varies as a function of \(\eta \) and \(p_{\mathrm {T}}\) of the tracks, as shown in Fig. 4 (bottom), and reaches its lowest values of about 2 % for \(|\eta |<1.5\) and \(p_{\mathrm {T}} >0.5\) \({~\text {GeV/}c}\). It becomes as large as 20 % at very low transverse momentum (\(p_{\mathrm {T}} <0.2\) \({~\text {GeV/}c}\)) and large pseudorapidity (\(|\eta |>1.8\)).

The correction factor for multiply reconstructed particles, \(f_{\mathrm {m}}\), is estimated as the ratio of the number of generated primary charged particles that are associated to multiply reconstructed tracks in a given \(n_\mathrm{{CMS}}\), \(p_{\mathrm {T}}\), \(\eta \) bin to the number of generated charged particles in that bin. It is found to be below 1 %.

The model dependence of the corrections is determined by using pythia 6, epos and pythia 8. The corrected data, based on correction factors derived from each generator independently, are found to differ by 1–4 % depending on the pseudorapidity bin. This amount is taken as a systematic uncertainty.

The average correction factor for the event selection, \(C_\mathrm{{sel}}(\eta )\), defined in Sect. 6.2, is found to be 1.01, 1.025, and 0.94 for the inclusive, NSD-enhanced and SD-enhanced samples, respectively, independent of pseudorapidity. The correction factor is obtained from epos and pythia 8. The average value of the correction factors from the two generators is applied to the data, while the relative difference between the two generators is taken as a systematic uncertainty. In addition, the event selection bias and the corresponding systematic uncertainty is estimated for each \(\eta \) bin as described in Sect. 6.2 by comparing the pseudorapidity distributions obtained with different methods. For the inclusive and NSD-enhanced samples, the systematic uncertainty in the event selection is found to be 3–5 and 4–6 %, respectively, while for the SD-enhanced sample it is 9–16 %.

Corrections of 4–6 % are applied to the final results to extrapolate to \(p_{\mathrm {T}}\)  = 0. The corrections are determined from the \(\mathrm {d}{}N/\mathrm {d}{}p_T\) spectrum of primary charged particles predicted by pythia 6 and pythia 8. The corrections obtained from the two MC generators differ by about 3 %, resulting in a systematic uncertainty in the \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \) distributions of about 0.2 %. The same corrections are also estimated from Tsallis fits [50] to the \(p_{\mathrm {T}}\) distributions for each \(\eta \) bin, giving consistent results.

A summary of the systematic uncertainties is given in Table 1. The uncertainties associated with the tracking efficiency are treated as uncorrelated between the \(\eta \) bins. For the inclusive and the NSD-enhanced samples, the most significant systematic uncertainties are those due to the uncertainty in the tracking efficiency and the event selection. The model dependence of \(C_\mathrm{{sel}}\) and the uncertainty in the event selection are dominant for the SD-enhanced sample. The total uncertainty in the tracking efficiency is estimated to be 3.9 % from a comparison of two-body and four-body \(\mathrm {D^0}\) decays in data and simulated samples [51]. The uncertainties related to the primary vertex selection, the trigger efficiency and pileup events are found to be around 0.1 % and are neglected.

6.4 Measurement of \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \) in the forward region

The pseudorapidity density in the forward region is measured for each T2 half-arm independently, thus providing a consistency check, as each half-arm differs in its alignment and track reconstruction efficiency. The number of primary-track candidates passing the \(z_\mathrm{{impact}}\) parameter selection criteria is calculated for each \(\eta \) bin as a function of the \(z_\mathrm{{impact}}\)-value with the double-Gaussian and exponential fits described in Sect. 5. The fraction of tracks associated to the double-Gaussian distribution core ranges from about 74 % (lower \(|\eta |\) bins) to about 92 % (higher \(|\eta |\) bins), and is used to weight each track by the probability for the track to be primary.

Each track is also weighted by the primary-track efficiency, which depends on \(\eta \) and on the average pad cluster multiplicity (APM) per plane in the corresponding half-arm. This efficiency, evaluated from MC generators, is defined as the probability to successfully reconstruct, with a \(z_\mathrm{{impact}}\) parameter within the allowed region, a generated primary particle that falls within the acceptance of the detector. As shown in Fig. 5 for one of the T2 half-arms, the tracking efficiency decreases with increasing APM, since the reconstruction of tracks with sufficient number of hits becomes more difficult with larger occupancy. The average primary track efficiency ranges from about 73 % (lower \(|\eta |\) bins) to about 87 % (higher \(|\eta |\) bins), as shown in Fig. 5 for one of the T2 half-arms. The APM probability is a rapidly decreasing distribution, with \(\text {average}\sim 24\) and \(\textsc {rms} \sim 29\), for the inclusive selection. The rate of multiple associations of reconstructed tracks to the primary particle is negligible (\(<\)0.4 %) once the \(z_\mathrm{{impact}}\) requirement is imposed.

Fig. 5
figure 5

Primary-track efficiency as a function of \(\eta \) and average pad cluster multiplicity (APM) in one T2 half-arm for the inclusive pp sample. The efficiency includes the track primary-candidate condition. Only particles with \(p_{\mathrm {T}} >40\) \({~\text {MeV/}c}\) are considered

Conversion of photons from \(\mathrm {\pi ^0}\) decays in the material between the IP and T2, as well as decay products of strange particles, also contribute to the double-Gaussian peak. The overall non-primary contribution, to be subtracted from the double-Gaussian peak, was estimated as a function of \(\eta \) with pythia 8, epos and sibyll 2.1 [52]. The first two generators bracket the data in the forward region, while sibyll is introduced to also enclose the measurements from the LHCf experiment of the photon \(\mathrm {d}{}N/\mathrm {d}{}E\) distribution [53]. The value of this correction is about 18 % and is obtained as the average of the three MC predictions.

The correction factors for the event selection bias (\(C_\mathrm{{sel}}(\eta )\)) are about 1.05, 1.06 and 1.00 for the inclusive, NSD-enhanced and SD-enhanced event samples, respectively.

Bin migration effects in \(\eta \) are corrected for with pythia 8, which gives the best description of the slope of the measured \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \) distribution. The effects are typically at the few percent level.

Table 2 Systematic and statistical uncertainties of the \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \) measurement with the T2 detector in the forward region. The first two contributions are half-arm dependent and partly \(\eta \)-uncorrelated, while the remaining, excluding the statistical one, are half-arm independent and correlated across bins in \(\eta \). The given ranges indicate the \(\eta \) dependence of the uncertainties

Events characterised by a high T2 hit multiplicity, typically due to showers generated by particles interacting with the material before T2, are not included in the analysis. These events, for which track reconstruction capability is limited, are characterised by an APM value larger than 60 and constitute 13.5, 16.5 and 6.3 % of the inclusive, NSD-enhanced and SD-enhanced samples, respectively. The effect of removing these events is evaluated in a MC study, which yields overall correction factors of about 1.05, 1.04 and 1.06 for the inclusive, NSD-enhanced and SD-enhanced samples, respectively. To verify the stability of this correction, the analysis is also repeated after excluding events with APM values larger than 45 and re-evaluating the corresponding MC corrections. The results of the two analyses agree within 1 %. In addition, this correction is also estimated by extrapolating the measured average multiplicity obtained as a function of the maximum APM included in the sample and without correcting for the missing fraction of the sample. The extrapolation, performed with a second degree polynomial, gives a correction that is within the MC uncertainty.

The fully corrected \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \) distributions in each \(\eta \) bin are determined from:

$$\begin{aligned}&\frac{\mathrm {d}{}N_\mathrm{{ch}}}{\mathrm {d}{} \eta }(\eta _i) =\frac{2\pi }{\varDelta \phi }\nonumber \\&\quad \times \frac{C_\mathrm{{sel}}(\eta _i) \sum _\mathrm{{evt}} \omega _\mathrm{{evt}}(n_{\mathrm {T2}})\sum _{\text {trk} \in S_j}B_{ij}\,\omega _\mathrm{{trk}}(\text {APM},\eta _j,z_\mathrm{{impact}})}{\varDelta \eta \sum _{n_{\mathrm{T}2}}N_\mathrm{{evt}}(n_{\mathrm {T2}})\omega _\mathrm{{evt}} (n_{\mathrm {T2}})},\nonumber \\ \end{aligned}$$
(5)

where \(S_j\) is the sample of tracks with \(\eta _j-\varDelta \eta /2 < \eta _j < \eta _j + \varDelta \eta /2\) satisfying the selection criteria above, \(\varDelta \eta =0.05\) is the bin width, \(C_\mathrm{{sel}}\) is the correction factor related to the event selection defined in Sect. 6.2, \(B_{ij}\) is the bin migration correction associated with the \(j\)th bin in \(\eta \). In addition, \(\varDelta \phi /2\pi =192^{\circ }/360^{\circ }\) is the azimuthal acceptance of each T2 half-arm, \(\omega _\mathrm{{evt}}(n_{\mathrm {T2}})\equiv 1/\epsilon _\mathrm{{trig}}(n_{\mathrm {T2}})\) is the trigger efficiency correction, \(N_\mathrm{{evt}}(n_{\mathrm {T2}})\) is the number of selected events with track multiplicity \(n_{\mathrm {T2}}\), and \(\omega _\mathrm{{trk}}\) is defined as:

$$\begin{aligned} \omega _\mathrm{{trk}}(\text {APM},\eta ,z_\mathrm{{impact}})= \frac{P_\mathrm{{prim}}(\eta ,z_\mathrm{{impact}})\,S_\mathrm{{np}}(\eta ) \,C_\mathrm{{mult}}(\eta )}{\epsilon (\eta , {\mathrm {APM}})},\nonumber \\ \end{aligned}$$
(6)

where \(P_\mathrm{{prim}}\) is the probability for a track to be primary. Here, \(\epsilon \) is the primary-track efficiency, \(S_\mathrm{{np}}\) is the correction factor for the non-primary contribution to the double-Gaussian peak and \(C_\mathrm{{mult}}\) is the correction factor for the exclusion of events with APM values above 60.

Fig. 6
figure 6

Charged-particle pseudorapidity distributions from an inclusive sample (top left), a NSD-enhanced sample (top right), and a SD-enhanced sample (bottom). The error bars represent the statistical \(+\) uncorrelated systematics between neighbouring bins and the bands show the combined systematic and statistical uncertainties. The measurements are compared to results from pythia 6, tune Z2*, pythia 8, tune 4C, herwig++, tune UE-EE-3 with CTEQ6L1 PDFs, epos, tune LHC and qgsjetII-04

The \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \) distribution thus obtained refers to charged particles with \(p_{\mathrm {T}} >40\) \({~\text {MeV/}c}\), corresponding to the nominal \(p_\mathrm{{T}}\) acceptance of T2. A MC-based estimation obtained with epos LHC and pythia 8 4C is used to correct the measurement down to \(p_{\mathrm {T}} =0\). This correction, taken from the average of the two MC predictions, is about 2 %.

The evaluation of the systematic uncertainties for the \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \) distributions is performed similarly to that discussed in [9]. Details are given in the following only for the most significant contributions.

The systematic uncertainty in the \(P_\mathrm{{prim}}\) function, which is of order 4–5 %, is evaluated by including four effects: (a) the sensitivity to the misalignment corrections, quantified by varying the corrections within their uncertainties, (b) the sensitivity to the \(z_\mathrm{{impact}}\) parameter fitting range, which was changed by either one or two meters depending on the \(\eta \) bin, (c) the sensitivity to the background parametrisation, obtained by replacing the exponential function with a second degree polynomial and (d) the difference between the area estimated by the fitting function and the integral of the \(z_\mathrm{{impact}}\) distribution.

The systematic uncertainty in the primary-track efficiency is evaluated in studies where tracks are reconstructed with a set of five detector planes (out of the total of ten) in a single T2 half-arm. The track reconstruction efficiency is determined with the other set of detector planes in the same half-arm. The 5–6 % difference between the results obtained from simulation and from data is taken as an estimate of the systematic uncertainty.

The systematic uncertainty due to non-primary tracks included in the double-Gaussian once the exponential contribution has been removed, \(S_\mathrm{{np}}\), is evaluated by considering the range of the predictions of the epos, pythia 8 and sibyll MC generators, and is about 5 %.

The uncertainty in the correction for the exclusion of events with high secondary-particle multiplicity (\(C_\mathrm{{mult}}\)) is taken as the difference between the epos and pythia 8 predictions, which is about 3 %. The uncertainty on the correction for the event selection (\(C_\mathrm{{sel}}\)) is evaluated by taking into account both the dependence of the correction from the MCs mentioned above and the dependence of the \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \) results on the different event selection strategies discussed earlier. This uncertainty is 13–15 % for the SD-enhanced sample and 2–3 % for the inclusive and NSD-enhanced samples.

The possible bias due to the material uncertainty and therefore on the production of secondary tracks is evaluated as a function of \(\eta \) from the MC vs. data discrepancy of the ratio between the number of tracks contained in the 96 % double-Gaussian area and all the tracks in the same range. The average discrepancy is in the range of 2–6 %. Simulation studies are also performed by varying the thickness of the material in front of T2 by 40 %. This part of the material is the main source of secondary tracks that contribute to the double-Gaussian. The effect of the change of the material results in a possible bias of less than 3 %.

Table 3 Event selection criteria applied at the stable-particle level in the MC simulation

Table 2 shows the uncertainties due to the corrections. To compute the total systematic uncertainty the errors are first separated into half-arm-correlated and uncorrelated parts and a weighted average between the four half-arms is taken. The \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \) measurements obtained for the different T2 half-arms are found to be compatible.

The first two systematic uncertainties in Table 2 vary as a function of \(\eta \) and contribute to the uncorrelated bin-by-bin uncertainties. Conversely, the remaining systematic uncertainties affect all \(\eta \) bins in the same direction. The effect that systematic uncertainties might introduce in the difference of the \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \) values at the beginning and at the end of the T2 acceptance is estimated to be at most 7 %.

The total uncertainty is obtained by adding in quadrature the \(\eta \)-correlated uncertainty and the \(\eta \)-uncorrelated one and the (negligible) statistical uncertainty.

7 Results

The combined CMS-TOTEM charged-particle pseudorapidity distributions are presented in Fig. 6 for the three event selection samples shown in Table 3. The results are derived in the central region by averaging the data points in the corresponding \(\pm \eta \) bins and in the forward region by averaging over the four T2 half-arms. The uncertainty band represents the total uncertainty, while the bars show the statistical and uncorrelated systematics between neighbouring bins.

In the central region, the pseudorapidity density at \(\eta =0\) is \(5.35 \pm 0.36\) for the inclusive sample, \(6.20 \pm 0.46\) for the NSD-enhanced sample and \(1.94^{\,+\,0.26}_{\,-\,0.23}\) for the SD-enhanced sample, with negligible statistical uncertainties. The predictions from various MC event generators differ from the data by up to 20 % for the inclusive and NSD-enhanced samples, with even larger discrepancies for the SD-enhanced sample. The data are well described by pythia 6 and qgsjetII-04 for the inclusive selection. For the NSD-enhanced sample, the predictions obtained from pythia 6 and qgsjetII-04 agree with the data for most \(\eta \) bins. A good description of the measurement for the SD-enhanced sample is provided by both epos and pythia 6.

The forward pseudorapidity density decreases with \(|\eta |\). In the inclusive sample, \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \) is \(3.85 \pm 0.49\) at \(\eta =5.375\) and \(2.61 \pm 0.28\) at \(\eta =6.350\), with negligible statistical uncertainty. The pseudorapidity density of the NSD-enhanced sample ranges between \(4.80 \pm 0.62\) and \(3.17 \pm 0.35\), while for the SD-enhanced sample it is in the range of \(1.49 \pm 0.27\) to \(1.20 \pm 0.20\). The MC predictions for the three samples differ from the data by up to about \(\pm \)30 %. For the inclusive and NSD-enhanced samples, the data in the forward region are in agreement with the prediction from qgsjetII-04 and are between the epos and pythia 8 results. For the SD-enhanced selection, the TOTEM data points are close to the pythia 8 and herwig++ predictions, while qgsjetII-04 underestimates the data. The change in the slope of the MC curves close to \(\eta =5.3\), more visible for the NSD- and SD-enhanced distributions, is due to the event selection requirement of at least one charged particle in the pseudorapidity region of T2.

The centre-of-mass energy dependence of the pseudorapidity distribution at \(\eta \approx 0\) is shown in Fig. 7, which includes data from various other experiments for NSD events in pp and \(\mathrm{p}\bar{\mathrm{p}}\) collisions. Although the different experiments do not use identical event selection criteria, they all include a large fraction of NSD events. Particle production at \(\eta \approx 0\) is expected to follow a power-law dependence, \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \left| _{\eta =0}\right. \propto s^\epsilon \), with \(\epsilon \) in the range 0.14–0.24 [40]. The result of fitting the high-energy pp and \(\mathrm {p}\mathrm {\overline{p}}\) central-pseudorapidity particle densities with this function is shown in Fig. 7. A value of \(\epsilon = 0.23 \pm 0.01\) is obtained.

Fig. 7
figure 7

Value of \(\mathrm {d}{}N_\mathrm{{ch}}/\mathrm {d}{}\eta \) at \(\eta \approx 0\) as a function of the centre-of-mass energy in pp and \(\mathrm {p}\) \(\mathrm {\overline{p}}\) collisions. Shown are measurements performed with different NSD event selections from UA1 [12], UA5 [14], CDF [10, 11], ALICE [6] and CMS [4]. The dashed line is a power-law fit to the data

8 Summary

Measurements of charged-particle densities over a large pseudorapidity range are presented for proton–proton (pp) collisions at a centre-of-mass energy of 8\(~\text {TeV}\). The data were collected concurrently with the CMS and TOTEM detectors during a dedicated run with low probability for overlapping pp interactions in the same bunch crossing and correspond to an integrated luminosity of \(\mathcal {L} = 45~\upmu {\mathrm {b}}^{-1} \). Pseudorapidity distributions of charged particles within \(|\eta | < 2.2\) and \(5.3 < |\eta | < 6.4\) have been measured for three event samples with different final state topologies: a sample of inclusive inelastic pp events, a sample dominated by non-single diffractive dissociation (NSD) events and a sample enriched in single diffractive dissociation (SD) events. The data are compared to theoretical predictions obtained from five different MC event generators and tunes (pythia 6 Z2*, pythia 8 4C, herwig++ UE-EE-3, epos LHC tune, and qgsjetII-04).

In the central region, the inclusive and NSD-enhanced samples are well described by pythia 6 and qgsjetII. For the SD-enhanced sample a good description of the data is provided by both pythia 6 and epos. In the forward region, the pseudorapidity distributions for the inclusive and NSD-enhanced samples are between the pythia 8 and epos predictions. The qgsjetII predictions are compatible with the data. The pseudorapidity distribution in the SD-enhanced sample, affected by a larger systematic uncertainty, is best described by pythia 8 and herwig++.

The charged-particle densities obtained in this paper span the largest pseudorapidity interval ever measured at the LHC and have the unique potential to probe the correlation between particle production in the central region and that in the forward region. With the tunes used, none of the MC event generators are able to consistently describe the data over the whole \(\eta \) region and for all event samples.