1 Introduction

Currently there are number of Monte Carlo (MC) generators which successfully describe many features of the inelastic pp collisions at the LHC [1,2,3,4,5]. Since all MC models assume some physics approximations, it is inevitable that they have a number of free parameters which must be fixed by experimental data during the procedure called tuning [6,7,8,9,10]. It often happens that the description of experimental data by different MC models is similar, despite the fact that the underlying dynamics in the models differs significantly – with hard collisions giving a major contribution in some models and significant soft contribution in the other. The aim of this paper is to look for the observables which would be especially sensitive to some of the important ingredients of the models. Specifically we will propose observables which are sensitive to two important characteristics of the models: taming of minijet production at small \(p_\mathrm{T}\) and the transverse distribution of partons in the colliding protons.

Obviously, the rate of parton-parton scattering has to be tamed at small momentum transfer to avoid an unphysical singular behaviour. The divergence is usually regulated by including a suppression factor, that is quite different in different models. Also, in most of the models the suppression for fixed \(p_\mathrm{T}\) becomes stronger with increase of collision energy. The relevant details of models used in this study are described in Sect. 4. The number of parton interactions depends not only on the suppression factor, but also on the set of parton distribution functions (PDF) and the overlap of the matter distribution of colliding protons. It often happens that for some observables models with very different PDF and model parameters are quite close to the data (and to each other). For instance, Fig. 1 shows \(p_\mathrm{T}\) distributions of gluons coming from primary and MPI interactions for successful tunes of pythia and herwig. One can see that they are very different in the low \(p_\mathrm{T}\) region, which eventually produces most of the final-state particles in the collision. Nevertheless, the models describe underlying event (UE) [11,12,13] and Minimum Bias [9] observables satisfactory, as the difference in other mechanisms compensates this discrepancy. This motivates us to propose observables which are sensitive to the underlying dynamics of minijet production and, thus, allows to discriminate models and learn more about underlying dynamics of pp interactions. The correlation between mechanisms and their impact on minijet production will be discussed in Sect.  4.

Fig. 1
figure 1

\(p_\mathrm{T}\) distribution of gluons coming from primary and MPI interactions. The pseudorapidity range is \(|\eta |<\) 5. We show two different settings (tunes) for the Pythia 8 [7, 9] and Herwig++ 2.7 [10] MC generators

As mentioned above, the transverse distribution of partons in nucleons is another fitted parameter which is relevant for minijet production. Basically, the rate of the double parton interactions (DPS) is inversely proportional to the transverse area occupied by partons. This parameter of the models can be conveniently coded via so-called sigma effective, \(\sigma _{\mathrm {eff}}\), defined through: \(\sigma _{\mathrm {ij}}=\sigma _i \sigma _j/ \sigma _{\mathrm {eff}}\), where \(\sigma _i\), \(\sigma _j\) and \(\sigma _{\mathrm {ij}}\), are cross sections for single- and double-parton scatters of types i and j. Practically in all models it is assumed that transverse distribution of partons does not depend on x of the parton.Footnote 1 In the approximation where the correlations between partons are neglected, the inclusive cross section of N binary collisions is \(\propto \sigma _{\mathrm {eff}}^{1-N}\). Hence the sensitivity to this parameter should grow with the hadron multiplicity (usually characterized by charged-particle multiplicity in the experimental measurements).

It is interesting to note, that the transverse area in which partons are localized, as determined by the fits to data, are at least a factor of two smaller than indicated by the HERA data on hard exclusive processes. This suggest that one may need to include pQCD effects which lead to decrease of \(\sigma _{\mathrm {eff}}\) with increase of the virtuality of the collision, see a review in [15]. This pattern was implemented for example in [16, 17].

To extend studies of the low-\(p_\mathrm{T}\) suppression mechanism we propose observables which minimize soft physics effects and still preserve sensitivity to the presence of the semi-hard collisions. We use here an observation that parton showers lead to a short-range correlation in rapidity, while a correlation of binary semi-hard collision extends to noticeably larger rapidity intervals. So we suggest to measure how the transverse momenta of hadrons produced in association with a trigger object are balanced as a function of rapidity. The exact definitions of the proposed observable is given in Sect. 2. One of the advantages of such observable is that the contribution of the events where the trigger and the balancing particles belong to different parton-parton interactions should cancel, as long as the parton-parton interactions are independent. This is in difference from the observables maximizing effects of MPI such as correlation of multiplicities at different rapidity intervals first considered by UA5 collaboration, see a review in [18]. These data were one of the first indications of the role of MPI in hadron-hadron collisions at collider energies and the enhancement of MPI in the high multiplicity events.

Our numerical studies described below demonstrate sensitivity of the proposed variable to the assumed dynamics. A study of the same observable as a function of the multiplicity of final-state particles (which in the discussed models originate from fluctuation of the number of hard collisions or a combination of the soft and hard collisions) provides an additional discriminating tool which is a natural combination of the UA5-like and the inclusive transverse momentum balance observables. For high multiplicities the discussed observable is sensitive to effects such as screening or a formation of quark gluon plasma in collisions of protons. For these reasons we shall also study the observable as a function of the charged particle multiplicity. Finally, we will investigate the impact of the so-called color-reconnection (CR) mechanism which is in continuous development by many Monte Carlo authors [19,20,21,22,23,24,25,26,27]. The most of mechanisms discussed above are assumed to be dependent on the collision energy, therefore we shall perform our tests at two center-of-mass (CM) collision energies, \(\sqrt{s}=\) 7 TeV and \(\sqrt{s}=\) 13 TeV.

The paper is organized as follows. In Sect. 2 we define the observables in a more formal way, while the justification of kinematic cuts is given in Sect. 3. In Sect. 4 a summary of the discussed models is presented. The results of calculations using these models are presented in Sect. 5. Our conclusions are presented in Sect. 6.

Fig. 2
figure 2

a Single collision on the \(y-z\) plane with partons produced due to hard process and initial and final state radiation. The thick lines represent the incoming partons while the red arrow represents the selected trigger parton. The total transverse momentum of all partons sums up to zero. b An event with two hard collisions. For each hard collision the momentum is conserved independently, if no correlations are present

2 Observables

As mentioned, we will be interested in a mechanisms of particle production in hadron-hadron collisions, in particular in finding experimental observables that are sensitive to a particular models. As is known, the particle production is driven by the minijets, i.e. semi-hard partons (quark and gluons) produced in a collision of incoming partons (one or many), or in a bremsstrahlung process.

Partons produced in different mechanisms are, in general, correlated in a different way. For example, if we concentrate on rapidity of produced partons, we may expect that bremsstrahlung partons will have short-range correlations, while the partons produced in a hard collision will have a long range tails.

One way to study the correlations is to investigate how the transverse momentum is balanced as a function of rapidity. The practical observable may be constructed as follows (see Fig. 2). For a given event with n final state particles, we pick up a particle k within a fixed rapidity interval and a certain (small) \(p_{T}\). Let us call this a trigger particle. Then, we define the total transverse momentum of the all remaining final state particles along the trigger particle, contained in a rapidity bin \(\Delta \eta \):

$$\begin{aligned} p_{T}^{\mathrm {rec}\,\left( k\right) }\left( \eta \right)= & {} \sum _{i=1,\ldots n,\, i\ne k} |\vec {p}_{Ti}|\cos \phi _i\,\, \Theta \left( \left( \eta -\frac{\Delta \eta }{2}\right) \right. \nonumber \\< & {} \left. \eta _{i}<\left( \eta +\frac{\Delta \eta }{2}\right) \right) , \end{aligned}$$
(1)

where \(\Theta \) is the step function and \(\phi _i\) is the azimuthal angle of the ith particle, in the coordinate system where the y axis is defined by the trigger particle k; in that system we simply add up the y components of the recoil particles. \(p_{T}^{\mathrm {rec}\,\left( k\right) }\left( \eta \right) \) can be calculated on the event-by-event basis so that we can define the average \(\left\langle p_{T}^{\mathrm {rec}}\right\rangle \left( \eta \right) \) as

$$\begin{aligned} \left\langle p_{T}^{\mathrm {rec}}\right\rangle \left( \eta \right) =\frac{\sum _{k=1}^{N}p_{T}^{\mathrm {rec}\,\left( k\right) }\left( \eta \right) }{N}\,, \end{aligned}$$
(2)

where N is the total number of events with the required trigger particle present. We can also define similar quantity for the trigger particle, \(\left\langle p_{T}^{\mathrm {trig}}\right\rangle \left( \eta \right) \), by simply counting only the trigger particles. The total momentum conservation requirement gives, obviously,

$$\begin{aligned} \int d\eta \,\left\langle p_{T}^{\mathrm {rec}}\right\rangle \left( \eta \right) =\int d\eta \,\left\langle p_{T}^{\mathrm {trig}}\right\rangle \left( \eta \right) \,. \end{aligned}$$
(3)
Fig. 3
figure 3

Gluon \(p_\mathrm{T}\) distribution for various final-state triggers as obtained using Pythia 8 CUETP8M1 model

3 Choice of kinematic cuts

This section justifies the choice of final state objects used to study the mechanisms of the minijet production. We are mostly guided by a performance of the LHC general-purpose detectors, ATLAS and CMS. Therefore, the usage of charged particles is the only option to study minijet production with upper \(p_\mathrm{T}\) limit of a few GeV. The tracking system of the experiments allows to reliably reconstruct charged particles with \(\eta<\) 2.5 (2.4) for ATLAS (CMS) starting from \(p_\mathrm{T} \approx \) 250 MeV. Therefore, we chose \(2.0<\eta <2.4\) for a trigger object in order to maximize the possible \(\eta \) distance for recoil particles.

There are two options for choosing the trigger object: (i) a single charged particle, or (ii) a charged-particle jet. Both approaches have their advantages. The single charged particle is a very simple and stable trigger, which is, in the contrast to the jet trigger, not contaminated by an additional activity from the UE. The second option is expected to be better connected to the initial parton (mainly a gluon). This is illustrated in Fig. 3 where we investigate (with the help of Pythia) to what \(p_\mathrm{T}\) of initial gluon the final state trigger corresponds to. These distributions are plotted under the assumption that the initial gluon, originating in the primary scattering or in MPI, can be matched with the final state trigger by a requirement of the maximum distance \(R = \sqrt{(\phi _\mathrm{p}- \phi _\mathrm{t})^2+(\eta _\mathrm{p}- \eta _\mathrm{t})^2}\). Here \(\phi _\mathrm{p}\) (\(\eta _\mathrm{p}\)) and \(\phi _\mathrm{t}\) (\(\eta _\mathrm{t}\)) are azimuthal angles (pseudorapidities) of the initial gluon and the final state trigger object, respectively. We found that in Pythia  8 model there is a strong spatial correlation between the trigger objects and the parent gluons for the \(p_\mathrm{T}\) range of interest. For \(R<\) 0.25 it is possible to match 80% of them, thus that value is used to obtain the distributions shown in Fig. 3. The \(p_\mathrm{T}\) windows of the trigger are chosen to be sensitive to the suppression of the minijet production. One can see that the distributions are expectedly narrower for charged-particle jets than for single charged particle, even if they correspond to the same gluon \(\langle p_\mathrm{T} \rangle \). The distribution for single-particle trigger has long tail that is quite noticeable for \(p_\mathrm{T} > \) 10 GeV. The main disadvantage of using the charged-particle jet is a contamination by UE. In order to reduce the UE contamination which grows with the jet area as \(R^2\) [28] we use small distance parameter of \(R=0.4\) in the anti-\(k_\mathrm{T}\) jet clustering algorithm [29]. In this case the UE contribution to the jet is \(\sim \) 0.5 GeV on average.

4 Monte Carlo models

The general purpose Monte Carlo event generators used in our study have been reviewed several times, see for example [30, 31]. Our intention here is not to review them again, but just to provide enough background to set our discussion of the modelling of minijets.

Before we discuss the event generators, let us however start by recalling briefly of the perturbative QCD mechanism of particle production based on the collinear factorization. In fact, it constitutes the skeleton for all MC event generators. We shall also discuss the modification one has to make in the collinear formula to be able to incorporate it into event generators.

4.1 Minijets in perturbative QCD

As mentioned, the particle production mechanism is driven by \(2\rightarrow 2\) perturbative parton production. In the leading order (LO) the cross section for a production of two jets reads (only \(gg\rightarrow gg\) channel is included here for simplicity):

$$\begin{aligned}&\frac{d\sigma _{2\mathrm {jet}}}{dp_{T}^{2}dz_{1}dz_{2}}=\frac{1}{16\pi }\,\frac{1}{p_{T}^{4}}\,\frac{z_{1}\, z_{2}}{(z_{1}+z_{2})^{4}}\,\nonumber \\&f_{g/H}\left( z_{1}+z_{2},\mu ^{2}\right) f_{g/H} \left( \frac{p_{T}^{2}}{s}\,\frac{z_{1}+z_{2}}{z_{1}z_{2}},\mu ^{2}\right) \nonumber \\&\quad \times \frac{1}{2}\left| \overline{\mathcal {M}}\right| _{gg\rightarrow gg}^{2}\left( z_{1},z_{2}\right) , \end{aligned}$$
(4)

where

$$\begin{aligned} \left| \overline{\mathcal {M}}\right| _{gg\rightarrow gg}^{2}\left( z_{1},z_{2}\right) =g^{4}\,\frac{9}{2}\,\frac{\left( z_{1}^{2}+z_{1}z_{2}+z_{2}^{2}\right) ^{3}}{z_{1}^{2}z_{2}^{2}\left( z_{1}+z_{2}\right) ^{2}}, \end{aligned}$$
(5)

is the LO matrix element squared and

$$\begin{aligned} z_{1,2}=\frac{\left| \vec {p}_{T\,1,2}\right| }{\sqrt{s}}\, e^{\, y_{1,2}}. \end{aligned}$$
(6)

Above, \(f_{g/H}\) are the gluon distributions in a hadron, \(\mu ^{2}\) is the hard scale \(\sim p_\mathrm{T}^2\) , s is the square of the CM energy, \(\vec {p}_{T\, 1,2}\) are the transverse momenta of the outgoing partons while \(y_{1,2}\) are their rapidities. Due to the momentum conservation we have at LO \(\left| \vec {p}_{T1}\right| =\left| \vec {p}_{T2}\right| \equiv p_{T}\).

There are two related aspects of this mechanism which are relevant at small transverse momenta [32]. First, the dijet cross section is divergent for jet \(p_{T}\rightarrow 0\):

$$\begin{aligned} \frac{d\sigma _{2\mathrm {jet}}}{dp_{T}^{2}}\sim \frac{\alpha _{s}^{2}\left( p_{T}^{2}\right) }{p_{T}^{4}}. \end{aligned}$$
(7)

It is however expected that the growth of the spectrum is tamed by some mechanism already in the perturbative domain for \(p_{T}\sim \) 2 to 3 GeV. In phenomenological model of [32] the suppression factor was introduced as follows:

$$\begin{aligned} \frac{d\sigma '_{2\mathrm {jet}}}{dp_{T}^{2}}=\frac{d\sigma _{2\mathrm {jet}}}{dp_{T}^{2}}\,\frac{p_{T}^{4}}{\left( p_{T}^{2}+p_{T0}^{2}\left( s\right) \right) ^{2}}\,\frac{\alpha _{s}^{2}\left( p_{T}^{2}+p_{T0}^{2}\left( s\right) \right) }{\alpha _{s}^{2}\left( p_{T}^{2}\right) },\nonumber \\ \end{aligned}$$
(8)

where \(p_{T0}\left( s\right) \) is a cutoff parameter which depends on the total CM energy of the collision s

$$\begin{aligned} p_{T0}(s) = p_{T0}^{\mathrm {ref}} \left( \frac{s}{s_0}\right) ^{\lambda }, \end{aligned}$$
(9)

where \(p_{T0}^{\mathrm {ref}}\), \(s_0\) and \(\lambda \) are parameters to be determined from the data.

Second, even with the cutoff, the inclusive dijet cross section can exceed the total inelastic cross section, implying presence of events with multiple hard parton-parton collisions. The average number of the parton collisions is defined to be \(\left\langle n\right\rangle \sim \sigma _{2\mathrm {jet}}/\sigma _{\mathrm {ND}}\), where \(\sigma _{\mathrm {ND}}\) is the nondiffractive total cross section, since the production of jets in diffraction is strongly suppressed.

In the above simple model we can simply obtain the expressions for our main observable, i.e. the average transverse momenta of the trigger and the recoil system: \(\left\langle p_{T}^{\mathrm {trig}}\right\rangle \left( y\right) \) and \(\left\langle p_{T}^{\mathrm {rec}}\right\rangle \left( y\right) \), respectively. Using the kinematic cuts discussed in Sect. 3, we show the sample result in Fig. 4. In this case, the trigger has \(2.0<p_T<3.0\,\mathrm {GeV}\) and rapidity \(2.0<y<2.4\). The result shows a typical pattern of the rapidity correlations encoded in the hard matrix element and will be a useful reference point when discussing realistic mechanisms.

Fig. 4
figure 4

Left: rapidity correlations \(\left\langle p_{T}\right\rangle _{\mathrm {trig}}\) and \(\left\langle p_{T}\right\rangle _{\mathrm {rec}}\) from the basic QCD perturbative model with the \(p_T\) cutoff. The trigger has \(2.0<p_T<3.0\,\mathrm {GeV}\) and rapidity \(2.0<y<2.4\). Right: zoom of the recoil system curves

4.2 Pythia model

A detailed description of the model is contained in the series of papers [32,33,34,35]. Here we give only a very brief summary.

The essential point is that the hard collisions are not completely independent. This is true in several respects. First, the hard collisions contributing to a single event are ordered according to a scale \(\sim \) \(p_{T}\) and interleaved with the shower mechanisms. Second, the correlations are introduced by the proper treatment of the beam remnants. That is, the removal of a parton from the beam affects the remaining multi-parton distribution function in the longitudinal and flavour space. Finally, there are correlations in the transverse momentum space introduced by the so-called primordial \(k_{T}\). Also, the colour reconnection introduces correlations.

The event generation goes as follows. After generating a hard interaction with certain \(p_{T\,\mathrm {max}}\), the following step, i.e. an emission with transverse momentum \(p_{T}<p_{T\,\mathrm {max}}\), is described by the probability distribution

$$\begin{aligned} \frac{dP}{dp_{T}}= & {} \left( \frac{dP_{\,\mathrm {MPI}}}{dp_{T}}+\sum \frac{dP_{\,\mathrm {IS}}}{dp_{T}}+\sum \frac{dP_{\,\mathrm {FS}}}{dp_{T}}\right) \nonumber \\&\times \exp \left\{ -\int _{p_{T}}^{p_{T\,\mathrm {max}}}dp'_{T}\left( \frac{dP_{\,\mathrm {MPI}}}{dp'_{T}}\right. \right. \nonumber \\&\left. \left. +\sum \frac{dP_{\,\mathrm {IS}}}{dp'_{T}}+\sum \frac{dP_{\,\mathrm {FS}}}{dp'_{T}}\right) \right\} , \end{aligned}$$
(10)

where the subsequent probabilities in brackets correspond, respectively, to the probability distribution of another hard collision, the emission from the initial state, and the final state emission. The exponential ‘Sudakov form factor’ originates from the requirement that no emission took place between \(p_{T}\) and \(p_{T\,\mathrm {max}}\). The initial and final state showers are based on the DGLAP evolution and we do not discuss them here. The MPI probability distribution is impact parameter dependent. For the hardest event it reads

$$\begin{aligned} \frac{dP_{\mathrm {MPI}}}{dp_{T}d^{2}b}= & {} \frac{\mathcal {O}\left( b\right) }{\left\langle \mathcal {O}\right\rangle }\,\frac{1}{\sigma _{\mathrm {nd}}}\,\frac{d\sigma }{dp_{T}}\,\nonumber \\&\times \exp \left\{ -\int _{p_{T}}^{p_{T\,\mathrm {max}}}dp'_{T}\,\frac{\mathcal {O}\left( b\right) }{\left\langle \mathcal {O}\right\rangle }\,\frac{1}{\sigma _{\mathrm {nd}}}\,\frac{d\sigma }{dp'_{T}}\right\} \,, \end{aligned}$$
(11)

where the cross section \(d\sigma /dp_{T}\) is given by the basic minijet model, Eq. (8). The matter overlap function \(\mathcal {O}\left( b\right) \) is

$$\begin{aligned} \mathcal {O}\left( b\right) \propto \int dt\,\int d^{3}r\,\rho \left( x,y,z\right) \rho \left( x+b,y,z+t\right) , \end{aligned}$$
(12)

were \(\rho \) is the matter distribution in a single hadron. In the recent Pythia version the default setting is that the overlap function \(\mathcal {O}\left( b\right) \) is of the form \(\exp \left( -b^{\mathrm {Pow}}\right) \), where \(\mathrm {Pow}=1.85\). By default, there is no x dependence in the matter distribution. However, it is possible to choose a non-standard setting where the width of the Gaussian matter distribution depends on x as \(1 + a_1 \log (1 / x)\). The average \(\left\langle \mathcal {O}\right\rangle \) is defined in a special way taking into account that every event has to have at least one collision, see the original papers for details.

The actual number n of binary collisions is determined by the truncation of the iterative procedure when no further emissions can be resolved from Eq. (10).

The effect of the interleaved evolution (10) is most important for the initial state shower and MPI. This is because they compete for the beam energy. The actual correlations are incorporated by modifying the beam remnant in regards to the remaining longitudinal momentum and flavour.

Fig. 5
figure 5

Left panel: the weighted \(\chi ^2/N\!.d.f.\) as a function of \(\mu ^2\) and \(p_{\perp ,0}^{\text {min}} \) from [10]. The best fit point is shown with a white dot (tune Var2) and a black dot (tune Var1) represents a good fit with higher \(p_{\perp ,0}^{\text {min}} \) which is preferred by the CDF \(\sigma _{\mathrm {eff}}\) data [39]. Right panel: an extension of \(p_{\perp }\) into non-perturbative regime with “Gaussian” transverse momentum distribution for two parameters sets denoted by circles on the plot in the left panel

We have studied the observables proposed in Sect. 2 in the context of the dependence on various parameters, notably on the \(p_\mathrm{{T}0}\). These exercises aim mostly at understanding some important aspects of the event generator, and are not meant to be compared with any discriminating data. Therefore, we include these studies in Appendix A as a useful reference to various correlation effects. In fact changing only for example \(p_\mathrm{{T}0}\) modifies the tune and the description of certain data will be spoiled, therefore our main results are provided for unchanged tunes of Pythia 8 that are meant to describe UE, minimum bias (MB), and DPS data. Namely, we choose tunes constructed with leading and next-to-next-to-leading order (NNLO) PDF sets: CUETP8M1 [7], and very recent CMS tunes: CP2, CP4, CP5 [9]. Selected tunes describe MB and UE data at a similar level [7, 9], however they have a significantly different values of the parameters. The first and second tunes are based on LO PDF sets, the third and forth were NNLO ones. The difference between PDF sets is reflected in the \(p_\mathrm{T}\) distributions of gluons coming from primary and MPI interactions (see curves for CUETP8M1 and CP5 tunes in Fig. 1). The other key feature is a choice of the impact-parameter distribution. For CUETP8M1 tune an exponential overlap function is used, while for the new tunes (CP2, CP4, CP5) a double-Gaussian matter distribution function [see Eq. (12)]. For given choice of PDF sets and impact-parameter profiles a number of parameters is tuned, including parameters of the b-profile, smooth cutoff parameter \(p_\mathrm{T0}\), colour reconnection ones, and some others. It is worth noting that the tunes differ in simulation of the initial state radiation (ISR). Pythia 8 CUETP8M1 and CP5 have rapidity ordering for ISR, while it is switched off for CP2 and CP4 tunes. It is shown in Sect. 5 that the mechanism has an impact on the studied rapidity correlation.

4.3 Herwig model

The MPI model used in Herwig has been reviewed several times [1, 36,37,38]. Here we aim just to describe briefly the most important building blocks and the parameters of the model.

The model is formulated in the impact parameter space. At a fixed impact parameter, multiple parton scatterings are assumed to be independent, however, later they are correlated, for example, by imposing energy-momentum conservation or through the colour reconnection mechanism. There are two types of parton-parton scatterings in the model, soft and semi-hard, the both are separated by a transverse momentum scale \(p_{\perp }^\mathrm{min}\), which is one of the main tuning parameters in the model. The value of \(p_{\perp }^\mathrm{min}\) is allowed to vary with energy and the evolution is govern by a power law, see Eq. (9). In fact, it is \(p_{\perp ,0}^{\text {min}} =p_{\perp }^\mathrm{min} (7~\text {TeV})\) and power \(\lambda \) that is fitted to data. Below \(p_{\perp }^\mathrm{min}\), scatters are assumed to be non-perturbative, with valence-like longitudinal momentum distribution and “Gaussian” transverse momentum distribution, see right panel of Fig. 5 for two examples how the extrapolation to non-perturbative region can be realized in the model. Above \(p_{\perp }^\mathrm{min}\), scatters are assumed to be perturbative, and take place according to leading order QCD matrix elements convoluted with inclusive PDFs and an overlap function A(b):

$$\begin{aligned} A(b) = \int \mathrm {d}^2b_1 \, G(b_1) \int \mathrm {d}^2b_2 \, G(b_2) \, \delta ^2(\mathbf {b}-\mathbf {b}_1+\mathbf {b}_2) \, ,\nonumber \\ \end{aligned}$$
(13)

where

$$\begin{aligned} G(b) = \frac{\mu ^2}{4\pi }(\mu b)K_1(\mu b) \end{aligned}$$
(14)

is Fourier transform of dipole form factor \(\frac{1}{(1-t/\mu ^2)^2}\) which leads to overlap function

$$\begin{aligned} A(b) = \frac{\mu ^2}{96\pi }(\mu b)^3K_3(\mu b), \end{aligned}$$
(15)

both G(b), and A(b) are normalised to unity. \(K_i(x)\) is the modified Bessel function of the i-th kind and \(\mu \) is the another important parameter of the model which governs the transverse distribution of partons in the proton and can be interpreted as an effective inverse proton radius. A lower values of \(\mu \) lead to the broader matter distribution, therefore higher probability of peripheral collisions. Soft scatters might see a different matter distribution, therefore the model allows them to have a different inverse radius \(\mu _\mathrm{{soft}}\) but it keeps the functional form of the overlap function from Eq. 15. The two soft MPI parameters \(\mu _\mathrm{{soft}}\) and \(\sigma _\mathrm{{soft}}\), the non-perturbative cross section below \(p_{\perp }^\mathrm{min}\), are fixed by the inelastic hadron-hadron cross section and the b-inelastic slope parameter, therefore they are not free parameters of the model.

Fig. 6
figure 6

Rapidity correlation of recoiled system with respect to single-charged particle with \(1.5 \le p_\mathrm{T}< 2.0\) GeV (a) and with respect to charged-particle jet with \(3.0 \le p_\mathrm{T}< 3.5\) GeV (b) at \(\sqrt{s}=13\) TeV

The probability distribution of number of scatters is Poissonian at a given impact parameter, but the distribution over impact parameter is a considerably broader than Poissonian. The number of soft and hard scatters is chosen according to this distribution and generated according to their respective distributions. Each hard scatter is evolved back to the incoming hadron according to the standard parton shower algorithm, therefore the evolution is not interleaved like in Pythia model and additional scatterings are not ordered in any kinematic variable. Energy-momentum conservation is imposed by rejecting any scatters that take the total energy extracted from the hadron above its total energy. As mentioned before the individual scatters might be colour correlated using a colour reconnection model, described in detail in Ref. [25], in that model a reconnection probability \(p_\mathrm{reco}\) is applied. To summarize there are four main parameters of the model: \(p_{\perp ,0}^{\text {min}}\) \(\Lambda \), \(\mu ^2\) and \(p_\mathrm{reco}\), which are fitted to the experimental data. Unlike Pythia, Herwig does not have a large family of tunes, usually no more than one tune is released with a new version of the program. Therefore, in order to study in a meaningful way effect of the parameters variation in Herwig we decided to use the two tunes prepared for the same version of the program [10]. The both tunes, which we label by Var1 (the default tune of Herwig ++ 2.7) and Var2, provide good description of the UE data over the collision energy range from 300 GeV to 7 TeV. This is visualized in the left panel of Fig. 5 (see [10] for the details), where we show the \(\chi ^2/N\!.d.f.\) value of the fit as a function of \(p_{\perp ,0}^{\text {min}} \) and \(\mu ^2\), the both tunes are marked by black (Var1) and white (Var2) dots. A visible strong correlation between \(p_{\perp ,0}^{\text {min}} \) and \(\mu ^2\) (a long thin blue valley in Fig. 5) reflects the fact that a smaller hadron radius means more likely central collisions and as a consequence more multiple scattering, which can be compensated to give a similar amount of underlying-event activity by having fewer perturbative MPIs, i.e. a larger value of \(p_{\perp }^\mathrm{min} \). The best fit value is for \(p_{\perp ,0}^{\text {min}} ={2.80}\,\mathrm {GeV}\) and \(\mu ^2={1.65}\,\mathrm {GeV}^2\) (tune Var2), but one can obtain good fits for higher value of \(p_{\perp ,0}^{\text {min}} ={3.91}\,\mathrm {GeV}\), together with \(\mu ^2={2.3}\,\mathrm {GeV}^2\) (tune Var1). As one can see from Fig. 5 (right-panel) the \(p_{\perp }\) spectra looks significantly different for these two tunes,Footnote 2 therefore they are well suited for our studies. It is worth to mention the default Herwig ++ 2.7 tune gives the value of \(\sigma _{\mathrm {eff}}=14.8\) mb which is close to \(\sigma _{\mathrm {eff}}\) obtained from the combination of the two most precise experimental results for this observable from CDF and D0 measurements \(\sigma _{\mathrm {eff}} = (13.9\pm 1.5)~{\hbox { mb}}\).Footnote 3

Finally, we will also show results of Herwig 7 which has new model for soft interactions including diffractive final states and multiple particle production in multiperipheral kinematics, see [41] for the details.

4.4 Sherpa model

Minimum bias events in Sherpa [5] are simulated using the Shrimps package [42, 43] which is based on Khoze–Martin–Ryskin (KMR) model [44]. The KMR model is a multi-channel eikonal model in which the incoming hadrons are described as a superposition of Good-Walker states, which are diffractive eigenstates that diagonalize the T-matrix. Each combination of colliding Good–Walker states gives rise to a single-channel eikonal. The final eikonal is the superposition of the single-channel eikonals. The number of Good–Walker states is two in Shrimps (the original KMR model includes three states). Each single-channel eikonal can be seen as the product of two parton densities, one from each of the colliding Good–Walker states. The evolution of the parton densities in rapidity due to extra emissions and absorption on either of the two hadrons is described by a set of coupled differential equations. The parameter \(\Delta \), which can be interpreted as the Pomeron intercept, is the probability for emitting an extra parton per unit of rapidity. The strength of absorptive corrections is quantified by the parameter \(\Lambda \), which can be related to the triple-Pomeron coupling. A small region of size \(\Delta Y\) around the beams is excluded from the evolution due to the finite longitudinal size of the parton densities. The boundary conditions for the parton densities are form factors, which have a dipole form characterized by the parameters \(\Lambda ^{2}\), \(\beta _0^2\), \(\kappa \) and \(\xi \). In this framework the eikonals and the cross sections for the different modes (elastic, inelastic, single- and double-diffractive) are calculated.

Inelastic events are generated by explicitly simulating the exchange and re-scattering of gluon ladders. The number of primary ladders is given by a Poisson distribution whose parameter is the single-channel eikonal. The decomposition of the incoming hadrons into partons proceeds via suitably infrared continued PDFs.

The emissions from the ladders are then generated in a Markov chain. The pseudo-Sudakov form factor contains several factors: an ordinary gluon emission term, a factor accounting for the Reggeisation of the gluons and a recombination weight taking absorptive corrections into account. The emission term has the perturbative form \(\alpha _s(k_T^2)/k_T^2\), that, as we have already seen in Pythia and Herwig models needs to be continued into the infrared region. In Sherpa in the case of \(\alpha _s\) the transition into the infrared region happens at \(Q_\mathrm{as}^2\) while in the case of \(1/k_T^2\) the transition scale is generated dynamically and depends on the parton densities and is scaled by \(Q_0^2\).

The propagators of the filled ladder can be either in a colour singlet or octet state, the probabilities are again given through the parton densities. The probability for a singlet can also be regulated by hand through the parameter \(\chi _S\). A singlet propagator is the result of an implicit re-scattering.

After all emissions have been generated and the colours assigned, further radiation is generated by the parton shower. The strength of radiation from the parton shower can be regulated with \(K_\mathrm{T}^{2}\_Factor\), which multiplies the shower starting scale. After parton showering partons emitted from the ladder or the parton shower are subject to explicit re-scattering, i.e. they can exchange secondary ladders. The probability for the exchange of a re-scattering ladder is characterised by RescProb. The probability for re-scattering over a singlet propagator receives an extra factor RescProb1. After all ladder exchanges and re-scatterings, the colour can be rearranged in the event in a similar fashion to the colour reconnection models in Pythia and Herwig . Finally, the event is hadronized using the standard Sherpa cluster hadronization. In our studies we used the default settings of Multiple Interaction Models in Sherpa 2.2.2, which is the only existing tune of the Shrimps model, for completeness we list its parameters in the Appendix C.

5 Results

In this section we present the calculation of the observables defined in Sect. 2 using a few recent versions and tunes of Pythia, Herwig, and Sherpa. Since some model parameters depend on \(\sqrt{s}\), including the suppression of jet cross-section at very low \(p_\mathrm{T}\), the results are presented for the CM energy 7 and 13 TeV. For the analysis we use non-diffractive inelastic events. The two approaches to the trigger object are studied, as discussed in Sect. 3, first using a single charged particle as a trigger, second using a charged-particle jet as a trigger. The former is more model dependent, but is less affected by the UE contribution, while the latter provides a better connection to the parent parton.

Figure 6 shows \(p_\mathrm{T}^\mathrm{{rec}}\) as a function of pseudorapidity at \(\sqrt{s}=13\) TeV for the two trigger approaches. In the trigger region (near \(2.0< \eta <2.4\)), in the case of charged particle trigger (left panel), we see a peak that is mostly caused by particles strongly correlated to a triggered particle (which would be clustered to the same jet). In the case of the jet trigger technique, the particles clustered into the trigger jet are excluded from the calculation of \(p_\mathrm{T}^\mathrm{rec}\), see Eq. (1), therefore we see a dale. In the region distant from the trigger object (\(\eta<\) 0), one can see that the difference between the trigger approaches is rather modest in the Pythia model. This is expected, since the \(p_\mathrm{T}\) ranges for the two trigger objects correspond, on average, to the same \(p_\mathrm{T}\) of the parent gluon (see Fig. 1). The difference between the trigger techniques is stronger for the other models, i.e. Herwig and Sherpa. This is probably due to different fragmentation and hadronization models. A peculiar feature of Herwig 7 is that it generates a recoil peak that lies at opposite \(\eta \) region with respect to the trigger object, this seems to be a feature of new Soft MPI model [41], which we plan to investigate more in the future. Finally, it is worth noticing that Sherpa produces the least amount of long-range correlations.

Fig. 7
figure 7

Comparison of rapidity correlation of recoiled system with respect to charged-particle jet with \(3 \le p_\mathrm{T}< 3.5\) GeV at \(\sqrt{s}=~7\) TeV and \(\sqrt{s}=13\) TeV for different models

Fig. 8
figure 8

Comparison of rapidity correlation of recoiled system with respect to single charged particle with \(1.5 \le p_\mathrm{T}< 2\) GeV at \(\sqrt{s}=~13\) TeV in different \(N_\mathrm{ch}\) domains for various MC models

Let us turn to the discussion of the dependence of the rapidity correlations on the collision energy. Figure 7 shows a comparison of the observable at \(\sqrt{s}=7\) TeV and \(\sqrt{s}=13\) TeV. All presented models, except Sherpa, show significant increase yield of particles associated with the trigger jet with the increase of the collision energy. However, for all the models the \(p_\mathrm{T}^\mathrm{rec}\) distributions converge in the distant \(\eta \) regions. In both Herwig and Pythia the only explicitly energy-dependent parameter is \(p_{T0}(s)\), see Eq. (9). However, indirectly, the energy evolution is also encoded for example in the x dependence of the PDFs. In the top two panels of Fig. 7, we show results for the two Pythia tunes. From the plots it is clear that the evolution with \(\sqrt{s}\) is stronger for the CP5 tune, which has a larger value of \(p_{T0}(13~\text {TeV})=2.8\) GeV, compare to tune CUETP8M1 which has \(p_{T0}(13~\text {TeV})=1.44\) GeV. The \(p_{T0}(s)\) dependence of Pythia prediction seems to be consistent with trends which we studied in more details in Appendix A. It is also interesting to notice that in CP5 tune the power \(\lambda \) governing the \(p_{T0}(s)\) evolution is equal to 0.03344 meaning the parameter is almost energy independent. Therefore, the energy dependence of CP5 tune is mainly governed by the PDF. On the other had in Herwig, we observe the opposite trend, tune Var2 which has smaller \(p_{T0}(13~\text {TeV})\) then tune Var1 (see Appendix B), shows stronger energy dependence, see middle panels of Fig. 7. Finally, Sherpa model predicts, to good approximation, no energy dependence for the observable.

Fig. 9
figure 9

Effect of color reconnection in Pythia 8 and Herwig ++ 2.7 models

Fig. 10
figure 10

Rapidity correlation of recoiled system with respect to single-charged particle with 1.5 \( \le p_\mathrm{T}<\) 2 GeV (a) and with respect to charged-particle jet with 3 \( \le p_\mathrm{T}<\) 3.5 GeV (b) at \(\sqrt{s}=\) 13 TeV

The study of correlations for different intervals of \(N_\mathrm{ch}\) reflects the transverse structure of colliding protons. The \(N_\mathrm{ch}\) is defined here as a number of stable charged particles with \(p_\mathrm{T}>\) 250 MeV and \(|\eta |<2.4\). We choose a single charged particle as the trigger for the present study. This is motivated by an increase of the UE contribution into a jet cone with increasing \(N_\mathrm{ch}\). For instance, the average \(p_\mathrm{T}\) density of charged particles is roughly 3 GeV per square unit at \(N_\mathrm{ch}=100\). That can contribute as much as half of the trigger jet momentum. In Fig. 8 we show the comparison of the \(p_\mathrm{T}^\mathrm{rec}\) distribution in different \(N_\mathrm{ch}\) domains at \(\sqrt{s}=\) 13 TeV for various MC models. Pythia and Herwig models exhibit fast increase of the amplitude of the rapidity correlation, up to \(N_\mathrm{ch} \approx 60\), and then it saturates. Such behaviour mimics the transverse \(p_\mathrm{T}\) density in the UE analyses [11,12,13]. At the same time Sherpa model shows continuous increase of the peak along the trigger direction. The difference between the two Herwig ++ 2.7 tunes could be explained by the difference in the transverse proton structure, which is conventionally characterized by \(\sigma _\mathrm{eff}\). The peak along the trigger particle is expected to be higher for lower \(\sigma _\mathrm{eff}\) and indeed this is confirmed by the Herwig ++ results in Fig. 8, where the tune Var1 has smaller \(\sigma _\mathrm{eff}\) (14.8 mb) compared to the tune Var2 (20.6 mb). However, interestingly, there is an opposite trend for the Pythia tunes CUETP8M1 and CP5. The \(\sigma _\mathrm{eff}\) for CUETP8M1 is 27.9 mb [7], while for CP5 \(\sigma _\mathrm{eff}\) is 25.3 mb [9], as obtained in the inclusive 4-jet production. Therefore, the observed difference is probably related to the change of PDF in the both Pythia tunes.

The color reconnection mechanism is one of the least understood elements of MPI models. Therefore, it is natural to test whether the proposed observable is sensitive to the CR. Figure 9 shows results of switching the CR on and off in Pythia  8 and Herwig models. The former shows most significant differences close the trigger region, while they almost converge at \(\eta = -1\). For the Herwig ++ 2.7 the effect is qualitatively similar, however the versions with CR and CR-off do not converge within the studied \(\eta \)-range. Such behaviour can be qualitatively explained by the fact that the trigger jet “absorbs” softer jets during the CR procedure.

It is also instructive to compare the expectations of several models within a single event generator for the proposed observable. For this purpose we choose Pythia 8 tunes discussed above and others presented in the [9], namely Pythia 8 CUETP8M1, CP2, CP4, CP5. The key features of these tunes are discussed in Sect. 4.2. Here, let us only remind that the first and second tunes use LO PDF sets, while latter are based on NNLO PDF sets. The main difference between CUETP8M1 and CP2 is that for the latter the rapidity ordering for the ISR is switched off. Similar difference is for CP4 and CP5, respectively. In order to study a sensitivity of the proposed observable to various physics mechanisms to the maximum extent, we use both discussed approaches to a trigger object. Figure 10a clearly shows that the usage of a single charged particle as a trigger makes the proposed observable sensitive to the choice of PDF sets mainly. The MC tunes in this case can be grouped according to PDF sets, i.e. LO and NNLO ones. The difference within a single group is almost within the line width in Fig. 10. The difference between the models look very different if a charged-particle jet is used as a trigger. However, Fig. 10b shows that MC models can be still grouped by a choice of PDF sets. The depth of the minimum is very close within the single PDF set. The particles that have \(\eta <1\) give a higher recoil in models with rapidity ordering during development of initial state shower.

6 Summary and conclusions

We have introduced a new observable which probes interplay between the soft and hard physics at moderate \(p_\mathrm{T}\) via probing long and short range rapidity correlations of transverse momenta of charged particles/minijets. The basic idea is to study how the transverse momenta of hadrons produced in association with a trigger object are balanced as a function of rapidity (the precise definition is given in Sect. 2). It is shown that the observable is sensitive to basic mechanisms and components used in the present MC models, such as a suppression of low-\(p_\mathrm{T}\) jet production, parton distribution functions, a transverse geometry of proton, a color reconnection mechanism, and their evolution with collision energy. We demonstrated that predictions of different MC models which describe well many characteristics of the hadron production at LHC differ significantly for suggested observable. The most prominent discrepancy between models appears when the correlation is studied as a function of charged-particle multiplicity. It is important to stress that changing the parameters within a single model results in the expected changes for the measured distribution. Therefore, the proposed measurements can help to disentangle various mechanisms relevant for minijet production. It is worth also mentioning that our tests have revealed quite peculiar features of Herwig 7 and Sherpa 2.2.2 models.

We performed our tests taking into account performance of general purpose detectors at LHC such as ATLAS and CMS. Hence, one may hope that prompt experimental studies of the quantities we calculated will be possible. The data necessary for proposed study are available from low pileup LHC runs. Standard amounts of minimum bias data, that are usually few ten million events, are enough for the measurement, however a special trigger is desired.

Obviously, the discussed correlations are sensitive to the various collective effects. Hence, it would be also interesting to study such correlations also in pA and AA scatterings. The proposed measurements can be also extended by using as trigger particles two hadrons with azimuthal angle difference \(\phi _1 - \phi _2\sim \pi /2\). One would measure \(\langle k(\phi _1,\Delta y_1) \rangle \) and \(\langle k(\phi _2,\Delta y_2) \rangle \) and compare the results with the measurements of the same quantities with one trigger particle which we studied in this paper. We expect that such an observable would have an enhanced sensitivity to the contribution of the multiparton interactions and collective effects.