Multiplicity dependence of two-particle azimuthal correlations in pp collisions at the LHC

We present the measurements of particle pair yields per trigger particle obtained from di-hadron azimuthal correlations in pp collisions at $\sqrt{s} = 0.9$, $2.76$, and $7$ TeV recorded with the ALICE detector. The yields are studied as a function of the charged particle multiplicity. Taken together with the single particle yields the pair yields provide information about parton fragmentation at low transverse momenta, as well as on the contribution of multiple parton interactions to particle production. Data are compared to calculations using the PYTHIA6, PYTHIA8, and PHOJET event generators.


Introduction
The multiplicity distribution of particles produced in proton-proton (pp) collisions and the multiplicity dependence of other global event characteristics represent fundamental observables reflecting the properties of the underlying particle production mechanisms.In the Feynman picture, the strongly interacting hadrons can be seen as bunches of point-like partons producing particles in interactions with small (soft) and large (hard) momentum transfer.As expected from Feynman scaling [1], at low centre-of-mass energies ( √ s), where particle production is dominated by soft interactions, the mean number of particles M was found to rise logarithmically with √ s.Moreover, the evolution of the charged particle multiplicity distribution P(M) as a function of √ s follows the Koba-Nielsen-Oleson (KNO) scaling [2] with scaling variable z = M/ M and P(M) M = ψ(z), where ψ(z) is an energy independent function.Experimentally one finds that KNO scaling is violated for √ s > 200 GeV [3].This scaling violation which increases with √ s has been interpreted as a consequence of particle production through multiple parton-parton interactions (MPI) [4,5].Further, at the LHC, already at a transverse momentum transfer of a few GeV/c the cross section for leading order (LO) parton-parton scatterings exceeds the total pp inelastic cross section.This apparent inconsistency can be resolved by aggregating several quasi independent scatterings in the same pp collision [6,7].If multiple semi-hard scatterings play a dominant role in the production of high multiplicity events, this should lead to distinct experimentally observable effects.The search for these is the aim of the present analysis of pp collisions recorded with the ALICE detector at the LHC.
Each parton-parton scattering produces partons almost back-to-back in azimuth, ϕ.They fragment producing two correlated bundles of particles.With increasing multiplicity we expect that both the number of sources of correlated particles and the number of correlated particles per source increase.Thus, we have designed our analysis methods in a way that the two effects can be separated as much as possible.Since many of the bundles of particles (low transverse-momentum jets) overlap in the same event, they can not be identified and separated event-by-event.An alternative method, pursued in this analysis, is to study two-particle angular correlations as a function of the event multiplicity [8].
Such studies involve measuring the distributions of the relative angle ∆ϕ between particle pairs consisting of a "trigger" particle in a certain transverse momentum p T,trig interval and an "associated" particle in a p T,assoc interval, where ∆ϕ is the difference in azimuth ϕ between the two particles.The p T ranges chosen for the analysis (p T,trig > 0.7 GeV/c and p T,assoc > 0.4(0.7)GeV/c) are a compromise between being high enough to decrease the sensitivity to low energy phenomena such as the breaking of individual strings (p min T ≫ Λ QCD ) and sufficiently low such that the correlations are sensitive to the bulk of the particle production.These cuts have been also used by the CDF collaboration to define so-called trackclusters: a track with p T > 0.7 GeV/c with at least one other track with p T > 0.4 GeV/c in a cone of radius ∆ϕ 2 + ∆η 2 < 0.7, where ∆η is the pseudo-rapidity difference [9].In the CDF analysis, the presence of a track-cluster has been used for an event-by-event identification of hard events.In the present correlation analysis the ∆ϕ distributions are averaged over all events of a given sample.This has the advantage that random correlations, which become dominant at high multiplicities, can be subtracted.The mean number of trigger particles per event and the correlated pair-yield per trigger are measured and combined in a way that they can provide information about the number of semi-hard scatterings in the event of a given charged particle multiplicity as well as the fragmentation properties of low-p T partons biased by the multiplicity selection.
Although the full final state of pp collisions cannot be calculated in perturbative QCD, pQCD-inspired models based on multiple parton interactions provide a consistent way to describe high multiplicity pp collisions, and have been implemented in recent Monte Carlo (MC) generators like PYTHIA6 [6,10], PYTHIA8 [11], PHOJET [12] and HERWIG [13].Using the QCD factorisation theorem [14] cross sections are calculated from a convolution of the short-distance parton-parton cross section and the long-distance parton distribution function (pdf) of the proton.Approaching zero momentum transfer, the leading order short distance cross sections diverge and the models have to implement regularisa-tion mechanisms to control this divergence.Moreover, parton distribution functions are only known for single parton scatterings and, hence, extensions for multiple interactions are needed.Furthermore, each partonic interaction produces coloured strings between the final state partons which overlap in the case of many interactions.It is possible that in this case partons do not hadronise independently and phenomenological models have been developed to account for so-called colour connections and reconnections.Measurements that can provide information on multiple parton interactions and fragmentation properties are important to constrain such models.Consequently, we compare our results for pp collisions at √ s = 0.9, 2.76 and 7 TeV among each other and to the outcome of Monte Carlo simulations on generator level with different PYTHIA6 tunes (Perugia-0 and Perugia-2011 [15]), PYTHIA8 and PHOJET.We have chosen this set of generators and tunes since they have been already compared to previous ALICE measurements based on azimuthal correlations: the underlying event [16] and transverse sphericity [17]; Ref. [16] contains a short description of them.
The paper is organised in the following way: the ALICE sub-systems used in the analysis are described in section 2 and the data samples, event and track selection in section 3. Section 4 introduces the analysis strategy.In sections 5 and 6 we focus on the data correction procedure and systematic uncertainties, respectively.Final results are presented in section 7 and in section 8 we draw conclusions.

Experimental setup
The pp collision data used for this analysis were recorded by the ALICE detector at the LHC.The detector is described in detail in Ref. [18].In the following, only the sub-detectors used in this analysis are described in detail.These are the VZERO detector, the Inner Tracking System (ITS) including the Silicon Pixel Detector (SPD), the Silicon Drift Detector (SDD), and the Silicon Strip Detector (SSD), as well as the Time Projection Chamber (TPC).The VZERO detector and the SPD are used to trigger on minimum bias events.The track reconstruction of charged particles is performed with the combined information from the ITS and the TPC.
The VZERO scintillator hodoscope is divided into two arrays of counters, VZERO-A and VZERO-C located at 3.4 m and -0.9 m from the nominal interaction point along the beam axis, respectively.VZERO-A covers the pseudorapidity range of 2.8 < η < 5.1 and VZERO-C −3.7 < η < −1.7.
The Inner Tracking System (ITS) comprises 6 cylindrical layers of silicon detectors of three different detector types, each contributing with two layers.The Silicon Pixel Detector constitutes the first two layers of the ITS.The sensitive part of the detector is made of high granularity 250 µm-thick hybrid silicon pixels consisting of a 2-dimensional matrix of reversed-biased silicon detector diodes with 10 7 read-out channels.The pseudorapidity coverage is |η| < 1.98 for the first layer and |η| < 1.4 for the second layer.The SPD contributes to the minimum bias trigger as well as to the reconstruction of tracks left by charged particles, and the vertex reconstruction.The Silicon Drift Detector comprises the two intermediate layers of the ITS.The sensitive part consists of homogeneous high-resistivity 300 µm-thick n-type silicon wafers with 133000 read-out channels.The SDD contributes to the reconstruction of tracks of charged particles as well as to the particle identification using energy loss information.The Silicon Strip Detector composes the two outermost layers of the ITS.The double-sided SSD has 2.6 million read-out channels and contributes like the SDD to the track reconstruction and the particle identification.Furthermore, it is optimised for track matching between the ITS and the Time Projection Chamber.The total material budget of the ITS traversed by straight tracks perpendicular to the detector surface amounts to 7.2% X 0 .
The main tracking detector of the ALICE central barrel is the Time Projection Chamber.It is a cylindrical detector filled with 90 m 3 of gaseous Ne/CO 2 /N 2 at a mixing ratio of (85.7/9.5/4.8).High-voltage is applied to the central membrane, resulting in an electric field between the central electrode and the end caps, which are each equipped with multi-wire proportional chambers.The TPC provides full az-

Trigger and offline event selection
Minimum bias events were selected using the following trigger requirements: at least one charged particle needs to be detected in either the SPD or in one of the two VZERO detectors in coincidence with signals from the two BPTX beam pick-up counters indicating the presence of two intersecting proton bunches [23].In addition to the online trigger selection, the trigger decision is reprocessed offline using the same selection criteria; however, the reconstructed information are used instead of the online signals.
Only events having exactly one good quality reconstructed primary collision vertex are used in the analysis.Collision vertices are reconstructed using either reconstructed tracks or so-called tracklets [23] based on correlated hits measured in the two SPD layers.A vertex passes the quality selection if it is located within |z vertex | < 10 cm with respect to the nominal interaction point in beam direction and if at least one track contributes to the reconstruction of the vertex.Pile-up events with more than one reconstructed collision vertex are rejected from the analysis.Furthermore, we require at least one reconstructed highquality track (see section 3.2) in the combined ITS-TPC acceptance of p T > 0.2 GeV/c and |η| < 0.9.The discussed event selection cuts efficiently suppress events from beam-gas and beam-halo interactions as well as from cosmic rays.Table 1 shows the number of recorded minimum bias events that pass the event selection cuts.The vertex-cut efficiency is dominated by the vertex quality requirements.The single vertex requirement after vertex quality cuts removes up to 0.5 % additional events.

Track cuts
In the analysis, we consider only charged primary particles which are defined as prompt particles produced in the collision and their decay products except products of weak decays of strange particles.The data analysis is performed using track selection cuts optimised for a uniform azimuth (ϕ) acceptance and for a minimal contamination of tracks by particles originating from secondary vertices (secondary particles) [16].The track selection comprises the following cuts: tracks are required to have at least three associated hits in the ITS, one of which has to be located in the first three ITS layers.Furthermore, each track needs to have at least 70 associated TPC clusters measured in the 159 TPC pad rows.The quality of the track parameter fitting is measured by the χ 2 per TPC cluster and tracks passing our se- lection have χ 2 per cluster < 4. No tracks with a kink topology indicating a particle decay are accepted.A p T -dependent DCA xy -cut corresponding to 7 times the σ of the expected primary track distribution (DCA xy, max ≈ 0.2 cm) assures that the tracks passing the selection criteria are predominantly those from the primary vertex.In addition, a cut on the distance of closest approach in the z-direction of maximal DCA z = 2 cm improves the selection of primary particles and rejects particles from secondary vertices originating from, for example, the decay of long-lived particles or hadronic interaction in the detector material.Moreover, this cut removes tracks originating from displaced pile-up vertices.Out of the selected high quality tracks, the data analysis accepts tracks within the ITS-TPC acceptance |η| < 0.9 and with p T > 0.2 GeV/c.The track selection cuts are summarised in table 2.

Definitions
We are analysing the sample-averaged probability distribution of the azimuthal difference ∆ϕ = ϕ trig − ϕ assoc between trigger particles (p T,trig > p min T,trig , |η| < 0.9) and associated particles (p T,assoc > p min T,assoc , |η| < 0.9).The pair-yield per trigger as a function of ∆ϕ is defined as where N trig is the number of trigger particles and N assoc is the number of associated particles.We study the pair-yield per trigger as a function of the charged particle multiplicity N charged,|η|< 0.9, p T > 0.2 GeV/c , as well as for different transverse momentum thresholds p min T,trig and p min T,assoc .
The left panel of figure 1 shows an example of the measured per-trigger pair yield as a function of ∆ϕ for p T,trig > 0.7 GeV/c and p T,assoc > 0.4 GeV/c and N charged,|η|< 0.9, p T > 0.2 GeV/c = 30.The two structures at the near-side (∆ϕ ≈ 0) and away-side (∆ϕ ≈ π) of the trigger particle are dominantly induced by the fragmentation of back-to-back parton pairs.In order to extract the per-trigger pair-yields for all multiplicity and p T -cut classes, a fit function is introduced which allows us to decompose the azimuthal correlation into its main components.Whereas the awayside peak can be fitted using a single Gaussian, the near-side peak shows an enhanced tail-region and needs the superposition of two Gaussians with different widths.Including a constant C to describe the combinatorial background, we obtained the fitting function To increase the stability of the fit, the first near-side Gaussian and the away-side Gaussian are restricted to −π/2 < ∆ϕ < π/2 and π/2 < ∆ϕ < 3π/2, respectively.The second near-side Gaussian is fitted in the region −π/5 < ∆ϕ < π/5.
The right panel of figure 1 shows the measured azimuthal correlation, the parametrisation of the correlation based on the fit function, and the sub-components of the fit function.The χ 2 per degree of freedom for this fit is 1.63.

Pair yield for asymmetric and symmetric p T -bins
In the case of non-overlapping p T intervals for the trigger and associated particles, the pair yield per trigger measures the conditional yield of associated particles under the trigger condition.Beside non-overlapping p T intervals (asymmetric bins), we are using symmetric bins for which the two intervals are identical.In this case, for n trigger particles, n(n− 1)/2 unique pairs with p T,trig > p T,assoc can be formed and, hence, the pair yield per trigger particle measures: In general, without the knowledge of the second moment n 2 of the number distribution function P n , the mean number of correlated particles n cannot be determined.However, for small n and monotonically falling P n , the expression has a well defined limit: 1 2 Since n /(1−P 0 ) is the mean value of the distribution P n under the condition that at least one particle has been produced, the right-hand side represents the number of particles associated with a trigger particle.Note that for jet-like self-similar particle emission (geometric series) the approximation is exact.
Pair yield extraction Based on the fit function of equation 2, five observables can be derived.Three of the observables are directly related to the decomposed pair yield per trigger: -Per-trigger pair yield in the combinatorial background -Per-trigger pair yield in the near-side peak -Per-trigger pair yield in the away-side peak The yields in the near-side and away-side peaks measure fragmentation properties of low-p T partons.In addition, the average number of trigger particles N trigger is determined: The average number of trigger particles depends on the number of semi-hard scatterings per event and the fragmentation properties of partons.With the aim to reduce the fragmentation dependence and to increase the sensitivity to the number of scatterings per event we define for symmetric p T -bins a new observable, average number of uncorrelated seeds, by combining the average number of trigger particles with the near-side and away-side yield of trigger particles (p T > p T, trig ).where N assoc, near+away, p T >p T, trig = N assoc,near−side + N assoc,away−side (10) and also the associated particles have p T > p T, trig .
Model studies (see section 4.2) show that the ratio effectively corrects for the multiplicative effect of fragmentation so that the obtained quantity provides information about the number of uncorrelated sources of particle production.

Relation between experimental observables and the PYTHIA MPI model
The PYTHIA MC for pp collisions includes a model for multiple parton interactions.Within the PYTHIA model, the dependence between the number of uncorrelated seeds N uncorrelated seeds and the average number of multiple parton interactions N MPI can be studied.Here, the number of multiple parton interactions N MPI is defined as the number of hard or semi-hard scatterings that occurred in a single pp collision [10].The number of multiple parton interactions N MPI is shown in figure 2 for the PYTHIA6 tunes Perugia-0 and Pro-Q2O [15].Both MC tunes predate LHC data and give a good description of Tevatron (pp at √ s = 2 TeV) results.However, they have very different probability distributions for N MPI .Whereas Pro-Q2O features a wide plateau, that of Perugia-0 is much narrower.
The dependence between the number of uncorrelated seeds and the number of multiple parton interactions in PYTHIA6 tune Perugia-0 simulations on generator level is shown in figure 3 for different |η|-ranges and p T,trig -thresholds.For all cases, we see a linear dependence.The same is observed for the tune Pro-Q2O (not shown).However, the difference in width of the MPI distributions has direct consequences for the experimental observables defined in the previous subsection 4.1 demonstrating their sensitivity to MPI and fragmentation properties.Figure 4 (left panel) shows the near-side pair-yield per trigger as a function of multiplicity.In the case of tune Pro-Q2O the yield reaches a plateau at N ch > 15 after which it rises only very slowly.In contrast, tune Perugia-0 shows a rather steep rise with a change to an even steeper slope at N ch ≈ 50.The reason is the limited N MPI in this tune.In order to reach high multiplicities the number of fragments per parton has to increase together with N MPI .This can also be observed in figure 4 (right panel) where the number of uncorrelated seeds as a function of charged multiplicity is shown.For the tune Pro-Q2O an almost linear rise as a function of charged multiplicity is observed up to the highest multiplicities, whereas for the tune Perugia-0, it starts to level off at about N ch ≈ 50.

Correction procedure
We corrected for the relevant inefficiencies such as detector acceptance, reconstruction, two-track and vertex reconstruction efficiency.In addition, the contamination of the sample of primary tracks by secondary particles was also corrected for.The trigger inefficiency is not part of these corrections as it is negligible for events with at least one track in the considered acceptance |η| < 0.9.In the following paragraphs, the correction steps are discussed in detail.Table 3 shows a breakdown of the main correction steps and corresponding efficiencies or contamination for the different collision energies.They have been estimated from full transport and detector response simulations of PHOJET and PYTHIA6 tune Perugia-0 events using GEANT3 and a data driven correction procedure.We show the efficiencies for the lowest p T -cut used in the analysis (p T > 0.2 GeV/c for the charged particle multiplicity), because it corresponds to the largest inefficiency and contamination.

Tracking efficiency
The tracking efficiency is given by the ratio of the number of reconstructed tracks from primary particles after track quality cuts to the number of primary particles.depends on the kinematic properties of the particle (p T , η, ϕ) and is influenced by the detector geometry, the probability of particle absorption in the detector material and particle decays.Figure 5 shows the tracking efficiency for the different centre-of-mass energies obtained by projecting 2-dimensional η − p T -correction maps and integrating over ϕ.For the analysed data sets, the integrated tracking efficiency lies in the range 76 % to 77 %.

Secondary particle contamination
The standard Monte Carlo based contamination correction is given by the ratio of the number of reconstructed tracks after track quality cuts to the number of reconstructed tracks of primary particles.The contamination of the reconstructed tracks passing the quality cuts is mainly due to decay products from strange particles, photon conversions, and hadronic interactions with the detector material.Figure 6 shows the contamination correction as a function of the transverse momentum and the pseudorapidity.For the analysed data sets, the integrated contamination correction amounts to approximately 5 %.
In addition to the Monte Carlo based contamination correction, a data driven correction has been applied.This correction is based on the results of Ref. [24,25] which show that the generators PHOJET and PYTHIA6 tune Perugia-0 used in the correction procedure strongly underestimate strange particle yields.This underestimation leads to an incomplete correction of the contamination in ALICE data when using Monte Carlo based correction maps only.Based on the measured yields of strange particles, an additional correction factor of approximately 1 % has been added to the 5 % obtained from the standard MC contamination correction.

Two-track and detector effects
Effects such as track splitting, track merging, decay of long-lived particles, hadronic interactions with the detector material, gamma conversions as well as a non-uniform ϕ-acceptance induce modulations of the ∆ϕ-distributions that have to be taken into account.These modifications can not be corrected in single-track-corrections only.Figure 7 shows the ratio dN ∆ϕ (pair corrected tracks )/ dN ∆ϕ (pair MC particles ).
The ratio is presented for all tracks, for tracks from primary particles only, and for tracks of mixed events each after single track correction.An enhanced number of particle pairs peaked around ∆ϕ = 0 is found after single track correction for the three cases.The ratio of corrected pairs to Monte Carlo particle pairs including secondary particles also shows a small enhancement around ∆ϕ = π.To correct for this effect, a two-track post-correction is performed after the single track correction, using Monte Carlo based correction factors which depend on ∆ϕ, p min T,trig , and p min T,assoc .The correction decreases with increasing transverse momentum thresholds.For the analysed data sets, the maximum effect from this correction (5 %) is observed for the lowest values of p min T,trig = 0.7 GeV/c and p min T,assoc = 0.4 GeV/c and at the highest N charged , where the ratio near-side yield over combinatorial background is lowest.

Vertex reconstruction efficiency
The vertex reconstruction efficiency is the ratio between the number of triggered events with a reconstructed accepted vertex of good quality and the number of triggered events.The vertex reconstruction efficiency has not only an impact on the number of events but also on the total number of particles entering the data sample.The effect of the vertex reconstruction efficiency contributes with 1.2 % to 2.5 % to the multiplicity integrated track-to-particle correction for the analysed data sets.The impact of the vertex reconstruction efficiency depends strongly on the charged particle multiplicity effecting only the low N charged bins.For N charged > 10, the vertex reconstruction efficiency is consistent with unity.

Trigger efficiency
The correction of the trigger efficiency takes into account the fact that the number of triggered events is only a subset of the produced events of a given event class.However, the trigger is fully efficient for events with at least one charged track in the considered ITS-TPC acceptance.Hence, no correction for the trigger efficiency is applied.

Charged particle multiplicity correction
The present analysis studies the evolution of the integrated yields of the azimuthal correlation as a function of the true charged particle multiplicity N charged in the range p T > 0.2 GeV/c and |η| < 0.9.Our approach to a full correction of detector effects on the multiplicity is a two-step procedure: first, the correction of the raw two-particle correlation observables O unc from O unc (N rec,charged ) to its corrected value O corr (N rec,charged ) is performed as a function of the reconstructed uncorrected multiplicty N rec,charged .Then, the correction of the charged particle multiplicity from O corr (N rec,charged ) to O corr (N charged ) is carried out to obtain the corresponding observable at the corrected charged particle multiplicity N charged .The same procedure has also been used for the measurement of the mean transverse momentum and the transverse sphericity as a function of the true multiplicity as described in Refs.[26,17].The correction employs the correlation matrices R(N charged , N rec,charged ) which are proportional to the probability of reconstructing N rec,charged particles under the condition that N charged particles have been produced.They are obtained from full detector simulations quantifying the relation between the number of charged primary particles and the number of reconstructed tracks both in p T > 0.2 GeV/c and |η| < 0.9 as shown in the left panel of figure 8.
The columns of the correlation matrix have to be normalised to one The normalised correlation matrix represents the conditional probability for measuring an event of a given true multiplicity, N charged , for a given reconstructed track multiplicity of N rec,charged .In a second step, the correlation matrix is extrapolated to the highest multiplicities not covered in the detector simulation due to the limited number of simulated events.To this end, the distribution of each matrix column at low multiplicities is fitted with a Gaussian function.As expected, the width of the Gaussian functions  Table 4: Systematic uncertainties for the per-trigger near-side pair yield measured using p T, trig > 0.7 GeV/c, p T, assoc > 0.4 GeV/c, and |η| < 0.9 exemplary for all final analysis results for two charged particle multiplicity bins.The full charged particle multiplicity dependence can be found in Ref. [27].
grows approximately as σ ∝ N rec,charged and the mean grows as N charged ∝ N rec,charged .This scaling is used to extrapolate the correlation matrix to higher multiplicities.An extrapolated correlation matrix with normalised columns is shown in the right panel of figure 8. Based on the normalised and extended correlation matrix, the observable O corr (N rec,charged ) can be converted to O corr (N charged ) using

Systematic uncertainties
A comprehensive study of the systematic uncertainties of the final analysis results has been performed.In the following, the sources of systematic uncertainties and their impact on the analysis results are described.Representative for all final analysis results, the systematic uncertainties of the per-trigger near-side pair yield measured using p T,trig > 0.7 GeV/c and p T,assoc > 0.4 GeV/c as a function of the charged particle multiplicity are discussed in the text and summarised in table 4.

Per-trigger pair yield measurement based on a fit function
The per-trigger pair yield of the azimuthal correlation is extracted utilizing the fit function of equation 2. A good agreement between the data distribution and the fit function has been found using residuals as well as the χ 2 /NDF test.The stability of the fit results has been verified based on various tests.For example, it has been checked that a modification of the combinatorial background of the azimuthal correlation does not change the extracted yields of the near and the away-side peaks.Moreover, it has been verified that the combination of events results in the expected modification of the per-trigger pair yield components.In addition, the minimum number of events needed for a stable fit result as well as the optimised resolution of the ∆ϕ-distribution in terms of the bin-size have been determined.

Correction procedure
In section 5, a full correction procedure of detector effects has been introduced.When correcting event generator data after full detector simulations with correction maps obtained with the same event generator, it is expected to recover the Monte Carlo input.A remaining disagreement between the corrected results and the input Monte Carlo results represents the systematic uncertainty of the correction procedure.As an example, the per-trigger near-side pair yield obtained from the MC input and the corrected results differ from each other by up to 12.8 % for the first charged particle multiplicity bin and by less than 3.0 % for higher charged particle multiplicity bins.
Correction maps can be estimated with different Monte Carlo generators.When using correction maps of one Monte Carlo generator for the correction of data of a second Monte Carlo generator, further discrepancies can emerge.The per-trigger near-side pair yields of corrected data obtained using PYTHIA6 tune Perugia-0 correction maps and using PHOJET correction maps differ by less than 2 % for all charged particle multiplicities.
We have estimated the impact of the transport Monte Carlo choice on the final analysis results.For this purpose, in addition to the default GEANT3 [20] detector simulations, a sample of pp events has been simulated using GEANT4 [21,22].The results obtained with the GEANT3 and the GEANT4 based correction maps are in very good agreement.The results differ from each other by a maximum of 0.3 % for all charged particle multiplicities.

Track and vertex selection
The systematic uncertainty related to the choice of the track selection cuts introduced in section 3.2 is estimated by performing a full correction and analysis chain using varying track selection cuts.For this purpose, the default ITS-TPC track cuts have been loosened and tightened within reasonable limits.In addition, tracks measured exclusively with the TPC have been analysed.The per-trigger near-side pair yield shows a sizable difference when using the different track cuts of up to 16 % for the first charged particle multiplicity bin, however, the impact decreases to less than 2.5 % for higher charged particle multiplicities.
The impact of the vertex selection choice is tested by varying the vertex quality cuts.Instead of requiring at least one track associated to the collision vertex, two tracks are required.The impact of this modification on the per-trigger near-side pair yield is 2 % for the lowest charged particle multiplicity bin and compatible with zero for charged particle multiplicities above N charged > 10.

Tracking efficiency
The ITS-TPC tracking efficiency uncertainty has been estimated by comparing the track matching efficiency between ITS and TPC and vice versa for simulated data and real data [26,28].The disagreement between the matching efficiencies is then converted into a transverse momentum dependent reconstruction efficiency uncertainty.By varying the reconstruction efficiency accordingly, the systematic uncertainty on the final analysis results can be estimated.The impact of this uncertainty on the per-trigger near-side pair yield is about 4 % for all charged particle multiplicities.
The material budget of ALICE has been measured with the help of photon conversions in the detector material.The remaining uncertainty in the knowledge of the material budget can be converted into a transverse momentum dependent uncertainty of the tracking efficiency.The effect of this uncertainty results in a small variation of analysis results.For example, the per-trigger near-side pair yield is modified by below 0.4 % for all charged particle multiplicities.
The ITS-TPC tracking efficiency estimated in full detector simulations depends to some extent on the composition of the particle yields.This is due to the fact that the particle decay length and the probability to be absorbed in the detector material depends on the particle type.The systematic uncertainty related to the particle composition has already been studied in Ref. [16].Motivated by a disagreement of measured particle yields to predictions of PYTHIA6 and PHOJET [24,25], the yields of pions, kaons, and protons used in the calculation of correction maps of detector effects have been modified by ±30 % [16].The effect of this modification accounts for a variation of the final results of at most 2.0 %.

Pile-up events
The impact of pile-up events on the analysis results has been tested by analysing high pile-up data sets.A quantitative estimation of the systematic uncertainty related to pile-up events in the data analysis has been performed by splitting the default data sets into sub-sets of relatively low and relatively high pileup-probability.The difference between the analysis results of the two sub-sets The ALICE collaboration accounts for about 5 % for the lowest charged particle multiplicity bin and below 1 % for all higher charged particle multiplicities.

Extrapolation of strangeness correction
As part of the contamination correction procedure described in section 5, a data driven contamination correction has been performed accounting for the underestimated strangeness yield in the Monte Carlo generators.This correction is based on ALICE measurements at √ s = 0.9 TeV [24,25], however, these corrections were also used to correct collision data measured at √ s = 2.76 and 7 TeV.The uncertainty related to the extrapolation of this correction to higher centreof-mass energies can be estimated using measurements of strange particle yields performed by the CMS experiment at √ s = 0.9 and 7 TeV [29].When performing the same data driven contamination correction based on the CMS measurements, small modification of the final results can be observed.The systematic uncertainty of the per-trigger near-side pair yield related to the extrapolation of the strangeness correction is below 2 % for the first charged particle multiplicity and compatible with zero for charged particle multiplicities above N charged > 8.

Results
The two-particle correlation analysis are now presented, after having included the corresponding corrections described in the previous sections.Results are discussed for the three different centre-of-mass energies and two sets of p T -cuts: p T,trig > 0.7 GeV/c, p T,assoc > 0.4 GeV/c and p T -cuts: p T,trig > 0.7 GeV/c, p T,assoc > 0.7 GeV/c.The second, symmetric, bin is used to analyse the number of uncorrelated seeds.ALICE data are presented as black points and the results of Monte Carlo calculations as coloured symbols.The error bars represent the statistical errors and the boxes the systematic uncertainties.The horizontal error bars correspond to the bin-width.For measurements as a function of the charged particle multiplicity, the upper part of the figures shows the analysis results and the lower part shows the ratio between data and the Monte Carlo calculations.
Before discussing in detail the multiplicity and centre-of-mass energy dependence and their implications for multiple parton interactions, we present in figure 9 an example of a measured azimuthal correlation function.In the figure, the data are compared to various MC simulations on generator level for the charged particle multiplicity bin N charged = 10 at √ s = 7 TeV.The part of the systematic uncertainty that has the same relative contribution for all ∆ϕ-bins is presented as a box on the left side of the data points.The height of the box corresponds to the value of the leftmost data point (at ∆ϕ = −π/2) and must be scaled for all other data points according to their absolute values.
Within the systematic uncertainties, the constant combinatorial background is of the same height for data and all PYTHIA tunes.PHOJET shows a lower combinatorial background.The near-side peak centred around ∆ϕ = 0 is overestimated by all Monte Carlo generators in terms of its height and its integral above the combinatorial background.Here, PYTHIA6 tune Perugia-2011 shows the best agreement with data.The width of the near-side peak is roughly reproduced by the Monte Carlo generators.PHOJET and PYTHIA8 tune 4C produce an away-side peak (∆ϕ = π) with a higher absolute height than in data.The PYTHIA6 tunes Perugia-0 and Perugia-2011 both agree with data in terms of the height of the away-side peak.PYTHIA8 4C and PHOJET overestimate the integral of the away-side peak above the constant combinatorial background.Here, PYTHIA6 Perugia-0 agrees with data, and PYTHIA6 tune Perugia-2011 underestimates the data slightly.The width of the away-side peak is much narrower in PHOJET than in data while the PYTHIA tunes give only a slightly narrower away-side peak.

Yields
First, the analysis results for the highest analysed collision energy √ s = 7 TeV are presented.Next, we discuss the collision energies √ s = 2.76 TeV and 0.9 TeV.

Near-side
The per-trigger near-side pair-yield which provides information on the fragmentation of partons is presented in the top left panel of figure 10 for p T,trig > 0.7 GeV/c and p T,assoc > 0.4 GeV/c.The measured near-side pair yield grows as a function of the charged particle multiplicity indicating a fragmentation bias as characteristic for a MPI distribution with a narrow plateau (tune Perugia-0, see section 4.2).This general trend is reproduced by the MC generators.As expected PYTHIA6 tune Perugia-2011 and PYTHIA8 tune 4C, which already include LHC data, are closest to the data.For N charged > 20 (Perugia-2011) and N charged > 30 (4C) the agreement is within the systematic errors, while in this region, all other models overestimate the data by up to 50 %.For all MCs, the agreement becomes worse moving to lower multiplicities.Here, Perugia-2011 also overestimates the data by up to 30 %.The largest deviations (up to 120 %) are found in the comparison with PHOJET.
For the higher p T,assoc -cut (> 0.7 GeV) the agreement is with the exception of PYTHIA6 tune Perugia-2011 and PYTHIA8 at high N ch worse (figure 10 (top right)).In particular, for low multiplicities the deviation is between 40 % and 150 %.

Away-side
The per-trigger away-side pair yield which provides information about the fragments produced back-to-back within the detector acceptance is presented for p T,trig > 0.7 GeV/c and p T,assoc > 0.4 GeV/c in the left panel of the second row of figure 10.As with the near-side yield, the measured away-side pair yield grows as a function of the charged particle multiplicity.Above N charged = 10, the growth is significantly stronger on the away-side.Surprisingly, tune Perugia-0 now agrees with the data within uncertainties over the whole multiplicity range, whereas Perugia-2011 and PYTHIA8, which have the best agreement for the near-side yield, significantly underestimates the away-side yield.The deviations of PHOJET is similar to the ones observed for the near-side.When increasing the p T,assoc -threshold to 0.7 GeV/c (right panel of the second row of figure 10), also PYTHIA6 tune Perugia-0 overestimates the away-side pair yield by about 30 %, whereas tune Perugia-2011 and PYTHIA8 show the best agreement at high N charged .

Combinatorial background
The per-trigger pair yield in the constant combinatorial background of the correlation grows linearly as a function of the charged particle multiplicity as shown in the third row of figure 10.The data are well described by all models within the systematic uncertainties for all charged particle multiplicities for p T,assoc > 0.4 GeV/c (left panel).When increasing the p T,assoc -threshold to 0.7 GeV/c (right panel), PHOJET underestimates the combinatorial background by approximately 20 %.

Trigger particles per event
The average number of trigger particles with p T,trig > 0.7 GeV/c as a function of the charged particle multiplicity is presented in the bottom-left panel of figure 10.The average number of trigger particles grows stronger than linearly as a function of the charged particle multiplicity.This can be understood from the pair yield results.As the multiplicity increases, both the number of semi-hard scatterings per event and the number of fragments per scattering increase, leading to a greater than linear increase in the number of particles above a given p T -threshold.This observation is also consistent with the observed increase of the mean transverse momentum with multiplicity [26].
The PYTHIA6 tunes slightly overestimate the ALICE results while PHOJET underestimates the data.
The agreement with PYTHIA8 is excellent for N charged > 15.

Number of uncorrelated seeds
The average number of uncorrelated seeds (c.f.equation 9) is presented in the bottom right panel of figure 10.At low multiplicities, the number of uncorrelated seeds grows almost linearly.At high multiplicities, the growth decreases.All models reproduce the qualitative development of the number of correlated seeds as a function of the charged particle multiplicity.While the data are significantly underestimated by PHOJET, PYTHIA6 and PYTHIA8 reproduce the results reasonably well.

Centre-of-mass energy dependence
Figures 11 and 12 show the observables discussed above measured at the two lower centre-of-mass energies √ s = 2.76 and 0.9 TeV.On average, the agreement between the model calculations and the ALICE results improves with decreasing collision energy.However, qualitatively the behaviour of the different models is similar.Tune Perugia-2011 agrees best with the measured near-side yield and underpredicts the away-side yield, for which Perugia-0 has the best agreement.PHOJET generally shows the worst agreement.However, the agreement between PHOJET and the ALICE results in terms of the nearand away-side yields is good for √ s = 900 GeV at high multiplicity, whereas PYTHIA8 has the largest disagreement in this region.To allow for a more direct comparison of the trends as a function of centre-of-mass energy, figures [13][14][15][16][17] show in the same plots the multiplicity dependence for the three energies for data (top left) and for the various MC generators.We note that the colors now indicate the different beam energies.In data, the near-side pair yield in a fixed charged particle multiplicity bin (figure 13) grows as a function of √ s.While all event generators reproduce this increase qualitatively, PHOJET shows a significantly stronger energy dependence than the data and the PYTHIA results.The away-side pair yield in a fixed charged particle multiplicity bin measured by ALICE decreases as a function of the centre-of-mass energy as shown in figure 14.This decrease is explained by the limited η-acceptance.Due to the longitudinal momentum distribution of partons in the colliding protons, the scattered partons have a wide relative ∆η distribution that increases with increasing √ s.While all PYTHIA tunes reproduce the away-side yield decrease, PHOJET does not show a clear energy dependence of the yield in the studied centre-of-mass energy range.The combinatorial background in a fixed charged particle multiplicity bin does not show any centre-ofmass energy dependence (figure 15).This behaviour is well reproduced by all Monte Carlo generators.The average number of trigger particle shown in figure 16 grows slowly as a function of the centre-ofmass energy.The average number of uncorrelated seeds (figure 17) also grows slowly as a function of the centre-of-mass energy.This increase is smallest for PHOJET.The qualitative centre-of-mass energy dependence of the average number of trigger particle and the average number of uncorrelated seeds is well reproduced by the Monte Carlo generators.

Multiple parton interactions
Interpreted in the context of the PYTHIA model, the number of uncorrelated seeds (c.f.equation 9) provides information about the number of semi-hard parton-parton interactions per event as discussed in section 4. In the top left panel of figure 17, the average number of uncorrelated seeds as a function of the charged particle multiplicity is presented for the centre-of-mass energies √ s = 0.9, 2.76, and 7 TeV. Figure 18 shows the residuals between the data points and linear fit functions ((data-fit)/data).It can be observed that the charged particle multiplicity increases approximately linearly with the number of uncorrelated seeds.However, it deviates from the linear dependence at large charged particle multiplicities.Here, the rise of the number of uncorrelated seeds levels off.This observation is consistent with the assumption that at highest multiplicities a further increase of the number of multiple parton interactions becomes extremely improbable.In this scenario, high charged particle multiplicities can only be reached by selecting events with many high-multiplicity jets.

Conclusions
We have studied the pair-yields per trigger in two-particle azimuthal correlations between charged trigger and associated particles in pp collisions at √ s = 0.9, 2.76, and 7 TeV.The correlations have been measured for charged particles recorded with the ALICE central barrel detectors ITS and TPC covering the full azimuth and a pseudorapidity range of |η| < 0.9.The analysis has been performed as a function of the charged particle multiplicity and for the transverse momentum thresholds for trigger particles of p T,trig > 0.7 GeV/c and for associated particles of p T,assoc > 0.4 and 0.7 GeV/c.The azimuthal correlations have been decomposed into the pair yield in the combinatorial background, the pair yield in the near-side peak (∆ϕ ≈ 0), and the pair yield in the away-side peak (∆ϕ ≈ π).Furthermore, the average number of trigger particles per event have been measured.While the per-trigger near-side and away-side pair yield provide information about fragmentation properties of low-p T partons, the average number of trigger particles includes information from both the number of sources of particle production and the fragmentation.In order to increase the sensitivity to the number of sources of particle production, we have defined an observable, number of uncorrelated seeds, in which the impact of the fragmentation is reduced.Using PYTHIA simulations on generator level, we have shown that the number of uncorrelated seeds is proportional to the number of semi-hard parton-parton interactions in pp collision.However, the factor of proportionality depends on the tune and, hence, no absolute number of interactions can be derived from this procedure.
The per-trigger near-and away-side pair-yields as a function of the charged particle multiplicity increase with multiplicity.This increase can be explained by the fact that the correlations and the multiplicity are measured in the same pseudo-rapidity region and that the probability distribution of the number of multi-parton interactions is steeply falling.Under these conditions, high multiplicities are reached through a high number of multi-parton interactions and a higher than average number of fragments per parton.This is also consistent with our observation that the number of trigger particles above a p T threshold (0.7 GeV considered here) increases stronger than linearly with multiplicity.The symmetric bin p T,trig , p T,assoc > 0.7 GeV/c has been used to reduce the multiplicative effect of fragmentation and to determine the number of uncorrelated trigger particles.The latter increases linearly with multiplicity up to the highest multiplicities where it starts to level off.This effect is observed for all centre-ofmass energies.Interpreted within the PYTHIA model of multi-parton interactions this is evidence for a limitation of the number of MPIs above a certain threshold.Independent of its physical interpretation the observed systematics are important for any study performed as a function of multiplicity.
We have compared our results to the event generators PYTHIA6, PYTHIA8, and PHOJET.While the constant, combinatorial background of the correlation is described fairly well by all models, the models have difficulties to describe the per-trigger pair-yields in the near-side peak and the away-side peaks.The PYTHIA tunes reproduce the centre-of-mass dependence of the near and the away-side pair yield.PHOJET overestimates the increase of the near-side yield with the centre-of-mass energy, while it does not show any centre-of-mass dependence of the away-side yield.The development of the number of uncorrelated seeds with charged particle multiplicity is described well by all models.These findings are expected to provide important input for future Monte Carlo tunes and will help to constrain the models used in these generators.

Fig. 1 :
Fig. 1: Left panel: illustration of the contributions to the per-trigger pair yield as a function of ∆ϕ.Right panel: the per-trigger pair yield as a function of ∆ϕ described by the fit function and its sub-components (see text).

Fig. 3 :Fig. 4 :
Fig. 3: Linear dependence between N uncorrelated seeds and N MPI in PYTHIA6 Perugia-0 simulations on generator level.Left panel: N uncorrelated seeds versus N MPI for different |η|-ranges.Right panel: N uncorrelated seeds versus N MPI for different p T, trig -thresholds.

Fig. 8 :
Fig. 8: Left panel: simulated correlation matrix.Right panel: normalized and extended correlation matrix.Input for the extension are Gaussian distributions with extrapolated N charged and σ N charged .

Fig. 10 :
Fig. 10: Per-trigger near-side pair yield (top row), per-trigger away-side pair yield (second row), per-trigger pair yield in the combinatorial background (third row), average number of trigger particles and average number of uncorrelated seeds (bottom row) measured at √ s = 7 TeV.

Fig. 11 :
Fig. 11: Per-trigger near-side pair yield (top row), per-trigger away-side pair yield (second row), per-trigger pair yield in the combinatorial background (third row), average number of trigger particles and average number of uncorrelated seeds (bottom row) measured at √ s = 2.76 TeV.

Fig. 12 :Fig. 13 :
Fig.12: Per-trigger near-side pair yield (top row), per-trigger away-side pair yield (second row), per-trigger pair yield in the combinatorial background (third row), average number of trigger particles and average number of uncorrelated seeds (bottom row) measured at √ s = 0.9 TeV.

Fig. 14 :
Fig.14: Per-trigger away-side pair yield as a function of the charged particle multiplicity measured for √ s = 0.9, 2.76, and 7 TeV.

Fig. 15 :
Fig.15: Per-trigger pair yield in the combinatorial background as a function of the charged particle multiplicity measured for √ s = 0.9, 2.76, and 7 TeV.

Fig. 16 :
Fig.16: Average number of trigger particles per event as a function of the charged particle multiplicity measured for √ s = 0.9, 2.76, and 7 TeV.

Fig. 17 :
Fig. 17:Average number of uncorrelated seeds per event as a function of the charged particle multiplicity measured for √ s = 0.9, 2.76, and 7 TeV.

Fig. 18 :
Fig. 18: Residual between the number of uncorrelated seeds and linear fit functions.

Table 1 :
Number of pp minimum bias events after event selection for the data sets at √ s = 0.9, 2.76, and 7 TeV.The track selection used in the last event selection step is described in section 3.2.

Table 3 :
Main contributions to the track-to-particle correction averaged over p T > 0.2 GeV/c, |η| < 0.9, and charged particle multiplicities N charged .