Measurement of the differential cross-sections of prompt and non-prompt production of J /ψ and ψ( 2S ) in pp collisions at √ s = 7 and 8 TeV with the ATLAS detector

The production rates of prompt and non-prompt J /ψ and ψ( 2S ) mesons in their dimuon decay modes are measured using 2.1 and 11.4 fb − 1 of data collected with the ATLAS experiment at the Large Hadron Collider, in proton– proton collisions at √ s = 7 and 8 respectively. Production cross-sections for prompt as well as non-prompt sources, ratios of ψ( 2S ) to J /ψ production, and the fractions of non-prompt production for J /ψ and ψ( 2S ) are measured as a function of meson transverse momentum and rapidity. The measurements are compared to theoretical predictions.


Introduction
Measurements of heavy quark-antiquark bound states (quarkonia) production processes provide an insight into the nature of quantum chromodynamics (QCD) close to the boundary between the perturbative and non-perturbative regimes. More than forty years since the discovery of the J/ψ, the investigation of hidden heavy-flavour production in hadronic collisions still presents significant challenges to both theory and experiment.
In high-energy hadronic collisions, charmonium states can be produced either directly by short-lived QCD sources ("prompt" production), or by long-lived sources in the decay chains of beauty hadrons ("non-prompt" production). These can be separated experimentally using the distance between the proton-proton primary interaction and the decay vertex of the quarkonium state. While Fixed-Order with Next-to-Leading-Log (FONLL) calculations [1,2], made within the framework of perturbative QCD, have been quite successful in describing non-prompt production of various quarkonium states, a satisfactory understanding of the prompt production mechanisms is still to be achieved.
Early attempts to describe the formation of charmonium [25-32] using leading-order perturbative QCD gave rise to a variety of models, none of which could explain the large pro-duction cross-sections measured at the Tevatron [3,13,[21][22][23]. Within the colour-singlet model (CSM) [33], next-tonext-to-leading-order (NNLO) contributions to the hadronic production of S-wave quarkonia were calculated without introducing any new phenomenological parameters. However, technical difficulties have so far made it impossible to perform the full NNLO calculation, or to extend those calculations to the P-wave states. So it is not entirely surprising that the predictions of the model underestimate the experimental data for inclusive production of J/ψ and ϒ states, where the feed-down is significant, but offer a better description for ψ(2S) production [18,34].
Non-relativistic QCD (NRQCD) calculations that include colour-octet (CO) contributions [35] introduce a number of phenomenological parameters -long-distance matrix elements (LDMEs) -which are determined from fits to the experimental data, and can hence describe the crosssections and differential spectra satisfactorily [36]. However, the attempts to describe the polarization of S-wave quarkonium states using this approach have not been so successful [37], prompting a suggestion [38] that a more coherent approach is needed for the treatment of polarization within the QCD-motivated models of quarkonium production.
Neither the CSM nor the NRQCD model gives a satisfactory explanation for the measurement of prompt J/ψ production in association with the W [39] and Z [40] bosons: in both cases, the measured differential cross-section is larger than theoretical expectations [41][42][43][44]. It is therefore important to broaden the scope of comparisons between theory and experiment by providing a variety of experimental information about quarkonium production across a wider kinematic range. In this context, ATLAS has measured the inclusive differential cross-section of J/ψ production, with 2.3 pb −1 of integrated luminosity [18], at √ s = 7 TeV using the data collected in 2010, as well as the differential cross-sections of the production of χ c states (4.5 fb −1 ) [14], and of the ψ(2S) in its J/ψπ π decay mode (2.1 fb −1 ) [9], at √ s = 7 TeV with data collected in 2011. The cross-section and polarization measurements from CDF [4], CMS [6,7,45,46], LHCb [8,10,12,47-49] and ALICE [5,50,51], cover a considerable variety of charmonium production characteristics in a wide kinematic range (transverse momentum p T ≤ 100 GeV and rapidities |y| < 5), thus providing a wealth of information for a new generation of theoretical models. This paper presents a precise measurement of J/ψ and ψ(2S) production in the dimuon decay mode, both at √ s = 7 TeV and at √ s = 8 TeV. It is presented as a doubledifferential measurement in transverse momentum and rapidity of the quarkonium state, separated into prompt and non-prompt contributions, covering a range of transverse momenta 8 < p T ≤ 110 GeV and rapidities |y| < 2.0. The ratios of ψ(2S) to J/ψ cross-sections for prompt and non-prompt processes are also reported, as well as the nonprompt fractions of J/ψ and ψ(2S).

The ATLAS detector
The ATLAS experiment [52] is a general-purpose detector consisting of an inner tracker, a calorimeter and a muon spectrometer. The inner detector (ID) directly surrounds the interaction point; it consists of a silicon pixel detector, a semiconductor tracker and a transition radiation tracker, and is embedded in an axial 2 T magnetic field. The ID covers the pseudorapidity 1 range |η| = 2.5 and is enclosed by a calorimeter system containing electromagnetic and hadronic sections. The calorimeter is surrounded by a large muon spectrometer (MS) in a toroidal magnet system. The MS consists of monitored drift tubes and cathode strip chambers, designed to provide precise position measurements in the bending plane in the range |η| < 2.7. Momentum measurements in the muon spectrometer are based on track segments formed in at least two of the three precision chamber planes.
The ATLAS trigger system [53] is separated into three levels: the hardware-based Level-1 trigger and the two-stage High Level Trigger (HLT), comprising the Level-2 trigger and Event Filter, which reduce the 20 MHz proton-proton collision rate to several-hundred Hz of events of interest for data recording to mass storage. At Level-1, the muon trigger searches for patterns of hits satisfying different transverse momentum thresholds with a coarse position resolution but a fast response time using resistive-plate chambers and thingap chambers in the ranges |η| < 1.05 and 1.05 < |η| < 2.4, respectively. Around these Level-1 hit patterns "Regions-of-Interest" (RoI) are defined that serve as seeds for the HLT muon reconstruction. The HLT uses dedicated algorithms to incorporate information from both the MS and the ID, achieving position and momentum resolution close to that provided by the offline muon reconstruction.

Candidate selection
The analysis is based on data recorded at the LHC in 2011 and 2012 during proton-proton collisions at centre-of-mass 1 ATLAS uses a right-handed coordinate system with its origin at the nominal interaction point (IP) in the centre of the detector and the z-axis along the beam pipe. The x-axis points from the IP to the centre of the LHC ring, and the y-axis points upward. Cylindrical coordinates (r, φ) are used in the transverse plane, φ being the azimuthal angle around the beam pipe. The pseudorapidity η is defined in terms of the polar angle θ as η = − ln tan(θ/2) and the transverse momentum p T is defined as p T = p sin θ. The rapidity is defined as y = 0.5 ln (E + p z ) / (E − p z ) , where E and p z refer to energy and longitudinal momentum, respectively. The η-φ distance between two particles is defined as R = ( η) 2 + ( φ) 2 . energies of 7 and 8 TeV, respectively. This data sample corresponds to a total integrated luminosity of 2.1 and 11.4 fb −1 for 7 and 8 TeV data, respectively.
Events were selected using a trigger requiring two oppositely charged muon candidates, each passing the requirement p T > 4 GeV. The muons are constrained to originate from a common vertex, which is fitted with the track parameter uncertainties taken into account. The fit is required to satisfy χ 2 < 20 for the one degree of freedom.
For 7 TeV data, the Level-1 trigger required only spatial coincidences in the MS [54]. For 8 TeV data, a 4 GeV muon p T threshold was also applied at Level-1, which reduced the trigger efficiency for lowp T muons.
The offline analysis requires events to have at least two muons, identified by the muon spectrometer and with matching tracks reconstructed in the ID [55]. Due to the ID acceptance, muon reconstruction is possible only for |η| < 2.5. The selected muons are further restricted to |η| < 2.3 to ensure high-quality tracking and triggering, and to reduce the contribution from misidentified muons. For the momenta of interest in this analysis (corresponding to muons with a transverse momentum of at most O(100) GeV), measurements of the muons are degraded by multiple scattering within the MS and so only the ID tracking information is considered. To ensure accurate ID measurements, each muon track must fulfil muon reconstruction and selection requirements [55]. The pairs of muon candidates satisfying these quality criteria are required to have opposite charges.
In order to allow an accurate correction for trigger inefficiencies, each reconstructed muon candidate is required to match a trigger-identified muon candidate within a cone of R = ( η) 2 + ( φ) 2 = 0.01. Dimuon candidates are obtained from muon pairs, constrained to originate from a common vertex using ID track parameters and uncertainties, with a requirement of χ 2 < 20 of the vertex fit for the one degree of freedom. All dimuon candidates with an invariant mass within 2.6 < m(μμ) < 4.0 GeV and within the kinematic range p T (μμ) > 8 GeV, |y(μμ)| < 2.0 are retained for the analysis. If multiple candidates are found in an event (occurring in approximately 10 −6 of selected events), all candidates are retained. The properties of the dimuon system, such as invariant mass m(μμ), transverse momentum p T (μμ), and rapidity |y(μμ)| are determined from the result of the vertex fit.

Methodology
The measurements are performed in intervals of dimuon p T and absolute value of the rapidity (|y|). The term "prompt" refers to the J/ψ or ψ(2S) states -hereafter called ψ to refer to either -are produced from short-lived QCD decays, including feed-down from other charmonium states as long as they are also produced from short-lived sources. If the decay chain producing a ψ state includes long-lived particles such as b-hadrons, then such ψ mesons are labelled as "non-prompt". Using a simultaneous fit to the invariant mass of the dimuon and its "pseudo-proper decay time" (described below), prompt and non-prompt signal and background contributions can be extracted from the data.
The probability for the decay of a particle as a function of proper decay time t follows an exponential distribution, p(t) = 1/τ B ·e −t/τ B where τ B is the mean lifetime of the particle. For each decay, the proper decay time can be calculated as t = Lm/ p, where L is the distance between the particle production and decay vertices, p is the momentum of the particle, and m is its invariant mass. As the reconstruction of nonprompt ψ mesons, such as b-hadrons, does not fully describe the properties of the parent, the transverse momentum of the dimuon system and the reconstructed dimuon invariant mass are used to construct the "pseudo-proper decay time", is the signed projection of the distance of the dimuon decay vertex from the primary vertex, L, onto its transverse momentum, p T (μμ). This is a good approximation of using the parent b-hadron information when the ψ and parent momenta are closely aligned, which is the case for the values of ψ transverse momenta considered here, and τ therefore can be used to distinguish statistically between the non-prompt and prompt processes (in which the latter are assumed to decay with vanishingly small lifetime). If the event contains multiple primary vertices [52], the primary vertex closest in z to the dimuon decay vertex is selected. The effect of selecting an incorrect vertex has been shown [56] to have a negligible impact on the extraction of prompt and non-prompt contributions. If any of the muons in the dimuon candidate contributes to the construction of the primary vertex, the corresponding tracks are removed and the vertex is refitted.

Double differential cross-section determination
The double differential dimuon prompt and non-prompt production cross-sections times branching ratio are measured separately for J/ψ and ψ(2S) mesons according to the equations: where Ldt is the integrated luminosity, p T and y are the interval sizes in terms of dimuon transverse momentum and rapidity, respectively, and N p(np) ψ is the number of observed prompt (non-prompt) ψ mesons in the slice under study, corrected for acceptance, trigger and reconstruction efficiencies. The intervals in y combine the data from negative and positive rapidities.
The determination of the cross-sections proceeds in several steps. First, a weight is determined for each selected dimuon candidate equal to the inverse of the total efficiency for each candidate. The total weight, w tot , for each dimuon candidate includes three factors: the fraction of produced ψ → μ + μ − decays with both muons in the fiducial region p T (μ) > 4 GeV and |η(μ)| < 2.3 (defined as acceptance, A), the probability that a candidate within the acceptance satisfies the offline reconstruction selection ( reco ), and the probability that a reconstructed event satisfies the trigger selection ( trig ). The weight assigned to a given candidate when calculating the cross-sections is therefore given by: After the weight determination, an unbinned maximumlikelihood fit is performed to these weighted events in each ( p T (μμ), |y(μμ)|) interval using the dimuon invariant mass, m(μμ), and pseudo-proper decay time, τ (μμ), observables. The fitted yields of J/ψ → μ + μ − and ψ(2S) → μ + μ − are determined separately for prompt and non-prompt processes. Finally, the differential cross-section times the ψ → μ + μ − branching fraction is calculated for each state by including the integrated luminosity and the p T and rapidity interval widths as shown in Eqs. (1) and (2).

Non-prompt fraction
The non-prompt fraction f ψ b is defined as the number of nonprompt ψ (produced via the decay of a b-hadron) divided by the number of inclusively produced ψ decaying to muon pairs after applying weighting corrections: where this fraction is determined separately for J/ψ and ψ(2S). Determining the fraction from this ratio is advantageous since acceptance and efficiencies largely cancel and the systematic uncertainty is reduced.

Ratio of ψ(2S) to J/ψ production
The ratio of ψ(2S) to J/ψ production, in their dimuon decay modes, is defined as: is the number of prompt (non-prompt) J/ψ or ψ(2S) mesons decaying into a muon pair in an interval of p T and y, corrected for selection efficiencies and acceptance.
For the ratio measurements, similarly to the non-prompt fraction, the acceptance and efficiency corrections largely cancel, thus allowing a more precise measurement. The theoretical uncertainties on such ratios are also smaller, as several dependencies, such as parton distribution functions and b-hadron production spectra, largely cancel in the ratio.

Acceptance
The kinematic acceptance A for a ψ → μ + μ − decay with p T and y is given by the probability that both muons pass the fiducial selection ( p T (μ) > 4 GeV and |η(μ)| < 2.3). This is calculated using generator-level "accept-reject" simulations, based on the analytic formula described below. Detectorlevel corrections, such as bin migration effects due to detector resolution, are found to be small. They are applied to the results and are also considered as part of the systematic uncertainties.
The acceptance A depends on five independent variables (the two muon momenta are constrained by the m(μμ) mass condition), chosen as the p T , |y| and azimuthal angle φ of the ψ meson in the laboratory frame, and two angles characterizing the ψ → μ + μ − decay, θ and φ , described in detail in Ref. [57]. The angle θ is the angle between the direction of the positive-muon momentum in the ψ rest frame and the momentum of the ψ in the laboratory frame, while φ is defined as the angle between the dimuon production and decay planes in the laboratory frame. The ψ production plane is defined by the momentum of the ψ in the laboratory frame and the positive z-axis direction. The distributions in θ and φ differ for various possible spin-alignment scenarios of the dimuon system.
The spin-alignment of the ψ may vary depending on the production mechanism, which in turn affects the angular distribution of the dimuon decay. Predictions of various theoretical models are quite contradictory, while the recent experimental measurements [7] indicate that the angular dependence of J/ψ and ψ(2S) decays is consistent with being isotropic.
The coefficients λ θ , λ φ and λ θφ in are related to the spin-density matrix elements of the dimuon spin wave function.
Since the polarization of the ψ state may affect acceptance, seven extreme cases that lead to the largest possible variations of acceptance within the phase space of this measurement are identified. These cases, described in Table 1, are used to define a range in which the results may vary under any physically allowed spin-alignment assumptions. The same technique has also been used in other measurements [9,14,34]. This analysis adopts the isotropic distribution in both cos θ and φ as nominal, and the variation of the results for a number of extreme spin-alignment scenarios is studied and presented as sets of correction factors, detailed further in "Appendix".
For each of the two mass-points (corresponding to the J/ψ and ψ(2S) masses), two-dimensional maps are produced as a function of dimuon p T (μμ) and |y(μμ)| for the set of spinalignment hypotheses. Each point on the map is determined from a uniform sampling over φ and cos θ , accepting those trials that pass the fiducial selections. To account for various spin-alignment scenarios, all trials are weighted according to Eq. 3. Acceptance maps are defined within the range 8 < p T (μμ) < 110 GeVand |y(μμ)| < 2.0, corresponding to the data considered in the analysis. The map is defined by 100 slices in |y(μμ)| and 4400 in p T (μμ), using 200k trials for each point, resulting in sufficiently high precision that the statistical uncertainty can be neglected. Due to the contributions of background, and the detector resolution of the signal, the acceptance for each candidate is determined from a linear interpolation of the two maps, which are generated for the J/ψ and ψ(2S) known masses, as a function of the reconstructed mass m(μμ). Figure 1 shows the acceptance, projected in p T for all the spin-alignment hypotheses for the J/ψ meson. The differences between the acceptance of the ψ(2S) and J/ψ meson, are independent of rapidity, except near |y| ≈ 2 at low p T . Similarly, the only dependence on p T is found below p T ≈ 9 GeV. The correction factors (as given in "Appendix") vary most at low p T , ranging from −35 % under longitudinal, to +100 % for transverse-positive scenarios. At high  Fig. 1 Projections of the acceptance as a function of p T for the J/ψ meson for various spin-alignment hypotheses p T , the range is between −14 % for longitudinal, and +9 % for transverse-positive scenarios. For the fraction and ratio measurements, the correction factor is determined from the appropriate ratio of the individual correction factors.

Muon reconstruction and trigger efficiency determination
The technique for correcting the 7 TeV data for trigger and reconstruction inefficiencies is described in detail in Refs. [9,34]. For the 8 TeV data, a similar technique is used, however different efficiency maps are required for each set of data, and the 8 TeV corrections are detailed briefly below. The single-muon reconstruction efficiency is determined from a tag-and-probe study in dimuon decays [40]. The efficiency map is calculated as a function of p T (μ) and q ×η(μ), where q = ±1 is the electrical charge of the muon, expressed in units of e.
The trigger efficiency correction consists of two components. The first part represents the trigger efficiency for a single muon in intervals of p T (μ) and q × η(μ). For the dimuon system there is a second correction to account for reductions in efficiency due to closely spaced muons firing only a single RoI, vertex-quality cuts, and opposite-sign requirements. This correction is performed in three rapidity intervals: 0-1.0, 1.0-1.2 and 1.2-2.3. The correction is a function of R(μμ) in the first two rapidity intervals and a function of R(μμ) and |y(μμ)| in the last interval.
The combination of the two components (single-muon efficiency map and dimuon corrections) is illustrated in Fig. 2 by plotting the average trigger-weight correction for the events in this analysis in terms of p T (μμ) and |y(μμ)|. The increased weight at low p T and |y| ≈ 1.25 is caused by the geometrical acceptance of the muon trigger system and the turn-on threshold behaviour of the muon trigger. At high =8 TeV, 11.4 fb s Fig. 2 Average dimuon trigger-weight in the intervals of p T (μμ) and |y(μμ)| studied in this set of measurements p T the weight is increased due to the reduced opening angle between the two muons.

Fitting technique
To extract the corrected yields of prompt and non-prompt J/ψ and ψ(2S) mesons, two-dimensional weighted unbinned maximum-likelihood fits are performed on the dimuon invariant mass, m(μμ), and pseudo-proper decay time, τ (μμ), in intervals of p T (μμ) and |y(μμ)|. Each interval is fitted independently from all the others. In m(μμ), signal processes of ψ meson decays are statistically distinguished as narrow peaks convolved with the detector resolution, at their respective mass positions, on top of background continuum. In τ (μμ), decays originating with zero pseudoproper decay time and those following an exponential decay distribution (both convolved with a detector resolution function) statistically distinguish prompt and non-prompt signal processes, respectively. Various sources of background processes include Drell-Yan processes, mis-reconstructed muon pairs from prompt and non-prompt sources, and semileptonic decays from separate b-hadrons.
The probability density function (PDF) for each fit is defined as a normalized sum, where each term represents a specific signal or background contribution, with a physically motivated mass and τ dependence. The PDF can be written in a compact form as where κ i represents the relative normalization of the i th term of the seven considered signal and background contributions (such that i κ i = 1), f i (m) is the mass-dependent term, and ⊗ represents the convolution of the τ -dependent function h i (τ ) with the τ resolution term, R(τ ). The latter is modelled Table 2 Description of the fit model PDF in Eq. 4. Components of the probability density function used to extract the prompt (P) and nonprompt (NP) contributions for J/ψ and ψ(2S) signal and the P, NP, and incoherent or mis-reconstructed background (Bkg) contributions by a double Gaussian distribution with both means fixed to zero and widths determined from the fit. Table 2 lists the contributions to the overall PDF with the corresponding f i and h i functions. Here G 1 and G 2 are Gaussian functions, B 1 and B 2 are Crystal Ball 2 distributions [58], while F is a uniform distribution and C 1 a first-order Chebyshev polynomial. The exponential functions E 1 , E 2 , E 3 , E 4 and E 5 have different decay constants, where E 5 (|τ |) is a double-sided exponential with the same decay constant on either side of τ = 0. The parameter ω represents the fractional contribution of the B and G mass signal functions, while the Dirac delta function, δ(τ ), is used to represent the pseudo-proper decay time distribution of the prompt candidates.
In order to make the fitting procedure more robust and to reduce the number of free parameters, a number of component terms share common parameters, which led to 22 free parameters per interval. In detail, the signal mass models are described by the sum of a Crystal Ball shape (B) and a Gaussian shape (G). For each of J/ψ and ψ(2S), the B and G share a common mean, and freely determined widths, with the ratio of the B and G widths common to J/ψ and ψ(2S). The B parameters α, and n, describing the transition point of the low-edge from a Gaussian to a power-law shape, and the shape of the tail, respectively, are fixed, and variations are considered as part of the fit model systematic uncertainties. The width of G for ψ(2S) is set to the width for J/ψ multiplied by a free parameter scaling term. The relative fraction of B and G is left floating, but common to J/ψ and ψ(2S).
The non-prompt signal decay shapes (E 1 ,E 2 ) are described by an exponential function (for positive τ only) convolved with a double Gaussian function, R(τ ) describing 2 The Crystal Ball function is given by: The background contributions are described by a prompt and non-prompt component, as well as a double-sided exponential function convolved with a double Gaussian function describing mis-reconstructed or non-coherent muon pairs. The same resolution function as in signal is used to describe the background. For the non-resonant mass parameterizations, the non-prompt contribution is modelled by a firstorder Chebyshev polynomial. The prompt mass contribution follows a flat distribution and the double-sided background uses an exponential function. Variations of this fit model are considered as systematic uncertainties.
The following quantities are extracted directly from the fit in each interval: the fraction of events that are signal (prompt or non-prompt J/ψ or ψ(2S)); the fraction of signal events that are prompt; the fraction of prompt signal that is ψ(2S); and the fraction of non-prompt signal that is ψ(2S). From these parameters, and the weighted sum of events, all measured values are calculated.
For 7 TeV data, 168 fits are performed across the range of 8 < p T < 100 GeV (8 < p T < 60 GeV) for J/ψ (ψ(2S)) and 0 < |y| < 2. For 8 TeV data, 172 fits are performed across the range of 8 < p T < 110 GeV and 0 < |y| < 2, excluding the area where p T is less than 10 GeV and simultaneously |y| is greater than 0.75. This region is excluded due to a steeply changing low trigger efficiency causing large systematic uncertainties in the measured crosssection. Figure 3 shows the fit results for one of the intervals considered in the analysis, projected onto the invariant mass and pseudo-proper decay time distributions, for 7 TeV data, weighted according to the acceptance and efficiency corrections. The fit projections are shown for the total prompt and total non-prompt contributions (shown as curves), and also for the individual contributions of the J/ψ and ψ(2S) prompt and non-prompt signal yields (shown as hashed areas of various types).
In Fig. 4 the fit results are shown for one highp T interval of 8 TeV data.

Bin migration corrections
To account for bin migration effects due to the detector resolution, which results in decays of ψ in one bin, being identified and accounted for in another, the numbers of acceptanceand efficiency-corrected dimuon decays extracted from the fits in each interval of p T (μμ) and rapidity are corrected for the differences between the true and reconstructed values of the dimuon p T . These corrections are derived from data by comparing analytic functions that are fitted to the p T (μμ) spectra of dimuon events with and without convolution by the experimental resolution in p T (μμ) (as determined from the fitted mass resolution and measured muon angular resolutions), as described in Ref. [34].
The correction factors applied to the fitted yields deviate from unity by no more than 1.5 %, and for the majority of slices are smaller than 1 %. The ratio measurement and nonprompt fractions are corrected by the corresponding ratios of bin migration correction factors. Using a similar technique, bin migration corrections as a function of |y| are found to differ from unity by negligible amounts.

Systematic uncertainties
The sources of systematic uncertainties that are applied to the ψ double differential cross-section measurements are from uncertainties in: the luminosity determination; muon and trigger efficiency corrections; inner detector tracking efficiencies; the fit model parametrization; and due to bin migration corrections. For the non-prompt fraction and ratio measurements the systematic uncertainties are assessed in the same manner as for the uncertainties on the cross-section, except that in these ratios some systematic uncertainties, such as the luminosity uncertainty, cancel out. The sources of systematic uncertainty evaluated for the prompt and nonprompt ψ cross-section measurements, along with the minimum, maximum and median values, are listed in Table 3. The largest contributions, which originate from the trigger and fit model uncertainties, are typically for the high p T intervals and are due to the limited statistics of the efficiency maps (for the trigger), and the data sample (for the fit model). Figures 5 and 6 show, for a representative interval, the impact of the considered uncertainties on the production cross-section, as well as the non-prompt fraction and ratios for 7 TeV data. The impact is very similar at 8 TeV.
Luminosity. The uncertainty on the integrated luminosity is 1.8 % (2.8 %) for the 7 TeV (8 TeV) data-taking period. The methodology used to determine these uncertainties is described in Ref. [59]. The luminosity uncertainty is only applied to the J/ψ and ψ(2S) cross-section results.
Muon reconstruction and trigger efficiencies. To determine the systematic uncertainty on the muon reconstruction and trigger efficiency maps, each of the maps is reproduced in 100 pseudo-experiments. The dominant uncertainty in each bin is statistical and hence any bin-to-bin correlations are neglected. For each pseudo-experiment a new map is created by varying independently each bin content according to a Gaussian distribution about its estimated value, determined from the original map. In each pseudo-experiment, the total weight is recalculated for each dimuon p T and |y| interval of the analysis. The RMS of the total weight pseudo-experiment distributions for each efficiency type is used as the systematic uncertainty, where any correlation effects between the muon and trigger efficiencies can be neglected.
The ID tracking efficiency is in excess of 99.5 % [34], and an uncertainty of 1 % is applied to account for the ID dimuon reconstruction inefficiency (0.5 % per muon, added coherently). This uncertainty is applied to the differential cross-sections and is assumed to cancel in the fraction of non-prompt to inclusive production for J/ψ and ψ(2S) and in the ratios of ψ(2S) to J/ψ production.   For the trigger efficiency trig , in addition to the trigger efficiency map, there is an additional correction term that accounts for inefficiencies due to correlations between the two trigger muons, such as the dimuon opening angle. This correction is varied by its uncertainty, and the shift in the resultant total weight relative to its central value is added in quadrature to the uncertainty from the map. The choice of triggers is known [60] to introduce a small lifetimedependent efficiency loss but it is determined to have a negligible effect on the prompt and non-prompt yields and no correction is applied in this analysis. Similarly, the muon reconstruction efficiency corrections of prompt and non-prompt signals are found to be consistent within the statistical uncertainties of the efficiency measurements, and no additional uncertainty is applied.  Fig. 6 Breakdown of the contributions to the fractional uncertainty on the non-prompt fractions for J/ψ (top left) and ψ(2S) (top right), and the prompt (bottom left) and non-prompt (bottom right) ratios for 7 TeV, shown for the region 0.75 < |y| < 1.00 lifetime model); and variation of the mixing terms for the two Gaussian components of this term.
Of the variations considered, it is typically the parametrizations of the signal mass model and pseudo-proper decay time resolution model that dominate the contribution to the fit model uncertainty.
Bin migrations. As the corrections to the results due to bin migration effects are factors close to unity in all regions, the difference between the correction factor and unity is applied as the uncertainty.
The variation of the acceptance corrections with spinalignment is treated separately, and scaling factors supplied in "Appendix".

Production cross-sections
Figures 7 and 8 show respectively the prompt and non-prompt differential cross-sections of J/ψ and ψ(2S) as functions of p T and |y|, together with the relevant theoretical predictions, which are described below.

Non-prompt production fractions
The results for the fractions of non-prompt production relative to the inclusive production of J/ψ and ψ(2S) are pre- [nb GeV  [nb GeV  [nb GeV [nb GeV  sented as a function of p T for slices of rapidity in Fig. 9. In each rapidity slice, the non-prompt fraction is seen to increase as a function of p T and has no strong dependence on either rapidity or centre-of-mass energy.
Production ratios of ψ(2S) to J/ψ Figure 10 shows the ratios of ψ(2S) to J/ψ decaying to a muon pair in prompt and non-prompt processes, presented as a function of p T for slices of rapidity. The non-prompt ratio is shown to be relatively flat across the considered range of p T , for each slice of rapidity. For the prompt ratio, a slight increase as a function of p T is observed, with no strong dependence on rapidity or centre-of-mass energy.

Comparison with theory
For prompt production, as shown in Fig. 11, the ratio of the NLO NRQCD theory calculations [61] to data, as a function of p T and in slices of rapidity, is provided for J/ψ and ψ(2S) at both the 7 and 8 TeV centre-of-mass energies. The theory predictions are based on the long-distance matrix elements (LDMEs) from Refs. [61,62], with uncertainties originating from the choice of scale, charm quark mass and LDMEs [nb GeV  [nb GeV  [nb GeV [nb GeV   [61,62] for more details). Figure 11 shows fair agreement between the theoretical calculation and the data points for the whole p T range. The ratio of theory to data does not depend on rapidity.
For non-prompt ψ production, comparisons are made to FONLL theoretical predictions [1,2], which describe the production of b-hadrons followed by their decay into ψ + X . Figure 12 shows the ratios of J/ψ and ψ(2S) FONLL predictions to data, as a function of p T and in slices of rapidity, for centre-of-mass energies of 7 and 8 TeV. For J/ψ, agreement is generally good, but the theory predicts slightly harder p T spectra than observed in the data. For ψ(2S), the shapes of data and theory appear to be in satisfactory agreement, but the theory predicts higher yields than in the data. There is no observed dependence on rapidity in the comparisons between theory and data for non-prompt J/ψ and ψ(2S) production.

Comparison of cross-sections 8 TeV with 7 TeV
It is interesting to compare the cross-section results between the two centre-of-mass energies, both for data and the theoretical predictions.   Figure 13 shows the 8-7 TeV cross-section ratios of prompt and non-prompt J/ψ and ψ(2S) for both data sets. For the theoretical ratios the uncertainties are neglected here, since the high correlation between them results in large cancellations.
Due to a finer granularity in p T for the 8 TeV data, a weighted average of the 8 TeV results is taken across equivalent intervals of the 7 TeV data to enable direct comparisons. Both data and theoretical predictions agree that the ratios become larger with increasing p T , however at the lower edge of the p T range the data tends to be slightly below theory.

Summary and conclusions
The prompt and non-prompt production cross-sections, the non-prompt production fraction of the J/ψ and ψ(2S) decaying into two muons, the ratio of prompt ψ(2S) to prompt J/ψ production, and the ratio of non-prompt ψ(2S) to non-prompt J/ψ production were measured in the rapidity range |y| < 2.0 for transverse momenta between 8 and 110 GeV. This measurement was carried out using 2.1fb −1 (11.4fb −1 ) of pp collision data at a centre-of-mass energy of 7 TeV (8 TeV) recorded by the ATLAS experiment at the LHC. It is the latest in a series of related mea-  surements of the production of charmonium states made by ATLAS. In line with previous measurements, the central values were obtained assuming isotropic ψ → μμ decays. Correction factors for these cross-sections, computed for a number of extreme spin-alignment scenarios, are between −35 and +100 % at the lowest transverse momenta studied, and between −14 and +9 % at the highest transverse momenta, depending on the specific scenario.
The ATLAS measurements presented here extend the range of existing measurements to higher transverse momenta, and to a higher collision energy of √ s = 8 TeV, and, in over-lapping phase-space regions, are consistent with previous measurements made by ATLAS and other LHC experiments. For the prompt production mechanism, the predictions from the NRQCD model, which includes colour-octet contributions with various matrix elements tuned to earlier collider data, are found to be in good agreement with the observed data points. For the non-prompt production, the fixed-order next-to-leading-logarithm calculations reproduce the data reasonably well, with a slight overestimation of the differential cross-sections at the highest transverse momenta reached in this analysis.   Fig. 13 The ratio of the 8 and 7 TeV differential cross-sections are presented for prompt (top) and non-prompt (bottom) J/ψ (left) and ψ(2S) (right) for both data (red points with error bars) and theoretical predictions (green points). The theoretical predictions used are NRQCD for prompt and FONLL for non-prompt production. The uncertainty on the data ratio does not account for possible correlations between 7 and 8 TeV data, and no uncertainty is shown for the ratio of theory predictions Acknowledgments We thank CERN for the very successful operation of the LHC, as well as the support staff from our institutions without whom ATLAS could not be operated efficiently. We acknowledge the support of ANPCyT, Argentina; YerPhI, Armenia; ARC, Australia; Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecomm ons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. Funded by SCOAP 3 .

Appendix: Spin-alignment correction factors
The measurement presented here assumes an unpolarized spin-alignment hypothesis for determining the correction factor. In principle, the polarization may be non-zero and may vary with p T . In order to correct these measurements when well-measured J/ψ and ψ(2S) polarizations are determined, a set of correction factors are provided in Tables 4,  5 Table 20 Mean weight correction factor for J/ψ under the "off-(λ θ -λ φ )-plane positive" spin-alignment hypothesis for 8 TeV. Those intervals not measured in the analysis at low p T , high rapidity are also excluded here