Since it was first observed, the production of heavy quarkonium states in high energy hadronic collisions remains a subject of considerable theoretical and experimental interest [1, 2]. These processes are sensitive to the interaction dynamics both at small and large distances: the production of heavy (c or b) quarks with a high transverse momentum is followed by a bound states formation with a low relative quark momentum. Accordingly, a theoretical description of these processes involves both perturbative and non-perturbative methods, as it was proposed in the non-relativistic QCD (NRQCD) [3,4,5,6,7]. However, it is known that the NRQCD at the next-to-leading order (NLO) accuracy meets difficulties in a simultaneous description of all the collider data in there entirety (see also discussions [8,9,10,11,12,13]). In particular, it has a long-standing challenge in the \(J/\psi \) and \(\psi (2S)\) polarization and provides an inadequate description [14,15,16,17,18,19] of the recent \(\eta _c\) production data taken by the LHCb Collaboration at the LHC [20]. One of possible solutions of the problems mentioned above, which implies a certain modification of the NRQCD rules, has been proposed recently [21]. As it was shown, the approach [21] allows one to describe well the recent data on the production and polarization of the entire charmonia family. The bottomonium production, namely \(\Upsilon (nS)\) and \(\chi _b(mP)\) mesons, provides an alternative laboratory for understanding the physics of the hadronization of heavy quark pairs. Due to heavier masses and a smaller quark relative velocity v (in a produced quarkonium rest frame), these processes could be even a more suitable case to apply the double NRQCD expansion in QCD coupling \(\alpha _s\) and v. The NLO NRQCD predictions for the \(\Upsilon (nS)\) production at the LHC were presented [22,23,24]. Of course, it is important to apply also the approach [21] to the bottomonium family.

Table 1 The NMEs for the \(\Upsilon (1S)\) and \(\chi _b(1P)\) mesons as determined from our fit at \(p_T^{\mathrm{cut}} = 10\) GeV (the “fit A” scenario). The NMEs obtained in the NLO NRQCD [22] are shown for comparison
Table 2 The NMEs for the \(\Upsilon (1S)\) and \(\chi _b(1P)\) mesons as determined from our fit at \(p_T^{\mathrm{cut}} = 10\) GeV (the “fit B” scenario) and only for the KMR density at \(p_T^{\mathrm{cut}} = 0\) GeV (the “fit C” scenario)
Table 3 The dependence of \(\chi ^2/d.o.f.\) achieved in the fit procedure (the “fit A” scenario) on the choice of \(p_T^{\mathrm{cut}}\) at only \(\sqrt{s} = 7\) TeV and at 7 and 13 TeV combined
Table 4 The dependence of \(\chi ^2/d.o.f.\) achieved in the fit procedure (using NMEs from the “fit A” and “fit C” scenarios) on the choice of \(p_T^{\mathrm{cut}}\) for only the ATLAS data and the ATLAS and CMS data combined
Fig. 1
figure 1

Transverse momentum distribution of the inclusive \(\Upsilon (1S)\) production calculated at \(\sqrt{s} = 7\) TeV in the different rapidity regions. The red, green and blue histograms correspond to the predictions obtained with the A0, KMR and JH’2013 set 1 gluon densities. Shaded bands represent the total uncertainties of our calculations, as it is described in the text. The experimental data are from ATLAS [30]

Fig. 2
figure 2

Transverse momentum distribution of the inclusive \(\Upsilon (1S)\) production calculated at \(\sqrt{s} = 7\) TeV (upper histograms) and \(\sqrt{s} = 13\) TeV (lower histograms, divided by 100) in the different rapidity regions. The notation of all histograms is the same as in Fig. 1. The experimental data are from CMS [27, 28]

Fig. 3
figure 3

The ratio \(R^{\chi _b(mP)}_{\Upsilon (1S)}\) calculated as a function of the \(\Upsilon (1S)\) transverse momentum calculated at \(\sqrt{s} = 7\) and 8 TeV using the NMEs from Tables 1 and 2 (for the A0 gluon density). The notation of all histograms is the same as in Fig. 1. The experimental data are from LHCb [33]

Fig. 4
figure 4

The ratio \(R^{\chi _{b2}(1P)}_{\chi _{b1}(1P)}\) calculated as a function of the \(\Upsilon (1S)\) transverse momentum calculated at \(\sqrt{s} = 8\) TeV using the NMEs from Tables 1 and 2 (for the A0 gluon density). The notation of all histograms is the same as in Fig. 1. The experimental data are from CMS [29] and LHCb [34]

Fig. 5
figure 5

Transverse momentum distribution of the inclusive \(\Upsilon (1S)\) production calculated at \(\sqrt{s} = 1.8\), 7, 8 and 13 TeV in the different rapidity regions. The notation of all histograms is the same as in Fig. 1. The experimental data are from CDF [67] and LHCb [31, 32]

Fig. 6
figure 6

Transverse momentum distribution of the inclusive \(\Upsilon (1S)\) production calculated at \(\sqrt{s} = 7\) TeV in the different rapidity regions. The green and purple histograms correspond to the “fit A” and “fit C” predictions obtained with the KMR gluon density. Shaded bands represent the NME uncertainties of our calculations, as it is described in the text. The experimental data are from ATLAS [30]

Fig. 7
figure 7

Transverse momentum distribution of the inclusive \(\Upsilon (1S)\) production calculated at \(\sqrt{s} = 7\) TeV (upper histograms) and \(\sqrt{s} = 13\) TeV (lower histograms, divided by 100) in the different rapidity regions. The notation of all histograms is the same as in Fig. 6. The experimental data are from CMS [27, 28]

Fig. 8
figure 8

The ratio \(R^{\chi _b(mP)}_{\Upsilon (1S)}\) calculated as a function of the \(\Upsilon (1S)\) transverse momentum calculated at \(\sqrt{s} = 7\) and 8 TeV. The notation of all histograms is the same as in Fig. 6. The experimental data are from LHCb [33]

Fig. 9
figure 9

The ratio \(R^{\chi _{b2}(1P)}_{\chi _{b1}(1P)}\) calculated as a function of the \(\Upsilon (1S)\) transverse momentum calculated at \(\sqrt{s} = 8\) TeV. The notation of all histograms is the same as in Fig. 6. The experimental data are from CMS [29] and LHCb [34]

Fig. 10
figure 10

Transverse momentum distribution of the inclusive \(\Upsilon (1S)\) production calculated at \(\sqrt{s} = 1.8\), 7, 8 and 13 TeV in the different rapidity regions. The notation of all histograms is the same as in Fig. 6. The experimental data are from CDF [67] and LHCb [31, 32]

Fig. 11
figure 11

The polarization parameters \(\lambda _\theta \), \(\lambda _\phi \), \(\lambda _{\theta \phi }\) and \({{\tilde{\lambda }}}\) of the \(\Upsilon (1S)\) mesons calculated in the CS frame as functions of its transverse momentum at \(\sqrt{s} = 7\) TeV. The A0 gluon density is used. The blue and red histograms correspond to the predictions obtained at \(|y|<0.6\) and \(0.6<|y|<1.2\), respectively. The experimental data are from CMS [35]

Fig. 12
figure 12

The polarization parameters \(\lambda _\theta \), \(\lambda _\phi \), \(\lambda _{\theta \phi }\) and \({{\tilde{\lambda }}}\) of the \(\Upsilon (1S)\) mesons calculated in the helicity frame as functions of its transverse momentum at \(\sqrt{s} = 7\) TeV. The notation of all histograms is the same as in Fig. 11. The experimental data are from CMS [35]

Fig. 13
figure 13

The polarization parameters \(\lambda _\theta \), \(\lambda _\phi \), \(\lambda _{\theta \phi }\) and \({{\tilde{\lambda }}}\) of the \(\Upsilon (1S)\) mesons calculated in the perpendicular helicity frame as functions of its transverse momentum at \(\sqrt{s} = 7\) TeV. The notation of all histograms is the same as in Fig. 11. The experimental data are from CMS [35]

Fig. 14
figure 14

The polarization parameters \(\lambda _\theta \) and \({{\tilde{\lambda }}}\) of the \(\Upsilon (1S)\) mesons calculated in the helicity frame as functions of its transverse momentum at \(\sqrt{s} = 1.96\) TeV. The notation of all histograms is the same as in Fig. 11. The experimental data are from CDF [36]

Our present work continues the line started in the previous studies [25, 26]. We have considered there the inclusive production of \(\Upsilon (3S)\), \(\Upsilon (2S)\), \(\chi _b(3P)\) and \(\chi _b(2P)\) mesons and now come to \(\Upsilon (1S)\) and \(\chi _b(1P)\) mesons. The motivation for the whole business has been already given [25, 26]. Below we present a systematic analysis of the CMS [27,28,29], ATLAS [30] and LHCb [31,32,33,34] data on the \(\Upsilon (1S)\) and \(\chi _b(1P)\) production collected at \(\sqrt{s} = 7\), 8 and 13 TeV (including the different relative production rates) and we extract from these data non-perturbative matrix elements (NMEs) for the \(\Upsilon (1S)\) and \(\chi _b(1P)\) mesons. Then we make predictions for polarization parameters \(\lambda _\theta \), \(\lambda _\phi \), \(\lambda _{\theta \phi }\) (and a frame-independent parameter \({{\tilde{\lambda }}}\)), which determine the \(\Upsilon (1S)\) spin density matrix and compare them to the currently available data [35, 36]. As it is known, the feed-down contributions from \(\chi _b(2P)\), \(\chi _b(3P)\), \(\Upsilon (2S)\) and \(\Upsilon (3S)\) decays give a significant impact on the \(\Upsilon (1S)\) production and polarization, so studies [25, 26] are important and necessary for our present consideration. Another important issue concerns the relative production rate \(\sigma (\chi _{b2})/\sigma (\chi _{b1})\) recently measured by the CMS [29] and LHCb [34] Collaborations. This ratio is sensitive to the color singlet (CS) and color octet (CO) production mechanisms and provides information complementary to the study of the S-wave bottomonium states.

In the present note we follow mostly the same steps as in [25, 26]. So, to describe the perturbative production of the \(b{{\bar{b}}}\) pair in the hard scattering subprocesses we apply the \(k_T\)-factorization approach [37,38,39,40], which is mainly based on the Balitsky–Fadin–Kuraev–Lipatov (BFKL) [41,42,43] or Ciafaloni–Catani–Fiorani–Marchesini (CCFM) [44,45,46,47] gluon evolution equations. A detailed description and discussion of the different aspects of the \(k_T\)-factorization can be found in the reviews [48]. As usual, we see certain advantages in the ease of including into the calculations a large piece of higher order pQCD corrections taking them into account in the form of transverse momentum dependent (TMD), or unintegrated, gluon densities in a proton. Our consideration is based on the off-shell gluon-gluon fusion subprocesses representing the true leading order (LO) in QCD:

$$\begin{aligned}&g^*(k_1) + g^*(k_2) \rightarrow \Upsilon [{}^3S_1^{(1)}](p) + g(k), \end{aligned}$$
(1)
$$\begin{aligned}&g^*(k_1) + g^*(k_2) \rightarrow \Upsilon [{}^1S_0^{(8)},{}^3S_1^{(8)},{}^3P_J^{(8)}](p). \end{aligned}$$
(2)
$$\begin{aligned}&g^*(k_1) + g^*(k_2) \rightarrow \chi _{bJ}(p)[{}^3P_J^{(1)},{}^3S_1^{(8)}] \rightarrow \Upsilon (p_1) + \gamma (p_2),\nonumber \\ \end{aligned}$$
(3)

where \(J = 0, 1\) or 2 and the four-momenta of all particles are given in the parentheses. The color states taken into account are directly indicated. Both initial gluons are off mass shell, that means that they have non-zero transverse four-momenta \(k_1^2 = - {{\mathbf {k}}}_{1T}^2 \ne 0\), \(k_2^2 = - {{\mathbf {k}}}_{2T}^2 \ne 0\) and an admixture of longitudinal component in the polarization four-vectors (see [37,38,39,40] for more information). The corresponding off-shell (\(k_T\)-dependent) production amplitudes contain projection operators [49,50,51,52] for spin and color, that guarantee the proper quantum numbers of the final state bottomonium. Following the ideas [21], to describe the nonperturbative transformations of the color-octet \(b{{\bar{b}}}\) pairs produced in hard subprocesses into observed final state mesons we employ the classical multipole radiation theory (where the electric dipole E1 transition dominates [53, 54]) under the key physical assumption that the lifetime of intermediate color-octet states is rather long. According to [21], only a single E1 transition is needed to transform a P-wave state into an S-wave state,Footnote 1 whereas the transformation of the color-octet S-wave state into the color-singlet S-wave state is treated as two successive E1 transitions \({}^3S_1^{(8)} \rightarrow {}^3P_J^{(8)} + g\), \({}^3P_J^{(8)} \rightarrow {}^3S_1^{(1)} + g\) proceeding via either of three intermediate \({}^3P_J^{(8)}\) states with \(J = 0,1,2\). An essential consequence of the idea above is the nonconservation of the spin momentum \(S_z\) during the transformation of color octet \(^3S_1^{(8)}\) state into the color-singlet \(^3S_1^{(1)}\) one. In fact, the intermediate P-wave state is a state with definite total momentum J and its projection \(J_z\) rather than a state with definite \(L_z\) and \(S_z\). To describe the formation of the intermediate state, we have to contract the electric dipole transition amplitude [53, 54] (which by its own conserves \(S_z\)) with Clebsch–Gordan coefficients \(|L,\,S,\,J,\,J_z\rangle \langle L,\,L_z,\,S,\,S_z|\) which are symmetric with respect to \(L_z\) and \(S_z\). Then the resulting expression comprises both: the terms containing \(\epsilon ^*(S_z[^3S_1])\cdot \epsilon (S_z[^3P_J])\) and the terms containing \(\epsilon ^*(S_z[^3S_1])\cdot \epsilon (L_z[^3P_J])\). Consequently, there is no direct transfer from the initial spin polarization to the final spin polarization (see [21] for more information).

Below we apply the gauge invariant expressions for quarkonia production and decay amplitudes implemented into the Monte-Carlo event generator pegasus [55]. The derivation steps are explained in [25, 26] in detail.

According to the \(k_T\)-factorization prescription, to calculate the cross sections of a considered process one has to convolute the partonic cross section \({{\hat{\sigma }}}^*\) (related with an off-shell production amplitude) and TMD gluon densities in a proton \(f_g(x, {{\mathbf {k}}}_{T}^2, \mu ^2)\):

$$\begin{aligned} \sigma&= \int dx_1 dx_2 d{{\mathbf {k}}}^2_{1T} d{{\mathbf {k}}}^2_{2T} \, {{\hat{\sigma }}}^*(x_1, x_2, {{\mathbf {k}}}_{1T}^2, {{\mathbf {k}}}_{2T}^2, \mu ^2) \nonumber \\&\quad \times f_g (x_1, {{\mathbf {k}}}_{1T}^2, \mu ^2) f_g(x_2, {{\mathbf {k}}}_{2T}^2, \mu ^2){{d\phi _1}\over {2\pi }}{{d\phi _2}\over {2\pi }}, \end{aligned}$$
(4)

where \(x_1\) and \(x_2\) are the longitudinal momentum fractions of initial off-shell gluons, \(\phi _1\) and \(\phi _2\) are their azimuthal angles and \(\mu \) is the hard interaction scale. Following [25, 26], we have tested several sets of TMD gluon densities in a proton. Two of them (A0 [56] and JH’2013 set 1 [57]) were obtained from the CCFM equation where all input parameters were fitted to the proton structure function \(F_2(x,Q^2)\). We have applied the TMD gluon densities obtained within the Kimber–Martin–Ryskin (KMR) prescription [58,59,60], which provides a method to construct the TMD quark and gluon distributions from the conventional (collinear) ones. For the input, we have applied the recent LO NNPDF3.1 set [61]. The parton level calculations according to (4) were performed using the Monte-Carlo generator pegasus. Of course, we take into account the feed-down contributions from \(\chi _b(3P)\), \(\chi _b(2P)\), \(\chi _b(1P)\), \(\Upsilon (3S)\) and \(\Upsilon (2S)\) decays.

Numerically, everywhere we set the masses \(m_{\Upsilon (1S)} = 9.4603\) GeV, \(m_{\Upsilon (2S)}\,{=}\,10.02326\) GeV, \(m_{\Upsilon (3S)}\,{=}\,10.3552\) GeV, \(m_{\chi _{b1}(3P)} = 10.512\) GeV, \(m_{\chi _{b2}(3P)} = 10.522\) GeV, \(m_{\chi _{b0}(2P)} = 10.232\) GeV, \(m_{\chi _{b1}(2P)} = 10.255\) GeV, \(m_{\chi _{b2}(2P)} = 10.268\) GeV, \(m_{\chi _{b0}(1P)} = 9.8594\) GeV, \(m_{\chi _{b1}(1P)} = 9.8928\) GeV, \(m_{\chi _{b2}(1P)} = 9.9122\) GeV [62] and adopt the usual non-relativistic approximation \(m_b = m_{{{\mathcal {Q}}}}/2\) for the beauty quark mass, where \(m_{{{\mathcal {Q}}}}\) is the mass of bottomonium \({{\mathcal {Q}}}\). We set the necessary branching ratios as they are given in [62]. Note that there is no experimental data for the branching ratios of \(\chi _b(3P)\), so we use the results of assumption [23] that the total decay widths of \(\chi _b(mP)\) are approximately independent on m. So, we have \(B(\chi _{b1}(3P)\rightarrow \Upsilon (1S) + \gamma ) = 0.0381\) and \(B(\chi _{b2}(3P) \rightarrow \Upsilon (1S) + \gamma ) = 0.0192\) [23]. We use the one-loop formula for the QCD coupling \(\alpha _s\) with \(n_f = 4(5)\) quark flavours at \(\Lambda _{\mathrm{QCD}} = 250(167)\) MeV for A0 (KMR) gluon density and two-loop expression for \(\alpha _s\) with \(n_f = 4\) and \(\Lambda _{\mathrm{QCD}} = 200\) MeV for JH’2013 set 1. We set the color-singlet NMEs \(\langle {{\mathcal {O}}}^{\Upsilon (1S)}[{}^{3}S_1^{(1)}]\rangle = 8.39\) GeV\(^3\) and \(\langle {{\mathcal {O}}}^{\chi _{b0}(1P)}[{}^{3}P_0^{(1)}]\rangle = 2.30\) GeV\(^5\) as obtained from the potential model calculations [63]. All the NMEs for \(\Upsilon (2S)\), \(\Upsilon (3S)\), \(\chi _b(2P)\) and \(\chi _b(3P)\) mesons were derived in [25, 26].

To determine the NMEs for both \(\Upsilon (1S)\) and \(\chi _b(1P)\) mesons we have performed a global fit to the \(\Upsilon (1S)\) production data at the LHC. We have included in the fitting procedure the \(\Upsilon (1S)\) transverse momentum distributions measured by the CMS [27, 28] and ATLAS [30] Collaborations at \(\sqrt{s} = 7\) and 13 TeV. Similar to the NRQCD analyses [22,23,24], we have excluded from our fit the low \(p_T\) region and considered only data at \(p_T > p_T^{\mathrm{cut}} = 10\) GeV. We note that at low transverse momenta a more accurate treatment of large logarithms \(\sim \ln m^2_{\Upsilon }/p^2_T\) and other nonperturbative effects become necessary. To determine NMEs for \(\chi _b(1P)\) mesons, we also included into the fit the recent LHCb data [33] on the radiative \(\chi _b(1P) \rightarrow \Upsilon (1S) + \gamma \) decays collected at \(\sqrt{s} = 7\) and 8 TeV and the recent CMS [29] and LHCb data [34] on the ratio \(\sigma (\chi _{b2}(1P))/\sigma (\chi _{b1}(1P))\) collected at \(\sqrt{s} = 8\) TeV.

Our analysis strategy is the following. First, we found that the \(p_T\) shape of the direct \(\Upsilon [^3S_1^{(8)}]\) and feed-down \(\chi _b[^3S_1^{(8)}]\) contributions to the \(\Upsilon (1S)\) production is almost the same in all kinematical regions probed at the LHC. Thus, the ratio

$$\begin{aligned} r{=}{\sum _{J = 0}^{2} (2J{+}1) \, B(\chi _{bJ}(1P) \rightarrow \Upsilon (1S){+}\gamma ) d\sigma [\chi _{bJ}(1P), {}^3S_1^{(8)}]/dp_T \over d\sigma [\Upsilon (1S), {}^3S_1^{(8)}]/dp_T}\nonumber \\ \end{aligned}$$
(5)

can be well approximated by a constant for a wide \(\Upsilon (1S)\) transverse momentum \(p_T\) and rapidity y range at different energies. For example, we estimate the mean-square average \(r = 1.743 \pm 0.010\) for the A0 set, which is practically the same for all other TMD gluon densities in a proton. So, we construct a linear combination

$$\begin{aligned} M_r = \langle {{\mathcal {O}}}^{\Upsilon (1S)}[{}^{3}S_1^{(8)}]\rangle + r \langle {{\mathcal {O}}}^{\chi _{b0}(1P)}[{}^{3}S_1^{(8)}]\rangle , \end{aligned}$$
(6)

which can be only extracted from the measured \(\Upsilon (1S)\) transverse momentum distributions. Note that here we considered the color singlet wave functions of \(\chi _b(1P)\) mesons as independent (not necessarily identical) free parameters, as it was proposed [64] to describe the LHC data on relative \(\sigma (\chi _{c2})/\sigma (\chi _{c1})\) production rate. Of course, we understand that doing so is at odds with the Heavy Quark Effective Theory (HQET) and Heavy Quark Spin Symmetry (HQSS). However, it was argued [64] that the HQSS predictions must not be taken for granted.Footnote 2 Thus, here we try two alternative scenarios for the mesons. We assume HQSS violation either solely for the color singlet states (“fit A”) or for both color singlet and color octet states (“fit B”). In the latter case, the color octet NMEs for \(\chi _b(1P)\) mesons are also treated as independent parameters not related to each other through the \((2J+1)\) factor. Thus, we introduce the ratios:

$$\begin{aligned} r^{CO}_0= & {} { B(\chi _{b0}(1P) \rightarrow \Upsilon (1S) + \gamma ) d\sigma [\chi _{b0}(1P), {}^3S_1^{(8)}]/dp_T \over d\sigma [\Upsilon (1S), {}^3S_1^{(8)}]/dp_T }, \nonumber \\ \end{aligned}$$
(7)
$$\begin{aligned} r^{CO}_1= & {} { B(\chi _{b1}(1P) \rightarrow \Upsilon (1S) + \gamma ) d\sigma [\chi _{b1}(1P), {}^3S_1^{(8)}]/dp_T \over d\sigma [\Upsilon (1S), {}^3S_1^{(8)}]/dp_T },\nonumber \\ \end{aligned}$$
(8)
$$\begin{aligned} r^{CO}_2= & {} { B(\chi _{b2}(1P) \rightarrow \Upsilon (1S) + \gamma ) d\sigma [\chi _{b2}(1P), {}^3S_1^{(8)}]/dp_T \over d\sigma [\Upsilon (1S), {}^3S_1^{(8)}]/dp_T },\nonumber \\ \end{aligned}$$
(9)

and obtain the mean-square average values \(r^{CO}_0 = 0.01737 \pm 0.00006\), \(r^{CO}_1 = 0.304 \pm 0.002\) and \(r^{CO}_2 = 0.1626 \pm 0.0010\) (for the A0 gluon density). Then, instead of (6), we have a modified linear combination for the color octet NMEs:

$$\begin{aligned} M_{r} = \langle {{\mathcal {O}}}^{\Upsilon (1S)}[{}^{3}S_1^{(8)}]\rangle + \sum \limits _{J = 0}^{2} r^{CO}_J \langle {{\mathcal {O}}}^{\chi _{bJ}(1P)}[{}^{3}S_1^{(8)}]\rangle .\nonumber \\ \end{aligned}$$
(10)

Next, we found that the \(p_T\) shapes of the direct \(\Upsilon [^3P_J^{(8)}]\), feed-down \(\chi _b[^3P_1^{(1)}]\) and \(\chi _b[^3P_2^{(1)}]\) contributions to the \(\Upsilon (1S)\) production are also the same in all kinematical regions. So, the ratios

$$\begin{aligned} r_1= & {} { B(\chi _{b2}(1P) \rightarrow \Upsilon (1S) + \gamma ) d\sigma [\chi _{b2}(1P), {}^3P_2^{(1)}]/dp_T \over B(\chi _{b1}(1P) \rightarrow \Upsilon (1S) + \gamma ) d\sigma [\chi _{b1}(1P), {}^3P_1^{(1)}]/dp_T },\nonumber \\ \end{aligned}$$
(11)
$$\begin{aligned} r_2= & {} { \sum \limits _{J = 0}^{2} (2J+1) \, d\sigma [\Upsilon (1S), {}^3P_J^{(8)}]/dp_T \over B(\chi _{b1}(1P) \rightarrow \Upsilon (1S) + \gamma ) d\sigma [\chi _{b1}(1P), {}^3P_1^{(1)}]/dp_T }\nonumber \\ \end{aligned}$$
(12)

can be approximated by constants for a wide \(\Upsilon (1S)\) transverse momentum \(p_T\) and rapidity y range at different energies. For example, we estimate the mean-square average \(r_1 = 0.91 \pm 0.02\) and \(r_2 = 104 \pm 2\) for the A0 set. Then we construct a linear combination

$$\begin{aligned} M_{r_1r_2}= & {} \langle {{\mathcal {O}}}^{\chi _{b1}(1P)}[{}^{3}P_1^{(1)}]\rangle + r_1 \langle {{\mathcal {O}}}^{\chi _{b2}(1P)}[{}^{3}P_2^{(1)}]\rangle \nonumber \\&+ r_2 \langle {{\mathcal {O}}}^{\Upsilon (1S)}[{}^{3}P_0^{(8)}]\rangle , \end{aligned}$$
(13)

which can be extracted from the measured \(\Upsilon (1S)\) transverse momentum distributions. As the next step, we use the recent LHCb data [33] on the ratio of \(\Upsilon (1S)\) mesons originating from the \(\chi _b(1P)\) radiative decays measured at \(\sqrt{s} = 7\) and 8 TeV:

$$\begin{aligned} R^{\chi _b(1P)}_{\Upsilon (1S)}&= \sum \limits _{J = 1}^{2} {\sigma (pp\rightarrow \chi _{bJ}(1P) + X) \over \sigma (pp \rightarrow \Upsilon (1S) + X)} \nonumber \\&\quad \times B(\chi _{bJ} \rightarrow \Upsilon (1S) + \gamma ). \end{aligned}$$
(14)

In the “fit A” scenario, from the known \(M_r\), \(M_{r_1r_2}\) and \(R^{\chi _b(1P)}_{\Upsilon (1S)}\) values one can separately determine the \(\langle {{\mathcal {O}}}^{\Upsilon (1S)}[{}^{3}S_1^{(8)}]\rangle \), \(\langle {{\mathcal {O}}}^{\chi _{b0}(1P)}[{}^{3}S_1^{(8)}]\rangle \), \(\langle {{\mathcal {O}}}^{\Upsilon (1S)}[{}^{3}P_0^{(8)}]\rangle \) and the linear combination \(M_{CS} = \langle {{\mathcal {O}}}^{\chi _{b1}(1P)}[{}^{3}P_1^{(1)}]\rangle + r_1 \langle {{\mathcal {O}}}^{\chi _{b2}(1P)}[{}^{3}P_2^{(1)}]\rangle \). In the case of “fit B”, we can determine the \(\langle {{\mathcal {O}}}^{\Upsilon (1S)}[{}^{3}S_1^{(8)}]\rangle \), \(\langle {{\mathcal {O}}}^{\chi _{b0}(1P)}[{}^{3}S_1^{(8)}]\rangle \), \(\langle {{\mathcal {O}}}^{\Upsilon (1S)}[{}^{3}P_0^{(8)}]\rangle \) and two linear combinations \(M_{CS} = \langle {{\mathcal {O}}}^{\chi _{b1}(1P)}[{}^{3}P_1^{(1)}]\rangle + r_1 \langle {{\mathcal {O}}}^{\chi _{b2}(1P)}[{}^{3}P_2^{(1)}]\rangle \) and \(M_{CO} = \langle {{\mathcal {O}}}^{\chi _{b1}(1P)}[{}^{3}S_1^{(8)}]\rangle + r^{CO}_2/r^{CO}_1 \langle {{\mathcal {O}}}^{\chi _{b2}(1P)}[{}^{3}S_1^{(8)}]\rangle \). Finally, we use recent CMS [29] and LHCb data [34] measured at \(\sqrt{s} = 8\) TeV on the ratio

$$\begin{aligned} R^{\chi _{b2}(1P)}_{\chi _{b1}(1P)} = {\sigma (\chi _{b2}(1P)) \over \sigma (\chi _{b1}(1P))}. \end{aligned}$$
(15)

From the known \(M_{CS}\), \(\langle {{\mathcal {O}}}^{\chi _{b0}(1P)}[{}^{3}S_1^{(8)}]\rangle \) and \(R^{\chi _{b2}(1P)}_{\chi _{b1}(1P)}\) values one can separately determine the \(\langle {{\mathcal {O}}}^{\chi _{b1}(1P)}[{}^{3}P_1^{(1)}]\rangle \) and \(\langle {{\mathcal {O}}}^{\chi _{b2}(1P)}[{}^{3}P_2^{(1)}]\rangle \) values for the first fit. For the second one we use only the CMS data [29], because the LHCb data [34] are very few and only increase the total error of the fitted quantities. So, from the known \(M_{CS}\), \(M_{CO}\) and \(R^{\chi _{b2}(1P)}_{\chi _{b1}(1P)}\) we determine the \(\langle {{\mathcal {O}}}^{\chi _{b1}(1P)}[{}^{3}P_1^{(1)}]\rangle \), \(\langle {{\mathcal {O}}}^{\chi _{b2}(1P)}[{}^{3}P_2^{(1)}]\rangle \), \(\langle {{\mathcal {O}}}^{\chi _{b1}(1P)}[{}^{3}S_1^{(8)}]\rangle \), \(\langle {{\mathcal {O}}}^{\chi _{b2}(1P)}[{}^{3}S_1^{(8)}]\rangle \) values. Therefore, we have reconstructed the full map of the NMEs for both \(\Upsilon (1S)\) and \(\chi _b(1P)\) mesons.

The fitting procedure described above was separately done in each of the rapidity subdivisions (using the fitting algorithm as implemented in the commonly used gnuplot package [66]) under the requirement that all the NMEs are strictly positive. Then, the mean-square average of the fitted values was taken. The corresponding uncertainties are estimated in the conventional way using Student’s t-distribution at the confidence level \(P = 80\)%. The results of our fits are collected in Tables 1 and 2. For comparison, we also presented there the NMEs obtained in the conventional NLO NRQCD by other authors [22]. Note that the results [22] were obtained from the fit on the same data set as in our analysis. The corresponding \(\chi ^2/d.o.f.\) are listed in Table 3, where we additionally show their dependence on the minimal \(\Upsilon (1S)\) transverse momenta involved into the fit \(p_T^{\mathrm{cut}}\). As one can see, the \(\chi ^2/d.o.f.\) tends to stay the same or slightly increase when \(p_T^{\mathrm{cut}}\) grows up and the best fit of the LHC data is achieved with the A0 and KMR gluon, although other gluon densities also return reliable \(\chi ^2/d.o.f.\) values. We note that including into the fit the latest CMS data [28] taken at \(\sqrt{s} = 13\) TeV leads to 2–3 times higher values of \(\chi ^2/d.o.f.\), as it was with the data on \(\Upsilon (2S)\) [26]. We have checked that this is true for both the \(k_T\)-factorization and collinear approachesFootnote 3 and, therefore, it could be a sign of some inconsistency between these CMS data and other measurements.

Both fit scenarios result in unequal values for \(\chi _{b1}(1P)\) and \(\chi _{b2}(1P)\) color singlet wave functions. So, for “fit A” we achieved the ratio \(\langle {{\mathcal {O}}}^{\chi _{b2}(1P)}[{}^{3}P_2^{(1)}]\rangle : \langle {{\mathcal {O}}}^{\chi _{b1}(1P)}[{}^{3}P_1^{(1)}]\rangle : \langle {{\mathcal {O}}}^{\chi _{b0}(1P)}[{}^{3}P_0^{(1)}]\rangle \sim 2.6 : 4.8 : 1\) for the JH’2013 set 1, \(\sim 2.6 : 3.9 : 1\) for the KMR and \(\sim 1 : 3 : 1\) for the A0 gluon densities, respectively. This is an obvious contradiction with naive expectations based on the number of spin degrees of freedom, \(\sim 5 : 3 : 1\). The difference between the predictions for this ratio obtained with the considered TMD gluon densities could be a sign of a sensitivity of the relative production rate \(R^{\chi _{b2}(1P)}_{\chi _{b1}(1P)}\) to the gluon distributions and/or due to lack of the experimental data. If we assume the HQSS violation in the color octet sector as well (the “fit B” scenario), the fitted values of the color singlet NMEs of the \(\chi _{b1}(1P)\) and \(\chi _{b2}(1P)\) mesons also differ from each other (see Table 2). The latter qualitatively agrees with the observations [64, 65] done in the case of the \(\chi _c\) mesons.

All the data used in the fits above are compared with our predictions in Figs. 1, 2, 3 and 4. The shaded areas represent the theoretical uncertainties of our calculations, which include the uncertainties coming from the NME fitting procedure and the scale uncertainties. To estimate the latter, the standard variations in default renormalization scale (which is set to be equal to \(\mu _R^2 = m_{{{\mathcal {Q}}}}^2 + {{\mathbf {p}}}_T^2\)), namely, \(\mu _R\rightarrow 2\mu _R\) or \(\mu _R\rightarrow \mu _R/2\) were introduced with replacing the A0 and JH’2013 set 1 gluon densities by the A0\(+\) and JH’2013 set 1\(+\), or by the A0− and JH’2013 set 1− ones. This was done to preserve the intrinsic correspondence between the TMD gluon set and the factorization scale taken as \(\mu _F^2 = {{\hat{s}}} + {{\mathbf {Q}}}_T^2\) (where \({{\mathbf {Q}}}_T\) is the net transverse momentum of incoming off-shell gluon pair) according to the TMD gluon fits (see [56, 57] for more information). Of course, in the case of KMR gluons both factorization and renormalization scales have been varied to estimate the scale uncertainties. One can see that we have achieved a reasonably good description of the CMS [27, 28] and ATLAS [30] data for the \(\Upsilon (1S)\) transverse momentum distributions in the whole \(p_T\) range within the experimental and theoretical uncertainties. The relative production rates \(R^{\chi _{b2}(1P)}_{\chi _{b1}(1P)}\) measured by the CMS [29] and LHCb [34] Collaborations and the \(R^{\chi _b(1P)}_{\Upsilon (1S)}\) ratios measured by the LHCb Collaboration [33] at \(\sqrt{s} = 7\) and 8 TeV are also reproduced well. However, our predictions for the \(R^{\chi _b(2P)}_{\Upsilon (1S)}\) and \(R^{\chi _b(3P)}_{\Upsilon (1S)}\) rates tend to overestimate a bit the LHCb data [33], although they are rather close to the measurements within the uncertainties bands (see Fig. 3). The same situation is observed in the conventional NRQCD scenario, where the NLO NRQCD calculations [22,23,24] also overestimate the experimental data for the \(R^{\chi _b(2P)}_{\Upsilon (1S)}\) and \(R^{\chi _b(3P)}_{\Upsilon (1S)}\) rates. Evaluation of these observables involves the NMEs for \(\Upsilon (2S)\), \(\Upsilon (3S)\), \(\chi _b(2P)\) and \(\chi _b(3P)\) mesons determined previously [25, 26]. Of course, both scenarios, “fit A” and “fit B”, lead to exactly the same results for the Y(1S) transverse momentum distributions due to full correspondence between (6) and (10). The corresponding predictions differ to each other for the relative production rates \(R^{\chi _b(mP)}_{\Upsilon (nS)}\) and/or \(R^{\chi _{b2}(1P)}_{\chi _{b1}(1P)}\) only (see Figs. 3 and 4).

In addition, we have checked our results with the data, not included into the fit procedure: namely, the rather old CDF data [67] taken at \(\sqrt{s} = 1.8\) TeV and the LHCb data [31, 32] taken in the forward rapidity region \(2< y < 4.5\) at \(\sqrt{s} = 7\), 8 and 13 TeV (see Fig. 5). As one can see, we acceptably describe all the data above. Moreover, we find that the KMR gluon density does a much better job here than the JH’2013 set 1 or A0 distributions. Remarkably, the KMR gluon is only one TMD gluon density which is able to reproduce well the measurements in the low \(p_T\) region.

Based on that we have investigated the sensitivity of our fit to the low \(p_T\) region using the KMR gluon density function. To do this we include the low \(p_T\) region into our fit (“fit C”). Our results can be find in Table 2 and Figs. 6, 7, 8, 9 and 10, where we used the KMR gluon density using the NMEs from the “fit A” and “fit C”. One can see that both fit scenarios give overall almost the same results, and only for the ratios \(R^{\chi _b(2P)}_{\Upsilon (1S)}\) and \(R^{\chi _b(3P)}_{\Upsilon (1S)}\) the “fit A” gives slightly better results. However, the uncertainties for the “fit C” are larger than for the “fit A” scenario. Additionally, we present the corresponding \(\chi ^2/d.o.f.\) using the NMEs from the “fit A” for A0, JH’2013 set 1, KMR distributions and from the “fit C” for KMR only, that are listed in Table 4. We should note that using the ATLAS [30] and CMS [27, 28] data give us lower values of \(\chi ^2/d.o.f.\) than using only the CMS data. This is due to large uncertainties of the ATLAS data in the low \(p_T\) region. One can see that \(\chi ^2/d.o.f.\) for the “fit C” are larger not only compared with the results using the NMEs from the “fit A” scenario in the low \(p_T\) region, but also compared with NMEs from Table 1 in the \(p_T>10\) GeV region. These results justify our exclusion of the low \(p_T\) region from our fit.

Now we turn to the polarization of \(\Upsilon (1S)\) mesons at the LHC conditions. It is well known that the polarization of any vector meson can be described with three parameters \(\lambda _\theta \), \(\lambda _\phi \) and \(\lambda _{\theta \phi }\), which determine the spin density matrix of a meson decaying into a lepton pair and can be measured experimentally. The double differential angular distribution of the decay leptons can be written as [68]:

$$\begin{aligned} {{d\sigma }\over {d\cos \theta ^*d\phi ^*}}&\sim {{1}\over {3+\lambda _\theta }}(1 + \lambda _\theta \cos ^2\theta ^* + \lambda _\phi \sin ^2\theta ^*\cos 2\phi ^* \nonumber \\&\quad + \lambda _{\theta \phi }\sin 2\theta ^*\cos \phi ^*), \end{aligned}$$
(16)

where \(\theta ^*\) and \(\phi ^*\) are the polar and azimuthal angles of the decay lepton measured in the meson rest frame. The case of \((\lambda _\theta \), \(\lambda _\phi \), \(\lambda _{\theta \phi }) = (0,0,0)\) corresponds to an unpolarized state, while \((\lambda _\theta \), \(\lambda _\phi \), \(\lambda _{\theta \phi }) = (1,0,0)\) and \((\lambda _\theta \), \(\lambda _\phi \), \(\lambda _{\theta \phi }) = (-1,0,0)\) refer to fully transverse and fully longitudinal polarizations. The CMS Collaboration has measured all of these polarization parameters for the \(\Upsilon (1S)\) mesons as functions of their transverse momentum in three complementary frames: the Collins-Soper, helicity and perpendicular helicity ones at \(\sqrt{s} = 7\) TeV [35]. The CDF Collaboration has measured the polarization parameters in the helicity frame at \(\sqrt{s} = 1.96\) TeV [36]. The frame-independent parameter \({{\tilde{\lambda }}} = (\lambda _\theta + 3\lambda _\phi )/(1 - \lambda _\phi )\) has been additionally studied. As it was done previously [25, 26], to estimate \(\lambda _\theta \), \(\lambda _\phi \), \(\lambda _{\theta \phi }\) and \({{\tilde{\lambda }}}\) we generally follow the experimental procedure. We collect the simulated events in the kinematical region defined by the experimental setup, generate the decay lepton angular distributions according to the production and decay matrix elements and then apply a three-parametric fit based on (16).

Our predictions are shown in Figs. 11, 12, 13 and 14. The calculations were performed using the A0 gluon density which provides the best description of the measured \(\Upsilon (1S)\) transverse momenta distributions at the LHC conditions. The NMEs from Table 1 (the “fit A” scenario) were applied. As one can see, we find only a weak or zero polarization in the all kinematic regions, that perfectly agrees with the CMS and CDF measurements. This agreement shows no fundamental problems in describing the \(\Upsilon (1S)\) polarization data. Moreover, the calculated polarization parameters \(\lambda _\theta \), \(\lambda _\phi \), \(\lambda _{\theta \phi }\) and \({{\tilde{\lambda }}}\) are stable with respect to variations in the model parameters. In fact, there is no dependence on the strong coupling constant and/or TMD gluon densities in a proton. As it was already pointed out above, our results for \(\lambda _\theta \), \(\lambda _\phi \), \(\lambda _{\theta \phi }\) and \({{\tilde{\lambda }}}\) are based on the key assumption [21] that the intermediate color octet states are states with a definite total angular momentum J and its projection \(J_z\), rather than states with definite projections of a spin \(S_z\) and orbital angular momentum \(L_z\). Given that, the transition amplitudes only involve the polarization vector associated with \(J_z\) and not with \(L_z\). As a result, we have no conservation of \(S_z\) in the electric dipole transitions. Under this assumption, we have achieved a reasonable simultaneous description for all of the available data for the \(\Upsilon (1S)\) and \(\chi _b(1P)\) mesons (the transverse momentum distributions, relative production rates and polarization observables). We have obtained earlier similar results for charmonia (\(J/\psi \), \(\psi ^\prime \), \(\chi _c\)), \(\Upsilon (2S)\) and \(\Upsilon (3S)\) polarizations [25, 26, 65, 69, 70]. Thus, keeping in mind the remarkable absence of tension with the \(\eta _c\) production data (see [69, 70]), one can conclude that the approach [21] results in the self-consistent and simultaneous description of charmonium and bottomonium data and therefore can be considered as providing an easy and natural solution to the long-standing quarkonia production and polarization puzzle.