Abstract
The production cross section of a W boson in association with two b jets is measured using a sample of proton–proton collisions at \(\sqrt{s} = 8{\,\mathrm{{TeV}}} \) collected by the CMS experiment at the CERN LHC. The data sample corresponds to an integrated luminosity of 19.8\(\,\text {fb}^\text {-1}\). The W bosons are reconstructed via their leptonic decays, \(\mathrm {W}\rightarrow \ell \nu \), where \(\ell =\mu \) or \(\mathrm {e}\). The fiducial region studied contains exactly one lepton with transverse momentum \(p_{\mathrm {T}} ^{\ell }>30\,\mathrm{{GeV}} \) and pseudorapidity \(|\eta ^{\ell } |<2.1\), with exactly two b jets with \(p_{\mathrm {T}} >25\,\mathrm{{GeV}} \) and \(|\eta |<2.4\) and no other jets with \(p_{\mathrm {T}} >25\,\mathrm{{GeV}} \) and \(|\eta |<4.7\). The cross section is measured to be \(\sigma ( {{\mathrm {p}\mathrm {p}}} \rightarrow {\mathrm {W}} (\ell \nu )\)+\({\mathrm{{b}}\mathrm{{\overline{b}}}})= 0.64 \pm 0.03{\,\mathrm{{(stat)}}} \pm 0.10{\,\mathrm{{(syst)}}} \pm 0.06{\,\mathrm{{(theo)}}} \pm 0.02{\,\mathrm{{(lumi)}}}\,{\text {pb}}\), in agreement with standard model predictions.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The measurement of \(\mathrm {W}\) or \(\mathrm {Z} \) boson production in association with b quarks in proton–proton collisions provides important input for refinement of calculations in perturbative quantum chromodynamics and is also relevant for searches and measurements. In particular, these processes constitute a background to the experimental measurement of a standard model (SM) Higgs boson in which the Higgs boson decays into a \({\mathrm{b} \mathrm{{\overline{b}}}}\) pair in association with a vector boson. The discovery by the ATLAS and CMS Collaborations at the CERN LHC of a neutral boson with a mass of about \(125\,\mathrm{{GeV}} \) [1,2,3,4] motivates further studies to establish the nature of the boson and determine its coupling to bottom quarks. Furthermore, different models based on extensions of the Higgs sector are being compared with LHC data using final states composed of leptons and b jets. In this context, a better understanding of the b hadron production mechanism and the kinematic properties of associated jets is required to refine the background predictions and increase the sensitivity to new physics. Throughout this paper, hadronic showers originating from bottom or anti-bottom quarks are referred to as b jets, and b-tagged jets are the reconstructed objects either in simulation or data that have been identified as such.
The production of \(\mathrm {W}\) [5, 6] or \(\mathrm {Z} \) [7,8,9,10,11] bosons in association with b jets has been measured at the LHC using pp collisions at \(\sqrt{s}=7{\,\mathrm{{TeV}}} \) using data samples corresponding to an integrated luminosity of up to 5\(\,\text {fb}^\text {-1}\), and at the Fermilab Tevatron [12, 13] using proton–antiproton collisions at \(\sqrt{s}=1.96{\,\mathrm{{TeV}}} \). This analysis extends previous measurements of the \({\mathrm {W}}+{\mathrm{{b}}\mathrm{{\overline{b}}}}\) production cross section [5] and uses data at \(\sqrt{s}=8{\,\mathrm{{TeV}}} \) collected with the CMS detector, corresponding to an integrated luminosity of 19.8\(\,\text {fb}^\text {-1}\) [14]. Whereas the previous CMS analysis used only the muon decay channel, this analysis uses both muon and electron decay modes.
2 CMS detector
The central feature of the CMS apparatus is a superconducting solenoid of 6 m internal diameter, providing a magnetic field of 3.8 T. Within the solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter (ECAL), and a brass and scintillator hadron calorimeter (HCAL), each composed of a barrel and two endcap sections. Muons are measured in gas-ionization detectors embedded in the steel flux-return yoke outside the solenoid. Extensive forward calorimetry complements the coverage provided by the barrel and endcap detectors. A more detailed description of the CMS detector, together with a definition of the coordinate system used and the relevant kinematic variables, can be found in Ref. [15].
3 Event selection and reconstruction
The \(\mathrm {W}\rightarrow \mathrm {\mu }\nu _\mathrm {\mu }\) (\(\mathrm {W}\rightarrow \mathrm {e}\nu _\mathrm {e}\)) events are selected using single-muon (single-electron) triggers that require a loosely isolated muon (electron) with transverse momentum \(p_{\mathrm {T}} >24\,(27)\,\mathrm{{GeV}} \) and pseudorapidity \(|\eta |<2.1\,(2.5)\).
Individual particles emerging from each collision are reconstructed with the particle-flow (PF) technique [16, 17]. This approach uses the information from all subdetectors to identify and reconstruct individual particle candidates in the event, classifying them into mutually exclusive categories: charged and neutral hadrons, photons, electrons, and muons.
Muons are reconstructed by combining the information from the tracker and the muon spectrometer [18, 19]. Electrons are reconstructed by combining the information from the tracker and the calorimeter [20]. Both the muon and the electron candidates are required to have \(p_{\mathrm {T}} >30\,\mathrm{{GeV}} \) and \(|\eta |<2.1\) to ensure that the triggers are fully efficient. They are also required to originate from the primary vertex of the event, chosen as the vertex with the highest \(\sum p_{\mathrm {T}} ^2\) of the charged particles associated with it. Furthermore, the leptons must be isolated, where the isolation variable is defined as
with the sums running over PF candidates in a cone of size \(\varDelta R < 0.4\,(0.3)\) around the muon (electron) direction, where \(\varDelta R = \sqrt{\smash [b]{ (\varDelta \eta )^2 + (\varDelta \phi )^2}}\), and \(\phi \) is the azimuthal angle in radians. The first three sums are over charged hadron candidates associated with the primary vertex, photon candidates, and neutral hadron candidates respectively. The definition of the isolation includes a correction for additional pp interactions, referred to as pileup, which is proportional to the scalar \(p_{\mathrm {T}}\) sum of charged particles not associated with the primary vertex in the isolation cone (\(\sum p_{\mathrm {T}} ^{\mathrm {PU}}\)). The selected muons (electrons) are required to have \(I < 0.12\,(0.10)\).
Missing transverse momentum in the event, \({\vec p}_{\mathrm {T}}^\mathrm{{miss}} \), is defined as the negative vector sum of the \(\vec {p_{\mathrm {T}}}\) of all PF candidates in the event. It is combined with the \(\vec {p_{\mathrm {T}}}\) of a muon or electron passing the identification and isolation requirements to compute the transverse mass, \(M_\mathrm {T}\), of the \(\mathrm {W}\) boson candidate. The \(M_\mathrm {T}\) variable is a natural discriminator against non-\(\mathrm {W}\) final states, such as quantum chromodynamics (QCD) multijet events, that have a lepton candidate and \({\vec p}_{\mathrm {T}}^\mathrm{{miss}} \), but a relatively low value of \(M_\mathrm {T} \). The result for \({\vec p}_{\mathrm {T}}^\mathrm{{miss}} \) is corrected for noise in the ECAL and HCAL using the method described in Ref. [21]. Corrections to minimize the effect of the pileup are also included [22].
Jets are constructed using the anti-\(k_{\mathrm {T}}\) clustering algorithm [23] with a radius parameter of 0.5, as implemented in the FastJet package [24, 25]. Jet clustering is performed using individual particle candidates reconstructed with the PF technique. Jets are required to pass identification criteria that eliminate jets originating from noisy channels in the HCAL [26]. Jets from pileup interactions are rejected by requiring that the jets originate at the primary interaction vertex. Small corrections to the relative and absolute jet energy calibrations of the detector are applied as a function of the \(p_{\mathrm {T}} \) and \(\eta \) of the jet [27].
The combined secondary vertex (CSV) b tagging algorithm [28, 29] exploits the long lifetime and relatively large mass of b hadrons to provide b jet identification. The CSV algorithm combines information about impact parameter significance, secondary vertex kinematic properties, and jet kinematic properties in a likelihood-ratio discriminator. The identification of b jets (b tagging) is made by imposing a minimum threshold on the CSV discriminator value. In this analysis, b-tagged jets are required to pass a threshold with an efficiency of 40% in the signal phase space and a misidentification probability of 0.1% (1%) for light (charm) jets. Jets are corrected for the difference in efficiency between data and simulation using scale factors dependent on the \(p_{\mathrm {T}}\) of the jet.
4 Simulated samples
After all selection requirements detailed in Sect. 5 are applied, the contributing background processes to the overall yield are the associated production of a massive vector boson and jets (\(\mathrm {V}\)+jets where V = \(\mathrm {W}\) or \(\mathrm {Z}\)), as well as production of diboson (\(\mathrm {W^+}{}\mathrm {W^-} \), \(\mathrm {W}{}\mathrm {Z} \), \(\mathrm {Z}{}\mathrm {Z} \)), \(\mathrm{{t}}\mathrm{{\overline{t}}}\), single top quark, \(\gamma \)+jets, and QCD multijet events. These background contributions are estimated from simulation, except for the QCD background, which is estimated from data as described in Sect. 5.
Simulated samples of \(\mathrm {V}\)+jets, \(\gamma \)+jets and \(\mathrm{{t}\mathrm{{\overline{t}}}}+\mathrm{{jets}}\) are generated at tree-level with MadGraph 5.1 [30, 31] using the CTEQ6L [32] parton distribution function (PDF) set. These samples are interfaced with pythia 6.4 [33] for hadronization using the Z2* tune for the underlying event. The most recent pythia Z2* tune is derived from the Z1 tune [34], which uses the CTEQ5L PDF set, whereas Z2* adopts CTEQ6L [32]. The \(k_{\mathrm {T}}\)-MLM [35, 36] matching scheme is used. For the signal distributions, the shapes are taken from a dedicated high-statistics generated sample of exclusive \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\). The normalization is obtained from the \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) component of an inclusive \(\mathrm {W}\)+jets sample by separating the \(\mathrm {W}\)+jets simulated sample into three subsamples labeled as \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\), \(\mathrm {W}\)+\(\mathrm {c} \mathrm {\overline{c}} \) , and \(\mathrm {W}\)+\(\mathrm {u}\) \(\mathrm {d}\) \(\mathrm {s}\) \(\mathrm {c}\) \(\mathrm {g}\) , which are defined below. If an event contains a bottom jet from the matrix element or parton shower, it is categorized as \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\). A bottom quark at generator level requires the presence of a bottom hadron within a cone of radius \(\varDelta R=0.4\) with respect to the jet axis. The jets are constructed using generator-level information using all stable particles in the event, excluding neutrinos. Jets with a distance smaller than \(\varDelta R = 0.5\) with respect to a lepton are removed from the event. If an event does not contain any b jet, but an even, nonzero number of charm jets, again from the matrix element or parton shower, it is categorized as \(\mathrm {W}\)+\(\mathrm {c} \mathrm {\overline{c}} \) . The remaining events are categorized as \(\mathrm {W}\)+\(\mathrm {u}\) \(\mathrm {d}\) \(\mathrm {s}\) \(\mathrm {c}\) \(\mathrm {g}\) . The energy of the selected leptons at the generator level is corrected for final-state radiation by summing the four-momenta of all the photons generated within a cone of radius \(\varDelta R = 0.1\) around the lepton. Leptons that do not originate from the primary vertex are not considered for selection.
Single top quark event samples are generated at next-to-leading order (NLO) with powheg 2.0 [37,38,39,40] using the CTEQ6M PDF set. Hadronization is performed using pythia 6.4 with the Z2* tune. Diboson samples are generated and hadronized with pythia 6.4 at leading order (LO) using the CTEQ6L PDF set and the Z2* tune.
The cross sections for the \(\mathrm {V}\)+jets processes are normalized using the predictions for inclusive \(\mathrm {W}\) and \(\mathrm {Z}\) boson production from fewz 3.1 [41, 42] evaluated at next-to-next-to-leading order (NNLO). The cross section for \(\gamma \)+jets is evaluated at LO using MadGraph with the CTEQ6L PDF set. Single top quark and diboson production cross sections are normalized to the NLO cross section predictions from \(\textsc {mcfm} ~7.0\) [43, 44] using the MSTW2008 NLO PDF set [45]. The \(\mathrm{{t}}\mathrm{{\overline{t}}}\) cross section used is \({241.5\pm 8.5\,{\text {pb}}}\), and was determined from data collected by the ATLAS and CMS experiments [46,47,48] at the LHC at \(\sqrt{s}=8{\,\mathrm{{TeV}}} \).
For all simulated processes, the detector response is simulated using a detailed description of the CMS detector based on Geant4 [49]. The reconstruction of simulated events is performed with the same algorithms used for the data.
Events induced by additional simultaneous pp interactions are simulated using events generated with pythia 6. During the 2012 data taking, the average pileup rate was 21 interactions per bunch crossing; the simulated number of pileup interactions has been reweighted to match this distribution in the data.
5 Analysis strategy
The \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) yield is estimated using a binned maximum-likelihood fit to the \(M_\mathrm {T} \) distribution in the signal event sample. With the exception of the multijet processes, the distributions and normalizations of all background contributions in the fit are taken from simulation. Consequently, it is important to verify that the simulation describes the data.
The dominant background in the signal event sample arises from the \(\mathrm{{t}}\mathrm{{\overline{t}}}\) process. Therefore, the data and simulation are compared in two \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-dominated control samples: one characterized by a pair of opposite flavor leptons (\(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multilepton), and the other by the presence of three or more jets (\(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multijet). The simulation is reweighted to describe the data in the control regions and then is used to predict the \(M_\mathrm {T} \) distributions in the signal region.
The signal region contains a muon (electron) with \(p_{\mathrm {T}} > 30\,\mathrm{{GeV}} \), \(|\eta | < 2.1\), and satisfying \(I < 0.12\,(0.10)\). Exactly two b-tagged jets with \(p_{\mathrm {T}} > 25\,\mathrm{{GeV}} \) and \(|\eta | < 2.4\) are also required. Events with additional leptons with \(p_{\mathrm {T}} > 10\,\mathrm{{GeV}} \) and \(|\eta | < 2.4\) or a third jet with \(p_{\mathrm {T}} > 25\,\mathrm{{GeV}} \) and \(|\eta | < 4.7\) are rejected. The \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multijet sample is obtained using the same selection criteria as for the signal event sample, but requiring at least three jets in the event with \(p_{\mathrm {T}} >25\,\mathrm{{GeV}} \) and \(|\eta |<2.4\) instead of vetoing events that have more than two jets. The \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multilepton sample uses similar selection criteria as the signal event sample; however, the lepton requirement is modified. The event must contain two isolated leptons of different flavor, each with \(p_{\mathrm {T}} >30\,\mathrm{{GeV}} \) and \(|\eta |<2.1\). In the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multilepton sample, the \(M_\mathrm {T} \) variable is calculated with respect to the electron in the electron channel and the muon in the muon channel.
The QCD background distributions in the \(M_\mathrm {T}\) variable are estimated from data using event samples that pass all signal requirements, but requiring the muon (electron) is not isolated, \(I > 0.20\,(0.15)\). The resulting distributions are corrected for the presence of all other backgrounds, as estimated from simulation. Their contribution is less than 1% of the QCD background rate. The QCD background normalization is adjusted in order to describe the number of data events at \(M_\mathrm {T} <20\,\mathrm{{GeV}} \), after subtracting the non-QCD backgrounds obtained from simulation.
In the fiducial regions used in this analysis, no correlation is observed between I and \(M_\mathrm {T} \) in multijet events simulated with pythia 6, so the use of an inverted isolation requirement to obtain the QCD background distribution is possible. However, this is not the case for the \(\varDelta R\) distance between the two b-tagged jets, \(\varDelta R({\mathrm{{b}},\mathrm{{\overline{b}}}})\), or the lepton \(p_{\mathrm {T}} \). The shape of the QCD distribution for these variables is therefore taken from an \(M_\mathrm {T} <30\) \(\,\mathrm{{GeV}}\) sideband and validated against QCD multijet simulation. The normalization of the QCD background in these variables is set to the final normalization resulting from the fit to the \(M_\mathrm {T}\) variable, which was derived using the inverted isolation requirement.
The normalizations and distributions of the simulated backgrounds are allowed to vary in the fit within the uncertainties listed in Table 1 as described in Sect. 6. The uncorrelated normalization uncertainties are uncertainties in the cross section of the given sample.
Two major parameters in the simulations significantly affect the normalization of the simulated distributions: the b tagging efficiency and the jet energy scale (JES). The control samples as well as the signal event samples show similar sensitivity to the b tagging efficiency, and its adjustment affects all the regions in a correlated manner. Because \(\mathrm{{t}}\mathrm{{\overline{t}}}\) production may have more than two jets in the final state, the rejection of events with a third jet makes it sensitive to JES. The effect on the leading jets is moderate, but JES variations lead to significant migration of jets into and out of the veto region. The \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multijet sample, since it has no veto on a third jet, is less sensitive to JES variations than the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multilepton sample. The variation in the JES changes the \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) yield in the signal region by less than 1%.
The fit procedure consists of three consecutive steps in which the simulated distributions in two control samples and the event sample are fit to data using the \(M_\mathrm {T} \) variable, which is chosen because it has a well-known shape for \(\mathrm {W}\)+jets production that allows for reliable signal extraction. First, the fit is performed using the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multijet sample. It results in a correction of the b tagging efficiency, measured separately in the muon and electron channels and then combined. The simulation is corrected using this result and the corrected simulated samples are fit to the data in the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multilepton sample. The result of the second step is used to adjust JES and as a result of this procedure, the simulation is expected to better describe the \(\mathrm{{t}}\mathrm{{\overline{t}}}\) contribution. The final step is to extract the number of \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) events from the fit to \(M_\mathrm {T}\) in the signal event sample.
Similar results can be obtained by performing a simultaneous fit of the signal and the two control regions. We find that the b tagging efficiency correction and JES correction have opposite effects on the distributions and thus compensate for each other in a simultaneous fit, reducing its precision. Separating these effects in steps provides better understanding of underlying uncertainties and therefore more precise results.
6 Systematic uncertainties
The main sources of the systematic uncertainties are listed in Table 1. The size of the variation is shown for each source, together with its effect on the measured cross section. These are included in the fit. Some of the uncertainties affect only the normalization in the respective contributions. These include the uncertainties in the theoretical cross section for a given sample, which are uncorrelated between samples and are included as log-normal constraints on the rate. The uncertainty due to the b tagging efficiency and the uncertainty due to the JES are observed to only affect the normalizations of the samples in the \(M_\mathrm {T}\) variable. The uncertainties that affect both the normalization and the shape of the \(M_\mathrm {T} \) distributions are listed in the table under “Shape” and are incorporated into the fit via binned distributions, which are obtained by varying the source of the given uncertainty and reprocessing the simulated sample. Such uncertainties in the template are interpolated quadratically.
As a conservative estimate of the uncertainty in QCD multijet background, a 50% uncertainty has been considered. This results in an uncertainty of 2–3% in the measured cross section. The b tagging efficiency and JES rescaling uncertainties are taken from their respective fits. The renormalization and factorization scales respectively are set at \(\mu _{\mathrm {R}}=\mu _{\mathrm {F}}=m_\mathrm {W}\), and the uncertainties on this choice are estimated from the change in acceptance found by varying \(\mu _{\mathrm {R}}\) and \(\mu _{\mathrm {F}}\) up and down by a factor of two. The PDF uncertainties are estimated from the change in acceptance found by varying the PDF set following the LHAPDF/PDF4LHC prescription [50,51,52,53], considering PDF sets from the CTEQ, MSTW, NNPDF, and HERA Collaborations.
7 Results
The fit in the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multijet sample is used to obtain b tagging efficiency rescaling factors separately for the muon and electron channels in order to better describe the b tagging efficiency in the simulation as described in Sect. 5. The results of the fit are presented in the two plots at the top of Fig. 1. The central values of the b tagging efficiency rescaling factors, \(1.12 \pm 0.08\) (muon channel) and \(1.16 \pm 0.08\) (electron channel), are averaged to \(1.14 \pm 0.08\) with the combined uncertainty, dominated by systematics, taken as the maximum of the uncertainties for the individual lepton channels. The simulation is reweighted accordingly for the next fit, and the uncertainty in this fit sets the one standard deviation bound on the b tagging efficiency rescaling factor in subsequent fits.
A fit to the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multilepton sample adjusts the JES, as described in Sect. 5. As a result, the simulated \(M_\mathrm {T} \) distributions change normalization. The best fit results in changing the normalization by approximately 3.4% from its central value, which corresponds to 1.3 standard deviations in JES. The middle plots in Fig. 1 show the results of the fits in the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multilepton sample for the muon (left) and the electron (right) channels. The JES is therefore shifted by 1.3 standard deviations in the simulation with the uncertainty taken from the fit. Thus the simulation is tuned to describe the \(\mathrm{{t}}\mathrm{{\overline{t}}}\) control samples and is used to extract the signal yield in the signal region.
The results of the fit in the \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) signal region are shown in the bottom of Fig. 1. All background contributions are allowed to vary in the fit within their uncertainties, while the \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) normalization remains a free parameter of the fit. The correlations across all simulated samples are taken into account as shown in Table 1. Based on the fits the number of events of each type in the signal event sample is given in Table 2. Events coming from the production of a Higgs boson in association with a vector boson constitute a negligible fraction of the overall event yield and are not considered.
Distributions for variables other than those being directly fit are also produced by applying the results from the three fits to the simulated samples. Distributions of \(\varDelta R({\mathrm{{b}},\mathrm{{\overline{b}}}})\) and \(p_{\mathrm {T}} ^\ell \) combining both lepton flavors are presented in Fig. 2. The angular separation between the b jets is seen to be well modeled, and the \(p_{\mathrm {T}} ^\ell \) distribution shows an agreement within \(10\%\) for \(p_{\mathrm {T}} ^\ell < 100\) GeV, with a slightly falling trend in the ratio of data and simulation.
The transverse mass distributions (upper) in the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multijet phase space after fitting to obtain the b tagging efficiency rescale factors, (middle) in the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multilepton sample after fitting to find the appropriate JES, and (lower) in the \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) signal sample after fitting simultaneously muon and electron decay channels. The lepton channels are shown separately with the muon sample on the left and the electron sample on the right. The last bin contains overflow events. The shaded area represents the total uncertainty in the simulation after the fit
Distributions of \(\varDelta R({\mathrm{{b}},\mathrm{{\overline{b}}}})\) and \(p_{\mathrm {T}} ^\ell \) after applying the results from the fits to the simulation. The QCD background shape is taken from an \(M_\mathrm {T} <30\,\mathrm{{GeV}} \) sideband and the muon and electron channels have been combined in these distributions. The last bin contains overflow events and the shaded area represents the total uncertainty in the simulation after the fit
The cross section is calculated as
where \(\mathcal {L}\) is the integrated luminosity, \(N^{\text {data}}_{\text {reconstructed}}\) is the number of observed signal events, \(N^{\mathrm {MC}}_{\text {reconstructed}}\) is the number of expected signal events from simulation reconstructed in the fiducial region, \(N^{\mathrm {MC}}_{\text {generated}}\) is the number of generated events in the fiducial region, A and \(\epsilon \) are the acceptance and efficiency, \(\alpha \) is the measured signal strength in the given lepton channel, and \(\sigma _{\text {gen}}\) is the simulated fiducial cross section of the signal sample. The signal strength is the scale factor in the \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) cross section predicted by the fit, after factoring out contributions to the overall change in normalization due to systematic effects which are correlated across samples. In this analysis, the fiducial cross section is calculated as follows: MadGraph is used to compute the \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) cross section with fiducial selections applied. Then a k-factor for inclusive W production is applied that is obtained from the ratio of the inclusive W cross section calculated with fewz 3.1 (at NNLO using the five-flavour CTEQ6M PDF set) and to that with MadGraph. The product \(A\,\epsilon \) is 10 to 15% and results from the combined effects of the efficiency of the lepton identification requirements (80%) and b tagging efficiency (40% per jet) and has an uncertainty of 10%, arising from scale and PDF choices as indicated in the bottom of Table 1.
The \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) cross section is measured within a fiducial volume, which is defined by requiring leptons with \(p_{\mathrm {T}} > 30\,\mathrm{{GeV}} \) and \(|\eta |<2.1\) and exactly two b-tagged jets of \(p_{\mathrm {T}} > 25\,\mathrm{{GeV}} \) and \(|\eta |<2.4\). The measured cross sections are presented in Table 3. The combination of the muon and electron measurements is done using a simultaneous fit to both channels, taking into account correlations across samples.
The measured cross sections are compared to theoretical predictions from mcfm 7.0 [43, 44] with the MSTW2008 PDF set, as well as from MadGraph 5 interfaced with pythia 6 in the four- and five-flavour schemes and MadGraph 5 with pythia 8 [54] in the four-flavour scheme. In the four- and five-flavour approaches, the four and five lightest quark flavours are used in the proton PDF sets. In the five-flavour scheme, the PDF set CTEQ6L is used and interfaced with pythia 6 using the Z2* tune. The two four-flavour samples are produced using an NNLO PDF set interfaced with pythia version 6 using the CTEQ6L tune in one sample, and version 8 using the CUETP8M1 tune [55] in the other.
Comparisons between the results of calculations performed under different assumptions provide important feedback on the validity of the techniques employed. Differences in predictions arising from the modelling of b quarks as massive or massless are possible, as are variations in predictions arising from the use of different showering packages (pythia 6 vs. pythia 8) or matrix element generators (MadGraph vs. mcfm 7.0). In the phase space explored here, these predictions are all very close in their central value and agree with each other well within their respective uncertainties.
The mcfm 7.0 cross section calculation is performed at the level of parton jets and thus requires a hadronization correction. The multiplicative hadronization correction factor \(0.81\pm 0.07\) is calculated using the MadGraph + pythia 6 sample and agrees well with the factor \(0.84\pm 0.03\) calculated in the 7 TeV Z+b analysis [8]. The correction factor is obtained for jets computed excluding neutrinos from the particle list because such jets are closer in kinematics to particle jets at the detector level. The uncertainty reflects both the limited statistics of the MadGraph + pythia 6 sample as well as a comparison with the MadGraph + pythia 8 sample.
The mcfm 7.0 and four-flavour MadGraph predictions do not take into account \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) production where the \({\mathrm{{b}}\mathrm{{\overline{b}}}}\) system is produced in a different partonic level interaction than the one which produced the \(\mathrm {W}\) boson, albeit in the same collision. Simulations of MadGraph + pythia events that include double parton interactions (DPI) reproduce the \(\mathrm {W}\)+jets data [56]. Therefore a MadGraph + pythia 8 sample of a \(\mathrm {W}\) boson produced in association with a \({\mathrm{{b}}\mathrm{{\overline{b}}}}\) pair coming from DPI is generated to study the effect on the fiducial cross section. Using this dedicated sample, an additive correction \(\sigma _{\mathrm {DPI}}\) is estimated to be \(0.06\pm 0.06\,{\text {pb}}\), where the uncertainty is conservatively assigned to be 100% of the value.
The resulting cross section predictions in the fiducial phase space at the hadron level, including the estimated hadronization and DPI corrections as needed, are compared in Fig. 3 with the measured value. Within one standard deviation the predictions agree with the measured cross section.
Comparison between the measured \({{\mathrm {W}}(\ell \nu )}+{\mathrm{{b}}\mathrm{{\overline{b}}}}\) cross section and various QCD predictions. The orange band indicates the uncertainty in the given sample associated with PDF choice and the yellow band represents the uncertainty associated with DPI. The labels 4F and 5F refer to the four- and five-flavour PDF schemes. In the case of the MadGraph + pythia 6 (5F) sample, the effects of DPI are already included in the generated samples so the DPI correction is not needed. The measured cross section is also shown with the total uncertainty in black and the luminosity, statistical, theoretical, and systematic uncertainties indicated
8 Summary
The cross section for the production of a W boson in association with two b jets was measured using a sample of proton–proton collisions at \(\sqrt{s} = 8{\,\mathrm{{TeV}}} \) collected by the CMS experiment. The data sample corresponds to an integrated luminosity of 19.8\(\,\text {fb}^\text {-1}\). The W bosons were reconstructed via their leptonic decays, \(\mathrm {W}\rightarrow \ell \nu \), where \(\ell =\mu \) or \(\mathrm {e}\). The fiducial region studied contains exactly one lepton with transverse momentum \(p_{\mathrm {T}} ^{\ell }>30\,\mathrm{{GeV}} \) and pseudorapidity \(|\eta ^{\ell } |<2.1\), with exactly two b jets with \(p_{\mathrm {T}} >25\,\mathrm{{GeV}} \) and \(|\eta |<2.4\) and no other jets with \(p_{\mathrm {T}} >25\,\mathrm{{GeV}} \) and \(|\eta |<4.7\). The cross section is \(\sigma ( {{\mathrm {p}\mathrm {p}}} \rightarrow {\mathrm {W}} (\ell \nu )+{\mathrm{{b}}\mathrm{{\overline{b}}}})= 0.64 \pm 0.03{\,\mathrm{{(stat)}}}\pm 0.10{\,\mathrm{{(syst)}}}\pm 0.06{\,\mathrm{{(theo)}}}\pm 0.02{\,\mathrm{{(lumi)}}}\,{\text {pb}}\), in agreement with standard model predictions.
References
ATLAS and CMS Collaborations, Combined measurement of the Higgs boson mass in pp collisions at \(\sqrt{s}=7\) and 8 TeV with the ATLAS and CMS experiments. Phys. Rev. Lett. 114, 191803 (2015). doi:10.1103/PhysRevLett.114.191803. arXiv:1503.07589
C.M.S. Collaboration, Precise determination of the mass of the Higgs boson and tests of compatibility of its couplings with the standard model predictions using proton collisions at 7 and 8 TeV. Eur. Phys. J. C 75, 212 (2015). doi:10.1140/epjc/s10052-015-3351-7. arXiv:1412.8662
C.M.S. Collaboration, Observation of a new boson at a mass of 125 GeV with the CMS experiment at the LHC. Phys. Lett. B 716, 30 (2012). doi:10.1016/j.physletb.2012.08.021. arXiv:1207.7235
ATLAS Collaboration, Observation of a new particle in the search for the Standard Model Higgs boson with the ATLAS detector at the LHC. Phys. Lett. B 716, 1 (2012). doi:10.1016/j.physletb.2012.08.020. arXiv:1207.7214
C.M.S. Collaboration, Measurement of the production cross section for a W boson and two b jets in pp collisions at \(\sqrt{s}\) = 7 TeV. Phys. Lett. B 735, 204 (2014). doi:10.1016/j.physletb.2014.06.041. arXiv:1312.6608
ATLAS Collaboration, Measurement of the cross-section for W boson production in association with b jets in pp collisions at \(\sqrt{s}\) = 7 TeV with the ATLAS detector. JHEP 06, 084 (2013). doi:10.1007/JHEP06(2013)084. arXiv:1302.2929
C.M.S. Collaboration, Measurement of the Z/\(\gamma \)*+b-jet cross section in pp collisions at 7 TeV. JHEP 06, 126 (2012). doi:10.1007/JHEP06(2012)126. arXiv:1204.1643
C.M.S. Collaboration, Measurement of the production cross sections for a Z boson and one or more b jets in pp collisions at \(\sqrt{s}\) = 7 TeV. JHEP 06, 120 (2014). doi:10.1007/JHEP06(2014)120. arXiv:1402.1521
C.M.S. Collaboration, Measurement of the cross section and angular correlations for associated production of a Z boson with b hadrons in pp collisions at \(\sqrt{s}\) = 7 TeV. JHEP 12, 39 (2013). doi:10.1007/JHEP12(2013)039. arXiv:1310.1349
ATLAS Collaboration, Measurement of the cross section for b jets produced in association with a Z boson at \(\sqrt{s}\) = 7 TeV with the ATLAS detector. Phys. Lett. B 706, 295 (2012). doi:10.1016/j.physletb.2011.11.059. arXiv:1109.1403
ATLAS Collaboration, Measurement of differential production cross sections for a Z boson in association with b jets in 7 TeV proton–proton collisions with the ATLAS detector. JHEP 10, 141 (2014). doi:10.1007/JHEP10(2014)141. arXiv:1407.3643
D0 Collaboration, Measurement of the p\(\rm \overline{p} \rightarrow \) W+b+X production cross section at \(\sqrt{s} = 1.96{\,{\rm {TeV}}}\). Phys. Lett. B 718, 1314 (2013). doi:10.1016/j.physletb.2012.12.044. arXiv:1210.0627
C.D.F. Collaboration, First measurement of the b jet cross section in events with a W boson in p\(\rm \overline{p}\) collisions at \(\sqrt{s} = 1.96{\,{\rm {TeV}}}\). Phys. Rev. Lett. 104, 131801 (2010). doi:10.1103/PhysRevLett.104.131801. arXiv:0909.1505
CMS Collaboration, CMS luminosity based on pixel cluster counting—summer 2013 update. CMS physics analysis summary CMS-PAS-LUM-13-001 (2013)
C.M.S. Collaboration, The CMS experiment at the CERN LHC. JINST 3, S08004 (2008). doi:10.1088/1748-0221/3/08/S08004
CMS Collaboration, Commissioning of the particle-flow event reconstruction with the first LHC collisions recorded in the CMS detector. CMS physics analysis summary CMS-PAS-PFT-10-001 (2010)
CMS Collaboration, Particle-flow event reconstruction in CMS and performance for jets, taus, and \(E_{\rm T}^{\text{miss}}\). CMS physics analysis summary CMS-PAS-PFT-09-001 (2009)
C.M.S. Collaboration, The performance of the CMS muon detector in proton-proton collisions at \(\sqrt{s}\) = 7 TeV at the LHC. JINST 8, P11002 (2013). doi:10.1088/1748-0221/8/11/P11002. arXiv:1306.6905
C.M.S. Collaboration, Performance of CMS muon reconstruction in pp collision events at \(\sqrt{s}\) = 7 TeV. JINST 7, P10002 (2012). doi:10.1088/1748-0221/7/10/P10002. arXiv:1206.4071
C.M.S. Collaboration, Performance of electron reconstruction and selection with the CMS detector in proton–proton collisions at \(\sqrt{s}\) = 8 TeV. JINST 10, 120 (2015). doi:10.1088/1748-0221/10/06/P06005. arXiv:1502.02701
C.M.S. Collaboration, Measurements of inclusive W and Z cross sections in pp collisions at \(\sqrt{s}\) = 7 TeV. JHEP 01, 080 (2011). doi:10.1007/JHEP01(2011)080. arXiv:1012.2466
C.M.S. Collaboration, Performance of the CMS missing transverse momentum reconstruction in pp data at \(\sqrt{s}=8\) TeV. J. Instrum. 10, P02006 (2015). doi:10.1088/1748-0221/10/02/P02006. arXiv:1411.0511
M. Cacciari, G.P. Salam, G. Soyez, The anti-\(k_{\rm {t}}\) jet clustering algorithm. JHEP 04, 63 (2008). doi:10.1088/1126-6708/2008/04/063. arXiv:0802.1189
M. Cacciari, G.P. Salam, Pileup subtraction using jet areas. Phys. Lett. B 659, 119 (2008). doi:10.1016/j.physletb.2007.09.077. arXiv:0707.1378
M. Cacciari, G.P. Salam, G. Soyez, The catchment area of jets. JHEP 04, 005 (2008). doi:10.1088/1126-6708/2008/04/005. arXiv:0802.1188
C.M.S. Collaboration, Identification and filtering of uncharacteristic noise in the CMS hadron calorimeter. JINST 5, T03014 (2010). doi:10.1088/1748-0221/5/03/T03014. arXiv:0911.4881
C.M.S. Collaboration, Determination of jet energy calibration and transverse momentum resolution in CMS. JINST 6, P11002 (2011). doi:10.1088/1748-0221/6/11/P11002. arXiv:1107.4277
CMS Collaboration, Performance of b tagging at \(\sqrt{s}\) = 8 TeV in multijet, ttbar and boosted topology events. CMS physics analysis summary CMS-PAS-BTV-13-001 (2013)
C.M.S. Collaboration, Identification of b-quark jets with the CMS experiment. J. Instrum. 8, 04013 (2012). doi:10.1088/1748-0221/8/04/P04013. arXiv:1211.4462
J. Alwall et al., MadGraph 5: going beyond. JHEP 06, 128 (2011). doi:10.1007/JHEP06(2011)128. arXiv:1106.0522
F. Maltoni, T. Stelzer, MadEvent: automatic event generation with MadGraph. JHEP 02, 027 (2003). doi:10.1088/1126-6708/2003/02/027. arXiv:hep-ph/0208156
H.-L. Lai et al., Uncertainty induced by QCD coupling in the CTEQ global analysis of parton distributions. Phys. Rev. D 82, 054021 (2010). doi:10.1103/PhysRevD.82.054021. arXiv:1004.4624
T. Sjöstrand, S. Mrenna, P.Z. Skands, PYTHIA 6.4 physics and manual. JHEP 05, 026 (2006). doi:10.1088/1126-6708/2006/05/026. arXiv:hep-ph/0603175
R. Field, Early LHC underlying event data—findings and surprises. In: Hadron Collider Physics. Proceedings, 22nd Conference. 2010. arXiv:1010.3558
J. Alwall et al., Comparative study of various algorithms for the merging of parton showers and matrix elements in hadronic collisions. Eur. Phys. J. C 53, 473 (2008). doi:10.1140/epjc/s10052-007-0490-5. arXiv:0706.2569
J. Alwall, S. de Visscher, F. Maltoni, QCD radiation in the production of heavy colored particles at the LHC. JHEP 02, 017 (2009). doi:10.1088/1126-6708/2009/02/017. arXiv:0810.5350
P. Nason, A new method for combining NLO QCD with shower Monte Carlo algorithms. JHEP 11, 040 (2004). doi:10.1088/1126-6708/2004/11/040. arXiv:hep-ph/0409146
S. Frixione, P. Nason, C. Oleari, Matching NLO QCD computations with parton shower simulations: the POWHEG method. JHEP 11, 070 (2007). doi:10.1088/1126-6708/2007/11/070. arXiv:0709.2092
S. Alioli, P. Nason, C. Oleari, E. Re, NLO single-top production matched with shower in POWHEG: \(s\)- and \(t\)-channel contributions. JHEP 09, 111 (2009). doi:10.1088/1126-6708/2009/09/111. arXiv:0907.4076. (Erratum: doi:10.1007/JHEP02(2010)011)
E. Re, Single-top \(Wt\)-channel production matched with parton showers using the POWHEG method. Eur. Phys. J. C 71, 1547 (2011). doi:10.1140/epjc/s10052-011-1547-z. arXiv:1009.2450
R. Gavin, Y. Li, F. Petriello, S. Quackenbush, W Physics at the LHC with FEWZ 2.1. Comput. Phys. Commun. 184, 208 (2013). doi:10.1016/j.cpc.2012.09.005. arXiv:1201.5896
Y. Li, F. Petriello, Combining QCD and electroweak corrections to dilepton production in FEWZ. Phys. Rev. D 86, 094034 (2012). doi:10.1103/PhysRevD.86.094034. arXiv:1208.5967
J.M. Campbell, R.K. Ellis, MCFM for the tevatron and the LHC. Loops and legs in quantum field theory—proceedings of the 10th DESY workshop on elementary particle theory. Nucl. Phys. B Proc. Suppl. 205, 10 (2010) . doi:10.1016/j.nuclphysbps.2010.08.011. arXiv:1007.3492
S. Badger, J.M. Campbell, R.K. Ellis, QCD corrections to the hadronic production of a heavy quark pair and a W boson including decay correlations. JHEP 03, 027 (2011). doi:10.1007/JHEP03(2011)027. arXiv:1011.6647
A.D. Martin, W.J. Stirling, R.S. Thorne, G. Watt, Parton distributions for the LHC. Eur. Phys. J. C 63, 189 (2009). doi:10.1140/epjc/s10052-009-1072-5. arXiv:0901.0002
CMS Collaboration, Combination of ATLAS and CMS top quark pair cross section measurements in the e\(\mu \) final state using proton–proton collisions at 8 TeV. CMS physics analysis summary CMS-PAS-TOP-14-016 (2014)
ATLAS Collaboration, Measurement of the t\(\rm \overline{t}\) production cross-section using e\(\mu \) events with b-tagged jets in pp collisions at \(\sqrt{s}=7\) and 8 TeV with the ATLAS detector. Eur. Phys. J. C 74, 3109 (2014). doi:10.1140/epjc/s10052-014-3109-7. arXiv:1406.5375
C.M.S. Collaboration, Measurement of the t\(\rm \overline{t}\) production cross section in the dilepton channel in pp collisions at \(\sqrt{s} = 8{\,{\rm {TeV}}}\). JHEP 02, 24 (2014). doi:10.1007/JHEP02(2014)024. arXiv:1312.7582. (Erratum: doi:10.1007/JHEP02(2014)102)
GEANT4 Collaboration, Nucl. Instrum. Meth. A GEANT4—a simulation toolkit. 506, 250 (2003). doi:10.1016/S0168-9002(03)01368-8
A. Buckley et al., LHAPDF6: parton density access in the LHC precision era. Eur. Phys. J. C 75, 132 (2015). doi:10.1140/epjc/s10052-015-3318-8. arXiv:1412.7420
M. Botje et al., The PDF4LHC working group interim recommendations (2011). arXiv:1101.0538
S. Alekhin et al., The PDF4LHC working group interim report (2011). arXiv:1101.0536
NNPDF Collaboration, Parton distributions with LHC data. Nucl. Phys. B867, 244 (2013). doi:10.1016/j.nuclphysb.2012.10.003. arXiv:1207.1303
T. Sjöstrand, S. Mrenna, P.Z. Skands, A brief introduction to PYTHIA 8.1. Comput. Phys. Commun. 178, 852 (2008). doi:10.1016/j.cpc.2008.01.036. arXiv:0710.3820
C.M.S. Collaboration, Event generator tunes obtained from underlying event and multiparton scattering measurements. Eur. Phys. J. C 76, 155 (2016). doi:10.1140/epjc/s10052-016-3988-x. arXiv:1512.00815
C.M.S. Collaboration, Study of double parton scattering using W + 2-jet events in proton-proton collisions at \(\sqrt{s}\) = 7 TeV. JHEP 03, 032 (2014). doi:10.1007/JHEP03(2014)032. arXiv:1312.5729
Acknowledgements
We congratulate our colleagues in the CERN accelerator departments for the excellent performance of the LHC and thank the technical and administrative staffs at CERN and at other CMS institutes for their contributions to the success of the CMS effort. In addition, we gratefully acknowledge the computing centers and personnel of the Worldwide LHC Computing Grid for delivering so effectively the computing infrastructure essential to our analyses. Finally, we acknowledge the enduring support for the construction and operation of the LHC and the CMS detector provided by the following funding agencies: BMWFW and FWF (Austria); FNRS and FWO (Belgium); CNPq, CAPES, FAPERJ, and FAPESP (Brazil); MES (Bulgaria); CERN; CAS, MoST, and NSFC (China); COLCIENCIAS (Colombia); MSES and CSF (Croatia); RPF (Cyprus); SENESCYT (Ecuador); MoER, ERC IUT and ERDF (Estonia); Academy of Finland, MEC, and HIP (Finland); CEA and CNRS/IN2P3 (France); BMBF, DFG, and HGF (Germany); GSRT (Greece); OTKA and NIH (Hungary); DAE and DST (India); IPM (Iran); SFI (Ireland); INFN (Italy); MSIP and NRF (Republic of Korea); LAS (Lithuania); MOE and UM (Malaysia); BUAP, CINVESTAV, CONACYT, LNS, SEP, and UASLP-FAI (Mexico); MBIE (New Zealand); PAEC (Pakistan); MSHE and NSC (Poland); FCT (Portugal); JINR (Dubna); MON, RosAtom, RAS and RFBR (Russia); MESTD (Serbia); SEIDI and CPAN (Spain); Swiss Funding Agencies (Switzerland); MST (Taipei); ThEPCenter, IPST, STAR and NSTDA (Thailand); TUBITAK and TAEK (Turkey); NASU and SFFR (Ukraine); STFC (United Kingdom); DOE and NSF (USA). Individuals have received support from the Marie-Curie program and the European Research Council and EPLANET (European Union); the Leventis Foundation; the A. P. Sloan Foundation; the Alexander von Humboldt Foundation; the Belgian Federal Science Policy Office; the Fonds pour la Formation à la Recherche dans l’Industrie et dans l’Agriculture (FRIA-Belgium); the Agentschap voor Innovatie door Wetenschap en Technologie (IWT-Belgium); the Ministry of Education, Youth and Sports (MEYS) of the Czech Republic; the Council of Science and Industrial Research, India; the HOMING PLUS program of the Foundation for Polish Science, cofinanced from European Union, Regional Development Fund, the Mobility Plus program of the Ministry of Science and Higher Education, the National Science Center (Poland), contracts Harmonia 2014/14/M/ST2/00428, Opus 2013/11/B/ST2/04202, 2014/13/B/ST2/02543 and 2014/15/B/ST2/03998, Sonata-bis 2012/07/E/ST2/01406; the Thalis and Aristeia programs cofinanced by EU-ESF and the Greek NSRF; the National Priorities Research Program by Qatar National Research Fund; the Programa Clarín-COFUND del Principado de Asturias; the Rachadapisek Sompot Fund for Postdoctoral Fellowship, Chulalongkorn University and the Chulalongkorn Academic into Its 2nd Century Project Advancement Project (Thailand); and the Welch Foundation, contract C-1845.
Author information
Authors and Affiliations
Consortia
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Funded by SCOAP3
About this article
Cite this article
Khachatryan, V., Sirunyan, A.M., Tumasyan, A. et al. Measurement of the production cross section of a W boson in association with two b jets in pp collisions at \(\sqrt{s} = 8{\,\mathrm{{TeV}}} \) . Eur. Phys. J. C 77, 92 (2017). https://doi.org/10.1140/epjc/s10052-016-4573-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1140/epjc/s10052-016-4573-z