1 Introduction

The measurement of \(\mathrm {W}\) or \(\mathrm {Z} \) boson production in association with b quarks in proton–proton collisions provides important input for refinement of calculations in perturbative quantum chromodynamics and is also relevant for searches and measurements. In particular, these processes constitute a background to the experimental measurement of a standard model (SM) Higgs boson in which the Higgs boson decays into a \({\mathrm{b} \mathrm{{\overline{b}}}}\) pair in association with a vector boson. The discovery by the ATLAS and CMS Collaborations at the CERN LHC of a neutral boson with a mass of about \(125\,\mathrm{{GeV}} \) [1,2,3,4] motivates further studies to establish the nature of the boson and determine its coupling to bottom quarks. Furthermore, different models based on extensions of the Higgs sector are being compared with LHC data using final states composed of leptons and b jets. In this context, a better understanding of the b hadron production mechanism and the kinematic properties of associated jets is required to refine the background predictions and increase the sensitivity to new physics. Throughout this paper, hadronic showers originating from bottom or anti-bottom quarks are referred to as b jets, and b-tagged jets are the reconstructed objects either in simulation or data that have been identified as such.

The production of \(\mathrm {W}\) [5, 6] or \(\mathrm {Z} \) [7,8,9,10,11] bosons in association with b jets has been measured at the LHC using pp collisions at \(\sqrt{s}=7{\,\mathrm{{TeV}}} \) using data samples corresponding to an integrated luminosity of up to 5\(\,\text {fb}^\text {-1}\), and at the Fermilab Tevatron [12, 13] using proton–antiproton collisions at \(\sqrt{s}=1.96{\,\mathrm{{TeV}}} \). This analysis extends previous measurements of the \({\mathrm {W}}+{\mathrm{{b}}\mathrm{{\overline{b}}}}\) production cross section [5] and uses data at \(\sqrt{s}=8{\,\mathrm{{TeV}}} \) collected with the CMS detector, corresponding to an integrated luminosity of 19.8\(\,\text {fb}^\text {-1}\)  [14]. Whereas the previous CMS analysis used only the muon decay channel, this analysis uses both muon and electron decay modes.

2 CMS detector

The central feature of the CMS apparatus is a superconducting solenoid of 6 m internal diameter, providing a magnetic field of 3.8 T. Within the solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter (ECAL), and a brass and scintillator hadron calorimeter (HCAL), each composed of a barrel and two endcap sections. Muons are measured in gas-ionization detectors embedded in the steel flux-return yoke outside the solenoid. Extensive forward calorimetry complements the coverage provided by the barrel and endcap detectors. A more detailed description of the CMS detector, together with a definition of the coordinate system used and the relevant kinematic variables, can be found in Ref. [15].

3 Event selection and reconstruction

The \(\mathrm {W}\rightarrow \mathrm {\mu }\nu _\mathrm {\mu }\) (\(\mathrm {W}\rightarrow \mathrm {e}\nu _\mathrm {e}\)) events are selected using single-muon (single-electron) triggers that require a loosely isolated muon (electron) with transverse momentum \(p_{\mathrm {T}} >24\,(27)\,\mathrm{{GeV}} \) and pseudorapidity \(|\eta |<2.1\,(2.5)\).

Individual particles emerging from each collision are reconstructed with the particle-flow (PF) technique [16, 17]. This approach uses the information from all subdetectors to identify and reconstruct individual particle candidates in the event, classifying them into mutually exclusive categories: charged and neutral hadrons, photons, electrons, and muons.

Muons are reconstructed by combining the information from the tracker and the muon spectrometer [18, 19]. Electrons are reconstructed by combining the information from the tracker and the calorimeter [20]. Both the muon and the electron candidates are required to have \(p_{\mathrm {T}} >30\,\mathrm{{GeV}} \) and \(|\eta |<2.1\) to ensure that the triggers are fully efficient. They are also required to originate from the primary vertex of the event, chosen as the vertex with the highest \(\sum p_{\mathrm {T}} ^2\) of the charged particles associated with it. Furthermore, the leptons must be isolated, where the isolation variable is defined as

$$\begin{aligned} I= & {} \frac{1}{p_{\mathrm {T}} ^\ell }\left[ \sum p_{\mathrm {T}} ^{\text {charged}}\right. \nonumber \\&\left. +{\max }\Bigl (0,\textstyle {\sum }p_{\mathrm {T}} ^{\gamma }+\textstyle {\sum }E_{\mathrm {T}} ^{\text {neutral}}-0.5\,\textstyle {\sum } p_{\mathrm {T}} ^{\mathrm {PU}}\Bigr )\right] , \end{aligned}$$
(1)

with the sums running over PF candidates in a cone of size \(\varDelta R < 0.4\,(0.3)\) around the muon (electron) direction, where \(\varDelta R = \sqrt{\smash [b]{ (\varDelta \eta )^2 + (\varDelta \phi )^2}}\), and \(\phi \) is the azimuthal angle in radians. The first three sums are over charged hadron candidates associated with the primary vertex, photon candidates, and neutral hadron candidates respectively. The definition of the isolation includes a correction for additional pp interactions, referred to as pileup, which is proportional to the scalar \(p_{\mathrm {T}}\) sum of charged particles not associated with the primary vertex in the isolation cone (\(\sum p_{\mathrm {T}} ^{\mathrm {PU}}\)). The selected muons (electrons) are required to have \(I < 0.12\,(0.10)\).

Missing transverse momentum in the event, \({\vec p}_{\mathrm {T}}^\mathrm{{miss}} \), is defined as the negative vector sum of the \(\vec {p_{\mathrm {T}}}\) of all PF candidates in the event. It is combined with the \(\vec {p_{\mathrm {T}}}\) of a muon or electron passing the identification and isolation requirements to compute the transverse mass, \(M_\mathrm {T}\), of the \(\mathrm {W}\) boson candidate. The \(M_\mathrm {T}\) variable is a natural discriminator against non-\(\mathrm {W}\) final states, such as quantum chromodynamics (QCD) multijet events, that have a lepton candidate and \({\vec p}_{\mathrm {T}}^\mathrm{{miss}} \), but a relatively low value of \(M_\mathrm {T} \). The result for \({\vec p}_{\mathrm {T}}^\mathrm{{miss}} \) is corrected for noise in the ECAL and HCAL using the method described in Ref. [21]. Corrections to minimize the effect of the pileup are also included [22].

Jets are constructed using the anti-\(k_{\mathrm {T}}\) clustering algorithm [23] with a radius parameter of 0.5, as implemented in the FastJet package [24, 25]. Jet clustering is performed using individual particle candidates reconstructed with the PF technique. Jets are required to pass identification criteria that eliminate jets originating from noisy channels in the HCAL [26]. Jets from pileup interactions are rejected by requiring that the jets originate at the primary interaction vertex. Small corrections to the relative and absolute jet energy calibrations of the detector are applied as a function of the \(p_{\mathrm {T}} \) and \(\eta \) of the jet [27].

The combined secondary vertex (CSV) b tagging algorithm [28, 29] exploits the long lifetime and relatively large mass of b hadrons to provide b jet identification. The CSV algorithm combines information about impact parameter significance, secondary vertex kinematic properties, and jet kinematic properties in a likelihood-ratio discriminator. The identification of b jets (b tagging) is made by imposing a minimum threshold on the CSV discriminator value. In this analysis, b-tagged jets are required to pass a threshold with an efficiency of 40% in the signal phase space and a misidentification probability of 0.1% (1%) for light (charm) jets. Jets are corrected for the difference in efficiency between data and simulation using scale factors dependent on the \(p_{\mathrm {T}}\) of the jet.

4 Simulated samples

After all selection requirements detailed in Sect. 5 are applied, the contributing background processes to the overall yield are the associated production of a massive vector boson and jets (\(\mathrm {V}\)+jets where V = \(\mathrm {W}\) or \(\mathrm {Z}\)), as well as production of diboson (\(\mathrm {W^+}{}\mathrm {W^-} \), \(\mathrm {W}{}\mathrm {Z} \), \(\mathrm {Z}{}\mathrm {Z} \)), \(\mathrm{{t}}\mathrm{{\overline{t}}}\), single top quark, \(\gamma \)+jets, and QCD multijet events. These background contributions are estimated from simulation, except for the QCD background, which is estimated from data as described in Sect. 5.

Simulated samples of \(\mathrm {V}\)+jets, \(\gamma \)+jets and \(\mathrm{{t}\mathrm{{\overline{t}}}}+\mathrm{{jets}}\) are generated at tree-level with MadGraph  5.1 [30, 31] using the CTEQ6L [32] parton distribution function (PDF) set. These samples are interfaced with pythia  6.4 [33] for hadronization using the Z2* tune for the underlying event. The most recent pythia Z2* tune is derived from the Z1 tune [34], which uses the CTEQ5L PDF set, whereas Z2* adopts CTEQ6L [32]. The \(k_{\mathrm {T}}\)-MLM [35, 36] matching scheme is used. For the signal distributions, the shapes are taken from a dedicated high-statistics generated sample of exclusive \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\). The normalization is obtained from the \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) component of an inclusive \(\mathrm {W}\)+jets sample by separating the \(\mathrm {W}\)+jets simulated sample into three subsamples labeled as \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\), \(\mathrm {W}\)+\(\mathrm {c} \mathrm {\overline{c}} \) , and \(\mathrm {W}\)+\(\mathrm {u}\) \(\mathrm {d}\) \(\mathrm {s}\) \(\mathrm {c}\) \(\mathrm {g}\) , which are defined below. If an event contains a bottom jet from the matrix element or parton shower, it is categorized as \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\). A bottom quark at generator level requires the presence of a bottom hadron within a cone of radius \(\varDelta R=0.4\) with respect to the jet axis. The jets are constructed using generator-level information using all stable particles in the event, excluding neutrinos. Jets with a distance smaller than \(\varDelta R = 0.5\) with respect to a lepton are removed from the event. If an event does not contain any b jet, but an even, nonzero number of charm jets, again from the matrix element or parton shower, it is categorized as \(\mathrm {W}\)+\(\mathrm {c} \mathrm {\overline{c}} \) . The remaining events are categorized as \(\mathrm {W}\)+\(\mathrm {u}\) \(\mathrm {d}\) \(\mathrm {s}\) \(\mathrm {c}\) \(\mathrm {g}\) . The energy of the selected leptons at the generator level is corrected for final-state radiation by summing the four-momenta of all the photons generated within a cone of radius \(\varDelta R = 0.1\) around the lepton. Leptons that do not originate from the primary vertex are not considered for selection.

Single top quark event samples are generated at next-to-leading order (NLO) with powheg  2.0 [37,38,39,40] using the CTEQ6M PDF set. Hadronization is performed using pythia  6.4 with the Z2* tune. Diboson samples are generated and hadronized with pythia  6.4 at leading order (LO) using the CTEQ6L PDF set and the Z2* tune.

The cross sections for the \(\mathrm {V}\)+jets processes are normalized using the predictions for inclusive \(\mathrm {W}\) and \(\mathrm {Z}\)  boson production from fewz  3.1 [41, 42] evaluated at next-to-next-to-leading order (NNLO). The cross section for \(\gamma \)+jets is evaluated at LO using MadGraph with the CTEQ6L PDF set. Single top quark and diboson production cross sections are normalized to the NLO cross section predictions from \(\textsc {mcfm} ~7.0\) [43, 44] using the MSTW2008 NLO PDF set [45]. The \(\mathrm{{t}}\mathrm{{\overline{t}}}\) cross section used is \({241.5\pm 8.5\,{\text {pb}}}\), and was determined from data collected by the ATLAS and CMS experiments [46,47,48] at the LHC at \(\sqrt{s}=8{\,\mathrm{{TeV}}} \).

For all simulated processes, the detector response is simulated using a detailed description of the CMS detector based on Geant4  [49]. The reconstruction of simulated events is performed with the same algorithms used for the data.

Events induced by additional simultaneous pp interactions are simulated using events generated with pythia  6. During the 2012 data taking, the average pileup rate was 21 interactions per bunch crossing; the simulated number of pileup interactions has been reweighted to match this distribution in the data.

5 Analysis strategy

The \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) yield is estimated using a binned maximum-likelihood fit to the \(M_\mathrm {T} \) distribution in the signal event sample. With the exception of the multijet processes, the distributions and normalizations of all background contributions in the fit are taken from simulation. Consequently, it is important to verify that the simulation describes the data.

The dominant background in the signal event sample arises from the \(\mathrm{{t}}\mathrm{{\overline{t}}}\) process. Therefore, the data and simulation are compared in two \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-dominated control samples: one characterized by a pair of opposite flavor leptons (\(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multilepton), and the other by the presence of three or more jets (\(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multijet). The simulation is reweighted to describe the data in the control regions and then is used to predict the \(M_\mathrm {T} \) distributions in the signal region.

The signal region contains a muon (electron) with \(p_{\mathrm {T}} > 30\,\mathrm{{GeV}} \), \(|\eta | < 2.1\), and satisfying \(I < 0.12\,(0.10)\). Exactly two b-tagged jets with \(p_{\mathrm {T}} > 25\,\mathrm{{GeV}} \) and \(|\eta | < 2.4\) are also required. Events with additional leptons with \(p_{\mathrm {T}} > 10\,\mathrm{{GeV}} \) and \(|\eta | < 2.4\) or a third jet with \(p_{\mathrm {T}} > 25\,\mathrm{{GeV}} \) and \(|\eta | < 4.7\) are rejected. The \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multijet sample is obtained using the same selection criteria as for the signal event sample, but requiring at least three jets in the event with \(p_{\mathrm {T}} >25\,\mathrm{{GeV}} \) and \(|\eta |<2.4\) instead of vetoing events that have more than two jets. The \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multilepton sample uses similar selection criteria as the signal event sample; however, the lepton requirement is modified. The event must contain two isolated leptons of different flavor, each with \(p_{\mathrm {T}} >30\,\mathrm{{GeV}} \) and \(|\eta |<2.1\). In the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multilepton sample, the \(M_\mathrm {T} \) variable is calculated with respect to the electron in the electron channel and the muon in the muon channel.

The QCD background distributions in the \(M_\mathrm {T}\) variable are estimated from data using event samples that pass all signal requirements, but requiring the muon (electron) is not isolated, \(I > 0.20\,(0.15)\). The resulting distributions are corrected for the presence of all other backgrounds, as estimated from simulation. Their contribution is less than 1% of the QCD background rate. The QCD background normalization is adjusted in order to describe the number of data events at \(M_\mathrm {T} <20\,\mathrm{{GeV}} \), after subtracting the non-QCD backgrounds obtained from simulation.

In the fiducial regions used in this analysis, no correlation is observed between I and \(M_\mathrm {T} \) in multijet events simulated with pythia 6, so the use of an inverted isolation requirement to obtain the QCD background distribution is possible. However, this is not the case for the \(\varDelta R\) distance between the two b-tagged jets, \(\varDelta R({\mathrm{{b}},\mathrm{{\overline{b}}}})\), or the lepton \(p_{\mathrm {T}} \). The shape of the QCD distribution for these variables is therefore taken from an \(M_\mathrm {T} <30\) \(\,\mathrm{{GeV}}\) sideband and validated against QCD multijet simulation. The normalization of the QCD background in these variables is set to the final normalization resulting from the fit to the \(M_\mathrm {T}\) variable, which was derived using the inverted isolation requirement.

The normalizations and distributions of the simulated backgrounds are allowed to vary in the fit within the uncertainties listed in Table 1 as described in Sect. 6. The uncorrelated normalization uncertainties are uncertainties in the cross section of the given sample.

Two major parameters in the simulations significantly affect the normalization of the simulated distributions: the b tagging efficiency and the jet energy scale (JES). The control samples as well as the signal event samples show similar sensitivity to the b tagging efficiency, and its adjustment affects all the regions in a correlated manner. Because \(\mathrm{{t}}\mathrm{{\overline{t}}}\) production may have more than two jets in the final state, the rejection of events with a third jet makes it sensitive to JES. The effect on the leading jets is moderate, but JES variations lead to significant migration of jets into and out of the veto region. The \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multijet sample, since it has no veto on a third jet, is less sensitive to JES variations than the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multilepton sample. The variation in the JES changes the \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) yield in the signal region by less than 1%.

The fit procedure consists of three consecutive steps in which the simulated distributions in two control samples and the event sample are fit to data using the \(M_\mathrm {T} \) variable, which is chosen because it has a well-known shape for \(\mathrm {W}\)+jets production that allows for reliable signal extraction. First, the fit is performed using the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multijet sample. It results in a correction of the b tagging efficiency, measured separately in the muon and electron channels and then combined. The simulation is corrected using this result and the corrected simulated samples are fit to the data in the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multilepton sample. The result of the second step is used to adjust JES and as a result of this procedure, the simulation is expected to better describe the \(\mathrm{{t}}\mathrm{{\overline{t}}}\) contribution. The final step is to extract the number of \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) events from the fit to \(M_\mathrm {T}\) in the signal event sample.

Similar results can be obtained by performing a simultaneous fit of the signal and the two control regions. We find that the b tagging efficiency correction and JES correction have opposite effects on the distributions and thus compensate for each other in a simultaneous fit, reducing its precision. Separating these effects in steps provides better understanding of underlying uncertainties and therefore more precise results.

6 Systematic uncertainties

The main sources of the systematic uncertainties are listed in Table 1. The size of the variation is shown for each source, together with its effect on the measured cross section. These are included in the fit. Some of the uncertainties affect only the normalization in the respective contributions. These include the uncertainties in the theoretical cross section for a given sample, which are uncorrelated between samples and are included as log-normal constraints on the rate. The uncertainty due to the b tagging efficiency and the uncertainty due to the JES are observed to only affect the normalizations of the samples in the \(M_\mathrm {T}\) variable. The uncertainties that affect both the normalization and the shape of the \(M_\mathrm {T} \) distributions are listed in the table under “Shape” and are incorporated into the fit via binned distributions, which are obtained by varying the source of the given uncertainty and reprocessing the simulated sample. Such uncertainties in the template are interpolated quadratically.

As a conservative estimate of the uncertainty in QCD multijet background, a 50% uncertainty has been considered. This results in an uncertainty of 2–3% in the measured cross section. The b tagging efficiency and JES rescaling uncertainties are taken from their respective fits. The renormalization and factorization scales respectively are set at \(\mu _{\mathrm {R}}=\mu _{\mathrm {F}}=m_\mathrm {W}\), and the uncertainties on this choice are estimated from the change in acceptance found by varying \(\mu _{\mathrm {R}}\) and \(\mu _{\mathrm {F}}\) up and down by a factor of two. The PDF uncertainties are estimated from the change in acceptance found by varying the PDF set following the LHAPDF/PDF4LHC prescription [50,51,52,53], considering PDF sets from the CTEQ, MSTW, NNPDF, and HERA Collaborations.

Table 1 The main sources of systematic uncertainty in the \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) signal event sample. The column labeled “Variation” indicates the bounds on the normalization change of a given sample due to a variation of the uncertainty by one standard deviation. The last column indicates the contribution of the given systematic to the overall uncertainty in the measured cross section. The uncertainty labeled “b tag eff rescaling” is the uncertainty associated with the rescaling of the b tagging efficiency. UES refers to the scale of energy deposits not clustered into jets, and MES and EES refer to the muon and electron energy scales. The uncertainty labeled as “Id/Iso/Trg” is the uncertainty associated with the efficiency of the lepton identification, isolation, and trigger. The uncertainties in the integrated luminosity [14] and in the acceptance due to PDF uncertainties and scale choices are not included in the fit, and are treated separately

7 Results

The fit in the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multijet sample is used to obtain b tagging efficiency rescaling factors separately for the muon and electron channels in order to better describe the b tagging efficiency in the simulation as described in Sect. 5. The results of the fit are presented in the two plots at the top of Fig. 1. The central values of the b tagging efficiency rescaling factors, \(1.12 \pm 0.08\) (muon channel) and \(1.16 \pm 0.08\) (electron channel), are averaged to \(1.14 \pm 0.08\) with the combined uncertainty, dominated by systematics, taken as the maximum of the uncertainties for the individual lepton channels. The simulation is reweighted accordingly for the next fit, and the uncertainty in this fit sets the one standard deviation bound on the b tagging efficiency rescaling factor in subsequent fits.

A fit to the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multilepton sample adjusts the JES, as described in Sect. 5. As a result, the simulated \(M_\mathrm {T} \) distributions change normalization. The best fit results in changing the normalization by approximately 3.4% from its central value, which corresponds to 1.3 standard deviations in JES. The middle plots in Fig. 1 show the results of the fits in the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multilepton sample for the muon (left) and the electron (right) channels. The JES is therefore shifted by 1.3 standard deviations in the simulation with the uncertainty taken from the fit. Thus the simulation is tuned to describe the \(\mathrm{{t}}\mathrm{{\overline{t}}}\) control samples and is used to extract the signal yield in the signal region.

The results of the fit in the \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) signal region are shown in the bottom of Fig. 1. All background contributions are allowed to vary in the fit within their uncertainties, while the \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) normalization remains a free parameter of the fit. The correlations across all simulated samples are taken into account as shown in Table 1. Based on the fits the number of events of each type in the signal event sample is given in Table 2. Events coming from the production of a Higgs boson in association with a vector boson constitute a negligible fraction of the overall event yield and are not considered.

Distributions for variables other than those being directly fit are also produced by applying the results from the three fits to the simulated samples. Distributions of \(\varDelta R({\mathrm{{b}},\mathrm{{\overline{b}}}})\) and \(p_{\mathrm {T}} ^\ell \) combining both lepton flavors are presented in Fig. 2. The angular separation between the b jets is seen to be well modeled, and the \(p_{\mathrm {T}} ^\ell \) distribution shows an agreement within \(10\%\) for \(p_{\mathrm {T}} ^\ell < 100\) GeV, with a slightly falling trend in the ratio of data and simulation.

Fig. 1
figure 1

The transverse mass distributions (upper) in the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multijet phase space after fitting to obtain the b tagging efficiency rescale factors, (middle) in the \(\mathrm{{t}}\mathrm{{\overline{t}}}\)-multilepton sample after fitting to find the appropriate JES, and (lower) in the \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) signal sample after fitting simultaneously muon and electron decay channels. The lepton channels are shown separately with the muon sample on the left and the electron sample on the right. The last bin contains overflow events. The shaded area represents the total uncertainty in the simulation after the fit

Table 2 Initial and final yields obtained in the \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) signal region. The uncertainties in the signal strength represent the total uncertainty of the fit
Fig. 2
figure 2

Distributions of \(\varDelta R({\mathrm{{b}},\mathrm{{\overline{b}}}})\) and \(p_{\mathrm {T}} ^\ell \) after applying the results from the fits to the simulation. The QCD background shape is taken from an \(M_\mathrm {T} <30\,\mathrm{{GeV}} \) sideband and the muon and electron channels have been combined in these distributions. The last bin contains overflow events and the shaded area represents the total uncertainty in the simulation after the fit

The cross section is calculated as

$$\begin{aligned} \sigma (\mathrm {p}\mathrm {p}\rightarrow \mathrm {W}(\ell \nu )+{\mathrm{{b}}\mathrm{{\overline{b}}}}) =&\frac{N^{\text {data}}_{\text {reconstructed}}}{A\,\epsilon \, \mathcal {L}} \\ =&\frac{N^{\text {data}}_{\text {reconstructed}}}{(N^{\mathrm {MC}}_{\text {reconstructed}}/N^{\mathrm {MC}}_{\text {generated}})\, \mathcal {L}} = \alpha \sigma _{\text {gen}} \end{aligned}$$

where \(\mathcal {L}\) is the integrated luminosity, \(N^{\text {data}}_{\text {reconstructed}}\) is the number of observed signal events, \(N^{\mathrm {MC}}_{\text {reconstructed}}\) is the number of expected signal events from simulation reconstructed in the fiducial region, \(N^{\mathrm {MC}}_{\text {generated}}\) is the number of generated events in the fiducial region, A and \(\epsilon \) are the acceptance and efficiency, \(\alpha \) is the measured signal strength in the given lepton channel, and \(\sigma _{\text {gen}}\) is the simulated fiducial cross section of the signal sample. The signal strength is the scale factor in the \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) cross section predicted by the fit, after factoring out contributions to the overall change in normalization due to systematic effects which are correlated across samples. In this analysis, the fiducial cross section is calculated as follows: MadGraph is used to compute the \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) cross section with fiducial selections applied. Then a k-factor for inclusive W production is applied that is obtained from the ratio of the inclusive W cross section calculated with fewz  3.1 (at NNLO using the five-flavour CTEQ6M PDF set) and to that with MadGraph. The product \(A\,\epsilon \) is 10 to 15% and results from the combined effects of the efficiency of the lepton identification requirements (80%) and b tagging efficiency (40% per jet) and has an uncertainty of 10%, arising from scale and PDF choices as indicated in the bottom of Table 1.

The \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) cross section is measured within a fiducial volume, which is defined by requiring leptons with \(p_{\mathrm {T}} > 30\,\mathrm{{GeV}} \) and \(|\eta |<2.1\) and exactly two b-tagged jets of \(p_{\mathrm {T}} > 25\,\mathrm{{GeV}} \) and \(|\eta |<2.4\). The measured cross sections are presented in Table 3. The combination of the muon and electron measurements is done using a simultaneous fit to both channels, taking into account correlations across samples.

Table 3 Measured cross sections in the muon, electron, and combined lepton channels. The systematic uncertainty (syst) includes the contributions from all rows in Table 1 that have an entry in the “Variation” column, and the theoretical uncertainty (theo) includes the combination of the uncertainties associated with the choice of \(\mu _{\mathrm {R}}\), \(\mu _{\mathrm {F}}\), and PDF

The measured cross sections are compared to theoretical predictions from mcfm  7.0 [43, 44] with the MSTW2008 PDF set, as well as from MadGraph  5 interfaced with pythia  6 in the four- and five-flavour schemes and MadGraph  5 with pythia  8 [54] in the four-flavour scheme. In the four- and five-flavour approaches, the four and five lightest quark flavours are used in the proton PDF sets. In the five-flavour scheme, the PDF set CTEQ6L is used and interfaced with pythia 6 using the Z2* tune. The two four-flavour samples are produced using an NNLO PDF set interfaced with pythia version 6 using the CTEQ6L tune in one sample, and version 8 using the CUETP8M1 tune [55] in the other.

Comparisons between the results of calculations performed under different assumptions provide important feedback on the validity of the techniques employed. Differences in predictions arising from the modelling of b quarks as massive or massless are possible, as are variations in predictions arising from the use of different showering packages (pythia 6 vs. pythia 8) or matrix element generators (MadGraph vs. mcfm  7.0). In the phase space explored here, these predictions are all very close in their central value and agree with each other well within their respective uncertainties.

The mcfm  7.0 cross section calculation is performed at the level of parton jets and thus requires a hadronization correction. The multiplicative hadronization correction factor \(0.81\pm 0.07\) is calculated using the MadGraph + pythia  6 sample and agrees well with the factor \(0.84\pm 0.03\) calculated in the 7 TeV Z+b analysis [8]. The correction factor is obtained for jets computed excluding neutrinos from the particle list because such jets are closer in kinematics to particle jets at the detector level. The uncertainty reflects both the limited statistics of the MadGraph + pythia  6 sample as well as a comparison with the MadGraph + pythia  8 sample.

The mcfm  7.0 and four-flavour MadGraph predictions do not take into account \({\mathrm {W} + \mathrm{{b}}\mathrm{{\overline{b}}}}\) production where the \({\mathrm{{b}}\mathrm{{\overline{b}}}}\) system is produced in a different partonic level interaction than the one which produced the \(\mathrm {W}\) boson, albeit in the same collision. Simulations of MadGraph + pythia events that include double parton interactions (DPI) reproduce the \(\mathrm {W}\)+jets data [56]. Therefore a MadGraph + pythia  8 sample of a \(\mathrm {W}\) boson produced in association with a \({\mathrm{{b}}\mathrm{{\overline{b}}}}\) pair coming from DPI is generated to study the effect on the fiducial cross section. Using this dedicated sample, an additive correction \(\sigma _{\mathrm {DPI}}\) is estimated to be \(0.06\pm 0.06\,{\text {pb}}\), where the uncertainty is conservatively assigned to be 100% of the value.

The resulting cross section predictions in the fiducial phase space at the hadron level, including the estimated hadronization and DPI corrections as needed, are compared in Fig. 3 with the measured value. Within one standard deviation the predictions agree with the measured cross section.

Fig. 3
figure 3

Comparison between the measured \({{\mathrm {W}}(\ell \nu )}+{\mathrm{{b}}\mathrm{{\overline{b}}}}\) cross section and various QCD predictions. The orange band indicates the uncertainty in the given sample associated with PDF choice and the yellow band represents the uncertainty associated with DPI. The labels 4F and 5F refer to the four- and five-flavour PDF schemes. In the case of the MadGraph + pythia 6 (5F) sample, the effects of DPI are already included in the generated samples so the DPI correction is not needed. The measured cross section is also shown with the total uncertainty in black and the luminosity, statistical, theoretical, and systematic uncertainties indicated

8 Summary

The cross section for the production of a W boson in association with two b jets was measured using a sample of proton–proton collisions at \(\sqrt{s} = 8{\,\mathrm{{TeV}}} \) collected by the CMS experiment. The data sample corresponds to an integrated luminosity of 19.8\(\,\text {fb}^\text {-1}\). The W bosons were reconstructed via their leptonic decays, \(\mathrm {W}\rightarrow \ell \nu \), where \(\ell =\mu \) or \(\mathrm {e}\). The fiducial region studied contains exactly one lepton with transverse momentum \(p_{\mathrm {T}} ^{\ell }>30\,\mathrm{{GeV}} \) and pseudorapidity \(|\eta ^{\ell } |<2.1\), with exactly two b jets with \(p_{\mathrm {T}} >25\,\mathrm{{GeV}} \) and \(|\eta |<2.4\) and no other jets with \(p_{\mathrm {T}} >25\,\mathrm{{GeV}} \) and \(|\eta |<4.7\). The cross section is \(\sigma ( {{\mathrm {p}\mathrm {p}}} \rightarrow {\mathrm {W}} (\ell \nu )+{\mathrm{{b}}\mathrm{{\overline{b}}}})= 0.64 \pm 0.03{\,\mathrm{{(stat)}}}\pm 0.10{\,\mathrm{{(syst)}}}\pm 0.06{\,\mathrm{{(theo)}}}\pm 0.02{\,\mathrm{{(lumi)}}}\,{\text {pb}}\), in agreement with standard model predictions.