1 Introduction

One of the most important objectives of the physics programme at the CERN LHC is to understand the mechanism behind electroweak symmetry breaking (EWSB). In the standard model (SM) [13] EWSB is achieved by a complex scalar doublet field that leads to the prediction of one physical Higgs boson (\(\mathrm{H}\)) [49]. Through Yukawa interactions, the Higgs scalar field can also account for fermion masses [1012].

In 2012 the ATLAS and CMS Collaborations at the LHC reported the observation of a new boson with mass near 125\(\,\text {GeV}\)  [1315], a value confirmed in later measurements [1618]. Subsequent studies of the production and decay rates [16, 1838] and of the spin-parity quantum numbers [16, 22, 3941] of the new boson show that its properties are compatible with those expected for the SM Higgs boson. The CDF and D0 experiments have also reported an excess of events consistent with the LHC observations [42, 43].

Standard model predictions have improved with time, and the results presented in this paper make use of a large number of theory tools and calculations [44168], summarized in Refs. [169171]. In proton-proton (pp) collisions at \(\sqrt{s}=\text {7--8}\,\text {TeV} \), the gluon-gluon fusion Higgs boson production mode (\(\mathrm{g} \mathrm{g} \mathrm{H} \)) has the largest cross section. It is followed by vector boson fusion (\(\mathrm{VBF}\)), associated \(\mathrm{W}\mathrm{H} \) and \(\mathrm{Z}\mathrm{H} \) production (\(\mathrm{V}\mathrm{H} \)), and production in association with a top quark pair (\(\mathrm{t}\mathrm{t}\mathrm{H} \)). The cross section values for the Higgs boson production modes and the values for the decay branching fractions, together with their uncertainties, are tabulated in Ref. [171] and regular online updates. For a Higgs boson mass of 125\(\,\text {GeV}\), the total production cross section is expected to be 17.5\(\text {\,pb}\) at \(\sqrt{s}=7\,\text {TeV} \) and 22.3\(\text {\,pb}\) at 8\(\,\text {TeV}\), and varies with the mass at a rate of about \(-1.6\,\%\) per \(\text {GeV}\).

This paper presents results from a comprehensive analysis combining the CMS measurements of the properties of the Higgs boson targeting its decay to \(\mathrm{b} \mathrm{b} \)  [21], \(\mathrm{W}\mathrm{W}\)  [22], \(\mathrm{Z}\mathrm{Z}\)  [16], \(\tau \tau \)  [23], \(\gamma \gamma \)  [18], and \(\mu \mu \)  [30] as well as measurements of the \(\mathrm{t}\mathrm{t}\mathrm{H} \) production mode [29] and searches for invisible decays of the Higgs boson [28]. For simplicity, \(\mathrm{b} \mathrm{b} \) is used to denote \(\mathrm{b}\overline{\mathrm{b}}\), \(\tau \tau \) to denote \(\mathrm{\tau }^+\mathrm{\tau }^-\), etc. Similarly, \(\mathrm{Z}\mathrm{Z}\) is used to denote \(\mathrm{Z}\mathrm{Z} ^{(*)}\) and \(\mathrm{W}\mathrm{W}\) to denote \(\mathrm{W}\mathrm{W} ^{(*)}\). The broad complementarity of measurements targeting different production and decay modes enables a variety of studies of the couplings of the new boson to be performed.

The different analyses have different sensitivities to the presence of the SM Higgs boson. The \(\mathrm{H} \rightarrow \gamma \gamma \) and \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) (where \(\ell = \mathrm{e},\mu \)) channels play a special role because of their high sensitivity and excellent mass resolution of the reconstructed diphoton and four-lepton final states, respectively. The \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \rightarrow \ell \nu \ell \nu \) measurement has a high sensitivity due to large expected yields but relatively poor mass resolution because of the presence of neutrinos in the final state. The \(\mathrm{b} \mathrm{b} \) and \(\tau \tau \) decay modes are beset by large background contributions and have relatively poor mass resolution, resulting in lower sensitivity compared to the other channels; combining the results from \(\mathrm{b} \mathrm{b} \) and \(\tau \tau \), the CMS Collaboration has published evidence for the decay of the Higgs boson to fermions [172]. In the SM the \(\mathrm{g} \mathrm{g} \mathrm{H} \) process is dominated by a virtual top quark loop. However, the direct coupling of top quarks to the Higgs boson can be probed through the study of events tagged as having been produced via the \(\mathrm{t}\mathrm{t}\mathrm{H} \) process.

The mass of the Higgs boson is determined by combining the measurements performed in the \(\mathrm{H} \rightarrow \gamma \gamma \) and \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) channels [16, 18]. The SM Higgs boson is predicted to have even parity, zero electric charge, and zero spin. All its other properties can be derived if the boson’s mass is specified. To investigate the couplings of the Higgs boson to SM particles, we perform a combined analysis of all measurements to extract ratios between the observed coupling strengths and those predicted by the SM.

The couplings of the Higgs boson are probed for deviations in magnitude using the formalism recommended by the LHC Higgs Cross Section Working Group in Ref. [171]. This formalism assumes, among other things, that the observed state has quantum numbers \(J^{PC} =0^{++}\) and that the narrow-width approximation holds, leading to a factorization of the couplings in the production and decay of the boson.

The data sets were processed with updated alignment and calibrations of the CMS detector and correspond to integrated luminosities of up to 5.1\(\,\text {fb}^\text {-1}\) at \(\sqrt{s}=7\,\text {TeV} \) and 19.7\(\,\text {fb}^\text {-1}\) at 8\(\,\text {TeV}\) for pp collisions collected in 2011 and 2012. The central feature of the CMS detector is a 13\(\text {\,m}\) long superconducting solenoid of 6\(\text {\,m}\) internal diameter that generates a uniform 3.8\(\text {\,T}\) magnetic field parallel to the direction of the LHC beams. Within the solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter, and a brass and scintillator hadron calorimeter. Muons are identified and measured in gas-ionization detectors embedded in the steel magnetic flux-return yoke of the solenoid. The detector is subdivided into a cylindrical barrel and two endcap disks. Calorimeters on either side of the detector complement the coverage provided by the barrel and endcap detectors. A more detailed description of the CMS detector, together with a definition of the coordinate system used and the relevant kinematic variables, can be found in Ref. [173].

This paper is structured as follows: Sect. 2 summarizes the analyses contributing to the combined measurements. Section 3 describes the statistical method used to extract the properties of the boson; some expected differences between the results of the combined analysis and those of the individual analyses are also explained. The results of the combined analysis are reported in the following four sections. A precise determination of the mass of the boson and direct limits on its width are presented in Sect. 4. We then discuss the significance of the observed excesses of events in Sect. 5. Finally, Sects. 6 and 7 present multiple evaluations of the compatibility of the data with the SM expectations for the magnitude of the Higgs boson’s couplings.

2 Inputs to the combined analysis

Table 1 provides an overview of all inputs used in this combined analysis, including the following information: the final states selected, the production and decay modes targeted in the analyses, the integrated luminosity used, the expected mass resolution, and the number of event categories in each channel.

Both Table 1 and the descriptions of the different inputs make use of the following notation. The expected relative mass resolution, \(\sigma _{m_\mathrm{{H}}}/m_\mathrm{{H}} \), is estimated using different \(\sigma _{m_\mathrm{{H}}}\) calculations: the \(\mathrm{H} \rightarrow \gamma \gamma \), \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \), \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \rightarrow \ell \nu \ell \nu \), and \(\mathrm{H} \rightarrow \mu \mu \) analyses quote \(\sigma _{m_\mathrm{{H}}}\) as half of the width of the shortest interval containing \(68.3\,\%\) of the signal events, the \(\mathrm{H} \rightarrow \tau \tau \) analysis quotes the RMS of the signal distribution, and the analysis of \(\mathrm{V}\mathrm{H} \) with \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \) quotes the standard deviation of the Gaussian core of a function that also describes non-Gaussian tails. Regarding leptons, \(\ell \) denotes an electron or a muon, \(\mathrm{\tau }_{\mathrm{h}}\) denotes a \(\mathrm{\tau }\) lepton identified via its decay into hadrons, and \(L\) denotes any charged lepton. Regarding lepton pairs, SF (DF) denotes same-flavour (different-flavour) pairs and SS (OS) denotes same-sign (opposite-sign) pairs. Concerning reconstructed jets, CJV denotes a central jet veto, \(p_{\mathrm{T}}\) is the magnitude of the transverse momentum vector, \(E_{\mathrm{T}}^{\text {miss}}\) refers to the magnitude of the missing transverse momentum vector, \({\mathrm{j}}\) stands for a reconstructed jet, and \(\mathrm{b} \) denotes a jet tagged as originating from the hadronization of a bottom quark.

Table 1 Summary of the channels in the analyses included in this combination. The first and second columns indicate which decay mode and production mechanism is targeted by an analysis. Notes on the expected composition of the signal are given in the third column. Where available, the fourth column specifies the expected relative mass resolution for the SM Higgs boson. Finally, the last columns provide the number of event categories and the integrated luminosity for the 7 and 8\(\,\text {TeV}\) data sets. The notation is explained in the text

2.1 \(\mathrm{H} \rightarrow \gamma \gamma \)

The \(\mathrm{H} \rightarrow \gamma \gamma \) analysis [18, 174] measures a narrow signal mass peak situated on a smoothly falling background due to events originating from prompt nonresonant diphoton production or due to events with at least one jet misidentified as an isolated photon.

The sample of selected events containing a photon pair is split into mutually exclusive event categories targeting the different Higgs boson production processes, as listed in Table 1. Requiring the presence of two jets with a large rapidity gap favours events produced by the \(\mathrm{VBF}\) mechanism, while event categories designed to preferentially select \(\mathrm{V}\mathrm{H} \) or \(\mathrm{t}\mathrm{t}\mathrm{H} \) production require the presence of muons, electrons, \(E_{\mathrm{T}}^{\text {miss}}\), a pair of jets compatible with the decay of a vector boson, or jets arising from the hadronization of bottom quarks. For 7\(\,\text {TeV}\) data, only one \(\mathrm{t}\mathrm{t}\mathrm{H} \)-tagged event category is used, combining the events selected by the leptonic \(\mathrm{t}\mathrm{t}\mathrm{H} \) and multijet \(\mathrm{t}\mathrm{t}\mathrm{H} \) selections. The 2-jet \(\mathrm{VBF}\)-tagged categories are further split according to a multivariate (MVA) classifier that is trained to discriminate \(\mathrm{VBF}\) events from both background and \(\mathrm{g} \mathrm{g} \mathrm{H} \) events.

Fewer than 1 % of the selected events are tagged according to production mode. The remaining “untagged” events are subdivided into different categories based on the output of an MVA classifier that assigns a high score to signal-like events and to events with a good mass resolution, based on a combination of (i) an event-by-event estimate of the diphoton mass resolution, (ii) a photon identification score for each photon, and (iii) kinematic information about the photons and the diphoton system. The photon identification score is obtained from a separate MVA classifier that uses shower shape information and variables characterizing how isolated the photon candidate is to discriminate prompt photons from those arising in jets.

The same event categories and observables are used for the mass measurement and to search for deviations in the magnitudes of the scalar couplings of the Higgs boson.

In each event category, the background in the signal region is estimated from a fit to the observed diphoton mass distribution in data. The uncertainty due to the choice of function used to describe the background is incorporated into the statistical procedure: the likelihood maximization is also performed for a discrete variable that selects which of the functional forms is evaluated. This procedure is found to have correct coverage probability and negligible bias in extensive tests using pseudo-data extracted from fits of multiple families of functional forms to the data. By construction, this “discrete profiling” of the background functional form leads to confidence intervals for any estimated parameter that are at least as large as those obtained when considering any single functional form. Uncertainty in the parameters of the background functional forms contributes to the statistical uncertainty of the measurements.

2.2 \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \)

In the \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) analysis [16, 175], we measure a four-lepton mass peak over a small continuum background. To further separate signal and background, we build a discriminant, \(\mathcal {D}^{\text {kin}}_\text {bkg}\), using the leading-order matrix elements for signal and background. The value of \(\mathcal {D}^{\text {kin}}_\text {bkg}\) is calculated from the observed kinematic variables, namely the masses of the two dilepton pairs and five angles, which uniquely define a four-lepton configuration in its centre-of-mass frame.

Given the different mass resolutions and different background rates arising from jets misidentified as leptons, the \(4\mathrm{\mu } \), \(2\mathrm{e}2\mathrm{\mu }/2\mathrm{\mu }2\mathrm{e} \), and \(4\mathrm{e} \) event categories are analysed separately. A stricter dilepton mass selection is performed for the lepton pair with invariant mass closest to the nominal \(\mathrm{Z}\) boson mass.

The dominant irreducible background in this channel is due to nonresonant \(\mathrm{Z}\mathrm{Z}\) production with both \(\mathrm{Z}\) bosons decaying to a pair of charged leptons and is estimated from simulation. The smaller reducible backgrounds with misidentified leptons, mainly from the production of \(\mathrm{Z}+\text {jets}\), top quark pairs, and \(\mathrm{W}\mathrm{Z}+\text {jets}\), are estimated from data.

For the mass measurement an event-by-event estimator of the mass resolution is built from the single-lepton momentum resolutions evaluated from the study of a large number of \(\mathrm{J}/\psi \rightarrow \mu \mu \) and \(\mathrm{Z}\rightarrow \ell \ell \) data events. The relative mass resolution, \(\sigma _{m_{4\ell }}/m_{4\ell }\), is then used together with \(m_{4\ell }\) and \(\mathcal {D}^{\text {kin}}_\text {bkg}\) to measure the mass of the boson.

To increase the sensitivity to the different production mechanisms, the event sample is split into two categories based on jet multiplicity: (i) events with fewer than two jets and (ii) events with at least two jets. In the first category, the four-lepton transverse momentum is used to discriminate \(\mathrm{VBF}\) and \(\mathrm{V}\mathrm{H} \) production from \(\mathrm{g} \mathrm{g} \mathrm{H} \) production. In the second category, a linear discriminant, built from the values of the invariant mass of the two leading jets and their pseudorapidity difference, is used to separate the \(\mathrm{VBF}\) and \(\mathrm{g} \mathrm{g} \mathrm{H} \) processes.

2.3 \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \)

In the \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) analysis [22], we measure an excess of events with two OS leptons or three charged leptons with a total charge of \(\pm 1\), moderate \(E_{\mathrm{T}}^{\text {miss}}\), and up to two jets.

The two-lepton events are divided into eight categories, with different background compositions and signal-to-background ratios. The events are split into SF and DF dilepton event categories, since the background from Drell–Yan production (\(\mathrm{q} \mathrm{q} \rightarrow \gamma ^{*}/\mathrm{Z}^{(*)} \rightarrow \ell \ell \)) is much larger for SF dilepton events. For events with no jets, the main background is due to nonresonant \(\mathrm{W}\mathrm{W}\) production. For events with one jet, the dominant backgrounds are nonresonant \(\mathrm{W}\mathrm{W}\) production and top quark production. The 2-jet \(\mathrm{VBF}\) tag is optimized to take advantage of the \(\mathrm{VBF}\) production signature and the main background is due to top quark production. The 2-jet \(\mathrm{V}\mathrm{H} \) tag targets the decay of the vector boson into two jets, \({\mathrm{V}}\rightarrow \text {jj}\). The selection requires two centrally-produced jets with invariant mass in the range \(65<m_{\text {jj}}<105\,\text {GeV} \). To reduce the top quark, Drell–Yan, and \(\mathrm{W}\mathrm{W}\) backgrounds in all previous categories, a selection is performed on the dilepton mass and on the angular separation between the leptons. All background rates, except for very small contributions from \(\mathrm{W}\mathrm{Z}\), \(\mathrm{Z}\mathrm{Z}\), and \(\mathrm{W}\mathrm{\gamma }\) production, are evaluated from data. The two-dimensional distribution of events in the \((m_{\ell \ell },m_{\text {T}})\) plane is used for the measurements in the DF dilepton categories with zero and one jets; \(m_{\ell \ell }\) is the invariant mass of the dilepton and \(m_{\text {T}} \) is the transverse mass reconstructed from the dilepton transverse momentum and the \(E_{\mathrm{T}}^{\text {miss}}\) vector. For the DF 2-jet \(\mathrm{VBF}\) tag the binned distribution of \(m_{\ell \ell }\) is used. For the SF dilepton categories and for the 2-jet \(\mathrm{V}\mathrm{H} \) tag channel, only the total event counts are used.

In the \(3\ell 3\nu \) channel targeting the \(\mathrm{W}\mathrm{H} \rightarrow \mathrm{W}\mathrm{W}\mathrm{W}\) process, we search for an excess of events with three leptons, electrons or muons, large \(E_{\mathrm{T}}^{\text {miss}}\), and low hadronic activity. The dominant background is due to \(\mathrm{W}\mathrm{Z} \rightarrow 3\ell \nu \) production, which is largely reduced by requiring that all SF and OS lepton pairs have invariant masses away from the \(\mathrm{Z}\) boson mass. The smallest angular distance between OS reconstructed lepton tracks is the observable chosen to perform the measurement. The background processes with jets misidentified as leptons, e.g. \(\mathrm{Z}+\text {jets}\) and top quark production, as well as the \(\mathrm{W}\mathrm{Z} \rightarrow 3\ell \nu \) background, are estimated from data. The small contribution from the \(\mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) process with one of the leptons escaping detection is estimated using simulated samples. In the \(3\ell 3\nu \) channel, up to 20 % of the signal events are expected to be due to \(\mathrm{H} \rightarrow \tau \tau \) decays.

In the \(3\ell \nu \text {jj}\) channel, targeting the \(\mathrm{Z}\mathrm{H} \rightarrow \mathrm{Z}+\mathrm{W}\mathrm{W}\rightarrow \ell \ell +\ell ^{\prime }\nu \text {jj}\) process, we first identify the leptonic decay of the \(\mathrm{Z}\) boson and then require the dijet system to satisfy \(|m_{\text {jj}}-m_{\mathrm{W}}|\le 60\,\text {GeV} \). The transverse mass of the \(\ell \nu \text {jj}\) system is the observable chosen to perform the measurement. The main backgrounds are due to the production of \(\mathrm{W}\mathrm{Z}\), \(\mathrm{Z}\mathrm{Z}\), and tribosons, as well as processes involving nonprompt leptons. The first three are estimated from simulated samples, while the last one is evaluated from data.

Finally, a dedicated analysis for the measurement of the boson mass is performed in the 0-jet and 1-jet categories in the \(\mathrm{e}\mu \) channel, employing observables that are extensively used in searches for supersymmetric particles. A resolution of 16–17 % for \(m_\mathrm{{H}} =125\,\text {GeV} \) has been achieved.

2.4 \(\mathrm{H} \rightarrow \tau \tau \)

The \(\mathrm{H} \rightarrow \tau \tau \) analysis [23] measures an excess of events over the SM background expectation using multiple final-state signatures. For the \(\mathrm{e}\mathrm{\mu }\), \(\mathrm{e}\mathrm{\tau }_{\mathrm{h}} \), \(\mathrm{\mu }\mathrm{\tau }_{\mathrm{h}} \), and \(\mathrm{\tau }_{\mathrm{h}} \mathrm{\tau }_{\mathrm{h}} \) final states, where electrons and muons arise from leptonic \(\mathrm{\tau }\) decays, the event samples are further divided into categories based on the number of reconstructed jets in the event: 0 jets, 1 jet, or 2 jets. The 0-jet and 1-jet categories are further subdivided according to the reconstructed \(p_{\mathrm{T}} \) of the leptons. The 2-jet categories require a \(\mathrm{VBF}\)-like topology and are subdivided according to selection criteria applied to the dijet kinematic properties. In each of these categories, we search for a broad excess in the reconstructed \(\tau \tau \) mass distribution. The 0-jet category is used to constrain background normalizations, identification efficiencies, and energy scales. Various control samples in data are used to evaluate the main irreducible background from \(\mathrm{Z}\rightarrow \tau \tau \) production and the largest reducible backgrounds from \(\mathrm{W}+\text {jets}\) and multijet production. The \(\mathrm{e}\mathrm{e}\) and \(\mathrm{\mu }\mathrm{\mu }\) final states are similarly subdivided into jet categories as above, but the search is performed on the combination of two MVA discriminants. The first is trained to distinguish \(\mathrm{Z}\rightarrow \ell \ell \) events from \(\mathrm{Z}\rightarrow \tau \tau \) events while the second is trained to separate \(\mathrm{Z}\rightarrow \tau \tau \) events from \(\mathrm{H} \rightarrow \tau \tau \) events. The expected SM Higgs boson signal in the \(\mathrm{e}\mathrm{\mu }\), \(\mathrm{e}\mathrm{e}\), and \(\mathrm{\mu }\mathrm{\mu }\) categories has a sizeable contribution from \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) decays: 17–24 % in the \(\mathrm{e}\mathrm{e}\) and \(\mathrm{\mu }\mathrm{\mu }\) event categories, and 23–45 % in the \(\mathrm{e}\mathrm{\mu }\) categories, as shown in Table 1.

The search for \(\tau \tau \) decays of Higgs bosons produced in association with a \(\mathrm{W}\) or \(\mathrm{Z}\) boson is conducted in events where the vector bosons are identified through the \(\mathrm{W}\rightarrow \ell \nu \) or \(\mathrm{Z}\rightarrow \ell \ell \) decay modes. The analysis targeting \(\mathrm{W}\mathrm{H} \) production selects events that have electrons or muons and one or two hadronically decaying tau leptons: \(\mathrm{\mu }+\mathrm{\mu }\mathrm{\tau }_{\mathrm{h}} \), \(\mathrm{e}+\mathrm{\mu }\mathrm{\tau }_{\mathrm{h}} \) or \(\mathrm{\mu }+\mathrm{e}\mathrm{\tau }_{\mathrm{h}} \), \(\mathrm{\mu }+\mathrm{\tau }_{\mathrm{h}} \mathrm{\tau }_{\mathrm{h}} \), and \(\mathrm{e}+\mathrm{\tau }_{\mathrm{h}} \mathrm{\tau }_{\mathrm{h}} \). The analysis targeting \(\mathrm{Z}\mathrm{H} \) production selects events with an identified \(\mathrm{Z}\rightarrow \ell \ell \) decay and a Higgs boson candidate decaying to \(\mathrm{e}\mathrm{\mu }\), \(\mathrm{e}\mathrm{\tau }_{\mathrm{h}} \), \(\mathrm{\mu }\mathrm{\tau }_{\mathrm{h}} \), or \(\mathrm{\tau }_{\mathrm{h}} \mathrm{\tau }_{\mathrm{h}} \). The main irreducible backgrounds to the \(\mathrm{W}\mathrm{H} \) and \(\mathrm{Z}\mathrm{H} \) searches are \(\mathrm{W}\mathrm{Z}\) and \(\mathrm{Z}\mathrm{Z}\) diboson events, respectively. The irreducible backgrounds are estimated using simulated event samples corrected by measurements from control samples in data. The reducible backgrounds in both analyses are due to the production of \(\mathrm{W}\) bosons, \(\mathrm{Z}\) bosons, or top quark pairs with at least one jet misidentified as an isolated \(\mathrm{e}\), \(\mathrm{\mu }\), or \(\mathrm{\tau }_{\mathrm{h}} \). These backgrounds are estimated exclusively from data by measuring the probability for jets to be misidentified as isolated leptons in background-enriched control regions, and weighting the selected events that fail the lepton requirements with the misidentification probability. For the SM Higgs boson, the expected fraction of \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) events in the \(\mathrm{Z}\mathrm{H} \) analysis is 10–15 % for the \(\mathrm{Z}\mathrm{H} \rightarrow \mathrm{Z}+\ell \mathrm{\tau }_{\mathrm{h}} \) channel and 70 % for the \(\mathrm{Z}\mathrm{H} \rightarrow \mathrm{Z}+\mathrm{e}\mathrm{\mu } \) channel, as shown in Table 1.

2.5 \(\mathrm{V}\mathrm{H} \) with \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \)

Exploiting the large expected \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \) branching fraction, the analysis of \(\mathrm{V}\mathrm{H} \) production and \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \) decay examines the \(\mathrm{W}(\ell \nu )\mathrm{H} (\mathrm{b} \mathrm{b} )\), \(\mathrm{W}(\mathrm{\tau }_{\mathrm{h}} \nu )\mathrm{H} (\mathrm{b} \mathrm{b} )\), \(\mathrm{Z}(\ell \ell )\mathrm{H} (\mathrm{b} \mathrm{b} )\), and \(\mathrm{Z}(\nu \nu )\mathrm{H} (\mathrm{b} \mathrm{b} )\) topologies [21].

The Higgs boson candidate is reconstructed by requiring two b-tagged jets. The event sample is divided into categories defined by the transverse momentum of the vector boson, \(p_{\mathrm{T}} ({\mathrm{V}})\). An MVA regression is used to estimate the true energy of the bottom quark after being trained on reconstructed b jets in simulated \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \) events. This regression algorithm achieves a dijet mass resolution of about 10 % for \(m_\mathrm{{H}} = 125\,\text {GeV} \). The performance of the regression algorithm is checked with data, where it is observed to improve the top quark mass scale and resolution in top quark pair events and to improve the \(p_{\mathrm{T}}\) balance between a \(\mathrm{Z}\) boson and b jets in \(\mathrm{Z}(\rightarrow \ell \ell )+\mathrm{b} \mathrm{b} \) events. Events with higher \(p_{\mathrm{T}} ({\mathrm{V}})\) have smaller backgrounds and better dijet mass resolution. A cascade of MVA classifiers, trained to distinguish the signal from top quark pairs, \(\text {V}+\text {jets}\), and diboson events, is used to improve the sensitivity in the \(\mathrm{W}(\ell \nu )\mathrm{H} (\mathrm{b} \mathrm{b} )\), \(\mathrm{W}(\mathrm{\tau }_{\mathrm{h}} \nu )\mathrm{H} (\mathrm{b} \mathrm{b} )\), and \(\mathrm{Z}(\nu \nu )\mathrm{H} (\mathrm{b} \mathrm{b} )\) channels. The rates of the main backgrounds, consisting of \(\text {V}+\text {jets}\) and top quark pair events, are derived from signal-depleted data control samples. The \(\mathrm{W}\mathrm{Z}\) and \(\mathrm{Z}\mathrm{Z}\) backgrounds where \(\mathrm{Z}\rightarrow \mathrm{b} \mathrm{b} \), as well as the single top quark background, are estimated from simulated samples. The MVA classifier output distribution is used as the final discriminant in performing measurements.

At the time of publication of Ref. [21], the simulation of the \(\mathrm{Z}\mathrm{H} \) signal process included only \(\mathrm{q}\mathrm{\overline{q}}\)-initiated diagrams. Since then, a more accurate prediction of the \(p_{\mathrm{T}} (\mathrm{Z})\) distribution has become available, taking into account the contribution of the gluon-gluon initiated associated production process \(\mathrm{g} \mathrm{g} \rightarrow \mathrm{Z}\mathrm{H} \), which is included in the results presented in this paper. The calculation of the \(\mathrm{g} \mathrm{g} \rightarrow \mathrm{Z}\mathrm{H} \) contribution includes next-to-leading order (NLO) effects [176179] and is particularly important given that the \(\mathrm{g} \mathrm{g} \rightarrow \mathrm{Z}\mathrm{H} \) process contributes to the most sensitive categories of the analysis. This treatment represents a significant improvement with respect to Ref. [21], as discussed in Sect. 3.4.

2.6 \(\mathrm{t}\mathrm{t}\mathrm{H} \) production

Given its distinctive signature, the \(\mathrm{t}\mathrm{t}\mathrm{H} \) production process can be tagged using the decay products of the top quark pair. The search for \(\mathrm{t}\mathrm{t}\mathrm{H} \) production is performed in four main channels: \(\mathrm{H} \rightarrow \gamma \gamma \), \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \), \(\mathrm{H} \rightarrow \mathrm{\tau }_{\mathrm{h}} \mathrm{\tau }_{\mathrm{h}} \), and \(\mathrm{H} \rightarrow \text {leptons}\)  [19, 29]. The \(\mathrm{t}\mathrm{t}\mathrm{H} \) search in \(\mathrm{H} \rightarrow \gamma \gamma \) events is described in Sect. 2.1; the following focuses on the other three topologies.

In the analysis of \(\mathrm{t}\mathrm{t}\mathrm{H} \) production with \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \), two signatures for the top quark pair decay are considered: lepton+jets (\(\mathrm{t}\overline{\mathrm{t}} \rightarrow \ell \nu \text {jj}\text {bb}\)) and dilepton (\(\mathrm{t}\overline{\mathrm{t}} \rightarrow \ell \nu \ell \nu \text {bb}\)). In the analysis of \(\mathrm{t}\mathrm{t}\mathrm{H} \) production with \(\mathrm{H} \rightarrow \mathrm{\tau }_{\mathrm{h}} \mathrm{\tau }_{\mathrm{h}} \), the \(\mathrm{t}\overline{\mathrm{t}} \) lepton+jets decay signature is required. In both channels, the events are further classified according to the numbers of identified jets and b-tagged jets. The major background is from top-quark pair production accompanied by extra jets. An MVA is trained to discriminate between background and signal events using information related to reconstructed object kinematic properties, event shape, and the discriminant output from the b-tagging algorithm. The rates of background processes are estimated from simulated samples and are constrained through a simultaneous fit to background-enriched control samples.

The analysis of \(\mathrm{t}\mathrm{t}\mathrm{H} \) production with \(\mathrm{H} \rightarrow \text {leptons}\) is mainly sensitive to Higgs boson decays to \(\mathrm{W}\mathrm{W}\), \(\tau \tau \), and \(\mathrm{Z}\mathrm{Z}\), with subsequent decay to electrons and/or muons. The selection starts by requiring the presence of at least two central jets and at least one b jet. It then proceeds to categorize the events according to the number, charge, and flavour of the reconstructed leptons: \(2\ell \) SS, \(3\ell \) with a total charge of \(\pm 1\), and \(4\ell \). A dedicated MVA lepton selection is used to suppress the reducible background from nonprompt leptons, usually from the decay of b hadrons. After the final selection, the two main sources of background are nonprompt leptons, which is evaluated from data, and associated production of top quark pairs and vector bosons, which is estimated from simulated samples. Measurements in the \(4\ell \) event category are performed using the number of reconstructed jets, \(N_\text {j}\). In the \(2\ell \) SS and \(3\ell \) categories, an MVA classifier is employed, which makes use of \(N_\text {j}\) as well as other kinematic and event shape variables to discriminate between signal and background.

2.7 Searches for Higgs boson decays into invisible particles

The search for a Higgs boson decaying into particles that escape direct detection, denoted as \(\mathrm{H} ({\mathrm{inv}})\) in what follows, is performed using \(\mathrm{VBF}\)-tagged events and \(\mathrm{Z}\mathrm{H} \)-tagged events [28]. The \(\mathrm{Z}\mathrm{H} \) production mode is tagged via the \(\mathrm{Z}\rightarrow \ell \ell \) or \(\mathrm{Z}\rightarrow \mathrm{b} \mathrm{b} \) decays. For this combined analysis, only the \(\mathrm{VBF}\)-tagged and \(\mathrm{Z}\rightarrow \ell \ell \) channels are used; the event sample of the less sensitive \(\mathrm{Z}\rightarrow \mathrm{b} \mathrm{b} \) analysis overlaps with that used in the analysis of \(\mathrm{V}\mathrm{H} \) with \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \) decay described in Sect. 2.5 and is not used in this combined analysis.

The \(\mathrm{VBF}\)-tagged event selection is performed only on the 8\(\,\text {TeV}\) data and requires a dijet mass above 1100\(\,\text {GeV}\) as well as a large separation of the jets in pseudorapidity, \(\eta \). The \(E_{\mathrm{T}}^{\text {miss}}\) is required to be above 130\(\,\text {GeV}\) and events with additional jets with \(p_{\mathrm{T}} >30\,\text {GeV} \) and a value of \(\eta \) between those of the tagging jets are rejected. The single largest background is due to the production of \(\mathrm{Z}(\nu \nu )+\text {jets}\) and is estimated from data using a sample of events with visible \(\mathrm{Z}\rightarrow \mathrm{\mu }\mathrm{\mu }\) decays that also satisfy the dijet selection requirements above. To extract the results, a one bin counting experiment is performed in a region where the expected signal-to-background ratio is 0.7, calculated assuming the Higgs boson is produced with the SM cross section but decays only into invisible particles.

The event selection for \(\mathrm{Z}\mathrm{H} \) with \(\mathrm{Z}\rightarrow \ell \ell \) rejects events with two or more jets with \(p_{\mathrm{T}} >30\,\text {GeV} \). The remaining events are categorized according to the \(\mathrm{Z}\) boson decay into \(\mathrm{e}\mathrm{e}\) or \(\mu \mu \) and the number of identified jets, zero or one. For the 8\(\,\text {TeV}\) data, the results are extracted from a two-dimensional fit to the azimuthal angular difference between the leptons and the transverse mass of the system composed of the dilepton and the missing transverse energy in the event. Because of the smaller amount of data in the control samples used for modelling the backgrounds in the signal region, the results for the 7\(\,\text {TeV}\) data set are based on a fit to the aforementioned transverse mass variable only. For the 0-jet categories the signal-to-background ratio varies between 0.24 and 0.28, while for the 1-jet categories it varies between 0.15 and 0.18, depending on the \(\mathrm{Z}\) boson decay channel and the data set (7 or 8\(\,\text {TeV}\)). The signal-to-background ratio increases as a function of the transverse mass variable.

The data from these searches are used for results in Sects. 7.5 and 7.8, where the partial widths for invisible and/or undetected decays of the Higgs boson are probed.

2.8 \(\mathrm{H} \rightarrow \mu \mu \)

The \(\mathrm{H} \rightarrow \mu \mu \) analysis [30] is a search in the distribution of the dimuon invariant mass, \(m_{\mu \mu }\), for a narrow signal peak over a smoothly falling background dominated by Drell–Yan and top quark pair production. A sample of events with a pair of OS muons is split into mutually exclusive categories of differing expected signal-to-background ratios, based on the event topology and kinematic properties. Events with two or more jets are assigned to 2-jet categories, while the remaining events are assigned to untagged categories. The 2-jet events are divided into three categories using selection criteria based on the properties of the dimuon and the dijet systems: a VBF-tagged category, a boosted dimuon category, and a category with the remaining 2-jet events. The untagged events are distributed among twelve categories based on the dimuon \(p_{\mathrm{T}}\) and the pseudorapidity of the two muons, which are directly related to the \(m_{\mu \mu }\) experimental resolution.

The \(m_{\mu \mu }\) spectrum in each event category is fitted with parameterized signal and background shapes to estimate the number of signal events, in a procedure similar to that of the \(\mathrm{H} \rightarrow \gamma \gamma \) analysis, described in Sect. 2.1. The uncertainty due to the choice of the functional form used to model the background is incorporated in a different manner than in the \(\mathrm{H} \rightarrow \gamma \gamma \) analysis, namely by introducing an additive systematic uncertainty in the number of expected signal events. This uncertainty is estimated by evaluating the bias of the signal function plus nominal background function when fitted to pseudo-data generated from alternative background functions. The largest absolute value of this difference for all the alternative background functions considered and Higgs boson mass hypotheses between 120 and 150\(\,\text {GeV}\) is taken as the systematic uncertainty and applied uniformly for all Higgs boson mass hypotheses. The effect of these systematic uncertainties on the final result is sizeable, about 75 % of the overall statistical uncertainty.

The data from this analysis are used for the results in Sect. 7.4, where the scaling of the couplings with the mass of the involved particles is explored.

3 Combination methodology

The combination of Higgs boson measurements requires the simultaneous analysis of the data selected by all individual analyses, accounting for all statistical uncertainties, systematic uncertainties, and their correlations.

The overall statistical methodology used in this combination was developed by the ATLAS and CMS Collaborations in the context of the LHC Higgs Combination Group and is described in Refs. [15, 180, 181]. The chosen test statistic, \(q\), is based on the profile likelihood ratio and is used to determine how signal-like or background-like the data are. Systematic uncertainties are incorporated in the analysis via nuisance parameters that are treated according to the frequentist paradigm. Below we give concise definitions of statistical quantities that we use for characterizing the outcome of the measurements. Results presented herein are obtained using asymptotic formulae [182], including routines available in the RooStats package [183].

3.1 Characterizing an excess of events: \(p\text {-value}\) and significance

To quantify the presence of an excess of events over the expected background we use the test statistic where the likelihood appearing in the numerator corresponds to the background-only hypothesis:

$$\begin{aligned} q_{0} = - 2 \ln \frac{\mathcal {L}(\text {data} \, | \, b, \, \hat{\theta }_{0} ) }{\mathcal {L}(\text {data} \, | \, \hat{\mu }\, s + b, \, \hat{\theta }) },\quad \text {with }\hat{\mu }>0, \end{aligned}$$
(1)

where \(s\) stands for the signal expected for the SM Higgs boson, \(\mu \) is a signal strength modifier introduced to accommodate deviations from the SM Higgs boson predictions, \(b\) stands for backgrounds, and \(\theta \) represents nuisance parameters describing systematic uncertainties. The value \(\hat{\theta }_{0}\) maximizes the likelihood in the numerator under the background-only hypothesis, \(\mu =0\), while \(\hat{\mu }\) and \(\hat{\theta }\) define the point at which the likelihood reaches its global maximum.

The quantity \(p_0\), henceforth referred to as the local \(p\text {-value}\), is defined as the probability, under the background-only hypothesis, to obtain a value of \(q_0\) at least as large as that observed in data, \(q_0^\text {data}\):

$$\begin{aligned} p_0 = {\mathrm{P}}\left( q_0 \ge q_0^\text {data} \, \Big | \, b\right) . \end{aligned}$$
(2)

The local significance \(z\) of a signal-like excess is then computed according to the one-sided Gaussian tail convention:

$$\begin{aligned} p_0 = \int _{z}^{+\infty } \frac{1}{\sqrt{2\pi }} \exp (-x^2/2) \, \mathrm{d}x. \end{aligned}$$
(3)

It is important to note that very small \(p\text {-values}\) should be interpreted with caution, since systematic biases and uncertainties in the underlying model are only known to a given precision.

3.2 Extracting signal model parameters

Signal model parameters \(a\), such as the signal strength modifier \(\mu \), are evaluated from scans of the profile likelihood ratio \(q(a)\):

$$\begin{aligned} q(a) = -2 \Delta \ln \mathcal {L} = - 2 \ln \frac{\mathcal {L}(\text {data} \, | \, s(a) + b, \, \hat{\theta }_{a} ) }{\mathcal {L}(\text {data} \, | \, s(\hat{a}) + b, \, \hat{\theta }) } . \end{aligned}$$
(4)

The parameter values \(\hat{a}\) and \(\hat{\theta }\) correspond to the global maximum likelihood and are called the best-fit set. The post-fit model, obtained using the best-fit set, is used when deriving expected quantities. The post-fit model corresponds to the parametric bootstrap described in the statistics literature and includes information gained in the fit regarding the values of all parameters [184, 185].

The 68 and 95 % confidence level (CL) confidence intervals for a given parameter of interest, \(a_i\), are evaluated from \(q(a_i)=1.00\) and \(q(a_i)=3.84\), respectively, with all other unconstrained model parameters treated in the same way as the nuisance parameters. The two-dimensional (2D) 68 and 95 % CL confidence regions for pairs of parameters are derived from \(q(a_i, a_j) = 2.30\) and \(q(a_i, a_j) = 5.99\), respectively. This implies that boundaries of 2D confidence regions projected on either parameter axis are not identical to the one-dimensional (1D) confidence interval for that parameter. All results are given using the chosen test statistic, leading to approximate CL confidence intervals when there are no large non-Gaussian uncertainties [186188], as is the case here. If the best-fit value is on a physical boundary, the theoretical basis for computing intervals in this manner is lacking. However, we have found that for the results in this paper, the intervals in those conditions are numerically similar to those obtained by the method of Ref. [189].

3.3 Grouping of channels by decay and production tags

The event samples selected by each of the different analyses are mutually exclusive. The selection criteria can, in many cases, define high-purity selections of the targeted decay or production modes, as shown in Table 1. For example, the \(\mathrm{t}\mathrm{t}\mathrm{H} \)-tagged event categories of the \(\mathrm{H} \rightarrow \gamma \gamma \) analysis are pure in terms of \(\gamma \gamma \) decays and are expected to contain less than 10 % of non-\(\mathrm{t}\mathrm{t}\mathrm{H} \) events. However, in some cases such purities cannot be achieved for both production and decay modes.

Mixed production mode composition is common in \(\mathrm{VBF}\)-tagged event categories where the \(\mathrm{g} \mathrm{g} \mathrm{H} \) contribution can be as high as 50 %, and in \(\mathrm{V}\mathrm{H} \) tags where \(\mathrm{W}\mathrm{H} \) and \(\mathrm{Z}\mathrm{H} \) mixtures are common.

For decay modes, mixed composition is more marked for signatures involving light leptons and \(E_{\mathrm{T}}^{\text {miss}}\), where both the \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) and \(\mathrm{H} \rightarrow \tau \tau \) decays may contribute. This can be seen in Table 1, where some \(\mathrm{V}\mathrm{H} \)-tag analyses targeting \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) decays have a significant contribution from \(\mathrm{H} \rightarrow \tau \tau \) decays and vice versa. This is also the case in the \(\mathrm{e}\mathrm{\mu }\) channel in the \(\mathrm{H} \rightarrow \tau \tau \) analysis, in particular in the 2-jet \(\mathrm{VBF}\) tag categories, where the contribution from \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) decays is sizeable and concentrated at low values of \(m_{\tau \tau }\), entailing a genuine sensitivity of these categories to \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) decays. On the other hand, in the \(\mathrm{e}\mathrm{e}\) and \(\mathrm{\mu }\mathrm{\mu }\) channels of the \(\mathrm{H} \rightarrow \tau \tau \) analysis, the contribution from \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) is large when integrated over the full range of the MVA observable used, but given that the analysis is optimized for \(\tau \tau \) decays the contribution from \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) is not concentrated in the regions with largest signal-to-background ratio, and provides little added sensitivity.

Another case of mixed decay mode composition is present in the analyses targeting \(\mathrm{t}\mathrm{t}\mathrm{H} \) production, where the \(\mathrm{H} \rightarrow \text {leptons}\) decay selection includes sizeable contributions from \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) and \(\mathrm{H} \rightarrow \tau \tau \) decays, and to a lesser extent also from \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \) decays. The mixed composition is a consequence of designing the analysis to have the highest possible sensitivity to the \(\mathrm{t}\mathrm{t}\mathrm{H} \) production mode. The analysis of \(\mathrm{t}\mathrm{t}\mathrm{H} \) with \(\mathrm{H} \rightarrow \mathrm{\tau }_{\mathrm{h}} \mathrm{\tau }_{\mathrm{h}} \) decay has an expected signal composition that is dominated by \(\mathrm{H} \rightarrow \tau \tau \) decays, followed by \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) decays, and a smaller contribution of \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \) decays. Finally, in the analysis of \(\mathrm{t}\mathrm{t}\mathrm{H} \) with \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \), there is an event category of the \(\text {lepton}+\text {jets}\) channel that requires six or more jets and two b-tagged jets where the signal composition is expected to be 58 % from \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \) decays, 24 % from \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) decays, and the remaining 18 % from other SM decay modes; in the dilepton channel, the signal composition in the event category requiring four or more jets and two b-tagged jets is expected to be 45 % from \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \) decays, 35 % from \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) decays, and 14 % from \(\mathrm{H} \rightarrow \tau \tau \) decays.

When results are grouped according to the decay tag, each individual category is assigned to the decay mode group that, in the SM, is expected to dominate the sensitivity in that channel. In particular,

  • \(\mathrm{H} \rightarrow \gamma \gamma \) tagged includes only categories from the \(\mathrm{H} \rightarrow \gamma \gamma \) analysis of Ref. [18].

  • \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \) tagged includes only categories from the \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \) analysis of Ref. [16].

  • \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) tagged includes all the channels from the \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) analysis of Ref. [22] and the channels from the analysis of \(\mathrm{t}\mathrm{t}\mathrm{H} \) with \(\mathrm{H} \rightarrow \text {leptons}\) of Ref. [29].

  • \(\mathrm{H} \rightarrow \tau \tau \) tagged includes all the channels from the \(\mathrm{H} \rightarrow \tau \tau \) analysis of Ref. [23] and the channels from the analysis of \(\mathrm{t}\mathrm{t}\mathrm{H} \) targeting \(\mathrm{H} \rightarrow \mathrm{\tau }_{\mathrm{h}} \mathrm{\tau }_{\mathrm{h}} \) of Ref. [29].

  • \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \) tagged includes all the channels of the analysis of \(\mathrm{V}\mathrm{H} \) with \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \) of Ref. [21] and the channels from the analysis of \(\mathrm{t}\mathrm{t}\mathrm{H} \) targeting \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \) of Ref. [29].

  • \(\mathrm{H} \rightarrow \mu \mu \) tagged includes only categories from the \(\mathrm{H} \rightarrow \mu \mu \) analysis of Ref. [30].

When results are grouped by the production tag, the same reasoning of assignment by preponderance of composition is followed, using the information in Table 1.

In the combined analyses, all contributions in a given production tag or decay mode group are considered as signal and scaled accordingly.

3.4 Expected differences with respect to the results of input analyses

The grouping of channels described in Sect. 3.3 is among the reasons why the results of the combination may seem to differ from those of the individual published analyses. In addition, the combined analysis takes into account correlations among several sources of systematic uncertainty. Care is taken to understand the post-fit behaviour of the parameters that are correlated between analyses, both in terms of the post-fit parameter values and uncertainties. Finally, the combination is evaluated at a value of \(m_\mathrm{{H}}\) that is not the value that was used in some of the individual published analyses, entailing changes to the expected production cross sections and branching fractions of the SM Higgs boson. Changes are sizeable in some cases:

  • In Refs. [16, 22] the results for \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) and \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \rightarrow \ell \nu \ell \nu \) are evaluated for \(m_\mathrm{{H}} =125.6\,\text {GeV} \), the mass measured in the \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) analysis. In the present combination, the results are evaluated for \(m_\mathrm{{H}} =125.0\,\text {GeV} \), the mass measured from the combined analysis of the \(\mathrm{H} \rightarrow \gamma \gamma \) and \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) measurements, presented in Sect. 4.1. For values of \(m_\mathrm{{H}}\) in this region, the branching fractions for \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \) and \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) vary rapidly with \(m_\mathrm{{H}}\). For the change of \(m_\mathrm{{H}}\) in question, \(\mathcal {B}(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} ,m_\mathrm{{H}} =125.0\,\text {GeV} )/\mathcal {B}(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} ,m_\mathrm{{H}} =125.6\,\text {GeV})=0.95\) and \(\mathcal {B}(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} ,m_\mathrm{{H}} =125.0\,\text {GeV} )/\mathcal {B}(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} ,m_\mathrm{{H}} =125.6\,\text {GeV})=0.96\) [171].

  • The expected production cross sections for the SM Higgs boson depend on \(m_\mathrm{{H}}\). For the change in \(m_\mathrm{{H}}\) discussed above, the total production cross sections for 7 and 8\(\,\text {TeV}\) collisions vary similarly: \(\sigma _\text {tot}(m_\mathrm{{H}} =125.0\,\text {GeV} )/\sigma _\text {tot}(m_\mathrm{{H}} =125.6\,\text {GeV})\sim 1.01\). While the variation of the total production cross section is dominated by the \(\mathrm{g} \mathrm{g} \mathrm{H} \) production process, the variation is about 1.005 for \(\mathrm{VBF}\), around 1.016 for \(\mathrm{V}\mathrm{H} \), and around 1.014 for \(\mathrm{t}\mathrm{t}\mathrm{H} \)  [171].

  • The \(\mathrm{H} \rightarrow \tau \tau \) analysis of Ref. [23] focused on exploring the coupling of the Higgs boson to the tau lepton. For this reason nearly all results in Ref. [23] were obtained by treating the \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) contribution as a background, set to the SM expectation. In the present combined analysis, both the \(\mathrm{H} \rightarrow \tau \tau \) and \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) contributions are considered as signal in the \(\tau \tau \) decay tag analysis. This treatment leads to an increased sensitivity to the presence of a Higgs boson that decays into both \(\tau \tau \) and \(\mathrm{W}\mathrm{W}\).

  • The search for invisible Higgs decays of Ref. [28] includes a modest contribution to the sensitivity from the analysis targeting \(\mathrm{Z}\mathrm{H} \) production with \(\mathrm{Z}\rightarrow \mathrm{b} \mathrm{b} \) decays. The events selected by that analysis overlap with those of the analysis of \(\mathrm{V}\mathrm{H} \) production with \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \) decays, and are therefore not considered in this combination. Given the limited sensitivity of that search, the overall sensitivity to invisible decays is not significantly impacted.

  • The contribution from the \(\mathrm{g} \mathrm{g} \rightarrow \mathrm{Z}\mathrm{H} \) process was not included in Ref. [21] as calculations for the cross section as a function of \(p_{\mathrm{T}} (\mathrm{Z})\) were not available. Since then, the search for \(\mathrm{V}\mathrm{H} \) production with \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \) has been augmented by the use of recent NLO calculations for the \(\mathrm{g} \mathrm{g} \rightarrow \mathrm{Z}\mathrm{H} \) contribution [176179]. In the \(\mathrm{Z}(\nu \nu )\mathrm{H} (\mathrm{b} \mathrm{b} )\) and \(\mathrm{Z}(\ell \ell )\mathrm{H} (\mathrm{b} \mathrm{b} )\) channels, the addition of this process leads to an increase of the expected signal yields by 10 % to 30 % for \(p_{\mathrm{T}} (\mathrm{Z})\) around and above 150\(\,\text {GeV}\). When combined with the unchanged \(\mathrm{W}\mathrm{H} \) channels, the overall expected sensitivity for \(\mathrm{V}\mathrm{H} \) production with \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \) increases by about 10 %.

In all analyses used, the contribution from associated production of a Higgs boson with a bottom quark pair, \(\mathrm{b}\mathrm{b}\mathrm{H} \), is neglected; in inclusive selections this contribution is much smaller than the uncertainties in the gluon fusion production process, whereas in exclusive categories it has been found that the jets associated with the bottom quarks are so soft that the efficiency to select such events is low enough and no sensitivity is lost. In the future, with more data, it may be possible to devise experimental selections that permit the study of the \(\mathrm{b}\mathrm{b}\mathrm{H} \) production mode as predicted by the SM.

4 Mass measurement and direct limits on the natural width

In this section we first present a measurement of the mass of the new boson from the combined analysis of the high-resolution \(\mathrm{H} \rightarrow \gamma \gamma \) and \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) channels. We then proceed to set direct limits on its natural width.

4.1 Mass of the observed state

Figure 1 shows the 68 % CL confidence regions for two parameters of interest, the signal strength relative to the SM expectation, \(\mu =\sigma /\sigma _\text {SM}\), and the mass, \(m_{\mathrm{{H}}} \), obtained from the \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) and \(\gamma \gamma \) channels, which have excellent mass resolution. The combined 68 % CL confidence region, bounded by a black curve in Fig. 1, is calculated assuming the relative event yield between the two channels as predicted by the SM, while the overall signal strength is left as a free parameter.

Fig. 1
figure 1

The 68 % CL confidence regions for the signal strength \(\sigma / \sigma _{\text {SM}}\) versus the mass of the boson \({m}_\mathrm{{H}}\) for the \(\mathrm{H} \rightarrow \gamma \gamma \) and \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) final states, and their combination. The symbol \(\sigma / \sigma _{\text {SM}}\) denotes the production cross section times the relevant branching fractions, relative to the SM expectation. In this combination, the relative signal strength for the two decay modes is set to the expectation for the SM Higgs boson

Fig. 2
figure 2

(Left) Scan of the test statistic \(q(m_{\mathrm{{H}}})=-2 \Delta \ln \mathcal {L} \) versus the mass of the boson \(m_{\mathrm{{H}}} \) for the \(\mathrm{H} \rightarrow \gamma \gamma \) and \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) final states separately and for their combination. Three independent signal strengths, \((\mathrm{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} )\rightarrow \gamma \gamma \), \((\mathrm{VBF},\mathrm{V}\mathrm{H} )\rightarrow \gamma \gamma \), and \(\text {pp}\rightarrow \mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \), are profiled together with all other nuisance parameters. (Right) Scan of the test statistic \(q(m_\mathrm{{H}}^{\gamma \gamma } - m_\mathrm{{H}}^{4\ell })\) versus the difference between two individual mass measurements for the same model of signal strengths used in the left panel

To extract the value of \(m_{\mathrm{{H}}}\) in a way that is not completely dependent on the SM prediction for the production and decay ratios, the signal strength modifiers for the \((\mathrm{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} )\rightarrow \gamma \gamma \), \((\mathrm{VBF},\mathrm{V}\mathrm{H} )\rightarrow \gamma \gamma \), and \(\text {pp}\rightarrow \mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) processes are taken as independent, unconstrained, parameters. The signal in all channels is assumed to be due to a single state with mass \(m_{\mathrm{{H}}} \). The best-fit value of \(m_{\mathrm{{H}}} \) and its uncertainty are extracted from a scan of the combined test statistic \(q(m_{\mathrm{{H}}})\) with the three signal strength modifiers profiled together with all other nuisance parameters; i.e. the signal strength modifiers float freely in the fits performed to scan \(q(m_{\mathrm{{H}}})\). Figure 2 (left) shows the scan of the test statistic as a function of the mass \(m_{\mathrm{{H}}} \) separately for the \(\mathrm{H} \rightarrow \gamma \gamma \) and \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) channels, and for their combination. The intersections of the \(q(m_{\mathrm{{H}}})\) curves with the thick horizontal line at 1.00 and thin line at 3.84 define the 68 % and 95 % CL confidence intervals for the mass of the observed particle, respectively. These intervals include both the statistical and systematic uncertainties. The mass is measured to be \(m_{\mathrm{{H}}} = 125.02 ^{+0.29}_{-0.31} \,\text {GeV} \). The less precise evaluations from the \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) analysis [22], \(m_{\mathrm{{H}}} =128^{+7}_{-5}\,\text {GeV} \), and from the \(\mathrm{H} \rightarrow \tau \tau \) analysis [23], \(m_{\mathrm{{H}}} =122\pm 7\,\text {GeV} \), are compatible with this result.

To evaluate the statistical component of the overall uncertainty, we also perform a scan of \(q(m_{\mathrm{{H}}})\) fixing all nuisance parameters to their best-fit values, except those related to the \(\mathrm{H} \rightarrow \gamma \gamma \) background models; given that the \(\mathrm{H} \rightarrow \gamma \gamma \) background distributions are modelled from fits to data, their degrees of freedom encode fluctuations which are statistical in nature. The result is shown by the dashed curve in Fig. 2 (left). The crossings of the dashed curve with the thick horizontal line define the 68 % CL confidence interval for the statistical uncertainty in the mass measurement: \(^{+0.26}_{-0.27}\) \(\,\text {GeV}\). We derive the systematic uncertainty assuming that the total uncertainty is the sum in quadrature of the statistical and systematic components; the full result is \(m_{\mathrm{{H}}} = 125.02\,^{+0.26}_{-0.27} \,\text {(stat)} \,^{+0.14}_{-0.15} \,\text {(syst)} \,\text {GeV} \). The median expected uncertainty is evaluated using an Asimov pseudo-data sample [182] constructed from the best-fit values obtained when testing for the compatibility of the mass measurement in the \(\mathrm{H} \rightarrow \gamma \gamma \) and \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) channels. The expected uncertainty thus derived is \(\,^{+0.26}_{-0.25}\,\text {(stat)} \,\pm 0.14\,\text {(syst)} \,\text {GeV} \), in good agreement with the observation in data. As a comparison, the median expected uncertainty is also derived by constructing an Asimov pseudo-data sample as above except that the signal strength modifiers are set to unity (as expected in the SM) and \(m_\mathrm{{H}}^{\gamma \gamma }=m_\mathrm{{H}}^{4\ell }=125\,\text {GeV} \), leading to an expected uncertainty of \(\,\pm 0.28\,\text {(stat)} \,\pm 0.13\,\text {(syst)} \,\text {GeV} \). As could be anticipated, the statistical uncertainty is slightly larger given that the observed signal strength in the \(\mathrm{H} \rightarrow \gamma \gamma \) channel is larger than unity, and the systematic uncertainty is slightly smaller given the small mass difference between the two channels that is observed in data.

To quantify the compatibility of the \(\mathrm{H} \rightarrow \gamma \gamma \) and \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \) mass measurements with each other, we perform a scan of the test statistic \(q(m_\mathrm{{H}}^{\gamma \gamma } - m_\mathrm{{H}}^{4\ell })\), as a function of the difference between the two mass measurements. Besides the three signal strength modifiers, there are two additional parameters in this test: the mass difference and \(m_\mathrm{{H}}^{\gamma \gamma }\). In the scan, the three signal strengths and \(m_\mathrm{{H}}^{\gamma \gamma }\) are profiled together with all nuisance parameters. The result from the scan shown in Fig. 2 (right) is \(m_\mathrm{{H}}^{\gamma \gamma } - m_\mathrm{{H}}^{4\ell }=-0.89^{+0.56}_{-0.57}\,\text {GeV} \). From evaluating \(q(m_\mathrm{{H}}^{\gamma \gamma } - m_\mathrm{{H}}^{4\ell }=0)\) it can be concluded that the mass measurements in \(\mathrm{H} \rightarrow \gamma \gamma \) and \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) agree at the \(1.6\sigma \) level.

To assess the dependency of the result on the SM Higgs boson hypothesis, the measurement of the mass is repeated using the same channels, but with the following two sets of assumptions: (i) allowing a common signal strength modifier to float, which corresponds to the result in Fig. 1, and (ii) constraining the relative production cross sections and branching fractions to the SM predictions, i.e. \(\mu =1\). The results from these two alternative measurements differ by less than 0.1\(\,\text {GeV}\) from the main result, both in terms of the best-fit value and the uncertainties.

4.2 Direct limits on the width of the observed state

For \(m_\mathrm{{H}} \sim 125\,\text {GeV} \) the SM Higgs boson is predicted to be narrow, with a total width \(\varGamma _{\mathrm{SM}} \sim 4\,\text {MeV} \). From the study of off-shell Higgs boson production, CMS has previously set an indirect limit on the total width, \(\varGamma _{\text {tot}}/\varGamma _{\mathrm{SM}} < 5.4\) \((8.0)\) observed (expected) at the 95 % CL [27]. While that result is about two orders of magnitude better than the experimental mass resolution, it relies on assumptions on the underlying theory, such as the absence of contributions to Higgs boson off-shell production from particles beyond the standard model. In contrast, a direct limit does not rely on such assumptions and is only limited by the experimental resolution.

The best experimental mass resolution, achieved in the \(\mathrm{H} \rightarrow \gamma \gamma \) and \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) analyses, is typically between 1\(\,\text {GeV}\) and 3\(\,\text {GeV}\), as shown in Table 1. The resolution depends on the energy, rapidity, and azimuthal angle of the decay products, and on the flavour of the leptons in the case of the \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) decay. If found inconsistent with the expected detector resolution, the total width measured in data could suggest the production of a resonance with a greater intrinsic width or the production of two quasi-degenerate states.

To perform this measurement the signal models in the \(\mathrm{H} \rightarrow \gamma \gamma \) and \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) analyses allow for a natural width using the relativistic Breit–Wigner distribution, as described in Refs. [16, 18]. Figure 3 shows the likelihood scan as a function of the assumed natural width. The mass of the boson and a common signal strength are profiled along with all other nuisance parameters. The dashed lines show the expected results for the SM Higgs boson. For the \(\mathrm{H} \rightarrow \gamma \gamma \) channel the observed (expected) upper limit at the 95 % CL is 2.4 (3.1)\(\,\text {GeV}\). For the \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) channel the observed (expected) upper limit at the 95 % CL is 3.4 (2.8)\(\,\text {GeV}\). For the combination of the two analyses, the observed (expected) upper limit at the 95 % CL is 1.7 (2.3)\(\,\text {GeV}\).

Fig. 3
figure 3

Likelihood scan as a function of the width of the boson. The continuous (dashed) lines show the observed (expected) results for the \(\mathrm{H} \rightarrow \gamma \gamma \) analysis, the \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) analysis, and their combination. The data are consistent with \(\varGamma _{\mathrm{SM}} \sim 4\,\text {MeV} \) and for the combination of the two channels the observed (expected) upper limit on the width at the 95 % CL is 1.7 (2.3)\(\,\text {GeV}\)

5 Significance of the observations in data

This section provides an assessment of the significance of the observed excesses at the best-fit mass value, \(m_\mathrm{{H}} =125.0\,\text {GeV} \).

Table 2 summarizes the median expected and observed local significance for a SM Higgs boson mass of \(125.0\,\text {GeV} \) from the different decay mode tags, grouped as described in Sect. 3.3. The value of \(m_{\mathrm{{H}}}\) is fixed to the best-fit combined measurement presented in Sect. 4.1. The values of the expected significance are evaluated using the post-fit expected background rates and the signal rates expected from the SM. In the three diboson decay mode tags, the significance is close to, or above, \(5\sigma \). In the \(\tau \tau \) decay mode tag the significance is above \(3\sigma \).

Table 2 The observed and median expected significances of the excesses for each decay mode group, assuming \(m_{\mathrm{{H}}} =125.0\,\text {GeV} \). The channels are grouped by decay mode tag as described in Sect. 3.3; when there is a difference in the channels included with respect to the published results for the individual channels, the result for the grouping used in those publications is also given

Differences between the results in Table 2 and the individual publications are understood in terms of the discussion in Sects. 3.3 and 3.4, namely the grouping of channels by decay mode tag, the change of the \(m_\mathrm{{H}}\) value at which the significance of the \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) and \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) analyses is evaluated, and the treatment of \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) as part of the signal, instead of background, in the \(\mathrm{H} \rightarrow \tau \tau \) analysis.

Finally, the observation of the \(\mathrm{H} \rightarrow \gamma \gamma \) and \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) decay modes indicates that the new particle is a boson, and the diphoton decay implies that its spin is different from unity [190, 191]. Other observations, beyond the scope of this paper, disfavour spin-1 and spin-2 hypotheses and, assuming that the boson has zero spin, are consistent with the pure scalar hypothesis, while disfavouring the pure pseudoscalar hypothesis [16, 22, 41].

6 Compatibility of the observed yields with the SM Higgs boson hypothesis

The results presented in this section focus on the Higgs boson production and decay modes, which can be factorized under the narrow-width approximation, leading to \(N_{ij}\sim \sigma _{i}\;\mathcal {B}_{j}\), where \(N_{ij}\) represents the event yield for the combination of production mode \(i\) and decay mode \(j\), \(\sigma _{i}\) is the production cross section for production process \(i\), and \(\mathcal {B}_{j}\) is the branching fraction into decay mode \(j\). Studies where the production and decay modes are interpreted in terms of underlying couplings of particles to the Higgs boson are presented in Sect. 7.

The size of the current data set permits many compatibility tests between the observed excesses and the expected SM Higgs boson signal. These compatibility tests do not constitute measurements of any physics parameters per se, but rather allow one to probe for deviations of the various observations from the SM expectations. The tests evaluate the compatibility of the data observed in the different channels with the expectations for the SM Higgs boson with a mass equal to the best-fit value found in Sect. 4.1, \(m_{\mathrm{{H}}} =125.0\,\text {GeV} \).

This section is organized by increasing degree of complexity of the deviations being probed. In Sect. 6.1 we assess the compatibility of the overall signal strength for all channels combined with the SM Higgs hypothesis. In Sect. 6.2 the compatibility is assessed by production tag group, decay tag group, and production and decay tag group. We then turn to the study of production modes. Using the detailed information on the expected SM Higgs production contributions, Sect. 6.3 discusses, for each decay tag group, the results of considering two signal strengths, one scaling the \(\mathrm{g} \mathrm{g} \mathrm{H} \) and \(\mathrm{t}\mathrm{t}\mathrm{H} \) contributions, and the other scaling the \(\mathrm{VBF}\) and \(\mathrm{V}\mathrm{H} \) contributions. Then, assuming the expected relative SM Higgs branching fractions, Sect. 6.4 provides a combined analysis for signal strengths scaling the \(\mathrm{g} \mathrm{g} \mathrm{H} \), \(\mathrm{VBF}\), \(\mathrm{V}\mathrm{H} \), and \(\mathrm{t}\mathrm{t}\mathrm{H} \) contributions individually. Turning to the decay modes, Sect. 6.5 performs combined analyses of signal strength ratios between different decay modes, where some uncertainties from theory and some experimental uncertainties cancel out. Finally, using the structure of the matrix of production and decay mode signal strengths, Sect. 6.6 tests for the possibility that the observations are due to the presence of more than one state degenerate in mass.

6.1 Overall signal strength

The best-fit value for the common signal strength modifier \(\hat{\mu }= \hat{\sigma }/ \sigma _{\text {SM}}\), obtained from the combined analysis of all channels, provides the simplest compatibility test. In the formal fit, \(\hat{\mu }\) is allowed to become negative if the observed number of events is smaller than the expected yield for the background-only hypothesis. The observed \(\hat{\mu }\), assuming \(m_\mathrm{{H}} =125.0\,\text {GeV} \), is \(1.00 ^{+0.14}_{-0.13}\), consistent with unity, the expectation for the SM Higgs boson. This value is shown as the vertical bands in the three panels of Fig. 4.

The total uncertainty can be broken down into a statistical component (stat); a component associated with the uncertainties related to renormalization and factorization scale variations, parton distribution functions, branching fractions, and underlying event description (theo); and any other systematic uncertainties (syst). The result is \(1.00\,\pm 0.09\,\text {(stat)} \,^{+0.08}_{-0.07}\,\text {(theo)} \,\pm 0.07\,\text {(syst)} \). Evolution of the SM predictions may not only reduce the associated uncertainties from theory, but also change the central value given above.

6.2 Grouping by predominant decay mode and/or production tag

One step in going beyond a single signal strength modifier is to evaluate the signal strength in groups of channels from different analyses. The groups chosen reflect the different production tags, predominant decay modes, or both. Once the fits for each group are performed, a simultaneous fit to all groups is also performed to assess the compatibility of the results with the SM Higgs boson hypothesis.

Figure 4 shows the \(\hat{\mu }\) values obtained in different independent combinations of channels for \(m_\mathrm{{H}} = 125.0\,\text {GeV} \), grouped by additional tags targeting events from particular production mechanisms, by predominant decay mode, or both. As discussed in Sect. 3.3, the expected purities of the different tagged samples vary substantially. Therefore, these plots cannot be interpreted as compatibility tests for pure production mechanisms or decay modes, which are studied in Sect. 6.4.

For each type of grouping, the level of compatibility with the SM Higgs boson cross section can be quantified by the value of the test statistic function of the signal strength parameters simultaneously fitted for the \(N\) channels considered in the group, \(\mu _1, \mu _2,\ldots ,\mu _N\),

$$\begin{aligned} q_{\mu } = -2 \Delta \ln \mathcal {L} = -2 \ln \frac{ \mathcal {L}(\text {data} \, | \, \mu _{i}, \hat{\theta }_{\mu _{i}}) }{\mathcal {L}(\text {data} \, | \, \hat{\mu }_{i}, \hat{\theta })} \end{aligned}$$
(5)

evaluated for \(\mu _1 = \mu _2 =\cdots =\mu _N = 1\). For each type of grouping, the corresponding \(q_{\mu }(\mu _1=\mu _2=\cdots =\mu _N=1)\) from the simultaneous fit of \(N\) signal strength parameters is expected to behave asymptotically as a \(\chi ^{2}\) distribution with \(N\) degrees of freedom (dof).

The results for the four independent combinations grouped by production mode tag are depicted in Fig. 4 (top left). An excess can be seen for the \(\mathrm{t}\mathrm{t}\mathrm{H} \)-tagged combination, due to the observations in the \(\mathrm{t}\mathrm{t}\mathrm{H} \)-tagged \(\mathrm{H} \rightarrow \gamma \gamma \) and \(\mathrm{H} \rightarrow \text {leptons}\) analyses that can be appreciated from the bottom panel. The simultaneous fit of the signal strengths for each group of production process tags results in \(\chi ^{2}/\text {dof} = 5.5 / 4 \) and an asymptotic \(p\text {-value}\) of \(0.24\), driven by the excess observed in the group of analyses tagging the \(\mathrm{t}\mathrm{t}\mathrm{H} \) production process.

The results for the five independent combinations grouped by predominant decay mode are shown in Fig. 4 (top right). The simultaneous fit of the corresponding five signal strengths yields \(\chi ^{2}/\text {dof} = 1.0 / 5 \) and an asymptotic \(p\text {-value}\) of \(0.96\).

The results for sixteen individual combinations grouped by production tag and predominant decay mode are shown in Fig. 4 (bottom). The simultaneous fit of the corresponding signal strengths gives a \(\chi ^{2}/\text {dof} = 10.5 / 16 \), which corresponds to an asymptotic \(p\text {-value}\) of \(0.84\).

The \(p\text {-values}\) above indicate that these different ways of splitting the overall signal strength into groups related to the production mode tag, decay mode tag, or both, all yield results compatible with the SM prediction for the Higgs boson, \(\mu =\mu _i=1\). The result of the \(\mathrm{t}\mathrm{t}\mathrm{H} \)-tagged combination is compatible with the SM hypothesis at the \(2.0\sigma \) level.

Fig. 4
figure 4

Values of the best-fit \(\sigma / \sigma _\text {SM}\) for the overall combined analysis (solid vertical line) and separate combinations grouped by production mode tag, predominant decay mode, or both. The \(\sigma / \sigma _{\text {SM}}\) ratio denotes the production cross section times the relevant branching fractions, relative to the SM expectation. The vertical band shows the overall \(\sigma / \sigma _\text {SM}\) uncertainty. The horizontal bars indicate the \(\pm 1 \) standard deviation uncertainties in the best-fit \(\sigma / \sigma _\text {SM}\) values for the individual combinations; these bars include both statistical and systematic uncertainties. (Top left) Combinations grouped by analysis tags targeting individual production mechanisms; the excess in the \(\mathrm{t}\mathrm{t}\mathrm{H} \)-tagged combination is largely driven by the \(\mathrm{t}\mathrm{t}\mathrm{H} \)-tagged \(\mathrm{H} \rightarrow \gamma \gamma \) and \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) channels as can be seen in the bottom panel. (Top right) Combinations grouped by predominant decay mode. (Bottom) Combinations grouped by predominant decay mode and additional tags targeting a particular production mechanism

6.3 Fermion- and boson-mediated production processes and their ratio

The four main Higgs boson production mechanisms can be associated with either couplings of the Higgs boson to fermions (\(\mathrm{g} \mathrm{g} \mathrm{H} \) and \(\mathrm{t}\mathrm{t}\mathrm{H} \)) or vector bosons (\(\mathrm{VBF}\) and \(\mathrm{V}\mathrm{H} \)). Therefore, a combination of channels associated with a particular decay mode tag, but explicitly targeting different production mechanisms, can be used to test the relative strengths of the couplings to the vector bosons and fermions, mainly the top quark, given its importance in \(\mathrm{g} \mathrm{g} \mathrm{H} \) production. The categorization of the different channels into production mode tags is not pure. Contributions from the different signal processes, evaluated from Monte Carlo simulation and shown in Table 1, are taken into account in the fits, including theory and experimental uncertainties; the factors used to scale the expected contributions from the different production modes are shown in Table 3 and do not depend on the decay mode. For a given decay mode, identical deviations of \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }\) and \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} }\) from unity may also be due to a departure of the decay partial width from the SM expectation.

Table 3 Parameterization used to scale the expected SM Higgs boson yields from the different production modes when obtaining the results presented in Table 5 and Fig. 5 (left). The signal strength modifiers \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} }\) and \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }\), common to all decay modes, are associated with the \(\mathrm{g} \mathrm{g} \mathrm{H} \) and \(\mathrm{t}\mathrm{t}\mathrm{H} \) and with the \(\mathrm{VBF}\) and \(\mathrm{V}\mathrm{H} \) production mechanisms, respectively
Fig. 5
figure 5

(Left) The 68 % CL confidence regions (bounded by the solid curves) for the signal strength of the \(\mathrm{g} \mathrm{g} \mathrm{H} \) and \(\mathrm{t}\mathrm{t}\mathrm{H} \) and of the \(\mathrm{VBF}\) and \(\mathrm{V}\mathrm{H} \) production mechanisms, \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} }\) and \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }\), respectively. The crosses indicate the best-fit values obtained in each group of predominant decay modes: \(\gamma \gamma \), \(\mathrm{Z}\mathrm{Z}\), \(\mathrm{W}\mathrm{W}\), \(\tau \tau \), and \(\mathrm{b} \mathrm{b} \). The diamond at \((1,1)\) indicates the expected values for the SM Higgs boson. (Right) Likelihood scan versus the ratio \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }/\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} } \), combined for all channels. The fit for \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }/\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} } \) is performed while profiling the five \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} }\) parameters, one per visible decay mode, as shown in Table 4. The solid curve represents the observed result in data while the dashed curve indicates the expected median result in the presence of the SM Higgs boson. Crossings with the horizontal thick and thin lines denote the 68 % CL and 95 % CL confidence intervals, respectively

Figure 5 (left) shows the 68 % CL confidence regions for the signal strength modifiers associated with the \(\mathrm{g} \mathrm{g} \mathrm{H} \) and \(\mathrm{t}\mathrm{t}\mathrm{H} \) and with the \(\mathrm{VBF}\) and \(\mathrm{V}\mathrm{H} \) production mechanisms, \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} }\) and \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }\), respectively. The five sets of contours correspond to the five predominant decay mode groups, introduced in Sect. 3.3. It can be seen in Fig. 5 (left) how the analyses in the \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \) decay group constrain \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }\) more than \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} }\), reflecting the larger sensitivity of the analysis of \(\mathrm{V}\mathrm{H} \) production with \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \) with respect to the analysis of \(\mathrm{t}\mathrm{t}\mathrm{H} \) production with \(\mathrm{H} \rightarrow \mathrm{b} \mathrm{b} \). An almost complementary situation can be found for the \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \) analysis, where the data constrain \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} }\) better than \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }\), reflecting the fact that the analysis is more sensitive to \(\mathrm{g} \mathrm{g} \mathrm{H} \), the most abundant production mode. The SM Higgs boson expectation of \((1,1)\) is within the 68 % CL confidence regions for all predominant decay groups. The best-fit values for each decay tag group are given in Table 5.

The ratio of \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }\) and \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} }\) provides a compatibility check with the SM Higgs boson expectation that can be combined across all decay modes. To perform the measurement of \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }/\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} } \), the SM Higgs boson signal yields in the different production processes and decay modes are parameterized according to the scaling factors presented in Table 4. The fit is performed simultaneously in all channels of all analyses and takes into account, within each channel, the full detail of the expected SM Higgs contributions from the different production processes and decay modes.

Table 4 Parameterization used to scale the expected SM Higgs boson yields for the different production processes and decay modes when obtaining the \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }/\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} } \) results presented in Table 5 and Fig. 5 (right)

Figure 5 (right) shows the likelihood scan of the data for \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }/\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} } \), while the bottom part of Table 5 shows the corresponding values; the best-fit \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }/\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} } \) is observed to be \(1.25 ^{+0.62}_{-0.44}\), compatible with the expectation for the SM Higgs boson, \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }/\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} } =1\).

Table 5 The best-fit values for the signal strength of the \(\mathrm{VBF}\) and \(\mathrm{V}\mathrm{H} \) and of the \(\mathrm{g} \mathrm{g} \mathrm{H} \) and \(\mathrm{t}\mathrm{t}\mathrm{H} \) production mechanisms, \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} } \) and \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} } \), respectively, for \(m_{\mathrm{{H}}} =125.0\,\text {GeV} \). The channels are grouped by decay mode tag as described in Sect. 3.3. The observed and median expected results for the ratio of \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }\) to \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} }\) together with their uncertainties are also given for the full combination. In the full combination, \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }/\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} } \) is determined while profiling the five \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} }\) parameters, one per decay mode, as shown in Table 4

6.4 Individual production modes

While the production modes can be grouped by the type of interaction involved in the production of the SM Higgs boson, as done in Sect. 6.3, the data set and analyses available allow us to explore signal strength modifiers for different production modes, \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} }\), \(\mu _{\mathrm{VBF}}\), \(\mu _{\mathrm{V}\mathrm{H} }\), and \(\mu _\mathrm{{t}\mathrm{t}\mathrm{H} }\). These scaling factors are applied to the expected signal contributions from the SM Higgs boson according to their production mode, as shown in Table 6. It is assumed that the relative values of the branching fractions are those expected for the SM Higgs boson. This assumption is relaxed, in different ways, in Sects. 6.5 and 6.6.

Table 6 Parameterization used to scale the expected SM Higgs boson yields of the different production and decay modes when obtaining the results presented in Fig. 6

Figure 6 summarizes the results of likelihood scans for the four parameters of interest described in Table 6 in terms of the 68 % CL (inner) and 95 % CL (outer) confidence intervals. When scanning the likelihood of the data as a function of one parameter, the other parameters are profiled.

Fig. 6
figure 6

Likelihood scan results for \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} }\), \(\mu _{\mathrm{VBF}}\), \(\mu _{\mathrm{V}\mathrm{H} }\), and \(\mu _\mathrm{{t}\mathrm{t}\mathrm{H} }\). The inner bars represent the 68 % CL confidence intervals while the outer bars represent the 95 % CL confidence intervals. When scanning each individual parameter, the three other parameters are profiled. The SM values of the relative branching fractions are assumed for the different decay modes

Table 7 shows the best-fit results for the 7\(\,\text {TeV}\) and 8\(\,\text {TeV}\) data sets separately, as well as for the full combined analysis. Based on the combined likelihood ratio values for each parameter, Table 7 also shows the observed significance, the expected significance, and the pull of the results with respect to the SM hypothesis. The observed significance is derived from the observed likelihood ratio for the background-only hypothesis, \(\mu _{i}=0\), in data. The expected significance is derived from the likelihood ratio for \(\mu _{i}=0\) obtained using the median expected result for the SM Higgs boson. The pull with respect to the SM hypothesis is derived from the observed likelihood ratio for \(\mu _{i}=1\); by definition, the expected pull with respect to the SM hypothesis is zero.

Table 7 The best-fit results for independent signal strengths scaling the \(\mathrm{g} \mathrm{g} \mathrm{H} \), \(\mathrm{VBF}\), \(\mathrm{V}\mathrm{H} \), and \(\mathrm{t}\mathrm{t}\mathrm{H} \) production processes; the expected and observed significances with respect to the background-only hypothesis, \(\mu _{i}=0\); and the pull of the observation with respect to the SM hypothesis, \(\mu _{i}=1\). The best-fit results are also provided separately for the 7\(\,\text {TeV}\) and 8\(\,\text {TeV}\) data sets, for which the predicted cross sections differ. These results assume that the relative values of the branching fractions are those predicted for the SM Higgs boson
Table 8 Parameterization used to scale the expected SM Higgs boson yields of the different production and decay modes when obtaining the results presented in Table 9. The \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} }\) and \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }\) parameters are introduced to reduce the dependency of the results on the SM expectation

The \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} }\) best-fit value is found to be \(0.85 ^{+0.19}_{-0.16}\). After calculating the component of the uncertainty that is statistical in nature (stat) and the component related to the theory inputs (theo), one can subtract them in quadrature from the total uncertainty and assign the remainder as the systematic uncertainty (syst), yielding \(0.85\,^{+0.11}_{-0.09}\,\text {(stat)} \,^{+0.11}_{-0.08}\,\text {(theo)} \,^{+0.10}_{-0.09}\,\text {(syst)} \). Advances in the calculation of the \(\mathrm{g} \mathrm{g} \mathrm{H} \) cross section, e.g. when considering higher-order effects, may not only reduce the uncertainty above, but also shift the central value. The signal strengths for the \(\mathrm{VBF}\) and \(\mathrm{V}\mathrm{H} \) production modes are assessed independently. Individual likelihood scans are performed as a function of \(\mu _{\mathrm{VBF}}\) (or \(\mu _{\mathrm{V}\mathrm{H} }\)), allowing the modifiers associated with the other production processes to float in the fit together with the nuisance parameters. In data, the best-fit result for \(\mu _{\mathrm{VBF}}\) is \(1.16 ^{+0.37}_{-0.34}\), while for \(\mu _{\mathrm{V}\mathrm{H} }\) it is \(0.92 ^{+0.38}_{-0.36}\). For the \(\mathrm{t}\mathrm{t}\mathrm{H} \) production mode, the best-fit value for \(\mu _\mathrm{{t}\mathrm{t}\mathrm{H} }\) is found to be \(2.90 ^{+1.08}_{-0.94}\). The results for \(\mathrm{VBF}\), \(\mathrm{V}\mathrm{H} \), and \(\mathrm{t}\mathrm{t}\mathrm{H} \) are driven by the corresponding tagged categories, while the contribution from \(\mathrm{g} \mathrm{g} \mathrm{H} \) is constrained by the 0-jet and untagged categories.

The results in Table 7 show a clear observation of Higgs bosons produced through gluon fusion, and evidence for the production of Higgs bosons through vector boson fusion, for which both the expected and observed significances are above the \(3\sigma \) level. For \(\mathrm{V}\mathrm{H} \) production, the expected significance is \(2.9 \sigma \) and the observed significance is \(2.7 \sigma \). The large best-fit value for \(\mu _\mathrm{{t}\mathrm{t}\mathrm{H} }\) is compatible with the results presented and discussed in Sect. 6.2; the data are compatible with the \(\mu _\mathrm{{t}\mathrm{t}\mathrm{H} } =1\) hypothesis at the \(2.2\sigma \) level. Because of the different parameterizations used, this significance is not exactly the same as that found in Sect. 6.2 when considering the combination of \(\mathrm{t}\mathrm{t}\mathrm{H} \)-tagged categories.

Table 9 The best-fit results and 68 % CL confidence intervals for signal strength ratios of the decay mode in each column and the decay mode in each row, as modelled by the parameterization in Table 8. When the likelihood of the data is scanned as a function of each individual parameter, the three other parameters in the same row, as well the production cross sections modifiers \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} }\) and \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }\), are profiled. Since each row corresponds to an independent fit to data, the relation \(\lambda _{yy,xx}=1/\lambda _{xx,yy}\) is only approximately satisfied
Table 10 A completely general signal parameterization used to scale the expected yields of the \(5\times 4\) different production and decay modes. The particular choice of parameters is such that the single-particle parameterization shown in Table 11 is a nested model, i.e. it can be obtained by assuming \(\lambda _{i}^{j}=\lambda _{i}\), where \(i\) runs through the production processes except \(\mathrm{g} \mathrm{g} \mathrm{H} \) and \(j\) runs through the decay modes. The expectation for the SM Higgs boson is \(\lambda _{i}^{j}=\mu _{j}=1\). This parameterization is used in the denominator of the test statistic defined in Eq. (6)

6.5 Ratios between decay modes

Some of the largest uncertainties in SM Higgs predictions are related to the production cross sections. In an attempt to evade those uncertainties, it has been proposed [192, 193] to perform measurements of ratios of the signal strengths in different decay modes, \(\lambda _{yy,xx} = \beta _{yy}/\beta _{xx}\), where \(\beta _{xx}=\mathcal {B}(\mathrm{H} \rightarrow xx)/\mathcal {B}(\mathrm{H} \rightarrow xx)_\text {SM}\) and \(\mathcal {B}\) denotes a branching fraction. In such \(\beta _{xx}\) ratios, uncertainties related to the production and decay predictions for the Higgs boson, as well as some experimental uncertainties, may cancel out. On the other hand, the uncertainty in a given ratio will reflect the combined statistical uncertainties of both the \(yy\) and \(xx\) decay modes.

To probe the different \(\lambda _{yy,xx}\), the expected signal yields for the different production and decay modes are scaled by the factors shown in Table 8. To reduce the dependency of the results on the expected structure of the SM Higgs production cross section, the \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} }\) and \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }\) parameters are introduced and allowed to float independently. Therefore, these measurements only assume the SM ratio of \(\mathrm{g} \mathrm{g} \mathrm{H} \) and \(\mathrm{t}\mathrm{t}\mathrm{H} \) cross sections and the ratio of \(\mathrm{VBF}\) and \(\mathrm{V}\mathrm{H} \) cross sections.

Given the five decay modes that are currently accessible, four ratios can be probed at a time. For example, the choice of the \(\mathrm{H} \rightarrow \gamma \gamma \) decay as denominator, \(xx=\gamma \gamma \), fixes the four ratio parameters to be \(\lambda _\mathrm{{Z}\mathrm{Z},\gamma \gamma }\), \(\lambda _\mathrm{{b}\mathrm{b},\gamma \gamma }\), \(\lambda _{\mathrm{W}\mathrm{W},\gamma \gamma }\), and \(\lambda _{\tau \tau ,\gamma \gamma }\). When scanning the likelihood for the data as a function of a given \(\lambda _{yy,xx}\) ratio, the production cross section modifiers \(\mu _\mathrm{{g} \mathrm{g} \mathrm{H} ,\mathrm{t}\mathrm{t}\mathrm{H} }\) and \(\mu _{\mathrm{VBF},\mathrm{V}\mathrm{H} }\), as well as the other three ratios, are profiled. The best-fit results for each choice of denominator are presented as the different rows in Table 9. While correlated uncertainties from theory and correlated experimental uncertainties may cancel out to some extent in these ratios, each ratio includes the statistical uncertainties from the two decay modes involved. For the available data set and analyses, the resulting statistical uncertainty dominates the total uncertainty. It can be seen that the SM expectation, \(\lambda _{yy,xx}=1\), is inside the 68 % CL confidence interval for all measurements.

Table 11 A general single-state parameterization used to scale the expected yields of the different production and decay modes. For this parameterization the matrix has \({\text {rank}}(\mathcal {M}) =1\) by definition. It can be seen that this parameterization is nested in the general one presented in Table 10, and can be obtained by setting \(\lambda _{i}^{j}=\lambda _{i}\), where \(i\) runs through the production processes except \(\mathrm{g} \mathrm{g} \mathrm{H} \) and \(j\) runs through the decay modes. The expectation for the SM Higgs boson is \(\lambda _{i}=\mu _{j}=1\). This parameterization is used in the numerator of the test statistic defined in Eq. (6)

6.6 Search for mass-degenerate states with different coupling structures

One assumption that is made in Sect. 7 when studying the couplings of the Higgs boson is that the observations are due to the manifestation of a single particle. Alternatively, a superposition of states with indistinguishable mass values is expected in models or theories beyond the SM [194197]. In this section we explore the validity of this assumption.

Taking advantage of the very good mass resolution in the \(\mathrm{H} \rightarrow \gamma \gamma \) analysis, the presence of near mass-degenerate states has been previously probed down to mass differences between 2.5\(\,\text {GeV}\) and 4\(\,\text {GeV}\) without evidence for the presence of a second state [18]. Given the finite mass resolution, such searches are not sensitive to a mixture of states with mass values closer than the resolution itself, such that other reported measurements would integrate the contributions from both states.

In the case of two or more states with masses closer to each other than the experimental resolution, it becomes impossible to discern them using the mass observables. However, the distinction between states can still be made, provided that the states have different coupling structures, i.e. different coupling strengths to the SM particles. Using the measurements of the different production and decay tags, as well as the detailed knowledge of their expected composition in terms of production processes and decay modes, it is possible to test the compatibility of the observations with the expectations from a single state. Several authors discussed this possibility, proposing methods to look for deviations assuming that, in the presence of more than one state, the individual states would couple differently to the SM particles [198, 199].

A general parameterization of the \(5\times 4\) matrix, \(\mathcal {M}\), of signal strengths for the different production processes and decay modes is shown in Table 10. This parameterization has as many degrees of freedom as there are elements in the matrix and is completely general. Depending on whether there is one particle or more particles responsible for the observations in data, the algebraic properties of \(\mathcal {M}\), namely its rank, \({\text {rank}}(\mathcal {M})\), will vary.

If there is only one state it follows that \({\text {rank}}(\mathcal {M}) = 1\), i.e. there should be one common multiplier per row and one common multiplier per column. A general matrix with \({\text {rank}}(\mathcal {M}) =1\) can be parameterized as shown in Table 11. This parameterization can also be obtained by taking the most general \(5\times 4\) parameterization in Table 10 and assuming \(\lambda _{i}^{j}=\lambda _{i}\), where \(i\) runs through the production processes except \(\mathrm{g} \mathrm{g} \mathrm{H} \) and \(j\) runs through the decay modes. Given this relationship, the model for a general matrix with \({\text {rank}}(\mathcal {M}) =1\) presented in Table 11 is nested, in the statistics sense, in the general parameterization of the \(5\times 4\) matrix presented in Table 10.

The expectation for the SM Higgs boson is a particular case of a rank 1 matrix, namely that for which \(\lambda _{i}=\mu _{j}=1\), where \(i\) runs through the production processes except \(\mathrm{g} \mathrm{g} \mathrm{H} \) and \(j\) runs through the decay modes.

If there is more than one particle contributing to the observations, the structure of \(\mathcal {M}\) may be such that \({\text {rank}}(\mathcal {M}) > 1\) as a consequence of the different interaction strengths of the individual, yet mass-degenerate, states.

The procedure to test for the presence of mass-degenerate states proposed in Ref. [200] takes into account both the fact that there may be missing matrix elements and the fact that there are uncertainties in the measurements, including their correlations. A profile likelihood ratio test statistic, \(q_\lambda \), is built using two different models for the structure of \(\mathcal {M}\), namely those presented in Tables 10 and 11,

$$\begin{aligned} q_{\lambda }=-2 \ln \frac{ \mathcal {L}(\text {data}\,|\,\lambda ^{j}_{i}=\hat{\lambda }_{i},\hat{\mu _{j}}) }{\mathcal {L}(\text {data}\,|\,\hat{\lambda }^{j}_{i},\hat{\mu _{j}}^\prime )}. \end{aligned}$$
(6)

The test statistic \(q_{\lambda }\) is a function of the 20 variables defined in Table 10: \(\lambda _i^j\) and \(\mu _j\), where the index \(i\) runs through the \(\mathrm{VBF}\), \(\mathrm{V}\mathrm{H} \), and \(\mathrm{t}\mathrm{t}\mathrm{H} \) production processes and the index \(j\) runs through the decay modes. In this likelihood ratio, the model in Table 10 is taken as the alternative hypothesis and corresponds to the so-called “saturated model” in statistics, as it contains as many degrees of freedom as there are elements in \(\mathcal {M}\). The null hypothesis model is the one presented in Table 11, which parameterizes \(\mathcal {M}\) as a general rank 1 matrix, where all rows are multiples of each other, as expected for a single particle. If the observations are due to a single particle, the \(\lambda _i\) do not depend on the decay mode and the value of the \(q_{\lambda }\) is not very large, since both hypotheses fit the data equally well. However, for a matrix with \({\text {rank}}(\mathcal {M}) \ne 1\), the most general \(5\times 4\) matrix model will fit the data better than the general rank 1 matrix model and the value of \(q_{\lambda }\) is expected to be large.

Fig. 7
figure 7

Distribution of the profile likelihood ratio \(q_\lambda \) between different assumptions for the structure of the matrix of signal strengths for the production processes and decay modes both for pseudo-data samples generated under the SM hypothesis and the value observed in data. The likelihood in the numerator is that for the data under a model of a general rank 1 matrix, expected if the observations are due to a single particle and of which the SM is a particular case. The likelihood in the denominator is that for the data under a “saturated model” with as many parameters as there are matrix elements. The arrow represents the observed value in data, \(q_\lambda ^\text {obs}\). Under the SM hypothesis, the probability to find a value of \(q_\lambda \ge q_\lambda ^\text {obs}\) is \((7.9\pm 0.3)\,\%\), where the uncertainty reflects only the finite number of pseudo-data samples generated

The compatibility of the value of the test statistic observed in data, \(q_\lambda ^\text {obs}\), with the expectation from the SM is evaluated using pseudo-data samples randomly generated under the SM hypothesis. Figure 7 shows the distribution of \(q_\lambda \) for the SM pseudo-data samples as well as the value observed in data, \(q_\lambda ^\text {obs}=12.2 \). Under the SM hypothesis, we find that the probability of observing a value of \(q_\lambda \ge q_\lambda ^\text {obs}\) is \((7.9\pm 0.3)\,\%\), where the uncertainty reflects only the finite number of pseudo-data samples generated. Such a \(p\text {-value}\) corresponds to a deviation from the SM expectation of about \(1.4\sigma \). This small tension, not present in previous tests performed in this section, is due to the observed data in the dijet-tagged channel of the \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \) analysis; performing a fit to a model where the \(\mathrm{VBF}\) and \(\mathrm{V}\mathrm{H} \) production modes are floated separately shows that the data prefer a very large \(\mathrm{V}\mathrm{H} \) contribution and a very small \(\mathrm{VBF}\) contribution. When \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \) analysis inputs are not considered, the \(p\text {-value}\) is found to be about 33 %.

7 Compatibility of the observed data with the SM Higgs boson couplings

Whereas in Sect. 6 the production and decay of the boson were explored separately, the studies presented in this section simultaneously investigate the couplings of the boson to SM particles in the production and decay processes. In this way, correlations are handled consistently between the production modes and the decay modes. For example, the coupling of the SM Higgs boson to the \(\mathrm{Z}\) boson is involved both in the \(\mathrm{Z}\mathrm{H} \) production mode and the \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \) decay mode, such that more information can be extracted from a simultaneous modelling of the production and decay modes in terms of the couplings involved.

Following the framework laid out in Ref. [171], we assume the signal arises from a single particle with \(J^{PC} =0^{++}\) and a width such that the narrow-width approximation holds, permitting its production and decay to be considered independently. These assumptions are supported by the results of Sect. 6.6 on the presence of more particles at the same mass, those of Refs. [40, 41] regarding alternative \(J^{P}\) assignments and mixtures, and those of Ref. [27] concerning the width of the particle.

Under the assumptions above, the event yield in a given (production)\(\times \)(decay) mode is related to the production cross section and the partial and total Higgs boson decay widths via

$$\begin{aligned} \left( \sigma \;\mathcal {B}\right) ( x \rightarrow \mathrm{H} \rightarrow yy) = \frac{\sigma _{ x}\;\varGamma _{ yy}}{\varGamma _{\text {tot}}}, \end{aligned}$$
(7)

where \(\sigma _{ x }\) is the production cross section through process \( x \), which includes \(\mathrm{g} \mathrm{g} \mathrm{H} \), \(\mathrm{VBF}\), \(\mathrm{W}\mathrm{H} \), \(\mathrm{Z}\mathrm{H} \), and \(\mathrm{t}\mathrm{t}\mathrm{H} \); \(\varGamma _{ yy }\) is the partial decay width into the final state \( yy \), such as \(\mathrm{W}\mathrm{W}\), \(\mathrm{Z}\mathrm{Z}\), \(\mathrm{b} \mathrm{b} \), \(\tau \tau \), \(\mathrm{g} \mathrm{g} \), or \(\gamma \gamma \); and \(\varGamma _{\text {tot}}\) is the total width of the boson.

Some quantities, such as \(\sigma _\mathrm{{g} \mathrm{g} \mathrm{H} }\), \(\varGamma _\mathrm{{g} \mathrm{g} }\), and \(\varGamma _{\gamma \gamma }\), are generated by loop diagrams and, therefore, are sensitive to the presence of certain particles beyond the standard model (BSM). The possibility of Higgs boson decays to BSM particles, with a partial width \(\varGamma _{\mathrm{BSM}}\), can also be accommodated by considering \(\varGamma _{\text {tot}}\) as a dependent parameter so that \(\varGamma _{\text {tot}} = \sum \varGamma _{ yy } + \varGamma _{\mathrm{BSM}}\), where \(\sum \varGamma _{ yy }\) stands for the sum over partial widths for all decays to SM particles. With the data from the \(\mathrm{H} ({\mathrm{inv}})\) searches, \(\varGamma _{\mathrm{BSM}}\) can be further broken down as \(\varGamma _{\mathrm{BSM}} = \varGamma _{\mathrm{inv}} + \varGamma _{\mathrm{undet}}\), where \(\varGamma _{\mathrm{inv}}\) can be constrained by searches for invisible decays of the Higgs boson and \(\varGamma _{\mathrm{undet}}\) corresponds to Higgs boson decays not fitting into the previous definitions. The definition of \(\varGamma _{\mathrm{undet}}\) is such that two classes of decays can give rise to \(\varGamma _{\mathrm{undet}}>0\): (i) BSM decays not studied in the analyses used in this paper, such as hypothetical lepton flavour violating decays, e.g. \(\mathrm{H} \rightarrow \mathrm{\mu }\mathrm{\tau }\), and (ii) decays that might not be detectable with the existing experimental setup because of the trigger conditions of the experiment, such as hypothetical decays resulting in a large multiplicity of low-\(p_{\mathrm{T}}\) particles.

To test the observed data for possible deviations from the rates expected for the SM Higgs boson in the different channels, we introduce coupling modifiers, denoted by the scale factors \(\kappa _{i}\) [171]. The scale factors are defined for production processes by \(\kappa _{i}^2=\sigma _{i}/\sigma _{i}^\text {SM}\), for decay processes by \(\kappa _{i}^2=\varGamma _{ii}/\varGamma _{ii}^\text {SM}\), and for the total width by \(\kappa _\mathrm{{H}}^2=\varGamma _\text {tot}/\varGamma _{\mathrm{SM}} \), where the SM values are tabulated in Ref. [171]. When considering the different \(\kappa _{i}\), the index \(i\) can represent many ways to test for deviations:

  • For SM particles with tree-level couplings to the Higgs boson: \(\kappa _{{\mathrm{W}}}\) (\(\mathrm{W}\) bosons), \(\kappa _{\mathrm{{Z}}}\) (\(\mathrm{Z}\) bosons), \(\kappa _\mathrm{{b}}\) (bottom quarks), \(\kappa _{\tau }\) (tau leptons), \(\kappa _\mathrm{{t}}\) (top quarks), and \(\kappa _{\mathrm{\mu }}\) (muons). Unless otherwise noted, the scaling factors for other fermions are tied to those that can be constrained by data.

  • Particular symmetries of the SM make it interesting to test for deviations in whole classes of particles, leading to \(\kappa _{\mathrm{V}}\) (massive vector bosons), \(\kappa _{\mathrm{f}}\) (fermions), \(\kappa _{\ell }\) (leptons), \(\kappa _\mathrm{{q}}\) (quarks), \(\kappa _\mathrm{{u}}\) (up-type fermions), and \(\kappa _\mathrm{{d}}\) (down-type fermions).

  • For SM particles with loop-induced couplings, the scaling factors can be expressed in terms of the tree-level coupling modifiers, assuming the SM loop structure, but can also be taken as effective coupling modifiers: \(\kappa _{\mathrm{{g}}}\) (gluons) and \(\kappa _{\gamma }\) (photons).

  • The scaling factors for couplings to second generation fermions are equal to those for the third generation: \(\kappa _\mathrm{{s}} =\kappa _\mathrm{{b}} \), \(\kappa _{\mathrm{\mu }} =\kappa _{\tau } \), and \(\kappa _\mathrm{{c}} =\kappa _\mathrm{{t}} \), except in Sect. 7.4, where \(\kappa _{\mathrm{\mu }}\) is constrained from the analysis of \(\mathrm{H} \rightarrow \mu \mu \) decays.

Given their small expected contributions, the couplings to electrons, up quarks, and down quarks, are neglected.

In addition to the \(\kappa _{i}\) parameters, the existence of BSM decays, invisible decays, and undetectable decays of the Higgs boson is considered; the corresponding branching fractions are denoted \(\mathrm{BR}_\mathrm{BSM}\), \(\mathrm{BR}_\mathrm{inv}\), and \(\mathrm{BR}_\text {undet}\), as in Ref. [171].

Significant deviations of any \(\kappa \) parameter from unity or of any \({\mathrm{BR}}\) parameter from zero would imply new physics beyond the SM Higgs boson hypothesis. The size of the current data set is insufficient to precisely quantify all phenomenological parameters defining the Higgs boson production and decay rates. Therefore, we present a set of combined analyses of different numbers of parameters, where the remaining parameters are either set to the SM expectations or profiled in the likelihood scans together with all other nuisance parameters. The value of \(m_\mathrm{{H}}\) is fixed to the measured value of \(125.0\,\text {GeV} \), as determined in Sect. 4.1. Since results for the individual channels are based on different assumed values of the mass, differences should be expected when comparing the previously published results from the individual channels with those in this combined analysis.

This section is organized as follows. In Sect. 7.1 we explore whether \(\kappa _{{\mathrm{W}}}\) and \(\kappa _{\mathrm{{Z}}}\) are compatible with each other and can be meaningfully used together as \(\kappa _{\mathrm{V}}\). In Sect. 7.2 we test for deviations that would affect the couplings of massive vector bosons and fermions differently. The scaling factors among different types of fermions, leptons versus quarks and up-type versus down-type, are investigated in Sect. 7.3. In Sect. 7.4, we consider the results of a fit for the tree-level coupling scaling factors and the relation between the observations and the corresponding particle masses. We then turn to the study of models where BSM physics could manifest itself in loops (\(\kappa _{\mathrm{{g}}}\), \(\kappa _{\gamma }\)) or decays (\(\mathrm{BR}_\mathrm{BSM}\), \(\mathrm{BR}_\mathrm{inv}\), \(\mathrm{BR}_\text {undet}\)). In Sect. 7.5 the tree-level couplings are constrained to those expected in the SM, and the searches for \(\mathrm{H} ({\mathrm{inv}})\) are included. This restriction is lifted in Sect. 7.6, where a coupling scaling factor for the massive vector bosons and individual fermion coupling scaling factors are allowed to float, while in Sect. 7.7 the total width scaling factor is also left free to float. In Sect. 7.8, the results from the searches for invisible decays are included, and from the combination of the visible and invisible decays, limits on \(\mathrm{BR}_\text {undet}\) are set. Table 12 summarizes the results of the tests performed.

Table 12 Tests of the compatibility of the data with the SM Higgs boson couplings. The best-fit values and 68 % and 95 % CL confidence intervals are given for the evaluated scaling factors \(\kappa _{i}\) or ratios \(\lambda _{ij}=\kappa _{i}/\kappa _{j}\). The different compatibility tests discussed in the text are separated by horizontal lines. When one of the parameters in a group is evaluated, others are treated as nuisance parameters

7.1 Relation between the coupling to the \(\mathrm{W}\) and \(\mathrm{Z}\) bosons

In the SM, the Higgs sector possesses an approximate \({\mathrm{SU(2)_L \times SU(2)_R}}\) global symmetry, which is broken by the Higgs vacuum expectation value to the diagonal subgroup \({\mathrm{SU(2)_{L+R}}}\). As a result, the tree-level ratios of the \(\mathrm{W}\) and \(\mathrm{Z}\) boson masses, \(m_{\mathrm{W}} / m_\mathrm{{Z}}\), and the ratio of their couplings to the Higgs boson, \(g_{\mathrm{W}} / g_\mathrm{{Z}}\), are protected against large radiative corrections, a property known as “custodial symmetry” [201, 202]. However, large violations of custodial symmetry are possible in new physics models. We focus on the two scaling factors \(\kappa _{{\mathrm{W}}} \) and \(\kappa _{\mathrm{{Z}}} \) that modify the couplings of the SM Higgs boson to the \(\mathrm{W}\) and \(\mathrm{Z}\) bosons and perform two different combined analyses to assess the consistency of the ratio \(\lambda _{{\mathrm{W}\mathrm{Z}}} = \kappa _{{\mathrm{W}}}/ \kappa _{\mathrm{{Z}}} \) with unity.

The dominant production mechanism populating the 0-jet and 1-jet channels of the \(\mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \rightarrow \ell \nu \ell \nu \) analysis and the untagged channels of the \(\mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \rightarrow 4\ell \) analysis is \(\mathrm{g} \mathrm{g} \mathrm{H} \). Therefore, the ratio of event yields in these channels provides a nearly model-independent measurement of \(\lambda _{{\mathrm{W}\mathrm{Z}}} \). We perform a combined analysis of these two channels with two free parameters, \(\kappa _{\mathrm{{Z}}} \) and \(\lambda _{{\mathrm{W}\mathrm{Z}}} \). The likelihood scan versus \(\lambda _{{\mathrm{W}\mathrm{Z}}} \) is shown in Fig. 8 (left). The scale factor \(\kappa _{\mathrm{{Z}}} \) is treated as a nuisance parameter. The result is \(\lambda _{{\mathrm{W}\mathrm{Z}}} =0.94 ^{+0.22}_{-0.18} \), i.e. the data are consistent with the SM expectation (\(\lambda _{{\mathrm{W}\mathrm{Z}}} =1\)).

Fig. 8
figure 8

Likelihood scans versus \(\lambda _{{\mathrm{W}\mathrm{Z}}} \), the ratio of the coupling scaling factors to \(\mathrm{W}\) and \(\mathrm{Z}\) bosons: (left) from untagged \(\text {pp} \rightarrow \mathrm{H} \rightarrow \mathrm{W}\mathrm{W} \) and \(\text {pp} \rightarrow \mathrm{H} \rightarrow \mathrm{Z}\mathrm{Z} \) searches, assuming the SM couplings to fermions, \(\kappa _{\mathrm{f}} =1\); (right) from the combination of all channels, profiling the coupling to fermions. The solid curve represents the observation in data. The dashed curve indicates the expected median result in the presence of the SM Higgs boson. Crossings with the horizontal thick and thin lines denote the 68 % CL and 95 % CL confidence intervals, respectively

Fig. 9
figure 9

Results of 2D likelihood scans for the \(\kappa _{\mathrm{V}} \) and \(\kappa _{\mathrm{f}} \) parameters. The cross indicates the best-fit values. The solid, dashed, and dotted contours show the 68 %, 95 %, and 99.7 % CL confidence regions, respectively. The diamond shows the SM point \((\kappa _{\mathrm{V}}, \kappa _{\mathrm{f}})=(1,1)\). The left plot shows the likelihood scan in two quadrants, \((+,+)\) and \((+,-)\). The right plot shows the likelihood scan constrained to the \((+,+)\) quadrant

We also extract \(\lambda _{{\mathrm{W}\mathrm{Z}}} \) from the combined analysis of all channels. In this approach, we introduce three parameters: \(\lambda _{{\mathrm{W}\mathrm{Z}}} \), \(\kappa _{\mathrm{{Z}}}\), and \(\kappa _{\mathrm{f}}\). The BSM Higgs boson width \(\varGamma _{\mathrm{BSM}}\) is set to zero. The partial width \(\varGamma _\text {gg}\), induced by top and bottom quark loops, scales as \(\kappa _{\mathrm{f}} ^2\). The partial width \(\varGamma _{\gamma \gamma }\) is induced via loop diagrams, with the \(\mathrm{W}\) boson and top quark being the dominant contributors, and is scaled with \(\kappa _{\gamma } ^2(\kappa _\mathrm{{b}},\kappa _{\tau },\kappa _\mathrm{{t}},\kappa _{{\mathrm{W}}})\), a function defined in Eq. (113) of Ref. [171]. In the likelihood scan as a function of \(\lambda _{{\mathrm{W}\mathrm{Z}}}\), both \(\kappa _{\mathrm{{Z}}} \) and \(\kappa _{\mathrm{f}} \) are profiled together with all other nuisance parameters. The introduction of \(\kappa _{\mathrm{f}}\) carries with it the assumption that the coupling to all fermions is common, but possibly different from the SM expectation. The likelihood scan is shown in Fig. 8 (right) with a solid curve. The dashed curve indicates the median expected result for the SM Higgs boson, given the current data set. The measured value from the combined analysis of all channels is \(\lambda _{{\mathrm{W}\mathrm{Z}}} = 0.92 ^{+0.14}_{-0.12} \) and is consistent with the expectation from the SM.

Given these results, and unless otherwise noted, in all subsequent measurements we assume \(\lambda _{{\mathrm{W}\mathrm{Z}}} =1\) and use a common factor \(\kappa _{\mathrm{V}} \) to modify the couplings to \(\mathrm{W}\) and \(\mathrm{Z}\) bosons, while preserving their ratio.

7.2 Test of the couplings to massive vector bosons and fermions

In the SM, the nature of the coupling of the Higgs boson to fermions, through a Yukawa interaction, is different from the nature of the Higgs boson coupling to the massive vector bosons, a result of electroweak symmetry breaking. Some BSM models predict couplings to fermions and massive vector bosons different from those in the SM.

We compare the observations in data with the expectation for the SM Higgs boson by fitting two parameters, \(\kappa _{\mathrm{V}}\) and \(\kappa _{\mathrm{f}}\), where \(\kappa _{\mathrm{V}} =\kappa _{{\mathrm{W}}} =\kappa _{\mathrm{{Z}}} \) is a common scaling factor for massive vector bosons, and \(\kappa _{\mathrm{f}} =\kappa _\mathrm{{b}} =\kappa _\mathrm{{t}} =\kappa _{\tau } \) is a common scaling factor for fermions. We assume that \(\varGamma _{\mathrm{BSM}}=0\). At leading order, all partial widths scale either as \(\kappa _{\mathrm{V}} ^2\) or \(\kappa _{\mathrm{f}} ^2\), except for \(\varGamma _{\gamma \gamma }\). As discussed in Sect. 7.1, the partial width \(\varGamma _{\gamma \gamma }\) is induced via loops with virtual \(\mathrm{W}\) bosons or top quarks and scales as a function of both \(\kappa _{\mathrm{V}} \) and \(\kappa _{\mathrm{f}} \). For that reason, the \(\mathrm{H} \rightarrow \gamma \gamma \) channel is the only channel being combined that is sensitive to the relative sign of \(\kappa _{\mathrm{V}} \) and \(\kappa _{\mathrm{f}} \).

Figure 9 shows the 2D likelihood scan over the \((\kappa _{\mathrm{V}},\kappa _{\mathrm{f}})\) parameter space. While Fig. 9 (left) allows for different signs of \(\kappa _{\mathrm{V}} \) and \(\kappa _{\mathrm{f}} \), Fig. 9 (right) constrains the scan to the \((+,+)\) quadrant that contains the SM expectation \((1,1)\). The \((-,-)\) and \((-,+)\) quadrants are not shown since they are degenerate with respect to the ones studied, with the implication that with the available analyses we can only probe whether \(\kappa _{\mathrm{V}}\) and \(\kappa _{\mathrm{f}}\) have the same sign or different signs. Studies of the production of a Higgs boson associated with a single top quark can, in principle, lift that degeneracy.

Fig. 10
figure 10

The 68 % CL confidence regions for individual channels (coloured swaths) and for the overall combination (thick curve) for the \(\kappa _{\mathrm{V}}\) and \(\kappa _{\mathrm{f}}\) parameters. The cross indicates the global best-fit values. The dashed contour bounds the 95 % CL confidence region for the combination. The diamond represents the SM expectation, \((\kappa _{\mathrm{V}}, \kappa _{\mathrm{f}})=(1,1)\). The left plot shows the likelihood scan in two quadrants \((+,+)\) and \((+,-)\), the right plot shows the positive quadrant only

Fig. 11
figure 11

(Left) Likelihood scan versus ratio of couplings to down/up fermions, \(\lambda _{\mathrm{{d} \mathrm{u}}}\), with the two other free coupling modifiers, \(\kappa _{\mathrm{V}}\) and \(\kappa _\mathrm{{u}}\), profiled together with all other nuisance parameters. (Right) Likelihood scan versus ratio of couplings to leptons and quarks, \(\lambda _{\mathrm{\ell \mathrm{q}}}\), with the two other free coupling modifiers, \(\kappa _{\mathrm{V}}\) and \(\kappa _\mathrm{{q}}\), profiled together with all other nuisance parameters

In Fig. 9 the 68 %, 95 %, and 99.7 % CL confidence regions for \(\kappa _{\mathrm{V}} \) and \(\kappa _{\mathrm{f}} \) are shown with solid, dashed, and dotted curves, respectively. The data are compatible with the expectation for the standard model Higgs boson: the point \((\kappa _{\mathrm{V}},\kappa _{\mathrm{f}})=(1,1)\) is within the 68 % CL confidence region defined by the data. Because of the way these compatibility tests are constructed, any significant deviations from \((1,1)\) would not have a straightforward interpretation within the SM and would imply BSM physics; the scale and sign of the best-fit values in the case of significant deviations would guide us in identifying the most plausible BSM scenarios.

Figure 10 shows the results of this combined analysis in the different decay mode groups. The role and interplay of different channels is important. For example, Fig. 9 (left) shows a region in the \((+,-)\) quadrant, where \(\kappa _{\mathrm{V}}\) and \(\kappa _{\mathrm{f}}\) have opposite signs, which is excluded at the 95 % CL but not at the 99.7 % CL; it can be seen in Fig. 10 (left) how the combined exclusion in the \((+,-)\) quadrant is foremost due to the ability of the \(\mathrm{H} \rightarrow \gamma \gamma \) decay to discern the relative sign between \(\kappa _{\mathrm{V}}\) and \(\kappa _{\mathrm{f}}\). This is due to the destructive interference between the amplitudes of the \(\mathrm{W}\) loops and top quark loops in the \(\mathrm{H} \rightarrow \gamma \gamma \) decay: \(\kappa _{\gamma } ^2 \sim 1.59\;\kappa _{\mathrm{V}} ^2 - 0.66\;\kappa _{\mathrm{V}} \kappa _{\mathrm{f}} +0.07\;\kappa _{\mathrm{f}} ^2\); if \(\kappa _{\mathrm{V}}\) and \(\kappa _{\mathrm{f}}\) have opposite signs, the interference becomes constructive, leading to a larger \(\mathrm{H} \rightarrow \gamma \gamma \) branching fraction. The shapes of the confidence regions for other decay channels are also interesting: the analyses of decays to massive vector bosons constrain \(\kappa _{\mathrm{V}}\) better than \(\kappa _{\mathrm{f}}\), whereas the analyses of decays to fermions constrain \(\kappa _{\mathrm{f}}\) better than \(\kappa _{\mathrm{V}}\). In the model used for this analysis, the total width scales as \(\kappa _\mathrm{{H}} ^2 \sim 0.75\;\kappa _{\mathrm{f}} ^2 + 0.25\;\kappa _{\mathrm{V}} ^2\), reflecting the large expected contributions from the bottom quark and \(\mathrm{W}\) boson.

The 95 % CL confidence intervals for \(\kappa _{\mathrm{V}} \) and \(\kappa _{\mathrm{f}} \), obtained from a scan where the other parameter is floated, are \([0.87,1.14]\) and \([0.63,1.15]\), respectively.

7.3 Test for asymmetries in the couplings to fermions

In models with two Higgs doublets (2HDM) [203], the couplings of the neutral Higgs bosons to fermions can be substantially modified with respect to the couplings predicted for the SM Higgs boson. For example, in the minimal supersymmetric standard model [204], the couplings of neutral Higgs bosons to up-type and down-type fermions are modified, with the modification being the same for all three generations and for quarks and leptons. In more general 2HDMs, leptons can be made to virtually decouple from one Higgs boson that otherwise behaves in a SM-like way with respect to the \(\mathrm{W}\) bosons, \(\mathrm{Z}\) bosons, and quarks. Inspired by the possibility of such modifications to the fermion couplings, we perform two combinations in which we allow for different ratios of the couplings to down-type fermions and up-type fermions (\(\lambda _{\mathrm{{d} \mathrm{u}}} = \kappa _\mathrm{{d}}/ \kappa _\mathrm{{u}} \)) or different ratios of the couplings to leptons and quarks (\(\lambda _{\mathrm{\ell \mathrm{q}}} = \kappa _{\ell }/ \kappa _\mathrm{{q}} \)).

Fig. 12
figure 12

(Left) Results of likelihood scans for a model where the gluon and photon loop-induced interactions with the Higgs boson are resolved in terms of the couplings of other SM particles. The inner bars represent the 68 % CL confidence intervals while the outer bars represent the 95 % CL confidence intervals. When performing the scan for one parameter, the other parameters in the model are profiled. (Right) The 2D likelihood scan for the \(M\) and \(\epsilon \) parameters of the model detailed in the text. The cross indicates the best-fit values. The solid, dashed, and dotted contours show the 68 %, 95 %, and 99.7 % CL confidence regions, respectively. The diamond represents the SM expectation, \((M, \epsilon )=(v,0)\), where \(v\) is the SM Higgs vacuum expectation value, \(v=246.22\,\text {GeV} \)

Figure 11 (left) shows the likelihood scan versus \(\lambda _{\mathrm{{d} \mathrm{u}}} \), with \(\kappa _{\mathrm{V}} \) and \(\kappa _\mathrm{{u}} \) profiled together with all other nuisance parameters. Figure 11 (right) shows the likelihood scan versus \(\lambda _{\mathrm{\ell \mathrm{q}}} \), with \(\kappa _{\mathrm{V}} \) and \(\kappa _\mathrm{{q}} \) profiled. Assuming that both \(\lambda _{\mathrm{{d} \mathrm{u}}}\) and \(\lambda _{\mathrm{\ell \mathrm{q}}}\) are positive, the 95 % CL confidence intervals are found to be \([0.65,1.39]\) and \([0.62,1.50]\), respectively. There is no evidence that different classes of fermions have different scaling factors.

7.4 Test of the scaling of couplings with the masses of SM particles

Under the assumption that there are no interactions of the Higgs boson other than to the massive SM particles, the data allow a fit for deviations in \(\kappa _{{\mathrm{W}}}\), \(\kappa _{\mathrm{{Z}}}\), \(\kappa _\mathrm{{b}}\), \(\kappa _{\tau }\), \(\kappa _\mathrm{{t}}\), and \(\kappa _{\mathrm{\mu }}\). In this fit, the loop-induced processes (\(\sigma _\mathrm{{g} \mathrm{g} \mathrm{H} }\), \(\varGamma _\mathrm{{g} \mathrm{g} }\), and \(\varGamma _{\gamma \gamma }\)) are expressed in terms of the above tree-level \(\kappa \) parameters and are scaled according to their SM loop structure. The result for this fit is displayed in Fig. 12 (left) and shows no significant deviations from the SM expectation. The small uncertainty in the \(\kappa _\mathrm{{t}}\) parameter directly reflects the fact that in this model, the \(\mathrm{g} \mathrm{g} \mathrm{H} \) production mode is being described in terms of \(\kappa _\mathrm{{t}}\) and \(\kappa _\mathrm{{b}}\), \(\kappa _{\mathrm{{g}}} ^2 \sim 1.11\;\kappa _\mathrm{{t}} ^2 +0.01\;\kappa _\mathrm{{t}} \kappa _\mathrm{{b}}-0.12\;\kappa _\mathrm{{b}} ^2\), such that \(\kappa _\mathrm{{b}}\) has a small contribution.

In the SM, the Yukawa coupling between the Higgs boson and the fermions, \(\lambda _{\mathrm{f}}\), is proportional to the mass of the fermion, \(m_{\mathrm{f}}\). This is in contrast with the coupling to weak bosons, \(g_{\mathrm{V}}\), which involves the square of the mass of the weak boson, \(m_{\mathrm{V}}\). With these differences in mind, it is possible to motivate a phenomenological parameterization relating the masses of the fermions and weak bosons to the corresponding \(\kappa \) modifiers using two parameters, \(M\) and \(\epsilon \) [205, 206]. In such a model one has for each fermion \(\kappa _{\mathrm{f}} = v \; m_{\mathrm{f}}^{\epsilon } / M^{1+\epsilon }\) and for each weak boson \(\kappa _{\mathrm{V}} = v \; m_{\mathrm{V}}^{2\epsilon } / M^{1+2\epsilon }\), where \(v\) is the SM Higgs boson vacuum expectation value, \(v=246.22\,\text {GeV} \) [207]. The SM expectation, \(\kappa _{i}=1\), is recovered when \((M,\epsilon )=(v,0)\). The parameter \(\epsilon \) changes the power with which the coupling scales with the particle mass; if the couplings were independent of the masses of the particles, one would expect to find \(\epsilon =-1\). To perform a fit to data, the particle mass values need to be specified. For leptons and weak bosons we have taken the values from Ref. [207]. For consistency with theoretical calculations used in setting the SM expectations, the top quark mass is taken to be \(172.5\,\text {GeV} \). The bottom quark is evaluated at the scale of the Higgs boson mass, \(m_{\mathrm{b}}(m_\mathrm{{H}} =125.0\,\text {GeV} )=2.76\,\text {GeV} \). In the fit, the mass parameters are treated as constants. The likelihood scan for \((M,\epsilon )\) is shown in Fig. 12 (right). It can be seen that the data do not significantly deviate from the SM expectation. The 95 % CL confidence intervals for the \(M\) and \(\epsilon \) parameters are \([217,279]\) \(\,\text {GeV}\) and \([-0.054,0.100]\), respectively.

Fig. 13
figure 13

Graphical representation of the results obtained for the models considered in Fig. 12. The dashed line corresponds to the SM expectation. The points from the fit in Fig. 12 (left) are placed at particle mass values chosen as explained in the text. The ordinates are different for fermions and massive vector bosons to take into account the expected SM scaling of the coupling with mass, depending on the type of particle. The result of the \((M,\epsilon )\) fit from Fig. 12 (right) is shown as the continuous line while the inner and outer bands represent the 68 % and 95 % CL confidence regions

The results of the two fits above are plotted versus the particle masses in Fig. 13. While the choice of the mass values for the abscissas is discussed above, to be able to show both Yukawa and weak boson couplings in the same plot requires a transformation of the results of the \(\kappa \) fit. Since \(g_V \sim \kappa _{\mathrm{V}} 2m_{\mathrm{V}}^2/v\) and \(\lambda _{\mathrm{f}} \sim \kappa _{\mathrm{f}} m_{\mathrm{f}}/v\), we have chosen to plot a “reduced” weak boson coupling, \(\sqrt{g_{\mathrm{V}}/(2v)}=\kappa _{\mathrm{V}} ^{1/2} m_{\mathrm{V}}/v\). This choice allows fermion and weak boson results to be plotted together, as shown in Fig. 13, but implies that the uncertainties for \(\kappa _{{\mathrm{W}}}\) and \(\kappa _{\mathrm{{Z}}}\) will seem to be reduced. This simply reflects the square root in the change of variables and not any gain of information with respect to the \(\kappa \) fit shown Fig. 12 (left). The result of the \((M,\epsilon )\) fit is shown in Fig. 13 as the band around the dashed line that represents the SM expectation. While the existing measurement of the scaling factor for the coupling of the boson with muons is clearly imprecise, the picture that arises from covering more than three orders of magnitude in particle mass is that the boson couples differently to the different particles and that those couplings are related to the mass of each particle. This is further supported by upper limits set in searches for \(\mathrm{H} \rightarrow \mathrm{e}\mathrm{e} \) decays: when assuming the production cross sections predicted in the SM, the branching fraction is limited to be \(\mathcal {B}(\mathrm{H} \rightarrow \mathrm{e}\mathrm{e} )<1.9\times 10^{-3}\) at the 95 % CL [30].

7.5 Test for the presence of BSM particles in loops

The manifestation of BSM physics can considerably modify the Higgs boson phenomenology even if the underlying Higgs boson sector in the model remains unaltered. Processes that are loop-induced at leading order, such as the \(\mathrm{H} \rightarrow \gamma \gamma \) decay and \(\mathrm{g} \mathrm{g} \mathrm{H} \) production, can be particularly sensitive to the presence of new particles. Therefore, we combine and fit the data for the scale factors for these two processes, \(\kappa _{\gamma } \) and \(\kappa _{\mathrm{{g}}} \). The partial widths associated with the tree-level production processes and decay modes are assumed to be those expected in the SM, and the total width scales as \(\kappa _\mathrm{{H}} ^2\sim 0.0857 \;\kappa _{\mathrm{{g}}} ^2 +0.0023\;\kappa _{\gamma } ^2 + 0.912\).

Fig. 14
figure 14

The 2D likelihood scan for the \(\kappa _{\mathrm{{g}}} \) and \(\kappa _{\gamma } \) parameters, assuming that \(\varGamma _{\mathrm{BSM}}=0\). The cross indicates the best-fit values. The solid, dashed, and dotted contours show the 68 %, 95 %, and 99.7 % CL confidence regions, respectively. The diamond represents the SM expectation, \((\kappa _{\gamma }, \kappa _{\mathrm{{g}}})=(1,1)\). The partial widths associated with the tree-level production processes and decay modes are assumed to be unaltered (\(\kappa = 1\))

Fig. 15
figure 15

(Left) The likelihood scan versus \(\mathrm{BR}_\mathrm{BSM} =\varGamma _{\mathrm{BSM}}/\varGamma _{\text {tot}}\). The solid curve represents the observation and the dashed curve indicates the expected median result in the presence of the SM Higgs boson. The partial widths associated with the tree-level production processes and decay modes are assumed to be as expected in the SM. (Middle) Result when also combining with data from the \(\mathrm{H} ({\mathrm{inv}})\) searches, thus assuming that \(\mathrm{BR}_\mathrm{BSM} =\mathrm{BR}_\mathrm{inv} \), i.e. that there are no undetected decays, \(\mathrm{BR}_\text {undet} =0\). (Right) Result when further assuming that \(\kappa _{\mathrm{{g}}} =\kappa _{\gamma } =1\) and combining with the data from the \(\mathrm{H} ({\mathrm{inv}})\) searches

Figure 14 shows the 2D likelihood scan for the \(\kappa _{\mathrm{{g}}} \) and \(\kappa _{\gamma } \) parameters, assuming that \(\varGamma _{\mathrm{BSM}}=0\). The results are compatible with the expectation for the SM Higgs boson, with the point \((\kappa _{\gamma }, \kappa _{\mathrm{{g}}})=(1,1)\) within the 68 % CL confidence region defined by the data. The best-fit point is \((\kappa _{\gamma }, \kappa _{\mathrm{{g}}})=(1.14, 0.89)\). The 95 % CL confidence interval for \(\kappa _{\gamma } \), when profiling \(\kappa _{\mathrm{{g}}} \) and all nuisance parameters, is \([0.89,1.40]\). For \(\kappa _{\mathrm{{g}}} \), the 95 % CL confidence interval is \([0.69,1.11]\), when profiling \(\kappa _{\gamma } \) and all other nuisance parameters.

Another way in which BSM physics may manifest itself is through the decay of the boson into BSM particles. To explore this possibility, we consider a further parameter that allows for a partial decay width into BSM particles, \(\mathrm{BR}_\mathrm{BSM} =\varGamma _{\mathrm{BSM}}/\varGamma _{\text {tot}}\). In this case, the total width scales as \(\kappa _\mathrm{{H}} ^2\sim (0.0857\;\kappa _{\mathrm{{g}}} ^2 + 0.0023\;\kappa _{\gamma } ^2 + 0.912)/(1-\mathrm{BR}_\mathrm{BSM})\).

Figure 15 (left) shows the likelihood scan versus \(\mathrm{BR}_\mathrm{BSM}\), with \(\kappa _{\mathrm{{g}}} \) and \(\kappa _{\gamma } \) constrained to be positive and profiled together with all other nuisance parameters. While under the SM hypothesis the expected 95 % CL confidence interval for \(\mathrm{BR}_\mathrm{BSM}\) is \([0.00,0.42]\), the data are such that the 95 % CL confidence interval for \(\mathrm{BR}_\mathrm{BSM}\) is \([0.00,0.32]\), narrower than the expectation. The best fit in data also takes into account variations in \(\kappa _{\mathrm{{g}}}\) and \(\kappa _{\gamma }\), particularly the preference for \(\kappa _{\mathrm{{g}}}\) smaller than unity in data, which influences the observed limit on \(\mathrm{BR}_\mathrm{BSM}\).

A further step can be taken by also including the data from the searches for \(\mathrm{H} ({\mathrm{inv}})\). The \(\mathrm{H} ({\mathrm{inv}})\) searches reported an observed (expected) upper limit on \(\mathrm{BR}_\mathrm{inv}\) of 0.58 (0.44) at the 95 % CL [28]. When including the \(\mathrm{H} ({\mathrm{inv}})\) search results in the combined analysis, one can only obtain bounds assuming that there are no undetected decay modes, \(\mathrm{BR}_\text {undet} =0\), i.e. that \(\mathrm{BR}_\mathrm{BSM} =\mathrm{BR}_\mathrm{inv} \). The results for the likelihood scan as a function of \(\mathrm{BR}_\mathrm{inv} (\mathrm{BR}_\text {undet} =0)\) when including the data from the \(\mathrm{H} ({\mathrm{inv}})\) searches is shown in Fig. 15 (middle). The expected 95 % CL confidence interval for \(\mathrm{BR}_\mathrm{inv} (\mathrm{BR}_\text {undet} =0)\) under the SM hypothesis is \([0.00,0.29]\), 31 % narrower than in the above case studied without the \(\mathrm{H} ({\mathrm{inv}})\) data, a reflection of the added power of the \(\mathrm{H} ({\mathrm{inv}})\) analysis. On the other hand, the 95 % CL confidence interval for \(\mathrm{BR}_\mathrm{inv} (\mathrm{BR}_\text {undet} =0)\) in data is \([0.00,0.32]\), similar to the result obtained without including the \(\mathrm{H} ({\mathrm{inv}})\) data, because the observed upper limit on \(\mathrm{BR}_\mathrm{inv} \) was found to be larger than expected in those searches. It should be noted that the shape of the observed curve changes substantially and the inclusion of the \(\mathrm{H} ({\mathrm{inv}})\) data leads to a very shallow minimum of the likelihood when \(\mathrm{BR}_\mathrm{inv} (\mathrm{BR}_\text {undet} =0)=0.03 \).

Finally, one may further set \(\kappa _{\mathrm{{g}}} =\kappa _{\gamma } =1\), which effectively implies \(\kappa _i=1\), i.e. assumes that the couplings to all SM particles with mass are as expected from the SM. From the combined analysis including the data from the \(\mathrm{H} ({\mathrm{inv}})\) searches, we can thus obtain bounds on \(\mathrm{BR}_\mathrm{inv} (\mathrm{BR}_\text {undet} =0,\kappa _i=1)\). The likelihood scan results are shown in Fig. 15 (right). The expected 95 % CL confidence interval for \(\mathrm{BR}_\mathrm{inv} (\mathrm{BR}_\text {undet} =0,\kappa _i=1)\) under the SM hypothesis is \([0.00,0.21]\), which is 28 % narrower than in the previous paragraph, a reflection of the total width now being fixed to the SM expectation. The 95 % CL confidence interval for \(\mathrm{BR}_\mathrm{inv} (\mathrm{BR}_\text {undet} =0,\kappa _i=1)\) in data is \([0.00,0.27]\), showing again a shallow minimum of the likelihood when \(\mathrm{BR}_\mathrm{inv} (\mathrm{BR}_\text {undet} =0,\kappa _i=1)=0.06 \).

The results obtained from the different combined analyses presented in Fig. 15 show the added value from combining the \(\mathrm{H} ({\mathrm{inv}})\) searches with the visible decay measurements, with the expected 95 % CL combined upper limit on \(\mathrm{BR}_\mathrm{inv}\) being up to a factor of two smaller than either, depending on the assumptions made.

Fig. 16
figure 16

Likelihood scans for parameters in a model with coupling scaling factors for the SM particles, one coupling at a time while profiling the remaining five together with all other nuisance parameters; from top to bottom: \(\kappa _{\mathrm{V}}\) (\(\mathrm{W}\) and \(\mathrm{Z}\) bosons), \(\kappa _\mathrm{{b}}\) (bottom quarks), \(\kappa _{\tau }\) (tau leptons), \(\kappa _\mathrm{{t}}\) (top quarks), \(\kappa _{\mathrm{{g}}}\) (gluons; effective coupling), and \(\kappa _{\gamma }\) (photons; effective coupling). The inner bars represent the 68 % CL confidence intervals while the outer bars represent the 95 % CL confidence intervals

Fig. 17
figure 17

Likelihood scans for parameters in a model without assumptions on the total width and with six coupling modifier ratios, one parameter at a time while profiling the remaining six together with all other nuisance parameters; from top to bottom: \(\kappa _\mathrm{{g} \mathrm{Z}}\) (\(=\kappa _{\mathrm{{g}}} \kappa _{\mathrm{{Z}}}/\kappa _\mathrm{{H}} \)), \(\lambda _{{\mathrm{W}\mathrm{Z}}}\) (\(=\kappa _{{\mathrm{W}}}/\kappa _{\mathrm{{Z}}} \)), \(\lambda _\mathrm{{Z}\mathrm{g}}\) (\(=\kappa _{\mathrm{{Z}}}/\kappa _{\mathrm{{g}}} \)), \(\lambda _\mathrm{{b}\mathrm{Z}}\) (\(=\kappa _\mathrm{{b}}/\kappa _{\mathrm{{Z}}} \)), \(\lambda _{\gamma \mathrm{Z}}\) (\(=\kappa _{\gamma }/\kappa _{\mathrm{{Z}}} \)), \(\lambda _{\tau \mathrm{Z}}\) (\(=\kappa _{\tau }/\kappa _{\mathrm{{Z}}} \)), and \(\lambda _\mathrm{{t}\mathrm{g}}\) (\(=\kappa _\mathrm{{t}}/\kappa _{\mathrm{{g}}} \)). The inner bars represent the 68 % CL confidence intervals while the outer bars represent the 95 % CL confidence intervals

7.6 Test of a model with scaling factors for SM particles

After having examined the possibility for BSM physics to manifest itself in loop-induced couplings while fixing all the other scaling factors, we now release the latter assumption. For that, we explore a model with six independent coupling modifiers and make the following assumptions:

  • The couplings to \(\mathrm{W}\) and \(\mathrm{Z}\) bosons scale with a common parameter \(\kappa _{\mathrm{V}} =\kappa _{{\mathrm{W}}} =\kappa _{\mathrm{{Z}}} \).

  • The couplings to third generation fermions, i.e. the bottom quark, tau lepton, and top quark, scale independently with \(\kappa _\mathrm{{b}}\), \(\kappa _{\tau }\), and \(\kappa _\mathrm{{t}}\), respectively.

  • The effective couplings to gluons and photons, induced by loop diagrams, scale with free parameters \(\kappa _{\mathrm{{g}}} \) and \(\kappa _{\gamma } \), respectively.

  • The partial width \(\varGamma _{\mathrm{BSM}}\) is zero.

A likelihood scan for each of the six coupling modifiers is performed while profiling the other five, together with all other nuisance parameters; the results are shown in Fig. 16. With this set of parameters, the \(\mathrm{g} \mathrm{g} \mathrm{H} \)-production measurements will constrain \(\kappa _{\mathrm{{g}}}\), leaving the measurements of \(\mathrm{t}\mathrm{t}\mathrm{H} \) production to constrain \(\kappa _\mathrm{{t}}\), which explains the best-fit value, \(\kappa _\mathrm{{t}} =1.60 ^{+0.34}_{-0.32} \). The current data do not show any statistically significant deviation with respect to the SM Higgs boson hypothesis. For every \(\kappa _i\) probed, the measured 95 % CL confidence interval contains the SM expectation, \(\kappa _i=1\). A goodness-of-fit test between the parameters measured in this model and the SM prediction yields a \(\chi ^{2}/\text {dof} = 7.5/6\), which corresponds to an asymptotic \(p\text {-value}\) of 0.28.

7.7 Test of a general model without assumptions on the total width

Given the comprehensiveness of the set of analyses being combined, we can explore the most general model proposed in Ref. [171], which makes no assumptions on the scaling of the total width. In this model, the total width is not rescaled according to the different \(\kappa _{i}\) values as a dependent parameter, but is rather left as a free parameter, embedded in \(\kappa _\mathrm{{g} \mathrm{Z}} =\kappa _{\mathrm{{g}}} \kappa _{\mathrm{{Z}}}/\kappa _\mathrm{{H}} \). All other parameters of interest are expressed as ratios between coupling scaling factors, \(\lambda _{ij}=\kappa _{i}/\kappa _{j}\).

Fig. 18
figure 18

(Left) Likelihood scan versus \(\mathrm{BR}_\mathrm{BSM} =\varGamma _{\mathrm{BSM}}/\varGamma _{\text {tot}}\). The solid curve represents the observation in data and the dashed curve indicates the expected median result in the presence of the SM Higgs boson. The modifiers for both the tree-level and loop-induced couplings are profiled, but the couplings to the electroweak bosons are assumed to be bounded by the SM expectation (\(\kappa _{\mathrm{V}} \le 1\)). (Right) Result when also combining with data from the \(\mathrm{H} ({\mathrm{inv}})\) searches, thus assuming that \(\mathrm{BR}_\mathrm{BSM} =\mathrm{BR}_\mathrm{inv} \), i.e. \(\mathrm{BR}_\text {undet} =0\)

A likelihood scan for each of the parameters \(\kappa _\mathrm{{g} \mathrm{Z}}\), \(\lambda _{{\mathrm{W}\mathrm{Z}}}\), \(\lambda _\mathrm{{Z}\mathrm{g}}\), \(\lambda _\mathrm{{b}\mathrm{Z}}\), \(\lambda _{\gamma \mathrm{Z}}\), \(\lambda _{\tau \mathrm{Z}}\), and \(\lambda _\mathrm{{t}\mathrm{g}}\) is performed while profiling the other six, together with all other nuisance parameters. The results are shown in Fig. 17 and are in line with those found in Sect. 7.6.

Fig. 19
figure 19

(Left) The 2D likelihood scan for the \(\mathrm{BR}_\mathrm{inv}\) and \(\mathrm{BR}_\text {undet}\) parameters for a combined analysis of the \(\mathrm{H} ({\mathrm{inv}})\) search data and visible decay channels. The cross indicates the best-fit values. The solid, dashed, and dotted contours show the 68 %, 95 %, and 99.7 % CL confidence regions, respectively. The diamond represents the SM expectation, \((\mathrm{BR}_\mathrm{inv}, \mathrm{BR}_\text {undet})=(0,0)\). (Right) The likelihood scan versus \(\mathrm{BR}_\text {undet}\). The solid curve represents the observation in data and the dashed curve indicates the expected median result in the presence of the SM Higgs boson. \(\mathrm{BR}_\mathrm{inv}\) is constrained by the data from the \(\mathrm{H} ({\mathrm{inv}})\) searches and modifiers for both the tree-level and loop-induced couplings are profiled, but the couplings to the electroweak bosons are assumed to be bounded by the SM expectation (\(\kappa _{\mathrm{V}} \le 1\))

7.8 Constraints on \(\mathrm{BR}_\mathrm{BSM}\) in a scenario with free couplings

An alternative and similarly general scenario can be built by allowing for \(\varGamma _{\mathrm{BSM}}>0\). In order to avoid the degeneracy through which the total width and the coupling scaling factors can compensate each other, we constrain \(\kappa _{\mathrm{V}} \le 1\), a requirement that holds in a wide class of models, namely in any model with an arbitrary number of Higgs doublets, with and without additional Higgs singlets [171]. The model has the following parameters: \(\kappa _{\mathrm{V}}\), \(\kappa _\mathrm{{b}}\), \(\kappa _{\tau }\), \(\kappa _\mathrm{{t}}\), \(\kappa _{\mathrm{{g}}}\), \(\kappa _{\gamma }\), and \(\mathrm{BR}_\mathrm{BSM}\). This is a much more general treatment than that performed in Sect. 7.5, where only the loop-induced couplings to photons and gluons were allowed to deviate from the SM expectation. As in Sect. 7.5, this model also allows for a combined analysis with the data from the \(\mathrm{H} ({\mathrm{inv}})\) searches.

Figure 18 (left) shows the likelihood scan versus \(\mathrm{BR}_\mathrm{BSM}\) derived in this scenario, while profiling all the other coupling modifiers and nuisance parameters. Within these assumptions, the 95 % CL confidence interval for \(\mathrm{BR}_\mathrm{BSM}\) in data is \([0.00,0.57]\), while the expected interval for the SM hypothesis is \([0.00,0.52]\).

Assuming that there are no undetected decay modes, \(\mathrm{BR}_\text {undet} =0\), it follows that \(\mathrm{BR}_\mathrm{BSM} =\mathrm{BR}_\mathrm{inv} \) and the data from the searches for \(\mathrm{H} ({\mathrm{inv}})\) can be combined with the data from the other channels to set bounds on \(\mathrm{BR}_\mathrm{inv}\). The likelihood scan for such a model and combination is shown in Fig. 18 (right). The 95 % CL confidence interval for \(\mathrm{BR}_\mathrm{inv}\) in data is \([0.00,0.49]\), while the expected interval for the SM hypothesis is \([0.00,0.32]\). The difference between the expected and observed confidence intervals reflects the results of the \(\mathrm{H} ({\mathrm{inv}})\) analysis that reported an observed (expected) upper limit on \(\mathrm{BR}_\mathrm{inv}\) of 0.58 (0.44) at the 95 % CL [28].

Finally, instead of simply assuming \(\mathrm{BR}_\text {undet} =0\), a simultaneous fit for \(\mathrm{BR}_\mathrm{inv}\) and \(\mathrm{BR}_\text {undet}\) is performed. In this case, the data from the \(\mathrm{H} ({\mathrm{inv}})\) searches constrains \(\mathrm{BR}_\mathrm{inv}\), while the visible decays constrain \(\mathrm{BR}_\mathrm{BSM} =\mathrm{BR}_\mathrm{inv} +\mathrm{BR}_\text {undet} \). The 2D likelihood scan for \((\mathrm{BR}_\mathrm{inv},\mathrm{BR}_\text {undet})\) is shown in Fig. 19 (left), while Fig. 19 (right) shows the likelihood scan for \(\mathrm{BR}_\text {undet}\) when profiling all other parameters, \(\mathrm{BR}_\mathrm{inv}\) included. The 95 % CL confidence interval for \(\mathrm{BR}_\text {undet}\) in data is \([0.00,0.52]\), while the expected interval for the SM hypothesis is \([0.00,0.51]\).

7.9 Summary of tests of the compatibility of the data with the SM Higgs boson couplings

Figure 20 summarizes the results for the benchmark scenarios of Ref. [171] with fewest parameters and shows that, in those benchmarks, all results are compatible with the SM expectations.

Fig. 20
figure 20

Summary plot of likelihood scan results for the different parameters of interest in benchmark models from Ref. [171] separated by dotted lines. The \(\mathrm{BR}_\mathrm{BSM}\) value at the bottom is obtained for the model with three parameters \((\kappa _{\mathrm{{g}}},\kappa _{\gamma },\mathrm{BR}_\mathrm{BSM})\). The inner bars represent the 68 % CL confidence intervals while the outer bars represent the 95 % CL confidence intervals

A much more comprehensive overview of the searches performed for deviations from the SM Higgs boson expectation is provided in Table 12, where all results obtained in this section are summarized.

No statistically significant deviations are observed with respect to the SM Higgs boson expectation.

8 Summary

Properties of the Higgs boson with mass near 125\(\,\text {GeV}\) are measured in proton-proton collisions with the CMS experiment at the LHC. Comprehensive sets of production and decay measurements are combined. The decay channels include \(\gamma \gamma \), \(\mathrm{Z}\mathrm{Z}\), \(\mathrm{W}\mathrm{W}\), \(\tau \tau \), \(\mathrm{b} \mathrm{b} \), and \(\mu \mu \) pairs. The data samples were collected in 2011 and 2012 and correspond to integrated luminosities of up to 5.1\(\,\text {fb}^\text {-1}\) at 7\(\,\text {TeV}\) and up to 19.7\(\,\text {fb}^\text {-1}\) at 8\(\,\text {TeV}\). From the high-resolution \(\gamma \gamma \) and \(\mathrm{Z}\mathrm{Z}\) channels, the mass of the Higgs boson is determined to be \(125.02\,^{+0.26}_{-0.27} \,\text {(stat)} \,^{+0.14}_{-0.15} \,\text {(syst)} \,\text {GeV} \). For this mass value, the event yields obtained in the different analyses tagging specific decay channels and production mechanisms are consistent with those expected for the standard model Higgs boson. The combined best-fit signal relative to the standard model expectation is \(1.00\,\pm 0.09\,\text {(stat)} \,^{+0.08}_{-0.07}\,\text {(theo)} \,\pm 0.07\,\text {(syst)} \) at the measured mass. The couplings of the Higgs boson are probed for deviations in magnitude from the standard model predictions in multiple ways, including searches for invisible and undetected decays. No significant deviations are found.