1 Introduction

In the Standard Model (SM), the \({{B} ^0_{s}} \) and \({{\hspace{0.0pt}\overline{\hspace{0.0pt}B}} {}^0_{s}} \) flavour eigenstates can be expressed as a linear combination of the heavy (H) and light (L) mass eigenstates with decay widths \(\Gamma _\textrm{H}\) and \(\Gamma _\textrm{L}\), respectively. A sizeable difference between these decay widths is predicted [1]. The effective lifetime of a \({{B} ^0_{s}} \) meson in a specific decay mode is measured by fitting the decay time distribution with a single exponential function [2, 3]. The experimental data on the weak mixing phase, \(\phi _s\), are consistent with the SM prediction that \(C\!P\) violation in \({{B} ^0_{s}}- {{\hspace{0.0pt}\overline{\hspace{0.0pt}B}} {}^0_{s}} \) mixing is small [1]. Consequently, the mass eigenstates are also \(C\!P\) eigenstates to better than per mille level and the effective lifetime measured in \(B_s^0 \rightarrow J/\psi \eta \) decays is equal, to good approximation, to \(\tau _\textrm{L} = 1/\Gamma _\textrm{L}\). Using the SM prediction for \(\Delta \Gamma _s\) from Ref. [1], and the value of \(\tau _{{{B} ^0_{s}}}/\tau _{{{B} ^0}}\) given in Ref. [4], together with the measured lifetime of the \({B} ^0\) meson [5] gives \(\tau _{L}\) =1.422 ± 0.013 ps. Measurements of \(\tau _\textrm{L}\) have been reported by the LHCb collaboration using the tree-level decay modes \({{B} ^0_{s}} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \eta \) [6], \({{B} ^0_{s}} \rightarrow {{D} ^+_{s}} {{D} ^-_{s}} \) [7] and the loop dominated \({{B} ^0_{s}} \rightarrow K^+ K^-\) [8] where hadronic uncertainties are more important. These measurements use data collected during Run 1 of the LHC at centre-of-mass energies, \(\sqrt{s} = 7 \,\) and \(8 \,\)TeV, corresponding to an integrated luminosity of \(3\,\text {fb} ^{-1} \). The lifetime of the heavy mass eigenstate, \(\tau _\textrm{H}\), has been measured using the \({{B} ^0_{s}} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} f_0(980)\) mode [9,10,11]. Improving the precision of \(\tau _\textrm{L}\) and \(\tau _\textrm{H}\) enables more stringent tests of the consistency between direct measurements of the decay-width difference, \(\Delta \Gamma _s = \Gamma _\textrm{L} - \Gamma _\textrm{H}\), in \({{B} ^0_{s}} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \phi \) decays and those inferred from effective lifetimes.

In this paper, \(\tau _\textrm{L}\) is measured in the \({{B} ^0_{s}} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \eta \) decay mode using a data sample, corresponding to an integrated luminosity of \(5.7 \,\text {fb} ^{-1} \), collected in proton–proton (pp) collisions at \(\sqrt{s} = 13 \, \text {TeV}\) during Run 2 of the LHC (2015–2018). The analysis builds on the Run 1 study described in Ref. [6] which used around \(3 \,\text {fb} ^{-1} \) collected at \(\sqrt{s} = 7,8 \, \text {TeV}\). The \({{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \) meson is reconstructed via the dimuon decay mode and the \(\eta \) meson through its decay to a pair of photons.

2 Detector and simulation

The LHCb detector [12, 13] is a single-arm forward spectrometer covering the pseudorapidity range \(2<\eta <5\), designed for the study of particles containing \(b \) or \(c \) quarks. It includes a high-precision tracking system consisting of a silicon-strip vertex detector (VELO) surrounding the pp interaction region, a large-area silicon-strip detector (TT) located upstream of a dipole magnet with a bending power of approximately \(4{\mathrm {\,Tm}}\), and three stations of silicon-strip detectors and straw drift tubes placed downstream of the magnet. The tracking system provides a measurement of the momentum, \(p\), of charged particles with a relative uncertainty that varies from 0.5 % at low momentum to 1.0 % at 200\(\,\text {Ge} \,\text {V}/\)c. Large samples of \({{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \rightarrow {\mu ^+} {\mu ^-} \) and \(B^+ \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} K^+\) decays, collected concurrently with the data set used here, were used to calibrate the momentum scale of the spectrometer using the procedure discussed in Ref. [14]. The relative accuracy of this procedure is determined using samples of other fully reconstructed b-hadrons and \(\Upsilon \) mesons, and is estimated to be \(3 \times 10^{-4}\). For \({b} \)-hadron decay modes such \(B^+ \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} K^+\) the mass resolution agrees between data and simulation to better than \(10 \%\).

Various charged hadrons are distinguished using information from two ring-imaging Cherenkov detectors. In addition, photons, electrons, and hadrons are identified by a calorimeter system consisting of scintillating-pad and preshower detectors, an electromagnetic and a hadronic calorimeter. The calorimeter response is calibrated using samples of \(\pi ^0 \rightarrow \gamma \gamma \) decays [15]. For this analysis, a further calibration is made using an inclusive sample of \(\eta \rightarrow \gamma \gamma \) decays, which results in a precision of \(0.1\,\%\) on the neutral energy scale. The mass resolution on the \(\eta \) meson found in simulation (data) is \(22.9 {\,\text {Me} \,\text {V}/c^2} \) (\(23.9 {\,\text {Me} \,\text {V}/c^2} \)). Muons are identified by a system composed of alternating layers of iron and multiwire proportional chambers.

The online event selection is performed by a trigger, which consists of a hardware stage followed by a two-level software stage [16]. An alignment and calibration of the detector is performed in near real-time with the results used in the software trigger [17]. The same alignment and calibration information is propagated to the offline reconstruction, ensuring consistent information between the trigger and offline software. In this analysis, candidate events are required to pass the hardware trigger, which selects muon and dimuon candidates with high transverse momentum, \(p_{\textrm{T}} \), using information from the muon system. The first stage of the software trigger performs a partial event reconstruction and requires events to have two well-identified oppositely charged muons with an invariant mass larger than \(2.7 {\,\text {Ge} \,\text {V}/c^2} \). The second stage performs a full event reconstruction. Events are retained for further processing if they contain a displaced \({{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \rightarrow \mu ^+ \mu ^-\) candidate. The decay vertex is required to be well separated from each reconstructed primary vertex (PV) of the proton–proton interaction by requiring the distance between the PV and the \({{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \) decay vertex divided by its uncertainty (referred to as the decay-length significance or DLS) to be greater than three. This introduces a non-uniform efficiency for \({b} \)-hadron candidates that have a decay time less than \(\sim 0.4 \,\text {ps} \).

Simulated pp collisions are generated using Pythia [18, 19] with a specific LHCb configuration [20]. Decays of hadronic particles are described by EvtGen [21], in which final-state radiation is generated using Photos [22]. The interaction of the generated particles with the detector, and its response, are implemented using the Geant4 toolkit [23, 24] as described in Ref. [25]. Other sources of background, such as those from inclusive \(b \rightarrow \chi _c\) transitions, where the \({\chi _{c}} \) decays radiatively to a \({{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \) meson, are studied using the RapidSim fast simulation package [26].

3 Selection

As in the LHCb Run 1 analysis of this mode [6], a two-step procedure is used to optimize the selection of \({{B} ^0_{s}} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \eta \) decay candidates. These studies use simulated signal samples together with the high-mass sideband of the data (\(5650< m({{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \eta ) < 5850 {\,\text {Me} \,\text {V}/c^2} \)), which is not used in the subsequent determination of \(\tau _\textrm{L}\). In the first step, loose selection criteria are applied that reduce background significantly whilst retaining high signal efficiency. Subsequently, a multivariate analysis (MVA) is used to reduce further the combinatorial background. This is optimised using pseudoexperiments to obtain the best precision on the measured value of \(\tau _\textrm{L}\). Compared to the Run 1 analysis, the modifications to both steps improve the signal efficiency and background rejection.

The selection starts with a pair of oppositely charged particles, identified as muons, that form a common decay vertex. To ensure a high efficiency the muon candidates are required to have a pseudorapidity between 2.0 and 4.6. The invariant mass of the dimuon candidate must be within \(\pm 50 {\,\text {Me} \,\text {V}/c^2} \) of the known \({{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \) mass [5].

Photons are selected from well-identified neutral clusters reconstructed in the electromagnetic calorimeter [13] that have a transverse energy in excess of \(300 \,\text {Me} \,\text {V} \). Candidate \(\eta \rightarrow \gamma \gamma \) decays are selected from diphoton combinations with an invariant mass within \(70{\,\text {Me} \,\text {V}/c^2} \) of the known \(\eta \) mass [5] and with a transverse momentum larger than \(2.1 {\,\text {Ge} \,\text {V}/c} \).

The \({{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \) and \(\eta \) candidates are combined to form candidate \({{B} ^0_{s}} \) mesons, which are required to have \(p_{\textrm{T}} \) larger than \(2.5 {\,\text {Ge} \,\text {V}/c} \). The \({{B} ^0_{s}} \) candidate is assigned to the PV with the smallest \(\chi ^2_{\text {IP}} \), where \(\chi ^2_{\text {IP}}\) is defined as the difference in the vertex-fit \(\chi ^2\) to a given PV reconstructed with and without the candidate being considered. Though lifetime biasing, a loose requirement of \(\chi ^2_{\text {IP}} <25\) is applied since it is effective in removing combinatorial background. In addition, to ensure the \({{B} ^0_{s}} \) candidate is matched to the correct PV, if there is another PV for which \(\chi ^2_{\text {IP}} <50\) in the event, the candidate is rejected. A kinematic fit is performed to improve the invariant mass resolution [27], where the intermediate resonance masses are constrained to their known values. The \(\chi ^2\) per degree of freedom of this fit is required to be less than five. The measured decay time of the candidate must be between \(0.4 \,\text {ps} \) and \(10 \,\text {ps} \). These requirements define a time range where the acceptance is reasonably uniform.

The second step of the selection process is based on a Multilayer Perceptron neural network [28], which is trained using the simulated signal sample and the high-mass sideband of the data for background. Fourteen variables, chosen as they give good separation between signal and background, are used as input to the neural network. It has been checked that these variables are well modelled by the simulation using the \({{B} ^0_{s}} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \eta \) signal after the preselection and subtracting the background with the sPlot [29] technique and also with the \({{B} ^+} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} K^{+}\) control channel.

Fig. 1
figure 1

Distributions of (left) invariant mass and (right) decay time by year: (top row) 2015, (second row) 2016, (third row) 2017 and (bottom row) 2018

These are chosen as they give good separation between signal and background, and are known to be well modelled by the simulation. The variables include information on the candidate kinematics, particle identification, vertexing and track quality. An important change compared to the Run 1 analysis is the addition of information related to the isolation of the b-hadron candidate. This is found to be highly effective at suppressing combinatorial background and compensates for the higher multiplicity in the Run 2 environment.

The requirement on the MVA output was chosen to minimize the expected statistical uncertainty on the fitted value of \(\tau _{\text {L}}\) using pseudoexperiments. The chosen value removes over \(99 \, \%\) of background candidates whilst retaining over \(80 \, \%\) of simulated signal decays. After applying these requirements \(5 \, \%\) of events contain multiple candidates from which only one, chosen at random, is kept.

4 Fit model

Figure 1 shows the \({{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \eta \) invariant mass and decay time distributions along with the fit projections for the four years of running (2015, 2016, 2017 and 2018). The value of \(\tau _\textrm{L}\) is determined from a two-dimensional unbinned maximum likelihood fit to the distributions of the \({B} ^0_{(s)}\) candidate invariant mass, m and decay time. To allow for variations in running conditions, the dataset is divided into the four years of running which are fitted simultaneously. The fit is performed for candidates with \( 5050<m({{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \eta ) < 5650 {\,\text {Me} \,\text {V}/c^2} \) and \(0.4< t < 10 \,\text {ps} \). The fit model has five components: the \({{B} ^0_{s}} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \eta \) signal, the \({{B} ^0} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \eta \) decay, partially reconstructed \({{B} ^0_{s}} \rightarrow \chi _{c1,c2} \eta \) decays with the subsequent decay \(\chi _c \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \gamma \), partially reconstructed \({{B} ^0_{s}} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \phi \) with \(\phi \rightarrow \eta \gamma \) and combinatorial background.

To account for the detector resolution, the decay time distribution of each component is convolved with a Gaussian function with a width of \(52 \,\text {fs}\) determined using the simulation. The variation of the detector efficiency as a function of decay time for each year of running is accounted for by an acceptance function, \(A_{\text {tot}}\). This is expressed as the product of four components. Firstly, the reconstruction efficiency of the vertex detector is known to decrease as the distance of closest approach of the decay products to the pp beam-line increases [6, 30]. This effect is parameterised with a second order polynomial. Improvements in the track reconstruction used in Run 2 led to a reduction for this effect by a factor of three compared to the Run 1 analysis [16]. It is further reduced by the choice of the pseudorapidity range for the muons and cross-checked with the \({{B} ^+} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} K^+\) decay mode. Secondly, the requirement on the DLS applied in the trigger leads to inefficiency at small proper times. Its effect is minimised by the requirement of \(t > 0.4 \,\text {ps} \). The residual effect of this requirement, \(A_\textrm{DLS}\), is modelled using the simulation and calibrated in data using a large sample of \({{B} ^+} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} K^+\) decays. Furthermore, the requirement on the \({{B} ^0_{s}} \) candidate \(\chi ^2_{\text {IP}} \) leads to an efficiency, \(A_{\text {IP} \chi ^2}\), that decreases linearly at higher time. This component is parameterised using simulated signal decays. Finally, the inclusion of the \({{B} ^0_{s}} \) candidate \(\chi ^2_{\text {IP}} \) in the MVA leads to a linear acceptance correction (\(A_\textrm{MVA}\)) which is determined using the simulation.

Figure 2 shows the overall acceptance curve, \(A_\textrm{tot} = A_{\text {VELO}}\cdot A_{\text {DLS}} \cdot A_{\text {IP} \chi ^2} \cdot A_{\text {MVA}} \) obtained for the 2016 dataset. If the data were not corrected for the acceptance, then the bias on \(\tau _\textrm{L}\), evaluated by fitting the simulated data with and without the correction, would be \(18 \,\text {fs}\). Different, but qualitatively similar, decay time acceptance curves are evaluated for each year and used in the decay time fit.

The invariant mass distribution for the \({{B} ^0_{s}} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \eta \) signal is parameterised by a Double Sided Crystal Ball (DSCB) function. This is a generalization of the Crystal Ball function with power law tails on both sides of the mass peak [31]. Alternative parametrisations including the Bukin function [32] and a skewed Student’s t-distribution [33, 34] are considered as systematic variations. In the fit to the data, the tail parameters of the DSCB distribution are fixed to the values obtained from simulation, while the mean and width parameter are freely varied in the fit. The decay time distribution for the signal component is modelled with an exponential function convolved with the detector resolution and multiplied by the detector acceptance, as discussed above.

The second component in the fit accounts for the \({{B} ^0} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \eta \) decay. Since the invariant mass resolution is approximately \(36 {\,\text {Me} \,\text {V}/c^2} \), this component partially overlaps with the \({{B} ^0_{s}} \) signal mode. The same DSCB mass shape as for the \({{B} ^0_{s}} \) signal decay is used to model this contribution, with a constraint on the mass resolution between the modes, as obtained from simulation. In the fit, the difference between the positions of the \({{B} ^0_{s}} \) and \({{B} ^0} \) mass peaks is Gaussian-constrained to the known value \(M({{B} ^0_{s}}) - M({{B} ^0}) = 87.22 \pm 0.16 {\,\text {Me} \,\text {V}/c^2} \) [5]. The decay time of the \({{B} ^0} \) component is modelled with an exponential convolved with the same acceptance function and detector resolution as for the \({{B} ^0_{s}} \) component. The lifetime is Gaussian-constrained to the known value, \(\tau ({{B} ^0}) = 1.519 \pm 0.005 \,\text {ps} \) [5]. Finally, the yield of this component is parameterised as the product of the \({{B} ^0_{s}} \) yield for each year multiplied by a common factor between the years, \(f_r\), which is left free. The shape of the combinatorial background in mass and time is studied using the high-mass sideband and mixing samples of \({{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \) and \(\eta \) mesons selected in data. Based upon these studies this component is modelled by a second-order Chebyshev polynomial in mass and the sum of two exponentials in decay time.

The RapidSim [26] package is used to study backgrounds from partially reconstructed b-hadron decays and two sources are identified. The first is the decay \({{B} ^0_{s}} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \phi \) with the subsequent decay \(\phi \rightarrow \eta \gamma \). Due to the missing photon, and the mass difference between the \(\phi \) and \(\eta \) mesons, this background has a maximum invariant mass of \(5200 \, {\,\text {Me} \,\text {V}/c^2} \). The shape of this component in mass is modelled with a bifurcated Gaussian function. Its lifetime parameter is Gaussian-constrained to the measured value of the lifetime in the \({{B} ^0_{s}} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \phi \) channel, \(\tau = 1.480 \pm 0.007 \,\text {ps} \) [35] multiplied by a factor of 0.99, determined from the simulation, that accounts for the missing photon energy. The relative yield of this component to the signal is Gaussian-constrained to be \(f_\phi = (0.6 \pm 0.1) \%\) using the known branching fractions [5] and the relative efficiency from the simulation. The second source of partially reconstructed background is due to \({{B} ^0_{s}} \rightarrow \chi _{c1,c2} \eta \) decays. The branching fractions for these decay modes are not measured but the rate for the decay to the \({\chi _{{c} 1}} \) meson is most likely higher than that of the \({\chi _{{c} 2}} \) decay mode from spin suppression arguments. Consequently, the yield of the \({\chi _{{c} 1}} \) component, \(N_{{\chi _{{c} 1}}}\), is left to freely vary in the fit. The yield of the \({\chi _{{c} 2}} \) component is Gaussian-constrained to be \(f_{{\chi _{{c} 2}}} N_{{\chi _{{c} 1}}}\) where \(f_{{\chi _{{c} 2}}} = 0.1 \pm 0.1\) based on the values seen for other decay modes [5]. The shape of these components in mass is modelled by an error function, based on simulation. The unknown lifetime parameter of this component is left free in the fit. The validity of these assumptions is tested as part of the study of systematic uncertainties.

Fig. 2
figure 2

Total acceptance function, \(A_{\text {tot}}\) for the 2016 data taking period. The insert shows the four individual components of the acceptance that are multiplied to give \(A_{\text {tot}}\). The acceptance functions for the other years are similar

In total the simultaneous fit to the four years of running has 40 free parameters. The correctness of the fit procedure is validated using the full simulation and pseudoexperiments. No significant bias compared to the uncertainty of the measurement is found, and the uncertainties estimated by the fit are found to be accurate.

5 Results and systematic uncertainties

The fit gives a measured lifetime of \(\tau _\textrm{L} = 1.445 \pm 0.016\,\text {ps} \). The values of other fitted parameters, such as the mass resolution are in good agreement with the expectation from the simulation. As discussed in Sect. 4, uncertainties on physics parameters are propagated to the statistical uncertainty via Gaussian constraints. The dominant uncertainty arises from the size of the simulation sample used to model the acceptance function. The uncertainty due to each of the individual acceptance corrections is estimated by resampling the covariance matrix of the determined PDF and by changing the functional form used for each component. The check of acceptance shape due to the VELO tracking efficiency using the \({{B} ^+} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} K^+\) data is limited by the knowledge of the \({{B} ^+} \) lifetime giving a further \(4 \,\text {fs}\) uncertainty.

Possible biases from misalignment of the VELO detector halves are evaluated by dividing the data into three categories according to which side of the VELO the tracks traverse and re-running the fit. This gives a systematic uncertainty of \(3.8 \,\text {fs}\). In the fit to the decay time the correlation between the time resolution and its uncertainty is ignored. The possible bias from this assumption is evaluated to be less than \(0.3 \,\text {fs}\). Varying the time resolution, in the range \(40 - 60 \,\text {fs}\), does not change the result and no uncertainty is assigned.

Uncertainties arising from the modelling of the signal and background mass distributions are evaluated using the discrete profiling method described in Ref. [36]. In the default fit the yield of the \({{B} ^0} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \eta \) component is left free. Constraining it using the knowledge of the branching fractions in Ref. [5] and the fragmentation fractions given in Ref. [37] changes \(\tau _\textrm{L}\) by \(0.4 \,\text {fs}\) which is assigned as a systematic.

Further small uncertainties arise from the limited knowledge of the length scale of the detector along the beam axis (z-scale), the momentum scale for charged particles and the neutral particle energy scale. An additional \(1 \,\text {fs}\) systematic uncertainty arises from the effect of \({{B} ^0_{s}} \) mesons produced via \({{B} _{c} ^+} \) decays [38].

The stability of the result is tested by dividing the data according to the year of running (Fig. 3, left), the \(p_{\textrm{T}} \) of the \({{B} ^0_{s}} \) candidate (Fig. 3, right) and by varying the requirements on the MVA, the \(\chi ^2_{\text {IP}} \) and the value of the minimum decay time requirement. No significant change in the final result is found and hence no further systematic uncertainty is assigned.

Fig. 3
figure 3

Values of \(\tau _\textrm{L}\) obtained by dividing the data according to (left) the four years of data-taking and (right) regions in \({B} ^0_{s} \) \(p_{\textrm{T}}\). In both plots the statistical uncertainty for each point is shown. The \(1 \sigma \) error bands of the statistical uncertainty on the combined results are shown in yellow. The \(\chi ^2\) probability of the measurements is \(51 \, \%\)(\(49 \, \%\)) for fits to the data divided by year (\({B} ^0_{s} \) \(p_{\textrm{T}}\))

The systematic uncertainties are summarized in Table 1. Adding them in quadrature leads to a total systematic uncertainty of \(8.0 \,\text {fs}\).

6 Summary

Using the dataset collected by LHCb during Run 2, the effective lifetime in the \(B^0_s \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \eta \) decay mode is measured to be

$$\begin{aligned} \tau _\textrm{L} = 1.445 \pm 0.016~\text {(stat)} \pm 0.008 ~\text {(syst)} \,\text {ps}. \end{aligned}$$

This result agrees with the Run 1 measurement [6] within \(1 \sigma \)

$$\begin{aligned} \tau _\textrm{L} = 1.479 \pm 0.034~\text {(stat)} \pm 0.011 ~\text {(syst)} \,\text {ps}, \end{aligned}$$

and is a factor of two more precise. The two values are combined assuming the uncertainties due to the momentum and length scales, \({{B} ^0} \) background, partially reconstructed background, mass model, time resolution and possible \({{B} _{c} ^+} \) component are fully correlated. The remaining systematic uncertainties, dominated by the simulated samples sizes, are taken to be uncorrelated. The combination gives

$$\begin{aligned} \tau _\textrm{L} = 1.452 \pm 0.014 \pm 0.007 \pm 0.002 \,\text {ps}, \end{aligned}$$

where the first uncertainty is statistical, the second is the uncorrelated systematic uncertainty and the third is the correlated systematic uncertainty. This combination supersedes the previous result in the \({{{B} ^0_{s}} \rightarrow {{J \hspace{-1.66656pt}/\hspace{-1.111pt}\psi }} \eta }\) mode. It is in agreement at 2\(\sigma \), but more precise than, the measurement made using the \({{B} ^0_{s}} \rightarrow {{D} ^+_{s}} {{D} ^-_{s}} \) decay mode \(\tau _\textrm{L} = 1.379 \pm 0.026 \,\text {(stat)} \pm 0.017 \,\text {(syst)} \,\text {ps} \) [7]. A weighted average of the tree-level measurements gives \(\tau _{\text {L}} = 1.437 \pm 0.014 \,\text {ps} \). This agrees at the 1\(\sigma \) level with the Standard Model expectation of \(\tau _\textrm{L} = 1.422 \pm 0.013 \,\text {ps} \) [1, 4]. The result also agrees at the 2\(\sigma \) level with the value quoted by HFLAV [35] based upon measurements of \(\Gamma _s\) and \(\Delta \Gamma _s\) in the \(B^0_s \rightarrow J/\psi \phi \) decay mode, \(\tau _\textrm{L} = 1.426 \pm 0.008 \,\text {ps} \). Finally, it is also consistent with the value of \(1.407 \pm 0.016 \,\text {(stat)} \pm 0.007 \,\text {(syst)} \,\text {ps} \) found in the penguin-dominated decay \({{B} ^0_{s}} \rightarrow K^+ K^-\) [8]. Figure 4 summarizes the measurements of \(\tau _\textrm{L}\) in all these modes.

Table 1 Systematic uncertainties on the lifetime measurement in \(\,\text {fs}\). Uncertainties less than \(0.1\,\text {fs}\) are indicated with a dash

Further improvements in precision in \(\tau _\textrm{L}\) are expected both by considering other \(C\!P\)-even \({{B} ^0_{s}} \) decays to final states containing \(\eta \) or \(\eta '\) mesons, the \({{B} ^0_{s}} \rightarrow {{D} ^+_{s}} {{D} ^-_{s}} \) dataset collected during Run 2 and from the larger dataset that will be collected by the upgraded LHCb detector.

Fig. 4
figure 4

Summary of measurements of \(\tau _\textrm{L}\) from LHCb [7, 8] along with the HFLAV average [35] determined using the measurements of \(\Gamma _s\) and \(\Delta \Gamma _s\) made using the \({{B} ^0_{s}} \rightarrow J/\psi \phi \) decay mode. The SM prediction, calculated using the values in Refs. [1, 4] is shown by the grey-green band