1 Introduction

To explain the matter–antimatter asymmetry in the currently observed Universe, C and CP symmetry violation is one of the three necessary conditions listed by Sakharov [1]. In the electroweak baryogenesis models, the CP violation phase from the CKM matrix is too small to accommodate the observed baryon–antibaryon imbalance. Hence the standard model has been extended to introduce extra new CP violating sources, such as in the 2HDM, left–right symmetric and SUSY models.

On the other hand, the recently discovered Higgs [2,3,4] has opened a new window toward the new physics. The precision measurement of the Higgs properties will be one of the most important targets of the next generation collider including high-luminosity upgrade of LHC [5] and \(e^+e^-\) colliders [6,7,8,9]. Besides the determination of the signal strengths of the Higgs for different channels [10,11,12], the large data that will be accumulated in future colliders would allow us to measure the CP structure of some certain interaction with extraordinary precision.

Using 8 TeV LHC data, ATLAS and CMS have already measured the CP property of the Higgs coupling with the Z boson using the four-lepton decay channel (\(h\rightarrow ZZ\rightarrow (\ell \ell )(\ell '\ell ')\)). A pure CP odd assumption is excluded at 99% confidence level [13,14,15]. On the other hand, a CP violating Higgs sector can also be searched in the fermionic decays of Higgs, using an effective Lagrangian of the following form:

$$\begin{aligned} \mathcal {L} = -\frac{y_\tau }{\sqrt{2}} ( \cos \phi \bar{\tau }\tau + i \sin \phi \bar{\tau }\gamma _5\tau ) h , \end{aligned}$$
(1)

where \(\phi \) is the mixing angle between the CP even and odd terms. The advantage of this decay is that, unlike the bosonic decays of Higgs where the pseudoscalar interaction may arise from a dimension-6 operator such as \(h Z_{\mu \nu }\tilde{Z}^{\mu \nu }\), the pseudoscalar term in Eq. (1) is dimension-4 and the coefficient can be large, so that a large CP violating effect is possible when the Higgs is a CP mixture. Although difficult to reconstruct, the tau decays are complex enough to yield information of the tau spin correlation [16, 17]. Hence \(h\rightarrow \tau ^+\tau ^-\) is a golden channel for the measurement of the CP nature of the Higgs which has led to an enormous amount of studies on the prospects of the measurement of the CP mixing angle(\(\phi \)) at LHC using either the ggF/ZH or the VBF production mode [18,19,20,21,22,23,24,25,26,27] as well as at \(e^+e^-\) colliders [28,29,30].

The tau decay will always produce at least one neutrino which can hardly be reconstructed at hadron colliders.Footnote 1 Hence, the proposed methods at LHC utilize either only the visible products (such as the four pions in the \(\rho \rho \) mode) or the impact parameter of the tau [23], all of these can be reconstructed without knowing the full information of the neutrino. However, the resolution of the hadron collider will significantly affect the reconstruction of the CP sensitive observable [26]. Furthermore, besides the clean background, the lepton collider can also provide the possibility to fully reconstruct the neutrinos. Thus the lepton collider will be a better facility for the study of the CP properties of the Higgs.

In this paper, we propose a new method that can be used to reconstruct the momentum of the neutrinos from tau decay at future \(e^+e^-\) machine. In this method we utilize all the information including mass peaks of all involving particles as well as the impact parameters of each tau decay which will be measured with higher precision at lepton collider than at hadron collider. We find that using the new proposed method we can precisely determine the momentum of the missing neutrinos. With these fully reconstructed neutrinos as well as the visible decay products we construct a CP sensitive observable based on event by event matrix element to test the CP structure of the coupling between Higgs and \(\tau \) leptons. After performing a detailed Monte Carlo (MC) simulation of the signal and corresponding backgrounds at \(e^+e^-\) collider, we have checked the capability of the matrix element-based observable and also compared with the usually used CP observable defined as the angle between some certain planes.

The rest of the paper is organized as follows. In Sect. 2 we present in detail the method that we used to reconstruct the neutrino. We first refine the Higgs momentum from recoiling Z boson, and then determine the momentum of neutrinos by a global fitting utilizing impact parameter and on-shell conditions. A detailed MC simulation study of this method is given in Sect. 3. We use a matrix element-based CP observable to give the prospect of the measurement of the CP mixing angle. A comparison with the previously used method is also given in Sect. 3. We summarize our discussion in Sect. 4.

2 The reconstruction of neutrinos at \(e^+e^-\) collider

The leading Higgs production channel at \(e^+e^-\) collider is

$$\begin{aligned} e^+ e^- \rightarrow Z h. \end{aligned}$$
(2)

In order to study the CP property of the \(h\tau \tau \) coupling, after the Higgs decays into \(\tau \) pair [including the CP violation effect as stated in Eq. (1)], and the Z boson decays either leptonically(\(Z\rightarrow \ell \ell \), \(\ell =e/\mu \)) or hadronically (\(Z \rightarrow jj\)), the following one-prong decay channels for \(\tau \)-leptons will be used:

$$\begin{aligned} \tau ^\mp\rightarrow & {} \nu _\tau \ \ell ^\mp \ \nu _\ell , \nonumber \\ \tau ^\mp\rightarrow & {} \nu _\tau \ \pi ^\mp , \nonumber \\ \tau ^\mp\rightarrow & {} \nu _\tau \ \rho ^\mp \rightarrow \nu _\tau \ \pi ^\mp \ \pi ^0. \end{aligned}$$
(3)

For clarity, before introducing the reconstructed method, we will first introduce details of the simulation and basic reconstruction for signal and background processes.

Table 1 The combined efficiencies of selecting the leptons, taus and jets for the different Z and di-tau decay modes (neutrinos are omitted in the notation)

The charged particle in each process leaves a track inside the detector which is used to construct the impact parameters. The \(\tau \) mass will be used in the reconstruction of the neutrino, as well as the \(\rho \) mass in the \(\rho \) decay channel. Further, the mass of the Higgs will also be used to provide more information. Note that one can construct the Higgs peak from either the momentum of \(\tau \) pair or the recoil mass constructed by the momentum of the Z boson, and how well the Higgs momentum can be reconstructed will significantly affect the further determination of the neutrino momentum. Thus the Higgs momentum reconstruction from the recoil system will also be investigated.

2.1 Signal and background simulation and reconstruction

The signal and background events are generated with MadGraph5 [31] and passed to Pythia8 [32] for the resonance decay (CP mixing Higgs, tau and rho mesons) and parton shower. The spin correlation between two taus is retained during the whole procedure. The signal process as demonstrated above is \(e^+e^-\rightarrow Zh\) (\(\sigma =212\) fb), at a center of mass energy \(E_{\text {CM}}=250\) GeV, with \(h\rightarrow \tau \tau \) and \(Z\rightarrow \ell \ell \) or \(Z\rightarrow jj\)(2 jets). The dominant backgrounds for this signal are the following:

  • \(e^+e^-\rightarrow ZZ,\ Z\rightarrow \tau \tau ,\ Z\rightarrow \ell \ell (jj)\);

  • \(e^+e^-\rightarrow Zh,\ Z\rightarrow \tau \tau ,\ h\rightarrow bb\);

  • \(e^+e^-\rightarrow Zh,\ Z\rightarrow \tau \tau ,\ h\rightarrow l\nu l\nu (l=e/\mu /\tau )\).

The events are afterwards passed through the DELPHES [33] simulation using the ILD card based on the TDR of ILC [7], in which the tracking efficiency for charged tracks is \(99\%\) with a fiducial pseudorapidity range up to \(|\eta |=2.4\); the identification efficiency for electrons and muons is \(95\%\), and the calorimeter towers are simulated with a particle flow algorithm with a coverage up to \(|\eta |=3.0\). The track momentum is smeared with a resolution of \(\sqrt{0.01^2 + (10^{-4}p_T)^2}\) (\(p_T\) in GeV) in a magnetic field of 3.5 T, and a resolution of 0.001 in \(\eta \) and \(\phi \) (azimuthal angle) for its direction. The calorimeter energy is smeared with a resolution of \(\sqrt{A^2E^2 + B^2E}\), where the coefficients for the constant and stochastic terms are \(A=1.0\%\) (\(1.5\%\)) and \(B=15\%\) (\(50\%\)), respectively, for the EM (hadronic) calorimeter. To reject non-prompt leptons from jet fragmentation and heavy flavor meson decay, a lepton isolation cut of \(\sum p_T^{\text {PF}}/p_T^{\ell }<0.7\) is applied, where \(\sum p_T^{\text {PF}}\) is the sum over particle flow objects (tracks and calorimeter clusters) around the lepton with \(p_T>0.5\) GeV and \(\varDelta R=\sqrt{\varDelta \eta ^2+\varDelta \phi ^2}<0.4\) with respect to the lepton. This efficiency of this cut ranges from \(91\%\) in the low \(p_T^{\ell }\) (5 GeV \(<p_T^{\ell }<10\) GeV) to \(99\%\) in the high \(p_T^{\ell }\) (\(>30\) GeV) region, and it mostly affects the soft leptons from tau decays.

To have the best impact parameter resolutions which will be used to reconstruct the neutrinos, a minimum \(p_T\) of 5 GeV is applied on the leptons and charged pions from taus. The resolution of the impact parameter in the transverse plane (in the z-axis) is set to \(\sigma _d=5\) \(\upmu \)m (\(\sigma _z=10\) \(\upmu \)m) as in Eq. (7), which can be achieved in the next generation \(e^+e^-\) colliders [7, 8].

The hadronic taus (\(\tau _h\)) are clustered by the Anti-\(k_{t}\) jet algorithm [34] with a cone parameter of 0.4. A simple flat \(\tau \)-tagging efficiency of \(60\%\) (\(0.5\%\)) is applied on the real (fake) taus, taking into account the current \(\tau \)-tagging performance at the LHC experiments [35, 36]. With this fake tau rate, the \(W^+W^-\rightarrow 4j\) background with two jets faking two taus can be neglected. When the Z boson decays hadronically, the particle flow objects associated with the selected electrons, muons and hadronic taus are firstly removed from the event, and then a coneless exclusive (\(k_t\)) algorithm [37] is applied on the remaining objects to cluster up to two jets, which are assumed to originate from \(Z\rightarrow jj\). The four-momentum of Z can be best reconstructed in this way, which further helps to calculate the Higgs mass recoiling against Z with the given CM energy. The efficiency of finding two jets with \(p_T>5\) GeV and \(|\eta |<3\) is about \(95\%\).

Fig. 1
figure 1

The resolutions of the Higgs recoil momentum in x and y (a) and in z (b) in the \(Z\rightarrow jj\) channels, and in z in the \(Z\rightarrow \ell \ell \) channels (c). The red dashed (blue solid) histogram corresponds to the calculation based on the original uncorrected momenta of jets or leptons from Z (improved momenta of jets from Z and the consideration of Eq. (4))

The net efficiencies after the above object selections in different di-tau decay modes and the requirement that leptons from the h/Z decay should have opposite charge are listed in Table 1. To measure the CP mixing angle, it is essential to identify the different tau decay modes efficiently. The development of tau substructure algorithms in the ATLAS and CMS experiments has recently made this possible based on the particle flow method [38, 39]. The neutral pion energy can also be measured with a precision of \(15\%\) with the substructure. In this work, we assume that different tau decay channels can be efficiently identified in the future \(e^+e^-\) colliders, with no crosstalk for simplicity, and a precision of \(10\%\) on the neutral pion energy measurement can be achieved.

Fig. 2
figure 2

Plots demonstrating the tau flight vector, the track trajectory and the point of closest approach for three cases: a the collision point O is inside the track curvature, b O is outside the track curvature with two intersection points, c O is outside the track curvature with no intersection

2.2 Refined Higgs four-momentum reconstruction

In general, the reconstructed Higgs peak will always have a long tail as we ignored the initial state radiation (ISR) photons. To best reconstruct the Higgs four-momentum from the recoiling Z boson, taking into account the known Higgs mass at 125 GeV, and assuming that the ISR photons are mostly collinear with the beam, a quantity x can be solved which represents the fraction of momentum carried away by the ISR photon from the positron beam traveling in the positive z-direction,

$$\begin{aligned} x = \frac{E_{\text {CM}}^2-2E_{\text {CM}}E+m^2-m_h^2}{\pm E_{\text {CM}}^2 \mp E_{\text {CM}}E+E_{\text {CM}}p_z}, \end{aligned}$$
(4)

where E, m and \(p_z\) are the energy, mass and z-component of the momentum of the recoiling Z boson. The value of x can also be negative, which means that the ISR photon carries momentum from the electron beam along the negative z-axis.Footnote 2 With the x solved, the four-momentum of the Zh system can be expressed as \(p_{\text {tot}} = (E, p_x, p_y, p_z) = (250-125|x|, 0, 0, -125x)\) GeV, from which the Higgs recoil four-momentum can be calculated as \(p_h^{\text {RC}} = p_{\text {tot}} - p_Z\). Because the tau mass is much smaller than the Higgs mass, the tau from Higgs decay is highly boosted and the neutrinos from tau decay are almost collinear with its visible decay products. With this assumption, the Higgs four-momentum can also be expressed as \(p_h\) = \(p_{\text {vis1}}/x_1\) + \(p_{\text {vis2}}/x_2\), where \(p_{\text {vis1},2}\) are the four-momenta of visible decay products of the two taus, and \(x_{1,2}\) are the fractions of momentum carried by the visible products. With these quantities defined, a \(\chi ^2\) can be minimized per event to get the best momentum resolution:

$$\begin{aligned} \begin{array}{ll} \chi ^2 = &{} \mathop {\sum }\nolimits _{i=0}^{3}\left( \frac{p_{h,i}-p_{h,i}^{\text {RC}}}{0.5}\right) ^2 + \left( \frac{m_Z-91.2}{2.5}\right) ^2\\ &{} + \left( \frac{f_{j1}-1}{0.06}\right) ^2 + \left( \frac{f_{j2}-1}{0.06}\right) ^2, \end{array} \end{aligned}$$
(5)

where \(m_Z\) is the Z boson mass constructed from the decay products of the Z boson (two leptons or two jets), \(f_{j1,2}\) are the jet energy correction factors on the two jets from \(Z\rightarrow jj\) decay, and variables with energy dimensionality are in GeV. The jet energy resolution used here is about 6% and this is a conservative choice and can be further improved in future electron colliders, such as ILC [6, 7], CEPC [8] and CLIC [9] (4%). By minimizing the \(\chi ^2\) in Eq. (5) with \(x_{1,2}\) and \(f_{j1,2}\) freely floating, not only the two-fold ambiguity of x in Eq. (4) is resolved (the solution with a smaller \(\chi ^2\) is chosen), but also the jet resolution, and hence the resolutions of \(p_h^{\text {RC}}\) are improved. The improvement is shown in Fig. 1. The denominator values in Eq. (5) are chosen such that the best resolutions as in Fig. 1 are achieved. In the \(Z\rightarrow \ell \ell \) channels, the last two terms in Eq. (5) are not present since the leptons’ momenta are precisely measured when compared to the jets. Equation (5) fulfills two purposes in this case. One is to resolve the x ambiguity, and the other is to resolve the two-fold ambiguity of the \(e^+e^-e^+\tau _h\) and \(\mu ^+\mu ^-\mu ^+\tau _h\) final states. There are two ways to assign the two same-sign leptons to the Z and h decays. With the current setup, the fraction of wrong assignments is negligible.

2.3 Impact parameter

Ideally, except for the lepton channels, using the missing momentum, the Higgs four-momentum and the tau mass, one already has enough constraints to reconstruct the two neutrinos. However, in practice, the reconstructed momentum of the neutrino still has large uncertainty. Thus the usage of the further information from the impact parameters [40, 41] will be absolutely helpful in the reconstruction of the neutrinos especially in lepton channel. In order to make use of the impact parameters, a \(\chi _{\text {IP}}^2\) is reconstructed for each tau which will be minimized in further global fittings; this will be explained in the next section.

The method to calculate \(\chi _{\text {IP}}^2\) for one tau is as follows: given the transverse impact parameter d and the direction of tau momentum in the transverse plane, the intersection point between the tau flight direction and the track trajectory in the transverse plan can be found. The transverse impact parameter d is defined in the transverse plane as the minimum distance from the interaction point to the charged track. As demonstrated in Fig. 2, the tau is produced at the collision point O, flight length from O to the point of decay P in the transverse plane is L, the point of closest approach to O with a backward extrapolation of the trajectory is D, and the extrapolated arc length in transverse plane is S. There is just one intersection point when O is inside the circle of the trajectory (Fig. 2a), but when O is outside, the number of intersection points can be 2, 1 or 0 (Fig. 2b, c). The distance between O and D is the impact parameter \(d_0\). The impact parameter in the z-axis can be calculated from P:

$$\begin{aligned} z_0 = L \sinh \eta _\tau - S \sinh \eta _{\text {track}}. \end{aligned}$$
(6)

The impact parameters will get updated values (\(d_0^{\text {fit}}\) and \(z_0^{\text {fit}}\)) when \(\chi _{\text {IP}}^2\) is minimized. When just one intersection point exists, the relevant contribution reads

$$\begin{aligned} \chi ^2 = \left( \frac{d_0^{\text {fit}}-d_0}{\sigma _d}\right) ^2 + \left( \frac{z_0^{\text {fit}}-z_0}{\sigma _z}\right) ^2. \end{aligned}$$
(7)

When two intersection points exist, the \(\chi ^2\) for each possibility is calculated according to Eq. (7) and the smaller one is taken. When no intersection point exists, as demonstrated by the dashed arrow in Fig. 2c, it is assumed to be due to the uncertainty in \(d_0\) measurement, and the best-fit \(d_0^{\text {fit}}\) will be around the distance between \(O'\) and D (denoted \(d_0^{\text {C}}\)). The \(O'\) is determined by translating the dashed vector to become tangential to the trajectory (the blue vector), and \(z_0^{\text {fit}}\) is calculated from the tangential point P. The relevant contribution then reads

$$\begin{aligned} \chi ^2 = \left( \frac{d_0^{\text {fit}}+d_0-2d_0^{\text {C}}}{\sigma _d}\right) ^2 + \left( \frac{z_0^{\text {fit}}-z_0}{\sigma _z}\right) ^2. \end{aligned}$$
(8)

When the dashed vector in Fig. 2c points away from the track curvature, no tangential point can be found, and \(O'\) coincides with D. Although the tracks in Fig. 2 all travel anti-clockwise, but depending on the particle charge, they may also travel clockwise, i.e., from P to D. Situations with tracks traveling clockwise can be obtained by taking a mirror image of the plots in Fig. 2, with essentially no change to Eqs. (7) and (8). The cases where the angle between the tau flight direction and the track trajectory is an obtuse angle are also considered in our method. However, since the tau is boosted and its decay products are highly collinear, this rarely happens.

Fig. 3
figure 3

The \(\varDelta R\) between the reconstructed and true neutrino momenta (a, b), and the ratio of them (c, d), for the \(\tau \rightarrow \pi \nu \) (a, c) and \(\tau \rightarrow \rho \nu \) (b, d) decays in the \(\pi +\rho \) channel

2.4 Reconstruction of the neutrino momenta

In order to best estimate the neutrino momenta, with the benifit of accurate particle momentum reconstruction and clean background from a \(e^+e^-\) collider, the direction (in \(\eta \), \(\phi \)) and the magnitude of each of the two neutrinos from tau decays are scanned globally for each event. The available information, such as the tau mass, the Higgs four-momentum from Z recoil, and the impact parameters of the charged tracks from tau decays, is all used to achieved this goal. With the four-momenta of all final state particles reconstructed, a matrix element-based method is used to fully extract the information of CP, which can result in an improved sensitivity with respect to the usually used method [23].

The global \(\chi ^2\) of the likelihood that is minimized for every event is expressed as

$$\begin{aligned} \chi ^2= & {} \mathop {\sum }\nolimits _{i=0}^{3}\left( \frac{p_{h,i}-p_{h,i}^{\text {RC}}}{\sigma _{\text {RC}}}\right) ^2 + \left( \frac{m_{\tau 1}-1.777}{\sigma _{\tau 1}}\right) ^2\nonumber \\&+ \left( \frac{m_{\tau 2}-1.777}{\sigma _{\tau 2}}\right) ^2 + \chi _{\text {IP}}^2, \end{aligned}$$
(9)

where \(p_h = p_{\text {vis1}}+p_{\text {mis1}}+p_{\text {vis2}}+p_{\text {mis2}}\) is calculated from the four-momenta of visible and invisible decay products of the two taus (without any collinear assumptions), \(p_{h}^{\text {RC}}\) is obtained by minimizing Eq. (5) in the previous step, \(m_{\tau 1,2}\) are the masses of the taus, and \(\chi _{\text {IP}}^2\) is the term accounting for the contribution from the impact parameters of the tracks that can help the fit to find the correct neutrino directions as stated in Sect. 2.3. For \(Z\rightarrow \ell \ell \) (jj) channels, \(\sigma _{RC}=0.5\) (4.0) GeV and \(\sigma _\tau =0.1\) (0.2) GeV. The resolution parameters are set so as to achieve the best reconstructed neutrino momenta (magnitude and direction close to the true values). For the states with an intermediate \(\rho \) meson, extra terms of \((m_\rho - 0.775)^2/\sigma _\rho ^2 + (f_\rho - 1)^2/0.10^2\) are added to Eq. (9), where \(f_\rho \) is the factor multiplied to the \(\rho \) meson’s energy for a better resolution. Correspondingly, \(\sigma _\tau \) is doubled for this case, and \(\sigma _\rho =0.15\) (0.30) for \(Z\rightarrow \ell \ell \) (jj).

In the per-event \(\chi ^2\) minimization, the (\(\eta \), \(\phi \)) of one neutrino is firstly scanned, from which the magnitudes of the neutrinos’ momenta and the direction of the other neutrino can be obtained from the tau mass and recoil constraints in Eq. (9). Conversely, the scan is repeated from the (\(\eta \), \(\phi \)) of the other neutrino. Finally, a fit using MINUIT [42] is performed around the minimal point found by the scans for a better estimation. The \(\varDelta R\) and momentum ratios between the reconstructed and true neutrinos in the \(\pi +\rho \) channel are shown in Fig. 3.

In the channels with a lepton (\(e/\mu \)) from tau, the momenta of the two neutrinos from the same tau cannot be fully reconstructed. Instead, the combined four-momenta of them is reconstructed, with the di-neutrino mass as an extra degree of freedom (the relative angles between the two neutrinos are “integrated” out). Although there are eight constraints in Eq. (9), it is still difficult to achieve. In this case, another extra term, \(-2 \ln \mathcal {P}(\varDelta R, m_{\text {mis}})\), is added. It is the joint probability distribution of \(\varDelta R\) between the lepton and di-neutrino momenta, and \(m_{\text {mis}}\) the di-neutrino invariant mass. This function is obtained depending on the tau momentum from the Monte Carlo simulation, which has been used in the di-tau MMC mass by ATLAS [43]. The quality of the neutrino momentum reconstruction in the \(\ell +\pi /\rho \) and \(Z\rightarrow \ell \ell \) channels are shown in Fig. 4. Due to \(m_{\text {mis}}\) and worse Higgs recoil momentum resolutions, the \(\ell +\pi /\rho \) channels with \(Z\rightarrow jj\) have limited reconstruction precision, and are thus excluded from the CP sensitivity test.

Fig. 4
figure 4

The \(\varDelta R\) between the reconstructed and true neutrino (or di-neutrino) momenta for the \(\tau \rightarrow \ell \nu \nu \) (a) and \(\tau \rightarrow \pi \nu +\rho \nu \) (b) decays, the ratio of the reconstructed to true di-neutrino momentum for the \(\tau \rightarrow \ell \nu \nu \) decay (c), and the 2-D distribution of the true versus reconstructed di-neutrino mass (d), in the \(\ell +\pi /\rho \) and \(Z\rightarrow \ell \ell \) channels

3 Measurement of the CP mixing angle of \(h\tau \tau \) at \(e^+e^-\) collider

The simulation details and the basic reconstruction are presented in Sect. 2.1. To suppress the background, the cuts listed in Table 2 are further applied to achieve the best signal significance. The \(m_h^{\text {RC}}\) is calculated from the Z recoil, \(m_{h,\text {fit}}^{\text {RC}}\) and \(m_Z^{\text {fit}}\) are obtained from per-event fit, and \(m_h\), \(m_\tau \) and \(m_\rho \) are calculated from the di-tau side. Figure 5 shows the distributions of \(m_Z\), \(m_h^{\text {RC}}\) and \(m_h\) for the \(Z\rightarrow \ell \ell \) and \(Z\rightarrow jj\) decays before the cuts are applied. With 5 \(ab^{-1}\) of \(e^+e^-\) collision data at \(E_{\text {CM}}=250\) GeV, the expected numbers of events entering the CP sensitivity test are listed in Table 3. Note that the whole per-event fitting procedure not only helps to determine the momentum of neutrino and also provides better resolutions for these mass observables and thus helps to further suppress the backgrounds.

Fig. 5
figure 5

The distributions of \(m_Z\) (a1, a2), \(m_h^{\text {RC}}\) (b1, b2) and \(m_h\) (c1, c2) for the \(Z\rightarrow \ell \ell \) (a1, b1, c1) and \(Z\rightarrow jj\) (a2, b2, c2) decays with the three tau decay modes considered and assuming 5 \(ab^{-1}\) of data. Note that the \(\ell +\pi /\rho \) and \(Z\rightarrow jj\) channels are excluded

3.1 Matrix element-based measurement of CP

With the fully reconstructed momentum of final states from the Higgs decay, a method based on the matrix elements could be used to probe the CP mixing angle. From Eq. (1), the matrix element squared with \(\phi \) being the CP mixing angle can be expressed as

$$\begin{aligned} |\mathcal {M}|^2\propto & {} A + B \cos (2\phi ) + C \sin (2\phi ), \nonumber \\\propto & {} I_1\cos ^2(\phi ) + I_2\sin (\phi )\cos (\phi ) + I_3\sin ^2(\phi ), \ \end{aligned}$$
(10)

where \(I_1 = A+B\), \(I_2 = 2C\) and \(I_3 = A-B\). For example, in the \(\pi +\pi \) channel, the coefficients have relatively simple expressions:

$$\begin{aligned} A_{\pi \pi }= & {} \frac{m_{\tau }^4}{8}((m_h^2 - 2m_{\tau }^2)(m_{\tau }^2 - m_{\pi }^2)^2 - 8m_{\tau }^4(p_{\pi ^{-}} \cdot p_{\pi ^{+}})) ,\nonumber \\ B_{\pi \pi }= & {} -\frac{m_{\tau }^6}{4}((m_{\pi }^2 - m_{\tau }^2)^2 - 2(m_h^2 - 2m_{\tau }^2)(p_{\pi ^{-}} \cdot p_{\pi ^{+}}) \nonumber \\&+\, 4(p_{\pi ^{-}} \cdot p_{\tau ^{+}})(p_{\pi ^{+}} \cdot p_{\tau ^{-}})) ,\nonumber \\ C_{\pi \pi }= & {} m_{\tau }^6\epsilon ^{p_{\pi ^{-}} p_{\pi ^{+}} p_{\tau ^{-}} p_{\tau ^{+}} }, \end{aligned}$$
(11)

where \(p_{\tau ^-}\) and \(p_{\tau ^+}\) are the momenta of the two taus, which include the information of the reconstructed neutrinos and \(\epsilon ^{p_1p_2p_3p_4}\) is short for \(\epsilon _{\mu \nu \rho \sigma }p_1^\mu p_2^\nu p_3^\rho p_4^\sigma \).

Table 2 Further kinematic cuts applied to enhance the signal significance
Table 3 The expected numbers of signal and background events with 5 \(ab^{-1}\) of data in each channel after the cuts in Table 2 with \(Z\rightarrow \ell \ell ,jj\) combined. Note that the \(\ell +\pi /\rho \) and \(Z\rightarrow jj\) channels are excluded

The full expression of the coefficients for the other decay mode combinations can be found in Appendix A, where we also list the effective Lagrangian used for tau decay. For channels involving leptonic decays of tau, a phase space integral is performed on the internal degrees of freedom between the two neutrinos, at the cost of a bit loss of sensitivity to CP. In the \(\rho +\rho \) and \(\pi +\rho \) channels, the neutral pion(s) is assigned to the corresponding decay chain without any ambiguity, since the neutral pion from different decay chain is highly collimated with the original tau. An optimal observable [44, 45], defined as \(OO=I_2/I_1\) for each event, is used to distinguish signals with different CP mixing angles. For signals with a positive (negative) \(\phi \), the mean of the OO distribution will be shifted to negative (positive) values, as shown in Fig. 6a.

Template probability density functions (PDF) for different CP angles are first obtained from the simulations. A binned likelihood function (\(\mathcal {L}\)) is then calculated from the PDF as a function of the CP mixing angle \(\phi \). The best-fit CP angle is found with the least negative logarithm of the likelihood (NLL), and the \(1\sigma \) confidence interval is obtained by finding the angles whose NLLs are 0.5 higher than the minimum NLL. Figure 6b shows the expected \(\varDelta \)NLL as a function of the CP angle \(\phi \), with all five channels in Table 3 combined. With 5 \(ab^{-1}\) of \(e^+e^-\) collision data (can be achieved by the CEPC [8]), a precision of 0.05 radians can be reached. On the other hand, if the integrated luminosity is 2 \(ab^{-1}\) (can be achieved by the ILC [46]), a precision of about 0.09 can be reached.

Actually, one can also built an angle \(\varDelta \phi _{\mathrm{ME}}\) (\(-\pi \le \varDelta \phi _{\mathrm{ME}}\le \pi \)) to probe the CP mixing angle which satisfies

$$\begin{aligned}&\cos (\varDelta \phi _{\mathrm{ME}}) = \frac{B}{\sqrt{B^2+C^2}}, \nonumber \\&\sin (\varDelta \phi _{\mathrm{ME}}) = \frac{C}{\sqrt{B^2+C^2}}. \end{aligned}$$
(12)

With this definition, the matrix element (which also determines the distribution of \(\varDelta \phi _{\mathrm{ME}}\)) becomes

$$\begin{aligned} |\mathcal {M}|^2 \propto A + \sqrt{B^2+C^2}\cos (\varDelta \phi _{\mathrm{ME}}-2\phi ). \end{aligned}$$
(13)

The distribution of \(\varDelta \phi _{\mathrm{ME}}\) is presented in Fig. 7a with two different choice of CP mixing angle \(\phi \). \(\varDelta \phi _{\mathrm{ME}}\) makes full use of the information inside the matrix element and performs better when \(\phi \) is large. Figure 7b shows the NLL comparison between OO and \(\varDelta \phi _{\mathrm{ME}}\), we find that when \(\phi \) is large, \(\varDelta \phi _{\mathrm{ME}}\) is better than OO (as in the OO construction, the term \(I_3\) is omitted). But in the small \(\phi \) region, the two methods give quite similar results.

Fig. 6
figure 6

a The expected OO distributions in the \(\pi +\rho \) channel with 5 \(ab^{-1}\) of data for two different CP mixing angles. b The \(\varDelta \)NLL as a function of the CP mixing angle \(\phi \) calculated from expected zero-CP events at 5 \(ab^{-1}\), with all five channels in Table 3 combined. The \(1\sigma \) interval bounds are the intersections between the \(\varDelta \)NLL curve and the blue dashed line

Fig. 7
figure 7

a The expected \(\varDelta \phi _{\mathrm{ME}}\) distributions in the \(\rho +\rho \) channel with 5 \(ab^{-1}\) of data for two different CP mixing angles. b The \(\varDelta \)NLL as a function of the CP mixing angle \(\phi \) calculated from expected zero-CP events at 5 \(ab^{-1}\) for \(\varDelta \phi _{\mathrm{ME}}\) and OO in \(\rho \rho \) channel

Further, the \(\rho +\rho \) channel is used to compare the sensitivities of the \(\varDelta \phi _{\text {IP}}\) method (using the impact parameters to approximately reconstruct the tau decay planes), the \(\varDelta \phi _{\text {CP}}\) method (using the visible \(\rho \) meson decay products to reconstruct the CP sensitive angle \(\varDelta \phi _{\text {CP}}\)) and the OO method, the details of the definitions for \(\varDelta \phi _{\text {IP}}\) and \(\varDelta \phi _{\text {CP}}\) are given in Appendix B. Figure 8a, b shows the distributions of \(\varDelta \phi _{\text {IP}}\), \(\varDelta \phi _{\text {CP}}\) in this channel. It is seen that the IP method have much worse separation power between CP even and CP mixed states than the \(\varDelta \phi _{\text {CP}}\) method. Figure 8c shows the \(\varDelta \)NLL variations with the \(\varDelta \phi _{\text {CP}}\) and the OO, from which one can read off that the latter is about \(35\%\) better than the former.

Fig. 8
figure 8

The distributions of \(\varDelta \phi _{\text {IP}}\) (a) and \(\varDelta \phi _{\text {CP}}\) (b) in the \(\rho +\rho \) channel with 5 \(ab^{-1}\) of data, using the impact parameter and \(\varDelta \phi _{\text {CP}}\) method, respectively. The \(\varDelta \)NLL variations with the \(\varDelta \phi _{\text {CP}}\) and the OO method are compared in (c) with the same channel

4 Conclusion

Understanding the CP property of the Higgs is one of the primary goals in future \(e^+e^-\) colliders or Higgs factory. If the Higgs is found to be a CP mixture, a door to matter–antimatter imbalance and New Physics would be opened. The \(H\rightarrow \tau \tau \) decay has a unique position in the Higgs CP search, as the CP odd contribution can enter the Lagrangian at the tree level.

In this paper, we proposed a new method utilizing the impact parameters and the on-shell conditions to fully reconstruct the momentum of the neutrinos from the tau decay (in the leptonic decay channel, the sum of the two neutrinos). With fully reconstructed the momenta of final state particles from the tau decays using a global likelihood scan and fit, the Higgs CP information can be best estimated based on a matrix element method. In the reconstruction, the resolution degradation due to ISR photons can be reduced by a pre-fit of the Higgs’ momentum from the Z recoil, which takes into account the Higgs mass and the special \(H\rightarrow \tau \tau \) decay topology.

We have performed a complete MC simulation taking into account the performance of current LHC detectors for the signal process \(e^+e^-\rightarrow Zh\rightarrow (\ell \ell /jj)(\tau ^+\tau ^-)\) followed by three main one-prong decay channels of tau and also the corresponding backgrounds. The refined Higgs momentum can further help to reduce the background leaving enough signal rates. By virtue of fully reconstructed neutrinos, the matrix element is calculated event by event for all di-tau decay modes, and matrix element based observables are built (OO and \(\varDelta \phi _{\text {ME}}\)) to probe the CP mixing angle \(\phi \) of the \(h\tau \tau \) coupling. At larger value of \(\phi \), \(\varDelta \phi _{\text {ME}}\) is definitely better than OO, but for small values of \(\phi \), the performances of them are found to be similar.

Template PDFs are obtained from the simulation, and the NLL analysis based on the PDF shows that with 5 \(ab^{-1}\) (2 \(ab^{-1}\)) of data at \(E_{\text {CM}}=250\) GeV center of mass energy from \(e^+e^-\) collisions, a CP mixing angle can be measured to a precision of 0.05 (0.09) radians, or \(2.9^\circ \) (\(5.2^\circ \)). The comparison with other CP observables (\(\varDelta \phi _{\text {IP}}\) and \(\varDelta \phi _{\text {CP}}\)) has also been presented, and we find that the CP mixing angle sensitivity reach based on our method is about at least 35% better than the other methods. The method can also be extended to other collider analyses that are sensitive to the neutrino momentum reconstruction.