Abstract
Within a covariant perturbative field-theoretical approach, the wave-packet modified neutrino propagator is expressed as an asymptotic expansion in powers of dimensionless Lorentz- and rotation-invariant variables. The expansion is valid at high energies and short but macroscopic space-time distances between the vertices of the proper Feynman macrodiagram. In terms of duality between the propagator and the effective neutrino wave packet, at short times and distances, neutrinos are deeply virtual and move quasi-classically. In the lowest-order approximation, this leads to the classical inverse-square dependence of the modulus squared flavor transition amplitude and related neutrino-induced event rate from distance L between the source and detector, and the above-mentioned asymptotics results in the corrections to the classical behavior represented by powers of \(L^2\). This is very different from the long-baseline regime, where similar corrections are given by an asymptotic expansion in inverse powers of \(L^2\). However, in both short- and long-baseline regimes, the main corrections lead to a decrease in number of neutrino events.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
A considerable amount of work has been devoted to the quantum-field theoretical (QFT) analysis of spatial evolution of mixed massive neutrinos [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40]. These studies were, in large part, initiated by a harsh critique of the quantum-mechanical (QM) theory of plane-wave neutrino oscillations, burdened with numerous inconsistencies and paradoxes. A comprehensive discussion of this and related issues, as well as further references, can be found in books [41, 42]. The QFT-based computations consistently reproduce, as an approximation, the standard QM formula for the neutrino flavor transition probability, and usually predict deviations from it (often different for different authors and nonequivalent approaches), thereby providing proper interpretation of the QM result, determining the scope of its applicability, and indicating possible directions of the experimental search for new related phenomena, such as decoherence and dispersion effects [43,44,45,46,47,48,49] and inverse-square law violation [50,51,52,53].
The above-cited works are mainly concerned with studying the neutrino oscillations at long baselines, where the transition probability is large enough to be detected experimentally. The short baseline oscillation regime was discussed in Ref. [9] within a solvable QFT model, and later in Ref. [19] in a more general approach. In this work, we study in detail just this case, i.e., the neutrino propagation at short distances in the QFT framework. Here, there is an immediate need for clarification: By short and long baselines (SBL and LBL) we will mean the distances L between the neutrino production and detection points that satisfy the conditions \(L \ll E_\nu /\Sigma _{\text {SBL}}^2\) and \(L \gg E_\nu /\Sigma _{\text {LBL}}^2\) respectively, where \(E_\nu \) is the mean neutrino energy, and \(\Sigma _{\text {SBL}}\) and \(\Sigma _{\text {LBL}}\) are the representative scales of the neutrino energy uncertainty related to the SBL and LBL regions. Later in the paper we explain where these conditions come from and what might be interesting about the SBL region (depending on the values of L, \(E_\nu \), and \(\Sigma _{\text {SBL}}\)) which may or may not be related to SBL experiments with accelerator or reactor (anti)neutrinos, or neutrinos from artificial radioactive sources.
Throughout the paper, we use the natural units (\(\hbar =c=1\)) and the Feynman metric. The Greek letters \(\mu , \nu , \ldots \) are used for Lorentzian indices, while the Latin indices \(k, l, n, \ldots \) enumerate the spatial components of vectors and tensors. The letters \(\alpha \) and \(\beta \) are reserved for lepton flavors. The formalism under consideration does not depend on the number of flavors or number of the neutrino mass eigenfields, but, unless otherwise noted, one may have in mind three generations of the Standard Model.
2 Essentials of formalism
The subsequent discussion is based on the covariant (“diagrammatic”) QFT formalism developed in Refs. [27, 28]. In this section, we shortly discuss the features of the formalism most essential for the aims of the present study, for more details, see Ref. [37]. The basic idea of the diagrammatic approach is that the flavor transition of neutrinos is the result of interference of “macroscopic” Feynman diagrams, such as the one shown in Fig. 1, perturbatively describing a lepton number violating process in which the massive neutrinos are internal lines connecting the vertices of their production and absorption. In other words, the neutrino mass eigenfields, \(\nu _i\) (\(i=1,2,3, \ldots \)), are treated as virtual intermediate states. The neutrinos with definite flavors, \(\nu _\alpha \) (\(\alpha =e,\mu ,\tau \)), do not participate in the formalism at all, although can be used to compare the predictions with those from other approaches, in particular, with the standard QM approach based on the concept of flavor mixing.
The external lines of the macrodiagrams correspond to asymptotically free incoming (“in”) and outgoing (“out”) wave packets \(\vert \varvec{p}_a,s_a,x_a\rangle \) (\(a{\in }I_{s,d}\)) and \(\vert \varvec{p}_b,s_b,x_b\rangle \) (\(b{\in }F_{s,d}\)) of the following generic form:
representing the superposition of the one-particle Fock’s states \(\vert \varvec{k}, s\rangle =\sqrt{2E_{\varvec{k}}}a_{\varvec{k}s}^\dag \vert 0\rangle \) with 3-momenta \(\varvec{k}\) and energy \(E_{\varvec{k}}=\sqrt{\varvec{k}^2+m^2}\), which may also depend on discrete variables s, such as the spin projection; here \(a_{\varvec{k}s}^\dag \) is the creation operator, m is the mass of the particle, \(d\varvec{k} \equiv d^3k\). Wave-packet state (1) is characterized by the most probable 3-momentum \(\varvec{p}\) and the space-time coordinate \(x=(x_0,\varvec{x})\) of its barycenter or, simply, center of symmetry, which is always implicitly assumed. The function \(\phi (\varvec{k},\varvec{p})\) is a model-dependent Lorentz scalar (“form factor”), which determines the shape of the packet and generally depends on a set of parameters \(\varvec{\sigma }\) that characterize the uncertainty of the momentum \(\varvec{p}\); \(k=(k_0,\varvec{k})\) and \(p=(p_0,\varvec{p})\) are the on-shell 4-momenta, i.e., \(k^2=p^2=m^2\). In the plane-wave limit (PWL) equivalent to the definite momenta approximation, the form factor becomes proportional to the 3d delta function,
and, as a result, the coordinate dependence in Eq. (1) disappears and the wave packet turns into the state with definite momentum:
From this point we will consider only very narrow in the momentum space external wave packets, for which the form-factor function \(\phi (\varvec{k},\varvec{p})\) is strongly peaked at the point \(\varvec{k}=\varvec{p}\). Such packets are very close to their plane-wave limit and in the configuration space are described by simple scalar function
The blocks (vertices) \(X_s\) and \(X_d\) (hereinafter referred to as “source” and “detector”) symbolically depict the microscopic regions of the in and out particle interactions. The spatial interval between \(X_s\) and \(X_d\) can be arbitrarily large. Without risk of confusion, we will denote by the same symbols \(X_s\) and \(X_d\) the 4-vectors that determine the effective space-time coordinates of the interaction regions; the precise definition of these coordinates (called “impact points”) depends on the form factor model and will be done later. Macroscopic separation between the source and detector vertices means that the values of \(\vert {X_d^0-X_s^0}\vert \) and \(\vert {\varvec{X}_d-\varvec{X}_s}\vert \) are large compared to the microscopic space-time scales of the interaction regions (determined by the dynamics) and to the micro- or even mesoscopic effective dimensions of the in and out wave packets; this clarifies the meaning of the term “macroscopic diagram”.
It is further assumed that the particle interactions are described by the Standard Model (SM) Lagrangian, phenomenologically augmented by a Dirac or Majorana mass term with the number of the neutrino mass eigenfields \(N\ge 3\) and by corresponding kinetic terms. So the set of initial or final states (or both sets) at each vertex must contain a charged lepton or (anti)neutrino. To avoid Irrelevant complications, we rule out the possibility that the external states include gauge or Higgs bosons.
Wave packets as external lines of macrodiagrams lead to a modification of Feynman’s rules. For the narrow in and out packets, they are modified in a rather simple way [27, 28]: for each external line, the standard (plain-wave) factor must be multiplied by
and
where each function \(\psi _{\varkappa }\left( \varvec{p}_{\varkappa },x\right) \) defined by Eq. (2) is generally specified by its own form factor \(\phi _{\varkappa }(\varvec{k},\varvec{p}_{\varkappa })\), most probable 3-momentum \(\varvec{p}_{\varkappa }\), space-time coordinate \(x_{\varkappa }\), and mass \(m_{\varkappa }\) (\(\varkappa =a,b\)). The internal lines (including loops) and vertex factors remain unchanged. The additional factors (3) provide the following two common multipliers in the integrand of the scattering amplitude [27, 28]:
These functions are called overlap integrals, because they characterize the space-time integrated degree of overlap between the in and out wave-packet states in the source and detector vertices. Obviously, in the plane-wave limit \(\psi (\varvec{p},x) \mapsto \exp (ipx)\) and integrals (4) turn into the ordinary 4d delta-functions to within a multiplier:
where
are the 4-momentum transfers in the source and detector vertices, respectively.
Our main interest in this paper is to study the space-time dependence of the count rate of neutrino-induced events. The expression for the count rate or number of events is obtained after various averages of the modulus squared amplitude of the process under study and is primarily determined by behavior of the causal neutrino propagator
“dressed” (through the overlap integrals) by the external wave packets. Here \(m_i\) is the neutrino mass,
\({{\widetilde{\delta }}}_{s,d}\left( q \mp q_{s,d}\right) \) are the “smeared” \(\delta \)-functions which provide approximate energy–momentum conservation at the vertices of the macrodiagram, and which in the plane-wave limit become the delta-functions,
in this case, the energy and momentum are exactly conserved at the microscopic level, while the uncertainty of coordinates of in and out particles tends to infinity and the space-time description of the process shown in Fig. 1 loses its physical meaning. The smeared \(\delta \)-functions are defined by the external wave packets and therefore their explicit form depends on the form factors \(\phi (\varvec{k},\varvec{p})\), but the limit (6) is model independent.
2.1 Relativistic Gaussian packets
Below we will use the so-called relativistic Gaussian packets (RGP) proposed in Ref. [27]. The RGP model corresponds to a very simple form factor \(\phi (\varvec{k},\varvec{p})\propto \exp \left[ -(kp)/2\sigma ^2\right] \) and the function (2) describing RGP in the coordinate representation is
Here \(\varvec{p}\) and m are the most probable 3-momentum and mass, respectively, \(x=(x_0,\varvec{x})\) is the space-time coordinate of the packet’s center,Footnote 1\(\sigma \) is the momentum spread (just a constant assumed small compared to m), \(K_1(z)\) is the modified Bessel function of the third kind (or Macdonald function) of order 1, and
For our aims, it will suffice to restrict ourselves to the simplest approximate form of Eq. (7)
(called contracted RGP, or CRGP), which describes a non-difluent mode characterized by the condition that in the intrinsic frame the function \(\vert \psi (\varvec{p},x)\vert =\vert \psi (\varvec{0},x_\star )\vert \) is independent of time \(x^0_\star \), namely, \( \vert \psi (\varvec{0},x_\star )\vert = \exp \left( -\sigma ^2\varvec{x}_\star ^2\right) . \) An analysis of the asymptotic expansion of \(\ln \left[ \psi _G(\varvec{0},x_\star )\right] \) in powers of the small parameter \(\sigma ^2/(m^2\zeta )\), taking into account the inequalities \(\vert \zeta \vert \ge 1\) and \(\vert \arg \zeta \vert <\pi /2\), provides the necessary and sufficient conditions of the non-difluent behavior:
As is seen, the area of applicability of the CRGP approximation expands with decreasing \(\sigma \).
2.2 Process under study
Many physical assumptions, limitations, and simplifications are the same for the SBL and LBL cases; these will be mentioned here only briefly. As in the LBL case [27, 28, 37], we will limit ourselves with studying the following process:
where \(\oplus \) sign indicates the macroscopic remoteness of the corresponding subsets of particles. This process is interesting because it includes virtually all terrestrial experiments with short baselines. The macroscopic Feynman diagram corresponding to process (10) is shown in Fig. 2; it is a particular case of diagram Fig. 1. Initial and final states \(I_{s,d}\) and \(F_{s,d}'\) in Fig. 2 consist of hadronic states, while \(\ell _\alpha ^+\) and \(\ell _\beta ^-\) represent the final-state leptons with the 4-momenta \(p_\alpha =\left( p_\alpha ^0,\varvec{p}_\alpha \right) \) and \(p_\beta =\left( p_\beta ^0,\varvec{p}_\beta \right) \), respectively (\(\alpha ,\beta =e,\mu ,\tau \)). Recall: it is assumed for definiteness that the subgraphs of the diagram containing lepton and quark lines, W-propagators, the elementary vertices proportional to the elements \(V_{{\alpha }j}^*\) and \(V_{{\beta }j}\) of the neutrino mixing matrix, as well as of the quark mixing matrix (not indicated in Fig. 1), are described by the Standard Model, but this assumption is not at all a critical part of the formalism under consideration. As above, \(X_s\) and \(X_d\) symbolically denote the neutrino source and detector blocks and, at the same time, the impact points macroscopically separated in space and time.
According to Ref. [27], the amplitude of the process (10) can be written in the following form:
This result is based on a theorem on factorization of hadronic blocks of the macrodiagram Fig. 2 proved in Ref. [37]. Let us concisely describe the ingredient of Eq. (11), necessary for a further presentation of the formalism. Here g is the SU(2) electroweak gauge coupling, \({\mathcal {N}}\) is the normalization factor, \({\mathcal {J}}_d^{\nu }\) and \({\mathcal {J}}_s^{\mu }\) are the standard QFT (c-number) hadronic currents (see Ref. [37] for more details), \({\overline{u}}(\varvec{p}_\beta )\) and \(v(\varvec{p}_\alpha )\) are the usual Dirac bispinors, describing the final-state charged leptons \(\ell _\beta \) and \(\ell _\alpha \), respectively, and \(O^{\nu }=\gamma ^\nu (1-\gamma _5)\); \(\varvec{p}_{\varkappa }\) and \(x_{\varkappa }\) are, respectively, the most probable 3-momentum of the packet \(\varkappa {\in }S{\oplus }D\) and space-time position of the packet’s center.
The tensor function \({\mathbb {G}}^j_{\nu \nu '\mu '\mu }\) accumulating the effects of all external wave packets has the form
The so-called overlap tensors \(\Re _s\) and \(\Re _d\) are given by
respectively, where \(S=I_s{\oplus }F_s\), \(D=I_d{\oplus }F_d\),
\(u_{\varkappa }=p_{\varkappa }/m_{\varkappa }=\Gamma _{\varkappa }(1,\varvec{v}_{\varkappa })\) is the 4-velocity and \(\sigma _{\varkappa }\) is the momentum spread of the packet \(\varkappa \). The tensors \({\widetilde{\Re }}_s\) and \({\widetilde{\Re }}_d\) are inverse to the overlap tensors, i.e.
It is important that both \(\Re _{s,d}\) and \({\widetilde{\Re }}_{s,d}\) are symmetric and positive definite. Therefore, the determinants \(\vert \Re _{s,d}\vert \) and \(\vert {\widetilde{\Re }}_{s,d}\vert =1/\vert \Re _{s,d}\vert \) are positive and the same is also true for the principal minors of \(\Re _{s,d}\) and \({\widetilde{\Re }}_{s,d}\).
The functions \(\Delta _{\nu \nu '}(q-p_\beta )\) and \(\Delta _{\mu \mu '}(q+p_\alpha )\) are the W-boson propagators, whose explicit form we will not need. The functions \(\varPhi (q)\), \({\mathfrak {S}}\), and \(\Theta \) are defined as
where
and
From Eqs. (13) and (16b) follow the identities
from which we see that the impact points represent the weighted-mean space-time coordinates of the external wave packets. Considering that \({\mathfrak {S}}_{s,d}\ge 0\), the functions \(\exp (-{\mathfrak {S}}_s)\) and \(\exp (-{\mathfrak {S}}_d)\) are the geometric suppression factors conditioned by the overlapping degree of the in and out wave packets in space and time: they suppress the amplitude when the packets do not intersect or intersect only partially near the points \(X_s\) and \(X_d\) in the source and detector, respectively. At the impact points \({\mathfrak {S}}_s={\mathfrak {S}}_d=0\); the closer are the world lines of the packets’ centers to the impact points, the more probable is the interaction between the packets. The condition that the interaction regions in the source and detector vertices are macroscopically separated from each other is therefore equivalent to the macroscopic separation of the points \(X_s\) and \(X_d\). The world line configurations have no concern with the specific dynamics governed by the interaction Lagrangian, being uniquely defined by the initial (final) coordinates, group velocities and effective dimensions of the asymptotically free in (out) wave packets.
3 Long baselines in short
An important tool for studying LBL asymptotics is the Grimus–Stockinger theorem [4] used in most abovecited analyses of neutrino oscillations based on QFT approaches with wave packets. The theorem predicts that the spatial part of the “dressed” neutrino propagator (5) behaves as 1/L at \(L\equiv \vert \varvec{X}\vert \rightarrow \infty \), which leads to the classical inverse-square law (ISL) for the detected number of neutrino events. In Refs. [50, 54] it was shown that the classical ISL is violated at finite L. This is a consequence of an extension of the Grimus–Stockinger theorem, which allows one to compute corrections to the long-distance asymptotics of the propagator (5). The theorem as formulated in Ref. [50] states that for any Schwartz function \(\varPhi (\varvec{q})\)Footnote 2 the integral
may be represented by the asymptotic series
where
\(\varvec{l} = \varvec{X}/\vert \varvec{X}\vert \), and \(c_{abc}\) are recursively defined positive numbers (given in Ref. [50] for \(a\le 6\)). Equivalent but recursive-free formulation of the theorem is given in Ref. [54]. The applicability of the leading (Grimus–Stockinger) approximation is defined by explicit form of \(\varPhi (\varvec{q})\), but in general case it can be written as
where \(\Sigma _{\text {LBL}}\) is an effective parameter dependent on the momenta, masses, and momentum spreads (\(\sigma _\varkappa \) in the CRGP model) of the external (in and out) wave packets. It is also shown in Ref. [50] that \(\left| {\mathfrak {J}}(\varvec{X},\upkappa )\right| ^2\) is given by a series in powers of \(1/L^2\) and the lowest (\(\propto 1/L^4\)) ISL violating (ISLV) correction is negative in the physical region \(\kappa ^2\ge 0\). The explicit formula for \(\Sigma _{\text {LBL}}\) obtained within the CRGP model in the third order saddle-point asymptotic expansion is given in Ref. [53]. It is far beyond the scope of this article to analyze the contribution of the non-physical domain.
Another important consequence of the theorem is that the virtual neutrinos are on-mass-shell, and both energies and momenta of different eigenfields \(\nu _i\) do not coincide. This strongly contradicts the standard assumption of the plane-wave QM theory about the equality of the momenta of the neutrino states with different masses, but, surprisingly, in the ultrarelativistic limit leads to the same oscillation phases.
Omitting the rather lengthy calculations, let us write down the final expression for the number of events in a detector located at a large distance from the source, obtained within the framework of the formalism described above:
The details of the calculations, conditions of applicability, and interpretation of this somewhat abstract approximate formula can be found in Refs. [28, 37]. Here we will only briefly explain its constituents.
The indices \(\alpha \) and \(\beta \) indicate that the final states must necessarily include the leptons \(\ell _\alpha \) and \(\ell _\beta \). Parameter \(\tau _d\) represents the detector exposure time, \(V_{{\mathcal {S}}}\) and \(V_{{\mathcal {D}}}\) are the fiducial volumes of, respectively, the source (\({\mathcal {S}}\)) and detector (\({\mathcal {D}}\)) “devices”; the spacial integrations are performed over these volumes. The differential form \(d\sigma _{{\nu }{\mathcal {D}}}\) is defined such that the expression
represents the differential cross section of neutrino scattering at the detector \({\mathcal {D}}\) as a whole. In the practically important case of neutrino scattering on single particles and under the usual assumption that the momentum distributions of scatterers are very narrow (such as the Maxwellian distribution), the differential form \(d\sigma _{{\nu }{\mathcal {D}}}\) becomes the elementary differential cross section for the given reaction multiplied by the number of scatterers in the volume \(V_{{\mathcal {D}}}\). The function \({\mathcal {P}}_{\alpha \beta }(E_\nu ,L)\) in Eq. (18) is the QFT analogue of the quantum-mechanical (QM) flavor transition probability; it is a function of neutrino energy \(E_\nu \) and distance \(L=\vert \varvec{y}-\varvec{x}\vert \) between the points \(\varvec{x}\in {\mathcal {S}}\) and \(\varvec{y}\in {\mathcal {D}}\); for any pair \((\varvec{x},\varvec{y})\), L is assumed to be large compared to the dimensions of \({\mathcal {S}}\) and \({\mathcal {D}}\). In the conventional ultrarelativistic approximation, the “probability” includes the standard QM oscillation phase factor and corrections for decoherence effects of different kinds. These corrections will be discussed in Sect. 6, when comparing the expressions for the number of events in the LBL and SBL modes.
Finally, the differential form \(d{\mathfrak {F}}_\nu \) is the differential neutrino flux in \({\mathcal {D}}\) defined such that the quantity
is the number of neutrinos appearing per unit time and unit neutrino energy in the elementary 3d volume \(d\varvec{x}=d^3x\) around the point \(\varvec{x}\), moving within a unit solid angle in the direction from point \(\varvec{x}\) to point \(\varvec{y}\), and crossing the unit area placed around \(\varvec{y}\) and perpendicular to \(\varvec{y}-\varvec{x}\). According to the extended Grimus–Stockinger theorem (17),
As a consequence, ISL is broken. The coefficients of the series (19) are discussed in Ref. [50]. Recall that \({\mathfrak {C}}_1 < 0\), so the lowest-order ISLV correction reduces the number of neutrino events:
Here L is the distance between the source and detector (assumed large compared to their sizes) and \({\mathfrak {L}}_0\) is an energy-dependent parameter of dimension of length:
Recent analysis [53] shows that current reactor antineutrino data timidly hints that the ISLV effect is already manifesting itself, although the accuracy of the data is still too low and the uncertainty of the calculated reactor antineutrino energy spectra is too large to draw confident conclusions.
4 Short baseline asymptotics
We are now ready to study the short-baseline (SBL) behavior of the amplitude (11). It will be shown that the SBL formula for the number of neutrino events leads, like Eq. (18), to an ISL behavior, which is also violated by higher-order corrections.
To simplify things a bit, we restrict ourselves to the conditions
Then the W propagator \(\Delta _{\mu \nu }\) can be approximated by \(-ig_{\mu \nu }/m_W^2\). Next, in a good approximation, we can replace \({\hat{q}}\) with \({\hat{{\mathcal {P}}}}\), where the 4-vector \({\mathcal {P}}\) is the stationary point of \(\varPhi (q)\). From Eqs. (14) and (15) it immediately follows that
where \({\mathcal {R}}\) is the tensor inverse of \({\widetilde{\Re }}\)Footnote 3:
In the exact energy–momentum conservation limit \({\mathcal {P}}=q_s=-q_d\) and, due to the suppression factor (14) in the amplitude, \({\mathcal {P}}\simeq q_s \simeq -q_d\) even otherwise. So, within the saddle-point approximation, it is natural to interpret \({\mathcal {P}}\) as the 4-momentum of the virtual neutrinos. It should be remarked that \({\mathcal {P}}\) is independent of the neutrino masses and is determined only by the momenta, masses and momentum spreads of the external wave packets (from both vertices of the macrodiagram), but it will be shown later that although neutrino virtuality, \({\mathcal {P}}^2-m_j^2\), can be rather large compared to \(m_j^2\), zero virtuality is the most probable for each (j-th) contribution to the full amplitude (11), which, of course, is a trivial consequence of the pole \(q^2=m_j^2\) in the neutrino propagator (5). Another simple but nonetheless important observation: At no configuration of external momenta can all the neutrinos \(\nu _j\) (\(j=1,2,3,\ldots \)) together be on their mass shell. Thus, at a substantial neutrino mass hierarchy (which, as the experiments suggest, is the case in the real world), neutrinos at short baselines are deeply virtual. This is basically different from the propagation regime occurring at long baselines, when all the neutrinos end up on their own mass shell and have different energies, 3-momenta, and velocities.
Since the Grimus–Stockinger theorem and its extension (17) are inapplicable at short distances, further analysis is based on another method, based on an asymptotic expansion of the full 4d neutrino propagator (A1). The expansion is derived in Appendix A in terms of the dimensionless variables \(\delta _j\), \(\eta _j\), and \(\Delta _j\) defined byFootnote 4
which are all assumed to be small (see Appendix A.3), and under the important additional constraint
Here we defined
The 4-vector \(p_j\) is a convenient intermediate notation for now, but, as will soon become clear, it can be given the meaning of the most probable 4-momentum of \(\nu _j\) in the sense that the j-th contribution to the amplitude (11) is maximal at \({\mathcal {P}}_0=E_j\). The Lorentz invariant positive-definite function
(where \(\vert {\mathcal {R}}\vert \equiv \det ({\mathcal {R}}) = \vert {\widetilde{\Re }}\vert ^{-1}\)), characterizes the scale of the neutrino energy–momentum uncertainty and generally depends on the momenta, masses, and momentum dispersions (spreads) of the external (in and out) wave packets. The dimensionless tensor \(\rho \) is defined by
It integrally describe the shapes and asymmetries of the space-time regions of overlap of the external wave packets in the vertices of the macrodiagram 2. The conditions of smallness of Lorentz scalars (23b) and (23c) are briefly summarized in Appendix A.3, and restriction (24) is discussed in Appendix A.6. All this will be covered in more detail below.
In this article, we will mainly study the leading-order approximation by putting \(\delta _j=\eta _j=\Delta _j=0\) in Eq. (A22), which gives
The following notations are used here:
The tensor function (12) now takes the form
where \({\mathbb {V}}_{s,d}\) are the overlap integrals defined by Eq. (4), which, in the framework of the CRGP model, have exactly the same form as in the LBL case:
where
The function \(\varOmega _j\) (not to be confused with the function \(\Omega _j\) used above) is given by
It is convenient to enter the intermediate notation
with which and considering that \(O^{\mu }O^{\nu }=0\) the amplitude (11) can be rewritten as
Let us now investigate the real part of the function \(\varOmega _j\) in more detail. As is seen from Eq. (29), it splits into two parts with very different physical content:
4.1 Off-shell control
To clarify the physical meaning of \(R_{1j}\), we rewrite this function as follows:
where
\(\varvec{v}_j=\varvec{{\mathcal {P}}}/E_j\) is the most probable velocity of the j-th neutrino (see below), of course different from the “virtual velocity” \(\varvec{v}=\varvec{{\mathcal {P}}}/{\mathcal {P}}_0\); the Lorentz scalar \(\Sigma \) and tensor \(\rho \) are defined by Eqs. (25) and (26), respectively.
Let us digress for a while to clarify the meaning of the function \(\Sigma \) by considering a toy model (hereinafter referred to as \(\pi \)-\(\mu \) model) in which neutrinos are produced in two-particle decays (\(\pi _{\mu _2}\) for certainty) and, for one reason or another, contribution to the tensor \({\mathcal {R}}\) from the detector vertex can be ignored, so that \({\mathcal {R}}=\Re _s=T_\pi +T_\mu \). Then, neglecting for further simplification the neutrino mass and assuming energy–momentum conservation, we find:
where \(m_{\pi ,\mu }\) (\(\sigma _{\pi ,\mu }\)) are the masses (momentum spreads) of the pion and muon wave packets, respectively. If, e.g., \(\sigma _{\mu }=\sigma _{\pi }\) then \(\Sigma ^{(\pi \text {-}\mu )} \approx 0.87\sigma _{\pi }\). The extreme simplicity of this example should not, however, be misleading, because for a process involving three packets already, \(\Sigma \) (as well as \({\mathcal {G}}_j\), see below) is an invariant function of external momenta, not a constant.
Now, returning to the general case, we will restrict ourselves to the ultra-relativistic approximation:
In fact, this condition is not quite trivial for deeply virtual neutrinos, since the components of the 4-vector \({\mathcal {P}}\) are formally mutually independent. However, it is a direct consequence, provided by the natural conditions for the energy and momentum transfers in the vertices of the macrodiagram 2,
and by the presence of the multipliers \({{\widetilde{\delta }}}_s(q-q_s)\) and \({{\widetilde{\delta }}}_d(q+q_d)\) (smeared \(\delta \)-functions) in the amplitude.Footnote 5 It is in this sense that we will call virtual neutrinos ultrarelativistic. In this approximation Eq. (31) can be rewritten as
From this relationship follows that \(0 \le R_{1j} \lesssim 1\) when
Thus, the factor \(\exp \left( -R_{1j}\right) \) controls the magnitude of the neutrino virtuality: it allows a significant escape of the virtual neutrinos from their mass shell, but suppresses the contributions to the amplitude from external 4-momentum configurations that give too much virtuality \(\vert {{\mathcal {P}}^2}-m_j^2\vert \). For a numerical illustration, consider again the \(\pi \)-\(\mu \) model. In the exact energy–momentum conservation limit, we find:
It is apparent that the neutrino mass is completely negligible here and \({\mathcal {G}}_j^{(\pi \text {-}\mu )} \ggg m_j^4\). The dependence of the maximum virtuality on \(\sigma _\pi \) and \(\sigma _\mu \) is shown in Fig. 3.
It can be seen that although the on-shell neutrinos make the maximum contribution to the amplitude, the neutrino virtuality can be potentially very large if the dispersions of the momenta of the external wave packets (\(\sigma _\pi \) and \(\sigma _\mu \) in this example) are large enough. It is noteworthy that constraints (32) (as well as (33) below) are actually independent of neutrino mass, provided that \(\varvec{{\mathcal {P}}}^2 \sim {\mathcal {P}}_0^2 \gg m_j^2\).
Although the virtual neutrinos can behave as a bradyonic, luxonic, or tachyonic particle, the virtual velocity value \(\vert \varvec{v}\vert =\vert \varvec{{\mathcal {P}}}/{\mathcal {P}}_0\vert \) always remains close to the speed of light,
and the most probable velocity of \(\nu _j\) (determined by the condition \(R_{1j}=0\)) is, as expected,
Figure 4 shows the maximum value of \(\vert 1-\vert \varvec{v}\vert \vert \) calculated in the \(\pi \)-\(\mu \) model.
For comparison: the velocity of the real \(\nu _j\) from \(\pi _{\mu _2}\) decay at rest is equal to
4.2 Quasi-classical motion
Consider now the second term in Eq. (30). The factor \(\exp \left( -R_{2j}\right) \) suppresses the corresponding (j-th) term in the amplitude when neutrino deviates from its classical world line. To prove this, let us rewrite \(R_{2j}\) as follows:
It is easy to prove that \(R_{2j}>0\). In the neutrino rest frame (checkmarked) Eq. (35a) converts to
By rotating the coordinate system to align the z-axis along \(\varvec{{\check{X}}}\), we find
This function is positive because the expression in parentheses and the denominator are the principal minors of the symmetric positive-definite matrix \(\Vert {\mathcal {R}}_{\mu \nu }\Vert \).
From Eq. (35b) follows that \(R_{2j}\) vanishes along the classical trajectory \(X^\mu =u_j^\mu \tau \), where \(\tau \) is the proper time, \(u_j=p_j/m_j=\left( E_j/m_j,\varvec{{\mathcal {P}}}/m_j\right) \) is the neutrino 4-velocity, and it is assumed that \(m_j>0\). Below we will need another useful representation of the function \(R_{2j}\), which also allows us to extend the above conclusion to massless neutrinos. After some manipulations, expression (35b) can be rewritten as
where
Although the relativistic invariance and positivity of expression (36) are not seen explicitly, they certainly take place, since it is an identical rewrite of Eq. (35b). Evidently \(R_{2j}=0\) on the classical trajectory \(\varvec{X}=\varvec{v}_jX_0\), where \(\varvec{v}_j=\varvec{{\mathcal {P}}}/E_j\). It is easy to verify that the function \(R_{2j}\) is invariant under the group of uniform rectilinear motions (displacements along the classical word lines)
This means that the virtual neutrino behaves like a quasiclassical on-shell particle or, more precisely, like the asymmetric (“true Gaussian”) wave packet studied in Ref. [55]. In other words, there is a duality between the description of the virtual neutrino motion by the causal propagator and by an effective wave packet (see also Ref. [50] and Sect. 5.4 below). One more important fact should be pointed out: as proved in Ref. [55], the true Gaussian wave packet does not coincide in any approximation with CRGP, which describes the neutrino state in the LBL regime [28, 37]. Thus, the neutrino propagation at short distances is profoundly different from that at long distances.
5 Microscopic probability
The following important identity
valid for any 4-vector q, was proved in Ref. [37]. Here
and \(\psi _{\varkappa }\left( \varvec{p}_{\varkappa },x_{\varkappa }\right) \) is the coordinate-dependent part of the wave function of the packet \(\varkappa \) in the coordinate representation (8) with \(p=p_{\varkappa }\), \(x=x_{\varkappa }\) and \(m=m_{\varkappa }\). The functions \(\delta _{s,d}(K)\), of course, do not coincide with the functions \({\widetilde{\delta }}_{s,d}(K)\) used so far, but have the same plane-wave limit (6) as \({\widetilde{\delta }}_{s,d}(K)\) and similar properties. The physical meaning of the functions \({\mathrm {V}}_s\) and \({\mathrm {V}}_d\) is obvious from the integral representation (40); it implies that they should be treated as 4d (space-time) overlap volumes of incoming and outgoing wave packets in the source and detector, respectively. It can be proved that they take maximum values when the classical world lines of the packets intersect at the impact points.
Using identity (38) and taking into account the explicit form of the normalization factor \({\mathcal {N}}\) [28, 37],
where
is the effective spatial volume of the wave packet \(\varkappa \), one can derive the following formula for the modulus squared transition amplitude (neutrino detection probability):
At first glance, this expression does not show the ISL-like behavior (that is \(\vert {\mathcal {A}}_{\beta \alpha }\vert ^2\propto 1/\vert \varvec{X}\vert ^2\)). However, since the exponential factor in the right-hand side of Eq. (41) suppresses the configurations of the impact points \(X_{s,d}\) that deviate significantly from the classical trajectories \(\varvec{X}_d-\varvec{X}_s =\varvec{v}_j(X_d^0-X_s^0)\), we can expect ISL to hold at least approximately. In order to see this explicitly, let us first recast Eq. (41) to a form that is the Lorentz-invariant 4d analogue of the 1d integral representation proposed by Cardall [14]. For this purpose, consider the identity
in which f(q) is an arbitrary smooth function of q. By integrating this identity with respect to q and using the saddle-point approximation, we obtain
Then, substituting
and using definition (39) we arrive at the approximate relation
valid with the same accuracy with which the amplitude itself was derived (that is, with the accuracy of the saddle point approximation). Using this result, we can rewrite Eq. (41) in the desired form:
where
and \(v_j=(1,\varvec{q}/E_j)\). Of course, it is implicitly assumed that we replaced \({\mathcal {P}}_0 \longmapsto q_0\) and \(\varvec{{\mathcal {P}}} \longmapsto \varvec{q}\) in the functions \(R_{1i}\), \(R_{1j}\), \(R_{2i}\), \(R_{2j}\), \(\omega _i\), and \(\omega _j\). Representation (43) provides the basis for further study. Its main difference from the corresponding LBL asymptotics is in the last exponential multiplier, which takes into account the non-zero virtuality and deviation from the classical trajectories. Below we will convert it to a 1d integral in order to obtain an expression for the event count rate as close as possible to the corresponding LBL result obtained in Ref. [28] for the on-shell regime and, in particular, to reproduce the inverse-square law.
5.1 Account for virtuality
To simplify expression (43), we take advantage of the fact that the function \(\exp {\left( -R_{1i}-R_{1j}\right) }\) has a sharp peak at its saddle point. In the ultrarelativistic limit it has the obvious meaning of the half-sum of the energies of \(\nu _i\) and \(\nu _j\) and in general case it is equal to
Quite apparently, the latter approximation (not used further) is valid only in the region \(\vert \varvec{q}\vert ^2\sim \vert \varvec{q}_s\vert ^2 \sim \vert \varvec{q}_d\vert ^2 \gg m_{i,j}^2\), which, however, gives the main contribution to the integral (43). It is notable that the difference with the usual on-shell neutrino energy appears here only in the fourth order with respect to \(m_{i,j}/\vert \varvec{q}\vert \). Given Eq. (44), we can replace \(q_0 \longmapsto q_{ij}^0\) everywhere except the exponent of \(-\left( R_{1i}+R_{1j}\right) \). Integrating the latter in \(q_0\), we obtain
This is the exact formula. Formal expansion of the right-hand side of Eq. (45) in powers of \(m_{i,j}^2\) yields:
Of course, this expansion is also valid only in the region \(\vert \varvec{q}\vert ^2\sim \vert \varvec{q}_s\vert ^2\sim \vert \varvec{q}_d\vert ^2 \gg m_{i,j}^2\). The last term in square brackets can compete with the second one if
but both these terms are negligible under experimentally interesting conditions. The common multiplier \(\propto \Sigma \) in Eq. (46) is however crucial: it shows that the non-zero neutrino virtuality, along with other factors to be discussed below, is responsible for the flavor transition and survival probabilities. This is an important distinguishing feature of the neutrino propagation at short distances between a source and a detector.
5.2 Account for quasi-classical motion
Next, we can integrate over the variables \(q_1\) and \(q_2\), given that the virtual neutrinos move almost quasi-classically and so the main contribution to the integral comes from the region where \(\vert {q}_1\vert \ll \vert {q}_3\vert \) and \(\vert {q}_2\vert \ll \vert {q}_3\vert \) in the coordinate frame where \(X_1=X_2=0\) (and hence \(\varvec{l}=(0,0,1)\)). Since all the factors in the integrand, except the exponent \(\exp \left( -R_{2j}-R_{2i}\right) \), slowly evolve in the region \(\vert {q}_{1,2}\vert \ll \vert {q}_3\vert \), we can safely put \(q_1=q_2=0\) in these factors. The remaining integral can be evaluated by the formal expansion of \(R_{2i}\) and \(R_{2i}\), represented according to Eq. (36), in powers of \(q_1/q_3\) and \(q_2/q_3\) to the second order. As a result of simple but time-consuming calculations, we obtainFootnote 6:
In this calculation, we neglected the neutrino masses in the pre-exponential multiplier (49), because taking them into account would exceed the accuracy.Footnote 7 The same is obviously true for the term proportional to \(\left( v_{i3}-v_{j3}\right) ^2\) in the exponent; it could be significant only at very large times satisfying the condition
which, as will soon be shown, strongly contradicts the restrictions determining the applicability of the formalism under discussion. Therefore, this term should certainly be neglected in further calculations. The leading term in the exponent expectedly shows that the main contributions to the modulus squared amplitude (41) give the classical trajectories \(X_3=v_{j3}X_0\) of the effective wave packets of virtual neutrinos \(\nu _j\). The deviations from the classical trajectories is suppressed by the universal Lorentz-invariant multiplier \(1/{\widetilde{\Re }}^{\mu \nu }l_{\mu }l_{\nu }=O\left( \Sigma ^2\right) \) – perfectly the same as in the LBL mode [28].
5.3 ISL violation at supershort baselines
The main result of this exercise is, of course, a mere reproduction in QFT of the classical ISL at short times, \(1/\Sigma _{\text {SBL}}\ll T_0 \ll \vert \varvec{q}\vert /\Sigma _{\text {SBL}}^2\), which is the SBL analogue to the Grimus–Stockinger theorem, applicable to asymptotically long distances, \(L \gg \vert \varvec{q}\vert /\Sigma _{\text {LBL}}^2\), and also deviations from ISL at \(T_0 \approx L \lesssim 1/\Sigma _{\text {SBL}}\), defined by the series (49). The dimensionless coefficients \({\mathfrak {T}}_n\) of this series can, in principle, be calculated for any finite n, but they are too cumbersome to reproduce here, and, furthermore, they do not seem to be of practical interest since measurements at the supershort distances \(L\lesssim 1/\Sigma \approx 0.2~\mu \text {m}\,\left( 1\,\text {eV}/\Sigma \right) \) are not feasible.
As a propaedeutic example, we write out only the coefficient for the leading correctionFootnote 8:
were
and \({\bar{s}}=3-s\). It follows from dimensional considerations that all the functions \({\mathfrak {T}}_n\) do not contain large or small dimensional parameters, and the smallness of the ISLV corrections at supershort baselines is determined by the common exponential factor in Eq. (47). Thus, the corrections
are small under condition (24). As explained in Appendix A.6, under this condition we must all the more neglect the exponentially small contribution \(J_v\) defined by Eq. (A26) in Appendix A.5
5.4 Evolution of neutrino wave packet
Let us summarize at a qualitative level the behavior of the effective neutrino wave packet (ENWP) over a wide range of distances between the source and the detector. In the vicinity of the impact point \(X_s\), up to the “mesoscopic” distances \(L \sim 1/\Sigma \) there is still no the duality “propagator \(\longleftrightarrow \) wave packet” and the neutrino propagator has the form \(J_g(X)+J_v(X)\) (see Appendix A). At short but macroscopic distances, \(1/\Sigma _{\text {SBL}}\ll L \ll L_{\text {tr}} = \vert \varvec{q}\vert /\Sigma _{\text {SBL}}^2\), the contribution \(J_v(X)\) disappears and a “virtual” packet with off-mass-shell 4-momentum is formed, which moves almost along the classical world line with a velocity satisfying the condition (33); the most probable 4-momentum at this stage is however on shell and the most probable velocity is given by Eq. (34). The packet evolves with time and distance and at long distances, \(L \gg L_{\text {tr}}' = \vert \varvec{q}\vert /\Sigma _{\text {LBL}}^2\), turns into the on-shell ENWP. The “transition lengths” \(L_{\text {tr}}\) and \(L_{\text {tr}}'\) have the same order of magnitude, but may considerably differ from each other. In any case, there is a certain transition region where the virtual ENWP converts into the real (on-shell) one. Needless to say this region is described by neither SBL nor LBL asymptotics.
5.5 Modulus squared amplitude
Combining Eqs. (45) and (47) together, we obtain the explicitly rotation invariant expression:
\(l=(1,\varvec{l})\), \(\varvec{l}=\varvec{X}/\vert \varvec{X}\vert \), \(q_{ij}=(q_{ij}^0,\vert \varvec{q}\vert \varvec{l})\), where \(q_{ij}^0\) is defined by Eq. (44). The integration limits in the integral over \(\vert \varvec{q}\vert \) are not specified, since the main contribution is determined by a narrow region where \(\vert \varvec{q}\vert \sim \vert \varvec{q}_s\vert \simeq \vert \varvec{q}_d\vert \) allowed by the factor
It is useful to identically rewrite this factor in the form
where \(q=\left( \vert \varvec{q}\vert ,\varvec{q}\right) \) (\(\varvec{q}=\vert \varvec{q}\vert \varvec{l}\)) represents the 4-momentum of the (fictitious) massless neutrino. This expedient allows us to factor the functions \(\delta _{s,d}\) out from the sums in i and j. Using definition (39) we find
In the ultrarelativistic approximation, we arrive at the following result:
Although the expression in the first square brackets in Eq. (53) can formally be of arbitrary value, the factor \(\delta _s(q-q_s)\delta _d(q+q_d)\) in Eq. (51) guarantees that in the main approximation,
\(\vert \varTheta _{ij}\vert =O\left( m_{i,j}^2/\Sigma \vert \varvec{q}\vert \right) \) inside the integral over \(\vert \varvec{q}\vert \). The function \(\varTheta _{ij}\) does not have a constant sign and does not disappear in the diagonal terms (with \(i=j\)) of the modulus squared amplitude, i.e., the factors \(\exp \left( -\varTheta _{ij}\right) \) could potentially play a role in both transition and survival probabilities, but only for heavy (beyond SM) neutrinos and extremely small \(\Sigma \). Otherwise, they can be ignored. It should be recorded that the corresponding function in the LBL mode has (in the same approximation) a very different form:
where \(q'=\left( q_0,q_0\varvec{l}\right) =q_0l\). But of course, the functions (54) and (55) are of the same order of magnitude.
It is easy to show [28] that \(\vert {{\mathcal {M}}}\vert =\vert {M_sM_d}\vert \), where
\(G_F\) is the Fermi coupling constant), and \(u_-(\varvec{q})\) is the Dirac bispinor for the left-handed massless neutrino; no need to justify the reasons for this trick. It should be readily apparent that \(M_s\) and \(M_d^*\) represent the matrix elements for the neutrino production and absorption in the reactions \(I_s \rightarrow F_s'\ell _\alpha ^+\nu \) and \(\nu I_d \rightarrow F_d'\ell _\beta ^-\), respectively. As a consequence, we can rewrite Eq. (50) in the form
Here and in what follows, \(q=(\vert \varvec{q}\vert ,\varvec{q})\) is the 4-momentum of the massless neutrino.
6 Count rate
In order to obtain observable quantities, the generic result (56) must be averaged over the unmeasurable and/or unused variables on which the in and out wave-packet states depend. The averaging procedure can be implemented only by taking into account the physical conditions of a real experimental environment. This makes further analysis model-dependent.
Here we will use the same model as in Refs. [28, 37]. The model supposes that the statistical distributions of the incoming wave packets \(a{\in }I_{s,d}\) on the mean momenta, spin projections and space-time coordinates in the source and detector devices can be described by single-particle distribution functions \(f_a({\varvec{p}}_a,s_a,x_a)\), \(x_a=(x_a^0,\varvec{x}_a)\). Each function \(f_a\) is normalized to the total number, \(N_a(x_a^0)\), of the packets a at time \(x_a^0\):
To proceed further, we redefine the terms “source” and “detector”, which so far have been used to refer to the vertices of the macrodiagram shown in Fig. 2. We will use these terms and notations \({\mathcal {S}}\) and \({\mathcal {D}}\) both for the corresponding devices and for the supports of the product of the distribution functions \(f_a\) in space and time, namely,
which are assumed finite and mutually non-intersecting in space. We also assume that the spatial dimensions of \({\mathcal {S}}\) and \({\mathcal {D}}\) are very large compared with the effective dimensions (\(\sim \sigma _{\varkappa }^{-1}\)) of the wave packets \(\varkappa \) moving inside \({\mathcal {S}}\) and \({\mathcal {D}}\). But we will not use the generally accepted assumption that the dimensions \({\mathcal {S}}\) and \({\mathcal {D}}\) are small compared to the effective distance between them, because this assumption is not suitable for such experiments as GALLEX [56], SAGE [57, 58], and BEST [59, 60], where the source is surrounded by a detector environment. We neglect the background events caused by outgoing particles produced in \({\mathcal {S}}\) and falling in \({\mathcal {D}}\); this issue would not deserve special mention in the case of long baseline experiments. For certainty, we consider only such experiments in which the momenta of all or some types of particles arising in \({\mathcal {D}}\) are measured. To avoid non-principal complications we suppose a detection efficiency of 100%. Under all these assumptions, the macroscopically averaged probability (56) (representing in this instance the number of events registered in \({\mathcal {D}}\)) reads
The symbol \(\sum _{\text {spins}}\) denotes the averaging over the spin projections \(s_a\) of the packets \(a {\in } I_{s,d}\) and summation over the spin projections \(s_b\) of the packets \(b {\in } F_{s,d}\), supposing that the projections \(s_b\) are not measured. The symbol \([d\varvec{p}_b]\) indicates no integration in \(\varvec{p}_b\) (i.e. \(\int [d\varvec{p}_b]{\equiv }d\varvec{p}_b\)), and hence that these momenta are measured. If, however, the momenta of particles of certain types \(b' {\in } F_d\) are not measured, the brackets must be removed for \(b=b'\). Therefore, it should be clearly apparent that the expression (58) means the number (\(N_{\beta \alpha }\)) of detected particles b with 3-momenta between \(\varvec{p}_b\) and \(\varvec{p}_b+d\varvec{p}_b\) (or any momenta) in \({\mathcal {D}}\). As in the LBL case, the indices \(\alpha \) and \(\beta \) indicate that neutrino exchange generally leads to the lepton number violation (when \(\alpha \ne \beta \)). The final state leptons \(\ell _\alpha \) and \(\ell _\beta \) provide the experimental signature of such a violation.
6.1 Averaging over spatial variables
An approximate multidimensional integration over the spatial variables \(\varvec{x}_{\varkappa }\) in Eq. (58) can be performed explicitly using the integral representation (40) for the 4d overlap volumes
and naturally assuming that the distribution functions \(f_a\) and the factor \(\exp \left( -\Xi _{ij}\right) \) change significantly only on macroscopic scales (comparable with the dimensions of \({\mathcal {S}}\) and \({\mathcal {D}}\)); most often, in real experiments, spatial variations of the distribution functions can be neglected altogether. On the other hand, the integrand
in Eq. (40) varies on a very short scale; in fact it differs from zero only when the classical world lines of all wave packets \(\varkappa \) pass through a narrow (microscopic or at least mesoscopic) neighborhood of the integration variable (let the latter be x for \({\mathrm {V}}_s\) and y for \({\mathrm {V}}_d\)). Hence, neglecting the possible (microscopic) edge effects in \({\mathcal {S}}\) and \({\mathcal {D}}\), it is safe to substitute \(x_{\varkappa }\) by x (y) for \(\varkappa {\in }S\) (\(\varkappa {\in }D\)) in the mentioned slowly-varying factors. Then, as is seen from Eq. (16) defining the impact points, \(X_s=x\) and \(X_d=y\). The remaining integrals over the variables \(\varvec{x}_{\varkappa }\) yield the factor \(\prod _{\varkappa {\in }S{\oplus }D}{\mathrm {V}}_{\varkappa }\), which cancels the product of the corresponding (inverse) factor in the integrand.
To formalize the procedure described above, let us prove that for an arbitrary 4-vector x there is always a configuration of spatially separated points \(\{x_{\varkappa }\}\) for which \({\mathscr {P}}_{s,d }=1\). Let, for certainty, \(\varkappa {\in }S\). Because of the invariance of \(\vert \psi _{\varkappa }(\varvec{p}_{\varkappa },x_{\varkappa }-x)\vert \) with respect to shifts along the world lines of the centers of the wave packet \(\varkappa \), we can replace the space-time coordinates \(x_{\varkappa }\) by \({\tilde{x}}_{\varkappa }\), where
It is apparent that \({\mathscr {P}}_s=1\) as \({\tilde{x}}_{\varkappa }=x\), i.e., at the same moment in time, but at non-matching spatial coordinates \(\varvec{x}_{\varkappa } = \varvec{x}+\left( x_{\varkappa }^0-x^0\right) \varvec{v}_{\varkappa }\), which are (by construction) remote from each other. Of course, \(x=X_s\) at \({\tilde{x}}_{\varkappa }=x\) (recall that \(X_s\) does not change when \(x_{\varkappa }\) is substituted by \({\tilde{x}}_{\varkappa }\)). It is implicitly assumed here that no more than one packet is at rest, but it is clear that the contribution to the integral from the configurations in which two or more wave packets have zero velocities is insignificant – such configurations form a set of measures zero.
In consequence of this we can rewrite Eq. (58) as follows:
Here we made the substitution
allowed by condition (24), and introduced the differential forms
6.2 Averaging over time intervals
The next step is to average the number of events (59) over the \({\mathcal {S}}\) and \({\mathcal {D}}\) running time intervals. This is not the most general setting of the problem and, moreover, it very much depends on the specific conditions of an experiment. But it typical for the current SBL experiments and is useful to clarify the physical meaning of the obtained results and compare them with the results derived for on-shell regime (long baselines, or LBL) in the same terms. For these aims, we will use the simple model for the particle distribution functions (59) proposed in Refs. [27, 28], in which they do not depend on time (or very slowly evolve with time) during the source operating period \(\tau _s=x^0_2-x^0_1\) and detector exposure period \(\tau _d=y^0_2-y^0_1\), that is
In the case of a detector, the step functions in Eq. (61) can be considered as “hardware” or “software” trigger conditions. In the context of present study (short distances between \({\mathcal {S}}\) and \({\mathcal {D}}\)), the steady-state operation periods for \({\mathcal {S}}\) and \({\mathcal {D}}\) can be long, as in the SBL reactor antineutrino experiments and experiments with artificial radioactive neutrino sources, or very short, as in accelerator-based SBL neutrino-tagging experiments (which, however, requires more specific analysis). In either case it is assumed that the time intervals required to turn the devices \({\mathcal {S}}\) and \({\mathcal {D}}\) on and off are negligible compared to \(\tau _s\) and \(\tau _d\), respectively. In the general case, we do not impose any synchronization conditions on the operation periods of \({\mathcal {S}}\) and \({\mathcal {D}}\).
As a result of this simplification, the integration over the source and detector operation times reduces to calculation of the following integral:
Using the general formula for the double Gaussian integral
where
the massless neutrino approximation in the pre-exponential factors, and leading order on \({\Delta }m_{ij}^2=m_i^2-m_j^2\) in the exponent, we can rewrite Eq. (59) in the following relatively compact form (see Appendix B for a more precise formula):
and \({\mathfrak {D}} = 1/\sqrt{2{\widetilde{\Re }}^{\mu \nu }l_{\mu }l_{\nu }}\). The unit vector \(\varvec{l}\) is directed along the vector \(\varvec{y}-\varvec{x}\). But, given that all structures (tensor convolutions) in the above formula are rotation invariants, one can orient \(\varvec{l}\) in any fixed direction, say, along the geometric centers of \({\mathcal {S}}\) and \({\mathcal {D}}\), provided they have appropriate symmetries. The differential forms \({\mathfrak {P}}_{s,d}\) in Eq. (62) are defined by Eq. (60), in which the distribution functions \(f_a\) should be substituted by \({\overline{f}}_a\), see Eq. (61). Therefore, according to Eq. (57), integration in \(\varvec{x}\) and \(\varvec{y}\) is carried out over the volumes of \({\mathcal {S}}\) and \({\mathcal {D}}\).
With the same arguments as in the LBL regime [28], the function (64) can be treated as a QFT generalization of the QM neutrino flavor transition probability. The ingredients involved in Eq. (64) are gathered in Table 1, which also contains the corresponding structures arising in the LBL asymptotics.
As is seen, the functions \({\mathcal {A}}_{ij}\) and \({\mathcal {B}}_{ij}\), as well as the function \(\varTheta _{ij}\) we discussed above (cf. Eqs. (54) and (55)) relating to the SBL and LBL modes have a certain similarities, but not identical, and do not coincide with each other in any approximation. The positivity of \({\mathcal {A}}_{ij}^2\) in the SBL mode is obvious, given that \({\mathcal {Q}^{4}} = {\mathcal {R}}_{00}{\mathcal {R}}_{33}-\left( {\mathcal {R}}_{03}\right) ^2\) in the coordinate system, where \(\varvec{l}=(0,0,1)\), and this expression is a principal minor of the positive-definite matrix \(\Vert {\mathcal {R}}_{\mu \nu }\Vert \) (see footnote 8). The suppression factor \(\exp \left( -{\mathcal {A}}_{ij}^2\right) \) is potentially important for very long L, but this does not apply to the SBL mode. The function \({\mathcal {C}}_{ij}^2\) is absent in the LBL mode, but has the same order of magnitude as \({\mathcal {B}}_{ij}^2\) and is also negligibly small for the SM neutrinos. Although the suppression factors in Eq. (64) are probably not important from the experimental viewpoint, they are of interest as a reflection of the process (obscured by the averaging of various kinds) responsible for the creation and evolution of ENWP, and shortly discussed in Sect. 5.4.
Remarkably the oscillation phase \(\varphi _{ij}\) is the same in both modes and, moreover, the same as in the QM approach, although it is clear that at short distances this phase is also small for the standard \(3\nu \) oscillations. The higher-order corrections to \(\varphi _{ij}\) are presented in Appendix B, but they are of potential interest only if there is mixing of standard neutrinos with hypothetical heavy sterile neutrinos.
6.3 Instrumental factor
Within the approximations made, the function \(S_{ij}\) (hereinafter referred to as decoherence function or instrumental factor) is the same as in the LBL mode, except for the difference in the explicit form of the function \({\mathcal {B}}_{ij}\) (see Table 1), on which \(S_{ij}\) depends. The general properties and several examples of the decoherence function have been studied in detail in Ref. [37]. Here we briefly discuss an additional simple example most relevant to SBL-experiments. Namely, we consider the case when the periods \(\tau _s\) and \(\tau _d\) are large compared to \(\max \vert \varvec{y}-\varvec{x}\vert \). Then
and, as shown in Ref. [37], the decoherence function is expressed through the universal (independent of indices i, j) real-valued function
of three independent dimensionless real variables. In these terms,
Here are the simplest properties of function (65):
-
\(0 < S \le 1\) for all values of the arguments and decreases with increasing b;
-
at \(t \ge t' \gg 1\) the function S weakly depends on t and \(t'\) and approaches its asymptotic value \(\exp (-b^2)\) with increasing t, so that \(S_{ij} \approx 1\) for \(i,j=1,2,3\) (SM neutrinos) and \(S_{ii} = 1\) for any i (since \({\mathcal {B}}_{ii}=0\));
-
S decreases rapidly as the ratio \(t/t'\) increases.
For illustration, Fig. 5 displays three particular cases of the function \(S(t,t',b)\) for \(t'=0.97t\), \(t'=t\), and \(t'=1.30t\). The interval of values of b used in the figure is, of course, unrealistically wide and is chosen only for the convenience of visualization. The figure shows that S depends very weakly on b at \(\vert b \vert \ll 1\) (which is the case in the real experimental circumstances), but the dependence on the parameters t and \(t'\) is very significant and should be taken into account when processing the data and deriving the count rate from the measured number of events.
6.4 Flavor transition probability
The function (64) matches the QM flavor transition probability only if \(S_{ij}=1\) and \({\mathcal {A}}_{ij}={\mathcal {B}}_{ij}={\mathcal {C}}_{ij}=\varTheta _{ij}=0\), which is not the case. So it really can be interpreted as a QFT refinement of the standard QM result. However, as in the LBL mode, the probabilistic interpretation of the function \({\mathcal {P}}_{\alpha \beta }\) can be true only in attenuated form, since the instrumental factor and the functions \({\mathcal {A}}_{ij}\), \({\mathcal {B}}_{ij}\), \({\mathcal {C}}_{ij}\), and \(\varTheta _{ij}\) involve the tensor components \({\widetilde{\Re }}^{\mu \nu }\) and \({\mathcal {R}}^{\mu \nu }\) that depend on the momenta, masses, and momentum spreads of all external wave packets and are in general different for the reactions (10) with different leptonic pairs \((\ell _\alpha ,\ell _\beta )\) in the final states. The distinctions are determined by kinematics of the subprocesses in \({\mathcal {S}}\) and \({\mathcal {D}}\) (in particular, by the reaction thresholds) and specific sets of participating in and out particles. It is also appropriate to recall that the function \(\varTheta _{ii}\) is not sign-definite and \(\varTheta _{ii} \ne 0\). As a result, the unitarity relations
are generally do not hold, or hold only approximately even within the kinematic boundaries for the corresponding subprocesses.
By applying the aforementioned asymptotics \(S=\exp (-b^2)\) (presumably valid for a wide class of possible SBL experiments), and using the same arguments as in Refs. [27, 28, 37], we arrive at formally the same expression as in the LBL case, represented by Eq. (18), where
with the exponent very close to unit for the SM neutrinos. In this extremely simple but quite realistic case, it follows from Eq. (67) and unitarity of the mixing matrix that
Thus, as was to be expected, the flavor transitions are strongly forbidden at short distances and/or high energies. The same conclusion follows from the simple QM formula for the flavor transition probability. It would seem that this leaves no room for experimental verification of the theory presented here. However, there is a subtle point: the commonplace result (68) is only valid when \(\Sigma \) is large enough, or better to say not too small; in fact, we have explicitly used this assumption in deriving the results discussed, as well as in some approximations and simplifications.
On the other hand, in the plane-wave limit, \(\Sigma \rightarrow 0\) and thus \({\mathcal {B}}_{ij}^2\rightarrow +\infty \) and \({\mathcal {C}}_{ij}^2\rightarrow +\infty \) at \(i \ne j\), which (together with the ultrarelativistic approximation) leads to the following resultFootnote 9
which is completely different from Eq. (68). As is well known, the oscillationless flavor transitions are ruled out by a huge number of (terrestrial) neutrino experiments, which within our formalism indicates that \(\Sigma >0\). At nonzero (and not very small!) \(\Sigma \) we have to take into account the power corrections for the neutrino propagator, derived in Appendix A.3, which could lead to a potentially observable consequence not predicted by the QM formalism and basically distinguishable from the ISLV effect [50] predicted for the LBL regime. The absence of additional masking factors (decoherence, damping, and oscillations themselves) would have helped to detect the presumably small effect.
7 ISL violation at short baselines
The previous discussion was based on using the leading order of the asymptotic expansion of the neutrino propagator in small parameters (23). Taking into account the next-to-leading order terms in the expression for number of events (18) would require getting rid of several approximations made in deriving this formula. In particular, the degree of approximation used in Eqs. (42) and (47) should be improved. We will postpone this issue for the future, and here we will confine our consideration to the potential effect from the first sub-leading term \(\propto \delta ^2\) in series (A23).
Within the approximations made so far, we must neglect in the series (A23) the terms of order \(O(\eta )\), \(O(\Delta )\) and higher, as well as the neutrino mass. Moreover, we can safely put in Eq. (A23) \(p \simeq q \simeq {\vert \varvec{q}\vert }l\) and \(X \simeq X_0l \simeq {\vert \varvec{X}\vert }l\). Then the series (A23) in the leading order becomes extremely simple:
Here
and the functions \(\theta _2\), \(\omega _1\), and \(\omega _2\) are defined by Eq. (A21) in which we must put \(\varvec{v}=\varvec{l}\). As a result, the inverse-square law for the differential neutrino flux \(d{\mathfrak {F}}_\nu \), which appears in the SBL version of Eq. (18), is violated under appropriate conditions (cf. the corresponding results of Refs. [9] and [19]):
where
In Appendix A.4 it is proved (without neglecting the neutrino mass) that \(F_{200}>0\). Therefore \(\Sigma _{\text {eff}}^4>0\). This means that the number of neutrino events should decrease compared to what would be expected in the absence of the ISL violation. This effect is ultimately associated with incomplete overlap in space and time of the noncollinearly interacting incoming and outgoing wave packets involved in the processes of neutrino production and absorption, which must lead to a decrease of the probability of their interactions. Therefore, the positivity of \(\Sigma _{\text {eff}}^4\) is natural.
A more intuitive, though more speculative, interpretation of the ISLV effect relates to the aforementioned duality “propagator \(\longleftrightarrow \) wave packet”. The effective neutrino wave packet moves quasi-classically, i.e., its center only approximately follows a straight line connecting the neutrino production and absorption points. According to Eq. (A24)
in the coordinate system, where \(\varvec{l}=(0,0,1)\), i.e. (within our approximations) \(\Sigma _{\text {eff}}\) is determined only by the spatial components of the tensor \({\mathcal {R}}\), transverse to the vector \(\varvec{l}\). Thus, the motion of the neutrino wave packet can be roughly thought as classical motion along \(\varvec{l}\) superimposed on the quantum jitter perpendicular to \(\varvec{l}\). The resulting path length, \({\bar{L}}\), is longer than the length of the classical rectilinear trajectory. Therefore the number of events (\(\propto 1/{\bar{L}}^2\)) must be smaller than for the classical motion. The formalism automatically accounts for this effect and gives the expected reduction of the number of events.
The neutrino virtuality also plays some part in the ISLV effect, and series (A22) accounts for it with the terms \(\propto \Delta ^c\). But we neglected these terms (as well as the terms \(\propto \eta ^b\)) because taking them into account would definitely exceed the accuracy of our estimate.
Regardless of the interpretation, the function \(\Sigma _{\text {eff}}\) in Eqs. (70) depicts a cumulative effect from overlap of the external wave-packet states \(\varkappa \) in the source and detector, depending (through the tensor \({\mathcal {R}}\)) on their momenta \(\varvec{p}_{\varkappa }\), masses \(m_{\varkappa }\), and momentum spreads \(\sigma _{\varkappa }\). With suitable values of these quantities, the ISLV correction can be measurable at macroscopic distances satisfying the conditions of applicability of the approximations used in the formalism. The basic constraints can be formulated as the inequalities
satisfied at sufficiently high neutrino energies and at short distances, exceeding the “mesoscopic scale” \(\sim 1/\Sigma \). They are virtually independent of the neutrino masses, provided that the latter are negligible compared to the neutrino energy. The constraints (71) are illustrated in Fig. 6 for representative intervals of neutrino energy and values of the scale parameter \(\Sigma \). The energies of the neutrinos from \(\pi _{\mu _2}\) and \(K_{\mu _2}\) decays in the meson rest frame are also shown (by vertical planes) to epitomize acceptable baselines in possible dedicated experiments.
Last but not least, the QFT approach predicts the exact opposite dependence of the ISLV effect on the distance \(L=\vert \varvec{y}-\varvec{x}\vert \) for the SBL and LBL asymptotics: the corresponding corrections are given by the series in powers of and \(1/L^2\), respectively, and the lowest order corrections are inversely proportional to each other to within a dimensionless positive factor. This feature makes it possible, at least in principle, to distinguish experimentally these two asymptotics from each other, as well as from the predictions of other approaches, in particular from the result of Ref. [19], which predicts the absence of even an approximate fulfillment of ISL in the entire region \(L\lesssim \vert \varvec{q}\vert /\Sigma _{\text {SBL}}^2\). Eventually, it is hoped that the transition region, which has not yet been explored within the formalism under discussion, will also become the subject of experimental study. A promising source of (anti)neutrinos could be, e.g., stopped-kaon decays, which provide a relatively wide range of applicability of the formalism (see Fig. 6).
8 Summary
In the framework of the covariant perturbative QFT approach with wave packets as asymptotically free initial and final states, the wave-packet modified neutrino propagator is expressed as an initial segment of the triple Taylor series in powers of dimensionless rotation-invariant variables (23). This asymptotic expansion is applicable at high energies and short but macroscopic space-time distances between the vertices of the corresponding Feynman macrodiagram of a fairly general form (see Fig. 2). It is also worth noting that the obtained asymptotic expansion applies not exclusively to the neutrino propagator.
Using this result, it is shown that in the short-baseline regime, high-energy neutrinos are deeply virtual and behave like quasi-classical particles, that is, propagate approximately along classical trajectories connecting the points of their production and absorption. In the leading-order approximation, this results in the classical inverse-square dependence of the modulus squared amplitude. But there are two kinds of corrections to the classical behavior: at supershort distances and times, \(\vert \varvec{X}\vert \sim X_0\), which, however, exceed the characteristic scale \(\sim 1/\Sigma \), relative corrections are described by a series in powers of \(\left( \vert \varvec{X}\vert -X_0\right) /X_0\sim 1/({\Sigma }X_0)\), while at \(1/\Sigma \ll \vert \varvec{X}\vert \sim X_0\ll \vert \varvec{q}\vert /\Sigma ^2\) – by an expansion in powers of \(\vert \varvec{X}\vert ^2\), which follows from the above expansion of the neutrino propagator. As in the LBL regime, the main ISL violating correction is negative, i.e., it leads to a decrease in the neutrino event rate. But the functional dependence of the ISLV correction is radically different from that in the LBL mode, where the correction is given by an expansion in inverse powers of \(\vert \varvec{X}\vert ^2\). This makes it potentially possible to distinguish between the SBL and LBL asymptotics and thereby test the theory in dedicated experiments with variable baseline or with several identical detectors at different baselines.
An expression for the time-averaged number of neutrino-induced events is derived and compared with the corresponding result for the long-baseline mode. It is, in particular, shown that although the corresponding formulas have the same structure, the decoherence factors are different and do not reduce to each other even in the lowest-order approximation. This results from the fact that the space-time behavior of the neutrino propagator and, hence, the effective neutrino wave packet in the two modes is strongly different, and there is a transition region, \(\vert \varvec{X}\vert \sim (0.1-10)\vert \varvec{q}\vert /\Sigma ^2\), in which the neutrino wave packet transformed from deeply virtual (with the 4-momentum independent of the neutrino mass) into the more habitual on-mass-shell packet. Negligible decoherence and flavor transition probabilities in the SBL regime should simplify the experimental study of the ISLV effect, and the specific dependence of the predicted count rate on distance and energy could potentially help distinguish its reduction due to ISLV from that caused by possible mixing of active and sterile neutrinos.
Data Statement
This manuscript has no associated data or the data will not be deposited. [Authors’ comment: This is a theoretical study and no experimental data.]
Notes
RGP is spherically symmetric in its own frame of reference (denoted by \(\star \)), which is defined by the condition \(\varvec{p}_\star =0\).
That is, the function \(\varPhi (\varvec{q}) \in C^{\infty }({\mathbb {R}}^3)\) which goes to zero faster than any inverse power of \(\vert \varvec{q}\vert \), as do all its derivatives. The (positive) function in the integrand of the neutrino propagator (5) belongs to this class; \(\upkappa ^2=q_0^2-m_i^2\).
It may be expedient to recall that the tensors \({\widetilde{\Re }}_{s,d}\) are inverse of \(\Re _{s,d}\); their explicit form can be found in Ref. [50]. Since these tensors are symmetric and positive definite, the same is true for the tensor \({\mathcal {R}}\).
In the Appendix, the index j is omitted to shorten the formulas. Notice that \(\delta _j\) and \(\eta _j\) weakly depend on \(m_j\) if, as will be eventually assumed, \(\varvec{{\mathcal {P}}}^2 \sim {\mathcal {P}}_0^2 \gg m_j^2\); in this approximation, the index j can be ignored, and the approximations for \(\vert \delta _j\vert \), \(\eta _j\), and \(\vert \Delta _j\vert \) in Eqs. (23) refer just to this case.
They also ensure that \(\vert \varvec{q}_s\vert \sim \vert \varvec{q}_d\vert \) and \(q_s^0 \sim -q_d^0\).
Recall that all ingredients in Eq. (47) are written in the coordinate system, where \(\varvec{l}=(0,0,1)\). For the convenience of further calculations, all tensor convolutions, except those contained in the coefficients \({\mathfrak {T}}_n\), are written in a form invariant to rotations of the coordinate axes.
It is, however, of interest that series (49) contains all degrees of the skew symmetric corrections \((X_3-X_0)/X_0\), since they are invariant with respect to the PT transformation.
Recall that the function \({\mathfrak {T}}_1\) is not an explicit rotation invariant. Here and in similar cases below, it is implicitly assumed that the tensors are properly transformed at the transition to the rotated coordinate system, but we do not rename or label them so as not to complicate already cumbersome formulas. We hope that this will not lead to misunderstandings, because it is always clear from the context which coordinate system we are talking about.
The derivation of this limit is not entirely trivial because of the factor \(\exp \left( -\varTheta _{ij}\right) \) in Eq. (67), but becomes more transparent if we rewrite Eq. (69) in the more accurate form
$$\begin{aligned} \delta _s(q-q_s)\delta _d(q+q_d){\mathcal {P}}_{\alpha \beta } \, {\mathop {\longrightarrow }\limits ^{\text {PWL}}} \,\delta (q-q_s)\delta (q+q_d)\sum _i\vert {V_{{\alpha }i}}\vert ^2\vert {V_{{\beta }i}}\vert ^2. \end{aligned}$$For simplicity, hereafter we omit the neutrino mass eigenfield index.
Admissibility of the latter simplification is discussed in the main text and, for this estimate, reduces to the approximate equalities \({\mathcal {P}}_0 \simeq E_p \simeq \vert \varvec{{\mathcal {P}}}\vert \).
Note that the main contribution is made by the interference term \(2\left[ \text {Re}(J_g)\text {Im}(J_v)+\text {Re}(J_v)\text {Im}(J_g)\right] /\vert {J_g}\vert ^2\).
References
I.Yu. Kobzarev, B.V. Martemyanov, L.B. Okun, M.G. Shchepkin, Sov. J. Nucl. Phys. 35, 708 (1982) [Yad. Fiz. 63, 1210–1219 (1982)]
C. Giunti, C.W. Kim, J.A. Lee, U.W. Lee, Phys. Rev. D 48, 4310 (1993). arXiv:hep-ph/9305276
M. Blasone, G. Vitiello, Ann. Phys. 244, 283 (1995). [Erratum ibid. 249, 363 (1996). arXiv:hep-ph/9501263]
W. Grimus, P. Stockinger, Phys. Rev. D 54, 3414 (1996). arXiv:hep-ph/9603430
C. Giunti, C.W. Kim, U.W. Lee, Phys. Lett. B 421, 237 (1998). arXiv:hep-ph/9709494
J.E. Campagne, Phys. Lett. B 400, 135 (1997)
K. Kiers, N. Weiss, Phys. Rev. D 57, 3091 (1998). arXiv:hep-ph/9710289
M. Zralek, Acta Phys. Pol. B 29, 3925 (1998). arXiv:hep-ph/9810543
A. Ioannisian, A. Pilaftsis, Phys. Rev. D 59, 053003 (1999). arXiv:hep-ph/9809503v2
M. Blasone, P.A. Henning, G. Vitiello, Phys. Lett. B 451, 140 (1999). arXiv:hep-th/9803157v2
W. Grimus, P. Stockinger, S. Mohanty, Phys. Rev. D 59, 013011 (1999). arXiv:hep-ph/9807442v2
W. Grimus, S. Mohanty, P. Stockinger, Phys. Rev. G 61, 033001 (2000). arXiv:hep-ph/9904285
C.Y. Cardall, D.J.H. Chung, Phys. Rev. D 60, 073012 (1999). arXiv:hep-ph/9904291
C.Y. Cardall, Phys. Rev. D 61, 073006 (2000). arXiv:hep-ph/9909332
M. Beuthe (2000). arXiv:hep-ph/0010054
P. Stockinger, Pramana 54, 203 (2000)
M. Beuthe, Phys. Rev. D 66, 013003 (2002). arXiv:hep-ph/0202068v2
C. Giunti, JHEP 11, 017 (2002). arXiv:hep-ph/0205014v2
M. Beuthe, Phys. Rep. 375, 105 (2003). arXiv:hep-ph/0109119v2
M. Garbutt, B.H.J. McKellar (2003). arXiv:hep-ph/0308111
A. Asahara, K. Ishikawa, T. Shimomura, T. Yabuki, Prog. Theor. Phys. 113, 385 (2005). arXiv:hep-ph/0406141v3
A.D. Dolgov, O.V. Lychkovskiy, A.A. Mamonov, L.B. Okun, M.V. Rotaev, M.G. Schepkin, Nucl. Phys. B 729, 79 (2005). arXiv:hep-ph/0505251
C.C. Nishi, Phys. Rev. D 73, 053013 (2006). arXiv:hep-ph/0506109v3
J. Kopp, JHEP 06, 049 (2009). arXiv:0904.4346v2 [hep-ph]
E.Kh. Akhmedov, A.Yu. Smirnov, Phys. Atom. Nucl. 72, 1363 (2009) [Yad. Fiz. 72, 1417–1435 (2009). arXiv:0905.1903v2 [hep-ph]]
B.D. Keister, W.N. Polyzou, Phys. Scr. 81, 055102 (2010). arXiv:0908.1404v2 [hep-ph]
V.A. Naumov, D.V. Naumov, Russ. Phys. J. 53, 549 (2010) [Izv. VUZ Fiz. 53, 5–27 (2010)]
D.V. Naumov, V.A. Naumov, J. Phys. G 37, 105014 (2010). arXiv:1008.0306v2 [hep-ph]
E.Kh. Akhmedov, J. Kopp, JHEP 04, 008 (2010). arXiv:1001.4815v3 [hep-ph]
E.Kh. Akhmedov, AYu. Smirnov, Found. Phys. 41, 1279 (2011). arXiv:1008.2077v3 [hep-ph]
A.E. Bernardini, M.M. Guzzo, C.C. Nishi, Fortschr. Phys. 59, 372 (2011). arXiv:1004.0734v4 [hep-ph]
EKh. Akhmedov, D. Hernandez, AYu. Smirnov, JHEP 04, 052 (2012). arXiv:1201.4128v2 [hep-ph]
EKh. Akhmedov, A. Wilhelm, JHEP 01, 165 (2013). arXiv:1205.6231v2 [hep-ph]
A. Kobach, A.V. Manohar, J. McGreevy, Phys. Lett. B 783, 59 (2018). arXiv:1711.07491 [hep-ph]
V.O. Egorov, I.P. Volobuev, Phys. Rev. D 100(3), 033004 (2019). arXiv:1902.03602v3 [hep-ph]
W. Grimus, J. Phys. G 47(8), 085004 (2020). arXiv:1910.13776v4 [hep-ph]
D.V. Naumov, V.A. Naumov, Phys. Part. Nucl. 51(1), 1 (2020) [Fiz. Elem. Chast. Atom. Yadra 51, 5–209 (2020)]
E. Ciuffoli, J. Evslin, H. Mohammed, Eur. Phys. J. C 81(4), 325 (2021). arXiv:2001.03287 [hep-ph]
L. Smaldone, G. Vitiello, Universe 7(12), 504 (2021). arXiv:2111.11809 [hep-th]
T. Cheng, M. Lindner, W. Rodejohann, JHEP 08, 111 (2022). arXiv:2204.10696v2 [hep-ph]
C. Giunti, C.W. Kim, Fundamentals of Neutrino Physics and Astrophysics (Oxford University Press, Oxford, 2007). ISBN:9780198508717
Z.-Z. Xing, S. Zhou, Neutrinos in Particle Physics, Astronomy and Cosmology (Zhejiang University Press, Hangzhou & Springer, Berlin, 2011). ISBN:9783642175596, 9787308080248
Y.L. Chan, M.C. Chu, K.M. Tsui, C.F. Wong, J. Xu, Eur. Phys. J. C 76(6), 310 (2016). arXiv:1507.06421v2 [hep-ph]
F.P. An et al. (Daya Bay Collaboration), Eur. Phys. J. C 77(9), 606 (2017). arXiv:1608.01661v2 [hep-ex]
C.F. Wong, M.C. Chu, M. Dolgareva, M. Gonchar, D. Naumov, D. Taichenachev, K. Treskov, K.M. Tsui, W. Wang, J. Phys. Conf. Ser. 888(1), 012253 (2017)
Z. Cheng, W. Wang, C.F. Wong, J. Zhang, Nucl. Phys. B 964, 115304 (2021). arXiv:2009.06450 [hep-ph]
A. de Gouvêa, V. De Romeri, C.A. Ternes, JHEP 08, 018 (2020). arXiv:2005.03022v2 [hep-ph]
A. de Gouvêa, V. De Romeri, C.A. Ternes, JHEP 06, 042 (2021). arXiv:2104.05806 [hep-ph]
J. Wang et al. (JUNO Collaboration), JHEP 06, 062 (2021). arXiv:2112.14450 [hep-ex]
V.A. Naumov, D.S. Shkirmanov, Eur. Phys. J. C 73, 2627 (2013). arXiv:1309.1011v2 [hep-ph]
D.V. Naumov, V.A. Naumov, D.S. Shkirmanov, Phys. Part. Nucl. 48(1), 12 (2017) [Pisma Fiz. Elem. Chast. Atom. Yadra 47, 1884–1897 (2016). arXiv:1507.04573 [hep-ph]]
D.V. Naumov, V.A. Naumov, D.S. Shkirmanov, Phys. Part. Nucl. 48(6), 1007 (2017)
V.A. Naumov, D.S. Shkirmanov, Universe 7(7), 246 (2021)
S.E. Korenblit, D.V. Taychenachev, Mod. Phys. Lett. A 30, 1550074 (2015). arXiv:1401.4031v4 [math-ph]
V.A. Naumov, D.S. Shkirmanov, Mod. Phys. Lett. A 30, 1550110 (2015). arXiv:1409.4669v2 [hep-ph]
W. Hampel et al. (GALLEX Collaboration), Phys. Lett. B 420, 114 (1998)
J.N. Abdurashitov et al. (SAGE Collaboration), Phys. Rev. C 59, 2246 (1999). arXiv:hep-ph/9803418v4
J.N. Abdurashitov et al. (SAGE Collaboration), Phys. Rev. C 73, 045805 (2006). arXiv:nucl-ex/0512041
V.V. Barinov et al., Phys. Rev. Lett. 128(23), 232501 (2022). arXiv:2109.11482v2 [nucl-ex]
V.V. Barinov et al., Phys. Rev. C 105, 065502 (2022). arXiv:2201.07364v3 [nucl-ex]
B.V. Fedosov, Deformation Quantization and Index Theory (Akademie Verlag, Berlin, 1996). ISBN:3-05-501716-1
F.W.J. Olver, D.W. Lozier, R.F. Boisvert, Ch.W. Clark (eds.), NIST Handbook of Mathematical Functions (National Institute of Standards and Technology and Cambridge University Press, Cambridge, 2010), ISBN:978-0-521-19225-5, 978-0-521-14063-8
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix A: SBL asymptotics of neutrino propagator
The aim of this Appendix is to represent the integralFootnote 10
where \(\varPhi (q)\) is defined by Eq. (14), through certain asymptotic expansions over several Lorentz-invariant and rotation-invariant dimensionless quantities, guaranteed to be small in absolute value. As will be shown, the resulting representation is applicable at relatively short macroscopic distances \(\vert \varvec{X}\vert \) and times \(X_0\), sufficiently high (not necessary ultrarelativistic) neutrino energies \({\mathcal {P}}_0\), and not too high neutrino virtualities \({\mathcal {P}}^2-m^2\), where \({\mathcal {P}}=({\mathcal {P}}_0,\varvec{{\mathcal {P}}})\) is the saddle point of \(\varPhi (q)\). The exact meaning of these restrictions will become clear from what follows. The results obtained below are quite general, in the sense that they do not apply only to neutrinos.
1.1 A.1 Transformation of integral J(X)
Performing the 4d Fourier transform
and substituting Eq. (A2) to integral (A1) we get
Considering the two possibilities \(x_0 \gtrless X_0\) and using the residue theorem, we obtain
where \(E_{\varvec{q}}=\sqrt{\varvec{q}^2+m^2}\). By substituting this result into Eq. (A3) and changing the order of integration, we can rewrite integral (A3) as
Using Eq. (A2) it can be shown that
Applying this identity and shifting the integration variable from \(x_0\) to \(X_0-x_0\), we rewrite Eq. (A4) as
It is convenient to split this integral into two parts,
where
Now, representing \(\varPhi (q)\) as a 1d Fourier integral
we have
It is readily seen that
where \(\theta \) is the Heaviside step function. Therefore
In exactly the same way, we get
To move on, we must use the explicit form of \(\varPhi (q)\) given by Eqs. (14) and (15). Let us rewrite expression (15) as follows:
Now, applying Eqs. (A6) and (A9), we obtain an explicit expression for the function \(\varphi (\varvec{q},y_0)\):
Combining Eq. (A10) with Eqs. (A7) and (A8) we arrive at
where
Given Eqs. (14), (A5), and (A11), we obtain:
where
and
Here the following notation is used:
Although the functions \(J_g(X)\) and \(J_v(X)\) individually are not exact relativistic invariants, the partitioning (A12) is convenient because, as will be shown in Sect. 1, under some simple condition valid in the given (laboratory) frame of reference, the second term in Eq. (A12) can be neglected. This is sufficient for our purposes.
1.2 A.2 Fedosov’s lemma
Below we will make extensive use of the important lemma proved by Fedosov [61, p. 79], which states:
Let \(\varphi (x) = xAx+ax+a_0\) be a quadratic function on \({\mathbb {R}}^n\), where A is a complex symmetric matrix with a positive definite real part, a and \(a_0\) are a complex vector and a complex constant, respectively. Then for any (smooth) function f(x) the integral
may be represented in the form
where \(T_N(\tau _0,\tau )\) is the N-th order Taylor polynomial of f(x) at \(x=\tau _0=\text {Re}\,z_0\), and \(z_0\in {\mathbb {C}}^n\) is a stationary point of \(\varphi (z)\).
Note that the stationary point \(z_0\) (solution of the linear system \(\partial \varphi /\partial x_r=0\)) is given by \(z_0 = -\frac{1}{2}A^{-1}a\) and thus \(\varphi (z_0) = -\frac{1}{4}aA^{-1}a+a_0\). If f(x) is analytic at \(z_0\) then \(f(\tau +\tau _0)-T_N(\tau _0,\tau )\rightarrow 0\) as \(N\rightarrow \infty \) and
We will also need a corollary of this lemma: Let
where A, a, and \(a_0\) have the same meaning as above, \(b\in {\mathbb {R}}^n\), and \(t\in {\mathbb {R}}^1\). It has been proven [55] that
The extension of Fedosov’s lemma and its corollaries to Minkowski signature is trivial: as long as the integration variables are independent, it is only necessary to take care of the correct position of the Lorentz indices in the tensor and vector convolutions.
1.3 A.3 Integral \(J_g(X)\)
The integral (A13) can be identically represented as
where
The 4-vector \({\mathcal {P}}\) is the stationary point and \({\mathcal {R}}\) is the tensor such that \({\mathcal {R}}_{\mu \lambda }{\widetilde{\Re }}^{\lambda \nu }=\delta _\mu ^\nu \). Here it is also appropriate to recall that \(p^0 \equiv E_{\varvec{p}} \ne {\mathcal {P}}^0 \equiv {\mathcal {R}}^{0\mu }Y_\mu \) and \(q^0 \ne E_{\varvec{q}}\).
The seeming complication (representing \(J_g\) as a 5D integral) allows us to straightforwardly apply Eq. (A17), which, after some transformations, gives
Here
and other notations are explained in the main text. Note that the differential operator \(D_{\varvec{\tau }}\) is a \(3\times 3\) matrix, since the function \(f(q;y_0)\) does not depend on \(q_0\). A direct application of the operator \(e^{D_{\varvec{\tau }}}=\sum _n D_{\varvec{\tau }}^n/n!\) in Eq. (A18) shows that the function \(J_g(X)\) can be represented by the initial segment of the triple asymptotic series
in powers of three dimensionless real variables
Here \(\Sigma \) is the energy-dimension scalar defined by Eq. (25) and assumed small compared to the masses of in and out wave packets at the source and detector vertices and \(\rho \) is the tensor with dimensionless space-time components defined by Eq. (26). We emphasize that \(\delta \) and \(\Delta \) are Lorentz scalars, while \(\eta \) is a rotation invariant. Therefore, the dimensionless coefficient functions \(f_{abc}\) must also be invariant to rotation. Under the natural conditions that follow from the formulation of the problem and from the formalism discussed in the main text, we see that
as \(\vert \varvec{X}\vert \simeq X_0\) (quasi-classical trajectories + ultrarelativism) and
as \(\vert \varvec{{\mathcal {P}}}\vert \simeq {\mathcal {P}}_0\) (suppression of too high virtualities + ultrarelativism + approximate energy–momentum conservation). From definition (26) and dimensional considerations, it follows that \(\rho _{\mu \nu } = O(1)\). Taking all this into account, we can conclude that
provided that the conditions
are both fulfilled. These conditions determine the range of applicability of expansion (A19) and, consequently, of the entire formalism under discussion. While condition (A20b) is trivial and is fulfilled with a margin in the modern neutrino experiments, the condition (A20a) means that the formalism is only applicable at sufficiently short distances and/or high neutrino energies. The area of applicability increases with decreasing \(\Sigma \). Several lower-order coefficients \(f_{abc}\) are:
Here
Obviously, the approximate equality \(\varrho \simeq \rho ^{\mu \nu }l_{\mu }l_{\nu }\) is valid only for ultrarelativistic neutrinos.
The presented results were derived using the computer algebra systems Maple and Maxima for a finite number of terms of the asymptotic series (A19). More specifically, the calculations were performed using the K-th order Taylor polynomial of the operator exponent \(e^{D_{\varvec{\tau }}}\). Up to the order \(K=8\) it is proved that \(f_{abc}=0\) at \(b>K-a\) and \(c>K-b\) for \(a \le K\). Hence
in Eq. (A19). The method for deriving the coefficients \(f_{abc}\) is reduced to the transformation of sums of products of tensor and vector components resulting from differentiation of \(\exp \left( \Omega _{\varvec{\tau }}\right) \) to the identical combinations of the rotation invariants \(\theta _s\), \(\omega _s\), and \(\varrho \). The calculations are simplified by dimensional and symmetry considerations, but are still intricate. Obviously, the required computer processing time grows exponentially as K increases, so that the calculation of high-order coefficients becomes rather time-consuming. But for our purposes, it would be enough to make sure that the structure of series (A19) is correctly “guessed” even in the lowest (second) subleading order in \(\delta \), yielding the ISL violation.
It is convenient to rewrite Eq. (A19) in the equivalent exponential form
This representation is a bit more suitable since it facilitates calculation of the modulus squared transition amplitude. The corresponding coefficients \(F_{abc}\) can be obtained by re-expanding the power series (A19):
It is also instructive to represent the triple power series in the exponent in Eq. (A22) as
The explicit form of the real coefficient functions \(g_a\) and \(g'_a\) trivially follows from the expressions for the coefficients \(F_{abc}\). A simple analysis shows that \(\text {Im}\,{\mathcal {I}}\) is negligibly small under all experimental circumstances, but \(\text {Re}\,{\mathcal {I}}\) provide corrections of the order of \(O(\eta )\) and \(O(\Delta \)) to the decoherence factors and, more importantly, potentially measurable ISL-violating correction given by even powers of \(\delta \) (see main text).
1.4 A.4 Positivity of \(F_{200}\)
Let us prove that the coefficient function \(F_{200}\) is positive (we will need this fact in Sect. 7). The proof can be fulfilled by an orthogonal diagonalization of the symmetric positive definite \(3\times 3\) matrix \(\varvec{\rho }\):
Here \({\varvec{u}}=(u_1,u_2,u_3)^T\), \(r_k\) are the positive eigenvalues of \(\varvec{\rho }\), \(\varvec{O}\) is an orthogonal matrix of rotation, and \(\varvec{v}\) is defined in Eq. (A21). In this representation, using definitions (A21) we find:
where \(\eta _k^{ln}=1\) for a cyclic permutation of (1, 2, 3) and \(\eta _k^{ln}=0\) otherwise. It is pertinent to note here that the coefficient function \(f_{200}\) in Eq. (A19) is also positive, since
A less formal way to prove the same is to rewrite \(F_{200}\) in the coordinate system whose z axis is directed along the vector \(\varvec{v}\). In this system
where \(\text {v}\equiv \vert \varvec{v}\vert =\vert \varvec{v}'\vert \), \(\text {v}\le 1\). For massless neutrino \(F_{200} = \rho _{11}^2+2\rho _{12}^2+\rho _{22}^2\), i.e. \(F_{200}\) is determined only by the transverse spatial components of the tensor \(\rho \).
1.5 A.5 Integral \(J_v(X)\)
Applying Fedosov’s lemma to Eq. (A14), we obtain:
We mention in passing that the 3d stationary point \(\varvec{{\mathcal {P}}}\) and operator \(D_{\varvec{\tau }}\) are exactly the same as above. From the known asymptotic expansion of the complementary error function of complex variable (see, e.g., Ref. [62, p. 164]) it follows that
Combining this and Fedosov’s expansions, the expression (A25) can be represented by the formal series
in powers of the real variable
and inverse powers of two complex variables
The specific structure of this series was checked only to the second order of the Taylor polynomials of \(w_\pm (\varvec{\tau })\) and \(e^{D_{\varvec{\tau }}}\), which is enough for our purposes. The lowest-order real dimensionless coefficients \({\mathcal {A}}^{\pm }_{ab}\) and \({\mathcal {B}}^{\pm }_{ab}\) are found to be
where
and the functions \(\omega _{1,2}\) and \(\theta _{1,2}\) are defined by Eq. (A21). The coefficients \({\mathcal {A}}^{\pm }_{ab}\) and \({\mathcal {B}}^{\pm }_{ab}\) were derived by the same method as described at the end of Sect. A.3. Obviously, \(\kappa \sim \Sigma /{\mathcal {P}}_0 \ll 1\), but the series in powers \(1/\kappa _{\pm }\) can be considered asymptotic expansions only at \(\text {Re}\,\kappa _\pm \gg 1\), which, under the obvious presumption \({\widetilde{\rho }}_{00}=O(1)\), is equivalent to the condition (24). Let us show that the relative contribution of \(J_v(X)\) is completely negligible under this condition.
1.6 A.6 Comparison of \(J_g(X)\) and \(J_v(X)\)
The patterns of the asymptotic expansions (A19) and (A26) allows us to estimate the relative contribution of \(J_g(X)\) and \(J_v(X)\) to the modulus square of the amplitude (11), using the leading-order terms of these expansions (that is the condition \({\Sigma }X_0 \gg 1\)) and neglecting the neutrino virtuality.Footnote 11 Then after simple calculations we get
where
and, according to conditions (A20),
Given the condition (24), the relative contribution of \(J_v(X)\) to the modulus squared amplitudeFootnote 12 is exponentially small, being proportional to
that is, to a good approximation,
where \(J_g(X)\) is given by Eq. (A22). In the main text, we restrict ourselves to this approximation, since for realistic values of the scale parameter \(\Sigma \), the condition (24) is satisfied in all potentially interesting applications of the formalism. It is, however, a straightforward though tedious task to get rid of this approximation in exotic circumstances when \({\Sigma }X_0 \gtrsim 1\), provided that a small number of terms in expansion (A26) is sufficient. In the context of the ISLV effect discussed in Sect. 7, it is interesting to note that accounting for the contribution of \(J_v\), as a NLO correction would lead to an additional reduction in the expected number of neutrino-induced events, i.e., \(J_v\) does not compete with the main ISLV correction.
Appendix B: Enhanced expression for \(N_{\beta \alpha }\)
Here we give an expression for the number of events obtained without using the ultrarelativistic approximation, but still neglecting the (LO) ISLV corrections \(\propto \vert \delta _i+\delta _j\vert ^2 \propto \Sigma ^4\vert \varvec{y}-\varvec{x}\vert ^2/\vert \varvec{q}\vert ^2\) and corrections to the oscillation phase \(\propto \vert \delta _i-\delta _j\vert \propto \vert {\Delta }m_{ij}^2\vert \Sigma ^2\vert \varvec{y}-\varvec{x}\vert /\vert \varvec{q}\vert ^3\), which follow from the expansion (A23) and Eq. (23b). So, with the reservations made, the number of events can be written in the form
where the following notations are used:
All notations not listed here are explained in the main text. This is a formal generalization of Eqs. (62) and (63) (see also Table 1), which can be of utility, for example, when analyzing possible experimental implications of models with “heavy” neutrinos. In practical terms, this result allows us to quantify the accuracy of the ultrarelativistic approximation.
We deliberately do not distinguish here the factors that could be treated as “flavor transition probability” and “instrumental function”. Recall again that since all structures in the formula are rotation invariants, the unit vector \(\varvec{l}\) can be oriented in any fixed direction independent of the vector \(\varvec{y}-\varvec{x}\), for example, along the z axis of a given coordinate system; then all tensors and vectors must be written in this very coordinate system.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Funded by SCOAP3. SCOAP3 supports the goals of the International Year of Basic Sciences for Sustainable Development.
About this article
Cite this article
Naumov, V.A., Shkirmanov, D.S. Virtual neutrino propagation at short baselines. Eur. Phys. J. C 82, 736 (2022). https://doi.org/10.1140/epjc/s10052-022-10670-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1140/epjc/s10052-022-10670-w