1 Introduction

The diagonalization of a Hamiltonian, H, is a ubiquitous, non-linear problem that appears frequently in many sub-fields of physics. In particle physics, it appears in a variety of topics, including the extension of the Higgs sector, axion physics and neutrino oscillations physics, to name just a few, see for example Refs. [1,2,3]. Diagonalization of a Hamiltonian requires the calculation of the eigenvalues and the eigenvectors of the Hamiltonian. The calculation of the eigenvalues, known widely in the hep community, is performed by finding the solutions to the characteristic equation, \(\textbf{Det}[\lambda I-H] =0\), where I is the identity matrix. This is either solved exactly for Hamiltonians of rank \(\le \) 4 or by some approximate method. The solutions to this non-linear equation are the eigenvalues, labelled \((\lambda _i, \ldots , \lambda _n)\). Obtaining the associated eigenvector, \(v_i\), for a given eigenvalue, \(\lambda _i\), requires solving the simultaneous set of linear equations given by \(Hv_i =\lambda _i v_i\), where \(v_i\) is a column vector. A frequently used method to algebraically obtain the eigenvectors is to use some variant of Gaussian elimination on the set of linear equations given by \((H-\lambda _i I)v_i=0\). This method, while simple enough for Hamiltonians of rank 2, becomes quite cumbersome for algebraic calculations for higher rank Hamiltonians. Another, though less frequently used, technique is the Lagrange method which relates products of the components of the i-th eigenvector to the matrix \(\Pi _{j \ne i} (H-\lambda _j I)\). In the Lagrange method, one needs all the eigenvalues of H to obtain a single eigenvector. A third method that does not seem to appear in the particle physics literature, but appears in the mathematics literature, relates products of the components of the ith eigenvector to the matrix \(\text { \textbf{Adjugate}}[\lambda _i I-H]\), where the adjugate is the transpose of the co-factor matrix. This adjugate matrix is a rearrangement of the Lagrange matrix as is shown in Appendix D. Furthermore, there is an exceptionally simple algorithm called the LeVerrier–Faddeev algorithm which gives you both the determinant and the adjugate of the matrix \([\lambda I-H]\) as a power series in \(\lambda \), quantities which are needed in the calculation of both the eigenvalues and the eigenvectors respectively.

If a purely numerically solution is required, there exist very efficient numerical algorithms for finding the eigenvalues and eigenvectors of a Hamiltonian that have been optimized for large-rank (\(\gg \) 10) Hamiltonians. However, purely numerical solutions usually limit the understand of the physics contained in the Hamiltonian. Therefore, for Hamiltonians of smaller rank, say \(\le \) 6, there is significant utility in understanding the physics by understanding the analytic structure of the eigenvectors. In addition, for low-rank matrices the unique features and algebraic form of the Hamiltonian under study means that one can beat the highly efficient general algorithms for purely numerical evaluations.

The Adjugate method is ideally suited to neutrino oscillations in matter given the form of the neutrino Hamiltonian and the majority of this paper uses examples from neutrino oscillation physics. However, the techniques applied here for diagonalizing a Hamiltonian have much greater general applicability in particle physics. If the reader is more interested in these general techniques than neutrino oscillation physics, we suggest going straight to Sect. 3 and associated appendices for a comprehensive review of this diagonalization method, as the rest of this introduction focuses on neutrino oscillation physics in matter, a topic that is of central importance for both current and future long baseline neutrino oscillation experiments.

The Wolfenstein matter effect [4] has been and continues to be very important for exploring neutrino mixing and masses. It is the solution to the solar neutrino anomaly [5, 6], discovered by Davis et al. [7] and in particular the SNO experiment [8] determined \(|U_{e2}|^2\) (\(\sin ^2 \theta _{12} \approx 0.3\)) and that the signFootnote 1 of \(\Delta m^2_{21} \) was positive, i.e. that the state with most \(\nu _e\), \(\nu _1\), is lighter than state with the next most \(\nu _e\), \(\nu _2\). The current long baseline (LBL) experiments, T2K [10] and NOvA [11], as well as the future experiments, Hyper-Kamiokande (HK) [12], Korean Neutrino observatory (KNO) [12] and DUNE [13], will determine the parameters \(|U_{\mu 3}|^2\) (\(\approx \sin ^2 \theta _{23}\)) and \(\Delta m^2_{32}\) including its sign. If \(|U_{\mu 3}|^2 >1/2\) (\(|U_{\mu 3}|^2 < 1/2\)) then \(\nu _\mu \) (\(\nu _\tau \)) dominates the neutrino mass eigenstate with the least \(\nu _e\), \(\nu _3\). The sign of \(\Delta m^2_{32}\) determines the atmospheric mass ordering, whether \(\nu _3\) is the heaviest or lightest mass state in the neutrino spectrum. Most importantly, these experiments will determine whether there is significant CP violation in the neutrino sector via the determination of the CP violating Jarlskog invariant [14] (or the phase \(\delta _{CP}\)).

None of these LBL experiments are performed in vacuum as the neutrino beams passes through 295–1300 km of the earth’s crust. Therefore the Wolfenstein matter effect must be included in the calculation of the oscillation probability and can have significant effects on the oscillation probability and the determination of the neutrino parameters for all of the LBL experiments. This is especially true for DUNE that will use the matter effects to determine the atmospheric mass ordering but it is also true for all the experiments for the determination of the size of CP violation, [15]. Although the density of the earth’s crust varies somewhat over the baseline of all the above LBL experiments it has been shown in [16, 17] that a constant density approximation gives accurate enough oscillation probabilities given the statistics of these experiments. For a review of CP violation in neutrino oscillations see [18].

The neutrino oscillation probabilities for three neutrino flavors in constant medium have been exactly calculatedFootnote 2 by many authors: Barger et al. [19], Zaglauer and Schwarzer [20], Harrison and Scott [21], Ohlsson and Snellman [22, 23], Kimura et al. [24, 25] as well as many approximation schemes reviewed in [26]. All of the exact methods first determine the squared masses of the neutrinos in the medium by solving the characteristic equation, \(\textbf{Det}(\lambda I -H)=0\), where \(\lambda \)’s are the eigenvalues of the Hamiltonian, H, and I is the identity matrix. Where these exact methods differ, is how one then determine the eigenvectors associated with the eigenvalues. Using the methods implored in the above papers the calculations of the eigenvectors are far from transparent. The eigenvectors make up the elements of the mixing matrix for the neutrino states in the medium and with these one can easily calculate the oscillation probabilities. In this paper, we present a different, efficient and simple way to determine the eigenvectors which is known in the mathematics community but so far has not appeared in the neutrino physics literature. We call this method the “adjugate method” as it requires the calculation of the adjugate, the transpose of the co-factor matrix, of the Hamiltonian, \(\textbf{Adjg}(H)\). Because of the form of the neutrino Hamiltonian in the flavor basis this adjugate method is particularly suited to neutrino oscillation calculations as we will show in Sect. 3. We use this method to identify a number of quantities that are invariant, that is, independent of the matter potential, in the various scenarios addressed. For the standard three flavor case these identities are known whereas for the other scenarios these identities are new such as the generalization of the Naumov-Harrison-Scott identity for four or more flavors of neutrinos.

The outline of this paper is as follows: in Sect. 2 we revisit what is required to calculate the oscillation probabilities in matter with particular attention to the intrinsic CP violating term. Section 3 we discuss the various methods for diagonalizing a Hamiltonian and present the “adjugate method” and why it is particularly suited to neutrino oscillation calculations. This section is comprehensive and technical. Section 4.1 is revisiting the well known three flavor case to demonstrate how simple and straight forward this method is. Section 5 is application of this method to four neutrino flavors, three active and one sterile, with particularly attention to the matter potential invariants such as the generalization of various identities, including the Naumov-Harrison-Scott identity. Section 6 addresses the case of three flavors plus non-standard interactions (NSI). Next is the conclusions, Sect. 7, followed by a number of very useful appendices.

2 Calculation of the oscillation probabilities

For neutrinos propagating through a constant medium, the oscillation probabilities can be easily calculated if you know the eigenvalues, \(\lambda _i\), of the Hamiltonian in the flavor bases and \( V_{\alpha i} V^*_{\beta i}\) of the normalized eigenvectors, \( V_{\alpha i} \), no matter how complicated the Hamiltonian. The \(\lambda _i\)’s and the \( V_{\alpha i} \)’s are the medium equivalents of \(m^2_i\) and elements of the PMNS matrix \(U_{\alpha i}\) in vacuum. In general the Hamiltonian is not the same for neutrinos and anti-neutrinos, so the \(\lambda _i\) and \( V_{\alpha i} V^*_{\beta i}\) need to be calculated for both. With these qualities the neutrino oscillation probabilities are given by

$$\begin{aligned} P(\nu _\alpha \rightarrow \nu _\beta )= & {} \left| ~\sum _j V^*_{\alpha j} ~V_{\beta j} \text {e}^{-i\lambda _j L/(2E)}~\right| ^2\nonumber \\= & {} \delta _{\alpha \beta } -4 \sum _{i>j} {\Re }( V_{\alpha i} V^*_{\beta i} \, V^*_{\alpha j} V_{\beta j} ) \sin ^2\left( \Delta _{ij} \right) \nonumber \\{} & {} -8 \sum _{i>j} {\Im }( V_{\alpha i} V^*_{\beta i} \, V^*_{\alpha j} V_{\beta j}) \, \nonumber \\ {}{} & {} \times \sin \Delta _{ij} \sin \Delta _{ik} \sin \Delta _{jk} \,, \end{aligned}$$
(1)

with \(\Delta _{ij} \equiv (\lambda _i -\lambda _j)L/(4E)\). There is no sum over k and it is fixed for the calculation, typically k = 1 is chosen. The last term contains the intrinsic CP violating piece and is only non-zero when \(\alpha \ne \beta \). The usual way of writing this term, as in the PDG,

$$\begin{aligned} 2\sum _{i>j} {\Im }( V_{\alpha i} V^*_{\beta i} V^*_{\alpha j} V_{\beta j} ) \sin \left( 2\Delta _{ij}\right) \end{aligned}$$

appears to have a linear dependence in (L/E), but this is illusionary, as this term must be of order \((L/E)^3\) in the small (L/E) limit as can be seen directly from the first line of this expression there can be no terms linear in (L/E). Therefore, it is more informative to rewrite the CP violating term as in Eq. (1), since

$$\begin{aligned}&2\sum _{i>j} {\Im }( V_{\alpha i} V^*_{\beta i} V^*_{\alpha j} V_{\beta j} ) \sin \left( 2\Delta _{ij}\right) \nonumber \\&\quad = 8 \sum _{i>j} {\Im }( V_{\alpha i} V^*_{\beta i} \, V^*_{\alpha j} V_{\beta j}) \, \sin \Delta _{ij} \sin \Delta _{ik} \sin \Delta _{jk} \end{aligned}$$
(2)

for any fixed k. So for n-flavors there are n ways to chose k, they are all equivalent. Typically k = 1 is chosen. There are only (n − 1)(n − 2)/2 non-zero terms on the RHS, as the terms when i = k or j = k are zero. Therefore the number of such terms is the same as the number of Dirac type phases in the n-flavor PMNS matrix. A derivation of Eq. (2) is given in Appendix A. Details for the case when n = 4 are presented in Sect. 5.

In the next section we will show how to calculate both \(\lambda _i\) and \( V_{\alpha i} V^*_{\beta i}\) in a systematic, straightforward and simple way, independent of the number of neutrino flavors and independent of the complexity of the Hamiltonian. Although the method for calculating \(\lambda _i\) is well known, the closely associated method for calculating \( V_{\alpha i} V^*_{\beta i}\) has not appeared in the neutrino physics literature before and to our surprise is not well known in the particle physics community. Once the \(\lambda _i\)’s are known, there is a simple matrix expression using the adjugate of a matrix that relates the \(\lambda _i\)’s and the Hamiltonian in flavor basis to the \( V_{\alpha i} V^*_{\beta i}\)’s.

3 Diagonalizing a Hermitian Hamiltonian

Let H be an n \(\times \) n Hermitian matrix that we want to diagonalize as follows:

$$\begin{aligned} H=V \Lambda V^\dagger \end{aligned}$$
(3)

where \(\Lambda =\text {Diag}(\lambda _1, \lambda _2, \ldots , \lambda _n)\) and \(V_{\alpha i}\) is a unitary matrix such that ith column of V is a normalized eigenvector of H with eigenvalue \(\lambda _i\), since \(HV=V\Lambda \). For simplicity we assume all eigenvalues are distinct.

It is well known that the eigenvalues are obtained be solving the characteristic equation

$$\begin{aligned} \textbf{Det}[\lambda _i I-H] =0. \end{aligned}$$
(4)

For \(n \le 4\) this can be exactly done analytically, but for n > 4 this cannot be done analytically in general so that approximate analytic methods must used or by numerical computation.

It is well known in the mathematics community, but rarely known in the particle physics community, that the components of the normalized, ith eigenvector are easily calculated using

$$\begin{aligned} V_{\alpha i}V^*_{\beta i} =\frac{\textbf{Adjg}[\lambda _i I-H]_{\alpha \beta }}{\mathbf{Tr[} \, \textbf{Adjg}[\lambda _i I-H] \, ]} \,, \end{aligned}$$
(5)

where Adjg is the adjugate of the matrix.Footnote 3 (the transpose of the co-factor matrix), see [27], and references therein.Footnote 4 We use the abbreviation \(\textbf{Adjg}\) here for Adjugate so as not to cause confusion with the adjoint (conjugate transpose). The denominator of this equation, which normalizes the eigenvector, can be rewritten in the many ways:

$$\begin{aligned}&\textbf{Tr} [ \, \textbf{Adjg}[\lambda _i I-H] \, ] =\textbf{Tr}[ \,\mathbf{\Pi }_{j\ne i} (H-\lambda _j I) \,] \nonumber \\&\quad = \frac{\partial }{\partial \lambda } \, \textbf{Det}[\lambda I -H]\, \biggr |_{\lambda =\lambda _i} =\mathbf{\Pi }_{k\ne i} (\lambda _i-\lambda _k) \,. \end{aligned}$$
(6)

For neutrino oscillations one needs exactly the quantities \( V_{\alpha i} V^*_{\beta i}\) which are given directly and simply from Eq. (5). If needed, this equation can also be used to calculate the eigenvectors in a simple, straight forward manner, up to an overall phase which is arbitrary. The quantity \( V_{\alpha i} V^*_{\beta i}\) given by Eq. (5) is independent of this arbitrary choice but can be used to calculate the magnitude, \( | V_{\alpha i} |\), and relative phase of each component of an eigenvector, since this phase is given by \( V_{\alpha i} V^*_{\beta i}/(| V_{\alpha i} | | V_{\beta i}|) \). A physicist’s style sketch of why Eq. (5) is correct is given in Appendix B.

With \(\lambda _i\)’s and the \(V_{\alpha i}V^*_{\beta i} \)’s constructed this way, Eq. (3) is satisfied and,Footnote 5

$$\begin{aligned} \sum _i V_{\alpha i}V^*_{\beta i} = \delta _{\alpha \beta } \quad \text {and} \quad \sum _\alpha V_{\alpha i}V^*_{\alpha j} = \delta _{ij}\,. \end{aligned}$$
(7)

Therefore the eigenvectors constructed this way are normalized and orthogonal.

Even less well known until recently, even in the mathematics community, is that the magnitudes of the elements of V can be simple, obtained using the eigenvector-eigenvalue identity: [3, 27],

$$\begin{aligned} |V_{\alpha i}|^2 =\frac{\Pi _{l} (\lambda _i-\lambda _l(h_\alpha ))}{\Pi _{k\ne i} (\lambda _i-\lambda _k)} \end{aligned}$$
(8)

where \(h_\alpha \) is the principal minor of H with the \(\alpha \)-th row and column deleted. \(\lambda _l(h_\alpha )\) are the eigenvalues of \(h_\alpha \), i.e. the solutions to \( \text {Det}[\lambda _lI-h_\alpha ]=0\). Constructed this way \(\sum _i |V_{\alpha i}|^2 = 1= \sum _\alpha |V_{\alpha i}|^2\), i.e. they are normalized appropriately. Equation (8) is mathematically equivalent to Eq. (5) when \(\alpha =\beta \), however the square of the diagonal elements are more simply obtained using Eq. (8). For standard three flavor oscillations in matter, Eq. (8) is sufficient, see [27], however for more complicated scenarios like 3 + 1 or NSIs then Eq. (5) is required. Equation (5) gives the most straight forward and systematic way to evaluate the magnitude and relative phase of all components of the eigenvectors and gives directly the quantities,\( V_{\alpha i} V^*_{\beta i}\), for neutrino oscillation calculations.

Therefore, for the evaluation of the eigenvalues we need, \(\textbf{Det}(\lambda I-H)\), and for the eigenvectors, \(\textbf{Adjg}(\lambda I-H)\) is also needed, see Eqs. (4) and 5. We first note that both are polynomials in \(\lambda \) of degree n and (n − 1) respectively, i.e.

$$\begin{aligned} \textbf{Det}[\lambda I-H]&\equiv \lambda ^{n} +\lambda ^{n-1} d_1+\cdots +d_n \end{aligned}$$
(9)
$$\begin{aligned} \textbf{Adjg}[\lambda I-H]&\equiv \lambda ^{n-1} A_1+\lambda ^{n-2} A_2+\cdots +A_n \,. \end{aligned}$$
(10)

where the \(d_i\)’s are scalars and the \(A_i\)’s are n \(\times \) n Hermitian matrices.

There are at least five of different algebraic methods that can be used to calculate \(\textbf{Det}[\lambda I-H]\) and \(\textbf{Adjg}[\lambda I-H]\), we review them here:

  1. 1.

    Using the Leibniz formulaeFootnote 6 to calculate the sub-determinants and hence the elements of the adjugate of \((\lambda I-H)\). However this standard method does not, in general, give the simplest results without significant additional manipulations.

  2. 2.

    Le Verrier–Faddeev algorithm: a more general method that is the very simple and recursive Le Verrier–Faddeev algorithm,Footnote 7 see also [28], given by

    $$\begin{aligned}&A_1 \equiv I_n \quad \quad&d_1=-\text {Tr}(H A_1) \nonumber \\{} & {} \, \quad = -\text {Tr}(H)=-\sum _j \lambda _j \nonumber \\&A_2= HA_1+d_1I_n. \quad&\quad d_2=-(1/2) \text {Tr}(H A_2) \nonumber \\&~\quad = H-\text {Tr}(H)I_n \quad&~\quad =(1/2)( \text {Tr}^2(H)\nonumber \\ {}{} & {} \quad -\text {Tr}(H^2))=\sum _{j1>j2} \lambda _{j1} \lambda _{j2} \nonumber \\ [-4mm]&\vdots \quad&\quad \quad \vdots \nonumber \\&A_n=HA_{n-1} +d_{n-1} I_n \quad&\quad d_n=-(1/n) \text {Tr}(H A_n)\nonumber \\&\, =(-1)^{n-1}{} \textbf{Adjg}[H]&\, \quad \quad =(-1)^n \textbf{Det}(H)\nonumber \\ {}{} & {} \qquad =(-1)^n\Pi ^n_{j=1} \lambda _j . \end{aligned}$$
    (11)

    The next iteration gives

    $$\begin{aligned} A_{n+1}=(-1)^{n-1}[H\textbf{Adjg} (H)- \textbf{Det}(H)I_n]=0, \end{aligned}$$

    which is related to the Caley–Hamiltonian identity. Then \(d_{n+1}=0\) follows, terminating the series. As expected, from Eq. (9), the d’s are the combination of traces of \(H^m\) (m=1,..., n) such that \(d_m = (-1)^m \sum _{ j1> \cdots >jm } \lambda _{j1} \cdots \lambda _{jm}\). In Appendix C the d’s and A’s are explicitly given for n\(\le \)6. It is worth noting here the following relationship between the trace of the A’s and the d’s: \(\textbf{Tr}[A_{m+1}] =(n-m)d_m\), valid for m=0, \(\ldots \), n. (using \(d_0 \equiv 1\)), and therefore giving

    $$\begin{aligned} \textbf{Tr}[ \textbf{Adjg}[\lambda I -H] ]=\frac{\partial }{\partial \lambda } \, \textbf{Det}[\lambda I -H] \end{aligned}$$
    (12)

    for any \(\lambda \).

  3. 3.

    Lagrange method: the results of the Le Verrier–Faddeev algorithm can be re-arranged, see Appendix D, as

    $$\begin{aligned} \textbf{Adjg}(\lambda _i I-H)&= \mathbf{\Pi }_{j\ne i} (H-\lambda _j I) \,. \end{aligned}$$
    (13)

    This is the Lagrange method that has been used in many papers: [29, 30]. The LHS of this equation only depends on the eigenvalue \(\lambda _i\) whereas the RHS depends on all the other eigenvalues, \(\lambda _j\)’s with \( j \ne i\). The LHS and RHS of Eq. (13) are the two extremes of the many ways to write this expression given the many relationships between the traces of the powers of the Hamiltonian and products of the eigenvalues, e.g. \(\sum _i \lambda _i=\text {tr}(H)\), \(\sum _{i>j} \lambda _i \lambda _j =\frac{1}{2}[\text {tr}^2(H)- \text {tr}(H^2)]\), etc. This implies that the denominator of Eq. (5), given in Eq. (6), can also be written as \( \textbf{Tr}[ \mathbf{\Pi }_{j\ne i} (H-\lambda _j I) ] \). For numerical calculations, the Le Verrier–Faddeev algorithm and using the Lagrange expression require a similar numbers of complex matrix operations. However, the Le Verrier–Faddeev algorithm gives coefficients of \(\lambda ^k\) for both the determinant and the adjugate of \((\lambda I -H)\) in one very simple iterative procedure. For purely numerical calculations, especially if the matrices are large, there exist many very efficient numerical packages. However for analytic neutrino oscillation calculations where the number of neutrino flavors is \(\le \)6, we will now argue that using the adjugate of \((\lambda I -H)\) is simpler, due to the form of the Hamiltonian, as we will see in the next item.

  4. 4.

    Adjugate method: for neutrino oscillation calculations one can use the special form of the Hamiltonian in the flavor basis as follows:

    $$\begin{aligned} H&=H_0 + H_1 \end{aligned}$$
    (14)

    where \(H_0\) is the vacuum Hamiltonian in the flavor basis, so that,

    $$\begin{aligned} H_0&=U X U^\dagger \nonumber \\ \text {with} \quad X&=\textbf{diag}(x_1, x_2, \ldots , x_n) \end{aligned}$$
    (15)

    where \(x_i=m^2_i/(2E)\) and U is unitrary matrix. The adjugate of \(H_0\) is simply given by

    $$\begin{aligned} \textbf{Adjg}[H_0]_{\alpha \beta }&= \sum _i U_{\alpha i} U^*_{\beta i} \, \Pi _{j\ne i} x_j \,, \end{aligned}$$
    (16)

    which follows from \(\textbf{Adjg}[U X U^\dagger ] =U\textbf{Adjg}[X] U^\dagger \). \(H_1\) contains the matter potential which for most cases is a sparse matrix in the flavor basis. For the standard 3 flavor case only one diagonal element of \(H_1\) is non-zero and for 3+1 flavors only 2 diagonal elements are non-zero. If \(H_1\) is sparse, then calculation of the \(\textbf{Adjg}[H]\) is simple and straight forward. A similar argument applies to \(\textbf{Det}[H]\). However, what is actually needed is \(\textbf{Det}[\lambda I-H]\) and \(\textbf{Adjg}[\lambda I-H]\), but these can be trivially calculated from \(\textbf{Det}[H]\) and \(\textbf{Adjg}[H]\), since

    $$\begin{aligned} \lambda I-H= U(\lambda I-X) U^\dagger - H_1 \, . \end{aligned}$$
    (17)

    Therefore, replacing \(x_i \rightarrow (x_i -\lambda )\) in \(\textbf{Det}[H]\) and \(\textbf{Adjg}[H]\) one obtains the required results, since

    $$\begin{aligned} \textbf{Det}[\lambda I-H(x_i)]&=(-1)^n ~ \textbf{Det}[H(x_i-\lambda )] \,, \nonumber \\ \textbf{Adjg}[\lambda I- H(x_i)]&=(-1)^{n-1} ~ \textbf{Adjg}[H(x_i-\lambda )] \,. \end{aligned}$$
    (18)

    Therefore one only needs to calculate \(\textbf{Det}[H]\) and \(\textbf{Adjg}[H]\), a considerable simplification. We call the method given here, the “adjugate method,” and it is the most straight forward and systematic for analytic neutrino oscillation calculations. This happens because the full Hamiltonian can be split into an easily diagonalized part, \(H_0\), and an additional piece, \(H_1=H-H_0\), that is a sparse matrix.

All of these methods, as well as combinations, can be used to find the required results, however, the last method given above, adjugate method, requires the least algebraic computation and gives simple expressions for neutrino oscillations because of the form of the Hamiltonian. This is the method that will be used in the rest of this paper for the numerator of Eq. (5).

A comment about the denominator of adjugate equation, Eq. (5), is in order as well. To evaluate this denominator it appears that you need all the eigenvalues of H, not just the eigenvalue of the eigenvector one is calculating. This is illusionary, as one can easily show that

$$\begin{aligned}&\varvec{\Pi }_{j\ne i} (\lambda _i-\lambda _j ) = \frac{\partial }{\partial \lambda } \, \textbf{Det}[\lambda I -H]\, \biggr |_{\lambda =\lambda _i} \nonumber \\&\quad = n\lambda ^{n-1}_i +(n-1)\lambda ^{n-2}_i d_1+\cdots +d_{n-1} \,, \end{aligned}$$
(19)

see for example Ref. [27]. \(\textbf{Det}[\lambda I -H]\) is just the characteristic polynomial and the coefficients are given by the \(d_i\)’s of Eq. (9). Thus, if you know the position of a zero of the characteristic polynomial and the slope of this polynomial at that zero, one has enough information using Eq. (5) to determine that eigenvector i.e. in physicist’s language the determination of the eigenvector is local in \(\lambda \), one does not need to know all the other eigenvalues. That is, the adjugate equation, Eq. (5), could have been written as

$$\begin{aligned} V_{\alpha i}V^*_{\beta i} = \biggr ( \textbf{Adjg}[\lambda I-H]_{\alpha \beta }\biggr / \frac{\partial }{\partial \lambda } \, \textbf{Det}[\lambda I -H] \biggr ) \, \biggr |_{\lambda =\lambda _i} \,, \end{aligned}$$
(20)

which manifestly only depends on the single eigenvalue of H, \(\lambda _i\), associated with the eigenvector given by \(V_{\alpha i}V^*_{\beta i}\). Of course frequently one may need all eigenvalues, if so, then RHS of Eq. (19) maybe the simplest route but it is not necessary for the determination of a single eigenvector. In addition Eq. (19) can also be used as a cross check.

Alternatively, since we require \(\sum _{\alpha } |V_{\alpha i}|^2=1\), then the denominator of Eq. (5) is clearly equal to the \(\textbf{Tr}[ \textbf{Adjg}[\lambda _i I-H]]\), see Appendix B, as all of these are equivalent since

$$\begin{aligned}{} & {} \mathbf{\Pi }_{j\ne i} (\lambda _i-\lambda _j ) =\frac{\partial }{\partial \lambda } \, \textbf{Det}[\lambda I -H]\, \biggr |_{\lambda =\lambda _i} \nonumber \\{} & {} \quad = \textbf{Tr}[ \, \textbf{Adjg}[\lambda _i I-H] \,] =\textbf{Tr}[ \,\mathbf{\Pi }_{j\ne i} (H-\lambda _j I) \,]. \end{aligned}$$

4 Relationship to previous works in neutrino physics

The method used in the papers of Kimura et al. (KTY), [24, 25], specifically for three neutrinos, is closely related but not exactly the calculation of \(\textbf{Adjg}(\lambda _i I-H)\) for a 3 \(\times \) 3 matrix. In the KTY papers the adjugate equation, Eq. (5), never appears, nor do the words “adjugate” or “classical adjoint” or “adjunct” appear, even though some of the results of their papers are closely related to what we find for three neutrinos in the next section. In fact, the results given in the KTY papers are midwayFootnote 8 between the calculations using \(\textbf{Adjg}(\lambda _i I-H)\) given in the next section and \( \mathbf{\Pi }_{j\ne i} (H-\lambda _j I)\) of Ref. [29]. There are multiple paths to extending the KTY method to more than three neutrinos, whereas \(\textbf{Adjg}(\lambda _i I-H)\) and \( \mathbf{\Pi }_{j\ne i} (H-\lambda _j I)\) are uniquely determined. The methods based on the Caley–Hamilton Theorem, \( \mathbf{\Pi }^n_{j=1} (H-\lambda _j I)=0\), such as papers [22, 23, 31], are more distantly related and are more complicated to implement.

Our method is very simple, general and systematic, and can be applied to more than 3 neutrinos or to more complicated matter potentials than the method in KTY. In some sense it can be considered a modification of the above methods that allows a straight forward generalization to more than 3 neutrinos or more complicated matter potentials.

4.1 3 flavors in matter: revisited

In this section we have used the adjugate method to calculate of the eigenvectors, i.e \(V_{\alpha i}V^*_{\beta i}\), and to reproduces the known results for 3 flavors. This not only demonstrates the simplicity of this method but also by reproducing known results confirms its validity for this important and non-trivial example.

For three flavors in matter the Hamiltonian is simply

$$\begin{aligned} (2E) H_{3x3}&=U M^2 U^\dagger + \text {Diag}(a,0,0) \,, \end{aligned}$$
(21)

where U is the PMNS matrix, \(M^2\) the mass matrix squared, \(M^2=\text {Diag}(m^2_1,m^2_2,m^2_3)\) and \(a=2\sqrt{2} G_F N_e E\) is the Wolfenstein matter potential, [4], where E is the energy of the neutrino in the rest frame of the matter. The term proportional to the number density of neutrons, has been subtract from all diagonal elements of the Hamiltonian, Eq. (21), as terms proportional to the identity matrix in the Hamiltonian do not effect oscillations. Now

$$\begin{aligned} \text {Det}[(2E) H_{3x3}]&=\Pi _i\, m^2_i+a \sum _{i} |U_{ei}|^2 \, \Pi _{j\ne i}m^2_j. \end{aligned}$$
(22)

Useful identities for this calculation can be found in Appendix E. Therefore by replacing \(m^2_i \rightarrow (m^2_i-\lambda )\) we obtain the characteristic equationFootnote 9

$$\begin{aligned}&\textbf{Det}[\lambda I-(2E) H_{3x3}]= \lambda ^3-\lambda ^2 \biggr ( \sum _i m^2_i +a \biggr ) \nonumber \\&\quad +\lambda \biggr ( \sum _{i>j} m^2_i m^2_j +a \sum _i m^2_i (1-|U_{ei}|^2) \biggr ) \nonumber \\&\quad -\biggr (\Pi _i m^2_i+ a \sum _{i} |U_{ei}|^2 \, \Pi _{j\ne i}m^2_j \biggr ) = 0 \, . \end{aligned}$$
(23)

Note that if one sets \((m^2_1,m^2_2,m^2_3)=(0, \Delta m^2_{21}, \Delta m^2_{31})\), Eq. (23) reduces to the results of Zaglauer–Schwarzer [20], and there are significant simplifications as any term with \(m^2_1\) is zero. In this paper we prefer not to do this so as to maintain the symmetry between the \(m^2_i\). The exact solutions to this cubic equation are given in Appendix F.

To obtain the eigenvectors, we need both the numerator and the denominator of the adjugate Eq. (5). The denominator can be simply evaluated using \(\textbf{Det}^\prime [\lambda I-(2E) H_{3x3}]\), see Eq. (19), with \(\lambda \) being the eigenvalue of the eigenvector. Note it is linear in the matter potential if one uses only the eigenvalue of the eigenvector being calculated.

The numerator is easily evaluated from \(\textbf{Adjg}[(2E) H_{3x3}]\), which is super simple to calculate using Eq. (16) to obtain the matter potential independent part whereas the matter dependent part is only linear in the matter potential and is given by

$$\begin{aligned}&\textbf{Adjg}[(2E) H_{3x3}]_{\alpha \beta } \nonumber \\&\quad = \sum _{i} U_{\alpha i}U^*_{\beta i} \, \Pi _{j\ne i} \, m^2_j \nonumber \\&\quad +a(1-\delta _{\alpha e})(1-\delta _{\beta e}) \left\{ \begin{array}{ll} \sum _i m^2_i |U_{\tau i}|^2 &{} \alpha =\beta = \mu \\ \sum _i m^2_i |U_{\mu i}|^2 &{} \alpha = \beta = \tau \\ - \sum _i m^2_i U_{\alpha i} U^*_{\beta i} \quad &{} \alpha \ne \beta \end{array} \right. \, . \end{aligned}$$
(24)

The matter potential independent part here is most easily calculated using the identity given in Eq. (16) whereas the matter dependent part is trivial.Footnote 10 Note, \(\textbf{Adjg}[(2E) H_{3x3}]\) is only linear in the matter potential like the determinant.

The eigenvectors are then calculated as follows:

$$\begin{aligned} V_{\alpha i}V^*_{\beta i}&\equiv \frac{\lambda ^2_i \delta _{\alpha \beta } - \lambda _i S_{\alpha \beta } + T_{\alpha \beta }}{\Pi _{l\ne i} (\lambda _i-\lambda _l)} \, , \end{aligned}$$
(25)

where the numerator, \(\textbf{Adjg}[\lambda I-(2E) H_{3x3}]\), can be easily obtained from \(\textbf{Adjg}[(2E) H_{3x3}]\) by replacing \(m^2_i \rightarrow (m^2_i-\lambda )\). Thus \(S_{\alpha \beta }\) and \(T_{\alpha \beta }\) are simply,

$$\begin{aligned} S_{\alpha \beta }&= \sum _i m^2_i (\delta _{\alpha \beta } - U_{\alpha i}U^*_{\beta i} ) +a \delta _{\alpha \beta }(1-\delta _{\alpha e}) \quad \text {and} \nonumber \\&\quad T_{\alpha \beta } =\text {Adjg}[(2E)H_{3x3}]_{\alpha \beta } \, . \end{aligned}$$
(26)

An alternative way to calculate the matrix S is via \(S=(2E)(Tr[H_{3x3}]I-H_{3x3})\). The non-zero terms linear in a, for matrix T are also given by \(Tr[X]I-X\) where X equals (2E) times the \(\mu -\tau \) sub-block of H.

Additional simplifications occur if one now choses the usual convention that \(m^2_1=0\), then the characteristic equation as well as \(S_{\alpha \beta }\)’s and \(T_{\alpha \beta }\)’s are all particularly simple,

$$\begin{aligned} \lambda ^3&-\lambda ^2 ( \Delta m^2_{21}+\Delta m^2_{31} +a ) +\lambda ( \Delta m^2_{21}\Delta m^2_{31} +a S_{ee} ) \nonumber \\ {}&\quad - T_{ee} = 0 \, , \nonumber \\ S_{\alpha \beta }&= \Delta m^2_{21}(\delta _{\alpha \beta } - U_{\alpha 2}U^*_{\beta 2} ) + \Delta m^2_{31}(\delta _{\alpha \beta } - U_{\alpha 3}U^*_{\beta 3} ) \nonumber \\ {}&\quad +a \delta _{\alpha \beta }(1-\delta _{\alpha e}) \, , \\ T_{\alpha \beta }&= \Delta m^2_{21} \Delta m^2_{31}U_{\alpha 1}U^*_{\beta 1} \nonumber \\&\quad +a(1-\delta _{\alpha e})(1-\delta _{\beta e}) \nonumber \\ {}&\quad \left\{ \begin{array}{ll} \Delta m^2_{21} |U_{\tau 2}|^2 +\Delta m^2_{31} |U_{\tau 3}|^2&{} \alpha =\beta = \mu \\ \Delta m^2_{21} |U_{\mu 2}|^2 +\Delta m^2_{31} |U_{\mu 3}|^2 &{} \alpha = \beta = \tau \\ -\Delta m^2_{21} U_{\alpha 2} U^*_{\beta 2} -\Delta m^2_{31} U_{\alpha 3} U^*_{\beta 3} \quad &{} \alpha \ne \beta \end{array} \right. \, .\nonumber \end{aligned}$$
(27)

These three equations summarize the three active neutrino cases since with the solutions to this characteristic equation, one can straightforwardly calculate the neutrino oscillation probabilities in matter using the remaining two equations together with Eqn. (1) and (25). The anti-neutrinos oscillation probabilities are obtained using negative energy and the mass ordering is determined by the sign \(\Delta m^2_{31}\). Note, this set of equations comes only from the simple evaluation of \( \textbf{Det}[(2E) H_{3x3}]\) and \( \textbf{Adjg}[(2E) H_{3x3}]\) which are related via the LeVerrier–Faddeev algorithm.

Furthermore, in the adjugate method where we only use the eigenvalue of the eigenvector we are calculating, then both the numerator and denominator, see Eq. (6), are quadratic polynomials in the eigenvalue with coefficients that are linear in the matter potential. Other formulations such as the Lagrange method do not have such a simple dependence, but of course, they give the same numerical values.

4.2 Matter potential invariant quantities for three flavors

The elements of the Hamiltonian given by Eq. (21) are nearly all independent of the matter potential, except for the \(H_{ee}\) component. In this section we will explain what the physics implications of this independence has on the oscillation probabilities. First we note that because we can re-phase the flavor states we must also take this into account when identify these physics implications.

First, the diagonal elements of the Hamiltonian give the “mass” of the flavor states:

$$\begin{aligned} m^2_\alpha = \sum _i |U_{\alpha i}|^2 m^2_i + a \delta _{\alpha e}= \sum _i |V_{\alpha i}|^2 \lambda _i \,. \end{aligned}$$
(28)

Note the “mass” of \(\nu _e\) is the exception here and depends linearly on the matter potential whereas the “masses” of \(\nu _\mu \) and \(\nu _\tau \) are independent of the matter potential.

Second the square of the off-diagonal elements of the Hamiltonian (\(|H_{\alpha \beta }|^2\), \(\alpha \ne \beta \)) are related to the appearance oscillation probabilities in the small L/E limit. Therefore in this limit the coefficients of the \((L/E)^2\) terms in the oscillation probabilities, i.e. CP or T conserving parts, must be invariant for \(\alpha \ne \beta \), [29]:

$$\begin{aligned}&\sum _{j>k} \mathcal{R}(V_{\alpha j}V^*_{\beta j} V^*_{\alpha k}V_{\beta k} )~ (\Delta \lambda _{jk})^2 \nonumber \\&\quad = \sum _{j>k} \mathcal{R}(U_{\alpha j}U^*_{\beta j} U^*_{\alpha k}U_{\beta k} )~ (\Delta m^2_{jk})^2\,. \end{aligned}$$
(29)

This identity identity is easily checked using the expressions given in the previous subsection and can be used as cross check.

Another identity, which is not independent of the above, is associated with the disappearance probabilities:

$$\begin{aligned} \sum _{j>k} |V_{\alpha j}|^2 |V_{\alpha k}|^2 (\Delta \lambda _{jk})^2&= \sum _{j>k} |U_{\alpha j}|^2 |U_{\alpha k}|^2 (\Delta m^2_{jk})^2 \end{aligned}$$
(30)

This identity guarantees that in \(L/E \rightarrow 0\) limit, the disappearance oscillation probabilities is independent of matter effects i.e. the the coefficients of the terms of order \((L/E )^2\) are equal in matter and vacuum. This identity follows from the matter potential independence of \(\sum _{\beta \ne \alpha }| H_{\alpha \beta }|^2\).

What about the intrinsic CP violating (CPV) or T violating part of the appearance probabilities? The CP or T violating part of the oscillation probabilities appears first an \((L/E)^3\) term given by

$$\begin{aligned} \Im (V_{\alpha i}V^*_{\beta i} V^*_{\alpha j}V_{\beta j} ) \Delta \lambda _{ij} \Delta \lambda _{ik} \Delta \lambda _{jk} \end{aligned}$$
(31)

and is also invariant. This is the Naumov–Harrison–Scott (NHS) invariant, [21, 32] which follows from the invariance of \(\Im ( H_{e \mu } H_{\mu \tau } H_{\tau e})\). Note, rephasing of the flavor states does not change this invariant. \(\Re ( H_{e \mu } H_{\mu \tau } H_{\tau e})\) gives no new invariant. The NHS identity can be easily derived from our expressions as follows: for \(\alpha \ne \beta \) and \(i \ne j\), we obtain

$$\begin{aligned}&V_{\alpha i}V^*_{\beta i} V^*_{\alpha j}V_{\beta j}\nonumber \\ {}&\quad = \frac{ \lambda _i \lambda _j |S_{\alpha \beta }|^2 -\lambda _i S_{\alpha \beta } T^*_{\alpha \beta } -\lambda _j S^*_{\alpha \beta } T_{\alpha \beta } +|T_{\alpha \beta }|^2 }{\Pi _{l\ne i} (\lambda _i-\lambda _l)~ \Pi _{k\ne j} (\lambda _j-\lambda _k)} \end{aligned}$$
(32)

and therefore the Jarlskog invariant, [14], is given by

$$\begin{aligned} \Im (V_{\alpha i}V^*_{\beta i} V^*_{\alpha j}V_{\beta j} )&= \frac{(\lambda _j -\lambda _i)~ {\Im }(S_{\alpha \beta } T^*_{\alpha \beta })}{\Pi _{l\ne i} (\lambda _i-\lambda _l)~ \Pi _{k\ne j} (\lambda _j-\lambda _k)} \nonumber \\&= \frac{ \Delta m^2_{ij} \Delta m^2_{ik} \Delta m^2_{jk}}{ \Delta \lambda _{ij} \Delta \lambda _{ik} \Delta \lambda _{jk}} ~~ {\Im }(U_{\alpha i}U^*_{\beta i} U^*_{\alpha j}U_{\beta j} ) \end{aligned}$$
(33)

where (i,j,k) are all different. This important identity as be used in many places to understand the effects of matter on CP and T violating oscillation probabilities, see for example [18, 33,34,35,36].

This next identity follows from the fact that matter potential is invariant under rotations in the \(\mu \) and \(\tau \) sectors. It is simple algebra to show that Kimura–Takamura–Yokomakura (KTY) identity, [25],

$$\begin{aligned} |V_{e1}|^2 |V_{e2}|^2 |V_{e3}|^2 (\Delta \lambda _{21} \Delta \lambda _{31} \Delta \lambda _{32})^2 \end{aligned}$$
(34)

is an invariant, i.e. independent of the matter potential. One only has to use that the \(\lambda _i\)’s satisfy the 3-flavor characteristic equation, Eq. 4 to prove this identity. When taking the square root of this identity care must be taken with the sign (phase) which depends on conventions, see [37].

The Toshev invariant, [38], is obtained by dividing the NHS identity by the square root of the KTY identity, i.e.

$$\begin{aligned} \left( \frac{\Delta \lambda _{ij} \Delta \lambda _{ik} \Delta \lambda _{jk}}{|\Delta \lambda _{ij} \Delta \lambda _{ik} \Delta \lambda _{jk}|}\right) \frac{ \Im (V_{\alpha i} V^*_{\beta i} V^*_{\alpha j} V_{\beta j})}{ |V_{e1} V_{e2} V_{e3}| } \, , \end{aligned}$$
(35)

where (i,j,k) are all different. Note, the piece of this expression in \(\left( \cdots \right) \) is either \(\pm 1\). Some care is needed with the phases of \(V_{ei}\) and signs of \(\Delta \lambda _{ij}\) when taking the square root KTY identity, as noted in reference [37]. Using the PDG PMNS matrix with all \(\Delta \lambda _{ij}>0\) one obtains the usual Toshev invariant (the invariance of \(\sin 2 \theta _{23} \sin \delta \)), however departures from the PDG conventions can introduce an additional sign. Equation (35) is the PMNS convention independent Toshev invariant which will be useful if a different PMNS convention than that of the PDG is used [39].

5 4 Flavors in matter

For four flavors in matter the Hamiltonian is simply

$$\begin{aligned} (2E) H_{4x4}&=U M^2 U^\dagger + \text {Diag}(a,0,0,b) \end{aligned}$$
(36)

where U is the 4 flavor PMNS matrix, \(M^2\) the mass matrix squared, \(M^2=\text {Diag}(m^2_1,m^2_2,m^2_3,m^2_4)\), “a” is the \(N_e\) matter potential with \(a=2\sqrt{2} G_F N_e E_\nu \) and “b” is the \(N_n\) matter potential with \(b=\sqrt{2} G_F N_n E_\nu \) where \(N_e\) and \(N_n\) are the number density for electrons and neutrons respectively.Footnote 11

The determinant is easily calculated as

$$\begin{aligned} \text {Det}[(2E) \, H_{4x4}]&= \Pi _i m^2_i + a \sum _i |U_{ei}|^2 \, \Pi _{j\ne i}m^2_j \nonumber \\ {}&\quad + b\sum _i |U_{si}|^2 \, \Pi _{j\ne i}m^2_j \nonumber \\&\quad +ab \sum _{i>j} |U_{e i} U_{s j}-U_{e j} U_{s i}|^2 \,\nonumber \\&\quad \times \Pi _{k \ne i, \,k \ne j} m^2_k \,. \end{aligned}$$
(37)

Here we use \(|U_{\mu i} U_{\tau j}-U_{\mu j} U_{\tau i}|^2 = |U_{e k} U_{s l}-U_{e l} U_{s k}|^2 \) with \((k,l)\ne (i,j)\). Then the characteristic equation

$$\begin{aligned} \text {Det}[\lambda I_4-(2E)H_{4x4}]&=\lambda ^4-f_1 \lambda _3 +f_2 \lambda ^2 -f_3 \lambda +f_4\nonumber \\&=0 \end{aligned}$$
(38)

is easily obtained by making the \(m^2_i \rightarrow (m^2_i -\lambda )\) substitution in 37, which gives

$$\begin{aligned} f_1&= \sum _i m^2_i +a +b \end{aligned}$$
(39)
$$\begin{aligned} f_2&= \sum _{i>j} m^2_i m^2_j +a \sum _i m^2_i (1-|U_{ei}|^2) \nonumber \\&\quad +b\sum _i m^2_i(1- |U_{si}|^2) +ab \end{aligned}$$
(40)
$$\begin{aligned} f_3&= \sum _i \, \Pi _{j\ne i} m^2_j\nonumber \\&\quad +a \sum _{i>j} m^2_i m^2_j(1-|U_{ei}|^2 -|U_{ej}|^2) \nonumber \\&\quad +b \sum _{i>j}m^2_i m^2_j(1-|U_{si}|^2 -|U_{sj}|^2) \nonumber \\&\quad +ab\sum _{i} m^2_i (1-|U_{ei}|^2 -|U_{si}|^2 ) \end{aligned}$$
(41)
$$\begin{aligned} f_4&=\text {Det}[(2E)H_{4x4}] =\Pi _i m^2_i \nonumber \\&\quad + a \sum _i |U_{ei}|^2 \, \Pi _{j\ne i}m^2_j + b\sum _i |U_{si}|^2 \, \Pi _{j\ne i}m^2_j \nonumber \\&\quad +ab \sum _{i>j} |U_{e i} U_{s j}-U_{e j} U_{s i}|^2 \, \Pi _{k \ne i, \,k \ne j} m^2_k \, . \end{aligned}$$
(42)

See Appendix E for some useful identities. This is in agreement with Ref. [40],

The solution to the 4 flavor characteristic equation is as follows:

$$\begin{aligned} 0&=\lambda ^4-f_1 \lambda ^3 +f_2 \lambda ^2 -f_3 \lambda +f_4\end{aligned}$$
(43)
$$\begin{aligned}&=(\lambda ^2 -.\frac{1}{2}(f_1-\sqrt{f_1^2+4(y-f_2)}~)\lambda \nonumber \\ {}&\quad + \frac{1}{2}(y+\sqrt{y^2-4f_4})) \end{aligned}$$
(44)
$$\begin{aligned}&\quad \times (\lambda ^2-\frac{1}{2}(f_1+\sqrt{f_1^2+4(y-f_2)} ~)\lambda \nonumber \\ {}&\quad +\frac{1}{2}(y-\sqrt{y^2-4f_4})) \end{aligned}$$
(45)

where

$$\begin{aligned} y^3 -f_2 y^2 +(f_3f_1-4f_4) y + f_4(4f_2-f^2_1)-f^2_3=0\, . \end{aligned}$$
(46)

The solution to this cubic equation can be found in the Appendix F. Therefore the solutions are then simply given by the solutions of the quadratics

$$\begin{aligned}&\lambda ^2 -.\frac{1}{2}(f_1-\sqrt{f_1^2+4(y-f_2)}~)\lambda + \frac{1}{2}(y+\sqrt{y^2-4f_4}) \nonumber \\ {}&\quad =0 \end{aligned}$$
(47)
$$\begin{aligned}&\lambda ^2-\frac{1}{2}(f_1+\sqrt{f_1^2+4(y-f_2)} ~)\lambda +\frac{1}{2}(y-\sqrt{y^2-4f_4})\nonumber \\ {}&\quad =0 \,. \end{aligned}$$
(48)

The solutions to four flavor cubic, Eq. (46), is the challenging part to solving the four flavor characteristic equation.

To obtain the eigenvectors, we first calculate the \(\textbf{Adjg}[(2E) H_{4x4}]\), which is given by

(49)

where \(\mathcal{W}_{\alpha \beta }\) (\(\mathcal{W}_{\alpha \beta \gamma }\)) is the matrix formed by the \(\alpha \beta \) (\(\alpha \beta \gamma \)) rows and columns of (2E)H with a=b=0, i.e. only the vacuum components. Using \(W_{\alpha \beta } \equiv \sum _i U_{\alpha i} U^*_{\beta i} m^2_i\),

$$\begin{aligned}&\mathcal{W}_{\mu ,\tau } = \begin{pmatrix} W_{\mu \mu } &{} W_{\mu \tau } \\ W_{\tau \mu } &{} W_{\tau \tau } \end{pmatrix} \quad \text {and} \quad \nonumber \\ {}&\mathcal{W}_{\mu ,\tau ,s} = \begin{pmatrix} W_{\mu \mu } &{} W_{\mu \tau } &{} W_{\mu s} \\ W_{\tau \mu } &{} W_{\tau \tau } &{} W_{\tau s} \\ W_{s \mu } &{} W_{ s \tau } &{}W_{ss} \end{pmatrix}. \end{aligned}$$
(50)

Where \(\textbf{Adjg}[\mathcal{W}_{\mu ,\tau }]=\text {tr}(\mathcal{W}_{\mu ,\tau })I - \mathcal{W}_{\mu ,\tau }\) and the elements of \(\textbf{Adjg}[\mathcal{W}_{\mu ,\tau ,s}] \) are bilinear combinations of the W’s and the identity

$$\begin{aligned} W_{\alpha \beta }W_{\gamma \delta } -W_{\alpha \delta }W_{\gamma \beta }&= \sum _{i>j} m^2_i m^2_j (U_{\alpha i} U_{\gamma j}-U_{\alpha j}U_{\gamma i} ) \nonumber \\&\quad \times (U_{\beta i} U_{\delta j}-U_{\beta j} U_{\delta i})^* \end{aligned}$$
(51)

is useful. Note, if \(\alpha = \gamma \) or \(\beta = \delta \) this bilinear sum is zero, so this combination never appears. Again see Appendix E.

The eigenvectors are then calculated as follows:

$$\begin{aligned} V_{\alpha i}V^*_{\beta i}&=\frac{\textbf{Adjg}[\lambda _i I_4 -(2E)H_{4x4}]_{\alpha \beta }}{\Pi _{j\ne i} (\lambda _i-\lambda _j)} \end{aligned}$$
(52)

with the numerator obtained from Eq. (49) by making the \(m^2_i \rightarrow (m^2_i-\lambda )\) replacement, which also implies a \(W_{\alpha \beta } \rightarrow W_{\alpha \beta }-\lambda \delta _{\alpha \beta }\) replacement. Defining

$$\begin{aligned} \eta _{\alpha \beta } \equiv 1- \delta _{\alpha \beta }, \end{aligned}$$

the opposite of the kronecker delta, we have

$$\begin{aligned}&\textbf{Adjg}[\lambda I_4-H_{4x4}]_{\alpha \beta } = \lambda ^3 \delta _{\alpha \beta } \nonumber \\&\quad -\lambda ^2 \biggr ( \sum _i m^2_i( \delta _{\alpha \beta } -U_{\alpha i}U^*_{\beta i}) +a \, \eta _{e\alpha } \, \delta _{\alpha \beta } +b \,\eta _{s\alpha } \,\delta _{\alpha \beta } \biggr ) \nonumber \\&\quad +\lambda \biggr ( \sum _{i>j} m^2_i m^2_j( \delta _{\alpha \beta } -U_{\alpha i}U^*_{\beta i}-U_{\alpha j}U^*_{\beta j}) \nonumber \\&\quad +a \, \eta _{e\alpha } \eta _{e\beta } \, (\mathcal{W}_{\mu ,\tau ,s}-\text {Tr}[\mathcal{W}_{\mu ,\tau ,s}] I_3)_{\alpha \beta } \nonumber \\&\quad + b \, \eta _{s\alpha }\eta _{s\beta } \, (\mathcal{W}_{e,\mu ,\tau }-\text {Tr}[\mathcal{W}_{e,\mu ,\tau }] I_3)_{\alpha \beta } + ab \, \eta _{e\alpha } \,\eta _{s\alpha } \, \delta _{\alpha \beta } \biggr ) \nonumber \\&\quad - \biggr ( \sum _i U_{\alpha i}U^*_{\beta i} ~ \Pi _{j \ne i}m^2_j +a \, \eta _{e\alpha } \eta _{e\beta } \, \textbf{Adjg}[\mathcal{W}_{\mu ,\tau ,s}]_{\alpha \beta } \nonumber \\&\quad +b \,\eta _{s\alpha }\eta _{s\beta } \,\textbf{Adjg}[\mathcal{W}_{e,\mu ,\tau }]_{\alpha \beta } \nonumber \\&\quad -ab\, \eta _{e\alpha } \eta _{e\beta } \, \eta _{s\alpha }\eta _{s\beta } \, (\mathcal{W}_{\mu ,\tau }-\text {Tr}[\mathcal{W}_{\mu ,\tau }] I_2)_{\alpha \beta } \biggr ) \, , \end{aligned}$$
(53)

where

$$\begin{aligned}&(\text {Tr}[\mathcal{W}_{\mu ,\tau ,s}] I_3-\mathcal{W}_{\mu ,\tau ,s})_{\alpha \beta } \\ {}&\quad = \sum m^2_i (\delta _{\alpha \beta } (1-|U_{ei}|^2)-U_{\alpha i}U^*_{\beta i} ) \,, \nonumber \\&\quad (\text {Tr}[\mathcal{W}_{e,\mu ,\tau }] I_3-\mathcal{W}_{e,\mu ,\tau })_{\alpha \beta } \\ {}&\quad = \sum m^2_i (\delta _{\alpha \beta } (1-|U_{si}|^2)-U_{\alpha i}U^*_{\beta i} ), \nonumber \\&\quad (\text {Tr}[\mathcal{W}_{\mu ,\tau }] I_2-\mathcal{W}_{\mu ,\tau })_{\alpha \beta } \\ {}&\quad =\sum m^2_i (\delta _{\alpha \beta } (1-|U_{ei}|^2-|U_{si}|^2)-U_{\alpha i}U^*_{\beta i} ). \end{aligned}$$

Everything is now available for calculating the eigenvectors, \(V_{\alpha i} V^*_{\beta i}\), as well as the oscillation probability using Eq. (1).

5.1 Matter potential invariant quantities for four flavors and beyond

As in the the \(3 \times 3\) case, the CP conserving identities are given by

$$\begin{aligned}&\sum _{j>k} \mathcal{R}(V_{\alpha j}V^*_{\beta j} V^*_{\alpha k}V_{\beta k} )~ (\Delta \lambda _{jk})^2 \nonumber \\&\quad = \sum _{j>k} \mathcal{R}(U_{\alpha j}U^*_{\beta j} U^*_{\alpha k}U_{\beta k} )~ (\Delta m^2_{jk})^2 \quad \alpha \ne \beta \,, \nonumber \\&\qquad \sum _{j>k} |V_{\alpha j}|^2 |V_{\alpha k}|^2 (\Delta \lambda _{jk})^2 = \sum _{j>k} |U_{\alpha j}|^2 |U_{\alpha k}|^2 (\Delta m^2_{jk})^2 \end{aligned}$$
(54)

These identities are easily derived from the matter potential independence of \(|H_{\alpha \beta }|^2\), \(\alpha \ne \beta \) in the flavor basis, and are valid when there is more than one sterile neutrino. In general these identities are valid whatever the matter potential is only along the diagonal in the flavor basis. As we will see in the next section, non-diagonal entries in the flavor basis of matter potential to the Hamiltonian break these identities.

The CP violating identities can be derived using the matter potential independence of

$$\begin{aligned} \sum _{\gamma \ne \alpha , \beta } \Im (H_{\alpha \beta } H_{\beta \gamma } H_{\gamma \alpha } ) \end{aligned}$$
(55)

\(\alpha \ne \beta \) and \(\alpha , \beta , \gamma = e, \mu , \tau , s \). This is the generalization of what was done for 3 flavors to 4 (or more flavors). This leads to a set of matter potential invariants that can be written as

$$\begin{aligned}&\sum _{i> j} \Im (V_{\alpha i} V^*_{\beta i} V^*_{\alpha j} V_{\beta j}) ~ \Delta \lambda _{ij} \Delta \lambda _{ik} \Delta \lambda _{jk} \nonumber \\&\quad =\sum _{i> j} \Im (U_{\alpha i} U^*_{\beta i} U^*_{\alpha j} U_{\beta j}) ~ \Delta m^2_{ij} \Delta m^2_{ik} \Delta m^2_{jk} \end{aligned}$$
(56)

for any fixed k. Note, when i = k or j = k, that term vanishes.

Using the notationFootnote 12\(J^{\alpha \beta }_{ij}(U) \equiv \Im (U_{\alpha i} U^*_{\beta i} U^*_{\alpha j} U_{\beta j}) \), then for 4 flavors the CP violating piece of the appearance oscillation probability was given in Sect. 2 as

$$\begin{aligned} \sum _{i>j} J^{\alpha \beta }_{ij}(V) \sin 2 \Delta _{ij}&= 4 J^{\alpha \beta }_{32}(V) \sin \Delta _{32} \sin \Delta _{21} \sin \Delta _{13} \nonumber \\&\quad + 4 J^{\alpha \beta }_{42}(V) \sin \Delta _{42} \sin \Delta _{21} \sin \Delta _{14} \nonumber \\&\quad + 4 J^{\alpha \beta }_{43}(V) \sin \Delta _{43} \sin \Delta _{31} \sin \Delta _{14} \end{aligned}$$
(57)

using k = 1. In L/E \(\rightarrow 0 \) limit, the coefficient of the \((L/E)^3\) terms is given by

$$\begin{aligned}&J^{\alpha \beta }_{32} (V) \, \Delta \lambda _{21} \Delta \lambda _{31} \Delta \lambda _{32} + J^{\alpha \beta }_{42}(V) \, \Delta \lambda _{21} \Delta \lambda _{41} \Delta \lambda _{42} \nonumber \\&\quad + J^{\alpha \beta }_{43} (V) \, \Delta \lambda _{41} \Delta \lambda _{31} \Delta \lambda _{43} \end{aligned}$$
(58)

and is therefore independent of matter potentials, i.e. is invariant. Such identities are the 4-flavor equivalents to the NHS ID. There are 6 of them, but there only 3 independent ones: if one chooses the independent ones to be \( (\alpha , \beta ) = (e,\mu ), (e,s) ~ \& ~ (\mu ,s)\) then the others \( (e, \tau ), ( \mu , \tau ) ~ \& ~ (\tau , s)\) are linear combinations as follows: since

$$\begin{aligned}&J^{e \tau }_{ij} =- J^{e \mu }_{ij}-J^{e s}_{ij}, \quad J^{\mu \tau }_{ij} =+J^{e \mu }_{ij}-J^{\mu s}_{ij}, \quad \nonumber \\&J^{\tau s}_{ij} =- J^{e s}_{ij}-J^{ \mu s}_{ij} \, , \end{aligned}$$
(59)

for \(i\ne j\). This is confirmed by the fact that there are 3 independent Dirac CP phases for 4 neutrino flavors. All of these identities have been confirmed using the expressions from the previous sub-section.

For n-flavor of neutrinos, with n-3 sterile neutrinos, the equivalent NHS invariant follows from the matter potential independence of

$$\begin{aligned} \sum _{\gamma \ne \alpha , \beta } \Im (H_{\alpha \beta } H_{\beta \gamma } H_{\gamma \alpha } ). \end{aligned}$$

The identities are

$$\begin{aligned}&\sum _{i>j} \Delta m^2_{ij} \Delta m^2_{jk} \Delta m^2_{ki} ~ J^{\alpha \beta }_{ij}(U)\nonumber \\ {}&\quad = \sum _{i>j} \Delta \lambda _{ij} \Delta \lambda _{jk} \Delta \lambda _{ki} \, J^{\alpha \beta }_{ij} (V) \end{aligned}$$
(60)

for k = (1, 2, \(\ldots \), n) and \(\alpha , \beta = (e, \mu , \tau , s_1, \ldots ) \). Although there are a large number of identities here only (n − 1)(n − 2)/2 are independent. A proof of this identity is given in Appendix G. Even for one sterile neutrino Eq. (60) is more general than Eq. (58), since k can take any value from 1 to 4.

6 3 flavors with off-diagonal non-standard interactions (NSI)

Here we sketch how the systematic methods presented earlier can be used for the case when there are non-standard interactions in the off-diagonal components of the Hamiltonian in the flavor basis.Footnote 13

6.1 NSI e\(\mu \)

The Hamiltonian is

$$\begin{aligned} (2E) H&=U M^2 U^\dagger + \left( \begin{array}{ccc} a&{} b&{} 0\\ b^*&{}0&{}0\\ 0 &{} 0 &{}0 \end{array} \right) \, , \end{aligned}$$
(61)

where in the usual notation \(b \equiv a \, \epsilon _{e \mu }\) in this subsection. The determinant is given by

$$\begin{aligned}&\textbf{Det}[(2E)H] -\textbf{Det}[(2E)H_{3x3}] \nonumber \\&= -b(W_{\mu e}W_{\tau \tau }-W_{\mu \tau }W_{\tau e})\nonumber \\ {}&\quad -b^*(W_{e \mu }W_{\tau \tau }-W_{\tau \mu }W_{e \tau })-bb^*W_{\tau \tau } \,, \nonumber \\&= \sum _{P} m^2_i m^2_j (b \,U^*_{ek}U_{\mu k} +b^*\, U_{ek}U^*_{\mu k}) -bb^* \sum _i m^2_i |U_{\tau i}|^2 \end{aligned}$$
(62)

where \(\textbf{Det}[(2E)H_{3x3}]\) is given by Eq. (22). Then the characteristic equation can be calculated using the replacement \(m^2_i \rightarrow (m^2_i -\lambda )\), giving

$$\begin{aligned}&\textbf{Det}[\lambda I-(2E)H]= \textbf{Det}[\lambda I-(2E)H_{3x3}]\nonumber \\&\quad - \lambda (bW_{\mu e} +b^* W_{e \mu }) +bb^* W_{\tau \tau } =0 \,, \end{aligned}$$
(63)

where \(\textbf{Det}[\lambda I-(2E)H_{3x3}]\) is given by Eq. (23).

The eigenvectors can be determined from \(\textbf{Adjg}[(2E)H] \) in a similar fashion. First,

$$\begin{aligned}&\textbf{Adjg}[(2E)H] - \textbf{Adjg}[(2E)H_{3x3}] \\&\quad = - b \begin{pmatrix} 0 &{} W_{\tau \tau } &{} -W_{\mu \tau } \\ 0 &{} 0 &{} 0 \\ 0&{} -W_{\tau e } &{} W_{\mu e } \end{pmatrix} -b^* \begin{pmatrix} 0 &{} 0 &{} 0\\ W_{\tau \tau } &{} 0&{} -W_{e \tau } \\ -W_{\tau \mu } &{} 0 &{} W_{e \mu } \end{pmatrix}\\&\qquad - b b^* \begin{pmatrix} 0 &{} 0 &{} 0\\ 0 &{} 0 &{} 0 \\ 0 &{} 0 &{} 1 \end{pmatrix} \end{aligned}$$

where \(\textbf{Adjg}[(2E)H_{3x3}]\) is given by Eq. (24). Then by using the \(m^2_i \rightarrow (m^2_i -\lambda )\) replacement, we have

$$\begin{aligned}&\text { \textbf{Adjg}}[\lambda I-(2E)H] \nonumber \\&. \quad =\textbf{Adjg}[\lambda I- (2E)H_{3x3}]+ \lambda \begin{pmatrix} 0 &{} b &{} 0 \\ b^* &{} 0 &{} 0 \\ 0&{} 0 &{} 0 \end{pmatrix} \nonumber \\&\quad +( \textbf{Adjg}[(2E)H] - \textbf{Adjg}[(2E)H_{3x3}] ) \end{aligned}$$
(64)

where \(\textbf{Adjg}[(2E)H_{3x3}]\) and \(\textbf{Adjg}[\lambda I-(2E)H_{3x3}]\) are given by Eqs. (24) and (25), respectively.

Since, in the flavor basis, \(H_{e \tau }\) and \(H_{\mu \tau }\) are independent of the matter potential, therefore we have

$$\begin{aligned}&\sum _{j>k} \mathcal{R}(V_{\alpha j}V^*_{\beta j} V^*_{\alpha k}V_{\beta k} )~ (\Delta \lambda _{jk})^2\nonumber \\ {}&\quad = \sum _{j>k} \mathcal{R}(U_{\alpha j}U^*_{\beta j} U^*_{\alpha k}U_{\beta k} )~ (\Delta m^2_{jk})^2 \end{aligned}$$
(65)

is invariant for \((\alpha , \beta ) = (e, \tau ), (\tau , e), (\mu , \tau ) ~ \text {and} (\mu , \tau )\) but not \((e, \mu )\) or \((\mu , e)\). Adding \((\tau , e)\) and \((\tau , \mu )\) one also obtains

$$\begin{aligned} \sum _{j>k} |V_{\tau j}|^2 |V_{\tau k}|^2 (\Delta \lambda _{jk})^2&= \sum _{j>k} |U_{\tau j}|^2 |U_{\tau k}|^2 (\Delta m^2_{jk})^2 \end{aligned}$$
(66)

as an invariant. So in the small L/E, the \((L/E)^2\) coefficients of the CP conserving part of the oscillation probabilities are independent of the matter potential for \(\nu _e \leftrightarrow \nu _\tau \), \(\nu _\mu \leftrightarrow \nu _\tau \) and \(\nu _\tau \) disappearance but not for \(\nu _e \leftrightarrow \nu _\mu \) or \(\nu _e\) and \(\nu _\mu \) disappearance.

For the equivalent of the NHS identity, or CPV invariant, the dependence of \(H_{e \mu }\) on the matter potential means that \(\Im (H_{e \mu } H_{\mu \tau } H_{\tau e})\) also depends on the matter potential and therefore in the small L/E limit, the coefficient \((L/E)^3\) term of the CP violating part of the appearance oscillation probabilities depends on the matter potential for all channels, see also [30]. This statement is true if the off-diagonal matter effect appears in \(H_{e \tau }\) or \(H_{\mu \tau }\). So any off-diagonal contribution to the matter potential in the flavor basis renders the NHS expression matter potential dependent.

Similar results can be obtained for an NSI in the e\(\tau \) sector.

6.2 NSI \(\mu \tau \)

For an NSI component in the \(\mu \tau \) sector, the Hamiltonian is given by

$$\begin{aligned} (2E) H&=U M^2 U^\dagger + \left( \begin{array}{ccc} a&{} 0&{}0\\ 0&{}0&{}b\\ 0 &{} b^* &{}0 \end{array} \right) \,, \end{aligned}$$
(67)

where in the usual notation \(b \equiv a\epsilon _{\mu \tau }\) in this subsection. The determinant and adjugate of the Hamiltonian are slightly different as product terms between “a” and “b” appear:

$$\begin{aligned}&\textbf{Det}[(2E)H]\nonumber \\&\quad = \textbf{Det}[(2E)H_{3x3}] \nonumber \\ {}&\qquad -b(W_{ee} W_{\tau \mu }-W_{e \mu }W_{\tau e})-b^*(W_{ee}W_{\mu \tau }-W_{\mu e}W_{e \tau }) \nonumber \\&\qquad - a(bW_{\tau \mu }+b^*W_{\mu \tau }) -bb^*(a+W_{ee}) \end{aligned}$$
(68)

where \(\textbf{Det}[(2E)H_{3x3}]\) is given by Eq. (22). Similarly

$$\begin{aligned}&\textbf{Adjg}[(2E)H] = \textbf{Adjg}[(2E)H_{3x3}] \nonumber \\&\quad -b \begin{pmatrix} W_{\tau \mu } &{} 0 &{} -W_{e \mu } \\ -W_{\tau e } &{} 0 &{} W_{e e} \\ 0 &{} 0 &{} 0 \end{pmatrix} -b^* \begin{pmatrix} W_{\mu \tau } &{} -W_{e \tau }&{}0 \\ 0 &{} 0 &{} 0\\ -W_{\mu e} &{} W_{e e} &{}0 \end{pmatrix} \nonumber \\&\quad -a \begin{pmatrix} 0 &{} 0 &{} 0\\ 0 &{} 0 &{} b \\ 0 &{} b^* &{} 0 \end{pmatrix} -b b^* \begin{pmatrix} 1 &{} 0 &{} 0\\ 0 &{} 0 &{} 0 \\ 0 &{} 0 &{} 0 \end{pmatrix} \,. \end{aligned}$$
(69)

Both \(\textbf{Det}[\lambda I-(2E)H]\) and \(\textbf{Adjg}[\lambda I-(2E)H]\) can be calculated using the \(m^2_i \rightarrow (m^2_i -\lambda )\) procedure as before.

Using similar arguments as in \((e, \mu )\) case, in the small L/E, the \((L/E)^2\) coefficients of the CP conserving part of the oscillation probabilities are independent of the matter potential for \(\nu _e \leftrightarrow \nu _\mu \), \(\nu _e \leftrightarrow \nu _\tau \) and \(\nu _e\) disappearance but not for \(\nu _\mu \leftrightarrow \nu _\tau \) or \(\nu _\mu \) and \(\nu _\tau \) disappearance.

If the NSI components appear in more than one off-diagonal component of the Hamiltonian the calculation of \(\textbf{Det}[\lambda I-(2E)H]\) and \(\textbf{Adjg}[\lambda I-(2E)H]\) follows in a similar fashion but of course the resulting equations are more complicated. For this case it maybe simpler instead of using \(\textbf{Adjg}[\lambda _i I-(2E)H]\) in the numerator of Eq. (5) but to use \(\mathbf{\Pi }_{j\ne i} (H-\lambda _j I)\) as for three neutrinos only H and \(H^2\) appear.

7 Conclusion

Neutrino oscillations in matter is important for measuring the remaining unknown parameters in the neutrino sector as well as for the discovery of beyond the Standard Model physics in this sector. The LBL experiments in particular will discover the final piece of the neutrino mass ordering, \(\Delta m^2_{32} >\text {or}< 0\), determine whether the neutrino mass state with least \(\nu _e\) is dominated by \(\nu _\mu \) or \(\nu _\tau \), octant of \(\theta _{23}\), and determine the size and sign of CP violation in the neutrino sector, i.e. determine the Jarlskog invariant. All of the current and future LBL experiments, T2K, NOvA, HK, DUNE and KNO, have significant matter effects which must be included in their analysis. Understanding these matter effects for the Standard Model three flavor paradigm as well as for beyond the Standard Model scenarios is important for extracting the most physics out of these expensive experiments.

In this paper we have provided a new simple way to determine the mixing matrix in constant density matter once the mass eigenvalues have been determined. This method uses a known result in linear algebra that has not been used in the neutrino literature before and has general applicability in particle physics. We call this method the adjugate method as it uses the adjugate of the Hamiltonian to determine the mixing matrix in matter. This method is particularly well suited to neutrino oscillations due to the form of the Hamiltonian but is also very general and can used for any Hamiltonian. Once the mixing matrix in matter has been determined it is then very simple to determine the oscillation probabilities for these LBL experiments. We have also rewritten the CP violating part of the neutrino oscillation probability in a way that better reflects the physics of this important part of the oscillation probability, see Sect. 2, and is especially useful in our discussion of four or more flavor neutrino oscillations.

In Sect. 4.1, we have revisited matter effects for three flavor oscillations that has previously been addressed by many authors. The derivation here is particularly simple and straight forward such that we do not need the form of the PMNS matrix to obtain results. We have also derived all of the useful matter invariant quantities including the NHS identity Eq. (31) and the KTY identity Eq. (34) as well as identities related to the disappearance probabilities Eq. (30) and the CPC part of the appearance probabilities, Eq. (29). The scenario with three active and one sterile is addressed in Sect. 5. Again we derive simple expressions for the mixing matrix in matter and give a number of invariant quantities especially the generalization of the NHS identity to four or more neutrinos. This is a sum of three neutrino factors which reduces to three flavor result as the fourth neutrino is decoupled. In Sect. 6, we have given a sketch of how to apply this method to Hamiltonians with NSI and discuss what happens to the invariant quantities with different non-zero off-diagonal elements in the Hamiltonian in the flavor basis.

The adjugate method for determining the mixing matrix, i.e. the eigenvectors of the Hamiltonian, is of general applicability but is especially well suited to determining neutrino oscillation probabilities in constant density matter due to the form of the Hamiltonian. We have illuminated its use in a variety of scenarios from the three flavor Standard Model as well as additional sterile neutrinos and also with non-standard interactions. We have used this method to illuminate a variety of quantities that are independent of the matter potential in the various scenarios and discussed the physics of these invariants. Extension of the adjugate method to non-normal Hamiltonians will be the subject of a follow up paper.