In this section, we will analyze how to derive a differential equation that has the form of the Schödinger equation plus relativistic corrections up to the second order in the reciprocal of the speed of light \(\epsilon \) from the two differential equations (73) and (74). For the following discussion, we use this expansion for the wave functions \(\phi _\pm ^\epsilon \) [9, 10]
$$\begin{aligned} \phi _\pm ^\epsilon&:= \sum _{n=0}^\infty \epsilon ^n \phi _\pm ^n. \end{aligned}$$
(75)
The upper index of the wave functions \(\phi _\pm ^n\) is not a power but it is related to the n-th expansion order of \(\phi _\pm ^\epsilon \) in \(\epsilon \). The wave functions \(\phi _\pm ^\epsilon \) and \(\phi _\pm ^n\) can be distinguished by the fact that for the wave function \(\phi _\pm ^\epsilon \) the upper index has the dimension of a reciprocal velocity, but for the wave function \(\phi _\pm ^n\), the upper index is an integer greater than or equal to zero. However, for the special case \(\{\epsilon =0\), \(n=0\)}, the wave functions \(\phi _\pm ^\epsilon \) and \(\phi _\pm ^n\) cannot be distinguished—but this is not a problem, since it follows from Eq. (75) that for this case, the wave functions \(\phi _\pm ^\epsilon \) and \(\phi _\pm ^n\) are equal.
For other wave functions discussed below, where either an upper index \(\epsilon \) or n is used, an analogous argumentation is valid.
Now, the next step of our derivations of the differential equation mentioned above is to calculate the positronic wave function \({\phi _{-}^{\epsilon }}\) up to third order in \(\epsilon \).
Positronic wave function
\({\varvec{\phi }}_{\varvec{-}}^{\varvec{\epsilon }}\)
up to third order in
\({\varvec{\epsilon }}\)
Using Eqs. (51) and (63), we obtain from the differential equation for \(\phi _-^\epsilon \) (74):
$$\begin{aligned} \phi _-^\epsilon= & {} \frac{\epsilon }{2m_e}\,\pi _-^\epsilon \left( e{\mathcal {A}}_k\alpha ^k\phi _+^\epsilon \right) +\frac{\epsilon }{2m_e}\,\pi _-^\epsilon \left( e{\mathcal {A}}_k\alpha ^k\phi _-^\epsilon \right) \nonumber \\&-\frac{\epsilon ^2}{2m_e}\,\pi _-^\epsilon \left( eV\phi _+^\epsilon \right) -\frac{\epsilon ^2}{2m_e}\,\pi _-^\epsilon \left( eV\phi _-^\epsilon \right) \nonumber \\&-\frac{\epsilon ^2}{2m_e}\frac{{\hat{\mathbf{p}}}^2}{2m_e}\,\phi _-^\epsilon -\frac{\mathrm {i}\hbar \epsilon ^2}{2m_e}\,\partial _t\phi _-^\epsilon +{\mathcal {O}}(\epsilon ^5). \end{aligned}$$
(76)
Now, we realize that only the lowest order function \(\phi _-^0\) vanishes, that is
$$\begin{aligned} \phi _-^0= & {} 0, \end{aligned}$$
(77)
$$\begin{aligned} \Longrightarrow \phi _-^\epsilon= & {} \epsilon \phi _-^1+\epsilon ^2\phi _-^2+\epsilon ^3\phi _-^3+{\mathcal {O}}(\epsilon ^4). \end{aligned}$$
(78)
For the following transformations, Eqs. (48), (49), (68), and (69) for the projectors \(\pi _\pm ^\epsilon \) and \(\pi _\pm ^0\) will be useful. In addition, we need the following equations [13]:Footnote 2
$$\begin{aligned} \alpha ^k \beta&= - \beta \alpha ^k. \end{aligned}$$
(82)
Here,
is the 4\(\times \)4 unit matrix, \(\Sigma ^k\) is the following 4\(\times \)4 matrix
$$\begin{aligned} \Sigma ^k&= \left( \begin{array}{cc} \sigma ^k &{} 0 \\ 0 &{} \sigma ^k \\ \end{array} \right) \end{aligned}$$
(83)
being an extension of the 2\(\times \)2 Pauli matrices \(\sigma ^k\). In addition, \(\delta _{kl}\) is the Kronecker delta, where
$$\begin{aligned} \delta _{kl}&= \left\{ \begin{array}{c} 1 \; \; \text {if} \; \; k \; = \; l \\ 0 \; \; \text {if} \; \; k \; \ne \; l \end{array} \right. , \end{aligned}$$
(84)
and \(\varepsilon _{klm}\) is the Levi–Cevita symbol given by
$$\begin{aligned} \varepsilon _{klm}&= \left\{ \begin{array}{cl} 1 &{} \text {if} \; \; (k,l,m) \; \; \text {is an even permutation of (1,2,3)} \\ -1 &{} \text {if} \; \; (k,l,m) \; \; \text {is an odd permutation of (1,2,3)} \\ 0 &{}\text {if at least two indices are equal} \end{array} \right. . \end{aligned}$$
(85)
Moreover, for the operator \(\Sigma ^k\), the following calculation rule applies:
$$\begin{aligned} \pi _\pm ^0 \Sigma ^k&= \Sigma ^k \pi _\pm ^0. \end{aligned}$$
(86)
As the next step, the six summands in Eq. (76) have to be evaluated and then the sum of these summands has to be simplified to find the final result for \(\phi _-^\epsilon \) up to third order in \(\epsilon \). Since these calculations are quite long and cumbersome, we execute them in the Appendix A and state here just the final result:
$$\begin{aligned} \phi _-^\epsilon= & {} \frac{\epsilon e{\mathcal {A}}_k}{2m_e}\left( \alpha ^k-\epsilon \,\frac{{\hat{p}}_k}{m_e} -\epsilon ^2\alpha ^k\,\frac{2\hat{\varvec{\Pi }}^2-e^2\varvec{{\mathcal {A}}}^2}{4m_e^2}\right) \phi _+^\epsilon \nonumber \\&-\frac{\epsilon ^3 e\hbar }{4m_e^2}\left( \varvec{\Sigma }{\mathbf{B}} -\mathrm {i}E_k\alpha ^k\right) \phi _+^\epsilon +{\mathcal {O}}(\epsilon ^4). \end{aligned}$$
(87)
The quantity \(\varvec{\Sigma }\) appearing in Eq. (A.27) is just \(\varvec{\Sigma } \; = \; \Sigma ^k \mathbf{e}_k\). As an analogous quantity, we use below \(\varvec{\sigma } \; = \; \sigma ^k \mathbf{e}_k\), too.
Differential equation for
\({\varvec{\phi }}_{\varvec{+}}^{\varvec{\epsilon }}\)
up to second order in
\({\varvec{\epsilon }}\)
As the next step, we now derive the differential equation for \(\phi _+^\epsilon \) up to second order in \(\epsilon \). Therefore, we use Eqs. (51), (63), and (73) to get
$$\begin{aligned} \mathrm {i}\hbar \,\partial _t\phi _+^\epsilon= & {} -\frac{\hbar ^2}{2m_e}\,\Delta \phi _+^\epsilon -\frac{\epsilon ^2\hbar ^4}{8m_e^3}\,\Delta ^2\phi _+^\epsilon +\pi _+^\epsilon \left( \frac{1}{\epsilon }\,e{\mathcal {A}}_k\alpha ^k\phi ^\epsilon -eV\phi ^\epsilon \right) +{\mathcal {O}}(\epsilon ^4) \nonumber \\= & {} -\frac{\hbar ^2}{2m_e}\,\Delta \phi _+^\epsilon -\frac{\epsilon ^2\hbar ^4}{8m_e^3}\,\Delta ^2\phi _+^\epsilon +\frac{1}{\epsilon }\,\pi _+^\epsilon \left( e{\mathcal {A}}_k\alpha ^k\phi _+^\epsilon \right) +\frac{1}{\epsilon }\,\pi _+^\epsilon \left( e{\mathcal {A}}_k\alpha ^k\phi _-^\epsilon \right) \nonumber \\&-\pi _+^\epsilon \left( eV\phi _+^\epsilon \right) -\pi _+^\epsilon \left( eV\phi _-^\epsilon \right) +{\mathcal {O}}(\epsilon ^4). \end{aligned}$$
(88)
On the right side of Eq. (88), six summands appear in front of the \({\mathcal {O}}(\epsilon ^4)\) term. The first two of these six summands do not need to be simplified. Moreover, in the Appendix B, we explain how the last four of these six summands can be simplified. Then, we sum up in this appendix these six summands, simplify the resulting sum, and find the following final differential equation for \(\phi _+^\epsilon \) up to second order in \(\epsilon \):
$$\begin{aligned} \mathrm {i}\hbar \,\partial _t\phi _+^\epsilon= & {} \left( \frac{1}{2m_e}\,\hat{\varvec{\Pi }}^2-eV\right) \phi _+^\epsilon + \frac{\epsilon e \hbar }{2m_e} \varvec{\Sigma }{\mathbf{B}} \phi _+^\epsilon + \frac{\mathrm {i} \epsilon e\hbar }{2m_e} \alpha ^k\left( \partial _kV\right) \phi _+^\epsilon \nonumber \\&-\frac{\epsilon ^2}{8m_e^3}\,\hat{\varvec{\Pi }}^4\phi _+^\epsilon +\frac{\epsilon ^2e}{2m_e^2}\,\alpha ^k\left[ \left( \frac{1}{\epsilon }\,{\hat{p}}_k{\mathcal {A}}_l\right) e{\mathcal {A}}_l +\beta \left( \frac{1}{\epsilon }\,{\hat{p}}_l{\mathcal {A}}_k\right) {\hat{p}}_l\right] \phi _+^\epsilon \nonumber \\&-\frac{\epsilon ^2e^2\hbar }{4m_e^2}\left\{ \varvec{\Sigma }\left[ \varvec{{\mathcal {A}}}\times \left( \mathbf{E}-\left( \varvec{\nabla } V\right) \right) \right] +\mathrm {i}\varvec{{\mathcal {A}}}\left( \partial _t \varvec{{\mathcal {A}}}\right) \right\} \phi _+^\epsilon \nonumber \\&-\frac{\epsilon ^2e\hbar ^2}{4m_e^2}\,\beta \left( \Delta V-V\Delta \right) \phi _+^\epsilon +{\mathcal {O}}(\epsilon ^3). \end{aligned}$$
(89)
We mention here that Mauser also gives a differential equation for \(\phi _+^\epsilon \) up to second order in \(\epsilon \) in [9, 10]. However, Mauser’s differential equation deviates from our differential equation, because he uses in his calculation the vector potential \(\mathbf{A} \) as a quantity which is independent of \(\epsilon \), while we use the scaled vector potential \(\varvec{{\mathcal {A}}} \;= \; \epsilon \mathbf{A} \) instead as a quantity which is in leading order independent from \(\epsilon \) (see Eq. (14)). As an additional contrast to [9, 10], we regard in our calculations that spatial derivatives increase the leading order for an expansion in \(\epsilon \) of the scaled vector potential \(\varvec{{\mathcal {A}}}\) and the magnetic field \({\mathbf{B}} \) (see Eqs. (16) and (22)).
The differential equation for \(\phi _+^\epsilon \) (Eq. (89)) has already the form of a Schrödinger equation plus relativistic corrections up to second order in \(\epsilon \). Within this equation, there appear two relativistic correction terms which have already a form that one could expect from literature [12,13,14,15]:
These two terms are first the coupling term \( \frac{\epsilon e\hbar }{2m_e} \varvec{\Sigma }{\mathbf{B}} \, \phi _+^\epsilon \) of the magnetic field \({\mathbf{B}} \) with the operator \(\varvec{\Sigma }\), and second a relativistic correction to the kinetic energy which is \(-\frac{\epsilon ^2}{8m_e^3}\,\hat{\varvec{\Pi }}^4\phi _+^\epsilon \).Footnote 3
However, two other relativistic corrections to the Schrödinger equation in second order in the reciprocal of the speed of light \(\epsilon \), namely, these are the spin–orbit interaction and the Darwin term, cannot be found in Eq. (89) in the form we would expect them from the references [12,13,14,15]. However, for two reasons, these deviations are not too surprising:
The first reason is that the differential equation (89) is only a differential equation for the electronic wave function \(\phi _+^\epsilon \), but not for the full wave function \(\phi ^\epsilon \) being the sum of the electronic wave function \(\phi _+^\epsilon \) and the positronic wave function \(\phi _-^\epsilon \) (see Eq. (63)).
The second reason is that the above mentioned spin–orbit interaction and the Darwin term do not appear in a differential equation for a four-dimensional spinor like Eq. (89) but a two-dimensional one—so we have to search for a differential equation, where a two-dimensional spinor appears. As the first step for that, we define below several two-dimensional wave functions.
Introduction of different two-spinors
We begin with the introduction of the two-dimensional spinors \(\varphi _\pm ^\epsilon \) and \(\chi _\pm ^\epsilon \)—using these spinors we can write the four-dimensional spinors \(\phi _\pm ^\epsilon \) as
$$\begin{aligned} \phi _\pm ^\epsilon&= \left( \begin{array}{c} \varphi _\pm ^\epsilon \\ \chi _\pm ^\epsilon \\ \end{array} \right) , \end{aligned}$$
(90)
and then, we define the four-dimensional spinors \({\tilde{\varphi }}_\pm ^\epsilon \) and \({\tilde{\chi }}_\pm ^\epsilon \) as
$$\begin{aligned} {\tilde{\varphi }}_\pm ^\epsilon&:= \left( \begin{array}{c} \varphi _\pm ^\epsilon \\ 0 \\ \end{array} \right) \end{aligned}$$
(91)
and
$$\begin{aligned} {\tilde{\chi }}_\pm ^\epsilon&:= \left( \begin{array}{c} 0 \\ \chi _\pm ^\epsilon \\ \end{array} \right) . \end{aligned}$$
(92)
Thus, we can calculate these spinors \({\tilde{\varphi }}_\pm ^\epsilon \) and \({\tilde{\chi }}_\pm ^\epsilon \) using the projectors \(\pi _\pm ^0\) and Eqs. (27) and (42) as
$$\begin{aligned} {\tilde{\varphi }}_\pm ^\epsilon&= \pi ^0_+ \phi _ \pm ^\epsilon , \end{aligned}$$
(93)
$$\begin{aligned} {\tilde{\chi }}_\pm ^\epsilon&= \pi ^0_- \phi _\pm ^\epsilon . \end{aligned}$$
(94)
Analogously to Eq. (75), we expand the functions \({\tilde{\varphi }}_\pm ^\epsilon \) and \({\tilde{\chi }}_\pm ^\epsilon \) in this way:
$$\begin{aligned} {\tilde{\varphi }}_\pm ^\epsilon&:= \sum _{n=0}^\infty \epsilon ^n {\tilde{\varphi }}_\pm ^n, \end{aligned}$$
(95)
$$\begin{aligned} {\tilde{\chi }}_\pm ^\epsilon&:= \sum _{n=0}^\infty \epsilon ^n {\tilde{\chi }}_\pm ^n. \end{aligned}$$
(96)
Of course, such two-dimensional spinors can not only be defined like in Eq. (90) for the four-dimensional electronic spinor \(\phi _+^\epsilon \) or the according positronic spinor \(\phi _-^\epsilon \), but for the total four-dimensional wave function \(\phi ^\epsilon = \phi _+^\epsilon + \phi _-^\epsilon \) (see also Eq. (63)), too. Therefore, within the following equation, we introduce the two-dimensional spinors \(\varphi ^\epsilon \) and \(\chi ^\epsilon \):
$$\begin{aligned} \phi ^\epsilon&= \left( \begin{array}{c} \varphi ^\epsilon \\ \chi ^\epsilon \\ \end{array} \right) , \end{aligned}$$
(97)
and also these four-spinors:
$$\begin{aligned} {\tilde{\varphi }}^\epsilon&= \left( \begin{array}{c} \varphi ^\epsilon \\ 0 \\ \end{array} \right) , \end{aligned}$$
(98)
$$\begin{aligned} {\tilde{\chi }}^\epsilon&:= \left( \begin{array}{c} 0 \\ \chi ^\epsilon \\ \end{array} \right) . \end{aligned}$$
(99)
In analogy to Eqs. (93)–(96), for the four-spinors \({\tilde{\varphi }}^\epsilon \) and \({\tilde{\chi }}^\epsilon \), these equations hold:
$$\begin{aligned} {\tilde{\varphi }}^\epsilon&= \pi _+^0 \phi ^\epsilon , \end{aligned}$$
(100)
$$\begin{aligned} {\tilde{\chi }}^\epsilon&= \pi _-^0 \phi ^\epsilon . \end{aligned}$$
(101)
Moreover, the four-spinors \({\tilde{\varphi }}^\epsilon \) and \({\tilde{\chi }}^\epsilon \) can be expanded as
$$\begin{aligned} {\tilde{\varphi }}^\epsilon&:= \sum _{n=0}^\infty \epsilon ^n {\tilde{\varphi }}^n, \end{aligned}$$
(102)
$$\begin{aligned} {\tilde{\chi }}^\epsilon&:= \sum _{n=0}^\infty \epsilon ^n {\tilde{\chi }}^n. \end{aligned}$$
(103)
As the next step, we derive a differential equation for \(\varphi _+^\epsilon \) up to second order in \(\epsilon \).
Differential equation for
\({\varvec{\varphi }}_{\varvec{+}}^{\varvec{\epsilon }}\)
up to second order in
\({\varvec{\epsilon }}\)
To obtain the differential equation for \(\varphi _+^\epsilon \) up to second order in \(\epsilon \), we use Eqs. (43), (44), (86), and apply the projector \(\pi _+^0\) on the differential equation for the electronic wave function \(\phi _+^\epsilon \) up to second order in \(\epsilon \) for interaction with laser fields (89) according to
$$\begin{aligned} \mathrm {i}\hbar \,\partial _t\pi _+^0\phi _+^\epsilon= & {} \left( \frac{1}{2m_e}\,\hat{\varvec{\Pi }}^2-eV\right) \pi _+^0\phi _+^\epsilon +\frac{\epsilon e\hbar }{2m_e}\,\varvec{\Sigma }{\mathbf{B}} \pi _+^0\phi _+^\epsilon +\frac{\mathrm {i}\epsilon e\hbar }{2m_e}\,\alpha ^k\left( \partial _kV\right) \pi _-^0\phi _+^\epsilon \nonumber \\&-\frac{\epsilon ^2}{8m_e^3}\,\hat{\varvec{\Pi }}^4\pi _+^0\phi _+^\epsilon +\frac{\epsilon ^2e}{2m_e^2}\,\alpha ^k\left[ \left( \frac{1}{\epsilon }\,{\hat{p}}_k{\mathcal {A}}_l\right) e{\mathcal {A}}_l -\left( \frac{1}{\epsilon }\,{\hat{p}}_l{\mathcal {A}}_k\right) {\hat{p}}_l\right] \pi _-^0\phi _+^\epsilon \nonumber \\&-\frac{\epsilon ^2e^2\hbar }{4m_e^2}\left\{ \varvec{\Sigma }\left[ \varvec{{\mathcal {A}}}\times \left( \mathbf{E}-\left( \varvec{\nabla } V\right) \right) \right] +\mathrm {i}\varvec{{\mathcal {A}}}\left( \partial _t \varvec{{\mathcal {A}}}\right) \right\} \pi _+^0\phi _+^\epsilon \nonumber \\&-\frac{\epsilon ^2e\hbar ^2}{4m_e^2}\left( \Delta V-V\Delta \right) \pi _+^0\phi _+^\epsilon +{\mathcal {O}}(\epsilon ^3). \end{aligned}$$
(104)
Now, we derive first a differential equation for \({{\tilde{\varphi }}}_+^\epsilon \) up to second order in \(\epsilon \)—since these calculations are a bit cumbersome, we give them in the Appendix C and here just state their result:
$$\begin{aligned} \mathrm {i}\hbar \,\partial _t{{\tilde{\varphi }}}_+^\epsilon= & {} \left( \frac{1}{2m_e}\,\hat{\varvec{\Pi }}^2-eV\right) {{\tilde{\varphi }}}_+^\epsilon +\frac{\epsilon e\hbar }{2m_e}\,\varvec{\Sigma }{\mathbf{B}} {{\tilde{\varphi }}}_+^\epsilon -\frac{\epsilon ^2}{8m_e^3}\,\hat{\varvec{\Pi }}^4{{\tilde{\varphi }}}_+^\epsilon \nonumber \\&-\frac{\epsilon ^2e\hbar }{4m_e^2}\,\varvec{\Sigma }\left[ \left( \varvec{\nabla } V\right) \times \hat{\varvec{\Pi }}+e\varvec{{\mathcal {A}}}\times \mathbf{E}\right] {{\tilde{\varphi }}}_+^\epsilon \nonumber \\&-\frac{\epsilon ^2e\hbar }{4m_e^2}\left[ \hbar \varvec{\nabla }\left( \varvec{\nabla } V\right) +\mathrm {i}e\varvec{{\mathcal {A}}}\left( \partial _t \varvec{{\mathcal {A}}}\right) \right] {{\tilde{\varphi }}}_+^\epsilon +{\mathcal {O}}(\epsilon ^3). \end{aligned}$$
(105)
In this equation, only terms appear, in which the upper two components of the wave function \({\tilde{\varphi }}^\epsilon _+\) are not coupled to its lower two components. Moreover, the lower two components of the wave function \({\tilde{\varphi }}^\epsilon _+\) are zero (see Eq. (91)). Therefore, we can substitute in Eq. (105) all the four-dimensional spinors \({\tilde{\varphi }}^\epsilon _+\) by the two-dimensional spinors \(\varphi ^\epsilon _+\) and the operator \(\varvec{\Sigma }\)—being a 4\(\times \)4 extension of the spinor operator \(\varvec{\sigma }\), which is a 2\(\times \)2 matrix—by the spinor operator \(\varvec{\sigma }\) itself. Taking this into account and using Eq. (31), we find
$$\begin{aligned} \mathrm {i}\hbar \,\partial _t\varphi _+^\epsilon= & {} \left( \frac{1}{2m_e}\,\hat{\varvec{\Pi }}^2-eV\right) \varphi _+^\epsilon +\frac{\epsilon e\hbar }{2m_e}\,\varvec{\sigma }{\mathbf{B}} \varphi _+^\epsilon -\frac{\epsilon ^2}{8m_e^3}\,\hat{\varvec{\Pi }}^4\varphi _+^\epsilon \nonumber \\&-\frac{\epsilon ^2e\hbar }{4m_e^2}\,\varvec{\sigma }\left[ \left( \varvec{\nabla } V\right) \times \hat{\varvec{\Pi }}+e\varvec{{\mathcal {A}}}\times \mathbf{E}\right] \varphi _+^\epsilon \nonumber \\&-\frac{\mathrm {i}\epsilon ^2e\hbar }{4m_e^2}\left[ \hat{ \mathbf{p}}\left( \varvec{\nabla } V\right) +e\varvec{{\mathcal {A}}}\left( \partial _t \varvec{{\mathcal {A}}}\right) \right] \varphi _+^\epsilon +{\mathcal {O}}(\epsilon ^3) \end{aligned}$$
(106)
as the differential equation for the wave function \(\varphi _+^\epsilon \) up to second order in \(\epsilon \).
Differential equation for
\({\varvec{\varphi }}^{\varvec{\epsilon }}\)
up to second order in
\({\varvec{\epsilon }}\)
We derived the differential equation (106) for \(\varphi _+^\epsilon \) up to second order in \(\epsilon \). However, the wave function \(\varphi _+^\epsilon \) contains just the electronic part of the total upper two-spinor \(\varphi ^\epsilon \), and now we derive an according differential equation for this upper two-spinor \(\varphi ^\epsilon \).
In the following, we apply \(\pi _+^0 \, \mathrm {i} \hbar \, \partial _t\) on Eq. (63), i.e., \(\phi ^\epsilon = \phi ^\epsilon _+ + \phi ^\epsilon _- \). In addition, we use Eq. (93) and that the limit \(\epsilon \rightarrow 0\) of the \(+\)-case of Eq. (58) is \([\pi _+^0, \partial _t] = 0\). Then, we find
$$\begin{aligned}&\pi _+^0 \, \mathrm {i} \hbar \, \partial _t \phi ^\epsilon&= \pi _+^0 \, \mathrm {i} \hbar \, \partial _t \left( \phi ^\epsilon _+ + \phi ^\epsilon _- \right) \nonumber \\ \ \Longrightarrow&\pi _+^0 \, \mathrm {i} \hbar \, \partial _t \phi ^\epsilon&= \pi _+^0 \, \mathrm {i} \hbar \, \partial _t \phi ^\epsilon _+ + \pi _+^0 \, \mathrm {i} \hbar \, \partial _t \phi ^\epsilon _- \nonumber \\ \Longrightarrow&\mathrm {i} \hbar \, \partial _t \pi _+^0 \phi ^\epsilon&= \mathrm {i} \hbar \, \partial _t \pi _+^0 \phi ^\epsilon _+ + \mathrm {i} \hbar \, \partial _t \pi _+^0 \phi ^\epsilon _- \nonumber \\ \Longrightarrow&\mathrm {i} \hbar \, \partial _t {{{\tilde{\varphi }}}}^\epsilon&= \mathrm {i} \hbar \, \partial _t {{{\tilde{\varphi }}}}^\epsilon _+ + \mathrm {i} \hbar \, \partial _t {{{\tilde{\varphi }}}}^\epsilon _-. \end{aligned}$$
(107)
Now, we have to bring the right side of Eq. (107) into the form of an operator acting on \({{{\tilde{\varphi }}}}^\epsilon \) (where we will neglect terms of higher order than \({\mathcal {O}}(\epsilon ^2)\)). This right side of Eq. (107) contains two terms, namely, \(\mathrm {i} \hbar \, \partial _t {\tilde{\varphi }}^\epsilon _+\) and \(\mathrm {i} \hbar \, \partial _t {\tilde{\varphi }}^\epsilon _-\). We will focus first on the term \(\mathrm {i} \hbar \, \partial _t {{{\tilde{\varphi }}}}^\epsilon _+\) in this equation, for which we can use Eq. (105). However, on the right side of Eq. (105), the wave function \({{{\tilde{\varphi }}}}_+^\epsilon \) appears, but not \({{{\tilde{\varphi }}}}^\epsilon \). Therefore, we need a formula to rewrite the wave function \({\tilde{\varphi }}_+^\epsilon \) as an expression, where \({\tilde{\varphi }}^\epsilon \) appears. In the Appendix D, we show first how to derive this equation. It is
$$\begin{aligned} {\tilde{\varphi }}_+^\epsilon&= {\tilde{\varphi }}^\epsilon + \frac{\epsilon ^2 e}{4 m_e^2} \left[ \varvec{ {\mathcal {A}}} \, \hat{ \mathbf{p}} - \mathrm {i} \, \varvec{\Sigma }\left( \varvec{{\mathcal {A}}} \times \hat{\mathbf{p}} \right) \right] {\tilde{\varphi }}^\epsilon + {\mathcal {O}}(\epsilon ^3). \end{aligned}$$
(108)
Using Eq. (108), we rewrite then in Appendix D the term \(\mathrm {i} \hbar \, \partial _t {{{\tilde{\varphi }}}}^\epsilon _+\) and find:
$$\begin{aligned} \mathrm {i} \hbar \, \partial _t {{{\tilde{\varphi }}}}^\epsilon _+ \;= & {} \left( \frac{1}{2m_e}\,\hat{\varvec{\Pi }}^2-eV\right) {{\tilde{\varphi }}}^\epsilon +\frac{\epsilon e\hbar }{2m_e}\,\varvec{\Sigma }{\mathbf{B}} {{\tilde{\varphi }}}^\epsilon -\frac{\epsilon ^2}{8m_e^3}\,\hat{\varvec{\Pi }}^4{{\tilde{\varphi }}}^\epsilon \nonumber \\&+ \; \frac{\epsilon ^2 e}{4 m_e^2} \left( \frac{1}{2m_e}\,\hat{\varvec{\Pi }}^2-eV\right) \left[ \varvec{ {\mathcal {A}}} \, \hat{ \mathbf{p}} - \mathrm {i} \, \varvec{\Sigma }\left( \varvec{{\mathcal {A}}} \times \hat{\mathbf{p}} \right) \right] {{\tilde{\varphi }}}^\epsilon \nonumber \\&- \; \frac{\epsilon ^2e\hbar }{4m_e^2}\,\varvec{\Sigma }\left[ \left( \varvec{\nabla } V\right) \times \hat{\varvec{\Pi }}+e\varvec{{\mathcal {A}}}\times \mathbf{E}\right] {{\tilde{\varphi }}}^\epsilon \nonumber \\&- \; \frac{\epsilon ^2e\hbar }{4m_e^2}\left[ \hbar \varvec{\nabla }\left( \varvec{\nabla } V\right) +\mathrm {i} \, e\varvec{{\mathcal {A}}}\left( \partial _t \varvec{{\mathcal {A}}}\right) \right] {{\tilde{\varphi }}}^\epsilon + \; {\mathcal {O}}(\epsilon ^3). \end{aligned}$$
(109)
Having found the result (109) for the first term \(\mathrm {i} \hbar \, \partial _t {\tilde{\varphi }}^\epsilon _+\) on the right side of Eq. (107), we turn now to the second term \(\mathrm {i} \hbar \, \partial _t {\tilde{\varphi }}^\epsilon _-\).
Since the calculus of this term is long, we discuss it in Appendix E and state here just the result of this calculation:
$$\begin{aligned} \mathrm {i} \hbar \, \partial _t {\tilde{\varphi }}_-^\epsilon&= - \frac{\mathrm {i} \epsilon ^2 e \hbar }{4 m_e^2} \left[ \left( \partial _t \varvec{{\mathcal {A}}} \right) \hat{\mathbf{p}} + \varvec{{\mathcal {A}}} (\varvec{\nabla } V) \right] {\tilde{\varphi }}^\epsilon \nonumber \\&\quad - \frac{\epsilon ^2 e \hbar }{4 m_e^2} \varvec{\Sigma }\left[ \left( \partial _t \varvec{ {\mathcal {A}}} \, \right) \times \hat{\mathbf{p}} + e \varvec{{\mathcal {A}}} \times \left( \varvec{\nabla } V \right) \right] {\tilde{\varphi }}^\epsilon \nonumber \\&\quad - \frac{\epsilon ^2 e}{4 m_e^2} \left( \frac{1}{2m_e}\,\hat{\varvec{\Pi }}^2-eV\right) \left[ \varvec{ {\mathcal {A}}} \, \hat{ \mathbf{p}} - \mathrm {i} \, \varvec{\Sigma }\left( \varvec{{\mathcal {A}}} \times \hat{\mathbf{p}} \right) \right] {{\tilde{\varphi }}}^\epsilon + {\mathcal {O}}(\epsilon ^3). \end{aligned}$$
(110)
As the next step, we insert into Eq. (107) the result (109) for the term \(\mathrm {i} \hbar \, \partial _t {\tilde{\varphi }}_+^\epsilon \) and the result (110) for the term \(\mathrm {i} \hbar \, \partial _t {\tilde{\varphi }}_-^\epsilon \). Then a differential equation for the wave function \({\tilde{\varphi }}^\epsilon \) can be calculated. The corresponding calculations are given in detail in Appendix F—here we give just the result:
$$\begin{aligned} \mathrm {i} \hbar \, \partial _t {\tilde{\varphi }}^\epsilon&= \frac{1}{2m_e}\,\hat{\varvec{\Pi }}^2 {{\tilde{\varphi }}}^\epsilon -eV {{\tilde{\varphi }}}^\epsilon +\frac{\epsilon e\hbar }{2m_e}\,\varvec{\Sigma }{\mathbf{B}} {{\tilde{\varphi }}}^\epsilon -\frac{\epsilon ^2}{8m_e^3}\,\hat{\varvec{\Pi }}^4{{\tilde{\varphi }}}^\epsilon \nonumber \\&\quad + \frac{\epsilon ^2 e \hbar }{4 m_e^2} \, \hat{\varvec{\Pi }} \left( \varvec{\Sigma }\times \mathbf{E} \right) {{\tilde{\varphi }}}^\epsilon + \frac{\mathrm {i} \epsilon ^2 e \hbar }{4 m_e^2} \, \hat{\varvec{\Pi }} \mathbf{E} {{\tilde{\varphi }}}^\epsilon + {\mathcal {O}}(\epsilon ^3). \end{aligned}$$
(111)
Since all of the operators that act in Eq. (111) on the wave function \({\tilde{\varphi }}^\epsilon \) do not couple its upper two components with its lower two components, and the lower two components of the wave function \({\tilde{\varphi }}^\epsilon \) are due to Eq. (98) just zero, we can substitute in (111) the four-spinor \({{\tilde{\varphi }}}^\epsilon \) everywhere it appears by its upper two-spinor \(\varphi ^\epsilon \), and the 4\(\times \)4 matrix \(\varvec{\Sigma }\), which is an extension of the 2\(\times \)2 spinor operator \(\varvec{\sigma }\), by the spinor operator \(\varvec{\sigma }\) itself. Then, we find:
$$\begin{aligned} \mathrm {i} \hbar \, \partial _t \varphi ^\epsilon&= \frac{1}{2m_e}\,\hat{\varvec{\Pi }}^2 \varphi ^\epsilon - eV \varphi ^\epsilon +\frac{\epsilon e\hbar }{2m_e}\,\varvec{\sigma }{\mathbf{B}} \varphi ^\epsilon -\frac{\epsilon ^2}{8m_e^3}\,\hat{\varvec{\Pi }}^4 \varphi ^\epsilon \nonumber \\&\quad + \frac{\epsilon ^2 e \hbar }{4 m_e^2} \, \hat{\varvec{\Pi }} \left( \varvec{\sigma }\times \mathbf{E} \right) \varphi ^\epsilon + \frac{\mathrm {i} \epsilon ^2 e \hbar }{4 m_e^2} \, \hat{\varvec{\Pi }} \mathbf{E} \varphi ^\epsilon + {\mathcal {O}}(\epsilon ^3). \end{aligned}$$
(112)
Now, we have derived the differential equation for the wave function \(\varphi ^\epsilon \) up to second order in \(\epsilon \).
However, one problem remains: while the operators
$$\begin{aligned} {\hat{H}}_1&:= \frac{1}{2m_e} \, \hat{\varvec{\Pi }}^2, \end{aligned}$$
(113)
$$\begin{aligned} {\hat{H}}_2&:= - eV, \end{aligned}$$
(114)
$$\begin{aligned} {\hat{H}}_3&:= \frac{\epsilon e\hbar }{2m_e} \, \varvec{\sigma }{\mathbf{B}} , \end{aligned}$$
(115)
$$\begin{aligned} {\hat{H}}_4&:= -\frac{\epsilon ^2}{8m_e^3}\,\hat{\varvec{\Pi }}^4, \end{aligned}$$
(116)
which act in the first line of Eq. (112) on the wave function \(\varphi ^\epsilon \), are Hermitian, the operators
$$\begin{aligned} {\hat{H}}_5&:= \frac{\epsilon ^2 e \hbar }{4 m_e^2} \hat{\varvec{\Pi }} \left( \varvec{\sigma }\times \mathbf{E} \right) , \end{aligned}$$
(117)
$$\begin{aligned} {\hat{H}}_6&:= \frac{\mathrm {i} \epsilon ^2 e \hbar }{4 m_e^2} \,\hat{\varvec{\Pi }} \mathbf{E} , \end{aligned}$$
(118)
which act in the second line of Eq. (112) on the wave function \(\varphi ^\epsilon \), are non-Hermitian. Because of these non-Hermitian operators \({\hat{H}}_5\) and \({\hat{H}}_6\), in general, the norm of the wave function \(\varphi ^\epsilon \) is not conserved during its propagation in time.
The reason why these norm deviations occur during the propagation of the wave function \(\varphi ^\epsilon \) is:
According to Eq. (97), the four-dimensional spinor \(\phi ^\epsilon \) can be represented with a two-dimensional upper spinor \(\varphi ^\epsilon \) and a two-dimensional lower spinor \(\chi ^\epsilon \). As given in Eq. (66), the four-dimensional spinor \(\phi ^\epsilon \) is normalized but it still allows the two-dimensional spinors \(\varphi ^\epsilon \) and \(\chi ^\epsilon \) to exchange population between each other during their propagation in time. Because of this exchange of population, the norms of the two-dimensional spinors \(\varphi ^\epsilon \) and \(\chi ^\epsilon \) are not conserved.
Differential equation for the normalized upper wave function
\({\varvec{\varphi }}^{\varvec{\epsilon }}_{\varvec{n}}\)
In the following, we will discuss how to bring the diffential equation (112) for the non-normalized wave function \(\varphi ^\epsilon \) in a form with a normalized wave-function \(\varphi ^\epsilon _n\). To do that, we have to make the operators \({\hat{H}}_5\) and \({\hat{H}}_6\) on the right side of (112) Hermitian.
Therefore, we regard that the Hermitian part \({\hat{O}}_h\) of any operator \({\hat{O}}\) is
$$\begin{aligned} {\hat{O}}_h&= \frac{1}{2} \left( {\hat{O}} + {\hat{O}}^\dagger \right) . \end{aligned}$$
(119)
The adjoint operators to the operators \({\hat{H}}_5\) and \({\hat{H}}_6\) are
$$\begin{aligned} {\hat{H}}_5^\dagger&= \frac{\epsilon ^2 e \hbar }{4 m_e^2} \left( \varvec{\sigma }\times \mathbf{E} \right) \hat{\varvec{\Pi }}, \end{aligned}$$
(120)
$$\begin{aligned} {\hat{H}}_6^\dagger&= -\frac{\mathrm {i} \epsilon ^2 e \hbar }{4 m_e^2} \, \mathbf{E} \hat{\varvec{\Pi }}, \end{aligned}$$
(121)
hence
$$\begin{aligned} {\hat{H}}_{5h}&= \frac{\epsilon ^2 e \hbar }{8 m_e^2} \left[ \hat{\varvec{\Pi }} \left( \varvec{\sigma }\times \mathbf{E} \right) + \left( \varvec{\sigma }\times \mathbf{E} \right) \hat{\varvec{\Pi }} \right] , \end{aligned}$$
(122)
$$\begin{aligned} {\hat{H}}_{6h}&= \frac{\mathrm {i} \epsilon ^2 e \hbar }{8 m_e^2} \left( \hat{\varvec{\Pi }} \, \mathbf{E} - \mathbf{E} \, \hat{\varvec{\Pi }} \right) \nonumber \\&= \frac{\mathrm {i} \epsilon ^2 e \hbar }{8 m_e^2} \left( - \mathrm {i} \hbar \, \varvec{\nabla } \mathbf{E} + e \varvec{{\mathcal {A}}} \, \mathbf{E} + \mathrm {i} \hbar \, \mathbf{E} \, \varvec{\nabla } - e \mathbf{E} \varvec{{\mathcal {A}}} \right) \nonumber \\&= \frac{\epsilon ^2 e \hbar ^2}{8 m_e^2} \left( \varvec{\nabla } \mathbf{E} - \mathbf{E} \varvec{\nabla } \right) \nonumber \\&= \frac{\epsilon ^2 e \hbar ^2}{8 m_e^2} \left[ \left( \varvec{\nabla } \mathbf{E} \right) + \mathbf{E} \varvec{\nabla } - \mathbf{E} \varvec{\nabla } \right] \nonumber \\&= \frac{\epsilon ^2 e \hbar ^2}{8 m_e^2} \left( \varvec{\nabla } \mathbf{E} \right) . \end{aligned}$$
(123)
As an additional detail, we mention that the term \({\hat{H}}_{5h}\) given in (122) can be rewritten using Eqs. (31), (32) in the following form:
$$\begin{aligned} {\hat{H}}_{5h}&= \frac{\epsilon ^2 e \hbar }{8 m_e^2} \varepsilon _{klm} \left( {{\hat{\Pi }}}_k \sigma _l E_m + \sigma _l E_m {{\hat{\Pi }}}_k \right) \nonumber \\&= \frac{\epsilon ^2 e \hbar }{8 m_e^2} \varepsilon _{klm} \left[ 2 {{\hat{\Pi }}}_k \sigma _l E_m + \mathrm {i} \hbar \, \sigma _l (\partial _k E_m) \right] \nonumber \\&= \frac{\epsilon ^2 e \hbar }{4 m_e^2} \, \hat{\varvec{\Pi }} \left( \varvec{\sigma }\times \mathbf{E} \right) - \frac{\mathrm {i} \epsilon ^2 e \hbar ^2}{8 m_e^2} \, \varvec{\sigma }\left( \varvec{\nabla } \times \mathbf{E} \right) \nonumber \\&= {\hat{H}}_{5} - \frac{\mathrm {i} \epsilon ^2 e \hbar }{8 m_e^2} \, \varvec{\sigma }\left( \varvec{\nabla } \times \mathbf{E} \right) . \end{aligned}$$
(124)
Now, we regard Eq. (24) for the rotation of the electric field \(\varvec{\nabla } \times \mathbf{E} \) and find:
$$\begin{aligned} {\hat{H}}_{5h}&= {\hat{H}}_{5} + {\mathcal {O}}(\epsilon ^3). \end{aligned}$$
(125)
If the rotation of the electric fields vanishes, from Eq. (124) follows that we have even \({\hat{H}}_{5h} = {\hat{H}}_{5}\) then.
Nevertheless, for electric fields with a non-vanishing rotation, it is still advisable to use the term \({\hat{H}}_{5h}\) instead of the term \({\hat{H}}_{5}\) as a summand in the Hamiltonian, because the Hermitian term \({\hat{H}}_{5h}\) does not cause norm violations for all kind of electric fields \(\mathbf{E} \), while this applies for the term \({\hat{H}}_{5}\) only for electric fields with a vanishing rotation.
Having discussed this detail related to the term \({\hat{H}}_{5h}\), using the results (122) and (123), we finally obtain the differential equation for the normalized upper wave function \(\varphi ^\epsilon _n\):
$$\begin{aligned} \mathrm {i} \hbar \, \partial _t \varphi ^\epsilon _n&= \frac{1}{2m_e}\,\hat{\varvec{\Pi }}^2 \varphi ^\epsilon _n- eV \varphi ^\epsilon _n +\frac{\epsilon e\hbar }{2m_e}\,\varvec{\sigma }{\mathbf{B}} \varphi ^\epsilon _n -\frac{\epsilon ^2}{8m_e^3}\,\hat{\varvec{\Pi }}^4 \varphi ^\epsilon _n \nonumber \\&\quad + \frac{\epsilon ^2 e \hbar }{8 m_e^2} \left[ \hat{\varvec{\Pi }} \left( \varvec{\sigma }\times \mathbf{E} \right) + \left( \varvec{\sigma }\times \mathbf{E} \right) \hat{\varvec{\Pi }} \right] \varphi ^\epsilon _n + \frac{\epsilon ^2 e \hbar ^2}{8 m_e^2} \left( \varvec{\nabla } \mathbf{E} \right) \varphi ^\epsilon _n + {\mathcal {O}}(\epsilon ^3). \end{aligned}$$
(126)
The result (126) is the Schrödinger equation with the relativistic corrections we searched for, and it coincides with the result in [12]. In detail, these relativistic corrections can be split into four different terms; one term scaling with \(\epsilon \) in first order and three terms scaling with \(\epsilon \) in second order. Due to [12], these terms can be described in the following way:
The single first-order term is \({\hat{H}}_3\) (see Eq. (115)), which can be related to the Zeeman splitting in the magnetic field.
The first second-order term is \({\hat{H}}_4\) (see Eq. (116)), which can be related to a relativistic correction of the kinetic energy; the following second-order term is \({\hat{H}}_{5h}\) (see Eq. (122)), which is proportional for the special case of static electromagnetic central fields with
$$\begin{aligned} \varvec{{\mathcal {A}}}&= {\mathbf{B}} = {\varvec{0}}, \end{aligned}$$
(127)
$$\begin{aligned} \mathbf{E}&= - \frac{\mathbf{r}}{r} \, \frac{\partial V}{\partial r} \end{aligned}$$
(128)
to an operator \(\hat{ \mathbf{S}} \cdot \hat{\mathbf{L}}\), where \(\hat{\mathbf{S}} = \frac{\hbar }{2} \varvec{\sigma }\) is the so-called spin operator and \(\hat{\mathbf{L}} = \mathbf{r} \times \hat{\mathbf{p}} \) is the angular momentum. Therefore, this operator \({\hat{H}}_{5h}\) is called the spin–orbit interaction.
Finally, the last second-order term is \({\hat{H}}_{6h}\), which is the Darwin term: a relativistic correction that can be related to the charge density \(\rho = \varvec{\nabla } \mathbf{E} /(4 \pi )\) in the system.
As an additional comment to Eq. (126), we mention that it also corresponds to the according results in [13,14,15]—whereas in these references, the spin–orbit interaction is given in the above-mentioned form for the special case of static electromagnetic fields, where \({\hat{H}}_{5h}\) is proportional to the operator \(\hat{\mathbf{S}} \cdot \hat{\mathbf{L}}\).Footnote 4
Thus, using the Mauser method, we could derive the differential equation (126) in the Schrödinger form containing relativistic corrections up to the second order in \(\epsilon \) that coincides with other literature results [12,13,14,15] derived with the Foldy–Wouthuysen scheme [16].
Relation of the discussed differential equations to the Pauli equation
As a final part of Sect. 5, we discuss how the Pauli equation [13,14,15, 17] is related to the differential equations (106), (112), and (126) derived in Sects. , , and , because Mauser and his coworkers derived the Pauli equation from a differential equation for the wave function \(\varphi _+^\epsilon \) in [8,9,10] already.
At first sight, one might wonder why they found the Pauli equation from an equation related to the electronic part \(\varphi _+^\epsilon \) of the upper two-spinor \(\varphi ^\epsilon \) and not from an equation related to the wave function \(\varphi ^\epsilon _n\), since we needed in this work the differential equation (126) related to the wave function \(\varphi ^\epsilon _n\) to derive a differential equation in the Schrödinger form containing relativistic corrections up to the second order in \(\epsilon \) that coincides with other literature results. As an addition to the works of Mauser [8,9,10], we will clarify this point with the discussion below about Eqs. (106), (112), and (126):
If we regard on the left side of the differential equation (126) for \(\varphi ^\epsilon _n\) only all the expansion terms up to the first expansion order \({\mathcal {O}}(\epsilon )\), Eq. (126) is approximated as a Pauli equation that takes this form (where we note the approximative wave function solving this equation as \(\varphi ^\epsilon _ {\text {p}}\)):
$$\begin{aligned} \mathrm {i} \hbar \, \partial _t \varphi ^\epsilon _\text {p}&= \frac{1}{2m_e}\,\hat{\varvec{\Pi }}^2 \varphi ^\epsilon _\text {p}- eV \varphi ^\epsilon _\text {p} +\frac{\epsilon e\hbar }{2m_e}\,\varvec{\sigma }{\mathbf{B}} \, \varphi ^\epsilon _\text {p}. \end{aligned}$$
(129)
Moreover, if we take into account on the right side of the differential equation (112) for the wave function \(\varphi ^\epsilon \) just all the expansion terms up to the first expansion order \({\mathcal {O}}(\epsilon )\) and approximate the wave function \(\varphi ^\epsilon \) as \(\varphi ^\epsilon _\text {p}\), we find again the Pauli equation (129). This result can be explained in this way:
As discussed in Sect. , on the right side of Eq. (112) the operators \({\hat{H}}_a, a \in \{1, 2, \ldots , 6\}\), act on the wave function \(\varphi ^\epsilon \) and on the right side of Eq. (126) their Hermitian parts \({\hat{H}}_{ah}\) act on the wave function \(\varphi ^\epsilon _n\). Among these six operators \({\hat{H}}_a\), three operators are \({\mathcal {O}}(\epsilon ^0)\) or \({\mathcal {O}}(\epsilon )\)—these three operators are \({\hat{H}}_1\), \({\hat{H}}_2\), \({\hat{H}}_3\), and all three of them are Hermitian. Thus, they appear in the same form in Eqs. (112) and (126). In addition, the operators \({\hat{H}}_4\), \({\hat{H}}_5\), \({\hat{H}}_6\) and their respective Hermitian parts are \(O(\epsilon ^2)\)—so, these operators are neglected if we take into account just operators up to the order \(O(\epsilon )\).
Therefore, it is comprehensible that Eq. (112) becomes the Pauli equation (129), too, if we approximate it analogously to the derivation of the Pauli equation starting from Eq. (126), where we took into account only all the expansion terms up to the first expansion order \({\mathcal {O}}(\epsilon )\).
Furthermore, if we regard on the right side of Eq. (106) only the expansion terms up to the expansion order \({\mathcal {O}}(\epsilon )\) and substitute the electronic part \(\varphi _+^\epsilon \) of the upper two-spinor \(\varphi ^\epsilon \) by the wave function \(\varphi ^\epsilon _\text {p}\), then we find the Pauli equation (129) again. This finding can be cleared up in the following way: using Eqs. (91), (98), and (108), we realize that
$$\begin{aligned} {\varphi }_+^\epsilon&= {\varphi }^\epsilon + \frac{\epsilon ^2 e}{4 m_e^2} \left[ \varvec{ {\mathcal {A}}} \, \hat{ \mathbf{p}} - \mathrm {i} \, \varvec{\sigma }\left( \varvec{{\mathcal {A}}} \times \hat{\mathbf{p}} \right) \right] {\varphi }^\epsilon + {\mathcal {O}}(\epsilon ^3), \nonumber \\ \Longrightarrow \; {\varphi }_+^\epsilon&= {\varphi }^\epsilon + {\mathcal {O}}(\epsilon ^2). \end{aligned}$$
(130)
Thus, we find this consequence of the transformations from the differential equation (106) for the wave function \(\varphi _+^\epsilon \) to the differential equation (112) for the wave function \(\varphi ^\epsilon \):
Because of Eq. (130), these transformations modify only the kind of operators on the right side of these equations, which are at least \({\mathcal {O}}(\epsilon ^2)\). Therefore, both on the right side of Eq. (106) and on the right side of Eq. (112), up to the expansion order \({\mathcal {O}}(\epsilon )\) just the same operators \({\hat{H}}_1\), \({\hat{H}}_2\) and \({\hat{H}}_3\) act on the wave functions \(\varphi ^\epsilon _+\) and \(\varphi ^\epsilon \), respectively. Therefore, both Eqs. (106) and (112) can be transformed into the Pauli equation (129) within the approximations for these equations described above.
Thus, the discussion above clarifies why we can derive the Pauli equation from all three differential equations (106) for \(\varphi ^\epsilon _+\), (112) for \(\varphi ^\epsilon \), and (126) for \(\varphi ^\epsilon _n\)—and so, we could verify the result of Mauser and his coworkers given in [8,9,10], where they derived the Pauli equation from a differential equation for the wave function \(\varphi ^\epsilon _+\).
However, if we are interested to find a differential equation in the Schrödinger form containing relativistic corrections up to the second order in \(\epsilon \) that coincides with the literature results derived with the Foldy–Wouthuysen scheme [16], it is not sufficient to take into account just the electronic part \(\varphi _+^\epsilon \) for the wave function in this differential equation. Instead, we have to take into account both the electronic part \(\varphi _+^\epsilon \) and the positronic part \(\varphi _-^\epsilon \) of the upper two-spinor \(\varphi ^\epsilon \). This is done in this work within the derivation from Eqs. (106) to (112) in Sect. . Furthermore, we have to make an another transformation to obtain a normalized wave function \(\varphi _n^\epsilon \), what can be found in this work within the derivation from Eqs. (112) to (126) in Sect. .