Time-independent perturbation theory with Lagrange multipliers

Yu, Chaehyun; Jung, Dong-Won; Kim, U-Rae; Lee, Jungil

doi:10.1007/s40042-021-00328-3

Time-independent perturbation theory with Lagrange multipliers

Original Paper - General, Mathematical and Statistical Physics
Open access
Published: 15 December 2021

Volume 79, pages 1104–1113, (2021)
Cite this article

Download PDF

You have full access to this open access article

Journal of the Korean Physical Society Aims and scope Submit manuscript

Time-independent perturbation theory with Lagrange multipliers

Download PDF

1519 Accesses
1 Citation
Explore all metrics

Abstract

We derive the formulas for the energy and wavefunction of the time-independent Schrödinger equation with perturbation in a compact form. Unlike the conventional approaches based on Rayleigh–Schrödinger or Brillouin–Wigner perturbation theories, we employ a recently developed approach of matrix-valued Lagrange multipliers that regularizes an eigenproblem. The Lagrange-multiplier regularization makes the characteristic matrix for an eigenproblem invertible. After applying the constraint equation to recover the original equation, we find the solutions of the energy and wavefunction consistent with the conventional approaches. This formalism does not rely on an iterative way and the order-by-order corrections are easily obtained by taking the Taylor expansion. The Lagrange-multiplier regularization formalism for perturbation theory presented in this paper is completely new and can be extended to the degenerate perturbation theory in a straightforward manner. We expect that this new formalism is also pedagogically useful to give insights on the perturbation theory in quantum mechanics.

Perturbation theory in the framework of the improved asymptotic iteration method

Article 15 March 2024

On Perturbation of Operators and Rayleigh-Schrödinger Coefficients

Article Open access 04 March 2024

Linear Perturbations of the Wigner Transform and the Weyl Quantization

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Stationary states of a quantum-mechanical system are described by the time-independent Schrödinger equation and there are many systems whose exact analytic solutions are known as can be seen in textbooks such as Refs. [1, 2]. However, the exact solution for the time-independent Schrödinger equation for an arbitrary quantum-mechanical system is usually not known or difficult to solve in practice. If the Hamiltonian of a given system deviates by a small amount from that of a system whose exact solutions are known well, then one may find an approximate solution of the given system in terms of the solutions of the well-known system. The small deviation from a well-known system is called perturbation.

The standard time-independent perturbation theory [3] that usually appears in most textbooks like Refs. [1, 2] of quantum mechanics is the Rayleigh–Schrödinger perturbation theory that was first introduced by Schrödinger. This approach is based on the direct calculation of the eigenvalue with the corresponding eigenvector of a system by explicit series, and, therefore, the formula is very simple at lower orders in the small perturbative parameter. Complications appear in computing corrections at higher orders in the Rayleigh–Schrödinger perturbation theory.

As another perturbation theory, the Brillouin–Wigner perturbation theory has merits against the Rayleigh–Schrödinger counterpart at higher orders because the formulas are much simpler while the first-order corrections of the two theories are the same. However, the weak point of the Brillouin–Wigner theory is that the expansion formula is a function of the energy eigenvalue for the perturbed Hamiltonian, which is unknown, instead of the unperturbed Hamiltonian. A comparison between Rayleigh–Schrödinger and Brillouin–Wigner perturbation theories can be found, for example, in Ref. [4].

Except for the well-known two perturbation theories, several methods to find approximate solutions of a system with small perturbation have been suggested so far. For example, Niblack and Nigam introduced an operator, which turns out to be the operator projecting out the components orthogonal to the unperturbed eigenstate [5]. With the aid of the operator, they obtained the solution for the energy and wavefunction of the system in terms of perturbative potential and unperturbed wavefunction. Lain and Torre found a simple algorithm to derive the formulas of both Rayleigh–Schrödinger and Brillouin–Wigner perturbation theories by considering the total corrections to the energy and wavefunction, which are expanded on the eigenfunctions of the unperturbed Hamiltonian [6].

In this paper, we present a new alternative method to find the solution of the perturbative system by employing the Lagrange-multiplier regularization formalism for the eigenproblem, which was suggested in Ref. [7] and successfully applied to the several eigenvalue problems in classical and quantum mechanics. For example, the applications of the Lagrange-multiplier regularization formalism have been made for the normal mode of a loaded string [8] and the inertia tensor of a three-body system [9]. Conventionally, the eigenproblem is solved by finding the eigenvalue, which is the solution of the secular equation, and its corresponding eigenvector by Gaussian elimination. The requirement that the secular equation vanishes implies that the original eigenvalue equation is indeterminate. However, by adding the matrix-valued Lagrange undetermined multipliers to the original eigenvalue equation, we can regularize the indeterminate equation and easily find the eigenvector for a given eigenvalue, which depends on a regularization parameter. The dependence on the regularization parameter can be removed by requiring a constraint equation. This procedure is similar to adding a gauge-fixing term to the gauge-field Lagrangian density in gauge-field theories and turned out to be very powerful to solve several eigenvalue problems in physics [8, 9].

We provide a new approach to find the energy and wavefunction of the time-independent Schödinger equation with perturbation by regularizing the eigenvalue equation of the Hamiltonian with matrix-valued Lagrange undetermined multipliers. The regularization makes the characteristic matrix invertible by increasing the number of linearly independent equations. By multiplying the inverse of the regularized characteristic matrix to the regularized eigenvalue equation and imposing the constraint equation that restores the original equation, we find the compact formulas for the energy and wavefunction of the perturbed system. The results agree with those obtained from the conventional method. We expect that this new formalism is pedagogically useful to give insights on the perturbation theory in quantum mechanics.

This paper is organized as follows: In Sect. 2, we define notations involving the Rayleigh–Schrödinger time-independent perturbation theory and review the conventional approach to deal with it. We provide a new Lagrange-multiplier-regularization formalism of determining the energy and wavefunction of the time-independent Schrödinger equation with perturbation in Sect. 3 and conclude in Sect. 4. A rigorous proof of the property of the adjugate matrix which is a crucial part of the formulation is presented in appendix.

2 Conventional time-independent perturbation theory

In this section, we first list definitions of the notations that are used in the remainder of this paper involving the Rayleigh–Schrödinger time-independent perturbation theory. Then we review the conventional approach to deal with the Rayleigh–Schrödinger time-independent perturbation theory [3] mainly following the convention given in Ref. [2].

2.1 Definitions

The energy eigenket $|n^{(0)}\rangle$ with the energy eigenvalue $E_n^{(0)}$ for the Hamiltonian $H_0$ satisfies the eigenvalue equation:

$$\begin{aligned} H_0|n^{(0)}\rangle =E_n^{(0)}|n^{(0)}\rangle , \end{aligned}$$

(1)

which is called the time-independent Schrödinger equation. The integer $n=1$, 2, $\cdots$ stands for the quantum number of the energy eigenket $|n^{(0)}\rangle$. We assume that the energy eigenstates are nondegenerate. Because $H_0$ is Hermitian, $H_0^\dagger =H_0$, the energy eigenvalue $E_n^{(0)}$ is real, and the eigenkets are orthogonal. The eigenket $|n^{(0)}\rangle$, which is also called the state ket, is chosen to have the unit normalization:

$$\begin{aligned} \langle m^{(0)}|n^{(0)}\rangle =\delta _{mn}, \end{aligned}$$

(2)

where $\langle m^{(0)}|$, which is called a bra, is the Hermitian conjugate of the corresponding ket $|m^{(0)}\rangle$. If we choose $\{\,|n^{(0)}\rangle \,|\,n=1,\,2,\,\ldots \}$ as the basis set of the Hamiltonian $H_0$, then the matrix representation of the Hamiltonian can be computed as

$$\begin{aligned} (H_0)_{mn}=\langle m^{(0)}|H_0|n^{(0)}\rangle . \end{aligned}$$

(3)

According to the Schrödinger equation (1) and the orthogonality relation (2), the Hamiltonian $H_0$ of the unperturbed system is a diagonal matrix:

$$\begin{aligned} H_0={\text{diag}}[ E_1^{(0)}\,E_2^{(0)}\ldots ]= \left(\begin{array}{lll} E_1^{(0)}& & \\ & E_2^{(0)}& \\ & & \ddots \\ \end{array}\right). \end{aligned}$$

(4)

Consider the time-independent Hamiltonian H which is the sum of the unperturbed Hamiltonian $H_0$ and a perturbative additional contribution V:

$$\begin{aligned} H=H_0+\lambda V, \end{aligned}$$

(5)

where $\lambda$ is a small real parameter and V is Hermitian so that $V^\dagger =V$ and $H^\dagger =H$. Then the energy eigenket $|n\rangle$ satisfies the eigenvalue equation:

$$\begin{aligned} H|n\rangle =(H_0+\lambda V)|n\rangle =E_n|n\rangle , \end{aligned}$$

(6)

where $E_n$ is the energy eigenvalue for the full Hamiltonian H that includes the perturbation. We find the solutions of $E_n$ and $|n\rangle$ satisfying Eq. (6). If $\lambda = 0$, then $E_n=E_n^{(0)}$ and $|n\rangle =|n^{(0)}\rangle$. The perturbation theory finds the Taylor-series expansions of $E_n$ and $|n\rangle$ in powers of $\lambda$ about $\lambda =0$.

From the Schrödinger equation (1), it is manifest that $(H_0-E_n^{(0)})|n^{(0)}\rangle =\mathbb {0}$, where $\mathbb {0}$ is the null ket. Subtracting $E_n^{(0)}|n\rangle$ on both sides of the eigenvalue equation (6), we find that

$$\begin{aligned} (H_0-E_n^{(0)}+\lambda V-\Delta _n\mathbbm {1})| n\rangle =\mathbb {0}, \end{aligned}$$

(7)

where $\mathbb {1}$ is the identity matrix and the energy shift $\Delta _n$ is defined by

$$\begin{aligned} \Delta _n=E_n-E_n^{(0)}. \end{aligned}$$

(8)

Note that $\Delta _n$ is of order $\lambda ^1$ or higher because it vanishes as $\lambda \rightarrow 0$.

2.2 Conventional strategy

We review the conventional strategy to compute perturbative corrections to the state kets and energy shifts order by order. The coefficients of $\lambda ^k$ for the energy eigenvalue $E_n$ and state ket $|n\rangle$ in the full Schrödinger equation (6) are to be expressed in terms of the corresponding unperturbed values for $E_k^{(0)}$’s and $|k^{(0)}\rangle$’s.

Orthogonality of $(\lambda V-\Delta _n\mathbbm {1})| n\rangle$: We first investigate the orthogonality of $(\lambda V-\Delta _n\mathbbm {1})| n\rangle$ with respect to the unperturbed ket $|n^{(0)}\rangle$. By applying the unperturbed bra $\langle n^{(0)}|$ to the left of the full Schrödinger equation (7) and taking the Hermitian conjugate of the unperturbed eigenvalue equation (1), we find that $(\lambda V-\Delta _n)| n\rangle$ is orthogonal to $|n^{(0)}\rangle$:

$$\begin{aligned} \langle n^{(0)}|(\lambda V-\Delta _n\mathbbm {1})| n\rangle =0. \end{aligned}$$

(9)

This immediately yields the identity for the energy shift $\Delta _n$:

$$\begin{aligned} \Delta _n=\frac{\lambda \langle n^{(0)}|V| n\rangle }{\langle n^{(0)}|n\rangle }= \lambda \langle n^{(0)}|V| n\rangle , \end{aligned}$$

(10)

where we have set the normalization for $|n\rangle$ such that

$$\begin{aligned} \langle n^{(0)}|n\rangle =\langle n^{(0)}|n^{(0)}\rangle =1,\quad \langle n^{(0)}|m^{(0)}\rangle =\delta _{nm}. \end{aligned}$$

(11)

2.2.1 Requirement of renormalization

We emphasize that the normalization condition $\langle n^{(0)}|n\rangle =1$ is not required but is chosen for convenience. However, this choice is crucial to simplify the intermediate steps significantly because the condition disallows any overlap between the unperturbed state $|n^{(0)}\rangle$ and the perturbative corrections $\lambda ^k|n^{(k)}\rangle$ to all orders of $\lambda ^k$ for $k\ge 1$. While this prescription provides us with a convenience in the intermediate steps, it brings in an additional procedure as a payback at the end of the calculation which is called the renormalization. The reason is that the choice of the normalization $\langle n^{(0)}|n\rangle =\langle n^{(0)}|n^{(0)}\rangle =1$ results in $\langle n |n\rangle \ne 1$ because $| n\rangle$ does acquire nonvanishing components orthogonal to $| n^{(0)}\rangle$ due to perturbation. Therefore, it is required to renormalize $|n\rangle$ not now but at the end of the calculation by replacing $|n\rangle$ as

$$\begin{aligned} |n\rangle \rightarrow \frac{|n\rangle }{\sqrt{\langle n|n\rangle }}. \end{aligned}$$

(12)

The renormalization procedure must be postponed until we complete the calculation for the perturbative corrections for $|n^{(k)}\rangle$ to the last order that we concern.

2.2.2 Perturbative expansion of $| n \rangle$

The eigenket can be expanded in powers of $\lambda$ as

$$\begin{aligned} |n \rangle = |n^{(0)} \rangle +\sum _{k=1}^\infty \lambda ^k|n^{(k)}\rangle . \end{aligned}$$

(13)

The coefficient kets $|n^{(k)}\rangle$ are independent of $\lambda$. This expansion is consistent with the normalization in Eq. (11) and

$$\begin{aligned} \langle n^{(0)}|n^{(k)}\rangle =0,\qquad k\ne 0. \end{aligned}$$

(14)

Thus we confirm that the choice of the normalization for $\langle n^{(0)}|n\rangle =1$ in Eq. (11) greatly simplifies the intermediate computation because of the orthogonality relation in Eq. (14).

2.2.3 Perturbative expansion of $\Delta _n^{(k)}$

Note that $\Delta _n$ can be expanded in a power series of $\lambda$ as

$$\begin{aligned} \Delta _n=\lambda \Delta _n^{(1)}+\lambda ^2\Delta _n^{(2)}+\cdots =\sum _{k=1}^\infty \lambda ^k\Delta _n^{(k)}, \end{aligned}$$

(15)

which starts from the order of $\lambda ^1$ due to the consistency in the limit of $\lambda \rightarrow 0$. Substituting Eqs. (13) and (15) into the second formula of Eq. (10) with the condition $\langle n^{(0)}|n\rangle =1$, we find that the coefficient of $\lambda ^k$ in the energy shift $\Delta _n$ is

$$\begin{aligned} \Delta _n^{(k)}=\langle n^{(0)}|V|n^{(k-1)}\rangle , \qquad k=1,\,2,\ldots , \end{aligned}$$

(16)

which implies that the kth correction to the energy shift is determined by the $(k-1)$th-order correction $|n^{(k-1)}\rangle$ to the eigenket.

Operator $\phi _n/(E_n^{(0)}-H_0)$: According to Eq. (9), $(\lambda V-\Delta _n\mathbbm {1})|n\rangle$ is always orthogonal to $|n^{(0)}\rangle .$ Therefore, $(E_n^{(0)}-H_0)^{-1}(\lambda V-\Delta _n\mathbbm {1})$ is a well-defined matrix because the denominator never vanishes. Thus we define

$$\begin{aligned} \frac{\phi _n}{E_n^{(0)}-H_0}(\lambda V-\Delta _n\mathbbm {1}) &\equiv (E_n^{(0)}-H_0)^{-1}\phi _n(\lambda V-\Delta _n\mathbbm {1})\nonumber \\&= (E_n^{(0)}-H_0)^{-1}(\lambda V-\Delta _n\mathbbm {1}) , \end{aligned}$$

(17)

where $\phi _n$ is the projection operator that projects out the components orthogonal to $|n^{(0)}\rangle :$

$$\begin{aligned} \phi _n=\mathbbm {1}-|n^{(0)}\rangle \langle n^{(0)}|=\sum _{k\ne n} |k^{(0)}\rangle \langle k^{(0)}|. \end{aligned}$$

(18)

Trivial properties of this projection operator are

$$\begin{aligned} \phi _n|n^{(0)}\rangle =\mathbb {0},\quad \phi _n^k=\phi _n,\quad k=1,\,2,\ldots , \end{aligned}$$

(19)

which denote the orthogonality and idempotent property, respectively.

2.3 Result for the eigenket

The eigenket $|n\rangle$ can be expanded by respecting the normalization (11) and the eigenvalue equation (7), and by applying the identity (17) as

$$\begin{aligned} |n\rangle =|n^{(0)}\rangle +\frac{\phi _n}{E_n^{(0)}-H_0}(\lambda V-\Delta _n\mathbbm {1})|n\rangle , \end{aligned}$$

(20)

where each term on the right side represents the component proportional and orthogonal to the unperturbed eigenket $|n^{(0)}\rangle$, respectively. Moving the last term on the right side to the left and solving $|n\rangle$, we find that

$$\begin{aligned} |n\rangle =\left[ \mathbbm {1}-\frac{\phi _n}{E_n^{(0)}-H_0}(\lambda V-\Delta _n\mathbbm {1})\right] ^{-1}|n^{(0)}\rangle . \end{aligned}$$

(21)

A note should be added in regarding the expression in Eq. (21). The conventional approaches that can be seen for example in Refs. [1, 2] rely on cumbersome iterative procedure to find the order-by-order expression for $|n^{(k)}\rangle$. This procedure is simplified into a one-step operation in Eq. (21). The contribution of the second term in the brackets is of order $\lambda ^1$ or higher according to Eqs. (10) and (13). If we assume that the contribution is small by taking $\lambda \ll 1$, then we can make a Taylor-series expansion of the inverse operator as

$$\begin{aligned} |n\rangle =\left\{ \sum _{k=0}^\infty \left[ \frac{\phi _n}{E_n^{(0)}-H_0}(\lambda V-\Delta _n\mathbbm {1})\right] ^k \right\} |n^{(0)}\rangle . \end{aligned}$$

(22)

The expansion of the operator from Eqs. (21)–(22) is similar to the expansion of the geometric series: $1/(1-r)=1+r+r^2+\cdots =\sum _{k=0}^\infty r^k$ for $|r|<1$.

One should take special care of dealing with the order-by-order computation of the perturbative contribution. While $\lambda V$ has the contribution of order $\lambda$ only, $\Delta _n$ does have contributions of order $\lambda ^k$ for all possible values for $k=1$, 2, $\cdots$. Substituting Eq. (16) into Eq. (15) and substituting this $\Delta _n$ into Eq. (22), we find the resultant power-series expression for $|n\rangle$ to all orders in $\lambda$:

$$\begin{aligned} |n\rangle =\left\{ \sum _{k=0}^\infty \left[ \frac{\phi _n}{E_n^{(0)}-H_0}\left( \lambda V- \mathbbm {1} \sum _{\ell =1}^\infty \lambda ^\ell \langle n^{(0)}|V|n^{(\ell -1)}\rangle \right) \right] ^k \right\} |n^{(0)}\rangle . \end{aligned}$$

(23)

Substituting Eq. (13) on the left side and comparing each term of order $\lambda ^k$ on both sides, we find the eigenket $|n\rangle$ in a power series of $\lambda$. In practice the series expansion is truncated at a certain order in $\lambda$. The result given in Eq. (23) is the consequence of the normalization choice in Eq. (11). Therefore, we must renormalize the state ket $|n\rangle$ following Eq. (12) as we have stated in the previous subsection.

Here, we have assumed that the interaction potential $\lambda V$ is small. In general, $\lambda V$ is not necessarily small. It is worthwhile pointing out that the formula (21) still holds even for a fairly large potential. Then, one can find the eigenket $|n\rangle$ from Eq. (21) if the convergence of the expansion is reasonably good.

2.4 Order-by-order formulas

We list the resultant formulas for the perturbative corrections to the kets and energy shifts to order $\lambda ^3$. The zeroth-order contribution is the identity $|n^{(0)}\rangle =|n^{(0)}\rangle$ because the first term in the sum on the right side of Eq. (23) is just $\mathbbm {1}|n^{(0)}\rangle =|n^{(0)}\rangle$.

2.4.1 State kets

The first-order contribution can be read off as

$$\begin{aligned} |n^{(1)}\rangle = \frac{\phi _n}{E_n^{(0)}-H_0}\left( V- \mathbbm {1} \langle n^{(0)}|V|n^{(0)}\rangle \right) |n^{(0)}\rangle = \frac{\phi _n}{E_n^{(0)}-H_0} V |n^{(0)}\rangle , \end{aligned}$$

(24)

where we have made use of the identity $\phi _n|n^{(0)}\rangle =0.$ The second-order contribution can be extracted as

$$\begin{aligned}|n^{(2)}\rangle = &\left( \frac{\phi _n}{E_n^{(0)}-H_0} V\frac{\phi _n}{E_n^{(0)}-H_0} V \right. \nonumber \\&\quad\left. - \frac{\phi _n}{E_n^{(0)}-H_0} \langle n^{(0)}|V|n^{(0)}\rangle \mathbbm {1} \frac{\phi _n}{E_n^{(0)}-H_0}V \right) |n^{(0)}\rangle . \end{aligned}$$

(25)

Note that the contributions of $k=0$ and 1 of the summation in Eq. (23) do not contribute. In these contributions the second-order ($\lambda ^2$) contribution is at $\ell =2$ in the summation over $\ell$ for $k=1$. However, the contribution is proportional to $\phi _n$ and, therefore, vanishes after acting on the ket $|n^{(0)}\rangle$. The $k=2$ contribution is expressed as the square of the operator in the brackets in Eq. (23), where $\ell$ and $\ell ^\prime$ are used for the two summation indices in the parentheses in Eq. (23) in operating order. Among the $k=2$ contributions, the $\ell ^\prime =1$ contribution vanishes because it is proportional to $[\phi _n/(E_n^{(0)}-H_0)]\mathbbm {1}$ with some prefactors. The first term in the parentheses in Eq. (25) is for $k=2$ and $\ell =\ell ^\prime =0$, while the second term is for $k=2$, $\ell =1$, and $\ell ^\prime =0$.

In a similar manner, we can read off the third-order contribution in a straightforward way:

$$\begin{aligned}|n^{(3)}\rangle = &\Bigg ( \frac{\phi _n}{E_n^{(0)}-H_0} V\frac{\phi _n}{E_n^{(0)}-H_0} V\frac{\phi _n}{E_n^{(0)}-H_0} V \nonumber \\&- \frac{\phi _n}{E_n^{(0)}-H_0} \langle n^{(0)}|V|n^{(0)}\rangle \mathbbm {1}\frac{\phi _n}{E_n^{(0)}-H_0}V \frac{\phi _n}{E_n^{(0)}-H_0}V \nonumber \\&- \frac{\phi _n}{E_n^{(0)}-H_0}V\frac{\phi _n}{E_n^{(0)}-H_0} \langle n^{(0)}|V|n^{(0)}\rangle \mathbbm {1}\frac{\phi _n}{E_n^{(0)}-H_0}V \nonumber \\&- \frac{\phi _n}{E_n^{(0)}-H_0}\langle n^{(0)}|V|n^{(0)}\rangle \mathbbm {1}\frac{\phi _n}{E_n^{(0)}-H_0} \langle n^{(0)}|V|n^{(0)}\rangle \mathbbm {1}\frac{\phi _n}{E_n^{(0)}-H_0}V \nonumber \\&- \frac{\phi _n}{E_n^{(0)}-H_0} \langle n^{(0)}|V \frac{\phi _n}{E_n^{(0)}-H_0} V |n^{(0)}\rangle \mathbbm {1}\frac{\phi _n}{E_n^{(0)}-H_0}V \Bigg )|n^{(0)}\rangle , \end{aligned}$$

(26)

where we have made use of Eq. (24). The formulas for the state kets follow the normalization in Eq. (11) that are not properly normalized. Thus they require the renormalization as is shown in Eq. (12) once the truncation of the perturbative series is determined at a certain finite order.

2.4.2 Energy shifts

The energy shift can be computed to order $\lambda ^3$ by substituting $|n^{(k)}\rangle$ in Eqs. (24) and (25) into Eq. (16) as

$$\begin{aligned}&\Delta _n^{(1)}=\langle n^{(0)}|V| n^{(0)}\rangle , \end{aligned}$$

(27)

$$\begin{aligned}\Delta _n^{(2)}&=\langle n^{(0)}|V| n^{(1)}\rangle = \langle n^{(0)}|V\frac{\phi _n}{E_n^{(0)}-H_0} V| n^{(0)}\rangle \nonumber \\&= \sum _{k\ne n} \frac{|V_{nk}|^2}{E_n^{(0)}-E_k^{(0)}} , \end{aligned}$$

(28)

$$\begin{aligned}\Delta _n^{(3)}&=\langle n^{(0)}|V| n^{(2)}\rangle = \langle n^{(0)}|V\frac{\phi _n}{E_n^{(0)}-H_0} V\frac{\phi _n}{E_n^{(0)}-H_0} V| n^{(0)}\rangle \nonumber \\ {}&\quad-\langle n^{(0)}|V\frac{\phi _n}{E_n^{(0)}-H_0} \langle n^{(0)}|V| n^{(0)}\rangle \frac{\phi _n}{E_n^{(0)}-H_0} V| n^{(0)}\rangle \nonumber \\&= \sum _{k\ne n}\sum _{\ell \ne n} \frac{V_{nk}V_{k\ell }V_{\ell n}}{(E_n^{(0)}-E_k^{(0)})(E_n^{(0)}-E_\ell ^{(0)})} -\sum _{k\ne n}\frac{V_{nn}|V_{nk}|^2}{(E_n^{(0)}-E_k^{(0)})^2}, \end{aligned}$$

(29)

where we have made use of the fact that V is a Hermitian operator $V^\dagger =V$. The matrix elements for V are defined in terms of the unperturbed state kets as

$$\begin{aligned} V_{nm}=\langle n^{(0)}|V|m^{(0)}\rangle . \end{aligned}$$

(30)

Thus $V_{ij}V_{ji}=V_{ij}V_{ij}^\star =|V_{ij}|^2$ for any i and j.

3 Lagrange-multiplier approach

In this section, we present an alternative derivation of the perturbative-series expansions displayed in Eqs. (21) and (22) involving the full Hamiltonian $H=H_0+\lambda V$. Our derivation is completely independent of the conventional approach described in Sect. 2. Instead, we regularize the eigenvalue equation with matrix-valued Lagrange undetermined multipliers. In Ref. [9], the Lagrange-multiplier-regularization formalism was first introduced to solve a system of indeterminate linear equations. A generalized version of the formalism applicable to the eigenproblem was developed in Ref. [7]. Very recently, the Lagrange-multiplier-regularization formalism is further developed to include a convenient adjugate representation in Ref. [8].

3.1 Lagrange-multiplier regularization

The original unperturbed system satisfies the Schrödinger equation (1). In perturbation theory, one assumes that the exact solution for the unperturbed eigenvalue equation is known. However, in this work we begin with solving the original unperturbed eigenvalue equation by applying the Lagrange-undetermined-multiplier regularization. This is a good example of demonstrating how the Lagrange-multiplier regularization works. Firstly, we observe that the eigenvalue equation (1) is indeterminate. In general, a Lagrange undetermined multiplier introduces a new degree of freedom that the given system is lacking due to a constraint. Intrinsically, the orthogonality between the characteristic matrix and the eigenvector corresponds to such a constraint. Thus the characteristic matrix is lacking the information along the direction parallel to the eigenvector. The Lagrange-multiplier regularization resurrects the lacking degree of freedom of the characteristic equation in a similar spirit as a usual Lagrange multiplier in Lagrangian mechanics does. Once a valid regularization is achieved, one can find the inverse transformation of the equation to find the regularized solution. The right side of the eigenvalue equation must be regularized simultaneously because the sole regularization of the left side leads to a trivial null vector. A valid regularization has the boundary condition that the regularized equation reproduces the original equation if we turn off the regularization parameter at any stage. Thus the solution to the original indeterminate equation is restored if we impose the constraint equation into the regularized solution.

The Schrödinger equation (1) for the unperturbed system is manifestly indeterminate because the determinant of the characteristic matrix is vanishing: ${\mathscr{D}}et[H_0-E_n^{(0)}\mathbbm {1}]=0$. The reason is that every row of the characteristic matrix $H_0-E_n^{(0)}\mathbbm {1}$ is linearly independent of the eigenket $|n^{(0)}\rangle$. To make the linear equation solvable, we regularize the characteristic matrix by adding a projection operator $|n^{(0)}\rangle \langle n^{(0)}|$ multiplied by the regularization parameter $\alpha$, which is in general a complex number and vanishes if we restore the original equation. The following regularization does not modify Eq. (1) at all for any complex number $\alpha$:

$$\begin{aligned} (H_0-E_n^{(0)}+\alpha |n^{(0)}\rangle \langle n^{(0)}|)|n^{(0)}\rangle =\alpha |n^{(0)}\rangle ,\qquad \alpha \in \mathbbm {C}, \end{aligned}$$

(31)

where $\mathbbm {C}$ is the set of complex numbers. In fact, the eigenket $|n^{(0)}\rangle$ is unknown yet and must be found as a solution of Eq. (1). Hence, the regularized equation (31) is practically useless. We make a further tuning that replaces $|n^{(0)}\rangle$ with an arbitrary vector. This vector must have the component along $|n^{(0)}\rangle$. This is equivalent to the replacement of $|n^{(0)}\rangle \langle n^{(0)}|$ with $\mathbbm {1}$, which is the simplest choice. Due to the modification of the left side, $|n^{(0)}\rangle$ on the right side must be modified as an arbitrary ket $|c\rangle$. However, it turns out that only the component parallel to $|n^{(0)}\rangle$ survives after applying the constraint equation as we will show later. This is a valid fine tuning because the identity matrix does project both longitudinal and transverse directions with respect to whatsoever $|n^{(0)}\rangle$ is: $\mathbbm {1}|n^{(0)}\rangle \langle n^{(0)}|=|n^{(0)}\rangle \langle n^{(0)}|$ and $\mathbbm {1}(\mathbbm {1}-|n^{(0)}\rangle \langle n^{(0)}|)=\mathbbm {1}-|n^{(0)}\rangle \langle n^{(0)}|$. Then we arrive at the regularized eigenvalue equation:

$$\begin{aligned} (H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})|n^{(0)}(\alpha )\rangle =\alpha |c\rangle , \end{aligned}$$

(32)

where the matrix-valued Lagrange undetermined multipliers are $\mathbbm {1}$ in $\alpha \mathbbm {1}$ on the left side and $|c\rangle$ on the right side. Note that the regularized ket $|n^{(0)}(\alpha )\rangle$ does acquire the dependence on the regularization parameter $\alpha$ and the original equation (1) is restored as $\alpha \rightarrow 0$. Having resurrected the lacking degree of freedom in the characteristic matrix $H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1}$, we are able to solve $|n^{(0)}(\alpha )\rangle$ by making an ordinary inverse linear transformation of Eq. (32). In principle, one can use any other matrix-valued multiplier $\mathbbm {B}$ instead of $\mathbbm {1}$ as long as $H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {B}$ is invertible. Thus the explicit form of $|n^{(0)}(\alpha )\rangle$ depends on the multiplier. One might worry about a possible scheme dependence of the regularization coming from the choice in the undetermined multiplier of either $\mathbbm {1}$ or $\mathbbm {B}$. However, the boundary condition that reproduces the original equation prohibits the regularization scheme dependence as $\alpha \rightarrow 0$.

The eigenvalue $E_n^{(0)}$ is the solution for the secular equation

$$\begin{aligned} \mathscr {D}={\mathscr{D}}et[H_0-E_n^{(0)}\mathbbm {1}]=0, \end{aligned}$$

(33)

where $\mathscr {D}$ is called the secular determinant. The corresponding regularized secular determinant is not vanishing any more if $\alpha \ne 0$:

$$\begin{aligned} \mathscr {D}(\alpha )={\mathscr{D}}et[H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1}]\propto \alpha \ne 0. \end{aligned}$$

(34)

If every eigenvalue is distinguished from any other eigenvalues, then we say the system is non-degenerate. If the system is non-degenerate, then $\mathscr {D}(\alpha )$ has the asymptotic behavior proportional to $\alpha$ as $\alpha \rightarrow 0$. Thus, the parameter $\alpha$ makes the matrix $(H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})$ invertible: $(H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1}$ exists. Once $E_n^{(0)}$ is known from the secular equation, one can solve $|n^{(0)}\rangle$ by multiplying the inverse $(H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1}$ to the left of the regularized eigenvalue equation (32). The result is

$$\begin{aligned} |n^{(0)}(\alpha )\rangle =\alpha (H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1}|c\rangle . \end{aligned}$$

(35)

It is remarkable that $\alpha (H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1}$ is analytic at $\alpha =0$ although $(H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1}$ has a singularity proportional to $1/\alpha$ as $\alpha \rightarrow 0$. The reason is that $\mathscr {D}(\alpha )\propto \alpha$ as $\alpha \rightarrow 0$. Since $|c\rangle$ is an arbitrary ket, one must rescale $|c\rangle$ by a constant factor after finishing the calculation to require a consistent normalization. To gain a more concrete insight on the $\alpha$-dependence of the eigenket after the regularization, we separate $|n^{(0)}(\alpha )\rangle$ into two components with and without the $\alpha$ dependence. Note that we have the initial condition:

$$\begin{aligned} |n^{(0)}(0)\rangle =|n^{(0)}\rangle . \end{aligned}$$

(36)

The analyticity on the right side of Eq. (35) guarantees that $|n^{(0)}(\alpha )\rangle$ can be expanded about $\alpha =0$ as

$$\begin{aligned} |n^{(0)}(\alpha )\rangle = |n^{(0)}\rangle + \alpha \Delta |n^{(0)}(\alpha )\rangle , \end{aligned}$$

(37)

where the second term $\alpha \Delta |n^{(0)}(\alpha )\rangle$ vanishes as $\alpha \rightarrow 0$. Here, the operator symbol $\Delta$ in Eq. (37) should be distinguished from the energy shift $\Delta _{n}$ defined in Eq. (8). Substituting Eq. (37) into (35), we find that

$$\begin{aligned} |n^{(0)}\rangle =\alpha (H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1}|c\rangle - \alpha \Delta |n^{(0)}(\alpha )\rangle . \end{aligned}$$

(38)

Since the left side is free of $\alpha$, it is manifest that the $\alpha$ dependence on the right side cancels completely. Furthermore, Eq. (38) holds for any value of $\alpha$. The simplest choice is to take $\alpha =0$ and the second term on the right side vanishes in the limit of $\alpha \rightarrow 0$.

According to Ref. [8], the matrix on the right side of Eq. (38) is actually the adjugate matrix for the original characteristic matrix in the limit of $\alpha \rightarrow 0$ if $H_0$ is not degenerate:

$$\begin{aligned} |n^{(0)}\rangle = \lim _{\alpha \rightarrow 0} \frac{\alpha }{\mathscr {D}(\alpha )}\times \mathscr {D}(\alpha ) (H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1}|c\rangle =\text {adj}(H_0-E_n^{(0)}\mathbbm {1})|c\rangle , \end{aligned}$$

(39)

where we have absorbed the finite constant factor $\displaystyle \lim _{\alpha \rightarrow 0}\alpha /\mathscr {D}(\alpha )$ into $|c\rangle$ by rescaling the normalization. Here, $\mathscr {D}(\alpha )$ is defined in Eq. (34). The adjugate $\text {adj}(\mathbbm {A})$ of a matrix $\mathbbm {A}$ is the transpose of the cofactor matrix $\mathbbm {C}$ whose ij element is the ij minor $\mathscr {M}_{ij}$ of the matrix $\mathbbm {A}$ multiplied by $(-1)^{i+j}$. The ij minor $\mathscr {M}_{ij}$ of the matrix $\mathbbm {A}$ is the determinant of the submatrix of $\mathbbm {A}$ in which the ith row and the jth column are eliminated from $\mathbbm {A}$. The matrix representation for the adjugate matrix $\text {adj}(H_0-E_n^{(0)}\mathbbm {1})$ is quite simple in the basis set $\{\,|n^{(0)}\rangle \,|\,n=1,\,2,\,\ldots \}$ for the original Hamiltonian $H_0$. The reason is that $H_0$ is diagonal with the eigenkets: all of the matrix elements are vanishing except for the nn element whose value is given by

$$\begin{aligned} \langle n^{(0)}|\text {adj}(H_0-E_n^{(0)}\mathbbm {1})|n^{(0)}\rangle =\prod _{k\ne n}(E_k^{(0)}-E_n^{(0)}). \end{aligned}$$

(40)

As a result, the limiting value of the operator $\mathscr {D}(\alpha )(H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1}$ as $\alpha \rightarrow 0$ is actually the projection operator that selects the component parallel to $|n^{(0)}\rangle$ up to an overall constant factor of $\prod _{k\ne n}(E_k^{(0)}-E_n^{(0)})$:

$$\begin{aligned}\lim _{\alpha \rightarrow 0}\mathscr {D}(\alpha )(H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1} =&\text {adj}(H_0-E_n^{(0)}\mathbbm {1})\nonumber \\=&|n^{(0)}\rangle \langle n^{(0)}|\prod _{k\ne n}(E_k^{(0)}-E_n^{(0)}). \end{aligned}$$

(41)

This is a manifestation of the primitive regularized Eq. (31). In appendix 1, we present a rigorous proof of the identity (41). By making use of the completeness,

$$\begin{aligned} \mathbbm {1}=\sum _{m=1}^\infty |m^{(0)}\rangle \langle m^{(0)}|, \end{aligned}$$

(42)

of the Hilbert space spanned by the eigenkets, we can express $|n^{(0)}\rangle$ in Eq. (39) into the form

$$\begin{aligned}|n^{(0)}\rangle =&\sum _{m=1}^{\infty }\text {adj}(H_0-E_n^{(0)}\mathbbm {1})|m^{(0)}\rangle \langle m^{(0)}|c\rangle \nonumber \\=&|n^{(0)}\rangle \langle n^{(0)}|c\rangle \prod _{k\ne n}(E_k^{(0)}-E_n^{(0)}), \end{aligned}$$

(43)

where we have made use of the identity (41) and the orthonormal relation for the unperturbed state kets in Eq. (11). The normalization can be corrected consistently by choosing $\langle n^{(0)}|c\rangle =1/\prod _{k\ne n}(E_k^{(0)}-E_n^{(0)})$. This leads to a trivial identity $|n^{(0)}\rangle =|n^{(0)}\rangle .$

3.2 Finding $|n\rangle$ with Lagrange multipliers

3.2.1 Regularized equation

The eigenvalue equation (7) is an indeterminate equation: the characteristic determinant ${\mathscr{D}}et[H_0-E_n^{(0)}+\lambda V-\Delta _n]$ for the perturbed system is vanishing and, therefore, the inverse $(H_0-E_n^{(0)}+\lambda V-\Delta _n)^{-1}$ does not exist. Following the approaches of Refs. [7, 8] and making use of the results in the previous subsection, we again carry out the Lagrange-multiplier regularization of the eigenproblem in Eq. (7) for the perturbed system with the regularization parameter $\alpha$ as

$$\begin{aligned} (H_0-E_n^{(0)}\mathbbm {1}+\lambda V-\Delta _n\mathbbm {1}+\alpha \mathbbm {1})| n(\alpha )\rangle =\alpha |c\rangle , \end{aligned}$$

(44)

where $\mathbbm {1}$ in the term $\alpha \mathbbm {1}$ and the arbitrary constant ket $|c\rangle$ are Lagrange multipliers. Note that the $\alpha$ dependence in the ket $|n(\alpha )\rangle$ disappears as we take the limit $\alpha \rightarrow 0$.

3.2.2 Factorization of the operator

We can in principle carry out the regularization procedure for the full Hamiltonian in a straightforward way. However, it is more efficient to make use of the findings from the unperturbed case which leads to a systematic reduction. Therefore, we are to employ an additional procedure in which the full operator is expressed as the product of the original operator without perturbation and the remainder. The factorization is a remarkable advantage of the Lagrange-multiplier-regularization formalism that reduces the intermediate steps significantly. Such a factorization is disallowed in the original equation due to the nonexistence of the inverse transformation. It is convenient to make use of the results in Sect. 3.1 for the unperturbed case. Thus we pull out the matrix $(H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})$ from Eq. (44) as

$$\begin{aligned} (&H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})\bigg [\mathbbm {1}+(H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1} (\lambda V-\Delta _n\mathbbm {1})\bigg ]| n(\alpha )\rangle \nonumber \\&=\alpha |c\rangle . \end{aligned}$$

(45)

If it were not for the Lagrange-multiplier-regularization procedure, we should be unable to find the inverse transformation. Following the strategy in the previous subsection, we separate the $\alpha$-dependent part of $|n(\alpha )\rangle$ as

$$\begin{aligned} |n(\alpha )\rangle = |n\rangle + \alpha \Delta |n(\alpha )\rangle , \end{aligned}$$

(46)

where $\alpha \Delta |n(\alpha )\rangle$ vanishes as $\alpha \rightarrow 0$. Substituting Eq. (46) into Eq. (45), we find that

$$\begin{aligned}&(H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1}) \bigg [\mathbbm {1}+(H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1} (\lambda V-\Delta _n\mathbbm {1})\bigg ]| n\rangle \nonumber \\&=\alpha |c\rangle -\alpha M(\alpha ) \Delta |n(\alpha )\rangle , \end{aligned}$$

(47)

where $M(\alpha )$ is an analytic matrix at $\alpha =0$ which is defined by

$$\begin{aligned}M(\alpha )\equiv& (H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})\nonumber \\&\times\bigg [\mathbbm {1}+(H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1} (\lambda V-\Delta _n\mathbbm {1})\bigg ]. \end{aligned}$$

(48)

The factorized form in Eq. (47) reveals that the singularity is regularized by the $\alpha$-dependent term $\alpha \mathbbm {1}$ and the remaining operator is free of singularity as $\alpha \rightarrow 0$ because of the following reason: As we have discussed during the derivation from Eqs. (20) to (21), we can make use of the fact that $(\lambda V-\Delta _n\mathbbm {1})| n\rangle$ is orthogonal to $|n^{(0)}\rangle$. Thus the result is invariant under the insertion of the projection operator $\phi _n$ defined in Eq. (18) in front of $(\lambda V-\Delta _n\mathbbm {1})| n\rangle$:

$$\begin{aligned} (\lambda V-\Delta _n\mathbbm {1})| n\rangle =\phi _n(\lambda V-\Delta _n\mathbbm {1})| n\rangle . \end{aligned}$$

(49)

The insertion, however, makes it clear that the inverse of $(H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1}$ still exists even in the limit $\alpha \rightarrow 0$. Then we find that

$$\begin{aligned}&(H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1}) \bigg [\mathbbm {1}+(H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1}\phi _n (\lambda V-\Delta _n\mathbbm {1})\bigg ]| n\rangle \nonumber \\&=\alpha \big [\,|c\rangle -M(\alpha )\Delta |n(\alpha )\rangle \,\big ]. \end{aligned}$$

(50)

3.2.3 Finding inverse

We next multiply the inverse $(H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1}$, which is the regularized matrix for the unperturbed eigenvalue equation, to the left on both sides of Eq. (50) to find that

$$\begin{aligned}&\bigg [\mathbbm {1}+(H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1}\phi _n (\lambda V-\Delta _n\mathbbm {1})\bigg ]| n\rangle \nonumber \\&=\alpha (H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1}\big [\,|c\rangle -M(\alpha )\Delta |n(\alpha )\rangle \,\big ]. \end{aligned}$$

(51)

Now the regularized matrix $(H_0-E_n^{(0)}+\lambda V-\Delta _n\mathbbm {1}+\alpha \mathbbm {1})$ has the inverse and we can solve the regularized eigenvalue equation (44) as

$$\begin{aligned}| n\rangle =&\bigg [\mathbbm {1}+(H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1} \phi _n(\lambda V-\Delta _n\mathbbm {1})\bigg ]^{-1}\nonumber \\&\times\bigg [\alpha (H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1} \bigg ]\big [\,|c\rangle -M(\alpha )\Delta |n(\alpha )\rangle \,\big ]. \end{aligned}$$

(52)

Because the operator product in front of $\big [\,|c\rangle -M(\alpha )\Delta |n(\alpha )\rangle \,\big ]$ in Eq. (52) is identical to $\alpha M^{-1}(\alpha )$, the second term becomes proportional to $\alpha \Delta |n(\alpha )\rangle$, which vanishes as $\alpha \rightarrow 0$.

At this stage, we can replace $\alpha (H_0-E_n^{(0)}\mathbbm {1}+\alpha \mathbbm {1})^{-1}|c\rangle$ with $\text {adj}(H_0-E_n^{(0)} )|c\rangle$ by rescaling $|c\rangle$ as is done in Eq. (39). Every matrix element of $\text {adj}(H_0-E_n^{(0)} )$ is vanishing except for the nn element. In the limit $\alpha \rightarrow 0$, we have

$$\begin{aligned}| n \rangle =& \bigg [\mathbbm {1}+(H_0-E_n^{(0)}\mathbbm {1} )^{-1} \phi _n(\lambda V-\Delta _n\mathbbm {1})\bigg ]^{-1} |n^{(0)}\rangle \langle n^{(0)}|c\rangle \nonumber \\&\times\prod _{k\ne n}(E_k^{(0)}-E_n^{(0)}), \end{aligned}$$

(53)

where we have used the identity in Eq. (40). We can absorb the factor $\prod _{k\ne n}(E_k^{(0)}-E_n^{(0)})$ into $|c\rangle$ to find that

$$\begin{aligned} | n\rangle = \left[ \mathbbm {1}-\frac{\phi _n}{E_n^{(0)}-H_0}(\lambda V-\Delta _n\mathbbm {1})\right] ^{-1} |n^{(0)}\rangle \langle n^{(0)}|c\rangle , \end{aligned}$$

(54)

where $\phi _n$ is the projection operator defined in Eq. (18). The result is consistent with Eq. (21) if we set $\langle n^{(0)}|c\rangle =1$. In the same manner as we have done with the derivation of Eq. (22) from Eq. (21), we find that

$$\begin{aligned} |n\rangle =\left\{ \sum _{k=0}^\infty \left[ \frac{\phi _n}{E_n^{(0)}-H_0}(\lambda V-\Delta _n\mathbbm {1})\right] ^k \right\} |n^{(0)}\rangle \langle n^{(0)}|c\rangle . \end{aligned}$$

(55)

Again, the result is consistent with Eq. (22) if we set $\langle n^{(0)}|c\rangle =1$. From this stage on, we may apply the same procedure to carry out the order-by-order calculation that is given in Sect. 2.4.

4 Conclusion

A conventional approach to the time-independent Schrödinger equation with small perturbation is the Rayleigh–Schrödinger perturbation theory which is based on the direct calculation of eigenvalue and wavefunction of a system by explicit series expansions. At lower orders of the series, the formulas are simple, but difficulties arise at higher orders. Another well-known approach is the Brillouin–Wigner perturbation theory, which has a much simpler form at higher orders. However, it has a disadvantage that the wavefunction is expressed in terms of the perturbed energy which must be determined. Except for the two approaches, several methods have been developed so far. We have derived the energy and wavefunction of the perturbed system by making use of a conventional approach for comparison.

We have presented a new formalism to this time-independent Schrödinger equation with small perturbation. This new formalism regularizes the original Schrödinger equation with matrix-valued Lagrange undetermined multipliers. The regularization allows one to carry out the inverse transformation that replaces time-consuming procedure of Gaussian elimination [7, 8] to obtain the eigenvector. The key mechanism of the new Lagrange-multiplier regularization formalism is to regularize the secular equation that is vanishing in the eigenvalue problem by adding new degrees of freedom to the characteristic matrix. This regularization makes the characteristic matrix invertible and the eigenvector is easily obtained from the inverse of the characteristic matrix, followed by applying the constraint equation. We note that this procedure is similar to adding the gauge-fixing terms to the Lagrange density of gauge fields in gauge-field theories.

We have applied the Lagrange-multiplier regularization formalism to the unperturbed system to demonstrate how it works in solving an eigenproblem. This demonstration will greatly help readers in understanding the mechanism of the Lagrange-multiplier regularization formalism in solving an eigenvalue equation and regularization of the secular determinant with a regularization parameter. With an appropriate normalization factor, we have found that the result is a trivial identity that reproduces the unperturbed eigenket. Another point of this demonstration is that some formulas appearing in the demonstration are actually used in finding the energy and wavefunction of the perturbed system.

Then, we have applied the Lagrange-multiplier regularization formalism to the perturbed system. By regularizing the perturbed Schrödinger equation with matrix-valued Lagrange multipliers and a regularization parameter, we have found the inverse of the corresponding characteristic matrix in the regularized equation. By taking the constraint equation, the eigenvector reduces into the adjugate of the characteristic matrix multiplied by an arbitrary vector. It is remarkable that the regularized characteristic matrix is factorized with a factor of the characteristic matrix in the unperturbed system and it contains all the singular structure of the characteristic matrix in the perturbed system. Finally, we have obtained the energy and wavefunction with all-order corrections in a compact form, which are consistent with those in the conventional approach. This reveals that the method to make use of the Lagrange multipliers does work well in the time-independent perturbation theory and would be very powerful to solve the eigenproblem in physics. In finding the solution, we have never relied on an iterative method. Instead, it is remarkable that we have indeed computed the all-order corrections in a compact form in a straightforward manner.

We have assumed that the Hamiltonian of the unperturbed system is not degenerate. However, the Lagrange-multiplier regularization formalism for perturbation theory that we have developed in this paper can be extended systematically to make it applicable to a degenerate case. One subtle point is that the inverse of the regularized matrix is not expressed in terms of its adjugate matrix. This problem can be resolved by taking into account the diagonal form of the unperturbed Hamiltonian like the proof in appendix A. The degeneracy of eigenstates of the perturbed system are expected to be distinguished, depending on the form of the perturbative potential.

Regularization of a set of indeterminate linear equations by adding new degrees of freedom is a key point of the Lagrange-multiplier regularization formalism, which is powerful and can be applied to a variety of fields in physics and science. The time-independent perturbation theory now is proved to be another concrete example for the applications of the Lagrange-multiplier regularization formalism. Since the Lagrange-multiplier regularization formalism has a more general feature than that in classical mechanics, we believe that the new formalism in this paper will give more insights on not-yet-known applications of Lagrange undetermined multipliers as well as the time-independent perturbation theory in quantum mechanics.

References

D. J. Griffiths, Introduction to Quantum Mechanics, 2nd ed., Pearson Prentice Hall, ISBN-13 : 978-0131118928 (2004)
J. J. Sakurai, Modern Quantum Mechanics, Benjamin/Cummings, Inc. (1985)
E. Schrödinger, Ann. Physik 80, 437 (1926)
Article ADS Google Scholar
W. Silvert, Am. J. Phys. 40, 557 (1972)
Article ADS Google Scholar
W.K. Niblack, B.P. Nigam, Am. J. Phys. 38, 101 (1970)
Article ADS Google Scholar
L. Lain, A. Torre, Eur. J. Phys. 8, 178 (1987)
Article Google Scholar
W. Han, D.-W. Jung, J. Lee, C. Yu, J. Korean Phys. Soc. 78, 1018 (2021)
Article ADS Google Scholar
D.-W. Jung, W. Han, U-R. Kim, J. Lee, C. Yu, Finding normal modes of loaded string with lagrange multipliers, J. Korean Phys. Soc. https://doi.org/10.1007/s40042-021-00314-9
J.-H. Ee, D.-W. Jung, U.-R. Kim, D. Kim, J. Lee, Eur. J. Phys. 42, 055016 (2021)
Article Google Scholar

Download references

Acknowledgements

As members of the Korea Pragmatist Organization for Physics Education (KPOP$\mathscr {E}$), the authors thank the remaining members of KPOP$\mathscr {E}$ for useful discussions. The work of JL and URK is supported in part by Grants funded by the Korea government (MSIT) under Contract No. NRF-2020R1A2C3009918. The work of DWJ and CY is supported in part by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education 2018R1D1A1B07047812 (DWJ) and 2020R1I1A1A01073770 (CY), respectively. The work is also supported in part by the National Research Foundation of Korea (NRF) under the BK21 FOUR program at Korea University, Initiative for science frontiers on upcoming challenges.

Author information

Authors and Affiliations

Department of Physics, Korea University, Seoul, 02841, Korea
Chaehyun Yu, Dong-Won Jung, U-Rae Kim & Jungil Lee

Authors

Chaehyun Yu
View author publications
You can also search for this author in PubMed Google Scholar
Dong-Won Jung
View author publications
You can also search for this author in PubMed Google Scholar
U-Rae Kim
View author publications
You can also search for this author in PubMed Google Scholar
Jungil Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed equally to this work.

Corresponding author

Correspondence to Jungil Lee.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Director of Korea Pragmatist Organization for Physics Education (KPOP$\mathscr {E}$) Collaboration.

Appendix A: Proof of Eq. (41)

In this appendix, we present a rigorous proof of Eq. (41). The proof of the first identity is given in Ref. [8] and we illustrate the proof of the second identity:

$$\begin{aligned} \text {adj}(\mathbbm {A})= \text {adj}(H_0-E_n^{(0)}\mathbbm {1})=|n^{(0)}\rangle \langle n^{(0)}|\prod _{k\ne n}(E_k^{(0)}-E_n^{(0)}). \end{aligned}$$

(A.1)

We define

$$\begin{aligned} \mathbbm {A}\equiv H_0-E_n^{(0)}\mathbbm {1}. \end{aligned}$$

(A.2)

Because $|n^{(0)}\rangle$ is an eigenket of $H_0$ with the eigenvalue $E_n^{(0)}$, $H_0$ is a diagonal matrix in the basis set $\{\,|n^{(0)}\rangle \,|\,n=1,\,2,\,\cdots \}$. In this basis, the matrix representation for the adjugate matrix $\mathbbm {A}= H_0-E_n^{(0)}\mathbbm {1}$ is

$$\begin{aligned} \mathbbm {A}=(A_{ij})= (\langle i^{(0)}|\mathbbm {A}|j^{(0)}\rangle )=\text {diag}( E_1^{(0)}-E_n^{(0)}\,\cdots \, E_{n-1}^{(0)}-E_n^{(0)}~~ 0~~ E_{n+1}^{(0)}-E_n^{(0)}\, \cdots ). \end{aligned}$$

(A.3)

Therefore, the matrix $\mathbbm {A}$ can be expressed as

$$\begin{aligned} \mathbbm {A} =\sum _{k\ne n}|k^{(0)}\rangle \langle k^{(0)}|(E_k^{(0)}-E_n^{(0)}). \end{aligned}$$

(A.4)

This is a diagonal matrix with the nn element vanishing. Thus the nth column and nth row are both completely vanishing.

The adjugate of a matrix $\mathbbm {A}$ is the transpose of the corresponding cofactor matrix $\mathbbm {C}$:

$$\begin{aligned} \text {adj}(\mathbbm {A})=\mathbbm {C}^T=(C_{ji})=[(-1)^{i+j}\mathscr {M}_{ji}]. \end{aligned}$$

(A.5)

And the ij element $C_{ij}$ of the cofactor matrix $\mathbbm {C}$ is the ij minor $\mathscr {M}_{ij}$ of the matrix $\mathbbm {A}$ multiplied by $(-1)^{i+j}$. The ij minor $\mathscr {M}_{ij}$ of the matrix $\mathbbm {A}$ is the determinant of the submatrix of $\mathbbm {A}$ in which the ith row and the jth column are eliminated from $\mathbbm {A}$. Except for a single case with $i=j=n$, ij minor $\mathscr {M}_{ij}$ of the matrix $\mathbbm {A}$ is vanishing because there is at least a single null column or a single null row:

$$\begin{aligned} \mathscr {M}_{ij}=0,\quad (i,j)\ne (n,n). \end{aligned}$$

(A.6)

The only nonvanishing ij minor is for $i=j=n$ whose value is

$$\begin{aligned}\mathscr {M}_{nn}&={\mathscr{D}}et [\text {diag}( E_{1}^{(0)}-E_n^{(0)}~~ E_{2}^{(0)}-E_n^{(0)} \ldots ~ E_{n-1}^{(0)}-E_n^{(0)}~~ E_{n+1}^{(0)}-E_n^{(0)}\ldots )]\nonumber \\&=\prod _{k\ne n}(E_k^{(0)}-E_n^{(0)}). \end{aligned}$$

(A.7)

As a result, the cofactor matrix is

$$\begin{aligned} \mathbbm {C}=|n^{(0)}\rangle \langle n^{(0)}|\prod _{k\ne n}(E_k^{(0)}-E_n^{(0)}). \end{aligned}$$

(A.8)

This is a symmetric matrix whose the only nonvanishing element is $C_{nn}$. As a result,

$$\begin{aligned} \text {adj}(\mathbbm {A})=\mathbbm {C}^T=|n^{(0)}\rangle \langle n^{(0)}|\prod _{k\ne n}(E_k^{(0)}-E_n^{(0)}). \end{aligned}$$

(A.9)

This completes the proof of Eq. (41).

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yu, C., Jung, DW., Kim, UR. et al. Time-independent perturbation theory with Lagrange multipliers. J. Korean Phys. Soc. 79, 1104–1113 (2021). https://doi.org/10.1007/s40042-021-00328-3

Download citation

Received: 24 September 2021
Accepted: 25 October 2021
Published: 15 December 2021
Issue Date: December 2021
DOI: https://doi.org/10.1007/s40042-021-00328-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Time-independent perturbation theory with Lagrange multipliers

Abstract

Similar content being viewed by others

Perturbation theory in the framework of the improved asymptotic iteration method

On Perturbation of Operators and Rayleigh-Schrödinger Coefficients

Linear Perturbations of the Wigner Transform and the Weyl Quantization

1 Introduction

2 Conventional time-independent perturbation theory

2.1 Definitions

2.2 Conventional strategy

2.2.1 Requirement of renormalization

2.2.2 Perturbative expansion of \(| n \rangle\)

2.2.3 Perturbative expansion of \(\Delta _n^{(k)}\)

2.3 Result for the eigenket

2.4 Order-by-order formulas

2.4.1 State kets

2.4.2 Energy shifts

3 Lagrange-multiplier approach

3.1 Lagrange-multiplier regularization

3.2 Finding \(|n\rangle\) with Lagrange multipliers

3.2.1 Regularized equation

3.2.2 Factorization of the operator

3.2.3 Finding inverse

4 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Additional information

Publisher's Note

Appendix A: Proof of Eq. (41)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Time-independent perturbation theory with Lagrange multipliers

Abstract

Similar content being viewed by others

Perturbation theory in the framework of the improved asymptotic iteration method

On Perturbation of Operators and Rayleigh-Schrödinger Coefficients

Linear Perturbations of the Wigner Transform and the Weyl Quantization

1 Introduction

2 Conventional time-independent perturbation theory

2.1 Definitions

2.2 Conventional strategy

2.2.1 Requirement of renormalization

2.2.2 Perturbative expansion of \(| n \rangle\)

2.2.3 Perturbative expansion of \(\Delta _n^{(k)}\)

2.3 Result for the eigenket

2.4 Order-by-order formulas

2.4.1 State kets

2.4.2 Energy shifts

3 Lagrange-multiplier approach

3.1 Lagrange-multiplier regularization

3.2 Finding \(|n\rangle\) with Lagrange multipliers

3.2.1 Regularized equation

3.2.2 Factorization of the operator

3.2.3 Finding inverse

4 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Additional information

Publisher's Note

Appendix A: Proof of Eq. (41)

Appendix A: Proof of Eq. (41)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation