Abstract
In this paper we compare three different orthogonal systems in \(\textrm{L}_2({\mathbb {R}})\) which can be used in the construction of a spectral method for solving the semi-classically scaled time dependent Schrödinger equation on the real line, specifically, stretched Fourier functions, Hermite functions and Malmquist–Takenaka functions. All three have banded skew-Hermitian differentiation matrices, which greatly simplifies their implementation in a spectral method, while ensuring that the numerical solution is unitary—this is essential in order to respect the Born interpretation in quantum mechanics and, as a byproduct, ensures numerical stability with respect to the \(\textrm{L}_2({\mathbb {R}})\) norm. We derive asymptotic approximations of the coefficients for a wave packet in each of these bases, which are extremely accurate in the high frequency regime. We show that the Malmquist–Takenaka basis is superior, in a practical sense, to the more commonly used Hermite functions and stretched Fourier expansions for approximating wave packets.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Highly oscillatory wave packets play an important role in quantum mechanics. A case in point is the semi-classically scaled time dependent Schrödinger equation (TDSE), which describes the quantum dynamics of the nuclei in a molecule [17]. It has the form
where \(\psi (x,t) \in {\mathbb {C}}\), \(x \in {\mathbb {R}}^{d}\) and \(t \in {\mathbb {R}}\). The Laplacian is \(\Delta = \sum _{k=1}^d \partial ^2/\partial x_k^2\), while V is a smooth potential, \(V(x):{\mathbb {R}}^{d} \rightarrow {\mathbb {R}}\).
This equation is important in a wide range of fields such as physical chemistry. However, solving TDSE comes with a number of computational challenges. A major reason is the small parameter \(\varepsilon \ll 1\), where \(\varepsilon ^{2}\) describes the mass ratio of an electron and the nuclei. Consider Eq. (1.1) without V(x) (the so-called free Schrödinger equation), then it can be shown that the solution to this problem generates oscillations of frequency \({\mathcal {O}}\left( \varepsilon ^{-1} \right) \) in space and time [16]. Moreover, the current techniques used to solve Eq. (1.1) are mainly defined on a finite interval. In that instance Fourier spectral and pseudo-spectral methods work well in practice [6]. However, these approaches require the imposition of periodic boundary conditions, which may lead to non-physical behaviour over longer time frames, because the solution inevitably reaches the non-physical periodic boundary of the domain. Modern applications like quantum control also require much longer time frames. One solution to this problem, is to solve TDSE on the whole real line using spectral methods and this underlies the setting of this paper.
In designing a spectral method, we first need to pick basis functions that approximate the solution to Eq. (1.1). In this paper, we restrict our focus to systems that are orthonormal in \(\textrm{L}_{2}({\mathbb {R}})\) and have banded skew-Hermitian differentiation matrices. When the basis has a skew-Hermitian differentiation matrix, the solution operator is naturally unitary, so the Born interpretation of quantum mechanics is respected. As a result, these methods are also stable by design [9, 11].
Heller in 1976 proposed that the solutions to the semi-classical TDSE can be approximated by Gaussian type functions i.e. wave packets [10]. Hagedorn proved error bounds for time dependent wave packets in 1980 [8]. Lubich in 2008, proved that for travelling wave packets with fixed width \(\sim \varepsilon ^{1/2}\) that a representation of the solution by wave packets has an \(\textrm{L}_{2}\) error of \({\mathcal {O}}\left( \varepsilon ^{1/2}\right) \) for time intervals of size \(t \sim 1\) [18]. Thus, wave packets are a good first approximation to the solution to semi-classical TDSE. In fact, for a quadratic potential with Gaussian initial data, they yield an exact solution to equation (1.1) [18]. Thus, a major yardstick for the suitability of basis functions for Eq. (1.1) is how well they approximate wave packets.
A univariate wave packet function has the form \(f(z)=\textrm{e}^{-\alpha (z-x_{0})^{2}}\cos (\omega z)\), where \(\alpha >0\), \(x_{0} \in {\mathbb {R}}\) and \(\omega \in {\mathbb {R}}\). The width of the wave packet is controlled by \(\alpha \), \(x_{0}\) is the position where the wave packet is centred and \(\omega \) is the frequency of oscillations. For simplicity much of our analysis is restricted to \(x_{0}, \omega >0\) and this should be assumed unless stated otherwise. The expansion coefficients of a wave packet function, f(x), in an orthonormal set of \(\textrm{L}_{2}({\mathbb {R}})\) basis functions, \(\{\varphi _n\}_{n \in I} \) where I is a countable index set such as \({\mathbb {Z}}\), is given by the following inner product,
where
Much of our analysis concerns the asymptotics of \(b_n(\omega )\) in the large-\(\omega \) regime, from which results about \(a_n(\omega )\) follow. Note we use the notation \(f(x) \sim g(x)\) for \(x \rightarrow \infty \) to denote that functions f(x) and g(x) are asymptotically equivalent as \(x \rightarrow \infty \). This can be rewritten as \(f(x) = g(x)[ 1 + o(1)]\) [4].
The three systems under consideration in this paper are
-
Stretched Fourier functions, which are simply the Fourier functions scaled to an interval \([-\lambda , \lambda ]\),
$$\begin{aligned} \varphi _{n} (x):= {\left\{ \begin{array}{ll} (2\lambda )^{-1/2}\textrm{e}^{\textrm{i}\pi n x/\lambda }&{}\text{ if } x \in [-\lambda ,\lambda ], \\ 0 &{} \text{ for } x \in {\mathbb {R}}\setminus [-\lambda ,\lambda ]. \end{array}\right. } \end{aligned}$$ -
Hermite functions [22, Table 18.3.1]
$$\begin{aligned} \varphi _{n}(x):= \frac{1}{(2^{n}n!\pi ^{1/2})^{1/2}} \textrm{H}_n(x)\textrm{e}^{-x^2/2}, \qquad n \in {\mathbb {Z}}_+. \end{aligned}$$ -
Malmquist–Takenaka (MT) functions
$$\begin{aligned} \varphi _{n}(x):= \textrm{i}^{n}\sqrt{\frac{2}{\pi }}\frac{(1 + 2\textrm{i}x)^{n}}{(1 - 2\textrm{i}x )^{n+1}}, \qquad \qquad \qquad n \in {\mathbb {Z}}. \end{aligned}$$
In this paper, we derive an asymptotic estimate of the coefficients in each basis, which will allow us to compare and evaluate how well they approximate wave packets on the whole real line. For simplicity and without loss of generality we restrict our analysis to \(\alpha , x_{0}, \omega , n >0\).
For stretched Fourier functions, the asymptotic estimate of coefficients is described in Theorem 2.1. In its more streamlined version, for any given \(\varepsilon > 0\) it is possible to choose the stretching constant \(\lambda \) such that
A specific choice that satisfies this bound is
From this estimate we expect the logarithm of the magnitude of the expansion coefficients to resemble a hump, very similar to a sombrero, and we can observe these characteristics in Fig. 1. The sombrero-like hump tells us that the coefficients decay spectrally until the small constant \(\varepsilon \) eventually dominates the estimate. This drop off in decay rate is real, and for \(\alpha = 1, x_{0} =0\) and \(\omega =50\) in Fig. 1 occurs at about \(\varepsilon =10^{-20}\), in line with Eq. (1.3). We have a pessimistic certainty that for all n satisfying,
we have \(|a_{n,\lambda }(\omega )| \ge 2\varepsilon \). The value of the stretching constant \(\lambda \) is discussed further in Sect. 2. One clear advantage of using the stretched Fourier basis is that we can take advantage of the structure seen in Fig. 1, where we can confine our calculation to approximating N coefficients, the complement of which is guaranteed to satisfy \(|a_{n}(\omega )| \le \varepsilon \), by a single Fast Fourier Transform (FFT) in \({\mathcal {O}}(N\log _2 N)\) operations.
In summary, stretched Fourier functions hold a great deal of promise; we have proved that the coefficients exhibit spectral decay and can be approximated using FFT. However, in a more practical setting, our method of determining the optimal cut-off of coefficients requires information about our wave packet which is typically unavailable. Moreover, a typical solution resembles an unknown linear combination of wave packets, whereas the nice performance e.g. in Fig. 1 is optimized for a single wave packet. This is discussed in more detail in Sect. 2.
The next contender is the basis of Hermite functions [22, Table 18.3.1]. Deferring the full asymptotic estimate to Theorem 3.1, a simplified version for \(n > 0\) and \(\alpha > 1/2\) is
where \({\mathcal {C}}_{\textrm{osc}}\) is a bounded oscillatory component. It is difficult to deduce the ‘shape’ of these coefficients from this expression. However, to help convince the reader, let us set \(c_{n,\omega }= n/\omega ^{2}\) to be constant—we see in Sect. 3.2.3 that this is a reasonable assumption and note that this relationship keeps n and \(\omega \) large. Then we have
-
A factor of \(1/\sqrt{\omega }\) which gently drives the size of \(a_{n}(\omega )\) down.
-
A factor, greater than one, to the power n drives the size up geometrically.
-
The argument of the exponential is essentially dominated by the competing \(\omega \) and n term. If we replace \(n = c_{n,\omega }\omega ^{2}\) and note that for \(\alpha > 1/2\) this term is negative, we obtain rapid decay to zero.
Thus, overall we get geometric decay. The above expression matches the envelope of the \(a_{n}\)’s exceedingly well. In Fig. 2, we can see that for large \(\omega \) and small \(\alpha \) and \(x_{0}\) we can clearly observe spectral decay.
To understand how the number of coefficients grows with respect to \(\omega \), consider the following crude inequality, the number of coefficients which are greater than \(\varepsilon \) in absolute value are all the n’s that such that
where \(c_{n,\omega }\) is a constant, and \(\alpha \) and \(x_{0}\) are fixed. Note that number of coefficients increases linearly with n but quadratically with \(\omega \) (as \(n=c_{n,\omega }\omega ^{2}\)). Figure 3 displays the number of coefficients required for varying \(\varepsilon \) and fixed \(\omega \).
Although it is natural to think of Hermite functions in a quantum mechanical setting, since they are the eigenfunctions of the quantum harmonic oscillator, we show in Sect. 3, that the convergence properties leave much to be desired. The decay of Hermite function coefficients for highly oscillatory wave packets with small \(\alpha \) and \(x_{0}\) is fast indeed, but as \(\alpha \) and \(x_0\) increase, this soon ceases to be true in practical terms.
Out of our three bases, Malmquist–Takenaka functions are the strongest contender when approximating wave packets on the real line for two reasons: firstly, the asymptotic estimate in Theorem 4.2 demonstrates spectral decay and secondly, the coefficients can be approximated using FFT. To give a brief insight into our findings, the asymptotic estimate of coefficients is
for \(\omega \ge 0 \). This holds for large \(|\omega |\) regime and uniformly for n satisfying \(1/4 \le \left| n/\omega \right| \le c\) for any given constant c (this is made more precise in Sect. 4). We can immediately deduce exponential decay with respect to n within this regime relative to \(\omega \). This is be confirmed experimentally in Fig. 4.
This bound highlights how the presence of high oscillations can be an advantage. In the case of no oscillations, \(\omega =0\), Weideman found in [26] that the coefficients decay sub-exponentially. However, counterintuitively, as \(\omega >0\) grows, the estimate in (1.4) exhibits exponential convergence for \(n \le c \cdot \omega \) for some c. In other words, the presence of oscillations means that we see exponential convergence within a domain and the range of the domain increases for larger values of \(\omega \). In the context of practical computation, for very highly oscillatory wave packets the coefficients will essentially exhibit exponential decay within the practical range of accuracy.
Our asymptotic estimate shows that we should expect our coefficients to decay spectrally. From (1.4), we can readily deduce the location of the peak which is given by
Interestingly, the location of the peak does not depend on how much we stretch the wave packet. The number of coefficients \(n>0\) that are greater than \(\varepsilon \) in absolute value is asymptotically equal to
That is the number of coefficients required to resolve a wave packet in MT basis grows linearly with \(\omega \), which beats Hermite functions where the number of coefficients grow quadratically in \(\omega \)!
Overall, MT functions seem to provide the most practical approach in approximating wave packets. The coefficients can be approximated rapidly using the FFT and we have proved that they exhibit asymptotic exponential decay. A more in-depth discussion about these bases can be found in Sect. 4.
Detailed derivation of the above asymptotic estimates is discussed in the sequel.
2 Stretched Fourier Basis
Stretched Fourier functions are the familiar Fourier functions scaled to the interval \([-\lambda , \lambda ]\), where \(\lambda >0\) can be chosen based on the function f to be approximated and a desired error tolerance \(\varepsilon \). Fourier functions are the natural basis in the presence of periodic boundary conditions in a compact interval, but in our setting we are truncating a function which is defined on the whole real line to the interval \([-\lambda ,\lambda ]\), and must contend with an error introduced by the Gibbs effect at the endpoints. The basis functions, for a subspace of \(\textrm{L}_2({\mathbb {R}})\), have the form
For a wave packet, \(f(x) = \textrm{e}^{-\alpha (x-x_{0})^{2}}\cos (\omega x)\), where \(\alpha > 0\), \(x_{0},\omega \in {\mathbb {R}}\), an integral expression for the expansion coefficient is found by substituting (2.1) into (1.2) to get
It is clear that a set of N of these coefficients can be approximated by a single FFT in \({\mathcal {O}}(N\log _2 N)\) operations.
Dietert and Iserles in their unpublished report [5] showed that stretched Fourier expansions for functions of the form \(f(x) = g(x) \exp (-\alpha x^2)\) (where g(x) is an entire function which grows at most exponentially) converge at spectral speed down to a specified error tolerance, provided that care is taken when choosing \(\lambda \). An appealing result shown therein is the ‘sombrero phenomenon’, whereby the logarithm of the absolute value of the expansion coefficients is shaped like a sombrero. For the special case of the wave packets considered in this paper, we apply their method to arrive at the result in Theorem 2.1, and give an example to demonstrate the sombrero effect in Fig. 5.
Theorem 2.1
(Coefficients for Stretched Fourier) Suppose that \(f(x) = \textrm{e}^{-\alpha ( x-x_{0})^{2}}\cos (\omega x)\), where \(\alpha >0\), \(x_{0}, \omega \in {\mathbb {R}}\). The coefficients of f in the stretched Fourier basis (as in Eq. (2.1)) with stretching parameter \(\lambda \ge |x_0|\) satisfy
for all \(n \in {\mathbb {Z}}\).
Before proving the theorem, let us discuss this sombrero phenomenon in more detail. We do not expect the reader to be familiar with millinery terminology, so allow us to explain that the main part of a hat in which the head sits is called the crown and the projecting edge is called the brim. Drawing the analogy with a sombrero seen in Fig. 5, the crown of the sombrero exhibits super-exponential decay from the central peak and the brim of the sombrero is where the decay rate is very slow—specifically, \({\mathcal {O}}(n^{-1})\), as predicted by standard Fourier theory for a function of bounded variation. Note however, that if \(\lambda \) is sufficiently large, as it turns out to be the case in Fig. 5, it is also a region where the magnitude of the coefficients is smaller than the accuracy we need for practical computation. Thus we can truncate the series for an approximation scheme and attain effective spectral convergence down to our desired accuracy.
Proof of Theorem 2.1
Essentially, we evaluate the integrals in Lemma 3 and 4 from [5] exactly. Consider the modified coefficient given by an integral over the whole real line, which is much more amenable to direct calculation:
From this we have the bound,
A bound for the original coefficient can then be found as follows.
In the last step we have used the known inequality,
which holds for \(t \ge 0\). Substituting the bound for \(|{\check{a}}_{n,\lambda }(\omega )|\) obtained above yields the result of the theorem. \(\square \)
2.1 The Choice of Stretching Parameter \(\lambda \)
It was found in [5] that the position of the sombrero brim depends on our choice of \(\lambda \). In order to choose an appropriate \(\lambda \), we need to understand the characteristics of our bound in (2.3). It requires us to choose \(\lambda \ge |x_{0}|\), if we instead choose \(\lambda \) less than \(|x_0|\) then we obtain a poor approximation to the original wave packet. This can be seen in Fig. 6, where the wave packet has much of its ‘mass’ outside the interval of approximation. The bound also suggests that we are limited by the magnitude of \(\exp \left( -\alpha (\lambda -|x_0|)^2\right) \). If this quantity is not smaller than some desired threshold \(\varepsilon \) then the coefficients are not guaranteed to decay spectrally below \({\mathcal {O}}(\varepsilon )\), but instead we expect them to decay like \({\mathcal {O}}\left( n^{-1}\right) \), until they reach such a threshold. This can be seen in Fig. 6, for \(\lambda = 1\), the \(\exp \left( -\alpha (\lambda -|x_0|)^2\right) \) term for \(\alpha = 1\) and \(x_{0}=2\) becomes \(\exp (-1) \approx 0.368\), resulting in slow decay. Whereas, for \(\lambda =9\), \(\exp (-49) \approx 5.24\times 10^{-22}\) so we see spectral decay past \(10^{-20}\).
There are many effective ways to define \(\lambda \); for a given \(\epsilon > 0\), a simple choice is,
in order to guarantee
Figure 7 demonstrates the dependence of the wave packet coefficients on the choice of \(\lambda \). If we choose \(\lambda \) to be smaller than \(\lambda _{\textrm{optimal}}\) we would need to choose \(n={{\mathcal {O}}}(\varepsilon ^{-1})={\mathcal {O}}(10^{20})\) in order to reach our desired accuracy, which is unrealistically large. For \(\lambda \) larger than our optimal value the brim of the sombrero lies lower but the thickness of the crown is greater and we need to choose large (at least, only linearly) n to reach \(10^{-20}\) accuracy. Thus even though we get spectral decay with stretched Fourier bases, this approach is extremely sensitive to how well we can define the optimal \(\lambda \). This effect can also be seen theoretically in Theorem 2.1.
In a more practical setting, a linear combination of wave packets is a more realistic approximate solution to the semi-classical TDSE and due to our current method of determining \(\lambda _{\textrm{optimal}}\) we need more knowledge about the exact solution before we can approximate it. Our analysis indicates that it is safer to overestimate \(\lambda \) than to underestimate it, providing a clue to a good strategy for a linear combination of wave packets. The current practical limitations of stretched Fourier basis notwithstanding, the functions still hold significant potential due to their simplicity, fast approximation of coefficients using FFT and spectral rate of decay.
2.2 Number of Coefficients Required for a given Tolerance
Given a tolerance \(\varepsilon > 0\), once we choose \(\lambda \) as in Eq. (2.4) we can guarantee that the coefficients satisfy (2.5). To gain a rough idea on the number of coefficients required for a given \(\varepsilon \), we do the following overestimation by considering \(|a_{n,\lambda }(\omega )| \ge 2\varepsilon \) and finding all the n’s that satisfy this bound. That is all n satisfying
Neglecting the complicated term inside the logarithm (i.e. bounding this inequality from above), we have
and, substituting \(\lambda =\lambda _{\textrm{optimal}}\) from (2.4), we conclude with
which emphasises the role different parameters play in the bound.
The important factor to note here is the linear dependence on both \(\omega \) and \(|x_0|\), which is extremely favourable compared to the other two bases considered in this paper. The downside, though, is that we require prior knowledge of \(x_0\), \(\alpha \) and \(\omega \) to choose \(\lambda \) and that the approximation of a linear combination of wave packets is likely to result in less favourable behaviour.
3 Hermite Functions
It is natural to think of Hermite functions in the context of quantum mechanics since they are solutions of the quantum harmonic oscillator. Hermite functions have a skew-symmetric and tridiagonal differentiation matrix, which implies that their associated spectral methods are stable and preserve unitarity [9]. However, computation of coefficients in an expansion in Hermite functions is associated with a number of practical challenges. An obvious method of computing expansion coefficients in Hermite functions with quadrature is by explicit integration of each coefficient at the overall cost of \({\mathcal {O}}(N^{2})\) operations. Nonetheless they are still a contender when approximating wave packets and, less intuitively, it is possible to compute the expansion coefficients in \({\mathcal {O}}(N(\log N)^2)\) operations using the fast multipole algorithm [2].
We consider normalised Hermite functions
The corresponding expansion integral is
where the \(\textrm{H}_n\)s are Hermite polynomials,
Theorem 3.1
(Coefficients for Hermite functions) Suppose that \(f(z) = \textrm{e}^{-\alpha ( z-x_{0})^{2}}\cos (\omega z)\), where \(\alpha > 0\), \(x_{0}\) and \(\omega \) are real. The asymptotic behaviour of the coefficients, \(a_{n}\), with Hermite basis functions in Eq. (3.1) is given by
The asymptotic estimate for \(\alpha >1/2\) holds uniformly for \(n > c_{n,\omega }\omega ^{2}\) for any given constant \(c_{n,\omega }\). For \(\alpha <1/2\), this estimate holds uniformly for \(n < c_{n,\omega }\omega ^{2}\), \(c_{n,\omega }\) is some constant, and n outside of any given neighbourhood of \(\omega ^{2}/(2(1-4\alpha ^{2}))\). These bounds hold uniformly as \(\omega \rightarrow \infty \).
The behaviour of the coefficients is hard to deduce by a direct examination of this expansion. It is helpful to break down the result into three regimes. Note that in a more practical setting we would expect \(\alpha >1/2\), so the following analysis considers the asymptotics for this case:
-
1.
When \(\omega \) is large,
$$\begin{aligned} a_{n}(\omega ) \sim \frac{{\mathcal {C}}_{\textrm{osc}}}{\sqrt{\omega }} \exp \left( -\frac{\omega ^{2}}{2(2\alpha + 1)}\right) , \end{aligned}$$where \({\mathcal {C}}_{\textrm{osc}}\) is a bounded oscillatory term. The asymptotic expression only depends on \(\omega \) and we expect exponential decay.
-
2.
When n is large,
$$\begin{aligned} a_{n}(\omega ) \sim {\mathcal {C}}\left| \frac{1 + \sqrt{1 +2\frac{n}{\omega ^{2}}\left( 4\alpha ^{2}-1\right) }}{\sqrt{2\frac{n}{\omega ^{2}}}(2\alpha +1)}\right| ^{n}\exp \left( \frac{n}{1+\sqrt{1+2\frac{n}{\omega ^{2}}\left( 4\alpha ^{2}-1\right) }} \right) . \end{aligned}$$The expression is still fairly opaque but in Fig. 9 we can see that the envelope of the logarithm of the absolute value of coefficients is approaching a straight line.
-
3.
When both n and \(\omega \) are large subject to a power law, \(n=\omega ^\lambda \) for some \(\lambda >0\), then
$$\begin{aligned} a_{n}(\omega )\sim & {} \frac{{\mathcal {C}}}{\sqrt{n^{1/\lambda }(2\alpha +1)}} \left| \frac{1 + \sqrt{1 +2n^{1-2/\lambda }\left( 4\alpha ^{2}-1\right) }}{\sqrt{2n^{1-2/\lambda }}(2\alpha +1)}\right| ^{n} \\{} & {} \times \exp \left( -\frac{n^{2/\lambda }}{2(2\alpha + 1)}+ \frac{n}{1+\sqrt{1+2n^{1-2/\lambda }\left( 4\alpha ^{2}-1\right) }}\right) , \end{aligned}$$
Comparing Fig. 9 to Fig. 2 we observe that increasing \(\alpha \) renders the rate of decay of coefficients worse. The same happens when we increase \(x_{0}\) in Fig. 10.
3.1 Fourier-type integral of Hermite Functions
The main idea behind the proof is to consider a Fourier type integral of Hermite functions,
Lemma 3.2
It is true that
for \({\textrm{Re}}\, c>0\), \(c\ne \frac{1}{2}\) and
The branch choice in Eq. (3.3) for \(\sqrt{2 c - 1}\), \(0< c < \frac{1}{2}\), is unimportant because only the even powers of \(\sqrt{2 c - 1 }\) occur on the right-hand side due to symmetry of the Hermite polynomials.
Proof
Consider Hermite functions in a Cauchy integral form,
where the contour \(\gamma \) encircles the origin. Substituting this into (3.3) and changing the order of integration,
For \(c = 1/2\), the proof of (3.4) is identical (and easier!) to the steps above: we deduce that
\(\square \)
We proceed by comparing (3.3) with (3.2) to deduce that
and
After some algebraic manipulation (see A.1) and setting
where for simplicity we restrict our analysis to \(\alpha , x_{0}, \omega >0\) so that X and Y are positive real numbers, we note that our expressions are different for different values of \(\alpha \),
Thereafter we focus on the asymptotics for \(\alpha \ne \frac{1}{2}\), the remaining case can be also obtained by a limiting process. Note that (3.7) are the exact coefficients for \(\alpha = 1/2\) and the plot of (3.7) can be seen in Fig. 11.
3.2 An asymptotic Estimate
3.2.1 The Method of Steepest Descent
The method of steepest descent (MSD), also known as the saddle point approach, is a key tool used in our Hermite and Malmquist–Takenaka estimates. We give a brief overview here; for a more in-depth review we refer to [4, 7, 21, 29]. It is an approximation method that can be applied to integrals of the form
where t is a large parameter. Let us assume that F(z) is a complex-valued meromorphic function and G(z) is an entire function.
The main idea is to deform the path of integration over a complex contour \(\Gamma \) to pass through (or near) certain saddle points of F(z), that is, points \(z^{*}\) such that \(F'(z^{*})=0\). As well as passing through a subset of saddle points, the path of integration is also designed so that \({\textrm{Im}}\,F(z)\) is constant, i.e. \({\textrm{Im}}\, F(z) = {\textrm{Im}}\, F(z^{*})\), which ensures that \(\exp (-t F(z))\) is non-oscillatory along the contour. If \(F''(z^{*}) > 0\) then \(\exp (-t F(z))\) decays rapidly away from \(z^{*}\) along this contour—the path of steepest descent. We assume at the outset that G(z) can be expanded as convergent power series about the neighbourhood of \(z^{*}\).
Skipping many technical details, suppose that there exists an ordered set of contours \(\Gamma _1, \Gamma _2,\ldots ,\Gamma _s\) such that each contour contains a single saddle point (\(z^{*}_{k} \in \Gamma _k\)), within each contour F satisfies \({\textrm{Im}}\, F(z) = {\textrm{Im}}\, F(z^{*}_{k})\), each saddle point in question satisfies \(F''(z^{*}_k) > 0\), G(z) can be expanded as a convergent power series about each \(z^{*}_{k}\), and such that \(\int _a^b \textrm{e}^{-t F(z)} \textrm{d}z = \sum _{k=1}^s \int _{\Gamma _k} G(z) \textrm{e}^{-t F(z)} \textrm{d}z\). Then as \(t \rightarrow \infty \) and \(G(z_{k}^{*}) \ne 0\), to first order approximation our integral can be expressed as
([4, Eq. 5.7.2]).
Remark 3.3
The power of this approach is that the first approximation depends only on the values of F, \(F''\) and G at a certain subset of the saddle points, and nowhere else. Note that the terms hidden in the \({\mathcal {O}}\) error term in (3.9) is a polynomial in \(G^{'}/G, G^{''}/G, F^{'''}, F^{(IV)}\) and \(1/F^{''},\) see [23, Eq. 2.4.16], evaluated at a subset of saddle points. These terms depend on z and \(\kappa \), where \(\kappa \) is a carefully chosen parameter with some dependence on t—more importantly they must remain controlled as \(t \rightarrow \infty \) thus imposing constraints on \(\kappa \). This ensures that (3.9) holds uniformly as \(t \rightarrow \infty \).
3.2.2 Steepest Descent: Hermite Functions
To find the asymptotic behaviour of (3.5), we consider \(\textrm{H}_{n}(z)\) for z given in (3.6) and (3.8), where n is a large parameter and Y is large in the complex argument, z. We start by considering Hermite functions in a Cauchy integral form,
where \(\gamma \) is a contour encircling the origin. Let \(\zeta =z/\nu \), so that
where \(\phi (w)=w^2-2\zeta w+\frac{1}{2}\log w\), \(\nu =\sqrt{2n+1}\). The saddle points are the zeros of \(\phi '\), namely \(w_\pm =\frac{1}{2}(\zeta \pm \textrm{i}\sqrt{1-\zeta ^2})\).
Deforming our path of integration to pass through the dominant saddle point, \(w_{*}\), then applying (3.9), gives us
– note that \(\phi ''(w_*)=2-1/(2 w_{*}^{2})\). This asymptotic approximation is valid for large values of n and \(\omega \). For small values of \(\omega \) a similar contribution from the remaining saddle point should also be included.
By remark 3.3, in order for the estimate in (3.10) to hold uniformly with respect to \(\zeta \) for \(\omega \rightarrow \infty \), we require the existence of \(c > 0\) such that \(c< |w_{*}|<c^{-1}\), which is guaranteed as long as \(\zeta \) is bounded. We also require \(|\zeta - 1| > \delta \), \(\delta \) being a small positive number, to avoid the coalescing of saddle points, in which case we would need to use uniform asymptotic methods, see [25, §23.4]. This leads to the combined conditions \(\delta<|\zeta - 1| < \delta ^{-1}\).
We have the key ingredient to find the asymptotic expansion of the Hermite coefficients. For \(\alpha > 1/2\), we want to apply (3.10) to \(\textrm{H}_{n}(z)\) in (3.6), the dominant root is \(w_{-}\) and \(w_-\phi ''(w_-)=-2\textrm{i}(1-\zeta ^2)^{1/2}\). Recall that for this case, \(\zeta = (X + \textrm{i}Y)/\nu \), where
\(\omega \) is a large parameter, while \(\alpha \) and \(x_{0}\) are fixed. For \(\alpha < 1/2\), we want to do the same, apply (3.10) to \(\textrm{H}_{n}(z)\) in (3.8), \(w_+\) root accounts for the main contribution to the asymptotic estimate and \(w_+\phi ''(w_+)=2\textrm{i}(1-\zeta ^2)^{1/2}\). In this case, \(\zeta = (Y - \textrm{i}X)/\nu \).
As highlighted in remark 3.3, care must be taken to constrain \(\zeta \) so that our approximation in (3.10) holds uniformly as \(\omega \rightarrow \infty .\) For \(\alpha \ne 1/2\) and large \(\omega \),
We have two competing large parameters n and \(\omega \), and we want to find a substitution which preserves the relationship between the two terms,
This satisfies the condition in remark 3.3 for \(\zeta \) to remain bounded for large n and \(\omega \). The additional condition in the remark for non-coalescing saddle points, \(\delta < |\zeta - 1|\) is equivalent to
3.2.3 \(\alpha > 1/2\): Deriving an Asymptotic Estimate
In a more practical setting, we would expect \(\alpha > 1/2\) and we commence by finding the asymptotics for this case. Equation (3.10) is instrumental in deriving first-order asymptotics when large degree coexists with large argument: higher order estimates can be obtained from [20], but we do not pursue this theme further.
We observe that for reasonable values of \(x_{0}\) we can simplify things by requiring \(Y\gg X\) so \(\zeta \sim \textrm{i}Y/\nu \). However, for more accurate asymptotics we want to retain \(x_{0}\) explicitly in our expansions. As discussed in the last section, to satisfy remark 3.3, (3.12) must hold for \(\zeta \) to remain bounded for large n and \(\omega \). Moreover, as \(\alpha > 1/2\) we have \(2(1-4\alpha ^{2}) <0\), so condition (3.13) is automatically satisfied. Thus the MSD approximation in (3.10) holds uniformly for \(n > c_{n,\omega }\omega ^{2}\) for any given constant \(c_{n,\omega }\). The relationship is demonstrated in our numerical experiments, shown in Fig. 12.
Applying (3.10) to (3.6) (with identical notation), we have
where
A is slowly varying for large n, Y and \(\nu = \sqrt{2 n + 1}\), while B is the dominant contribution, the fast component.
At this point, the reader can refer to Sect. A.2 for the full derivation of the asymptotics. To summarise, we substitute terms \(w_{-}\phi ''(w_{-})\), \(\phi (w_{-})\) and (3.11) into (3.14a) and (3.14b).
Setting \(n = c_{n,\omega }\omega ^{2}\) where \(c_{n,\omega }\) is some constant, then expanding about \(\omega \) we arrive at
Substituting A and B back into \(a_{n}\) and restoring \(c_{n,\omega } = n/\omega ^{2}\), we obtain the asymptotic estimate of coefficients,
Fig. 13 demonstrates how remarkably well our estimate models the size of the coefficients.
The dominant term in the Hermite estimate with the bounded oscillations (the terms in the curly brackets in (3.15)) removed has the form
3.2.4 \(\alpha < 1/2\) Asymptotics
We have used the same method as for \(\alpha > 1/2\) so we have omitted some technical details and the bulk of the algebra has been moved to the appendix.
To apply the MSD, we first need to satisfy the conditions in remark 3.3 which, in our case, translate to (3.12) and (3.13). For \(\alpha < 1/2\), (3.13) requires \(\omega ^{2}/n\) to be kept away from \(2(1-4\alpha ^{2})\). Combining the two constraints, the MSD estimates will hold uniformly for \(n > c_{n,\omega }\omega ^{2}\) and values of n outside any given neighbourhood of \(\omega ^{2}/(2(1-4\alpha ^{2}))\).
We proceed by applying, (3.10). For \(\alpha < 1/2\)
Note that for this case we apply the MSD on \(\textrm{H}_{n}(\zeta )\) for \(\zeta = (Y-\textrm{i}X)/\nu \) where our contour of integration passes from the saddle point \(w_{+}\). Like before, we can rewrite \(a_{n}\) in terms of A and B, the slow and fast components respectively,
Here, we have substituted \(w_{+}\phi ''(w_{+})\) and \(\phi (w_{+})\) in terms of X and Y, into A and B (see Sect. A.3). In (3.18) we have used Stirling’s formula.
From equations (3.17), (3.19) and (3.20), the key term we need to approximate is
Here we have substituted \(n=c_{n,\omega }\omega ^{2}\), like before, and we want to expand around \(\omega \). First thing to note is that the large \(\omega \) term is given by
Observe that there is a change in behaviour at
This tells us that for a small neighbourhood around \(c_{n,\omega }^{*}\), we would expect this region to have different asymptotics to the the region away from it. This agrees nicely with our constraints discussed at the start of this section, where we have identified that the non-coalescing saddle point regime in remark 3.3 is equivalent to considering \(c_{n,\omega }\) away from \(c_{n,\omega }^{*}\).
Going through the same steps as in the previous section and sparing the reader more algebra (see Sect. A.3.1 for full derivation), for \(c_{n,\omega }\ne c_{n,\omega }^{*}\) we obtain
The asymptotics are displayed in Fig. 15, which confirms the remarkable fit of our formulæ with computed values of the coefficients.
4 Malmquist–Takenaka Functions
MT functions have a long history. Discovered simultaneously by Malmquist [19] and Takenaka [24] in 1925, they have since been studied in fields ranging from signal processing (Wiener in 1949 [28]) to spectral methods (Christov in 1982 [3]). In [12], Iserles and Webb proved that this is essentially the only orthonormal and complete system in \(\textrm{L}_{2}({\mathbb {R}})\) with a banded tridiagonal skew-Hermitian differentiation matrix, and the coefficients can be computed using the Fourier transform. This is part of Iserles and Webb’s ongoing work classifying bases with banded skew-Hermitian differentiation matrices [13,14,15]. It was the work of Boyd in [1] and Weideman in [26, 27] that highlighted the interesting approximation properties of MT functions. The approximating characteristics of MT functions do not obey simple rules. As an example, for \(f(x)= 1/(1 + x^{4})\) the coefficients decay at a spectral rate of \(a_{n} \sim {\mathcal {O}}\left( (1 + \sqrt{2})^{-|n|}\right) \), however if we include an oscillatory factor so that our function is in the form \(f(x) = \sin (x)/(1 + x^{4})\), our approximation collapses from spectral to algebraic convergence, \(a_{n}\sim {\mathcal {O}}\left( |n|^{-9/4} \right) \), see Fig. 16. Why does multiplying by \(\sin (x)\), an entire function, have such a large effect on the rate of convergence? How do we get exponential convergence?
In [1] and [27] (respectively), Boyd and Weideman proved that MT expansion coefficients decay exponentially if and only if f(z) decays at least linearly as \(|z| \rightarrow \infty \) in the complex plane, and f is analytic in a region which may exclude \(\pm \textrm{i}/2\) but which must include the point at infinity. Analyticity at infinity is a strong condition to impose, since \(\sin z\), for example, blows up as \(z\rightarrow \infty \) along the imaginary axis. This is not unexpected because the proof of exponential convergence in a compact interval (e.g. with orthogonal polynomials) depends on functions being analytic within a Bernstein ellipse surrounding this interval.
Interestingly, Weideman (in [26]) found that the coefficients, \(a_{n}\), of a Gaussian function asymptotically decay close to exponential convergence, \(a_{n} \sim {\mathcal {O}}\!\left( \textrm{e}^{-\frac{3}{2}|n|^{2/3}}\right) \). Note, we do not have exact spectral decay as the Gaussian function has an essential singularity at \(\infty \). What does this mean for wave packet coefficients? We proved that for wave packet functions the coefficients exhibit asymptotic exponential convergence. In other words as \(\omega \) gets larger, we get closer to exponential convergence up to some threshold. For example, if \(f(x) = \textrm{e}^{-x^{2}}\cos (\omega x)\) the coefficients decay like \(a_{n} \sim {\mathcal {O}}\left( \textrm{e}^{-n/\omega }\right) \).This can be seen in Fig. 17, where even though the \(\cos (50 x)\) factor decreases the rate of convergence compared to just the Gaussian function, we still see asymptotic spectral decay, at least in the regime \(|a_{n}(\omega )| \ge 10^{-10}\).
To see how we have proved this, let us start by defining our system. From [12] the Malmquist–Takenaka system is defined as
Substituting (4.1) into (1.2) yields the expansion coefficients
where
where \(\alpha = \beta \omega , n = c_{n,\omega }\omega \) and we take the principal value of \(\tan ^{-1}\). We initially restrict ourselves to the case \(n \ge 0, \ \omega \ge 0\). The remaining cases will be considered in Sect. 4.3.
4.1 The Case \(\omega \ge 0\), \(n \ge 0\): the Method of Steepest Descent
This section draws inspiration from Weideman in [26], who found the rate of decay of MT coefficients for the Gaussian function, \(\textrm{e}^{-x^{2}}\) by applying MSD to the coefficient integral. Our analysis will be considerably more involved than that of Weideman’s for the Gaussian function, because we have two large parameters, n and \(\omega \). A brief explanation of the MSD can be found in Sect. 3.2.1.
Since \(\beta =\alpha /\omega \), it is bounded for large \(\omega \). We stipulate that n and \(\omega \) vary so that \(c_{n,\omega }\) is bounded: details will be discussed further in the next section.The main step of the MSD is to deform our integral in (4.3) over a complex contour that passes through (or near) a saddle point of (4.4). This represents the path of steepest descent. We can find the saddle points by solving
with some manipulation i.e. multiplying through by \(1+ 4z^{2}\) then collecting z terms, this reduces to the cubic
Our formulation in (4.3) has a nice symmetry aspect that comes from the \(\tan ^{-1}\). For example, when \(\beta =0\) there are two saddle points
As \(\alpha \) is not an asymptotic parameter, \(\beta \) will be small for large \(\omega \), and the two shown values \(z_\pm \) in (4.6) will be close to two exact saddle points, while the third one approaches infinity when \(\beta \rightarrow 0\).
We have for two saddle points \(z_2\) and \(z_3\) expansions
where \(c_{n,\omega }\) should be bounded away from \(\frac{1}{4}\) and
We obtain this from substituting (4.7) into (4.5), then collecting \(\beta \) terms to find the \(c_{k}^{(*)}\)s, the coefficients \(c_{k}\)s at saddle point \(z_{*} \in \{z_{1},z_{2},z_{3}\}\). For the third saddle point, \(z_1\), we use a property of the cubic equation. Writing the equation as \(a_3z^3+a_2z^2+a_1z+a_0=0\),
and \(z_3\) has the expansion
The two saddle points \(z_2\) and \(z_3\) become nearly equal when \(c_{n,\omega }\rightarrow \frac{1}{4}\). That is, there is a special case, when \(\omega \sim 4n\). In this case, we cannot use the MSD and will have to use uniform asymptotic methods—this method can be found in [21, 25, 29]. We do not consider this case in the paper and assume that \(c_{n,\omega }\ge \frac{1}{4} + \delta \) where \(\delta > 0\) is small. This is a necessary constraint to satisfy remark 3.3 and we discuss this further in the next section.
4.2 Large \(\omega \) Asymptotics
Our main goal is to find out how many MT functions are required to approximate a wave packet with a frequency of \(\omega \) to prescribed accuracy. We want to estimate the number of \(\{a_{n}: |a_{n}| = |b_{n}(\omega ) + b_{n}(-\omega )| \ge \varepsilon \}\) for any given \(\varepsilon > 0\). Our asymptotic expansion requires a subtle approach as both n and \(\omega \) are large parameters.
We have learnt from numerical experiments that it is a poor idea to fix one parameter and let the other become large i.e. fix \(\omega \) and let \(n \gg 1\), as the asymptotics from this regime miss on the spectral decay behaviour that we wish to capture. A simple explanation for this alludes to the relationship between n and \(\omega \) in the square-root term in Eq. (4.6).
We found that there was an intermediate range where n and \(\omega \) followed a relationship of the form
for any given constant c. This range satisfies remark 3.3 and ensures that the MSD approximation holds uniformly for \(\omega \rightarrow \infty \).
Writing (4.7) and (4.8) in orders of \(\omega \), i.e. substituting \(\beta = \alpha / \omega \), the roots have the form
We require \(c_{n,\omega }> \frac{1}{4}\) so \(\sqrt{c_{n,\omega }- \frac{1}{4}}\) is always positive. For \(\alpha , x_{0}, n, \omega \ge 0\), \(z_{3}\) stays in the third quadrant, \(z_{2}\) can be located in the first or fourth quadrant and \(z_{1}\) is located in first quadrant of the complex plane. Moreover, if \(x_{0} =0\), then \(z_{2}\) would only be in the fourth quadrant. From \(z_{1}\), we see that the root is shifted from the pure imaginary axis by \(x_{0}\). By requiring \({\textrm{Re}}\,z_{2} > x_{0}\), we get \(c_{n,\omega }>\frac{1}{4} + x_{0}^{2}\) and past this point we see that \({\textrm{Im}}\,z_{2} < 0\), so in this regime \(z_{2}\) is confined to the fourth quadrant.
The plot below suggests that \(z_2\) and \(z_3\) may be appropriate saddle points through which to deform the contour.
4.2.1 Paths of Steepest Descent
The path of steepest descent is chosen so that \({\textrm{Im}}\,g(z)\) is constant and passes through the saddle point \(z^{*}\) of g(z). Recall from equation (4.4) that
Substituting \(z = x+ \textrm{i}y\), and using the identity
the imaginary part of g(z) has the form
We choose the path of steepest descent to be the path where \({\textrm{Im}}\,g(z)={\textrm{constant}}\), the constant being chosen so that the path passes through the saddle point \(z^{*}\) i.e. path \(= \{z: {\textrm{Im}}\,g(z) = {\textrm{Im}}\,g(z^{*}) \}\). Figure 19, shows a plot of steepest descent paths, equation (4.12), for certain parameter values.
To construct our steepest descent contour \(\Gamma \), we split \(\Gamma = \Gamma ^{(3)} \cup \Gamma ^{(2)}\), where \(\Gamma ^{(2)}\) is the contour that passes through \(z_{2}\), i.e. the integral that goes from 0 to \(\infty \). Note that for \(n>0\) (4.2) has a pole at \(\textrm{i}/2\) and a zero at \(-\textrm{i}/2\) and it is sensible to avoid the pole in the upper half plane. From our discussion on the location of our roots in the last section, \(z_{2}\) can be in the first or fourth quadrant. For \(c_{n,\omega }> \frac{1}{4} + x_{0}^{2}\), \(z_{2}\) is in the fourth quadrant. For simplicity we will consider the root to be located in the fourth quadrant, but the same principle can be applied when the root is in the first quadrant. We start by defining contour \(\Gamma ^{(2)}\) at \(-\textrm{i}/2\):
-
This was chosen simply because we want to avoid the pole at \(\textrm{i}/2\). We can show that \(-\textrm{i}/2\) is always on our steepest descent path by substituting \(-\textrm{i}/2\) into (4.12) which gives \({\textrm{Im}}\,g(-\textrm{i}/2) = \beta x_{0}= \alpha x_{0}/\omega \) where \(\alpha , x_{0}, \omega > 0\) with no dependence on n. For a visual representation see the bottom plot of Fig. 19, where the steepest descent contour through \(z_{2}\) passes through both the pole and the zero. We can simply choose to construct our contour from \(-\textrm{i}/2\) as there is nothing to look for along the path from 0 to \(-\textrm{i}/2\). Moreover, the point \(z = -\textrm{i}/2\) is the central point of a valley which is why the path of steepest descent may visit this point.
-
By starting our contour from \(-\textrm{i}/2\) means our curve looks nice and smooth. The imaginary part converges to \(\omega /2\alpha \) whilst the real part tends to infinity. As the real part of our contour tends to infinity, the integrand tends to zero very fast. Although this step is not necessary, we can bring the tail back to the real line by travelling straight down, only varying the imaginary part of our path, to reach the real line and then carrying on towards infinity.
The same idea can be applied to the remaining integral, \(\Gamma ^{(3)}\), from \(-\infty \) to 0. For a schematic representation see Fig. 20.
The key argument used in constructing \(\Gamma ^{(2)}\) is that the imaginary part of the path of steepest descent converges to \(\omega / 2 \alpha \) as \({\textrm{Re}}\,z \rightarrow \infty .\) Note, this also eliminates the choice of \(z_{1}\) as \( {\textrm{Im}}(z_{1}) > \omega /2 \alpha \) which can be seen from Eq. (4.9). The same idea holds for \({\textrm{Re}}\,z \rightarrow -\infty \). We can observe this in Fig. 19 for a given \(\alpha \), \(\omega \) \(x_{0}\) and n.
Lemma 4.1
For sufficiently large x there exists a function \(y = \gamma (x)\) such that \(x + \textrm{i}y \in \Gamma \). Furthermore, \(\lim _{x \rightarrow \pm \infty } \gamma (x) = \omega /2\alpha .\)
Proof
Our steepest descent contour satisfies
where for some fixed n and \(\omega \) the RHS is constant. Let \(z_{*} = c + \textrm{i}d\), where c and d are constants. We have
We retrieve the \(\gamma (x)\) by the implicit function theorem. Dividing through by x and letting \(x \rightarrow \pm \infty \),
Taking the limit we are left with \(y = 1/(2\beta )= \omega /(2 \alpha )\), (where \(\beta = \alpha /\omega \)) as required. \(\square \)
Once the desired path is chosen, we can apply the MSD as discussed in Sect. 3.2.1. The integral in (4.3) can be rewritten as
where the path of integration \(\Gamma ^{(3)}\) starts at \(-\infty \), passes through the saddle point \(z_3\) and terminates at \(-\frac{1}{2}\textrm{i}\), and \(\Gamma ^{(2)}\) starts at that point, runs through the saddle point \(z_2\) and terminates at \(+\infty \).
The transformations
result in
whereFootnote 1
In the transformations in (4.13) we assume that z on the infinite part of the steepest descent path from \(-\infty \) to \(z_3\) corresponds to a nonpositive t-value, and that z on the finite part from \(z_3\) to \(-\frac{1}{2}\textrm{i}\) with a nonnegative t-value. Similarly for the integral through \(z_2\): the points z on the part from \(-\frac{1}{2}\textrm{i}\) to \(z_2\) correspond to \(t\le 0\), the points z on the part from \(z_2\) to \(+\infty \) correspond to \(t\ge 0\). In this way there is a one-to-one relation of points \(z\in \Gamma ^{(*)}\) and \(t\in {\mathbb {R}}\). Observe that on the paths of steepest decent the quantities \(g(z)-g(z^{*})\) are real and non-negative and their graphs are convex, like the graph of \(\frac{1}{2}t^2\) on \({\mathbb {R}}\).
Substituting the Taylor expansion \(\displaystyle {f^{(*)}(t)=\sum _{k=0}^\infty a^{(*)}_k t^k}\), the asymptotic expansion becomes
where the odd k terms disappear as we are integrating an odd integrand over the real line. The integrals now look like Gamma integrals of the form
Let \(y = \omega t^{2}/2\), \(\left( y^{-1/2}/\sqrt{2 \omega } \right) \textrm{d}y = \textrm{d}t \) and \(t^{2k} = \left( 2y / \omega \right) ^{k}\), then
where \(\left( \frac{1}{2} \right) _{k}\) is the Pochhammer symbol, \(\left( x\right) _{n} = \Gamma (x+n)/ \Gamma (x)\). By substituting (4.16) into (4.15) we obtain
as \(\omega \rightarrow \infty \), and large \(n=c_{n,\omega }\omega \) such that \(c_{n,\omega }\ge \frac{1}{4}+\delta \), with \(\delta \) a small positive number. The next step is to determine the \(a_{2k}\)’s, the full derivation can be found in Sect. B.1. The first coefficients have the form
Substituting into (4.17) we obtain the first-order approximation
4.2.2 Asymptotic Expansion
To complete the asymptotic expansion, we substitute (4.10), (4.11) and \(\beta = \alpha /\omega \) into the integrand in (4.18) and expand for large \(\omega \). This yields
It follows from the first term in (4.19) that, as \(x_{0}\) increases, \(\textrm{e}^{-\omega g(z_{3})}\) gets smaller than \(\textrm{e}^{-\omega g(z_{2})}\). Substituting the terms inside (4.18), the following asymptotic expansion holds for \( c_{n,\omega }= n/\omega > \frac{1}{4}\), \(n \ge 0\), \(\alpha ,x_{0},\omega \ge 0\),
The final line of the above equation vividly demonstrates that our asymptotics exhibit exponential decay. This can also be seen in Appendix B.2 where we find a bound for \(|b_{n}(\omega )|\) which leads to the result in (4.21). All that remains is to consider the \(n<0\) and \(\omega <0\) cases.
4.3 Reduction to the Case \(n \ge 0 \) and \(\omega > 0 \)
At the outset we have assumed \(n,\omega >0\), in this subsection we demonstrate that this can be extended to \(n\in {\mathbb {R}}\), \(\omega \in {\mathbb {R}}\).
4.3.1 A symmetry Argument
Noting that \(\varphi _{-n-1} (x)= (-\textrm{i})^{2 n + 1} \varphi _{n}(- x)\) for all \(n \in {\mathbb {Z}}\), consider
Let \(z' = -z\), \(\textrm{d}z' = - \textrm{d}z\). Relabelling, we obtain
Using this symmetry we deduce the results for \(n \le -1, \omega \le 0\) from \(n \ge 0, \omega \ge 0\) for all \( \ n \in {\mathbb {Z}}\).
4.3.2 Laguerre Coefficients Argument
We can further reduce our range by considering the case where \(n \le -1, \omega > 0\) by using the relationship of MT functions to Laguerre polynomials, considered in [12]. Taking an inverse Fourier transform of our wave packet
It is proved in [12] that for \(n\le -1\) the MT functions are
Therefore, substituting into the expression for the coefficients in terms of \(\textrm{e}^{\textrm{i}\omega x -\alpha ( z-x_{0})^2}\) and \(\varphi _n\), we get
By Parseval’s identity, we deduce that
As both \(\textrm{L}_{n}(\xi ) \textrm{e}^{-\xi /2}\) and \(\textrm{e}^{\textrm{i}x_{0}(\xi - \omega )} \) have \(\textrm{L}_2({\mathbb {R}})\) unit norm, by Cauchy–Schwartz,
Now, if \(\omega \ge 0\) then \(\textrm{e}^{-\xi \omega } < 1\). Finally, \(\int _{0}^{\infty } \textrm{e}^{-\xi ^{2}/(2\alpha )}\textrm{d}\xi = \sqrt{\pi \alpha /2}\) leads to
Thus by the symmetry argument in (4.20), this result holds for \(\omega \le 0, n \ge 0\).
As we have shown in our crude bound above, the coefficients for \(b_{n}(\omega )\) are small for \(\omega \ge 0\) and \(n \le -1\). Moreover, we can use the above symmetry argument to evaluate \(b_{n}(-\omega )\).
Finally, tying everything together, we formulate our estimate for \(a_{n}(\omega )\) in Eq. (4.2) throughout the entire range of parameters.
Theorem 4.2
(Malmquist–Takenaka function) Suppose that \(f(z) = \textrm{e}^{-\alpha ( z-x_{0})^{2}}\cos (\omega z)\), where \(\alpha >0\), while \(x_{0}\) and \(\omega \) are real. The asymptotic behaviour of the coefficients, \(a_{n}\), with Malmquist–Takenaka basis functions
is subject to the estimate
where for \(\omega \ge 0\)
The bound in (4.21) is valid uniformly for n satisfying \(1/4 \le \left| \frac{n}{\omega }\right| \le c\) for any given constant c. The remaining case, \(\omega <0\), can be found by applying the symmetry argument in (4.20). These bounds hold as \(|\omega |\rightarrow \infty \).
Remark 4.3
For practical purposes, the values of n satisfying \(1/4 \le \left| \frac{n}{\omega }\right| \le c\) are the only values of n we care about because, with reasonable assumptions on the size of c, we can ensure that \(|a_{n}| < \varepsilon \) for n outside this range.
Figs. 21 and 22 depict the asymptotic bound in (4.21). We see that they capture exceedingly well the characteristics of the exact coefficients and can observe asymptotic spectral decay.
Notes
\(f^{(*)}(t)\) denotes \(f^{(2)}(t)\) or \(f^{(3)}(t)\), \(z^{*}\) denotes \(z_2\) or \(z_3\). Similar notation will be used throughout.
References
Boyd, J.P.: Spectral methods using rational basis functions on an infinite Interval. J. Comput. Phys. 69(1), 112–142 (1987)
Boyd, J.P.: Chebyshev and Fourier Spectral Methods, 2nd edn. Dover Publications Inc, Mineola, NY (2001)
Christov, C.I.: A Complete Orthonormal System of Functions in \(\text{ L}^{2}(-\infty,\infty )\) space. SIAM J. Appl. Math. 42(6), 1337–1344 (1982)
de Bruijn, N.G.: Asymptotic Methods in Analysis. Bibliotheca mathematica. Dover Publications, Mineola (1981)
Dietert, H., Iserles, A.: Fast approximation on the real line. Technical Report NA02, DAMTP, University of Cambridge (2017)
Feit, M.D., Fleck, J.A., Jr., Steiger, A.: Solution of the Schrödinger equation by a spectral method. J. Comput. Phys. 47(3), 412–433 (1982)
Flajolet, P., Sedgewick, R.: Analytic Combinatorics. Cambridge University Press, Cambridge (2009)
Hagedorn, G.A.: Semiclassical quantum mechanics. I. the \(\hbar \rightarrow 0\) limit for coherent states. Commun. Math. Phys. 71(1), 77–93 (1980)
Hairer, E., Iserles, A.: Numerical stability in the presence of variable coefficients. Found. Comput. Math. 16(3), 751–777 (2016)
Heller, E.J.: Time dependent variational approach to semiclassical dynamics. J. Chem. Phys. 64(1), 63–73 (1976)
Iserles, A.: On skew-symmetric differentiation matrices. IMA J. Numer. Anal. 34(2), 435–451 (2014)
Iserles, A., Webb, M.: A family of orthogonal rational functions and other orthogonal systems with a skew-Hermitian differentiation matrix. J. Fourier Anal. Appl. (2019)
Iserles, A., Webb, M.: Orthogonal systems with a skew-symmetric differentiation matrix. Found. Comput. Math. 19(6), 1191–1221 (2019)
Iserles, A., Webb, M.: A differential analogue of Favard’s theorem. In: From Operator Theory to Orthogonal Polynomials, Combinatorics, and Number Theory, pp. 239–263. Springer (2021)
Iserles, A., Webb, M.: Fast computation of orthogonal systems with a skew-symmetric differentiation matrix. Commun. Pure Appl. Math. 74(3), 478–506 (2021)
Jin, S., Markowich, P., Sparber, C.: Mathematical and computational methods for semiclassical Schrödinger equations. Acta Numer 20, 121–209 (2011)
Lasser, C., Lubich, C.: Computing quantum dynamics in the semiclassical regime. Acta Numer 29, 229–401 (2020)
Lubich, C.: From Quantum to Classical Molecular Dynamics: Reduced Models and Numerical Analysis. European Mathematical Society (2008)
Malmquist, F.: Sur la détermination d’une classe de fonctions analytiques par leurs valeurs dans un ensemble donné de points. Comptes Rendus du Sixieme Congres des mathematiciens scandinaves (Kopenhagen), pp. 253–259 (1925)
Olver, F.W.J.: Uniform asymptotic expansions for Weber parabolic cylinder functions of large orders. J. Res. Nat. Bur. Standards Sect. B, 63 (1959)
Olver, F.W.J.: Asymptotics and Special Functions. AK Peters/CRC Press, London (1997)
Olver, F.W.J., Lozier, D.W., Boisvert, R.F., Clark, C.W. (eds.) NIST Handbook of Mathematical Functions. U.S. Department of Commerce, National Institute of Standards and Technology, Cambridge University Press, Cambridge, Washington, DC, 2010. With 1 CD-ROM (Windows, Macintosh and UNIX)
Olver, F.W.J., Lozier, D.W., Boisvert, R.F., Clark, C.W.: NIST Handbook of Mathematical Functions Hhardback and CD-ROM. Cambridge University Press, Cambridge (2010)
Takenaka, S.: On the orthogonal functions and a new formula of interpolation. In: Japanese Journal of Mathematics: Transactions and Abstracts, volume 2, pp. 129–145. The Mathematical Society of Japan (1925)
Temme, N.M.: Asymptotic Methods for Integrals, vol. 6. World Scientific, Singapore (2014)
Weideman, J.A.C.: Computation of the complex error function. SIAM J. Numer. Anal. 31(5), 1497–1518 (1994)
Weideman, J.A.C.: Computing the Hilbert transform on the real line. Math. Comput. 64(210), 745–762 (1995)
Wiener, N.: Extrapolation, interpolation, and smoothing of stationary time series: with engineering applications. Principles of electrical engineering series. Technology Press, Cambridge, MA (1949)
Wong, R.: Asymptotic Approximations of Integrals, vol. 34. SIAM (2001)
Acknowledgements
KL wishes to acknowledge UK Engineering and Physical Sciences Research Council (EPSRC) for the grant EP/L016516/1 for the University of Cambridge Centre for Doctoral Training, the Cambridge Centre for Analysis, which supported the research and writing of this paper. KL is also grateful to the Department of Computer Science, KU Leuven for their hospitality during a visit where some of the research for this paper was done. MW thanks FWO Research Foundation Flanders for the postdoctoral research fellowship he enjoyed while some of the research for this paper was done. MW also thanks the Polish National Science Centre (SONATA-BIS-9), project no. 2019/34/E/ST1/00390, for the funding that supported the research. This work is partially supported by the Simons Foundation Award No 663281 granted to the Institute of Mathematics of the Polish Academy of Sciences for the years 2021–2023.
The authors would like to thank Bruno Salvy and André Weideman for helpful correspondence on the steepest descent method, and are especially grateful to Nico Temme for the helpful discussions on Hermite functions and the method of steepest descent which helped simplify the section on the Malmquist–Takenaka functions.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Karlheinz Groechenig.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Hermite
1.1 Different values of \(\alpha \)
Starting from Eq. (3.5), it is helpful to define
which satisfies
and distinguish between the cases \(\alpha = \tfrac{1}{2}\) and \(\alpha \ne \tfrac{1}{2}\).
Case I \(\alpha =\frac{1}{2}\). We need to use (3.4), namely
Substituting this into Eq. (A.2) yields
Case II \(\alpha >\frac{1}{2}\). Using Eqs. (3.3) and (A.1), we obtain,
where
Note that \(Y\gg X\) in the high frequency regime.
Case III \(\alpha <\frac{1}{2}\). We obtain,
1.2 \(\alpha > 1/2\): Asymptotics
The terms \(w_{-}\phi ''(w_{-})\) and \(\phi (w_{-})\) in term of X and Y are
where
Substituting these terms into (3.14a) and (3.14b),
We need to be careful when choosing which large parameter we wish to expand around. Recalling that
while \(\alpha \) and \(x_{0}\) are fixed variables, we want to preserve the relationship between our large parameters n and \(\omega \) so
This agrees with our numerical experiment seen in Fig. 12.
We start by expanding (A.3a), substituting \(n =c_{n,\omega } \omega ^{2} \):
Substituting terms back into (A.3a), we obtain
Evaluating B in (A.3b), by the Sterling formula,
Thus, B is estimated by
Setting \(n = c_{n,\omega } \omega ^{2}\) and expanding about \(\omega \),
Assembling all this,
This results in
Restoring \(c_{n,\omega } = n/\omega ^{2}\) and recalling that \(a_{n}(\omega ) \sim {\textrm{Re}}\,{\tilde{a}}_{n}(\omega ),\) we obtain
1.3 \(\alpha < 1/2\): Asymptotics
Take \(\zeta = (Y - \textrm{i}X)/\nu \) and let our contour of integration pass through the root \(w_{+}\),
where
Just like in \(\alpha > 1/2\), numerical experiments show that the relationship between n and \(\omega \) follows
See Fig. 23.
Here we return to Sect. 3.2.4 where we observe that for \(\alpha < 1/2\), our asymptotics require us to consider \(c_{n,\omega }\ne c_{n,\omega }^{*}\), where
1.3.1 \(c_{n,\omega }\ne c_{n,\omega }^{*}\): Full Derivation
Commencing from Eq. (3.17), substituting \(n = c_{n,\omega }\omega ^{2}\) and expanding about \(\omega \), we get
We substitute this back into (3.17). Now, moving on to the asymptotics of B, expanding about \(\omega \) having substituted \(n=c_{n,\omega }\omega ^{2}\),
Combining the two results,
which we substitute back into B. Bringing all the terms together,
Malmquist–Takenaka
1.1 Determining \(a_{2k}\)’s
The final step is to determine the coefficients \(a^{(*)}_{2k}\). To do this, we first need to find the coefficients of the expansion near the saddle points,
They can be obtained from the transformations in (4.13), where we rewrite the transformations as values of z near the saddle points
or, taking the square root,
This equation can be inverted by using standard inversion methods. Substituting the expansions given in (B.1) and collecting equal powers of t, we can find the coefficients \(b_k^{(*)}\) of (B.1), the first being
Here and in (B.2) we have taken the positive square root, this comes from the expansion in (B.1), in fact \(z-z^{*}\sim b_1^{(*)} t\), and we have assumed conditions as explained below (4.14).
More coefficients can be obtained by using computer algebra. The next ones are
To obtain \(a_{2k}\)’s we use (4.14) and (B.1)
Equating powers of t we obtain the first coefficients of the expansion in (4.17), which are
1.2 Bounding \(|b_{n}(\omega )|\) for \(\omega \ge 0\) and \(n\ge 0\)
The result obtained in (4.21) is not obvious from the asymptotic expansions in Sect. 4.2.2 as they require further manipulation. In this section, we walk through the steps taken to bound (4.18).
Applying the triangle inequality to (4.18) results in
Following the same expansion steps as described in Sect. 4.2.2, we obtain asymptotics that would help us bound \(|b_{n}(\omega )|\),
Substituting the above terms into (B.3), separating each term into real and imaginary parts, and using the definition of the \(\textrm{L}_{2}\) norm \(|x+ \textrm{i}y| = \sqrt{x^{2} + y^{2}}\) we obtain the following bounds
From this point we can see that we obtain the bound in (4.21).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Iserles, A., Luong, K. & Webb, M. Approximation of Wave Packets on the Real Line. Constr Approx 58, 199–250 (2023). https://doi.org/10.1007/s00365-022-09605-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00365-022-09605-1
Keywords
- Orthogonal systems
- Orthogonal rational functions
- Fast Fourier transform
- Malmquist–Takenaka system
- Steepest descent
- Asymptotic approximation
- Wave packet