Abstract
We investigate the approximation formulas that were proposed by Tanaka & Sugihara (IMA J. Numer. Anal. 39(4):1957–1984, 2019), in weighted Hardy spaces, which are analytic function spaces with certain asymptotic decay. Under the criterion of minimum worst error of n-point approximation formulas, we demonstrate that the formulas are nearly optimal. We also obtain the upper bounds of the approximation errors that coincide with the existing heuristic bounds in asymptotic order by a duality theorem for the minimization problem of potential energy.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
By taking over the arguments of [23], Tanaka & Sugihara [24] proposed an algorithm to design accurate approximation formulas in function spaces called weighted Hardy spaces defined by
where \(d > 0\), \({\mathcal {D}}_d:=\{z\in {\mathbb {C}}\mid |\mathop {\textrm{Im}}z|<d\}\), and w is a weight function characterized later in Sect. 2.1. The spaces \({\mathbb {H}}^\infty ({\mathcal {D}}_d, w)\) are often considered as spaces of transformed functions for well-used sinc approximation formulas shown later in (2). The objective of [23] and [24] was to provide formulas outperforming the sinc formulas. However, their studies only provided heuristic analyses on the proposed formulas without any theoretical guarantees, although their methods have shown superiority to the sinc approximation formulas. In this study, we mathematically
-
(1)
prove near optimality of the formulas, and
-
(2)
provide a general upper bound of the errors of the proposed formulas and show that the bound coincides in asymptotic order with the heuristic bound derived by [23].
Below we describe the background of this study more precisely. The spaces \({\mathbb {H}}^\infty ({\mathcal {D}}_d, w)\) appear in literature as spaces of variable-transformed functions [18, 19, 21, 25]. For example, the double exponential (DE) transform, which is well-used in numerical analysis [22], has the form
and shows a double-exponential decay. Also, TANH transform \(g(\tanh (x/2))\) is commonly used [2, 15]. These variable transformations are employed for the accurate approximation of functions by yielding functions with rapid decay on \({\mathcal {D}}_d\), which enables us to neglect the values of the functions for large |x|. This motivates us to analyze the approximation possibility over weighted Hardy spaces with general weight functions w. After Sugihara [21] demonstrated near optimality of sinc approximation formulas
for several weight functions w, attempts to construct an optimal formula for general weight functions was started in the literature.
For this purpose, Tanaka et al. [23] employed potential theoretical arguments to generate sampling points for the approximation of functions. Furthermore, Tanaka & Sugihara [24] simplified the arguments and proposed accurate formulas \(L_n[a^{*};f](x)\) given later by (6) with special sets \(a^{*}\) of sampling points. The formulas \(L_n[a^{*};f](x)\) outperform the sinc methods for functions \(f \in {\mathbb {H}}^\infty ({\mathcal {D}}_d, w)\). The authors showed that
where \(\Vert f \Vert \) is a norm of \(f \in {\mathbb {H}}^\infty ({\mathcal {D}}_d, w)\) and \(F^\textrm{D}_{K, Q}(n)\) is determined later in (13) by a “discrete” energy minimization problem. Furthermore, they considered the minimum worst error \(E_n^{\min }({\mathbb {H}}^\infty ({\mathcal {D}}_d, w))\) in (5) of n-point approximation formulas in \({\mathbb {H}}^\infty ({\mathcal {D}}_d, w)\) and evaluated it as
where \(F^\textrm{C}_{K, Q}(n)\) is determined later in (11) by a “continuous counterpart” of the above energy minimization problem. The following problems about the formula \(L_n[a^*;f](x)\) were left unsolved in [24].
-
(i)
Since (the RHS of (4))\(\, \le \,\)(the LHS of (3)), the formula \(L_n[a^*;f](x)\) is assured of “near optimality” if \(F^\textrm{C}_{K, Q}(n)\) and \(F^\textrm{D}_{K, Q}(n)\) are close. However, their difference was not estimated.
-
(ii)
To estimate the convergence rate of the error in the LHS of (3), we need to know how \(F^\textrm{D}_{K, Q}(n)\) depends on n. However, it was not known.
In this paper, we provide solutions to these problems. Our contributions (1) and (2) mentioned in the first paragraph of this section correspond to the solutions to problems (i) and (ii), respectively. More precisely, we show the following statements.
-
(1)
We show an evaluation like
$$\begin{aligned} F_{K, Q}^\textrm{D}(n)\lesssim F_{K, Q}^\textrm{C}(n)\lesssim 2 F_{K, Q}^\textrm{D}(n). \end{aligned}$$Its rigorous version is given by Theorem 23 in Sect. 2.3. The quantities \(F^\textrm{D}_{K, Q}(n)\) and \(F^\textrm{C}_{K, Q}(n)\) were obtained from the optimal solutions of the “discrete” energy minimization problem and its “continuous counterpart”, respectively. Therefore we construct a feasible solution for the latter using the optimal solution of the former to show this theorem.
-
(2)
We show an inequality
$$\begin{aligned} \frac{F_{K, Q}^\textrm{C}(n)}{n}\ge \frac{Q(\alpha _n)}{2}, \end{aligned}$$where \(Q(x) = - \log w(x)\) and \(\alpha _{n}\) is determined by a tractable inequality. Its details are given by Theorem 24 in Sect. 2.3. By combining this inequality, the above statement (1), and Inequality (3), we obtain explicit convergence rates of the proposed formulas. To show this theorem, we consider the dual problem of the “continuous” energy minimization problem and provide its feasible solution. For preparation, we present a primal-dual theory of the energy minimization problem in Sect. 4.
As a result, we explicitly obtain lower bounds of \(F_{K, Q}^\textrm{C}(n)\) and demonstrate that the rates of lower bounds coincide with those of heuristic bounds in [23].
The rest of this paper is organized as follows. In Sect. 2, we present a mathematical overview of the existing studies and describe our main results as mathematical statements. Section 3 describes the proof of the first result, i.e., Theorem 23. Section 4 contains general arguments, which introduce the concept of “positive semi-definite in measure”. Then, we show that the problem under our interest is a special case of that concept and derive the duality theorem. The evaluations for the second result, described by Theorem 24, are given in Sect. 5. We compare the bounds with those in [23] in Sect. 6. Finally, we describe the concluding remarks in Sect. 7.
2 Mathematical preliminaries and main results
2.1 General settings
We first give some definitions and formulate the problem mathematically. Let \(d>0\) and define the strip region \({\mathcal {D}}_d:=\{z\in {\mathbb {C}}\mid |\mathop {\textrm{Im}}z|<d\}\). Throughout this paper, a weight function \(w:{\mathcal {D}}_d\rightarrow {\mathbb {C}}\) is supposed to satisfy the following conditions:
-
1.
w is analytic and does not vanish over the domain \({\mathcal {D}}_d\) and takes values in (0, 1] on \({\mathbb {R}}\);
-
2.
w satisfies \(\lim _{x\rightarrow \pm \infty }\int _{-d}^d|w(x+iy)|\,\textrm{d}y=0\) and \(\lim _{y\nearrow d}\int _{-\infty }^\infty (|w(x+iy)|+|w(x-iy)|)\,\textrm{d}x<\infty \);
-
3.
\(\log w\) is strictly concave on \({\mathbb {R}}\).
For a weight function with the above conditions, we define the weighted Hardy space \({\mathbb {H}}^\infty ({\mathcal {D}}_d, w)\) on \({\mathcal {D}}_d\) in (1). We define
for \(f\in {\mathbb {H}}^\infty ({\mathcal {D}}_d,w)\), and the expression \(\Vert f\Vert <\infty \) shall also imply \(f\in {\mathbb {H}}^\infty (\mathcal {D}_{d}, w)\) in the following.
For an approximation formula over \({\mathbb {H}}^\infty ({\mathcal {D}}_d, w)\), an evaluation criterion needs to be defined. Based on [21] and [24], we adopt the minimum worst-case error
as the optimal performance over all possible n-point interpolation formulas on \({\mathbb {R}}\), which is applicable to any \(f\in {\mathbb {H}}^\infty ({\mathcal {D}}_d, w)\).
2.2 Properties of approximation formulas to be analyzed
Let us introduce some functions dependent on an n-sequence \(a=\{a_j\}_{j=1}^n\subset {\mathbb {R}}\) as follows.
Using these functions, we can give an n-point interpolation formula
which is known to characterize the value \(E_n^{\min }({\mathbb {H}}^\infty ({\mathcal {D}}_d, w))\) as follows.
Proposition 21
[21, 24] We have an upper bound of the error of (6) as
for any fixed sequence \(a=\{a_j\}_{j=1}^n\subset {\mathbb {R}}\) (of distinct points). Moreover, by taking infimum of the above expression over all n-sequences, it holds that
By this assertion, it is enough to consider interpolation formulas of the form (6). Additionally, this motivates us to analyze the value \(\sup _{x\in {\mathbb {R}}}|B_n(x;a,{\mathcal {D}}_d)w(x)|\), which is simpler than the worst-case error of (6). In [23] and [24],
is treated as an optimal value of an optimization problem (justifiable by the addition rule of \(\tanh \))
where K and Q are defined by
They considered a continuous relaxation of (DC) as
where, we define \({\mathcal {M}}({\mathbb {R}}, n)\) as the set of all (positive) Borel measures \(\mu \) over \({\mathbb {R}}\) with \(\mu ({\mathbb {R}})=n\) and
Because each feasible solution of (DC) can be interpreted as a combination of \(\delta \)-measures being a feasible solution of (CT),
Potential theoretical arguments [5, 14, 24] lead to the following proposition.
Proposition 22
[24, Theorem 2.4, 2.5] The energy of \(\mu \in {\mathcal {M}}({\mathbb {R}}, n)\) is defined as
Then, there exists a unique minimizer \(\mu _n^*\) over \({\mathcal {M}}({\mathbb {R}}, n)\) of \(I_n^\textrm{C}(\mu )\) with a compact support and \(\mu _n^*\) is also an optimal solution of (CT). Furthermore, if we define
the optimal value of (CT) coincides with \(\displaystyle \frac{F_{K,Q}^\textrm{C}(n)}{n}\).
Following this proposition, Tanaka & Sugihara [24] considered a discrete counterpart of \(I^\textrm{C}_n(\mu )\) and \(F_{K,Q}^\textrm{C}\), which are defined for \(a=\{a_i\}_{i=1}^n\) (\(a_1<\cdots <a_n\)) as
where \(a^*=\{a_i^*\}_{i=1}^n\) is the unique minimizer of \(I_{K, Q}^\textrm{D}(a)\), which certainly exists according to Theorem 3.3 in [24]. We can easily obtain \(a^*\) numerically as it is a solution of the convex programming and it is known to satisfy [24, Theorem 4.1]
Then \(E_n^{\min }({\mathbb {H}}^\infty ({\mathcal {D}}_d, w))\) is evaluated as [24, Remark 4.2]
Indeed, the left inequality holds true by (9) and Proposition 22 and the right inequality follows from (14). By this evaluation, we can consider \(L_n[a^*;f](x)\) as a nearly optimal approximation formula if \(F^\textrm{C}_{K, Q}(n)/n\) and \(F^\textrm{D}_{K, Q}(n)/(n-1)\) are sufficiently close.
2.3 Main results
In this paper, we demonstrate the following two theorems. The first and second theorems, respectively, correspond to (1) and (2) in Sect. 1.
Theorem 23
For \(n\ge 2\), the following holds true:
Theorem 24
Suppose w is even on \({\mathbb {R}}\). For \(\alpha _n>0\) that satisfies
we have
Theorem 23 shows the near optimality of the approximation formula \(L_n[a^*;f](x)\). In addition, Theorem 24 (combined with Theorem 23) gives an explicit upper bound of \(E_n^{\min }({\mathbb {H}}^\infty ({\mathcal {D}}_d, w))\). We describe these results by the following theorem.
Theorem 25
Let w be a weight function and let K and Q be given by (7) and (8), respectively. In addition, let \(a^*=\{a_i^*\}_{i=1}^n\) be the unique minimizer of \(I_{K, Q}^\textrm{D}(a)\) and let \(L_n[a^*;f](x)\) be the formula given by (6) with \(a = a^{*}\). Then, for arbitrary \(\varepsilon >0\), we have
for each sufficiently large n. In addition, we have
2.4 Basic ideas to show the main results
The left inequality of Theorem 23 is from Theorems 3.4 and 3.5 in [24]. To prove the right inequality of Theorem 23, we consider the optimization problem
whose solution provides \(F^\textrm{C}_{K, Q}(n)\) as shown in Proposition 22. The quantity \(F^\textrm{D}_{K, Q}(n)\) is obtained from the optimal solution of a discrete counterpart of (P) given by (12). Then, we construct a feasible solution of (P) given later by (16) from the optimal solution of the discrete counterpart. By using the feasible solution, we bound \(F^\textrm{C}_{K, Q}(n)\) from above by using \(F^\textrm{D}_{K, Q}(n)\).
To prove Theorem 24, we need a lower bound of the optimal value of (P). However, because (P) is a minimization problem, any concrete feasible solution does not help us. Therefore, we prove that (P) can be regarded as an infinite-dimensional convex quadratic programming, as K is positive semi-definite in measure (Definition 41), and take the dual problem [1, 6]. We also show that the dual problem
satisfies the weak and strong duality (Theorem 43), i.e., the optimal value of (D) coincides with that of (P). By this, we can obtain a lower bound for the optimal value of (P), taking concrete \(\nu \) and s. The practical advantage of taking (D) is that \(\nu \) can be a signed measure (though we indeed deal with a little wider class in Sect. 4), which means that we can define \(\nu \) as some Fourier transform of the symmetric function, without confirming the non-negativity. This solves one of the improper points of the evaluation in [23].
Remark 1
Problem (D) in (15) needs to be more rigorously stated to realize a primal-dual theory for (P) and (D). In Sect. 4, we provide a rigorous form of (D) by introducing a set \({\mathcal {S}}_{K}\) for \(\nu \).
3 Proof of Theorem 23
To prove Theorem 23, we prepare the following lemmas.
Lemma 31
For arbitrary \(t>0\), the following holds true.
Proof
Consider the function \(g(x):=K(x)+\log \left( \frac{\pi }{4d}x\right) \) defined for \(x>0\). We first prove that g(x) is strictly increasing and satisfies \(\lim _{x\searrow 0}g(x)=0\). Let \(h(x):=\exp \left( g\left( \frac{2d}{\pi }x\right) \right) \). Then, we have
and
Because \((e^{2x}-2xe^x-1)'=2(e^{2x}-e^x-xe^x)=2e^x(e^x-1-x)\) is valid, we have \(h'(x)>0\) for \(x>0\). Evidently, we also have \(\lim _{x\searrow 0}h(x)=1\). Thus, g satisfies the above properties.
Because g is positive and increasing, \(\int _0^1g(tx)\,\textrm{d}x\le g(t)\) is valid. Therefore, we have
as desired. \(\square \)
Lemma 32
For arbitrary \(x>0\), the following holds true.
Proof
By the definition of K, the assertion follows from the fact that \(\tanh x \le 2\tanh \frac{x}{2}\). \(\square \)
We can now prove the first theorem.
Proof of Theorem 23
The left inequality is from Theorem 3.4 and 3.5 in [24].
Let us prove the right inequality. Let \(a=(a_1, \ldots , a_n)\) (with \(a_1<\cdots <a_n\)) be the minimizer of the discrete energy, satisfying
Let \(\mu \) be a measure with the density function p defined by
Then, we have
In the following, we obtain an upper bound of \(I_n^\textrm{C}(\mu )\). First, we evaluate \(\int _{\mathbb {R}}\int _{\mathbb {R}}K(x-y)\,\textrm{d}\mu (x)\,\textrm{d}\mu (y)\). For \(1\le k\le n-1\) and \(y\in [a_k, a_{k+1})\), we have
Here, because \(y\in [a_k, a_{k+1})\), for \(i\not \in \{k-1, k, k+1\}\), the convexity and monotonicity of K over \((-\infty , 0)\) or \((0, \infty )\) shows that
Therefore, by considering that K is non-negative, we have
Here, the terms that include an index of a outside the domain \(\{1,\ldots ,n\}\) are void. Next, we consider the cases \(i=k\pm 1\). If \(k-1\ge 1\) is valid, we have
Similarly, if \(k+2\le n\) is valid, we have, by Lemma 31,
Finally, we deal with the case \(i=k\). We show that the integral
is maximized at \(y=\frac{a_k+a_{k+1}}{2}\) (over \(y\in [a_k, a_{k+1})\)). If we define \(t:=\frac{y-a_k}{a_{k+1}-a_k}\) (\(t\in [0, 1)\)), the following holds true.
For \(t<\frac{1}{2}\), we have
By symmetry, \(L_k(y)< L_k\left( \frac{a_k+a_{k+1}}{2}\right) \) is valid for \(t>\frac{1}{2}\). Therefore, by Lemma 31 and 32,
Considering the sum of the right-hand side with respect to \(k=1,\ldots ,n-1\), the coefficient of each \(K(a_i-a_j)\) with \(|i-j|\ge 2\) is at most 1, and that of \(K(a_i-a_j)\) with \(|i-j|=1\) is at most 2 (\(=\frac{1}{2}+\frac{3}{2}\)), where we have distinguished \(K(a_i-a_j)\) from \(K(a_j-a_i)\). Therefore, we have
Let us now evaluate the second term of \(I_n^\textrm{C}(\mu )\), i.e., \(\int _{\mathbb {R}}Q(x)\,\textrm{d}\mu (x)\). By the convexity of Q, we have
To estimate the sum, we consider the following two cases:
-
1.
Q is not monotone in \([a_{1}, a_{n}]\),
-
2.
Q is monotone in \([a_{1}, a_{n}]\).
In the former case, the unique minimizer \(q^{*}\) of Q on \({\mathbb {R}}\) exists in \([a_{1}, a_{n}]\) because of the strict convexity. Then, by the strict convexity of Q, we have
where
Therefore
holds for some \(k \in \{1, \ldots , n\}\). In the latter case (case 2 above), we have equality (23) for \(k = 1\) or \(k = n\). Therefore, in both cases, the following holds true:
Combining (22) and (24), we obtain
Now, using (17), we reach the conclusion. \(\square \)
4 Duality theorem for convex programming of measures
The following definition is a variant of the existing definitions of positive definite kernel [4, 17, 20].
Definition 41
Let X be a topological space. A non-negative measurable function \(k:X\times X\rightarrow {\mathbb {R}}_{\ge 0}\cup \{\infty \}\) is called positive semi-definite in measure if it satisfies
for arbitrary (positive) \(\sigma \)-finite Borel measures \(\mu , \nu \) on X.
Remark 2
Let k be positive semi-definite in measure. Considering the Hahn-Jordan decomposition of a signed measure, we have
for an arbitrary signed Borel measure \(\mu \) on X with \(|\mu |\) being \(\sigma \)-finite, where \(|\mu |\) denotes the total variation of \(\mu \). This is the generalization of the ordinary positive semi-definiteness. Notice that this non-negativity holds for a wider class of “measure". Indeed, if we define
and for each \(\nu =(\nu _+,\nu _-)\in {\mathcal {S}}_k\) define
then this integral is well-defined and the generalization of quadratic forms for ordinary signed measures. We formally write \(\nu =\nu _+-\nu _-\) in such a situation, and call it also the Hahn-Jordan decomposition of \(\nu \).
Lemma 42
Let \(K:{\mathbb {R}}\rightarrow {\mathbb {R}}_{\ge 0}\cup \{+\infty \}\) be an even function. If \(K\in L^1({\mathbb {R}})\) and K is convex on \([0, \infty )\), and satisfies \(\lim _{x\searrow 0}K(x)=K(0)\), then \(K(x-y)\) is positive semi-definite in measure.
Proof
Because K is integrable and convex, K is continuous over \((0, \infty )\) and \(\lim _{x\rightarrow \infty }K(x)=0\) holds true. If \(K(0)<\infty \), K becomes continuous and this type of function is called Pólya-type. Pólya-type functions are known to be a characteristic function of a positive bounded Borel measure, i.e., there exists a positive bounded measure \(\alpha \) on \({\mathbb {R}}\) such that
is valid [4, 12]. Let \(\mu \) be a signed Borel measure with \(\int _{\mathbb {R}}\int _{\mathbb {R}}K(x-y)\,\textrm{d}\mu (x)\,\textrm{d}\mu (y)\) being finite and \(|\mu |\) being \(\sigma \)-finite. Then, we can take a sequence of increasing Borel sets \(A_1\subset A_2\subset \cdots \rightarrow {\mathbb {R}}\) satisfying \(|\mu |(A_k)<\infty \) for all k. Let \(\mu =\mu _+-\mu _-\) be the Hahn-Jordan decomposition and \(\mu _+^k:=\mu _+(A_k\cap \cdot )\), \(\mu _-^k:=\mu _-(A_k\cap \cdot )\). For each k, by Fubini’s theorem and (26), we have
This can be rewritten as
The integrals in (27) are given by integrands that are monotone increasing with respect to k. Indeed, the first term of the left-hand side is written in the form
and its integrand \( 1_{A_k\times A_k}(x, y)K(x-y) \) is monotone increasing with respect to k because \(A_{1} \subset A_{2} \subset \cdots \). Similar arguments can be applied to the other terms. Therefore we get the desired inequality by letting \(k \rightarrow \infty \) and using the monotone convergence theorem in (27).
Let us consider the case \(K(0)=\infty \). In this case, K is continuous on \((0, \infty )\) and has a limit \(\lim _{x\searrow 0}K(x)\) For any \(\varepsilon >0\), define
Then, by \(K\in L^1({\mathbb {R}})\), K is bounded everywhere by \(\varepsilon ^{-1}\Vert K\Vert _{L^1}\). Moreover, \(K_\varepsilon \) is still convex, such that \(K_\varepsilon (x-y)\) is positive semi-definite in measure. Now, the continuity of K leads to
by the monotone convergence theorem. Applying the monotone convergence theorem to both sides of (25) with \(K=K_\varepsilon \), we obtain the conclusion. \(\square \)
The function \(K=-\log \left| \tanh \left( \frac{\pi }{4d}\cdot \right) \right| \) satisfies the condition of Lemma 42. Thus, we can observe the optimization problem
as convex quadratic programming. We can analogously make the dual problem to the finite-dimensional case in [1], as
Note that this is a rigorous version of problem (D) in (15). It should be noted here that we have not justified (D) as a formal (topologically) dual problem. There are arguments limited to the optimization of Radon measure over compact space [10, 11, 27]. While they are on quadratic programming problems, there exist more general theories on duality, such as [3], von Neumann’s minimax theorem [8, 16] and Fenchel-Rockafellar duality theorem [13, 26]. However, as it is essential that our duality can treat infinite measure \(\nu \) with unbounded support (we indeed later use such a measure as a dual feasible solution), it is difficult to just apply existing studies and check all the conditions for (D) to be a topologically dual problem. Therefore, we here do not go deeper in this aspect, but just prove the assertion of Theorem 43. This assertion is sufficient to derive a lower bound of the optimal value of (P), which is our objective.
In the following, we demonstrate that the weak duality and strong duality are still valid in this infinite-dimensional primal-dual pair. It should be noted that \(s=0\), \(\nu \equiv 0\) is a trivial feasible solution of (D) such that there exists an optimal value of (D).
Theorem 43
The optimal value of (D) is equal to the optimal value of (P).
Proof
First, we present the weak duality. Let \(\mu \) and \((\nu , s)\) be feasible solutions of (P) and (D), respectively, and \(\nu =\nu _+-\nu _-\) be the Hahn-Jordan decomposition. If we write \(\langle \alpha , \beta \rangle _K:=\int _{\mathbb {R}}\int _{\mathbb {R}}K(x-y)\,\textrm{d}\alpha (x)\,\textrm{d}\beta (y)\) for measures \(\alpha \) and \(\beta \),
holds true. Because \(\langle \mu , \mu \rangle _K, \langle \nu _+, \nu _+ \rangle _K, \langle \nu _-, \nu _- \rangle _K<\infty \), we have \(\langle \mu , \nu _+ \rangle _K, \langle \mu , \nu _- \rangle _K, \langle \nu _+, \nu _- \rangle _K<\infty \) by K’s positive semi-definiteness in measure. Therefore, we have
by the positive semi-definiteness in measure. This indicates the weak duality. Note that we have the last inequality above by replacing \(\mu \) and \(\nu \) in (25) in Definition 41 with \(\mu + \nu _{+}\) and \(\nu _{+}\), respectively.
To prove the strong duality, we construct the optimal solution of (D) using that of (P). By Theorem 2.4 in [24], \(\mu ^*\), the optimal solution of (P), satisfies
for all \(x\in {\mathbb {R}}\). Now, \(\mu ^*\) and \(n^{-1}F^\textrm{C}_{K, Q}(n)\) is a feasible solution for (D). Moreover, the equation that we obtain by replacing the inequality of (28) with an equality is valid on the support of \(\mu ^*\). Therefore we have
This shows the strong duality. \(\square \)
5 Proof of Theorem 24
We can now give a lower bound of \(F_{K, Q}^\textrm{C}(n)\) by using the dual problem (D) and prove Theorem 24. Let \(\alpha >0\) be a constant and f be the inverse Fourier transform of
Along with this, f is \(L^2\)-integrable by Theorem 4.4 in [23]. Here, the Fourier transform of a function \(g\in L^1({\mathbb {R}})\cap L^2({\mathbb {R}})\) is defined by
and for the whole space \(L^2({\mathbb {R}})\), \({\mathcal {F}}[\cdot ]\) is defined as the continuous extension of \({\mathcal {F}}[\cdot ]|_{L^1\cap L^2}\). Because Q(x) is even by the assumption, f is an inverse Fourier transform of an even real function, so that f itself is an even real function. Then, the formula [9, p.43, 7.112]
leads to the (almost everywhere) equation
where \(K\in L^1({\mathbb {R}})\cap L^2({\mathbb {R}})\) and \(f\in L^2({\mathbb {R}})\) are used for the justification of the first equality. The former statement \(K\in L^1({\mathbb {R}})\cap L^2({\mathbb {R}})\) follows from
The integrability of \(K(x-\cdot )f(\cdot )\) comes from \(K, f\in L^2({\mathbb {R}})\). Indeed, we have
where the Cauchy-Schwarz inequality is used on the second inequality. Therefore the integrability is shown as follows:
where the Fubini theorem is used on the first equality. Considering the inverse Fourier transform of (29), we also have
It should be noted that \(f(x)\,\textrm{d}x \in {\mathcal {S}}_K\) follows from the inequality
These two relations imply that \((f(x)\,\textrm{d}x, Q(\alpha ))\) is a feasible solution of (D). We can now evaluate the value of the objective function of (D). Let us define
Because the first term can be considered as the inner product of \(K*f\) and f in \(L^2({\mathbb {R}})\), it can be computed through the Fourier transform as
Let \(G(\alpha )\) be the value of the right-hand side. \(G(\alpha )\) can be decomposed into two parts, which are defined as
and
We first evaluate \(G_1\). Because the function \(\omega /\tanh (d\omega )\) is monotonically increasing in \([0, \infty )\) (see the proof of Lemma 31), we have
Next, we similarly evaluate \(G_2\). By integration by parts, we get
Thus, we have
Finally, we reach the evaluation
By letting \(\alpha _n\) satisfy
we get \(nQ(\alpha _n)\) as a lower bound for the optimal value of (P). For such \(\alpha _n\), we finally have
and this is equivalent to the assertion of Theorem 24.
6 Examples of convergence rates for several Q(x)’s
Although the asymptotic rates given in [23, Section 4.3] are derived through mathematically informal arguments, we here demonstrate that those rates roughly coincide with the bound in Theorem 24.
Example 61
(The case w is a single exponential) Consider the case
for \(\beta >0\) and \(\rho \ge 1\). In this case, for a sufficiently large \(\alpha \) (satisfying \(\alpha \ge \rho \)), we have
and \(\alpha _n\) can be taken as
for sufficiently large n. This rate roughly coincides with (4.37) in [23].
Example 62
(The case w is a double exponential) Consider the case
for \(\beta ,\gamma >0\). In this case,
is valid. Let \(\alpha _n>0\) satisfy that the right-hand side is equal to n. Then, we have
where W is Lambert’s W function, i.e., the inverse of \(x\mapsto xe^x\). Using this, we get
This rate roughly coincides with the asymptotic order (4.44) in [23] for each fixed constant \(\gamma \).
Remark 3
We choose the weight functions in Examples 61 and 62 for simplicity although they are not (necessarily) analytic in the strip region \({\mathcal {D}}_{d}\) for any \(d > 0\). This is because we just need their asymptotic properties for finding \(\alpha _{n}\).
7 Conclusion
In this study, we analyzed the approximation method proposed by [24] over weighted Hardy spaces \({\mathbb {H}}^\infty ({\mathcal {D}}_d, w)\). We provided (1) proof of the fact that the approximation formulas are nearly optimal from the viewpoint of minimum worst-case error \(E_n^{\min }({\mathbb {H}}^\infty ({\mathcal {D}}_d, w))\); and (2) upper bounds of \(E_n^{\min }({\mathbb {H}}^\infty ({\mathcal {D}}_d))\) to evaluate the convergence rates of approximation errors with \(n\rightarrow \infty \). To obtain (2), we introduced the concept “positive semi-definite in measure” and by using this, provided a lower bound for \(F_{K, Q}^\textrm{C}(n)\). We also compared the given bounds with those mentioned in the study by [23], and demonstrated that they have the same convergence rate with \(n\rightarrow \infty \).
The new bounds do not indicate that the approximation formulas in [24] are optimal. Another method to bound the error is recently considered by [7], although their bound do not show the optimality, either. We need tighter bounds to show the optimality, which may require more sophisticated analysis. We leave such analysis to future work.
References
Dorn, W.S.: Duality in quadratic programming. Q. Appl. Math. 18(2), 155–162 (1960)
Haber, S.: The tanh rule for numerical integration. SIAM J. Numer. Anal. 14(4), 668–685 (1977)
Isii, K.: Inequalities of the types of chebyshev and cramér-rao and mathematical programming. Ann. Inst. Stat. Math. 16(1), 277–293 (1964)
Jaming, P., Matolcsi, M., Révész, S.G.: On the extremal rays of the cone of positive, positive definite functions. J. Fourier Anal. Appl. 15(4), 561–582 (2009)
Levin, A., Lubinsky, D.: Green equilibrium measures and representations of an external field. J. Approx. Theory 113(2), 298–323 (2001)
Luenberger, D.G.: Optimization by vector space methods. Wiley, New York (1997)
van Meurs, P., Tanaka, K.: Convergence rates for energies of interacting particles whose distribution spreads out as their number increases. ESAIM: COCV 29, 4 (2023). https://doi.org/10.1051/cocv/2022083
von Neumann, J.: Zur theorie der gesellschaftsspiele. Math. Ann. 100(1), 295–320 (1928)
Oberhettinger, F.: Tables of Fourier transforms and Fourier transforms of distributions. Springer, New York (1990)
Ohtsuka, M.: A generalization of duality theorem in the theory of linear programming. J. Sci. Hiroshima Univ. Ser. A-I Math. 30(1), 31–39 (1966)
Ohtsuka, M.: Generalized capacity and duality theorem in linear programming. J Sci Hiroshima Univ. Ser. A-I Math. 30(1), 45–56 (1966)
Pólya, G.: Remarks on characteristic functions. In: Proceedings of First Berkeley Conference on Mathematical Statistics and Probability, pp. 115–123. University of California Press, Berkeley (1949)
Rockafellar, R.T.: Extension of fenchel’s duality theorem for convex functions. Duke Math. J. 33(1), 81–89 (1966)
Saff, E.B., Totik, V.: Logarithmic potentials with external fields, vol. 316. Springer, New York (1997)
Schwartz, C.: Numerical integration of analytic functions. J. Comput. Phys. 4(1), 19–29 (1969)
Sion, M.: On general minimax theorems. Pac. J. Math. 8(1), 171–176 (1958)
Sriperumbudur, B.K., Gretton, A., Fukumizu, K., Schölkopf, B., Lanckriet, G.R.: Hilbert space embeddings and metrics on probability measures. J Mach Learn Res. 11, 1517–1561 (2010)
Stenger, F.: Numerical methods based on sinc and analytic functions, vol. 20. Springer, New York (1993)
Stenger, F.: Handbook of Sinc numerical methods. CRC Press, Boca Raton (2011)
Stewart, J.: Positive definite functions and generalizations, an historical survey. Rocky Mountain J. Math. 6(3), 409–434 (1976). https://doi.org/10.1216/RMJ-1976-6-3-409
Sugihara, M.: Near optimality of the sinc approximation. Math. Comput. 72(242), 767–786 (2003). https://doi.org/10.1090/S0025-5718-02-01451-5
Takahasi, H., Mori, M.: Double exponential formulas for numerical integration. Publ Res Inst Math Sci 9(3), 721–741 (1974)
Tanaka, K., Okayama, T., Sugihara, M.: Potential theoretic approach to design of accurate formulas for function approximation in symmetric weighted hardy spaces. IMA J. Numer. Anal. 37(2), 861–904 (2017)
Tanaka, K., Sugihara, M.: Design of accurate formulas for approximating functions in weighted hardy spaces by discrete energy minimization. IMA J. Numer. Anal. 39(4), 1957–1984 (2019)
Tanaka, K., Sugihara, M., Murota, K.: Function classes for successful de-sinc approximations. Math. Comput. 78(267), 1553–1571 (2009)
Villani, C.: Topics in optimal transportation. 58. American Mathematical Society, Providence, Rhode Island (2003)
Wu, S.: A cutting plane approach to solving quadratic infinite programs on measure spaces. J. Global Optim. 21(1), 67–87 (2001)
Acknowledgements
The authors are grateful to Ryunosuke Oshiro for his comment on signed measures. They also thank the anonymous reviewers for their valuable comments about this paper.
Funding
Open access funding provided by The University of Tokyo.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This study was supported by the Japan Society for the Promotion of Science with KAKENHI (17K14241 to K.T.).
Rights and permissions
This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.
About this article
Cite this article
Hayakawa, S., Tanaka, K. Convergence analysis of approximation formulas for analytic functions via duality for potential energy minimization. Japan J. Indust. Appl. Math. 41, 105–127 (2024). https://doi.org/10.1007/s13160-023-00588-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13160-023-00588-5