On the condition number of Vandermonde matrices with pairs of nearly-colliding nodes

Kunis, Stefan; Nagel, Dominik

doi:10.1007/s11075-020-00974-x

On the condition number of Vandermonde matrices with pairs of nearly-colliding nodes

Original Paper
Open access
Published: 21 July 2020

Volume 87, pages 473–496, (2021)
Cite this article

Download PDF

You have full access to this open access article

Numerical Algorithms Aims and scope Submit manuscript

On the condition number of Vandermonde matrices with pairs of nearly-colliding nodes

Download PDF

Stefan Kunis¹ &
Dominik Nagel¹

8 Citations
Explore all metrics

A Correction to this article was published on 03 October 2020

This article has been updated

Abstract

We prove upper and lower bounds for the spectral condition number of rectangular Vandermonde matrices with nodes on the complex unit circle. The nodes are “off the grid,” pairs of nodes nearly collide, and the studied condition number grows linearly with the inverse separation distance. Such growth rates are known in greater generality if all nodes collide or for groups of colliding nodes. For pairs of nodes, we provide reasonable sharp constants that are independent of the number of nodes as long as non-colliding nodes are well-separated.

On the Spectral Gap of a Square Distance Matrix

Article 09 December 2016

Hilbert-Schmidt Numerical Radius of a Pair of Operators

Article 04 December 2023

Rectangles of positive eigenvalues with positive eigenfunctions of nonlinear multiparameter coupled systems

Article 11 October 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Vandermonde matrices with complex nodes appear in polynomial interpolation problems and many other fields of mathematics (see, e.g., the introduction of [2] and its references). In this paper, we are interested in rectangular Vandermonde matrices with nodes on the complex unit circle and with a large polynomial degree. These matrices generalize the classical discrete Fourier matrices to non-equispaced nodes and the involved polynomial degree is also called bandwidth. The condition number of those matrices has recently become important in the context of stability analysis of super-resolution algorithms like Prony’s method [6, 15], the matrix pencil method [12, 18], the ESPRIT algorithm [20, 21], and the MUSIC algorithm [17, 22]. If the nodes of such a Vandermonde matrix are all well-separated, with minimal separation distance greater than the inverse bandwidth, bounds on the condition number are established for example in [2, 5, 14, 18].

If nodes are nearly colliding, i.e., their distance is smaller than the inverse bandwidth, the behavior of the condition number is not yet fully understood. The seminal paper [9] coined the term (inverse) super-resolution factor for the product of the bandwidth and the separation distance of the nodes. For M nodes on a grid, the results in [7, 9] imply that the condition number grows like the super-resolution factor raised to the power of M − 1 if all nodes nearly collide. More recently, the practically relevant situation of groups of nearly colliding nodes was studied in [1, 4, 16, 19]. In different setups and oversimplifying a bit, all of these refinements are able to replace the exponent M − 1 by the smaller number m − 1, where m denotes the number of nodes that are in the largest group of nearly colliding nodes. The authors of [1, 19] focus on quite specific quantities in an optimization approach and in the so-called Prony mapping, respectively. In contrast, the condition number or the relevant smallest singular value of Vandermonde matrices with “off the grid” nodes on the unit circle is studied in [4, 16]. While [4] provided the exponent m − 1 for the first time, the proof technique leads to quite pessimistic constants and more restrictively asks all nodes (including the well-separated ones) to be within a tiny arc of the unit circle. More recently, the second version of [16] provided a quite general framework and reasonable sharp constants, but involves a technical condition which prevents the separation distance from going to zero for a fixed number of nodes and a fixed bandwidth.

Here, we present upper and lower bounds for the condition number of Vandermonde matrices with pairs of nearly colliding nodes, i.e., the special case m = 2. We achieve the expected linear order and all constants are reasonably sharp and absolute. In contrast to the more general quoted results [4, 16], the nodes can be placed on the full unit circle and the separation distance is allowed to approach zero. Our mild technical conditions, which seem to be artifacts of our proof technique, are:

(i)
A logarithmic growth in the separation distance of the well-separated nodes (which can be dropped at a price of a larger constant for the condition number estimate),
(ii)
A uniformity condition that colliding nodes behave similarly (they have the same separation distance up to a predefined constant), and
(iii)
An a priori upper bound on the separation distance of the colliding nodes.

The outline of this paper is as follows: Section 2 fixes the notation, recalls results for the case of well-separated nodes, and provides lower bounds for the condition number. In Section 3, we establish upper bounds for nodes that are well-separated from each other except for one pair of nodes that is nearly colliding. Section 4 goes one step further and studies the more general case where an arbitrary number of pairs of nodes nearly collide. Theoretical and numerical comparisons with [3, 4, 8, 16] can be found at the end of Section 4 and in Section 5.

2 Preliminaries

Let ${\mathbb {T}}:=\left \{ z\in {\mathbb {C}}\colon \left | z \right |=1 \right \}$ be the complex torus and nodes $\left \{ {z_{1},\dots ,z_{M}} \right \} \subset {\mathbb {T}}$ be parametrized by $z_{j}= \textnormal {e}^{-2\pi \textnormal {i}{{t_{j}}}}, j=1\dots ,M$, such that $t_{1}< \cdots < t_{M} \in \left [ 0,1 \right )$. We fix a degree $n \in {\mathbb {N}}$ so that N := 2n + 1 > M and set up the rectangular Vandermonde matrix:

$$ \begin{array}{@{}rcl@{}} A:= \begin{pmatrix} {z_{j}^{k}} \end{pmatrix}_{ \begin{array}{c}j{=}1,{\dots},M \\ \left| k \right|{\le} n \end{array}} = \begin{pmatrix} z_{1}^{-n}& {\cdots} & z_{1}^{-1} & 1 & {z_{1}^{1}} & {\cdots} & {z_{1}^{n}}\\ {\vdots} & & {\vdots} & & {\vdots} \\ z_{M}^{-n}& {\cdots} & z_{M}^{-1} & 1 & {z_{M}^{1}} & {\cdots} & {z_{M}^{n}} \end{pmatrix} \in {\mathbb{C}}^{M\times N}. \end{array} $$

(2.1)

The Dirichlet kernel $D_{n} \colon {\mathbb {R}}\rightarrow {\mathbb {R}}$ is given by:

$$ D_{n}(t):= \sum\limits_{k=-n}^{n}\textnormal{e}^{2\pi\textnormal{i}{kt}} = \begin{cases} N, & t\in {\mathbb{Z}},\\ \frac{\sin(N\pi t)}{\sin(\pi t)}, & \text{otherwise}, \end{cases} $$

(2.2)

so that

$$ K := AA^{*}= \begin{pmatrix} {D_n\left( t_{i}-t_{j}\right)} \end{pmatrix}_{i,j=1}^{M} \in {\mathbb{R}}^{M\times M}. $$

The matrix K is symmetric positive definite and the spectral condition number

$$ \text{cond}(A) := \frac{\sigma_{\max}(A)}{\sigma_{\min}(A)}=\sqrt{\left\Vert K\right\Vert\left\Vert K^{-1}\right\Vert} $$

is finite since all nodes are distinct (here and throughout the paper $\|K\|:=\sup \{\|Kx\|: \|x\|=1\}$ with $\|x\|^{2}:={\sum }_{k}|x_{k}|^{2}$). On the other hand, if two nodes are equal, then two rows of A are the same and by continuity the condition number diverges if two nodes collide. The (wrap around) distance of two nodes is given by:

$$ \left| t_{j}-t_{\ell} \right|_{{\mathbb{T}}}:= \underset{r\in {\mathbb{Z}}}{\min}\left| t_{j}-t_{\ell}+r \right|. $$

and we introduce the normalized separation distance of the node set as:

$$ \tau := N \underset{j\ne\ell}{\min}\left| t_{j}-t_{\ell} \right|_{{\mathbb{T}}}. $$

We call the case τ = 1 critical separation, i.e., $\min \limits _{j\ne \ell }\left | {t_{j}-t_{\ell }} \right | _{{\mathbb {T}}} = \frac {1}{N}$, and the cases τ ≤ 1 and τ > 1 nearly colliding and well-separated, respectively. Figure 1 illustrates the situation for 4 nodes on the unit circle. The parameter ρ_min describes a minimum separation distance of involved non-colliding nodes assumed in the theorems.

A reasonable result for well-separated nodes is as follows.

Theorem 2.1

[2, 18] Let A be a Vandermonde matrix as in (2.1) with τ > 1, then

$$ N\left( 1-\frac{1}{\tau}\right) \le \sigma_{\min}^{2}(A)\le N \le \sigma_{\max}^{2}(A)\le N\left( 1+\frac{1}{\tau}\right). $$

In particular, we have

$$ \textnormal{cond}(A)^{2} \le 1+ \frac{2}{\tau-1} $$

and this implies $\left \Vert K\right \Vert \le N + N/\tau $ and $\left \Vert K^{-1}\right \Vert =\left \Vert A^{\dagger }\right \Vert ^{2}\le (N - N/\tau )^{-1}$, where A^‡ := A^∗(AA^∗)^− 1 denotes the Moore-Penrose pseudo inverse of A.

We note in passing that the above lower bound on the smallest singular value is an improvement of [18] by [2] and that [18] and [8] allow to replace $\frac {1}{\tau }$ in the upper and the lower bounds by $\frac {1}{\tau }-\frac {1}{N}$, respectively. Moreover, we have the following lower bound on the condition number. This already shows that the upper bound for well-separated nodes is quite sharp and provides the benchmark for nearly colliding nodes.

Theorem 2.2 (Lower bound)

Let A be a Vandermonde matrix as in (2.1), then:

$$ \sigma_{\min}^{2}(A) \le N-\left| D_{n}(\tau/N) \right| \le N \le N+\left| D_{n}(\tau/N) \right|\le\sigma_{\max}^{2}(A). $$

In particular, we have:

$$ \textnormal{cond}(A)^{2} \ge 1+\frac{2}{\pi\tau-1} $$

for $\tau \in {\mathbb {N}}+\frac {1}{2}$, uniformly in N and almost matching the above upper bound.

For nearly colliding nodes, we have:

$$ \textnormal{cond}(A)^{2} \ge \frac{12}{\pi^{2}\tau^{2}}-1 \ge \frac{1}{\tau^{2}} $$

for $\tau \le \sqrt {12/\pi ^{2}-1}\approx 0.46$ and $\textnormal {cond}(A) \ge \sqrt {6}/\pi \tau \approx 0.77/\tau $ for all τ ≤ 1.

Proof

Without loss of generality, let t₂ − t₁ = τ/N and consider the upper left 2 × 2 block in:

$$ K= \begin{pmatrix} C & * \\ * & * \end{pmatrix}, \quad C:=\begin{pmatrix} {D_n\left( 0\right)} & {D_n\left( \tau/N\right)} \\ {D_n\left( \tau/N\right)} & {D_n\left( 0\right)} \end{pmatrix}. $$

We apply Lemma A.5, and get:

$$ \text{cond}(A)^{2}=\frac{\lambda_{\max}(K)}{\lambda_{\min}(K)}\ge \frac{\lambda_{\max}(C)}{\lambda_{\min}(C)} = \frac{D_n(0)+\left| D_n\left( \tau/N\right)\right|}{D_n(0)-\left| D_n\left( \tau/N\right)\right|} = 1+\frac{2\left| {D_n\left( \tau/N\right)} \right|}{N-\left| {D_n\left( \tau/N\right)} \right|}, $$

and Lemma A.1 yields the assertion. □

3 Nodes with one nearly colliding pair

Definition 3.1

Let M ≥ 2 and $0=t_{1}<\cdots <t_{M} \in \left [ 0,1 \right )$ such that:

$$ \begin{array}{@{}rcl@{}} \left| t_{1}-t_{2} \right|_{{\mathbb{T}}} &=& \frac{\tau}{N},\qquad \qquad\qquad\qquad0<\tau\le 1,\\ \left| t_{j}-t_{\ell} \right|_{{\mathbb{T}}} &\ge& \frac{\rho}{N}, j\ne \ell, \ell\ge 3, \qquad~1<\rho<\infty, \end{array} $$

then {t₁,…,t_M} is called a set of nodes with one nearly colliding pair; see Fig. 2 for an illustration. Due to periodicity, the choice t₁ = 0 and $\left | t_{1}-t_{2} \right |_{{\mathbb {T}}} = \frac {\tau }{N}$ is without loss of generality.

Now, we estimate an upper bound on the condition number of the Hermitian matrix K by bounding $\left \Vert {K}\right \Vert $ directly and applying Lemma A.4 to K^− 1 before bounding $\left \Vert K^{-1}\right \Vert $. For that, we introduce some notation for abbreviation.

Definition 3.2

We define $a_{1} := \begin {pmatrix} {z_{1}^{k}} \end {pmatrix}_{\left | k \right |\le n}\in {\mathbb {C}}^{1\times N}$ and $A_{2}:= \begin {pmatrix} {z_{j}^{k}} \end {pmatrix}_{ \begin {array}{c}j{=}2,{\dots },M\\ \left | k \right |{\le } n \end {array}}\in {\mathbb {C}}^{(M-1)\times N}$ so that with:

$$ \begin{array}{@{}rcl@{}} a_{1}a_{1}^{*}=N, \quad K_{2}:= A_{2}A_{2}^{*} \quad\text{and}\quad b:=A_{2} a_{1}^{*}=\begin{pmatrix} D_{n}(\tau/N)\\D_{n}(t_{3})\\ \vdots\\ D_{n}(t_{M}) \end{pmatrix}, \end{array} $$

(3.1)

we have the partitioning:

$$ \begin{array}{@{}rcl@{}} A=\begin{pmatrix} a_{1}\\ A_{2} \end{pmatrix} \quad \text{and} \quad K =\begin{pmatrix} N & b^{*} \\ b & K_{2} \end{pmatrix}, \end{array} $$

(3.2)

where A₂ is a Vandermonde matrix with nodes that are at least $\frac {\rho }{N}$ separated.

Lemma 3.3

Under the conditions of Definition 3.1 and for ρ ≥ 6, we have:

$$ \left\Vert K\right\Vert\le 2.3 N. $$

Proof

The key idea is to see the set of nodes as a union of two well-separated subsets and use the existing bounds for these. In contrast to the next chapter, here, one of the sets only consists of a single node. We start by noting that Theorem 2.1 and (3.1) yield $\left \Vert {b}\right \Vert ^{2}\le \left \Vert {a_{1}}\right \Vert ^{2} \left \Vert {A_{2}}\right \Vert ^{2}=N\left \Vert {K_{2}}\right \Vert $. Together with the decomposition (3.2), the triangle inequality, Lemma A.6, and Theorem 2.1, we obtain:

$$ \left\Vert K\right\Vert \le \left\Vert \begin{pmatrix} N&0\\0&K_{2} \end{pmatrix}\right\Vert +\left\Vert {\begin{pmatrix} 0&b^{*}\\b&0 \end{pmatrix}}\right\Vert \le\left\Vert {K_{2}}\right\Vert +\left\Vert {b}\right\Vert \le N\left( \frac{\rho+1}{\rho}+\sqrt{\frac{\rho+1}{\rho}}\right). $$

□

Lemma 3.4

Under the conditions of Definition 3.1 and with b as in (3.1), we have:

$$ b=K_{2} e_{1} +r, $$

where $e_{1}\in {\mathbb {R}}^{(M-1)}$ denotes the first unit vector and:

$$ \left\Vert r\right\Vert^{2} \le \left( N- {D_n\left( \tau/N\right)} \right)^{2} +N^{2}\tau^{2} \left( \frac{\pi^{4}}{12\rho^{2}}+\frac{1.21\pi}{\rho^{3}}+ \frac{\pi^{4}}{180\rho^{4}} \right). $$

Proof

The vector b can be approximated by the first column of K₂ in the sense that:

$$ b=\begin{pmatrix} {D_n\left( \tau/N\right)} \\ {D_n\left( t_{3}\right)} \\ \vdots\\ {D_n\left( t_{M}\right)} \end{pmatrix}=\begin{pmatrix} {D_n\left( 0\right)} \\ {D_n\left( t_{3}-\tau/N\right)} \\ \vdots\\ {D_n\left( t_{M}-\tau/N\right)} \end{pmatrix}+ \begin{pmatrix} r_{1} \\ \vdots\\ r_{M-1} \end{pmatrix}. $$

We have $\left | r_{1} \right | = N- {D_n\left (\tau /N\right )} $ and for j = 2,…,M − 1 the mean value theorem yields:

$$ \left| r_{j} \right| = \left| {D_n\left( t_{j+1}\right)} - {D_n\left( t_{j+1} - \tau/N\right)} \right| = \left| {D_n^{\prime}\left( \xi_{j}\right)} \right|\frac{\tau}{N},\! \!\quad \xi_{j}\!\in\! \left( \left| {t_{j+1} - \frac{\tau}{N}} \right| _{{\mathbb{T}}},\left| {t_{j+1}} \right| _{{\mathbb{T}}}\right). $$

Note that, in the worst case, half of the nodes can be as close as possible (under the assumed separation condition) to t₂ not only on its right but also on its left. Hence, for $j=2,\dots ,\left \lceil \frac {M}{2}\right \rceil $, $\xi _{j} \ge \frac {(j-1)\rho }{N}$ and Lemma A.1 lead to:

$$ \left| r_{j} \right|\le N\left( \frac{\pi}{2N\left| {\xi_{j}} \right| }+\frac{1}{2N^{2}\left| {\xi_{j}} \right| ^{2}}\right)\tau \le N\left( \frac{\pi}{2(j-1)\rho}+\frac{1}{2(j-1)^{2}\rho^{2}}\right)\tau. $$

Thus, for all nodes, we get:

$$ \sum\limits_{j=2}^{M-1}|r_{j}|^{2} \le 2 \sum\limits_{j=2}^{\lceil M/2\rceil}|r_{j}|^{2} \le N^{2}\tau^{2} \left( \frac{\pi^{2}}{2\rho^{2}} \underbrace{\sum\limits_{j=1}^{\infty}\frac{1}{j^{2}}}_{=\frac{\pi^{2}}{6}} + \frac{\pi}{\rho^{3}} \underbrace{\sum\limits_{j=1}^{\infty} \frac{1}{j^{3}}}_{\le 1.21} + \frac{1}{2\rho^{4}}\underbrace{\sum\limits_{j=1}^{\infty} \frac{1}{j^{4}}}_{=\frac{\pi^{4}}{90}} \right). $$

□

Lemma 3.5

Under the conditions of Definition 3.1 and for ρ ≥ 5, we have:

$$ \left\Vert K^{-1}\right\Vert\le \frac{C(\rho)}{N\tau^{2}}, $$

where

$$ C(\rho) = \left( \frac{2\rho-1}{\rho-1}+\sqrt{\frac{\rho}{\rho-1}} \right) \left[2- \frac{\rho}{\rho-1} \left( 1+\frac{\pi^{4}}{12\rho^{2}}+\frac{1.21\pi}{\rho^{3}}+\frac{\pi^{4}}{180\rho^{4}} \right)\right]^{-1}. $$

Proof

We consider K decomposed as in (3.2) and apply Lemma A.4 with respect to K₂ to obtain:

$$ K^{-1} = \begin{pmatrix} I&0\\-K_{2}^{-1}b&I \end{pmatrix} \begin{pmatrix} (N-b^{*}K_{2}^{-1}b)^{-1}& 0\\0&K_{2}^{-1} \end{pmatrix} \begin{pmatrix} I & -b^{*}K_{2}^{-1} \\ 0 & I \end{pmatrix} $$

and thus,

$$ \left\Vert K^{-1}\right\Vert \le \left\Vert \begin{pmatrix} I&0\\-K_{2}^{-1}b&I \end{pmatrix}\right\Vert^{2} \max\left\{\left\Vert {K_{2}^{-1}}\right\Vert,\left\Vert {\left( N-b^{*}K_{2}^{-1}b\right)^{-1}}\right\Vert \right\}. $$

First of all, we establish an upper bound for the norm of the triangular matrix. Equation (3.1) and Theorem 2.1 imply:

$$ \left\Vert K_{2}^{-1}b\right\Vert=\left\Vert (A_{2} A_{2}^{*})^{-1} A_{2} a_{1}^{*}\right\Vert\le \left\Vert A_{2}^{\dagger}\right\Vert\left\Vert a_{1}\right\Vert\le\sqrt{\frac{\rho}{\rho-1}}. $$

Together with Lemma A.6, we obtain:

$$ \begin{array}{@{}rcl@{}} \left\Vert \begin{pmatrix} I&0\\-K_{2}^{-1}b&I \end{pmatrix}\right\Vert^{2} \le 1+\left\Vert K_{2}^{-1}b\right\Vert+\left\Vert K_{2}^{-1}b\right\Vert^{2} \le \frac{2\rho-1}{\rho-1}+\sqrt{\frac{\rho}{\rho-1}}. \end{array} $$

(3.3)

The next step is to bound $(N-b^{*}K_{2}^{-1}b)^{-1}$. Lemma 3.4 yields:

$$ b^{*}K_{2}^{-1}b = (K_{2} e_{1}+ r)^{*} K_{2}^{-1} (K_{2} e_{1}+ r) = 2 D_{n}(\tau/N)-D_{n}(0)+r^{*}K_{2}^{-1}r. $$

Applying the second part of Lemma 3.4, Lemma A.1, and Theorem 2.1 yields:

$$ \begin{array}{@{}rcl@{}} N-b^{*}K_{2}^{-1}b &\ge& 2\left( N-D_{n}(\tau/N)\right) - \left\Vert r\right\Vert^{2}\left\Vert K_{2}^{-1}\right\Vert\\ &\ge& \!\left( N - D_{n}(\tau/N)\right) \!\left( 2 - \left( N-D_{n}(\tau/N)\right)\left\Vert {K_{2}^{-1}}\right\Vert \right)- \!\left\Vert {K_{2}^{-1}}\right\Vert \sum\limits_{j=2}^{M-1}\left| {r_{j}} \right| ^{2} \\ &\ge& N\tau^{2} \left( 2-N\left\Vert K_{2}^{-1}\right\Vert\right) -\|K_{2}^{-1}\| N^{2}\tau^{2} \left( \frac{\pi^{4}}{12\rho^{2}}+\frac{1.21\pi}{\rho^{3}}+\frac{\pi^{4}}{180\rho^{4}}\right) \\ &\ge& N\tau^{2} \left[2- \frac{\rho}{\rho-1} \left( 1+\frac{\pi^{4}}{12\rho^{2}}+\frac{1.21\pi}{\rho^{3}}+\frac{\pi^{4}}{180\rho^{4}} \right)\right]. \end{array} $$

For ρ ≥ 5, the most inner bracketed term takes values in (1,1.4) such that the square bracketed term is positive. Forming the reciprocal gives the result, since Theorem 2.1 also implies:

$$ \begin{array}{@{}rcl@{}} N\left\Vert K_{2}^{-1}\right\Vert\le \frac{\rho}{\rho-1}\le \frac{\rho-1}{\rho-2}\le \left[2- \frac{\rho}{\rho-1} \left( 1+\hdots\right)\right]^{-1}. \end{array} $$

(3.4)

□

Theorem 3.6 (Upper bound)

Under the conditions of Definition 3.1 with $\rho \ge \rho _{\min \limits }=6$, we have:

$$ \textnormal{cond}(A)\le \frac{4}{\tau}. $$

Proof

The bound follows from Lemmata 3.3 and 3.5 with C(ρ) ≤ C(6) ≤ 6.5. □

Lower and upper bounds in Theorems 2.2 and 3.6 yield:

$$ \frac{1}{\tau}\le\text{cond}(A)\le \frac{4}{\tau} $$

for τ ≤ 0.46 and 6 ≤ ρ. The condition on ρ implies that for specific configurations of M nodes, our result becomes effective as early as N ≈ 6M—this is in contrast to the results [4, 16], where N has to be much larger.

Remark 3.7 (Constants)

Some comments regarding what is lost during our proof:

(i)
The constant in Lemma 3.3 is a numerical value for all ρ ≥ 6, indeed the proof is valid for all values ρ > 1. The case M = 2 shows that Lemmata 3.3 and 3.4 are reasonably sharp since in this case $\left \Vert {K}\right \Vert =N+D_{n}(\tau /N)\ge N(2-\pi ^{2}\tau ^{2}/6)$ and $\left \Vert r\right \Vert =N-D_{n}(\tau /N)\ge N(2-\tau ^{2})$; see Lemma A.1 for the two inequalities.
(ii)
In Lemma 3.5, the constant C(ρ) is monotone decreasing in ρ; see also Fig. 3. It is bounded below by 3 which is due to the relatively crude norm estimate on the block triangular factors in the Schur complement decomposition. Note that the left-hand side in (3.3) is bounded from below by $1+\left \Vert K_{2}^{-1}b\right \Vert ^{2}$. An additional minor improvement on C(ρ) and on the range of admissible values for ρ can be achieved when applying Lemma A.1 to two factors simultaneously.

Remark 3.8 (Generalizations and limitations)

In principle, the suggested Schur complement technique can be generalized to more than two nodes colliding and also to the multivariate case:

(i)
Let M ≥ 3 and $0=t_{1}<\cdots <t_{M} \in \left [ 0,1 \right )$ be such that {t₁,t₂,t₃} nearly collide and decompose:
$$ \begin{array}{@{}rcl@{}} K &=&\begin{pmatrix} K_{1} & B^{*} \\ B & K_{2} \end{pmatrix},\quad K_{1}=\begin{pmatrix} N & {D_n\left( t_{1}-t_{2}\right)} \\ {D_n\left( t_{1}-t_{2}\right)} & N \end{pmatrix},\\ K_{2}&=&\begin{pmatrix} {D_n\left( t_{i}-t_{j}\right)} \end{pmatrix}_{i,j=3}^{M}. \end{array} $$

While it is clear that the Schur complement $K_{1}-B^{*} K_{2}^{-1} B$ is strictly positive definite, establishing a lower bound on its smallest singular value similar to the proofs of Lemmata 3.4 and 3.5 seems considerably harder. Already, the linear approximation in Lemma 3.4 then needs to be replaced by a higher order approximation for the matrix B.
(ii)
Consider the bivariate case and the Vandermonde matrix:
$$ A= \begin{pmatrix} z_{j}^{\gamma} \end{pmatrix}_{ \begin{array}{c}j{=}1,{\hdots},M \\ \left\Vert \gamma\right\Vert_{\infty} {\le} n \end{array}} \in {\mathbb{C}}^{M\times N^{2}}, $$
where $z_{j}=(x_{j}, y_{j})=(\textnormal {e}^{-2\pi \textnormal {i}{u_{j}}},\textnormal {e}^{-2\pi \textnormal {i}{v_{j}}}) \in {\mathbb {T}}^{2}$, $\gamma =(\alpha ,\beta )\in \mathbb {Z}^{2}$ is a multi-index, and $z_{j}^{\gamma } := x_{j}^{\alpha } \cdot y_{j}^{\beta }$. The distance of the nodes t_j = (u_j,v_j) ∈ [0,1)² is measured by $\left | {t_{j}-t_{\ell }} \right | _{{\mathbb {T}}}:= \min \limits _{r\in \mathbb {Z}^{2}}\left \Vert {t_{j}-t_{\ell }+r}\right \Vert _{\infty }$ and we consider the situation as in Definitions 3.1 and 3.2 with K = AA^∗. Lemma 3.4 can be proven using the bivariate mean value theorem to get $|r_{j}|\le N\tau \pi / \left | \xi _{j} \right |_{{\mathbb {T}}}$, j = 2,3,…,M, and the packing argument [14, Lem. 4.5] to get:
$$ \left\Vert r\right\Vert^{2} \le (N^{2}- D_{n}(u_{2})D_{n}(v_{2}))^{2} + \frac{12\pi^{2} N^{4}\tau^{2}}{(\rho-1)^{2}} \left( 1+\log\left\lceil{\sqrt{M/6}}\right\rceil\right). $$
We need additional assumptions for Lemma 3.5 to work since results for general well-separated nodes, cf. [15], seem to be too weak. If the nodes t₂,…,t_M are a subset of equispaced nodes in ${\mathbb {T}}^{2}$, then [14, Cor. 4.11] yields $\left \Vert {K_{2}^{-1}}\right \Vert \le (N-N/\rho )^{-2}$. Together with M ≥ 4 and $\rho \ge 4+2\log M$, this yields $\left \Vert K^{-1}\right \Vert \le 20 / N^{2}\tau ^{2}$.

4 Pairs of nearly colliding nodes

We now study the situation in which the Vandermonde matrix comes from pairs of nearly colliding nodes.

Definition 4.1

Let $n\in {\mathbb {N}}$, N = 2n + 1, c ≥ 1 and let $t_{1}< {\cdots } < t_{\frac {M}{2}}\in \left [ 0,1 \right )$ and $t_{\frac {M}{2}+1}<\cdots <t_{M} \in \left [ 0,1 \right )$ for M ≥ 4 even such that:

$$ \begin{array}{@{}rcl@{}} \frac{\tau}{N} &\le& \left| t_{j}-t_{j+\frac{M}{2}} \right|_{{\mathbb{T}}} \le \frac{c\tau}{N},\quad j=1,\dots,\frac{M}{2}, \qquad 0<c\tau\le 1,\\ \frac{\rho}{N} &\le&\left| t_{j}-t_{\ell} \right|_{{\mathbb{T}}}, \quad j < \ell, \ell \ne j+\frac{M}{2}, \qquad\qquad1<\rho < \infty, \end{array} $$

then {t₁,…,t_M} is called a set of nodes with pairs of nearly colliding nodes (see Fig. 4 for an illustration). The constant c measures the uniformity of the colliding nodes. For subsequent use, we additionally introduce the following wrap around distance of indices $\left | {j-\ell } \right |^{\prime }:=\min \limits _{r\in \mathbb {Z}} \left | j-\ell +r\frac {M}{2} \right |$ with respect to $\frac {M}{2}$.

Definition 4.2

We define:

$$ A_{1}\!:=\! \begin{pmatrix} {z_{j}^{k}} \end{pmatrix}_{ \begin{array}{c}j{=}1,{\dots},M/2\\ \left| k \right|{\le }n \end{array}}\in {\mathbb{C}}^{(M/2)\times N} \quad \!\text{and}\! \quad A_{2}\!:=\! \begin{pmatrix} {{z_{j}^{k}}} \end{pmatrix} _{\begin{array}{c}\!j{=}M/2{+}1,{\dots},M\!\\ \left| {k} \right| {\le} n \end{array}}\!\in {\mathbb{\!C}}^{(M/2)\times N} $$

so that with $K_{1}:=A_{1}A_{1}^{*}$, $K_{2}:=A_{2}A_{2}^{*}$, and $B:=A_{2}A_{1}^{*}$ we have the partitioning:

$$ \begin{array}{@{}rcl@{}} A=\begin{pmatrix} A_{1}\\A_{2} \end{pmatrix}, \quad K=\begin{pmatrix} K_{1} & B^{*} \\ B & K_{2} \end{pmatrix}. \end{array} $$

(4.1)

Note that under the assumptions in Definition 4.1 the Vandermonde matrices A₁ and A₂ are each corresponding to nodes that are at least ρ/N-separated.

The proof technique we use is analogous to the one we used in the case of two nearly colliding nodes. The difference is that we have a matrix K₁ instead of a scalar and the block B is a matrix instead of a vector. Subsequently, Lemma 4.3 establishes an upper bound on $\left \Vert {K}\right \Vert $ and Lemmata 4.4, 4.5, and 4.6 establish an upper bound on $\left \Vert K^{-1}\right \Vert $.

Lemma 4.3

Under the conditions of Definition 4.1, we have:

$$ \left\Vert K\right\Vert\le 2N \cdot\frac{\rho+1}{\rho}. $$

Proof

Similar to Lemma 3.3, we start by noting that $\left \Vert B\right \Vert ^{2}\le \left \Vert K_{1}\right \Vert \left \Vert K_{2}\right \Vert $. Together with the decomposition (4.1), the triangle inequality, Lemma A.6, and Theorem 2.1, this leads to:

$$ \begin{array}{@{}rcl@{}} \left\Vert K\right\Vert &\le& \left\Vert \begin{pmatrix} K_{1}&0\\0&K_{2} \end{pmatrix}\right\Vert +\left\Vert \begin{pmatrix} 0&B^{*}\\{{B}}&0 \end{pmatrix}\right\Vert\\ &\le& \max\left\{ \left\Vert K_{1}\right\Vert,\left\Vert K_{2}\right\Vert \right\} + \sqrt{\left\Vert K_{1}\right\Vert\left\Vert K_{2}\right\Vert} \le 2N\cdot\frac{\rho+1}{\rho}. \end{array} $$

□

Lemma 4.4

Under the conditions of Definition 4.1, R₁ := B − K₁ fulfills:

$$ \left\Vert R_{1}\right\Vert \le N- {D_n\left( c\tau/N\right)} + Nc\tau\left( \frac{\pi(\log\left\lfloor\frac{M}{4}\right\rfloor+1)}{\rho}+\frac{\pi^{2}}{6\rho^{2}}\right). $$

Proof

The Dirichlet kernel D_n is monotone decreasing on $\left [ 0,1/N \right ]$. Hence, for the diagonal entries, we obtain:

$$ \left| (R_{1})_{jj} \right| = \left| D_n\left( t_{j}-t_{j+\frac{M}{2}}\right) -N \right| = N -{D_n}\left( t_{j}-t_{j+\frac{M}{2}}\right) \le N -{D_n}({c\tau/N}). $$

The off-diagonal entries are bounded by the mean value theorem and Lemma A.1 as:

$$ \begin{array}{@{}rcl@{}} \left| (R_{1})_{j\ell} \right| &=&\left| {D_n\left( t_{j}-t_{\ell}\right)} - {D_n\left( t_{j+\frac{M}{2}}-t_{\ell}\right)} \right|\\ &\le& \left| {D_n^{\prime}\left( \xi_{j\ell}\right)} \right| \frac{c\tau}{N} \le Nc\tau\left( \frac{\pi}{2 N\xi_{j\ell}} + \frac{1}{2N^{2}\xi_{j\ell}^{2}} \right), \end{array} $$

where $\left ( \left | t_{j+\frac {M}{2}}-t_{\ell } \right |_{{\mathbb {T}}},\left | t_{j}-t_{\ell } \right |_{{\mathbb {T}}} \right )\ni \xi _{j\ell } \ge \left | {j-\ell } \right |^{\prime }\rho /N$ implies:

$$ \left| (R_{1})_{j\ell} \right| \le Nc\tau \left( \frac{\pi}{2\rho \left| j-\ell \right|^{\prime}} + \frac{1}{2\rho^{2}(\left| {j-\ell} \right| ^{\prime})^{2}} \right)=:(\widetilde R_{1})_{j\ell} $$

for $j,\ell =1,\dots ,\frac {M}{2}$, j≠ℓ. Additionally, we set $(\widetilde R_{1})_{j j}:=N -D_{n}({c\tau /N})$. We bound the spectral norm of R₁ by the one of the real symmetric matrix $\widetilde R_{1}$ using Lemma A.2 and proceed by:

$$ \left\Vert R_{1}\right\Vert\le\left\Vert \widetilde R_{1}\right\Vert\le\left\Vert \widetilde R_{1}\right\Vert_{\infty} \le N-D_{n}({c\tau/N}) + 2Nc\tau \sum\limits_{j=1}^{\lfloor\frac{M}{4}\rfloor} \left( \frac{\pi}{2j\rho}+\frac{1}{2j^{2}\rho^{2}}\right), $$

from which the assertion follows. □

Lemma 4.5

Under the conditions of Definition 4.1, R₁ = B − K₁ and R₂ := B − K₂ fulfill:

$$ \left\Vert 2NI+R_{1}^{*}+R_{2}\right\Vert \le 2 {D_n\left( \tau/N\right)} + c^{2}\tau^{2} N\left( \frac{\pi^{2}(\log\lfloor\frac{M}{4}\rfloor+1)}{\rho} + \frac{\pi^{3}}{3\rho^{2}} + \frac{2.42}{\rho^{3}}\right). $$

Proof

First, note that:

$$ (R_{1}^{*}+R_{2})_{j\ell} = {D_n\left( t_{j+\frac{M}{2}}-t_{\ell}\right)} + {D_n\left( t_{j} - t_{\ell+\frac{M}{2}}\right)} - {D_n\left( t_{j+\frac{M}{2}}-t_{\ell+\frac{M}{2}}\right)} - {D_n\left( t_{j}\!-t_{\ell}\right)} . $$

Monotonicity of the Dirichlet kernel D_n on $t\in \left [ 0,1/N \right ]$ gives:

$$ \left| (2NI+R_{1}^{*}+R_{2})_{jj} \right|=2\left| {D_n\left( t_{j+\frac{M}{2}}-t_{j}\right)} \right| \le 2{D_n}({\tau/N}) $$

for j = ℓ. For each fixed off-diagonal entry j≠ℓ, the matrix 2NI has no contribution. We write the node t_j+M/2 as a perturbation of t_j by h_j := t_j+M/2 − t_j and expand the Dirichlet kernel by its Taylor polynomial of degree 2 in the point $\hat h :=t_{j} - t_{\ell } + \frac {h_{j} - h_{\ell }}{2}$. Using:

$$ D_{n}(h)=D_{n}(\hat h)+D_{n}^{\prime}(\hat h)(h-\hat h)+\frac{D_{n}^{\prime\prime}(\xi)}{2}(h-\hat h)^{2} $$

for some $\xi \in [\hat h,h]\cup [h,\hat h]$, the constant term, as well as the linear term, cancels out and we get:

$$ \begin{array}{@{}rcl@{}} &&D_{n}(t_{j}+h_{j}-t_{\ell}) + D_{n}(t_{j}-t_{\ell}-h_{\ell}) - D_{n}(t_{j}+h_{j} - t_{\ell}-h_{\ell}) - D_{n}(t_{j}-t_{\ell}) \\ &=&\frac{1}{8} \left( D_{n}^{\prime\prime}(\xi_{1}) (h_{j}+h_{\ell})^{2} + D_{n}^{\prime\prime}(\xi_{2}) (h_{j}+h_{\ell})^{2} + D_{n}^{\prime\prime}(\xi_{3}) (h_{j}-h_{\ell})^{2} + D_{n}^{\prime\prime}(\xi_{4}) (h_{j}-h_{\ell})^{2} \right). \end{array} $$

Lemma A.1 and $\xi _{1},\dots ,\xi _{4} \ge \left | j-\ell \right |^{\prime }\rho /N$ imply:

$$ \begin{array}{@{}rcl@{}} \left| (R_{1}^{*}+R_{2})_{j\ell} \right| &\le &\frac{N^{3}}{4} \left( \frac{\pi^{2}}{2\left| {j-\ell} \right| ^{\prime}\rho}+\frac{\pi}{(\left| {j-\ell} \right| ^{\prime})^{2}\rho^{2}} + \frac{1}{ (\left|{j-\ell} \right| ^{\prime})^{3}\rho^{3}} \right)\\ &&\cdot \left( (h_{j}+h_{\ell})^{2}+(h_{j}-h_{\ell})^{2}\right) \end{array} $$

and hence by h_j,h_ℓ ≤ cτ/N

$$ \left| (2NI+R_{1}^{*}+R_{2})_{j\ell} \right| \le N c^{2}\tau^{2}\left( \frac{\pi^{2}}{2\left| {j-\ell} \right| ^{\prime}\rho}+\frac{\pi}{(\left| {j-\ell} \right| ^{\prime})^{2}\rho^{2}} + \frac{1}{ (\left| j-\ell \right|^{\prime})^{3}\rho^{3}} \right). $$

The matrix $2NI+R_{1}^{*}+R_{2}$ is real symmetric so that:

$$ \begin{array}{@{}rcl@{}} \left\Vert 2NI+R_{1}^{*}+R_{2}\right\Vert &\le& \left\Vert 2NI+R_{1}^{*}+R_{2}\right\Vert_{\infty} \\ &\le& 2 {D_n\left( \tau/N\right)} + 2\sum\limits_{j=1}^{\lfloor\frac{M}{4}\rfloor} N c^{2}\tau^{2}\left( \frac{\pi^{2}}{2 j\rho}+\frac{\pi}{j^{2}\rho^{2}} + \frac{1}{j^{3}\rho^{3}} \right)\\ &\le& 2 {D_n\left( \tau/N\right)} + 2c^{2}\tau^{2} N\left( \frac{\pi^{2}(\log\lfloor\frac{M}{4}\rfloor+1)}{2\rho} + \frac{\pi^{3}}{6\rho^{2}} + \frac{1.21}{\rho^{3}}\right) \end{array} $$

and therefore the result holds. □

Lemma 4.6

Under the conditions of Definition 4.1 with τ ≤ 1/2 and ρ ≥ 2, such that:

$$ \begin{array}{@{}rcl@{}} \tilde C(\tau,\rho,c,M) &:=& 2 - \frac{c^{2}\pi^{2}(\log\lfloor\frac{M}{4}\rfloor+1)}{\rho}-\frac{c^{2}\pi^{3}}{3\rho^{2}}-\frac{2.42c^{2} }{\rho^{3}} \\ &&-\frac{\rho}{(\rho-1)} \left( \frac{c^{2}\pi^{2}}{6}\tau + \frac{c\pi(\log\lfloor\frac{M}{4}\rfloor+1)}{\rho}+\frac{c\pi^{2}}{6\rho^{2}} \right)^{2} \end{array} $$

is positive, we have:

$$ \left\Vert K^{-1}\right\Vert\le \frac{C(\tau,\rho,c,M)}{N\tau^{2}}, $$

where

$$ C(\tau,\rho,c,M):= \left( \frac{2\rho}{\rho-1}+\sqrt{\frac{\rho+1}{\rho-1}} \right)/\tilde C(\tau,\rho,c,M). $$

Figure 5 visualizes the values of the constant $\tilde {C}(\tau ,\rho ,c,M)$ with respect to ρ and τ. Please note that (i) increasing the constant c by a factor $\sqrt {2}$ has to be compensated approximately by halving τ and doubling ρ and (ii) increasing the number of nodes M from 4 to 64 has to be compensated approximately by tripling ρ.

Proof

We proceed analogously to Lemma 3.5 and apply Lemma A.4 to the matrix K decomposed as in (4.1) and obtain:

$$ \begin{array}{@{}rcl@{}} \|K^{-1}\| \le \max\{\|K_{1}^{-1}\|,\|(K_{2} - BK_{1}^{-1}B^{*})^{-1}\|\} \left\Vert \begin{pmatrix} I & 0 \\ -BK_{1}^{-1} & I \end{pmatrix}\right\Vert^{2}. \end{array} $$

(4.2)

Definition 4.2 and Theorem 2.1 yield:

$$ \left\Vert BK_{1}^{-1}\right\Vert\le \left\Vert A_{2}\right\Vert\left\Vert A_{1}^{\dagger}\right\Vert\le\sqrt{\frac{\rho+1}{\rho-1}}, $$

together with Lemma A.6, we obtain:

$$ \left\Vert \begin{pmatrix} I & 0 \\ -BK_{1}^{-1} & I \end{pmatrix}\right\Vert^{2} \le 1+\left\Vert {BK_{1}^{-1}}\right\Vert +\left\Vert {BK_{1}^{-1}}\right\Vert ^{2} \le \frac{2\rho}{\rho-1}+\sqrt{\frac{\rho+1}{\rho-1}}. $$

Now, we estimate $\left \Vert (K_{2} - BK_{1}^{-1}B^{*})^{-1}\right \Vert $, which is done by the following steps:

(i)
First, note that $I-A_{1}^{\dagger } A_{1}$ is an orthogonal projector and thus Theorem 2.1 implies:
$$ \left\Vert K_{2}-BK_{1}^{-1}B^{*}\right\Vert \le \left\Vert A_{2}\right\Vert\left\Vert I-A_{1}^{\dagger} A_{1}\right\Vert\left\Vert A_{2}^{*}\right\Vert \le \left\Vert A_{2}\right\Vert^{2}<2N. $$
We apply Lemma A.3 with η = 2N, use the identities R₁ = B − K₁ and R₂ = B − K₂, apply the triangular inequality, and the sub-multiplicativity of the matrix norm to get:
$$ \begin{array}{@{}rcl@{}} \left\Vert (K_{2} - BK_{1}^{-1}B^{*})^{-1}\right\Vert \!&=&\! \frac{1}{2N - \left\Vert 2NI - K_{2}+BK_{1}^{-1}B^{*}\right\Vert}\\ \!&\le&\! \frac{1}{2N - \left\Vert 2NI + R_{1}^{*} + R_{2}\right\Vert - \left\Vert R_{1}\right\Vert^{2}\left\Vert K_{1}^{-1}\right\Vert}. \end{array} $$
(4.3)
(ii)
Lemma 4.5 leads to:
$$ \begin{array}{@{}rcl@{}} 2N-\left\Vert 2NI + R_{1}^{*}+R_{2}\right\Vert &\ge& 2(N- {D_n\left( \tau/N\right)} )\\ &&- c^{2}\tau^{2} N\left( \frac{\pi^{2}(\log\lfloor\frac{M}{4}\rfloor+1)}{\rho} + \frac{\pi^{3}}{3\rho^{2}} + \frac{2.42}{\rho^{3}}\right). \end{array} $$
(iii)
We apply Theorem 2.1 and Lemma 4.4 to get:

$$ \begin{array}{@{}rcl@{}} \left\Vert R_{1}\right\Vert^{2} \left\Vert K_{1}^{-1}\right\Vert \le \frac{\rho}{N(\rho-1)} \left[ N- {D_n\left( c\tau/N\right)} + Nc\tau\left( \frac{\pi(\log\left\lfloor\frac{M}{4}\right\rfloor+1)}{\rho}+\frac{\pi^{2}}{6\rho^{2}}\right) \right]^{2}. \end{array} $$
(iv)
We use the estimates for the Dirichlet kernel $N- {D_n\left (\tau /N\right )} \ge N\tau ^{2}$ in ii) and $N-D_n({c\tau /N}) \le N\frac {\pi ^{2}}{6}c^{2}\tau ^{2}$ in iii) (see Lemma A.1), and insert this in (4.3) to get finally:

$$ \begin{array}{@{}rcl@{}} \left\Vert (K_{2} - BK_{1}^{-1}B^{*})^{-1}\right\Vert &\le& \frac{1}{N\tau^{2}} \left[ 2 - \frac{c^{2}\pi^{2}(\log\lfloor\frac{M}{4}\rfloor+1)}{\rho}-\frac{c^{2}\pi^{3}}{3\rho^{2}}-\frac{2.42c^{2} }{\rho^{3}} \right.\\ &&-\!\left.\frac{\rho}{(\rho-1)} \left( \frac{\!c^{2}\pi^{2}}{6}\tau + \frac{c\pi(\log\lfloor\frac{M}{4}\rfloor+1)}{\rho}+\frac{c\pi^{2}}{6\rho^{2}} \!\right)^{2} \right]^{-1}. \end{array} $$

This upper bound also bounds the maximum in (4.2) since for all τ ≤ 1/2 and ρ ≥ 2 together with Theorem 2.1

$$ \left\Vert K_{1}^{-1}\right\Vert\le \frac{2}{N} \le \frac{1}{2N \tau^{2}} \le \frac{1}{N\tau^{2}} [2- \cdots]^{-1}. $$

□

Theorem 4.7 (Upper bound)

Under the conditions of Definition 4.1 with M ≥ 4, $\tau \le \tau _{max} =\frac {1}{4c^{2}}$ and $\rho \ge \rho _{\min \limits } =10c^{2}(\log \lfloor \frac {M}{4} \rfloor +1)$, we have:

$$ \textnormal{cond}(A) \le \frac{5}{\tau}. $$

Proof

In Lemma 4.6, the constant C(τ,ρ,c,M) is monotone increasing in τ and monotone decreasing in ρ. Hence, after plugging in the bounds for τ and ρ in our assumptions, it is easy to see that the constant $C(\frac {1}{4c^{2}},10c^{2}(\log \lfloor \frac {M}{4} \rfloor +1),c,M)$ is monotone decreasing in c and M, respectively. Therefore, we get C(τ,ρ,c,M) ≤ C(1/4,10,1,4) ≤ 11.3, so that $\left \Vert {K^{-1}}\right \Vert \le 11.3N^{-1}\tau ^{-2}$. Together with the bound $\left \Vert {K}\right \Vert \le 22N/10=2.2N$ from Lemma 4.3, we obtain the result. □

If each pair of nearly colliding nodes has the same separation distance, i.e., c = 1, we can improve the upper bound in the sense that restrictions on τ except for τ ≤ 1 can be dropped. In order to obtain the same constant, we have to increase the restrictions on ρ slightly.

Lemma 4.8

Under the conditions of Definition 4.1 with c = 1, such that:

$$ \begin{array}{@{}rcl@{}} \tilde C(\rho,M) &:=& 2 - \frac{\pi^{2}(\log\lfloor\frac{M}{4}\rfloor + 1)}{\rho} - \frac{\pi^{3}}{3\rho^{2}}- \frac{2.42}{\rho^{3}}\\ && -\frac{\rho}{\rho-1} - \frac{2\pi(\log\lfloor\frac{M}{4}\rfloor+1)}{(\rho-1)}-\frac{\pi^{2}}{3\rho(\rho-1)}\\ &&- \frac{\pi^{2}(\log\lfloor\frac{M}{4}\rfloor+1)^{2}}{\rho(\rho-1)} -\frac{\pi^{3}(\log\lfloor\frac{M}{4}\rfloor+1)}{3\rho^{2}(\rho-1)}- \frac{\pi^{4}}{36\rho^{3}(\rho-1)} \end{array} $$

is positive, we have

$$ \left\Vert K^{-1}\right\Vert\le \frac{C(\rho,M)}{N\tau^{2}}, $$

where $C(\rho ,M):= \left ( \frac {2\rho }{\rho -1}+\sqrt {\frac {\rho +1}{\rho -1}} \right ) / \tilde C(\rho ,M)$.

Proof

The proof is analogous to that of Lemma 4.6, the only difference is in step (iv). Setting c = 1 in (ii) and (iii), expanding the squared bracket in (iii) and inserting this into (4.3) leads to:

$$ \begin{array}{@{}rcl@{}} &&\left\Vert (K_{2} - BK_{1}^{-1}B^{*})^{-1}\right\Vert \le \left[ 2\left( N- {D_n\left( \tau/N\right)} \right)\right.\\ &&-N\tau^{2} \left( \frac{\pi^{2}(\log\lfloor\frac{M}{4}\rfloor+1)}{\rho}+\frac{\pi^{3}}{3\rho^{2}}+\frac{2.42}{\rho^{3}}\right) - \frac{\rho}{N(\rho-1)} \left( N- {D_n\left( \tau/N\right)} \right)^{2} \\ &&- \frac{\rho}{\rho-1} 2\tau\left( N- {D_n\left( \tau/N\right)} \right) \left( \frac{\pi(\log\lfloor\frac{M}{4}\rfloor+1)}{\rho}+\frac{\pi^{2}}{6\rho^{2}}\right)\\ &&\left.- N\tau^{2}\frac{\rho}{\rho-1} \left( \frac{\pi^{2}(\log\lfloor\frac{M}{4}\rfloor+1)^{2}}{\rho^{2}} +\frac{\pi^{3}(\log\lfloor\frac{M}{4}\rfloor+1)}{3\rho^{3}}+ \frac{\pi^{4}}{36\rho^{4}}\right) \right]^{-1}. \end{array} $$

In three summands, we can factor out $N- {D_n\left (\tau /N\right )} $ and use the estimate N − D_n(τ/N) ≥ Nτ², leading to a larger bound after inverting the expression in the end. Afterwards, in the third summand $N- {D_n\left (\tau /N\right )} $ is left, for which we use the rough bound N − D_n(τ/N) ≤ N. In the fourth summand, we use τ ≤ 1 for the single τ. The same argument as in (3.4) shows that this also bounds the maximum in (4.2) and we get the result. □

Theorem 4.9 (Upper bound)

Under the conditions of Definition 4.1 with c = 1, $\rho \ge \rho _{\min \limits } = 25 (\log \lfloor \frac {M}{4} \rfloor +1)$, we have:

$$ \textnormal{cond}(A) < \frac{5}{\tau}. $$

Proof

Direct inspection gives monotonicity of C(ρ,M) with respect to ρ and also the estimate $C(25 (\log \lfloor {M}/{4} \rfloor +1),M)\le C(25,4) \le 12$. Hence, $\left \Vert {K^{-1}}\right \Vert \le {12} N^{-1} \tau ^{-2}$ and together with the bound $\left \Vert K\right \Vert \le 52N/25$ from Lemma 4.3 we obtain the result. □

Remark 4.10

Due to Lemma A.5, the upper bound from Theorem 4.7 remains valid if nodes are removed. We note in passing that $\sigma _{\min \limits }$ and $\sigma _{\max \limits }$ are monotone increasing with N and thus, condition number estimates for an even number N follow. Lower and upper bounds in Theorems 2.2 and 4.7 finally yield:

$$ \frac{1}{\tau}\le\text{cond}(A)\le \frac{5}{\tau} . $$

The lower bound is tight and the numerical value 5 in the upper bound follows from our proof technique and can be improved (see Fig. 6). The uniformity condition τ ≤ 1/(4c²) is artificial and, except for the special cases in Theorems 3.6 and 4.9, prevents letting τ → 1.

Moreover, the technical condition $\rho \ge \rho _{\min \limits } = 25 (\log \lfloor \frac {M}{4} \rfloor +1)$ in Theorem 4.7 is due to the slow decay of the Dirichlet kernel and can be weakened by a preconditioning technique which however leads to a somewhat larger constant in the final result.^{Footnote 1} The diagonal matrix $D=\text {diag}(1-{\left | k \right |}/{(n+1)})_{\left | k \right |\le n} \in {\mathbb {C}}^{N\times N}$ is positive definite with ∥D∥≤ 1 and thus the Rayleigh-Riesz characterization of the smallest eigenvalue for Hermitian matrices leads to:

$$ \begin{array}{@{}rcl@{}} \lambda_{\min}(ADA^{*}) &=&\min_{x\in{\mathbb{C}}^{M}, \left\Vert x\right\Vert=1} \left\Vert D^{1/2}A^{*} x\right\Vert^{2} \le \min_{x\in\mathbb{C}^{M}, \left\Vert {x}\right\Vert =1} \left\Vert {A^{*} x}\right\Vert ^{2} = \lambda_{\min}(K). \end{array} $$

The entries of the matrix ADA^∗ consist of Fejér kernel evaluations and analogously to Lemmata 4.4, 4.5, and 4.6 this yields (independently of M):

$$ \begin{array}{@{}rcl@{}} \left\Vert K^{-1}\right\Vert \!\!&\le&\!\! \frac{C(\tau,\rho,c)}{N\tau^{2}},\quad C(\tau,\rho,c)\!:=\! \left( \frac{6\rho^{2}}{3\rho^{2}-\pi^{2}}+\sqrt{\frac{3\rho^{2}+\pi^{2}}{3\rho^{2}-\pi^{2}}} \right)/\tilde C(\tau,\rho,c),\\ \tilde C(\tau,\rho,c) \!&:=&\! \frac{1}{8} \left( 2 - \frac{4c^{2}\pi^{4}}{3\rho^{2}} - \frac{39 c^{2}\pi }{\rho^{3}} - \frac{16c^{2}\pi^{4}+2c^{2}\pi^{6}}{45\rho^{4}} \right.\\ &&\left.\qquad-\frac{3\rho^{2}}{3\rho^{2}-\pi^{2}} \left( \frac{2c^{2}\pi^{2}}{9} \tau + \frac{2c\pi^{3}}{3\rho^{2}} + \frac{9.68 c}{\rho^{3}}\right)^{2} \right). \end{array} $$

Under the conditions of Definition 4.1 with n ≥ 1, $\tau \le \tau _{max} =\frac {1}{4c^{2}}$ and $\rho \ge \rho _{\min \limits } =11c^{2}$, we finally have cond(A) ≤ 14/τ. Note that this approach also allows to drop the logarithmic factor in Remark 3.8 (ii).

The absolute constant 5 in the upper bound of the condition number (or $\tau \sqrt {N}\|A^{\dagger }\| \le \sqrt {11.3}\approx 3.4$) follows from our proof technique and we give a numerical comparison to the approaches [3, 4, 8, 16] in Fig. 7. A short theoretical comparison including different assumptions on N, M, τ, and ρ is given below.

Remark 4.11 (Comparison to 4)

This approach is more general and allows more than two nodes in a group nearly colliding. The upper bound on the condition number grows quite strongly like CM^M with the total number of nodes M, cf. [4, Cor. 3.6]. Moreover, the a priori conditions N ≥ 4M³ and ρ ≥ 2M (see [4, Cor. 3.6, left ineq. (3.4)]) are much stronger than ours. The uniformity condition τ ≤ M/(2c) is slightly weaker than ours but nonetheless artificial.

Remark 4.12 (Comparison to 16)

Again, this approach is more general and allows more than two nodes in a group nearly colliding. The upper bound on the condition number grows like $C\sqrt {M}$ with the total number of nodes M, cf. [16, Thm. 1 ineq. (2.3), Thm. 2 ineq. (2.7), and ineq. (2.8)]. With minor simplifications, the a priori conditions N ≥ M² and:

$$ \begin{array}{@{}rcl@{}} \rho\ge \begin{cases} C_{1} \left( \frac{M}{\tau}\right)^{1/4}, & \text{with}~ C_{1}\approx 42,\\ C_{2} \left( \frac{M}{\tau}\right)^{1/2}, & \text{with}~ C_{2}\approx 63, \end{cases} \end{array} $$

(4.4)

are imposed; see [16, Thm. 1, ineq. (2.2), Thm 2, ineq. (2.5)]. We note that [16, Thm. 1, Thm. 2] places no upper bound except τ ≤ 1 but that condition (4.4) is in fact an a priori lower bound on τ which prevents τ → 0 already for moderate fixed M ≥ 3. Recently, we refined this approach in [13], dropped the mentioned dependencies on M and could weaken the condition (4.4) considerably.

Remark 4.13 (Comparison to 8)

This approach deals with pairs of nearly colliding nodes but differs completely from ours and the ones in [3, 4, 16], and rather generalizes the construction of certain extremal functions in [18] to pairs of nearly colliding nodes and subsets of them. The proven constant in the upper bound given in [8, Cor. 4.2] is $\tau \sqrt {N}\|A^{\dagger }\|\le 9\sqrt {6}/\pi \approx 7.0$ and thus is slightly larger than ours ($\sqrt {11.3}\approx 3.4$). Using the stronger assumption on τ from our setting in the proof of [8, Thm. 3.6] and improving estimates in [8, Eq. (8)] provides the best result (≈ 1.7) for pairs of nearly colliding nodes. The conditions τ ≤ 1 and 3 ≤ ρ are quasi-optimal. Provided all technical results prove right, this approach is superior.

Remark 4.14 (Comparison to 3)

This approach uses a QR-decomposition technique to establish bounds on all singular values of Vandermonde matrices with nearly colliding nodes. Adapted to the case of pairs of nearly colliding nodes, we obtain the following: Let M ≥ 4 even and A as in Definition 4.1. With respect to the nearly colliding pairs, partition $A^{*}=(A_{1}^{*} A_{2}^{*} \hdots A^{*}_{M/2})$ with QR decompositions $A_{j}^{*}=Q_{j} R_{j}$ and set Q := (Q₁Q₂…Q_M/2). Tracing back all constants in lemmata and proofs for the case of pairwise nearly colliding nodes, we obtain the uniform off-diagonal estimate:

$$ \begin{array}{@{}rcl@{}} \left| (Q^{*} Q)_{j,k} \right|&\le {150}/{\rho}+1079\tau, \qquad j\ne k, \end{array} $$

which yields a constant “multiplicative perturbation” in [3, Lem. 5.1] and thus a condition number estimate like Theorem 4.7 or [8] only if τ ≤ C₁/M and C₂M ≤ ρ, for some constants C₁,C₂.

However, note that for two nearly colliding pairs u₁ < u₂ ≪ v₁ < v₂, a direct computation (avoiding a so-called limit basis used in [3]) yields the off-diagonal estimate:

$$ \|Q_{1}^{*} Q_{2}\|_{\mathrm{F}}\le \frac{116}{N(v_{1}-u_{2})}, \qquad Q_{1}=Q_{1}(u_{1},u_{2}), Q_{2}=Q_{2}(v_{1},v_{2}). $$

Together with $\rho \ge \frac {27}{23}\cdot 232(\log \left \lfloor \frac {M}{4}\right \rfloor +1)$ and Lemma A.6, this gives:

$$ \begin{array}{@{}rcl@{}} \left|1-\lambda_{r}(Q^{*}Q)\right| &\le& \underset{j}{\max} \sum\limits_{\ell=1}^{M/2} \|Q_{j}^{*} Q_{\ell}\|_{\mathrm{F}} \le 2 \sum\limits_{\ell=1}^{\left\lfloor{M/4}\right\rfloor} \frac{116}{\ell\rho}\\& \le& \frac{232(\log\left\lfloor\frac{M}{4}\right\rfloor+1)}{\rho} \le \frac{23}{27},\quad r=1,\dots,M. \end{array} $$

The Courant-Fisher min-max theorem [11, Thm. 4.2.6] and Weyl’s perturbation theorem [11, Thm. 4.3.1] finally yield:

$$ \text{cond}(A)\le \text{cond}(Q) \cdot \underset{j}{\max}\text{cond}(A_{j}) \le \frac{5}{\tau}. $$

Altogether, the improved variant of this technique can be used for nearly colliding pairs, but leads to a stronger assumption on ρ for all moderate uniformity constants c.

5 Numerical examples

All computations were carried out using MATLAB R2019b. As a test for the bounds in the case of one pair of nearly colliding nodes, we use the following configuration. Let the number of nodes M = 20 and M = 200 be fixed, respectively. Moreover, we choose N = 1 + 12(M − 1) which ensures that all nodes fit on the unit interval. We choose $\tau \in \left [ {10^{-11},1} \right ]$ logarithmically uniformly at random and $\rho _{3},\dots ,\rho _{M} \in \left [ 6,12 \right ]$ uniformly at random. Then, we set the nodes $t_{1}<\cdots <t_{M} \in \left [ {0,1} \right )$ such that t₁ = 0, t₂ = τ/N and for $j=3,\dots ,M$, $\left | t_{j}-t_{j-1} \right |=\rho _{j}/N$. Afterwards, the condition number of the corresponding Vandermonde matrix is computed. This procedure is repeated 100 times and the results are presented in Fig. 6 (left).

For pairs of nearly colliding nodes, we use the following configuration. Let the number of nodes M = 20 and M = 200 be fixed, respectively. Moreover, we choose the parameter c = 2 and τ_max and $\rho _{\min \limits }$ as in Theorem 4.7. To ensure that all nodes fit on the unit interval, we choose N as the smallest odd integer bigger than $(c\tau _{max}+2\rho _{\min \limits })M/2$. Then, we choose $\tau \in \left [ 10^{-11},1 \right ]$ logarithmically uniformly at random and set the nodes $t_{1}< {\cdots } < t_{M} \in \left [ {0,1} \right )$ such that t₁ = 0, t₂ = τ/N and for $j=3,\dots ,M$, $\left | t_{j}-t_{j-1} \right |=\rho _{j}/N$ if j is odd or $\left | {t_{j}-t_{j-1}} \right | =\tau _{j}/N$ if j is even, where $\tau _{j}\in \left [ \tau ,c\tau \right ]$ and $\rho _{j} \in \left [ \rho _{\min \limits },2\rho _{\min \limits } \right ]$ are picked uniformly at random, respectively. Afterwards, the condition number of the corresponding Vandermonde matrix is computed. This procedure is repeated 100 times and the results are presented in Fig. 6 (right). Note that Theorem 4.7 makes the restriction $\tau \le \tau _{max}=\frac {1}{4}$, which seems to be an artifact of our proof technique.

In order to compare Theorem 4.7 with the results from [4, Cor. 3.6], we need to satisfy the assumptions of both results. We take M = 3 nodes with two nodes nearly colliding, i.e., t₁ = 0, t₁ = τ/N and t₂ = t₁ + ρ/N. The assumptions in [4, Cor. 3.6] make it necessary that the nodes lie on an interval of length $\frac {1}{2M^{2}}=\frac {1}{18}$. We choose the parameter c = 1, $\rho _{\min \limits } =12$, and N = 1001. Then, we pick $\tau \in \left [ {10^{-11},1} \right ] $ logarithmically uniformly at random and $\rho \in \left [ \rho _{\min \limits },\frac {N}{2M^{2}}-\tau \right ]$ uniformly at random. Afterwards, the inverse of the smallest singular value (norm of Moore-Penrose pseudo inverse) of the corresponding Vandermonde matrix is computed. This procedure is repeated 100 times and the results normalized by $\tau \sqrt {N}$ are presented in Fig. 7 (left). From [4, Cor. 3.6], we get:

$$ \left\Vert A^{\dagger}\right\Vert \le \frac{2(2\pi)^{M-1} M^{2M-1}}{\pi}\cdot \frac{N\sqrt{N}}{(N-1)\sqrt{N-1}}\cdot\frac{1}{\tau\sqrt{N}} \approx 6116 \cdot \frac{1}{\tau\sqrt{N}} $$

for τ ≤ 1, whereas Theorem 4.7 provides $\left \Vert A^{\dagger }\right \Vert \le \sqrt {11.3}\cdot \frac {1}{\tau \sqrt {N}} \approx 3.4 \cdot \frac {1}{\tau \sqrt {N}}$ for $\tau \le \frac {1}{4}$.

In order to compare our results with the ones from the second version of [16, Thm. 1, Thm. 2], we set the parameter N = 2¹⁵ + 1, c = 1 and M = 4 and M = 20, respectively. All pairs of nodes are placed uniformly, such that $t_{j}=\frac {2j-2}{M}$ and $t_{j+M/2}=t_{j}+\frac {\tau }{N}$ for $j=1,\hdots ,\frac {M}{2}$, where τ is picked logarithmically uniformly at random from [10^− 11,1]. Afterwards, the inverse of the smallest singular value (norm of Moore-Penrose pseudo inverse) of the corresponding Vandermonde matrix is computed. This procedure is repeated 100 times and the results normalized by $\tau \sqrt {N}$ are presented in Fig. 7 (right). Note that [16, Thm. 1, ineq. (2.2), Thm. 2, ineq. (2.5)] restricts:

$$ \begin{array}{@{}rcl@{}} \tau \!\ge\! \frac{20^{2} M 2^{5} N^{3}}{\rho^{2} (N-1)^{3}} \!\approx\! \left\{\begin{array}{cc} 1.9\cdot 10^{-4},\\ 2.4 \cdot 10^{-2}, \end{array}\right.\qquad \tau \!\ge\! \frac{10^{4} 2^{10} M N^{5}}{\rho^{4} \pi(N-1)^{5}} \!\approx\! \left\{\begin{array}{cc} 1.8\cdot 10^{-10},\quad &M=4,\\ 5.6 \cdot 10^{-7},\quad &M=20, \end{array}\right. \end{array} $$

respectively, where we used the uniform bound $\rho < \frac {2N}{M}$. The results are shown in Fig. 7 (right) by proper lines [16, Thm. 2, ineq. (2.5)] and by broken lines [16, Thm. 1, ineq. (2.2)]. In both cases and with minor corrections, the resulting estimate is:

$$ \begin{array}{@{}rcl@{}} \left\Vert A^{\dagger}\right\Vert &\le& \frac{20\sqrt{2}}{19} \left( 1-\frac{\pi^{2}}{12}\right)^{-1/2} \frac{N-1}{2}\left\lfloor{\frac{N-1}{2}}\right\rfloor ^{-1}\frac{4}{\pi} \sqrt{M} \frac{\sqrt{N}}{\sqrt{N-1}}\cdot \frac{1}{\tau\sqrt{N}}\\ &\approx& \begin{cases} 9 \cdot \frac{1}{\tau\sqrt{N}},\quad & M=4,\\ 20.1 \cdot \frac{1}{\tau\sqrt{N}},\quad &M=20, \end{cases} \end{array} $$

whereas Theorem 4.7 provides again $\left \Vert A^{\dagger }\right \Vert \le 3.4 \cdot \frac {1}{\tau \sqrt {N}}$ for $\tau \le \frac {1}{4}$. We note that our bound remains valid for c > 1 but the restriction on τ becomes more severe.

6 Summary

We proved upper and lower bounds for the spectral condition number of rectangular Vandermonde matrices with nodes on the complex unit circle. If pairs of nodes nearly collide, the studied condition number grows linearly with the inverse separation distance. In contrast to the more general results [4, 16], we provide reasonable sharp and absolute constants but have to admit that our technique most likely will not generalize to more than two nodes nearly colliding. Note that our easy to achieve lower bound seems to capture the situation more accurately than the upper bound. We posed mild technical conditions in our proofs, which cannot be confirmed to be necessary numerically. While [4] provided the right growth order for the first time, some of the imposed conditions are very restrictive and the involved constants are quite pessimistic. The second version of [16] provided a quite general framework and presented decent results with only a mild artificial growth of the condition number with respect to the number of nodes. Moreover, a technical condition there prevents the separation distance from going to zero for a fixed number of nodes and a fixed bandwidth. We believe that both problems can be fixed at least partially and thus [16] seems to be a good framework for understanding node configurations with nearly colliding nodes. Recently, the manuscript [8] came to our attention—it considers pairs of nearly colliding nodes and weakens the assumptions considerably and gives, after modifications, stronger bounds on the smallest singular value. The taken approach differs completely from ours and the ones in [4, 16], but rather generalizes the construction of [18] to pairs of nearly colliding nodes.

Change history

03 October 2020
A Correction to this paper has been published: https://doi.org/10.1007/s11075-020-01015-3

Notes

We thank one of the peer reviewers for this clever hint.

References

Akinshin, A., Goldman, G., Yomdin, Y.: Geometry of error amplification in solving Prony system with near-colliding nodes. arXiv:1701.04058 (2017)
Aubel, C., Bölcskei, H.: Vandermonde matrices with nodes in the unit disk and the large sieve. Appl. Comput. Harmon. Anal. 47(1), 53–86 (2019)
Article MathSciNet Google Scholar
Batenkov, D., Diederichs, B., Goldman, G., Yomdin, Y.: The spectral properties of Vandermonde matrices with clustered nodes (2019)
Batenkov, D., Demanet, L., Goldman, G., Yomdin, Y.: Conditioning of partial nonuniform Fourier matrices with clustered nodes. SIAM J. Matrix Anal. Appl. 41(1), 199–220 (2020). arXiv:1909.01927
Article MathSciNet Google Scholar
Bazán, F.S.V.: Conditioning of rectangular Vandermonde matrices with nodes in the unit disk. SIAM J. Matrix Anal. Appl. 21(2), 679–693 (1999)
Article MathSciNet Google Scholar
de Prony, B.G.R.: Essai éxperimental et analytique: sur les lois de la dilatabilité de fluides élastique et sur celles de la force expansive de la vapeur de l’alkool, a différentes températures. J. l’École Polytech. 1(22), 24–76 (1795)
Google Scholar
Demanet, L., Nguyen, N.: The recoverability limit for superresolution via sparsity ArXiv e-prints. arXiv:1502.01385 (2015)
Diederichs, B.: Well-posedness of sparse frequency estimation. arXiv:1905.08005 (2019)
Donoho, D.L.: Superresolution via sparsity constraints. SIAM J. Math. Anal. 23(5), 1309–1331 (1992)
Article MathSciNet Google Scholar
Feingold, D.G., Varga, R.S.: Block diagonally dominant matrices and generalizations of the gerschgorin circle theorem. Pac. J. Math. 12(4), 1241–1250 (1962)
Article MathSciNet Google Scholar
Horn, R.A., Johnson, C.R.: Matrix Analysis, 2nd edn. Cambridge University Press, New York (2013)
Google Scholar
Hua, Y., Sarkar, T.K.: Matrix pencil method for estimating parameters of exponentially damped/undamped sinusoids in noise. IEEE Trans. Acoust. Speech Signal Process. 38(5), 814–824 (1990)
Article MathSciNet Google Scholar
Kunis, S., Nagel, D.: On the smallest singular value of multivariate vandermonde matrices with clustered nodes. Linear Algebra and its Applications 604, 1–20 (2020). ArXiv e-prints
Article MathSciNet Google Scholar
Kunis, S., Potts, D.: Stability results for scattered data interpolation by trigonometric polynomials. SIAM J. Sci. Comput. 29, 1403–1419 (2007)
Article MathSciNet Google Scholar
Kunis, S., Möller, H.M., Peter, T., von der Ohe, U.: Prony’s method under an almost sharp multivariate Ingham inequality. J. Fourier Anal. Appl. 24(5), 1306–1318 (2018)
Article MathSciNet Google Scholar
Li, W., Liao, W.: Stable super-resolution limit and smallest singular value of restricted Fourier matrices. arXiv:1709.03146 (2017)
Liao, W., Fannjiang, A.: MUSIC for single-snapshot spectral estimation: stability and super-resolution. Appl. Comput. Harmon. Anal. 40(1), 33–67 (2016)
Article MathSciNet Google Scholar
Moitra, A.: Super-resolution, extremal functions and the condition number of Vandermonde matrices. In: STOC’15—Proceedings of the 2015 ACM Symposium on Theory of Computing, pp 821–830. ACM, New York (2015)
Google Scholar
Morgenshtern, V.I., Candès, E.J.: Super-resolution of positive sources: the discrete setup. SIAM J. Imaging Sci. 9(1), 412–444 (2016)
Article MathSciNet Google Scholar
Potts, D., Tasche, M.: Error estimates for the ESPRIT algorithm. In: Large Truncated Toeplitz Matrices, Toeplitz Operators, and Related Topics, volume 259 of Oper. Theory Adv. Appl., p Cham. Birkhäuser/Springer (2017)
Roy, R., Kailath, T.: ESPRIT—estimation of signal parameters via rotational invariance techniques. IEEE Trans. Acoust. Speech Signal Process. 37 (7), 984–995 (1989)
Article Google Scholar
Schmidt, R.O.: Multiple emitter location and signal parameter estimation. IEEE Trans. Antennas Propag. 34(3), 276–280 (1986)
Article Google Scholar

Download references

Acknowledgments

Open Access funding provided by Projekt DEAL. The authors thank both referees for their valuable suggestions.

Funding

The authors received support from the projects DFG-GK1916 and DFG-SFB944.

Author information

Authors and Affiliations

Institute of Mathematics and Research Center of Cellular Nanoanalytics, Osnabrück University, Osnabrueck, Germany
Stefan Kunis & Dominik Nagel

Authors

Stefan Kunis
View author publications
You can also search for this author in PubMed Google Scholar
Dominik Nagel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Stefan Kunis.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The original article was revised: Figure 2 image is correctly presented in this paper.

Appendix: A

The following technical results are used within the proofs of our main results.

Lemma A.1

Let $n\in {\mathbb {N}}$, N = 2n + 1, then the Dirichlet kernel (2.2) is bounded by

$$ N- \frac{\pi^{2}}{6}N^{3}t^{2}\le D_{n}(t)\le N - N^{3} t^{2},\qquad 0\le|t|\le\frac{1}{N}. $$

Furthermore, the Dirichlet kernel and its first two derivatives are bounded by

$$ \begin{array}{@{}rcl@{}} \left| {D_n\left( t\right)} \right| &\le& \frac{1}{2\left| t \right|},\\ \left| {D_n^{\prime}\left( t\right)} \right| &\le& N^{2} \left( \frac{\pi}{2N|t|}+\frac{1}{2N^{2}|t|^{2}}\right),\\ \left| {D_n^{\prime\prime}} {\left( t\right)} \right| &\le& N^{3}\left( \frac{\pi^{2}}{2N|t|}+\frac{\pi}{N^{2}|t|^{2}}+\frac{1}{N^{3}|t|^{3}}\right) \end{array} $$

for 0 < |t|≤ 1/2.

Proof

Due to symmetry, it suffices to prove all bounds for t > 0 and we use the explicit expression of the Dirichlet kernel in (2.2). The lower bound on t can be derived from the inequalities $x-x^{3}/6\le \sin \limits (x) \le x$, that hold for all x ∈ [0,π]. The left inequality with x = Nπt and the right inequality with x = πt lead to:

$$ \sin(N\pi t) \ge \left( N- \frac{\pi^{2}}{6} N^{3} t^{2}\right) \pi t \ge \left( N- \frac{\pi^{2}}{6} N^{3} t^{2}\right) \sin(\pi t). $$

The upper bound on $ {D_n\left (t\right )} $ can be derived from the inequality $\cos \limits (\alpha x) \le \cos \limits (x)$ that holds for all x ∈ [0,π/2] and α > 1 such that αx ∈ [0,π/2]. Integrating this inequality, choosing α = N/2 and x = πt, and applying the double angle formula yields:

$$ \frac{\sin(N\pi t)}{2\cos(\frac{N}{2}\pi t)}=\sin\left( \frac{N}{2}\pi t\right) \le \frac{N}{2}\sin(\pi t). $$

Reordering the inequality and applying that $\cos \limits (x)\le 1-4 x^{2} /\pi ^{2}$ for all x ∈ [0,π/2] yields:

$$ \frac{\sin(N\pi t)}{\sin(\pi t)} \le N\cos\left( \frac{N}{2} \pi t\right) \le N(1- N^{2} t^{2}). $$

Finally, the remaining bounds on the absolute values can be proven by calculating the first and second derivatives and using $\sin \limits (x)\ge 2x/\pi $ and $\cot (x)\le 1/x$ that hold for all x ∈ (0,π/2]. □

Lemma A.2

Let $M,\widetilde M\in {\mathbb {C}}^{m\times n}$ with $\left | M_{k\ell } \right |\le \widetilde M_{k\ell }$ for all k = 1,…,m, ℓ = 1,…,n, then:

$$ \left\Vert M\right\Vert \le \left\Vert \widetilde M\right\Vert. $$

Proof

We directly show the result by:

$$ \begin{array}{@{}rcl@{}} \left\Vert M\right\Vert^{2}&=& \max_{\left\Vert x\right\Vert=1} \left\Vert Mx\right\Vert^{2} = \max_{\left\Vert x\right\Vert=1} \sum\limits_{k=1}^{m} \left| \sum\limits_{\ell=1}^{n} M_{k\ell}x_{\ell} \right|^{2} \le \max_{\left\Vert x\right\Vert=1} \sum\limits_{k=1}^{m} \left( \sum\limits_{\ell=1}^{n} \left| M_{k\ell} \right|\left| x_{\ell} \right|\right)^{2}\\ &\le& \max_{\left\Vert x\right\Vert=1} \sum\limits_{k=1}^{m} \left( \sum\limits_{\ell=1}^{n} \widetilde M_{k\ell}\left| {x_{\ell}} \right| \right)^{2} = \max_{\left\Vert {x}\right\Vert =1} \sum\limits_{k=1}^{m} \left( \sum\limits_{\ell=1}^{n} \widetilde M_{k\ell}x_{\ell}\right)^{2} = \left\Vert \widetilde M\right\Vert^{2}. \end{array} $$

Note that similar estimates can be found for the Frobenius norm in [11, p. 520]. □

Lemma A.3 (Norm of matrix inverse)

Let $M \in {\mathbb {C}}^{n\times n}$ Hermitian and positive definite and $I \in \mathbb {C}^{n\times n}$ the identity matrix. Let $\eta \in \mathbb {R}$ be a parameter satisfying $\eta > \left \Vert M\right \Vert $, then:

$$ \left\Vert M^{-1}\right\Vert = \frac{1}{\eta - \left\Vert \eta I -M\right\Vert}. $$

Proof

Since M is positive definite, let its real, positive eigenvalues be given by $\lambda _{1}(M) \ge \dotsb \ge \lambda _{n}(M) >0$. By assumption $\eta >\left \Vert {M}\right \Vert = \lambda _{\max \limits }(M)$ and therefore, ηI − M is positive definite as well with largest eigenvalue $\lambda _{\max \limits }(\eta I - M)=\eta -\lambda _{\min \limits }(M)$. This finally leads to:

$$ \left\Vert M^{-1}\right\Vert = \frac{1}{\lambda_{\min}(M)} = \frac{1}{\eta - (\eta-\lambda_{\min}(M))} = \frac{1}{\eta - \lambda_{\max}(\eta I - M)} = \frac{1}{\eta - \left\Vert {\eta I - M}\right\Vert}. $$

□

Lemma A.4 (Schur complement, cf. [11, eq. (0.8.5.3)])

Let $n_{1},n_{2}\in {\mathbb {N}}$ and the matrix M $\in {\mathbb {C}}^{(n_{1}+n_{2})\times (n_{1}+n_{2})}$ be a 2 × 2 block matrix of the form:

$$ M=\begin{pmatrix} M_{1} & M_{2} \\ M_{3} & M_{4} \end{pmatrix}, M_{1}\in {\mathbb{C}}^{n_{1}\times n_{1}}, M_{4} \in {\mathbb{C}}^{n_{2}\times n_{2}}, $$

with M₁ being invertible. Then, the Schur complement decomposition is given by:

$$ M= \begin{pmatrix} I_{n_{1}} & 0 \\ -M_{3}M_{1}^{-1} & I_{n_{2}} \end{pmatrix}^{-1} \begin{pmatrix} M_{1} & 0 \\ 0 & M_{4}-M_{3}M_{1}^{-1}M_{2} \end{pmatrix} \begin{pmatrix} I_{n_{1}} & -M_{1}^{-1}M_{2} \\ 0 & I_{n_{2}} \end{pmatrix}^{-1}. $$

The block $[M/M_{1}]:=M_{4}-M_{3}M_{1}^{-1}M_{2}$ is called Schur complement of M₁ in M.

Lemma A.5 (Cauchy interlacing theorem for eigenvalues, cf. [11, Thm. (4.3.28)])

Let $M\in {\mathbb {C}}^{n\times n}$ be a Hermitian complex matrix, such that:

$$ M = \begin{pmatrix} M_{1} & M_{2} \\ M_{2}^{*} & M_{3} \end{pmatrix},\quad M_{1}\in{\mathbb{C}}^{m\times m}, M_{2}\in {\mathbb{C}}^{m\times (n-m)}, M_{3}\in {\mathbb{C}}^{(n-m) \times (n-m)}. $$

Let the eigenvalues of M and M₁ be ordered in non-decreasing order, then:

$$ \lambda_{i}(M)\le \lambda_{i}(M_{1}) \le \lambda_{i+n-m}(M),\quad i=1,\dots,m. $$

Lemma A.6 (Block Gerschgorin theorem, cf. [11, 6.1.P17] or [10, Thm. 5])

Let $M \in {\mathbb {C}}^{nm\times nm}$ be an m × m block matrix with blocks $M_{ik}\in {\mathbb {C}}^{n\times n}$. Let the diagonal blocks M_ii be normal and denote $\lambda _{1}^{(i)},\dots , \lambda _{n}^{(i)}$ their eigenvalues, respectively. Then, the eigenvalues of M are included in the set:

$$ \bigcup_{i=1}^{n}\bigcup_{j=1}^{m} \left\{z\in {\mathbb{C}} : |z-\lambda_{j}^{(i)}| \le \sum\limits_{k\ne i}\|M_{ik}\|\right\}. $$

In particular, we have for $M\in {\mathbb {C}}^{m\times n}$ the inequalities:

$$ \begin{array}{@{}rcl@{}} \left\Vert \begin{pmatrix} 0 & M^{*} \\ M & 0 \end{pmatrix}\right\Vert &\le \left\Vert M\right\Vert \text{and}\qquad \left\Vert \begin{pmatrix} I & 0 \\ M & I \end{pmatrix}\right\Vert^{2} \le 1+\left\Vert M\right\Vert+\left\Vert M\right\Vert^{2}. \end{array} $$

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kunis, S., Nagel, D. On the condition number of Vandermonde matrices with pairs of nearly-colliding nodes. Numer Algor 87, 473–496 (2021). https://doi.org/10.1007/s11075-020-00974-x

Download citation

Received: 28 October 2019
Accepted: 24 June 2020
Published: 21 July 2020
Issue Date: May 2021
DOI: https://doi.org/10.1007/s11075-020-00974-x

Keywords

Mathematics Subject Classification (2010)

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

On the condition number of Vandermonde matrices with pairs of nearly-colliding nodes

Abstract

Similar content being viewed by others

On the Spectral Gap of a Square Distance Matrix

Hilbert-Schmidt Numerical Radius of a Pair of Operators

Rectangles of positive eigenvalues with positive eigenfunctions of nonlinear multiparameter coupled systems

1 Introduction

2 Preliminaries

Theorem 2.1

Theorem 2.2 (Lower bound)

Proof

3 Nodes with one nearly colliding pair

Definition 3.1

Definition 3.2

Lemma 3.3

Proof

Lemma 3.4

Proof

Lemma 3.5

Proof

Theorem 3.6 (Upper bound)

Proof

Remark 3.7 (Constants)

Remark 3.8 (Generalizations and limitations)

4 Pairs of nearly colliding nodes

Definition 4.1

Definition 4.2

Lemma 4.3

Proof

Lemma 4.4

Proof

Lemma 4.5

Proof

Lemma 4.6

Proof

Theorem 4.7 (Upper bound)

Proof

Lemma 4.8

Proof

Theorem 4.9 (Upper bound)

Proof

Remark 4.10

Remark 4.11 (Comparison to 4)

Remark 4.12 (Comparison to 16)

Remark 4.13 (Comparison to 8)

Remark 4.14 (Comparison to 3)

5 Numerical examples

6 Summary

Change history

03 October 2020

Notes

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Appendix: A

Appendix: A

Lemma A.1

Proof

Lemma A.2

Proof

Lemma A.3 (Norm of matrix inverse)

Proof

Lemma A.4 (Schur complement, cf. [11, eq. (0.8.5.3)])

Lemma A.5 (Cauchy interlacing theorem for eigenvalues, cf. [11, Thm. (4.3.28)])

Lemma A.6 (Block Gerschgorin theorem, cf. [11, 6.1.P17] or [10, Thm. 5])

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification (2010)

Search

Navigation