Eigenvalue perturbation theory of symplectic, orthogonal, and unitary matrices under generic structured rank one perturbations

Mehl, Christian; Mehrmann, Volker; Ran, André C. M.; Rodman, Leiba

doi:10.1007/s10543-013-0451-3

Eigenvalue perturbation theory of symplectic, orthogonal, and unitary matrices under generic structured rank one perturbations

Published: 15 October 2013

Volume 54, pages 219–255, (2014)
Cite this article

BIT Numerical Mathematics Aims and scope Submit manuscript

Christian Mehl¹,
Volker Mehrmann¹,
André C. M. Ran^2,3 &
…
Leiba Rodman⁴

679 Accesses
21 Citations
Explore all metrics

Abstract

We study the perturbation theory of structured matrices under structured rank one perturbations, with emphasis on matrices that are unitary, orthogonal, or symplectic with respect to an indefinite inner product. The rank one perturbations are not necessarily of arbitrary small size (in the sense of norm). In the case of sesquilinear forms, results on selfadjoint matrices can be applied to unitary matrices by using the Cayley transformation, but in the case of real or complex symmetric or skew-symmetric bilinear forms additional considerations are necessary. For complex symplectic matrices, it turns out that generically (with respect to the perturbations) the behavior of the Jordan form of the perturbed matrix follows the pattern established earlier for unstructured matrices and their unstructured perturbations, provided the specific properties of the Jordan form of complex symplectic matrices are accounted for. For instance, the number of Jordan blocks of fixed odd size corresponding to the eigenvalue 1 or −1 have to be even. For complex orthogonal matrices, it is shown that the behavior of the Jordan structures corresponding to the original eigenvalues that are not moved by perturbations follows again the pattern established earlier for unstructured matrices, taking into account the specifics of Jordan forms of complex orthogonal matrices. The proofs are based on general results developed in the paper concerning Jordan forms of structured matrices (which include in particular the classes of orthogonal and symplectic matrices) under structured rank one perturbations. These results are presented and proved in the framework of real as well as of complex matrices.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Global Properties of Eigenvalues of Parametric Rank One Perturbations for Unstructured and Structured Matrices

Article Open access 09 March 2021

Global Properties of Eigenvalues of Parametric Rank One Perturbations for Unstructured and Structured Matrices II

Article Open access 05 August 2022

Perturbation Analysis of Matrix Equations and Decompositions

References

Barnett, S.: Greatest common divisor of several polynomials. Proc. Camb. Philos. Soc. 70, 263–268 (1971)
Article MATH Google Scholar
Beitia, M.A., de Hoyos, I., Zaballa, I.: The change of the Jordan structure under one row perturbations. Linear Algebra Appl. 401, 119–134 (2005)
Article MATH MathSciNet Google Scholar
Brunovsky, P.: A classification of linear controllable systems. Kybernetika 6, 173–188 (1970)
MATH MathSciNet Google Scholar
Chen, B.M.: Robust and H _∞ Control. Springer, London (2000)
MATH Google Scholar
De Terán, F., Dopico, F.: Low rank perturbation of Kronecker structures without full rank. SIAM J. Matrix Anal. Appl. 29, 496–529 (2007)
Article MathSciNet Google Scholar
De Terán, F., Dopico, F.: Low rank perturbation of regular matrix polynomials. Linear Algebra Appl. 430, 579–586 (2009)
Article MATH MathSciNet Google Scholar
De Terán, F., Dopico, F., Moro, J.: Low rank perturbation of Weierstrass structure. SIAM J. Matrix Anal. Appl. 30, 538–547 (2008)
Article MATH MathSciNet Google Scholar
Fuhrmann, P.A.: Linear Systems and Operators in Hilbert Space. McGraw-Hill, New York (1981)
MATH Google Scholar
Gantmacher, F.R.: Theory of Matrices vol. 1. Chelsea, New York (1959)
MATH Google Scholar
Godunov, S.K., Sadkane, M.: Spectral analysis and symplectic matrices with application to the theory of parametric resonance. SIAM J. Matrix Anal. Appl. 28, 1045–1069 (2006)
Article MathSciNet Google Scholar
Gohberg, I., Heinig, G.: The resultant matrix and its generalizations. I. The resultant operator of matrix polynomials. Acta Sci. Math. 37, 41–61 (1975) (Russian)
MathSciNet Google Scholar
Gohberg, I., Lancaster, P., Rodman, L.: Indefinite Linear Algebra and Applications. Birkhäuser, Basel (2005)
MATH Google Scholar
Hörmander, L., Melin, A.: A remark on perturbations of compact operators. Math. Scand. 75, 255–262 (1994)
MATH MathSciNet Google Scholar
Janse van Rensburg, D.: Structured Matrices in Indefinite Inner Product Spaces: Simple Forms, Invariant Subspaces, and Rank-one Perturbations. Ph.D. thesis, North-West University, Potchefstroom, South Africa (2012)
Krupnik, M.: Changing the spectrum of an operator by perturbation. Linear Algebra Appl. 167, 113–118 (1992)
Article MATH MathSciNet Google Scholar
Mackey, D.S., Mackey, N., Mehl, C., Mehrmann, V.: Smith forms of palindromic matrix polynomials. Electron. J. Linear Algebra 22, 53–91 (2011)
MATH MathSciNet Google Scholar
Mehl, C.: On classification of normal matrices in indefinite inner product spaces. Electron. J. Linear Algebra 15, 50–83 (2006)
MATH MathSciNet Google Scholar
Mehl, C., Mehrmann, V., Ran, A.C.M., Rodman, L.: Eigenvalue perturbation theory of classes of structured matrices under generic structured rank one perturbations. Linear Algebra Appl. 435, 687–716 (2011)
Article MATH MathSciNet Google Scholar
Mehl, C., Mehrmann, V., Ran, A.C.M., Rodman, L.: Perturbation theory of selfadjoint matrices and sign characteristics under generic structured rank one perturbations. Linear Algebra Appl. 436, 4027–4042 (2012)
Article MATH MathSciNet Google Scholar
Mehl, C., Mehrmann, V., Ran, A.C.M., Rodman, L.: Jordan forms of real and complex matrices under rank one perturbations. Oper. Matrices 7, 381–398 (2013)
Article MATH MathSciNet Google Scholar
Mehrmann, V.: The Autonomous Linear Quadratic Optimal Control Problem: Theory and Numerical Solution. Lecture Notes in Control and Information Sciences., vol. Number 163. Springer, Heidelberg (1991)
Book Google Scholar
Mimura, M., Toda, H.: Topology of Lie Groups, I and II. Am. Math. Soc., Providence (1991)
MATH Google Scholar
Moro, J., Dopico, F.: Low rank perturbation of Jordan structure. SIAM J. Matrix Anal. Appl. 25, 495–506 (2003)
Article MATH MathSciNet Google Scholar
Ran, A.C.M., Wojtylak, M.: Eigenvalues of rank one perturbations of unstructured matrices. Linear Algebra Appl. 437, 589–600 (2012)
Article MATH MathSciNet Google Scholar
Savchenko, S.V.: Typical changes in spectral properties under perturbations by a rank-one operator. Mat. Zametki 74, 590–602 (2003) (Russian). Translation in Math. Notes 74, 557–568 (2003)
Article MathSciNet Google Scholar
Savchenko, S.: On the change in the spectral properties of a matrix under a perturbation of a sufficiently low rank. Funkc. Anal. Prilozh. 38, 85–88 (2004) (Russian). Translation in Funct. Anal. Appl. 38, 69–71 (2004)
Article MathSciNet Google Scholar
Stewart, G.W., Sun, J.-G.: Matrix Perturbation Theory. Academic Press, Boston (1990)
MATH Google Scholar
Thompson, R.C.: Invariant factors under rank one perturbations. Can. J. Math. 32, 240–245 (1980)
Article MATH Google Scholar
Zhou, K., Doyle, J.C., Glover, K.: Robust and Optimal Control. Prentice Hall, New York (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

Institut für Mathematik, MA 4-5, TU Berlin, Straße des 17. Juni 136, 10623, Berlin, Germany
Christian Mehl & Volker Mehrmann
Afdeling Wiskunde, Faculteit der Exacte Wetenschappen, VU Amsterdam, De Boelelaan 1081a, 1081 HV, Amsterdam, The Netherlands
André C. M. Ran
Unit for BMI, North-West University, Potchefstroom, South Africa
André C. M. Ran
Department of Mathematics, College of William and Mary, P.O. Box 8795, Williamsburg, VA, 23187-8795, USA
Leiba Rodman

Authors

Christian Mehl
View author publications
You can also search for this author in PubMed Google Scholar
Volker Mehrmann
View author publications
You can also search for this author in PubMed Google Scholar
André C. M. Ran
View author publications
You can also search for this author in PubMed Google Scholar
Leiba Rodman
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Christian Mehl.

Additional information

Communicated by Miloud Sadkane.

This research was supported by Deutsche Forschungsgemeinschaft, through the DFG Research Center Matheon Mathematics for key technologies in Berlin.

Appendix: Proof of Theorem 5.3

In this section we prove Theorem 5.3. The proof follows the same lines as the proof of Theorem 4.2 in [18], but is more general and extends the result that was obtained there. Before we prove Theorem 5.3, we quote two results from [18]. The first one follows from the Brunovsky canonical form, see [3], and also [4, 8], of general multi-input control systems $\dot{x}=Ax+Bu$ under transformations

$$(A,B) \quad\mapsto\quad\bigl(C^{-1}(A+BR)C, C^{-1}BD\bigr), $$

with invertible matrices C,D and arbitrary matrix R of suitable sizes.

Theorem 10.1

Let $A\in{\mathbb{C}}^{n\times n}$ be a matrix in Jordan canonical form

$$ A={\mathrm{J}}_{n_1}(\lambda_1)\oplus\cdots \oplus{\mathrm{J}}_{n_g}(\lambda_g) \oplus{\mathrm{J}}_{n_{g+1}}(\lambda_{g+1})\oplus\cdots\oplus{ \mathrm{J}}_{n_\nu}(\lambda_\nu), $$

(10.1)

where $\lambda_{1}=\dots=\lambda_{g}=:\widehat{\lambda}\in\mathbb{C}$, $\lambda_{g+1},\dots,\lambda_{\nu}\in\mathbb{C}\setminus\{\widehat{\lambda}\}$, n ₁≥⋯≥n _g. Moreover, let B=uv ^T, where

$$u=\left [ \begin{array}{c}u_1\\ \vdots\\ u_\nu \end{array} \right ],\quad v=\left [ \begin{array}{c}v_1\\ \vdots\\ v_\nu \end{array} \right ], \quad u_i,v_i \in\mathbb{C}^{n_i},\ i=1,\dots,\nu. $$

Assume that the first component of each vector v _i, i=1,…,ν is nonzero. Then the matrix $\operatorname{Toep} (v_{1})\oplus\cdots\oplus\operatorname{Toep} (v_{\nu})$ is invertible, and if we denote its inverse by S, then S ⁻¹ AS=A and

$$ S^{-1}BS= \bigl[we_{1,n_1}^T, \dots,we_{1,n_\nu}^T \bigr], $$

(10.2)

where w=S ⁻¹ u. Moreover, the matrix S ⁻¹(A+B)S has at least g−1 Jordan chains associated with $\widehat{\lambda}$ of lengths at least n ₂,…,n _g given by

$$ \begin{array}{l@{\quad}l@{\quad}l} e_1-e_{n_1+1},&\dots,&e_{n_2}-e_{n_1+n_2};\\ e_1-e_{n_1+n_2+1},&\dots,&e_{n_3}-e_{n_1+n_2+n_3};\\ \vdots&\ddots&\vdots\\ e_1-e_{n_1+\cdots+n_{g-1}+1},&\dots,&e_{n_g}-e_{n_1+\cdots +n_{g-1}+n_g}.\\ \end{array} $$

(10.3)

Theorem 10.2

(Partial Brunovsky form)

Let

$$A= \bigl({\mathrm{J}}_{n_1}(\widehat{\lambda})^{\oplus \ell _1} \bigr) \oplus\cdots\oplus \bigl({\mathrm{J}}_{n_m}(\widehat{\lambda})^{\oplus \ell_m} \bigr)\oplus\widetilde{A} \in{\mathbb{C}}^{n\times n}, $$

where n ₁>…>n _m and $\sigma(\widetilde{A})\subseteq\mathbb{C}\setminus\{\widehat{\lambda}\}$. Moreover, let a=ℓ ₁ n ₁+⋯+ℓ _m n _m denote the algebraic multiplicity of $\widehat{\lambda}$ and let B=uv ^T, where $u,v\in\mathbb{C}^{n}$ and

$$v=\left [ \begin{array}{c} v^{(1)}\\ \vdots\\ v^{(m)}\\ \widetilde{v} \end{array} \right ],\quad v^{(i)}= \left [ \begin{array}{c} v^{(i,1)}\\ \vdots\\ v^{(i,\ell_i)} \end{array} \right ],\quad v^{(i,j)}\in \mathbb{C}^{n_i},\quad j=1,\dots,\ell_i,\ i=1,\dots,m. $$

Assume that the first component of each vector v ^(i,j), j=1,…,ℓ _i, i=1,…,m is nonzero. Then the following statements hold:

(1)
The matrix $S:= (\bigoplus_{j=1}^{\ell _{1}}\operatorname{Toep}(v^{(1,j)})\oplus\cdots\oplus \bigoplus_{j=1}^{\ell_{m}}\operatorname{Toep}(v^{(m,j)}) )^{-1} \oplus I_{n-a}$ exists and satisfies
$$S^{-1}AS=A,\quad S^{-1}BS=w \bigl[\underbrace{e_{1,n_1}^T, \dots ,e_{1,n_1}^T}_{\ell_1~\mathit{times}},\dots, \underbrace{e_{1,n_m}^T,\dots,e_{1,n_m}^T}_{\ell_m~\mathit{times}},z^T \bigr], $$
where w=S ⁻¹ u and for some appropriate vector $z\in\mathbb{C}^{n-a}$.
(2)
The matrix S ⁻¹(A+B)S has at least ℓ ₁+…+ℓ _m−1 Jordan chains associated with $\widehat{\lambda}$ given as follows:
1. (a)
  ℓ ₁−1 Jordan chains of length at least n ₁:
  $$\begin{array}{l@{\quad}l@{\quad}l} e_1-e_{n_1+1},&\dots,&e_{n_1}-e_{2n_1};\\ \vdots&\ddots&\vdots\\ e_1-e_{(\ell_1-1)n_1+1},&\dots,&e_{n_1}-e_{\ell_1n_1};\\ \end{array} $$
2. (b)
  ℓ _i Jordan chains of length at least n _i for i=2,…,m:
  $$\begin{array}{l@{\quad}l@{\quad}l} e_1-e_{\ell_1n_1+\dots+\ell_{i-1}n_{i-1}+1},&\dots,&e_{n_i}-e_{\ell _1n_1+\dots+\ell_{i-1}n_{i-1}+n_i};\\ e_1-e_{\ell_1n_1+\dots+\ell_{i-1}n_{i-1}+n_i+1},&\dots ,&e_{n_i}-e_{\ell_1n_1+\dots+\ell_{i-1}n_{i-1}+2n_i};\\ \vdots&\ddots&\vdots\\ e_1-e_{\ell_1n_1+\dots+\ell_{i-1}n_{i-1}+(\ell_i-1)n_i+1},&\dots,&e_{n_i}- e_{\ell_1n_1+\dots+\ell_{i-1}n_{i-1}+\ell_in_i}.\\ \end{array} $$
(3)
Partition w=S ⁻¹ u as
$$w=\left [ \begin{array}{c} w^{(1)}\\ \vdots\\ w^{(m)}\\ \widetilde{w} \end{array} \right ],\qquad w^{(i)}= \left [ \begin{array}{c} w^{(i,1)}\\ \vdots\\ w^{(i,\ell_i)} \end{array} \right ],\qquad w^{(i,j)}= \left [ \begin{array}{c} w^{(i,j)}_1\\ \vdots\\ w^{(i,j)}_{n_i} \end{array} \right ]\in\mathbb{C}^{n_i}, $$
and let λ ₁,…,λ _q be the pairwise distinct eigenvalues of A different from $\widehat{\lambda}$ having the algebraic multiplicities r ₁,…,r _q, respectively. Set $\mu_{i}=\lambda_{i}-\widehat{\lambda}$, i=1,2,…,q.

Then the characteristic polynomial $p_{\widehat{\lambda}}$ of $A+B-\widehat{\lambda}I$ is given by
$$\begin{aligned} p_{\widehat{\lambda}}(\lambda) =&(-\lambda)^aq(\lambda)+ \Biggl(\prod _{i=1}^q(\mu_{i}-\lambda )^{r_i} \Biggr) \\ &{}\cdot \Biggl((-\lambda)^a+(-1)^{a-1} \sum _{i=1}^m\sum_{j=1}^{\ell_i} \sum_{k=1}^{n_i}w^{(i,j)}_{k} \lambda ^{a-k} \Biggr), \end{aligned}$$
where q(λ) is some polynomial;
(4)
Write $p_{\widehat{\lambda}}(\lambda)=c_{n}\lambda^{n}+\cdots +c_{a-n_{1}+1}\lambda^{a-n_{1}+1}+c_{a-n_{1}}\lambda^{a-n_{1}}$. Then
$$c_{a-n_1}=(-1)^{a-1} \Biggl(\prod_{i=1}^q \mu_{i}^{r_i} \Biggr) \Biggl(\sum _{j=1}^{\ell_1}w^{(1,j)}_{n_1} \Biggr); $$
and in the case n ₁>1 we have in addition that
$$\begin{aligned} c_{a-n_1+1} =&(-1)^a \Biggl(\sum_{\nu=1}^qr_{\nu} \mu_\nu^{r_\nu-1} \prod_{\genfrac{}{}{0pt}{2}{i=1}{i\neq\nu}}^q \mu_{i}^{r_i} \Biggr) \Biggl(\sum _{j=1}^{\ell_1}w^{(1,j)}_{n_1} \Biggr) \\ &{}+(-1)^{a-1} \Biggl(\prod_{i=1}^q \mu_{i}^{r_i} \Biggr) \Biggl(\sum _{j=1}^{\ell_1}w^{(1,j)}_{n_1-1} \Biggr), \end{aligned}$$
if n ₁−1>n ₂ or, if n ₁−1=n ₂, then
$$\begin{aligned} c_{a-n_1+1} =&(-1)^a \Biggl(\sum_{\nu=1}^qr_{\nu} \mu_\nu^{r_\nu-1} \prod_{\genfrac{}{}{0pt}{2}{i=1}{i\neq\nu}}^q \mu_{i}^{r_i} \Biggr) \Biggl(\sum _{j=1}^{\ell_1}w^{(1,j)}_{n_1} \Biggr) \\ &{}+(-1)^{a-1} \Biggl(\prod_{i=1}^q \mu_{i}^{r_i} \Biggr) \Biggl(\sum _{j=1}^{\ell_1}w^{(1,j)}_{n_1-1}+\sum _{j=1}^{\ell _2}w^{(2,j)}_{n_2} \Biggr). \end{aligned}$$

The following notation of linear combinations of Jordan chains will be necessary.

Definition 10.1

Let $A\in{\mathbb{C}}^{n\times n}$ and let X=(x ₁,…,x _p) and Y=(y ₁,…,y _q) be two Jordan chains of A associated with the same eigenvalue $\widehat{\lambda}$ of (possibly different) lengths p and q. Then the sum X+Y of X and Y is defined to be the chain Z=(z ₁,…,z _max(p,q)), where

$$z_j=\left \{ \begin{array}{l@{\quad}l} x_j&\mbox{if $p\geq q$},\\ y_j&\mbox{if $p<q$}, \end{array} \right .\quad j=1,\dots,|p-q| $$

and

$$z_j=\left \{ \begin{array}{l@{\quad}l} x_j+y_{j-p+q}&\mbox{if $p\geq q$},\\ y_j+x_{j-q+p}&\mbox{if $p<q$}, \end{array} \right .\quad j=|p-q|+1,\dots,\max(p,q). $$

To illustrate this construction, consider e.g. X=(x ₁,x ₂,x ₃,x ₄) and Y=(y ₁,y ₂), then X+Y=(x ₁,x ₂,x ₃+y ₁,x ₄+y ₂).

It is straightforward to check that the sum Z=X+Y of two Jordan chains associated with an eigenvalue $\widehat{\lambda}$ is again a Jordan chain associated with $\widehat{\lambda}$ of the given matrix A, but it should be noted that this sum is not commutative.

Proof of Theorem 5.3

Let $\tau\in\mathbb{F}\setminus\{0\}$ be arbitrary. We may assume without loss of generality that A and G are already in the forms (5.5) and (5.6). Furthermore, we may assume $\widehat{\lambda}=0$, otherwise consider the matrix $A-\widehat{\lambda}I$ instead of A. Then the algebraic and geometric multiplicity a and γ of the eigenvalue zero of A are given by

$$a=\sum_{s=1}^m\ell_sn_s, \qquad\gamma=\sum_{s=1}^m \ell_s, $$

respectively. Let us partition u conformably with the forms (5.5) and (5.6), i.e., we let

$$u=\left [ \begin{array}{c} u^{(1)}\\ \vdots\\ u^{(m)}\\ \widetilde{u} \end{array} \right ],\qquad u^{(i)}= \left [ \begin{array}{c} u^{(i,1)}\\ \vdots\\ u^{(i,\ell_i)} \end{array} \right ],\qquad u^{(i,j)}= \left [ \begin{array}{c} u^{(i,j)}_1\\ \vdots\\ u^{(i,j)}_{n_i} \end{array} \right ]\in\mathbb{C}^{n_i}, $$

for j=1,…,ℓ _i; i=1,…,m. Thus, $\widetilde{u}\in \mathbb{C}^{n-a}$. Then the vector v ^T=u ^T G has the following structure:

$$v\,{=}\,\bigl(u^TG\bigr)^T\,{=}\,G^Tu\,{=}\,\left [ \begin{array}{c} v^{(1)}\\ \vdots\\ v^{(m)}\\ \widetilde{v} \end{array} \right ],\qquad v^{(i)}\,{=}\,\left [ \begin{array}{c} v^{(i,1)}\\ \vdots\\ v^{(i,\ell_i)} \end{array} \right ], \qquad v^{(i,j)}\,{=}\,\left [ \begin{array}{c} v^{(i,j)}_1\\ \vdots\\ v^{(i,j)}_{n_i} \end{array} \right ] \in\mathbb{C}^{n_i}, $$

for j=1,…,ℓ _i and i=1,…,m, where

$$v^{(1,2s-1)}=\bigl(G^{(1,2s)}\bigr)^Tu^{(1,2s)}= \left [ \begin{array}{c} g_{n_1,1}^{(1,2s)}u^{(1,2s)}_{n_1}\\ g_{n_1-1,2}^{(1,2s)}u^{(1,2s)}_{n_1-1}+g_{n_1,2}^{(1,2s)}u^{(1,2s)}_{n_1}\\ \ast\\ \vdots\\ \ast \end{array} \right ] $$

and

$$v^{(1,2s)}=\bigl(G^{(1,2s-1)}\bigr)^Tu^{(1,2s-1)}= \left [ \begin{array}{c} g_{n_1,1}^{(1,2s-1)}u^{(1,2s-1)}_{n_1}\\ g_{n_1-1,2}^{(1,2s-1)}u^{(1,2s-1)}_{n_1-1}+g_{n_1,2}^{(1,2s-1)}u^{(1,2s-1)}_{n_1}\\ \ast\\ \vdots\\ \ast \end{array} \right ] $$

for s=1,…,ℓ ₁/2. Generically, the hypothesis of Theorem 10.2 is satisfied, i.e., the first entries of the vectors v ^(i,j) are nonzero. Thus, generically the matrix S as in Theorem 10.2 exists so that S ⁻¹(A+τB)S is in partial Brunovsky form. In fact, S ⁻¹ takes the form

$$S^{-1}= \Biggl(\bigoplus_{j=1}^{\ell_1} \operatorname{Toep} \bigl(v^{(1,j)}\bigr) \Biggr)\oplus\cdots\oplus \Biggl( \bigoplus_{j=1}^{\ell_m}\operatorname{Toep} \bigl(v^{(m,j)}\bigr) \Biggr)\oplus I_{n-a}, $$

and it follows that

$$ S^{-1}\tau BS=w\bigl(\underbrace{e_{1,n_1}^T, \dots,e_{1,n_1}^T}_{{\ell_1~\mathrm{times}}},\dots, \underbrace{e_{1,n_m}^T,\dots,e_{1,n_m}^T}_{{\ell_m~\mathrm{times}}},z^T \bigr) $$

(10.4)

for some $z\in\mathbb{C}^{n-a}$, where w=τS ⁻¹ u. Thus,

$$ w=\tau S^{-1}u=\left [ \begin{array}{c} w^{(1)}\\ \vdots\\ w^{(m)}\\ \widetilde{w} \end{array} \right ],\qquad w^{(i)}=\left [ \begin{array}{c}w^{(i,1)}\\ \vdots\\ w^{(i,\ell_i)} \end{array} \right ],\qquad w^{(i,s)}=\left [ \begin{array}{c}w^{(i,j)}_1\\ \vdots\\ w^{(i,j)}_{n_i} \end{array} \right ]\in\mathbb{C}^{n_i}, $$

(10.5)

for j=1,…,ℓ _i and i=1,…,m, where

$$ \begin{aligned} w^{(1,2s-1)}_{n_1}&=\tau g_{n_1,1}^{(1,2s)}u^{(1,2s)}_{n_1}u^{(1,2s-1)}_{n_1}, \\ w^{(1,2s)}_{n_1}&=\tau g_{n_1,1}^{(1,2s-1)}u^{(1,2s-1)}_{n_1}u^{(1,2s)}_{n_1}. \end{aligned} $$

(10.6)

Thus, using hypothesis (2a) we obtain $w^{(1,2s)}_{n_{1}}=-w^{(1,2s-1)}_{n_{1}}$. Furthermore,

$$\begin{aligned} w^{(1,2s-1)}_{n_1-1} =&\tau g_{n_1,1}^{(1,2s)}u^{(1,2s)}_{n_1}u^{(1,2s-1)}_{n_1-1} +\tau g_{n_1-1,2}^{(1,2s)}u^{(1,2s)}_{n_1-1}u^{(1,2s-1)}_{n_1} \\ &{}+\tau g_{n_1,2}^{(1,2s)}u^{(1,2s)}_{n_1}u^{(1,2s-1)}_{n_1}, \\ w^{(1,2s)}_{n_1-1} =&\tau g_{n_1,1}^{(1,2s-1)}u^{(1,2s-1)}_{n_1}u^{(1,2s)}_{n_1-1} +\tau g_{n_1-1,2}^{(1,2s-1)}u^{(1,2s-1)}_{n_1-1}u^{(1,2s)}_{n_1} \\ &{}+\tau g_{n_1,2}^{(1,2s-1)}u^{(1,2s-1)}_{n_1}u^{(1,2s)}_{n_1} \end{aligned}$$

for s=1,…,ℓ ₁/2, provided that n ₁>1. This implies that

$$\begin{aligned} w^{(1,2s-1)}_{n_1-1}+w^{(1,2s)}_{n_1-1} =&\tau \bigl(g_{n_1,1}^{(1,2s)}+g_{n_1-1,2}^{(1,2s-1)} \bigr)u^{(1,2s)}_{n_1}u^{(1,2s-1)}_{n_1-1} \\ &{}+\tau\bigl(g_{n_1-1,2}^{(1,2s)}+g_{n_1,1}^{(1,2s-1)} \bigr)u^{(1,2s)}_{n_1-1}u^{(1,2s-1)}_{n_1} \\ &{}+\tau\bigl(g_{n_1,2}^{(1,2s)}+g_{n_1,2}^{(1,2s-1)} \bigr)u^{(1,2s-1)}_{n_1}u^{(1,2s)}_{n_1} \end{aligned}$$

(10.7)

which, by the hypothesis (2b), is generically nonzero.

We will now show in two steps that generically A+τB has the Jordan canonical form (5.7). By Theorem 10.2 we know that generically A+τB has ℓ ₁−1 Jordan chains of length n ₁ and ℓ _j Jordan chains of length n _j, j=2,…,m associated with the eigenvalue zero. (These chains are linearly independent but need not form a basis of the corresponding root subspace of A+τB yet, as it may be possible to extend some of the chains.) In the first step, we will show that generically there exists a Jordan chain of length n ₁+1. In the second step, we will show that the algebraic multiplicity of the eigenvalue zero of A+τB generically is $\widetilde{a}=(\sum_{s=1}^{m}\ell _{s}n_{s})-n_{1}+1=a-n_{1}+1$. Both steps together obviously imply that (5.7) represents the only possible Jordan canonical form for A+τB.

Step 1: Existence of a Jordan chain of length n ₁+1.

Consider the following Jordan chains of S ⁻¹(A+τB)S associated with the eigenvalue zero and denoted by C _1,s and C _i,j, respectively:

$$\begin{aligned} &\mbox{length } n_1:\quad C_{1,s}:\quad e_{2(s-1)n_1+1}-e_{(2s-1)n_1+1},\dots,e_{(2s-1)n_1}-e_{2sn_1}, \\ &\quad{} s=1,\dots,\frac{\ell_1}{2} \\ &\mbox{length } n_i:\quad C_{i,j}:\quad -e_{1}+e_{\varSigma_{k=1}^{i-1}\ell_kn_k+(j-1)n_i+1},\dots,-e_{n_i}+e_{\varSigma_{k=1}^{i-1} \ell_kn_k+jn_i}, \\ &\quad{} j=1,\dots,\ell_i, \end{aligned}$$

where i=2,…,m. Observe that C _i,j, i≠1, are just the Jordan chains from Theorem 10.2 multiplied by −1 while the chains C _1,s are linear combinations of the Jordan chains from Theorem 10.2. Namely, in the notation of (10.3), and numbering the chains in (10.3) first, second, etc., from the top to the bottom, we see that the chains $C_{1,1}, \ldots, C_{1, \ell_{1}/2}$ are the negative of the second chain plus the first chain, the negative of the fourth chain plus the third chain, …, the negative of the (ℓ ₁−1)-th chain plus the (ℓ ₁)-th chain, respectively. Now consider the Jordan chain

$$C:= \Biggl(\sum_{s=1}^{\ell_1/2} \alpha_{1,s}C_{1,s} \Biggr)+\sum_{i=2}^m \sum_{j=1}^{\ell_i}\alpha_{i,j}C_{i,j} $$

of length n ₁ (see Definition 10.1), and let y denote the n ₁-th (and thus last) vector of this chain. We next show that the Jordan chain C can be extended by a certain vector to a Jordan chain of length n ₁+1 associated with the eigenvalue zero, for some particular choice of the parameters α _i,s (depending on u) such that generically at least one of $\alpha_{1,1}, \ldots, \alpha_{1,\ell_{1}/2}$ is nonzero. To see this, we have to show that y is in the range of S ⁻¹(A+τB)S. First, partition

$$y=\left [ \begin{array}{c} y^{(1)}\\ \vdots\\ y^{(m)}\\ \widetilde{y} \end{array} \right ],\qquad y^{(i)}= \left [ \begin{array}{c} y^{(i,1)}\\ \vdots\\ y^{(i,\ell_i)} \end{array} \right ],\qquad y^{(i,j)}= \left [ \begin{array}{c} y^{(i,j)}_1\\ \vdots\\ y^{(i,j)}_{n_i} \end{array} \right ]\in\mathbb{C}^{n_i}, $$

for j=1,…,ℓ _i; i=1,…,m. Then by the definition of y, we have $\widetilde{y}=0\in\mathbb{C}^{n-a}$,

$$\begin{aligned} &y^{(1,2s-1)}_{n_1}=\alpha_{1,s},\quad y^{(1,2s)}_{n_1}= -\alpha_{1,s},\quad s=1,\dots, \ell_1/2, \\ &y^{(i,j)}_{n_i}=\alpha_{i,j},\quad j=1,\dots, \ell_i; \ i=2,\dots,m. \end{aligned}$$

We have to solve the linear system

$$ S^{-1}(A+\tau B)Sx=y. $$

(10.8)

Partitioning

$$x=\left [ \begin{array}{c} x^{(1)}\\ \vdots\\ x^{(m)}\\ \widetilde{x} \end{array} \right ],\qquad x^{(i)}= \left [ \begin{array}{c} x^{(i,1)}\\ \vdots\\ x^{(i,\ell_i)} \end{array} \right ],\qquad x^{(i,j)}= \left [ \begin{array}{c}x^{(i,j)}_1\\ \vdots\\ x^{(i,j)}_{n_i} \end{array} \right ]\in\mathbb{C}^{n_i}, $$

and making the ansatz $\widetilde{x}=0$, then (10.8) becomes (here we use (10.4) and (10.5)):

$$\begin{aligned} &w^{(i,j)}_{k} \Biggl(\sum_{\nu=1}^m \sum_{\mu=1}^{\ell_\nu}x^{(\nu ,\mu)}_{1} \Biggr)+ x^{(i,j)}_{k+1}=y^{(i,j)}_{k}, \\ &\quad{}k=1,\ldots,n_i-1; j=1,\ldots,\ell_i; i=1, \ldots,m, \end{aligned}$$

(10.9)

$$\begin{aligned} &w^{(i,j)}_{n_i} \Biggl(\sum_{\nu=1}^m \sum_{\mu=1}^{\ell_\nu }x^{(\nu,\mu)}_{1} \Biggr)=\alpha_{i,j},\quad j=1,\ldots,\ell_i; i=2, \ldots,m, \end{aligned}$$

(10.10)

$$\begin{aligned} &w^{(1,2s-1)}_{n_1} \Biggl(\sum_{\nu=1}^m \sum_{\mu=1}^{\ell_\nu}x^{(\nu,\mu)}_{1} \Biggr)=\alpha_{1,s},\quad s=1,\ldots,\ell_1/2, \end{aligned}$$

(10.11)

$$\begin{aligned} &w^{(1,2s)}_{n_1} \Biggl(\sum_{\nu=1}^m \sum_{\mu=1}^{\ell_\nu}x^{(\nu,\mu)}_{1} \Biggr)=-\alpha_{1,s},\quad s=1,\ldots,\ell_1/2. \end{aligned}$$

(10.12)

Set $x^{(1,1)}_{1}=1$ and $x^{(\nu,\mu)}_{1}=0$, for μ=1,…,ℓ _ν; ν=1,…,m; (ν,μ)≠(1,1), as well as $\alpha_{i,j}=w^{(i,j)}_{n_{i}}$ for j=1,…,ℓ _i; i=2,…,m and $\alpha_{1,s}=w^{(1,2s-1)}_{n_{1}}$ for s=1,…,ℓ ₁/2. Then (10.10) and (10.11) are satisfied and so is (10.12), because $w^{(1,2s)}_{n_{1}}=-w^{(1,2s-1)}_{n_{1}}$ by (10.6). Finally, (10.9) can be solved by choosing $x^{(i,j)}_{k+1}=y^{(i,j)}_{k}-w^{(i,j)}_{k}$ for k=1,…,n _i−1; j=1,…,ℓ _i; i=1,…,m.

Step 2: We show that the algebraic multiplicity of the eigenvalue zero of A+τB generically is $\widetilde{a}=(\sum_{s=1}^{m}\ell _{s}n_{s})-n_{1}+1=a-n_{1}+1$.

Let μ ₁,…,μ _q be the pairwise distinct nonzero eigenvalues of A and let r ₁,…,r _q be their algebraic multiplicities. Denote by p ₀(λ) the characteristic polynomial of A+τB. By Theorem 10.2, the lowest possible power of λ associated with a nonzero coefficient in p ₀(λ) is a−n ₁ and the corresponding coefficient $c_{a-n_{1}}$ is

$$c_{a-n_1}=(-1)^{a-1} \Biggl(\prod_{i=1}^q \mu_{i}^{r_i} \Biggr) \Biggl(\sum _{j=1}^{\ell_1}w^{(1,j)}_{n_1} \Biggr)=0, $$

because of (10.6). If n ₁=1 then $\widetilde{a}=a$ and there is nothing to show as the algebraic multiplicity of the eigenvalue zero cannot increase when a generic perturbation is applied. Otherwise, we distinguish the two cases n ₂<n ₁−1 and n ₂=n ₁−1. If n ₂<n ₁−1, then by Theorem 10.2 the coefficient $c_{a-n_{1}+1}$ of $\lambda^{a-n_{1}+1}$ in p ₀(λ) is

$$\begin{aligned} c_{a-n_1+1} =&(-1)^a \Biggl(\sum _{\nu=1}^qr_{\nu}\mu_\nu^{r_\nu-1} \prod_{\genfrac{}{}{0pt}{1}{i=1}{i\neq\nu}}^q\mu_{i}^{r_i} \Biggr) \Biggl(\sum_{j=1}^{\ell_1}w^{(1,j)}_{n_1} \Biggr) \\ &{}+(-1)^{a-1} \Biggl(\prod_{i=1}^q \mu_{i}^{r_i} \Biggr) \Biggl(\sum _{j=1}^{\ell_1}w^{(1,j)}_{n_1-1} \Biggr) \\ =&(-1)^{a-1} \Biggl(\prod_{i=1}^q \mu_{i}^{r_i} \Biggr) \Biggl(\sum _{j=1}^{\ell_1}w^{(1,j)}_{n_1-1} \Biggr) \\ =&(-1)^{a-1} \Biggl(\prod_{i=1}^q \mu_{i}^{r_i} \Biggr) \Biggl(\sum _{s=1}^{\ell_1/2} \bigl(\tau\bigl(g_{n_1,1}^{(1,2s)}+g_{n_1-1,2}^{(1,2s-1)} \bigr)u^{(1,2s)}_{n_1}u^{(1,2s-1)}_{n_1-1} \\ &{}+\tau\bigl(g_{n_1-1,2}^{(1,2s)}+g_{n_1,1}^{(1,2s-1)} \bigr)u^{(1,2s)}_{n_1-1}u^{(1,2s-1)}_{n_1} \\ &{}+\tau\bigl(g_{n_1,2}^{(1,2s)}+g_{n_1,2}^{(1,2s-1)} \bigr)u^{(1,2s-1)}_{n_1}u^{(1,2s)}_{n_1} \bigr) \Biggr) \end{aligned}$$

by (10.7), where we have used (10.6) to conclude that $\sum_{j=1}^{\ell_{1}}w^{(1,j)}_{n_{1}}=0$ in the second equation. By the hypothesis (2b), it follows that $c_{a-n_{1}+1}$ generically is nonzero. If, on the other hand, n ₂=n ₁−1, then again by Theorem 10.2 (and using (10.6)) the coefficient $c_{a-n_{1}+1}$ of $\lambda^{a-n_{1}+1}$ in p ₀(λ) is

$$c_{a-n_1+1}=(-1)^{a-1} \Biggl(\prod_{i=1}^q \mu_{i}^{r_i} \Biggr) \Biggl(\sum _{s=1}^{\ell_1}w^{(1,s)}_{n_1-1} +\sum _{j=1}^{\ell_2}w^{(2,j)}_{n_2} \Biggr), $$

so in comparison to the case n ₂<n ₁−1, there is an extra term in $c_{a-n_{1}+1}$ depending on $w^{(2,j)}_{n_{2}}$, j=1,…,ℓ ₂. However, each entry $w^{(2,j)}_{n_{2}}$ only depends on the entries of the vectors u ^(2,s), s=1,…,ℓ ₂, so still $c_{a-n_{1}+1}$ is nonzero generically. In all cases, we have shown that zero is a root of p ₀(λ) with multiplicity a−n ₁+1. Thus, the algebraic multiplicity of the eigenvalue zero of A+τB is a−n ₁+1. Together with Step 1, we obtain that (5.7) generically is the only possible Jordan canonical form of A+τB. □

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mehl, C., Mehrmann, V., Ran, A.C.M. et al. Eigenvalue perturbation theory of symplectic, orthogonal, and unitary matrices under generic structured rank one perturbations. Bit Numer Math 54, 219–255 (2014). https://doi.org/10.1007/s10543-013-0451-3

Download citation

Received: 27 February 2013
Accepted: 25 September 2013
Published: 15 October 2013
Issue Date: March 2014
DOI: https://doi.org/10.1007/s10543-013-0451-3

Keywords

Mathematics Subject Classification (2010)

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Eigenvalue perturbation theory of symplectic, orthogonal, and unitary matrices under generic structured rank one perturbations

Abstract

Access this article

Similar content being viewed by others

Global Properties of Eigenvalues of Parametric Rank One Perturbations for Unstructured and Structured Matrices

Global Properties of Eigenvalues of Parametric Rank One Perturbations for Unstructured and Structured Matrices II

Perturbation Analysis of Matrix Equations and Decompositions

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix: Proof of Theorem 5.3

Theorem 10.1

Theorem 10.2

Definition 10.1

Proof of Theorem 5.3

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification (2010)

Navigation

Eigenvalue perturbation theory of symplectic, orthogonal, and unitary matrices under generic structured rank one perturbations

Abstract

Access this article

Similar content being viewed by others

Global Properties of Eigenvalues of Parametric Rank One Perturbations for Unstructured and Structured Matrices

Global Properties of Eigenvalues of Parametric Rank One Perturbations for Unstructured and Structured Matrices II

Perturbation Analysis of Matrix Equations and Decompositions

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix: Proof of Theorem 5.3

Appendix: Proof of Theorem 5.3

Theorem 10.1

Theorem 10.2

Definition 10.1

Proof of Theorem 5.3

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification (2010)

Search

Navigation