A null-space approach for large-scale symmetric saddle point systems with a small and non zero (2, 2) block

Scott, Jennifer; Tůma, Miroslav

doi:10.1007/s11075-021-01245-z

A null-space approach for large-scale symmetric saddle point systems with a small and non zero (2, 2) block

Original Paper
Open access
Published: 12 January 2022

Volume 90, pages 1639–1667, (2022)
Cite this article

Download PDF

You have full access to this open access article

Numerical Algorithms Aims and scope Submit manuscript

A null-space approach for large-scale symmetric saddle point systems with a small and non zero (2, 2) block

Download PDF

1549 Accesses
4 Citations
Explore all metrics

Abstract

Null-space methods have long been used to solve large sparse n × n symmetric saddle point systems of equations in which the (2, 2) block is zero. This paper focuses on the case where the (1, 1) block is ill conditioned or rank deficient and the k × k (2, 2) block is non zero and small (k ≪ n). Additionally, the (2, 1) block may be rank deficient. Such systems arise in a range of practical applications. A novel null-space approach is proposed that transforms the system matrix into a nicer symmetric saddle point matrix of order n that has a non zero (2, 2) block of order at most 2k and, importantly, the (1, 1) block is symmetric positive definite. Success of any null-space approach depends on constructing a suitable null-space basis. We propose methods for wide matrices having far fewer rows than columns with the aim of balancing stability of the transformed saddle point matrix with preserving sparsity in the (1, 1) block. Linear least squares problems that contain a small number of dense rows are an important motivation and are used to illustrate our ideas and to explore their potential for solving large-scale systems.

The saddle point problem and the Manteuffel algorithm

Article 06 April 2016

The Uzawa-MBB type algorithm for nonsymmetric saddle point problems

Article 05 April 2019

On Saddle Points in Semidefinite Optimization via Separation Scheme

Article 12 August 2014

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Our interest is in solving symmetric saddle point systems of equations of the form

$$ \mathcal{A} \left( \begin{array}{cc} u \\ v \end{array}\right) = \left( \begin{array}{cc} H & B^{T} \\ B & -C \end{array}\right) \left( \begin{array}{cc} u \\ v \end{array}\right) = \left( \begin{array}{cc} f \\ g \end{array}\right), $$

(1)

in which $H\in \mathbb {R}^{n \times n}$ is large, sparse and symmetric positive semidefinite (SPSD), $B\in \mathbb {R}^{k \times n}$ (n > k) and $C\in \mathbb {R}^{k \times k}$ is SPSD. Our focus is on k ≪ n (the saddle point matrix may then be referred to as a “bordered” matrix) and the “wide” matrix B may contain one or more dense rows. Such systems arise in a range of scientific applications, including the finite element approximation of PDEs in which a constraint is enforced on a subset of unknowns via a global scalar Lagrange multiplier, numerical continuation methods for large nonlinear systems of equations, electronic circuit simulation, constrained optimization, and linear least squares problems (see the paper of Benzi, Golub and Liesen [1], which describes a wide range of real life problems that are dependent on the solution of saddle point systems and see also [2, 3]). We are particularly concerned with large-scale least squares problems that have a few dense rows. They can be expressed in the form (1) and provide an important motivation for the null-space approach proposed in this paper. In this application, the (1, 1) block can be ill conditioned or rank deficient.

There has been considerable work over many years and in different research areas into null-space methods for saddle point problems with a zero (2, 2) block. The basic idea is to characterize the null space of the constraint (off-diagonal) blocks and use that characterization to reduce the system to two smaller linear systems of order (n − k) × (n − k) and k × k that have nice properties and are thus straightforward to solve. Consequently, applications of null-space methods are usually geared towards problems for which n − k is small. An attractive feature of null-space methods that widens their applicability is that the (1, 1) block H need not be invertible. However, a key problem is the need to construct a null-space basis $\mathcal {N}(B)$ for the matrix B. In general, this is challenging, particularly if it is desirable that the matrix whose columns form a basis for $\mathcal {N}(B)$ has additional properties, such as bandedness, sparsity or orthogonality. Moreover, the classical null-space method is restricted to C = 0.

The objectives of this paper are twofold. Firstly, we present a null-space based approach for symmetric saddle point systems in which the (1, 1) block is SPSD and the small k × k block C is non zero. The new approach preserves symmetry and results in a transformed symmetric saddle point system of the same order but with favourable numerical properties, namely a sparse symmetric positive definite (SPD) (1, 1) block and a non zero (2, 2) block of size k + r, where r ≤ k is the rank of B. Having an SPD (1, 1) block allows the use of block (possibly incomplete) factorization methods to be used to solve the system. Secondly, we propose techniques for constructing a null-space basis $\mathcal {N}(B)$ when k is small, with an emphasis on balancing sparsity in the transformed problem with stability through the use of QR factorizations with threshold pivoting. Novel contributions are the underlying theory that we present in Section 2 (Theorem 1 and the transformation of the sparse-dense least squares problem described by Lemma 2) as well as the main algorithmic approach based on the threshold QR factorization of the off-diagonal block of the transformed saddle-point system. While the composite approaches we propose for constructing the null-space bases are a straightforward generalization of the idea of Howell [3], the pivoted algorithmic approach based on right oblique conjugation is new, building on work done in the 1990s by Benzi, Meyer and T˚uma [4,5,6].

This paper is organized as follows. In Section 2, we first recall the standard null-space approach for saddle point systems with a zero (2, 2) block and the recent method of Howell [3] that allows for a non zero C but sacrifices symmetry. We then propose a new symmetry preserving null-space approach for the case of a non zero C and discuss how it can be used to solve large-scale linear least squares problems that have a small number of dense rows. Null basis construction is considered in Section 3. A brief review of methods for building null-space bases for sparse matrices is given before we focus on the case of interest to us: null-space bases for wide matrices that may include some dense rows. A number of approaches are proposed, with an emphasis on ensuring sparsity within the transformed saddle point system while also guaranteeing numerical stability through the use of pivoting. In Section 4, numerical examples from practical applications demonstrate the effectiveness of the approaches and illustrate their potential strengths and limitations. Finally, some concluding remarks and pointers to future research directions area given in Section 5.

1.1 Test environment

The numerical experiments reported on in this paper are performed on an Intel(R) Core(TM) i5-4590 CPU running at 3.30 GHz with 12 GB of internal memory. The software for the null-space computations is written in Fortran and compiled using GNU Fortran 6.3.0. The algorithms we present are complex to implement and the software developed for our experiments is not optimized for efficiency, but is written for robustness at the expense of speed. Having obtained a null-space basis, we employ the sparse direct solver HSL_MA87 [7] from the HSL mathematical software library (available at http://www.hsl.rl.ac.uk) to compute the sparse Cholesky factorization within the block matrix factorization. This package is also written in Fortran. The experimental results are processed by MATLAB R2020a. In particular, the presented condition numbers are computed using the MATLAB 1-norm condition number estimator condest.

1.2 Notation

We end this section by introducing notation that we use throughout. Let $x \in \mathbb {R}^{n}$ be a vector. We denote the i-th entry of x by (x)_i and (x)_j:l denotes entries j to l of x (1 ≤ j < l ≤ n). x^⊥ denotes the (n − 1)-dimensional space of all vectors $w \in \mathbb {R}^{n}$ such that x^Tw = 0. e_i denotes the i-th unit vector. Let $B \in \mathbb {R}^{k \times n}$ be a matrix. The (i, j)-th entry of B is given by (B)_{i, j}. (B)_{:, l} denotes column l of B and (B)_{:, 1:l} denotes a matrix comprising columns 1 to l of B. (B)_{1:l, 1:l} denotes the principal leading submatrix of B of order l. The rows of B are ${b^{T}_{1}}, \ldots , {b^{T}_{k}}$. $\mathcal {N}(B)$ and $\mathcal {R}(B)$ denote the null space and range space of B, respectively. Z is used to denote a null-space basis matrix (that is, its columns form a null-space basis). P (with or without a subscript and/or superscript) is used for a permutation matrix. I_k denotes the k × k identity matrix and 0_{n, k} denotes the n × k null matrix.

2 Null-space approach for solving saddle point systems

2.1 Null-space approach for zero C

Consider the case that B is of full rank and C = 0_{k, k}, that is,

$$ \left( \begin{array}{cc} H & B^{T} \\ B & 0_{k,k} \end{array}\right) \left( \begin{array}{cc} u \\ v \end{array}\right) = \left( \begin{array}{cc} f \\ g \end{array}\right). $$

(2)

Such systems frequently arise in practical applications, particularly when solving constrained optimization problems (where they are usually referred to as reduced Hessian methods); a null-space approach is one possible method of solution (see, for example, [1, Section 6]). Assume we have a matrix $Z \in \mathbb {R}^{n \times (n-k)}$ whose columns form a basis for $\mathcal {N}(B)$ i.e., BZ = 0_{k, n}, and that we have a particular solution for the second equation in (2), that is, a vector $\hat {u}$ such that

$$ B \hat u = g.$$

Then solving (2) is equivalent to solving

$$ \left( \begin{array}{cc} H & B^{T} \\ B & 0_{k,k} \end{array}\right) \left( \begin{array}{cc} \bar u \\ v \end{array}\right) = \left( \begin{array}{cc} f - H \hat u \\ 0_{k, 1} \end{array}\right), $$

where $\bar u = u - \hat {u}$. The second equation in this system is equivalent to finding a vector $z \in \mathbb {R}^{n-k}$ such that $\bar {u} = Zz.$ Substituting this into the first equation gives

$$ \begin{array}{@{}rcl@{}} HZz + B^{T} v & = & f - H \hat{u} \\ \iff \quad Z^{T}H Z z & = & Z^{T} (f - H \hat{u}) \end{array} $$

(3)

Therefore, by solving the reduced system (3), it is possible to straightforwardly recover $u = \hat {u} + Zz$. Finally, v can be obtained by solving the reduced k × k symmetric positive definite system

$$B B^{T} v = B(f-Hu).$$

The null-space method for solving (2) is summarized as Algorithm 1.

A recent study of null-space factorizations for saddle point systems of the form (2) has been given by Rees and Scott [8] and the advantages and disadvantages of null-space methods are summarized in [9]. They can be very efficient for solving a series of problems with the same block B but different H because the null-space basis $\mathcal {N}(B)$ needs to be computed only once. A further key advantage is that H^− 1 is not required. In fact, the method is applicable if H is singular, provided $\mathcal {R}(H) \cap \mathcal {N}(B) = \{0\}$. Furthermore, the Schur complement is not needed. The null-space approach can also be useful when additional rows (dense or sparse) are added to the system matrix a posteriori [10]. However, a potential difficulty is the need to construct a null-space basis matrix Z and, even when Z can be computed efficiently, some columns of Z may contain a large number of entries so that Z^THZ is not sparse.

The null-space method can be generalized to unsymmetric saddle point systems with a zero (2, 2) block in which H is unsymmetric and the full rank (1, 2) and (2, 1) blocks are ${B_{1}^{T}}$ and B₂ with B₁≠B₂. In this case, $\hat {u}$ is required such that $B_{2} \hat {u} =g$ and matrices Z₁ and $Z_{2} \in \mathbb {R}^{n \times (n-k)}$ whose columns form a basis for the null-spaces $\mathcal {N}({B_{1}})$ and $\mathcal {N}({B_{2}})$, respectively, must be constructed.

2.2 Symmetry-preserving null-space approach for non zero C

As Golub et al. observe [1, Section 6], the null-space method cannot be applied to solve saddle point systems with C≠ 0_{k, k}. In a recent paper, Howell [3] proposes what he terms a one-sided application of the null-space method to solve non-singular generalized saddle point systems of the form

$$ \mathcal{A} \left( \begin{array}{cc} u \\ v \end{array}\right) = \left( \begin{array}{cc} H & {B_{1}^{T}} \\ B_{2} & -C \end{array}\right) \left( \begin{array}{cc} u \\ v \end{array}\right) = \left( \begin{array}{cc} f \\ g \end{array}\right), $$

(4)

in which H and C ≠ 0_{k, k} are possibly unsymmetric, $B_{1} \in \mathbb {R}^{k \times n}$ and $B_{2} \in \mathbb {R}^{k \times n}$ are of full row rank and either B₁ and/or B₂ is dense. Howell introduces an auxiliary variable w that is used to replace the (n + k) × (n + k) system (4) by a larger (n + 2k) × (n + 2k) system

$$ \hat{\mathcal{A}} \left( \begin{array}{cc} u \\ v \\ w \end{array}\right) = \left( \begin{array}{cccc} H & {B_{1}^{T}} & 0_{n,k} \\ B_{2} & -C & 0_{k,k} \\ 0_{k,n} & 0_{k,k} & I_{k} \end{array}\right) \left( \begin{array}{cc} u \\ v \\ w \end{array}\right) = \left( \begin{array}{cc} f \\ g \\ 0_{k, 1} \end{array}\right). $$

(5)

Interchanging the second and third equations leads to another unsymmetric saddle point system

$$ \left( \begin{array}{cc} \hat H & \hat{B}_{1}^{T} \\ \hat B_{2} & 0_{k,k} \end{array}\right) \left( \begin{array}{cc} \hat u \\ v \end{array}\right) = \left( \begin{array}{cc} \hat f \\ g \end{array}\right), $$

(6)

with

$$ \hat u = \left( \begin{array}{cc} u \\ w \end{array}\right), \hat f = \left( \begin{array}{cc} f \\ 0_{k, 1} \end{array}\right), \hat H = \left( \begin{array}{cc} H & {B_{1}^{T}} \\ 0_{k,n} & 0_{k,k} \end{array}\right), \hat{B}_{1}^{T} = \left( \begin{array}{cc} 0_{n,k} \\ I_{k} \end{array}\right), \hat B_{2} = \left( \begin{array}{cc} B_{2} & -C \end{array}\right). $$

Because the coefficient matrix in (6) is non singular with a zero (2, 2) block, the null-space method can be applied. Howell provides further details and presents results for a number of practical applications that have a small number of dense rows and columns. However, a major problem with this approach is that it fails to preserve symmetry. If B₁ = B₂ and H and C are symmetric, the symmetry of the original problem is lost because of the swapping of the second and third equations in (5); it leads to the need to solve an n × n unsymmetric system of the form

$$ \hat{Z}_{1}^{T} \hat H \hat Z_{2} \hat v = \hat g ,$$

where $\hat Z_{j}$ is a null-space basis matrix for $\hat B_{j}$ (j = 1, 2). Howell does not comment on this aspect and the use of MATLAB backslash for solving linear systems within his numerical experiments obscures the fact that a symmetric problem has been traded for a larger unsymmetric one.

Our interest is in developing a null-space approach for (1) when C ≠ 0_{k, k} and k is small that retains symmetry and leads to a transformed saddle point system of the same dimension n + k that has favourable numerical properties, specifically, a symmetric positive definite (1, 1) block and a (2, 2) block of dimension at most 2k. We consider the general case that both H and C are only SPSD and B may not have full row rank. Lemma 1 gives sufficient conditions for $\mathcal {A}$ to be nonsingular. This extends results from [1]; see also [11].

Lemma 1.

Let

$$ \mathcal{A} = \left( \begin{array}{cc} H & B^{T} \\ B & -C \end{array}\right), $$

where $H\in \mathbb {R}^{n \times n}$ and $C\in \mathbb {R}^{k \times k}$ are SPSD matrices, and the rectangular matrix $B\in \mathbb {R}^{k \times n}$ is not necessarily of full row rank. If $\mathcal {N}(H) \cap \mathcal {N}(B) = \{0\}$ and $\mathcal {N}(C) \cap \mathcal {N}(B^{T}) = \{0\}$ then $\mathcal {A}$ is nonsingular.

Proof 1.

Consider a vector $w = \left (\begin {array}{cc}u^{T} & v^{T} \end {array}\right )$ such that $\mathcal {A}w=0$. This means that Hu + B^Tv = 0 and Bu − Cv = 0. Consequently, $w^{T} \mathcal {A}w = 0$ implies u^THu + v^TCv = 0. Because H and C are SPSD, u^THu = 0 and v^TCv = 0 and it follows that Hu = 0 and Cv = 0 [12]. Consequently, B^Tv = 0 and Bu = 0. The null-space assumptions then imply that w = 0 and hence $\mathcal {A}$ is nonsingular. □

The following theorem presents our proposed transformation without B needing to be of full row rank.

Theorem 1.

Consider the symmetric saddle point problem (1) of order n + k with rank(B) = r ≤ k, H and C SPSD. Assume furthermore that $\mathcal {N}(H) \cap \mathcal {N}(B) = \{0\}$ and $\mathcal {N}(C) \cap \mathcal {N}(B^{T}) = \{0\}$. The solution of (1) can be obtained by solving a transformed saddle point problem of order n + k with a symmetric positive definite (SPD) principal leading submatrix of order n − r.

Proof 2.

Because the saddle-point problem satisfies the assumptions of Lemma 1, the system matrix $\mathcal {A}$ is nonsingular. Let E be the n × n matrix of full rank given by

$$ E = \left( \begin{array}{cc} Z & Y \end{array}\right), $$

(7)

where $Z \in \mathbb {R}^{n \times (n-r)}$ has full column rank and is such

$$ BE = \left( \begin{array}{cc} 0_{k,n-r} & B_{Y} \end{array}\right), \qquad B_{Y} = BY \in \mathbb{R}^{k \times r}, \qquad rank(B_{Y}) = r. $$

(8)

Set $\mathcal {E}$ to be the (n + k) × (n + k) matrix

$$ \mathcal{E} = \left[\begin{array}{cc} E & 0_{n,k} \\ 0_{k,n} & I_{k} \end{array}\right]. $$

Then the transformed saddle point matrix

$$ \widetilde{\mathcal{A}} = \mathcal{E}^{T} \mathcal{A} \mathcal{E} $$

(9)

is symmetric and non singular. We can rewrite this transformation as

$$ \begin{array}{@{}rcl@{}} \widetilde{\mathcal{A}} &=& \left( \begin{array}{cc} E^{T} & 0_{k,n} \\ 0_{n,k} & I_{k} \end{array}\right) \left( \begin{array}{cc} H & B^{T} \\ B & -C \end{array}\right) \left( \begin{array}{cc} E & 0_{n,k} \\ 0_{k,n} & I_{k} \end{array}\right) \\ &=& \left( \begin{array}{cc} E^{T}HE & (BE)^{T} \\ BE & -C \end{array}\right)\\ &=& \left( \begin{array}{ccccc} Z^{T}HZ &~~~ Z^{T}HY &~~~ 0_{n-r,k} \\ Y^{T}HZ &~~~ Y^{T}HY &~~~ {B_{Y}^{T}} \\ 0_{k,n-r} &~~~ B_{Y} &~~~ -C \end{array}\right) \\ &=& \left( \begin{array}{cc} \widehat H & \widehat B^{T} \\ \widehat B & - \widehat C \end{array}\right), \end{array} $$

where

$$ \widehat H = (E^{T}HE)_{1:n-r, 1:n-r} = Z^{T} H Z $$

(10)

is the SPD leading principal submatrix of E^THE of order n − r, and

$$ \widehat B = \left( \begin{array}{cc} Y^{T}HZ \\ 0_{k,n-r} \end{array}\right), \qquad \widehat C = \left( \begin{array}{cc} -Y^{T}HY & -{B_{Y}^{T}} \\ -B_{Y} & C \end{array}\right). $$

The transformed system becomes

$$ \widetilde{\mathcal{A}} \left( \begin{array}{cc} \tilde u \\ \tilde v \end{array}\right) = \left( \begin{array}{cc} \widehat H & \widehat B^{T} \\ \widehat B & - \widehat C \end{array}\right) \left( \begin{array}{cc} \tilde u \\ \tilde v \end{array}\right) = \left( \begin{array}{cc} \tilde f \\ \tilde g \end{array}\right) = \mathcal{E}^{T} \left( \begin{array}{cc} f \\ g \end{array}\right), $$

(11)

where $\tilde u \in \mathbb {R}^{n-r}$ and $\tilde v \in \mathbb {R}^{r+k}$ and

$$ \tilde f = (E^{T}f)_{1:n-r} \qquad \tilde g = \left( \begin{array}{cc} (E^{T}f)_{n-r+1:n} \\ g \end{array}\right). $$

To solve (11), consider a conformal partitioning of the vectors u and f. Once $\tilde u$ and $\tilde v$ have been computed, the solution of the original problem (1) is given by

$$ u = \left[\begin{array}{cc} E_{1:n-r, 1:n-r} \tilde u \\ E_{n-r+1:n,n-r+1:n}\tilde v_{1:r} \end{array}\right], \qquad v= (\tilde v)_{r+1:r+k} . $$

□

Remark 1.

The claim in Theorem 1 that the transformed saddle point matrix $\tilde {\mathcal {A}}$ given by (9) is symmetric and nonsingular cannot be strengthened to it being quasi definite, even when B has full row rank. The problem is that the submatrix (E^THE)_{n−r+ 1:n, n−r+ 1:n} is then only symmetric.

The (1, 1) block $\widehat H$ in the transformed system (11) is SPD. This allows block factorizations to be employed, which are useful not only for direct methods but also in the construction of preconditioners for use with iterative methods. For example, using the Cholesky factorization $\widehat H = L_{1} {L_{1}^{T}}$ yields

$$ \left( \begin{array}{cc} \widehat H & \widehat B^{T} \\ \widehat B & - \widehat C \end{array}\right) = \left( \begin{array}{cc} L_{1} \\ L_{2} & L_{S} \end{array}\right) \left( \begin{array}{cc} I & \\ & -D_{S} \end{array}\right) \left( \begin{array}{cc} {L_{1}^{T}} & {L_{2}^{T}} \\ & {L_{S}^{T}} \end{array}\right), $$

(12)

where

$$ L_{2} = \left[\begin{array}{cc} \tilde{L_{2}} \\ 0_{1:k, 1:n-r} \end{array}\right] \in \mathbb{R}^{(r+k)\times (n-r)} \quad \text{with} \quad L_{2} {L_{1}^{T}} = \widehat B, $$

and

$$ \begin{array}{@{}rcl@{}} \widehat C + L_{2} {L_{2}^{T}} = \left[\begin{array}{cc} -Y^{T}HY+\tilde{L_{2}}\tilde{{L_{2}^{T}}} &~~~ -{B_{Y}^{T}} \\ -B_{Y} &~~ C \end{array}\right] = L_{S} D_{S} {L_{S}^{T}}, \end{array} $$

where L_S is unit lower triangular and D_S is block diagonal with 1 × 1 and 2 × 2 blocks.

2.3 An application: sparse-dense least squares problems

Consider the linear least-squares (LS) problem

$$ \min_{x} \| Ax - b \|_{2}, $$

where the system matrix $A \in \mathbb {R}^{m \times n}$ (m ≥ n) and the right-hand side vector $b \in \mathbb {R}^{m}$ are given. The solution x satisfies the n × n normal equations

$$ C x = A^{T}b, \qquad C = A^{T}A, $$

where, provided A has full column rank, the normal matrix C is symmetric positive definite. Our interest lies in the case where A is large and mainly sparse and includes a relatively small number of rows that are regarded as dense. These rows may be fully dense or have significantly more entries compared to the other rows of A or may contain far fewer entries than n but nevertheless lead to large amounts of fill in C. The presence of such rows has long been recognized as a fundamental difficulty in the solution of large LS problems; see, for example, [13] for a discussion and references to past work on this problem.

Assuming the rows of A that are to be treated as dense have been permuted to the end and assuming a conformal partitioning of the vector b (and omitting the row permutation matrix for simplicity), we have

$$ A = \left( \begin{array}{cc} A_{s} \\ A_{d} \end{array}\right), \ A_{s} \in \mathbb{R}^{m_{s}\times n}, \ A_{d} \in \mathbb{R}^{m_{d}\times n}, \ b = \left( \begin{array}{cc} b_{s} \\ b_{d} \end{array}\right), \ b_{s} \in \mathbb{R}^{m_{s}}, \ b_{d} \in \mathbb{R}^{m_{d}}, $$

where m_s denotes the number of sparse rows of A, and m_d is the number of dense rows, with m = m_s + m_d, m_s ≥ n > m_d ≥ 1 and m_d ≪ m_s. The normal equations become

$$ Cx = (C_{s} + {A_{d}^{T}} A_{d})x = c, \qquad c = {A_{s}^{T}}b_{s} + {A_{d}^{T}} b_{d}, $$

(13)

where $C_{s} = {A_{s}^{T}}A_{s}$ is the reduced normal matrix. The solution of (13) can be obtained from the equivalent (n + m_d) × (n + m_d) system

$$ \left( \begin{array}{cc} C_{s} & {A_{d}^{T}} \\ A_{d} & -I_{m_{d}} \end{array}\right) \left( \begin{array}{cc} x \\ A_{d} x \end{array}\right) = \left( \begin{array}{cc} c \\ 0_{md, 1} \end{array}\right). $$

(14)

This is of the form (1) with k = m_d, H = C_s, B = A_d and $C=I_{m_{d}}$. If A_s has full column rank, C_s is SPD. But in practice A_s may contain one or more null columns, and C_s is then singular with a corresponding number of null rows and columns. Even if C_s has no null columns, it can be singular or highly ill conditioned. Recent studies have proposed ways of circumventing the problem of null columns in A_s. In [13, 14], it was shown how they can be dealt with either by perturbing diagonal entries of C_s before applying a direct solver or by solving a number of related sparse LS problems and combining their solutions to give the solution of the original problem. Unfortunately, both methods incur overheads, with the former requiring the sparse direct solver is combined with an iterative solver and the latter needing the solution of a number of LS problems. These disadvantages led us to consider employing matrix stretching [15, 16]. The main aim of matrix stretching is to split each of the dense rows (that is, each row of A_d) into a number of sparser rows and to formulate a (larger) modified problem from which the solution to the original problem can be derived. This has the advantage that, provided A is of full rank, the issue of null columns does not arise. However, when A_s is very sparse, the dimensions of the stretched system can grow rapidly with m_d so that the stretched LS problem can be considerably larger than the original problem. It may also be highly ill conditioned. Thus, we remain interested in developing alternative strategies and this was a key motivation behind the current work on null-space approaches.

The following result shows that, in the case of the full rank LS problem (14), the assumptions of Theorem 1 are satisfied, with r ≤ m_d the rank of A_d.

Lemma 2.

Consider $A = \left (\begin {array}{cc} A_{s} \\ A_{d} \end {array}\right ), \ A_{s} \in \mathbb {R}^{m_{s}\times n}, \ A_{d} \in \mathbb {R}^{m_{d}\times n}$. If A is of full rank, then $C_{s} = {A_{s}^{T}} A_{s}$ is positive definite on the null space of A_d.

Proof 3.

Let $v \in \mathcal {N}(A_{d})$ and consider

$$v^{T}C_{s}v = (A_{s}v)^{T}A_{s}v \ge 0.$$

If A_sv = 0 then because A_dv = 0 it follows that Av = 0. Because A is of full rank, this implies v = 0. Hence v^TC_sv > 0 and C_s is positive definite on $\mathcal {N}(A_{d})$. □

Thus, provided a basis for $\mathcal {N}(A_{d})$ can be constructed, the proposed null-space approach offers an alternative way of solving sparse-dense LS problems, even in the case that C_s is SPSD.

3 Null-space basis construction

Solving large-scale saddle point systems using null-space methods leads to the problem of finding null-space bases that preserve sparsity and lead to a stable transformed system. Before looking at null-space basis construction for wide matrices, we give a brief overview of the historical development of techniques for constructing null-space bases of sparse matrices.

3.1 Construction of null-space bases of sparse matrices

A null-space basis matrix Z for $B \in \mathbb {R}^{k \times n}$ of rank r > 0 can sometimes be obtained directly by analysing the problem [17] but more advanced methods are normally necessary. In structural analysis, early methods were based on factorizations of B and focused on the sparsity of Z, with emphasis on Z having a banded or skyline sparsity pattern. Computation of Z based on an initial LU factorization of B was proposed by Topçu [18] (see also [19, 20]). A specific strategy of this kind, called the turnback method [18, 19], attracted interest in the late 1970s/early 1980s (see also an early parallel implementation [21] and its recent use in practice [22]). The turnback (backward looking) method finds null vectors by expressing n − r chosen start columns of B as linear combinations of a small set of previously numbered columns. The initial LU factorization serves to determine these start columns. Linear independence of the columns of Z is guaranteed by not using the leftmost columns in these linear combinations from any set computed for the remaining start columns. An overview and modifications to the basic approach (such as replacing LU by a QR factorization) are described in [23]; see also [24] for further refinements. More recently, factorization approaches for the computation of null-space bases of sparse matrices are discussed in [25, 26].

Theoretical and algorithmic breakthroughs based on bipartite graph matchings and matroid theory came with the 1984 thesis of Pothen [27] and subsequent papers. The contributions covered not only new algorithmic approaches but also complexity concepts for the related problem of finding the sparsest null-space basis of B, that is, for finding Z with the smallest possible number of non zero entries (see also [28]). It has been shown that such a basis can be constructed by a greedy algorithm that assembles the basis using a sequential choice of the sparsest vectors belonging to $\mathcal {N}(B)$. While this can provide extremely sparse Z, there are two important practical limitations. Firstly, finding the sparsest vectors of a null-space basis for a general constraint matrix is NP-hard. Secondly, sparsity of Z may not be enough and its numerical properties must also be considered. Sophisticated proposals given in [27, 29, 30] were designed to compute sparse null-space bases of sparse matrices and they led to efficient algorithms for computing so-called fundamental and triangular null-space bases.

Origins of another line of research appear in the 1969 seminal paper of Henderson and Maunder [31] (see also [32]). Subsequently, a number of related algorithms for specific applications have been developed independently. These exploit the graph of B and, in particular, cycles in the graph. The idea is most transparent if B is the vertex-edge incident matrix of some underlying graph. In structural mechanics, cycles of the graph that describes the interconnection of separate substructures of a skeletal structure can be considered. Using the graph cycles, a set of independent null-space vectors that form columns of Z can be computed [31, 33]. This construction was developed and discussed in the context of other approaches in [34]. In some partial differential equation applications the cycle approach is relevant because a simple constraint structure may be implied by discretization schemes [35,36,37,38,39]. Many publications discuss using cycles in the graph: pointers to relevant literature can be found in [40, 41]. There are heuristic ways of finding the cycle basis, a task that is straightforward if the underlying graph is planar [39]. More generally, construction may be based on a spanning tree of the graph [27]. Procedures to find sparse cycle bases in this way are given in [42] (see also [43, 44]). However, discretizations of three-dimensional grids may not allow sparse cycle bases to be found [38]. As with sparse factorization approaches, those based solely on local computations of Z may suffer from ill conditioning.

An interesting motivation is to construct Z using structure, for example, by allowing a permutation to block angular form. Such an approach was introduced in [45]. Closely related proposals to compute Z while explicitly exploiting substructuring, motivated in part by parallel computing, were given in [46, 47] (see also [48]). The block form can be obtained algebraically, as in the nested dissection ordering discussed in [44]; see also [45].

3.2 Null-space basis for wide matrices

In contrast to much of the previous work on constructing null-space bases, our focus is on wide matrices $B \in \mathbb {R}^{k\times n}$ with k ≪ n. B may be sparse but because k is small, we may want to treat B as dense (although the null-space basis construction may still involve sparse operations). As it can happen in practical applications, we allow for B being rank deficient (rank(B) = r < k). We want the nearly-square null-space basis matrix $Z \in \mathbb {R}^{n \times (n-r)}$ to be sparse but we are also concerned about conditioning. Our experiments consider the quality of Z and, as we employ a sparse direct solver to factorize the (1, 1) block Z^THZ given by (10) of the transformed system, we need Z to have no dense rows and be such that Z^THZ is sparse. Orthogonal transformations inevitably lead to dense Z and so, although useful in other situations (see, for instance, [49]), are not appropriate here. Because B is wide, each column of Z can be computed independently and such that each has small support (the entries can be computed as the coefficients of linear combinations of at most k other columns). Small support means that there is always some orthogonality but Z may still be ill conditioned. Indeed, if B is an adjacency matrix of a domain discretized by mixed hybrid finite elements, then the condition number of the null-space basis of the constraints that express the hybridization with small support can grow like h^− 2, where h is the discretization parameter [38]. Thus, we propose employing a QR factorization that incorporates pivoting for stability and, because B has only a small number of rows, this is not prohibitively expensive.

3.2.1 Null-space bases with local support

This section discusses constructing (a permutation of) Z by finding linearly dependent sets of columns with close column indices. The coefficients of the linear combination of these columns that sum to zero are the values of the entries in Z. These bases are similar to those belonging to the category of triangular null-space bases [29], but employment of QR factorizations with pivoting leads to Z having a more general sparsity pattern. Our focus is on the numerical qualities of the basis combined with limiting the number of entries through the employment of a threshold pivoting strategy (thus, as is common with sparse matrix factorizations, there is a trade off between stability and sparsity).

We begin with the simple case in which B comprises a single fully dense row (k = 1) and consider two extreme cases for the structure of a sparse null-space basis $Z \in \mathbb {R}^{n \times (n-1)}$. First, Z can be constructed to have non zeros only in its first row and on the diagonal of Z_{2:n, 1:n− 1}. The entries in row 1 are obtained by dividing entries 2 to n of B by the entry in its (1, 1) position. Column pivoting is needed to ensure that this entry is large. The resulting Z is sparse but Z^THZ is dense and so is not of interest here.

The second extreme case constructs Z to be a permutation of a matrix with a banded structure; it is presented as Algorithm 2 and illustrated in Fig. 1. When $B \in \mathbb {R}^{1 \times n}$ we have Q = I₁, but we include the Q factor in the algorithm description because it will be needed in subsequent more general algorithms. The R factor is a single row with its first entry β₁ ≠ 0. The computed Y (recall (7)) is sparse, has full rank, is well conditioned because of the column pivoting and belongs to the space of vectors Z^⊥ [8, 50]. In particular, BY = 1. We remark that a similar approach was recently proposed by Howell [3, Algorithm 3].

A dominant part of the saddle-point matrix (11) is its (1, 1) block Z^THZ. Its condition number cond(Z^THZ) is bounded by

$$ cond(Z^{T} H Z) \le cond({Z_{0}^{T}} H Z_{0}) cond(Z^{T}Z), $$

(15)

where Z₀ is an orthogonal null-space matrix (see Lemma 10 in [51]). Consequently, it is desirable for cond(Z^TZ) to be small. Constructing Z so that P^TZ has a narrow band may not be sufficient to guarantee this, as can be deduced from the following result for the idealized example of B comprising a single row of all ones.

Lemma 3.

Let $B \in \mathbb {R}^{1 \times n}$ be a row vector of all ones and construct Z using Algorithm 2. Then the condition number of $Z^{T}Z \in \mathbb {R}^{(n-1) \times (n-1)}$ is asymptotically of order n².

Proof 4.

The permutation P is the identity and the computed Z is bidiagonal with diagonal entries equal to − 1 and off diagonal nonzeros equal to 1. Z^TZ is a well-known Laplacian matrix with eigenvalues given by

$$ 2 \cos (1-\frac{j \pi}{n}), 1 \le j \le n-1. $$

Consequently, the largest eigenvalue goes asymptotically to 4 and the smallest one is

$$4 \sin^{2} \frac{\pi}{2 n} \approx \frac{\pi^{2}} {n^{2}}.$$

The result follows. □

Algorithm 3 considers general B with 1 ≤ k < n rows and rank r such that 0 < r ≤ k. The basic idea is to express columns of Z as linear combinations of previous columns with close indices. r is computed using a QR factorization with column pivoting. The columns of B corresponding to the first r columns of the R factor are permuted to the front and the remaining n − r columns are marked as dependent; they correspond to the columns of Z. They are computed independently while aiming to balance sparsity and numerical stability. As in other sparse numerical linear algebra algorithms that need to combine locality with ensuring stability, this is achieved using threshold column pivoting. The principle behind threshold pivoting is as follows. Assume a threshold parameter θ is given with 0 < θ ≤ 1. At a step of the QR factorization, assume that Δ is the maximum norm of all the columns eligible to be chosen as the next pivot column. While full column pivoting selects the pivot column to be an eligible column with norm Δ, threshold pivoting chooses the pivot column to be the eligible column with norm at least θΔ and such that its index is closest to the index of the first (starting) column of the factorized matrix. In Algorithm 3, we perform QR factorizations repeatedly in the loop commencing at Line 3 for a set of n − r starting columns. The initial permutation P guarantees to find r and determine the n − r starting columns, that is, the columns that are expressed in the loop as linear combinations of previous columns in the order determined by P. The local permutation P_L reverses the column order so that threshold column pivoting chooses columns closest to the starting column first. The coefficients are then obtained by performing QR factorizations (with threshold partial pivoting) of submatrices of BP. While Algorithm 3 is somewhat technical, it essentially combines two steps: it first determines BP and then solves n − r subproblems for the entries of the n − r columns of Z. Permutations ensure the subproblems always find these entries. The matrix E computed by Algorithm 3 satisfies (8).

Incorporating threshold column pivoting in the QR factorizations in Algorithm 3 implies that, provided the parameter θ is not too small, the matrix BY should be reasonably well conditioned. Moreover, it guarantees that any dependent column $(\widetilde B)_{:,l}$ of $\tilde B$ can be stably expressed as a linear combination of previous columns. Note that the coefficients of this linear combination are expressed using the inverse of $\tilde R$. Because each such column is used in the construction of just one column of Z, the columns of Z must be linearly independent. Observe that a null column $(\tilde B)_{:,l}$ implies a unit column in Z. Note also that Line 10 is well defined because the permutation P guarantees not only that there is a submatrix of rank r in each X_l, l = r + 1, … , n, but also that $\widehat Q^{T} (\widetilde B)_{:,l}$ cannot have non zeros in rows r + 1, … , k.

Figure 2 shows the basis matrix Z constructed using Algorithm 3 for $B \in \mathbb {R}^{2 \times 6}$ of full rank; θ is set to 0.1. With this threshold choice, it is not necessary to permute column 6 of B (the one with largest norm) to the front and hence P = I and $B=\tilde B$. The loop starting in Line 3 considers the last n − r = 4 columns of $\tilde B$ and expresses each as a linear combination of the closest previous columns, successively accessing the columns in reverse order; the initial permutation P guarantees this is possible and each column provides a column of $Z =\tilde Z$. For example, B_:,4 = 2B_:,3 − B_{:, 2}, giving entries − 1, 2,− 1 in rows 2, 3 and 4 of the second column of Z.

3.2.2 Fundamental null-space basis

We next discuss constructing the so-called fundamental null-space basis, which was first mentioned in an unpublished 1962 paper by Wolfe [52] (see [29]).

Let $B \in \mathbb {R}^{k \times n}$ have full row rank and assume the columns have been permuted so that G = (B)_{1:k, 1:k} is nonsingular. Then the fundamental null-space basis of $B =\left (\begin {array}{cc} G & N \end {array}\right )$ is defined to be

$$ Z = \left( \begin{array}{cc} - G^{-1} N \\ I_{n-k} \end{array}\right). $$

(16)

For rank(B) = r < k, let

$$ BP = Q \left( \begin{array}{cc}R_{1} & R_{2} \\ 0_{k-r,r} & 0_{k-r,n-r} \end{array}\right) \quad \text{and} \quad P_{1} B P_{2} = \left( \begin{array}{cc}L_{1} \\ L_{2} & I_{k-r} \end{array}\right)\left( \begin{array}{cc}U_{1} & U_{2} \\ 0_{k-r,r} & 0_{k-r,n-r} \end{array}\right) $$

be the pivoted QR and LU factorizations of B, respectively. The fundamental null-space basis of B may be expressed as either

$$ Z = P \left( \begin{array}{cc} - R_{1}^{-1} R_{2} \\ I_{n-r} \end{array}\right) $$

(17)

or

$$ Z = P_{2} \left( \begin{array}{cc} - U_{1}^{-1} U_{2} \\ I_{n-r} \end{array}\right). $$

(18)

For the LU factorization, our experience is that complete pivoting should be used for stability.

The biconjugation process of Hestenes [53] applied to a square matrix $A \in \mathbb {R}^{n \times n}$ yields a pair of biconjugate matrices $(\tilde V , V)$ such that $\tilde V A V$ is diagonal. Biconjugate factorizations were discussed in [54] and some practical extensions were given by Benzi [4]; see also the direct projection method [5, 6]. Applying biconjugation approximately leads to so-called approximate inverse preconditioners [55]. We define a partial biconjugation factorization applied to $B \in \mathbb {R}^{k \times n}$ with rank(B) = r, 0 < r ≤ k ≤ n, to be a process that yields $V \in \mathbb {R}^{n \times n}$ and a lower trapezoidal $L \in \mathbb {R}^{k \times n}$ satisfying

$$ BV = L. $$

(19)

We term this the right oblique conjugation. Assuming r steps of an LDU factorization of B can be performed without pivoting, Algorithm 4 computes a null-space basis matrix $Z \in \mathbb {R}^{(n-r) \times n}$ by right oblique conjugation.

It initializes V to be the identity. The columns of V are then transformed so that, from Line 4, ${b_{i}^{T}} v_{j}^{(i)} = 0.$ At the end of the algorithm, BV is lower trapezoidal. This means, in particular, that the last n − r columns form a null-space basis because they are linearly independent. To see the mutual relationship between oblique conjugation and LDU factorization, first consider the ‘non-wide’ square case (r = k = n). In this instance, V = U^− 1, where U is the U factor of the LDU factorization of B without pivoting. This result follows from the fact that V is unit upper triangular by construction, the uniqueness of the LDU factorization without pivoting and (19) is satisfied (see [56]). Now consider r ≤ k < n. Because the construction commences with V = I_n, and all the updates at Line 4 are well defined, the computed V is upper triangular and its last n − r columns have the shape of (16), where G represents the principal leading submatrix of B of dimension r. In the algorithm outline, for simplicity, we omit pivoting. Even with column pivoting, an assumption is needed to guarantee the factorization exists. It is sufficient to assume linear independence of the first r rows of B, which is weaker than requiring the first r steps of the LDU factorization of B can be performed without pivoting. We have the following straightforward result.

Lemma 4.

Let the first r rows of $B \in \mathbb {R}^{k \times n}$ be linearly independent and let rank(B) = r. Apply right oblique conjugation to B. Then there exists a permutation $P \in \mathbb {R}^{n \times n}$ and a block partitioning of $BP = \left (\begin {array}{cc} G & N \\ D & C \end {array}\right )$ with $G \in \mathbb {R}^{r \times r}$ such that

$$ \left( \begin{array}{cc} G & N \end{array}\right) \left( \begin{array}{cc} W & S \\ 0_{n-r,r} & I_{n-r} \end{array}\right) = \left( \begin{array}{cc} I_{r} & 0_{r,n-r} \end{array}\right), $$

(20)

for some $W \in \mathbb {R}^{r \times r}$ and $S \in \mathbb {R}^{r \times (n-r)}$. In particular,

$$ Z = \left( \begin{array}{cc} S \\ I_{n-r} \end{array}\right)$$

is the fundamental null-space basis corresponding to $\left (\begin {array}{cc} G & N \end {array}\right )$.

Proof 5.

Setting P to be the permutation matrix corresponding to column pivoting within the LU factorization and using the uniqueness of the first r steps of the LU factorization of BP, we have W = G^− 1. Moreover, GS + N = 0_{r, n−r}, from which it follows that

$$ \left( \begin{array}{cc} S \\ I_{n-r} \end{array}\right) \equiv \left( \begin{array}{cc} -G^{-1}N \\ I_{n-r} \end{array}\right)$$

is the fundamental null-space basis of the matrix B with permuted columns. □

One reason why fundamental null-space bases using right oblique conjugation are of potential interest is the practical implementation of Algorithm 4 and relates to the fact that the one-sided factorization does not split the inverse of G into two factors. In practice, pivoting needs to be incorporated to maximize the magnitude of the quantity ${b_{i}^{T}}v_{i}^{(i-1)}$ used in Line 4. While the rows of B can be stored in static data structures, the columns of V that are eligible to be pivot columns (those with indices j = i + 1, … , n) are updated and so need to be stored in a (single) dynamic data structure. By contrast, if the basis is constructed using a standard LU factorization, two sets of vectors (those that form the L and U factors) change dynamically and so must be stored using two dynamic data structures. This storage difference may be important when k is large (which is not the case in this paper). Note that the one-sided factorization is a useful way to explain relations among closely related computational approaches [57]. In exact arithmetic, the diagonal entries ${b_{i}^{T}}v_{i}^{(i-1)}$ are inverses of the diagonal entries of the LDU factorization (see [4]). In finite precision arithmetic, there are other, and possibly more stable, ways of computing the pivots that we do not discuss here; theoretical properties of some biconjugation variants (with additional assumptions) are discussed in [58]. In the case k ≪ n, complete pivoting is not prohibitively expensive and may be needed for stability.

A further attraction of right oblique conjugation is its flexibility. It was introduced as a way to obtain the fundamental null-space basis, but it can be modified to give other null-space bases, including banded ones. To see this, consider the inner loop of Algorithm 4. The oblique projection in Line 4 projects the vectors $v_{j}^{(i-1)}$ along $v_{i}^{(i-1)}$ onto the space $b_{i}^{\bot }$. The oblique projection can be rewritten using the projector

$$ \left( \begin{array}{cc}I - \frac{v_{i}^{(i-1)}{b_{i}^{T}}}{{b_{i}^{T}}v_{i}^{(i-1)}} \end{array}\right), $$

where the vectors $v_{i}^{(i-1)}, \ldots , v_{n}^{(i-1)}$ belong to the space $b_{1}^{\bot } \cap {\ldots } \cap b_{i-1}^{\bot }$ (see, for example, [5]). But $v_{i}^{(i-1)}, \ldots , v_{n}^{(i-1)}$ can be projected along any other set of linearly independent vectors that are not equal to the $v_{i}^{(i-1)}$. Algorithmically, the projection can be replaced by

$$ {\displaystyle v_{j}^{(i)} = \left( \begin{array}{cc}I - \frac{x_{j} {b_{i}^{T}}}{{b_{i}^{T}}x_{j}} \end{array}\right) v_{j}^{(i-1)}, j=i+1, \ldots, n,} $$

where x₂, … , x_n are linearly independent. The real power to construct the null-space bases via oblique projections is apparent from the following result. It shows that Algorithm 2, which obtains the null-space basis Z by permuting the upper bidiagonal matrix $\tilde Z$, can be cast in the form of projections. We state the result without proof because it can be verified by direct checking.

Lemma 5.

Algorithm 2 can be rephrased in terms of the general scheme of Algorithm 4 by using in Line 4 the oblique projections

$$ {z_{j}^{(i)} = \left( \begin{array}{cc}I - \frac{x_{j} {b_{i}^{T}}}{{b_{i}^{T}}x_{j}} \end{array}\right) z_{j}^{(i-1)}, j=i+1, \ldots, n,} $$

(21)

with x_j = e_j− 1, j = 2, … , n.

3.2.3 Composite null-space basis for wide dense matrices

There are situations in which the null-space basis can be constructed from partial null-space bases in several steps, instead of the single pass of Algorithm 3. This may simplify the computation, allow it to be more parallel, or to be useful in specific cases, for example, when B has a particular block structure. The following is a straightforward extension of Theorem 6.4.1 of [59] for rank deficient blocks and non-orthonormal bases (a slightly less general version was described in [3]).

Theorem 2.

Consider the null-space basis $Z_{F} \in \mathbb {R}^{n \times (n-r_{1})}$ of the matrix $F \in \mathbb {R}^{k_{1} \times n}$ of rank r₁ and the null-space basis $Z_{G} \in \mathbb {R}^{(n-r_{1}) \times (n-r_{1}-r_{2})}$ of the matrix $GZ_{F} \in \mathbb {R}^{k_{2} \times (n-r_{1})}$ of rank r₂, where $G \in \mathbb {R}^{k_{2} \times n}$. Then the columns of the matrix $Z_{F}Z_{G} \in \mathbb {R}^{n \times (n-r_{1}-r_{2})}$ form a basis of $\mathcal {N}(F) \cap \mathcal {N}(G)$.

This result allows Algorithms 2 and 3 to be generalized by adding rows (or blocks of rows) sequentially to B. An important application for such a construction is when a sequence of problems is generated by successively modifying B through the addition of further rows. A special case of a procedure of this type was proposed by Howell [3]. To demonstrate the mechanism, we introduce Algorithm 5 that applies Algorithm 3 repeatedly to τ ≥ 1 row blocks of B and the null-space basis construction exploits Theorem 2. For simplicity, we assume that the rows of B are dense and rank(B) = k (so that all the row blocks are also of full rank). In practice, B may contain zeros that fill in as the algorithm proceeds.

An important practical application of Algorithm 5 is B having a non-trivial block angular form as this can be exploited to reduce the work. Assume that B (of full row rank) can be ordered to the block angular form

$$ B = \left( \begin{array}{ccccc} \bar B_{1} &&&& \bar D_{1} \\ & \bar B_{2} &&& \bar D_{2} \\ && {\ddots} && {\vdots} \\ &&& \bar B_{s} & \bar D_{s} \end{array}\right), $$

(22)

where $\bar B_{i} \in \mathbb {R}^{k_{i} \times n_{i}}$ and $\bar D_{i} \in \mathbb {R}^{k_{i} \times n_{d}}$. If $\mathcal {N}(B)$ can be composed from the null-space bases of the $\bar B_{i}$ extended by zeros outside their domains, then the null-space basis matrix Z is of block angular form; see also the framework of substructuring in [47].

In our experiments, we consider the special case in which each row block contains a single row but B is not assumed to have full row rank. The approach is given in Algorithm 6 and illustrated in Fig. 3. It constructs the null-space basis Z as a product of bases Z_i corresponding to the rows of B. Let $(\widetilde B)_{1:i,:}$ be the i × n matrix with rows $\tilde {b^{T}_{j}}, j=1, \ldots , i$. Then at Line 4, l_i = l − 1 if $rank((\widetilde B)_{1:i,:})> rank((\widetilde B)_{1:i-1,:})$ and l_i = l otherwise.

4 Numerical experiments

In this section, we present numerical results to illustrate the potential of the proposed approaches for computing null space bases for solving symmetric saddle point problems with a small non zero (2, 2) block. We start by presenting detailed results for a least squares problem that has 7 dense rows. In particular, we examine the effects of varying the threshold parameter θ on the construction of the null-space basis, reporting on its sparsity and orthogonality. Then for a wider set of least squares problems that have a few dense rows and using a fixed threshold, we compare the performances of Algorithms 3, 4 and 6 in terms of the sparsity of Z^TC_sZ and its condition number. Finally, we consider a problem coming from quadratic programming.

With the exception of PDE1 (which comes from the CUTEst linear program test set [60]), the test problems given in Table 1 are from the SuiteSparse Matrix Collection^{Footnote 1}. If necessary, a test matrix is transposed to give an over determined system (m > n). The problems are a subset of those used in the least squares study of Gould and Scott [61]. They were selected because they are real rectangular matrices of full rank that contain a small number of dense rows. Problem PDE1 was included because it is a large problem that illustrates the effects of a single dense row. The row block A_d is identified using the variant of the approach of Meszaros [62] described in [16]. All reported norms are Euclidean norms. In our experiments, for any matrix H, nnz(H) denotes the number of non zero entries, with the number set to those in the lower triangular part when H is symmetric.

Table 1 Least squares test problems. The matrix A is of order m × n and m_d is the number of rows in A_d

Full size table

4.1 Results for problem scagr7-2b

Our initial experiments are for problem scagr7-2b. Figure 4 demonstrates the effect of varying the threshold parameter θ ∈ [0.15, 1] in Algorithm 3 on the sparsity of the null-space basis matrix Z and on the transformed normal matrix Z^TC_sZ. It confirms the significant advantage of using a small θ. Once θ is sufficiently small, there are sufficient degrees of freedom to find a suitable null-space basis Z and the sparsity of the matrices Z and Z^TC_sZ does not decrease. The sparsity patterns of Z and Z^TC_sZ for θ = 0.15, 0.5 and 1 are given in Figs. 5 and 6.

The effects of varying θ on the orthogonality of the null-space basis (here and elsewhere this is measured as the norm of BZ) and on the condition number cond(Z^TC_sZ) are illustrated in Fig. 7. We see that, except for very small θ, ∥BZ∥ is small and there is little variation in cond(Z^TC_sZ).

We also tested the fundamental null-space basis approaches discussed in Section 3.2.2. The standard approach based on the pivoted QR factorization (17) finds a well-conditioned submatrix and nnz(Z^TC_sZ) ≈ 7 × 10⁶ for a range of values of θ. For right oblique conjugation (Algorithm 4), cond(Z^TC_sZ) = 2.8 × 10⁹ (independently of θ) and nnz(Z^TC_sZ) ≈ 1.4 × 10⁶. The greater density for the fundamental null-space approach is potentially an important disadvantage if it is used as here in combination with a sparse direct solver. However, we anticipate that the approach may be more attractive if Z^TC_sZ is applied implicitly, such as in the employment of an iterative solver; we plan to investigate this further in the future.

In Fig. 8, we plot cond(Z^TZ) and ratio = ∥A^Tres∥/∥res∥ for Algorithms 3, 4 and 6 for the threshold parameter θ ∈ [0.15, 1] (here res = b − Ax is the least squares residual). We see that Algorithm 4 leads to the smallest condition numbers but the variation in ratio for the different approaches is much smaller.

4.2 Experiments on other matrices

Table 2 presents results for other least squares test examples. In these experiments, the threshold parameter is θ = 0.25. The emphasis here is on comparing the proposed approaches for computing a null-space basis for wide $B=A_{d} \in \mathbb {R}^{m_{d} \times n}$. In particular, we compare the standard pivoted QR computation of the fundamental null-space basis given by (17) with Algorithm 3 and the right conjugation approach of Algorithm 4.

Table 2 Results for least squares examples with the fixed threshold parameter θ = 0.25

Full size table

The results demonstrate that, as expected, the different approaches lead to null-space bases with complementary properties, with no single approach being uniformly advantageous.

In particular, we see that for the chosen threshold parameter, Algorithms 3 and 6 result in sparse transformed matrices but it can lead to a large condition number (as illustrated by problem lp_agg). Furthermore, comparing the pivoted QR and Algorithm 4, which both construct fundamental bases, we see that the condition number estimate is similar but the latter produces a sparser transformed normal matrix.

4.3 Results for problem hues_mod

The experiments presented so far targeted the saddle-point formulation of sparse-dense least squares problems. The next example, hues_mod, comes from a convex quadratic programming problem and is taken from the CUTEst test set [60]. While the (1, 1) Hessian block of size n = 10000 is well conditioned, the k = 2 rows that form the off-diagonal constraint block B are dense with entries that differ by 6 orders of magnitude. This is potentially challenging when constructing a sparse null space basis matrix Z. The (2, 2) block is C = 10^− 6 × I₂. In Fig. 9, we report results for Algorithm 3 for different values of the threshold pivoting parameter θ. We see that, even for small thresholds (θ ≈ 0.1), the orthogonality of Z is very good and both the number of entries nnz(Z^THZ) in the transformed (1, 1) block and its factor increase steadily with θ. Note that, for this problem, there is little fill in the factor of Z^THZ.

In Fig. 10, we illustrate how the solution time is influenced by the threshold parameter. This is the total solution time (in seconds).^{Footnote 2} We see that using a large threshold is expensive.

5 Concluding remarks and future directions

In this paper, we have proposed a new null-space approach for solving general symmetric saddle point systems with a small and non zero (2, 2) block and a (2, 1) block that may be rank deficient. An important motivation was solving large-scale linear least squares problems in which the system matrix has a small number of rows that are considered to be dense. Because the success of null-space approaches depends on being able to construct appropriate null-space bases, we have looked at how this can be done stably for our applications. In particular, our emphasis has been on null-space bases for k × n matrices that are wide (k ≪ n) and possibly dense.

The standard QR-based fundamental null-space basis computation leads to a transformed matrix that is relatively dense but has the advantage of being generally well conditioned. If a sparse direct solver is employed, the blocks of the transformed matrix must be constructed explicitly and the factors will further fill in. In this case, the QR approach is not ideal; indeed, memory requirements limit the size of systems that can be tackled. Null-space bases computed using right conjugation are also well conditioned and offer the possibility of sparser transformed matrices.

Fundamental null-space bases are potentially attractive for iterative solvers if the basis can be efficiently applied implicitly and provided an effective preconditioner is available. In the future, we plan to develop preconditioners for use with an iterative solver for the solution of large-scale saddle-point systems with a small non zero (2, 2) block via our proposed null-space transformation. Possible lines of research are the left inverses proposed by Nash and Sofer [51] and the factorization behind the conjugation process. Preconditioning of the transformed system based on constructing Z using Algorithm 3 with a small threshold parameter may be more straightforward but possible ill conditioning must be taken into account.

A further goal will be to satisfy linear constraints with a small residual, that is, using the least squares notation of Section 2.3, if we require A_dx = b_d then we need to ensure res_d = b_d − A_dx is small. Applying Algorithm 3 to the test example scagr7-2b gives $\|res_{d}\|_{\infty }/\|res\|_{\infty } \approx 5.7 \times 10^{-2}$, with little variation for different values of the threshold parameter θ. Thus the constraints are not tightly satisfied. In the future, we will explore how we can use the null-space approach presented here in combination with other techniques to reduce $\|res_{d}\|_{\infty }$.

Notes

https://sparse.tamu.edu/
Timings on a uniprocessor Intel Core(TM) i5-4990, 3.30GHz, 12GB memory.

References

Benzi, M., Golub, G. H., Liesen, J.: Numerical solution of saddle point problems. Acta Numerica 14, 1–137 (2005)
Article MathSciNet MATH Google Scholar
Govaerts, W.: Solution of bordered singular systems in numerical continuation and bifurcation. In: Proceedings of the Fifth International Congress on Computational and Applied Mathematics (Leuven, 1992). https://doi.org/10.1016/0377-0427(94)90311-5, vol. 50, pp 339–347 (1994)
Howell, J. S.: Prestructuring sparse matrices with dense rows and columns via null space methods. Numer. Linear Algebra Appl. 25, 1–30 (2018). https://doi.org/10.1002/nla.2133
Article MathSciNet MATH Google Scholar
Benzi, M.: A direct row-projection method for sparse linear systems. PhD thesis, Department of Mathematics North Carolina State University (1993)
Benzi, M., Meyer, C. D.: A direct projection method for sparse linear systems. SIAM J. Sci. Comput. 16(5), 1159–1176 (1995)
Article MathSciNet MATH Google Scholar
T˚uma, M.: Implicit Gauss algorithm for solving the sparse unsymmetric sets of linear equations. Technical Report CSGS 1/85, Department of Mathematics, Statistics and Informatics, University of Bergamo (1992)
Hogg, J. D., Reid, J. K., Scott, J. A.: Design of a multicore sparse Cholesky factorization using DAGs. SIAM J. Sci. Comput. 32, 3627–3649 (2010)
Article MathSciNet MATH Google Scholar
Rees, T., Scott, J. A.: A comparative study of null-space factorizations for sparse saddle point systems. Numer. Linear Algebra Appl. 25:e2103, 1–17 (2018). https://doi.org/10.1002/nla.2103
MathSciNet MATH Google Scholar
Le Borne, S.: Preconditioned nullspace method for the two-dimensional Oseen problem. SIAM J. Sci. Comput. 31(4), 2494–2509 (2009). https://doi.org/10.1137/070691577
Article MathSciNet MATH Google Scholar
George, A., Heath, M. T.: Solution of sparse linear least squares problems using Givens rotations. Linear Algebra Appl. 34, 69–83 (1980)
Article MathSciNet MATH Google Scholar
Benzi, M., Golub, G. H.: A preconditioner for generalized saddle point problems. SIAM J. Matrix Anal. Appl. 26(1), 20–41 (2004). https://doi.org/10.1137/S0895479802417106
Article MathSciNet MATH Google Scholar
Horn, R. A., Johnson, C. R.: Matrix Analysis. Cambridge University Press. https://doi.org/10.1017/CBO9780511810817 (1985)
Scott, J. A., T˚uma, M.: Solving mixed sparse-dense linear least-squares problems by preconditioned iterative methods. SIAM J. Sci. Comput. 39(6), 2422–2437 (2017)
Article MathSciNet Google Scholar
Scott, J. A., T˚uma, M.: A Schur complement approach to preconditioning sparse least-squares problems with some dense rows. Numer. Algorithm. 79, 1147–1168 (2018). https://doi.org/10.1007/s11075-018-0478-2
Article MathSciNet MATH Google Scholar
Scott, J. A., T˚uma, M.: Sparse stretching for solving sparse-dense linear least-squares problems. SIAM J. on Scientific Computing 41(3), 1604–1625 (2019)
Article MathSciNet Google Scholar
Scott, J. A., T˚uma, M.: Strengths and limitations of stretching for least-squares problems with some dense rows. ACM Trans. Math. Softw. 47(1), 1–1125 (2021). https://doi.org/10.1145/3412559
Article MathSciNet MATH Google Scholar
Chow, E., Manteuffel, T., Tong, C., Wallin, B.: Algebraic elimination of slide surface constraints in implicit structural analysis. Int. J. Numer. Methods Eng. 57, 1129–1144 (2003)
Article MATH Google Scholar
Topçu, A.: A Contribution to the Systematic Analysis of Finite Element Structures Using the Force Method. PhD Thesis, University of Essen, Federal Republic of Germany (1979)
Kaneko, L., Lawo, M., Thierauf, G.: On computational procedures for the force method. Int. J. Numer. Methods Eng. 18(10), 1469–1495 (1982). https://doi.org/10.1002/nme.1620181004
Article MathSciNet MATH Google Scholar
Soyer, E., Topçu, A.: Sparse self-stress matrices for the finite element force method. Int. J. Numer. Methods Eng. 50, 2175–2194 (2001). https://doi.org/10.1002/nme.119
Article MATH Google Scholar
Berry, M., Plemmons, R.: Computing a banded basis of the null space on the Denelcor HEP multiprocessor. Contemp. Math. 47, 7–23 (1985)
Article MATH Google Scholar
Dang, T., Ling, K., Maciejowski, J.: Banded null basis and ADMM for embedded MPC. IFAC-PapersOnLine 50, 13170–13175 (2017). https://doi.org/10.1016/j.ifacol.2017.08.2172
Article Google Scholar
Heath, M. T., Plemmons, R. J., Ward, R. C.: Sparse orthogonal schemes for structural optimization using the force method. SIAM J. Sci. Stat. Comput. 5(3), 514–532 (1984). https://doi.org/10.1137/0905038
Article MathSciNet MATH Google Scholar
Berry, M. W., Heath, M. T., Kaneko, I., Lawo, M., Plemmons, R. J., Ward, R. C.: An algorithm to compute a sparse basis of the null space. Numer. Math. 47(4), 483–504 (1985). https://doi.org/10.1007/BF01389453
Article MathSciNet MATH Google Scholar
Gotsman, C., Toledo, S.: On the computation of null spaces of sparse rectangular matrices. SIAM J. Matrix Anal. Appl. 30(2), 445–463 (2008). https://doi.org/10.1137/050638369
Article MathSciNet MATH Google Scholar
Shklarski, G., Toledo, S.: Computing the null space of finite element problems. Comput. Methods Appl. Mech. Eng. 198(37-40), 3084–3095 (2009). https://doi.org/10.1016/j.cma.2009.05.012
Article MathSciNet MATH Google Scholar
Pothen, A.: Sparse null bases and marriage theorems, pp. AAI8415425. PhD thesis, Cornell University, Ithaca (1984)
Coleman, T. F., Pothen, A.: The null space problem. I. Complexity. SIAM J. Algebraic Discret. Methods 7(4), 527–537 (1986). https://doi.org/10.1137/0607059
Article MathSciNet MATH Google Scholar
Coleman, T. F., Pothen, A.: The null space problem. II. Algorithms. SIAM J. Algebraic Discrete Methods 8(4), 544–563 (1987). https://doi.org/10.1137/0608045
Article MathSciNet MATH Google Scholar
Gilbert, J. R., Heath, M. T.: Computing a sparse basis for the null space. SIAM J. Algebraic Discrete Methods 8(3), 446–459 (1987). https://doi.org/10.1137/0608037
Article MathSciNet MATH Google Scholar
Henderson, J.C.d.C., Maunder, E. A. W.: A problem in applied topology: On the selection of cycles for the flexibility analysis of skeletal structures. J. Inst. Math. Appl. 5, 254–269 (1969)
Article MathSciNet MATH Google Scholar
Cassell, A. C., Henderson, J.C.d.C., Kaveh, A.: Cycle basis for flexibility analysis of structures. Int. J. Numer. Methods Eng. 8, 521–528 (1974). https://doi.org/10.1002/nme.1620080308
Article MATH Google Scholar
Maunder, E. A. W.: Topological and Linear Analysis of Skeletal Structures. PhD Thesis, Imperial College, London (1971)
Pothen, A.: Sparse null basis computations in structural optimization. Numer. Math. 55 (5), 501–519 (1989). https://doi.org/10.1007/BF01398913
Article MathSciNet MATH Google Scholar
Amit, R., Hall, C. A., Porsching, T. A.: An application of network theory to the solution of implicit Navier-Stokes difference equations. J. Comput. Phys. 40(1), 183–201 (1981). https://doi.org/10.1016/0021-9991(81)90206-0
Article MathSciNet MATH Google Scholar
Arioli, M., Manzini, G.: Null space algorithm and spanning trees in solving Darcy’s equation. BIT Numer. Math. 43(suppl.), 839–848. https://doi.org/10.1023/B:BITN.0000014568.20710.77 (2003)
Arioli, M., Manzini, G.: A network programming approach in solving Darcy’s equations by mixed finite-element methods. Electron. Trans. Numer. Anal. 22, 41–70 (2006)
MathSciNet MATH Google Scholar
Arioli, M., Maryška, J., Rozložník, M., T˚uma, M.: Dual variable methods for mixed-hybrid finite element approximation of the potential fluid flow problem in porous media. Electr. Trans. Numer. Anal. 22, 17–40 (2006)
MathSciNet MATH Google Scholar
Hall, C. A.: Numerical solution of Navier-Stokes problems by the dual variable method. SIAM J. Algebraic Discrete Methods 6(2), 220–236 (1985). https://doi.org/10.1137/0606020
Article MathSciNet MATH Google Scholar
Kaveh, A.: Computational structural analysis and finite element methods. Springer. https://doi.org/10.1007/978-3-319-02964-1 (2014)
Kaveh, A.: Graph transformations for efficient structural analysis. Acta Mech. 229(2), 659–675 (2018)
Article MathSciNet MATH Google Scholar
Deo, N., Prabhu, G. M., Krishnamoorthy, M. S.: Algorithms for generating fundamental cycles in a graph. ACM Trans. Math. Softw. 8(1), 26–42 (1982). https://doi.org/10.1145/355984.355988
Article MathSciNet MATH Google Scholar
Horton, J. D.: A polynomial-time algorithm to find the shortest cycle basis of a graph. SIAM J. Comput. 16(2), 358–366 (1987). https://doi.org/10.1137/0216026
Article MathSciNet MATH Google Scholar
Stern, J. M., Vavasis, S. A.: Nested dissection for sparse nullspace bases. SIAM J. Matrix Anal. Appl. 14(3), 766–775 (1993). https://doi.org/10.1137/0614054
Article MathSciNet MATH Google Scholar
Pinar, A., Chow, E., Pothen, A.: Combinatorial algorithms for computing column space bases that have sparse inverses. Electron. Trans. Numer. Anal. 22, 122–145 (2006)
MathSciNet MATH Google Scholar
James, D., Plemmons, R. J.: An iterative substructuring algorithm for equilibrium equations. Numer. Math. 57(6-7), 625–633 (1990). https://doi.org/10.1007/BF01386432
Article MathSciNet MATH Google Scholar
Plemmons, R. J., White, R. E.: Substructuring methods for computing the nullspace of equilibrium matrices. SIAM J. Matrix Anal. Appl. 11(1), 1–22 (1990). https://doi.org/10.1137/0611001
Article MathSciNet MATH Google Scholar
James, D.: Implicit nullspace iterative methods for constrained least squares problems. SIAM J. Matrix Anal. Appl. 13(3), 962–978 (1992). https://doi.org/10.1137/0613058
Article MathSciNet MATH Google Scholar
Le Borne, S.: Block computation and representation of a sparse nullspace basis of a rectangular matrix. Linear Algebra Appl. 428(11-12), 2455–2467 (2008). https://doi.org/10.1016/j.laa.2007.11.025
Article MathSciNet MATH Google Scholar
Fletcher, R., Johnson, T.: On the stability of null-space methods for KKT systems. SIAM J. Matrix Anal. Appl. 18(4), 938–958 (1997). https://doi.org/10.1137/S0895479896297732
Article MathSciNet MATH Google Scholar
Nash, S. G., Sofer, A.: Preconditioning reduced matrices. SIAM J. Matrix Anal. Appl. 17(1), 47–68 (1996). https://doi.org/10.1137/S0895479893245371
Article MathSciNet MATH Google Scholar
Wolfe, P.: Methods of nonlinear programming. In: Nonlinear Programming (NATO Summer School, Menton, 1964), pp. 97–131. North-Holland (1967)
Hestenes, M. R.: Inversion of matrices by biorthogonalization and related results. J. Soc. Ind. Appl. Math. 6, 51–90 (1958)
Article MathSciNet MATH Google Scholar
Chu, M. T., Funderlic, R. E., Golub, G. H.: A rank-one reduction formula and its applications to matrix factorizations. SIAM Rev. 37, 512–530 (1995)
Article MathSciNet MATH Google Scholar
Benzi, M., T˚uma, M.: A sparse approximate inverse preconditioner for nonsymmetric linear systems. SIAM J. Sci. Comput. 19(3), 968–994 (1998)
Article MathSciNet MATH Google Scholar
Benzi, M., Meyer, C. D., T˚uma, M.: A sparse approximate inverse preconditioner for the conjugate gradient method. SIAM J. Sci. Comput. 17(5), 1135–1149 (1996)
Article MathSciNet MATH Google Scholar
Li, J., Widlund, O. B.: FETI-DP, BDDC, and block Cholesky methods. Int. J. Numer. Methods Eng. 66(2), 250–271 (2006). https://doi.org/10.1002/nme.1553
Article MathSciNet MATH Google Scholar
Kopal, J., Rozložník, M., Smoktunowicz, A., T˚uma., M.: Rounding error analysis of orthogonalization with a non-standard inner product. BIT Numer. Math. 52, 1035–1058 (2012)
Article MATH Google Scholar
Golub, G. H., Van Loan, C. F.: Matrix Computations, 4th edn. The Johns Hopkins University Press, Baltimore and London (1996)
MATH Google Scholar
Gould, N. I. M., Orban, D., Toint, P.L.: CUTEst: a constrained and unconstrained testing environment with safe threads for mathematical optimization. Comput. Optim. Appl. 60, 545–557 (2015)
Article MathSciNet MATH Google Scholar
Gould, N. I. M., Scott, J. A.: The state-of-the-art of preconditioners for sparse linear least squares problems. ACM Trans. Math. Softw. 43(4), 1–35 (2017)
Article MathSciNet MATH Google Scholar
Meszaros, C.: Detecting dense columns in interior point methods for linear programs. Comput. Optim. Appl. 36, 309–320 (2007)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

We are grateful to two anonymous reviewers for their constructive feedback.

Funding

The first author was partially supported by the UK Engineering and Physical Sciences Research grant EP/M025179/1. The second author was partially supported by project 18-12719S of the Grant Agency of the Czech Republic.

Author information

Authors and Affiliations

Rutherford Appleton Laboratory, Science and Technology Facilities Council, Didcot, Oxfordshire, OX11 0QX, UK
Jennifer Scott
School of Mathematical, Physical and Computational Sciences, University of Reading, Reading, RG6 6AQ, UK
Jennifer Scott
Department of Numerical Mathematics, Faculty of Mathematics and Physics, Charles University, Sokolovská 49/83, Prague, 18675, Czech Republic
Miroslav Tůma

Authors

Jennifer Scott
View author publications
You can also search for this author in PubMed Google Scholar
Miroslav Tůma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jennifer Scott.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Scott, J., Tůma, M. A null-space approach for large-scale symmetric saddle point systems with a small and non zero (2, 2) block. Numer Algor 90, 1639–1667 (2022). https://doi.org/10.1007/s11075-021-01245-z

Download citation

Received: 09 March 2021
Accepted: 15 December 2021
Published: 12 January 2022
Issue Date: August 2022
DOI: https://doi.org/10.1007/s11075-021-01245-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A null-space approach for large-scale symmetric saddle point systems with a small and non zero (2, 2) block

Abstract

Similar content being viewed by others

The saddle point problem and the Manteuffel algorithm

The Uzawa-MBB type algorithm for nonsymmetric saddle point problems

On Saddle Points in Semidefinite Optimization via Separation Scheme

1 Introduction

1.1 Test environment

1.2 Notation

2 Null-space approach for solving saddle point systems

2.1 Null-space approach for zero C

2.2 Symmetry-preserving null-space approach for non zero C

Lemma 1.

Proof 1.

Theorem 1.

Proof 2.

Remark 1.

2.3 An application: sparse-dense least squares problems

Lemma 2.

Proof 3.

3 Null-space basis construction

3.1 Construction of null-space bases of sparse matrices

3.2 Null-space basis for wide matrices

3.2.1 Null-space bases with local support

Lemma 3.

Proof 4.

3.2.2 Fundamental null-space basis

Lemma 4.

Proof 5.

Lemma 5.

3.2.3 Composite null-space basis for wide dense matrices

Theorem 2.

4 Numerical experiments

4.1 Results for problem scagr7-2b

4.2 Experiments on other matrices

4.3 Results for problem hues_mod

5 Concluding remarks and future directions

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation