Sparse Resultant-Based Minimal Solvers in Computer Vision and Their Connection with the Action Matrix

Bhayani, Snehal; Heikkilä, Janne; Kukelova, Zuzana

doi:10.1007/s10851-024-01182-1

Sparse Resultant-Based Minimal Solvers in Computer Vision and Their Connection with the Action Matrix

Open access
Published: 23 March 2024

Volume 66, pages 335–360, (2024)
Cite this article

Download PDF

You have full access to this open access article

Journal of Mathematical Imaging and Vision Aims and scope Submit manuscript

Sparse Resultant-Based Minimal Solvers in Computer Vision and Their Connection with the Action Matrix

Download PDF

Snehal Bhayani¹,
Janne Heikkilä¹ &
Zuzana Kukelova²

647 Accesses
1 Altmetric
Explore all metrics

A Correction to this article was published on 25 April 2024

This article has been updated

Abstract

Many computer vision applications require robust and efficient estimation of camera geometry from a minimal number of input data measurements. Minimal problems are usually formulated as complex systems of sparse polynomial equations. The systems usually are overdetermined and consist of polynomials with algebraically constrained coefficients. Most state-of-the-art efficient polynomial solvers are based on the action matrix method that has been automated and highly optimized in recent years. On the other hand, the alternative theory of sparse resultants based on the Newton polytopes has not been used so often for generating efficient solvers, primarily because the polytopes do not respect the constraints amongst the coefficients. In an attempt to tackle this challenge, here we propose a simple iterative scheme to test various subsets of the Newton polytopes and search for the most efficient solver. Moreover, we propose to use an extra polynomial with a special form to further improve the solver efficiency via Schur complement computation. We show that for some camera geometry problems our resultant-based method leads to smaller and more stable solvers than the state-of-the-art Gröbner basis-based solvers, while being significantly smaller than the state-of-the-art resultant-based methods. The proposed method can be fully automated and incorporated into existing tools for the automatic generation of efficient polynomial solvers. It provides a competitive alternative to popular Gröbner basis-based methods for minimal problems in computer vision. Additionally, we study the conditions under which the minimal solvers generated by the state-of-the-art action matrix-based methods and the proposed extra polynomial resultant-based method, are equivalent. Specifically, we consider a step-by-step comparison between the approaches based on the action matrix and the sparse resultant, followed by a set of substitutions, which would lead to equivalent minimal solvers.

Minimal Solutions for Pose Estimation of a Multi-Camera System

A Minimal Solution to Relative Pose with Unknown Focal Length and Radial Distortion

PL $$_1$$ P - Point-Line Minimal Problems Under Partial Visibility in Three Views

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The robust estimation of camera geometry, one of the most important tasks in computer vision, is usually based on solving so-called minimal problems [26, 27, 43], i.e., problems that are solved from minimal samples of input data, inside a RANSAC framework [13, 18, 45]. Since the camera geometry estimation has to be performed many times inside RANSAC [18], fast solvers to minimal problems are of high importance. Minimal problems often result in polynomial systems with the following characteristics:

1.
They are usually non-square, with more polynomials than the number of variables.
2.
The coefficients of the constituent polynomials usually are non-generic, i.e. algebraically constrained [15, p. 109].
3.
The constituent polynomials have the same structure, i.e. the same monomials, but the coefficient values vary with the input measurements. This property has enabled an offline + online strategy, which moves as many computations as possible from the “online” stage of solving equations to an earlier pre-processing “offline” stage.

Most of the state-of-the-art specific minimal solvers are based on Gröbner bases and the action matrix method [15, 47]. The Gröbner basis method was popularized in computer vision by Stewenius [50]. The first efficient Gröbner basis-based solvers were mostly hand-crafted [48, 49] and sometimes very unstable [51]. However, in the last 15 years much effort has been put into making the process of constructing the solvers more automatic [27, 32, 33] stable [9, 10] and more efficient [5, 31,32,33, 35, 38]. Now powerful tools are available for the automatic generation of efficient solvers based on Gröbner basis [27, 32].

Note that while all these methods are in the computer vision community known as Gröbner basis methods, most of them do not generate solvers that directly compute Gröbner bases. They usually compute polynomials from a border basis [40] or go “beyond” Gröbner and border bases [35]. However, all these methods are based on computing an action matrix.^{Footnote 1} Therefore, for the sake of simplicity, in this paper, we will interchangeably write “action matrix method” and “Gröbner basis method” to refer to the same class of methods [5, 27, 31,32,33, 35, 38, 47].

The first step in such action matrix methods is to compute a linear basis B of the quotient ring $A = \mathbb {C}[X]/I$ where I denotes the ideal generated by input polynomial system. An action matrix method based on the Gröbner bases computes the basis of A by defining the division w.r.t. some Gröbner basis of the ideal I. In general, computing the Gröbner basis requires to fix some monomial ordering. Alternatively, a heuristic basis sampling method [35] that goes beyond Gröbner bases and monomial orderings in order to compute B can be used. The basis sampling method is related to the border basis methods [39, 40], which generalize the concept of monomial ordering and even propose a non-monomial basis of A. The second step in these action matrix methods is to compute a linear map $T_f$, representing the multiplication in A, w.r.t. some polynomial f. The representative matrix $\texttt{M}_f$ of the map $T_f$ is what we call the action matrix. Recently, [54, 55] generalized the method used for generating the basis B. The proposed methods are known as normal form methods, developed for general systems of polynomial equations. They are not tailored to problems that appear in computer vision.

While the action matrix method for generating efficient minimal solvers has been thoroughly studied in computer vision and all recently generated action-matrix-based solvers are highly optimized in terms of efficiency and stability, little attention has been paid to an alternative algebraic method for solving systems of polynomial equations, i.e. the resultant method [15, Chapter 3]. The resultant method was manually applied to several computer vision problems [20, 23, 26, 28]. However, in contrast to the action matrix method, there is no method for automatically generating sparse resultant-based minimal solvers. The most relevant works in this direction are the methods using the subdivision of the Minkowski sum of all the Newton polytopes [11, 12, 16] and the iterative method based on the Minkowski sum of a subset of Newton polytopes [21]. However, the subdivision methods are not applicable to polynomial systems with non-generic coefficients, whereas the method in [21] leads (due to linearizations) to larger and less efficient solvers than the action matrix solvers.

Our first objective in this paper is to study the theory of sparse resultants and propose an approach for generating efficient solvers for solving camera geometry problems, by using an extra polynomial with a special form. Our approach, based on the Newton polytopes and their Minkowski sums, is used to generate sparse resultant matrices, followed by a Schur complement computation, and a conversion of the resultant constraint into an eigenvalue problem.

Our approach differs from the subdivision methods [11, 16] in how the Newton polytopes are used for generating minimal solvers. The subdivision methods compute the Minkowski sum of the Newton polytopes of all the polynomials and divide its lattice interior into cells, which are used to construct the sparse resultant matrices. Whereas in this paper, we make this process iterative, computing the Minkowski sum of each subset of Newton polytopes (of the input polynomials) and avoid computing a subdivision. We directly use the lattice interior to generate a candidate sparse resultant matrix. For systems with non-generic coefficients, such an iterative approach has been crucial in generating sparse resultant matrices for the minimal problems in computer vision. The sparse resultant matrix and the eigenvalue problem together represent our minimal solver. The solvers based on our proposed approach have achieved significant improvements over the state-of-the-art sparse resultant-based solvers [11, 16, 21] and achieved comparable or better efficiency and/or accuracy to state-of-the-art Gröbner basis-based solvers [32, 35, 38]. Moreover, our proposed approach has been fully automated [2] and can be incorporated in the existing tools for automatic generation of efficient minimal solvers [27, 32, 35].

There is a similarity between the solvers obtained using our sparse resultant-based approach and the solvers based on the action matrix methods [8, 27, 32, 35, 37, 38, 47]. Therefore, the second objective in this paper is to investigate this similarity. In a step-by-step manner, we demonstrate that for a minimal solver generated based on the action matrix method, we can change the steps performed by the sparse resultant method such that it leads to an equivalent solver. Similarly, we also demonstrate that for a given minimal solver generated based on the sparse resultant method, we can change the steps performed by the action matrix method such that it leads to an equivalent solver. Specifically, our contributions are:

1.
A novel approach (Sect. 3), of adding an extra polynomial with a special form for generating a sparse resultant matrix whose sparse resultant constraint can be decomposed (Sect. 3.3) into an eigenvalue problem.
- A scheme (Sect. 3.2) to iteratively test the Minkowski sum of the Newton polytopes of each subset of polynomials, searching for the most efficient minimal solver, in the presence of algebraically constrained coefficients.
- Two procedures (Sect. 3.4) to reduce the sparse resultant matrix size, leading to comparable or better solver speeds than those generated by many state-of-the-art Gröbner basis-based methods.
- A general method for automatic generation of efficient minimal solvers. The automatic generator is publicly available at [2].
2.
A study of the constraints (Sect. 4) to be satisfied so that the solvers based on our sparse resultant method as well as the action matrix method are equivalent, i.e. they involve eigenvalue decomposition of exactly the same matrix, and the steps performed for constructing that matrix can be interchanged.

This paper is an extension of our work [3], where we proposed an extra polynomial sparse resultant method for generating efficient minimal solvers.

2 Theoretical Background and Related Work

In this section, we summarize the basic concepts and notation used in the paper. This notation is based on the book by [15], to which we refer the reader for more details.

Our goal is to compute the solutions to a system of m polynomial equations,

$$\begin{aligned} f_1(x_1,\dots ,x_n) = 0,\dots , f_m(x_1,\dots ,x_n)=0, \end{aligned}$$

(1)

in n variables, $X = \lbrace x_1,\dots ,x_n \rbrace $. Let us denote the set of polynomials from (1) as $\mathcal {F}=\{f_1,\dots ,f_m\}$. The variables in X can be ordered and formed into a vector. W.l.o.g. let us assume that this vector has the form, $\textbf{x} = \begin{bmatrix} x_1&\dots&x_n \end{bmatrix}^\top $. Then for a vector $\mathbf {\alpha } =\begin{bmatrix} \alpha _1&\dots&\alpha _n \end{bmatrix}^\top \in \mathbb {N}^{n}$, a monomial $\textbf{x}^\mathbf {\alpha }$ is a product $\textbf{x}^\mathbf {\alpha } = \prod _{i=1}^{n} x_i^{\alpha _i}$. Each polynomial $f_i \in \mathcal {F}$ is expressed as a linear combination of a finite set of monomials

$$\begin{aligned} f_{i}(\textbf{x}) = \underset{\mathbf {\alpha } \in \mathbb {N}^{n}}{\Sigma } c_{i,\mathbf {\alpha }} \textbf{x}^{\mathbf {\alpha }}. \end{aligned}$$

(2)

We collect all monomials present in $\mathcal {F}$ and denote the set as $\text {mon}(\mathcal {F})$. Let $\mathbb {C}[X]$ denote the set of all polynomials in unknowns X with coefficients in $\mathbb {C}$. The ideal $I = \langle f_1,\dots ,f_m \rangle \subset \mathbb {C}[X]$ is the set of all polynomial combinations of generators $f_1,\dots ,f_m$. The set V of all solutions of the system (1) is called the affine variety. Each polynomial $f \in I$ vanishes on the solutions of (1). Here we assume that the ideal I generates a zero-dimensional variety, i.e., the system (1) has a finite number of solutions (say r). Using the ideal I, we can define the quotient ring $A = \mathbb {C}[X]/I$ as the set of equivalence classes in $\mathbb {C}[X]$, where the equivalence is defined by the relation, $a \sim b \iff (a - b) \in I$. If I has a zero-dimensional variety, then the quotient ring A is a finite-dimensional vector space over $\mathbb {C}$ [15]. For an ideal I, there exist special sets of generators called Gröbner bases which have the nice property that the remainder after division is unique. Using a Gröbner basis we can define a linear basis for the quotient ring A.

Non-generic polynomial system In geometric computer vision, each coefficient, say $ c_{i,\mathbf {\alpha }}$ in (2), can be expressed as a parametric form, $ c_{i,\mathbf {\alpha }} = \phi (\mathcal {D})$ of the data measurements $\mathcal {D}$. One of the important properties of the set of the coefficients of such a polynomial system is that of non-genericity:

Definition 1

The coefficients $\lbrace c_{i,\mathbf {\alpha }} \rbrace $ of $\mathcal {F}$ in (2), are defined to be non-generic, if and only if we can obtain a polynomial constraint on them by eliminating the parameters $\mathcal {D}$.

A rigorous definition of the genericity of a property of a system is given in [15, p. 109].

Vector representation of a set Given a set B, one of the steps in this paper is to express it as a column vector by arranging its elements w.r.t. to a given ordering. Let us denote this operation as

$$\begin{aligned} \text {vec}(B). \end{aligned}$$

(3)

Matrix form In this paper, we will need to express a given set of polynomials $\mathcal {F}$ via matrix multiplication. To achieve this, we fix an order for the polynomials in $\mathcal {F}$, and also do the same for the monomials in $B = \text {mon}(\mathcal {F})$. The matrix representation of $\mathcal {F}$ has the form

$$\begin{aligned} \texttt{C}([c_{i, \mathbf {\alpha }}]) \ \textbf{b}, \end{aligned}$$

(4)

where $\textbf{b}$ is a column vector of the monomials in B ordered w.r.t. the given monomial ordering and $\texttt{C}([c_{i, \mathbf {\alpha }}])$ is the matrix of coefficients $c_{i, \mathbf {\alpha }}$ of $\mathcal {F}$. The entries of $ \texttt{C}([c_{i, \mathbf {\alpha }}])$ are row-indexed by the ordered set of polynomials in $\mathcal {F}$ and column-indexed by the monomials in $\textbf{b}$.

Monomial multiples of a polynomial set We begin with a polynomial system $\mathcal {F}$ and the set of monomials in $\mathcal {F}$, i.e., $B = \text {mon}(\mathcal {F})$. Let $B^\prime \supset B$ be another set of monomials.

Then an important step in our proposed method is to extend $\mathcal {F}$ to the largest possible set, say $\mathcal {F}^\prime $, such that $\text {mon}(\mathcal {F}^\prime ) \subseteq B^\prime $. This extension is done by multiplying each $f_i \in \mathcal {F}$ with monomials. i.e., we are generating polynomials from $I = \langle f_1,\dots , f_m \rangle $. For this, we compute the set $T_i$ of all possible monomials $T_i = \lbrace \textbf{x}^{\mathbf {\alpha }} \rbrace $ for each $f_i \in \mathcal {F}$, such that $\text {mon}(\textbf{x}^{\mathbf {\mathbf {\alpha }}} f_i) \subset B^\prime , \ \textbf{x}^{\mathbf {\alpha }} \in T_i $. We will use the following shorthand notation to express such an operation

$$\begin{aligned} \mathcal {F}\overset{B^\prime }{\rightarrow }\ (\mathcal {F}^\prime , T), \end{aligned}$$

(5)

where $T = \lbrace T_1,\dots ,T_m \rbrace $. Subsequently, in this operation we assume to have removed all monomials in $B^\prime $ which are not present in the extended set $\mathcal {F}^\prime $ and denote the modified set as $B^\prime $ by abuse of notation. In other words, we will assume that $B^\prime = \text {mon}(\mathcal {F}^\prime )$.

2.1 The Action Matrix Method

One of the well-known SOTA methods for polynomial solving is the Gröbner basis-based action matrix method [15, 47, 52]. It has been recently used to efficiently solve many minimal problems in computer vision [8, 9, 26, 27, 32, 35, 47]. It transforms the problem of finding the solutions to (1), to a problem of eigendecomposition of a special matrix. We list the steps performed by an action matrix method in “Appendix A”, and note here that the algorithm can essentially be distilled in a sequence of three important steps, viz. construct the set $T_j$ of monomial multiples for each of the input polynomials $f_j \in \mathcal {F}$, extend $\mathcal {F}$ via monomial multiplication to $\mathcal {F}^{\prime }$ and linearize the resulting system as a matrix product, $\texttt{C} \textbf{b}$. A G–J elimination of $\texttt{C}$ is then used to extract the required action matrix, whose eigenvalues give us the roots of $\mathcal {F}$.

2.2 Resultants

Alternative to the action matrix method, we have the method of resultants for polynomial solving. Originally, resultants were used to determine whether a system of $n+1$ polynomial equations in n unknowns has a common root or not. Let us have a system of polynomial equations as defined in (1), and in (2) and assume that $m=n+1$.

Definition 2

The resultant $Res([\lbrace c_{i,\mathbf {\alpha }} \rbrace ])$ of $\mathcal {F}$ is defined to be an irreducible polynomial in the coefficients $\lbrace c_{i,\mathbf {\alpha }} \rbrace $ which vanishes only if $\lbrace c_{i,\mathbf {\alpha }} \rbrace $ are such that $\mathcal {F}=0$ has non-trivial roots.

A more formal treatment of the theory of resultants can be obtained from [15, Chapter 3].

2.2.1 Polynomial Solving Using Resultants

A step in a sparse resultant method is to expand the given system of polynomials $\mathcal {F}$ to a set of linearly independent polynomials. This is usually done by adding some monomial multiples of the original polynomials, i.e., using the operation (5). The expanded set of polynomials, say $\mathcal {F}^\prime $, can be expressed in a matrix form as

$$\begin{aligned} \texttt{C}([c_{i, \mathbf {\alpha }}]) \ \textbf{b}. \end{aligned}$$

(6)

Usually, $\texttt{C}([c_{i, \mathbf {\alpha }}])$ in (6) is a tall matrix.

The resultant-based method here requires the coefficient matrix $\texttt{C}([c_{i, \mathbf {\alpha }}])$ to be a square invertible matrix for randomly assigned values to the coefficients, $c_{i, \mathbf {\alpha }} \in \mathbb {C}_{\ne 0}$ i.e., $\det \texttt{C}([c_{i, \mathbf {\alpha }}]) \ne 0$. A matrix with such properties is called the resultant matrix and in this paper we will denote it as $\texttt{M}([c_{i, \mathbf {\alpha }}])$.^{Footnote 2} If $\texttt{C}([c_{i, \mathbf {\alpha }}])$ is a square invertible matrix, we can rewrite (6) as

$$\begin{aligned} \texttt{M}([c_{i, \mathbf {\alpha }}]) \ \textbf{b}. \end{aligned}$$

(7)

Now, $\mathcal {F}= 0$ implies $\mathcal {F}^\prime = 0$ and it leads to

$$\begin{aligned} \texttt{M}([c_{i, \mathbf {\alpha }}]) \ \textbf{b} = \textbf{0}. \end{aligned}$$

(8)

Thus, the requirement for $\mathcal {F}=0$ to have common solutions is the following condition on the coefficients of $\mathcal {F}$,

$$\begin{aligned} \det \texttt{M}([c_{i, \mathbf {\alpha }}]) = 0. \end{aligned}$$

(9)

We call this the resultant constraint. It is a polynomial with $c_{i, \mathbf {\alpha }}$ as variables. From the definition of a resultant $Res([c_{i, \mathbf {\alpha }}])$ [15, Theorem 2.3], we have that $Res([c_{i, \mathbf {\alpha }}])$ is a polynomial with $c_{i, \mathbf {\alpha }}$ as variables and that it vanishes iff the system of equations $\mathcal {F}=0$ has common roots. This gives us the necessary condition for the existence of roots of the system $\mathcal {F}=0$, that $\det \texttt{M}([c_{i, \mathbf {\alpha }}])$ must vanish if the resultant vanishes, i.e.,

$$\begin{aligned} Res([c_{i, \mathbf {\alpha }}]) = 0 \implies \det \texttt{M}([c_{i, \mathbf {\alpha }}]) = 0. \end{aligned}$$

(10)

This implies that given a polynomial system $\mathcal {F}$, the resultant constraint (9) is a non-trivial multiple of its resultant $Res([c_{i, \mathbf {\alpha }}])$.

While resultants are defined for a system of one more polynomial than the number of variables, we can employ them for solving a system of n polynomials in n variables. One way to do this, is to hide one of the variables to the coefficient field (in other words, consider it to be a constant), another way is to add an extra polynomial by introducing a new variable, and then hide this variable to the coefficient field. In both these approaches, we end up with a system where we have one more polynomial than the number of variables.

2.2.2 Hiding a Variable

By hiding one variable, say $x_{n}$, to the coefficient field, we are left with n polynomials $\mathcal {F}$ in $n - 1$ variables. This gives us a way to use the concept of resultants and compute $Res([c_{i, \mathbf {\alpha }}], x_{n})$ which now becomes a function of $c_{i, \mathbf {\alpha }}$ and $x_{n}$. In this case, (7) becomes

$$\begin{aligned} \texttt{M}([c_{i, \mathbf {\alpha }}],x_{n}) \textbf{b}, \end{aligned}$$

(11)

where the symbols have their usual meaning. For simplicity, we will denote $\texttt{M}([c_{i, \mathbf {\alpha }}],x_{n})$ as $\texttt{M}(x_{n})$ in the rest of this paper. Its determinant $\det \texttt{M}(x_{n})$ is a multiple of the resultant $Res(x_n)$. This is known as a hidden variable resultant and it is a polynomial in $x_n$ whose roots are the $x_{n}$-coordinates of the solutions of the system of polynomial equations. For theoretical details and proofs we refer to [15, Chapter 7]. Such a hidden variable approach has been used in the past to solve various minimal problems [20, 23, 26, 28].

This approach leads to computing the roots of the polynomial, $\det \texttt{M}(x_{n}) = 0 $. However, computing the determinant of a polynomial matrix $\det \texttt{M}(x_n)$ and then its roots may be slow and unstable. Therefore, the most common way to solve the original system of polynomial equations is to first transform the following matrix equation

$$\begin{aligned} \texttt{M}(x_{n}) \textbf{b} = \textbf{0}, \end{aligned}$$

(12)

to a polynomial eigenvalue problem (PEP) [14], which is then expressed as,

$$\begin{aligned} (\texttt{M}_{0} + \texttt{M}_{1} \ x_{n}+\cdots + \texttt{M}_{l} \ x^{l}_{n})\textbf{b} = \textbf{0}, \end{aligned}$$

(13)

where l is the degree of the matrix $\texttt{M}(x_{n})$ in the hidden variable $x_n$ and $\texttt{M}_{0},\dots ,\texttt{M}_{l}$ are matrices that depend only on the coefficients $c_{i, \mathbf {\alpha }}$ of the original system of polynomials. The PEP (13) can be easily converted to a generalized eigenvalue problem (GEP), written as,

$$\begin{aligned} \texttt{A} \textbf{y} = x_{n} \texttt{B} \textbf{y}, \end{aligned}$$

(14)

and solved using standard efficient eigenvalue algorithms [28]. Basically, the eigenvalues give us the solutions to $x_{n}$ and the rest of the variables can be extracted from the corresponding eigenvectors, $\textbf{y}$. Such a transformation to a GEP relaxes the original problem of finding the solutions to the input polynomial system since the eigenvectors in general do not satisfy the monomial dependencies induced by the monomial vector $\textbf{b}$ as well the monomial vector $\textbf{y}$. Moreover, this relaxation may also introduce extra parasitic (zero) eigenvalues leading to slower polynomial solvers.

2.2.3 Adding an Extra Polynomial

Alternatively, we can add a new polynomial

$$\begin{aligned} f_{n+1} = u_0 + u_1 x_1 + \dots + u_n x_n \end{aligned}$$

(15)

to $\mathcal {F}$. We thus have an augmented polynomial system.

$$\begin{aligned} \mathcal {F}_a = \mathcal {F}\cup \lbrace f_{m+1} \rbrace . \end{aligned}$$

(16)

We then compute the so-called u-resultant (see [56] and [15, Chapter 3]) by hiding $u_0,\dots ,u_n$. In general, random values are assigned to $u_1,\dots ,u_n$ (and $u_0$ is a new unknown). Just like in the hidden variable method, this method hides the variable $u_0$ to the coefficient field and generates the resultant matrix $\texttt{M}(u_0)$. In this case, the extended system of polynomials (7) can be expressed as a matrix product as

$$\begin{aligned} \texttt{M}([c_{i, \mathbf {\alpha }}], u_0) \ \textbf{b}, \end{aligned}$$

(17)

where $u_0$ is a new unknown variable and $\textbf{b}$ is a vector of all monomials in this extended system. We will denote $\texttt{M}([c_{i, \mathbf {\alpha }}], u_0)$ as $\texttt{M}(u_0)$ in order to simplify the notation. Again, it holds that $\text {det} \ \texttt{M}(u_0)$ is a multiple of the resultant $Res(u_0)$. Thus, in order to compute the common zeros of $\mathcal {F}=0$, we need to solve $\text {det} \texttt{M}(u_0)=0$, similar to the case of the hidden variable resultant. Instead of computing the determinant of a polynomial matrix, here we also solve it as a GEP described in Sect. 2.2.2. However, as mentioned in the hidden variable method, such a method introduces spurious solutions, arising from the linearization of the original system in (17), i.e., not considering monomial dependencies in $\textbf{b}$, as well as from transforming PEP to GEP and not considering the monomial dependencies in $\textbf{y}$ in (14). Some of these spurious solutions (eigenvalues) can be removed by using Schur complement, described in the section below or by applying the method for removing zero eigenvalues, similar to [4, 26].

2.2.4 Schur Complement

One way to remove the spurious solutions introduced by the linearization of a hidden variable resultant matrix $\texttt{M}(u_0)$ is to use the Schur complement of one its submatrices. Here, we briefly review this method. Let us first consider some partition of the set of monomials B as

$$\begin{aligned} B = B_1 \cup B_2. \end{aligned}$$

(18)

Note that $\textbf{b} = \text {vec}(B) = \begin{bmatrix} \textbf{b}_1&\textbf{b}_2 \end{bmatrix}^\top $, where $\textbf{b}_1=\text {vec}(B_1)$ and $\textbf{b}_2=\text {vec}(B_2)$. The vectorization operation is defined in Eq. (3). This imposes a column partition on $\texttt{M}(u_0)$ in (17). Moreover, we can order the rows of $\texttt{M}(u_0)$ such that its upper block is independent of $u_0$. Together, we obtain the following block partition of $\texttt{M}(u_0)$:

$$\begin{aligned} \texttt{M}(u_0)= \begin{bmatrix} \texttt{M}_{11} &{} \texttt{M}_{12} \\ \texttt{M}_{21}(u_0) &{} \texttt{M}_{22}(u_0) \end{bmatrix}. \end{aligned}$$

(19)

Here, the upper block $\begin{bmatrix} \texttt{M}_{11}&\texttt{M}_{12} \end{bmatrix}$ is independent of $u_0$. Thus we can write (17) as

$$\begin{aligned} \begin{bmatrix} \texttt{M}_{11} &{} \texttt{M}_{12} \\ \texttt{M}_{21}(u_0) &{} \texttt{M}_{22}(u_0) \end{bmatrix} \ \begin{bmatrix} \textbf{b}_1 \\ \textbf{b}_2 \end{bmatrix}. \end{aligned}$$

(20)

The requirement for existence of solutions of $\mathcal {F}= 0$ is that the vector in (20) should vanish. We thus obtain the following two vector equations

$$\begin{aligned}{} & {} \texttt{M}_{11} \textbf{b}_{1} + \texttt{M}_{12} \textbf{b}_{2} = \textbf{0} \end{aligned}$$

(21)

$$\begin{aligned}{} & {} \texttt{M}_{21}(u_0) \textbf{b}_{1} + \texttt{M}_{22}(u_0) \textbf{b}_{2} = \textbf{0}. \end{aligned}$$

(22)

If we were able to partition $\texttt{M}(u_0)$ such that $\texttt{M}_{12}$ is a square invertible matrix, we can eliminate $\textbf{b}_2$ from these two equations, to obtain

$$\begin{aligned} (\underbrace{\texttt{M}_{21}(u_0) - \texttt{M}_{22}(u_0) \texttt{M}^{-1}_{12} \texttt{M}_{11}}_{\texttt{X}(u_0)}) \textbf{b}_{1} = \textbf{0}. \end{aligned}$$

(23)

The matrix $\texttt{X}(u_0)$ is the Schur complement of the block $\texttt{M}_{12}$ of $\texttt{M}(u_0)$, which has the following property:

$$\begin{aligned} \text {det} (\texttt{M}(u_0)) = \text {det}(\texttt{M}_{12}) \ \text {det}(\texttt{X}), \end{aligned}$$

(24)

where, $\text {det}(\texttt{M}_{12}) \ne 0$ by our assumption. Therefore, for a generic value of $u_0$, $\text {det}(\texttt{M}(u_0)) \ne 0 \Leftrightarrow \text {det}(\texttt{X}(u_0)) \ne 0$, which means that $\texttt{X}(u_0)$ is a square invertible matrix and its determinant is also a multiple of the resultant. However, $\texttt{X}(u_0)$ corresponds to a smaller eigenvalue problem as compared to the sparse resultant matrix $\texttt{M}(u_0)$.

Both the resultant-based methods, described in Sects. 2.2.2 and 2.2.3, are proposed for square generic systems [16] via mixed subdivision of polytopes. However, the polynomial systems studied here are usually sparse, overdetermined and consist of polynomials with non-generic coefficients (see Definition 1). The sparsity of the systems can be exploited to obtain more compact resultants using specialized algorithms. Such resultants are commonly referred to as the sparse resultants.

2.2.5 Polyhedral Geometry

Sparse resultants are studied via the theory of polytopes [15, Chapter 7]. Therefore, we define the important terms and notations related to polytopes, which we will later use in the text.

The Newton polytope $\text {NP}(f)$, of a polynomial f is defined as a convex hull of the exponent vectors of all the monomials occurring in the polynomial (also known as the support of the polynomial). Therefore, we have $\text {NP}(f_{i}) = \text {Conv}(A_{i})$ where $A_{i}$ is the set of all integer vectors that are exponents of monomials with nonzero coefficients in $f_{i}$. A Minkowski sum of any two convex polytopes $P_1, P_2$ is defined as $P_1+P_2 = \lbrace p_1 + p_2 \ | \ \forall p_1 \in P_1, \forall p_2 \in P_2 \rbrace $. We demonstrate the concept of a Newton polytope using a simple example.

Example 1

Let us consider a system of two polynomials $\mathcal {F}= \lbrace f_1(\textbf{x}), f_2(\textbf{x}) \rbrace $, in two variables $X = \lbrace x_1, x_2 \rbrace $

$$\begin{aligned} f_1= & {} {c_{1,1}} {x_1}^{3} {x_{2}}^{3} + {c_{1,2}} {x_{1}}^{2} {x_{2}}^{3} + {c_{1,3}} {x_{1}}^{3} {x_{2}}^{2} \nonumber \\{} & {} +{c_{1,4}} {x_{1}}^{2} {x_{2}}^{2} + {c_{1,5}} {x_{2}}^{3} +{c_{1,6}} {x_{1}}^{2} {x_{2}} \nonumber \\{} & {} + {c_{1,7}} {x_{2}}^{2} + {c_{1,8}} {x_{1}} {x_{2}} +{c_{1,9}} {x_{1}}^{2} +{c_{1,10}} {x_{2}} \end{aligned}$$

(25)

$$\begin{aligned} f_2= & {} {c_{2,1}} {x_{1}}^{2} +{c_{2,2}} {x_{2}} +{c_{2,3}} {x_{1}} + {c_{2,4}}. \end{aligned}$$

(26)

Here, $\lbrace c_{1,1},\dots , c_{1,10},c_{2,1},\dots ,c_{2,4} \rbrace \subset \mathbb {C} $ is the set of coefficients of $\mathcal {F}$. The supports of $\mathcal {F}$ are

$$\begin{aligned} A_{1}= & {} \lbrace [3,3],[2,3],[3,2], [2,2], [0,3], \nonumber \\{} & {} [2,1], [0,2], [1,1], [2,0], [0,1] \rbrace \subset \mathbb {Z}^2 \end{aligned}$$

(27)

$$\begin{aligned} A_{2}= & {} \lbrace [2,0], [0,1], [1,0], [0,0] \rbrace \subset \mathbb {Z}^2. \end{aligned}$$

(28)

The Newton polytopes $P_1 = \text {NP}(f_1)$, $P_2 = \text {NP}(f_2)$ as well as the Minkowski sum $Q = P_1 + P_2$ are depicted in Fig. 1.

2.2.6 Sparse Resultants

Consider the SOTA methods in [11, 16, 17], for computing the sparse resultant for a well-determined generic system $\mathcal {F}$. The main idea in all such methods is to compute the Minkowski sum of the Newton polytopes of all the polynomials, $f_i \in \mathcal {F}$, and divide its interior into cells. Each cell determines a multiple of one of the input polynomials, $f_i \in \mathcal {F}$ with a monomial. All such monomial multiples collectively lead to an extended polynomial system, $\mathcal {F}^\prime $. The extended polynomial system, via linearization into a matrix product, then leads to a sparse resultant matrix $\texttt{M}(x_n)$, if using the hidden-variable technique in Sect. 2.2.2, or $\texttt{M}(u_0)$, if using the u-resultant of the general form in Sect. 2.2.3. As such the resulting solvers are usually quite large and not very efficient.

However, if $\mathcal {F}$ has non-generic coefficients (see Definition 1), these methods may fail, as the algebraic constraints on the coefficients lead to algebraic constraints on the elements of the sparse resultant matrices $\texttt{M}(x_n)$ or $\texttt{M}(u_0)$, and hence, they may not satisfy the necessary rank conditions. For such systems, [21] recently proposed an iterative approach based on the hidden variable resultant, to test and extract $\texttt{M}(x_n)$ in (7). Thereafter, it transforms (7) to a GEP (14) and solves for eigenvalues and eigenvectors to compute the roots. Our proposed approach here, extends this iterative scheme for generating a sparse resultant matrix $\texttt{M}(u_0)$, specifically for the extra polynomial approach in Sect. 2.2.3.

3 Proposed Extra Polynomial Resultant Method

In this paper, we propose an iterative approach based on [21] and apply it to a modified version of the u-resultant method described in Sect. 2.2.3. Specifically, we propose the following modifications to the u-resultant method.

1.
Instead of the general form (15) of the extra polynomial in the u-resultant-based method, in this paper, we propose to use the following special form of the extra polynomial
$$\begin{aligned} f_{m+1} = x_{k} - u_0, \end{aligned}$$
(29)
where $u_0$ is a new variable and $x_k \in X$ is one of the input variables. In general, we can select any $x_k \in X$. However, since in practice, selecting different $x_k$’s leads to solvers of different sizes and with different numerical stability, we test all $x_k \in X$ when generating the final solver.
2.
For non-generic and overdetermined polynomial systems $\mathcal {F}$, we avoid computing the Minkowski sum of all the Newton polytopes $NP(f), \forall f \in \mathcal {F}_a$, as proposed in the methods in [11, 16, 17]. Here, $\mathcal {F}_a$ denotes the augmented polynomial system (see Eq. (16)). Instead, we iterate through each subset, $\mathcal {F}_{\text {sub}} \subset \mathcal {F}_a$, and compute the Minkowski sum of $NP(f), \forall f \in \mathcal {F}_{\text {sub}}$. Instead of dividing the Minkowski sum into cells, we simply use its lattice interior to determine the monomials B in the extended polynomial system $\mathcal {F}^\prime $, i.e. $B=\text {mon}(\mathcal {F}^\prime )$.
3.
We exploit the form of the extra polynomial (29) and propose a block partition of $\texttt{M}(u_0)$ (17), which facilitates its decomposition using Schur complement directly into a regular eigenvalue problem. This regular eigenvalue problem, compared to GEP that arises for general u-resultant polynomial (15), leads to fewer spurious eigenvalues and hence a faster solver.

Note that our method of iteratively testing various polynomial subsets has been quite effective in generating efficient solvers, as demonstrated on many minimal problems (see Sect. 6). The generated solvers are comparable in efficiency and speed with those generated by the SOTA action matrix-based methods [32, 35, 38].

3.1 Method Outline

In the following, we first go through the important steps performed by our method.

1.
Let us consider a system of m polynomial equations, $\mathcal {F}=0$ (1), in n variables, $X = \lbrace x_1,\dots ,x_n \rbrace $. For all variables $x_k \in X$, we perform the steps $2-5$ in the offline phase.
2.
[Offline] We fix the form of the extra polynomial to $f_{m+1} = x_k - u_0$ (29) and augment $\mathcal {F}$ with $f_{m+1}$, to obtain the augmented system (see Eq. (16)). We hide the new variable $u_0$ to the coefficient field which means that, $\mathcal {F}_a \subset \mathbb {C}[X \cup \lbrace u_0 \rbrace ]$.
3.
[Offline] We execute steps 3(a)-3(c), for each subset of polynomials $\mathcal {F}_{\text {sub}} \subset \mathcal {F}_a$ and for every variable $x_k \in X$.
1. (a)
  From the set $\mathcal {F}_{\text {sub}}$, we attempt to construct a favourable monomial set B, using a polytope-based method, described in Sect. 3.2.
2. (b)
  We extend the polynomial system, $\mathcal {F}_a$, using the computed monomial set B, represented as
  $$\begin{aligned} \mathcal {F}_a \overset{B}{\rightarrow }\ (\mathcal {F}_a^\prime , T), \end{aligned}$$
  (30)
  where $T=\lbrace T_1,\dots ,T_{m+1} \rbrace $ (5).
3. (c)
  The set of equations $\mathcal {F}_a^\prime = 0$, can be expressed in a form of a matrix equation as $\texttt{C}(u_0) \textbf{b} = \textbf{0}$, where $\textbf{b} = \text {vec}(B) $, where the vectorization operation is defined in Eq. (3).
The output of this step is described in detail in Sect. 3.2, is all favourable monomial sets B and the corresponding coefficient matrices $\texttt{C}(u_0)$.
4.
[Offline] For each favourable monomial set B and the corresponding coefficient matrix $\texttt{C}(u_0)$ we perform the following:
1. (a)
  We partition $B=B_1 \cup B_2$ in two different ways,
  $$\begin{aligned} B_{1} ={} & {} B \cap T_{m+1} \text { or} \end{aligned}$$
  (31)
  $$\begin{aligned} B_{1} ={} & {} \lbrace \textbf{x}^{\mathbf {\alpha }} \in B \mid \dfrac{\textbf{x}^{\mathbf {\alpha }}}{x_k} \in T_{m+1} \rbrace . \end{aligned}$$
  (32)
  Note that $B_2 = B \setminus B_1$.
2. (b)
  Based on the two set-partitions of B, we attempt to partition the matrix $\texttt{C}(u_0)$ in the following two ways.
  $$\begin{aligned}{} & {} \texttt{C}(u_0) \textbf{b}\! = \! \begin{bmatrix} \texttt{A}_{11} &{} \texttt{A}_{12} \\ \texttt{A}_{21}-u_0 \texttt{I} &{} \texttt{A}_{22} \end{bmatrix}\! \begin{bmatrix} \textbf{b}_1 \\ \textbf{b}_2 \end{bmatrix}\! = \! \textbf{0}, \ \ \end{aligned}$$
  (33)
  $$\begin{aligned} \text {or} \ \nonumber \\{} & {} \texttt{C}(u_0) \textbf{b} \! = \! \begin{bmatrix} \texttt{A}_{11} &{} \texttt{A}_{12} \\ \texttt{I} + u_0 \texttt{B}_{21} &{} u_0 \texttt{B}_{22} \end{bmatrix}\! \begin{bmatrix} \textbf{b}_1 \\ \textbf{b}_2 \end{bmatrix}\! = \! \textbf{0}, \ \ \end{aligned}$$
  (34)
  and the matrix $\texttt{A}_{12}$ has the full column rank.
The output of this step is the coefficient matrix $\texttt{C}(u_0)$, for which a partitioning as (33) or (34), with the full column rank matrix $\texttt{A}_{12}$ is possible, and which corresponds to smallest set $B_1$. If we have more than one such choice of $\texttt{C}(u_0)$, we select the smallest matrix $\texttt{C}(u_0)$. This step is described in more detail in Sect. 3.3.
5.
[Offline] In this step, we aim to reduce the size of the matrix $\texttt{C}(u_0)$, selected in the previous step.
1. (a)
  We first try to remove a combination of rows and columns from $\texttt{C}(u_0)$ and the corresponding monomials from the favourable monomial set B, such that the resulting monomial set is still a favourable monomial set (Sect. 3.2) and that the coefficient matrix $\texttt{C}(u_0)$ can still be partitioned as in (33) or in (34).
2. (b)
  We next remove the extra rows from $\texttt{C}(u_0)$ to obtain a sparse resultant matrix $\texttt{M}(u_0)$ (7), while still respecting its block partition, as in (33) or (34).
This step is described in Sect. 3.4.
6.
[Online] In the final online solver, we fill the precomputed sparse resultant matrix $\texttt{M}(u_0)$ with the coefficients coming from the input data/measurements. Then we compute the Schur complement of a block of $\texttt{M}(u_0)$, as described in 2.2.4, which is then formulated as an eigenvalue problem of the matrix $\texttt{X}$ (50) (or (52)). Finally, the eigendecomposition of the matrix $\texttt{X}$ gives us the solutions to the input system of polynomial equations $\mathcal {F}$, i.e. to the given instance of the minimal problem.

3.2 Computing a Favourable Monomial Set

Let the extra polynomial $f_{m+1}$ have the form $f_{m+1} = x_k - u_0$ (29) for $x_k \in X$, and let us assume a subset $\mathcal {F}_{\text {sub}} \subset \mathcal {F}_a$ of the augmented system $\mathcal {F}_a = \mathcal {F}\cup \lbrace f_{m+1} \rbrace $ (16).

In the step 3(a) of our algorithm (Sect. 3.1), we attempt to generate so-called favourable monomial sets. Our method for constructing these sets is based on the polytope-based algorithm proposed in [21], where such sets were generated for the original system $\mathcal {F}$ and its subsets. Here we describe our method that constructs such favourable sets for subsets $\mathcal {F}_{\text {sub}}$ of the augmented system $\mathcal {F}_a$ (16). In this subsection, we refer to the basic terminology and notations described in Sect. 2.2.5.

In our method, we begin by computing the support $A_{j}=\text {supp}(f_{j})$ and the Newton polytope $\text {NP}(f_{j}) = \text {conv}(A_{j})$ for each polynomial $f_{j} \in \mathcal {F}_a$. We also compute the unit simplex $\text {NP}_{0} \subset \mathbb {Z}^{n}$ and the Minkowski sum, $Q = \text {NP}_{0} + \Sigma _{f \in \mathcal {F}_{\text {sub}}} \text {NP}(f)$. Let us construct a small displacement vector $\mathbf {\delta } \in \mathbb {R}^n$, such that each of its elements is assigned one of the three values, $\lbrace -10^{-1}, 0, 10^{-1} \rbrace $. In other words,

$$\begin{aligned} \mathbf {\delta } \in \lbrace \begin{bmatrix} \delta _{1}&\dots&\delta _{n} \end{bmatrix} \mid \delta _{i} \in \lbrace -10^{-1}, 0, 10^{-1} \rbrace ; i = 1,\dots ,n \rbrace . \end{aligned}$$

(35)

Then using this value of $\mathbf {\delta }$, we compute the set of integer points inside $Q+\mathbf {\delta }$, from which we compute a set of monomials, $B = \lbrace \textbf{x}^{\mathbf {\alpha }} \mid \mathbf {\alpha } \in \mathbb {Z}^n \cap (Q+\mathbf {\delta }) \rbrace $, i.e. a potential candidate for a favourable monomial set. We demonstrate this step on a system of two polynomials in Example 2.

Example 2

Let us continue with Example 1, where we computed the Newton polytopes $P_1 = \text {NP}(f_1)$ and $P_2 = \text {NP}(f_2)$ as well as their Minkowski sum $Q = P_1 + P_2$. Let us assume the displacement vector to be $\mathbf {\delta } = [-0.1,-0.1]$. In Fig. 2, we depict the integer points in the interior of $Q+\mathbf {\delta }$, i.e. the points in the set $\mathbb {Z}^2 \cap (Q+\mathbf {\delta })$. This set of points give us a favourable set of monomials B

$$\begin{aligned} B= & {} \lbrace x_2, x_2^{2}, x_2^{3}, x_1^{2}, x_1^{3}, x_1 x_2, x_1 \,x_2^{2}, \nonumber \\{} & {} x_1 \,x_2^{3}, x_1^{2} x_2, x_1^{2} x_2^{2}, x_1^{2} x_2^{3}, x_1^{3} x_2, \nonumber \\{} & {} x_1^{3} x_2^{2}, x_1^{3} x_2^{3}, x_1^{4} x_2, x_1^{4} x_2^{2}, x_1^{4} x_2^{3} \rbrace . \end{aligned}$$

(36)

Selecting the displacement vector $\delta $

In Eq. (35), our choice of the displacement $10^{-1}$ in each direction is not completely random, although to the best of our knowledge there is no theory which gives us a $\delta $ leading to the smallest solver. All we know is that the displacement in each direction should be in the range $(-1.0, 1.0)$, and in this paper we have chosen a reasonably small value, i.e. $10^{-1}$. A different choice will lead to a different favourable monomial set. To this end, we have observed, that as the number of vertices of the Minkowski sum Q increases the number of different solvers we can generate by choosing different displacements will also increase.

In the next step (see step 3(b) in Sect. 3.1), we extend the input polynomial system, $\mathcal {F}_a$, using the set of monomials B as

$$\begin{aligned} \mathcal {F}_a \overset{B}{\rightarrow }\ (\mathcal {F}_a^{\prime }, T), \end{aligned}$$

(37)

where $T=\lbrace T_1,\dots ,T_{m+1} \rbrace $ (see (5)). Each $T_i$ denotes the set of monomials to be multiplied with the polynomial $f_i \in \mathcal {F}$. The extended set of polynomials $\mathcal {F}_a^{\prime }$ can be written in a matrix form as

$$\begin{aligned} \texttt{C}([c_{i,\alpha }],u_0) \ \textbf{b}, \end{aligned}$$

(38)

where $\textbf{b}$ is a vector form of B w.r.t. some ordering of monomials. In the rest of the paper, we will denote the coefficient matrix $\texttt{C}([c_{i,\alpha }], u_0)$ as $\texttt{C}(u_0)$ for the sake of simplicity.

Our approach then evaluates the following three conditions:

$$\begin{aligned}{} & {} \Sigma _{j=1}^{m+1}|T_{j}| \ge |B|, \end{aligned}$$

(39)

$$\begin{aligned}{} & {} \min \limits _{j}|T_{j}| > 0, \end{aligned}$$

(40)

$$\begin{aligned}{} & {} \text {column rank of } \texttt{C}(u_0) = |B|, \end{aligned}$$

(41)

for a random value of $u_0 \in \mathbb {C}$. The first condition (39) ensures that we have at least as many rows as the columns in the matrix $\texttt{C}(u_0)$. The second condition (40) ensures that there is at least one row corresponding to every polynomial $f_i \in \mathcal {F}_a$. The third condition (41) ensures that $\texttt{C}(u_0)$ has full column rank and that we can extract a sparse resultant matrix from it. If these conditions are satisfied, we consider the set of monomials B to be a favourable set of monomials.

Note that we repeat this procedure for all variables $x_k \in X$, for each subset $\mathcal {F}_{\text {sub}} \subset \mathcal {F}_a$, and for each value of the displacement vector $\mathbf {\delta }$ (35). The output of this step are all generated favourable monomial sets B and the corresponding coefficient matrices $\texttt{C}(u_0)$.

3.3 Block Partition of $\texttt{C}(u_0)$

In the step 4 of our method (Sec 3.1), we iterate through all favourable monomial sets B and the coefficient matrices $\texttt{C}(u_0)$, which was the output of the previous step.

For each favourable monomial set B, we know the corresponding sets of monomial multiples $T_i$ for each polynomial $f_i \in \mathcal {F}_a$ and the set $T = \lbrace T_{1},\dots ,T_{m},T_{m+1} \rbrace $. We have also computed the corresponding coefficient matrix $\texttt{C}(u_0)$. Assume the extra polynomial (29) to be fixed as $f_{m+1} = x_k - u_0$, for $x_k \in X$, while computing this favourable monomial set.

Since B was constructed such that it satisfies (41), $\texttt{C}(u_0)$ has full column rank. Let us assume, $\varepsilon = \mid B \mid $ and $p = \Sigma _{j=1}^{m+1}|T_{j}|$. As $\texttt{C}(u_0)$ satisfies the condition (39), we have $p \ge \varepsilon $. Therefore, we can remove the extra $p-\varepsilon $ rows from $\texttt{C}(u_0)$, leading to a square invertible matrix. The algorithm of row removal approach is described in B.3. The square matrix so obtained, is a sparse resultant matrix $\texttt{M}(u_0)$ and it satisfies the resultant matrix constraint (8), if the set of equations $\mathcal {F}_a = 0$ has solutions.

Instead of directly solving the resultant constraint (9) or converting (8) to GEP, we exploit the structure of the extra polynomial $f_{m+1}$ (29) and propose a special ordering of the rows and columns of $\texttt{M}(u_0)$. This facilitates a block partition of $\texttt{M}(u_0)$, such that the Schur complement of one of its block submatrices then helps us to reduce its matrix constraint (8) to a regular eigenvalue problem.^{Footnote 3}

Rather than block partitioning the sparse resultant matrix $\texttt{M}(u_0)$, we first fix a block partition of the coefficient matrix $\texttt{C}(u_0)$ and then remove its rows, while respecting this partition, to obtain the sparse resultant matrix $\texttt{M}(u_0)$. Note that we obtain two different block partitions on $\textbf{b}$ and $\texttt{C}(u_0)$, in (38), based on our selected set-partition of the favourable monomial set $B = B_{1} \cup B_{2}$:

$$\begin{aligned}{} & {} B_{1} = B \cap T_{m+1} \text {or} \end{aligned}$$

(42)

$$\begin{aligned}{} & {} B_{1} = \left\{ \textbf{x}^{\mathbf {\alpha }} \in B \mid \dfrac{\textbf{x}^{\mathbf {\alpha }}}{x_k} \in T_{m+1} \right\} \end{aligned}$$

(43)

Note that $B_{2} = B \setminus B_{1}$. We consider both these partitions in Proposition 1. Thereafter, we will describe our method of removing the rows from $\texttt{C}(u_0)$ such that we can compute a Schur complement of a block of $\texttt{M}(u_0)$.

Proposition 1

Consider a block partition of $\texttt{C}(u_0)$ and $\textbf{b}$ in (38) as

$$\begin{aligned} \texttt{C}(u_0)\ \textbf{b} = \begin{bmatrix} \texttt{C}_{11} &{} \texttt{C}_{12} \\ \texttt{C}_{21} &{} \texttt{C}_{22} \end{bmatrix} \begin{bmatrix} \textbf{b}_{1} \\ \textbf{b}_{2} \end{bmatrix}. \end{aligned}$$

(44)

We can achieve this in two different ways, based on our choice the set-partition of B in (42) or (43). In either case, if $\texttt{C}_{12}$ has full column rank, then the resultant matrix constraint (8) can be converted to an eigenvalue problem, of the form $\texttt{X} \ \textbf{b}_{1} = u_0\textbf{b}_{1}$ if we set-partition B as in (42), and of the form $\texttt{X} \ \textbf{b}_{1} = -\dfrac{1}{u_0} \textbf{b}_{1}$ if we set-partition B as in (43). The matrix $\texttt{X}$ is square and its entries are the functions of the coefficients of $\mathcal {F}$.

Proof

Let us first consider the set-partition of B as in (42). Then, the special form of the extra polynomial (29) implies that

$$\begin{aligned} B_{1} \! =\! \lbrace \textbf{x}^{\mathbf {\alpha }} \in B | \ x_k \textbf{x}^{\mathbf {\alpha }} \in B \rbrace . \end{aligned}$$

(45)

We next order the monomials in B such that $\textbf{b}$ can be partitioned as $\textbf{b}\! = \begin{bmatrix} \text {vec}(B_{1})\; \text {vec}(B_{2}) \end{bmatrix}^{T}\! =\! \begin{bmatrix} \textbf{b}_1\; \textbf{b}_2 \end{bmatrix}^{T}$. This induces a column partition of $\texttt{C}(u_0)$. Moreover, we can row partition $\texttt{C}(u_0)$ by indexing the rows in its lower block by monomial multiples of $f_{m+1}$. All such multiples are linear in $u_0$, i.e., these multiples have the form $\textbf{x}^{\mathbf {\alpha _j}} (x_{k} - u_0), \forall \textbf{x}^{\mathbf {\alpha _j}}\in T_{m+1}$. The upper block is row-indexed by monomial multiples of $f_{1},\dots ,f_{m}$. Such a row and column partition of $\texttt{C}(u_0)$ gives us a block partition as in (44). As $\begin{bmatrix} \texttt{C}_{11}\!&\! \texttt{C}_{12} \end{bmatrix}$ contains polynomials independent of $u_0$ and $\begin{bmatrix} \texttt{C}_{21}\!&\! \texttt{C}_{22}\! \end{bmatrix}$ contains polynomials of the form $\textbf{x}^{\mathbf {\alpha _j}} ( x_{k} - u_0)$, we obtain

$$\begin{aligned}{} & {} \texttt{C}_{11} = \texttt{A}_{11}, \; \; \; \texttt{C}_{12} = \texttt{A}_{12} \nonumber \\{} & {} \quad \texttt{C}_{21} = \texttt{A}_{21} + u_0 \texttt{B}_{21}, \; \; \; \texttt{C}_{22} = \texttt{A}_{22} + u_0 \texttt{B}_{22}, \end{aligned}$$

(46)

where $\texttt{A}_{11}, \texttt{A}_{12}, \texttt{A}_{21}$, $\texttt{A}_{22}$, $\texttt{B}_{21}$ and $\texttt{B}_{22}$ are matrices dependent only on the coefficients of input polynomials in $\mathcal {F}_a$ (16). Based on our assumption in the statement of this proposition, $\texttt{C}_{12} $ and hence $\texttt{A}_{12}$, have full column rank. Substituting (46) in (44) gives

$$\begin{aligned} \texttt{C}(u_0) = \underbrace{\begin{bmatrix} \texttt{A}_{11} &{} \texttt{A}_{12} \\ \texttt{A}_{21} &{} \texttt{A}_{22} \end{bmatrix}}_{\texttt{C}_0} + u_0 \underbrace{\begin{bmatrix} \texttt{0} &{} \texttt{0} \\ \mathtt {B_{21}} &{} \mathtt {B_{22}} \end{bmatrix}}_{\texttt{C}_1}&\end{aligned}$$

(47)

We can order monomials so that $\textbf{T}_{m+1} = \textbf{b}_1$. Now, the block partition of $\texttt{C}(u_0)$ implies that $\texttt{C}_{21}$ is column-indexed by $\textbf{b}_1$ and row-indexed by $\textbf{T}_{m+1}$. As $\begin{bmatrix} \texttt{C}_{21}\!&\! \texttt{C}_{22} \end{bmatrix}$ has rows of form $\textbf{x}^{\mathbf {\alpha _j}}(x_{k}\! -\! u_0)$, $\textbf{x}^{\mathbf {\alpha _j}}\! \in \! T_{m+1}\! \implies \! \textbf{x}^{\mathbf {\alpha _j}}\! \in \! B_{1}$. This gives us $\texttt{B}_{21} = \mathtt {-I}$, where $\texttt{I}$ is an identity matrix of size $\mid B_{1} \mid \times \mid B_{1} \mid $ and $\texttt{B}_{22}$ is a zero matrix of size $\mid B_{1}\mid \times \mid B_{2} \mid $. This also means that $\texttt{A}_{21}$ is a square matrix of the same size as $\texttt{B}_{21}$. Thus, we have

$$\begin{aligned} \texttt{C}(u_0) = \begin{bmatrix} \texttt{A}_{11} &{} \texttt{A}_{12} \\ \texttt{A}_{21} &{} \texttt{A}_{22} \\ \end{bmatrix} + u_0 \begin{bmatrix} \texttt{0} &{} \texttt{0} \\ - \texttt{I} &{} \texttt{0} \end{bmatrix}, \end{aligned}$$

(48)

where $\texttt{C}(u_0)$ is a $p \times \varepsilon $ matrix. If $\texttt{C}(u_0)$ is a tall matrix, so is $\texttt{A}_{12}$. Therefore, we can eliminate extra rows from the upper block $\begin{bmatrix} \texttt{A}_{11}&\texttt{A}_{12} \end{bmatrix}$ so that we obtain a square invertible matrix $\hat{\texttt{A}}_{12}$ while preserving the above-mentioned structure. Such a row-removal will also give us the sparse resultant matrix $\texttt{M}(u_0)$ which satisfies the resultant matrix constraint (8). We will describe our method for removing the extra rows in Sect. 3.4. Let us now here, use the Schur complement technique 2.2.4, to reduce the size of the eigenvalue problem. From (48), we have

$$\begin{aligned}{} & {} \underbrace{\begin{bmatrix} \hat{\texttt{A}}_{11} &{} \hat{\texttt{A}}_{12} \\ \texttt{A}_{21} &{} \texttt{A}_{22} \end{bmatrix}}_{\hat{\texttt{C}}_{0}} \begin{bmatrix} \textbf{b}_1 \\ \textbf{b}_2 \end{bmatrix} + u_0 \begin{bmatrix} \texttt{0} &{} \texttt{0} \\ - \texttt{I} &{} \texttt{0} \end{bmatrix} \begin{bmatrix} \textbf{b}_1 \\ \textbf{b}_2 \end{bmatrix} = \textbf{0} \nonumber \\{} & {} \quad \implies \hat{\texttt{A}}_{11} \textbf{b}_1 + \hat{\texttt{A}}_{12} \textbf{b}_2 = \textbf{0}, \nonumber \\{} & {} \quad \texttt{A}_{21} \textbf{b}_1 + \texttt{A}_{22} \textbf{b}_2 - u_0 \textbf{b}_1 = \textbf{0}. \end{aligned}$$

(49)

Eliminating $\textbf{b}_2$ from the above pair of equations, we obtain

$$\begin{aligned} \underbrace{(\texttt{A}_{21} - \texttt{A}_{22} \hat{\texttt{A}}_{12}^{-1} \hat{\texttt{A}}_{11})}_{\texttt{X}} \textbf{b}_1 = u_0 \textbf{b}_1. \end{aligned}$$

(50)

The matrix $\texttt{X}$ is the Schur complement of $\hat{\texttt{A}}_{12}$ w.r.t. $\hat{\texttt{C}}_{0}$.

If we select the alternative set-partition of B, as in (43), we obtain a different block partition of $\textbf{b}$ and $\texttt{C}(u_0)$. Specifically, in (47), we have $\texttt{A}_{21}\! =\! \texttt{I}$ and $\texttt{A}_{22}\! =\! \texttt{0}$. By assumption, we have $\texttt{A}_{12}$ has full column rank. Therefore, we can remove the extra rows from $\texttt{C}(u_0)$ in (48) and obtain

$$\begin{aligned} \texttt{M}(u_0)= \begin{bmatrix} \hat{\texttt{A}}_{11} &{} \hat{\texttt{A}}_{12} \\ \texttt{I} &{} \texttt{0} \\ \end{bmatrix} + u_0 \begin{bmatrix} \texttt{0} &{} \texttt{0} \\ \texttt{B}_{21} &{} \texttt{B}_{22} \end{bmatrix}. \end{aligned}$$

(51)

Substituting the value of $\texttt{M}(u_0)$, in (8), we get $\hat{\texttt{A}}_{11} \textbf{b}_1 + \hat{\texttt{A}}_{12} \textbf{b}_2 = \textbf{0}$ and $u_0(\texttt{B}_{21} \textbf{b}_1 + \texttt{B}_{22} \textbf{b}_2) + \textbf{b}_1 = \textbf{0}$. Eliminating $\textbf{b}_2$ from these equations, we obtain an alternate eigenvalue formulation

$$\begin{aligned} \underbrace{(\texttt{B}_{21} - \texttt{B}_{22} \hat{\texttt{A}}_{12}^{-1} \hat{\texttt{A}}_{11})}_{\texttt{X}} \textbf{b}_1 = -(1/u_0) \textbf{b}_1. \;\; \end{aligned}$$

(52)

$\square $

The matrix $\texttt{X}$ here, represents an alternative representation to the one in (50), of the Schur complement.

Note that this proposition allows us to test the existence of the Schur complement $\texttt{X}$ (50) (or (52)), for a given favourable monomial set B and the corresponding coefficient matrix $\texttt{C}(u_0)$. This condition is tested for each B and corresponding $\texttt{C}(u_0)$, for both choices of set-partitions (33) or (32). Out of those that succeed, we select the coefficient matrix $\texttt{C}(u_0)$, which led to the smallest possible Schur complement $\texttt{X}$. If we have more than one such choice, we choose the smallest coefficient matrix $\texttt{C}(u_0)$. Note that our iterative approach is crucial in increasing the chances of obtaining a minimal solver, even when the polynomials in $\mathcal {F}$ have non-generic coefficients.

3.4 Row-Column Removal

The next step in our method is to attempt to reduce the size of the matrix $\texttt{C}(u_0)$ selected in the previous step. For this, we employ a method, inspired by [26]. Here we select columns of $\texttt{C}(u_0)$, one by one in a random order to test for their removal. For each such column, we select rows (say $r_{1},\dots ,r_{s}$) that contain nonzero entries in the column and also consider all columns (say $c_{1},\dots ,c_{l}$) that have nonzero entries in $r_{1},\dots ,r_{s}$. Then, we can remove these s rows and l columns from $\texttt{C}(u_0)$ only if the following conditions hold true for the resulting reduced matrix $\texttt{C}_{\text {red}}(u_0)$.

1.
After eliminating the monomials from T, we require that there is at least one monomial left in each $T_{i}$.
2.
If $\texttt{C}(u_0)$ is of size $p \times \varepsilon $, the reduced matrix $\texttt{C}_{\text {red}}(u_0)$ would be of size $(p-s) \times (\varepsilon -l)$. Then, we require $p-s \ge \varepsilon -l$ and $\text {rank}(\texttt{C}_{\text {red}}(u_0)) = \varepsilon -l$.
3.
$\texttt{C}_{\text {red}}(u_0)$ must be block partitioned and decomposed as in Proposition 1.

We repeat the above process until there are no more columns that can be removed. Note that the last condition is important as it ensures that at each stage, the reduced matrix can still be partitioned and decomposed into an eigenvalue formulation (50) (or alternately (52)). Now by abuse of notation, let us denote $\texttt{C}(u_0)$ to be the reduced matrix and denote B and T to be the reduced set of monomials and the set of monomial multiples, respectively.

If $\texttt{C}(u_0)$ still has more rows than columns, we transform it into a square matrix by removing extra rows (say $q_{1},\dots ,q_{j}$) and the monomials from T indexing these rows, chosen in a way such that the three conditions mentioned above are still satisfied. Note that it is always possible to choose rows, such that these three conditions are satisfied. Moreover, our proposed approach first tries to remove as many rows as possible from the lower block, indexed by $T_{m+1}$. This is to reduce $|T_{m+1}|$($=|B_{1}|$) as much as possible and ensure that the matrix $\texttt{A}_{21}$ and hence $\texttt{X}$ (50) (or (52)) for eigenvalue problem has as small size as possible. Then, if there are more rows still to be removed, the rest are randomly chosen from the upper block indexed by $\lbrace T_{1},\dots ,T_{m} \rbrace $. Algorithms for this step of matrix reduction are provided in “Appendix B”.

The sparse resultant matrix $\texttt{M}(u_0)$ is constructed offline through these three steps. In the online stage, we fill in the coefficients of $\texttt{M}(u_0)$ from the input measurements and compute a Schur complement $\texttt{X}$ (50) (or (52)). Then, we extract the solutions to $x_{1},\dots ,x_{n}$ by computing the eigenvectors of $\texttt{X}$. The speed of execution of the solver depends on the size of $\textbf{b}_1$(=$|B_{1} |$) as well the size of $\hat{\texttt{A}}_{12}$, while the accuracy of the solver largely depends on the condition number as well as the size of the matrix to be inverted i.e., $\hat{\texttt{A}}_{12}$.

4 Action Matrix vs sparse resultant

In this section, we study the connection between a solver generated using an action matrix method [8, 9, 26, 27, 32, 35, 47], and a solver based on the sparse resultant method proposed in Sect. 3. Observe that, our sparse resultant method exploits only the structure of the input polynomials, via the geometry of their Newton polytopes, whereas the SOTA action matrix method algebraically investigates the input polynomial system via the Gröbner basis. Seemingly different, the generated solvers exhibit the same properties, and hence, in this section our objective is to throw some light on the similarities. Some of the symbols used in both the methods are the same. Therefore, to distinguish them, in this section we will use the prefixes ${}^{a}$ and ${}^{r}$ to, respectively, denote the symbols used in the action matrix method in Sect. 2.1 and the sparse resultant method in Sect. 3.1. Let us assume that both these methods are used to generate a solver for a system $\mathcal {F}=0$ (1) of m polynomial equations in n variables with r solutions. An action matrix method results in a solver that is performing G–J elimination (or alternatively LU/QR decomposition) of an elimination template ${}^{a}\!\texttt{C}$ (A8) and the eigendecomposition of an action matrix $\texttt{M}_f$ (A3) of size $r \times r$ .^{Footnote 4} The proposed sparse resultant method, on the other hand, leads to a solver, which computes a Schur complement $\texttt{X}$ (50) (or (52)) of size $N \times N$, from the sparse resultant matrix $\texttt{M}(u_0)$, followed by its eigendecomposition. In general, $N \ge r$.

While the final solvers generated by these two methods are in general different, i.e., they perform different operations, at the same time, they demonstrate some similar properties, e.g., both these solvers extract solutions from eigenvectors of some matrices, and both involve computing a matrix inverse (G–J elimination can be replaced by matrix inverse). This motivates us to study the similarities of these solvers. Let us first define the equivalence of a solver generated using an action matrix method (AM) and a solver generated using the sparse resultant method from Sect. 3 (SRes).

Definition 3

For a given system of polynomial equations $\mathcal {F}=0$ (1), an action matrix-based solver and a sparse resultant-based solver are defined to be equivalent, iff the following two conditions are satisfied:

1.
The action matrix $\texttt{M}_f$ (A3) is the same as the Schur complement $\texttt{X}$ (50) (or (52)), i.e. $\texttt{M}_f = \texttt{X}$.
2.
The size of the matrix ${}^{a}\!\hat{\texttt{C}}$ is the same as the size of the upper block $\begin{bmatrix} \hat{\texttt{A}}_{11}&\hat{\texttt{A}}_{12}\end{bmatrix}$ of the sparse resultant matrix $\texttt{M}(u_0)$ (49) (or (51)) and the step of G–J elimination of the template matrix ${}^{a}\!\hat{\texttt{C}}$ in the online stage (step 7) of the action matrix method (Sect. 2.1) can be replaced with the matrix product $\hat{\texttt{A}}_{12}^{-1} \hat{\texttt{A}}_{11}$, used to compute the Schur complement in the online stage (step 6) of our sparse resultant method (Sect. 3.1), or vice-versa.

Now, our goal is to define conditions under which the final online solvers generated by these two methods are equivalent.

We next demonstrate that if we have an action matrix-based solver (AM) for a given system $\mathcal {F}=0$ (1), we can modify the steps performed by the sparse resultant method proposed in the Sect. 3.1, such that both the solvers are equivalent according to Definition 3.

4.1 $\texttt{X}$ (SRes) from $\texttt{M}_f$ (AM)

Let us assume that we have generated a solver for a system of polynomial equations, $\mathcal {F}=0$ (1) using the action matrix method described in Sect. 2.1 and detailed in “Appendix A”. The action polynomial is assumed to be $f=x_1$, where $ x_1 \in X$.

We first propose the following changes to the steps performed by the sparse resultant method in the offline stage, in Sect. 3.1. In what follows, we will assume that step i actually means the step i in Sect. 3.1.

C1R:

The step 1 is skipped, as there is no need to test each variable $x_k \in X$.

C2R:

In the step 2, the extra polynomial is assumed to be $f_{m+1} = x_1 - u_0$.

C3R:

In the step 3(a), the favourable set of monomials ${}^{r}\!B$ is directly constructed from ${}^{a}\!B$, as ${}^{r}\!B = {}^{a}\!B$. Moreover, the set of monomial multiples is constructed as ${}^{r}T_i = {}^{a}T_i, i=1,\dots ,m$ and ${}^{r}T_{m+1}$ = $B_a$ (see Eq. (A5)). The step 3(b) is to be skipped, as the coefficient matrix ${}^{r}\texttt{C}(u_0)$ will be directly obtained from the action matrix-based solver, in subsequent steps. The output of this step then is the favourable monomial set ${}^{r}\! B$.

C4R:

Replace the step 4(a) by directly determining the set partition of ${}^{r}\!B=B_1\cup B_2$ (31)–(32), as $B_1 = B_a $ and $ B_2 = \hat{B}_e \cup B_r$, where $B_a, B_r$ and $\hat{B}_e$ are defined in “Appendix A”. Moreover, the monomial ordering in ${}^{a}\textbf{b}$ determines the vectors, $\textbf{b}_1 = \textbf{b}_a$ and $\textbf{b}_2 = \begin{bmatrix} {\hat{\textbf{b}}}_e \\ \textbf{b}_r \end{bmatrix} $. The action matrix-based solver corresponds to the first version of set partition of ${}^{r}\!B$, considered in our sparse resultant method, i.e. the set partition as in (31).

C5R:

In the step 4(b), the upper block $\begin{bmatrix} \texttt{A}_{11}&\texttt{A}_{12} \end{bmatrix}$ of ${}^{r}\texttt{C}(u_0)$ (33) is directly obtained from ${}^{a}\hat{\texttt{C}}$, as

$$\begin{aligned}{} & {} \texttt{A}_{11} = \begin{bmatrix} \texttt{C}_{13} \\ \texttt{C}_{23} \end{bmatrix}, \nonumber \\{} & {} \quad \texttt{A}_{12} = \begin{bmatrix} \hat{\texttt{C}}_{11} &{} \texttt{C}_{12} \\ \hat{\texttt{C}}_{21} &{} \texttt{C}_{22} \end{bmatrix}. \end{aligned}$$

(53)

Multiplying $f_{m+1}$ with the monomials in ${}^{r}T_{m+1}$, the expanded set of polynomials is obtained as $\lbrace \textbf{x}^{\mathbf {\alpha }} \left( x_1 - u_0 \right) \mid \textbf{x}^{\mathbf {\alpha }} \in {}^{r}T_{m+1} \rbrace $. It is possible to order the polynomials in this set, such that its matrix form is

$$\begin{aligned} \begin{bmatrix} \texttt{A}_{21} - u_0 \texttt{I}&\texttt{A}_{22} \end{bmatrix} \begin{bmatrix} \textbf{b}_1 \\ \textbf{b}_2 \end{bmatrix}. \end{aligned}$$

(54)

This determines the lower block of ${}^{r}\texttt{C}(u_0)$. Here, $\texttt{A}_{21}$ and $\texttt{A}_{22}$ are binary matrices, with entries either 0 or 1. The upper and the lower blocks together give us

$$\begin{aligned} {}^{r}\texttt{C}(u_0) \ {}^{r}\textbf{b} = \begin{bmatrix} \texttt{A}_{11} &{} \texttt{A}_{12} \\ \texttt{A}_{21} - u_0 \texttt{I} &{} \texttt{A}_{22} \end{bmatrix} \begin{bmatrix} \textbf{b}_1 \\ \textbf{b}_2 \end{bmatrix}. \end{aligned}$$

(55)

The output of the modified step 4, is the coefficient matrix ${}^{r}\texttt{C}(u_0)$.

C6R:

By construction, ${}^{r}\texttt{C}(u_0)$ is already a square invertible matrix and has no extra rows to be removed. As we do not need to attempt row-column removal from ${}^{r}\texttt{C}(u_0)$, the step 5 is to be skipped, to directly construct the sparse resultant matrix as $\texttt{M}(u_0)= {}^{r}\texttt{C}(u_0)$. In this case, the submatrices $\hat{\texttt{A}}_{11}$ and $\hat{\texttt{A}}_{12}$ of $\texttt{M}(u_0)$ are equal to the submatrices $ \texttt{A}_{11}$ and $ \texttt{A}_{12}$ of ${}^{r}\texttt{C}(u_0)$, respectively.

Applying these changes to the steps 1–5 in Sect. 3.1, i.e. the offline stage, we obtain the sparse resultant matrix $\texttt{M}(u_0)$. After that, the online solver computes the Schur complement $\texttt{X}$ of the block submatrix $\hat{\texttt{A}}_{12}$ of $\texttt{M}(u_0)$, followed by an eigenvalue decomposition (50) of $\texttt{X}$ as described in the step 6 in Sect. 3.1. In the proposition C.1, we prove that the sparse resultant-based solver so obtained is equivalent to the action matrix-based solver.

4.2 $\texttt{M}_f$ (AM) from $\texttt{X}$ (SRes)

Let us assume that for a system of polynomial equations, $\mathcal {F}=0$ (1), we have a sparse resultant-based solver, generated by following the steps in Sect. 3.1. In this solver the coefficient matrix ${}^{r}\texttt{C}(u_0)$ is assumed to be partitioned as in (33), and the alternative partition (34) will be considered in the next Sect. 4.3. Moreover, the Schur complement $\texttt{X}$ is assumed to have as many eigenvalues as the number of solutions to $\mathcal {F}=0$, i.e., $N = r$.^{Footnote 5} The extra polynomial is supposed to be of the form $f_{m+1} = x_1 - u_0$, for $x_1 \in X$.

We first propose the following changes to the steps performed by the action matrix method in the offline stage, in “Appendix A”. In the following list, we assume that step i actually means the step i in “Appendix A”.

C1A:

There is no change to the step 1.

C2A:

In the step 2, set the basis of the quotient ring A, to be $\mathcal {B}_A = \lbrace [\textbf{x}^{\mathbf {\alpha }}] \mid \textbf{x}^{\mathbf {\alpha }} \in B_1\rbrace \subset A$. By assumption, A is an r-dimensional vector space over $\mathbb {C}$ spanned by the elements of $\mathcal {B}_A$,^{Footnote 6}. Construct the monomial set $B_a = B_1$.

C3A:

In the step 3, assume the action polynomial $f = x_1$.

C4A:

The monomial multiples, required in the step 4, are obtained as ${}^{a}T_i = {}^{r}T_i, i=1,\dots ,m$. This also gives us the extended polynomial set $\mathcal {F}^\prime $. Note that ${}^{a}\!B = {}^{r}\!B = \text {mon}(\mathcal {F}^\prime )$.

C5A:

In the step 5, the required sets of reducible and excess monomials are determined as

$$\begin{aligned}{} & {} B_r = \lbrace x_1 \textbf{x}^{\mathbf {\alpha }} \mid \textbf{x}^{\mathbf {\alpha }} \in B_1 \rbrace \setminus B_1 \nonumber \\{} & {} \quad B_e = B_2 \setminus B_r. \end{aligned}$$

(56)

Note that by (45), $ \textbf{x}^{\mathbf {\alpha }} \in B_1 \implies x_1 \textbf{x}^{\mathbf {\alpha }} \in {}^{r}\!B $.

C6A:

In the step 6, the vector of monomials is determined as ${}^{a}\hat{\textbf{b}}={}^{r}\textbf{b}$. This will also determine the vectors $\textbf{b}_a, \textbf{b}_r$ and $\hat{\textbf{b}}_e$. Thus, the monomials in the action matrix method are ordered in the same way as the monomials in the sparse resultant method. Moreover, in the step 6, the elimination template ${}^{a}\hat{\texttt{C}}$, and its block partition, are determined as

$$\begin{aligned}{} & {} \begin{bmatrix} \hat{\texttt{C}}_{11} &{} \texttt{C}_{12} \\ \hat{\texttt{C}}_{21} &{} \texttt{C}_{22} \end{bmatrix} = \hat{\texttt{A}}_{12}, \nonumber \\{} & {} \quad \begin{bmatrix} \texttt{C}_{13} \\ \texttt{C}_{23} \end{bmatrix} = \hat{\texttt{A}}_{11}. \end{aligned}$$

(57)

The column partition of ${}^{a}\hat{\texttt{C}}$ is determined by the partition of the vector ${}^{a}\textbf{b}$. By the construction of $\texttt{M}(u_0)$, $\hat{\texttt{A}}_{12}$ is a square invertible matrix. Therefore, we can always find a row partition such that $\texttt{C}_{22}$ is a square invertible matrix. As, $\hat{\texttt{C}}_{22}$ and $\hat{\texttt{A}}_{12}$ are square matrices, $\hat{\texttt{C}}_{11}$ is also a square matrix. Therefore, there are no extra columns to be removed and the elimination template ${}^{a}\hat{\texttt{C}}$ will be such that its G–J elimination will lead to the required form, as in (A11).

Applying these changes to the steps 1–6 in Sect. 2.1, i.e. the offline stage, we obtain the elimination template ${}^{a}\hat{\texttt{C}}$. Subsequently, the online action matrix-based solver (step 7 in Sect. 2.1) computes the G–J elimination of ${}^{a}\hat{\texttt{C}}$, from which the entries of the action matrix $\texttt{M}_f$ are read-off. This implies that the action matrix-based solver is equivalent to the sparse resultant-based solver, which is proved in Proposition C.2.

4.3 $\texttt{M}_f$ (AM) from $\texttt{X}$ (SRes) with Alternative Form

Let us assume that for a system of polynomial equations, $\mathcal {F}=0$ (1), we have used the alternative partition of ${}^{r}\!B$ (32) and ${}^{r}\texttt{C}(u_0)$ (34), and generated a sparse resultant-based solver by following the steps in Sect. 3.1. This means that we need to assume that no solution of $\mathcal {F}=0$, has $x_1=0$. The other assumptions remain the same as in Sect. 4.2.

The alternative partition of the favourable monomial set ${}^{r}\!B$ (32) implies that the Schur complement $\texttt{X}$ (52) gives us a representation of each coset $\left[ \dfrac{\textbf{x}^{\mathbf {\alpha _j}}}{x_1} \right] , \textbf{x}^{\mathbf {\alpha _j}} \in B_1 $ as a linear combination of the cosets $[\textbf{x}^{\mathbf {\alpha _j}}], \forall \textbf{x}^{\mathbf {\alpha _j}} \in B_1$. However, the approach in Sect. 4.2 does not work, because in this case we need to set the action polynomial f to be $1/x_1$.

Therefore, we can use the so-called Rabinowitsch trick [15], and propose the following changes to the offline stage of the action matrix method in Sect. 2.1. In the following list, we will assume that step i actually means the step i in Sect. 2.1.

C1A$^\prime $:

In the step 1, the polynomial system $\mathcal {F}$, is augmented with an extra polynomial ${}^{a}\!f_{m+1}=x_1 \lambda - 1$, where $\lambda $ is a new variable. The augmented polynomial system is denoted as ${}^{a}\!\mathcal {F}_a$, and the augmented set of variables as ${}^{a}\!X_a = X \cup \lbrace \lambda \rbrace $. Consider the ideal $I_a = \langle {}^{a}\!\mathcal {F}_a \rangle $ and the quotient ring $A_a = \mathbb {C}[{}^{a}\!X_a]/I_a$.

C2A$^\prime $:

Note that the number of solutions to $\mathcal {F}=0$ is the same as that of ${}^{a}\!\mathcal {F}_a=0$, because we have assumed $x_1 \ne 0$. Therefore, $A_a$ is an r-dimensional vector space over $\mathbb {C}$. Its basis is constructed as $\mathcal {B}_A = \lbrace [\textbf{x}^{\mathbf {\alpha _j}}] \mid \textbf{x}^{\mathbf {\alpha _j}} \in B_1\rbrace $. Construct the monomial set $B_a = B_1$.

C3A$^\prime $:

In the step 3, the action polynomial is assumed to be $f = \lambda $.

C4A$^\prime $:

The sets of monomial multiples required in the step 4, are constructed as ${}^{a}T_i = {}^{r}T_i, i=1,\dots ,m+1$. From (32), $\textbf{x}^{\mathbf {\alpha _j}} \in T_{m+1}$ implies $x_1 \textbf{x}^{\mathbf {\alpha _j}} \in B_1$, which implies that $\lambda x_1 \textbf{x}^{\mathbf {\alpha _j}} \in B_r$. The extended polynomial set ${}^{a}\!\mathcal {F}_a^\prime $ is obtained by multiplying each $f_i \in {}^{a}\!\mathcal {F}_a$ with the monomials in ${}^{a}\!T_i$. In the step 4, the set of monomials is obtained ${}^{a}\!B = \text {mon}(\mathcal {F}_a^\prime )$.

C5A$^\prime $:

In the step 5, the required monomial sets are

$$\begin{aligned}{} & {} B_r = \lbrace \lambda \textbf{x}^{\mathbf {\alpha }} \mid \textbf{x}^{\mathbf {\alpha }} \in B_1 \rbrace \nonumber \\{} & {} \quad B_e = B_2. \end{aligned}$$

(58)

Note that the set of monomials ${}^{a}\!B = B_e \cup B_r \cup B_a$, and $B_a \cap B_r = \emptyset $.

C6A$^\prime $:

In the step 6, the monomial vectors are set as $\textbf{b}_a=\textbf{b}_1, \textbf{b}_r=\text {vec}(B_r)$ and $\textbf{b}_e=\textbf{b}_2$. This will also fix the vector of monomials ${}^{a}\textbf{b} = \begin{bmatrix} \textbf{b}_e \\ \textbf{b}_r \\ \textbf{b}_a \end{bmatrix}$. The monomials in the vector ${}^{a}\textbf{b}$ in the action matrix method are ordered in the same way as the monomials in the sparse resultant method in Sect. 3.1. Moreover, as the monomials in $B_r$ are in a one-to-one correspondence w.r.t. the monomials in $B_1$, the monomial ordering in $\textbf{b}_1$ fixes the order of the monomials in $\textbf{b}_r$ as well.

Applying these changes to the steps 1–6 in Sect. 2.1, i.e. the offline stage, we obtain the elimination template ${}^{a}\hat{\texttt{C}}$. Then, the action matrix-based solver (step 7 in Sect. 2.1) computes the G–J elimination of ${}^{a}\hat{\texttt{C}}$, from which the entries of the action matrix $\texttt{M}_f$ are read-off. This implies that the action matrix-based solver is equivalent to the sparse resultant-based solver, which is proved in Proposition C.3.

5 Comparison w.r.t. SOTA Sparse Resultant Methods

Our extra polynomial algorithm is based on the hidden variable method [21]. It also extends the method in [11] for square systems, by doing away with the computation of the mixed subdivision of the Minkowski sum of all the Newton polytopes. We consider these two methods to be the current SOTA for generating sparse resultant-based minimal solvers.

We select a subset of interesting minimal problems and demonstrate the importance of our algorithm in improving solver speeds compared to those generated by the SOTA resultant methods. Moreover, our algorithm has two important properties, viz. the special form of the extra polynomial and the row-column removal procedure. In this section, we demonstrate via an ablation study, how these two properties are critical in achieving substantially improved solver sizes. Refer to Table 1, where we list the sizes of the solvers generated using the SOTA sparse resultant methods [11, 21] (cols 3 & 4), the sizes of the solvers generated with an extra polynomial of a general form (see Sect. 2.2.3) (col 5), and the sizes of the solvers generated without the row-column removal step (see Sect. B.2) (col 6). Finally, we list the sizes of the solvers generated by our approach (col 7).

Note that the size of a solver based on the hidden variable method [21] is reported as the size of the underlying generalized eigenvalue problem (GEP) (14). If the matrix $\texttt{A}$ (and $\texttt{B}$) in GEP is of dimensions $r \times r$, the solver size can be considered to be equivalent to the size of an inverse of a matrix of dimensions $r \times r$ and an eigenvalue computation of a matrix of size $r \times r$. On the other hand, the size of the solver based on any one of the other methods is reported as a matrix of size $s \times t$, which is equivalent to the size of an inverse of a matrix of dimensions $s \times s$ and an eigenvalue computation of a matrix of size $ (t-s) \times (t-s)$. The difference in the time taken for inverting matrices of comparable sizes is minute, as the sparsity of the matrices enable highly optimized software implementation for matrix inverse. Therefore the speed of the solver is primarily governed by the size of the eigenvalue computation. With this discussion in mind, we observe from Table 1 that the solvers based on the GEP (col 1) will have worse or comparable speed to that of the solvers based on our approach (col 7). Observe that out of the five problems the approach by [11] failed to generate any solver as the systems are non-square, and hence, the subdivision methods cannot be applied. For all the problems, the method of u-resultant using an extra polynomial of a general form leads to larger elimination templates, whereas using an extra polynomial with a special form improves the template size. Additionally, the row-column removal step ultimately leads to the best possible solver sizes among all sparse resultant-based methods.

Table 1 A comparison of the sizes of the solvers for some minimal problems, generated by the iterative algorithm [21] (col 3), the polytope subdivision approach for square systems in [11] (col 4), the u-resultant method using an extra polynomial with a generic form (col 5), our proposed algorithm (see Sect. 3) without (col 6) and with row-column removal (col 7)

Full size table

6 Experiments

In this section, we evaluate the performance of the minimal solvers, generated using our sparse resultant method, proposed in Sect. 3. We compare the stability as well as the computational complexities of these solvers with the state-of-art Gröbner basis-based solvers for many interesting minimal problems. The minimal problems selected for comparison represent a huge variety of relative and absolute pose problems and correspond to those studied in [35]. Note that for the two minimal problems, Rel. pose f+E+f 6pt and Abs. pose refractive P5P, we generated solvers from a simplified polynomial system as described in [38], instead of the original formulation.

Table 2 Size comparison of solvers based on [32] (col 6), Gröbner fan [35] (col 7), heuristic sampling [35] (col 9) and greedy parameter search [38] (col 10)

Full size table

6.1 Computational Complexity

The biggest contributor towards computational complexity of a minimal solver is the size of the matrices that undergo crucial numerical operations. The solvers based on our sparse resultant method in Sect. 3, the state-of-the-art Gröbner basis solvers as well as in the original solvers proposed by the respective authors (see column 4) involve two crucial operations, a matrix inverse^{Footnote 7} and an eigenvalue decomposition. This is indicated by the size of the solver in the table, e.g. a sparse resultant-based solver of size $11 \times 20$ means computing a matrix inverse $\hat{\texttt{A}}_{12}^{-1}$ of size $11 \times 11$, ultimately leading to a Schur complement $\texttt{X}$ of size $20-11 =9$, and an eigenvalue decomposition of this matrix $\texttt{X}$. Similarly an action matrix-based solver of size $11 \times 20$ means performing a G–J elimination of a $11 \times 20$ matrix $\texttt{C}$ and then an eigenvalue decomposition of an action matrix $\texttt{M}_f$ of size $20-11=9$. So in Table 2, we compare the size of the upper block $\begin{bmatrix} \hat{\texttt{A}}_{11}&\hat{\texttt{A}}_{12} \end{bmatrix}$ of the sparse resultant matrix $\texttt{M}(u_0)$ in our extra polynomial sparse resultant-based solvers, with the size of the elimination template $\texttt{C}$ used in state-of-the-art Gröbner basis solvers as well as in the original solvers proposed by the respective authors (column 5).

The Gröbner basis-based solvers used for comparison include the solvers generated using the approach in [32] (column 6), the Gröbner fan-based and the heuristic-based approaches in [35] (columns 7 and 9, respectively), and the template optimization approach using a greedy parameter search in [38] (column 10). Out of the two methods for generating minimal solvers, proposed in [38], we consider the smallest solver for each minimal problem.

As we can see from Table. 2, for most minimal problems, our sparse resultant method leads to smaller solvers compared to the Gröbner basis solvers based on [32, 35] and of exactly the same size as the solvers based on [38]. The smallest solvers where size of the eigenvalue decomposition problem is the same as the number of solutions are written in bold in Table 2.

Moreover, for some minimal problems, our sparse resultant method leads to a larger eigenvalue problem than the number of solutions, written in italic in Table 2. For three such minimal problems, i.e. Rel. pose E+$f\lambda $ 7pt, Unsynch. Rel. pose, and Optimal pose 2pt v2, our sparse resultant method leads to solvers with the smaller size as compared to the solvers based on the Gröbner basis-based methods [32, 35], whereas for the problem, Abs. pose quivers, our sparse resultant method leads to a smaller solver than the solvers based on [32, 35] and of comparable size w.r.t. the solver based on [38].

Note that for the two minimal problems, Optimal pose 2pt v2 and Rel. pose E angle+4pt, the elimination template $\texttt{C}$ in the solvers based on the approach in [38], underwent an extra Schur complement-based step in the offline stage. Therefore, the template sizes reported for these two solvers are smaller than those in the solvers based on the other Gröbner basis methods in [32, 35]. For the problem, Rel. pose E angle+4pt, our sparse resultant method failed to generate a solver. This implies that none of the subsets $\mathcal {F}_{\text {sub}}$ for this problem lead to a favourable monomial set (see Sect. 3.2). The primary reason for this failure is the non-genericity of the coefficients (see Definition 1). This points to a future research direction for such minimal problems. We need to look beyond Newton Polytopes of the subsets $\mathcal {F}_{\text {sub}}$ and test other convex objects.

6.2 Stability Comparison

We evaluate and compare the stability of our solvers from Table 2 with the Gröbner basis-based solvers. As it is not feasible to generate scene setups for all considered problems, we instead evaluated the stability of minimal solvers using 5K instances of random data points. Stability measures include mean and median of $Log_{10}$ of the normalized equation residuals for computed solutions, as well as the solver failures. The solver failures are an important measure of stability for real-world applications and are computed as the $\%$ of 5K instances for which at least one solution has a normalized equation residual $>10^{-3}$. These measures on randomly generated inputs have been shown to be sufficiently good indicators of solver stability [32]. Also, Table 3 shows the stability of the solvers for all the minimal problems from Table 2. We observe that for most of the minimal problems, our proposed sparse resultant-based solvers have no failures. Among those with some failures, for all except two minimal problems our sparse resultant-based solvers have fewer failures, as compared to the Gröbner basis-based solvers.

In general, our new method generates solvers that are stable with only very few failures. For a selected subset of nine minimal problems from Table 2, we computed the $Log_{10}$ of the normalized equation residuals and depicted their histograms in Fig. 4. The histograms agree with our observation from Table 3, that our proposed sparse resultant method leads to solvers with only few failures.

Table 3 Stability comparison for solvers for the minimal problems from Table 2, generated by our new method, solvers generated using [32] (col 4), heuristic-based solvers [35] (col 5) and greedy parameter search [38] (col 6)

Full size table

Note that as our new solvers are solving the same formulations of problems as the existing state-of-the-art solvers, the performance on noisy measurements and real data would be the same as the performance of the state-of-the-art solvers. The only difference in the performance comes from numerical instabilities that already appear in the noise-less case and are detailed in Table 3 (fail$\%$). For performance of the solvers in real applications, we refer the reader to papers where the original formulations of the studied problems were presented (see Table 2, column 2). Here we select two interesting problems, i.e., one relative and one absolute pose problem, and perform experiments on synthetically generated scenes and on real images, respectively.

E+ $f\lambda $ solver on synthetic scenes We study the numerical stability of our sparse resultant-based solver for the problem of estimating the relative pose of one calibrated camera, and one camera with unknown focal length and radial distortion from 7-point correspondences, i.e. the Rel. pose E+$f\lambda $ 7pt problem from Table 2. We consider the formulation “elim. $\lambda $” proposed in [35] that leads to the smallest solvers. We study the performance on noise-free data and compare it to the results of Gröbner basis solvers from Table 2.

We generated 10K scenes with 3D points drawn uniformly from a $\left[ -10,10\right] ^3$ cube. Each 3D point was projected by two cameras with random feasible orientation and position. The focal length of the first camera was randomly drawn from the interval $f_{gt} \in \left[ 0.5, 2.5\right] $ and the focal length of the second camera was set to 1, i.e. the second camera was calibrated. The image points in the first camera were corrupted by radial distortion following the one-parameter division model. The radial distortion parameter $\lambda _{gt}$ was drawn at random from the interval $[-0.7, 0]$ representing distortions of cameras with a small distortion up to slightly more than GoPro-style cameras.

Figure 3 shows $Log_{10}$ of the relative error of the distortion parameter $\lambda $ (left) and the focal length f (right), obtained by selecting the real root closest to the ground truth. All tested solvers provide stable results with only a small number of runs with larger errors. The solver based on our sparse resultant method (green) is not only smaller, but also slightly more stable than the heuristic-based solver [35] (red) and the solver based on [32] (black). For most minimal problems, the solver based on our proposed sparse resultant method has fewer failures, as compared to the solver based on the greedy parameter approach in [38] (blue) (Fig. 4).

P4Pfr solver on real images

We evaluated our sparse resultant-based solver for a practical problem of estimating the absolute pose of camera with unknown focal length and radial distortion from four 2D-to-3D point correspondences, i.e. the P4Pfr solver, on real data. We consider the Rotunda dataset, which was proposed in [29], and in [34] it was used for evaluating P4Pfr solvers. This dataset consists of 62 images captured by a GoPro Hero4 camera. Example of an input image from this dataset (left) as well as undistorted (middle) and registered image (right) using our new solver, is shown in Fig. 5 (top). The Reality Capture software [44] was used to build a 3D reconstructions of this scene. We used the 3D model to estimate the pose of each image using the new P4Pfr resultant-based solver ($28\times 40$) in a RANSAC framework. Similar to [34], we used the camera and distortion parameters obtained from [44] as ground truth for the experiment. Figure 5 (bottom) shows the errors for the focal length, radial distortion, and the camera pose. Overall, the errors are quite small, e.g. most of the focal lengths are within $0.1\%$ of the ground truth and almost all rotation errors are less than 0.1 degrees, which shows that our new solver works well for real data. We have summarized these results in Table 4 where we present the errors for the focal length, radial distortion, and the camera pose obtained using our proposed solver and for the sake of comparison we also list the errors, which were reported in [34], where the P4Pfr ($40\times 50$) solver was tested on the same dataset. Overall the errors are quite small, e.g. most of the focal lengths are within $0.1\%$ of the ground truth and almost all rotation errors are less than 0.1 degrees, which shows that our new solver as well as the original solver work well for real data. The results of both solvers are very similar. However, note that the slightly different results reported in [34] are due to RANSAC’s random nature and a slightly different P4Pfr formulation ($40\times 50$) used in [34].

7 Conclusion

In this paper, we have proposed a sparse resultant method for generating efficient solvers for minimal problems in computer vision. It uses a polynomial with a special form to augment the initial polynomial system $\mathcal {F}$ in (1) and construct a sparse resultant matrix $\texttt{M}(u_0)$ (7), using the theory of polytopes [15]. The special form enables us to decompose the resultant matrix constraint (8), into a regular eigenvalue problem (50) (or (52)) using the Schur complement. As demonstrated in Sect. 6, our sparse resultant method leads to minimal solvers with comparable speed and stability w.r.t. to the SOTA solvers based on the action matrix method [26, 32, 35, 38]. Note that the way our sparse resultant method is designed, for some minimal problems, it leads to solvers with larger eigenvalue problems but performing a smaller matrix inverse and with comparable or better stability compared to that of the Gröbner basis-based solvers.

While the action matrix method and the sparse resultant method are based on different mathematical theories, we have observed that the resulting solvers involve similar numerical operations, such as eigenvalue computation. This raises the question, “Under what conditions for a given minimal problem (a given system of polynomial equations), are the two solvers the same?” In Sect. 4, we have attempted to answer this question. Specifically, if we begin with an action matrix-based solver for a given minimal problem, then we propose a list of changes to be made to the steps in our sparse resultant method so that it leads to an equivalent sparse resultant-based solver. In the opposite direction, we also study the case when we begin with a sparse resultant-based solver and establish a list of changes to the steps performed by an action matrix method such that it leads to an equivalent solver. In other words, if we begin with an action matrix-based solver, it determines the extra polynomial $f_{m+1}$ and the favourable monomial set, to be used in the sparse resultant method. Or, if we begin with a sparse resultant-based solver, it determines the monomial ordering, the extended polynomial set $\mathcal {F}^{\prime }$, and the basis of the quotient ring to be used for computing an action matrix-based solver.

We hope that this discussion paves a path towards unifying the two approaches for generating minimal solvers, i.e., the action matrix methods and the sparse resultant method based on an extra polynomial. It is known that both these approaches are examples of the normal form methods for a given polynomial system $\mathcal {F}$, recently studied in [40, 54, 55]. To the best of our knowledge, normal forms have yet not been incorporated in the automatic generators for minimal solvers in computer vision. This may be an interesting future topic to investigate.

Change history

25 April 2024
A Correction to this paper has been published: https://doi.org/10.1007/s10851-024-01189-8

Notes

In the field of mathematics, an action matrix is also known as a multiplication matrix.
E.g. if $\texttt{C}([c_{i, \mathbf {\alpha }}])$ in (6) is a tall matrix, i.e. matrix with more rows than columns, with full column rank, then $\texttt{M}([c_{i, \mathbf {\alpha }}])$ can be constructed as a full rank square submatrix of $\texttt{C}([c_{i, \mathbf {\alpha }}])$
Note that it may happen that neither of the two set-partitions for a given favourable monomial set B, lead to a block partition of $\texttt{C}(u_0)$ such that $\texttt{A}_{12}$ has full column rank. In this case, the Schur complement does not exist and our algorithm will continue to test the next favourable monomial set B.
The size of the action matrix in all state-of-the art action matrix methods [9, 26, 27, 32, 35, 47] is equal to the number of solutions to the input system. The only exceptions are the methods proposed in [8] and [37], which can result in larger action matrices.
If the Schur complement $\texttt{X}$ has more eigenvalues N than the number of solutions r, we can still change the steps performed by the action matrix method such that it leads to a solver equivalent to the sparse resultant-based solver. However, such a case would be too complicated, and is not discussed here. Recently, [37] have proposed an action matrix-based automatic generator which attempts to generate solvers with an eigenvalue problem larger than the number of roots of the system.
It was proved in [15, Theorem 6.17] that if $\mathcal {F}$ is a generic polynomial system, then A would be spanned by the elements of the set $\mathcal {B}_A$. However, even if the polynomial system is not generic, we can still follow the steps performed in the same proof.
G–J elimination is usually performed by a computing a matrix inverse.
The coefficients of each polynomial $q_{j}$ are found through a G–J elimination of the matrix $\texttt{C}$ representing the extended set of polynomials $\mathcal {F}^\prime $.
Usually, G–J elimination is performed through a step of LU or QR factorization [8].
$i_2$-th row of $\begin{bmatrix} \hat{\texttt{C}}^\prime _{13} \\ \texttt{C}^\prime _{23} \end{bmatrix}$ will always be a row from the lower block, indexed by the monomials in $\textbf{b}_r$. This is because $x_1 \textbf{x}^{\mathbf {\alpha _j}} = \textbf{x}^{\mathbf {\alpha _{i_2}}} \in B_2 \implies \textbf{x}^{\mathbf {\alpha _{i_2}}} \in B_r$, and from the change C4R, $B_2 = \hat{B}_e \cup B_r$.

References

Albl, C., Kukelova, Z., Fitzgibbon, A.W., Heller, J., Smíd, M., Pajdla, T.: On the two-view geometry of unsynchronized cameras. CoRR arXiv:1704.06843 (2017)
Bhayani, S., Kukelova, Z., Heikkilä, J.: Automatic generator for sparse resultant solvers. https://github.com/snehalbhayani/aut_gen_sparse_res_solver (2020)
Bhayani, S., Kukelova, Z., Heikkilä, J.: A sparse resultant based method for efficient minimal solvers. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1767–1776 (2020)
Bhayani, S., Kukelova, Z., Heikkilä, J.: Computing stable resultant-based minimal solvers by hiding a variable. In: 2020 25th International Conference on Pattern Recognition (ICPR), pp. 6104–6111 (2021)
Bujnak, M., Kukelova, Z., Pajdla, T.: Making minimal solvers fast. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2012), pp. 1506–1513 (2012)
Bujnak, M., Kukelova, Z., Pajdla, T.: 3D reconstruction from image collections with a single known focal length. In: International Conference on Computer Vision (ICCV), pp. 1803–1810. IEEE (2009)
Bujnak, M., Kukelova, Z., Pajdla, T.: New efficient solution to the absolute pose problem for camera with unknown focal length and radial distortion. In: Asian Conference on Computer Vision (ACCV), pp. 11–24. Springer (2010)
Byröd, M., Josephson, K., Åström, K.: Fast and stable polynomial equation solving and its application to computer vision. Int. J. Comput. Vis. 84, 237–256 (2009)
Article Google Scholar
Byröd, M., Josephson, K., Åström, K.: Improving numerical accuracy of Gröbner basis polynomial equation solvers. In: International Conference on Computer Vision (ICCV). IEEE (2007)
Byröd, M., Josephson, K., Åström, K.: A column-pivoting based strategy for monomial ordering in numerical Gröbner basis calculations. In: European Conference on Computer Vision (ECCV). Springer, Berlin (2008)
Canny, J.F., Emiris, I.Z.: A subdivision-based algorithm for the sparse resultant. J. ACM 47(3), 417–451 (2000)
Article MathSciNet Google Scholar
Checa, C., Emiris, I.Z.: Mixed subdivisions suitable for the Canny–Emiris formula. Math. Comput. Sci. 18, 1–18 (2022)
MathSciNet Google Scholar
Chum, O., Matas, J., Kittler, J.: Locally optimized RANSAC. In: Michaelis, B., Krell, G. (eds.) Pattern Recognition, pp. 236–243. Springer, Berlin (2003)
Chapter Google Scholar
Cox, D., Little, J., O’Shea, D.: Ideals, Varieties, and Algorithms. An Introduction to Computational Algebraic Geometry and Commutative Algebra. Springer, New York (2007)
Google Scholar
Cox, D.A., Little, J.B., O’Shea, D.: Using Algebraic Geometry. Graduate Texts in Mathematics, vol. 185, 1st edn. Springer, Berlin (1998)
Book Google Scholar
Emiris, I.Z.: A general solver based on sparse resultants. CoRR arXiv:1201.5810 (2012)
Emiris, I.Z., Canny, J.F.: A practical method for the sparse resultant. In: Proceedings of the 1993 International Symposium on Symbolic and Algebraic Computation, ISSAC ’93, Kiev, Ukraine, July 6–8, 1993, pp. 183–192 (1993)
Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)
Article MathSciNet Google Scholar
Haner, S., Åström, K.: Absolute pose for cameras under flat refractive interfaces. In: Computer Vision and Pattern Recognition (CVPR), pp. 1428–1436 (2015)
Hartley, R.I., Li, H.: An efficient hidden variable approach to minimal-case camera motion estimation. IEEE Trans. Pattern Anal. Mach. Intell. 34, 2303–2314 (2012)
Article Google Scholar
Heikkilä, J.: Using sparse elimination for solving minimal problems in computer vision. In: IEEE International Conference on Computer Vision, ICCV 2017, Venice, Italy, October 22–29, 2017, pp. 76–84 (2017)
Hesch, J.A., Roumeliotis, S.I.: A direct least-squares (dls) method for pnp. In: 2011 International Conference on Computer Vision, pp. 383–390 (2011)
Kasten, Y., Galun, M., Basri, R.: Resultant based incremental recovery of camera pose from pairwise matches. In: IEEE Winter Conference on Applications of Computer Vision, WACV 2019, Waikoloa Village, HI, USA, January 7–11, 2019, pp. 1080–1088 (2019)
Kuang, Y., Åström, K.: Pose estimation with unknown focal length using points, directions and lines. In: International Conference on Computer Vision (ICCV), pp. 529–536 (2013)
Kuang, Y., Solem, J.E., Kahl, F., Åström, K.: Minimal solvers for relative pose with a single unknown radial distortion. In: Computer Vision and Pattern Recognition (CVPR), pp. 33–40. IEEE (2014)
Kukelova, Z.: Algebraic methods in computer vision. Ph.D. thesis, Czech Technical University in Prague (2013)
Kukelova, Z., Bujnak, M., Pajdla, T.: Automatic generator of minimal problem solvers. In: European Conference on Computer Vision (ECCV 2008), Proceedings, Part III, volume 5304 of Lecture Notes in Computer Science (2008)
Kukelova, Z., Bujnak, M., Pajdla, T.: Polynomial eigenvalue solutions to minimal problems in computer vision. IEEE Trans. Pattern Anal. Mach. Intell. 34, 1381–1393 (2012)
Article Google Scholar
Kukelova, Z., Heller, J., Bujnak, M., Fitzgibbon, A., Pajdla, T.: Efficient solution to the epipolar geometry for radially distorted cameras. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2309–2317 (2015)
Kukelova, Z., Kileel, J., Sturmfels, B., Pajdla, T.: A clever elimination strategy for efficient minimal solvers. In: Computer Vision and Pattern Recognition (CVPR). IEEE (2017)
Larsson, V., Åström, K.: Uncovering symmetries in polynomial systems. In: European Conference on Computer Vision (ECCV). Springer (2016)
Larsson, V., Åström, K., Oskarsson, M.: Efficient solvers for minimal problems by syzygy-based reduction. In: Computer Vision and Pattern Recognition (CVPR) (2017)
Larsson, V., Åström, K., Oskarsson, M.: Polynomial solvers for saturated ideals. In: International Conference on Computer Vision (ICCV) (2017)
Larsson, V., Kukelova, Z., Zheng, Y.: Making minimal solvers for absolute pose estimation compact and robust. In: International Conference on Computer Vision (ICCV) (2017)
Larsson, V., Oskarsson, M., Åström, K., Wallis, A., Kukelova, Z., Pajdla, T.: Beyond grobner bases: Basis selection for minimal solvers. In: 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA, June 18–22, 2018, pp. 3945–3954 (2018)
Li, B., Heng, L., Lee, G.H., Pollefeys, M.: A 4-point algorithm for relative pose estimation of a calibrated camera with a known relative rotation angle. In: 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 1595–1601. IEEE (2013)
Martyushev, E., Bhayani, S., Pajdla, T.: Automatic solver generator for systems of Laurent polynomial equations (2023)
Martyushev, E., Vrablikova, J., Pajdla, T.: Optimizing elimination templates by greedy parameter search (2022)
Mourrain, B., Telen, S., Van Barel, M.: Truncated normal forms for solving polynomial systems: generalized and efficient algorithms. J. Symb. Comput. 102, 63–85 (2021)
Article MathSciNet Google Scholar
Mourrain, B., Trébuchet, P.: Stable normal forms for polynomial system solving. Theor. Comput. Sci. 409(2), 229–240 (2008)
Article MathSciNet Google Scholar
Nakano, G.: Globally optimal DLS method for pnp problem with Cayley parameterization. In: Proceedings of the British Machine Vision Conference 2015, BMVC 2015, Swansea, UK, September 7–10, 2015, pp. 78.1–78.11 (2015)
Naroditsky, O., Daniilidis, K.: Optimizing polynomial solvers for minimal geometry problems. In: International Conference on Computer Vision (ICCV). IEEE (2011)
Nistér, D.: An efficient solution to the five-point relative pose problem. IEEE Trans. Pattern Anal. Mach. Intell. 26(6), 756–770 (2004)
Article Google Scholar
RealityCapture. www.capturingreality.com (2016)
Raguram, R., Chum, O., Pollefeys, M., Matas, J., Frahm, J.-M.: USAC: a universal framework for random sample consensus. IEEE Trans. Pattern Recognit. Mach. Intell. 35(8), 2022–2038 (2013)
Article Google Scholar
Saurer, O., Pollefeys, M., Lee, G.H.: A minimal solution to the rolling shutter pose estimation problem. In: 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 1328–1334. IEEE (2015)
Stetter, H.J.: Matrix eigenproblems are at the heart of polynomial system solving. SIGSAM Bull. 30(4), 22–25 (1996)
Article Google Scholar
Stewenius, H., Engels, C., Nistér, D.: Recent developments on direct relative orientation. ISPRS J. Photogramm. Remote Sens. 60, 284–294 (2006)
Article Google Scholar
Stewenius, H., Nister, D., Kahl, F., Schaffalitzky, F.: A minimal solution for relative pose with unknown focal length. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2005) (2005)
Stewenius, H.: Gröbner basis methods for minimal problems in computer vision. Ph.D. thesis, Lund University, Sweden (2005)
Stewenius, H., Schaffalitzky, F., Nister, D.: How hard is 3-view triangulation really? In: International Conference on Computer Vision (ICCV). IEEE (2005)
Sturmfels, B.: Solving systems of polynomial equations. In: American Mathematical Society, CBMS Regional Conferences Series, No. 97 (2002)
Svärm, L., Enqvist, O., Kahl, F., Oskarsson, M.: City-scale localization for cameras with known vertical direction. IEEE Trans. Pattern Anal. Mach. Intell. 39(7), 1455–1461 (2017)
Telen, S., Mourrain, B., Van Barel, M.: Solving polynomial systems via truncated normal forms. SIAM J. Matrix Anal. Appl. 39(3), 1421–1447 (2018)
Article MathSciNet Google Scholar
Telen, S., Van Barel, M.: A stabilized normal form algorithm for generic systems of polynomial equations. J. Comput. Appl. Math. 342, 119–132 (2018)
Article MathSciNet Google Scholar
van der Waerden, B.L., Artin, E., Noether, E., Benac, T.J.: Modern Algebra, vol. II. F. Ungar, New York (1950)
Google Scholar
Zheng, E., Wang, K., Dunn, E., Frahm, J.: Minimal solvers for 3D geometry from satellite imagery. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 738–746 (2015)

Download references

Acknowledgements

Zuzana Kukelova was supported by the OP VVV funded project CZ.02.1.01/0.0/0.0/16_019/0000765 ‘Research Center for Informatics’ and the Czech Science Foundation (GAČR) JUNIOR STAR Grant No. 22-23183M.

Funding

Open Access funding provided by University of Oulu (including Oulu University Hospital).

Author information

Authors and Affiliations

Faculty of Information Technology and Electrical Engineering, Center for Machine Vision and Signal Analysis, University of Oulu, Oulu, Finland
Snehal Bhayani & Janne Heikkilä
Visual Recognition Group, Department of Cybernetics, Faculty of Electrical Engineering, Czech Technical University in Prague, Prague, Czech Republic
Zuzana Kukelova

Authors

Snehal Bhayani
View author publications
You can also search for this author in PubMed Google Scholar
Janne Heikkilä
View author publications
You can also search for this author in PubMed Google Scholar
Zuzana Kukelova
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Snehal Bhayani.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The original online version of this article was revised: “In this article, the funding details for Zuzana Kukelova in the Acknowledgements section were incomplete. The corrected details are “Zuzana Kukelova was supported by the OP VVV funded project CZ.02.1.01/0.0/0.0/16_019/0000765 ‘Research Center for Informatics’ and the Czech Science Foundation (GAčR) JUNIOR STAR Grant No. 22-23183M.” In addition, the affiliations were incomplete. The full affiliation for Snehal Bhayani and Janne Heikkilä is “Faculty of Information Technology and Electrical Engineering, Center for Machine Vision and Signal Analysis, University of Oulu, Oulu, Finland”. The full affiliation for Zuzana Kukelova is “Visual Recognition Group, Department of Cybernetics, Faculty of Electrical Engineering, Czech Technical University in Prague, Prague, Czech Republic”.

Appendices

Appendix A Action Matrix

Here, we describe the important steps typically performed by an action matrix method based on the theory of the Gröbner basis. Note that for a given minimal problem, the offline steps are performed only once, while the online steps are repeated for each instance of the input data/measurements for the given minimal problem.

1.
[Offline] We begin with a system of m polynomial equations $\mathcal {F}=0$ (1) in n variables. Let us denote the ideal generated by $\mathcal {F}$, as I. Let us also assume that $\mathcal {F}=0$ has r solutions. It is well known that the quotient ring $A = \mathbb {C}[X]/I$ has the structure of a finite-dimensional vector space over the field $\mathbb {C}$ (see [15] for details). Let A be a l-dimensional vector space. Then, if I is a radical ideal, i.e., $I = \sqrt{I}$, we have $l=r$.
2.
[Offline] This method computes a linear basis of A, denoted as $\mathcal {B}_A = \lbrace \left[ \textbf{x}^{\mathbf {\alpha _1}}\right] ,\dots ,\left[ \textbf{x}^{\mathbf {\alpha _r}} \right] \rbrace $. Here, some monomial ordering is assumed, to define the division of a polynomial $f \in \mathbb {C}[X]$ w.r.t. the ideal I. This polynomial division is denoted with the operator $\left[ f \right] $. The set of monomials, corresponding to the elements in $\mathcal {B}_A$, are $B_a = \lbrace \textbf{x}^{\mathbf {\alpha _1}},\dots ,\textbf{x}^{\mathbf {\alpha _r}} \rbrace $.
3.
[Offline] Some $f \in \mathbb {C}[X]$ is assumed as an action polynomial, which sends each $\left[ \textbf{x}^{\mathbf {\alpha }} \right] \in A$ to $\left[ f \textbf{x}^{\mathbf {\alpha }} \right] \in A$. We thus have the following linear map
$$\begin{aligned} T_f: A \rightarrow A, \ T_f([\textbf{x}^{\mathbf {\alpha }}]) = \left[ f \textbf{x}^{\mathbf {\alpha }} \right] . \end{aligned}$$
(A1)
Fixing a linear basis $\mathcal {B}_{A}$ of A, allows us to represent the linear map $T_f$, with an $r \times r$ matrix, $\texttt{M}_{f} = (m_{ij})$. Thus, for each $ \textbf{x}^{\mathbf {\alpha _j}} \in B_a$, we have
$$\begin{aligned} T_f ([\textbf{x}^{\mathbf {\alpha _j}}]) = [f\textbf{x}^{\mathbf {\alpha _j}} ] = \sum _{i=1}^r m_{ij}[\textbf{x}^{\mathbf {\alpha _i}}]. \end{aligned}$$
(A2)
In other words, the matrix $\texttt{M}_f$ represents, what we call a multiplication or an action matrix. Assuming $B_{fa} = \lbrace f\textbf{x}^{\mathbf {\alpha _j}} \mid \textbf{x}^{\mathbf {\alpha _j}} \in B_a \rbrace $, we have
$$\begin{aligned} \texttt{M}_f \textbf{b}_a = \textbf{b}_{fa}, \end{aligned}$$
(A3)
where $\textbf{b}_{a} = \text {vec}(B_{a})$ and $\textbf{b}_{fa} = \text {vec}(B_{fa})$. We can find polynomials $q_j \in I$, such that (A2) becomes
$$\begin{aligned} f \textbf{x}^{\mathbf {\alpha _j}} = \sum _{i=1}^r m_{ij} \textbf{x}^{\mathbf {\alpha _i}} + q_j. \end{aligned}$$
(A4)
Note that $q_j \in I \implies q_j = \sum _{i=1}^{m} h_{ij} f_i$. Various action matrix methods in the literature adopt different approaches for computing these polynomials $q_{j} \in \mathbb {C}\left[ X \right] $. Here, we assume the action polynomial $f = x_k$, for some variable $x_k \in X$.
4.
[Offline] In all action matrix methods, we basically need to compute a set $T_j$ of monomial multiples for each of the input polynomials $f_j \in \mathcal {F}$. This gives us an expanded set of polynomials, which we denote as $\mathcal {F}^{\prime } = \lbrace \textbf{x}^{\mathbf {\alpha }} f_i \mid f_i \in \mathcal {F}, \textbf{x}^{\mathbf {\alpha }} \in T_i \rbrace $. This extended set of polynomials $\mathcal {F}^\prime $, is constructed such that each $q_{j}$ can be computed as a linear combination of the polynomials $\mathcal {F}^\prime $.^{Footnote 8}
5.
[Offline] Let $B = \text {mon}(\mathcal {F}^\prime )$, which is partitioned [8] as
$$\begin{aligned} B ={} & {} B_e \cup B_r \cup B_a, \end{aligned}$$
(A5)
$$\begin{aligned} B_r ={} & {} \lbrace x_k \textbf{x}^{\mathbf {\alpha }} \mid \textbf{x}^{\mathbf {\alpha }} \in B_a \rbrace \setminus B_a, \end{aligned}$$
(A6)
$$\begin{aligned} B_e ={} & {} B \setminus (B_r \cup B_a), \end{aligned}$$
(A7)
where $B_r$ and $B_e$ are, respectively, what we call the reducible monomials and the excess monomials.
6.
[Offline] The set of equations $\mathcal {F}^{\prime }=0$ is expressed in a matrix form, as
$$\begin{aligned} \texttt{C} \ \textbf{b} = \begin{bmatrix} \texttt{C}_{11} &{} \texttt{C}_{12} &{} \texttt{C}_{13} \\ \texttt{C}_{21} &{} \texttt{C}_{22} &{} \texttt{C}_{23} \end{bmatrix} \begin{bmatrix} \textbf{b}_e \\ \textbf{b}_r \\ \textbf{b}_a \end{bmatrix} = \textbf{0}, \end{aligned}$$
(A8)
where $\textbf{b}_e=\text {vec}(B_e)$ and $ \textbf{b}_r=\text {vec}(B_r)$. The rows of $\texttt{C}$ are assumed to be partitioned such that $\texttt{C}_{22}$ is a square invertible matrix. The matrix $\texttt{C}$ is known as an elimination template. As $\texttt{C}$ represents a coefficient matrix of $\mathcal {F}^\prime $ constructed as described in step 4, if we perform a G–J elimination of $\texttt{C}$, we obtain the following
$$\begin{aligned} \begin{bmatrix} \texttt{C}^{\prime }_{11} &{} \texttt{0} &{} \texttt{C}^{\prime }_{13} \\ \texttt{0} &{} \texttt{I} &{} \texttt{C}^{\prime }_{23} \end{bmatrix}. \end{aligned}$$
(A9)
Note that, $\texttt{C}_{22}$ is a square invertible matrix. The submatrix $\texttt{C}^{\prime }_{11}$ may not be square. This implies that some columns in $\begin{bmatrix} \texttt{C}_{11} \\ \texttt{C}_{21} \end{bmatrix}$ are linearly dependent, which can be removed [26], along with the corresponding monomials from the set of excess monomials $B_e$. Let the resulting monomial set be denoted as $\hat{B}_e$ and the reduced column block as $\begin{bmatrix} \hat{\texttt{C}}_{11} \\ \hat{\texttt{C}}_{21} \end{bmatrix}$. Then, (A8) becomes
$$\begin{aligned} \hat{\texttt{C}} \ \hat{\textbf{b}} = \begin{bmatrix} \hat{\texttt{C}}_{11} &{} \texttt{C}_{12} &{} \texttt{C}_{13} \\ \hat{\texttt{C}}_{21} &{} \texttt{C}_{22} &{} \texttt{C}_{23} \end{bmatrix} \begin{bmatrix} \hat{\textbf{b}}_e \\ \textbf{b}_r \\ \textbf{b}_a \end{bmatrix} = \textbf{0}, \end{aligned}$$
(A10)
where $\hat{\texttt{C}}$ and $\hat{\textbf{b}}$, respectively, denote the reduced elimination template and the monomial vector.
Table 4 Errors for the real Rotunda dataset
Full size table
7.
[Online] A Gauss–Jordan (G–J) elimination^{Footnote 9} of $\hat{\texttt{C}}$ leads to
$$\begin{aligned} \begin{bmatrix} \texttt{I} &{} \texttt{0} &{} \hat{\texttt{C}}^{\prime }_{13} \\ \texttt{0} &{} \texttt{I} &{} \texttt{C}^{\prime }_{23} \end{bmatrix} \begin{bmatrix} \hat{\textbf{b}}_e \\ \textbf{b}_r \\ \textbf{b}_a \end{bmatrix} = \textbf{0}. \end{aligned}$$
(A11)
The lower block row is rewritten as
$$\begin{aligned} \begin{bmatrix} \texttt{I}&\texttt{C}^{\prime }_{23} \end{bmatrix} \begin{bmatrix} \textbf{b}_r \\ \textbf{b}_a \end{bmatrix} = \textbf{0}. \end{aligned}$$
(A12)
The entries of the action matrix $\texttt{M}_f$ can be then read off from the entries of the matrix $\texttt{C}^{\prime }_{23}$ [8, 26]. For $\textbf{x}^{\mathbf {\alpha _j}} \in B_a$, if $x_k \textbf{x}^{\mathbf {\alpha _j}} \in B_a$ for some $\textbf{x}^{\mathbf {\alpha _{i_1}}} \in B_a$, then j-th row of $\texttt{M}_f$ contains 1 in $i_1$-th column and 0 in the remaining columns. But if $x_k \textbf{x}^{\mathbf {\alpha _j}} \notin B_a$, then $x_k \textbf{x}^{\mathbf {\alpha _j}} \in B_r$ for some $\textbf{x}^{\mathbf {\alpha _{i_2}}} \in B_r$. In this case, j-th row of $\texttt{M}_f$ is the $i_2$-th row of $-\texttt{C}^\prime _{23}$. We recover solutions to $\mathcal {F}=0$, from the eigenvalues (and eigenvectors) of the action matrix $\texttt{M}_{f}$. Specifically, $u_0 \in \mathbb {C}$ is an eigenvalue of the matrix $\texttt{M}_{f}$, iff $u_0$ is a value of the function f on the variety V. We refer to the book [15] for further details. In other words, if $f=x_k$, then the eigenvalues of $\texttt{M}_{f}$ are the $x_{k}$-coordinates of the solutions of (1). The solutions to the remaining variables can be obtained from the eigenvectors of $\texttt{M}_{f}$. This means that after finding the multiplication matrix $\texttt{M}_{f}$, we can recover the solutions by its eigendecompostion, for which efficient algorithms exist.

The output of the offline stage is the elimination template $\hat{\texttt{C}}$ in step 6. In the online stage, step 7, for a given instance of the minimal problem, we fill in the coefficients of the elimination template $\hat{\texttt{C}}$ using the input data, and perform its G–J elimination to compute the action matrix $\texttt{M}_f$. Eigendecomposition of $\texttt{M}_f$, then gives us the solutions to the given instance of the minimal problem.

The first automatic approach for generating elimination templates and Gröbner basis solvers was presented in [27]. Recently, an improvement to the automatic generator [27] was proposed in [32]. It exploits the inherent relations between the input polynomial equations and it results in more efficient solvers than [27]. The automatic method from [32] was later extended by a method for dealing with saturated ideals [33] and a method for detecting symmetries in polynomial systems [31].

Appendix B Algorithms

Here, we provide the algorithms for our sparse resultant method in Sect. 3.

1.1 B.1 Extracting a Favourable Set of Monomials

Algorithm 1 computes a favourable set of monomials B and a block partition of the corresponding coefficient matrix $\texttt{C}(u_0)$, for an instance of a minimal problem, i.e., a system of m polynomial equations $\mathcal {F}=0$, in n unknown variables X (1). The output of the algorithm also contains a set of monomial multiples T. Our approach for computing a favourable monomial set is described in Sect. 3.2, and our approach for partitioning the coefficient matrix is described in Sect. 3.3. In Sect. 3.3, we have considered two ways of partitioning the favourable monomial set B, i.e., as in (42) and in (43), in order to block partition $\texttt{C}(u_0)$. However, in Algorithm 1, we consider the first one. For the alternate partition, all steps remain the same, except the step 15, where we use the alternative partition in (43).

$$\begin{aligned} B^{\prime }_{1} \leftarrow \lbrace \textbf{x}^{\mathbf {\alpha }} \in B^\prime \mid \dfrac{\textbf{x}^{\mathbf {\alpha }}}{x_i} \in T_{m+1} \rbrace , B^{\prime }_{2} \leftarrow B^{\prime } \setminus B^{\prime }_{1}. \end{aligned}$$

1.2 B.2 Row-Column Removal

The next step in the proposed method is to reduce the favourable monomial set B, by removing columns from $\texttt{C}(u_0)$ along with a corresponding set of rows, described in Sect. 3.4. Algorithm 2 achieves this. Its input is the favourable monomial set B, the corresponding set of monomial multiples T, computed by Algorithm 1 and the output is a reduced monomial set $B_{\text {red}}$ and a reduced set of monomial multiples, $T_{\text {red}}$ that index the columns and rows of the reduced matrix $\texttt{C}_{\text {red}}(u_0)$, respectively. We note that this algorithm is the same irrespective of the version of partition of the monomial set B, i.e. (42) or (43).

1.3 B.3 Row Removal

It may happen that the reduced matrix $\texttt{C}_{\text {red}}(u_0)$ still has more rows than columns. Therefore, we propose an approach to remove the excess rows from $\texttt{C}_{\text {red}}(u_0)$, to transform it into a square matrix, which will be our sparse resultant matrix $\texttt{M}(u_0)$. Towards this, we provide Algorithm 3 to remove the extra rows from $\texttt{C}_{\text {red}}(u_0)$ by removing some monomial multiples from $T_{\text {red}}$. It accepts the favourable monomial set $B_{\text {red}}$ and its corresponding monomial multiples $T_{\text {red}}$, as input and returns a reduced set of monomial multiples, $T_{\text {red}}$ such that, along with the basis $B_{\text {red}}$, leads to a square matrix. If we partitioned $B_{\text {red}}$ using the alternative approach (43), we just need to change step 17 in Algorithm 3 to

$$\begin{aligned} B^{\prime }_{1} \leftarrow \lbrace \textbf{x}^{\mathbf {\alpha }} \! \in \! B^\prime \mid \dfrac{ \textbf{x}^{\mathbf {\alpha }}}{x_i} \in T_{m+1} \rbrace , B^{\prime }_{2} \leftarrow B^{\prime } \setminus B^{\prime }_{1}. \end{aligned}$$

Appendix C Proofs

In this appendix, we provide proofs for the propositions discussed in Sect. 4, regarding the equivalence of an action matrix-based solver with a sparse resultant-based solver, in three situations.

Proposition C.1

For a given system of polynomial equations $\mathcal {F}=0$ (1), let us assume to have generated an action matrix-based solver using the method in Sect. 2.1. The action matrix-based solver is assumed to be generated for the action variable, $f=x_1$. Also, let us consider a sparse resultant-based solver, generated after applying the changes C1R-C6R in Sect. 4.1, to the steps 1–5 in Sect. 3.1, i.e. the offline stage. Then, the two solvers are equivalent, as defined in Definition 3.

Proof

Note from (53) that the elimination template can be written as

$$\begin{aligned} {}^{a}\hat{\texttt{C}} = \begin{bmatrix} \hat{\texttt{C}}_{11} &{} \texttt{C}_{12} &{} \texttt{C}_{13} \\ \hat{\texttt{C}}_{21} &{} \texttt{C}_{22} &{} \texttt{C}_{23} \end{bmatrix} = \begin{bmatrix} \hat{\texttt{A}}_{12}&\hat{\texttt{A}}_{11} \end{bmatrix}. \end{aligned}$$

(C13)

Therefore, G–J elimination of ${}^{a}\hat{\texttt{C}}$ can be considered as computing the matrix product, $\hat{\texttt{A}}_{12}^{-1} \ {}^{a}\hat{\texttt{C}} = \begin{bmatrix} \texttt{I}&\hat{\texttt{A}}_{12}^{-1} \hat{\texttt{A}}_{11} \end{bmatrix}$. Comparing it with the G–J eliminated form of the matrix ${}^{a}\hat{\texttt{C}}$ in (A11), we have

$$\begin{aligned} \hat{\texttt{A}}_{12}^{-1} \hat{\texttt{A}}_{11}&= \begin{bmatrix} \hat{\texttt{C}}^\prime _{13} \\ \texttt{C}^\prime _{23} \end{bmatrix}. \end{aligned}$$

(C14)

Note that for each $\textbf{x}^{\mathbf {\alpha _j}}$ in ${}^{r}T_{m+1} (\!=\! B_1)$, the j-th row of $\begin{bmatrix} \texttt{A}_{21} - u_0 \texttt{I}&\texttt{A}_{22} \end{bmatrix} {}^{r}\textbf{b}$ (55) represents the multiple $\textbf{x}^{\mathbf {\alpha _j}} f_{m+1} = \textbf{x}^{\mathbf {\alpha _j}} x_1 - \textbf{x}^{\mathbf {\alpha _j}} u_0$. Then, we have the following two cases, for each $\textbf{x}^{\mathbf {\alpha _j}} \in B_1$:

Case 1::: If $x_1 \textbf{x}^{\mathbf {\alpha _j}} = \textbf{x}^{\mathbf {\alpha _{i_1}}} \in B_1$, the j-th row of $\texttt{A}_{22}$ is $\textbf{0}$. The j-th row of $\texttt{A}_{21}$, and hence that of $\texttt{X}$ has 1 in $i_1$-th column and 0 in the remaining columns. Moreover, in this case, the j-th row of the action matrix $\texttt{M}_f$ should also have 1 in $i_1$-th column and 0 in the remaining columns (see step 7 of the action matrix-based method in Sect. 2.1).
Case 2::: If $x_1 \textbf{x}^{\mathbf {\alpha _j}} \notin B_1$, then $x_1 \textbf{x}^{\mathbf {\alpha _j}} = \textbf{x}^{\mathbf {\alpha _{i_2}}}$, for some $ \textbf{x}^{\mathbf {\alpha _{i_2}}} \in B_2$. Then the j-th row of $\texttt{A}_{21}$ is $\textbf{0}$, and j-th row of $\texttt{A}_{22}$ has 1 in $i_2$-th column and 0 in the remaining columns. Therefore, the j-th row of the matrix product $-\texttt{A}_{22} \hat{\texttt{A}}_{12}^{-1} \hat{\texttt{A}}_{11}$, and hence that of the matrix $\texttt{X}$, is actually the $i_2$-th row of $-\hat{\texttt{A}}_{12}^{-1} \hat{\texttt{A}}_{11}$. From (C14), this is actually a row from the lower block,^{Footnote 10}$-\texttt{C}^\prime _{23}$. However, from the discussion in the step 7 in Sect. 2.1, this is the same as the j-th row of the action matrix $\texttt{M}_f$.

Thus in both cases, the rows of the matrices $\texttt{M}_f$ and $\texttt{X}$ are the same, and therefore, $\texttt{X}=\texttt{M}_f$. Moreover, (C14) implies that computing the matrix product $\hat{\texttt{A}}_{12}^{-1} \hat{\texttt{A}}_{11}$ can be replaced by the step of G–J elimination of the matrix ${}^{a}\hat{\texttt{C}}$. Therefore, as both the conditions in Definition 3 are satisfied, the action matrix-based solver is equivalent to the sparse resultant-based solver. $\square $

Proposition C.2

For a given system of polynomial equations $\mathcal {F}=0$ (1), let us assume to have generated a sparse resultant-based solver. Let us also consider an action matrix-based solver, generated after applying the changes C1A-C6A in Sect. 4.2 to the offline stage, i.e. steps 1–6 in Sect. 2.1. Then the two solvers are equivalent as defined in Definition 3.

Proof

From (57), note that the elimination template ${}^{a}\hat{\texttt{C}}$ can be expressed as ${}^{a}\hat{\texttt{C}} = \begin{bmatrix} \hat{\texttt{C}}_{11} &{} \texttt{C}_{12} &{} \texttt{C}_{13} \\ \hat{\texttt{C}}_{21} &{} \texttt{C}_{22} &{} \texttt{C}_{23} \end{bmatrix} = \begin{bmatrix} \hat{\texttt{A}}_{12}&\hat{\texttt{A}}_{11} \end{bmatrix} $. Therefore a G–J elimination of ${}^{a}\hat{\texttt{C}}$ can be achieved by computing the matrix product $\hat{\texttt{A}}_{12}^{-1} \ {}^{a}\hat{\texttt{C}} = \begin{bmatrix} \texttt{I}&\hat{\texttt{A}}_{12}^{-1} \hat{\texttt{A}}_{11} \end{bmatrix}$. Comparing it with the G–J eliminated form of the matrix ${}^{a}\hat{\texttt{C}}$ in (A11), we can write

$$\begin{aligned} \begin{bmatrix} \hat{\texttt{C}}^\prime _{13} \\ \texttt{C}^\prime _{23} \end{bmatrix}&= \hat{\texttt{A}}_{12}^{-1} \hat{\texttt{A}}_{11}. \end{aligned}$$

(C15)

This is exactly the same situation, as in Proposition (C.1). Specifically, the Schur complement $\texttt{X}$, and the action matrix $\texttt{M}_f$, are exactly the same matrices, with each row, either being a row from $-\texttt{C}^{\prime }_{23}$ or a row containing 1’s or 0’s. Moreover (C15) implies that G–J elimination of ${}^{a}\hat{\texttt{C}}$ can be replaced by the matrix product, $\hat{\texttt{A}}_{12}^{-1} \hat{\texttt{A}}_{11}$. Therefore, as both the conditions in Definition 3 are satisfied, the action matrix-based solver is equivalent to the sparse resultant-based solver. $\square $

Proposition C.3

For a given system of polynomial equations $\mathcal {F}=0$ (1), let us assume to have generated a sparse resultant-based solver using the alternative partition of ${}^{r}\!B$ (32) and ${}^{r}\texttt{C}(u_0)$ (34). Let us also consider an action matrix-based solver, generated after applying the changes C1A$^\prime $-C6A$^\prime $ in Sect. 4.3 to the steps 1–6 in Sect. 2.1, i.e. the offline stage. Then the two solvers are equivalent as defined in Definition 3.

Proof

In the step C4A$^\prime $, the monomial multiples are set to be the same as those in the sparse resultant method, i.e. ${}^{a}T_i = {}^{r}T_i, i=1,\dots ,m+1$. Therefore, the upper block of the elimination template ${}^{a}\texttt{C}$ in (A8), is fixed by setting

$$\begin{aligned} \texttt{C}_{11} = \hat{\texttt{A}}_{12}, \texttt{C}_{12} = \texttt{0}, \texttt{C}_{13} = \hat{\texttt{A}}_{11}, \end{aligned}$$

(C16)

where $\begin{bmatrix} \hat{\texttt{A}}_{11}&\hat{\texttt{A}}_{12} \end{bmatrix}$ (34) is the upper block of the sparse resultant matrix $\texttt{M}(u_0)$.

Note that each monomial $\textbf{x}^{\mathbf {\alpha _j}}$ in ${}^{a}T_{m+1}$ provides the j-th row in the lower block of ${}^{a}\texttt{C}$, as a vector form of the polynomial $\textbf{x}^{\mathbf {\alpha _j}} \ {}^{a}\!f_{m+1}=\textbf{x}^{\mathbf {\alpha _j}} (x_1 \lambda -1)$. Similarly, each monomial $\textbf{x}^{\mathbf {\alpha _j}}$ in ${}^{r}T_{m+1}$ provides the j-th row in the lower block of ${}^{r}\texttt{C}(u_0)$ in (34), as a vector form of the polynomial $\textbf{x}^{\mathbf {\alpha _j}} \ {}^{r}\!f_{m+1}=\textbf{x}^{\mathbf {\alpha _j}} (x_1 -u_0)$. The lower block of ${}^{a}\texttt{C}$ in (A8) is of the form $\begin{bmatrix} \texttt{C}_{21}&\texttt{I}&\texttt{C}_{23} \end{bmatrix}$, and the lower block of ${}^{r}\texttt{C}(u_0)$ in (34) is of the form $\begin{bmatrix} \texttt{I}+u_0 \texttt{B}_{21}&u_0 \texttt{B}_{22} \end{bmatrix}$. Note that the columns of $\texttt{C}_{21}$ and $\texttt{C}_{23}$ are, respectively, indexed by $\textbf{b}_e$ and $\textbf{b}_a$, whereas the columns of $\texttt{B}_{21}$ and $\texttt{B}_{22}$ are, respectively, indexed by $\textbf{b}_1$ and $\textbf{b}_2$. From the change C6A$^\prime $, note that $\textbf{b}_a=\textbf{b}_1$, $\textbf{b}_e=\textbf{b}_2$ and ${}^{a}T_{m+1}={}^{r}T_{m+1}$. Therefore, $\texttt{C}_{21} = \texttt{B}_{22}$ and $\texttt{C}_{23} = \texttt{B}_{21}$.

Substituting the upper and the lower blocks of ${}^{a}\texttt{C}$, in (A10), we obtain

$$\begin{aligned} {}^{a}\texttt{C} \ {}^{a}\textbf{b} = \begin{bmatrix} \hat{\texttt{A}}_{12} &{} \texttt{0} &{} \hat{\texttt{A}}_{11} \\ \texttt{B}_{22} &{} \texttt{I} &{} \texttt{B}_{21} \end{bmatrix} \begin{bmatrix} \textbf{b}_e \\ \textbf{b}_r \\ \textbf{b}_a \end{bmatrix} = \textbf{0}. \end{aligned}$$

(C17)

Note that, in the sparse resultant approach, the matrix $ \hat{\texttt{A}}_{12}$ has to be square invertible. Therefore, the matrix $\begin{bmatrix} \hat{\texttt{A}}_{12} &{} \texttt{0} \\ \texttt{B}_{22} &{} \texttt{I} \end{bmatrix}$ is also invertible and there are no extra columns to be removed in the step 6 in Sect. 2.1. This means that we have ${}^{a}\hat{\texttt{C}} = {}^{a}\texttt{C}$, ${}^{a}\hat{\textbf{b}}={}^{a}\textbf{b},{}^{a}\hat{\textbf{b}}_e={}^{a}\textbf{b}_e, \hat{\texttt{C}}_{11} = \hat{\texttt{A}}_{12}$ and $\hat{\texttt{C}}_{21} = \texttt{B}_{22}$, in (A10). Note that

$$\begin{aligned} \begin{bmatrix} \hat{\texttt{A}}_{12} &{} \texttt{0} \\ \texttt{B}_{22} &{} \texttt{I} \end{bmatrix}^{-1} =&\begin{bmatrix} \hat{\texttt{A}}^{-1}_{12} &{} \texttt{0} \\ -\texttt{B}_{22}\hat{\texttt{A}}^{-1}_{12} &{} \texttt{I} \end{bmatrix}. \end{aligned}$$

(C18)

Therefore, a G–J elimination of the matrix ${}^{a}\hat{\texttt{C}}$ is of the form:

$$\begin{aligned} \begin{bmatrix} \texttt{I} &{} \texttt{0} &{} \hat{\texttt{A}}^{-1}_{12}\hat{\texttt{A}}_{11} \\ \texttt{0} &{} \texttt{I} &{} \texttt{B}_{21} - \texttt{B}_{22} \hat{\texttt{A}}_{12}^{-1} \hat{\texttt{A}}_{11} \end{bmatrix}. \end{aligned}$$

(C19)

Comparing it with the G–J eliminated form of the matrix ${}^{a}\hat{\texttt{C}}$ in (A11), the submatrix $\texttt{C}_{23}^\prime $ is

$$\begin{aligned} \texttt{C}_{23}^\prime =\texttt{B}_{21} - \texttt{B}_{22} \hat{\texttt{A}}_{12}^{-1} \hat{\texttt{A}}_{11}. \end{aligned}$$

(C20)

From the lower block (A12), we have $\textbf{b}_r = -\texttt{C}_{23}^\prime \textbf{b}_a$. In this case $B_r \cap B_a = \emptyset $, which means that $-\texttt{C}_{23}^\prime $ is the action matrix $\texttt{M}_f$ (A3), for $f=\lambda $. However, from (52), $\texttt{X}=\texttt{B}_{21} - \texttt{B}_{22} \hat{\texttt{A}}_{12}^{-1} \hat{\texttt{A}}_{11}$. Thus, the action matrix $\texttt{M}_f$ is equal to the Schur complement $\texttt{X}$. Moreover from (C19), it can be seen that a step of G–J elimination of the matrix ${}^{a}\hat{\texttt{C}}$ can be replaced by the step of computing the matrix product $\hat{\texttt{A}}_{12}^{-1} \hat{\texttt{A}}_{11}$. Therefore, as both the conditions in Definition 3 are satisfied, the action matrix-based solver is equivalent to the sparse resultant-based solver. $\square $

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bhayani, S., Heikkilä, J. & Kukelova, Z. Sparse Resultant-Based Minimal Solvers in Computer Vision and Their Connection with the Action Matrix. J Math Imaging Vis 66, 335–360 (2024). https://doi.org/10.1007/s10851-024-01182-1

Download citation

Received: 28 October 2023
Accepted: 22 February 2024
Published: 23 March 2024
Issue Date: June 2024
DOI: https://doi.org/10.1007/s10851-024-01182-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Sparse Resultant-Based Minimal Solvers in Computer Vision and Their Connection with the Action Matrix

Abstract

Similar content being viewed by others

Minimal Solutions for Pose Estimation of a Multi-Camera System

A Minimal Solution to Relative Pose with Unknown Focal Length and Radial Distortion

PL $$_1$$ P - Point-Line Minimal Problems Under Partial Visibility in Three Views

1 Introduction

2 Theoretical Background and Related Work

Definition 1

2.1 The Action Matrix Method

2.2 Resultants

Definition 2

2.2.1 Polynomial Solving Using Resultants

2.2.2 Hiding a Variable

2.2.3 Adding an Extra Polynomial

2.2.4 Schur Complement

2.2.5 Polyhedral Geometry

Example 1

2.2.6 Sparse Resultants

3 Proposed Extra Polynomial Resultant Method

3.1 Method Outline

3.2 Computing a Favourable Monomial Set

Example 2

3.3 Block Partition of \(\texttt{C}(u_0)\)

Proposition 1

Proof

3.4 Row-Column Removal

4 Action Matrix vs sparse resultant

Definition 3

4.1 \(\texttt{X}\) (SRes) from \(\texttt{M}_f\) (AM)

4.2 \(\texttt{M}_f\) (AM) from \(\texttt{X}\) (SRes)

4.3 \(\texttt{M}_f\) (AM) from \(\texttt{X}\) (SRes) with Alternative Form

5 Comparison w.r.t. SOTA Sparse Resultant Methods

6 Experiments

6.1 Computational Complexity

6.2 Stability Comparison

7 Conclusion

Change history

25 April 2024

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices

Appendix A Action Matrix

Appendix B Algorithms

1.1 B.1 Extracting a Favourable Set of Monomials

1.2 B.2 Row-Column Removal

1.3 B.3 Row Removal

Appendix C Proofs

Proposition C.1

Proof

Proposition C.2

Proof

Proposition C.3

Proof

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation