Abstract
We present a new probabilistic symbolic algorithm that, given a variety defined in an n-dimensional affine space by a generic sparse system with fixed supports, computes the Zariski closure of its projection to an ℓ-dimensional coordinate affine space with ℓ<n. The complexity of the algorithm depends polynomially on some combinatorial invariants associated to the supports.
Similar content being viewed by others
1 Introduction
Let \(\mathcal{A}=(\mathcal{A}_{1},\ldots, \mathcal{A}_{r})\) be a family of finite subsets of \((\mathbb{Z}_{\ge0})^{n}\) and f=(f 1,…,f r ) a system of polynomials in \(\mathbb {Q}[X_{1},\ldots, X_{n}]\) supported on \(\mathcal{A}\). If \(V(\mathbf{f}) \subset\mathbb{C}^{n}\) denotes the affine variety of the common zeros of the polynomials in f and, for a given ℓ<n, \(\pi: \mathbb{C}^{n} \to\mathbb{C}^{\ell}\) is the projection π(x 1,…,x n )=(x 1,…,x ℓ ), we consider the problem of computing algorithmically a description of the Zariski closure \(\overline{\pi (V(\mathbf{f}))}\subset\mathbb{C}^{\ell}\) within a complexity depending on combinatorial invariants associated to the input supports.
The computation of (Zariski closures of) linear projections of varieties is the basic task in elimination theory. A more general formulation of this problem is algorithmic quantifier elimination over algebraically closed fields (see, for instance, [9, 18, 23, 36, 44] for algorithms with complexities depending on the number and degrees of the polynomials and the number of variables involved). Particular instances of the computation of Zariski closures of projections are the computation of Chow forms (see, for instance, [7, 21, 29]), classical resultants (see [13] and the references therein) and sparse resultants (see, for example, [8, 12, 28, 43]).
The foundations of the study of sparse polynomial systems can be traced back to [4, 31] and [32], which prove that the number of isolated roots in \((\mathbb{C}^{*})^{n}\) of a system of n polynomial equations in n unknowns is bounded by the mixed volume of the family of their supports.
This result led to the construction of algorithms for the computation of these isolated solutions, particularly designed to deal with sparse systems (see, for instance, [26, 30, 38, 46]), which run faster than the known general procedures solving this task. Later on, upper bounds for the number of isolated affine solutions of sparse systems were obtained and new efficient algorithms to compute them were designed (see, for instance, [17, 20, 24, 27, 34, 37, 39]). More recently, positive dimensional components of affine varieties defined by sparse systems were also considered: in [45], certificates for the existence of 1-dimensional components were given and, in [1] and [2], under certain assumptions on the equations, algorithmic methods to describe Puiseux series expansions of curves and arbitrary positive dimensional components, respectively, were presented. Also, in [25], an upper bound in terms of mixed volumes for the degree of the affine variety defined by a sparse polynomial system of n equations in n unknowns was proved and algorithms for characterizing the equidimensional decomposition of affine varieties defined by sparse systems were designed. The good performance of the most efficient algorithms dealing with sparse systems relies on the use of polyhedral deformations introduced in [26], because these deformations essentially preserve the monomial structure of the system involved.
In this paper, we present a probabilistic symbolic algorithm which computes the Zariski closure \(\overline{\pi(V(\mathbf{f}))}\subset \mathbb{C}^{\ell}\) for a sparse system f with fixed supports and generic coefficients. We use the decomposition of the variety V(f) into equidimensional subvarieties contained in coordinate subspaces proved in [25] to reduce the problem to the case of a variety such that each of its components intersects the torus. In this case we compute a geometric resolution of the variety with respect to a suitably chosen set of free variables and, from this resolution, we show how to obtain a geometric resolution of the Zariski closure of the required projection. The complexity of our algorithm is polynomial in some combinatorial invariants associated to the supports of the input polynomials. Our main result is the following.
Theorem 1
Let f=(f 1,…,f r ) be a system of polynomials in \(\mathbb{Q}[X_{1},\ldots, X_{n}]\) with generic coefficients supported on a family \(\mathcal{A}= (\mathcal{A}_{1},\ldots, \mathcal{A}_{r})\) of finite subsets of \((\mathbb{Z}_{\ge0})^{n}\) such that \(\dim(\sum_{j\in J} \mathcal{A}_{j} )\ge\# J\) for every J⊂{1,…,r}, and let \(V^{*}(\mathbf{f})\subset\mathbb{C}^{n}\) be the Zariski closure of
There is a probabilistic algorithm that computes a geometric resolution of \(\overline{\pi(V^{*}(\mathbf{f}))}\), where \(\pi: \mathbb{C}^{n} \to\mathbb{C}^{\ell}\) is the projection to the first ℓ coordinates, within
operations in \(\mathbb{Q}\) up to logarithmic factors, where
Here, MV denotes the mixed volume, Δ is the vertex set of the standard simplex in \(\mathbb{R}^{n}\) and the superscript indicates the number of repetitions.
The stated complexity is due to a polyhedral deformation-based algorithm to solve generic sparse zero-dimensional systems [30], an algorithmic Newton–Hensel lifting [40] and codification of multivariate polynomials as straight-line programs [6]. As in [30], we do not take into account the cost of mixed-volume related computations (see Sect. 2.2 for more details on this point). Furthermore, in the complexity of Theorem 1 we have ignored terms depending on the size of certain combinatorial objects associated to polyhedral deformations involved; for a more precise complexity estimate, see Theorem 10.
The paper is organized as follows. In Sect. 2, the notation and the basic theoretical and algorithmic notions used throughout are introduced. Section 3 is devoted to proving the main theoretical results on which our algorithms rely. Finally, Sect. 4 contains the descriptions, proof of correctness, and complexity estimates of our algorithms, and examples illustrating how they work.
2 Preliminaries
2.1 Basic Definitions and Notation
Let k be a field of characteristic zero and \(\overline{k}\) be an algebraic closure of k. Given polynomials f 1,…,f r ∈k[X 1,…,X n ], we write V(f)=V(f 1,…,f r ) for the affine variety which is the set of the common zeros of f=(f 1,…,f r ) in \(\overline{k}^{n}\), and
for the union of the irreducible components of V(f 1,…,f r ) containing points with all their coordinates in \(\overline{k}^{*}:= \overline{k} - \{0\}\).
For a variety \(V\subset\overline{k}^{n}\) definable over k, we denote k[V]=k[X 1,…,X n ]/I(V) its coordinate ring (where I(V)⊂k[X 1,…,X n ] is the ideal of the polynomials vanishing identically on V). If V is irreducible, we write k(V) for the fraction field of k[V].
To describe zero-dimensional affine varieties we use the notion of a geometric resolution (see, for instance, [22] and the references therein): Let \(V= \{\xi^{(1)},\ldots, \xi^{(D)}\}\subset\overline{k}^{n}\) be a zero-dimensional variety defined by polynomials in k[X 1,…,X n ]. Given a linear form λ=λ 1 X 1+⋯+λ n X n in k[X 1,…,X n ] such that λ(ξ (i))≠λ(ξ (j)) if i≠j, the following polynomials completely characterize V:
-
the minimal polynomial q λ =∏1≤i≤D (Y−λ(ξ (i)))∈k[Y] of λ over the variety V (where Y is a new variable),
-
polynomials v 1,…,v n ∈k[Y] with deg(v j )<D for every 1≤j≤n satisfying \(V = \{(v_{1} (\eta),\ldots, v_{n} (\eta)) \in\mathbb{C}^{n} \mid\eta \in\overline{k}, q_{\lambda}(\eta)=0\}\).
The family of univariate polynomials (q λ ,v 1,…,v n )∈k[Y]n+1 is called the geometric resolution of V (or a geometric resolution of k[V]) associated with the linear form λ.
The notion of geometric resolution can be extended to any equidimensional variety: Let \(V\subset\overline{k}^{n}\) be an equidimensional variety of dimension t defined by polynomials in k[X 1,…,X n ]. Assume that, for each irreducible component W of V, the identity I(W)∩k[X 1,…,X t ]={0} holds. By considering k(X 1,…,X t )⊗k[V], we are in a zero-dimensional situation, and we call a geometric resolution of V with free variables X 1,…,X t to a geometric resolution (q λ ,v t+1,…,v n )∈k(X 1,…,X t )[Y]n−t+1 of k(X 1,…,X t )⊗k[V] associated to a linear form λ∈k[X t+1,…,X n ]. If \(\hat{q}_{\lambda}\in k[X_{1},\ldots, X_{t},Y]\) is obtained from q λ by clearing denominators, a geometric resolution of V gives a birational map between the hypersurface \(\{(x_{1},\ldots, x_{t}, y) \in \overline{k}^{n-t+1} \mid\hat{q}_{\lambda}(x_{1},\ldots, x_{t}, y) =0\}\) and V.
When dealing with varieties defined by sparse polynomial systems, an important combinatorial invariant associated to the system is the mixed volume of their supports. For a family \(\mathcal{A}=(\mathcal {A}_{1},\ldots, \mathcal{A}_{n})\) of n finite subsets of \((\mathbb {Z}_{\ge0})^{n}\), \(\mathit{MV}_{n}(\mathcal{A})\) (or \(\mathit{MV}(\mathcal{A})\) if n is clear from the context) denotes the n-dimensional mixed volume of the convex hulls in \(\mathbb{R}^{n}\) of \(\mathcal{A}_{1},\ldots, \mathcal {A}_{n}\) (see, for instance, [10, Chap. 7] for the definition and basic properties). In this context, we write Δ for the vertex set {0,e 1,…,e n } of the standard simplex of \(\mathbb{R}^{n}\), which is the support of an affine linear form with non-zero coefficients, and Δ (t) for the family of t copies of Δ.
2.2 Algorithms and Codification
Although we work with polynomials, our algorithms only deal with elements in a base field k. The notion of complexity of an algorithm we consider is the number of operations and comparisons in k it performs. We will encode multivariate polynomials in different ways:
-
in sparse form, that is, by means of the list of pairs (a,c a ) where a runs over a fixed set including the exponents of the monomials appearing in the polynomial with non-zero coefficients and c a is the corresponding coefficient,
-
in the standard dense form, which encodes a polynomial of degree bounded by d as the vector of the coefficients of all the monomials of degree at most d including zeros (we use this encoding only for univariate polynomials),
-
in the straight-line program (slp for short) encoding. A straight-line program is an algorithm without branchings which allows the evaluation of the polynomial at a generic value (for a precise definition and properties of slp’s, see [6]).
In our complexity estimates, we use the usual O notation: for \(f,g\colon\mathbb{Z}_{\ge0} \to\mathbb{R}\), f(d)=O(g(d)) if |f(d)|≤c|g(d)| for a positive constant c. We also use the notation M(d)=dlog2(d)log(log(d)), where log denotes logarithm to base 2. We recall that multipoint evaluation and interpolation of univariate polynomials of degree d with coefficients in a commutative ring R of characteristic 0 can be performed with O(M(d)) operations and that multiplication and division with remainder of such polynomials can be done with O(M(d)/log(d)) arithmetic operations in R. We write Ω for the exponent (Ω<4) in the complexity O(d Ω) of operations (determinant and adjoint computations) on d×d matrices with entries in a commutative ring R [3]. We use Padé approximation in order to compute the dense representation of the numerator and denominator of a rational function \(f = \frac {p}{q}\in k(Y)\) with max{degp,degq}≤d from its Taylor series expansion up to order 2d, which we do by means of subresultant computations within O(d Ω+1) arithmetic operations in k (see [48, Corollaries 5.21 and 6.48]).
Our algorithms are probabilistic in the sense that they make random choices of points (which we consider cost-free in our complexity estimates) which lead to a correct computation provided the points lie outside certain proper Zariski closed sets of suitable affine spaces. Using the Schwartz–Zippel Lemma [41, 49], the error probability of our algorithms can be controlled by making these random choices within sufficiently large sets of integer numbers whose size depend on the degrees of the polynomials defining the previously mentioned Zariski closed sets.
We use two previous algorithms as subroutines:
-
a probabilistic algorithm that, given n generic sparse polynomials in n variables with coefficients in k, computes a geometric resolution of the set of their common zeros in \(\overline{k}^{n}\) (see [30, Sect. 5]),
-
a Newton–Hensel-based procedure that, given a system f of n polynomials in n variables and t parameters, a specialization point \(\xi\in\mathbb{Q}^{t}\) for the parameters and a geometric resolution of the set of simple common zeros of f(ξ,⋅), computes an approximation up to a given precision of the geometric resolution of the components of V(f) where the Jacobian determinant of the system with respect to its n variables does not vanish identically (see [40, Sect. 4.2]).
The algorithm in [30] assumes that the mixed cells in a fine mixed subdivision of the family of the input supports are given (see [26] for the definition of a fine mixed subdivision), that is to say, the computation of mixed cells is considered as a pre-processing. In this paper, our algorithms also require deciding whether a mixed volume is zero or not, and computing mixed volumes.
While the non-vanishing of a mixed volume can be decided algorithmically in polynomial time, the problem of computing mixed volumes is known to be #P-hard (see [14, 16]). The mixed cells in a fine mixed subdivision of a family of finite sets of \(\mathbb{Z}^{n}\) (and, therefore, the mixed volume) can be obtained algorithmically by means of lifting (see [26]) and linear programming-based procedures. In [16], an algorithm following this approach is presented with a worst-case complexity single exponential in n. Successive algorithms that run faster according to numerical results can be found in [33] and [19]. A dynamic approach which does not use a random lifting function is given in [47]. The dynamic enumeration procedure from [35], which proved to be efficient even for large systems, seems to be the fastest known up until now. However, there are no explicit complexity upper bounds for these more efficient procedures, neither depending on the input nor the output size (namely, the mixed volume).
As in [30], we consider the computation of mixed cells as a pre-processing and do not include its cost in our complexity estimates. Note that if our algorithm needs to be applied to several systems with the same supports, this pre-processing only has to be carried out once.
3 Theoretical Results
Let n,r be positive integers and \(\mathcal{A}= (\mathcal {A}_{1},\ldots, \mathcal{A}_{r})\) a family of r finite subsets of \((\mathbb{Z}_{\ge0})^{n}\). Consider r polynomials in n variables X=(X 1,…,X n ) supported on \(\mathcal{A}\) with indeterminate coefficients: for every 1≤j≤r, let
where \(C_{j}:=(C_{j,a})_{a\in\mathcal{A}_{j}}\) is a set of \(N_{j}:= \# \mathcal {A}_{j}\) new indeterminates. Let f:=(f 1,…,f r ) and \(K:= \mathbb{Q}(C_{1},\ldots, C_{r})\).
3.1 Reduction to the Toric Case
Consider the family f=(f 1,…,f r ) of polynomials in K[X 1,…,X n ] supported on \(\mathcal{A}\) with indeterminate coefficients introduced in (1). Since f is a generic system supported on \(\mathcal{A}\), the equidimensional decomposition of the affine variety \(V(\mathbf{f})\subset\overline{K}^{n}\) depends only on the combinatorial structure of \(\mathcal{A}\) (see [25]). Moreover, its equidimensional components can be defined from certain smaller polynomial families f I in fewer variables associated to f. This decomposition, which we explain below, enables us to reduce the problem of computing a projection of the affine variety V(f) to the computation of finitely many projections of varieties having a non-empty intersection with the torus.
For I⊂{1,…,n}, given a polynomial f∈K[X 1,…,X n ], we write f I to denote the polynomial in K[(X i ) i∉I ] obtained from f by specializing X i =0 for every i∈I. We define \(J_{I} = \{ j\in\{1,\ldots, r\} \mid\exists a\in\mathcal{A}_{j} : a_{i} =0 \ \forall\, i\in I \}\) and \(\mathbf{f}_{I}= ((f_{j})_{I})_{j\in J_{I}}\) (note that J I is the set of indices of those polynomials f j that do not vanish identically when specializing X i =0 for every i∈I). Let
where, for every 1≤j≤r, \(\mathcal{A}_{j}^{I}\subset(\mathbb {Z}_{\ge0})^{n-\#I}\) denotes the support of (f j ) I . Finally, let \(\varphi_{I}: \overline{K}^{n-\#I} \to\overline{K}^{n}\) be the map that inserts zeros in the coordinates indexed by I. Using the previous notation, according to [25, Theorem 7], we have
Let \(\pi: \overline{K}^{n} \to\overline{K}^{\ell}\) be the projection π(x 1,…,x n )=(x 1,…,x ℓ ). For I⊂{1,…,n}, let ℓ I =ℓ−#(I∩{1,…,ℓ}), \(\pi_{\ell_{I}} : \overline{K}^{n-\# I}\to\overline{K}^{\ell_{I}}\) the projection to the first coordinates and \(\varphi_{I\cap\{1,\ldots,\ell\}}: \overline{K}^{\ell_{I}} \to\overline{K}^{\ell}\) the map that inserts zeros in the coordinates indexed by I∩{1,…,ℓ}. As \(\pi(\varphi_{I}(V^{*}(\mathbf{f}_{I}))) = \varphi_{I\cap\{1,\ldots,\ell \}}(\pi_{\ell_{I}}(V^{*}(\mathbf{f}_{I})))\) holds for every I, we conclude that
Therefore, the computation of \(\overline{\pi(V(\mathbf{f}))}\) amounts to obtaining the Zariski closures of the projections by \(\pi _{\ell_{I}}\) of the affine varieties \(V^{*}(\mathbf{f}_{I})= \overline{\{ x\in(\overline{K}^{*})^{n-\# I} \mid\mathbf{f}_{I}(x)=0 \}}\). For this reason, in the sequel we will deal with this problem.
3.2 The Vanishing Ideal of the Projection
We consider a generic polynomial system f:=(f 1,…,f r ) supported on \(\mathcal{A}:=(\mathcal{A}_{1},\ldots, \mathcal {A}_{r})\) with indeterminate coefficients \(C_{j}:=(C_{j,a})_{a\in\mathcal {A}_{j}}\) for every 1≤j≤r as in Eq. (1). We assume that r≤n and that, for every J⊂{1,…,r}, \(\dim(\sum_{j\in J} \mathcal{A}_{j})\ge\# J\) (or, equivalently, \(\mathit{MV}(\mathcal{A}, \varDelta^{(n-r)})>0\)) so that V ∗(f) is not empty.
Our aim is to obtain a geometric resolution of the Zariski closure \(\overline{\pi(V^{*}(\mathbf{f}))}\), where \(\pi: \overline{K}^{n}\to \overline{K}^{\ell}\) (ℓ≤n) is the projection defined by π(x 1,…,x n )=(x 1,…,x ℓ ). We begin by proving some basic results on the vanishing ideals of the varieties V ∗(f) and \(\overline{\pi(V^{*}(\mathbf{f}))}\).
Lemma 2
Under the previous assumptions, (f 1,…,f r ):(X 1⋯X n )∞ is a prime ideal in K[X 1,…,X n ] of dimension n−r. Moreover, I(V ∗(f))=(f 1,…,f r ):(X 1⋯X n )∞.
Proof
For every 1≤j≤r, fix \(a_{j0}\in\mathcal{A}_{j}\) and let \(\mathcal{A}_{j}':= \mathcal{A}_{j}-\{a_{j0}\}\) and \(C'_{j}:=C_{j} -\{ C_{j, a_{j0}}\}\). We denote C:=(C 1,…,C r ) and \(C':=(C_{1}',\ldots, C_{r}')\). Consider the ring morphism \(\psi: \mathbb{Q}[C,X] \to \mathbb{Q}[C',X]_{X_{1}\cdots X_{n}}\) (where \(\mathbb{Q}[C',X]_{X_{1}\cdots X_{n}}\) is the localization of \(\mathbb{Q}[C',X]\) at the multiplicative set \(\{(X_{1}\cdots X_{n})^{m} \mid m\in\mathbb{Z}_{\ge0}\}\)) defined by ψ(X i )=X i for 1≤i≤n, ψ(C j,a )=C j,a for 1≤j≤r, \(a\in\mathcal{A}'_{j}\), and \(\psi(C_{j,a_{j0}}) = -X^{-a_{j0}}(\sum_{a\in\mathcal{A}_{j}'} C_{j,a}X^{a})\) for 1≤j≤r.
We claim that ker(ψ)=(f 1,…,f r ):(X 1⋯X n )∞. It is clear that for a polynomial g∈(f 1,…,f r ):(X 1⋯X n )∞, ψ(g)=0. Assume now that for a polynomial \(g\in \mathbb{Q}[C,X]\) we have ψ(g)=0. Let \(C_{0}:=(C_{1,a_{10}},\ldots, C_{r,a_{r0}})\) and let \(\widehat{C}_{0}:=(\widehat{C}_{10},\ldots, \widehat{C}_{r0})\) be new variables. By Taylor expansion, \(g(\widehat{C}_{0},C',X) = g(C_{0}, C',X) + \sum_{1\le j \le r} (\widehat{C}_{j0}-C_{j,a_{j0}}) \cdot G_{j} \) for certain polynomials \(G_{j}\in\mathbb{Q}[\widehat{C}_{0}, C, X]\). Specializing \(\widehat{C}_{j0}= -X^{-a_{j0}}(\sum_{a\in \mathcal{A}_{j}'} C_{j,a}X^{a})\) for every 1≤j≤r, it follows that \(\psi(g) = g(C_{0},C', X)- \sum_{1\le j \le r}X^{-a_{j0}} f_{j} \cdot \widetilde{G}_{j}\) with \(\widetilde{G}_{j}\in\mathbb{Q}[C, X]_{X_{1}\cdots X_{n}}\); therefore, \(g(C,X) = \sum_{1\le j \le r}X^{-a_{j0}} f_{j} \cdot \widetilde{G}_{j}\). Multiplying by (X 1⋯X n )m for a sufficiently large m, we conclude that (X 1⋯X n )m g(C,X)∈(f 1,…,f r ).
Then, (f 1,…,f r ):(X 1⋯X n )∞ is a prime ideal of \(\mathbb{Q}[C,X]\). The first statement of the lemma follows by localizing at \(\mathbb{Q}[C]-\{0\}\).
In order to prove the second part of the lemma, consider first a polynomial g∈(f 1,…,f r ):(X 1⋯X n )∞, and let \(m\in\mathbb{Z}_{\ge0}\) be such that (X 1⋯X n )m g∈(f 1,…,f r ). Then, (X 1⋯X n )m g vanishes over V(f) and so, g vanishes over \(V(\mathbf{f})\cap (\overline{K}^{*})^{n}\); therefore, g∈I(V ∗(f)). Conversely, if g∈I(V ∗(f)), then it vanishes over each irreducible component of V(f) intersecting \((\overline{K}^{*})^{n}\) properly. Then, (X 1⋯X n )g vanishes over V(f 1,…,f r ), which implies that there exists \(m\in\mathbb{Z}_{\ge0}\) with (X 1⋯X n )m g m∈(f 1,…,f r ). Therefore, g m∈(f 1,…,f r ):(X 1⋯X n )∞ and, since this is a prime ideal, it follows that g∈(f 1,…,f r ):(X 1⋯X n )∞. □
Corollary 3
The affine variety \(V^{*}(\mathbf{f})\subset\overline{K}^{n}\) is an irreducible K-variety of dimension n−r.
Taking into account that for any K-variety \(V\subset\overline{K}^{n}\), the identity \(I(\overline{\pi(V)}) = I(V) \cap K[X_{1},\ldots, X_{\ell}]\) holds, Lemma 2 also enables us to characterize the vanishing ideal of the projection we want to compute:
Corollary 4
With the previous assumptions and notation,
Let \(t:= \dim(\overline{\pi(V^{*}(\mathbf{f}))})\). Without loss of generality, by renaming variables, we may assume that {X 1,…,X t }⊂{X 1,…,X ℓ } is a transcendence basis of \(K(\overline{\pi(V^{*}(\mathbf{f}))})\) over K and {X t+r+1,…,X n } are such that {X 1,…,X t ,X t+r+1,…,X n } is a transcendence basis of K(V ∗(f)) over K. The following proposition generically allows us to deal with projections with 0-dimensional generic fibers.
Proposition 5
There is a Zariski dense open set \(\mathcal{O}\subseteq\overline{K}^{n-t-r}\) such that, for every \(b\in K^{n-t-r} \cap\mathcal{O}\), the identity \(I(\overline{\pi(V^{*}(\mathbf{f}))}) = ((f_{1}(X_{1},\ldots,X_{t+r}, b), \ldots, f_{r}(X_{1},\ldots, X_{t+r},b)): (X_{1}\cdots X_{t+r})^{\infty})\cap K[X_{1},\ldots, X_{\ell}]\) holds.
Proof
Let \(\widehat{X}:=(X_{1},\ldots, X_{t+r})\). From Corollary 4, it is clear that \(I(\overline{\pi(V^{*}(\mathbf{f})}) ) \subset ((f_{1}(\widehat{X}, b),\ldots, f_{r}(\widehat{X},b)): (X_{1}\cdots X_{t+r})^{\infty})\cap K[X_{1},\ldots, X_{\ell}]\) for every b=(b t+r+1,…,b n ) such that b i ≠0 for every t+r+1≤i≤n.
For the converse inclusion, first note that, for every 1≤j≤r,
As in the first part of the proof of Lemma 2, it follows that (f 1,…,f j ):(X 1⋯X n )∞ is a prime ideal of dimension n−j for every 1≤j≤r; therefore, YX 1⋯X n −1, f 1,…,f r is a reduced regular sequence in K[Y,X 1,…,X n ]. Moreover, the set {X 1,…,X t ,X t+r+1,…,X n } is algebraically independent modulo (YX 1⋯X n −1,f 1,…,f r ). Then, by [11, Corollary 17 and Theorem 19], there exists a K-definable Zariski dense open set \(\mathcal{O}\subset\overline{K}^{n-t-r}\) containing {x i ≠0 ∀ 1≤i≤t+r} such that for every \(b\in\mathcal{O}\cap K^{n-t-r}\), \(c_{b}\, Y X_{1}\cdots X_{t+r}-1,f_{1}(\widehat{X}, b) ,\ldots, f_{r}(\widehat{X}, b)\) (where c b :=b t+r+1⋯b n ) is a reduced regular sequence in \(K[\widehat{X}]\) and {X 1,…,X t } is algebraically independent modulo each of the associated primes of the ideal this regular sequence generates. By noticing that
we conclude that \((f_{1}(\widehat{X}, b) ,\ldots, f_{r}(\widehat{X}, b)): (X_{1}\cdots X_{t+r})^{\infty}\) is a radical equidimensional ideal of dimension t and {X 1,…,X t } is algebraically independent modulo each of its associated primes. Then, the same holds for the ideal \(((f_{1}(\widehat{X}, b) ,\ldots, f_{r}(\widehat{X}, b)): (X_{1}\cdots X_{t+r})^{\infty})\cap K[X_{1},\ldots, X_{\ell}]\). As this ideal includes the t-dimensional prime ideal \(I(\overline{\pi(V^{*}(\mathbf{f}))})\), the equality in the statement of the proposition holds. □
3.3 Free Variables
The following result gives a combinatorial condition for the algebraic independence of a subset of variables modulo the vanishing ideal of V ∗(f) that we will use in our algorithm to compute a suitable transcendence basis of the rational fraction field of this variety.
Lemma 6
Let f 1,…,f r ∈K[X 1,…,X n ] be sparse polynomials supported on a family \(\mathcal{A}= (\mathcal{A}_{1},\ldots, \mathcal{A}_{r})\) of finite subsets of \((\mathbb{Z}_{\ge0})^{n}\) as introduced in (1), and let \(\mathcal{I}= (f_{1},\ldots, f_{r}): (X_{1}\cdots X_{n})^{\infty}\subset K[X_{1},\ldots, X_{n}]\). Let {e 1,…,e n } be the canonical basis of \(\mathbb{Q}^{n}\).
Then, the set \(\{ X_{i_{1}},\ldots, X_{i_{k}}\}\subset\{X_{1},\ldots, X_{n}\}\) for 1≤i 1<⋯<i k ≤n is algebraically independent modulo \(\mathcal{I}\) if and only if \(\mathit{MV}(\mathcal{A}_{1},\ldots, \mathcal {A}_{r}, \{0,e_{i_{1}}\},\ldots, \{0,e_{i_{k}}\}, \varDelta^{(n-r-k)})>0\).
Proof
First, assume that \(\mathit{MV}(\mathcal{A}_{1},\ldots, \mathcal {A}_{r}, \{0,e_{i_{1}}\},\ldots, \{0,e_{i_{k}}\},\varDelta^{(n-r-k)})>0\). Let Q be a non-zero polynomial in the coefficients of a system supported on \((\mathcal{A}, \{0,e_{i_{1}}\}, \ldots, \{0,e_{i_{k}}\}, \varDelta^{(n-r-k)})\) such that for each coefficient vector (c,η)=(c 1,…,c r ,η r+1,…,η n ) with coordinates in \(\mathbb{Q}^{*}\) and Q(c,η)≠0, the corresponding sparse system has as many isolated solutions in \((\mathbb{C}^{*})^{n}\) as the mixed volume.
Let \(p\in\mathcal{I} \cap K[X_{i_{1}},\ldots, X_{i_{k}}]\). Without loss of generality, we may assume that \(p\in\mathbb{Q}[C,X_{i_{1}},\ldots, X_{i_{k}}]\). Then there exist \(m\in\mathbb{Z}_{\ge0}\) and a non-zero polynomial \(p_{0}\in\mathbb{Q}[C]\) such that
with \(g_{1},\ldots, g_{r}\in\mathbb{Q}[C,X_{1},\ldots, X_{n}]\).
For each (c,η) with coordinates in \(\mathbb{Q}^{*}\) such that p 0(c)Q(c,η)≠0, considering a solution \(\xi\in(\mathbb {C}^{*})^{n}\) of the system
and specializing identity (2) in (c,η,ξ) we obtain p(c,−η r+1,1/η r+1,2,…,−η r+k,1/η r+k,2)=0. We conclude that p≡0.
Assume now that \(\{X_{i_{1}},\ldots, X_{i_{k}}\}\) is algebraically independent modulo \(\mathcal{I}= I(V^{*}(\mathbf{f}))\). Let l 1,…,l n−r−k be linear forms in the variables X 1,…,X n with coefficients in K ∗ such that \(\{X_{i_{1}},\ldots, X_{i_{k}}, l_{1},\ldots, l_{n-r-k}\}\) is a transcendence basis of K(V ∗(f)). Since \(V^{*}(\mathbf {f})\cap(\overline{K}^{*})^{n}\) is a dense open subset of V ∗(f), for a generic \((\zeta_{1},\ldots, \zeta_{n-r})\in(\overline{K}^{*})^{n-r}\), the set
is not empty and consists of finitely many points in \((\overline{K}^{*})^{n}\). These points are the common solutions in \((\overline{K}^{*})^{n}\) of the system
which is supported on \((\mathcal{A}_{1},\ldots, \mathcal{A}_{r}, \{0, e_{i_{1}}\},\ldots, \{ 0, e_{i_{k}}\}, \varDelta^{(n-r-k)})\). By Bernstein’s Theorem, we conclude that \(\mathit{MV}(\mathcal{A}_{1},\ldots, \mathcal{A}_{r}, \{0, e_{i_{1}}\},\ldots, \{ 0, e_{i_{k}}\}, \varDelta^{(n-r-k)})>0\). □
Remark 7
For 1≤i 1<⋯<i k ≤n, we have
(see [42, Lemma 6]), where \(\varpi: \mathbb{R}^{n}\to\mathbb{R}^{n-k}\) is the projection to the coordinates indexed by {1,…,n}−{i 1,…,i k }. This implies that, in order to determine whether a set of k variables is algebraically independent or not, it suffices to determine whether an (n−k)-dimensional mixed volume is zero or not.
4 Algorithms
In this section we will present an algorithm to compute the Zariski closure of the projection of V ∗(f) to the first ℓ coordinates, where f is a generic polynomial system with given supports. First, we describe some subroutines.
4.1 Subroutines
The first subroutine we will use, which follows from Lemma 6, computes a transcendence basis of K(V ∗(f)) containing a transcendence basis of \(K(\overline{\pi(V^{*}(\mathbf{f}))})\).
Note that the above algorithm requires to decide whether the mixed volume of a family of finite sets is non-zero for at most n families. Following [14, Theorem 8], there is a polynomial time algorithm to achieve this task based on the matroid intersection algorithm from [15]. Therefore, Algorithm TransBasis runs in polynomial time.
Without loss of generality, by renaming variables, we may assume that the transcendence basis of K(V ∗(f)) obtained by applying algorithm TransBasis is {X 1,…,X t ,X t+r+1,…,X n } with t≤ℓ and ℓ≤t+r. Then, {X 1,…,X t } is a transcendence basis of \(K(\overline{\pi (V^{*}(\mathbf{f}))})\). Let f b be the polynomial system obtained by evaluating X t+r+1,…,X n in a generic point b. By Proposition 5, we may obtain \(\overline{\pi(V^{*}(\mathbf{f}))}\) using the system f b . In order to do this, we will compute first a geometric resolution of V ∗(f b ) by means of the subroutine we introduce below.
Let k be a field of characteristic 0. Algorithm ParametricToricGeomRes computes a geometric resolution of the variety V ∗(g) defined from a generic sparse system g:=(g 1,…,g r ) in k[X 1,…,X t+r ] with given supports \(\mathcal {S}:=(\mathcal{S}_{1},\ldots,\mathcal{S}_{r})\), provided that {X 1,…,X t } is a set of independent variables for all its components. This subroutine is obtained by following the parametric geometric resolution algorithm from [40, Theorem 2] taking X 1,…,X t as the parameters.
Before estimating the complexity of the previous algorithm, we present a simple example to illustrate how the algorithm works.
Example
Let g be the following sparse system supported on \(\mathcal {S}=(\mathcal{S}_{1}, \mathcal{S}_{2})\), where \(\mathcal{S}_{1}=\{(0,0,0), (1,1,0),(0,1,1)\}\) and \(\mathcal{S}_{2}=\{(0,0,0), (2,1,1),(0,2,0),(1,1,1)\}\):
Here, {X 1} is algebraically independent modulo (g 1,g 2):(X 1 X 2 X 3)∞.
In step 1, the algorithm chooses a value \(\xi_{1}\in\mathbb{Z}-\{0\}\) at random and specializes X 1=ξ 1. Setting ξ 1=1, we obtain the system
Now, the algorithm computes a geometric resolution of the common zeros of g 1 in \((\mathbb{C}^{*})^{2}\). This is a generic system supported on ({(0,0),(1,0),(1,1)},{(0,0),(1,1),(2,0)}). Then, in order to solve it, we can apply the algorithm from [30, Sect. 5]. Choosing X 3 as separating linear form, we obtain
-
\(q_{X_{3}} (Y)= Y^{2}-\frac{12}{5}Y-\frac{1}{5}\)
-
\(w_{2}(Y)= -\frac{5}{4} Y - \frac{3}{4}\)
-
w 3(Y)=Y
From this geometric resolution, the algorithm computes a sufficiently good approximation of a geometric resolution of V ∗(g).
A geometric resolution of V ∗(g) consists of univariate polynomials with coefficients that are rational functions in the parameter X 1 with numerators and denominators of degrees bounded by degV ∗(g) (see [40, Theorem 1]). These rational functions can be regarded as power series in the variable X 1 centered at ξ 1=1, and therefore, they can be recovered by means of Padé approximation from sufficiently many terms of their expansions. The precision required to do this equals \(2 \deg V^{*}(\mathbf{g}) = 2 \mathit{MV}(\mathcal {S}, \varDelta)\). Since \(\mathit{MV}(\mathcal{S}, \varDelta)=6\), in step 4, the algorithm applies a Newton–Hensel lifting (in the parameter X 1), as explained in [40, Sect. 4.2], to the geometric resolution \((q_{X_{3}}, w_{2}, w_{3})\) with precision 12, obtaining
-
\(\widehat{q}_{X_{3}} (Y)= Y^{2} + q_{1} Y+ q_{0} \)
-
\(\widehat{w}_{2}(Y)= w_{21} Y + w_{20} \)
-
\(\widehat{w}_{3}(Y) = w_{31} Y +w_{30}\)
where
-
\(q_{1}= -\frac{12}{5}-\frac{18}{5}(X_{1}-1)+\frac {18}{25}(X_{1}-1)^{2}-\frac{24}{25}(X_{1}-1)^{3}+\frac{168}{125}(X_{1}-1)^{4} -\frac{48}{25}(X_{1}- 1)^{5}+\frac{1728}{625}(X_{1}-1)^{6}-\frac {2496}{625}(X_{1}-1)^{7}+\frac{18048}{3125}(X_{1}-1)^{8} -\frac{26112}{3125}(X_{1}- 1)^{9}+\frac{188928}{15625}(X_{1}-1)^{10}-\frac {273408}{15625}(X_{1}-1)^{11}+\frac{1978368}{78125}(X_{1}-1)^{12}\)
-
\(q_{0}= -\frac{1}{5}-\frac{16}{5}(X_{1}-1)+\frac {119}{25}(X_{1}-1)^{2}-\frac{174}{25}(X_{1}-1)^{3}+\frac{1264}{125}(X_{1}-1)^{4}- \frac{1832}{125}(X_{1}- 1)^{5}+\frac{13264}{625}(X_{1}-1)^{6}-\frac {768}{25}(X_{1}-1)^{7}+\frac{138944}{3125}(X_{1}-1)^{8} - \frac{201088}{3125}(X_{1}- 1)^{9}+\frac{1455104}{15625}(X_{1}-1)^{10}-\frac {2105856}{15625}(X_{1}-1)^{11}+ \frac{15238144}{78125}(X_{1}-1)^{12}\)
-
\(w_{21}= -\frac{5}{4}-\frac{5}{2}(X_{1}-1)-(X_{1}-1)^{2}\)
-
\(w_{20}= -\frac{3}{4}-\frac{3}{4}(X_{1}-1) \)
-
w 31=1
-
w 30=0
Finally, Padé approximation is applied, following [48, Corollaries 5.21 and 6.48], to each of the coefficients previously computed in order to obtain the numerators and denominators of the coefficients in the geometric resolution of V ∗(g) associated to the linear form X 3:
-
\(Q_{X_{3}}(Y) = Y^{2} + \frac{-12X_{1}^{3}-6X_{1}^{2}+6X_{1}}{4X_{1}^{2}+2X_{1}-1} Y+ \frac{-9X_{1}^{2}+8}{4X_{1}^{2}+2X_{1}-1}\)
-
\(W_{2}(Y)= (-X_{1}^{2}-\frac{1}{2}X_{1}+\frac{1}{4}) Y -\frac{3}{4}X_{1} \)
-
W 3(Y)=Y
Now we explain how the different steps of the algorithm are carried out in general and estimate the number of operations in k it performs. We will use the notation \(N=\sum_{1\le j \le r}\# \mathcal{S}_{j}\) and d=max1≤j≤r deg(g j ).
In Step 2, the algorithm computes the sparse encoding of g(ξ,X t+1,…,X t+r ) within O(r(t+r)Nlog(d)) operations. Then, this system is solved using the procedure from [30, Sect. 5] within complexity \(O(r^{3} N \log(d) M(\mathcal {M}) M(\mathit{MV}(\varpi(\mathcal{S}))) (M(\mathit{MV}(\varpi(\mathcal{S})))+ M(\omega_{\max} \sum_{1\le h\le r} \mathit{MV}(\varpi(\mathcal{S}_{j})_{j\ne h}, \varDelta))))\), where ϖ is the projection to the last r coordinates, ω max is the maximum of the values taken by a generic lifting function ω for \(\varpi(\mathcal{S}):=(\varpi(\mathcal{S}_{1}),\ldots, \varpi(\mathcal{S}_{r} ))\), and \(\mathcal{M}:= \max\{ \| \mu\| \}\), the maximum ranging over all primitive normal vectors to the mixed cells in the fine mixed subdivision of \(\varpi(\mathcal{S})\) induced by ω.
In Step 3, an slp of length O((t+r)Nlog(d)) encoding the polynomials g is obtained from their sparse representation.
The next step is performed modifying the procedure in [40, Sect. 4.2] in order to use straight-line programs for computations with truncated multivariate power series. We represent each of these series as the vector of its homogeneous components and these components by means of straight-line programs. The required precision is \(2 \deg(V^{*}(\mathbf{g}))= 2 \mathit{MV}(\mathcal{S}, \varDelta^{(t)})\) (see [40, Theorem 1] and [25, Lemma 1]). Therefore, this step is done within complexity \(O((r(t+r)N\log(d)+ r^{4}) M (\mathit{MV}(\varpi(\mathcal{S}))) \mathit{MV}(\mathcal {S}, \varDelta^{(t)})^{2})\) and produces an slp of the same order encoding the homogeneous components of the coefficients of the output.
Finally, the Padé approximation to obtain the coefficients of the geometric resolution is done by reducing it to a univariate problem and solving it by means of subresultant computations following [48, Corollaries 5.21 and 6.48]. This step adds \(O(rMV(\mathcal{S}, \varDelta^{(t)})^{\varOmega+2})\) operations to the previous complexity and \(O(rMV(\mathcal{S}, \varDelta ^{(t)})^{\varOmega+1})\) to the slp length.
Therefore, we have the following complexity result:
Lemma 8
Let g:=(g 1,…,g r ) in k[X 1,…,X t+r ] be a generic sparse system with supports \(\mathcal{S}:=(\mathcal {S}_{1},\ldots,\mathcal{S}_{r})\) such that {X 1,…,X t } is algebraically independent modulo each associated prime of (g 1,…,g r ):(X 1⋯X t+r )∞. Algorithm ParametricToricGeomRes computes a geometric resolution of V ∗(g). With the previous notation, the total number of operations in k performed by the algorithm is of order
The algorithm produces an slp of length
for the coefficients of the output.
The last step of our main algorithm consists in describing the projection to a coordinate subspace of an equidimensional variety of dimension t given by a geometric resolution in the case that the projection has the same dimension t. To do this, we apply the subroutine described below.
Let \(V \subset\overline{k}^{t+r}\) be an equidimensional variety of dimension t definable over k and such that for each irreducible component W of V, I(W)∩k[X 1,…,X t ]={0} holds. Consider the projection \(\pi: \overline{k}^{t+r} \to\overline{k}^{\ell}\) where ℓ>t, π(x 1,…,x t+r )=(x 1,…,x ℓ ). Note that {X 1,…,X t } are free variables with respect to each irreducible component of \(\overline{\pi(V)}\). Let \(\mathbb{K}:= k(X_{1},\ldots, X_{t})\). Suppose λ∈k[X t+1,…,X t+r ] is a primitive element for \(\mathbb{K}\otimes k[V]\) and let \((q_{\lambda}, w_{t+1},\ldots, w_{t+r})\in\mathbb{K}[Y]^{r+1}\) be the associated geometric resolution. Let D be the dimension of \(\mathbb {K}\otimes k[V]\) as \(\mathbb{K}\)-vector space.
Let μ=μ t+1 X t+1+⋯+μ ℓ X ℓ be a primitive element for \(\mathbb{K}\otimes k[\overline{\pi(V)}]\). As \(I(\overline{\pi(V)}) = I(V) \cap k[X_{1},\ldots, X_{\ell}]\), to find the minimal polynomial of μ with respect to \(\overline{\pi(V)}\), it suffices to find a polynomial \(q_{\mu} \in\mathbb{K}[Y]\) of minimal degree such that \(q_{\mu}(\mu)\in\mathbb{K}\otimes I(V)\). Then, δ:=deg Y (q μ ) is the dimension of \(\mathbb {K}\otimes k[\overline{\pi(V)}]\) as a \(\mathbb{K}\)-vector space and so, for each t+1≤j≤ℓ, in order to obtain a polynomial v j such that X j =v j (μ) in \(\mathbb{K}\otimes k[\overline{\pi(V)}]\) it suffices to find a linear combination \(X_{j} = \sum_{i=0}^{\delta- 1} v_{ji} \mu^{i}\) of {1,μ,…,μ δ−1} in \(\mathbb{K}\otimes k[V]\). To do this, we use the basis B λ :={1,λ,…,λ D−1} of \(\mathbb {K}\otimes k[V]\).
In order to compute the geometric resolution of \(\overline{\pi(V)}\) associated with μ, we first look for the minimal power μ δ which is a \(\mathbb{K}\)-linear combination of {1,μ,…,μ δ−1} in \(\mathbb{K}\otimes k[V]\). Since X j =w j (λ) for every t+1≤j≤t+r, we have \(\mu= \sum_{j=t+1}^{\ell} \mu_{j} w_{j}(\lambda)= p_{\mu}(\lambda )\), where \(p_{\mu}(Y):=\sum_{j=t+1}^{\ell} \mu_{j} w_{j}(Y)\) and, for every \(i\in\mathbb{N}\), μ i=p μ (λ)i=(p μ (Y)imodq λ (Y))| Y=λ . Therefore, δ equals the rank of the D×D matrix whose columns are the coefficients of the polynomials (p μ (Y)imodq λ (Y)) for i=0,…,D−1.
Then, we obtain the minimal polynomial \(q_{\mu}(Y):=Y^{\delta}+ \sum_{i=0}^{\delta-1} q_{\mu,i} Y^{i}\) of μ and the polynomials \(v_{j}(Y)=\sum_{i=0}^{\delta-1} v_{ji} Y^{i}\), t+1≤j≤ℓ, which form the geometric resolution of \(\overline{\pi(V)}\), by solving the linear systems obtained by equating the coefficients of the different powers of λ in the identities
Summarizing, with the previous notation and hypothesis, we have the following.
The correctness of the procedure follows from our previous arguments. Now we estimate its complexity. Assume that the input polynomials are encoded in dense form as degree D univariate polynomials in k(X 1,…,X t )[Y] and their coefficients are encoded by an slp over k of length L.
In Step 1, the algorithm computes an slp encoding the coefficients of p μ of length bounded by L+2D(ℓ−t).
In order to fulfill Step 2, we first compute recursively the powers Y h for h=D,…,2D−2 modulo q λ (Y) from the coefficients of q λ and then, we obtain an slp of length O(L+D(ℓ−t)+D 3) for the coefficients of \(p_{\mu^{i}}\) for i=2,…,D by expanding the product \(p_{\mu}(Y)\cdot p_{\mu^{i-1}}(Y)\) and substituting the powers of Y by their previously computed expressions.
Step 3 is done probabilistically by choosing a point x=(x 1,…,x t ) at random, evaluating the involved rational functions at this point within O(L+D(ℓ−t)+D 3) operations in k and finally computing the rank δ of \((\mathbf{p}_{\mu^{0}}(x),\mathbf {p}_{\mu}(x),\ldots, \mathbf{p}_{\mu^{D-1}}(x))\) within O(D ω) operations in k, with ω<3 (see [5, Chap. 2, Sect. 2]).
To solve the linear systems involved in Step 5, the algorithm computes the invertible matrix P t P within O(Dδ 2) operations in k, its adjoint matrix and determinant within O(δ Ω) additional operations, and the products \(\operatorname{adj}(\mathbf{P}^{t}\mathbf{P}) \textbf{p}_{\mu^{\delta}}\) and \(\operatorname{adj}(\mathbf{P}^{t}\mathbf{P}) \textbf{w}_{j}\) for t+1≤j≤ℓ with O(δ 2(ℓ−t)).
Adding the previous estimates, we conclude that the algorithm produces an slp of length O(L+D 3+Dδ(ℓ−t)+δ Ω) over k within a complexity of the same order. Taking into account that δ≤D, we have the following.
Lemma 9
Let \(V \subset\overline{k}^{t+r}\) be an equidimensional variety of dimension t definable over k and such that for each irreducible component W of V, I(W)∩k[X 1,…,X t ]={0} holds. With the previous notation, given a geometric resolution of V with free variables X 1,…,X t , and a linear form μ which is a primitive element for \(\mathbb{K}\otimes k[\overline{\pi(V)}]\), Algorithm GeomResProj computes the geometric resolution of the projection \(\overline{\pi(V)} \subset\overline{k}^{\ell}\) with free variables X 1,…,X t associated to the linear form μ within O(L+D Ω+D 2(ℓ−t)) operations in k. The output of the algorithm is encoded by an slp of length of the same order.
4.2 An Algorithm to Find the Projection
Here we present a probabilistic algorithm that, from a fixed family \(\mathcal{A}=(\mathcal{A}_{1},\ldots, \mathcal{A}_{r})\) of finite sets \((\mathbb{Z}_{\ge0})^{n}\) and a fixed family of variables X 1,…,X ℓ , obtains a geometric resolution of \(\overline{\pi(V^{*}(\mathbf{f}))} \subset\overline{K}^{\ell}\), where f=(f 1,…,f r ) is defined in Eq. (1) and \(\pi:\overline{K}^{n} \to\overline{K}^{\ell}\) is the projection π(x 1,…,x n )=(x 1,…,x ℓ ).
Theorem 10
Given a family \(\mathcal{A}= (\mathcal{A}_{1},\ldots, \mathcal{A}_{r})\) of finite subsets of \((\mathbb{Z}_{\ge0})^{n}\) such that \(\dim(\sum_{j\in J} \mathcal{A}_{j} )\ge\# J\) for every J⊂{1,…,r} and the projection \(\pi: \overline{K}^{n}\to\overline{K}^{\ell}\) to the first coordinates, Algorithm K-Projection is a probabilistic procedure that computes a geometric resolution of \(\overline{\pi(V^{*}(\mathbf{f}))}\) for the sparse system f supported on \(\mathcal{A}\) with indeterminate coefficients within
operations in K, where \(N= \sum_{1\le j \le r} \# \mathcal{A}_{j}\), d:=max1≤j≤r {deg X (f j )}, \(\mathcal{D}= \mathit{MV}(\mathcal {A}, \varDelta^{(n-r)})\), \(\mathcal{E}= \sum_{1\le h \le r} \mathit{MV}( ((\mathcal{A}_{j})_{j\ne h}), \varDelta^{(n-r+1)})\), and Ξ is a constant measuring the size of certain combinatorial objects involved at intermediate computations and associated to the family of supports \(\mathcal{A}\).
Proof
Since in our complexity estimates we only take into account the number of operations in K (and not mixed volume or mixed subdivision computations), to obtain the overall complexity of the algorithm it suffices to add the complexities of steps 3, 4 and 6.
Step 3 can be done within O(n 2 Nlog(d)) operations in K.
The complexity of Step 4 is the already stated for Algorithm ParametricToricGeomRes in Lemma 8. Note that for generic b, the system f b is a generic polynomial system supported on \(\mathcal{S}:=(\mathcal{S}_{1},\ldots, \mathcal{S}_{r})\), where \(\mathcal{S}_{j}\subset(\mathbb{Z}_{\ge0})^{t+r}\) is the projection of \(\mathcal{A}_{j}\) to the first t+r coordinates for every 1≤j≤r. Moreover, by Bernstein’s Theorem, \(\mathit{MV}(\mathcal{S}, \varDelta^{(t)}) \le \mathit{MV}(\mathcal{A}, \varDelta^{(n-r)})\), \(\mathit{MV}(\varpi (\mathcal{S}))\le \mathit{MV}(\mathcal{A}, \varDelta^{(n-r)})\) and, for every 1≤h≤r, \(\mathit{MV}(\varpi((\mathcal{S}_{j})_{j\ne h}), \varDelta) \le \mathit{MV}(((\mathcal{A}_{j})_{j\ne h}), \varDelta^{(n-r+1)})\). We take Ξ such that \(M(\mathcal{M}) M(\omega_{\max}) \le \varXi\).
Finally, the complexity of Step 6 follows from Lemma 9. Note that, here, \(D\le \mathit{MV}(\mathcal{A}, \varDelta^{(n-r)})\) and L is the length of the slp computed in Step 4 according to Lemma 8. □
4.3 Example
Consider a sparse system of two polynomials in five variables supported on \(\mathcal{A}=(\mathcal{A}_{1}, \mathcal{A}_{2})\), where \(\mathcal {A}_{1}=\{(0,0,0,0,0), (1,1,1,0,0),(2,0,0,4,2),(0,0,0,8,4)\}\) and \(\mathcal{A}_{2}=\{(1,0,1,1,2),(0,1,2,5,4),(1,3,0,5,4)\}\), with indeterminate coefficients:
and the projection \(\pi: \mathbb{A}^{5} \to\mathbb{A}^{3}\), π(x 1,x 2,x 3,x 4,x 5)=(x 1,x 2,x 3). We are going to show a geometric resolution of \(\overline{\pi(V^{*}(\mathbf{f}))}\) following Algorithm K-Projection.
First, we apply Algorithm TransBasis and we find that {X 1,X 2,X 4} is a transcendence basis of K(V ∗(f)), and so, {X 1,X 2} is a transcendence basis of \(K(\overline{\pi(V^{*}(\mathbf{f}))})\).
Now, the algorithm chooses a value b at random and specializes X 4=b. Set b=1. The specialized system is
The next step is to apply Algorithm ParametricToricGeomRes with free variables X 1,X 2. We choose λ=X 5 as the primitive element to obtain the geometric resolution:
-
\(q_{X_{5}}(Y)= Y^{10}+\frac{2C_{13}X_{1}^{2}}{C_{14}}Y^{8}+\frac {C_{13}^{2}X_{1}^{4}+2C_{11}C_{14}}{C_{14}^{2}}Y^{6} +\frac {(-C_{12}C_{21}C_{14}+2C_{11}C_{22}C_{13})X_{1}^{2}}{C_{22}C_{14}^{2}}Y^{4} + \frac {-C_{12}C_{21}C_{13}X_{1}^{4}+C_{11}^{2}C_{22}+C_{12}^{2}C_{23}X_{1}^{3}X_{2}^{4}}{C_{22}C_{14}^{2}}Y^{2}-\frac {C_{12}C_{21}C_{11}X_{1}^{2}}{C_{22}C_{14}^{2}}\)
-
\(w_{3}(Y)= -\frac{C_{14}}{C_{12}X_{1}X_{2}} Y^{4} - \frac {C_{13}X_{1}}{C_{12}X_{2}} Y^{2} - \frac{C_{11}}{C_{12} X_{1} X_{2}}\)
-
w 5(Y)=Y
Finally, Algorithm GeomResProj is applied to \(q_{X_{5}}, w_{3}, w_{5}\) and a primitive element μ for \(K(\overline{\pi(V^{*}(\mathbf {f}))})\). In this case, we take μ=X 3 and obtain
-
\(q_{X_{3}}(Y)= Y^{5}+\frac{C_{11}}{C_{12}X_{1}X_{2}}Y^{4}+\frac {2C_{12}C_{23}X_{1}X_{2}^{4}-C_{13}C_{21}X_{1}^{2}}{C_{12}C_{22}X_{2}^{2}}Y^{3}+ \frac {2C_{23}C_{22}C_{11}X_{2}^{4}+C_{21}^{2}C_{14}X_{1}}{C_{12}C_{22}^{2}X_{2}^{3}}Y^{2} + \frac {C_{12}C_{23}^{2}X_{1}^{2}X_{2}^{4}-C_{13}C_{21}C_{23}X_{1}^{3}}{C_{12}C_{22}^{2}}Y+\frac {C_{23}^{2}C_{11}X_{1}X_{2}^{3}}{C_{22}^{2}C_{12}} \)
-
v 3(Y)=Y
4.4 Computation for Generic Rational Coefficients
We are now going to show that, for a system with generic rational coefficients, the same steps as the ones in Algorithm K-Projection can be performed using these rational coefficients to obtain a geometric resolution of the Zariski closure of the projection of the associated variety.
Let {X 1,…,X t ,X t+r+1,…,X n } be a transcendence basis of K(V ∗(f)) such that {X 1,…,X t } is a maximal algebraically independent subset of {X 1,…,X ℓ }.
As shown in [11, Appendix A], there exists a non-empty Zariski open set \(\mathcal{U}_{1}\subset\mathbb{A}^{N}\) such that if \(c=(c_{1},\ldots, c_{r})\in\mathcal{U}_{1}\cap\mathbb{Q}^{N}\), then:
-
the ideal I c :=(f 1(c,X),…,f r (c,X)):(X 1⋯X n )∞ is radical equidimensional of dimension n−r,
-
{X 1,…,X t ,X t+r+1,…,X n } is algebraically independent modulo each of the associated primes of I c ,
-
for every 1≤k≤ℓ−t, {X 1,…,X t ,X t+k } is algebraically dependent modulo I c .
Assume \(c\in\mathcal{U}_{1}\cap\mathbb{Q}^{N}\) and consider the variety \(V^{*}(\mathbf{f}(c)) \subset\mathbb{A}^{n}\). Let W be an irreducible component of V ∗(f(c)). We have dim(W)=n−r and {X 1,…,X t ,X t+r+1,…,X n } is a transcendence basis of \(\mathbb{Q}(W)\). Then, the projection of W over the last n−t−r coordinates is a dominant map. Therefore, there is a Zariski open subset \(\mathcal{O}_{W}\subset\mathbb{A}^{n-t-r}\) such that, for every \(b\in\mathcal{O}_{W} \cap\mathbb{Q}^{n-t-r}\):
-
W b :=W∩{x t+r+1=b t+r+1,…,x n =b n } is an equidimensional variety of dimension t,
-
{X 1,…,X t } is algebraically independent modulo I(W b ).
Then, for every \(b\in\mathcal{O}_{W}\cap\mathbb{Q}^{n-t-r}\), the identity \(\overline{\pi(W)} = \overline{\pi(W_{b})}\) holds.
As the dimension of the set \(\partial(\mathbf{f}(c)) := V^{*}(\mathbf {f}(c)) - \{ x \in(\mathbb{C}^{*})^{n} : \mathbf{f}(c,x)=0\}\) is less than n−r, for every {i 1,…,i t }⊂{1,…,t+r}, there exists a non-zero polynomial \(p_{i_{1},\ldots, i_{t}}(X_{i_{1}},\ldots, X_{i_{t}}, X_{t+r+1},\ldots, X_{n})\) vanishing identically on this set. Then, there is a non-empty Zariski open set \(\mathcal{O}_{1}\subset \mathbb{A}^{n-t-r}\) such that for every \(b\in\mathcal{O}_{1}\), the dimension of ∂(f(c))∩{x t+r+1=b t+r+1,…,x n =b n } is less than t.
Then, for every \(b\in\mathcal{O}_{1} \cap\bigcap_{W} \mathcal{O}_{W}\cap (\mathbb{Q}^{*})^{n-t-r}\), we have
and π(V ∗(f(c)))=⋃ W π(W)=⋃ W π(W b )=π(V ∗(f(c))∩{x t+r+1=b t+r+1,…,x n =b n }); therefore,
For j=1,…,r, let \(\mathcal{S}_{j}\subset(\mathbb{Z}_{\ge 0})^{t+r}\) be the projection of \(\mathcal{A}_{j}\) to the first t+r coordinates. Then \(f_{j}= \sum_{\hat{a} \in\mathcal{S}_{j}} (\sum_{(\hat{a},\tilde{a})\in\mathcal{A}_{j}} C_{j,(\hat{a},\tilde{a})} \tilde{X}^{\tilde{a}}) \hat{X}^{\hat{a}} \). Algorithm ParametricToricGeomRes works for generic sparse polynomial systems g supported on \(\mathcal{S} = (\mathcal {S}_{1},\ldots, \mathcal{S}_{r})\), that is, there is a polynomial \(p_{\mathcal{S}}\) in the coefficients of the system such that it computes a geometric resolution of \(V^{*}(\mathbf{g}(\hat{c}))\) for every vector of coefficients \(\hat{c}\) with \(p_{\mathcal{S}}(\hat{c})\ne0\). Let \(\mathcal{U}_{2}\subset\mathbb{A}^{N}\) be a non-empty Zariski open set such that, for every c:=(c 1,…,c r ), the polynomial \(p_{\mathcal{S}}((\sum_{(\hat{a},\tilde{a})\in\mathcal{A}_{j}} c_{j,(\hat{a},\tilde{a})} \tilde{X}^{\tilde{a}})_{1\le j \le r, \hat{a}\in \mathcal{S}_{j}})\) does not vanish identically.
For \(c\in\mathcal{U}_{2}\cap\mathbb{Q}^{N}\), there exists a non-empty Zariski open set \(\mathcal{O}_{2}\subset\mathbb{A}^{n-t-r}\) such that for every \(b\in\mathcal{O}_{2} \cap\mathbb{Q}^{n-t-r}\), the algorithm ParametricToricGeomRes can be applied to the system \(\mathbf {f}(c,\hat{x}, b)\).
We conclude that, for coefficient vectors \(c\in\mathcal{U}_{1} \cap \mathcal{U}_{2} \cap(\mathbb{Q}^{*})^{N}\), a probabilistic algorithm \(\mathbb{Q}\)-Projection which follows the same steps as Algorithm K-Projection can be applied in order to compute a geometric resolution of \(\overline{\pi(V^{*}(\mathbf{f}(c)))}\). Taking into account the complexity estimates in Theorem 10, this proves Theorem 1.
Finally, we show an example where, following the steps of the algorithm \(\mathbb{Q}\)-Projection, we obtain a geometric resolution of \(\overline{\pi(V^{*}(\mathbf {f}))}\) for a sparse system f with rational coefficients.
Example
Let f be the following sparse system with the same support family \(\mathcal{A}=(\mathcal{A}_{1}, \mathcal{A}_{2})\) as in the example of Sect. 4.3:
and let \(\pi: \mathbb{C}^{5} \to\mathbb{C}^{3}\) be the projection π(x 1,x 2,x 3,x 4,x 5)=(x 1,x 2,x 3).
We use the same choices b=1, λ=X 5 and μ=X 3 as in Sect. 4.3 and look at steps 4 and 6 of the algorithm.
In Step 4, the algorithm ParametricToricGeomRes computes the geometric resolution of V ∗(f 1) with free variables X 1,X 2 associated to the linear form λ:
-
\(\widehat{q}_{X_{5}}(Y)= Y^{10}-\frac{2X_{1}^{2}}{5}Y^{8}+\frac{X_{1}^{4}+30}{ 25}Y^{6}+\frac{2X_{1}^{2}}{75}Y^{4}-\frac{4X_{1}^{4} -27+28X_{1}^{3}X_{2}^{4}}{ 75}Y^{2}+\frac{4X_{1}^{2}}{25}\),
-
\(\widehat{w}_{3}(Y)= \frac{-5}{2X_{1}X_{2}} Y^{4} + \frac{X_{1}}{2X_{2}} Y^{2} - \frac{3}{2 X_{1} X_{2}}\),
-
\(\widehat{w}_{5}(Y)=Y\).
In Step 6, if Algorithm GeomResProj is applied to the geometric resolution obtained in Step 4 and the linear form μ, the geometric resolution \((\widehat{q}_{X_{3}},\widehat{v}_{3})\) is obtained, where
-
\(\widehat{q}_{X_{3}}(Y)= Y^{5}+\frac{3}{2X_{1}X_{2}}Y^{4}-\frac{14X_{1}X_{2}^{4}+ X_{1}^{2}}{3X_{2}^{2}}Y^{3}+\frac {-63X_{2}^{4}+ 10X_{1}}{9X_{2}^{3}}Y^{2} +\frac{49X_{1}^{2}X_{2}^{4}+7X_{1}^{3}}{9}Y+ \frac{49X_{1}X_{2}^{3}}{6}\),
-
\(\widehat{v}_{3}(Y)=Y\).
This is, in fact, the geometric resolution of \(\overline{\pi (V^{*}(\mathbf{f}))} \) with free variables X 1,X 2 associated to the linear form μ=X 3, as can be checked, for instance, by applying a Groebner basis elimination-based procedure.
References
Adrovic, D., Verschelde, J.: Computing Puiseux series for algebraic surfaces. In: Proceedings of the 37th International Symposium on Symbolic and Algebraic Computation (ISSAC 2012), Grenoble, France, July 22–25, 2012, pp. 20–27 (2012)
Adrovic, D., Verschelde, J.: Polyhedral methods for space curves exploiting symmetry applied to the cyclic n-roots problem. In: Gerdt, V.P., Koepf, W., Mayr, E.W., Vorozhtsov, E.V. (eds.) Computer Algebra in Scientific Computing, 15th International Workshop, CASC 2013, Berlin, Germany. Lecture Notes in Computer Science, vol. 8136, pp. 10–29 (2013)
Berkowitz, S.J.: On computing the determinant in small parallel time using a small number of processors. Inf. Process. Lett. 18(3), 147–150 (1984)
Bernstein, D.N.: The number of roots of a system of equations. Funct. Anal. Appl. 9, 183–185 (1975)
Bini, D., Pan, V.Y.: Polynomial and Matrix Computations. Vol. 1. Fundamental Algorithms. Progress in Theoretical Computer Science. Birkhäuser Boston, Boston (1994)
Bürgisser, P., Clausen, M., Shokrollahi, M.A.: Algebraic Complexity Theory. Grundlehren Math. Wiss., vol. 315. Springer, Berlin (1997)
Caniglia, L.: How to compute the Chow form of an unmixed polynomial ideal in single exponential time. Appl. Algebra Eng. Commun. Comput. 1(1), 25–41 (1990)
Canny, J., Emiris, I.Z.: A subdivision-based algorithm for the sparse resultant. J. ACM 47(3), 417–451 (2000)
Chistov, A.L., Grigor’ev, D.Yu.: Complexity of quantifier elimination in the theory of algebraically closed fields. In: Mathematical Foundations of Computer Science, 1984, Prague, 1984. Lecture Notes in Computer Science, vol. 176, pp. 17–31. Springer, Berlin (1984)
Cox, D., Little, J., O’Shea, D.: Using Algebraic Geometry. Grad. Texts Math., vol. 185. Springer, New York (1998)
D’Alfonso, L., Jeronimo, G., Ollivier, F., Sedoglavic, A., Solernó, P.: A geometric index reduction method for implicit systems of differential algebraic equations. J. Symb. Comput. 46(10), 1114–1138 (2011)
D’Andrea, C.: Macaulay style formulas for sparse resultants. Trans. Am. Math. Soc. 354(7), 2595–2629 (2002)
D’Andrea, C., Dickenstein, A.: Explicit formulas for the multivariate resultant. J. Pure Appl. Algebra 164(1–2), 59–86 (2001). Effective methods in algebraic geometry (Bath, 2000)
Dyer, M., Gritzmann, P., Hufnagel, A.: On the complexity of computing mixed volumes. SIAM J. Comput. 27(2), 356–400 (1998)
Edmonds, J.: Submodular functions, matroids, and certain polyhedra. In: Combinatorial Structures and Their Applications, Proc. Calgary Internat. Conf., Calgary, Alta., 1969, pp. 69–87. Gordon & Breach, New York (1970)
Emiris, I.Z., Canny, J.F.: Efficient incremental algorithms for the sparse resultant and the mixed volume. J. Symb. Comput. 20(2), 117–149 (1995)
Emiris, I.Z., Verschelde, J.: How to count efficiently all affine roots of a polynomial system. Discrete Appl. Math. 93(1), 21–32 (1999). 13th European Workshop on Computational Geometry CG’97 (Würzburg, 1997)
Fitchas, N., Galligo, A., Morgenstern, J.: Precise sequential and parallel complexity bounds for quantifier elimination over algebraically closed fields. J. Pure Appl. Algebra 67(1), 1–14 (1990)
Gao, T., Li, T.Y.: Mixed volume computation for semi-mixed systems. Discrete Comput. Geom. 29(2), 257–277 (2003)
Gao, T., Li, T.Y., Wang, X.: Finding all isolated zeroes of polynomial systems in \(\mathbb {C}^{n}\) via stable mixed volumes. Polynomial elimination—algorithms and applications. J. Symb. Comput. 28(1–2), 187–211 (1999)
Giusti, M., Heintz, J.: Algorithmes—disons rapides—pour la décomposition d’une variété algébrique en composantes irréductibles et équidimensionelles. In: Proc. Effective Methods in Algebraic Geometry, Castiglioncello, 1990. Progr. Math., vol. 94, pp. 169–194. Birkhäuser Boston, Boston (1991)
Giusti, M., Lecerf, G., Salvy, B.: A Gröbner free alternative for polynomial system solving. J. Complex. 17(1), 154–211 (2001)
Heintz, J.: Definability and fast quantifier elimination in algebraically closed fields. Theor. Comput. Sci. 24(3), 239–277 (1983)
Herrero, M.I., Jeronimo, G., Sabia, J.: Computing isolated roots of sparse polynomial systems in affine space. Theor. Comput. Sci. 411(44–46), 3894–3904 (2010)
Herrero, M.I., Jeronimo, G., Sabia, J.: Affine solution sets of sparse polynomial systems. J. Symb. Comput. 51, 34–54 (2013)
Huber, B., Sturmfels, B.: A polyhedral method for solving sparse polynomial systems. Math. Comput. 64(212), 1541–1555 (1995)
Huber, B., Sturmfels, B.: Bernstein’s theorem in affine space. Discrete Comput. Geom. 17(2), 137–141 (1997)
Jeronimo, G., Sabia, J.: Computing multihomogeneous resultants using straight-line programs. J. Symb. Comput. 42(1–2), 218–235 (2007)
Jeronimo, G., Krick, T., Sabia, J., Sombra, M.: The computational complexity of the Chow form. Found. Comput. Math. 4(1), 41–117 (2004)
Jeronimo, G., Matera, G., Solernó, P., Waissbein, A.: Deformation techniques for sparse systems. Found. Comput. Math. 9(1), 1–50 (2009)
Khovanskii, A.G.: Newton polyhedra and toroidal varieties. Funct. Anal. Appl. 11, 289–296 (1978)
Kushnirenko, A.G.: Newton polytopes and the Bézout theorem. Funct. Anal. Appl. 10, 233–235 (1976)
Li, T.Y., Li, X.: Finding mixed cells in the mixed volume computation. Found. Comput. Math. 1(2), 161–181 (2001)
Li, T.Y., Wang, X.: The BKK root count in \(\mathbb{C}^{n}\). Math. Comput. 65(216), 1477–1484 (1996)
Mizutani, T., Takeda, A., Kojima, M.: Dynamic enumeration of all mixed cells. Discrete Comput. Geom. 37(3), 351–367 (2007)
Puddu, S., Sabia, J.: An effective algorithm for quantifier elimination over algebraically closed fields using straight line programs. J. Pure Appl. Algebra 129(2), 173–200 (1998)
Rojas, J.M.: A convex geometrical approach to counting the roots of a polynomial system. Theor. Comput. Sci. 133(1), 105–140 (1994)
Rojas, J.M.: Why polyhedra matter in non-linear equation solving. In: Topics in Algebraic Geometry and Geometric Modeling. Contemp. Math., vol. 334, pp. 293–320. Am. Math. Soc., Providence (2003)
Rojas, J.M., Wang, X.: Counting affine roots of polynomial systems via pointed Newton polytopes. J. Complex. 12(2), 116–133 (1996)
Schost, É.: Computing parametric geometric resolutions. Appl. Algebra Eng. Commun. Comput. 13(5), 349–393 (2003)
Schwartz, J.: Fast probabilistic algorithms for verification of polynomial identities. J. ACM 27, 701–717 (1980)
Steffens, R., Theobald, T.: Mixed volume techniques for embeddings of Laman graphs. J. Comput. Geom. 43(2), 84–93 (2010)
Sturmfels, B.: On the Newton polytope of the resultant. J. Algebr. Comb. 3(2), 207–236 (1994)
Tarski, A.: A Decision Method for Elementary Algebra and Geometry, 2nd edn. University of California Press, Berkeley/Los Angeles (1951)
Verschelde, J.: Polyhedral methods in numerical algebraic geometry. In: Interactions of Classical and Numerical Algebraic Geometry. Contemp. Math., vol. 496, pp. 243–263. Am. Math. Soc., Providence (2009)
Verschelde, J., Verlinden, P., Cools, R.: Homotopies exploiting Newton polytopes for solving sparse polynomial systems. SIAM J. Numer. Anal. 31(3), 915–930 (1994)
Verschelde, J., Gatermann, K., Cools, R.: Mixed-volume computation by dynamic lifting applied to polynomial system solving. Discrete Comput. Geom. 16(1), 69–112 (1996)
von zur Gathen, J., Gerhard, J.: Modern Computer Algebra. Cambridge University Press, Cambridge (1999)
Zippel, R.: Effective Polynomial Computation. Kluwer Int. Ser. Eng. Comput. Sci., vol. 241. Kluwer, Dordrecht (1993)
Acknowledgements
The authors thank the referees for their helpful comments and suggestions. This work was partially supported by the Argentinian research grants CONICET PIP 0099/11, UBACYT 20020090100069 (2010–2013) and UBACYT 20020120100133 (2013–2016).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Herrero, M.I., Jeronimo, G. & Sabia, J. Elimination for Generic Sparse Polynomial Systems. Discrete Comput Geom 51, 578–599 (2014). https://doi.org/10.1007/s00454-014-9571-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00454-014-9571-z