Ideals, Varieties, and Algorithms pp 175-232 | Cite as
The Algebra–Geometry Dictionary
Abstract
In this chapter, we will explore the correspondence between ideals and varieties. In §§1 and 2, we will prove the Nullstellensatz, a celebrated theorem which identifies exactly which ideals correspond to varieties. This will allow us to construct a “dictionary” between geometry and algebra, whereby any statement about varieties can be translated into a statement about ideals (and conversely).
In this chapter, we will explore the correspondence between ideals and varieties. In §§1 and 2, we will prove the Nullstellensatz, a celebrated theorem which identifies exactly which ideals correspond to varieties. This will allow us to construct a “dictionary” between geometry and algebra, whereby any statement about varieties can be translated into a statement about ideals (and conversely). We will pursue this theme in §§3 and 4, where we will define a number of natural algebraic operations on ideals and study their geometric analogues. In keeping with the computational emphasis of the book, we will develop algorithms to carry out the algebraic operations. In §§5 and 6, we will study the more important algebraic and geometric concepts arising out of the Hilbert Basis Theorem: notably the possibility of decomposing a variety into a union of simpler varieties and the corresponding algebraic notion of writing an ideal as an intersection of simpler ideals. In §7, we will prove the Closure Theorem from Chapter 3 using the tools developed in this chapter.
1 Hilbert’s Nullstellensatz
Does this problem of having different ideals represent the empty variety go away if the field k is algebraically closed? It does in the one-variable case when the ring is k[x]. To see this, recall from §5 of Chapter 1 that any ideal I in k[x] can be generated by a single polynomial because k[x] is a principal ideal domain. So we can write \(I =\langle \, f\rangle\) for some polynomial f ∈ k[x]. Then V(I) is the set of roots of f; i.e., the set of a ∈ k such that f(a) = 0. But since k is algebraically closed, every nonconstant polynomial in k[x] has a root. Hence, the only way that we could have \(\mathbf{V}(I) = \varnothing\) would be to have f be a nonzero constant. In this case, 1∕f ∈ k. Thus, \(1 = (1/f) \cdot f \in I\), which means that \(g = g \cdot 1 \in I\) for all g ∈ k[x]. This shows that I = k[x] is the only ideal of k[x] that represents the empty variety when k is algebraically closed.
A wonderful thing now happens: the same property holds when there is more than one variable. In any polynomial ring, algebraic closure is enough to guarantee that the only ideal which represents the empty variety is the entire polynomial ring itself. This is the Weak Nullstellensatz, which is the basis of (and is equivalent to) one of the most celebrated mathematical results of the late nineteenth century, Hilbert’s Nullstellensatz. Such is its impact that, even today, one customarily uses the original German name Nullstellensatz: a word formed, in typical German fashion, from three simpler words: Null ( = Zero), Stellen ( = Places), Satz ( = Theorem).
Theorem 1 (The Weak Nullstellensatz).
Let k be an algebraically closed field and let \(I \subseteq k[x_{1},\ldots,x_{n}]\) be an ideal satisfying \(\mathbf{V}(I) = \varnothing\) . Then \(I = k[x_{1},\ldots,x_{n}]\) .
Proof.
Claim.
If k is algebraically closed and \(I \subsetneq k[x_{1},\ldots,x_{n}]\) is a proper ideal, then there is a ∈ k such that \(I_{x_{n}=a} \subsetneq k[x_{1},\ldots,x_{n-1}]\).
Once we prove the claim, an easy induction gives elements \(a_{1},\ldots,a_{n} \in k\) such that \(I_{x_{n}=a_{n},\ldots,x_{1}=a_{1}} \subsetneq k\). But the only ideals of k are {0} and k (Exercise 3), so that \(I_{x_{n}=a_{n},\ldots,x_{1}=a_{1}} =\{ 0\}\). This implies \((a_{1},\ldots,a_{n}) \in \mathbf{V}(I)\). We conclude that \(\mathbf{V}(I)\neq \varnothing\), and the theorem will follow.
To prove the claim, there are two cases, depending on the size of \(I \cap k[x_{n}]\).
Case 1. \(I \cap k[x_{n}]\neq \{0\}\). Let \(f \in I \cap k[x_{n}]\) be nonzero, and note that f is nonconstant, since otherwise \(1 \in I \cap k[x_{n}] \subseteq I\), contradicting \(I\neq k[x_{1},\ldots,x_{n}]\).
We claim that the \(\bar{g}_{i}\) form a Gröbner basis of \(I_{x_{n}=a}\). Assuming the claim, it follows that \(1\notin I_{x_{n}=a}\) since no \(\mbox{ LT}(\bar{g}_{i})\) can divide 1. Thus \(I_{x_{n}=a}\neq k[x_{1},\ldots,x_{n-1}]\), which is what we want to show.
In the special case when \(k = \mathbb{C}\), the Weak Nullstellensatz may be thought of as the “Fundamental Theorem of Algebra for multivariable polynomials”—every system of polynomials that generates an ideal strictly smaller than \(\mathbb{C}[x_{1},\ldots,x_{n}]\) has a common zero in \(\mathbb{C}^{n}\).
To see this, let \(\{g_{1},\ldots,g_{t}\}\) be a Gröbner basis of \(I =\langle 1\rangle\). Thus, \(1 \in \langle \mbox{ LT}(I)\rangle =\langle \mbox{ LT}(g_{1}),\ldots,\mbox{ LT}(g_{t})\rangle\), and then Lemma 2 of Chapter 2, §4 implies that 1 is divisible by some LT(g i ), say LT(g 1). This forces LT(g 1) to be constant. Then every other LT(g i ) is a multiple of that constant, so that \(g_{2},\ldots,g_{t}\) can be removed from the Gröbner basis by Lemma 3 of Chapter 2, §7. Finally, since LT(g 1) is constant, g 1 itself is constant since every nonconstant monomial is > 1 (see Corollary 6 of Chapter 2, §4). We can multiply by an appropriate constant to make g 1 = 1. Our reduced Gröbner basis is thus {1}.
To summarize, we have the following consistency algorithm: if we have polynomials \(f_{1},\ldots,f_{s} \in \mathbb{C}[x_{1},\ldots,x_{n}]\), we compute a reduced Gröbner basis of the ideal they generate with respect to any ordering. If this basis is {1}, the polynomials have no common zero in \(\mathbb{C}^{n}\); if the basis is not {1}, they must have a common zero. Note that this algorithm works over any algebraically closed field.
If we are working over a field k which is not algebraically closed, then the consistency algorithm still works in one direction: if {1} is a reduced Gröbner basis of \(\langle \,f_{1},\ldots,f_{s}\rangle\), then the equations \(f_{1} = \cdots = f_{s} = 0\) have no common solution. The converse is not true, as shown by the examples preceding the statement of the Weak Nullstellensatz.
Inspired by the Weak Nullstellensatz, one might hope that the correspondence between ideals and varieties is one-to-one provided only that one restricts to algebraically closed fields. Unfortunately, our earlier example V(x) = V(x 2) = { 0} works over any field. Similarly, the ideals \(\langle x^{2},y\rangle\) and \(\langle x,y\rangle\) (and, for that matter, \(\langle x^{n},y^{m}\rangle\) where n and m are integers greater than one) are different but define the same variety: namely, the single point \(\{(0,0)\} \subseteq k^{2}\). These examples illustrate a basic reason why different ideals can define the same variety (equivalently, that the map V can fail to be one-to-one): namely, a power of a polynomial vanishes on the same set as the original polynomial. The Hilbert Nullstellensatz states that over an algebraically closed field, this is the only reason that different ideals can give the same variety: if a polynomial f vanishes at all points of some variety V(I), then some power of f must belong to I.
Theorem 2 (Hilbert’s Nullstellensatz).
Proof.
-
(a 1, …, a n ) is a common zero of f 1, …, f s , or
-
(a 1, …, a n ) is not a common zero of f 1, …, f s .
In the first case f(a 1, …, a n ) = 0 since f vanishes at any common zero of f 1, …, f s . Thus, the polynomial 1 − yf takes the value 1 − a n+1 f(a 1, …, a n ) = 1 ≠ 0 at the point (a 1, …, a n , a n+1). In particular, \((a_{1},\ldots,a_{n},a_{n+1})\notin \mathbf{V}(\tilde{I })\). In the second case, for some i, 1 ≤ i ≤ s, we must have f i (a 1, …, a n ) = 0. Thinking of f i as a function of n + 1 variables which does not depend on the last variable, we have f i (a 1, …, a n , a n+1) ≠ 0. In particular, we again conclude that \((a_{1},\ldots,a_{n},a_{n+1})\notin \mathbf{V}(\tilde{I })\). Since (a 1, …, a n , a n+1) ∈ k n+1 was arbitrary, we obtain \(\mathbf{V}(\tilde{I }) = \varnothing\), as claimed.
Exercises for §1
- 1.Recall that V(y − x 2, z − x 3) is the twisted cubic in \(\mathbb{R}^{3}\).
- a.
Show that V((y − x 2)2 + (z − x 3)2) is also the twisted cubic.
- b.
Show that any variety \(\mathbf{V}(I) \subseteq \mathbb{R}^{n}\), \(I \subseteq \mathbb{R}[x_{1},\ldots,x_{n}]\), can be defined by a single equation (and hence by a principal ideal).
- a.
- 2.
Let \(J =\langle x^{2} + y^{2} - 1,y - 1\rangle\). Find f ∈ I(V(J)) such that \(f\notin J\).
- 3.
Prove that {0} and k are the only ideals of a field k.
- 4.
Prove that an algebraically closed field k must be infinite. Hint: Given n elements a 1, …, a n of a field k, can you write down a nonconstant polynomial f ∈ k[x] with the property that f(a i ) = 1 for all i?
- 5.
In the proof of Theorem 1, prove that \(I_{x_{n}=a} =\langle \bar{ g}_{1},\ldots,\bar{g}_{t}\rangle\).
- 6.
In the proof of Theorem 1, let x δ be a monomial in \(x_{1},\ldots,x_{n-1}\) satisfying x δ > LT( f) for some \(f \in k[x_{1},\ldots,x_{n}]\). Prove that \(x^{\delta }> \mbox{ LT}(\,\bar{f})\), where \(\bar{f} = f(x_{1},\ldots,x_{n-1},a)\).
- 7.
In deducing Hilbert’s Nullstellensatz from the Weak Nullstellensatz, we made the substitution y = 1∕f(x 1, …, x n ) to deduce relations (3) and (4) from (2). Justify this rigorously. Hint: In what set is 1∕f contained?
- 8.The purpose of this exercise is to show that if k is any field that is not algebraically closed, then any variety \(V \subseteq k^{n}\) can be defined by a single equation.
- a.
If \(g = a_{0}x^{n} + a_{1}x^{n-1} + \cdots + a_{n-1}x + a_{n}\) is a polynomial of degree n in x, define the homogenization g h of g with respect to some variable y to be the polynomial \(g^{h} = a_{0}x^{n} + a_{1}x^{n-1}y + \cdots + a_{n-1}xy^{n-1} + a_{n}y^{n}\). Show that g has a root in k if and only if there is (a, b) ∈ k 2 such that (a, b) ≠ (0, 0) and g h (a, b) = 0. Hint: Show that g h (a, b) = b n g h (a∕b, 1) when b ≠ 0.
- b.
If k is not algebraically closed, show that there exists f ∈ k[x, y] such that the variety defined by f = 0 consists of just the origin (0, 0) ∈ k 2. Hint: Choose a polynomial in k[x] with no root in k and consider its homogenization.
- c.
If k is not algebraically closed, show that for each integer l > 0 there exists f ∈ k[x 1, …, x l ] such that the only solution of f = 0 is the origin (0, …, 0) ∈ k l . Hint: Use induction on l and part (b) above.
- d.
If W = V(g 1, …, g s ) is any variety in k n , where k is not algebraically closed, then show that W can be defined by a single equation. Hint: Consider the polynomial f(g 1, …, g s ) where f is as in part (c).
- a.
- 9.
Let k be an arbitrary field and let S be the subset of all polynomials in k[x 1, …, x n ] that have no zeros in k n . If I is any ideal in k[x 1, …, x n ] such that \(I \cap S = \varnothing\), show that \(\mathbf{V}(I)\neq \varnothing\). Hint: When k is not algebraically closed, use the previous exercise.
- 10.
In Exercise 1, we encountered two ideals in \(\mathbb{R}[x,y]\) that give the same nonempty variety. Show that one of these ideals is contained in the other. Can you find two ideals in \(\mathbb{R}[x,y]\), neither contained in the other, which give the same nonempty variety? Can you do the same for \(\mathbb{R}[x]\)?
2 Radical Ideals and the Ideal–Variety Correspondence
To further explore the relation between ideals and varieties, it is natural to recast Hilbert’s Nullstellensatz in terms of ideals. Can we characterize the kinds of ideals that appear as the ideal of a variety? In other words, can we identify those ideals that consist of all polynomials which vanish on some variety V? The key observation is contained in the following simple lemma.
Lemma 1.
Let V be a variety. If f m ∈ I (V ), then f ∈ I (V ).
Proof.
Let a ∈ V. If f m ∈ I(V), then ( f(a)) m = 0. But this can happen only if f(a) = 0. Since a ∈ V was arbitrary, we must have f ∈ I(V). □
Thus, an ideal consisting of all polynomials which vanish on a variety V has the property that if some power of a polynomial belongs to the ideal, then the polynomial itself must belong to the ideal. This leads to the following definition.
Definition 2.
An ideal I is radical if f m ∈ I for some integer m ≥ 1 implies that f ∈ I.
Rephrasing Lemma 1 in terms of radical ideals gives the following statement.
Corollary 3.
I (V ) is a radical ideal.
On the other hand, Hilbert’s Nullstellensatz tells us that the only way that an arbitrary ideal I can fail to be the ideal of all polynomials vanishing on V(I) is for I to contain powers f m of polynomials f which are not in I—in other words, for I to fail to be a radical ideal. This suggests that there is a one-to-one correspondence between affine varieties and radical ideals. To clarify this and get a sharp statement, it is useful to introduce the operation of taking the radical of an ideal.
Definition 4.
Lemma 5.
If I is an ideal in k[x 1 ,…,x n ], then \(\sqrt{I}\) is an ideal in k[x 1 ,…,x n ] containing I. Furthermore, \(\sqrt{I}\) is a radical ideal.
Proof.
We have already shown that \(I \subseteq \sqrt{I}\). To show \(\sqrt{I}\) is an ideal, suppose \(f,g \in \sqrt{I}\). Then there are positive integers m and l such that f m , g l ∈ I. In the binomial expansion of ( f + g) m+l−1 every term has a factor f i g j with i + j = m + l − 1. Since either i ≥ m or j ≥ l, either f i or g j is in I, whence f i g j ∈ I and every term in the binomial expansion is in I. Hence, ( f + g) m+l−1 ∈ I and, therefore, \(f + g \in \sqrt{I}\). Finally, suppose \(f \in \sqrt{I}\) and h ∈ k[x 1, …, x n ]. Then f m ∈ I for some integer m ≥ 1. Since I is an ideal, we have (h ⋅ f) m = h m f m ∈ I. Hence, \(hf \in \sqrt{I}\). This shows that \(\sqrt{I}\) is an ideal. In Exercise 4, you will show that \(\sqrt{I}\) is a radical ideal. □
We are now ready to state the ideal-theoretic form of the Nullstellensatz.
Theorem 6 (The Strong Nullstellensatz).
Proof.
We certainly have \(\sqrt{I} \subseteq \mathbf{I}(\mathbf{V}(I))\) because \(f \in \sqrt{I}\) implies that f m ∈ I for some m. Hence, f m vanishes on V(I), which implies that f vanishes on V(I). Thus, f ∈ I(V(I)).
Conversely, take f ∈ I(V(I)). Then, by definition, f vanishes on V(I). By Hilbert’s Nullstellensatz, there exists an integer m ≥ 1 such that f m ∈ I. But this means that \(f \in \sqrt{I}\). Since f was arbitrary, \(\mathbf{I}(\mathbf{V}(I)) \subseteq \sqrt{I}\), and we are done. □
It has become a custom, to which we shall adhere, to refer to Theorem 6 as the Nullstellensatz with no further qualification. The most important consequence of the Nullstellensatz is that it allows us to set up a “dictionary” between geometry and algebra. The basis of the dictionary is contained in the following theorem.
Theorem 7 (The Ideal–Variety Correspondence).
- (i)The mapsand$$\displaystyle{\mbox{ affine varieties}\ \mathop{\longrightarrow }^{\mathbf{I}}\ \mbox{ ideals}}$$are inclusion-reversing, i.e., if \(I_{1} \subseteq I_{2}\) are ideals, then \(\mathbf{V}(I_{1}) \supseteq \mathbf{V}(I_{2})\) and, similarly, if \(V _{1} \subseteq V _{2}\) are varieties, then \(\mathbf{I}(V _{1}) \supseteq \mathbf{I}(V _{2})\) .$$\displaystyle{\mbox{ ideals}\ \mathop{\longrightarrow }^{\mathbf{V}}\ \mbox{ affine varieties}}$$
- (ii)For any variety V,so that I is always one-to-one. On the other hand, any ideal I satisfies$$\displaystyle{\mathbf{V}(\mathbf{I}(V )) = V,}$$$$\displaystyle{\mathbf{V}(\sqrt{I}) = \mathbf{V}(I).}$$
- (iii)If k is algebraically closed, and if we restrict to radical ideals, then the mapsand$$\displaystyle{\mbox{ affine varieties}\ \mathop{\longrightarrow }^{\mathbf{I}}\ \mbox{ radical ideals}}$$are inclusion-reversing bijections which are inverses of each other.$$\displaystyle{\mbox{ radical ideals}\ \mathop{\longrightarrow }^{\mathbf{V}}\ \mbox{ affine varieties}}$$
Proof.
(i) The proof will be covered in the exercises.
(ii) Let V = V( f 1, …, f s ) be an affine variety in k n . Since every f ∈ I(V) vanishes on V, the inclusion \(V \subseteq \mathbf{V}(\mathbf{I}(V ))\) follows directly from the definition of V. Going the other way, note that f 1, …, f s ∈ I(V) by the definition of I, and, thus, \(\langle \,f_{1},\ldots,f_{s}\rangle \subseteq \mathbf{I}(V )\). Since V is inclusion-reversing, it follows that \(\mathbf{V}(\mathbf{I}(V )) \subseteq \mathbf{V}(\langle \,f_{1},\ldots,f_{s}\rangle ) = V\). This proves that V(I(V)) = V, and, consequently, I is one-to-one since it has a left inverse. The final assertion of part (ii) is left as an exercise.
(iii) Since I(V) is radical by Corollary 3, we can think of I as a function which takes varieties to radical ideals. Furthermore, we already know V(I(V)) = V for any variety V. It remains to prove I(V(I)) = I whenever I is a radical ideal. This is easy: the Nullstellensatz tells us \(\mathbf{I}(\mathbf{V}(I)) = \sqrt{I}\), and I being radical implies \(\sqrt{I} = I\) (see Exercise 4). This gives the desired equality. Hence, V and I are inverses of each other and, thus, define bijections between the set of radical ideals and affine varieties. The theorem is proved. □
As a consequence of this theorem, any question about varieties can be rephrased as an algebraic question about radical ideals (and conversely), provided that we are working over an algebraically closed field. This ability to pass between algebra and geometry will give us considerable power.
-
(Radical Generators) Is there an algorithm which produces a set {g 1, …, g m } of polynomials such that \(\sqrt{I} =\langle g_{1},\ldots,g_{m}\rangle\)?
-
(Radical Ideal) Is there an algorithm which will determine whether I is radical?
-
(Radical Membership) Given f ∈ k[x 1, …, x n ], is there an algorithm which will determine whether \(f \in \sqrt{I}\)?
The existence of these algorithms follows from the work of HERMANN (1926) [see also MINES, RICHMAN, and RUITENBERG (1988) and SEIDENBERG (1974, 1984) for more modern expositions]. More practical algorithms for finding radicals follow from the work of GIANNI, TRAGER and ZACHARIAS (1988), KRICK and LOGAR (1991), and EISENBUD, HUNEKE and VASCONCELOS (1992). These algorithms have been implemented in CoCoA, Singular, and Macaulay2, among others. See, for example, Section 4.5 of GREUEL and PFISTER (2008).
For now, we will settle for solving the more modest radical membership problem. To test whether \(f \in \sqrt{I}\), we could use the ideal membership algorithm to check whether f m ∈ I for all integers m > 0. This is not satisfactory because we might have to go to very large powers of m, and it will never tell us if \(f\notin \sqrt{I}\) (at least, not until we work out a priori bounds on m). Fortunately, we can adapt the proof of Hilbert’s Nullstellensatz to give an algorithm for determining whether \(f \in \sqrt{\langle \,f_{1 },\ldots, f_{s}\rangle }\).
Proposition 8 (Radical Membership).
Let k be an arbitrary field and let \(I =\langle \, f_{1},\ldots,f_{s}\rangle \subseteq k[x_{1},\ldots,x_{n}]\) be an ideal. Then \(f \in \sqrt{I}\) if and only if the constant polynomial 1 belongs to the ideal \(\tilde{I }=\langle \, f_{1},\ldots,f_{s},1 - yf\rangle \subseteq k[x_{1},\ldots,x_{n},y]\) , in which case \(\tilde{I }= k[x_{1},\ldots,x_{n},y]\) .
Proof.
Proposition 8, together with our earlier remarks on determining whether 1 belongs to an ideal (see the discussion of the consistency problem in §1), immediately leads to the following radical membership algorithm: to determine if \(f \in \sqrt{\langle \,f_{1 },\ldots, f_{s}\rangle } \subseteq k[x_{1},\ldots,x_{n}]\), we compute a reduced Gröbner basis of the ideal \(\langle \,f_{1},\ldots,f_{s},1 - yf\rangle \subseteq k[x_{1},\ldots,x_{n},y]\) with respect to some ordering. If the result is {1}, then \(f \in \sqrt{I}\). Otherwise, \(f\notin \sqrt{I}\).
If we have f expressed as a product of irreducible polynomials, then it is easy to write down the radical of the principal ideal generated by f.
Proposition 9.
Proof.
Conversely, suppose that \(g \in \sqrt{I}\). Then there exists a positive integer M such that \(g^{M} \in I =\langle \, f\rangle\), so that g M is a multiple of f and hence a multiple of each irreducible factor f i of f. Thus, f i is an irreducible factor of g M . However, the unique factorization of g M into distinct irreducible polynomials is the Mth power of the factorization of g. It follows that each f i is an irreducible factor of g. This implies that g is a polynomial multiple of \(f_{1}f_{2}\cdots f_{r}\) and, therefore, g is contained in the ideal \(\langle \,f_{1}f_{2}\cdots f_{r}\rangle\). The proposition is proved. □
In view of Proposition 9, we make the following definition:
Definition 10.
If f ∈ k[x 1, …, x n ] is a polynomial, we define the reduction of f, denoted f red, to be the polynomial such that \(\langle \,f_{\mathrm{red}}\rangle = \sqrt{\langle \,f\rangle }\). A polynomial is said to be reduced (or square-free) if f = f red.
Thus, f red is the polynomial f with repeated factors “stripped away.” So, for example, if f = (x + y 2)3(x − y), then f red = (x + y 2)(x − y). Note that f red is only unique up to a constant factor in k.
The usefulness of Proposition 9 is mitigated by the requirement that f be factored into irreducible factors. We might ask if there is an algorithm to compute f red from f without factoring f first. It turns out that such an algorithm exists.
To state the algorithm, we will need the notion of a greatest common divisor of two polynomials.
Definition 11.
- (i)
h divides f and g.
- (ii)
If p is any polynomial that divides both f and g, then p divides h.
It is easy to show that \(\mathrm{gcd}(\,f,g)\) exists and is unique up to multiplication by a nonzero constant in k (see Exercise 9). Unfortunately, the one-variable algorithm for finding the gcd (i.e., the Euclidean Algorithm) does not work in the case of several variables. To see this, consider the polynomials xy and xz in k[x, y, z]. Clearly, \(\mathrm{gcd}(xy,xz) = x\). However, no matter what term ordering we use, dividing xy by xz gives 0 plus remainder xy and dividing xz by xy gives 0 plus remainder xz. As a result, neither polynomial “reduces” with respect to the other and there is no next step to which to apply the analogue of the Euclidean Algorithm.
Nevertheless, there is an algorithm for calculating the gcd of two polynomials in several variables. We defer a discussion of it until the next section after we have studied intersections of ideals. For the purposes of our discussion here, let us assume that we have such an algorithm. We also remark that given polynomials f 1, …, f s ∈ k[x 1, …, x n ], one can define \(\mathrm{gcd}(\,f_{1},f_{2},\ldots,f_{s})\) exactly as in the one-variable case. There is also an algorithm for computing \(\mathrm{gcd}(\,f_{1},f_{2},\ldots,f_{s})\).
Using this notion of gcd, we can now give a formula for computing the radical of a principal ideal.
Proposition 12.
Proof.
It is worth remarking that for fields which do not contain \(\mathbb{Q}\), the above formula for f red may fail (see Exercise 13).
Exercises for §2
- 1.
Given a field k (not necessarily algebraically closed), show that \(\sqrt{\langle x^{2 },y^{2}\rangle } =\langle x,y\rangle\) and, more generally, show that \(\sqrt{\langle x^{n },y^{m}\rangle } =\langle x,y\rangle\) for any positive integers n and m.
- 2.
Let f and g be distinct nonconstant polynomials in k[x, y] and let \(I =\langle \, f^{2},g^{3}\rangle\). Is it necessarily true that \(\sqrt{I} =\langle \, f,g\rangle\)? Explain.
- 3.
Show that \(\langle x^{2} + 1\rangle \subseteq \mathbb{R}[x]\) is a radical ideal, but that V(x 2 + 1) is the empty variety.
- 4.
Let I be an ideal in k[x 1, …, x n ], where k is an arbitrary field.
- a.
Show that \(\sqrt{I}\) is a radical ideal.
- b.
Show that I is radical if and only if \(I = \sqrt{I}\).
- c.
Show that \(\sqrt{\sqrt{I}} = \sqrt{I}\).
- a.
- 5.
Prove that I and V are inclusion-reversing and that \(\mathbf{V}(\sqrt{I}) = \mathbf{V}(I)\) for any ideal I.
- 6.
Let I be an ideal in k[x 1, …, x n ].
- a.
In the special case when \(\sqrt{I} =\langle \, f_{1},f_{2}\rangle\), with \(f_{i}^{m_{i}} \in I\), prove that \(f^{m_{1}+m_{2}-1} \in I\) for all \(f \in \sqrt{I}\).
- b.
Now prove that for any I, there exists a single integer m such that f m ∈ I for all \(f \in \sqrt{I}\). Hint: Write \(\sqrt{I} =\langle \, f_{1},\ldots,f_{s}\rangle\).
- a.
- 7.
Determine whether the following polynomials lie in the following radicals. If the answer is yes, what is the smallest power of the polynomial that lies in the ideal?
- a.
Is \(x + y \in \sqrt{\langle x^{3 },y^{3 },xy (x + y)\rangle }\)?
- b.
Is \(x^{2} + 3xz \in \sqrt{\langle x + z,x^{2 } y,x - z^{2}\rangle }\)?
- a.
- 8.
Let f 1 = y 2 + 2xy − 1 and f 2 = x 2 + 1. Prove that \(\langle \,f_{1},f_{2}\rangle\) is not a radical ideal. Hint: What is f 1 + f 2?
- 9.
Given f, g ∈ k[x 1, …, x n ], use unique factorization to prove that \(\mathrm{gcd}(\,f,g)\) exists. Also prove that \(\mathrm{gcd}(\,f,g)\) is unique up to multiplication by a nonzero constant of k.
- 10.
Prove the following ideal-theoretic characterization of \(\mathrm{gcd}(\,f,g)\): given polynomials f, g, h in \(k[x_{1},\ldots,x_{n}]\), then \(h = \mathrm{gcd}(\,f,g)\) if and only if h is a generator of the smallest principal ideal containing \(\langle \,f,g\rangle\) (i.e., if \(\langle h\rangle \subseteq J\), whenever J is a principal ideal such that \(J \supseteq \langle \, f,g\rangle )\).
- 11.
- 12.
Let \(f = x^{5} + 3x^{4}y + 3x^{3}y^{2} - 2x^{4}y^{2} + x^{2}y^{3} - 6x^{3}y^{3} - 6x^{2}y^{4} + x^{3}y^{4} - 2xy^{5} + 3x^{2}y^{5} + 3xy^{6} + y^{7} \in \mathbb{Q}[x,y]\). Compute \(\sqrt{\langle \,f\rangle }\).
- 13.
A field k has characteristic zero if it contains the rational numbers \(\mathbb{Q}\); otherwise, k has positive characteristic.
- a.
Let k be the field \(\mathbb{F}_{2}\) from Exercise 1 of Chapter 1, §1. If \(f = x_{1}^{2} + \cdots + x_{n}^{2} \in \mathbb{F}_{2}[x_{1},\ldots,x_{n}]\), then show that \(\frac{\partial f} {\partial x_{i}} = 0\) for all i. Conclude that the formula given in Proposition 12 may fail when the field is \(\mathbb{F}_{2}\).
- b.
Let k be a field of characteristic zero and let f ∈ k[x 1, …, x n ] be nonconstant. If the variable x j appears in f, then prove that \(\frac{\partial f} {\partial x_{j}}\neq 0\). Also explain why \(\frac{\partial f} {\partial x_{j}}\) has smaller total degree than f.
- a.
- 14.
Let \(J =\langle xy, (x - y)x\rangle\). Describe V(J) and show that \(\sqrt{J} =\langle x\rangle\).
- 15.
Prove that \(I =\langle xy,xz,yz\rangle\) is a radical ideal. Hint: If you divide f ∈ k[x, y, z] by xy, xz, yz, what does the remainder look like? What does f m look like?
- 16.Let \(I \subseteq k[x_{1},\ldots,x_{n}]\) be an ideal. Assume that I has a Gröbner basis \(G =\{ g_{1},\ldots,g_{t}\}\) such that for all i, LT(g i ) is square-free in the sense of Definition 10.
- a.
If \(f \in \sqrt{I}\), prove that LT( f) is divisible by LT(g i ) for some i. Hint: f m ∈ I.
- b.
Prove that I is radical. Hint: Use part (a) to show that G is a Gröbner basis of \(\sqrt{I}\).
- a.
- 17.This exercise continues the line of thought begun in Exercise 16.
- a.
Prove that a monomial ideal in \(k[x_{1},\ldots,x_{n}]\) is radical if and only if its minimal generators are square-free.
- b.
Given an ideal \(I \subseteq k[x_{1},\ldots,x_{n}]\), prove that if \(\langle \mbox{ LT}(I)\rangle\) is radical, then I is radical.
- c.
Give an example to show that the converse of part (b) can fail.
- a.
3 Sums, Products, and Intersections of Ideals
Ideals are algebraic objects and, as a result, there are natural algebraic operations we can define on them. In this section, we consider three such operations: sum, intersection, and product. These are binary operations: to each pair of ideals, they associate a new ideal. We shall be particularly interested in two general questions which arise in connection with each of these operations. The first asks how, given generators of a pair of ideals, one can compute generators of the new ideals which result on applying these operations. The second asks for the geometric significance of these algebraic operations. Thus, the first question fits the general computational theme of this book; the second, the general thrust of this chapter. We consider each of the operations in turn.
Sums of Ideals
Proposition 2.
If I and J are ideals in k[x 1 ,…,x n ], then I + J is also an ideal in k[x 1 ,…,x n ]. In fact, I + J is the smallest ideal containing I and J. Furthermore, if \(I =\langle \, f_{1},\ldots,f_{r}\rangle\) and \(J =\langle g_{1},\ldots,g_{s}\rangle\) , then \(I + J =\langle \, f_{1},\ldots,f_{r},g_{1},\ldots,g_{s}\rangle\) .
Proof.
Note first that 0 = 0 + 0 ∈ I + J. Suppose h 1, h 2 ∈ I + J. By the definition of I + J, there exist f 1, f 2 ∈ I and g 1, g 2 ∈ J such that h 1 = f 1 + g 1, h 2 = f 2 + g 2. Then, after rearranging terms slightly, h 1 + h 2 = ( f 1 + f 2) + (g 1 + g 2). But f 1 + f 2 ∈ I because I is an ideal and, similarly, g 1 + g 2 ∈ J, whence h 1 + h 2 ∈ I + J. To check closure under multiplication, let h ∈ I + J and p ∈ k[x 1, …, x n ] be any polynomial. Then, as above, there exist f ∈ I and g ∈ J such that h = f + g. But then p ⋅ h = p ⋅ ( f + g) = p ⋅ f + p ⋅ g. Now p ⋅ f ∈ I and p ⋅ g ∈ J because I and J are ideals. Consequently, p ⋅ h ∈ I + J. This shows that I + J is an ideal.
If H is an ideal which contains I and J, then H must contain all elements f ∈ I and g ∈ J. Since H is an ideal, H must contain all f + g, where f ∈ I, g ∈ J. In particular, \(H \supseteq I + J\). Therefore, every ideal containing I and J contains I + J and, thus, I + J must be the smallest such ideal.
Finally, if \(I =\langle \, f_{1},\ldots,f_{r}\rangle\) and \(J =\langle g_{1},\ldots,g_{s}\rangle\), then \(\langle \,f_{1},\ldots,f_{r},g_{1},\ldots,g_{s}\rangle\) is an ideal containing I and J, so that \(I + J \subseteq \langle \, f_{1},\ldots,f_{r},g_{1},\ldots,g_{s}\rangle\). The reverse inclusion is obvious, so that \(I + J =\langle \, f_{1},\ldots,f_{r},g_{1},\ldots,g_{s}\rangle\). □
The following corollary is an immediate consequence of Proposition 2.
Corollary 3.
To see what happens geometrically, let \(I =\langle x^{2} + y\rangle\) and \(J =\langle z\rangle\) be ideals in \(\mathbb{R}[x,y,z]\). We have sketched V(I) and V(J) on the next page. Then \(I + J =\langle x^{2} + y,z\rangle\) contains both x 2 + y and z. Thus, the variety V(I + J) must consist of those points where both x 2 + y and z vanish, i.e., it must be the intersection of V(I) and V(J).
The same line of reasoning generalizes to show that addition of ideals corresponds geometrically to taking intersections of varieties.
Theorem 4.
If I and J are ideals in k[x 1 ,…,x n ], then \(\mathbf{V}(I + J) = \mathbf{V}(I) \cap \mathbf{V}(J)\) .
Proof.
If a ∈ V(I + J), then a ∈ V(I) because \(I \subseteq I + J\); similarly, a ∈ V(J). Thus, \(a \in \mathbf{V}(I) \cap \mathbf{V}(J)\) and we conclude that \(\mathbf{V}(I + J) \subseteq \mathbf{V}(I) \cap \mathbf{V}(J)\).
To get the opposite inclusion, suppose \(a \in \mathbf{V}(I) \cap \mathbf{V}(J)\). Let h be any polynomial in I + J. Then there exist f ∈ I and g ∈ J such that h = f + g. We have f(a) = 0 because a ∈ V(I) and g(a) = 0 because a ∈ V(J). Thus, h(a) = f(a) + g(a) = 0 + 0 = 0. Since h was arbitrary, we conclude that a ∈ V(I + J). Hence, \(\mathbf{V}(I + J) \supseteq \mathbf{V}(I) \cap \mathbf{V}(J)\). □
An analogue of Theorem 4 stated in terms of generators was given in Lemma 2 of Chapter 1, §2.
Products of Ideals
Definition 5.
If I and J are two ideals in k[x 1, …, x n ], then their product, denoted I ⋅ J, is defined to be the ideal generated by all polynomials f ⋅ g where f ∈ I and g ∈ J.
Proposition 6.
Proof.
The following proposition guarantees that the product of ideals does indeed correspond geometrically to the operation of taking the union of varieties.
Theorem 7.
If I and J are ideals in k[x 1 ,…,x n ], then \(\mathbf{V}(I \cdot J) = \mathbf{V}(I) \cup \mathbf{V}(J)\) .
Proof.
Let a ∈ V(I ⋅ J). Then g(a)h(a) = 0 for all g ∈ I and all h ∈ J. If g(a) = 0 for all g ∈ I, then a ∈ V(I). If g(a) ≠ 0 for some g ∈ I, then we must have h(a) = 0 for all h ∈ J. In either event, \(a \in \mathbf{V}(I) \cup \mathbf{V}(J)\).
Conversely, suppose \(a \in \mathbf{V}(I) \cup \mathbf{V}(J)\). Either g(a) = 0 for all g ∈ I or h(a) = 0 for all h ∈ J. Thus, g(a)h(a) = 0 for all g ∈ I and h ∈ J. Thus, f(a) = 0 for all \(f \in I \cdot J\) and, hence, a ∈ V(I ⋅ J). □
In what follows, we will often write the product of ideals as IJ rather than I ⋅ J.
Intersections of Ideals
The operation of forming the intersection of two ideals is, in some ways, even more primitive than the operations of addition and multiplication.
Definition 8.
The intersection \(I \cap J\) of two ideals I and J in k[x 1, …, x n ] is the set of polynomials which belong to both I and J.
As in the case of sums, the set of ideals is closed under intersections.
Proposition 9.
If I and J are ideals in k[x 1 ,…,x n ], then \(I \cap J\) is also an ideal.
Proof.
Note that \(0 \in I \cap J\) since 0 ∈ I and 0 ∈ J. If \(f,g \in I \cap J\), then f + g ∈ I because f, g ∈ I. Similarly, f + g ∈ J and, hence, \(f + g \in I \cap J\). Finally, to check closure under multiplication, let \(f \in I \cap J\) and h be any polynomial in k[x 1, …, x n ]. Since f ∈ I and I is an ideal, we have h ⋅ f ∈ I. Similarly, h ⋅ f ∈ J and, hence, \(h \cdot f \in I \cap J\). □
Note that we always have \(IJ \subseteq I \cap J\) since elements of IJ are sums of polynomials of the form fg with f ∈ I and g ∈ J. But the latter belongs to both I (since f ∈ I) and J (since g ∈ J). However, IJ can be strictly contained in \(I \cap J\). For example, if \(I = J =\langle x,y\rangle\), then \(IJ =\langle x^{2},xy,y^{2}\rangle\) is strictly contained in \(I \cap J = I =\langle x,y\rangle\) (\(x \in I \cap J\), but \(x\notin IJ)\).
This computation is easy precisely because we were given factorizations of f and g into irreducible polynomials. In general, such factorizations may not be available. So any algorithm which allows one to compute intersections will have to be powerful enough to circumvent this difficulty.
Nevertheless, there is a nice trick that reduces the computation of intersections to computing the intersection of an ideal with a subring (i.e., eliminating variables), a problem which we have already solved. To state the theorem, we need a little notation: if I is an ideal in k[x 1, …, x n ] and f(t) ∈ k[t] a polynomial in the single variable t, then f(t)I denotes the ideal in k[x 1, …, x n , t] generated by the set of polynomials \(\{f(t) \cdot h\mid h \in I\}\). This is a little different from our usual notion of product in that the ideal I and the ideal generated by f(t) in k[t] lie in different rings: in fact, the ideal \(I \subseteq k[x_{1},\,\ldots,\,x_{n}]\) is not an ideal in k[x 1, …, x n , t] because it is not closed under multiplication by t. When we want to stress that a polynomial h ∈ k[x 1, …, x n ] involves only the variables x 1, …, x n , we write h = h(x). Along the same lines, if we are considering a polynomial g in k[x 1, …, x n , t] and we want to emphasize that it can involve the variables x 1, …, x n as well as t, we will write g = g(x, t). In terms of this notation, \(f(t)I =\langle \, f(t)h(x)\mid h(x) \in I\rangle\). So, for example, if f(t) = t 2 − t and \(I =\langle x,y\rangle\), then the ideal f(t)I in k[x, y, t] contains (t 2 − t)x and (t 2 − t)y. In fact, it is not difficult to see that f(t)I is generated as an ideal by (t 2 − t)x and (t 2 − t)y. This is a special case of the following assertion.
Lemma 10.
- (i)
If I is generated as an ideal in k[x 1 ,…,x n ] by p 1 (x),…,p r (x), then f(t)I is generated as an ideal in k[x 1 ,…,x n ,t] by f(t) ⋅ p 1 (x),…,f(t) ⋅ p r (x).
- (ii)
If g(x,t) ∈ f(t)I and a is any element of the field k, then g(x,a) ∈ I.
Proof.
Theorem 11.
Proof.
Note that tI + (1 − t)J is an ideal in k[x 1, …, x n , t]. To establish the desired equality, we use the usual strategy of proving containment in both directions.
Suppose \(f \in I \cap J\). Since f ∈ I, we have t ⋅ f ∈ tI. Similarly, f ∈ J implies (1 − t) ⋅ f ∈ (1 − t)J. Thus, f = t ⋅ f + (1 − t) ⋅ f ∈ tI + (1 − t)J. Since \(I,J \subseteq k[x_{1},\ldots,x_{n}]\), we have f ∈ (tI + (1 − t)J) ∩ k[x 1, …, x n ]. This shows that \(I \cap J \subseteq (tI + (1 - t)J) \cap k[x_{1},\ldots,x_{n}]\).
To establish the opposite containment, take \(f \in (tI + (1 - t)J) \cap k[x_{1},\ldots,x_{n}]\). Then f(x) = g(x, t) + h(x, t), where g(x, t) ∈ tI and h(x, t) ∈ (1 − t)J. First set t = 0. Since every element of tI is a multiple of t, we have g(x, 0) = 0. Thus, f(x) = h(x, 0) and hence, f(x) ∈ J by Lemma 10. On the other hand, set t = 1 in the relation f(x) = g(x, t) + h(x, t). Since every element of (1 − t)J is a multiple of 1 − t, we have h(x, 1) = 0. Thus, f(x) = g(x, 1) and, hence, f(x) ∈ I by Lemma 10. Since f belongs to both I and J, we have \(f \in I \cap J\). Thus, \(I \cap J \supseteq (tI + (1 - t)J) \cap k[x_{1},\ldots,x_{n}]\) and this completes the proof. □
We can generalize both of the examples above by introducing the following definition.
Definition 12.
- (i)
f divides h and g divides h.
- (ii)
If f and g both divide a polynomial p, then h divides p.
Proposition 13.
- (i)
The intersection \(I \cap J\) of two principal ideals \(I,J \subseteq k[x_{1},\ldots,x_{n}]\) is a principal ideal.
- (ii)If \(I =\langle \, f\rangle\) , \(J =\langle g\rangle\) and \(I \cap J =\langle h\rangle\) , then$$\displaystyle{h =\mathrm{ lcm}(\,f,g).}$$
Proof.
The proof will be left as an exercise. □
This result, together with our algorithm for computing the intersection of two ideals immediately gives an algorithm for computing the least common multiple of two polynomials: to compute the least common multiple of two polynomials f, g ∈ k[x 1, …, x n ], we compute the intersection \(\langle \,f\rangle \cap \langle g\rangle\) using our algorithm for computing the intersection of ideals. Proposition 13 assures us that this intersection is a principal ideal (in the exercises, we ask you to prove that the intersection of principal ideals is principal) and that any generator of it is a least common multiple of f and g.
This algorithm for computing least common multiples allows us to clear up a point which we left unfinished in §2: namely, the computation of the greatest common divisor of two polynomials f and g. The crucial observation is the following.
Proposition 14.
Proof.
This follows by expressing f and g as products of distinct irreducibles and then using the remarks preceding Proposition 13, especially equation (1). You will provide the details in Exercise 5. □
We should point out that the gcd algorithm just described is rather cumbersome. In practice, more efficient algorithms are used [see DAVENPORT, SIRET and TOURNIER (1993)].
Having dealt with the computation of intersections, we now ask what operation on varieties corresponds to the operation of intersection on ideals. The following result answers this question.
Theorem 15.
If I and J are ideals in k[x 1 ,…,x n ], then \(\mathbf{V}(I \cap J) = \mathbf{V}(I) \cup \mathbf{V}(J)\) .
Proof.
Let \(a \in \mathbf{V}(I) \cup \mathbf{V}(J)\). Then a ∈ V(I) or a ∈ V(J). This means that either f(a) = 0 for all f ∈ I or f(a) = 0 for all f ∈ J. Thus, certainly, f(a) = 0 for all \(f \in I \cap J\). Hence, \(a \in \mathbf{V}(I \cap J)\). Hence, \(\mathbf{V}(I) \cup \mathbf{V}(J) \subseteq \mathbf{V}(I \cap J)\).
On the other hand, note that since \(IJ \subseteq I \cap J\), we have \(\mathbf{V}(I \cap J) \subseteq \mathbf{V}(IJ)\). But \(\mathbf{V}(IJ) = \mathbf{V}(I) \cup \mathbf{V}(J)\) by Theorem 7, and we immediately obtain the reverse inequality. □
Thus, the intersection of two ideals corresponds to the same variety as the product. In view of this and the fact that the intersection is much more difficult to compute than the product, one might legitimately question the wisdom of bothering with the intersection at all. The reason is that intersection behaves much better with respect to the operation of taking radicals: the product of radical ideals need not be a radical ideal (consider IJ where I = J), but the intersection of radical ideals is always a radical ideal. The latter fact is a consequence of the next proposition.
Proposition 16.
Proof.
If \(f \in \sqrt{I \cap J}\), then \(f^{m} \in I \cap J\) for some integer m > 0. Since f m ∈ I, we have \(f \in \sqrt{I}\). Similarly, \(f \in \sqrt{J}\). Thus, \(\sqrt{I \cap J} \subseteq \sqrt{I} \cap \sqrt{J}\).
For the reverse inclusion, take \(f \in \sqrt{I} \cap \sqrt{J}\). Then, there exist integers m, p > 0 such that f m ∈ I and f p ∈ J. Thus \(f^{m+p} = f^{m}f^{p} \in I \cap J\), so \(f \in \sqrt{I \cap J}\). □
Exercises for §3
- 1.Show that in \(\mathbb{Q}[x,y]\), we have$$\displaystyle{\langle (x + y)^{4}(x^{2} + y)^{2}(x - 5y)\rangle \cap \langle (x + y)(x^{2} + y)^{3}(x\ + 3y)\rangle =\langle (x + y)^{4}(x^{2} + y)^{3}(x\ - 5y)(x + 3y)\rangle.}$$
- 2.
Prove formula (1) for the least common multiple of two polynomials f and g.
- 3.
Prove assertion (i) of Proposition 13. In other words, show that the intersection of two principal ideals is principal.
- 4.
Prove assertion (ii) of Proposition 13. In other words, show that the least common multiple of two polynomials f and g in k[x 1, …, x n ] is the generator of the ideal \(\langle \,f\rangle \cap \langle g\rangle\).
- 5.
Prove Proposition 14. In other words, show that the least common multiple of two polynomials times the greatest common divisor of the same two polynomials is the product of the polynomials. Hint: Use the remarks following the statement of Proposition 14.
- 6.
Let I 1, …, I r and J be ideals in k[x 1, …, x n ]. Show the following:
- a.
(I 1 + I 2)J = I 1 J + I 2 J.
- b.
\((I_{1}\cdots I_{r})^{m} = I_{1}^{m}\cdots I_{r}^{m}\).
- a.
- 7.
Let I and J be ideals in k[x 1, …, x n ], where k is an arbitrary field. Prove the following:
- a.
If \(I^{\ell} \subseteq J\) for some integer ℓ > 0, then \(\sqrt{I} \subseteq \sqrt{J}\).
- b.
\(\sqrt{I + J} = \sqrt{\sqrt{I} + \sqrt{J}}\).
- a.
- 8.Letand$$\displaystyle{f = x^{4} + x^{3}y + x^{3}z^{2} - x^{2}y^{2} + x^{2}yz^{2} - xy^{3} - xy^{2}z^{2} - y^{3}z^{2}}$$$$\displaystyle{g = x^{4} + 2x^{3}z^{2} - x^{2}y^{2} + x^{2}z^{4} - 2xy^{2}z^{2} - y^{2}z^{4}.}$$
- a.
Use a computer algebra program to compute generators for \(\langle \,f\rangle \cap \langle g\rangle\) and \(\sqrt{\langle \,f\rangle \langle g\rangle }\).
- b.
Use a computer algebra program to compute \(\mathrm{gcd}(\,f,g)\).
- c.
Let p = x 2 + xy + xz + yz and q = x 2 − xy − xz + yz. Use a computer algebra program to calculate \(\langle \,f,g\rangle \cap \langle p,q\rangle\).
- a.
- 9.
For an arbitrary field, show that \(\sqrt{IJ} = \sqrt{I \cap J}\). Give an example to show that the product of radical ideals need not be radical. Also give an example to show that \(\sqrt{IJ}\) can differ from \(\sqrt{I}\sqrt{J}\).
- 10.
If I is an ideal in k[x 1, …, x n ] and \(\langle \,f(t)\rangle\) is an ideal in k[t], show that the ideal f(t)I defined in the text is the product of the ideal \(\tilde{I }\) generated by all elements of I in k[x 1, …, x n , t] and the ideal \(\langle \,f(t)\rangle\) generated by f(t) in k[x 1, …, x n , t].
- 11.
Two ideals I and J of k[x 1, …, x n ] are said to be comaximal if and only if I + J = k[x 1, …, x n ].
- a.
Show that if \(k = \mathbb{C}\), then I and J are comaximal if and only if \(\mathbf{V}(I) \cap \mathbf{V}(J) = \varnothing\). Give an example to show that this is false in general.
- b.
Show that if I and J are comaximal, then \(IJ = I \cap J\).
- c.
Is the converse to part (b) true? That is, if \(IJ = I \cap J\), does it necessarily follow that I and J are comaximal? Proof or counterexample?
- d.
If I and J are comaximal, show that I and J 2 are comaximal. In fact, show that I r and J s are comaximal for all positive integers r and s.
- e.Let I 1, …, I r be ideals in k[x 1, …, x n ] and suppose that I i and \(J_{i} =\bigcap _{j\neq i}I_{j}\) are comaximal for all i. Show thatfor all positive integers m.$$\displaystyle{I_{1}^{m} \cap \cdots \cap I_{r}^{m} = (I_{ 1}\cdots I_{r})^{m} = (I_{ 1} \cap \cdots \cap I_{r})^{m}}$$
- a.
- 12.
Let I, J be ideals in k[x 1, …, x n ] and suppose that \(I \subseteq \sqrt{J}\). Show that \(I^{m} \subseteq J\) for some integer m > 0. Hint: You will need to use the Hilbert Basis Theorem.
- 13.Let A be an m × n constant matrix and suppose that x = Ay, where we are thinking of x ∈ k m and y ∈ k n as column vectors of variables. Define a mapby sending f ∈ k[x 1, …, x m ] to α A ( f) ∈ k[y 1, …, y n ], where α A ( f) is the polynomial defined by α A ( f)(y) = f(Ay).$$\displaystyle{\alpha _{A}: k[x_{1},\ldots,x_{m}]\longrightarrow k[y_{1},\ldots,y_{n}]}$$
- a.
Show that α A is k-linear, i.e., show that α A (rf + sg) = r α A ( f) + s α A (g) for all r, s ∈ k and all f, g ∈ k[x 1, …, x n ].
- b.
Show that \(\alpha _{A}(\,f \cdot g) =\alpha _{A}(\,f) \cdot \alpha _{A}(g)\) for all f, g ∈ k[x 1, …, x n ]. (As we will see in Definition 8 of Chapter 5, §2, a map between rings which preserves addition and multiplication and also preserves the multiplicative identity is called a ring homomorphism. Since it is clear that α A (1) = 1, this shows that α A is a ring homomorphism.)
- c.
Show that the set \(\{f \in k[x_{1},\ldots,x_{m}]\mid \alpha _{A}(\,f) = 0\}\) is an ideal in k[x 1, …, x m ]. [This set is called the kernel of α A and denoted ker(α A ).]
- d.
If I is an ideal in k[x 1, …, x n ], show that the set \(\alpha _{A}(I) =\{\alpha _{A}(\,f)\mid f \in I\}\) need not be an ideal in k[y 1, …, y n ]. [We will often write \(\langle \alpha _{A}(I)\rangle\) to denote the ideal in k[y 1, …, y n ] generated by the elements of α A (I)—it is called the extension of I to k[y 1, …, y n ].]
- e.
If I ′ is an ideal in k[y 1, …, y n ], set \(\alpha _{A}^{-1}(I^{{\prime}}) =\{ f \in k[x_{1},\ldots,x_{m}]\mid \alpha _{A}(\,f) \in I^{{\prime}}\}\). Show that \(\alpha _{A}^{-1}(I^{{\prime}})\) is an ideal in k[x 1, …, x m ] (often called the contraction of I ′ ).
- a.
- 14.
Let A and α A be as above and let K = ker(α A ). Let I and J be ideals in k[x 1, …, x m ]. Show that:
- a.
\(I \subseteq J\) implies \(\langle \alpha _{A}(I)\rangle \subseteq \langle \alpha _{A}(J)\rangle\).
- b.
\(\langle \alpha _{A}(I + J)\rangle =\langle \alpha _{A}(I)\rangle +\langle \alpha _{A}(J)\rangle\).
- c.
\(\langle \alpha _{A}\ (IJ)\rangle =\langle \alpha _{A}(I)\rangle \langle \alpha _{A}(J)\rangle\).
- d.
\(\langle \alpha _{A}(I \cap J)\rangle \subseteq \langle \alpha _{A}(I)\rangle \cap \langle \alpha _{A}(J)\rangle\), with equality if \(I \supseteq K\) or \(J \supseteq K\) and α A is onto.
- e.
\(\langle \alpha _{A}(\sqrt{I})\rangle \subseteq \sqrt{\langle \alpha _{A } (I)\rangle }\) with equality if \(I \supseteq K\) and α A is onto.
- a.
- 15.
Let A, α A , and K = ker(α A ) be as above. Let I ′ and J ′ be ideals in k[y 1, …, y n ]. Show that:
- a.
\(I^{{\prime}}\subseteq J^{{\prime}}\) implies \(\alpha _{A}^{-1}(I^{{\prime}}) \subseteq \alpha _{A}^{-1}(J^{{\prime}})\).
- b.
\(\alpha _{A}^{-1}(I^{{\prime}} + J^{{\prime}}) \supseteq \alpha _{A}^{-1}(I^{{\prime}}) +\alpha _{ A}^{-1}(J^{{\prime}})\), with equality if α A is onto.
- c.
\(\alpha _{A}^{-1}(I^{{\prime}}J^{{\prime}}) \supseteq (\alpha _{A}^{-1}(I^{{\prime}}))(\alpha _{A}^{-1}(J^{{\prime}}))\), with equality if α A is onto and the right-hand side contains K.
- d.
\(\alpha _{A}^{-1}(I^{{\prime}}\cap J^{{\prime}}) =\alpha _{ A}^{-1}(I^{{\prime}}) \cap \alpha _{A}^{-1}(J^{{\prime}})\).
- e.
\(\alpha _{A}^{-1}(\sqrt{I^{{\prime}}}) = \sqrt{\alpha _{A }^{-1 }(I^{{\prime} } )}\).
- a.
4 Zariski Closures, Ideal Quotients, and Saturations
We have already encountered a number of examples of sets which are not varieties. Such sets arose very naturally in Chapter 3, where we saw that the projection of a variety need not be a variety, and in the exercises in Chapter 1, where we saw that the (set-theoretic) difference of varieties can fail to be a variety.
Proposition 1.
If \(S \subseteq k^{n}\) , the affine variety V ( I (S)) is the smallest variety that contains S [in the sense that if \(W \subseteq k^{n}\) is any affine variety containing S, then \(\mathbf{V}(\mathbf{I}(S)) \subseteq W\) ].
Proof.
If \(W \supseteq S\), then \(\mathbf{I}(W) \subseteq \mathbf{I}(S)\) because I is inclusion-reversing. But then \(\mathbf{V}(\mathbf{I}(W)) \supseteq \mathbf{V}(\mathbf{I}(S))\) because V also reverses inclusions. Since W is an affine variety, V(I(W)) = W by Theorem 7 from §2, and the result follows. □
This proposition leads to the following definition.
Definition 2.
The Zariski closure of a subset of affine space is the smallest affine algebraic variety containing the set. If \(S \subseteq k^{n}\), the Zariski closure of S is denoted \(\overline{S}\) and is equal to V(I(S)).
We note the following properties of Zariski closure.
Lemma 3.
- (i)
\(\mathbf{I}(\overline{S}) = \mathbf{I}(S)\) .
- (ii)
If \(S \subseteq T\) , then \(\overline{S} \subseteq \overline{T}\) .
- (iii)
\(\overline{S \cup T} = \overline{S} \cup \overline{T}\) .
Proof.
For (i), the inclusion \(\mathbf{I}(\overline{S}) \subseteq \mathbf{I}(S)\) follows from \(S \subseteq \overline{S}\). Going the other way, f ∈ I(S) implies \(S \subseteq \mathbf{V}(\,f)\). Then \(S \subseteq \overline{S} \subseteq \mathbf{V}(\,f)\) by Definition 2, so that \(f \in \mathbf{I}(\overline{S})\).
The proofs of (ii) and (iii) will be covered in the exercises. □
A natural example of Zariski closure is given by elimination ideals. We can now prove the first assertion of the Closure Theorem (Theorem 3 of Chapter 3, §2).
Theorem 4 (The Closure Theorem, first part).
Assume k is algebraically closed. Let \(V = \mathbf{V}(\,f_{1},\ldots,f_{s}) \subseteq k^{n}\) , and let \(\pi _{l}: k^{n} \rightarrow k^{n-l}\) be projection onto the last n − l coordinates. If I l is the l-th elimination ideal \(I_{l} =\langle \, f_{1},\ldots,f_{s}\rangle \cap k[x_{l+1},\ldots,x_{n}]\) , then V (I l ) is the Zariski closure of π l (V ).
Proof.
In view of Proposition 1, we must show that V(I l ) = V(I(π l (V))). By Lemma 1 of Chapter 3, §2, we have \(\pi _{l}(V ) \subseteq \mathbf{V}(I_{l})\). Since V(I(π l (V))) is the smallest variety containing π l (V), it follows immediately that \(\mathbf{V}(\mathbf{I}(\pi _{l}(V ))) \subseteq \mathbf{V}(I_{l})\).
To get the opposite inclusion, suppose f ∈ I(π l (V)), i.e., f(a l+1, …, a n ) = 0 for all (a l+1, …, a n ) ∈ π l (V). Then, considered as an element of k[x 1, x 2, …, x n ], we certainly have f(a 1, a 2, …, a n ) = 0 for all (a 1, …, a n ) ∈ V. By Hilbert’s Nullstellensatz, \(f^{N} \in \langle \, f_{1},\ldots,f_{s}\rangle\) for some integer N. Since f does not depend on x 1, …, x l , neither does f N , and we have \(f^{N} \in \langle \, f_{1},\ldots,f_{s}\rangle \cap k[x_{l+1},\ldots,x_{n}] = I_{l}\). Thus, \(f \in \sqrt{I}_{l}\), which implies \(\mathbf{I}(\pi _{l}(V )) \subseteq \sqrt{I}_{l}\). It follows that \(\mathbf{V}(I_{l}) = \mathbf{V}(\sqrt{I}_{l}) \subseteq \mathbf{V}(\mathbf{I}(\pi _{l}(V )))\), and the theorem is proved. □
The conclusion of Theorem 4 can be stated as \(\mathbf{V}(I_{l}) = \overline{\pi _{l}(V )}\). In general, if V is a variety, then we say that a subset \(S \subseteq V\) is Zariski dense in V if \(V = \overline{S}\), i.e., V is the Zariski closure of S. is Thus Theorem 4 tells us that π l (V) is Zariski dense in V(I l ) when the field is algebraically closed.
One context in which we encountered sets that were not varieties was in taking the difference of varieties. For example, let V = V(I) where \(I \subseteq k[x,y,z]\) is the ideal \(\langle xz,yz\rangle\) and W = V(J) where \(J =\langle z\rangle\). Then we have already seen that V is the union of the (x, y)-plane and the z-axis. Since W is the (x, y)-plane, \(V \setminus W\) is the z-axis with the origin removed [because the origin also belongs to the (x, y)-plane]. We have seen in Chapter 1 that this is not a variety. The z-axis [i.e., V(x, y)] is the Zariski closure of \(V \setminus W\).
We could ask if there is a general way to compute the ideal corresponding to the Zariski closure \(\overline{V \setminus W}\) of the difference of two varieties V and W. The answer is affirmative, but it involves two new algebraic constructions on ideals called ideal quotients and saturations.
We begin with the first construction.
Definition 5.
Proposition 6.
If I, J are ideals in k[x 1 ,…,x n ], then the ideal quotient \(I:J\) is an ideal in k[x 1 ,…,x n ] and \(I:J\) contains I.
Proof.
To show \(I:J\) contains I, note that because I is an ideal, if f ∈ I, then fg ∈ I for all g ∈ k[x 1, …, x n ] and, hence, certainly fg ∈ I for all g ∈ J. To show that \(I:J\) is an ideal, first note that \(0 \in I:J\) because 0 ∈ I. Let \(f_{1},f_{2} \in I:J\). Then f 1 g and f 2 g are in I for all g ∈ J. Since I is an ideal ( f 1 + f 2)g = f 1 g + f 2 g ∈ I for all g ∈ J. Thus, \(f_{1} + f_{2} \in I:J\). To check closure under multiplication is equally straightforward: if \(f \in I:J\) and h ∈ k[x 1, …, x n ], then fg ∈ I and, since I is an ideal, hfg ∈ I for all g ∈ J, which means that \(hf \in I:J\). □
The algebraic properties of ideal quotients and methods for computing them will be discussed later in the section. For now, we want to explore the relation between ideal quotients and the Zariski closure of a difference of varieties.
Proposition 7.
- (i)If I and J are ideals in k[x 1 ,…,x n ], then$$\displaystyle{\mathbf{V}(I) = \mathbf{V}(I + J) \cup \mathbf{V}(I:J).}$$
- (ii)If V and W are varieties k n , then$$\displaystyle{V = (V \cap W) \cup (\overline{V \setminus W}).}$$
- (iii)In the situation of (i) , we have$$\displaystyle{\overline{\mathbf{V}(I)\setminus \mathbf{V}(J)} \subseteq \mathbf{V}(I:J).}$$
Proof.
We begin with (ii). Since V contains \(V \setminus W\) and V is a variety, the smallest variety containing \(V \setminus W\) must be contained in V. Hence, \(\overline{V \setminus W} \subseteq V\). Since \(V \cap W \subseteq V\), we have \((V \cap W) \cup (\overline{V \setminus W}) \subseteq V\).
To get the reverse containment, note that \(V = (V \cap W) \cup (V \setminus W)\). Since \(V \setminus W \subseteq \overline{V \setminus W}\), the desired inclusion \(V \subseteq (V \cap W) \cup \overline{V \setminus W}\) follows immediately.
For (iii), we first claim that \(I:J \subseteq \mathbf{I}(\mathbf{V}(I)\setminus \mathbf{V}(J))\). For suppose that \(f \in I:J\) and \(a \in \mathbf{V}(I)\setminus \mathbf{V}(J)\). Then fg ∈ I for all g ∈ J. Since a ∈ V(I), we have f(a)g(a) = 0 for all g ∈ J. Since \(a\notin \mathbf{V}(J)\), there is some g ∈ J such that g(a) ≠ 0. Hence, f(a) = 0 for all \(a \in \mathbf{V}(I)\setminus \mathbf{V}(J)\). Thus, \(f \in \mathbf{I}(\mathbf{V}(I)\setminus \mathbf{V}(J))\), which proves the claim. Since V reverses inclusions, we have \(\mathbf{V}(I:J) \supseteq \mathbf{V}(\mathbf{I}(\mathbf{V}(I)\setminus \mathbf{V}(J))) = \overline{\mathbf{V}(I)\setminus \mathbf{V}(J)}\).
In Proposition 7, note that V(I + J) from part (i) matches up with \(V \cap W\) in part (ii) since \(\mathbf{V}(I + J) = \mathbf{V}(I) \cap \mathbf{V}(J)\). So it is natural to ask if \(\mathbf{V}(I:J)\) in part (i) matches up with \(\overline{V \setminus W}\) in part (ii). This is equivalent to asking if the inclusion \(\overline{\mathbf{V}(I)\setminus \mathbf{V}(J)} \subseteq \mathbf{V}(I:J)\) in part (iii) is an equality.
However, if we replace J with J 2, then a computation similar to the above gives \(I:J^{2} =\langle y - 1\rangle\), so that \(\mathbf{V}(I:J^{2}) = \overline{\mathbf{V}(I)\setminus \mathbf{V}(J)}\). In general, higher powers may be required, which leads to our second algebraic construction on ideals.
Definition 8.
Proposition 9.
- (i)
\(I \subseteq I:J \subseteq I:J^{\infty }\) .
- (ii)
\(I:J^{\infty } = I:J^{N}\) for all sufficiently large N.
- (iii)
\(\sqrt{I: J^{\infty }} = \sqrt{I}:J\) .
Proof.
Part (ii) follows from the claim just proved, and \(I:J^{\infty } = I:J^{N}\) implies that \(I:J^{\infty }\) is an ideal by Proposition 6. Note also that part (i) follows from (1) and part (ii).
For part (iii), we first show \(\sqrt{I: J^{\infty }}\subseteq \sqrt{I}:J\). This is easy, for \(f \in \sqrt{I: J^{\infty }}\) implies \(f^{m} \in I:J^{\infty }\) for some m. Given g ∈ J, it follows that f m g N ∈ I for some N. Then (fg) M ∈ I for \(M =\mathop{ \mathrm{max}}\limits (m,N)\), so that \(fg \in \sqrt{I}\). Since this holds for all g ∈ J, we conclude that \(f \in \sqrt{I}:J\).
Later in the section we will discuss further algebraic properties of saturations and how to compute them. For now, we focus on their relation to geometry.
Theorem 10.
- (i)
\(\mathbf{V}(I) = \mathbf{V}(I + J) \cup \mathbf{V}(I:J^{\infty })\) .
- (ii)
\(\overline{\mathbf{V}(I)\setminus \mathbf{V}(J)} \subseteq \mathbf{V}(I:J^{\infty })\) .
- (iii)
If k is algebraically closed, then \(\mathbf{V}(I:J^{\infty }) = \overline{\mathbf{V}(I)\setminus \mathbf{V}(J)}\) .
Proof.
In the exercises, you will show that (i) and (ii) follow by easy modifications of the proofs of parts (i) and (iii) of Proposition 7.
In some situations, saturations can be replaced with ideal quotients. For example, the proof of Theorem 10 yields the following corollary when the ideal I is radical.
Corollary 11.
The following proposition takes care of some simple properties of ideal quotients and saturations.
Proposition 12.
- (i)
\(I:k[x_{1},\ldots,x_{n}] = I:k[x_{1},\ldots,x_{n}]^{\infty } = I\) .
- (ii)
\(J \subseteq I\) if and only if \(I:J = k[x_{1},\ldots,x_{n}]\) .
- (iii)
\(J \subseteq \sqrt{I}\) if and only if \(I:J^{\infty } = k[x_{1},\ldots,x_{n}]\) .
Proof.
The proof is left as an exercise. □
When the field is algebraically closed, the reader is urged to translate parts (i) and (iii) of the proposition into terms of varieties (upon which they become clear).
The following proposition will help us compute ideal quotients and saturations.
Proposition 13.
Proof.
We again leave the (straightforward) proofs to the reader. □
We now turn to the question of how to compute generators of the ideal quotient \(I:J\) and saturation \(I:J^{\infty }\), given generators of I and J. Inspired by (5), we begin with the case when J is generated by a single polynomial.
Theorem 14.
- (i)
If {h 1 ,…,h p } is a basis of the ideal \(I \cap \langle g\rangle\) , then {h 1 ∕g,…,h p ∕g} is a basis of \(I:g\) .
- (ii)If \(\{f_{1},\ldots,f_{s}\}\) is a basis of I and \(\tilde{I }=\langle \, f_{1},\ldots,f_{s},1 - yg\rangle \subseteq k[x_{1},\ldots,x_{n},y]\) , where y is a new variable, thenFurthermore, if G is a lex Gröbner basis of \(\tilde{I }\) for y > x 1 > ⋯ > x n , then \(G \cap k[x_{1},\ldots,x_{n}]\) is a basis of \(I:g^{\infty }\) .$$\displaystyle{I:g^{\infty } =\tilde{ I }\cap k[x_{ 1},\ldots,x_{n}].}$$
Proof.
The first assertion of (ii) is left as an exercise. Then the Elimination Theorem from Chapter 3, §1 implies that \(G \cap k[x_{1},\ldots,x_{n}]\) is a Gröbner basis of \(I:g^{\infty }\). □
This theorem, together with our procedure for computing intersections of ideals and equation (5), immediately leads to an algorithm for computing a basis of an ideal quotient: given \(I =\langle \, f_{1},\ldots,f_{r}\rangle\) and \(J =\langle g_{1},\ldots,g_{s}\rangle\), to compute a basis of \(I:J\), we first compute a basis for \(I:g_{i}\) for each i. In view of Theorem 14, this means computing a basis \(\{h_{1},\ldots,h_{p}\}\) of \(\langle \,f_{1},\ldots,f_{r}\rangle \cap \langle g_{i}\rangle\). Recall that we do this via the algorithm for computing intersections of ideals from §3. Using the division algorithm, we divide each of basis element h j by g i to get a basis for \(I:g_{i}\) by part (i) of Theorem 14. Finally, we compute a basis for \(I:J\) by applying the intersection algorithm s − 1 times, computing first a basis for \(I:\langle g_{1},g_{2}\rangle = (I:g_{1}) \cap (I:g_{2})\), then a basis for \(I:\langle g_{1},g_{2},g_{3}\rangle = (I:\langle g_{1},g_{2}\rangle ) \cap (I:g_{3})\), and so on.
Similarly, we have an algorithm for computing a basis of a saturation: given \(I =\langle \, f_{1},\ldots,f_{r}\rangle\) and \(J =\langle g_{1},\ldots,g_{s}\rangle\), to compute a basis of \(I:J^{\infty }\), we first compute a basis for \(I:g_{i}^{\infty }\) for each i using the method described in part (ii) of Theorem 14. Then by (5), we need to intersect the ideals \(I:g_{i}^{\infty }\), which we do as above by applying the intersection algorithm s − 1 times.
Exercises for §4
- 1.Find the Zariski closure of the following sets:
- a.
The projection of the hyperbola V(xy − 1) in \(\mathbb{R}^{2}\) onto the x-axis.
- b.
The boundary of the first quadrant in \(\mathbb{R}^{2}\).
- c.
The set \(\{(x,y) \in \mathbb{R}^{2}\mid x^{2} + y^{2} \leq 4\}\).
- a.
- 2.
- 3.
Let f = (x + y)2(x − y)(x + z 2) and g = (x + z 2)3(x − y)(z + y). Compute generators for \(\langle \,f\rangle:\langle g\rangle\).
- 4.
Let I and J be ideals in \(k[x_{1},\ldots,x_{n}]\). Show that if I is radical, then I: J is radical and \(I: J = I: \sqrt{J} = I: J^{\infty }\).
- 5.
As in the proof of Proposition 9, assume \(J =\langle g_{1},\ldots,g_{s}\rangle\). Prove that \(J^{sM}\! \subseteq \!\langle g_{1}^{M},\ldots,g_{s}^{M}\rangle\). Hint: See the proof of Lemma 5 of §2.
- 6.
Prove parts (i) and (ii) of Theorem 10. Hint: Adapt the proofs of parts (i) and (iii) of Proposition 7.
- 7.
Prove Corollary 11. Hint: Combine Theorem 10 and the Exercise 4. Another approach would be look closely at the proof of Theorem 10 when I is radical.
- 8.
Let \(V,W \subseteq k^{n}\) be varieties. Prove that \(\mathbf{I}(V ): \mathbf{I}(W) = \mathbf{I}(V \setminus W)\).
- 9.
Prove Proposition 12 and find geometric interpretations of parts (i) and (iii)
- 10.
- 11.
Prove \(I: g^{\infty } =\tilde{ I }\cap k[x_{1},\ldots,x_{n}]\) from part (ii) of Theorem 14. Hint: See the proof of Proposition 8 of §2.
- 12.
- 13.
An example mentioned in the text used \(I =\langle x^{2}(y - 1)\rangle\) and \(J =\langle x\rangle\). Compute \(I: J^{\infty }\) and explain how your answer relates to the discussion in the text.
- 14.
Let \(I,J \subseteq k[x_{1},\ldots,x_{n}]\) be ideals. Prove that \(I: J^{\infty } = I: J^{N}\) if and only if \(I: J^{N} = I: J^{N+1}\). Then use this to describe an algorithm for computing the saturation \(I: J^{\infty }\) based on the algorithm for computing ideal quotients.
- 15.
Show that N can be arbitrarily large in \(I: J^{\infty } = I: J^{N}\). Hint: Look at \(I =\langle x^{N}(y - 1)\rangle\).
- 16.Let \(I,J,K \subseteq k[x_{1},\ldots,x_{n}]\) be ideals. Prove the following:
- a.
\(IJ \subseteq K\) if and only if \(I \subseteq K: J\).
- b.
\((I: J): K = I: JK\).
- a.
- 17.
Given ideals \(I_{1},\ldots,I_{r},J \subseteq k[x_{1},\ldots,x_{n}]\), prove that \(\big(\bigcap _{i=1}^{r}I_{i}\big): J =\bigcap _{ i=1}^{r}(I_{i}: J)\). Then prove a similar result for saturations and give a geometric interpretation.
- 18.Let A be an m × n constant matrix and suppose that x = Ay. where we are thinking of x ∈ k m and y ∈ k n as column vectors of variables. As in Exercise 13 of §3, define a mapby sending f ∈ k[x 1, …, x m ] to α A ( f) ∈ k[y 1, …, y n ], where α A ( f) is the polynomial defined by α A ( f)(y) = f(Ay).$$\displaystyle{\alpha _{A}: k[x_{1},\ldots,x_{m}]\longrightarrow k[y_{1},\ldots,y_{n}]}$$
- a.
Show that \(\alpha _{A}(I: J) \subseteq \alpha _{A}(I): \alpha _{A}(J)\) with equality if \(I \supseteq \mathrm{ ker}(\alpha _{A})\) and α A is onto.
- b.
Show that \(\alpha _{A}^{-1}(I^{{\prime}}: J^{{\prime}}) =\alpha _{ A}^{-1}(I^{{\prime}}): \alpha _{A}^{-1}(J^{{\prime}})\) when α A is onto.
- a.
5 Irreducible Varieties and Prime Ideals
We have already seen that the union of two varieties is a variety. For example, in Chapter 1 and in the last section, we considered V(xz, yz), which is the union of a line and a plane. Intuitively, it is natural to think of the line and the plane as “more fundamental” than V(xz, yz). Intuition also tells us that a line or a plane are “irreducible” or “indecomposable” in some sense: they do not obviously seem to be a union of finitely many simpler varieties. We formalize this notion as follows.
Definition 1.
An affine variety \(V \subseteq k^{n}\) is irreducible if whenever V is written in the form \(V = V _{1} \cup V _{2}\), where V 1 and V 2 are affine varieties, then either V 1 = V or V 2 = V.
Thus, V(xz, yz) is not an irreducible variety. On the other hand, it is not completely clear when a variety is irreducible. If this definition is to correspond to our geometric intuition, it is clear that a point, a line, and a plane ought to be irreducible. For that matter, the twisted cubic V(y − x 2, z − x 3) in \(\mathbb{R}^{3}\) appears to be irreducible. But how do we prove this? The key is to capture this notion algebraically: if we can characterize ideals which correspond to irreducible varieties, then perhaps we stand a chance of establishing whether a variety is irreducible.
The following notion turns out to be the right one.
Definition 2.
An ideal \(I \subseteq k[x_{1},\ldots,x_{n}]\) is prime if whenever f, g ∈ k[x 1, …, x n ] and fg ∈ I, then either f ∈ I or g ∈ I.
If we have set things up right, an irreducible variety will correspond to a prime ideal and conversely. The following theorem assures us that this is indeed the case.
Proposition 3.
Let \(V \subseteq k^{n}\) be an affine variety. Then V is irreducible if and only if I (V ) is a prime ideal.
Proof.
First, assume that V is irreducible and let fg ∈ I(V). Set \(V _{1} = V \cap \mathbf{V}(\,f)\) and \(V _{2} = V \cap \mathbf{V}(g)\); these are affine varieties because an intersection of affine varieties is a variety. Then fg ∈ I(V) easily implies that \(V = V _{1} \cup V _{2}\). Since V is irreducible, we have either V = V 1 or V = V 2. Say the former holds, so that \(V = V _{1} = V \cap \mathbf{V}(\,f)\). This implies that f vanishes on V, so that f ∈ I(V). Thus, I(V) is prime.
Next, assume that I(V) is prime and let \(V = V _{1} \cup V _{2}\). Suppose that V ≠ V 1. We claim that I(V) = I(V 2). To prove this, note that \(\mathbf{I}(V ) \subseteq \mathbf{I}(V _{2})\) since \(V _{2} \subseteq V\). For the opposite inclusion, first note that \(\mathbf{I}(V ) \subsetneq \mathbf{I}(V _{1})\) since \(V _{1} \subsetneq V\). Thus, we can pick \(f \in \mathbf{I}(V _{1})\setminus \mathbf{I}(V )\). Now take any g ∈ I(V 2). Since \(V = V _{1} \cup V _{2}\), it follows that fg vanishes on V, and, hence, fg ∈ I(V). But I(V) is prime, so that f or g lies in I(V). We know that \(f\notin \mathbf{I}(V )\) and, thus, g ∈ I(V). This proves I(V) = I(V 2), whence V = V 2 because I is one-to-one. Thus, V is an irreducible variety. □
It is an easy exercise to show that every prime ideal is radical. Then, using the ideal-variety correspondence between radical ideals and varieties, we get the following corollary of Proposition 3.
Corollary 4.
When k is algebraically closed, the functions I and V induce a one-to-one correspondence between irreducible varieties in k n and prime ideals in k[x 1 ,…,x n ].
In fact, the above argument holds much more generally.
Proposition 5.
Proof.
With a little care, the above argument extends still further to show that any variety defined by a rational parametrization is irreducible.
Proposition 6.
Proof.
Now, we continue our proof that I(V) is prime. Suppose p, q ∈ k[x 1, …, x n ] satisfy p ⋅ q ∈ I(V). If the total degrees of p and q are M and N, respectively, then the total degree of \(p \cdot q\) is M + N. Thus, \((g_{1}g_{2}\cdots g_{n})^{M+N}(p \circ F) \cdot (q \circ F) = 0\). But the former is a product of the polynomials \((g_{1}g_{2}\cdots g_{n})^{M}(p \circ F)\) and \((g_{1}g_{2}\cdots g_{n})^{N}(q \circ F)\) in k[t 1, …, t m ]. Hence one of them must be the zero polynomial. In particular, either p ∈ I(V) or q ∈ I(V). This shows that I(V) is a prime ideal and, therefore, that V is an irreducible variety. □
The simplest variety in k n given by a parametrization consists of a single point, {(a 1, …, a n )}. In the notation of Proposition 5, it is given by the parametrization in which each f i is the constant polynomial \(f_{i}(t_{1},\ldots,t_{m}) = a_{i},1 \leq i \leq n\). It is clearly irreducible and it is easy to check that \(\mathbf{I}(\{(a_{1},\ldots,a_{n})\}) =\langle x_{1} - a_{1},\ldots,x_{n} - a_{n}\rangle\) (see Exercise 7), which implies that the latter is prime. The ideal \(\langle x_{1} - a_{1},\ldots,x_{n} - a_{n}\rangle\) has another distinctive property: it is maximal in the sense that the only ideal which strictly contains it is the whole ring k[x 1, …, x n ]. Such ideals are important enough to merit special attention.
Definition 7.
An ideal \(I \subseteq k[x_{1},\ldots,x_{n}]\) is said to be maximal if I ≠ k[x 1, …, x n ] and any ideal J containing I is such that either J = I or J = k[x 1, …, x n ].
In order to streamline statements, we make the following definition.
Definition 8.
An ideal \(I \subseteq k[x_{1},\ldots,x_{n}]\) is said to be proper if I is not equal to k[x 1, …, x n ].
Thus, an ideal is maximal if it is proper and no other proper ideal strictly contains it. We now show that any ideal of the form \(\langle x_{1} - a_{1},\ldots,x_{n} - a_{n}\rangle\) is maximal.
Proposition 9.
Proof.
Proposition 10.
If k is any field, a maximal ideal in k[x 1 ,…,x n ] is prime.
Proof.
Suppose that I is a proper ideal which is not prime and let fg ∈ I, where \(f\notin I\) and \(g\notin I\). Consider the ideal \(\langle \,f\rangle + I\). This ideal strictly contains I because \(f\notin I\). Moreover, if we were to have \(\langle \,f\rangle + I = k[x_{1},\ldots,x_{n}]\), then 1 = cf + h for some polynomial c and some h ∈ I. Multiplying through by g would give g = cfg + hg ∈ I which would contradict our choice of g. Thus, \(I +\langle \, f\rangle\) is a proper ideal containing I, so that I is not maximal. □
Note that Propositions 9 and 10 together imply that \(\langle x_{1} - a_{1},\ldots,x_{n} - a_{n}\rangle\) is prime in k[x 1, …, x n ] even if k is not infinite. Over an algebraically closed field, it turns out that every maximal ideal corresponds to some point of k n .
Theorem 11.
If k is an algebraically closed field, then every maximal ideal of k[x 1 ,…,x n ] is of the form \(\langle x_{1} - a_{1},\ldots,x_{n} - a_{n}\rangle\) for some a 1 ,…,a n ∈ k.
Proof.
Note the proof of Theorem 11 uses the Weak Nullstellensatz. It is not difficult to see that it is, in fact, equivalent to the Weak Nullstellensatz.
We have the following easy corollary of Theorem 11.
Corollary 12.
If k is an algebraically closed field, then there is a one-to-one correspondence between points of k n and maximal ideals of k[x 1 ,…,x n ].
Thus, we have extended our algebra–geometry dictionary. Over an algebraically closed field, every nonempty irreducible variety corresponds to a proper prime ideal, and conversely. Every point corresponds to a maximal ideal, and conversely.
We can use Zariski closure to characterize when a variety is irreducible.
Proposition 13.
A variety V is irreducible if and only if for every variety \(W \subsetneq V\) , the difference \(V \setminus W\) is Zariski dense in V.
Proof.
First assume that V is irreducible and take \(W \subsetneq V\). Then Proposition 7 of §4 gives the decomposition \(V = W \cup \overline{V \setminus W}\). Since V is irreducible and V ≠ W, this forces \(V = \overline{V \setminus W}\).
For the converse, suppose that \(V = V _{1} \cup V _{2}\). If \(V _{1} \subsetneq V\), then \(\overline{V \setminus V _{1}} = V\). But \(V \setminus V _{1} \subseteq V _{2}\), so that \(\overline{V \setminus V _{1}} \subseteq V _{2}\). This implies \(V \subseteq V _{2}\), and V = V 2 follows. □
Let us make a final comment about terminology. Some references, such as HARTSHORNE (1977), use the term “variety” for what we call an irreducible variety and say “algebraic set” instead of variety. When reading other books on algebraic geometry, be sure to check the definitions!
Exercises for §5
- 1.
If h ∈ k[x 1, …, x n ] has total degree N and if F is as in Proposition 6, show that \((g_{1}g_{2}\ldots g_{n})^{N}(h \circ F)\) is a polynomial in k[t 1, …, t m ].
- 2.
Show that a prime ideal is radical.
- 3.
Show that an ideal I is prime if and only if for any ideals J and K such that \(JK \subseteq I\), either \(J \subseteq I\) or \(K \subseteq I\).
- 4.
Let I 1, …, I n be ideals and P a prime ideal containing \(\bigcap _{i=1}^{n}I_{i}\). Then prove that \(P \supseteq I_{i}\) for some i. Further, if \(P =\bigcap _{ i=1}^{n}I_{i}\), show that P = I i for some i.
- 5.
Express f = x 2 z − 6y 4 + 2xy 3 z in the form f = f 1(x, y, z)(x + 3) + f 2(x, y, z)(y − 1) + f 3(x, y, z)(z − 2) for some f 1, f 2, f 3 ∈ k[x, y, z].
- 6.Let k be an infinite field.
- a.
Show that any straight line in k n is irreducible.
- b.
Prove that any linear subspace of k n is irreducible. Hint: Parametrize and use Proposition 5.
- a.
- 7.Show that$$\displaystyle{\mathbf{I}(\{(a_{1},\ldots,a_{n})\}) =\langle x_{1} - a_{1},\ldots,x_{n} - a_{n}\rangle.}$$
- 8.Show the following:
- a.
\(\langle x^{2} + 1\rangle\) is maximal in \(\mathbb{R}[x]\).
- b.
If \(I \subseteq \mathbb{R}[x_{1},\ldots,x_{n}]\) is maximal, show that V(I) is either empty or a point in \(\mathbb{R}^{n}\). Hint: Examine the proof of Theorem 11.
- c.
Give an example of a maximal ideal I in \(\mathbb{R}[x_{1},\ldots,x_{n}]\) for which \(\mathbf{V}(I) = \varnothing\). Hint: Consider the ideal \(\langle x_{1}^{2} + 1,x_{2},\ldots,x_{n}\rangle\).
- a.
- 9.Suppose that k is a field which is not algebraically closed.
- a.
Show that if \(I \subseteq k[x_{1},\ldots,x_{n}]\) is maximal, then V(I) is either empty or a point in k n . Hint: Examine the proof of Theorem 11.
- b.
Show that there exists a maximal ideal I in k[x 1, …, x n ] for which \(\mathbf{V}(I) = \varnothing\). Hint: See the previous exercise.
- c.
Conclude that if k is not algebraically closed, there is always a maximal ideal of k[x 1, …, x n ] which is not of the form \(\langle x_{1} - a_{1},\ldots,x_{n} - a_{n}\rangle\).
- a.
- 10.
Prove that Theorem 11 implies the Weak Nullstellensatz.
- 11.
If \(f \in \mathbb{C}[x_{1},\ldots,x_{n}]\) is irreducible, then V( f) is irreducible. Hint: Show that \(\langle \,f\rangle\) is prime.
- 12.
Prove that if I is any proper ideal in \(\mathbb{C}[x_{1},\ldots,x_{n}]\), then \(\sqrt{I}\) is the intersection of all maximal ideals containing I. Hint: Use Theorem 11.
- 13.Let \(f_{1},\ldots,f_{n} \in k[x_{1}]\) be polynomials of one variable and consider the idealWe also assume that \(\mathrm{deg}(\,f_{1}) = m> 0\).$$\displaystyle{I =\langle \, f_{1}(x_{1}),x_{2} - f_{2}(x_{1}),\ldots,x_{n} - f_{n}(x_{1})\rangle \subseteq k[x_{1},\ldots,x_{n}].}$$
- a.
Show that every \(f \in k[x_{1},\ldots,x_{n}]\) can be written uniquely as f = q + r where q ∈ I and r ∈ k[x 1] with either r = 0 or \(\mathrm{deg}(r) <m\). Hint: Use lex order with x 1 last.
- b.
Let f ∈ k[x 1]. Use part (a) to show that f ∈ I if and only if f is divisible by f 1 in k[x 1].
- c.
Prove that I is prime if and only if f 1 ∈ k[x 1] is irreducible.
- d.
Prove that I is radical if and only if f 1 ∈ k[x 1] is square-free.
- e.
Prove that \(\sqrt{I} =\langle (\,f_{1})_{\mathrm{red}}\rangle + I\), where (f 1)red is defined in §2.
- a.
6 Decomposition of a Variety into Irreducibles
In the last section, we saw that irreducible varieties arise naturally in many contexts. It is natural to ask whether an arbitrary variety can be built up out of irreducibles. In this section, we explore this and related questions.
We begin by translating the ascending chain condition (ACC) for ideals (see §5 of Chapter 2) into the language of varieties.
Proposition 1 (The Descending Chain Condition).
Proof.
We can use Proposition 1 to prove the following basic result about the structure of affine varieties.
Theorem 2.
Proof.
Both in the above example and the case of V(xz, yz), it appears that the decomposition of a variety into irreducible pieces is unique. It is natural to ask whether this is true in general. It is clear that, to avoid trivialities, we must rule out decompositions in which the same irreducible piece appears more than once, or in which one irreducible piece contains another. This is the aim of the following definition.
Definition 3.
With this definition, we can now prove the following uniqueness result.
Theorem 4.
Proof.
By Theorem 2, V can be written in the form \(V = V _{1} \cup \cdots \cup V _{m}\), where each V i is irreducible. Further, if a V i lies in some V j for i ≠ j, we can drop V i , and V will be the union of the remaining V j ’s for j ≠ i. Repeating this process leads to a minimal decomposition of V.
The uniqueness part of Theorem 4 guarantees that the irreducible components of V are well-defined. We remark that the uniqueness is false if one does not insist that the union be finite. (A plane P is the union of all the points on it. It is also the union of some line in P and all the points not on the line—there are infinitely many lines in P with which one could start.) This should alert the reader to the fact that although the proof of Theorem 4 is easy, it is far from vacuous: one makes subtle use of finiteness (which follows, in turn, from the Hilbert Basis Theorem).
Here is a result that relates irreducible components to Zariski closure.
Proposition 5.
Let V,W be varieties with \(W \subsetneq V\) . Then \(V \setminus W\) is Zariski dense in V if and only if W contains no irreducible component of V.
Proof.
Theorems 2 and 4 can also be expressed purely algebraically using the one-to-one correspondence between radical ideals and varieties.
Theorem 6.
If k is algebraically closed, then every radical ideal in k[x 1 ,…,x n ] can be written uniquely as a finite intersection of prime ideals \(P_{1} \cap \cdots \cap P_{r}\) , where \(P_{i}\not\subseteq P_{j}\) for i ≠ j. (As in the case of varieties, we often call such a presentation of a radical ideal a minimal decomposition or an irredundant intersection ).
We can also use ideal quotients from §4 to describe the prime ideals that appear in the minimal representation of a radical ideal.
Theorem 7.
Proof.
First, note that since I is proper, each P i is also a proper ideal (this follows from minimality).
Now suppose that \(I: f\) is a proper prime ideal. By Exercise 4 of §5, the above formula for \(I: f\) implies that \(I: f = P_{i}: f\) for some i. Since \(P_{i}:f = P_{i}\) or \(\langle 1\rangle\) by the above observation, it follows that \(I: f = P_{i}\).
To see that every P i can arise in this way, fix i and pick \(f \in \big (\bigcap _{j\neq i}^{r}P_{j}\big)\setminus P_{i}\); such an f exists because \(\bigcap _{i=1}^{r}P_{i}\) is minimal. Then \(P_{i}: f = P_{i}\) and \(P_{j}: f =\langle 1\rangle\) for j ≠ i. If we combine this with the above formula for \(I: f\), then it follows that \(I: f = P_{i}\). □
We should mention that Theorems 6 and 7 hold for any field k, although the proofs in the general case are different (see Corollary 10 of §8).
For an example of these theorems, consider the ideal \(I =\langle xz - y^{2},x^{3} - yz\rangle\). Recall that the variety V = V(I) was discussed earlier in this section. For the time being, let us assume that I is radical (eventually we will see that this is true). Can we write I as an intersection of prime ideals?
-
(Primality) Is there an algorithm for deciding if a given ideal is prime?
-
(Irreducibility) Is there an algorithm for deciding if a given affine variety is irreducible?
-
(Decomposition) Is there an algorithm for finding the minimal decomposition of a given variety or radical ideal?
The answer to all three questions is yes, and descriptions of the algorithms can be found in the works of HERMANN (1926), MINES, RICHMAN, and RUITENBERG (1988), and SEIDENBERG (1974, 1984). As in §2, the algorithms in these articles are very inefficient. However, the work of GIANNI, TRAGER and ZACHARIAS (1988) and EISENBUD, HUNEKE and VASCONCELOS (1992) has led to more efficient algorithms. See also Chapter 8 of BECKER and WEISPFENNING (1993) and §4.4 of ADAMS and LOUSTAUNAU (1994).
Exercises for §6
- 1.
Show that the intersection of any collection of prime ideals is radical.
- 2.
Show that an irredundant intersection of at least two prime ideals is never prime.
- 3.
If \(P \subseteq k[x_{1},\ldots,x_{n}]\) is a prime ideal, then prove that \(P: f = P\) if \(f\notin P\) and \(P: f =\langle 1\rangle\) if f ∈ P.
- 4.Let \(I =\langle xz\ - y^{2},x^{3} - yz\rangle\).
- a.
Show that \(I: (x^{2}y - z^{2}) =\langle x,y\rangle\).
- b.
Show that \(I: (x^{2}y - z^{2})\) is prime.
- c.
Show that \(I =\langle x,y\rangle \cap \langle xz\ - y^{2},x^{3} - yz,z^{2} - x^{2}y\rangle\).
- a.
- 5.Let \(J =\langle xz\ - y^{2},x^{3} - yz,z^{2} - x^{2}y\rangle \subseteq k[x,y,z]\), where k is infinite.
- a.
Show that every point of W = V(J) is of the form (t 3, t 4, t 5) for some t ∈ k.
- b.Show that J = I(W). Hint: Compute a Gröbner basis for J using lex order with z > y > x and show that every f ∈ k[x, y, z] can be written in the formwhere g ∈ J, a, b ∈ k and A, B, C ∈ k[x].$$\displaystyle{f = g + a + bz + xA(x)\ + yB(x)\ + y^{2}C(x),}$$
- a.
- 6.
Complete the proof of Proposition 5. Hint: \(V _{i} \subseteq W\) implies \(V \setminus W \subseteq V \setminus V _{i}\).
- 7.
Translate Theorem 7 and its proof into geometry.
- 8.Let \(I =\langle xz\ - y^{2},z^{3} - x^{5}\rangle \subseteq \mathbb{Q}[x,y,z]\).
- a.
Express V(I) as a finite union of irreducible varieties. Hint: The parametrizations (t 3, t 4, t 5) and (t 3, −t 4, t 5) will be useful.
- b.
Express I as an intersection of prime ideals which are ideal quotients of I and conclude that I is radical.
- a.
- 9.
Let V, W be varieties in k n with \(V \subseteq W\). Show that each irreducible component of V is contained in some irreducible component of W.
- 10.
Let \(f \in \mathbb{C}[x_{1},\ldots,x_{n}]\) and let \(f = f_{1}^{a_{1}}f_{2}^{a_{2}}\cdots f_{r}^{a_{r}}\) be the decomposition of f into irreducible factors. Show that \(\mathbf{V}(\,f) = \mathbf{V}(\,f_{1}) \cup \cdots \cup \mathbf{V}(\,f_{r})\) is the decomposition of V( f) into irreducible components and \(\mathbf{I}(\mathbf{V}(\,f)) =\langle \, f_{1}f_{2}\cdots f_{r}\rangle\). Hint: See Exercise 11 of §5.
7 Proof of the Closure Theorem
This section will complete the proof of the Closure Theorem from Chapter 3, §2. We will use many of the tools introduced in this chapter, including the Nullstellensatz, Zariski closures, saturations, and irreducible components.
We begin by recalling the basic situation. Let k be an algebraically closed field, and let π l : k n → k n−l is projection onto the last n − l components. If V = V(I) is an affine variety in k n , then we get the l-th elimination ideal \(I_{l} = I \cap k[x_{l+1},\ldots,x_{n}]\). The first part of the Closure Theorem, which asserts that V(I l ) is the Zariski closure of π l (V) in k n−l , was proved earlier in Theorem 4 of §4.
The remaining part of the Closure Theorem tells us that π l (V) fills up “most” of V(I l ). Here is the precise statement.
Theorem 1 (The Closure Theorem, second part).
This is slightly different from the Closure Theorem stated in §2 of Chapter 3. There, we assumed \(V \neq \varnothing\) and asserted that \(\mathbf{V}(I_{l})\setminus W \subseteq \pi _{l}(V )\) for some \(W \subsetneq \mathbf{V}(I_{l})\). In Exercise 1 you will prove that Theorem 1 implies the version in Chapter 3.
The proof of Theorem 1 will use the following notation. Rename \(x_{l+1},\ldots,x_{n}\) as \(y_{l+1},\ldots,y_{n}\) and write \(k[x_{1},\ldots,x_{l},y_{l+1},\ldots,y_{n}]\) as \(k[\mathbf{x},\mathbf{y}]\) for \(\mathbf{x} = (x_{1},\ldots,x_{l})\) and \(\mathbf{y} = (y_{l+1},\ldots,y_{n})\). Also fix a monomial order > on k[x, y] with the property that x α > x β implies x α > x β y γ for all γ. The product order described in Exercise 9 of Chapter 2, §4 is an example of such a monomial order. Another example is given by lex order with x 1 > ⋯ > x l > y l+1 > ⋯ > y n .
An important tool in proving Theorem 1 is the following result.
Theorem 2.
- (i)The setis a Gröbner basis of the ideal \(\{f(\mathbf{x},\mathbf{b})\mid f \in I\}\) .$$\displaystyle{\overline{G} =\{ g_{i}(\mathbf{x},\mathbf{b})\mid g_{i}\notin k[\mathbf{y}]\} \subseteq k[\mathbf{x}]}$$
- (ii)
If k is algebraically closed, then there exists \(\mathbf{a} = (a_{1},\ldots,a_{l}) \in k^{l}\) such that ( a , b ) ∈ V = V (I).
Proof.
For part (ii), note that by construction, every element of \(\overline{G}\) has positive total degree in the x variables, so that \(\bar{g}_{i}\) is nonconstant for every i. It follows that \(1\notin \bar{I}\) since \(\overline{G}\) is a Gröbner basis of \(\bar{I}\). Hence \(\bar{I} \subsetneq k[\mathbf{x}]\), so that by the Nullstellensatz, there exists a ∈ k l such that \(\bar{g}_{i}(\mathbf{a}) = 0\) for all \(\bar{g}_{i} \in \overline{G}\), i.e., g i (a, b) = 0 for all \(g_{i} \in G\setminus k[\mathbf{y}]\). Since \(\bar{g}_{i} = 0\) when g i ∈ k[y], it follows that g i (a, b) = 0 for all g i ∈ G. Hence (a, b) ∈ V = V(I). □
Part (ii) of Theorem 2 is related to the Extension Theorem from Chapter 3. Compared to the Extension theorem, part (ii) is simultaneously stronger (the Extension Theorem assumes l = 1, i.e., just one variable is eliminated) and weaker [part (ii) requires the nonvanishing of all relevant leading coefficients, while the Extension Theorem requires just one].
For our purposes, Theorem 2 has the following important corollary.
Corollary 3.
Proof.
Take \(\mathbf{b} \in \mathbf{V}(I_{l})\setminus \mathbf{V}\big(\prod _{g_{i}\in G\setminus k[\mathbf{y}]}\,c_{i}\big)\). Then b ∈ V(I l ) and c i (b) ≠ 0 for all \(g_{i} \in G\setminus k[\mathbf{y}]\). By Theorem 2, there is a ∈ k l such that (a, b) ∈ V = V(I). In other words, b ∈ π l (V), and the corollary follows. □
Hence, to complete the proof of the Closure Theorem, we need to explore what happens when the difference \(\mathbf{V}(I_{l})\setminus \mathbf{V}\big(\prod _{g_{i}\in G\setminus k[\mathbf{y}]}\,c_{i}\big)\) is not Zariski dense in V(I l ). The following proposition shows that in this case, the original variety V = V(I) decomposes into varieties coming from strictly bigger ideals.
Proposition 4.
- (i)
\(V = \mathbf{V}(I +\langle c_{i}\rangle ) \cup \mathbf{V}(I:c_{i}^{\infty })\) .
- (ii)
\(I \subsetneq I +\langle c_{i}\rangle\) and \(I \subsetneq I:c_{i}^{\infty }\) .
Proof.
For (i), we have \(V = \mathbf{V}(I) = \mathbf{V}(I +\langle c_{i}\rangle ) \cup \mathbf{V}(I:c_{i}^{\infty })\) by Theorem 10 of §4.
For (ii), we first show that \(I \subsetneq I +\langle c_{i}\rangle\) for all \(g_{i} \in G\setminus k[\mathbf{y}]\). To see why, suppose that c i ∈ I for some i. Since G is a Gröbner basis of I, LT(c i ) is divisible by some LT(g j ), and then g j ∈ k[y] since the monomial order eliminates the x variables. Hence g j ≠ g i . But then (1) implies that LT(g j ) divides \(\mbox{ LT}(g_{i}) = \mbox{ LT}(c_{i})\mathbf{x}^{\alpha _{i}}\), which contradicts our assumption that G is reduced. Hence \(c_{i}\notin I\), and \(I \subsetneq I +\langle c_{i}\rangle\) follows.
In the situation of Proposition 4, we have a decomposition of V into two pieces. The next step is to show that if we can find a W that works for each piece, then we can find a W what works for V. Here is the precise result.
Proposition 5.
Proof.
The final ingredient we need for the proof of the Closure Theorem is the following maximum principle for ideals.
Proposition 6 (Maximum Principle for Ideals).
Proof.
This is an easy consequence of the ascending chain condition (Theorem 7 of Chapter 2, §5). The proof will be left as an exercise. □
We are now ready to prove the second part of the Closure Theorem.
Among all ideals for which the theorem fails, the maximum principle of Proposition 6 guarantees that there is a maximal such ideal, i.e., there is an ideal I such that the theorem fails for I but holds for every strictly larger ideal \(I \subsetneq J\).
The proof of the Closure Theorem just given is nonconstructive. Fortunately, in practice it is straightforward to find \(W \subseteq \mathbf{V}(I_{l})\) with the required properties. We will give two examples and then describe a general procedure.
We say that FindW is recursive since it calls itself. We leave it as an exercise to show that the maximum principle from Proposition 6 implies that FindW always terminates in finitely many steps. When it does, correctness follows from the above discussion.
We end this section by using the Closure Theorem to give a precise description of the projection \(\pi _{l}(V ) \subseteq k^{n-l}\) of an affine variety \(V \subseteq k^{n}\).
Theorem 7.
Proof.
In general, a set of the form described in Theorem 7 is called constructible.
Exercises for §7
- 1.
Prove that Theorem 3 of Chapter 3, §2 follows from Theorem 1 of this section. Hint: Show that the W from Theorem 1 satisfies \(W \subsetneq \mathbf{V}(I_{l})\) when \(V \neq \varnothing\).
- 2.
In the notation of Theorem 2, prove that \(\bar{I} =\langle \overline{G}\rangle\) for \(\bar{I} =\{\bar{ f}\mid f \in I\}\).
- 3.
Given sets A and B, prove that \(A\setminus B = A\setminus (A \cap B)\).
- 4.
In the proof of Proposition 4, prove that \(I = I: c_{i}^{\infty }\) implies that \(I_{l} = I_{l}: c_{i}^{\infty }\).
- 5.This exercise will explore some properties of irreducible components needed in the proofs of Propositions 4 and 5.
- a.
Let \(W_{1},\ldots,W_{r}\) be affine varieties contained in a variety V and assume that for each 1 ≤ i ≤ r, no irreducible component of V is contained in W i . Prove that the same is true for \(\bigcup _{i=1}^{r}W_{i}\). (This fact is used in the proof of Proposition 4.)
- b.
Let \(W_{i} \subseteq V _{i}\) be affine varieties for i = 1, 2 such that W i contains no irreducible component of V i . Prove that \(W = W_{1} \cup W_{2}\) contains no irreducible component of \(V = V _{1} \cup V _{2}\). (This fact is used in the proof of Proposition 5.)
- a.
- 6.
Prove Proposition 6. Hint: Assume that the proposition is false for some nonempty collection of ideals \(\{I_{\alpha }\}_{\alpha \in \mathcal{A}}\) and show that this leads to a contradiction of the ascending chain condition.
- 7.In this exercise we will see why it is reasonable to make the convention that the empty product is 1. Let R be a commutative ring with 1 and let \(\mathcal{A}\) be a finite set such that for every \(\alpha \in \mathcal{A}\), we have r α ∈ R. Then we get the productAlthough \(\mathcal{A}\) is unordered, the product is well-defined since R is commutative.$$\displaystyle{\prod _{\alpha \in \mathcal{A}}r_{\alpha }.}$$
- a.Assume \(\mathcal{B}\) is finite and disjoint from \(\mathcal{A}\) such that for every \(\beta \in \mathcal{B}\), we have r β ∈ R. Prove that$$\displaystyle{\prod _{\gamma \in \mathcal{A}\cup \mathcal{B}}r_{\gamma } =\big (\prod _{\alpha \in \mathcal{A}}r_{\alpha }\big)\big(\prod _{\beta \in \mathcal{B}}r_{\beta }\big).}$$
- b.
It is likely that the proof you gave in part (a) assumed that \(\mathcal{A}\) and \(\mathcal{B}\) are nonempty. Explain why \(\prod _{\alpha \in \varnothing }r_{\alpha } = 1\) makes the above formula work in all cases.
- c.
In a similar way, define \(\sum _{\alpha \in \mathcal{A}}r_{\alpha }\) and explain why \(\sum _{\alpha \in \varnothing }r_{\alpha } = 0\) is needed to make the analog of part (a) true for sums.
- a.
- 8.The goal of this exercise is to describe an algorithm for deciding whether \(\overline{\mathbf{V}(I)\setminus \mathbf{V}(g)} = \mathbf{V}(I)\) when the field k is algebraically closed.
- a.
Prove that \(\overline{\mathbf{V}(I)\setminus \mathbf{V}(g)} = \mathbf{V}(I)\) is equivalent to \(I: g^{\infty }\subseteq \sqrt{I}\). Hint: Use the Nullstellensatz and Theorem 10 of §4. Also remember that \(I \subseteq I: g^{\infty }\).
- b.
Use Theorem 14 of §4 and the Radical Membership Algorithm from §2 to describe an algorithm for deciding whether \(I: g^{\infty }\subseteq \sqrt{I}\).
- a.
- 9.
Give a proof of the termination of FindW that uses the maximum principle stated in Proposition 6. Hint: Consider the set of all ideals in k[x, y] for which FindW does not terminate.
- 10.This exercise is concerned with the proof of Theorem 7.
- a.
Verify that \(V _{1} = V \cap \{ (a_{1},\ldots,a_{n}) \in k^{n}\mid (a_{l+1},\ldots,a_{n}) \in Z_{1}\}\) is an affine variety.
- b.
Verify that \(\pi _{l}(V ) = (W_{1}\setminus Z_{1}) \cup \pi _{l}(V _{1})\).
- a.
- 11.As in the text, let V = V(I) for \(I =\langle xy + z - 1,y^{2}z^{2}\rangle \subseteq \mathbb{C}[x,y,z]\). Show thatand that W = V(y, z) = { (0, 0)} satisfies \(\mathbf{V}(I_{1})\setminus W \subseteq \pi _{1}(V )\).$$\displaystyle{\mathbf{V}(I_{1}) = \mathbf{V}(z) \cup \mathbf{V}(y,z - 1) = \mathbf{V}(z) \cup \{ (0, 1)\}}$$
- 12.
Let \(V = \mathbf{V}(y - xz) \subseteq \mathbb{C}^{3}\). Theorem 7 tells us that \(\pi _{1}(V ) \subseteq \mathbb{C}^{2}\) is a constructible set. Find an explicit decomposition of π 1(V) of the form given by Theorem 7. Hint: Your answer will involve W 1, Z 1 and W 2.
- 13.
Proposition 6 is the maximum principle for ideals. The geometric analog is the minimum principle for varieties, which states that among any nonempty collection of varieties in k n , there is a variety in the collection which is minimal with respect to inclusion. More precisely, this means that if we are given varieties V α , \(\alpha \in \mathcal{A}\), where \(\mathcal{A}\) is a nonempty set, then there is some \(\alpha _{0} \in \mathcal{A}\) with the property that for all \(\beta \in \mathcal{A}\), \(V _{\beta } \subseteq V _{\alpha _{0}}\) implies \(V _{\beta } = V _{\alpha _{0}}\). Prove the minimum principle. Hint: Use Proposition 1 of §6.
- 14.
Apply the minimum principle of Exercise 13 to give a different proof of Theorem 7. Hint: Consider the collection of all varieties \(V \subseteq k^{n}\) for which π l (V) is not constructible. By the minimum principle, there is a variety V such that π l (V) is not constructible but π l (W) is constructible for every variety \(W \subsetneq V\). Show how the proof of Theorem 7 up to (2) can be used to obtain a contradiction and thereby prove the theorem.
8 Primary Decomposition of Ideals
In view of the decomposition theorem proved in §6 for radical ideals, it is natural to ask whether an arbitrary ideal I (not necessarily radical) can be represented as an intersection of simpler ideals. In this section, we will prove the Lasker-Noether decomposition theorem, which describes the structure of I in detail.
There is no hope of writing an arbitrary ideal I as an intersection of prime ideals (since an intersection of prime ideals is always radical). The next thing that suggests itself is to write I as an intersection of powers of prime ideals. This does not quite work either: consider the ideal \(I =\langle x,y^{2}\rangle\) in \(\mathbb{C}[x,y]\). Any prime ideal containing I must contain x and y and, hence, must equal \(\langle x,y\rangle\) (since \(\langle x,y\rangle\) is maximal). Thus, if I were to be an intersection of powers of prime ideals, it would have to be a power of \(\langle x,y\rangle\) (see Exercise 1 for the details).
The concept we need is a bit more subtle.
Definition 1.
An ideal I in k[x 1, …, x n ] is primary see ideal if fg ∈ I implies either f ∈ I or some power g m ∈ I for some m > 0.
It is easy to see that prime ideals are primary. Also, you can check that the ideal \(I =\langle x,y^{2}\rangle\) discussed above is primary (see Exercise 1).
Lemma 2.
If I is a primary ideal, then \(\sqrt{I}\) is prime and is the smallest prime ideal containing I.
Proof.
See Exercise 2. □
In view of this lemma, we make the following definition.
Definition 3.
If I is primary and \(\sqrt{I} = P\), then we say that I is P-primary.
We can now prove that every ideal is an intersection of primary ideals.
Theorem 4.
Every ideal \(I \subseteq k[x_{1},\ldots,x_{n}]\) can be written as a finite intersection of primary ideals.
Proof.
We first define an ideal I to be irreducible if \(I = I_{1} \cap I_{2}\) implies that I = I 1 or I = I 2. We claim that every ideal is an intersection of finitely many irreducible ideals. The argument is an “ideal” version of the proof of Theorem 2 from §6. One uses the ACC rather than the DCC—we leave the details as an exercise.
Next we claim that an irreducible ideal is primary. Note that this will prove the theorem. To see why the claim is true, suppose that I is irreducible and that fg ∈ I with \(f\notin I\). We need to prove that some power of g lies in I. Consider the saturation \(I:g^{\infty }\). By Proposition 9 of §4, we know that \(I:g^{\infty } = I:g^{N}\) once N is sufficiently large. We will leave it as an exercise to show that \((I +\langle g^{N}\rangle ) \cap (I +\langle \, f\rangle ) = I\). Since I is irreducible, it follows that \(I = I +\langle g^{N}\rangle\) or \(I = I +\langle \, f\rangle\). The latter cannot occur since \(f\notin I\), so that \(I = I +\langle g^{N}\rangle\). This proves that \(g^{N} \in I\). □
As in the case of varieties, we can define what it means for a decomposition to be minimal.
Definition 5.
A primary decomposition of an ideal I is an expression of I as an intersection of primary ideals: \(I =\bigcap _{ i=1}^{r}Q_{i}\). It is called minimal or irredundant if the \(\sqrt{Q_{i}}\) are all distinct and \(Q_{i}\not\supseteq \bigcap _{j\neq i}Q_{j}\).
To prove the existence of a minimal decomposition, we will need the following lemma, the proof of which we leave as an exercise.
Lemma 6.
If I, J are primary and \(\sqrt{I} = \sqrt{J}\) , then \(I \cap J\) is primary.
We can now prove the first part of the Lasker-Noether decomposition theorem.
Theorem 7 (Lasker-Noether).
Every ideal \(I \subseteq k[x_{1},\ldots,x_{n}]\) has a minimal primary decomposition.
Proof.
By Theorem 4, we know that there is a primary decomposition \(I =\bigcap _{ i=1}^{r}Q_{i}\). Suppose that Q i and Q j have the same radical for some \(i\neq j\). Then, by Lemma 6, \(Q = Q_{i} \cap Q_{j}\) is primary, so that in the decomposition of I, we can replace Q i and Q j by the single ideal Q. Continuing in this way, eventually all of the Q i ’s will have distinct radicals.
Next, suppose that some Q i contains \(\bigcap _{j\neq i}Q_{j}\). Then we can omit Q i , and I will be the intersection of the remaining Q j ’s for j ≠ i. Continuing in this way, we can reduce to the case where \(Q_{i}\not\supseteq \bigcap _{j\neq i}Q_{j}\) for all i. □
Lemma 8.
- (i)
If f ∈ I, then \(I: f =\langle 1\rangle\) .
- (ii)
If \(f\notin I\) , then \(I: f\) is P-primary.
- (iii)
If \(f\notin P\) , then \(I: f = I\) .
Proof.
See Exercise 7. □
The second part of the Lasker-Noether theorem tells us that the radicals of the ideals in a minimal decomposition are uniquely determined.
Theorem 9 (Lasker-Noether).
Let \(I =\bigcap _{ i=1}^{r}Q_{i}\) be a minimal primary decomposition of a proper ideal \(I \subseteq k[x_{1},\ldots,x_{n}]\) and let \(P_{i} = \sqrt{Q_{i}}\) . Then the P i are precisely the proper prime ideals occurring in the set \(\{\sqrt{I: f}\mid f \in k[x_{1},\ldots,x_{n}]\}\) .
Remark.
In particular, the P i are independent of the primary decomposition of I. We say that the P i belong to I.
Proof.
The proof is very similar to the proof of Theorem 7 from §6. The details are covered in Exercises 8–10. □
In §6, we proved a decomposition theorem for radical ideals over an algebraically closed field. Using the Lasker–Noether theorems, we can now show that these results hold over an arbitrary field k.
Corollary 10.
Let \(I =\bigcap _{ i=1}^{r}Q_{i}\) be a minimal primary decomposition of a proper radical ideal \(I \subseteq k[x_{1},\ldots,x_{n}]\) . Then the Q i are prime and are precisely the proper prime ideals occurring in the set \(\{I: f\mid f \in k[x_{1},\ldots,x_{n}]\}\) .
Proof.
See Exercise 12. □
The two Lasker–Noether theorems do not tell the full story of a minimal primary decomposition \(I =\bigcap _{ i=1}^{r}Q_{i}\). For example, if P i is minimal in the sense that no P j is strictly contained in P i , then one can show that Q i is uniquely determined. Thus there is a uniqueness theorem for some of the Q i ’s [see Chapter 4 of ATIYAH and MACDONALD (1969) for the details]. We should also mention that the conclusion of Theorem 9 can be strengthened: one can show that the P i ’s are precisely the proper prime ideals in the set \(\{I: f\mid f \in k[x_{1},\ldots,x_{n}]\}\) [see Chapter 7 of ATIYAH and MACDONALD (1969)].
-
(Primary Decomposition) Is there an algorithm for finding bases for the primary ideals Q i in a minimal primary decomposition of I?
-
(Associated Primes) Can we find bases for the associated primes \(P_{i} = \sqrt{Q_{i}}\)?
If you look in the references given at the end of §6, you will see that the answer to these questions is yes. Primary decomposition has been implemented in CoCoA, Macaulay2, Singular, and Maple.
Exercises for § 9
- 1.Consider the ideal \(I =\langle x,y^{2}\rangle \subseteq \mathbb{C}[x,y]\).
- a.
Prove that \(\langle x,y\rangle ^{2} \subsetneq I \subsetneq \langle x,y\rangle\), and conclude that I is not a prime power.
- b.
Prove that I is primary.
- a.
- 2.
Prove Lemma 2.
- 3.This exercise is concerned with the proof of Theorem 4. Let \(I \subseteq k[x_{1},\ldots,x_{n}]\) be an ideal.
- a.
Using the hints given in the text, prove that I is a finite intersection of irreducible ideals.
- b.
Suppose that fg ∈ I and \(I: g^{\infty } = I: g^{N}\). Then prove that \((I +\langle g^{N}\rangle ) \cap (I +\langle \, f\rangle ) = I\). Hint: Elements of \((I +\langle g^{N}\rangle ) \cap (I +\langle \, f\rangle )\) can be written as a + bg N = c + df, where a, c ∈ I and b, d ∈ k[x 1, …, x n ]. Now multiply through by g and use \(I: g^{N} = I: g^{N+1}\).
- a.
- 4.In the proof of Theorem 4, we showed that every irreducible ideal is primary. Surprisingly, the converse is false. Let I be the ideal \(\langle x^{2},xy,y^{2}\rangle \subseteq k[x,y]\).
- a.
Show that I is primary.
- b.
Show that \(I =\langle x^{2},y\rangle \cap \langle x,y^{2}\rangle\) and conclude that I is not irreducible.
- a.
- 5.
- 6.Let I be the ideal \(\langle x^{2},xy\rangle \subseteq \mathbb{Q}[x,y]\).
- a.Prove thatare two distinct minimal primary decompositions of I.$$\displaystyle{I =\langle x\rangle \ \cap \langle x^{2},xy,y^{2}\rangle =\langle x\rangle \ \cap \langle x^{2},y\rangle }$$
- b.Prove that for any \(a \in \mathbb{Q}\),is a minimal primary decomposition of I. Thus I has infinitely many distinct minimal primary decompositions.$$\displaystyle{I =\langle x\rangle \ \cap \langle x^{2},y - ax\rangle }$$
- a.
- 7.
Prove Lemma 8.
- 8.
Prove that an ideal is proper if and only if its radical is.
- 9.
Use Exercise 8 to show that the primes belonging to a proper ideal are also proper.
- 10.
Prove Theorem 9. Hint: Adapt the proof of Theorem 7 from §6. The extra ingredient is that you will need to take radicals. Proposition 16 from §3 will be useful. You will also need to use Exercise 9 and Lemma 8.
- 11.
- 12.
9 Summary
The table on the next page summarizes the results of this chapter. In the table, it is supposed that all ideals are radical and that the field is algebraically closed.
ALGEBRA | GEOMETRY | ||
|---|---|---|---|
radical ideals | varieties | ||
I | \(\longrightarrow\) | V(I) | |
I(V) | \(\longleftarrow\) | V | |
addition of ideals | intersection of varieties | ||
I + J | \(\longrightarrow\) | \(\mathbf{V}(I) \cap \mathbf{V}(J)\) | |
\(\sqrt{\mathbf{I} (V ) + \mathbf{I} (W)}\) | \(\longleftarrow\) | \(V \cap W\) | |
product of ideals | union of varieties | ||
IJ | \(\longrightarrow\) | \(\mathbf{V}(I) \cup \mathbf{V}(J)\) | |
\(\sqrt{\mathbf{I} (V )\mathbf{I} (W)}\) | \(\longleftarrow\) | \(V \cup W\) | |
intersection of ideals | union of varieties | ||
\(I \cap J\) | \(\longrightarrow\) | \(\mathbf{V}(I) \cup \mathbf{V}(J)\) | |
\(\mathbf{I}(V ) \cap \mathbf{I}(W)\) | \(\longleftarrow\) | \(V \cup W\) | |
ideal quotients | difference of varieties | ||
\(I:J\) | \(\longrightarrow\) | \(\overline{\mathbf{V}(I)\setminus \mathbf{V}(J)}\) | |
\(\mathbf{I}(V ):\mathbf{I}(W)\) | \(\longleftarrow\) | \(\overline{V \setminus W}\) | |
elimination of variables | projection of varieties | ||
\(I \cap k[x_{l+1},\ldots,x_{n}]\) | \(\longleftrightarrow\) | \(\overline{\pi _{l}(\mathbf{V}(I))}\) | |
prime ideal | \(\longleftrightarrow\) | irreducible variety | |
minimal decomposition | minimal decomposition | ||
\(I = P_{1} \cap \cdots \cap P_{m}\) | \(\longrightarrow\) | \(\mathbf{V}(I) = \mathbf{V}(P_{1}) \cup \cdots \cup \mathbf{V}(P_{m})\) | |
\(\mathbf{I}(V ) = \mathbf{I}(V _{1}) \cap \cdots \cap \mathbf{I}(V _{m})\) | \(\longleftarrow\) | \(V = V _{1} \cup \cdots \cup V _{m}\) | |
maximal ideal | \(\longleftrightarrow\) | point of affine space | |
ascending chain condition | \(\longleftrightarrow\) | descending chain condition |
References
- W. Adams, P. Loustaunau, An Introduction to Gröbner Bases. Graduate Studies in Mathematics, vol. 3 (AMS, Providence, 1994) Google Scholar
- M.F. Atiyah, I.G. MacDonald, Introduction to Commutative Algebra (Addison-Wesley, Reading, MA, 1969) Google Scholar
- T. Becker, V. Weispfenning, Gröbner Bases (Springer, New York-Berlin-Heidelberg, 1993) Google Scholar
- J.H. Davenport, Y. Siret, E. Tournier, Computer Algebra, 2nd edn. (Academic, New York, 1993) Google Scholar
- D. Eisenbud, C. Huneke, W. Vasconcelos, Direct methods for primary decomposition. Invent. Math. 110, 207–235 (1992) Google Scholar
- P. Gianni, B. Trager, G. Zacharias, Gröbner bases and primary decomposition of polynomial ideals, in Computational Aspects of Commutative Algebra, ed. by L. Robbiano (Academic Press, New York, 1988), pp. 15–33 Google Scholar
- L. Glebsky, A proof of Hilbert’s Nullstellensatz Based on Groebner bases (2012). arXiv:1204.3128 [math.AC] Google Scholar
- G.-M. Greuel, G. Pfister, A Singular Introduction to Commutative Algebra, 2nd edn. (Springer, New York, 2008) Google Scholar
- R. Hartshorne, Algebraic Geometry (Springer, New York, 1977) Google Scholar
- G. Hermann, Die Frage der endlich vielen schritte in der theorie der polynomideale, Math. Ann. 95, 736–788 (1926) Google Scholar
- T. Krick, A. Logar, An Algorithm for the Computation of the Radical of an Ideal in the Ring of Polynomials, in Applied Algebra, Algebraic Algorithms and Error-Correcting Codes, ed. by H.F. Mattson, T. Mora, T.R.N. Rao. Lecture Notes in Computer Science, vol. 539 (Springer, Berlin, 1991), pp. 195–205 Google Scholar
- R. Mines, F. Richman, W. Ruitenburg, A Course in Constructive Algebra (Springer, New York-Berlin-Heidelberg, 1988) Google Scholar
- P. Schauenburg, A Gröbner-based treatment of elimination theory for affine varieties. J. Symb. Comput. 42, 859–870 (2007) Google Scholar
- A. Seidenberg, Constructions in algebra. Trans. Am. Math. Soc. 197, 273–313 (1974) Google Scholar
- A. Seidenberg, On the Lasker–Noether decomposition theorem. Am. J. Math. 106, 611–638 (1984) Google Scholar
- P. Ullrich, Closed-form formulas for projecting constructible sets in the theory of algebraically closed fields. ACM Commun. Comput. Algebra 40, 45–48 (2006)Google Scholar

