1 Introduction

Throughout this paper, we will write x for an n-tuple \((x_1,x_2,\ldots ,x_n)\) of variables, where n is a positive integer. We write \({\mathcal J}F\) for the Jacobian matrix of a polynomial map \(F = (F_1,F_2,\ldots ,F_m)\) with respect to x, where m is another positive integer, i.e.

$$\begin{aligned} {\mathcal J}F = \left( \begin{array}{cccc} \frac{\partial }{ \partial x_1 } F_1 &{}\quad \frac{\partial }{ \partial x_2 } F_1 &{}\quad \cdots &{}\quad \frac{\partial }{ \partial x_n } F_1 \\ \frac{\partial }{ \partial x_1 } F_2 &{}\quad \frac{\partial }{ \partial x_2 } F_2 &{}\quad \cdots &{}\quad \frac{\partial }{ \partial x_n } F_2 \\ \vdots &{}\quad \vdots &{}\quad &{}\quad \vdots \\ \frac{\partial }{ \partial x_1 } F_m &{}\quad \frac{\partial }{ \partial x_2 } F_m &{}\quad \cdots &{}\quad \frac{\partial }{ \partial x_n } F_m \end{array}\right) \end{aligned}$$

We write \({\mathcal H}f\) for the Hessian matrix of a polynomial f with respect to x, i.e.

$$\begin{aligned} {\mathcal H}f = \left( \begin{array}{cccc} \frac{\partial ^{2}}{ \partial x_1 ^{2}} f &{}\quad \frac{\partial }{ \partial x_2 } \frac{\partial }{ \partial x_1 } f &{}\quad \cdots &{}\quad \frac{\partial }{ \partial x_n } \frac{\partial }{ \partial x_1 } f \\ \frac{\partial }{ \partial x_1 } \frac{\partial }{ \partial x_2 } f &{}\quad \frac{\partial ^{2}}{ \partial x_2 ^{2}} f &{}\quad \cdots &{}\quad \frac{\partial }{ \partial x_n } \frac{\partial }{ \partial x_2 } f \\ \vdots &{}\quad \vdots &{}\quad \ddots &{}\quad \vdots \\ \frac{\partial }{ \partial x_1 } \frac{\partial }{ \partial x_n } f &{}\quad \frac{\partial }{ \partial x_2 } \frac{\partial }{ \partial x_n } f &{}\quad \cdots &{}\quad \frac{\partial ^{2}}{ \partial x_n ^{2}} f \end{array}\right) \end{aligned}$$

We see a polynomial f as a polynomial with only one component, so

$$\begin{aligned} {\mathcal J}f = \left( \frac{\partial }{ \partial x_1 } f ~ \frac{\partial }{ \partial x_2 } f ~ \cdots ~ \frac{\partial }{ \partial x_n } \right) \end{aligned}$$

and write \(\nabla f = ({\mathcal J}f)^\mathrm{t}\). Here, and in the rest of the article, \((\cdots )^\mathrm{t}\) stands for the transpose matrix. So

$$\begin{aligned} {\mathcal H}f = {\mathcal J}(\nabla f) \end{aligned}$$

Just like with x, we will write y for another n-tuple \((y_1,y_2,\ldots ,y_n)\) of variables. But unlike x and y, t will be just a single variable.

Definition 1.1

Let \(F = x + H\) be a polynomial map from \({\mathbb C}^n\) to \({\mathbb C}^n\). Then we call F a quasi-translation if \(2x - F = x - H\) is the inverse of \(F = x + H\).

The condition that \(x - H\) is the inverse of \(x + H\) is automatically fulfilled if \(\deg H = 0\), in which case \(x + H\) is a regular translation. So a quasi-translation is a polynomial map which is characterized by a property of a regular translation.

Below are some examples of quasi-translations in dimension \(n = 4\):

$$\begin{aligned} x&+ (x_2^2 x_3 - 3x_3^3 x_4 - 5,0,0,0) \\ x&+ (1,x_4,x_4^2,0) \\ x&+ (x_3^2 -3x_3^3 x_4 - 5, x_3 + 7 x_4^7, 0, 0) \\ x&+ \left( b(a x_1 - b x_2),a(a x_1 - b x_2),\right. \\&\quad \quad \!\!\left. b(a x_3 - b x_4),a(a x_3 - b x_4) \right) \quad \text{ with } a,b \in {\mathbb C}\end{aligned}$$

In the next section, we will show that \(x + H\) is a quasi-translation, if and only if \({\mathcal J}H \cdot H = 0\). This is equivalent to that for the derivation \(D = H_1 \frac{\partial }{ \partial x_1 } + H_2 \frac{\partial }{ \partial x_2 } + \cdots + H_n \frac{\partial }{ \partial x_n }\), \(D^2 x_i = 0\) for all i, because \(D^2 x_i = D H_i = {\mathcal J}H_i \cdot H\).

Hence quasi-translations correspond to a special kind of locally nilpotent derivations. Furthermore, invariants of the quasi-translation \(x + H\) are just kernel elements of D. Paul Gordan and Max Nöther call these kernel elements ‘Functionen \(\Phi \)’ in Gordan and Nöther (1876).

In addition, we can write \(\exp (D)\) and \(\exp (tD)\) for the automorphisms corresponding to the maps \(x + H\) and \(x + tH\) respectively. But in order to make the article more readable for readers that are not familiar with derivations, we will omit the terminology of derivations further in this article.

Gordan and Nöther (1876) studied (homogeneous) quasi-translations to obtain results about (homogeneous) polynomials h with \(\det {\mathcal H}h = 0\). One such a result is the classification of homogeneous polynomials in 5 indeterminates for which the Hessian determinant is zero. This classification has been reproved in Franchetta (1954) and Garbagnati and Repetto (2009), but only for the case where h is an irreducible polynomial. In Russo (2016, Ch. 7), the proof of Garbagnati and Repetto (2009) is extended to the case where h is a square-free polynomial. With an easy argument, which the reader may find, one can extend these results to the case where h is a power of such a polynomial. But then, you still do not have all polynomials h.

However, Russo (2016), told me that by way of de Bondt (2008, Th. 2.2) one can reduce the general case to the case where h is square-free. This is indeed true, because of the following.

Proposition 1.2

Let \(h \in {\mathbb C}[x]\) and let \(\tilde{h}\) be the square-free part of h.

  1. (i)

    If \(\det {\mathcal H}h = 0\), then \(\det {\mathcal H}\tilde{h} = 0\).

  2. (ii)

    Suppose that \(a_1,a_2,\ldots ,a_{n-2} \in {\mathbb C}[x_1,x_2]\) are relatively prime. Let

    $$\begin{aligned} A := {\mathbb C}[x_1,x_2,a_1(x_1,x_2)x_3+a_2(x_1,x_2)x_4+\cdots +a_{n-2}(x_1,x_2)x_n] \end{aligned}$$

    If \(\tilde{h} \in A\), then \(h \in A\).

Proof

  1. (i)

    This is a special case of de Bondt (2008, Th. 2.2).

  2. (ii)

    Suppose that \(\tilde{h} \in A\), and let f be an arbitrary factor of h over \({\mathbb C}[x]\). It suffices to show that \(f \in A\).

    Over \({\mathbb C}(x_1,x_2)\), h is a polynomial in the linear form \(a_1(x_1,x_2)x_3+a_2(x_1,x_2)\cdot x_4+\cdots +a_{n-2}(x_1,x_2)x_n\). Just like \({\mathbb C}(x_1,x_2)[x_3]\),

    $$\begin{aligned} {\mathbb C}(x_1,x_2)[a_1(x_1,x_2)x_3+a_2(x_1,x_2)x_4+\cdots +a_{n-2}(x_1,x_2)x_n] \end{aligned}$$

    is factorially closed in \({\mathbb C}(x_1,x_2)[x_3,x_4,\ldots ,x_n]\). Consequently, f is a polynomial over \({\mathbb C}(x_1,x_2)\) in the linear form \(a_1(x_1,x_2)x_3+a_2(x_1,x_2)x_4+\cdots +a_{n-2}(x_1,x_2)x_n\) as well.

    Take \(d \ge 0\) arbitrary, and let \(\tilde{f}\) be the part of f, which has degree d with respect to \(x_3,x_4,\ldots ,x_n\). Then \(\tilde{f} \in {\mathbb C}[x]\), and over \({\mathbb C}(x_1,x_2)\), \(\tilde{f}\) is a monomial in the linear form \(a_1(x_1,x_2)x_3+a_2(x_1,x_2)x_4+\cdots +a_{n-2}(x_1,x_2)x_n\). From Gauss’s Lemma, it follows that \(\tilde{f} \in A\), As d was arbitrary, we can conclude that \(f \in A\). \(\square \)

The connection between quasi-translations and polynomial Hessians with determinant zero, which comes from Gordan and Nöther (1876), is given at the beginning of Sect. 4. This connection is used in Garbagnati and Repetto (2009) and Russo (2016, Ch. 7) as well, and appears as Garbagnati and Repetto (2009, p. 33) and Russo (2016, Lem. 7.3.7) respectively. Garbagnati and Repetto (2009) and Russo (2016, Ch. 7) contain classifications in dimensions less than 5 as well, but with the same limitations as above on the factorization of h. These limitations are not present in Lossen (2004), which follows the approach of Gordan and Nöther (1876) in proving the classifications in dimensions less than 5.

In Watanabe (2014), it is claimed that \({\text {rk}}{\mathcal J}H \ne 3\) if \(x + H\) is a quasi-translation in dimension \(n = 5\), but this is not true. Hence the proof in Watanabe (2014) of the classification of homogeneous polynomials in 5 indeterminates, for which the Hessian determinant is zero, has a gap. The paper (Franchetta 1954) has an error and hence a gap on the same point. This gap can be fixed by proving that \({\text {rk}}{\mathcal J}H \ne 3\) indeed, if \(x + H\) is associated to a polynomial for which the Hessian determinant is zero, which can be done by way of the results on linear invariants of quasi-translations, as given in Gordan and Nöther (1876) and this paper: see Remark 4.7 at the end of Sect. 4.

Garbagnati and Repetto (2009) and Russo (2016, Ch. 7) on one hand, and de Bondt (2009, Th. 5.3.7) on the other hand, treat the case where \({\text {rk}}{\mathcal J}H = 3\) incorrectly as well. But both incorrect treatments are only on subcases which do not overlap, so Russo (2016, Ch. 7) and de Bondt (2009, Th. 5.3.7) fix each other’s errors. The error in Garbagnati and Repetto (2009) and Russo (2016, Ch. 7) can be repaired by way of Theorem 4.6, which comes from Gordan and Nöther (1876). The error in de Bondt (2009, Th. 5.3.7) can be repaired by way of Lemma 4.4, which gives a simpler argument than that in Gordan and Nöther (1876).

It is easy to show that for any homogeneous polynomial map H such that \({\text {rk}}{\mathcal J}H = 1\), \(x + H\) has \(n-1\) independent linear invariants. Gordan and Nöther (1876) proved that any homogeneous quasi-translation \(x + H\) such that \({\text {rk}}{\mathcal J}H = 2\) has at least 2 independent linear invariants. In their study of homogeneous quasi-translations \(x + H\) in dimension \(n = 5\) with \({\text {rk}}{\mathcal J}H = 3\), Gordan and Nöther (1876) distinguished two cases, namely “Fall a)” and “Fall b)”, of which “Fall a)” had two subcases, which we indicate by (a1) and (a2).

The quasi-translations of subcase (a1) in Gordan and Nöther (1876) are the homogeneous quasi-translations \(x + H\) in dimension 5 with Jacobian rank three, for which the Zariski closure of the image of H is a 3-dimensional linear subspace of \({\mathbb C}^5\). The quasi-translations of case (b) in Gordan and Nöther (1876) are the homogeneous quasi-translations in dimension 5 with Jacobian rank three, which are linearly conjugate to another such quasi-translation \(x + H\), for which \(H_5\) is algebraically independent over \({\mathbb C}\) of \(H_1, H_2, H_3, H_4\), but for which the Zariski closure of the image of H is not a 3-dimensional linear subspace of \({\mathbb C}^5\).

The quasi-translations of subcase (a2) in Gordan and Nöther (1876) are categorized by a somewhat technical property, which is the existence of \(p^{(1)}\) and \(p^{(2)}\) as in (iii) of Theorem 3.8. Let us just say for now that they are the homogeneous quasi-translations in dimension 5 with Jacobian rank three, which do not belong to case (b) or subcase (a1) in Gordan and Nöther (1876). As a consequence of Theorem 3.8, we deduce in Corollary 3.10 that quasi-translations of case (a2) in Gordan and Nöther (1876) have at least one linear invariant, by showing that the linear span of the image of H is 4-dimensional. Having reasoned about these three cases, one can wonder whether they actually exist.

Example 1.3

The following three H’s are chosen in such a way, that \(x + H\) with \(n = 5\) is a quasi-translation which belongs to the above-described case (a1), (a2), and (b), respectively.

  1. (a1)

    \(H = (x_4^2,x_4x_5,x_1x_5-x_2x_4,0,0)\),

  2. (a2)

    \(H = (x_5^2(ax_1-x_5^2x_2),a(ax_1-x_5^2x_2),x_5^2(ax_3-x_5^2x_4), a(ax_3-x_5^2x_4),0)\) with \(a = x_1x_4-x_2x_3\),

  3. (b)

    \(H = (x_5^5,bx_5^3,b^2x_5,-b^2x_1+2bx_2x_5^2-x_3x_5^4,0)\) with \(b = x_1x_3-x_2^2+x_4x_5\).

The quasi-translations for (a1) and (a2) were found by using techniques of de Bondt (2006, §2). The quasi-translations for (b) was found by applying Propositions 2.4 and 2.5, on the quasi-translation \(x + H\) with \(n = 4\) and \(H = (1,x_4,x_4^2,0)\).

An unsolved question is whether a homogeneous quasi-translation in dimension 5 always has a linear invariant or not. We reprove the following results obtained in Gordan and Nöther (1876) in modern language: a homogeneous quasi-translation in dimension 5 without a linear invariant can only belong to case (b) in Gordan and Nöther (1876). Furthermore, we give a somewhat less computational proof of the result in Liu (2011) that a homogeneous quasi-translation in dimension 5 without a linear invariant must have degree 15 at least.

In dimension 6 and up, homogeneous quasi-translations do not need to have linear invariants, see (de Bondt 2006 Th. 2.1). If we substitute \(x_5 = 1\) in the quasi-translations of cases (a2) and (b) in Example 1.3 and remove the last component, we get non-homogeneous quasi-translations in dimension 4 without linear invariants.

The rest of the paper is organized as follows. In the next section, we show some basic concepts about quasi-translations.

In Sect. 3, we prove some geometric results about homogeneous quasi-translations \(x + H\) for which \({\text {rk}}{\mathcal J}H \le (n+1) / 2\). As a consequence, we deduce that a homogeneous quasi-translation in dimension 5 without linear invariants can only belong to case (b) in Gordan and Nöther (1876).

In Sect. 4, we apply the result that a homogeneous quasi-translation in dimension 5 without a linear invariant can only belong to case (b) in Gordan and Nöther (1876), to classify all homogeneous polynomials in 5 indeterminates for which the Hessian determinant vanishes.

In Sect. 5, we study homogeneous quasi-translations in dimension 5 that belong to case (b) in Gordan and Nöther (1876), with the purpose of getting properties of possible homogeneous quasi-translations in dimension 5 without linear invariants. One of these properties is that the degree of such a quasi-translation is at least 15.

In Sect. 6, we prove some geometric results about quasi-translations which gives us the following result about quasi-translations which belong to case (b) in Gordan and Nöther (1876): the Zariski closure of the image of H is an irreducible component of V(H), which contains a linear 1-dimensional subspace L of \({\mathbb C}^5\), such that every other irreducible component of V(H) is a 3-dimensional linear subspace of \({\mathbb C}^5\) which contains L. Here, V(H) is the set of common zeroes of \(H_1, H_2, \ldots , H_n\).

2 Some basics about quasi-translations

In Proposition 2.2 below, we will show that quasi-translations are also characterized by \(H(x+tH) = H\) and by that \({\mathcal J}H \cdot H\) is the zero vector. We need the following lemma to prove Proposition 2.2.

Lemma 2.1

Assume that \(x + H\) is a polynomial map and \(f \in {\mathbb C}[x]\). Then

$$\begin{aligned} f(x+tH) = f(x) \end{aligned}$$
(2.1)

in case one of the following assumptions is satisfied.

  1. (1)

    \(x + H\) is a quasi-translation and \(f(x + H) = f(x)\),

  2. (2)

    \({\mathcal J}H \cdot H = (0^1,0^2,\ldots ,0^n)\) and \({\mathcal J}f \cdot H = 0\).

Proof

  1. (1)

    Since \((x - H) \circ (x + H) = x\), we see that

    $$\begin{aligned} (x + mH) \circ (x + H)&= \left( (m+1)x - m(x - H) \right) \circ (x + H) \\&= (m+1)(x + H) - mx = x + (m+1)H \end{aligned}$$

    By induction on m, \(x + mH\) is equal to the composition of m copies of \(x + H\) for all \(m \in {\mathbb N}\). Using \(f(x + H) = f(x)\) m times, we obtain

    $$\begin{aligned} f(x + mH) = f \left( (x+H)^{\circ m} \right) = f \left( (x+H)^{\circ (m-1)} \right) = \cdots = f(x) \end{aligned}$$

    for all \(m \in {\mathbb N}\). This is only possible if (2.1) holds.

  2. (2)

    By the chain rule and \({\mathcal J}H\cdot H = (0^1,0^2,\ldots ,0^n)\), we get

    $$\begin{aligned} {\mathcal J}f(x + tH) \cdot H&= ({\mathcal J}f)|_{x=x+tH} \cdot (I_n + t {\mathcal J}H) \cdot H \nonumber \\&= ({\mathcal J}f)|_{x=x+tH} \cdot H = \frac{\partial }{ \partial t } f(x + tH) \end{aligned}$$

    where \(I_n\) is the unit matrix of size n. Since \({\mathcal J}f \cdot H = 0\), it follows from the above that

    $$\begin{aligned} {\mathcal J}\left( f(x + tH) - f(x) \right) \cdot H = \frac{\partial }{ \partial t } f(x + tH) \end{aligned}$$
    (2.2)

    Suppose that t divides the right hand side of (2.2) exactly \(r < \infty \) times. Then t divides \(f(x+tH) - f(x)\) more than r times. Hence t divides the left hand side of (2.2) more than r times as well, which is a contradiction. So both sides of (2.2) are zero. Since the right hand side of (2.2) is zero, we get (2.1). \(\square \)

Proposition 2.2

Let \(H: {\mathbb C}^n \rightarrow {\mathbb C}^n\) be a polynomial map. Then the following properties are equivalent:

  1. (1)

    \(x + H\) is a quasi-translation,

  2. (2)

    \(H(x + tH) = H\) (where t is a variable),

  3. (3)

    \({\mathcal J}H \cdot H = (0^1,0^2,\ldots ,0^n)\).

Furthermore, if any of (1), (2) and (3) is satisfied, then

$$\begin{aligned} f(x + H) = f(x) \Longleftrightarrow f(x + tH) = f(x) \Longleftrightarrow {\mathcal J}f \cdot H = 0 \end{aligned}$$
(2.3)

for all \(f \in {\mathbb C}[x]\), and

$$\begin{aligned} ({\mathcal J}H)|_{x=x-t{\mathcal J}H} = ({\mathcal J}H) + t ({\mathcal J}H)^2 + t^2 ({\mathcal J}H)^3 + \cdots \end{aligned}$$
(2.4)

Proof

The middle hand side of (2.3) gives the left hand side by substituting \(t = 1\) and the right hand side by taking the coefficient of \(t^1\). Lemma 2.1 gives the converse implications by way of (1) and (3). Hence (2.3) follows as soon as we have the equivalence of (1), (2) and (3).

By taking the Jacobian of (2), we get \(({\mathcal J}H)|_{x=x+tH} \cdot (I_n + t {\mathcal J}H) = {\mathcal J}H\), which gives (2.4) after substituting \(t = -t\). Therefore, it remains to show that (1), (2) and (3) are equivalent.

(1) \({{\Rightarrow }}\) (2):

Assume (1). Since \(x = (x - H) \circ (x + H) = x + H - H(x+H)\), we see that \(H(x + H) = H\), and (2) follows by taking \(f = H_i\) for each i in (1) of Lemma 2.1.

(2) \({{\Rightarrow }}\) (1):

Assume (2). Then

$$\begin{aligned} (x - tH) \circ (x + tH)&= (x + tH) - tH(x + tH) \nonumber \\&= x + tH - tH = x \end{aligned}$$
(2.5)

which gives (1) after substituting \(t = 1\).

(2) \({{\Rightarrow }}\) (3):

Assume (2). By taking the coefficient of \(t^1\) of (2), we get (3).

(3) \({{\Rightarrow }}\) (2):

Assume (3). By taking \(f = H_i\) in (2) of Lemma 2.1, we get (2). \(\square \)

Proposition 2.3 below gives a tool to obtain quasi-translations \(x + H\) over \({\mathbb C}\) for which \(\gcd \{H_1,H_2,\ldots ,H_n\} = 1\) from arbitrary quasi-translations \(x + H\) over \({\mathbb C}\).

Proposition 2.3

Assume \(x + gH\) is a quasi-translation over \({\mathbb C}\), where \(g \in {\mathbb C}[x]\) is nonzero. Then \(x + H\) is a quasi-translation over \({\mathbb C}\) as well. Furthermore, the invariants of \(x + H\) and \(x + gH\) are the same. If additionally H is homogeneous of positive degree, then \({\text {rk}}{\mathcal J}gH = {\text {rk}}{\mathcal J}H\).

Proof

By (1) \(\Rightarrow \) (2) of Proposition 2.2, we see that \(g(x+tgH) \cdot H_i(x+tgH) = g \cdot H_i\). We can substitute \(t = g^{-1}t\) in it, to obtain that

$$\begin{aligned} \deg _t H_i(x+tH)&\le \deg _t g(x+tH) + \deg _t H_i(x+tH) \\&= \deg _t (gH_i)(x+tH) \le 0 \end{aligned}$$

for each i, which is exactly \(H(x+tH) = H\). Hence \(x + H\) is a quasi-translation on account of (2) \(\Rightarrow \) (1) of Proposition 2.2.

Assume f is an invariant of \(x + H\). Then \(f(x + tH) = f(x)\) on account of (2.3), and by substituting \(t = g\) we see that f is an invariant of \(x + gH\). The converse follows in a similar manner by substituting \(t = g^{-1}\).

Suppose that H is homogeneous of positive degree. From Proposition 1.2.9 of either van den Essen (2000) or de Bondt (2009), we deduce that in order to prove that \({\text {rk}}{\mathcal J}gH = {\text {rk}}{\mathcal J}H\), it suffices to show that \({\text {trdeg}}_{{\mathbb C}} {\mathbb C}(gH) = {\text {trdeg}}_{{\mathbb C}} {\mathbb C}(H)\). For that purpose, we show that for any \(R \in {\mathbb C}[y]\), both R(gH) and R(H) are zero if one of them is.

Suppose that either \(R(gH) = 0\) or \(R(H) = 0\) for some \(R \in {\mathbb C}[y]\), say of degree r. Let \(\bar{R}\) be the leading homogeneous part of R. If \(R(H) = 0\), then \(\bar{R}(H) = 0\) because H is homogeneous of positive degree. If \(R(gH) =\!\! 0\), then \(\deg \bar{R}(gH) < r \deg gH = \deg g^r + r \deg H\), so \(\deg \bar{R}(H) < r \deg H\), which is only possible if \(\bar{R}(H) = 0\). So \(\bar{R}(gH) = \bar{R}(H) = 0\) in any case. Hence either \((R-\bar{R})(gH) = 0\) or \((R-\bar{R})(gH) = 0\). By induction to the number of homogeneous parts of R, it follows that \(R(gH) = R(H) = 0\) indeed. \(\square \)

Proposition 2.4 gives a criterion about preservation of the quasi-translation property with respect to conjugation with an invertible polynomial map.

Proposition 2.4

Assume \(x + H\) is a quasi-translation in dimension n over \({\mathbb C}\), and F is an invertible polynomial map in dimension n over \({\mathbb C}\) with inverse G. Then

$$\begin{aligned} G \circ (x + H) \circ F \end{aligned}$$

is a quasi-translation as well, if and only if \(\deg _t G_i(x+tH) \le 1\) for all i. In particular, if T is an invertible matrix of size n over \({\mathbb C}\), we have that

$$\begin{aligned} x + T^{-1} H(Tx) = T^{-1} (Tx + H(Tx)) = T^{-1}x \circ (x + H) \circ Tx \end{aligned}$$

is a quasi-translation as well.

Proof

Assume first that \(\deg _t G_i(x+tH) \le 1\) for all i. Then we can write

$$\begin{aligned} G(x+tH) = G^{(0)} + tG^{(1)} \end{aligned}$$

Notice that \(G^{(0)} = G(x+tH)|_{t=0} = G\). Hence

$$\begin{aligned} G \circ (x+tH) \circ F = G^{(0)}(F) + t G^{(1)}(F) = G(F) + t G^{(1)}(F) = x + t G^{(1)}(F) \end{aligned}$$

By substituting \(t = 1\) on both sides, we obtain that \(G \circ (x + H) \circ F = x + G^{(1)}(F)\) and substituting \(t = -1\) tells us that its inverse \(G \circ (x - H) \circ F\) is equal to \(x - G^{(1)}(F)\). Thus \(G \circ (x + H) \circ F\) is a quasi-translation indeed.

Assume next that \(G \circ (x + H) \circ F\) is a quasi-translation \(x + \tilde{H}\). Then \(x - \tilde{H}\) is the inverse of \(G \circ (x + H) \circ F\), which is \(G \circ (x - H) \circ F\). Hence

$$\begin{aligned} \tilde{H} = \left( G \circ (x + H) \circ F \right) - x = x - \left( G \circ (x - H) \circ F \right) \end{aligned}$$

Substituting \(x = G(x + mH)\) in the above gives

$$\begin{aligned}&G \left( x + mH + H(x + mH) \right) - G(x + mH) = G(x + mH)\\&\quad -\, G \left( x + mH - H(x + mH) \right) \end{aligned}$$

Since \(H(x + mH) = H\) on account of (1) \(\Rightarrow \) (2) of Proposition 2.2, we obtain

$$\begin{aligned} G(x + (m+1)H) - G(x + mH) = G(x + mH) - G(x + (m-1)H) \end{aligned}$$

By induction on m, we get \(G(x + (m+1)H) - G(x + mH) = G(x + H) - G(x)\) for all \(m \in {\mathbb N}\), whence

$$\begin{aligned} G(x + \tilde{m}H) - G(x)= & {} \sum _{m=0}^{\tilde{m}-1} G(x + (m+1)H) - G(x + mH) \\= & {} \tilde{m} (G(x + H) - G(x)) \end{aligned}$$

for all \(\tilde{m} \in {\mathbb N}\). This is only possible if \(G(x + t H) - G(x) = t (G(x + H) - G(x))\). Hence \(\deg _t G(x + t H) \le 1\), as desired.\(\square \)

Proposition 2.5 gives a tool to obtain homogeneous quasi-translations over \({\mathbb C}\) from arbitrary quasi-translations \(x + H\) over \({\mathbb C}\). Hence we can obtain results about arbitrary quasi-translations by studying homogeneous ones.

Proposition 2.5

Assume \(x + H\) is a quasi-translation over \({\mathbb C}\) in dimension n, and

$$\begin{aligned} d \ge \deg H := \max \{\deg H_1, \deg H_2, \ldots , \deg H_n\} \end{aligned}$$

Then

$$\begin{aligned} (x,x_{n+1}) + x_{n+1}^d \left( H(x_{n+1}^{-1}x), 0 \right) \end{aligned}$$

is a homogeneous quasi-translation over \({\mathbb C}\) in dimension \(n + 1\).

Proof

Denote

$$\begin{aligned} (x,x_{n+1}) =: \tilde{x} \quad \text{ and } \quad x_{n+1}^d \left( H(x_{n+1}^{-1}x), 0 \right) =: \tilde{H} \end{aligned}$$

We must show that \(\tilde{x} + \tilde{H}\) is a quasi-translation in dimension \(n+1\) over \({\mathbb C}\). On account of (3) \(\Rightarrow \) (1) of Proposition 2.2, it suffices to show that \({\mathcal J}_{\tilde{x}} \tilde{H} \cdot \tilde{H} = (0^1,0^2,\ldots ,0^{n+1})\). Since \(\tilde{H}_{n+1} = 0\), this is equivalent to

$$\begin{aligned} {\mathcal J}\tilde{H} \cdot x_{n+1}^d H(x_{n+1}^{-1}x) = (0^1,0^2,\ldots ,0^{n+1}) \end{aligned}$$

Using that \({\mathcal J}\tilde{H}_{n+1}\) is the zero row, we see that it suffices to show that

$$\begin{aligned} {\mathcal J}\left( x_{n+1}^d H(x_{n+1}^{-1}x) \right) \cdot x_{n+1}^d H(x_{n+1}^{-1}x) = (0^1,0^2,\ldots ,0^n) \end{aligned}$$

This is indeed the case, because the chain rule tells us that

$$\begin{aligned} (0^1,0^2,\ldots ,0^n)&= x_{n+1}^{2d-1} \cdot (0^1,0^2,\ldots ,0^n) \\&= x_{n+1}^{2d-1} \cdot ({\mathcal J}H \cdot H)_{x = x_{n+1}^{-1}x} \\&= x_{n+1}^{2d-1} \cdot ({\mathcal J}H \cdot x_{n+1}^{-1} \cdot x_{n+1} H)_{x = x_{n+1}^{-1}x} \\&= x_{n+1}^{2d-1} \cdot {\mathcal J}\left( H(x_{n+1}^{-1}x) \right) \cdot x_{n+1} H(x_{n+1}^{-1}x) \\&= {\mathcal J}\left( x_{n+1}^d H(x_{n+1}^{-1}x) \right) \cdot x_{n+1}^d H(x_{n+1}^{-1}x) \end{aligned}$$

\(\square \)

Proposition 2.6 below connects quasi-translations with homogeneity.

Proposition 2.6

Assume H is a homogeneous polynomial map over \({\mathbb C}\). Then the assertions

  1. (1)

    \({\mathcal J}H^2\) is the zero matrix,

  2. (2)

    \(x + H\) is a quasi-translation,

  3. (3)

    \(H(H) = (0^1,0^2,\ldots ,0^n)\) and \({\text {rk}}{\mathcal J}H \le \max \{n-2,1\}\),

satisfy (1) \(\Rightarrow \) (2) \(\Rightarrow \) (3).

Proof

Suppose that H is homogeneous of degree d. Let \(E: {\mathbb C}[x]^n \rightarrow {\mathbb C}[x]^n\) be the map which multiplies each term in any of the n components by its own degree. Then one can verify that \(E(H) = {\mathcal J}H \cdot x\). So \({\mathcal J}H \cdot H = d^{-1} {\mathcal J}H \cdot d H = d^{-1} {\mathcal J}H \cdot E(H) = d^{-1} {\mathcal J}H^2 \cdot x\). Hence (1) \(\Rightarrow \) (2) follows from (3) \(\Rightarrow \) (1) of Proposition 2.2.

In order to prove (2) \(\Rightarrow \) (3), assume that (2) holds. By looking at the coefficient of \(t^d\) of \(H(x+tH) - H(x)\), we deduce that \(H(H) = (0^1,0^2,\ldots ,0^n)\), which is the first claim of (3).

To show the second claim of (3), assume that \({\text {rk}}{\mathcal J}H > 1\). Write \(H = g \tilde{H}\), where \(g \in {\mathbb C}[x]\), such that \(\gcd \{\tilde{H}_1,\tilde{H}_2,\ldots ,\tilde{H}_n\} = 1\). Since \({\text {rk}}{\mathcal J}H > 1\), we have \(\deg \tilde{H} \ge 1\). Furthermore, \(V(\tilde{H})\) cannot be written as a zero set of a single polynomial. Since \({\mathbb C}[x]\) is a unique factorization domain, we see that \(\dim V(\tilde{H}) \le n-2\).

Using Proposition 2.3, Proposition 1.2.9 of either van den Essen (2000) or de Bondt (2009), and the above obtained \(\tilde{H}(\tilde{H}) = 0\) and \(\dim V(\tilde{H}) \le n-2\), in that order, we deduce that

$$\begin{aligned} {\text {rk}}{\mathcal J}H = {\text {rk}}{\mathcal J}\tilde{H} = {\text {trdeg}}_{{\mathbb C}} {\mathbb C}(\tilde{H}) \le \dim V(\tilde{H}) \le n-2 \end{aligned}$$

which gives the second claim of (3). \(\square \)

3 The image of the map H of quasi-translations \(x + H\)

We prove several results about quasi-translations with geometrical arguments. Some of these results have been claimed by Gordan and Nöther (1876). For the last two sections, we need several parts of Corollary 3.10 in this section.

Since the results may essentially be useful for non-homogeneous quasi-translations as well, it does not seem to be a good idea to work with projective varieties. But we will need the completeness of complex projective space in some manner. The lemma below gives us an affine version of that.

Lemma 3.1

Let \(\tilde{Z} \subseteq {\mathbb C}^{m+kn}\) be closed with respect to the Euclidian topology. Assume that for every point of \(\tilde{Z}\), the projection onto its last kn coordinates gives a point of \({\mathbb C}^{kn}\) with complex norm \(\sqrt{k}\). Let \(\tilde{X}\) be the image of the projection of \(\tilde{Z}\) onto its first m coordinates.

Suppose that there is an irreducible variety \(X \subseteq {\mathbb C}^{m}\) and a Zariksi open set U of X, such that \(U \subseteq \tilde{X} \subseteq X\). Then \(\tilde{X} = X\).

Proof

Since the set of points in \({\mathbb C}^{kn}\) whose complex norm is \(\sqrt{k}\) form a compact space, the projection of \(\tilde{Z}\) onto \(\tilde{X}\) is closed with respect to the Euclidean topology. Hence \(\tilde{X}\) is closed in the Euclidean topology. So \(\tilde{X}\) contains the Euclidean closure of U in X. On account of O’Meara et al. (2011, Th. 7.5.1), the Euclidean closure of U in X is the same as the Zariksi closure of U in X, which is X. Hence \(X \subseteq \tilde{X}\). So \(\tilde{X} = X\) indeed.\(\square \)

Notice that reverting to Euclidean topology is not only because the complex inner product cannot be expressed as a polynomial, but also because the Zariski topology of a product is not the corresponding product topology.

We also need a weak form of the projective fiber dimension theorem in some manner. Lemma 3.3 below is an affine version of that. But first, we need another lemma.

Lemma 3.2

Suppose that \(H \in {\mathbb C}[x]^n\). Then the Zariski closure W of the image of H is irreducible and has dimension \({\text {rk}}{\mathcal J}H\).

Furthermore, V(H) has dimension at least \(n - {\text {rk}}{\mathcal J}H\) if H has no constant part.

Proof

From Proposition 1.2.9 of either van den Essen (2000) or de Bondt (2009), it follows that \({\text {rk}}{\mathcal J}H = {\text {trdeg}}_{{\mathbb C}} {\mathbb C}(H)\). Hence \(\dim W = {\text {rk}}{\mathcal J}H\) indeed.

Let Z be a component of W and let Y be the union of the other components of W. By definition of Z, \(U := H^{-1}(W \setminus Y) \ne \varnothing \). By continuity of H, U is open and \(H^{-1}(Z) \supseteq U\) is closed, so \(H^{-1}(Z) = {\mathbb C}^n\) and \(W = Z\) is irreducible.

To prove the last claim, suppose that H has no constant part. Then \(0 \in V(H)\). From a weak version of the affine fiber dimension theorem (or from Lemma 3.3 below, applied on the map \((H,x_{n+1})\)), it follows that \(\dim V(H) = \dim H^{-1}(0) \ge n - {\text {rk}}{\mathcal J}H\) indeed.\(\square \)

Lemma 3.3

Suppose that \(H: {\mathbb C}^n \rightarrow {\mathbb C}^n\) is a polynomial map and \(p \in {\mathbb C}^n\), such that the linear span \({\mathbb C}p\) of p contains infinitely many points of the image of H. Then there exists an irreducible component X of \(H^{-1}({\mathbb C}p)\) such that H(X) has infinitely many points, and the dimension of any such X is larger than \(n - {\text {rk}}{\mathcal J}H\).

Proof

Let W be the Zariski closure of the image of H. On account of Lemma 3.2, \(\dim W = {\text {rk}}{\mathcal J}H\). Take a generic linear subspace \(L \ni p\) of dimension \(n + 1 - {\text {rk}}{\mathcal J}H\) of \({\mathbb C}^n\), so that \(\dim (L \cap W) = 1\). The set \(Y := \{c \in {\mathbb C}^n \mid H(c) \in L\}\) is the zero set of \({\text {rk}}{\mathcal J}H - 1\) \({\mathbb C}\)-linear forms in the components of H. By applying (Hartshorne 1977 Ch. I, Prop. 7.1) \({\text {rk}}{\mathcal J}H - 2\) times, it follows that every irreducible component of Y have dimension greater than \(n - {\text {rk}}{\mathcal J}H\). Furthermore, \(\dim H(Y) = 1\) because \(H(Y) = L \cap W\).

Since \({\mathbb C}p \cap H(Y)\) contains infinitely many points and Y has finitely many irreducible components, there is an irreducible component X of Y such that H(X) has infinitely many points of \({\mathbb C}p\). Furthermore, \(\dim X > n - {\text {rk}}{\mathcal J}H\), because all irreducible components of Y have dimension greater than \(n - {\text {rk}}{\mathcal J}H\). So it remains to show that \(X \subseteq H^{-1}({\mathbb C}p)\).

Since H(X) has infinitely many points of \({\mathbb C}p\), it follows that \({\mathbb C}p\) is contained in the Zariski closure of H(X). As \(\dim H(X) \le \dim H(Y) = 1 = \dim {\mathbb C}p\), \({\mathbb C}p\) is a component of the Zariski closure of H(X). Now \(X \subseteq H^{-1} ({\mathbb C}p)\) follows in a similar manner as \({\mathbb C}^n \subseteq H^{-1}(Z)\) in the proof of Lemma 3.2.   \(\square \)

Lemma 3.4

Assume \(x + H\) is a homogeneous quasi-translation over \({\mathbb C}\). Suppose that p and q are independent and contained in the image of H. Then there exists an algebraic set X of dimension at least \(n - 2({\text {rk}}{\mathcal J}H - 1)\), such that \(H(c + tp) = H(c + tq) = 0\) for all \(c \in X\).

Proof

On account of Lemma 3.3, there exist irreducible algebraic sets \(X_p\) and \(X_q\) of dimension at least \(n + 1 - {\text {rk}}{\mathcal J}H\), such that \(H(X_p)\) and \(H(X_q)\) contain infinitely many points of \({\mathbb C}p\) and \({\mathbb C}q\) respectively. The set \(X_p \cap H^{-1}({\mathbb C}^{*}p)\) is an open subset of \(X_p\), and its Zariski closure is just \(X_p\) because \(X_p\) is irreducible. For \(c \in H^{-1}({\mathbb C}^{*}p)\), we have \(H(c + t p) = H(c) = \lambda p\) for some \(\lambda \in {\mathbb C}\) on account of (1) \(\Rightarrow \) (2) of Proposition 2.2. Hence \(H(c + t p) = H(c) \in {\mathbb C}p\) for every \(c \in X_p\).

By a similar argument with q instead of p, we see that \(H(c + t p) = H(c) = H(c + t q)\) is dependent of both p and q for every \(c \in X_p \cap X_q\). Due to the homogeneity of H, \(0 \in X_p \cap X_q\). Hence it follows from Hartshorne (1977, Ch. I, Prop. 7.1) that the dimension of \(X_p \cap X_q\) is at least \(n - 2({\text {rk}}{\mathcal J}H - 1)\). So \(X = X_p \cap X_q\) suffices.   \(\square \)

Lemma 3.5

Assume \(x + H\) is a homogeneous quasi-translation in dimension \(n \le 5\) over \({\mathbb C}\), such that \({\text {rk}}{\mathcal J}H = 2\) and \(\dim V(H) \le n-2\). Then V(H) contains the linear span of the image of H.

Proof

V(H) contains only finitely many \((n-2)\)-dimensional linear subspaces of \({\mathbb C}^n\) because \(\dim V(H) \le n-2\). Furthermore, the Zariski closure of the image of H is irreducible on account of Lemma 3.2. From those two facts, we can deduce that it suffices to show that every nonzero p in the image of H is contained in an \((n-2)\)-dimensional linear subspace of \({\mathbb C}^n\) which is contained in V(H).

So take any nonzero p in the image of H. Take q independent of p such that q is the image of H as well. From Lemma 3.4, it follows that there exists an algebraic set X of dimension at least \(n - 2({\text {rk}}{\mathcal J}H - 1) = n - 2\), such that \(H(c + tp) = H(c + tq) = 0\) for all \(c \in X\). Choose X irreducible. Since \(\dim V(H) \le n - 2\) and \(X \subseteq V(H)\), it follows that \(\dim X = n-2\) and that the interior \(X^{\circ }\) of X as a closed subset of V(H) is nonempty.

Take \(c \in X^{\circ }\), such that c is independent of p and q if \(n = 5\). Then the linear span of c, p and q has dimension at least \(\max \{2,n-2\}\). Since \(H(c + tp) = 0\), the linear span L of c and p is contained in V(H). Since \(c \in L \subseteq V(H)\) and \(c \in X^{\circ }\), it follows from the irreducibility of L that \(L \subseteq X\).

In as similar manner, it follows that for every \(\tilde{c} \in L \cap X^{\circ }\), hence for all \(\tilde{c} \in L\), the linear span of \(\tilde{c}\) and q is contained in V(H). So the linear span of L and q is contained in V(H). This linear span has dimension at least \(\max \{2,n-2\}\). Since \(\dim V(H) \le n-2\), it follows that \(n \ge 4\) and that p is contained in an \((n-2)\)-dimensional linear subspace of \({\mathbb C}^n\) which is contained in V(H).\(\square \)

Theorem 3.6

(Gordan and Nöther) Assume \(x + H\) is a homogeneous quasi-translation over \({\mathbb C}\), such that \(\deg H \ge 1\).

  1. (i)

    If \({\text {rk}}{\mathcal J}H \le 1\), then the image of H is a line through the origin and \(x + H\) has \(n-1\) independent linear invariants.

  2. (ii)

    If \(\gcd \{H_1,H_2,\ldots ,H_n\} = 1\), then \(2 \le {\text {rk}}{\mathcal J}H \le \dim V(H) \le n-2\).

  3. (iii)

    If \({\text {rk}}{\mathcal J}H = 2\), then \(x + H\) has at least two independent linear invariants.

Proof

For the moment, we prove (iii) only for the case where \(n \le 5\), because we do not need the case where \(n \ge 6\) in this paper. To prove the general case of (iii), one can replace the use of Lemma 3.5 by that of the more general Corollary 6.5 in the last section.

Let W be the Zariski closure of the image of H. From Lemma 3.2, it follows that W is irreducible and that \(\dim W = {\text {rk}}{\mathcal J}H\).

  1. (i)

    As \(\deg H \ge 1\), the case \({\text {rk}}{\mathcal J}H = 0\) is impossible. So assume that \({\text {rk}}{\mathcal J}H =\!\! 1\). Since H is homogeneous and \(\dim W = {\text {rk}}{\mathcal J}H = 1\), it follows from the irreducibility of W that the image of H can only be a line through the origin. Hence there are \(n-1\) independent linear forms \(l_1, l_2, \ldots , l_{n-1}\) which vanish on the image of H. So \(l_1, l_2, \ldots , l_{n-1}\) are invariants of \(x + H\).

  2. (ii)

    Assume that \(\gcd \{H_1,H_2,\ldots ,H_n\} = 1\). Since \(\deg H \ge 1\), it follows from (i) that \({\text {rk}}{\mathcal J}H \ge 2\). From (2) \(\Rightarrow \) (3) of Proposition 2.6, it follows that \({\text {rk}}{\mathcal J}H \le n - 2\), but its proof tells us that even \({\text {rk}}{\mathcal J}H \le \dim V(H) \le n - 2\). So \(2 \le {\text {rk}}{\mathcal J}H \le \dim V(H) \le n - 2\).

  3. (iii)

    Assume that \({\text {rk}}{\mathcal J}H = 2\). From Lemma 3.2, it follows that \(\dim V(H) \ge n-{\text {rk}}{\mathcal J}H = n-2\). Write \(H = g \tilde{H}\), where \(g \in {\mathbb C}[x]\), such that \(\gcd \{\tilde{H}_1,\tilde{H}_2,\ldots , \tilde{H}_n\} = 1\). Since \({\text {rk}}{\mathcal J}H = 2 > 1\), we have \(\deg \tilde{H} \ge 1\). On account of Proposition 2.3, \({\text {rk}}{\mathcal J}\tilde{H} = {\text {rk}}{\mathcal J}H = 2\). Furthermore, \(2 \le \dim V(\tilde{H}) \le n-2\) on account of (ii), so \(n \ge 4\).

    From Lemma 3.5, it follows that the linear span of the image of \(\tilde{H}\) is contained in \(V(\tilde{H})\). Since \(\dim V(\tilde{H}) \le n-2\), the linear span of the image of \(\tilde{H}\) has dimension at most \(n-2\) as well. Hence there are at least two independent linear forms \(l_1\) and \(l_2\) which vanish on the image of \(\tilde{H}\). Thus \(l_i(\tilde{H}) = 0\) and \(l_i(H) = g \cdot 0 = 0\) for both \(i \le 2\). So \(l_1\) and \(l_2\) are invariants of \(x + H\). \(\square \)

Definition 3.7

Let H be a polynomial map. We define a GN-plane of H as a 2-dimensional linear subspace of \({\mathbb C}^n\) which is contained in V(H).

Theorem 3.8

Assume \(x + H\) is a homogeneous quasi-translation over \({\mathbb C}\), such that \(2 \le {\text {rk}}{\mathcal J}H \le (n+1)/2\). Write W for the Zariski closure of the image of H.

  1. (i)

    For each \(p \in W\) and each \(q \in W\), there are GN-planes \(L_p \ni p\) and \(L_q \ni q\) of H which intersect nontrivially.

  2. (ii)

    If there exists a \(p \in W\) which is contained in only finitely many GN-planes of H, then the set of such \(p \in W\) is not contained in a proper algebraic subset of W.

  3. (iii)

    Suppose that \(p^{(1)}, p^{(2)}, \ldots , p^{(k)} \in W\), such that \(p^{(i)}\) is contained in only finitely many GN-planes of H for each i.

    Then there exist GN-planes \(L_{p^{(1)}} \ni p^{(1)}\), \(L_{p^{(2)}} \ni p^{(2)}\), ..., \(L_{p^{(k)}} \ni p^{(k)}\) of H, such that for each \(q \in W\), there exists a GN-plane \(L_q \ni q\) of H which intersects \(L_{p^{(i)}}\) nontrivially for each i.

Proof

  1. (i)

    We first show that (i) holds for all (pq) in a dense open subset of \(W^2\). The generic property of p and q that we assume is that p and q are independent and contained in the image of H itself. From Mumford (1999, §1.8, Th. 3), it follows that the image of H contains an open subset of W, so that we can easily show that we are considering a dense open subset of \(W^2\) indeed. From Lemma 3.4, it follows that there exists an algebraic set X of dimension at least \(n - 2({\text {rk}}{\mathcal J}H - 1) \ge 1\), such that \(H(c + t p) = H(c + t q) = 0\) for every \(c \in X\). Take \(c \in X\) nonzero. Since H is homogeneous, we deduce by substituting \(t = t^{-1}\) that \(H(tc + p) = H(tc + q) = 0\).

    In the general case, consider the sets

    $$\begin{aligned} Z := \{(p,q,c,b) \in W^2 \times ({\mathbb C}^n)^2 \mid H(tc + p) = H(tc + q) = 0 \text{ and } b^\mathrm{t}c = 1 \} \end{aligned}$$

    and

    $$\begin{aligned} \tilde{Z} := \{(p,q,c,b) \in Z \mid b \text{ is } \text{ the } \text{ complex } \text{ conjugate } \text{ of } c\} \end{aligned}$$

    By applying proper substitutions in t, we see that the image \(\tilde{X}\) of the projection of \(\tilde{Z}\) onto its first 2n coordinates is equal to that of Z. Since \(\tilde{X}\) contains an open subset of \(X := W \times W\), it follows from Lemma 3.1 that \(\tilde{X} = X\), which gives (i).

  2. (ii)

    Suppose that there exists a \(p \in W\) for which there are only finitely many GN-planes \(L_p \ni p\). Let Y be the set of \(q \in W\) for which there are infinitely many GN-planes \(L_q \ni q\). It is clear that (ii) holds if \(Y = \{0\}\), so assume that there exist a \(q \in Y\) which is nonzero. Take \(P := \{c \in V(H) \mid H(c + tp) = 0\}\) and \(Q := \{c \in V(H) \mid H(c + tq) = 0\}\). Since H is homogeneous, we see that both P and Q are unions of GN-planes. Furthermore, \(\dim P = 2\) and \(\dim Q \ge 3\) because of the cardinality assumptions on the GN-planes in P and Q.

    Let L be a generic linear subspace of dimension \(n-2\) of \({\mathbb C}^n\), so that \(\dim (L \cap P) = 0\). Then \(L \cap P = \{0\} \subseteq L \cap Q\) and on account of Hartshorne (1977, Ch. I, Prop. 7.1), \(\dim (L \cap Q) \ge 1\). Now define

    $$\begin{aligned} Z := \{(r,c,b) \in W \times L \times {\mathbb C}^n \mid H(tc + r) = 0 \text{ and } b^\mathrm{t}c = 1 \} \end{aligned}$$

    and

    $$\begin{aligned} \tilde{Z} := \{(r,c,b) \in Z \mid b \text{ is } \text{ the } \text{ complex } \text{ conjugate } \text{ of } c\} \end{aligned}$$

    By applying proper substitutions in t, we see that the image \(\tilde{X}\) of the projection of \(\tilde{Z}\) onto its first n coordinates is equal to that of Z. Furthermore q is contained in \(\tilde{X}\), but p is not. Since \(q \in Y \setminus \{0\}\) was arbitrary, we see that \(Y \subseteq \tilde{X}\).

    If Y would contain an open subset of W, then Lemma 3.1 tells us that \(\tilde{X} = W\), which contradicts that p is not contained in \(\tilde{X}\). So Y does not contain an open subset of W, and \(W \setminus Y\) is not contained in a proper closed subset of W indeed.

  3. (iii)

    We can simplify (iii) by changing both the quantization set of q and the quantization order, to get the following.

    (iii\(^{\prime }\))    Suppose that \(p^{(1)}, p^{(2)}, \ldots , p^{(k)} \in W\), such that \(p^{(i)}\) is contained in only finitely many GN-planes of H for each i.

    Then for each \(q \in W\) which contains only finitely many GN-planes of H, there exist a GN-plane \(L_q\) of H and GN-planes \(L_{p^{(1)}} \ni p^{(1)}\), \(L_{p^{(2)}} \ni p^{(2)}\), ..., \(L_{p^{(k)}} \ni p^{(k)}\) of H, such that \(L_q\) and \(L_{p^{(i)}}\) intersect nontrivially for each i.

    The case where \(k = 1\) of this simplification follows from (i). The case where \(k \ge 2\) of this simplification follows from the case where \(k = 1\) of the unsimplified (iii) with \(p^{(1)} = q\), which may be assumed by induction on k. So it remains to deduce (iii) from its simplification. For that purpose, define Y as

    $$\begin{aligned} \begin{aligned} Y&:= \left\{ (q,c^{(1)},c^{(2)},\ldots ,c^{(k)},b^{(1)},b^{(2)},\ldots ,b^{(k)}) \in W \times ({\mathbb C}^n)^{2k} ~\big | \right. \\&\qquad \quad H(tc^{(i)} + q) = H(tc^{(i)} + p^{(i)}) = 0 \text{ and } (b^{(i)})^\mathrm{t}c^{(i)} = 1 \\&\qquad \quad \left. \text{ for } \text{ each } i, \text{ and } {\text {rk}}\left( \,q\,\big |\,c^{(1)}\,\big |\,c^{(2)}\,\big |\,\cdots \,\big |\,c^{(k)}\right) \le 2 \right\} \end{aligned} \end{aligned}$$

    We can write Y as a union of algebraic sets of the form

    $$\begin{aligned} \begin{aligned}&\left\{ (q,c^{(1)},\ldots ,c^{(k)},b^{(1)},\ldots ,b^{(k)}) \in Y ~\big |~ c^{(i)} \in L_{p^{(i)}} \right. \\&\left. \text{ for } \text{ each } i, \text{ and } {\text {rk}}\left( \,q\,\big |\,c^{(1)}\,\big |\,c^{(2)}\,\big |\,\cdots \,\big |\,c^{(k)} \right) \le 2 \right\} \end{aligned} \end{aligned}$$
    (3.1)

    where \(L_{p^{(i)}} \ni p^{(i)}\) is a GN-plane of H for each i. This union is finite by assumption.

    Let f be the projection of \({\mathbb C}^{n+2kn}\) onto its first n coordinates. From the simplified version of (iii), it follows that the image of \(f|_{Y}\) contains all \(q \in W\) which contains only finitely many GN-planes of H. On account of (ii), the image of \(f|_{Y}\) is not contained in a proper algebraic subset of W. Hence there exists an irreducible component Z of Y such that the image of \(f|_{Z}\) is not contained in a proper algebraic subset of W. From Mumford (1999, §1.8, Th. 3), it follows that the image of \(f|_{Z}\) contains an open subset of W.

    Since Y is a finite union of algebraic subsets of the form (3.1) and Z is irreducible, we deduce that Z is contained in an algebraic subset of the form (3.1). Take

    $$\begin{aligned} \begin{aligned} \tilde{Z}&:= \left\{ (q,c^{(1)},c^{(2)},\ldots ,c^{(k)},b^{(1)},b^{(2)},\ldots ,b^{(k)}) \in Z ~\big | \right. \\&\quad \qquad \left. b^{(i)} \text{ is } \text{ the } \text{ complex } \text{ conjugate } \text{ of } c^{(i)} \text{ for } \text{ each } i\right\} \end{aligned} \end{aligned}$$

    By applying proper substitutions in t and \(y_1,y_2,\ldots ,y_k\), we see that the image \(\tilde{X}\) of \(f|_{\tilde{Z}}\) is the same as that of \(f|_{Z}\), so \(\tilde{X}\) contains an open subset of W. From Lemma 3.1, it follows that \(\tilde{X} = W\). Since \(\tilde{X}\) is the image of the restriction of f on an algebraic subset of the form (3.1), the unsimplified (iii) follows. \(\square \)

Definition 3.9

Let X be any subset of \({\mathbb C}^n\). We say that \(a \in {\mathbb C}^n\) is an apex of X if \((1-\lambda )c + \lambda a \in X\) for all \(\lambda \in {\mathbb C}\) and all \(c \in X\).

We say that a \(p \in {\mathbb C}^n\) is a projective apex of X if \(p \ne 0\) and \(c + \lambda p \in X\) for all \(\lambda \in {\mathbb C}\) and all \(c \in X\).

If X is the Zariski closure of the image of a map H, then we say that a and p as above are an image apex of H and a projective image apex of H respectively.

figure a

One may convince oneself that a projective apex is in fact an apex on the projective horizon.

If X is a zero set of homogeneous polynomials, e.g. because X is the Zariski closure of the image of a homogeneous map, then 0 is an apex of X. If 0 is an apex of X, then a projective image apex is the same as a nonzero apex. In that case, we will parenthesize the word projective.

Corollary 3.10

Assume \(x + H\) is a homogeneous quasi-translation over \({\mathbb C}\), such that \({\text {rk}}{\mathcal J}H \le (n+1)/2\). Write W for the Zariski closure of the image of H. Then for

  1. (1)

    \(\dim V(H) = {\text {rk}}{\mathcal J}H \le 3\) and W has no nonzero (projective) apex;

  2. (2)

    \(\dim V(H) = {\text {rk}}{\mathcal J}H\) and there is no nonzero \(p \in W\) which contains infinitely many GN-planes of H that are contained in W;

  3. (3)

    There exists a \(p \in W\) which is contained in only finitely many GN-planes of H, but there does not exist a nonzero \(c \in V(H)\) which shares a GN-plane of H with every \(q \in W\);

  4. (4)

    \({\text {rk}}{\mathcal J}H \le 1\) or W is properly contained in the linear span of two GN-planes of H which are contained in W;

  5. (5)

    W is a properly contained in a 4-dimensional linear subspace of \({\mathbb C}^n\) and \({\text {rk}}{\mathcal J}H \le 3\);

we have (1) \(\Rightarrow \) (2) \(\Rightarrow \) (3) \(\Rightarrow \) (4) \(\Rightarrow \) (5).

Proof

From Lemma 3.2, it follows that W is irreducible and that \({\text {rk}}{\mathcal J}H = \dim W\).

(1) \({{\Rightarrow }}\) (2):

Assume that \(\dim V(H) = {\text {rk}}{\mathcal J}H \le 3\) and that (2) does not hold. Then there exists a nonzero \(p \in W\) which contains infinitely many GN-planes of H that are contained in W. Suppose that W is the zero set of \(g_1, g_2, \ldots , g_m\) and let

$$\begin{aligned} \quad Y = \{q \in W \mid g_1(p + tq) = g_2(p + tq) = \cdots = g_m(p + tq)= 0\} \end{aligned}$$

Then Y has an irreducible component Z which contains infinitely many GN-planes of H. Hence \(\dim Z \ge 3\). Since \(Z \subseteq Y \subseteq W\) and \(\dim W = {\text {rk}}{\mathcal J}H \le 3\), it follows from the irreducibility of Z and W that \(Z = W\). So p is a nonzero (projective) apex of W and (1) does not hold.

(2) \({{\Rightarrow }}\) (3):

Assume that \(\dim V(H) = {\text {rk}}{\mathcal J}H\) and that (3) does not hold. Since \(\dim V(H) = {\text {rk}}{\mathcal J}H\), it follows from Lemma 3.2 that W is an irreducible component of V(H), so the interior of W as a closed subset of V(H) is nonempty. Take p in that interior and let \(L_p \ni p\) be a GN-plane of H. Since \(L_p\) is irreducible, \(L_p\) is contained in an irreducible component of V(H), which can only be W because W is the only irreducible component of V(H) which contains p.

So if p is contained in infinitely many GN-planes of H, then (2) cannot hold. Hence assume that p is contained in only finitely many GN-planes of H. Since (3) does not hold, there exists a nonzero \(c \in V(H)\) which shares a GN-plane of H with every \(q \in W\). Inductively, we can choose \(p^{(i)}\) in the interior of W outside \(L_{p^{(1)}}, L_{p^{(2)}}, \ldots , L_{p^{(i-1)}}\) for \(i = 1,2,3,\ldots \), such that \(c \in L_{p^{(i)}}\) for each i. As we have seen above, \(L_{p^{(i)}} \subseteq W\) for each i, so c is a counterexample to the claim of (2).

(3) \({{\Rightarrow }}\) (4):

Assume that (3) is satisfied. From (ii) of Theorem 3.8, it follows that there exist a \(p^{(1)} \in W\) and a \(p^{(2)} \in W\) as in (iii) of Theorem 3.8. Take \(L_{p^{(1)}}\) and \(L_{p^{(2)}}\) as in (iii) of Theorem 3.8. Since there is no nonzero \(c \in V(H)\) which shares a GN-plane of H with every \(q \in W\), W cannot be equal to any linear span. Hence it suffices to show that W is contained in the linear span of \(L_{p^{(1)}}\) and \(L_{p^{(2)}}\). In the case where \(L_{p^{(1)}} \cap L_{p^{(2)}} = \{0\}\), this follows directly from (iii) of Theorem 3.8, so assume that there exist a nonzero \(c \in L_{p^{(1)}} \cap L_{p^{(2)}}\). Let

$$\begin{aligned} Y = \{q \in W \mid H(c + tq) = 0\} \end{aligned}$$

From (3), it follows that Y is a proper algebraic subset of W. Since W irreducible and contained in the union of Y and the linear span of \(L_{p^{(1)}}\) and \(L_{p^{(2)}}\), W is contained in the linear span of \(L_{p^{(1)}}\) and \(L_{p^{(2)}}\).

(4) \({{\Rightarrow }}\) (5):

Assume that (4) is satisfied. If \({\text {rk}}{\mathcal J}H \le 1\), then W is a line through the origin on account of (i) of Theorem 3.6, which gives (5). So assume that \({\text {rk}}{\mathcal J}H \ge 2\). Then W is properly contained in a 4-dimensional linear subspace of \({\mathbb C}^n\) and hence \({\text {rk}}{\mathcal J}H = \dim W < 4\). \(\square \)

Remark 3.11

Theorem 3.6 was obtained in Gordan and Nöther (1876, p. 565), but Gordan and Nöther proved additionally that \({\mathcal J}H \cdot H(y) = 0\) if \({\text {rk}}{\mathcal J}H \le 2\) and \({\text {rk}}{\mathcal J}H + \dim V(H) \le n\). See de Bondt (2014, Th. 4.1) for properties that are equivalent to \({\mathcal J}H \cdot H(y) = 0\).

The starting point of the distinction into cases ‘Fall a)’ and ‘Fall b)’ on Gordan and Nöther (1876, p. 565) is (i) of Theorem 3.8, but with the extra property that \(L_p\) and \(L_q\) are contained in W. Since \(\dim V(H) = {\text {rk}}{\mathcal J}H = \dim W\) in this situation, this extra property can indeed be obtained, namely by extending the genericity condition in the proof of (i) of Theorem 3.8 by that p and q are in the interior of W as a closed subset of V(H).

The case where \(k = 2\) of (iii) of Theorem 3.8 is obtained on Gordan and Nöther (1876, p. 566), and is used on the same page to prove the case where \(n = 5\) and \({\text {rk}}{\mathcal J}H = 3\) of Corollary 3.10.

4 Homogeneous singular Hessians in dimension 5

Gordan and Nöther (1876) classified all homogeneous polynomials with singular Hessians in dimension 5 as follows.

Theorem 4.1

(Gordan and Nöther) Assume \(h \in {\mathbb C}[x]\) is a homogeneous polynomial in dimension \(n = 5\). If \(\det {\mathcal H}h = 0\) and h is not a polynomial in \(n - 1 = 4\) linear forms in \({\mathbb C}[x]\), then there exists an invertible matrix T over \({\mathbb C}\) such that h(Tx) is of the form

$$\begin{aligned} h(Tx) = f \left( x_1,x_2,a_1(x_1,x_2)x_3 + a_2(x_1,x_2)x_4 + a_3(x_1,x_2)x_5 \right) \end{aligned}$$

where f and \(a_1, a_2, a_3\) are polynomials over \({\mathbb C}\) in their arguments.

The proof that is given below uses results about homogeneous quasi-translations in dimension five and follows the approach of Gordan and Nöther more or less.

The following connection exists between singular Hessians and quasi-translations.

Proposition 4.2

(Gordan and Nöther) Assume \(h \in {\mathbb C}[x]\) such that \(\det {\mathcal H}h = 0\). Then there exists a nonzero \(R \in {\mathbb C}[y]\) such that \(R(\nabla h) = 0\). For any such R, \(x + H\) is a quasi-translation and \((\nabla h)(x+tH) = \nabla h\), where \(H := (\nabla R)(\nabla h)\), and \(H \ne 0\) if R has minimum degree. Furthermore, \(h(x + tH) = h\) if \(R^{*}(\nabla h) = 0\) for every homogeneous part of \(R^{*}\) of R.

Proof

From Proposition 1.2.9 of either van den Essen (2000) or de Bondt (2009), it follows that the components of \(\nabla h\) are algebraically dependent over \({\mathbb C}\), so R indeed exists. By the chain rule,

$$\begin{aligned} {\mathcal J}H \cdot H = ({\mathcal H}R)|_{y=\nabla h} \cdot {\mathcal H}h \cdot H \end{aligned}$$

So if \({\mathcal H}h \cdot H = 0\), then \(x + H\) is a quasi-translation on account of (3) \(\Rightarrow \) (1) of Proposition 2.2. Indeed, if we take the Jacobian of \(R(\nabla h) = 0\), we obtain

$$\begin{aligned} {\mathcal J}0 = {\mathcal J}\left( R(\nabla h) \right) = ({\mathcal J}R)_{y = \nabla h} \cdot {\mathcal H}h = H^\mathrm{t}\cdot {\mathcal H}h \end{aligned}$$

which gives \({\mathcal H}h \cdot H = 0\), because \({\mathcal H}h\) is symmetric. Furthermore, (2.3) in Proposition 2.2 tells us that \((\nabla h)(x + tH) = 0\).

If R has minimum degree and \(H_i = 0\), then \( \frac{\partial }{ \partial y_i } R = 0\) because \(( \frac{\partial }{ \partial y_i } R)(\nabla h) = H_i = 0\). Since \(R \notin {\mathbb C}\), we see that \(H \ne 0\) if R has minimum degree.

Suppose that \(R^{*}(\nabla h) = 0\) for every homogeneous part \(R^{*}\) of R. Let \(E_y: {\mathbb C}[y] \rightarrow {\mathbb C}[y]\) be the map which multiplies each term by its own degree in y. Then one can verify that \(E_y R = y^\mathrm{t}\nabla R\), and that \(E_y R\) is a linear combination of the homogeneous parts \(R^{*}\) of R. So \({\mathcal J}h \cdot H = (y^\mathrm{t}\nabla R)_{y = \nabla h} = (E_y R)_{y = \nabla h} = 0\). Hence \(h(x+tH) = h\) on account of (2.3) in Proposition 2.2.\(\square \)

In order to prove Theorem 4.1, we need the classification of all homogeneous polynomials with singular Hessians in dimensions less than 5, which is as in Theorem 4.3 below. Our proof of Theorem 4.3 is somewhat different from that by Gordan and Nöther. A proof of Theorem 4.3 which is based on that by Gordan and Nöther can be found in Lossen (2004).

Theorem 4.3

(Gordan and Nöther) Assume \(h \in {\mathbb C}[x]\) is a homogeneous polynomial in dimension \(n \le 4\). If \(\det {\mathcal H}h = 0\), then the components of \(\nabla h\) are linearly dependent over \({\mathbb C}\).

Proof

Suppose that the components of \(\nabla h\) are linearly independent over \({\mathbb C}\). Then \(\deg \nabla h \ge 1\) because \(\det {\mathcal H}h = 0\). Let \(H = (\nabla R)(\nabla h)\) as in Proposition 4.2, such that R has minimum degree. Then H is a nonzero quasi-translation and \(\deg H \ge 1\) because \(\deg R \ge 2\) and \(\deg \nabla h \ge 1\). Furthermore, H is homogeneous because R and \(\nabla h\) are homogeneous. From (2) \(\Rightarrow \) (3) of Proposition 2.6, it follows that \(r := {\text {rk}}{\mathcal J}H \le \max \{n-2,1\} \le 2\). Using (i) and (iii) of Theorem 3.6, we can deduce that \(x + H\) has \(n-r < n\) linear invariants.

Since \(n-r < n\), there exists a nonzero \(p \in {\mathbb C}^n\) which is a zero of all these \(n - r\) linear invariants. From Proposition 1.2.9 of either van den Essen (2000) or de Bondt (2009), it follows that \({\text {trdeg}}_{{\mathbb C}}(H) = r\). Hence the \(n - r\) linear invariants of \(x + H\) generate the ideal \((\tilde{R} \in {\mathbb C}[y] \mid \tilde{R}(H) = 0)\) of \({\mathbb C}[y]\). Consequently, p is a projective image apex of H. From Lemma 4.4 below, it follows that \({\mathcal J}h \cdot p = 0\), so the components of \(\nabla h\) are linearly dependent over \({\mathbb C}\) indeed.\(\square \)

Lemma 4.4

Let \(h \in {\mathbb C}[x]\) and \(R \in {\mathbb C}[y]\), such that \(R^{*}(\nabla h) = 0\) for every homogeneous part \(R^{*}\) of R.

Then \({\mathcal J}h \cdot {\mathcal J}H = 0\), where \(H := (\nabla R)(\nabla h)\). Furthermore, if p is a projective image apex of H, then \({\mathcal J}h \cdot p = 0\).

Proof

From Proposition 4.2, it follows that \(h(x + tH) = h\). By taking the Jacobian on both sides, we obtain

$$\begin{aligned} ({\mathcal J}h)|_{x=x+tH} \cdot (I_n + t {\mathcal J}H) = {\mathcal J}h \end{aligned}$$

From Proposition 4.2 again, it follows that \(({\mathcal J}h)|_{x=x+tH} = {\mathcal J}h\), so \({\mathcal J}h \cdot t {\mathcal J}H = 0\), which gives the first claim.

Suppose that p is a projective image apex of H. Take \(T \in {\text {GL}}_n({\mathbb C})\) such that the last column of T equals p. Then \(e_n\) is a projective image apex of \(\tilde{H} := T^{-1} H\). So \(\tilde{H}_n\) is algebraically independent of \(\tilde{H}_1, \tilde{H}_2, \ldots , \tilde{H}_{n-1}\). Hence \({\text {trdeg}}_{{\mathbb C}} {\mathbb C}(\tilde{H}) = {\text {trdeg}}_{{\mathbb C}} {\mathbb C}(\tilde{H}_1,\tilde{H}_1,\ldots ,\tilde{H}_{n-1}) + 1\). From proposition 1.2.9 of either van den Essen (2000) or de Bondt (2009), it follows that the last row of \({\mathcal J}\tilde{H}\) is independent of the rows above it.

But \({\mathcal J}h \cdot T \cdot {\mathcal J}\tilde{H} = {\mathcal J}h \cdot {\mathcal J}H = 0\). Hence the rightmost entry of \({\mathcal J}h \cdot T\) is zero. So \({\mathcal J}h \cdot p = 0\) indeed.\(\square \)

Theorem 4.1 is formulated as de Bondt and van de Essen (2004, Th. 3.6). The starting point of the proof of de Bondt and van de Essen (2004, Th. 3.6) is de Bondt and van de Essen (2004 Th. 2.1 iii)), which is not accompanied by a proof and comes down the following.

Theorem 4.5

(Gordan and Nöther) Assume \(h \in {\mathbb C}[x]\) is a homogeneous polynomial in dimension \(n = 5\). Suppose that \(R(\nabla h) = 0\), such that R has minimum degree. Then R can be expressed as a polynomial in three linear forms over y.

Proof

Notice that R is homogeneous because h is homogeneous and R has minimum degree. We distinguish two cases.

  • R cannot be expressed as a polynomial in four linear forms over y.

    Then the components of \(\nabla R\) are linearly independent over \({\mathbb C}\). Since R has minimum degree, the components of \(H := (\nabla R)(\nabla h)\) are linearly independent over \({\mathbb C}\) as well. Write \(H = g \tilde{H}\), where \(g \in {\mathbb C}[x]\), such that \(\gcd \{\tilde{H}_1,\tilde{H}_2,\ldots ,\tilde{H}_n\} = 1\). Since the components of H and hence also \(\tilde{H}\) are linearly independent over \({\mathbb C}\), we have \(\deg \tilde{H} \ge 1\). On account of Proposition 2.3, \({\text {rk}}{\mathcal J}\tilde{H} = {\text {rk}}{\mathcal J}H\).

    Since the components of \(\tilde{H}\) are linearly independent over \({\mathbb C}\), it follows from Theorem 3.6 that \(3 \le {\text {rk}}{\mathcal J}\tilde{H} \le \dim V(\tilde{H}) \le n-2\), so \({\text {rk}}{\mathcal J}\tilde{H} = \dim V(\tilde{H}) = 3\). From (1) \(\Rightarrow \) (5) of Corollary 3.10, it follows that \(\tilde{H}\) has a projective image apex, say p. Then \(f(\tilde{H}) = 0\) implies \(f(\tilde{H} + tp) = 0\) for every homogeneous \(f \in {\mathbb C}[y]\). Hence \(f(H) = 0\) implies \(f(H + tgp) = 0\) for every homogeneous \(f \in {\mathbb C}[y]\). Since H is homogeneous, we can substitute \(t = g^{-1}t\) to deduce that p is a projective image apex of H as well.

    From Lemma 4.4, it subsequently follows that \({\mathcal J}h \cdot p = 0\). Hence the components of \(\nabla h\) are linearly dependent over \({\mathbb C}\). Since R has minimum degree, we conclude that \(\deg R = 1\), so R is a linear form in \({\mathbb C}[y]\). Contradiction.

  • R can be expressed as a polynomial in four linear forms over y.

    Then there is an \(i \le 5\) such that \(y_i\) is not a linear combination of these four linear forms. Say that \(i = 5\). Then R is of the form \(\tilde{R}(y_1 + c_1 y_5, y_2 + c_2 y_5, y_3 + c_3 y_5, y_4 + c_4 y_5)\), where \(c_i \in {\mathbb C}\) for each i. Furthermore, \(\tilde{R} \in {\mathbb C}[y_1,y_2,y_3,y_4]\) is homogeneous and \(\tilde{R}(\nabla \tilde{h}) = \tilde{R}(\nabla \hat{h}) = 0\), where

    $$\begin{aligned} \tilde{h} = h\big |_{x_5 = x_5 + c_1 x_1 + c_2 x_2 + c_3 x_3 + c_4 x_4} \quad \text{ and } \quad \hat{h} = \tilde{h}\big |_{x_5 = 1} \end{aligned}$$

    Since \(\tilde{h}\) is homogeneous, say of degree d, it follows that \(\tilde{h} = x_5^d \hat{h}(x_5^{-1} x)\) and that \(\nabla \tilde{h}\) and \(x_5^{d-1} (\nabla \hat{h})(x_5^{-1} x)\) agree on the first 4 components. From this, we can deduce that \(\tilde{R}\), as a homogeneous polynomial in \({\mathbb C}[y_1, y_2, y_3, y_4]\) such that \(\tilde{R}(\nabla \hat{h}) = 0\), has minimum degree as well. From Theorem 4.6 below, we obtain that \(\tilde{R}\) can be expressed as a polynomial in three linear forms in \({\mathbb C}[y_1, y_2, y_3, y_4]\). Hence R can be expressed as a polynomial in three linear forms in \({\mathbb C}[y]\). \(\square \)

Theorem 4.6

Let \(n = 4\) and \(h \in {\mathbb C}[x]\), not necessarily homogeneous. Suppose that \(R \in {\mathbb C}[y]\) is homogeneous, such that \(R(\nabla h) = 0\). If R has minimum degree, then R can be expressed as a polynomial in three linear forms in \({\mathbb C}[y]\).

Proof

Suppose that R has minimum degree. Let \(\bar{h}\) be the leading homogeneous part of h, and define \(H := (\nabla R)(\nabla h)\). From Proposition 4.2, it follows that \(h(x + tH) = h\). By taking the leading coefficient with respect to t, we deduce that \(\bar{h}(H) = 0\).

Since \(\bar{h}\) is homogeneous and \(R(\nabla \bar{h}) = 0\), it follows from Theorem 4.3 that the components of \(\nabla \bar{h}\) are linearly dependent over \({\mathbb C}\), say that \(L(\nabla \bar{h}) = 0\) for some linear form \(L \in {\mathbb C}[y]\). Assume first that \({\text {rk}}{\mathcal H}\bar{h} = 3\). Then the relations between the components of \(\nabla \bar{h}\) form a prime ideal of height one, which is a principal ideal because \({\mathbb C}[y]\) is a unique factorization domain. Since L is irreducible, (L) must be that principal ideal, and \(L \mid R\) because \(R(\nabla \bar{h}) = 0\). Since R has minimum degree, R is irreducible, so R is linear.

Assume next that \({\text {rk}}{\mathcal H}\bar{h} \le 2\). Since there exists a linear relation between the components of \(\nabla \bar{h}\), there exists a \(T \in {\text {GL}}_n({\mathbb C})\) such that the last component of \(T^\mathrm{t}\nabla \bar{h}\) is zero. Hence the last component of \(\nabla (\bar{h}(Tx)) = T^\mathrm{t}(\nabla \bar{h})(Tx)\) is zero. So \(\bar{h}(Tx) \in {\mathbb C}[x_1,x_2,x_3]\). Since \({\mathcal H}(\bar{h}(Tx)) = T^\mathrm{t}({\mathcal H}\bar{h})|_{x = Tx} T\), we see that \({\text {rk}}{\mathcal H}(\bar{h}(Tx)) \le 2\). It follows from Theorem 4.3 again that \(\bar{h}(Tx)\) can be expressed as a polynomial in two linear forms. Hence \(\bar{h} = \bar{h} (T(T^{-1}x))\) can be expressed as a polynomial in two linear forms as well.

Since \(\bar{h}\) is homogeneous in addition, \(\bar{h}\) decomposes into linear factors, and one of these factors is already a relation between \(H_1, H_2, H_3, H_4\). So there exist a linear form \(M \in {\mathbb C}[x]\) such that \(M((\nabla R)(\nabla h)) = M(H) = 0\). Since R has minimum degree, \(M(\nabla R) = 0\). On account of Example 1.2 in de Bondt and van den Essen (2004), R can be expressed as a polynomial in three linear forms over y.\(\square \)

Remark 4.7

The proof of the first case in the proof of Theorem 4.5 is different from that given in Gordan and Nöther (1876, p. 568), where the second claim of Lemma 4.4 is obtained by way of differentiation on the inverse of H. Since the inverse of H is not a map, the above proof of this first case seems much easier. The proof of this first case as given in de Bondt (2009, Th. 5.3.7) is incorrect.

The proof of the second case in the proof of Theorem 4.5 comes from Gordan and Nöther (1876, p. 567). This seems a little odd, because Lemma 4.6 is about not necessarily homogeneous polynomials, which Gordan and Nöther did not consider in Gordan and Nöther (1876). But in spite of that, the proof of Lemma 4.6 comes from Gordan and Nöther (1876, p. 567) indeed.

On Gordan and Nöther (1876, p. 567), Gordan and Nöther additionally prove that \({\text {rk}}{\mathcal J}H \le 2\), as follows. They assume that \(H_1 = H_2 = 0\) on account of Theorem 4.5 and Proposition 2.4, and show the first claim of Lemma 4.4 that \({\mathcal J}h \cdot {\mathcal J}H = 0\), to conclude that either \(h \in {\mathbb C}[x_1,x_2]\) or that the rows of \({\mathcal J}(H_3,H_4,H_5)\) are dependent. In both cases, \({\text {rk}}{\mathcal J}H \le 2\) indeed, because \(H_i \in {\mathbb C}[ \frac{\partial }{ \partial x_1 } h, \frac{\partial }{ \partial x_2 } h]\) for all i in the first case, so that the row space of \({\mathcal J}H\) is generated by \({\mathcal J}( \frac{\partial }{ \partial x_1 } h)\) and \({\mathcal J}( \frac{\partial }{ \partial x_2 } h)\).

Unlike Gordan and Nöther, we do not need to show that \({\text {rk}}{\mathcal J}H \le 2\) here, because for the techniques in de Bondt and van den Essen (2004), linear dependences between the components of H are the only thing that matters. But the result of Gordan and Nöther can be used to fix the gap in Watanabe (2014), which is caused by the incorrect (Watanabe 2014, Lm. 5.2), and a gap on the same point in Franchetta (1954).

5 Homogeneous 5-dimensional quasi-translations of ‘Fall b)’

In this section, we study homogeneous quasi-translations in dimension 5 which corresponds to ‘Fall b)’ in Gordan and Nöther (1876, §8). In Corollary 5.2, we will show that homogeneous quasi-translations in dimension 5 which are not of this type always have a linear invariant.

Theorem 5.1

Assume \(x + H\) is a homogeneous quasi-translation in dimension 5 over \({\mathbb C}\), such that \(\gcd \{H_1,H_2,H_3,H_4,H_5\} = 1\) and \(H_5\) is algebraically independent over \({\mathbb C}\) of \(H_1, H_2, H_3, H_4\).

If \(x + H\) does not have two independent linear invariants, then \({\text {rk}}{\mathcal J}H = \dim V(H) = 3\) and the following holds.

  1. (i)

    H is of the form

    $$\begin{aligned} H = \left( g h_1(p,q),g h_2(p,q),g h_3(p,q),g h_4(p,q),H_5 \right) \end{aligned}$$

    where \(g \in {\mathbb C}[x]\), \(h \in {\mathbb C}[y_1,y_2]^4\) and \((p,q) \in {\mathbb C}[x]^2\) are homogeneous, and \(\gcd \{p,q\} = 1\).

  2. (ii)

    gpq are invariants of \(x + H\), and \(g \in {\mathbb C}[x_1,x_2,x_3,x_4]\).

  3. (iii)

    \(\deg _{x_5} H_5 \le \deg _{x_5} (H_1,H_2,H_3,H_4)\) and there exists a linear combination over \({\mathbb C}\) of p and q whose degree with respect to \(x_5\) is less than \(\max \{\deg _{x_5} p, \deg _{x_5} q\}\).

  4. (iv)

    There exists an invariant \(a \in {\mathbb C}[x]\) of degree at most 1 of \(x + H\), such that every invariant of \(x + H\) which can be expressed as a polynomial in four linear forms in \({\mathbb C}[x]\) is contained in \({\mathbb C}[a]\).

  5. (v)

    If H has no linear invariants at all, then \(g \in {\mathbb C}\).

Proof

From (ii) of Theorem 3.6, it follows that \(2 \le {\text {rk}}{\mathcal J}H \le \dim V(H) \le 3\). Assume that \(x + H\) does not have two independent linear invariants. From (iii) of Theorem 3.6, it follows that \({\text {rk}}{\mathcal J}H \ne 2\), so \({\text {rk}}{\mathcal J}H = \dim V(H) = 3\).

  1. (i)

    Since \(H_5\) is algebraically independent over \({\mathbb C}\) of \(H_1, H_2, H_3, H_4\), it follows from \({\text {rk}}{\mathcal J}H = 3\) and Proposition 1.2.9 of either van den Essen (2000) or de Bondt (2009) that \({\text {rk}}{\mathcal J}(H_1,H_2,H_3,H_4) = 2\). Using (de Bondt and van den Essen 2005, Th. 2.2) (see also de Bondt 2009, Th. 4.3.1), we see that H is of the given form.

  2. (ii)

    Take \(i \le 4\) such that \(H_i \ne 0\). Then \(g \cdot h_i(p,q) = H_i\) and on account of Proposition 2.2,

    $$\begin{aligned} \deg _t g(x + tH) + \deg _t h_i \left( p(x+tH),q(x+tH) \right) = \deg _t H_i(x + tH) = 0 \end{aligned}$$

    whence \(\deg _t g(x + tH) = 0\) and g is an invariant of \(x + H\). Similarly, any linear form in p and q that divides \(H_i\) is an invariant of \(x + H\) as well. If there is at most one independent linear form in p and q that divides \(H_i\) for any \(i \le 4\) such that \(H_i \ne 0\), then \(\deg g = \deg H\) and \(x + H\) has three independent linear invariants, which is a contradiction. Hence there are two independent linear forms in p and q that are invariants of \(x + H\). Since p and q are in turn linear forms in these invariants, p and q are invariants of \(x + H\) themselves.

    Since g is an invariant of \(x + H\), it follows from (2.3) in Proposition 2.2 that \({\mathcal J}g \cdot H = 0\). Hence

    $$\begin{aligned} g ~\Big |~ {\mathcal J}g \cdot H - g \sum _{i=1}^4 h_i(p,q) \frac{\partial }{ \partial x_i } g = H_5 \frac{\partial }{ \partial x_5 } g \end{aligned}$$

    Now \( \frac{\partial }{ \partial x_5 } g \ne 0\) contradicts the assumption that \(\gcd \{g,H_5\} \mid \gcd \{H_1,H_2,H_3,H_4,H_5\} = 1\). Thus \(g \in {\mathbb C}[x_1,x_2,x_3,x_4]\).

  3. (iii)

    Let r be the degree with respect to \(x_5\) of \((H_1,H_2,H_3,H_4)\). If the degree with respect to \(x_5\) of \(H_5\) is larger than \(r+1\), then \(\deg _{x_5} {\mathcal J}H_5 \cdot H = \deg _{x_5} ( \frac{\partial }{ \partial x_5 } H_5) \cdot H_5 > 2r + 1\), which contradicts (3) \(\Rightarrow \) (1) of Proposition 2.2. Take for \(\bar{H}_i\) all terms of degree r with respect to \(x_5\) of \(H_i\) if \(i \le 4\), and for \(\bar{H}_5\) all terms of degree \(r+1\) with respect to \(x_5\) of \(H_5\).

    Then the part of degree 2r with respect to \(x_5\) of \({\mathcal J}(H_1,H_2,H_3,H_4) \cdot H\) equals \({\mathcal J}(\bar{H}_1,\bar{H}_2, \bar{H}_3,\bar{H}_4) \cdot \bar{H}\) and the part of degree \(2r+1\) of \({\mathcal J}H_5 \cdot H\) equals \({\mathcal J}\bar{H}_5 \cdot \bar{H}\). Since \({\mathcal J}H_i \cdot H = 0\) for all i on account of (1) \(\Rightarrow \) (3) of Proposition 2.2, we have \({\mathcal J}\bar{H} \cdot \bar{H} = 0\).

    On account of (3) \(\Rightarrow \) (2) of Proposition 2.2, \(\deg _t \bar{H}_5(x + t\bar{H}) = 0\). Since \(x_5 \mid \bar{H}_5\), \(\deg _t (x + t\bar{H})_5 = 0\) as well. Hence \(\bar{H}_5 = 0\) and \(\deg _{x_5} H_5 \le r = \deg _{x_5} (H_1,H_2,H_3,H_4)\). By taking leading parts with respect to \(x_5\), we see that for homogeneous and hence any \(R \in {\mathbb C}[y_1,y_2,y_3,y_4]\), \(R(H_1,H_2,H_3,H_4) = 0\) implies \(R(\bar{H}_1,\bar{H}_2,\bar{H}_3,\bar{H}_4) = 0\). It follows from Proposition 1.2.9 of either van den Essen (2000) or de Bondt (2009) that

    $$\begin{aligned} {\text {rk}}{\mathcal J}(x_5^{-r}\bar{H})&= {\text {trdeg}}_{{\mathbb C}} {\mathbb C}(x_5^{-r}\bar{H}_1,x_5^{-r}\bar{H}_2,x_5^{-r}\bar{H}_3,x_5^{-r}\bar{H}_4) \\&\le {\text {trdeg}}_{{\mathbb C}} {\mathbb C}(H_1,H_2,H_3,H_4) = {\text {rk}}{\mathcal J}(H_1, H_2, H_3, H_4) = 2 \end{aligned}$$

    Since \(x_5^{-r} \bar{H}_5 = 0\) and \(x_5^{-r} \bar{H}_i \in {\mathbb C}[x_1,x_2,x_3,x_4]\), we deduce that \(x + \bar{H}\) can be regarded as a quasi-translation in dimension four (over its first four coordinates). By (i) and (iii) of Theorem 3.6, there are two independent linear forms \(l_1\) and \(l_2\) in \(x_1,x_2,x_3,x_4\) such that \(l_1(x_5^{-r}\bar{H}) = l_2(x_5^{-r}\bar{H}) = 0\). So \(l_1(\bar{H}) = l_2\bar{H}) = 0\).

    Suppose that the leading parts of p and q with respect to \(x_5\) are independent and of the same degree with respect to \(x_5\). Since \((\bar{H}_1,\bar{H}_2,\bar{H}_3, \bar{H}_4)\) is the leading part of \((H_1, H_2, H_3, H_4)\) with respect to \(x_5\), it follows that \((\bar{H}_1,\bar{H}_2,\bar{H}_3,\bar{H}_4) = h(\bar{p},\bar{q})\), where \(\bar{p}\) and \(\bar{q}\) are the leading parts of p and q respectively with respect to \(x_5\). By assumption, \(\bar{p}\) and \(\bar{q}\) are independent, so we can deduce from \(l_1(\bar{H}) = l_2(\bar{H}) = 0\) that \(l_1(h) = l_2(h) = 0\) and hence also \(l_1(H) = l_2(H) = 0\). Contradiction, thus the leading parts of p and q with respect to \(x_5\) are dependent or have different degrees with respect to \(x_5\), as desired.

  4. (iv)

    Take for a the linear invariant of \(x + H\), if it has any, and take \(a = 1\) otherwise. Let f be a non-constant invariant of \(x + H\) which can be expressed in four linear forms. We distinguish two cases.

    • \(f \in {\mathbb C}[x_1,x_2,x_3,x_4]\).

      On account of (iii) above, we can obtain that \(\deg _{x_5} p < \deg _{x_5} q\), namely by replacing p and q by linear combinations of p and q, and adapting h accordingly. If we replace H by \(T^{-1}H(Tx)\) and (fpq) by (f(Tx), p(Tx), q(Tx)) for some \(T \in {\text {GL}}_5({\mathbb C})\) such that the last column of T is equal to the fifth unit vector, the form of H does not change and neither do \(\deg _{x_5} f, \deg _{x_5} p\) and \(\deg _{x_5} q\). By choosing T appropriate, we can obtain \(-\infty \le \deg _{y_2} h_1< \deg _{y_2} h_2< \deg _{y_2} h_3 < \deg _{y_2} h_4\).

      On account of (2.3) in Proposition 2.2, \({\mathcal J}f \cdot H = 0\). By looking at the leading coefficient with respect to \(x_5\) in \({\mathcal J}f \cdot H\), we can successively deduce that \( \frac{\partial }{ \partial x_4 } f = 0\), \( \frac{\partial }{ \partial x_3 } f = 0\), \( \frac{\partial }{ \partial x_2 } f = 0\), and \(H_1 = 0\). Hence f is a polynomial over \({\mathbb C}\) in the invariant \(x_1\) of \(x + H\), and f was a polynomial over \({\mathbb C}\) in the invariant \((T^{-1})_1 x\) of \(x + H\) before replacing H by \(T^{-1}H(Tx)\). Since \(x + H\) does not have two independent linear invariants, we see that \(f \in {\mathbb C}[a]\).

    • \(f \notin {\mathbb C}[x_1,x_2,x_3,x_4]\).

      There exists a \(T \in {\text {GL}}_5({\mathbb C})\) such that \(f(Tx) \in {\mathbb C}[x_1,x_2,x_3,x_5]\) and last column of T is equal to the fifth unit vector. Just as above, we replace H by \(T^{-1}H(Tx)\) and (fpq) by (f(Tx), p(Tx), q(Tx)). So we may assume that \(f \in {\mathbb C}[x_1,x_2,x_3,x_5]\). From (2.3), it follows that \({\mathcal J}f \cdot H = 0\) and that any homogeneous part of f is an invariant of \(x + H\) as well, so we may assume that f is homogeneous.

      Since \(x + H\) has at most one linear invariant, we can use techniques in the proof of (i) of Theorem 3.6 to show that \({\text {rk}}{\mathcal J}(H_1,H_2,H_3) = 2\). Hence the ideal \(\mathfrak {b}:= (R \in {\mathbb C}[y_1,y_2,y_3] \mid R(H_1,H_2,H_3) = 0)\) has height 1, and since \({\mathbb C}[y]\) is a unique factorization domain, \(\mathfrak {b}\) is principal. Say that R is a generator of \(\mathfrak {b}\).

      By looking at the leading homogeneous part of \(f(x+H) = f\), we see that \(f(H) = 0\). Since \(H_5\) is algebraically independent of \(H_1,H_2,H_3\), we deduce that \(R(x_1,x_2,x_3) \mid f\). From (2.3), it follows \(f(x + tH) = f\), from which we can deduce that every factor of f is an invariant of \(x + H\). The case \(f \in {\mathbb C}[x_1,x_2,x_3,x_4]\) above tells us that \(R(x_1,x_2,x_3) \in {\mathbb C}[a]\), and \(f / R(x_1,x_2,x_3) \in {\mathbb C}[a]\) follows by induction on the degree of f.

  5. (v)

    From (ii), it follows that \(g \in {\mathbb C}[x_1,x_2,x_3,x_4]\). On account of (iv), \(g \in {\mathbb C}[a]\), where a is as in (iv). If H has no linear invariant, then \(\deg a = 0\). Hence \(g \in {\mathbb C}[a] = {\mathbb C}\) if H has no linear invariant. \(\square \)

Corollary 5.2

Assume \(x + H\) is a homogeneous quasi-translation over \({\mathbb C}\) in dimension 5 without linear invariants. Then \(\deg H \ge 12\). More precisely, there exists a \(T \in {\text {GL}}_5({\mathbb C})\) such that \(T^{-1} H(Tx)\) is of the form

$$\begin{aligned} T^{-1} H(Tx) = g \cdot \left( h_1(p,q),h_2(p,q),h_3(p,q),h_4(p,q),f \right) \end{aligned}$$

where h is homogeneous of degree at least 3 and (pq) is homogeneous of degree at least 4.

Proof

On account of Proposition 2.3, we may assume that \(\gcd \{H_1,H_2,H_3,H_4,H_5\} = 1\). From (ii) of Theorem 3.6, it follows that \(\dim V(H) \le 3\). From (i) and (iii) of Theorem 3.6, it follows that \({\text {rk}}{\mathcal J}H \ge 3\). Using (2) \(\Rightarrow \) (3) of Proposition 2.6, we can deduce that \(\dim V(H) = {\text {rk}}{\mathcal J}H = 3\). From (1) \(\Rightarrow \) (5) of Corollary 3.10, we obtain that H has a nonzero (projective) image apex. From Proposition 2.4, it follows that we may assume that \(e_5\) is a (projective) image apex.

From (i), (ii) and (v) of Theorem 5.1, it follows that there are invariants p and q of \(x + H\), such that H is of the form

$$\begin{aligned} H = \left( h_1(p,q),h_2(p,q),h_3(p,q),h_4(p,q),H_5 \right) \end{aligned}$$
(5.1)

such that h and (pq) are homogeneous. Furthermore, it follows from (iii) and (iv) of Theorem 5.1 that we may assume that \(\deg _{x_5} q > \deg _{x_5} p\) and \(\deg _{x_5} p > 0\) respectively.

On account of (2.3) in Proposition 2.2, \(q(x+tH) = q(x)\), and looking at the leading coefficient with respect to t gives \(q(H) = 0\). Since \(e_5\) is a projective apex of H, we even have \(q(H_1,H_2,H_3,H_4,H_5+t) = 0\). Hence \(\deg _{x_5} q \le \deg q-1\) and in case of equality, looking at the leading coefficient with respect to t in \(q(x_1,x_2,x_3,x_4,t)\) gives a linear form \(l_1\) such that \(l_1(H_1, H_2, H_3, H_4)\), which contradicts that \(x + H\) has no linear invariants. Thus \(\deg _{x_5} q \le \deg q-2\). If we combine this with the conclusion of the previous paragraph, then we obtain

$$\begin{aligned} 0< \deg _{x_5} p < \deg _{x_5} q \le \deg q - 2 \end{aligned}$$
(5.2)

So \(\deg (p,q) \ge \deg q \ge \deg _{x_5} q + 2 \ge 4\).

If \(\deg h < 3\), then there exists a linear form \(l_2 \in {\mathbb C}[x_1,x_2,x_3,x_4]\) such that \(l_2(h)\!\! = 0\) and hence also \(l_2(H_1,H_2,H_3,H_4) = 0\), which contradicts that \(x + H\) has no linear invariants. Hence \(\deg h \ge 3\).\(\square \)

The following theorem has been proved in Liu (2011) as well. The proof that is given below is somewhat less computational than that in Liu (2011).

Theorem 5.3

Assume \(x + H\) is a homogeneous quasi-translation over \({\mathbb C}\) in dimension 5 without linear invariants. Then \(\deg H \ge 15\). More precisely, \(\deg (p,q) \ge 5\), where p and q are as in Corollary 5.2.

Proof

Just like in the proof of Corollary 5.2, we may assume that H is as in (5.1) such that h is homogeneous of degree at least 3 and (pq) is homogeneous such that (5.2) is satisfied. If \(\deg q \ge 5\), then \(\deg H \ge 5 \deg h \ge 15\) indeed, because (pq) is homogeneous. Hence assume that \(\deg q \le 4\). We shall derive a contradiction.

  1. (i)

    From (iv) of Theorem 5.1, it follows that \(\deg _{x_4} p \ge 1\) and \(\deg _{x_4} q \ge 1\). From (5.2), we deduce that \(\deg _{x_5} p = 1\), \(\deg _{x_5} q = 2\) and \(\deg q = 4\). Assume without loss of generality that \(\deg _{y_2} h_4> \deg _{y_2} h_3> \deg _{y_2} h_2 > \deg _{y_2} h_1\). Then \(\deg _{x_5} H_4> \deg _{x_5} H_3> \deg _{x_5} H_2 > \deg _{x_5} H_1\).

    Let r be the leading coefficient with respect to \(x_5\) of q. On account of (2.3) in Proposition 2.2, \({\mathcal J}q \cdot H = 0\). By looking at the leading coefficient with respect to \(x_5\) of \({\mathcal J}q \cdot H = 0\), we deduce from (iii) of Theorem 5.1 that \(r \in {\mathbb C}[x_1,x_2,x_3]\). Since \(q(H_1,H_2,H_3,H_4,t) = 0\), \(r(H_1,H_2,H_3) = 0\) as well. By looking at the leading coefficient with respect to \(x_5\) in \(r(H_1,H_2,H_3)\), we see that the coefficients of \(x_3^2\) and \(x_2x_3\) of r are zero. Hence r is of the form \(r = (\lambda _1 x_1 + \lambda _2 x_2 + \lambda _3 x_3) x_1 - \lambda _4^2 x_2^2\), where \(\lambda _i \in {\mathbb C}\) for all i. Since r is irreducible, we have \(\lambda _3 \lambda _4 \ne 0\).

  2. (ii)

    We show that for invariants f of \(x + H\), we have \(\deg _{x_4,x_5} f = \deg _{x_5} f\). Let f be an invariant of \(x + H\). From (2.3) in Proposition 2.2, it follows that \(f(x + tH) = 0\). Let \(\bar{f}\) be the leading part of f with respect to \((x_4,x_5)\) and suppose that \(\deg _{x_4,x_5} f > \deg _{x_5} f\). Then \(x_4 \mid \bar{f}\), say that \(x_4^v \mid \bar{f}\) and \(x_4^{v+1} \not \mid \bar{f}\). On account of (iii) of Theorem 5.1, \(\deg _{x_5} H_4 \ge \deg _{x_5} H_5\) and \(\deg _{x_5} H_4 > \deg _{x_5} H_i\) for all \(i \le 3\). So \(\deg _{x_5} H_4 \ge \deg _{x_5} H_i - \deg _{x_5} x_i - 1\) for all \(i \ne 4\).

    If we change a factor \(x_i\) in a product into \(t H_i\), the degree with respect to \(x_5\) of that product will increase \(\deg _{x_5} tH_4 - \deg _{x_5} x_4 = \deg _{x_5} tH_4\) if \(i = 4\) and \(\deg _{x_5} t H_i - \deg _{x_5} x_i \le \deg _{x_5} tH_4 - 1\) if \(i \ne 4\). Having to do such a change v times, starting with a term \(u \in {\mathbb C}[x]\), we deduce from \(\deg _{x_5} u = \deg _{x_4,x_5} u - \deg _{x_4} u\) for terms \(u \in {\mathbb C}[x]\) that for any term and hence any polynomial \(u \in {\mathbb C}[x]\), the coefficient of \(t^v\) of \(u(x + tH)\) has degree at most

    $$\begin{aligned} b(u)&:= \deg _{x_4,x_5} u - \deg _{x_4} u + \deg _{x_4} u \cdot \deg _{x_5} tH_4 \\&\qquad +(v - \deg _{x_4} u) \cdot \left( \deg _{x_5} tH_4 - 1 \right) +\\&= v\cdot \left( \deg _{x_5} tH_4 - 1 \right) + \deg _{x_4,x_5} u \end{aligned}$$

    with respect to \(x_5\). Since b(u) is affinely linear in \(\deg _{x_4,x_5} u\) as a function on terms \(u \in {\mathbb C}[x]\), the part of degree b(f) with respect to \(x_5\) of the coefficient of \(t^v\) of \(f(x + tH)\) is equal to that of \(\bar{f}(x + tH)\).

    The part of degree v with respect to t of \(\bar{f}(x_1,x_2,x_3,x_4+t H_4,x_5)\) equals \((tH_4)^v \frac{1}{v!} \frac{\partial ^{v}}{ \partial x_4 ^{v}} \bar{f}\). By definition of v,

    $$\begin{aligned} \deg _{x_5} \left( (tH_4)^v \frac{1}{v!} \frac{\partial ^{v}}{ \partial x_4 ^{v}} \bar{f} \right)&= v\cdot \deg _{x_5} tH_4 + \deg _{x_5} \frac{\partial ^{v}}{ \partial x_4 ^{v}} \bar{f} \\&= v\cdot \deg _{x_5} tH_4 - v + \deg _{x_4,x_5} \bar{f} \\&= b(\bar{f}) = b(f) \end{aligned}$$

    so the part of degree b(f) with respect to \(x_5\) of the coefficient of \(t^v\) of \(\bar{f}(x_1,x_2,x_3,x_4+t H_4,x_5)\) is nonzero. Furthermore, we can deduce from \(\deg _{x_5} tH_4 - \deg _{x_5} x_4 = \deg _{x_5} tH_4 > \deg _{x_5} t H_i - \deg _{x_5} x_i\) for all \(i \ne 4\) and \(\deg _{x_5} \frac{\partial ^{v}}{ \partial x_4 ^{v}} \bar{f} = \deg _{x_5} \bar{f}\) that the degree with respect to \(x_5\) of \(\bar{f}(x + tH) - \bar{f}(x_1,x_2,x_3,x_4+t H_4,x_5)\) is less than b(f). Hence

    $$\begin{aligned} \deg _{x_5} f(x + tH) = \deg _{x_5} \bar{f}(x + tH) = \deg _{x_5} \bar{f}(x_1,x_2,x_3,x_4+t H_4,x_5) = b(f) \end{aligned}$$

    But the coefficient of \(t^v\) of \(f(x + tH)\) is zero, so \(v = 0\). Hence \(\deg _{x_4,x_5} f = \deg _{x_5} f\) for invariants f of \(x + H\).

    Since p and q are invariants of \(x + H\), and \(\deg _{x_5} p = 1\) and \(\deg _{x_5} q = 2\), we have \(\deg _{x_4,x_5} p = 1\) and \(\deg _{x_4,x_5} q = 2\).

  3. (ii)

    Let \(\bar{H}_i\) be the part of \(H_i\) that has degree \(2\deg h - 1\) with respect to \((x_4,x_5)\), for \(i = 1,2,3\), and \(\bar{H}_j\) the part of \(H_j\) that has degree \(2 \deg h\) with respect to \((x_4,x_5)\), for \(j = 4,5\). Then the part of degree \(4\deg h - 2\) with respect to \((x_4,x_5)\) of \({\mathcal J}(H_1,H_2,H_3,H_4) \cdot H\) equals \({\mathcal J}(\bar{H}_1,\bar{H}_2, \bar{H}_3,\bar{H}_4) \cdot \bar{H}\) and the part of degree \(4\deg h - 1\) with respect to \((x_4,x_5)\) of \({\mathcal J}H_5 \cdot H\) equals \({\mathcal J}H_5 \cdot H\). Since \({\mathcal J}H \cdot H = 0\) on account of (1) \(\Rightarrow \) (3) of Proposition 2.2, we have \({\mathcal J}\bar{H} \cdot \bar{H} = 0\).

    On account of (iii), \(\deg _{x_5} H_i = \deg _{x_4,x_5} H_i\) for all i. Consequently, \(\bar{H}_1 = \bar{H}_2 = 0\) and \(\bar{H}_3\) and \((\bar{H}_4,\bar{H}_5)\) are homogeneous with respect to \((x_4,x_5)\). We shall show that \(\bar{H}_5\) is linearly dependent over \({\mathbb C}\) of \(\bar{H}_4\), distinguishing the cases \(\bar{H}_3 = 0\) and \(\bar{H}_3 \ne 0\).

    Assume first that \(\bar{H}_3 = 0\). From (2.4), it follows that \({\mathcal J}\bar{H}\) is nilpotent. Since \(\bar{H}_1 = \bar{H}_2 = \bar{H}_3 = 0\), \({\mathcal J}_{x_4,x_5} (\bar{H}_4, \bar{H}_5)\) is nilpotent as well. From van den Essen (2000, Th. 7.2.25) (see also van den Essen and Hubbers 2000), we obtain that

    $$\begin{aligned} (\bar{H}_4, \bar{H}_5) = (bc (a x_4 - b x_5)^{2 \deg h}, ac (a x_4 - b x_5)^{2 \deg h}) \end{aligned}$$

    Since \(\deg (\bar{H}_4, \bar{H}_5) = 2 \cdot 2 \deg h\), this is only possible if a and b are constant. Hence \( \bar{H}_5 = b^{-1} a \bar{H}_4\) is linearly dependent over \({\mathbb C}\) of \(\bar{H}_4\).

    Assume next that \(\bar{H}_3 \ne 0\). Let \(\bar{q}\) be the leading and quadratic part of q with respect to \((x_4,x_5)\). Then \(\bar{q} \mid \bar{H}_4\), so \(\deg _t \bar{q}(x + t\bar{H}) \le \deg _t \bar{H}_4(x + t\bar{H}) = 0\) on account of (3) \(\Rightarrow \) (2) of Proposition 2.2. Since \(\lambda _3 \ne 0\) and the leading term with respect to \(x_5\) of q is divisible by r, we have \(\deg _{x_3,x_4,x_5} \bar{q} = \deg _{x_4,x_5} \bar{q} + 1 = 3\). The coefficient of \(t^3\) in \(\bar{q}(x+t\bar{H})\) is of the form \(x_1\bar{H}_3 s(\bar{H}_4,\bar{H}_5)\), where s is a quadratic form, which decomposes into linear factors. Since \(\bar{H}_3 \ne 0\), we deduce that \(s(\bar{H}_4,\bar{H}_5) = 0\) and that \(\bar{H}_5\) is linearly dependent over \({\mathbb C}\) of \(\bar{H}_4\).

  4. (iii)

    By way of a linear conjugation of H and the same linear conjugation of \(\bar{H}\), we can obtain \(\bar{H}_5 = 0\). If \(\bar{H}_3 = 0\), then one can compute that \(\deg _{x_5} {\mathcal J}p \cdot H = \deg _{x_5} \frac{\partial }{ \partial x_4 } p \cdot \bar{H}_4\), which gives a contradiction to (2.3) in Proposition 2.2. Hence \(\bar{H}_3 \ne 0\). From (2.4), it follows that \({\mathcal J}\bar{H}\) is nilpotent. Since \(\bar{H}_1 = \bar{H}_2 = \bar{H}_5 = 0\), \({\mathcal J}_{x_3,x_4} (\bar{H}_3, \bar{H}_4)\) is nilpotent as well. From van den Essen (2000, Th. 7.2.25) (see also van den Essen and Hubbers 2000), we obtain that

    $$\begin{aligned} (\bar{H}_3, \bar{H}_4) = \left( b g(a x_3 - b x_4) + c, ag (a x_3 - b x_4) + d \right) \end{aligned}$$

    for certain \(a,b,c \in {\mathbb C}[x_1,x_2,x_5]\). Hence \(\deg _{x_4} \bar{H}_3 = \deg _{x_4} \bar{H}_4\). Since \(\deg _{x_4} p \ge 1\) and \(\deg _{x_4,x_5} p = 1\), this is only possible if \(x_5 \mid a\) and \(\deg _{x_4} p = \deg _{x_4} \bar{q} = 1\). Since \(\bar{q}\) be the leading and quadratic part of q with respect to \((x_4,x_5)\), we deduce from \(\deg _{x_4} \bar{q} = 1\) that q has a term which is divisible by \(x_4 x_5\), but no term which is divisible by \(x_4^2\). So \(\deg _{x_4} p = \deg _{x_4} q = 1\) and the right hand side of

    $$\begin{aligned} \deg _{x_5} \left( \frac{\partial }{ \partial x_4 } p \right) = 0 \quad \text{ and } \quad \deg _{x_5} \left( \frac{\partial }{ \partial x_4 } q \right) = 1 \end{aligned}$$
    (5.3)

    follows. The left hand side of (5.3) follows from \(\deg _{x_4,x_5} p = 1\).

  5. (iv)

    Since q is an invariant of \(x + H\), we obtain from Proposition 2.2 that \(q(x+tH)\cdot H(x+tH) = q \cdot H\), and substituting \(t = tq\) gives by way of (2) \(\Rightarrow \) (1) of Proposition 2.2 that \(x + qH\) is a quasi-translation. Since the leading coefficient with respect to \(x_4\) of q and hence also \(q H_5\) is contained in \({\mathbb C}[x_1,x_2,x_3,x_5] \setminus {\mathbb C}[x_1,x_2,x_3]\), we deduce that \(\deg _{x_4} q H_5 = \deg _{x_4} \frac{\partial }{ \partial x_5 } (q H_5)\). On account of (2.4) in Proposition 2.2, we have \({\text {tr}}{\mathcal J}qH = 0\), so

    $$\begin{aligned} \deg _{x_4} H_5&= \deg _{x_4} (qH_5) - \deg _{x_4} q \\&= \deg _{x_4} {\textstyle \frac{\partial }{ \partial x_5 }} (q H_5) - \deg _{x_4} q \\&\le \deg _{x_4} (qH_1,qH_2,qH_3,qH_4) - \deg _{x_4} q \\&= \deg _{x_4} (H_1,H_2,H_3,H_4) \\&= \deg h = 3 \end{aligned}$$

    Take k minimal such that the leading coefficient with respect to \(x_4\) of p is contained in \({\mathbb C}[x_1,x_2,\ldots ,x_k]\). Since \(\deg _{x_4} H_5 \le \deg h\) and \(\deg _{x_4,x_5} p = 1\), we have \(( \frac{\partial }{ \partial x_4 })^{\deg h + 1} (H_5 \frac{\partial }{ \partial x_5 } p) = 0\). From (2.3) it follows that \({\mathcal J}p \cdot H = 0\), so that we can deduce from \(\deg _{x_4} p = \deg _{x_4} q = 1\) that

    $$\begin{aligned} 0 - 0&= \left( \frac{\partial }{ \partial x_4 } \right) ^{\deg h + 1} \left( {\mathcal J}p \cdot H \right) - \left( \frac{\partial }{ \partial x_4 } \right) ^{\deg h + 1} \left( H_5 \frac{\partial }{ \partial x_5 } p \right) \\&= (\deg h + 1)! \cdot \sum _{i=1}^4 h \left( \frac{\partial }{ \partial x_4 } p, \frac{\partial }{ \partial x_4 } q \right) \frac{\partial }{ \partial x_4 } \frac{\partial }{ \partial x_i } p \\&= (\deg h + 1)! \cdot \sum _{i=1}^k h \left( \frac{\partial }{ \partial x_4 } p, \frac{\partial }{ \partial x_4 } q \right) \frac{\partial }{ \partial x_4 } \frac{\partial }{ \partial x_i } p \end{aligned}$$

    But the right hand side has degree \(\deg _{y_2} h_k\) with respect to \(x_5\) on account of (5.3). Contradiction, so \(\deg q \ge 5\). \(\square \)

6 The kernel of the map H of quasi-translations \(x + H\)

In the beginning of the proof of Theorem 5.1, we have shown that for quasi-translations \(x + H\) which belong to case (b) in Gordan and Nöther (1876), \(\dim V(H) = {\text {rk}}{\mathcal J}H = 3\). Hence the Zariski closure of the image of H is an irreducible component of V(H) for such quasi-translations. Corollary 6.4 in this section subsequently gives us several results about quasi-translations which belong to case (b) in Gordan and Nöther (1876), among which a result about such quasi-translations without linear invariants.

First we prove some geometric results about quasi-translations to obtain Theorem 6.3. Next, we use Theorem 6.3 to prove Corollary 6.4. At last, we use Theorem 6.3 to prove Corollary 6.5, which gives us the case where \(n \ge 6\) of (iii) of Theorem 3.6.

Lemma 6.1

Assume \(x + H\) is a quasi-translation in dimension n over \({\mathbb C}\). Let \(X \subseteq {\mathbb C}^n\) be an irreducible variety such that \(H|_X\) is not the zero map, so that the Zariski closure Y of the image of \(H|_X\) is nonzero.

Then for each \(c \in X\), there exists a nonzero \(p \in Y\) such that \(g(c + tp) = g(c)\), for every invariant g of \(x + H\), where t is a new indeterminate.

Proof

Let G be the set of invariants of \(x + H\). We first prove this lemma for all c in a nonempty open subset of X. The generic property of c that we assume is that \(H(c) \ne 0\). Since \(H|_X\) is not the zero map, we are considering a nonempty open subset of X indeed. From (2.3) in Proposition 2.2, it follows that \(g(x + tH) = g(x)\) for every invariant g of \(x + H\). Hence \(g(c + t p) = g(c)\) for every \(g \in G\), if we take \(p = H(c) \ne 0\).

In the general case, consider the sets

$$\begin{aligned} Z := \{(c,p,b) \in X \times ({\mathbb C}^n)^2 \mid g(c + tp) = g(c) \hbox { for every }g \in G\hbox { and } b^\mathrm{t}p = 1 \} \end{aligned}$$

and

$$\begin{aligned} \tilde{Z} := \{(c,p,b) \in Z \mid b \text{ is } \text{ the } \text{ complex } \text{ conjugate } \text{ of } p\} \end{aligned}$$

By applying proper substitutions in t, we see that the image \(\tilde{X}\) of the projection of \(\tilde{Z}\) onto its first n coordinates is equal to that of Z. Since \(\tilde{X}\) contains an open subset of X, it follows from Lemma 3.1 that \(\tilde{X} = X\), which gives the desired result. \(\square \)

Lemma 6.2

Assume \(x + H\) is a quasi-translation in dimension n over \({\mathbb C}\). Let W be the Zariski closure of the image of H. Then for any linear subspace L of \({\mathbb C}^n\), the assertions

  1. (1)

    \(\dim L > \dim V(H)\);

  2. (2)

    every irreducible component of \(H^{-1}(L)\) has dimension greater than \(\dim V(H)\);

  3. (3)

    for each \(c \in V(H)\), there exists a nonzero \(p \in L \cap W\) such that \(H(c + tp) = 0\);

satisfy (1) \(\Rightarrow \) (2) \(\Rightarrow \) (3).

Proof

Assume that L is a linear subspace of \({\mathbb C}^n\).

(1) \({{\Rightarrow }}\) (2):

Notice that \(H^{-1}(L)\) is the zero set of \(n - \dim L\) linear forms in \(H_1, H_2, \ldots , H_n\). By applying (Hartshorne 1977, Ch. I, Prop. 7.1) \(n- \dim L - 1\) times, it follows that every irreducible component of \(H^{-1}(L)\) has dimension at least \(\dim L\), which exceeds \(\dim V(H)\) if (1) is satisfied.

(2) \({{\Rightarrow }}\) (3):

Assume \(H(c) = 0\). Since \(V(H) = H^{-1}(0) \subseteq H^{-1}(L)\), there exists an irreducible component X of \(H^{-1}(L)\) which contains c. Assuming (2), we obtain \(\dim X > \dim V(H)\), whence \(H|_X\) is not the zero map. Hence (2) \(\Rightarrow \) (3) follows from Lemma 6.1 and (1) \(\Rightarrow \) (2) of Proposition 2.2. \(\square \)

Theorem 6.3

Assume \(x + H\) is a homogeneous quasi-translation over \({\mathbb C}\). Let W be the Zariski closure of the image of H.

Then for every irreducible component X of V(H) such that \(\dim (X \cap W) \le n - \dim V(H)\), \(X \cap W\) has an irreducible component Z of dimension \(n - \dim V(H)\), such that \(c + p \in X\) for all \(c \in X\) and all p in the linear span of Z.

Proof

Let X be an irreducible component of V(H) such that \(\dim (X \cap W) \le n - \dim V(H)\). We can take a linear subspace M of \({\mathbb C}^n\), such that \(c + p \subseteq X\) for all \(c \in X\) and all \(p \in M\), because \(M = \{0\}^n\) suffices. Take M as above such that \(\dim M\) is as large as possible. Suppose first that \(\dim (M \cap X \cap W) = n - \dim V(H)\). Then \(\dim (X \cap W) = n - \dim V(H)\) as well, so that \(X \cap W\) has an irreducible component \(Z \subseteq M\) of maximum dimension \(n - \dim V(H)\). Since M contains the linear span of Z, it follows from the definition of M that Z suffices.

Suppose next that \(\dim (M \cap X \cap W) < n - \dim V(H)\). Take for L a generic linear subspace of \({\mathbb C}^n\) of dimension \(\dim V(H) + 1\), to obtain that \(\dim (L \cap (M \cap X \cap W)) = 0\) and \(\dim (L \cap (X \cap W)) \le 1\). Since X is an irreducible component of V(H), the interior \(X^{\circ }\) of X as a closed subset of V(H) is nonempty. Now take an arbitrary \(c \in X^{\circ }\). On account of (1) \(\Rightarrow \) (3) of Lemma 6.2, there exists a nonzero \(p \in L \cap W\), such that \(H(c + t p) = 0\). Since H is homogeneous, the set \(L \cap W\) is a union of lines through the origin. Hence there exists a line \(P \subseteq L \cap W\) though the origin, such that \(c + P \subseteq V(H)\).

Since \(c \in X^{\circ }\) and X is an irreducible component of V(H), we deduce that \(c + P \subseteq X\). In particular, \(P \subseteq X\), so \(P \subseteq L \cap X \cap W\). But \(\dim (L \cap X \cap W) \le 1\), so \(L \cap X \cap W\) can only contain finitely many lines through the origin, say that Q is the finite set of these lines. Since \(X^{\circ }\) is dense in X and c was arbitrary, we can deduce that

$$\begin{aligned} X = \bigcup _{P \in Q} \{ c \in X \mid c + P \subseteq X \} \end{aligned}$$

Since X is irreducible, there exists a \(P \in Q\) such that \(c + P \subseteq X\) for all \(c \in X\). Therefore we can replace M by \(M \oplus P\), which contradicts the maximality of \(\dim M\).\(\square \)

Corollary 6.4

Assume \(x + H\) is a homogeneous quasi-translation over \({\mathbb C}\), such that \(\dim V(H) \le 3\) and \(\gcd \{H_1,H_2,\ldots ,H_n\} = 1\).

Then the Zariski closure W of the image of H is contained in V(H). Furthermore, every irreducible component X of V(H) which is not equal to W is a 3-dimensional linear subspace of \({\mathbb C}^n\) for which \(\dim (X \cap W) = 2\).

If W has a nonzero (projective) apex p and V(H) has a component X which does not contain p, then W is contained in the 4-dimensional linear subspace of \({\mathbb C}^n\) which is spanned by X and p.

Proof

Using (2) \(\Rightarrow \) (3) of Proposition 2.6 and Lemma 3.2, we deduce that W is irreducible and that \(W \subseteq V(H)\). Let X be an irreducible component of V(H) which is not equal to W. Since \(X \ne W\) and \(\dim V(H) \le 3\), we have \(\dim (X \cap W) \le 2\). From \(\gcd \{H_1,H_2,\ldots , H_n\} = 1\), we deduce that \(\dim (X \cap W) \le 2 \le n - \dim V(H)\). On account of Theorem 6.3, \(X \cap W\) has an irreducible component Z of dimension \(n - \dim V(H) = 2 = \dim (X \cap W)\), such that \(c + q \in X\) for all \(c \in X\) and all q in the linear span of Z.

Notice that \(\dim X \le \dim V(H) \le 3\). Suppose that \(\dim X \le 2\). Then \(X \subseteq W\) because X is irreducible and \(\dim (X \cap W) = 2\). Since W is irreducible, this contradicts the fact that X is an irreducible component of V(H) which is not equal to W. Thus \(\dim X = 3\). Let r be the dimension of the linear span of Z. If \(r \ge 3\), then X contains the linear span of r independent \(q \in Z\), whence X is equal to the linear span of \(r = 3\) independent \(q \in Z\). If \(r \le 2\), then \(r = 2\) because \(\dim Z = 2\), and X is the linear span of two independent \(q \in Z\), and any \(c \in X \setminus Z\).

Suppose that W has a nonzero (projective) apex p and V(H) has a component X which does not contain p. Since \(\dim (X \cap W) = 2\), there are infinitely many GN-planes spanned by p and a nonzero \(q \in X \cap W\). Any proper algebraic subset of W can only have finitely many GN-planes, because W is irreducible and \(\dim W = 3\). Hence the set of infinitely many GN-planes spanned by p and a nonzero \(q \in X \cap W\) is dense in W. It follows that W is contained in the linear span of X and p.\(\square \)

Corollary 6.5

Assume \(x + H\) is a homogeneous quasi-translation over \({\mathbb C}\), such that \({\text {rk}}{\mathcal J}H + \dim V(H) \le n\). Then \(H(c + p) = 0\) for all \(c \in V(H)\) and all p in the linear span of the image of H. In particular, \(x + H\) has at least \({\text {rk}}{\mathcal J}H\) linear invariants.

Proof

The case where \(\deg H \le 0\) is easy, so assume that \(\deg H \ge 1\). Let W be the Zariski closure of the image of H and X be an irreducible component of V(H). From Lemma 3.2, it follows that W is irreducible and that \(\dim (X \cap W) \le \dim W = {\text {rk}}{\mathcal J}H \le n - \dim V(H)\). Using Theorem 6.3, we subsequently deduce that \(X \cap W\) has an irreducible component Z of dimension \(n - \dim V(H)\), such that \(c + p \in X\) for all \(c \in X\) and all p in the linear span of Z.

If \(W \nsubseteq X\), then by the irreducibility of W, \(\dim Z \le \dim (X \cap W) < \dim W = {\text {rk}}{\mathcal J}H \le n - \dim V(H)\), which contradicts \(\dim Z = n - \dim V(H)\). Hence \(W \subseteq X\), and by irreducibility of W once again, the only irreducible component of \(X \cap W\) is W. Thus \(Z = W\). Furthermore, X is an arbitrary irreducible component of V(H), so \(c + p \in V(H)\) for all \(c \in V(H)\) and all p in the linear span of W.

Consequently, \(H(c + p) = 0\) for all \(c \in V(H)\) and all p in the linear span of the image of H. Furthermore, the dimension of the linear span of the image of H does not exceed the dimension of V(H). So there are at least \(r := n - \dim V(H) \ge {\text {rk}}{\mathcal J}H\) independent linear forms \(l_1, l_2, \ldots , l_r\) which vanish on the image of H. Hence \(l_i(H) = 0\) and \(l_i(x + H) = l_i(x)\) for each i, as desired.\(\square \)