1 Introduction

Let \(N \ge 2\) be an integer. A pair of elliptic curves are said to be N-congruent, if their N-torsion subgroups are isomorphic as Galois modules. Such an isomorphism raises the Weil pairing to the power \({\varepsilon }\) for some \({\varepsilon }\in ({{\mathbb {Z}}}/N{{\mathbb {Z}}})^\times \). In this situation we say that the N-congruence has power\({\varepsilon }\). Since multiplication-by-m on one of the elliptic curves (for m an integer coprime to N) changes \({\varepsilon }\) to \(m^2 {\varepsilon }\), we are only ever interested in the class of \({\varepsilon }\in ({{\mathbb {Z}}}/N{{\mathbb {Z}}})^\times \) mod squares.

Congruences of elliptic curves are interesting for several reasons. They can be used in the modular approach to solving Diophantine equations, in studying morphisms from a genus 2 curve to an elliptic curve, and in visualising elements of the Tate-Shafarevich group. See for example [9, 15, 25].

Let \(Z(N,{\varepsilon })\) be the surface that parametrises pairs of N-congruent elliptic curves with power \({\varepsilon }\), up to simultaneous quadratic twist. This is a surface defined over \({{\mathbb {Q}}}\). We only consider \(Z(N,{\varepsilon })\) up to birational equivalence. Kani and Schanz [19, Theorem 4] classified the geometry of these surfaces, explicitly determining the pairs \((N,{\varepsilon })\) for which \(Z(N,{\varepsilon })\) is birational over \({{\mathbb {C}}}\) to either (i) a rational surface, (ii) an elliptic K3-surface, (iii) an elliptic surface with Kodaira dimension one (also known as a properly elliptic surface), or (iv) a surface of general type. In case (i) it is known that the surface is rational over \({{\mathbb {Q}}}\). We show in cases (ii) and (iii) that the surface is birational over \({{\mathbb {Q}}}\) to an elliptic surface, determining in each case a Weierstrass equation for the generic fibre as an elliptic curve over \({{\mathbb {Q}}}(T)\). One application of these explicit equations is that we are then able to use the methods of van Luijk and Kloosterman to compute the geometric Picard number of each surface.

The problem of computing \(Z(N,{\varepsilon })\) is closely related to that of computing the modular curves \(X_E(N,{\varepsilon })\) parametrising the elliptic curves N-congruent (with power \({\varepsilon }\)) to a given elliptic curve E. Equations for \(X_E(N,{\varepsilon })\), and the family of curves it parametrises, have been determined as follows. In each case the equations have coefficients in \({{\mathbb {Q}}}(a,b)\) where a and b are the coefficients of a Weierstrass equation for E. The cases \(N \le 5\) were treated by Rubin and Silverberg [26, 28, 30] for \({\varepsilon }= 1\), and by Fisher [11, 12] for \({\varepsilon }\not = 1\). The case \(N=7\) was treated by Halberstadt and Kraus [17] for \({\varepsilon }= 1\), and by Poonen, Schaefer and Stoll [25] for \({\varepsilon }\not = 1\). The case \(N=8\) was treated by Chen [8], and the cases \(N=9\) and \(N=11\) by Fisher [13, 14].

If N is not a prime power, then in principle we obtain equations for \(X_E(N,{\varepsilon })\) as a fibre product of modular curves of smaller level. Equations that are substantially better than this have been obtained in the case \((N,{\varepsilon }) = (6,1)\) by Rubin and Silverberg [27], and independently Papadopoulos [24], and in the cases \((N,{\varepsilon }) = (12,1)\) and (12, 7) by Chen [7, Chapter 7]. Chen also gives equations for \(X_E(N,{\varepsilon })\) in the cases \((N,{\varepsilon }) = (6,5)\) and (10, 1).

The equations for \(X_E(N,{\varepsilon })\) do immediately give us equations for \(Z(N,{\varepsilon })\), but unfortunately this does not always make it easy to find the elliptic fibrations. The main purpose of this note is to record the transformations that work in each case.

According to [19, Theorem 4] the surface \(Z(N,{\varepsilon })\) is rational over \({{\mathbb {C}}}\) for all \(N \le 5\), and in the cases \(N = 6,7,8\) with \({\varepsilon }= 1\). In each of these cases \(Z(N,{\varepsilon })\) is rational over \({{\mathbb {Q}}}\), as follows (see [7, Chapter 8]) from the results cited above.

In our terminology, it is part of the definition of an elliptic surface that it has a section. As we describe below, some of the cases in the next two theorems were already treated in [7, 8, 14, 21].

Theorem 1.1

The surfaces \(Z(N,{\varepsilon })\) that are birational over \({{\mathbb {C}}}\) to an elliptic K3-surface, are in fact birational over \({{\mathbb {Q}}}\) to an elliptic surface. The generic fibres are the elliptic curves over \({{\mathbb {Q}}}(T)\) with the following Weierstrass equations.

$$\begin{aligned}&Z(6,5): \qquad y^2 + 3 T (T - 2)xy + 2 (T - 1) (T + 2)^2 (T^3 - 2)y = x^3 -6 (T - 1) (T^3 - 2) x^2, \\&Z(7,3): \qquad y^2 = x^3 + (4 T^4 + 4 T^3 - 51 T^2 - 2 T - 50) x^2 + (6 T + 25) (52 T^2 - 4 T + 25) x, \\&Z(8,3):\qquad y^2 = x^3 - (3 T^2 - 7) x^2 - 4 T^2 (4 T^4 - 15) x + 4 T^2 (53 T^4 + 81 T^2 + 162), \\&Z(8,5):\qquad y^2 = x^3 - 2(T^2 + 19) x^2 - (4 T^2 - 49) (T^4 - 6 T^2 + 25) x, \\&Z(9,1):\qquad y^2 + (6 T^2 + 3 T + 2)xy + T^2 (T + 1) (4 T^3 + 9 T +9) y\\&\qquad \qquad \qquad \quad = x^3 - (16 T^4 + 12 T^3 + 9 T^2 + 6 T + 1)x^2, \\&Z(12,1):\qquad y^2 + 2 (5 T^2 + 9) xy + 96 (T^2 + 3) (T^2 + 1)^2 y = x^3 + (T^2 + 3) (11 T^2 + 1)x^2. \end{aligned}$$

Theorem 1.2

The surfaces \(Z(N,{\varepsilon })\) that are birational over \({{\mathbb {C}}}\) to a properly elliptic surface, are in fact birational over \({{\mathbb {Q}}}\) to an elliptic surface. The generic fibres are the elliptic curves over \({{\mathbb {Q}}}(T)\) with the following Weierstrass equations.

$$\begin{aligned}&Z(8,7): \qquad y^2 = x^3 + 2(4 T^6 - 15 T^4 + 14 T^2 - 1) x^2 + (T^2 - 1)^4 (16 T^4 - 24 T^2 + 1) x, \\&Z(9,2): \qquad y^2 + 3( 4 T^3 + T^2 - 2)xy + (T - 1)^3 (T^3 - 1) (4 T^3 - 3 T - 7)y\\&\qquad \qquad \qquad \quad = x^3 -3 (T + 1) (T^3 - 1) (9 T^2 + 2 T + 1)x^2, \\&Z(10,1): \qquad y^2 -(3 T - 2) (6 T^2 - 5 T - 2)xy\\&\qquad \qquad \qquad \quad -4 T^2 (T - 1)^2 (4 T^2 - 2 T - 1) (27 T^3 - 54 T^2 + 16 T + 12)y \\&\qquad \qquad \qquad \quad = x^3 + T^2 (T - 1) (27 T^3 - 54 T^2 + 16 T + 12)x^2, \\&Z(10,3): \qquad y^2 + (T^3 - 8 T^2 - 9 T - 8)xy + 2 T^2 (T^3 - T^2 - 3 T - 3) (7 T^2 + 2 T + 3)y \\&\qquad \qquad \qquad \quad = x^3 + 2 (3 T + 2) (T^3 - T^2 - 3 T - 3)x^2, \\&Z(11,1): \qquad y^2 + (T^3 + T)xy = x^3 - (4 T^5 - 17 T^4 + 30 T^3 - 18 T^2 + 4)x^2 \\&\qquad \qquad \qquad \quad + T^2 (2 T -1) (3 T^2 - 7 T + 5)^2 x. \end{aligned}$$

Although we have not made it formally part of the statements of Theorems 1.1 and 1.2, our methods do also give the moduli interpretations of these surfaces. In other words, given a point on one of these surfaces (away from a certain finite set of curves) we can determine the corresponding pair of N-congruent elliptic curves. The fact that N-congruent elliptic curves over \({{\mathbb {Q}}}\) have traces of Frobenius (at all primes of good reduction) that are congruent mod N, then provides some very useful check on our calculations. See for example [7, Chapter 9] or the calculations at the end of the Magma file accompanying this article.

The second part of the following corollary was conjectured by Kani and Schanz [19, Conjecture 5], and its proof (for \({\varepsilon }= 1\)) was completed by Zexiang Chen in his PhD thesis [7]. For N sufficiently large it is expected (with variants of this conjecture variously attributed to Frey, Mazur, Kani and Darmon) that the conclusions of the corollary are false.

Corollary 1.3

Let \(N \le 12\) and \({\varepsilon }\in ({{\mathbb {Z}}}/N{{\mathbb {Z}}})^\times \) with \({\varepsilon }=1\) if \(N=11\) or 12.

  1. (i)

    There are infinitely many pairs of non-isogenous elliptic curves over \({{\mathbb {Q}}}(T)\) that are N-congruent with power \({\varepsilon }\).

  2. (ii)

    There are infinitely many pairs of non-isogenous elliptic curves over \({{\mathbb {Q}}}\) that are N-congruent with power \({\varepsilon }\).

Moreover the j-invariants \(j_1\) and \(j_2\) of the elliptic curves in (i) (resp. (ii)) correspond to infinitely many curves (resp. points) in the \((j_1,j_2)\)-plane.

Proof

In Table 3 we list at least one \({{\mathbb {Q}}}\)-rational section of infinite order for each of the elliptic surfaces in Theorems 1.1 and 1.2. This proves the first part. The second part follows by specialising T. See the proof of [14, Theorem 1.5] for further details. The final sentence of the statement is included to guard against various “cheat” proofs, where new examples are generated from old by taking quadratic twists, or making substitutions for T. \(\square \)

Remark 1.4

If elliptic curves \(E_1\) and \(E_2\) are N-congruent with power \({\varepsilon }= -1\), then the quotient of \(E_1 \times E_2\) by the graph of the N-congruence is a principally polarised abelian surface. The surface \(Z(N,-1)\) may then be interpreted as a Hilbert modular surface, parametrising degree N morphisms from a genus 2 curve to an elliptic curve. At the outset of our work, this moduli interpretation had not been made explicit for any \(N > 5\). Remarkably however, this approach has been used by A. Kumar [21] to independently obtain results equivalent to the first two parts of Theorem 1.1 and the first three parts of Theorem 1.2. As far as we are aware, his methods do not generalise to \({\varepsilon }\not = -1\).

In Tables 1 and 2 we record some further data concerning the elliptic surfaces in Theorems 1.1 and 1.2. It is well known that K3-surfaces may admit more than one elliptic fibration. For some interesting arithmetic applications see [10, 31, 35]. In view of this, the data in Table 1 comes with the caveat that it relates to the elliptic fibration we happened to find in Theorem 1.1. Since a properly elliptic surface has a unique elliptic fibration, there is no such caveat for Table 2. We list in each case the Kodaira symbols of the singular fibres (with bracketing to indicate fibres that are Galois conjugates), the order of the torsion subgroup over \({{\mathbb {Q}}}(T)\), the ranks of the group of sections over \({{\mathbb {Q}}}(T)\) and \({\overline{{\mathbb {Q}}}}(T)\), and finally the geometric Picard number \(\rho \). The lower bounds on the ranks are immediate from the independent sections of infinite order listed in Tables 3 and 4. The upper bounds on the ranks, and the geometric Picard numbers are justified in Sect. 4.

Table 1 The elliptic K3-surfaces in Theorem 1.1
Table 2 The properly elliptic surfaces in Theorem 1.2
Table 3 Mordell-Weil generators over \({{\mathbb {Q}}}(T)\)
Table 4 Additional Mordell–Weil generators over \({\overline{{{\mathbb {Q}}}}}(T)\)

We organise the proofs of Theorems 1.1 and 1.2 as follows. The cases \(N=8\) and \(N=9\) were already treated in [8, 14], by starting from equations for \(X_E(N,{\varepsilon })\). In Sect. 2 we use a similar approach to treat the cases \((N,{\varepsilon }) = (7,3)\), (11, 1) and (12, 1). Then in Sect. 3 we treat the cases \((N,{\varepsilon }) = (6,5)\), (10, 1) and (10, 3) by exhibiting \(Z(N,{\varepsilon })\) as a degree 3 cover of a K3-surface. See the opening paragraphs of these sections for some more explanation as to how our work relates to that in Chen’s thesis [7].

The calculations described in this paper were carried out using Magma [3]. Accompanying Magma files are available from the author’s website. We assume that the reader is familiar with the standard techniques for putting an elliptic curve in Weierstrass form, as described in [4, §8], or as implemented in Magma.

2 Proofs via equations for \(X_E(N,{\varepsilon })\)

We prove Theorems 1.1 and 1.2 in the cases \((N,{\varepsilon }) = (7,3)\), (11, 1) and (12, 1). The case \((N,{\varepsilon }) = (7,3)\) was treated in [7, Section 8.2], but as this has not been published before, we include the details for completeness.

Equations for \(Z(N,{\varepsilon })\) as a surface fibred over the j-line may be obtained from the equations for \(X_E(N,{\varepsilon })\) by setting \(a = b = -27j/(4(j-1728))\). However, instead of simply setting \(a = b\), it is better to quotient out by the \({{\mathbb {G}}}_m\)-action described below. We obtain the same surface this way since a and b scale under the \({{\mathbb {G}}}_m\)-action by consecutive powers of \(\lambda \).

2.1 Case (\(N,{\varepsilon }) = (7,3)\)

Let E be the elliptic curve \(y^2 = x^3 + ax + b\). The following equation for \(X_E(7,3)\), as a quartic curve in \({{\mathbb {P}}}^2\), was computed by Poonen, Schaefer and Stoll [25, Section 7.2], building on work of Halberstadt and Kraus [17].

$$\begin{aligned} F(a,b;x,y,z)&= -\, a^2 x^4 + 2 a b x^3 y - 12 b x^3 z - 6(a^3 + 6 b^2) x^2 y^2 + 6 a x^2 z^2\\&\quad +\, 2 a^2 b x y^3 - 12 a b x y^2 z + 18 b x y z^2 + (3 a^4 + 19 a b^2) y^4\\&\quad - 2(4 a^3 + 21 b^2) y^3 z + 6 a^2 y^2 z^2 - 8 a y z^3 + 3 z^4. \end{aligned}$$

Replacing E by a quadratic twist does not change the isomorphism class of \(X_E(7,3)\). This is borne out by the identity

$$\begin{aligned} F(\lambda ^2 a,\lambda ^3 b ; \lambda x,y, \lambda ^2 z ) = \lambda ^8 F(a,b;x,y,z). \end{aligned}$$

The surface Z(7, 3) is the quotient of \(\{F = 0\} \subset {{\mathbb {A}}}^2 \times {{\mathbb {P}}}^2\) by this \({{\mathbb {G}}}_m\)-action. We have \(F(y,x_1 y;x_2,T,y) = y^2 (c y^2 + h y - f)\) where

$$\begin{aligned} c&= (T^2 + 1) (3 T^2 - 8 T + 3), \\ h&= T^3 (19 T - 42) x_1^2 + 2 T (T - 3)^2 x_1 x_2 - 6 (T^2 - 1)x_2^2, \\ f&= 36 T^2 x_1^2x_2^2 - 2(T - 6) x_1 x_2^3 + x_2^4. \end{aligned}$$

Therefore Z(7, 3) is birational to the total space (i.e., the corresponding fibred surface) for the genus one curve over \({{\mathbb {Q}}}(T)\) with equation \(Y^2 = h^2 + 4 cf\). This is a double cover of \({{\mathbb {P}}}^1\) with a rational point above \((x_1:x_2) = (1:0)\). Putting this elliptic curve in Weierstrass form, and replacing T by \((6T - 3)/(4T + 4)\), gives the equation in the statement of Theorem 1.1.

2.2 Case (\(N,{\varepsilon }) = (11,1)\)

Let E be the elliptic curve \(y^2 = x^3 + ax + b\). Equations for \(X_E(11,1)\) as a curve in \({{\mathbb {P}}}^4\) were computed in [13, Theorem 1.2]. These equations are the \(4 \times 4\) minors of the \(5 \times 5\) Hessian matrix of the cubic form

$$\begin{aligned} F(a,b;v,w,x,y,z)&= v^3 + a v^2 z - 2 a v x^2 + 4 a v x y - 6 b v x z + a v y^2 + 6 b v y z \\&\quad + a^2 v z^2 - w^3 + a w^2 z - 4 a w x^2 - 12 b w x z + a^2 w z^2 - 2 b x^3 + 3 b x^2 y \\&\quad + 2 a^2 x^2 z + 6 b x y^2 + 4 a b x z^2 + b y^3 - a^2 y^2 z + a b y z^2 + 2 b^2 z^3. \end{aligned}$$

Replacing E by a quadratic twist does not change the isomorphism class of \(X_E(11,1)\). This is borne out by the identity

$$\begin{aligned} F(\lambda ^2 a,\lambda ^3 b;\lambda ^2 v,\lambda ^2 w,\lambda x,\lambda y,z) = \lambda ^6 F(a,b;v,w,x,y,z). \end{aligned}$$

We may describe Z(11, 1) as the quotient of a 3-fold in \({{\mathbb {A}}}^2 \times {{\mathbb {P}}}^4\) by this \({{\mathbb {G}}}_m\)-action.

We start by using the discriminant condition \(4a^3 + 27b^2 \not =0\) to simplify the equations for \(X_E(11,1)\). The polynomials

$$\begin{aligned} F_1&= v z + 2 w z + x^2 - x y - y^2, \\ F_2&= a x z + b z^2 - v x + v y - 2 w x, \\ F_3&= a^2 z^2 + 2 a w z - 4 a x^2 - 12 b x z - 3 w^2, \\ F_4&= a^2 z^2 + 2 a v z - 2 a x^2 + 4 a x y + a y^2 - 6 b x z + 6 b y z + 3 v^2, \\ F_5&= 2 a^2 y z - a b z^2 - 4 a v x - 2 a v y - 6 b v z - 3 b x^2 - 12 b x y - 3 b y^2, \end{aligned}$$

are linear combinations of the derivatives of F, where the matrix implicit in taking these linear combinations is invertible if \(4a^3 + 27b^2 \not =0\). Now \(X_E(11,1)\) is defined by the \(4 \times 4\) minors of the \(5 \times 5\) Jacobian matrix (M say) of \(F_1, \ldots , F_5\).

We make the substitutions

$$\begin{aligned} a&= (4 U + 3 x_3) x_4 - 3 x_5^2, \\ b&= x_2 (x_1 + x_3) x_4 - (4 U + 3 x_3) x_4 x_5 + 2 x_5^3, \\ (v,w,x,y,z)&= ( x_2 x_4 + x_4 x_5 + x_5^2, x_3 x_4 - x_5^2, x_5, x_4, 1 ). \end{aligned}$$

We have \(4a^3 + 27b^2 = x_4 h\) for some polynomial h. We add \(x_5\) times the first row of M to the second row. We then divide all but the first row by \(x_4\). Let \(I \subset {{\mathbb {Q}}}[U,x_1,x_2,x_3,x_4,x_5]\) be the ideal generated by the \(4 \times 4\) minors of M, and

$$\begin{aligned} J = \{f \in {{\mathbb {Q}}}[U,x_1,x_2,x_3,x_4,x_5] : x_2 h f \in I \}. \end{aligned}$$

Using the Gröbner basis machinery in Magma we find that \(J \cap {{\mathbb {Q}}}[U,x_1,x_2,x_3,x_4]\) is generated by 3 homogeneous polynomials of degree 4. These define a surface in \({{\mathbb {P}}}^4\) of degree 12. By the substitution

$$\begin{aligned} T = \frac{4 (2 x_1 - x_2 + x_3) U + (x_1 x_2 - x_2^2 + x_3 x_4)}{2 (2 x_4 - x_2) U + 2 (x_1 x_2 - x_2^2 + x_3 x_4)} \end{aligned}$$

this surface is birational to the surface \(\{Q_1 = Q_2 = 0\}\) in \({{\mathbb {A}}}^1 \times {{\mathbb {P}}}^3\) where

$$\begin{aligned} Q_1&= 4 (T - 2) x_1 x_2 + 8 x_1 x_3 + (T - 2)^2 x_2^2\\&\quad + 4 (T - 2) x_2 x_3 + 2 (T^2 - T + 1) x_2 x_4 + 4 x_3^2 - 4 T x_3 x_4, \\ Q_2&= 8 x_1^2 + 16 x_1 x_3 - 4 (2 T - 1) x_1 x_4 - T (T - 2) x_2^2 - T^2 x_2 x_3\\&\quad - 2 (T^2 - T + 1) x_2 x_4 - 2 (T - 4) x_3^2 - 2 (T^2 + 4 T - 1) x_3 x_4. \end{aligned}$$

These same equations define a genus one curve in \({{\mathbb {P}}}^3\) defined over \({{\mathbb {Q}}}(T)\), with a rational point at \((x_1: x_2: x_3: x_4) = (0:0:0:1)\). Putting this elliptic curve in Weierstrass form gives the equation in the statement of Theorem 1.2.

2.3 Case \((N,{\varepsilon }) = (12,1)\)

Let E be the elliptic curve \(y^2 = x^3 + ax + b\). Equations for \(X_E(12,1)\) as a curve in \({{\mathbb {P}}}^5\) were computed in [7, Theorem 1.7.10]. These equations are \(F_0 = F_1 = F_2 = F_3 = 0\) where

$$\begin{aligned} F_0&= -X^2 Z + a X Y^2 + 6 b Y^3 - 6 a Y^2 Z - 12 Z^3, \\ F_1&= X^2 + 12 X Z + 36 Z^2 - 2 u_0 u_2 - u_1^2 + a u_2^2, \\ F_2&= 4 a X Y + 36 b Y^2 - 24 a Y Z - 2 u_0 u_1 + 2 a u_1 u_2 + b u_2^2, \\ F_3&= 8 a X Z - 4 a^2 Y^2 - u_0^2 + 2 b u_1 u_2. \end{aligned}$$

These polynomials satisfy

$$\begin{aligned} F_i(\lambda ^2 a,\lambda ^3 b;\lambda X,Y,\lambda Z,\lambda ^2 u_0,\lambda u_1,u_2) = \lambda ^{m_i} F_i(a,b;X,Y,Z,u_0,u_1,u_2) \end{aligned}$$

where \((m_0,m_1,m_2,m_3) = (3,2,3,4)\). Again, it is our aim to quotient out by this \({{\mathbb {G}}}_m\)-action. We do this by setting \((X + 6 Z)Y = u_2^2\), which works since the left hand side and right hand side scale by consecutive powers of \(\lambda \). Specifically, we substitute \((X,Y,Z,u_0,u_1,u_2) = (x^2 - 6 y,1,y,v x,w x,x)\) and then solve for a and b so that the first two equations are satisfied. In the remaining two equations we substitute

$$\begin{aligned} v = 2 (w - 2 y) y + \frac{T + 1}{T - 1} (x^2 (y + 1) - (w + 2 y)^2). \end{aligned}$$

The resultant of these two equations with respect to w is \(f(T) x^{14} y^{2} g(x,y)^2 h(x,{\widetilde{y}})\) where f(T) is a rational function in T, \(g(x,y) = x^6 (y + 1) - 9 y^2 (x^2 + 4)^2\),

$$\begin{aligned} h(x,y)&= (T + 1)^2 x^2 y^2 + (T + 2) (T^2 + 3)^2 x^2 \\&\quad - 4 (T - 1) (T + 3)^2 x y + 4 (T + 3)^2 y^2 + 12 T (T + 1)^2 (T + 3)^2 \end{aligned}$$

and \({\widetilde{y}} = (144 (T + 1) y + (T + 3)^2 ((T - 3) x^2 + 12 (T + 1)))/(8 (T + 3) x)\). Therefore Z(12, 1) is birational to the total space for the genus one curve \(C = \{h = 0\}\) in \({{\mathbb {A}}}^2\) defined over \({{\mathbb {Q}}}(T)\). Replacing x by \(2(T + 3)/(T^2 + 3)x\), and completing the square in y shows that C has equation

$$\begin{aligned} Y^2 = -(T + 2) x^4 - (4 T^3 + 5 T^2 + 6 T + 9) x^2 - 3 T (T^2 + 3)^2. \end{aligned}$$
(1)

This gives a genus one fibration on Z(12, 1) defined over \({{\mathbb {Q}}}\), but without a \({{\mathbb {Q}}}\)-rational section. Indeed the fibres with \(T > 0\) have no real points.

We now find another genus one fibration that does have a \({{\mathbb {Q}}}\)-rational section. Let \(F(x_1,x_2,x_3)\) be the unique homogeneous polynomial of degree 6 with the property that F(xT, 1) is the right hand side of (1). Then F is the discriminant of the following quadratic in T.

$$\begin{aligned} x_1^2 x_2 + (T^2 + 2) x_1^2 x_3 + 2 T x_1 x_2^2 - 2 T x_1 x_2 x_3 + T^2 x_2^3 + 3 x_2^2 x_3 + 3 T^2 x_2 x_3^2 + 9 x_3^3=0 \end{aligned}$$

This same equation defines a genus one curve in \({{\mathbb {P}}}^2\) defined over \({{\mathbb {Q}}}(T)\), with a rational point at \((x_1: x_2: x_3) = (1:0:0)\). Putting this elliptic curve in Weierstrass form gives the equation in the statement of Theorem 1.1.

3 Degree 3 covers of K3-surfaces

We prove Theorems 1.1 and 1.2 in the cases \((N,{\varepsilon }) = (6,5)\), (10, 1) and (10, 3). In the first of these cases, Chen’s equations for \(X_E(6,5)\) already give a genus one fibration on Z(6, 5), but one without a section. The content of Theorem 1.1 in this case is that we have found another genus one fibration that does have a section.

For N an odd integer, let \(Z^*(N,{\varepsilon })\) be the double cover of \(Z(N,{\varepsilon })\) that parametrises pairs of elliptic curves that are N-congruent with power \({\varepsilon }\), and additionally have the property that their ratio of discriminants is a square.

Theorem 3.1

If \((N,{\varepsilon }) = (3,2)\), (5, 1) or (5, 2) then \(Z^*(N,{\varepsilon })\) is a double cover of \({{\mathbb {P}}}^2\), ramified over the union of two cuspidal cubics, with equation

$$\begin{aligned} y^2 = F_{+}(u,v,w) F_{-}(u,v,w) \end{aligned}$$
(2)

where

$$\begin{aligned} Z^*(3,2): \quad F_{\pm }&= u (u + 3 v \pm w)^2 + 4 v^3, \\ Z^*(5,1): \quad F_{\pm }&= u(u^2 - 11 u v - v^2)+ w^2 (12 u + v) \pm 2 w (3 u^2 - 4 u v + 4 w^2), \\ Z^*(5,2): \quad F_{\pm }&= u^2 (11 v + 8 w) + w^2 (8 u - v + 4 w) \pm 2 u (2 v - w)(4u - v + 4w). \end{aligned}$$

In particular, in each of these cases \(Z^*(N,{\varepsilon })\) is a K3-surface.

Proof

Let E be the elliptic curve \(y^2 = x^3 + ax + b\). We put \(\Delta = - 4 a^3 - 27 b^2\), and define polynomials

$$\begin{aligned} f(x)&= x^3 + a x + b, \\ g(x)&= 3 a x^4 + 18 b x^3 - 6 a^2 x^2 - 6 a b x - a^3 - 9 b^2, \\ h(x)&= 3 a x^2 + 9 b x - a^2, \\ j(x)&= 27 b x^3 - 18 a^2 x^2 - 27 a b x - 2 a^3 - 27 b^2. \end{aligned}$$

If we assign the variables xab weights 1, 2, 3, then each of these polynomials is homogeneous. We note that \(j^2 = -4 h^3 - 27 \Delta f^2\).

Case\(({{\varvec{N}}},{\varvec{{\varepsilon }}}) = \mathbf{(3,2)}\). The following equations for the family of curves parametrised by \(X_E(3,2)\) are taken from [11, Section 13]. Starting from the Klein formFootnote 1

$$\begin{aligned} D(\xi ,\eta ) = -27 a \xi ^4 - 54 b \xi ^3 \eta - 18 a^2 \xi ^2 \eta ^2 - 54 a b \xi \eta ^3 + (a^3 - 27 b^2) \eta ^4, \end{aligned}$$

we define

$$\begin{aligned} A(\xi ,\eta ) = \frac{1}{108} \left| \begin{array}{cc} D_{\xi \xi } &{} D_{\xi \eta } \\ D_{\eta \xi } &{} D_{\eta \eta } \end{array} \right| , \quad \text { and } \quad B(\xi ,\eta ) = \frac{1}{36} \left| \begin{array}{cc} D_{\xi } &{} D_{\eta } \\ A_{\xi } &{} A_{\eta } \end{array} \right| , \end{aligned}$$

where the subscripts denote partial derivatives. These forms satisfy the syzygy

$$\begin{aligned} -4 A^3 - 27 B^2 = 16 (4 a^3 + 27 b^2)^2 D^3. \end{aligned}$$
(3)

The family of elliptic curves 3-congruent to E with power \({\varepsilon }= 2\) is given by

$$\begin{aligned} y^2 = x^3 + A(\xi ,\eta ) x + B(\xi ,\eta ). \end{aligned}$$

We dehomogenise by putting \((\xi ,\eta )=(x,1)\). Then \(D = f_x^3 - 27 f^2 = j - 3 f_x h\) where \(f_x = 3x^2 + a\). The quantities \((u,v,r,s) = (D, f_x h, h^3,3^6 \Delta f^4)\) are related by

$$\begin{aligned} (4 r + (u + 3 v)^2) (r u - v^3) = r s. \end{aligned}$$
(4)

As we verify in Remark 3.2 below, this is an equation for Z(3, 2) in \({{\mathbb {P}}}(1,1,2,3)\) where the coordinates uvrs have weights 1, 1, 2, 3. We see by (3) that, up to squared factors, the ratio of discriminants is s / u. We substitute \(s = u w^2\) in (4) to give a quadratic in r whose discriminant is the polynomial \(F_{+} F_{-}\) in the statement of the theorem.

Case\(({{\varvec{N}}},{\varvec{{\varepsilon }}}) = \mathbf{(5,1)}\). The following equations for the family of curves parametrised by \(X_E(5,1)\) are taken from [11, Section 13]. Starting from the Klein formFootnote 2

$$\begin{aligned} D(\lambda ,\mu )&= \lambda ^{12} + 22 a \lambda ^{10} \mu ^2 + 220 b \lambda ^9 \mu ^3 - 165 a^2 \lambda ^8 \mu ^4 - 528 a b \lambda ^7 \mu ^5 \\&\quad - 220 (a^3 + 12 b^2) \lambda ^6 \mu ^6 + 264 a^2 b \lambda ^5 \mu ^7 - 165 a (5 a^3 + 32 b^2) \lambda ^4 \mu ^8 \\&\quad - 880 b (3 a^3 + 20 b^2) \lambda ^3 \mu ^9 + 22 a^2 (25 a^3 + 168 b^2) \lambda ^2 \mu ^{10} \\&\quad + 20 (19 a^4 b + 128 a b^3) \lambda \mu ^{11} + (125 a^6 + 1792 a^3 b^2 + 6400 b^4) \mu ^{12}, \end{aligned}$$

we define

$$\begin{aligned} A(\lambda ,\mu ) = \frac{1}{5808} \left| \begin{array}{cc} D_{\lambda \lambda } &{} D_{\lambda \mu } \\ D_{\mu \lambda } &{} D_{\mu \mu } \end{array} \right| , \quad \text { and } \quad B(\lambda ,\mu ) = \frac{1}{360} \left| \begin{array}{cc} D_{\lambda } &{} D_{\mu } \\ A_{\lambda } &{} A_{\mu } \end{array} \right| , \end{aligned}$$

where the subscripts denote partial derivatives. These forms satisfy the syzygy

$$\begin{aligned} 4 A^3 + 27 B^2 = (4 a^3 + 27 b^2) D^5. \end{aligned}$$
(5)

The family of elliptic curves 5-congruent to E with power \({\varepsilon }= 1\) is given by

$$\begin{aligned} y^2 = x^3 + A(\lambda ,\mu ) x + B(\lambda ,\mu ). \end{aligned}$$

We dehomogenise by putting \((\lambda ,\mu )=(x,1)\). Then

$$\begin{aligned} D = 4 kf - 3(f^2 + g)^2 + 32\Delta (f^2 + g), \end{aligned}$$

where \(k(x) = f^3 + f j + 4 \Delta f + 3 g(x f_x - 2 f) = x^9 + 12 a x^7 + 84 b x^6 + \cdots \)

The quantities \((t,u,v,r,s) = (4 f,2 (f^2 + g),16 \Delta ,4 k,D)\) are related by

$$\begin{aligned} \begin{aligned} r^2 + s t^2&= u (u^2 - 11 u v - v^2) + (12 u + v) s, \\ r t&= 3 u^2 - 4 u v + 4 s. \end{aligned} \end{aligned}$$
(6)

These are equations for Z(5, 1) in \({{\mathbb {P}}}(1,2,2,3,4)\) where the coordinates tuvrs have weights 1, 2, 2, 3, 4. We see by (5) that, up to squared factors, the ratio of discriminants is s. Putting \(s= w^2\) we obtain from (6) the equation

$$\begin{aligned} (r^2 - s t^2)^2 = (r^2 + s t^2)^2 - 4 s(r t)^2 = F_{+}(u,v,w) F_{-}(u,v,w) \end{aligned}$$

where \(F_{\pm }\) are the polynomials in the statement of the theorem.

Case\(({{\varvec{N}}},{\varvec{{\varepsilon }}}) = \mathbf{(5,2)}\). The following equations for the family of curves parametrised by \(X_E(5,2)\) are taken from [12, Theorem 5.8]. Starting from the Klein formFootnote 3

$$\begin{aligned} D(\lambda ,\mu )&= (125 a^3 - 432 b^2) \lambda ^{12} + 2430 a^2 b \lambda ^{11} \mu - 22 a (25 a^3 - 378 b^2) \lambda ^{10} \mu ^2 \\&\quad - 110 b (11 a^3 - 108 b^2) \lambda ^9 \mu ^3 - 165 a^2 (5 a^3 - 27 b^2) \lambda ^8 \mu ^4 - 132 a b (53 a^3 - 189 b^2) \lambda ^7 \mu ^5 \\&\quad + 220 (a^6 - 123 a^3 b^2 + 81 b^4) \lambda ^6 \mu ^6 + 132 a^2 b (19 a^3 - 297 b^2) \lambda ^5 \mu ^7 \\&\quad - 165 (a^7 - 26 a^4 b^2 + 189 a b^4) \lambda ^4 \mu ^8 - 110 (3 a^6 b - 34 a^3 b^3 + 135 b^5) \lambda ^3 \mu ^9 \\&\quad - 22 a^2 (a^3 - 3 b^2) (a^3 + 27 b^2) \lambda ^2 \mu ^{10} - 10 a b (5 a^6 + 82 a^3 b^2 + 189 b^4) \lambda \mu ^{11} \\&\quad + (a^9 - a^6 b^2 - 181 a^3 b^4 - 675 b^6) \mu ^{12}, \end{aligned}$$

we define

$$\begin{aligned} A(\lambda ,\mu ) = \frac{1}{1452} \left| \begin{array}{cc} D_{\lambda \lambda } &{} D_{\lambda \mu } \\ D_{\mu \lambda } &{} D_{\mu \mu } \end{array} \right| , \quad \text { and } \quad B(\lambda ,\mu ) = \frac{-1\,\,}{180} \left| \begin{array}{cc} D_{\lambda } &{} D_{\mu } \\ A_{\lambda } &{} A_{\mu } \end{array} \right| , \end{aligned}$$

where the subscripts denote partial derivatives. These forms satisfy the syzygy

$$\begin{aligned} -4 A^3 - 27 B^2 = 16(4 a^3 + 27 b^2)^2 D^5. \end{aligned}$$
(7)

The family of elliptic curves 5-congruent to E with power \({\varepsilon }= 2\) is given by

$$\begin{aligned} y^2 = x^3 + A(\lambda ,\mu ) x + B(\lambda ,\mu ). \end{aligned}$$

We dehomogenise by putting \((\lambda ,\mu )=(x,1)\). Then

$$\begin{aligned} D = 16 \Delta f^4 - g^3 + 4 (2 g^3 - g^2 j - 4 \Delta f^2 g). \end{aligned}$$
(8)

The quantities \((r,s,v,w) = (\Delta f^4,\Delta f^2 g,g^3,2 g^3 - g^2 j - 4 \Delta f^2 g)\) are related by

$$\begin{aligned} r (4 s - 2 v + w)^2 + 27 r s v + s w^2 - s^2 (v - 4 w) = 0. \end{aligned}$$
(9)

This is an equation for Z(5, 2) as a cubic surface in \({{\mathbb {P}}}^3\). We see from (7) and (8) that, up to squared factors, the ratio of discriminants is \(r (16 r - v + 4 w)\). Putting \(r (16 r - v + 4 w) = (4 r - u)^2\), where u is a new variable, and using this equation to eliminate r from (9), we obtain a quadratic in s whose discriminant is the polynomial \(F_{+}F_{-}\) in the statement of the theorem.

For the last part of Theorem 3.1, we note that one of the standard constructions of a K3-surface is as a double cover of \({{\mathbb {P}}}^2\) ramified over a non-singular sextic curve. In our case the sextic is singular, but the surface is still (birational to) a K3-surface. See for example [18, Chapter 1, Example 1.3(iv)]. \(\square \)

Remark 3.2

Let \((N,{\varepsilon }) = (3,2)\), (5, 1) or (5, 2). We saw in the proof of Theorem 3.1 that one model for \(Z(N,{\varepsilon })\) is the weighted projective plane \({{\mathbb {P}}}(1,2,3)\) where the co-ordinates xab have weights 1, 2, 3. We mapped this to another model for \(Z(N,{\varepsilon })\) defined by (4), (6) or (9). The inverse maps are as follows.

$$\begin{aligned}&(3,2) \qquad \left\{ \begin{aligned} x&= r + v^2, \\ a&= -3 r (r + u v + 2 v^2), \\ b&= r (u + 3 v) (r u + v^3) + 2 r^2 (r + 3 v^2), \end{aligned} \right. \\&(5,1) \qquad \left\{ \begin{aligned} x&= 32 u - v + 5 t^2, \\ a&= -3 (8 u - v) (32 u - v) - 288 r t + 30 (28 u + v) t^2 - 75 t^4, \\ b&= -2 (32 u - v)^2 (4 u + v) - 144 (32 u - v) rt + 6 (32 u - v) (88 u - 5 v) t^2 \\&\qquad \qquad + 1008 r t^3 - 150 (28 u + v) t^4 + 250 t^6, \end{aligned} \right. \\&(5,2) \qquad \left\{ \begin{aligned} x&= 4 r s + 4 r v + r w - s^2, \\ a&= 3 (8 r s^3 + 4 r s^2 v + 6 r s^2 w + r s w^2 - s^4), \\ b&= r^2 s (16 s^3 - 8 s^2 v - 24 s^2 w - 40 s v w - 15 s w^2 + 4 v w^2 - 2 w^3) \\&\qquad \qquad + r s^3 (24 s^2 + 8 s v + 34 s w + 7 w^2) - 2 s^6. \end{aligned} \right. \end{aligned}$$

Remark 3.3

There are two naturally defined involutions on the K3-surfaces in Theorem 3.1. The first switches the sign of y, and corresponds to swapping over the pair of N-congruent elliptic curves. The second is given on \(Z^*(3,1)\) and \(Z^*(5,1)\) by switching the sign of w, and on \(Z^*(5,2)\) by \((u,v,w,y) \mapsto ({\widetilde{u}},v,w,({\widetilde{u}}/u)^2y)\) where \({\widetilde{u}} = u(v - 4w)/(8u - (v-4w))\). This second involution switches the choice of square root for the ratio of discriminants. The two involutions commute, and the second swaps over the curves \(F_+=0\) and \(F_-=0\).

Remark 3.4

For a suitable parametrisation of the cuspidal cubic \(F_+=0\), we obtain a family of elliptic curves with j-invariant

$$\begin{aligned}&(3,1):&j&= 27 (T - 3)^3 (T + 1)^3/T^3, \\&(5,1):&j&= (T + 5)^3 (T^2 - 5)^3 (T^2 + 5 T + 10)^3/(T^2 + 5 T + 5)^5, \\&(5,2):&j&= 125 T (2 T + 1)^3 (2 T^2 + 7 T + 8)^3/(T^2 + T - 1)^5. \end{aligned}$$

These correspond to \(X_\mathrm{s}^+(3)\), \(X_\mathrm{s}^+(5)\) and \(X_\mathrm{ns}^+(5)\), where \(X_\mathrm{s}^+(N)\) and \(X_\mathrm{ns}^+(N)\) are the modular curves associated to the normaliser of a split or non-split Cartan subgroup of level N. We may compute \(X_\mathrm{s}^+(N)\) as the quotient of \(X_0(N^2)\) by the Fricke involution, whereas the formula for \(X_\mathrm{ns}^+(5)\) is taken from [6, Corollary 5.3]. The use of these modular curves to construct pairs of N-congruent elliptic curves is described further in [16].

Let N be an odd integer and let \({\varepsilon }\in ({{\mathbb {Z}}}/2N{{\mathbb {Z}}})^\times \). Then \(X(2N) \rightarrow X(N)\), and hence also \(X_E(2N,{\varepsilon }) \rightarrow X_E(N,{\varepsilon })\), is geometrically a Galois covering with Galois group \({\text {PSL}}_2({{\mathbb {Z}}}/2{{\mathbb {Z}}}) \cong S_3\). Since the discriminant of an elliptic curve is equal (up to a constant factor) to the discriminant of its 2-division polynomial, elliptic curves which are 2-congruent have ratio of discriminants a square. It follows that \(Z(2N,{\varepsilon }) \rightarrow Z^*(N,{\varepsilon })\) is a degree 3 cover. In the cases \((2N,{\varepsilon })= (6,5)\), (10, 1) and (10, 3) it turns out that the surface \(Z(2N,{\varepsilon })\) has an elliptic fibration. The pushforward of a fibre gives a divisor class D on the K3-surface \(Z^*(N,{\varepsilon })\) with \(D^2 = 2\). Using this divisor class D we may write \(Z^*(N,{\varepsilon })\) as a double cover of \({{\mathbb {P}}}^2\). We have arranged (with the benefit of hindsight) that the equations in Theorem 3.1 write \(Z^*(N,{\varepsilon })\) as a double cover of \({{\mathbb {P}}}^2\) in exactly this way.

The equations for \(Z(2N,{\varepsilon })\) in Theorems 1.1 and 1.2 may be obtained from the equations for \(Z^*(N,{\varepsilon })\) in Theorem 3.1 as follows. The tangent line to a general point on the cuspidal cubic \(F_{+}(u,v,w) = 0\) has equation:

$$\begin{aligned} (2N,{\varepsilon })&= (6,5)&(T^3 - 1) u + 3 (T - 1) v - w&=0, \end{aligned}$$
(10)
$$\begin{aligned} (2N,{\varepsilon })&= (10,1)&(T - 2)u - T (T - 1)^2 v + 2 (T - 1) w&=0, \end{aligned}$$
(11)
$$\begin{aligned} (2N,{\varepsilon })&= (10,3)&T^3 u - (T + 1)v - T^2 w&= 0. \end{aligned}$$
(12)

We parametrise this line, and substitute into the right hand side of the equation \(y^2 = F_{+}(u,v,w) F_{-}(u,v,w)\). After cancelling a squared factor (which arises since we chose a tangent line) the right hand side is a binary quartic with a linear factor. We now have the equation for a genus one curve over \({{\mathbb {Q}}}(T)\) with a rational point. Putting this elliptic curve in Weierstrass form gives the equations for Z(6, 5), Z(10, 1) and Z(10, 3) in Theorems 1.1 and 1.2.

It remains to show that the degree 3 covers of \(Z^*(N,{\varepsilon })\) defined by (10), (11) and (12) are indeed birational to \(Z(2N,{\varepsilon })\). To do this we must check that a point on \(Z^*(N,{\varepsilon })\) lifts to the degree 3 cover precisely when the corresponding pair of elliptic curves are 2-congruent. We use the following lemma.

Lemma 3.5

Let K be a field of characteristic not 2 or 3. Elliptic curves \(E_1\) and \(E_2\) over K with j-invariants \(j_1\) and \(j_2\), with \(j_1,j_2 \not \in \{0,1728\}\), are 2-congruent if and only if there exist \(m,x \in K\) satisfying \((j_1 - 1728)(j_2 - 1728) = m^2\) and

$$\begin{aligned} x^3 - 3 j_1 j_2 x - 2 j_1 j_2 (m + 1728) = 0. \end{aligned}$$

Proof

This follows from the formulae in [28] or [11, Sections 8 and 13] by a generic calculation. \(\square \)

We illustrate the use of Lemma 3.5 in the case \((2N,{\varepsilon })=(10,3)\), the other cases being similar, and treated in the accompanying Magma file. Above each point \((u:v:w) \in {{\mathbb {P}}}^2\) there are a pair of points on \(Z^*(5,2)\) possibly defined over a quadratic extension. These points correspond to a pair of elliptic curves, say with j-invariants \(j_1\) and \(j_2\). A calculation using the formulae in Remark 3.2 shows that, for m a suitable choice of square root of \((j_1 - 1728)(j_2 - 1728)\), we have

$$\begin{aligned} j_1 j_2&= G_6(u,v,w) H(u,v,w)^2 \\ j_1 j_2 (m + 1728)&= G_9(u,v,w) H(u,v,w)^3 \end{aligned}$$

where

$$\begin{aligned} G_6(u,v,w)&= 640 u^4 v^2 - 768 u^4 v w - 72 u^3 v^3 - 240 u^3 v^2 w + \cdots \\ G_9(u,v,w)&= 6912 u^7 v^2 - 1376 u^6 v^3 - 14976 u^6 v^2 w + \cdots \end{aligned}$$

are irreducible homogeneous polynomials of degrees 6 and 9, and \(H \in {{\mathbb {Q}}}(u,v,w)\) is a rational function. Finally we claim that the polynomials

$$\begin{aligned} X^3 - 3 G_6(u,v,w) X - 2 G_9(u,v,w) = 0, \end{aligned}$$
(13)

arising from Lemma 3.5, and

$$\begin{aligned} u T^3 - w T^2 - v T - v = 0, \end{aligned}$$
(14)

appearing in (12), define the same cubic extension. Indeed we find by computer algebra that if (14) has root \(T_0\) then (13) has root

$$\begin{aligned} X_0&= 3 u^2 (8 u - 3 v - 4 w) T_0^2 + 12 u (2 u v - 4 u w + v w) T_0 \\&\qquad - 16 u^2 v + 6 u v^2 + 8 u w^2 - v w^2 + 4 w^3. \end{aligned}$$

4 Computing the Picard numbers

Let \(E/{{\mathbb {Q}}}(T)\) be one of the elliptic curves in Theorems 1.1 and 1.2. We write \(X \rightarrow {{\mathbb {P}}}^1\) for the minimal fibred surface with generic fibre E. The reduction of E mod p is an elliptic curve \(E_p/{{\mathbb {F}}}_p(T)\), and the reduction of X mod p is a surface \(X_p/{{\mathbb {F}}}_p\). We will always take p to be a prime of good reduction.

Let \({\overline{X}} = X \times _{{\mathbb {Q}}}{\overline{{\mathbb {Q}}}}\) and \({\overline{X}}_p = X_p \times _{{{\mathbb {F}}}_p} {\overline{{\mathbb {F}}}}_p\). We write \({\text {NS}}\) for the Néron-Severi group. The Shioda-Tate formula [29, Corollary 5.3] tells us that

$$\begin{aligned} {{\text {rank}}}\, E({\overline{{\mathbb {Q}}}}(T)) + 2 + \textstyle \sum _{t \in {{\mathbb {P}}}^1({\overline{{\mathbb {Q}}}})} (m_t - 1) = \mathrm{rank~NS}({\overline{X}}), \end{aligned}$$
(15)

and

$$\begin{aligned} {{\text {rank}}}\, E_p({\overline{{\mathbb {F}}}}_p(T)) + 2 + \textstyle \sum _{t \in {{\mathbb {P}}}^1({\overline{{\mathbb {F}}}}_p)} (m_t - 1) = {{\text {rank}}}{\text {NS}}({\overline{X}}_p), \end{aligned}$$
(16)

where \(m_t\) is the number of irreducible components in the fibre above t. We write \(\rho \) and \(\rho _p\) for the numbers on the right of (15) and (16). These are the geometric Picard numbers of X and \(X_p\). The sections exhibited in Tables 3 and 4 give a lower bound for \({{\text {rank}}}\, E({\overline{{\mathbb {Q}}}}(T))\) and hence by (15) a lower bound for \(\rho \). These lower bounds are exactly the values recorded in Tables 1 and 2.

Let \(X \rightarrow {{\mathbb {P}}}^1\) be a minimal elliptic surface with non-constant j-invariant, and let \(m = \chi ({{\mathcal {O}}}_X)\) be the Euler characteristic of the structure sheaf. This may be computed from the fact [22, III.4.4 and IV.3.3] that the sum of the Euler numbers of the singular fibres is 12m. By [22, Lemma IV.1.1] the Hodge diamond of X is

The surfaces in Theorem 1.1 have \(m=2\) and those in Theorem 1.2 have \(m=3\). This may be checked by computing the Euler numbers, but more simply corresponds to the fact that m is the least integer such that the coefficients \(a_1, \ldots , a_6\) of the Weierstrass equations satisfy \(\deg (a_i) \le m i\) for all i. To tie in with [19, Theorem 4], we note that \(p_g = h^{2,0} = m- 1\). By the Lefschetz theorem on (1, 1)-classes (see for example [2, Chapter IV, Theorem 2.13]) we have \(\rho \le h^{1,1} = 10 m\). This already determines \(\rho \) in all cases with \(N \le 8\). It remains for us to improve this upper bound by 1 in the cases \((N,{\varepsilon }) = (9,1),(12,1),(9,2)\), and to improve it by 2 in the cases \((N,{\varepsilon }) = (10,1),(10,3),(11,1)\).

The main tool we wish to use (see [33, Proposition 6.2]) is that there is an injective map \({\text {NS}}({\overline{X}}) \rightarrow {\text {NS}}({\overline{X}}_p)\) that preserves the intersection pairing.

Let \(f_p(x)\) be the characteristic polynomial of Frobenius acting on \(H^2_{{\acute{e}\text {t}}}({\overline{X}}_p,{{\mathbb {Q}}}_\ell (1))\), normalised so that \(f_p(0)=1\). This is a polynomial of degree \(b_{2} = 12m - 2\), independent of the choice of prime \(\ell \not =p\). By the Weil conjectures it satisfies the functional equation \(f_p(x) = \pm x^{b_2} f_p(1/x)\). The polynomials \(f_p(x)\) may be computed from the numbers \(n_r = | X_p({{\mathbb {F}}}_{p^r}) |\) using the Lefschetz trace formula. See for example [34, Section 3], where \(f_p\) is denoted \({\widetilde{f}}_p\). We used both the functional equation and the Magma function FrobeniusActionOnTrivialLattice to limit how many \(n_r\) we had to compute. The polynomials \(f_p(x)\) for two primes of good reduction (carefully chosen with the calculations at the end of this section in mind) are recorded in Table 5, and again in the accompanying Magma file.

Table 5 Bounding the geometric Picard number

Let \(\Delta _p \in {{\mathbb {Q}}}^\times /({{\mathbb {Q}}}^\times )^2\) be the absolute value of the determinant of the intersection pairing on \({\text {NS}}({\overline{X}}_p)\). It may be computed using either of the following two lemmas. We emphasise at this point that our results in Tables 1 and 2 are unconditional. In particular they do not depend on any unproven cases of the Tate conjecture.

Lemma 4.1

Write \(f_p(x) = g_p(x) h_p(x)\) where every root of \(g_p\) is a root of unity, and no root of \(h_p\) is a root of unity. Then \(\rho _p \le \deg g_p\) and in the case of equality we have \(\Delta _p \equiv h_p(1) h_p(-1) \mod ({{\mathbb {Q}}}^\times )^2\).

Proof

The first part is described for example in [34, Corollary 2.3]. The Tate conjecture predicts that this inequality is always an equality, and this has been proved for K3-surfaces. See [18, Section 17.3] or [32] for the history of this problem and further references. The formula for \(\Delta _p\) is a small refinement of a result of Kloosterman, that in turn depends on known cases of the Artin-Tate conjecture.

Let \(F_k(x) = \prod (1 - p^k \alpha _i^k x)\) where \(f_p(x) = \prod (1- \alpha _ix)\). The result of Kloosterman [20, Proposition 4.7], is that if k is an even integer with \(\alpha _i^k=1\) for all \(\alpha _i\) which are roots of unity, then

$$\begin{aligned} \Delta _p = \lim _{s \rightarrow 1} \frac{F_k(p^{-ks})}{(1-p^{k(1-s)})^\rho }. \end{aligned}$$
(17)

Let \(H_k(x) = \prod (1- p^k \beta _i^k x)\) where \(h_p(x) = \prod (1- \beta _ix)\). Then \(F_k(x) = (1-p^k x)^\rho H_k(x)\), and (17) becomes

$$\begin{aligned} \Delta _p = H_k(p^{-k}) = \prod _i (\beta _i^k-1) = h_p(1)h_p(-1) \prod _{{\begin{array}{c} d \mid k\\ d > 2 \end{array}}} \prod _i \Phi _d(\beta _i) \end{aligned}$$

where \(\Phi _d\) is the \(d\hbox {th}\) cyclotomic polynomial. For \(d > 2\) we claim that \(\prod _i \Phi _d(\beta _i)\) is a rational square. By the functional equation we have

$$\begin{aligned} \beta _{1}, \ldots , \beta _{2m} = \gamma _1, \ldots , \gamma _m, \gamma _1^{-1}, \ldots , \gamma _m^{-1}. \end{aligned}$$

Since \(d>2\) we have \(\Phi _d(x) = x^{\phi (d)} \Phi _d(1/x)\) where \(\phi (d)\) is even, say \(\phi (d) = 2n\). Therefore \(\gamma _i^{-n} \Phi _d(\gamma _i) = \gamma _i^{n} \Phi _d(\gamma _i^{-1})\). It follows that \(\prod _{i=1}^m \gamma _i^{-n} \Phi _d(\gamma _i) \in {{\mathbb {Q}}}\) and

$$\begin{aligned} \prod _{i=1}^{2m} \Phi _d(\beta _i) = \prod _{i=1}^{2m} \beta _i^{-n} \Phi _d(\beta _i) = \bigg (\prod _{i=1}^{m} \gamma _i^{-n} \Phi _d(\gamma _i)\bigg )^2. \end{aligned}$$

Lemma 4.2

Suppose that \(P_1, \ldots P_r\) generate a finite index subgroup of \(E_p({\overline{{\mathbb {F}}}}_p(T))\). Then we have \(\Delta _p \equiv (\prod _t c_t) {\text {Reg}}(P_1, \ldots , P_r) \mod ({{\mathbb {Q}}}^\times )^2\) where the product is over \(t \in {{\mathbb {P}}}^1({\overline{{\mathbb {F}}}}_p)\) and \(c_t\) is the number of irreducible components of multiplicity one in the fibre of \({\overline{X}}_p\) above t.

Proof

See [29, Theorem 8.7 and (7.8)].

In the calculations below, we sometimes needed to find explicit generators for \(E_p({{\mathbb {F}}}_p(T))\). These were found by searching on 2-coverings, computed using 2-descent in Magma, as implemented in the function field case by S. Donnelly.

If \(\rho = \rho _p = \rho _q\) for distinct primes p and q, then (by the properties of the map \({\text {NS}}({\overline{X}}) \rightarrow {\text {NS}}({\overline{X}}_p)\) cited above) we have \(\Delta _p \equiv \Delta _q \mod ({{\mathbb {Q}}}^\times )^2\). As observed by van Luijk [34], this can sometimes be used to improve our upper bound on \(\rho \) by 1. This is particularly useful since (assuming the Tate conjecture) \(\rho _p\) is always even. Indeed \(\rho _p = \deg g_p = b_2 - \deg h_p\), and \(\deg h_p\) is even by the functional equation. See [20] and [23] for further examples.

Case\(({{\varvec{N}}},{\varvec{{\varepsilon }}}) = \mathbf{(9,1)}\). From the second and third paragraphs of this section we already know that \(\rho = 19\) or 20. Since the Tate conjecture has been proved for elliptic K3-surfaces [1], equality holds in Lemma 4.1. By Lemma 4.1 we compute \(\Delta _5 = 3 \cdot 17\) and \(\Delta _7 = 2 \cdot 3\), it being understood that these are elements of \({{\mathbb {Q}}}^\times /({{\mathbb {Q}}}^\times )^2\). Since these are different, it follows by the method of van Luijk that \(\rho = 19\).

Case\(({{\varvec{N}}},{\varvec{{\varepsilon }}}) = \mathbf{(12,1)}\). This is identical to the previous example, except that now \(\Delta _5=1\) and \(\Delta _{11}=7\).

Case\(({\varvec{N}},{\varvec{{\varepsilon }}}) = \mathbf{(9,2)}\). We already know that \(\rho = 29\) or 30. Let \(p=7\) or 13. By Lemma 4.1 and (16) we have \({{\text {rank}}}\,\, E_p({\overline{{\mathbb {F}}}}_p(T)) \le 3\). We prove equality by exhibiting three independent points of infinite order in \(E_p({{\mathbb {F}}}_p(T))\). In addition to the reductions of the two points in Table 3, we have when \(p=7\) the point

$$\begin{aligned} ( 5 T^5 + 6 T^4 + 4 T^2 + 6 T, T^7 + 3 T^6 + 6 T^3 + 2 T^2 + 2 T), \end{aligned}$$

and when \(p=13\) the point

$$\begin{aligned} ( 4 T^6 + 8 T^5 + 3 T^4 + 7 T^3 + 5 T^2 + 10 T + 2, 10 T^9 + 4 T^8 + 5 T^6 + 5 T^5 + 12 T^3 + 4 T^2 + 12). \end{aligned}$$

Using either Lemma 4.1 or Lemma 4.2 we find that \(\Delta _7 = 2\) and \(\Delta _{13} = 17\). Since these are different, it follows that \(\rho = 29\). As the referee points out, had we assumed for a contradiction that \(\rho = 30\), then this would already give equality in Lemma 4.1 for \(p=7\) and 13. Computing the extra points was therefore not necessary in this case.

In the cases \((N,{\varepsilon }) = (10,1),(10,3),(11,1)\) we aim to show that \(\rho = 28\). We were unableFootnote 4 to find a prime p with \(\rho _p = 28\), despite computing the polynomials \(f_p(x)\) for all \(p < 150\). This prompted us to try a variant of van Luijk’s method, where we use the intersection pairing to improve our upper bound for \(\rho \) by 2.

Case\(({\varvec{N}},{\varvec{{\varepsilon }}}) = \mathbf{(10,1)}\). We already know that \(\rho = 28\), 29 or 30. In addition to the point \(P_1 = (0,0)\) in Table 3 we have when \(p=7\) the points

$$\begin{aligned} Q_1&= (6 T^6 + 6 T^4 + 4 T^3 + 5 T^2,4 T^9 + 6 T^8 + 6 T^7 + T^6 + T^5 + 3 T^4), \\ Q_2&= (T^6 + 5 T^5 + 6 T^4 + 4 T^3 + 5 T^2,2 T^9 + 6 T^8 + 2 T^7 + T^6 + 3 T^4), \end{aligned}$$

and when \(p=17\) the points

$$\begin{aligned} R_1&= (16 T^6 + 13 T^5 + 6 T^4 + 4 T^3 + 12 T^2, 4 T^9 + 2 T^8 + 5 T^7 + 8 T^5 + 15 T^4), \\ R_2&= ((6 T^8 + 8 T^7 + 2 T^6 + 5 T^5 + 8 T^4 + 4 T^3 + T^2)/(T + 6)^2, \,\, \ldots \,\,). \end{aligned}$$

We checked that these points are independent. Therefore by Lemma 4.1 and (16) we have \(\rho _p = 30\) and \({{\text {rank}}}\,\, E_p({\overline{{\mathbb {F}}}}_p(T)) = 3\) for \(p = 7\) and 17. Using either Lemma 4.1 or Lemma 4.2 we find that \(\Delta _7 = 1\) and \(\Delta _{17} = 2 \cdot 59\). Since these are different, it follows that \(\rho \le 29\).

As we checked using Magma, reducing mod 7 or 17 does not change the Kodaira symbols of the singular fibres. So by Lemma 4.2 it will be enough for us to work with the height pairing on the Mordell-Weil group, rather than the intersection pairing on the full Néron-Severi group. We compute

$$\begin{aligned} {\text {Reg}}(P_1,uQ_1 + vQ_2)&= \tfrac{2}{75} ( 7 u^2 - 12 uv + 18 v^2), \\ {\text {Reg}}(P_1,xR_1 + yR_2)&= \tfrac{1}{450} ( 139 x^2 + 76 xy + 316 y^2). \end{aligned}$$

If \(\rho = 29\) then we have \({{\text {rank}}}\, E({\overline{{\mathbb {Q}}}}(T)) = 2\), and so there exists \(P_2 \in E({\overline{{\mathbb {Q}}}}(T))\) whose reduction mod 7 is \(\lambda P_1 + uQ_1 + vQ_2\) and whose reduction mod 17 is \(\mu P_1 + xR_1 + yR_2\) for some \(\lambda ,\mu ,u,v,x,y \in {{\mathbb {Z}}}\) with \((u,v) \not = (0,0)\) and \((x,y) \not = (0,0)\). Therefore the equation \(\tfrac{2}{75} ( 7 u^2 - 12 u v + 18 v^2) = \tfrac{1}{450} ( 139 x^2 + 76 xy + 316 y^2)\) has a solution in rational numbers uvxy not all zero. However this quadratic form of rank 4 is not locally soluble over the 3-adics. Therefore \(\rho = 28\).

Case\(({\varvec{N}},{\varvec{{\varepsilon }}}) = \mathbf{(10,3)}\). We already know that \(\rho = 28\), 29 or 30. Let \(p = 31\) or 37. Since \(p \equiv 1 \pmod {3}\) the reductions of the points in Tables 3 and 4 give us points \(P_1,P_2,P_3,P_4 \in E_p({{\mathbb {F}}}_p(T))\). In addition when \(p=31\) we have

$$\begin{aligned} Q_1&= (20 T^4 + 13 T^3 + 30 T^2 + 6 T,5 T^5 + 4 T^4 + 29 T^3 + 4 T^2 + 5 T), \\ Q_2&= (7 T^6 + 12 T^5 + 9 T^4 + 19 T^2 + 13 T + 4)/(T+29)^2, \,\, \ldots \,\, ), \end{aligned}$$

and when \(p=37\) we have

$$\begin{aligned} R_1&= (36 T^4 + 11 T^3 + 4 T^2,26 T^5 + 34 T^4 + 2 T^3 + 15 T^2), \\ R_2&= (6 T^4 + 5 T^3 + T^2 + 26 T + 32, 29 T^5 + 35 T^4 + 2 T^3 + 15 T^2 + 19 T + 10). \end{aligned}$$

Using either Lemma 4.1 or Lemma 4.2 we find that \(\Delta _{31} = 2 \cdot 5\) and \(\Delta _{37} = 1\). Therefore \(\rho \le 29\).

As in the previous example, reducing mod 31 or 37 does not change the singular fibres. We compute

$$\begin{aligned} {\text {Reg}}(P_1,P_2,P_3,P_4,uQ_1 + vQ_2)&= \tfrac{5}{96} ( 25 u^2 - 4 uv + 52 v^2), \\ {\text {Reg}}(P_1,P_2,P_3,P_4,xR_1 + yR_2)&= \tfrac{1}{8} ( 5 x^2 + 8 y^2). \end{aligned}$$

If \(\rho = 29\) then the equation \(\tfrac{5}{96} ( 25 u^2 - 4 uv + 52 v^2) = \tfrac{1}{8} ( 5 x^2 + 8 y^2)\) has a solution in rational numbers uvxy not all zero. However this quadratic form is not locally soluble over the 3-adics. Therefore \(\rho = 28\).

Case\(({\varvec{N}},{\varvec{{\varepsilon }}}) = \mathbf{(11,1)}\). We already know that \(\rho = 28\), 29 or 30. Let \(P_1\) and \(P_2\) be the reductions mod p of the points in Table 3. In addition, when \(p=23\) we have

$$\begin{aligned} Q_1&= (16 T^2 + 5 T + 5,21 T^3 + 15 T^2 + 3 T + 18), \\ Q_2&= (18 T^6 + 5 T^5 + 5 T^4 + 22 T^3 + 9 T^2)/(T+16)^2, \,\, \ldots \,\, ), \end{aligned}$$

and when \(p=53\) we have

$$\begin{aligned} R_1&= (28 T^5 + T^4 + 23 T^3 + 40 T^2 + 15 T, \,\, \ldots \,\, ), \\ R_2&= (49 T^6 + 44 T^5 + 38 T^4)/(T^2 + 42 T + 5)^2, \,\, \ldots \,\, ). \end{aligned}$$

Using either Lemma 4.1 or Lemma 4.2 we find that \(\Delta _{23}= 2 \cdot 7 \cdot 11 \cdot 13\) and \(\Delta _{53} = 11 \cdot 131\). Therefore \(\rho \le 29\).

Again, reducing mod 23 or 53 does not change the singular fibres. We compute

$$\begin{aligned} {\text {Reg}}(P_1,P_2,uQ_1 + vQ_2)&= \tfrac{11}{480} ( 57 u^2 - 46 uv + 137 v^2), \\ {\text {Reg}}(P_1,P_2,xR_1 + yR_2)&= \tfrac{1}{240} ( 541 x^2 - 228xy + 1196 y^2). \end{aligned}$$

If \(\rho = 29\) then the equation \(\tfrac{11}{480} ( 57 u^2 - 46 uv + 137 v^2) = \tfrac{1}{240} ( 541 x^2 - 228xy + 1196 y^2)\) has a solution in rational numbers uvxy not all zero. However this quadratic form is not locally soluble over the 11-adics. Therefore \(\rho = 28\).