Arithmetic of hyperelliptic curves over local fields

Dokchitser, Tim; Dokchitser, Vladimir; Maistret, Céline; Morgan, Adam

doi:10.1007/s00208-021-02319-y

Arithmetic of hyperelliptic curves over local fields

Open access
Published: 20 February 2022

Volume 385, pages 1213–1322, (2023)
Cite this article

Download PDF

You have full access to this open access article

Mathematische Annalen Aims and scope Submit manuscript

Arithmetic of hyperelliptic curves over local fields

Download PDF

Tim Dokchitser¹,
Vladimir Dokchitser ORCID: orcid.org/0000-0003-4384-4193²,
Céline Maistret¹ &
…
Adam Morgan³

2113 Accesses
Explore all metrics

Abstract

We study hyperelliptic curves $y^2 = f(x)$ over local fields of odd residue characteristic. We introduce the notion of a “cluster picture” associated to the curve, that describes the p-adic distances between the roots of f(x), and show that this elementary combinatorial object encodes the curve’s Galois representation, conductor, whether the curve is semistable, and if so, the special fibre of its minimal regular model, the discriminant of its minimal Weierstrass equation and other invariants.

Elliptic normal curves of even degree and theta functions

Article 21 June 2024

Number of complete subgraphs of Peisert graphs and finite field hypergeometric functions

Article 08 March 2024

On r-isogenies over $$\mathbb {Q}(\zeta _r)$$ of elliptic curves with rational j-invariants

Article 25 June 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In this paper we study hyperelliptic curves $y^2 = f(x)$ over local fields of odd residue characteristic. To a curve we associate a “cluster picture”, defined by the combinatorics of the root configuration of f, and show that it encodes many arithmetic invariants of the curve and its Jacobian. We use cluster pictures to get hold of a curve’s Galois representation and conductor, determine whether it is semistable and if so obtain the special fibre of its minimal regular model, the discriminant of its minimal Weierstrass model and other invariants. A similar construction to our clusters was used by Bosch [7, Sect. 5] to determine the stable type of hyperelliptic curves.

For the purposes of applications to arithmetic over number fields, the key features of our approach are that cluster pictures address whole classes of curves at a time, and that the input data is only basic information about the polynomial f(x). This is used in [20], which employs a case-by-case analysis of cluster pictures as an ingredient in establishing the parity conjecture for semistable abelian surfaces, and in [1], which uses clusters to construct explicit hyperelliptic curves over $\mathbb {Q}$ of arbitrarily high genus whose Galois image on ${\text {Jac}}C [\ell ]$ is the maximal possible.

Based on the present article, for semistable hyperelliptic curves Betts [5] has given a description of the Tamagawa number of the Jacobian in terms of clusters, and Kunzweiler [25] has found a simple formula for the other local “fudge factor” $|\omega /\omega ^o|$ in the Birch–Swinnerton-Dyer formula for the L-value, as well as a description of a basis of integral differentials on the minimal regular model. For hyperelliptic curves with tame reduction, Faraggi and Nowell [21] have described the special fibre of the minimal SNC model.

We would like to alert the reader to the companion paper [3], which summarises the results in this paper along with the subsequent works mentioned in the paragraph above, and illustrates them with examples. This explains why examples are virtually non-existent in this, already too long, paper.

1.1 Setup

Throughout the paper K will be a local field, with normalised valuation v, ring of integers ${\mathcal {O}}_K$, uniformiser $\pi $, and finite residue field k of characteristic $p\ne 2$. We use the shorthand mod ${\mathfrak {m}}$ to denote reduction to the residue field. We write $G_K={\text {Gal}}(K^{{\text {sep}}}/K)$ for the absolute Galois group, and $I_K<G_K$ for the inertia subgroup.

We work with hyperelliptic curves^{Footnote 1}C/K given by Weierstrass equations

$$\begin{aligned} C:y^2=f(x). \end{aligned}$$

We write $\mathcal {R}$ for the set of roots of f(x) in $K^{{\text {sep}}}$ and $c_f$ for its leading coefficient, so that

$$\begin{aligned} f(x) = c_f\prod _{r\in \mathcal {R}} (x-r). \end{aligned}$$

We denote by g the genus of the curve so that $|\mathcal {R}| = 2g+1$ or $2g+2$.

The main invariant that we are interested in is the configuration of distances between the roots of f. This is captured in the following:

Definition 1.1

A cluster is a non-empty subset ${{\mathfrak {s}}}\subset \mathcal {R}$ of the form ${{\mathfrak {s}}}= D \cap \mathcal {R}$ for some disc $D=\{x\!\in \! {\bar{K}}\mid v(x-z)\!\ge \! d\}$ for some $z\in {\bar{K}}$ and $d\in \mathbb {Q}$. If $|{{\mathfrak {s}}}|>1$, we say that ${{\mathfrak {s}}}$ is a proper cluster and define its depth $d_{{\mathfrak {s}}}$ to be

$$\begin{aligned} d_{{\mathfrak {s}}}= \min _{r,r' \in {\mathfrak {s}}} v(r-r'). \end{aligned}$$

The depth is the maximal d for which ${{\mathfrak {s}}}$ is cut out by such a disc.

It turns out that the cluster data carries a huge amount of information about the arithmetic of C/K. To fix ideas, let us begin with an example.

Example 1.2

Let $C/\mathbb {Q}_p$ be the hyperelliptic curve of genus 3 given by

$$\begin{aligned} C:\quad y^2 = (x\!-\!1)\cdot (x\!-\!(1\!+\!p^2))\cdot (x\!-\!(1\!-\!p^2))\cdot (x\!-\!p)\cdot x\cdot (x\!-\!p^3)\cdot (x\!+\!p^3). \end{aligned}$$

The set of roots is $\mathcal {R}=\{1,1\!+\!p^2,1\!-\!p^2,p,0,p^3,-p^3\}$. There are four proper clusters:

$$\begin{aligned} \{1,1\!+\!p^2,1\!-\!p^2\}, \quad \{0,p^3,-p^3\}, \quad \{p,0,p^3,-p^3\}, \quad \mathcal {R}, \end{aligned}$$

of depths 2,3,1 and 0, respectively. We draw cluster pictures by drawing roots $r \in \mathcal {R}$ as , and draw ovals around roots to represent a proper cluster:

Here we have ordered the roots as they appear in the equation for C. The subscript of the top cluster $\mathcal {R}$ is its depth. For all other clusters it is their “relative depth”, that is the difference between their depth and that of their parent cluster.

This simple picture, along with the fact that f(x) is monic, lets us deduce the following. The curve $C/\mathbb {Q}_p$ is semistable (Thm. 1.9 (1)), with conductor exponent 1 (Thm. 1.21), the model for C is a minimal Weierstrass equation (Thm. 1.24) with discriminant of valuation $v(\Delta _C) = 36$ (Thm. 1.26). The special fibre $\mathcal {C}_{\bar{{\mathbb {F}}}_p}$ of the minimal regular model $\mathcal {C}/\mathbb {Z}_p^{nr}$ has two components of genus 1 (Z and $Z'$, say) and two of genus 0, intersecting as shown on the right (Thms. 1.15(2), 1.11); in particular, the homology of its dual graph $\Upsilon _C$ is isomorphic to $\mathbb {Z}= \langle \ell \rangle $ with length pairing $\langle \ell , \ell \rangle =2$.

The local Galois representation is the following: for $l\ne p$, a Frobenius element ${\text {Frob}}\in G_{\mathbb {Q}_p}$ and $\tau :I_{\mathbb {Q}_p}\rightarrow \mathbb {Z}_l$ an l-adic tame character, there is a basis for $H^1_{\acute{\mathrm{e}}\text {t}}(C,\mathbb {Z}_l)\otimes _{\mathbb {Z}_l}{{\bar{\mathbb {Q}}}_l}$ such that $\sigma \in I_{\mathbb {Q}_p}$ and ${\text {Frob}}$ act as

respectively, where $\epsilon \in \{\pm 1\}$, and $\alpha , \beta $ are the Frobenius eigenvalues for $Z/\mathbb {F}_p$, and $\alpha ',\beta '$ those for $Z'/\mathbb {F}_p$ (Thm. 1.20 and Remark 10.2).

Let us stress that all of this information is encoded in the above picture (with Galois action) together with $v(c_f)$. Slightly finer data about f(x) also tells us that $\epsilon =\bigl (\frac{-1}{p}\bigr )$ and that both Z and $Z'$ are given by $y^2 = x^3-x$ (see Definition 1.13 and Theorem 1.11).

The general philosophy of the paper is that the local arithmetic of C/K should be visible from its cluster picture, that is from the roots $\mathcal {R}$ viewed as an abstract set with Galois action and its collection of clusters and depths (see Definition 1.27). Of course, we also need to know something about the leading term $c_f$, and some invariants require finer information. Here is a result that formalises some of this. Here $\epsilon _{{\mathfrak {s}}}({\text {Frob}})\in \{\pm 1\}$ is an explicit value associated to some clusters ${{\mathfrak {s}}}$, that generalises $\epsilon $ in the above example (see Definition 1.13).

Theorem 1.3

(see Theorem 18.4, Lemma 18.2, Theorem 16.2) For a hyperelliptic curve C/K, its cluster picture with $I_K$-action and the value of $v(c_f)$ determine $H^1_{\acute{\mathrm{e}}\text {t}}(C,\mathbb {Z}_l)\otimes _{\mathbb {Z}_l}{{\bar{\mathbb {Q}}}_l}$ as an $I_K$-module, and, in particular, the conductor exponent of C and whether the curve is semistable. If C/K is semistable, then together with the action of Frobenius on proper clusters and the values of $\epsilon _{{\mathfrak {s}}}({\text {Frob}})$ for clusters ${{\mathfrak {s}}}$ that contain an even number of roots, these also determine the dual graph of the special fibre of its minimal regular model (with genera of components and Frobenius action), whether the curve is deficient, and the Tamagawa number and root number of the Jacobian. If, moreover, $|k|>2g+1$, these also determine the valuation of the minimal discriminant of the curve.

We now explain how to explicitly obtain this data and more from the cluster information:

In Sect. 1.2 — the reduction type of C and its Jacobian;

In Sect. 1.3 — for semistable C, the special fibre of the minimal regular model, the homology of its dual graph, and whether the curve is deficient;

In Sect. 1.4 — the Galois representation and conductor exponent of C;

In Sect. 1.5 — for semistable C, the discriminant of a minimal Weierstrass equation;

In Sect. 1.6 — the relation of cluster pictures to isomorphism classes of curves;

In Sect. 1.7 — how much the coefficients of f(x) can be varied without affecting arithmetic invariants;

In Sect. 1.8 — a classification of semistable curves of genus 2 and their invariants.

We first need some terminology to work with clusters; see also Table 6 in Appendix E for a summary.

Definition 1.4

If ${{\mathfrak {s}}}'\subsetneq {{\mathfrak {s}}}$ is a maximal subcluster, we write ${{\mathfrak {s}}}'<{{\mathfrak {s}}}$ and refer to ${{\mathfrak {s}}}'$ as a child of ${{\mathfrak {s}}}$, and to ${{\mathfrak {s}}}$ as the parent of ${{\mathfrak {s}}}'$. We write ${{\mathfrak {s}}}=P({{\mathfrak {s}}}')$.

For two clusters (or roots) ${{\mathfrak {s}}}_1$, ${{\mathfrak {s}}}_2$ write ${{\mathfrak {s}}}_1\wedge {{\mathfrak {s}}}_2$ for the smallest cluster containing them.

Definition 1.5

A cluster ${{\mathfrak {s}}}$ is a twin if $|{{\mathfrak {s}}}|=2$, and it is odd/even if its size is odd/even. A proper cluster is übereven if it has only even children. A cluster ${{\mathfrak {s}}}$ is principal except when:

$|{{\mathfrak {s}}}|\le 2$, or
${{\mathfrak {s}}}$ has a child of size 2g, or
${{\mathfrak {s}}}=\mathcal {R}$ is even and has exactly two children.

Definition 1.6

For a proper cluster ${{\mathfrak {s}}}\ne \mathcal {R}$ we define its relative depth to be

$$\begin{aligned} \delta _{{\mathfrak {s}}}= d_{{\mathfrak {s}}}-d_{P({{\mathfrak {s}}})}. \end{aligned}$$

1.2 Reduction of C and its Jacobian

One of the main outcomes of the cluster approach is that it easily detects the reduction behaviour of the curve C and its Jacobian ${\text {Jac}}C$. It requires one extra invariant, which feels the leading coefficient of f(x):

Definition 1.7

For a cluster ${{\mathfrak {s}}}$ set

$$\begin{aligned} \nu _{{\mathfrak {s}}}= v(c_f)+\sum _{r\in \mathcal {R}} d_{r\wedge {{\mathfrak {s}}}}. \end{aligned}$$

Definition 1.8

We say that C/K satisfies the semistability criterion if the following conditions hold:

(1)
The extension $K(\mathcal {R})/K$ has ramification degree at most 2.
(2)
Every proper cluster is $I_K$-invariant.
(3)
Every principal cluster ${{\mathfrak {s}}}$ has $d_{{\mathfrak {s}}}\in \mathbb {Z}$ and $\nu _{{\mathfrak {s}}}\in 2\mathbb {Z}.$

Theorem 1.9

(=Theorem 10.3) Let C/K be a hyperelliptic curve of genus g. Then

(1)
C is semistable ($\Leftrightarrow $ ${\text {Jac}}C$ semistable) $\Leftrightarrow $ C/K satisfies the semistability criterion.
(2)
C has good reduction $\Leftrightarrow $ $K(\mathcal {R})/K$ is unramified, there are no proper clusters of size $<2g+1$ and $\nu _\mathfrak {s}\in 2\mathbb {Z}$ for the unique principal cluster.
(3)
C has potentially good reduction $\Leftrightarrow $ there are no proper clusters of size $<2g+1$.
(4)
C is tame^{Footnote 2} ($\Leftrightarrow $ ${\text {Jac}}C$ is tame) $\Leftrightarrow $ $K(\mathcal {R})/K$ is tame.
(5)
${\text {Jac}}C$ has good reduction $\Leftrightarrow $ $K(\mathcal {R})/K$ is unramified, all clusters ${{\mathfrak {s}}}\ne \mathcal {R}$ are odd, and principal clusters have $\nu _{{\mathfrak {s}}}\in 2\mathbb {Z}$.
(6)
${\text {Jac}}C$ has potentially good reduction $\Leftrightarrow $ all clusters ${{\mathfrak {s}}}\ne \mathcal {R}$ are odd.
(7)
The potential toric rank of ${\text {Jac}}C$ equals the number of even non-übereven clusters excluding $\mathcal {R}$, less 1 if $\mathcal {R}$ is übereven.
(8)
${\text {Jac}}C$ has potentially totally toric reduction $\Leftrightarrow $ every cluster has at most two odd children.

1.3 Special fibre of the minimal regular model

For semistable curves we write down explicit charts for a regular model (Proposition 5.5). This then gives us an explicit construction for the special fibre of the minimal regular model in terms of clusters (see Theorem 8.6). We give a simplified statement here.

Definition 1.10

A centre $z_{{\mathfrak {s}}}$ of a proper cluster ${{\mathfrak {s}}}$ is any element $z_{{\mathfrak {s}}}\in K^{{\text {sep}}}$ such that $v(z_{{\mathfrak {s}}}-r) \ge d_{{\mathfrak {s}}}$ for all $r \in {{\mathfrak {s}}}$; equivalently the cluster ${{\mathfrak {s}}}$ can be written as $D \cap \mathcal {R}$ for the disc $D = z_s + \pi ^{d_{{\mathfrak {s}}}} {\mathcal {O}}_{K^{{\text {sep}}}}$. If ${{\mathfrak {s}}}= \{r\}$ is a singleton, its centre is $z_{{\mathfrak {s}}}= r$.

Theorem 1.11

Suppose C/K is semistable. The special fibre of the minimal regular model of C over ${\mathcal {O}}_{K^{{\text {nr}}}}$ consists of components $\Gamma _{{\mathfrak {s}}}$ for every principal cluster ${{\mathfrak {s}}}$, linked by chains of $\mathbb {P}^1$s. The normalisation of $\Gamma _s$ is given as follows. Fix a choice of centre $z_{{\mathfrak {s}}}$ for each cluster. Then

$$\begin{aligned} \tilde{\Gamma }_{{\mathfrak {s}}}:\> y^2 = c_{{\mathfrak {s}}}\prod _{{\text {odd }}{{{\mathfrak {o}}} < {{\mathfrak {s}}}}}(x-{\text {red}}_{{\mathfrak {s}}}({\mathfrak {o}})), \end{aligned}$$

where $c_{{\mathfrak {s}}}=\frac{c_f}{\pi ^{v(c_f)}}\prod _{r \notin {{\mathfrak {s}}}} \frac{z_{{\mathfrak {s}}}-r}{\pi ^{d_{{{\mathfrak {s}}}\wedge r}}} \mod {\mathfrak {m}}$ and ${\text {red}}_{{\mathfrak {s}}}({\mathfrak {o}}) = \frac{z_{{\mathfrak {o}}}-z_{{\mathfrak {s}}}}{\pi ^{d_{{\mathfrak {s}}}}} \mod {\mathfrak {m}}$.

If $\mathcal {R}$ is principal then the chains of $\mathbb {P}^1$s are given explicitly as follows. If ${{\mathfrak {s}}}'<{{\mathfrak {s}}}$ are principal, then there is one chain with $\frac{\delta _{{{\mathfrak {s}}}'}}{2}\!-\! 1$ $\mathbb {P}^1$s (resp. two chains with $\delta _{{{\mathfrak {s}}}'}\!-\! 1$ $\mathbb {P}^1$s) between $\Gamma _{{\mathfrak {s}}}$ and $\Gamma _{{{\mathfrak {s}}}'}$ if ${{\mathfrak {s}}}'$ is odd (resp. even); if a chain has length 0 the components $\Gamma _{{\mathfrak {s}}}$ and $\Gamma _{{{\mathfrak {s}}}'}$ intersect. Every twin ${{\mathfrak {t}}}<{{\mathfrak {s}}}$ with $\delta _{{\mathfrak {t}}}>\frac{1}{2}$ gives a chain of $2\delta _{{\mathfrak {t}}}\!-\! 1$ $\mathbb {P}^1$s from $\Gamma _{{\mathfrak {s}}}$ to itself.

In particular we obtain a description of the dual graph of the special fibre and its homology in terms of clusters. The latter has a particularly simple description: roughly speaking, the set of even clusters corresponds to a basis of the homology group. Recall that this homology group is related to the Galois representation, the Tamagawa number of the Jacobian (Lemma 2.22) and the character group of the toric part of the Raynaud parametrisation of the Jacobian (Lemma 2.21). In order to keep track of the Galois action on the dual graph (an analogue of split/non-split multiplicative reduction for elliptic curves) we need an extra invariant $\epsilon _{{\mathfrak {s}}}$, which generalises $\epsilon $ in Example 1.2.

Definition 1.12

A cotwin is a non-übereven cluster that has a child of size 2g.

For a cluster ${{\mathfrak {s}}}$ that is not a cotwin we write ${{\mathfrak {s}}}^*$ for the smallest cluster ${{\mathfrak {s}}}^*\supseteq {{\mathfrak {s}}}$ whose parent is not übereven (and ${{\mathfrak {s}}}^*=\mathcal {R}$ if no such cluster exists). If ${{\mathfrak {s}}}$ is a cotwin, we write ${{\mathfrak {s}}}^*$ for its child of size 2g.

Definition 1.13

For even clusters ${{\mathfrak {s}}}$ fix a choice of $\theta _{{\mathfrak {s}}}= \sqrt{c_f\prod \nolimits _{r \notin {{\mathfrak {s}}}} (z_{{\mathfrak {s}}}-r)}$, where $z_s$ is some (any) centre for ${{\mathfrak {s}}}$. If ${{\mathfrak {s}}}$ is either even or a cotwin, define $\epsilon _{{\mathfrak {s}}}:G_K\rightarrow \{\pm 1\}$ by

$$\begin{aligned} \epsilon _{{{\mathfrak {s}}}}(\sigma ) \equiv \frac{\sigma (\theta _{{{\mathfrak {s}}}^*})}{\theta _{(\sigma {{\mathfrak {s}}})^*}} \mod \mathfrak {m}. \end{aligned}$$

For all other clusters ${{\mathfrak {s}}}$, set $\epsilon _{{\mathfrak {s}}}(\sigma )=0$.

Remark 1.14

(see Remark 8.4) Note that $\epsilon _{{\mathfrak {s}}}$ does not depend on the choice of centre $z_{{{\mathfrak {s}}}^*}$. When ${{\mathfrak {s}}}$ is even or a cotwin, $\epsilon _{{\mathfrak {s}}}$ restricts to a character on the stabiliser of ${{\mathfrak {s}}}$ that is also independent of the choice of the sign of $\theta _{{{\mathfrak {s}}}^*}$. This character is unramified if and only if $|I_K/I_{{{\mathfrak {s}}}^*}| (v(c_f)+\sum _{r\notin {{\mathfrak {s}}}^*}d_{r\wedge {{\mathfrak {s}}}^*})$ is even, where $I_{{{\mathfrak {s}}}}<I_K$ denotes the stabiliser of ${{\mathfrak {s}}}$. When ${{\mathfrak {s}}}$ is not a cotwin, this is equivalent to ${\text {ord}}_2|I_K/I_{{{\mathfrak {s}}}}| (v(c_f)+\sum _{r\notin {{\mathfrak {s}}}}d_{r\wedge {{\mathfrak {s}}}})\ge 1$.

Theorem 1.15

(see Theorem 9.3, Corollary 8.7) Suppose C/K is semistable. Let $\Upsilon _C$ denote the dual graph of the special fibre of the minimal regular model of C over ${\mathcal {O}}_{K^{{\text {nr}}}}$, with its natural action of ${\text {Gal}}({\bar{k}}/k)$. Let A be the set of even non-übereven clusters excluding $\mathcal {R}$. Then

(1)
$ rk_\mathbb {Z}( H_1(\Upsilon _C,\mathbb {Z})) = { \biggl \{\!\!\begin{array}{ll}\#A-1&{} \text{ if } \mathcal {R} \text{ is } \ddot{\text {u}}\text{ bereven },\\ \#A&{} \text{ otherwise. }\end{array}} $
(2)
The number of components in the special fibre is
$$\begin{aligned} m_C = \sum _{\begin{array}{c} {{\mathfrak {s}}}\text { odd,} \\ |{{\mathfrak {s}}}|\ne 1,~2g+1 \end{array}} \frac{\delta _{{{\mathfrak {s}}}}}{2} + \sum _{\begin{array}{c} {{\mathfrak {s}}}\ne \mathcal {R}, \\ \text {even} \end{array}} 2\delta _{{{\mathfrak {s}}}} +1-{\text {rk}}H_1(\Upsilon _C,\mathbb {Z}). \end{aligned}$$
(3)
$H_1(\Upsilon _{C},{\mathbb {Z}}) = \Bigl \{ \sum _{{{\mathfrak {s}}}\in A} a_{{{\mathfrak {s}}}}\ell _{{{\mathfrak {s}}}} \Bigm |\> a_{{\mathfrak {s}}}\in \mathbb {Z}, \>\>\sum _{{{\mathfrak {s}}}\in B} a_{{{\mathfrak {s}}}}=0\Bigr \},$ where B is the subset of clusters ${{\mathfrak {s}}}\in A$ such that ${{\mathfrak {s}}}^*=\mathcal {R}$.
(4)
The length pairing is given by
$$\begin{aligned} \langle \ell _{{{\mathfrak {s}}}_1},\ell _{{{\mathfrak {s}}}_2} \rangle =\left\{ \begin{array}{ll} 0&{} \text{ if } {{\mathfrak {s}}}^*_1 \ne {{\mathfrak {s}}}^*_2, \\ 2(d_{({{\mathfrak {s}}}_1\wedge {{\mathfrak {s}}}_2)}-d_{P({{\mathfrak {s}}}^*_1)})&{} \text{ if } {{\mathfrak {s}}}^*_1 ={{\mathfrak {s}}}^*_2 \ne \mathcal {R}, \\ 2(d_{({{\mathfrak {s}}}_1\wedge {{\mathfrak {s}}}_2)}-d_{\mathcal {R}})&{} \text{ if } {{\mathfrak {s}}}^*_1 ={{\mathfrak {s}}}^*_2 = \mathcal {R}.\\ \end{array} \right. \end{aligned}$$
(5)
$\sigma \in {\text {Gal}}(\bar{k}/k)$ acts on $H_1(\Upsilon _C,\mathbb {Z})$ by $\sigma (\ell _{{{\mathfrak {s}}}}) = \epsilon _{{{\mathfrak {s}}}}(\sigma ) \ell _{\sigma ({{\mathfrak {s}}})}.$

Our description of the special fibre of the minimal regular model of C also allows us to easily determine whether C is deficient, i.e. has no K-rational divisor of degree $g-1$. Deficiency is used to determine whether the Tate-Shafarevich group of the Jacobian of a curve over a number field has square order (see Sect. 8 of [30]).

Theorem 1.16

(=Theorem 12.4) Suppose C/K is semistable. Then C is deficient if and only if it has even genus and either

(1)
$\mathcal {R}= {{\mathfrak {s}}}_1 \coprod {{\mathfrak {s}}}_2$ with ${{\mathfrak {s}}}_1, {{\mathfrak {s}}}_2$ odd, $G_K$-conjugate and $\delta _{{{\mathfrak {s}}}_i}$ odd, or
(2)
$\mathcal {R}$ is übereven with $\epsilon _{\mathcal {R}}({\text {Frob}}) = -1$ and for all non-übereven ${{\mathfrak {s}}}$ such that ${{\mathfrak {s}}}^* =\mathcal {R}$, either $d_{{\mathfrak {s}}}\notin \mathbb {Z}$ or the ${\text {Frob}}$-orbit of ${{\mathfrak {s}}}$ has even size, or
(3)
$\mathcal {R}$ is a cotwin, its principal child ${{\mathfrak {r}}}$ is übereven with $\epsilon _{{{\mathfrak {r}}}}({\text {Frob}})\! =\! -1$, and for all ${{\mathfrak {s}}}$ such that ${{\mathfrak {s}}}^*\!=\!{{\mathfrak {r}}}$, either $d_{{\mathfrak {s}}}\notin \mathbb {Z}$ or the ${\text {Frob}}$-orbit of ${{\mathfrak {s}}}$ has even size.

In fact, even for curves that are not semistable, we construct a regular model over a field F/K where the curve becomes semistable. The special fibre of the minimal regular model of C over ${\mathcal {O}}_{F^{{\text {nr}}}}$ comes with a natural action of $G_K$, not just $G_{F^{{\text {nr}}}}(=I_F)$, see Sect. 2.4. We describe it explicitly in Theorems 8.6 and 8.8 as well.

1.4 Galois representation

Knowing the explicit Galois action on the special fibre lets us determine the l-adic Galois representation of arbitrary hyperelliptic curves. We use the following shorthand notation:

Notation 1.17

For a curve X/k and a prime $l\ne p$ write

$$\begin{aligned} H^1_{\acute{\mathrm{e}}\text {t}}(X)=H^1_{\acute{\mathrm{e}}\text {t}}(X_{{\bar{k}}},\mathbb {Q}_l), \end{aligned}$$

and similarly for curves over K.

Notation 1.18

For a cluster ${{\mathfrak {s}}}$ we write $G_{{\mathfrak {s}}}={\text {Stab}}({{\mathfrak {s}}})$ for its stabiliser in $G_K$ and $I_{{\mathfrak {s}}}<G_{{\mathfrak {s}}}$ for the corresponding inertia subgroup.

As mentioned above, if C acquires semistable reduction over F, the full Galois group $G_K$ acts on the special fibre of the minimal regular model of C over ${\mathcal {O}}_{F^{{\text {nr}}}}$. In particular, the étale cohomology groups $H^1_{\acute{\mathrm{e}}\text {t}}(\Gamma _{{\mathfrak {s}}})$ of the components have an induced action of the stabiliser $G_{{\mathfrak {s}}}$, which is closely linked to the étale cohomology of C and which we are able to control explicitly. We obtain the following description.

Notation 1.19

For a cluster ${{\mathfrak {s}}}$ we define ${\tilde{{{\mathfrak {s}}}}}$ to be the set of odd children of ${{\mathfrak {s}}}$ and write

$$\begin{aligned} {\tilde{\lambda }}_{{\mathfrak {s}}}= \frac{1}{2} (v(c_f) + |{\tilde{{{\mathfrak {s}}}}}|d_{{{\mathfrak {s}}}} + \sum _{r\notin {{\mathfrak {s}}}}d_{{{\mathfrak {s}}}\wedge r}). \end{aligned}$$

Theorem 1.20

(= Theorem 10.1) Let C/K be a hyperelliptic curve. Let $H^1_{\acute{\mathrm{e}}\text {t}}(C)=H^1_{\acute{\mathrm{e}}\text {t}}(C)_{ab}\oplus H^1_{\acute{\mathrm{e}}\text {t}}(C)_{t}\otimes {\text {Sp}}_2$ be the decomposition into ‘toric’ and ‘abelian’ parts. Then

$$\begin{aligned} H^1_{\acute{\mathrm{e}}\text {t}}(C)_{t} =\bigoplus _{{\mathfrak {s}}}{\text {Ind}}_{G_{{\mathfrak {s}}}}^{G_K} \!\epsilon _{{\mathfrak {s}}}\quad \ominus \epsilon _\mathcal {R}, \end{aligned}$$

the sum taken over representatives of $G_K$-orbits of even non-übereven clusters. Writing $\Gamma _{{\mathfrak {s}}}$ for the components of the special fibre in Theorem 1.11 over a Galois extension where C acquires semistable reduction, we have

$$\begin{aligned} H^1_{\acute{\mathrm{e}}\text {t}}(C)_{ab}=\bigoplus _{{\mathfrak {s}}}{\text {Ind}}_{G_{{\mathfrak {s}}}}^{G_K} H^1_{\acute{\mathrm{e}}\text {t}}({\tilde{\Gamma }}_{{\mathfrak {s}}}), \end{aligned}$$

the sum taken over representatives of $G_K$-orbits of principal non-übereven clusters. For every such cluster ${{\mathfrak {s}}}$, there is an isomorphism of $I_{{\mathfrak {s}}}$-modules

$$\begin{aligned} H^1_{\acute{\mathrm{e}}\text {t}}({\tilde{\Gamma }}_{{\mathfrak {s}}}) \>\>\cong \>\> {\tilde{\gamma }}_s \otimes (\mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}] \ominus {\varvec{1}}) \quad \ominus \epsilon _{{\mathfrak {s}}}, \end{aligned}$$

where ${\tilde{\gamma }}_{{\mathfrak {s}}}:I_{{\mathfrak {s}}}\rightarrow {\bar{\mathbb {Q}}}_l^\times $ is any character^{Footnote 3} whose order is the prime-to-p part of the denominator of $|I_K/I_{{\mathfrak {s}}}|\,{\tilde{\lambda }}_{{\mathfrak {s}}}$.

When C/K is semistable one can in fact also recover the Frobenius action on $H^1_{\acute{\mathrm{e}}\text {t}}({\tilde{\Gamma }}_{{\mathfrak {s}}})$, as we explain in Remark 10.2. Briefly, one can pick the centres $z_{{\mathfrak {s}}}$ in Theorem 1.11 to lie in $K_{{\mathfrak {s}}}$, the field cut out by $G_{{\mathfrak {s}}}$, which makes $\widetilde{\Gamma }_{\mathfrak {s}}$ naturally a curve over the residue field $k_{{\mathfrak {s}}}$ of $K_{{\mathfrak {s}}}$. The action of $G_{{\mathfrak {s}}}$ on $H^1_{\acute{\mathrm{e}}\text {t}}({\tilde{\Gamma }}_{{\mathfrak {s}}})$ in Theorem 1.20 is then simply via the quotient $G_{{\mathfrak {s}}}/I_{{\mathfrak {s}}}=\text{ Gal }(\bar{k}/k_{{\mathfrak {s}}})$ and the usual action of ${\text {Gal}}(\bar{k}/k_{{\mathfrak {s}}})$ on étale cohomology. One may recover the Frobenius eigenvalues for this action from point counts on ${\tilde{\Gamma }}_{{\mathfrak {s}}}$ over extensions of $k_{{\mathfrak {s}}}$. In a forthcoming paper [17] we will explain how to reconstruct $H^1_{\acute{\mathrm{e}}\text {t}}(C)$ as a full $G_K$-representation from point counts even when C/K is not semistable.

From the explicit description of the Galois representation we deduce the following formula for the conductor.

Theorem 1.21

(Theorem 11.3, Corollary 9.4) Let C/K be a hyperelliptic curve. Decompose the conductor exponent of ${\text {Jac}}C$ into its tame and wild parts, $n_C = n_{C,\text {tame}}+n_{C,\text {wild}}$.

(i)
The wild part is given by
$$\begin{aligned} n_{C,\text {wild}} = \sum _{r\in S} v(\Delta _{K(r)/K})-[K(r):K]+f_{K(r)/K}, \end{aligned}$$
where S is any set of representatives of $G_K$-orbits of $\mathcal {R}$, $\Delta _{K(r)/K}$ is the discriminant of K(r) over K, and $f_{K(r)/K}$ is the residue degree.
(ii)
The tame part is given by $n_{C,\text {tame}} = 2g-\dim H^1_{\acute{\mathrm{e}}\text {t}}(C)^{I_K}$ with
$$\begin{aligned} \begin{array}{cll} \dim H^1_{\acute{\mathrm{e}}\text {t}}(C)^{I_K} &{}=&{} \#\{I_K\text {-orbits on }U\}\> - \#\{I_K\text {-orbits on }V\} - { \Bigl \{\!\!\begin{array}{ll} \scriptstyle ~1\!\!\!&{}\scriptstyle \text { if }|\mathcal {R}|\text { and }v(c_f)\text { are even,}\\ \scriptstyle ~0\!\!\!&{}\scriptstyle \text { otherwise,}\end{array}}\\ U &{}=&{} \{{{\mathfrak {s}}}\ne \mathcal {R}\text { odd cluster} \,\bigm |\, \xi _{P({{\mathfrak {s}}})}({\tilde{\lambda }}_{P({{\mathfrak {s}}})})\le \xi _{P({{\mathfrak {s}}})}(d_{P({{\mathfrak {s}}})}) \bigr \},\\ V &{}=&{} \{{{\mathfrak {s}}}\text { proper non-}\ddot{\text {u}}\text {bereven cluster} \,\bigm |\, \xi _{{{\mathfrak {s}}}}({\tilde{\lambda }}_{{{\mathfrak {s}}}})=0 \bigr \};\\ \end{array} \end{aligned}$$
here $\xi _{{\mathfrak {s}}}(a)={\text {ord}}_2(b)$ where b is the denominator of $|I_K/I_{{{\mathfrak {s}}}}|a$, with $\xi _{{\mathfrak {s}}}(0)=0$.
(iii)
If C/K is semistable then
where A is the set of even non-übereven clusters ${{\mathfrak {s}}}\ne \mathcal {R}$.

1.5 Weierstrass equations

By their very nature, clusters are designed to work with Weierstrass equations. We establish criteria in terms of clusters for an equation to be integral (Theorem 13.3) and to be a minimal Weierstrass equation, and give a formula for its discriminant.

Definition 1.22

We say that the Weierstrass equation $y^2=f(x)$ for C is integral if $f(x)\in {\mathcal {O}}_{K}[x]$. Its discriminant is $\Delta _C= 16^g c_f^{4g+2}\text {disc}\bigl (\frac{1}{c_f}f(x)\bigr )$. We say that such an equation is minimal if $v(\Delta _C)$ is minimal amongst all integral Weierstrass equations for C.

Remark 1.23

One can consider more general Weierstrass equations for C of the form $y^2+Q(x)y=P(x)$ for Q, P polynomials of degree at most $g+1$ and $2g+2$ respectively, and define integral and minimal equations accordingly. Since we work in odd residue characteristic, we can always find a minimal equation with $Q(x)=0$.

Our notion of integral Weierstrass equation differs slightly from that of Liu used in [28, Definition 2]. However the resulting notion of minimal equation and minimal discriminant (Definition 3 of op. cit.) is easily seen to coincide. Several additional notions of minimal discriminant appear in the literature for both hyperelliptic curves and more general curves. See for example [24, 36] and [33] for a discussion of these, and [27] for the relationship between them for hyperelliptic curves of genus 2.

Theorem 1.24

(see Theorem 17.2) Suppose $C:y^2=f(x)$ is a semistable hyperelliptic curve over K with $f(x)\in {\mathcal {O}}_K[x]$, and that $|k|>2g+1$. Then C defines a minimal Weierstrass equation if and only if one of the following conditions hold:

(1)
there are two clusters of size $g+1$ that are swapped by Frobenius, $d_\mathcal {R}=0$ and $v(c_f)\in \{0,1\}$,
(2)
there is no cluster of size $>\!g\!+\!1$ with depth $>0$, but there is some $G_K$-stable cluster ${{\mathfrak {s}}}$ of size $|{{\mathfrak {s}}}|\ge g+1$ with $d_{{\mathfrak {s}}}\ge 0$ and $v(c_f)=-\sum _{r\notin {{\mathfrak {s}}}}d_{r\wedge {{\mathfrak {s}}}}$.

Note that even if C is not semistable or if $|k|\le 2g\!+\!1$ but $y^2\!=\!f(x)$ satisfies (2) in the above theorem, then it is automatically a minimal Weierstrass equation, as it becomes minimal after a finite field extension. In particular (taking ${{\mathfrak {s}}}\!=\!\mathcal {R}$ and $d_\mathcal {R}\!=\!0$ in (2)), we immediately obtain the following general criterion for the equation to be minimal:

Corollary 1.25

Let $C:y^2\!=\!f(x)$ be a hyperelliptic curve over K with $f(x)\in {\mathcal {O}}_K[x]$. If $f(x)\!\mod {\mathfrak {m}}$ has at least two distinct roots in $\bar{k}$, but no root of multiplicity $>\!g\!+\!1$, and the leading coefficient of f(x) is a unit, then this is a minimal Weierstrass equation.

We also obtain the following result on the discriminant.

Theorem 1.26

(=Theorem 16.2) Let C/K be a hyperelliptic curve. The valuation of its discriminant $\Delta _C$ is given by

$$\begin{aligned} v(\Delta _C)= v(c_f)(4g+2) + \sum _{{{\mathfrak {s}}}\text { proper}} d_{{\mathfrak {s}}}\Bigl (|{{\mathfrak {s}}}|^2-\mathop {\Sigma }\limits _{{{\mathfrak {s}}}'<{{\mathfrak {s}}}}|{{\mathfrak {s}}}'|^2\Bigr ). \end{aligned}$$

If C/K is semistable and $|k|>2g\!+\!1$, then the valuation of the discriminant $\Delta _C^{min}$ of a minimal Weierstrass model of C is determined by the formula

$$\begin{aligned} \frac{v(\Delta _C)-v(\Delta _C^{min})}{4g+2}=v(c_f)-E+ d_{\mathcal {R}}(|\mathcal {R}|-g-1)+\sum _{g+1<|{{\mathfrak {s}}}|<|\mathcal {R}|}\delta _{{\mathfrak {s}}}(|{{\mathfrak {s}}}|-g-1), \end{aligned}$$

where $E=0$ unless there are two clusters of size $g+1$ that are permuted by Frobenius and $v(c_f)$ is odd, in which case $E=1$.

1.6 Cluster pictures

As discussed in the beginning, we extract most of our arithmetic data purely from the ‘cluster picture’ of the roots of f(x). Effectively this is how we propose to think about hyperelliptic curves over local fields. To formalise this slightly, we consider an abstract cluster picture purely combinatorially, without reference to roots of a polynomial.

Definition 1.27

Let X be a finite set and $\Sigma $ a collection of non-empty subsets of X; elements of $\Sigma $ are called clusters. Attach depth $d_{{\mathfrak {s}}}\in \mathbb {Q}$ to every cluster ${{\mathfrak {s}}}$ of size $> 1$. Then $\Sigma $ (or $(\Sigma ,X,d)$) is a cluster picture if

(1)
Every singleton (‘root’) is a cluster, and X is a cluster.
(2)
Two clusters are either disjoint or contained in one another.
(3)
$d_{{\mathfrak {t}}}>d_{{\mathfrak {s}}}$ if ${{\mathfrak {t}}}\subsetneq {{\mathfrak {s}}}$.

Two cluster pictures $(\Sigma ,X,d)$ and $(\Sigma ',X',d')$ are isomorphic if there is a bijection $\phi :X\rightarrow X'$ which induces a bijection from $\Sigma $ to $\Sigma '$ and $d_{{\mathfrak {s}}}= d'_{\phi ({{\mathfrak {s}}})}$.

For a polynomial $f(x)\in K[x]$ or a hyperelliptic curve $C:y^2=f(x)$, the cluster picture $\Sigma _f$ or $\Sigma _C$ is the collection of all clusters of the roots of f, as in Definition 1.1.

As explained in Theorem 1.3, cluster pictures carry a lot of arithmetic data. Different models for the same curve can have different cluster pictures, however, even this can be largely seen from the combinatorics of the roots. There is a purely combinatorial notion of “equivalence” of cluster pictures (Definition 14.7) that keeps track of isomorphisms of curves:

Theorem 1.28

(Theorem 14.4, Corollary 14.7) If C and $C'$ are isomorphic hyperelliptic curves over K, then their cluster pictures are equivalent. Conversely, if an abstract cluster picture $\Sigma $ is equivalent to $\Sigma _C$ for some hyperelliptic curve C, then there is a hyperelliptic curve $C'/\bar{K}$ that is $\bar{K}$-isomorphic to C and whose cluster picture is $\Sigma $.

It is worth mentioning that every equivalence class of cluster pictures has a canonical representative, a ‘balanced cluster picture’ (see Lemma 15.1). For semistable hyperelliptic curves Theorem 15.2 and Corollary 15.3 explain how to find an isomorphic curve with such a cluster picture.

1.7 Perturbing the curve

The cluster picture is a fairly coarse invariant of the polynomial f(x). In particular small perturbations of the coefficients of f(x) will not change its associated cluster picture and hence many of the invariants of the corresponding curve. Here is a precise statement (see Theorem 19.1 for a more general result).

Theorem 1.29

(=Corollary 19.3) Suppose $C_1:y^2 = c_1g_1(x)$ and $C_2:y^2 = c_2g_2(x)$ are two hyperelliptic curves with $c_1, c_2 \in K^{\times }$ and $g_1(x), g_2(x) \in {\mathcal {O}}_K[x]$ monic polynomials. If $\frac{c_1}{c_2} \in K^{\times 2}$ and $g_1(x) \equiv g_2(x) \mod \pi ^{d+1}$ where d is the largest depth among the depths of all proper clusters of $C_1$, then

$H^1_{\acute{\mathrm{e}}\text {t}}(C_1) \cong H^1_{\acute{\mathrm{e}}\text {t}}(C_2)$ as $G_K$-modules for every $l\ne p$, and $C_1$ and $C_2$ have the same conductor exponent and the same root number.
If $C_1$ is semistable then so is $C_2$. In this case, the special fibres of their minimal regular models over ${\mathcal {O}}_{K^{nr}}$ are isomorphic as curves with an action of Frobenius, their Jacobians have the same Tamagawa number, $C_2$ is deficient if and only if $C_1$ is and, if $|k|>\deg g_1(x)$, the valuations of their minimal discriminants are equal.

1.8 Classification of semistable curves

As in Theorem 1.3, cluster pictures together with the signs of $\epsilon _{{\mathfrak {s}}}({\text {Frob}})$ determine a range of arithmetic information for semistable curves (see also Theorem 18.4). In view of the semistability criterion (Theorem 1.9(1)), it is easy to list and classify all possible cluster pictures that correspond to semistable hyperelliptic curves. We present here the classification one obtains for curves of genus 2, together with their various arithmetic invariants; this is discussed more fully in Sect. 18. We propose a notation for cluster pictures that carries this extra data in Notation 18.5.

Notation 1.30

For a cluster picture we write the relative depth on all proper clusters (except for $\mathcal {R}$ which is decorated with its depth) at the bottom right corner of the cluster. For every even cluster ${{\mathfrak {s}}}$ such that ${{\mathfrak {s}}}\! =\! {{\mathfrak {s}}}^*$ we write a sign $+$ or − on its top right corner to indicate $\epsilon _{{\mathfrak {s}}}({\text {Frob}})$. For every cluster, we link its children that are in the same Frobenius orbit by lines.

For the dual graph of the special fibre $\Upsilon _C$, the numbers inside the vertices indicate their genus (no number meaning genus 0), an edge of length n indicates a chain of n edges between $n\!-\!1$ genus 0 vertices, and the black arrows represent the action of Frobenius on the graph.

Theorem 1.31

(see Theorem 18.8) Suppose the residue field of K has size $|k|\!>\!5$.

Every semistable genus 2 curve C/K admits a model $y^2{=}f(x)$, with $f(x){\in }{\mathcal {O}}_K[x]$ and one of the listed cluster pictures in Table 1. Conversely, every genus 2 curve $y^2= f(x)$ with one of the cluster pictures in Table 1 is semistable and, if $f(x) \in {\mathcal {O}}_K[x]$, then this is a minimal Weierstrass model.

Moreover, Table 1 gives the invariants of such curves: $m_C$ is the number of components in the special fibre of the minimal regular model of C, n is the conductor exponent, w is the local root number, c is the Tamagawa number of Jac(C), Def indicates whether the curve is deficient (−) or not ($+$). The column $H_1(\Upsilon _C, \mathbb {Z})$ lists the isomorphism class of the lattice together with automorphism (induced by Frobenius) and pairing (induced by the length pairing on $\Upsilon _C$), in the notation of [4] Thm 1.2.2.

Table 1 Local invariants of semistable genus 2 curves

Full size table

How to determine the reduction type (first column of Table 1) of a semistable genus 2 curve direct from its cluster picture is explained in Theorem 18.7 and Table 3 in Sect. 18.

1.9 Layout

In Sect. 2 we review some facts about models of general curves over local fields and étale cohomology.

In Sect. 3 we show that cluster configurations (or, rather, certain collections of discs) give rise to a regular model of $\mathbb {P}^1_{K^\text{ nr }}$ and describe its properties (Proposition 3.13). In Sect. 4 and Sect. 5, we study double covers of those models, and deduce explicit regular models of hyperelliptic curves that satisfy the semistability criterion (Definition 1.8). This approach is similar to that of Kausz [24], and has also been exploited by Bouw-Wewers [10] and Srinivasan [36], though each of these works in a slightly different generality to us. In particular, we construct the models under the assumptions of the semistability criterion and so in particular do not assume that all Weierstrass points are rational. We find the minimal regular model (Sect. 5.4), describe the dual graph of its special fibre (Theorem 5.18), give explicit equations for its components (Theorem 5.20) and describe the reduction map from the generic to the special fibre (Prop. 5.23); we deduce the stable model in Sect. 5.8.

In Sect. 6 we turn to non-semistable curves and study the natural Galois action on the model that we have over an extension where the curve becomes semistable. We then deduce the semistability criterion (Theorem 7.1), and in Sect. 8 describe the Galois action on the special fibre in terms of clusters. In Sect. 9 we extract the homology of the dual graph of the special fibre (Theorem 9.3), and, consequently, the toric part of the étale cohomology (Corollary 9.6). The abelian part is addressed in Sect. 10, and as a consequence, we get Theorems 1.20 (=10.1) and 1.9 (=10.3). In Sect. 11 we then find the formula for the conductor, and classify deficient curves in Sect. 12.

In Sect. 16 and Sect. 17 we study the discriminant and the minimal Weierstrass equation of a hyperelliptic curve, proving Theorems 1.24 (=17.2) and 16.2 (=16.2). This is primarily a combinatorial cluster yoga, relying on the semistability criterion to convert semistability into cluster language. In Sect. 18 we propose a notion of a ‘reduction type’ of a semistable curve, and give classfication in genus 2. In Sect. 19 we study the variation of the coefficients of a curve that does not affect its primary arithmetic invariants, and prove Theorem 19.1 and Corollary 1.29 (=19.3).

In Appendix A we review affine automorphisms of (possibly singular) hyperelliptic equations. Appendices B and C prove some technical results concerning centres of clusters and equivalent forms of the semistability criterion. Finally, Appendix D links the results of this paper to its combinatorial predecessor [18].

For convenience of the reader, Appendix E lists the notation and terminology used in different parts of the paper, together with their definitions or references to where they are given.

2 Curves and Jacobians over local fields

In this section we review some facts about models of curves over local fields and étale cohomology. We refer the reader to [12, Sect. 2], [10, 23, 35], and especially [19] for details. All of this is standard, except we want the residue field to be non-algebraically closed, and so have to keep track of the Galois action throughout the section.

Let K be a local^{Footnote 4} field, with uniformiser $\pi $ and residue field k. Suppose C/K is a non-singular projective curve, of genus $g\ge 2$.

A model of C/K is a flat proper scheme $\mathcal {C}/{\mathcal {O}}_K$ together with a K-isomorphism of its generic fibre with C. It is a regular model if $\mathcal {C}$ is regular, and such a model can always be obtained from a given model by repeated blowups. Among regular models, there is a unique one dominated by all the others, the minimal regular model

$$\begin{aligned} \mathcal {C}_{\min } \longrightarrow {\text {Spec}}{\mathcal {O}}_K. \end{aligned}$$

A model is semistable if its special fibre $\mathcal {C}_k$ is geometrically reduced and has only ordinary double points as singularities, and when such a model exists we say that C/K is semistable or has semistable reduction. Such a model always exists over some finite extension F/K [13]. When one exists over K, the minimal regular model is semistable as well, and blowing down certain components of the special fibre yields a stable model

$$\begin{aligned} \mathcal {C}_{\text{ st }} \longrightarrow {\text {Spec}}{\mathcal {O}}_K, \end{aligned}$$

characterised among semistable models by the fact that its special fibre has a finite automorphism group (i.e. it is a stable curve). It is again unique, though it is not necessarily regular, and it commutes with base change, as opposed to the regular model.

Example 2.1

Take $K=\mathbb {Q}_p$ ($p>3$), and

$$\begin{aligned} C/K: y^2=x^3+p^2, \end{aligned}$$

an elliptic curve with additive reduction. The defining equation gives a model $\mathcal {C}/\mathbb {Z}_p$ that is neither regular (the ideal (0, 0, p) gives a singular point) nor semistable (the special fibre has a cusp). Its minimal regular model $\mathcal {C}_{\min }/\mathbb {Z}_p$ has three $\mathbb {P}^1$s meeting at a point as a special fibre (Kodaira type $\mathrm{IV}$). The curve C becomes semistable over $\mathbb {Q}_p(\root 3 \of {p})$, since

$$\begin{aligned} y^2 = x^3 + \pi ^6 \quad \cong \quad y^2=x^3+1 \qquad (\pi =\root 3 \of {p}), \end{aligned}$$

(2.2)

and the latter has good reduction: the special fibre is an elliptic curve over $y^2=x^3+1$ over $\mathbb {F}_p$.

We begin by reviewing special fibres of semistable models.

2.1 Semistable curves over the residue field

We follow [23, pp. 469–474] closely, except our description of X(T) in (2.5) is slightly tweaked.

Let $Y/k$ be a semistable curve, that is $Y$ is complete, connected, not necessarily irreducible, and the only singularities of $Y_{{\bar{k}}}=Y\times _k{\bar{k}}$ are ordinary double points. Write

$$\begin{aligned} \begin{array}{llllll} n&{}=&{} \hbox {normalisation map } {{\tilde{Y}}}\rightarrow Y,\\ {{\mathcal {I}}}&{}=&{} \hbox {set of singular (ordinary double) points of }Y_{{\bar{k}}},\\ {{\mathcal {J}}}&{}=&{} \hbox {set of irreducible components of }Y_{{\bar{k}}}(\hbox {=connected comps. of }{\tilde{Y}}_{{\bar{k}}}),\\ {{\mathcal {K}}}&{}=&{} n^{-1}({{\mathcal {I}}}); \hbox {this comes with two canonical maps}\\ &{}&{} \phi : {{\mathcal {K}}}\rightarrow {{\mathcal {I}}}, P\mapsto n(P),\\ &{}&{} \psi : {{\mathcal {K}}}\rightarrow {{\mathcal {J}}}, P\mapsto \hbox { component of }{\tilde{Y}}_{{\bar{k}}}\hbox { on which P lies,}\\ \Upsilon &{}=&{} \hbox {dual graph of }Y_{{\bar{k}}}.\\ \end{array} \end{aligned}$$

2.1.1 Dual graph

By a metric graph we mean a topological space G homeomorphic to a finite (combinatorial) graph, equipped with a set V(G) of vertices (containing (at least) all points $x\in G$ of degree $\ne 2$), a set E(G) of edges, and a length function $l:E(G)\rightarrow {\mathbb {R}}_{>0}$. Graph isomorphisms are homotopy classes of homeomorphisms that preserve vertices, edges and lengths. We allow loops and multiple edges and note that automorphisms may permute multiple edges and reverse the direction of loops. Note that automorphisms act naturally on the first singular homology group $H_1(G,\mathbb {Z})$.

The dual graph $\Upsilon $ is the metric graph with vertex set ${{\mathcal {J}}}$ and edge set ${{\mathcal {I}}}$. The set ${{\mathcal {K}}}$ is the set of edge endpoints, the maps $\phi $ and $\psi $ specify adjacency, and each edge is given length 1. It comes with a natural ‘genus’ marking $g:V(\Upsilon )\rightarrow {\mathbb {Z}}_{\ge 0}$ which associates to each vertex the (geometric) genus of the component to which it corresponds. Note that a graph automorphism of $\Upsilon $ is precisely the data of bijections ${{\mathcal {K}}}\rightarrow {{\mathcal {K}}}$, ${{\mathcal {I}}}\rightarrow {{\mathcal {I}}}$ and ${{\mathcal {J}}}\rightarrow {{\mathcal {J}}}$ that commute with $\phi $ and $\psi $.

2.1.2 Character group

The normalisation map $n$ is an isomorphism outside ${{\mathcal {I}}}$, and yields an exact sequence of sheaves on $Y$,

$$\begin{aligned} 1 \longrightarrow O_{Y}^\times \longrightarrow n_* O_{{{\tilde{Y}}}}^\times \longrightarrow {{\mathbb {I}}}\longrightarrow 0, \end{aligned}$$

with ${{\mathbb {I}}}$ concentrated in ${{\mathcal {I}}}$. Consider the long exact sequence on cohomology,

$$\begin{aligned} 0 \rightarrow H^0(Y,O_{Y}^\times ) \rightarrow H^0({{\tilde{Y}}},O_{{{\tilde{Y}}}}^\times ) \rightarrow H^0(Y,{{\mathbb {I}}}) \rightarrow H^1(Y,O_{Y}^\times ) \rightarrow H^1({{\tilde{Y}}},O_{{{\tilde{Y}}}}^\times ) \rightarrow 0. \end{aligned}$$

The global sections of ${{\mathbb {I}}}$ are in bijection with invertible functions on ${{\mathcal {K}}}$ modulo those pulled back from ${{\mathcal {I}}}$. In other words,

$$\begin{aligned} H^0(Y,{{\mathbb {I}}}) = {\text {coker}}(({\bar{k}}^\times )^{{\mathcal {I}}}\>\>{\buildrel {\phi ^*}\over \longrightarrow }\>\>({\bar{k}}^\times )^{{\mathcal {K}}}), \end{aligned}$$

where $\phi ^*$ takes a function ${{\mathcal {I}}}\rightarrow {\bar{k}}^\times $ to ${{\mathcal {K}}}\rightarrow {\bar{k}}^\times $ by composing it with $\phi $. With $\psi ^*$ defined in the same way, the exact sequence above becomes

$$\begin{aligned} \!0\!\longrightarrow \! {\bar{k}}^\times \!\longrightarrow \! ({\bar{k}}^\times )^{{\mathcal {J}}}\>\>{\buildrel {\psi ^*}\over \longrightarrow }\>\> \frac{({\bar{k}}^\times )^{{\mathcal {K}}}}{\phi ^*(({\bar{k}}^\times )^{{\mathcal {I}}})} \!\longrightarrow \! {\text {Pic}}Y({\bar{k}}) \!\longrightarrow \! {\text {Pic}}{{\tilde{Y}}}({\bar{k}}) \!\longrightarrow \! 0. \end{aligned}$$

(2.3)

In fact, there is an exact sequence of algebraic groups

$$\begin{aligned} 0 \longrightarrow T \longrightarrow {\text {Pic}}^0(Y) \longrightarrow {\text {Pic}}^0({{\tilde{Y}}}) \longrightarrow 0, \end{aligned}$$

(2.4)

with T the largest torus in ${\text {Pic}}^0(Y)$. Then (2.3) gives a canonical identification of its character group $X(T)={\text {Hom}}(T_{{\bar{k}}},\mathbb {G}_{m,{\bar{k}}})$ as

$$\begin{aligned} X(T) = \ker (\mathbb {Z}^{{\mathcal {K}}}\>\>{\buildrel {(\phi ,\psi )}\over \longrightarrow }\>\> \mathbb {Z}^{{\mathcal {I}}}\times \mathbb {Z}^{{\mathcal {J}}}). \end{aligned}$$

(2.5)

On the other hand, write the dual graph $\Upsilon $ as the union $\Upsilon =U\cup V$, where U is the union of open edges, and V is the union of small open neighbourhoods of the vertices. Then the Mayer-Vietoris sequence reads

$$\begin{aligned} 0 \longrightarrow H_1(\Upsilon ,\mathbb {Z}) \longrightarrow \mathbb {Z}^{{\mathcal {K}}}\>\>{\buildrel {(\phi ,\psi )}\over \longrightarrow }\>\> \mathbb {Z}^{{\mathcal {I}}}\times \mathbb {Z}^{{\mathcal {J}}}\longrightarrow \mathbb {Z}\longrightarrow 0, \end{aligned}$$

since $H_0(U)=\mathbb {Z}^{{\mathcal {I}}}$, $H_0(V)=\mathbb {Z}^{{\mathcal {J}}}$, $H_0(U\cap V)=\mathbb {Z}^{{\mathcal {K}}}$ and all their higher homology groups vanish. Therefore, the character group of T and its $\mathbb {Z}$-linear dual are, canonically,

$$\begin{aligned} X(T)=H_1(\Upsilon ,\mathbb {Z}), \qquad X(T)^*=H^1(\Upsilon ,\mathbb {Z}). \end{aligned}$$

(2.6)

On the level of Tate modules $T_l$, $l\ne {{\,\mathrm{char}\,}}k$, the sequence (2.4) becomes

$$\begin{aligned} 0 \longrightarrow X(T)^*\otimes _\mathbb {Z}\mathbb {Z}_l \longrightarrow T_l{\text {Pic}}^0 Y\longrightarrow T_l {\text {Pic}}^0 ({{\tilde{Y}}}) \longrightarrow 0. \end{aligned}$$

(2.7)

There is a length pairing on $H_1(\Upsilon ,\mathbb {Z})$: let $\langle e,e\rangle =1$ and $\langle e,e'\rangle =0$ for edges $e\ne e'$ of $\Upsilon $, and extend to singular chains by linearity. This descends to a pairing on $H_1$,

$$\begin{aligned} \langle , \rangle : H_1(\Upsilon ,\mathbb {Z})\times H_1(\Upsilon ,\mathbb {Z})\longrightarrow \mathbb {Z}. \end{aligned}$$

(2.8)

Finally, it is clear that ${{\mathcal {I}}}$, ${{\mathcal {J}}}$, ${{\mathcal {K}}}$, X(T), $X(T)^*$, $H_1(\Upsilon ,\mathbb {Z})$, $T_l$ are all $G_k$-modules, and (2.6), (2.7), (2.8) are compatible with $G_k$-action, as everything is canonical.

2.2 Semistable Jacobians

Now we go back to C/K, and suppose it has semistable reduction. Then the Jacobian $A={\text {Jac}}C$ is a semistable abelian variety over K. Let

$\mathcal {C}/{\mathcal {O}}_K$:: = semistable model of C over ${\mathcal {O}}_K$ base changed to ${\mathcal {O}}_{K^{{\text {nr}}}}$, with special fibre ${\bar{C}}/{\bar{k}}$.
$\mathcal {N}/{\mathcal {O}}_K$:: = Néron model of $A/K^{{\text {nr}}}$ base changed to ${\mathcal {O}}_{K^{{\text {nr}}}}$, with special fibre ${\bar{N}}/{\bar{k}}$; the identity component ${\bar{N}}^0$ is ${\text {Pic}}^0{\bar{C}}$.
$\Phi ({\bar{k}})$:: = (finite) group of components ${\bar{N}}/{\bar{N}}^0$.
$T/{\bar{k}}$:: = toric part of ${\text {Pic}}^0{\bar{C}}$, as in (2.4).

By the work of Raynaud ([23, 31] Sect. 9), there is a smooth commutative group scheme ${{\mathcal {A}}}/{\mathcal {O}}_{K^{{\text {nr}}}}$, unique up to a unique isomorphism, characterised by the following properties: it is an extension

$$\begin{aligned} 0 \longrightarrow {{\mathcal {T}}}\longrightarrow {{\mathcal {A}}}\longrightarrow {{\mathcal {B}}}\longrightarrow 0, \end{aligned}$$

with ${{\mathcal {T}}}/{\mathcal {O}}_{K^{{\text {nr}}}}$ a torus and ${{\mathcal {B}}}/{\mathcal {O}}_{K^{{\text {nr}}}}$ an abelian scheme, and ${{\mathcal {A}}}\otimes ({{\mathcal {O}}}_{K^{{\text {nr}}}}/m_{K^{{\text {nr}}}}^i)$ is the identity component of $\mathcal {N}\otimes ({{\mathcal {O}}}_{K^{{\text {nr}}}}/m_{K^{{\text {nr}}}}^i)$. Noting that ${{\mathcal {B}}}({{\mathcal {O}}}_{K^{{\text {nr}}}})={{\mathcal {B}}}(K^{{\text {nr}}})$ as ${{\mathcal {B}}}$ is proper, from the commutative diagram

we have

$$\begin{aligned} \frac{{{\mathcal {A}}}(K^{{\text {nr}}})}{{{\mathcal {A}}}({{\mathcal {O}}}_{K^{{\text {nr}}}})} \cong \frac{{{\mathcal {T}}}(K^{{\text {nr}}})}{{{\mathcal {T}}}({{\mathcal {O}}}_{K^{{\text {nr}}}})} = \frac{{\text {Hom}}(X({{\mathcal {T}}}),(K^{{\text {nr}}})^\times )}{{\text {Hom}}(X({{\mathcal {T}}}),{\mathcal {O}}_{K^{{\text {nr}}}}^\times )} = {\text {Hom}}(X({{\mathcal {T}}}),\mathbb {Z}) = X({{\mathcal {T}}})^*. \end{aligned}$$

(2.9)

By the rigidity of tori, we have $X({{\mathcal {T}}})=X(T)$ and so $X({{\mathcal {T}}})^*=X(T)^*$.

The dual abelian variety $A^t/K^{{\text {nr}}}$ has semistable reduction as well, and there is a sequence as above with ${{\mathcal {T}}}^*, {{\mathcal {A}}}^*$ and ${{\mathcal {B}}}^*\cong {{\mathcal {B}}}^t$ ([23] Thm. 5.4). Raynaud constructs a canonical map $X({{\mathcal {T}}}^*)\hookrightarrow {{\mathcal {A}}}(K^{{\text {nr}}})$, inducing a $G_{K^{{\text {nr}}}}$-isomorphism

$$\begin{aligned} A({\bar{K}}) \cong {{\mathcal {A}}}({\bar{K}})/X({{\mathcal {T}}}^*). \end{aligned}$$

In the case of elliptic curves with split multiplicative reduction, this is Tate’s parametrisation $E({\bar{K}})\cong {\bar{K}}^\times /q^\mathbb {Z}$.

Combining with (2.9), we get an inclusion

(2.10)

with finite cokernel, which is canonically isomorphic to the group of components $\Phi ({\bar{k}})$. We may view n as a non-degenerate bilinear pairing, the monodromy pairing,

$$\begin{aligned} X({{\mathcal {T}}}^*) \times X({{\mathcal {T}}}) \longrightarrow \mathbb {Z}. \end{aligned}$$

(2.11)

If $K'/K$ is a finite extension, then $X({{\mathcal {T}}})$ and $X({{\mathcal {T}}}^*)$ remain the same modules by uniqueness of Raynaud parametrisation, and the map n becomes $e_{K'/K}n$, see [23, 10.3.5]. Because A is a Jacobian, it has a principal polarisation , inducing ${{\mathcal {A}}}\cong {{\mathcal {A}}}^*$, ${{\mathcal {T}}}\cong {{\mathcal {T}}}^*$, ${{\mathcal {B}}}\cong {{\mathcal {B}}}^*$. The pairing (2.10) becomes a symmetric bilinear pairing ([23] Sect. 10.2)

$$\begin{aligned} X({{\mathcal {T}}}) \times X({{\mathcal {T}}}) \longrightarrow \mathbb {Z}, \end{aligned}$$

(2.12)

and it coincides with (2.8), up to identifying $X({{\mathcal {T}}})=X(T)$. Because $\Phi ({\bar{k}})\cong {\text {coker}}n$, we get a perfect symmetric pairing

$$\begin{aligned} \Phi ({\bar{k}}) \times \Phi ({\bar{k}}) \longrightarrow \mathbb {Q}/\mathbb {Z}. \end{aligned}$$

(2.13)

Finally, as in Sect. 2.1, $G_k$ acts on everything, and (2.11), (2.12), (2.13) are $G_k$-equivariant.

2.3 Galois and inertia

As K is a local field, $G_K={\text {Gal}}(K^{sep}/K)$ fits into an exact sequence

$$\begin{aligned} 1 \longrightarrow I_K \longrightarrow G_K \longrightarrow G_k \longrightarrow 1, \end{aligned}$$

with $I_K$ the inertia group, and $G_k\cong {\hat{\mathbb {Z}}}$ topologically generated by the map $x\mapsto x^q$. Any of its lifts to $G_K$ is called an (arithmetic) Frobenius element ${\text {Frob}}$. Write $\chi _l: G_K\rightarrow \mathbb {Z}_l^\times ={\text {GL}}_1(\mathbb {Z}_l)$,

$$\begin{aligned} \chi _l: \quad I_K\mapsto 1, \>\> {\text {Frob}}\mapsto q \end{aligned}$$

for the l-adic cyclotomic character, and $\mathbb {Z}_l(n)=\chi _l^{\otimes n}$ for the Tate twist of the trivial module $\mathbb {Z}_l$.

The inertia group $I_K$ has a unique p-Sylow subgroup, the wild inertia $P_K$, and we have a short exact sequence

$$\begin{aligned} 1 \longrightarrow P_K \longrightarrow I_K \longrightarrow \prod _{l\ne p} \mathbb {Z}_l \longrightarrow 1. \end{aligned}$$

The tame inertia $I_K/P_K$ projects onto $\mathbb {Z}_l$ via the l-adic tame character

2.4 General curves

Fix

C/K:: = arbitrary non-singular projective curve of genus $\ge 1$,
A/K:: = Jacobian of C,
F/K:: = finite Galois extension over which C is semistable, with residue field $k_F$, ring of integers ${\mathcal {O}}_F$ and uniformiser $\pi $,
$\mathcal {C}_{\min }/{\mathcal {O}}_{F^{{\text {nr}}}}$:: = minimal regular model of $C/F^{{\text {nr}}}$, with special fibre ${{\bar{C}}}$ (semistable curve), and normalisation $\tilde{{\bar{C}}}$,
$\mathcal {N}/{\mathcal {O}}_F$:: = Néron model of $A/F^{{\text {nr}}}$, with special fibre ${\bar{N}}$,
${{\mathcal {J}}}$:: = set of connected components of $\tilde{{\bar{C}}}$,
$\Upsilon $:: = dual graph of ${\bar{C}}$, as in Sect. 2.1.

For $\sigma \in G_K$, the model $\mathcal {C}_{\min }^\sigma $ is again a stable model of C/F, and so $\mathcal {C}_{\min }^\sigma \cong \mathcal {C}_{\min }$, canonically. As explained in [19] (see also [12, p. 13] and [35, p. 497]), this implies that the Galois action ,$A({\bar{K}})$ extends to a semilinear action on the geometric points of the special fibres ${{\bar{C}}}$, ${\bar{N}}$ and $\tilde{{\bar{C}}}$,

$$\begin{aligned} s: G_K \rightarrow {\text {Aut}}{{\bar{C}}}({\bar{k}})\>\> (\rightarrow {\text {Aut}}{\bar{N}}({\bar{k}})). \end{aligned}$$

(2.14)

It is computed as follows. Let ${{\bar{C}}}_{ns}({\bar{k}})\subset {{\bar{C}}}({\bar{k}})$ be the non-singular locus. Write ${\text {red}}$ for the reduction map

$$\begin{aligned} {\text {red}}:C(F^{\text{ nr }}) \>\>{\buildrel {=}\over \longrightarrow }\>\> \mathcal {C}_{\min }({\mathcal {O}}_F^{\text{ nr }}) \>\>{\buildrel \mathrm{reduce}\over \longrightarrow }\>\> {{\bar{C}}}_{ns}({\bar{k}}). \end{aligned}$$

(2.15)

It is surjective by Hensel’s Lemma, so take a section ${\text {red}}^{-1}:{{\bar{C}}}_{ns}({\bar{k}})\rightarrow C(F^{\text{ nr }})$. Then on ${{\bar{C}}}_{ns}({\bar{k}})$, the map s is the composition (cf. [19, Thm 1.5], except there ${\text {red}}^{-1}$ is chosen to land in $C({\bar{F}})$, which is a bit more general)

$$\begin{aligned} s(\sigma ):{{\bar{C}}}_{ns}({\bar{k}}) \>\>{\buildrel {{\text {red}}^{-1}}\over \longrightarrow }\>\> C(F^{\text{ nr }}) \>\>{\buildrel {\sigma }\over \longrightarrow }\>\> C(F^{\text{ nr }}) \>\>{\buildrel \mathrm{red}\over \longrightarrow }\>\> {{\bar{C}}}_{ns}({\bar{k}}). \end{aligned}$$

(2.16)

The reduced curve ${{\bar{C}}}$ has a natural structure of a k-scheme; denote by $\Phi : {{\bar{C}}}\rightarrow {{\bar{C}}}$ the absolute Frobenius map, acting on K(C) by raising everything to the power |k|. If $\sigma ={\text {Frob}}^n\tau $ for some $n\in \mathbb {Z}_{\ge 0}$ and $\tau \in I_K$, then $s(\sigma )=\Phi ^n{\tilde{g}}$ for some k-linear automorphism ${\tilde{g}}$ of ${{\bar{C}}}$. In particular, $n=0$ for $\sigma $ in the inertia group $I_K$, and $s(I_K)=I_{F/K}$ is finite, acting through honest $k_F$-scheme automorphisms. Note that for every $\sigma $ in the Weil group of K (i.e. of the form $\sigma ={\text {Frob}}^n\tau $ as above, but with $n\in \mathbb {Z}$), either $s(\sigma )$ or $s(\sigma ^{-1})$ is a morphism of schemes, so that (a) (2.16) determines it uniquely (on the whole of $G_K$, as the Weil group is dense in it), though it is only defined on the non-singular points, and (b) the action of s extends naturally to the action on $\tilde{{\bar{C}}}({\bar{k}})$,

$$\begin{aligned} {\tilde{s}}: \tilde{{\bar{C}}}({\bar{k}}) \rightarrow \tilde{{\bar{C}}}({\bar{k}}), \end{aligned}$$

(2.17)

on $\mathcal {N}({\bar{k}})$, and on the dual graph $\Upsilon $.

Applying this to the l-power torsion points of $A={\text {Pic}}^0(C)$, from (2.6), (2.7) and [10, Prop. 2.6] (see also [12, p.14]) we find an isomorphism of $G_K$-representations $V_l A\cong (V_l A)_t\!\otimes \!{\text {Sp}}_2\oplus (V_l A)_{ab}$, with

$$\begin{aligned} \begin{array}{llllllllllll} (V_l A)_t &{}\cong &{} H_1(\Upsilon _{C},\mathbb {Z})\otimes _\mathbb {Z}\mathbb {Q}_l, \\ (V_l A)_{ab} &{}\cong &{} \displaystyle \bigoplus _{\Gamma \in {{\mathcal {J}}}/G_K} {\text {Ind}}_{{\text {Stab}}(\Gamma )}^{G_K} V_l{\text {Pic}}^0(\Gamma ),\\ \end{array} \end{aligned}$$

(2.18)

and $G_K$ acting through s on the spaces on the right. Twisting by $\mathbb {Q}_l(1)$ gives a similar decomposition for the étale cohomology group $H^1_{\acute{\mathrm{e}}\text {t}}(C_{{\bar{K}}},\mathbb {Q}_l)$. See [19, Cor 1.6] for details, noting that $H_1(\Upsilon _{C},\mathbb {Z})\otimes _\mathbb {Z}\mathbb {Q}_l\cong H^1(\Upsilon _{C},\mathbb {Z})\otimes _\mathbb {Z}\mathbb {Q}_l$ as a $G_K$-module, since a rational representation is self-dual.

Example 2.19

Consider the curve from Example 2.1 over $K=\mathbb {Q}_p$ ($p>3$),

$$\begin{aligned} C/K:y^2=x^3+p^2. \end{aligned}$$

Fix a primitive 3rd root of unity $\zeta \in {\bar{K}}$ and $\pi =\root 3 \of {p}$. Let $K'=K(\pi )$, and $F=K(\zeta ,\pi )$, its Galois closure. Thus, ${\text {Gal}}(F/K)\cong S_3$ if $p\equiv 2\mod 3$, and $F=K'$ with ${\text {Gal}}(F/K)\cong C_3$ otherwise. In either case, the inertia group $I_{F/K}$ is $C_3$, generated by $\tau $ that sends $\pi \rightarrow \zeta \pi $. Let $\Phi $ be a Frobenius element of $K'$; so $\Phi $ fixes $\pi $ and sends $\zeta \rightarrow \zeta ^p$. So

$$\begin{aligned} G\>=\>{\text {Gal}}(F^{\text{ nr }}/K) \>=\> \langle \tau ,\Phi \rangle \>\cong \> C_3\rtimes {\hat{\mathbb {Z}}}. \end{aligned}$$

Recall from 2.1 that C acquires good reduction over $K'$, and thus over F as well, and the special fibre of its minimal model is the curve ${{\bar{C}}}: y^2=x^3+1$. Using the isomorphism (2.2), the reduction map (2.15) becomes

$$\begin{aligned} \begin{array}{llllllllllll} {\text {red}}:&{} C(F^{\text{ nr }}) &{} \longrightarrow &{} {{\bar{C}}}({\bar{\mathbb {F}}}_p) \\ &{} (x,y) &{} \longmapsto &{} (\tfrac{x}{\pi ^2},\tfrac{y}{\pi ^3})\mod \pi . \\ \end{array} \end{aligned}$$

The semilinear action s of $G_K$ in (2.16) factors through G and is given by

$$\begin{aligned} \begin{array}{llllllllllll} s(\tau ):&{} {{\bar{C}}}({\bar{\mathbb {F}}}_p) &{} \longrightarrow &{} {{\bar{C}}}({\bar{\mathbb {F}}}_p) \\ &{} (x,y) &{} \longmapsto &{} (\zeta x,y) \\ \end{array} \qquad \begin{array}{llllllllllll} s(\Phi ):&{} {{\bar{C}}}({\bar{\mathbb {F}}}_p) &{} \longrightarrow &{} {{\bar{C}}}({\bar{\mathbb {F}}}_p) \\ &{} (x,y) &{} \longmapsto &{} (x^p,y^p) \\ \end{array} \end{aligned}$$

We refer the reader to [16, Sect. 6] and [15, Sect. 3-4] and [10] for additional examples, and explicit computations of Galois representations attached to curves.

We also recall the formula for the local root number of the Jacobian in the semistable case:

Theorem 2.20

Let C/K be a semistable curve. Then the local root number of the Jacobian $A={\text {Jac}}C$ is

$$\begin{aligned} w_A = (-1)^{a} , \end{aligned}$$

where a is the multiplicity of the trivial representation of $G_k$ in the homology of the dual graph $H_1(\Upsilon ,\mathbb {Q})$ of C.

Proof

This is a standard root number computation, see e.g. [14, Prop. 3.23] with $\tau = {\varvec{1}}$ and $X({{\mathcal {T}}}) = X(T) = H_1(\Upsilon ,\mathbb {Z})$ by (2.6). $\square $

Lemma 2.21

Let C/K be a semistable curve. Then $H_1(\Upsilon _C, \mathbb {Z}) \cong X({{\mathcal {T}}})$ as $\mathbb {Z}[{\text {Frob}}]$-modules with pairing.

Proof

Again, $X({{\mathcal {T}}}) = X(T) = H_1(\Upsilon ,\mathbb {Z})$ by (2.6); for the compatibility with the pairing, see (2.12). $\square $

Lemma 2.22

Suppose C/K is semistable. Let $\Phi $ be the component group of the Néron model of the Jacobian of C over $K^\mathrm{nr}$. Then $\Phi $ is isomorphic, as a $G_k$-module, to the cokernel of

$$\begin{aligned} H_1(\Upsilon _C,\mathbb {Z}) \longrightarrow {\text {Hom}}(H_1(\Upsilon _C,\mathbb {Z}),\mathbb {Z}), \qquad \ell \mapsto \langle \ell ,\cdot \rangle . \end{aligned}$$

Proof

Apply [9, Thm. 9.6/1] over $K^\mathrm{nr}$. $\square $

Remark 2.23

The size of the $G_k$-invariants of $\Phi $ is known as the Tamagawa number of the Jacobian of C over K.

3 Regular semistable models of $\mathbb {P}^1$

In this section we show how certain finite collections of discs (see Definition 3.2) give rise to regular semistable models of the projective line over $K^{\text{ nr }}$. We then describe (Proposition 3.13) the divisor of a polynomial on this model. In the next section, given a hyperelliptic curve $C/K:y^2=f(x)$ satisfying the semistability criterion we use this to associate to C a particular model of ${\mathbb {P}}^1_{K^{\text{ nr }}}$ on which the divisor of f(x) has nice properties (see Proposition 4.16). We then deduce that the normalisation of this model in the function field $K^{\text{ nr }}(C)$ is a regular model of $C/K^{\text{ nr }}$ (Theorem 5.2) from which several of our main theorems follow.

The relationship between discs and models of the projective line is not new and roughly proceeds as follow. To each disc there is a naturally associated valuation on $\bar{K}(x)$ (we recall this construction in Sect. 3.1.2). Now by [32, Proposition 3.4] (see also [29, Sect. 5]) there is a one to one correspondence between normal models of ${\mathbb {P}}^1_{K^{\text{ nr }}}$ and finite non-empty collections of ‘inductive’ valuations on $K^{\text{ nr }}(x)$, the map taking a model to the set of valuations corresponding to the local rings at the generic points of the components of its special fibre. Our model is the one yielding the same collection of valuations as that associated to the collection of discs. To facilitate in the analysis of the model however we construct it below using explicit charts. That these two descriptions agree follows from Proposition 3.13. For a more general correspondence between normal models of ${\mathbb {P}}^1_{K^{\text{ nr }}}$ and collections of ‘rigid diskoids’ see [32, Proposition 4.4], [29, Sect. 5.1].

3.1 Discs and associated valuations

3.1.1 Discs

A disc is a subset

$$\begin{aligned} D=D_{z,d}:=\{x\in {\bar{K}}~\mid ~ v(x-z)\ge d\} \end{aligned}$$

for some $z\in {\bar{K}}$ and $d\in \mathbb {Q}$. Here d is an invariant of the disc, its depth, denoted $d_D$. If D has depth $d_D$ and $z\in D$ then $D=D_{z,d_D}$; we call any $z\in D$ a centre of D. We say a disc is integral if it has a centre in $K^{{\text {nr}}}$ and integer depth. For an integral disc D we denote by P(D) its ‘parent’ integral disc $P(D)=D_{z_D,d_{D}-1}$ for any $z_D\in D$. We say integral discs D and $D'$ are adjacent if one is the parent of the other.

3.1.2 The valuation associated to a disc

Each disc $D=D_{z_D,d_D}$ defines a valuation $\nu _D$ on the function field ${\bar{K}}(x)$ extending v (see e.g. [2, Sect. 1.4.4]). Explicitly, for a polynomial $f(x)\in {\bar{K}}[x]$, letting $c_i$ denote the coefficient of $x^i$ in $f(x+z_D)$ we have

$$\begin{aligned} \nu _D(f)=\text{ min}_i \{v(c_i)+d_Di\}. \end{aligned}$$

Writing ${\mathcal {R}}\subseteq \bar{K}$ for the (multi)set of roots of f(x) and $c_f$ for its leading coefficient, factoring f(x) as a product of linear polynomials it follows from the fact that $\nu _D$ is a valuation extending v that

$$\begin{aligned} \nu _D(f)=v(c_f)+\sum _{r\in {\mathcal {R}}} \mathrm {min}\{d_D,v(z_D-r)\}. \end{aligned}$$

(3.1)

3.2 Admissible collections of discs

The following collections of discs will correspond to regular semistable models of ${\mathbb {P}}^1_{K^{\text{ nr }}}$.

Definition 3.2

Call a finite non-empty collection ${\mathcal {D}}$ of integral discs admissible if

(i)
${\mathcal {D}}$ has a maximal element $D_{\text{ max }}$ with respect to inclusion,
(ii)
if $D_1,D_2\in {\mathcal {D}}$ with $D_1\subseteq D_2$ then every integral disc $D_1\subseteq D\subseteq D_2$ is in ${\mathcal {D}}$ also.

To such ${\mathcal {D}}$ we associate the finite connected rooted tree $T_{\mathcal {D}}$ with vertices $\{v_D \mid D\!\in \!{\mathcal {D}}\}$ and root $v_{D_\text{ max }}$, where $v_D$ and $v_{D'}$ are joined by an edge when D and $D'$ are adjacent. We write ${\mathcal {D}}_i$ (resp. ${\mathcal {D}}_{\le i}$) for the subset of ${\mathcal {D}}$ consisting of discs whose associated vertices are a distance i (resp. at most i) from the root.

Remark 3.3

We will see in Proposition 3.10 that $T_{\mathcal {D}}$ is canonically the dual graph of the model of ${\mathbb {P}}^1_{K^{\text{ nr }}}$ associated to ${\mathcal {D}}$.

3.3 The model of $\mathbb {P}^1_{K^{{\text {nr}}}}$ associated to an admissible collection of discs

Notation 3.4

For the rest of this section we fix an admissible collection of discs ${\mathcal {D}}$, along with a choice of centre $z_D\in K^{{\text {nr}}}$ for each $D\in {\mathcal {D}}$.

In what follows denote by ${\mathcal {O}}$ the ring of integers of $K^{{\text {nr}}}$ and, as in the introduction, let $\pi $ denote a fixed choice of uniformiser for K. We now associate to ${\mathcal {D}}$ a model ${\mathcal {Y}}_{\mathcal {D}}/{\mathcal {O}}$ of $\mathbb {P}^1_{K^{{\text {nr}}}}$, first introducing some objects and notation which will be useful for the construction.

Remark 3.5

The choice of centres above is minor—in particular the model ${\mathcal {Y}}_{\mathcal {D}}/{\mathcal {O}}$ which we associate to ${\mathcal {D}}$ in Definition 3.8 is, up to isomorphism over ${\mathcal {O}}$, independent of this.

3.3.1 The schemes $U_D$, $W_D$ and $Y_D$

Definition 3.6

To each disc $D\in {\mathcal {D}}$ we associate schemes

$$\begin{aligned} U_{D}=\text{ Spec }~{\mathcal {O}}[x_D]~~\text{ and }~~W_D={\left\{ \begin{array}{ll}\text{ Spec }~{\mathcal {O}}[t_D]~~&{}~~D=D_{\text{ max },}\\ \text{ Spec }~{\mathcal {O}}[s_D,t_D]/(\pi -s_Dt_D)~~&{}~~\text{ else }.\end{array}\right. } \end{aligned}$$

We denote by $Y_D$ the glueing of $U_D$ and $W_D$ over the subsets $\{x_D\ne 0\}$ and $\{t_D\ne 0\}$ via the isomorphsim $x_D=1/t_D$.

For $D=D_\text{ max }$ we have $Y_D=\mathbb {P}^1_{\mathcal {O}}$ with variable $x_D$. We denote its special fibre $E_{D_\text{ max }}$. For $D\ne D_\text{ max }$, $Y_D$ is the result of blowing up ${\mathbb {A}}^1_{\mathcal {O}}$ with variable $s_D$ at the origin on the special fibre (see e.g. [26, Lemma 8.1.4]). Its special fibre consists of two irreducible components intersecting transversally at the single closed point $s_D=t_D=0$. One component (the exceptional fibre of the blow up) is isomorphic to $\mathbb {P}^1_{{\bar{k}}}$ with variable $x_D$ and we denote it $E_D$. The other is isomorphic to ${\mathbb {A}}^1_{{\bar{k}}}$ with variable $s_D$. It is contained entirely in the complement of $U_D$ and we denote it $F_D$.

Definition 3.7

Let $D\in {\mathcal {D}}$. We define a ‘reduction’ map $D\rightarrow \bar{k}$ (which depends on the choice of centre $z_D$ for D) by setting

$$\begin{aligned} \text{ red}_{D}(z)=\frac{z-z_{D}}{\pi ^{d_{D}}}\quad (\text{ mod }~{\mathfrak {m}}). \end{aligned}$$

Note that this gives a bijection between closed points on the special fibre of $U_D$ and maximal integral subdiscs $D'$ of D, sending $D'$ to the point $x_D=\text{ red}_D(z_{D'})$ for any $z_{D'}\in D$. Since this does not depend on the choice of centre $z_{D'}$ we henceforth write $\text{ red}_D(D')$ in place of $\text{ red}_D(z_{D'})$.

We denote by ${\mathcal {P}}_D$ the finite set of points on the special fibre of $U_D$ corresponding to maximal integral subdiscs of D which are in ${\mathcal {D}}$. Similarly, let ${\mathcal {Q}}_D$ denote the finite set of closed points on the special fibre of $W_D$ of the form

$$\begin{aligned} s_D=\frac{z_{D'}-z_D}{\pi ^{d_D-1}}\quad (\text{ mod }~{\mathfrak {m}}) \end{aligned}$$

for $D'\in {\mathcal {D}}$ a ‘sibling’ of D (i.e. such that $D'\ne D$ is a maximal integral subdisc of P(D)).

In what follows we will at times wish to consider the scheme $W_D{\setminus } {\mathcal {P}}_{\mathcal {D}}$, viewing both ${\mathcal {P}}_D$ and $W_D$ as subsets of $Y_D$ to form the complement.

3.3.2 The model ${\mathcal {Y}}_{\mathcal {D}}$

We now glue the schemes $Y_D$ with certain points removed to form the model ${\mathcal {Y}}_{\mathcal {D}}$. We will do this in such a way that for $D\ne D_{\text{ max }}$ the component $F_D$ of $Y_D$ glues onto the component $E_{P(D)}$ of $Y_{P(D)}$, identifying the set ${\mathcal {Q}}_{D}$ with ${\mathcal {P}}_{P(D)}$ less the point $x_D=\text{ red}_{P(D)}(D)$ in the process.

Definition 3.8

(The model ${\mathcal {Y}}_{\mathcal {D}}$) For $i\ge 0$ we construct inductively schemes ${\mathcal {Y}}_{{\mathcal {D}}_{\le i}}$ which will be covered by

$$\begin{aligned} \{Y_D{\setminus } ({\mathcal {P}}_D\cup {\mathcal {Q}}_D)~~\mid D\in {\mathcal {D}}_{\le i-1}\}~~\text{ and }~~ \{Y_D{\setminus } {\mathcal {Q}}_D~~\mid ~~D\in {\mathcal {D}}_i\}. \end{aligned}$$

In this way we talk about components $E_D$ of ${\mathcal {Y}}_{{\mathcal {D}}_{\le i}}$. We will denote by $\infty $ the point on ${\mathcal {Y}}_{{\mathcal {D}}_{\le i}}$ corresponding to $t_{D_\text{ max }}=0$ on the generic fibre of $Y_{D_\text{ max }}$, and denote by $\overline{\{\infty \}}$ its closure in the model. We then define ${\mathcal {Y}}_{\mathcal {D}}$ to be equal to ${\mathcal {Y}}_{{\mathcal {D}}_{\le n}}$ for n minimal such that ${\mathcal {D}}={\mathcal {D}}_{\le n}$.

First, set ${\mathcal {Y}}_{{\mathcal {D}}_{\le 0}}\!=\!Y_{D_\text{ max }}$. We make this a model of $\mathbb {P}^1_{K^{{\text {nr}}}}$ (thought of with variable x) via the change of variable $x=\pi ^{d_{D_\text{ max }}}x_{D_\text{ max }}+z_{D_\text{ max }}.$

Now given ${\mathcal {Y}}_{{\mathcal {D}}_{\le i}}$ we obtain ${\mathcal {Y}}_{{\mathcal {D}}_{\le i+1}}$ by blowing up ${\mathcal {Y}}_{{\mathcal {D}}_{\le i}}$ at the finite set $\bigcup _{D\in {\mathcal {D}}_i}{\mathcal {P}}_{D}$ of closed points on its special fibre.

Explicitly, since blowing up is a local process, ${\mathcal {Y}}_{{\mathcal {D}}_{\le i+1}}$ is given by glueing each of the schemes $Y_D{\setminus } {\mathcal {Q}}_D$ for $D\in {\mathcal {D}}_{i+1}$ onto ${\mathcal {Y}}_{{\mathcal {D}}_{\le i}}':={\mathcal {Y}}_{{\mathcal {D}}_{\le i}}{\setminus } \bigcup _{D\in {\mathcal {D}}_i}{\mathcal {P}}_{D}$ over the open subsets given by removing $E_D$ from the special fibre of $Y_D{\setminus } {\mathcal {Q}}_D$ ($D\in {\mathcal {D}}_{i+1}$), and removing $\overline{\{\infty \}}\cup \bigcup _{D'\ne P(D)}E_{D'}$ from ${\mathcal {Y}}_{{\mathcal {D}}_{\le i}}'$. The glueing maps are given, for $D\in {\mathcal {D}}_{i+1}$, by

$$\begin{aligned} s_D=x_{P(D)}+\frac{z_{P(D)}-z_D}{\pi ^{d_{P(D)}-1}}. \end{aligned}$$

Remark 3.9

In the function field of ${\mathcal {Y}}_{\mathcal {D}}$ we have, for each $D\in {\mathcal {D}}$,

$$\begin{aligned} x_D=\frac{x-z_D}{\pi ^{d_D}},\quad t_D=1/x_D,~~\text{ and }~~s_Dt_D=\pi . \end{aligned}$$

Moreover, we see from the construction that ${\mathcal {Y}}_{{\mathcal {D}}}$ is covered by the open subsets $U_D{\setminus }{\mathcal {P}}_D$ and $W_D{\setminus } ({\mathcal {Q}}_D\cup {\mathcal {P}}_D)$ as D ranges over all elements of ${\mathcal {D}}$. In particular, every closed point of the special fibre of ${\mathcal {Y}}_D$ is either the point at infinity (i.e. $t_{D_\text{ max }}=0$) on $E_{D_\text{ max }}$, the single point of intersection between $E_D$ and $E_{P(D)}$ (visible as the point $s_D=t_D=0$ on $W_D{\setminus } ({\mathcal {Q}}_D\cup {\mathcal {P}}_D)$) for some $D\ne D_\text{ max }$, or a point of the form $x_D=\text{ red}_D(D')$ on $E_{D}$ for $D'\notin {\mathcal {D}}$ a maximal proper integral subdisc of some $D \in {\mathcal {D}}$, visible on $U_D{\setminus }{\mathcal {P}}_D$.

3.3.3 Properties of the model

Proposition 3.10

Let ${\mathcal {D}}$ be an admissible collection of discs and ${\mathcal {Y}}_{{\mathcal {D}}}/{\mathcal {O}}$ the associated model of $\mathbb {P}^1_{K^{{\text {nr}}}}$. Then ${\mathcal {Y}}_{\mathcal {D}}$ is proper regular and semistable with dual graph $T_{\mathcal {D}}$ (Definition 3.2), the vertex $v_D$ corresponding to the component $E_D$. The valuation on $K^{\text{ nr }}(x)$ corresponding to $E_D$ is (the restriction to $K^{\text{ nr }}(x)$ of) $\nu _D$.

Proof

Since ${\mathcal {Y}}_{\mathcal {D}}$ is obtained by iteratively blowing up $\mathbb {P}^1_{{\mathcal {O}}}$ at a finite set of closed points on the special fibre it follows that ${\mathcal {Y}}_{\mathcal {D}}$ is proper and regular. Since none of these points were intersection points between components of the special fibre, ${\mathcal {Y}}_{\mathcal {D}}$ is semistable (as can also be seen from the explicit charts covering ${\mathcal {Y}}_{\mathcal {D}}$). The dual graph is equal to $T_{\mathcal {D}}$ by construction.

Finally, write $\mu _D$ for the valuation associated to $E_D$. It is the valuation on $K^{{\text {nr}}}(x)$ associated to the prime ideal $(\pi )$ of ${\mathcal {O}}[x_D]$, where $x=\pi ^{d_D}x_D+z_D$. Since both $\nu _D$ and $\mu _D$ are valuations, to show that they are equal it suffices to show they agree on all polynomials. Now for $f(x)\in K^{\text{ nr }}[x]$, it follows from the definition of $\nu _D$ that

$$\begin{aligned} \pi ^{-\nu _D(f)}f(x)=\pi ^{-\nu _D(f)}f(\pi ^{d_D}x_D+z_D) \end{aligned}$$

is in ${\mathcal {O}}[x_D]$ but not in the ideal $(\pi )$. Thus $\mu _D(\pi ^{-\nu _D(f)}f(x))=0$ and the result follows. $\square $

3.4 The divisor of a polynomial on the model ${\mathcal {Y}}_{\mathcal {D}}$

We now describe the divisor of a polynomial $f(x)\in K^{\text{ nr }}[x]$ on the model ${\mathcal {Y}}_{\mathcal {D}}$, first introducing some notation with which to describe the result.

Definition 3.11

Let D be an integral disc and $f(x)\in K^{\text{ nr }}[x]$. Define

$$\begin{aligned} \kappa _D(f)=\nu _D(f)-\nu _{P(D)}(f). \end{aligned}$$

Note that if f(x) is a polynomial with (multi)set of roots ${\mathcal {R}}$ then (3.1) gives

$$\begin{aligned} \kappa _D(f)= |D \cap {\mathcal {R}}|+\sum _{\begin{array}{c} r\in {\mathcal {R}}\\ d_D-1<v(r-z_D)<d_D \end{array}}v\left( \frac{r-z_D}{\pi ^{d_D-1}}\right) \quad \quad \ge 0. \end{aligned}$$

(3.12)

Proposition 3.13

Let ${\mathcal {D}}$ be an admissible collection of discs, ${\mathcal {Y}}_{{\mathcal {D}}}/{\mathcal {O}}$ the associated model of $\mathbb {P}^1_{K^{{\text {nr}}}}$ and $f(x)\in K^{\text{ nr }}[x]$ a polynomial with (multi)set of roots ${\mathcal {R}}\subseteq \bar{K}$. Let $Z\in \text{ Div }({\mathcal {Y}}_{\mathcal {D}})$ denote the divisor

$$\begin{aligned} Z=\text{ div }(f)+\text{ deg }(f)\overline{\{\infty \}}, \end{aligned}$$

and let $Z_\text{ ver }$ (resp. $Z_\text{ hor }$) denote its vertical (resp. horizontal) parts.

(i)
We have
$$\begin{aligned} Z_\text{ ver }=\sum _{D\in {\mathcal {D}}}\nu _D(f)E_D. \end{aligned}$$
(ii)
$Z_\text{ hor }$ does not meet any of the intersection points between components of the special fibre of ${\mathcal {Y}}_{{\mathcal {D}}}$ if and only if
$$\begin{aligned} \kappa _D(f)=|D\cap {\mathcal {R}}| ~~\quad \text{ for } \text{ all } D\in {\mathcal {D}} \text{ with } D\ne D_\text{ max }. \end{aligned}$$
(3.14)

Suppose that (3.14) holds. Then

(iii)
for each $D\in {\mathcal {D}}$, $Z_\text{ hor }$ meets $E_D$ precisely at the points $x_D=\text{ red}_D(D')$ for $D'$ a maximal integral subdisc of D with $D'\notin {\mathcal {D}}$ and $\kappa _{D'}(f)>0$, unless $D=D_\text{ max }$ and $\kappa _D(f)<\text{ deg }(f)$, in which case it additionally meets $E_{D}$ at the point at infinity.
(iv)
$Z_\text{ hor }$ is regular if and only if
$$\begin{aligned} \kappa _{D_{\text{ max }}}(f)\in \{\text{ deg }(f),\text{ deg }(f)-1\} \end{aligned}$$
and for all integral discs D with $P(D)\in {\mathcal {D}}$ and $\kappa _D(f)\ge 2$, we have $D\in {\mathcal {D}}$ also.

We postpone the proof to the end of the section, beginning by defining certain auxilliary polynomials associated to f(x).

Definition 3.15

Let $f(x)\in K^{{\text {nr}}}[x]$ and $D\in {\mathcal {D}}$. We define

$$\begin{aligned} f_D(x_D)=\pi ^{-\nu _D(f)}f(\pi ^{d_D}x_D+z_D)\in {\mathcal {O}}[x_D]. \end{aligned}$$

If $D=D_{\text{ max }}$ we define

$$\begin{aligned} g_D(t_D)=t_D^{\text{ deg }(f)}f_D(1/t_D)\in {\mathcal {O}}[t_D], \end{aligned}$$

whilst if $D\ne D_\text{ max }$ we define

$$\begin{aligned} g_D(s_D,t_D)=\sum _{i\ge 0}\hat{c}_is_D^{v(c_i)+d_Di-\nu _D(f)}t_D^{v(c_i)+(d_D-1)i-\nu _{P(D)}(f)}\in {\mathcal {O}}[s_D,t_D], \end{aligned}$$

where $c_i$ is the coefficient of $x^i$ in $f(x+z_D)$ and $\hat{c}_i=c_i\pi ^{-v(c_i)}$.

Remark 3.16

Inside $K^{\text{ nr }}({\mathcal {Y}}_{\mathcal {D}})$ we have (cf. Remark 3.9)

$$\begin{aligned} f_D(x_D)=\pi ^{-\nu _D(f)}f(x),\quad g_{D_{\text{ max }}}(t_{D_\text{ max }})=t_{D_\text{ max }}^{\text{ deg }(f)}\pi ^{-\nu _D(f)}f(x)~~ \end{aligned}$$

and

$$\begin{aligned} ~~g_D(s_D,t_D)=s_D^{-\nu _D(f)}t_D^{-\nu _{P(D)}(f)}f(x). \end{aligned}$$

In particular, upon proving Proposition 3.13(i) it follows that, for each D, $f_D$ (resp. $g_D$) gives a local equation for $Z_\text{ hor }$ on $U_D{\setminus } {\mathcal {P}}_D$ (resp. $W_D{\setminus } ({\mathcal {P}}_D \cup {\mathcal {Q}}_D)$).

Lemma 3.17

Let $f(x)\in K^{{\text {nr}}}[x]$ have (multi)set of roots ${\mathcal {R}}\subseteq \bar{K}$. Let $D\in {\mathcal {D}}$ with $D\ne D_\text{ max }$ and let $R_D$ denote the intersection point between the two components of the special fibre of $W_D$. Then $R_D$ lies on the closed subscheme $\{g_D(s_D,t_D)=0\}$ of $W_D$ if and only if

$$\begin{aligned} \kappa _D(f)\ne | D \cap {\mathcal {R}}|. \end{aligned}$$

Proof

Note that $R_D$ corresponds to the maximal ideal $(s_D,t_D)$ of ${{\mathcal {O}}[s_D,t_D]/(\pi - s_Dt_D)}$. To simplify notation, for each $0\le i \le \text{ deg }(f)$ write $\lambda _i=v(c_i)+(d_D-1)i$, so that $\nu _D(f)=\text{ min}_i\{\lambda _i+i\}$, $\nu _{P(D)}(f)=\text{ min}_i\{\lambda _i\}$ and

$$\begin{aligned} g(s_D,t_D)=\sum _i \hat{c}_i s_D^{\lambda _i+i-\nu _D(f)}t_D^{\lambda _i-\nu _{P(D)}(f)} \end{aligned}$$

for $\hat{c}_i$ as in Definition 3.15. Now $(s_D,t_D)$ does not lie on $\{g_D(s_D,t_D)=0\}$ if and only if $g_D(s_D,t_D)$ has non-zero constant term, or equivalently if and only if

$$\begin{aligned} \text{ min}_i\{\lambda _i+i\}=\text{ min}_i\{\lambda _i\}+j \end{aligned}$$

where j is the smallest integer such that $\lambda _j=\text{ min}_i\{\lambda _i\}$. That is, if and only if $\kappa _D(f)=j$ for $0\le j \le \text{ deg }(f)$ as above.

Considering the Newton polygon of the polynomial $f(\pi ^{d_D-1}x+z_D)$, the valuation of the ith coefficient of which is $\lambda _i$, we see that j is equal to the sum of the lengths of the projections onto the horizontal axis of all segments in the Newton polygon having strictly negative slope. By standard properties of Newton polygons this is equal to the number of roots of $f(\pi ^{d_D-1}x+z_D)$ having strictly positive valuation or, equivalently, equal to the number of roots r of f(x) satisfying $v(r-z_D)>d_{D}-1.$ From (3.12) we see that $\kappa _D(f)$ is equal to the number of such roots if and only if $\kappa _D(f)=|D \cap {\mathcal {R}}|$. $\square $

Lemma 3.18

Let $f(x)\in K^{{\text {nr}}}[x]$.

(i)
Let $D'$ be a maximal integral subdisc of some $D\in {\mathcal {D}}$. Then the closed subscheme $\{f_D(x_D)=0\}$ of $U_D$ meets the point $x_D=\text{ red}_{D}(D')$ on the special fibre of $U_D$ if and only if $\kappa _{D'}(f)>0,$ and if this is the case, is regular at this point if and only if $\kappa _{D'}(f)=1.$
(ii)
The closed subscheme $\{g_{D_\text{ max }}(t_{D_\text{ max }})=0\}$ of $W_{D_\text{ max }}$ meets the point $t_{D_\text{ max }}=0$ on the special fibre if and only if $\kappa _{D_\text{ max }}(f)<\text{ deg }(f)$, in which case it is regular here if and only if $\kappa _{D_\text{ max }}(f)=\text{ deg }(f)-1$.

Proof

(i) Fix a centre $z_{D'}$ of $D'$ so that the point $x_D=\text{ red}_{D}(D')$ in question corresponds to the maximal ideal ${\mathfrak {n}}=(\pi ,x_D-\alpha _{D'})$ of ${\mathcal {O}}[x_D]$, where $\alpha _{D'}=(z_{D'}-z_D)/\pi ^{d_D}$. Denoting by $c_i$ the coefficient of $x^i$ in $f(x+z_{D'})$ we have

$$\begin{aligned} v_{D'}(f)=\text{ min}_{i}\{v(c_i)+(d_{D}+1)i\} \end{aligned}$$

and

$$\begin{aligned} f_D(x_D)=\sum _i c_i'(x_D-\alpha _{D'})^i \end{aligned}$$

where $c_i'=c_i\pi ^{id_D-\nu _D(f)}$. Now $f_D(x_D)$ is in ${\mathfrak {n}}$ if and only if $v(c_0')>0$ and, when it is, ${\mathcal {O}}[x_D]/(f_D(x_D))$ is regular at ${\mathfrak {n}}$ if and only if $f_D(x_D)\notin {\mathfrak {n}}^2$ (see [26, Lemma 4.2.2]). This happens if and only if either $v(c_0')=1$ or $v(c_1')=0$ (or both).

Now $v(c_0')>0$ (resp. $v(c_0')=1$ or $v(c_1')=0$) if and only if

$$\begin{aligned} \text{ min}_i\{v(c_i')+i\}>0~~~(\text{ resp. } \text{ min}_i\{v(c_i')+i\}=1) \end{aligned}$$

and since

$$\begin{aligned} \text{ min}_i\{v(c_i')+i\}=\text{ min}_i\{v(c_i)+(d_D+1)i-\nu _D(f)\}=\nu _{D'}(f)-\nu _D(f)=\kappa _{D'}(f) \end{aligned}$$

we are done.

(ii) This may be proved analogously to (i). $\square $

Proof of Proposition 3.13

(i)
Follows from Proposition 3.10 which shows that $\nu _D$ is the valuation on $K^{\text{ nr }}(x)$ corresponding to $E_D$.
(ii)
Follows from Lemma 3.17 noting that by Remark 3.16, for any $D\ne D_{\text{ max }}$, $g_D(s_D,t_D)$ is a local equation for $Z_\text{ hor }$ on $W_D{\setminus } ({\mathcal {P}}_D\cup {\mathcal {Q}}_D)$, where the intersection point between $E_D$ and $E_{P(D)}$ corresponds, in the notation of the lemma, to the point $R_D$.
(iii)
and (iv). The argument is similar to that of (ii) and follows by combining Remark 3.16, Lemma 3.18 (ii) and the description of the closed points on the special fibre of ${\mathcal {Y}}_{{\mathcal {D}}}$ afforded by Remark 3.9. We note that the condition $D\in {\mathcal {D}}$ in (iv) ensures that the point $x_{P(D)}=\text{ red}_{P(D)}(D)$ is removed from $U_{P(D)}$ when forming the model.

$\square $

4 An explicit model of $\mathbb {P}^1$ assuming the semistability criterion

Suppose now that $C/K:y^2=f(x)$ is a hyperelliptic curve satisfying the semistability criterion (Definition 1.8). In this section we associate to C an admissible collection of discs (in the sense of Definition 3.2) which we call ‘valid discs’, and which are closely related to the cluster picture of C. Using the results of the previous section we then study the divisor of f(x) on the associated model of $\mathbb {P}^1_{K^{{\text {nr}}}}$, the main result being Proposition 4.16. This shows in particular that the conditions of [36, Lemma 2.1] are satisfied, so that the normalisation of this model of $\mathbb {P}^1_{K^{{\text {nr}}}}$ in the function field $K^{\text{ nr }}(C)$ is a regular proper model of $C/K^{\text{ nr }}$. This, and its consequences, are treated in Sect. 5. We remark that this approach to constructing regular models of hyperelliptic curves in not new and appears in [24, 36] and [10], although each of these assumes that all roots of f(x) lie in K (or at least $K^{\text{ nr }}$). In particular, these constructions do not cover all curves satisfying the semistability criterion as is needed in our situation.

4.1 Notation

In this section we freely apply the notation and terminology associated to clusters as set out in Table 6 in Appendix E. As in the introduction we denote by ${\mathcal {R}}\subseteq {\bar{K}}$ the set of roots of f(x). We also frequently use the following additional definition.

Definition 4.1

If ${{\mathfrak {s}}}\subseteq {\mathcal {R}}$ is a proper cluster we call the unique smallest disc cutting it out the defining disc of ${{\mathfrak {s}}}$, and denote it $D({{\mathfrak {s}}})$.

Note that for any proper cluster ${{\mathfrak {s}}}$, the disc $D({{\mathfrak {s}}})$ has depth $d_{\mathfrak {s}}$ and, in the notation of Defintion 1.7 and Sect. 3.1.2, it follows from (3.1) that $\nu _{D({{\mathfrak {s}}})}(f)=\nu _{{\mathfrak {s}}}$.

We also note here that by part (1) of the semistability criterion, all roots of f(x) are defined over the unique quadratic extension of $K^{\text{ nr }}$. In particular, every proper cluster ${{\mathfrak {s}}}\subseteq {\mathcal {R}}$ has depth $d_{{\mathfrak {s}}}\in \frac{1}{2}{\mathbb {Z}}$, and if $r\in {\mathcal {R}}$ with $r\notin K^{\text{ nr }}$ then denoting by $r'$ its inertia-conjugate root we have $v(r-r')\in 1/2+{\mathbb {Z}}$.

4.2 The collection of valid discs

To first approximation the set of valid discs consists of all discs of the form $D({{\mathfrak {s}}})$ for a proper cluster ${{\mathfrak {s}}}$, along with all integral discs between them. However, the precise definition is slightly more complicated, mainly owing to the failure of some proper clusters to have integer depth. The following lemma characterises this phenomenon.

Lemma 4.2

Let ${{\mathfrak {s}}}\subseteq {\mathcal {R}}$ be a proper cluster. Then ${{\mathfrak {s}}}$ has a centre in $K^{{\text {nr}}}$. Moreover, $d_{\mathfrak {s}}\in {\mathbb {Z}}$ unless we have either

(i)
${{\mathfrak {s}}}=\{r,r'\}$, or
(ii)
${{\mathfrak {s}}}={\mathcal {R}}$ has a unique proper child ${{\mathfrak {s}}}'$, which has size 2g, and ${{\mathfrak {s}}}{\setminus } {{\mathfrak {s}}}'=\{r,r'\}$,

for two inertia-conjugate roots $r,r'\notin K^{\text{ nr }}$. In these cases we have $d_{{\mathfrak {s}}}\in 1/2+{\mathbb {Z}}$.

Proof

Since the hyperelliptic curve $C:y^2=f(x)$ satisfies the semistability criterion, ${\mathcal {R}}$ is tamely ramified and every proper cluster inertia invariant. In particular, every proper cluster has centre in $K^{{\text {nr}}}$ by Lemma B.1. Now for a proper cluster ${{\mathfrak {s}}}$, if ${{\mathfrak {s}}}=\{r\} \wedge \{ r'\}$ for two inertia-conjugate roots $r,r'\notin K^{{\text {nr}}}$, then $d_{{\mathfrak {s}}}=v(r-r')\in 1/2+{\mathbb {Z}}$, whence ${{\mathfrak {s}}}$ is not principal (see Definition 1.5). An easy case by case analysis shows that this forces ${{\mathfrak {s}}}$ to fall into one of cases (i) or (ii) above. Otherwise, for any two (not necessarily proper) children ${{\mathfrak {s}}}_1$ and ${{\mathfrak {s}}}_2$ of ${{\mathfrak {s}}}$ we may find centres $z_1$ and $z_2$ in $K^{{\text {nr}}}$ respectively. Then $v(z_1-z_2)$ is an integer and it follows that $d_{{\mathfrak {s}}}$, being the minimum of such valuations as ${{\mathfrak {s}}}_1,{{\mathfrak {s}}}_2$ vary over all children of ${{\mathfrak {s}}}$, is an integer also. $\square $

Remark 4.3

The lemma shows that for any proper cluster ${{\mathfrak {s}}}$, the disc $D({{\mathfrak {s}}})$ is integral save when ${{\mathfrak {s}}}$ falls into one of the cases (i) or (ii) above.

Definition 4.4

(Valid discs) Call ${\mathcal {R}}$ exceptional (of type A resp. B) if:

(A)
${\mathcal {R}}$ has a unique proper child ${{\mathfrak {s}}}$, which has size 2g, and $d_{\mathcal {R}}\in 1/2+{\mathbb {Z}}$,
(B)
${\mathcal {R}}$ has size $2g+2$ and a child of size $2g+1$.

We define the integral disc $D_\text{ max }$ as follows. If ${\mathcal {R}}$ is not exceptional set $D_\text{ max }=D({\mathcal {R}})$. If ${\mathcal {R}}$ is exceptional of type A define $D_{\text{ max }}$ to be the maximal integral disc cutting out the child of size 2g, and if ${\mathcal {R}}$ is exceptional of type B let $D_{\text{ max }}$ be the defining disc of the child of size $2g+1$.

We now say that an integral disc D is valid if it is contained in $D_{\text{ max }}$ and $|D\cap {\mathcal {R}}|\ge 2$. We denote the collection of all valid discs by ${\mathcal {D}}$, noting that ${\mathcal {D}}$ is admissible in the sense of Definition 3.2.

Remark 4.5

When ${\mathcal {R}}$ is exceptional of type A, if $z_{{\mathfrak {s}}}\in K^{\text{ nr }}$ is a centre for the child ${{\mathfrak {s}}}$ of size 2g then $D_\text{ max }=D_{z_{{\mathfrak {s}}},d_{\mathcal {R}} - 1/2}$. Moreover, it follows from Lemma 4.2 that $d_{\mathcal {R}}\in 1/2+{\mathbb {Z}}$ alone forces ${\mathcal {R}}$ to be exceptional of type A.

Notation 4.6

For the rest of this section fix a choice of centre $z_D\in K^{\text{ nr }}$ for each valid disc D (this is necessary to use the notation of the previous section when talking about the model of ${\mathbb {P}}^1_{K^{\text{ nr }}}$ associated to the collection of valid discs, since such a choice was made in Sect. 3.3).

4.3 The quantity $\kappa _D(f)$

By Proposition 3.13, to study the divisor of f(x) on the model associated to the collection of valid discs we wish to understand the quantities $\nu _D(f)$ and $\kappa _D(f)$ for integral discs D. We begin by considering the second of these functions.

Lemma 4.7

Let D be an integral disc. Then $\kappa _D(f)=|D\cap {\mathcal {R}}|$ unless either of the following hold:

(i)
${\mathcal {R}}$ is exceptional of type A and $D=D_{\text{ max }}$, in which case $\kappa _D(f)=\text{ deg }(f)-1$,
(ii)
$D=D_{z_{{\mathfrak {t}}},\lceil d_{{\mathfrak {t}}}\rceil }$ for ${{\mathfrak {t}}}$ a twin with $d_{{\mathfrak {t}}}\in \frac{1}{2}+{\mathbb {Z}}$ and $z_{{\mathfrak {t}}}$ a centre^{Footnote 5} for ${{\mathfrak {t}}}$ in $K^{{\text {nr}}}$, in which case $D\cap {\mathcal {R}}=\emptyset $ and $\kappa _D(f)=1$.

In particular, if $D\ne D_\text{ max }$ is a valid disc then $\kappa _D(f)=|D\cap {\mathcal {R}}|$.

Will will need the following easy lemma, whose proof we omit.

Lemma 4.8

Let D be a disc with centre in $K^{{\text {nr}}}$ and non-integral depth $d_D$. Then D has a unique maximal proper integral subdisc given by $D_{z_D,\lceil d_D \rceil }$ for any $z_D\in D\cap K^{{\text {nr}}}$.

Proof of Lemma 4.7

Fix an integral disc $D=D_{z_D,d_D}$ with $z_D\in K^{{\text {nr}}}$ and suppose $\kappa _D(f)\ne |D\cap {\mathcal {R}}|$. By (3.12) we have

$$\begin{aligned} \kappa _D(f)- \mid D \cap {\mathcal {R}}\mid =\sum _{\begin{array}{c} r\in {\mathcal {R}}\\ d_D-1<v(r-z_D)<d_D \end{array}}v\left( \frac{r-z_D}{\pi ^{d_D-1}}\right) . \end{aligned}$$

Fix $r\in {\mathcal {R}}$ contributing non trivially to the right hand side. Then necessarily $r\notin K^{{\text {nr}}}$ and, denoting by $r'$ its inertia-conjugate root,

$$\begin{aligned}d_D-1/2=v\left( r-z_D\right) =v(r'-z_D).\end{aligned}$$

Let ${{\mathfrak {s}}}=\{r\}\wedge \{r'\}$ and pick a centre $z_{{\mathfrak {s}}}\in K^{{\text {nr}}}$. Note that $d_{{\mathfrak {s}}}=v(r-r')\notin {\mathbb {Z}}$ and as also $v(z_D-r)\notin {\mathbb {Z}}$ we must have

$$\begin{aligned} v(z_D-r)=v\left( z_D-z_{{\mathfrak {s}}}+z_{{\mathfrak {s}}}-r\right) =v(z_{{\mathfrak {s}}}-r)=d_{{\mathfrak {s}}}. \end{aligned}$$

Thus $z_D$ is a centre for ${{\mathfrak {s}}}$ and we deduce that D is the maximal integral subdisc of $D({{\mathfrak {s}}})$ afforded by Lemma 4.8. By Lemmas 4.2 and 4.8, D is now one of the discs claimed and the rest follows. $\square $

4.4 The parity of $\nu _D(f)$

Lemma 4.9

Let ${{\mathfrak {s}}}$ be a proper cluster with $d_{{\mathfrak {s}}}\in {\mathbb {Z}}$. Then $\nu _{\mathfrak {s}}$ is even unless ${{\mathfrak {s}}}={\mathcal {R}}$ has size $2g+2$ and ${\mathcal {R}}={{\mathfrak {s}}}_{1}\sqcup {{\mathfrak {s}}}_{2}$ is a union of two odd children with one of the $\delta _{{{\mathfrak {s}}}_i}$ odd (equivalently both if they have size $\ge 3$, cf. Lemma C.6).

Proof

By the semistability criterion every principal cluster ${{\mathfrak {s}}}$ has $\nu _{{\mathfrak {s}}}$ even. Now combine Lemma 4.2 with Lemma C.5. $\square $

We now use this to characterise the valid discs D for which $\nu _D(f)$ is odd (resp. even).

Lemma 4.10

Let D be a valid disc and ${{\mathfrak {s}}}=D\cap {\mathcal {R}}$. Then $\nu _D(f)$ is odd if and only if one of the following hold:

(i)
Both ${{\mathfrak {s}}}$ and $d_D\!\!-\!\!d_{{\mathfrak {s}}}\!$ are odd (in particular D is not the defining disc of a cluster), or
(ii)
$D=D_\text{ max }$ and ${\mathcal {R}}={{\mathfrak {s}}}_1\sqcup {{\mathfrak {s}}}_2$ is a union of two odd proper children with $\delta _{{{\mathfrak {s}}}_1}$ odd (equivalently $\delta _{{{\mathfrak {s}}}_2}$ odd, cf. Lemma C.6).

Proof

First note that by Lemma 4.7 if $D_1\subseteq D_2$ are valid discs and $D'\cap {\mathcal {R}}=D_1\cap {\mathcal {R}}$ for each intermediate integral disc $D_1\subseteq D' \subsetneq D_2$ then

$$\begin{aligned} \nu _{D_1}(f)=\nu _{D_2}(f)+|D_1\cap {\mathcal {R}}|(d_{D_2}-d_{D_1}). \end{aligned}$$

(4.11)

To prove the lemma, note first that if $D({{\mathfrak {s}}})$ is integral and $\nu _{{\mathfrak {s}}}$ even then we conclude by applying (4.11) with $D_1=D({{\mathfrak {s}}})$ and $D_2=D$. If $D=D_\text{ max }$ and ${\mathcal {R}}={{\mathfrak {s}}}_1\sqcup {{\mathfrak {s}}}_2$ is a union of two odd proper children then we conclude by applying (4.11) with $D_1=D({{\mathfrak {s}}}_1)$ and $D_2=D_\text{ max }$. Now by Lemmas 4.2 and 4.9 the only remaining case is ${{\mathfrak {s}}}$ a twin with $\delta _{{\mathfrak {s}}}\in 1/2+{\mathbb {Z}}$. Applying (4.11) with $D_1=D$ and $D_2=D(P({{\mathfrak {s}}}))$ shows that $\nu _D(f)$ is even as desired, since $\nu _{P({{\mathfrak {s}}})}$ is even by Lemma 4.9. $\square $

4.5 The model of ${\mathbb {P}}^1$ associated to the collection of valid discs

Definition 4.12

Define ${\mathcal {Y}}_{\text{ disc }}/{\mathcal {O}}$ to be the regular, proper, semistable model of ${\mathbb {P}}^1_{K^{\text{ nr }}}$ associated to the collection of valid discs via Definition 3.8.

We retain the notation of Sect. 3 to talk about ${\mathcal {Y}}_{\text{ disc }}$. Thus the components of its special fibre are indexed by discs $D\in {\mathcal {D}}$, the component corresponding to a disc D being isomorphic to ${\mathbb {P}}^1_{\bar{k}}$ with variable $x_D$, and denoted $E_D$. The point at infinity on the generic fibre of ${\mathcal {Y}}_{\text{ disc }}$ is denoted $\infty $, and $\overline{\{\infty \}}$ denotes the closure of this point in ${\mathcal {Y}}_{\text{ disc }}$. Its intersection with the special fibre is the point at infinity on the component $E_{D_\text{ max }}$.

Definition 4.13

We denote by $B\in \text{ Div }({\mathcal {Y}}_\text{ disc})$ the divisor

$$\begin{aligned} B=\text{ div }(f)_\text{ hor }+(2g+2)\overline{\{\infty \}}+\sum _{D\in {\mathcal {D}}}\omega _D(f)E_D \end{aligned}$$

where $\omega _D(f)\in \{0,1\}$ is such that $\omega _D(f)\equiv \nu _D(f)~~\text{(mod } \text{2) }$, and $\text{ div }(f)_\text{ hor }$ denotes the horizontal part of the divisor of f(x).

Remark 4.14

In the next section we will consider the normalisation of ${\mathcal {Y}}_{\text{ disc }}$ in the function field of $C:y^2=f(x)$, which we denote ${\mathcal {C}}_{\text{ disc }}$. We show in Theorem 5.2 that the divisor B above is the branch locus of the associated morphism ${\mathcal {C}}_{\text{ disc }}\rightarrow {\mathcal {Y}}_{\text{ disc }}$.

To describe the divisor of f(x) on ${\mathcal {Y}}_{\text{ disc }}$ it will be convenient to introduce the following notation.

Notation 4.15

For a proper cluster ${{\mathfrak {s}}}$ with $D({{\mathfrak {s}}})$ strictly contained in a valid disc D, by $\text{ red}_{D}({{\mathfrak {s}}})$ we mean $\text{ red}_D(z_{{\mathfrak {s}}})$ (Definition 3.7) for any centre $z_{{\mathfrak {s}}}$ of ${{\mathfrak {s}}}$. Note that this is independent of the choice of centre for ${{\mathfrak {s}}}$. For ${{\mathfrak {s}}}=\{r\}$ a singleton, for any valid disc D containing r we define $\text{ red}_{D}({{\mathfrak {s}}}):=\text{ red}_{D}(r)$.

Proposition 4.16

Let $C/K:y^2=f(x)$ be a hyperelliptic curve satisfying the semistability criterion and let ${\mathcal {Y}}_\text{ disc }/{\mathcal {O}}$ and $B\in \text{ Div }({\mathcal {Y}}_{\text{ disc }})$ be as above. Then B is effective, regular, and $\text{ div }(f)\equiv B~~\text{(mod } \text{2) }$ inside $\text{ Div }({\mathcal {Y}}_{\text{ disc }})$. Moreover, the horizontal part $B_\text{ hor }$ of B meets the special fibre of ${\mathcal {Y}}_{\text{ disc }}$ at precisely the following points:

the point at infinity on $E_{D_\text{ max }}$ if $\text{ deg }(f)=2g+1$ or ${\mathcal {R}}$ is exceptional,
the points $x_D=\text{ red}_D({{\mathfrak {s}}}')$ on $E_D$ if $D=D({{\mathfrak {s}}})$ for a proper cluster ${{\mathfrak {s}}}$ and ${{\mathfrak {s}}}'<{{\mathfrak {s}}}$ is either a singleton, or a twin with $d_{{{\mathfrak {s}}}'}=d_{{\mathfrak {s}}}+1/2$ (note that these points are distinct as we vary ${{\mathfrak {s}}}'$),
the point $x_D=\text{ red}_D({{\mathfrak {t}}})$ on $E_D$ if $D\cap {\mathcal {R}}={{\mathfrak {t}}}$ for ${{\mathfrak {t}}}$ a twin with $d_{{\mathfrak {t}}}=d_D+1/2$.

Proof

It follows from Proposition 3.13 (i) along with the definition of $\omega _D(f)$ that $B\equiv \text{ div }(f)~~\text{(mod } \text{2) }$, and it’s clear that B is effective.

Combining Proposition 3.13 (iii), (iv) and (v) with Lemma 4.7, we see that $B_\text{ hor }$ is regular and meets the special fibre ${\mathcal {Y}}_{\text{ disc },\bar{k}}$ of ${\mathcal {Y}}_{\text{ disc }}$ precisely at the points claimed (in particular $B_\text{ hor }$ does not meet any intersection points between the components of ${\mathcal {Y}}_{\text{ disc },\bar{k}}$).

Next it follows from Lemma 4.10 that if $D_1$ and $D_2$ are adjacent valid discs then $\nu _{D_1}(f)$ and $\nu _{D_2}(f)$ are not both odd. As any two components $E_{D_1}$ and $E_{D_2}$ are individually regular, and intersect if and only if $D_1$ and $D_2$ are adjacent, we deduce that the vertical part $B_\text{ ver }$ of B is regular.

To conclude we claim that $B_{\text{ hor }}$ and $B_{\text{ ver }}$ are disjoint, i.e. that if $B_\text{ hor }$ meets $E_D$ then $\nu _D(f)$ is even. This follows from Lemma 4.10. $\square $

4.6 The dual graph of ${\mathcal {Y}}_{\text{ disc }}$

By Proposition 3.13 the dual graph of ${\mathcal {Y}}_{\text{ disc }}$ is the graph $T_{\mathcal {D}}$ associated to the collection of valid discs as in Definition 3.2. Here we describe this graph in terms of clusters. We begin with an explicit description, and then recast this in terms of the purely combinatorial notions introduced in [18]. For convenience, we summarise the relevant parts of that paper in Appendix D.

Notation 4.17

Let $\widehat{T_{\mathcal {D}}}$ denote the graph obtained from $T_{\mathcal {D}}$ by removing from the vertex set (keeping the underlying metric space the same) all $v_D$ corresponding to discs $D\ne D_\text{ max }$ for which there is a smaller valid disc cutting out the cluster $D\cap {\mathcal {R}}$.

Lemma 4.18

The graph $\widehat{T_{\mathcal {D}}}$ is, via $v_D\leftrightarrow v_{D\cap {\mathcal {R}}}$, the metric graph with:

one vertex $v_{{\mathfrak {s}}}$ for each cluster ${{\mathfrak {s}}}$ which is not
- a twin ${{\mathfrak {t}}}$ with $\delta _{{\mathfrak {t}}}=1/2$,
- ${\mathcal {R}}$ if either ${\mathcal {R}}$ has size $2g+2$ and a child of size $2g+1$, or a child ${{\mathfrak {s}}}$ with $|{{\mathfrak {s}}}|=2g$ and $\delta _{{\mathfrak {s}}}=1/2$,
an edge of length $\lfloor \delta _{{\mathfrak {s}}}\rfloor $ between $v_{{\mathfrak {s}}}$ and $v_{P({{\mathfrak {s}}})}$ (should $P({{\mathfrak {s}}})$ yield a vertex).

Proof

This follows from Lemma 4.2 and the definition of the collection of valid discs. $\square $

In Example D.2 it is shown how, to f(x), one may associate a purely combinatorial object called a metric cluster picture (Definition D.1). To this one may then formally associate a certain metric graph called a BY tree (this procedure is explained in Definition D.6) which is very closely related to the dual graph of (the special fibre of) ${\mathcal {Y}}_{\text{ disc }}$, as the following result shows. This will be important later since it enables us to apply combinatorial results of [18] to hyperelliptic curves.

Lemma 4.19

Let $\Sigma $ denote the metric cluster picture associated to f(x), and $T_\Sigma $ denote the associated BY tree. Define $T_\Sigma '$ to be the graph obtained from $T_\Sigma $ by (in order)

halving the length of all yellow edges,
replacing each edge of length $l\in 1/2+{\mathbb {Z}}$ joining a leaf v to a vertex $v'$, with an edge of length $\lfloor l \rfloor $ (identifying v and $v'$ if $\lfloor l \rfloor =0$),
adding a vertex $v_\mathfrak {{\mathcal {R}}}$ on the edge joining $v_{{\mathfrak {s}}_1}$ and $v_{{\mathfrak {s}}_2}$, at a distance $\delta _{{{\mathfrak {s}}}_1}$ from $v_{{\mathfrak {s}}_1}$, if ${\mathcal {R}}$ has size $2g+2$ and is a disjoint union of two proper clusters ${\mathfrak {s}}_1$ and ${\mathfrak {s}}_2$.

Then the map sending a vertex $v_D\in \widehat{T_{\mathcal {D}}}$ to the vertex $v_{D\cap {\mathcal {R}}}\in T_\Sigma '$ is an isomorphism of metric graphs.

Proof

Combine Lemma 4.18 with the description of $T_\Sigma $ given in Definition D.6. $\square $

Remark 4.20

When all the roots of f(x) lie in $K^{{\text {nr}}},$ so that all relative depths are integers, it follows from Lemma 4.19 that $\widehat{T_{\mathcal {D}}}$ is simply the result of halving the length of all yellow edges in $T_\Sigma $ (up to possibly adding in a vertex corresponding to the cluster ${\mathcal {R}}$).

5 A regular model of C assuming the semistability criterion

As in Sect. 4 let $C/K:y^2=f(x)$ be a hyperelliptic curve satisfying the semistability criterion (Definition 1.8). We begin by using the results of the previous section to give an explicit regular proper model of $C/K^{\text{ nr }}$. We then use this to determine the minimal proper regular model of C and show that it is semistable. Having done this, we proceed to describe the special fibre of the minimal regular model and relate this to the cluster picture of f(x).

We continue to denote the ring of integers of $K^{\text{ nr }}$ by ${\mathcal {O}}$. As in Notation 4.6 we fix centres $z_D\in K^{\text{ nr }}$ for each valid disc D (see Definition 4.4). Let ${\mathcal {Y}}_\text{ disc }/{\mathcal {O}}$ be the model of $\mathbb {P}^1_{K^{{\text {nr}}}}$ associated to the collection of valid discs as in Definition 4.12, and let $B\in \text{ Div }({\mathcal {Y}}_{\text{ disc }})$ be as in Definition 4.13, so that by Proposition 4.16B is effective, regular, and congruent to $\text{ div }(f)$ modulo 2.

Definition 5.1

Define the scheme ${\mathcal {C}}_{\text{ disc }}/{\mathcal {O}}$ as the normalisation of ${\mathcal {Y}}_{\text{ disc }}/{\mathcal {O}}$ in the function field $K^{\text{ nr }}(C)$ of $C/K^{\text{ nr }}$. Denote by $\phi :{\mathcal {C}}_{\text{ disc }}\rightarrow {\mathcal {Y}}_{\text{ disc }}$ the associated normalisation morphism. We write ${\mathcal {C}}_{\text{ disc },\bar{k}}$ for the special fibre of ${\mathcal {C}}_{\text{ disc }}$.

Theorem 5.2

Let $C/K:y^2=f(x)$ be a hyperelliptic curve satisfying the semistability criterion. Then ${\mathcal {C}}_{\text{ disc }}/{\mathcal {O}}$ is a proper regular model of C and $\phi :{\mathcal {C}}_{\text{ disc }}\rightarrow {\mathcal {Y}}_{\text{ disc }}$ is finite flat of degree 2 with branch locus B.

Proof

That the normalisation morphism is finite flat of degree 2 is standard (the degree being equal to the degree of $K^{{\text {nr}}}(C)/K^{{\text {nr}}}(x)$). In particular, ${\mathcal {C}}_{\text{ disc }}/{\mathcal {O}}$ is proper since both $\phi $ and ${\mathcal {Y}}_{\text{ disc }}/{\mathcal {O}}$ are. That ${\mathcal {C}}_\text{ disc }$ is regular is a consequence of the regularity of both ${\mathcal {Y}}_{\text{ disc }}$ and B (the latter requiring the semistability criterion, see Proposition 4.16), and follows from [36, Lemma 2.1] (taking $Y={\mathcal {Y}}_{\text{ disc }}$ and $f=f(x)$ in the statement). Indeed, from the definition of B, one sees that, writing $\text{ div }(f)=\sum _{i\in I}m_i\Gamma _i$ as in the notation of loc. cit., the divisor B is equal to $\sum _{i\in I}\bar{m}_i\Gamma _i$ for $\bar{m}_i\in \{0,1\}$ congruent to $m_i$ modulo 2. It’s then clear that regularity of B forces the hypotheses (a) and (b) of loc. cit. to be satisfied, guaranteeing the regularity of the normalisation of ${\mathcal {Y}}_{\text{ disc }}$ in $K^\text{ nr }(C)$. Finally, the claim about the branch locus follows from the description of the normalisation of the local rings given in the proof of loc. cit., noting that for a point P of ${\mathcal {Y}}_{\text{ disc }}$, P lies on B if and only if, in the expression for $\text{ div }(f)$ as $\sum _{i\in I}\bar{m}_i\Gamma _i$ above, $m_i$ is odd for a $\Gamma _i$ which contains P. $\square $

Remark 5.3

Since we have defined ${\mathcal {C}}_{\text{ disc }}$ as the normalisation in $K^{\text{ nr }}(C)$ of a model of ${\mathbb {P}}^1_{K^{\text{ nr }}}$ it follows that the hyperelliptic involution on C (which on function fields is the generator of the Galois group of $K^{\text{ nr }}(C)/K^{\text{ nr }}(x)$) extends to an involution $\iota $ on ${\mathcal {C}}_\text{ disc }/{\mathcal {O}}$ and identifies ${\mathcal {Y}}_{\text{ disc }}$ with the quotient ${\mathcal {C}}_{\text{ disc }}/\iota $. In particular, $\iota $ swaps the two points in the fibre over any point of ${\mathcal {Y}}_{\text{ disc }}{\setminus } B$.

5.1 Explicit equations for the model ${\mathcal {C}}_{\text{ disc }}$

We now give explicit charts covering the model ${\mathcal {C}}_{\text{ disc }}$. Recall from Remark 3.9 that ${\mathcal {Y}}_{\text{ disc }}$ is covered by the open subschemes $U_D{\setminus }{\mathcal {P}}_D$ and $W_D{\setminus } ({\mathcal {Q}}_D\cup {\mathcal {P}}_D)$ (Definitions 3.6 and 3.7) as D ranges over all valid discs.

Definition 5.4

Define, for each valid disc D, the schemes

$$\begin{aligned} {\mathcal {U}}_D=\text{ Spec }~ \frac{{\mathcal {O}}[x_D,y_D]}{\left( y_D^2-\pi ^{\omega _D(f)}f_D(x_D)\right) } \end{aligned}$$

and

$$\begin{aligned} {\mathcal {W}}_D={\left\{ \begin{array}{ll}\text{ Spec }~\frac{{\mathcal {O}}[t_D,w_D]}{\left( w_D^2-\pi ^{\omega _D(f)}t_D^{\lambda }g_D(t_D)\right) }~~&{}~~\text{ if } D=D_\text{ max, }\\ \\ \text{ Spec }~\frac{{\mathcal {O}}[s_D,t_D,w_D]}{\left( \pi -s_Dt_D~,~w_D^2-s_D^{\omega _D(f)}t_D^{\omega _{P(D)}(f)}g_D(s_D,t_D)\right) }~~&{}~~\text{ else, }\end{array}\right. } \end{aligned}$$

where $\lambda \in \{0,1\}$ is such that $\lambda \equiv \text{ deg }(f)~~\text{(mod } \text{2) }$ and the polynomials $f_D$ and $g_D$ are as in Definition 3.15.

Denote by $\phi _{D,1}:{\mathcal {U}}_D\rightarrow U_D$ and $\phi _{D,2}:{\mathcal {W}}_D\rightarrow W_D$ the morphisms induced by the obvious inclusions of rings.

Proposition 5.5

The model ${\mathcal {C}}_\text{ disc }$ is covered by the charts ${\mathcal {U}}_D{\setminus } \phi _{D,1}^{-1}({\mathcal {P}}_D)$ and ${\mathcal {W}}_D{\setminus } \phi _{D,2}^{-1}({\mathcal {P}}_D\cup {\mathcal {Q}}_D)$ as D ranges over all valid discs. The restriction of the normalisation morphism to a map ${\mathcal {U}}_D{\setminus } \phi _{D,1}^{-1}({\mathcal {P}}_D)\rightarrow U_D{\setminus }{\mathcal {P}}_D$ (resp. ${\mathcal {W}}_D{\setminus } \phi _{D,2}^{-1}({\mathcal {P}}_D\cup {\mathcal {Q}}_D)\rightarrow W_D{\setminus } ({\mathcal {P}}_D\cup {\mathcal {Q}}_D) $) is given by $\phi _{D,1}$ (resp. $\phi _{D,2}$). Inside $K^{{\text {nr}}}(C)$ we have the relations

$$\begin{aligned} x_D=\frac{x-z_D}{\pi ^{d_D}},\quad t_D=1/x_D,~~~~~ s_Dt_D=\pi , \quad y_D=\pi ^{(\omega _D(f)-\nu _D(f))/2}y \end{aligned}$$

and

$$\begin{aligned} w_D={\left\{ \begin{array}{ll}x_D^{-\lfloor \text{ deg }(f)/2\rfloor }y_D &{}~~\text{ if } D=D_{\text{ max }},\\ s_D^{(\omega _D(f)-\nu _D(f))/2}t_D^{(\omega _{P(D)}(f)-\nu _{P(D)}(f))/2}y~~&{}~~\text{ else. }\end{array}\right. } \end{aligned}$$

Proof

Since ${\mathcal {Y}}_{\text{ disc }}$ is covered by the open subschemes $U_D{\setminus }{\mathcal {P}}_D$ and $W_D{\setminus } ({\mathcal {Q}}_D\cup {\mathcal {P}}_D)$, and normalisation is local on the base, it suffices to show that for each valid disc D, $\phi _{D,1}:{\mathcal {U}}_D{\setminus } \phi _{D,1}^{-1}({\mathcal {P}}_D)\rightarrow U_D{\setminus } {\mathcal {P}}_D$ realises the normalisation of $U_D{\setminus } {\mathcal {P}}_D$ in $K^{\text{ nr }}(C)$, as well as the analogous result for $\phi _{D,2}$. We prove this for $\phi _{D,1}$, the argument in the other case being identical. Viewing $U_D{\setminus } {\mathcal {P}}_D$ as a subscheme of ${\mathcal {Y}}_{\text{ disc }}$ it follows from Remark 3.16 that the divisor of $\pi ^{\omega _D(f)}f_D(x_D)$ on $U_D{\setminus } {\mathcal {P}}_D$ is equal to $B\cap (U_D{\setminus } {\mathcal {P}}_D)$ and as such is regular. Note also that the ring ${\mathcal {O}}[x_D]$ is regular. Now for any $h\in {\mathcal {O}}[x_D]$ such that the open subscheme $\{h\ne 0\}$ of $U_D$ is contained in $U_D{\setminus } {\mathcal {P}}_D$, it follows that both the ring $A={\mathcal {O}}[x_D]_h$ and the closed subscheme $V(\pi ^{\omega _D(f)}f_D(x_D))$ of $\text{ Spec } A$ are regular. From this it follows easily that the ring $A'=A[y]/(y^2-\pi ^{\omega _D(f)}f_D(x_D))$ is regular (this is the algebraic result underpinning the proof of Theorem 5.2). Moreover, appealing to Remark 3.16 once again we see that $\pi ^{\omega _D(f)}f_D(x_D)$ and f(x) differ by a square in $K^{{\text {nr}}}(x)=\text{ Frac }A$ (indeed, by an even power of $\pi $). Thus $\text{ Frac }(A')=K^{{\text {nr}}}(x)[y]/(y^2-f(x))=K^{{\text {nr}}}(C)$. Since $A'$ is regular, finite over A and $\text{ Frac }(A')=K^{{\text {nr}}}(C)$, it follows that $A'$ is the integral closure of A in K(C). Thus the (map on schemes associated to the) inclusion of A into $A'$ realises the normalisation of $\text{ Spec }A$ in $K^{{\text {nr}}}(C)$. It remains only to note that $U_D{\setminus } {\mathcal {P}}_D$ is covered by the schemes ${\mathcal {O}}[x_D]_h$ as we vary h, and that normalisation is local on the base.

The relationship between the various variables follows from combining Remark 3.9 and Remark 3.16. $\square $

Remark 5.6

The extension of the hyperelliptic involution to ${\mathcal {C}}_{\text{ disc }}$ acts on the charts of Proposition 5.5 as $y_D\mapsto -y_D$ (resp. $w_D\mapsto -w_D$).

Remark 5.7

The proof of Proposition 5.5 shows that locally over a sufficiently small $U=\text{ Spec }A\subseteq {\mathcal {Y}}_{\text{ disc }}$, $\phi ^{-1}(U)\subseteq {\mathcal {C}}_{\text{ disc }}$ is given by $\text{ Spec }A'$ where $A'=A[u]/(u^2-t)$ for t a local equation for B on U. From this we deduce the following. Suppose that D is a valid disc with $\omega _D(f)=1$ (i.e. $\nu _D(f)$ odd). Then $\phi ^{-1}(E_D)$ consists of a single component Z of multiplicity 2 in ${\mathcal {C}}_{\text{ disc },{\bar{k}}}$ and the restriction of $\phi $ to a map $Z_\text{ red }\rightarrow E_D$ is an isomorphism.^{Footnote 6} Indeed, our assumptions mean that $E_D$ is contained entirely in B, so that locally over some U as above, $E_D$ corresponds to a prime ideal ${\mathfrak {p}}$ of A containing t. The fibre of $\phi $ over the generic point of $E_D$ is then $\text{ Spec }~\mathbf{k} ({\mathfrak {p}})[u]/(u^2)$ where $\mathbf{k} ({\mathfrak {p}})$ denotes the residue field at ${\mathfrak {p}}$. Thus there is a unique component over $E_D$ with multiplicity 2 in ${\mathcal {C}}_{\text{ disc },{\bar{k}}}$. Moreover, the unique prime ${\mathfrak {q}}$ of $A'$ lying over ${\mathfrak {p}}$ is generated by ${\mathfrak {p}}$ and u whence the map $A/{\mathfrak {p}}\rightarrow A'/{\mathfrak {q}}$ is an isomorphism. As this is just the map $Z_\text{ red }\cap \phi ^{-1}(U)\rightarrow E_D\cap U$ and such U cover ${\mathcal {Y}}_{\text{ disc }}$, we are done.

The following lemma describes the reduction $\text{ mod } {\mathfrak {m}}$ of the polynomials $f_D$ and $g_{D_\text{ max }}$ appearing in the above charts, and will facilitate in the study of the special fibre of ${\mathcal {C}}_{\text{ disc }}$.

Definition 5.8

For a valid disc D define $c_D\in {\bar{k}}^{\times }$ as

$$\begin{aligned} c_D=\frac{c_f}{\pi ^{v(c_f)}}\prod _{r\in {\mathcal {R}}{\setminus } D}\left( \frac{z_D-r}{\pi ^{v(z_D-r)}}\right) \quad (\text{ mod } {\mathfrak {m}}). \end{aligned}$$

For a proper cluster ${{\mathfrak {s}}}$ for which $D({{\mathfrak {s}}})$ is valid we set $c_{{\mathfrak {s}}}=c_{D({{\mathfrak {s}}})}$.

Lemma 5.9

Let D be a valid disc and write ${{\mathfrak {s}}}=D\cap {\mathcal {R}}$.

(i)
We have
$$\begin{aligned} f_D(x_D)\quad (\text{ mod } {\mathfrak {m}}) ={\left\{ \begin{array}{ll} c_{{\mathfrak {s}}}\prod _{{{\mathfrak {s}}}'<{{\mathfrak {s}}}}\left( x_D-\text{ red}_D({{\mathfrak {s}}}')\right) ^{|{{\mathfrak {s}}}'|}~~&{}~~ D=D({{\mathfrak {s}}}) \\ c_D\left( x_D-\text{ red}_D({{\mathfrak {s}}})\right) ^{|{{\mathfrak {s}}}|}~~&{}~~\text{ else. } \end{array}\right. } \end{aligned}$$
(ii)
For $D=D_\text{ max }$ we have
$$\begin{aligned} t_D^\lambda g_D(t_D)\quad (\text{ mod } {\mathfrak {m}}) =t_{D}^{2g+2-|D\cap {\mathcal {R}}|}c_D\prod _{r\in D\cap {\mathcal {R}}}\left( 1-\text{ red}_{D}(r)t_{D}\right) \end{aligned}$$
where, as in Definition 5.4, $\lambda \in \{0,1\}$ is such that $\lambda \equiv \text{ deg }(f)~~\text{(mod } \text{2) }$.

Proof

By the definition of $f_D(x_D)$ and (3.1) we have

$$\begin{aligned} f_D(x_D)\quad (\text{ mod } {\mathfrak {m}}) = \frac{c_f}{\pi ^{v(c_f)}}\prod _{r\in {\mathcal {R}}}\left( \frac{\pi ^{d_D}x_D+z_D-r}{\pi ^{\text{ min }\{d_D,v(z_D-r)\}}}\right) ~~(\text{ mod } {\mathfrak {m}}) . \end{aligned}$$

Noting that $r\in D$ if and only if $\text{ min }\{d_D,v(z_D-r)\}=d_D$ we have

$$\begin{aligned} f_D(x_D)\quad (\text{ mod } {\mathfrak {m}}) =c_D\prod _{r\in D}\left( x_D-\text{ red}_D(r)\right) \end{aligned}$$

from which part (i) follows. For (ii), note that when $D=D_\text{ max }$ we have $t_D^\lambda g_D(t_D)=t_D^{2g+2}f_D(1/t_D)$. Now apply (i). $\square $

Remark 5.10

In (i), two factors $(x_D-\text{ red}_D({{\mathfrak {s}}}'))$ and $(x_D-\text{ red}_D({{\mathfrak {s}}}''))$ are distinct for distinct ${{\mathfrak {s}}}',{{\mathfrak {s}}}''<{{\mathfrak {s}}}$.

5.2 Types of valid disc

We now describe the special fibre of ${\mathcal {C}}_{\text{ disc }}$. To do this we will frequently break into cases, which we set out here. We remark that if all roots of f(x) lie in $K^{\text{ nr }}$ then cases II, III and IV do not occur as then all proper clusters have integer depth.

Definition 5.11

Let D be a valid disc. We define the type of D as follows.

(I.1)
$D=D({{\mathfrak {s}}})$ for a non-übereven cluster ${{\mathfrak {s}}}$ with $\nu _{{\mathfrak {s}}}$ even, and D is not of type II.1,
(I.2)
$D=D({{\mathfrak {s}}})$ for an übereven cluster ${{\mathfrak {s}}}$ and D is not of type II.2,
(II.1)
$D=D({{\mathfrak {s}}})$ where ${{\mathfrak {s}}}<{\mathcal {R}}$ is such that $\delta _{{\mathfrak {s}}}=1/2$, $|{{\mathfrak {s}}}|=2g$, and ${{\mathfrak {s}}}$ is not übereven,
(II.2)
$D=D({{\mathfrak {s}}})$ where ${{\mathfrak {s}}}<{\mathcal {R}}$ is such that $\delta _{{\mathfrak {s}}}=1/2$, $|{{\mathfrak {s}}}|=2g$, and ${{\mathfrak {s}}}$ is übereven,
(III)
$D=D_\text{ max }$ and there is a cluster ${{\mathfrak {s}}}<{\mathcal {R}}$ with $|{{\mathfrak {s}}}|=2g$ and $\delta _{{{\mathfrak {s}}}}=n+1/2$ for $n\ge 1$ an integer,
(IV)
$D\cap {\mathcal {R}}={{\mathfrak {t}}}$ for ${{\mathfrak {t}}}$ a twin with $d_{{\mathfrak {t}}}=d_D+1/2$,
(V)
$\nu _D(f)$ is odd (i.e. $\omega _D(f)=1$, equivalently either $D=D_\text{ max }$ and ${\mathcal {R}}={{\mathfrak {s}}}_1\sqcup {{\mathfrak {s}}}_2$ is a union of two odd proper children with $d_{{{\mathfrak {s}}}_1}-d_{\mathcal {R}}$ odd, or both ${\mathcal {R}}\cap D$ and $d_D-d_{{\mathcal {R}}\cap D}$ are odd, c.f. Lemma 4.10),
(VI.1)
D does not fall into cases I-V and $|D\cap {\mathcal {R}}|$ is odd,
(VI.2)
D does not fall into cases I-V and $|D\cap {\mathcal {R}}|$ is even.

5.3 The special fibre of ${\mathcal {C}}_\text{ disc }$

In the following proposition we describe, for each component $E_D$ of the special fibre of ${\mathcal {Y}}_{\text{ disc }}$, the component(s) of the special fibre of ${\mathcal {C}}_{\text{ disc }}$ which lie over this. Later in Theorem 5.18 we explain how these components fit together, drawing on the description of the dual graph of ${\mathcal {Y}}_{\text{ disc },\bar{k}}$ afforded by Lemma 4.19.

In the statement below, for a proper cluster ${{\mathfrak {s}}}$ we write $g({{\mathfrak {s}}})$ for the genus of ${{\mathfrak {s}}}$, thus $g({{\mathfrak {s}}})$ is 0 if ${{\mathfrak {s}}}$ is übereven and is otherwise detemined by $|{\tilde{{{\mathfrak {s}}}}}| = 2g({{\mathfrak {s}}}) +1$ or $2g({{\mathfrak {s}}})+2$ where ${\tilde{{{\mathfrak {s}}}}}$ denotes the set of odd children of ${{\mathfrak {s}}}$.

Proposition 5.12

Let D be a valid disc, $E_D$ the associated component of the special fibre of ${\mathcal {Y}}_{\text{ disc }}$ and $\phi :{\mathcal {C}}_\text{ disc }\rightarrow {\mathcal {Y}}_{\text{ disc }}$ the normalisation morphism. Then $\phi ^{-1}(E_D)$ consists^{Footnote 7}, according to the type of D, of:

(I.1,II.1)
one component of multiplicity 1, with geometric genus $g({{\mathfrak {s}}})$, and one node for each twin ${{\mathfrak {t}}}<{{\mathfrak {s}}}$ with $d_{{\mathfrak {t}}}=d_{{\mathfrak {s}}}+1/2$, plus one additional node if D has type II.1. The normalisation of this component is the hyperelliptic curve
$$\begin{aligned} \widetilde{\Gamma }_{\mathfrak {s}}:y^2=c_{\mathfrak {s}}\prod _{\text{ odd } {{\mathfrak {s}}}'<{{\mathfrak {s}}}}(x-\text{ red}_{\mathfrak {s}}({{\mathfrak {s}}}')). \end{aligned}$$
(I.2,II.2)
two components of multiplicity 1, each isomorphic to $\mathbb {P}^1_{{\bar{k}}}$ and intersecting transversally at one point for each twin ${{\mathfrak {t}}}$ with $d_{{\mathfrak {t}}}=d_{{\mathfrak {s}}}+1/2$, and at one additional point if D has type $\mathrm{II.2.}$
(III,IV)
two components of multiplicity 1, isomorphic to $\mathbb {P}^1_{{\bar{k}}}$ and intersecting transversally at 1 point.
(V)
a single component isomorphic to $\mathbb {P}^1_{{\bar{k}}}$, with multiplicity 2 and self intersection $-1$.
(VI.1)
one component of multiplicity 1, isomorphic to $\mathbb {P}^1_{{\bar{k}}}$,
(VI.2)
two disjoint components of multiplicity 1, each isomorphic to $\mathbb {P}^1_{{\bar{k}}}$.

Moreover, all intersections between components of ${\mathcal {C}}_{\text{ disc },{\bar{k}}}$ are transversal.

Remark 5.13

When all roots of f(x) lie in $K^{{\text {nr}}}$ (e.g. if we are willing to construct the model only over a suitable extension of the base field) all components are individually smooth, no two components lying over the same component of ${\mathcal {Y}}_{\text{ disc }}$ intersect, and, as above, types II, III and IV never arise.

The proof of Proposition 5.12 is given after Lemmas 5.14 and 5.15.

Lemma 5.14

Let D be a valid disc, $E_D$ the associated component of the special fibre of ${\mathcal {Y}}_{\text{ disc },{\bar{k}}}$ and $\phi :{\mathcal {C}}_\text{ disc }\rightarrow {\mathcal {Y}}_{\text{ disc }}$ the normalisation morphism. Then

(i)
If $D=D({{\mathfrak {s}}})$ for a proper cluster ${{\mathfrak {s}}}$ with $\nu _{{\mathfrak {s}}}$ even then $\phi ^{-1}(E_D)$ consists of two multiplicity 1 components if ${{\mathfrak {s}}}$ is übereven (types I.2 and II.2), and one otherwise (types I.1 and II.1). The (geometric) genus of the component(s) is $g({{\mathfrak {s}}})$.
(ii)
If $\omega _D(f)=0$ and D is not of the form $D({{\mathfrak {s}}})$ for a proper cluster ${{\mathfrak {s}}}$ then $\phi ^{-1}(E_D)$ consists of two multiplicity 1 components of genus 0 if $|D\cap {\mathcal {R}}|$ is even (types III, IV and VI.2), and one otherwise (type VI.2).
(iii)
If D has type V then $\phi ^{-1}(E_D)$ consists of a single multiplicity 2 component, which is an exceptional curve^{Footnote 8}, and meets all other components transversally.

Proof

(i) and (ii). In what follows, let us temporarily denote the reduction of $f_D(x_D)$ $\text{ mod } {\mathfrak {m}}$ as $\overline{f_D(x_D)}$. Let $\eta _D$ be the generic point of $E_D$. From Proposition 5.5 we see that the fibre over $\eta _D$ is

$$\begin{aligned} \text{ Spec }~{\bar{k}}(x_D)[y_D]/\left( y_D^2-\overline{f_D(x_D)}\right) \end{aligned}$$

and as $\overline{f_D(x_D)}$ is non-zero the associated component(s) has multiplicity 1. That the number of components is as claimed follows since, by Lemma 5.9, if D is not the defining disc of ${{\mathfrak {s}}}=D\cap {\mathcal {R}}$ then $\overline{f_D(x_D)}$ is a square in ${\bar{k}}(x_D)$ if and only if $|{{\mathfrak {s}}}|$ is even, whilst if $D=D({{\mathfrak {s}}})$ then $\overline{f_D(x_D)}$ is a square if and only ${{\mathfrak {s}}}$ is übereven. To determine the geometric genus of the components note that when $\overline{f_D(x_D)}$ is a square the fibre over the generic point of $\eta _D$ is isomorphic to two copies of ${\bar{k}}(x)$ from which it follows that both components have genus 0. Now note that, for $\overline{f_D(x_D)}$ non-square, the genus of the function field ${\bar{k}}(x_D)[y_D]/\left( y_D^2-\overline{f_D(x_D)}\right) $ is precisely the number of odd degree factors of $\overline{f_D(x_D)}$ and we conclude by Lemma 5.9.

(iii). By Remark 5.7 we see that if $\omega _D(f)=1$ then there is a unique component over $E_D$, $\Gamma _D$ say, which has multiplicity 2 and is isomorphic to $\mathbb {P}^1_{{\bar{k}}}$. It remains to show that $\Gamma _D$ has self intersection $-1$. Combining parts (i) and (ii) with Lemma 4.10 we see that $E_D$ meets precisely two components of the special fibre of ${\mathcal {Y}}_{\text{ disc }}$, $E_{D_1}$ and $E_{D_2}$ say, and there is one multiplicity 1 component of the special fibre of ${\mathcal {C}}_{\text{ disc }}$ lying over each of these, $\Gamma _{D_1}$ and $\Gamma _{D_2}$ say. In particular, as divisors we have $\phi ^*(E_{D_1})=\Gamma _{D_1}$, $\phi ^*(E_{D_2})=\Gamma _{D_2}$ and $\phi _{*}(\Gamma _D)=E_D$. By the projection formula [26, Theorem 9.2.12] we have $\Gamma _D\cdot \Gamma _{D_1}=E_D\cdot E_{D_1}=1$ and $\Gamma _D\cdot \Gamma _{D_2}=1$ similarly. Thus $\Gamma _D$ intersects $\Gamma _{D_1}$ and $\Gamma _{D_2}$ transversally. Finally as the intersection of $\Gamma _D$ with the whole special fibre is 0 it follows that $\Gamma _D^2=-1$. $\square $

Lemma 5.15

Let D be a valid disc not of type V. Then $\phi ^{-1}(E_D)$ has precisely one singular point for each twin ${{\mathfrak {t}}}$ with $d_{{\mathfrak {t}}}=d_D+1/2$, unless $D=D_\text{ max }$, when it has an additional singular point (lying over the point at infinity on $E_{D_\text{ max }}$) if ${\mathcal {R}}$ is exceptional of type A (i.e. if $D_\text{ max }$ has type II or III). Each singular point of $\phi ^{-1}(E_D)$ is an ordinary double point.

Proof

Let P be a closed point of $\phi ^{-1}(E_D)$, let $Q=\phi (P)$ and suppose that $Q \in B\cap E_D$ (if not $\phi $ is étale at P whence, since $E_D$ is smooth, P is a smooth point of $\phi ^{-1}(E_D)$). If $Q\in B_{\text{ ver }}$ then as $\omega _D(f)=0$, P is a point of intersection between $\phi ^{-1}(E_D)$ and a component lying over some $E_{D'}$ with $\omega _{D'}(f)=1$. By Lemma 5.14 such intersections are transverse whence (see e.g. [26, Lemma 10.3.1]) they are smooth points of the individual components. Next, suppose $Q\in B_{\text{ hor }}\cap E_D$ and is not the point at infinity on $E_{D_\text{ max }}$. Then by Proposition 4.16Q corresponds to a point $x_D=\text{ red}_{D}({{\mathfrak {s}}})$ where ${{\mathfrak {s}}}$ is a child of $D\cap {\mathcal {R}}$ and is either a singleton, or a twin with $d_{{\mathfrak {s}}}=d_D+1/2$. Now by Proposition 5.5 and again writing $\overline{f_D(x_D)}$ for the reduction of $f_D(x_D)$ $\text{ mod } {\mathfrak {m}}$, locally around P, $\phi ^{-1}(E_D)$ is given by

$$\begin{aligned}\text{ Spec }~{\bar{k}}[x_D,y_D]/(y_D^2-\overline{f_D(x_D)})\end{aligned}$$

with P the point $(\text{ red}_{D}({{\mathfrak {s}}}),0)$. By Lemma 5.9$\text{ red}_D(z_{D'})$ is a root of multiplicity 1 if ${{\mathfrak {s}}}$ is a singleton, and 2 if ${{\mathfrak {s}}}$ is a twin. One checks (see e.g. [26, Example 10.3.4]) that P is a smooth point in the first instance, and an ordinary double point in the second. The case $D=D_\text{ max }$ and Q the point at infinity on $E_{D_\text{ max }}$ is similar. Our assumption that $Q\in B_{\text{ hor }}$ forces $\text{ deg }(f)=2g+1$ or ${\mathcal {R}}$ exceptional. Arguing as above and using Lemma 5.9 (ii) one sees that P is smooth if $\text{ deg }(f)=2g+1$ or ${\mathcal {R}}$ is exceptional of type B, and is an ordinary double point if ${\mathcal {R}}$ is exceptional of type A. $\square $

Proof of Proposition 5.12

(I.1, II.1): By Lemma 5.14$\phi ^{-1}(E_D)$ consists of a unique component of geometric genus $g({{\mathfrak {s}}})$ whose function field is

$$\begin{aligned}{\bar{k}}(x_D)[y_D]/(y_D^2-\overline{f_D(x_D)}).\end{aligned}$$

The equation for $\widetilde{\Gamma }_{\mathfrak {s}}$ follows from Lemma 5.9. That the nodes of $\Gamma _{{\mathfrak {s}}}$ are as claimed is Lemma 5.15. (I.2, II.2): by Lemma 5.14$\phi ^{-1}(E_D)$ consists of two components each of geometric genus 0 and by Lemma 5.15 the singular points of $\phi ^{-1}(E_D)$ are all ordinary double points and are in bijection with the twins ${{\mathfrak {t}}}<{{\mathfrak {s}}}$ with $d_{{\mathfrak {t}}}=d_{{\mathfrak {s}}}+1/2$, plus one additional ordinary double point for type II.2. For each such, P say, we have $\phi (P)\in B$ whence P is the unique point lying over $\phi (P)$. In particular, it lies on both components of $\phi ^{-1}(E_D)$. It now follows from [26, Lemma 10.3.11] that P is a smooth point of each individual component, and that these components intersect transversally at P. In particular, each component is smooth hence isomorphic to $\mathbb {P}^1_{{\bar{k}}}$. (III, IV): Follows from Lemma 5.14 and Lemma 5.15 similarly to cases I.2 and II.2. (V): Lemma 5.14 (iii). (VI): Combine Lemmas 5.14 and 5.15 (the latter shows $\phi ^{-1}(E_D)$ is smooth).

Finally we show that all intersections are transverse. Let $P\in {\mathcal {C}}_{\text{ disc },{\bar{k}}}$ be a point lying on distinct components $Z_1$ and $Z_2$. If $\phi (Z_1)=\phi (Z_2)$ then both $Z_1$ and $Z_2$ lie in $\phi ^{-1}(E_D)$ for some valid disc D of type I.2, II.2, III or IV, and the intersection is transverse as above. Similarly, if one of $Z_1$ or $Z_2$ has multiplicity 2 we are done by Lemma 5.14 (iii). Otherwise $Q=\phi (P)$ is disjoint from B whence $\phi $ is étale at P and, moreover, Q is a point of transverse intersection between the distinct components $\phi (Z_1)$ and $\phi (Z_2)$ of ${\mathcal {Y}}_{\text{ disc }}$. Thus the intersection is transverse in this case also. $\square $

5.4 The minimal regular model of $C/K^{{\text {nr}}}$

Having constructed a regular model of $C/K^{\text{ nr }}$ and described the components of its special fibre, it is a simple matter to describe the minimal regular model ${\mathcal {C}}_{\text{ min }}/{\mathcal {O}}$ of $C/K^{\text{ nr }}$, which we do now. In particular, we use the explicit description we obtain to show that ${\mathcal {C}}_{\text{ min }}$ is semistable. This proves the ‘semistability criterion implies semistability’ part of Theorem 1.9 (1) (see also Theorem 7.1).

Theorem 5.16

Let C/K be a hyperelliptic curve satisfying the semistability criterion. Then the model of C obtained from ${\mathcal {C}}_{\text{ disc }}$ by contracting each of the components corresponding to valid discs of type V is semistable, and is the minimal regular model of $C/K^{{\text {nr}}}$.

Proof

Note first that by Proposition 5.12 (iv) the components we contract are disjoint, and are all exceptional curves. Thus we may contract them and the resulting model, which we temporarily denote ${\mathcal {C}}/{\mathcal {O}}$, is again proper and regular. Moreover, again by Proposition 5.12, the components contracted are precisely those with multiplicity greater than one. Fix one such, $\Gamma _D$ say. Then as in the proof of Lemma 5.14(iii) it intersects precisely two components of ${\mathcal {C}}_{\text{ disc }, {\bar{k}}}$, say $Z_1$ and $Z_2$ (which do not themselves meet), both transversally. By [26, Lemma 3.35], after contracting $\Gamma _D$ to a point the components $Z_1$ and $Z_2$ intersect transversally there. Thus the special fibre of ${\mathcal {C}}$ is reduced with normal crossings (away from the contracted components the same is true for the special fibre of ${\mathcal {C}}_{\text{ disc }}$ by Proposition 5.12, and the contraction map is an isomorphism here). Thus ${\mathcal {C}}$ is semistable.

Since ${\mathcal {C}}$ is a proper regular model of C, to show that ${\mathcal {C}}$ is the minimal such we must show there are no exceptional curves in its special fibre. Note that such components appear in the dual graph of ${\mathcal {C}}_{\bar{k}}$ as genus 0 vertices of degree 1. Theorem 5.18 below shows that the dual graph of ${\mathcal {C}}$ is (after removing vertices of degree 2 and genus 0 from the vertex set) a hyperelliptic graph in the sense of [18, Definition 3.2] and in particular contains no genus 0 vertices of degree 1 as desired (the statement of Theorem 5.18 refers to the dual graph of ${\mathcal {C}}_{\text{ min },\bar{k}}$ however the proof in fact uses the description of ${\mathcal {C}}_{\text{ min },\bar{k}}$ as the model ${\mathcal {C}}$ considered here, and does not assume minimality in the proof). Alternatively one may proceed via Proposition 5.12 and the description of the dual graph of ${\mathcal {Y}}_{\text{ disc },\bar{k}}$ afforded by Lemma 4.19. $\square $

5.5 The dual graph of the special fibre of the minimal regular model

Having shown that ${\mathcal {C}}_{\text{ min }}$ is semistable, we may talk about the dual graph of its special fibre, which we now describe.

Notation 5.17

Let C/K be a hyperelliptic curve satisfying the semistability criterion. We denote by $\Upsilon _C$ the dual graph of the special fibre of its minimal regular model.

Recall from Sect. 2.1.1 that $\Upsilon _C$ has vertex set the set of irreducible components of ${\mathcal {C}}_{\text{ min },\bar{k}}$, edge set the set of ordinary double points of ${\mathcal {C}}_{\text{ min },\bar{k}}$, and, for an ordinary double point P, the edge-endpoints at P are the points lying above P under the normalisation morphism

$$\begin{aligned} n:\widetilde{{\mathcal {C}}_{\text{ min },\bar{k}}}\rightarrow {\mathcal {C}}_{\text{ min },\bar{k}} \end{aligned}$$

(we refer to these points as the tangents at P). The graph $\Upsilon _C$ carries a natural genus marking, with a vertex being given the genus of the component to which it corresponds. Finally, by uniqueness of the minimal regular model, the hyperelliptic involution on C extends (necessarily uniquely) to an involution $\iota $ on ${\mathcal {C}}_{\text{ min }}$ which then acts on $\Upsilon _C$ via its action on components, ordinary double points and tangents.

As we did in Sect. 4.6 for ${\mathcal {Y}}_{\text{ disc }}$, we will describe the graph $\Upsilon _C$ using the combinatorial framework developed in [18] and summarised in Appendix D. In particular, as explained in Example D.2, to C (or rather to f(x)), one may associate a metric cluster picture $\Sigma $, which is a purely combinatorial object. Then, via Definitions D.6 and D.9, one can associate to $\Sigma $ a BY tree $T_\Sigma $, and a metric hyperelliptic graph $G_\Sigma $, where the latter is a metric graph coming equipped with a genus marking and involution, the quotient by which is canonically the result of halving the length of all edges in $T_\Sigma $. This graph is (by design) very closely related to $\Upsilon _C$, as the following result shows.

Theorem 5.18

Let C/K be a hyperelliptic curve satisfying the semistability criterion. Denote by $\widehat{\Upsilon _C}$ the graph obtained from $\Upsilon _C$ by removing from the vertex set all vertices of genus 0 and degree 2, and let $\Sigma $ denote the metric cluster picture associated to C. Then there is a genus preserving isomorphism of metric graphs $\widehat{\Upsilon _C}\cong G_\Sigma $ identifying the hyperelliptic involutions and inducing a canonical identification of the quotient $\widehat{\Upsilon _C}/\iota $ with the graph obtained from $T_\Sigma $ by halving the length of all edges.

In particular $\widehat{\Upsilon _C}$, along with its genus marking and hyperelliptic involution, is a hyperelliptic graph in the sense of [18, Definition 3.2].

Proof

The basic idea is that $G_\Sigma $ is by definition a certain ramified double cover of $T_\Sigma $ (up to slightly adjusting the metric), whilst our explicit construction of ${\mathcal {C}}_{\text{ min }}$ shows that $\Upsilon _C$ is essentially a ramified double cover of the dual graph of ${\mathcal {Y}}_{\text{ disc }}$. The latter is related to $T_\Sigma $ via Lemma 4.19. We now flesh out the details. In what follows, we will need to understand the action of the hyperelliptic involution on ${\mathcal {C}}_{\text{ min }}$ and do this by noting that where ${\mathcal {C}}_{\text{ disc }}$ and ${\mathcal {C}}_{\text{ min }}$ are isomorphic the two extensions of the hyperelliptic involution must agree, since they do so on the generic fibre.

We first describe the dual graph of ${\mathcal {C}}_{\text{ disc },\bar{k}}$, which we temporarily denote $\Omega _C$.^{Footnote 9} By Remark 5.3, whenever the fibre over a point of ${\mathcal {Y}}_{\text{ disc }}$ contains two points the hyperelliptic involution swaps these. In particular, applying this to the generic point of a component $E_D$ of ${\mathcal {Y}}_{\text{ disc },\bar{k}}$, if $\phi ^{-1}(E_D)$ consists of two components then $\iota $ swaps these. Similarly, if there are two points lying over an intersection point of components $E_D$ and $E_{D'}$ then $\iota $ swaps these also.

Now consider the (topological) quotient graph $\Omega _C/\iota $. Its vertices either arise as $\iota $-orbits of vertices of $\Omega _C$ - we get one vertex in $\Omega _C/\iota $ for each valid disc this way, corresponding to $\phi ^{-1}(E_D)$ - or as the midpoint of an edge whose endpoints are swapped (i.e. is ‘$\iota $-anti-invariant’). Similarly, the edges of $\Omega _C/\iota $ are $\iota $-orbits of edges of $\Omega _C$, but in the case of an $\iota $-anti-invariant edge e, the resulting edge of $\Omega _C/\iota $ has length half that of e.

As $\iota $ swaps components if and only if they lie over the same component of ${\mathcal {Y}}_{\text{ disc }}$, an edge of $\Omega _C$ can only be $\iota $-anti-invariant if it corresponds to an ordinary double point on $\phi ^{-1}(E_D)$ for some valid disc D (as opposed to an ordinary double point arising as the intersection between components whose images in ${\mathcal {Y}}_\text{ disc }$ are distinct). Moreover, each such edge is in fact $\iota $-anti-invariant. Indeed, for edges corresponding to intersection points between two components of some $\phi ^{-1}(E_D)$ this is clear, whilst for those corresponding to a node lying on a single component, say a node P, $\iota $ must swap the tangents at P else by [26, Proposition 3.48 (b)] $\phi (P)$ would not be a smooth point of ${\mathcal {Y}}_{\text{ disc }}$.

From the above discussion we see that $\Omega _C/\iota $ is the graph obtained from the dual graph of ${\mathcal {Y}}_{\text{ disc }}$ by adding a vertex for each ordinary double point of ${\mathcal {C}}_{\text{ disc },\bar{k}}$ lying over a single component $E_D$, joined to the vertex corresponding to $E_D$ by an edge of length 1/2. It now follows from Lemma 5.15 and Lemma 4.19 that, defining $\widehat{\Omega _C}$ as for $\widehat{\Upsilon _C}$, the quotient $\widehat{\Omega _C}/\iota $ is canonically the result of halving all yellow edges in $T_\Sigma $. Finally (c.f. Lemma 4.10), one obtains $\widehat{\Upsilon _C}/\iota $ from $\widehat{\Omega _C}/\iota $ by halving the length of those edges corresponding to blue edges of $T_\Sigma $ to account for contracted components.

To conclude, we now note that by Proposition 5.12 the map $\widehat{\Upsilon _C}\rightarrow \widehat{\Upsilon _C}/\iota $ ramifies precisely over the part of $\widehat{\Upsilon _C}/\iota $ which corresponds to the blue part of $T_\Sigma $, so that the canonical isomorphism $\widehat{\Upsilon _C}/\iota \cong G_\Sigma /\iota $ above lifts to a (in general noncanonical) isomorphism $\widehat{\Upsilon _C}\cong G_\Sigma $ identifying the hyperelliptic involutions. $\square $

5.6 Explicit equations for the components of the special fibre of the minimal regular model

By Theorem 5.18 the components of the special fibre of ${\mathcal {C}}_{\text{ min }}$ are indexed by valid discs D not of type V (though one disc can yield two components not one). Here we record explicit equations for these components.

Definition 5.19

For a valid disc D not of type $\mathrm{V} $ we define $\Gamma _D$ to be the image in ${\mathcal {C}}_{\text{ min }}$ (under the contraction morphism) of the closed subscheme $\phi ^{-1}(E_D)$ of ${\mathcal {C}}_{\text{ disc }}$. Note that $\Gamma _D$ consists of either one or two components of ${\mathcal {C}}_{\text{ min },{\bar{k}}}$, and that these account for all components.

Proposition 5.20

Let D be a valid disc not of type V. Then the associated component(s) $\Gamma _D$ of ${\mathcal {C}}_{\text{ min },{\bar{k}}}$ is given, according to the type of D, as follows^{Footnote 10}:

(I)
$$\begin{aligned} Y^2 = c_{{\mathfrak {s}}}\prod _{{\text {odd }}{{{\mathfrak {o}}}< {{\mathfrak {s}}}}}(X-{\text {red}}_{{\mathfrak {s}}}({\mathfrak {o}})) \prod _{\begin{array}{c} \text {twin }{{\mathfrak {t}}}<{{\mathfrak {s}}}\\ \delta _{{\mathfrak {t}}}=1/2 \end{array}}(X-{\text {red}}_{{\mathfrak {s}}}({\mathfrak {t}}))^2. \end{aligned}$$
(II)
The glueing of the affine curves
$$\begin{aligned}Y^2=c_{{\mathfrak {s}}}\prod _{{\text {odd }}{{{\mathfrak {o}}}< {{\mathfrak {s}}}}}(X-{\text {red}}_{{\mathfrak {s}}}({\mathfrak {o}})) \prod _{\begin{array}{c} \text {twin }{{\mathfrak {t}}}<{{\mathfrak {s}}}\\ \delta _{{\mathfrak {t}}}=1/2 \end{array}}(X-{\text {red}}_{{\mathfrak {s}}}({\mathfrak {t}}))^2\quad \quad (\dagger ) \end{aligned}$$
and
$$\begin{aligned} Z^2=c_{{\mathfrak {s}}}T^{2}\prod _{{\text {odd }}{{{\mathfrak {o}}}< {{\mathfrak {s}}}}}(1-{\text {red}}_{{\mathfrak {s}}}({\mathfrak {o}})T) \prod _{\begin{array}{c} \text {twin }{{\mathfrak {t}}}<{{\mathfrak {s}}}\\ \delta _{{\mathfrak {t}}}=1/2 \end{array}}(1-{\text {red}}_{{\mathfrak {s}}}({\mathfrak {t}})T)^2 \end{aligned}$$
over the subsets $Y\ne 0$ and $T\ne 0$ via $Z=YT^{n+1}$, where 2n is the degree of the right hand side of $(\dagger )$.
(III)
The glueing of the affine curves
$$\begin{aligned} Y^2=c_D\quad \subseteq \quad {\mathbb {A}}^2_{X,Y} \end{aligned}$$
and
$$\begin{aligned} Z^2=c_DT^2 \end{aligned}$$
over the subsets $X\ne 0$ and $T\ne 0$ via $X=1/T$, $Z=YT$.
(IV)
$$\begin{aligned} Y^2 = c_D(X-{\text {red}}_D({\mathfrak {t}}))^2. \end{aligned}$$
(VI.1)
$$\begin{aligned} Y^2=c_D(X-\text{ red}_D({{\mathfrak {s}}})), \end{aligned}$$
where ${{\mathfrak {s}}}=D\cap {\mathcal {R}}$.
(VI.2)
$$\begin{aligned} Y^2=c_D\quad \subseteq \quad {\mathbb {A}}^2_{X,Y}. \end{aligned}$$

Proof

We will extract the result from the explicit equations for ${\mathcal {C}}_{\text{ disc }}$ given in Proposition 5.5. Fix a valid disc D and temporarily denote by $\Gamma _D'$ the curve with equations as in the statement, so that we must prove $\Gamma _D'\cong \Gamma _D$. By Proposition 5.5, the scheme ${\mathcal {U}}_{D}{\setminus }\phi ^{-1}({\mathcal {P}}_D)$ of Definition 5.4 is an open subscheme of ${\mathcal {C}}_{\text{ disc }}$. Its special fibre is an open subset of $\phi ^{-1}(E_D)$ and is disjoint from all components contracted when forming ${\mathcal {C}}_{\text{ min }}$. Thus its special fibre is naturally an open subscheme of $\Gamma _D$ and equations for it are readily extracted from Lemma 5.9. In particular, we may define a rational map $\psi :\Gamma _D'\rightarrow \Gamma _D$ by

$$\begin{aligned} (x_0,y_0)\mapsto \left( x_0,y_0h_D(x_0)\right) \end{aligned}$$

(5.21)

where the right hand side is understood with respect to the variables $x_D,y_D$ for the chart ${\mathcal {U}}_{D}{\setminus }\phi ^{-1}({\mathcal {P}}_D)$ and we set

$$\begin{aligned} h_D(x)={\left\{ \begin{array}{ll}\prod _{\begin{array}{c} {{\mathfrak {s}}}'<{{\mathfrak {s}}}\\ \delta _{{{\mathfrak {s}}}'}>1/2 \end{array}}(x-\text{ red}_{{{\mathfrak {s}}}}({{\mathfrak {s}}}'))^{\lfloor \frac{|{{\mathfrak {s}}}|}{2}\rfloor } ~~&{}~~\text{ if } D \text{ has } \text{ type } \text{ I, } \text{ II }\\ \left( x-\text{ red}_D(D\cap {\mathcal {R}})\right) ^{\lfloor \frac{|D\cap {\mathcal {R}}|}{2}\rfloor } ~~&{}~~\text{ if } D \text{ has } \text{ type } \text{ III, } \text{ VI } \\ 1~~&{}~~\text{ if } D \text{ has } \text{ type } \text{ IV }.\end{array}\right. } \end{aligned}$$

(5.22)

Since ${\mathcal {P}}_D$ is precisely the set of points

$$\begin{aligned} \{\left( \text{ red}_{{{\mathfrak {s}}}}({{\mathfrak {s}}}'),0\right) ~~|~~{{\mathfrak {s}}}'<{{\mathfrak {s}}}~~\text{ and }~~ \delta _{{{\mathfrak {s}}}'}>1/2\} \end{aligned}$$

if D has type I or II, and consists of the single point $\left( \text{ red}_{D}(D\cap {\mathcal {R}}),0\right) $ if D has type III or VI, we see that $\psi $ is invertible on the open subset $U=\Gamma _D\cap \left( {\mathcal {U}}_{D}{\setminus }\phi ^{-1}({\mathcal {P}}_D)\right) $ of $\Gamma _D$. Suppose first that D is not of type II or III. Then by Lemma 5.15 (and its proof) all singular points of $\Gamma _D$ lie in U. Similarly, $\Gamma _D'$ is visibly smooth away from $\psi ^{-1}(U)$ also. Thus $\psi $ is a rational map between complete (possibly reducible) curves $\Gamma _D$ and $\Gamma _D'$ which is an isomorphism away from finitely many smooth points. Thus $\psi $ is in fact an isomorphism everywhere.

The case where D has type II or III is proved identically with the caveat that one must explicitly check that the map is an isomorphism over an open subset of the point at infinity on $\Gamma _D$ (since unlike the other cases the curves are not smooth here). This may be done by using the chart ${\mathcal {W}}_{D_\text{ max }}$ of Proposition 5.5 combined with Lemma 5.9 (ii). $\square $

5.7 Reducing points

Since ${\mathcal {C}}_\text{ min }/{\mathcal {O}}$ is proper there is a natural reduction map $C(K^{{\text {nr}}})\rightarrow {\mathcal {C}}_{\text{ min },{\bar{k}}}({\bar{k}})$ whose image consists precisely of the non-singular points (by regularity of ${\mathcal {C}}_{\text{ min }}$ and the fact that $K^{{\text {nr}}}$ is Henselian). We describe this map in the following proposition.

Proposition 5.23

Let $C:y^2=f(x)$ be a hyperelliptic curve satisfying the semistability criterion and let $P=(x_0,y_0)\in C(K^{{\text {nr}}})$.

(i)
Suppose $x_0\in D_\text{ max }$ and let D be a valid disc not of type V. Then P reduces to $\Gamma _{D}$ if and only if $x_0\in D$ but $x_0\notin D'$ for any valid subdisc $D'\subseteq D$. In this case, in the variables X, Y of Proposition 5.20, the reduction of P is, according to the type of D, the following point on $\Gamma _D$:
1. (I,II)
  $$\begin{aligned} \left( \text{ red}_D(x_0)~,~\overline{\pi ^{-\nu _D(f)/2}y_0}\prod _{\begin{array}{c} {{\mathfrak {s}}}'<{{\mathfrak {s}}}\\ \delta _{{{\mathfrak {s}}}'}>\frac{1}{2} \end{array}}\left( {\text {red}}_{{\mathfrak {s}}}(x_0)-{\text {red}}_{{\mathfrak {s}}}({{\mathfrak {s}}}')\right) ^{-\lfloor \frac{|{{\mathfrak {s}}}'|}{2}\rfloor }\right) , \end{aligned}$$
2. (III,VI)
  $$\begin{aligned} \left( \text{ red}_D(x_0)~,~\overline{\pi ^{-\nu _D(f)/2}y_0}\left( \text{ red}_D(x_0)-\text{ red}_D(r)\right) ^{-\lfloor \frac{|D\cap {\mathcal {R}}|}{2}\rfloor }\right) \end{aligned}$$
  for any choice of $r\in D\cap {\mathcal {R}}$,
3. (IV)
  $$\begin{aligned} \left( \text{ red}_D(x_0)~,~\overline{\pi ^{-\nu _D(f)/2}y_0}\right) . \end{aligned}$$
(ii)
Suppose $x_0\notin D_\text{ max }$. Then $\text{ red }(P)$ is a point at infinity on $\Gamma _{D_\text{ max }}$. There is a unique such point unless $|{\mathcal {R}}\cap D_\text{ max }|=2g+2$, in which case D has type I and P reduces to
$$\begin{aligned} \left( 0\quad ,\quad \frac{\pi ^{-\frac{v(c_f)}{2}}y_0}{(x_0-z_{{{\mathcal {R}}}})^{g+1}}\quad (\text{ mod } {\mathfrak {m}})\right) \end{aligned}$$
in the variables for the chart at infinity of (the equation given in Proposition 5.20 for) $\Gamma _{D_\text{ max }}$.

Proof

As in [26, Definition 10.1.3] $\text{ red }(P)$ is the unique point of intersection of $\overline{\{P\}}$ (the closure of P in ${\mathcal {C}}_{\text{ min }}$) with ${\mathcal {C}}_{\text{ min },{\bar{k}}}$. In particular the reduction map may be computed locally. Now for a valid disc $D\ne D_\text{ max }$ with $\omega _D(f)=0$, the non-singular points of $\Gamma _D$ are all visible on the chart ${\mathcal {U}}_{D}{\setminus }\phi ^{-1}({\mathcal {P}}_D)$ of Proposition 5.5. Changing variables from x, y to $x_D,y_D$ (c.f. Proposition 5.5) we see that P corresponds to the point

$$\begin{aligned} \left( \frac{x_0-z_D}{\pi ^{d_D}},\pi ^{-\nu _D(f)/2}y_0\right) \end{aligned}$$

on the generic fibre of ${\mathcal {U}}_D$. One checks readily that the closure of this point in ${\mathcal {U}}_D$ contains a point of the special fibre if and only if $(x_0-z_D)/\pi ^{d_D}$ is integral, i.e. if and only if $x_0\in D$. When this is the case the point is given by

$$\begin{aligned} \left( \text{ red}_D(x_0),\overline{\pi ^{-\nu _D(f)/2}y_0}\right) . \end{aligned}$$

Moreover, this lies in ${\mathcal {U}}_D{\setminus } \phi _{D,1}^{-1}({\mathcal {P}}_D)$ if and only if $x_0$ is not in any valid subdisc of D. To complete the proof it remains to change variables from $x_D,y_D$ to the variables for $\Gamma _D$ of Proposition 5.20, which simply serves to multiply the second coordinate by $h_D(\text{ red}_D(x_0))^{-1}$ with $h_D$ as in (5.22).

The case $D=D_\text{ max }$ may be treated similarly, additionally considering the chart ${\mathcal {W}}_{D_\text{ max }}$ to prove (ii). $\square $

5.8 The stable model of $C/K^{{\text {nr}}}$

We finish the section by giving an explicit description of the stable model of a hyperelliptic curve satisfying the semistability criterion. As in the statement of Proposition 5.12, for a principal cluster ${{\mathfrak {s}}}$ we denote by $\widetilde{\Gamma }_{\mathfrak {s}}$ the hyperelliptic curve

$$\begin{aligned} \widetilde{\Gamma }_{\mathfrak {s}}:y^2=c_{\mathfrak {s}}\prod _{\text{ odd } {{\mathfrak {s}}}'<{{\mathfrak {s}}}}(x-\text{ red}_{\mathfrak {s}}({{\mathfrak {s}}}')). \end{aligned}$$

Theorem 5.24

Let C/K be a hyperelliptic curve satisfying the semistability criterion. Then the stable model ${\mathcal {C}}_{\text{ st }}/{\mathcal {O}}$ of $C/K^{{\text {nr}}}$ is obtained from ${\mathcal {C}}_{\text{ disc }}$ by contracting each component $\Gamma _D$ for which D is not of the form $D({{\mathfrak {s}}})$ for a principal cluster ${{\mathfrak {s}}}$.^{Footnote 11} Its special fibre consists of one component with normalisation $\tilde{\Gamma }_{{\mathfrak {s}}}$ for each non-übereven principal cluster ${{\mathfrak {s}}}$, and two components with normalisation $\mathbb {P}^1_{{\bar{k}}}$ for each übereven principal cluster. Letting $\Sigma $ denote the metric cluster picture associated to C, the dual graph of the special fibre of ${\mathcal {C}}_{\text{ st }}$ is obtained from $G_\Sigma $ by adjusting the length function to give each edge length 1.

Proof

By Theorem 5.16, upon contracting components $\Gamma _D$ for which $\omega _D(f)=1$ we obtain the minimal regular model of $C/K^{{\text {nr}}}$. The stable model is then obtained from this by contracting all components which have self-intersection $-2$ and are isomorphic to $\mathbb {P}^1_{{\bar{k}}}$ ([26, Proposition 9.4.8, Theorem 10.3.34]). Since such components are precisely the ones which give vertices of genus 0 and degree 2 in the dual graph, i.e. precisely the ones which, in the notation of Theorem 5.18, are removed from the vertex set when passing from $\Upsilon _{C}$ to $\widehat{\Upsilon _{C}}$, this proves the claim about the dual graph of ${\mathcal {C}}_{\text{ st }}$. It remains to show that the components which remain are precisely those corresponding to principal clusters. For this one may either argue via Proposition 5.12 and a case by case analysis or use [18, Lemmas 5.5, 5.20] to note that the vertices of $G_\Sigma $ correspond precisely to the principal clusters of $\Sigma $. $\square $

6 Galois action on the models of C

In this section we still work with a hyperelliptic curve $C/K:y^2=f(x)$ and write ${\mathcal {R}}\subseteq \bar{K}$ for the set of roots of f(x). However we now no longer assume that C/K satisfies the semistability criterion, and instead fix a finite Galois extension F/K such that the semistability criterion is satisfied by C/F. For example we may take F/K to be the Galois closure of the extension given by adjoining a square root of a uniformiser to $K({\mathcal {R}})/K$, though the case where $F=K$ and C/K satisfies the semistability criterion is still an important special case of the results of this section. We denote by $\pi _F$ a uniformiser for F and write e for the ramification index of F/K.

Since C/F satisfies the semistability criterion we may apply the constructions of Sects. 4 and 5 with K replaced by F throughout, so that in particular we have an explicit construction of the minimal regular model and stable model of C over $F^{\text{ nr }}$, afforded by Theorem 5.16 and 5.24 respectively. The aim of this section is to describe explicitly the action of $G_K$ on these models (we recall how this action works in Sect. 6.1 below). To talk about these models we use the notation of Sections 4 and 5 replacing K by F throughout. Thus for example we work with the collection of valid discs (Definition 4.4) defined with respect to F rather than K and fix a choice of centre $z_D\in F^{\text{ nr }}$ for each valid disc D. We caution however that we continue to normalise the valuation v on $\bar{K}$ with respect to K, which is the reason for the appearance of the ramification index e in the formulae below.

6.1 Galois action on components

Let $\mathcal {C}_{\min }/{\mathcal {O}}_{F^{{\text {nr}}}}$ be the minimal regular model of C over $F^{{\text {nr}}}$. We will work with the explicit description of $\mathcal {C}_{\min }/{\mathcal {O}}_{F^{{\text {nr}}}}$ afforded by Theorem 5.16 and Proposition 5.20. Thus the components of its special fibre (or more precisely the $\iota $-orbits of components where $\iota $ denotes the hyperelliptic involution) are indexed by valid discs D not of type V (Definition 5.11), with D corresponding to the component(s) $\Gamma _D$ of Definition 5.19.

As explained in Sect. 2.4 ((2.16) in particular), the special fibre of $\mathcal {C}_{\min }/{\mathcal {O}}_{F^{{\text {nr}}}}$ carries a natural action of the full Galois group $G_K$ (arising from uniqueness of the model) which is uniquely determined by ‘lift-act-reduce’ on non-singular points. The following quantities will facilitate in describing this action.

Definition 6.1

We define, for $\sigma \in G_K$,

$$\begin{aligned} \chi (\sigma ) = \frac{\sigma (\pi _F)}{\pi _F} \quad (\text{ mod } {\mathfrak {m}}). \end{aligned}$$

Given also a valid disc D not of type V define

$$\begin{aligned} \beta _{D}(\sigma ) = \frac{\sigma (z_D)-z_{\sigma D}}{\pi _F^{ed_{D}}} \quad (\text{ mod } {\mathfrak {m}}), \end{aligned}$$

and

$$\begin{aligned} \lambda _D={\left\{ \begin{array}{ll}\frac{\nu _{{{\mathfrak {s}}}}}{2} - d_{{{\mathfrak {s}}}}\!\sum _{\begin{array}{c} {{\mathfrak {s}}}'<{{\mathfrak {s}}}\\ \delta _{{{\mathfrak {s}}}'}>\frac{1}{2} \end{array}} \lfloor \frac{|{{\mathfrak {s}}}'|}{2}\rfloor ~~&{}~~D~~ \text{ has } \text{ type } \text{ I, } \text{ II }\\ \\ \frac{\nu _D(f)}{2}-d_D\lfloor \frac{|D\cap {\mathcal {R}}|}{2}\rfloor ~~&{}~~D~~ \text{ has } \text{ type } \text{ III, } \text{ VI }\\ \\ \frac{\nu _D(f)}{2}~~&{}~~D~~ \text{ has } \text{ type } \text{ IV, }\end{array}\right. } \end{aligned}$$

where, for types I and II, ${{\mathfrak {s}}}$ denotes the cluster $D\cap {\mathcal {R}}$ and we recall that by definition we have $D=D({{\mathfrak {s}}})$ in these instances.

We now describe the Galois action on the $\Gamma _D$.

Theorem 6.2

Let $C/K:y^2=f(x)$ be a hyperelliptic curve satisfying the semistability criterion over a finite Galois extension F/K and let D be a valid disc (the collection of such defined with respect to F) not of type V. Then any $\sigma \in G_K$ maps $ \Gamma _{D}$ to $\Gamma _{\sigma D}$ and for a point $P=(x_0,y_0)\in \Gamma _D$, we have

$$\begin{aligned} \sigma (P)= \left( \chi (\sigma )^{ed_{D}}\bar{\sigma }(x_0)+\beta _{D}(\sigma ),\chi (\sigma )^{e\lambda _D}\bar{\sigma }(y_0)\right) \in \Gamma _{\sigma D} \end{aligned}$$

where $\bar{\sigma }$ denotes the map induced by $\sigma $ on the residue field ${\bar{k}}$ and the points are written with respect to the variables X, Y for $\Gamma _D$ (resp $\Gamma _{\sigma D}$) of Proposition 5.20.

Proof

As explained in Sect. 2.4 it suffices to prove the result under the assumption that P is non-singular. We follow the recipe of Sect. 2.4 to act on P. Let $\tilde{P}\in C(F^{{\text {nr}}})$ be a lift of P. Now let $h_D$ be as in (5.22) and $\tilde{h}_D$ any lift of $h_D$ to a polynomial with coefficients in ${\mathcal {O}}_{F^{\text{ nr }}}$. By Proposition 5.23, in the coordinates x, y for C, $\tilde{P}$ has the form

$$\begin{aligned} \left( \pi _F^{ed_D}\tilde{x}_0+z_D~,~\pi _F^{e\nu _D/2}\tilde{y}_0\tilde{h}_D(\tilde{x}_0)\right) \end{aligned}$$

for some $\tilde{x}_0,\tilde{y}_0\in {\mathcal {O}}_{F^{\text{ nr }}}$ with $\tilde{x}_0\equiv x_0~~(\text{ mod } {\mathfrak {m}})$ and $\tilde{y}_0\equiv y_0~~(\text{ mod } {\mathfrak {m}})$ and such that the x-coordinate is not in any valid subdisc of D. Under $\sigma $ this maps to the point

$$\begin{aligned} \tilde{Q}=\left( \sigma (\pi _F)^{ed_D}\sigma (\tilde{x}_0)+\sigma (z_D)~,~\sigma (\pi _F)^{e\nu _D(f)/2}\sigma (\tilde{y}_0)\sigma (\tilde{h}_D(\tilde{x}_0))\right) \in C(F^{{\text {nr}}}). \end{aligned}$$

Note that the x-coordinate of $\tilde{Q}$ is in $\sigma D$ but not in any valid subdisc of $\sigma D$, since the set of valid discs is stable under $\sigma $. Thus this new point reduces to $\Gamma _{\sigma D}$ by Proposition 5.23. In particular $\sigma $ maps $\Gamma _D$ to $\Gamma _{\sigma D}$. Moreover, applying Proposition 5.23 one last time and noting that $\sigma $ preserves depths and $\nu $, we find that $\tilde{Q}$ reduces to

$$\begin{aligned} \left( \chi (\sigma )^{ed_D}\bar{\sigma }(x_0)+\beta _D(\sigma ),\frac{\chi (\sigma )^{e\nu _D(f)/2}\bar{\sigma }(y_0)\bar{\sigma }(h_D(x_0))}{h_{\sigma {D}}\left( \chi (\sigma )^{ed_D}\bar{\sigma }(x_0)+\beta _D(\sigma )\right) }\right) \in \Gamma _{\sigma D}. \end{aligned}$$

In light of the definition of $\lambda _D$ it remains to show that

$$\begin{aligned} \bar{\sigma }(h_D(x_0))=\chi (\sigma )^{-ed_D\text{ deg }(h_D)}h_{\sigma D}\left( \chi (\sigma )^{ed_D}\bar{\sigma }(x_0)+\beta _D(\sigma )\right) . \end{aligned}$$

Writing $h_D^{\bar{\sigma }}(x)$ for the polynomial obtained by applying $\bar{\sigma }$ to the coefficients of $h_D$ we wish to prove the polynomial identity

$$\begin{aligned} h_D^{\bar{\sigma }}(x)=\chi (\sigma )^{-ed_D\text{ deg }(h_D)}h_{\sigma D}\left( \chi (\sigma )^{ed_D}x+\beta _D(\sigma )\right) . \end{aligned}$$

Noting that for any $z\in {\bar{K}}$ we have

$$\begin{aligned} \bar{\sigma }\text{ red}_D(z)= & {} \bar{\sigma }\left( \frac{z-z_D}{\pi _F^{ed_D}}\right) \nonumber \\= & {} \chi (\sigma )^{-ed_D}\left( \text{ red}_{\sigma D}(\bar{\sigma }z)-\beta _D(\sigma )\right) \end{aligned}$$

(6.3)

and that $h_D^{\bar{\sigma }}$ is the monic polynomial whose (multi)set of roots is given by applying $\bar{\sigma }$ to the (multi)set of roots of $h_D$, the result follows. $\square $

6.2 Galois action on the normalisation of components

In what follows, for any cluster ${{\mathfrak {s}}}$ for which $D({{\mathfrak {s}}})$ is a valid disc (in particular, for all principal clusters) we write $\Gamma _{{\mathfrak {s}}}$ in place of $\Gamma _{D({{\mathfrak {s}}})}$. Note that by Theorem 5.24 the $\Gamma _{{\mathfrak {s}}}$ for ${{\mathfrak {s}}}$ principal account for precisely those components which remain when passing from the minimal regular model of $C/F^{\text{ nr }}$ to the stable model, and by an abuse of notation we denote the associated component(s) of the stable model by $\Gamma _{{\mathfrak {s}}}$ also. Here we describe the Galois action on the normalisation of these components.

Definition 6.4

For a cluster ${{\mathfrak {s}}}$ for which $D({{\mathfrak {s}}})$ is valid (so in particular for all principal clusters) define

$$\begin{aligned} \quad \tilde{\lambda }_{{\mathfrak {s}}}= \frac{\nu _{{{\mathfrak {s}}}}}{2} - d_{{{\mathfrak {s}}}}\!\sum _{{{\mathfrak {s}}}'<{{\mathfrak {s}}}} \lfloor \frac{|{{\mathfrak {s}}}'|}{2}\rfloor . \quad \end{aligned}$$

Define also

$$\begin{aligned} \tilde{\Gamma }_{{\mathfrak {s}}}:y^2=c_{{\mathfrak {s}}}\prod _{\text{ odd } {{\mathfrak {s}}}'<{{\mathfrak {s}}}}(x-\text{ red}_{{\mathfrak {s}}}({{\mathfrak {s}}}')). \end{aligned}$$

By Proposition 5.12 this is the normalisation of $\Gamma _{{\mathfrak {s}}}$ viewed either on the minimal regular model of $C/F^{\text{ nr }}$, or, for ${{\mathfrak {s}}}$ principal, the stable model of $C/F^{\text{ nr }}$.

Corollary 6.5

(of Theorem 6.2) Let $C/K:y^2=f(x)$ be a hyperelliptic curve satisfying the semistability criterion over a finite Galois extension F/K. Let $\sigma \in G_K$, ${{\mathfrak {s}}}$ a principal cluster, and $\Gamma _{{\mathfrak {s}}}$ the associated component(s) of the special fibre of either the minimal regular model, or the stable model, of $C/F^{\text{ nr }}$. Then $\sigma $ maps $\Gamma _{{\mathfrak {s}}}$ to $\Gamma _{\sigma {{\mathfrak {s}}}}$. Moreover, if $\sigma $ stabilises ${{\mathfrak {s}}}$ then the action of $\sigma $ on the normalisation $\tilde{\Gamma }_{{\mathfrak {s}}}$ is, for a point $P=(x_0,y_0)$ in the variables x, y of Definition 6.4, given by

$$\begin{aligned} \sigma (P) = \left( \chi (\sigma )^{ed_{{{\mathfrak {s}}}}}\bar{\sigma }(x_0)+\beta _{{{\mathfrak {s}}}}(\sigma ),\chi (\sigma )^{e\tilde{\lambda }_{{\mathfrak {s}}}}\bar{\sigma }(y_0)\right) . \end{aligned}$$

Proof

Combine Theorem 6.2 with Lemma A.6 (2). $\square $

6.3 Galois action on the dual graph

As in Sect. 2.1.1 the action of $G_K$ on the special fibre of the minimal regular model of $C/F^{\text{ nr }}$ induces an action on its dual graph $\Upsilon _C$ via the action on components, ordinary double points, and tangents. Here we describe this action, beginning with the following lemma.

Definition 6.6

(cf. Definition 1.13) Let ${\mathcal {E}}$ denote the set of even clusters which do not have an übereven parent, excluding ${\mathcal {R}}$ unless ${\mathcal {R}}$ is übereven. For each cluster ${{\mathfrak {s}}}\in {\mathcal {E}}$, fix a square root $\theta _{{\mathfrak {s}}}$ of

$$\begin{aligned} c_f\prod _{r\notin {{\mathfrak {s}}}}(z_{{\mathfrak {s}}}-r). \end{aligned}$$

Having made this choice define, for each $\sigma \in \text{ G}_K$ and ${{\mathfrak {s}}}\in {\mathcal {E}}$,

$$\begin{aligned} \epsilon _{{\mathfrak {s}}}(\sigma )=\frac{\sigma (\theta _{{\mathfrak {s}}})}{\theta _{\sigma {{\mathfrak {s}}}}}\quad (\text{ mod } {\mathfrak {m}}). \end{aligned}$$

Lemma 6.7

For each ${{\mathfrak {s}}}\in {\mathcal {E}}$, the above choice $\theta _{{\mathfrak {s}}}$ of square root of $c_f\prod _{r\notin {{\mathfrak {s}}}}(z_{{\mathfrak {s}}}-r)$ determines:

(i)
if ${{\mathfrak {s}}}$ is a twin with $\delta _{{\mathfrak {s}}}=1/2$, a choice of tangent at the node $(\text{ red}_{P({{\mathfrak {s}}})}({{\mathfrak {s}}}),0)$ on $\Gamma _{P({{\mathfrak {s}}})}$,
(ii)
if ${{\mathfrak {s}}}$ has size 2g, is not übereven, and $\delta _{{\mathfrak {s}}}=1/2$ (i.e. $D({{\mathfrak {s}}})$ has type II.1) a choice of tangent at the node at infinity on $\Gamma _{{\mathfrak {s}}}$,
(iii)
a choice of one of the two points at infinity on $\widetilde{\Gamma _{{\mathfrak {s}}}}$ otherwise.

Proof

We begin with (iii) which is the simplest case. The points at infinity on $\widetilde{\Gamma _{{\mathfrak {s}}}}$ are $(0,\pm \sqrt{c_{{\mathfrak {s}}}})$. Now we compute

$$\begin{aligned} c_{{\mathfrak {s}}}=\frac{\theta _{{\mathfrak {s}}}^2}{\pi _F^{e(\nu _{{\mathfrak {s}}}-|{{\mathfrak {s}}}|d_{{\mathfrak {s}}})}}\quad (\text{ mod } {\mathfrak {m}}). \end{aligned}$$

(6.8)

By Lemmas 4.2 and 4.9$e(\nu _{{\mathfrak {s}}}-|{{\mathfrak {s}}}|d_{{\mathfrak {s}}})$ is even, so that our choice of $\theta _{{\mathfrak {s}}}$ determines a square root of $c_{{\mathfrak {s}}}$ and hence a choice of a point at infinity, namely the point

$$\begin{aligned} \left( 0,\frac{\theta _{{\mathfrak {s}}}}{\pi _F^{e(\nu _{{\mathfrak {s}}}-|{{\mathfrak {s}}}|d_{{\mathfrak {s}}})/2}}\quad (\text{ mod } {\mathfrak {m}})\right) . \end{aligned}$$

In case (ii) (see (A.3) and Proposition 5.20), the two tangents at the node are similarly given by the points $(0,\pm \sqrt{c_{{\mathfrak {s}}}})$ and we proceed as in case (iii).

Finally, in case (i), (see (A.3) and Proposition 5.20 again) the two tangents are the points

$$\begin{aligned} \left( \text{ red}_{P({{\mathfrak {s}}})}({{\mathfrak {s}}}),\pm \sqrt{c_{P({{\mathfrak {s}}})}\prod _{\text{ odd } {\mathfrak {o}}<P({{\mathfrak {s}}})}\left( \text{ red}_{P({{\mathfrak {s}}})}({{\mathfrak {s}}})-\text{ red}_{P({{\mathfrak {s}}})}({\mathfrak {o}})\right) }\right) \in \widetilde{\Gamma }_{P({{\mathfrak {s}}})}. \end{aligned}$$

This time, we compute

$$\begin{aligned}&c_{P({{\mathfrak {s}}})}\prod _{\text{ odd } {\mathfrak {o}}<P({{\mathfrak {s}}})}\left( \text{ red}_{P({{\mathfrak {s}}})}({{\mathfrak {s}}})-\text{ red}_{P({{\mathfrak {s}}})}({\mathfrak {o}})\right) \\&\quad =\frac{\theta _{{\mathfrak {s}}}^2}{\pi _F^{e(\nu _{P(\mathfrak {s})}-2d_{P(\mathfrak {s})})}} \left( \prod _{\begin{array}{c} {{\mathfrak {s}}}'<P(\mathfrak {s})\\ {{\mathfrak {s}}}'\ne \mathfrak {s} \end{array}}\left( \frac{z_\mathfrak {s}-z_{{{\mathfrak {s}}}'}}{\pi _F^{ed_{P(\mathfrak {s})}}}\right) ^{-\lfloor \frac{|{{\mathfrak {s}}}'|}{2}\rfloor }\right) ^2\quad (\text{ mod } {\mathfrak {m}}). \end{aligned}$$

Again $e(\nu _{P(\mathfrak {s})}-2d_{P(\mathfrak {s})})$ is even (Lemmas 4.2 and 4.9), so our choice of $\theta _{{\mathfrak {s}}}$ determines a choice of one of the tangents, namely

$$\begin{aligned} \left( \text{ red}_{P({{\mathfrak {s}}})}({{\mathfrak {s}}}),\frac{\theta _{{\mathfrak {s}}}}{\pi _F^{e(\nu _{P(\mathfrak {s})}-2d_{P(\mathfrak {s})})/2}} \prod _{\begin{array}{c} {{\mathfrak {s}}}'<P(\mathfrak {s})\\ {{\mathfrak {s}}}'\ne \mathfrak {s} \end{array}}\left( \frac{z_\mathfrak {s}-z_{{{\mathfrak {s}}}'}}{\pi _F^{ed_{P(\mathfrak {s})}}}\right) ^{-\lfloor \frac{|{{\mathfrak {s}}}'|}{2}\rfloor }\quad (\text{ mod } {\mathfrak {m}})\right) . \end{aligned}$$

$\square $

We now return to describing the action of $G_K$ on $\Upsilon _C$. Let $\Sigma $ denote the metric cluster picture associated to f(x) over F (Example D.2), with associated hyperelliptic graph $G_\Sigma $. By Theorem 5.18 we have $\widehat{\Upsilon _C}\cong G_\Sigma $ where $\widehat{\Upsilon _C}$ is the graph obtained from $\Upsilon _C$ by removing from the vertex set all vertices of genus 0 and degree 2. As explained in Definition D.15, to each pair $\rho =(\rho _0,\epsilon _\rho )$ where $\rho _0$ is a permutation of the set $\Sigma $ of proper clusters preserving sizes, inclusions and relative depths, and $\epsilon $ is a collection of signs $\epsilon _\rho ({{\mathfrak {s}}})\in \{\pm 1\}$ for each cluster ${{\mathfrak {s}}}\in {\mathcal {E}}$, there is an associated automorphism $G(\rho )$ of $G_\Sigma $.

Theorem 6.9

Let $C/K:y^2=f(x)$ be a hyperelliptic curve satisfying the semistability criterion over a finite Galois extension F/K. Denote by $\Upsilon _C$ the dual graph of the special fibre of the minimal regular model of $C/F^{\text{ nr }}$ and $\Sigma $ the metric cluster picture associated to f(x) over F. Fix a choice of $\theta _{{\mathfrak {s}}}$ for each ${{\mathfrak {s}}}\in {\mathcal {E}}$ as in (6.6).

There is an isomorphism of metric graphs $\widehat{\Upsilon _C}\cong G_\Sigma $ under which the action of any $\sigma \in G_K$ corresponds to the automorphism

$$\begin{aligned} \left( \rho (\sigma ),(\epsilon _{{\mathfrak {s}}}(\sigma ))_{{{\mathfrak {s}}}\in {\mathcal {E}}}\right) \end{aligned}$$

of $G_\Sigma $, where $\rho (\sigma )$ is the permutation of the proper clusters of $\Sigma $ induced by the natural action of $\sigma $ on the set of roots of f(x).

Proof

Fix an isomorphism $\widehat{\Upsilon _C}\cong G_\Sigma $ as in Theorem 5.18, so that the isomorphism identifies the respective hyperelliptic involutions (denoted $\iota $) and induces the canonical identification of the quotients $\widehat{\Upsilon _C}/\iota $ and $G_\Sigma /\iota $ detailed there. Note that our choice of $\theta _{{\mathfrak {s}}}$ for each ${{\mathfrak {s}}}\in {\mathcal {E}}$ determines via Lemma 6.7 a choice $n_{{\mathfrak {s}}}^+$ of (below $v_\Gamma $ denotes the vertex of $\Upsilon _C$ corresponding to a component $\Gamma $):

an endpoint of the loop at $v_{\Gamma _{P({{\mathfrak {s}}})}}$ associated to the node $(\text{ red}_{P({{\mathfrak {s}}})}({{\mathfrak {s}}}),0)$ on $\Gamma _{P({{\mathfrak {s}}})}$ if ${{\mathfrak {s}}}$ is a twin with $\delta _{{\mathfrak {s}}}=1/2$,
an endpoint of the loop at infinity on $v_{\Gamma _{{\mathfrak {s}}}}$ when $|{{\mathfrak {s}}}|=2g$ and $\delta _{{\mathfrak {s}}}=1/2$,
one of the two vertices corresponding to $\Gamma _{{\mathfrak {s}}}$ if ${{\mathfrak {s}}}={\mathcal {R}}$ (indeed, note that the points at infinity of $\Gamma _{{\mathfrak {s}}}$ lie on different components),
an edge endpoint at $v_{\Gamma _{{\mathfrak {s}}}}$ for one of the two edges between $v_{{\mathfrak {s}}}$ and $v_{P({{\mathfrak {s}}})}$ otherwise (here if $\Gamma _{{\mathfrak {s}}}$ consists of two components then by this we mean a choice of an edge-endpoint for one of the two edges meeting one of two associated vertices).

Composing our chosen isomorphism $\widehat{\Upsilon _C}\cong G_\Sigma $ with an automorphism of $G_\Sigma $ of the form $(\text{ id },\eta )$ for an appropriate choice of $\eta $ we may assume that the choices $n_{{\mathfrak {s}}}^+\in \widehat{\Upsilon _C}$ (${{\mathfrak {s}}}\in {\mathcal {E}}$) get identified with the corresponding ‘plus’ choice arising from the decomposition of $G_y$ into $G_y^+$ and $G_y^-$ (see Definition D.9 for the definition of these objects).

Now fix $\sigma \in \text{ Gal }(\bar{K}/K)$ and view it as an automorphism of $G_\Sigma $ via the identification above. Since $\sigma $ fixes $\Gamma _{\mathcal {R}}$ (or $\Gamma _{{\mathfrak {s}}}$ if ${\mathcal {R}}$ has size $2g+2$ and a child ${{\mathfrak {s}}}$ of size $2g+1$) and perserves genera, by (D.17) there is an automorphism $\tau =(\tau _0,\epsilon _\tau )$ of $\Sigma $ such that $\sigma $ acts as $G(\tau )$; we will show that $\tau $ must be as in the statement of the theorem. We now determine $\tau _0$ and $\epsilon _\tau $, using the explicit description of automorphisms of this form afforded by (D.16), to which we also refer for the definition of vertices $v^{\pm }_\bullet $ and (half-)edges $e^{\pm }_\bullet $ appearing below.

By Theorem 6.2, $\sigma $ maps $\Gamma _{{\mathfrak {s}}}$ to $\Gamma _{\sigma {{\mathfrak {s}}}}$ so that on $G_\Sigma $, $\sigma $ maps $\{v_{{{\mathfrak {s}}}}^{\pm }\}$ to $\{v_{\sigma {{\mathfrak {s}}}}^{\pm }\}$ for each principal cluster ${{\mathfrak {s}}}$. Similarly, by Theorem 6.2 and 6.3 we see that for a twin ${{\mathfrak {t}}}$, $\sigma $ maps a node $(\text{ red}_{P({{\mathfrak {t}}})}({{\mathfrak {t}}}),0)$ on $\Gamma _{P({{\mathfrak {t}}})}$ to the node $(\text{ red}_{\sigma P({{\mathfrak {t}}})}(\sigma {{\mathfrak {t}}}),0)$ on $\Gamma _{\sigma P({{\mathfrak {t}}})}$, so that $\sigma $ maps $\{e_{{\mathfrak {t}}}^{\pm }\}$ onto $\{e_{\sigma {{\mathfrak {t}}}}^{\pm }\}$ for each twin ${{\mathfrak {t}}}$. It follows that $\tau _0=\rho (\sigma )$ as desired.^{Footnote 12}

For the signs, fix a cluster ${{\mathfrak {s}}}\in {\mathcal {E}}$ with $\delta _{{\mathfrak {s}}}\ne 1/2$ (i.e. case (iii) of Lemma 6.7). Then $n_{{\mathfrak {s}}}^+$ is the specified point at infinity on $\Gamma _{{\mathfrak {s}}}$. By Lemma A.6 (1) and Corollary 6.5 we have

$$\begin{aligned} \sigma (n_{{\mathfrak {s}}}^+)={\left\{ \begin{array}{ll} n_{\sigma {{\mathfrak {s}}}}^+~~&{}~~\frac{\chi (\sigma )^{e\lambda _{{\mathfrak {s}}}}}{\chi (\sigma )^{ed_{{\mathfrak {s}}}(n+1)}}\cdot \frac{\sigma (\sqrt{c_{{\mathfrak {s}}}})}{\sqrt{c_{{\mathfrak {s}}}}}=1\\ n_{\sigma {{\mathfrak {s}}}}^-~~&{}~~ \text{ else, }\end{array}\right. } \end{aligned}$$

where $2n+2$ is the degree of the defining polynomial of $\Gamma _{{\mathfrak {s}}}$. Using (6.8) and the definition of $\lambda _{{\mathfrak {s}}}$, we compute

$$\begin{aligned} \frac{\chi (\sigma )^{e\lambda _{{\mathfrak {s}}}}}{\chi (\sigma )^{ed_{{\mathfrak {s}}}(n+1)}}\cdot \frac{\sigma (\sqrt{c_{{\mathfrak {s}}}})}{\sqrt{c_{{\mathfrak {s}}}}}=\frac{\sigma (\theta _{{\mathfrak {s}}})}{\theta _{\sigma {{\mathfrak {s}}}}}~~\quad (\text{ mod } {\mathfrak {m}})~~=~~\epsilon _{{\mathfrak {s}}}(\sigma ). \end{aligned}$$

Comparing this with the action of $G(\tau )$ on $G_\Sigma $ detailed in (D.16) we see that $\epsilon _\tau ({{\mathfrak {s}}})=\epsilon _{{\mathfrak {s}}}(\sigma )$ as desired.

Finally, for the nodes we compute using Theorem 6.2, Lemma 6.7 and Lemma A.6 (2) that $\epsilon _\tau ({{\mathfrak {s}}})=\epsilon _{{\mathfrak {s}}}(\sigma )$ similarly. $\square $

7 The semistability criterion is equivalent to semistability

We now complete the proof that the semistability criterion (Definition 1.8) is equivalent to semistability.

Theorem 7.1

Let $C:y^2=f(x)$ be a hyperelliptic curve over K. Then C/K is semistable if and only if it satisfies the semistability criterion.

Proof

When C/K satisfies the semistability criterion Theorem 5.16 gives an explicit semistable model of C over ${\mathcal {O}}_{K^{{\text {nr}}}}$. Since semistability may be checked after unramified extension it follows that C is semistable over K.

Now suppose that C/K is semistable. We will show that $K({\mathcal {R}})/K$ is tamely ramified and that each principal cluster ${{\mathfrak {s}}}$ is fixed by inertia, has $d_{{\mathfrak {s}}}\in {\mathbb {Z}}$ and $\nu _{{\mathfrak {s}}}\in 2{\mathbb {Z}}$. This is equivalent to the semistability criterion by Proposition C.4.

As C/K is semistable so is its Jacobian $\text{ Jac }(C)/K$ [13, Theorem 2.4] whence the inertia group of K acts unipotently on the 2-adic Tate module of $\text{ Jac }(C)$ [23, 3.5/3.8]. It follows that K(J[2])/K is tamely ramified. As $K(\text{ Jac }(C)[2])=K({\mathcal {R}})$ (see e.g. [11, Lemma 2.1]) $K({\mathcal {R}})/K$ is tame.

Now consider the stable model $\mathcal {C}_\text{ st }/{\mathcal {O}}_{K^{{\text {nr}}}}$ (which exists since C/K is assumed semistable). Fix a tame extension F/K, ramification degree e say, over which C satisfies the semistability criterion (e.g. a quadratic ramified extension of $K({\mathcal {R}})/K$) and set $I_{F/K}={\text {Gal}}(F^{{\text {nr}}}/K^{{\text {nr}}})$. By [26, Lemma 10.3.30] the formation of the stable model commutes with base change, in other words the stable model of C over $F^{{\text {nr}}}$ is

$$\begin{aligned} \mathcal {C}'=\mathcal {C}_\text{ st }\times _{{\mathcal {O}}_{K^{{\text {nr}}}}} {\mathcal {O}}_{F^{{\text {nr}}}}. \end{aligned}$$

In particular, the unique extension of the action of $I_{F/K}$ on $C/F^{{\text {nr}}}$ to $\mathcal {C}'$ is via the second factor and becomes trivial upon passing to the special fibre.

On the other hand, since C satisfies the semistability criterion over F we have an explicit description of the stable model over ${\mathcal {O}}_{F^{{\text {nr}}}}$ complete with action of $I_{F/K}$ on its special fibre afforded by Theorem 5.24 and Corollary 6.5. In order that this action be trivial we see from Corollary 6.5 that each principal cluster must be fixed by $I_{F/K}$ (and hence the full inertia group of K) else $I_{F/K}$ would permute components of $\mathcal {C}'_{{\bar{k}}}$. Moreover, since the character $\chi $ of Definition 6.1 has exact order e when restricted to $I_{F/K}$, for each principal cluster ${{\mathfrak {s}}}$ we deduce from Corollary 6.5 that both $d_{{\mathfrak {s}}}$ and $\tilde{\lambda }_{{\mathfrak {s}}}$ must be integers. Equivalently $d_{{\mathfrak {s}}}\in {\mathbb {Z}}$ and $\nu _{{\mathfrak {s}}}\in 2{\mathbb {Z}}$ as desired. $\square $

8 Special fibre of the minimal regular model

Here we collect and present the relevant notation and results from Sects. 4, 5 and 6 for the convenience of the reader. In particular we present the special fibre of the minimal regular model of $C/F^{\text{ nr }}$ in a self-contained manner, that does not refer to the constructions in [18].

Let $C/K:y^2 =f(x)$ be a hyperelliptic curve and F/K a finite Galois extension over which C becomes semistable. By Theorem 7.1C/F satisfies the semistability criterion, so that all the constructions of Sects. 4, 5 and 6 are valid over F.

For the rest of this section we fix the following data.

Notation 8.1

Fix as above a finite Galois extension F/K over which C is semistable and let $\pi _F$ denote a fixed choice of uniformiser of F. For each proper cluster ${{\mathfrak {s}}}$, fix a centre $z_{{\mathfrak {s}}}\in F^{\text{ nr }}$ (possible by Lemma 4.2). Additionally, for every even cluster ${{\mathfrak {s}}}\ne {\mathcal {R}}$ that does not have an übereven parent, and for ${\mathcal {R}}$ if it is übereven, fix a square root $\theta _{{\mathfrak {s}}}$ of $ c_f\prod \nolimits _{r \notin {{\mathfrak {s}}}} (z_{{\mathfrak {s}}}-r). $

We write e for the ramification degree of F/K and v for the valuation on $\bar{K}$ normalised with respect to K, so that, in particular, $v(\pi _F)=1/e$.

8.1 Components and characters

Definition 8.2

For $\sigma \in G_K$ set

$$\begin{aligned} \chi (\sigma ) = \frac{\sigma (\pi _F)}{\pi _F} \mod \mathfrak {m}. \end{aligned}$$

For principal clusters ${{\mathfrak {s}}}$ define

$$\begin{aligned} \tilde{\lambda }_{{\mathfrak {s}}}= & {} \frac{\nu _{{{\mathfrak {s}}}}}{2} - d_{{{\mathfrak {s}}}}\!\sum _{{{\mathfrak {s}}}'<{{\mathfrak {s}}}} \lfloor \frac{|{{\mathfrak {s}}}'|}{2}\rfloor \qquad \text { and } \\ \alpha _{{\mathfrak {s}}}(\sigma )= & {} \chi (\sigma )^{ed_{{\mathfrak {s}}}}, \qquad \quad \beta _{{{\mathfrak {s}}}}(\sigma ) = \frac{\sigma (z_{{{\mathfrak {s}}}})-z_{\sigma {{\mathfrak {s}}}}}{\pi _F^{ed_{{{\mathfrak {s}}}}}} \mod \mathfrak {m}, \qquad \quad \gamma _{{\mathfrak {s}}}(\sigma ) = \chi (\sigma )^{e\tilde{\lambda }_{{\mathfrak {s}}}}. \end{aligned}$$

If ${{\mathfrak {s}}}$ is either even or a cotwin, define $\epsilon _{{\mathfrak {s}}}:G_K\rightarrow \{\pm 1\}$ by

$$\begin{aligned} \epsilon _{{{\mathfrak {s}}}}(\sigma ) \equiv \frac{\sigma (\theta _{{{\mathfrak {s}}}^*})}{\theta _{(\sigma {{\mathfrak {s}}})^*}} \mod \mathfrak {m}. \end{aligned}$$

For all other clusters ${{\mathfrak {s}}}$, set $\epsilon _{{\mathfrak {s}}}(\sigma )=0$.

Remark 8.3

On the inertia group $I_K<G_K$ the map $\chi $, and therefore the $\alpha _{{\mathfrak {s}}}$ and $\gamma _{{\mathfrak {s}}}$ as well, are independent of the choices of F, $\pi _F$ and $z_{{\mathfrak {s}}}$, and are characters $I_K\rightarrow {\bar{k}}^\times $ which are trivial on wild inertia. When restricted to the stabiliser $I_{{\mathfrak {s}}}$, the character $\gamma _{{\mathfrak {s}}}$ has order the prime-to-p part of the denominator of $|I_K/I_{{\mathfrak {s}}}|\,{\tilde{\lambda }}_{{\mathfrak {s}}}$.

Remark 8.4

Note that $\epsilon _{{\mathfrak {s}}}$ does not depend on the choice of centre $z_{{{\mathfrak {s}}}^*}$: if $z'_{{{\mathfrak {s}}}^*}$ is another centre and $r \notin {{\mathfrak {s}}}^*$ then $v(z_{{{\mathfrak {s}}}^*}-r) < v(z_{{{\mathfrak {s}}}^*}-z'_{{{\mathfrak {s}}}^*})$ so the leading term in the p-adic expansion of $z'_{{{\mathfrak {s}}}^*}-r$ is the same as that of $z_{{{\mathfrak {s}}}^*}-r$.

When ${{\mathfrak {s}}}$ is even or a cotwin, $\epsilon _{{\mathfrak {s}}}$ restricts to a character on the stabiliser of ${{\mathfrak {s}}}$ that is also independent of the choice of the sign of $\theta _{{{\mathfrak {s}}}^*}$. This character is unramified if and only if $|I_K/I_{{{\mathfrak {s}}}^*}| (v(c_f)+\sum _{r\notin {{\mathfrak {s}}}^*}d_{r\wedge {{\mathfrak {s}}}^*})$ is even. If ${{\mathfrak {s}}}$ is not a cotwin, this is equivalent to ${\text {ord}}_2 |I_K/I_{{\mathfrak {s}}}| (v(c_f)+\sum _{r\notin {{\mathfrak {s}}}}d_{r\wedge {{\mathfrak {s}}}})\ge 1$. (To see this, first pass to the fixed field of $I_{{\mathfrak {s}}}$, and then to an odd degree ramified extension that kills the wild inertia action. We then need to check that whenever ${{\mathfrak {s}}}<P({{\mathfrak {s}}})$ with $P({{\mathfrak {s}}})$ übereven and ${{\mathfrak {s}}}$ is fixed by inertia, $\sum _{r\notin {{\mathfrak {s}}}}d_{r\wedge {{\mathfrak {s}}}} - \sum _{r\notin P({{\mathfrak {s}}})}d_{r\wedge P({{\mathfrak {s}}})}$ is even. The latter is $d_{P({{\mathfrak {s}}})}(|P({{\mathfrak {s}}})|-|{{\mathfrak {s}}}|)$, which is even by [6, Thm 1.3(iii)].)

Definition 8.5

For principal clusters ${{\mathfrak {s}}}$ define $c_{{{\mathfrak {s}}}} \in \bar{k}^\times $ by

$$\begin{aligned} c_{{\mathfrak {s}}}=\hat{c}_f \prod _{r \notin {{\mathfrak {s}}}}\widehat{(z_{{\mathfrak {s}}}-r)} ~~ \text{ mod } {\mathfrak {m}}. \end{aligned}$$

For proper clusters ${{\mathfrak {s}}}$ set

$$\begin{aligned} \begin{array}{llllll} {\text {red}}_{{\mathfrak {s}}}(t)= \frac{t-z_{{\mathfrak {s}}}}{\pi _F^{ed_{{\mathfrak {s}}}}}\mod \mathfrak {m} \end{array} \end{aligned}$$

for those $t\in \bar{K}$ for which the above formula makes sense. For ${{\mathfrak {s}}}'<{{\mathfrak {s}}}$, by ${\text {red}}_{{\mathfrak {s}}}({{\mathfrak {s}}}')$ we mean ${\text {red}}_{{\mathfrak {s}}}(r)$ for any $r \in {{\mathfrak {s}}}'$.

If ${{\mathfrak {s}}}$ is a principal cluster and $e\delta _{{\mathfrak {s}}}\ne 1/2$ we define the hyperelliptic curve $\Gamma _{{\mathfrak {s}}}/{\bar{k}}$ by

$$\begin{aligned} \qquad \qquad \Gamma _{{\mathfrak {s}}}:\>\> Y^2 = c_{{\mathfrak {s}}}\prod _{{\text {odd }}{{{\mathfrak {o}}}< {{\mathfrak {s}}}}}(X-{\text {red}}_{{\mathfrak {s}}}({\mathfrak {o}})) \prod _{\begin{array}{c} \text {twin }{{\mathfrak {t}}}<{{\mathfrak {s}}}\\ e\delta _{{{\mathfrak {t}}}}=1/2 \end{array}}(X-{\text {red}}_{{\mathfrak {s}}}({\mathfrak {t}}))^2. \end{aligned}$$

If ${{\mathfrak {s}}}$ is principal and $e\delta _{{\mathfrak {s}}}=1/2$ we define the curve $\Gamma _{{\mathfrak {s}}}/\bar{k}$ to be the glueing of the affine curves

$$\begin{aligned} Y^2=c_{{\mathfrak {s}}}\prod _{{\text {odd }}{{{\mathfrak {o}}}< {{\mathfrak {s}}}}}(X-{\text {red}}_{{\mathfrak {s}}}({\mathfrak {o}})) \prod _{\begin{array}{c} \text {twin }{{\mathfrak {t}}}<{{\mathfrak {s}}}\\ e\delta _{{\mathfrak {t}}}=1/2 \end{array}}(X-{\text {red}}_{{\mathfrak {s}}}({\mathfrak {t}}))^2\qquad (\dagger ) \end{aligned}$$

and

$$\begin{aligned} Z^2=c_{{\mathfrak {s}}}T^{2}\prod _{{\text {odd }}{{{\mathfrak {o}}}< {{\mathfrak {s}}}}}(1-{\text {red}}_{{\mathfrak {s}}}({\mathfrak {o}})T) \prod _{\begin{array}{c} \text {twin }{{\mathfrak {t}}}<{{\mathfrak {s}}}\\ e\delta _{{\mathfrak {t}}}=1/2 \end{array}}(1-{\text {red}}_{{\mathfrak {s}}}({\mathfrak {t}})T)^2 \end{aligned}$$

over the subsets $Y\ne 0$ and $T\ne 0$ via $Z=YT^{n+1}$ where n is half the degree of the right hand side of $(\dagger )$ (note that this is not the usual chart at infinity).

As in Sect. 5, each $\Gamma _{{\mathfrak {s}}}$ corresponds to one or possibly two components of the special fibre of the minimal regular model of C over ${\mathcal {O}}_{F^{{\text {nr}}}}$. The following theorem describes how these components fit together: roughly $\Gamma _{{\mathfrak {s}}}$ and $\Gamma _{{{\mathfrak {s}}}'}$ are linked by chains of curves isomorphic to ${\mathbb {P}}^1_{\bar{k}}$ whenever ${{\mathfrak {s}}}' \,\scriptstyle <\,{{\mathfrak {s}}}$ and there is a loop of such curves from $\Gamma _{{\mathfrak {s}}}$ to itself for each twin or cotwin ${{\mathfrak {t}}}\,\scriptstyle <\,{{\mathfrak {s}}}$ or ${{\mathfrak {s}}}\,\scriptstyle <\,{{\mathfrak {t}}}$. It also describes the corresponding Galois action and the reduction map.

Theorem 8.6

Let F/K be an extension over which C is semistable.

(1) Let $\Upsilon _C$ be the dual graph of the special fibre of the minimal regular model of C over ${\mathcal {O}}_{F^{{\text {nr}}}}$. Then $\Upsilon _C$ has a vertex $v_{{\mathfrak {s}}}$ corresponding to $\Gamma _{{\mathfrak {s}}}$ for every non-übereven principal cluster and two vertices $v_{{{\mathfrak {s}}}}^+, v_{{{\mathfrak {s}}}}^-$ for each übereven principal cluster ${{\mathfrak {s}}}$. These are linked by chains of edges as follows (where we write $v_{{\mathfrak {s}}}= v_{{{\mathfrak {s}}}}^+=v_{{{\mathfrak {s}}}}^-$ whenever ${{\mathfrak {s}}}$ is not übereven).

Conditions	Name	From	To	Length
${{\mathfrak {s}}}'<{{\mathfrak {s}}}$ both principal, ${{\mathfrak {s}}}'$ odd	$L_{{{\mathfrak {s}}}'}$	$v_{{{\mathfrak {s}}}'}$	$v_{{{\mathfrak {s}}}}$	$\frac{1}{2} \delta _{{{\mathfrak {s}}}'}$
${{\mathfrak {s}}}'<{{\mathfrak {s}}}$ both principal, ${{\mathfrak {s}}}'$ even	$L_{{{\mathfrak {s}}}'}^+$	$v_{{{\mathfrak {s}}}'}^+$	$v_{{{\mathfrak {s}}}}^+$	$\delta _{{{\mathfrak {s}}}'}$
	$L_{{{\mathfrak {s}}}'}^-$	$v_{{{\mathfrak {s}}}'}^-$	$v_{{{\mathfrak {s}}}}^-$	$\delta _{{{\mathfrak {s}}}'}$
${{\mathfrak {s}}}$ principal, ${{\mathfrak {t}}}<{{\mathfrak {s}}}$ twin	$L_{{{\mathfrak {t}}}}$	$v_{{{\mathfrak {s}}}}^-$	$v_{{{\mathfrak {s}}}}^+$	$2 \delta _{{\mathfrak {t}}}$
${{\mathfrak {s}}}$ principal, ${{\mathfrak {s}}}<{{\mathfrak {t}}}$ cotwin	$L_{{{\mathfrak {t}}}}$	$v_{{{\mathfrak {s}}}}^-$	$v_{{{\mathfrak {s}}}}^+$	$2 \delta _{{\mathfrak {s}}}$

Moreover, if $\mathcal {R}$ is not principal

$ \mathcal {R}= {{\mathfrak {s}}}_1\coprod {{\mathfrak {s}}}_2$, with ${{\mathfrak {s}}}_1, {{\mathfrak {s}}}_2$ principal odd	$L_{{{\mathfrak {s}}}_1,{{\mathfrak {s}}}_2}$	$v_{{{\mathfrak {s}}}_1}$	$v_{{{\mathfrak {s}}}_2}$	$\frac{1}{2} (\delta _{{{\mathfrak {s}}}_1}+\delta _{{{\mathfrak {s}}}_2})$
$ \mathcal {R}= {{\mathfrak {s}}}_1\coprod {{\mathfrak {s}}}_2$, with ${{\mathfrak {s}}}_1, {{\mathfrak {s}}}_2$ principal even	$L_{{{\mathfrak {s}}}_1,{{\mathfrak {s}}}_2}^+$	$v_{{{\mathfrak {s}}}_1}^+$	$v_{{{\mathfrak {s}}}_2}^+$	$\delta _{{{\mathfrak {s}}}_1}+\delta _{{{\mathfrak {s}}}_2}$
	$L_{{{\mathfrak {s}}}_1,{{\mathfrak {s}}}_2}^-$	$v_{{{\mathfrak {s}}}_1}^-$	$v_{{{\mathfrak {s}}}_2}^-$	$\delta _{{{\mathfrak {s}}}_1}+\delta _{{{\mathfrak {s}}}_2}$
$\mathcal {R}= {{\mathfrak {s}}}\coprod {{\mathfrak {t}}}$, with ${{\mathfrak {s}}}$ principal even, ${{\mathfrak {t}}}$ twin	$L_{{{\mathfrak {t}}}}$	$v_{{{\mathfrak {s}}}}^-$	$v_{{{\mathfrak {s}}}}^+$	$2(\delta _{{\mathfrak {s}}}+\delta _{{\mathfrak {t}}})$

(2) If $\sigma \in G_K$ then it acts on $\Upsilon _C$ by

$\mathrm{(i)}$:: $ \sigma (v_{{{\mathfrak {s}}}}^\pm )=v_{\sigma ({{\mathfrak {s}}})}^{\pm \epsilon _{{{\mathfrak {s}}}}(\sigma )}$;
$\mathrm{(ii)}$:: $\sigma (L_{{{\mathfrak {s}}}}^\pm )=L_{\sigma ({{\mathfrak {s}}})}^{\pm \epsilon _{{{\mathfrak {s}}}}(\sigma )}$;
$\mathrm{(iii)}$:: for ${{\mathfrak {t}}}$ twin or cotwin $\sigma (L_{{{\mathfrak {t}}}})=\epsilon _{{{\mathfrak {t}}}}(\sigma )L_{\sigma ({{\mathfrak {t}}})}$, where $-L$ denotes L with reversed orientation;

and the induced permutation on the remaining edges and vertices.

(3) If $\sigma \in G_K$ and ${{\mathfrak {s}}}$ is a principal cluster then^{Footnote 13}$\sigma $ maps $ \Gamma _{{{\mathfrak {s}}}}$ to $\Gamma _{\sigma {{\mathfrak {s}}}}$ and

$$\begin{aligned} \sigma |_{\Gamma _{{\mathfrak {s}}}}(x,y) = \left( \chi (\sigma )^{ed_{{{\mathfrak {s}}}}}\bar{\sigma }(x)+\beta _{{{\mathfrak {s}}}}(\sigma ),\chi (\sigma )^{e\lambda _{{\mathfrak {s}}}}\bar{\sigma }(y)\right) \in \Gamma _{\sigma {{\mathfrak {s}}}}, \end{aligned}$$

where $ \lambda _{{\mathfrak {s}}}= \frac{\nu _{{{\mathfrak {s}}}}}{2} - d_{{{\mathfrak {s}}}}\!\sum \lfloor \frac{|{{\mathfrak {s}}}'|}{2}\rfloor $, the sum taken over ${{\mathfrak {s}}}'<{{\mathfrak {s}}}$ with $e\delta _{{{\mathfrak {s}}}'}>\frac{1}{2}$.

(4) The point $(x,y) \in C(F^{nr})$ reduces to $\Gamma _{{\mathfrak {s}}}$ if and only if:

(i):: $v(x-z_{{\mathfrak {s}}}) \ge d_{{\mathfrak {s}}}$ and ${\text {red}}_{{{\mathfrak {s}}}}(x) \ne {\text {red}}_{{{\mathfrak {s}}}}({{\mathfrak {s}}}')$ for any proper cluster ${{\mathfrak {s}}}'<{{\mathfrak {s}}}$, or
(ii):: $|{{\mathfrak {s}}}| \ge 2g+1$ and $v(x-z_{{\mathfrak {s}}}) < d_{{\mathfrak {s}}}$.

Explicitly, for one of these points^{Footnote 14},

$$\begin{aligned} (x,y) \mapsto \left( {\text {red}}_{{\mathfrak {s}}}(x), ~\pi _F^{-\frac{e\nu _{{\mathfrak {s}}}}{2}}y\cdot \!\!\!\! \prod _{\begin{array}{c} {{\mathfrak {s}}}'<{{\mathfrak {s}}}\\ e\delta _{{{\mathfrak {s}}}'}>\frac{1}{2} \end{array}}\left( {\text {red}}_{{\mathfrak {s}}}(x)-{\text {red}}_{{\mathfrak {s}}}({{\mathfrak {s}}}')\right) ^{-\lfloor \frac{|{{\mathfrak {s}}}'|}{2}\rfloor }\right) . \end{aligned}$$

Proof

(1)
The dual graph of the special fibre is given by Theorem 5.18 and Remark D.11 gives the explicit description.
(2)
follows by combining Theorem 6.9 with (D.16).
(3)
This is Proposition 6.2.
(4)
This is Proposition 5.23.

$\square $

Corollary 8.7

Let C/K be a semistable hyperelliptic curve. Then the number of components in the special fibre of its minimal regular model over ${\mathcal {O}}_{K^{{\text {nr}}}}$ is

$$\begin{aligned} m_C = \sum _{\begin{array}{c} {{\mathfrak {s}}}\text { odd, } \\ |{{\mathfrak {s}}}|\ne 1,~2g+1 \end{array}} \frac{\delta _{{{\mathfrak {s}}}}}{2} + \sum _{\begin{array}{c} {{\mathfrak {s}}}\ne \mathcal {R}, \\ \text {even} \end{array}} 2\delta _{{{\mathfrak {s}}}} +1-{\text {rk}}H_1(\Upsilon _C,\mathbb {Z}). \end{aligned}$$

Proof

This follows from the usual Euler characteristic formula for $H_1$ of a graph, and counting the total number of edges in part (1) of the theorem. $\square $

We now describe the normalisation of each $\Gamma _{{\mathfrak {s}}}$ as well as the induced Galois action.

Theorem 8.8

For a principal cluster ${{\mathfrak {s}}}$ the normalisation of $\Gamma _{{\mathfrak {s}}}$ is given by

$$\begin{aligned} \qquad \qquad \widetilde{\Gamma }_{{\mathfrak {s}}}:\>\> Y^2 = c_{{\mathfrak {s}}}\prod _{{\text {odd }}{{{\mathfrak {o}}} < {{\mathfrak {s}}}}}(X-{\text {red}}_{{\mathfrak {s}}}({\mathfrak {o}})). \end{aligned}$$

(i)
If $\sigma \in G_K$ the associated map $ \widetilde{\Gamma }_{{{\mathfrak {s}}}}$ to $\widetilde{\Gamma }_{\sigma {{\mathfrak {s}}}}$ is given by
$$\begin{aligned} \sigma |_{\widetilde{\Gamma }_{{\mathfrak {s}}}}(x,y) = \left( \alpha _{{\mathfrak {s}}}(\sigma )\sigma (x)+\beta _{{{\mathfrak {s}}}}(\sigma ),\gamma _{{\mathfrak {s}}}(\sigma )\sigma (y)\right) . \end{aligned}$$
(ii)
If $\sigma \in I_{{\mathfrak {s}}}$, the geometric automorphism of ${\tilde{\Gamma }}_{{{\mathfrak {s}}}}(\bar{k})$ given by $\sigma $
$$\begin{aligned} \begin{array}{ll} \hbox {swaps two points at infinity} &{} \hbox {if }\epsilon _{{\mathfrak {s}}}(\sigma )=-1,\\ \hbox {fixes two points at infinity} &{} \hbox {if }\epsilon _{{\mathfrak {s}}}(\sigma )=1,\\ \hbox {fixes the unique point at infinity} &{} \hbox {if }\epsilon _{{\mathfrak {s}}}(\sigma )=0. \end{array} \end{aligned}$$
(iii)
If the point $P=(x,y) \in C(F^{nr})$ reduces to $\bar{P} \in \Gamma _{{\mathfrak {s}}}$ then $\bar{P}$ corresponds to the point
$$\begin{aligned} \left( {\text {red}}_{{\mathfrak {s}}}(x), ~\pi _F^{-\frac{e\nu _{{\mathfrak {s}}}}{2}}y\cdot \prod _{{{\mathfrak {s}}}'<{{\mathfrak {s}}}}\left( {\text {red}}_{{\mathfrak {s}}}(x)-{\text {red}}_{{\mathfrak {s}}}({{\mathfrak {s}}}')\right) ^{-\lfloor \frac{|{{\mathfrak {s}}}'|}{2}\rfloor }\right) \in \widetilde{\Gamma }_{{\mathfrak {s}}}. \end{aligned}$$

Proof

(i)
Combine Theorem 8.6 (3) and Lemma A.6 (2).
(ii)
The case where there is a unique point at infinity is clear so suppose otherwise. Theorem 6.9 gives the case ${{\mathfrak {s}}}= {{\mathfrak {s}}}^*$. For the case ${{\mathfrak {s}}}^*\ne {{\mathfrak {s}}}$, note that by considering the action of $\sigma $ on the components of the special fibre of the minimal regular model of C (c.f. Theorem 8.6 (2)), one sees that the points at infinity on $\widetilde{\Gamma }_{\mathfrak {s}}$ are swapped by $\sigma $ if and only if the points at infinity on $\widetilde{\Gamma }_{s^*}$ are, if and only if $\epsilon _{\mathfrak {s}}=-1$.
(iii)
The description of the normalisation of $\Gamma _{{\mathfrak {s}}}$ is standard and the normalization map between $\tilde{\Gamma }_{{\mathfrak {s}}}$ and $\Gamma _{{\mathfrak {s}}}$ is given by
$$\begin{aligned} (x,y) \mapsto \left( x, y \cdot \prod _{\begin{array}{c} \text {twin }{{\mathfrak {t}}}<{{\mathfrak {s}}}\\ e\delta _{{\mathfrak {t}}}=\frac{1}{2} \end{array}}(x-{\text {red}}_{{\mathfrak {s}}}({\mathfrak {t}}))\right) \end{aligned}$$
(c.f. A.2). The claimed formula now follows from Theorem 8.6 (4).

$\square $

Remark 8.9

We note that the formula for the action of $\sigma \in G_K$ becomes particularly simple in the following two settings:

(i)
if ${{\mathfrak {s}}}$ is a principal cluster and $\sigma \in I_{{\mathfrak {s}}}$ then $\sigma $ acts on $\widetilde{\Gamma }_{{\mathfrak {s}}}$ as the geometric automorphism
$$\begin{aligned} (x,y)\mapsto \left( \alpha _{{\mathfrak {s}}}(\sigma )x+\beta _{{\mathfrak {s}}}(\sigma ),\gamma _{{\mathfrak {s}}}(\sigma )y\right) . \end{aligned}$$
(ii)
Suppose ${{\mathfrak {s}}}$ is a principal cluster, $F=K$ (so that C/K is semistable) and that $\sigma \in G_{{\mathfrak {s}}}$. Then $\chi (\sigma )=\text{ id }$. If (as is possible by Lemma B.1) we additionally pick our centre $z_{{\mathfrak {s}}}$ for ${{\mathfrak {s}}}$ to lie in $K_{{\mathfrak {s}}}$, the subfield of $K^{{\text {sep}}}$ fixed by $G_{{\mathfrak {s}}}$, then we also have $\beta _{{\mathfrak {s}}}(\sigma )=0$. Thus $\sigma $ acts on $\widetilde{\Gamma }_{{\mathfrak {s}}}(\bar{k})$ via
$$\begin{aligned} \sigma (x,y)=\left( \bar{\sigma }(x),\bar{\sigma }(y)\right) \end{aligned}$$
where $\bar{\sigma }$ denotes the automorphism of $\bar{k}$ induced by $\sigma $. (This is a manifestation of the fact that, when C/K is semistable, all our constructions are $\text{ Gal }(K^{\text{ nr }}/K)$-equivariant.)

9 Homology of the dual graph of the special fibre

The homology of the dual graph of the special fibre forms a part of the Galois representation of C and determines several arithmetic invariants (see (2.18), Theorem 2.20 and Lemma 2.22). In this section we give a description of the homology in terms of clusters. In the notation of Theorem 8.6, the basic observation is that every even non-übereven cluster ${{\mathfrak {s}}}$ starts off two chains $L_{{\mathfrak {s}}}^+$ and $L_{{\mathfrak {s}}}^-$ that eventually join back up (normally at $v_{P({{\mathfrak {s}}}^*)}$) to form a loop in $\Upsilon _C$.

Definition 9.1

Let C/F be a semistable hyperelliptic curve and $\Upsilon _C$ the dual graph of the special fibre of its minimal regular model over ${\mathcal {O}}_{F^{{\text {nr}}}}$ as in Theorem 8.6.

Let ${{\mathfrak {s}}}\ne \mathcal {R}$ be an even non-übereven cluster. If ${{\mathfrak {s}}}^*\ne \mathcal {R}$, we define the 1-chain $\ell _{{{\mathfrak {s}}}}$ in $C_1(\Upsilon _C,\mathbb {Z})$ to be the shortest path from $v_{P({{\mathfrak {s}}}^*)}$ to itself that passes through $v_{{\mathfrak {s}}}$ and goes through the minus part of the graph before the plus part of the graph. If ${{\mathfrak {s}}}^*=\mathcal {R}$, we define $\ell _{{{\mathfrak {s}}}}$ to be the shortest path from $v_\mathcal {R}^-$ to $v_\mathcal {R}^+$ that passes through $v_{{\mathfrak {s}}}$. Here

if ${{\mathfrak {s}}}={{\mathfrak {t}}}$ is a twin or $P({{\mathfrak {s}}}^*)={{\mathfrak {t}}}$ is a cotwin, we write $v_{{{\mathfrak {t}}}}$ for the point in the middle of $L_{{{\mathfrak {t}}}}$;
if ${{\mathfrak {s}}}^*=\mathcal {R}= {{\mathfrak {s}}}_1 \coprod {{\mathfrak {s}}}_2$ with ${{\mathfrak {s}}}_i$ both principal even, we write $v_{\mathcal {R}}^{+}$ and $v_{\mathcal {R}}^-$ for the points in the middle of $L_{{{\mathfrak {s}}}_1,{{\mathfrak {s}}}_2}^+$ and $L_{{{\mathfrak {s}}}_1,{{\mathfrak {s}}}_2}^-$;
if ${{\mathfrak {s}}}^*=\mathcal {R}= {{\mathfrak {t}}}\coprod {{\mathfrak {s}}}$ with ${{\mathfrak {t}}}$ a twin and ${{\mathfrak {s}}}$ principal even, we write $v_{\mathcal {R}}^{+}$ and $v_{\mathcal {R}}^-$ for the points on $L_{{{\mathfrak {t}}}}$ of distance $\delta _{{\mathfrak {s}}}$ from $v_{{{\mathfrak {s}}}}^+$ and $v_{{{\mathfrak {s}}}}^-$, respectively.

Remark 9.2

$\ell _{{{\mathfrak {s}}}}$ is a loop (cycle) in $\Upsilon _C$ unless ${{\mathfrak {s}}}^* = \mathcal {R}$. In the latter case, it is a “half loop” in the sense that if $\ell _{{{\mathfrak {s}}}}$, $\ell _{{{\mathfrak {s}}}'}$ are two half loops then $ \ell _{{{\mathfrak {s}}}} - \ell _{{{\mathfrak {s}}}'}$ is a loop.

Using the explicit description of the dual graph it is not hard to check that the loops described above form a basis for the homology of $\Upsilon _C$ and to track the action of Galois on them. This gives the following result on $H_1(\Upsilon _C,\mathbb {Z})$.

Theorem 9.3

Let C/K be a hyperelliptic curve and let F/K be a Galois extension over which C is semistable. Let $\Upsilon _C$ be the dual graph of the special fibre of the minimal regular model of C over ${\mathcal {O}}_{F^{{\text {nr}}}}$. Let A be the set of even non-übereven clusters excluding $\mathcal {R}$, and let B be the subset of clusters ${{\mathfrak {s}}}\in A$ such that ${{\mathfrak {s}}}^*=\mathcal {R}$. Then

(1)
${\text {rk}}_\mathbb {Z}( H_1(\Upsilon _C,\mathbb {Z})) = { \biggl \{\!\!\begin{array}{ll}\#A&{} \text{ if } \mathcal {R} \text{ is } \text{ not } \ddot{\text {u}}\text{ bereven },\\ \#A-1&{} \text{ otherwise. }\end{array}}$
(2)
$$\begin{aligned} H_1(\Upsilon _{C},{\mathbb {Z}}) = \Bigl \{ \sum _{{{\mathfrak {s}}}\in A} a_{{{\mathfrak {s}}}}\ell _{{{\mathfrak {s}}}} \Bigm |\> a_{{\mathfrak {s}}}\in \mathbb {Z},\>\>\sum _{{{\mathfrak {s}}}\in B} a_{{{\mathfrak {s}}}}=0\Bigr \}, \end{aligned}$$
(3)
the length pairing is given by
$$\begin{aligned} \langle \ell _{{{\mathfrak {s}}}_1},\ell _{{{\mathfrak {s}}}_2} \rangle =\left\{ \begin{array}{ll} 0&{} \text{ if } {{\mathfrak {s}}}^*_1 \ne {{\mathfrak {s}}}^*_2, \\ 2(d_{({{\mathfrak {s}}}_1\wedge {{\mathfrak {s}}}_2)}-d_{P({{\mathfrak {s}}}^*_1)})&{} \text{ if } {{\mathfrak {s}}}^*_1 ={{\mathfrak {s}}}^*_2 \ne \mathcal {R}, \\ 2(d_{({{\mathfrak {s}}}_1\wedge {{\mathfrak {s}}}_2)}-d_{\mathcal {R}})&{} \text{ if } {{\mathfrak {s}}}^*_1 ={{\mathfrak {s}}}^*_2 = \mathcal {R}.\\ \end{array} \right. \end{aligned}$$
(4)
for $\sigma \in G_K$,
$$\begin{aligned} \sigma (\ell _{{{\mathfrak {s}}}}) = \epsilon _{{{\mathfrak {s}}}}(\sigma ) \ell _{\sigma ({{\mathfrak {s}}})}. \end{aligned}$$

Proof

This follows from Theorem 6.9, which describes the $\Upsilon _C$ with the induced Galois action, and Theorem D.18 and Remark D.19, which describe the associated homology group. $\square $

Corollary 9.4

Let C/K be a semistable hyperelliptic curve. Let A be the set of even, non-übereven clusters excluding $\mathcal {R}$. Then the conductor exponent of Jac C is

$$\begin{aligned} n_C = \#A - { \Bigl \{\!\!\begin{array}{ll} \scriptstyle ~~1\!\!\!&{}\scriptstyle ~~\mathcal {R}~ \displaystyle \ddot{\text {u}}\text{ bereven, }\\ \scriptstyle ~~0\!\!\!&{}\scriptstyle \text{ otherwise. }\end{array}} \end{aligned}$$

Proof

Since $J={\text {Jac}}C$ is semistable,

$$\begin{aligned} n_C = \dim (V_{\ell }J)-\dim (V_{\ell }(J)^{I}) = {\text {rk}}_\mathbb {Z}(H_1(\Upsilon _C, \mathbb {Z})) \end{aligned}$$

by (2.18). The result follows from Theorem 9.3 (1). $\square $

Notation 9.5

Let G be a group acting on a set X via the signed^{Footnote 15} permutation $(X,\varepsilon )$. For a ring R we write $R[X,\varepsilon ]$ for the corresponding signed permutation representation, and $R[X,\varepsilon ]_0$ for its sum zero part.

Corollary 9.6

Let F/K be an extension over which C is semistable. Let A be the set of even non-übereven clusters excluding $\mathcal {R}$, and let B be the subset of clusters ${{\mathfrak {s}}}\in A$ such that ${{\mathfrak {s}}}^*=\mathcal {R}$.

(1)
$$\begin{aligned} H_1(\Upsilon _{C},\mathbb {Z}) \>\>\cong \>\> \mathbb {Z}[A~{\setminus } ~B, \epsilon ] \>\> \oplus \>\>\mathbb {Z}[B,\epsilon ]_0, \end{aligned}$$
(2)
$$\begin{aligned} H^1_{\acute{\mathrm{e}}\text {t}}(C,\mathbb {Q}_l)_t \>\>\>\>\cong \>\>\>\> \mathbb {Q}_l[E, \epsilon ]\ominus \epsilon _\mathcal {R}\ \>\>\>\>\cong \>\>\>\> \bigoplus _{{\mathfrak {s}}}{\text {Ind}}_{{\text {Stab}}{{\mathfrak {s}}}}^{G_K} \!\epsilon _{{\mathfrak {s}}}\quad \ominus \epsilon _\mathcal {R}, \end{aligned}$$
where E is the set of even non-übereven clusters and the sum is taken over representatives of $G_K$-orbits on E.

Proof

(1) Follows directly from parts (2) and (4) of Theorem 9.3.

(2) Tensoring (1) with $\mathbb {Q}_l$ and using (2.18) we get

$$\begin{aligned} H^1_{\acute{\mathrm{e}}\text {t}}(C,\mathbb {Q}_l)_t \>\>\>\>&\cong \>\>\>\> \mathbb {Q}_l[A{\setminus } B, \epsilon ] \>\> \oplus \>\>\mathbb {Q}_l[B,\epsilon ]_0\\&\cong \>\>\>\> \mathbb {Q}_l[A{\setminus } B, \epsilon ] \>\> \oplus \>\>\ { \Bigl \{\!\!\begin{array}{ll} \scriptstyle 0\!\!\!&{}\scriptstyle B = \emptyset \\ \scriptstyle \mathbb {Q}_l[B,\epsilon ]\ominus \epsilon _\mathcal {R}\!\!\!&{}\scriptstyle B \ne \emptyset \end{array}}\\&\cong \>\>\>\> \mathbb {Q}_l[A, \epsilon ] \>\> \ominus \>\>\ { \Bigl \{\!\!\begin{array}{ll} \scriptstyle 0\!\!\!&{}\scriptstyle B = \emptyset ,\\ \scriptstyle \epsilon _\mathcal {R}\!\!\!&{}\scriptstyle B \ne \emptyset \end{array}}\\&\cong \>\>\>\> \mathbb {Q}_l[E, \epsilon ] \>\> \ominus \>\>\ \epsilon _\mathcal {R}, \end{aligned}$$

where E is the set of even non-übereven clusters and the last isomorphism uses the fact that B is empty if and only if $\mathcal {R}$ is not übereven.

Observe that when G acts transitively on X and $(X, \varepsilon )$ is a signed permutation then $\mathbb {Q}_l[X, \varepsilon ] \cong {\text {Ind}}_{{\text {Stab}}_t}^G\varepsilon _t$ for any point $t \in X$; here $\sigma (t) = \varepsilon _t(\sigma ) t$ for $\sigma \in {\text {Stab}}_t$. Hence

$$\begin{aligned} H^1_{\acute{\mathrm{e}}\text {t}}(C,\mathbb {Q}_l)_t \>\>\>\>\cong \>\>\>\> \bigoplus _{{\mathfrak {s}}}{\text {Ind}}_{{\text {Stab}}{{\mathfrak {s}}}}^{G_K} \!\epsilon _{{\mathfrak {s}}}\quad \ominus \epsilon _\mathcal {R}, \end{aligned}$$

where the sums are taken over representatives of $G_K$-orbits on E. $\square $

10 Galois representation

Having obtained an explicit description of the special fibre of the minimal regular model of C over the field where it becomes semistable, together with the action of $G_K$, we are now in a position to extract the action of $G_K$ on $H^1_{\acute{\mathrm{e}}\text {t}}(C)=H^1_{\acute{\mathrm{e}}\text {t}}(C_{{\bar{K}}},\mathbb {Q}_l)$.

Fix a prime $l\ne {{\,\mathrm{char}\,}}k$. As in Sect. 8, we take

C/K a hyperelliptic curve;
F/K a finite Galois extension over which C becomes semistable;
$\Gamma _{{\mathfrak {s}}}$ components of the special fibre $\mathcal {C}_{\min ,{\bar{k}}}$ of the minimal regular model of C over ${\mathcal {O}}_{F^{{\text {nr}}}}$ (see Definition 8.5);
Galois action of (2.17); it induces the action of the stabiliser $G_{{\mathfrak {s}}}$ on $\Gamma _{{\mathfrak {s}}}$, on its normalisation ${\tilde{\Gamma }}_{{\mathfrak {s}}}$ and on the étale cohomology group $H^1_{\acute{\mathrm{e}}\text {t}}({\tilde{\Gamma }}_{{\mathfrak {s}}})$.

Theorem 10.1

Let C/K be a hyperelliptic curve. Let $H^1_{\acute{\mathrm{e}}\text {t}}(C)=H^1_{\acute{\mathrm{e}}\text {t}}(C)_{ab}\oplus H^1_{\acute{\mathrm{e}}\text {t}}(C)_{t}\otimes {\text {Sp}}_2$ be the decomposition into ‘toric’ and ‘abelian’ parts. Then

$$\begin{aligned} \begin{array}{llllllllllll} H^1_{\acute{\mathrm{e}}\text {t}}(C)_{t} &{}=&{}\bigoplus _{{\mathfrak {s}}}{\text {Ind}}_{G_{{\mathfrak {s}}}}^{G_K} \!\epsilon _{{\mathfrak {s}}}\quad \ominus \epsilon _\mathcal {R},\\ H^1_{\acute{\mathrm{e}}\text {t}}(C)_{ab}&{}=&{}\bigoplus _{{\mathfrak {s}}}{\text {Ind}}_{G_{{\mathfrak {s}}}}^{G_K} H^1_{\acute{\mathrm{e}}\text {t}}({\tilde{\Gamma }}_{{\mathfrak {s}}}). \end{array} \end{aligned}$$

The first sum is taken over representatives of $G_K$-orbits of even non-übereven clusters. The second sum is taken over representatives of $G_K$-orbits of principal non-übereven clusters. For every such cluster ${{\mathfrak {s}}}$, there is an isomorphism of $I_{{\mathfrak {s}}}$-modules

$$\begin{aligned} H^1_{\acute{\mathrm{e}}\text {t}}({\tilde{\Gamma }}_{{\mathfrak {s}}}) \>\>\cong \>\> {\tilde{\gamma }}_s \otimes (\mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}] \ominus {\varvec{1}}) \quad \ominus \epsilon _{{\mathfrak {s}}}, \end{aligned}$$

where ${\tilde{\gamma }}_{{\mathfrak {s}}}:I_{{\mathfrak {s}}}\rightarrow {\bar{\mathbb {Q}}}_l^\times $ is any character whose order is the prime-to-p part of the denominator of $|I_K/I_{{\mathfrak {s}}}|\,{\tilde{\lambda }}_{{\mathfrak {s}}}$.

Proof

By Theorem 2.18 we have the decomposition and the claim regarding the abelian part. The statement about the toric part is Corollary 9.6(2). The last claim is [16, Thm. 1.2] combined with Theorem 8.8(i) and Remark 8.3; note that [16, Thm. 1.2] is phrased for $\mathbb {C}$- rather then $\mathbb {Q}_l$-representations, but that does not affect the result. $\square $

Remark 10.2

When C/K is semistable the full action of $G_{{\mathfrak {s}}}$ (rather than just that of $I_{{\mathfrak {s}}}$) on $H^1_{\acute{\mathrm{e}}\text {t}}({\tilde{\Gamma }}_{{\mathfrak {s}}})$ may be explicitly determined, as we now explain. For a proper cluster ${{\mathfrak {s}}}$, write $K_{{\mathfrak {s}}}$ for the subfield of $K^{{\text {sep}}}$ fixed by $G_{{\mathfrak {s}}}$ and denote by $k_{{\mathfrak {s}}}$ its residue field. Suppose (as is possible by Lemma B.1) that for each proper cluster ${{\mathfrak {s}}}$ we have fixed our choice of centre $z_{{\mathfrak {s}}}$ to lie in $K_{{\mathfrak {s}}}$. Then for any principal cluster ${{\mathfrak {s}}}$ the coefficients of

$$\begin{aligned} \widetilde{\Gamma }_{\mathfrak {s}}:y^2=c_{\mathfrak {s}}\prod _{\text{ odd } {\mathfrak {o}}<{{\mathfrak {s}}}}(x-\text{ red}_{\mathfrak {s}}({\mathfrak {o}})) \end{aligned}$$

lie in $k_{{\mathfrak {s}}}$. Moreover, by Remark 8.9 (ii) the action of $G_{{\mathfrak {s}}}$ on $\widetilde{\Gamma }_{\mathfrak {s}}(\bar{k})$ (arising from (2.14)) is simply given by $(x,y)\mapsto (\bar{\sigma }(x),\bar{\sigma }(y))$ where $\bar{\sigma }$ denotes the automorphism of $\bar{k}$ induced by $\sigma $ (whence $G_{{\mathfrak {s}}}$ acts through $\text{ Gal }(K_{{\mathfrak {s}}}^{\text{ nr }}/K_{{\mathfrak {s}}})$). In particular, upon identifying $\text{ Gal }(K_{{\mathfrak {s}}}^{\text{ nr }}/K_{{\mathfrak {s}}})$ with $\text{ Gal }(\bar{k}/k_{{\mathfrak {s}}})$, the induced action on $H^1_{\acute{\mathrm{e}}\text {t}}({\tilde{\Gamma }}_{{\mathfrak {s}}})$ is precisely the usual action of $\text{ Gal }(\bar{k}/k_{{\mathfrak {s}}})$ on $H^1_{\acute{\mathrm{e}}\text {t}}({\tilde{\Gamma }}_{{\mathfrak {s}}})$ coming from viewing $\widetilde{\Gamma }_{{\mathfrak {s}}}$ as a curve defined over $k_{{\mathfrak {s}}}$ given by the above formula. One may then recover the Frobenius eigenvalues for this action on $H^1_{\acute{\mathrm{e}}\text {t}}({\tilde{\Gamma }}_{{\mathfrak {s}}})$ from point counts on $\widetilde{\Gamma }_{{\mathfrak {s}}}$ over extensions of $k_{{\mathfrak {s}}}$ in the usual way.

Theorem 10.3

Let C/K be a hyperelliptic curve. Write ${\text {Jac}}C$ for its Jacobian. Then

(1)
C is semistable $\Leftrightarrow $ ${\text {Jac}}C$ semistable $\Leftrightarrow $ C/K satisfies the semistability criterion.
(2)
C has good reduction $\Leftrightarrow $ $K(\mathcal {R})/K$ is unramified, there are no proper clusters of size $<2g+1$ and $\nu _s\in 2\mathbb {Z}$ for the unique principal cluster.
(3)
C has potentially good reduction $\Leftrightarrow $ there are no proper clusters of size $<2g+1$.
(4)
C is tame $\Leftrightarrow $ ${\text {Jac}}C$ is tame $\Leftrightarrow $ $K(\mathcal {R})/K$ is tame.
(5)
${\text {Jac}}C$ has good reduction $\Leftrightarrow $ $K(\mathcal {R})/K$ is unramified, all clusters ${{\mathfrak {s}}}\ne \mathcal {R}$ are odd, and principal clusters have $\nu _{{\mathfrak {s}}}\in 2\mathbb {Z}$.
(6)
${\text {Jac}}C$ has potentially good reduction $\Leftrightarrow $ all clusters ${{\mathfrak {s}}}\ne \mathcal {R}$ are odd.
(7)
The potential toric rank of ${\text {Jac}}C$ equals the number of even non-übereven clusters excluding $\mathcal {R}$, less 1 if $\mathcal {R}$ is übereven.
(8)
${\text {Jac}}C$ has potentially totally toric reduction $\Leftrightarrow $ every cluster has at most two odd children.

Proof

(1) As $g\ge 2$, C is semistable if and only if its Jacobian is [13, Thm. 1.2]. The equivalence with the semistability criterion is proved in Theorem 7.1.

(2),(3) Using that good reduction is in particular semistable, these follow from (1) and Theorem 8.6 which gives the description of the special fibre for semistable curves in terms of principal clusters.

(4) Follows directly from (1).

(5),(6) Recall that ${\text {Jac}}C$ has good reduction if and only if inertia $I_K$ acts trivially on the l-adic Tate module $V_l{\text {Jac}}C$ (for some $l\ne p$), by the Néron-Ogg-Shafarevich criterion [35, Sect. 2]. Now apply Theorem 10.1 that gives the inertia action on $V_l{\text {Jac}}C$.

(7),(8) Apply Theorem 10.1. For (8), note that the condition ‘at most two odd children’ is equivalent to all components $\Gamma _{{\mathfrak {s}}}$ from principal clusters having genus 0. $\square $

11 Conductor

In this section we derive a formula for the conductor of a hyperelliptic curve C/K in terms of clusters (Theorem 11.3).

Lemma 11.1

Let k be a field of characteristic $\ne 2$, and C/k a hyperelliptic curve given by

$$\begin{aligned} Y^2 = c\prod _{r\in R}(X-r), \qquad R\subset {\bar{k}}. \end{aligned}$$

Let $G\subset {\text {Aut}}_k C$ be an affine group of automorphisms acting as

$$\begin{aligned} g(X) = \alpha (g) X+\beta (g), \qquad g(Y)=\gamma (g)\,Y \qquad \qquad (g\in G). \end{aligned}$$

Let ${\tilde{\gamma }}: G\rightarrow {\bar{\mathbb {Q}}}_l^\times $ be a character with $\ker {\tilde{\gamma }}=\ker \gamma $. We have:

If ${\text {ord}}_2({\text {order}}(\gamma ))>{\text {ord}}_2({\text {order}}(\alpha ))$ then ${\tilde{\gamma }} \otimes (\mathbb {Q}_l[R] \ominus {\varvec{1}})$ has trivial G-invariants.
If ${\text {ord}}_2({\text {order}}(\gamma ))\le {\text {ord}}_2({\text {order}}(\alpha ))$ then
$$\begin{aligned} {\tilde{\gamma }} \otimes (\mathbb {Q}_l[R] \ominus {\varvec{1}}) \>\cong \> \mathbb {Q}_l[R] \ominus { \biggl \{\!\!\begin{array}{ll}{\varvec{1}}&{}\text {if }\gamma \text { has odd order,}\\ {\tilde{\gamma }}&{}\text {if }\gamma \text { has even order,}\end{array}} \end{aligned}$$
as G-modules.
If $|R|\le 2$, then ${\tilde{\gamma }} \otimes (\mathbb {Q}_l[R] \ominus {\varvec{1}})\oplus {\varvec{1}}$ is the permutation representation of G on the (one or two) points at infinity of C.

Proof

If ${\text {ord}}_2({\text {order}}(\gamma ))>{\text {ord}}_2({\text {order}}(\alpha ))$, then $\mathbb {Q}_l[R] \ominus {\varvec{1}}$ contains no 1-dimensional characters of order equal to the order of $\gamma $. Therefore ${\tilde{\gamma }} \otimes (\mathbb {Q}_l[R] \ominus {\varvec{1}})$ has no G-invariants.

Suppose ${\text {ord}}_2({\text {order}}(\gamma ))\le {\text {ord}}_2({\text {order}}(\alpha ))$. Then we are in the setup of [16, Thm 4.1], and by [16, Lemma 4.4 (2)] we have

$$\begin{aligned} {\tilde{\gamma }} \otimes (\mathbb {Q}_l[R] \ominus {\varvec{1}}) \>\cong \> \mathbb {Q}_l[R] \ominus { \biggl \{\!\!\begin{array}{ll}{\varvec{1}}&{}\text {if }\mathbb {Q}_l[R]\text { contains an irregular orbit of }G,\\ {\tilde{\gamma }}&{}\text {if }\mathbb {Q}_l[R]\cong \mathbb {Q}_l[G]^{\oplus r}\text { for some }r.\end{array}} \end{aligned}$$

In the first (irregular orbit) case, $\gamma ^2=\alpha $ by [16] Prop. 2.2 (5b), and it follows that $\gamma $ has odd order. In the second (regular) case, $\gamma ^2={\varvec{1}}$ by [16] Prop. 2.2 (5a). Hence, either $\gamma ={\varvec{1}}$ and the claim is trivial, or $\gamma $ has even order.

The last claim follows from [16, Thm 4.1], since C has genus 0 and trivial $H^1_{\acute{\mathrm{e}}\text {t}}$ in this case. $\square $

Now we go back to the setting of a hyperelliptic curve C/K. Recall from Definition 8.2 that we defined $\tilde{\lambda }_{{\mathfrak {s}}}$ and characters $\alpha _{{\mathfrak {s}}}, \beta _{{\mathfrak {s}}}$ and $\gamma _{{\mathfrak {s}}}$ for all principal clusters ${{\mathfrak {s}}}$. In what follows we extend these definitions to all proper clusters ${{\mathfrak {s}}}$ by the same formulae.^{Footnote 16}

Lemma 11.2

Let ${{\mathfrak {s}}}$ be a proper non-übereven cluster, and ${\tilde{\gamma }}_{{\mathfrak {s}}}: I_{{\mathfrak {s}}}\rightarrow {\bar{\mathbb {Q}}}_l^\times $ a character with $\ker {\tilde{\gamma }}_{{\mathfrak {s}}}=\ker \gamma _{{\mathfrak {s}}}$.

If ${\text {ord}}_2{\text {denom}}(|I/I_{{\mathfrak {s}}}|{\tilde{\lambda }}_{{\mathfrak {s}}})>{\text {ord}}_2{\text {denom}}(|I/I_{{\mathfrak {s}}}|d_{{\mathfrak {s}}})$ then ${\tilde{\gamma }}_{{\mathfrak {s}}}\otimes (\mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}] \ominus {\varvec{1}})$ has trivial $I_{{\mathfrak {s}}}$-invariants.
If ${\text {ord}}_2{\text {denom}}(|I/I_{{\mathfrak {s}}}|{\tilde{\lambda }}_{{\mathfrak {s}}})\le {\text {ord}}_2{\text {denom}}(|I/I_{{\mathfrak {s}}}|d_{{\mathfrak {s}}})$ then
$$\begin{aligned} {\tilde{\gamma }}_{{\mathfrak {s}}}\otimes (\mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}] \ominus {\varvec{1}}) \>\cong \> \mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}] \ominus { \biggl \{\!\!\begin{array}{ll}{\varvec{1}}&{}\text {if }{\text {ord}}_2(|I/I_{{\mathfrak {s}}}|{\tilde{\lambda }}_{{\mathfrak {s}}})\ge 0,\\ {\tilde{\gamma }}_{{\mathfrak {s}}}&{}\text {if }{\text {ord}}_2(|I/I_{{\mathfrak {s}}}|{\tilde{\lambda }}_{{\mathfrak {s}}})<0,\end{array}} \end{aligned}$$
as $I_{{\mathfrak {s}}}$-modules.
If $|{\tilde{{{\mathfrak {s}}}}}|=2$ and ${{\mathfrak {s}}}$ is not a cotwin, then ${\tilde{\gamma }}_{{\mathfrak {s}}}\otimes (\mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}] \ominus {\varvec{1}}) = \epsilon _{{\mathfrak {s}}}$.
If $\mathcal {R}$ is a cotwin of size $2g\!+\!2$, then ${\tilde{\gamma }}_\mathcal {R}\otimes (\mathbb {Q}_l[\mathcal {R}_0] \ominus {\varvec{1}}) = \epsilon ^0_\mathcal {R}$, where $\epsilon ^0_{\mathcal {R}}(\sigma )=\frac{\sigma (\sqrt{c_f})}{\sqrt{c_f}}$ for $\sigma \in I_K$.

Proof

Let F be a finite Galois extension over which C becomes semistable, and such that $\mathcal {R}\subset F$. In what follows we consider all representations as those of $I_{{\mathfrak {s}}}$. First let ${{\mathfrak {s}}}$ be a principal cluster. By Theorem 8.8, we have the curve

$$\begin{aligned} {\tilde{\Gamma }}_{{\mathfrak {s}}}:y^2=c_{{\mathfrak {s}}}\prod _{\text{ odd } {{\mathfrak {s}}}'<{{\mathfrak {s}}}}(x-\text{ red}_{{\mathfrak {s}}}({{\mathfrak {s}}}')) \end{aligned}$$

over $\bar{k}$ together with $I_{{\mathfrak {s}}}$-action

$$\begin{aligned} g(X) = \alpha _{{\mathfrak {s}}}(g) X+\beta _{{\mathfrak {s}}}(g), \qquad g(Y)=\gamma _s(g)\,Y \qquad \qquad (g\in I_{{\mathfrak {s}}}). \end{aligned}$$

Observe that ${\text {order}}(\gamma _{{\mathfrak {s}}})$ is the prime-to-p part of the denominator of $|I/I_{{\mathfrak {s}}}|{\tilde{\lambda }}_{{\mathfrak {s}}}$, and ${\text {order}}(\alpha _{{\mathfrak {s}}})$ is the prime-to-p part of the denominator of $|I/I_{{\mathfrak {s}}}|d_{{\mathfrak {s}}}$. The claim follows from Lemma 11.1 and Theorem 8.8(ii).

When ${{\mathfrak {s}}}$ is not principal we argue similarly. Since $\mathcal {R}\subset F$ the disc $D({{\mathfrak {s}}})$ (the minimal disc cutting out ${{\mathfrak {s}}}$, see Notation 4.1) is valid in the sense of Definition 4.4. In particular this disc contributes a component $\Gamma _{D({{\mathfrak {s}}})}$ to the special fibre of the minimal regular model of C over ${\mathcal {O}}_{F^{{\text {nr}}}}$ (see Proposition 5.20; if $e\nu _{{\mathfrak {s}}}$ is not even, enlarge F to make it so). The normalisation of this component is given by the same equation as for ${{\mathfrak {s}}}$ principal as is the action of $I_{{\mathfrak {s}}}$ (see Proposition 6.2). Now we conclude as before; that $\epsilon _{{\mathfrak {s}}}$ corresponds to the action on the points at infinity follows from the explicit action of $I_{{\mathfrak {s}}}$ on the dual graph $\Upsilon _C$ in Thereom 8.6(2) in the non-cotwin case, and by the formula for $\Gamma _{D(\mathcal {R})}$ in the cotwin case. $\square $

Theorem 11.3

(Conductor) Let C/K be a hyperelliptic curve. Decompose the conductor exponent of ${\text {Jac}}C$ into the tame part and a wild part, $n_C = n_{C,\text {tame}}+n_{C,\text {wild}}$. Then

$$\begin{aligned} n_{C,\text {wild}} = \sum _{r\in R/\!/G_K} v_K(\Delta _{K(r)/K})-[K(r):K]+f_{K(r)/K}, \end{aligned}$$

and $n_{C,\text {tame}} = 2g-\dim H^1_{\acute{\mathrm{e}}\text {t}}(C)^{I_K}$ with

$$\begin{aligned} \begin{array}{llllllllllll} \dim H^1_{\acute{\mathrm{e}}\text {t}}(C)^{I_K} &{}=&{} \#\bigl \{{{\mathfrak {s}}}\subsetneq \mathcal {R}\text { odd} \,\bigm |\, \xi _{P({{\mathfrak {s}}})}({\tilde{\lambda }}_{P({{\mathfrak {s}}})})\le \xi _{P({{\mathfrak {s}}})}(d_{P({{\mathfrak {s}}})})\bigr \}/{I_K} \\ &{}-&{} \#\bigl \{{{\mathfrak {s}}}\text { proper non-}\ddot{\text {u}}\text {bereven} \,\bigm |\, \xi _{{{\mathfrak {s}}}}({\tilde{\lambda }}_{{{\mathfrak {s}}}})=0 \bigr \}/{I_K} \\ &{}-&{} { \Bigl \{\!\!\begin{array}{ll} \scriptstyle ~~1\!\!\!&{}\scriptstyle \text {~~if }|\mathcal {R}|\text { is even and }v(c_f)\text { is even,}\\ \scriptstyle ~~0\!\!\!&{}\scriptstyle \text {~~otherwise;}\end{array}} \end{array} \end{aligned}$$

here $\xi _{{\mathfrak {s}}}(a)={\text {ord}}_2(b)$ where b is the denominator of $|I_K/I_{{{\mathfrak {s}}}}|a$, with $\xi _{{\mathfrak {s}}}(0)=0$.

Proof

(Tame part) Let $\epsilon ^0_{\mathcal {R}}(\sigma )=\frac{\sigma (\sqrt{c_f})}{\sqrt{c_f}}$ for $\sigma \in I_K$ if $\mathcal {R}$ is even, and the zero representation if $\mathcal {R}$ is odd. Note that $\epsilon _\mathcal {R}^0$ is the same as $\epsilon _\mathcal {R}$, except when $\mathcal {R}$ is a cotwin.

By Theorem 1.20 we have to compute the dimension of inertia invariants on

$$\begin{aligned}&H^1_{\acute{\mathrm{e}}\text {t}}(C)_{ab}\oplus H^1_{\acute{\mathrm{e}}\text {t}}(C)_t\otimes {\text {Sp}}_2 \\&\quad = \bigoplus _{{{\mathfrak {s}}}\in P} {\text {Ind}}_{I_{{\mathfrak {s}}}}^{I_K} ({\tilde{\gamma }}_{{\mathfrak {s}}}\otimes (\mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}] \ominus {\varvec{1}})\ominus \epsilon _{{\mathfrak {s}}}) \oplus ({\text {Sp}}_2 \otimes (\bigoplus _{s\in E}{\text {Ind}}_{I_{{\mathfrak {s}}}}^{I_K} \!\epsilon _{{\mathfrak {s}}}\quad \ominus \epsilon _\mathcal {R})), \end{aligned}$$

where P and E are sets of $I_K$-orbit representatives on principal non-übereven clusters and on even non-übereven clusters, respectively.

By Lemma 11.2 we can expand the first sum to

$$\begin{aligned} H^1_{\acute{\mathrm{e}}\text {t}}(C)_{ab} = \bigoplus _{{{\mathfrak {s}}}\in T} {\text {Ind}}_{I_{{\mathfrak {s}}}}^{I_K} ({\tilde{\gamma }}_{{\mathfrak {s}}}\otimes (\mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}] \ominus {\varvec{1}})\ominus \epsilon _{{\mathfrak {s}}}) \oplus \epsilon _\mathcal {R}\ominus \epsilon ^0_\mathcal {R}, \end{aligned}$$

where T is a set of representatives of $I_K$-orbits on all proper non-übereven clusters excluding cotwins of odd size. Since $\epsilon _{{\mathfrak {s}}}$ is the zero representation for odd clusters (except cotwins), we can similarly expand the second sum to

$$\begin{aligned} H^1_{\acute{\mathrm{e}}\text {t}}(C)_{t} = {\text {Sp}}_2 \otimes (\bigoplus _{s\in T}{\text {Ind}}_{I_{{\mathfrak {s}}}}^{I_K} \!\epsilon _{{\mathfrak {s}}}\quad \ominus \epsilon _\mathcal {R}), \end{aligned}$$

Taking $I_F$ invariants on $H^1_{\acute{\mathrm{e}}\text {t}}(C)_{ab}\oplus H^1_{\acute{\mathrm{e}}\text {t}}(C)_t\otimes {\text {Sp}}_2$ replaces ${\text {Sp}}_2$ by ${\varvec{1}}$, so we are left with $I_{F/K}$-invariants on

$$\begin{aligned} \bigoplus _{{{\mathfrak {s}}}\in T} {\text {Ind}}_{I_{{\mathfrak {s}}}}^{I_K} ({\tilde{\gamma }}_{{\mathfrak {s}}}\otimes (\mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}] \ominus {\varvec{1}})) \quad \ominus \epsilon ^0_\mathcal {R}. \end{aligned}$$

Note that if ${{\mathfrak {s}}}$ is a cotwin of odd size then $\mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}]$ is the trivial representation so this expression is the same as

$$\begin{aligned} \bigoplus _{{{\mathfrak {s}}}\in S} {\text {Ind}}_{I_{{\mathfrak {s}}}}^{I_K} ({\tilde{\gamma }}_{{\mathfrak {s}}}\otimes (\mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}] \ominus {\varvec{1}})) \quad \ominus \epsilon ^0_\mathcal {R}, \end{aligned}$$

where S is a set of representatives of $I_K$-orbits on proper non-übereven clusters. By Frobenius reciprocity, we find

$$\begin{aligned} \dim H^1_{\acute{\mathrm{e}}\text {t}}(C)^{I_K} = \sum _{{{\mathfrak {s}}}\in S}\> \langle {\varvec{1}}, {\tilde{\gamma }}_{{\mathfrak {s}}}\otimes (\mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}] \ominus {\varvec{1}}) \rangle _{I_{{\mathfrak {s}}}} \quad - \langle {\varvec{1}},\epsilon _\mathcal {R}^0 \rangle _{I_K}. \end{aligned}$$

Let $S_0\subset S$ be the set of those clusters for which

$$\begin{aligned} \xi _{{{\mathfrak {s}}}}({\tilde{\lambda }}_{{{\mathfrak {s}}}})\le \xi _{{{\mathfrak {s}}}}(d_{{{\mathfrak {s}}}}). \end{aligned}$$

By Lemma 11.2, $\gamma _{{\mathfrak {s}}}\otimes (\mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}] \ominus {\varvec{1}})$ has no $I_{{\mathfrak {s}}}$-invariants if ${{\mathfrak {s}}}\in S{\setminus } S_0$. Otherwise, by the same lemma,

$$\begin{aligned} \gamma _{{\mathfrak {s}}}\otimes (\mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}] \ominus {\varvec{1}}) \qquad \cong \qquad \mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}] \ominus {\varvec{1}}\qquad \text {or}\qquad \mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}] \ominus \gamma _{{\mathfrak {s}}}, \end{aligned}$$

depending on whether $\xi _{{\mathfrak {s}}}({\tilde{\lambda }}_{{\mathfrak {s}}})$ is $=0$ or $>0$, respectively. Therefore,

$$\begin{aligned} \dim H^1_{\acute{\mathrm{e}}\text {t}}(C)^{I_K} = \sum _{{{\mathfrak {s}}}\in S_0}\> \langle {\varvec{1}}, \mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}] \rangle _{I_{{\mathfrak {s}}}} \>\>-\>\> \#\{{{\mathfrak {s}}}\in S_0\,|\,\xi _{{\mathfrak {s}}}({\tilde{\lambda }}_{{\mathfrak {s}}})= 0\} \>\>-\>\> \langle {\varvec{1}},\epsilon _\mathcal {R}^0 \rangle . \end{aligned}$$

Note that

$$\begin{aligned} \langle {\varvec{1}},\epsilon _\mathcal {R}^0 \rangle ={ \Bigl \{\!\!\begin{array}{ll} \scriptstyle ~~1\!\!\!&{}\scriptstyle \text {~~if }|\mathcal {R}|\text { is even and }v(c_f)\text { is even,}\\ \scriptstyle ~~0\!\!\!&{}\scriptstyle \text {~~otherwise,}\end{array}} \end{aligned}$$

is the last term in the statement. Write $U_{{\mathfrak {s}}}$ for the set of those odd clusters ${{\mathfrak {r}}}$ such that $P({{\mathfrak {r}}})$ lies in the $I_K$-orbit of ${{\mathfrak {s}}}$. Then

$$\begin{aligned} \sum _{{{\mathfrak {s}}}\in S_0}\> \langle {\varvec{1}}, \mathbb {Q}_l[{\tilde{{{\mathfrak {s}}}}}] \rangle _{I_{{\mathfrak {s}}}} = \sum _{{{\mathfrak {s}}}\in S_0}\> \langle {\varvec{1}}, \mathbb {Q}_l[U_{{\mathfrak {s}}}] \rangle _{I_K} \end{aligned}$$

and

$$\begin{aligned} \bigcup _{{{\mathfrak {s}}}\in S_0} U_{{{\mathfrak {s}}}} = \{{{\mathfrak {r}}}\subsetneq \mathcal {R}\text { odd} \,\bigm |\, \xi _{P({{\mathfrak {s}}})}({\tilde{\lambda }}_{P({{\mathfrak {s}}})})\le \xi _{P({{\mathfrak {s}}})}(d_{P({{\mathfrak {s}}})}) \bigr \}. \end{aligned}$$

Counting $I_K$-orbits gives the claim.

(Wild part) By the standard description of the 2-torsion of Jacobians of hyperelliptic curves (see e.g. Cornelissen [11, Lemma 2.1]),

$$\begin{aligned} ({\text {Jac}}C)[2] \>\cong \> \mathbb {F}_2[R] \ominus { \Bigl \{\!\!\begin{array}{ll} \scriptstyle {\varvec{1}}\!\!\!&{}\scriptstyle ~~2g+1\text { odd,}\\ \scriptstyle {\varvec{1}}\oplus {\varvec{1}}\!\!\!&{}\scriptstyle ~~2g+2\text { even,}\end{array}} \end{aligned}$$

as $G_K$-modules. Let W be the wild inertia subgroup of $G_K$. Hence

$$\begin{aligned} V_2{\text {Jac}}C \>\cong \> \mathbb {Q}_2[R] \ominus { \Bigl \{\!\!\begin{array}{ll} \scriptstyle {\varvec{1}}\!\!\!&{}\scriptstyle ~~2g+1\text { odd,}\\ \scriptstyle {\varvec{1}}\oplus {\varvec{1}}\!\!\!&{}\scriptstyle ~~2g+2\text { even,}\end{array}} \end{aligned}$$

as W-modules, since W acts on them through a finite group of odd order. (Representation theory modulo p agrees with complex representation theory for finite groups of order prime to p.)

The left-hand side has the same wild part of the conductor as $H^1_{\acute{\mathrm{e}}\text {t}}(C)$. For $r\in R$ write $X_r$ for its $G_K$ orbit. The conductor of the right-hand side is

$$\begin{aligned} n_{\mathbb {Q}_l[R]} = \sum _{r\in R/\!/G_K} n_{\mathbb {Q}_l[X_r]} = \sum _{r\in R/\!/G_K} v_K(\Delta _{K(r)/K}), \end{aligned}$$

and its tame part is

$$\begin{aligned} n_{\mathbb {Q}_l[R],{\text {tame}}} = \sum _{r\in R/\!/G_K} [K(r):K] \!-\! \dim \mathbb {Q}_l[X_r]^{I_K}. \end{aligned}$$

Because $\dim \mathbb {Q}_l[X_r]^{I_K}$ is the number of $I_K$-orbits on $X_r$, which is the residue degree of K(r)/K, we have

$$\begin{aligned} n_{\mathbb {Q}_l[R],{\text {wild}}} = \sum _{r\in R/\!/G_K} v_K(\Delta _{K(r)/K})\!-\![K(r):K]\!+\!f_{K(r)/K}. \end{aligned}$$

$\square $

12 Deficiency

In this section we explain how to see whether a semistable hyperelliptic curve is deficient in terms of its cluster picture. We first recall some standard results regarding deficiency of curves (see e.g. [30] Sect. 8).

Definition 12.1

Let X/K be a geometrically integral, smooth and proper curve of genus g. Then X is deficient over K if and only if it has no rational divisor of degree $g-1$; equivalently, the index $I_2$ does not divide $g-1$, where

$$\begin{aligned} I_2=\gcd \bigl \{[K(P)\!:\!K] ~|~ P \in X({\bar{K}})\bigr \}. \end{aligned}$$

Lemma 12.2

Let X/K be a geometrically integral, smooth and proper curve of genus g. Let $Y_1, ... Y_r$ denote the components of the special fibre of the minimal regular model of X over ${\mathcal {O}}_K$, and $d_1,..., d_r$ their multiplicities. Let

$$\begin{aligned} e_i =[{\bar{k}}\cap k(Y_i):k]= \text {length of }G_k\!\text {-orbit} \text { of }Y_i \end{aligned}$$

and

$$\begin{aligned} I_3 = \gcd \{d_ie_i ~|~ i =1,...,r\}. \end{aligned}$$

Then $I_2=I_3$.

Proof

This is Remark 1 after Lemma 16 in [30]. $\square $

Lemma 12.3

Let C/K be a hyperelliptic curve of genus g. The following are equivalent:

(1)
C is deficient over K,
(2)
C has even genus and has no rational point over any odd degree extension of K,
(3)
C has even genus and every component of the special fibre of its minimal regular model has either even multiplicity or a $G_k$-orbit of even length.

Proof

Since C is hyperelliptic, it has either one or two points at infinity. In particular, it has a rational divisor of degree 2 so $I_2 = 1$ or $I_2=2$. Hence if g is odd then C is not deficient.

$1 \Leftrightarrow 2$ is clear from Definition 12.1.

$1\Leftrightarrow 3$ is clear from Lemma 12.2. $\square $

Theorem 12.4

Let C/K be a semistable hyperelliptic curve. Then C is deficient if and only if it has even genus and either

(1)
$\mathcal {R}= {{\mathfrak {s}}}_1 \coprod {{\mathfrak {s}}}_2$ with ${{\mathfrak {s}}}_1, {{\mathfrak {s}}}_2$ odd $G_k$-conjugate clusters, and $\delta _{{{\mathfrak {s}}}_1}$ is odd, or
(2)
$\mathcal {R}$ is übereven with $\epsilon _{\mathcal {R}}({\text {Frob}}) = -1$ and for all non-übereven ${{\mathfrak {s}}}$ such that ${{\mathfrak {s}}}^* =\mathcal {R}$, either $d_{{\mathfrak {s}}}\notin \mathbb {Z}$ or the ${\text {Frob}}$-orbit of ${{\mathfrak {s}}}$ has even size, or
(3)
$\mathcal {R}$ is a cotwin with a principal übereven child ${{\mathfrak {r}}}$ with $\epsilon _{{{\mathfrak {r}}}}({\text {Frob}}) = -1$, and for all ${{\mathfrak {s}}}$ such that ${{\mathfrak {s}}}^* ={{\mathfrak {r}}}$, either $d_{{\mathfrak {s}}}\notin \mathbb {Z}$ or the ${\text {Frob}}$-orbit of ${{\mathfrak {s}}}$ has even size.

Proof

Since C/K is semistable, all the components of the special fibre of its minimal regular model have multiplicity 1. Thus, by Lemma 12.3, C is deficient if and only if every component has a $G_k$-orbit of even length.

The result follows from the explicit description of the dual graph of the special fibre of the minimal regular model of C together with the action of Frobenius given in Theorem 8.6. From the description of the Frobenius action on the vertices $v_{{\mathfrak {s}}}, v_{{\mathfrak {s}}}^{\pm }$ and the various chains of edges, we see that all components will have even-length orbits under Frobenius if and only if: i) every principal non-übereven cluster has an even-length orbit under Frobenius, ii) every principal übereven cluster ${{\mathfrak {s}}}$ either has an even-length orbit or $\epsilon _{{\mathfrak {s}}}({\text {Frob}})=-1$, iii) every twin and cotwin ${{\mathfrak {t}}}$ either has an even-length orbit or $\epsilon _{{\mathfrak {t}}}({\text {Frob}})=-1$ and $d_{{\mathfrak {t}}}\not \in \mathbb {Z}$, iv) if $\mathcal {R}={{\mathfrak {s}}}_1 \coprod {{\mathfrak {s}}}_2$ is a disjoint union of two even clusters then $\epsilon (\mathcal {R})=-1$, v) if $\mathcal {R}={{\mathfrak {s}}}_1 \coprod {{\mathfrak {s}}}_2$ is a disjoint union of two proper odd clusters then $\frac{1}{2} (\delta _{{{\mathfrak {s}}}_1}\!+\!\delta _{{{\mathfrak {s}}}_2})$ is odd and ${{\mathfrak {s}}}_1$ and ${{\mathfrak {s}}}_2$ are swapped by Frobenius.

Since $\mathcal {R}$ itself cannot have a non-trivial orbit under Frobenius, it follows that it must either be a union of two odd clusters, übereven or a cotwin (and, in the cotwin case, its principal child must similarly be übereven). It follows by inspection that one of (1), (2) and (3) must hold for the curve to be deficient.

For the converse, observe that if a cluster ${{\mathfrak {s}}}$ has an ancestor with an even-length Frobenius orbit, then so does ${{\mathfrak {s}}}$ itself. In particular (1) automatically forces (i)–(v) to hold. Similarly, if (2) (respectively (3)) holds, then every proper cluster ${{\mathfrak {s}}}$ with ${{\mathfrak {s}}}^*\ne \mathcal {R}$ (respectively ${{\mathfrak {s}}}^*\ne {{\mathfrak {r}}}$) will necessarily have a non-übereven ancestor ${\mathfrak {a}}$ with ${\mathfrak {a}}^*=\mathcal {R}$ (respectively ${\mathfrak {a}}={{\mathfrak {r}}}$), and, since ${\mathfrak {a}}$ is not a twin or cotwin, $d_{{\mathfrak {a}}}\in \mathbb {Z}$ by the semistability criterion. Thus ${\mathfrak {a}}$ must have a Frobenius-orbit of even length, and hence so does ${{\mathfrak {s}}}$. It follows that if either (2) or (3) holds, then so do (i)–(iv), and the curve is deficient. $\square $

13 Integral weierstrass models

As we shall see in Sect. 13–17, cluster pictures are very well suited for studying Weierstrass equations of hyperelliptic curves, including discriminants and minimal Weierstrass equations. We begin by a criterion for checking whether a Weierstrass equation is integral, that is whether $f(x)\in {\mathcal {O}}_K[x]$. Since the cluster picture of a polynomial does not change under a substitution $x\mapsto x-t$, it is clearly not possible to determine whether $f(x)\in {\mathcal {O}}_K[x]$ from the cluster picture. However, up to such shifts in the x-coordinate, this turns out to be possible.

Recall first the definition of an (abstract) cluster picture:

Definition 13.1

Let $\mathcal {R}$ be a finite set, $\Sigma $ a collection of non-empty subsets of $\mathcal {R}$ (called clusters), and $d_{{\mathfrak {s}}}\in \mathbb {Q}$ to every ${{\mathfrak {s}}}\in \Sigma $ of size $> 1$ (called the depth of ${{\mathfrak {s}}}$). Then $\Sigma $ (or $(\Sigma ,\mathcal {R},d)$) is a cluster picture if

(1)
Every singleton is a cluster, and $\mathcal {R}$ is a cluster.
(2)
Two clusters are either disjoint or contained in one another.
(3)
$d_{{\mathfrak {t}}}>d_{{\mathfrak {s}}}$ if ${{\mathfrak {t}}}\subsetneq {{\mathfrak {s}}}$.

Two cluster pictures $(\Sigma ,\mathcal {R},d)$ and $(\Sigma ',\mathcal {R}',d')$ are isomorphic if there is a bijection $\phi :\mathcal {R}\rightarrow \mathcal {R}'$ that induces a bijection from $\Sigma $ to $\Sigma '$ and $d_{{\mathfrak {s}}}= d'_{\phi ({{\mathfrak {s}}})}$. We say a group G acts on $(\Sigma ,\mathcal {R},d)$ if it acts by isomorphisms^{Footnote 17}.

We refer the reader to Table 6 in Appendix E for our standard notation for clusters, including the notions of child/parent and ${{\mathfrak {s}}}\wedge {{\mathfrak {s}}}'$.

Definition 13.2

Let $(\Sigma ,\mathcal {R},d)$ be a cluster picture with an action of $G_K$, and let $n\in \mathbb {Z}$. We say that the pair $(\Sigma ,n)$ is integral if either

$n \ge 0$ and $d_{\mathcal {R}} \ge 0$, or
there is a $G_K$-stable proper cluster ${{\mathfrak {s}}}$ with $d_{{\mathfrak {s}}}\le 0$ and
$$\begin{aligned} n+(|{{\mathfrak {s}}}|\!-\!|{{\mathfrak {t}}}|)d_{{\mathfrak {s}}}+\sum \nolimits _{r\notin {{\mathfrak {s}}}} d_{r\wedge {{\mathfrak {s}}}} \ge 0 \end{aligned}$$
for some ${{\mathfrak {t}}}$ which is either empty or a $G_K$-stable child ${{\mathfrak {t}}}<{{\mathfrak {s}}}$ with either $|{{\mathfrak {t}}}|=1$ or $d_{{{\mathfrak {t}}}} \ge 0$.

Theorem 13.3

Let $C:y^2 = f(x)$ be a hyperelliptic curve over K, and let $(\Sigma ,\mathcal {R},d)$ be the associated cluster picture.

(1)
If $f(x)\in {\mathcal {O}}_K[x]$ then $(\Sigma ,v(c_f))$ is integral.
(2)
Conversely, if $(\Sigma , v(c_f))$ is integral and $G_K$ acts tamely on $\mathcal {R}$ then $f(x-z) \in {\mathcal {O}}_K[x]$ for some $z \in K$.

Lemma 13.4

Let $f(x) \in K[x]$. Then $f(x) \in {\mathcal {O}}_{K}[x]$ if and only if

$$\begin{aligned} \sum _{r\in \mathcal {R}} \min \{0, v(r)\} \ge -v(c_f). \end{aligned}$$

Proof

$f(x) \in {\mathcal {O}}_K[x]$ if and only if every point in the Newton polygon of $\frac{f(x)}{c_f}$ lies above $-v(c_f)$. Equivalently the sum of the positive slopes is less than or equal to $v(c_f)$, i.e. $ \sum _{r\in \mathcal {R}} \min \{0, v(r)\} \ge -v(c_f). $ $\square $

Proof of Theorem 13.3

(1) As $f(x)\in {\mathcal {O}}_K[x]$, clearly $v(c_f) \ge 0$, so we may suppose $d_{\mathcal {R}}<0$. If 0 is not a centre for $\mathcal {R}$, then every $r\in \mathcal {R}$ has $v(r)<d_\mathcal {R}$, so that by Lemma 13.4

$$\begin{aligned} |\mathcal {R}|d_\mathcal {R}> \sum _{r\in \mathcal {R}}\min \{0,v(r)\}\ge -v(c_f). \end{aligned}$$

Thus, taking ${{\mathfrak {s}}}=\mathcal {R}$ and ${{\mathfrak {t}}}=\emptyset $, shows that $(\Sigma ,v(c_f))$ is integral.

Henceforth suppose that $d_\mathcal {R}<0$ and that 0 is a centre for $\mathcal {R}$. Let ${{\mathfrak {s}}}$ be the smallest proper cluster with centre $z_{{\mathfrak {s}}}=0$ and depth $d_{{\mathfrak {s}}}\le 0$ and let ${{\mathfrak {t}}}= \{r \in {{\mathfrak {s}}}\mid v(r) > d_{{\mathfrak {s}}}\}$. Note that, by minimality of ${{\mathfrak {s}}}$, ${{\mathfrak {t}}}$ is either empty or a child of ${{\mathfrak {s}}}$ (with $d_{{\mathfrak {t}}}\ge 0$ if $|{{\mathfrak {t}}}|>1$). In particular $v(r)<d_{{\mathfrak {s}}}\le 0$ for every $r\notin {{\mathfrak {s}}}$, and $v(r)=d_{{\mathfrak {s}}}\le 0$ for every $r\in {{\mathfrak {s}}}{\setminus }{{\mathfrak {t}}}$. Thus

$$\begin{aligned} (|{{\mathfrak {s}}}|-|{{\mathfrak {t}}}|)d_{{\mathfrak {s}}}+ \sum _{r \notin {{\mathfrak {s}}}} d_{r\wedge {{\mathfrak {s}}}}= & {} \sum _{r\in {{\mathfrak {s}}}{\setminus }{{\mathfrak {t}}}}d_{{\mathfrak {s}}}+\sum _{r\in {{\mathfrak {t}}}}0+ \sum _{r \notin {{\mathfrak {s}}}} d_{r\wedge {{\mathfrak {s}}}} = \\= & {} \sum _{r \in {{\mathfrak {s}}}{\setminus } {{\mathfrak {t}}}} v(r) + \sum _{r \in {{\mathfrak {t}}}} 0 + \sum _{r \notin {{\mathfrak {s}}}} v(r) \ge \sum _{r\in \mathcal {R}} \min \{0, v(r)\}\ge -v(c_f), \end{aligned}$$

where the last step comes from Lemma 13.4. Finally, note that ${{\mathfrak {s}}}$ and ${{\mathfrak {t}}}$ are $G_K$-stable, and that either $|{{\mathfrak {t}}}|\le 1$ or ${{\mathfrak {t}}}<{{\mathfrak {s}}}$ with $d_{{\mathfrak {t}}}\ge 0$. The result follows.

(2) If $d_{\mathcal {R}} \ge 0$, by Lemma B.1 we can pick a centre $z=z_\mathcal {R}\in K$ for $\mathcal {R}$. Then the roots of $f(x-z)$ are all integral. Since by integrality $v(c_f) \ge 0$, we must have $f(x-z) \in {\mathcal {O}}_K[x]$, as required.

If $d_\mathcal {R}<0$, consider ${{\mathfrak {s}}}$ and ${{\mathfrak {t}}}$ as in the definition of integrality of $(\Sigma ,v(c_f))$. If ${{\mathfrak {t}}}\ne \emptyset $, by Lemma B.1 we can pick a centre $z=z_{{\mathfrak {t}}}=z_{{\mathfrak {s}}}\in K$ for ${{\mathfrak {t}}}$ and ${{\mathfrak {s}}}$. If ${{\mathfrak {t}}}$ is empty, pick $z=z_{{\mathfrak {s}}}\in K$ to be a centre for ${{\mathfrak {s}}}$, using the same lemma. Shifting x to $x-z$, we may thus assume that $z_{{\mathfrak {s}}}=0$ is a centre for ${{\mathfrak {s}}}$, and that $z_{{\mathfrak {t}}}=0$ if ${{\mathfrak {t}}}\ne \emptyset $. By assumption, $d_{{\mathfrak {s}}}\le 0$, so we have $v(r)<d_{{\mathfrak {s}}}\le 0$ for all $r\not \in {{\mathfrak {s}}}$; moreover, $d_s\le v(r)$ for all $r\in {{\mathfrak {s}}}$, and $0\le v(r)$ for $r\in {{\mathfrak {t}}}$. Hence

$$\begin{aligned}&-v(c_f) \le (|{{\mathfrak {s}}}|-|{{\mathfrak {t}}}|)d_{{\mathfrak {s}}}+ \sum _{r \notin {{\mathfrak {s}}}} d_{r\wedge {{\mathfrak {s}}}} = |{{\mathfrak {s}}}{\setminus }{{\mathfrak {t}}}|d_{{\mathfrak {s}}}+ \sum _{r \notin {{\mathfrak {s}}}} v(r) \\&\quad =\sum _{r \in {{\mathfrak {s}}}{\setminus } {{\mathfrak {t}}}} d_{{\mathfrak {s}}}+ \sum _{r \in {{\mathfrak {t}}}} 0 + \sum _{r \notin {{\mathfrak {s}}}} v(r) \le \sum _{r\in \mathcal {R}} \min \{0, v(r)\}. \end{aligned}$$

The result follows from Lemma 13.4. $\square $

We record a further result that will be useful for understanding the minimal Weierstrass equation of a hyperelliptic curve. In view of Theorem 13.3 it gives a simple criterion for checking whether $p^n f(x-z)$ has integral coefficients for some $z\in K$, provided all the roots of f(x) lie in K.

Lemma 13.5

Let $\Sigma $ be a cluster picture with trivial $G_K$-action, such that $d_{{\mathfrak {s}}}\in \mathbb {Z}$ for every cluster ${{\mathfrak {s}}}$ and $d_{\mathcal {R}}\le 0$. Let n be the minimal integer which makes $(\Sigma ,n)$ integral.

(i)
Then $n\!=\!\min _{{\mathfrak {t}}}(-\sum _{r\notin {{\mathfrak {t}}}}d_{r\wedge {{\mathfrak {t}}}})$, where ${{\mathfrak {t}}}$ ranges over all clusters with $d_{P({{\mathfrak {t}}})}\!\le \!0$ that are either singletons or have $d_{{\mathfrak {t}}}>0$.
(ii)
If there is a cluster of size $\ge \frac{|\mathcal {R}|}{2}$ and depth $\ge 0$ then $n=-\sum _{r\notin {{\mathfrak {t}}}}d_{r\wedge {{\mathfrak {t}}}}$, where ${{\mathfrak {t}}}$ is the maximal such cluster (either of two, if there are two such).
(ii’)
If there is a cluster of size $\ge \frac{|\mathcal {R}|}{2}$ and depth $>0$ then $n=-\sum _{r\notin {{\mathfrak {t}}}}d_{r\wedge {{\mathfrak {t}}}}$, where ${{\mathfrak {t}}}$ is the maximal such cluster (either of two, if there are two such).

Proof

If $d_\mathcal {R}=0$ then $n=0$ and the results follow directly from the definition, so we may suppose that $d_\mathcal {R}<0$. Note that for ${{\mathfrak {t}}}<{{\mathfrak {s}}}$,

$$\begin{aligned} (|{{\mathfrak {s}}}|-|{{\mathfrak {t}}}|)d_{{\mathfrak {s}}}+\sum _{r\notin {{\mathfrak {s}}}}d_{r\wedge {{\mathfrak {s}}}} = \sum _{r\in {{\mathfrak {s}}}{\setminus }{{\mathfrak {t}}}}d_{{\mathfrak {s}}}+\sum _{r\notin {{\mathfrak {s}}}}d_{r\wedge {{\mathfrak {s}}}} = \sum _{r\notin {{\mathfrak {t}}}} d_{r\wedge {{\mathfrak {s}}}} = \sum _{r\notin {{\mathfrak {t}}}} d_{r\wedge {{{\mathfrak {t}}}}}. \end{aligned}$$

(i)
Suppose that ${{\mathfrak {s}}}$ and ${{\mathfrak {t}}}$ are the clusters from Definition 13.2 corresponding to n. Recall that $G_K$ acts trivially, so all clusters are $G_K$-stable. If ${{\mathfrak {t}}}=\emptyset $ then ${{\mathfrak {s}}}$ cannot have a child ${{\mathfrak {s}}}'$ with $d_{{{\mathfrak {s}}}'}\le 0$, since $\sum _{r\in \mathcal {R}} d_{r\wedge {{\mathfrak {s}}}} < \sum _{r\in \mathcal {R}} d_{r\wedge {{\mathfrak {s}}}'}$, and so n is not the minimal possible. Thus if ${{\mathfrak {t}}}=\emptyset $, then every child ${{\mathfrak {t}}}'<{{\mathfrak {s}}}$ is either a singleton or has $d_{{{\mathfrak {t}}}'}>0$, and has $(|{{\mathfrak {s}}}|\!-\!|{{\mathfrak {t}}}|)d_{{\mathfrak {s}}}\le (|{{\mathfrak {s}}}|\!-\!|{{\mathfrak {t}}}'|)d_{{\mathfrak {s}}}$ and hence $n+(|{{\mathfrak {s}}}|\!-\!|{{\mathfrak {t}}}'|)d_{{\mathfrak {s}}}+\sum _{r\notin {{\mathfrak {s}}}}d_{r\wedge {{\mathfrak {s}}}}\ge 0$. In other words, we may assume that ${{\mathfrak {t}}}\ne \emptyset $. The required formula now follows.
(ii)
If $d_{{\mathfrak {t}}}=0$ then for any child ${{\mathfrak {t}}}'<{{\mathfrak {t}}}$ we have $\sum _{r\notin {{\mathfrak {t}}}} d_{r\wedge {{\mathfrak {t}}}}=\sum _{r\notin {{\mathfrak {t}}}'} d_{r\wedge {{\mathfrak {t}}}'}$. If $d_{{\mathfrak {t}}}>0$ then by maximality $d_{P({{\mathfrak {t}}})}\le 0$. In either case, $n\le -\sum _{r\notin {{\mathfrak {t}}}}d_{r\wedge {{\mathfrak {t}}}}$ by (i). If ${{\mathfrak {s}}}\not \subseteq {{\mathfrak {t}}}$ is a cluster with $d_{P({{\mathfrak {s}}})}\le 0$ then necessarily $|({{\mathfrak {s}}}\wedge {{\mathfrak {t}}}){\setminus }{{\mathfrak {t}}}|\le \frac{\mathcal {R}}{2}\le |{{\mathfrak {t}}}|$ and
$$\begin{aligned}&\sum _{r\notin {{\mathfrak {s}}}} d_{r\wedge {{\mathfrak {s}}}} = \sum _{r\notin {{\mathfrak {s}}}\wedge {{\mathfrak {t}}}} d_{r\wedge ({{\mathfrak {s}}}\wedge {{\mathfrak {t}}})} + \sum _{r\in ({{\mathfrak {s}}}\wedge {{\mathfrak {t}}}){\setminus }{{\mathfrak {s}}}} d_{r\wedge {{\mathfrak {s}}}} \le \sum _{r\notin {{\mathfrak {s}}}\wedge {{\mathfrak {t}}}} d_{r\wedge ({{\mathfrak {s}}}\wedge {{\mathfrak {t}}})} + |{{\mathfrak {t}}}|d_{{{\mathfrak {s}}}\wedge {{\mathfrak {t}}}} \\&\quad \le \sum _{r\notin {{\mathfrak {s}}}\wedge {{\mathfrak {t}}}} d_{r\wedge ({{\mathfrak {s}}}\wedge {{\mathfrak {t}}})} + |({{\mathfrak {s}}}\wedge {{\mathfrak {t}}}){\setminus }{{\mathfrak {t}}}|d_{{{\mathfrak {s}}}\wedge {{\mathfrak {t}}}} \le \sum _{r\notin {{\mathfrak {s}}}\wedge {{\mathfrak {t}}}} d_{r\wedge ({{\mathfrak {s}}}\wedge {{\mathfrak {t}}})} + \sum _{r\in ({{\mathfrak {s}}}\wedge {{\mathfrak {t}}}){\setminus }{{\mathfrak {t}}}} d_{r\wedge {{\mathfrak {t}}}} = \sum _{r\notin {{\mathfrak {t}}}} d_{r\wedge {{\mathfrak {t}}}}. \end{aligned}$$
Hence ${{\mathfrak {t}}}$ gives the optimal bound in the expression in (i). (ii’) Same as (ii) without the $d_{{\mathfrak {t}}}=0$ case.

$\square $

14 Isomorphisms of curves and cluster pictures

Different models of the same hyperelliptic curve may have different cluster pictures. In this section we show that there is a good equivalence relation on cluster pictures that is respected by isomorphisms between hyperelliptic curves: isomorphic curves have “equivalent” cluster pictures (Theorem 14.4) and, conversely, every cluster picture in the equivalence class is realised by some curve over ${\bar{K}}$ (Corollary 14.7). We will look at K-isomorphism classes of (semistable) curves in the next section.

Definition 14.1

Two cluster pictures $(\Sigma ,\mathcal {R},d)$ and $(\Sigma ', \mathcal {R}', d')$ are equivalent if $(\Sigma ', \mathcal {R}', d')$ is isomorphic to a cluster picture obtained from $(\Sigma ,\mathcal {R},d)$ in a finite number of the following steps:

increasing the depth of all clusters by $m\in \mathbb {Q}$:

$\mathcal {R}' = \mathcal {R}$, $\Sigma '=\Sigma $ and $d_{{\mathfrak {s}}}'= d_{{\mathfrak {s}}}+ m$ for all ${{\mathfrak {s}}}\in \Sigma $,
adding a root r, when $|\mathcal {R}|$ is odd:

$\mathcal {R}' = \mathcal {R}\cup \{r\}$, $\Sigma '=\Sigma \cup \{\{r\},\mathcal {R}'\}{\setminus }\{\mathcal {R}\}$, $d'_{{\mathfrak {s}}}= d_{{\mathfrak {s}}}$ for all proper ${{\mathfrak {s}}}\in \Sigma '{\setminus }{\{\mathcal {R}'}\}$ and $d'_{\mathcal {R}'}=d_{\mathcal {R}}$,
removing a root $r\in \mathcal {R}$, when $|\mathcal {R}|$ is even, $\{r\}<\mathcal {R}$ and $\mathcal {R}{\setminus }\{r\}\notin \Sigma $:

$\mathcal {R}' = \mathcal {R}{\setminus } \{r\}$, $\Sigma '=\Sigma \cup \{\mathcal {R}'\}{\setminus }\{\mathcal {R},\{r\}\}$, $d'_{{\mathfrak {s}}}= d_{{\mathfrak {s}}}$ for all proper ${{\mathfrak {s}}}\in \Sigma '{\setminus }{\{\mathcal {R}'}\}$ and $d'_{\mathcal {R}'}=d_{\mathcal {R}}$,
redistributing the depth between ${{\mathfrak {s}}}$ and ${{\mathfrak {s}}}^{c}\!=\!\mathcal {R}{\setminus } {{\mathfrak {s}}}$ to $d'_{{\mathfrak {s}}}=d_{{\mathfrak {s}}}+m$, when $|\mathcal {R}|$ is even, ${{\mathfrak {s}}}<\mathcal {R}$ and $-\delta _{{\mathfrak {s}}}\le m\le \delta _{{{\mathfrak {s}}}^c}$:

$\mathcal {R}'=\mathcal {R}$, $\Sigma '=\Sigma \cup \{{{\mathfrak {s}}},{{\mathfrak {s}}}^{c}\}$, $d'_\mathcal {R}= d_\mathcal {R}$,
$$\begin{aligned}&d'_{{\mathfrak {t}}}= d_{{\mathfrak {t}}}+m \qquad \text {for proper clusters } {{\mathfrak {t}}}\subseteq {{\mathfrak {s}}}\\&d'_{{\mathfrak {t}}}= d_{{\mathfrak {t}}}-m \qquad \text {for proper clusters } {{\mathfrak {t}}}\subseteq {{\mathfrak {s}}}^c \end{aligned}$$
Here we set $\delta _{{{\mathfrak {s}}}^{c}} =0$ if ${{\mathfrak {s}}}^{c} \notin \Sigma $, and $d_{{\mathfrak {t}}}=+\infty $ if $|{{\mathfrak {t}}}|=1$; if in the resulting cluster picture $\delta '_{{\mathfrak {s}}}=0$, we remove ${{\mathfrak {s}}}$ from $\Sigma '$, and similarly for ${{\mathfrak {s}}}^c$.

Example 14.2

For any given $n \in \mathbb {Q}$ the following cluster pictures form an equivalence class:

Here the subscripts on clusters specify their relative depths, t can take all values in the range $0<t<n$, and all clusters of size 5 and 6 can have arbitrary depths. Horizontal lines correspond to cluster pictures that are related by redistributing the depth of a child ${{\mathfrak {s}}}<\mathcal {R}$ (possibly a singleton) and $\mathcal {R}{\setminus }{{\mathfrak {s}}}$, and vertical lines to those related by adding or removing a root.

Remark 14.3

This agrees with the notion of equivalence in [18] for “metric cluster pictures” ([18] Definitions 3.43, 3.45). Metric cluster pictures do not carry a depth function on clusters, but only a relative depth, which is accounted for by the first of the steps in Definition 14.1. The other steps then correspond to the moves (iii), (iv) and (i/ii) in the definitions in [18].

Theorem 14.4

If $C_1: y^2=f_1(x)$ and $C_2: y^2=f_2(x)$ are isomorphic hyperelliptic curves over K, then their cluster pictures are equivalent.

Proof

Note that if F/K is a finite extension then the cluster pictures of $C_1$ and $C_2$ are equivalent over K if and only if they are equivalent over F. So we may assume that $C_i/K$ are semistable. Then by Theorem 5.18, the two metric hyperelliptic graphs $G_{\Sigma _{C_1}}$ and $G_{\Sigma _{C_2}}$ are isomorphic (see D.9 for the notation), and by [18] Thm 5.1, the cluster pictures $\Sigma _{C_1}$ and $\Sigma _{C_2}$ are equivalent. $\square $

Lemma 14.5

(i)
For $x,y\in {\bar{K}}$,
$$\begin{aligned} v\Bigl (\frac{1}{x} - \frac{1}{y}\Bigr )=-v(x)-v(y)+v(x-y). \end{aligned}$$
(ii)
Let $f(x)\in K[x]$ be a separable polynomial with cluster picture $\Sigma _f=(\mathcal {R},\Sigma ,d)$. Suppose ${{\mathfrak {s}}}<\mathcal {R}$ and that all $r\in {{\mathfrak {s}}}$ have $v(r)=a$ and all $r\in \mathcal {R}{\setminus }{{\mathfrak {s}}}$ have $v(r)=b$.

Let $\mathcal {R}'=\{\frac{1}{r}:r\in \mathcal {R}\}$. Then $\psi :r\mapsto \frac{1}{r}$ induces a 1-to-1 correspondence between $\Sigma \cup \{{{\mathfrak {s}}},\mathcal {R}\,\setminus \,{{\mathfrak {s}}}\}$ and $\Sigma '\cup \{{{\mathfrak {s}}}',\mathcal {R}\,\setminus \,{{\mathfrak {s}}}'\}$, where ${{\mathfrak {s}}}'=\{\frac{1}{r}:r\in {{\mathfrak {s}}}\}$. Moreover, $d'_{\psi ({{\mathfrak {t}}})}=d_{{{\mathfrak {t}}}}-2a$ for clusters ${{\mathfrak {t}}}\subset {{\mathfrak {s}}}$, $d'_{\psi ({{\mathfrak {t}}})}=d_{{{\mathfrak {t}}}}-2b$ for clusters ${{\mathfrak {t}}}\,\subset \,\mathcal {R}\,\setminus \,{{\mathfrak {s}}}$, and $d'_{\mathcal {R}'}=d_{\mathcal {R}}-a-b$.

Proof

(i)
Clear, since $\frac{1}{x}-\frac{1}{y}=\frac{y-x}{xy}$.
(ii)
Follows directly from (i).

$\square $

Proposition 14.6

Let $f(x)\in K[x]$ be a separable polynomial with roots $\mathcal {R}\subset {\bar{K}}$, such that $G_K$ acts tamely on $\mathcal {R}$, and let $\Sigma $ be the associated cluster picture. Suppose $\Sigma '$ is a cluster picture obtained from $\Sigma $ by one of the following constructions:

(1)
Increasing the depth of all clusters by some $n\in \mathbb {Z}$;
(2)
Adding a root to $\Sigma $, provided $|\mathcal {R}|$ is odd, $d_\mathcal {R}\in \mathbb {Z}$ and $|k|> \#\{{{\mathfrak {s}}}<\mathcal {R}: {{\mathfrak {s}}}\text { is }G_K\text {-stable}\}$;
(3)
Redistributing the depth from ${{\mathfrak {s}}}$ to $\mathcal {R}{\setminus }{{\mathfrak {s}}}$ to eliminate ${{\mathfrak {s}}}$ and then changing the depth of $\mathcal {R}$ to 0, provided $|\mathcal {R}|$ is even, ${{\mathfrak {s}}}<\mathcal {R}$ is $G_K$-stable with $d_{{\mathfrak {s}}}\in \mathbb {Z}$ and $|k|> \#\{{{\mathfrak {t}}}<{{\mathfrak {s}}}: {{\mathfrak {t}}}\text { is }G_K\text {-stable}\}$;
(4)
Redistributing the depth between ${{\mathfrak {s}}}$ and $\mathcal {R}{\setminus }{{\mathfrak {s}}}$ by decreasing the depth of ${{\mathfrak {s}}}$ by 1, provided $|\mathcal {R}|$ is even, ${{\mathfrak {s}}}<\mathcal {R}$ is $G_K$-stable with $d_\mathcal {R}, d_{{\mathfrak {s}}}\in \mathbb {Z}$ and $|k|> \#\{{{\mathfrak {t}}}<{{\mathfrak {s}}}: {{\mathfrak {t}}}\text { is }G_K\text {-stable}\}$.
(5)
Removing a root from $\mathcal {R}$, provided $|\mathcal {R}|$ is even, $d_\mathcal {R}\in \mathbb {Z}$ and f(x) has a root $r\in K$ that does not lie in any proper cluster other than $\mathcal {R}$.

Then there is a Möbius transformation $\phi (z)=\frac{az+b}{cz+d}$ with $a, b, c, d\in K$, such that $\Sigma '$ is the cluster picture of $\mathcal {R}'=\{\phi (r): r\in \mathcal {R}\}{\setminus }\{\infty \}$ if $|\mathcal {R}|$ is even and of $\mathcal {R}'=\{\phi (r): r\in \mathcal {R}\cup \{\infty \}\}{\setminus }\{\infty \}$ if $|\mathcal {R}|$ is odd.

Moreover, if $y^2=f(x)$ is a hyperelliptic curve, then there is a K-isomorphic curve given by a Weierstrass model whose cluster picture is $\Sigma '$.

Proof

Depending on the case 1–5, the Möbius transformation $\phi $ can be obtained as follows:

1.
Take $\phi (z)=\pi ^n z$.
2.
Since $G_K$ acts tamely, Lemma B.1 shows that $\mathcal {R}$ has a centre $z_\mathcal {R}\in K$; shifting by this and applying (1) we may assume that $z_\mathcal {R}=0$ and $d_\mathcal {R}=0$. Shifting further by some $y\in {\mathcal {O}}_K$, we may assume that all $r\in \mathcal {R}$ are units in ${\mathcal {O}}_{{\bar{K}}}$. Now $\phi (z)=\frac{1}{z}$ has $\mathcal {R}'=\{\frac{1}{r} :r\in \mathcal {R}\}\cup \{0\}$, which, by Lemma 14.5 (i), has the required properties.
3.
As in the proof of (2), we may assume that $d_{{\mathfrak {s}}}=0$, $z_{{\mathfrak {s}}}=0$ and that all roots $r\in {{\mathfrak {s}}}$ are units. All other roots $r\in \mathcal {R}{\setminus }{{\mathfrak {s}}}$ then have valuation $v(r)=d_\mathcal {R}$. By Lemma 14.5, $\phi (z)=\frac{1}{z}$ gives the required cluster picture.
4.
As in the proof of (2), we may assume that $d_\mathcal {R}=0$, $z_{{\mathfrak {s}}}=0$ and that all roots $r\in {{\mathfrak {s}}}$ have valuation 1. Note that all $r\in \mathcal {R}{\setminus }{{\mathfrak {s}}}$ have valuation 0. By Lemma 14.5, $\phi (z)=\frac{\pi _K}{z}$ gives the required cluster picture.
5.
Shifting by r and applying (1), we may assume that $z_\mathcal {R}=r=0$ and $d_\mathcal {R}=0$. Since r does not lie in any proper subcluster of $\mathcal {R}$, all the other roots of f(x) must be units. By Lemma 14.5, $\phi (z)=\frac{1}{z}$ gives the required cluster picture.

Finally observe that if $y^2=f(x)$ is a hyperelliptic curve, then a change of variables of the form $x=\frac{ax'+b}{cx'+d}$, $y=\frac{y'}{(cx'+d)^{g+1}}$ for a Möbius transformation $\psi (z)=\frac{az+b}{cz+d}$ with $a, b, c, d\in K$, gives a model for C/K of the form $y'^2=g(x')$. The set of roots of g(x) is precisely $\mathcal {R}'=\{\psi ^{-1}(r):r\in \mathcal {R}\}{\setminus }\{\infty \}$ if $|\mathcal {R}|$ is even and $\mathcal {R}'=\{\psi ^{-1}(r):r\in \mathcal {R}\cup \{\infty \}\}{\setminus }\{\infty \}$ if $|\mathcal {R}|$ is odd. Setting $\phi =\psi ^{-1}$ for $\phi $ as in the first part gives the desired model. $\square $

Corollary 14.7

Let $C: y^2=f(x)$ be a hyperelliptic curve over K and $\Sigma $ its cluster picture. If $\Sigma '$ is equivalent to $\Sigma $, then there is a ${\bar{K}}$-isomorphic hyperelliptic curve $C'/{\bar{K}}:y^2=g(x)$ whose cluster picture is $\Sigma '$.

Proof

Replacing K by a sufficiently large extension we may assume that Galois acts trivially on the roots of f(x) and that the depths of all clusters in $\Sigma $ and $\Sigma '$ are integers. Now Proposition 14.6 lets us realise all the equivalence steps from $\Sigma $ to $\Sigma '$ in Definition 14.1 through isomorphisms of curves. $\square $

15 Canonical cluster picture of a semistable curve

As any given hyperelliptic curve can admit many different models, it is often desirable to be able to put it in some canonical form. It turns out that every equivalence class of cluster pictures has a canonical representative (Lemma 15.1). Unfortunately, this canonical cluster picture does not always correspond to a Weierstrass model for C defined over K. However, for semistable curves this is nearly the case (Theorem 15.2). Moreover, if the genus of a semistable hyperelliptic curve is even, then this cluster picture does come from a Weierstrass model over K (Corollary 15.3). As we shall see in Sect. 18, this canonical cluster picture is particularly well suited for classifying all reduction types of semistable hyperelliptic curves in any given genus.

Lemma 15.1

Let $(\Sigma ,\mathcal {R},d)$ be a cluster picture with $|\mathcal {R}|=2g+1$ or $2g+2$. There is a unique equivalent (“balanced”) cluster picture $(\Sigma ^{b},\mathcal {R}^b,d^b)$ such that (i) $d^b_{\mathcal {R}^b}=0$, (ii) $\Sigma ^b$ has no clusters of size $>g+1$, (iii) either $\Sigma ^b$ has no clusters of size $g\!+\!1$ or it has two such clusters, in which case they have equal depth.

Proof

This follows from [18] Theorem 5.1. $\square $

Theorem 15.2

Let C/K be a semistable hyperelliptic curve and suppose that $|k|>$ number of $G_K$-stable children of every cluster of size $\ge g+1$ in the cluster picture of C. Then there is a K-isomorphic curve $C'/{\mathcal {O}}_K: y^2=f(x)$ with $deg(f) =2g+2$ such that

the top cluster of $C'$ has depth 0;
the cluster picture of $C'$ has no cluster of size $>g+1$;
either there is at most one cluster of size $g+1$ and $v(c_f)=0$, or ${\text {Frob}}_K$ swaps two clusters of size $g+1$ and $v(c_f)=0$ or 1.

Proof

Let $(\Sigma ,\mathcal {R}, d)$ be the cluster picture of C/K. Recall that as C/K is semistable, it satisfies the semistability criterion (Theorem 7.1, Definition 1.8). In particular, the inertia group cannot permute proper clusters in $\Sigma $.

If Frobenius swaps two clusters of size $g+1$, then by the semistability criterion $d_{\mathcal {R}}\in \mathbb {Z}$ (e.g. since ${{\mathfrak {s}}}<\mathcal {R}$ is principal and so has integral depth, and $\delta _{{\mathfrak {s}}}\in \mathbb {Z}$ by Proposition C.7 (2)). Applying a transformation of the form $x'=\pi _K^k x$, $y'=\pi _K^n x$ gives a model over K with the required cluster picture.

Suppose henceforth that Frobenius does not swap clusters of size $g+1$, and consequently that all clusters of size $\ge g+1$ are $G_K$-stable. We now change the model for C by repeatedly applying Proposition 14.6: (2) creates a cluster picture with an even number of roots, then (1) or (3) makes the depth of the top cluster 0, and finally a repeated use of (4) removes all clusters of size $>g+1$ and leaves at most one cluster of size $g+1$ (principal clusters have integral relative depth by Proposition C.7 using that the depth of the top cluster is now 0 for the case when it is a cotwin). The resulting model necessarily has $v(c_f)\in 2\mathbb {Z}$ (semistability criterion for the top cluster, as it now has depth 0), and hence a change of variables of the form $y=\pi ^k y$ gives a model over K with the required cluster picture.

Finally, Theorem 13.3 shows that shifting the x-coordinate by a suitable element of K gives a model over ${\mathcal {O}}_K$. $\square $

Corollary 15.3

Let C/K be as in Theorem 15.2. If C has even genus then there is a K-isomorphic curve $C'/{\mathcal {O}}_K: y^2=f(x)$ such that

the top cluster of $C'$ has size $2g+2$ and depth 0;
the cluster picture of $C'$ has no cluster of size $>g+1$;
either $v(c_f)=0$ and there is no cluster of size $g+1$, or $v(c_f) \in \{0,1\}$ and there are two clusters of size $g+1$ with equal depths.

Any other K-isomorphic curve $C''/{\mathcal {O}}_K$ satisfying (1), (2) and (3) has the same cluster picture and valuation of leading term as $C'$.

Proof

The existence of $C'$ follows from the theorem and, in the case when the theorem yields a cluster picture with a cluster of size $g+1$, Proposition 14.6(4). The fact that, in this special case, the two resulting clusters ${{\mathfrak {s}}}, {{\mathfrak {s}}}'$of size $g+1$ can be made to have equal depth follows from the semistability criterion, which shows that $\delta _{{\mathfrak {s}}}, \delta _{{{\mathfrak {s}}}'}\in \mathbb {Z}$ and $(g+1)\delta _{{\mathfrak {s}}}\equiv (g+1)\delta _{{{\mathfrak {s}}}'} \mod 2$ (Theorem 7.1, Definition 1.8).

Uniqueness of the cluster picture follows from the fact that isomorphic curves have equivalent cluster pictures (Theorem 14.4) and uniqueness of balanced cluster pictures (Lemma 15.1). $\square $

Remark 15.4

Theorem 17.1 and Proposition 17.5 show that the models obtained in the theorem and the corollary are minimal Weierstrass equations.

16 Discriminant

Recall the definition of the discriminant of a hyperelliptic curve:

Definition 16.1

(See [28], Sect. 2) Let $C:y^2=f(x)$ be a hyperelliptic curve of genus g over K. The discriminant $\Delta _C$ of C is

$$\begin{aligned} \Delta _C = 16^g c_f^{4g+2}\text {disc}\Bigl (\frac{1}{c_f}f(x)\Bigr ). \end{aligned}$$

Our main result on the discriminant is that one can easily read off its valuation from the cluster picture of C and, when C/K is semistable, one can moreover read off the valuation of the discriminant of its minimal Weierstrass model:

Theorem 16.2

Let $C:y^2=f(x)$ be a hyperelliptic curve of genus g over K, and let $(\Sigma ,\mathcal {R},d)$ be the associated cluster picture. Then

$$\begin{aligned} v(\Delta _C)= v(c_f)(4g+2) + \sum _{{{\mathfrak {s}}}\text { proper}} d_{{\mathfrak {s}}}\Bigl (|{{\mathfrak {s}}}|^2-\mathop {\Sigma }\limits _{{{\mathfrak {s}}}'<{{\mathfrak {s}}}}|{{\mathfrak {s}}}'|^2\Bigr ). \end{aligned}$$

If C/K is semistable and $|k|>2g\!+\!1$, then the valuation of the discriminant $\Delta _C^{\min }$ of a minimal Weierstrass model of C is determined by the formula

$$\begin{aligned} \frac{v(\Delta _C)-v(\Delta _C^{\min })}{4g+2}=v(c_f)-E+ d_{\mathcal {R}}(|\mathcal {R}|-g-1)+\sum _{g+1<|{{\mathfrak {s}}}|<|\mathcal {R}|}\delta _{{\mathfrak {s}}}(|{{\mathfrak {s}}}|-g-1), \end{aligned}$$

where $E=0$ unless $\Sigma $ has two clusters of size $g+1$ that are permuted by Frobenius and $v(c_f)$ is odd, in which case $E=1$.

The first part of the theorem follows directly from Lemmas 16.4 and 16.5 below. The second part will be proved at the end of Sect. 17, after we find a description for minimal Weierstrass equations in terms of cluster pictures.

Definition 16.3

Let $(\Sigma ,\mathcal {R},d)$ be a cluster picture and let $n\in \mathbb {Z}$. The discriminant $\Delta _{\Sigma ,n}$ of the pair $(\Sigma ,n)$ is the fractional ideal of K with valuation

$$\begin{aligned} v(\Delta _{\Sigma ,n}) = n(4g+2)+ 2 \sum _{r\ne r'\in \mathcal {R}} d_{r\wedge r'}. \end{aligned}$$

Lemma 16.4

Let $C:y^2=f(x)$ be a hyperelliptic curve over K, let $\Sigma $ be the associated cluster picture and $n=v(c_f)$. Then

$$\begin{aligned} v(\Delta _{C})=v(\Delta _{\Sigma ,n}). \end{aligned}$$

Proof

Clear. $\square $

Lemma 16.5

Let $(\Sigma , \mathcal {R}, d)$ be a cluster picture. Then

$$\begin{aligned} 2 \sum _{r\ne r'\in \mathcal {R}} d_{r\wedge r'} = \sum _{{{\mathfrak {s}}}, |{{\mathfrak {s}}}|>1} d_{{\mathfrak {s}}}(|{{\mathfrak {s}}}|^2-\mathop {\Sigma }\limits _{{{\mathfrak {s}}}'<{{\mathfrak {s}}}}|{{\mathfrak {s}}}'|^2) = \sum _{{{\mathfrak {s}}}} |{{\mathfrak {s}}}|^2 \left\{ \scriptstyle \begin{array}{lll} -d_{P({{\mathfrak {s}}})} &{} \text {if} &{} |{{\mathfrak {s}}}|=1, \\ d_{{\mathfrak {s}}}&{} \text {if} &{} {{\mathfrak {s}}}=\mathcal {R}, \\ \delta _{{{\mathfrak {s}}}} &{} {\text {otherwise.}} \\ \end{array}\right. \end{aligned}$$

Proof

We have

$$\begin{aligned}&2 \sum _{r\ne r'\in \mathcal {R}} d_{r\wedge r'}= \sum _{r\in \mathcal {R}} \sum _{{{\mathfrak {s}}}\ni r, |{{\mathfrak {s}}}|>1} d_{{\mathfrak {s}}}(|{{\mathfrak {s}}}|-|\text {child of }{{\mathfrak {s}}}\text { containing }r|) \\&\quad = \sum _{{{\mathfrak {s}}}, |{{\mathfrak {s}}}|>1} d_{{\mathfrak {s}}}(|{{\mathfrak {s}}}|^2-\mathop {\Sigma }\limits _{{{\mathfrak {s}}}'<{{\mathfrak {s}}}}|{{\mathfrak {s}}}'|^2)=\sum _{{{\mathfrak {s}}}, |{{\mathfrak {s}}}|>1} d_{{\mathfrak {s}}}|{{\mathfrak {s}}}|^2-\sum _{{{\mathfrak {s}}}'\ne \mathcal {R}} d_{P({{\mathfrak {s}}}')}|{{\mathfrak {s}}}'|^2. \end{aligned}$$

$\square $

In the remainder of this section we establish some results on how the discriminant changes under the different steps yielding equivalent cluster pictures (see Definition 14.1). As equivalence of cluster pictures is closely tied to isomorphisms of hyperelliptic curves (see Theorem 14.4 and Proposition 14.6), this will be the key to understanding minimal Weierstrass equations and their discriminants. Recall also that we have a notion of integrality for cluster pictures (see Definition 13.2), which gives a way of determining whether a cluster picture corresponds to an integral Weierstrass equations.

Lemma 16.6

Let $(\Sigma ,\mathcal {R},d)$ and $(\Sigma ',\mathcal {R}',d')$ be cluster pictures.

(i)
If $\Sigma '$ is obtained from $\Sigma $ by increasing the depth of all clusters to $d_{{\mathfrak {s}}}'=d_{{\mathfrak {s}}}+t$, then
$$\begin{aligned} v(\Delta _{\Sigma ',n})= v(\Delta _{\Sigma ,n})+t|\mathcal {R}|(|\mathcal {R}|-1). \end{aligned}$$
(ii)
If $\Sigma $ has odd size and $\Sigma '$ is obtained by adding a root to $\Sigma $, then
$$\begin{aligned} v(\Delta _{\Sigma ',n})= v(\Delta _{\Sigma ,n})+2d_{\mathcal {R}}|\mathcal {R}|. \end{aligned}$$
(iii)
If $\Sigma $ has even size then for $k\in \mathbb {Z}$,
$$\begin{aligned} v(\Delta _{\Sigma ,n+k})= v(\Delta _{\Sigma ,n})+2k(|\mathcal {R}|-1). \end{aligned}$$
If $\Sigma $ has odd size, then
$$\begin{aligned} v(\Delta _{\Sigma ,n+k})= v(\Delta _{\Sigma ,n})+2k|\mathcal {R}|. \end{aligned}$$
(iv)
If $\Sigma $ has even size and $\Sigma '$ is obtained by redistributing the depth between ${{\mathfrak {s}}}<\mathcal {R}$ and $\mathcal {R}{\setminus }{{\mathfrak {s}}}$ to $d_{{\mathfrak {s}}}'=d_{{\mathfrak {s}}}-t$ and $d_{\mathcal {R}{\setminus }{{\mathfrak {s}}}}'=d_{\mathcal {R}{\setminus }{{\mathfrak {s}}}}+t$, then
$$\begin{aligned} v(\Delta _{\Sigma ',n})= v(\Delta _{\Sigma ,n})+ t(|\mathcal {R}|-2|{{\mathfrak {s}}}|)(|\mathcal {R}|-1). \end{aligned}$$

Proof

Write $|\mathcal {R}|$ as $|\mathcal {R}|=2g+1$ or $2g+2$ for some $g\in \mathbb {Z}$.

(i)
$$\begin{aligned} v(\Delta _{\Sigma ',n})=n(4g+2) + \sum _{r\ne r'\in \mathcal {R}}d'_{r\wedge r'} = n(4g+2) + \sum _{r\ne r'\in \mathcal {R}}d_{r\wedge r'} + |\mathcal {R}|(|\mathcal {R}|-1)t. \end{aligned}$$
(ii)
$$\begin{aligned} v(\Delta _{\Sigma ',n})=n(4g+2) + \sum _{r\ne r'\in \mathcal {R}'}d'_{r\wedge r'} = n(4g+2) + \sum _{r\ne r'\in \mathcal {R}}d_{r\wedge r'} + 2|\mathcal {R}|d_{\mathcal {R}}. \end{aligned}$$
(iii)
$$\begin{aligned} v(\Delta _{\Sigma ,n+k})=(n+k)(4g+2) + \sum _{r\ne r'\in \mathcal {R}}d_{r\wedge r'} = n(4g+2) + \sum _{r\ne r'\in \mathcal {R}}d_{r\wedge r'} + k(4g+2). \end{aligned}$$
(iv)
$$\begin{aligned}&v(\Delta _{\Sigma ',n})=n(4g+2) + \sum _{r\ne r'\in \mathcal {R}}d'_{r\wedge r'} = \\&\quad = n(4g+2) + \sum _{r\ne r'\in {{\mathfrak {s}}}}d'_{r\wedge r'} + \sum _{r\ne r'\in \mathcal {R}{\setminus }{{\mathfrak {s}}}}d'_{r\wedge r'} + 2\sum _{r\in {{\mathfrak {s}}}, r'\in \mathcal {R}{\setminus }{{\mathfrak {s}}}}d'_{r\wedge r'} = \\&\quad = v(\Delta _{\Sigma ,n})- t|{{\mathfrak {s}}}|(|{{\mathfrak {s}}}|-1) + t|\mathcal {R}{\setminus }{{\mathfrak {s}}}|(\mathcal {R}{\setminus }{{\mathfrak {s}}}|-1) = v(\Delta _{\Sigma ,n}) + t(|\mathcal {R}|-1)(|\mathcal {R}|-2|{{\mathfrak {s}}}|). \end{aligned}$$

$\square $

The following proposition identifies a number of scenarios when we can manipulate integral cluster pictures to decrease the valuation of their discriminant.

Proposition 16.7

Let $(\Sigma ,\mathcal {R},d)$ and $(\Sigma ',\mathcal {R}',d')$ be cluster pictures all of whose cluster depths are integers, and let $G_K$ act trivially on both $\Sigma $ and $\Sigma '$. Let $n, n'\in \mathbb {Z}$ be the minimal integers such that $(\Sigma ,n)$ and $(\Sigma ',n')$ are integral.

(i)
If $d_\mathcal {R}\ge 1$ and $\Sigma '$ is obtained from $\Sigma $ by decreasing the depth of all clusters by 1, then
$$\begin{aligned} n'=n=0 \qquad \text {and}\qquad v(\Delta _{\Sigma ',n'})=v(\Delta _{\Sigma ,n}) - |\mathcal {R}|(|\mathcal {R}|-1). \end{aligned}$$
(ii)
If $\Sigma $ has odd size, $d_\mathcal {R}\le 0$ and $\Sigma '$ is obtained by adding a root to $\Sigma $, then
$$\begin{aligned} n'=n-d_\mathcal {R}\quad \text { and } \quad v(\Delta _{\Sigma ',n'})=v(\Delta _{\Sigma ,n}). \end{aligned}$$
(iii)
If $\Sigma $ has even size, $d_\mathcal {R}\le 0$, $\Sigma $ has a cluster ${{\mathfrak {t}}}$ with $|{{\mathfrak {t}}}|\ge \frac{|\mathcal {R}|}{2}$, $d_{{\mathfrak {t}}}>0$ and $d_{P({{\mathfrak {t}}})}\le 0$, and $\Sigma '$ is obtained by redistributing the depth of the child ${{\mathfrak {s}}}<\mathcal {R}$ containing ${{\mathfrak {t}}}$ and $\mathcal {R}{\setminus }{{\mathfrak {s}}}$ to $d_{{\mathfrak {s}}}'=d_{{\mathfrak {s}}}-1$ and $d_{\mathcal {R}{\setminus }{{\mathfrak {s}}}}'=d_{\mathcal {R}{\setminus }{{\mathfrak {s}}}}+1$, then
$$\begin{aligned} n'=n+|{{\mathfrak {s}}}|-|{{\mathfrak {t}}}| \quad \text { and } \quad v(\Delta _{\Sigma ',n'})=v(\Delta _{\Sigma ,n})-(2|{{\mathfrak {t}}}|-|\mathcal {R}|) (|\mathcal {R}|-1). \end{aligned}$$
(iv)
If $\Sigma $ has even size, $d_\mathcal {R}< 0$, $\Sigma $ has no cluster ${{\mathfrak {t}}}$ with $|{{\mathfrak {t}}}|\ge \frac{|\mathcal {R}|}{2}$ and $d_{{\mathfrak {t}}}\ge 0$, and $\Sigma '$ is obtained by increasing the depth of all clusters by 1, then
$$\begin{aligned} n'<n-\frac{|\mathcal {R}|}{2} \quad \text { and } \quad v(\Delta _{\Sigma ',n'})<v(\Delta _{\Sigma ,n}). \end{aligned}$$
(v)
If $\Sigma $ has even size, $d_\mathcal {R}< 0$, $\Sigma $ has a cluster ${{\mathfrak {t}}}$ with $|{{\mathfrak {t}}}|\ge \frac{|\mathcal {R}|}{2}$ and $d_{{\mathfrak {t}}}=0$, and $\Sigma '$ is obtained by increasing the depth of all clusters by 1, then
$$\begin{aligned} n'= n-|\mathcal {R}|+|{{\mathfrak {t}}}| \quad \text { and } \quad \quad v(\Delta _{\Sigma ',n'})=v(\Delta _{\Sigma ,n})+(2|{{\mathfrak {t}}}|-|\mathcal {R}|) (|\mathcal {R}|-1). \end{aligned}$$

Proof

(i)
The claim for $n,n'$ is clear. The rest follows from Lemma 16.6(i).
(ii)
The claim for $n'$ follows from Lemma 13.5(i). Lemma 16.6(ii,iii) then gives
$$\begin{aligned} v(\Delta _{\Sigma ',n'})=v(\Delta _{\Sigma ,n}) + 2d_\mathcal {R}|\mathcal {R}| - 2d_\mathcal {R}(|\mathcal {R}'|-1)=v(\Delta _{\Sigma ,n}). \end{aligned}$$
(iii)
Since the cluster depths are assumed to be integers, we must have $d'_{{\mathfrak {t}}}\ge 0$, so by Lemma 13.5(ii, ii’),
$$\begin{aligned} n'= & {} - \sum _{r\notin {{\mathfrak {t}}}} d_{r\wedge {{\mathfrak {t}}}}' = - \sum _{r\notin {{\mathfrak {s}}}}d_{r\wedge {{\mathfrak {t}}}}' - \sum _{r\in {{\mathfrak {s}}}{\setminus }{{\mathfrak {t}}}}d_{r\wedge {{\mathfrak {t}}}}' = - \sum _{r\notin {{\mathfrak {s}}}}d_{r\wedge {{\mathfrak {t}}}} - \sum _{r\in {{\mathfrak {s}}}{\setminus }{{\mathfrak {t}}}}d_{r\wedge {{\mathfrak {t}}}} \\&+ |{{\mathfrak {s}}}{\setminus }{{\mathfrak {t}}}| = n+|{{\mathfrak {s}}}{\setminus }{{\mathfrak {t}}}|. \end{aligned}$$
Thus by Lemma 16.6(iii,iv),
$$\begin{aligned} v(\Delta _{\Sigma ',n'})= & {} v(\Delta _{\Sigma ,n}) + (|\mathcal {R}|-2|{{\mathfrak {s}}}|) (|\mathcal {R}|-1) + 2(|{{\mathfrak {s}}}|-|{{\mathfrak {t}}}|)(|\mathcal {R}|-1) \\= & {} v(\Delta _{\Sigma ,n})+ (|\mathcal {R}|-2|{{\mathfrak {t}}}|) (|\mathcal {R}|-1). \end{aligned}$$
(iv)
By Lemma 13.5(i) we can write $n=-\sum _{r\notin {{\mathfrak {t}}}} d_{r\wedge {{\mathfrak {t}}}}$ for some ${{\mathfrak {t}}}$ which has $d_{P({{\mathfrak {t}}})}\le 0$ and is either a singleton or has $d_{{\mathfrak {t}}}>0$. Since necessarily $|{{\mathfrak {t}}}|<\frac{|\mathcal {R}|}{2}$, we must have $-\sum _{r\notin {{\mathfrak {t}}}}d'_{r\wedge {{\mathfrak {t}}}}< -\sum _{r\notin {{\mathfrak {t}}}}d_{r\wedge {{\mathfrak {t}}}}- \frac{|\mathcal {R}|}{2}$. Moreover, if $d'_{P({{\mathfrak {t}}})}>0$ then necessarily $d_{P({{\mathfrak {t}}})}=0$ and we also have
$$\begin{aligned} -\sum _{r\notin P({{\mathfrak {t}}})}d'_{r\wedge P({{\mathfrak {t}}})}< -\sum _{r\notin P({{\mathfrak {t}}})}d_{r\wedge P({{\mathfrak {t}}})}- \frac{|\mathcal {R}|}{2}=-\sum _{r\notin {{\mathfrak {t}}}}d_{r\wedge {{\mathfrak {t}}}}- \frac{|\mathcal {R}|}{2}. \end{aligned}$$
By Lemma 13.5(i) it follows that
$$\begin{aligned} n'<n - \frac{|\mathcal {R}|}{2}. \end{aligned}$$
Thus by Lemma 16.6(i,iii)
$$\begin{aligned} v(\Delta _{\Sigma ',n'})=v(\Delta _{\Sigma ,n}) + |\mathcal {R}|(|\mathcal {R}|-1) + 2(n'-n)(|\mathcal {R}|-1)<v(\Delta _{\Sigma ,n}) . \end{aligned}$$
(v)
By Lemma 13.5 (ii’,ii),
$$\begin{aligned} n' = -\sum _{r\notin {{\mathfrak {t}}}}d_{r\wedge {{\mathfrak {t}}}}' = -|\mathcal {R}{\setminus }{{\mathfrak {t}}}|-\sum _{r\notin {{\mathfrak {t}}}}d_{r\wedge {{\mathfrak {t}}}} = n-|\mathcal {R}|+|{{\mathfrak {t}}}|, \end{aligned}$$
and by Lemma 16.6(i,iii)
$$\begin{aligned} v(\Delta _{\Sigma ',n'})= & {} v(\Delta _{\Sigma ,n}) + |\mathcal {R}|(|\mathcal {R}|-1) + 2(n'-n)(|\mathcal {R}|-1)=v(\Delta _{\Sigma ,n}) \\&+ (2|{{\mathfrak {t}}}|-|\mathcal {R}|)(|\mathcal {R}|-1). \end{aligned}$$

$\square $

We end this section with a result that effectively gives sufficient criteria for a cluster picture to correspond to a minimal Weierstrass equation.

Corollary 16.8

Suppose $(\Sigma ,\mathcal {R},d)$ is a cluster picture with a trivial action of $G_K$, with $|\mathcal {R}|=2g\!+\!2$, with $d_{\mathcal {R}}=0$, with $d_{{\mathfrak {s}}}\in \mathbb {Z}$ for every proper cluster ${{\mathfrak {s}}}$, and with no cluster ${{\mathfrak {s}}}\ne \mathcal {R}$ of size $|{{\mathfrak {s}}}|\!>\!g\!+\!1$. Then for every integral $(\Sigma ',n')$ with $\Sigma '$ equivalent to $\Sigma $, with $d_{{{\mathfrak {s}}}}\in \mathbb {Z}$ for every cluster ${{\mathfrak {s}}}$, and with trivial $G_K$-action,

$$\begin{aligned} v(\Delta _{\Sigma ,0}) \le v(\Delta _{\Sigma ',n'}), \end{aligned}$$

with equality if and only if $\Sigma '$ has no cluster of size $>g+1$ with depth $>0$, but has some cluster ${{\mathfrak {t}}}$ of size $\ge g+1$ with $d_{{\mathfrak {t}}}'\ge 0$, and $n'=-\sum _{r\notin {{\mathfrak {t}}}}d_{r\wedge {{\mathfrak {t}}}}'$.

Proof

By [18] Thm 5.1, there is a unique (“balanced”) cluster picture $(\Sigma ^{b},\mathcal {R}^{b},d^{b})$ equivalent to $\Sigma $ with ${{\mathcal {R}}}^b$ even, $d^{b}_{\mathcal {R}^{b}}=0$, no clusters other than $\mathcal {R}^{b}$ of size $>g+1$, and either zero or two clusters of size $g+1$, in which case they have equal depth. Note that, by uniqueness, either $\Sigma =\Sigma ^{b}$, or $\Sigma ^{b}$ is obtained from $\Sigma $ by redistributing the depths of a child ${{\mathfrak {s}}}<\mathcal {R}$ of size $g+1$ and $\mathcal {R}{\setminus }{{\mathfrak {s}}}$ so that they get equal depth. Thus by Lemma 16.6(iv)

$$\begin{aligned} v(\Delta _{\Sigma ^{b},0}) = v(\Delta _{\Sigma ,0}). \end{aligned}$$

Now consider $\Sigma '$. From the definition of the discriminant and Lemma 13.5(ii), we may assume that $n'$ is the minimal integer that makes $(\Sigma ',n')$ integral.

By Proposition 16.7(i,ii), it suffices to prove the result when $\Sigma '$ has even size and $d_\mathcal {R}'\le 0$.

By Proposition 16.7(iv), we may further assume that either $d_{\mathcal {R}'}'\!=\!0$, or $d_{\mathcal {R}'}'<0$ and $\Sigma '$ has a cluster of size $\ge g\!+\!1$ with depth $\ge 0$.

By Proposition 16.7(iii), we may further further assume that $\Sigma '$ has no cluster ${{\mathfrak {s}}}$ with $|{{\mathfrak {s}}}|>g\!+\! 1$ and $d'_{{\mathfrak {s}}}>0$.

Suppose $d'_{\mathcal {R}'}<0$. If $\Sigma '$ has a cluster ${{\mathfrak {s}}}$ of size $g\!+\! 1$ and depth $d'_{{\mathfrak {s}}}>0$, then repeatedly applying Proposition 16.7(iii), reduces the problem to the case when $\Sigma '$ has a cluster ${{\mathfrak {s}}}$ of size $g\!+\! 1$ and depth $d'_{{\mathfrak {s}}}=0$. Now if ${{\mathfrak {s}}}$ is a cluster of size $\ge g\!+\!1$ and depth $d'_{{\mathfrak {s}}}=0$, then repeatedly applying Proposition 16.7(v) followed by (iii) (which eventually eliminates the cluster by pushing $d'_\mathcal {R}$ up to 0) reduces the problem to the case when $d'_{\mathcal {R}'}=0$.

Finally, suppose $d'_{\mathcal {R}'}=0$, so, in particular, $n'=0$. If $\Sigma '$ has no cluster of size $g\!+\! 1$, then $\Sigma '=\Sigma ^{b}$. If $\Sigma '$ does have such a cluster, then Lemma 16.6(iv) and uniqueness of $\Sigma ^{b}$ show that $v(\Delta _{\Sigma ',0})=v(\Delta _{\Sigma ^{b},0})$. The result follows. $\square $

17 Minimal Weierstrass equations

Theorem 17.1

Let $C:y^2=f(x)$ be a hyperelliptic curve over K with $f(x)\in {\mathcal {O}}_K[x]$, and let $\Sigma $ be the associated cluster picture. If $d_{\mathcal {R}}=0$, $v(c_f)=0$ and $\Sigma $ has no cluster ${{\mathfrak {s}}}\ne \mathcal {R}$ of size $|{{\mathfrak {s}}}|>g+1$, then C is a minimal Weierstrass equation.

Proof

Suppose another integral Weierstrass model $C':y^2=g(x)$ for C has cluster picture $\Sigma '$. Over a suitable field extension F/K, the depths of all clusters of $\Sigma $ and $\Sigma '$ are integers, and ${\text {Gal}}(\bar{F}/F)$ acts trivially on both cluster pictures. By Theorems 13.3 and 14.4, $(\Sigma ',v_F(c_g))$ is integral and $\Sigma '$ is equivalent to $\Sigma $ over F. By Proposition 16.8, $v_F(\Delta _{\Sigma ,0})\le v_F(\Delta _{\Sigma ',v_F(c_g)})$, and so the same inequality holds over K. By Lemma 16.4, $v(\Delta _C)\le v(\Delta _{C'})$, as required. $\square $

For semistable hyperelliptic curves, we can give a full characterisation of minimal Weierstrass models in terms of cluster pictures:

Theorem 17.2

Let $C:y^2=f(x)$ be a semistable hyperelliptic curve over K, and let $(\Sigma ,\mathcal {R},d)$ be the associated cluster picture. Suppose that every cluster of $\Sigma $ of size $>g+1$ has at most $|k|\!-\!1$ $G_K$-stable children.

Then there is some $z\in K$ such that $y^2=f(x-z)$ is a minimal Weierstrass model if and only if one of the following conditions holds

(1)
$\Sigma $ has two clusters of size $g+1$ that are swapped by Frobenius, $d_\mathcal {R}=0$ and $v(c_f)\in \{0,1\}$,
(2)
$\Sigma $ has no cluster of size $>\!g\!+\!1$ with depth $>0$, but has some $G_K$-stable cluster ${{\mathfrak {t}}}$ of size $|{{\mathfrak {t}}}|\ge g+1$ with $d_{{\mathfrak {t}}}\ge 0$ and $v(c_f)=-\sum _{r\notin {{\mathfrak {t}}}}d_{r\wedge {{\mathfrak {t}}}}$.

In particular, if $f(x)\in {\mathcal {O}}_K[x]$ then $y^2=f(x)$ is a minimal Weierstrass equation if and only if $\Sigma $ satisfies either (1) or (2).

The rest of this section is devoted to the proof of this theorem. Our approach is to translate everything in terms of cluster pictures (semistability, integral Weierstrass equations, discriminants) and work mainly on that level. The two cases for the cluster picture in the above theorem are quite distinct and it will be convenient to use the following terminology:

Definition 17.3

The cluster picture of a semistable hyperelliptic curve $y^2=f(x)$ of genus g is exceptional if it has two clusters of size $g+1$ that are swapped by Frobenius.

Proposition 17.4

Let $C:y^2=f(x)$ be a semistable hyperelliptic curve over K, let $\Sigma $ be the associated cluster picture and $n=v(c_f)$. Suppose $C'$ is another Weierstrass model for C with corresponding $(\Sigma ',n')$. Then $\Sigma $ is exceptional if and only if $\Sigma '$ is. If so, then, $n'\equiv n \mod 2$ and $\Sigma '$ is obtained from $\Sigma $ by increasing the depths of all clusters by some $d\in \mathbb {Z}$; moreover

$$\begin{aligned} v(\Delta _{\Sigma ,n}) = v(\Delta _{\Sigma ',n'}) - 2(n'-n)(|\mathcal {R}|-1) - d|\mathcal {R}|(|\mathcal {R}|-1). \end{aligned}$$

Proof

Since $C'$ is another Weierstrass model for C, their special fibres have the same dual graphs with the same automorphisms induced by Frobenius. By Theorem 8.6 this graph $\Upsilon _C$ modulo the hyperelliptic involution has a unique fixed point under the Frobenius action. In the terminology of [18], it follows that they have the same open hyperelliptic graph ([18] Proposition 5.7 with $\tilde{G}=\Upsilon _C$) and hence the same metric cluster picture with the same automorphisms induced by Frobenius ([18] Thm 4.2). In particular, $\Sigma '$ is obtained from $\Sigma $ by increasing the depth of all clusters by some $d\in \mathbb {Z}$, and with the same action of Frobenius on clusters.

Thus $\Sigma $ is exceptional if and only if $\Sigma '$ is. It follows from the semistability criterion (Theorem 7.1, Definition 1.8) that $n' \equiv n \mod 2$. Finally the valuation of the discriminant follows from Lemma 16.6(i,iii). $\square $

Proposition 17.5

Let $C:y^2=f(x) \in {\mathcal {O}}_K[x]$ be a semistable hyperelliptic curve, and let $n=v(c_f)$. Suppose the associated cluster picture $\Sigma $ is exceptional. Then the Weierstrass equation is minimal if and only if $d_{\mathcal {R}} =0$ and $n\in \{0,1\}$.

Proof

First suppose $\Sigma $ is exceptional with $d_{\mathcal {R}}=0$ and $n\in \{0,1\}$. Let $C'$ be another integral Weierstrass model with corresponding $(\Sigma ', n')$. By Proposition 17.4$\Sigma '$ is exceptional, $n'\equiv n \mod 2$, and

$$\begin{aligned} v(\Delta _{\Sigma ,n}) = v(\Delta _{\Sigma ',n'}) - 2(n'-n)(|\mathcal {R}|-1) - d'_{\mathcal {R}'}|\mathcal {R}|(|\mathcal {R}|-1). \end{aligned}$$

By Theorem 13.3$(\Sigma ',n')$ is integral, so either $n'\ge 0$ and $d'_{\mathcal {R}'}\ge 0$, or $d'_{\mathcal {R}'} < 0$ and $n'+|\mathcal {R}|d'_{\mathcal {R}'}\ge 0$. In the first case, since $n'\equiv n\mod 2$, we clearly have $ v(\Delta _{\Sigma ,n}) \le v(\Delta _{\Sigma ',n'})$ with equality if and only if $n'=n$ and $d'_{\mathcal {R}'} = 0$. In the second case, $d'_{\mathcal {R}'} \in \mathbb {Z}$ by the semistability criterion for $C'$ (since ${{\mathfrak {s}}}<\mathcal {R}'$ is principal and so has integral depth, and $\delta _{{\mathfrak {s}}}\in \mathbb {Z}$ by Proposition C.7 (2)), so $n'\!\ge \! -|\mathcal {R}|d'_{\mathcal {R}}>2$ and we again obtain

$$\begin{aligned} v(\Delta _{\Sigma ,n}) \le v(\Delta _{\Sigma ',n'}) - 2(n'-n)(|\mathcal {R}|-1) + n'(|\mathcal {R}|-1)< v(\Delta _{\Sigma ',n'}). \end{aligned}$$

Since the discriminants of the cluster pictures are those of the curves (Lemma 16.4), it follows that C is a minimal Weierstrass equation.

For the converse, suppose $C:y^2=f(x)$ is a minimal Weierstrass model and $\Sigma $ is exceptional. A substitution of the form $y_1 = p^a y$ and $x_1= p^bx$ for suitable $a,b \in \mathbb {Z}$, gives a new model $C'$ whose cluster picture $\Sigma '$ is also exceptional, $d'_{\mathcal {R}'}=0$ and the corresponding valuation $n'$ is either 0 or 1. Theorem 13.3 gives a new integral model $C''$ with the same cluster picture, $d''_{\mathcal {R}''}=0$ and $n''\in \{0,1\}$. By the first part of the proof, $C''$ is a minimal Weierstrass model (so $v(\Delta _{\Sigma '',n''})=v(\Delta _{\Sigma ,n})$), $n=n''$ and $d_{\mathcal {R}}=0$. $\square $

We now turn to cluster pictures that are not exceptional.

Lemma 17.6

Let $C:y^2=f(x)$ be a semistable hyperelliptic curve over K with cluster picture $\Sigma $. Suppose that $\Sigma $ is not exceptional and that every cluster of $\Sigma $ of size $>g+1$ has at most $|k|\!-\!1$ $G_K$-stable children. Then C admits an integral Weierstrass model $y^2=h(x)$ with $v(c_h)=0$ and whose cluster picture $(\Sigma ',\mathcal {R}',d')$ has $|\mathcal {R}'|=2g\!+\!2$, $d'_{\mathcal {R}'}\!=\!0$ and has no cluster ${{\mathfrak {s}}}\ne \mathcal {R}'$ of size $> g\!+\!1$.

Proof

This follows from Theorem 15.2, and the fact that by Proposition 17.4, $\Sigma '$ cannot be exceptional. $\square $

Proposition 17.7

Let $C:y^2=f(x) \in {\mathcal {O}}_K[x]$ be a semistable hyperelliptic curve. Suppose the associated cluster picture $\Sigma $ is not exceptional and that every cluster of $\Sigma $ of size $>g+1$ has at most $|k|\!-\!1$ $G_K$-stable children. Then the Weierstrass equation is minimal if and only if $\Sigma $ has no cluster of size $>g+1$ with depth $>0$, but has some cluster ${{\mathfrak {t}}}$ of size $\ge g+1$ with $d_{{\mathfrak {t}}}\ge 0$ and $-\sum _{r\notin {{\mathfrak {t}}}}d_{r\wedge {{\mathfrak {t}}}}=v(c_f)$.

Proof

By Lemma 17.6, there exists another integral model $C_{\min }:y^2=g(x)$ with $v(c_g)=0$ and whose associated cluster picture $\Sigma _{\min }$ has its top cluster of depth 0, and has no other clusters of size $>g+1$. By Theorem 17.1$C_{\min }$ is a minimal Weierstrass equation, so

$$\begin{aligned} v(\Delta _{C})\ge v(\Delta _{C_{\min }}). \end{aligned}$$

It remains to show that the claimed condition for having equality is correct. Passing to a suitable field extension if necessary, we may assume that all the clusters in $\Sigma $ have integer depth and that the Galois group acts trivially on $\Sigma $ and on $\Sigma _{\min }$. By Theorem 14.4, $\Sigma $ is equivalent to $\Sigma _{\min }$, so by Corollary 16.8$v(\Delta _{C})=v(\Delta _{C_{\min }})$ if and only if $\Sigma $ has no cluster of size $>g+1$ with depth $>0$, but has some cluster ${{\mathfrak {t}}}$ of size $\ge g+1$ with $d_{{\mathfrak {t}}}\ge 0$, and $v(c_f)=-\sum _{r\notin {{\mathfrak {t}}}}d_{r\wedge {{\mathfrak {t}}}}$. $\square $

Proof of Theorem 17.2

Note first that, by definition of being exceptional, conditions (1) and (2) are mutually exclusive. Note also that $y^2=f(x)$ and $y^2=f(x-z)$ have the same cluster picture and the same valuation of the leading term of the defining polynomial.

Suppose $\Sigma $ is exceptional. If $y^2=f(x-z)$ is a minimal Weierstrass equation, then Proposition 17.5 shows that $d_\mathcal {R}=0$ and $v(c_f)\in \{0,1\}$. Conversely, if $d_\mathcal {R}=0$ and $v(c_f)\in \{0,1\}$ then $(\Sigma ,v(c_f))$ is integral. By the semistability criterion $G_K$ acts tamely on $\mathcal {R}$, so by Theorem 13.3(2) there exists $z \in K$ such that $f(x-z) \in {\mathcal {O}}_K[x]$. By Proposition 17.5, $y^2=f(x-z)$ is then a minimal Weierstrass equation.

Suppose $\Sigma $ is not exceptional and $y^2=f(x-z)$ is a minimal Weierstrass equation. Then Proposition 17.7 shows that $\Sigma $ has no cluster of size $>g+1$ with depth $>0$, but has some cluster ${{\mathfrak {t}}}$ of size $\ge g+1$ with $d_{{\mathfrak {t}}}\ge 0$, and $v(c_f)=-\sum _{r\notin {{\mathfrak {t}}}}d_{r\wedge {{\mathfrak {t}}}}$. Since $\Sigma $ is not exceptional, ${{\mathfrak {t}}}$ is necessarily $G_K$-stable, so condition (2) holds.

Finally, suppose $\Sigma $ is not exceptional and satisfies (2). If ${{\mathfrak {t}}}=\mathcal {R}$ then $d_{{\mathfrak {t}}}=0$ and $n=0$, so $(\Sigma ,n)$ is integral. If ${{\mathfrak {t}}}\ne \mathcal {R}$, then

$$\begin{aligned} \sum _{r\notin {{\mathfrak {t}}}}d_{r\wedge {{\mathfrak {t}}}}=|P({{\mathfrak {t}}}){\setminus }{{\mathfrak {t}}}|d_{P({{\mathfrak {t}}})}+\sum _{r\notin P({{\mathfrak {t}}})}d_{r\wedge P({{\mathfrak {t}}})}, \end{aligned}$$

so $(\Sigma ,n)$ is again integral (with ${{\mathfrak {s}}}=P({{\mathfrak {t}}})$ in Definition 13.2). By the semistability criterion $G_K$ acts tamely on $\mathcal {R}$, so by Theorem 13.3(2) there exists $z \in K$ such that $f(x-z) \in {\mathcal {O}}_K[x]$, and by Proposition 17.7, $y^2=f(x-z)$ is a minimal Weierstrass equation. $\square $

Proof of Theorem 16.2

The first part of the theorem follows from Lemmas 16.4 and 16.5. From now on, suppose C/K is semistable.

First suppose that $\Sigma $ is exceptional. A substitution of the form $x'=\pi _K^a$, $y'=\pi _K^b$ gives a curve whose cluster picture is obtained from $\Sigma $ by increasing the depth of all clusters so that $\mathcal {R}$ gets depth 0, and whose valuation of the leading term of the defining polynomial is either 0 or 1. By Theorem 17.2(i) this is the cluster picture of a minimal Weierstrass equation of C. The result follows from Lemma 16.6(i,iii).

Now suppose that $\Sigma $ is not exceptional. Let $(\Sigma ^b,\mathcal {R}^b,d^b)$ be the equivalent balanced cluster picture (in the sense of Lemma 15.1). Let $C'$ be the Weierstrass model given by Theorem 15.2. By Theorem 17.2, this is a minimal Weierstrass model. Its cluster picture is either already balanced, or is obtained from the balanced one by redistributing the depth of a child of $\mathcal {R}$ of size $g\!+\!1$. Thus, by Lemmas 16.4 and 16.6(iv),

$$\begin{aligned} v(\Delta _C^{\min }) = v(\Delta _{C'}) = v(\Delta _{\Sigma ,0}) = v(\Delta _{\Sigma ^b,0}). \end{aligned}$$

The balanced cluster picture $(\Sigma ^b,0)$ is also obtained from $(\Sigma ,v(c_f))$ itself by (i) adding a root if necessary to make $\mathcal {R}$ have size $2g+2$, (ii) increasing the depth of all clusters so that $\mathcal {R}$ has depth 0, (iii) changing the valuation of the leading term to 0, and (iv) repeatedly redistributing the depth of a child of $\mathcal {R}$ of size $\ge g\!+\!1$ to eliminate all clusters of size $>g\!+\!1$ and make the ones of size $g+1$ have equal depths. Thus by Lemma 16.6,

$$\begin{aligned} v(\Delta _C^{\min })= & {} v(\Delta _{\Sigma ^b,0}) = v(\Delta _C) +{ \Bigl \{\!\!\begin{array}{ll} \scriptstyle 0\!\!\!&{}\scriptstyle \text {if } |\mathcal {R}| \text { even}\\ \scriptstyle 2(2g+1) d_{\mathcal {R}}\!\!\!&{}\scriptstyle \text {if } |\mathcal {R}| \text { odd}\end{array}}\!\!\Bigr \}\, \\&-(2g+2)(2g+1)d_\mathcal {R}-2v(c)(2g+1)-2(2g+1)\sum _{g+1<|{{\mathfrak {s}}}|<|\mathcal {R}|}\delta _{{\mathfrak {s}}}(|{{\mathfrak {s}}}|-g-1), \end{aligned}$$

which simplifies to give the required expression. $\square $

18 Reduction types and a classification in genus 2

In this section, we propose a notion of the reduction type of a semistable hyperelliptic curve. For elliptic curves these types would correspond to good, split multiplicative of type $\mathrm{I}_n$ or non-split multiplicative of type $\mathrm{I}_n$. Our main input is the cluster picture of the curve which determines the combinatorics of its special fibre as well as several other invariants. Since our residue field is not algebraically closed, we need to keep track of the action of Frobenius on proper clusters and the sign $\epsilon _{{\mathfrak {s}}}({\text {Frob}})$ for even clusters. For elliptic curves with multiplicative reduction this sign will distinguish between split and non-split cases.

Definition 18.1

By the reduction type of a semistable curve of genus $\ge 2$ over a non-archimedean local field we mean (the isomorphism class of) the dual graph of the special fibre of its minimal regular model with Frobenius action and with a genus associated to every vertex.

For hyperelliptic curves there is a notation for such types given in [18, Sect. 8].

Lemma 18.2

The reduction type determines the conductor exponent, whether the curve is deficient and the Tamagawa number and root number of its Jacobian.

Proof

By (2.18), Theorem 2.20, Definition 12.1, Lemma 12.2 and Lemma 2.22, the dual graph and its homology determine these invariants. $\square $

Example 18.3

Table 2 lists all reduction types of semistable genus 2 curves together with their labels. Here the numbers inside the vertices indicate their genus (no number meaning genus 0). We draw an edge of length n to indicate a chain of n edges between $n-1$ genus 0 vertices. The black arrows represent the action of Frobenius on the graph.

Table 2 Reduction types of semistable genus 2 curves

Full size table

Theorem 18.4

The cluster picture of a semistable hyperelliptic curve over K together with the action of Frobenius on clusters and $\epsilon _{{\mathfrak {s}}}({\text {Frob}})$ for even clusters determine the reduction type of the curve.

Proof

This is clear from the definition of reduction type and Theorem 8.6. $\square $

It follows that one can classify all reduction types of semistable curves of a given genus via their cluster pictures with this extra data. Note that different cluster pictures can give the same reduction type.

Notation 18.5

Given the cluster picture of a semistable hyperelliptic curve, we write the relative depth on all proper clusters (except for $\mathcal {R}$ which is decorated with its depth) at the bottom right corner of the cluster. For every even cluster ${{\mathfrak {s}}}$ such that ${{\mathfrak {s}}}= {{\mathfrak {s}}}^*$ we write a sign $+$ or − on its top right corner to indicate $\epsilon _{{\mathfrak {s}}}({\text {Frob}})$. For every cluster, we link its children that are in the same Frobenius orbit by lines.

Note that the definition of $\epsilon _{{\mathfrak {s}}}$ (see Definition 1.13) depends on a choice of sign for $\theta _{{\mathfrak {s}}}$. A different choice of sign will change the sign parameter on ${{\mathfrak {s}}}$ and on ${\text {Frob}}({{\mathfrak {s}}})$ if these are different clusters.

Example 18.6

Suppose $p \equiv 7 \mod 8$. The curve

$$\begin{aligned} y^2=(x+1)(x-1)(x-(i+p))(x-(i-p))(x-(-i+p))(x-(-i-p)) \end{aligned}$$

has the following cluster picture , where i is a square root of -1 in $\overline{\mathbb {Q}}_p$. The two twins are ${{\mathfrak {t}}}_1 = \{i+p, i-p\}$ and ${{\mathfrak {t}}}_2 = \{-i+p, -i-p\}$ and hence are swapped by Frobenius. Take $\theta _{{{\mathfrak {t}}}_1} =\theta _{{{\mathfrak {t}}}_2} = 2\sqrt{2}$. Since $p \equiv 7 \mod 8$, $\epsilon _{{{\mathfrak {t}}}_1}({\text {Frob}}) = \epsilon _{{{\mathfrak {t}}}_2}({\text {Frob}}) = +1$. We draw this data as . Note that if we had chosen $\theta _{{{\mathfrak {t}}}_2} = -2\sqrt{2}$ we would have obtained $\epsilon _{{{\mathfrak {t}}}_1}({\text {Frob}}) = \epsilon _{{{\mathfrak {t}}}_2}({\text {Frob}}) = -1$ and the cluster picture . This is the reason why we consider these two the same type.

Finally note that if $p \equiv 3 \mod 8$ then ${\text {Frob}}(\sqrt{2}) = -\sqrt{2}$ and the corresponding cluster pictures would be and

[18, Sect. 9] explains how to list all the reduction types of semistable hyperelliptic curves of arbitrary genus g. Given a hyperelliptic curve, in order to find its reduction type in that list, we first construct its cluster picture together with the additional data as in Notation 18.5 and either we use Theorem 8.6 or use Table 4.20 of [18] to identify (the core of its open) hyperelliptic graph with automorphism induced by Frobenius. Theorem 5.18 guarantees that the latter produces the correct reduction type.

Finally, note that Theorem 15.2 shows that every semistable hyperelliptic curve has a K-rational model with a distinguished cluster picture. For instance in genus 2, curves with cluster pictures that have no clusters of size 4 or 5 cover all K-isomorphism classes (see Theorem 18.7.(2)). In general, hyperelliptic curves of genus g with a cluster picture that has no cluster ${{\mathfrak {s}}}$ of size $g+1<|{{\mathfrak {s}}}|<2g+2$ cover all K-isomorphism classes.

In the rest of this section we give a complete classification for semistable genus 2 curves.

Theorem 18.7

Let C be a hyperelliptic curve over K of genus 2 with cluster picture $\Sigma _C$.

(1)
C/K is semistable if and only if (a) $\Sigma _C$ is one of the pictures in Table 3 with $n,m,k,r,t \in \mathbb {Z}$ , (b) the thick black cluster ${{\mathfrak {s}}}$ has depth $d_{{\mathfrak {s}}}\in \mathbb {Z}$ and $\nu _{{\mathfrak {s}}}\in 2\mathbb {Z}$, and (c) wild inertia does not permute any root. In this case its reduction type is the one given in Table 3.
(2)
If C is semistable and $|k|>5$ then there is an isomorphic curve $C'/K$ such that $\Sigma _{C'}$ is the top cluster picture in the second column of the same reduction type as $\Sigma _C$, with the same parameters $n,m,k,r,\epsilon , \delta $ and with $t = r$.
(3)
The Namikawa-Ueno type of a semistable genus 2 curve is as indicated in Table 3.

Table 3 Cluster pictures for semistable genus 2 curves

Full size table

Proof

The table contains all possible cluster pictures for genus 2 curves, with all possible choices of a permutation of proper clusters and choice of signs on even clusters of the form ${{\mathfrak {s}}}^*$.

(1)
The semistability claim follows from Theorem 7.1 and Proposition C.7 with the thick black cluster as choice for ${{\mathfrak {s}}}$. The reduction type follows from Theorem 8.6.
(2)
By Corollary 15.3 there is a model for the curve whose cluster picture is balanced in the sense of Lemma 15.1. By Theorem 14.4 their cluster pictures are equivalent and in particular their relative depths are related as in the table. Moreover, the special fibres have isomorphic dual graphs with the same action of Frobenius, which pins down the Frobenius action on clusters and the sign parameters as given in the table.
(3)
The Namikawa-Ueno type is determined by the dual graph of the special fibre of the minimal regular model.

$\square $

The arithmetic invariants of genus 2 semistable hyperelliptic curves depend on the reduction type as follows.

Theorem 18.8

Let K be a local field of odd residue characteristic and C/K a semistable hyperelliptic curve of genus 2. Then the reduction type of C/K is one of the ones listed in Table 1 (Sect. 1).

Any genus 2 curve $y^2= f(x)$ with one of the cluster pictures from the table is semistable of the corresponding reduction type. If $f(x) \in {\mathcal {O}}_K[x]$ then this is a minimal Weierstrass model. Moreover if $|k| > 5$, then every semistable C/K admits a model $y^2 = f(x)$, with $f(x) \in {\mathcal {O}}_K[x]$ and one of the listed cluster pictures.

The invariants of the curve and its Jacobian are as given in the table (the value of $v(\Delta _{min})$ is conditional on $|k| > 5$). In the table m is the number of components in the special fibre of the minimal regular model of C, n is the conductor exponent, w is the local root number, c is the Tamagawa number of Jac(C). Def is - or + depending on whether the curve is deficient or not; $(-)^r$ means deficient if and only r is odd. The column $H_1(\Upsilon _C, \mathbb {Z})$ lists the isomorphism class of the lattice together with automorphism (induced by Frobenius) and pairing (induced by the length pairing on $\Upsilon _C$), in the notation of Theorem 1.2.2 in [4].

Proof

The completeness of the list of reduction types, and that curves with such cluster pictures are semistable and have these types follow from Theorem 18.7. The claim regarding minimality of the model follows from Theorem 17.2. If $|k|>5$, Theorem 18.7 shows that C admits a model with the cluster picture corresponding to this type. In this case, the value for the valuation of the minimal discriminant follows from Theorem 16.2.

The number of components m is clear from the dual graph $\Upsilon _C$. The conductor exponent n, the root number w and deficiency Def follow from Theorems 9.4, 2.20 and 12.4. The Tamagawa number is explicitly determined by the lattice type by Theorem 1.2.2 in [4], using the fact that $H_1(\Upsilon _C,\mathbb {Z})$ is isomorphic to the character lattice of the torus in the Raynaud parametrisation (Lemma 2.21).

Finally, it remains to prove the claim for the lattice type of $H_1(\Upsilon _C,\mathbb {Z})$. The dimension of the lattice and the eigenvalues of Frobenius come from the cluster picture (with the extra data) as given by Corollary 9.6.

Suppose that $\mathcal {R}$ is not übereven. Then the pairing with respect to the basis given in Theorem 9.3 is diagonal with values given by twice the relative depth of the corresponding twins; the dual lattice and the lattice type follow except for the types $\mathrm{I}^+_{n_{\tilde{~}} n}$ and $\mathrm{I}_n^+\tilde{\times }_r\mathrm{I}_n$. For these two cases, let ${{\mathfrak {t}}}_1, {{\mathfrak {t}}}_2$ be the two twins and $\ell _{{{\mathfrak {t}}}_1}, \ell _{{{\mathfrak {t}}}_2}$ the corresponding loops which generate $H_1(\Upsilon _C,\mathbb {Z})$. The Frobenius invariant/anti-invariant loops are generated by $\ell _{{{\mathfrak {t}}}_1} + \ell _{{{\mathfrak {t}}}_2}$ and $\ell _{{{\mathfrak {t}}}_1} - \ell _{{{\mathfrak {t}}}_2}$ respectively. It follows that $H_1(\Upsilon _C,\mathbb {Z})$ is not spanned by these and hence the type is $[1.2_B:*,*]$. However, over an unramified quadratic extension, it becomes a [1.1 : n, n] and hence by Theorem 1.2.2 in [4] with $f=2$, the type is a $[1.2_B:n,n]$.

Suppose $\mathcal {R}$ is übereven. For the case $U^+_{n_{\tilde{~}} n,k}$, let $\ell ^+= \ell _{{{\mathfrak {t}}}_1}+\ell _{{{\mathfrak {t}}}_2}-2\ell _{{{\mathfrak {t}}}_3}$, $\ell ^- = \ell _{{{\mathfrak {t}}}_1}-\ell _{{{\mathfrak {t}}}_2}$ be $\mathbb {Z}$-generators for the space of invariant and anti-invariant loops respectively. Note that $\frac{1}{2}(\ell ^+ + \ell ^-) \in H_1(\Upsilon _C,\mathbb {Z})$ so that the lattice type is $[1.2_B:*,*]$. Now, $\langle a\ell ^+, \ell ^+\rangle = a(2n+4k)$ and $\langle a\ell ^+, \ell _{{{\mathfrak {t}}}_1}-\ell _{{{\mathfrak {t}}}_3}\rangle = a(n+2k)$ so that $a\ell ^+ \in H_1(\Upsilon _C,\mathbb {Z})^{\vee }$ if and only if $a \in \frac{1}{n+2k}\mathbb {Z}$. Similarly, $\langle a\ell ^-, \ell ^+\rangle =0$ and $\langle a\ell ^-, \ell _{{{\mathfrak {t}}}_1}-\ell _{{{\mathfrak {t}}}_3}\rangle = an$ so that $a\ell ^- \in H_1(\Upsilon _C,\mathbb {Z})^{\vee }$ if and only if $a \in \frac{1}{n}\mathbb {Z}$. It follows that the lattice type is $[1.2_B:n+2k,n]$. The case $U^-_{n_{\tilde{~}} n,k}$ follows by swapping the roles of $\ell ^+$ and $\ell ^-$.

For the case $U_{n,m,k}^{+}$, let ${{\mathfrak {t}}}_1, {{\mathfrak {t}}}_2, {{\mathfrak {t}}}_3$ be the three twins and $\ell _{{{\mathfrak {t}}}_1}, \ell _{{{\mathfrak {t}}}_2}, \ell _{{{\mathfrak {t}}}_3}$ be the corresponding half loops. Choose the loops $h_1 = \ell _{{{\mathfrak {t}}}_1}- \ell _{{{\mathfrak {t}}}_3}$ and $h_2 = \ell _{{{\mathfrak {t}}}_2}- \ell _{{{\mathfrak {t}}}_3}$ as a basis for $H_1(\Upsilon _C,\mathbb {Z})$. Then for $a, b \in \mathbb {Q}$, $\ell = ah_1+bh_2 \in H_1(\Upsilon _C,\mathbb {Z})^{\vee }$ if and only if $\langle \ell , h_1\rangle = a(n+k) + bk\in \mathbb {Z}$ and $\langle \ell , h_2 \rangle =b(m+k)+ak \in \mathbb {Z}$ i.e.

$$\begin{aligned} \begin{pmatrix}n+k &{} k\\ k&{}m+k \end{pmatrix}\begin{pmatrix} a\\ b \end{pmatrix} \in \mathbb {Z}^2. \end{aligned}$$

By properties of Smith normal forms, there exists a $\mathbb {Z}$-basis $g_1, g_2$ of $H_1(\Upsilon _C,\mathbb {Z})$ such that $ug_1+vg_2 \in H_1(\Upsilon _C,\mathbb {Z})^{\vee }$ if and only if

$$\begin{aligned} \begin{pmatrix}gcd(n+k, m+k, k) &{} 0\\ 0&{} \frac{det(M)}{gcd(n+k, m+k, k)} \end{pmatrix}\begin{pmatrix} u\\ v \end{pmatrix} =\begin{pmatrix}d &{} 0\\ 0&{} \frac{t}{d} \end{pmatrix}\begin{pmatrix} u\\ v \end{pmatrix} \in \mathbb {Z}^2, \end{aligned}$$

where $M = \begin{pmatrix}n+k &{} k\\ k&{}m+k \end{pmatrix}$. It follows that the lattice type is [1.1 : d, t/d].

In the cases $U_{n,m,k}^-$, $U_{n_{\tilde{~}} n_{\tilde{~}} n}^+$ and $U_{n_{\tilde{~}} n_{\tilde{~}} n}^-$, the eigenvalues of Frobenius are $(-1,-1)$, $ (\zeta _3, \zeta _3^{-1})$ and $(\zeta _6, \zeta _6^{-1})$, respectively. The lattices become [1.1 : d, t/d] after an unramified extension of degree 2, 3 and 6, respectively. It follows from Theorem 1.2.2 in [4] using the $f=2,3$ columns that the original lattice types are [2.2 : d, t/d], [3 : n] and [6 : n], respectively. $\square $

19 Local constancy of arithmetic invariants

Observe that all our main results on a hyperelliptic curve $C: y^2 = f(x)$ are obtained from fairly coarse data coming from the roots of f(x). It follows that small p-adic perturbations of the roots of f(x) do not change the arithmetic invariants of C. Here is a precise formulation:

Theorem 19.1

Suppose $C_1/K: y^2=f_1(x)$, $C_2/K: y^2=f_2(x)$ are two hyperelliptic curves, such that

(a)
The leading coefficients $c_{f_1}$ of $f_1$ and $c_{f_2}$ of $f_2$ satisfy $\frac{c_{f_1}}{c_{f_2}}\in K^{\times 2}$.
(b)
There is a Galois-equivariant bijection $\phi :${roots of $f_1$}$\rightarrow ${roots of $f_2$} such that $\frac{\phi (r_i)-\phi (r_j)}{r_i-r_j}\equiv 1$ mod ${\mathfrak {m}}$ for all roots $r_i\ne r_j$ of $f_1$.

Then

(1)
$C_1$ and $C_2$ acquire semistable reduction over the same extensions of K.
(2)
If $C_1, C_2$ are semistable over a finite Galois extension F of K, then the special fibres of their minimal regular models over ${\mathcal {O}}_{F^{{\text {nr}}}}$ are isomorphic as curves with the semilinear action of $G_K$ given by (2.16).
(3)
$H^1_{\acute{\mathrm{e}}\text {t}}(C_1) \cong H^1_{\acute{\mathrm{e}}\text {t}}(C_2)$ as $G_K$-modules, for every $l\ne p$.
(4)
If $C_1, C_2$ are semistable and $|k|>2g+1$, then the valuation of their minimal discriminants are equal.

Remark 19.2

By (3) $C_1$ and $C_2$ share the same conductor exponent and root number. By (2) if $C_1$ and $C_2$ are semistable then they have the same reduction type (in the sense of Definition 18.1). It follows that $C_1$ is deficient if and only if $C_2$ is, and that their Jacobians have the same Tamagawa number.

Proof of Theorem

First note that by (a) and after a change of variable, we may assume that $c_{f_1} = c_{f_2}$. Moreover by (b), $\phi $ induces an isomorphism of cluster pictures, preserving depths and the Galois action on the roots.

(1) Follows from the semistability criterion (Theorem 7.1).

(2) There is a one-to-one correspondence between valid discs (cf. Sect. 4) for $C_1$ and $C_2$ over F, defined as follows. By the semistability criterion, every proper cluster ${{\mathfrak {s}}}$ for $C_1$ contains either an $I_F$-invariant root r or a twin consisting of $I_F$-conjugate roots $r_1$, $r_2$. Let $z_{{\mathfrak {s}}}=r$ in the first case, and $z_{{\mathfrak {s}}}=(r_1+r_2)/2$ in the second case. Then every valid disc for $C_1$ has a centre $z_{{\mathfrak {s}}}\in F^{{\text {nr}}}$ of this type. It corresponds to a valid disc of $C_2$ that has centre $\phi (r)$, respectively, $(\phi (r_1)+\phi (r_2))/2$, and the same radius. This gives the one-to-one correspondence $D\leftrightarrow \phi (D)$.

Next, we claim that the reduction maps agree, that is ${\text {red}}_D(r)={\text {red}}_{\phi (D)}(\phi (r))$ for every root r and valid disc D of $C_1$. As the radii are the same, this is equivalent to

$$\begin{aligned} \frac{r-z_D}{\phi (r)-z_{\phi (D)}} \equiv 1 \mod {\mathfrak {m}}. \end{aligned}$$

This is clear from (b) if $z_D$ is a root. Suppose $z_D=(r_1+r_2)/2$. If $r\in \{r_1,r_2\}$, this is again clear. Otherwise,

$$\begin{aligned} \frac{r-z_D}{\phi (r)-z_{\phi (D)}} \equiv \frac{r-r_1}{\phi (r)-z_{\phi (r_1)}} \equiv 1 \mod {\mathfrak {m}}, \end{aligned}$$

because $r-z_D=(r-r_1)+\frac{r_1-r_2}{2}$, and the second term has higher valuation than the first (and similarly for $\phi (r)-z_{\phi (D)}$). It follows from Definition 1.13 and 8.2 that all $\alpha $, $\beta $, $\gamma $, $\epsilon $ are the same for the corresponding discs (using the same argument as above for $\beta $ in the case of non-root centres), and by Proposition 5.20 and Theorem 6.2, the special fibres are the same, with the same Galois action.

(3) By (2.18), the Tate module $V_l {\text {Jac}}(C)$, and hence $H^1_{\acute{\mathrm{e}}\text {t}}(C)$, is determined as a Galois module by the special fibre of the minimal regular model over F together with the $G_K$-action (2.17). These are the same for $C=C_1$ and $C=C_2$ by (2).

(4) Since the cluster pictures are the same, so are the minimal discriminants by Lemma 16.4 and Theorem 16.2. $\square $

Corollary 19.3

Suppose $C_1:y^2 = c_1f_1(x)$ and $C_2:y^2 = c_2f_2(x)$ are two hyperelliptic curves with $c_1, c_2 \in K^{\times }$ and $f_1(x), f_2(x) \in {\mathcal {O}}_K[x]$ monic polynomials. If $\frac{c_1}{c_2} \in K^{\times 2}$ and $f_1(x) \equiv f_2(x) \mod \pi ^{d+1}$ where d is the largest depth among the depths of all proper clusters of $C_1$, then

$H^1_{\acute{\mathrm{e}}\text {t}}(C_1) \cong H^1_{\acute{\mathrm{e}}\text {t}}(C_2)$ as $G_K$-modules for every $l\ne p$, $C_1$ and $C_2$ have the same conductor exponent and the same root number.
If $C_1$ is semistable then so is $C_2$. In this case, the special fibres of their minimal regular models over ${\mathcal {O}}_{K^{nr}}$ are isomorphic as curves with an action of Frobenius, their Jacobians have the same Tamagawa number, $C_2$ is deficient if and only if $C_1$ is and, if $|k|>\deg f_1(x)$, the valuations of their minimal discriminants are equal.

Proof

By hypothesis, the condition (a) of Theorem 19.1 holds. Also, as $f_1(x)$ and $f_2(x)$ are monic and congruent mod $\pi $, they have the same degree.

Let F be the splitting field of $f_1(x)$ and $\alpha _1, ..., \alpha _n \in F$ its roots. Note that $\alpha _i \mod \pi ^{d+1}$ is a root of $f_2(x) \mod \pi ^{d+1}$ for all i. By definition of depth, these are all distinct so that by Hensel’s Lemma, the roots $\beta _1,...\beta _n$ of $f_2(x)$ can be ordered so that $\alpha _i \equiv \beta _i \mod \pi ^{d+1}$. Now if $\sigma (\alpha _i) = \alpha _j$ then $\sigma (\beta _i) \equiv \beta _j \mod \pi ^{d+1}$ and hence $\sigma (\beta _i) = \beta _j$. Finally by choice of d, we have $\beta _i-\beta _j\equiv \alpha _i-\alpha _j \not \equiv 0 \mod \pi ^{d+1}$, so that $\frac{\beta _i-\beta _j}{\alpha _i-\alpha _j }\equiv 1 \mod {\mathfrak {m}}$. The result follows from Theorem 19.1 and Remark 19.2. $\square $

Availability of data and material

Not applicable.

Notes

hyperelliptic curves will always be assumed to have genus at least 2; we will not consider double covers of general conics.
By ‘tame’ we mean semistable over some tamely ramified extension of K; this is automatically the case if $p>2g+1$.
Although ${\tilde{\gamma }}_{{\mathfrak {s}}}$ is ${\bar{\mathbb {Q}}}_l^\times $-valued, the tensored representation is realisable over $\mathbb {Q}_l$, and we view it as a $\mathbb {Q}_l$-representation in this formula.
In fact, here and in Sect. 2.2, K could be any complete discretely valued field with perfect residue field.
any two choices for $z_{{\mathfrak {t}}}$ give the same disc, c.f. Lemma 4.8 below.
here $Z_\text{ red }$ denotes the component Z with its reduced structure.
here and below, by $\phi ^{-1}(E_D)$ we formally mean the scheme ${\mathcal {C}}_{\text{ disc }}\times _{{\mathcal {Y}}_{\text{ disc }}}E_D$.
isomorphic to $\mathbb {P}^1_{{\bar{k}}}$ with self intersection $-1$.
We have only defined the dual graph of a semistable curve - whilst ${\mathcal {C}}_{\text{ disc },\bar{k}}$ is not semistable it is when given its reduced structure (see Proposition 5.12) and we formally mean the dual graph of this curve.
other than in cases II and III, the curves are understood to have the additional chart at infinity corresponding to our conventions for (possibly singular) hyperelliptic curves set out in Appendix A.
That such curves may be contracted will be shown during the course of the proof.
Note that a permutation of the proper clusters preserving size and inclusion is determined by its action on principal clusters and twins.
recall that $\sigma $ acts as in (2.17) on $\mathcal {C}_{min,\bar{k}}$.
in (ii) if $|{{\mathfrak {s}}}| =2g+2$ then the point reduces to one of the two points at infinity on $\Gamma _{{\mathfrak {s}}}$, see Proposition 5.23.(i) to determine which one.
i.e. G acts on $\{+x,-x | x\in X\}$ by $g(\pm x)=\pm \varepsilon _x(g) g(x)$ with $x\mapsto g(x)$ a G-action and $\varepsilon _x(g)\in \{\pm \}$ satisfying $\varepsilon _{x}(gh)=\varepsilon _{hx}(g)\varepsilon _x(h)$.
The formulae of Definition 8.2 only make sense when $ed_{{\mathfrak {s}}}$ and $e\tilde{\lambda }_{{\mathfrak {s}}}$ are integers. However this is always the case when F is suitably large and (cf Remark 8.3) these characters, when defined, are independent of the choice of F.
This is subtly different from the action of a group by automorphisms on metric cluster pictures, which we use specifically for semistable curves; see Definition D.4.
we will freely still refer to $v_X$ in this case, understanding that it is simply a point on $T_\Sigma $ rather than a vertex.
graph isomorphism of order $\le $ 2.

References

Anni, S., Dokchitser, V.: Constructing hyperelliptic curves with surjective Galois representations. Trans. Am. Math. Soc. 373(2020), 1477–1500 (2019)
Article MathSciNet MATH Google Scholar
Berkovich, V.: Spectral theory and analytic geometry over non-Archimedean fields. Mathematical Surveys and Monographs, AMS, Providence, RI (1990)
Best, A.J., Betts, L.A., Bisatt, M., van Bommel, R., Dokchitser, V., Faraggi, O., Kunzweiler, S., Maistret, C., Morgan, A., Muselli, S., Nowell, S.: A user’s guide to the local arithmetic of hyperelliptic curves, to appear in the Bulletin of the London Mathematical Society
Betts, A., Dokchitser, V.: Variation of Tamagawa numbers of semistable abelian varieties in field extensions. Math. Proc. Camb. Philos. Soc. 166(3), 487–521 (2018)
Article MathSciNet MATH Google Scholar
Betts, L.A.:Variation of Tamagawa numbers of Jacobians of hyperelliptic curves with semistable reduction, to appear in Journal of Number Theory
Bisatt, M.: Clusters, inertia and root numbers, to appear in Functiones et Approximatio, Commentarii Mathematici
Bosch, S.: Formelle Standardmodelle hyperelliptischer Kurven. Math. Ann. 251, 19–42 (1980)
Article MathSciNet MATH Google Scholar
Bosch, S., Liu, Q.: Rational points on the group of components of a Néron model. Manuscripta Math. 98(3), 275–293 (1999)
Article MathSciNet MATH Google Scholar
Bosch, S., Lütkebohmert, W., Raynaud, M.: Néron Models Erg. Math., vol. 21. Springer, Berlin (1990)
Bouw, I., Wewers, S.: Computing $L$-functions and semistable reduction of superelliptic curves. Glasgow Math. J. 59(1), 77–108 (2017)
Article MathSciNet MATH Google Scholar
Cornelissen, G.: Two-torsion in the Jacobian of hyperelliptic curves over finite fields. Arch. Math. 77, 241–246 (2001)
Article MathSciNet MATH Google Scholar
Coates, J., Fukaya, T., Kato, K., Sujatha, R.: Root numbers, Selmer groups and non-commutative Iwasawa theory. J. Alg. Geom. 19(2010), 19–97 (2007)
MathSciNet MATH Google Scholar
Deligne, P., Mumford, D.: The irreducibility of the space of curves of given genus. Publ. Math. IHÉS, Tome 36, 75–109 (1969)
Article MathSciNet MATH Google Scholar
Dokchitser, T., Dokchitser, V.: Regulator constants and the parity conjecture. Invent. Math. 178, 23–71 (2009)
Article MathSciNet MATH Google Scholar
Dokchitser, T., Dokchitser, V.: Euler factors determine local Weil representations. J. Reine Angew. Math. 717, 35–46 (2016)
MathSciNet MATH Google Scholar
Dokchitser, T., Dokchitser, V.: Quotients of hyperelliptic curves and étale cohomology. Quart. J. Math. 1–22 (2018). https://doi.org/10.1093/qmath/hax053
Dokchitser, T., Dokchitser, V.: Local Galois representations and Frobenius traces, arXiv:2201.04094
Dokchitser, T., Dokchitser, V., Maistret, C., Morgan, A.: Semistable types of hyperelliptic curves. In: Algebraic Curves and Their Applications, Beshaj, L., Shaska, T. (eds.), Contemporary Math., vol. 724, 73–136 (2019)
Dokchitser, T., Dokchitser, V., Morgan, A.: Tate module and bad reduction. Proc. Am. Math. Soc. 149, 1361–1372 (2021)
Article MathSciNet MATH Google Scholar
Dokchitser, V., Maistret, C.: Parity conjecture for abelian surfaces. arXiv:1911.04626
Faraggi, O., Nowell, S.: Models of hyperelliptic curves with tame potentially semistable reduction. Trans. Lond. Math. Soc. 7(1), 49–95 (2020)
Article MathSciNet MATH Google Scholar
Grothendieck, A.: Modèles de Néron et monodromie, LNM 288, Séminaire de Géométrie 7, Exposé I, Springer (1973)
Grothendieck, A.: Modèles de Néron et monodromie, LNM 288, Séminaire de Géométrie 7, Exposé IX, Springer (1973)
Kausz, I.: A discriminant and an upper bound for $\omega ^2$ for hyperelliptic arithmetic surfaces. Compos. Math. 115(1), 37–69 (1999)
Article MathSciNet MATH Google Scholar
Kunzweiler, S.: Differential forms on hyperelliptic curves with semistable reduction. Res. Number Theory 6(25), 1–17 (2020)
MathSciNet MATH Google Scholar
Liu, Q.: Algebraic Geometry and Arithmetic Curves. Oxford University Press, Oxford (2002)
MATH Google Scholar
Liu, Q.: Conducteur et discriminant minimal de courbes de genre 2. Compos. Math. 94, 51–79 (1994)
MathSciNet MATH Google Scholar
Liu, Q.: Modèles entiers des courbes hyperelliptiques sur un corps de valuation discrète. Trans. Am. Math. Soc. 348(11), 4577–4610 (1996)
Article MATH Google Scholar
Obus, A., Wewers, S.: Explicit resolution of weak wild quotient singularities on arithmetic surfaces. J. Algebraic Geom. 29(4), 691–728 (2020). https://doi.org/10.1090/jag/745
Poonen, B., Stoll, M.: The Cassels–Tate pairing on polarized abelian varieties. Ann. Math. 150, 1109–1149 (1999)
Article MathSciNet MATH Google Scholar
Raynaud, M.: Variétés abéliennes et géométrie rigide, Actes du congrès international de Nice 1970, tome 1, 473–477
Rüth, Julian: Models of curves and valuations, Ph.D. Thesis (2014). Available at https://oparu.uni-ulm.de/xmlui/handle/123456789/3302
Saito, T.: Conductor, discriminant, and the Noether formula of arithmetic surfaces. Duke Math. J. 57, 151–173 (1988)
Article MathSciNet MATH Google Scholar
Serre, J.-P.: Linear Representations of Finite Groups, GTM 42. Springer, Berlin (1977)
Book Google Scholar
Serre, J.-P., Tate, J.: Good reduction of abelian varieties. Ann. Math. 68, 492–517 (1968)
Article MathSciNet MATH Google Scholar
Srinivasan, P.: Conductors and minimal discriminants of hyperelliptic curves with rational Weierstrass points. arXiv:1508.05172
Tate, J.: Number theoretic background. In: Borel, A., Casselman, W. (eds.), Automorphic forms, representations and L-functions, Part 2, Proc. Symp. in Pure Math. 33, AMS, Providence, RI, 3–26 (1979)

Download references

Acknowledgements

We would like to thank the Warwick Mathematics Institute, University of Bristol, King’s College London, Boston University, the Max Planck Institute for Mathematics in Bonn and University College London, where parts of this research were carried out. We would also like to thank Matthew Bisatt, Simone Muselli and Omri Faraggi for helpful comments.

Funding

This research was supported by the Engineering and Physical Sciences Research Council (EPSRC) Grants EP/M016838/1 and EP/M016846/1 ‘Arithmetic of hyperelliptic curves’. The second author is supported by a Royal Society University Research Fellowship.

Author information

Authors and Affiliations

Department of Mathematics, University of Bristol, Bristol, BS8 1TW, UK
Tim Dokchitser & Céline Maistret
University College London, London, WC1H 0AY, UK
Vladimir Dokchitser
School of Mathematics and Statistics, University of Glasgow, Glasgow, G12 8QQ, UK
Adam Morgan

Authors

Tim Dokchitser
View author publications
You can also search for this author in PubMed Google Scholar
Vladimir Dokchitser
View author publications
You can also search for this author in PubMed Google Scholar
Céline Maistret
View author publications
You can also search for this author in PubMed Google Scholar
Adam Morgan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vladimir Dokchitser.

Ethics declarations

Conflicts of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Code availability

Not applicable.

Additional information

Communicated by Wei Zhang.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A. Hyperelliptic curves

Let K be a field with $\text{ char }(K)\ne 2$. By a (possibly singular) hyperelliptic curve

$$\begin{aligned} C:Y^2 = f(X), \end{aligned}$$

where $f(X)\in K[X]$ is of degree 2g or $2g+1$, has leading coefficient $c_f$ and has at worst double roots, we mean the projective curve given by glueing the pair of affine patches

$$\begin{aligned} U_X:Y^2 = f(X) ~~\text { and }\quad U_T:V^2 = T^{2g+2}f(\frac{1}{T}) \end{aligned}$$

along $X = \frac{1}{T}$ and $Y = \frac{V}{T^{g+1}}$. By the points at infinity on C we mean the points of $C{\setminus } U_X$, i.e. the points with $T=0$ on $U_T$. If $\text{ deg }(f)=2g+1$ there is a unique such,

$$\begin{aligned} P^{\infty }=(0,0), \end{aligned}$$

whilst if $\text{ deg }(f)=2g+2$ then

$$\begin{aligned} P^{\infty }_{\pm \sqrt{c_f}}=(0,\pm \sqrt{c_f}) \end{aligned}$$

are the two points on $U_T$ with $T=0$. Note that the points at infinity are always nonsingular. The singular points of C, all of which are nodes, are precisely those of the form (r, 0) on $U_X$ where r is a double root of f(X).

Write $f(X)=c_fg(X)h(X)^2$ with g(X), h(X) monic and square free. Then the normalization of C is the hyperelliptic curve

$$\begin{aligned} \widetilde{C}:Y^2 = c_fg(X) \end{aligned}$$

(A.1)

and the canonical morphism $\widetilde{C}\rightarrow C$ is given (on the chart $U_X$) by

$$\begin{aligned} (x,y) \mapsto (x, yh(x)). \end{aligned}$$

(A.2)

The points on $\widetilde{C}$ above a node (r, 0) are

$$\begin{aligned} N_r^{\pm \sqrt{c_fg(r)}} =(r, \pm \sqrt{c_fg(r)}). \end{aligned}$$

(A.3)

Given a morphism $\phi :C_1\rightarrow C_2$ of hyperelliptic curves, we denote by $\tilde{\phi }$ the unique morphism $ \widetilde{C_1}\rightarrow \widetilde{C_2}$ making the diagram

(A.4)

commute.

Remark A.5

We allow the case where every root of f(X) is a double root, in which case C is not geometrically connected. The discussion above and lemma below, however, remain valid as stated.

Lemma A.6

Suppose K has characteristic $p>2$ and let $\sigma $ be a positive integer power of the Frobenius map on ${\bar{K}}$ sending x to $x^p$. Let $C_1:Y^2=f_1(X)$ and $C_2:Y^2=f_2(X)$ be two (possibly singular) hyperelliptic curves. Denote the affine charts for $C_1$ (resp. $C_2$) by $U_{X,1}$ and $U_{T,1}$ (resp. $U_{X,2}$ and $U_{T,2}$). Suppose $\phi :C_1\rightarrow C_2$ is a morphism given as a map $U_{X,1}\rightarrow U_{X,2}$ on ${\bar{K}}$-points by

$$\begin{aligned} (x,y)\mapsto (\alpha \sigma (x)+\beta ,\gamma \sigma (y)), \qquad \alpha ,\gamma \in {\bar{K}}^{\times }\text { and }\beta \in {\bar{K}}. \end{aligned}$$

(1)
As a (rational) map $U_{T,1}\rightarrow U_{T,2}$, $\phi $ is given on ${\bar{K}}$-points by the formula
$$\begin{aligned} (t,v) \mapsto \left( \frac{\sigma (t)}{\alpha +\sigma (t)\beta },\frac{\gamma \sigma (v)}{(\alpha +\sigma (t)\beta )^{g+1}}\right) . \end{aligned}$$
In particular, if $\text{ deg }(f_1)$ is odd then $P^\infty \mapsto P^\infty $ whilst if $\text{ deg }(f_1)$ is even then $P_{\infty }^{\sqrt{c_{f_1}}} \mapsto P_{\infty }^{\epsilon \sqrt{c_{f_2}}}$ where $\epsilon = \frac{\gamma \sigma (\sqrt{c_{f_1}})}{\alpha ^{g+1}\sqrt{c_{f_2}}}\in \{\pm 1\}$.
(2)
For $i=1,2$, write $f_i(X)=h_i(X)^2g_i(X)$ with $h_i,g_i$ monic and squarefree. Then the morphism $\tilde{\phi }:\widetilde{C_1}\rightarrow \widetilde{C_2}$ is given explicitly on ${\bar{K}}$-points by the formula
$$\begin{aligned} (x,y)\mapsto \left( \alpha \sigma (x)+\beta ,\gamma \alpha ^{-\text{ deg }(h_1)}\sigma (y)\right) . \end{aligned}$$
In particular we have
$$\begin{aligned} \tilde{\phi }\left( N_r^{\pm \sqrt{c_{f_1}g_1(r)}}\right) =N_{r'}^{\pm \kappa _r\sqrt{c_{f_2}g_2(r')}} \end{aligned}$$
where $r'=\alpha \sigma (r)+\beta $ and $\kappa _r=\gamma \alpha ^{-\text{ deg }(h_1)}\frac{\sigma (\sqrt{c_{f_1}g_1(r)})}{\sqrt{c_{f_2}g_2(r')}}\in \{\pm 1\}.$

Proof

(1) Let $(t,v)\in U_{T,1}$ with $t\ne 0$. This corresponds to the point $(1/t,v/t^{g+1})\in U_{X,1}$ which under $\phi $ is mapped to the point

$$\begin{aligned} \left( \frac{\alpha +\beta \sigma (t)}{\sigma (t)},\frac{\gamma \sigma (v)}{\sigma (t)^{g+1}}\right) \in U_{x,2}. \end{aligned}$$

Changing variables we see that this corresponds to the point

$$\begin{aligned} \left( \frac{\sigma (t)}{\alpha +\sigma (t)\beta },\frac{\gamma \sigma (v)}{(\alpha +\sigma (t)\beta )^{g+1}}\right) \in U_{T,2}. \end{aligned}$$

Since this formula describes a rational map which is defined at $t=0$ it gives the desired expression for the morphism on $U_{T,1}$, as well as the claim about the points at infinity.

(2) We first claim that

$$\begin{aligned} f_2(X)=\gamma ^2f_1^\sigma \left( \frac{X-\beta }{\alpha }\right) , \end{aligned}$$

(A.7)

where $f_1^\sigma (X)$ is the result of applying $\sigma $ to the coefficients of $f_1(X)$. In particular

$$\begin{aligned} c_{f_2}=(\gamma /\alpha ^{g+1})^2\sigma (c_{f_1}). \end{aligned}$$

Indeed, since $\phi $ is a morphism, for all $(x,y)\in C_1({\bar{K}})$ we must have

$$\begin{aligned} \gamma ^2\sigma (y^2)=f_2(\alpha \sigma (x)+\beta ), \end{aligned}$$

or equivalently

$$\begin{aligned} \gamma ^2f_1^\sigma (\sigma (x))=f_2(\alpha \sigma (x)+\beta ). \end{aligned}$$

Since both the x-coordinate map $U_{x,1}({\bar{K}})\rightarrow {\bar{K}}$ and $\sigma :{\bar{K}}\rightarrow {\bar{K}}$ are surjective we deduce that

$$\begin{aligned} \gamma ^2f_1^\sigma (x)=f_2(\alpha x+\beta ) \end{aligned}$$

holds for all $x\in {\bar{K}}$ and is thus a polynomial identity, from which (A.7) follows.

To see that the claimed formula for $\tilde{\phi }$ gives a morphism $\widetilde{C}_1\rightarrow \widetilde{C}_2$ making the diagram (A.4) commute, we use that from (A.7) one has

$$\begin{aligned} h_2(X)=\alpha ^{\text{ deg }(h_1)}h_1^\sigma \left( \frac{X-\beta }{\alpha }\right) . \end{aligned}$$

$\square $

Appendix B. Centres of clusters

Lemma B.1

Let $f(x)\in K[x]$ be a squarefree polynomial with set of roots ${\mathcal {R}}$. Let ${{\mathfrak {s}}}$ be a proper cluster, $G_{{\mathfrak {s}}}= {\text {Stab}}_{G_K}({{\mathfrak {s}}})$ and $K_{{\mathfrak {s}}}= (K^{sep})^{G_{\mathfrak {s}}}$. If there is a root $z_0\in {{\mathfrak {s}}}$ such that $K_{{\mathfrak {s}}}(z_0)/K_{{\mathfrak {s}}}$ is tame, then there is a centre for ${{\mathfrak {s}}}$ which lies in $K_{{\mathfrak {s}}}$. In particular, if $K({\mathcal {R}})/K$ is tamely ramified, then for every proper cluster ${{\mathfrak {s}}}$ there is a centre which lies in $K_{{\mathfrak {s}}}$.

Proof

By assumption $z_0$ lies in a tame extension of $K_{{{\mathfrak {s}}}}$ and hence in $K_{{{\mathfrak {s}}}}^{nr}(\root m \of {\pi _{{\mathfrak {s}}}})$ for some $p\not \mid m$ and uniformiser $\pi _{{\mathfrak {s}}}$ of $K_{{\mathfrak {s}}}$ (we fix here a choice of $\root m \of {\pi _{{\mathfrak {s}}}}$). Write the p-adic expansion of $z_0$ as

$$\begin{aligned} z_0 = a_t \root m \of {\pi _{{\mathfrak {s}}}}^t + a_{t+1} \root m \of {\pi _{{\mathfrak {s}}}}^{t+1} + \ldots \end{aligned}$$

for a suitable $t\in \mathbb {Z}$ and $a_t\in K_{{\mathfrak {s}}}^{nr}$ roots of unity of order prime to p.

For $\sigma \in G_{{\mathfrak {s}}}$ we have $\sigma (z_0)\equiv z_0 \mod \pi _K^{d_{{\mathfrak {s}}}}$. In other words the terms in the p-adic expansions of $z_0$ and $\sigma z_0$ agree up to $\root m \of {\pi _{{\mathfrak {s}}}}^{e_{K_{{\mathfrak {s}}}/K} m d_{{\mathfrak {s}}}}$. Define

$$\begin{aligned} z = \sum _{t<e_{K_{{\mathfrak {s}}}/K} m d_{{\mathfrak {s}}}} a_t \root m \of {\pi _{{\mathfrak {s}}}}^t. \end{aligned}$$

Clearly z is a centre for ${{\mathfrak {s}}}$ and it suffices to check that it is $G_{{\mathfrak {s}}}$-invariant. Suppose not, and that $a_{u}\root m \of {\pi _{{\mathfrak {s}}}}^u$ is the lowest valuation term the expression which is not $G_{{\mathfrak {s}}}$-invariant. If $m\not \mid u$ then there is some element $\sigma $ of tame inertia of $K_{{\mathfrak {s}}}$ which fixes $a_u\in K_{{\mathfrak {s}}}^{nr}$ and maps $\root m \of {\pi _{{\mathfrak {s}}}}^u$ to $\zeta \root m \of {\pi _{{\mathfrak {s}}}}^u$ with a root of unity $\zeta \ne 1$; this contradicts the fact that $\sigma z_0\equiv z_0 \mod \root m \of {\pi _{{\mathfrak {s}}}}^{e_{K_{{\mathfrak {s}}}/K} m d_{{\mathfrak {s}}}}$. If m|u then $\root m \of {\pi _s}^u\in K_{{\mathfrak {s}}}$, so we must have $a_u\not \in K_{{\mathfrak {s}}}$; but in this case the Frobenius element $\phi $ similarly scales $a_u\root m \of {\pi _{{\mathfrak {s}}}}^u$ by a non-trivial root of unity of order prime to p, which contradicts $\phi z_0\equiv z_0 \mod \root m \of {\pi _{{\mathfrak {s}}}}^{e_{K_{{\mathfrak {s}}}/K} m d_{{\mathfrak {s}}}}$. $\square $

Appendix C. Equivalent semistability conditions

Throughout this appendix $C/K:y^2 = f(x)$ is a hyperelliptic curve with

$$\begin{aligned} f(x) = c_f\prod _{r \in \mathcal {R}}(x-r). \end{aligned}$$

We give two equivalent formulations to the semistability criterion (Propositions C.4 and C.7). In view of Theorem 7.1 these provide equivalent conditions for C/K to be semistable.

Definition C.1

(= Definition 1.8) We say that C/K satisfies the semistability criterion if the following conditions hold:

(1)
The extension $K(\mathcal {R})/K$ has ramification degree at most 2.
(2)
Every proper cluster is $I_K$-invariant.
(3)
Every principal cluster ${{\mathfrak {s}}}$ has $d_{{\mathfrak {s}}}\in \mathbb {Z}$ and $\nu _{{\mathfrak {s}}}\in 2\mathbb {Z}.$

Lemma C.2

Suppose $K({\mathcal {R}})/K$ is tamely ramified and $\sigma r\ne r$ for some $\sigma \in I_K$ and $r\in \mathcal {R}$. Then $v(r-\sigma r)\notin \mathbb {Z}$, and $|\text {Orbit}_{I_K}(r)|\,v(r-\sigma r)\in \mathbb {Z}$.

Proof

Write

$$\begin{aligned} r = a_1 \pi ^{b_1} + a_2 \pi ^{b_2} \ldots \end{aligned}$$

with rational $b_1<b_2<\cdots $ and $a_i\in K^{nr}$ roots of unity of order prime to p. The expansion of $\sigma r$ differs from that of r only at those $b_i$ that are not in $\mathbb {Z}$, hence $v(r-\sigma r)\notin \mathbb {Z}$. Also, the size of the orbit $|\text {Orbit}_{I_K}(r)|$ is the lowest common multiple of the denominators of the $b_i$ (when written in lowest terms), and so $|\text {Orbit}_{I_K}(r)|\,v(r-\sigma r)\in \mathbb {Z}$. $\square $

Lemma C.3

Suppose $K({\mathcal {R}})/K$ is tamely ramified and $d_{{\mathfrak {s}}}\in \mathbb {Z}$ for every principal cluster ${{\mathfrak {s}}}\subset \mathcal {R}$ and for $\mathcal {R}$ itself when $\mathcal {R}={{\mathfrak {s}}}_1\coprod s_2$ is a union of two clusters. Then

(1)
$e_{K(\mathcal {R})/K}\le 2$,
(2)
all proper clusters are inertia invariant,
(3)
a root r is fixed by inertia unless $r\in {{\mathfrak {t}}}$ for a twin ${{\mathfrak {t}}}$ or $\mathcal {R}= {\mathfrak {c}} \cup \{r,r'\}$ is a cotwin with ${\mathfrak {c}}$ its principal child.

Proof

Let r and $\sigma r$ be two inertia conjugate roots. By Lemma C.2, $v(r-\sigma r) \notin \mathbb {Z}$, so the depth of ${{\mathfrak {s}}}=\{r\} \wedge \{ \sigma r\}$ is not an integer. Note that ${{\mathfrak {s}}}$ cannot be a cotwin of odd size, since its singleton root (by construction r or $\sigma r$) cannot be Galois conjugate to a root in its principal child. Thus ${{\mathfrak {s}}}$ is either a twin or ${{\mathfrak {s}}}= \mathcal {R}$ is a cotwin of the form ${\mathfrak {c}} \cup \{r,\sigma r\}$ where ${\mathfrak {c}}$ is its principal child. It follows that inertia can only swap roots inside twins or the two singletons inside a cotwin. The lemma follows. $\square $

Proposition C.4

Let C/K be a hyperelliptic curve. Then C/K satisfies the semistability criterion if and only if

(1)
The extension $K(\mathcal {R})/K$ is tamely ramified.
(2)
Every principal cluster ${{\mathfrak {s}}}$ is $I_K$-invariant, has $d_{{\mathfrak {s}}}\in \mathbb {Z}$ and $\nu _{{\mathfrak {s}}}\in 2\mathbb {Z}$.

Proof

Clearly if C/K satisfies the semistability criterion then (1) and (2) hold. For the converse, by Lemma C.3 it suffices to show that $d_\mathcal {R}\in \mathbb {Z}$ if $\mathcal {R}$ is a union of two clusters. Suppose $\mathcal {R}={{\mathfrak {s}}}_1 \coprod {{\mathfrak {s}}}_2$ is a union of two clusters. At least one of the ${{\mathfrak {s}}}_i$ is principal, so, by hypothesis, they cannot be permuted by $I_K$. By Lemma B.1, ${{\mathfrak {s}}}_1$ and ${{\mathfrak {s}}}_2$ have centres $z_{{{\mathfrak {s}}}_1}, z_{{{\mathfrak {s}}}_2}\in K$ (taking $z_{{{\mathfrak {s}}}}=r$ if ${{\mathfrak {s}}}=\{r\}$ is a singleton), and hence $d_{\mathcal {R}}=v(z_{{{\mathfrak {s}}}_1}-z_{{{\mathfrak {s}}}_2})\in \mathbb {Z}$. $\square $

Lemma C.5

For any cluster ${{\mathfrak {s}}}$, $\nu _{{{\mathfrak {s}}}} = \nu _{P({{\mathfrak {s}}})} + |{{\mathfrak {s}}}|\delta _{{{\mathfrak {s}}}}.$

Proof

By definition of $\nu $,

$$\begin{aligned} \nu _{{{\mathfrak {s}}}}= & {} v(c_f)+|{{\mathfrak {s}}}|d_{{{\mathfrak {s}}}}+ \sum _{r \notin {{\mathfrak {s}}}} v(z_{{{\mathfrak {s}}}}-r) = v(c_f)+|{{\mathfrak {s}}}|d_{{{\mathfrak {s}}}}+\sum _{r \notin P({{\mathfrak {s}}})} v(z_{{{\mathfrak {s}}}}-r) \\&+ \sum _{r \in P({{\mathfrak {s}}}) {\setminus } {{\mathfrak {s}}}} v(z_{{{\mathfrak {s}}}}-r) \\= & {} v(c_f) + |P({{\mathfrak {s}}})|d_{P({{\mathfrak {s}}})}+ \sum _{r \notin P({{\mathfrak {s}}})} v(z_{P({{\mathfrak {s}}})}-r) + |{{\mathfrak {s}}}|d_{{{\mathfrak {s}}}} - |P({{\mathfrak {s}}})|d_{P({{\mathfrak {s}}})} \\&+ (|P({{\mathfrak {s}}})|-|{{\mathfrak {s}}}|)d_{P({{\mathfrak {s}}})} = \nu _{P({{\mathfrak {s}}})} +|{{\mathfrak {s}}}|\delta _{{{\mathfrak {s}}}}. \end{aligned}$$

$\square $

Lemma C.6

The following are equivalent:

(1)
There exists a principal cluster ${{\mathfrak {s}}}$ with $d_{{\mathfrak {s}}}\in \mathbb {Z}$ and $\nu _{{\mathfrak {s}}}\in 2\mathbb {Z}$ and for all other principal clusters ${{\mathfrak {s}}}', {{\mathfrak {s}}}''\ne \mathcal {R}$,
1. (a)
  $\delta _{{{\mathfrak {s}}}'} \in \mathbb {Z}$ if ${{\mathfrak {s}}}'$ is even and $P({{\mathfrak {s}}}')$ is principal,
2. (b)
  $\delta _{{{\mathfrak {s}}}'} \in 2\mathbb {Z}$ if ${{\mathfrak {s}}}'$ is odd and $P({{\mathfrak {s}}}')$ is principal,
3. (c)
  $\delta _{{{\mathfrak {s}}}'} - \delta _{{{\mathfrak {s}}}^{''}} \in 2\mathbb {Z}$ if $\mathcal {R}= {{\mathfrak {s}}}' \coprod {{\mathfrak {s}}}^{''}$ and ${{\mathfrak {s}}}', {{\mathfrak {s}}}^{''}$ odd.
4. (d)
  $\delta _{{{\mathfrak {s}}}'} - \delta _{{{\mathfrak {s}}}^{''}} \in \mathbb {Z}$ if $\mathcal {R}= {{\mathfrak {s}}}' \coprod {{\mathfrak {s}}}^{''}$ and ${{\mathfrak {s}}}', {{\mathfrak {s}}}^{''}$ even.
(2)
All principal clusters ${{\mathfrak {s}}}$ have $d_{{\mathfrak {s}}}\in \mathbb {Z}$ and $\nu _s \in 2\mathbb {Z}$.

Proof

For all proper clusters ${{\mathfrak {s}}}$ we have $d_s=d_{P({{\mathfrak {s}}})}+\delta _{{\mathfrak {s}}}$ by definition of $\delta _{{\mathfrak {s}}}$, and $\nu _{{{\mathfrak {s}}}} = \nu _{P({{\mathfrak {s}}})} + |{{\mathfrak {s}}}|\delta _{{{\mathfrak {s}}}}$, by Lemma C.5. The result follows from a simple case-by-case check, and the fact that going to parent and child clusters allows one to move from any principal cluster to any other one, moving only through principal clusters and possibly through $\mathcal {R}$ when it is a union of two odd or two even clusters. $\square $

Proposition C.7

Let C/K be a hyperelliptic curve and let ${{\mathfrak {s}}}$ be a principal cluster. Then C/K satisfies the semistability criterion if and only if

(1)
there exists a principal cluster ${{\mathfrak {s}}}$ with $d_{{\mathfrak {s}}}\in \mathbb {Z}$ and $\nu _{{\mathfrak {s}}}\in 2\mathbb {Z}$,
(2)
for all proper clusters ${{\mathfrak {s}}}', {{\mathfrak {s}}}''\ne \mathcal {R}$,
1. (a)
  $\delta _{{{\mathfrak {s}}}'} \in \mathbb {Z}$ if $|{{\mathfrak {s}}}'|>2$ is even and $P({{\mathfrak {s}}}')$ is not a cotwin,
2. (b)
  $\delta _{{{\mathfrak {s}}}'} \in 2\mathbb {Z}$ if $|{{\mathfrak {s}}}'|$ is odd and $P({{\mathfrak {s}}}')$ is principal,
3. (c)
  $\delta _{{{\mathfrak {s}}}'} \in \frac{1}{2}\mathbb {Z}$ if $|{{\mathfrak {s}}}'|=2$,
4. (d)
  $\delta _{{{\mathfrak {s}}}'} \in \frac{1}{2}\mathbb {Z}$ if $|{{\mathfrak {s}}}'|= 2g$ and $P({{\mathfrak {s}}}')$ is a cotwin,
5. (e)
  $\delta _{{{\mathfrak {s}}}'}, \delta _{{{\mathfrak {s}}}''} \in \mathbb {Z}$ and $\delta _{{{\mathfrak {s}}}'} + \delta _{{{\mathfrak {s}}}''} \in 2\mathbb {Z}$ if $\mathcal {R}= {{\mathfrak {s}}}' \coprod {{\mathfrak {s}}}''$ and ${{\mathfrak {s}}}', {{\mathfrak {s}}}''$ odd,
6. (f)
  $\delta _{{{\mathfrak {s}}}'}, \delta _{{{\mathfrak {s}}}''} \in \mathbb {Z}$ if $\mathcal {R}= {{\mathfrak {s}}}' \coprod {{\mathfrak {s}}}''$ and ${{\mathfrak {s}}}', {{\mathfrak {s}}}''$ even principal,
(3)
wild inertia acts trivially on the roots.

Proof

Suppose C/K satisfies (1)–(3). By (1) and (2) the curve satisfies the hypotheses of Lemma C.6(1), and hence all principal clusters ${{\mathfrak {s}}}'$ have $d_{{{\mathfrak {s}}}'}\in \mathbb {Z}$ and $\nu _{{{\mathfrak {s}}}'}\in 2\mathbb {Z}$. By (2e), (2f), if $\mathcal {R}$ is a union of two clusters then $d_\mathcal {R}\in \mathbb {Z}$, so by Lemma C.3 all proper clusters are inertia invariant and the ramification degree of $K(\mathcal {R})/K$ is at most 2.

Conversely, suppose that C/K satisfies the semistability criterion. Then (1) and (3) trivially hold. If $\mathcal {R}={{\mathfrak {s}}}_1 \coprod {{\mathfrak {s}}}_2$ is a union of two clusters, then at least one of the ${{\mathfrak {s}}}_i$ is principal, so they cannot be permuted by $I_K$. By Lemma B.1, ${{\mathfrak {s}}}_1$ and ${{\mathfrak {s}}}_2$ have centres $z_{{{\mathfrak {s}}}_1}, z_{{{\mathfrak {s}}}_2}\in K$ (taking $z_{{{\mathfrak {s}}}}=r$ if ${{\mathfrak {s}}}=\{r\}$ is a singleton), and hence $d_{\mathcal {R}}=v(z_{{{\mathfrak {s}}}_1}-z_{{{\mathfrak {s}}}_2})\in \mathbb {Z}$. Thus (2a), (2b), (2e) and (2f) hold because $d_{{\mathfrak {s}}}\in \mathbb {Z}$ and $\nu _{{\mathfrak {s}}}\in 2\mathbb {Z}$ for every principal cluster ${{\mathfrak {s}}}$. Finally, (2c) and (2d) follow from Lemma C.2. $\square $

Appendix D. Metric cluster pictures, hyperelliptic graphs and BY trees

Here we summarise various definitions and constructions from [18]. Specifically, we recall the combinatorial notion of metric cluster picture and the process for associating a metric BY tree and metric hyperelliptic graph to each such. The relevance to this paper is that, for the metric cluster picture associated to a semistable hyperelliptic curve over a local field K of odd residue characteristic (Example D.3), the resulting hyperelliptic graph is precisely the dual graph of (the special fibre of) its minimal regular model over $K^{\text{ nr }}$ (Theorem 5.18).

We caution that our notation differs slightly from that of [18]. Where there are differences we indicate this immediately after the relevant defintion and note in particular that, for a metric cluster picture $\Sigma $, we write $T_\Sigma $ (resp. $G_\Sigma $) for the graph denoted in op. cit. as $\widetilde{\underline{T}(\Sigma )}$ (resp. $\widetilde{\underline{G}(\underline{T}(\Sigma ))}$).

We adopt the same definitions and conventions for metric graphs as in Sect. 2.1.1. In particular we allow graphs to have loops and multiple edges and automorphisms of a metric graph G are homotopy classes of homeomorphisms $G\rightarrow G$ preserving vertices and lengths (which may permute multiple edges and reverse direction of loops).

1.1 Cluster pictures

Definition D.1

(Cluster picture) Let X be a finite set and $\Sigma $ a collection of non-empty subsets of X; elements of $\Sigma $ are clusters. Then $\Sigma $ (or $(X,\Sigma )$) is a cluster picture if

(1)
every singleton is a cluster, and X is a cluster,
(2)
any two clusters are either disjoint or one is contained in the other.

A cluster picture $(X,\Sigma )$ is metric if it is equipped with a distance $\delta ({{\mathfrak {s}}},{{\mathfrak {r}}})=\delta ({{\mathfrak {r}}},{{\mathfrak {s}}})\in \mathbb {R}_{>0}$ for every pair of proper clusters ${{\mathfrak {s}}}<{{\mathfrak {r}}}$. This extends to a distance function between all pairs of proper clusters in the natural way; see [18, Definition 3.45].

We talk about properties of clusters using the notation and terminology set out in Table 6 in Appendix E (see also [18, Sect. 3]), and do not recall these terms here in the interest of space. In particular $(X,\Sigma )$ has genus g if $|X|\in \{2g+1,2g+2\}$. In this appendix we restrict to cluster pictures of genus at least 2.

Example D.2

Let $C/K:y^2=f(x)$ be a hyperelliptic curve and denote by ${\mathcal {R}}\subseteq \bar{K}$ the set of roots of f(x). Then the non-empty subsets of ${\mathcal {R}}$ cut out by discs form a metric cluster picture, where for proper clusters ${{\mathfrak {s}}}<{{\mathfrak {s}}}'$ we set $\delta ({{\mathfrak {s}}},{{\mathfrak {s}}}')=d_{{\mathfrak {s}}}-d_{{{\mathfrak {s}}}'}$ where $d_{{\mathfrak {s}}}= \min _{r,r'\in {{\mathfrak {s}}}}\{v(r-r')\}$ is the depth of the cluster ${{\mathfrak {s}}}$.

Example D.3

As a concrete example of the above, take $K={\mathbb {Q}}_p$ for p odd and consider the monic polynomial f(x) with set of roots $\mathcal {R}=\{1,1\!+\!p^2,1\!-\!p^2,p,0,p^3,-p^3\}$, so that the resulting hyperelliptic curve $C:y^2=f(x)$ is the one considered in (1.2). There are four proper clusters:

$$\begin{aligned} \mathcal {R},\quad {{\mathfrak {s}}}_1=\{1,1\!+\!p^2,1\!-\!p^2\}, \quad {{\mathfrak {s}}}_2=\{p,0,p^3,-p^3\}, \quad {{\mathfrak {s}}}_3=\{0,p^3,-p^3\}, \quad \end{aligned}$$

of depths 0, 2, 1 and 3 respectively. We represent this pictorially by drawing roots $r \in \mathcal {R}$ as , and drawing ovals around roots to represent a cluster:

the roots ordered as they appear in the definition of $\mathcal {R}$. The subscript of the top cluster $\mathcal {R}$ is its depth and for all other clusters it is their “relative depth”: the difference between their depth and that of their parent cluster.

Definition D.4

(Automorphisms of cluster pictures) An automorphism of $\Sigma $ is a pair $\sigma =(\sigma _0,\epsilon _\sigma )$ where $\sigma _0$ is a permutation of the proper clusters preserving sizes, inclusions and, in the metric case, distances, and $\epsilon _\sigma $ is a collection of signs $\epsilon _\sigma ({{\mathfrak {s}}})\in \{\pm 1\}$ for even clusters ${{\mathfrak {s}}}\in \Sigma $ such that $\epsilon _\sigma ({{\mathfrak {s}}}')=\epsilon _\sigma ({{\mathfrak {s}}})$ whenever ${{\mathfrak {s}}}$ is übereven and ${{\mathfrak {s}}}'<{{\mathfrak {s}}}.$

We compose automorphisms by the cocycle rule

$$\begin{aligned} (\alpha , \epsilon _\alpha )\circ (\beta , \epsilon _\beta ) = \bigl (\alpha \circ \beta , {{\mathfrak {s}}}\mapsto \epsilon _\beta ({{\mathfrak {s}}})\epsilon _\alpha (\beta ({{\mathfrak {s}}}))\bigr ). \end{aligned}$$

Remark D.5

Let ${\mathcal {E}}$ denote the set of even clusters which do not have an übereven parent, excluding ${\mathcal {R}}$ unless ${\mathcal {R}}$ is itself übereven. Then to give a collection of signs $\epsilon _\sigma ({{\mathfrak {s}}})$ as in Definition D.4 is equivalent to specifying $\epsilon _\sigma ({{\mathfrak {s}}})$ for ${{\mathfrak {s}}}\in {\mathcal {E}}$, with no additional compatibility.

1.2 The BY tree associated to a cluster picture

Let $\Sigma $ be a metric cluster picture. We associate to $\Sigma $ a finite tree $T_\Sigma $, equipped with a genus marking $g: V(T_\Sigma )\rightarrow \mathbb {Z}_{\ge 0}$ on vertices and a 2-colouring blue/yellow on vertices and edges as follows.

Definition D.6

($T_\Sigma $) Let $(X,\Sigma )$ be a metric cluster picture. We define $T_\Sigma $, the BY tree associated to $\Sigma $, as follows. First take the graph with:

a vertex $v_{{\mathfrak {s}}}$ for every proper cluster ${{\mathfrak {s}}}$, excluding ${{\mathfrak {s}}}=X$ when $|X|=2g+2$ and has a child of size $2g+1$, coloured yellow if ${{\mathfrak {s}}}$ is übereven and blue otherwise,
an edge linking $v_{{{\mathfrak {s}}}}$ to $v_{P({{\mathfrak {s}}})}$ for every proper cluster ${{\mathfrak {s}}}\ne X$, yellow of length $2\delta _{{\mathfrak {s}}}$ if ${{\mathfrak {s}}}$ is even, and blue of length $\delta _{{\mathfrak {s}}}$ if ${{\mathfrak {s}}}$ is odd.

To obtain $T_\Sigma $ from this graph we remove, if $|X|=2g+2$ and X is a disjoint union of two proper children, the (degree 2) vertex $v_{X}$ from the vertex set^{Footnote 18} (keeping the underlying topological space the same). We define the genus of a vertex $v_{{\mathfrak {s}}}$ as $g(v_{{\mathfrak {s}}})=g({{\mathfrak {s}}})$.

Writing $T=T_\Sigma $, as a topological space $T=T_b\coprod T_y$ with $T_b$ the blue part, and $T_y$ the yellow part. Note that all leaves are blue and that $T_b\subset T$ is closed.

Remark D.7

$T_\Sigma $ is a (metric) BY tree in the sense of [18, Definition 3.18]; in the notation of op. cit. (see Construction 4.13, Proposition 5.7) it is precisely the graph $\widetilde{\underline{T}(\Sigma )}$.

Example D.8

Consider the cluster picture associated to the polynomial of Example D.3. The associated metric BY tree is

where the yellow edge is squiggly for the benefit of viewing in black and white, the number above an edge is its length, and the number on a vertex its genus.

1.3 The hyperelliptic graph associated to a metric cluster picture

Let $\Sigma =(X,\Sigma )$ be a cluster picture. We associate to $\Sigma $ a metric graph $G_\Sigma $, equipped with a genus marking $g: V(G_\Sigma )\rightarrow {\mathbb {Z}}_{\ge 0}$ and an involution^{Footnote 19}$\iota $ as follows.

Definition D.9

($G_\Sigma $) Let $\Sigma $ be a metric cluster picture and $T=T_\Sigma $ the associated metric BY tree. Define $G_\Sigma $, the hyperelliptic graph associated to $\Sigma $, to be the topological space (complete with metric) given by glueing two disjoint copies $T^+$ and $T^-$ of T along their common blue parts. Thus $G=G_\Sigma $ comes with a natural map $\pi :G\rightarrow T$ making it into a double cover of T ramified along $T_b$, as well as an involution $\iota $ swapping $T^+$ and $T^-$. We make $G_T$ into a (metric) graph by, for each $v_{{\mathfrak {s}}}\in V(T)$ not a genus 0 leaf (equiv. ${{\mathfrak {s}}}$ principal, see [18, Lemma 5.20]), declaring each element x of $\pi ^{-1}(v_{{\mathfrak {s}}})$ to be a vertex of genus $g({{\mathfrak {s}}})$. We denote this vertex of G by $v_{{\mathfrak {s}}}$ if x is the unique element of $\pi ^{-1}(v_{{\mathfrak {s}}})$, and otherwise denote it $v_{{\mathfrak {s}}}^+$ (resp. $v_{{\mathfrak {s}}}^-$) if $x\in T^+$ (resp. $x\in T^-$). Finally, we adjust the metric by halving the lengths of all edges.

We write $G_b$ for those points in G fixed by $\iota $ and $G_y$ for $G{\setminus } G_b$. Further, write $G_y^+$ for the points in $G_y$ which come from $T^+$ and $G_y^-$ for the points coming from $T^-$.

Remark D.10

The graph $G_\Sigma $ is a hyperelliptic graph in the sense of [18, Definition 3.2], so that in particular all vertices of genus 0 necessarily have degree at least 3. Specifically it is the hyperelliptic graph $\widetilde{\underline{G}(\underline{T}(\Sigma ))}$ (see op. cit. Construction 4.8, Lemma 5.5).

Remark D.11

The graph $G_\Sigma $ may be described somewhat more concretely as follows.

For every non-übereven principal cluster there is a vertex $v_{{\mathfrak {s}}}$, and for each übereven principal cluster ${{\mathfrak {s}}}$ there are two vertices $v_{{{\mathfrak {s}}}}^+$ and $v_{{{\mathfrak {s}}}}^-$. These are linked by edges as follows (where we write $v_{{\mathfrak {s}}}= v_{{{\mathfrak {s}}}}^+=v_{{{\mathfrak {s}}}}^-$ whenever ${{\mathfrak {s}}}$ is not übereven):

Name	From	To	Length	Conditions
$L_{{{\mathfrak {s}}}'}$	$v_{{{\mathfrak {s}}}'}$	$v_{{{\mathfrak {s}}}}$	$\frac{1}{2} \delta _{{{\mathfrak {s}}}'}$	${{\mathfrak {s}}}'<{{\mathfrak {s}}}$ both principal, ${{\mathfrak {s}}}'$ odd
$L_{{{\mathfrak {s}}}'}^+$	$v_{{{\mathfrak {s}}}'}^+$	$v_{{{\mathfrak {s}}}}^+$	$\delta _{{{\mathfrak {s}}}'}$	${{\mathfrak {s}}}'<{{\mathfrak {s}}}$, both principal, ${{\mathfrak {s}}}'$ even
$L_{{{\mathfrak {s}}}'}^-$	$v_{{{\mathfrak {s}}}'}^-$	$v_{{{\mathfrak {s}}}}^-$	$\delta _{{{\mathfrak {s}}}'}$	${{\mathfrak {s}}}'<{{\mathfrak {s}}}$, both principal, ${{\mathfrak {s}}}'$ even
$L_{{{\mathfrak {t}}}}$	$v_{{{\mathfrak {s}}}}^-$	$v_{{{\mathfrak {s}}}}^+$	$2 \delta _{{\mathfrak {t}}}$	${{\mathfrak {s}}}$ principal, ${{\mathfrak {t}}}<{{\mathfrak {s}}}$ twin
$L_{{{\mathfrak {t}}}}$	$v_{{{\mathfrak {s}}}}^-$	$v_{{{\mathfrak {s}}}}^+$	$2 \delta _{{\mathfrak {s}}}$	${{\mathfrak {s}}}$ principal, ${{\mathfrak {s}}}<{{\mathfrak {t}}}$ cotwin

and if $\mathcal {R}$ is not principal additionally:

$L_{{{\mathfrak {s}}}_1,{{\mathfrak {s}}}_2}$	$v_{{{\mathfrak {s}}}_1}$	$v_{{{\mathfrak {s}}}_2}$	$\frac{1}{2} (\delta _{{{\mathfrak {s}}}_1}+\delta _{{{\mathfrak {s}}}_2})$	$ \mathcal {R}= {{\mathfrak {s}}}_1\coprod {{\mathfrak {s}}}_2$, with ${{\mathfrak {s}}}_1, {{\mathfrak {s}}}_2$ principal odd
$L_{{{\mathfrak {s}}}_1,{{\mathfrak {s}}}_2}^+$	$v_{{{\mathfrak {s}}}_1}^+$	$v_{{{\mathfrak {s}}}_2}^+$	$\delta _{{{\mathfrak {s}}}_1}+\delta _{{{\mathfrak {s}}}_2}$	$ \mathcal {R}= {{\mathfrak {s}}}_1\coprod {{\mathfrak {s}}}_2$, with ${{\mathfrak {s}}}_1, {{\mathfrak {s}}}_2$ principal even
$L_{{{\mathfrak {s}}}_1,{{\mathfrak {s}}}_2}^-$	$v_{{{\mathfrak {s}}}_1}^-$	$v_{{{\mathfrak {s}}}_2}^-$	$\delta _{{{\mathfrak {s}}}_1}+\delta _{{{\mathfrak {s}}}_2}$	$ \mathcal {R}= {{\mathfrak {s}}}_1\coprod {{\mathfrak {s}}}_2$, with ${{\mathfrak {s}}}_1, {{\mathfrak {s}}}_2$ principal even
$L_{{{\mathfrak {t}}}}$	$v_{{{\mathfrak {s}}}}^-$	$v_{{{\mathfrak {s}}}}^+$	$2(\delta _{{\mathfrak {s}}}+\delta _{{\mathfrak {t}}})$	$\mathcal {R}= {{\mathfrak {s}}}\coprod {{\mathfrak {t}}}$, with ${{\mathfrak {s}}}$ principal even, ${{\mathfrak {t}}}$ twin

Example D.12

The hyperelliptic graph associated to the cluster picture of Example D.3 is

where the number above an edge indicates its length, and the number on a vertex its genus. In particular, by Theorem 5.18, for p an odd prime this is the dual graph of the hyperelliptic curve

$$\begin{aligned} C/{\mathbb {Q}}_p:y^2=x(x-1)(x-(1+p^2))(x-(1-p^2))(x-p^3)(x+p^3). \end{aligned}$$

1.4 Automorphisms of $T_\Sigma $ and $G_\Sigma $

Let $\Sigma =(X,\Sigma )$ be a metric cluster picture. We now explain how to produce an automorphism of $T_\Sigma $ (resp. $G_\Sigma $) from an automorphism of $\Sigma $.

1.4.1 Automorphisms of $T_\Sigma $

By an automorphism of $T=T_\Sigma $ we mean a pair $(\sigma _0, \epsilon _\sigma )$ where

$\sigma _0$ is a graph automorphism of T that preserves genera, colours and distances,
$\epsilon _\sigma (Z)\in \{\pm 1\}$ is a collection of signs for every connected component Z of the yellow part $T_y\subset T$.

As for cluster pictures we compose automorphisms by the cocycle rule

$$\begin{aligned} (\alpha , \epsilon _\alpha )\circ (\beta , \epsilon _\beta ) = \bigl (\alpha \circ \beta , \bullet \mapsto \epsilon _\beta (\bullet )\epsilon _\alpha (\beta (\bullet ))\bigr ). \end{aligned}$$

(This is precisely the notion of automorphism for BY trees used in [18, Definition 3.27].)

Definition D.13

($T(\sigma )$) Let $\sigma =(\sigma _0,\epsilon _{\sigma })$ be an automorphism of $\Sigma $. Define the automorphism $T(\sigma )=(T(\sigma )_0,\epsilon _{T(\sigma )})$ of T as follows. For a vertex $v_{{\mathfrak {s}}}$ of T, set $T(\sigma )_0(v_{{\mathfrak {s}}})=v_{\sigma _0({{\mathfrak {s}}})}$. To define $\epsilon _{T(\sigma )}$ for a yellow component Z of $T_y$, pick (as is always possible) an even cluster ${{\mathfrak {s}}}$ such that the edge between $v_{{\mathfrak {s}}}$ and $v_{P({{\mathfrak {s}}})}$ (half-edge if $P({{\mathfrak {s}}})=X$ and $v_X$ is removed from the vertex set in the construction of T) lies in Z. Set $\epsilon _{T(\sigma )}(Z)=\epsilon _{\sigma }({{\mathfrak {s}}})$. The compatibility of signs on even clusters ensures this is well defined.

Remark D.14

The automorphism $T(\sigma )$ of $T_\Sigma $ is precisely the result of restricting the automorphism $\underline{T}(\sigma )$ of the open BY tree $\underline{T}(\Sigma )$, as defined in [18, Construction 4.13], to its core $\widetilde{T(\Sigma )}$. In particular (see [18, Proposition 4.14]), the association $\sigma \mapsto T(\sigma )$ is a homomorphism, and every automorphism of $T_\Sigma $ fixing $v_X$ (or $v_{{\mathfrak {s}}}$ is X has size $2g+2$ and a child ${{\mathfrak {s}}}$ of size $2g+1$) arises this way.

1.4.2 Automorphisms of $G_\Sigma $

By an automorphism of $G_\Sigma $ we mean a graph automorphism preserving the genus marking.

Definition D.15

($G(\sigma )$) Let $\sigma =(\sigma _0,\epsilon _\sigma )$ be an automorphism of $\Sigma $, and $T(\sigma )=(T(\sigma )_0,\epsilon _{T(\sigma )})$ the associated automorphism of $T=T_\Sigma $. Denote by $\pi :G\rightarrow T$ the quotient map and for a connected component Z of $G_y$, denote by $\bar{Z}$ the component $\pi (Z)$ of $T_y$. We define $G(\sigma )$ to be the unique automorphism of G such that:

$G(\sigma )$ commutes with $\iota $ and induces the graph automorphism $T(\sigma )_0$ (temporarily denoted $\rho $) on the quotient T,
for a connected component Z of $G_y^+$, we have
$$\begin{aligned} G(\sigma )(Z)={\left\{ \begin{array}{ll} \pi ^{-1}(\rho (\bar{Z}))\cap G_y^+~~&{}~~\epsilon _{T(\sigma )}(\bar{Z})=1,\\ \pi ^{-1}(\rho (\bar{Z}))\cap G_y^-~~&{}~~\epsilon _{T(\sigma )}(Z)=-1.\end{array}\right. } \end{aligned}$$

Remark D.16

Explicitly, for a non übereven principal cluster ${{\mathfrak {s}}}$ we have $G(\sigma )(v_{{\mathfrak {s}}})=v_{\sigma {{\mathfrak {s}}}}$. Similarly, for an übereven principal cluster ${{\mathfrak {s}}}$ we have

$$\begin{aligned} G(\sigma )(v_{{\mathfrak {s}}}^+)={\left\{ \begin{array}{ll}v_{\sigma {{\mathfrak {s}}}}^+~~&{}~~\epsilon _\sigma ({{\mathfrak {s}}})=1\\ v_{\sigma {{\mathfrak {s}}}}^-~~&{}~~\epsilon _\sigma ({{\mathfrak {s}}})=-1.\end{array}\right. } \end{aligned}$$

For the edges, for a proper cluster ${{\mathfrak {s}}}$ of size $<2g+1$, write $e_{{\mathfrak {s}}}\in G_b$ (resp. $e_{{\mathfrak {s}}}^+\in G_y^+$ and $e_{{\mathfrak {s}}}^-\in G_y^-$) for the edge(s) between $\pi ^{-1}(v_{{\mathfrak {s}}})$ and $\pi ^{-1}(v_{P({{\mathfrak {s}}})})$ (if ${{\mathfrak {s}}}={{\mathfrak {t}}}$ is a twin, then by $e_{{\mathfrak {t}}}^{\pm }$ we mean the two half-edges which get glued at $\pi ^{-1}(v_{{\mathfrak {t}}})$ to form a loop). Then for ${{\mathfrak {s}}}$ odd we have $G(\sigma )(e_{{\mathfrak {s}}})=e_{\sigma {{\mathfrak {s}}}}$, whilst for ${{\mathfrak {s}}}$ even we have

$$\begin{aligned} G(\sigma )(e_{{\mathfrak {s}}}^+)={\left\{ \begin{array}{ll} e_{\sigma {{\mathfrak {s}}}}^+ ~~&{}~~\epsilon _\sigma ({{\mathfrak {s}}})=1\\ e_{\sigma {{\mathfrak {s}}}}^-~~&{}~~\epsilon _\sigma ({{\mathfrak {s}}})=-1.\end{array}\right. } \end{aligned}$$

Remark D.17

The automorphism $G(\sigma )$ of $G_\Sigma $ is precisely the result of restricting the automorphism $\underline{G}(\sigma )$ of the hyperelliptic graph $\underline{G}(\underline{T}(\Sigma ))$, as defined in [18, Construction 4.8], to its core $\widetilde{\underline{G}(\underline{T}(\Sigma ))}$. In particular (see [18, Proposition 4.11] and Remark D.14), the association $\sigma \mapsto G(\sigma )$ is a homomorphism, and every automorphism of $G_\Sigma $ fixing $\pi ^{-1}(v_X)$ as a set (or $\pi ^{-1}(v_{{\mathfrak {s}}})$ if X has size $2g+2$ and a child ${{\mathfrak {s}}}$ of size $2g+1$) arises this way.

1.5 The homology of $G_\Sigma $

In [18, Sect. 6] an explicit description of the first singular homology group $H_1(G_\Sigma ,{\mathbb {Z}})$, along with its length pairing ([18, Sect. 2.2.2]) and automorphism action is given. Here we recall the result.

Theorem D.18

Let $\Sigma $ be a metric cluster picture, A the set of even non-übereven clusters excluding X, and B the subset of clusters ${{\mathfrak {s}}}\in A$ such that ${{\mathfrak {s}}}^*=X$. Then there is a canonical isomorphism

$$\begin{aligned} H_1(G_\Sigma ,{\mathbb {Z}})\cong \left\{ \sum _{{{\mathfrak {s}}}\in A}\lambda _{{\mathfrak {s}}}\ell _{{\mathfrak {s}}}\in {\mathbb {Z}}^A~~\mid ~~\sum _{{{\mathfrak {s}}}\in B}\lambda _{{\mathfrak {s}}}=0 \right\} . \end{aligned}$$

The length pairing is given by

$$\begin{aligned} \langle \ell _{{{\mathfrak {s}}}_1},\ell _{{{\mathfrak {s}}}_2} \rangle ={\left\{ \begin{array}{ll} 0~~&{} ~~~~\quad {{\mathfrak {s}}}^*_1 \ne {{\mathfrak {s}}}^*_2, \\ 2(\delta ({{{\mathfrak {s}}}_1\wedge {{\mathfrak {s}}}_2},{P({{\mathfrak {s}}}^*_1)}))&{} ~ \quad \quad {{\mathfrak {s}}}^*_1 ={{\mathfrak {s}}}^*_2 \ne X, \\ 2(\delta ({{{\mathfrak {s}}}_1\wedge {{\mathfrak {s}}}_2},{X}))&{} \quad \quad ~{{\mathfrak {s}}}^*_1 ={{\mathfrak {s}}}^*_2 = X.\\ \end{array}\right. } \end{aligned}$$

For an automorphism $\sigma =(\sigma _0,\epsilon _\sigma )$ of $\Sigma $, the action of $G(\sigma )$ on $H_1(G_\Sigma ,{\mathbb {Z}})$ is given by

$$\begin{aligned} \sigma (\ell _{{\mathfrak {s}}}) = \epsilon _{{{\mathfrak {s}}}}(\sigma )\ell _{\sigma {{\mathfrak {s}}}}. \end{aligned}$$

Proof

This is [18, Theorem 6.1] (see op. cit. Definitions 3.16, 3.31 and 3.48 for the definitions of the lattices $\Lambda _{\bullet }$ appearing in the statement). We remark that, writing $T=T_\Sigma $, the proof passes through a canonical identification (op. cit. Proposition 6.6) of $H_1(G_\Sigma ,{\mathbb {Z}})$ with the relative homology group $H_1(T,T_b,{\mathbb {Z}})$, equivariant for the natural actions of automorphisms and preserving the respective length pairings. $\square $

Remark D.19

Unwinding the isomorphism in [18, Theorem 6.1] yields the following explicit description of the basis elements $\ell _{{\mathfrak {s}}}$: for ${{\mathfrak {s}}}\ne \mathcal {R}$ an even non-übereven cluster $\ell _{{{\mathfrak {s}}}}\in C_1(G_\Sigma ,\mathbb {Z})$ is the shortest path going from $v_{\hat{{{\mathfrak {s}}}}}^-$ to $v_{{\mathfrak {s}}}$ through $G_\Sigma ^-$ before going on to $v_{\hat{{{\mathfrak {s}}}}}^+$ through $G_\Sigma ^+$, where here we set ${\hat{{{\mathfrak {s}}}}}=P({{\mathfrak {s}}}^*)$ if ${{\mathfrak {s}}}^* \ne \mathcal {R}$, and ${\hat{{{\mathfrak {s}}}}}=\mathcal {R}$ otherwise. (In exceptional cases, for a cluster ${{\mathfrak {s}}}$ appearing above, $v_{{\mathfrak {s}}}$ (resp. $v_{\hat{{{\mathfrak {s}}}}}^+$ , $v_{\hat{{{\mathfrak {s}}}}}^-$) may not be in the vertex set of $G_\Sigma $, and we must interpret it as the obvious point on an edge instead.) Note that $\ell _{{{\mathfrak {s}}}}$ is a loop in $G_\Sigma $ unless ${{\mathfrak {s}}}^* = \mathcal {R}$, in which case it is a “half loop” in the sense that if $\ell _{{{\mathfrak {s}}}}$, $\ell _{{{\mathfrak {s}}}'}$ are two such then $ \ell _{{{\mathfrak {s}}}} - \ell _{{{\mathfrak {s}}}'}$ is a loop.

Appendix E. Summary of notation

For the reader’s convenience, the following tables gather the general notation and terminology that are used throughout the paper. We reserve gothic letters ${{\mathfrak {s}}}$, ${{\mathfrak {t}}}$, ${{\mathfrak {s}}}_1$ etc. for clusters (except for “mod ${\mathfrak {m}}$”). Tables 4 and 5 list the general notation associated to fields and hyperelliptic curves. Tables 6 and 7 summarise the notation and terminology associated to a cluster ${{\mathfrak {s}}}$, and the main functions and invariants associated to clusters. Table 8 presents the main notation associated to a disc D, as used in Sects. 3–7.

Table 4 General notation associated to fields

Full size table

Table 5 General notation associated to a hyperelliptic curve

Full size table

Table 6 Terminology and notation for clusters

Full size table

Table 7 Functions and invariants associated to clusters

Full size table

Table 8 General notation associated to discs

Full size table

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dokchitser, T., Dokchitser, V., Maistret, C. et al. Arithmetic of hyperelliptic curves over local fields. Math. Ann. 385, 1213–1322 (2023). https://doi.org/10.1007/s00208-021-02319-y

Download citation

Received: 11 July 2020
Revised: 30 July 2021
Accepted: 14 November 2021
Published: 20 February 2022
Issue Date: April 2023
DOI: https://doi.org/10.1007/s00208-021-02319-y

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Conditions	Name	From	To	Length
\({{\mathfrak {s}}}'<{{\mathfrak {s}}}\) both principal, \({{\mathfrak {s}}}'\) odd	\(L_{{{\mathfrak {s}}}'}\)	\(v_{{{\mathfrak {s}}}'}\)	\(v_{{{\mathfrak {s}}}}\)	\(\frac{1}{2} \delta _{{{\mathfrak {s}}}'}\)
\({{\mathfrak {s}}}'<{{\mathfrak {s}}}\) both principal, \({{\mathfrak {s}}}'\) even	\(L_{{{\mathfrak {s}}}'}^+\)	\(v_{{{\mathfrak {s}}}'}^+\)	\(v_{{{\mathfrak {s}}}}^+\)	\(\delta _{{{\mathfrak {s}}}'}\)
	\(L_{{{\mathfrak {s}}}'}^-\)	\(v_{{{\mathfrak {s}}}'}^-\)	\(v_{{{\mathfrak {s}}}}^-\)	\(\delta _{{{\mathfrak {s}}}'}\)
\({{\mathfrak {s}}}\) principal, \({{\mathfrak {t}}}<{{\mathfrak {s}}}\) twin	\(L_{{{\mathfrak {t}}}}\)	\(v_{{{\mathfrak {s}}}}^-\)	\(v_{{{\mathfrak {s}}}}^+\)	\(2 \delta _{{\mathfrak {t}}}\)
\({{\mathfrak {s}}}\) principal, \({{\mathfrak {s}}}<{{\mathfrak {t}}}\) cotwin	\(L_{{{\mathfrak {t}}}}\)	\(v_{{{\mathfrak {s}}}}^-\)	\(v_{{{\mathfrak {s}}}}^+\)	\(2 \delta _{{\mathfrak {s}}}\)

\( \mathcal {R}= {{\mathfrak {s}}}_1\coprod {{\mathfrak {s}}}_2\), with \({{\mathfrak {s}}}_1, {{\mathfrak {s}}}_2\) principal odd	\(L_{{{\mathfrak {s}}}_1,{{\mathfrak {s}}}_2}\)	\(v_{{{\mathfrak {s}}}_1}\)	\(v_{{{\mathfrak {s}}}_2}\)	\(\frac{1}{2} (\delta _{{{\mathfrak {s}}}_1}+\delta _{{{\mathfrak {s}}}_2})\)
\( \mathcal {R}= {{\mathfrak {s}}}_1\coprod {{\mathfrak {s}}}_2\), with \({{\mathfrak {s}}}_1, {{\mathfrak {s}}}_2\) principal even	\(L_{{{\mathfrak {s}}}_1,{{\mathfrak {s}}}_2}^+\)	\(v_{{{\mathfrak {s}}}_1}^+\)	\(v_{{{\mathfrak {s}}}_2}^+\)	\(\delta _{{{\mathfrak {s}}}_1}+\delta _{{{\mathfrak {s}}}_2}\)
	\(L_{{{\mathfrak {s}}}_1,{{\mathfrak {s}}}_2}^-\)	\(v_{{{\mathfrak {s}}}_1}^-\)	\(v_{{{\mathfrak {s}}}_2}^-\)	\(\delta _{{{\mathfrak {s}}}_1}+\delta _{{{\mathfrak {s}}}_2}\)
\(\mathcal {R}= {{\mathfrak {s}}}\coprod {{\mathfrak {t}}}\), with \({{\mathfrak {s}}}\) principal even, \({{\mathfrak {t}}}\) twin	\(L_{{{\mathfrak {t}}}}\)	\(v_{{{\mathfrak {s}}}}^-\)	\(v_{{{\mathfrak {s}}}}^+\)	\(2(\delta _{{\mathfrak {s}}}+\delta _{{\mathfrak {t}}})\)

Name	From	To	Length	Conditions
\(L_{{{\mathfrak {s}}}'}\)	\(v_{{{\mathfrak {s}}}'}\)	\(v_{{{\mathfrak {s}}}}\)	\(\frac{1}{2} \delta _{{{\mathfrak {s}}}'}\)	\({{\mathfrak {s}}}'<{{\mathfrak {s}}}\) both principal, \({{\mathfrak {s}}}'\) odd
\(L_{{{\mathfrak {s}}}'}^+\)	\(v_{{{\mathfrak {s}}}'}^+\)	\(v_{{{\mathfrak {s}}}}^+\)	\(\delta _{{{\mathfrak {s}}}'}\)	\({{\mathfrak {s}}}'<{{\mathfrak {s}}}\), both principal, \({{\mathfrak {s}}}'\) even
\(L_{{{\mathfrak {s}}}'}^-\)	\(v_{{{\mathfrak {s}}}'}^-\)	\(v_{{{\mathfrak {s}}}}^-\)	\(\delta _{{{\mathfrak {s}}}'}\)	\({{\mathfrak {s}}}'<{{\mathfrak {s}}}\), both principal, \({{\mathfrak {s}}}'\) even
\(L_{{{\mathfrak {t}}}}\)	\(v_{{{\mathfrak {s}}}}^-\)	\(v_{{{\mathfrak {s}}}}^+\)	\(2 \delta _{{\mathfrak {t}}}\)	\({{\mathfrak {s}}}\) principal, \({{\mathfrak {t}}}<{{\mathfrak {s}}}\) twin
\(L_{{{\mathfrak {t}}}}\)	\(v_{{{\mathfrak {s}}}}^-\)	\(v_{{{\mathfrak {s}}}}^+\)	\(2 \delta _{{\mathfrak {s}}}\)	\({{\mathfrak {s}}}\) principal, \({{\mathfrak {s}}}<{{\mathfrak {t}}}\) cotwin

\(L_{{{\mathfrak {s}}}_1,{{\mathfrak {s}}}_2}\)	\(v_{{{\mathfrak {s}}}_1}\)	\(v_{{{\mathfrak {s}}}_2}\)	\(\frac{1}{2} (\delta _{{{\mathfrak {s}}}_1}+\delta _{{{\mathfrak {s}}}_2})\)	\( \mathcal {R}= {{\mathfrak {s}}}_1\coprod {{\mathfrak {s}}}_2\), with \({{\mathfrak {s}}}_1, {{\mathfrak {s}}}_2\) principal odd
\(L_{{{\mathfrak {s}}}_1,{{\mathfrak {s}}}_2}^+\)	\(v_{{{\mathfrak {s}}}_1}^+\)	\(v_{{{\mathfrak {s}}}_2}^+\)	\(\delta _{{{\mathfrak {s}}}_1}+\delta _{{{\mathfrak {s}}}_2}\)	\( \mathcal {R}= {{\mathfrak {s}}}_1\coprod {{\mathfrak {s}}}_2\), with \({{\mathfrak {s}}}_1, {{\mathfrak {s}}}_2\) principal even
\(L_{{{\mathfrak {s}}}_1,{{\mathfrak {s}}}_2}^-\)	\(v_{{{\mathfrak {s}}}_1}^-\)	\(v_{{{\mathfrak {s}}}_2}^-\)	\(\delta _{{{\mathfrak {s}}}_1}+\delta _{{{\mathfrak {s}}}_2}\)	\( \mathcal {R}= {{\mathfrak {s}}}_1\coprod {{\mathfrak {s}}}_2\), with \({{\mathfrak {s}}}_1, {{\mathfrak {s}}}_2\) principal even
\(L_{{{\mathfrak {t}}}}\)	\(v_{{{\mathfrak {s}}}}^-\)	\(v_{{{\mathfrak {s}}}}^+\)	\(2(\delta _{{\mathfrak {s}}}+\delta _{{\mathfrak {t}}})\)	\(\mathcal {R}= {{\mathfrak {s}}}\coprod {{\mathfrak {t}}}\), with \({{\mathfrak {s}}}\) principal even, \({{\mathfrak {t}}}\) twin

Arithmetic of hyperelliptic curves over local fields

Abstract

Similar content being viewed by others

Elliptic normal curves of even degree and theta functions

Number of complete subgraphs of Peisert graphs and finite field hypergeometric functions

On r-isogenies over $$\mathbb {Q}(\zeta _r)$$ of elliptic curves with rational j-invariants

1 Introduction

1.1 Setup

Definition 1.1

Example 1.2

Theorem 1.3

Definition 1.4

Definition 1.5

Definition 1.6

1.2 Reduction of C and its Jacobian

Definition 1.7

Definition 1.8

Theorem 1.9

1.3 Special fibre of the minimal regular model

Definition 1.10

Theorem 1.11

Definition 1.12

Definition 1.13

Remark 1.14

Theorem 1.15

Theorem 1.16

1.4 Galois representation

Notation 1.17

Notation 1.18

Notation 1.19

Theorem 1.20

Theorem 1.21

1.5 Weierstrass equations

Definition 1.22

Remark 1.23

Theorem 1.24

Corollary 1.25

Theorem 1.26

1.6 Cluster pictures

Definition 1.27

Theorem 1.28

1.7 Perturbing the curve

Theorem 1.29

1.8 Classification of semistable curves

Notation 1.30

Theorem 1.31

1.9 Layout

2 Curves and Jacobians over local fields

Example 2.1

2.1 Semistable curves over the residue field

2.1.1 Dual graph

2.1.2 Character group

2.2 Semistable Jacobians

2.3 Galois and inertia

2.4 General curves

Example 2.19

Theorem 2.20

Proof

Lemma 2.21

Proof

Lemma 2.22

Proof

Remark 2.23

3 Regular semistable models of \(\mathbb {P}^1\)

3.1 Discs and associated valuations

3.1.1 Discs

3.1.2 The valuation associated to a disc

3.2 Admissible collections of discs

Definition 3.2

Remark 3.3

3.3 The model of \(\mathbb {P}^1_{K^{{\text {nr}}}}\) associated to an admissible collection of discs

Notation 3.4

Remark 3.5

3.3.1 The schemes \(U_D\), \(W_D\) and \(Y_D\)

Definition 3.6

Definition 3.7

3.3.2 The model \({\mathcal {Y}}_{\mathcal {D}}\)

Definition 3.8

Remark 3.9

3.3.3 Properties of the model