1 Introduction

Let S be a smooth projective complex surface with \(b_1(S) = 0\) and polarization H. We denote by

$$\begin{aligned} M:=M_{S}^{H}(r,c_1,c_2) \end{aligned}$$

the moduli space of rank r Gieseker H-stable torsion free sheaves on S with Chern classes \(c_1 \in H^2(S,{\mathbb {Z}})\), \(c_2 \in H^4(S,{\mathbb {Z}})\). Suppose that no rank r strictly Gieseker H-semistable sheaves with Chern classes \(c_1, c_2\) exist. Then \(M_{S}^{H}(r,c_1,c_2)\) is projective. T. Mochizuki [Moc] studied a perfect obstruction theory on M with

(1)

where \({\mathbb {E}}\) denotes the universal sheaf on \(M \times S\), \(\pi : M \times S \rightarrow M\) is projection, and \((\cdot )_0\) denotes the trace-free part.Footnote 1

This leads to a virtual cycle on M of degree equal to the virtual dimension

$$\begin{aligned} {\text {vd}}(M)=2rc_2-(r-1)c_1^2-(r^2-1)\chi ({\mathcal {O}}_S). \end{aligned}$$
(2)

The (algebraic) Donaldson invariants are then obtained by capping certain classes with the virtual cycle

$$\begin{aligned} \int _{[M]^{\text {vir}}} \tau _{\alpha _1}(\sigma _1) \cdots \tau _{\alpha _m}(\sigma _m), \end{aligned}$$
(3)

where \(\sigma _1, \ldots \sigma _m \in H^*(S,{\mathbb {Q}})\), \(\alpha _1, \ldots , \alpha _m \ge 0\) are the descendence degrees, and the insertions \(\tau _{\alpha _i}(\sigma _i)\) are defined in Sect. 2. One of the main achievements of [Moc] is a beautiful formula expressing (3) for \(r=2\) in terms of Seiberg–Witten invariants of S and certain explicit integrals over \(S^{[n_1]} \times S^{[n_2]}\), where \(S^{[n_i]}\) denotes the Hilbert scheme of \(n_i\) points on S. This formula was used by the first named author, H. Nakajima, and K. Yoshioka to prove the Witten conjecture for algebraic surfaces [GNY3].

We are interested in the virtual \(\chi _y\)-genus of M defined in [FG]

$$\begin{aligned} \chi _{-y}^{\text {vir}}(M) := \sum _{p \ge 0} (-y)^p \chi ^\text {vir}(M,\Omega _{M}^{p,\text {vir}}) \in {\mathbb {Z}}[y], \end{aligned}$$

where \(\chi ^\text {vir}(M, \cdot )\) is virtual holomorphic Euler characteristic and \(\Omega _{M}^{p, \text {vir}} = \Lambda ^p (T_{M}^{\text {vir}})^{\vee }\).Footnote 2 We will usually use its shifted version

$$\begin{aligned} {\overline{\chi }}_{-y}^{\text {vir}}(M):=y^{-\frac{{\text {vd}}(M)}{2}}\chi _{-y}^{\text {vir}}(M), \end{aligned}$$

which is a symmetric Laurent polynomial in \(y^{\pm \frac{1}{2}}\) [FG, Cor. 4.9]. The virtual Euler characteristic is defined as

$$\begin{aligned} e^\text {vir}(M) :={\overline{\chi }}_{-1}^{\text {vir}}(M)= \int _{[M]^\text {vir}} c_{{\text {vd}}(M)}(T^\text {vir}), \end{aligned}$$

where the last equality is the virtual Hopf index theorem [FG, Cor. 4.8]. We are interested in the coefficients of the generating function

$$\begin{aligned} {\textsf {Z}}_{S,H,c_1}(x,y) := \sum _{c_2} \overline{\chi }_{-y}^\text {vir}(M_{S}^{H}(2,c_1,c_2)) \, x^{4c_2 - c_{1}^{2} - 3 \chi ({\mathcal {O}}_S)}, \end{aligned}$$

where the power is \({\text {vd}}(M_{S}^{H}(2,c_1,c_2))\). We will encounter the Dedekind eta function and three of the Jacobi theta functions

$$\begin{aligned} \begin{aligned} \eta (q)&= q^{\frac{1}{24}} \prod _{n=1}^{\infty } (1-q^n), \quad \ \ \theta _1(q,y) = \sum _{n \in {\mathbb {Z}}} (-1)^n q^{\big (n+\frac{1}{2}\big )^2} y^{n+\frac{1}{2}} \\ \theta _2(q,y)&= \sum _{n \in {\mathbb {Z}}} q^{\big (n+\frac{1}{2}\big )^2} y^{n+\frac{1}{2}}, \quad \theta _3(q,y) = \sum _{n \in {\mathbb {Z}}} q^{n^2} y^{n}. \end{aligned} \end{aligned}$$
(4)

We define

$$\begin{aligned} \overline{\eta }(q) := q^{-\frac{1}{24}} \eta (q) \end{aligned}$$

and write the corresponding “Nullwerte” by \(\theta _i(q) = \theta _i(q,1)\).

Seiberg–Witten invariants are oriented diffeomorphism invariants of 4-manifolds. For smooth projective surfaces over \({\mathbb {C}}\) satisfying \(b_1(S)=0\) and \(p_g(S)>0\), Seiberg–Witten invariants of an algebraic class \(a \in H^2(S,{\mathbb {Z}})\) can be constructed in terms of a natural virtual class on the linear system \(|{\mathcal {O}}_S(a)|\) [Moc, Sect. 6.3].Footnote 3 Either the virtual class is zero, in which case one defines \(\text {SW}(a) = 0\), or it has virtual dimension zero, in which case its degree is denoted by \(\text {SW}(a) \in {\mathbb {Z}}\). A class \(a \in H^2(S,{\mathbb {Z}})\) is called a Seiberg–Witten basic class when \(\text {SW}(a) \ne 0\). Many surfaces only have Seiberg–Witten basic classes 0 and \(K_S \ne 0\) with corresponding Seiberg–Witten invariants 1 and \((-1)^{\chi ({\mathcal {O}}_S)}\). Such surfaces are for example minimal surfaces of general type with \(p_g(S)>0\) and \(b_1(S)=0\) [Mor, Thm. 7.4.1].

Conjecture 1.1

Let S be a smooth projective surface with \(b_1(S) = 0\) and \(p_g(S)>0\). Suppose the Seiberg–Witten basic classes of S are 0 and \(K_S \ne 0\). Let \(H, c_1,c_2\) be chosen such that there exist no rank 2 strictly Gieseker H-semistable sheaves with Chern classes \(c_1,c_2\). Then \(\overline{\chi }_{-y}^{\text {vir}}(M^H_S(2,c_1,c_2))\) is the coefficient of \(x^{4c_2-c_1^2-3\chi ({\mathcal {O}}_S)}\) of

$$\begin{aligned} \psi _S(x,y):=8 \Bigg ( \frac{1}{2} \prod _{n=1}^{\infty } \frac{1}{(1-x^{2n})^{10}(1-x^{2n} y)(1-x^{2n} y^{-1})} \Bigg )^{\chi ({\mathcal {O}}_S)} \Bigg ( \frac{2 \overline{\eta }(x^4)^2}{\theta _3(x,y^{\frac{1}{2}})} \Bigg )^{K_{S}^2}. \end{aligned}$$

In this conjecture, we can view \(\psi _S(x,y)\) as generating function for \(\overline{\chi }_{-y}^{\text {vir}}(M^H_S(2,c_1,c_2))\) for all \(c_1,c_2\). As a consequence of the conjecture, \({\textsf {Z}}_{S,H,c_1}(x,y)\) is independent of the choice of polarization H, which we often omit from our notation. There is actually a closed formula for \({\textsf {Z}}_{S,c_1}(x,y)\), which we give in Proposition 5.5. Note that the first factor is related to \(\theta _1(q,y)\) by the Jacobi triple product formula

$$\begin{aligned} \theta _1(q,y) =q^{\frac{1}{4}}(y^{\frac{1}{2}}-y^{-\frac{1}{2}})\prod _{n=1}^\infty (1-q^{2n})(1-q^{2n} y)(1-q^{2n} y^{-1}). \end{aligned}$$

We also present a generalization of this conjecture to arbitrary smooth projective surfaces S with \(b_1(S) = 0\) and \(p_g(S)>0\) (Conjecture 5.7), but the above conjecture is easier to state.

Remark 1.2

Replacing x by \(x y^{\frac{1}{2}}\), we go from generating functions for \(\overline{\chi }_{-y}^{\text {vir}}\) to generating functions for \(\chi _{-y}^{\text {vir}}\). Therefore we also get a conjectural generating function for the non-shifted virtual \(\chi _y\)-genera of the moduli spaces. Under the same assumptions, Conjecture 1.1 gives that \(\chi ^{\text {vir}}_{-y}(M^H_S(2,c_1,c_2))\) is the coefficient of \(x^{4c_2-c_1^2-3\chi ({\mathcal {O}}_S)}\) of

$$\begin{aligned} \begin{aligned} \psi _S(xy^{\frac{1}{2}},y)=&\, 8 \Bigg ( \frac{1}{2} \prod _{n=1}^{\infty } \frac{1}{(1-x^{2n}y^n)^{10}(1-x^{2n} y^{n+1})(1-x^{2n}y^{n-1})} \Bigg )^{\chi ({\mathcal {O}}_S)} \\&\times \Bigg ( \frac{2 \overline{\eta }(x^4y^2)^2}{\theta _3(xy^{\frac{1}{2}},y^{\frac{1}{2}})} \Bigg )^{K_{S}^2} \end{aligned} \end{aligned}$$

with

$$\begin{aligned} {\overline{\eta }}(x^4y^2)=\prod _{n=1}^\infty (1-x^{4n}y^{2n}), \quad \theta _3(xy^{\frac{1}{2}},y^{\frac{1}{2}})=\sum _{n\in {\mathbb {Z}}} y^{\left( {\begin{array}{c}n+1\\ 2\end{array}}\right) }x^{n^2}. \end{aligned}$$

This formula also makes it evident that \(\chi ^{\text {vir}}_{-y}(M^H_S(2,c_1,c_2))\) is a polynomial.

Next, denote by

$$\begin{aligned} {\textsf {Z}}_{S,c_1}(x) := {\textsf {Z}}_{S,c_1}(x,1) \end{aligned}$$

the generating function of virtual Euler characteristics. When \(-K_S H > 0\) or \(K_S=0\) (and some other cases including elliptic surfaces), these are just ordinary topological Euler characteristics because the obstructions vanish and the moduli space is smooth. Then \({\textsf {Z}}_{S,c_1}(x)\) was studied by many people, e.g. [Kly, Got1, Got2, GH, Yos1, Yos2, Yos3]. Conjecture 1.1 implies the following formula which follows by specializing to \(y=1\).

Corollary 1.3

(Proposition 5.5). Assume Conjecture 1.1. Let S be a smooth projective surface with \(b_1(S) = 0\) and \(p_g(S)>0\). Suppose the Seiberg–Witten basic classes of S are 0 and \(K_S \ne 0\). Let \(H, c_1\) be chosen such that there exist no rank 2 Gieseker strictly H-semistable sheaves with first Chern class \(c_1\). Then

$$\begin{aligned} {\textsf {Z}}_{S,c_1}(x) =2\sum _{k=0}^3 \frac{(i^k)^{c_1^2 - \chi ({\mathcal {O}}_S)}}{(2\overline{\eta }((-1)^kx^2)^{12})^{\chi ({\mathcal {O}}_S)}} \Bigg (\frac{2\overline{\eta }(x^4)^2}{\theta _3(i^kx)} \Bigg ) ^{K_S^2}, \end{aligned}$$

where \(i = \sqrt{-1}\).

From definition (4), we see that \(\theta _3(i^k x)=\theta _3(x^4)+i^k\theta _2(x^4)\). Therefore we can rewrite the formula for \({\textsf {Z}}_{S,c_1}(x)\) of Corollary 1.3 as

$$\begin{aligned} \begin{aligned}&\frac{2}{(2\overline{\eta }(x^2)^{12})^{\chi ({\mathcal {O}}_S)}} \Bigg \{ \Bigg ( \frac{\theta _3(x^4)+\theta _2(x^4)}{2\overline{\eta }(x^4)^2} \Bigg )^{-K_S^2} + (-1)^{c_1^2 - \chi ({\mathcal {O}}_S)} \Bigg ( \frac{\theta _3(x^4)-\theta _2(x^4)}{2\overline{\eta }(x^4)^2} \Bigg )^{-K_S^2} \Bigg \} \\&\quad {+}\frac{2 (-i)^{c_1^2 - \chi ({\mathcal {O}}_S)}}{(2\overline{\eta }(-x^2)^{12})^{\chi ({\mathcal {O}}_S)}} \Bigg \{ \Bigg ( \frac{\theta _3(x^4)-i\theta _2(x^4)}{2\overline{\eta }(x^4)^2} \Bigg )^{-K_S^2} + (-1)^{c_1^2 - \chi ({\mathcal {O}}_S)} \Bigg ( \frac{\theta _3(x^4)+i\theta _2(x^4)}{2\overline{\eta }(x^4)^2} \Bigg )^{-K_S^2} \Bigg \}. \end{aligned} \end{aligned}$$
(5)

In [VW], C. Vafa and E. Witten study certain invariants related to S-duality. On \({\mathbb {P}}^2\) their invariants are topological Euler characteristics e(M). For surfaces S with smooth connected canonical divisor, they give a formula [VW, Eqn. (5.38)]. Equation (5) coincides with part of their formula, namely all except the first two terms of [VW, Eqn. (5.38)] and up to an overall factor \(x^{-\chi ({\mathcal {O}}_S) + K_S^2/3}\) coming from our choice of normalization. Likewise, for \(y=1\), the more general Conjecture 5.7 specializes to (part of) a formula from the physics literature due to R. Dijkgraaf, J.-S. Park, and B. Schroers [DPS].

Remark 1.4

By work of S. Donaldson, D. Gieseker and J. Li, and others, if we fix S with \(b_1(S)=0\) and ample class H, then \(M:=M^H_S(2,c_1,c_2)\) is irreducible and generically smooth of the expected dimension for sufficiently large \({\text {vd}}=4c_2-c_1^2-3\chi ({\mathcal {O}}_S)\). It is then also normal and a local complete intersection. See [HL, Chap. 9] for references. In this case [FG, Thm. 4.15] implies that \(e^\text {vir}(M)\) is the degree of the Fulton Chern class \(c_{F}(M)\), which agrees with the Euler characteristic of any smoothening of M, and Corollary 1.3 predicts this number. More generally, all virtual Chern numbers of M coincide with the corresponding Chern numbers of any smoothening [FG, Rem. 4.16]. In particular the virtual \(\chi _y\)-genus of M equals the \(\chi _y\)-genus of any smoothening and Conjecture 1.1 predicts these genera.

Remark 1.5

Recently, Y. Tanaka and R. P. Thomas [TT1] defined a symmetric perfect obstruction theory on the moduli space of stable Higgs pairs \((E,\phi )\) on S, where E has fixed determinant and \(\phi \) is trace-free.Footnote 4 Stable Higgs pairs are related by a Hitchin-Kobayashi correspondence to solutions of the Vafa–Witten equations. There is a \({\mathbb {C}}^*\)-scaling action on the Higgs field and Tanaka-Thomas define “\(\text {SU}(r)\) Vafa–Witten invariants” by virtual localization with respect to this action. They show that the contribution to the invariant of the components corresponding to \(\phi = 0\) are precisely the virtual Euler characteristics \({\textsf {Z}}_{S,c_1}(x)\) that we study (though Tanaka-Thomas’s invariants are defined for any rank). Moreover in the rank 2 case and for S with smooth connected canonical divisor and \(b_1(S) = 0\), they conjecture that the contribution of the other components of the \({\mathbb {C}}^*\)-fixed locus corresponds to the first two terms of [VW, Eqn. (5.38)]. Recall that these are precisely the two terms that we do not see. They gather evidence for this by computing the contributions of other components for some low orders. Therefore by the calculations of this paper and their conjecture, their invariant indeed appears to be the correct mathematical definition of Vafa–Witten’s invariant [VW, Eqn. (5.38)]. Also recently, A. Gholampour, A. Sheshmani, and S.-T. Yau studied Donaldson-Thomas invariants of local surfaces [GSY]. Their invariants are closely related to Tanaka-Thomas’s invariants. The virtual Euler characteristics that we calculate are part of their invariants.

We approach Conjecture 1.1 as follows:

  • We use the virtual Hirzebruch-Riemann-Roch formulaFootnote 5 to express \(\overline{\chi }_{-y}(M)\) in terms of certain descendent Donaldson invariants (Proposition 2.1).

  • We apply Mochizuki’s formula to these descendent Donaldson invariants. This expresses \({\textsf {Z}}_{S,c_1}(x,y)\) in terms of Seiberg–Witten invariants of S and certain integrals over \(S^{[n_1]} \times S^{[n_2]}\). Although Mochizuki’s formula requires \(p_g(S)>0\), these integrals make sense for any surface S.

  • We show that the integrals over \(S^{[n_1]} \times S^{[n_2]}\) can be expressed in terms of seven universal series \(A_1, \ldots , A_7 \in 1+q \, {\mathbb {Q}}[y](\!(s)\!)[[q]]\) (Proposition 3.3).

  • The universal series \(A_i\) are entirely determined by their values on \(S = {\mathbb {P}}^2\) and \({\mathbb {P}}^1 \times {\mathbb {P}}^1\). We calculate \(A_i(s,y,q)\) to order \(q^{7}\) and \(A_i(s,1,q)\) to order \(q^{30}\) by Atiyah-Bott localization (Sect. 4 and Appendix 6.7).

  • We then verify Conjecture 1.1 up to a certain order in x for examples of the following types: blow-ups \({\text {Bl}}_p K3\), double covers of \({\mathbb {P}}^2\), double covers of \({\mathbb {P}}^1 \times {\mathbb {P}}^1\), double covers of Hirzebruch surfaces \({\mathbb {F}}_a\), surfaces in \({\mathbb {P}}^3\), \({\mathbb {P}}^2 \times {\mathbb {P}}^1\) and \({\mathbb {P}}^1 \times {\mathbb {P}}^1 \times {\mathbb {P}}^1\), and complete intersections in \({\mathbb {P}}^4\) and \({\mathbb {P}}^5\) (Sect. 6).

The reduction to toric surfaces also allows us to relate \({\textsf {Z}}_{S,c_1}(x,1)\) to the Nekrasov partition function with one adjoint matter and one fundamental matter (Appendix 6.7). As a consequence, \({\textsf {Z}}_{S,c_1}(x,1)\) can be expressed in terms of four universal series \(F_0, H, G_1, G_2\) (see Remark B.3 for details). This is not used elsewhere in the paper.

We present two generalizations of Conjecture 1.1:

  • Conjecture 5.1 is a statement purely about intersection numbers on Hilbert schemes of points. Together with a strong form of Mochizuki’s formula (Remark 3.5), it implies Conjecture 1.1 (see Proposition 5.3). In addition, the strong form of Mochizuki’s formula and Conjecture 5.1 imply a version of Conjecture 1.1 for arbitrary blow-ups of surfaces S with \(b_1(S)=0\), \(p_g(S)>0\), and Seiberg–Witten basic classes 0 and \(K_S \ne 0\) (Proposition 5.6). We test Conjecture 5.1 in many cases in Sect. 6.

  • Conjecture 5.7 generalizes Conjecture 1.1 to arbitrary surfaces with \(b_1(S) = 0\) and \(p_g(S)>0\). This conjecture has two further applications. (1) It implies a blow-up formula for virtual \(\chi _y\)-genera, which is reminiscent of the blow-up formula of W.-P. Li and Z. Qin (Proposition 5.9). (2) It implies a formula for surfaces with canonical divisor with irreducible reduced connected components (Proposition 5.11). For \(y=1\), the latter recovers another formula of Vafa–Witten [VW, (5.45)]. We check Conjecture 5.7 up to a certain order in x in the following cases (other than the cases above): K3 surfaces, blow-ups \({\text {Bl}}_p {\text {Bl}}_q K3\), and elliptic surfaces E(n) for various \(n \ge 4\).

In Appendix 6.7, the first named author and H. Nakajima conjecture a formula unifying the virtual Euler characteristic specialization of Conjecture 5.7 and Witten’s conjecture for Donaldson invariants.

In [GK] we extend these results and conjectures to virtual elliptic genera and virtual cobordism classes. Besides Mochizuki’s formula [Moc], this paper uses ideas from [GNY1, GNY2, GNY3]. The physics approach to the calculation of elliptic genera of instanton moduli spaces was discussed in N. Nekrasov’s PhD thesis [Nek1] and the papers [LNS, BLN]. We refer to [LLZ] for applications to Gopakumar–Vafa invariants.

2 Mochizuki’s Formula

Let S be a smooth projective surface with \(b_1(S) = 0\) and polarization H. Denote by \(M:=M_{S}^{H}(r,c_1,c_2)\) the moduli space of rank r Gieseker H-stable torsion free sheaves on S with Chern classes \(c_1,c_2\). We assume there are no rank r strictly Gieseker H-semistable sheaves with Chern classes \(c_1,c_2\). Then \(M_{S}^{H}(r,c_1,c_2)\) is projective. In this section, we first show that \({\chi }_{-y}^{\text {vir}}(M)\) can be expressed in terms of descendent Donaldson invariants. We then recall Mochizuki’s formula [Moc] and apply it to our setting.

We start with some notation. Assume there exists a universal sheaf \({\mathbb {E}}\) on \(M \times S\) (in fact, we get rid of this assumption at the end of this section in Remark 2.4). Let \(\sigma \in H^*(S,{\mathbb {Q}})\) and \(\alpha \ge 0\), then we define

$$\begin{aligned} \tau _{\alpha }(\sigma ) := \pi _{M*} \big ( {\text {ch}}_{2+\alpha }({\mathbb {E}}) \cap \pi _{S}^{*} \, \sigma \big ), \end{aligned}$$

where \(\pi _M : M \times S \rightarrow M\) and \(\pi _S : M \times S \rightarrow S\) denote projections. We refer to \(\tau _{\alpha }(\sigma )\) as a descendent insertion of descendence degree \(\alpha \). The insertions \(\tau _0(\sigma )\) are called primary insertions.

We introduce some further notation which will be useful later. Suppose X is any projective \({\mathbb {C}}\)-scheme and E a vector bundle on X, then we define

$$\begin{aligned} \Lambda _y E := \sum _{p=0}^{{\text {rk}}(E)} [\Lambda ^p E] \, y^p \in K^0(X)[[y]]. \end{aligned}$$

This element is invertible in \(K^0(X)[[y]]\), so we can define \(\Lambda _y (-E) = 1/ \Lambda _y E\). Hence we can also define \(\Lambda _y E\) for any element E of \(K^0(X)\). Next, for any element E of \(K^0(X)\) we define

$$\begin{aligned} {\textsf {T}}_{y}(E,t) := t^{-{\text {rk}}E} \sum _k \left\{ {\text {ch}}(\Lambda _{y} E^{\vee }) \, \text {td}(E) \right\} _k t^{k}, \end{aligned}$$
(6)

where \(\{\cdot \}_k \in A^k(X)_{{\mathbb {Q}}}\) selects the degree k part in the Chow ring. Since we have

$$\begin{aligned} \Lambda _y (E_1 + E_2) = \Lambda _y E_1 \otimes \Lambda _y E_2, \end{aligned}$$

the standard properties of Chern characters and Todd classes give

$$\begin{aligned} {\textsf {T}}_y(E_1 + E_2,t) = {\textsf {T}}_y(E_1,t) \, {\textsf {T}}_y(E_2,t). \end{aligned}$$
(7)

This multiplicative property will be crucial in Sect. 3. We also note that for a line bundle L on X with \(c_1(L) = x\), we have

$$\begin{aligned} {\textsf {T}}_y(L,t) = \frac{x (1+y e^{-x t})}{1-e^{-x t}}. \end{aligned}$$
(8)

The variable t can be exploited for a convenient normalization. Indeed if we take \(t = 1+y\), then equations (7) and (8) imply

$$\begin{aligned} {\textsf {T}}_{y}(E - {\mathcal {O}}_X^{\oplus r},1+y) = {\textsf {T}}_{y}(E,1+y) \end{aligned}$$
(9)

for all \(r \ge 0\). This will be used in Sect. 3 as well. Another convenient consequence of the specialization \(t = 1+y\) is that \({\textsf {T}}_{y}(E,1+y)\) is a polynomial in \(1+y\). Moreover, its leading coefficient is given by

$$\begin{aligned} {\textsf {T}}_{y}(E,1+y) \Big |_{y=-1} = c(E), \end{aligned}$$
(10)

where \(c(\cdot )\) denotes total Chern class. This essentially follows from [FG, Thm. 4.5(c)].

We come back to \(M:=M_{S}^{H}(r,c_1,c_2)\). The next proposition involves an argument that appears more generally in the context of stable pairs on 3-folds in [She] (see also [Pan]).

Proposition 2.1

For every \(S,H,r,c_1,c_2\) as above, there exists a polynomial expression \(P({\mathbb {E}})\) in certain descendent insertions \(\tau _{\alpha }(\sigma )\) and y such that

$$\begin{aligned} \chi ^{\text {vir}}_{-y}(M_{S}^{H}(r,c_1,c_2)) = \int _{[M_{S}^{H}(r,c_1,c_2)]^{\text {vir}}} P({\mathbb {E}}). \end{aligned}$$

Proof

By the virtual Hirzebruch-Riemann-Roch theorem [FG, Cor. 3.4]

$$\begin{aligned} \chi ^{\text {vir}}_{-y}(M)&= \int _{[M]^{\text {vir}}} {\textsf {T}}_{-y}(T^{\text {vir}},1-y), \end{aligned}$$

where \({\textsf {T}}_{-y}(T^{\text {vir}},1-y)\) can be expressed as a \({\mathbb {Q}}\)-linear combination of monomials in \(c_i(T^{\text {vir}})\) and y. By (1) and Grothendieck-Riemann-Roch, each \(c_{i}(T^{\text {vir}})\) can be expressed as a \({\mathbb {Q}}\)-linear combination of monomials in

$$\begin{aligned} \pi _{M*} \big ( {\text {ch}}_\alpha ({\mathbb {E}}) {\text {ch}}_\beta ({\mathbb {E}}) \cap \pi _{S}^{*} \sigma \big ), \end{aligned}$$
(11)

where \(\sigma \) is one of the components of \(\text {td}(S)\). Therefore it suffices to show that every expression of the form (11) is a polynomial in descendent insertions. This will then define the universal polynomial \(P({\mathbb {E}})\).

Let \(\pi _{ij}\) and \(\pi _i\) be the projections from \(M \times S \times S\) to factors (ij) and i respectively. Then (11) equals

$$\begin{aligned} \pi _{1*} \big ( \pi _{12}^{*} {\text {ch}}_\alpha ({\mathbb {E}}) \ \pi _{13}^{*} {\text {ch}}_\beta ({\mathbb {E}}) \cap \pi _2^* \sigma \ \pi _{23}^{*} \Delta \big ), \end{aligned}$$
(12)

where \(\Delta \in H^4(S \times S,{\mathbb {Q}})\) is (Poincaré dual to) the class of the diagonal. Next we consider the Künneth decomposition

$$\begin{aligned} \Delta = \sum _{i+j=4} \theta _{1}^{(i)} \otimes \theta _{2}^{(j)}, \end{aligned}$$

where \(\theta _{1}^{(i)} \in H^i(S,{\mathbb {Q}})\) and \(\theta _{2}^{(j)} \in H^j(S,{\mathbb {Q}})\). Substituting this decomposition into (12), factoring the push-forward as \(\pi _1 = \pi _M \circ \pi _{12}\), and applying the projection formula gives

$$\begin{aligned} \pi _{M*} \big ( {\text {ch}}_\alpha ({\mathbb {E}}) \, {\text {ch}}_\beta ({\mathbb {E}}) \cap \pi _{S}^{*} \sigma \big ) = \sum _{i+j=4} \tau _\alpha (\sigma \theta _{1}^{(i)}) \, \tau _\beta (\theta _{2}^{(j)}). \end{aligned}$$

\(\square \)

Remark 2.2

Note that in this section (and the next) we use \(\chi _{-y}^{\text {vir}}\) instead of \(\overline{\chi }_{-y}^{\text {vir}}\) as in Conjecture 1.1. The reason is that the intermediate formulae of this and the next section look slightly easier for \(\chi _{-y}^{\text {vir}}\) whereas the final formula of Conjecture 1.1 looks more elegant for \(\overline{\chi }_{-y}^{\text {vir}}\). One can easily pass from one to the other by Remark 1.2.

Next we recall Mochizuki’s formula [Moc, Thm. 1.4.6]. His formula is derived by a beautiful argument involving geometric wall-crossing for moduli spaces of so-called Bradlow pairs depending on a stability parameter \(\alpha \in {\mathbb {Q}}_{>0}\). For \(\alpha \rightarrow \infty \) the moduli spaces are empty and for \(\alpha \rightarrow 0^+\) the moduli space is a projective bundle over M. On a wall, Mochizuki uses a “master space” with \({\mathbb {C}}^*\)-action, whose fixed locus contains components corresponding to the moduli spaces of Bradlow pairs on either side of the wall. Other components of the fixed locus can be seen as contributions from wall-crossing. They lead to the Seiberg–Witten invariants and integrals over \(S^{[n_1]} \times S^{[n_2]}\) described below. These wall-crossing terms are computed by Graber-Pandharipande’s virtual localization on the master space. This geometric wall-crossing is very different from motivic wall-crossing, as in [Joy], which does not work for ample \(K_S\) due to non-vanishing \({\text {Ext}}^2\). Among other things, Mochizuki’s formula was used in [GNY3] to prove Witten’s conjecture for algebraic surfaces.

There are two ingredients for Mochizuki’s formula. The first ingredient is the Seiberg–Witten invariants \(\text {SW}(a)\) of S in class \(a \in H^2(S,{\mathbb {Z}})\) mentioned in the introduction.

The second ingredient is certain integrals over products of Hilbert schemes of points. On \(S^{[n_1]} \times S^{[n_2]} \times S\) we have the pull-backs of the universal ideal sheaves \({\mathcal {I}}_1\), \({\mathcal {I}}_2\) from \(S^{[n_1]} \times S\), \(S^{[n_2]} \times S\). For any line bundle \(L \in {\text {Pic}}(S)\) we denote by \(L^{[n_i]}\) the tautological vector bundle on \(S^{[n_i]}\) defined by

$$\begin{aligned} L^{[n_i]} := p_* q^* L, \end{aligned}$$

with \(p : {\mathcal {Z}}_i \rightarrow S^{[n_i]}\), \(q : {\mathcal {Z}}_i \rightarrow S\) projections from the universal subscheme \({\mathcal {Z}}_i \subset S^{[n_i]} \times S\).

We consider \(S^{[n_1]} \times S^{[n_2]}\) to be endowed with a trivial \({\mathbb {C}}^*\)-action and we denote the generator of the character group by \({\mathfrak {s}}\).Footnote 6 Moreover we write s for the generator of

$$\begin{aligned} H^*(B{\mathbb {C}}^*,{\mathbb {Q}}) = H^*_{{\mathbb {C}}^*}(pt,{\mathbb {Q}}) \cong {\mathbb {Q}}[s]. \end{aligned}$$

Let \(P({\mathbb {E}})\) be any polynomial in descendent insertions \(\tau _{\alpha }(\sigma )\), which arises from a polynomial in Chern numbers of \(T^\text {vir}\) (e.g. such as in Proposition 2.1). For any \(a_1, a_2 \in A^1(S)\) and \(n_1, n_2 \in {\mathbb {Z}}_{\ge 0}\), Mochizuki defines \(\Psi (a_1,a_2,n_1,n_2)\) as follows

$$\begin{aligned} \text {Coeff}_{s^0} \Bigg ( \frac{P({\mathcal {I}}_1(a_1) \otimes {\mathfrak {s}}^{-1} \oplus {\mathcal {I}}_2(a_2) \otimes {\mathfrak {s}})}{Q({\mathcal {I}}_1(a_1) \otimes {\mathfrak {s}}^{-1}, {\mathcal {I}}_2(a_2) \otimes {\mathfrak {s}})} \frac{\text {Eu}({\mathcal {O}}(a_1)^{[n_1]}) \, \text {Eu}({\mathcal {O}}(a_2)^{[n_2]} \otimes {\mathfrak {s}}^2)}{(2s)^{n_1+n_2 - \chi ({\mathcal {O}}_S)}} \Bigg ). \end{aligned}$$
(13)

Let us explain this notation. Here \(\text {Eu}(\cdot )\) denotes \({\mathbb {C}}^*\)-equivariant Euler class and \(\text {Coeff}_{s^0}\) refers to taking the coefficient of \(s^0\).Footnote 7 The notation \({\mathcal {I}}_i(a_i)\) is short-hand for \({\mathcal {I}}_i \otimes \pi _S^* {\mathcal {O}}(a_i)\). Furthermore, for any \({\mathbb {C}}^*\)-equivariant sheaves \({\mathbb {E}}_1\), \({\mathbb {E}}_2\) on \(S^{[n_1]} \times S^{[n_2]} \times S\) flat over \(S^{[n_1]} \times S^{[n_2]}\), Mochizuki defines

where \(\pi : S^{[n_1]} \times S^{[n_2]} \times S \rightarrow S^{[n_1]} \times S^{[n_2]}\) denotes projection and

Finally, \(P(\cdot )\) is the expression obtained from \(P({\mathbb {E}})\) by formally replacing \({\mathbb {E}}\) by \(\cdot \). We define

$$\begin{aligned} {\widetilde{\Psi }}(a_1,a_2,n_1,n_2,s) \end{aligned}$$

by expression (13) but without applying \(\text {Coeff}_{s^0}\).

Next, let \(c_1,c_2\) be a choice of Chern classes and let \({\text {ch}}= (2,c_1,\frac{1}{2}c_1^2 - c_2)\) denote the corresponding Chern character. For any decomposition \(c_1 = a_1 + a_2\), Mochizuki defines

$$\begin{aligned} {\mathcal {A}}(a_1,a_2,c_2) := \sum _{n_1 + n_2 = c_2 - a_1 a_2} \int _{S^{[n_1]} \times S^{[n_2]}} \Psi (a_1,a_2,n_1,n_2). \end{aligned}$$
(14)

We denote by \({\widetilde{{\mathcal {A}}}}(a_1,a_2,c_2,s)\) the same expression with \(\Psi \) replaced by \({\widetilde{\Psi }}\).

Theorem 2.3

(Mochizuki). Let S be a smooth projective surface with \(b_1(S) = 0\) and \(p_g(S) >0\). Let \(H, c_1,c_2\) be chosen such that there exist no rank 2 strictly Gieseker H-semistable sheaves with Chern classes \(c_1,c_2\). Suppose the universal sheaf \({\mathbb {E}}\) exists on \(M_{S}^{H}(2,c_1,c_2) \times S\). Suppose the following conditions hold:

  1. (i)

    \(\chi ({\text {ch}}) > 0\), where \(\chi ({\text {ch}}) := \int _S {\text {ch}}\cdot \text {td}(S)\) and \({\text {ch}}= (2,c_1,\frac{1}{2}c_1^2 - c_2)\).

  2. (ii)

    \(p_{{\text {ch}}} > p_{K_S}\), where \(p_{{\text {ch}}}\) and \(p_{K_S}\) are the reduced Hilbert polynomials associated to \({\text {ch}}\) and \(K_S\).

  3. (iii)

    For all Seiberg–Witten basic classes \(a_1\) satisfying \(a_1 H \le (c_1 -a_1) H\) the inequality is strict.

Let \(P({\mathbb {E}})\) be any polynomial in descendent insertions, which arises from a polynomial in Chern numbers of \(T^\text {vir}\) (e.g. such as in Proposition 2.1). Then

$$\begin{aligned} \int _{[M_{S}^{H}(2,c_1,c_2)]^{\text {vir}}} P({\mathbb {E}}) = -2^{1-\chi ({\text {ch}})} \sum _{{{\begin{array}{c} c_1 = a_1 + a_2 \\ a_1 H < a_2 H \end{array}}}} \text {SW}(a_1) \, {\mathcal {A}}(a_1,a_2,c_2). \end{aligned}$$

Remark 2.4

The assumption that \({\mathbb {E}}\) exists on \(M \times S\), where \(M:=M_{S}^{H}(2,c_1,c_2)\), is unnecessary. As remarked in the introduction, always exists globally so the left-hand side of Mochizuki’s formula always makes sense. Moreover, Mochizuki [Moc] works over the Deligne-Mumford stack of oriented sheaves, which always has a universal sheaf. This can be used to show that global existence of \({\mathbb {E}}\) on \(M \times S\) can be dropped from the assumptions. In fact, when working on the stack, P can be any polynomial in descendent insertions defined using the universal sheaf on the stack.

Remark 2.5

Mochizuki’s formula (Theorem 2.3) holds without the assumption that there are no strictly H-Gieseker semistable sheaves and without assumption (iii). Then \(\int _{[M]^{\text {vir}}} P({\mathbb {E}})\) is defined via the moduli space of oriented reduced Bradlow pairs [Moc, Def. 7.3.2]. In this more general setup the definition of \({\mathcal {A}}(a_1,a_2,c_2)\) has to be modified slightly: when \(a_1 H = a_2 H\) the sum in (14) is over all \(n_1>n_2\) satisfying \(n_1 + n_2 = c_2 - a_1 a_2\).

Remark 2.6

It is conjectured in [GNY3] that assumptions (ii) and (iii) can be dropped from Theorem 2.3 and the sum can be replaced by a sum over all Seiberg–Witten basic classes. We will see in our computations that assumption (i) is necessary.

Remark 2.7

Let the setup be as in Theorem 2.3. Then

$$\begin{aligned} {\chi }_{-y}^\text {vir}(M_{S}^{H}(2,c_1,c_2)) = -2^{1-\chi ({\text {ch}})} \sum _{{{\begin{array}{c} c_1 = a_1 + a_2 \\ a_1 H < a_2 H \end{array}}}} \text {SW}(a_1) \, {\mathcal {A}}(a_1,a_2,c_2), \end{aligned}$$

where \(P({\mathbb {E}})\) is determined by the following expression

with \({\mathbb {E}}= {\mathcal {I}}_1(a_1) \otimes {\mathfrak {s}}^{-1} \oplus {\mathcal {I}}_2(a_2) \otimes {\mathfrak {s}}\). We note that the rank of

is given by

$$\begin{aligned} \chi ({\mathcal {O}}_S)+(2n_1+2n_2 - 2\chi ({\mathcal {O}}_S))&+ (n_1+n_2-\chi ({\mathcal {O}}(a_2-a_1))) \\&+(n_1+n_2-\chi ({\mathcal {O}}(a_1-a_2))) \\ = 4c_2-c_1^2-3\chi ({\mathcal {O}}_S), \end{aligned}$$

which equals the rank of \(T^\text {vir}\) given by (2).

3 Universality

In this section S is any smooth projective surface. We start with a well-known lemma, which we include for completeness.

Lemma 3.1

Let \(\pi : S^{[n]}z \times S \rightarrow S^{[n]}\) denote the projection. Then

where \({\mathcal {I}}\) denotes the universal ideal sheaf and \(T_{S^{[n]}}\) denotes the tangent bundle.

Proof

Since , it suffices to show that for any ideal sheaf \(I = I_Z\subset {\mathcal {O}}_S\) we have \({\text {Hom}}(I,I)_0 = {\text {Ext}}^2(I,I)_0 = 0\). Clearly \({\text {Hom}}(I,I)_0 = 0\) because I is simple.

Next we consider the trace map

$$\begin{aligned} {\text {Ext}}^2(I,I) \rightarrow H^2({\mathcal {O}}_S). \end{aligned}$$

First applying \(- \otimes K_S\) and then \({\text {Hom}}(I,\cdot )\) to

$$\begin{aligned} 0 \rightarrow I \rightarrow {\mathcal {O}}_S \rightarrow {\mathcal {O}}_Z \rightarrow 0 \end{aligned}$$
(15)

gives a long exact sequence

$$\begin{aligned} 0 \rightarrow {\text {Hom}}(I,I \otimes K_S) \rightarrow {\text {Hom}}(I,K_S) \rightarrow {\text {Hom}}(I,K_S|_Z) \rightarrow \cdots . \end{aligned}$$

The natural map \(H^0(K_S) \rightarrow {\text {Hom}}(I,I \otimes K_S)\) is Serre dual to the trace map, so our goal is to show that this map is an isomorphism. It is enough to show that the restriction map \(H^0(K_S) \rightarrow {\text {Hom}}(I,K_S)\), which factors through \({\text {Hom}}(I,I \otimes K_S)\), is an isomorphism. This in turn can be seen by applying \({\text {Hom}}(\cdot , K_S)\) to (15) and using that Z is 0-dimensional. \(\quad \square \)

Our main object of study is the following generating function.

Definition 3.2

For any a in the Chow group \(A^1(S)\) we abbreviate \(\chi (a) := \chi ({\mathcal {O}}(a))\). For any \(a_1, c_1 \in A^1(S)\), we define

$$\begin{aligned}&{\textsf {Z}}_{S}(a_1,c_1,s,y, q) := \\&\sum _{n_1, n_2 \ge 0} q^{n_1+n_2} \int _{S^{[n_1]} \times S^{[n_2]}} \frac{{\textsf {T}}^{{\mathbb {C}}^*}_{-y}(E_{n_1,n_2},1-y) \, \text {Eu}({\mathcal {O}}(a_1)^{[n_1]}) \, \text {Eu}({\mathcal {O}}(c_1-a_1)^{[n_2]} \otimes {\mathfrak {s}}^2)}{\text {Eu}(E_{n_1,n_2} - \pi _1^* \, T_{S^{[n_1]}} - \pi _2^* \, T_{S^{[n_2]}})}, \end{aligned}$$

where \({\textsf {T}}^{{\mathbb {C}}^*}\) denotes the \({\mathbb {C}}^*\)-equivariant analog of (6) and

with

$$\begin{aligned} {\mathbb {E}}= {\mathcal {I}}_1(a_1) \otimes {\mathfrak {s}}^{-1} \oplus {\mathcal {I}}_2(a_2) \otimes {\mathfrak {s}}. \end{aligned}$$

This can be rewritten as

Here we used Serre duality and \((\cdot )^*\) denotes the dual vector bundle. The complex \(E_{n_1,n_2}\) has rank \(4n_1+4n_2\). Note that

$$\begin{aligned} {\textsf {Z}}_{S}(a_1,c_1,s,y,q) \in 1 + q \, {\mathbb {Q}}[y](\!(s)\!)[[q]]. \end{aligned}$$
(16)

For \(y=1\), using the \({\mathbb {C}}^*\)-equivariant version of (10), we obtain

$$\begin{aligned} {\textsf {T}}^{{\mathbb {C}}^*}_{-y}(E_{n_1,n_2},1-y) \Big |_{y=1} = c^{{\mathbb {C}}^*}(E_{n_1,n_2}). \end{aligned}$$

Suppose we have a decomposition \(c_1 = a_1 + a_2\). Then Remark 2.7 implies

$$\begin{aligned} \begin{aligned}&\sum _{c_2 \in {\mathbb {Z}}} {\widetilde{{\mathcal {A}}}}(a_1,c_1-a_1,c_2,s) \, q^{c_2} = {\textsf {Z}}_{S}\Big (a_1,c_1,s,y,\frac{q}{2s}\Big ) \\&\times (2s)^{\chi ({\mathcal {O}}_S)} \Bigg ( \frac{1-e^{-2s(1-y)}}{1-y e^{-2s(1-y)}} \Bigg )^{\chi (c_1-2a_1)} \Bigg ( \frac{1-e^{2s(1-y)}}{1-y e^{2s(1-y)}} \Bigg )^{\chi (2a_1-c_1)} q^{a_1 (c_1-a_1)}. \end{aligned} \end{aligned}$$
(17)

Besides Remark 2.7, this equality uses the following facts: Lemma 3.1, Eqs. (7), (8), and (9). For \(y=1\) the second line of (17) simplifies to

$$\begin{aligned} (2s)^{\chi ({\mathcal {O}}_S)} \Bigg ( \frac{2s}{1+2s} \Bigg )^{\chi (c_1-2a_1)} \Bigg ( \frac{-2s}{1-2s} \Bigg )^{\chi (2a_1-c_1)} q^{a_1 (c_1-a_1)}. \end{aligned}$$

Proposition 3.3

There exist universal functions

$$\begin{aligned} A_1(s,y,q), \ldots , A_7(s,y,q) \in 1 + q \, {\mathbb {Q}}[y](\!(s)\!)[[q]] \end{aligned}$$

such that for any smooth projective surface S and \(a_1, c_1 \in A^1(S)\) we have

$$\begin{aligned} {\textsf {Z}}_{S}(a_1,c_1,s,y,q) = A_1^{a_1^2} \, A_2^{a_1 c_1} \, A_3^{c_1^2} \, A_4^{a_1 K_S} \, A_5^{c_1 K_S} \, A_6^{K_S^2} \, A_7^{\chi ({\mathcal {O}}_S)}. \end{aligned}$$

Proof

Step 1: Universality. For any smooth projective surface S, define \(S_2 := S \sqcup S\) and denote by \(S_2^{[n]}\) the Hilbert scheme of n points on \(S_2\). Then

$$\begin{aligned} S_2^{[n]} = \bigsqcup _{n_1+n_2=n} S^{[n_1]} \times S^{[n_2]}. \end{aligned}$$

We endow \(S_2^{[n]}\) with trivial \({\mathbb {C}}^*\)-action. Denote by \({\mathcal {I}}_1\), \({\mathcal {I}}_2\) the sheaves on \(S_2^{[n]} \times S\) whose restrictions to \(S^{[n_1]} \times S^{[n_2]} \times S\) are the sheaves \({\mathcal {I}}_1\), \({\mathcal {I}}_2\) pulled back along projection to \(S^{[n_1]} \times S\), \(S^{[n_2]} \times S\) respectively. Let \(L(a_1,c_1)\) be the vector bundle whose restriction to \(S^{[n_1]} \times S^{[n_2]}\) is

$$\begin{aligned} \pi _1^* {\mathcal {O}}(a_1)^{[n_1]} \oplus \pi _2^*{\mathcal {O}}(c_1-a_1)^{[n_2]} \otimes {\mathfrak {s}}^2. \end{aligned}$$

Let X be any rational function in the following list of Chern classes and with coefficients in \({\mathbb {Q}}[y](\!(s)\!)\)

Of course we assume that only \({\mathbb {C}}^*\)-moving terms appear in the denominator of X. Then there exists a polynomial Y in \(a_1^2\), \(a_1 c_1\), \(c_1^2\), \(a_1 K_S\), \(c_1 K_S\), \(K_S^2\), \(\chi ({\mathcal {O}}_S)\) with coefficients in \({\mathbb {Q}}[y](\!(s)\!)\) such that

$$\begin{aligned} \int _{S_2^{[n]}} X = Y, \end{aligned}$$

for any smooth projective surface S and \(a_1, c_1 \in A^1(S)\). This is essentially [GNY1, Lem. 5.5], which in turn is an adaptation of [EGL].

We conclude that for each \(n \ge 0\), \(i \in {\mathbb {Z}}\), there exists a universal polynomial \(Y_{n,i}\) in \(a_1^2\), \(a_1 c_1\), \(c_1^2\), \(a_1 K_S\), \(c_1 K_S\), \(K_S^2\), \(\chi ({\mathcal {O}}_S)\) such that

$$\begin{aligned} {\textsf {Z}}_{S}(a_1,c_1,s,y,q) = \sum _{n \ge 0} \sum _{i} Y_{n,i} s^i q^n, \end{aligned}$$

for all \(S,a_1,c_1\). The coefficient of \(q^0\) is 1 by (16). Hence there exists a universal power series

$$\begin{aligned} G \in {\mathbb {Q}}[y,x_1, \ldots , x_7](\!(s)\!)[[q]], \end{aligned}$$

such that

$$\begin{aligned} {\textsf {Z}}_{S}(a_1,c_1,s,y,q) = \exp \, G(a_1^2, a_1 c_1, c_1^2, a_1 K_S, c_1 K_S, K_S^2, \chi ({\mathcal {O}}_S)) \end{aligned}$$
(18)

for all \(S,a_1,c_1\).

Step 2: Multiplicativity. Let \(S = S' \sqcup S''\), where \(S', S''\) are not necessarily connected smooth projective surfaces. Let \(a_1, c_1 \in A^1(S)\) be such that

$$\begin{aligned} a_1|_{S'} = a_1', \ c_1|_{S'} = c_1', \ a_1|_{S''} = a_1'', \ c_1|_{S''} = c_1''. \end{aligned}$$

Then

$$\begin{aligned} S_{2}^{[n]} = \bigsqcup _{n_1+n_2 = n} S_{2}^{\prime [n_1]} \times S_{2}^{\prime \prime [n_2]}. \end{aligned}$$

Let \({\mathcal {I}}_1 = {\mathcal {I}}_1' \boxplus {\mathcal {I}}_1''\), \({\mathcal {I}}_2 = {\mathcal {I}}_2' \boxplus {\mathcal {I}}_2''\). As observed in [GNY1], we then have

and similarly with \({\mathcal {I}}_1\) and \({\mathcal {I}}_2\) interchanged. Furthermore

$$\begin{aligned} T_{S_{2}^{[n]}}&= \bigoplus _{n_1+n_2=n} T_{S_{2}^{\prime [n_1]}} \boxplus T_{S_{2}^{\prime \prime [n_2]}} \\ L(a_1,c_1)&= L(a_1',c_1') \boxplus L(a_1'',c_1''), \\ \chi (c_1-2a_1)&= \chi (c_1'-2a_1') + \chi (c_1''-2a_1''), \\ \chi (2a_1-c_1)&= \chi (2a_1'-c_1') + \chi (2a_1''-c_1''), \\ a_1 (c_1-a_1)&= a_1' (c_1' - a_1') + a_1'' (c_1''-a_1''). \end{aligned}$$

Note that \(\text {Eu}(\cdot )\) and \({\textsf {T}}^{{\mathbb {C}}^*}_{-y}(\cdot ,1-y)\) are both group homomorphisms from \((K^{0}_{{\mathbb {C}}^*}(M),+)\) to \((A^*(M)_{{\mathbb {Q}}}[y](\!(s)\!),\cdot )\), where “\({\mathbb {C}}^*\)” stands for “\({\mathbb {C}}^*\)-equivariant”. For \({\textsf {T}}^{{\mathbb {C}}^*}_{-y}(\cdot ,1-y)\) this follows from the crucial multiplicative property (7). Therefore

$$\begin{aligned} {\textsf {Z}}_{S}(a_1,c_1,s,y,q) = {\textsf {Z}}_{S'}(a_1',c_1',s,q) \, {\textsf {Z}}_{S''}(a_1'',c_1'',s,y,q). \end{aligned}$$
(19)

We combine (18) and (19) in order to construct the universal functions \(A_i(s,y,q)\). This follows from a cobordism argument similar to [GNY1, Lem. 5.5]. Take seven triples \((S^{(i)},a_1^{(i)},c_1^{(i)})\) such that the vectors

$$\begin{aligned} w_i := ((a_1^{(i)})^2,a_1^{(i)}c_1^{(i)},(c_1^{(i)})^2,a_1^{(i)}K_{S^{(i)}},c_1^{(i)}K_{S^{(i)}},K_{S^{(i)}}^2,\chi ({\mathcal {O}}_{S^{(i)}})) \in {\mathbb {Q}}^{7} \end{aligned}$$

form a \({\mathbb {Q}}\)-basis. Now consider an arbitrary triple \((S,a_1,c_1)\). Then we can decompose \(w = (a_1^2, \ldots , \chi ({\mathcal {O}}_S))\) as \(w = \sum _i n_i w_i\) for some \(n_i \in {\mathbb {Q}}\). If all \(n_i \in {\mathbb {Z}}_{\ge 0}\), then (19) implies

$$\begin{aligned} {\textsf {Z}}_S(a_1,c_1,s,y,q) = \prod _{i=1}^{7} \big (\exp G(w_i) \big )^{n_i} = \exp \Big ( \sum _{i=1}^{7} n_i G(w_i) \Big ). \end{aligned}$$
(20)

Denote by W the matrix with column vectors \(w_1, \ldots , w_{7}\) and let \(M = (m_{ij})\) be its inverse. We define

$$\begin{aligned} A_j(s,y,q) := \exp \Big (\sum _i m_{ij} G(w_i)\Big ), \quad \forall j=1, \ldots , 7. \end{aligned}$$

From (20) we obtain

$$\begin{aligned} \begin{aligned} {\textsf {Z}}_S(a_1,c_1,s,y,q) = A_1^{a_1^2} \, A_2^{a_1 c_1} \, A_3^{c_1^2} \, A_4^{a_1 K_S} \, A_5^{c_1 K_S} \, A_6^{K_S^2} \, A_7^{\chi ({\mathcal {O}}_S)}. \end{aligned} \end{aligned}$$
(21)

Since the points \(w = \sum _i n_i w_i\), with \(n_i \in {\mathbb {Z}}_{\ge 0}\), lie Zariski dense in \({\mathbb {Q}}^{7}\), we conclude that (21) holds for all triples \((S,a_1,c_1)\). \(\quad \square \)

For a 7-tuple \({\underline{\alpha }}=(\alpha _1,\alpha _2,\alpha _3,\alpha _4,\alpha _5,\alpha _6,\alpha _7) \in {\mathbb {Z}}^7\) we denote

$$\begin{aligned} \begin{aligned} {\textsf {A}}_{{\underline{\alpha }} }(s,y,q) :=&- 2 \Bigg ( 2^{-1} \Bigg ( \frac{1-e^{-2s(1-y)}}{1-y e^{-2s(1-y)}} \Bigg )^{2} \Bigg ( \frac{1-e^{2s(1-y)}}{1-y e^{2s(1-y)}} \Bigg )^{2} q^{-1} A_1(s,y,q/s) \Bigg )^{\alpha _1} \\&\times \Bigg ( 2 \Bigg ( \frac{1-e^{-2s(1-y)}}{1-y e^{-2s(1-y)}} \Bigg )^{-2} \Bigg ( \frac{1-e^{2s(1-y)}}{1-y e^{2s(1-y)}} \Bigg )^{-2} q A_2(s,y,q/s) \Bigg )^{\alpha _2} \\&\times \Bigg ( 2^{-\frac{1}{2}} \Bigg ( \frac{1-e^{-2s(1-y)}}{1-y e^{-2s(1-y)}} \Bigg )^{\frac{1}{2}} \Bigg ( \frac{1-e^{2s(1-y)}}{1-y e^{2s(1-y)}}\Bigg )^{\frac{1}{2}} A_3(s,y,q/s) \Bigg )^{\alpha _3} \\&\times \Bigg ( \Bigg ( \frac{1-e^{-2s(1-y)}}{1-y e^{-2s(1-y)}} \Bigg ) \Bigg ( \frac{1-e^{2s(1-y)}}{1-y e^{2s(1-y)}} \Bigg )^{-1} A_4(s,y,q/s) \Bigg )^{\alpha _4} \\&\times \Bigg ( 2^{\frac{1}{2}} \Bigg ( \frac{1-e^{-2s(1-y)}}{1-y e^{-2s(1-y)}} \Bigg )^{-\frac{1}{2}} \Bigg ( \frac{1-e^{2s(1-y)}}{1-y e^{2s(1-y)}} \Bigg )^{\frac{1}{2}} A_5(s,y,q/s) \Bigg )^{\alpha _5} \\&\times A_6(s,y,q/s) ^{\alpha _6} \\&\times \Bigg ( \frac{s}{2} \Bigg ( \frac{1-e^{-2s(1-y)}}{1-y e^{-2s(1-y)}} \Bigg ) \Bigg ( \frac{1-e^{2s(1-y)}}{1-y e^{2s(1-y)}} \Bigg ) A_7(s,y,q/s) \Bigg )^{\alpha _7}. \end{aligned} \end{aligned}$$
(22)

Corollary 3.4

Suppose S satisfies \(b_1(S)=0\) and \(p_g(S) > 0\). Let \(H, c_1, c_2\) be chosen such that there exist no rank 2 strictly Gieseker H-semistable sheaves with Chern classes \(c_1, c_2\). Assume furthermore that:

  1. (i)

    \(c_2 < \frac{1}{2} c_{1}(c_1-K_S) + 2\chi ({\mathcal {O}}_S)\).

  2. (ii)

    \(p_{{\text {ch}}} > p_{K_S}\), where \(p_{{\text {ch}}}\) and \(p_{K_S}\) are the reduced Hilbert polynomials associated to \({\text {ch}}= (2,c_1,\frac{1}{2}c_1^2-c_2)\) and \(K_S\).

  3. (iii)

    For all SW basic classes \(a_1\) satisfying \(a_1 H \le (c_1 -a_1) H\) the inequality is strict.

Then

$$\begin{aligned}&\chi _{-y}^{\text {vir}}(M_{S}^{H}(2,c_1,c_2))\\&\quad =\text {Coeff}_{s^0 q^{c_2}} \Big [ \sum _{{{\begin{array}{c} a_1 \in H^2(S,{\mathbb {Z}}) \\ a_1 H < (c_1-a_1) H \end{array}}}} \text {SW}(a_1) \, {\textsf {A}}_{(a_1^2,a_1 c_1,c_1^2,a_1 K_S, c_1 K_S, K_S^2, \chi ({\mathcal {O}}_S))}(s,y,q)\Big ]. \end{aligned}$$

Proof

This follows from Remark 2.7, Eq. (17), and Proposition 3.3. \(\quad \square \)

Remark 3.5

By Remark 2.6, we conjecture that this corollary holds without assuming (ii) and (iii) and that the sum can be replaced by a sum over all Seiberg–Witten basic classes. We refer to this as “the strong form of Mochizuki’s formula”.

4 Toric Calculation

The universal functions \(A_1, \ldots , A_7\) are entirely determined by the generating function \({\textsf {Z}}_S(a_1,c_1,s,y,q)\) in the following cases

$$\begin{aligned} (S,a_1,c_1) = \,&({\mathbb {P}}^2,{\mathcal {O}},{\mathcal {O}}), ({\mathbb {P}}^2,{\mathcal {O}}(1),{\mathcal {O}}(1)), ({\mathbb {P}}^2,{\mathcal {O}},{\mathcal {O}}(1)), ({\mathbb {P}}^2,{\mathcal {O}}(1),{\mathcal {O}}(2)), \\&({\mathbb {P}}^1 \times {\mathbb {P}}^1, {\mathcal {O}},{\mathcal {O}}), ({\mathbb {P}}^1 \times {\mathbb {P}}^1, {\mathcal {O}}(1,0),{\mathcal {O}}(1,0)), ({\mathbb {P}}^1 \times {\mathbb {P}}^1, {\mathcal {O}},{\mathcal {O}}(1,0)). \end{aligned}$$

For these choices the \(7 \times 7\) matrix with rows

$$\begin{aligned} (a_1^2,a_1 c_1,c_1^2,a_1 K_S, c_1 K_S, K_S^2, \chi ({\mathcal {O}}_S)) \end{aligned}$$

has full rank. In each of these cases S is a toric surface.

Suppose S is a toric surface with torus \(T = {\mathbb {C}}^{*2}\). Let \(\{U_\sigma \}_{\sigma = 1, \ldots , e(S)}\) be the cover of maximal T-invariant affine open subsets of S. On \(U_\sigma \) we use coordinates \(x_\sigma , y_\sigma \) such that T acts with characters of weight \(v_\sigma , w_\sigma \in {\mathbb {Z}}^2\)

$$\begin{aligned} t \cdot (x_\sigma ,y_\sigma ) = (\chi (v_\sigma )(t) \, x_\sigma , \chi (w_\sigma )(t) \, y_\sigma ). \end{aligned}$$

Here \(\chi (m) : T \rightarrow {\mathbb {C}}^*\) denotes the character of weight \(m \in {\mathbb {Z}}^2\). Consider the integral over \(S^{[n_1]} \times S^{[n_2]}\) of Definition 3.2

$$\begin{aligned} \int _{S^{[n_1]} \times S^{[n_2]}} \frac{{\textsf {T}}_{-y}^{{\mathbb {C}}^*}(E_{n_1,n_2},1-y) \, \text {Eu}({\mathcal {O}}(a_1)^{[n_1]}) \, \text {Eu}({\mathcal {O}}(c_1-a_1)^{[n_2]} \otimes {\mathfrak {s}}^2)}{\text {Eu}\big (E_{n_1,n_2}-\pi _1^*T_{S^{[n_1]}}-\pi _2^*T_{S^{[n_2]}}\big )}. \end{aligned}$$
(23)

Define \({\widetilde{T}} := T \times {\mathbb {C}}^*\), where \({\mathbb {C}}^*\) denotes the trivial torus factor of Sect. 2. The T-fixed locus of \(S^{[n_1]} \times S^{[n_2]}\) is indexed by pairs \((\varvec{\lambda },\varvec{\mu })\) with

$$\begin{aligned} \varvec{\lambda }= \{\lambda ^{(\sigma )}\}_{\sigma =1, \ldots , e(S)}, \ \varvec{\mu }= \{\mu ^{(\sigma )}\}_{\sigma =1, \ldots , e(S)}, \end{aligned}$$

where \(\lambda ^{(\sigma )}\), \(\mu ^{(\sigma )}\) are partitions such that

$$\begin{aligned} \sum _\sigma |\lambda ^{(\sigma )}| = n_1, \ \sum _\sigma |\mu ^{(\sigma )}| = n_2. \end{aligned}$$
(24)

Here \(|\lambda |\) denotes the size of \(\lambda \). A partition \(\lambda = (\lambda _1 \ge \cdots \ge \lambda _\ell )\) corresponds to a monomial ideal of \({\mathbb {C}}[x,y]\)

$$\begin{aligned} I_{Z_\lambda } := (y^{\lambda _1}, xy^{\lambda _2}, \ldots , x^{\ell -1}y^{\lambda _\ell }, x^{\ell }), \end{aligned}$$

where \(\ell (\lambda ) = \ell \) denotes the length of \(\lambda \). For \(\lambda ^{(\sigma )}\) we denote the subscheme defined by the corresponding monomial ideal in variables \(x_\sigma ,y_\sigma \) by \(Z_{\lambda ^{(\sigma )}}\) and similarly for \(\mu ^{(\sigma )}\).

In order to apply localization, we make a choice of T-equivariant structure on the line bundles \({\mathcal {O}}(a_1)\), \({\mathcal {O}}(c_1-a_1)\). For any T-equivariant divisor a, the restriction \({\mathcal {O}}(a)|_{U_{\sigma }}\) is trivial with T-equivariant structure determined by some character of weight \(a_\sigma \in {\mathbb {Z}}^2\). By Atiyah-Bott localization, the integral (23) equals

$$\begin{aligned} \sum _{(\varvec{\lambda },\varvec{\mu })}&\prod _\sigma \frac{\text {Eu}(H^0({\mathcal {O}}(a_1)|_{Z_{\lambda ^{(\sigma )}}}))}{\text {Eu}(T_{Z_{\lambda ^{(\sigma )}}})} \frac{\text {Eu}(H^0({\mathcal {O}}(c_1-a_1)|_{Z_{\mu ^{(\sigma )}}}) \otimes {\mathfrak {s}}^2)}{\text {Eu}(T_{Z_{\mu ^{(\sigma )}}})} \\&\times \frac{{\textsf {T}}^{\widetilde{T}}_{-y}(E_{n_1,n_2}|_{(Z_{\lambda ^{(\sigma )}}, Z_{\mu ^{(\sigma )}})},1-y)}{\text {Eu}(E_{n_1,n_2}|_{(Z_{\lambda ^{(\sigma )}}, Z_{\mu ^{(\sigma )}})} - T_{Z_{\lambda ^{(\sigma )}}} - T_{Z_{\mu ^{(\sigma )}}})}. \end{aligned}$$

Here \(\text {Eu}(\cdot )\) denotes \({\widetilde{T}}\)-equivariant Euler class, \({\textsf {T}}^{{\widetilde{T}}}\) is the \({\widetilde{T}}\)-equivariant version of (6), and the sum is over all \((\varvec{\lambda },\varvec{\mu })\) satisfying (24). Moreover, \(T_{Z_{\lambda }}\) denotes the T-equivariant Zariski tangent space of \(({\mathbb {C}}^{2})^{[n]}\) at \(Z_\lambda \) where \(n = |\lambda |\). The calculation of the above product reduces to the computation of the following elements of the T-equivariant K-group \(K_0^{T}(pt)\)

$$\begin{aligned}&H^0({\mathcal {O}}(a)|_{Z_{\lambda ^{(\sigma )}}}), \\&R{\text {Hom}}_S({\mathcal {O}}_{Z_{\lambda ^{(\sigma )}}}, {\mathcal {O}}_{Z_{\lambda ^{(\sigma )}}}), \\&R{\text {Hom}}_S({\mathcal {O}}_{Z_{\lambda ^{(\sigma )}}}, {\mathcal {O}}_{Z_{\mu ^{(\sigma )}}}(a)), \end{aligned}$$

for various T-equivariant divisors a. By definition we have

$$\begin{aligned} Z_{\lambda ^{(\sigma )}} = \sum _{i=0}^{\ell (\lambda ^{(\sigma )}) - 1} \sum _{j=0}^{\lambda ^{(\sigma )}_{i+1}-1} \chi (v_\sigma )^i \, \chi (w_\sigma )^j \end{aligned}$$

and similarly for \(Z_{\mu ^{(\sigma )}}\). Multiplying by \(\chi (a_\sigma )\) gives \(H^0({\mathcal {O}}(a)|_{Z_{\lambda ^{(\sigma )}}})\). Define

$$\begin{aligned} \overline{\chi (m)} := \chi (-m) = \frac{1}{\chi (m)}, \end{aligned}$$

for any \(m \in {\mathbb {Z}}^2\). This defines an involution on \(K_0^{T}(pt)\) by \({\mathbb {Z}}\)-linear extension.

Proposition 4.1

Let \(W,Z \subset S\) be 0-dimensional T-invariant subschemes supported on a chart \(U_\sigma \subset S\) and let a be a T-equivariant divisor on S corresponding to a character of weight \(a_\sigma \in {\mathbb {Z}}^2\) on \(U_\sigma \). Then we have the following equality in \(K_0^{T}(pt)\)

$$\begin{aligned} R{\text {Hom}}_S({\mathcal {O}}_W, {\mathcal {O}}_Z(a)) = \chi (a_\sigma ) \, {\overline{W}} Z \frac{(1-\chi (v_\sigma )) (1- \chi (w_\sigma ))}{\chi (v_\sigma ) \chi (w_\sigma )}. \end{aligned}$$

Proof

The proof we present is similar to the calculation in [MNOP, Sect. 4.7]. Let \(v := v_\sigma \), \(w := w_\sigma \), and \(a := a_\sigma \). Write \(U_\sigma = {\text {Spec}}\,R\) with \(R = {\mathbb {C}}[x_\sigma ,y_\sigma ]\). Then

$$\begin{aligned} R{\text {Hom}}_S({\mathcal {O}}_W, {\mathcal {O}}_Z(a)) = R {\text {Hom}}_{U_\sigma }({\mathcal {O}}_{W}, {\mathcal {O}}_{Z}(a)), \end{aligned}$$

because WZ are supported on \(U_\sigma \). We claim

$$\begin{aligned}&\Gamma (U_\sigma , {\mathcal {O}}(a)) - R {\text {Hom}}_{U_\sigma }(I_W,I_Z(a)) \\&\quad = \chi (a) \Big ( Z + \frac{{\overline{W}}}{\chi (v) \chi (w)} - {\overline{W}} Z \frac{(1-\chi (v)) (1- \chi (w))}{\chi (v) \chi (w)} \Big ). \end{aligned}$$

The result of the proposition follows from this using \(I_Z = {\mathcal {O}}_{U_\sigma } - {\mathcal {O}}_{Z}\), \(I_W = {\mathcal {O}}_{U_\sigma } - {\mathcal {O}}_{W}\), because the first term on the right-hand side is \(H^0(U_\sigma ,{\mathcal {O}}_Z(a))\) and the second term is

$$\begin{aligned} R{\text {Hom}}_{U_\sigma }({\mathcal {O}}_W,{\mathcal {O}}_{U_\sigma }(a))&= H^0(U_\sigma ,{\mathcal {O}}_W(-a) \otimes K_{U_\sigma })^* \\&= H^0(U_\sigma ,{\mathcal {O}}_W)^* \otimes \frac{\chi (a)}{\chi (v)\chi (w)} \end{aligned}$$

by T-equivariant Serre duality. In order to prove the claim, choose T-equivariant graded free resolutions

$$\begin{aligned} 0 \rightarrow E_r&\rightarrow \cdots \rightarrow E_0 \rightarrow I_{W} \rightarrow 0, \\ 0 \rightarrow F_s&\rightarrow \cdots \rightarrow F_0 \rightarrow I_{Z} \rightarrow 0, \end{aligned}$$

where

$$\begin{aligned} E_i = \bigoplus _j R(d_{ij}), \ F_i = \bigoplus _j R(e_{ij}). \end{aligned}$$

Then we have Poincaré polynomials

$$\begin{aligned} P_W = \sum _{i,j} (-1)^i \chi (d_{ij}), \ P_Z = \sum _{i,j} (-1)^i \chi (e_{ij}), \end{aligned}$$

which are independent of the choice of resolution. Moreover

$$\begin{aligned} \begin{aligned} W&= {\mathcal {O}}_{U_\sigma } - I_W \\&= \frac{1-P_W}{(1-\chi (v)) (1-\chi (w))}, \\ Z&= {\mathcal {O}}_{U_\sigma } - I_Z \\&= \frac{1-P_Z}{(1-\chi (v)) (1-\chi (w))}. \end{aligned} \end{aligned}$$
(25)

Furthermore

$$\begin{aligned} R {\text {Hom}}_{U_\sigma }(I_W,I_Z(a))&= \sum _{i,j,k,l} (-1)^{i+k} {\text {Hom}}(R(d_{ij}),R(a+e_{kl})) \\&= \sum _{i,j,k,l} (-1)^{i+k} R(a+e_{kl} - d_{ij}) \\&= \frac{\chi (a) {\overline{P}}_W P_Z}{(1-\chi (v))(1-\chi (w))}. \end{aligned}$$

Eliminating \(P_W\), \(P_Z\) using (25) gives the desired result. \(\quad \square \)

We implemented the calculation of (23) into a PARI/GP program (and some parts into Maple as well). This allows us to compute \(A_1(s,y,q), \ldots , A_7(s,y,q)\) up to order \(q^7\), where we calculated the coefficient of \(q^i\) up to order \(s^{29-3i}\) . We also calculated \(A_1(s,1,q), \ldots , A_7(s,1,q)\) up to order \(q^{30}\) and any order in s. The latter are listed up to order \(q^4\) in Appendix 6.7.

5 Two More Conjectures and Consequences

In Sect. 4, we have given a toric procedure to calculate the universal functions \(A_1(s,y,q), \ldots , A_7(s,y,q)\) and therefore also \({\textsf {A}}_{\underline{\alpha }}(s,y,q)\) defined by (22). Consequently, we could now go ahead and provide checks of Conjecture 1.1.

Instead we first present two generalizations of Conjecture 1.1. The first conjecture is a statement about intersection numbers on Hilbert schemes of points. It implies a formula for arbitrary blow-ups of surfaces S with \(b_1(S)=0\), \(p_g(S)>0\), and Seiberg–Witten basic classes 0 and \(K_S \ne 0\). The second conjecture generalizes Conjecture 1.1 to arbitrary surfaces S with \(b_1(S)=0\) and \(p_g(S)>0\). It implies a blow-up formula, which is reminiscent of the blow-up formula of W.-P. Li and Z. Qin [LQ1, LQ2]. It also implies a formula for surfaces with canonical divisor with irreducible reduced connected components. The latter refines a result from the physics literature due to Vafa–Witten [VW, Eqn. (5.45)].

5.1 Numerical conjecture

Suppose S is a surface with \(b_1(S) = 0\), \(p_g(S)>0\), and Seiberg–Witten basic classes are 0 and \(K_S \ne 0\). Then Conjecture 1.1 applies to any choice of

$$\begin{aligned} \underline{\beta } = (\beta _1, \beta _2,\beta _3,\beta _4) = (c_1^2, c_1 K_S, K_S^2, \chi ({\mathcal {O}}_S)), \end{aligned}$$

provided we choose a polarization H for which there are no rank 2 strictly Gieseker H-semistable sheaves with Chern classes \(c_1, c_2\). Moreover, as long as the assumptions of Corollary 3.4 are satisfied, the coefficients of \({\textsf {Z}}_{S,c_1}(x,y)\) are calculated by the universal functions \(A_1, \ldots , A_7\). This raises the expectation that for any choice of the 4-tuple \(\underline{\beta } \in {\mathbb {Z}}^4\) the formula of Corollary 3.4 is determined by the coefficients of the modular form of Conjecture 1.1. This turns out to be false. Computer calculations show that we need to imposeFootnote 8

$$\begin{aligned} \beta _3 \ge \beta _4 - 3. \end{aligned}$$

Indeed let \(\underline{\beta } \in {\mathbb {Z}}^4\) with \(\beta _1 \equiv \beta _2 \mod 2\) and \(\beta _3 \ge \beta _4 - 3\). Let \(n < \frac{1}{2}(\beta _1-\beta _2)+2\beta _4\). We conjecture that

$$\begin{aligned}&\text {Coeff}_{s^0x^{4n-\beta _1-3\beta _4}} \Big [(x y^{-\frac{1}{2}})^{-\beta _1-3\beta _4} {\textsf {A}}_{(0,0,\beta _1,0,\beta _2,\beta _3,\beta _4)}(s,y,(x y^{-\frac{1}{2}})^4) \\&\quad + (-1)^{\beta _4} (x y^{-\frac{1}{2}})^{-\beta _1-3\beta _4} {\textsf {A}}_{(\beta _3,\beta _2,\beta _1,\beta _3,\beta _2,\beta _3,\beta _4)}(s,y,(x y^{-\frac{1}{2}})^4) \Big ] \end{aligned}$$

equals the coefficient of \(x^{4n-\beta _1-3\beta _4}\) of

$$\begin{aligned} 8 \Bigg ( \frac{1}{2} \prod _{n=1}^{\infty } \frac{1}{(1-x^{2n})^{10}(1-x^{2n} y)(1-x^{2n} y^{-1})} \Bigg )^{\beta _4} \Bigg ( \frac{2 \overline{\eta }(x^4)^2}{\theta _3(x,y^{\frac{1}{2}})} \Bigg )^{\beta _3}. \end{aligned}$$

In fact, we have a stronger conjecture, which arose by attempts to generalize Conjecture 1.1 to blow-ups.

Conjecture 5.1

Let \(\underline{\beta } \in {\mathbb {Z}}^4\) be such that \(\beta _1 \equiv \beta _2 \mod 2\) and \(\beta _3 \ge \beta _4 - 3\). Let \(n < \frac{1}{2}(\beta _1-\beta _2)+2\beta _4\). Let \((\gamma _1,\gamma _2)\in {\mathbb {Z}}^2\). Then

$$\begin{aligned}&\text {Coeff}_{s^0x^{4n-\beta _1-3\beta _4}} \Big [(x y^{-\frac{1}{2}})^{-\beta _1-3\beta _4} {\textsf {A}}_{(\gamma _1,\gamma _2,\beta _1,\gamma _1,\beta _2,\beta _3,\beta _4)}(s,y,(x y^{-\frac{1}{2}})^4) \\&\quad + (-1)^{\beta _4} (x y^{-\frac{1}{2}})^{-\beta _1-3\beta _4} {\textsf {A}}_{(\beta _3-\gamma _1,\beta _2-\gamma _2,\beta _1,\beta _3-\gamma _1,\beta _2,\beta _3,\beta _4)}(s,y,(x y^{-\frac{1}{2}})^4) \Big ] \end{aligned}$$

equals the coefficient of \(x^{4n-\beta _1-3\beta _4}\) of

$$\begin{aligned} \psi _{\gamma _1, \gamma _2, \beta _3, \beta _4}(x,y) := 8 (-1)^{\gamma _2}\Bigg ( \frac{\phi (x,y)}{2}\Bigg )^{\beta _4} \Bigg ( \frac{2 \overline{\eta }(x^4)^2}{\theta _3(x,y^{\frac{1}{2}})} \Bigg )^{\beta _3}\Bigg (\frac{\theta _3(x,y^{\frac{1}{2}})}{\theta _3(-x,y^{\frac{1}{2}})} \Bigg )^{\gamma _1}, \end{aligned}$$

where

$$\begin{aligned} \phi (x,y) :=\prod _{n=1}^{\infty } \frac{1}{(1-x^{2n})^{10}(1-x^{2n} y)(1-x^{2n} y^{-1})}. \end{aligned}$$

The evidence for this conjecture is presented in Sect. 6.7. We now discuss its consequences.

Remark 5.2

Let \(\beta _1\in {\mathbb {Z}}\) be even, \(\beta _4\in {\mathbb {Z}}_{\le 3}\), and \(n < \frac{1}{2}\beta _1+2\beta _4\). Then we conjecture

$$\begin{aligned} \text {Coeff}_{s^0x^{4n-\beta _1-3\beta _4}} \Big [&(x y^{-\frac{1}{2}})^{-\beta _1-3\beta _4} {\textsf {A}}_{(0,0,\beta _1,0,0,0,\beta _4)}(s,y,(x y^{-\frac{1}{2}})^4) \Big ] \end{aligned}$$
(26)

equals the coefficient of \(x^{4n-\beta _1-3\beta _4}\) of

$$\begin{aligned} 4 \Bigg ( \frac{\phi (x,y}{2} \Bigg )^{\beta _4}. \end{aligned}$$
(27)

In the case \(\beta _4\) is even this follows from Conjecture 5.1 by taking \(\gamma _1=\gamma _2=\beta _2=\beta _3 = 0\) because then the two summands on the left-hand side of the conjecture are equal. In the case \(\beta _4\) is odd this says that (26) is 0, because (27) only contains even powers of x.

Proposition 5.3

Conjecture 5.1 and the strong form of Mochizuki’s formula (Remark 3.5) imply Conjecture 1.1.

Proof

We only use Conjecture 5.1 for \(\gamma _1=\gamma _2 = 0\). Suppose S is a surface with \(b_1(S) = 0\), \(p_g(S)>0\), and Seiberg–Witten basic classes 0 and \(K_S \ne 0\). Choose a polarization H and \(c_1, c_2\) such that there are no rank 2 strictly Gieseker H-semistable sheaves with these Chern classes. By taking

$$\begin{aligned} (\beta _1, \beta _2,\beta _3,\beta _4) = (c_1^2, c_1K_S, K_S^2, \chi ({\mathcal {O}}_S)) \end{aligned}$$

we automatically satisfy \(\beta _1 \equiv \beta _2 \mod 2\). The fact that \(\beta _3 \ge \beta _4 - 3\) can be seen as follows. If S is not minimal then it is the blow-up of a K3 in one point and the inequality is trivial. If S is minimal then it is minimal properly elliptic or minimal general type because \(p_g(S)>0\). For minimal general type the inequality follows from Noether’s inequality \(K_S^2 \ge 2(\chi ({\mathcal {O}}_S) - 3)\). When \(\pi : S \rightarrow B\) is minimal properly elliptic, we have \(B \cong {\mathbb {P}}^1\) because \(b_1(S) = 0\). The canonical bundle \(K_S\) satisfies \(K_S^2 = 0\) and can be represented by an effective divisor containing \(\pi ^* D\), where \(D \subset {\mathbb {P}}^1\) is some effective divisor of degree

$$\begin{aligned} \chi ({\mathcal {O}}_S)-2 \ge 0. \end{aligned}$$

Indeed \(\chi ({\mathcal {O}}_S) \le 3\), because otherwise at least \(0, F, K_S\) are distinct Seiberg–Witten basic classes [FM]. By the strong form of Mochizuki’s formula (Corollary 3.4 and Remark 3.5), Conjecture 1.1 follows for all

$$\begin{aligned} {\text {vd}}< c_1^2 - 2 c_1 K_S + 5 \chi ({\mathcal {O}}_S). \end{aligned}$$

If this inequality is not satisfied, then we replace \(c_1\) by \(c_1 + t H\) for some \(t>0\). Since

$$\begin{aligned} {\textsf {Z}}_{S,c_1}(x,y) = {\textsf {Z}}_{S,c_1 + 2 t H}(x,y) \end{aligned}$$

we can compute the coefficients of this generating function for all

$$\begin{aligned} {\text {vd}}< c_1^2 - 2 c_1 K_S + 5 \chi ({\mathcal {O}}_S) + 4 t^2 H^2 + 4 t H (c_1 - K_S). \end{aligned}$$

By choosing \(t \gg 0\) the bound becomes arbitrarily high. \(\quad \square \)

5.2 Fixed first Chern class

Proposition 5.4

Assume Conjecture 5.1. Let \(\underline{\beta } \in {\mathbb {Z}}^4\) be such that \(\beta _1 \equiv \beta _2 \mod 2\) and \(\beta _3 \ge \beta _4 - 3\) and let \((\gamma _1,\gamma _2)\in {\mathbb {Z}}^2\). Then

$$\begin{aligned}&\text {Coeff}_{s^0} \Big [ (x y^{-\frac{1}{2}})^{-\beta _1-3\beta _4} {\textsf {A}}_{(\gamma _1,\gamma _2,\beta _1,\gamma _1,\beta _2,\beta _3,\beta _4)}(s,y,(x y^{-\frac{1}{2}})^4) \\&\qquad \quad \ \, + (-1)^{\beta _4} (x y^{-\frac{1}{2}})^{-\beta _1-3\beta _4} {\textsf {A}}_{(\beta _3-\gamma _1,\beta _2-\gamma _2,\beta _1,\beta _3-\gamma _1, \beta _2,\beta _3,\beta _4)}(s,y,(x y^{-\frac{1}{2}})^4) \Big ]\\&\quad =2(-1)^{\gamma _2}\sum _{k=0}^3 (i^k)^{\beta _1-\beta _4} \Bigg (\frac{\phi (i^k x,y)}{2}\Bigg )^{\beta _4} \Bigg (\frac{2\overline{\eta }(x^4)^2}{\theta _3(i^k x,y^{\frac{1}{2}})} \Bigg )^{\beta _3}\Bigg (\frac{\theta _3(i^k x,y^{\frac{1}{2}})}{\theta _3(-i^k x,y^{\frac{1}{2}})}\Bigg )^{\gamma _1} \\&\qquad \quad \ \, +O(x^{\beta _1-2\beta _2+5\beta _4}), \end{aligned}$$

where \(i = \sqrt{-1}\) and

$$\begin{aligned} \phi (x,y) :=\prod _{n=1}^{\infty } \frac{1}{(1-x^{2n})^{10}(1-x^{2n} y)(1-x^{2n} y^{-1})}. \end{aligned}$$

Proof

Recall the formula for \(\psi (x,y) := \psi _{\gamma _1,\gamma _2,\beta _3,\beta _4}(x,y)\) of Conjecture 5.1. We see that the term in the sum on the right-hand side corresponding to \(k=0\) equals \(\psi (x,y)/4\). Define coefficients \(f_n(y)\) by

$$\begin{aligned} \psi (x,y) = \sum _{n=0}^{\infty } f_n(y) \, x^n. \end{aligned}$$

Then the right-hand side of the formula of the proposition equals

$$\begin{aligned} \sum _{k=0}^{3} \frac{(i^k)^{\beta _1 - \beta _4}}{4} \psi (i^k x,y)&=\sum _{k=0}^{3} \sum _{n=0}^{\infty } \frac{(i^k)^{\beta _1 - \beta _4 + n}}{4} f_n(y) \, x^n \\&=\sum _{n=0}^{\infty } \Bigg ( \frac{1}{4}\sum _{k=0}^{3} \big (i^{\beta _1 - \beta _4 + n}\big )^k\Bigg ) f_n(y) \, x^n \\&= \sum _{n \equiv -\beta _1 + \beta _4 \mod 4} f_n(y) \, x^n. \end{aligned}$$

Therefore we conclude that the right-hand side of the formula of the proposition is obtained from \(\psi (x,y)\) by extracting all terms \(x^n\) for which \(n \equiv -\beta _1 - 3 \beta _4 \mod 4\) and up to order \(O(x^{\beta _1-2\beta _2+5\beta _4})\). The result follows from Conjecture 5.1. \(\quad \square \)

The same type of proof applied to Conjecture 1.1 implies the following.

Proposition 5.5

Assume Conjecture 1.1. Let S be a smooth projective surface with \(b_1(S) = 0\) and \(p_g(S)>0\). Suppose the Seiberg–Witten basic classes of S are 0 and \(K_S \ne 0\). Let \(H, c_1\) be chosen such that there are no rank 2 strictly Gieseker H-semistable sheaves with first Chern class \(c_1\). Then

$$\begin{aligned} {\textsf {Z}}_{S,c_1}(x,y) = 2\sum _{k=0}^3 (i^k)^{c_1^2 - \chi ({\mathcal {O}}_S)} \Bigg ( \frac{\phi (i^k x,y)}{2} \Bigg )^{\chi ({\mathcal {O}}_S)} \Bigg (\frac{2\overline{\eta }(x^4)^2}{\theta _3(i^k x,y^{\frac{1}{2}})} \Bigg )^{K_S^2}. \end{aligned}$$

In particular, Corollary 1.3 in the introduction follows.

Using Conjecture 5.1 we can do better:

Proposition 5.6

Assume Conjecture 5.1 and the strong form of Mochizuki’s formula (Remark 3.5). Let \(S_0\) be a smooth projective surface with \(b_1(S_0) = 0\) and \(p_g(S_0)>0\). Suppose the Seiberg–Witten basic classes of \(S_0\) are 0 and \(K_{S_0} \ne 0\). Suppose S is obtained from an iterated blow-up (possibly at infinitely near points) of \(S_0\) and let \(E_1,\cdots , E_m\) denote the total transforms of the exceptional divisors. Suppose furthermore that \(K_S^2\ge \chi ({\mathcal {O}}_S)-3\). Let \(H, c_1\) be chosen such that there exist no rank 2 strictly Gieseker H-semistable sheaves on S with first Chern class \(c_1\). Then

$$\begin{aligned} \begin{aligned}{\textsf {Z}}_{S,c_1}(x,y) =&\, 2\sum _{k=0}^3 (i^k)^{c_1^2-\chi ({\mathcal {O}}_S)}\Bigg (\frac{\phi (i^k x,y)}{2}\Bigg )^{\chi ({\mathcal {O}}_S)}\Bigg (\frac{2\overline{\eta }(x^4)^2}{\theta _3(i^k x,y^{\frac{1}{2}})}\Bigg )^{K_{S}^2}\\&\times \,\prod _{j=1}^m \Bigg (1+(-1)^{c_1E_j} \frac{\theta _3(-i^kx,y^{\frac{1}{2}})}{\theta _3(i^kx,y^{\frac{1}{2}})}\Bigg ). \end{aligned} \end{aligned}$$

Proof

The surface S is obtained from \(S_0\) by an iterated blow-up in (possibly infinitely near) points \(\pi : S \rightarrow S_0\). We denote by \(E_1, \ldots , E_m\) the total transforms on the blow-ups. Write \(M:=\{1,\ldots ,m\}\), and for a subset \(I\subset M\) write \(E_I=\sum _{i\in I} E_i\). Then \(K_S=K_{S_0}+E_M\). Moreover the Seiberg–Witten basic classes are the \(E_I\) (with Seiberg–Witten invariant 1) and the \(K_{S_0}+E_I=K_S-E_{M-I}\) (with Seiberg–Witten invariant \((-1)^{\chi ({\mathcal {O}}_S)}\)) for all \(I\subset M\). E.g. see [Mor, Thm. 7.4.6] for Seiberg–Witten invariants of blow-ups. We denote by |I| the number of elements of I. Now note that

$$\begin{aligned} K_S(K_S-E_I)=(K_S-E_I)^2=K_S^2-E_I^2. \end{aligned}$$

By the strong form of Mochizuki’s formula (Corollary 3.4 and Remark 3.5) and Proposition 5.4, we obtain the following equation modulo \(x^{c_1^2-2c_1K_S+5\chi ({\mathcal {O}}_S)}\)

$$\begin{aligned} \begin{aligned}&{\textsf {Z}}_{S,c_1}(x,y) =\sum _{I\subset M}\text{ Coeff }_{s^0}\Big [\Big ((x y^{-\frac{1}{2}})^{-c_1^2-3\chi ({\mathcal {O}}_S)}\\ {}&\times {\textsf {A}}_{(E_I^2,E_Ic_1,c_1^2, E_IK_S,c_1K_S,K_S^2,\chi ({\mathcal {O}}_S))}(s,y,(x y^{-\frac{1}{2}})^4)+(-1)^{\chi ({\mathcal {O}}_S)}(x y^{-\frac{1}{2}})^{-c_1^2-3\chi ({\mathcal {O}}_S)}\\ {}&\times {\textsf {A}}_{(K_{S}^2-E_I^2,c_1K_{S}-c_1E_I,c_1^2,K_S^2-E_I^2,c_1K_S,K_S^2, \chi ({\mathcal {O}}_S))}(s,y,(x y^{-\frac{1}{2}})^4)\Big ]\\ {}&=2\sum _{I\subset M}\sum _{k=1}^3 (i^k)^{c_1^2-\chi ({\mathcal {O}}_S)}\Bigg (\frac{\phi (i^k x,y)}{2}\Bigg )^{\chi ({\mathcal {O}}_S)}\Bigg (\frac{2\overline{\eta }(x^4)^2}{\theta _3(i^k x,y^{\frac{1}{2}})}\Bigg )^{K_S^2} (-1)^{c_1E_I}\\ {}&\times \Bigg (\frac{\theta _3(-i^k x,y^{\frac{1}{2}})}{\theta _3(i^kx,y^{\frac{1}{2}})}\Bigg )^{|I|}, \end{aligned} \end{aligned}$$

where we replaced \(E_{M-I}\) by \(E_I\) for all terms with Seiberg–Witten invariant \((-1)^{\chi ({\mathcal {O}}_S)}\). After interchanging the sums we get the formula of the proposition. \(\quad \square \)

5.3 Arbitrary surfaces with holomorphic 2-form

We present the following conjecture about virtual \(\chi _y\)-genera of moduli spaces of rank 2 sheaves on arbitrary smooth projective surfaces S with \(b_1(S)=0\) and \(p_g(S)>0\). Although this conjecture is strictly stronger than Conjecture 1.1, the latter is a little easier to state and was therefore the focus of the introduction. We provide some evidence for this conjecture in Sects. 6.16.2, and 6.3 .

Conjecture 5.7

Let S be a smooth projective surface with \(b_1(S) = 0\) and \(p_g(S)>0\). Let \(H,c_1,c_2\) be chosen such that there are no rank 2 strictly Gieseker H-semistable sheaves with Chern classes \(c_1,c_2\) and let \(M:=M_S^H(2,c_1,c_2)\). Then \(\overline{\chi }_{-y}^{\text {vir}}(M)\) equals the coefficient of \(x^{{\text {vd}}(M)}\) of

$$\begin{aligned} \begin{aligned}\psi _{S,c_1}(x,y) :=&\, 4 \Bigg (\frac{\phi (x,y)}{2}\Bigg )^{\chi ({\mathcal {O}}_S)}\Bigg (\frac{2 \overline{\eta }(x^4)^2}{\theta _3(x,y^{\frac{1}{2}})}\Bigg )^{K_{S}^2}\\ {}&\times \sum _{a \in H^2(S,{\mathbb {Z}})} \text{ SW }(a)(-1)^{c_1 a} \Bigg (\frac{\theta _3(x,y^{\frac{1}{2}})}{\theta _3(-x,y^{\frac{1}{2}})}\Bigg )^{a K_S}, \end{aligned} \end{aligned}$$

where

$$\begin{aligned} \phi (x,y) :=\prod _{n=1}^{\infty } \frac{1}{(1-x^{2n})^{10}(1-x^{2n} y)(1-x^{2n} y^{-1})}. \end{aligned}$$

Assuming this conjecture and when there are no strictly Gieseker H-semistable sheaves with first Chern class \(c_1\), the same calculation as in Sect. 5.2 gives

$$\begin{aligned} {\textsf {Z}}_{S,c_1}(x,y) = \frac{1}{2} \psi _{S,c_1}(x,y) + \frac{1}{2} i^{c_1^2 - \chi ({\mathcal {O}}_S)} \psi _{S,c_1}(ix,y), \end{aligned}$$
(28)

where \(i = \sqrt{-1}\). Specializing to \(y=1\) gives (part of) a formula from the physics literature due to Dijkgraaf-Park-Schroers, namely terms two and three of [DPS, Eqn. (6.1)].Footnote 9 This involves a bit of rewriting using

$$\begin{aligned} \theta _3(i^k x)=\theta _3(x^4)+i^k\theta _2(x^4), \end{aligned}$$
(29)

\(\text {SW}(a) = (-1)^{\chi ({\mathcal {O}}_S)} \text {SW}(K_S-a)\) [Mor, Cor. 6.8.4], and \(a^2=aK_S\) for Seiberg–Witten basic classes [Moc, Prop. 6.3.1].

Remark 5.8

A straight-forward calculation shows that this conjecture implies both Proposition 5.5 (without assuming Conjecture 1.1) and Proposition 5.6 (without assuming Conjecture 5.1 and without assuming \(K_S^2 \ge \chi ({\mathcal {O}}_S)-3\)). In fact, this conjecture implies Conjecture 1.1.

The first application of Conjecture 5.7 is the following blow-up formula.

Proposition 5.9

Assume Conjecture 5.7 holds. Let \(\pi : {\widetilde{S}} \rightarrow S\) be the blow-up in a point of a smooth projective surface S with \(b_1(S) = 0\), \(p_g(S)>0\). Suppose \(H, c_1\) are chosen such that there are no rank 2 strictly Gieseker H-semistable sheaves with first Chern class \(c_1\). Let \({\widetilde{c}}_1 = \pi ^* c_1 - \epsilon E\) with \(\epsilon =0,1\) and suppose \({\widetilde{H}}\) is a polarization on \({\widetilde{S}}\) such that there are no rank 2 strictly Gieseker \({\widetilde{H}}\)-semistable sheaves on \({\widetilde{S}}\) with first Chern class \({\widetilde{c}}_1\). Then

$$\begin{aligned} {\textsf {Z}}_{{\widetilde{S}}, {\widetilde{c}}_1}(x,y) = {\left\{ \begin{array}{ll} \frac{\theta _3(x^4,y)}{\overline{\eta }(x^4)^2} \, {\textsf {Z}}_{S, c_1}(x,y), &{} \epsilon =0 \\ \frac{\theta _2(x^4,y)}{\overline{\eta }(x^4)^2} \, {\textsf {Z}}_{S, c_1}(x,y), &{} \epsilon =1. \end{array}\right. } \end{aligned}$$

Proof

The Seiberg–Witten basic classes of \({\widetilde{S}}\) are \(\pi ^*a\), \(\pi ^*a+E\), where a is a Seiberg–Witten basic class of S and the corresponding Seiberg–Witten invariants are [Mor, Thm. 7.4.6]

$$\begin{aligned} \text {SW}(\pi ^*a) = \text {SW}(\pi ^*a+E) = \text {SW}(a). \end{aligned}$$

Using \(\chi ({\mathcal {O}}_{{\widetilde{S}}}) = \chi ({\mathcal {O}}_S)\), \(K_{{\widetilde{S}}} = \pi ^* K_{S}+E\), \(K_{{\widetilde{S}}}^2 = K_S^2-1\), Conjecture 5.7 implies

$$\begin{aligned} \psi _{{\widetilde{S}},{\widetilde{c}}_1}(x,y) = \frac{1}{2} \Bigg ( \frac{\theta _3(x,y^{\frac{1}{2}})}{\overline{\eta }(x^4)^2} + (-1)^{\epsilon } \frac{\theta _3(-x,y^{\frac{1}{2}})}{\overline{\eta }(x^4)^2} \Bigg ) \psi _{S,c_1}(x,y). \end{aligned}$$

Specializing to \(\epsilon =0,1\) and using (29) the result follows. \(\quad \square \)

For \(y=1\) this blow-up formula appears in a physics context in [VW, Sect. 4.3].

Remark 5.10

Let \(\pi : {\widetilde{S}} \rightarrow S\) be the blow-up in a point of a simply connected smooth projective surface S. Let \(H, c_1\) be chosen on S such that \(c_1 H\) is odd. Let \({\widetilde{H}} = r \pi ^* H - E\) for \(r \gg 0\) and \({\widetilde{c}}_1 = \pi ^* c_1 - \epsilon E\) for \(\epsilon =0,1\) such that \({\widetilde{c}}_1 {\widetilde{H}}\) is odd as well. One can show that the moduli spaces \(M_{{\widetilde{S}}}^{{\widetilde{H}}}(2,{\widetilde{c}}_1,c_2)\) do not depend on the choice of such \({\widetilde{H}}\) [LQ1]. In this setting Li-Qin [LQ1, LQ2] derived a blow-up formula for the virtual Hodge polynomials of these moduli spacesFootnote 10

$$\begin{aligned} h(M_S^H(2,c_1,c_2), x_1, x_2), \, h(M_{{\widetilde{S}}}^{{\widetilde{H}}}(2,{\widetilde{c}}_1,c_2), x_1, x_2). \end{aligned}$$

Normalize the virtual Hodge polynomials as follows

$$\begin{aligned} {\overline{h}}(M_S^H(2,c_1,c_2), x_1, x_2)&= (x_1 x_2)^{ - {\text {vd}}(M_S^H(2,c_1,c_2))/2} h(M_S^H(2,c_1,c_2), x_1, x_2), \\ {\overline{h}}(M_{{\widetilde{S}}}^{{\widetilde{H}}}(2,{\widetilde{c}}_1,c_2), x_1, x_2)&= (x_1 x_2)^{ - {\text {vd}}(M_{{\widetilde{S}}}^{{\widetilde{H}}}(2,{\widetilde{c}}_1,c_2)/2} h(M_{{\widetilde{S}}}^{{\widetilde{H}}}(2,{\widetilde{c}}_1,c_2), x_1, x_2). \end{aligned}$$

Then Li-Qin’s formula reads (see also [Got2, Rem. 3.2])

$$\begin{aligned}&\sum _{c_2} {\overline{h}}(M_{{\widetilde{S}}}^{{\widetilde{H}}}(2,{\widetilde{c}}_1,c_2), x_1, x_2) x^{4c_2 - {\widetilde{c}}_{1}^2} \\&\quad = {\left\{ \begin{array}{ll} \frac{\theta _3(x^4,x_1x_2)}{\overline{\eta }(x^4)^2} \sum _{c_2} {\overline{h}}(M_S^H(2,c_1,c_2), x_1, x_2) x^{4c_2 - c_{1}^2}, &{} \epsilon =0 \\ \frac{\theta _2(x^4,x_1x_2)}{\overline{\eta }(x^4)^2} \sum _{c_2} {\overline{h}}(M_S^H(2,c_1,c_2), x_1, x_2) x^{4c_2 - c_{1}^2}, &{} \epsilon =1. \end{array}\right. } \end{aligned}$$

When specializing to \(x_1 = y\) and \(x_2=1\), this gives the ratios of Proposition 5.9. Hence the blow-up formula for virtual \(\chi _y\)-genera (virtual in the sense of virtual classes) and the blow-up formula for \(\chi _y\)-genera (defined via virtual Hodge polynomials) coincide. In particular, the blow-up formula for virtual Euler characteristics and classical Euler characteristics involve the same ratio as well.

The second application of Conjecture 5.7 is to surfaces with canonical divisor with irreducible reduced connected components.

Proposition 5.11

Assume Conjecture 5.7 holds. Let S be a smooth projective surface with \(b_1(S) = 0\), \(p_g(S)>0\), and suppose \(C_1+ \cdots +C_m \in |K_S|\), where \(C_1, \ldots , C_m\) are mutually disjoint irreducible reduced curves. Suppose \(H,c_1\) are chosen such that there are no rank 2 strictly Gieseker H-semistable sheaves with first Chern class \(c_1\). Then

$$\begin{aligned}&{\textsf {Z}}_{S,c_1}(x,y) = 2 \Bigg (\frac{\phi (x,y)}{2}\Bigg )^{\chi ({\mathcal {O}}_S)}\\&\quad \times \prod _{j=1}^{m} \Bigg \{ \Bigg ( \frac{2\overline{\eta }(x^4)^2}{\theta _3(x,y^{\frac{1}{2}})} \Bigg )^{C_j^2} + (-1)^{c_1 C_j+h^0(N_{C_j/S})} \Bigg ( \frac{2\overline{\eta }(x^4)^2}{\theta _3(-x,y^{\frac{1}{2}})} \Bigg )^{C_j^2} \Bigg \} \\&\quad + 2(-i)^{c_1^2 - \chi ({\mathcal {O}}_S)} \Bigg (\frac{\phi (-ix,y)}{2}\Bigg )^{\chi ({\mathcal {O}}_S)} \\&\quad \times \prod _{j=1}^{m} \Bigg \{ \Bigg ( \frac{2\overline{\eta }(x^4)^2}{\theta _3(-i x,y^{\frac{1}{2}})} \Bigg )^{C_j^2} + (-1)^{c_1 C_j+h^0(N_{C_j/S)}} \Bigg ( \frac{2\overline{\eta }(x^4)^2}{\theta _3(i x,y^{\frac{1}{2}})} \Bigg )^{C_j^2} \Bigg \} \end{aligned}$$

where \(i = \sqrt{-1}\) and \(N_{C_j/S}\) denotes the normal bundle of \(C_j \subset S\).

Specializing to \(y=1\) and using \(\theta _3(i^k x)=\theta _3(x^4)+i^k\theta _2(x^4)\) gives a more explicit version of [VW, Eqn. (5.45)].Footnote 11 Before we prove this proposition, we need three lemmas about disconnected curves and their Seiberg–Witten invariants.

Lemma 5.12

Let CD be irreducible reduced mutually disjoint curves on a smooth projective surface S with \(b_1(S)=0\). Then precisely one of the following is true:

  • C or D is rigid, i.e. |C| or |D| is 0-dimensional.

  • \(|C| = |D| \cong {\mathbb {P}}^1\) is a pencil.

Proof

Suppose neither of CD is rigid. Then their linear systems sweep out S. Therefore |C| contains an element F which intersects D. Note that F is connected because C is irreducible reduced. The intersection cannot be only in dimension 0, because \(CD=0\). Therefore \(F = D + \sum _{i \in I} n_i F_i\), where I is a finite index set, \(n_i>0\), and \(D, \{F_i\}_{i \in I}\) are all mutually distinct prime divisors. Suppose \(|I|>0\). Then

$$\begin{aligned} 0 = CD = FD = D^2 + \sum _{i \in I} n_i F_i D > D^2. \end{aligned}$$

Hence \(D^2<0\), so \(H^0(N_{D/S}) = 0\) contradicting the assumption that D is not rigid. Therefore \(I = \varnothing \) and \(D \in |C|\). Furthermore, |C| is base-point free and \(C^{\prime } C^{\prime \prime } = C D = 0\) for all \(C^{\prime }, C^{\prime \prime } \in |L|\) so \(|L| \cong {\mathbb {P}}^1\). \(\quad \square \)

Suppose \(C_1, \ldots , C_m\) are irreducible reduced mutually disconnected curves on a smooth projective surface S with \(b_1(S) = 0\) and let \(M:=\{1, \ldots , m\}\). Then for any \(I = \{i_1, \ldots , i_k\} \subset M\), we define

$$\begin{aligned} C_I := \sum _{i \in I} C_i. \end{aligned}$$

For \(I, J \subset M\) we write \(I \sim J\) whenever \(C_I \sim _{\text {lin}} C_J\). This defines an equivalence relation. We denote the equivalence class corresponding to I by [I] and denote its number of elements by |[I]|.

Lemma 5.13

For any \(I \subset M\), we have \(|[I]| = \left( {\begin{array}{c}\dim |C_M|\\ \dim |C_I|\end{array}}\right) \).

Proof

Since \(b_1(S) = 0\), any effective divisor \(D \subset S\) satisfies \(\dim |D| = h^0(N_{D/S})\). In particular we have

$$\begin{aligned} \dim |C_I| = h^0(N_{C_I/S}) = \sum _{i \in I} h^0(N_{C_i/S}) = \sum _{i \in I} \dim |C_i|. \end{aligned}$$
(30)

Suppose, possibly after relabeling, that \(C_1, \ldots , C_a\) are the rigid curves (i.e. their linear systems are 0-dimensional). Then \(C_{a+1}, \ldots , C_{m}\) are all linearly equivalent (Lemma 5.12). Moreover, if \(m>a+1\), then their linear systems are pencils (Lemma 5.12). There are three cases:

Case 1::

\(m=a\). Then all curves \(C_i\) are rigid, so \(I \sim J\) if and only if \(I=J\) and the statement follows from (30).

Case 2::

\(m=a+1\). Then only \(C_m\) is not rigid and it is again easy to see that \(I \sim J\) if and only if \(I=J\). By (30), we have that \(\dim |C_I| = \dim |C_m|\) if \(m \in I\) and zero otherwise. The statement follows.

Case 3::

\(m>a+1\). Then \(C_1, \ldots , C_a\) are rigid and \(|C_{a+1}| = \cdots = |C_m| \cong {\mathbb {P}}^1\). Let \(A:=\{1, \ldots , a\}\) and \(B:=\{a+1, \ldots , m\}\). Then \(I \sim J\) if and only if \(A \cap I = A \cap J\) and \(|B \cap I| = |B \cap J|\). Therefore \(|[I]| = \left( {\begin{array}{c}|B|\\ |B \cap I|\end{array}}\right) \). The result follows from Eq. (30) as follows

$$\begin{aligned} \dim |C_I| = \sum _{i \in A \cap I} \dim |C_i| + \sum _{i \in B \cap I} \dim |C_i| = |B \cap I|. \end{aligned}$$

\(\square \)

Lemma 5.14

Let S be a smooth projective surface with \(b_1(S) = 0\), \(p_g(S)>0\), and suppose \(C_1+ \cdots +C_m \in |K_S|\), where \(C_1, \ldots , C_m\) are mutually disjoint irreducible reduced curves. Then the Seiberg–Witten basic classes of S are \(\{C_I\}_{I \subset M}\) and

$$\begin{aligned} \text {SW}(C_I) = |[I]| \prod _{i \in I} (-1)^{h^0(N_{C_i/S})}. \end{aligned}$$

Proof

The proof combines Lemma 5.13 and the proof of [Moc, Prop. 6.3.1]. We first note that all Seiberg–Witten basic classes must be of the form \(\{C_I\}_{I \subset M}\) (this can be seen most easily from the cosection localization of Chang-Kiem [CK, Lem. 3.2]). Let \(\varnothing \ne I \subset M\). Then Mochizuki shows that

$$\begin{aligned} \text {SW}(C_I) = c_{\text {top}}({\text {Ob}}) \cap |C_I|, \end{aligned}$$

where \({\text {Ob}}\) is a rank \(h^1(N_{C_I/S})\) vector bundle with total Chern class

$$\begin{aligned} (1+h)^{h^1(N_{C_I/S}) - p_g(S)} \end{aligned}$$

where h denotes the hyperplane class on \(|C_I|\). Hence

$$\begin{aligned} \text {SW}(C_I) = \left( {\begin{array}{c}h^1(N_{C_I/S})-p_g(S)\\ h^1(N_{C_I/S})\end{array}}\right) = (-1)^{h^1(N_{C_I/S})} \left( {\begin{array}{c}p_g(S)-1\\ h^1(N_{C_I/S})\end{array}}\right) . \end{aligned}$$

By Serre duality and adjunction \(K_{C_i} = (C_M+C_i)|_{C_i} = 2C_i|_{C_i}\)

$$\begin{aligned} h^1(N_{C_I/S}) = \sum _{i \in I} h^1({\mathcal {O}}_{C_i}(C_i)) = \sum _{i \in I} h^0({\mathcal {O}}_{C_i}(C_i)) = h^0(N_{C_I/S}). \end{aligned}$$

Therefore Lemma 5.13 implies

$$\begin{aligned} \left( {\begin{array}{c}p_g(S)-1\\ h^1(N_{C_I/S})\end{array}}\right) = \left( {\begin{array}{c}\dim |C_M|\\ \dim |C_I|\end{array}}\right) = |[I]|. \end{aligned}$$

\(\square \)

Proof of Proposition 5.11

Combining Conjecture 5.7 and Lemma 5.14 gives

$$\begin{aligned} \begin{aligned} \psi _{S,c_1}(x,y)&= 4 \Bigg (\frac{\phi (x,y)}{2}\Bigg )^{\chi ({\mathcal {O}}_S)}\Bigg (\frac{2 \overline{\eta }(x^4)^2}{\theta _3(x,y^{\frac{1}{2}})}\Bigg )^{K_{S}^2}\\ {}&\quad \times \sum _{[I]} |[I]| (-1)^{h^0(N_{C_I/S})+c_1C_I} \Bigg (\frac{\theta _3(x,y^{\frac{1}{2}})}{\theta _3(-x,y^{\frac{1}{2}})}\Bigg )^{C_I^2} \\&= 4 \Bigg (\frac{\phi (x,y)}{2}\Bigg )^{\chi ({\mathcal {O}}_S)}\Bigg (\frac{2 \overline{\eta }(x^4)^2}{\theta _3(x,y^{\frac{1}{2}})}\Bigg )^{K_{S}^2}\\&\quad \times \sum _{I} (-1)^{h^0(N_{C_I/S})+c_1C_I} \Bigg (\frac{\theta _3(x,y^{\frac{1}{2}})}{\theta _3(-x,y^{\frac{1}{2}})}\Bigg )^{C_I^2} \\&= 4 \Bigg (\frac{\phi (x,y)}{2}\Bigg )^{\chi ({\mathcal {O}}_S)}\Bigg (\frac{2 \overline{\eta }(x^4)^2}{\theta _3(x,y^{\frac{1}{2}})}\Bigg )^{K_{S}^2}\\ {}&\quad \times \prod _{i=1}^{m}\Bigg (1+(-1)^{h^0(N_{C_i/S})+C_i c_1} \Bigg (\frac{\theta _3(x,y^{\frac{1}{2}})}{\theta _3(-x,y^{\frac{1}{2}})}\Bigg )^{C_i^2} \Bigg ). \end{aligned} \end{aligned}$$

The formula for \({\textsf {Z}}_{S,c_1}(x,y)\) follows from (28) after re-organizing the terms. \(\quad \square \)

6 Verification of the Conjectures in Examples

In this section we check Conjectures 1.15.1, and 5.7 in many cases. We recall that we calculated \(A_1(s,y,q), \ldots , A_7(s,y,q)\) up to order \(q^7\), where we calculated the coefficient of \(q^i\) up to order \(s^{29-3i}\). We computed \(A_1(s,1,q), \ldots , A_7(s,1,q)\) up to order \(q^{30}\) and any order in s (see Sect. 4). The latter are listed up to order \(q^4\) in Appendix 6.7.

6.1 K3 surfaces

Let S be a K3 surface. The canonical class is trivial and \(b_1(S) = 0\) so we are in the setting of Conjecture 5.7, which states

$$\begin{aligned} {\textsf {Z}}_{S,c_1}(x,y)=\frac{1}{2}\big (\phi (x,y)^2-i^{c_1^2} \phi (ix,y)^2\big ). \end{aligned}$$
(31)

This can be restated as saying that \({\overline{\chi }}_{-y}^\text {vir}(M^H_S(2,c_1,c_2))\) is the coefficient of \(4c_2-c_1^2-6\) of

$$\begin{aligned} \phi (x,y)^2= \prod _{n=1}^{\infty } \frac{1}{(1-x^{2n})^{20}(1-x^{2n} y)^2(1-x^{2n} y^{-1})^2}, \end{aligned}$$

when there are no rank 2 strictly H-semistable sheaves with Chern classes \(c_1,c_2\). This looks very similar to Conjecture 1.1, which does not apply, and which would be off by a factor 2.

In the absence of strictly Gieseker H-semistable sheaves, the moduli space \(M_{S}^{H}(2,c_1,c_2)\) is smooth of expected dimension. If in addition \(c_1\) is primitive, big, and nef, then in [GH] it is shown that

$$\begin{aligned} \chi _{-y}(M^S_H(2,c_1,c_2))=\chi _{-y}(S^{[2c_2 - c_1^2/2 - 3]}). \end{aligned}$$

More generally, in [Huy, Cor. 4.8], when \(c_1\) is primitive, and in [Yos4, Thm. 0.1] in general, it is shown that in the absence of strictly Gieseker H-semistable sheaves \(M_{S}^{H}(r,c_1,c_2)\) is deformation equivalent to the Hilbert scheme \(S^{[{\text {vd}}/2]}\). In particular they have the same Hodge numbers. In turn, \(\chi _y\)-genera of Hilbert schemes of points on K3 surfaces were computed by the first named author and W. Soergel in [GS]. Combining this formula with the result above implies that (31) holds for K3 surfaces.

The only Seiberg–Witten basic class of S is 0 with corresponding Seiberg–Witten invariant 1. Suppose \(c_1\) is a choice of first Chern class such that \(c_1^2 = c\), then \(c \in 2 {\mathbb {Z}}\). Then we see that (31) also follows from Conjecture 5.1. We put \((\gamma _1,\gamma _2)=(0,0)\) and \((\beta _1,\beta _2,\beta _3,\beta _4)=(c,0,0,2)\). Then the two summands in \(\text {Coeff}_{s^0x^n}[\cdot ]\) in Conjecture 5.1 are equal, and each equals the contribution of the Seiberg–Witten basic class \(0=K_S\). Therefore, if \(c_2\) satisfies the assumptions of Conjecture 5.1, it gives that \({\overline{\chi }}_{-y}^\text {vir}(M^H_S(2,c_1,c_2))\) is the coefficient of \(x^{4c_2-c_1^2-6}\) of

$$\begin{aligned} \frac{1}{2}\psi _{0,0,0,2}(x,y)=\phi (x,y)^2. \end{aligned}$$

The same argument as in the proof of Proposition 5.3 shows that (31) follows from Conjecture 5.1 and the strong form of Mochizuki’s formula.

All the same, we want to calculate terms of (31) directly by applying Corollary 3.4 and our explicit knowledge of the universal functions \(A_i\) as described in Sect. 4. We use the conjectured strong form of Mochizuki’s formula (Remark 3.5), so our calculations can also be viewed as an additional test of Remark 3.5. The easiest way to satisfy all assumptions of Corollary 3.4 is by choosing \(c_1\) and H such that \(c_1 H > 0\) is odd (though this is unnecessarily strong). According to Theorem 2.3 (i), we can calculate modulo

$$\begin{aligned} \mod x^{c + 10}. \end{aligned}$$

In addition we have a bound given by the accuracy to which we calculated the universal functions \(A_i(s,y,q)\) and \(A_i(s,1,q)\). For \(A_i(s,1,q)\) this bound is \(q^{30}\), which for \({\textsf {Z}}_{S,c_1}(x,1)\) means we can calculate modulo

$$\begin{aligned} \mod x^{-c+118}. \end{aligned}$$

For \(y=1\), i.e. the case of virtual Euler characteristics, we tested Conjecture 5.7 for all even \(c \in \{-6, \ldots , 116\}\) up to the above accuracies. In conclusion, for K3 surfaces and \(y=1\), we verified Conjecture 5.7 by direct calculation for:

  • \(c_1\) such that \(c_1^2 = c \in \{-6, \ldots , 116\}\) is even, modulo \(x^{c+10}\) (bound from Corollary 3.4 (i)) and modulo \(x^{-c+118}\) (bound from \(A_i\)). Here “and” means that both bounds apply, in other words “modulo \(x^{\min (c+10,-c+118)}\).

For virtual \(\chi _y\)-genus, we tested Conjecture 5.7 for:

  • \(c_1\) such that \(c_1^2 = c \in \{ -6, \ldots , 32\}\) is even, modulo \(x^{c+10}\) (bound from Corollary 3.4 (i)) and modulo \(x^{-c+26}\) (bound from \(A_i\)).

6.2 Elliptic surfaces

Let \(S \rightarrow {\mathbb {P}}^1\) be a non-trivial elliptic surface with section, \(12n>0\) rational nodal fibres, and no other singular fibres, i.e. an elliptic surface of type E(n). The canonical class is given by \(K_S = (n-2) F\), where F denotes the class of the fibre. Note that \(\chi ({\mathcal {O}}_S) = n\). Moreover, choose a section \(B \subset S\), then its class satisfies \(B^2 = -n\).

We assume \(n \ge 2\), then E(n) has a smooth canonical divisor which has \(m=n-2\) connected components \(F_j\); each a smooth elliptic fibre of S. The surface E(3) can be used to test Conjecture 1.1 and the surfaces E(n) with \(n \ge 2\) can be used to test Conjecture 5.7. Note that E(2) is K3, which was discussed in Sect. 6.1, so we take \(n \ge 3\).

Let \(c_1 = \epsilon B + d F\), for \(\epsilon , d \in {\mathbb {Z}}\) and B the class of the section. Since \(F_j^2=0\) and \(c_1 F_j=\epsilon \), Conjecture 5.7 gives

$$\begin{aligned} {\textsf {Z}}_{S,c_1}(x,y)={\left\{ \begin{array}{ll}\frac{1}{2}\big (\phi (x,y)^n+(-i)^{c_1^2 - n}\phi (-ix,y)^n\big ), &{} \epsilon \hbox { odd} \\ 0, &{} \epsilon \hbox { even.}\end{array}\right. } \end{aligned}$$
(32)

In the case \(n=3\), this can be restated as saying that \({\overline{\chi }}_{-y}^\text {vir}(M^H_S(2,c_1,c_2))\) is the coefficient of \(4c_2-c_1^2-9\) of \( \phi (x,y)^3\), which is the statement of Conjecture 1.1.

In [Yos3], Yoshioka fixes \(c_1, c_2\) such that \(c_1 F\) is odd and an ample divisor \(H = B + \beta F\) with \(\beta \gg c_2\). By choosing \(\beta \) of the right parity \(c_1H\) is odd, so there are no rank 2 strictly \(\mu \)-semistable sheaves with Chern classes \(c_1,c_2\). Then \(M:=M_{S}^{H}(2,c_1,c_2)\) is smooth irreducible of expected dimension and independent of H. In this setting, the Betti numbers and Euler characteristics of \(M:=M_{S}^{H}(2,c_1,c_2)\) were calculated in [Yos3] and are indeed given by the specialization \(y=1\) of (32).

Next we want to calculate terms of (32) directly by applying Corollary 3.4 and our explicit knowledge of the universal functions \(A_i\) as described in Sect. 4. There are numerous choices for the polarization H for which the conditions of Corollary 3.4 are satisfied. Specifically, let \(H = \alpha B + \beta F\) with \(\alpha >0\) and \(\beta > \alpha n\) be a polarization. When

$$\begin{aligned} c_1 H&= (\beta - \alpha n) \epsilon + \alpha d > 2\alpha (n-2) = 2 K_S H, \\ c_1 H&= (\beta - \alpha n) \epsilon + \alpha d \qquad \text {odd}, \end{aligned}$$

all conditions are satisfied. For fixed \(\epsilon > 0\) and \(d \in {\mathbb {Z}}\) not both even, there are many solutions \(\alpha >0\) and \(\beta > \alpha n\) to these equations. By [FM], the Seiberg–Witten basic classes are \(0, F \ldots , (n-2)F\) and

$$\begin{aligned} \text {SW}(pF) = (-1)^p \left( {\begin{array}{c}n-2\\ p\end{array}}\right) . \end{aligned}$$

For \(y=1\), we verified Conjecture 5.7 in the following cases:

  • E(n) such that \(n = 3, \ldots , 8\), \(c_1=\epsilon B + d F\) such that \(\epsilon = 1,2\) and \(d = 0,\ldots , 8\) not both even, modulo \(x^{- \epsilon ^2 n+2\epsilon d - 2 \epsilon n + 4 \epsilon + 5n}\) (bound from Corollary 3.4 (i)) and modulo \(x^{n \epsilon ^2 - 2 \epsilon d - 3n + 4\min \{0, \epsilon (n-2) \} + 124}\) (bound from \(A_i\)).

For virtual \(\chi _y\)-genus, we tested Conjecture 5.7 for:

  • E(n) such that \(n = 3,\ldots ,6\), \(c_1=\epsilon B + d F\) such that \(\epsilon = 1,2\) and \(d = 4 \ldots ,8\) not both even, modulo \(x^{- \epsilon ^2 n + 2\epsilon d - 2 \epsilon n + 4 \epsilon + 5n}\) (bound from Corollary 3.4 (i)) and modulo \(x^{n \epsilon ^2 - 2 \epsilon d - 3n + 4\min \{0, \epsilon (n-2) \} + 32}\) (bound from \(A_i\)).

6.3 Blow-ups

Let \(\pi : {\widetilde{S}} \rightarrow S\) be the blow-up of a K3 surface S in a point and denote the exceptional divisor by E. Then \(K_{{\widetilde{S}}} = E\) is a smooth canonical divisor. We want to gather evidence for Conjecture 1.1 by applying Corollary 3.4 and our explicit knowledge of the universal functions \(A_i\) as described in Sect. 4. Let \({\widetilde{c}}_1 = \pi ^* c_1 + \epsilon E\) and \(c_1^2=c \in 2 {\mathbb {Z}}\). The easiest way to satisfy the conditions of Corollary 3.4 is to take a polarization H on S such that \(c_1 H > 0\) is odd and taking \({\widetilde{H}} = r H - E\) with \(r \gg 0\) and \(r + \epsilon \) odd, but more general choices are possible. The blow-up formula for Seiberg–Witten invariants implies that \({\widetilde{S}}\) has Seiberg–Witten basic classes 0, E and \(\text {SW}(0)=\text {SW}(E) = 1\) [Mor, Thm. 7.4.6]. For \(y=1\), we verified Conjecture 1.1 in the following cases:

  • \({\widetilde{c}}_1 = \pi ^* c_1 + \epsilon E\) with \(c_1^2=c\), \(\epsilon = -3,\ldots ,3\), \(c = -10,\ldots ,80\) even, modulo \(x^{c - \epsilon ^2 + 3 \epsilon + 10}\) (bound from Corollary 3.4 (i)) and modulo \(x^{-c + \epsilon ^2 + 4\min \{0,-(\epsilon -1)\}+118}\) (bound from \(A_i\)).

For virtual \(\chi _y\)-genus, we tested Conjecture 1.1 for:

  • \({\widetilde{c}}_1 = \pi ^* c_1 + \epsilon E\) with \(c_1^2=c\), \(\epsilon = 0,\ldots ,3\), \(c = -10,\ldots ,26\) even, modulo \(x^{c - \epsilon ^2 + 3 \epsilon + 10}\) (bound from Corollary 3.4 (i)) and modulo \(x^{-c + \epsilon ^2 + 4\min \{0,-(\epsilon -1)\}+26}\) (bound from \(A_i\)).

Next we consider the blow-up of a K3 surface S in two distinct points

$$\begin{aligned} \pi : {\widetilde{S}} \rightarrow S \end{aligned}$$

and we denote the exceptional divisors by \(E_1, E_2\). We gather evidence for Conjecture 5.7 by applying Corollary 3.4 and our explicit knowledge of the universal functions \(A_i\) as described in Sect. 4. By the blow-up formula for Seiberg–Witten invariants, the Seiberg–Witten basic classes are

$$\begin{aligned} 0, \, E_1, \, E_2, \, E_1+E_2 \end{aligned}$$

and the invariant is 1 in each case. We consider classes

$$\begin{aligned} {\widetilde{c}}_1 = \pi ^* c_1 + \epsilon _1 E_1 + \epsilon _2 E_2. \end{aligned}$$

As before the easiest way to satisfy the conditions of Corollary 3.4 is to take a polarization H on S such that \(c_1 H > 0\) is odd and taking \({\widetilde{H}} = r H - E_1 - E_2\) with \(r \gg 0\) and parity such that \(r + \epsilon _1 + \epsilon _2\) is odd, though other choices are possible. For \(y=1\), we verified Conjecture 5.7 in the following cases:

  • \({\widetilde{c}}_1 = \pi ^* c_1 + \epsilon _1 E_1 + \epsilon _2 E_2\) with \(c_1^2=c\), \(\epsilon _1, \epsilon _2 = -2,\ldots ,2\), \(c =-10,\ldots ,126\) even, modulo \(x^{c - \epsilon _1^2 - \epsilon _2^2 + 2\epsilon _1 + 2\epsilon _2 + 10}\) (bound coming from Corollary 3.4 (i)) and modulo \(x^{-c+ \epsilon _1^2+\epsilon _2^2 +4\min \{0,-(\epsilon _1-1),-(\epsilon _2-1),-(\epsilon _1+\epsilon _2-2)\} + 118}\) (bound from \(A_i\)).

For virtual \(\chi _y\)-genus, we tested Conjecture 5.7 for:

  • \({\widetilde{c}}_1 = \pi ^* c_1 + \epsilon _1 E_1 + \epsilon _2 E_2\) with \(c_1^2=c\), \(\epsilon _1, \epsilon _2 = 0,1,2\), \(c =-10,\ldots ,30\) even, modulo \(x^{c - \epsilon _1^2 - \epsilon _2^2 + 2\epsilon _1 + 2\epsilon _2 + 10}\) (bound from Corollary 3.4 (i)) and in addition modulo \(x^{-c+ \epsilon _1^2+\epsilon _2^2 +4 \min \{0,-(\epsilon _1-1),-(\epsilon _2-1),-(\epsilon _1+\epsilon _2-2)\} + 26}\) (bound from \(A_i\)).

6.4 Double covers

Next, we consider double covers

$$\begin{aligned} \pi : S_d \rightarrow {\mathbb {P}}^2 \end{aligned}$$

branched over a smooth curve C of degree 2d. Then \(K_{S_d} = (d-3) L\), where L is the pull-back of the class of a line on \({\mathbb {P}}^2\). These surfaces satisfy \(b_1(S_d) = 0\). It is easy to calculate

$$\begin{aligned} K_{S_d}^2&= 2(d-3)^2, \\ \chi ({\mathcal {O}}_{S_d})&=\frac{1}{2}d(d-3)+2. \end{aligned}$$

Since \(\alpha L\) is base-point free for all \(\alpha > 0\), the canonical linear system \(|K_{S_d}|\) contains smooth connected canonical divisors when \(d \ge 4\), which we assume from now on. Let \(c_1 = \epsilon L\). As before, we want to gather evidence for Conjecture 1.1 by applying Corollary 3.4 and our explicit knowledge of the universal functions \(A_i\) as described in Sect. 4. The Seiberg–Witten basic classes are \(0, K_S \ne 0\) with Seiberg–Witten invariants

$$\begin{aligned} 1, \, (-1)^{\chi ({\mathcal {O}}_{S_d})}. \end{aligned}$$

We first take \(H=L\) as the polarization on S. Then conditions (ii), (iii) of Corollary 3.4 require

$$\begin{aligned} c_1 H = 2 \epsilon > 4(d-3) = 2K_S H, \end{aligned}$$

i.e. \(\epsilon > 2(d-3)\). In the case \(\epsilon =2k\) is even, we can ensure the absence of rank 2 strictly Gieseker H-semistable sheaves with Chern classes \(c_1, c_2\) by choosing \(c_2\) such that

$$\begin{aligned} \frac{1}{2} c_1(c_1 - K_S) - c_2 = \epsilon (\epsilon - (d-3)) - c_2 \end{aligned}$$

is odd [HL, Rem. 4.6.8].

Now assume that \(\epsilon =2k+1\) is odd. If L generates the Picard group of \(S_d\), then there are no rank 2 strictly \(\mu \)-semistable sheaves with Chern classes \(\epsilon L\) and \(c_2\). In general the Picard group of \(S_d\) can have more generators, but L is still ample and primitive. In this case we take the polarization H general and sufficiently close to L (i.e. of the form \(nL+B\) for n sufficiently large), so that conditions (ii) and (iii) of Corollary 3.4 still hold when \(\epsilon >2(d-3)\), and so that there are no rank 2 strictly \(\mu \)-semistable sheaves with Chern classes \(\epsilon L\) and \(c_2\).

Recall that conjecturally conditions (ii), (iii) of Corollary 3.4 are not necessary (see Remark 3.5). Therefore we will also test Conjecture 1.1 for \(\epsilon , d\) which do not satisfy \(\epsilon > 2(d-3)\).

Note that \(\mu \)-stability is invariant under tensorizing by a line bundle and that Gieseker stability with respect to H is invariant under tensorizing by a power of H. Therefore we know that \(M^H_{S_d}(2,L,c_2)\) together with its virtual structure is isomorphic to \(M^H_{S_d}(2,(2k+1)L,c_2+2(k^2+k))\), and when \(c_2\) is odd, the same holds for \(M^L_{S_d}(2,0,c_2)\) and \(M^L_{S_d}(2,2kL,c_2+2k^2)\).

For \(y=1\), we verified Conjecture 1.1 in the following cases:

  • \(S_d\) such that \(d=4,\ldots ,7\), \(c_1 = \epsilon L\) with \(\epsilon = -3, \ldots , 6\), modulo \(x^{\frac{5}{2}d(d-3) - 4\epsilon (d-3) + 2\epsilon ^2 +10}\) (bound from Corollary 3.4 (i)) and modulo \(x^{-\frac{3}{2}d(d-3)-2 \epsilon ^2 + 8\min \{0,(d-3)(\epsilon -(d-3))\} +118}\) (bound from \(A_i\)).

For virtual \(\chi _y\)-genus, we tested Conjecture 1.1 for:

  • \(S_d\) such that \(d=4,\ldots ,7\), \(c_1 = \epsilon L\) with \(\epsilon = -3, \ldots , 6\), modulo \(x^{\frac{5}{2}d(d-3) - 4\epsilon (d-3) + 2\epsilon ^2 +10}\) (bound from Corollary 3.4 (i)) and modulo \(x^{-\frac{3}{2}d(d-3)-2 \epsilon ^2 + 8\min \{0,(d-3)(\epsilon -(d-3))\} +26}\) (bound from \(A_i\)).

In order to give an idea of the complexity of the numbers involved, we give some examples. First we compute the virtual Euler numbers. We only show cases where \(\epsilon > 2(d-3)\), so that we do not need to use the strong form of Mochizuki’s formula, and thus we have unconditionally proved these formulas.

The formula of Corollary 1.3 is proved for \((d=4,\epsilon =5)\), modulo \(x^{50}\), for \((d=4,\epsilon =6)\) modulo \(x^{40}\), for\((d=5,\epsilon =5)\) modulo \(x^{45}\) for \((d=5,\epsilon =6)\) modulo \(x^{31}\). When \(\epsilon \) is even, let \({\textsf {Z}}^\text {odd}_{S_d,H,0}(x,1)\) be the part of the generating function corresponding to \(c_2\) odd, so the moduli spaces only consist of Gieseker stable sheaves. Using the invariance under tensoring by H or L respectively we have

Now we give some examples of the virtual \(\chi _{y}\)-genus. In this case we need to use the strong form of Mochizuki’s formula. For \(d=4\), \(c_1=0,L\) we get the virtual refinements

where the missing terms are determined by the symmetry of the virtual \(\chi _y\)-genus.

Next let \({\mathbb {F}}_a = {\mathbb {P}}({\mathcal {O}}_{{\mathbb {P}}^1} \oplus {\mathcal {O}}_{{\mathbb {P}}^1}(a))\) denote the ath Hirzebruch surface, where \(a \ge 0\). Suppose B is the section corresponding to the surjection \({\mathcal {O}}_{{\mathbb {P}}^1} \oplus {\mathcal {O}}_{{\mathbb {P}}^1}(a) \twoheadrightarrow {\mathcal {O}}_{{\mathbb {P}}^1}\) onto the first factor. Then \(B^2 = -a\). We denote the class of the fibre of \({\mathbb {F}}_a \rightarrow {\mathbb {P}}^1\) by F. Let \(d_1>0\) and \(d_2>ad_1\). A rich source of examples are double covers

$$\begin{aligned} \pi : S_{a,d_1,d_2} \rightarrow {\mathbb {F}}_a \end{aligned}$$

branched over a smooth connected curve in \(|{\mathcal {O}}(2d_1 B + 2d_2 F)|\). Let \({\widetilde{B}} := \pi ^* B\) and \({\widetilde{F}} = \pi ^* F\). We choose \(d_1 > 2\) and \(d_2 > a(d_1-1)+2\), so the canonical divisor

$$\begin{aligned} K_{S_{a,d_1,d_2}}=(d_1-2){\widetilde{B}}+ (d_2-(a + 2)){\widetilde{F}} \end{aligned}$$

is the pull-back of a very ample divisor and its linear system contains smooth connected curves. These surfaces satisfy \(b_1(S_{a,d_1,d_2}) = 0\). We have

$$\begin{aligned} K_{S_{a,d_1,d_2}}^2&=4(d_1-2)\Big (d_2-\frac{d_1a}{2}-2\Big ), \\ \chi ({\mathcal {O}}_{S_{a,d_1,d_2}})&=(d_1-1)\Big (d_2-\frac{d_1a}{2}-1\Big )+1. \end{aligned}$$

We first note that it is enough to determine \({\textsf {Z}}_{S_{a,d_1,d_2},\epsilon _1{\widetilde{B}}+\epsilon _2{\widetilde{F}}}(x,y)\) for \(a=0,1\).

Proposition 6.1

Assume the strong form of Mochizuki’s formula holds (Remark 3.5). Let \(a \ge 0\), \(d_1 > 2\), \(d_2 > (a+1)(d_1-1) + 1\), and \(d_2 > (a+1)d_1\). Let H be a polarization on \(S_{a,d_1,d_2}\) such that there exist no rank 2 strictly Gieseker H-semistable sheaves with Chern classes \(c_1 := \epsilon _1 {\widetilde{B}} + \epsilon _2 {\widetilde{F}}\) and \(c_2\). Let \(H'\) be a polarization on \(S_{a+2,d_1,d_1+d_2}\) such that there exist no rank 2 strictly Gieseker \(H'\)-semistable sheaves with Chern classes \(c_1' := \epsilon _1 {\widetilde{B}} + (\epsilon _1+\epsilon _2) {\widetilde{F}}\) and \(c_2\). Then

$$\begin{aligned} \chi _{-y}^{\text {vir}}(M_{S_{a,d_1,d_2}}^{H}(2,\epsilon _1 {\widetilde{B}} + \epsilon _2 {\widetilde{F}},c_2)) = \chi _{-y}^{\text {vir}}(M_{S_{a+2,d_1,d_1+d_2}}^{H'}(2,\epsilon _1 {\widetilde{B}} + (\epsilon _1+\epsilon _2) {\widetilde{F}},c_2)), \end{aligned}$$

for all

$$\begin{aligned} c_2&< 2\chi ({\mathcal {O}}_{S_{a,d_1,d_2}}) + \frac{1}{2} c_1(c_1-K_{S_{a,d_1,d_2}}) \\&= 2\chi ({\mathcal {O}}_{S_{a+2,d_1,d_1+d_2}}) + \frac{1}{2} c_1^{\prime }(c_1^\prime - K_{S_{a+2,d_1,d_1+d_2}}). \end{aligned}$$

Proof

Write \(S:=S_{a,d_1,d_2}\) and \(S':=S_{a+2,d_1,d_1+d_2}\). Fix \(c_1:=\epsilon _1{\widetilde{B}}+\epsilon _2{\widetilde{F}}\) on S and \(c_1':=\epsilon _1{\widetilde{B}}+(\epsilon _1+\epsilon _2){\widetilde{F}}\) on \(S'\). The map

$$\begin{aligned} \phi : H^2(S,{\mathbb {Z}})\rightarrow H^2(S',{\mathbb {Z}}), \ {\widetilde{B}}\mapsto {\widetilde{B}}+{\widetilde{F}}, \ {\widetilde{F}}\mapsto {\widetilde{F}} \end{aligned}$$

is an isomorphism between the subgroups generated by \({\widetilde{B}}, {\widetilde{F}}\) on S, \(S'\) and it preserves the intersection forms on these subgroups. It sends \(K_{S}\) to \(K_{S'}\), and thus it sends the Seiberg–Witten basic classes 0, \(K_ {S}\) of S to the corresponding Seiberg–Witten basic classes of \(S'\). We also see that \(c_1\) is send to \(c_1'\). Moreover \(\chi ({\mathcal {O}}_{S})=\chi ({\mathcal {O}}_{S'})\). We apply the strong form of Mochizuki’s formula to S with \(H,c_1,c_2\) and \(S'\) with \(H',c_1',c_2\). Since

$$\begin{aligned} (a_1^2, a_1 c_1, c_1^2, a_1 K_S, c_1 K_S, K_S^2,\chi ({\mathcal {O}}_S)) = (a_1'^2, a_1' c_1', c_1^{ \prime 2}, a_1' K_{S'}, c_1' K_{S'}, K_{S'}^2,\chi ({\mathcal {O}}_{S'})) \end{aligned}$$

for \((a_1,a_1') = (0,0)\) and \((a_1,a_1') = (K_S,K_{S'})\), the assertion follows from Corollary 3.4 and Remark 3.5. \(\quad \square \)

For \(y=1\), we checked Conjecture 1.1 in the following cases:

  • \(S_{a,d_1,d_2}\) such that \(a=0,\ldots ,3\), \(d_1=3,\ldots ,6\), \(d_2 = a(d_1-1)+3,\ldots ,a(d_1-1)+10\), \(c_1 = \epsilon _1 {\widetilde{B}} + \epsilon _2 {\widetilde{F}}\) with \(\epsilon _1=-2,\ldots ,2\), \(\epsilon _2 = -2,\ldots ,10\), modulo \(x^{M}\) (bound from Corollary 3.4 (i)) and modulo \(x^{N+124}\) (bound from \(A_i\)),

where

$$\begin{aligned} \begin{aligned}&M := -\frac{5}{2} a d_1^2+\frac{5}{2} a d_1+ 5 d_1 d_2-5d_1-5d_2 +10+4 a \epsilon _1 d_1 - 4 a \epsilon _1-4 \epsilon _1 d_2\\&\quad +8 \epsilon _1-4 \epsilon _2 d_1 +8 \epsilon _2-2 a \epsilon _1^2+4 \epsilon _1 \epsilon _2, \\&N := 2a \epsilon _1^2 - 4 \epsilon _1 \epsilon _2 -3 (d_1-1)(d_2-\frac{d_1 a}{2} - 1) - 3 \\&\quad + 8\min \Big \{0,a(d_1-2)(\epsilon _1 - (d_1-2)) + (d_1-2)(\epsilon _2-(d_2-(a+2))) \\&\quad + (d_2 - (a+2))(\epsilon _1 - (d_1-2)) \Big \}. \end{aligned} \end{aligned}$$

For virtual \(\chi _{y}\)-genus, we checked Conjecture 1.1 in the following cases:

  • \(S_{a,d_1,d_2}\) such that \(a=0,\ldots ,3\), \(d_1=3,4,5\), \(d_2 = a(d_1-1)+3,\ldots , a(d_1-1)+5\), \(c_1 = \epsilon _1 {\widetilde{B}} + \epsilon _2 {\widetilde{F}}\) with \(\epsilon _1=-2,\ldots ,2\), \(\epsilon _2 = -2,\ldots , 6\), modulo \(x^{M}\) (bound from Corollary 3.4 (i)) and modulo \(x^{N+32}\) (bound from \(A_i\)).

Some examples of these tables are the following:

$$\begin{aligned}&{\textsf {Z}}^\text {odd}_{S_{0,3,3},0}(x,1)=- 105472x^5 - 17699188736x^{13} - 311478544324608x^{21} +O(x^{29}),\\&{\textsf {Z}}_{S_{0,3,4}, {\widetilde{B}}}(x,1)= -36864 x^{3}-123928576 x^{7}-125523673088 x^{11} +O( x^{15} ),\\&{\textsf {Z}}_{S_{1,3,5}, {\widetilde{F}}}(x,1) = 1248 x^{2}+3740160 x^{6}+3011270208 x^{10}\\&\quad +1143966397440 x^{14} +O(x^{18}) ,\\&{\textsf {Z}}_{S_{1,3,5}, {\widetilde{F}}+{{\widetilde{B}}}}(x,1) =8 + 85920x^4 + 119219712x^8 + 63288183168x^{12}+O(x^{16}). \end{aligned}$$

Here \(c_1,c_2\) always have the property that there are no rank 2 strictly Gieseker H-semistable sheaves with these Chern classes. The virtual refinement of \({\textsf {Z}}_{S_{1,3,5}, {\widetilde{F}}}(x,1)\) is given by

$$\begin{aligned}&{\textsf {Z}}_{S_{1,3,5}, {\widetilde{F}}}(xy^{\frac{1}{2}},y)=\,(216y^2 + 816y + 216)x^2 + (13720y^6 + 184128y^5\\&\quad + 876264y^4 + 1591936y^3 + 876264y^2 + 184128y + 13720)x^6+O(x^{10}). \end{aligned}$$

6.5 Divisors in products of projective spaces

Let \(S_d\) be a smooth surface of degree d in \({\mathbb {P}}^3\), then

$$\begin{aligned} K_{S_d} = (d-4)L, \end{aligned}$$

where L is the hyperplane section. Moreover \(S_d\) is simply connected by the Lefschetz hyperplane theorem. We take \(d \ge 5\), so \(S_d\) is of general type. One readily calculates

$$\begin{aligned} K_{S_d}^2&= d(d-4)^2, \\ \chi ({\mathcal {O}}_{S_d})&= \frac{1}{6} d (d^2+11) - d^2. \end{aligned}$$

The hyperplane section L on \(S_d\) is very ample so \(|K_S|\) contains smooth connected canonical divisors. We test Conjecture 1.1 using Corollary 3.4.

We take as polarization \(H = L\) and \(c_1 = \epsilon L\). If \(S_d\) is very general, i.e. in the complement of countably many closed subvarieties in the projective space of hypersurfaces of degree d, then \({\text {Pic}}(S_d)={\mathbb {Z}}L\) by the Noether–Lefschetz Theorem. For

$$\begin{aligned} c_1 H&= d \epsilon > 2 d (d-4) = 2 K_{S_d} H, \\ c_1 H&= d \epsilon \quad \hbox {odd, or } S_d \hbox { very general and } \epsilon \ \text { odd} \end{aligned}$$

there are no rank 2 strictly \(\mu \)-semistable sheaves with first Chern class \(c_1\) and conditions (ii), (iii) of Corollary 3.4 are satisfied.

We also perform computations when these conditions are not satisfied (recall Remark 3.5). For \(y=1\), we verified Conjecture 1.1 in the following cases:

  • \(S_d\) such that \(d=4,5,6\), \(c_1 = \epsilon E\) with \(\epsilon = -1,\ldots ,4\), modulo \(x^{\frac{5}{6}d(d^2+11)-5d^2+\epsilon d (\epsilon -2d+8)}\) (bound from Corollary 3.4 (i)) and modulo \(x^{-\epsilon ^2 d - \frac{1}{2}d(d^2+11)+3d^2 + 4 \min \{0,\epsilon d(d-\epsilon -4)\} + 124}\) (bound from \(A_i\)).

For virtual \(\chi _y\)-genus, we tested Conjecture 1.1 for:

  • \(S_d\) such that \(d=4,5,6\), \(c_1 = \epsilon E\) with \(\epsilon = 1,2\), modulo \(x^{\frac{5}{6}d(d^2+11)-5d^2+\epsilon d (\epsilon -2d+8)}\) (bound from Corollary 3.4 (i)) and modulo \(x^{-\epsilon ^2 d - \frac{1}{2}d(d^2+11)+3d^2 + 4 \min \{0,\epsilon d(d-\epsilon -4)\} + 32}\) (bound from \(A_i\)).

We list a few examples. First we deal with the specialization to the virtual Euler number and in this case we only consider the case that \(\epsilon >2(d-4)\), so that we do not use the strong from of Mochizuki’s formula and the formulas are proved unconditionally. For \(d=5\) and \(\epsilon =3,4\) we obtain

$$\begin{aligned} {\textsf {Z}}_{S_5,L,L}(x,1)=\,&8 + 52720x^4 + 48754480x^8 + 17856390560x^{12} + 3626761297400x^{16} \\&+ 482220775619120x^{20}+ 46283858505022160x^{24} +O(x^{28}),\\ {\textsf {Z}}^\text {odd}_{S_5,L,0}(x,1)=\,&- 316096x^5 - 70399202880x^{13} - 1550539821466560x^{21} + O(x^{29}). \end{aligned}$$

For \(d=5\), \(\epsilon =1\) and using the strong form of Mochizuki’s formula, we obtain the refinement

$$\begin{aligned}&{\textsf {Z}}_{S_5,L,L}(xy^{\frac{1}{2}},y)=8 + (1280y^4 + 11440y^3+ 27280y^2+ 11440y+1280)x^4 \\&\quad +(25520y^8+ 448640y^7+ 3228960y^6 + 11405320y^5+18537600y^4+\ldots ) x^8\\&\quad + O(x^{12}). \end{aligned}$$

Next we move on to smooth surfaces \(S_{d_1,d_2}\) of bidegree \((d_1,d_2)\) in \({\mathbb {P}}^2 \times {\mathbb {P}}^1\). Again \(S_{d_1,d_2}\) is simply connected and one readily calculates

$$\begin{aligned} \begin{aligned} K_{S_{d_1,d_2}}^2&= (d_1-3)^2 d_2 + 2d_1(d_1-3)(d_2-2), \\ \chi ({\mathcal {O}}_{S_{d_1,d_2}})&= \frac{1}{2}d_1(d_1d_2-d_1-3d_2+3)+d_2. \end{aligned} \end{aligned}$$

We take \(d_1 \ge 4\) and \(d_2 \ge 3\) so \(K_{S_{d_1,d_2}}\) is very ample and its linear system contains a smooth connected canonical divisor.

Let \(L_i\) be the restriction of \(\pi _i^{*} L\) to S, where \(\pi _1 : {\mathbb {P}}^2 \times {\mathbb {P}}^1 \rightarrow {\mathbb {P}}^2\) and \(\pi _2 : {\mathbb {P}}^2 \times {\mathbb {P}}^1 \rightarrow {\mathbb {P}}^1\) denote projections and L is the hyperplane class on each factor. For \(y=1\), we tested Conjecture 1.1 using Corollary 3.4 and the strong version of Mochizuki’s formula (Remark 3.5) in the following cases:

  • for \(S_{d_1,d_2}\) such that \((d_1,d_2)=(4,3),(5,3),(6,3),(4,4),(5,4),(4,5)\), \(c_1 = \epsilon _1 L_1 + \epsilon _2 L_2\) with \(\epsilon _1=-3,\ldots ,7, \epsilon _2 = -2,\ldots ,8\), modulo \(x^{M}\) (bound from Corollary 3.4 (i)) and modulo \(x^{N+124}\) (bound from \(A_i\)),

where

$$\begin{aligned} M :=\,&\frac{5}{2}d_1(d_1d_2-3d_2-d_1+3)+5d_2 + \epsilon _1^2 d_2 + 2 \epsilon _1 \epsilon _2 d_1 \\&- 2 \epsilon _1 (d_1-3) d_2 - 2 \epsilon _2 (d_1-3) d_1 - 2\epsilon _1 (d_2-2) d_1, \\ N :=\,&- \epsilon _1^2 d_2 - 2 \epsilon _1 \epsilon _2 d_1- \frac{3}{2}d_1(d_1d_2-3d_2-d_1+3)-3d_2 \\&+ 4\min \Big \{0,(d_1-3)(\epsilon _1-(d_1-3))d_2 + (d_1-3)(\epsilon _2 - (d_2-2))d_1 \\&\qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \ \!\! + (d_2-2)(\epsilon _1 - (d_1-3))d_1\Big \}. \end{aligned}$$

For general y, we checked Conjecture 1.1 in the following cases:

  • for \(S_{d_1,d_2}\) such that \((d_1,d_2)=(4,3), (4,4)\), \(c_1 = \epsilon _1 L_1 + \epsilon _2 L_2\) with \(\epsilon _1=-1,\ldots ,3, \epsilon _2 = -2,\ldots ,5\), modulo \(x^{M}\) (bound from Corollary 3.4 (i)) and modulo \(x^{N+32}\) (bound from \(A_i\)).

Examples contained in these tables with \((d_1,d_2) = (4,3)\) are

$$\begin{aligned} {\textsf {Z}}_{S_{4,3},3 L_1 }(x,1)&=128 + 5350656x^4 \\&\quad + 18196176128x^8 + 20761214894592x^{12} +O(x^{16}),\\ {\textsf {Z}}_{S_{4,3},3 L_2 }^\text {odd}(x,1)&= - 2704756736 x^{7} +O(x^{15}), \end{aligned}$$

where we choose a polarization H such that \(L_1 H\) is odd in the first case. A virtual refinement of the second formula is given by

$$\begin{aligned}&{\textsf {Z}}_{S_{4,3},3 L_2 }^\text {odd}(xy^{\frac{1}{2}},y) =\, - \big (6323328y^{7} +81371136y^{6} +394518784y^{5} \\&\quad + 870165120(y^4+y^3) +394518784y^{2}+81371136y+6323328\big )x^7 + O(x^{15}). \end{aligned}$$

Finally we consider smooth surfaces \(S_{d_1,d_2,d_3}\) of tridegree \((d_1,d_2,d_3)\) in \({\mathbb {P}}^1 \times {\mathbb {P}}^1 \times {\mathbb {P}}^1\). Then \(S_{d_1,d_2,d_3}\) is simply connected and one can compute

$$\begin{aligned} K_{S_{d_1,d_2,d_3}}^2&= 2 \prod _{{{\begin{array}{c} (i,j,k) \\ i,j,k \, \text {distinct} \end{array}}}} d_i (d_j - 2)(d_k-2) , \\ \chi ({\mathcal {O}}_{S_{d_1,d_2,d_3}})&= \frac{1}{2} d_1d_2d_3-\frac{1}{6}d_1d_2-\frac{1}{6}d_1d_3-\frac{1}{6} d_2 d_3-\frac{1}{3}d_1-\frac{1}{3}d_2-\frac{1}{3}d_3+2. \end{aligned}$$

Taking \(d_1,d_2,d_3 \ge 3\), the canonical linear system is very ample and contains smooth connected curves. Denote by \(L_i\) the restriction of \(\pi _i^{*} L\) to S, where \(\pi _i : {\mathbb {P}}^1 \times {\mathbb {P}}^1 \times {\mathbb {P}}^1 \rightarrow {\mathbb {P}}^1\) are the projections and L is the class of a point on \({\mathbb {P}}^1\). For \(y=1\), we tested Conjecture 1.1 using Corollary 3.4 and the strong version of Mochizuki’s formula (Remark 3.5) in the following cases:

  • for \(S_{d_1,d_2,d_3}\) such that \((d_1,d_2,d_3)=(3,3,3), (3,3,4), (3,3,5),(3,4,4)\), \(c_1 = \epsilon _1 L_1 + \epsilon _2 L_2 + \epsilon _3 L_3\) with \(\epsilon _1, \epsilon _2, \epsilon _3 = -2,\ldots ,5\), modulo \(x^{M}\) (bound from Corollary 3.4 (i)) and modulo \(x^{N+124}\) (bound from \(A_i\)),

where

$$\begin{aligned} M := \,&\frac{5}{2} d_1d_2d_3-\frac{5}{6}d_1d_2-\frac{5}{6}d_1d_3-\frac{5}{6} d_2 d_3-\frac{5}{3}d_1-\frac{5}{3}d_2-\frac{5}{3}d_3+10 \\&- 2 \prod _{{{\begin{array}{c} (i,j,k) \\ i,j,k \, \text {distinct} \end{array}}}} (d_i-2) d_j e_k + 2 \prod _{{{\begin{array}{c} (i,j,k) \\ i,j,k \, \text {distinct} \end{array}}}} e_i e_j d_k, \\ N := \,&-\frac{3}{2} d_1d_2d_3+\frac{1}{2}d_1d_2+\frac{1}{2}d_1d_3+\frac{1}{2} d_2 d_3+d_1+d_2+d_3-6 \\&-\prod _{{{\begin{array}{c} (i,j,k) \\ i,j,k \, \text {distinct} \end{array}}}} e_i e_j d_k +4\min \Big \{0,\prod (d_i-2)(e_j - (d_j-2)) d_k \Big \}. \end{aligned}$$

For arbitrary y, we checked Conjecture 1.1 in the following cases:

  • for \(S_{d_1,d_2,d_3}\) such that \((d_1,d_2,d_3)=(3,3,3),(3,3,4)\), \(c_1 = \epsilon _1 L_1 + \epsilon _2 L_2 + \epsilon _3 L_3\) with \(\epsilon _1, \epsilon _2, \epsilon _3 =-1,\ldots ,3\), modulo \(x^{M}\) (bound from Corollary 3.4 (i)) and modulo \(x^{N+32}\) (bound from \(A_i\)).

An example covered by these tables with \((d_1,d_2,d_3) = (3,3,3)\) is the following

$$\begin{aligned} {\textsf {Z}}_{S, -L_1+2L_2+2L_3}(x,1) =-147456 x -8358985728 x^5 +O( x^{9}), \end{aligned}$$

where we choose a polarization H such that \(L_1 H\) is odd. Its virtual refinement is

$$\begin{aligned} {\textsf {Z}}_{S,3L_1}(xy^{\frac{1}{2}},y)={\textsf {Z}}_{S, -L_1+2L_2+2L_3}(xy^{\frac{1}{2}},y)= (-73728y- 73728)x+O(x^5). \end{aligned}$$

6.6 Complete intersections in projective spaces

For \(d_1,d_2\in {\mathbb {Z}}_{\ge 2}\), with \(d_1+d_2\ge 6\), let \(S_{d_1,d_2}\) be a smooth complete intersection of bidegree \((d_1,d_2)\) in \({\mathbb {P}}^4\). Then \(S_{d_1,d_2}\) is simply connected. Let L be the restriction of the hyperplane class on \({\mathbb {P}}^4\) to \(S_{d_1,d_2}\). If \(S_{d_1,d_2}\) is very general, then by the Noether–Lefschetz theorem for complete intersections (see e.g. [Kim]) the Picard group of \(S_{d_1,d_2}\) is generated by L. Putting \(d:=d_1d_2\), \(D:=d_1+d_2\), we have \(K_{S_{d_1,d_2}}=(D-5)L\) is very ample and

$$\begin{aligned} K_{S_{d_1,d_2}}^2&=d(D-5)^2, \\ \chi ({\mathcal {O}}_{S_{d_1,d_2}})&=\frac{d}{12}(2D^2-15D-d+35). \end{aligned}$$

If the Picard group of \(S_{d_1,d_2}\) is \({\mathbb {Z}}L\) and \(H=L\), then there are no rank 2 strictly \(\mu \)-semistable sheaves on \(S_{d_1,d_2}\) with \(c_1=\epsilon L\) and \(\epsilon \) odd.

For \(y=1\), we checked Conjecture 1.1 in the following cases:

  • for \(S_{d_1,d_2}\) such that \((d_1,d_2)=(2,4),(2,5),(3,3),(3,4)\), \(c_1 = \epsilon L\) with \(\epsilon =-1,\ldots ,3\), modulo \(x^{M}\) (bound from Corollary 3.4 (i)) and modulo \(x^{N+124}\) (bound from \(A_i\)).

Here

$$\begin{aligned} M&:=\epsilon ^2d-2\epsilon d(D-5)+\frac{5d}{12}(2D^2-15D-d+35),\\ N&:=-\epsilon ^2d-\frac{3d}{12}(2D^2-15D-d+35)+4\min \Big \{0,d(D-5)(\epsilon -(D-5))\Big \}. \end{aligned}$$

For y general, we checked Conjecture 1.1 in the following cases:

  • for \(S_{d_1,d_2}\) such that \((d_1,d_2)=(2,4), (3,3)\), \(c_1 = \epsilon L\) with \(\epsilon =1\), modulo \(x^{M}\) (bound from Corollary 3.4 (i)) and modulo \(x^{N+32}\) (bound from \(A_i\)).

For \(\epsilon =3\) we determined \({\textsf {Z}}_{S_{3,3},L,L}(x,1)\), and \({\textsf {Z}}_{S_{2,4},L,L}(x,1)\) modulo \(x^{25}\) and \(x^{34}\), where the second is only written modulo \(x^{26}\).

$$\begin{aligned} {\textsf {Z}}_{S_{3,3},L,L}(x,1)=\,&-1152x - 11784960x^5 - 18762235136x^9 - 11903890079232x^{13} \\&- 4135344957021312x^{17} - 924519456314916096x^{21} + O(x^{25}),\\ {\textsf {Z}}_{S_{2,4},L,L}(x,1)=\,&6912x^2 + 30124032x^6 + 31867565056x^{10} + 15237098061824x^{14} \\&+ 4243875728564736x^{18} + 789670403161694208x^{22} +O(x^{26}). \end{aligned}$$

For \(\epsilon =1\) and using the strong form of Mochizuki’s formula (Remark 3.5), we get

$$\begin{aligned} \chi _{-y}^\text {vir}(M^L_{S_{3,3}}(2,L,7))&=-576y - 576, \\ \chi _{-y}^\text {vir}(M^L_{S_{2,4}}(2,L,7))&=1344y^2 + 4224y + 1344. \end{aligned}$$

For \(d_1,d_2,d_3\in {\mathbb {Z}}_{\ge 2}\), with \(d_1+d_2+d_3\ge 7\), let \(S_{d_1,d_2,d_3}\) be a smooth complete intersection of tridegree \((d_1,d_2,d_3)\) in \({\mathbb {P}}^5\). Then \(S_{d_1,d_2,d_3}\) is simply connected. When \(S_{d_1,d_2,d_3}\) is very general, the Picard group of \(S_{d_1,d_2,d_3}\) is generated by the restriction L of a hyperplane class on \({\mathbb {P}}^5\). Putting \(d:=d_1d_2d_3\), \(D:=d_1+d_2+d_3\), we have \(K_{S_{d_1,d_2,d_3}}=(D-6)L\) is very ample and

$$\begin{aligned} K_{S_{d_1,d_2,d_3}}^2&=d(D-6)^2,\\ \chi ({\mathcal {O}}_{S_{d_1,d_2,d_3}})&=\frac{d}{12}(2D^2-18D+51-d_1d_2-d_1d_3-d_2d_3). \end{aligned}$$

For \(y=1\), we checked Conjecture 1.1 in the following cases:

  • for \(S_{d_1,d_2,d_3}\) with \((d_1,d_2,d_3)=(2,2,3)\), \(c_1 = \epsilon L\) such that \(\epsilon =0,1,2\), modulo \(x^{12\epsilon ^2-24\epsilon +35}\) (bound from Corollary 3.4 (i)) and modulo \( x^{48\min \{0,\epsilon -1\}-12\epsilon ^2+103}\) (bound from \(A_i\)).

As an example we get

$$\begin{aligned} {\textsf {Z}}_{S_{2,2,3},L,L}(x,1)=&-1261568x^3 - 7379091456x^7 - 11717181702144x^{11}\\&- 8585117244063744x^{15} - 3662336916158939136x^{19} + O(x^{23}). \end{aligned}$$

6.7 Verifications of Conjecture 5.1

Let \(\underline{\beta } \in {\mathbb {Z}}^4\) and \((\gamma _1,\gamma _2)\in {\mathbb {Z}}^2\) such that \(\beta _1 \equiv \beta _2 \mod 2\) and \(\beta _3 \ge \beta _4 - 3\). Put

$$\begin{aligned} M_1&:=\min \Big \{2\beta _1-2\beta _2+8\beta _4, 124-4\gamma _1+4\gamma _2,124+4\beta _2-4\beta _3+4\gamma _1-4\gamma _2\Big \}\\&\quad -\beta _1-3\beta _4,\\ M_2&:= \min \Big \{2\beta _1-2\beta _2+8\beta _4, 32-4\gamma _1+4\gamma _2,32+4\beta _2-4\beta _3+4\gamma _1-4\gamma _2\Big \}\\&\quad -\beta _1-3\beta _4. \end{aligned}$$

For \(y=1\), we verified Conjecture 5.1 up to order \(x^{M_1-1}\) in the following cases:

  • \(\gamma _1=\gamma _2=0\) and any \(|\beta _1|, |\beta _2|, |\beta _3|, |\beta _4| \le 16\),

  • any \(|\gamma _1|, |\gamma _2|, |\beta _1|, |\beta _2|, |\beta _3|, |\beta _4| \le 4\).

For arbitrary y, we verified Conjecture 5.1 up to order \(x^{M_2-1}\) in the following cases:

  • \(\gamma _1=\gamma _2=0\) and any \(|\beta _1|, |\beta _2|, |\beta _3|, |\beta _4| \le 3\),

  • any \(|\gamma _1|, |\gamma _2|, |\beta _1|, |\beta _2|, |\beta _3|, |\beta _4| \le 2\).

We have examples showing that the condition \(\beta _3 \ge \beta _4 - 3\) is necessary.

Remark 6.2

Any choice of \(\underline{\beta } \in {\mathbb {Z}}^4\) with \(\beta _1 \equiv \beta _2 \mod 2\) satisfying

$$\begin{aligned} \frac{1}{12} \beta _3 \le \beta _4 < \frac{1}{9} \beta _3 \end{aligned}$$

does not arise geometrically and automatically implies \(\beta _3 \ge \beta _4-3\). These inequalities correspond to “\(e(S) \ge 0\) but violation of the Bogomolov-Miyaoka-Yau inequality”, so in this regime \(\underline{\beta }\) cannot be realized by a smooth projective algebraic surface. With these restrictions, we verified Conjecture 5.1 for \(y=1\) up to order \(x^{M_1-1}\) for all

$$\begin{aligned} \gamma _1=\gamma _2=0 \hbox { and } |\beta _1|, |\beta _2|, |\beta _3|, |\beta _4|&\le 25, \end{aligned}$$

and for arbitrary y up to order \(x^{M_2-1}\) for all

$$\begin{aligned} \gamma _1=\gamma _2=0 \hbox { and } |\beta _1|, |\beta _2|, |\beta _3|, |\beta _4|&\le 27. \end{aligned}$$