1 Introduction

Let \(X\) be a smooth projective variety of dimension \(n\ge 2\) defined over an algebraically closed field \(k\). Let us fix an ample divisor \(H\) on \(X\). In [3] Bogomolov proved a celebrated inequality saying that in case \(n=2\) and \(k={\mathbb C}\), the discriminant \(\Delta (E)=2rc_2(E)-(r-1)c_1^2(E)\) of any slope \(H\)-semistable rank \(r\) vector bundle \(E\) is non-negative. Together with the Mehta–Ramanathan restriction theorem this implies that for \(k={\mathbb C}\) and any \(n\ge 2\) we have \(\Delta (E)H^{n-2}\ge 0\) for any slope \(H\)-semistable torsion free sheaf \(E\) of rank \(r\) on \(X\).

In [21] we proved that in positive characteristic the same inequality holds for strongly \(H\)-semistable sheaves. On the other hand, it is well known that in general this inequality fails. For example, any counterexample to Kodaira’s vanishing on surfaces (see [30, 33]) leads to a slope \(H\)-semistable vector bundle violating Bogomolov’s inequality. However, we prove the following surprising theorem:

Theorem 1

Assume that \(X\) can be lifted to the ring \(W_2(k)\) of Witt vectors of length \(2\). Then for any slope \(H\)-semistable sheaf \(E\) of rank \(r\le p= \mathop {char} \, k\) we have

$$\begin{aligned} \Delta (E) H^{n-2}\ge 0. \end{aligned}$$

This theorem was known only in the surface case for rank 2 vector bundles in characteristic 2 (see [34, Corollary 11]). Let us recall that such an inequality has many applications, for example to adjoint linear systems via Reider’s method (see [2, IV, Section 11] and Sect. 7).

In case of complex varieties there exist several proofs of Bogomolov’s inequality. Roughly, there are three different proofs of this inequality. The first one, due to Bogomolov (see [3, 12]), is algebraic, using the fact that symmetric powers of a semistable bundle have few sections. The second one, due to Gieseker (see [11]), is also algebraic, using reduction to positive characteristic and studying sections of the Frobenius pull back of the bundle. The third proof is analytic, using the Kobayashi–Hitchin correspondence.

Unfortunately, none of these proofs works in positive characteristic and we do not know how to prove the above theorem without proving a much stronger result for Higgs bundles.

Let us recall that one of the starting points of Simpson’s work on nonabelian Hodge theory was his paper [35] in which he proved the generalized Kobayashi–Hitchin correspondence: on a compact complex Kähler manifold every stable Higgs bundle has a Hermitian–Yang–Mills metric. As a corollary, Simpson proved Bogomolov’s inequality for stable Higgs bundles on such manifolds. This inequality played an important role in further development of the subject (see, e.g., [28, 29]).

To the author one of the most intriguing and eluding problems was how to prove Bogomolov’s inequality for Higgs sheaves using algebraic methods. This problem was posed by Narasimhan in late 1990s. Early attempts to solve this problem worked only in low rank and used vanishing theorems to reduce the problem to the usual Bogomolov’s inequality (see [1, 20]). The first algebraic approach to Bogomolov’s inequality via bounding the number of sections of symmetric powers does not seem to work in the Higgs case. This motivated the author to work on the positive characteristic case and resulted in a proof of Bogomolov’s inequality for strongly semistable sheaves (see [21]). Unfortunately, this did not shed light on the original problem.

An algebraic proof of Bogomolov’s inequality for Higgs sheaves became possible only after the appearance of Ogus and Vologodsky’s non-abelian Hodge theory in positive characteristic (see [31]). We also use in a crucial way the results of [21] to prove a weak version of Bogomolov’s inequality for semistable modules with generalized connections. Bogomolov’s inequality for Higgs sheaves in characteristic zero is then proven by reducing to positive characteristic and applying the above inequality to the vector bundle with a connection corresponding to the Higgs bundle. This argument needs a small adjustment as Ogus and Vologodsky prove Simpson’s correspondence only for Higgs bundles satisfying some nilpotence conditions which are not satisfied in our case.

But the real main aim of this paper is to establish the following Bogomolov’s inequality for Higgs sheaves in positive characteristic:

Theorem 2

Assume that \(X\) be can be lifted to the ring \(W_2(k)\) of Witt vectors of length \(2\). Then for any slope \(H\)-semistable Higgs sheaf \((E, \theta )\) of rank \(r\le p\) we have

$$\begin{aligned} \Delta (E) H^{n-2}\ge 0. \end{aligned}$$

The strategy of proof of the above theorem is the following. First we use the results of [21] to prove a weak version of Bogomolov’s inequality for vector bundles with generalized connections (see Theorem 6). Then we deform \((E, \theta )\) to a slope semistable system of Hodge sheaves and we recall a construction of a Higgs–de Rham sequence for such systems. Existence of such sequences for semistable Higgs sheaves was conjectured by Lan et al. in [18] and we prove their conjecture in a separate paper (see [25]; after this paper was submitted there also appeared another proof of this result in [17]).

The sequence is constructed by means of Ogus and Vologodsky’s inverse Cartier transform. This transform changes a slope semistable system of Hodge sheaves into a slope semistable coherent sheaf with an integrable connection with “larger Chern classes”. Then we can deform this sheaf into another slope semistable system of Hodge sheaves and we repeat the process constructing a sequence of slope semistable systems of Hodge bundles. If the discriminant of the original sheaf were negative then this process would make the discriminants of further terms in this sequence more and more negative. But by the above weak Bogomolov’s inequality, the discriminants in the constructed sequence are bounded below, so we get a contradiction.

In [3, Theorem 5] Bogomolov proved that on a complex projective surface of general type we have \(4c_2\ge c_1^2\). Later, Miyaoka [27] and Yau [38], proved the optimal inequality \(3c_2\ge c_1^2\). It is well known that both these inequalities fail in positive characteristic (see, e.g., [7, 15, 30, 33, 37]). In 1989 Shepherd-Barron conjectured (see [34, p. 244]) that the crucial theorem in algebraic proofs of the Bogomolov–Miyaoka–Yau inequality (see [3, Theorem 4] and [27, Theorem 2”]) holds for surfaces liftable to \(W_2(k)\). We recall an unpublished example of Ekedahl et al. (see [8, p. 23, Remark ii]) showing that this conjecture is false. It is therefore very surprising that in characteristic \(p\ge 3\) the Miyaoka–Yau inequality \(3c_2\ge c_1^2\) holds for surfaces of non-negative Kodaira dimension liftable to \(W_2(k)\) (see Theorem 13). This is a strong version of the inequality conjectured by Shepherd–Barron (see [34, p. 244]). In characteristic \(2\) we reprove the weaker inequality \(4c_2\ge c_1^2\) known from [34].

Another interesting result says that semistable Higgs sheaves with vanishing Chern classes are locally free (see Theorem 11). The analogous result [36, Theorem 2] was proven by Simpson using analytic methods and the (topological) Lefschetz hyperplane theorem. We obtain his result as a corollary by purely algebraic methods.

Other applications of Theorem 2 include an effective restriction theorem for Higgs sheaves (see Theorem 10) and an analogue in positive characteristic of the new restriction theorem of Bruzzo and Hernández Ruipérez [5] (see Theorem 12). As a corollary, we also obtain a new proof of the original theorem in characteristic zero.

In the final section we give a quick proof by Bhargav Bhatt of the fact that Frobenius split varieties are liftable to the ring of Witt vectors of length 2. In particular, all the obtained results are valid for sheaves on such varieties.

For simplicity, we deal in this paper only with the usual Higgs sheaves. However, very similar methods work also for sheaves with \(\lambda \)-connections and in case of logarithmic Higgs bundles. The parabolic version is more complicated and requires a quite heavy notation. All these generalizations will be treated elsewhere.

The paper is organized as follows. In Sect. 2 we gather several preliminary results. Section 3 contains proofs of various Bogomolov type inequalities. In Sect. 4 we apply these inequalities to prove restriction theorems for Higgs sheaves. Then in Sect. 5 we study semistable Higgs sheaves with vanishing Chern classes. In Sect. 6 we prove the Bogomolov–Miyaoka–Yau inequality in positive characteristic. In Sect. 7 we quickly recall applications of Bogomolov’s inequality to adjoint linear systems. In Sect. 8 we prove that Frobenius split varieties are liftable to \(W_2\).

1.1 Notation

If \(x\) is a real number then we set \([x]_{+} :=\max (x, 0).\)

Let \(S\) be a scheme of characteristic \(p\) (i.e., \({\fancyscript{O}}_S\) is an \({\mathbb F}_p\)-algebra). By \(F_S^r :S\rightarrow S\) we denote the \(r\)-th absolute Frobenius morphism of \(S\) which corresponds to the \(p^r\)-th power mapping on \({\fancyscript{O}}_S\). If \(X\) is an \(S\)-scheme, we denote by \(X'\) the fiber product of \(X\) and \(S\) over the first absolute Frobenius morphism of \(S\). The absolute Frobenius morphism of \(X\) induces the relative Frobenius morphism \(F_{X/S}: X\rightarrow X'\).

For a rank \(r\) torsion free sheaf \(E\) on a smooth projective variety \(X\) we set

$$\begin{aligned} \Delta (E):=2rc_2(E)- (r-1)c_1(E)^2. \end{aligned}$$

2 Preliminaries

2.1 Semistability

Let \(k\) be an algebraically closed field of any characteristic. Let \(X\) be a smooth projective variety of dimension \(n\ge 1\) over \(k\) and let \(D_1,\ldots ,D_{n-1}\) be nef divisors on \(X\) such that the \(1\)-cycle \(D_1\dots D_{n-1}\) is numerically nontrivial, i.e., there exists a divisor \(D\) such that \(DD_1\dots D_{n-1}\ne 0\).

Let \(E\) be a rank \(r\) torsion free sheaf on \(X\). We define the slope of \(E\) by

$$\begin{aligned} \mu (E)=\mu _{D_1,\ldots ,D_{n-1}}(E)=\frac{c_1(E) D_1\dots D_{n-1} }{r}. \end{aligned}$$

We say that \(E\) is slope \((D_1, \ldots , D_{n-1})\)-semistable if for every subsheaf \(E'\subset E\) we have

$$\begin{aligned} \mu _{D_1,\ldots ,D_{n-1}} (E')\le \mu _{D_1,\ldots ,D_{n-1}}(E). \end{aligned}$$

If \(H\) is an ample divisor then we say that \(E\) is slope \(H\)-semistable, if it is slope \((H, \ldots , H)\)-semistable.

For simplicity, in the following we usually omit in notation dependence of slopes on the collection \((D_1, \ldots , D_{n-1})\). The maximal (minimal) slope in the Harder–Narasimhan filtration of \(E\) (with respect to \((D_1,\ldots ,D_{n-1})\)) is denoted by \(\mu _{\max } (E)\) (\(\mu _{\min } (E)\), respectively). If \(k\) has positive characteristic then we set

$$\begin{aligned} L_{\max } (E) =\lim _{m\rightarrow \infty } \frac{\mu _{\max } ((F^m_X)^*E)}{p^m} \end{aligned}$$

and

$$\begin{aligned} L _{\min } (E) =\lim _{m\rightarrow \infty } \frac{\mu _{\min } ((F^m_X)^*E)}{p^m}. \end{aligned}$$

By [21, Theorem 2.7] \(L_{\max } (E)\) and \(L_{\min } (E)\) are well defined rational numbers.

We say that \(E\) is strongly slope semistable if all the Frobenius pull backs \((F_X^m)^*E\) of \(E\) for \(m\ge 0\) are slope semistable. This is equivalent to saying that \(L_{\max } (E) =\mu (E)\) (or, equivalently, to \(L_{\min } (E) =\mu (E)\)).

2.2 Higgs sheaves

A Higgs sheaf \((E, \theta )\) is a pair consisting of a coherent \({\fancyscript{O}}_X\)-module \(E\) and an \({\fancyscript{O}}_X\)-linear map \(\theta : E\rightarrow E\otimes _{{\fancyscript{O}}_X}\Omega _X\) satisfying the integrability condition \(\theta \wedge \theta =0\).

We say that \((E,\theta )\) is slope semistable if the inequality \(\mu (E')\le \mu (E)\) is satisfied for every Higgs subsheaf \((E', \theta ')\) of \( (E, \theta )\).

A system of Hodge sheaves is a Higgs sheaf \((E, \theta )\) with decomposition \(E=\bigoplus E^j\) such that \(\theta : E^j \rightarrow E^{j-1}\otimes _{{\fancyscript{O}}_X}\Omega _X\).

As above, a system of Hodge sheaves \((E,\theta )\) is slope semistable if the inequality \(\mu (E')\le \mu (E)\) is satisfied for every subsystem of Hodge sheaves \((E', \theta ')\subset (E, \theta )\). It is easy to check that then \((E,\theta )\) is slope semistable as a Higgs sheaf (see [25, Corollary 3.5]).

Using properness of the Hitchin morphism one can show the following lemma (see [25, Corollary 5.7]):

Lemma 1

Let \((E, \theta )\) be a torsion free, slope semistable Higgs sheaf on \(X\). Then there exists an \({\mathbb A}^1\)-flat family of Higgs sheaves \((\tilde{E}, \tilde{\theta })\) on \(X\times {\mathbb A}^1\) such that for any closed \(k\)-point \(t\in {\mathbb A}^1-\{0\}\) the restriction \((\tilde{E}_t, \tilde{\theta }_t)\) to the fiber over \(t\) is isomorphic to \((E, \theta )\) and \((\tilde{E}_0, \tilde{\theta }_0)\) is a slope semistable system of Hodge sheaves.

Let \((E, \nabla )\) be a coherent sheaf with an integrable connection. Similarly to the above, one can define slope semistability for coherent sheaves with integrable connections. In [25, Theorem 5.5] (see also [17, Theorem 2.2]) we prove the following theorem:

Theorem 3

If \((E, \nabla )\) is slope semistable then there exists a canonical Griffiths transverse filtration \(N^0=0\subset N^1\subset \cdots \subset N^m=E\) such that the associated graded system of Hodge sheaves is slope semistable. This filtration is preserved by the automorphisms of \((E,\nabla )\).

The canonical filtration \(N^{\bullet }\) from Theorem 3 is called Simpson’s filtration of \((E, \nabla )\) and denoted by \(N^{\bullet }_S\).

2.3 Simpson’s correspondence in positive characteristic

In this subsection we recall the main results of Ogus and Vologodsky [31].

Let \(f:X\rightarrow S\) be a morphism of schemes in characteristic \(p\). A lifting of \(X/S\) modulo \(p^2\) is a morphism \(\tilde{X} \rightarrow \tilde{S}\) of flat \({\mathbb Z}/p^2{\mathbb Z}\)-schemes such that \(X\rightarrow S\) is equal to the base change of \(\tilde{X} \rightarrow \tilde{S}\) by the closed embedding \(S\rightarrow \tilde{S}\) defined by \(p\).

Let \({\mathop {\mathrm{MIC}}}_{p-1}(X/S)\) be the category of \({\fancyscript{O}}_X\)-modules with an integrable connection whose \(p\)-curvature is nilpotent of level less or equal to \(p-1\). Similarly, let \({\mathop {\mathrm{HIG}}}_{p-1}(X/S)\) denote the category of Higgs \({\fancyscript{O}}_{X'}\)-modules with a nilpotent Higgs field of level less or equal to \(p-1\). The following theorem is one of the main results of Ogus and Vologodsky (see [31, Theorem 2.8]):

Theorem 4

Let \({\fancyscript{X}}/{\fancyscript{S}}:=(X/S, \tilde{X}'/\tilde{S})\) be a smooth morphism with a lifting \(\tilde{X}'/\tilde{S}\) of \({X}'/{S}\) modulo \(p^2\). Then the Cartier transform

$$\begin{aligned} C_{{\fancyscript{X}}/{\fancyscript{S}}}:{\mathop {\mathrm{MIC}}}_{p-1}(X/S)\rightarrow {\mathop {\mathrm{HIG}}}_{p-1}(X'/S) \end{aligned}$$

defines an equivalence of categories with quasi-inverse

$$\begin{aligned} C_{{\fancyscript{X}}/{\fancyscript{S}}}^{-1}:{\mathop {\mathrm{HIG}}}_{p-1}(X/S)\rightarrow {\mathop {\mathrm{MIC}}}_{p-1}(X'/S). \end{aligned}$$

A small variant of the following lemma can be found in proof of [31, Theorem 4.17]:

Lemma 2

Let \((E, \theta )\in {\mathop {\mathrm{HIG}}}_{p-1}(X'/S)\). Then

$$\begin{aligned}{}[C_{{\fancyscript{X}}/{\fancyscript{S}}}^{-1}(E)]=F_{X/S}^*[E], \end{aligned}$$

where \([\cdot ] \) denotes the class of a coherent \({\fancyscript{O}}_X\)-module in Grothendieck’s \(K\)-group \(K_0(X)\).

Proof

By assumption there exists a filtration \(N^m=0\subset N^{m-1}\subset \cdots \subset N^0=(E, \theta )\) of length \(m< p\) by Higgs subsheaves such that \(N_i=N^i/N^{i-1}\) has zero Higgs field. Then \(C_{{\fancyscript{X}}/{\fancyscript{S}}}^{-1}(N_i)=F_{X/S}^*N_i\) and therefore

$$\begin{aligned}{}[C_{{\fancyscript{X}}/{\fancyscript{S}}}^{-1}(E)]=\sum [C_{{\fancyscript{X}}/{\fancyscript{S}}}^{-1}(N_i)] =\sum [F_{X/S}^*N_i]=F_{X/S}^*[E]. \end{aligned}$$

\(\square \)

As a corollary to Theorem 4 and Lemma 2 we get the following:

Corollary 1

Assume that \(S\) is the spectrum of an algebraically closed field and \(X\) is projective. Let \((E, \theta )\) be a torsion free Higgs sheaf with nilpotent Higgs field of level less than \(p\). Then it is slope semistable if and only if the corresponding sheaf with integrable connection \(C_{{\fancyscript{X}}/{\fancyscript{S}}}^{-1}(E, \theta )\) is slope semistable.

Proof

Let \((V', \nabla ')\subset (V,\nabla )=C_{{\fancyscript{X}}/{\fancyscript{S}}}^{-1}(E, \theta )\). Then by Theorem 4 \((E, \theta )\simeq C_{{\fancyscript{X}}/{\fancyscript{S}}}(V,\nabla )\) contains \((E', \theta ')\simeq C_{{\fancyscript{X}}/{\fancyscript{S}}}(V',\nabla ')\). So if \((E, \theta )\) is slope semistable then by Lemma 2 we have

$$\begin{aligned} \mu (V')=p\mu (E')\le p\mu (E)=\mu (V), \end{aligned}$$

which proves slope semistability of \(C_{{\fancyscript{X}}/{\fancyscript{S}}}^{-1}(E, \theta )\).

Implication in the opposite direction is completely analogous.

2.4 Higgs–de Rham sequences

Let \(k\) be an algebraically closed field of characteristic \(p>0\). Let \(X\) be a smooth projective \(k\)-variety of dimension \(n\) that can be lifted to a smooth scheme over the ring \(W_2(k)\) of Witt vectors of length \(2\). Let us set \(S=\mathop {\hbox {Spec }}k\) and \(\tilde{S}= \mathop {\hbox {Spec }}W_2(k)\). Since \(X'\) is isomorphic to \(X\), this provides the lifting needed in Ogus’ and Vologodsky’s version of Simpson’s correspondence.

Let \((E, \theta )\) be a rank \(r\) torsion free Higgs sheaf with nilpotent Higgs field. Let us assume that \(r\le p\) so that level of nilpotence of \((E, \theta )\) is less than \(p\). The following definition is taken from [18]:

Definition 1

A Higgs–de Rham sequence of \((E, \theta )\) is an infinite sequence

in which \(C^{-1}=C^{-1}_{{\fancyscript{X}}/S}\) is the inverse Cartier transform, \(N_i^{\bullet }\) is a Griffiths transverse filtration of \((V_i, \nabla _i)\) and \((E_{i+1}:={\mathop {\mathrm{Gr}}}_{N_{i}}(V_{i}), \theta _{i+1})\) is the associated Higgs sheaf.

Let us note the following lemma:

Lemma 3

If in a Higgs–de Rham sequence of \((E,\theta )\) there exists \(i\) such that \(E_i\) is locally free then \(E_j\) and \(V_j\) are locally free for \(j\le i\).

Proof

Note that local freeness of \(V_i\) follows from the construction of the inverse Cartier transform. Indeed, locally \(V_i\) is isomorphic to the Frobenius pull back of \(E_i\) (see [31, Theorem 2.8, 3]), so it is locally free.

Now note that there exists a family whose general member is isomorphic to \(V_{i-1}\) and the special one is \(E_i\). By openness of local freeness we see that \(V_{i-1}\) is locally free. At this point one can also use the following argument due to the referee: if a sheaf admits a finite exhaustive filtration whose associated graded is locally free, then it is locally free (proof by induction on the length of the filtration). Therefore if \(E_i\) is locally free then \(V_{i-1}\) is locally free.

This implies that \(E_{i-1}\) is locally free by descent of flatness under the Frobenius morphism (again we use this locally using the construction of the inverse Cartier transform). This last fact is a special case of Ferrand’s theorem (see [10]). Now the assertions follow by induction on decreasing \(j\). \(\square \)

The following theorem was conjectured by Lan–Sheng–Zuo [18, Conjecture 2.8] and it follows from Theorem 3 (see [25, Theorem 5.12]; see also [17, Theorem 3.2] for a slightly weaker statement):

Theorem 5

If \((E, \theta )\) is slope semistable then there exists a canonically defined Higgs–de Rham sequence

in which each \((V_i, \nabla _i)\) is slope semistable and \((E_{i+1}, \theta _{i+1})\) is the slope semistable Higgs sheaf associated to \((V_i, \nabla _i)\) via Simpson’s filtration.

As usual, in the above theorem slope semistability is defined with respect to an arbitrary fixed collection \((D_1,\ldots ,D_{n-1})\) of nef divisors on \(X\).

3 Bogomolov type inequalities

In this section we use the same notation as in Sect. 2.1. Namely, \(k\) is an algebraically closed field of any characteristic and \(X\) is a smooth projective \(k\)-variety of dimension \(n\ge 1\). We also fix a collection of nef divisors \((D_1,\ldots ,D_{n-1})\) on \(X\) such that the 1-cycle \(D_1\dots D_{n-1}\) is numerically nontrivial.

3.1 Bogomolov’s inequality for sheaves with generalized connections

Let us fix a torsion free, coherent \({\fancyscript{O}}_X\)-module \(M\) with a \(k\)-derivation \(d_M:{\fancyscript{O}}_X\rightarrow M\). A \(d_M\)-connection (or simply an \(M\)-connection if \(d_M\) is clear from the context) on a coherent \({\fancyscript{O}}_X\)-module \(E\) is a \(k\)-linear map \(\nabla : E\rightarrow E\otimes _{{\fancyscript{O}}_X} M \) satisfying the Leibniz rule

$$\begin{aligned} \nabla (fe) =f\nabla (e)+e\otimes d_M(f) \end{aligned}$$

for all sections \(f\in {\fancyscript{O}}_X\) and \(e\in E\).

A useful feature of a sheaf \(E\) with a \(d_M\)-connection, used below, is that if \(E'\subset E\) is an \({\fancyscript{O}}_X\)-submodule then the induced map \(E'\rightarrow (E/E')\otimes M\) is \({\fancyscript{O}}_X\)-linear.

From now on in this subsection we assume that \(k\) has characteristic \(p>0\).

Lemma 4

Let \((E, \nabla )\) be a rank \(r\) slope semistable sheaf with a \(d_M\)-connection. Let us set

$$\begin{aligned} M_t=(F_X^t)^*M\oplus (F_X^{t-1})^*\Omega _X \oplus ... \oplus \Omega _X \end{aligned}$$

for \(t=0,1,\ldots \) If \(E'\subset (F_X^t)^*E\) is a saturated \({\fancyscript{O}}_X\)-submodule destabilizing \((F_X^t)^*E\) then there exists a non-zero \({\fancyscript{O}}_X\)-linear map \(E'\rightarrow ((F^t_X)^*E/E')\otimes M_t.\)

Proof

The proof is by induction on \(t\). If \(t=0\) then \(E'\) is not preserved by \(\nabla \) as this would contradict semistability of \((E, \nabla )\). Then we get the required map as the one induced by \(\nabla \). Now let us take \(t\ge 1\) and assume the assertion holds for \({\fancyscript{O}}_X\)-submodules of the \((t-1)\)-th Frobenius pullback of \(E\). If \(E'\subset (F_X^t)^*E\) is not preserved by the canonical connection \(\nabla _{\hbox {can}}\) on \((F_X)^*((F_X^{t-1})^*E)\) then \(\nabla _{\hbox {can}}\) induces a non-zero \({\fancyscript{O}}_X\)-linear map \(E'\rightarrow ((F^t_X)^*E/E')\otimes \Omega _X\) and we obtain the required map by embedding \(\Omega _X\) into \(M_t\). Otherwise, \(E'\) is preserved by \(\nabla _{\hbox {can}}\) and by Cartier’s descent there exists an inclusion \(E''\subset (F_X^{t-1})^*E\) whose Frobenius pull back is \(E'\subset (F_X^{t})^*E\). Since \(E''\) destabilizes \((F_X^{t-1})^*E\), by the induction assumption there exists a non-zero \({\fancyscript{O}}_X\)-linear map \(E''\rightarrow ((F^{t-1}_X)^*E/E'')\otimes M_{t-1}.\) Pulling back this map by \(F_X\) and setting \(0\) on the last component, we get the required map. \(\square \)

Lemma 5

Let \((E, \nabla )\) be a rank \(r\) torsion free sheaf with a \(d_M\)-connection. Let us set

$$\begin{aligned} \gamma (M):=\max \left( L_{\max }(M) ,\frac{L_{\max }(\Omega _X)}{p}, 0\right) . \end{aligned}$$

If the collection \((D_1,\ldots , D_{n-1})\) consists of ample divisors then

$$\begin{aligned} \max (L_{\max } (E)-\mu _{\max } (E), \mu _{\min } (E)-L_{\min } (E)) \le (r-1) \gamma (M) . \end{aligned}$$

Moreover, if \((E, \nabla )\) is slope semistable then

$$\begin{aligned} L_{\max } (E)-L_{\min } (E) \le (r-1) \gamma (M). \end{aligned}$$

Proof

Let us assume that \((E, \nabla )\) is semistable. By [21, Theorem 2.7] we can find \(m\) such that quotients of the Harder–Narasimhan filtration for \((F_X^m)^*E, (F_X^m)^*M\) and \((F_X^m)^*\Omega _X\) are strongly semistable. Let \(0=E_{0t}\subset E_{1t}\subset \dots \subset E_{st}=(F_X^t)^*E\) be the Harder–Narasimhan filtration of \((F^t_X)^*E\) for \(t\ge m\). Let \(\mu _{it}\) be the corresponding slopes of quotients of this filtration. By Lemma 4 there exists a non-zero \({\fancyscript{O}}_X\)-linear map

$$\begin{aligned} E_{it}\rightarrow ((F^t_X)^*E/E_{it})\otimes M_t. \end{aligned}$$

Let us choose an ample divisor \(A\) such that \(T_X(A)\) is globally generated. Then for some \(N\) we have an inclusion \(\Omega _X \subset {\fancyscript{O}}_X(A)^N\) which gives a similar inclusion after pulling-back by the Frobenius morphisms. Hence, using the fact that tensor product of strongly semistable sheaves is strongly semistable (see [32, Theorem 3.23] and [22, Corollary A.3.1]), for any \(t\ge m\) we have

$$\begin{aligned} \mu _{\min } (E_{it})&\le \mu _{\max }((F^t_X)^*E/E_{it})+\max (\mu _{\max }((F^t_X)^*M),p^{m}A D_1\dots D_{t-1},\\&\quad \{\mu _{\max }((F^j_X)^*\Omega _X):\, {m\le j\le t-1}\}). \end{aligned}$$

After dividing by \(p^{t-m}\) this gives

$$\begin{aligned} \mu _{im}\le \mu _{(i+1)m}\!+\!\max ( p^m L_{\max }(M), p^{m\!-\!2t}A D_1\dots D_{n-1}, p^{m\!-\!1}{[L_{\max }(\Omega _X)]_{+}}). \end{aligned}$$

Summing all these inequalities for \(i=1,\ldots , s-1\) we get

$$\begin{aligned}&\mu _{1m}\le \mu _{sm}+ (s-1)\max (p^m L_{\max }(M), p^{m-2t}A D_1\dots D_{n-1},\\&\quad p^{m-1}{[L_{\max }(\Omega _X)]_{+}}). \end{aligned}$$

But by definition we have \(\mu _{1m}=p^mL_{\max }(E)\) and \(\mu _{sm}=p^mL_{\min }(E)\) so dividing by \(p^m\) and passing with \(t\) to infinity gives the second inequality.

In particular, we proved that if \((E, \nabla )\) is semistable then

$$\begin{aligned} \max (L_{\max } (E)-\mu (E), \mu (E)-L_{\min } (E)) \le (r-1)\gamma (M). \end{aligned}$$

In general, applying this inequality to the quotients of the Harder-Narasimhan filtration of \((E, \nabla )\), we obtain the first inequality. \(\square \)

Theorem 6

Let \((E, \nabla )\) be a rank \(r\) torsion free sheaf with a \(d_M\)-connection. Assume that the collection \((D_1,\ldots , D_{n-1})\) consists of ample divisors. If \((E, \nabla )\) is slope \((D_1,\ldots , D_{n-1})\)-semistable then

$$\begin{aligned} D_1^2D_2\dots D_{n-1}\cdot \Delta (E)D_2\dots D_{n-1}+ r^2(r-1)^2\gamma (M) ^2 \ge 0. \end{aligned}$$

In particular, if we have slope semistable \((E, \theta )\in {\mathop {\mathrm{HIG}}}(X/k)\) or \((E, \nabla )\in {\mathop {\mathrm{MIC}}}(X/k)\) then

$$\begin{aligned} D_1^2D_2\dots D_{n-1}\cdot \Delta (E)D_2\dots D_{n-1}+ r^2(r-1)^2 [L_{\max }(\Omega _X)]_+^2 \ge 0. \end{aligned}$$

Proof

If \(E\) is any torsion free sheaf then by [21, Theorem 5.1]

$$\begin{aligned}&D_1^2D_2\dots D_{n-1}\cdot \Delta (E)D_2\dots D_{n-1}+ r^2(L_{\max }(E) -\mu (E)) (\mu (E) \\&\quad -L_{\min } (E) )\ge 0. \end{aligned}$$

Therefore the theorem follows immediately from Lemma 5. \(\square \)

The above theorem plays an important role in establishing Bogomolov type inequalities for semistable Higgs sheaves.

3.2 Bogomolov’s inequality for Higgs sheaves in characteristic zero

The following Bogomolov’s inequality for Higgs bundles was proven in [35, Proposition 3.4] using analytic methods. We give the first algebraic proof of this fact. For simplicity we deal only with one ample polarization although the proof works also for collections of ample polarizations.

Theorem 7

Let \(k\) be a field of characteristic \(0\). Let \(H\) be an ample divisor on \(X\). Then for any slope \(H\)-semistable Higgs sheaf \((E, \theta )\) we have

$$\begin{aligned} \Delta (E) H^{n-2}\ge 0. \end{aligned}$$

Proof

Deforming \((E, \theta )\) to a system of Hodge sheaves (see Lemma 1) we can assume that \((E, \theta )\) is nilpotent.

Now we use a standard spreading out technique, which we recall for the convenience of the reader (cf. proof of [6, Corollaire 2.7]). There exists a subring \(R\subset k\), finitely generated over \({\mathbb Z}\) and a smooth projective scheme \({\fancyscript{X}}\rightarrow S=\mathop {\hbox {Spec }}R\) such that \(X={\fancyscript{X}}\otimes _R k\). We can also assume that \(H\) and \((E, \theta )\) are defined over \(R\) and on \({\fancyscript{X}}\) there exists a relatively ample divisor \({\fancyscript{H}}\) extending \(H\) and an \(S\)-flat family of Higgs sheaves \(({\fancyscript{E}}, \tilde{\theta })\) extending \((E, \theta )\). Further shrinking \(S\), by openness of semistability we can assume that \(({\fancyscript{E}}_s, \tilde{\theta }_s)\) is slope \({\fancyscript{H}}_s\)-semistable for every \(s\in S\). For any closed point of \(S\otimes _{{\mathbb Z}}{\mathbb Q}\) its schematic closure \(T\) in \(S\) is generically étale over \(\mathop {\hbox {Spec }}{\mathbb Z}\). Let us choose a closed point \(s\in T\) such that \(T\) is étale at \(s\) over \({\mathbb Z}\) and the characteristic of the residue field \(k(s)\) is \(\ge r=\mathop {\mathrm{rk}}E\). Then the scheme \({\fancyscript{X}}\otimes _R ({\fancyscript{O}}_{{\fancyscript{X}}, s}/m_s^2)\) is a lifting of \({\fancyscript{X}}_s\) modulo \(p^2\). So by Corollary 1 we can associate to \( ({\fancyscript{E}}_s, \tilde{\theta }_s)\) a slope \({\fancyscript{H}}_s\)-semistable sheaf with integrable connection \((V _s, \nabla _s)\). By Lemma 2 we have \(\Delta (V_s) {\fancyscript{H}}_s^{n-2}=p^2\Delta ({\fancyscript{E}}_s) {\fancyscript{H}}_s^{n-2}\). By Theorem 6 we have

$$\begin{aligned} {\fancyscript{H}}_s^n\cdot \Delta (V_s){\fancyscript{H}}_s^{n-2}+ r^2(r-1)^2[L_{\max }(\Omega _{{\fancyscript{X}}_{\bar{s}}/k(\bar{s})})]_+^2\ge 0, \end{aligned}$$

where \(\bar{s}\) is the geometric point lying over \(s\). But for some \(a\ge 0\) a suitably twisted relative tangent bundle \(T_{{\fancyscript{X}}/S} (a{\fancyscript{H}})\) is relatively globally generated, so we can embed \(\Omega _{{\fancyscript{X}}_{\bar{s}}/k(\bar{s})}\) into a direct sum of some copies of \({\fancyscript{O}}_{{\fancyscript{X}}_{\bar{s}}}(a{\fancyscript{H}}_{\bar{s}})\). This shows that we have \(L_{\max }(\Omega _{{\fancyscript{X}}_{\bar{s}}/k(\bar{s})})\le aH^n\), where \(a\) is independent of the choice of \(s\). So the above inequality implies that

$$\begin{aligned} p^2 H^n\cdot \Delta (E)H^{n-2}+ r^2(r-1)^2(aH^n)^2\ge 0. \end{aligned}$$

Therefore dividing by \(p^2\) and taking sufficiently large \(p\) we get \(\Delta (E) H^{n-2}\ge 0.\) \(\square \)

Remark 1

The above inequality works also for collections of nef divisors \((D_1, \ldots , D_{n-1})\) such that the \(1\)-cycle \(D_1\dots D_{n-1}\) is numerically nontrivial. However, proof of this fact requires different techniques (cf. proof of Theorem 8).

Remark 2

Note that Theorem 7 follows from Theorem 8 by the spreading out technique. We include another proof as it is much simpler and it motivates the more complicated proof of Theorem 8.

3.3 Bogomolov’s inequality for Higgs sheaves in positive characteristic

Assume that \(k\) has characteristic \(p>0\) and that \(X\) can be lifted to \(W_2(k)\).

Theorem 8

For any slope \((D_1,\ldots ,D_{n-1})\)-semistable Higgs sheaf \((E, \theta )\) of rank \(r\le p\) we have

$$\begin{aligned} \Delta (E) D_2\dots D_{n-1}\ge 0. \end{aligned}$$

Proof

By Lemma 1 we can deform \((E, \theta )\) to a system of Hodge sheaves preserving semistability and Chern classes. So we can assume that \((E, \theta )\) is nilpotent of level less than \(r\le p\). In this case we can consider the canonical Higgs–de-Rham sequence of \((E, \theta )\) provided by Theorem 5. By Lemma 2 we know that \(c_j (V_i)=p^jc_j(E_i)\) for \(i=0,\ldots \) On the other hand, it is clear that \(c_j(E_{i+1})=c_j(V_i)\), so we have

$$\begin{aligned} \Delta (E_i)D_2\dots D_{n-1}=p^{2i}\Delta (E) D_2\dots D_{n-1}. \end{aligned}$$

Let us set \(d=D_1^2D_2\dots D_{n-1}\ge 0\).

First, let us consider the case in which all the divisors \(D_1,\ldots , D_{n-1}\) are ample. Then by Theorem 6 we have

$$\begin{aligned} d\cdot \Delta (E_i)D_2\dots D_{n-1}+ r^2(r-1)^2 [L_{\max }(\Omega _X)]_+^2 \ge 0. \end{aligned}$$

So

$$\begin{aligned} \Delta (E)D_2\dots D_{n-1}\ge -\frac{r^2(r-1)^2 [L_{\max }(\Omega _X)]_+^2}{d p^{2i}} \end{aligned}$$

and we get the required inequality passing with \(i\) to infinity. Note that this implies that if \(D_1, \ldots , D_{n-1}\) are ample then

$$\begin{aligned}&D_1^2D_2\dots D_{n-1}\cdot \Delta (E)D_2\dots D_{n-1}+r^2(\mu _{\max }(E,\theta )\\&\quad -\mu (E)) (\mu (E)-\mu _{\min } (E,\theta )) \ge 0 \end{aligned}$$

(see Corollary 3).

Now let us consider the general case. The strategy is the same as that in proof of [21, 3.6 and 3.8] and we just sketch the arguments. First, let us assume that \(d>0\). In this case we fix an ample divisor \(H\) and set \(H_i(t)=D_i+tH\). Then the Harder–Narasimhan filtration of \((E, \theta )\) with respect to \((H_1(t),\ldots ,H_{n-1}(t))\) is independent of \(t\) for small positive \(t\) (this is a not-trivial assertion; see [21, 3.6] for proof in the case of usual sheaves; the proof in our case is the same).

Let \(0=F_0\subset F_1\subset \dots \subset F_{m}=(E, \theta )\) be the corresponding filtration. Since \((E, \theta )\) is slope \((D_1,\ldots ,D_{n-1})\)-semistable, we have

$$\begin{aligned} \lim _{t\rightarrow 0}\mu _{\max ,H_1(t),\ldots , H_{n-1}(t)}(E, \theta )= \lim _{t\rightarrow 0}\mu _{H_1(t),\ldots , H_{n-1}(t)}(F_1) =\mu _{D_1,\ldots , D_{n-1}}(E) \end{aligned}$$

and

$$\begin{aligned} \lim _{t\rightarrow 0}\mu _{\min ,H_1(t),\ldots , H_{n-1}(t)}(E, \theta )=\mu _{D_1,\ldots , D_{n-1}}(E). \end{aligned}$$

But we know that

$$\begin{aligned}&H_1(t)^2H_2(t)\dots H_{n-1}(t)\cdot \Delta (E)H_2(t)\dots H_{n-1}(t)\\&\quad +r^2(\mu _{\max ,H_1(t),\ldots , H_{n-1}(t)}(E, \theta ) \! -\!\mu _{H_1(t),\ldots , H_{n-1}(t)}(E)) \cdot (\mu _{H_1(t),\ldots , H_{n-1}(t)}(E)\\&\quad -\mu _{\min ,H_1(t),\ldots , H_{n-1}(t)}(E, \theta ))\ge 0, \end{aligned}$$

so passing with \(t\) to \(0\) yields the required inequality.

Now consider the remaining case \(d=0\) and assume that \(\Delta (E) D_2\dots D_{n-1}<0\). Let us choose an ample divisor \(H_1\) such that \(H_1^2D_2\dots D_{n-1}> 0\). Then \((E,\theta )\) is not slope \((H_1,D_2,\ldots ,D_{n-1})\)-semistable and as in [21, 3.7] (see also [12, Theorem 7.3.3]) one can show that there exists a saturated Higgs subsheaf \((E', \theta ')\subset (E, \theta )\) such that \((c_1(E')/r'-c_1(E)/r)D_1'D_2...D_{n-1}>0\) for every nef divisor \(D_1'\) such that \(D_1'D_2...D_{n-1}\) is numerically nontrivial. In particular, for \(D_1'=D_1\) we get a contradiction with our assumption on \((D_1,\ldots ,D_{n-1})\)-semistability of \((E,\theta )\). \(\square \)

The following corollary is a generalization of Theorem 1:

Corollary 2

For any \((D_1,\ldots ,D_{n-1})\)-semistable sheaf \(E\) of rank \(r\le p\) we have

$$\begin{aligned} \Delta (E) D_2\dots D_{n-1}\ge 0. \end{aligned}$$

The above corollary was known only in the surface case for rank \(2\) vector bundles in characteristic \(2\) (see [34, Corollary 11]).

Corollary 3

If \((E,\theta )\) is a torsion free Higgs sheaf of rank \(r\le p\) then

$$\begin{aligned}&D_1^2D_2\dots D_{n-1}\cdot \Delta (E)D_2\dots D_{n-1}+r^2(\mu _{\max }(E,\theta )\\&\quad -\mu (E)) (\mu (E)-\mu _{\min } (E,\theta )) \ge 0. \end{aligned}$$

Proof

Let \(0=F_0\subset F_1\subset \dots \subset F_m=(E, \theta )\) be the Harder–Narasimhan filtration (in the category of Higgs sheaves). Set \(F^i=F_i/F_{i-1}, r_i=\mathop {\mathrm{rk}}F^i, \mu _i =\mu (F^i)\). We can assume that \(d=D_1^2D_2\dots D_{n-1}>0\). Then by the Hodge index theorem we have

$$\begin{aligned} \frac{\Delta (E)D_2\dots D_{n-1}}{ r}&= \sum \frac{\Delta (F^i)D_2\dots D_{n-1}}{ r_i}\\&-\frac{1}{r}\sum _{i<j} r_ir_j \left( \frac{c_1F^i}{r_i}\!-\!\frac{c_1F^j}{ r_j}\right) ^2 D_2\dots D_{n-1}\\&\ge \sum \frac{\Delta (F^i)D_2\dots D_{n-1}}{ r_i}- \frac{1}{ r d} \sum _{i<j}r_ir_j(\mu _i-\mu _j)^2.\\ \end{aligned}$$

Since \({\Delta (F^i)D_2\dots D_{n-1}}\ge 0\) by Theorem 8, the required inequality follows from the following easy inequality (see [21, Lemma 1.4])

$$\begin{aligned} \sum _{i<j}r_ir_j(\mu _i-\mu _j)^2 \le r^2 (\mu _1-\mu (E)(\mu (E)-\mu _m). \end{aligned}$$

\(\square \)

4 Restriction theorems

In this section we assume that \(k\) has characteristic \(p>0\) and that \(X\) can be lifted to \(W_2(k)\). We recall how Bogomolov’s inequality implies effective restriction theorems. Originally, this method was invented by F. Bogomolov with further improvements by the author (see [21, Theorem 5.2]). We give all the details as we feel that this approach is not as known as it should be and some people still reprove much weaker restriction theorems in a more complicated way. Note that in the characteristic zero case a non-effective version of restriction theorem (the so called Mehta–Ramanathan restriction theorem) together with Bogomolov’s inequality in the surface case implies Bogomolov’s inequality in higher dimensions. This is a classical approach and it was also used by Mochizuki in [28, 29] to deduce Bogomolov’s inequality for semistable (logarithmic) Higgs sheaves in higher dimensions. Our approach is exactly opposite: first we prove Bogomolov’s inequality in all dimensions and then we deduce (strong) restriction theorems.

4.1 Restriction theorem for sheaves with operators

First, we repeat arguments from proof of [21, Theorem 5.2] to prove a strong restriction theorem for sheaves with operators:

Theorem 9

Let \((E, \theta )\) be a torsion free Higgs sheaf of rank \(r\le p\). Assume that \((E, \theta )\) is slope \((D_1,\ldots ,D_{n-1})\)-stable and \(d=D_{1}^{2}D_{2}\ldots D_{n-1} > 0\). Let \(D\in |mD_1|\) be a normal divisor such that \(E|_D\) has no torsion. If

$$\begin{aligned} m>\frac{r-1}{ r}\Delta (E)D_2\dots D_{n-1} +\frac{1}{dr(r-1)} \end{aligned}$$

then \((E|_D, \theta |_D)\) is a slope \((D_2|_D,\ldots ,D_{n-1}|_D)\)-stable sheaf with an \(\Omega _X|_D\)-valued operator.

Proof

Let us assume that \((E|_D, \theta |_D)\) is not slope \((D_2|_D,\ldots ,D_{n-1}|_D)\)-stable. Let \((T, \theta _T: T\rightarrow T\otimes \Omega _X|_D)\) be the minimal destabilizing quotient of \((E|_D, \theta |_D)\). Let us set \(\rho =\mathop {\mathrm{rk}}T\) and let \(E'\) be the kernel of the composition \(E\rightarrow E|_D\rightarrow T\). Note that \(E'\) has a natural structure of a Higgs subsheaf of \((E, \theta )\) with Higgs field \(\theta '=\theta |_{E'}\). Since \((T, \theta _T)\) destabilizes \((E|_D, \theta |_D)\) we have

$$\begin{aligned}&\Delta (E')D_2\dots D_{n-1} =\Delta (E)D_2\dots D_{n-1}-\rho (r-\rho ) D^2D_2\dots D_{n-1}\\&\quad + 2(r c_1(T)\!-\!\rho c_1(E|_D)) D_2\dots D_{n\!-\!1} \!\le \Delta (E)D_2\dots D_{n-1} \!-\!\rho (r-\rho ) m^2d.\\ \end{aligned}$$

Since \((E', \theta ')\subset (E, \theta )\) and \((E,\theta )\) is slope \((D_1,\ldots ,D_{n-1})\)-stable we have

$$\begin{aligned} \mu _{\max } (E',\theta ')-\mu (E')&= \frac{\rho }{r} DD_1 \dots D_{n-1}+\mu _{\max } (E',\theta ')-\mu (E)\\&\le \frac{\rho }{r}md -\frac{1}{r(r-1)} . \end{aligned}$$

Similarly, since \((E(-D), \theta (-D))\subset (E', \theta ')\) we have

$$\begin{aligned} \mu (E')-\mu _{\min } (E', \theta ')&= \frac{r-\rho }{r}DD_1 \dots D_{n-1}+\mu (E(-D))- \mu _{\min } (E', \theta ')\\&\le \frac{r-\rho }{ r}md -\frac{1}{r(r-1)}.\\ \end{aligned}$$

Hence by Corollary 3 we obtain

$$\begin{aligned} 0&\le d\cdot \Delta (E')D_2\dots D_{n-1} \!+\!r^2(\mu _{\max } (E', \theta ')\! -\!\mu (E'))(\mu (E')\!-\!\mu _{\min }(E',\theta '))\\&\le d\cdot \Delta (E)D_2\dots D_{n-1}-\rho (r-\rho ) m^2d^2 +\left( \rho md -\frac{1}{r-1}\right) \\&\times \left( (r-\rho ) md -\frac{1}{r-1}\right) .\\ \end{aligned}$$

Therefore

$$\begin{aligned} m \frac{dr}{ r-1} \le d\cdot \Delta (E)D_2\dots D_{n-1}+\frac{1}{(r-1)^2}, \end{aligned}$$

which contradicts our assumption on \(m\). \(\square \)

4.2 Restriction theorem for Higgs sheaves

Note that up to now even in characteristic zero there were no effective vanishing theorems for Higgs semistabilityFootnote 1. Here we use Simpson’s arguments from proof of [36, Lemma 3.7] to prove the first such result:

Theorem 10

Let \((E, \theta )\) be a torsion free Higgs sheaf of rank \(r\le p\). Let \(m_0\) be a non-negative integer such that \(T_X(m_0D_1)\) is globally generated. Assume that \((E, \theta )\) is slope \((D_1,\ldots ,D_{n-1})\)-stable and \(d=D_{1}^{2}D_{2}\ldots D_{n-1} > 0\). Let \(D\in |mD_1|\) be a normal divisor such that \(E|_D\) has no torsion. Let us take an integer

$$\begin{aligned} m>\frac{r-1}{ r}\Delta (E)D_2\dots D_{n-1} +\frac{1}{dr(r-1)}. \end{aligned}$$

Moreover, if \(r>2\) let us also assume that \(m>2(r-1)^2m_0\). Then the Higgs sheaf \((E|_D, \theta _D)\), induced from \((E|_D,\theta |_D)\) via composition \(E|_D\rightarrow E|_D\otimes \Omega _X|_D\rightarrow E\otimes \Omega _D\), is slope \((D_2|_D,\ldots ,D_{n-1}|_D)\)-stable.

Proof

Let us assume that \((E|_D, \theta _D)\) is not slope \((D_2|_D,\ldots ,D_{n-1}|_D)\)-stable and let \((E', \theta ')\subset (E|_D, \theta _D)\) be the maximal destabilizing Higgs subsheaf. By Theorem 9 \((E|_D,\theta |_D)\) is slope \((D_2|_D,\ldots ,D_{n-1}|_D)\)-stable. Therefore \(E'\) is not preserved by \(\theta |_D\) and we have a non-zero map \(E'\rightarrow E''\otimes \Omega _X|_D\), where \(E''=E/E'\). This induces a non-zero map \(\varphi : E'\rightarrow E''(-D)\). In particular, if \(r=2\) then \(E'\) and \(E''\) have rank one and we have a contradiction with \(\mu (E')\ge \mu (E'')>\mu (E''(-D))\).

If \(r\ge 3\) we need more subtle arguments. Let \(G=\ker \varphi \) and let \(M\) be the preimage of \((E'/G)(D)\subset E''\) under the canonical projection \(E|_D\rightarrow E''\). Then

$$\begin{aligned} \deg M\!=\!\deg E'\!+\!\deg (E'/G)(D)\!=\! 2 \deg E' \!-\!\deg G\!+\!\mathop {\mathrm{rk}}(E'/G) \, \deg {\fancyscript{O}}_D(D). \end{aligned}$$

Therefore

$$\begin{aligned} \deg M+\deg G= 2 \deg E' +\mathop {\mathrm{rk}}(E'/G) \, \deg {\fancyscript{O}}_D(D)\ge 2r'\mu (E)+m^2d. \end{aligned}$$

But \(M\) and \(G\) are subsheaves of \(E|_D\), so

$$\begin{aligned} \deg M+\deg G\le (\mathop {\mathrm{rk}}M+\mathop {\mathrm{rk}}G)\mu _{\max } (E|_D)=2 r'\mu _{\max } (E|_D). \end{aligned}$$

Since \((E|_D,\theta |_D)\) is slope \((D_2|_D,\ldots ,D_{n-1}|_D)\)-stable, by Lemma 5 we have

$$\begin{aligned} \mu _{\max }(E|_D)-\mu _{\min }(E|_D)\le (r-1)[L _{\max }(\Omega _X|_D)]_{+}. \end{aligned}$$

But \(T_X(m_0D_1)\) is globally generated, so \(\Omega _X|_D\subset {\fancyscript{O}}_D (m_0D_1)^{\oplus N} \) for some \(N\). This shows that \(L _{\max }(\Omega _X|_D)\le m_0md\) and hence

$$\begin{aligned} 2r'\mu (E)+m^2d\le \deg M+\deg G \le 2r' (\mu (E)+(r-1)m_0md). \end{aligned}$$

This contradicts our assumption that \(m>2(r-1)^2m_0\). \(\square \)

Let us remark that in both Theorems 9 and 10 we do not assume that the divisor \(D\), to which we restrict the Higgs bundle, lifts to \(W_2(k)\). Moreover, both theorems (and their proofs) work also in the characteristic zero case.

5 Semistable Higgs sheaves with vanishing Chern classes

We keep the same notation and assumptions as in the previous section (unless otherwise explicitly stated).

In this section we study semistable Higgs sheaves with vanishing Chern classes. In the characteristic zero case such sheaves correspond to representations of the topological fundamental group. Here they also exhibit analogous interesting properties. For example, we prove that they are locally free and satisfy a strong restriction theorem. It is also interesting to note that one can define on them certain discrete dynamical systems (even in characteristic zero). This fact seems to be unnoticed in the characteristic zero case and the corresponding systems were not yet studied. In positive characteristic we make the first steps in this direction.

5.1 Local freeness

In this subsection we prove that semistable Higgs sheaves with vanishing Chern classes are locally free. This is a positive characteristic version of [36, Theorem 2] (see also [23, Theorem 4.1] for another version for strongly semistable sheaves). This fact can be also considered as a vast generalization of [18, Corollary 3.11], which was proven by a completely different method using Faltings’ [9, Theorem 2.1]. Actually, [18, Corollary 3.11] implies that preperiodic Higgs sheaves (see Sect. 5.3) are locally free. Indeed, by [18, Corollary 3.11] there exists \(i\) such that \(E_i\) in the Higgs–de Rham sequence of preperiodic \((E, \theta )\) is locally free. So by Lemma 3 \(E\) is locally free. In particular, Theorem 5 implies Theorem 11 for an algebraic closure of a finite field.

This theorem has many interesting applications. In particular, it implies that moduli spaces of semistable Higgs bundles with vanishing Chern classes are projective. This is important as it is much easier to deal with locally free sheaves than torsion free ones, e.g., if one needs to study moduli spaces of principal Higgs bundles. Another corollary says that the category of slope semistable Higgs bundles of rank \(r\le p\) with vanishing Chern classes behaves similarly to the category of representations of the fundamental group (cf. Corollary 5).

Theorem 11

Let \(H\) be an ample divisor on \(X\) and let \((E, \theta )\) be a rank \(r\le p\) slope \(H\)-semistable Higgs sheaf with \(\mathop {\mathrm{ch}}_1(E)H^{n-1}=0\) and \(\mathop {\mathrm{ch}}_2(E)H^{n-2}=0\). Assume that either \(E\) is reflexive or the normalized Hilbert polynomial of \(E\) is the same as that of \({\fancyscript{O}}_X\). Then \(E\) has a filtration whose quotients are locally free slope \(H\)-stable Higgs sheaves with vanishing Chern classes.

Note: We say that \(E\) has vanishing Chern classes if the Chern classes \(c_i(E)\) vanish in \(H^{2i}_{\mathop {{\acute{\mathrm{e}}}\mathrm{t}}} (X, {\mathbb Q}_l(i))\) for \(i>0\) and any \(l\ne p\). It also implies that numerically Chern classes are trivial (in particular, \(c_i(E)H^{n-i}=0\)).

Proof

In the curve case the theorem follows from existence of a Jordan–Hölder filtration. In general, the proof is by induction on the dimension \(n\) starting with \(n=2\).

If \(X\) is a surface then the Higgs field \(\theta \) extends to a Higgs field \(\tilde{\theta }\) on the reflexivization \(E^{**}\) of \(E\). Note that \(E^{**}\) is locally free and \((E^{**}, \tilde{\theta })\) is slope \(H\)-semistable. Hence \(\Delta (E^{**})\ge 0\) by Theorem 8. By the Hodge index theorem we have

$$\begin{aligned} 0=2r \, {\mathop {\mathrm{ch}}} _2 (E)=c_1(E)^2- \Delta (E)\le c_1(E)^2 \le \frac{(c_1(E)H)^2}{H^2}=0, \end{aligned}$$

so \( c_1(E)^2=\Delta (E)=0\). But \(\Delta (E^{**})\le \Delta (E)\) so also \(\Delta (E^{**})=0\). Since \(E^{**}/E\) is supported on a finite set of points and its second Chern class vanishes, we must have \(E=E^{**}\). Hence the required assertion can be obtained by taking a Jordan–Hölder filtration of \((E, \theta )\).

Now fix \(n\ge 3\) and assume that the theorem holds for varieties of dimension less than \(n\). First, let us assume that \(E\) is reflexive. By the same argument as above, we see that \(c_1(E)H^{n-1}=0\) and \(\Delta (E)H^{n-2}=0\). Let us consider the canonical Higgs–de Rham sequence of \((E, \theta )\) provided by Theorem 5. By [21, Theorem 4.4] the family of Higgs sheaves \((E_i, \theta _i)\) is bounded. Therefore there are only finitely many possibilities for Chern classes \(c_j(E_i)=p^{ij} c_j(E)\). This implies that the Chern classes of \(E\) must vanish. In particular, for any smooth divisor \(D\) on \(X\), the reduced Hilbert polynomial of \(E_D\) is equal to the Hilbert polynomial of \({\fancyscript{O}}_D\). Moreover, \(E_D\) is torsion free (see, e.g., [12, Corollary 1.1.14]).

Let us assume that \((E, \theta )\) is slope \(H\)-stable. Without loss of generality we can assume that \(H^1(X, {\fancyscript{O}}_X(mH))=0\) for \(m\ge 1\) and \({\fancyscript{O}}_X(H)\) lifts to a line bundle \({\fancyscript{O}}_{\tilde{X}} (\tilde{H})\) on a lifting \(\tilde{X}\) of \(X\) to \(W_2(k)\). Then a short exact sequence

$$\begin{aligned} 0\rightarrow p{\fancyscript{O}}_{\tilde{X}}\rightarrow {\fancyscript{O}}_{\tilde{X}}\rightarrow {\fancyscript{O}}_X \rightarrow 0 \end{aligned}$$

implies that \(H^0({\fancyscript{O}}_{\tilde{X}} (m\tilde{H}))\rightarrow H^0({\fancyscript{O}}_{ X} (mH))\) is surjective for all \(m\ge 1\) (note that \(p{\fancyscript{O}}_{\tilde{X}}\simeq {\fancyscript{O}}_X\) as \({\fancyscript{O}}_X\)-modules). By Theorem 10 the restriction \(E_D\) is also slope \(H\)-stable for any smooth divisor \(D\in |mH|\), \(m\ge 1\). Since by the above \(D\) can be lifted to \(W_2(k)\), the restriction \(E_D\) is locally free by the induction assumption. It follows that \(E\) is locally free along every smooth divisor \(D\in |mH|\). Since such divisors cover \(X\) (when we vary \(m\)) it follows that \(E\) is locally free.

The general case can be proven by induction on the rank \(r\) of \(E\). Let us take any saturated proper Higgs subsheaf \((E', \theta ')\subset (E, \theta )\) which is slope \(H\)-stable with \(c_1(E')H^{n-1}=0\). Let us set \((E'', \theta '')=(E, \theta )/ (E', \theta ')\). By the Hodge index theorem and Theorem 8 we have

$$\begin{aligned} 0&= \frac{\Delta (E)H^{n\!-\!2}}{r}\!=\!\frac{\Delta (E')H^{n\!-\!2}}{r'}\!+\!\frac{\Delta (E'')H^{n-2}}{r''} \!-\! \frac{r'r''}{r} \!\!\left( \frac{c_1E'}{r'}\!-\!\frac{c_1E''}{r''}\right) ^2 H^{n-2}\\&\quad \quad \ge \frac{\Delta (E')H^{n-2}}{r'}+\frac{\Delta (E'')H^{n-2}}{r''}\ge 0.\\ \end{aligned}$$

So \(\Delta (E')H^{n-2}=0\) and \(\Delta (E'')H^{n-2}=0\) and hence

$$\begin{aligned}&0={\mathop {\mathrm{ch}}} _2 (E)H^{n-2}= {\mathop {\mathrm{ch}}} _2 (E')H^{n-2}+{\mathop {\mathrm{ch}}} _2 (E'')H^{n-2}\\&\quad =c_1(E')^2H^{n-2}+ c_1(E'')^2H^{n-2}. \end{aligned}$$

By the Hodge index theorem

$$\begin{aligned} c_1(E')^2H^{n-2}\le \frac{(c_1(E')H^{n-1})^2}{H^n}=0 \end{aligned}$$

and

$$\begin{aligned} c_1(E'')^2H^{n-2}\le \frac{(c_1(E'')H^{n-1})^2}{H^n}=0. \end{aligned}$$

It follows that \(\mathop {\mathrm{ch}}_1(E')H^{n-1}=0\) and \(\mathop {\mathrm{ch}}_2(E')H^{n-2}=0\). Since \(E'\) is reflexive, by the induction assumption it is locally free with vanishing Chern classes. But then \(E''\) also has vanishing Chern classes. Therefore the reduced Hilbert polynomial of \(E''\) is equal to the Hilbert polynomial of \({\fancyscript{O}}_X\). Since \((E'', \theta '')\) is slope \(H\)-semistable, by the induction assumption it has a filtration whose quotients are locally free slope \(H\)-stable Higgs sheaves with vanishing Chern classes.

Now we assume that the reduced Hilbert polynomial of \(E\) is equal to the Hilbert polynomial of \({\fancyscript{O}}_X\) without assuming that \(E\) is reflexive. By what we already proved the reflexivization \(E^{**}\) of \(E\) is locally free with vanishing Chern classes. Therefore the reduced Hilbert polynomial of \(E^{**}\) is also equal to the Hilbert polynomial of \({\fancyscript{O}}_X\). This implies that the Hilbert polynomial of the quotient \(E^{**}/E\) is trivial, which proves the required assertion. \(\square \)

Corollary 4

Let \(H\) be an ample divisor on \(X\) and let \((E, \theta )\) be a rank \(r\le p\) slope \(H\)-semistable Higgs sheaf with vanishing Chern classes. Then \((E, \theta )\) is locally free and Gieseker semistable (and hence also slope semistable) with respect to every ample polarization.

Proof

Since by assumption the normalized Hilbert polynomial of \(E\) is the same as that of \({\fancyscript{O}}_X\), Theorem 11 implies that \(E\) is locally free. Let us recall that the family of Higgs sheaves \(\{(E_i, \theta _i)\}_{i\in {\mathbb Z}_{\ge 0}}\) in the canonical Higgs–de Rham sequence of \((E, \theta )\) is bounded (see proof of Theorem 11).

Assume that \((E, \theta )\) is not slope \(A\)-semistable for some ample polarization \(A\) and consider the maximal \(A\)-destabilizing Higgs subsheaf \((E',\theta ')\subset (E,\theta )\). Then by Lemma 2 the slopes (counted with respect to \(A\)) of images of \((E',\theta ')\) under iterations of \({\mathop {\mathrm{Gr}}}_{N_S}\circ C^{-1}\) grow to infinity. But this contradicts boundedness of the family \(\{(E_i, \theta _i)\}_{i\in {\mathbb Z}_{\ge 0}}\).

Hence \((E, \theta )\) is slope \(A\)-semistable. By Theorem 11 it has a filtration whose quotients are locally free slope \(A\)-stable Higgs sheaves with vanishing Chern classes, so it is also Gieseker \(A\)-semistable. \(\square \)

Corollary 5

Let \({\fancyscript{C}}\) be the full subcategory of the category of coherent Higgs sheaves on \(X\), whose objects are slope semistable Higgs bundles of rank \(r\le p\) with vanishing Chern classes. Then \({\fancyscript{C}}\) contains all kernels, images and cokernels.

Proof

Let \((E_1, \theta _1)\) and \((E_2, \theta _2)\) be two objects of \({\fancyscript{C}}\). By Theorem 11 both \((E_1, \theta _1)\) and \((E_2, \theta _2)\) have filtrations by slope stable objects of \({\fancyscript{C}}\). Since the only maps of Higgs sheaves between such slope stable objects are either zero or isomorphisms, it is easy to see that the kernel, image and cokernel of any morphism of Higgs sheaves \((E_1, \theta _1)\rightarrow (E_2, \theta _2)\) are objects of \({\fancyscript{C}}\).

Note that Theorem 11 also implies [36, Theorem 2] using a standard spreading out argument (or following proof of Theorem 11). This gives the first algebraic proof of Simpson’s result (original Simpson’s arguments use Lefschetz’s hyperplane theorem for the topological fundamental group).

Corollary 6

Let \(X\) be a smooth projective variety defined over a field of characteristic zero. Let \(H\) be an ample divisor on \(X\) and let \((E, \theta )\) be a slope \(H\)-semistable Higgs sheaf with \(\mathop {\mathrm{ch}}_1(E)H^{n-1}=0\) and \(\mathop {\mathrm{ch}}_2(E)H^{n-2}=0\). Assume that either \(E\) is reflexive or the normalized Hilbert polynomial of \(E\) is the same as that of \({\fancyscript{O}}_X\). Then \(E\) has a filtration whose quotients are locally free slope \(H\)-stable Higgs sheaves with vanishing Chern classes.

5.2 Bruzzo–Hernández Ruipérez’ strong restriction theorem

It is well known that on a smooth complex variety, a slope semistable locally free sheaf with vanishing Chern classes is numerically flat, i.e., its restriction to every curve is nef of degree \(0\). A similar theorem for Higgs sheaves was proven by Bruzzo and Hernández Ruipérez in [5]. The known proofs of these theorems use the fact that a vector bundle of degree \(0\) on a smooth complex projective curve is semistable if and only if it is nef. This is no longer true in positive characteristic as nefness characterizes strong semistability. However, we can prove that the same restriction theorem holds also in positive characteristic. Note that this theorem immediately implies the characteristic zero version by standard reduction to positive characteristic (cf. proof of Theorem 7).

Theorem 12

Let \(E\) be a locally free \({\fancyscript{O}}_X\)-module of rank \(r\le p=\hbox { char } k\) with vanishing Chern classes. Assume that a Higgs sheaf \((E, \theta )\) is slope \(H\)-semistable for some ample polarization \(H\). Let \(\nu :\tilde{C}\rightarrow C\subset X\) be a normalization of some projective curve. Then the induced Higgs bundle \((\nu ^*E, \theta _{\tilde{C}})\) is (slope) semistable.

Proof

Let us consider the canonical Higgs–de Rham sequence of \((E, \theta )\) from Theorem 5. By Theorem 11 all the involved sheaves \(V_i\) and \(E_i\) are locally free and have vanishing Chern classes (note that we need this theorem even though we assume that \(E\) is locally free with vanishing Chern classes!). Therefore by functoriality of the Cartier transform we have the induced Higgs–de Rham sequence for \((\nu ^*E, \theta _{\tilde{C}})\):

Since the Higgs sheaves \(\{(E_i, \theta _i)\} _{i\ge 0}\) are all locally free with vanishing Chern classes and they are all slope \(H\)-semistable, they form a bounded set of sheaves. Therefore \(\{\nu ^* E_i\} _{i \ge 0}\) also forms a bounded set of sheaves on \(\tilde{C}\). Assume that \((\nu ^*E, \theta _{\tilde{C}})\) is not semistable and let \((E', \theta ')\) denote its maximal destabilizing Higgs subsheaf. In particular, \(\mu (E')>\mu (\nu ^*E)=0\). Then the filtration \(\nu ^*S\) induces on \(C^{-1}(E', \theta ')\subset (\nu ^*V_0, \nabla _{\tilde{C}, 0})\) a Griffiths transverse filtration such that the associated Higgs sheaf is a Higgs subsheaf of \((\nu ^*E_1, \theta _{\tilde{C}, 1})\) of slope \(p\mu (E')\). Analogously, \(\nu ^*E_i\) contains a subsheaf of slope \(p^i \mu (E')\), which is impossible for large \(i\). \(\square \)

5.3 Discrete dynamical systems on Higgs bundles

Let \((E, \theta )\) be a rank \(r\le p\) torsion free Higgs sheaf with nilpotent Higgs field.

Definition 2

We say that \((E, \theta )\) is periodic if there exists some positive \(i\) such that in the canonical Higgs–de Rham sequence of \((E,\theta )\) we have \((E_i, \theta _i)\simeq (E, \theta )\). We say that \((E, \theta )\) is preperiodic if there exists some \(i\) such that \((E_i, \theta _i)\) is periodic.

The following proposition strengthens (and corrects) [18, Theorem 2.5] and [18, Corollary 3.11]:

Proposition 1

  1. 1.

    If \((E, \theta )\) is preperiodic then it is locally free with vanishing Chern classes. Moreover, \((E, \theta )\) is slope semistable with respect to every ample polarization.

  2. 2.

    If \(k=\bar{\mathbb F}_p\) and \((E, \theta )\) is slope semistable with vanishing Chern classes then it is preperiodic.

Proof

If \((E, \theta )\) is preperiodic then by Lemma 2 its Chern classes vanish. Then Theorem 11 implies that \(E\) is locally free (for another argument in case \(k=\bar{\mathbb F}_p\) see the beginning of Sect. 5.1). Slope semistability can be proven in the same way as Corollary 4.

The second part of the proposition follows from Theorem 5 and [18, Theorem 2.5]. For the convenience of the reader we repeat a simplified version of proof of [18, Theorem 2.5]. Let us consider the canonical Higgs–de Rham sequence of \((E,\theta )\). Note that \({\mathop {\mathrm{Gr}}}_{S}\circ C^{-1}\) does not enlarge the field of definition of \((E_i, \theta _i)\) (this requires the fact that Simpson’s filtration is preserved by the action of the Galois group). So we can assume that \(X\) and all the bundles \((E_i, \theta _i)\) are defined over the same finite field. But the Higgs bundles \(\{(E_i, \theta _i)\} _{i\in {\mathbb Z}_{\ge 0}}\) are slope semistable with vanishing Chern classes so by Lemma 5 and [21, Theorem 4.4] they form a bounded family. Since any scheme of finite type over a finite field has only finitely many points, there exists a repetition \((E_i, \theta _i)\simeq (E_j, \theta _j)\) with \(j>i\). This gives periodicity of \((E_i, \theta _i)\).   \(\square \)

In general, a slope semistable Higgs bundle with vanishing Chern classes do not need to be preperiodic. For example any non-torsion line bundle \(L\in {\mathop {{\hbox {Pic }}}}^0(X)\) with trivial Higgs field is not preperiodic. The second part of the above corollary is analogous to the fact that over \(\bar{\mathbb F}_p\) strongly semistable vector bundles with vanishing Chern classes become étale trivializable after taking some Frobenius pull-back (see [4, Proposition 2.5]). This last fact can be also interpreted as saying that for \(X/\bar{\mathbb F}_p\) Nori’s fundamental group scheme is equal to the S-fundamental group scheme.

Let \(M^0_\mathrm{Hodge}(X,r)\) be the set of isomorphism classes of slope semistable systems of Hodge bundles of rank \(r\le p\) and with vanishing Chern classes. This set does not depend on the choice of ample polarization and it forms a bounded family. We can define the function \(\Phi :M^0_\mathrm{Hodge}(X,r)\rightarrow M^0_\mathrm{Hodge}(X,r)\) which maps \((E, \theta )\) to the associated graded of Simpson’s filtration of \(C^{-1}(E, \theta )\). This defines a discrete dynamical system. Note that \((E,\theta )\) is periodic (preperiodic) if and only if the point of \(M^0_\mathrm{Hodge}(X,r)\) corresponding to \((E,\theta )\) is periodic (respectively, preperiodic) for the above defined dynamical system.

In the characteristic zero case one can also construct an analogous discrete dynamical system by taking the associated graded of Simpson’s filtration of the flat bundle corresponding to a system of Hodge bundles via the Kobayashi–Hitchin correspondence. However, in the characteristic zero case this system is less interesting as it carries much less information.

6 The Bogomolov–Miyaoka–Yau inequality in positive characteristic

Let \(X\) be a smooth projective surface over an algebraically closed field \(k\) of characteristic \(p\). Let us fix an ample divisor \(H\) on \(X\). Let \(N^1(X)= \mathop {\mathrm{NS}}(X)\otimes {\mathbb R}\), where \(\mathop {\mathrm{NS}}(X)\) is the Néron–Severi group of \(X\). By the Hodge index theorem the intersection pairing on \(N^1(X)\) has signature \((1, \rho -1)\), where \(\rho =\dim _{{\mathbb R}}N^1(X)\). Therefore the open cone \(D^2>0\) has two components separated by the hyperplane \(DH=0\). Let \(C^{+}\) denote the positive component of this cone:

$$\begin{aligned} C^+:=\{ D\in N^1 (X): D^2>0 \hbox { and } DH>0 \}. \end{aligned}$$

The following proposition generalizes to arbitrary characteristic [34, Theorem 9], proven by Shepherd-Barron in characteristic \(2\):

Proposition 2

Let \(X\) be a smooth projective surface that can be lifted to \(W_2(k)\). Then \(\Omega ^1_X\) does not contain a line bundle \(M\) with \(M\in C^{+}\).

Proof

Assume that \(M\in C^{+}\) and consider a system of Hodge bundles \((E:=E^1\oplus E^0, \theta )\) with \(E^1=M, E^0={\fancyscript{O}}_X\) and \(\theta : E^1=M\rightarrow E^0\otimes \Omega _X=\Omega _X\) given by the inclusion. Note that \((E, \theta )\) is slope \(H\)-stable since the only rank \(1\) Hodge subsystem of \((E, \theta )\) is of the form \(({\fancyscript{O}}_X, 0)\). Therefore by Theorem 8 we have

$$\begin{aligned} 0=4c_2(E)\ge c_1^2(E)=M^2, \end{aligned}$$

a contradiction. \(\square \)

In [3, Theorem 4] Bogomolov proved that the cotangent bundle of a smooth complex projective surface does not contain big line bundles. Below we present an example showing that this is no longer true for smooth projective surfaces liftable to the Witt ring. So Proposition 2 is the best result one can expect in this situation.

Example 1

Let us recall the following example due to Ekedahl et al. (see [8, p. 23, Remark ii]). Let \(X\) be a smooth projective Shimura surface, which is the moduli space of abelian 4-folds with multiplication by a totally indefinite quaternion algebra over a real quadratic field \(F\). \(X\) can be also obtained as a quotient of \({\mathbb H}\times {\mathbb H}\) by an irreducible arithmetic group (see [24, Example 4.4 and Example 5.6] for details). So \(\Omega _X= L\oplus M\) where \(L\) and \(M\) are strictly nef line bundles with \(L^2=M^2=0\) and \(LM>0\). \(X\) is defined over \(F\) and it has a smooth projective model \({\fancyscript{X}}\rightarrow \mathop {\hbox {Spec }}{\fancyscript{O}}_{F, S}\), where \({\fancyscript{O}}_{F, S}\) is a localization of the ring of integers of \(F\) along a finite set of places \(S\). For all the unramified prime ideals \({\mathfrak p}\subset {\fancyscript{O}}_{F, S}\) we can base change \({\fancyscript{X}}\) to the Witt ring \(W(k)\) of the algebraic closure \(k\) of \({\fancyscript{O}}_{F,S}/{\mathfrak p}\). For all the rational primes \(p\) that are inert in \(F\) the reduction \(L_p\subset \Omega _{X_p}\) of \(L\subset \Omega _X\) is a big line bundle whereas \(X_p\) lifts to the Witt ring of the algebraic closure of the residue field.

Corollary 7

Let \(X\) be a smooth projective surface of non-negative Kodaira dimension. Assume that \(X\) can be lifted to \(W_2(k)\). Then \(4c_2(X)\ge c_1^2(X)\). Moreover, if \(4c_2(X)=c_1^2(X)\) then one of the following conditions is satisfied:

  1. 1.

    \(K_X^2=0\),

  2. 2.

    \(K_X^2>0\) and \(p=2\). In this case \(\Omega _X\) is slope stable with respect to every ample polarization.

Proof

Let us note that if \(X\rightarrow Y\) is a blow up at a smooth point and \(X\) can be lifted to \(W_2(k)\) then also \(Y\) can be lifted to \(W_2(k)\). Therefore, since blowing up a smooth surface increases \(c_2\) and decreases \(c_1^2\), we can assume that \(X\) is minimal. In this case \(K_X\) is nef and in particular \(K_X\in \overline{C^+}\). If \(\Omega _X\) is slope stable with respect to every ample polarization then the assertions follow from Corollary 2. So we can assume that \(\Omega _X\) is not slope \(H\)-stable. Let \(M\) be the maximal destabilizing subsheaf of \(\Omega _X\). Since the quotient \(\Omega _X/M\) is a rank \(1\) torsion free sheaf, we can write it as \(I_ZL\), where \(Z\) is a \(0\)-dimensional subscheme and \(L\) is a line bundle such that \(L+M=K_X\). By assumption we have \((2M-K_X)H\ge 0\). Then

$$\begin{aligned} c_2(X)=ML+\deg Z\ge ML=M(K_X-M). \end{aligned}$$

If \((2M-K_X)^2>0\) then \((2M-K_X)\in C^{+}\) and hence \(2M=(2M-K_X)+K_X\in C^+\), which contradicts Proposition 2. Therefore

$$\begin{aligned} 4c_2(X)- c_1^2(X)\ge 4M(K_X-M)-K_X^2=-(2M-K_X)^2\ge 0, \end{aligned}$$

which proves the required inequality.

If \(4c_2(X)=c_1^2(X)>0\) then \(K_X\in C^{+}\). So if \(\Omega _X\) is not slope \(H\)-stable then \((2M-K_X)^2=0\). This implies that \((2M-K_X)\in \overline{C^+}\) and hence \(2M=(2M-K_X)+K_X\in C^+\), a contradiction. The fact that in this case \(p=2\) follows from Theorem 13. \(\square \)

Note that Shepherd-Barron proved the above inequality in case \(p=2\) ([34, Theorem 9 and Corollary 11]) and he conjectured that it remains true in general (see [34, p. 244]). In fact, we show the following stronger version of this inequality:

Theorem 13

Let \(X\) be a smooth projective surface of non-negative Kodaira dimension. Assume that \(X\) can be lifted to \(W_2(k)\). If \(p\ge 3\) then \(3c_2(X)\ge c_1^2(X)\).

Proof

As in the proof of Corollary 7 we can assume that \(K_X\in \overline{C^+}\). There exists a canonical system of Hodge bundles \((E:=E^1\oplus E^0, \theta )\) with \(E^1=\Omega _X\), \(E^0={\fancyscript{O}}_X\) and \(\theta :={\mathop {\mathrm{id}}}_{\Omega _X}\). If \((E, \theta )\) is slope \(H\)-semistable then by Theorem 8

$$\begin{aligned} 3c_2(E)=3c_2(X)\ge c_1^2(E)=c_1^2(X). \end{aligned}$$

So we can assume that \((E, \theta )\) is not slope \(H\)-semistable. Let \((E', \theta ')\) be the maximal destabilizing Hodge subsystem. The only rank \(1\) Hodge subsystem is of the form \(({\fancyscript{O}}_X, 0)\) and it does not destabilize \((E, \theta )\) so \(E'\) has rank \(2\) (and \((E', \theta ')\) is slope \(H\)-stable). In this case \(E'={\fancyscript{O}}_X\oplus M\), where \(M\) is a line bundle contained in \(\Omega _X\). Since \((E', \theta ')\) destabilizes \((E, \theta )\) we have

$$\begin{aligned} \mu (E')=\frac{MH}{2}> \mu (E)=\frac{K_XH}{3}. \end{aligned}$$

Therefore \((3M-2K_X)H>0\). If \(3M-2K_X\in C^{+}\) then \(3M=(3M-2K_X)+2K_X\in C^{+}\), which contradicts Proposition 2. This shows that \((3M-2K_X)^2\le 0\).

As before we write \(\Omega _X/M\) as \(I_ZL\), where \(Z\) is a \(0\)-dimensional subscheme and \(L\) is a line bundle such that \(L+M=K_X\). Then

$$\begin{aligned} c_2(X)=ML+\deg Z\ge ML=M(K_X-M). \end{aligned}$$

Therefore, since \(M^2\le 0\) by Proposition 2, we have

$$\begin{aligned} 3c_2(X)- c_1^2(X)\ge 3M(K_X-M)+\frac{3}{4}M^2-K_X^2=-\frac{1}{4}(3M-2K_X)^2\ge 0. \end{aligned}$$

\(\square \)

The above proof of Theorem 13 follows quite closely proof of [35, Proposition 9.9] but in the dual setting.

Remark 3

Note that passing to Simpson’s filtration can increase the level of nilpotence. This is the reason why proof of Theorem 13 does not work in characteristic 2 even though the level of nilpotence of the canonical system of Hodge bundles is only one.

Proposition 3

Under the assumptions of Theorem 13, the equality \(3c_2(X)=c_1^2(X)\) holds if and only if \(X\) is minimal and one of the following conditions is satisfied:

  1. 1.

    the canonical system of Hodge bundles \((E={\fancyscript{O}}_X\oplus \Omega _X, \theta )\) is slope \(H\)-stable and \(\Delta (E)=0\),

  2. 2.

    \(K_X\equiv 0\) and there exists a line subbundle \(M\subset \Omega _X\) such that \(M^2=0\) and \(MH>0\) for an ample divisor \(H\),

  3. 3.

    \(K_X^2=0, K_X\not \equiv 0\) and there exists a line subbundle \(M\subset \Omega _X\) such that \(M\equiv aK_X\) for some positive rational number \(a\),

  4. 4.

    \(\Omega _X\) is numerically flat (in particular, \(K_X\equiv 0\)).

Proof

Minimality of \(X\) is clear. First consider the case in which \((E, \theta )\) is not slope \(H\)-semistable. Then the proof of Theorem 13 shows that \(3c_2(X)=c_1^2(X)\) implies that \(M^2=(3M-2K_X)^2=0\) and \(\deg Z=0\) (which implies that \(M\) is a line subbundle of \(\Omega _X\)). Therefore \((3M-2K_X)\in \overline{C^+}\), which implies that \(K_X^2=0\) (otherwise \(K_X\in {C^+}\) and \(3M=(3M-2K_X)+2K_X\in C^{+}\), a contradiction). Hence \(MK_X=0\) and by the Hodge index theorem either \(K_X\equiv 0\) or \(K_X\not \equiv 0\) and there exists some positive rational number \(a\) such that \(M\equiv aK_X\).

Now consider the case when \((E, \theta )\) is slope \(H\)-semistable but it is not slope \(H\)-stable. Then there exists a saturated slope \(H\)-stable subsystem of Hodge bundles \((E', \theta ')\) such that \(\mu (E')=\mu (E)\).

If \((E', \theta ')=({\fancyscript{O}}_X, 0)\) then \(K_XH=0\). So \(K_X\) is numerically trivial by the Hodge index theorem. In this case the quotient \((E, \theta )/(E', \theta ')=(\Omega _X, 0)\) is slope \(H\)-semistable. But then \(\Omega _X\) is strongly slope \(H\)-semistable (see [26, Theorem 2.1]) and hence it is numerically flat (see [23, Proposition 5.1]).

If \(E'\) has rank \(2\) then \(E'={\fancyscript{O}}_X\oplus M\), where \(M\) is a line bundle contained in \(\Omega _X\). By assumption \((3M-2K_X)H=0\). The same arguments as before show that \(M\) is a line subbundle of \(\Omega _X\) and \(M^2=(3M-2K_X)^2=0\). But then \(3M\equiv 2 K_X\), which also implies that \(K_X^2=0\). \(\square \)

Remark 4

It is easy to see that cases 1, 3 and 4 of the above proposition can occur in every characteristic (including characteristic zero). Assume that we are in the second case. Looking at the classification of surfaces one can easily see that \(K_X\equiv 0\) and \(c_2=0\) can occur only if \(X\) is abelian or hyperelliptic or quasi-hyperelliptic. In the first two cases we have a short exact sequence

$$\begin{aligned} 0\rightarrow {\fancyscript{O}}_X\rightarrow \Omega _X\rightarrow \omega _X\rightarrow 0, \end{aligned}$$

which easily gives a contradiction. This shows that \(X\) is quasi-hyperelliptic. This can only occur in characteristics \(2\) and \(3\). Then we have a short exact sequence

$$\begin{aligned} 0\rightarrow {\fancyscript{O}}_X(m\Sigma )\rightarrow \Omega _X\rightarrow \omega _X (-m\Sigma )\rightarrow 0, \end{aligned}$$

where \(\Sigma \) is the curve of cusps with \(\Sigma ^2=0\) and \(m=2\) in characteristic \(2\) and \(m=1\) in characteristic \(3\). The above sequence implies that \(M={\fancyscript{O}}_X(m\Sigma )\). There are examples of quasi-hyperelliptic surfaces with vanishing \(H^2(T_X)\) (see [19, Theorem 4.2]). Since \(H^2(T_X)\) is the obstruction space to lifting, such surfaces lift to \(W_2(k)\). On the other hand, not all quasi-hyperelliptic surfaces lift to \(W_2(k)\) as in some cases the Hodge to de Rham spectral sequence does not degenerate at \(E_1\) (see [19, Theorem 4.4]) and we would get a contradiction with [6, Corollaire 2.4].

7 Adjoint linear systems on surfaces

Existence of Bogomolov’s inequality for semistable sheaves on varieties liftable to \(W_2\) has many interesting applications. In this subsection we state just two results on adjoint linear systems on surfaces. The first one generalizes Mumford’s vanishing theorem (see [2, IV, Theorem 12.1]) and it was already known with a different proof (see [6, Corollaire 2.8]):

Theorem 14

Let \(X\) be a smooth projective surface liftable to \(W_2(k)\). Let \(D\) be a nef and big line bundle on \(X\). Then \(H^1(X, K_X+D)=0\).

The vanishing follows immediately by replacing in proof of [2, IV, Theorem 12.1] usual Bogomolov’s inequality with Corollary 2.

Note that the same arguments show a counterexample to Bogomolov’s inequality for semistable sheaves in positive characteristic. More precisely, let \(X\) be a smooth projective surface in positive characteristic for which Kodaira’s vanishing fails (see [33] or [30]). Then there exists an ample divisor \(H\) such that \(H^1(X, K_X+H)\ne 0\). By Serre’s duality \({\mathop {\mathrm{E}xt}}^1({\fancyscript{O}}_X(H), {\fancyscript{O}}_X)\ne 0\), so there exists a non-trivial extension

$$\begin{aligned} 0\rightarrow {\fancyscript{O}}_X \rightarrow E\rightarrow {\fancyscript{O}}_X(H)\rightarrow 0. \end{aligned}$$

Then the sheaf \(E\) is slope \(H\)-semistable, but \(\Delta (E)<0\).

Similarly to the above, the usual characteristic zero proof can be used to show the following Reider’s type theorem (cf. [2, IV, Section 11]). Details are left to the reader.

Theorem 15

Let \(X\) be a smooth projective surface liftable to \(W_2(k)\). Let \(D\) be a nef line bundle on \(X\) and let \(Z\) be a zero-dimensional locally complete intersection subscheme of \(X\). Assume that \(D^2>4 \deg Z\). Then \(Z\) is in special position with respect to \(K_X+D\) if and only if there there exists an effective curve \(C\) containing \(Z\) such that \(Z\) is in (very) special position with respect to \({\fancyscript{O}}_C(K_X+D)\). Moreover, we can choose \(C\) so that \(D-2C\) is big, \(DC-\deg Z\le C^2\) and \(0\le DC<2 \deg Z\).

The above theorem has many applications to studying \(n\)-very ampleness and \(n\)-jet ampleness of adjoint linear systems, to pluricanonical maps on surfaces of general type, linear systems on surfaces of Kodaira dimension \(0\), etc.

8 Lifting of Frobenius split varieties

Let \(f:X\rightarrow S\) be a flat morphism of schemes in characteristic \(p\) and let \(F_{X/S}:X\rightarrow X'\) be the relative Frobenius morphism. Let us fix a flat \({\mathbb Z}/p^2{\mathbb Z}\)-scheme \(\tilde{S}\) such that \(S\rightarrow \tilde{S}\) is the closed subscheme defined by \(p\). Then \(p{\fancyscript{O}}_{\tilde{S}}\simeq {\fancyscript{O}}_S\).

We say that an \(S\)-scheme \(X/S\) is F-split is the canonical map \({\fancyscript{O}}_{X'}\rightarrow F_{X/S, *}{\fancyscript{O}}_X\) can be split by some \({\fancyscript{O}}_{X'}\)-linear map \(F_{X/S, *}{\fancyscript{O}}_X\rightarrow {\fancyscript{O}}_{X'}\).

The following proposition was pointed out to the author by Bhargav Bhatt:

Proposition 4

If \(X/S\) is F-split then \(X'\) can be lifted to \(\tilde{S}\).

Proof

Let \(L_{X/S}\) be the cotangent complex of \(X/S\). By definition of the cotangent complex it is easy to see that the map \( F_{X/S}^* L_{X'/S}\rightarrow L_{X/S}\) induced by the relative Frobenius morphism is the zero map.

There exists a canonical obstruction \(ob_{X/S}: L_{X/S}\rightarrow {\fancyscript{O}}_X[2]\) to lifting \(X/S\) to \(\tilde{S}\) (see [13, Chapitre III, Théorème 2.1.7]). By functoriality of obstructions we have the following commutative diagram

But the vertical map \(L_{X'/S}\rightarrow F_{X/S, *} L_{X/S}\) is adjoint to \(F_{X/S}^* L_{X'/S}\rightarrow L_{X/S}\), so it is zero. This implies that \(ob_{X'/S}=0\) and \(X'\) can be lifted to \(\tilde{S}\). \(\square \)

In the smooth case this proposition was proven in [14, p. 164] (see also [16, Corollary 9.2]).

Let us recall that \(X/S\) is F-pure if the Frobenius map \({\fancyscript{O}}_{X'}\rightarrow F_{X/S, *}{\fancyscript{O}}_X\) is locally split as a map of \({\fancyscript{O}}_{X'}\)-modules. Proposition 4 immediately implies the following corollary:

Corollary 8

If \(X/S\) is F-pure then locally in the Zariski topology \(X\) can be lifted modulo \(p^2\).

Together with Theorem 8, Proposition 4 implies also the following corollary:

Corollary 9

Let \(X\) be a smooth projective variety defined over an algebraically closed field of positive characteristic \(p\). Let \(H\) be an ample divisor on \(X\). Assume that \(X\) is F-split. Then for any slope \(H\)-semistable Higgs sheaf \((E, \theta )\) on \(X\) of rank \(r\le p\) we have \(\Delta (E)H^{n-2}\ge 0\).