1 Introduction

Let H be an r-uniform hypergraph. The Turán function \(\textrm{ex}(n,H)\) of H is the largest number of edges in an r-uniform hypergraph on n vertices with no copy of H. The study of the function \(\textrm{ex}(n,H)\) for various hypergraphs H is one of the central problems of extremal combinatorics. In the graph case \(r=2\), the Turán function is fairly well understood unless H is bipartite. On the other hand, for \(r \ge 3\), our understanding of the Turán function is much worse and there are only few tight results. For example, determining the answer for the 3-uniform clique on 4 vertices is still open. Some of the few cases where tight results are known involve hypergraphs which are derived from graphs, see e.g. [6, 9, 11, 13, 15]. We refer the interested reader to an extensive survey of Keevash [8] on Turán problems for non–r-partite r-uniform hypergraphs.

It is well-known that for r-partite H, one has \(\textrm{ex}(n,H) = O(n^{r - \varepsilon })\) for some \(\varepsilon = \varepsilon (H) > 0\) and the main goal here is to determine or estimate the best possible \(\varepsilon (H)\). One of the very old such Turán-type questions for hypergraphs is a problem of Erdős [3], asking for the maximum number \(f_r(n)\) of edges in an r-uniform hypergraph on n vertices which does not have four distinct edges ABCD satisfying \(A \cup B = C \cup D\) and \(A \cap B = C \cap D = \emptyset \). Note that in this problem the forbidden hypergraphs originate quite naturally from a four-cycle. Erdős in particular asked whether \(f_r(n) = O(n^{r-1})\). This was answered affirmatively by Füredi [7], who showed that \(f_r(n) \le 3.5 \left( {\begin{array}{c}n\\ r-1\end{array}}\right) \). Mubayi and Verstraëte [12] extended Erdős’s question by considering the following family of r-uniform hypergraphs which generalize complete bipartite graphs: for positive integers rst, let \(K_{s,t}^{(r)}\) denote the r-uniform hypergraph whose vertex set consists of disjoint sets \(X,Y_1,\dots ,Y_t\), where \(|X| = s\) and \(|Y_1| = \dots = |Y_t| = r-1\), and whose edge set is \(\{\{x\} \cup Y_i: x \in X, 1\le i\le t\}\). Note that \(K_{s,t}^{(r)}\) is r-partite and for \(r = 2\), \(K_{s,t}^{(2)}\) is just the \(s \times t\) complete bipartite graph. Observe that the edges of \(K^{(r)}_{2,2}\) form a configuration ABCD as in the definition of \(f_r(n)\). Hence, \(f_r(n) \le \textrm{ex}(n,K_{2,2}^{(r)})\) (this is in fact an equality for \(r = 3\)). Mubayi and Verstraëte [12] proved that \(\textrm{ex}(n,K_{2,2}^{(r)})\le 3\left( {\begin{array}{c}n\\ r-1\end{array}}\right) +O(n^{r-2})\), improving the constant in Füredi’s result. Pikhurko and Verstraëte [14] improved the coefficient of \(\left( {\begin{array}{c}n\\ r-1\end{array}}\right) \) further. It remains open whether \(\textrm{ex}(n,K_{2,2}^{(r)}) = (1 + o(1)) \left( {\begin{array}{c}n-1\\ r-1\end{array}}\right) \), as conjectured by Füredi. That \(\left( {\begin{array}{c}n-1\\ r-1\end{array}}\right) \) is a lower bound can be seen by considering the star, i.e. the hypergraph consisting of all edges containing a fixed vertex.

Mubayi and Verstraëte [12] initiated the study of \(\textrm{ex}(n,K_{s,t}^{(3)})\) for general st, and proved that \(\textrm{ex}(n,K_{s,t}^{(3)})\le C_{s,t}n^{3-1/s}\) as well as that \(\textrm{ex}(n,K_{s,t}^{(3)})\ge c_t n^{3-2/s}\) for \(t>(s-1)!\). For small values of s, they obtained more accurate estimates. Namely, for \(s=3\), they improved their bound to \(\textrm{ex}(n,K_{3,t}^{(3)})\le C_tn^{13/5}\), while for \(s = 2\), they showed that \(\textrm{ex}(n,K_{2,t}^{(3)})\le t^4 \left( {\begin{array}{c}n\\ 2\end{array}}\right) \) and that \(\textrm{ex}(n,K_{2,t}^{(3)})\ge \frac{2t-1}{3}\left( {\begin{array}{c}n\\ 2\end{array}}\right) \) for infinitely many n. They further asked to determine the correct dependence of \(\textrm{ex}(n,K_{2,t}^{(3)})\) on t. Ergemlidze et al. [4] improved the upper bound to \(\textrm{ex}(n,K_{2,t}^{(3)})\le (15t\log t+40t)n^2\), leaving a \(\log t\) gap from the lower bound of \(\Omega (tn^2)\). For \(r > 3\), little is known. Ergemlidze, Jiang and Methuku found a construction showing \(\textrm{ex}(n, K_{2,t}^{(4)}) = \Omega (tn^3)\). Xu et al. [16, 17] proved a tight bound on \(\textrm{ex}(n,K_{s,t}^{(r)})\) when s is much larger than t, using a standard application of the random algebraic method of Bukh [2].

Our first result, Theorem 1.1, achieves two goals. First, it resolves the problem of Mubayi and Verstraëte by proving that \(\textrm{ex}(n,K_{2,t}^{(3)})= \Theta (tn^2)\). And second, it improves the upper bound of [12] on \(\textrm{ex}(n,K_{s,t}^{(3)})\) for every s and t by reducing the exponent of n from \(3 - \frac{1}{s}\) to \(3 - \frac{1}{s-1}\). We also obtain an analogous result for every \(r \ge 3\). The proof of this bound relies on a new weighted variant of the dependent random choice method (see [5] for a description of the technique and a brief history).

Theorem 1.1

For any \(s,t\ge 2\) and \(r \ge 3\) there is a constant \(C_s\) depending only on s such that

$$\begin{aligned} \textrm{ex}\left( n,K_{s,t}^{(r)}\right) \le C_{s} t^{\frac{1}{s-1}}n^{r-\frac{1}{s-1}}. \end{aligned}$$

In particular,

$$\begin{aligned} \textrm{ex}\left( n,K_{2,t}^{(3)}\right) \le Ctn^2 \end{aligned}$$

for some absolute constant C.

Our next result shows that, somewhat surprisingly, the bound in Theorem 1.1 is tight in terms of both n and t if the uniformity r is even and \(t \gg s\). Our construction uses as building blocks a variation of the norm graphs, introduced by Kollár et al. [10], which might be of independent interest.

Theorem 1.2

For any positive integers \(s\ge 2\) and k, there is a positive constant \(c=c(k,s)\) such that for every integer \(t>(s-1)!\), if n is sufficiently large, then

$$\begin{aligned} \textrm{ex}\left( n,K_{s,t}^{(2k)}\right) \ge ct^{\frac{1}{s-1}}n^{2k-\frac{1}{s-1}}. \end{aligned}$$

By combining Theorems 1.1 and 1.2, we see that \(\textrm{ex}(n,K_{s,t}^{(r)}) = \Theta _{r,s}(t^{\frac{1}{s-1}}n^{r-\frac{1}{s-1}})\) if \(r\ge 4\) is even and \(t > (s-1)!\). (Here and elsewhere in the paper, \(\Theta _{r,s}\) means that the implied constants can depend on r and s.) In the special case \(s = 2\), this gives \(\textrm{ex}(n,K_{2,t}^{(r)}) = \Theta _r(t n^{r-1})\) for even \(r\ge 4\). This partially answers a question of Ergemlidze et al. [4], who asked to determine the dependence of \(\textrm{ex}(n,K_{2,t}^{(r)})\) on t. Also, Theorem 1.2 disproves a conjecture of Xu et al. [17, Conjecture 5.1] which stated that \(\textrm{ex}(n,K_{s,t}^{(r)}) = \Theta _{r,s,t}(n^{r-2/s})\) for all \(2 \le s \le t\).

It is natural to ask whether the bound in Theorem 1.1 is tight for odd uniformities as well. Our next theorem shows that this is not the case for \(r = 3\). This indicates that, perhaps surprisingly, the parity of r may play a role.

Theorem 1.3

For any \(s\ge 3\), there exists some \(\varepsilon = \varepsilon (s)>0\) such that for any t,

$$\begin{aligned} \textrm{ex}\left( n,K_{s,t}^{(3)}\right) \le C_{s,t} n^{3-\frac{1}{s-1}-\varepsilon }. \end{aligned}$$

The \(\varepsilon = \varepsilon (s)\) in Theorem 1.3 can be chosen on the order of \(s^{-5}\).

Theorems 1.2 and 1.3 together show that \(\textrm{ex}(n,K_{s,t}^{(r)})/n^{r-1}\) has a different order of magnitude for even \(r \ge 4\) and for \(r = 3\). Indeed, for even r the function \(\textrm{ex}(n,K_{s,t}^{(r)})/n^{r-1}\) is asymptotically \(\Theta _{r,s,t}(n^{1 - \frac{1}{s-1}})\), assuming \(s \ll t\), while for \(r = 3\) this function is smaller by at least a factor of \(n^{\varepsilon }\). This contradicts a claim made in the concluding remarks of [4] (see [4, Proposition 1]), where it was stated that \(\textrm{ex}(n,K_{s,t}^{(r)}) \le O_{r,s,t}(n^{r-3}) \cdot \textrm{ex}(n,K_{s,t}^{(3)})\). One can check that the proof suggested in [4] is incorrect. Moreover, as we now see, the statement itself is disproved by Theorems 1.2 and 1.3.

It would be very interesting to determine if Theorem 1.3 can be extended to all odd uniformities r. If so, then this would be a rare example of an extremal problem where the answer depends on the parity of the uniformity. (See [9] for another hypergraph Turán problem where the extremal construction depends heavily on number theoretic properties of the parameters.) The first open case is \(r = 5\): is it true that \(\textrm{ex}(n,K_{s,t}^{(5)})=O(n^{5-\frac{1}{s-1}-\varepsilon })?\)

We end with some results for a more general family of hypergraphs. Let G be a bipartite graph with an ordered bipartition (XY), \(Y = \{y_1,\dots ,y_m\}\). Following [4], we define \(G_{X,Y}^{(r)}\) to be the r-uniform hypergraph whose vertex set consists of disjoint sets \(X,Y_1,\dots ,Y_m\), \(|Y_1| = \dots = |Y_m| = r-1\), and whose edge set is \(\{\{x\} \cup Y_i: \{x,y_i\} \in E(G)\}\). Note that if \(G=K_{s,t}\) with X being the part of size s and Y being the part of size t, then \(G_{X,Y}^{(r)}=K_{s,t}^{(r)}\). Ergemlidze, Jiang and Methuku [4] asked whether it is true that if all vertices in Y have degree at most 2 in G, then \(\textrm{ex}(n,G_{X,Y}^{(r)}) = O(n^{r-1})\) where the implied constant depends only on G and r. Here we resolve this conjecture in greater generality.

Theorem 1.4

Let \(s\ge 2\), \(r\ge 3\) and let G be a bipartite graph with an ordered bipartition (XY) such that every vertex in Y has degree at most s. Then \(\textrm{ex}(n,G_{X,Y}^{(r)})\le Cn^{r-\frac{1}{s-1}}\) where C only depends on G and r.

Note that Theorem 1.2 shows that this bound can be attained whenever r is even.

Finally, we consider the hypergraph \(G_{X,Y}^{(r)}\) when \(G=C_{2t}\) is the cycle of length 2t. Let us write \(C_{2t}^{(r)}\) for this hypergraph. In an unpublished work, Jiang and Liu showed that \(\Omega _r(tn^{r-1}) \le \textrm{ex}(n,C_{2t}^{(r)}) \le O_r(t^5n^{r-1})\). The lower bound is obtained by taking all edges which contain one of \(t-1\) vertices. This hypergraph has cover number \(t-1\), so it cannot contain \(C_{2t}^{(r)}\), which has cover number t. Ergemlidze, Jiang and Methuku [4] improved the upper bound to \(\textrm{ex}(n,C_{2t}^{(r)}) \le O_r(t^2(\log t) n^{r-1})\). We determine the correct dependence on t.

Theorem 1.5

For every \(t \ge 2\) and \(r \ge 3\), we have that \(\textrm{ex}(n,C_{2t}^{(r)}) = \Theta _r(tn^{r-1})\).

The rest of the paper is organized as follows. In Sect. 2, we prove a general result which implies Theorems 1.1, 1.4 and 1.5. In Sect. 3, we prove Theorem 1.2. In Sect. 4, we prove Theorem 1.3. In Sect. 5, we give some concluding remarks.

2 Upper Bounds

In what follows, for an r-uniform hypergraph \(\mathcal {G}\) and a set \(S=\{v_1,\dots ,v_{r-1}\}\subset V(\mathcal {G})\), we write \(d_{\mathcal {G}}(S)\) or \(d_{\mathcal {G}}(v_1,\dots ,v_{r-1})\) for the number of vertices \(v_r\in V(\mathcal {G})\) such that \(v_1v_2\dots v_r\in E(\mathcal {G})\). We omit the subscript when the hypergraph is clear. Given a vertex \(v \in V(\mathcal {G})\), the link hypergraph of v (with respect to \(\mathcal {G}\)) is the \((r-1)\)-uniform hypergraph containing all \((r-1)\)-sets which together with v form an edge in \(\mathcal {G}\).

Definition 2.1

In an r-uniform hypergraph \(\mathcal {G}\), we call a set \(S\subset V(\mathcal {G})\) t-rich if there are sets \(T_1,T_2,\dots ,T_t\subset V(\mathcal {G})\) of size \(r-1\) such that \(S,T_1,\dots ,T_t\) are pairwise disjoint and \(\{u\}\cup T_i\in E(\mathcal {G})\) for every \(u\in S\) and \(i\in [t]\).

Note that if \(\mathcal {G}\) has any t-rich set of size s, then it contains \(K_{s,t}^{(r)}\) as a subgraph.

Theorem 2.2

Let \(\alpha >1\) be a real number and let \(r\ge 3\), \(s\ge 2\), t and n be positive integers. Then there is a constant C which depends only on s such that the following holds. If \(\mathcal {G}\) is an n-vertex r-uniform hypergraph with at least \(C\alpha ^{\frac{1}{s-1}}t^{\frac{1}{s-1}}n^{r-\frac{1}{s-1}}\) hyperedges, then there is a set \(A\subset V(\mathcal {G})\) of size at least \(\alpha ^{\frac{1}{s-1}}t^{\frac{1}{s-1}}n^{1-\frac{1}{s-1}}\) (and at least s) such that the proportion of t-rich sets of size s in A is at least \(1-\alpha ^{-1}\).

Observe that the conclusion of this theorem implies that \(\mathcal {G}\) contains \(K_{s,t}^{(r)}\) as a subgraph, so Theorem 1.1 follows immediately (by taking \(\alpha =2\), for example). Moreover, as we will see shortly, Theorem 2.2 also implies Theorems 1.4 and 1.5 fairly easily.

The proof of Theorem 2.2 uses a novel variant of the dependent random choice method. The rough idea is to choose random vertices \({\textbf {v}}_2,{\textbf {v}}_3,\dots ,{\textbf {v}}_r\in V(\mathcal {G})\) and take \({\textbf {A}}\) to be the set of \({\textbf {v}}_1\in V(\mathcal {G})\) such that \({\textbf {v}}_1{\textbf {v}}_2\dots {\textbf {v}}_r\in E(\mathcal {G})\). However, we add two major twists to this. Firstly, we only put into \({\textbf {A}}\) those vertices \({\textbf {v}}_1\) for which \(d({\textbf {v}}_2,\dots ,{\textbf {v}}_r)=\max _i d({\textbf {v}}_1,\dots ,{\textbf {v}}_{i-1},{\textbf {v}}_{i+1},\dots ,{\textbf {v}}_r)\). Secondly, the vertices \({\textbf {v}}_2,\dots ,{\textbf {v}}_r\) are not chosen uniformly at random, but with probability proportional to \(1/d({\textbf {v}}_2,\dots ,{\textbf {v}}_r)\).

Let us briefly comment on the purpose of these two twists. If we were to choose \({\textbf {v}}_2,\dots ,{\textbf {v}}_r\) uniformly at random and let \({\textbf {A}}\) be the set of \({\textbf {v}}_1\in V(\mathcal {G})\) such that \({\textbf {v}}_1{\textbf {v}}_2\dots {\textbf {v}}_r\in E(\mathcal {G})\), then by the usual dependent random choice argument, we could conclude that for most s-sets \(\{u_1,\dots ,u_s\}\subset {\textbf {A}}\), there are many sets of size \(r-1\) forming an edge with each of the \(u_i\). (“Many” here means roughly \(\omega (n^{r-2-\frac{1}{s-1}})\).) If at least t of these \((r-1)\)-sets are pairwise disjoint, then we can find a copy of \(K_{s,t}^{(r)}\), but we run into trouble if say all these \((r-1)\)-sets contain some vertex x. In this case, the reason why the event \(\{u_1,\dots ,u_s\}\subset {\textbf {A}}\) is not very unlikely is that conditional on \(x\in \{{\textbf {v}}_2,\dots ,{\textbf {v}}_r\}\), the probability of this event is fairly large. However, now the pair \((u_1,x)\) belongs to slightly more edges than a typical pair of vertices in \(\mathcal {G}\). This can be exploited by modifying the probability distribution of the random vertices \(({\textbf {v}}_2,\dots ,{\textbf {v}}_r)\) in a way that \((r-1)\)-sets with large degree are chosen with lower probability.

More precisely, by choosing \({\textbf {v}}_2,\dots ,{\textbf {v}}_r\) with probability proportional to \(1/d({\textbf {v}}_2,\dots ,{\textbf {v}}_r)\) and taking \({\textbf {A}}\) to be the set of those vertices \({\textbf {v}}_1\) for which \(d({\textbf {v}}_2,\dots ,{\textbf {v}}_r)=\max _i d({\textbf {v}}_1,\dots ,{\textbf {v}}_{i-1},{\textbf {v}}_{i+1},\dots ,{\textbf {v}}_r)\), we achieve that for a typical s-set \(\{u_1,\dots ,u_s\}\subset {\textbf {A}}\), not only there are many \((r-1)\)-sets forming an edge with each \(u_i\), but in fact the stronger statement holds that there cannot be a constant-sized set which intersects all such \((r-1)\)-sets. See Claims 3-5 for the full details (Claims 1-2 would hold similarly without the twists).

We proceed with the detailed proof.

Proof of Theorem 2.2

Choose C such that \(C\ge 4s\) and \(\frac{C^{s-1}((r-1)!)^s}{2^{s}r!s^s}-r^2\ge 1\). Since \(s\ge 2\), C can be chosen to be independent from r. Let \(\mathcal {G}\) be an n-vertex r-uniform hypergraph with \(e(\mathcal {G})\ge C\alpha ^{\frac{1}{s-1}}t^{\frac{1}{s-1}}n^{r-\frac{1}{s-1}}\). Let

$$\begin{aligned} D=\frac{C}{2}\alpha ^{\frac{1}{s-1}}t^{\frac{1}{s-1}}n^{1-\frac{1}{s-1}}. \end{aligned}$$

By successively deleting all edges containing a set of size \(r-1\) which lies in less than D edges, we obtain a subhypergraph \(\mathcal {G}'\) (on the same vertex set) with \(e(\mathcal {G}')\ge e(\mathcal {G}) - n^{r-1}D \ge e(\mathcal {G})/2\) such that for every set \(S\subset V(\mathcal {G})\) of size \(r-1\), we have either \(d_{\mathcal {G}'}(S)=0\) or \(d_{\mathcal {G}'}(S)\ge D\). For the rest of the proof, we let \(d(S):=d_{\mathcal {G}'}(S)\) for every set S of size \(r-1\).

For distinct vertices \(v_2,\dots ,v_{r}\in V(\mathcal {G}')\), let

$$\begin{aligned} A_{v_2,\dots ,v_{r}}{} & {} =\{v_1\in V(\mathcal {G}'): v_1v_2\dots v_{r}\in E(\mathcal {G}'), d(v_2,\dots ,v_r)\nonumber \\{} & {} =\max _i(d(v_1,\dots ,v_{i-1},v_{i+1},\dots ,v_r))\} \end{aligned}$$

and let \(a(v_2,\dots ,v_{r})=|A_{v_2,\dots ,v_{r}}|\).

Define

$$\begin{aligned} p=\sum _{\begin{array}{c} v_2,\dots ,v_r: \\ d(v_2,\dots ,v_r)>0 \end{array}} \frac{D}{n^{r-1} d(v_2,\dots ,v_r)}. \end{aligned}$$

By the definition of \(\mathcal {G}'\), if \(d(v_2,\dots ,v_r)>0\), then \(d(v_2,\dots ,v_r)\ge D\), so we have \(p\le 1\). Let us define a random set \({\textbf {A}}\subset V(\mathcal {G}')\) as follows. With probability \(1-p\), we let \({\textbf {A}}=\emptyset \). With probability p, we choose a random \((r-1)\)-tuple \(({\textbf {v}}_2,\dots ,{\textbf {v}}_r)\) of distinct vertices in \(\mathcal {G}'\) in a way that the probability that \({\textbf {v}}_i=v_i\) for every i is \(\frac{D}{n^{r-1} d(v_2,\dots ,v_r)}\) if \(d(v_2,\dots ,v_r)>0\) and 0 otherwise. Set \({\textbf {A}}=A_{{\textbf {v}}_2,\dots ,{\textbf {v}}_r}\). \(\square \)

Claim 1

\(\sum _{v_2,\dots ,v_r} a(v_2,\dots ,v_r)\ge (r-1)!e(\mathcal {G}')\).

Proof of Claim 1

For any \(e\in E(\mathcal {G}')\), there are at least \((r-1)!\) ordered r-tuples \((v_1,\dots ,v_r)\) such that \(e=v_1v_2\dots v_r\) and \(d(v_2,\dots ,v_r)=\max _i(d(v_1,\dots ,v_{i-1},v_{i+1},\dots ,v_r))\). For any such r-tuple, we have \(v_1\in A_{v_2,\dots ,v_r}\). \(\square \)

Claim 2

\(\mathbb {E}[|{\textbf {A}}|^s]\ge \frac{((r-1)!)^s}{r!}D^s\).

Proof of Claim 2

Using Hölder’s inequality for three functions with parameters \(p_1 = s,\) \(p_2 = s, p_3 = s / (s-2),\) we get

$$\begin{aligned} \left( \sum \frac{a(v_2,\dots ,v_r)^s}{d(v_2,\dots ,v_r)}\right) \left( \sum d(v_2,\dots ,v_r)\right) \left( \sum 1 \right) ^{s-2}\ge \left( \sum a(v_2,\dots ,v_r)\right) ^s, \end{aligned}$$

where each sum is over all \((r-1)\)-tuples of distinct vertices \((v_2,\dots ,v_r)\) with \(d(v_2,\dots ,v_r)>0\). Hence,

$$\begin{aligned} \sum \frac{a(v_2,\dots ,v_r)^s}{d(v_2,\dots ,v_r)}\ge \frac{\left( \sum a(v_2,\dots ,v_r)\right) ^s}{n^{(r-1)(s-2)}\sum d(v_2,\dots ,v_r)}, \end{aligned}$$

so

$$\begin{aligned} \mathbb {E}[|{\textbf {A}}|^s]&=\sum \frac{D}{n^{r-1} d(v_2,\dots ,v_r)}a(v_2,\dots ,v_r)^s\ge \frac{D}{n^{(r-1)(s-1)}} \frac{(\sum a(v_2,\dots ,v_r))^s}{\sum d(v_2,\dots ,v_r)} \\ {}&\ge \frac{D}{n^{(r-1)(s-1)}}\frac{((r-1)!e(\mathcal {G}'))^s}{r!e(\mathcal {G}')}= \frac{((r-1)!)^s D}{r! n^{(r-1)(s-1)}}e(\mathcal {G}')^{s-1} \\&\ge \frac{((r-1)!)^s D}{r! n^{(r-1)(s-1)}}(Dn^{r-1})^{s-1}=\frac{((r-1)!)^s}{r!}D^s, \end{aligned}$$

where the second inequality used Claim 1. \(\square \)

Claim 3

Let \(u_1\), \(u_2\), ..., \(u_s\), \(v_2\), ..., \(v_{r-1}\) be distinct vertices in \(\mathcal {G}'\). Then the probability that \(u_1,u_2,\dots ,u_s\in {\textbf {A}}\) and \({\textbf {v}}_i=v_i\) for all \(2\le i\le r-1\) is at most \(D/n^{r-1}\).

Proof of Claim 3

Assume that \(v_r\in V(\mathcal {G}')\) is such that \(u_j\in A_{v_2,\dots ,v_r}\) for each \(j\in [s]\). Then in particular \(u_1v_2v_3\dots v_r\in E(\mathcal {G}')\) and \(d(v_2,v_3,\dots ,v_r)\ge d(u_1,v_2,\dots ,v_{r-1})\). Clearly, there are at most \(d(u_1,v_2,\dots ,v_{r-1})\) choices for \(v_r\) satisfying these two properties, and for each such choice, the probability that \({\textbf {v}}_i=v_i\) for all \(2\le i\le r\) is \(\frac{D}{n^{r-1} d(v_2,v_3,\dots ,v_r)}\le \frac{D}{n^{r-1} d(u_1,v_2,\dots ,v_{r-1})}\). Hence, summing over all possibilities for \(v_r\) proves the claim. \(\square \)

Claim 4

Let \(u_1\), \(u_2\), ..., \(u_s\) and \(v_2\) be distinct vertices in \(\mathcal {G}'\). Then the probability that \(u_1,\dots ,u_s\in {\textbf {A}}\) and \({\textbf {v}}_2=v_2\) is at most \(D/n^2\).

Proof of Claim 4

This follows from Claim 3 and the union bound over all choices for \(v_3,\dots ,v_{r-1}\). \(\square \)

Claim 5

Suppose that \(u_1\), \(u_2\), ..., \(u_s\) are distinct vertices in \(\mathcal {G}'\) such that \(\{u_1,\dots ,u_s\}\) is not t-rich in \(\mathcal {G}'\). Then the probability that \(u_j\in {\textbf {A}}\) for every \(j\in [s]\) is at most \(\frac{(r-1)^2(t-1)D}{n^2}\).

Proof of Claim 5

Since \(\{u_1,\dots ,u_s\}\) is not t-rich, the common intersection of the link hypergraphs of \(u_1, \dots , u_s\) does not contain a matching of size t. Hence, there is a set \(T\subset V(\mathcal {G}')\) of size at most \((r-1)(t-1)\) with the property that whenever \(u_jv_2\dots v_r\in E(\mathcal {G}')\) for every \(j\in [s]\), we have \(v_i\in T\) for some \(2\le i\le r\). Therefore, if \(u_j\in {\textbf {A}}\) for every \(j\in [s]\), then \({\textbf {v}}_i\in T\) for some \(2\le i\le r\). By Claim 4, the probability that \(u_j\in {\textbf {A}}\) for every \(j\in [s]\) and \({\textbf {v}}_2\in T\) is at most \(|T|D/n^2\). By symmetry, for any fixed \(2\le i\le r\), the probability that \(u_j\in {\textbf {A}}\) for every \(j\in [s]\) and \({\textbf {v}}_i\in T\) is also at most \(|T|D/n^2\). The claim follows. \(\square \)

Let \({\textbf {b}}\) be the number of sets of size s in \({\textbf {A}}\) which are not t-rich. It follows from Claim 5 that \(\mathbb {E}[{\textbf {b}}]\le \left( {\begin{array}{c}n\\ s\end{array}}\right) \cdot \frac{(r-1)^2 (t-1) D}{n^2}\le r^2 t D n^{s-2}\). By Claim 2 and since \(D\ge 4s\), \(\mathbb {E}\left[ |{\textbf {A}}|^s\mathbbm {1}(|{\textbf {A}}|\ge s)\right] \ge \mathbb {E}\left[ |{\textbf {A}}|^s\right] -s^s\ge \frac{((r-1)!)^s}{r!}D^s-s^s\ge \frac{((r-1)!)^s}{2r!}D^s\), so using that \(\left( {\begin{array}{c}x\\ s\end{array}}\right) \ge (x/s)^s\) for \(x \ge s,\) we have

$$\begin{aligned} \mathbb {E}\left[ \left( {\begin{array}{c}|{\textbf {A}}|\\ s\end{array}}\right) \right] \ge \mathbb {E}\left[ \left( {\begin{array}{c}|{\textbf {A}}|\\ s\end{array}}\right) \mathbbm {1}(|{\textbf {A}}|\ge s)\right] \ge \mathbb {E}[|{\textbf {A}}|^s \mathbbm {1}(|{\textbf {A}}|\ge s)]/s^s\ge \frac{((r-1)!)^s}{r!2s^s}D^s. \end{aligned}$$

Hence,

$$\begin{aligned} \mathbb {E}\left[ \left( {\begin{array}{c}|{\textbf {A}}|\\ s\end{array}}\right) {-}\alpha {\textbf {b}}\right]&{\ge } \frac{((r{-}1)!)^s}{r!2s^s}D^s{-}\alpha r^2 t D n^{s-2}{=}\left( \frac{((r{-}1)!)^s}{r!2s^s}D^{s-1}{-}\alpha r^2 t n^{s-2}\right) D \\&{=}\left( \frac{C^{s{-}1}((r{-}1)!)^s}{2^{s}r!s^s}{-}r^2\right) \alpha t n^{s-2} D{\ge } \alpha t n^{s-2} D {\ge } \alpha ^{\frac{s}{s-1}} t^{\frac{s}{s-1}} n^{s-\frac{s}{s-1}}. \end{aligned}$$

It follows that there is an outcome for which \(\left( {\begin{array}{c}|{\textbf {A}}|\\ s\end{array}}\right) -\alpha {\textbf {b}}\ge \alpha ^{\frac{s}{s-1}} t^{\frac{s}{s-1}} n^{s-\frac{s}{s-1}}\). Then \(|{\textbf {A}}|\ge s\), \(|{\textbf {A}}|\ge \alpha ^{\frac{1}{s-1}}t^{\frac{1}{s-1}}n^{1-\frac{1}{s-1}}\) and the proportion of t-rich sets of size s in \({\textbf {A}}\) is at least \(1-\alpha ^{-1}\), as desired. \(\square \)

It is now not hard to deduce Theorems 1.4 and 1.5.

Proof of Theorem 1.4

Let \(t=|V(G_{X,Y}^{(r)})|\) and let \(\alpha =t^s\). Let C be the constant provided by Theorem 2.2 and let \(C'=C\alpha ^{\frac{1}{s-1}}t^{\frac{1}{s-1}}\). Note that \(C'\) is a constant that depends only on G and r.

Let \(\mathcal {G}\) be an n-vertex r-uniform hypergraph with at least \(C'n^{r-\frac{1}{s-1}}\) edges. By Theorem 2.2, there is a set \(A\subset V(\mathcal {G})\) of size at least \(\alpha ^{\frac{1}{s-1}}t^{\frac{1}{s-1}}n^{1-\frac{1}{s-1}}\) such that the proportion of t-rich sets in A is at least \(1-\alpha ^{-1}\). Note that then \(|A|\ge t\ge |X|\). Moreover, the proportion of t-rich sets in A is greater than \(1-\left( {\begin{array}{c}|X|\\ s\end{array}}\right) ^{-1}\), so A has a subset \(A'\) of size |X| in which all s-sets are t-rich. This implies that \(\mathcal {G}\) contains \(G_{X,Y}^{(r)}\) as a subgraph. Indeed, using that \(t=|V(G_{X,Y}^{(r)})|\), we can construct a copy of \(G_{X,Y}^{(r)}\) by embedding X arbitrarily into \(A'\) and then embedding the sets \(Y_1,\dots ,Y_m\) from the definition of \(G_{X,Y}^{(r)}\) greedily one by one. \(\square \)

Proof of Theorem 1.5

The lower bound was justified in the paragraph before the statement of the theorem, so it is enough to prove the upper bound. Let C be the constant provided by Theorem 2.2 with \(s=2\), and let \(\mathcal {G}\) be an n-vertex r-uniform hypergraph with at least \(100Crtn^{r-1}\) edges. By Theorem 2.2 applied with \(\alpha =100\), \(s=2\) and rt in place of t, there is a set \(A\subset V(\mathcal {G})\) of size at least 100rt such that the proportion of rt-rich sets of size 2 in A is at least 99/100. By known results on the Turán number of cycles, this implies that there is a t-cycle formed by rich pairs, i.e., there are distinct vertices \(x_1,x_2,\dots ,x_t\) in A such that \(\{x_1,x_2\}\), \(\{x_2,x_3\}\), ..., \(\{x_t,x_1\}\) are rt-rich pairs. We give a direct proof of this statement for completeness.

We claim that there is a set \(A'\subset A\) of size at least 4t such that for each \(u\in A'\), the number of \(v\in A'\) for which \(\{u,v\}\) is rt-rich is at least \(3|A'|/4\). Indeed, let \(A_0=A\) and, recursively for every i:

  • if there is some \(u\in A_i\) such that the number of vertices \(v\in A_i\) for which \(\{u,v\}\) is rt-rich is less than \(3|A_i|/4\), then choose such a vertex and let \(A_{i+1}=A_i\setminus \{u\}\),

  • else terminate the process and let \(A'=A_i\).

Clearly, we obtain a set \(A'\) such that for each \(u\in A'\), the number of \(v\in A'\) for which \(\{u,v\}\) is rt-rich is at least \(3|A'|/4\); we just need to show that \(|A'|\ge 4t\). If \(|A'|<4t\), then we have deleted at least |A|/2 vertices which implies that there were at least \(\frac{|A|}{2}\cdot (|A|/8-1)>\frac{1}{100}\left( {\begin{array}{c}|A|\\ 2\end{array}}\right) \) pairs in A which are not rt-rich. This is a contradiction, so indeed \(|A'|\ge 4t\).

Observe that for any \(v, v' \in A'\), there are at least \(|A'| / 2 \ge 2t\) vertices \(u\in A'\) such that the pairs \(\{u, v\}\) and \(\{u, v'\}\) are rt-rich. We can now greedily find distinct vertices \(x_1,x_2,\dots ,x_t\) in \(A'\) such that \(\{x_1,x_2\}\), \(\{x_2,x_3\}\), ..., \(\{x_t,x_1\}\) are rt-rich pairs.

Since \(|V(C_{2t}^{(r)})| = rt\), we can greedily find pairwise disjoint sets \(Y_1, \dots , Y_t\) in \(V(\mathcal {G}){\setminus } \{x_1,\dots ,x_t\}\) such that \(\{x_i\} \cup Y_i, \{x_{i+1}\} \cup Y_i \in E(\mathcal {G})\) for all \(i \in [t]\), where we let \(x_{t+1} = x_1\). Hence, \(\mathcal {G}\) contains \(C_{2t}^{(r)}\) as a subgraph, completing the proof. \(\square \)

3 Lower Bounds

In this section we prove Theorem 1.2. The key ingredient is the following lemma.

Lemma 3.1

Let A and B be two disjoint sets of size n. Assume that there exist pairwise edge-disjoint bipartite graphs \(G_1,G_2,\dots ,G_m\) with parts A and B such that for any distinct vertices \(x_1,x_2,\dots ,x_s\in A\cup B\), there are fewer than t vertices \(y\in A\cup B\) for which there exists \(i\in [m]\) (that may depend on y) with \(x_1y,x_2y,\dots ,x_sy\in E(G_i)\). Let \(e=\sum _{i=1}^m e(G_i)\). Then

$$\begin{aligned} \textrm{ex}\left( 2kn,K_{s,t}^{(2k)}\right) \ge e^k/m. \end{aligned}$$

Proof

Let \(X_1,X_2,\dots ,X_{2k}\) be pairwise disjoint sets of size n. For every \(1\le p\le m\), we define a 2k-partite 2k-uniform hypergraph \(\mathcal {G}(p)\) with parts \(X_1,X_2,\dots ,X_{2k}\) as follows. For \(x_1\in X_1,\dots ,x_{2k}\in X_{2k}\), we let \(x_1x_2\dots x_{2k}\) be a hyperedge in \(\mathcal {G}(p)\) if and only if there exist \(1\le i_1,i_2,\dots ,i_k\le m\) such that \(i_1+\dots +i_k \equiv p \mod m\) and for each \(1\le \ell \le k\), we have \(x_{2\ell -1}x_{2\ell }\in E(G_{i_\ell })\), where \(X_{2\ell -1}\) is identified with A and \(X_{2\ell }\) is identified with B. Now clearly, \(|\bigcup _{p=1}^m E(\mathcal {G}(p))|=|\bigcup _{i=1}^m E(G_i)|^k=e^k\). Hence, there exists some p for which \(e(\mathcal {G}(p))\ge e^k/m\).

It is therefore sufficient to prove that \(\mathcal {G}(p)\) is \(K_{s,t}^{(2k)}\)-free for every p. Suppose otherwise. By symmetry, we may assume that there are distinct vertices \(x_{1,1},\dots ,x_{1,s}\in X_1\), \(x_{\alpha ,\beta }\in X_\alpha \) for all \(2\le \alpha \le 2k\) and \(1\le \beta \le t\) such that \(x_{1,i}x_{2,\beta }x_{3,\beta }\dots x_{2k,\beta }\in E(\mathcal {G})\) for each \(1\le i\le s\) and \(1\le \beta \le t\). Clearly, for each \(2\le \ell \le k\) and \(1\le \beta \le t\), there is a unique \(i_{\ell }(\beta )\in [m]\) such that \(x_{2\ell -1,\beta }x_{2\ell ,\beta }\in E(G_{i_{\ell }(\beta )})\). Moreover, for any \(1\le j\le s\) and \(1\le \beta \le t\) there is a unique \(i_1(j,\beta )\in [m]\) such that \(x_{1,j}x_{2,\beta }\in E(G_{i_1(j,\beta )})\).

By the definition of \(\mathcal {G}(p)\), for any \(1\le j\le s\) and \(1\le \beta \le t\), \(i_1(j,\beta )+i_2(\beta )+\dots +i_k(\beta )\equiv p \mod m\). Hence, \(i_1(1,\beta )=i_1(2,\beta )=\dots =i_1(s,\beta )\). Then for every \(1\le \beta \le t\), there is some \(i\in [m]\) such that \(x_{1,1}x_{2,\beta },x_{1,2}x_{2,\beta },\dots ,x_{1,s}x_{2,\beta }\in E(G_i)\) (namely \(i=i_1(1,\beta )=i_1(2,\beta )=\dots =i_1(s,\beta )\)). By assumption, the vertices \(x_{2,\beta }, 1 \le \beta \le t\) are all distinct which contradicts the properties of the graphs \(G_i\). \(\square \)

We now want to show that for \(m\approx (n/t)^{\frac{1}{s-1}}\), one can almost completely cover the edge set of \(K_{n,n}\) with pairwise edge-disjoint graphs \(G_1,\dots ,G_m\) satisfying the property described in Lemma 3.1. The bound in Lemma 3.1 will then give \(\textrm{ex}(2kn,K_{s,t}^{(2k)})\gtrapprox t^{\frac{1}{s-1}}n^{2k-\frac{1}{s-1}}\).

The following lemma provides a suitable collection of subgraphs under some mild divisibility conditions.

Lemma 3.2

Let \(s\ge 2\) and h be positive integers and let p be a prime congruent to 1 modulo h. Let \(m=(p-1)/h\). Then there are pairwise edge-disjoint bipartite graphs \(G_1,\dots ,G_{m}\) with the same parts A and B such that \(|A|=|B|=p^{s-1}\), \(|\bigcup _{i=1}^{m} E(G_i)|=p^{2s-2}-p^{s-1}\) and for any distinct vertices \(x_1,\dots ,x_s\in A\cup B\) there are at most \(h^{s-1}(s-1)!\) vertices \(y\in A\cup B\) for which there exists \(i\in [m]\) with \(x_1y,x_2y,\dots ,x_sy\in E(G_i)\).

In the proof, we make use of the following result of Kollár, Rónyai and Szabó which was used to obtain their celebrated lower bound for the Turán number of complete bipartite graphs; see also [1] for a refinement.

Lemma 3.3

[10, Theorem 3.3] Let K be a field and let \(a_{i,j},b_i\in K\) for \(1\le i,j\le t\) such that \(a_{i,j_1}\ne a_{i,j_2}\) for any \(i\in [t]\) and \(j_1\ne j_2\). Then the system of equations

$$\begin{aligned} (z_1-a_{1,1})(z_2-a_{2,1})\dots (z_t-a_{t,1})=b_1, \\ (z_1-a_{1,2})(z_2-a_{2,2})\dots (z_t-a_{t,2})=b_2, \\ \vdots \\ (z_1-a_{1,t})(z_2-a_{2,t})\dots (z_t-a_{t,t})=b_t \end{aligned}$$

has at most t! solutions \((z_1,z_2,\dots ,z_t)\in K^t\).

Proof of Lemma 3.2

Let A and B be disjoint copies of the field \(\mathbb {F}_{p^{s-1}}\). Let H be a subgroup of \(\mathbb {F}_p^{\times }\) of order h, where \(\mathbb {F}_p^{\times }\) denotes the multiplicative group of \(\mathbb {F}_p\). Let \(S_1,S_2,\dots ,S_m\) be the cosets of H in \(\mathbb {F}_p^{\times }\).

Recall that the norm map \(N :\mathbb {F}_{p^{s-1}}\rightarrow \mathbb {F}_p\) is defined as \(N(x) = x \cdot x^p \cdot x^{p^2} \cdots x^{p^{s-2}}\) and note that \(N(xy) = N(x)N(y)\) for any \(x, y \in \mathbb {F}_{p^{s-1}}\). For \(x\in A\), \(y\in B\) and \(i\in [m]\), let xy be an edge in \(G_i\) if and only if \(N(x+y)\in S_i\). Since \(S_1,\dots ,S_m\) partition \(\mathbb {F}_p^{\times }\) and \(N(z)=0\) if and only if \(z=0\), it follows that \(G_1,G_2,\dots ,G_m\) are pairwise edge-disjoint and \(\bigcup _{i=1}^{m} E(G_i)=(A\times B){\setminus } \{(x,-x):x\in \mathbb {F}_{p^{s-1}}\}\). Hence, \(|\bigcup _{i=1}^{m} E(G_i)|=p^{2\,s-2}-p^{s-1}\).

We are left to show that for any distinct vertices \(x_1,\dots ,x_s\in A\cup B\) there are at most \(h^{s-1}(s-1)!\) vertices \(y\in A\cup B\) for which there exists \(i\in [m]\) with \(x_1y,x_2y,\dots ,x_sy\in E(G_i)\). We may assume without loss of generality that \(x_j\in A\) for each \(j\in [s]\). Suppose that for some \(y\in B\) there is \(i\in [m]\) with \(x_1y,x_2y,\dots ,x_sy\in E(G_i)\). This means that \(N(x_j+y)\in S_i\) holds for each \(j\in [s]\). Then \(N(\frac{x_j+y}{x_s+y})=N(x_j+y)/N(x_s+y)\in H\) for each \(j\in [s-1]\). \(\square \)

Claim 6

Let \(x_1,\dots ,x_s\) be distinct elements of \(\mathbb {F}_{p^{s-1}}\) and let \(\lambda _1,\dots ,\lambda _{s-1}\in H\). Then there are at most \((s-1)!\) elements \(y\in \mathbb {F}_{p^{s-1}}\) such that \(N(\frac{x_j+y}{x_s+y})=\lambda _j\) for each \(j\in [s-1]\).

Since there are \(h^{s-1}\) ways to choose the possible values of \(N(\frac{x_j+y}{x_s+y})\) for \(j\in [s-1]\) from H, the claim implies the lemma.

Proof of Claim

Note that \(N(\frac{x_j+y}{x_s+y})=\lambda _j\) is equivalent to \(N(\frac{1}{x_s+y}+\frac{1}{x_j-x_s})=\lambda _j/N(x_j-x_s)\). Setting \(z=\frac{1}{x_s+y}\), \(a_j=\frac{1}{x_j-x_s}\) and \(b_j=\lambda _j/N(x_j-x_s)\), the problem is reduced to counting the number of solutions to the system of equations

$$\begin{aligned} \begin{aligned} N(z+a_1)&=b_1, \\ N(z+a_2)&=b_2, \\&\vdots \\ N(z+a_{s-1})&=b_{s-1} \end{aligned} \end{aligned}$$
(2)

in the variable z. Since \(N(z+a_j)=(z+a_j)(z^p+a_j^p)\dots (z^{p^{s-2}}+a_j^{p^{s-2}})\), we can apply Lemma 3.3 (with \(K=\mathbb {F}_{p^{s-1}}\), \(t=s-1\), \(a_{i,j}=-a_j^{p^{i-1}}\), \(z_i=z^{p^{i-1}}\)) to see that (2) has at most \((s-1)!\) solutions for z, completing the proof of the claim. \(\square \)

Remark 3.4

One can prove a variant of Lemma 3.2 using the random algebraic method of Bukh (see [2] for a detailed example of how this method is applied). More precisely, one can take a uniformly random polynomial \(f:\mathbb {F}_p^{2s-2}\rightarrow \mathbb {F}_p\) of a given (large) degree and set \(E(G_i)=\{xy: x,y\in \mathbb {F}_p^{s-1}, f(x,y)\in S_i\}\) for all \(i\in [m]\), where \(S_i\) are defined as in the proof of Lemma 3.2. The proof above uses essentially the same construction for the explicit choice \(f(x,y)=N(x+y)\) (with the minor difference that the parts there are identified with \(\mathbb {F}_{p^{s-1}}\) rather than \(\mathbb {F}_p^{s-1}\)).

We are now in a position to prove Theorem 1.2.

Proof of Theorem 1.2

Let \(h=\lfloor ((t-1)/(s-1)!)^{\frac{1}{s-1}}\rfloor \ge 1\). Choose a prime p such that \(p\equiv 1 \mod h\) and \(\frac{1}{2}(\frac{n}{2k})^{\frac{1}{s-1}}\le p\le (\frac{n}{2k})^{\frac{1}{s-1}}\) (since n is sufficiently large, such a prime exists by the prime number theorem for arithmetic progressions). Let \(m=(p-1)/h\). Note that \(h^{s-1}(s-1)!\le t-1\). By the existence of the bipartite graphs provided by Lemma 3.2 and by Lemma 3.1, we get

$$\begin{aligned} \textrm{ex}(2kp^{s-1},K_{s,t}^{(2k)})\ge & {} (p^{2s-2}-p^{s-1})^k/m\ge 2^{-k}p^{2(s-1)k}/m\nonumber \\ {}\ge & {} 2^{-k}hp^{2(s-1)k-1}\ge ct^{\frac{1}{s-1}}n^{2k-\frac{1}{s-1}} \end{aligned}$$

for some positive constant \(c=c(k,s)\). Since \(2kp^{s-1}\le n\), this completes the proof.

\(\square \)

4 An Improved Upper Bound for \(r=3\)

In this section we prove Theorem 1.3. The following definition will be crucial in the proof. Here and in the rest of this section, we will ignore floor and ceiling signs whenever doing so does not make a substantial difference.

Definition 4.1

Let \(s\ge 3\) be an integer and let \(\mathcal {G}\) be a 3-uniform 3-partite hypergraph with parts X, Y and Z of size n each. We call a vertex \(z\in Z\) s-nice in \(\mathcal {G}\) if there exist partitions \(X=X_1\cup \dots \cup X_{n^{\frac{1}{s-1}}}\) and \(Y=Y_1\cup \dots \cup Y_{n^{\frac{1}{s-1}}}\) into sets of size \(n^{1-\frac{1}{s-1}}\) such that if \(xyz\in E(\mathcal {G})\) for some \(x\in X_i\), then \(y\in Y_i\). We define s-nice vertices in X and Y analogously.

Observe that if some vertex z is s-nice in \(\mathcal {G}\), then it is also s-nice in any subhypergraph of \(\mathcal {G}\).

The proof of Theorem 1.3 will consist of two main steps. First, we prove the following structural result which states that (under a mild condition on the maximum degree) if a \(K_{s,t}^{(3)}\)-free hypergraph has close to \(n^{3-\frac{1}{s-1}}\) edges, then it contains a subgraph with a similar number of edges in which all vertices in two of the parts are nice.

Lemma 4.2

Let \(s\ge 3\), let t be a positive integer, let \(\varepsilon >0\) and let n be sufficiently large. Let \(\mathcal {G}\) be a \(K_{s,t}^{(3)}\)-free 3-uniform 3-partite hypergraph with parts X, Y and Z of size n each. Assume that \(e(\mathcal {G})\ge n^{3-\frac{1}{s-1}-\varepsilon }\) and that every pair of vertices belongs to at most \(n^{1-\frac{1}{s-1}+\varepsilon }\) hyperedges. Then \(\mathcal {G}\) has a subhypergraph \(\mathcal {H}\) (on the same vertex set) such that \(e(\mathcal {H})\ge n^{3-\frac{1}{s-1}-225\,s^4\varepsilon }\) and every vertex in \(X\cup Y\), \(Y\cup Z\) or \(Z\cup X\) is s-nice in \(\mathcal {H}\).

The second step is showing that (again under some mild conditions on the degrees) such a structured hypergraph must contain \(K_{s,t}^{(3)}\).

Lemma 4.3

Let \(s\ge 3\), let t be a positive integer, let \(0<\varepsilon <\frac{1}{4s+6}\) and let n be sufficiently large. Let \(\mathcal {G}\) be a 3-uniform 3-partite hypergraph with parts X, Y and Z of size n each. Assume that the number of hyperedges containing any given pair of vertices in \(\mathcal {G}\) is either 0 or between \(n^{1-\frac{1}{s-1}-\varepsilon }\) and \(n^{1-\frac{1}{s-1}+\varepsilon }\). Assume that \(xyz\in E(\mathcal {G})\) for some \(x\in X\), \(y\in Y\) and \(z\in Z\) and that every vertex in \(Z\cup \{x\}\) is s-nice in \(\mathcal {G}\). Then \(\mathcal {G}\) contains a copy of \(K_{s,t}^{(3)}\).

We will give the proof of Lemmas 4.2 and 4.3 in Sects. 4.1 and 4.2, respectively. Now let us see how these lemmas imply Theorem 1.3. The last ingredient is a lemma that shows that we can assume that no pair of vertices belongs to many hyperedges.

Lemma 4.4

Let \(s\ge 3\), let t be a positive integer, let \(0<\varepsilon <1/2\), let n be sufficiently large and let \(\mathcal {G}\) be a \(K_{s,t}^{(3)}\)-free 3-uniform hypergraph with 3n vertices and at least \(n^{3-\frac{1}{s-1}-\varepsilon }\) hyperedges. Then \(\mathcal {G}\) has a 3-partite subgraph \(\mathcal {H}\) with parts of size n such that \(e(\mathcal {H})\ge n^{3-\frac{1}{s-1}-2\varepsilon }\) and any pair of vertices belongs to at most \(n^{1-\frac{1}{s-1}+2\varepsilon }\) hyperedges in \(\mathcal {H}\).

Proof

Clearly \(\mathcal {G}\) has a 3-partite subgraph \(\mathcal {G}'\), with parts XYZ of size n each, such that \(e(\mathcal {G}')\ge \frac{2}{9}e(\mathcal {G})\). For each \(e=xyz\in E(\mathcal {G}')\), let \(\lambda (e)=\max (d_{\mathcal {G}'}(x,y),d_{\mathcal {G}'}(y,z),d_{\mathcal {G}'}(z,x))\). Now there is a positive integer \(1\le b\le \log _2 n\) such that \(\mathcal {G}'\) has at least \(e(\mathcal {G}')/\log _2 n\) edges e with \(2^{b-1}\le \lambda (e)\le 2^{b}\). By symmetry, we may assume that there are at least \(n^{3-\frac{1}{s-1}-\varepsilon -o(1)}\) triples \((x,y,z)\in X\times Y\times Z\) such that \(xyz\in E(\mathcal {G}')\), \(d_{\mathcal {G}'}(x,y)\ge 2^{b-1}\) and \(d_{\mathcal {G}'}(x,y),d_{\mathcal {G}'}(y,z),d_{\mathcal {G}'}(z,x)\le 2^{b}\). Let \(\mathcal {H}\) be the subgraph of \(\mathcal {G}'\) consisting of precisely these edges xyz. Clearly, \(2^{b}\ge n^{1-\frac{1}{s-1}-\varepsilon -o(1)}\ge \omega (1)\), so there are at least \(n^{3-\frac{1}{s-1}-\varepsilon -o(1)}(2^{b-1})^{s-1}\) many \((s+2)\)-tuples \((x,y,z_1,\dots ,z_s)\in X\times Y\times Z^s\) of distinct vertices such that \(xyz_1\in E(\mathcal {H})\) and \(xyz_i\in E(\mathcal {G}')\) for each \(i\in [s]\). Write \(\mathcal {A}\) for the set of these tuples. By the pigeon hole principle, we can choose some \((z_1,\dots ,z_s)\in Z^s\) which features at least \(n^{-s}n^{3-\frac{1}{s-1}-\varepsilon -o(1)}(2^{b-1})^{s-1}\) many times in \(\mathcal {A}\). Since \(\mathcal {G}'\) does not contain \(K_{s,t}^{(3)}\) as a subgraph, there is a set \(T\subset X\cup Y\) of size at most \(2(t-1)\) such that if \((x,y,z_1,\dots ,z_s)\in \mathcal {A}\), then \(x\in T\) or \(y\in T\). By symmetry, we may therefore assume that for some \(y_0\in T \cap Y\) there are at least \(\frac{1}{2t-2}n^{-s}n^{3-\frac{1}{s-1}-\varepsilon -o(1)}(2^{b-1})^{s-1}\) vertices \(x\in X\) such that \((x,y_0,z_1,\dots ,z_s)\in \mathcal {A}\). In particular, \(d_{\mathcal {G}'}(y_0,z_1)\ge \frac{1}{2t-2}n^{-s}n^{3-\frac{1}{s-1}-\varepsilon -o(1)}(2^{b-1})^{s-1}\). On the other hand, \(d_{\mathcal {G}'}(y_0,z_1)\le 2^b\) by the definition of \(\mathcal {A}\). Hence,

$$\begin{aligned} \frac{1}{2t-2}n^{-s}n^{3-\frac{1}{s-1}-\varepsilon -o(1)}(2^{b-1})^{s-1}\le 2^b, \end{aligned}$$

so \(2^b\le n^{1-\frac{1}{s-1}+\frac{\varepsilon }{s-2}+o(1)}\le n^{1-\frac{1}{s-1}+2\varepsilon }\). This implies that every pair of vertices belongs to at most \(n^{1-\frac{1}{s-1}+2\varepsilon }\) hyperedges in \(\mathcal {H}\). \(\square \)

We can now prove Theorem 1.3.

Proof of Theorem 1.3

Let \(\varepsilon <\frac{1}{900s^4(2s+3)}\), let n be sufficiently large and assume, for the sake of contradiction, that \(\mathcal {G}\) is a \(K_{s,t}^{(3)}\)-free 3-uniform hypergraph with 3n vertices and at least \(n^{3-\frac{1}{s-1}-\varepsilon }\) edges.

By Lemma 4.4, \(\mathcal {G}\) has a 3-partite subgraph \(\mathcal {G}'\) with parts XYZ of size n such that \(e(\mathcal {G}')\ge n^{3-\frac{1}{s-1}-2\varepsilon }\) and any pair of vertices belongs to at most \(n^{1-\frac{1}{s-1}+2\varepsilon }\) hyperedges in \(\mathcal {G}'\). Lemma 4.2 implies that \(\mathcal {G}'\) has a subgraph \(\mathcal {G}''\) (on the same vertex set) such that \(e(\mathcal {G}'')\ge n^{3-\frac{1}{s-1}-450s^4 \varepsilon }\) and every vertex in \(X\cup Y\), \(Y\cup Z\) or \(Z\cup X\) is s-nice in \(\mathcal {G}''\). By successively removing edges which contain a pair of vertices lying in less than \(D = \frac{1}{10}n^{1-\frac{1}{s-1}-450s^4 \varepsilon }\) edges, we obtain a non-empty subgraph \(\mathcal {G}'''\) (on the same vertex set) in which every pair of vertices belongs to either 0 or at least D hyperedges. Hence, since \(450s^4 \varepsilon <\frac{1}{4s+6}\), Lemma 4.3 implies that \(\mathcal {G}'''\) contains \(K_{s,t}^{(3)}\) as a subgraph, which is a contradiction. \(\square \)

4.1 Finding a Structured Subgraph in \(\mathcal {G}\)

In this subsection, we prove Lemma 4.2. In what follows, for a graph G and vertices \(u_1,\dots ,u_k\in V(G)\), we write \(d_G(u_1,\dots ,u_k)\) for the number of common neighbours of \(u_1,\dots ,u_k\) in G. With a slight abuse of notation, for a 3-uniform hypergraph \(\mathcal {G}\), we still write \(d_{\mathcal {G}}(u,v)\) for the number of hyperedges in \(\mathcal {G}\) containing both u and v.

Lemma 4.5

Let \(s\ge 3\), let \(\varepsilon >0\) and let n be sufficiently large. Let \(G=(X,Y)\) be a bipartite graph on \(n+n\) vertices such that \(\Delta (G)\le n^{1-\frac{1}{s-1}+\varepsilon }\) and the number of s-tuples \((u_1,\dots ,u_s)\in X^s\) with \(d_G(u_1,\dots ,u_s)\ge n^{1-\frac{1}{s-1}-\varepsilon }\) is at least \(n^{s-1-\varepsilon }\). Then there are pairwise disjoint sets \(U_1,U_2,\dots ,U_k\subset X\) and \(V_1,V_2,\dots ,V_k\subset Y\) of size \(n^{1-\frac{1}{s-1}}\) for some \(k\ge n^{\frac{1}{s-1}-(3\,s+1)\varepsilon }\) such that \(G[U_i,V_i]\) has at least \(n^{2-\frac{2}{s-1}-4\varepsilon }\) edges for each \(i\in [k]\).

Proof

Assume that for some \(j<n^{\frac{1}{s-1}-(3s+1)\varepsilon }\) we have already found pairwise disjoint sets \(U_1,U_2,\dots ,U_j\subset X\) and \(V_1,V_2,\dots ,V_j\subset Y\) of size \(n^{1-\frac{1}{s-1}}\) such that \(G[U_i,V_i]\) has at least \(n^{2-\frac{2}{s-1}-4\varepsilon }\) edges for each \(i\in [j]\). We show how to find the next pair of subsets \(U_{j+1}, V_{j+1}\). Write \(U=\bigcup _{i\in [j]} U_i\) and \(V=\bigcup _{i\in [j]} V_i\). Clearly, \(|U|\le n^{1-(3s+1)\varepsilon }\) and \(|V|\le n^{1-(3s+1)\varepsilon }\). Let \(W=\{x\in X: |N_G(x)\cap V|\ge \frac{1}{2}n^{1-\frac{1}{s-1}-\varepsilon }\}\). By double counting the edges between W and V, we get \(|W|\cdot \frac{1}{2}n^{1-\frac{1}{s-1}-\varepsilon }\le |V|\Delta (G)\), which implies that \(|W|\le 2n^{1-(3s-1)\varepsilon }\).

For a vertex \(u\in X\), let \(S_u=\{x\in X: d_G(u,x)\ge n^{1-\frac{1}{s-1}-\varepsilon }\}\). By double counting the edges between \(S_u\) and \(N_G(u)\), we get \(|S_u|n^{1-\frac{1}{s-1}-\varepsilon }\le \Delta (G)^2\), so \(|S_u|\le n^{1-\frac{1}{s-1}+3\varepsilon }\). It follows that for any \(u\in X\), the number of \((u_2,\dots ,u_s)\in X^{s-1}\) with \(d_G(u,u_2,u_3,\dots ,u_s)\ge n^{1-\frac{1}{s-1}-\varepsilon }\) is at most \(|S_u|^{s-1}\le n^{s-2+3(s-1)\varepsilon }\). Therefore, the number of \((u_1,\dots ,u_s)\in X^s\) with \(d_G(u_1,\dots ,u_s)\ge n^{1-\frac{1}{s-1}-\varepsilon }\) such that \(u_i\in U\cup W\) for some \(i\in [s]\) is at most \(s(|U|+|W|)n^{s-2+3(s-1)\varepsilon }\le 3sn^{s-1-2\varepsilon }\le \frac{1}{2}n^{s-1-\varepsilon }\). On the other hand, by assumption, the number of s-tuples \((u_1,\dots ,u_s) \in X^s\) with \(d_G(u_1,\dots ,u_s)\ge n^{1-\frac{1}{s-1}-\varepsilon }\) is at least \(n^{s-1-\varepsilon }\). Hence, there exists some \(u\in X\setminus (U\cup W)\) such that there are at least \(\frac{1}{2}n^{s-2-\varepsilon }\) tuples \((u_2,\dots ,u_s)\in (X\setminus (U\cup W))^{s-1}\) with \(d_G(u,u_2,u_3,\dots ,u_s)\ge n^{1-\frac{1}{s-1}-\varepsilon }\). This implies that there are at least \((\frac{1}{2}n^{s-2-\varepsilon })^{\frac{1}{s-1}}\ge \frac{1}{2}n^{1-\frac{1}{s-1}-\varepsilon }\) vertices \(x\in X{\setminus } (U\cup W)\) with \(d_G(u,x)\ge n^{1-\frac{1}{s-1}-\varepsilon }\). Since \(u\not \in W\), we have \(|N_G(u)\cap V|< \frac{1}{2}n^{1-\frac{1}{s-1}-\varepsilon }\), so there are at least \(\frac{1}{2}n^{1-\frac{1}{s-1}-\varepsilon }\) vertices \(x\in X{\setminus } (U\cup W)\) with \(|N_G(u)\cap N_G(x){\setminus } V|\ge \frac{1}{2}n^{1-\frac{1}{s-1}-\varepsilon }\). This means that we can choose a set \(U_{j+1}\subset X{\setminus } U\) of size \(n^{1-\frac{1}{s-1}}\) which sends at least \((\frac{1}{2}n^{1-\frac{1}{s-1}-\varepsilon })^2=\frac{1}{4}n^{2-\frac{2}{s-1}-2\varepsilon }\) edges to \(N_G(u)\setminus V\). Since \(|N_G(u){\setminus } V|\le \Delta (G)\), there exists a set \(V_{j+1}\subset Y{\setminus } V\) of size \(n^{1-\frac{1}{s-1}}\) such that the number of edges in \(G[U_{j+1},V_{j+1}]\) is at least \(\frac{1}{4}n^{2-\frac{2}{s-1}-2\varepsilon } \cdot \min \left( 1,n^{1-\frac{1}{s-1}}/\Delta (G)\right) \ge \frac{1}{4}n^{2-\frac{2}{s-1}-3\varepsilon }\ge n^{2-\frac{2}{s-1}-4\varepsilon }\). This completes the proof. \(\square \)

Lemma 4.6

Let \(s\ge 3\), let t be a positive integer, let \(\varepsilon >0\) and let n be sufficiently large. Let \(\mathcal {G}\) be a \(K_{s,t}^{(3)}\)-free 3-uniform 3-partite hypergraph with parts X, Y and Z of size n each. Assume that \(e(\mathcal {G})\ge n^{3-\frac{1}{s-1}-\varepsilon }\) and that every pair of vertices belongs to at most \(n^{1-\frac{1}{s-1}+\varepsilon }\) hyperedges. Then \(\mathcal {G}\) has a subhypergraph \(\mathcal {G}'\) (on the same vertex set) such that \(e(\mathcal {G}')\ge n^{3-\frac{1}{s-1}-15\,s^2\varepsilon }\) and either every vertex in Y or every vertex in Z is s-nice in \(\mathcal {G}'\).

Proof

We may assume that \(\varepsilon <1/4\), else the conclusion of the lemma holds trivially. Using \(e(\mathcal {G})\ge n^{3-\frac{1}{s-1}-\varepsilon }\), by convexity there is a set \(\mathcal {T}\) of at least \(\Omega _s(n^2(n^{1-\frac{1}{s-1}-\varepsilon })^s) = \Omega _s(n^{s+1-\frac{1}{s-1}-s\varepsilon })\) tuples \((x_1,x_2,\dots ,x_s,y,z)\) of distinct vertices such that \(x_i\in X\), \(y\in Y\) and \(z\in Z\) and \(x_iyz\in E(\mathcal {G})\) for every i. Since \(\mathcal {G}\) is \(K_{s,t}^{(3)}\)-free, for any distinct \(x_1,\dots ,x_s\in X\) there is a set \(S\subset Y\cup Z\) of size at most \(2t-2\) such that for each \(y\in Y\) and \(z\in Z\) for which \((x_1,x_2,\dots ,x_s,y,z)\in \mathcal {T}\), we have \(y\in S\) or \(z\in S\). Since every pair of vertices in \(\mathcal {G}\) is in at most \(n^{1-\frac{1}{s-1}+\varepsilon }\) hyperedges, it follows that any fixed \((x_1,\dots ,x_s)\) extends to at most \((2t-2)n^{1-\frac{1}{s-1}+\varepsilon }\) members of \(\mathcal {T}\). Hence, there are at least \(\frac{\frac{1}{2}|\mathcal {T}|}{(2t-2)n^{1-\frac{1}{s-1}+\varepsilon }}\) tuples \((x_1,\dots ,x_s)\) which extend to at least \(\frac{\frac{1}{2}|\mathcal {T}|}{n^s}\) members of \(\mathcal {T}\). For each such \((x_1,\dots ,x_s)\) there is some \(z\in Y\cup Z\) such that \(d_{\mathcal {G}}(\{x_1,\dots ,x_s\},z)\ge \frac{1}{2t-2} \cdot \frac{\frac{1}{2}|\mathcal {T}|}{n^s} \ge \Omega _{s,t}(n^{1 - \frac{1}{s-1}-s\varepsilon }) \ge n^{1-\frac{1}{s-1}-(s+1)\varepsilon }\). Here \(d_{\mathcal {G}}(\{x_1,\dots ,x_s\},z)\) denotes the number of vertices \(y\in V(\mathcal {G})\) such that \(x_iyz\in E(\mathcal {G})\) holds for all \(i\in [s]\). Hence, the number of tuples \((x_1,x_2,\dots ,x_s,z)\in X^s\times (Y\cup Z)\) of distinct vertices with \(d_{\mathcal {G}}(\{x_1,\dots ,x_s\},z)\ge n^{1-\frac{1}{s-1}-(s+1)\varepsilon }\) is at least \(\frac{\frac{1}{2}|\mathcal {T}|}{(2t-2)n^{1-\frac{1}{s-1}+\varepsilon }}\ge \Omega _{s,t}(n^{s-(s+1)\varepsilon })\ge 4n^{s-(s+2)\varepsilon }\). By the symmetry of Y and Z, we may assume, without loss of generality, that there are at least \(2n^{s-(s+2)\varepsilon }\) tuples \((x_1,x_2,\dots ,x_s,z)\in X^s\times Z\) with \(d_{\mathcal {G}}(\{x_1,\dots ,x_s\},z)\ge n^{1-\frac{1}{s-1}-(s+1)\varepsilon }\).

For every \(z\in Z\), define a bipartite graph \(G_z\) with parts X and Y where xy is an edge in \(G_z\) if and only if \(xyz\in E(\mathcal {G})\). Observe that for any \(z\in Z\), we have \(\Delta (G_z)\le n^{1-\frac{1}{s-1}+\varepsilon }\). By the previous paragraph,

$$\begin{aligned} \sum _{z\in Z} \left| \left\{ (x_1,\dots ,x_s)\in X^s: d_{G_z}(x_1,\dots ,x_s)\ge n^{1-\frac{1}{s-1}-(s+1)\varepsilon } \right\} \right| \ge 2n^{s-(s+2)\varepsilon }. \end{aligned}$$
(3)

On the other hand, we claim that for any \(z\in Z\), we have

$$\begin{aligned} \left| \left\{ (x_1,\dots ,x_s)\in X^s: d_{G_z}(x_1,\dots ,x_s)\ge n^{1-\frac{1}{s-1}-(s+1)\varepsilon } \right\} \right| \le n^{s-1+(s-1)(s+3)\varepsilon }.\nonumber \\ \end{aligned}$$
(4)

Indeed, let \(x \in X\) and let \(S_x = \{x' \in X: d_{G_z}(x,x') \ge n^{1-\frac{1}{s-1}-(s+1)\varepsilon } \}\). By double counting the edges of \(G_z\) between \(S_x\) and \(N_{G_z}(x)\), we have \(|S_x| \cdot n^{1-\frac{1}{s-1}-(s+1)\varepsilon } \le \Delta (G_z)^2 \le n^{2-\frac{2}{s-1}+2\varepsilon }\) and so \(|S_x| \le n^{1-\frac{1}{s-1} + (s+3)\varepsilon }\). Hence, the number of \(x_2,\dots ,x_s \in X\) satisfying \(d_{G_z}(x,x_2,\dots ,x_s) \ge n^{1-\frac{1}{s-1}-(s+1)\varepsilon }\) is at most \(|S_x|^{s-1} \le n^{s-2 + (s-1)(s+3)\varepsilon }\). Now (4) follows by summing over x.

By (3) and (4), there are at least \(n^{1-(s+2)\varepsilon -(s-1)(s+3)\varepsilon }\) vertices \(z\in Z\) for which

$$\begin{aligned} \left| \left\{ (x_1,\dots ,x_s)\in X^s: d_{G_z}(x_1,\dots ,x_s)\ge n^{1-\frac{1}{s-1}-(s+1)\varepsilon } \right\} \right| \ge n^{s-1-(s+2)\varepsilon }. \end{aligned}$$
(5)

We now define a suitable subhypergraph of \(\mathcal {G}\). For any vertex z satisfying (5), Lemma 4.5 (applied for \(G_z\) with \((s+2)\varepsilon \) in place of \(\varepsilon \)) implies that for some \(k \ge n^{\frac{1}{s-1}-(3s+1)(s+2)\varepsilon }\) there are disjoint sets \(X_1, \dots , X_k \subseteq X\) and \(Y_1, \dots , Y_k \subseteq Y\) of size \(n^{1-\frac{1}{s-1}}\) such that for all \(i \in [k]\), in \(G_z\) there are at least \(n^{2-\frac{2}{s-1}-4(s+2)\varepsilon }\) edges between \(X_i\) and \(Y_i\). Among the hyperedges of \(\mathcal {G}\) containing z, keep those xyz for which there is \(i \in [k]\) such that \(x\in X_i\) and \(y\in Y_i\). Thus, for each z satisfying (5) we keep at least \(k \cdot n^{2-\frac{2}{s-1}-4(s+2)\varepsilon } \ge n^{2-\frac{1}{s-1} - (3s+5)(s+2)\varepsilon }\) edges containing it. For each \(z\in Z\) which does not satisfy (5), delete all hyperedges of \(\mathcal {G}\) containing z. Call the resulting subhypergraph \(\mathcal {G}'\).

It is clear that every vertex in Z is s-nice in \(\mathcal {G}'\). Moreover,

$$\begin{aligned} e(\mathcal {G}')\ge n^{1-(s+2)\varepsilon -(s-1)(s+3)\varepsilon } \cdot n^{2-\frac{1}{s-1}-(3s+5)(s+2)\varepsilon }\ge n^{3-\frac{1}{s-1}-15s^2\varepsilon }, \end{aligned}$$

as \(s+2+(s-1)(s+3)+(3s+5)(s+2) \le 15s^2\) for \(s \ge 3\). This completes the proof. \(\square \)

Proof of Lemma 4.2

The lemma follows from two applications of Lemma 4.6. \(\square \)

4.2 Finding \(K_{s,t}^{(3)}\) in the Structured Subgraph

In this subsection, we prove Lemma 4.3. In what follows, for a 3-uniform hypergraph \(\mathcal {G}\) and distinct vertices \(x,z\in V(\mathcal {G})\), we write \(N_{\mathcal {G}}(x,z)\) for the set of vertices \(y\in V(\mathcal {G})\) for which xyz is an edge in \(\mathcal {G}\). Let \(K_{1,s,1}^{(3)}\) denote the 3-uniform hypergraph with vertices \(x,y_1,\dots ,y_s,z\) and edges \(xy_iz\), \(i = 1,\dots ,s\).

Lemma 4.7

Let \(s\ge 3\), let \(0<\varepsilon <1/8\) and let n be sufficiently large. Let \(\mathcal {G}\) be a 3-uniform 3-partite hypergraph with parts X, Y and Z of size n each. Assume that every pair of vertices in \(\mathcal {G}\) is contained in either 0 or at least \(n^{1-\frac{1}{s-1}-\varepsilon }\) hyperedges. Suppose that \(xyz\in E(\mathcal {G})\) for some \(x\in X\), \(y\in Y\) and \(z\in Z\) and that z is s-nice in \(\mathcal {G}\). Then \(\mathcal {G}\) has at least \(n^{s-\frac{2}{s-1}-(2s+1)\varepsilon }\) copies of \(K_{1,s,1}^{(3)}\) containing z and with the part of size s being a subset of \(N_{\mathcal {G}}(x,z)\).

Proof

Let \(G_z\) be the link graph of z. Since z is s-nice in \(\mathcal {G}\), there are sets \(X'\subset X\) and \(Y'\subset Y\) of size \(n^{1-\frac{1}{s-1}}\) such that \(x\in X'\), \(y\in Y'\) and whenever \(x'y'\in E(G_z)\), then either none or both of \(x'\in X'\) and \(y'\in Y'\) hold. By the assumption in the lemma, every vertex in \(G_z\) has degree either 0 or at least \(n^{1-\frac{1}{s-1}-\varepsilon }\).

Let \(xy'\in E(G_z)\). Clearly, \(y'\in Y'\). Then \(y'\) has at least \(n^{1-\frac{1}{s-1}-\varepsilon }\) neighbours in \(G_z\), all of which must be in \(X'\). Hence, there are at least \(n^{1-\frac{1}{s-1}-\varepsilon } \cdot |N_{G_z}(x)| = n^{-\varepsilon }|X'||N_{G_z}(x)|\) edges in \(G_z\) between \(X'\) and \(N_{G_z}(x)\). Since \(|N_{G_z}(x)|\) is much larger than \(n^{\varepsilon }\), by convexity there are \(\Omega _s(|X'||N_{G_z}(x)|^sn^{-s\varepsilon })\) copies of \(K_{1,s}\) in \(G_z\) with the part of size s inside \(N_{G_z}(x)\). Since \(|X'|=n^{1-\frac{1}{s-1}}\) and \(|N_{G_z}(x)|\ge n^{1-\frac{1}{s-1}-\varepsilon }\), the lemma follows. \(\square \)

Proof of Lemma 4.3

Since x is s-nice, there are sets \(Y'\subset Y\) and \(Z'\subset Z\) of size \(n^{1-\frac{1}{s-1}}\) such that \(y\in Y'\), \(z\in Z'\) and whenever \(xy'z'\in E(\mathcal {G})\), then we have either none or both of \(y'\in Y'\) and \(z'\in Z'\). Note that there are at least \(n^{1-\frac{1}{s-1}-\varepsilon }\) vertices \(z'\in Z'\) such that \(xyz'\in E(\mathcal {G})\), because each pair of vertices is in either 0 or at least \(n^{1-\frac{1}{s-1}-\varepsilon }\) edges, by the assumption of the lemma. For each \(z'\in Z'\) with \(xyz'\in E(\mathcal {G})\), Lemma 4.7 gives at least \(n^{s-\frac{2}{s-1}-(2s+1)\varepsilon }\) copies of \(K_{1,s,1}^{(3)}\) containing \(z'\) and with the part of size s being a subset of \(N_{\mathcal {G}}(x,z')\). Since \(N_{\mathcal {G}}(x,z')\subset Y'\) for every such \(z'\), it follows that \(\mathcal {G}\) contains at least \(n^{1-\frac{1}{s-1}-\varepsilon }\cdot n^{s-\frac{2}{s-1}-(2s+1)\varepsilon }=n^{s+1-\frac{3}{s-1}-(2s+2)\varepsilon }\) copies of \(K_{1,s,1}^{(3)}\) with the part of size s being a subset of \(Y'\). However, \(|Y'|^s=(n^{1-\frac{1}{s-1}})^s = n^{s-1-\frac{1}{s-1}}\), so it follows by the pigeon hole principle that there is a set \(S\subset Y'\) of size s which extends to at least \(\frac{n^{s+1-\frac{3}{s-1}-(2s+2)\varepsilon }}{n^{s-1-\frac{1}{s-1}}}=n^{2-\frac{2}{s-1}-(2s+2)\varepsilon }\) copies of \(K_{1,s,1}^{(3)}\). Let E be the set of pairs \((x',z') \in X \times Z\) with \(x'y'z' \in E(\mathcal {G})\) for every \(y' \in S\), so \(|E| \ge n^{2-\frac{2}{s-1}-(2s+2)\varepsilon }\). We claim that \(\mathcal {G}\) contains a copy of \(K_{s,t}^{(3)}\) with the part of size s being S. If not, then the pairs in E are covered by at most \(2t-2\) vertices. But then \(|E| \le (2t-2) \cdot n^{1-\frac{1}{s-1}+\varepsilon }\), as every pair of vertices is in at most \(n^{1-\frac{1}{s-1}+\varepsilon }\) hyperedges. Since \(2-\frac{2}{s-1}-(2s+2)\varepsilon >1-\frac{1}{s-1}+\varepsilon \), this contradicts \(|E| \ge n^{2-\frac{2}{s-1}-(2s+2)\varepsilon }\). \(\square \)

5 Concluding Remarks

  • The most interesting question arising from the present paper is whether \(\textrm{ex}(n,K_{s,t}^{(r)}) = O_{r,s,t}(n^{r - \frac{1}{s-1} - \varepsilon })\) for \(s\ge 3\) and odd \(r \ge 5\). Recall that this is true for \(r = 3\) (Theorem 1.3) but false for even r if \(t \gg s\) (Theorem 1.2).

  • Similarly, it would be interesting to decide whether \(\textrm{ex}(n,K_{2,t}^{(r)})=\Theta _r(tn^{r-1})\) for odd \(r\ge 5\). This is true for \(r=3\) and every even \(r\ge 4\). The upper bound holds for arbitrary \(r\ge 3\) (Theorem 1.1).

  • Mubayi and Verstraëte conjectured that \(\textrm{ex}(n,K_{s,t}^{(3)}) = \Theta _{s,t}(n^{3-2/s})\) for \(2 \le s \le t\). This remains open for \(s \ge 3\).

Note added After posting our paper on the arXiv, we were informed by Dhruv Mubayi that he had independently proved a weaker version of our Theorem 1.1, namely that \(\textrm{ex}(n,K_{s,t}^{(r)})\le n^{r-\frac{1}{s-1}}(\log n)^{O_{r,s,t}(1)}\).