1 Introduction

Ajtai’s discovery [2] of \(V^{0}\not \vdash ontoPHP^{n+1}_{n}\), where \(ontoPHP^{n+1}_{n}\) is a formalization of the statement “there does not exist a bijection between \((n+1)\) pigeons and n holes,” was a significant breakthrough in proof complexity. The technique, which was later formalized in [14] as k-evaluation and switching lemma, has been utilized to further works in the area such as the comparison between various types of counting principles (such as [3] and [4]). In the course of the works, it turned out that degree lower bounds of Nullstellensatz proofs are essential when one would like to give lower bounds for the lengths of proofs from constant depth Frege system equipped with \(\{Count^{p}_{k}\}_{1 \le k \in \mathbb {N}}\) (i.e., the modular counting principle mod p) as an axiom scheme. One of the most important open problems in the current proof complexity is whether \(V^{0}(p) \vdash injPHP^{n+1}_{n}\) or not (here, \(injPHP^{n+1}_{n}\) denotes the pigeonhole principle for injections). This problem is interesting because it would deepen our understanding of how hard it is to count a set (recall that \({\textbf {VTC}}^{0} \vdash injPHP^{n+1}_{n}\) and \(V^{0}(p) \vdash Count^{p}_{n}\)). Furthermore, if the problem is solved for composite p, it would give us tips to solve the separation problem \(AC^{0}(p)\) versus \(TC^{0}\). As for this problem, Razborov [17] made considerable progress. The paper gave good degree lower bounds for polynomial calculus proofs of \(injPHP^{m}_{n}\) (\(m>n\)), which does not depend on the specific coefficient field.

This paper aims to connect the result of [17] to a superpolynomial lower bound for the proof length of \(injPHP^{n+1}_{n}\) from \(AC^{0}\)-Frege system equipped with Uniform Counting Principle, which can be seen as a uniform version of infinite formulae \(\{Count^{p}_{n}\}_{p,n}\), as an axiom scheme. Tackling the issue, we obtain natural and exciting open problems. We also consider some of them, too.

The detailed content and the organization of the article are as follows.

First, in Sect. 2, as the preliminary, we define three types of (first-order and propositional) formulae, which at first glance seem to be generalized versions of modular counting principles (which do not fix the modulus). We name them as:

  1. 1.

    Modular Pigeonhole Principle \(modPHP^{d,m}_{n}\).

  2. 2.

    Uniform Counting Principle \(UCP^{l,m}_{n}\).

  3. 3.

    Generalized Counting Principle GCP.

Then, we compare the relative strength of these versions over \(V^{0}\) and also develop some independence results over \(V^{0}\). It immediately turns out that

  • \(V^{0} + ontoPHP^{L}_{l} \vdash modPHP^{q,m}_{k}\), and

  • \(V^{0} + modPHP^{q,m}_{k} \vdash ontoPHP^{L}_{l}\).

(For the precise meaning of the statements, see Sect. 2)

Therefore, we see \(modPHP^{d,m}_{n}\) is actually not appropriate to be called a generalized version of modular counting principles because it cannot imply them.

On the other hand, we observe that

  • For any natural number \(p \ge 2\), \(V^{0}+UCP^{l,m}_{k} \vdash Count^{p}_{n}\).

  • \(V^{0}+UCP^{l,m}_{k} \vdash ontoPHP^{n+1}_{n}\).

  • \(V^{0}+GCP \vdash UCP^{l,m}_{n}\).

  • \(V^{0}+GCP \vdash injPHP^{n+1}_{n}\).

Hence, we see \(UCP^{l,m}_{n}\) and GCP can be seen as generalizations of counting principles. In the latter sections, we tackle natural questions arising from the observations above. Towards them, in Sect. 2, we review the Nullstellensatz proof system.

In Sect. 3, we consider the problem: \(V^{0}+UCP^{l,m}_{k} \vdash injPHP^{n+1}_{n}\)? The author conjectures \(V^{0}+UCP^{l,m}_{k} \not \vdash injPHP^{n+1}_{n}\), and gives a sufficient condition to prove it. We define suitable analogues of PHP-tree and k-evaluation in the proof of Ajtai’s theorem given in [13], and show that if an h-evaluation using injPHP-trees exists for a Frege proof of \(injPHP^{n+1}_{n}\) admitting \(UCP^{l,m}_{k}\) as an axiom scheme, h cannot be of order o(n). Our main work is manipulating the trees that connect the order of h with the degrees of Nullstellensatz proofs of it.

In Sect. 4, we consider a “more natural” combinatorial principle than GCP that implies both \(injPHP^{n+1}_{n}\) and \(Count^{p}_{n}\) for some p. Namely, we consider the (propositional and first-order) formulae \(oddtown_{n}\), which formalize the the oddtown theorem. We observe:

  • \(V^{0}+oddtown_{k} \vdash Count^{p}_{n}\) for \(p=2^{l}\).

  • \(V^{0}+oddtown_{k} \vdash injPHP^{n+1}_{n}\).

The author conjectures \(V^{0}+oddtown_{k} \not \vdash Count^{p}_{n}\) for any prime \(p \ne 2\), and gives a sufficient condition to prove it. Roughly speaking, the statement is as follows; if \(V^{0}+oddtown_{k} \vdash Count^{p}_{n}\), then there exists a constant \(\epsilon > 0\) such that for each n, we can construct a vector of \(n^{O(1)}\) many \(\mathbb {F}_{2}\)-polynomials whose violating oddtown condition can be verified by Nullstellensatz proofs from \(\lnot Count^{p}_{n^{\epsilon }}\) with degree O(1).

In Sect. 5, we consider the (propositional and first-order) formulae \(FIE_{n}\) which formalize Fisher’s inequality. We observe

  • \(V^{0}+FIE_{k} \vdash injPHP^{n+1}_{n}\).

On the other hand, the author conjectures \(V^{0}+FIE_{k} \not \vdash Count^{p}_{n}\) for any \(p \ge 2\), and gives a sufficient condition whose form is similar to the previous one to prove it.

2 Preliminaries

2.1 Setup of bounded arithmetic and counting principles

Throughout this paper, p and q denote natural numbers. The cardinality of a finite set S is denoted by \(\# S\). We prioritize readability and often use natural abbreviations to express logical formulae. We assume the reader is familiar with the basics of bounded arithmetics and Frege systems (such as the concepts treated in [8]). Unless stated otherwise, we follow the conventions of [8].

We basically use the parenthesis (, ) to denote tuples, but when there are several tuples of different types or in different universes, we also use \(\langle , \rangle \) for readability. In particular, in Sect. 5, we often label edges and leaves of a tree by tuples. In that case, we use \(\langle , \rangle \).

As propositional connectives, we use only \(\bigvee \) and \(\lnot \). We assume \(\bigvee \) has unbounded arity. When the arity is small, we also use \(\vee \) to denote \(\bigvee \). We define an abbreviation \(\bigwedge \) by

$$\begin{aligned} \bigwedge _{i=1}^{k} \varphi _{i}:= \lnot \bigvee _{i=1}^{k} \lnot \varphi _{i}. \end{aligned}$$

When the arity of \(\bigwedge \) is small, we also use \(\wedge \) to denote it. We give the operators \(\bigvee \) and \(\bigwedge \) precedence over \(\vee \) and \(\wedge \) as the order of application.

Example 1

\(\bigwedge _{i} \varphi _{i} \vee \bigwedge _{j} \psi _{j}\) means \((\bigwedge _{i} \varphi _{i}) \vee (\bigwedge _{j} \psi _{j})\).

We also define an abbreviation \(\rightarrow \) by

$$\begin{aligned} (\varphi \rightarrow \psi ) := \lnot \varphi \vee \psi . \end{aligned}$$

For a set S of propositional variables, an S-formula means a propositional formula whose propositional variables are among S. For a set \(S=\{s^{j}_{i} \mid 1 \le j \le k, i \in I_{j}\}\) of propositional variables where each \(s^{j}_{i}\) is distinct, an S-formula \(\psi \), and a family \(\{\varphi ^{j}_{i}\}_{i \in I_{j}}\) (\(j = 1, \ldots , k\)) of propositional formulae,

$$\begin{aligned} \psi [\varphi ^{1}_{i}/s^{1}_{i}, \ldots , \varphi ^{k}_{i} / s^{k}_{i}] \end{aligned}$$

denotes the formula obtained by substituting each \(\varphi ^{j}_{i}\) for \(s^{j}_{i}\) simultaneously.

It is well-known that a \(\Sigma ^{B}_{0}\) \(\mathcal {L}^{2}_{A}\)-formula \(\varphi (x_{1}, \ldots ,x_{k},R_{1}, \ldots , R_{l})\) can be translated into a family \(\{\varphi [n_{1},\ldots , n_{k}, m_{1}, \ldots , m_{l}]\}_{n_{1},\ldots , n_{k}, m_{1}, \ldots , m_{l} \in \mathbb {N}}\) of propositional formulae. (See Theorem VII 2.3 in [8].)

Now, we define several formulae which express the so-called “counting principle.”

Definition 2

For each \(p \ge 2\), let \(Count^{p}(n,X)\) be an \(\mathcal {L}^{2}_{A}\)-formula as follows (intuitively, it says for \(n \not \equiv 0 \pmod p\), [n] cannot be p-partitioned):

$$\begin{aligned} Count^{p}(n,X) := \lnot p \mid n \rightarrow \lnot \Bigg (&\Big (\forall k \in [n].\exists e \in [n]^{(p)}. (e \in X \wedge k \in ^{*} e)\Big ) \\&\wedge \Big (\forall e, e^{\prime } \in [n]^{(p)}. \lnot (e\in X \wedge e^{\prime } \in X\wedge e \perp e^{\prime })\Big )\Bigg ) \end{aligned}$$

Here,

  • \(p \mid n\) is a \(\Sigma ^{B}_{0}\) formula expressing “p divides n.”

  • [n] denotes the set \(\{1, \ldots ,n\}\).

  • We code a p-subset

    $$\begin{aligned} e=\{e_{1}< \cdots < e_{p}\} \end{aligned}$$

    of [n] by the number

    $$\begin{aligned} \sum _{i=1}^{p}e_{i}(n+1)^{p-i}. \end{aligned}$$
  • \([n]^{(p)}\) denotes the \(\Sigma ^{B}_{0}\)-definable set of all the codes of the p-subsets of [n]. Note that each member in \([n]^{(p)}\) is less than, say, \((n+1)^{p}\).

  • The elementship relation \(\in ^{*}\) is expressed by a natural \(\Sigma ^{B}_{0}\)-predicate. We often write it \(\in \), too.

  • \(e \perp e^{\prime }\) means

    $$\begin{aligned} e \ne e^{\prime } \ \text{ and } \ e \cap e^{\prime } \ne \emptyset , \end{aligned}$$

    and it is also expressed by a natural \(\Sigma ^{B}_{0}\)-predicate.

We also define the propositional formula \(Count^{p}_{n}\) as in [12]:

$$\begin{aligned} Count^{p}_{n} := {\left\{ \begin{array}{ll} 1 &{}(\text{ if }\, p \mid n)\\ \lnot \left( \bigwedge _{k \in [n]} \bigvee _{e: k \in e \in [n]^{(p)} } r_{e} \wedge \bigwedge _{e,e^{\prime } \in [n]^{(p)} :e \perp e^{\prime }} (\lnot r_{e} \vee \lnot r_{e^{\prime }})\right) &{}(\text{ otherwise}) \end{array}\right. } \end{aligned}$$

Here, \(\{r_{e}\}_{e \in [n]^{(p)}}\) is a family of distinct propositional variables.

Convention 3

With suitable identification of propositional variables, \(Count^{p}(x,X)[n,(n+1)^{p}]\) is equivalent to \(Count^{p}_{n}\) over \(AC^{0}\)-Frege system modulo polynomial-sized proofs. Thus we often abuse the notation and write \(Count^{p}_{n}\) for \(Count^{p}(n,X)\).

Definition 4

The \(\Sigma ^{B}_{0}\) \(\mathcal {L}^{2}_{A}\)-formula ontoPHP(mnR) is a natural expression of the statement “If \(m>n\), then R does not give a graph of a bijection between [m] and [n],” in a similar way as \(Count^{p}(n,X)\). Similarly, the \(\Sigma ^{B}_{0}\) \(\mathcal {L}^{2}_{A}\)-formula injPHP(mnR) is a natural expression of the statement “If \(m>n\), then R does not give a graph of an injection from [m] to [n].”

We also define the propositional formulae \(ontoPHP^{m}_{n}\) and \(injPHP^{m}_{n}\) by

$$\begin{aligned} ontoPHP^{m}_{n}:= {\left\{ \begin{array}{ll} \lnot \Big (\bigwedge _{i \in [m]} \bigvee _{j \in [n]} r_{ij} \wedge \bigwedge _{i \ne i^{\prime } \in [m]} \bigwedge _{j \in [n]} (\lnot r_{ij} \vee \lnot r_{i^{\prime }j}) \\ \quad \wedge \bigwedge _{j \in [n]} \bigvee _{i \in [m]}r_{ij} \wedge \bigwedge _{j \ne j^{\prime } \in [n]} \bigwedge _{i \in [m]} (\lnot r_{ij} \vee \lnot r_{ij^{\prime }}) \Big ) &{}(\text{ if }\, m>n)\\ 1 &{}(\text{ otherwise}) \end{array}\right. } \end{aligned}$$

and

$$\begin{aligned} injPHP^{m}_{n}:= {\left\{ \begin{array}{ll} \lnot \Big (\bigwedge _{i \in [m]} \bigvee _{j \in [n]} r_{ij} \wedge \bigwedge _{i \ne i^{\prime } \in [m]} \bigwedge _{j \in [n]} (\lnot r_{ij} \vee \lnot r_{i^{\prime }j}) \\ \quad \wedge \bigwedge _{j \ne j^{\prime } \in [n]} \bigwedge _{i \in [m]} (\lnot r_{ij} \vee \lnot r_{ij^{\prime }}) \Big ) &{}(\text{ if }\, m>n)\\ 1 &{}(\text{ otherwise}) \end{array}\right. } \end{aligned}$$

With reasons similar to the one stated in Convention 3, we abuse the notations and use \(ontoPHP^{m}_{n}\) to denote ontoPHP(mnR) and \(injPHP^{m}_{n}\) to denote injPHP(mnR).

The following are well-known:

Theorem 5

([2], improved by [14] and [16])

$$\begin{aligned} V^{0} \not \vdash ontoPHP^{n+1}_{n}. \end{aligned}$$

Here, we adopt the following convention.

Convention 6

For \(\Sigma ^{B}_{0}\)-formulae \(\psi _{1}, \ldots , \psi _{l}\) and \(\varphi \), we write

$$\begin{aligned} V^{0}+\psi _{1}+\cdots +\psi _{l} \vdash \varphi \end{aligned}$$

to express the fact that the theory \(V^{0}\cup \{\forall \forall \psi _{i}\mid i \in [l]\}\) implies \(\forall \forall \varphi \). Here, \(\forall \forall \) means the universal closure.

We use different parameters to express concrete \(\vec {\psi }\) and \(\varphi \) in order to avoid the confusion. We also use letters p and q for fixed parameters of formulae (which are not universally quantified in the theory). For example,

$$\begin{aligned} V^{0}+ Count^{p}_{k}\not \vdash Count^{q}_{n} \end{aligned}$$

means

$$\begin{aligned} V^{0}+\forall k,X. Count^{p}(k,X) \not \vdash \forall n,X.\ Count^{q}(n,X), \end{aligned}$$

while

$$\begin{aligned} V^{0} \not \vdash UCP^{l,d}_{n} \end{aligned}$$

means

$$\begin{aligned} V^{0} \not \vdash \forall l,d,n,R.\ UCP(l,d,n,R) \end{aligned}$$

(for the definition of \(UCP^{l,d}_{n}\) and UCP(ldnR), see Definition 13).

In the former example, note that we have used the different variables kn to avoid confusion about the variables’ dependencies.

Theorem 7

([3]) For \(p,q \ge 2\), \(V^{0}+Count^{p}_{k} \vdash Count^{q}_{n}\) if and only if \(\exists N \in \mathbb {N}.\ q \mid p^{N}\).

Theorem 8

([4]) For any \(p \ge 2\), \(V^{0}+ Count^{p}_{k} \not \vdash injPHP^{n+1}_{n}\).

Also, the following is a corollary of the arguments given in [12]:

Theorem 9

(Essentially in [12]) For all \(p \ge 2\), \(V^{0}+ injPHP^{k+1}_{k} \not \vdash Count^{p}_{n}\).

Remark 10

Note that the exact statement Theorem 12.5.7 in [12] shows is

$$\begin{aligned} V^{0} + ontoPHP^{k+1}_{k} \not \vdash Count^{p}_{n} \end{aligned}$$

for each fixed \(p \ge 2\). However, with a slight change of the argument, it is easy to see that Theorem 9 actually holds.

From now on, we consider several seemingly generalized versions of \(Count^{p}_{n}\), which do not fix the modulus p, and evaluate their strengths.

Naively, the generalized counting principle should be a statement like: “For any \(d \ge 2\) and \(n \in \mathbb {N}\), if d does not divide n, then n cannot be partitioned into d-sets.” The following is one of the straightforward formalizations of this statement:

Definition 11

The \(\Sigma ^{B}_{0}\) \(\mathcal {L}^{2}_{A}\)-formula modPHP(dmnR) is a natural formalization of the statement “If \(m \not \equiv n \pmod d\), then R does not give the graph of a bijection between [m] and [n].”

We also define the propositional formulae \(modPHP^{d,m}_{n}\) as follows:

$$\begin{aligned}&modPHP^{d,m}_{n} := \\&{\left\{ \begin{array}{ll} \text{ same } \text{ as } \text{ the } \text{ case } \text{ of }\, m>n\, \text{ in } \text{ the } \text{ definition } \text{ of }\, ontoPHP^{m}_{n} &{}(\text{ if }\, m \not \equiv n \pmod d)\\ 1 &{} (\text{ otherwise}) \end{array}\right. } \end{aligned}$$

With a similar reason as the one given in Convention 3, we abuse the notation and use \(modPHP^{d,m}_{n}\) to denote modPHP(dmnR).

Intuitively, \(modPHP^{d,m}_{n}\) expresses “if \(n \not \equiv m \pmod d\) and \(m=ds+r\) (\(0 \le r <d\)), then there does not exist a family \(\{S_{i}\}_{i \in [q]}\) of d-sets and an r-set \(S_{0}\) which give a partition of [n].” However, it does not imply even \(Count^{2}_{n}\):

Proposition 12

The following hold:

  1. 1.

    \(V^{0} + ontoPHP^{L}_{l} \vdash modPHP^{d,m}_{k}\).

  2. 2.

    \(V^{0} + modPHP^{d,m}_{k} \vdash ontoPHP^{L}_{l}\).

In particular, for any \(p \ge 2\), \(V^{0}+ modPHP^{d,m}_{k} \not \vdash Count^{p}_{n}\).

Proof

As for 1, argue in \(V^{0}\) as follows: assume \(m \not \equiv k \pmod d\), and R gives a bijection between [m] and [k]. It easily follows that \(m \ne k\), and hence R or \(R^{-1}\) violates \(ontoPHP^{m}_{k}\) or \(ontoPHP^{k}_{m}\).

As for 2, argue in \(V^{0}\) as follows: suppose \(L>l\) and R gives a bijection between [L] and [l]. Then R violates \(modPHP^{L,L}_{l}\).

The last part follows from Theorem 9. \(\square \)

Therefore, \(modPHP^{d,m}_{n}\) is actually not a generalization of counting principles over \(V^{0}\).

Next, we consider the following version:

Definition 13

UCP(ldnR) (which stands for Uniform Counting Principle) is an \(\mathcal {L}^{2}_{A}\) formula defined as follows:

$$\begin{aligned} (d \ge 1 \wedge \lnot d \mid n ) \rightarrow \lnot&\forall i \in [l]. (\forall j\in [d]. \exists e \in [n]. R(i,j,e) \vee \forall j \in [d]. \lnot \exists e \in [n]. R( i,j,e )) \\&\wedge \forall (i,j) \in [l]\times [d]. \forall e \ne e^{\prime } \in [n] (\lnot R( i,j,e) \vee \lnot R( i,j,e^{\prime }))\\&\wedge \forall (i,j) \ne (i^{\prime },j^{\prime }) \in [l] \times [d]. \forall e \in [n]. (\lnot R(i,j,e ) \vee \lnot R( i^{\prime },j^{\prime },e ))\\&\wedge \forall e \in [n]. \exists (i,j) \in [l] \times [d]. R( i,j,e) \end{aligned}$$

The propositional formula \(UCP^{l,d}_{n}\) is defined as follows:

$$\begin{aligned} UCP^{l,d}_{n}:= {\left\{ \begin{array}{ll} \lnot \bigg (\bigwedge _{i=1}^{l}\left( \left( \bigwedge _{j=1}^{d} \bigvee _{e \in [n]} r_{i,j,e} \right) \vee \left( \bigwedge _{j=1}^{d} \lnot \bigvee _{e \in [n]} r_{i,j,e} \right) \right) \\ \wedge \bigwedge _{(i,j) \in [l] \times [d]} \bigwedge _{e \ne e^{\prime } \in [n]} (\lnot r_{i,j,e} \vee \lnot r_{i,j,e^{\prime }})\\ \wedge \bigwedge _{(i,j) \ne (i^{\prime },j^{\prime }) \in [l] \times [d]} \bigwedge _{e \in [n]} (\lnot r_{i,j,e} \vee \lnot r_{i^{\prime },j^{\prime },e})\\ \wedge \bigwedge _{e \in [n]} \bigvee _{(i,j) \in [l] \times [d]} r_{i,j,e} \bigg ) \quad (\text{ if }\, n \not \equiv 0 \pmod d, d \ge 1)\\ 1 \quad (\text{ otherwise}) \end{array}\right. } \end{aligned}$$

As in the previous definitions, we abuse the notation and use \(UCP^{l,d}_{n}\) to express UCP(ldnR).

Intuitively, \(UCP^{l,d}_{n}\) states “if \(n \not \equiv 0 \pmod d\), then there does not exist a family \(\{S_{i}\}_{i \in [l]}\) which consists of d-sets and emptysets which give a partition of [n].” Each variable \(r_{i,j,e}\) reads “the j-th element of \(S_{i}\) is e.”

We observe the following:

Proposition 14

\({\textbf {VTC}}^{0} \vdash UCP^{l,d}_{k}\).

For a detailed treatment of the bounded arithmetic \({\textbf {VTC}}^{0}\), see §IX.3 of [8].

Proof Sketch

Work in \({\textbf {VTC}}^{0}\). Suppose a bounded set R violates \(UCP^{l,d}_{k}\), that is:

  • \(k \not \equiv 0 \pmod d\).

  • [k] is partitioned into \(\{S_{i}\}_{i \in [l]}\), where \(S_{i}=\{e \in [k] \mid \exists j \in [d].\ R(i,j,e)\}\).

  • Each \(S_{i}\) is either empty or [d]-set. In the latter case, it is witnessed by the bijection

    $$\begin{aligned} B_{i}:=\{( j,e ) \mid R(i,j,e)\}. \end{aligned}$$

Crucial properties of \({\textbf {VTC}}^{0}\) we use are:

  1. 1.

    There is a \(\Sigma ^{B}_{1}\)-definable function \(\#S\), which counts the cardinality of the input bounded set S.

  2. 2.

    \({\textbf {VTC}}^{0}\) proves the following: if a bounded set M codes a bijection between two bounded sets A and B, then \(\#A=\#B\).

Armed with these, we can derive a contradiction in the following way: we consider the cardinality of \(S=\{( i, j, e ) \in [l] \times [d] \times [k] \mid R(i,j,e)\}\). We can show that \(\#S_{i}\) is 0 if it is empty and d otherwise. In the latter case, we use the item 2 above. Furtheremore, by induction on \(i \in [l]\), we see that \(\#\bigcup _{i' \le i} S_{i} \equiv 0 \pmod d\). Considering the case \(i=l\), we have \(\# S \equiv 0 \pmod d\).

On the other hand, there is a \(\Sigma ^{B}_{0}\)-definable bijection between S and [k]:

$$\begin{aligned} S \rightarrow [k];\ ( i,j,e) \mapsto e. \end{aligned}$$

The fact that the above map is a bijection follows immediately from the assumption on R. Hence, using the item 2 again, we have \(\#S=k \not \equiv 0 \pmod d\), a contradiction. \(\square \)

Proposition 15

The following hold:

  1. 1.

    For any \(p \ge 2\), \(V^{0}+UCP^{l,d}_{k} \vdash Count^{p}_{n}\).

  2. 2.

    \(V^{0}+UCP^{l,d}_{k} \vdash ontoPHP^{m}_{n}\).

Proof

As for 1, argue in \(V^{0}\) as follows: suppose \(n \not \equiv 0 \pmod p\), and R gives a p-partition of [n]. Set the family \(\{S_{r}\}_{r \in [(n+1)^{p}]}\) by:

$$\begin{aligned} S_{r}:= {\left\{ \begin{array}{ll} \text{ the } \text{ set } \text{ coded } \text{ by }\, r &{}(\text{ if } \quad r \in R)\\ \emptyset &{}(\text{ otherwise}) \end{array}\right. } \end{aligned}$$

Then \(\{S_{r}\}_{r\in [(n+1)^{p}]}\) indeed violates \(UCP^{(n+1)^{p},p}_{n}\).

As for , argue in \(V^{0}\) as follows: suppose \(m>n\) and R gives a bijection between [m] and [n]. Then, \(\{[n]\}\) violates \(UCP^{1,m}_{n}\). \(\square \)

Hence, \(UCP^{l,d}_{n}\) is indeed a generalization of counting principles. It is natural to ask

Question 1

Does the following hold?:

$$\begin{aligned} V^{0}+ UCP^{l,d}_{k} \vdash injPHP^{n+1}_{n}, \end{aligned}$$

or, at least,

$$\begin{aligned} V^{0} + ontoPHP^{M}_{m} \vdash injPHP^{n+1}_{n}. \end{aligned}$$

In view of theorem 8, the author conjectures the answer to the problem is no. We tackle this issue in Sect. 3.

Here, we consider one more generalization of counting principles (which relates to the above problem):

Definition 16

\(GCP(P, Q_{1},Q_{2},R_{1},R_{2}, M_{0},M_{1},M_{2})\) (which stands for Generalized Counting Principle) is a \(\Sigma ^{B}_{0}\) \(\mathcal {L}^{2}_{A}\)-formula expressing the following statement: bounded sets

$$\begin{aligned} P, Q_{1},Q_{2},R_{1},R_{2}, M_{0},M_{1},M_{2} \end{aligned}$$

cannot satisfy the conjunction of the following properties:

  1. 1.

    \(M_{0}\) codes a bijection between \((P \times Q_{1}) \sqcup R_{1}\) and \((P \times Q_{2}) \sqcup R_{2}\).

  2. 2.

    \(M_{1}\) is an injection from \(R_{1}\) to \(R_{2}\) such that some element \(a \in R_{2}\) is out of its range.

  3. 3.

    \(M_{2}\) is an injection from \(R_{2}\) to P such that some element \(b \in P\) is out of its range.

Remark 17

We can consider the propositional translation of GCP as well as the previous examples \(UCP^{l,d}_{n}\), \(Count^{p}_{n}\), etc. However, we do not write it down here because we are not using it at this time.

It is easy to see that:

Proposition 18

\({\textbf {VTC}}^{0} \vdash GCP\).

Proof Sketch

Analogously with Proposition 14, we can derive an arithmetical contradiction in number sort by applying \(\#\) to the bounded sets listed in the definition of GCP and their products. \(\square \)

Proposition 19

  1. 1.

    \(V^{0} +GCP \vdash UCP^{l,d}_{n}\).

  2. 2.

    \(V^{0}+GCP \vdash injPHP^{n+1}_{n}\).

Proof

Work in \(V^{0}+GCP\).

We first prove \(UCP^{l,d}_{n}\). Suppose \(n \not \equiv 0 \pmod d\), and \(\{S_{i}\}_{i \in [l]}\) is a family consisting of d-sets and emptysets, and \([n]=\bigsqcup _{i \in [l]}S_{i}\). Set

$$\begin{aligned} Q:=\{i \in [l] \mid S_{i} \ne \emptyset \}. \end{aligned}$$

Then the partition gives a bijection between [n] and \(Q \times [d] \sqcup \emptyset \).

On the other hand, since \(n \not \equiv 0 \pmod d\), we can write \(n=ds+r\) where \(1 \le r < d\). This gives a natural bijection between [n] and \([d] \times [s] \sqcup [r]\).

Using \(\Sigma ^{B}_{0}\)-COMP (cf. Definition V.1.2 in [8]), it is straightforward to construct proper injections from \(\emptyset \) to [r] and from [r] to [d]. Thus, GCP is violated, which is a contradiction.

We next prove \(injPHP^{n+1}_{n}\). Suppose R gives an injection from \([n+1]\) to [n]. Then, there is a natural bijection between [n] and \([n+1] \times [1] \sqcup ([n]\setminus {{\,\textrm{ran}\,}}R)\).

On the other hand, there is a natural bijection between [n] and \([n+1]\times \emptyset \sqcup [n]\).

It is easy to construct proper injections from \([n]\setminus {{\,\textrm{ran}\,}}R\) to [n], and from [n] to \([n+1]\). Thus, GCP is violated, which is a contradiction. \(\square \)

It is natural to ask:

Question 2

  1. 1.

    Does the following hold: \(V^{0}+ UCP^{l,d}_{k} \vdash GCP\)?

  2. 2.

    Is there any other combinatorial principle than GCP which also implies \(injPHP^{n+1}_{n}\) and some of \(Count^{p}_{n}\)?

On the item 1, the author conjectures the answer is no (since GCP implies \(injPHP^{n+1}_{n}\)).

As for the item 2, we consider the oddtown theorem in Sect. 4.

2.2 Nullstellensatz proof system

In the analysis of this paper, Nullstellensatz proofs (written shortly as “NS-proofs”) play an essential role in the arguments. We set up our terminology on NS-proofs and review degree lower bounds for injPHP.

Definition 20

Let A be a (commutative) ring, and \(\mathcal {F}\) be a set of multivariate A-polynomials. For multivariate polynomials \(g_{1}\), \(g_{2}\) and \(h_{f}\) \((f \in \mathcal {F})\) over A, \(\{h_{f}\}_{f \in \mathcal {F}}\) is a NS-proof of \(g_{1}=g_{2}\) from \(\mathcal {F}\) over A if and only if

$$\begin{aligned} g_{1}-g_{2}=\sum _{f \in \mathcal {F}}h_{f}f. \end{aligned}$$

Especially, for \(a \in A \setminus \{0\}\), a NS-proof of \(a=0\) from \(\mathcal {F}\) is called a NS-refutation of \(\mathcal {F}\).

The degree of a NS-proof \(\{h_{f}\}_{f \in \mathcal {F}}\) is defined by \(\max _{f \in \mathcal {F}}(\deg (h_{f})+\deg (f))\). Here, we adopt the convention \(\deg 0:= -\infty \).

We are particularly interested in the following family of polynomials, which is another representation of the pigeonhole principle:

Definition 21

Let \(M, m \in \mathbb {N}\), and \(M>m\). Given a ring A, we define \(\lnot injPHP^{M}_{m}[A]\) as the family of the following multivariate polynomials over A:

$$\begin{aligned}&x_{ij}^{2}-x_{ij} \quad (i \in [M], j \in [m]), \end{aligned}$$
(1)
$$\begin{aligned}&x_{ij}x_{ij^{\prime }} \quad (i \in [M], j\ne j^{\prime } \in [m]), \end{aligned}$$
(2)
$$\begin{aligned}&x_{ij}x_{i^{\prime }j} \quad (i\ne i^{\prime } \in [M], j \in [m]), \end{aligned}$$
(3)
$$\begin{aligned}&\sum _{j=1}^{m} x_{ij} -1 \quad (i \in [M]), \end{aligned}$$
(4)

(each \(x_{ij}\) is distinct.)

Furthermore, we define \(\lnot inj^{*}PHP^{M}_{m}[A]\) as the family of the following multivariate polynomials over A:

$$\begin{aligned}&x_{ij}^{2}-x_{ij}, u_{j}^{2}-u_{j} \quad (i \in [M], j \in [m]),\\&x_{ij}x_{ij^{\prime }} \quad (i \in [M], j\ne j^{\prime } \in [m]), \\&x_{ij}x_{i^{\prime }j} \quad (i\ne i^{\prime } \in [M], j \in [m]),\\&\sum _{j=1}^{m} x_{ij} -1 \quad (i \in [M]),\\&\sum _{i=1}^{M} x_{ij} + u_{j} -1 \quad (j \in [m]) \end{aligned}$$

(\(x_{ij}\) and \(u_{j}\) are distinct indeterminates).

Remark 22

It is easy to see that \(\lnot injPHP^{M}_{m}[A]\) is not satisfiable in A if A is a field; the system is a paraphrase of the pigeonhole principle since \(x^{2}-x=0\) implies \(x=0,1\) in a field. This unsatisfiablity of \(\lnot injPHP^{M}_{m}[A]\) is extended to the case when A is a general nontrivial ring since there always exists a ring homomorphism from A to a field, and all the coefficients of the polynomials in \(\lnot injPHP^{M}_{m}[A]\) are among \(0, \pm 1\).

Furthermore, if A is nontrivial, then \(\lnot injPHP^{M}_{m}[A]\) always has a NS-refutation over A. Indeed, the unit of A generates a subring of A which is isomorphic to \(\mathbb {Z}\) or \(\mathbb {Z}/m\mathbb {Z}\) (\(m \ge 2\)). \(\lnot injPHP^{M}_{m}[\mathbb {Z}]\) has a NS-refutation over \(\mathbb {Z}\) since we have a NS-refutation over \(\mathbb {Q}\). We know that \(\lnot injPHP^{M}_{m}[\mathbb {Z}/m\mathbb {Z}]\) has a NS-refutation over \(\mathbb {Z}/m\mathbb {Z}\) when m is prime. For composite m, taking a prime divisor p of it, we have an embedding of \(\mathbb {Z}/p\mathbb {Z}\) into \(\mathbb {Z}/m\mathbb {Z}\) as additive groups. Since all the coefficients of the polynomials in \(\lnot injPHP^{M}_{m}[\mathbb {Z}/m\mathbb {Z}]\) are among \(0, \pm 1\), we can transform a NS-refutation \((h_{f})_{f}\) over \(\mathbb {Z}/p\mathbb {Z}\) to a NS-refutation over \(\mathbb {Z}/m\mathbb {Z}\) by simply mapping each coefficient of \(h_{f}\) by the embedding.

Note that Nullstellensatz proof system is not always complete for general coefficient rings. See the appendix of [7] for details.

We have introduced \(\lnot inj^{*}PHP^{M}_{m}[A]\) since it makes the translation of injPHP-trees into NS-refutations in the proof of Theorem 53 simpler, although it is an easy observation that \(\lnot injPHP^{M}_{m}\) and \(\lnot inj^{*}PHP^{M}_{m}\) are equivalent in the sense of degree of NS-refutation:

Proposition 23

Let A be a ring. For \(M,m,d \in \mathbb {N}\) with \(M > m\), the following are equivalent:

  1. 1.

    \(\lnot injPHP^{M}_{m}[A]\) has a NS-refutation over A of degree \(\le d\).

  2. 2.

    \(\lnot inj^{*}PHP^{M}_{m}[A]\) has a NS-refutation over A of degree \(\le d\).

Proof

(1) \(\Rightarrow \) (2) follows from \(\lnot injPHP^{M}_{m}[A] \subseteq \lnot inj^{*}PHP^{M}_{m}[A]\) as families of polynomials.

We consider the converse. Suppose \(\lnot inj^{*}PHP^{M}_{m}[A]\) has a NS-refutation of degree \(\le d\). Applying the substitution \(u_{j}:=1-\sum _{i=1}^{M} x_{ij}\) (\(j \in [m]\)), we obtain a NS-refutaion of

$$\begin{aligned} \lnot injPHP^{M}_{m} \cup \left\{ \left( 1-\sum _{i=1}^{M} x_{ij}\right) ^{2}-\left( 1-\sum _{i=1}^{M} x_{ij}\right) \mid j\in [m]\right\} \end{aligned}$$

over A without increasing the degree. Furthermore,

$$\begin{aligned} \left( 1-\sum _{i=1}^{M} x_{ij}\right) ^{2}-\left( 1-\sum _{i=1}^{M} x_{ij}\right) = \sum _{i=1}^{M}(x_{ij}^{2}-x_{ij}) + \sum _{i \ne i' \in [M]} x_{ij}x_{i'j}. \end{aligned}$$

Hence, we obtain a NS-refutation of \(\lnot injPHP^{M}_{m}[A]\) over A without increasing the degree. \(\square \)

As for the case when A is a field, the following powerful degree lower-bound is well-known:

Theorem 24

([17]) Let \(M,m \in \mathbb {N}\) and \(M>m\). Assume A is a field. There is no PC-refutation of \(\lnot injPHP^{M}_{m}[A]\) over A with degree \(\le m/2\). In particular, there is no NS refutation of \(\lnot injPHP^{M}_{m}[A]\) over A with degree \(\le m/2\).

Here, PC stands for “Polynomial Calculus.” For the precise treatment of the formal system, see §6.2 of [13].

Towards the discussion in Sect. 3, we want to generalize the above degree lower bound for \(A=\mathbb {Z}/n\mathbb {Z}\), where n is a general integer satisfying \(n \ge 2\). Precisely speaking, taking a closer look at the proof of Theorem 24, we can obtain the following generalization:

Corollary 25

(Essentially by [17]) Let A be a finite nontrivial commutative ring. Let \(M,m\in \mathbb {N}\) satisfy \(M>m\). Then there is no NS-refutation of \(injPHP^{M}_{m}[A]\) of degree \(\le \frac{m}{2}\).

We give a proof for completeness. Below, we follow the presentation of §16.2 in [13]. Fix \(M>m\) and a finite nontrivial commutative ring A in the rest of this section.

Let Maps be the set of partial bijections from [M] to [m]. For each \(\alpha \in Maps\), set \(x_{\alpha }:= \prod _{i \in {{\,\textrm{dom}\,}}(\alpha )} x_{i\alpha (i)}\). In particular, \(x_{\emptyset }:=1\). Let \(\hat{S}:= A[\bar{x}]/I\), where I is the ideal generated by the polynomials (1), (2), (3) in \(injPHP^{M}_{m}[A]\). For \(f \in A[\bar{x}]\), \(\hat{f}\) denotes the quotient of f in \(\hat{S}\). Furthermore, for a subset \(U \subseteq A[\bar{x}]\), set

$$\begin{aligned} \hat{U}:=\{\hat{f} \in \hat{S} \mid f \in U\}. \end{aligned}$$

Definition 26

For a parameter \(t \le m\), define:

$$\begin{aligned} Maps_{t}&:=\{\alpha \in Maps \mid \#\alpha \le t\}.\\ S_{t}&:= \{ f \mid f \in A[\bar{x}],\ \deg (f) \le t\}.\\ T_{t}&:=\{x_{\alpha } \mid \alpha \in Maps_{t}\}. \end{aligned}$$

Note that:

Proposition 27

For \(t \le m\),

  1. 1.

    \(\hat{S}_{t}\) is a free A-module of finite rank.

  2. 2.

    \(\hat{T}_{t}\) is a basis of \(\hat{S}_{t}\), and \(\#T_{t}=\#\hat{T}_{t}\).

Proof

The first item follows from the second item. We can show the second item as follows. The fact that \(\hat{T}_{t}\) generates \(\hat{S}_{t}\) straightforwardly follows from the definition of I above. We show that \(\hat{T}_{t}\) is A-linearly independent and \(\#T_{t}=\#\hat{T}_{t}\). Suppose not. Then we have

$$\begin{aligned} \sum _{\alpha \in Maps_{t}} c_{\alpha } x_{\alpha } \in I \end{aligned}$$

for some \(\{c_{\alpha }\}_{\alpha \in Maps_{t}} \subseteq A\) such that \(c_{\alpha } \ne 0\) for at least one \(\alpha \). Take a minimal \(\alpha _{0}\) of such, and consider the following substitution:

$$\begin{aligned} x_{ij} \mapsto {\left\{ \begin{array}{ll} 1 \quad \text{(if }\, \alpha _{0}(i)=j)\\ 0 \quad \text{(otherwise) }. \end{array}\right. } \end{aligned}$$

Then every element in I is evaluated to 0, and \(c_{\alpha }x_{\alpha }\) is evaluated to:

$$\begin{aligned} {\left\{ \begin{array}{ll} c_{\alpha _{0}} \quad \text{(if }\, \alpha =\alpha _{0})\\ 0 \quad \text{(otherwise) }. \end{array}\right. } \end{aligned}$$

because of the minimality of \(\alpha _{0}\). Thus we have \(c_{\alpha _{0}}=0\), a contradiction. \(\square \)

We aim to show that no \(a \in A \setminus \{0\}\) is in \(O_{t}\) (\(t \le m/2\)) below.

Definition 28

For \(t \le m\), let \(O_{t}\) be the set of \(f \in A[\bar{x}]\) such that f has a degree \(\le t\) PC-proof from \(injPHP^{M}_{m}[A]\).

Towards it, the following notion is useful to describe the rewriting procedure of polynomials in concern:

Definition 29

Let \(\alpha \in Maps\). A pigeon dance of \(\alpha \) is the following non-deterministic process:

  • Take the first (i.e. the smallest) pigeon \(i_{1} \in {{\,\textrm{dom}\,}}(\alpha )\) and move it to any unoccupied hole j such that is larger than \(\alpha (i_{1})\).

  • Then take the second smallest pigeon \(i_{2} \in {{\,\textrm{dom}\,}}(\alpha )\) and move it to any currently unoccupied hole larger than \(\alpha (i_{2})\), etc., as long as this is possible.

We say pigeon dance is defined on \(\alpha \) if this process can be completed for all the pigeons in \({{\,\textrm{dom}\,}}(\alpha )\).

Definition 30

Let \(Maps^{*}\) be the set of all partial maps \(\alpha :[M] \rightarrow [m] \sqcup \{0\}\) such that \(\alpha \) is injective outside \(\alpha ^{-1}(0)\). The partial bijection \(\alpha \restriction _{\alpha ^{-1}([m])}\) is denoted by \(\alpha ^{-}\).

For \(\alpha \in Maps^{*}\), extend the notation \(x_{\alpha }\) as follows:

$$\begin{aligned} x_{\alpha }:= x_{\alpha ^{-}} \times \prod _{i :\alpha (i)=0}\left( \sum _{j=1}^{m}x_{ij}-1\right) . \end{aligned}$$

Note that \(x_{\alpha }\) is no longer a monomial in general. We consider the following subsets of \(\hat{S}_{t}\):

Definition 31

For \(t \le m\), define:

  1. 1.

    \(B_{t} \subseteq S_{t}\) as the set of all \(x_{\alpha } \in Maps^{*}\) such that \(\#\alpha \le t\) and a pigeon dance is defined on \(\alpha ^{-}\).

  2. 2.

    \(C_{t}:=B_{t}{\setminus } T_{t}\) and \(\Delta _{t}:= B_{t} \cap T_{t}\), that is, \(\Delta _{t}\) consists of the monomials \(x_{\alpha }\) for \(\alpha \in Maps_{t}\) such that a pigeon dance is defined on \(\alpha \).

Now, Corollary 25 is an immediate consequence of the following lemma since \(1 \in \Delta _{t}\):

Lemma 32

For \(t \le m/2\),

$$\begin{aligned} \hat{S}_{t} = \hat{O}_{t} \oplus A \hat{\Delta }_{t},\ \hat{O}_{t}=A\hat{C}_{t} \end{aligned}$$

as A-modules. In particular, any non-zero scalar in A does not belong to \(O_{t}\).

Proof

First, we show that \(\hat{B}_{t}=\hat{C}_{t} \cup \hat{\Delta }_{t}\) spans \(\hat{S}_{t}\). For the monomials

$$ \begin{aligned} x_{\gamma } = x_{i_{1}j_{1}}\cdots x_{i_{k}j_{k}} \ \text{ and } \ x_{\delta }= x_{u_{1}v_{1}} \cdots x_{u_{l}v_{l}}\quad (i_{1}< \cdots< i_{k} \ \& \ u_{1}< \cdots < u_{l}) \end{aligned}$$

from \(T_{t}\), define a partial ordering \(x_{\gamma } \preceq x_{\delta }\) by the condition that:

  • either \(k < l\), or \(k=l\) and for the largest w such that \(j_{w} \ne v_{w}\) it holds that \(j_{w} < v_{w}\).

Claim 33

The monomial \(x_{\gamma }\) can be expressed as

$$\begin{aligned} \widehat{x_{\gamma }} = \sum _{x_{\alpha } \in X} c_{\alpha }\widehat{x_{\alpha }} + \sum _{x_{\beta } \in Y} c_{\beta }\widehat{x_{\beta }} \end{aligned}$$

in \(\hat{S}\) for some \(X \subseteq C_{t}\) and \(Y \subseteq \Delta _{t}\), and non-zero coefficients \(c_{\alpha }\)’s from A. (Note that \(X \cap Y = \emptyset \) by definition of \(C_{t}\) and \(\Delta _{t}\).) Further, for all \(x_{\alpha } \in X\) and \(x_{\beta } \in Y\), \({{\,\textrm{dom}\,}}(\alpha ) \cup {{\,\textrm{dom}\,}}(\beta ) \subseteq {{\,\textrm{dom}\,}}(\gamma )\).

We may assume \(i_{1}< \cdots <i_{k}\) in \(x_{\gamma }\) above and also \(x_{\gamma } \not \in \Delta _{t}\). We use induction on \(\preceq \). Assume the claim holds for all terms \(\preceq \)-smaller than \(x_{\gamma }\). In the following, we omit the symbol \(\hat{(\cdot )}\) representing the quotient map for readability. In \(\hat{S}\), rewrite \(x_{\gamma }\) as

$$\begin{aligned}&x_{i_{2}j_{2}} \cdots x_{i_{k}j_{k}} - \sum _{\begin{array}{c} j_{1}'<j_{1}\\ j_{1}' \not \in \{j_{2}, \ldots , j_{k}\} \end{array}} x_{i_{1}j_{1}'} \cdots x_{i_{k}j_{k}} - \sum _{\begin{array}{c} j_{1}'>j_{1}\\ j_{1}' \not \in \{j_{2}, \ldots , j_{k}\} \end{array}}x_{i_{1}j_{1}'} \cdots x_{i_{k}j_{k}} \\&\quad + \left( \sum _{j=1}^{m}x_{i_{1}j}-1\right) x_{i_{2}j_{2}}\cdots x_{i_{k}j_{k}}. \end{aligned}$$

The first term and the terms in the first summation are all \(\preceq \)-smaller than \(x_{\gamma }\), and their domain is included in \({{\,\textrm{dom}\,}}(\gamma )\). Thus, the statement for them follows by the induction hypothesis. Moreover, the statement for the first term implies that for the last term. Note that \(x_{\alpha } \in B_{t-1}\) implies \(\left( \sum _{j=1}^{m}x_{i_{1}j}-1\right) x_{\alpha } \in AB_{t}+I\).

The collection of all the terms in the second summation can be interpreted as describing all possible moves of \(i_{1}\) in the attempted pigeon dance of \(\gamma \). To simulate other steps in all possible pigeon dances, rewrite each of these terms analogously using (4) for \(i_{2}\), and so on. We have assumed \(x_{\gamma } \not \in \Delta _{t}\), that the pigeon dance cannot be completed. Therefore, the rewriting procedure must eventually produce only terms \(\preceq \)-smaller than \(x_{\gamma }\). This completes the proof of Claim 33.

Next, we establish the linear independence of \(\hat{B}_{t}=\hat{C}_{t} \cup \hat{\Delta }_{t}\). It suffices to show

$$\begin{aligned} \#B_{t} \le \# T_{t} = \#\hat{T}_{t} \end{aligned}$$

since \(\hat{B}_{t}\) spans \(\hat{S}_{t}\), \(\hat{T}_{t}\) is its basis, \(\#\hat{B}_{t} \le \#B_{t}\), and A is finite. Note that \(\#\hat{B}_{t}=\#B_{t}=\#T_{t}\) follows.

Towards the above inequality, consider another procedure minimal pigeon dance, which is described as follows: given \(\alpha \in Maps^{*}\), it is defined by the following instructions:

  1. 1.

    Put \(\alpha _{1}:=\alpha \).

  2. 2.

    For \(i=1,\ldots , M\), if \(i \in {{\,\textrm{dom}\,}}(\alpha _{i})\), move it to the smallest free hole \(j > \alpha (i)\) and let \(\alpha _{i+1}\) be the resulting map. If \(i \not \in {{\,\textrm{dom}\,}}(\alpha _{i})\), do nothing and put \(\alpha _{i+1}:=\alpha _{i}\).

  3. 3.

    \(D(\alpha )\) is the result of this process applied to \(\alpha \).

The following claims are purely combinatorial results on minimal pigeon dance, and the coefficient ring A does not matter, so we state them without proof. See the proof of Lemma 16.2.2 in [13] for concrete proofs.

Claim 34

For all \(t \le m/2\), D is defined on the whole of \(B_{t}\).

Claim 35

D is an injective map from \(B_{t}\) into \(T_{t}\) for \(t \le m/2\).

Claim 35 implies \(\# B_{t} \le \# T_{t}\).

It remains to show that \(\hat{C}_{t}\) is a basis of \(\hat{O}_{t}\). We already know that \(\hat{C}_{t}\) is A-linearly independent, so we focus on proving \(\hat{C}_{t}\) spans \(\hat{O}_{t}\). Clearly \(C_{t} \subseteq O_{t}\) and \(AC_{t}\) contains all the axioms (4) in \(injPHP^{M}_{m}[A]\) and is closed under the addition rule of PC. For the closure of the space under the multiplication rule, assume that

$$\begin{aligned} x_{\alpha } \in C_{t},\ s=\deg (x_{\alpha }) < t. \end{aligned}$$

It suffices to show that \(x_{ij}x_{\alpha } \in AC_{s+1}+I\), which implies \(\widehat{fx_{\alpha }} \in A\hat{C}_{t}\) for each monomial f with \(\deg (f) + \deg (x_{\alpha }) \le t\) by induction on \(\deg (f)\) and for each \(f \in A[\bar{x}]\) with \(\deg (f) + \deg (x_{\alpha }) \le t\) since \(A\hat{C}_{t}\) is closed under addition and scalar product. We use induction on s. By the definition of \(C_{t}\), we can write \(x_{\alpha }=x_{\beta } (1-\sum _{j=1}^{m}x_{i'j})\) for some \(i' \in [M]\). As \(\deg (x_{\beta }) <\deg (x_{\alpha }) = s\), \(x_{ij}x_{\beta } \in AB_{s}+I\) by Claim 33 if \(x_{\beta } \in T_{s-1}\) and by induction hypothesis if \(x_{\beta } \in C_{s-1}\). Note that the degree does not increase. Multiplying \((1-\sum _{j=1}^{m}x_{i'j})\), we have \(x_{ij}x_{\alpha } \in AC_{s+1}+I\). \(\square \)

3 On question 1

As for Question 1, the author conjectures the following:

Conjecture 1

$$\begin{aligned} F_{c}+UCP^{l,d}_{k} \not \vdash _{poly(n)} injPHP^{n+1}_{n}. \end{aligned}$$

   Here, for a family \(\{\alpha _{\vec {k}}\}_{\vec {k} \in \mathbb {N}}\) of propositional formulae, \(F_{c}+\alpha _{\vec {k}}\) is the fragment of Frege system allowing the formulae with depth \(\le c\) only and admitting \(\{\alpha _{\vec {k}}\}_{\vec {k}}\) as an axiom scheme. Furthermore, \(P \vdash _{poly(n)} \varphi _{n}\) means each \(\varphi _{n}\) has a poly(n)-sized P-proof.

If this conjecture is true, then it follows that \(V^{0}+UCP^{l,d}_{k} \not \vdash injPHP^{n+1}_{n}\) by the witnessing theorem and the translation theorem. In this section, we provide a sufficient condition to prove this conjecture. Our strategy is to adapt the proof technique of Ajtai’s theorem to the situation. We define injPHP-tree, and a k-evaluation using injPHP-tree, and show that a Frege-proof of \(injPHP^{n+1}_{n}\) admitting \(UCP^{l,d}_{k}\) as an axiom scheme cannot have o(n)-evaluation. Taking the contrapositive, we obtain our sufficient condition.

We start with introducing the following notions:

Definition 36

Let D and R be disjoint sets. A partial injection from D to R is a set \(\rho \) which satisfies the following:

  1. 1.

    Each \(x \in \rho \) is either a 2-set having one element from D and one element from R, or a singleton contained in R. (In the former case, if \(x=\{i,j\}\) where \(i \in D\) and \(j \in R\), we use a tuple \(\langle i,j \rangle \) to denote x. In the latter case, if \(x=\{j\}\) where \(j \in R\), then we use 1-tuple \(\langle j \rangle \) to denote x.)

  2. 2.

    Each pair \(x \ne x^{\prime } \in \rho \) are disjoint.

The 2-sets in a partial injection \(\rho \) give a partial bijection from D to R. We denote it by \(\rho _{bij}\). Also, we set \(\rho _{sing}:=\rho \setminus \rho _{bij}\).

We define \(v(\rho ):=\bigcup _{x \in \rho }x\), \({{\,\textrm{dom}\,}}(\rho ):= v(\rho ) \cap D\), and \({{\,\textrm{ran}\,}}(\rho ):=v(\rho ) \cap R\).

For two partial injections \(\rho \) and \(\tau \) from D to R,

  1. 1.

    \(\rho || \tau \) if and only if \(\rho \cup \tau \) is again a partial injection.

  2. 2.

    \(\rho \perp \tau \) if and only if \(\rho || \tau \) does not hold. In other words, there exist \(x \in \rho \) and \(y \in \tau \) such that \(x \ne y\) and \(x \cap y \ne \emptyset \).

  3. 3.

    \(\sigma \tau := \sigma \cup \tau \).

Now, we introduce an analogy of PHP-trees in this context:

Definition 37

Let D and R be disjoint finite sets. An injPHP-tree over (DR) is a vertex-labeled and edge-labeled rooted tree defined inductively as follows:

  1. 1.

    The tree whose only vertex is its root and has no labels is an injPHP-tree over (DR).

  2. 2.

    If the root is labeled by “\(i \mapsto ?\)” having \(\#R\) children and each of its edges corresponding to each label “\(\langle i,j \rangle \)” (\(j \in R\)), and the subtree which the child under the edge labeled by “\(\langle i,j \rangle \)” induces is an injPHP-tree over \((D \setminus \{i\}, R\setminus \{j\})\), then the whole labeled tree is again an injPHP-tree over (DR).

  3. 3.

    If the root is labeled by “\(? \mapsto j\)” having \((\#D+1)\) children and each of its edges corresponding to each label “\(\langle i,j \rangle \)” (\(i \in D\)) and “\(\langle j \rangle \),” and the subtree which the child under the edge indexed by \(\langle i,j \rangle \) induces is an injPHP-tree over \((D {\setminus } \{i\}, R{\setminus } \{j\})\) while the subtree which the child under the edge labeled by “\(\langle j \rangle \)” induces is an injPHP-tree over \((D, R\setminus \{j\} )\), then the whole tree is again an injPHP-tree over (DR).

For an injPHP-tree T, we denote the height (the maximum number of edges in its branches) of T by height(T) and the set of its branches by br(T).

The pair \((T, L:br(T) \rightarrow S)\) is called a labeled injPHP-tree with label set S. For each label \(s \in S\), we set \(br_{s}(T):= L^{-1}(s)\).

Convention 38

When T is an injPHP-tree over (DR), each branch \(b \in br(T)\) naturally gives a partial injection, which is the collection of labels of edges contained in b. We often abuse the notation and use b to denote the partial injection given by b.

In the following, if there is no problem, we identify domains having the same size n and denote them \(D_{n}\). Similarly, we identify ranges R having the same size n and denote them \(R_{n}\). We assume \(D_{m}\) and \(R_{n}\) are disjoint for any \(m,n \in \mathbb {N}\).

Definition 39

For \(m>n\), \(\mathcal {M}^{m}_{n}\) denotes the set of all partial injections from \(D_{m}\) to \(R_{n}\).

Definition 40

Let \(\rho \in \mathcal {M}^{m}_{n}\) (\(m > n\)). Let T be an injPHP-tree over \((D_{m},R_{n})\). We define the restriction \(T^{\rho }\) as the injPHP-tree over \((D_{m} {\setminus } {{\,\textrm{dom}\,}}(\rho ), R_{n} {\setminus } {{\,\textrm{ran}\,}}(\rho ))\) obtained from T by deleting the edges with label incompatible with \(\rho \), contracting the edges whose label are contained in \(\rho \) (we leave the label of the child), and taking the connected component including the root of the tree.

In particular, we are interested in shallow injPHP-trees. The main reason is the following Lemma 42.

Definition 41

For \(\tau \in \mathcal {M}^{m}_{n}\) such that \(\tau || \rho \), we set

$$\begin{aligned} \tau ^{\rho } := \tau \setminus \rho . \end{aligned}$$

Lemma 42

Let T be an injPHP-tree and \(\rho \in \mathcal {M}^{m}_{n}\) (\(m > n\)). If \(height(T) \le n-\#\rho \), then the map

$$\begin{aligned} \{b \in br(T) \mid b || \rho \} \rightarrow br(T^{\rho });\ b \mapsto b^{\rho } \end{aligned}$$

is bijective.

Proof

First, we observe that \(b^{\rho } \in br(T^{\rho })\) if \(b \in br(T)\) satisfies \(b||\rho \). Indeed, no edge in b is deleted by the restriction with \(\rho \) since \(b || \rho \), although some edges in b may be contracted.

Next, we see the above map \(b \mapsto b^{\rho }\) is injective. Indeed, if \(b \ne b' \in br(T)\), \(b||\rho \), and \(b'||\rho \), then \(b \perp b'\). Let \(\pi \in b\) and \(\pi ' \in b'\) witness \(b \perp b'\), that is, one of the following holds:

  • \(\pi =\langle i,j \rangle \), \(\pi '=\langle i,j' \rangle \), and \(j \ne j'\).

  • \(\pi =\langle i,j \rangle \), \(\pi '=\langle i',j \rangle \), and \(i \ne i'\).

  • \(\{\pi ,\pi '\}=\{\langle i,j \rangle ,\langle j \rangle \}\).

Since \(b||\rho \) and \(b'||\rho \), we have \(\pi ',\pi \not \in \rho \), and therefore \(\pi \in b^{\rho }\) and \(\pi ' \in (b')^{\rho }\) follow. It implies \(b^{\rho } \perp (b')^{\rho }\).

Lastly, we show that \(b \mapsto b^{\rho }\) is surjective. Let \(r \in br(T^{\rho })\). By definition of \(T^{\rho }\), there exists a vertex v of T such that \(a||\rho \) and \(r=a^{\rho }\), where a is the partial injection induced by the path from the root to v in T. Let v be the most distant from the root among such. It suffices to show that v is actually a leaf of T, and therefore \(a \in br(T)\). Suppose v is not a leaf of T. If h is the height of v, then \(h < n-\#\rho \) by assumption. It implies \(\# a + \# \rho < n\). Hence, whatever the label of v is, there exists an edge from v whose label is compatible with \(\rho \), contradicting the maximality of v. \(\square \)

Definition 43

In the setting of the previous Lemma, we denote the inverse mapping by;

$$\begin{aligned} br(T^{\rho }) \rightarrow \{b \in br(T) \mid b || \rho \};\ r \mapsto r^{-\rho }. \end{aligned}$$

Definition 44

Let \(\rho \in \mathcal {M}^{m}_{n}\) (\(m>n\)). For \(\{r_{ij}\}_{i \in D_{m}, j\in R_{n}}\)-propositional formula \(\varphi \) (by a natural identification of variables, we regard each variable \(r_{ij}\) is utilized to construct the propositional formula \(injPHP^{m}_{n}\)), we define the restriction \(\varphi ^{\rho }\) by applying \(\varphi \) the following partial assignment: for each \(i \in [m]\) and \(j \in [n]\),

$$\begin{aligned} r_{ij} \mapsto {\left\{ \begin{array}{ll} 1 &{}\quad (\text{ if }\, \langle i,j \rangle \in \rho )\\ 0 &{}\quad (\text{ if }\, \{\langle i,j \rangle \} \perp \rho )\\ r_{ij} &{}\quad (\text{ otherwise}) \end{array}\right. } \end{aligned}$$

For a set \(\Gamma \) of \(\{r_{ij}\}_{i \in D_{m}, j\in R_{n}}\)-propositional formulae, define

$$\begin{aligned} \Gamma ^{\rho }:=\{\varphi ^{\rho }\mid \varphi \in \Gamma \}. \end{aligned}$$

Example 45

Let \(\rho \in \mathcal {M}^{m}_{n}\) (\(m>n\)). Then, by suitable change of variables, \((injPHP^{m}_{n})^{\rho }\) is equivalent to \(injPHP^{m-\#{{\,\textrm{dom}\,}}(\rho )}_{n-\#{{\,\textrm{ran}\,}}(\rho )}\) (over \(AC^{0}\)-Frege system, mod poly(mn)-sized proofs).

Definition 46

Let \(m>n\). Let T be an injPHP-tree over \((D_{m},R_{n})\) with height h. Given a set \(S \subseteq br(T)\) and a family \((T_{b})_{b \in br(T)}\) of injPHP-trees where each \(T_{b}\) is over \((D_{m}\setminus {{\,\textrm{dom}\,}}(b),R_{n} \setminus {{\,\textrm{ran}\,}}(b))\), we define the concatenated tree

$$\begin{aligned} T*\sum _{b \in S} T_{b} \end{aligned}$$

as follows: for each \(b \in S\), concatenate \(T_{b}\) under b in T identifying the leaf of b and the root of \(T_{b}\) (and leaving the label of the root of \(T_{b}\)).

For two injPHP-trees T and U over \((D_{m},R_{n})\), we define

$$\begin{aligned} T*U:= T*\sum _{b \in br(T)} U^{b}. \end{aligned}$$

Lemma 47

In the setting of the previous Definition, the following map is a bijection:

$$\begin{aligned}&\{( b,b' ) \mid b \in S,\ b' \in br(T_{b}) \}\sqcup (br(T) \setminus S) \rightarrow br\left( T*\sum _{b \in S} T_{b}\right) ;\\&\quad {\left\{ \begin{array}{ll} ( b,b' ) &{}\mapsto bb' \\ b \in br(T) \setminus S &{}\mapsto b \end{array}\right. }. \end{aligned}$$

Proof

Clear. \(\square \)

Definition 48

Let \(\Gamma \) be a subformula closed set of \(\{r_{ij}\}_{i \in D_{m}, j\in R_{n}}\)-formulae (\(m>n\)). A k-evaluation (using injPHP-trees) of \(\Gamma \) is a map

$$\begin{aligned} T_{\cdot } :\varphi \in \Gamma \mapsto T_{\varphi } \end{aligned}$$

satisfying the following:

  1. 1.

    Each \(T_{\varphi }\) is a labeled injPHP-tree over \((D_{m},R_{n})\) with label set \(\{0,1\}\) and height \(\le k\).

  2. 2.

    \(T_{0}\) is the injPHP-tree with height 0, whose only branch is labeled by 0.

  3. 3.

    \(T_{1}\) is the injPHP-tree with height 0, whose only branch is labeled by 1.

  4. 4.

    \(T_{r_{ij}}\) is the injPHP-tree over \((D_{m},R_{n})\) with height 1, whose label of the root is \(i \mapsto ?\) and \(br_{1}(T_{r_{ij}})=\{\langle i,j \rangle \}\).

  5. 5.

    \(T_{\lnot \varphi } = T_{\varphi }^{c}\), that is, \(T_{\lnot \varphi }\) is obtained from \(T_{\varphi }\) by flipping the labels 0 and 1.

  6. 6.

    \(T_{\bigvee _{i \in I} \varphi _{i}}\) (where each \(\varphi _{i}\) does not begin from \(\vee \)) represents \(\bigcup _{i \in I} br_{1}(T_{\varphi _{i}})\). Here, we say a \(\{0,1\}\)-labeled injPHP-tree T represents a set \(\mathcal {F}\) of partial injections if and only if the following hold:

    1. (a)

      For each \(b \in br_{1}(T)\), there exists a \(\sigma \in \mathcal {F}\) such that \(\sigma \subseteq b\).

    2. (b)

      For each \(b \in br_{0}(T)\), every \(\sigma \in \mathcal {F}\) satisfies \(\sigma \perp b\).

Example 49

Given a list \(F=\{ \sigma _{1}, \ldots , \sigma _{N}\}\), where \(\sigma _{1}, \ldots , \sigma _{N}\) are partial injections from D to R, we define the \(\{0,1\}\)-labeled injPHP-tree \(T_{F}\) over (DR) inductively as follows:

  1. 1.

    If F is empty, \(T_{F}:= T_{0}\).

  2. 2.

    If some \(\sigma _{i}\) is an empty map, \(T_{F}:= T_{1}\).

  3. 3.

    Otherwise, ask where to go for each \(v \in v(\sigma _{1})\). Let T be the obtained injPHP-tree.

  4. 4.

    For each branch \(b \in br(T)\), consider \(F^{b}\) below:

    $$\begin{aligned} F^{b}:=\{ \sigma _{i}^{b} \mid \sigma _{i} || b\}. \end{aligned}$$

    Construct \(T_{F^{b}}\) over \((D{\setminus } {{\,\textrm{dom}\,}}(b), R{\setminus } {{\,\textrm{ran}\,}}(b))\) inductively, and set

    $$\begin{aligned} T_{F}:= T*\sum _{b \in br(T)} T_{F^{b}}. \end{aligned}$$

\(T_{F}\) clearly represents F (we regard F as a set here).

Definition 50

Let \(T=(S,L)\) be a labeled injPHP-tree over \((D_{m},R_{n})\) (\(m>n\)). Let \(\rho \in \mathcal {M}^{m}_{n}\) and assume \(height(T) \le n- \#\rho \). Then the restricted labeled injPHP-tree \(T^{\rho }=(S',L')\) is defined as follows:

$$\begin{aligned} S':=S^{\rho },\ L'(r):=L(r^{-\rho }). \end{aligned}$$

Example 51

If a \(\{0,1\}\)-labeled injPHP-tree T over \((D_{m},R_{n})\) (\(m>n\)) represents \(\mathcal {F}\), \(\rho \in \mathcal {M}^{m}_{n}\), and \(height(T) \le n- \#\rho \), then \(T^{\rho }\) represents \(\mathcal {F}^{\rho }\). Indeed, for \(r \in br_{1}(T^{\rho })\), there exists \(\sigma \in \mathcal {F}\) such that \(\sigma \subseteq r^{-\rho }\). Hence \(\sigma || \rho \) and it gives \(\sigma ^{\rho } \in \mathcal {F}^{\rho }\), \(\sigma ^{\rho } \subseteq b^{\rho }\). On the other hand, for \(r \in br_{0}(T^{\rho })\), each \(\sigma \in \mathcal {F}\) satisfies \(\sigma \perp r^{-\rho }\). Therefore, for all \(\sigma \) such that \(\sigma || \rho \), \(\sigma ^{\rho } \perp (r^{-\rho })^{\rho }=r\) holds.

Proposition 52

Let \(T_{\cdot }\) be a k-evaluation of a subfomula-closed set \(\Gamma \) of \(\{r_{ij}\}_{i \in D_{m},j \in R_{n}}\)-formulae (\(m>n\)), \(\rho \in \mathcal {M}^{m}_{n}\), \(k \le n-\#\rho \).

Consider

$$\begin{aligned} U_{\varphi }:=(T_{\varphi })^{\rho }. \end{aligned}$$

Note that the RHS is the restricted labeled injPHP-tree.

\(U_{\varphi }\) is an injPHP-tree over \((D_{m} {\setminus } {{\,\textrm{dom}\,}}(\rho ), R_{n} {\setminus } {{\,\textrm{ran}\,}}(\rho ))\), which can be regarded as \((D_{m-\#{{\,\textrm{dom}\,}}(\rho )},R_{n-\#{{\,\textrm{ran}\,}}(\rho )})\). In particular, we can regard \(U_{\cdot }\) as a k-evaluation of \(\Gamma ^{\rho }\) of \(\{r_{ij}\}_{i \in D_{m-\#{{\,\textrm{dom}\,}}(\rho )},j \in R_{n-\#{{\,\textrm{ran}\,}}(\rho )}}\)-formulae.

Proof

Clear. \(\square \)

Theorem 53

Let \(f :\mathbb {N}\rightarrow \mathbb {N}\) be a function satisfying \(n<f(n) \le n^{O(1)}\). Suppose \((\pi _{n})_{n \ge 1}\) be a sequence of Frege-proofs such that \(\pi _{n}\) proves \(injPHP^{f(n)}_{n}\) using \(UCP^{l,d}_{k}\) as an axiom scheme.

Then there cannot be a sequence \((T^{n})_{n \ge 1}\) satisfying the following: each \(T^{n}\) is an o(n)-evaluation of \(\Gamma _{n}\) using injPHP-trees over \((D_{f(n)},R_{n})\), where \(\Gamma _{n}\) is the set of all subformulae appearing in \(\pi _{n}\).

Proof

Suppose otherwise. There exist o(n)-evaluations \(T^{n}\) of \(\Gamma _{n}\). Fix a large enough n, and we suppress the superscript n of \(T^{n}\), and denote it simply by T.

For \(\varphi \in \Gamma _{n}\), define

$$\begin{aligned} T \models \varphi :\Leftrightarrow br_{1}(T_{\varphi })=br(T_{\varphi }). \end{aligned}$$

Then, we can show the following claims analogously with Lemma 15.1.7 and Lemma 15.1.6 in [13]:

Claim 54

There exists a constant \(c>0\), depending only on the formalization of the Frege system we use, such that the following holds: if \(\varphi \) is derived from \(\psi _{1}, \ldots , \psi _{k} \in \pi _{n}\) by a Frege rule in \(\pi _{n}\), \(\forall i \in [k].T \models \psi _{i}\), and \(\forall i \in [k]. height(T_{\varphi }) \le n/c\), then \(T \models \varphi \).

Claim 55

\(br_{1}(T_{injPHP^{f(n)}_{n}})= \emptyset \). In particular, \(T \not \models injPHP^{f(n)}_{n}\).

Also, the following fact is useful:

Claim 56

$$\begin{aligned} T \models \bigwedge _{i=1}^{L} \varphi _{i} \Longrightarrow \forall i\in [L].\ T \models \varphi _{i}. \end{aligned}$$

Proof of the claim

The hypothesis means

$$\begin{aligned} T \models \lnot \bigvee _{i=1}^{L} \lnot \varphi _{i}, \end{aligned}$$

that is,

$$\begin{aligned} br_{0}(T_{\bigvee _{i=1}^{L} \lnot \varphi _{i}}) = br(T_{\bigvee _{i=1}^{L} \lnot \varphi _{i}}). \end{aligned}$$

Therefore, for each \(b \in br(T_{\bigvee _{i=1}^{L} \lnot \varphi _{i}})\) and \(r \in br_{1}(T_{\lnot \varphi _{i}})\), \(b \perp r\) holds. Now, assume some \(br_{0}(T_{\varphi _{i}})\) is nonempty, and \(r_{0}\) be one of its elements. Since

$$\begin{aligned} \#r_{0} \le o(n) \ \text{ and } \ height(T_{\bigvee _{i=1}^{L} \lnot \varphi _{i}}) \le o(n), \end{aligned}$$

there exists a branch \(b \in br(T_{\bigvee _{i=1}^{L} \lnot \varphi _{i}})\) such that \(b||r_{0}\), which is a contradiction. \(\square \)

Therefore, there exists an instance

$$\begin{aligned} I= UCP^{l,d}_{k}[\psi _{i,j,e}/r_{i,j,e}] \end{aligned}$$

in \(\pi _{n}\) such that \(T \not \models I\). Hence, \(k \not \equiv 0 \pmod d\). By restricting the formulae and T by some \(\rho \in br_{0}(T_{I})\), we obtain the proof \(\pi _{n}^{\rho }\) of \((injPHP^{n+1}_{n})^{\rho }\) and the o(n)-evaluation \(T^{\rho }\) of \(\Gamma _{n}^{\rho }\) (note that \(\#\rho \le o(n)\), hence \(T^{\rho }\) is well-defined). Therefore, we may assume that \(br_{0}(T_{I})=br(T_{I})\).

Let

$$\begin{aligned} S_{i,j,e}&:= T_{\psi _{i,j,e}},\\ S_{e}&:= T_{\bigvee _{(i,j) \in [l] \times [d]}\psi _{i,j,e}},\\ S_{i,j}&:= T_{\bigvee _{e \in [k]} \psi _{i,j,e}},\\ P_{i}&:= T_{\bigvee _{j \in [d]} \lnot \bigvee _{e \in [k]}\psi _{i,j,e}},\\ N_{i}&:= T_{\bigvee _{j \in [d]} \bigvee _{e \in [k]}\psi _{i,j,e}},\\ U_{i}&:= T_{(\lnot \bigvee _{j \in [d]} \lnot \bigvee _{e \in [k]}\psi _{i,j,e}) \vee (\lnot \bigvee _{j \in [d]} \bigvee _{e \in [k]}\psi _{i,j,e})}.\\&(i \in [l], j \in [d], e \in [k]) \end{aligned}$$

Since \(br_{0}(T_{I})=br(T_{I})\), we observe the following facts:

  1. 1.

    For each \(e \in [k]\), \(T \models \bigvee _{(i,j) \in [l] \times [d]}\psi _{i,j,e}\), that is, every \(b \in br(S_{e})\) has \((i,j) \in [l] \times [d]\) and \(b' \in br_{1}(S_{i,j,e})\) such that \(b' \subseteq b\).

  2. 2.

    For each \(e \in [k]\) and \((i,j) \ne (i^{\prime }, j^{\prime }) \in [l] \times [d]\), every pair of branches \(b \in br_{1}(S_{i,j,e})\) and \(b^{\prime } \in br_{1}(S_{i^{\prime },j^{\prime },e})\) satisfies \(b \perp b^{\prime }\).

  3. 3.

    For each \(e \ne e^{\prime } \in [k]\) and \((i,j) \in [l] \times [d]\), each \(b \in br_{1}(S_{i,j,e})\) and \(b^{\prime } \in br_{1}(S_{i,j,e^{\prime }})\) satisfies \(b \perp b^{\prime }\).

  4. 4.

    For each \(i \in [l]\), \(T \models (\lnot \bigvee _{j \in [d]} \lnot \bigvee _{e \in [k]}\psi _{i,j,e}) \vee (\lnot \bigvee _{j \in [d]} \bigvee _{e \in [k]}\psi _{i,j,e})\), that is, every \(b \in br(U_{i})\) is an extension of some \(b^{\prime } \in br_{1}(P_{i}^{c})\) or some \(b^{\prime } \in br_{1}(N_{i}^{c})\). In the former case, b is incompatible with every \(b^{\prime \prime }\in \bigcup _{j}br_{0}(S_{i,j})\). In the latter case, b is incompatible with every \(b^{\prime \prime } \in \bigcup _{j,e}br_{1}(S_{i,j,e})\). Therefore, the two cases are mutually disjoint. Indeed, if b satisfies the both cases, then take \(b^{\prime } \in br(S_{i,j})\) such that \(b || b^{\prime }\) (which exists since \(\#b\) and \(height(S_{i,j})\) are both o(n)). It follows that \(b^{\prime } \in br_{1}(S_{i,j})\), and therefore \(b^{\prime }\) is an extension of some \(b^{\prime \prime } \in \bigcup _{e}br_{1}(S_{i,j,e})\), which contradicts the observation of the latter case.

With the observations above, we construct labeled injPHP-trees \((X_{i,j})_{(i,j) \in [l] \times [d]}\) and \((Y_{e})_{e \in [k]}\) as follows

  • We define \(Y_{e}\) for fixed e first. Consider \(S_{e}\). By observation 1 and 2, each \(b \in br(S_{e})\) has a unique \((i_{b},j_{b}) \in [l] \times [d]\) such that b is an extension of some \(b^{\prime } \in br_{1}(S_{i_{b},j_{b},e})\). Consider the tree

    $$\begin{aligned} S_{e}*\sum _{b \in br(S_{e})}(U_{i_{b}}*S_{i_{b},j_{b}})^{b} \end{aligned}$$

    (here, we have concatenated the trees, ignoring their labels).

       Label each branch extending \(b \in br(S_{e})\) with \(\langle i_{b}, j_{b},e\rangle \). Let \(Y_{e}\) be the resulting labeled injPHP-tree. Note that \(height(Y_{e})\) is still o(n).

  • Next, we define \(X_{i,j}\) for fixed \((i,j) \in [l]\times [d]\). Consider \(U_{i}\). Let \(B \subseteq br(U_{i})\) be the set of all \(b \in br(U_{i})\) satisfying the former case of observation 4. Consider the tree \(\widetilde{X}_{i,j}:= U_{i}*\sum _{b \in B}S_{i,j}^{b}\). Each branch \(\widetilde{b} \in br(\widetilde{X}_{i,j})\) satisfies one of the following:

    1. 1.

      \(\widetilde{b} \in br(U_{i}) \setminus B\).

    2. 2.

      Otherwise, by Lemma 42, we can decompose \(\widetilde{b}=bs^{b}\) (\(b \in B, s \in br(S_{i,j})\)). Since b is an extension of some \(b^{\prime } \in br_{1}(P_{i}^{c})\), we have \(br_{1}(S_{i,j}^{b})=br(S_{i,j}^{b})\). It implies \(s \in br_{1}(S_{i,j})\). Therefore, by observation 3, each \(s^{b} \in br(S_{i,j}^{b})\) has a unique \(e_{\widetilde{b}} \in [k]\) such that s extends some \(b^{\prime \prime } \in br_{1}(S_{i,j,e_{\widetilde{b}}})\).

    Let \(\widetilde{B}\) be the all branches of \(\widetilde{X}_{i,j}\) satisfying the second item. We define

    $$\begin{aligned} \widehat{X}_{i,j} := \widetilde{X}_{i,j}*\sum _{\widetilde{b} \in \widetilde{B}} (S_{e_{\widetilde{b}}})^{\widetilde{b}}. \end{aligned}$$

    We label each branch \(b \in br(\widehat{X}_{i,j})\) as follows and define \(X_{i,j}\) to be the obtained labeled injPHP-tree.

    1. 1.

      If \(b \in br(U_{i}) \setminus B\), then label it with the symbol \(\perp \).

    2. 2.

      Otherwise, there exists a unique \(\widetilde{b} \in \widetilde{B}\) such that \(\widetilde{b} \subseteq b\). Label the branch b with \(\langle i, j, e_{\widetilde{b}} \rangle \).

By observation 3, we see that \((X_{i,j})_{i \in [l],j \in [d]}\) and \((Y_{e})_{e \in [k]}\) satisfy the following:

  • For each \(ibr_{\perp }(X_{i,1})= \cdots = br_{\perp }(X_{i,d})\).

  • For each ije, \(br_{\langle i,j,e \rangle }(X_{i,j})= br_{\langle i,j,e \rangle }(Y_{e})\) (as sets of partial injections).

The second item is justified as follows. By Lemmas 42, 47, and the definitions of \(Y_{e}\) and \(X_{i,j}\), the following are bijections for each ije:

$$\begin{aligned}&\Bigg \{ \left( b,\beta ,s \right) \in br(S_{e}) \times br(U_{i}) \times br(S_{ij}) \mid \exists b'\in br_{1}(S_{i,j,e}).(b' \subseteq b),\ \beta || s,\ b||\beta s \Bigg \} \\&\quad \rightarrow br_{\langle i,j,e \rangle }(Y_{e});\ ( b,\beta ,s ) \mapsto b(\beta s).\\&\Bigg \{ \left( \beta ,s,b \right) \in br(U_{i}) \times br(S_{i,j}) \times br(S_{e}) \mid \begin{array}{l} \exists \hat{b}\in br_{1}(P_{i}^{c}).(\hat{b} \subseteq \beta ),\\ \exists b'' \in br_{1}(S_{i,j,e}). (b'' \subseteq s),\\ \beta || s ,\ \beta s || b \end{array} \Bigg \}\\&\quad \rightarrow br_{\langle i,j,e \rangle }(X_{i,j});\ (\beta ,s,b ) \mapsto (\beta s)b. \end{aligned}$$

Hence, in order to see \(br_{\langle i,j,e \rangle }(X_{i,j})=br_{\langle i,j,e \rangle }(Y_{e})\), it suffices to show the following: let \(\beta \in br(U_{i})\), \(s \in br(S_{i,j})\), \(b \in br(S_{e})\). If \(\beta || s\) and \(\beta s || b\), then the following are equivalent:

  1. 1.

    There exists \(b'\in br_{1}(S_{i,j,e})\) such that \(b' \subseteq b\).

  2. 2.

    \(\beta \) is an extension of some \(\hat{b}\in br_{1}(P_{i}^{c})\), and s is an extension of some \(b'' \in br_{1}(S_{i,j,e})\).

We consider (1) \(\Rightarrow \) (2) first. Suppose \(\beta \) is not an extension of any \(\hat{b}\in br_{1}(P_{i}^{c})\). \(\beta \in br(U_{i})\), so it must be an extension of some \(r \in br_{1}(N_{i}^{c})=br_{0}(N_{i})\) by observation 4 above. Thus we have \(r \perp b'\), and therefore \(\beta \perp b\), contradicting the assumption \(\beta s || b\). Hence, \(\beta \) is an extension of some \(\hat{b}\in br_{1}(P_{i}^{c})=br_{0}(P_{i})\). Then, by observation 4 and the assumption \(\beta || s\), we have \(s \in br_{1}(S_{i,j})\), which means s is an extension of some \(b'' \in br_{1}(S_{i,j,e})\). This finishes the proof of (2) \(\Rightarrow \) (2).

Next, we consider the converse. By observation 1, there exist \(i',j'\) and \(b' \in br(S_{i',j',e})\) such that \(b' \subseteq b\). Then we have \(b' || b''\) since b||s follows from \(\beta s || b\). By observation 2, \((i,j)=(i',j')\) and \(b'=b''\) follows. Since \(b'' \in br_{1}(S_{i,j,e})\) by assumption, we have the converse.

Now, we construct a low-degree NS-refutation of \(\lnot inj^{*}PHP^{f(n)-\#{{\,\textrm{dom}\,}}(\rho )}_{n-\# {{\,\textrm{ran}\,}}(\rho )}\). By the properties of \((X_{i,j})_{i \in [l], j \in [d]}\) and \((Y_{e})_{e \in [k]}\), the following holds:

$$\begin{aligned} \sum _{(i,j) \in [l]\times [d]}\sum _{\alpha \in br(X_{i,j})} x_{\alpha } = \sum _{e \in [k]} \sum _{\beta \in br(Y_{e})} x_{\beta } \pmod d. \end{aligned}$$

(Here, for a partial injection \(\rho \), \(x_{\rho }:= (\prod _{\langle p,h\rangle \in \rho _{bij}}x_{p,h})( \prod _{\langle h \rangle \in \rho _{sing}}u_{h})\)).

It is easy to see that for any injPHP-tree T over \((D_{M},R_{m})\), \(\sum _{\alpha \in br(T)}x_{\alpha }=1\) has a NS-refutation from \(\lnot inj^{*}PHP^{M}_{m}\) with degree \(\le height(T)\). Hence, we obtain a NS-proof of \(k=0\) from \(\lnot inj^{*}PHP^{f(n)-\#{{\,\textrm{dom}\,}}(\rho )}_{n-\#{{\,\textrm{ran}\,}}(\rho )}\) over \(\mathbb {Z}/d\mathbb {Z}\) with degree o(n). Since \(k \not \equiv 0 \pmod d\), it is a NS-refutation of \(\lnot inj^{*}PHP^{f(n)-\#{{\,\textrm{dom}\,}}(\rho )}_{n-\#{{\,\textrm{ran}\,}}(\rho )}\) over \(\mathbb {Z}/d\mathbb {Z}\) with degree o(n). However, it contradicts Proposition 23 and Corollary 25.

\(\square \)

Setting \(f(n):=n+1\) and taking the contrapositive of Theorem 53, we obtain that an analogue of switching lemma for injPHP-trees suffices to prove Conjecture 1:

Corollary 57

Assume \(F_{c} + UCP^{l,d}_{k} \vdash _{poly(n)} injPHP^{n+1}_{n}\) is witnessed by \(AC^{0}\)-proofs \((\pi _{n})_{n \ge 1}\). Suppose there are partial injections \((\rho _{n})_{n \ge 1}\) satisfying

  • For each n, \(\rho _{n} \in \mathcal {M}^{n+1}_{n}\).

  • \(n-\#{{\,\textrm{ran}\,}}(\rho _{n}) \rightarrow \infty \) \((n \rightarrow \infty )\).

  • There exist \(o(n-\#{{\,\textrm{ran}\,}}(\rho _{n}))\)-evaluations \((T^{n})_{n\ge 1}\) of \(\Gamma _{n}^{\rho }\), where \(\Gamma _{n}\) is the all subformulae appearing in \(\pi _{n}\).

Then, we obtain a contradiction.

Note that an analog of switching lemma would give how to construct such \(\rho _{n}\) above.

Remark 58

It will be good if we can prove such an analog for injPHP-trees; in that case, we can apply the previous theorem and prove that Conjecture 1 is valid. However, it seems implementing this approach is beyond the current proof techniques. The difficulty is relevant to that of the famous open problem: does \(V^{0} \vdash injPHP^{2n}_{n}\) hold?

4 On the strength of the oddtown theorem

The oddtown theorem is a combinatorial principle stating that there cannot be \(n+1\) orthogonal normal vectors in \(\mathbb {F}_{2}^{n}\). In other words, (regarding each \(v \in \mathbb {F}^{n}_{2}\) as the characteristic vector of a subset \(S \subseteq [n]\)) there cannot be a family \((S_{i})_{i \in [n+1]}\) satisfying the following:

  • Each \(S_{i}\) has an odd cardinality.

  • Each \(S_{i} \cap S_{i^{\prime }}\) (\(i < i^{\prime }\)) has an even cardinality.

Historically, the oddtown theorem and Fisher’s inequality (introduced in Sect. 5) were proposed as candidates for statements that are easy to prove in the extended Frege system but not in the Frege system [6]. Standard proofs of them use linear-algebra methods, such as evaluating the number of linearly independent vectors of a given linear space, which do not seem easy to formalize in the bounded arithmetic \(VNC^{1}\) (corresponding to polynomial-sized Frege proofs), and this was why they and some other combinatorial principles were presented as above candidates.

Actually, around the time [6] was published, a linear-algebra-free proof of Fisher’s inequality was already discovered, and I did not know it. See Sect.  5 for details and acknowledgment.

On the other hand, upper bounds of linear algebra methods were also given in a series of works. Soltys [18] and Soltys and Cook [19] gave natural bounded arithmetics \({\textbf {LA}}\) and \({\textbf {LA}}\texttt{P}\) capable of formalizing elementary arguments in linear algebra, formalized the matrix determinant by Berkowitz algorithm [5], and showed the following:

  • Cayley-Hamilton, co-factor expansion, and an axiomatic definition of determinants (multi-linear, alternating, and \(\det (I)=1\) for unit matrices I) are equivalent over \({\textbf {LA}}\texttt{P}\).

  • Multiplicativity of determinants implies all above in \({\textbf {LA}}\texttt{P}\).

Hrubes and Tzameret [10] and Tzameret and Cook [20] were recent breakthroughs in the area.

Hrubes and Tzameret [10] gave a short proof of multiplicativity of determinants of \(\mathbb {F}\)-matrices in the proof system \(P_{c}(\mathbb {F})\), which manipulates arithmetical circuits, for general coefficient fields \(\mathbb {F}\). The results and the arguments in [10] also imply multiplicativity of determinant for \(\mathbb {Z}\)-matrices (or, equivalently, \(\mathbb {Q}\)-rational matrices) has quasipolynomial-sized Frege proofs. See the introduction of [10] for details.

Tzameret and Cook [20] carefully formalized the argument for \(\mathbb {Z}\)-matrices in \({\textbf {VNC}}^{2}\). Using the result of [20], we can show that \({\textbf {VNC}}^{2}\) proves the oddtown theorem, and therefore propositional translations have quasipolynomial-sized Frege proofs. See Proposition .

However, there is still room to investigate the precise strengths of the oddtown theorem and Fisher’s inequality in terms of bounded reverse mathematics. In this section, we focus on lower bounds for the oddtown theorem. We observe that a natural formalization of the oddtown theorem over \(V^{0}\) is stronger than several combinatorial principles related to counting. Then, we try to analyze the relation between the oddtown theorem and \(Count^{p}_{n}\) for p, which is not a 2-power.

Definition 59

Define the \(\Sigma ^{B}_{0}\) \(\mathcal {L}^{2}_{A}\)-formula oddtown(nPQRS) as follows:

$$\begin{aligned} \lnot&[\forall i \in [n+1]. \forall j \in [n]. (S(i,j) \leftrightarrow Q(i,j) \vee \exists e \in [n]^{(2)}. (j \in ^{*} e \wedge P(i,e)) \\&\wedge \forall i \in [n+1]. \exists j \in [n]. Q(i,j)\\&\wedge \forall i \in [n+1]. \forall j \ne j^{\prime }\in [n].(\lnot Q(i,j) \vee \lnot Q(i,j^{\prime })) \\&\wedge \forall i \in [n+1]. \forall j\in [n]. \forall e \in [n]^{(2)}(j \in ^{*} e \rightarrow \lnot Q(i,j) \vee \lnot P(i,e)) \\&\wedge \forall i \in [n+1]. \forall e \ne e^{\prime } \in [n]^{(2)}(e \cap e^{\prime } \ne \emptyset \rightarrow \lnot P(i,e) \vee \lnot P(i,e^{\prime })) \\&\wedge \forall i< i^{\prime } \in [n+1]. \forall j \in [n]. (S(i,j) \wedge S(i^{\prime },j) \leftrightarrow \exists e \in [n]^{(2)} (j \in ^{*} e \wedge R(i,i^{\prime },e)) )\\&\wedge \forall i < i^{\prime } \in [n+1]. \forall e\ne e^{\prime } \in [n]^{(2)}. (e \cap e^{\prime } \ne \emptyset \rightarrow \lnot R(i,i^{\prime },e) \vee \lnot R(i,i^{\prime },e^{\prime }))] \end{aligned}$$

Intuitively, S above gives sets \(S_{i}:=\{ j \in [n] \mid S(i,j)\}\), P gives a 2-partition of each \(S_{i}\) leaving one element, which is specified by Q, and R gives a 2-partition of each \(S_{i} \cap S_{i^{\prime }}\) (\(i < i^{\prime }\)).

Definition 60

Define the propositional formula \(oddtown_{n}\) as follows:

$$\begin{aligned}&oddtown_{n}:= {\left\{ \begin{array}{ll} &{}1 \quad (n=0)\\ \lnot [ &{}\bigwedge _{i \in [n+1]} \bigwedge _{j \in [n]} (\lnot s_{ij} \vee q_{ij} \vee \bigvee _{e: j \in e \in [n]^{(2)}} p_{ie}) \\ &{} \wedge \bigwedge _{i \in [n+1]} \bigwedge _{j \in [n]} (s_{ij} \vee \lnot q_{ij})\\ &{} \wedge \bigwedge _{i \in [n+1]} \bigwedge _{j \in [n]}\bigwedge _{e: j \in e \in [n]^{(2)}} (s_{ij} \vee \lnot p_{ie})\\ &{} \wedge \bigwedge _{i \in [n+1]} \bigvee _{j \in [n]} q_{ij}\\ &{}\wedge \bigwedge _{i \in [n+1]} \bigwedge _{j< j^{\prime } \in [n]} (\lnot q_{ij} \vee \lnot q_{ij^{\prime }}) \\ &{} \wedge \bigwedge _{i \in [n+1]} \bigwedge _{j \in [n]} \bigwedge _{e:j \in e \in [n]^{(2)}} (\lnot q_{ij} \vee \lnot p_{ie})\\ &{}\wedge \bigwedge _{i \in [n+1]} \bigwedge _{e\perp e^{\prime } \in [n]^{(2)}} (\lnot p_{ie} \vee \lnot p_{ie^{\prime }})\\ &{} \wedge \bigwedge _{i<i^{\prime } \in [n+1]} \bigwedge _{j \in [n]} (\lnot s_{ij} \vee \lnot s_{i^{\prime }j} \vee \bigvee _{e: j \in e \in [n]^{(2)}} r_{ii^{\prime }e}) \\ &{} \wedge \bigwedge _{i<i^{\prime } \in [n+1]} \bigwedge _{j \in [n]}\bigwedge _{e: j \in e \in [n]^{(2)}} ( s_{ij} \vee \lnot r_{ii^{\prime }e}) \\ &{} \wedge \bigwedge _{i<i^{\prime } \in [n+1]} \bigwedge _{j \in [n]}\bigwedge _{e: j \in e \in [n]^{(2)}} ( s_{i^{\prime }j} \vee \lnot r_{ii^{\prime }e}) \\ &{} \wedge \bigwedge _{i<i^{\prime } \in [n+1]} \bigwedge _{e \perp e^{\prime } \in [n]^{(2)}} (\lnot r_{ii^{\prime }e} \vee \lnot r_{ii^{\prime }e^{\prime }}) ] \quad (n \ge 1) \end{array}\right. } \end{aligned}$$

By a reason similar to that of Convention 3, we abuse the notation and write \(oddtown_{n}\) to express oddtown(nPQRS), too.

We first observe an upper bound of the strength of \(oddtown_{n}\):

Proposition 61

(A corollary of [20])

$$\begin{aligned} {\textbf {VNC}}^{2} \vdash oddtown(n,P,Q,R,S). \end{aligned}$$

For the definition of \({\textbf {VNC}}^{2}\), see [8].

Proof Sketch

Tzameret and Cook [20] showed that there exists a \(\Sigma ^{B}_{1}\)-definition \(\det (A)=N\) of the matrix determinant, where both A and N are of set-sort, N is meant to be a binary representation of an integer, and A is meant to code a square matrix with integer coefficients, where each coefficient is represented by a binary expression, such that \({\textbf {VNC}}^{2}\) proves:

  • Let A and B be strings coding square matrices with integer coefficients of the same size \((k \times k)\). Then \(\det (A)\det (B)=\det (AB)\).

  • \(\det (A)\) can be computed by co-factor expansion concerning any row or column.

Focusing on the last digit of the integers involved, we can see that there exists a \(\Sigma ^{B}_{1}\)-definition \(\det _{\mathbb {F}_{2}}(A)=b\), where \(b \in \{0,1\}\), and A is of set-sort, meant to code a square matrix with \(\mathbb {F}_{2}\)-coefficients satisfying the above two items.

Armed with this, we work in \({\textbf {VNC}}^{2}\) and show \(oddtown_{n}\) as follows. Towards a contradiction, suppose nPQRS violate oddtown(nPQRS). Let M be an \(\mathbb {F}_{2}\) matrix of size \((n+1)\times n\) given by

$$\begin{aligned} M_{ij}:= {\left\{ \begin{array}{ll} 1 \quad &{}(\text{ if }\, S(i,j)\, \text{ holds})\\ 0 \quad &{}(\text{ otherwise}) \end{array}\right. }. \end{aligned}$$

Since oddtown(nPQRS) is violated, we have \(MM^{t}=I_{n+1}\) as \(\mathbb {F}_{2}\)-matrices, where \(I_{n+1}\) is a unit matrix of size \((n+1)\times (n+1)\). Note that \({\textbf {VNC}}^{2}\) is an extension of \({\textbf {VTC}}^{0}\) and can count cardinalities of bounded sets, which is sufficient to prove \(MM^{t}=I_{n+1}\). Consider the matrix A obtained by padding M by a zero column vector. A is of size \((n+1)\times (n+1)\), and \(AA^{t}=I_{n+1}\). Taking determinants of both sides and applying multiplicativity of determinants, we have \(\det _{\mathbb {F}_{2}}(A)\det _{\mathbb {F}_{2}}(A^{t})=\det _{\mathbb {F}_{2}}I_{n+1}\). However, A has a zero column vector, and \(A^{t}\) has a zero row vector. Therefore, by co-factor expansions, we have \(\det _{\mathbb {F}_{2}}(A)=\det _{\mathbb {F}_{2}}(A^{t})=0\). On the other hand, it is elementary to see \(\det _{\mathbb {F}_{2}}(I_{n+1})=1\), a contradiction. \(\square \)

Corollary 62

There exist \(2^{O((\log n)^{2})}\)-sized Frege proofs of \(oddtown_{n}\).

Now, we focus on the lower bounds of the strength of \(oddtown_{n}\). We start with the following simple observations:

Proposition 63

  1. 1.

    \(V^{0} +oddtown_{k} \vdash injPHP^{n+1}_{n}\).

  2. 2.

    \(V^{0}+oddtown_{k} \vdash Count^{2}_{n}\).

Proof

We first prove 1. Argue in \(V^{0}\). We prove the contrapositive. Suppose there exists an injection \(f :[n+1] \rightarrow [n]\). Define

$$\begin{aligned} S_{i}:=\{f(i)\}. \end{aligned}$$

Then it violates \(oddtown_{n}\).

We next prove 2. Argue in \(V^{0}\). We prove the contrapositive. Suppose \([2n+1]\) is partitioned by 2-sets. Let R be the 2-partition. Then setting

$$\begin{aligned} S_{i}:=[2n+1] \quad (i \in [2n+2]), \end{aligned}$$

we can violate \(oddtown_{2n+2}\) since \([2n+1] \cap [2n+1]\) can be 2-partitioned by R while \([2n+1]\setminus \{2n+1\}\) has a natural 2-partition. \(\square \)

Remark 64

Observing the proof above, one may think that we might obtain another interesting formalization of the oddtown theorem imposing \(\{S_{i}\}_{i \in [n+1]}\) to be a family of distinct sets. Let \(oddtown^{\prime }\) be this version. It turns out that:

  1. 1.

    \(V^{0}+oddtown_{k} \vdash oddtown_{n}^{\prime }\).

  2. 2.

    \(V^{0}+oddtown_{k}^{\prime }+Count^{2}_{k} \vdash oddtown_{n}\).

  3. 3.

    \(V^{0}+oddtown_{k}^{\prime } \vdash Count^{2}_{n}\).

Hence, \(oddtown_{n}\) and \(oddtown^{\prime }_{n}\) have the same strength over \(V^{0}\).

The proof of the remark

The item 1 is clear.

We show the item 2. Work in \(V^{0}+oddtown_{k}^{\prime }+Count^{2}_{k}\). Assume \(\{S_{i}\}_{i=1}^{n+1}\) (where \(S_{i} \subseteq [n]\)) violates \(oddtown_{n}\). Since each \(S_{i} \cap S_{i^{\prime }}\) (\(i < i^{\prime }\)) is 2-partitioned, it follows that \(S_{i} \ne S_{i^{\prime }}\). Indeed, if \(S_{i}=S_{i^{\prime }}=:S\), then both S and \(S{\setminus } \{s_{0}\}\) (\(s_{0} \in S\)) is 2-partitioned by the hypothesis. Consider a straightforward bijection:

$$\begin{aligned} {[}2n-1] \cong ([n]\setminus S) \sqcup ([n]\setminus S) \sqcup S \sqcup (S \setminus \{s_{0}\}). \end{aligned}$$

The right-hand side gives a natural 2-partition using those of S and \(S \setminus \{s_{0}\}\), and it induces a 2-partition of the left-hand side, which violates \(Count^{2}_{2n-1}\).

Hence, it follows that each \(S_{i}\) is distinct. However, it contradicts \(oddtown_{n}^{\prime }\). This shows the item 2.

Lastly, we show the item 3. Argue in \(V^{0}\). Assume R is a 2-partition of \([2n+1]\). Note that R has a natural linear ordering induced by that of whole numbers. Define \(\{S_{i}\}_{i=1}^{2n+2}\) as follows: it is easy to see that R has at least four elements. Take distinct \(r_{1}, r_{2}, r_{3} \in R\). For each \(i = 1, \ldots , 2n+1\), take the unique j such that \(\{i,j\} \in R\). Then,

  1. Case1.

    If \(i<j\), set \(S_{i}:=[2n+1] \setminus \{i,j\}\).

  2. Case2.

    If \(i>j\), set \(S_{i}:=[2n+1] {\setminus } (\{i,j\} \cup s_{i})\), where \(s_{i}\) is the successor of \(\{i,j\}\) in R. If there is none (i.e. \(\{i,j\} =\max R\)), let \(s_{i}\) be \(\min R\).

Furthermore, we define \(S_{2n+2}:=[2n+1] {\setminus } (r_{1} \cup r_{2} \cup r_{3})\).

Now, we see that \(\{S_{i}\}_{i=1}^{2n+2}\) violates \(oddtown_{2n+2}\). Indeed, the \(S_{i}\) are distinct, we can take natural 2-partitions leaving one element for each \(S_{i}\), and we can obtain 2-partitions for each \(S_{i} \cap S_{j}\) (\(i<j\)) removing at most five elements from R. \(\square \)

By Theorems 8 and 9, we obtain

Corollary 65

$$\begin{aligned}&V^{0} + injPHP^{k+1}_{k} \not \vdash oddtown_{n}, \\&V^{0} + Count^{2}_{k} \not \vdash oddtown_{n}. \end{aligned}$$

This raises the following natural problems:

Question 3

  1. 1.

    \(V^{0}+ injPHP^{k+1}_{k} + Count^{2}_{k} \vdash oddtown_{n}\)? How about \(V^{0}+ GCP \vdash oddtown_{n}\)?

  2. 2.

    \(V^{0}+oddtown_{k} \vdash Count^{p}_{n}\) for which p?

The author cannot answer these questions for now. However, we tackle the item 2 in the rest of this section.

By Proposition 63 and Theorem 7, it is easy to see:

Corollary 66

If p is a power of 2, \(V^{0} + oddtown_{k} \vdash Count^{p}_{n}\).

The author conjectures that the converse of this corollary holds. Furthermore, the author conjectures the following:

Conjecture 2

For each \(d \in \mathbb {N}\) and a prime \(p \ne 2\),

$$\begin{aligned} F_{d}+oddtown_{k} \not \vdash _{poly(n)} Count^{p}_{n}. \end{aligned}$$

Using Theorem 7, it is easy to see that Conjecture 2 implies the converse of Corollary 66. We give a sufficient condition to prove Conjecture 2:

Theorem 67

Let \(p \in \mathbb {N}\) be a prime other than 2. Suppose

$$\begin{aligned} F_{d}+oddtown_{k} \vdash _{poly(n)} Count^{p}_{n}. \end{aligned}$$

Then, for any large enough \(n \not \equiv 0 \pmod p\), there exist \(m =n^{O(1)}\) and a family \((f_{ij})_{i \in [m+1], j \in [m]}\) of polynomials over \(\mathbb {F}_{2}\) such that the following equalities have NS-proofs over \(\mathbb {F}_{2}\) from \(\lnot Count^{p}_{n}\) with degree O(1):

$$\begin{aligned}&\sum _{j \in [m]}f_{ij} =1 \quad&(i \in [m+1])\\&\sum _{j \in [m]} f_{ij}f_{i^{\prime }j}=0 \quad&(i \ne i^{\prime } \in [m+1]) \end{aligned}$$

Here, \(\lnot Count^{p}_{n}\) (where \(n \not \equiv 0 \pmod p\)) means the following system of polynomials:

$$ \begin{aligned}&\sum _{e : j \in e \in [n]^{(p)}} x_{e}-1,\ x_{e}x_{e^{\prime }},\ x_{e}^{2}-x_{e}\\&(j \in [n] \ \& \ e, e^{\prime } \in [n]^{(p)} \ \& \ e \perp e^{\prime }) \end{aligned}$$

Hence, if we can prove a non-constant degree lower bound for NS-proofs for the above system of equations, then Conjecture 2 is true.

Before the proof, we present a strategy for obtaining the constant degree bound above. We assume the basics of partial p-partitions, p-trees, and k-evaluations using p-trees. See section 15.5 in [13] for references. We also use the notations br(T), \(T \models \varphi \), \(T*\sum _{b \in S} T_{b}\), etc. as the straightforward analogous meanings to the ones given in Sect. 3.

The power of the notion comes from the following so-called switching lemma:

Theorem 68

(Lemma 12.3.11 in [12]. Essentially by [14, 16]) Let \(p \ge 2\). Then there exists a constant c satisfying the following: let \(H_{1},\ldots , H_{N}\) be families of partial p-partitions of \([pn+r]\), where \(0 \le r<p\). Assume that \(\#H_{i} \le t \le s\) for every \(i \in [N]\), and

$$\begin{aligned} \frac{w^{s}}{[t\cdot (n-w)]^{c \cdot s}} > N, \end{aligned}$$
(5)

where \(w \le n\). Then there is a partial p-partition \(\rho \) with \(\# \rho = w\), such that for every \(i \in [N]\) there exists a p-tree with height at most \(p\cdot s\) representing \(H_{i}^{\rho }\).

Remark 69

In the book [12], \(r=1\) is assumed, but the proof is straightforwardly extended to treat general \(r < p\) since p is a fixed constant here.

Remark 70

In the book [12], the notion k-complete system is used instead of p-trees, and therefore the precise definition of k-evaluation in it is different from the one given in [13]. However, the proof of Lemma 12.3.11 in [12] essentially constructs canonical p-trees \(T_{i}\) representing \(H_{i}\)’s under “good” restrictions, and \(br(T_{i})\)’s are the k-complete systems argued in [12]. In other words, an inequality corresponding to (5) can be extracted from the proof of Theorem 15.2.2 in [13]. The intertwin between several terminologies and notions is concisely mentioned in §4.4 of [1].

In particular, when \(N=n^{O(1)}\), choosing a parameter \(0<\epsilon <1/c\) and putting \(w:=n-n^{\epsilon }\), we can satisfy the inequality (5) by choosing sufficiently large \(t=s=O(1)\). Applying Theorem 68 constantly many times, we obtain the following:

Corollary 71

(A counterpart of Theorem 12.4.3 in [12] when \(\# \Gamma \) is just \(n^{O(1)}\)) Let \(e,d \ge 1\). Then, for sufficiently small \(\epsilon >0\), there exists a constant \(k>0\) satisfying the following:

  • For sufficiently large n, if \(\Gamma \) is a set of formulae of depth \(\le d\) whose variables are those used in \(Count^{p}_{pn+r}\) (\(0 \le r < p\)), closed under the subformulae, and with size \(\#\Gamma \le n^{e}\), then there is a partial p-partition \(\rho \) with size \(n-n^{\epsilon }\) such that there exists a k-evaluation of \(\Gamma ^{\rho }\).

Here, \(n^{\epsilon }\) is rounded to an integer in an appropriate way.

Now, we move on to the proof of Theorem 67.

Proof of Theorem 67

In this proof, we assume \(p=3\) for readability. Let proofs \((\pi _{l})_{l \in \mathbb {N}}\) witness

$$\begin{aligned} F_{d}+oddtown_{k} \vdash _{poly(l)} Count^{3}_{l}. \end{aligned}$$

We focus on \(l \not \equiv 0 \pmod 3\). Let \(l=3n+r\) (\(0< r < 3\)). Let \(\Gamma _{l}\) be the all subformulae appearing in \(\pi _{l}\). Apply Corollary 71 for 3-trees, and obtain \(\epsilon > 0\) and a partial 3-partition \(\rho \) with size \(n^{\epsilon }\) such that there exists an O(1)-evaluation T of \(\Gamma _{l}^{\rho }\), where l is sufficiently large. Note that \((Count^{3}_{l})^{\rho }\) can be identified as \(Count^{3}_{3n^{\epsilon }+r}\).

It suffices to construct \((f_{ij})\) in the claim for these sufficiently large \(3n^{\epsilon }+r\) instead of general l. Indeed, given sufficiently large integer \(N \not \equiv 0 \pmod 3\), take an integer n satisfying

$$\begin{aligned} 3n^{\epsilon }+r < N \le 3(n+1)^{\epsilon }+r, \end{aligned}$$

where \(N \equiv r \pmod 3\) and \(0<r<3\). If \((f_{ij})_{i \in [m+1], j \in [m]}\) is as claimed for \(3(n+1)^{\epsilon }+r\), then applying a partial restriction given by a partial 3-partition leaving elements in [N] in the universe \([3(n+1)^{\epsilon }+r]\), we get \((f_{ij})_{i \in [m+1], j \in [m]}\) for N. Since

$$\begin{aligned} m=(3(n+1)^{\epsilon }+r)^{O(1)}=(3n^{\epsilon }+r)^{O(1)}=N^{O(1)}, \end{aligned}$$

the claim for general N follows.

Now, we focus on constructing \((f_{ij})\) in the claim for sufficiently large \(3n^{\epsilon }+r\). We renew the expression \(3n^{\epsilon }+r\) and write it as (new) n from now on for readability, refreshing our mind.

Recall that the evaluation T is sound for Frege rules in \(\pi _{n}\), and it satisfies \(T \not \models Count^{3}_{n}\) at the same time. It follows that some instance

$$\begin{aligned} I:=oddtown_{m}[\sigma _{ij}/s_{ij}, \tau _{ij}/q_{ij}, \varphi _{ie}/p_{ie}, \psi _{ii^{\prime }e}/r_{ii^{\prime }e}] \end{aligned}$$

in \(\pi _{n}\) satisfies \(T \not \models I\). Restricting O(1)-elements more if necessary, we may assume that actually \(br_{0}(T_{I}) = br(T_{I})\) holds.

Clearly, \(m = n^{O(1)}\). For \(i \in [m+1]\) and \(j \in [m]\),

$$\begin{aligned} f_{ij} := \sum _{b \in br_{1}(T_{\sigma _{ij}})}\prod _{e \in b} x_{e}. \end{aligned}$$

We prove that these polynomials satisfy the required properties.

First, fix \(i<i^{\prime } \in [m+1]\). We construct a NS-proof of \(\sum _{j=1}^{m}f_{ij}f_{i^{\prime }j}=0\) over \(\mathbb {F}_{2}\) from the system \(\lnot Count^{3}_{n}\) For each \(j \in [m]\), construct \(U^{1}_{j}\) (\(j \in [m]\)) as follows:

$$\begin{aligned} U^{1}_{j}:=T_{\sigma _{ij}}*\sum _{b \in br_{1}(T_{\sigma _{ij}})} (T_{\sigma _{i^{\prime }j}}*\sum _{b^{\prime } \in br_{1}(T_{\sigma _{i^{\prime }j}})} (T_{i,i^{\prime },j})^{b^{\prime }})^{b}, \end{aligned}$$

where

$$\begin{aligned} T_{i,i^{\prime },j}:= T_{\lnot \sigma _{ij} \vee \lnot \sigma _{i^{\prime }j} \vee \bigvee _{e: j \in e \in [m]^{(2)}} \psi _{ii^{\prime }e}}. \end{aligned}$$

Now, define \(U_{j}\) (\(j \in [m]\)) as follows:

  • Consider each branch \(r\in br(U^{1}_{j})\) of the form \(r=b(b^{\prime })^{b}d^{bb^{\prime }}\), where

    $$\begin{aligned} b \in br_{1}(T_{\sigma _{ij}}), b^{\prime } \in br_{1}(T_{\sigma _{i^{\prime }j}}), d\in br( T_{i,i^{\prime },j}). \end{aligned}$$

    Let \(S^{1}_{j}\) be the set of all branches of \(U^{1}_{j}\) of the above form. Since \(d || bb^{\prime }\) and

    $$\begin{aligned}&T \models \lnot \sigma _{ij} \vee \lnot \sigma _{i^{\prime }j} \vee \bigvee _{e: j \in e \in [m]^{(2)}} \psi _{ii^{\prime }e}, \\&T \models \lnot \psi _{ii^{\prime }e} \vee \lnot \psi _{ii^{\prime }e^{\prime }} \quad (\forall e^{\prime } \perp e) \end{aligned}$$

    There is a unique \(e_{r} \in [m]^{(2)}\) such that

    • \(j \in e_{r}\), and

    • d is an extension of some \(d^{\prime } \in br_{1}(T_{\psi _{ii^{\prime }e_{r}}})\).

  • Let \(j^{\prime }_{r,j}\) be the element of \(e_{r}\) other than j (that is, \(e_{r}=\{j, j^{\prime }_{r,j}\}\)).

  • We define \(U^{2}_{j}:= U^{1}_{j} * \sum _{r \in S^{1}_{j}} (T_{\sigma _{ij^{\prime }_{r,j}}\vee \lnot \psi _{ii^{\prime }e_{r}}}*T_{\sigma _{i^{\prime }j^{\prime }_{r,j}}\vee \lnot \psi _{ii^{\prime }e_{r}}})^{r}\).

  • Let \(S^{2}_{j} \subseteq br(U^{2}_{j})\) be the set of all branches extending some element of \(S^{1}_{j}\).

  • For each

    $$\begin{aligned} u=rs^{r} \in S^{2}_{j} \quad (r \in S^{1}_{j}, s \in br (T_{\sigma _{ij^{\prime }_{r,j}}\vee \lnot \psi _{ii^{\prime }e_{r}}}*T_{\sigma _{i^{\prime }j^{\prime }_{r,j}}\vee \lnot \psi _{ii^{\prime }e_{r}}})), \end{aligned}$$

    Let \(J^{\prime }_{u,j}:= j^{\prime }_{r,j}\).

  • We set \(U_{j}:=U^{2}_{j}*\sum _{u \in S^{2}_{j}} (U^{2}_{J^{\prime }_{u,j}})^{u}\).

For each \(j \in [m]\), let \(B_{j} \subseteq br(U_{j})\) be the set of branches extending some element of \(S^{2}_{j}\). Under \(\lnot Count^{3}_{n}\), we see the equation

$$\begin{aligned} \sum _{\alpha \in B_{j}} x_{\alpha }=f_{ij}f_{i^{\prime }j} \end{aligned}$$

has a NS-proof over \(\mathbb {F}_{2}\) with degree O(1). Therefore, we obtain a NS-proof of

$$\begin{aligned} \sum _{j \in [m]} \sum _{\alpha \in B_{j}}x_{\alpha } = \sum _{j \in [m]} f_{ij}f_{i^{\prime }j} \end{aligned}$$

over \(\mathbb {F}_{2}\) with degree O(1). The LHS is 0 based on the following reasoning: Let \(b=uv^{u} \in B_{j}\), where \(u \in S^{2}_{j}\) and \(v \in br(U^{2}_{J^{\prime }_{u,j}})\). Let \(l:=J^{\prime }_{u,j}\). It is easy to see that \(v \in S^{2}_{l}\), and \(b \in B_{l}\). Decomposing \(b=cd^{c}\), where \(c \in S_{l}\) and \(d \in S_{J^{\prime }_{c,l}}\), we obtain \(J^{\prime }_{c,l}=j\). This argument gives a natural 2-partition of the disjoint union \(\bigsqcup _{j \in [m]} B_{j}\), pairing branches giving the same partial injections.

Lastly, we prove that \(\sum _{j=1}^{m} f_{ij}=1\) (\(i \in [m+1]\)) has a NS-proof from \(\lnot Count^{3}_{n}\) with degree O(1). For \(j \in [m]\), let

$$\begin{aligned} V_{j}:= T_{\sigma _{ij}}*\sum _{b \in br_{1}(T_{\sigma _{ij}})} (T_{i,j})^{b}. \end{aligned}$$

where \(T_{i,j}:=T_{\lnot \sigma _{ij} \vee \tau _{ij} \vee \bigvee _{e: j \in e \in [n]^{(2)}} \varphi _{ie}}\).

Let \(B_{j}\) be the set of branches \(b \in br(V_{j})\) extending some \(b^{\prime } \in br_{1}(T_{\sigma _{ij}})\). Since

$$\begin{aligned}&T \models \lnot \tau _{ij} \vee \lnot \varphi _{ie} \quad (j \in e \in [n]^{(2)})\quad \text{ and }\\&T \models \lnot \varphi _{ie} \vee \lnot \varphi _{ie^{\prime }} \quad (e \perp e^{\prime }), \end{aligned}$$

each \(b \in B_{j}\) satisfies exactly one of the following:

  1. 1.

    b is an extension of some \(b^{\prime } \in br_{1}(T_{\tau _{ij}})\).

  2. 2.

    There exists a unique \(e \in [m]^{(2)}\) (\(j \in e\)) such that b is an extension of some \(b^{\prime } \in br_{1}(T_{\varphi _{ie}})\).

Define \(R_{b}\) as follows:

  1. 1.

    If the Condition 1 is satisfied,

    $$\begin{aligned} R_{b}:= T_{i}*T_{\sigma _{ij} \vee \lnot \tau _{ij}}, \end{aligned}$$

    where \(T_{i}:= T_{\bigvee _{j}\tau _{ij}}\).

  2. 2.

    If the Condition 2 is satisfied, and \(\{j,j^{\prime }\}:=e\), then

    $$\begin{aligned} R_{b}:=T_{\sigma _{ij^{\prime }} \vee \lnot \varphi _{ie}}*(V_{j^{\prime }}*T_{\sigma _{ij} \vee \lnot \varphi _{ie}}). \end{aligned}$$

Consider

$$\begin{aligned} W_{j}:=V_{j}*\sum _{b \in B_{j}}(R_{b})^{b}. \end{aligned}$$

Let \(C_{j}\) be the set of all branches \(r \in br(W_{j})\) of the form \(r=bd^{b}\), where \(b \in B_{j}\). The equation

$$\begin{aligned} \sum _{r \in C_{j}}x_{r}=f_{ij} \end{aligned}$$

has a NS-proof over \(\mathbb {F}_{2}\) from \(\lnot Count^{3}_{n}\) with degree O(1).

Now, we define \(Q_{i}\) as follows: since

$$\begin{aligned} T \models \bigvee _{j} \tau _{ij}\quad \text{ and }\quad T \models \lnot \tau _{ij} \vee \lnot \tau _{ij^{\prime }}, \end{aligned}$$

we see each branch \(b \in br(T_{i})\) has the unique \(j_{b}\) such that b is an extension of \(b^{\prime } \in br_{1}(T_{\tau _{ij_{b}}})\). We set

$$\begin{aligned} Q_{i}:= T_{i}*\sum _{b \in br(T_{i})}(T_{\sigma _{ij_{b}} \vee \lnot \tau _{ij_{b}}} *(T_{\sigma _{ij_{b}}} * T_{i,j_{b}}))^{b}. \end{aligned}$$

We already know that there exists a NS-proof of \(\sum _{\beta \in br(Q_{i})} x_{\beta }=1\) from \(\lnot Count^{3}_{n}\) with degree O(1). Observing

$$\begin{aligned} \sum _{j} \sum _{r \in C_{j}} x_{r}+ \sum _{\beta \in br(Q_{i})} x_{\beta } = 0, \end{aligned}$$
(6)

we obtain a NS-proof of \(\sum _{j=1}^{m} f_{ij}=1\) from \(\lnot Count^{3}_{n}\) with degree O(1).

The observation follows from a 2-partition of \((\bigsqcup _{j}C_{j}) \sqcup br(Q_{i})\) similar to the one appearing in the proof of item 3. To be concrete, consider the following labeling of \(r = bd^{b} \in C_{j}\) (where \(b \in B_{j}\)):

  • If b satisfies the Condition 1 in the definition of \(W_{j}\), label it by \(\{ j \}\).

  • If b satisfies the Condition 2 in the definition of \(W_{j}\), and \(\{j,j^{\prime }\}:=e\), label b by e.

To make \(W_{j}\) fully labeled, we label each \(b \in br(W_{j}) {\setminus } C_{j}\) by a symbol \(\perp \). Then we obtain:

  • For each \(j \ne j^{\prime }\), \(br_{\{ j,j^{\prime } \}} (W_{j}) = br_{\{ j,j^{\prime } \}} (W_{j^{\prime }})\).

  • \(\bigsqcup _{j \in [m]} br_{\{ j \}} (W_{j}) = br(Q_{i})\).

These give the Eq. (6). \(\square \)

5 On the strength of Fisher’s inequality

When we discuss whether the condition given in Theorem 67 actually holds or not, it is natural also to consider the \(\mathbb {K}\)-analogue of the condition, where \(\mathbb {K}\) is an arbitrary field other than \(\mathbb {F}_{2}\). The following combinatorial principle (see Remark 73 for the informal meaning) relates to a condition similar to the analog.

Definition 72

We define the \(\Sigma ^{B}_{0}\) \(\mathcal {L}^{2}_{A}\) formula FIE(nSR) as follows:

$$\begin{aligned}&FIE(n,S,R) \\&\quad :=\lnot \Bigg ( \forall i \in [n+1] \exists j \in [n] S(i,j)\\&\quad \wedge \forall i_{1}< i_{2} \in [n+1] \exists j \in [n] ((S(i_{1},j) \wedge \lnot S(i_{2},j)) \vee (\lnot S(i_{1},j) \wedge S(i_{2},j) ))\\&\quad \wedge \forall i_{1}< i_{2} \in [n+1] \forall i_{1}^{\prime }< i_{2}^{\prime } \in [n+1] \forall j \in [n]\\&\quad \quad (\lnot S(i_{1},j) \vee \lnot S(i_{2},j) \vee \exists j^{\prime } \in [n] R(i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime },j,j^{\prime }) ) \\&\quad \wedge \forall i_{1}<i_{2} \in [n+1] \forall i_{1}^{\prime }< i_{2}^{\prime } \in [n+1] \forall j^{\prime } \in [n] \\&\quad \quad (\lnot S(i_{1}^{\prime },j) \vee \lnot S(i_{2}^{\prime },j) \vee \exists j \in [n] R(i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime },j,j^{\prime })) \\&\quad \wedge \forall i_{1}<i_{2} \in [n+1] \forall i_{1}^{\prime }< i_{2}^{\prime } \in [n+1] \forall j, j^{\prime } \in [n] (\lnot R(i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime },j,j^{\prime }) \vee S(i_{1},j) )\\&\quad \wedge \forall i_{1}<i_{2} \in [n+1] \forall i_{1}^{\prime }< i_{2}^{\prime } \in [n+1] \forall j, j^{\prime } \in [n] (\lnot R(i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime },j,j^{\prime }) \vee S(i_{2},j))\\&\quad \wedge \forall i_{1}<i_{2} \in [n+1] \forall i_{1}^{\prime }< i_{2}^{\prime } \in [n+1] \forall j, j^{\prime } \in [n]( \lnot R(i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }, j,j^{\prime }) \vee S(i_{1}^{\prime },j^{\prime }) )\\&\quad \wedge \forall i_{1}<i_{2} \in [n+1] \forall i_{1}^{\prime }< i_{2}^{\prime } \in [n+1] \forall j, j^{\prime } \in [n]( \lnot R(i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime },j,j^{\prime }) \vee S(i_{2}^{\prime },j^{\prime })) \\&\quad \wedge \forall i_{1}<i_{2} \in [n+1] \forall i_{1}^{\prime }< i_{2}^{\prime } \in [n+1] \forall j \in [n]\forall j^{\prime } \ne j^{\prime \prime } \in [n]\\&\quad \quad (\lnot R(i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime },j,j^{\prime }) \vee \lnot R(i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime },j,j^{\prime \prime }) )\\&\quad \wedge \forall i_{1}<i_{2} \in [n+1] \forall i_{1}^{\prime } < i_{2}^{\prime }\in [n+1] \forall j^{\prime } \in [n]\forall j \ne \widetilde{j} \in [n] \\&\quad \quad (\lnot R(i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime },j,j^{\prime }) \vee \lnot R(i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime },\widetilde{j},j^{\prime })) \Bigg ) \end{aligned}$$

Furthermore, we define the propositional formula \(FIE_{n}\) as follows:

$$\begin{aligned} FIE_{n}:=&\lnot \Bigg ( \bigwedge _{i \in [n+1]} \bigvee _{j \in [n]} s_{ij} \\&\wedge \bigwedge _{i_{1}<i_{2} \in [n+1]} \bigvee _{j \in [n]} ((s_{i_{1}j} \wedge \lnot s_{i_{2}j}) \vee (\lnot s_{i_{1}j} \wedge s_{i_{2}j})) \\&\wedge \bigwedge _{i_{1}<i_{2} \in [n+1]} \bigwedge _{i_{1}^{\prime }< i_{2}^{\prime } \in [n+1]} \bigwedge _{j \in [n]} (\lnot s_{i_{1}j} \vee \lnot s_{i_{2}j} \vee \bigvee _{j^{\prime } \in [n]} r^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }} )\\&\wedge \bigwedge _{i_{1}<i_{2} \in [n+1]} \bigwedge _{i_{1}^{\prime }< i_{2}^{\prime } \in [n+1]} \bigwedge _{j^{\prime } \in [n]} (\lnot s_{i_{1}^{\prime }j} \vee \lnot s_{i_{2}^{\prime }j} \vee \bigvee _{j \in [n]} r^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }}) \\&\wedge \bigwedge _{i_{1}<i_{2} \in [n+1]} \bigwedge _{i_{1}^{\prime }< i_{2}^{\prime } \in [n+1]} \bigwedge _{j, j^{\prime } \in [n]} (\lnot r^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }} \vee s_{i_{1}j} )\\&\wedge \bigwedge _{i_{1}<i_{2} \in [n+1]} \bigwedge _{i_{1}^{\prime }< i_{2}^{\prime } \in [n+1]} \bigwedge _{j, j^{\prime } \in [n]} (\lnot r^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }} \vee s_{i_{2}j})\\&\wedge \bigwedge _{i_{1}<i_{2} \in [n+1]} \bigwedge _{i_{1}^{\prime }< i_{2}^{\prime } \in [n+1]} \bigwedge _{j, j^{\prime } \in [n]}( \lnot r^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }} \vee s_{i_{1}^{\prime }j^{\prime }} )\\&\wedge \bigwedge _{i_{1}<i_{2} \in [n+1]} \bigwedge _{i_{1}^{\prime }< i_{2}^{\prime } \in [n+1]} \bigwedge _{j, j^{\prime } \in [n]}( \lnot r^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }} \vee s_{i_{2}^{\prime }j^{\prime }}) \\&\wedge \bigwedge _{i_{1}<i_{2} \in [n+1]} \bigwedge _{i_{1}^{\prime }< i_{2}^{\prime } \in [n+1]} \bigwedge _{j \in [n]}\bigwedge _{j^{\prime } \ne j^{\prime \prime } \in [n]} (\lnot r^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }} \vee \lnot r^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime \prime }} ) \\&\wedge \bigwedge _{i_{1}<i_{2} \in [n+1]} \bigwedge _{i_{1}^{\prime } < i_{2}^{\prime } \in [n+1]} \bigwedge _{j^{\prime } \in [n]}\bigwedge _{j \ne \widetilde{j} \in [n]} (\lnot r^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }} \vee \lnot r^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{\widetilde{j},j^{\prime }}) \Bigg ) \end{aligned}$$

Remark 73

The above formulae are formalizations of Fisher’s inequality: there does not exist a family \(\{S_{i}\}_{i \in [n+1]}\) satisfying the following:

  • For each i, \(\emptyset \ne S_{i} \subseteq [n]\).

  • For each \(i_{1}<i_{2}\), \(S_{i_{1}} \ne S_{i_{2}}\).

  • For each \(i_{1}<i_{2}\) and \(i^{\prime }_{1}<i^{\prime }_{2}\), \(\#(S_{i_{1}} \cap S_{i_{2}})=\#(S_{i^{\prime }_{1}} \cap S_{i^{\prime }_{2}})\).

In our definition of FIE(nSR), S intuitively gives a family \(\{S_{i}\}_{i \in [n+1]}\), and R gives a family of bijections

$$ \begin{aligned} \{R^{i_{1},i_{2},i^{\prime }_{1},i^{\prime }_{2}} :S_{i_{1}} \cap S_{i_{2}} \rightarrow S_{i^{\prime }_{1}} \cap S_{i^{\prime }_{2}}\}_{i_{1}<i_{2} \& i^{\prime }_{1}<i^{\prime }_{2}}. \end{aligned}$$

Remark 74

In [9], Fisher gave a special case of the statement, and later, it was generalized to the above form by [15] and others. See introductions of [15] and [21] for historical remarks. Note that there are several versions of the statement of Fisher’s inequality. Primarily, it is often presented in a dual form, switching the roles of sets and elements. The form we adopted is following the presentation of [6]. To verify the above version, see the proof of Theorem 4 in [6].

Although Fisher’s inequality was proposed as a candidate for potentially hard tautology for the Frege system, it is provable in \({\textbf {VTC}}^{0}\). I was unaware of this fact when this paper was submitted, and I would like to thank the anonymous referee for sharing the work of [21]. Around the very time [6] was published, Woodall gave an elementary proof of Fisher’s inequality in [21], which was already sufficient to construct short Frege proofs of Fisher’s inequality. The proof can be interpreted as a proof in \({\textbf {VTC}}^{0}\) augmented with iterated multiplication of integers (or, equivalently, rationals). Recently, Jeřabek showed that \({\textbf {VTC}}^{0}\) can formalize iterated multiplication of integers and prove its basic properties [11]. Since we have nothing to add to their results, we just present the claim for reference:

Theorem 75

(Essentially by [21] and [11])

$$\begin{aligned} VTC^{0} \vdash FIE(n,S,R). \end{aligned}$$

Note that [21] takes the dual approach mentioned in Remark 74, and also we do not need the “nontriviality”-condition assumed in the article.

Now, we focus on the lower bounds of the strength of \(FIE_{n}\). It is easy to see that \(FIE_{n}\) is a generalization of the pigeonhole principle.

Proposition 76

\(V^{0}+FIE_{k} \vdash injPHP^{n+1}_{n}\). Hence, for each \(p \ge 2\),

$$\begin{aligned} V^{0} + Count^{p}_{k} \not \vdash FIE_{n}. \end{aligned}$$

Proof

Argue in \(V^{0}\). Suppose f is an injection from \([n+1]\) to [n]. We set \(S_{i}:= \{f(i)\}\) (\(i \in [n+1]\)). Then, the \(S_{i}\)’s are distinct and \(S_{i} \cap S_{j}=\emptyset \) for every \(i<j\). Hence, \(FIE_{n}\) is violated.

The latter part follows immediately from Theorem 8. \(\square \)

It is natural to ask the following:

Question 4

Which p satisfies \(V^{0}+FIE_{k} \vdash Count^{p}_{n}\)?

We conjecture that there is no such p. The following theorem gives a criterion for proving this conjecture.

Theorem 77

Let \(\mathbb {K}\) be a field. Suppose \(F_{d} + FIE_{k} \vdash _{poly(n)} Count^{p}_{n}\). Then, for large enough \(n \not \equiv 0 \pmod p\), there exist \(m =n^{O(1)}\) and families \((f_{ij})_{i \in [m+1], j \in [m]}\), \((a_{ij})_{i \in [m+1], j \in [m]}\) and \((b_{ii^{\prime }j})_{i<i^{\prime } \in [m+1], j \in [m]}\) of polynomials over \(\mathbb {K}\) such that the following equalities have NS-proofs over \(\mathbb {K}\) from \(\lnot Count^{p}_{n}\) with degree O(1):

$$ \begin{aligned} \sum _{j=1}^{m}f_{i_{1}j}f_{i_{2}j}&=\sum _{j=1}^{m}f_{i_{1}^{\prime }j}f_{i_{2}^{\prime }j} \quad&(i_{1}< i_{2}\in [m+1] \ \& \ i_{1}^{\prime }< i_{2}^{\prime } \in [m+1]),\\ a_{ij}(1-f_{ij})&=0 \quad&(i \in [m+1]),\\ \sum _{j=1}^{m}a_{ij}&=1 \quad&(i \in [m+1]),\\ b_{ii^{\prime }j}f_{ij}f_{i^{\prime }j}&=0 \quad&(i<i^{\prime } \in [m+1], \ \& \ j \in [m]),\\ b_{ii^{\prime }j}(1-f_{ij})(1-f_{i^{\prime }j})&=0 \quad&(i<i^{\prime } \in [m+1], \ \& \ j \in [m]).\\ \sum _{j=1}^{m}b_{ii^{\prime }j}&=1 \quad&(i<i^{\prime } \in [m+1]). \end{aligned}$$

Proof

For readability, we assume \(p=3\). Assume proofs \((\pi _{n})_{n \in \mathbb {N}}\) witness

$$\begin{aligned} F_{d} + FIE_{k} \vdash _{poly(n)} Count^{3}_{n}. \end{aligned}$$

We focus on \(n \not \equiv 0 \pmod 3\). Let \(\Gamma _{n}\) be the set of subformulae of \(\pi _{n}\). As the proof of Theorem 67, using the switching lemma for 3-tree and a padding argument, we may assume that there exists an O(1)-evaluation \(T^{n}\) of \(\Gamma _{n}\), and it suffices to construct \((f_{ij})\), \((a_{ij})\) and \((b_{ii^{\prime }j})\) for n. We fix a large enough \(n \not \equiv 0 \pmod 3\), and suppress scripts n of \(T^{n}\), \(\rho _{n}\), etc.

Since \(T \not \models Count^{3}_{n}\), soundness gives that some instance

$$\begin{aligned} I:= FIE_{m}[\sigma _{ij} / s_{ij}, \varphi ^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }}/r^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }}] \end{aligned}$$

in \(\pi _{n}\) satisfies \(T \not \models I\). With an additional restriction, we may assume that

$$\begin{aligned} br_{0}(T_{I}) = br(T_{I}). \end{aligned}$$

We obtain the following

  1. 1.

    Let \(T_{i}:=T_{\bigvee _{j \in [m]} \sigma _{ij}}\). Since

    $$\begin{aligned} T \models \bigvee _{j \in [m]} \sigma _{ij}, \end{aligned}$$

    each \(b \in br(T_{i})\) has at least one \(j_{b} \in [m]\) and \(b^{\prime } \in br_{1}(T_{\sigma _{ij_{b}}})\) such that \(b^{\prime } \subseteq b\). We relabel each branch \(b \in br(T_{i})\) with \(\langle j_{b} \rangle \) and obtain a labeled injPHP-tree \(\widetilde{T}_{i}\).

  2. 2.

    Let \(T_{i_{1}, i_{2}}:= T_{\bigvee _{j \in [m]} ((\sigma _{i_{1}j} \wedge \lnot \sigma _{i_{2}j}) \vee (\lnot \sigma _{i_{1}j} \wedge \sigma _{i_{2}j}))}\). Since

    $$\begin{aligned} T \models \bigvee _{j \in [m]} ((\sigma _{i_{1}j} \wedge \lnot \sigma _{i_{2}j}) \vee (\lnot \sigma _{i_{1}j} \wedge \sigma _{i_{2}j})), \end{aligned}$$

    each \(b \in br(T_{i_{1},i_{2}})\) has at least one \(j_{b}\) satisfying one of the following

    1. (a)

      For all \(b^{\prime } \in br_{0}(T_{\sigma _{i_{1}j_{b}}})\cup br_{1}(T_{\sigma _{i_{2}j_{b}}})\), \(b\perp b^{\prime }\)

    2. (b)

      For all \(b^{\prime } \in br_{1}(T_{\sigma _{i_{1}j_{b}}})\cup br_{0}(T_{\sigma _{i_{2}j_{b}}})\), \(b \perp b^{\prime }\)

    We relabel each branch \(b \in br(T_{i_{1}, i_{2}})\) with \(\langle j_{b} \rangle \) and obtain a labeled injPHP-tree \(\widetilde{T}_{i_{1},i_{2}}\).

  3. 3.

    Let \( T^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{1,j}:= T_{ \lnot \sigma _{i_{1}j} \vee \lnot \sigma _{i_{2}j} \vee \bigvee _{j^{\prime } \in [m]} \varphi ^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }}}\). Each \(b \in br(T^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{1,j} )\) is an extension of some element of \(br_{0}(T_{\sigma _{i_{1}j}}), br_{0} (T_{\sigma _{i_{2}j}}), \bigcup _{j^{\prime }}br_{1}(T_{\varphi ^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }}})\). If b is an extension of an element of \(br_{1}(T_{\varphi ^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }}})\), such \(j^{\prime }\) is unique.

  4. 4.

    Let \( T^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{2,j^{\prime }}:= T_{\lnot \sigma _{i_{1}^{\prime }j} \vee \lnot \sigma _{i_{2}^{\prime }j} \vee \bigvee _{j \in [m]} \varphi ^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }}}\). Each \(b \in br(T^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{2,j^{\prime }})\) is an extension of an element of \(br_{0}(T_{\sigma _{i_{1}^{\prime }j^{\prime }}}), br_{0} (T_{\sigma _{i_{2}^{\prime }j^{\prime }}}), \bigcup _{j}br_{1}(T_{r^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }}})\). If b is an extension of an element of \(br_{1}(T_{r^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }}})\), such j is unique.

Now, we set

$$\begin{aligned} f_{ij}&:= \sum _{\alpha \in br_{1}(T_{\sigma _{ij}})}x_{\alpha },\\ a_{ij}&:= \sum _{\alpha \in br_{\langle j \rangle }(\widetilde{T}_{i})}x_{\alpha }\\ b_{i_{1}i_{2}j}&:=\sum _{\alpha \in br_{\langle j \rangle }(\widetilde{T}_{i_{1},i_{2}})}x_{\alpha }.\\&( i \in [m+1], j \in [m]) \end{aligned}$$

Clearly, \(m \le n^{O(1)}\).

We show that each of the following has a NS-proof from \(\lnot Count^{3}_{n}\) over \(\mathbb {K}\) with O(1)-degree

$$\begin{aligned}&\sum _{j=1}^{m} a_{ij}=1, \end{aligned}$$
(7)
$$\begin{aligned}&\sum _{j=1}^{m} b_{i_{1}i_{2}j} = 1, \end{aligned}$$
(8)
$$\begin{aligned}&\sum _{j=1}^{m}f_{i_{1}j}f_{i_{2}j} = \sum _{j=1}^{m} f_{i_{1}^{\prime }j}f_{i_{2}j^{\prime }}, \end{aligned}$$
(9)
$$\begin{aligned}&a_{ij}(1-f_{ij})=0, \end{aligned}$$
(10)
$$\begin{aligned}&b_{i_{1}i_{2}j}f_{i_{1}j}f_{i_{2}j}=0 ,\end{aligned}$$
(11)
$$\begin{aligned}&b_{i_{1}i_{2}j}(1-f_{i_{1}j})(1-f_{i_{2}j})=0 . \end{aligned}$$
(12)

\( (i, i_{1}, i_{2}, i^{\prime }_{1}, i^{\prime }_{2} \in [m+1] \& i_{1}< i_{2} \& i^{\prime }_{1}<i^{\prime }_{2} \& j \in [m] )\)

  • (7): Since the left-hand side is the sum of all branches of the 3-partition tree \(\widetilde{T}_{i}\).

  • (8): Since the left-hand side is the sum of all branches of the 3-partition tree \(\widetilde{T}_{i_{1},i_{2}}\)

  • (9): We first define \(A_{i_{1},i_{2},j}:= T_{\sigma _{i_{1}j}} * T_{\sigma _{i_{2}j}}\) (\(i_{1}, i_{2} \in [m+1], j \in [m], i_{1}<i_{2}\)). Let \(B_{i_{1},i_{2},j}\) be the set of all branches \(b \in A_{i_{1},i_{2},j}\) having the form

    $$\begin{aligned} b=cd^{c} \quad (c \in br_{1}(T_{\sigma _{i_{1}j}}), d \in br_{1}(T_{\sigma _{i_{2}j}})). \end{aligned}$$

    It is easy to construct a NS-proof of

    $$\begin{aligned} f_{i_{1}j}f_{i_{2}j} = \sum _{b \in B_{i_{1},i_{2},j}} x_{b} \end{aligned}$$
    (13)

    from \(\lnot Count^{3}_{n}\) over \(\mathbb {K}\) with degree O(1).

    Now, fix \(i_{1}, i_{2}, i^{\prime }_{1}, i^{\prime }_{2} \in [m+1]\) such that \(i_{1}< i_{2}\) and \(i^{\prime }_{1}<i^{\prime }_{2}\). For each \(j \in [m]\), consider the trees

    $$\begin{aligned} R_{j}&:= A_{i_{1},i_{2},j}*\sum _{b \in B_{i_{1},i_{2},j}}(T^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{1,j})^{b}\\ R^{\prime }_{j}&:=A_{i^{\prime }_{1},i^{\prime }_{2},j}*\sum _{b \in B_{i^{\prime }_{1},i^{\prime }_{2},j}}(T^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{2,j})^{b}. \end{aligned}$$

    For each \(r=bd^{b}\) (\(b \in B_{i_{1},i_{2},j}\), \(d \in br(T^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{1,j})\)), since d||b, there exists a unique \(j^{\prime }_{r}\) such that d is an extension of some \(c \in br_{1}(T_{\varphi ^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }_{r}}})\). Let \(B_{j} \subseteq br(R_{j})\) be the set of all branches having the above form.

    Similarly, for each \(r^{\prime }=bd^{b}\) (\(b \in B_{i^{\prime }_{1},i^{\prime }_{2},j}\), \(d \in br(T^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{2,j})\)), since d||b, there exists a unique \(\widehat{j}_{r^{\prime }}\) such that d is an extension of some \(c \in br_{1}(T_{\varphi ^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{\widehat{j}_{r^{\prime }},j}})\). Let \(B^{\prime }_{j} \subseteq br(R^{\prime }_{j})\) be the set of all branches having the above form.

       Now, we define

    $$\begin{aligned} T_{j,j^{\prime }}:= (((T_{\lnot r^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }} \vee s_{i_{1}j}} * T_{\lnot r^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }} \vee s_{i_{2}j} })* T_{\lnot r^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }} \vee s_{i^{\prime }_{1}j^{\prime }} })* T_{\lnot r^{i_{1},i_{2},i_{1}^{\prime }, i_{2}^{\prime }}_{j,j^{\prime }} \vee s_{i^{\prime }_{2}j^{\prime }} } ). \end{aligned}$$

    for each \(j\ne j^{\prime } \in [m]\). Using these trees, we define

    $$\begin{aligned} S_{j}&:= R_{j} * \sum _{r \in B_{j}} (T_{j,j^{\prime }_{r}} * \sum _{t \in br(T_{j,j^{\prime }_{r}})} (R^{\prime }_{j^{\prime }_{r}})^{t})^{r},\\ S^{\prime }_{j}&:= R^{\prime }_{j} * \sum _{r^{\prime } \in B^{\prime }_{j}} (T_{\widehat{j}_{r^{\prime }},j}*\sum _{t \in br(T_{\widehat{j}_{r^{\prime }},j})} (R_{\widehat{j}_{r^{\prime }}})^{t} )^{r^{\prime }}. \end{aligned}$$

    Label each branch \(b \in br(S_{j})\) as follows:

    • If b extends some \(r \in B_{j}\), then label b with \(\langle j,j^{\prime }_{r} \rangle \).

    • Otherwise, label b with the symbol \(\perp \).

    Similarly, we label each branch \(b^{\prime } \in br(S^{\prime }_{j})\) as follows:

    • If b extends some \(r^{\prime } \in B^{\prime }_{j}\), then label b with \(\langle \widehat{j}_{r^{\prime }}, j\rangle \).

    • Otherwise, label b with the symbol \(\perp \).

    It is easy to see that for each \(j,j^{\prime }\), \(br_{\langle j,j^{\prime }\rangle } (S_{j})= br_{\langle j,j^{\prime } \rangle }(S^{\prime }_{j^{\prime }})\). Hence,

    $$\begin{aligned} \sum _{j, j^{\prime } \in [m]} \sum _{\alpha \in br_{\langle j,j^{\prime } \rangle }(S_{j})} x_{\alpha } =\sum _{j, j^{\prime } \in [m]} \sum _{\beta \in br_{\langle j,j^{\prime } \rangle }(S^{\prime }_{j^{\prime }})} x_{\beta }. \end{aligned}$$

    Furthermore, it is easy to see that the following have NS-proofs from \(\lnot Count^{3}_{n}\) over \(\mathbb {K}\) with O(1)-degree:

    $$\begin{aligned}&\sum _{ j^{\prime } \in [m]} \sum _{\alpha \in br_{\langle j,j^{\prime } \rangle }(S_{j})} x_{\alpha } = \sum _{b \in B_{i_{1},i_{2},j}} x_{b} \quad (j \in [m]),\\&\sum _{j \in [m]} \sum _{\beta \in br_{\langle j,j^{\prime } \rangle }(S^{\prime }_{j^{\prime }})} x_{\beta } = \sum _{b \in B_{i^{\prime }_{1},i^{\prime }_{2},j^{\prime }}} x_{b} \quad (j^{\prime } \in [m]). \end{aligned}$$

    Hence, combined with (13), they give a NS-proof of \(\sum _{j} f_{i_{1}j}f_{i_{2}j} = \sum _{j^{\prime }}f_{i^{\prime }_{1}}f_{i^{\prime }_{2}}\) satisfying the required conditions.

  • (10): It follows similarly as (12) below.

  • (11): \(b_{i_{1}i_{2}j}f_{i_{1}j}f_{i_{2}j}=0\) follows easily from \(\lnot Count^{3}_{n}\) since each \(\alpha \in br_{\langle j \rangle } (\widetilde{T}_{i_{1},i_{2}})\) satisfies \(\alpha \perp b\) for all \(b \in B_{i_{1},i_{2},j}\).

  • (12): Note that we have NS-proofs of the following:

    $$\begin{aligned}&f_{i_{1}j}+\sum _{\beta \in br_{0}(T_{\sigma _{i_{1}j}})} x_{\beta }=1,\\&f_{i_{2}j}+\sum _{\beta \in br_{0}(T_{\sigma _{i_{2}j}})} x_{\beta }=1.\\&(j \in [m]) \end{aligned}$$

    Hence, \(b_{i_{1}i_{2}j}(1-f_{i_{1}j})(1-f_{i_{2}j})=0\) follows easily from \(\lnot Count^{3}_{n}\) by a similar reason as the previous item.

\(\square \)