1 Introduction

Graph pebbling is a game on graphs. It was suggested by Saks and Lagarias to solve a number theoretic problem, which was done by Chung [4]. The main framework is the following: a distribution of pebbles is placed on the vertices of a simple graph. A pebbling move removes two pebbles from a vertex and places one pebble on an adjacent vertex. The goal is to reach any specified vertex by a sequence of pebbling moves. This may be viewed as a transportation problem on a graph where the cost of a move is one pebble. We begin with some notation needed to state our results.

Let G be a simple graph. We denote the vertex and edge set of G by V(G) and E(G), respectively. A pebble distributionP is a function from V(G) to the nonnegative integers. We say that G has P(v) pebbles placed at the vertex v under the distribution P. We say that a vertex v is occupied if \(P(v)>0\) and unoccupied otherwise. The size of a pebble distribution P, denoted \(\left|{P}\right|\), is the total number of pebbles placed on the vertices of G.

Let u be a vertex with at least two pebbles under P, and let v be a neighbor of u. A pebbling move from u to v consists of removing two pebbles from u and adding one pebble to v. That is, a pebbling move yields a new pebble distribution \(P'\) with \(P'(u) = P(u)-2\) and \(P'(v) = P(v) + 1\). A sequence of pebbling moves is executable under a pebble distribution if we can apply its moves one after the another so that the number of pebbles is nonnegative after each move on any vertex. We call such a sequence a pebbling sequence. We say that a vertex v is k-reachable under the distribution P if we can obtain, after a pebbling sequence, a distribution with at least k pebbles on v. If \(k=1\) we say simply that v is reachable under P. More generally, a set of vertices S is k-reachable under the distribution P if, after a pebbling sequence, we can obtain a distribution with at least a total of k pebbles on the vertices in S.

A pebble distribution P on G is solvable if all vertices of G are reachable under P. A pebble distribution on G is optimal if it is solvable and its size is minimal among all of the solvable distributions of G. Note that optimal distributions are usually not unique.

The optimal pebbling number of G, denoted by \(\pi _{{{\,\mathrm{opt}\,}}}(G)\), is the size of an optimal pebble distribution. In general, the decision problem for this graph parameter is NP-complete [5].

We denote with \(P_n\) and \(C_n\) the path and cycle on n vertices, respectively. The Cartesian product\(G \square H\) of graphs G and H is defined in the following way: \(V(G \square H) = V(G)\times V(H)\) and \(\{ (g_{1} ,h_{1} ),(g_{2} ,h_{2} )\} \in E(G\square H)\) if and only if \(\{g_1,g_2\}\in E(G)\) and \(h_1=h_2\) or \(\{h_1,h_2\}\in E(H)\) and \(g_1=g_2\).

Let u and v be vertices of graph G. The distance between v and u, namely the number of edges contained in the shortest path between u and v, is denoted by d(vu). The distance k neighborhood of v contains the vertices whose distance from v is exactly k. We denote this set with \(N_k(v)\).

The optimal pebbling number is known for several graphs including paths, cycles [1, 6, 7], caterpillars [8] and m-ary trees [9]. The optimal pebbling number of grids has also been investigated. Exact values were proved for \(P_n \square P_2\) [1] and \(P_n \square P_3\) [2]. The question for bigger grids is still open. The best known upper bound for the square grid can be found in [3]. Diagonal induced subgraphs of the square grid was studied in [10].

Instead of the square grid on the plane it is easier to work with the square grid on the torus. As the plane grid is a subgraph of this, any lower bound on the torus grid will also give a lower bound on the plane grid as well. It is well known that the torus grid is a vertex transitive graph, i.e. given any two vertices \(v_1\) and \(v_2\) of G, there is some automorphism \(f:V(G)\rightarrow V(G)\) such that \(f(v_{1})=v_{2}\). Some of our statements will be stated for all vertex transitive graphs.

In this paper we present a new method giving a lower bound on the optimal pebbling number. We obtain \(\frac{2}{13}V(G)\) as a lower bound for the optimal pebbling number of the square grid, which is better than the previously known bounds.Please provide the complete details for reference [11].

In Sect. 2 we show that the concept of excess—introduced in [2]—can be used to improve the fractional lower bound on the optimal pebbling number. The higher the total excess, the better the obtained bound on the optimal pebbling number is. The problem is that this method is not standalone, because excess can be zero and zero excess does not give us any improvement. Therefore the main objective of the rest of the paper is to give a lower bound on the excess using some other pebbling tools.

In Sect. 3 we study the concept of cooperation. Cooperation is the phenomenon which makes pebbling hard. We show there, that if cooperation can be bounded from above, then we can state a lower bound on the optimal pebbling number. We invent the tool called cooperation excess, which is a mixture of cooperation and excess. In this section we state and prove several small claims which will be required later to prove Lemma 4.1. This lemma is the essence of our work. It shows that if the total excess is small, then there is not much cooperation and if cooperation is huge, then the total excess is also large. Therefore in each case one of our two lower bounds works well.

Unfortunately, the proof of Lemma 4.1 is quite complicated. The third part of Sect. 3 and the whole Sect. 4 contain the parts of this proof. In Sect. 5 we show a general method which can be used to give a lower bound on the optimal pebbling number. This method relies on Lemma 4.1. Using this method we show that \(\pi _{{{\,\mathrm{opt}\,}}}(P_n\square P_m)\ge \frac{2}{13}nm\). We also present a new proof for \(\pi _{{{\,\mathrm{opt}\,}}}(P_n)=\pi _{{{\,\mathrm{opt}\,}}}(C_n)=\left\lceil \frac{2n}{3}\right\rceil\).

2 Improving the Fractional Lower Bound

The optimal pebbling number problem can be formulated as the following integer programming problem [11], where \(v_1, v_2, \ldots , v_n\) are the vertices of the given graph:

$$\begin{aligned}&P(v_i)+\sum _{x\in N(v_i)}(p_i(x,v_i)-2p_i(v_i,x))\ge 1 \ \forall i\in \{1,2,\ldots n\} \\&P(v_j)+\sum _{x\in N(v_j)}(p_i(x,v_j)-2p_i(v_j,x))\ge 0 \ \forall i,j\in \{1,2,\ldots n\} \\&P(v_i)\ge 0 \text { integer} \ \forall i\in \{1,2,\ldots n\} \\&p_i(v_j,v_k)\ge 0 \text { integer} \ \forall i,j,k\in \{1,2,\ldots n\} \\&\min \sum _{v\in V(G)}P(v) \end{aligned}$$

Its fractional relaxation can be solved efficiently, and its solution is called the fractional optimal pebbling number, which gives a lower bound on the optimal pebbling number. Originally it was defined in a bit different way, but this is an equivalent definition. You can find the details of fractional pebbling in [11].

Notice that some vertices must be 2-reachable in a solvable distribution if there is an unoccupied vertex. Optimal distributions usually contain many unoccupied and several 2-reachable vertices. However, in some sense, 2, 3, or more reachability wastes the effect of pebbles. Also 3-reachability induces larger waste than 2-reachability. In order to measure this waste we use the notion called excess, which was introduced in [2].

Definition

Let \({{\,\mathrm{Reach}\,}}(P,v)\) be the greatest integer k such that v is k-reachable under P. The excess of v under P is \({{\,\mathrm{Reach}\,}}(P,v)-1\) if v is reachable and zero otherwise. It is denoted by \({{\,\mathrm{Exc}\,}}(P,v)\).

We are interested in the total amount of waste, therefore we define the notation of total excess of P, which is \({{\,\mathrm{TE}\,}}(P)=\sum _{v\in {{\,\mathrm{V}\,}}(G)}{{\,\mathrm{Exc}\,}}(P,v)\).

Definition

An effect of a pebble placed at v is the following:

$$\begin{aligned} {{\,\mathrm{ef}\,}}(v)=\sum _{i=0}^{{{\,\mathrm{diam}\,}}(G)}\left( \frac{1}{2}\right) ^i|N_i(v)| \end{aligned}$$

.

Herscovici et al. proved that the fractional optimal pebbling number of a vertex-transitive graph is \(|V(G)|/{{\,\mathrm{ef}\,}}(v)\), therefore it is a lower bound on the optimal pebbling number. The corollary of the next theorem improves this bound.

Theorem 2.1

IfPis a solvable distribution onG, then

$$\begin{aligned} \sum _{v\in V(G)}{{\,\mathrm{ef}\,}}(v)P(v)\ge |V(G)|+{{\,\mathrm{TE}\,}}(P). \end{aligned}$$

Proof

It is clear that if a vertex u is k-reachable under P, then it is mandatory that \(\sum _{v\in V(G)}\left( \frac{1}{2}\right) ^{{{\,\mathrm{d}\,}}(v,u)}P(v)\ge k\). Summing these inequalities for all the vertices, we have that

$$\begin{aligned} \sum _{u\in V(G)}\sum _{v\in V(G)}\left( \frac{1}{2}\right) ^{{{\,\mathrm{d}\,}}(v,u)}P(v)\ge \sum _{u\in V(G)}{{\,\mathrm{Reach}\,}}(P,u). \end{aligned}$$

Exchange the summations on the left side and use the fact that P is solvable on the right side, to obtain that

$$\begin{aligned} \sum _{v\in V(G)}\sum _{u\in V(G)}\left( \frac{1}{2}\right) ^{{{\,\mathrm{d}\,}}(v,u)}P(v)\ge \sum _{u\in V(G)}(1+{{\,\mathrm{Exc}\,}}(P,u)). \end{aligned}$$

Group the elements of the second sum according to the distance i neighborhoods, to acquire that

$$\begin{aligned} \sum _{v\in V(G)}\sum _{i=0}^{{{\,\mathrm{diam}\,}}(G)}\left( \frac{1}{2}\right) ^i|N_i(v)|P(v)\ge |V(G)|+{{\,\mathrm{TE}\,}}(P). \end{aligned}$$

\(\square\)

Corollary 2.2

IfPis a solvable distribution on a vertex-transitive graphG, then

$$\begin{aligned} |P|\ge \frac{|V(G)|+{{\,\mathrm{TE}\,}}(P)}{{{\,\mathrm{ef}\,}}(v)}. \end{aligned}$$

Naturally, this bound is useless without a proper estimate of total excess. To say something useful about it we look at the optimal pebbling problem from a different angle.

3 Cooperation Between Distributions

In this section we talk about cooperation, which makes pebbling hard.

3.1 Pebbling Cooperation

Definition

Let P and Q be pebble distributions on graph G. Now \(P+Q\) is the unique pebble distribution on G which satisfies \((P+Q)(v)=P(v)+Q(v)\). P and Q are disjoint when no vertex has pebbles under both distributions.

Definition

The coverage of a distribution P is the set of vertices which are reachable under P. We denote the size of this set with \({{\,\mathrm{Cov}\,}}(P)\).

A natural idea to find small solvable distributions is finding a distribution with small size and huge coverage and make it solvable by placing some more pebbles.

In the rest of the section we assume that we add disjoint distributions P and Q together. We would like to establish an upper bound using \({{\,\mathrm{Cov}\,}}(P)+{{\,\mathrm{Cov}\,}}(Q)\) on \({{\,\mathrm{Cov}\,}}(P+Q)\) . Similarly, we are interested in some relation between \({{\,\mathrm{TE}\,}}(P+Q)\) and \({{\,\mathrm{TE}\,}}(P)+{{\,\mathrm{TE}\,}}(Q)\).

Definition

A cooperation vertex is neither reachable under P nor Q, but it is reachable under \(P+Q\). We denote the number of such vertices with \({{\,\mathrm{Coop}\,}}(P,Q)\). A double covered vertex is reachable under both P and Q, we denote the size of their set with \({{\,\mathrm{DC}\,}}(P,Q)\).

The following claim is a trivial consequence of the definitions.

Claim 3.1

\({{\,\mathrm{Cov}\,}}(P+Q)= {{\,\mathrm{Cov}\,}}(P)+ {{\,\mathrm{Cov}\,}}(Q)+{{\,\mathrm{Coop}\,}}(P,Q)-{{\,\mathrm{DC}\,}}(P,Q)\).

Definition

We say that a distribution U is a unit, if only one vertex has pebbles under U.

Units are the building blocks of pebble distributions in the following sense: Any distribution P can be written as \(\sum _{u|P(u)>0}P_u\), where \(P_u\) is a unit having P(u) pebbles at u. Units have two main advantages over other distributions. Their coverage and total excess can be easily calculated:

Claim 3.2

LetUbe a unit distribution which places pebbles at vertexu. Then we have that

$$\begin{aligned} {{\,\mathrm{Cov}\,}}(U)= & {} \sum _{i=0}^{\left\lfloor \log _2(P(u))\right\rfloor }|N_i(u)|, \\ {{\,\mathrm{TE}\,}}(U)= & {} \sum _{i=0}^{\left\lfloor \log _2(P(u))\right\rfloor }|N_i(u)|\left( \left\lfloor \frac{P(u)}{2^i}\right\rfloor -1\right) . \end{aligned}$$

3.2 Combining Cooperation and Excess

We would like to distinguish the sources of excess. Does it come from P or Q or does it arise from the “cooperation of P and Q”?

Definition

The unit excess of P, denoted by \({{\,\mathrm{UE}\,}}(P)\), is \(\sum _{u|P(u)>0}(TE(P_u))\), where \(P_u\) is a unit on u containing exactly P(u) pebbles and all of them are placed at u.

Definition

The cooperation excess of a vertex v is \({{\,\mathrm{Exc}\,}}(P+Q,v)-({{\,\mathrm{Exc}\,}}(P,v)+{{\,\mathrm{Exc}\,}}(Q,v))\). If it is positive, then we say that v has cooperation excess.

Similarly, the cooperation excess between P and Q is the total excess of \(P+Q\) minus the total excesses of P and Q. Denote this with CE(PQ).

We have mentioned previously, that we can split any pebbling distribution into disjoint unit distributions. If we get t unit distributions, then the application of Claim 3.1 and the definition of cooperation excess gives the following results.

Claim 3.3

LetPbe a pebble distribution onGand let\({\mathcal {D}}\)be a disjoint decomposition ofPto unit distributions. Denote the elements of\({\mathcal {D}}\) with \(U_1, U_2, \dots , U_t\). Now

$$\begin{aligned} {{\,\mathrm{TE}\,}}(P)= & {} \sum _{i=1}^t{{\,\mathrm{TE}\,}}(U_i)+\sum _{i=1}^t{{\,\mathrm{CE}\,}}\left( \sum _{k=1}^{i-1}U_k,U_i\right) , \end{aligned}$$
(3.1)
$$\begin{aligned} {{\,\mathrm{Cov}\,}}(P)= & {} \sum _{i=1}^t{{\,\mathrm{Cov}\,}}(U_i)+\sum _{i=1}^t\left( {{\,\mathrm{Coop}\,}}\left( \sum _{k=1}^{i-1}U_k,U_i\right) -{{\,\mathrm{DC}\,}}\left( \sum _{k=1}^{i-1}U_k,U_i\right) \right) . \end{aligned}$$
(3.2)

Both \(\sum _{i=1}^t{{\,\mathrm{TE}\,}}(U_i)\) and \(\sum _{i=1}^t{{\,\mathrm{Cov}\,}}(U_i)\) can be calculated easily. The “effect” of cooperation is calculated in the other, more complicated terms. Lemma 4.1 is going to establish a connection between those quantities in a fruitful way.

3.3 Connection Between Cooperation and Excess

Now let us consider an arbitrary graph G, and let \(\varDelta\) be the maximum degree of G. In the rest of the section we assume that \(Q=U\) is a unit having pebbles only at vertex u and its size is not zero. Now we state some basic claims about the recently defined objects.

Claim 3.4

Each cooperation vertex c has a neighbor that has cooperation excess.

Proof

A cooperation vertex c is not reachable under P or U. Therefore none of its neighbors is 2-reachable under these distributions. On the other hand, c is reachable under \(P+U\), hence there is a neighbor n of c which is 2-reachable under this distribution. This means that n has cooperation excess.\(\square\)

Definition

If a vertex is not a cooperation vertex and it does not have cooperation excess, then we call it cooperation free.

This name is a somewhat misleading, because these vertices can participate in cooperation in a sophisticated way. For an example see Fig. 1.

Fig. 1
figure 1

Vertices x and y are both cooperation free, but v has cooperation excess and w is a cooperation vertex

Definition

Let \(\sigma\) be a pebbling sequence. \(P_\sigma\) denotes the pebble distribution which is obtained by the application of \(\sigma\) to distribution P. A vertex is utilized by a pebbling sequence if there is a move in the sequence which removes or adds a pebble to the vertex. Let M(v) be the minimal number of cooperation vertices, including v if it is a cooperation vertex, which are utilized by a pebbling sequence \(\sigma\) which satisfies that \((P+U)_\sigma (v)\ge 2\). If v is not 2-reachable under \(P+U\), then we say that \(M(v)=\infty\).

An example where M() is shown for each vertex can be seen in Fig. 2.

Fig. 2
figure 2

Vertices w, x, v and z are cooperation vertices. The M values are written in brackets

Claim 3.5

If there is an available pebbling move which removes a pebble from a cooperation vertexc, then either two neighbors ofc, sayeandf, have cooperation excess at least 1 with\(M(e)<M(c)\)and\(M(f)<M(c)\)or a neighbordhas cooperation excess at least 3with\(M(d)<M(c)\).

Proof

The condition implies that c can obtain two pebbles by some pebbling moves under \(P+U\). Consider a pebbling sequence \(\sigma\) which does this by utilizing M(c) cooperation vertices. Either \(\sigma\) moves the two pebbles to c from two different neighbors e and f, or it can move both pebbles from the same neighbor d. None of the neighbors are 2-reachable under P or U, but e, f and d has to be 2, 2 and 4 reachable under \(P+U\), respectively. This means that e and f have cooperation excess at least 1 and the cooperation excess of d is at least 3. Furthermore, \(\sigma\) moves two pebbles to e and f or to d, then it moves them to c with some more moves. This shows that \(M(e),M(f),M(d)<M(c)\).\(\square\)

Claim 3.6

If the cooperation excess of a vertexvis at least 3 and one of its neighbors, sayc, is a cooperation vertex, then there is a vertexwthat is adjacent tovand\(M(w)\le M(v)\).

Proof

Note that v does not have two pebbles under \(P+U\), otherwise c can not be a cooperation vertex. Vertex v obtains pebbles from its neighbors, so one of them, say w, can get two pebbles by utilizing at most M(v) cooperation vertices. If v is a cooperation vertex, then a pebbling sequence resulting in two pebbles at v utilizes more cooperation vertices than the sequence which does not make the final move from w to v.\(\square\)

Claim 3.7

If a vertexvhas cooperation excess, then it has a neighbor which has cooperation excess or reachable underPorU.

Proof

Vertex v gets a pebble under \(P+U\), so a neighbor n is 2-reachable under \(P+U\). If n is not 2-reachable under P or U, then it has cooperation excess.\(\square\)

Remark

In fact, a stronger property holds. If a vertex v gains an extra pebble by cooperation, then it can happen in two ways: A neighbor gained extra pebbles and it passes one of them. Or there are two or more neighbors of v such that each of them can give some pebbles to v, but these moves somehow blocks each other. The advantage of the cooperation is that some previously blocked moves can be done simultaneously. This is the way how cooperation free vertices can “help cooperation”.

3.4 Trajectories

Here we introduce a visualization of pebbling sequences, which is slightly different from the signature digraph used in several pebbling papers (i.e. in [5]).

Definition

The trajectory of a pebbling sequence \(\sigma\), denoted by \(T(\sigma )\), is a digraph on the vertices of G without parallel edges, where (uv) is a directed edge if and only if a pebbling move \(u\rightarrow v\) is contained in the sequence.

Definition

The size of a pebbling sequence is the total number of moves contained in it. We say that \(\sigma\) is a minimal pebbling sequence with property p if its size is minimal among all pebbling sequences having property p.

In the next proof we need a lemma which is frequently used to solve pebbling problems. It is called No-Cycle Lemma and proved in several papers [5, 12, 13]. We state this lemma in the language of this paper.

Lemma 3.8

(No-Cycle [13]) LetPbe a pebble distribution on graphG, and\(\sigma\)be a pebbling sequence. There is a subsequence\(\delta\)whose trajectory does not contain directed cycles and\(P_{\sigma }(v)\le ~P_{\delta }(v)\)for each vertexv.

This implies the following corollary:

Corollary 3.9

If\(\sigma\)is a minimal pebbling sequence which movesmpebbles to a vertexv, then its trajectory is acyclic.

Claim 3.10

Ifuhas cooperation excess under\(P+U\), where\(|U|>0\), thenuis double covered.

Proof

The No-Cycle lemma yields that we can move the maximum possible number of pebbles to u without removing a pebble from u. We can move \({{\,\mathrm{Reach}\,}}(U,u)+1\) pebbles to u, which means that we move here a pebble of P while we keep the pebbles of U, so u is double covered.\(\square\)

The following definition will be crucial in the proof.

Definition

We say that a path is a coopexcess path, if each inner vertex of the path has cooperation excess.

Lemma 3.11

Letvbe a vertex which is not double covered but it has cooperation excess. There is a coopexcess path betweenvand a double covered vertex or there are at least two cooperation free vertices such that each of them is connected tovby a coopexcess path. Ifvis not 2-reachable under bothPandU, then these paths does not contain a vertex whoseMvalue is higher thanM(v).

An example for the first case is shown in Fig. 2 where y is a double covered vertex and vxy is a coopexcess path. The second case can be seen in Fig. 1, where vx and vy are coopexcess paths connecting v to cooperation free vertices.

Proof

Consider a pebbling sequence \(\sigma\) moving \({{\,\mathrm{Reach}\,}}(U,v)+{{\,\mathrm{Reach}\,}}(P,v)+1\) pebbles to v utilizing M(v) cooperation vertices. Consider some path in the trajectory of \(\sigma\) connecting u to v. We can assume that the only sink in the trajectory of \(\sigma\) is v. A cooperation vertex without cooperation excess can not be the tail of an arc which is contained in the trajectory, therefore each vertex in the trajectory is either cooperation free or it has cooperation excess.

If there is a path between u and v which is contained in the trajectory such that all vertices of this path have cooperation excess, then according to Claim 3.10u is double covered and this path is a coopexcess path. If an u, v path which is included in the trajectory contains a vertex d which is double covered and each vertex between d and v has cooperation excess, then it is a coopexcess path which we are looking for. Otherwise, all of the uv paths which are contained in the trajectory contain cooperation free vertices.

In each such path let \(w_i\) denote the cooperation free vertex which is the closest vertex to v. If \(w_i\ne w_j\) exist, then we have found 2 cooperation free vertices such that each of them is connected to v by a coopexcess path.

In the remaining case there is only one such w. Either it is a cut vertex in the trajectory or \(w=u\). Let T be the set of vertices which are included in the trajectory. We divide T to three sets \({\mathcal {U}}\), \({\mathcal {V}}\) and \({\mathcal {W}}\) in the following way:

We remove w from the trajectory obtaining some components, then we place a vertex t of T to \({\mathcal {U}}\) if t is in the component containing u, similarly we place t to \({\mathcal {V}}\) if it is in the component containing v and place the remaining vertices to \({\mathcal {W}}\). Now we add w to all of these sets. Let \(\sigma _u\) be the sequence of pebbling moves containing all moves of \(\sigma\) which acts only on the vertices of \({\mathcal {U}}\). We define \(\sigma _w\) and \(\sigma _v\) similarly. The sources of the latter two sequences are only w and vertices having pebbles under P.

If there is a cooperation free vertex in \({\mathcal {V}}\) which is not w, then the closest one to v is connected to v by a coopexcess path. Hence, assume that all vertices in the trajectory of \(\delta _v\) have cooperation excess.

If w is reachable under U, then \(\sigma _w\) is empty (w is not double covered) and \((P+U)_{\sigma _u}(w)\le {{\,\mathrm{Reach}\,}}(U,w)\). Since w is cooperation free, we can replace \(\sigma _u\) with a pebbling sequence \(\delta\) which does not use any pebbles of P and \((P+U)_{\sigma _u}(w)=(P+U)_{\delta }(w)\). Therefore \(\delta \sigma _v\) is a pebbling sequence under \(P+U\) and \((P+U)_{\sigma }(v)=(P+U)_{\delta \sigma _v}(v)={{\,\mathrm{Reach}\,}}(P,v)+{{\,\mathrm{Reach}\,}}(U,v)+1\). \(\sigma _v\) must use a pebble of P to do this, otherwise \(\delta \sigma _v\) is executable under U which is a contradiction. The trajectory of \(\sigma _v\) is connected, therefore there is a vertex which is double covered, furthermore each vertex in this trajectory is connected by a coopexcess path to v, so we are done.

If w is not reachable under U, then \((P+U)_{\sigma _u\sigma _w}(w)\le {{\,\mathrm{Reach}\,}}(P,w)\). Thus, there is a minimal pebbling sequence \(\delta\) which is executable under P and \(P_\delta (w)={{\,\mathrm{Reach}\,}}(P+U,w)={{\,\mathrm{Reach}\,}}(P,w)\). Clearly \(\delta \sigma _{v}\) is not executable under distribution P or \({(P+U)_{\delta \sigma _v}(v)}<(P+U)_{\sigma }(v)\). Both cases require that \(\delta\) removes a pebble from a vertex contained in \({\mathcal {V}}\).

Let \({\mathcal {X}}\subseteq {\mathcal {V}}\) be the set of vertices from which \(\delta\) removes a pebble. \(\delta\) is executable under P so these vertices are 2-reachable under P. Consider the trajectory of \(\delta\). If any vertex x from \({\mathcal {X}}\) is connected in the trajectory with a vertex y contained in \({\mathcal {U}}\) without pass-through w, then each vertex in such a connecting path is 2-reachable under P, therefore it is cooperation free or has cooperation excess. So there is either an other cooperation free vertex connected by a coopexcess path to v, or there is a coopexcess path between v and y which is connected to u by a path in the trajectory of \(\sigma\) which does not contain w, so that path has to contain a double covered or a cooperation free vertex, which is not w.

The remaining case is when w separates all elements of \({\mathcal {X}}\) from \({\mathcal {U}}\) in the trajectory of \(\delta\).

Let \(\delta _{uw}\) be a maximal subset of \(\delta\) which is executable without using the pebbles placed at \({\mathcal {X}}\), and let \(\delta _{v}\) be the remaining subsequence. \(\delta _{uw}\sigma _v\) is not executable under \(P+U\) or \((P+U)_{\delta _{uw}\sigma _v}(v)<(P+U)_{\sigma }(v)=(P+U)_{\sigma _u\sigma _w\sigma _v}(v)\). Therefore \(\sigma _u\sigma _w\) moves more pebbles to w than \(\delta _{uw}\), but \(\delta _v\) is executable under \(P_{\sigma _u\sigma _w}\), thus \(P_{\sigma _u\sigma _w\delta _v}(w)>P_\delta (w)\), therefore w has cooperation excess.

To prove the second claim, consider the paths we have found. If they were part of the trajectory of \(\sigma\), then all of them are 2-reachable under \(P+U\), so their M value can not be higher than M(v). Otherwise, the path consists of vertices from the trajectory of \(\sigma\) and some others whose M value is zero, since they are 2-reachable under P. \(\square\)

The following claim is a trivial consequence of the definitions.

Claim 3.12

If u contains at least two pebbles and it is double covered, then one of its neighbors is also double covered.

In the rest of the section we assume, that u contains at least two pebbles, i. e. \(|U|\ge 2\). Therefore we can use the previous claim.

Lemma 3.13

Assume thatUcontains at least two pebbles. Then each double covered vertexdis connected by a coopexcess path to an other double covered vertex or a cooperation free vertex. Furthermore, each vertex of this coopexcess path is 2-reachable underU.

Proof

The previous claim handles the case when d is u, since the neighbor is connected to \(d=u\). So assume that \(d\ne u\).

Since d is double covered, it is reachable from U, so it is connected to u by a path, whose vertices are 2-reachable under U. Therefore these vertices can not be cooperation vertices. If there is a vertex on this path which does not have cooperation excess, then the vertex closest to d satisfies the conditions of the second type. Otherwise, u has cooperation excess which means that it is double covered.\(\square\)

We are getting closer to establish a connection between the number of cooperation vertices and cooperation excess.

Definition

We call a subset \({\mathcal {Q}}\) of V(G) a C-block, if

  1. (1)

    each pair of vertices in \({\mathcal {Q}}\) is connected by a coopexcess path,

  2. (2)

    it contains a vertex having cooperation excess

and it is maximal with these properties.

Notice that the intersection of two C-blocks cannot contain a vertex having cooperation excess.

Lemma 3.14

EachC-block either

  1. (3)

    contains at least two double covered vertices, or

  2. (4)

    contains one double covered vertex and one cooperation free vertex, or

  3. (5)

    contains at least two cooperation free vertices.

Proof

Consider an arbitrary element v of \({\mathcal {Q}}\) which having cooperation excess. If the C-block does not have a double covered vertex, then Lemma 3.11 guarantees that two cooperation free vertices are connected to v by a coopexcess path, which means that they are contained in \({\mathcal {Q}}\), so (5) is satisfied.

Otherwise \({\mathcal {Q}}\) contains a double covered vertex. According to Lemma 3.13, either there is an other double covered vertex in \({\mathcal {Q}}\), or a cooperation free vertex. Thus either (3) or (4) is satisfied.\(\square\)

Later we generalize the notion of C-blocks, so that we keep the properties of 3.14. The following statement will be useful for this.

Lemma 3.15

If a vertexv having cooperation excess is adjacent to a cooperation vertex c such that \(M(v)<M(c)\) , then there are vertices e and f, such that each of them is either double covered or cooperation free and they are connected to v by coopexcess paths containing only vertices whose M values are smaller than M(c).

Proof

Vertex v has a cooperation vertex neighbor, therefore v is not 2-reachable under P or U. According to Lemma 3.11 there is a double covered vertex or there are two cooperation free vertices who are connected to v by a coopexcess path containing only vertices whose M values are at most \(M(v)<M(c)\). In the latter case we are done. Since the double covered vertex is connected to an other double covered or cooperation free by a coopexcess path containing vertices whose M value is zero, according to Lemma 3.13. The concatenation of these two coopexcess paths fulfills the criteria.\(\square\)

4 Connection Between Total Cooperation Excess, Number of Cooperation Vertices and Maximum Degree

In this section we prove a crucial lemma. Unfortunately, the proof requires quite a lot of effort, including many small claims.

Lemma 4.1

Letbe an arbitrary pebble distribution onG andU be a unit having at least two pebbles, such thatdoes not contain a pebble atu. Now we have

$$\begin{aligned} {{\,\mathrm{Coop}\,}}(P,U)-{{\,\mathrm{DC}\,}}(P,U)\le (\varDelta -2){{\,\mathrm{CE}\,}}(P,U). \end{aligned}$$

This lemma gives a connection between the total cooperation, the total number of double covered vertices and total cooperation excess. The proof would be relatively easy if the effect of a pebble would appear close to the location of the pebble. The example on Fig. 3. shows, that unfortunately this is not always true.

Another difficulty arises from the fact that a cooperation vertex can have cooperation excess. For such an example see Fig. 4. To prove Lemma 4.1 we get rid of such vertices one by one.

Fig. 3
figure 3

The triangles are cooperation vertices. Notice that they can be far from the added unit

Fig. 4
figure 4

Both x and y are cooperation vertices, furthermore x has cooperation excess

So to prove the lemma we will change the graph in several steps. In the new graph it will be easier to isolate these effects.

We introduce a sequence of auxiliary graphs \(A_0, A_1,\ldots , A_k\), whose vertices are labeled with vectors of four coordinates. The first and fourth coordinate is always an integer, while the other coordinates are binary. We denote the vertices of these graphs with underlined letters and the ith coordinate of vertex \({\underline{b}}\) with \(b_i\). We encode the parameters of the investigated pebbling problem in the auxiliary graph and in the coordinates in the following way:

\(A_0\) is isomorphic to G. The first coordinate of each vector is the amount of the cooperation excess of the corresponding vertex. The second coordinate is 1 iff the corresponding vertex is a cooperation vertex. The third coordinate is 1 when the vertex is double covered. Finally, the last coordinate is M(v), i.e. the minimum number of cooperation vertices have to be utilized by a pebbling sequence to obtain 2 pebbles at v, where v is the corresponding vertex. So \(A_0\) is representation of the original configuration, the labels give the values of the various quantities that we are interested in. An example can be seen on Fig. 5.

The other graphs in the sequence \(A_1,\ldots , A_k\) will be obtained from \(A_0\) by applying certain operations recursively, until we finally obtain \(A_k\) with some useful properties. It is important to note that although the labels of \(A_0\) are obtained from the pebble distribution on G, this will not be true any more for the other auxiliary graphs. We are not trying to change the graph and the pebble distribution and then obtain the new labels from these. We just apply the transformation on the abstract, labeled graphs.

Now we translate the properties of the pebble distribution to properties of \(A_0\).

Fig. 5
figure 5

An example graph G with a pebble distribution P, a unit U and the corresponding auxiliary graph \(A_0\). \(A_1\) is obtained from \(A_0\) by using the first transformation. Note that \(A_1\) does not contain a saturated vertex

Definition

We call a path \({\mathcal {P}}\) in A an A-path, if each inner vertex \({\underline{b}}\) of \({\mathcal {P}}\) satisfies \(b_1>0\). We say that \({\mathcal {B}}\) is an A-block iff

  1. (6)

    there is a vertex \({\underline{b}}\in {\mathcal {B}}\) such that \(b_1>0\),

  2. (7)

    if \({\underline{a}},{\underline{b}}\in {\mathcal {B}}\), then there is an A-path which connects them

and \({\mathcal {B}}\) is maximal to these properties.

Note that the concept of A-path and A-block are generalizations of coopexcess path and C-block, respectively. In this language, the statement of Lemma 4.1 can be formulated as:

$$\begin{aligned} \sum _{{\underline{a}}\in A}a_2-\sum _{{\underline{a}}\in A}a_3 \le \sum _{{\underline{a}}\in A}a_1(\varDelta -2) \end{aligned}$$

We state four properties of \(A_0\) which will be inherited to later auxiliary graphs. The significant properties are the first and the last. The other two are technical ones which will help the proof of inheritance stated in Claim 4.5.

Claim 4.2

The following statements hold for\(A_0\):

  1. (8)

    If\(c_1 c_2>0\) then one of the following two cases hold:

    1. (a)

      there exist a\({\underline{d}}\) which is adjacent to\({\underline{c}}\), \(d_1\ge 3\) and\(d_4<c_4\), or

    2. (b)

      there are vertices\({\underline{e}}\) and\({\underline{f}}\)such that they are neighbors of\({\underline{c}}\), \(e_1\) and\(f_1\) are both positive,\(e_4<c_4\) and\(f_4<c_4\).

  2. (9)

    If\(a_1\ge 3\) and \({\underline{a}}\) has a neighbor\({\underline{c}}\) such that\(c_2=1\), then there is a\({\underline{b}}\), which is adjacent to\({\underline{a}}\) and\(a_4\ge b_4\).

  3. (10)

    Let\({\underline{c}}\) be a vertex whose first and second coordinates are both positive. If\({\underline{a}}\) is a neighbor of\({\underline{c}}\) such that\(a_1>0\) and\(a_4<c_4\), then there are vertices\({\underline{e}}\)and\({\underline{f}}\)such that each of them is connected to\({\underline{a}}\)byA -paths containing only vertices having their fourth coordinate smaller than\(c_4\), and their third coordinate is either 1 or the first and second coordinates of them are 0.

  4. (11)

    Each A -block contains either

    1. (a)

      two vertices with third coordinate 1, or

    2. (b)

      two vertices with first and second coordinates 0, or

    3. (c)

      one vertex with third coordinate 1 and one vertex with first and second coordinates 0.

This claim is equivalent to the following, previously proven, statements with the new notation: Claim 3.5\(\rightarrow\) (8), Claim  3.6\(\rightarrow\) (9), Lemma 3.15\(\rightarrow\) (10) and Lemma 3.14\(\rightarrow\) (11).

We will obtain \(A_i\) from \(A_{i-1}\) by applying one of two transformations. Then we repeat this until it is possible to apply at least one of the transformations. Both transformations will preserve \(\sum _{{\underline{a}}\in A}a_i\), \((i\in \{1,2,3\})\), the fourth coordinate of each vertex and \(\varDelta\), the maximum degree in the graph. The objective of the transformations is to replace vertices satisfying \(a_1 a_2>0\) (i.e. it has cooperation excess and it is a cooperation vertex) with (one ore more) vertices satisfying \(b_1 b_2=0\). From this point, we call these vertices saturated vertices. Both transformations will increase the number of vertices in the auxiliary graph.

Let \({\underline{w}}\) be a vertex where \(w_1 w_2>0\) such that its fourth coordinate is maximal among these vertices. By Claim 4.2 (8) there are two cases.

Case 1: If \({\underline{w}}\) has a neighbor \({\underline{x}}\) such that \(x_1\ge 3\) and \(w_4>x_4\), then we apply the following transformation to \(A_i\):

Transformation 1

  • Choose a neighbor \({\underline{y}}\) of \({\underline{x}}\) such that its fourth coordinate is minimal among all neighbors of \({\underline{x}}\).

  • Let R be the set of \({\underline{x}}\)’s neighbors without \({\underline{y}}\) where the product of the first and the second coordinate is positive.

  • Delete x and add three vertices \(x^1\), \(x^2\) and \(x^3\), such that \(x^1_1=x_1^3=1\) and \(x^2_1=x_1-2\). \(x^1_2=x^2_2=x^3_2=0\), \(x^1_3=x^3_3=0\) and \(x^2_3=x_3\). Connect \({\underline{x}}^2\) with \({\underline{y}}\), \({\underline{x}}^1\) and \({\underline{x}}^3\).

  • Delete each element \({\underline{r}}\) of R and add two vertices \({\underline{r}}^1\) and \({\underline{r}}^2\) and set the coordinates as: \(r_1^1=r^1_3=0\), \(r_2^1=1\), \(r^1_4=r^2_4=r_4\), \(r^2_1=r_1\), \(r^2_2=0\) and \(r^2_3=r_3\). We connect \({\underline{r}}^1\) to \({\underline{x}}^1\) and \({\underline{r}}^2\) to \({\underline{x}}^3\) and to each original neighbor of \({\underline{r}}\).

  • We connect the neighbors of \({\underline{x}}\) which are not included in \(R\cup {{\underline{y}}}\) to \({\underline{x}}^3\).

  • Set \(x^1_4=x^2_4=x^3_4=x_4\).

  • If \(x_2=1\), then add an extra vertex \(x^4\) and connect it only with \({\underline{x}}^2\). Set its vector to \((0,1,0,x_4)\).

Fig. 6
figure 6

Vertices denoted by squares are “cooperation vertices” so their second coordinates are one. Edges are shown between vertices contained in the same C-block. The upper transformation is called first, and the lower one is mentioned as the second transformation. Note that in the upper example \(R=\{w,r\}\)

In other words, this transformation replaces each saturated neighbor of \({\underline{x}}\) (excluding a chosen \({\underline{y}}\)) with two vertices such that one of them is a leaf with zero first coordinate and the other one is act as the original vertex, but its second coordinate is zero. To handle the increased degree of \({\underline{x}}\), we triple it. Also, if x is saturated then we add the additional \({\underline{x}}^4\) vertex. Note that this can be done when \(\varDelta \ge 4\). If \(\varDelta \le 3\), then we have to handle this case in a slightly different way.

Case 2: If \({\underline{w}}\) has two neighbors such that their first coordinates are positive and their fourth coordinates are strictly less than \(w_4\), then we apply the second transformation:

Transformation 2

  • We choose neighbors \({\underline{x}}\) and \({\underline{y}}\) whose fourth coordinate is minimal among all neighbors and \(x_4\ge y_4\).

  • We delete \({\underline{w}}\) and add vertices \({\underline{w}}^1\) and \({\underline{w}}^2\). We set the coordinates of these vectors as: \(w^1_1=w^1_3=0\), \(w^1_2=1\), \(w^1_4=w^2_4=w_4\), \(w^2_1=w_1\)\(w^2_2=0\) and \(w^2_3=w_3\).

  • We connect \({\underline{w}}^1\) only with \({\underline{x}}\). In contrast, we connect \({\underline{w}}^2\) with all neighbors of \({\underline{w}}\) except \({\underline{y}}\).

Both transformation can be seen on Fig. 6.

Claim 4.3

Both transformations preserve\(\sum _{{\underline{a}}\in A}\underline{a_i}\)\(i\in \{1,2,3\}\), and\(\varDelta\)if\(\varDelta \ge 4\).

Claim 4.4

Both transformations decrease the number of saturated vertices.

Claim 4.5

If the statements of Claim 4.2hold for an auxiliary graph, then they hold for the new graph obtained by applying one of the above transformations.

Proof

We say that a vertex v is created by the ith transformation if \({\underline{v}}={\underline{z}}^j\) is a vertex of \(A_i\) and z is a vertex of \(A_{i-1}\). In this situation we say that v is a descendant of z. A vertex is involved in a transformation if either it is created by that or its vector is changed by it.

Notice that the transformations keep the fourth coordinates of the vertices and if two vertices are descendants of the same vertex, then their fourth coordinates are the same.

(8): If \({\underline{c}}\) is a saturated vertex in \(A_i\), then it is not created by the ith transformation and it was saturated in \(A_{i-1}\) also. If none of its neighbors were involved in the last transformation, then the property is clearly holds. Therefore assume the opposite. Assume that \({\underline{c}}\) had a neighbor \({\underline{d}}\) in \(A_{i-1}\), such that \(d_1\ge 3\) and \(d_4<c_4\) in \(A_{i-1}\).

If \({\underline{d}}\) is contained in \(A_i\) also, then the ith transformation did not change \(d_1\). In that case \({\underline{d}}\) and \({\underline{c}}\) are adjacent in \(A_i\) and we are done.

Otherwise, the ith transformation created some descendants of \({\underline{d}}\).

If it was Transformation 1 then a descendant of \({\underline{d}}\) is connected to \({\underline{c}}\) and either its first coordinate equals \(d_1\) or it is \(d_1-2\). In the first case we are done and the latter can happen if and only if \({\underline{d}}\) acted as \({\underline{x}}\) in that transformation. However, this is not possible, because this would mean that \({\underline{c}}\) acted as \({\underline{y}}\) but \(y_4\le x_4\) by (9) and the choice of y in Transformation 1, therefore \(c_4=y_4\le x_4=d_4<c_4\) which is a contradiction.

The remaining case is that two descendants of \({\underline{d}}\) are created by Transformation 2. Since \(d_4<c_4\), vertex \({\underline{c}}\) can not be \({\underline{x}}\) or \({\underline{y}}\) in Transformation 2, therefore it is adjacent to \({\underline{d}}^2={\underline{w}}^2\) in \(A_i\) and the first coordinate of this vertex equals \(d_1\).

Now assume that there are neighbors \({\underline{e}}\) and \({\underline{f}}\) in \(A_{i-1}\) such that their fourth coordinates are smaller than \(c_4\) and \(e_1,f_1>0\). We may assume that \(e_1,f_1<3\), otherwise we obtain the previous case. Therefore neither \(e_1\) nor \(f_1\) can act as \({\underline{x}}\) in Transformation 1.

If \({\underline{e}}\) is contained in \(A_i\), then it is still adjacent to \({\underline{c}}\). If \({\underline{e}}\) is replaced with some descendants by the ith transformation, then one of its descendants keep its first coordinate and that one is connected to \({\underline{c}}\). Like in the previous case it cannot happen that \({\underline{c}}={\underline{y}}\) and \({\underline{e}}={\underline{x}}\) in Transformation 1 or \({\underline{c}}={\underline{x}}\) and \({\underline{e}}={\underline{w}}\) in Transformation 2. We can state the same for \({\underline{f}}\).

(9): If \({\underline{a}}\) is contained in \(A_{i-1}\) then \(a_1\ge 3\) in \(A_{i-1}\) also. Therefore according to (9) there is a \({\underline{b}}\) which is adjacent to \({\underline{a}}\) and \(b_4\le a_4\). Either \(b_4\) or one of its descendants is adjacent to \({\underline{a}}\) in \(A_i\), therefore we are done.

Otherwise, \({\underline{a}}\) is a descendant of a vertex \({\underline{v}}\). \({\underline{v}}\) has a neighbor \({\underline{b}}\) whose fourth coordinate is at most \(a_4\). There are several cases:

First case: \({\underline{v}}={\underline{r}}\) in Transformation 1, where \(r\in R\). If we remove \({\underline{x}}\) from the neighborhood of \({\underline{v}}\) and add \(\underline{x^3}\) we obtains the neighborhood of \({\underline{a}}\). Therefore \({\underline{a}}\) has a neighbor whose fourth coordinate is not bigger.

Second case: \({\underline{v}}={\underline{x}}\) in Transformation 1. \({\underline{y}}\) had the smallest fourth coordinate among the neighbors of \({\underline{x}}\), thus \(y_4\le x^2_4=a_4\) and \({\underline{y}}\) and \({\underline{a}}\) are adjacent in \(A_i\).

Third case: \({\underline{v}}={\underline{w}}\) in Transformation 2. Since \(a_1\ge 3\), \({\underline{a}}=\underline{w^2}\) and \(y_4\le w_4=w_4^2\) according to (9).

(10): Transformation 1 keeps the A-paths, because it keeps connectivity and the first coordinate becomes zero only at leaves. Transformation 2 destroys some A-paths but all of them contain the saturated vertex which was handled by the transformation and whose fourth coordinate was at least \(c_4\).

(11): Transformation 1 does not split an A-block, furthermore it keeps the number of vertices whose third coordinate is one and whose first and second coordinate are both zero in each A-block.

Transformation 2 either does not split an A-block and keeps the investigated quantities, or it splits an A-block to two A-blocks. But (10) guarantees that both blocks contain enough vertices whose third coordinate is one or both first and second coordinates are zero.\(\square\)

Claim 4.6

If Claim 4.2 holds for\(A_i\) and there is a saturated vertex, then at least one of the two transformations can be applied to\(A_i\).

The first proposition of Claim 4.2 guarantees this.

Corollary 4.7

There is an\(A_k\), such that there is no saturated vertex in\(A_k\), furthermore\(\sum _{{\underline{a}}\in A_0}a_i=\sum _{{\underline{a}}\in A_k}a_i\), \(i\in \{1,2,3\}\).

Lemma 4.8

If\(A_k\)does not contain any saturated vertices and Claim 4.2 holds, then for eachA-block \({\mathcal {B}}\)

$$\begin{aligned} \sum _{{\underline{a}}\in {\mathcal {B}}}a_2-\sum _{{\underline{a}}\in {\mathcal {B}}}a_3 \le \sum _{{\underline{a}}\in {\mathcal {B}}}a_1(\varDelta -2). \end{aligned}$$

Definition

Let \({\mathcal {B}}\) be an A-block in an auxiliary graph. We say that a vertex of \({\mathcal {B}}\) is inner vertex if its first coordinate is positive otherwise, it is called a boundary vertex.

Claim 4.9

Consider anA -block of an auxiliary graphA. Let the number of the boundary and the number of inner vertices denoted bybandi, respectively. If\(a_1 a_2=0\) holds for each\({\underline{a}}\in A\)then\(b\le (\varDelta -2)i+2\) is satisfied.

Proof

Proof by induction: The base case is an A-block with one inner vertex. This A-block is the closed neighborhood of the only inner vertex, therefore the number of boundary vertices is at most \(\varDelta\). Now we assume that for any \(i<k\) the inequality is true. Let \(i=k\). We take a spanning tree of the inner vertices and consider a leaf vertex \({\underline{l}}\). If we set \(l_1\) to zero, then \({\underline{l}}\) becomes a boundary vertex and at most \(\varDelta -1\) boundary vertices, which are neighbors of \({\underline{l}}\), are dropped from the A-block. The number of inner vertices is decreased by one, and the number of boundary vertices is decreased by at most \(\varDelta -2\). Using the induction hypothesis the proof is completed.\(\square\)

Proof of Lemma 4.8:

Consider an A-block \({\mathcal {B}}\) and a boundary vertex v of \({\mathcal {B}}\). Either \(v_2=1\) or \(v_1=v_2=0\). Thus we have: \(b=\sum _{{\underline{a}}\in {\mathcal {B}}}a_2+{{\,\mathrm{N({\mathcal {B}})}\,}}\) where \({{\,\mathrm{N({\mathcal {B}})}\,}}\) is the number of vertices in \({\mathcal {B}}\) whose first two coordinates are zero. It is also clear that \(\sum _{{\underline{a}}\in {\mathcal {B}}}a_1\ge i\). Combining these observations and the previous claim:

$$\begin{aligned} \sum _{{\underline{a}}\in {\mathcal {B}}}a_2+{{\,\mathrm{N({\mathcal {B}})}\,}}=b\le (\varDelta -2)i+2\le (\varDelta -2)\sum _{{\underline{a}}\in {\mathcal {B}}}a_1+2. \end{aligned}$$

Claim 4.5 implies that Claim 4.2 (11) holds for \(A_k\). Thus \(\sum _{{\underline{a}}\in {\mathcal {B}}} a_3+{{\,\mathrm{N({\mathcal {B}})}\,}}\ge 2\). Therefore

$$\begin{aligned}&\sum _{{\underline{a}}\in {\mathcal {B}}}a_2+{{\,\mathrm{N({\mathcal {B}})}\,}}\le (\varDelta -2)\sum _{{\underline{a}}\in {\mathcal {B}}}a_1+\sum _{{\underline{a}}\in {\mathcal {B}}} a_3+{{\,\mathrm{N({\mathcal {B}})}\,}}, \\&\sum _{{\underline{a}}\in {\mathcal {B}}}a_2-\sum _{{\underline{a}}\in {\mathcal {B}}} a_3\le (\varDelta -2)\sum _{{\underline{a}}\in {\mathcal {B}}}a_1. \end{aligned}$$

Proof of Lemma 4.1:

We distinguish three cases depending on \(\varDelta\).

Case 1: \(\varDelta \ge 4\)

Let \(A_k\) be the auxiliary graph which we obtained from \(A_0\) by applying transformations until it does not contain any more saturated vertices. The last lemma holds for each A-block, therefore:

$$\begin{aligned} \sum _{{\mathcal {B}}}\left( \sum _{{\underline{a}}\in {\mathcal {B}}}a_2-\sum _{{\underline{a}}\in {\mathcal {B}}} a_3\right) \le \sum _{{\mathcal {B}}}(\varDelta -2)\sum _{{\underline{a}}\in {\mathcal {B}}}a_1. \end{aligned}$$

Only boundary vertices can be included in multiple blocks, and the first and third coordinate of a boundary vertex is zero, thus:

$$\begin{aligned}\sum _{{\underline{a}}\in A_k}a_2-\sum _{{\underline{a}}\in A_k} a_3\le&\sum _{{\mathcal {B}}}\left( \sum _{{\underline{a}}\in {\mathcal {B}}}a_2-\sum _{{\underline{a}}\in {\mathcal {B}}} a_3\right) \\ \le&\sum _{{\mathcal {B}}}(\varDelta -2)\sum _{{\underline{a}}\in {\mathcal {B}}}a_1= (\varDelta -2)\sum _{{\underline{a}}\in A_k}a_1. \end{aligned}$$

Using Claim 4.3 we obtain

$$\begin{aligned} \sum _{{\underline{a}}\in A_0}a_2-\sum _{{\underline{a}}\in A_0} a_3\le (\varDelta -2)\sum _{{\underline{a}}\in A_0}a_1. \end{aligned}$$

Case 2:\(\varDelta =1,2\)

If the graph consists of multiple connected components we may restrict our attention to the component containing the unit. Let d be the number of double covered vertices. We first verify the lemma in the case \(\varDelta =1\). In this case, the graph consists of a matching and isolated vertices. Thus, we must have \({{\,\mathrm{Coop}\,}}(P,P_u) = 0\), and we must show that \(CE(P,P_u) \le d\). If the unit, u, is isolated the result is trivial. Suppose the unit is in an edge \(\{x,u\}\). If \(P(x)=0\), then \(CE(P,P_u) = d = 0\). If \(P(x)=1\), then \(CE(P,P_u)= d = 1\). Suppose that \(P(x)=a \ge 2\) and set \(\left|{P_u}\right| = b\). We have \(d=2\) and

$$\begin{aligned} CE(P,P_u)&= (a + \left\lfloor {b/2}\right\rfloor - 1 + b + \left\lfloor {a/2}\right\rfloor -1) - (a - 1 + \left\lfloor {a/2}\right\rfloor -1)\\&\quad - (b - 1 + \left\lfloor {b/2}\right\rfloor -1) = 2. \end{aligned}$$

This completes the proof in the \(\varDelta =1\) case. If \(\varDelta =2\), then we may assume that the graph is a path or a cycle. In this case we have \(\varDelta -2 = 0\) so we must show \({{\,\mathrm{Coop}\,}}(P,P_u) \le d\). However, it is easy to see that in a path or a cycle every cooperation vertex is adjacent to a double covered vertex and, moreover, that double covered vertex is on the path between the cooperation vertex and u (possibly u itself). It follows that there are at least as many double covered vertices as cooperation vertices, as desired.

Case 3: \(\varDelta =3\)

We remind the reader that the problem with the \(\varDelta =3\) case is that it is not possible to add \({\underline{x}}^4\) in transformation 1.

\({\underline{x}}^4\) is needed when \({\underline{x}}\) is saturated in \(A_{i-1}\). In that case Transformation 1 handles \({\underline{x}}\) and substitutes it with unsaturated descendants. If the degree of any of the descendants of \({\underline{x}}\) is smaller than \(\varDelta\), then we can make \(\underline{x^4}\) adjacent to this vertex and the problem is eliminated. Otherwise \({\underline{x}}\) has three neighbors: \({\underline{y}}\), \({\underline{v}}\) and \({\underline{w}}\). The one whose fourth coordinate is minimal among them is \({\underline{y}}\), also \(v_4\le w_4\) and both \({\underline{v}}\) and \({\underline{w}}\) are saturated vertices.

Now we make one of \({\underline{x}}\)’s descendants saturated. We have to make sure that (8) holds for this saturated descendant therefore we have to make a few new transformations.

Case 1: (8) (a) holds for \({\underline{x}}\) in \(A_{i-1}\). Consider vertex \({\underline{d}}\), which is a neighbor of \({\underline{x}}\) in \(A_{i-1}\), \(d_4<x_4\) and \(d_1\ge 3\). If \({\underline{d}}={\underline{y}}\), then we set \(x_2^2\) to one, otherwise we set \(x_2^3\) to one and the rest of the transformation is similar to Transformation 1.

Case 2: (8) (b) holds for \({\underline{x}}\) in \(A_{i-1}\). Then we apply the following transformation:

Transformation 3

  • Delete x and add three vertices \(x^1\), \(x^2\) and \(x^3\), such that \(x^1_1=x_1^3=1\) and \(x^2_1=x_1-2\). \(x^1_2=x^3_2=0\), \(x^2_2=1\), \(x^1_3=x^3_3=0\) and \(x^2_3=x_3\). Connect \({\underline{x}}^2\) with \({\underline{y}}\) and \({\underline{x}}^3\) and connect \(\underline{x^1}\) with \(\underline{x^3}\).

  • Delete \({\underline{v}}\) and \({\underline{w}}\) and add four vertices \({\underline{v}}^1\), \({\underline{v}}^2\), \({\underline{w}}^1\) and \({\underline{w}}^2\) and set the coordinates as: \(v_1^1=v^1_3=0\), \(v_2^1=1\), \(v^1_4=v^2_4=v_4\), \(v^2_1=v_1\), \(v^2_2=0\) and \(v^2_3=v_3\). We set the coordinates of w in the exact same way. We connect \({\underline{w}}^1\) and \({\underline{v}}^1\) to \({\underline{x}}^1\). We make \({\underline{w}}^2\) adjacent to the neighbors of \({\underline{w}}\) and to \({\underline{x}}^3\). We make \({\underline{v}}^2\) adjacent to the neighbors of \({\underline{v}}\) and to \({\underline{x}}^2\).

This transformation is shown on Fig. 7.

The new transformations are made in such a way that we immediately obtain that (8) holds for the recently created saturated vertex. The proof of Claim 4.5 can be repeated to prove that the statements of Claim 4.2 hold after we apply these recently introduced transformations. Therefore all the statements following Claim 4.2 hold in the \(\varDelta =3\) case.

Fig. 7
figure 7

Transformation 3 which is needed when \(\varDelta =3\)

5 Lower Bound on the Optimal Pebbling Number of Vertex Transitive Graphs

Theorem 5.1

LetPbe an arbitrary solvable pebble distribution onGand let\({\mathcal {D}}\) be a disjoint decomposition ofPto unit distributions. Denote the elements of\({\mathcal {D}}\) with\(U_1, U_2, \dots , U_t\), so that\(|U_i|\le |U_{i+1}|\). Now

$$\begin{aligned} \sum _{i=1}^t{{\,\mathrm{CE}\,}}\left( \sum _{k=1}^{i-1}U_k,U_i\right) \ge \frac{|V(G)|-\sum _{i=1}^t {{\,\mathrm{Cov}\,}}(U_i)}{\varDelta -2}. \end{aligned}$$

Proof

We use Claim 3.3 (3.2) in the following inquality to obtain the second line. To obtain the third line, notice that if \(|U_i|=1\), then \({{\,\mathrm{Coop}\,}}\left( \sum _{k=1}^{i-1}U_k,U_i\right) ={{\,\mathrm{DC}\,}}\left( \sum _{k=1}^{i-1}U_k,U_i\right) ={{\,\mathrm{CE}\,}}\left( \sum _{k=1}^{i-1}U_k,U_i\right) =0\). Otherwise \(|U_i|\ge 2\) and we can apply Lemma 4.1.

$$\begin{aligned} \begin{aligned} |V(G)|&={{\,\mathrm{Cov}\,}}(P)=\sum _{i=1}^t \left( {{\,\mathrm{Cov}\,}}(U_i)+{{\,\mathrm{Coop}\,}}\left( \sum _{k=1}^{i-1}U_k,U_i\right) -{{\,\mathrm{DC}\,}}\left( \sum _{k=1}^{i-1}U_k,U_i\right) \right) \\&= \sum _{i=1}^t {{\,\mathrm{Cov}\,}}(U_i)+ \sum _{i=1}^t\left( {{\,\mathrm{Coop}\,}}\left( \sum _{k=1}^{i-1}U_k,U_i\right) -{{\,\mathrm{DC}\,}}\left( \sum _{k=1}^{i-1}U_k,U_i\right) \right) \\&\le \sum _{i=1}^t {{\,\mathrm{Cov}\,}}(U_i)+(\varDelta -2)\sum _{i=1}^t{{\,\mathrm{CE}\,}}\left( \sum _{k=1}^{i-1}U_k,U_i\right) \end{aligned} \end{aligned}$$

\(\square\)

This result together with the corollary of Theorem 2.1 and Claim 3.3 implies the following:

Corollary 5.2

IfPis a solvable distribution on a vertex-transitive graphG, then

$$\begin{aligned} |P|\ge \frac{\frac{\varDelta -1}{\varDelta -2}|V(G)|+{{\,\mathrm{UE}\,}}(P)-\frac{1}{\varDelta -2}\sum _{i=1}^t {{\,\mathrm{Cov}\,}}(U_i)}{{{\,\mathrm{ef}\,}}(v)}. \end{aligned}$$

This is a tool that helps to prove lower bounds on optimal pebbling number. Also notice that each element of the formula can be calculated efficiently.

5.1 Back to Square Grids

We would like to investigate finite square grids. It is easier to investigate torus graphs instead of square grids, because they are vertex-transitive. Let \(T_{m,n}\) be the torus graph which we obtain if we glue together the opposite boundaries of \(P_{m+1}\square P_{n+1}\).

Note that \(T_{m,n}\cong C_m\square C_n\).

\(P_{m}\square P_n\) can be obtained from \(T_{m,n}\) by deleting some edges. Edge removal can not decrease the optimal pebbling number, therefore \(\pi _{{{\,\mathrm{opt}\,}}}(T_{m,n})\le \pi _{{{\,\mathrm{opt}\,}}}(P_m\square P_n)\). Therefore we work with \(T_{m,n}\) in the rest of the section.

The size of the distance i neighborhood in \(T_{m,n}\) is at most 4i. Thus Claim 3.2 gives the following estimates on excess and coverage of any unit placed on \(T_{m,n}\).

Claim 5.3

LetUbe a single unit on\(T_{m,n}\). Then:

$$\begin{aligned} {{\,\mathrm{Cov}\,}}(U)\le {\left\{ \begin{array}{ll} 1 &{} \text {if } |U|=1 \\ \frac{5}{2}|U| &{} \text {if } 2\le |U|\le 3 \\ \frac{13}{4}|U| &{} \text {if } |U|\ge 4 \\ \end{array}\right. } \end{aligned}$$

Claim 5.4

LetUbe a single unit on\(T_{m,n}\), where\(\min (m,n)\ge 5\). We have the following estimate on the ratio of unit excess and the size of the unit:

$$\begin{aligned} {{\,\mathrm{Exc}\,}}(U)\ge {\left\{ \begin{array}{ll} 0 &{} \text {if } |U|=1\\ \frac{1}{2}|U| &{} \text {if } 2\le |U|\le 3\\ \frac{8}{5}|U| &{} \text {if } 4\le |U| \end{array}\right. } \end{aligned}$$

To obtain these bounds it is enough to check small units and notice that the distance 2 neighborhood of u contains at least \(\frac{8}{5}\left|{U}\right|\) excess when \(\left|{U}\right|>4\).

Claim 5.5

Letvbe a vertex of\(T_{m,n}\). Then\({{\,\mathrm{ef}\,}}(v)< 9\).

A similar result is proven in [2] for the square grid. We mimic that calculation.

Proof

We know that \(|N_0(v)|=1\), \(|N_1(v)|=4\) and Fig. 8 shows that \(|N_i(v)|\le |N_{i-1}(v)|+4\). Therefore \(|N_i(v)|\le 4i\).

$$\begin{aligned} {{\,\mathrm{ef}\,}}(v)=\sum _{i=0}^{{{\,\mathrm{diam}\,}}(T_{m,n})}\left( \frac{1}{2}\right) ^i|N_i(v)| <&1+\sum _{i=1}^{\infty }\left( \frac{1}{2}\right) ^i 4i\\ =&1+4\sum _{i=1}^{\infty }\sum _{j=i}^{\infty }\left( \frac{1}{2}\right) ^j=1+4\sum _{i=1}^{\infty }\left( \frac{1}{2}\right) ^{i-1}=9 \end{aligned}$$

\(\square\)

Fig. 8
figure 8

A mapping shows that there are at most 4 more vertices in the distance 3 neighborhood than in the distance 2 neighborhood

Now we can obtain our new lower bound on the optimal pebbling number of the square grid:

Theorem 5.6

The optimal pebbling number of\(T_{m,n}\) is at least\(\frac{2}{13}nm\), when\({m,n\ge 5}\).

Proof

Let P be an optimal distribution of \(T_{m,n}\) and let \({\mathcal {D}}\) be a disjoint decomposition of P to unit distributions. Denote the elements of \({\mathcal {D}}\) with \(U_1, U_2 \dots U_t\), such that \(|U_i|\le |U_{i+1}|\). Let \({\mathcal {D}}_{\ge 4}\) be the subset of \({\mathcal {D}}\) which contains all units whose size is at least four. Furthermore let \({\mathcal {D}}_{2,3}\) be the set which contains the units whose size is two or three, and \({\mathcal {D}}_{1}\) be the set of units whose size is one. Denote the total number of pebbles which are placed on vertices belonging to \({\mathcal {D}}_{1}\) by \(S_1\). Define \(S_{2,3}\) and \(S_{\ge 4}\) similarly. It is clear that \(S_{1}=|P|-S_{2,3}-S_{\ge 4}\).

We start with Corollary 5.2 and use the estimates of Claims 5.35.4 and 5.5.

$$\begin{aligned} \begin{aligned} |P|\ge&\frac{\frac{\varDelta -1}{\varDelta -2}|V(G)|+{{\,\mathrm{UE}\,}}(P)-\frac{1}{\varDelta -2}\sum _{i=1}^t {{\,\mathrm{Cov}\,}}(U_i)}{{{\,\mathrm{ef}\,}}(v)}\ge \\ \ge&\frac{\frac{3}{2}nm+\frac{1}{2}S_{2,3}+\frac{8}{5}S_{\ge 4}-\frac{1}{2}\left( \frac{5}{2}S_{2,3}+\frac{13}{4}S_{\ge 4}+(|P|-S_{2,3}-S_{\ge 4})\right) }{9}=\\ =&\frac{-\frac{1}{2}|P|+\frac{3}{2}nm-\frac{1}{4}S_{2,3}+\frac{19}{40}S_{\ge 4}}{9}, \end{aligned} \end{aligned}$$

which implies

$$\begin{aligned} |P|\ge \frac{3}{19}nm-\frac{1}{38}S_{2,3}+\frac{1}{20}S_{\ge 4}. \end{aligned}$$

Consider the worst case when each of the units contains exactly two or three pebbles:

$$\begin{aligned} |P|\ge \frac{3}{19}nm-\frac{1}{38}|P|, \end{aligned}$$

thus

$$\begin{aligned} |P|\ge \frac{2}{13}nm. \end{aligned}$$

\(\square\)

Corollary 5.7

The optimal pebbling number of\(P_n\square P_m\) is at least\(\frac{2}{13}nm\) whenmandnare at least 5.

5.2 New Proof for the Optimal Pebbling Number of the Path and Circle

To illustrate the power of Lemma 4.1 we give a short proof of the following well known theorem. It was first proved in [6]. Later, essentially different proofs were given in [1] and [7].

Theorem 5.8

\(\pi _{{{\,\mathrm{opt}\,}}}(P_{3k+r})=\pi _{{{\,\mathrm{opt}\,}}}(C_{3k+r})=2k+r\) when\(0\le k\), \(0\le r \le 2\)andk, rare integers.

The new proof uses Lemma 4.1 when \(\varDelta =2\). (Note that the proof of this case was short and easy.)

Proof

It is easy to construct solvable distributions with the desired size, so we prove only the lower bound here.

Let u be a single unit on \(P_{3k+r}\) or \(C_{3k+r}\). Then:

$$\begin{aligned} \frac{{{\,\mathrm{Cov}\,}}(P_u)}{|P_u|}\le \frac{3}{2} \end{aligned}$$

Assume that P is a solvable distribution. Now by Lemma 4.1

$$\begin{aligned} 3k+r={{\,\mathrm{Cov}\,}}(P)\le&\sum _{i=1}^t \left( {{\,\mathrm{Cov}\,}}(U_i)+{{\,\mathrm{Coop}\,}}\left( \sum _{k=1}^{i-1}U_k,U_i\right) -{{\,\mathrm{DC}\,}}\left( \sum _{k=1}^{i-1}U_k,U_i\right) \right) \\ \le&\sum _{i=1}^t {{\,\mathrm{Cov}\,}}(U_i)\le \frac{3}{2}|P|. \end{aligned}$$

So \(2k+\frac{2r}{3}\le |P|\). |P| is integer, therefore this is equivalent to \(2k+r\le |P|\).\(\square\)

6 Open questions

Question 1

Is there a constant k, which does not depend on n, such that in an optimal distribution of \(P_n\square P_n\) no pebble can be moved to the distance-k neighborhood of its initial location?

Question 2

If the answer for Question 1 is yes, then how small can be k?

We think that the answer for Question 1 is yes and we conjecture that k can be 4. Any finite k would improve our lower bound on \(\pi _{opt}(P_n\square P_m)\).