1 Introduction

The purpose of this paper is to give a unified approach to the use of graph theory to find optimal incomplete-block designs with very low average replication. Some of the results given have already been proved by graph-theoretical or other methods. Therefore a part of this paper gives historical notes on how graph theory has been used to determine optimal designs, while the other part continues this approach to give new results.

The two graphs associated with an incomplete-block design are its Levi graph and its concurrence graph. In [8], Bapat and Dey used the Levi graph to find D-optimal designs with the minimal number of observations, while Balasubramanian and Dey used it in [7] to find D-optimal designs with nearly minimal number of observations. In [33], Tjur used it to find A-optimal designs with nearly minimal number of observations when the block size is two, and Krafft and Schaefer extended this in [22] to arbitrary block size.

On the other hand, Cheng formulated D-optimality in terms of the concurrence graph in [11]. In [1], Bailey used this graph to find A-optimal and D-optimal designs when the block size is two and the average replication is low. Sajjad used it in [31] to find A- and D-optimal designs when the number of observations is minimal or nearly minimal.

Section 2 of this paper recalls definitions of block designs and related concepts, including the information matrix, the Laplacian matrix, D-optimality and A-optimality, and gives the notation used for the rest of the paper. The Levi graph and concurrence graph are defined, and examples are given.

Section 3 shows how the values of both optimality criteria can be determined from properties of either the concurrence graph or the Levi graph. Then Sect. 4 uses this to give some general results for D-optimal designs and A-optimal designs. Although the results are different for the two sorts of optimality, they both show that graphs associated with optimal designs have strong constraints on whether they can be disconnected by the removal of a single edge or a single vertex. Then Sects. 5 and 6 apply these results in a unified way to classify the optimal designs when the number of observational units is very low.

2 Definitions and Notation

2.1 Incomplete-Block Designs and Associated Matrices

Consider v treatments to be tested in b blocks of size k, where \(2\le k<v\). Such a design is an incomplete-block design. A block design is said to be connected if all contrasts among treatment effects can be estimated, under the assumption that the expected value of each observation is the sum of an unknown fixed treatment effect and an unknown fixed block effect. The number of observations is bk, so a necessary condition for a block design to be connected is \(bk \ge b+v-1\). Put \(c = bk-b-v+1\). We need \(c\ge 0\). Therefore \(b(k-1)\ge v-1\), and so \(b \ge (v-1)/(k-1) >1\) because \(k<v\).

Thus the number of observations is said to be minimal if \(c=0\) and nearly minimal if \(c=1\). The class of connected designs for given values of v, b, k and c is denoted \(\mathscr {D}_c(v,b,k)\) in [7, 13]. The optimal designs with minimal or nearly minimal number of observations can be of special interest to a statistician. Two block designs in \(\mathscr {D}_1(18,6,4)\) are shown in Fig. 1.

Fig. 1
figure 1

Two block designs with \(b=6\), \(k=4\) and \(v=18\)

We denote individual treatments by small Roman letters such as i and j, and individual blocks by large Greek letters such as \(\varGamma \) and \(\Delta \). Denote by \(n_{i\varGamma }\) the number of times that treatment i occurs in block \(\varGamma \). The design is said to be binary if \(n_{i\varGamma } \le 1\) for all i and \(\varGamma \). Denote by \(r_i\) the number of observations on treatment i: thus

$$\begin{aligned} r_i = \sum _\varGamma n_{i\varGamma }. \end{aligned}$$

Definition 1

The incidence matrix \(\mathbf {N}\) of an incomplete-block design is the \(v\times b\) matrix whose \((i,\varGamma )\) entry is \(n_{i\varGamma }\).

Definition 2

Given an incomplete-block design, the concurrence \(\lambda _{ij}\) of treatments i and j is defined by

$$\begin{aligned} \lambda _{ij} = \sum _\varGamma n_{i\varGamma } n_{j\varGamma }. \end{aligned}$$

If the design is binary, then \(\lambda _{ii} = r_i\) and, for \(i\ne j\), the concurrence \(\lambda _{ij}\) is the number of blocks in which i and j both occur. The concurrence matrix \(\varvec{\Lambda }\) is the \(v\times v\) matrix with (ij) entry equal to \(\lambda _{ij}\). Therefore \(\varvec{\Lambda }=\mathbf {N}\mathbf {N}^\top \).

Let \(\mathbf {R}\) be the \(v \times v\) diagonal matrix with diagonal entries \(r_1\), ..., \(r_v\), and denote by \(\mathbf {I}_b\) the identity matrix of order b. By standard linear model theory, the Fisher information matrix for estimating both treatment and block parameters is \(\tilde{\mathbf {C}}\), where

$$\begin{aligned} \tilde{\mathbf {C}} = \begin{bmatrix} \mathbf {R} &{} \mathbf {N}\\ \mathbf {N}^\top &{} k\mathbf {I}_b \end{bmatrix}. \end{aligned}$$

The Fisher information matrix \(\mathbf {C}\) for the estimation of treatment effects only is given by \(\mathbf {C} = \mathbf {R} - k^{-1}\varvec{\Lambda }\). For all the treatment contrasts to be estimable, the rank of \(\mathbf {C}\) has to be \(v-1\). See [3, 4, 29, 32, 33].

2.2 Graphs Defined by Block Designs

We will discuss two graphs associated with block designs, the Levi graph and the concurrence graph. The Levi graph was introduced by Levi [24] and implicitly used by Bose in [9] in coining the concept of connectivity of a block design. In [12], Coxeter named it the Levi graph. Some authors call it the incidence graph. The concurrence graph was defined in [28] by Patterson and Williams as the variety-concurrence graph. It is worth mentioning here that Coxeter had called it the Menger graph in [12], and that Grünbaum claimed in [17] that ‘The earlier “Menger graphs” ... are much less useful.’

Definition 3

The Levi graph associated with an incomplete-block design is a bipartite graph with \(v+b\) vertices. The vertices in one part are labelled by the treatments, those in the other part by the blocks. There are bk edges, one corresponding to each observation. An observation on treatment i in block \(\varGamma \) gives an edge between vertices i and \(\varGamma \). Thus there are \(n_{i\varGamma }\) edges between i and \(\varGamma \), and bk edges in total. If the design is binary, then there is no more than one edge between any two vertices.

In any graph, the degree of a vertex denotes the number of edges incident with it. Thus in the Levi graph the degree \(\deg (\varGamma )\) of each block-vertex \(\varGamma \) is equal to k, while for each treatment-vertex \(\deg (i)=r_{i}\). The design can be recovered from its incidence matrix \(\mathbf {N}\) if its rows and columns are labelled and equivalently from its Levi graph if the vertices are labelled.

In a general graph with n vertices and e edges, the cycle rank is defined to be \(e-n+1\). This must be non-negative if the graph is connected. The Levi graph has \(b+v\) vertices and bk edges, so c is equal to the cycle rank of the Levi graph. Henceforth, we shall call it the cycle rank of the block design.

Figures 2 and 3 show the Levi graphs of the block designs in Fig. 1a, b, respectively. Throughout this paper we use the convention that an open square represents a block-vertex and a filled circle represents a treatment-vertex.

Fig. 2
figure 2

Levi graph for the block design in Fig. 1a

Fig. 3
figure 3

Levi graph for the block design in Fig. 1b

Definition 4

The concurrence graph of an incomplete-block design is a graph whose vertices are labelled by the treatments. Distinct treatments i and j are joined by \(\lambda _{ij}\) edges.

In the concurrence graph, the degree of vertex i is given by \(\deg (i)=\sum _{\{j|j\ne i\}}\lambda _{ij}\). In general, two different block designs may have the same concurrence graphs: therefore the block design cannot be retrieved from its concurrence graph. For example, Fig. 4 shows two block designs with \(v=6\), \(b=4\) and \(k=3\) whose concurrence graph is the one shown in Fig. 5. In spite of this, the concurrence graph provides significant information to rank designs on different optimality criteria, as discussed by Bailey and Cameron in [3, 4]. Furthermore, when \(k=2\), binary block designs are effectively the same as their concurrence graphs, with each edge corresponding to a block.

Fig. 4
figure 4

Two block designs with \(b=4\), \(k=3\) and \(v=6\)

Fig. 5
figure 5

Concurrence graph for both designs in Fig. 4

The following are equivalent: (i) an incomplete-block design is connected; (ii) its Levi graph is a connected graph; (iii) its concurrence graph is a connected graph. See [3, 4, 33].

2.3 Laplacian Matrices of Graphs

Definition 5

Let H be a graph on n vertices. The Laplacian matrix \(\mathbf {L}(H)\) of this graph is the \(n\times n\) matrix whose (ij) entry \(L_{ij}\) is the negative of the number of edges between vertices i and j, if \(i \ne j\), and whose (ii) entry \(L_{ii}\) is the degree of i.

It follows from the above definition that \(L_{ii}=-\sum _{\{j:j\ne i\}}L_{ij}\). Therefore the row and column sums of \(\mathbf {L}(H)\) are all zero. Hence the Laplacian matrix \(\mathbf {L}(H)\) has eigenvalue zero on the all-1 vector. Moreover, the multiplicity of this eigenvalue is equal to the number of connected components of H: see [3]. The other eigenvalues of \(\mathbf {L}(H)\) are all positive. One way of proving this is to note that \(\mathbf {L}(H)\) is the sum of matrices \(\mathbf {L}_e(H)\) over all edges e of H, where the submatrix of \(\mathbf {L}_e(H)\) corresponding to the two vertices joined by e is

$$\begin{aligned} \left[ \begin{array}{rr} 1 &{} -1\\ -1 &{} 1 \end{array}\right] \end{aligned}$$

and all other entries of \(\mathbf {L}_e(H)\) are zero: see [3]. It follows that \(\mathbf {L}(H)\) is positive semi-definite.

Let G be the concurrence graph of a given incomplete-block design, and let \(\tilde{G}\) be its Levi graph. Let \(\mathbf {L}\) and \(\tilde{\mathbf {L}}\) be the corresponding Laplacian matrices \(\mathbf {L}(G)\) and \(\mathbf {L}(\tilde{G})\). In [14,15,16], Gaffke showed that \(\mathbf {L} = k\mathbf {C}\) and

$$\begin{aligned} \tilde{\mathbf {L}} = \begin{bmatrix} \mathbf {R} &{} -\mathbf {N} \\ -\mathbf {N}^{T} &{} k\mathbf {I}_{b} \end{bmatrix}. \end{aligned}$$

It was shown in [4, 33] that if the block parameters in the usual linear model are replaced by their negatives, then \(\tilde{\mathbf {C}} = \tilde{\mathbf {L}}\).

We are interested only in connected incomplete-block designs whose blocks all have the same size. For such designs, zero has multiplicity one as an eigenvalue of \(\mathbf {C}\) and \(\mathbf {L}\). The other eigenvalues of these matrices will be called non-trivial.

2.4 Optimality in Incomplete-Block Designs

These definitions are taken from [32].

Definition 6

An incomplete-block design for v treatments in b blocks of size k is called D-optimal if it maximizes the geometric mean of the non-trivial eigenvalues of the information matrix \(\mathbf {C}\) over all connected block designs for v treatments with b blocks of size k.

Equivalently, a D-optimal design minimizes the volume of the ellipsoid of confidence for the vector of treatment parameters. Equivalently again, a D-optimal design maximizes the geometric mean of the non-trivial eigenvalues of the Laplacian matrix \(\mathbf {L}\).

Definition 7

An incomplete-block design is called A-optimal if it maximizes the harmonic mean of the non-trivial eigenvalues of the information matrix \(\mathbf {C}\) over all connected block designs for v treatments with b blocks of size k.

Equivalently, an A-optimal design minimizes the sum of the variances of the estimates of the differences between two treatment parameters. Equivalently again, an A-optimal design maximizes the harmonic mean of the non-trivial eigenvalues of the Laplacian matrix \(\mathbf {L}\).

For both D- and A-optimality, the first equivalence is given in [29, 32] and the second follows from the fact that \(\mathbf {L}=k\mathbf {C}\).

3 Graphs and Optimality

3.1 Spanning Trees

Definition 8

A subset of the edges in a graph is a cycle if, together with the vertices at their ends, they form a connected subgraph in which every vertex has degree two. A subset of the edges (together with its vertices) is a tree if it is connected but contains no cycles. A tree whose vertices are all those of the original graph is a spanning tree.

Thus a connected graph is a tree if and only if \(c=0\), while, if \(c=1\), then it contains a single cycle.

Figure 6 shows a graph which is a tree: it has six vertices, five edges, and cycle rank zero. On the other hand, the Levi graph in Fig. 2 has a single cycle, which has length 12. Every spanning tree of this graph is made by removing a single edge of the cycle from the graph.

Fig. 6
figure 6

A tree with six vertices and five edges

If a graph is not connected, it has no spanning trees. In [14], Gaffke showed the relevance to optimal design theory of the following theorem, known as the matrix tree theorem, which was given by Kirchhoff [21].

Theorem 1

If H is a connected graph with n vertices, then the product of n and the number of spanning trees of H is equal to the product of the non-trivial eigenvalues of \(\mathbf {L}(H)\).

In [30], Sachs gave the next theorem, which was cited in [16].

Theorem 2

Let G and \(\tilde{G}\) be the concurrence graph and Levi graph, respectively, of an incomplete-block design for v treatments in b blocks of size k. Then the number of spanning trees of \(\tilde{G}\) is equal to the product of \(k^{b-v+1}\) and the number of spanning trees of G.

Since the product of the non-trivial eigenvalues of \(\mathbf {L}\) is their geometric mean raised to the power \(v-1\), these results give the following way of assessing D-optimality by using graphs.

Theorem 3

Consider a connected incomplete-block design d for v treatments in b blocks of size k, with concurrence graph G and Levi graph \(\tilde{G}\). The following statements are equivalent.

  1. (i)

    The design d is D-optimal.

  2. (ii)

    The graph G has the maximal number of spanning trees among concurrence graphs of such designs.

  3. (iii)

    The graph \(\tilde{G}\) has the maximal number of spanning trees among Levi graphs of such designs.

Cheng used this theorem in [10, 11] with the concurrence graph to find some D-optimal designs whose concurrence matrices are highly patterned. Gaffke used it in [16] with both graphs to find D-optimal designs for \(v\le 6\). Bapat and Dey used it in [8] with the Levi graph to show that all connected designs are D-optimal when \(c=0\). Balasubramanian and Dey extended this in [7] to find all D-optimal designs when \(c=1\) and \(c=2\), but unfortunately gave the wrong constant of proportionality in Sachs’ theorem.

3.2 Electrical Networks

Denote by \(V_{ij}\) the variance of the best linear unbiased estimator of the difference between treatments i and j when the variance of each observation is unity. Thus

$$\begin{aligned} V_{ij} = C^-_{ii} + C^-_{jj} - 2C^-_{ij}, \end{aligned}$$

where \(C^-_{ij}\) is the (ij) entry of the Moore–Penrose generalized inverse \(\mathbf {C}^{-}\) of \(\mathbf {C}\). Paterson and Wild used concurrence graphs to evaluate block designs in [26, 27, 35], using a heuristic argument that, in general terms, \(V_{ij}\) should increase according to the distance between vertices i and j in the graph G, where distance is the length of the shortest path connecting the two vertices. This is true for some nicely patterned graphs, but is not true in all cases, as shown in [2].

When microarray experiments became popular, there was a sudden need for A-optimal block designs with \(k=2\). The designs were drawn as their concurrence graphs. Jones and Eccleston [19], Kerr and Churchill [20] and Wit et al. [36] found A-optimal designs for small values of v and b by computer search. In [1], Bailey gave analytical proofs of these results, and extended them to larger values. In [3], Bailey and Cameron observed that \(V_{ij}\) is proportional, not to distance in G, but to resistance-distance in G, which is explained below. Tjur had already proved in [33] that \(V_{ij}\) is equal to resistance-distance in the Levi graph \(\tilde{G}\), which he called the design network. He said “It is tempting to ask ... whether it makes sense to think of the variety-concurrence graph as an electrical network” and concluded that it does.

Any connected graph H can be considered as an electrical network with a 1-ohm resistance in each edge. Suppose that a battery is connected to vertices i and j, where \(i \ne j\), giving a potential difference of 1 volt between those vertices. Then current flows in the network according to the three standard laws of electrical networks.

  • Ohm’s Law: In every edge, the (signed) current is equal to the (signed) potential difference between the vertices at its ends.

  • Kirchhoff’s voltage law: The total (signed) potential difference around any cycle is zero.

  • Kirchhoff’s current law: At every vertex except i and j, the total (signed) outgoing current is zero.

These laws give a unique solution for the total current between vertices i and j. The reciprocal of its absolute value is called the effective resistance or resistance distance between vertices i and j. Electrical network theory shows that this effective resistance is equal to \( L^-_{ii} + L^-_{jj} - 2L^-_{ij}\), where \(L^-_{ij}\) is the (ij) entry of the Moore–Penrose generalized inverse of \(\mathbf {L}(H)\): see [6]. Thus results of [3, 4, 33, 34] can be summarized as follows.

Theorem 4

Let i and j be distinct treatments in an incomplete-block design with block size k. Let \(R_{ij}\) and \(\tilde{R}_{ij}\) be the effective resistances between i and j in the graphs G and \(\tilde{G}\), respectively. Then \(V_{ij} = \tilde{R}_{ij} = kR_{ij}\).

This result gives an efficient method for calculating the values \(V_{ij}\) when the graph does not have many edges. Moreover, it gives the following analogue of Theorem 3.

Theorem 5

Consider a connected incomplete-block design d for v treatments in b blocks of size k, with concurrence graph G and Levi graph \(\tilde{G}\). The following statements are equivalent.

  1. (i)

    The design d is A-optimal.

  2. (ii)

    Among concurrence graphs of such designs, the graph G minimizes the sum of the resistance distances between pairs of distinct vertices.

  3. (iii)

    Among Levi graphs of such designs, the graph \(\tilde{G}\) minimizes the sum of the resistance distances between pairs of distinct treatment-vertices.

For example, consider the resistance distance between treatment-vertices 1 and 3 in Fig. 2. The anticlockwise route around the cycle from 1 to 3 has eight edges, so, if a current of one amp goes on this route from vertex 1 to vertex 3, then the voltage difference between those two vertices is equal to eight volts. Then current will also flow along the shorter, clockwise route. This has four edges, and a voltage difference of eight volts, so the current must be two amps. Therefore the total current is three amps. The definition of effective resistance assumes a potential difference of 1 volt, so we scale so that the voltage difference is just one volt. This makes the current 3/8, and so \(\tilde{R}_{12}=8/3\).

This example demonstrates the two key methods of combining resistance distances, which are explained in [6]. If two paths have a single vertex in common, then their union gives a single path: the resistance distance between its endpoints is the sum of the resistance distances of the original paths. This is known as combining resistances in series. If the two paths have the same endpoints but no other vertices in common, then their resistance distances are in parallel: to combine these, we add the reciprocals of both, and then invert that.

3.3 Which Graph to Use?

When \(k=2\), the concurrence graph is effectively the same as the design, and it seems natural to use it to calculate the number of spanning trees and the pairwise effective resistances. When \(k=3\), both graphs have 3b edges, but the Levi graph has more vertices, so again it is simpler to use the concurrence graph. For larger values of k, the Levi graph has fewer edges, and the concurrence graph has b complete subgraphs of size k, so that calculations become simpler in the Levi graph. For theorems valid for all values of k, it seems simpler to use the Levi graph, although visualizing the graph for a given design with small k is easier with the concurrence graph. If \(b>v-1\), then Theorem 2 shows that the Levi graph has more spanning trees and so it may be easier to count them in the concurrence graph; if \(b<v-1\), then it is the other way around.

The remaining sections of the paper prove some general results valid for all values of k, so the proofs are given using the Levi graph in all cases.

4 Bridges and Cut-Vertices

4.1 Definitions

One way of measuring the connectivity of a graph is to find out how many edges (or vertices) need to be removed to disconnect the graph.

Definition 9

An edge of a connected graph is a bridge if its removal disconnects the graph. A vertex of a connected graph is a cut-vertex if its removal (together with all edges incident to it) leaves a disconnected graph.

Thus, if a graph is a tree, then every edge is a bridge.

For example, in Fig. 3, vertex 1 is a cut-vertex. The three edges coming down from that vertex are all bridges. Vertex 4 is not a cut-vertex, but the single edge incident with that vertex is a bridge.

A bridge in the Levi graph corresponds to an observation whose loss makes some treatment contrast inestimable. In the terminology of Latif et al. [23], if the Levi graph has a bridge, then the design has breakdown number one.

Lemma 1

Suppose that a connected graph K has a cut-vertex W, cutting it into components \(H_1\) and \(H_2\) both containing W. Let c, \(c_1\) and \(c_2\) be the cycle ranks of K, \(H_1\) and \(H_2\), respectively. Then \(c=c_1+c_2\).

Proof

For \(m=1\), 2, suppose that \(H_m\) contains \(e_m\) edges and \(v_m\) vertices in addition to W. See Fig. 7. Then \(c_m= e_m -v_m\) for \(m=1\), 2. The graph K has \(e_1+e_2\) edges and \(v_1+v_2+1\) vertices, and so \(c=e_1+e_2-v_1-v_2=c_1+c_2\). \(\square \)

Fig. 7
figure 7

Graph in the proof of Lemma 1: W is a cut-vertex

In the rest of this section, we show that optimality places strong conditions on the potential occurrences of bridges and cut-vertices in the Levi graph.

4.2 General Results for D-optimality

In [5], Bailey, Schiffl and Hilgers proved the following.

Theorem 6

If G is the concurrence graph of a D-optimal block design with \(k=2\), \(v\ge 3\) and \(c\ge 1\), then G does not contain a bridge.

When \(k=2\), the Levi graph has a bridge if and only if the concurrence graph has a bridge, so Theorem 6 can be rephrased in terms of the Levi graph. Here we prove a stronger result.

Theorem 7

Let \(\tilde{G}\) be the Levi graph of a D-optimal block design d with \(c\ge 1\). Then \(\tilde{G}\) does not contain any bridge, except, possibly, a bridge whose treatment-vertex has degree 1.

Proof

Since \(c\ge 1\), \(\tilde{G}\) is not a tree, so, if it does contain a bridge, then it contains a bridge e joining treatment-vertex i to block-vertex \(\varGamma \) in which at least one of those two vertices is contained in a cycle of \(\tilde{G}\). If e is removed, then \(\tilde{G}\) splits into two connected components. Denote by H and K the components containing i and \(\varGamma \), respectively. There are no other edges between H and K.

First, suppose that i is contained in a cycle in H, and that f is one edge of this cycle, joining treatment-vertex i to block-vertex \(\Delta \). See Fig. 8. Let \(n_1\) be the number of spanning trees of H which contain f, and let \(n_2\) be the number of spanning trees of H which do not contain f. Because f is in a cycle, \(n_2>0\). Let m be the number of spanning trees of K. The block-vertex \(\varGamma \) has degree k, so it has \(k-1\) edges in K, so K has more than one vertex. Therefore, because K is connected, \(m>0\). Every spanning tree of \(\tilde{G}\) consists of a spanning tree for H, the bridge e, and a spanning tree for K. Therefore \(\tilde{G}\) has \((n_1+n_2)m\) spanning trees, where \(n_2>0\).

Let j be one of the treatment-vertices in K which are joined to block-vertex \(\varGamma \). Create a new design \(d^*\) from d by removing i from \(\Delta \) and replacing it by j, so that all blocks still have size k. Let \(\tilde{G}^*\) be the Levi graph of \(d^*\), which omits the edge f but contains the edge \(e'\) between j and \(\Delta \). Let T and \(T'\) be spanning trees for H and K respectively. If T does not contain f, then \(T \cup \{e\} \cup T'\) and \(T \cup \{e'\} \cup T'\) are both spanning trees for \(\tilde{G}^*\); if T does contain f, then \((T{\setminus } \{f\}) \cup \{e,e'\} \cup T'\) is a spanning tree for \(\tilde{G}^*\). Therefore the number of spanning trees for \(\tilde{G}^*\) is at least \((n_1+2n_2)m\), which is strictly greater than \((n_1+n_2)m\) because \(n_2>0\) and \(m>0\). This contradicts the D-optimality of design d.

Fig. 8
figure 8

Graph in the first case in the proof of Theorem 7: e is a bridge

Fig. 9
figure 9

Graph in the second case in the proof of Theorem 7: e is a bridge

Secondly, suppose that \(\varGamma \) is contained in a cycle in K. The argument is similar, but with some modifications, so we give it in full. Let g be one edge in this cycle, joining treatment-vertex \(\ell \) to block-vertex \(\Theta \) (which may or may not be the same as \(\varGamma \)). See Fig. 9. Let \(m_1\) be the number of spanning trees of K which contain g, and let \(m_2\) be the number of spanning trees of K which do not contain g. Then \(m_2>0\). Let n be the number of spanning trees of H. If \(\deg (i)>1\), then H has more than one vertex and so \(n>0\); if \(\deg (i)=1\), then H consists of the single vertex i, which by itself forms the only spanning tree for H, and so \(n=1\). In both cases, the number of spanning trees of \(\tilde{G}\) is \((m_1+m_2)n\).

Create a new design \(d^{**}\) from d by removing \(\ell \) from \(\Theta \) and replacing it by i; all blocks still have size k. The Levi graph \(\tilde{G}^{**}\) of \(d^{**}\) is obtained from \(\tilde{G}\) by removing the edge g and inserting an edge \(e''\) between i and \(\Theta \). Let T and \(T'\) be spanning trees for H and K respectively. If \(T'\) does not contain g, then \(T \cup \{e\} \cup T'\) and \(T \cup \{e''\} \cup T'\) are both spanning trees for \(\tilde{G}^{**}\); if \(T'\) does contain g, then \(T \cup \{e,e''\} \cup (T' {\setminus } \{g\})\) is a spanning tree for \(\tilde{G}^{**}\). Therefore the number of spanning trees for \(\tilde{G}^{**}\) is at least \((m_1+2m_2)n\), which is strictly greater than \((m_1+m_2)n\) as \(m_2>0\) and \(n>0\), again contradicting the D-optimality of design d. \(\square \)

As an example of the proof of Theorem 7, consider the block design in Fig. 1b. Its Levi graph is in Fig. 3. This has a single cycle, which has length 6, so it has 6 spanning trees. Let e be the bridge joining treatment-vertex 1 to the vertex for the block \(\varGamma \) containing treatment 18, and let f be the edge joining treatment-vertex 1 to the block \(\Delta \) containing treatment 2. Create the new design by removing treatment 1 from \(\Delta \) and replacing it by treatment 18. The new design has a single cycle of length 8, containing the vertices for treatments 2, 3, 1 and 18 in order, so it has 8 spanning trees.

Note that, if \(bk <2v\), then the design must have at least one treatment i with \(r_i=1\) and so even a D-optimal design will have a bridge in which one of its vertices has degree 1. A vertex with degree 1 is called a leaf. Let us call a bridge leafless if neither of its vertices is a leaf.

Theorem 8

If \(\tilde{G}\) is the Levi graph of a D-optimal block design d with \(c\ge 1\), then no treatment-vertex of \(\tilde{G}\) is a cut-vertex.

Proof

Suppose that treatment-vertex i is a cut-vertex which splits \(\tilde{G}\) into components H and K both containing i as well as other vertices. Thus \(\deg (i) \ge 2\) and there are no edges between H and K except those through vertex i. Let e be an edge through i in H, joining i to block-vertex \(\varGamma \). Since \(c\ge 1\), Theorem 7 shows that e is not a bridge. Therefore e is contained in a cycle in H. Hence there is at least one spanning tree for H which contains e, and at least one which omits it. See Fig. 10.

Fig. 10
figure 10

Graph in the proof of Theorem 8: i is a cut-vertex

Suppose that i is joined to block-vertex \(\Delta \) in K. Because \(\deg (\Delta ) \ge 2\), there is a treatment-vertex j joined to \(\Delta \) with \(j\ne i\). Create a new design \(d^*\) from d by removing i from \(\varGamma \) and replacing it by j. The Levi graph \(\tilde{G}^*\) of \(d^*\) is obtained from \(\tilde{G}\) by replacing the edge e with an edge f connecting j to \(\varGamma \).

Every spanning tree of \(\tilde{G}\) has the form \(T \cup T'\), where T and \(T'\) are spanning trees for H and K respectively. Thus the number of spanning trees for \(\tilde{G}\) is \(n_1n_2\), where \(n_1\) and \(n_2\) are the number of spanning trees for H and K respectively. If T contains e, then \((T{\setminus } \{e\}) \cup \{f\} \cup T'\) is a spanning tree for \(\tilde{G}^*\). If T does not contain e, then \(T \cup T'\) is a spanning tree for \(\tilde{G}^*\). Furthermore, \(T'\) contains a unique path from i to j, and this has positive even length (it may not be the path through vertex \(\Delta \) shown in Fig. 10, because \(T'\) does not contain all edges in K). If \(e'\) is any one of the edges in this path, then \(T \cup \{f\} \cup (T' {\setminus }\{e'\})\) is a spanning tree for \(\tilde{G}^*\). Different choices for \(T'\) may give rise to the same spanning tree of this type. Nevertheless, at least two of them are distinct, and so the number of spanning trees for \(\tilde{G}^*\) is at least \(n_1n_2 +2\), which contradicts the D-optimality of d. \(\square \)

4.3 General Results for A-optimality

Theorem 9

Suppose that i is a treatment in a block design d and that its treatment-vertex is a cut-vertex of the Levi graph \(\tilde{G}\) of d, cutting it into components \(H_1\) and \(H_2\) both containing i. For \(m=1\), 2, let \(H_m\) contain \(v_m\) treatment-vertices in addition to i, let \(R_m\) be the sum of the resistance distances between pairs of treatment-vertices in \(H_m\) and let \(S_m\) be the sum of these in \(H_m\) which involve treatment-vertex i. Let \(K_m\) be the set of treatment-vertices, including i, in \(H_m\). Thus \(S_m= \sum _{j\in K_m} \tilde{R}_{ij}\), because \(\tilde{R}_{ii}=0\), and \(R_m = (\sum _{j\in K_m}\sum _{\ell \in K_m} R_{j\ell })/2\), because \(\tilde{R}_{jj}=0\) and \(\tilde{R}_{j\ell } = \tilde{R}_{\ell j}\) for all j and \(\ell \). Then the sum of the resistance distances between pairs of treatment-vertices in \(\tilde{G}\) is \(R_1 + R_2 +v_2 S_1 + v_1S_2\).

Proof

See Fig. 11. All resistance distances involving treatment-vertex i within \(H_1\) or \(H_2\) are included in \(R_1+R_2\). If t is a treatment-vertex in \(K_1 {\setminus }\{i\}\) and u is a treatment-vertex in \(K_2 {\setminus }\{i\}\), then \(\tilde{R}_{tu} = \tilde{R}_{ti} + \tilde{R}_{iu}\). Summing this over all t and u gives \(v_2S_1 + v_1S_2\), and the result follows. \(\square \)

Fig. 11
figure 11

Graph in the proof of Theorem 9: i is a cut-vertex

The following lemma shows the use of Theorem 9 on a small example. Not only does this help in understanding Theorem 9; it will also be used in the proof of Theorem 11.

Lemma 2

In addition to the assumptions and notation in Theorem 9, suppose that \(k=3\) and \(v_1=4\). Suppose that component \(H_1\) is as shown in Fig. 12a, so that block \(\Delta \) contains treatments i, t and u, while block \(\varGamma \) contains treatment j twice and treatment i once. Then d is not A-optimal.

Proof

In \(H_1\), the resistance distances between pairs of treatment-vertices are

$$\begin{aligned} \tilde{R}_{ij} = 3/2,\ \tilde{R}_{it} = \tilde{R}_{iu} = \tilde{R}_{tu} =2 \text { and }\tilde{R}_{jt} = \tilde{R}_{ju} =7/2. \end{aligned}$$

Hence \(R_1=29/2\) and \(S_1=11/2\).

Construct a new design \(d^*\) from d by removing one occurrence of j from \(\varGamma \), inserting it into \(\Delta \), and removing t from \(\Delta \) and inserting it into \(\varGamma \), thus giving \(H_1^*\) in Fig. 12b. The resistance distances between pairs of treatment-vertices in \(H_1^*\) are

$$\begin{aligned} \tilde{R}^*_{ij}=1,\ \tilde{R}^*_{it}= \tilde{R}^*_{iu}= \tilde{R}^*_{jt}= \tilde{R}^*_{iu}= 7/4 \text { and } \tilde{R}^*_{tu}= 3. \end{aligned}$$

Hence \(R^*_1 = 11 < R_1\) and \(S_1^* = 9/2 < S_1\).

Theorem 9 shows that the sum of the resistance distances between pairs of treatment-vertices in the Levi graph is

$$\begin{aligned} \begin{array}{c@{\qquad }l} R_1 + R_2 + v_2 S_1 + 4S_2 &{} \text {for design } d, \text { and}\\ R_1^* + R_2 + v_2 S_1^* + 4S_2 &{} \text {for design } d^*.\\ \end{array} \end{aligned}$$

Since \(R^*_1 < R_1\) and \(S_1^*< S_1\), the second value is strictly smaller, regardless of the properties of \(H_2\). By Theorem 5, the design d cannot be A-optimal. \(\square \)

Fig. 12
figure 12

Two possibilities for one component of the Levi graph in Lemma 2

Theorem 10

Let \(\tilde{G}\) be the Levi graph of an A-optimal block design d. Then there cannot be two distinct treatments i and j whose vertices are both cut-vertices.

Proof

Suppose that there are two such treatments i and j. Then removing vertices i and j splits the graph \(\tilde{G}\) into three components \(H_1\), \(H_2\) and \(H_3\), each with at least two vertices, where treatment-vertex i is the only vertex in \(H_1\cap H_2\), treatment-vertex j is the only vertex in \(H_2\cap H_3\) and \(H_1 \cap H_3\) contains no vertices. Each component must be connected, because the whole Levi graph is connected. For \(\ell \in \{1,2,3\}\), let \(K_\ell \) be the set of treatment-vertices in \(H_\ell \), and let \(v_\ell = \left| K_\ell {\setminus }\{i,j\} \right| \). See Fig. 13.

Fig. 13
figure 13

Graph in the proof of Theorem 10: i and j are both cut-vertices

Let

$$\begin{aligned} S_1 = \sum _{t\in K_1} \tilde{R}_{it}, \quad S_2 = \sum _{t\in K_2} \tilde{R}_{it}, \quad S_3 = \sum _{t\in K_2} \tilde{R}_{jt} \quad \text { and } \quad S_4 = \sum _{t\in K_3} \tilde{R}_{jt}. \end{aligned}$$

Then

$$\begin{aligned} \sum _{t\in K_1} \tilde{R}_{jt} = S_1 + (v_1+1)\tilde{R}_{ij} \quad \text {and so} \quad \sum _{t\in K_1\cup K_2} \tilde{R}_{jt} = S_3 + S_1 + v_1\tilde{R}_{ij} \end{aligned}$$

(after removing the double occurrence of \(\tilde{R}_{ij}\), because \(K_1\cap K_2 = \{i\}\)).

Let \(R_0\) and \(R_3\) be the sums of the resistance distances between pairs of treatment-vertices in \(K_1\cup K_2\) and \(K_3\), respectively. Theorem 9 shows that the sum of the resistance distances between pairs of treatment-vertices in \(\tilde{G}\) is

$$\begin{aligned} R_0 + R_3 + (v_1+v_2+1)S_4 + v_3(S_1+S_3+ v_1\tilde{R}_{ij}). \end{aligned}$$
(1)

There is no loss of generality in assuming that \(S_2\le S_3\). Create design \(d^*\) from d by replacing treatment j by treatment i in all those blocks containing j whose other treatment-vertices are in \(K_3\). Let \(\tilde{G}^*\) be the Levi graph of \(d^*\), and denote resistance distances in this graph by \(\tilde{R}^*\). Then

$$\begin{aligned} \sum _{t\in K_1\cup K_2} \tilde{R}^*_{it} = S_1 + S_2, \end{aligned}$$

and so Theorem 9 shows that the sum of the resistance distances between pairs of treatment-vertices in \(\tilde{G}^*\) is

$$\begin{aligned} R_0 + R_3 + (v_1+v_2+1)S_4 + v_3(S_1+S_2). \end{aligned}$$
(2)

Since \(S_2\le S_3\) and \(0< v_1v_3\tilde{R}_{ij}\), the comparison of expressions (1) and (2) shows that the sum of the resistance distances between pairs of treatment-vertices in \(\tilde{G}^*\) is strictly less than the corresponding sum in \(\tilde{G}\), which contradicts the A-optimality of d, by Theorem 5. \(\square \)

Theorem 11

If \(\tilde{G}\) is the Levi graph of an A-optimal block design d with \(c\ge 1\), then no treatment-vertex of \(\tilde{G}\) is a cut-vertex unless it is contained in a cycle.

Proof

Suppose that treatment i gives such a cut-vertex, contained in no cycle and cutting \(\tilde{G}\) into components \(H_1\) and \(H_2\) both containing i. Without loss of generality, there is a cycle in \(H_1\). If \(H_1{\setminus }\{i\}\) is not connected then we may move any, but not all, of its connected components into \(H_2\) without affecting the assumptions, so long as we leave a cycle in \(H_1\). Thus we may assume that \(H_1 {\setminus }\{i\}\) is connected.

Suppose that treatment i is in blocks \(\varGamma _1\) and \(\varGamma _2\) whose vertices are in \(H_1\). Since \(H_1 {\setminus }\{i\}\) is connected, it contains a path from \(\varGamma _1\) to \(\varGamma _2\). Adding the edges from \(\varGamma _1\) and \(\varGamma _2\) to i gives a cycle containing vertex i. This is contradicts our assumption that i is not in a cycle.

Otherwise, suppose that treatment i is in a unique block \(\varGamma \) in \(H_1\). Because \(H_1\) is connected and contains a cycle, there is at least one treatment-vertex j in \(H_1\) that is joined to \(\varGamma \) and is not a leaf. See Fig. 14. If the edge between \(\varGamma \) and j is not in a cycle, then j is also a cut-vertex, which contradicts Theorem 10. Hence this edge is in a cycle. If the cycle has length 2n, then the rule for combining resistance distances in parallel given in Sect. 3.2 shows that

$$\begin{aligned} \tilde{R}_{j\varGamma } \le \left[ 1 + \frac{1}{2n-1}\right] ^{-1} = \frac{2n-1}{2n} <1. \end{aligned}$$

On the other hand, \(\tilde{R}_{i\varGamma }=1\) because the edge between \(\varGamma \) and i is not contained in any cycle. In particular, there cannot be multiple edges between \(\varGamma \) and i.

Fig. 14
figure 14

Graph in the proof of Theorem 11: j is a cut-vertex not contained in a cycle

For \(m=1\), 2, let \(R_m\) be the sum of the resistance distances between pairs of treatment-vertices in \(H_m\), and let \(v_m\) be the number of treatment-vertices in \(H_m\) in addition to i. Let \(S_i\) and \(S_j\) be the sum of resistance distances between treatment-vertices in \(H_1\) and vertices i and j, respectively, and let S be the sum of resistance distances between vertex i and treatment-vertices in \(H_2\). Theorem 9 shows that the sum of the resistance distances for pairs of treatment-vertices in \(\tilde{G}\) is \(R_1+R_2 +v_2S_i + v_1S\).

Let t be a treatment, other than i and j, whose vertex is in \(H_1\). Because resistance distance is a metric,

$$\begin{aligned} \tilde{R}_{jt} \le \tilde{R}_{j\varGamma } + \tilde{R}_{t\varGamma } < 1 + \tilde{R}_{t\varGamma } = \tilde{R}_{it}. \end{aligned}$$

Hence \(S_j < S_i\).

Create a new design \(d^*\) from d by replacing treatment i with treatment j in every block containing i whose vertex is in \(H_2\). Then the sum of the resistance distances for pairs of treatment-vertices in the new Levi graph is \(R_1+R_2 +v_2S_j + v_1S\), which is strictly less than it was in \(\tilde{G}\). This contradicts the A-optimality of d, by Theorem 5.

The last step in the proof breaks down if i and j are the only treatment-vertices in \(H_1\). In this case, j is joined to \(\varGamma \) by at least two edges. As noted above, there is only one edge between i and \(\varGamma \). Hence there are \(k-1\) edges between j and \(\varGamma \). Thus \(\tilde{R}_{ij} = 1 + 1/(k-1) = k/(k-1)\). If \(k\ge 4\), then replacing one occurrence of treatment j in block \(\varGamma \) by a second copy of treatment i reduces \(\tilde{R}_{ij}\) to \(1/2 + 1/(k-2) = k/(2k-4)\). This reduces all the resistance distances from j to treatment-vertices within \(H_2\), while leaving all those within \(H_2\) unchanged. Thus d cannot be A-optimal, by Theorem 5.

Fig. 15
figure 15

Two possibilities for one component of the Levi graph at the end of the proof of Theorem 11

The only remaining possibility is that \(k=3\) and that vertex i cuts \(\tilde{G}\) into connected components, each containing one block. At least one block contains two copies of one treatment, which is not i. The remaining blocks are either like this or contain three distinct treatments. Hence there are either a pair of components giving the graph in Fig. 12a or there are a pair giving the graph in Fig. 15a. In the first case, Lemma 2 shows that d is not A-optimal. In the second case, a similar argument shows that replacing the graph in Fig. 15a by the one in Fig. 15b, while leaving the rest of the design unchanged, gives a design with a strictly smaller sum of resistance distances between pairs of treatment-vertices, and so d is not A-optimal, by Theorem 5. \(\square \)

5 D-optimal Block Designs with Low Cycle Rank

5.1 Minimal Number of Observations

Theorem 12

If \(bk=b+v-1\), then all connected block designs for v treatments in b blocks of size k are D-optimal.

Proof

In this case, the number of edges of the Levi graph is one less than its number of vertices. Hence, if it is connected, then it must be a tree, and so it has precisely one spanning tree. If it is not connected, it has none. \(\square \)

Theorem 12 was proved in [8, 25]. We include it here for completeness.

5.2 Nearly Minimal Number of Observations

Since \(c = bk - b - v + 1\), \(c = 1\) if and only if \(v = b(k - 1)\). Thus there is a single cycle if and only if \(v=b(k-1)\). In this case, \(b = v/(k-1) \le v\) and so \(\min \{b,v\} =b\).

Theorem 13

If \(bk=b+v\), then every D-optimal design is obtainable from the following by relabelling treatments and relabelling blocks. For \(i=1\), ..., \(b-1\), block \(\varGamma _i\) contains treatments i and \(i+1\), while block \(\varGamma _b\) contains treatments b and 1. A further \(k-2\) treatments, all with replication 1, are inserted into each block.

Proof

In this case, the Levi graph contains a single cycle. Each spanning tree is obtained by removing one edge from the cycle, so the design is D-optimal if and only if the cycle has maximal length, which is \(\min \{v,b\}=b\). \(\square \)

For example, this theorem shows that the block design in Fig. 1a is D-optimal.

Theorem 13 was proved in [7]. Again, we include it both for completeness and to show the benefit of the unified approach.

5.3 One More Observation

If \(bk=b+v+1\), then \(c=2\). Let \(\tilde{G}\) be the Levi graph of a block design d for v treatments in b blocks of size k. When \(c=2\), then there are only three possibilities for how the cycles occur in a graph: (a) there are two separate cycles, with no vertices in common; (b) there are two cycles with a single vertex in common; (c) there are two special vertices joined by three distinct paths.

In case (a), connectivity implies that there is a path connecting the two cycles. Every edge of this path is a leafless bridge, so Theorem 7 shows that d is not D-optimal.

In case (b), the single vertex in common to the two cycles is a cut-vertex. If d is D-optimal, then Theorem 8 shows that no treatment-vertex is a cut-vertex of \(\tilde{G}\). So suppose that \(\varGamma \) is a block whose vertex is a cut-vertex of \(\tilde{G}\), cutting it into components \(H_1\) and \(H_2\) both containing \(\varGamma \). Suppose further that both \(H_1\) and \(H_2\) contain other block-vertices in addition to \(\varGamma \). Theorem 7 shows that neither \(H_1\) nor \(H_2\) contains a leafless bridge, so all of their block-vertices are contained in cycles. By Lemma 1, each component has a single cycle containing all its block-vertices. Figure 16 shows an example, excluding the treatment-vertices outside the cycles. For \(m=1\), 2, let \(b_m\) be the number of block-vertices in \(H_m\) in addition to \(\varGamma \). Thus \(b=b_1+b_2+1 \ge 3\), and \(H_m\) contains \(2(b_m+1)\) spanning trees, each made by omitting one edge from its cycle. Thus the number of spanning trees of \(\tilde{G}\) is \(4(b_1 +1)(b_2+1)\). This is maximized when \(b_1\) and \(b_2\) are as equal as possible. If \(b=2n+1\), then it is maximized at \((b+1)^2\) when \(b_1=b_2 = n\); if \(b=2n+2\), then it is maximized at \(b(b+2)\) when \(b_1\) and \(b_2\) are equal to n and \(n+1\) in either order.

Fig. 16
figure 16

An example of the cycles in the Levi graph in the second case in Sect. 5.3, with \(b_1=5\) and \(b_2=4\)

In case (c) there are two special vertices \(W_1\) and \(W_2\) in \(\tilde{G}\) which are joined by three disjoint paths. Because there are no bridges, every block-vertex is either one of \(W_1\) and \(W_2\) or is contained in a single path. If \(W_1\) and \(W_2\) are both block-vertices, then the path lengths are all even and sum to \(2b+2\). Figure 17 shows an example, excluding the treatment-vertices outside the cycles. If \(W_1\) and \(W_2\) are both treatment-vertices, then the path lengths are all even and sum to 2b; otherwise, the path lengths are all odd and sum to \(2b+1\).

Fig. 17
figure 17

An example of the cycles of the Levi graph in the third case in Sect. 5.3, with \(p_1=p_3=8\) and \(p_2=6\)

Denote these path lengths by \(p_1\), \(p_2\) and \(p_3\). Then the number of spanning trees in \(\tilde{G}\) is \(p_1p_2 + p_1p_3 + p_2p_3\). This is maximized when \(p_1+p_2+p_3=2b+2\), so that \(W_1\) and \(W_2\) are both block-vertices, and the path lengths are as equal as possible. If \(b=3m+1\), this happens when two of the path lengths are \(2m+2\) and the other is 2m; if \(b=3m+2\), it happens when all of the path lengths are \(2m+2\); if \(b=3m+3\), it happens when two of the path lengths are \(2m+2\) and the other is \(2m+4\). In every case, \(p_1p_2 + p_1p_3 +p_2p_3 >(b+1)^2\), so a design with a block cut-vertex cannot be D-optimal. A D-optimal design can be constructed by drawing a graph with three paths, of the prescribed lengths, between block-vertices \(W_1\) and \(W_2\); inserting one treatment in alternate vertices in each path (and hence in the two blocks on either side); inserting \(k-3\) other treatments into the blocks with vertices \(W_1\) and \(W_2\); and inserting \(k-2\) other treatments into every other block. This works whenever \(k\ge 3\).

For \(k=2\), it is not possible for either of \(W_1\) or \(W_2\) to be a block-vertex. A similar argument shows that, in this case, the D-optimal designs are made by allocating two treatments to these vertices, drawing three paths of the prescribed lengths between them, and inserting one block in alternate vertices in each path.

This result was given in [7].

For example, when \(b=10\), \(k=4\) and \(v=29\), then the non-leaf part of the Levi graph is as shown in Fig. 17. If vertices \(W_1\) and \(W_2\) represent blocks 1 and 10, respectively, one possibility for the D-optimal design is shown in Fig. 18. On the other hand, if \(b=11\), \(k=2\) and \(v=10\), then we need to swap the roles of treatment-vertices and block-vertices in Fig. 17. Figure 19 shows one possibility for the D-optimal design in this case.

Fig. 18
figure 18

A D-optimal block design with \(b=10\), \(k=4\) and \(v=29\)

Fig. 19
figure 19

A D-optimal block design with \(b=11\), \(k=2\) and \(v=10\)

6 A-optimal Block Designs with Low Cycle Rank

6.1 Minimal Number of Observations

Theorem 14

If \(bk=b+v-1\), then a block design for v treatments in b blocks of size k is A-optimal if and only if there is one treatment that occurs in every block.

Proof

As in the proof of Theorem 12, if the Levi graph is connected, then it must be a tree. Every non-leaf vertex of a tree is a cut-vertex, so Theorem 10 shows that all treatment-vertices but one are leaves, so these treatments have replication 1. Connectivity then implies that the remaining treatment is in every block. The converse follows from the fact that the Levi graph of each block design that has one treatment that occurs in each block has the same sum of resistance distances. \(\square \)

Designs like those in Theorem 14 are called star designs in [3], and queen-bee designs in [4]. Figure 20 shows an example with \(b=8\), \(k=4\) and \(v=25\).

Fig. 20
figure 20

An A-optimal block design with \(b=8\), \(k=4\) and \(v=25\)

Theorem 14 was proved in [3, 8, 25]. Now the groundwork in Sect. 4 makes the proof simpler.

6.2 Nearly Minimal Number of Observations

As noted in Sect. 5.2, if \(c = 1\), then \(v = b(k-1)\) and the Levi graph contains a single cycle. Let s be the number of block-vertices in this cycle. Then the cycle contains s treatment-vertices and 2s edges. If \(s=b\), then there are no cut-vertices. If \(1<s<b\), then there is at least one block-vertex outside the cycle. If the path joining it to the cycle meets the cycle in a block-vertex, then that path contains a treatment-vertex which is not in a cycle and is a cut-vertex: by Theorem 11, the design is not A-optimal. On the other hand, if that path meets the cycle in a treatment-vertex \(\ell \), then that vertex is a cut-vertex. Theorem 10 shows that if the block design is A-optimal, then there is no other treatment cut-vertex. Hence treatment-vertex \(\ell \) is joined to all the \(b-s\) block-vertices outside the cycle. If \(s=1\), then the treatment whose vertex is in the cycle occurs twice in the block whose vertex is in the cycle, so connectivity implies that \(k\ge 3\). Denote by \(\mathscr {C}(b,k,s)\) the set of block designs satisfying these conditions.

Figures 2 and 3 show the Levi graphs of block designs in \(\mathscr {C}(6,4,6)\) and \(\mathscr {C}(6,4,3)\), respectively.

Lemma 3

Suppose that \(v=b(k-1)\) and a block design is in \(\mathscr {C}(b,k,s)\). Then the sum of all the pairwise resistance distances between treatment-vertices in the Levi graph is g(s)/6, where

$$\begin{aligned} g(s) = -(k-1)^2s^3 +2b(k-1)^2s^2 -[6bk(k-1) -4k^2 +2k-1]s +C, \end{aligned}$$

and \(C= b(k-1)[12b(k-1) -5k-4]\).

Proof

Let \(\mathscr {V}_1\) be the set of treatment-vertices in the cycle, let \(\mathscr {V}_2\) be the set of other treatment-vertices joined to block-vertices in the cycle, and let \(\mathscr {V}_3\) be the set of remaining treatment-vertices. Suppose that i and j are in \(\mathscr {V}_1\), with \(i\ne j\). The two paths from i to j have lengths 2p and \(2(s-p)\) for some p with \(1\le p \le s-1\). The rule for combining resistance distances in parallel gives

$$\begin{aligned} R_{ij} =\left( \frac{1}{2p} + \frac{1}{2(s-p)}\right) ^{-1} = \frac{2p(s-p)}{s}. \end{aligned}$$

For each fixed i, the sum of \(R_{ij}\) as j varies over \(\mathscr {V}_1\) is

$$\begin{aligned} \sum _{p=1}^{s-1} \frac{2p(s-p)}{s} =\frac{(s-1)(s+1)}{3}. \end{aligned}$$

Call this \(P_1\). If \(\varGamma \) is a block-vertex in the cycle, then the same argument shows that the sum of the resistance distances from \(\varGamma \) to other block-vertices in the cycle is also equal to \(P_1\).

Similarly, if i and \(\varGamma \) are a treatment-vertex and block-vertex in the cycle, then the two paths from i to \(\varGamma \) have lengths \(2q-1\) and \(2s-2q+1\) for some q with \(1\le q \le s\). For each fixed i, the sum of \(R_{i\varGamma }\) as \(\varGamma \) varies over these blocks is

$$\begin{aligned} \sum _{q=1}^{s} \frac{(2q-1)(2s-2q+1)}{2s} =\frac{(2s^2+1)}{6}. \end{aligned}$$

Call this \(P_2\).

For \(1\le m\le n\le 3\), denote by \(\mathscr {R}_{mn}\) the sum of the pairwise resistance distances between vertices in \(\mathscr {V}_m\) and \(\mathscr {V}_n\). Thus \(\mathscr {R}_{11} = sP_1/2\).

Suppose that vertex t in \(\mathscr {V}_2\) is joined to block-vertex \(\varGamma \). If \(i \in \mathscr {V}_1\), then the rule for combining resistance distances in series gives \(R_{ti} = R_{t\varGamma } + R_{\varGamma i} = 1 +R_{\varGamma i}\). Summing over i in \(\mathscr {V}_1\) gives \(s+P_2\). There are \(s(k-2)\) vertices in \(\mathscr {V}_2\), and so \(\mathscr {R}_{12} = s(k-2)(s+P_2)\).

Let t and u be distinct vertices in \(\mathscr {V}_2\). If they are joined to the same block-vertex, then \(R_{tu}=2\). There are \(s(k-2)(k-3)/2\) such pairs, so the sum of these resistance distances is \(s(k-2)(k-3)\). If they are joined to different block-vertices \(\varGamma \) and \(\Delta \), then \(R_{tu}= 2+R_{\varGamma \Delta }\). There are \((k-2)^2\) such pairs for each choice of \(\varGamma \) and \(\Delta \), so the sum of these resistance distances is \((k-2)^2[s(s-1) + \mathscr {R}_{11}]\). Hence \(\mathscr {R}_{22} = s(k-2)(k-3) + s(s-1)(k-2)^2 + s(k-2)^2P_1/2\).

If \(t\in \mathscr {V}_3\) and \(u\in \mathscr {V}_1 \cup \mathscr {V}_2\), then \(R_{tu} = 2 + R_{\ell u}\), where \(\ell \) is the cut-vertex. Summing over u in \(\mathscr {V}_1\) gives \(2s+P_1\), while summing over u in \(\mathscr {V}_2\) gives \((k-2)(3s+P_2)\). There are \((b-s)(k-1)\) vertices in \(\mathscr {V}_3\), and so \(\mathscr {R}_{13} = (b-s)(k-1)(2s+P_1)\) and \(\mathscr {R}_{23} = (b-s)(k-1)(k-2)(3s+P_2)\).

If different vertices t and u in \(\mathscr {V}_3\) are in the same block, then \(R_{tu}=2\); otherwise, \(R_{tu}=4\). Hence \(\mathscr {R}_{33} = (b-s)(k-1)(k-2) + 2(b-s)(b-s-1)(k-1)^2\).

Adding \(\mathscr {R}_{11}\), \(\mathscr {R}_{12}\), \(\mathscr {R}_{13}\), \(\mathscr {R}_{22}\), \(\mathscr {R}_{23}\) and \(\mathscr {R}_{33}\) shows that the sum of all the pairwise resistance distances between treatment-vertices in the Levi graphs is g(s)/6. \(\square \)

Theorem 15

If \(v=b(k-1)\), then a block design for v treatments in b blocks of size k is A-optimal if and only if it is in \(\mathscr {C}(b,k,s)\), where

  1. (i)

    \(s=b\) if \(2\le b\le 4\), or \(b=5\) and \(k\le 5\), or \(b=6\) and \(k\le 3\), or \(7\le b \le 8\) and \(k=2\);

  2. (ii)

    \(s=4\) if \(9\le b \le 11\) and \(k=2\);

  3. (iii)

    \(s=3\) or \(s=4\) if \(b=12\) and \(k=2\);

  4. (iv)

    \(s=3\) if \(b=6\) and \(k=4\), or \(7\le b\le 11\) and \(k=3\), or \(b\ge 13\) and \(k=2\);

  5. (v)

    \(s=2\) otherwise.

Proof

Lemma 3 shows that we need to find the value of s, for s in \(\{1, 2, \ldots , b\}\), which minimizes g(s). First, we note that

$$\begin{aligned} g(1) - g(2 )= & {} 7(k-1)^2 -6b(k-1)^2 + 6bk(k-1) - 4k^2 +2k -1\\= & {} (3k+6b-9)(k-1) -3 >0, \end{aligned}$$

because \(b\ge 2\) and \(k\ge 2\). Thus designs in \(\mathscr {C}(b,k,1)\) are never A-optimal.

We now take four steps to find the integer s in [2, b] which minimizes g(s).

Step 1 If we regard the polynomial function g as defined on the real numbers, then

$$\begin{aligned} g'(s) = -3(k-1)^2s^2 + 4b(k-1)^2s -6bk(k-1) +4k^2 -2k+1. \end{aligned}$$

This is zero when

$$\begin{aligned} s = \frac{2b}{3} \pm \frac{(k-1)\sqrt{\alpha (b,k)}}{3(k-1)^2}, \end{aligned}$$

where

$$\begin{aligned} \alpha (b,k) = 4b^2(k-1)^2 -18bk(k-1) +12k^2 -6k+3. \end{aligned}$$

If \(\alpha (b,k)<0\), then \(g'(s)\) is never zero for real values of s, and so g(s) decreases monotonically. Thus g(s) is minimized for \(s\in \{1,2, \ldots , b\}\) at \(s=b\). This happens for all k when \(b=2\) or \(b=3\), and for \(k\le 14\) when \(b=4\). When \(b=4\), then \(g(3)-g(4) = k^2+16k-20 >0\) for all \(k\ge 2\), so g(s) is minimized for \(s \in \{1,2,3,4\}\) at g(4).

It is also the case that \(\alpha (b,k) <0\) if \(b=5\) and \(k \le 4\); if \(b=6\) and \(k\le 3\); and if \(b=7\) and \(k=2\). In all of these cases, g(s) is minimized for \(s\in \{1,2, \ldots , b\}\) at \(s=b\).

From now on, we may assume that \(b\ge 5\).

Step 2 Now we compare g(b) with g(2) and g(3), and show that g(b) is larger than at least one of the other two in almost all of the remaining cases. We have

$$\begin{aligned} g(b) - g(2) = b^2(k-1)[(b-6)(k-1) -6] + b(8k^2-7) + 2k(b-6) +6. \end{aligned}$$

If \(b=5\), this is \(15k^2-102k+96\), which is positive whenever \(k\ge 6\). If \(b=6\), it is \(12(4k^2-18k+15)\), which is positive whenever \(k\ge 4\). For all larger values of b, it is positive whenever \(k\ge 3\).

When \(k=2\),

$$\begin{aligned} g(b)-g(3) = b^2(b-12) +31b -12. \end{aligned}$$

This is positive whenever \(b\ge 9\).

Thus g(b) does not give the minimum value if \(k\ge 6\) and \(b\ge 5\), if \(k\ge 4\) and \(b\ge 6\), if \(k\ge 3\) and \(b\ge 7\), or if \(k=2\) and \(b\ge 9\). In all these cases, it suffices to find the real number \(s_0\) in (1, b] where \(g'(s_0)=0\); note that \(s_0\ne 1\), because \(g(1) > g(2)\). If \(s_0 <2\), then the minimum for integer s occurs at \(s=2\). Otherwise, if \(s_0\) is not an integer, compare \(g(s_1)\) and \(g(s_2)\), where \(s_1\) and \(s_2\) are the integers on either side of \(s_0\).

Step 3 If \(g(3)>g(2)\), then the overall minimum of g(s) on [1, b] occurs in [1, 3], and the minimum value for integer s is at \(s=2\). Now,

$$\begin{aligned} g(3)-g(2) = (4bk-10b -15 k+21)(k-1) +3. \end{aligned}$$

This is always negative when \(k=2\). However, \(g(3)-g(2)>0\) when \(k=3\) and \(b\ge 12\); when \(k=4\) and \(b\ge 7\); when \(k=5\) and \(b\ge 6\); and when \(k\ge 6\) and \(b \ge 5\). Apart from the situation with \(k=2\), this leaves only a small finite number of cases to be checked.

Step 4 When \(k=2\), \(g(s) =-s^3 +2bs^2 -(12b-13)s +C\). Therefore \(g(4)-g(3) =2b-24\). Hence the minimum occurs at \(s=3\) if \(b\ge 13\). If \(b=12\), then \(g(3)=g(4)\) and the minimum value for integers s occurs at both points. If \(b\le 11\), then g(s) is minimized for some s in [4, b]. From Step 1, the only remaining cases to check are values of b with \(8\le b \le 11\). \(\square \)

For example, this theorem shows that the block design in Fig. 1(b) is A-optimal.

Theorem 15 was proved in [4, 22, 31]. Again, the proof here is much shorter (in spite of the detailed calculations) now that we have the groundwork in Sect. 4.

Theorems 13 and 15 show that, for \(c=1\), the A-optimal designs are very different from the D-optimal designs as either of b or k increases.

7 Comments and Conclusions

7.1 Summary

In this paper, we have concentrated on using properties of the Levi graph to find the D-optimal block designs and A-optimal block designs when the number of observations is close to minimal. We believe that the results in Sect. 4 are new. Moreover, they do not depend on the number of observations being small. They show that, if the cycle rank is at least 1, then the Levi graph of a D-optimal design does not contain any bridge, except, possibly, one whose treatment-vertex is a leaf. Moreover, in the Levi graph of a D-optimal design with cycle rank at least 1, no treatment-vertex is a cut-vertex. The Levi graph of an A-optimal design with cycle rank at least 1 can have no more than one treatment-vertex which is a cut-vertex, and any such vertex must be contained in a cycle.

Sections 5 and 6 use the results of Sect. 4 to give unified, and shorter, proofs of a number of known results about D-optimal and A-optimal designs with low cycle rank. Parts of [4] referred to a preliminary version of the current paper for details of the proofs.

For small designs with cycle rank 1, the same design can be both D-optimal and A-optimal. For example, when \(b=k=4\) and \(v=16\), then the design in Fig. 21 is D-optimal, by Theorem 13, and A-optimal, by Theorem 15.

Fig. 21
figure 21

A block design with \(b=k=4\) and \(v=12\) which is both D-optimal and A-optimal

However, as the number of treatments increases, the A-optimal designs diverge from the D-optimal designs. For example, when \(b=6\), \(k=4\) and \(v=18\), then the design in Fig. 1a is D-optimal, by Theorem 13, but the design in Fig. 1b is A-optimal, by Theorem 15. The geometric means of the non-trivial eigenvalues of the information matrices of these two designs are 0.8411 and 0.8075, respectively. Their harmonic means are 0.5829 and 0.6000, respectively.

7.2 What About Dual Designs?

If d is a block design, then its dual block design \(d^\top \) is made by simply interchanging the roles of treatments and blocks. In the case that all treatments have the same replication, in addition to all blocks having the same size, John and Mitchell proved in [18] that \(d^\top \) is D-optimal if and only if d is D-optimal, and that \(d^\top \) is A-optimal if and only if d is A-optimal. Thus if we are interested in estimating block effects, and use equireplicate designs, then the designs which are optimal for block effects are the same as those which are optimal for treatment effects, under either the D or the A criterion.

In most experimental situations, both management constraints and randomization concerns lead us to use blocks of equal size, even though there is no such restriction on treatment replication. We are not usually interested in estimating block effects, so we are not concerned about the optimality of \(d^\top \). Suppose that we are interested in block effects, as well as treatment effects. Since the Levi graph of \(d^\top \) is the same as the Levi graph of d, Theorem 3 shows that \(d^\top \) is D-optimal if and only if d is D-optimal. Thus, whether or not we insist on equal replication for treatments, the design which is D-optimal for treatment effects is D-optimal for block effects.

The same is true for A-optimality when the cycle rank is 0. Theorem 14 shows that the A-optimal designs are precisely the star designs. The dual of a star design has one complete block, and all other blocks of size one. Therefore a star design gives the minimal sum of resistance distances between pairs of block-vertices.

However, the same is not true for A-optimality in general. For example, consider cycle rank 1. When \(k=2\) and \(b=v\), an investigation like the one in the proof of Theorem 15 shows that, for \(b\ge 5\), \(d^\top \) is A-optimal if and only if d is in \(\mathscr {C}(b,2,2)\).

7.3 What About Larger Cycle Rank?

It seems unlikely that the detailed investigations in Sects. 5 and 6 can be replicated for larger cycle ranks. However, Section 6 of [1] does suggest that, when average replication is no more than three, the A-optimal designs will diverge from the D-optimal designs as v increases. Thus John and Mitchell’s assumption in [18] that optimal designs will always be equireplicate if possible is no longer tenable. We recommend that any computer search for optimal block designs should not rule out considering designs with unequal (possibly very unequal) replication of treatments.