1 Introduction

In a crossover design, the effects of t treatments are compared by giving each of n subjects a series of treatments in p consecutive periods. The response of each unit in each period is measured and used to estimate the so-called direct effects of the treatments. Usually, for experiments of this kind, there is concern that a treatment given in period i, \(1 \le i \le p\), in addition to its direct effect will also have a carryover effect that influences the measurement in period \(i+1\). An important application of crossover designs is in sensory experiments, where each assessor evaluates a number of products, one after the other. Here, a carryover effect may for instance be caused by a lingering taste of a product.

There are two options to deal with carryover effects. One possibility is to explicitly include carryover effects in the model and use an estimate that corrects for carryover effects. There is a vast literature on designs for the corrected estimate, see, e.g. [3].

Another approach is to neglect the carryover effect and use an uncorrected estimate which does not correct for carryover effects. In sensory experiments, the analysis is usually done with the uncorrected estimate. In most cases, the experimenter then uses technical measures like, e.g. washout with a neutral taste to reduce carryover.

To deal with the, hopefully small, carryover effects which may be present in spite of washout, some authors recommend using neighbour-balanced designs as a precaution to minimize the effects of carryover, see, e.g. [11, p. 189]. Neumann and Kunert [12] have formally shown that the use of neighbour-balanced designs can minimize the mean square error of the uncorrected estimate.

In a circular crossover design, there is a run-in period where each unit receives a treatment but no measurement is made. During the run-in period of a circular design, each unit receives the treatment which it will receive again in the last period. Hence, there is a carryover effect in each period. Circular designs were introduced to the area of crossover designs by Magda [10]. There are some results on optimal designs for estimators taking account of the carryover effects, e.g. by Kunert [7], Druilhet [5] and Bailey and Cameron [2].

Azaïs [1] has shown that for circular designs and the uncorrected estimate, there is a randomization which preserves the neighbour structure but validates an analysis with the simple block model. This makes circular designs appealing for the uncorrected estimate.

The present paper extends the results of [12] to the class of circular designs. Since the mean square error depends on the size of the carryover effects, we determine locally optimal designs. If the number of periods is larger than the number of treatments, however, it turns out that there are designs minimizing the mean square error regardless of the size of the carryover effects. If the number of periods is less or equal to the number of treatments, the optimal design varies with the size of the carryover effects. As long as the carryover effects are small, no experimental unit in the optimal design receives the same treatment twice. When the carryover effects get larger, some units will receive the same treatment twice in consecutive periods. The proportion of units receiving the same treatment twice increases with increasing carryover effects. These results remain correct whether there is a period effect in the model or not.

2 Calculating the MSE

Define \(\Omega _{t,n,p}\) as the set of all circular crossover designs with t treatments, n units (subjects) and p periods. We assume that the j-th observation on unit i can be written as

$$\begin{aligned} y_{ij}= \alpha _i+ \tau _{d(i,j)}+\rho _{d(i,j-1)}+\varepsilon _{ij}. \end{aligned}$$

Here, \(\alpha _i, 1\le i\le n\), is the effect of the i-th unit, \(\tau _{d(i,j)}\) is the effect of the treatment given to the i-th unit in the j-th period by the design \(d \in \Omega _{t,n,p}\), \(\rho _{d(i,j-1)}\) is the carryover effect of the treatment given to unit i in period \((j-1)\) with \(\rho _{d(i,0)} = \rho _{d(i,p)}\), and \(\varepsilon _{ij}\) is the error. The errors are independent, identically distributed with expectation 0 and variance \(\sigma ^2.\)

To continue, we need some notation. Denote by \(A^T\) the transpose of the matrix A. If \(I_t\) is the \(t\times t\)-identity matrix and \(1_t\) the t-dimensional vector of ones, define \(H_t=I_t-\frac{1}{t}1_t 1_t^T\). For a matrix \(A\in {\mathbb {R}}^{(n,r)}\) set \(\omega ^\perp (A)=I_n-A(A^TA)^+A^T\), where \(B^+\) denotes the Moore–Penrose generalized inverse of the matrix B.

In vector notation, our model then can be written as

$$\begin{aligned} y =U\alpha + T_d \tau + F_d \rho + \varepsilon . \end{aligned}$$
(1)

Here, \(y=[y_{11},\ldots ,y_{1p},y_{21},\ldots ,y_{np}]^T\) and \(\varepsilon \) is the vector of the errors. Further, \(\alpha \), \(\tau \) and \(\rho \) are the vectors of the unit, direct and carryover effects, respectively. The matrices U, \(T_d\) and \(F_d\) are the corresponding design matrices.

We assume that the analysis of the data is done with a model neglecting the carryover effects, i.e.

$$\begin{aligned} y=U\alpha + T_d \tau + \varepsilon . \end{aligned}$$
(2)

If we define \(M_{d11}=T_d^T\omega ^\perp (U)T_d\), then \(M_{d11}\) is the information matrix for the estimation of direct effects in model (2). Note that \(M_{d11}\) and, therefore, \(M_{d11}^+\) have row- and column-sums 0. Consequently, \(H_tM_{d11}^+=M_{d11}^+\) and \({\hat{\tau }}=M_{d11}^+T_d^T\omega ^\perp (U)y\) is the Gauss–Markov estimate for \(H_t \tau \) in model (2).

If there are carryover effects, then the uncorrected estimate \({\hat{\tau }}\) is biased. More precisely,

$$\begin{aligned} E({\hat{\tau }})=H_t \tau + M_{d11}^+T_d^T\omega ^\perp (U)F_d \rho . \end{aligned}$$

As in [12], we try to determine a design that minimizes the mean square error (MSE) as a performance measure combining bias and variance.

The joint information matrix of direct and carryover effects in model (1) can be written as

$$\begin{aligned} M_d= \begin{bmatrix} M_{d11}&{}\quad M_{d12}\\ M_{d12}^T&{}\quad M_{d22} \end{bmatrix}, \end{aligned}$$

where \(M_{d11}\) is as before and

$$\begin{aligned} M_{d12}= & {} T_d^T \omega ^\perp (U)F_d,\\ M_{d22}= & {} F_d^T \omega ^\perp (U)F_d, \end{aligned}$$

see [3, p. 15].

Analogous to [12], we calculate the MSE in the following way: We restrict attention to designs which allow estimation of all contrasts of direct effects in the model without carryover effects. Because \(M_{d11}\) is the information matrix for direct effects in the model without carryover effects, this is the set of all designs for which \(\hbox {rank}(M_{d11}) = t-1\). In the model with carryover effects, the MSE of the uncorrected estimate \(\widehat{\tau _i - \tau _j}\) for any pair \((i,j), i \ne j\) then is

$$\begin{aligned} E\left( \widehat{\tau _i - \tau _j}-(\tau _i - \tau _j)\right) ^2 = \sigma ^2 \ell _{ij}^T M_{d11}^+ \ell _{ij} + \left( \ell _{ij}^T M_{d11}^+ M_{d12} \rho \right) ^2, \end{aligned}$$

where \(\ell _{ij}\) is a t-dimensional vector with \(+1\) in position i, \(-1\) in position j and all other entries 0. With the calculations done in [12], we get that the average MSE over all pairs equals

$$\begin{aligned} \frac{2}{t-1}\left( \sigma ^2 \hbox {tr}\left( M_{d11}^+\right) + \rho ^T \left( M_{d12}^T M_{d11}^+ M_{d11}^+ M_{d12}\right) \rho \right) , \end{aligned}$$

where \(\hbox {tr}(M)\) denotes the trace of a matrix M. As in [12], we consider the worst case for given \(\rho ^T H_t \rho = \sum (\rho _i - {\bar{\rho }})^2=\delta ,\) say. That is, we consider

$$\begin{aligned}&\max _{\rho ^T H_t \rho = \delta } \left( \frac{2}{t-1}\left( \sigma ^2 \hbox {tr}\left( M_{d11}^+\right) + \rho ^T \left( M_{d12}^T M_{d11}^+ M_{d11}^+ M_{d12}\right) \rho \right) \right) \\&\quad = \frac{2}{t-1}\left( \sigma ^2 tr(M_{d11}^+) + \delta \lambda _1 \left( M_{d12}^T M_{d11}^+ M_{d11}^+ M_{d12}\right) \right) , \end{aligned}$$

where \(\lambda _i(M)\) is the i-th largest eigenvalue of the symmetric matrix M.

We multiply this equation by \(\frac{t-1}{2}\) and get the following definition for our optimality-criterion. For any \(d \in \Omega _{t,n,p}\), we define

$$\begin{aligned} \hbox {MSE}(d)= \sigma ^2 tr(M_{d11}^+) + \delta \lambda _1\left( M_{d12}^TM_{d11}^{+}M_{d11}^+M_{d12}\right) . \end{aligned}$$

The advantage of this criterion is that the multivariate purpose of minimizing the bias and maximizing the precision of the estimators can be measured as a single real number. In what follows, we assume without loss of generality that \(\sigma ^2=1.\) Our aim is to find a design that minimizes \(\hbox {MSE}(d)\).

We call a square matrix M completely symmetric, if there are numbers a and b such that all diagonal elements are equal to a and all off-diagonal elements are equal to b. We get the following bound for \(\hbox {MSE}(d)\).

Theorem 1

For any design \(d \in \Omega _{t,n,p}\), there is a lower bound for \(\hbox {MSE}(d)\), namely

$$\begin{aligned} \hbox {MSE}(d) \ge \frac{(t-1)^2}{\hbox {tr}(M_{d11})} + \delta \frac{(\hbox {tr}(M_{d12}))^2}{(\hbox {tr}(M_{d11}))^2}. \end{aligned}$$

Equality holds if both \(M_{d11}\) and \(M_{d12}\) are completely symmetric.

Proof

The proof given for Proposition 1 of [12] remains valid in the circular case. \(\square \)

Note that for every \(M_{d11}\) and \(M_{d12}\) there are symmetrized versions \({\bar{M}}_{d11}\) and \({\bar{M}}_{d12}\) with

$$\begin{aligned} {\bar{M}}_{dij} = \frac{1}{t!} \sum _{\Pi \in {\mathcal {S}}_t}\Pi ^T M_{dij}\Pi \end{aligned}$$

for \(1\le i \le j \le 2.\) Here \({\mathcal {S}}_t\) is the set of all \((t\times t)-\)permutation matrices \(\Pi \). We see that all \({\bar{M}}_{dij}\) are completely symmetric and that \(\hbox {tr}(M_{dij}) = \hbox {tr}({\bar{M}}_{dij}).\)

For any design d, define \(q_{dij}=\frac{1}{n}\hbox {tr}\left( M_{dij}\right) \) for \(1\le i \le j \le 2.\) Then, the lower bound in Theorem 1 can be written as

$$\begin{aligned} \hbox {MSE}(d)\ge \frac{(t-1)^2}{n\,q_{d11}}+\delta \left( \frac{q_{d12}}{q_{d11}}\right) ^2. \end{aligned}$$

Each subject in the design d receives a sequence s of treatments. Denote by T(s) and F(s) the part of \(T_d\) and \(F_d\) that corresponds to s and define

$$\begin{aligned} q_{11}(s)=\hbox {tr}\left( T(s)^T\omega ^\perp (1_p)T(s)\right) \quad \mathrm {and} \quad q_{12}(s)=\hbox {tr}\left( T(s)^T\omega ^\perp (1_p)F(s)\right) . \end{aligned}$$

We call two sequences \(s_1\) and \(s_2\) equivalent if \(s_1\) can be transformed to \(s_2\) by relabelling the treatments. Two equivalent sequences then have the same \(q_{ij}(s)\). If for given t and p there are K, say, equivalence classes of sequences, we choose a representative sequence \(s_k, 1 \le k \le K\) for each class. As pointed out by Kushner [9], the \(q_{dij}\) then are weighted means of the \(q_{ij}(s_k)\). More precisely, we get

$$\begin{aligned} q_{dij} = \sum _{k=1}^K q_{ij}(s_k)\pi _d(k), \end{aligned}$$

where \(\pi _d(k)\) is the proportion of units of d receiving a sequence from class \(k, 1 \le k \le K.\)

Similarly to [12], we consider approximate designs. For approximate designs, we remove the restriction that the number of experimental units to receive a given sequence s must be an integer. For an approximate design, the \(\pi _d(k), 1 \le k \le K\) can be any set of nonnegative real numbers, subject to the condition that \(\sum _{k=1}^K \pi _d(k)=1.\) An exact design \(d \in \Omega _{t,n,p}\) then is a special instance of an approximate design, where each sequence is assigned to an integral number of units. We denote the set of all approximate designs for given tn and p by \(\Delta _{t,n,p}\). Note that the number n of units is not important for an approximate design \(d \in \Delta _{t,n,p}\). It plays a role in the calculation of \(\hbox {MSE}(d)\), however. A design \(d \in \Delta _{t,n,p}\) is called symmetric, if each sequence s from class \(k, 1 \le k \le K\) appears equally often (i.e. \(\pi _d(k)/m_k\) times, where \(m_k\) is the number of sequences in class k). If the design d is symmetric, then all \(M_{dij}\) are completely symmetric, \(1 \le i \le j \le 2\). We see from the definition of symmetrized designs that for every combination of \(\hbox {tr}(M_{d11})\) and \(\hbox {tr}(M_{d12})\) there is a symmetric design with the same traces. As these designs achieve the lower bound in Theorem 1, we can restrict attention to symmetric designs.

3 Optimal Designs

For any sequence s, we observe that

$$\begin{aligned} q_{11}(s)=p-\frac{1}{p}\sum _{m=1}^t f_{s,m}^2\quad \text {and}\quad q_{12}(s)=B_s- \frac{1}{p}\sum _{m=1}^t f_{s,m}^2, \end{aligned}$$

where \(f_{s,m}\) is the frequency of treatment m in sequence s and \(B_s\) the number of periods where the treatment of the preceding period is repeated. These formulas show that the \(q_{ij}(s)\) depend on the sequence only through the treatment frequencies and the number of periods where the preceding treatment is repeated. The position of the treatments within the sequence does not matter. Hence, we will choose the representative sequences such that the periods with repeated treatments, if there are, come at the end.

As a first step, let \(p>t\). In this case, we can write \(p=gt+s\) with g and s being integers and \(1\le s\le t\). In this situation, there are two classes of particular interest, namely A with representative sequence \(s_1\) and B with representative sequence \(s_2\), where

$$\begin{aligned} s_1&= [1,\ldots ,1,\,2,\ldots ,2,\,\,\ldots ,\,\,t-1,\ldots ,t-1,\,t,\ldots ,t]\\ s_2&= [1,2,\ldots ,t,\,1,2,\ldots ,t,\ldots ,1,2,\ldots ,t,\,1,2,\ldots ,r]. \end{aligned}$$

In these sequences, each treatment appears either g or \(g+1\) times. Note that it makes no difference for \(q_{11}(s_1)\) and \(q_{12}(s_1)\) whether the treatments appearing \(g+1\) times come early or late in \(s_1\).

Table 1 Size of \(q_{11}(s)\) and \(q_{12}(s)\) for two classes of sequences in the case \(p>t\)

The values of \(q_{11}(s)\) and \(q_{12}(s)\) for the two classes A and B are displayed in Table 1. It can be seen that \(\sum _{m=1}^t f_{s,m}^2\) is minimal, if the \(f_{s,m}, 1\le m \le t,\) are as nearly equal as possible. Consequently, \(s_1\) and \(s_2\) maximize \(q_{11}(s)\). With that observation, the MSE-optimal design can be determined as follows.

Theorem 2

Assume \(p>t\) and consider a symmetric approximate design \(d^*\in \Delta _{t,n,p}\) consisting of sequences from class A with proportion

$$\begin{aligned} \pi _1 = \frac{p^2-s^2+ts}{pt(p - t)} \end{aligned}$$

and of sequences from class B with proportion \(\pi _2 = 1- \pi _1\). We then get

$$\begin{aligned} \forall d \in \Delta _{t,n,p}: \hbox {MSE}(d)\ge \hbox {MSE}(d^*). \end{aligned}$$

Proof

In each sequence used by \(d^*\), each treatment in each unit appears either g or \(g+1\) times. Therefore, we have that

$$\begin{aligned} q_{d^*11} = \max _d\{q_{d11}\} \end{aligned}$$

and

$$\begin{aligned} \hbox {MSE}(d)\ge \frac{(t-1)^2}{n\,q_{d11}}+\delta \left( \frac{q_{d12}}{q_{d11}}\right) ^2 \ge \frac{(t-1)^2}{n\,q_{d11}} \ge \frac{(t-1)^2}{n\,q_{d^*11}} . \end{aligned}$$

Observe that the choice of \(\pi _1\) implies that \(q_{d^*12} = 0\). Since \(d^*\) is symmetric, we get

$$\begin{aligned} \hbox {MSE}(d^*)= \frac{(t-1)^2}{n\,q_{d^*11}}, \end{aligned}$$

which completes the proof. \(\square \)

If \(p\le t\), the determination of an MSE-optimal design is more complicated. We examine the special cases \(p=2\le t\) and \(p=3\le t\) before we find general optimal designs for \(4\le p \le t.\)

Table 2 Size of \(q_{11}(s)\) and \(q_{12}(s)\) for the two classes of possible sequences when \(2=p\le t\)

Assume \(p=2\). Note that there are only two possible classes of sequences A and B, displayed in Table 2.

Theorem 3

Assume \(2=p\le t\) and consider a symmetric approximate design \(d^*\in \Delta _{t,n,2}\) consisting only of sequences from class B with representative sequence [1, 2]. Then,

$$\begin{aligned} \forall d \in \Delta _{t,n,2}: \hbox {MSE}(d)\ge \hbox {MSE}(d^*). \end{aligned}$$

Proof

Assume the design has proportion \(\pi _1\) of its sequences from class A and \(\pi _2=1-\pi _1\) from class B. We get from Table 2 that \(q_{d11}=\sum \pi _d(s)q_{11}(s) =\pi _2\) and \(q_{d12}=-\pi _2\). Hence,

$$\begin{aligned} \hbox {MSE}(d)\ge \frac{(t-1)^2}{n\pi _2}+\delta \left( \frac{-\pi _2}{\pi _2}\right) ^2=\frac{(t-1)^2}{n\pi _2}+\delta \ge \frac{(t-1)^2}{n}+\delta =\hbox {MSE}(d^*). \end{aligned}$$

This completes the proof. \(\square \)

Table 3 Size of \(q_{11}(s)\) and \(q_{12}(s)\) for the five classes of possible sequences when \(3=p\le t\)

For the case \(p=3\), there are only the five possible classes of sequences listed in Table 3. In this case, the optimal design consists of sequences [1, 2, 3] only.

Theorem 4

Assume \(3=p\le t\) and consider a symmetric approximate design \(d^*\in \Delta _{t,n,3}\) that consists only of sequences from class C with representative sequence [1, 2, 3]. Then,

$$\begin{aligned} \forall d \in \Delta _{t,n,3}: \hbox {MSE}(d)\ge \hbox {MSE}(d^*). \end{aligned}$$

Proof

In the circular model, sequences s from any of the three classes \(B_1\), \(B_2\) and \(B_3\) all have the same \(q_{11}(s)\) and \(q_{12}(s)\), respectively. Assume the design d has proportion \(\pi _2\) of its sequences from one of the classes \(B_j\) and \(\pi _3\) from class C. Then, the proportion of sequences from class A is \(\pi _1=1-\pi _2-\pi _3.\) We get from Table 3 that \(q_{d11}=\sum \pi _d(s)q_{11}(s) =2(\frac{2}{3}\pi _2+\pi _3)\) and \(q_{d12}=-(\frac{2}{3}\pi _2+\pi _3)\). Hence,

$$\begin{aligned} \hbox {MSE}(d)\ge & {} \frac{(t-1)^2}{n\left( \frac{2}{3}\pi _2+\pi _3\right) }+\delta \left( \frac{-\left( \frac{2}{3}\pi _2+\pi _3\right) }{2\left( \frac{2}{3}\pi _2+\pi _3\right) }\right) ^2\\= & {} \frac{(t-1)^2}{2n\left( \frac{2}{3}\pi _2+\pi _3\right) }+\frac{\delta }{4}\ge \frac{(t-1)^2}{2n}+\frac{\delta }{4}=\hbox {MSE}(d^*) \end{aligned}$$

and \(d^*\) is MSE-optimal. \(\square \)

Now consider the situation when \(4 \le p \le t\). Remember that for a sequence s, the number \(B_s\) denotes the number of periods in s where the treatment of the preceding period is repeated. As in [12], we can show that

$$\begin{aligned} \sum _{m=1}^t f_{s,m}^2\ge p+2B_s, \end{aligned}$$

also in the circular case. This inequality is used in the next proposition.

Proposition 1

Define B(d) of a design \(d \in \Delta _{t,n,p}\) as the weighted mean of the \(B_s\) of its sequences, that is \(B(d)=\sum _{s}B_s\pi _d(s)\). Then, for any \( d \in \Delta _{t,n,p} \) we have

$$\begin{aligned} q_{d11}&\le p-1-\frac{2}{p}B(d)\\ q_{d12}&\le - 1 + \frac{p-2}{p}B(d). \end{aligned}$$

Proof

From the inequality \(\sum _{m=1}^t f_{s,m}^2\ge p+2B_s\), we get for any sequence s that

$$\begin{aligned} q_{11}(s)&\le p-1-\frac{2B_s}{p}\\ q_{12}(s)&\le B_s - 1 - \frac{2B_s}{p}=-1 + \frac{p-2}{p} B_s. \end{aligned}$$

Since \(q_{dij}=\sum \pi _d(s) q_{ij}(s)\), the desired inequalities follow. \(\square \)

Table 4 Size of \(q_{11}(s)\) and \(q_{12}(s)\) for two classes of sequences of interest when \(4\le p\le t\)

For \(4 \le p \le t\), there are two classes of sequences of interest. Class A has representative sequence \([1,2,\ldots , p-4,p-3,p-3, p-2, p-2]\), i.e. two treatments are given twice in adjacent periods such that \(B_s=2\). If \(p>4\), there are \(p-4\) other treatments given exactly once. The second class, B, has representative sequence \([1,\ldots , p]\), i.e. no treatment is given twice and \(B_s=0\). Class B has been shown to produce (universally) optimal designs for various cases [2, 5, 8], since balanced designs use sequences from this class. Class A was shown by Zheng et al. [13] to be important for non-trivial covariance matrices.

The respective values for \(q_{11}(s)\) and \(q_{12}(s)\) can be seen in Table 4. We now show that a design minimizing the MSE can be found in the set of all designs consisting of sequences from classes A and B only.

Proposition 2

Assume \(4\le p \le t\). For any design \(d \in \Delta _{t,n,p}\) define

$$\begin{aligned} {\bar{\pi }}=\frac{1}{2} \min \{B(d), \frac{p}{p-2}\} \end{aligned}$$

and

$$\begin{aligned} M({\bar{\pi }})=\frac{(t-1)^2}{n(p-1-\frac{4{\bar{\pi }}}{p})} + \delta \left( \frac{ - 1 + 2{\bar{\pi }}\frac{p-2}{p}}{p-1-\frac{4{\bar{\pi }}}{p}}\right) ^2. \end{aligned}$$

We construct a symmetric design \({\bar{d}}_{t,n,p} \in \Delta _{t,n,p}\) which consists of a proportion \({\bar{\pi }}\) of sequences from class A and a proportion \(1-{\bar{\pi }}\) of sequences from class B. Then,

$$\begin{aligned} \hbox {MSE}(d) \ge M({\bar{\pi }}) = \hbox {MSE}({\bar{d}}). \end{aligned}$$

Proof

For the design \({\bar{d}}\), we conclude from the \(q_{ij}(s)\) in Table 4, that \(q_{{\bar{d}}11} = p-1- \frac{4}{p}{\bar{\pi }}\) and \(q_{{\bar{d}}12} = -(1-{\bar{\pi }})+{\bar{\pi }}\frac{p-4}{p}=-1+2{\bar{\pi }} \frac{p-2}{p}\). Making use of the fact that \({\bar{d}}\) is symmetric, we then have

$$\begin{aligned} \hbox {MSE}({\bar{d}})=\frac{(t-1)^2}{n q_{{\bar{d}}11}} + \delta \left( \frac{q_{{\bar{d}}12}}{q_{{\bar{d}}11}}\right) ^2 = M({\bar{\pi }}). \end{aligned}$$

Now distinguish between two cases.

Case 1: \(B(d) \ge \frac{p}{p-2}\).

In this case, \({\bar{\pi }}=\frac{p}{2(p-2)}\) and therefore \(q_{{\bar{d}}11} = p-1- \frac{2}{p}B(d)\) while \(q_{{\bar{d}}12} = 0.\) It follows from Proposition 1 that \(q_{d11}\le p-1 -\frac{2}{p}B(d) \le q_{{\bar{d}}11}\). Hence,

$$\begin{aligned} M({\bar{\pi }})=\hbox {MSE}({\bar{d}})=\frac{(t-1)^2}{n q_{{\bar{d}}11}} + \delta \left( \frac{q_{{\bar{d}}12}}{q_{{\bar{d}}11}}\right) ^2 = \frac{(t-1)^2}{n q_{{\bar{d}}11}} \le \frac{(t-1)^2}{n q_{d11}} \le \hbox {MSE}(d). \end{aligned}$$

Case 2: Assume \(0\le B(d) < \frac{p}{p-2}.\)

In that case, it follows from Proposition 1 that \(q_{d12}\le -1 + \frac{p-2}{p}B(d) <0.\) Therefore, a lower bound for \(\hbox {MSE}(d)\) is achieved by making both \(q_{d12}\) and \(q_{d11}\) as large as possible. We hence get from Proposition 1 that

$$\begin{aligned} \hbox {MSE}(d) \ge \frac{(t-1)^2}{n(p-1-\frac{2B(d)}{p})} + \delta \left( \frac{ - 1 + \frac{p-2}{p}B(d)}{p-1-\frac{2B(d)}{p}}\right) ^2 . \end{aligned}$$

Since \({\bar{\pi }}=B(d)/2\), the right hand side of this inequality equals \(M({\bar{\pi }})\). \(\square \)

It follows from Proposition 2 that the MSE-optimal design for a given \(\delta \) can be found among the set of all designs \({\bar{d}}\) defined in Proposition 2. That is, we have to find the best \({\bar{\pi }}\).

Theorem 5

Assume \(4\le p \le t\) and \(\delta \ge 0\) is given.

  1. 1.

    If \(\delta < \frac{(p-1)(t-1)^2}{np(p-3)}\) choose \(d^*\) as the symmetric design that only consists of sequences from class B, i.e. a design \({\bar{d}}\) with \({\bar{\pi }}=0.\) Then,

    $$\begin{aligned} \hbox {MSE}(d^*) = M(0)=\min _{d \in \Delta _{t,n,p}}\hbox {MSE}(d) \end{aligned}$$
  2. 2.

    If \(\delta \ge \frac{(p-1)(t-1)^2}{np(p-3)}\), let \(d^*\) be a symmetric design that consists of sequences from class A with proportion

    $$\begin{aligned} \pi (\delta )=\frac{p(p-1)}{4} - \frac{\delta n p^3(p-3)^2}{4\big ( \delta n p (p-2)(p-3) - 2(t-1)^2 \big ) } \end{aligned}$$

    and of sequences from class B with proportion \(1-\pi (\delta )\). Then,

    $$\begin{aligned} \hbox {MSE}(d^*) = M(\pi (\delta )) = \min _{d \in \Delta _{t,n,p}}\hbox {MSE}(d). \end{aligned}$$

Proof

It follows from Proposition 2 that for any d there is a \({\pi } \in [0,\frac{p}{2(p-2)}]\) such that \(\hbox {MSE}(d) \ge M({\pi })\).

We therefore have to find a \(\pi ^* \in [0, \frac{p}{2(p-2)}]\) which minimizes \(M({\pi })\).

Case 1: \(\delta =\frac{2(t-1)^2}{np(p-2)(p-3)}.\) In this case the derivative of \(M(\pi )\) with respect to \(\pi \) is

$$\begin{aligned} D(\pi )=\frac{4(t-1)^2(p-3)}{n(p-2)(p-1-\frac{4\pi }{p})^3} \end{aligned}$$

Then, \(D(\pi )>0\) for all \(\pi \in [0,\frac{p}{2(p-1)}]\) and \(M({\pi })\) is minimal for \(\pi =0\).

For all other \(\delta \), the derivative of \(M({\pi })\) with respect to \({\pi }\) is

$$\begin{aligned} D({\pi })= \frac{ 8 \big ( \delta np(p-2)(p-3)-2(t-1)^2\big ) }{np^2(p-1-\frac{4{\pi }}{p})^3} \left( {\pi }-\pi _0\right) , \end{aligned}$$

where \(\pi _0=\frac{p(p-1)}{4}-\frac{\delta np^3(p-3)^2}{4\big ( \delta np(p-2)(p-3)-2(t-1)^2 \big ) }.\)

Hence, \(D(\pi )=0\) if and only if \(\pi =\pi _0\).

Case 2: \(0\le \delta <\frac{2(t-1)^2}{np(p-2)(p-3)}.\)

In this case, \( \delta np(p-2)(p-3)-2(t-1)^2 < 0\) and, therefore, \(\pi _0>\frac{p(p-1)}{4}>\frac{p}{2(p-1)}\). Further, it follows for all \(\pi < \pi _0\) that \(D(\pi )>0\). Hence, the minimum of \(M(\pi )\) is attained for \(\pi =0.\)

Case 3: \(\frac{2(t-1)^2}{np(p-2)(p-3)}<\delta \le \frac{(p-1)(t-1)^2}{np(p-3)}.\)

In this case, \( \delta np(p-2)(p-3)-2(t-1)^2 > 0\) and, therefore, \(D(\pi )>0\) for all \(\pi > \pi _0\). Also in this case, \(\pi _0 \le 0\). Hence, again, the minimum of \(MSE(\pi )\) is attained for \(\pi =0\).

Combining cases 1, 2 and 3, we have shown that for all \(\delta \le \frac{(p-1)(t-1)^2}{np(p-3)}\) the choice \(\pi ^*=0\) is optimum.

Case 4: \(\delta \ge \frac{(p-1)(t-1)^2}{np(p-3)}\). In this case, \( \delta np(p-2)(p-3)-2(t-1)^2 > 0\) and, therefore, \(D(\pi )>0\) for all \(\pi > \pi _0\) and \(D(\pi )<0\) for all \(\pi < \pi _0\). Therefore, the MSE is minimized by that \({\bar{d}}\) for which \({\bar{\pi }}=\pi _0 = \pi (\delta )\). \(\square \)

Note that \(\pi (\delta )\) increases with \(\delta \) and approaches \(\frac{p}{2(p-2)}\) for \(\delta \rightarrow \infty \).

4 Optimal Designs for the Model with Period Effects

We extend the model and include period effects, i.e. the model becomes

$$\begin{aligned} y =U\alpha + P\beta + T_d \tau + F_d \rho + \varepsilon , \end{aligned}$$

with \(\beta \) the vector of the period effects and P the corresponding design matrix. In this model, we have the information matrices

$$\begin{aligned} {\tilde{M}}_{d11}&= T_d^T\omega ^\perp \left( [U,\, P]\right) T_d,\\ {\tilde{M}}_{d12}&= F_d^T\omega ^\perp \left( [U,\, P]\right) T_d,\\ {\tilde{M}}_{d22}&= F_d^T\omega ^\perp \left( [U,\, P]\right) F_d. \end{aligned}$$

The MSE of a design then can be defined in the same manner as in the model without period effects, only replacing \(M_{dij}\) by \({\tilde{M}}_{dij}\), \(1\le i \le j \le 2.\) We get

$$\begin{aligned} {\widetilde{\hbox {MSE}}}(d) = \sigma ^2 \hbox {tr}({\tilde{M}}_{d11})^+ + \delta \lambda _1 \left( {\tilde{M}}_{d12}^T{\tilde{M}}_{d11}^+{\tilde{M}}_{d11}^+ {\tilde{M}}_{d12}\right) . \end{aligned}$$

Defining \({\tilde{q}}_{d11} = \frac{1}{n} \hbox {tr}\left( {\tilde{M}}_{d11}\right) \) and \({\tilde{q}}_{d12} = \frac{1}{n} \hbox {tr}\left( {\tilde{M}}_{d12}\right) \), we can show with the same arguments as before that

$$\begin{aligned} {\widetilde{\hbox {MSE}}}(d) \ge \frac{(t-1)^2}{n {\tilde{q}}_{d11}} + \delta \left( \frac{{\tilde{q}}_{d12}}{{\tilde{q}}_{d11}}\right) ^2, \end{aligned}$$

with equality if \({\tilde{M}}_{d11}\) and \({\tilde{M}}_{d12}\) are completely symmetric.

Unfortunately, the calculations of \(\hbox {tr}\left( {\tilde{M}}_{d11}\right) \) and \(\hbox {tr}\left( {\tilde{M}}_{d12}\right) \) are different from the calculations before. In particular, it is not possible to express the \({\tilde{q}}_{dij}\) as weighted means of contributions from the single sequences. Adapting the results of [4] to circular designs, we get

$$\begin{aligned} \hbox {tr}({\tilde{M}}_{d11})&= n\, {\tilde{q}}_{d11} = n\, q_{d11} - \frac{1}{n} \sum _{i=1}^t\sum _{k=1}^p l_{di,k}^2 + \frac{1}{np} \sum _{i=1}^t r_{di}^2 \quad {\mathrm {and}}\\ \hbox {tr}({\tilde{M}}_{d12})&= n\, {\tilde{q}}_{d12} = n\, q_{d12} - \frac{1}{n} \sum _{i=1}^t\sum _{k=1}^p l_{di,k}{l}_{di,k-1} + \frac{1}{np} \sum _{i=1}^t r_{di}^2, \end{aligned}$$

where \(l_{di,k}\) is the number of appearances of treatment i in period k (with \(l_{di0}=l_{\mathrm{dip}}\)) and \(r_{di}\) the number of appearances of treatment i in the design. The numbers \(q_{d11}\) and \(q_{d12}\) are as in Sect. 3.

For symmetric designs, we get \(\hbox {tr}({\tilde{M}}_{d11}) = n\, q_{d11}\) and \(\hbox {tr}({\tilde{M}}_{d12}) = n\, q_{d12}\), i.e. \({\tilde{q}}_{dij}=q_{dij}\). In the case \(p>t\), the best symmetric designs have \(q_{d12}=0\) and the results of Theorem 2 extend to the model with period effects.

There are examples—at least in non-circular designs—of designs that are non-symmetric and where the uncorrected estimate from the model with period effects has a smaller bias than the estimate from the model without period effects. In the case \(p\le t\), the best symmetric designs have nonzero bias. Hence, it is not obvious that the results of Theorems 3 and 4 and of Proposition 2 still hold in the model with period effects.

To show that they do, we start by showing that for non-symmetric designs the loss in \(\hbox {tr}({\tilde{M}}_{d11})\) is higher than the possible gain in \(|\hbox {tr}({\tilde{M}}_{d12})|.\)

Proposition 3

For any \(d\in \Omega _{t,n,p},\) it holds that

$$\begin{aligned} q_{d11}-{\tilde{q}}_{d11} \ge |q_{d12}-{\tilde{q}}_{d12}|. \end{aligned}$$

Proof

Defining \(Q=\frac{1}{n}PP^T-\frac{1}{np}1_{np}1_{np}^T = \omega \left( \omega ^\perp \left( U\right) P\right) \), we find that

$$\begin{aligned} nq_{d11}-n{\tilde{q}}_{d11} = \frac{1}{n} \sum _{i=1}^t\sum _{k=1}^p l_{di,k}^2 - \frac{1}{np} \sum _{i=1}^t r_{di}^2 = tr\left( T_d^TQ T_d\right) \end{aligned}$$

and that

$$\begin{aligned} nq_{d12}-n{\tilde{q}}_{d12} = \frac{1}{n} \sum _{i=1}^t\sum _{k=1}^p l_{di,k }l_{di,k-1} - \frac{1}{np} \sum _{i=1}^t r_{di}^2 = tr(T_d^TQ F_d). \end{aligned}$$

Because of the circular structure of the carryover effects, we have

$$\begin{aligned} T_d^T\omega ^\perp \left( [U,\, P]\right) T_d = F_d^T\omega ^\perp \left( [U,\, P]\right) F_d \end{aligned}$$

(cf. [8] or [6]) and, hence, that \(T_d^TQ T_d = F_d^TQ F_d\). Therefore, the Cauchy–Schwarz inequality implies that

$$\begin{aligned} |\hbox {tr}\left( T_d^TQ F_d\right) | \le \sqrt{\hbox {tr}\left( T_d^TQ T_d\right) \hbox {tr}\left( F_d^TQF_d\right) }=\hbox {tr}\left( T_d^TQT_d\right) \end{aligned}$$

and the proposition follows. \(\square \)

If \(p\ge 4\), we are now able to show that for every design d there is a symmetric design \(d_s\) consisting of sequences from classes A and B described in Table 4 only and having smaller or equal MSE.

Proposition 4

Let \(p\ge 4\). Assume the model with period effects holds and define \(\Gamma _{t,n,p} \subset \Delta _{t,n,p}\) as the set of all symmetric \(d \in \Delta _{t,n,p}\).

Then, \(\forall d \in \Delta _{t,n,p}\, \exists \, d_s \in \Gamma _{t,n,p}\) with

$$\begin{aligned} {\widetilde{\hbox {MSE}}}(d) \ge {\widetilde{\hbox {MSE}}}(d_s). \end{aligned}$$

Proof

Case 1: \(|{\tilde{q}}_{d12}|>1\).

Let \(d_s\) be a symmetric design that consists of sequences from class B only. Then, \(d_s\in \Gamma _{t,n,p}\) and \(|q_{d_s12}| = 1 < |{\tilde{q}}_{d12}|\). On the other hand (because of Proposition 1 and since \(B(d_s)=0\)), we have that

$$\begin{aligned} q_{d_s11} \ge q_{d11} \ge {\tilde{q}}_{d11}. \end{aligned}$$

Thus, it holds that

$$\begin{aligned} {\widetilde{\hbox {MSE}}}(d) \ge \frac{(t-1)^2}{n q_{d11}} + \delta \left( \frac{{\tilde{q}}_{d12}}{q_{d11}}\right) ^2 \ge \frac{(t-1)^2}{n q_{d_s11}} + \delta \left( \frac{1}{q_{d_s11}}\right) ^2 = {\widetilde{\hbox {MSE}}}(d_s). \end{aligned}$$

Case 2: \(|{\tilde{q}}_{d12}|\le 1\).

Let \(d_s\) be the symmetric design that consists of sequences from class A with proportion

$$\begin{aligned} \pi = \frac{p(1-|{\tilde{q}}_{d12}|)}{2(p-2)} \end{aligned}$$

and sequences from class B with proportion \(1-\pi \). Then, \(d_s\in \Gamma _{t,n,p}\) and we get from Table 4 that

$$\begin{aligned} q_{d_s12}=-(1-\pi ) + \pi \frac{p-4}{p}=-1+2\pi \frac{p-2}{p}= -|{\tilde{q}}_{d12}| \end{aligned}$$

(and hence \(|q_{d_s12}|=|{\tilde{q}}_{d12}|\)).

Therefore, if we can show that \({\tilde{q}}_{d_s11}\ge {\tilde{q}}_{d11},\) we are done.

Remembering that \(d_s\) is a symmetric design we have \({\tilde{q}}_{d_s11}= q_{d_s11}\). Making use of Table 4 again, we get that \(B(d_s)=2\pi \) and

$$\begin{aligned} {\tilde{q}}_{d_s11} = \left( 1-\frac{B(d_s)}{2}\right) (p-1) + \frac{B(d_s)}{2}\left( p-1-\frac{4}{p}\right) = p-1- \frac{2B(d_s)}{p}. \end{aligned}$$

Subcase 2a: \(B(d)\ge B(d_s).\)

From Proposition 1, we know that \(q_{d11} \le p-1- \frac{2B(d)}{p}\). Hence,

$$\begin{aligned} {\tilde{q}}_{d11}\le q_{d11} \le p-1- \frac{2B(d)}{p} \le p-1- \frac{2 B(d_s)}{p} = {\tilde{q}}_{d_s11}. \end{aligned}$$

Subcase 2b: \(B(d)< B(d_s).\)

Observe that

$$\begin{aligned} {\tilde{q}}_{d12}-q_{d12}\ge & {} -|{\tilde{q}}_{d12}|-q_{d12} \ge q_{d_s12} + 1 - \frac{p-2}{p}B(d)\\= & {} -1+\frac{p-2}{p}B(d_s)+ 1 - \frac{p-2}{p}B(d)=\frac{p-2}{p}(B(d_s)-B(d))\ge 0. \end{aligned}$$

Hence, we conclude from Proposition 3 that

$$\begin{aligned} q_{d11}-{\tilde{q}}_{d11}\ge {\tilde{q}}_{d12}-q_{d12} \ge \frac{p-2}{p}(B(d_s)-B(d)). \end{aligned}$$

Therefore,

$$\begin{aligned} {\tilde{q}}_{d11}\le q_{d11}-\frac{p-2}{p}(B(d_s)-B(d)). \end{aligned}$$

With Proposition 1, it follows that

$$\begin{aligned} {\tilde{q}}_{d11} \le p-1- \frac{2B(d)}{p}-\frac{p-2}{p}(B(d_s)-B(d)). \end{aligned}$$

Since \(p-2 \ge 2\), we get

$$\begin{aligned} {\tilde{q}}_{d11} \le p-1- \frac{2B(d_s)}{p}, \end{aligned}$$

which is \({\tilde{q}}_{d_s11}\). This completes the proof. \(\square \)

Note that this proposition implies that the design \(d^*\) determined in Theorem 5 for the model without period effects, is also MSE-optimal in the model with period effects.

The proofs for the cases \(p=2\) and \(p=3\) have to be done differently. We start with \(p=2\).

Theorem 6

Assume \(2=p\le t\). As in Theorem 3, consider the symmetric approximate design \(d^*\in \Delta _{t,n,2}\) consisting only of sequences from class B of Table 2 with representative sequence [1, 2]. We then have \(\forall d \in \Omega _{t,n,2}\) that

$$\begin{aligned} {\widetilde{\hbox {MSE}}}(d) \ge {\widetilde{\hbox {MSE}}}\left( d^*\right) . \end{aligned}$$

Proof

We have seen in the proof of Theorem 3 for an arbitrary design d that

$$\begin{aligned} q_{d11}=\pi _2 \quad \hbox {and} \quad q_{d12}=-\pi _2, \end{aligned}$$

where \(\pi _2\) is the proportion of sequences from class B.

In the model with period effects, we write \({\tilde{q}}_{d11}=q_{d11}-\epsilon = \pi _2-\epsilon \). Since \(q_{d11}\ge {\tilde{q}}_{d11}\ge 0\), we have \(\pi _2\ge \epsilon \ge 0\).

From Proposition 3, we get that \(|q_{d12}-{\tilde{q}}_{d12}|\le \epsilon \). Therefore, \({\tilde{q}}_{d12} \le q_{d12} +\epsilon = -\pi _2+\epsilon \le 0\) and \(|{\tilde{q}}_{d12}|\ge \pi _2 - \epsilon \).

It follows that

$$\begin{aligned} {\widetilde{\hbox {MSE}}}(d)\ge \frac{(t-1)^2}{n\pi _2-\epsilon } + \delta \left( \frac{\pi _2-\epsilon }{\pi _2-\epsilon }\right) ^2 \ge \frac{(t-1)^2}{n}+\delta ={\widetilde{\hbox {MSE}}}(d_B). \end{aligned}$$

This completes the proof. \(\square \)

The proof for the case \(p=3\) turns out to cause extra difficulties.

Theorem 7

Assume \(3=p\le t\). As in Theorem 4, consider a symmetric approximate design \(d^*\in \Delta _{t,n,3}\) that consists only of sequences from class C with representative sequence [1, 2, 3]. Then,

$$\begin{aligned} \forall d \in \Delta _{t,n,3}: {\widetilde{\hbox {MSE}}}(d)\ge {\widetilde{\hbox {MSE}}}(d^*). \end{aligned}$$

Proof

We have seen in the proof of Theorem 4 for an arbitrary design \(d\in \Delta _{t,n,3}\) that

$$\begin{aligned} q_{d11}=2\left( \pi _3+\frac{2}{3}\pi _2\right) \quad \hbox {and} \quad q_{d12}=-\left( \pi _3-\frac{2}{3}\pi _2\right) . \end{aligned}$$

If we define \(\epsilon =q_{d11}-{\tilde{q}}_{d11}\) and \(\rho ={\tilde{q}}_{d12}-{q}_{d12}\), we get that

$$\begin{aligned} {\widetilde{\hbox {MSE}}}(d)\ge \frac{(t-1)^2}{n\left( 2\left( \pi _3+\frac{2}{3}\pi _2\right) -\epsilon \right) } + \delta \left( \frac{\pi _3+\frac{2}{3}\pi _2-\rho }{2\left( \pi _3+\frac{2}{3}\pi _2\right) -\epsilon }\right) ^2. \end{aligned}$$

At this point, Proposition 3 is of no use. It only guarantees that \(\epsilon \ge |\rho |\), but if we could find a design with \(\rho =\epsilon >0\), this would produce a smaller bias. If we could get \(\rho =\epsilon =\pi _3+\frac{2}{3}\pi _2\), we could even achieve

$$\begin{aligned} \left( \frac{\pi _3+\frac{2}{3}\pi _2-\rho }{2\left( \pi _3+\frac{2}{3}\pi _2\right) -\epsilon }\right) ^2=0. \end{aligned}$$

However, this is not possible. In the case \(p=3\), we can get a stronger version of Proposition 3.

In the proof of Proposition 3, we have used the equalities

$$\begin{aligned} nq_{d11}-n{\tilde{q}}_{d11} = \frac{1}{n} \sum _{i=1}^t\sum _{k=1}^p l_{dik}^2 - \frac{1}{np} \sum _{i=1}^t r_{di}^2 \end{aligned}$$

and

$$\begin{aligned} nq_{d12}-n{\tilde{q}}_{d12} = \frac{1}{n} \sum _{i=1}^t\sum _{k=1}^p l_{di,k-1}l_{dik} - \frac{1}{np} \sum _{i=1}^t r_{di}^2. \end{aligned}$$

Hence,

$$\begin{aligned} \epsilon= & {} \frac{1}{n^2}\sum _{i=1}^t\left( \sum _{k=1}^3 l_{di,k}^2 - \frac{1}{3} \left( \sum _{k=1}^3 l_{di,k}\right) ^2\right) \\= & {} \frac{1}{n^2}\sum _{i=1}^t \frac{2}{3}\left( l_{di,1}^2+l_{di,2}^2+l_{di,3}^2 - l_{di,1}l_{di,3}-l_{di,2}l_{di,1}-l_{di,3}l_{di,2}\right) \end{aligned}$$

and

$$\begin{aligned} \rho= & {} -\frac{1}{n^2}\sum _{i=1}^t\left( \sum _{k=1}^3 l_{di,k}l_{di,k-1} - \frac{1}{3} \left( \sum _{k=1}^3 l_{di,k}\right) ^2\right) \\= & {} \frac{1}{n^2}\sum _{i=1}^t \frac{1}{3}\left( l_{di,1}^2+l_{di,2}^2+l_{di,3}^2 - l_{di,1}l_{di,3}-l_{di,2}l_{di,1}-l_{di,3}l_{di,2}\right) . \end{aligned}$$

Therefore, \(\rho =\frac{1}{2}\epsilon \), regardless of our choice of the \(l_{di,k}\). It follows that

$$\begin{aligned} {\widetilde{\hbox {MSE}}}(d)\ge \frac{(t-1)^2}{n(2(\pi _3+\frac{2}{3}\pi _2)-\epsilon )} + \frac{\delta }{4}\ge \frac{(t-1)^2}{2n} + \frac{\delta }{4}={\widetilde{\hbox {MSE}}}(d^*). \end{aligned}$$

\(\square \)

5 Efficiency in Terms of MSE and Example

We have seen in Sects. 3 and 4 that MSE-optimality, for \(4 \le p\le t\), is a local criterion, the MSE-optimum design depends on the true \(\delta \). Since in practice, we will in general not know \(\delta \), we try to find a design which is efficient for a broad range of \(\delta \). The efficiency of a design d at a point \(\delta \) is defined as

$$\begin{aligned} \hbox {Eff}_\delta (d) = \hbox {MSE}_{\delta , \mathrm{opt}}/\hbox {MSE}_\delta (d), \end{aligned}$$

where \(\hbox {MSE}_{\delta , \mathrm{opt}}\) is the minimum MSE for this \(\delta \). Thus, a high efficiency is desirable and the efficiency is a number between 0 and 1.

In what follows, we consider the case \(4\le p \le t\). In the case of circular designs, often designs are considered that only consist of sequences from class B. Special cases of these designs are so-called circular balanced designs or orthogonal arrays of type I. These have been shown to be optimal in a broad class of designs, see, e.g. [2]. However, Zheng et al. [13] showed that designs with sequences from class A can be efficient in some situations. Thus, we will consider the designs \(d_A\) and \(d_B\) that only consist of sequences from classes A or B, respectively, and calculate their efficiency.

Fig. 1
figure 1

Proportion \(1-\pi (\delta )\) of sequences from class B as a function of \(\delta \)

Fig. 2
figure 2

MSE-efficiencies for designs \(d_{B}\) (solid line) and \(d_{A}\) (dashed line) for parameters \(p=6,t=4,n=40\) and \(\delta \in [0,3]\)

First, we examine the case that \(\delta < \frac{(p-1)(t-1)^2}{np(p-3)}.\) In this case, \(d_B\) is MSE-optimal and thus

$$\begin{aligned} \hbox {Eff}_{\delta }(d_B) = 1. \end{aligned}$$

For \(d_A\), we get in this case

$$\begin{aligned} \hbox {Eff}_{\delta }(d_A) = \frac{-(\delta n + (p - 1)(t - 1)^2)(p^2-p-4)^2}{(p - 1)^2(4p - tp(t-2)(p^2-p - 4)+p^2(p - 1) + \delta n(p - 4)^2)}. \end{aligned}$$

If \(\delta \ge \frac{(p-1)(t-1)^2}{np(p-3)}\), we get

$$\begin{aligned} \hbox {Eff}_{\delta }(d_B)= \frac{(\delta np(p^2-5p+6)-(t - 1)^2)(t - 1)^2(p - 1)^2}{\delta np^2(p - 3)^2(\delta n+ (p-1)(t - 1)^2)} \end{aligned}$$

and

$$\begin{aligned} \hbox {Eff}_{\delta }(d_A)=\frac{(\delta np(p^2-5p+6) - (t - 1)^2)(t-1)^2 (p^2-p-4)^2}{\delta np^2(p-3)^2\left( p(t-1)^2(p^2 -p-4)+n\delta (p-4)^2\right) }. \end{aligned}$$

We see that the efficiency of \(d_B\) decreases with \(\delta \) while the efficiency of \(d_A\) increases with \(\delta \).

As an example, consider the case \(t=6, p=4\) and \(n=40\). In that case, the critical step for \(\pi (\delta )\) is at \(\delta _1 = \frac{4\cdot 25}{40\cdot 4 \cdot 1} = \frac{15}{32} = 0.46875\). If \(\delta <\delta _1\), the design \(d_B\) is MSE-optimal and has MSE-efficiency 1.

If \(\delta \) increases, we see that the proportion \(1-\pi (\delta )\) of sequences from class B in the MSE-optimal design decreases (Fig. 1). This is in agreement with the decreasing efficiency of the design \(d_B\) and the increasing efficiency of the design that only consists of sequences from class A (Fig. 2). However, as long as \(\delta \) remains reasonably small, the efficiency of \(d_B\) is still high (and larger than the efficiency of \(d_A\)). Therefore, it makes sense to use the design \(d_B\).

Of particular interest for practice is the case when the number of treatments t equals the number of periods p. In this case, we can construct a circular balanced uniform design \(d_B\) with \(n=t(t-1)\) units.

For a design of this size, the limit \(\delta _1=\frac{(p-1)(t-1)^2}{np(p-3)}\) becomes 0.56 for \(p=4\), 0.32 for \(p=5\), 0.23 for \(p=6\) and goes down to 0.01 for \(p=100\). The efficiency \(\hbox {Eff}_{d_B}(\delta )\) is 1 as long as \(\delta \) is less than \(\delta _1\), and decreases when \(\delta \) increases. For \(\delta =3\), we observe that \(\hbox {Eff}_{d_B}(3)\) is 0.62 if \(p=4\), 0.61 if \(p=5\) and 0.64 if \(p=6\). For larger p, it increases, for instance, it is 0.97 for \(p=100\).