Abstract
In crossover designs, each subject receives a series of treatments, one after the other in p consecutive periods. There is concern that the measurement of a subject at a given period might be influenced not only by the direct effect of the current treatment but also by a carryover effect of the treatment applied in the preceding period. Sometimes, the periods of a crossover design are arranged in a circular structure. Before the first period of the experiment itself, there is a run-in period, in which each subject receives the treatment it will receive again in the last period. No measurements are taken during the run-in period. We consider the estimate for direct effects of treatments which is not corrected for carryover effects. If there are carryover effects, this uncorrected estimate will be biased. In that situation, the quality of the estimate can be measured by the mean square error, the sum of the squared bias and the variance. We determine MSE-optimal designs, that is, designs for which the mean square error is as small as possible. Since the optimal design will in general depend on the size of the carryover effects, we also determine the efficiency of some designs compared to the locally optimal design. It turns out that circular neighbour-balanced designs are highly efficient.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
In a crossover design, the effects of t treatments are compared by giving each of n subjects a series of treatments in p consecutive periods. The response of each unit in each period is measured and used to estimate the so-called direct effects of the treatments. Usually, for experiments of this kind, there is concern that a treatment given in period i, \(1 \le i \le p\), in addition to its direct effect will also have a carryover effect that influences the measurement in period \(i+1\). An important application of crossover designs is in sensory experiments, where each assessor evaluates a number of products, one after the other. Here, a carryover effect may for instance be caused by a lingering taste of a product.
There are two options to deal with carryover effects. One possibility is to explicitly include carryover effects in the model and use an estimate that corrects for carryover effects. There is a vast literature on designs for the corrected estimate, see, e.g. [3].
Another approach is to neglect the carryover effect and use an uncorrected estimate which does not correct for carryover effects. In sensory experiments, the analysis is usually done with the uncorrected estimate. In most cases, the experimenter then uses technical measures like, e.g. washout with a neutral taste to reduce carryover.
To deal with the, hopefully small, carryover effects which may be present in spite of washout, some authors recommend using neighbour-balanced designs as a precaution to minimize the effects of carryover, see, e.g. [11, p. 189]. Neumann and Kunert [12] have formally shown that the use of neighbour-balanced designs can minimize the mean square error of the uncorrected estimate.
In a circular crossover design, there is a run-in period where each unit receives a treatment but no measurement is made. During the run-in period of a circular design, each unit receives the treatment which it will receive again in the last period. Hence, there is a carryover effect in each period. Circular designs were introduced to the area of crossover designs by Magda [10]. There are some results on optimal designs for estimators taking account of the carryover effects, e.g. by Kunert [7], Druilhet [5] and Bailey and Cameron [2].
Azaïs [1] has shown that for circular designs and the uncorrected estimate, there is a randomization which preserves the neighbour structure but validates an analysis with the simple block model. This makes circular designs appealing for the uncorrected estimate.
The present paper extends the results of [12] to the class of circular designs. Since the mean square error depends on the size of the carryover effects, we determine locally optimal designs. If the number of periods is larger than the number of treatments, however, it turns out that there are designs minimizing the mean square error regardless of the size of the carryover effects. If the number of periods is less or equal to the number of treatments, the optimal design varies with the size of the carryover effects. As long as the carryover effects are small, no experimental unit in the optimal design receives the same treatment twice. When the carryover effects get larger, some units will receive the same treatment twice in consecutive periods. The proportion of units receiving the same treatment twice increases with increasing carryover effects. These results remain correct whether there is a period effect in the model or not.
2 Calculating the MSE
Define \(\Omega _{t,n,p}\) as the set of all circular crossover designs with t treatments, n units (subjects) and p periods. We assume that the j-th observation on unit i can be written as
Here, \(\alpha _i, 1\le i\le n\), is the effect of the i-th unit, \(\tau _{d(i,j)}\) is the effect of the treatment given to the i-th unit in the j-th period by the design \(d \in \Omega _{t,n,p}\), \(\rho _{d(i,j-1)}\) is the carryover effect of the treatment given to unit i in period \((j-1)\) with \(\rho _{d(i,0)} = \rho _{d(i,p)}\), and \(\varepsilon _{ij}\) is the error. The errors are independent, identically distributed with expectation 0 and variance \(\sigma ^2.\)
To continue, we need some notation. Denote by \(A^T\) the transpose of the matrix A. If \(I_t\) is the \(t\times t\)-identity matrix and \(1_t\) the t-dimensional vector of ones, define \(H_t=I_t-\frac{1}{t}1_t 1_t^T\). For a matrix \(A\in {\mathbb {R}}^{(n,r)}\) set \(\omega ^\perp (A)=I_n-A(A^TA)^+A^T\), where \(B^+\) denotes the Moore–Penrose generalized inverse of the matrix B.
In vector notation, our model then can be written as
Here, \(y=[y_{11},\ldots ,y_{1p},y_{21},\ldots ,y_{np}]^T\) and \(\varepsilon \) is the vector of the errors. Further, \(\alpha \), \(\tau \) and \(\rho \) are the vectors of the unit, direct and carryover effects, respectively. The matrices U, \(T_d\) and \(F_d\) are the corresponding design matrices.
We assume that the analysis of the data is done with a model neglecting the carryover effects, i.e.
If we define \(M_{d11}=T_d^T\omega ^\perp (U)T_d\), then \(M_{d11}\) is the information matrix for the estimation of direct effects in model (2). Note that \(M_{d11}\) and, therefore, \(M_{d11}^+\) have row- and column-sums 0. Consequently, \(H_tM_{d11}^+=M_{d11}^+\) and \({\hat{\tau }}=M_{d11}^+T_d^T\omega ^\perp (U)y\) is the Gauss–Markov estimate for \(H_t \tau \) in model (2).
If there are carryover effects, then the uncorrected estimate \({\hat{\tau }}\) is biased. More precisely,
As in [12], we try to determine a design that minimizes the mean square error (MSE) as a performance measure combining bias and variance.
The joint information matrix of direct and carryover effects in model (1) can be written as
where \(M_{d11}\) is as before and
see [3, p. 15].
Analogous to [12], we calculate the MSE in the following way: We restrict attention to designs which allow estimation of all contrasts of direct effects in the model without carryover effects. Because \(M_{d11}\) is the information matrix for direct effects in the model without carryover effects, this is the set of all designs for which \(\hbox {rank}(M_{d11}) = t-1\). In the model with carryover effects, the MSE of the uncorrected estimate \(\widehat{\tau _i - \tau _j}\) for any pair \((i,j), i \ne j\) then is
where \(\ell _{ij}\) is a t-dimensional vector with \(+1\) in position i, \(-1\) in position j and all other entries 0. With the calculations done in [12], we get that the average MSE over all pairs equals
where \(\hbox {tr}(M)\) denotes the trace of a matrix M. As in [12], we consider the worst case for given \(\rho ^T H_t \rho = \sum (\rho _i - {\bar{\rho }})^2=\delta ,\) say. That is, we consider
where \(\lambda _i(M)\) is the i-th largest eigenvalue of the symmetric matrix M.
We multiply this equation by \(\frac{t-1}{2}\) and get the following definition for our optimality-criterion. For any \(d \in \Omega _{t,n,p}\), we define
The advantage of this criterion is that the multivariate purpose of minimizing the bias and maximizing the precision of the estimators can be measured as a single real number. In what follows, we assume without loss of generality that \(\sigma ^2=1.\) Our aim is to find a design that minimizes \(\hbox {MSE}(d)\).
We call a square matrix M completely symmetric, if there are numbers a and b such that all diagonal elements are equal to a and all off-diagonal elements are equal to b. We get the following bound for \(\hbox {MSE}(d)\).
Theorem 1
For any design \(d \in \Omega _{t,n,p}\), there is a lower bound for \(\hbox {MSE}(d)\), namely
Equality holds if both \(M_{d11}\) and \(M_{d12}\) are completely symmetric.
Proof
The proof given for Proposition 1 of [12] remains valid in the circular case. \(\square \)
Note that for every \(M_{d11}\) and \(M_{d12}\) there are symmetrized versions \({\bar{M}}_{d11}\) and \({\bar{M}}_{d12}\) with
for \(1\le i \le j \le 2.\) Here \({\mathcal {S}}_t\) is the set of all \((t\times t)-\)permutation matrices \(\Pi \). We see that all \({\bar{M}}_{dij}\) are completely symmetric and that \(\hbox {tr}(M_{dij}) = \hbox {tr}({\bar{M}}_{dij}).\)
For any design d, define \(q_{dij}=\frac{1}{n}\hbox {tr}\left( M_{dij}\right) \) for \(1\le i \le j \le 2.\) Then, the lower bound in Theorem 1 can be written as
Each subject in the design d receives a sequence s of treatments. Denote by T(s) and F(s) the part of \(T_d\) and \(F_d\) that corresponds to s and define
We call two sequences \(s_1\) and \(s_2\) equivalent if \(s_1\) can be transformed to \(s_2\) by relabelling the treatments. Two equivalent sequences then have the same \(q_{ij}(s)\). If for given t and p there are K, say, equivalence classes of sequences, we choose a representative sequence \(s_k, 1 \le k \le K\) for each class. As pointed out by Kushner [9], the \(q_{dij}\) then are weighted means of the \(q_{ij}(s_k)\). More precisely, we get
where \(\pi _d(k)\) is the proportion of units of d receiving a sequence from class \(k, 1 \le k \le K.\)
Similarly to [12], we consider approximate designs. For approximate designs, we remove the restriction that the number of experimental units to receive a given sequence s must be an integer. For an approximate design, the \(\pi _d(k), 1 \le k \le K\) can be any set of nonnegative real numbers, subject to the condition that \(\sum _{k=1}^K \pi _d(k)=1.\) An exact design \(d \in \Omega _{t,n,p}\) then is a special instance of an approximate design, where each sequence is assigned to an integral number of units. We denote the set of all approximate designs for given t, n and p by \(\Delta _{t,n,p}\). Note that the number n of units is not important for an approximate design \(d \in \Delta _{t,n,p}\). It plays a role in the calculation of \(\hbox {MSE}(d)\), however. A design \(d \in \Delta _{t,n,p}\) is called symmetric, if each sequence s from class \(k, 1 \le k \le K\) appears equally often (i.e. \(\pi _d(k)/m_k\) times, where \(m_k\) is the number of sequences in class k). If the design d is symmetric, then all \(M_{dij}\) are completely symmetric, \(1 \le i \le j \le 2\). We see from the definition of symmetrized designs that for every combination of \(\hbox {tr}(M_{d11})\) and \(\hbox {tr}(M_{d12})\) there is a symmetric design with the same traces. As these designs achieve the lower bound in Theorem 1, we can restrict attention to symmetric designs.
3 Optimal Designs
For any sequence s, we observe that
where \(f_{s,m}\) is the frequency of treatment m in sequence s and \(B_s\) the number of periods where the treatment of the preceding period is repeated. These formulas show that the \(q_{ij}(s)\) depend on the sequence only through the treatment frequencies and the number of periods where the preceding treatment is repeated. The position of the treatments within the sequence does not matter. Hence, we will choose the representative sequences such that the periods with repeated treatments, if there are, come at the end.
As a first step, let \(p>t\). In this case, we can write \(p=gt+s\) with g and s being integers and \(1\le s\le t\). In this situation, there are two classes of particular interest, namely A with representative sequence \(s_1\) and B with representative sequence \(s_2\), where
In these sequences, each treatment appears either g or \(g+1\) times. Note that it makes no difference for \(q_{11}(s_1)\) and \(q_{12}(s_1)\) whether the treatments appearing \(g+1\) times come early or late in \(s_1\).
The values of \(q_{11}(s)\) and \(q_{12}(s)\) for the two classes A and B are displayed in Table 1. It can be seen that \(\sum _{m=1}^t f_{s,m}^2\) is minimal, if the \(f_{s,m}, 1\le m \le t,\) are as nearly equal as possible. Consequently, \(s_1\) and \(s_2\) maximize \(q_{11}(s)\). With that observation, the MSE-optimal design can be determined as follows.
Theorem 2
Assume \(p>t\) and consider a symmetric approximate design \(d^*\in \Delta _{t,n,p}\) consisting of sequences from class A with proportion
and of sequences from class B with proportion \(\pi _2 = 1- \pi _1\). We then get
Proof
In each sequence used by \(d^*\), each treatment in each unit appears either g or \(g+1\) times. Therefore, we have that
and
Observe that the choice of \(\pi _1\) implies that \(q_{d^*12} = 0\). Since \(d^*\) is symmetric, we get
which completes the proof. \(\square \)
If \(p\le t\), the determination of an MSE-optimal design is more complicated. We examine the special cases \(p=2\le t\) and \(p=3\le t\) before we find general optimal designs for \(4\le p \le t.\)
Assume \(p=2\). Note that there are only two possible classes of sequences A and B, displayed in Table 2.
Theorem 3
Assume \(2=p\le t\) and consider a symmetric approximate design \(d^*\in \Delta _{t,n,2}\) consisting only of sequences from class B with representative sequence [1, 2]. Then,
Proof
Assume the design has proportion \(\pi _1\) of its sequences from class A and \(\pi _2=1-\pi _1\) from class B. We get from Table 2 that \(q_{d11}=\sum \pi _d(s)q_{11}(s) =\pi _2\) and \(q_{d12}=-\pi _2\). Hence,
This completes the proof. \(\square \)
For the case \(p=3\), there are only the five possible classes of sequences listed in Table 3. In this case, the optimal design consists of sequences [1, 2, 3] only.
Theorem 4
Assume \(3=p\le t\) and consider a symmetric approximate design \(d^*\in \Delta _{t,n,3}\) that consists only of sequences from class C with representative sequence [1, 2, 3]. Then,
Proof
In the circular model, sequences s from any of the three classes \(B_1\), \(B_2\) and \(B_3\) all have the same \(q_{11}(s)\) and \(q_{12}(s)\), respectively. Assume the design d has proportion \(\pi _2\) of its sequences from one of the classes \(B_j\) and \(\pi _3\) from class C. Then, the proportion of sequences from class A is \(\pi _1=1-\pi _2-\pi _3.\) We get from Table 3 that \(q_{d11}=\sum \pi _d(s)q_{11}(s) =2(\frac{2}{3}\pi _2+\pi _3)\) and \(q_{d12}=-(\frac{2}{3}\pi _2+\pi _3)\). Hence,
and \(d^*\) is MSE-optimal. \(\square \)
Now consider the situation when \(4 \le p \le t\). Remember that for a sequence s, the number \(B_s\) denotes the number of periods in s where the treatment of the preceding period is repeated. As in [12], we can show that
also in the circular case. This inequality is used in the next proposition.
Proposition 1
Define B(d) of a design \(d \in \Delta _{t,n,p}\) as the weighted mean of the \(B_s\) of its sequences, that is \(B(d)=\sum _{s}B_s\pi _d(s)\). Then, for any \( d \in \Delta _{t,n,p} \) we have
Proof
From the inequality \(\sum _{m=1}^t f_{s,m}^2\ge p+2B_s\), we get for any sequence s that
Since \(q_{dij}=\sum \pi _d(s) q_{ij}(s)\), the desired inequalities follow. \(\square \)
For \(4 \le p \le t\), there are two classes of sequences of interest. Class A has representative sequence \([1,2,\ldots , p-4,p-3,p-3, p-2, p-2]\), i.e. two treatments are given twice in adjacent periods such that \(B_s=2\). If \(p>4\), there are \(p-4\) other treatments given exactly once. The second class, B, has representative sequence \([1,\ldots , p]\), i.e. no treatment is given twice and \(B_s=0\). Class B has been shown to produce (universally) optimal designs for various cases [2, 5, 8], since balanced designs use sequences from this class. Class A was shown by Zheng et al. [13] to be important for non-trivial covariance matrices.
The respective values for \(q_{11}(s)\) and \(q_{12}(s)\) can be seen in Table 4. We now show that a design minimizing the MSE can be found in the set of all designs consisting of sequences from classes A and B only.
Proposition 2
Assume \(4\le p \le t\). For any design \(d \in \Delta _{t,n,p}\) define
and
We construct a symmetric design \({\bar{d}}_{t,n,p} \in \Delta _{t,n,p}\) which consists of a proportion \({\bar{\pi }}\) of sequences from class A and a proportion \(1-{\bar{\pi }}\) of sequences from class B. Then,
Proof
For the design \({\bar{d}}\), we conclude from the \(q_{ij}(s)\) in Table 4, that \(q_{{\bar{d}}11} = p-1- \frac{4}{p}{\bar{\pi }}\) and \(q_{{\bar{d}}12} = -(1-{\bar{\pi }})+{\bar{\pi }}\frac{p-4}{p}=-1+2{\bar{\pi }} \frac{p-2}{p}\). Making use of the fact that \({\bar{d}}\) is symmetric, we then have
Now distinguish between two cases.
Case 1: \(B(d) \ge \frac{p}{p-2}\).
In this case, \({\bar{\pi }}=\frac{p}{2(p-2)}\) and therefore \(q_{{\bar{d}}11} = p-1- \frac{2}{p}B(d)\) while \(q_{{\bar{d}}12} = 0.\) It follows from Proposition 1 that \(q_{d11}\le p-1 -\frac{2}{p}B(d) \le q_{{\bar{d}}11}\). Hence,
Case 2: Assume \(0\le B(d) < \frac{p}{p-2}.\)
In that case, it follows from Proposition 1 that \(q_{d12}\le -1 + \frac{p-2}{p}B(d) <0.\) Therefore, a lower bound for \(\hbox {MSE}(d)\) is achieved by making both \(q_{d12}\) and \(q_{d11}\) as large as possible. We hence get from Proposition 1 that
Since \({\bar{\pi }}=B(d)/2\), the right hand side of this inequality equals \(M({\bar{\pi }})\). \(\square \)
It follows from Proposition 2 that the MSE-optimal design for a given \(\delta \) can be found among the set of all designs \({\bar{d}}\) defined in Proposition 2. That is, we have to find the best \({\bar{\pi }}\).
Theorem 5
Assume \(4\le p \le t\) and \(\delta \ge 0\) is given.
-
1.
If \(\delta < \frac{(p-1)(t-1)^2}{np(p-3)}\) choose \(d^*\) as the symmetric design that only consists of sequences from class B, i.e. a design \({\bar{d}}\) with \({\bar{\pi }}=0.\) Then,
$$\begin{aligned} \hbox {MSE}(d^*) = M(0)=\min _{d \in \Delta _{t,n,p}}\hbox {MSE}(d) \end{aligned}$$ -
2.
If \(\delta \ge \frac{(p-1)(t-1)^2}{np(p-3)}\), let \(d^*\) be a symmetric design that consists of sequences from class A with proportion
$$\begin{aligned} \pi (\delta )=\frac{p(p-1)}{4} - \frac{\delta n p^3(p-3)^2}{4\big ( \delta n p (p-2)(p-3) - 2(t-1)^2 \big ) } \end{aligned}$$and of sequences from class B with proportion \(1-\pi (\delta )\). Then,
$$\begin{aligned} \hbox {MSE}(d^*) = M(\pi (\delta )) = \min _{d \in \Delta _{t,n,p}}\hbox {MSE}(d). \end{aligned}$$
Proof
It follows from Proposition 2 that for any d there is a \({\pi } \in [0,\frac{p}{2(p-2)}]\) such that \(\hbox {MSE}(d) \ge M({\pi })\).
We therefore have to find a \(\pi ^* \in [0, \frac{p}{2(p-2)}]\) which minimizes \(M({\pi })\).
Case 1: \(\delta =\frac{2(t-1)^2}{np(p-2)(p-3)}.\) In this case the derivative of \(M(\pi )\) with respect to \(\pi \) is
Then, \(D(\pi )>0\) for all \(\pi \in [0,\frac{p}{2(p-1)}]\) and \(M({\pi })\) is minimal for \(\pi =0\).
For all other \(\delta \), the derivative of \(M({\pi })\) with respect to \({\pi }\) is
where \(\pi _0=\frac{p(p-1)}{4}-\frac{\delta np^3(p-3)^2}{4\big ( \delta np(p-2)(p-3)-2(t-1)^2 \big ) }.\)
Hence, \(D(\pi )=0\) if and only if \(\pi =\pi _0\).
Case 2: \(0\le \delta <\frac{2(t-1)^2}{np(p-2)(p-3)}.\)
In this case, \( \delta np(p-2)(p-3)-2(t-1)^2 < 0\) and, therefore, \(\pi _0>\frac{p(p-1)}{4}>\frac{p}{2(p-1)}\). Further, it follows for all \(\pi < \pi _0\) that \(D(\pi )>0\). Hence, the minimum of \(M(\pi )\) is attained for \(\pi =0.\)
Case 3: \(\frac{2(t-1)^2}{np(p-2)(p-3)}<\delta \le \frac{(p-1)(t-1)^2}{np(p-3)}.\)
In this case, \( \delta np(p-2)(p-3)-2(t-1)^2 > 0\) and, therefore, \(D(\pi )>0\) for all \(\pi > \pi _0\). Also in this case, \(\pi _0 \le 0\). Hence, again, the minimum of \(MSE(\pi )\) is attained for \(\pi =0\).
Combining cases 1, 2 and 3, we have shown that for all \(\delta \le \frac{(p-1)(t-1)^2}{np(p-3)}\) the choice \(\pi ^*=0\) is optimum.
Case 4: \(\delta \ge \frac{(p-1)(t-1)^2}{np(p-3)}\). In this case, \( \delta np(p-2)(p-3)-2(t-1)^2 > 0\) and, therefore, \(D(\pi )>0\) for all \(\pi > \pi _0\) and \(D(\pi )<0\) for all \(\pi < \pi _0\). Therefore, the MSE is minimized by that \({\bar{d}}\) for which \({\bar{\pi }}=\pi _0 = \pi (\delta )\). \(\square \)
Note that \(\pi (\delta )\) increases with \(\delta \) and approaches \(\frac{p}{2(p-2)}\) for \(\delta \rightarrow \infty \).
4 Optimal Designs for the Model with Period Effects
We extend the model and include period effects, i.e. the model becomes
with \(\beta \) the vector of the period effects and P the corresponding design matrix. In this model, we have the information matrices
The MSE of a design then can be defined in the same manner as in the model without period effects, only replacing \(M_{dij}\) by \({\tilde{M}}_{dij}\), \(1\le i \le j \le 2.\) We get
Defining \({\tilde{q}}_{d11} = \frac{1}{n} \hbox {tr}\left( {\tilde{M}}_{d11}\right) \) and \({\tilde{q}}_{d12} = \frac{1}{n} \hbox {tr}\left( {\tilde{M}}_{d12}\right) \), we can show with the same arguments as before that
with equality if \({\tilde{M}}_{d11}\) and \({\tilde{M}}_{d12}\) are completely symmetric.
Unfortunately, the calculations of \(\hbox {tr}\left( {\tilde{M}}_{d11}\right) \) and \(\hbox {tr}\left( {\tilde{M}}_{d12}\right) \) are different from the calculations before. In particular, it is not possible to express the \({\tilde{q}}_{dij}\) as weighted means of contributions from the single sequences. Adapting the results of [4] to circular designs, we get
where \(l_{di,k}\) is the number of appearances of treatment i in period k (with \(l_{di0}=l_{\mathrm{dip}}\)) and \(r_{di}\) the number of appearances of treatment i in the design. The numbers \(q_{d11}\) and \(q_{d12}\) are as in Sect. 3.
For symmetric designs, we get \(\hbox {tr}({\tilde{M}}_{d11}) = n\, q_{d11}\) and \(\hbox {tr}({\tilde{M}}_{d12}) = n\, q_{d12}\), i.e. \({\tilde{q}}_{dij}=q_{dij}\). In the case \(p>t\), the best symmetric designs have \(q_{d12}=0\) and the results of Theorem 2 extend to the model with period effects.
There are examples—at least in non-circular designs—of designs that are non-symmetric and where the uncorrected estimate from the model with period effects has a smaller bias than the estimate from the model without period effects. In the case \(p\le t\), the best symmetric designs have nonzero bias. Hence, it is not obvious that the results of Theorems 3 and 4 and of Proposition 2 still hold in the model with period effects.
To show that they do, we start by showing that for non-symmetric designs the loss in \(\hbox {tr}({\tilde{M}}_{d11})\) is higher than the possible gain in \(|\hbox {tr}({\tilde{M}}_{d12})|.\)
Proposition 3
For any \(d\in \Omega _{t,n,p},\) it holds that
Proof
Defining \(Q=\frac{1}{n}PP^T-\frac{1}{np}1_{np}1_{np}^T = \omega \left( \omega ^\perp \left( U\right) P\right) \), we find that
and that
Because of the circular structure of the carryover effects, we have
(cf. [8] or [6]) and, hence, that \(T_d^TQ T_d = F_d^TQ F_d\). Therefore, the Cauchy–Schwarz inequality implies that
and the proposition follows. \(\square \)
If \(p\ge 4\), we are now able to show that for every design d there is a symmetric design \(d_s\) consisting of sequences from classes A and B described in Table 4 only and having smaller or equal MSE.
Proposition 4
Let \(p\ge 4\). Assume the model with period effects holds and define \(\Gamma _{t,n,p} \subset \Delta _{t,n,p}\) as the set of all symmetric \(d \in \Delta _{t,n,p}\).
Then, \(\forall d \in \Delta _{t,n,p}\, \exists \, d_s \in \Gamma _{t,n,p}\) with
Proof
Case 1: \(|{\tilde{q}}_{d12}|>1\).
Let \(d_s\) be a symmetric design that consists of sequences from class B only. Then, \(d_s\in \Gamma _{t,n,p}\) and \(|q_{d_s12}| = 1 < |{\tilde{q}}_{d12}|\). On the other hand (because of Proposition 1 and since \(B(d_s)=0\)), we have that
Thus, it holds that
Case 2: \(|{\tilde{q}}_{d12}|\le 1\).
Let \(d_s\) be the symmetric design that consists of sequences from class A with proportion
and sequences from class B with proportion \(1-\pi \). Then, \(d_s\in \Gamma _{t,n,p}\) and we get from Table 4 that
(and hence \(|q_{d_s12}|=|{\tilde{q}}_{d12}|\)).
Therefore, if we can show that \({\tilde{q}}_{d_s11}\ge {\tilde{q}}_{d11},\) we are done.
Remembering that \(d_s\) is a symmetric design we have \({\tilde{q}}_{d_s11}= q_{d_s11}\). Making use of Table 4 again, we get that \(B(d_s)=2\pi \) and
Subcase 2a: \(B(d)\ge B(d_s).\)
From Proposition 1, we know that \(q_{d11} \le p-1- \frac{2B(d)}{p}\). Hence,
Subcase 2b: \(B(d)< B(d_s).\)
Observe that
Hence, we conclude from Proposition 3 that
Therefore,
With Proposition 1, it follows that
Since \(p-2 \ge 2\), we get
which is \({\tilde{q}}_{d_s11}\). This completes the proof. \(\square \)
Note that this proposition implies that the design \(d^*\) determined in Theorem 5 for the model without period effects, is also MSE-optimal in the model with period effects.
The proofs for the cases \(p=2\) and \(p=3\) have to be done differently. We start with \(p=2\).
Theorem 6
Assume \(2=p\le t\). As in Theorem 3, consider the symmetric approximate design \(d^*\in \Delta _{t,n,2}\) consisting only of sequences from class B of Table 2 with representative sequence [1, 2]. We then have \(\forall d \in \Omega _{t,n,2}\) that
Proof
We have seen in the proof of Theorem 3 for an arbitrary design d that
where \(\pi _2\) is the proportion of sequences from class B.
In the model with period effects, we write \({\tilde{q}}_{d11}=q_{d11}-\epsilon = \pi _2-\epsilon \). Since \(q_{d11}\ge {\tilde{q}}_{d11}\ge 0\), we have \(\pi _2\ge \epsilon \ge 0\).
From Proposition 3, we get that \(|q_{d12}-{\tilde{q}}_{d12}|\le \epsilon \). Therefore, \({\tilde{q}}_{d12} \le q_{d12} +\epsilon = -\pi _2+\epsilon \le 0\) and \(|{\tilde{q}}_{d12}|\ge \pi _2 - \epsilon \).
It follows that
This completes the proof. \(\square \)
The proof for the case \(p=3\) turns out to cause extra difficulties.
Theorem 7
Assume \(3=p\le t\). As in Theorem 4, consider a symmetric approximate design \(d^*\in \Delta _{t,n,3}\) that consists only of sequences from class C with representative sequence [1, 2, 3]. Then,
Proof
We have seen in the proof of Theorem 4 for an arbitrary design \(d\in \Delta _{t,n,3}\) that
If we define \(\epsilon =q_{d11}-{\tilde{q}}_{d11}\) and \(\rho ={\tilde{q}}_{d12}-{q}_{d12}\), we get that
At this point, Proposition 3 is of no use. It only guarantees that \(\epsilon \ge |\rho |\), but if we could find a design with \(\rho =\epsilon >0\), this would produce a smaller bias. If we could get \(\rho =\epsilon =\pi _3+\frac{2}{3}\pi _2\), we could even achieve
However, this is not possible. In the case \(p=3\), we can get a stronger version of Proposition 3.
In the proof of Proposition 3, we have used the equalities
and
Hence,
and
Therefore, \(\rho =\frac{1}{2}\epsilon \), regardless of our choice of the \(l_{di,k}\). It follows that
\(\square \)
5 Efficiency in Terms of MSE and Example
We have seen in Sects. 3 and 4 that MSE-optimality, for \(4 \le p\le t\), is a local criterion, the MSE-optimum design depends on the true \(\delta \). Since in practice, we will in general not know \(\delta \), we try to find a design which is efficient for a broad range of \(\delta \). The efficiency of a design d at a point \(\delta \) is defined as
where \(\hbox {MSE}_{\delta , \mathrm{opt}}\) is the minimum MSE for this \(\delta \). Thus, a high efficiency is desirable and the efficiency is a number between 0 and 1.
In what follows, we consider the case \(4\le p \le t\). In the case of circular designs, often designs are considered that only consist of sequences from class B. Special cases of these designs are so-called circular balanced designs or orthogonal arrays of type I. These have been shown to be optimal in a broad class of designs, see, e.g. [2]. However, Zheng et al. [13] showed that designs with sequences from class A can be efficient in some situations. Thus, we will consider the designs \(d_A\) and \(d_B\) that only consist of sequences from classes A or B, respectively, and calculate their efficiency.
First, we examine the case that \(\delta < \frac{(p-1)(t-1)^2}{np(p-3)}.\) In this case, \(d_B\) is MSE-optimal and thus
For \(d_A\), we get in this case
If \(\delta \ge \frac{(p-1)(t-1)^2}{np(p-3)}\), we get
and
We see that the efficiency of \(d_B\) decreases with \(\delta \) while the efficiency of \(d_A\) increases with \(\delta \).
As an example, consider the case \(t=6, p=4\) and \(n=40\). In that case, the critical step for \(\pi (\delta )\) is at \(\delta _1 = \frac{4\cdot 25}{40\cdot 4 \cdot 1} = \frac{15}{32} = 0.46875\). If \(\delta <\delta _1\), the design \(d_B\) is MSE-optimal and has MSE-efficiency 1.
If \(\delta \) increases, we see that the proportion \(1-\pi (\delta )\) of sequences from class B in the MSE-optimal design decreases (Fig. 1). This is in agreement with the decreasing efficiency of the design \(d_B\) and the increasing efficiency of the design that only consists of sequences from class A (Fig. 2). However, as long as \(\delta \) remains reasonably small, the efficiency of \(d_B\) is still high (and larger than the efficiency of \(d_A\)). Therefore, it makes sense to use the design \(d_B\).
Of particular interest for practice is the case when the number of treatments t equals the number of periods p. In this case, we can construct a circular balanced uniform design \(d_B\) with \(n=t(t-1)\) units.
For a design of this size, the limit \(\delta _1=\frac{(p-1)(t-1)^2}{np(p-3)}\) becomes 0.56 for \(p=4\), 0.32 for \(p=5\), 0.23 for \(p=6\) and goes down to 0.01 for \(p=100\). The efficiency \(\hbox {Eff}_{d_B}(\delta )\) is 1 as long as \(\delta \) is less than \(\delta _1\), and decreases when \(\delta \) increases. For \(\delta =3\), we observe that \(\hbox {Eff}_{d_B}(3)\) is 0.62 if \(p=4\), 0.61 if \(p=5\) and 0.64 if \(p=6\). For larger p, it increases, for instance, it is 0.97 for \(p=100\).
References
Azaïs JM (1987) Design of experiments for studying intergenotypic competition. J R Stat Soc B 49:334–345
Bailey RA, Cameron PJ, Filipiak K, Kunert J, Markiewicz A (2017) On optimality and construction of circular repeated-measurements designs. Stat Sinica 27:1–22
Bose M, Dey A (2009) Optimal crossover designs. World Scientific, Hackensack, NJ
Cheng CS, Wu CF (1980) Balanced repeated measurements designs. Ann Stat 8:1272–1283
Druilhet P (1999) Optimality of neighbour balanced designs. J Stat Plann Inference 81:141–152
Druilhet P, Bailey RA (2004) Optimality of neighbor-balanced designs for total effects. Ann Stat 32(4):1650–1661. https://doi.org/10.1214/009053604000000481
Kunert J (1983) Optimal design and refinement of the linear model with applications to repeated measurements designs. Ann Stat 11:247–257
Kunert J (1998) On the analysis of circular balanced crossover designs. J Stat Plann Inference 69:359–370
Kushner HB (1997) Optimal repeated measurements designs: the linear optimality equations. Ann Stat 25:2328–2344
Magda CG (1980) Circular balanced repeated measurements designs. Commun Stat Theory Methods 9:1901–1918
Naes T, Brockhoff P, Tomic O (2010) Statistics for sensory and consumer science. Wiley, Chichester
Neumann C, Kunert J (2018) On MSE-optimal crossover designs. Ann Stat 46:2939–2959
Zheng W, Ai M, Li K (2017) Identification of universally optimal circular designs for the interference model. Ann. Stat 45:1462–1487
Funding
This research was sponsored by the Deutsche Forschungsgemeinschaft (SFB 823, Teilprojekt C2).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there are no conflicts of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article is part of the topical collection “Special Issue: State of the art in research on design and analysis of experiments” guest edited by John Stufken, Abhyuday Mandal, and Rakhi Singh.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Neumann, C., Kunert, J. On MSE-Optimal Circular Crossover Designs. J Stat Theory Pract 15, 85 (2021). https://doi.org/10.1007/s42519-021-00220-0
Accepted:
Published:
DOI: https://doi.org/10.1007/s42519-021-00220-0