Skip to main content
Log in

Optimal approximate designs for estimating treatment contrasts resistant to nuisance effects

  • Regular Article
  • Published:
Statistical Papers Aims and scope Submit manuscript

Abstract

Suppose that we intend to perform an experiment consisting of a set of independent trials. The mean value of the response in each trial is assumed to be equal to the sum of the effect of the treatment selected for that trial and some nuisance effects, e.g., the effect of a time trend or blocking. In this model, we examine optimal approximate designs for the estimation of a system of treatment contrasts, with respect to a wide range of optimality criteria. We show that it is necessary for any optimal design to attain the optimal treatment proportions, which may be obtained from the marginal model that excludes the nuisance effects. Moreover, we prove that for a design to be optimal, it is sufficient that it attains the optimal treatment proportions and satisfies the conditions for resistance to nuisance effects. For selected natural choices of treatment contrasts and optimality criteria, we calculate the optimal treatment proportions and provide an explicit form of optimal designs. In particular, we obtain optimal treatment proportions for the comparison of a set of test treatments with a set of controls. Once the optimal treatment proportions are determined, the results allow us to construct a method of calculating optimal approximate designs with small support sizes through linear programming. Consequently, we can construct efficient exact designs using a simple heuristic.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

References

  • Atkinson AC (1999) Optimum biased-coin designs for sequential treatment allocation with covariate information. Stat Med 18:1741–1752

    Article  Google Scholar 

  • Atkinson AC, Donev AN (1996) Experimental design optimally balanced for trend. Technometrics 38:333–341

    Article  MATH  Google Scholar 

  • Bailey RA, Cameron PJ (2009) Combinatorics of optimal designs. Surv Comb 365:19–73

    MathSciNet  MATH  Google Scholar 

  • Bailey RA, Cheng CS, Kipnis P (1992) Construction of trend-resistant factorial designs. Stat Sin 2:393–411

    MathSciNet  MATH  Google Scholar 

  • Bechhofer RE, Tamhane AC (1981) Incomplete block designs for comparing treatments with a control: general theory. Technometrics 23:45–57

    MathSciNet  MATH  Google Scholar 

  • Bradley RA, Yeh CM (1980) Trend-free block designs: theory. Ann Stat 8:883–893

    Article  MathSciNet  MATH  Google Scholar 

  • Cheng CS (1990) Construction of run orders of factorial designs. Statistical design and analysis of industrial experiments. Marcel-Dekker, New York, pp 423–439

    Google Scholar 

  • Cox DR (1951) Some systematic experimental designs. Biometrika 38:312–323

    Article  MathSciNet  MATH  Google Scholar 

  • Cox DR, Reid N (2000) Theory of the design of experiments. Chapman and Hall, London

    MATH  Google Scholar 

  • Filipiak K, Markiewicz A, Szczepanska A (2009) Optimal designs under a multivariate linear model with additional nuisance parameters. Stat Pap 50:761–778

    Article  MathSciNet  MATH  Google Scholar 

  • Giovagnoli A, Wynn HP (1985) Schur-optimal continuous block designs for treatments with a control. In: Proceedings of the Berkeley conference in honor of Jerzy Neyman and Jack Kiefer, Wadsworth, pp 651–666

  • Githinji F, Jacroux M (1998) On the determination and construction of optimal designs for comparing a set of test treatments with a set of controls in the presence of a linear trend. J Stat Plan Infer 66:161–174

    Article  MathSciNet  MATH  Google Scholar 

  • Harman R (2004) Minimal efficiency of designs under the class of orthogonally invariant information criteria. Metrika 60:137–153

    Article  MathSciNet  MATH  Google Scholar 

  • Harman R, Sagnol G (2015) Computing D-optimal experimental designs for estimating treatment contrasts under the presence of a nuisance time trend. Stochastic models, statistics and their applications, Springer, Wroclaw, pp 83–91

  • Jacroux M (1982) Some E-optimal designs for the one-way and two-way elimination of heterogeneity. J R Stat Soc 44:253–261

    MathSciNet  MATH  Google Scholar 

  • Jacroux M (1990) Some optimal designs for comparing a set of test treatments with a set of controls. Ann I Stat Math 42:173–185

    Article  MathSciNet  MATH  Google Scholar 

  • Jacroux M (1993) On the construction of trend-resistant designs for comparing a set of test treatments with a set of controls. J Am Stat Assoc 88:1398–1403

    Article  MathSciNet  MATH  Google Scholar 

  • Jacroux M (2002) A- and MV-efficient block designs for comparing a set of controls to a set of test treatments. Sankhya Ser B 64:141–161

    MathSciNet  MATH  Google Scholar 

  • Jacroux M, Ray RS (1990) On the construction of trend-free run orders of treatments. Biometrika 77:187–191

    Article  MathSciNet  MATH  Google Scholar 

  • Jacroux M, Majumdar D, Shah KR (1997) On the determination and construction of optimal block designs in the presence of linear trends. J Am Stat Assoc 92:375–382

    Article  MathSciNet  MATH  Google Scholar 

  • Kitsos CP, Titterington DM, Torsney B (1988) An optimal design problem in rhythmometry. Biometrics 44:657–671

    Article  MathSciNet  MATH  Google Scholar 

  • Kunert J (1983) Optimal design and refinement of the linear model with applications to repeated measurements designs. Ann Stat 11:247–257

    Article  MathSciNet  MATH  Google Scholar 

  • Majumdar D (1986) Optimal designs for comparisons between two sets of treatments. J Stat Plan Inference 14:359–372

    Article  MathSciNet  MATH  Google Scholar 

  • Majumdar D (1996) Optimal and efficient treatment-control designs. Handbook of statistics 13: design and analysis of experiments. North Holland, Amsterdam, pp 1007–1053

    Google Scholar 

  • Majumdar D, Notz WI (1983) Optimal incomplete block designs for comparing treatments with a control. Ann Stat 11:258–266

    Article  MathSciNet  MATH  Google Scholar 

  • Pocock SJ, Simon R (1975) Sequential treatment assignment with balancing for prognostic factors in the controlled clinical trial. Biometrics 31:103–115

    Article  Google Scholar 

  • Pukelsheim F (1983) On optimality properties of simple block designs in the approximate design theory. J Stat Plan Inference 8:193–208

    Article  MathSciNet  MATH  Google Scholar 

  • Pukelsheim F (2006) Optimal design of experiments. SIAM, Philadelphia

    Book  MATH  Google Scholar 

  • Schwabe R (1996) Optimal designs for additive linear models. Statistics 27:267–278

    Article  MathSciNet  MATH  Google Scholar 

  • Schwabe R, Wierich W (1995) D-optimal designs of experiments with non-interacting factors. J Stat Plan Inference 44:371–384

    Article  MathSciNet  MATH  Google Scholar 

  • Seber GA (2008) A matrix handbook for statisticians. Wiley, Hoboken

    MATH  Google Scholar 

  • Wu CF (1980) On some ordering properties of the generalized inverses of non-negative definite matrices. Linear Algebra Appl 32:49–60

    Article  MathSciNet  MATH  Google Scholar 

Download references

Acknowledgments

This research was supported by the VEGA 1/0521/16 grant from the Slovak Scientific Grant Agency. The research of the first author was supported by the UK/255/2015 grant from Comenius University in Bratislava.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Samuel Rosa.

Appendix

Appendix

Throughout Appendix, we assume that \(K^T = (Q^T, 0_{s \times d})\).

Proof of Proposition 1

Let us partition the matrix L in

$$\begin{aligned} N_K(\xi )= \mathop \mathrm {min}\limits _{L \in \mathbb {R}^{s \times m}: LK=I_s} LM(\xi )L^T \end{aligned}$$

as \(L=(L_1, L_2)\), where \(L_1\) is an \(s \times v\) matrix and \(L_2\) is an \(s \times d\) matrix. Then,

$$\begin{aligned} N_K(\xi )&=\mathop \mathrm {min}\limits _{LK=I_s} LM(\xi )L^T = \mathop \mathrm {min}\limits _{(L_1, L_2) (Q^T, 0)^T = I_s} (L_1, L_2)M(\xi )(L_1, L_2)^T \\&\preceq \mathop \mathrm {min}\limits _{L_1Q=I_s} L_1 M_{11}(\xi ) L_1^T = N_Q(w). \end{aligned}$$

\(\square \)

To prove Proposition 2, we will need two preliminary results formulated in the following lemmas.

Lemma 1

Let \(\tilde{M}\) be a non-negative definite matrix. If a design \(\xi \) satisfies \(M(\xi )\tilde{M}^-K=K\) for some generalized inverse \(\tilde{M}^-\) of \(\tilde{M}\), then (i) \(\xi \) is feasible for \(K^T\beta \) and (ii) \(K^T M^-(\xi )K = K^T \tilde{M}^-K\).

Proof

The steps of the proof follow the proof of Theorem 8.13 from Pukelsheim (2006). Let \(G{:}=\tilde{M}^-\). Because \(M(\xi )GK=K\), we obtain \(M(\xi )X=K\), where \(X=GK\). Therefore, \(\mathcal {C}(K) \subseteq \mathcal {C}(M(\xi ))\), and hence, \(\xi \) is feasible. Let us premultiply the equation \(M(\xi )GK=K\) by \(K^TM^-(\xi )\) such that we obtain \(K^T M^-(\xi )K\) on the right-hand side. The left-hand side is then equal to \(K^T M^-(\xi )M(\xi )GK\). Note that \(K^T=X^TM^T(\xi )=X^TM(\xi )\), and hence, the following holds:

$$\begin{aligned} K^T M^-(\xi )M(\xi )GK = X^TM(\xi ) M^-(\xi )M(\xi )GK = X^T M(\xi ) G K = K^T G K. \end{aligned}$$

It follows that \(K^T M^-(\xi )K = K^T \tilde{M}^- K\). \(\square \)

Lemma 2

Let \(w>0\) be a treatment proportions design, and let \(G:=\mathrm {diag}(w^{-1},0_d)\). Let \(\xi \) be a design in model (1). Then, \(\xi \) satisfies \(M(\xi )GK=K\) if and only if (i) w is a treatment proportions design of \(\xi \) and (ii) \(\xi \) is resistant to nuisance effects.

Proof

We may express \(M(\xi )GK {=} K\) as \( M_{11}(\xi )\mathrm {diag}(w^{-1}) Q {=} Q \) and \(M_{12}^T(\xi ) \mathrm {diag}(w^{-1}) Q = 0\). Because both \(M_{11}(\xi )\) and \(\mathrm {diag}(w^{-1})\) are diagonal matrices and all rows of Q are assumed to be non-zero vectors, the first equation is equivalent to \(\frac{1}{w_u}\sum _t\xi (u,t) = 1\) for all u, which is (i). From the second equation, we find that every row of \( M_{12}^T(\xi )\mathrm {diag}(w^{-1})= \big [ \frac{1}{w_1}\sum _t \xi (1,t)h(t), \ldots , \frac{1}{w_v}\sum _t \xi (v,t)h(t) \big ] \) needs to be in \(\mathcal {N}(Q^T)\), which is (ii). \(\square \)

Proof of Proposition 2

Let \(\tilde{M} := \mathrm {diag}(w,0_d)\). Then, \(G := \mathrm {diag}(w^{-1},0_d)\) is a generalized inverse of \(\tilde{M}\). From Lemma 2 it follows that \(M(\xi )GK=K\) and from Lemma 1, it follows that (i) and (ii) hold. Statement (iii) is a direct consequence of (ii). \(\square \)

Proof of Theorem 1

Let \(\xi \) be a feasible design in (1). Using Proposition 1, we find that \(N_K(\xi ) \preceq N_Q(w)\), where w is the treatment proportions design of \(\xi \). Moreover, because product designs are nuisance resistant, part (iii) of Proposition 2 implies that \(N_Q(w) = N_K(w \otimes \alpha )\) for any nuisance conditions design \(\alpha \). Therefore, \(N_K(\xi ) \preceq N_K(w \otimes \alpha )\).

Suppose that \(w^*\) is not a \(\Phi \)-optimal design. Then, there exists a design \(w_{\mathrm {b}}\) in model (3) such that \(\Phi (N_Q(w^*)) < \Phi (N_Q(w_{\mathrm {b}}))\). Then, \(\Phi (N_K(\xi ^*)) \le \Phi (N_Q(w^*)) < \Phi (N_Q(w_{\mathrm {b}})) = \Phi (N_K(w_{\mathrm {b}} \otimes \alpha ))\) for any nuisance conditions design \(\alpha \). This contradicts the \(\Phi \)-optimality of \(\xi ^*\). \(\square \)

The following lemma (Theorem 8.13 from Pukelsheim 2006) enables a complete characterization of \(\Phi \)-optimal designs for a strictly concave \(\Phi \).

Lemma 3

Let \(\Phi \) be an information function that is strictly concave on \(\mathfrak {S}^s_{++}\), and let \(\xi ^*\) be \(\Phi \)-optimal for \(K^T\beta \). Let G be a generalized inverse of \(M(\xi ^*)\) that satisfies the normality inequality of the General Equivalence Theorem (Theorem 7.14 from Pukelsheim 2006), i.e., there exists a non-negative definite matrix D that solves the polarity equation

$$\begin{aligned} \Phi (N_K(\xi ^*))\Phi ^\infty (D) = \mathrm {tr}(N_K(\xi ^*)D)=1, \end{aligned}$$

where \(\Phi ^\infty \) is the polar information function of \(\Phi \) (see Pukelsheim 2006), and G satisfies the normality inequality

$$\begin{aligned} \mathrm {tr}(M(\xi )B) \le 1 \quad \text {for all feasible designs } \xi , \end{aligned}$$

where \(B=GKN_K(\xi ^*)DN_K(\xi ^*)K^TG^T\). Then, a design \(\xi \) is \(\Phi \)-optimal if and only if \(M(\xi )GK=K\).

To use Lemma 3, we must obtain a matrix G that satisfies the normality inequality of the General Equivalence Theorem.

Lemma 4

Let \(\Phi \) be an information function that is strictly concave on \(\mathfrak {S}^s_{++}\). Then, there exists a unique \(\Phi \)-optimal treatment proportions design for \(Q^T\tau \), say \(w^*\). Let \(G:=\mathrm {diag}((w^*)^{-1},0_d)\). Then, G satisfies the normality inequality of the General Equivalence Theorem for estimating \(K^T\beta \) in model (1).

Proof

Let \(w^*\) be a \(\Phi \)-optimal treatment proportions design, and let \(G_{11}{:}{=}\,\mathrm {diag}((w^*)^{-1})\). Then, the matrix \(G_{11}\), which is the unique generalized inverse of \(M(w^*)\), satisfies the normality inequality of the General Equivalence Theorem for model (3). By Lemma 3, a treatment proportions design w is \(\Phi \)-optimal if and only if \(M(w)M^{-1}(w^*)Q=Q\). By the same argument as that used in the proof of Lemma 2, \(M(w)M^{-1}(w^*)Q=Q\) holds if and only if \(w=w^*\), which proves the uniqueness of \(w^*\). Let \(N^*:=N_Q(w^*)\). Because \(G_{11}\) satisfies the normality inequality in model (3), there exists a matrix D that satisfies the polarity equation \(\Phi (N^*)\Phi ^\infty (D)=\mathrm {tr}(N^*D) = 1\) and the matrix \(B_w=G_{11}QN^*DN^*Q^TG_{11}\) satisfies \(\mathrm {tr}(M(\tilde{w})B_w) \le 1\) for all \(\tilde{w}>0\).

Since \(N_K(\xi ^*)\) is positive definite and \(\Phi \) is strictly concave on \(\mathfrak {S}^s_{++}\), there exists a unique \(\Phi \)-optimal information matrix \(N_K(\xi ^*)\). The \(\Phi \)-optimality of \(N_K(w^* \otimes \alpha ) = N_Q(w^*)=N^*\) for any nuisance conditions design \(\alpha \) yields \(N_K(\xi ^*) = N^*\). Thus, the polarity equality holds in model (1) for the same matrix D. Let \(\tilde{\xi }\) be a feasible design. Then, the left-hand side of the normality inequality in model (1) is \(\mathrm {tr}(M(\tilde{\xi })B)\), where

$$\begin{aligned} B=\begin{bmatrix} G_{11}&\quad 0 \\ 0&\quad 0 \end{bmatrix} \begin{bmatrix} Q \\ 0 \end{bmatrix} N^* D N^* \begin{bmatrix} Q^T&\quad 0 \end{bmatrix} \begin{bmatrix} G_{11}&\quad 0 \\ 0&\quad 0 \end{bmatrix} = \begin{bmatrix} B_w&\quad 0 \\ 0&\quad 0 \end{bmatrix}. \end{aligned}$$

Then, because \(B_w\) satisfies the normality inequality in model (3), we obtain \(\mathrm {tr}(M(\tilde{\xi })B) = \mathrm {tr}(M_{11}(\tilde{\xi })B_w) = \mathrm {tr}(M(\tilde{w})B_w) \le 1\), where \(\tilde{w}\) is the treatment proportions design of \(\tilde{\xi }\). \(\square \)

Proof of Theorem 2

Let \(\xi \) be a feasible design in model (1), and let w be its treatment proportions design. Because \(\Phi \) is isotonic, it follows from Proposition 1 that \(\Phi (N_Q(w)) \ge \Phi (N_K(\xi ))\). Since \(w^*\) is \(\Phi \)-optimal, \(\Phi (N_Q(w^*)) \ge \Phi (N_Q(w))\) and it is feasible; thus, \(w^* {>}0\). Using Proposition 2, we find that \(\Phi (N_K(\xi ^*)) = \Phi (N_Q(w^*)) \ge \Phi (N_Q(w)) \ge \Phi (N_K(\xi )),\) i.e., \(\xi ^*\) is \(\Phi \)-optimal. \(\square \)

Proof of Theorem 3

Lemma 4 ensures the uniqueness of \(w^*\). Let \(G=\mathrm {diag}((w^*)^{-1},0_d)\). Consider an extension of model (1), where \(t \in \mathfrak T'\), \(\mathfrak T'=\mathfrak T\cup \{ z\}\), and \(h(z) = 0_d\), and let \(\xi '\) be a design in this extended model, with its treatment proportions design \(w^*\), that satisfies \(\xi '(u,t) = 0\) for \(t \ne z\). Then, \(M(\xi ')=\mathrm {diag}(w^*,0_d)\) and G is a generalized inverse of \(M(\xi ')\). From Lemma 4, it follows that G satisfies the normality inequality of the General Equivalence Theorem for any nuisance regressors and, consequently, also for the extended model. By Lemma 3, a design \(\xi \) is \(\Phi \)-optimal in the extended model if and only if \(M(\xi )GK=K\). The equality \(M(\xi )GK=K\) holds if and only if \(\xi \) satisfies (i) and (ii) from Lemma 2 in the extended model, i.e.,

$$\begin{aligned} \sum _{t \in \mathfrak T'} \xi (u,t) = w_u^*,\quad u=1,\ldots ,v, \end{aligned}$$
(16)
$$\begin{aligned} \begin{bmatrix} \frac{1}{w_1^*}\sum \limits _{t\in \mathfrak T'} \xi (1,t)h(t),&\ldots ,&\frac{1}{w_v^*}\sum \limits _{t\in \mathfrak T'} \xi (v,t)h(t) \end{bmatrix} Q = 0. \end{aligned}$$
(17)

Let \(\Xi \) be the set of feasible designs in the extended model that satisfy \(\xi (u,z)=0\) for all u, which represents the set of all feasible designs in model (1). The \(\Phi \)-optimal value in the extended model can be attained in \(\Xi \), e.g., \(\xi ^*=w^* \otimes \alpha \), where \(\alpha \) is any nuisance conditions design that satisfies \(\alpha (z)=0\). Thus, a design \(\xi \) is \(\Phi \)-optimal in \(\Xi \) if and only if \(\xi \in \Xi \) and it satisfies (16) and (17), i.e., \(\sum _{t \in \mathfrak T} \xi (u,t) = w_u^*\) for \(u=1,\ldots ,v\), and (5). It follows that a design \(\xi \) is \(\Phi \)-optimal in (1) if and only if its treatment proportions design is \(\Phi \)-optimal and \(\xi \) is nuisance resistant.

Proof of Theorem 5

First, let us assume that Q is of full column rank. Let w be a feasible treatment proportions design, and let P be a \(v \times v\) permutation matrix. We define Pw as the design obtained via the P-permutation of the treatments in w, i.e., \(Pw(u)=w(\pi _P(u))\) for \(u \in \{1,\ldots ,v\}\), where \(\pi _P\) is the permutation of the elements \(\{1,\ldots ,v\}\) that corresponds to the matrix P. Because \(Pw>0\), it is feasible, its moment matrix is \(M(Pw) = PM(w)P^T\), and it has the information matrix \(N_Q(Pw) = (Q^TPM^{-1}(w)P^TQ)^{-1}\).

We will use the well-known fact that for any matrix X, the non-zero eigenvalues of the matrices \(X^TX\) and \(XX^T\) are the same (e.g., 6.54(c) in Seber 2008), including their multiplicities. Let us define \(Y=Q^TM^{-1/2}(w)\) and \(Z=Q^T P M^{-1/2}(w)\). Since \(QQ^T\) is completely symmetric, \(Y^TY = Z^TZ\). Furthermore, \(YY^T=Q^T M^{-1}(w) Q = N_Q^{-1}(w)\) and \(ZZ^T=Q^T P M^{-1}(w) P^T Q = N_Q^{-1}(Pw)\); thus, \(N_Q(w)\) and \(N_Q(Pw)\) have the same set of non-zero eigenvalues. Because they are of the same (full) rank, it follows that \(N_Q(w)\) and \(N_Q(Pw)\) are orthogonally similar and that \(\Phi (Pw) = \Phi (w)\). Note that analogous results hold in the rank-deficient case for the matrices \(C_Q(w)\) and \(C_Q(Pw)\).

The uniform treatment proportions design satisfies

$$\begin{aligned} \Phi (\bar{w})&= \Phi \left( \frac{1}{v!}\sum _{P-\text {perm.}} Pw\right) \ge \frac{1}{v!}\sum _{P-\text {perm.}} \Phi (Pw) \\&= \frac{1}{v!}\sum _{P-\text {perm.}} \Phi (w) = \frac{1}{v!}v! \Phi (w) = \Phi (w), \end{aligned}$$

where the inequality follows from the concavity of \(\Phi \). Thus, \(\bar{w}\) is \(\Phi \)-optimal. \(\square \)

In the proof of the uniqueness of \(\Phi _p\)-optimal treatment proportions in Theorem 6, we will employ the following lemma.

Lemma 5

(Corollary 8.14 from Pukelsheim 2006) Let \(p \in [-\infty ,0]\) and let \(\xi ^*\) be a \(\Phi _p\)-optimal design. Then, if \(p>-\infty \), any other design \(\xi \) is also \(\Phi _p\)-optimal if and only if it satisfies \(M(\xi )GK=K\), where G is a generalized inverse of \(M(\xi ^*)\) that satisfies the normality inequality of the General Equivalence Theorem (see Lemma 3).

Proof of Theorem 6

Note that for \(Q=(-I_g \otimes 1_{v-g}, 1_{g} \otimes I_{v-g})^T\), we have

$$\begin{aligned} QQ^T = \begin{bmatrix} (v-g)I_g&\quad -J_{g \times (v-g)} \\ -J_{(v-g) \times g}&\quad gI_{v-g} \end{bmatrix}. \end{aligned}$$

Let w be a treatment proportions design, let \(P_1\) and \(P_2\) be \(g \times g\) and \((v-g) \times (v-g)\) permutation matrices, respectively, and let

$$\begin{aligned} \tilde{P}=\begin{bmatrix} P_1&\quad 0_{g \times (v-g)} \\ 0_{(v-g) \times g}&\quad P_2 \end{bmatrix}. \end{aligned}$$
(18)

Define \(\tilde{P}w\) to be the design given by the \(\tilde{P}\)-permutations of the treatments. Then, \(M(\tilde{P}w)= \tilde{P}M(w)\tilde{P}^T\) and \(\tilde{P}^T QQ^T \tilde{P} = QQ^T\). From an argument analogous to that used in the proof of Theorem 5, \(C_Q(\tilde{P}w)\) and \(C_Q(w)\) are orthogonally similar; therefore \(\Phi _p(\tilde{P}w) = \Phi _p(w)\).

Let \(\tilde{w} := \frac{1}{(v-g)!g!} \sum _{\tilde{P}} \tilde{P} w\), where the sum is over all \(v \times v\) permutation matrices \(\tilde{P}\) of the form (18). Then, \(\Phi _p(\tilde{w}) \ge \Phi _p(w)\). It follows that an optimal design exists in the class of designs that allocate one weight to each of the first g treatments, say \(\gamma _1\) (\(0<\gamma _1<1/g\)), and another weight \(\gamma _2:=(1-g\gamma _1)/(v-g)\) to each of the other treatments. We denote the total weight of the first g treatments as \(\gamma :=g\gamma _1\) and for a given \(\gamma \), we denote such designs as \(w_\gamma \).

The non-zero eigenvalues of \(C_Q(w_\gamma )\) are inverse to the non-zero eigenvalues of \(V(w_\gamma ):=Q^TM^{-1}(w_\gamma )Q\), where \(M(w_\gamma )=\mathrm {diag}(\gamma _1 1_g, \gamma _2 1_{v-g})\). Let \(X=Q^TM^{-1/2}(w_\gamma )\). Then, the set of non-zero eigenvalues of \(V(w_\gamma ) = XX^T\) coincides with the set of non-zero eigenvalues of

$$\begin{aligned} X^TX&= M^{-1/2}(w_\gamma )QQ^T M^{-1/2}(w_\gamma ) \\&= \begin{bmatrix} (v-g)\gamma _1^{-1}I_g&\quad - (\gamma _1\gamma _2)^{-1/2}J_{g \times (v-g)} \\ - (\gamma _1\gamma _2)^{-1/2}J_{(v-g) \times g}&\quad g\gamma _2^{-1}I_{v-g} \end{bmatrix}. \end{aligned}$$

It can be verified that \(X^TX\) has the following eigenvalues, listed with the corresponding eigenvectors of the form \(x=(x_1^T,x_2^T)^T\), where \(x_1 \in \mathbb {R}^{g}\) and \(x_2 \in \mathbb {R}^{v-g}\): \(\mu _1 = g\gamma _2^{-1}\) with multiplicity (w.m.) \(v-g-1\), \(x_1=0_g\) and \(1_{v-g}^Tx_2=0\); \(\mu _2 = (v-g)\gamma _1^{-1}\) w.m. \(g-1\), \(1_g^Tx_1 = 0\) and \(x_2=0_{v-g}\); \(\mu _3 = (v-g)\gamma _1^{-1} + g\gamma _2^{-1}\) w.m. 1, \(x_1=-(v-g)\gamma _2^{1/2}1_g\) and \(x_2=g\gamma _1^{1/2}1_{v-g}\); and \(\mu _4 = 0\) w.m. 1, \(x_1=\gamma _1^{1/2}1_g\) and \(x_2=\gamma _2^{1/2}1_{v-g}\).

Therefore, the non-zero eigenvalues of \(C_Q(w_\gamma )\) are \(\lambda _1 = \frac{1-g\gamma _1}{g(v-g)}\) w.m. \(v-g-1\), \(\lambda _2 = \frac{\gamma _1}{v-g}\) w.m. \(g-1\) and \(\lambda _3=\frac{\gamma _1(1-g\gamma _1)}{v-g}\) w.m. 1. Thus, for \(p \in (-\infty ,0)\), the \(\Phi _p\)-optimal value of \(w_\gamma \) is obtained by minimizing the convex function

$$\begin{aligned} f_p(\gamma _1) = (v-g-1)\left( \frac{1-g\gamma _1}{g(v-g)}\right) ^p + (g-1)\left( \frac{\gamma _1}{v-g}\right) ^p + \left( \frac{\gamma _1(1-g\gamma _1)}{v-g}\right) ^p \end{aligned}$$

for \(\gamma _1 \in (0,1/g)\). Then, \(f_p'(\gamma _1) = 0\) if and only if

$$\begin{aligned} -(v-g-1)(1-g\gamma _1)^{p-1} + (g-1)(g\gamma _1)^{p-1} + (1-2g\gamma _1)(g\gamma _1)^{p-1}(1-g\gamma _1)^{p-1}=0, \end{aligned}$$

which is equivalent to

$$\begin{aligned} -(v-g-1)(g\gamma _1)^{1-p} + (g-1)(1-g\gamma _1)^{1-p} + 1-2g\gamma _1 = 0. \end{aligned}$$

Using \(\gamma =g\gamma _1\), we obtain (9).

For \(p=0\) the unique maximum of the function \(f_0(\gamma _1)=\lambda _1^{v-g-1}\lambda _2^{g-1}\lambda _3=(1-g\gamma _1)^{v-g}\gamma _1^g/[g^{v-g-1}(v-g)^{v-1}]\) for \(\gamma _1 \in (0,1/g)\) is in \(\gamma _1=1/v\), which means that \(w_\gamma \) is a uniform design. This corresponds to \(\gamma =g/v\), which is obtained if we set \(p=0\) in (9).

The smallest non-zero eigenvalue of \(C_Q(w_\gamma )\) is \(\lambda _3\) and hence the \(\Phi _{-\infty }\)-optimal design can be obtained by maximizing

$$\begin{aligned} f_{-\infty }(\gamma ) = \frac{\gamma (1-\gamma )}{g(v-g)}, \end{aligned}$$

which has a unique maximum in \(\gamma =\frac{1}{2}\).

For \(p \in (-\infty ,0]\), the uniqueness follows from Lemma 5 (as stated in Sect. 8.18 of Pukelsheim 2006, the General Equivalence Theorem also holds for the rank deficient \(\Phi _p\)-criteria; however, instead of negative powers of \(N_K(\xi ^*)\), we have positive powers of \(K^TM^-(\xi ^*)K\) and instead of \(\lambda _{\min }(N_K(\xi ^*))\), we have \(1/\lambda _{\max }(K^TM^-(\xi ^*)K)\)). Denote by \(w^*\) the \(\Phi _p\)-optimal treatment proportions obtained from (9). The design \(w^*\) is \(\Phi _p\)-optimal and the moment matrix \(M(w^*)\) is non-singular, therefore there exists a unique generalized inverse \(G=M^{-1}(w^*)\) satisfying the normality inequality. Then, \(M(w)GQ=Q\) can be satisfied only by \(w=w^*\), because M(w)G is a diagonal matrix and no row of Q is equal to zero.

For \(p=-\infty \) let \(w>0\). Then, the matrix \(V(w)=Q^TM^{-1}(w)Q\) can be expressed as \(V(w) = A_1(w) + A_2(w)\), where \(A_1(w)\) is a block diagonal matrix with the blocks on the diagonal given by \(w_1^{-1}J_{v-g}, \ldots , w_g^{-1}J_{v-g}\) and \(A_2(w) = J_g \otimes \mathrm {diag}(w_{g+1}^{-1}, \ldots , w_v^{-1})\). Let us partition \(u \in \mathbb R^{g(v-g)}\) as \(u^T=(u_1^T,\ldots ,u_g^T)\), where \(u_i \in \mathbb R^{v-g}\) for all i. Then, the largest eigenvalue \(\mu _{\max }\) of V(w) satisfies

$$\begin{aligned} \mu _{\max }&= \max _{||u ||=1} u^T V(w) u \\&= \max _{||u ||=1} \left( \sum _{i=1}^g w_i^{-1}\Big (1_{v-g}^Tu_i\Big )^2 + \sum _{i=1}^g \sum _{j=1}^g u_i^T \mathrm {diag}\Big (w_{g+1}^{-1}, \ldots , w_v^{-1}\Big ) u_j \right) . \end{aligned}$$

For the particular choice of \(u=1_{g(v-g)}/\sqrt{g(v-g)}\), we obtain

$$\begin{aligned} \mu _{\max }&\ge \sum \limits _{i=1}^g w_i^{-1}\left( \frac{v-g}{\sqrt{g(v-g)}}\right) ^2 + \sum \limits _{i=1}^g \sum \limits _{j=1}^g \sum \limits _{k=g+1}^v \left( \frac{1}{\sqrt{g(v-g)}}\right) ^2 w_k^{-1} \\&= \frac{v-g}{g} \sum \limits _{i=1}^g w_i^{-1} + \frac{g}{v-g} \sum \limits _{i=g+1}^v w_i^{-1}. \end{aligned}$$

If w does not satisfy \(w_1 = \cdots = w_g\) and \(w_{g+1} = \cdots = w_v\), then define \(\tilde{w}\) such that \(\tilde{w}_1 = \cdots = \tilde{w}_g = \sum _{i=1}^g w_i/g\) and \(\tilde{w}_{g+1} = \cdots = \tilde{w}_v = \sum _{i=g+1}^v w_i/(v-g)\). Then, from the first part of the proof, we have \(\mu _{\max }(\tilde{w}) = \mu _3(\tilde{w}) = (v-g)\tilde{w}_1^{-1} + g\tilde{w}_{g+1}^{-1}\). From the inequality of arithmetic and harmonic means, it follows that \(\frac{v-g}{g}\sum _{i=1}^g w_i^{-1} \ge (v-g)\tilde{w}_{1}^{-1}\) and \(\frac{g}{v-g}\sum _{i=g+1}^v w_i^{-1} \ge g\tilde{w}_{g+1}^{-1}\), and at least one of these inequalities is strict. Hence, \(\mu _{\max }(w) > \mu _{\max }(\tilde{w})\) and thus any \(\Phi _{-\infty }\)-optimal w must satisfy \(w_1 = \cdots = w_g\) and \(w_{g+1} = \cdots = w_v\). From the first part of the proof it follows that the only \(\Phi _{-\infty }\)-optimal design that satisfies these conditions is \(w^* = (1_g^T/(2g),1_{v-g}^T /(2(v-g)))^T \).

If \(g=1\), the matrix Q is of full rank \(v-1\), and the eigenvalues of V(w) are inverses of the eigenvalues of \(N_Q(w)\). Therefore, the results hold also for \(g=1\). \(\square \)

Proof of Theorem 7

This proof will closely follow the proof of Theorem 6. The covariance matrix of the least-square estimator is proportional to \(V(w) = Q^TM^{-1}(w)Q\). Note that the \(\textit{MV}\)-optimality criterion \(\Phi _{MV}\) is permutationally invariant, because it depends only on the diagonal of the variance matrix.

Let w be a treatment proportions design and let \(\tilde{P}\), \(\tilde{w}\), \(\gamma _1\), \(\gamma _2\), \(\gamma \) and \(w_\gamma \) be defined as in the proof of Theorem 6. Then, \(Q^T\tilde{P} = BQ^T\), where \(B=P_1 \otimes P_2\), which is a permutation matrix. Thus, \(V(\tilde{P}w) = BV(w)B^T\), \(\Phi _{MV}(\tilde{P}w) = \Phi _{MV}(w)\) and \(\Phi _{MV}(\tilde{w}) \le \Phi _{MV}(w)\). It follows that an optimal design exists in the class of designs \(w_\gamma \).

We have \(V(w_\gamma )= \gamma _1^{-1}I_g \otimes J_{v-g} + \gamma _2^{-1} J_g \otimes I_{v-g}\) and all of its diagonal elements are \(\gamma _1^{-1} + \gamma _2^{-1}\). Thus, the optimal \(\gamma _1\) may be obtained by minimizing

$$\begin{aligned} f_{MV}(\gamma _1) = \gamma _1^{-1} + \frac{v-g}{1-g\gamma _1}, \end{aligned}$$

which has minimum in \(\gamma _1^* = \frac{\sqrt{g(v-g)}-g}{g(v-2g)}\); therefore, \(\gamma ^* = \frac{\sqrt{g(v-g)}-g}{v-2g}\). \(\square \)

The following lemma provides a method of constructing balanced designs in the model with trigonometric time trend described in (11).

Lemma 6

Let \(l,a \in \mathbb {N}\), let \(\xi _p\) be an exact design of size l and let \(\xi =\xi _p\xi _p \ldots \xi _p\) be the exact design of size \(n=la\) formed through a-fold replication of \(\xi _p\). Assume that \(b \in \mathbb {N}\) is not an integer multiple of a. Then, \(\xi \) is balanced for the nuisance regressors of the form \(\cos (b\phi _nt)\) and \(\sin (b\phi _nt)\), \(t=1,\ldots ,n\).

Proof

Let \(u \in \{1,\ldots ,v\}\). Using the fact that \(\xi (u,k+lj)=\xi _p(u,k)\) for all \(k \in \{1,\ldots ,l\}\) and \(j \in \{0,\ldots ,a-1\}\), we obtain

$$\begin{aligned} \sum _{t=1}^n \xi (u,t)\cos (b\phi _nt)+\mathrm {i} \sum _{t=1}^n \xi (u,t)\sin (b\phi _nt) = \sum _{t=1}^n \xi (u,t)e^{b\phi _nt\mathrm {i}} \\ = \sum _{j=0}^{a-1}\sum _{k=1}^l\xi (u,k+lj)e^{b\phi _n(k+lj)\mathrm {i}} =\left( \sum _{k=1}^l\xi _p(u,k)e^{b\phi _nk\mathrm {i}}\right) \left( \sum _{j=0}^{a-1}e^{(b\phi _nl \mathrm {i})j}\right) . \end{aligned}$$

Note that if b is not an integer multiple of a, then \(b\phi _nl=2\pi (b/a)\) is not an integer multiple of \(2\pi \), which implies that \(e^{b\phi _nl\mathrm {i}} \ne 1\). In that case

$$\begin{aligned} \sum _{j=0}^{a-1}e^{(b\phi _nl\mathrm {i})j}=\frac{1-e^{b2\pi \mathrm {i}}}{1-e^{b\phi _n l\mathrm {i}}}=0. \end{aligned}$$

\(\square \)

Proof of Proposition 4

The proposition follows from Theorem 2 and Lemma 6. \(\square \)

Proof of Proposition 5

It is well known that a point x is a vertex of the set \(\{x|Ax=b,x\ge 0\}\) if and only if the system \(\{A_j| x_j > 0\}\), where \(A_j\) is the jth column of A, is of full rank.

The matrix A consists of \(v+(v-1)d+n\) rows, but these rows are linearly dependent. Let k be the affine dimension of \(\{h(t)\}_{t \in \mathfrak T}\) and, without loss of generality, let \(\mathfrak T=\{1,\ldots ,n\}\). Then, the matrix \([h(2)-h(1), \ldots , h(n)-h(1)]\) is of rank k and thus its row space has dimension k. That is, without the loss of generality, we obtain that \(h_i(t)-h_i(1) = \sum _{j=1}^k c_j^{(i)}(h_j(t)-h_j(1))\) for some \( c_1^{(i)}, \ldots , c_k^{(i)} \in \mathbb {R}\), for \(i>k\) and \(t\in \{1,\ldots ,n\}\) (for \(t=1\), we formally obtain \(0=0\)). Let \(u \in \{1,\ldots ,v\}\). Then, if (ii) is satisfied for \(h_1, \ldots , h_k\), for all \(i>k\) and \(u \in \{1,\ldots ,v\}\) we have:

$$\begin{aligned} w_1^{-1} \sum _t \xi (1,t) h_i(t)= & {} w_1^{-1} \left( h_i(1) - \sum _{j=1}^k c_j^{(i)}h_j(1)\right) \sum _t \xi (1,t)\\&+ \sum _{j=1}^k c_j^{(i)}w_1^{-1} \sum _t \xi (1,t)h_j(t) \\= & {} h_i(1) - \sum _{j=1}^k c_j^{(i)}h_j(1) + \sum _{j=1}^k c_j^{(i)}w_u^{-1} \sum _t \xi (u,t)h_j(t) \\= & {} w_u^{-1} \sum _t \xi (u,t)\left( h_i(1) - \sum _{j=1}^k c_j^{(i)}h_j(1)\right) \\&+ \sum _{j=1}^k c_j^{(i)}w_u^{-1} \sum _t \xi (u,t)h_j(t) \\= & {} w_u^{-1}\sum _t \xi (u,t) \left[ \left( h_i(1) - \sum _{j=1}^k c_j^{(i)}h_j(1)\right) + \sum _{j=1}^k c_j^{(i)}h_j(t)\right] \\= & {} w_u^{-1} \sum _t \xi (u,t) h_i(t), \end{aligned}$$

where the second and the third equalities also use (i). It follows that (ii) provides at most \(k(v-1)\) additional linearly independent equalities.

If \(\xi \) satisfies (i), then \(\sum _{u,t} \xi (u,t)=1\) holds. Thus, if \(\xi \) satisfies (iii) for \(t=1, \ldots , n-1\), we have \(1= \sum _{t=1}^{n-1} \sum _u \xi (u,t) + \sum _u \xi (u,n) = \frac{n-1}{n} + \sum _u \xi (u,n)\) and therefore (iii) also holds for \(t=n\). That is, (iii) provides only \(n-1\) additional linearly independent equalities. Hence, A is of rank at most \(v + (v-1)k + n-1\) and a vertex x contains at most \(v + (v-1)k + n-1\) support points. \(\square \)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rosa, S., Harman, R. Optimal approximate designs for estimating treatment contrasts resistant to nuisance effects. Stat Papers 57, 1077–1106 (2016). https://doi.org/10.1007/s00362-016-0809-0

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00362-016-0809-0

Keywords

Mathematics Subject Classification

Navigation