Abstract
For a rational two-dimensional nonlinear in parameters model used in analytical chemistry, we investigate how homothetic transformations of the design space affect the number of support points in the optimal designs. We show that there exist two types of optimal designs: a saturated design (i.e. a design with the number of support points which is equal to the number of parameters) and an excess design (i.e. a design with the number of support points which is greater than the number of parameters). The optimal saturated designs are constructed explicitly. Numerical methods for constructing optimal excess designs are used.
Similar content being viewed by others
References
Atkinson AC, Fedorov VV (1975a) The designs of experiments for discriminating between two rival models. Biometrika 62:57–70
Atkinson AC, Fedorov VV (1975b) Optimal design: experiments for discriminating between several models. Biometrika 62:289–303
Atkinson AC, Donev AN, Tobias RD (2007) Optimum experimental designs, with SAS. Oxford University Press, Oxford
Ayen R, Peters MS (1962) Catalytic reduction of nitric oxide. Ind Eng Chem Process Des 1(3):204–207
Box JEP, Hunter WG (1965a) The experimental study of physical mechanisms. Technometrics 7:23–42
Box JEP, Hunter WG (1965b) Sequential design of experiments for nonlinear models. In: Proceedings of IBM scientific computing symposium on statistics, pp 113–137
Chaloner K, Verdinelli I (1995) Bayesian experimental design: a review. Stat Sci 10:273–304
Chernoff H (1953) Locally optimal designs for estimating parameters. Ann Math Stat 24:586–602
de la Garza A (1954) Spacing of information in polynomial regression. Ann Math Stat 25:123–130
Dette H (1997) Designing experiments with respect to “standardized” optimality criteria. J R Stat Soc 59:97–110
Dette H, Melas VB (2011) A note on the de la Garza phenomenon for locally optimal designs. Ann Stat 39(2):1266–1281
Ermakov SM, Kulikov DV, Leora SN (2017) Towards the analysis of the simulated annealing method in the multiextremal case. Vestn St. Petersb Univ Math 50(2):132–137
Fedorov VV (1972) Theory of optimal experiment. Academic Press, New York
Filipe J, Adams FG (1987) The estimation of the Cobb–Douglas function: a retrospective view. East Econ J 31(3):427–445
Grigoriev YD, Melas VB, Shpilev PV (2017) Excess of locally \(D\)-optimal designs and homothetic transformations. Vestn St. Petersb Univ Math 50(4):329–336
Grigoriev YD, Melas VB, Shpilev PV (2018) Excess of locally \(D\)-optimal designs for Cobb–Douglas model. Stat Pap 59(4):1425–1439
Kiefer J (1974) General equivalence theory for optimum designs (approximate theory). Ann Stat 2:849–879
Laible JR (1959) The kinetics of the catalytic dehydration of certain tertiary and long chain primary alcohols. University of Wisconsin, Microfilmed Ph.D. Thesis, Madison
Melas VB (2006) Functional approach to optimal experimental design. Springer, New York
Pukelsheim F (2006) Optimal design of experiments. SIAM, Philadelphia
Pukelsheim F, Rieder S (1992) Efficient rounding of approximate designs. Biometrika 79:763–770
Seber GAF, Wild CJ (1989) Nonlinear regression. Wiley, New York
Whittle P (1973) Some general points in the theory and construction of \(D\)-optimal experimental design. J R Stat Soc B 35:123–130
Yang M (2010) On the de la Garza phenomenon. Ann Stat 38:2499–2524
Yang M, Stufken J (2009) Support points of locally optimal designs for nonlinear models with two parameters. Ann Stat 37:518–541
Yang M, Stufken J (2012) Identifying locally optimal designs for nonlinear models: a simple extension with profound consequences. Ann Stat 40(3):1665–1681
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This work was partially supported by Russian Foundation for Basic Research (Project Nos. 17-01-00267-a, 20-01-00096-a).
Appendix
Appendix
Throughout this section without losing generality, we can assume that \(\gamma =1\). Also, due to the fact that the D-optimal design for model (3.1) doesn’t depend on the parameter \(\theta _0\), we put \(\theta _0\equiv 1.\) To prove Theorem 2 we need to establish the following lemma first:
Lemma 1
A saturated D-optimal design for model (3.1) is concentrated at points
where the point C coincides with one of three points: \((b_1, (1+b_1\theta _1)/\theta _2)\), \((b_1, b_2)\), \(((1+b_2\theta _2)/\theta _1, b_2)\).
Proof of Lemma 1
According to Theorem 1 a design \(\xi \) is D-optimal for model (3.1) if it satisfies the equation
For model (3.1) the function \(d((x_1,x_2),\xi )\) has a form:
where \(a_i\) are coefficients depending on \(\theta _i\) and elements of the matrix \(M^{-1}(\xi )\). The analysis of this function shows that it has no more than 4 maxima at points
Indeed, for any given non-singular design \(\xi \) and fixed \(x_1=x_1^{*}\in (0,b_1]\) the function \(d((x_1^{*},x_2),\xi )\) has no more than two maxima at the point \(x_2=0\) and at point \(x_2=x_2^{*}\in (0,b_2]\) on the interval \([0,b_2]\). On the other hand, for fixed \(x_2=x_2^{*}\in [0,b_2]\) the function \(d((x_1,x_2^{*}),\xi )\) has also no more than two maxima at the point \(x_1=b_1\) and at the point \(x_1=x_1^{*}\in (0,b_1)\) on the interval \([0,b_1].\) This immediately implies that the function \(d((x_1,x_2),\xi )\) has maxima at points of the form (6.2). We have to show now that the number of global maxima is less than or equal to 4. To do this, we prove that the function \(d((x_1,x_2),\xi )\) cannot have two global maxima at points of the form \(\breve{D}\) when \(\alpha _2<b_1,\ \beta _2<b_2\). We prove it by reductio ad absurdum. Let
Consider the line \(x_2=ax_1+b\) that passes through the points \(X^{*}_i\), \(i=1,2.\) We have
After the appropriate replacements, we obtain
Note that the function \((x_1+{\widetilde{c}}_2)^{-4}\) is bounded and strictly increasing (or decreasing) on the interval \([0,b_1]\). On the other hand, the function \(x_1^2({\widetilde{a}}x_1^2+{\widetilde{b}}x_1+{\widetilde{c}})\) has no more than 2 global maxima on the interval \([0,b_1]\) and one of them has to be located at the point \(x_1=b_1\). It follows from this that the function in (6.3) has no more than 2 global maxima at the points \(x_1=b_1\) and \(x_1=x^{*}_1\in (0,b_1)\) on the interval \([0,b_1]\). Thus, we have obtained a contradiction. Therefore, we have proved that the saturated D-optimal design is concentrated at points
Now we have to consider all possible combinations of sets of parameter’s values \(\{\alpha _1,\alpha _2,\beta _1\}\) to show that any saturated D-optimal design is concentrated at points (6.1). Since \(0<\alpha _1<b_1\) we have only 4 combinations: \(\{0<\alpha _1<b_1, \alpha _2=b_1, 0<\beta _1<b_2\}\), \(\{0<\alpha _1<b_1, \alpha _2=b_1, \beta _1=b_2\}\), \(\{0<\alpha _1<b_1, 0<\alpha _2<b_1, \beta _1=b_2\}\), \(\{0<\alpha _1<b_1, 0<\alpha _2<b_1, 0<\beta _1<b_2\}\). By Theorem 1 any saturated D-optimal design \(\xi ^*\) must satisfy the conditions:
This implies that if \((t_{i1},t_{i2})\) is an inner point of the design space \({\mathcal {X}}\) then it satisfies the system of equations:
Thus, for our four combinations we have four different systems of corresponding equations. Since
we immediately obtain that any saturated D-optimal design is concentrated at points (6.1). Note that first 3 combinations give the support points of designs (3.4), (3.5), (3.6) and for the last one (\(\{0<\alpha _2<1, 0<\beta _1<1\}\)) there is no solution:
Lemma 1 is proved. \(\square \)
Proof of Theorem 2
Without losing generality, we can assume that \(b_1=b_2=1\).
Optimality of designs (3.4)–(3.6) is verified directly by Theorem 1. For example, in case (a) for the design \({\bar{\xi }}_1^*\) in form (3.4) (under our assumption) we have
There are two stationary points of the function \(d((t_1,t_2),{\bar{\xi }}_1^*)\) on \([0,1]\times [0,1]:\)
The function \(d((t_1,t_2),{\bar{\xi }}_1^*)\) has the following values at these points:
The Hessian matrix of this function is
Direct calculations show that for eigenvalues \(\mu _{1,2}(Q_1)\) of the matrix H at the point \(Q_1\) the inequalities \(\mu _1(Q_1)>0\) and \(\mu _2(Q_2)<0\) hold. Thus, \(Q_1\) is a saddle point. On the other hand, for the point \(Q_2\) the inequalities \(\mu _1(Q_1)>0\) and \(\mu _2(Q_2)>0\) hold and this entails that \(Q_2\) is a minimum point.
Since there are no stationary points \(Q_i\) of the function \(d((t_1,t_2),{\bar{\xi }}_1^*)\) on \([0,1]\times [0,1]\) such that \(d(Q_i, ,{\bar{\xi }}_1^*)\le 3\) this function reaches its maxima at boundary points. Since \(d((0,t_2),{\bar{\xi }}_1^*)\equiv 0\) we have only four possible candidates:
For the design \({\bar{\xi }}_1^*\) with \(\breve{A}=(( 1/(2+\theta _1),0),\ \breve{C}=(1, (1+\theta _1)/\theta _2))\) we have
and
Note that under fixed \(\theta _1\) this function decreases by \(\theta _2\) on the interval \([\theta _1+1,\infty ).\) Indeed,
Since for case (a) the inequality \(\theta _2\ge \theta _1+1\) holds, we have:
Thus, by Theorem 1 the design in (3.4) for case (a) is D-optimal. The remaining cases are verified in a similar way.
The behavior of the function \(d((t_1,t_2),{\bar{\xi }}_1^*)\) for \(\theta _1=1,\theta _2=3\), \({\mathcal {X}}=[0,1]\times [0,1]\) is depicted in Fig. 9.
Theorem 2 is proved. \(\square \)
Rights and permissions
About this article
Cite this article
Grigoriev, Y.D., Melas, V.B. & Shpilev, P.V. Excess and saturated D-optimal designs for the rational model. Stat Papers 62, 1387–1405 (2021). https://doi.org/10.1007/s00362-019-01140-9
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00362-019-01140-9