Abstract
Often, uncertainty quantification is followed by the computation of sensitivity indices of input factors. Variance-based sensitivity analysis and multivariate sensitivity analysis (MSA) aim to apportion the variability of the model output(s) into input factors and their interactions. Sobol’ indices (first-order and total indices), which quantify the effects of input factor(s), serve as a practical tool to assess interactions among input factors, the order of interactions, and the magnitude of interactions. In this paper, we investigate a novel way of estimating both the first-order and total indices based on U-statistics, including the statistical properties of the new estimator. First, we provide a minimum variance unbiased estimator of the non-normalized Sobol’ indices as well as its optimal rate of convergence and its asymptotic distribution. Second, we derive a joint estimator of Sobol’ indices, its consistency and its asymptotic distribution, and third, we demonstrate the applicability of these results by means of numerical tests. The new estimator allows for improving the estimation of Sobol’ indices for some degrees of the kernel.
Similar content being viewed by others
References
Antoniadis A, Pasanisi A (2012) Modeling of computer experiments for uncertainty propagation and sensitivity analysis. Stat Comput 22(3):677–679. https://doi.org/10.1007/s11222-011-9282-8
Bolado-Lavin R, Castaings W, Tarantola S (2009) Contribution to the sample mean plot for graphical and numerical sensitivity analysis. Reliab Eng Syst Saf 94(6):1041–1049
Borgonovo E, Tarantola S, Plischke E, Morris MD (2014) Transformations and invariance in the sensitivity analysis of computer experiments. J R Stat Soc Ser B 76(5):925–947
Buzzard GT (2012) Global sensitivity analysis using sparse grid interpolation and polynomial chaos. Reliab Eng Syst Saf 107:82–89
Caflisch RE, Morokoff W, Owen AB (1997) Valuation of mortgage backed securities using brownian bridges to reduce effective dimension. J Comput Financ 1:27–46
Chan K, Saltelli A, Tarantola S (2000) Winding stairs: a sampling tool to compute sensitivity indices. Stat Comput 10(3):187–196. https://doi.org/10.1023/A:1008950625967
Conti S, O’Hagan A (2010) Bayesian emulation of complex multi-output and dynamic computer models. J Stat Plan Inference 140(3):640–651
Dutang C, Savicky P (2013) randtoolbox: generating and testing random numbers. R package version 1:13
Fang S, Gertner GZ, Shinkareva S, Wang G, Anderson A (2003) Improved generalized Fourier amplitude sensitivity test (FAST) for model assessment. Stat Comput 13(3):221–226. https://doi.org/10.1023/A:1024266632666
Ferguson TS (1996) A course in large sample theory. Chapman-Hall, New York
Gamboa F, Janon A, Klein T, Lagnoux A (2014) Sensitivity indices for multivariate outputs. Comptes Rendus de l’Académie des Sciences 351(7–8):307–310
Ghanem R, Higdon D, Owhadi H (2017) Handbook of uncertainty quantification. Springer, Cham
Hoeffding W (1948a) A class of statistics with asymptotically normal distribution. Ann Math Stat 19:293–325
Hoeffding W (1948b) A non-parametric test for independence. Ann Math Stat 19:546–557
Homma T, Saltelli A (1996) Importance measures in global sensitivity analysis of nonlinear models. Reliab Eng Syst Saf 52:1–17
Jansen MJW (1999) Analysis of variance designs for model output. Comput Phys Commun 117:35–43
Jourdan A (2012) Global sensitivity analysis using complex linear models. Stat Comput 22(3):823–831. https://doi.org/10.1007/s11222-011-9239-y
Kucherenko S, Rodriguez-Fernandez M, Pantelides C, Shah N (2009) Monte Carlo evaluation of derivative-based global sensitivity measures. Reliab Eng Syst Saf 94:1135–1148
Kucherenko S, Feil B, Shah N, Mauntz W (2011) The identification of model effective dimensions using global sensitivity analysis. Reliab Eng Syst Saf 96(4):440–449
Kucherenko S, Tarantola S, Annoni P (2012) Estimation of global sensitivity indices for models with dependent variables. Comput Phys Commun 183(4):937–946
Kucherenko S, Delpuech B, Iooss B, Tarantola S (2015) Application of the control variate technique to estimation of total sensitivity indices. Reliab Eng Syst Saf 134:251–259
Lamboni M (2016a) Global sensitivity analysis: a generalized, unbiased and optimal estimator of total-effect variance. Stat Pap 59(1):361–386. https://doi.org/10.1007/s00362-016-0768-5
Lamboni M (2016b) Global sensitivity analysis: an efficient numerical method for approximating the total sensitivity index. Int J Uncertain Quant (accepted)
Lamboni M, Makowski D, Monod H (2008) Multivariate global sensitivity analysis for discrete-time models. Rapport technique 2008-3. INRA, UR341 Mathématiques et Informatique Appliquées, Jouy-en-Josas, France
Lamboni M, Makowski D, Lehuger S, Gabrielle B, Monod H (2009) Multivariate global sensitivity analysis for dynamic crop models. Fields Crop Res 113:312–320
Lamboni M, Monod H, Makowski D (2011) Multivariate sensitivity analysis to measure global contribution of input factors in dynamic models. Reliab Eng Syst Saf 96:450–459
Lamboni M, Iooss B, Popelin AL, Gamboa F (2013) Derivative-based global sensitivity measures: general links with Sobol’ indices and numerical tests. Math Comput Simul 87:45–54
Lehmann EL (1951) Consistency and unbiasedness of certain nonparametric tests. Ann Math Stat 22:165–179
Lehmann EL (1999) Elements of large sample theory. Springer, New York
Mara TA, Joseph OR (2008) Comparison of some efficient methods to evaluate the main effect of computer model factors. J Stat Comput Simul 78(2):167–178. https://doi.org/10.1080/10629360600964454
Mara TA, Tarantola S (2012) Variance-based sensitivity indices for models with dependent inputs. Reliab Eng Syst Saf 107:115–121
Mara TA, Tarantola S, Annoni P (2015) Non-parametric methods for global sensitivity analysis of model output with dependent inputs. Environ Model Softw 72:173–183
Marrel A, Iooss B, Da Veiga S, Ribatet M (2012) Global sensitivity analysis of stochastic computer models with joint metamodels. Stat Comput 22(3):833–847. https://doi.org/10.1007/s11222-011-9274-8
Muehlenstaedt T, Roustant O, Carraro L, Kuhnt S (2012) Data-driven kriging models based on FANOVA-decomposition. Stat Comput 22(3):723–738. https://doi.org/10.1007/s11222-011-9259-7
Oakley JE, O’Hagan A (2004) Probabilistic sensitivity analysis of complex models: a Bayesian approach. J R Stat Soc Ser B (Stat Methodol) 66(3):751–769
Owen AB (2013a) Better estimation of small Sobol’ sensitivity indices. ACM Trans Model Comput Simul 23:111–1117
Owen AB (2013b) Variance components and generalized Sobol’ indices. SIAM/ASA J Uncertain Quant 1(1):19–41
Plischke E, Borgonovo E, Smith CL (2013) Global sensitivity measures from given data. Eur J Oper Res 226(3):536–550
Pujol G, Iooss B, Janon A (2013) sensitivity: sensitivity analysis. R package version 1:7
Rao CR, Kleffe J (1988) Estimation of variance components and applications. North Holland, Amsterdam
Ratto M, Pagano A (2010) Using recursive algorithms for the efficient identification of smoothing spline anova models. AStA Adv Stat Anal 94(4):367–388
Ratto M, Pagano A, Young P (2007) State dependent parameter metamodelling and sensitivity analysis. Comput Phys Commun 177(11):863–876
Saltelli A (2002) Making best use of model evaluations to compute sensitivity indices. Comput Phys Commun 145:280–297
Saltelli A, Tarantola S, Chan K (1999) Quantitative model independent methods for global sensitivity analysis of model output. Technometrics 41:39–56
Saltelli A, Chan K, Scott E (2000) Variance-based methods. Probability and statistics. Wiley, New York
Saltelli A, Annoni P, Azzini I, Campolongo F, Ratto M, Tarantola S (2010) Variance based sensitivity analysis of model output. Design and estimator for the total sensitivity index. Comput Phys Commun 181(2):259–270
Sobol IM (1993) Sensitivity analysis for non-linear mathematical models. Math Model Comput Exp 1:407–414
Sobol IM (2001) Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates. Math Comput Simul 55:271–280
Sobol IM (2007) Global sensitivity analysis indices for the investigation of nonlinear mathematical models. Matematicheskoe Modelirovanie 19:23–24
Sobol IM, Kucherenko S (2009) Derivative based global sensitivity measures and the link with global sensitivity indices. Math Comput Simul 79:3009–3017
Storlie CB, Swiler LP, Helton JC, Sallaberry CJ (2009) Implementation and evaluation of nonparametric regression procedures for sensitivity analysis of computationally demanding models. Reliab Eng Syst Saf 94(11):1735–1763
Sudret B (2008) Global sensitivity analysis using polynomial chaos expansions. Reliab Eng Syst Saf 93(7):964–979
Sugiura N (1965) Multisample and multivariate nonparametric tests based on \(u\) statistics and their asymptotic efficiencies. Osaka J Math 2(2):385–426. http://projecteuclid.org/euclid.ojm/1200691466
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix A: An useful summary of the U-statistics theory
This section presents the main theorems about U-statistics used in this paper. While we consider only three independent samples, the general version of the theorems can be found in Ferguson (1996), Hoeffding (1948a, b) and Sugiura (1965).
Let \((X_1, \ldots , X_{n_1})\), \((Y_1, \ldots , Y_{n_2})\), and \((Z_1, \ldots , Z_{n_3})\) be three independent samples of size \(n_1, n_2, n_3\) from the distribution \(D_1\), \(D_2\), and \(D_3\) respectively. Suppose that our parameter of interest \(\theta \) is defined as follows:
where \( K\left( \cdot \right) \) is a symmetric kernel w.r.t its first, second, and third arguments. It has \(p+q+r\) arguments.
The U-statistic corresponding to \(\theta \) is given by
The estimator \(\widehat{\theta }\) is unbiased, and its variance is given by
with \(\sigma _{i,j,k}^2= \mathbb {V}\Big [ \mathbb {E}\Big [K\Big (X_{1},\ldots , X_{p}; Y_{1}, \ldots , Y_{q}; Z_{1}, \ldots , Z_{r} \Big ) | X_{1}, \ldots , X_{i}; Y_{1}, \ldots , Y_{j}; Z_{1}, \ldots , Z_{k} \Big ]\Big ] \).
The U-statistic \(\widehat{\theta }\) is a function of ordered statistics, as it is symmetric w.r.t each of its three types of arguments. It is known that ordered statistics are sufficient and complete statistics for non-parametric families such as the class of distribution functions having finite fourth moments. By Lehmann–Scheff theorem, it follows that \(\widehat{\theta }\) is the unique, uniformly minimum variance unbiased estimator of \(\theta \), given \(X_1, \ldots , X_{n_1}; Y_1, \ldots , Y_{n_2}; Z_1, \ldots , Z_{n_3}\). We also have the asymptotic normality of the estimator \(\widehat{\theta }\).
The same results can be derived for the multivariate U-statistics, and all elements can be found in Ferguson (1996), Lehmann (1999), Hoeffding (1948a, b) and Sugiura (1965).
Appendix B: a generalized estimator of the total sensitivity index
This section recalls mainly Corollary 2 from Lamboni (2016a).
For \(p\ge 2\), let \(\mathcal {X}=(\mathbf {X}^{(1)}_{u},\ldots , \mathbf {X}^{(p)}_{u}\)) be p i.i.d copies of \(\mathbf {X}_{u}\) and \(\mathcal {Y}=\mathbf {X}_{\sim u}\). We consider the following symmetric kernel.
It is known that the expectation of \(K\left( \mathbf {X}_{u}^{(1)}, \ldots , \mathbf {X}_{u}^{(p)}, \mathbf {X}_{\sim u} \right) \) is the non-normalized total index, that is,
Given two independent samples \(\mathcal {X}_i= (\mathbf {X}^{(1)}_{i,u},\ldots , \mathbf {X}^{(p)}_{i,u}\)) and \(\mathcal {Y}_i =(\mathbf {X}_{i,\sim u})\), \(i=1, 2, \ldots , m\), from \(\mathcal {X}\) and \(\mathcal {Y}\) respectively, the U-statistic corresponding to the kernel in (6.4 ) is given by
The properties of \(\widehat{D_u^{\textit{tot}}}\) are given as follows:
- (i):
-
a MVUE of \(D_u^{\textit{tot}}\) is \(\widehat{D_u^{\textit{tot}}}\) defined in (6.6);
- (ii):
-
the variance of \(\widehat{D}_{u}^{\textit{tot}}\) is given by
$$\begin{aligned} \mathbb {V}(\widehat{D}_{u}^{\textit{tot}}) = \frac{\sigma _{p,1}^2 }{m}, \end{aligned}$$(6.7)with \(\sigma _{p,1}^2 \) the variance of the kernel.
Moreover, we have
$$\begin{aligned} m\mathbb {E}\left( \widehat{D}_{u}^{\textit{tot}} - D_{u}^{\textit{tot}} \right) ^2 = \sigma _{p,1}^2; \end{aligned}$$(6.8) - (iii):
-
if \(m \rightarrow +\infty \), we have
$$\begin{aligned} \sqrt{m}\left( \widehat{D}_{u}^{\textit{tot}} - D_{u}^{\textit{tot}} \right) \xrightarrow {\mathcal {D}} \mathcal {N}\left( 0, \sigma _{p,1}^2 \right) . \end{aligned}$$(6.9)
Rights and permissions
About this article
Cite this article
Lamboni, M. Uncertainty quantification: a minimum variance unbiased (joint) estimator of the non-normalized Sobol’ indices. Stat Papers 61, 1939–1970 (2020). https://doi.org/10.1007/s00362-018-1010-4
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00362-018-1010-4