Efficient estimation of multiple expectations with the same sample by adaptive importance sampling and control variates

Demange-Chryst, Julien; Bachoc, François; Morio, Jérôme

doi:10.1007/s11222-023-10270-y

Efficient estimation of multiple expectations with the same sample by adaptive importance sampling and control variates

Original Paper
Published: 15 July 2023

Volume 33, article number 103, (2023)
Cite this article

Statistics and Computing Aims and scope Submit manuscript

Julien Demange-Chryst^1,2,
François Bachoc² &
Jérôme Morio¹

186 Accesses
Explore all metrics

Abstract

Some classical uncertainty quantification problems require the estimation of multiple expectations. Estimating all of them accurately is crucial and can have a major impact on the analysis to perform, and standard existing Monte Carlo methods can be costly to do so. We propose here a new procedure based on importance sampling and control variates for estimating more efficiently multiple expectations with the same sample. We first show that there exists a family of optimal estimators combining both importance sampling and control variates, which however cannot be used in practice because they require the knowledge of the values of the expectations to estimate. Motivated by the form of these optimal estimators and some interesting properties, we therefore propose an adaptive algorithm. The general idea is to adaptively update the parameters of the estimators for approaching the optimal ones. We suggest then a quantitative stopping criterion that exploits the trade-off between approaching these optimal parameters and having a sufficient budget left. This left budget is then used to draw a new independent sample from the final sampling distribution, allowing to get unbiased estimators of the expectations. We show how to apply our procedure to sensitivity analysis, by estimating Sobol’ indices and quantifying the impact of the input distributions. Finally, realistic test cases show the practical interest of the proposed algorithm, and its significant improvement over estimating the expectations separately.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Application of the full Bayesian significance test to model selection under informative sampling

Article 08 September 2016

Detecting and diagnosing prior and likelihood sensitivity with power-scaling

Article Open access 31 December 2023

Convergence rates for optimised adaptive importance samplers

Article Open access 21 January 2021

References

Broto, B., Bachoc, F., Depecker, M.: Variance reduction for estimation of Shapley effects and adaptation to unknown input distribution. SIAM/ASA J. Uncertain. Quantif. 8(2), 693–716 (2020)
Article MathSciNet MATH Google Scholar
Bucklew, J.: Introduction to Rare Event Simulation. Springer, Berlin (2004)
Book MATH Google Scholar
Chen, Y., Gupta, M.R., Chen, Y., et al.: EM demystified: an expectation-maximization tutorial. Electric. Eng. (2010)
Cornuet, J.M., Marin, J.M., Mira, A., et al.: Adaptive multiple importance sampling. Scand. J. Stat. 39(4), 798–812 (2012)
Article MathSciNet MATH Google Scholar
Davis, P.J., Rabinowitz, P.: Methods of numerical integration. Cour. Corp. (2007)
De Boer, P.T., Kroese, D.P., Mannor, S., et al.: A tutorial on the cross-entropy method. Ann. Oper. Res. 134(1), 19–67 (2005)
Article MathSciNet MATH Google Scholar
Demange-Chryst, J., Bachoc, F., Morio, J.: Shapley effect estimation in reliability-oriented sensitivity analysis with correlated inputs by importance sampling. Int. J. Uncertain. Quantif. 13(3) (2023)
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. J. Roy. Stat. Soc.: Ser. B (Methodol.) 39(1), 1–22 (1977)
MathSciNet MATH Google Scholar
Geyer, S., Papaioannou, I., Straub, D.: Cross entropy-based importance sampling using Gaussian densities revisited. Struct. Saf. 76, 15–27 (2019)
Article Google Scholar
Glynn, P.W., Szechtman, R.: Some new perspectives on the method of control variates. In: Monte Carlo and Quasi-Monte Carlo Methods 2000. Springer, p 27–49 (2002)
Hansen, L.P.: Large sample properties of generalized method of moments estimators. Econom. J. Econom. Soc. pp. 1029–1054 (1982)
He, H.Y., Owen, A.B.: Optimal mixture weights in multiple importance sampling. arXiv preprint arXiv:1411.3954 (2014)
Helton, J.C., Davis, F.J.: Latin hypercube sampling and the propagation of uncertainty in analyses of complex systems. Reliab. Eng. Syst. Saf. 81(1), 23–69 (2003)
Article Google Scholar
Homma, T., Saltelli, A.: Importance measures in global sensitivity analysis of nonlinear models. Reliab. Eng. Syst. Saf. 52(1), 1–17 (1996)
Article Google Scholar
Jagannathan, R., Skoulakis, G., Wang, Z.: Generalized methods of moments: applications in finance. J. Bus. Econ. Stat. 20(4), 470–481 (2002)
Article MathSciNet Google Scholar
Kahn, H., Harris, T.E.: Estimation of particle transmission by random sampling. Natl. Bureau Stand. Appl. Math. Ser. 12, 27–30 (1951)
Kraft, D.: A software package for sequential quadratic programming. Forschungsbericht- Deutsche Forschungs- und Versuchsanstalt fur Luft- und Raumfahrt (1988)
Kullback, S., Leibler, R.A.: On information and sufficiency. Ann. Math. Stat. 22(1), 79–86 (1951)
Article MathSciNet MATH Google Scholar
Kurtz, N., Song, J.: Cross-entropy-based adaptive importance sampling using Gaussian mixture. Struct. Saf. 42, 35–44 (2013)
Article Google Scholar
Leluc, R., Portier, F., Segers, J.: Control variate selection for Monte Carlo integration. Stat. Comput. 31(4), 1–27 (2021)
Article MathSciNet MATH Google Scholar
Li, B., Zhang, L., Zhu, X., et al.: Reliability analysis based on a novel density estimation method for structures with correlations. Chin. J. Aeronaut. 30(3), 1021–1030 (2017)
Article Google Scholar
Marin, J.M., Pudlo, P., Sedki, M.: Consistency of the adaptive multiple importance sampling. arXiv preprint arXiv:1211.2548 (2012)
Morio, J.: Influence of input PDF parameters of a model on a failure probability estimation. Simul. Model. Pract. Theory 19(10), 2244–2255 (2011)
Article Google Scholar
Nelson, B.L.: On control variate estimators. Comput. Oper. Res. 14(3), 219–225 (1987)
Article MathSciNet MATH Google Scholar
Nelson, B.L.: Control variate remedies. Oper. Res. 38(6), 974–992 (1990)
Article MathSciNet MATH Google Scholar
Owen, A., Zhou, Y.: Safe and effective importance sampling. J. Am. Stat. Assoc. 95(449), 135–143 (2000)
Article MathSciNet MATH Google Scholar
Owen, A.B.: Monte Carlo theory, methods and examples (2013)
Owen, A.B.: Sobol’indices and Shapley value. SIAM/ASA J. Uncertain. Quantif. 2(1), 245–251 (2014)
Article MathSciNet MATH Google Scholar
Owen, A.B., Zhou, Y.: Adaptive importance sampling by mixtures of products of beta distributions. Citeseer (1999)
Rubinstein, R.Y., Kroese, D.P.: The Cross-entropy Method: A Unified Approach to Combinatorial Optimization. Monte-Carlo Simulation and Machine Learning, Springer, Berlin (2013)
MATH Google Scholar
Rubinstein, R.Y., Kroese, D.P.: Simulation and the Monte Carlo Method. Wiley, Hoboken (2016)
Book MATH Google Scholar
Saltelli, A., Tarantola, S., Campolongo, F., et al.: Sensitivity Analysis in Practice: A Guide to Assessing Scientific Models, vol. 1. Wiley Online Library, Hoboken (2004)
MATH Google Scholar
Sobol, I.M.: Sensitivity analysis for non-linear mathematical models. Math. Modell. Comput. Exp. 1, 407–414 (1993)
MATH Google Scholar
Zhang, P.: Nonparametric importance sampling. J. Am. Stat. Assoc. 91(435), 1245–1253 (1996)
Article MathSciNet MATH Google Scholar
Zhou, C., Lu, Z., Zhang, L., et al.: Moment independent sensitivity analysis with correlations. Appl. Math. Model. 38(19–20), 4885–4896 (2014)
Article Google Scholar

Download references

Acknowledgements

The first author is enrolled in a Ph.D. program co-funded by ONERA-The French Aerospace Lab and Toulouse III - Paul Sabatier University. Their financial supports are gratefully acknowledged.

Author information

Authors and Affiliations

ONERA/DTIS, Université de Toulouse, 31055, Toulouse, France
Julien Demange-Chryst & Jérôme Morio
UMR5219 CNRS, Institut de Mathématiques de Toulouse, 31062, Toulouse, France
Julien Demange-Chryst & François Bachoc

Authors

Julien Demange-Chryst
View author publications
You can also search for this author in PubMed Google Scholar
François Bachoc
View author publications
You can also search for this author in PubMed Google Scholar
Jérôme Morio
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Julien Demange-Chryst: conceptualization, methodology, formal analysis, investigation, software and numerical results, writing-original draft preparation, writing-review. François Bachoc: conceptualization, methodology, formal analysis, investigation, writing-review, supervision. Jérôme Morio: conceptualization, methodology, formal analysis, investigation, writing-review, supervision. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Julien Demange-Chryst.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix A: Equivalence between both optimization problem

Let us prove that the optimization problem in Eq. (18) is equivalent to the one in Eq. (21). Consider a sequence $\varvec{\alpha }\in S_J$, a family of IS auxiliary distributions $\left( g_{\varvec{\lambda }_j}\right) _{j\in [\![1,J]\!]}$, a family of control parameters $\left( \beta _j\right) _{j\in [\![1,J]\!]}\in {\mathbb {R}}^J$ and a family of positive weights $\left( w_j\right) _{j\in [\![1,J]\!]}\in {\mathbb {R}}_+^J$.

For any $j\in [\![1,J]\!]$ and any IS auxiliary distribution h, we have:

$$\begin{aligned}&{\mathbb {V}}_{g_{\varvec{\alpha }}}\left( \dfrac{\phi _j\left( {\textbf{X}}\right) f_j\left( {\textbf{X}}\right) -\beta _j g_{\varvec{\lambda }_j}\left( {\textbf{X}}\right) }{g_{\varvec{\alpha }}\left( {\textbf{X}}\right) } \right) \\&\quad = {\mathbb {E}}_{g_{\varvec{\alpha }}}\left[ \left( \dfrac{\phi _j\left( {\textbf{X}}\right) f_j\left( {\textbf{X}} \right) -\beta _j g_{\varvec{\lambda }_j}\left( {\textbf{X}}\right) }{g_{\varvec{\alpha }}\left( {\textbf{X}} \right) }\right) ^2\right] \\&\quad - {\mathbb {E}}_{g_{\varvec{\alpha }}}\left( \dfrac{\phi _j\left( {\textbf{X}} \right) f_j\left( {\textbf{X}}\right) -\beta _j g_{\varvec{\lambda }_j}\left( {\textbf{X}}\right) }{g_{\varvec{\alpha }}\left( {\textbf{X}}\right) }\right) ^2\\&\quad = {\mathbb {E}}_{g_{\varvec{\alpha }}}\left[ \dfrac{\left( \phi _j\left( {\textbf{X}}\right) f_j \left( {\textbf{X}} \right) -\beta _j g_{\varvec{\lambda }_j}\left( {\textbf{X}}\right) \right) ^2}{g_{\varvec{\alpha }} \left( {\textbf{X}}\right) ^2}\right] \\&\quad - \underbrace{{\mathbb {E}}_{f_j}\left( \phi _j\left( {\textbf{X}}\right) -\dfrac{\beta _j g_{\varvec{\lambda }_j}\left( {\textbf{X}}\right) }{f_j\left( {\textbf{X}}\right) }\right) ^2} _{= c_j \text { independent of } \varvec{\alpha }}\\&= {\mathbb {E}}_{h}\left[ \dfrac{\left( \phi _j\left( {\textbf{X}}\right) f_j\left( {\textbf{X}}\right) -\beta _j g_{\varvec{\lambda }_j}\left( {\textbf{X}}\right) \right) ^2}{g_{\varvec{\alpha }} \left( {\textbf{X}}\right) h\left( {\textbf{X}}\right) }\right] - c_j. \end{aligned}$$

Therefore, we have:

$$\begin{aligned}&\sum _{j=1}^J w_j{\mathbb {V}}_{g_{\varvec{\alpha }}}\left( \dfrac{\phi _j\left( {\textbf{X}} \right) f_j\left( {\textbf{X}}\right) -\beta _j g_{\varvec{\lambda }_j}\left( {\textbf{X}}\right) }{g_{\varvec{\alpha }}\left( {\textbf{X}}\right) }\right) \\&\quad =\sum _{j=1}^J w_j\left( {\mathbb {E}}_{h}\left[ \dfrac{\left( \phi _j\left( {\textbf{X}}\right) f_j \left( {\textbf{X}}\right) -\beta _j g_{\varvec{\lambda }_j}\left( {\textbf{X}}\right) \right) ^2}{g_{\varvec{\alpha }}\left( {\textbf{X}}\right) h\left( {\textbf{X}}\right) }\right] - c_j\right) \\&\quad = \sum _{j=1}^J w_j{\mathbb {E}}_{h}\left[ \dfrac{\left( \phi _j\left( {\textbf{X}}\right) f_j\left( {\textbf{X}}\right) -\beta _j g_{\varvec{\lambda }_j}\left( {\textbf{X}}\right) \right) ^2}{g_{\varvec{\alpha }}\left( {\textbf{X}}\right) h\left( {\textbf{X}}\right) }\right] - \sum _{j=1}^J w_jc_j\\&\quad = {\mathbb {E}}_{h}\left[ \dfrac{\sum _{j=1}^Jw_j\left( \phi _j\left( {\textbf{X}}\right) f_j\left( {\textbf{X}}\right) -\beta _j g_{\varvec{\lambda }_j}\left( {\textbf{X}}\right) \right) ^2}{g_{\varvec{\alpha }}\left( {\textbf{X}} \right) h\left( {\textbf{X}}\right) }\right] \\&\quad - \sum _{j=1}^J w_jc_j. \end{aligned}$$

Since the term $\sum _{j=1}^J w_jc_j$ does not depend on the sequence $\varvec{\alpha }$, minimizing $\sum _{j=1}^J w_j{\mathbb {V}}_{g_{\varvec{\alpha }}}\left( \dfrac{\phi _j\left( {\textbf{X}} \right) f_j\left( {\textbf{X}}\right) -\beta _j g_{\varvec{\lambda }_j}\left( {\textbf{X}}\right) }{g_{\varvec{\alpha }}\left( {\textbf{X}}\right) }\right) $ w.r.t. $\varvec{\alpha }$ is then equivalent to minimize ${\mathbb {E}}_{h}\left[ \dfrac{\sum _{j=1}^Jw_j\left( \phi _j\left( {\textbf{X}} \right) f_j\left( {\textbf{X}}\right) -\beta _j g_{\varvec{\lambda }_j}\left( {\textbf{X}}\right) \right) ^2}{g_{\varvec{\alpha }} \left( {\textbf{X}}\right) h\left( {\textbf{X}}\right) }\right] $ w.r.t. $\varvec{\alpha }$. As a conclusion, both optimization problems in Eqs. (18) and (21) are equivalent.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Demange-Chryst, J., Bachoc, F. & Morio, J. Efficient estimation of multiple expectations with the same sample by adaptive importance sampling and control variates. Stat Comput 33, 103 (2023). https://doi.org/10.1007/s11222-023-10270-y

Download citation

Received: 30 November 2022
Accepted: 20 June 2023
Published: 15 July 2023
DOI: https://doi.org/10.1007/s11222-023-10270-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Efficient estimation of multiple expectations with the same sample by adaptive importance sampling and control variates

Abstract

Access this article

Similar content being viewed by others

Application of the full Bayesian significance test to model selection under informative sampling

Detecting and diagnosing prior and likelihood sensitivity with power-scaling

Convergence rates for optimised adaptive importance samplers

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix A: Equivalence between both optimization problem

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Efficient estimation of multiple expectations with the same sample by adaptive importance sampling and control variates

Abstract

Access this article

Similar content being viewed by others

Application of the full Bayesian significance test to model selection under informative sampling

Detecting and diagnosing prior and likelihood sensitivity with power-scaling

Convergence rates for optimised adaptive importance samplers

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix A: Equivalence between both optimization problem

Appendix A: Equivalence between both optimization problem

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation