Abstract
Sampling-based uncertainty quantification demands large data. Hence, when the available sample is scarce, it is customary to assume a distribution and estimate its moments from scarce data, to characterize the uncertainties. Nonetheless, inaccurate assumption about the distribution leads to flawed decisions. In addition, extremes, if present in the scarce data, are prone to be classified as outliers and neglected which leads to wrong estimation of the moments. Therefore, it is desirable to develop a method that is (i) distribution independent or allows distribution identification with scarce samples and (ii) accounts for the extremes in data and yet be insensitive or less sensitive to moments estimation. We propose using L-moments to develop a distribution-independent, robust moment estimation approach to characterize the uncertainty and propagate it through the system model. L-moment ratio diagram that uses higher order L-moments is adopted to choose the appropriate distribution, for uncertainty quantification. This allows for better characterization of the output distribution and the probabilistic estimates obtained using L-moments are found to be less sensitive to the extremes in the data, compared to the results obtained from the conventional moments approach. The efficacy of the proposed approach is demonstrated on conventional distributions covering all types of tails and several engineering examples. Engineering examples include a sheet metal manufacturing process, 7 variable speed reducer, and probabilistic fatigue life estimation.
Similar content being viewed by others
References
Abarbanel H, Koonin S, Levine H, MacDonald G, Rothaus O (1992) Statistics of Extreme Events with Application to Climate; Technical Report; DTIC Document: McLean, VA, USA
Acar E, Ramu P (2014) Reliability estimation using guided tail modeling with adaptive sampling. In: 16th AIAA non-deterministic approaches conference. https://doi.org/10.2514/6.2014-0645, pp 1–9
Adamowski K (2000) Regional analysis of annual maximum and partial duration flood data by nonparametric and L-moment methods. J Hydrol 229(3-4):219–231. 10.1016/S0022-1694(00)00156-6
Alvarado E, Sandberg D, Pickford S (1998) Modeling large forest fires as extreme events. Northwest Science 72:66–75
Anderson TV, Mattson CA (2012) Propagating skewness and kurtosis through engineering models for low-cost, meaningful, nondeterministic design. Journal of Mechanical Design 134(10):100911. https://doi.org/10.1115/1.4007389
Atiem IA, Harmancioǧlu NB (2006) Assessment of regional floods using L-moments approach: the case of the River Nile. Water Resources Management 20(5):723–747. https://doi.org/10.1007/s11269-005-9004-0
Buch-Larsen T, Nielsen JP, Guillén M, Bolancé C (2005) Kernel density estimation for heavy-tailed distributions using the champernowne transformation. Statistics 39(6):503–518. https://doi.org/10.1080/02331880500439782
Craig CC (1991) A new exposition and chart for the pearson system of frequency curves. Ann Stat 7:16–28
Ceriani L, Verme P (2012) The origins of the gini index: extracts from variabilità e mutabilità (1912) by Corrado Gini. J Econ Inequal 10(3):421–443
Chatterjee T, Chakraborty S, Chowdhury R (2017) A critical review of surrogate assisted robust design optimization. Archives of Computational Methods in Engineering, pp 1–30. https://doi.org/10.1007/s11831-017-9240-5
David HA (1981) Order statistics. Wiley, New York
Davison A, Huser R (2015) Statistics of extremes. Annual Review of Statistics and its Application 2(1):203–235. https://doi.org/10.1146/annurev-statistics-010814-020133
Elamir EA, Seheult AH (2004) Exact variance structure of sample L-moments. Journal of Statistical Planning and Inference 124(2):337–359. https://doi.org/10.1016/S0378-3758(03)00213-1
Gao W, Song C, Tin-Loi F (2010) Probabilistic interval analysis for structures with uncertainty. Struct Saf 32(3):191–199
Greenwood J, Landwehr J, Matalas N, Wallis J (1979) Probability weighted moments: definition and relation to parameters of several distributions expressable in inverse form. Water Resour Res, pp 1049–1054. https://doi.org/10.1029/WR015i005p01049
Gubareva TS, Gartsman BI (2010) Estimating distribution parameters of extreme hydrometeorological characteristics by L-moments method. Water Resources 37(4):437–445. https://doi.org/10.1134/S0097807810040020
Haddad K, Rahman A, Green J (2011) Design rainfall estimation in australia: a case study using l moments and generalized least squares regression. Stoch Env Res Risk A 25(6):815–825
Hall P, Sheather SJ, Jones M, Marron JS (1991) On optimal data-based bandwidth selection in kernel density estimation. Biometrika 78(2):263–269. https://doi.org/10.1093/biomet/78.2.263
Hosking JRM (1989) Some theoretical results concerning L-moments. IBM Thomas J. Watson Research Division
Hosking JRM (1990) L-moments: analysis and estimation of distributions using linear combinations of order statistics. https://doi.org/10.2307/2345653
Hosking JRM (1992) Moments or L-moments - an example comparing 2 measures of distributional shape. Am Stat 46(3):186–189. https://doi.org/10.2307/2685210
Hosking JRM (2006) On the characterization of distributions by their L-moments. Journal of Statistical Planning and Inference 136(1):193–198. https://doi.org/10.1016/j.jspi.2004.06.004
Hosking JRM, Wallis JR (1997) Regional frequency analysis: an approach based on L-moments. Cambridge University Press
Hu Z, Du X, Conrad D, Twohy R, Walmsley M (2014) Fatigue reliability analysis for structures with known loading trend. Struct Multidiscip Optim 50(1):9–23
Jayaraman D, Ramu P (2019) Uncertainty propagation using L-moments with scarce samples including extremes. In: Proc. 13th world congress of structural and multidisciplinary optimization, pp 15–21
Jayaraman D, Ramu P, Suresh SK, Ramanath V (2018) Treating uncertainties to generate a robust design of gas turbine disk using l-moments and scarce samples including outliers. In: Turbo expo: power for land, sea, and air. https://doi.org/10.1115/GT2018-76431, vol 51135, p V07AT32A008
Jin R, Du X, Chen W (2003) The use of metamodeling techniques for optimization under uncertainty. Struct Multidiscip Optim 25(2):99–116. https://doi.org/10.1007/s00158-002-0277-0
Kang YJ, Noh Y, Lim OK (2018) Kernel density estimation with bounded data. Struct Multidiscip Optim 57(1):95–113. https://doi.org/10.1007/s00158-017-1873-3
Kenney J, Keeping E (1947) Mathematics of Statistics. No. pt. 2 in Mathematics of Statistics, Van Nostrand
Kumar R, Chatterjee C (2005) Regional flood frequency analysis using l-moments for north brahmaputra region of India. J Hydrol Eng 10:1–7. https://doi.org/10.1061/(ASCE)1084-0699(2005)10:1(1)
Lee G, Kim W, Oh H, Youn BD, Kim NH (2019) Review of statistical model calibration and validation—from the perspective of uncertainty structures. Structural and Multidisciplinary Optimization. https://doi.org/10.1007/s00158-019-02270-2
Lee SH, Chen W (2009) A comparative study of uncertainty propagation methods for black-box-type problems. Struct Multidiscip Optim 37(3):239–253. https://doi.org/10.1007/s00158-008-0234-7
Lee SH, Chen W, Kwak BM (2009) Robust design with arbitrary distributions using Gauss-type quadrature formula. Struct Multidiscip Optim 39(3):227–243. https://doi.org/10.1007/s00158-008-0328-2
Lin MH, Tsai JF, Hu NZ, Chang SC (2013) Design optimization of a speed reducer using deterministic techniques. Math Probl Eng 2013. https://doi.org/10.1155/2013/419043
Liu H, Jiang C, Liu J, Mao J (2019) Uncertainty propagation analysis using sparse grid technique and saddlepoint approximation based on parameterized p-box representation. Struct Multidiscip Optim 59 (1):61–74
Mekid S, Vaja D (2008) Propagation of uncertainty: expressions of second and third order uncertainty with third and fourth moments. Measurement: Journal of the International Measurement Confederation 41 (6):600–609. https://doi.org/10.1016/j.measurement.2007.07.004
Melville P, Yang SM, Saar-Tsechansky M, Mooney R (2005) Active learning for probability estimation using Jensen-Shannon divergence. In: Gama J, Camacho R, Brazdil PB, Jorge AM, Torgo L (eds) Machine learning: ECML 2005. Springer, Berlin, pp 268–279
Moon MY, Kim HS, Lee K, Park B, Choi KK (2020) Uncertainty quantification and statistical model validation for an offshore jacket structure panel given limited test data and simulation model. Structural and Multidisciplinary Optimization. https://doi.org/10.1007/s00158-020-02520-8
Moustapha M, Sudret B (2019) Surrogate-assisted reliability-based design optimization: a survey and a unified modular framework. Structural and Multidisciplinary Optimization 60(5):2157–2176. https://doi.org/10.1007/s00158-019-02290-y, 1901.03311
Nair NU, Vineshkumar B (2010) L-moments of residual life. Journal of Statistical Planning and Inference 140(9):2618–2631
Park BU, Marron JS (1990) Comparison of data-driven bandwidth selectors. J Am Stat Assoc 85(409):66–72. https://doi.org/10.1080/01621459.1990.10475307
Pearson K (1916) Mathematical contributions to the theory of evolution. XIX. Second Supplement to a Memoir on Skew Variation. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 216(538-548):429–457. https://doi.org/10.1098/rsta.1916.0009
Ramu P, Arul S (2016) Estimating probabilistic fatigue of nitinol with scarce samples. Int J Fatigue 85:31–39. https://doi.org/10.1016/j.ijfatigue.2015.11.022
Ramu P, Kim NH, Haftka RT (2010) Multiple tail median approach for high reliability estimation. Struct Saf 32(2):124– 137
Ramu P, Kumar GS, Neelakantan P, Bathula KK (2017) Cost-reliability trade-off of path generating linkages using multi-objective genetic algorithm. International Journal of Reliability and Safety 11 (3-4):200–219. https://doi.org/10.1504/IJRS.2017.089706
P Ramu (2013) Modified third order polynomial approach for reliability analysis with scarce samples. In: Proceedings of the 10th world congress on structural and multidisciplinary optimization, Orlando, USA (2010), pp 1–10
Rokne JG (2001) Interval arithmetic and interval analysis: an introduction. In: Granular computing. Springer, Berlin, pp 1–22
Rudemo M (1982) Empirical choice of histograms and kernel density estimators. Scand J Stat 9 (2):65–78
Sankarasubramanian A, Srinivasan K (1999) Investigation and comparison of sampling properties of L-moments and conventional moments. Journal of Hydrology 218(1-2):13–34. https://doi.org/10.1016/S0022-1694(99)00018-9
Schäfer HJ (2008) Auswertealgorithmus auf der Basis einer Modifikation des Goniometrischen Modells zur stetigen Beschreibung der Wöhlerkurve vom Low-Cycle-Fatigue-bis in den Ultra-High-Cycle-Fatigue-Bereich. Mainz
Shi L, Fu Y, Yang RJ, Wang BP, Zhu P (2013) Selection of initial designs for multi-objective optimization using classification and regression tree. Structural and Multidisciplinary Optimization 48 (6):1057–1073. https://doi.org/10.1007/s00158-013-0947-0
Shirahata S, Is Chu (1992) Integrated squared error of kernel-type estimator of distribution function. Ann Inst Statist Math 44(3):579–591
Sillitto GP (1969) Derivation of approximants to the inverse distribution function of a continuous univariate population from the order statistics of a sample. Biometrika 56(3):641–650
Silverman BW (1986) Density estimation for statistics and data analysis 26
Smithers J, Schulze R (2001) A methodology for the estimation of short duration design storms in South Africa using a regional approach based on l-moments. J Hydrol 241(1-2):42–52
Turlach BA (1993) Bandwidth selection in kernel density estimation: a review. In: CORE and Institut de Statistique, Citeseer
Voinov A, Kolagani N, McCall MK, Glynn PD, Kragt ME, Ostermann FO, Pierce SA, Ramu P (2016) Modelling with stakeholders - next generation. Environmental Modelling and Software 77:196–220. https://doi.org/10.1016/j.envsoft.2015.11.016
Weisstein EW, et al. (2004) Pearson system, from mathworld–a wolfram web resource
Zafirakou-Koulouris A, Vogel RM, Craig SM, Habermeier J (1998) L moment diagrams for censored observations. Water Resour Res 34(5):1241–1249. https://doi.org/10.1029/97WR03712
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interests
The authors declare that they have no conflict of interest.
Additional information
Responsible Editor: Makoto Ohsaki
Replication of results
MATLAB®; codes to reproduce the L-moment approach results for the examples discussed are made available in this link https://github.com/DeepanJayaraman/UQ.git
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix 1: Distribution selection based on L-moment
Based on generalized distribution identification procedure mentioned in Fig. 1, the step-by-step distribution selection process based on L-moment ratio diagram is provided in Algorithm 1.
Appendix 2: Statistical distributions
1.1 2.1 Shape parameter and tail heaviness
Generalized pareto distribution (GPD) is used to approximate the tails of any parent distribution. ψ, the scale, and ξ, the shape, parameters characterize the GPD fit (Ramu et al. 2010). ξ plays a significant role in quantifying the weight of the tail or tail heaviness. Probability distributions can be classified as light (ξ < 0), medium (ξ = 0), and heavy tail (ξ > 0) distributions based on the tail heaviness. Let G be the performance measure which is random and u is threshold of G. The observations of G that surpass u are called, exceedance, Z in the following (12).
The conditional CDF of exceedance is modeled by GPD and denoted by \(\hat {F}{_{\xi ,\psi }(Z)}\). The distribution function of (G − u) is mentioned in (13).
ξ for exponential distribution is approximately 0. Similarly, ξ is lesser than 0 for light tail and greater than 0 for heavy tail. Figure 19 shows different CDF of tails approximated by GPD of the selected distributions.
1.2 2.2 Results—One extreme
In Tables 4, 5, 6, and 7, the quartile (Q) values of the DJS are presented which are obtained for different sample sizes based on the Pearson system and L-moment ratio diagram. In all cases, it can be observed that L-moment approach works better.
1.3 2.3 Results—Two extremes
Two extremes are incorporated in the samples in this study. 106 samples from the parent distribution are generated and the maximum value and the value at 9.9 × 105 is considered as extremes in the ordered data. In Tables 8, 9, 10, and 11, and Figs. 20, 21, 22, and 23, the results are presented for two extremes case. For all the sample sizes, L-moment approach work better than the C-moment approach.
Appendix 3: Flat rolling metal sheet manufacturing process
1.1 3.1 Distribution selection process based on L-moment
As mentioned in Algorithm 1, the step by step distribution selection process is demonstrated for this example. In this example, coefficient of friction (μf) is considered as random variable and follows the distribution mentioned in Fig. 9. Sample size considered to demonstrate the distribution selection process is 100.
-
1.
Samples are generated and the response, maximum change in thickness (ΔH) is computed using (10).
-
2.
L-moment ratios L-skewness (t3) and L-kurtosis (t3) are estimated and the values are 0.3, 0.19. Estimated sample L-moment ratios are plotted in L-moment ratio diagram and shown in Fig. 24.
-
3.
Shortest distance is computed for all possible distributions in the L-moment ratio diagram and presented in Table 12.
-
4.
Generalized pareto distribution is considered as the best possible distribution which has less distance compared to all other distributions.
-
5.
Parameters of generalized pareto distribution obtained from the sample and corresponding PDF is plotted with the actual PDF (sample size for population is 10000), presented in Fig. 25.
-
6.
Distributions obtained from sample is compared with the population distribution using DJS. The DJS value is 0.002.
In Tables 13 and 14, the quartile (Q) values of the DJS are presented, which are obtained for different sample sizes. L-moment approach works well in all cases compared to the C-moment approach.
1.2 3.2 Results—Two extremes
In this section, results of the two extremes in the scarce data are presented. The DJS obtained for various sample size are presented in Figs. 26 and 27 and the corresponding Q values are provided in the Tables 15 and 16. Similar observations are obtained as mentioned in one extreme case.
Appendix 4: Design of speed reducer
1.1 4.1 Results—One extreme
In this section, the quartile (Q) values of the DJS are presented for different sample sizes. The Q values clearly indicate that L-moment approach are less affected by the sampling variability and works better than C-moment approach.
1.2 4.2 Results—Two extremes
DJS results are presented in Figs. 28, 29, 30, and 31 and the corresponding Q values are presented in Tables 21, 22, 23, and 24. The results clearly show that the superiority of the L-moment approach over C-moment approach.
Appendix 5: Probabilistic fatigue life assessment
In this section, probabilistic fatigue life assessment for different sample sizes are presented. μ + 0.5σ of fatigue life is considered the upper bound and the minimum value of ordered experimental data is considered as the lower bound for scarce sample selection. Samples are randomly drawn between the upper and lower bound of the ordered experimental data for all load cases. The survival % comparison results are presented in Fig. 32. The confidence bounds of 90th and 95th survival % are provided in Tables 25, 26, 27, and 28. In all cases, L-moments approach predicts the probability of fatigue life closer to the actual for different loading conditions.
Rights and permissions
About this article
Cite this article
Jayaraman, D., Ramu, P. L-moments-based uncertainty quantification for scarce samples including extremes. Struct Multidisc Optim 64, 505–539 (2021). https://doi.org/10.1007/s00158-021-02930-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00158-021-02930-2