Skip to main content
Log in

L-moments-based uncertainty quantification for scarce samples including extremes

  • Research Paper
  • Published:
Structural and Multidisciplinary Optimization Aims and scope Submit manuscript

Abstract

Sampling-based uncertainty quantification demands large data. Hence, when the available sample is scarce, it is customary to assume a distribution and estimate its moments from scarce data, to characterize the uncertainties. Nonetheless, inaccurate assumption about the distribution leads to flawed decisions. In addition, extremes, if present in the scarce data, are prone to be classified as outliers and neglected which leads to wrong estimation of the moments. Therefore, it is desirable to develop a method that is (i) distribution independent or allows distribution identification with scarce samples and (ii) accounts for the extremes in data and yet be insensitive or less sensitive to moments estimation. We propose using L-moments to develop a distribution-independent, robust moment estimation approach to characterize the uncertainty and propagate it through the system model. L-moment ratio diagram that uses higher order L-moments is adopted to choose the appropriate distribution, for uncertainty quantification. This allows for better characterization of the output distribution and the probabilistic estimates obtained using L-moments are found to be less sensitive to the extremes in the data, compared to the results obtained from the conventional moments approach. The efficacy of the proposed approach is demonstrated on conventional distributions covering all types of tails and several engineering examples. Engineering examples include a sheet metal manufacturing process, 7 variable speed reducer, and probabilistic fatigue life estimation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18

Similar content being viewed by others

References

  • Abarbanel H, Koonin S, Levine H, MacDonald G, Rothaus O (1992) Statistics of Extreme Events with Application to Climate; Technical Report; DTIC Document: McLean, VA, USA

  • Acar E, Ramu P (2014) Reliability estimation using guided tail modeling with adaptive sampling. In: 16th AIAA non-deterministic approaches conference. https://doi.org/10.2514/6.2014-0645, pp 1–9

  • Adamowski K (2000) Regional analysis of annual maximum and partial duration flood data by nonparametric and L-moment methods. J Hydrol 229(3-4):219–231. 10.1016/S0022-1694(00)00156-6

    Article  Google Scholar 

  • Alvarado E, Sandberg D, Pickford S (1998) Modeling large forest fires as extreme events. Northwest Science 72:66–75

    Google Scholar 

  • Anderson TV, Mattson CA (2012) Propagating skewness and kurtosis through engineering models for low-cost, meaningful, nondeterministic design. Journal of Mechanical Design 134(10):100911. https://doi.org/10.1115/1.4007389

    Article  Google Scholar 

  • Atiem IA, Harmancioǧlu NB (2006) Assessment of regional floods using L-moments approach: the case of the River Nile. Water Resources Management 20(5):723–747. https://doi.org/10.1007/s11269-005-9004-0

    Article  Google Scholar 

  • Buch-Larsen T, Nielsen JP, Guillén M, Bolancé C (2005) Kernel density estimation for heavy-tailed distributions using the champernowne transformation. Statistics 39(6):503–518. https://doi.org/10.1080/02331880500439782

    Article  MathSciNet  MATH  Google Scholar 

  • Craig CC (1991) A new exposition and chart for the pearson system of frequency curves. Ann Stat 7:16–28

    Article  Google Scholar 

  • Ceriani L, Verme P (2012) The origins of the gini index: extracts from variabilità e mutabilità (1912) by Corrado Gini. J Econ Inequal 10(3):421–443

    Article  Google Scholar 

  • Chatterjee T, Chakraborty S, Chowdhury R (2017) A critical review of surrogate assisted robust design optimization. Archives of Computational Methods in Engineering, pp 1–30. https://doi.org/10.1007/s11831-017-9240-5

  • David HA (1981) Order statistics. Wiley, New York

    Google Scholar 

  • Davison A, Huser R (2015) Statistics of extremes. Annual Review of Statistics and its Application 2(1):203–235. https://doi.org/10.1146/annurev-statistics-010814-020133

    Article  Google Scholar 

  • Elamir EA, Seheult AH (2004) Exact variance structure of sample L-moments. Journal of Statistical Planning and Inference 124(2):337–359. https://doi.org/10.1016/S0378-3758(03)00213-1

    Article  MathSciNet  MATH  Google Scholar 

  • Gao W, Song C, Tin-Loi F (2010) Probabilistic interval analysis for structures with uncertainty. Struct Saf 32(3):191–199

    Article  Google Scholar 

  • Greenwood J, Landwehr J, Matalas N, Wallis J (1979) Probability weighted moments: definition and relation to parameters of several distributions expressable in inverse form. Water Resour Res, pp 1049–1054. https://doi.org/10.1029/WR015i005p01049

  • Gubareva TS, Gartsman BI (2010) Estimating distribution parameters of extreme hydrometeorological characteristics by L-moments method. Water Resources 37(4):437–445. https://doi.org/10.1134/S0097807810040020

    Article  Google Scholar 

  • Haddad K, Rahman A, Green J (2011) Design rainfall estimation in australia: a case study using l moments and generalized least squares regression. Stoch Env Res Risk A 25(6):815–825

    Article  Google Scholar 

  • Hall P, Sheather SJ, Jones M, Marron JS (1991) On optimal data-based bandwidth selection in kernel density estimation. Biometrika 78(2):263–269. https://doi.org/10.1093/biomet/78.2.263

    Article  MathSciNet  MATH  Google Scholar 

  • Hosking JRM (1989) Some theoretical results concerning L-moments. IBM Thomas J. Watson Research Division

  • Hosking JRM (1990) L-moments: analysis and estimation of distributions using linear combinations of order statistics. https://doi.org/10.2307/2345653

  • Hosking JRM (1992) Moments or L-moments - an example comparing 2 measures of distributional shape. Am Stat 46(3):186–189. https://doi.org/10.2307/2685210

    Article  Google Scholar 

  • Hosking JRM (2006) On the characterization of distributions by their L-moments. Journal of Statistical Planning and Inference 136(1):193–198. https://doi.org/10.1016/j.jspi.2004.06.004

    Article  MathSciNet  MATH  Google Scholar 

  • Hosking JRM, Wallis JR (1997) Regional frequency analysis: an approach based on L-moments. Cambridge University Press

  • Hu Z, Du X, Conrad D, Twohy R, Walmsley M (2014) Fatigue reliability analysis for structures with known loading trend. Struct Multidiscip Optim 50(1):9–23

    Article  Google Scholar 

  • Jayaraman D, Ramu P (2019) Uncertainty propagation using L-moments with scarce samples including extremes. In: Proc. 13th world congress of structural and multidisciplinary optimization, pp 15–21

  • Jayaraman D, Ramu P, Suresh SK, Ramanath V (2018) Treating uncertainties to generate a robust design of gas turbine disk using l-moments and scarce samples including outliers. In: Turbo expo: power for land, sea, and air. https://doi.org/10.1115/GT2018-76431, vol 51135, p V07AT32A008

  • Jin R, Du X, Chen W (2003) The use of metamodeling techniques for optimization under uncertainty. Struct Multidiscip Optim 25(2):99–116. https://doi.org/10.1007/s00158-002-0277-0

    Article  Google Scholar 

  • Kang YJ, Noh Y, Lim OK (2018) Kernel density estimation with bounded data. Struct Multidiscip Optim 57(1):95–113. https://doi.org/10.1007/s00158-017-1873-3

    Article  MathSciNet  Google Scholar 

  • Kenney J, Keeping E (1947) Mathematics of Statistics. No. pt. 2 in Mathematics of Statistics, Van Nostrand

  • Kumar R, Chatterjee C (2005) Regional flood frequency analysis using l-moments for north brahmaputra region of India. J Hydrol Eng 10:1–7. https://doi.org/10.1061/(ASCE)1084-0699(2005)10:1(1)

    Article  Google Scholar 

  • Lee G, Kim W, Oh H, Youn BD, Kim NH (2019) Review of statistical model calibration and validation—from the perspective of uncertainty structures. Structural and Multidisciplinary Optimization. https://doi.org/10.1007/s00158-019-02270-2

  • Lee SH, Chen W (2009) A comparative study of uncertainty propagation methods for black-box-type problems. Struct Multidiscip Optim 37(3):239–253. https://doi.org/10.1007/s00158-008-0234-7

    Article  MathSciNet  Google Scholar 

  • Lee SH, Chen W, Kwak BM (2009) Robust design with arbitrary distributions using Gauss-type quadrature formula. Struct Multidiscip Optim 39(3):227–243. https://doi.org/10.1007/s00158-008-0328-2

    Article  MathSciNet  MATH  Google Scholar 

  • Lin MH, Tsai JF, Hu NZ, Chang SC (2013) Design optimization of a speed reducer using deterministic techniques. Math Probl Eng 2013. https://doi.org/10.1155/2013/419043

  • Liu H, Jiang C, Liu J, Mao J (2019) Uncertainty propagation analysis using sparse grid technique and saddlepoint approximation based on parameterized p-box representation. Struct Multidiscip Optim 59 (1):61–74

    Article  MathSciNet  Google Scholar 

  • Mekid S, Vaja D (2008) Propagation of uncertainty: expressions of second and third order uncertainty with third and fourth moments. Measurement: Journal of the International Measurement Confederation 41 (6):600–609. https://doi.org/10.1016/j.measurement.2007.07.004

    Article  Google Scholar 

  • Melville P, Yang SM, Saar-Tsechansky M, Mooney R (2005) Active learning for probability estimation using Jensen-Shannon divergence. In: Gama J, Camacho R, Brazdil PB, Jorge AM, Torgo L (eds) Machine learning: ECML 2005. Springer, Berlin, pp 268–279

  • Moon MY, Kim HS, Lee K, Park B, Choi KK (2020) Uncertainty quantification and statistical model validation for an offshore jacket structure panel given limited test data and simulation model. Structural and Multidisciplinary Optimization. https://doi.org/10.1007/s00158-020-02520-8

  • Moustapha M, Sudret B (2019) Surrogate-assisted reliability-based design optimization: a survey and a unified modular framework. Structural and Multidisciplinary Optimization 60(5):2157–2176. https://doi.org/10.1007/s00158-019-02290-y, 1901.03311

    Article  MathSciNet  Google Scholar 

  • Nair NU, Vineshkumar B (2010) L-moments of residual life. Journal of Statistical Planning and Inference 140(9):2618–2631

    Article  MathSciNet  Google Scholar 

  • Park BU, Marron JS (1990) Comparison of data-driven bandwidth selectors. J Am Stat Assoc 85(409):66–72. https://doi.org/10.1080/01621459.1990.10475307

    Article  Google Scholar 

  • Pearson K (1916) Mathematical contributions to the theory of evolution. XIX. Second Supplement to a Memoir on Skew Variation. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 216(538-548):429–457. https://doi.org/10.1098/rsta.1916.0009

    MATH  Google Scholar 

  • Ramu P, Arul S (2016) Estimating probabilistic fatigue of nitinol with scarce samples. Int J Fatigue 85:31–39. https://doi.org/10.1016/j.ijfatigue.2015.11.022

    Article  Google Scholar 

  • Ramu P, Kim NH, Haftka RT (2010) Multiple tail median approach for high reliability estimation. Struct Saf 32(2):124– 137

    Article  Google Scholar 

  • Ramu P, Kumar GS, Neelakantan P, Bathula KK (2017) Cost-reliability trade-off of path generating linkages using multi-objective genetic algorithm. International Journal of Reliability and Safety 11 (3-4):200–219. https://doi.org/10.1504/IJRS.2017.089706

    Article  Google Scholar 

  • P Ramu (2013) Modified third order polynomial approach for reliability analysis with scarce samples. In: Proceedings of the 10th world congress on structural and multidisciplinary optimization, Orlando, USA (2010), pp 1–10

  • Rokne JG (2001) Interval arithmetic and interval analysis: an introduction. In: Granular computing. Springer, Berlin, pp 1–22

  • Rudemo M (1982) Empirical choice of histograms and kernel density estimators. Scand J Stat 9 (2):65–78

    MathSciNet  MATH  Google Scholar 

  • Sankarasubramanian A, Srinivasan K (1999) Investigation and comparison of sampling properties of L-moments and conventional moments. Journal of Hydrology 218(1-2):13–34. https://doi.org/10.1016/S0022-1694(99)00018-9

    Article  Google Scholar 

  • Schäfer HJ (2008) Auswertealgorithmus auf der Basis einer Modifikation des Goniometrischen Modells zur stetigen Beschreibung der Wöhlerkurve vom Low-Cycle-Fatigue-bis in den Ultra-High-Cycle-Fatigue-Bereich. Mainz

  • Shi L, Fu Y, Yang RJ, Wang BP, Zhu P (2013) Selection of initial designs for multi-objective optimization using classification and regression tree. Structural and Multidisciplinary Optimization 48 (6):1057–1073. https://doi.org/10.1007/s00158-013-0947-0

    Article  MathSciNet  Google Scholar 

  • Shirahata S, Is Chu (1992) Integrated squared error of kernel-type estimator of distribution function. Ann Inst Statist Math 44(3):579–591

    MathSciNet  MATH  Google Scholar 

  • Sillitto GP (1969) Derivation of approximants to the inverse distribution function of a continuous univariate population from the order statistics of a sample. Biometrika 56(3):641–650

    Article  Google Scholar 

  • Silverman BW (1986) Density estimation for statistics and data analysis 26

  • Smithers J, Schulze R (2001) A methodology for the estimation of short duration design storms in South Africa using a regional approach based on l-moments. J Hydrol 241(1-2):42–52

    Article  Google Scholar 

  • Turlach BA (1993) Bandwidth selection in kernel density estimation: a review. In: CORE and Institut de Statistique, Citeseer

  • Voinov A, Kolagani N, McCall MK, Glynn PD, Kragt ME, Ostermann FO, Pierce SA, Ramu P (2016) Modelling with stakeholders - next generation. Environmental Modelling and Software 77:196–220. https://doi.org/10.1016/j.envsoft.2015.11.016

    Article  Google Scholar 

  • Weisstein EW, et al. (2004) Pearson system, from mathworld–a wolfram web resource

  • Zafirakou-Koulouris A, Vogel RM, Craig SM, Habermeier J (1998) L moment diagrams for censored observations. Water Resour Res 34(5):1241–1249. https://doi.org/10.1029/97WR03712

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Deepan Jayaraman.

Ethics declarations

Conflict of interests

The authors declare that they have no conflict of interest.

Additional information

Responsible Editor: Makoto Ohsaki

Replication of results

MATLAB®; codes to reproduce the L-moment approach results for the examples discussed are made available in this link https://github.com/DeepanJayaraman/UQ.git

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix 1: Distribution selection based on L-moment

Based on generalized distribution identification procedure mentioned in Fig. 1, the step-by-step distribution selection process based on L-moment ratio diagram is provided in Algorithm 1.

figure a

Appendix 2: Statistical distributions

1.1 2.1 Shape parameter and tail heaviness

Generalized pareto distribution (GPD) is used to approximate the tails of any parent distribution. ψ, the scale, and ξ, the shape, parameters characterize the GPD fit (Ramu et al. 2010). ξ plays a significant role in quantifying the weight of the tail or tail heaviness. Probability distributions can be classified as light (ξ < 0), medium (ξ = 0), and heavy tail (ξ > 0) distributions based on the tail heaviness. Let G be the performance measure which is random and u is threshold of G. The observations of G that surpass u are called, exceedance, Z in the following (12).

$$ Z = G-u $$
(12)

The conditional CDF of exceedance is modeled by GPD and denoted by \(\hat {F}{_{\xi ,\psi }(Z)}\). The distribution function of (Gu) is mentioned in (13).

$$ \hat{F}{{~}_{\xi,\psi}(Z)}=\begin{cases}1-\left( 1+\frac{\xi}{\psi}Z\right)^{-\frac{1}{\xi}} & \text{if} \xi \not= 0\\1-\exp\left( -\frac{Z}{\psi}\right) & \text{if} \xi = 0\end{cases} $$
(13)

ξ for exponential distribution is approximately 0. Similarly, ξ is lesser than 0 for light tail and greater than 0 for heavy tail. Figure 19 shows different CDF of tails approximated by GPD of the selected distributions.

Fig. 19
figure 19

CDF and shape parameter values of selected distribution

1.2 2.2 Results—One extreme

In Tables 456, and 7, the quartile (Q) values of the DJS are presented which are obtained for different sample sizes based on the Pearson system and L-moment ratio diagram. In all cases, it can be observed that L-moment approach works better.

Table 4 Quartile values of JS divergence - light tail
Table 5 Quartile values of JS divergence - medium tail
Table 6 Quartile values of JS divergence - heavy tail I
Table 7 Quartile values of JS divergence - heavy tail II

1.3 2.3 Results—Two extremes

Two extremes are incorporated in the samples in this study. 106 samples from the parent distribution are generated and the maximum value and the value at 9.9 × 105 is considered as extremes in the ordered data. In Tables 8910, and 11, and Figs. 202122, and 23, the results are presented for two extremes case. For all the sample sizes, L-moment approach work better than the C-moment approach.

Fig. 20
figure 20

JS divergence for light tail

Fig. 21
figure 21

JS divergence for medium tail

Fig. 22
figure 22

JS divergence for heavy tail I

Fig. 23
figure 23

JS divergence for heavy tail II

Table 8 Quartile values of JS divergence - light tail
Table 9 Quartile values of JS divergence - medium tail
Table 10 Quartile values of JS divergence - heavy tail I
Table 11 Quartile values of JS divergence - heavy tail II

Appendix 3: Flat rolling metal sheet manufacturing process

1.1 3.1 Distribution selection process based on L-moment

As mentioned in Algorithm 1, the step by step distribution selection process is demonstrated for this example. In this example, coefficient of friction (μf) is considered as random variable and follows the distribution mentioned in Fig. 9. Sample size considered to demonstrate the distribution selection process is 100.

  1. 1.

    Samples are generated and the response, maximum change in thickness (ΔH) is computed using (10).

  2. 2.

    L-moment ratios L-skewness (t3) and L-kurtosis (t3) are estimated and the values are 0.3, 0.19. Estimated sample L-moment ratios are plotted in L-moment ratio diagram and shown in Fig. 24.

  3. 3.

    Shortest distance is computed for all possible distributions in the L-moment ratio diagram and presented in Table 12.

  4. 4.

    Generalized pareto distribution is considered as the best possible distribution which has less distance compared to all other distributions.

  5. 5.

    Parameters of generalized pareto distribution obtained from the sample and corresponding PDF is plotted with the actual PDF (sample size for population is 10000), presented in Fig. 25.

  6. 6.

    Distributions obtained from sample is compared with the population distribution using DJS. The DJS value is 0.002.

In Tables 13 and 14, the quartile (Q) values of the DJS are presented, which are obtained for different sample sizes. L-moment approach works well in all cases compared to the C-moment approach.

Fig. 24
figure 24

L-moment ratio diagram with sample L-moment ratio

Fig. 25
figure 25

PDF obtained with sample size 100

Table 12 Shortest distance of all possible distributions in L-moment ratio diagram
Table 13 Quartile values of JS divergence - uncertainty quantification of ΔH of flat rolling sheet metal manufacturing process
Table 14 Quartile values of JS divergence - uncertainty propagation of ΔH of flat rolling sheet metal manufacturing process

1.2 3.2 Results—Two extremes

In this section, results of the two extremes in the scarce data are presented. The DJS obtained for various sample size are presented in Figs. 26 and 27 and the corresponding Q values are provided in the Tables 15 and 16. Similar observations are obtained as mentioned in one extreme case.

Fig. 26
figure 26

JS divergence - uncertainty quantification

Fig. 27
figure 27

JS divergence - uncertainty propagation

Table 15 Quartile values of JS divergence - uncertainty quantification of ΔH of flat rolling sheet metal manufacturing process
Table 16 Quartile values of JS divergence - uncertainty propagation of ΔH of flat rolling sheet metal manufacturing process

Appendix 4: Design of speed reducer

1.1 4.1 Results—One extreme

In this section, the quartile (Q) values of the DJS are presented for different sample sizes. The Q values clearly indicate that L-moment approach are less affected by the sampling variability and works better than C-moment approach.

1.2 4.2 Results—Two extremes

DJS results are presented in Figs. 282930, and 31 and the corresponding Q values are presented in Tables 212223, and 24. The results clearly show that the superiority of the L-moment approach over C-moment approach.

Fig. 28
figure 28

JS divergence - extreme at 1 variable at a time

Fig. 29
figure 29

JS divergence - extreme at 3 variables at a time

Fig. 30
figure 30

JS divergence - extreme at 5 variables at a time

Fig. 31
figure 31

JS divergence - extreme at 7 variables at a time

Table 17 Quartile values of JS divergence - extreme at 1 variable at a time
Table 18 Quartile values of JS divergence - extreme at 3 variable at a time
Table 19 Quartile values of JS divergence - extreme at 5 variables at a time
Table 20 Quartile values of JS divergence - extreme at 7 variables at a time
Table 21 Quartile values of JS divergence - extreme at 1 variable at a time
Table 22 Quartile values of JS divergence - extreme at 3 variables at a time
Table 23 Quartile values of JS divergence - extreme at 5 variables at a time
Table 24 Quartile values of JS divergence - extreme at 7 variables at a time

Appendix 5: Probabilistic fatigue life assessment

In this section, probabilistic fatigue life assessment for different sample sizes are presented. μ + 0.5σ of fatigue life is considered the upper bound and the minimum value of ordered experimental data is considered as the lower bound for scarce sample selection. Samples are randomly drawn between the upper and lower bound of the ordered experimental data for all load cases. The survival % comparison results are presented in Fig. 32. The confidence bounds of 90th and 95th survival % are provided in Tables 252627, and 28. In all cases, L-moments approach predicts the probability of fatigue life closer to the actual for different loading conditions.

Fig. 32
figure 32

Comparison of 95th survival % of sample distribution to parent distribution with different sample sizes

Table 25 Survival % of fatigue life at different load cases and confidence bounds for sample size 10
Table 26 Survival % of fatigue life at different load cases and confidence bounds for sample size 15
Table 27 Survival % of fatigue life at different load cases and confidence bounds for sample size 20
Table 28 Survival % of fatigue life at different load cases and confidence bounds for sample size 25

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jayaraman, D., Ramu, P. L-moments-based uncertainty quantification for scarce samples including extremes. Struct Multidisc Optim 64, 505–539 (2021). https://doi.org/10.1007/s00158-021-02930-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00158-021-02930-2

Keywords

Navigation