Skip to main content
Log in

Robust scale estimators for fuzzy data

  • Regular Article
  • Published:
Advances in Data Analysis and Classification Aims and scope Submit manuscript

Abstract

Observations distant from the majority or deviating from the general pattern often appear in datasets. Classical estimates such as the sample mean or the sample variance can be substantially affected by these observations (outliers). Even a single outlier can have huge distorting influence. However, when one deals with real-valued data there exist robust measures/estimates of location and scale (dispersion) which reduce the influence of these atypical values and provide approximately the same results as the classical estimates applied to the typical data without outliers. In real-life, data to be analyzed and interpreted are not always precisely defined and they cannot be properly expressed by using a numerical scale of measurement. Frequently, some of these imprecise data could be suitably described and modelled by considering a fuzzy rating scale of measurement. In this paper, several well-known scale (dispersion) estimators in the real-valued case are extended for random fuzzy numbers (i.e., random mechanisms generating fuzzy-valued data), and some of their properties as estimators for dispersion are examined. Furthermore, their robust behaviour is analyzed using two powerful tools, namely, the finite sample breakdown point and the sensitivity curves. Simulations, including empirical bias curves, are performed to complete the study.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  • Blanco-Fernández A, Casals MR, Colubi A, Corral N, García-Bárzana M, Gil MA, González-Rodríguez GL, López MT, Lubiano MA, Montenegro M, Ramos-Guajardo AB, de la Rosa de Sáa S, Sinova B (2014a) A distance-based statistical analysis of fuzzy number-valued data. Int J Approx Reason 55:1487–1501

  • Blanco-Fernández A, Casals MR, Colubi A, Corral N, García-Bárzana M, Gil MA, González-Rodríguez G, López MT, Lubiano MA, Montenegro M, Ramos-Guajardo AB, de la Rosa de Sáa S, Sinova B (2014b) Rejoinder on “A distance-based statistical analysis of fuzzy number-valued data”. Int J Approx Reason 55:1601–1605

  • Colubi A, López-Díaz M, Domínguez-Menchero JS, Gil MA (1999) A generalized strong law of large numbers. Probab Theory Relat Fields 114:401–417

    Article  MathSciNet  MATH  Google Scholar 

  • Colubi A, Domínguez-Menchero JS, López-Díaz M, Ralescu DA (2001) On the formalization of fuzzy random variables. Inf Sci 133:3–6

  • De la Rosa de Sáa S, Gil MA, González-Rodríguez G, López MT, Lubiano MA (2015) Fuzzy rating scale-based questionnaires and their statistical analysis. IEEE Trans Fuzzy Syst 23(1):111–126

    Article  Google Scholar 

  • Diamond P, Kloeden P (1990) Metric spaces of fuzzy sets. Fuzzy Sets Syst 35:241–249

    Article  MathSciNet  MATH  Google Scholar 

  • Donoho DL (1982) Breakdown properties of multivariate location estimators. Pd.D. quantifying paper, Department of Statistics, Harvard University

  • Donoho DL, Huber PJ (1983) The notion of breakdown point. In: Bickel PJ, Doksum K, Hodges JL Jr (eds) A Festschrift for Erich L. Lehmann. Wadsworth, Belmont, pp 157–184

    Google Scholar 

  • Fréchet M (1948) Les éléments aléatoires de nature quelconque dans un espace distancié. Ann. L’Inst. H. Poincaré 10:215–310

    MATH  Google Scholar 

  • Goetschel R Jr, Voxman W (1986) Elementary fuzzy calculus. Fuzzy Sets Syst 18:31–43

    Article  MathSciNet  MATH  Google Scholar 

  • González-Rodríguez G, Colubi A, Gil MA (2012) Fuzzy data treated as functional data. A one-way ANOVA test approach. Comput Stat Data Anal 56:943–955

    Article  MathSciNet  MATH  Google Scholar 

  • Hampel FR (1968) Contributions to the theory of robust estimation. Ph.D. Thesis. University of California, Berkeley

  • Körner R (1997) On the variance of fuzzy random variables. Fuzzy Sets Syst 92:83–93

    Article  MathSciNet  MATH  Google Scholar 

  • Körner R, Näther W (2002) On the variance of random fuzzy variables. In: Bertoluzza C et al (eds) Statistical modeling, analysis and management of fuzzy data. Physica-Verlag, Heidelberg, pp 22–39

    Google Scholar 

  • López-Díaz M, Gil MA (1998) Reversing the order of integration in iterated expectations of fuzzy random variables, and statistical applications. J Stat Plan Inference 74:11–29

    Article  MathSciNet  MATH  Google Scholar 

  • Lubiano MA, Gil MA, López-Díaz M, López MT (2000) The \(\varvec {\lambda }\)-mean squared dispersion associated with a fuzzy random variable. Fuzzy Sets Syst 111:307–317

    Article  MathSciNet  MATH  Google Scholar 

  • Maronna RA, Martin RD, Yohai VJ (2006) Robust statistics: theory and methods. Wiley series in probability and statistics. Wiley, Chichester, England

  • Puri ML, Ralescu DA (1985) The concept of normality for fuzzy random variables. Ann Probab 11:1373–1379

    Article  MathSciNet  MATH  Google Scholar 

  • Puri ML, Ralescu DA (1986) Fuzzy random variables. J Math Anal Appl 114:409–422

    Article  MathSciNet  MATH  Google Scholar 

  • Ramos-Guajardo AB, Lubiano MA (2012) K-sample tests for equality of variances of random fuzzy sets. Comput Stat Data Anal 56(4):956–966

    Article  MathSciNet  MATH  Google Scholar 

  • Rousseeuw PJ, Croux C (1993) Alternatives to the median absolute deviation. J Am Stat Assoc 88(424):1273–1283

    Article  MathSciNet  MATH  Google Scholar 

  • Sinova B, Gil MA, Colubi A, Van Aelst S (2012) The median of a random fuzzy number. The 1-norm distance approach. Fuzzy Sets Syst 200:99–115

    Article  MathSciNet  MATH  Google Scholar 

  • Sinova B, De la Rosa de Sáa S, Gil MA (2013) A generalized \(L^1\)-type metric between fuzzy numbers for an approach to central tendency of fuzzy data. Inform Sci 242:22–34

    Article  MathSciNet  MATH  Google Scholar 

  • Zadeh LA (1975a) The concept of a linguistic variable and its application to approximate reasoning, part 1. Inform Sci 8:199–249

  • Zadeh LA (1975b) The concept of a linguistic variable and its application to approximate reasoning, part 2. Inform Sci 8:301–353

  • Zadeh LA (1975c) The concept of a linguistic variable and its application to approximate reasoning, part 3. Inform Sci 9:43–80

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to María Asunción Lubiano.

Additional information

Authors are very grateful for the insight comments from the reviewers of the original. The research in this paper has been partially supported by/benefited from Principality of Asturias Grants GRUPIN14-101, Research Contract E-33-2015-0040746 (this one for Sinova) and Severo Ochoa BP12012 (this one for De la Rosa de Sáa), and the Spanish Ministry of Economy and Competitiveness Grant MTM2013-44212-P. Their financial support is gratefully acknowledged.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

de la Rosa de Sáa, S., Lubiano, M.A., Sinova, B. et al. Robust scale estimators for fuzzy data. Adv Data Anal Classif 11, 731–758 (2017). https://doi.org/10.1007/s11634-015-0210-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11634-015-0210-1

Keywords

Mathematics Subject Classification

Navigation