Abstract
Observations distant from the majority or deviating from the general pattern often appear in datasets. Classical estimates such as the sample mean or the sample variance can be substantially affected by these observations (outliers). Even a single outlier can have huge distorting influence. However, when one deals with real-valued data there exist robust measures/estimates of location and scale (dispersion) which reduce the influence of these atypical values and provide approximately the same results as the classical estimates applied to the typical data without outliers. In real-life, data to be analyzed and interpreted are not always precisely defined and they cannot be properly expressed by using a numerical scale of measurement. Frequently, some of these imprecise data could be suitably described and modelled by considering a fuzzy rating scale of measurement. In this paper, several well-known scale (dispersion) estimators in the real-valued case are extended for random fuzzy numbers (i.e., random mechanisms generating fuzzy-valued data), and some of their properties as estimators for dispersion are examined. Furthermore, their robust behaviour is analyzed using two powerful tools, namely, the finite sample breakdown point and the sensitivity curves. Simulations, including empirical bias curves, are performed to complete the study.
Similar content being viewed by others
References
Blanco-Fernández A, Casals MR, Colubi A, Corral N, García-Bárzana M, Gil MA, González-Rodríguez GL, López MT, Lubiano MA, Montenegro M, Ramos-Guajardo AB, de la Rosa de Sáa S, Sinova B (2014a) A distance-based statistical analysis of fuzzy number-valued data. Int J Approx Reason 55:1487–1501
Blanco-Fernández A, Casals MR, Colubi A, Corral N, García-Bárzana M, Gil MA, González-Rodríguez G, López MT, Lubiano MA, Montenegro M, Ramos-Guajardo AB, de la Rosa de Sáa S, Sinova B (2014b) Rejoinder on “A distance-based statistical analysis of fuzzy number-valued data”. Int J Approx Reason 55:1601–1605
Colubi A, López-Díaz M, Domínguez-Menchero JS, Gil MA (1999) A generalized strong law of large numbers. Probab Theory Relat Fields 114:401–417
Colubi A, Domínguez-Menchero JS, López-Díaz M, Ralescu DA (2001) On the formalization of fuzzy random variables. Inf Sci 133:3–6
De la Rosa de Sáa S, Gil MA, González-Rodríguez G, López MT, Lubiano MA (2015) Fuzzy rating scale-based questionnaires and their statistical analysis. IEEE Trans Fuzzy Syst 23(1):111–126
Diamond P, Kloeden P (1990) Metric spaces of fuzzy sets. Fuzzy Sets Syst 35:241–249
Donoho DL (1982) Breakdown properties of multivariate location estimators. Pd.D. quantifying paper, Department of Statistics, Harvard University
Donoho DL, Huber PJ (1983) The notion of breakdown point. In: Bickel PJ, Doksum K, Hodges JL Jr (eds) A Festschrift for Erich L. Lehmann. Wadsworth, Belmont, pp 157–184
Fréchet M (1948) Les éléments aléatoires de nature quelconque dans un espace distancié. Ann. L’Inst. H. Poincaré 10:215–310
Goetschel R Jr, Voxman W (1986) Elementary fuzzy calculus. Fuzzy Sets Syst 18:31–43
González-Rodríguez G, Colubi A, Gil MA (2012) Fuzzy data treated as functional data. A one-way ANOVA test approach. Comput Stat Data Anal 56:943–955
Hampel FR (1968) Contributions to the theory of robust estimation. Ph.D. Thesis. University of California, Berkeley
Körner R (1997) On the variance of fuzzy random variables. Fuzzy Sets Syst 92:83–93
Körner R, Näther W (2002) On the variance of random fuzzy variables. In: Bertoluzza C et al (eds) Statistical modeling, analysis and management of fuzzy data. Physica-Verlag, Heidelberg, pp 22–39
López-Díaz M, Gil MA (1998) Reversing the order of integration in iterated expectations of fuzzy random variables, and statistical applications. J Stat Plan Inference 74:11–29
Lubiano MA, Gil MA, López-Díaz M, López MT (2000) The \(\varvec {\lambda }\)-mean squared dispersion associated with a fuzzy random variable. Fuzzy Sets Syst 111:307–317
Maronna RA, Martin RD, Yohai VJ (2006) Robust statistics: theory and methods. Wiley series in probability and statistics. Wiley, Chichester, England
Puri ML, Ralescu DA (1985) The concept of normality for fuzzy random variables. Ann Probab 11:1373–1379
Puri ML, Ralescu DA (1986) Fuzzy random variables. J Math Anal Appl 114:409–422
Ramos-Guajardo AB, Lubiano MA (2012) K-sample tests for equality of variances of random fuzzy sets. Comput Stat Data Anal 56(4):956–966
Rousseeuw PJ, Croux C (1993) Alternatives to the median absolute deviation. J Am Stat Assoc 88(424):1273–1283
Sinova B, Gil MA, Colubi A, Van Aelst S (2012) The median of a random fuzzy number. The 1-norm distance approach. Fuzzy Sets Syst 200:99–115
Sinova B, De la Rosa de Sáa S, Gil MA (2013) A generalized \(L^1\)-type metric between fuzzy numbers for an approach to central tendency of fuzzy data. Inform Sci 242:22–34
Zadeh LA (1975a) The concept of a linguistic variable and its application to approximate reasoning, part 1. Inform Sci 8:199–249
Zadeh LA (1975b) The concept of a linguistic variable and its application to approximate reasoning, part 2. Inform Sci 8:301–353
Zadeh LA (1975c) The concept of a linguistic variable and its application to approximate reasoning, part 3. Inform Sci 9:43–80
Author information
Authors and Affiliations
Corresponding author
Additional information
Authors are very grateful for the insight comments from the reviewers of the original. The research in this paper has been partially supported by/benefited from Principality of Asturias Grants GRUPIN14-101, Research Contract E-33-2015-0040746 (this one for Sinova) and Severo Ochoa BP12012 (this one for De la Rosa de Sáa), and the Spanish Ministry of Economy and Competitiveness Grant MTM2013-44212-P. Their financial support is gratefully acknowledged.
Rights and permissions
About this article
Cite this article
de la Rosa de Sáa, S., Lubiano, M.A., Sinova, B. et al. Robust scale estimators for fuzzy data. Adv Data Anal Classif 11, 731–758 (2017). https://doi.org/10.1007/s11634-015-0210-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11634-015-0210-1
Keywords
- Finite sample breakdown point
- Empirical bias curves
- Fuzzy numbers
- Random fuzzy numbers
- Robustness
- Scale estimation
- Sensitivity curves