Abstract
In this paper, we introduce weighted estimators of the location and dispersion of a multivariate data set with weights based on the ranks of the Mahalanobis distances. We discuss some properties of the estimators like the breakdown point, influence function and asymptotic variance. The outlier detection capacities of different weight functions are compared. A simulation study is given to investigate the finite-sample behavior of the estimators.
Similar content being viewed by others
References
Agulló J, Croux C, Van Aelst S (2008) The multivariate least trimmed squares estimator. J Multivariate Anal 99: 311–338
Butler RW, Davies PL, Jhun M (1993) Asymptotics for the minimum covariance determinant estimator. Ann Stat 21: 1385–1400
Croux C, Dehon C (2002) Analyse canonique basée sur des estimateurs robustes de la matrice de covariance. Revue de Statistique Appliquée 50: 5–26
Croux C, Haesbroeck G (1999) Influence function and efficiency of the minimum covariance determinant scatter matrix estimator. J Multivariate Anal 71: 161–190
Croux C, Haesbroeck G (2000) Principal component analysis based on robust estimators of the covariance or correlation matrix: influence functions and efficiencies. Biometrika 87: 603–618
Davies PL (1987) Asymptotic behaviour of S-estimates of multivariate location parameters and dispersion matrices. Ann Stat 15: 1269–1292
Donoho DL, Huber PJ (1983) The notion of breakdown point. In: Bickel PJ, Doksum KA, Hodges JL (eds) A festschrift for Erich Lehmann. Wadsworth, Belmont, pp 157–184
Hadi AS, Luceño A (1997) Maximum trimmed likelihood estimators: a unified approach, examples and algorithms. Computat Stat Data Anal 25: 251–272
Hampel FR, Ronchetti EM, Rousseeuw PJ, Stahel WA (1986) Robust statistics: the approach based on influence functions. Wiley, New York
Hettich S, Bay SD (1999) The UCI KDD Archive (http://kdd.ics.uci.edu), Department of Information and Computer Science, University of California, Irvine
Hössjer O (1994) Rank-based estimates in the linear model with high breakdown point. J Am Stat Assoc 89: 149–158
Kent JT, Tyler DE (1996) Constrained M-estimation for multivariate location and scatter. Ann Stat 24: 1346–1370
Lopuhaä HP (1989) On the relation between S-estimators and M-estimators of multivariate location and covariance. Ann Stat 17: 1662–1683
Lopuhaä HP (1991) Multivariate τ-estimators for location and scatter. Can J Stat 19: 307–321
Lopuhaä HP, Rousseeuw PJ (1991) Breakdown points of affine equivariant estimators of multivariate location and covariance matrices. Ann Stat 19: 229–248
Maronna RA (1976) Robust M-estimators of multivariate location and scatter. Ann Stat 4: 51–67
Maronna RA, Zamar RH (2002) Robust estimates of location and dispersion for high-dimensional datasets. Technometrics 44: 307–317
Masicek L (2004) Optimality of the least weighted squares estimator. Kybernetika 40: 715–734
Pison G, Rousseeuw PJ, Filzmoser P, Croux C (2003) Robust factor analysis. J Multivariate Anal 84: 145–172
Pison G, Van Aelst S (2004) Diagnostic plots for robust multivariate methods. J Computat Graph Stat 13: 310–329
Roelant E, Van Aelst S, Willems G (2008) The minimum weighted covariance determinant estimator. Technical report, available at http://users.ugent.be/~svaelst/publications
Rousseeuw PJ (1984) Least median of squares regression. J Am Stat Assoc 79: 871–880
Rousseeuw PJ, Van Aelst S, Van Driessen K, Agulló J (2004) Robust multivariate regression. Technometrics 46: 293–305
Rousseeuw PJ, Van Driessen K (1999) A fast algorithm for the minimum covariance determinant estimator. Technometrics 41: 212–223
Salibian-Barrera M, Van Aelst S, Willems G (2006) Principal components analysis based on multivariate MM-estimators with fast and robust bootstrap. J Am Stat Assoc 101: 1198–1211
Taskinen S, Croux C, Kankainen A, Ollila E, Oja H (2006) Influence functions and efficiencies of the canonical correlation and vector estimates based on scatter and shape matrices. J Multivariate Anal 97: 359–384
Tatsuoka KS, Tyler DE (2000) On the uniqueness of S-functionals and M-functionals under nonelliptical distributions. Ann Stat 28: 1219–1243
Van Aelst S, Willems G (2005) Multivariate regression S-estimators for robust estimation and inference. Stat Sin 15: 981–1001
Vandev DL, Neykov NM (1998) About regression estimators with high breakdown point. Statistics 32: 111–129
Visek JA (2001) Regression with high breakdown point. In: ROBUST’2000, Proceedings of the 11th conference on robust statistics, pp 324–356
Author information
Authors and Affiliations
Corresponding author
Additional information
The research of Stefan Van Aelst was supported by a grant of the Fund for Scientific Research-Flanders (FWO-Vlaanderen) and by IAP research network grant nr. P6/03 of the Belgian government (Belgian Science Policy).
Rights and permissions
About this article
Cite this article
Roelant, E., Van Aelst, S. & Willems, G. The minimum weighted covariance determinant estimator. Metrika 70, 177–204 (2009). https://doi.org/10.1007/s00184-008-0186-3
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00184-008-0186-3