Abstract
In this paper we estimate the parameters of a regression model using S-estimators of multivariate location and scatter. The approach is proven to be Fisher-consistent, and the influence functions are derived. The corresponding asymptotic variances are obtained and it is shown how they can be estimated in practice. A comparison with other recently proposed robust regression estimators is made.
Similar content being viewed by others
References
Barndorff-Nielsen, O. E. and Cox, D. R. (1989).Asymptotic Techniques for Use in Statistics, Chapman and Hall, London.
Chang, W. H., McKean, J. W., Naranjo, J. D. and Sheather, S. J. (1999). High-breakdown rank regression,J. Amer. Statist. Assoc.,94, 205–219.
Cheng, C. and Van Ness, J. W. (1997). Robust calibration,Technometrics,39, 401–411.
Coakley, C. W. and Hettmansperger, T. P. (1993). A bounded influence, high breakdown, efficient regression estimator,J. Amer. Statist. Assoc.,88, 872–880.
Cook, R. D., Hawkins, D. M. and Weisberg, S. (1992). Comparison of model misspecification diagnostics using residuals from least mean of squares and least median of squares fits,J. Amer. Statist. Assoc.,87, 419–424.
Cox, T. F. and Cox, M. A. A. (1994).Multidimensional Scaling, Chapman and Hall, London.
Croux, C. and Dehon, C. (2001). Robust linear discriminant analysis using S-estimators,Canad. J. Statist.,29, 473–492.
Croux, C. and Haesbroeck, G. (1999). Influence function and efficiency of the minimum covariance determinant scatter matrix estimator,J. Multivariate Anal.,71, 161–190.
Croux, C. and Haesbroeck, G. (2000). Principal component analysis based on robust estimators of the covariance or correlation matrix: Influence functions and efficiencies,Biometrika,87, 603–618.
Croux, C., Rousseeuw, P. J. and Hössjer, O. (1994). Generalized S-estimators,J. Amer. Statist. Assoc.,89, 1271–1281.
Croux, C., Dehon, C., Van Aelst, S. and Rousseeuw, P. (2001). Robust estimation of the conditional median function at elliptical models,Statist. Probab. Lett.,51, 361–368.
Davies, L. (1987). Asymptotic behavior of S-estimators of multivariate location parameters and dispersion matrices,Ann. Statist.,15, 1269–1292.
Ferretti, N., Kelmansky, D., Yohai, V. J. and Zamar, R. H. (1999). A class of locally and globally robust regression estimates,J. Amer. Statist. Assoc.,94, 174–188.
Hampel, F. R., Ronchetti, E. M., Rousseeuw, P. J. and Stahel, W. A. (1986).Robust Statistics: The Approach Based on Influence Functions, Wiley, New York.
Härdle, W. (1991).Smoothing Techniques with Implementation in S, Springer, New York.
Hawkins, D. M., Bradu, D. and Kass, G. V. (1984). Location of several outliers in multiple regression data using elemental sets,Technometrics,26, 197–208.
Hettmansperger, T. P. and Sheather, S. J. (1992). A cautionary note on the method of least median squares,Amer. Statist.,46, 79–83.
Iglewicz, B. (1982). Robust scale estimates,Understanding Robust and Exploratory Data Analysis (eds. D. C. Hoaglin, F. Mosteller and J. W. Tukey), 404–431, Wiley, New York.
Lopuhaä, H. P. (1989). On the relation between S-estimators and M-estimators of multivariate location and covariance,Ann. Statist.,17, 1662–1683.
Lopuhaä, H. P. (1997). Asymptotic expansion ofS-estimators of location and covariance,Statist. Neerlandica,51, 220–237.
Maronna, R. (1976). Robust M-estimates of multivariate location and scatter,Ann. Statist.,4, 51–67.
Maronna, R. and Morgenthaler, S. (1986). Robust regression through robust covariances,Comm. Statist. Theory Methods,15, 1347–1365.
McKean, J. W., Sheather, S. J. and Hettmansperger, T. P. (1990). Regression diagnostics for rank-based methods,J. Amer. Statist. Assoc.,85, 1018–1028.
McKean, J. W., Sheather, S. J. and Hettmansperger, T. P. (1993). The use and interpretation of residuals based on robust estimation,J. Amer. Statist. Assoc.,88, 1254–1263.
McKean, J. W., Naranjo, J. D. and Sheather, S. J. (1996). Diagnostics to detect differences in robust fits of linear models,Comput. Statist.,11, 223–243.
Naranjo, J. D. and Hettmansperger, T. P. (1994). Bounded influence rank regression,J. Roy. Statist. Soc. Ser. B,56, 209–220.
Pullman, N. J. (1976).Matrix Theory and Its Applications: Selected Topics, Marcel Dekker, New York.
Rousseeuw, P. J. (1984). Least median of squares regression,J. Amer. Statist. Assoc.,79, 871–880.
Rousseeuw, P. J. (1985). Multivariate estimation with high breakdown point,Mathematical Statistics and Applications, Vol. B (eds. W. Grossmann, G. Pflug, I. Vincze and W. Wertz), 283–297, Reidel, Dordrecht.
Rousseeuw, P. J. and Leroy, A. M. (1987).Robust Regression and Outlier Detection, Wiley, New York.
Rousseeuw, P. J. and van Zomeren, B. C. (1990). Unmasking multivariate, outliers and leverage points,J. Amer. Statist. Assoc.,85, 633–651.
Rousseeuw, P. J., Van Aelst, S., Van Driessen, K. and Agulló, J. (2000). Robust multivariate regression, Tech. Report, University of Antwerp.
Ruppert, D. (1992). Computing S-estimators for regression and multivariate location/dispersion,J. Comput. Graph. Statist.,1, 253–270.
Sheather, S. J., McKean, J. W. and Hettmansperger, T. P. (1997). Finite sample stability properties of the least median of squares estimator,J. Statist. Comput. Simulation,58, 371–383.
Simpson, D. G. and Yohai, V. J. (1998). Functional stability of one-step GM-estimators in approximately linear regression,Ann. Statist.,26, 1147–1169.
Simpson, D. G., Ruppert, D. and Carroll, R. J. (1992). On one-step GM estimates and stability of inferences in linear regression,J. Amer. Statist. Assoc.,87, 439–450.
Stahel, W. A. (1981). Robust estimation: Infinitesimal optimality and covariance matrix estimators, PhD. Thesis, ETH, Zurich (in German).
Visuri, S., Koivunen, V., Möttönen, J., Ollila, E. and Oja, H. (2002). Affine equivariant multivariate rank methods,J. Statist. Plann. Inference, (to appear).
Woodruff, D. L. and Rocke, D. M. (1994). Computable robust estimation of multivariate location and shape in high dimension using compound estimators,J. Amer. Statist. Assoc.,89, 888–896.
Yohai, V. J. (1987). High breakdown point and high efficiency robust estimates, for regression,Ann. Statist.,15, 642–656.
Yohai, V. J. and Zamar, R. H. (1988). High breakdown-point estimates of regression by means of the minimization of an efficient scale,J. Amer. Statist. Assoc.,83, 406–413.
Yohai, V. J., Stahel, W. A. and Zamar, R. H. (1991) A procedure for robust estimation and inference in linear regression,Directions in Robust Statistics and Diagnostics (eds. W. Stahel and S. Weisberg), 365–374, Springer, New York.
Author information
Authors and Affiliations
About this article
Cite this article
Croux, C., Van Aelst, S. & Dehon, C. Bounded influence regression using high breakdown scatter matrices. Ann Inst Stat Math 55, 265–285 (2003). https://doi.org/10.1007/BF02530499
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02530499