Abstract
The covariance matrix is a key component of many multivariate robust procedures, whether or not the data are assumed to be Gaussian. We examine the idea of robustly fitting a mixture of multivariate Gaussian densities in the situation when the number of components estimated is intentionally too few. Using a minimum distance criterion, we show how useful results may be obtained in practice. Application areas are numerous, and examples will be provided.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
M. Aitkin and G. T. Wilson, Mixture Models, Outliers, and the EM Algorithm. Tech-nometrics 22 (1980), 325–331.
D. F. Andrews, P. J. Bickel, F. R. Hampel, RJ. Huber, W. H. Rogers, and J. W. Tukey, Robust Estimates of Location: Survey and Advances. Princeton University Press, Princeton, 1972.
V. Barnett and T. Lewis, Outliers in Statistical Data. John Wiley & Sons, New York, 1994.
A. Basu, I. R. Harris, H. L. Hjort, and M. C. Jones, Robust and Efficient Estimation by Minimising a Density Power Divergence. Biometrika 85 (1998), 549–560.
R. Beran, Robust Location Estimates, Ann. Statist. 5 (1977), 431–444.
R. D. Cook, D. M. Hawkins, and S. Weisberg, Exact Iterative Computation of the Robust Multivariate Minimum Volume Ellipsoid Estimator. Statist. Probab. Lett. 16 (1993), 213–218.
A. P. Dempster, N. M. Laird, and D. B. Rubin, Maximum Likelihood from Incomplete Data via the EM Algorithm. J. Roy. Statist. Soc. Ser. B 39 (1977), 1–22.
D. Harrison and D. L. Rubinfeld, Hedonic Housing Prices and the Demand for Clean Air. J. Environ. Econ. Manag. 5 (1978), 81–102.
H. L. Hjort, Minimum L2 and Robust Kullback-Leibler Estimation. In P. Lachout and J.Á. Víšek, editors, Proceedings of the 12th Prague Conference on Information Theory, Statistical Decision Functions and Random Processes, pages 102–105. Prague Academy of Sciences of the Czech Republic, 1994.
P. J. Huber, Robust Statistics. John Wiley & Sons, New York, 1981.
D. Kim, Least Squares Mixture Decomposition Estimation. Unpublished doctoral dissertation, Department of Statistics, Virginia Tech, Blacksburg, VA, 1995.
G. J. McLachlan and D. Peel, Finite Mixture Models. John Wiley & Sons, New York, 2001.
W. L. Poston, E. J. Wegman, C. E. Priebe, and J. L. Solka, A Deterministic Method for Robust Estimation of Multivariate Location and Shape, J. Comput. Graph. Statist. 6 (1997), 300–313.
P. J. Rousseeuw, Multivariate Estimation with High Breakdown Point. In W. Gross-mann, G. Pflug, I. Vincze, and W. Wertz, editors, Mathematical Statistics and Applications, Vol B, pages 283–297. Reidel Publishing Company, Dordrecht, 1985.
P. J. Rousseeuw and A. M. Leroy, Robust Regression and Outlier Detection, John Wiley & Sons, New York, 1987.
M. Rudemo, Empirical Choice of Histogram and Kernel Density Estimators. Scand. J. Statist. 9 (1982), 65–78.
D. W. Scott, Multivariate Density Estimation: Theory, Practice, and Visualization. John Wiley & Sons, New York, 1992.
D. W. Scott, Remarks on Fitting and Interpreting Mixture Models. In K. Berk and M. Pourahmadi, editors, Computing Science and Statistics 31, pages 104–109, 1999.
D. W. Scott, Parametric Statistical Modeling by Minimum Integrated Square Error. Technometrics 43 (2001), 274–285.
D. W. Scott and J. B. Christian, Finding Outliers in Models of Spatial Data. In E. Hovy, editor, Proceedings of the Third National Conference on Digital Government. Digital Government Research Center, Boston, 2003, available at http://www.dgrc.org/dgo2003/start.html.
D. W. Scott and W. F. Szewczyk, From Kernels to Mixtures. Technometrics 43 (2001), 323–335.
D. W. Scott and W. F. Szewczyk, The Stochastic Mode Tree and Clustering. J. Comput. Graph. Statist. 12 (2004), in press.
D. F. Swayne, D. Cook, and A. Buja, XGobi: Interactive Dynamic Data Visualization in the X Window System. J. Comput. Graph. Statist. 7 (1998), 113–130.
G. R. Terrell, Linear Density Estimates. In Proceedings of the Statistical Computing Section of the Americ. Statist. Assoc, pages 297–302, 1990.
W. C. Wojciechowski and D. W. Scott, Robust Location Estimation with L2 Distance. In K. Berk and M. Pourahmadi, editors, Computing Science and Statistics 31, pages 292–295, 1999.
D. L. Woodruff and D. M. Rocke, Computable Robust Estimation of Multivariate Location and Shape in High Dimension using Compound Estimators. J. Amer. Statist. Assoc. 89 (1994), 888–896.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer Basel AG
About this paper
Cite this paper
Scott, D.W. (2004). Partial Mixture Estimation and Outlier Detection in Data and Regression. In: Hubert, M., Pison, G., Struyf, A., Van Aelst, S. (eds) Theory and Applications of Recent Robust Methods. Statistics for Industry and Technology. Birkhäuser, Basel. https://doi.org/10.1007/978-3-0348-7958-3_26
Download citation
DOI: https://doi.org/10.1007/978-3-0348-7958-3_26
Publisher Name: Birkhäuser, Basel
Print ISBN: 978-3-0348-9636-8
Online ISBN: 978-3-0348-7958-3
eBook Packages: Springer Book Archive