Skip to main content

Partial Mixture Estimation and Outlier Detection in Data and Regression

  • Conference paper
Theory and Applications of Recent Robust Methods

Part of the book series: Statistics for Industry and Technology ((SIT))

Abstract

The covariance matrix is a key component of many multivariate robust procedures, whether or not the data are assumed to be Gaussian. We examine the idea of robustly fitting a mixture of multivariate Gaussian densities in the situation when the number of components estimated is intentionally too few. Using a minimum distance criterion, we show how useful results may be obtained in practice. Application areas are numerous, and examples will be provided.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. M. Aitkin and G. T. Wilson, Mixture Models, Outliers, and the EM Algorithm. Tech-nometrics 22 (1980), 325–331.

    MATH  Google Scholar 

  2. D. F. Andrews, P. J. Bickel, F. R. Hampel, RJ. Huber, W. H. Rogers, and J. W. Tukey, Robust Estimates of Location: Survey and Advances. Princeton University Press, Princeton, 1972.

    MATH  Google Scholar 

  3. V. Barnett and T. Lewis, Outliers in Statistical Data. John Wiley & Sons, New York, 1994.

    MATH  Google Scholar 

  4. A. Basu, I. R. Harris, H. L. Hjort, and M. C. Jones, Robust and Efficient Estimation by Minimising a Density Power Divergence. Biometrika 85 (1998), 549–560.

    Article  MathSciNet  MATH  Google Scholar 

  5. R. Beran, Robust Location Estimates, Ann. Statist. 5 (1977), 431–444.

    Article  MathSciNet  MATH  Google Scholar 

  6. R. D. Cook, D. M. Hawkins, and S. Weisberg, Exact Iterative Computation of the Robust Multivariate Minimum Volume Ellipsoid Estimator. Statist. Probab. Lett. 16 (1993), 213–218.

    Article  MathSciNet  Google Scholar 

  7. A. P. Dempster, N. M. Laird, and D. B. Rubin, Maximum Likelihood from Incomplete Data via the EM Algorithm. J. Roy. Statist. Soc. Ser. B 39 (1977), 1–22.

    MathSciNet  MATH  Google Scholar 

  8. D. Harrison and D. L. Rubinfeld, Hedonic Housing Prices and the Demand for Clean Air. J. Environ. Econ. Manag. 5 (1978), 81–102.

    Article  MATH  Google Scholar 

  9. H. L. Hjort, Minimum L2 and Robust Kullback-Leibler Estimation. In P. Lachout and J.Á. Víšek, editors, Proceedings of the 12th Prague Conference on Information Theory, Statistical Decision Functions and Random Processes, pages 102–105. Prague Academy of Sciences of the Czech Republic, 1994.

    Google Scholar 

  10. P. J. Huber, Robust Statistics. John Wiley & Sons, New York, 1981.

    Book  MATH  Google Scholar 

  11. D. Kim, Least Squares Mixture Decomposition Estimation. Unpublished doctoral dissertation, Department of Statistics, Virginia Tech, Blacksburg, VA, 1995.

    Google Scholar 

  12. G. J. McLachlan and D. Peel, Finite Mixture Models. John Wiley & Sons, New York, 2001.

    Google Scholar 

  13. W. L. Poston, E. J. Wegman, C. E. Priebe, and J. L. Solka, A Deterministic Method for Robust Estimation of Multivariate Location and Shape, J. Comput. Graph. Statist. 6 (1997), 300–313.

    MathSciNet  Google Scholar 

  14. P. J. Rousseeuw, Multivariate Estimation with High Breakdown Point. In W. Gross-mann, G. Pflug, I. Vincze, and W. Wertz, editors, Mathematical Statistics and Applications, Vol B, pages 283–297. Reidel Publishing Company, Dordrecht, 1985.

    Chapter  Google Scholar 

  15. P. J. Rousseeuw and A. M. Leroy, Robust Regression and Outlier Detection, John Wiley & Sons, New York, 1987.

    Book  MATH  Google Scholar 

  16. M. Rudemo, Empirical Choice of Histogram and Kernel Density Estimators. Scand. J. Statist. 9 (1982), 65–78.

    MathSciNet  MATH  Google Scholar 

  17. D. W. Scott, Multivariate Density Estimation: Theory, Practice, and Visualization. John Wiley & Sons, New York, 1992.

    Book  MATH  Google Scholar 

  18. D. W. Scott, Remarks on Fitting and Interpreting Mixture Models. In K. Berk and M. Pourahmadi, editors, Computing Science and Statistics 31, pages 104–109, 1999.

    Google Scholar 

  19. D. W. Scott, Parametric Statistical Modeling by Minimum Integrated Square Error. Technometrics 43 (2001), 274–285.

    Article  MathSciNet  Google Scholar 

  20. D. W. Scott and J. B. Christian, Finding Outliers in Models of Spatial Data. In E. Hovy, editor, Proceedings of the Third National Conference on Digital Government. Digital Government Research Center, Boston, 2003, available at http://www.dgrc.org/dgo2003/start.html.

    Google Scholar 

  21. D. W. Scott and W. F. Szewczyk, From Kernels to Mixtures. Technometrics 43 (2001), 323–335.

    Article  MathSciNet  Google Scholar 

  22. D. W. Scott and W. F. Szewczyk, The Stochastic Mode Tree and Clustering. J. Comput. Graph. Statist. 12 (2004), in press.

    Google Scholar 

  23. D. F. Swayne, D. Cook, and A. Buja, XGobi: Interactive Dynamic Data Visualization in the X Window System. J. Comput. Graph. Statist. 7 (1998), 113–130.

    Google Scholar 

  24. G. R. Terrell, Linear Density Estimates. In Proceedings of the Statistical Computing Section of the Americ. Statist. Assoc, pages 297–302, 1990.

    Google Scholar 

  25. W. C. Wojciechowski and D. W. Scott, Robust Location Estimation with L2 Distance. In K. Berk and M. Pourahmadi, editors, Computing Science and Statistics 31, pages 292–295, 1999.

    Google Scholar 

  26. D. L. Woodruff and D. M. Rocke, Computable Robust Estimation of Multivariate Location and Shape in High Dimension using Compound Estimators. J. Amer. Statist. Assoc. 89 (1994), 888–896.

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer Basel AG

About this paper

Cite this paper

Scott, D.W. (2004). Partial Mixture Estimation and Outlier Detection in Data and Regression. In: Hubert, M., Pison, G., Struyf, A., Van Aelst, S. (eds) Theory and Applications of Recent Robust Methods. Statistics for Industry and Technology. Birkhäuser, Basel. https://doi.org/10.1007/978-3-0348-7958-3_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-0348-7958-3_26

  • Publisher Name: Birkhäuser, Basel

  • Print ISBN: 978-3-0348-9636-8

  • Online ISBN: 978-3-0348-7958-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics