Skip to main content

Testing Outliers in Multivariate Data

  • Chapter

Part of the book series: NATO Advanced study Institutes Series ((ASIC,volume 79))

Summary

Given n random observations on a p-dimensional random vector x, the problem is to test whether a specified number (usually~small) of suspected observations are outliers (too discordant as compared to the bulk of observations). As a generalization of Tiku’s (1975, 1977) univariate statistic, we propose a statistic g for testing a specified number of outliers in multivariate data; g is the ratio of the product of robust estimators (Tiku, 1980) to the product of ordinary estimators of the scale parameters. For the multivariate normal, g is shown to be considerably more powerful than the prominent statistic R (restricted to the multivariate normal) due to Wilks (1963) under location shifts (model A; Barnett and Lewis, 1978) although slightly less powerful under scale changes (model B; Barnett and Lewis). Like R, g is not sensitive to changes in correlations (orientation). The statistic g can be used (under models A or B) for testing outliers in samples from any multivariate distribution whose marginal distributions are of the type (l/σ)f((x-μ)/σ).

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Barnett, V. (1978). The study of outliers: purpose and model. Applied Statistics, 27, 242–250.

    Article  Google Scholar 

  • Barnett, V. (1979). Some outlier tests for multivariate samples. South African Statistical Journal, 13, 29–52

    Google Scholar 

  • Barnett, V., Lewis, T. (1978). Outliers in Statistical Data. Wiley, New York.

    MATH  Google Scholar 

  • Cox, D.R. (1968). Notes on some aspects of regression analysis. Journal of the Royal Statistical Society, Series A, 131, 265–279.

    Article  Google Scholar 

  • David, H.A., Paulson, A.S. (1965). The performance of several tests for outliers. Biometrika, 52, 429–436.

    MathSciNet  MATH  Google Scholar 

  • Gnanadesikan, R., Kettenring, J.R. (1972). Robust estimates, residuals, and outlier detection with multiresponse data. Biometrics, 28, 81–124.

    Article  Google Scholar 

  • Hawkins, D.M. (1977). Comment on “A new statistic for testing suspected outliers”. Communications in Statistics, A6, 435–438.

    Article  MathSciNet  Google Scholar 

  • Healy, M.J.R. (1968). Multivariate normal plotting. Applied Statistics, 17, 157–161.

    Article  MathSciNet  Google Scholar 

  • Johnson, N.L., Nixon, E., Amos, D.E., Pearson, E.S. (1963). Table of percentage points of Pearson curves. Biometrika, 50, 459–498.

    MathSciNet  MATH  Google Scholar 

  • Johnson, N.L., Kotz, S. (1972). Distributions in Statistics: Continuous Multivariate Distributions. Wiley, New York.

    MATH  Google Scholar 

  • Siotani, M. (1959). The extreme value of the generalized distances of the individual points in the multivariate normal sample. Annals of the Institute of Statistical Mathematics, 10, 183–208.

    Article  MathSciNet  MATH  Google Scholar 

  • Tietjen, G.L., Moore, R.H. (1972). Some Grubbs-type statistics for the detection of several outliers. Technometrics, 14, 583–597.

    Article  Google Scholar 

  • Tiku, M.L. (1967). Estimating the mean and standard deviation from censored normal samples. Biometrika, 54, 155–165.

    MathSciNet  Google Scholar 

  • Tiku, M.L. (1970). Monte Carlo study of some simple estimators in censored normal samples. Biometrika, 57, 207–210.

    Article  MathSciNet  MATH  Google Scholar 

  • Tiku, M.L. (1975). A new statistic for testing suspected outliers. Communications in Statistics, 4, 737–752.

    Article  MathSciNet  Google Scholar 

  • Tiku, M.L. (1977). Rejoinder: “Comment on ‘A new statistic for testing suspected outliers’”. Communications in Statistics, A6, 1417–1422.

    Article  Google Scholar 

  • Tiku, M.L. (1978). Linear regression model with censored observations. Communications in Statistics, A7, 1219–1232.

    Article  MathSciNet  Google Scholar 

  • Tiku, M.L. Singh, M. (1980). Robustness of MML estimators based on censored samples and robust test statistics. Journal of Statistical Planning and Interference, 4, (123–143).

    Article  MATH  Google Scholar 

  • Tiku, M.L. Singh, M. (1980). Robust estimation of the variance-covariance matrix and its use in testing an assumed multivariate distribution, (submitted to JSPI for publication).

    Google Scholar 

  • Wilks, S.S. (1963). Multivariate statistical outliers. Sankhya, Series A, 25, 407–426.

    MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1981 D. Reidel Publishing Company

About this chapter

Cite this chapter

Tiku, M.L., Singh, M. (1981). Testing Outliers in Multivariate Data. In: Taillie, C., Patil, G.P., Baldessari, B.A. (eds) Statistical Distributions in Scientific Work. NATO Advanced study Institutes Series, vol 79. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-8552-0_17

Download citation

  • DOI: https://doi.org/10.1007/978-94-009-8552-0_17

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-009-8554-4

  • Online ISBN: 978-94-009-8552-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics