Summary
Given n random observations on a p-dimensional random vector x, the problem is to test whether a specified number (usually~small) of suspected observations are outliers (too discordant as compared to the bulk of observations). As a generalization of Tiku’s (1975, 1977) univariate statistic, we propose a statistic g for testing a specified number of outliers in multivariate data; g is the ratio of the product of robust estimators (Tiku, 1980) to the product of ordinary estimators of the scale parameters. For the multivariate normal, g is shown to be considerably more powerful than the prominent statistic R (restricted to the multivariate normal) due to Wilks (1963) under location shifts (model A; Barnett and Lewis, 1978) although slightly less powerful under scale changes (model B; Barnett and Lewis). Like R, g is not sensitive to changes in correlations (orientation). The statistic g can be used (under models A or B) for testing outliers in samples from any multivariate distribution whose marginal distributions are of the type (l/σ)f((x-μ)/σ).
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Barnett, V. (1978). The study of outliers: purpose and model. Applied Statistics, 27, 242–250.
Barnett, V. (1979). Some outlier tests for multivariate samples. South African Statistical Journal, 13, 29–52
Barnett, V., Lewis, T. (1978). Outliers in Statistical Data. Wiley, New York.
Cox, D.R. (1968). Notes on some aspects of regression analysis. Journal of the Royal Statistical Society, Series A, 131, 265–279.
David, H.A., Paulson, A.S. (1965). The performance of several tests for outliers. Biometrika, 52, 429–436.
Gnanadesikan, R., Kettenring, J.R. (1972). Robust estimates, residuals, and outlier detection with multiresponse data. Biometrics, 28, 81–124.
Hawkins, D.M. (1977). Comment on “A new statistic for testing suspected outliers”. Communications in Statistics, A6, 435–438.
Healy, M.J.R. (1968). Multivariate normal plotting. Applied Statistics, 17, 157–161.
Johnson, N.L., Nixon, E., Amos, D.E., Pearson, E.S. (1963). Table of percentage points of Pearson curves. Biometrika, 50, 459–498.
Johnson, N.L., Kotz, S. (1972). Distributions in Statistics: Continuous Multivariate Distributions. Wiley, New York.
Siotani, M. (1959). The extreme value of the generalized distances of the individual points in the multivariate normal sample. Annals of the Institute of Statistical Mathematics, 10, 183–208.
Tietjen, G.L., Moore, R.H. (1972). Some Grubbs-type statistics for the detection of several outliers. Technometrics, 14, 583–597.
Tiku, M.L. (1967). Estimating the mean and standard deviation from censored normal samples. Biometrika, 54, 155–165.
Tiku, M.L. (1970). Monte Carlo study of some simple estimators in censored normal samples. Biometrika, 57, 207–210.
Tiku, M.L. (1975). A new statistic for testing suspected outliers. Communications in Statistics, 4, 737–752.
Tiku, M.L. (1977). Rejoinder: “Comment on ‘A new statistic for testing suspected outliers’”. Communications in Statistics, A6, 1417–1422.
Tiku, M.L. (1978). Linear regression model with censored observations. Communications in Statistics, A7, 1219–1232.
Tiku, M.L. Singh, M. (1980). Robustness of MML estimators based on censored samples and robust test statistics. Journal of Statistical Planning and Interference, 4, (123–143).
Tiku, M.L. Singh, M. (1980). Robust estimation of the variance-covariance matrix and its use in testing an assumed multivariate distribution, (submitted to JSPI for publication).
Wilks, S.S. (1963). Multivariate statistical outliers. Sankhya, Series A, 25, 407–426.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1981 D. Reidel Publishing Company
About this chapter
Cite this chapter
Tiku, M.L., Singh, M. (1981). Testing Outliers in Multivariate Data. In: Taillie, C., Patil, G.P., Baldessari, B.A. (eds) Statistical Distributions in Scientific Work. NATO Advanced study Institutes Series, vol 79. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-8552-0_17
Download citation
DOI: https://doi.org/10.1007/978-94-009-8552-0_17
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-009-8554-4
Online ISBN: 978-94-009-8552-0
eBook Packages: Springer Book Archive