Summary
This paper reviews models for the occurrence of outliers in data from the linear model. The Bayesian analyses are all closely similar in form, but differ in the way they treat suspected outliers. The models are compared on Darwin’s data and one of them is used on data from a 25 factorial experiment.
The question of how many outliers are present involves comparison of models with different numbers of parameters. A solution using proper priors on all parameters is given. On two trial datasets it is found to be insensitive to choice of priors on all except the parameters representing the amount of contamination in the outliers. Here, choice of even a slightly “wrong” prior can be very misleading. Moreover, it is difficult to choose an appropriate prior when contaminations can be both positive or negative.
Similar content being viewed by others
References
ABRAHAM, B. and BOX, G.E.P. (1978). Linear models and spurious observations.Appl. Statist. 27, 131–8.
AKAIKE, H. (1973). Information theory and an extension of the maximum likelihood principle. In2nd International Symposium on Information Theory (B.N. Petrov and F. Csaki, eds.) 267–281, Budapest, Akademia Kiado.
BESAG, J. (1979). Exploratory data analysis.Invited paper to R.S.S. Conference, Oxford, April 2–6.
BOX, G.E.P. and TIAO, G.C. (1968). A Bayesian approach to some outlier problems.Biometrika 55, 119–29.
FISHER, R.A. (1960).The design of experiments (7th ed.) Oliver and Boyd: Edinburgh.
GENTLE, J.E. (1978). Testing for outliers in linear regression. InContributions to survey sampling and applied statistics (H.A. David ed.) 223–233. New York: Academic Press.
GUTTMAN, I., DUTTER, R. and FREEMAN, P.R. (1978). Care and handling of univariate outliers in the general linear model to detect spuriosity—a Bayesian approach.Technometrics 20, 187–193
JEFFREYS, H. (1961).Theory of probability. Oxford: University Press.
JOHN, J.A. (1978). Outliers in factorial experiments.Appl. Statist. 27, 111–9.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Freeman, P.R. On the number of outliers in data from a linear model. Trabajos de Estadistica Y de Investigacion Operativa 31, 349–365 (1980). https://doi.org/10.1007/BF02888359
Issue Date:
DOI: https://doi.org/10.1007/BF02888359