Abstract
Loglinear Poisson models are commonly used to analyse contingency tables. So far, robustness of parameter estimators as well as outlier detection have rarely been treated in this context. We start with finite-sample breakdown points. We yield that the breakdown point of mean value estimators determines a lower bound for a masking breakdown point of a class of one-step outlier identification rules. Within a more refined breakdown approach, which takes account of the structure of the contingency table, a stochastic breakdown function is defined. It returns the probability that a given proportion of outliers is randomly placed at such a pattern, where breakdown is possible. Finally, the introduced breakdown concepts are applied to characterise the maximum likelihood estimator and a median-polish estimator.
Similar content being viewed by others
References
Becker C (1996) Bruchpunkt und Bias zur Beurteilung multivariater Ausreißeridentifizierung. Ph.D. Thesis, Department of Statistics, University of Dortmund
Becker C (2002) Robustness concepts for analyzing structured and complex data sets. Habilitation thesis, Department of Statistics, University of Dortmund, Germany
Becker C, Gather U (1999) The masking breakdown point of multivariate outlier identification rules. J Am Stat Assoc 94: 947–955
Ben MG, Marinez E, Yohai VJ (2006) Robust estimation for the multivariate linear model based on a τ-scale. J Multivar Anal 97: 1600–1622
Bishop YMM, Fienberg SE, Holland PW (1975) Discrete multivariate analysis. MIT Press, Cambridge
Brown MB (1974) Identification of the source of significance in two-way contingency tables. Appl Stat 23: 405–413
Davies L, Gather U (1993) The identification of multiple outliers. J Am Stat Assoc 88: 782–792
Davies L, Gather U (2004) Robust statistics. In: Gentle JE, Härdle W, Mori Y (eds) Handbook of Computational Statistics, Chapter III.9. Springer, Berlin, pp 655–695
Donoho DL, Huber PJ (1983) The notion of breakdown point. In: Bickel PJ, Doksum KA, Hodges JL (eds) A Festschrift for Erich Lehmann. Wadswort, Belmont, CA, pp 157–184
Emerson JD, Hoaglin DC (1983) Analysis of two-way tables by medians. In: Hoaglin DC, Mosteller F, Tukey JW (eds) Understanding robust and exploratory data analysis. Wiley, New York
Fieller NRJ (1976) Some problems related to the rejection of outlying observations. Ph.D. thesis, University of Hull, Great Britain
Fisher RA (1956) Statistical methods and scientific inference. Hafner, New York
Fuchs C, Kenett R (1980) A test for detecting outlying cells in the multinomial distribution and two-way contingency tables. J Am Stat Assoc 75: 395–398
Gallegos MT, Ritter G (2005) A robust method for cluster analysis. Ann Stat 33: 347–380
Gather U, Becker C, Kuhnt S (2000) Robust methods for complex data structures. In: Kiers HAL, Rasson J-P, Groenen PJF, Schader M (eds) Data Analysis, Classification and Related Methods. Springer, Berlin, pp 315–320
Hampel FR (1968) Contribution to the theory of robust estimation. Ph.D. Thesis, University of California, Berkeley
Hampel FR (1971) A general qualitative definition of robustness. Ann Math Stat 42: 1887–1896
Hampel FR, Ronchetti EM, Rousseeuw PJ, Stahel WA (1986) Robust statistics: the approach based on influence functions. Wiley, New York
Huber PJ (1981) Robust statistics. Wiley, New York
Hubert M (1997) The breakdown value of the L 1 estimator in contingency tables. Stat Probab Lett 33: 419–425
Henning C (2004) Breakdown points for maximum likelihood estimators of location-scale mixtures. Ann Stat 32: 1313–1340
Kuhnt S (2000) Ausreißeridentifikation im Loglinearen Poissonmodell für Kontingenztafeln unter Einbeziehung robuster Schätzer. Ph.D. Thesis, Department of Statistics, University of Dortmund, Germany
Kuhnt S (2004) Outlier identification procedure for contingency tables using maximum likelihood and L1 estimates. Scand J Stat 31: 431–442
Madalla, GS, Rao, CR (eds) (1997) Handbook of statistics 15: Robust Inference. Elsevier Science, Amsterdam
Mosteller F, Parunak A (1985) Identifying extreme cells in a sizable contingency table: Probabilistic and exploratory approaches. In: Hoaglin DC, Mosteller F, Tukey JW (eds) Exploring data tables, trends and shapes. Wiley, New York, pp 189–224
Murphy RB (1951) On tests for outlying observations. Ph.D. Thesis, Princeton University, Ann Arbor, Michigan
Rousseeuw PJ, Leroy AM (1987) Robust regression and outlier detection. Wiley, New York
Salibian-Barrera M, Van Aelst S, Willems G (2006) Principal components analysis based on multivariate MM estimators with fast and robust bootstrap. J Am Stat Assoc 101: 1198–1211
Shane KV, Simonoff JS (2001) A robust approach to categorical data analysis. J Comput Graph Stat 10: 135–157
Simonoff JS (1988) Detecting outlying cells in two-way contingency tables via backwards-stepping. Technometrics 30: 339–345
Terbeck W (1996). Interaktionen in der Zwei-Faktoren-Varianzanalyse. PhD Thesis, Department of Mathematics and Informatics, University of Essen, Germany
Terbeck W, Davies L (1998) Interactions and outliers in the two-way analysis of variance. Ann Stat 26: 1279–1305
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Kuhnt, S. Breakdown concepts for contingency tables. Metrika 71, 281–294 (2010). https://doi.org/10.1007/s00184-008-0230-3
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00184-008-0230-3