Skip to main content

Abstract

Observations which seem to deviate strongly from the main part of the data may occur in every statistical analysis. These observations, usually labelled as outliers, may cause completely misleading results when using standard methods and may also contain information about special events or dependencies. We discuss outliers in situations where a generalized linear model is assumed as null model for the regular data and introduce rules for their identification. For the special cases of a loglinear Poisson model and a logistic regression model some one-step identifiers based on robust and non-robust estimators are proposed and compared.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • BARNETT, V. and LEWIS, T. (1994): Outliers in Statistical Data. 3rd ed., Wiley, New York.

    Google Scholar 

  • BECKER, C. and GATHER, U. (1999): The Masking Breakdown Point of Multivariate Outlier Identification Rules. Journal of the American Statistical Association, 94, 947–955.

    Article  MathSciNet  Google Scholar 

  • CHRISTMANN, A. (2001): Robust Estimation in Generalized Linear Models. In: J. Kunert and G. Trenkler (Eds.) Mathematical Statistics with Applications in Biometry: Festschrift in Honour of Siegfried Schach. Eul-Verlag, Lohmar, 215–230.

    Google Scholar 

  • DAVIES, P.L. and GATHER, U. (1993): The Identification of Outliers. Journal of the American Statistical Association, 88, 782–792.

    Article  MathSciNet  Google Scholar 

  • HUBERT, M. (1997): The Breakdown Value of the L1 Estimator in Contingency Tables. Statistics and Probability Letters, 33, 419–425.

    Article  MATH  MathSciNet  Google Scholar 

  • GATHER, U., KUHNT, S., and PAWLITSCHKO, J. (2003): Concepts of Outlyingness for Various Data Structures. In: J.C. Misra (Ed.): Industrial Mathematics and Statistics. Narosa Publishing House, New Dehli, 545–585.

    Google Scholar 

  • KUHNT, S. (2000): Ausreißeridentifikation im Loglinearen Poissonmodell für Kontingenztafeln unter Einbeziehung robuster Schätzer. Dissertation, Department of Statistics, University of Dortmund, Germany.

    Google Scholar 

  • MOSTELLER, F. and PARUNAK, A. (1985): Identifying Extreme Cells in a Sizable Contingency Table: Probabilistic and Exploratory Approaches. In: D.C. Hoaglin, F. Mosteller, and J.W. Tukey (Eds.): Exploring Data Tables, Trends and Shapes. Wiley, New York, 189–224.

    Google Scholar 

  • MYERS, R.H., MONTGOMERY, D.C, and VINING, G.C. (2002): Generalized Linear Models. Wiley, New York.

    Google Scholar 

  • NELDER, J.A. and WEDDERBURN, R.W.M. (1972): Generalized Linear Models. Journal of the Royal Statistical Society A, 134, 370–384.

    Google Scholar 

  • SHANE, K.V. and SIMONOFF, J.S. (2001): A Robust Approach to Categorical Data Analysis. Journal of Computational and Graphical Statistics, 10, 135–157.

    Article  MathSciNet  Google Scholar 

  • YICK, J.S. and LEE, A.H. (1998): Unmasking Outliers in Two-Way Contingency Tables. Computational Statistics & Data Analysis, 29, 69–79.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin · Heidelberg

About this paper

Cite this paper

Kuhnt, S., Pawlitschko, J. (2005). Outlier Identification Rules for Generalized Linear Models. In: Baier, D., Wernecke, KD. (eds) Innovations in Classification, Data Science, and Information Systems. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-26981-9_20

Download citation

Publish with us

Policies and ethics