Outlier Identification Rules for Generalized Linear Models

Kuhnt, Sonja; Pawlitschko, Jörg

doi:10.1007/3-540-26981-9_20

Sonja Kuhnt²¹ &
Jörg Pawlitschko²¹

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organization ((STUDIES CLASS))

2699 Accesses
2 Citations

Abstract

Observations which seem to deviate strongly from the main part of the data may occur in every statistical analysis. These observations, usually labelled as outliers, may cause completely misleading results when using standard methods and may also contain information about special events or dependencies. We discuss outliers in situations where a generalized linear model is assumed as null model for the regular data and introduce rules for their identification. For the special cases of a loglinear Poisson model and a logistic regression model some one-step identifiers based on robust and non-robust estimators are proposed and compared.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

BARNETT, V. and LEWIS, T. (1994): Outliers in Statistical Data. 3^rd ed., Wiley, New York.
Google Scholar
BECKER, C. and GATHER, U. (1999): The Masking Breakdown Point of Multivariate Outlier Identification Rules. Journal of the American Statistical Association, 94, 947–955.
Article MathSciNet Google Scholar
CHRISTMANN, A. (2001): Robust Estimation in Generalized Linear Models. In: J. Kunert and G. Trenkler (Eds.) Mathematical Statistics with Applications in Biometry: Festschrift in Honour of Siegfried Schach. Eul-Verlag, Lohmar, 215–230.
Google Scholar
DAVIES, P.L. and GATHER, U. (1993): The Identification of Outliers. Journal of the American Statistical Association, 88, 782–792.
Article MathSciNet Google Scholar
HUBERT, M. (1997): The Breakdown Value of the L₁ Estimator in Contingency Tables. Statistics and Probability Letters, 33, 419–425.
Article MATH MathSciNet Google Scholar
GATHER, U., KUHNT, S., and PAWLITSCHKO, J. (2003): Concepts of Outlyingness for Various Data Structures. In: J.C. Misra (Ed.): Industrial Mathematics and Statistics. Narosa Publishing House, New Dehli, 545–585.
Google Scholar
KUHNT, S. (2000): Ausreißeridentifikation im Loglinearen Poissonmodell für Kontingenztafeln unter Einbeziehung robuster Schätzer. Dissertation, Department of Statistics, University of Dortmund, Germany.
Google Scholar
MOSTELLER, F. and PARUNAK, A. (1985): Identifying Extreme Cells in a Sizable Contingency Table: Probabilistic and Exploratory Approaches. In: D.C. Hoaglin, F. Mosteller, and J.W. Tukey (Eds.): Exploring Data Tables, Trends and Shapes. Wiley, New York, 189–224.
Google Scholar
MYERS, R.H., MONTGOMERY, D.C, and VINING, G.C. (2002): Generalized Linear Models. Wiley, New York.
Google Scholar
NELDER, J.A. and WEDDERBURN, R.W.M. (1972): Generalized Linear Models. Journal of the Royal Statistical Society A, 134, 370–384.
Google Scholar
SHANE, K.V. and SIMONOFF, J.S. (2001): A Robust Approach to Categorical Data Analysis. Journal of Computational and Graphical Statistics, 10, 135–157.
Article MathSciNet Google Scholar
YICK, J.S. and LEE, A.H. (1998): Unmasking Outliers in Two-Way Contingency Tables. Computational Statistics & Data Analysis, 29, 69–79.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Statistics, University of Dortmund, D-44221, Dortmund, Germany
Sonja Kuhnt & Jörg Pawlitschko

Authors

Sonja Kuhnt
View author publications
You can also search for this author in PubMed Google Scholar
Jörg Pawlitschko
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Business Administration and Economics, Brandenburg University of Technology Cottbus, Konrad-Wachsmann-Allee 1, 03046, Cottbus, Germany
Daniel Baier (Chair of Marketing and Innovation Management) (Chair of Marketing and Innovation Management)
Department of Medical Biometrics Charité Virchow-Klinikum, Humboldt University Berlin, 13344, Berlin, Germany
Klaus-Dieter Wernecke

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kuhnt, S., Pawlitschko, J. (2005). Outlier Identification Rules for Generalized Linear Models. In: Baier, D., Wernecke, KD. (eds) Innovations in Classification, Data Science, and Information Systems. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-26981-9_20

Download citation

DOI: https://doi.org/10.1007/3-540-26981-9_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23221-6
Online ISBN: 978-3-540-26981-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics