Adaptive robust regression with continuous Gaussian scale mixture errors

Seo, Byungtae; Noh, Jungsik; Lee, Taewook; Yoon, Young Joo

doi:10.1016/j.jkss.2016.08.002

Adaptive robust regression with continuous Gaussian scale mixture errors

Published: 13 September 2016

Volume 46, pages 113–125, (2017)
Cite this article

Journal of the Korean Statistical Society Aims and scope Submit manuscript

Byungtae Seo¹,
Jungsik Noh²,
Taewook Lee³ &
…
Young Joo Yoon⁴

42 Accesses
7 Citations
Explore all metrics

Abstract

Model based regression analysis always requires a certain choice of models which typically specifies the behavior of regression errors. The normal distribution is the most common choice for this purpose, but the estimator under normality is known to be too sensitive to outliers. As an alternative, heavy tailed distributions such as t distributions have been suggested. Though this choice can reduce the sensitivity to outliers, it also requires the choice of distributions and tuning parameters for practical use. In this paper, we propose a class of continuous Gaussian scale mixtures for the error distribution that contains most symmetric unimodal probability distributions including normal, t, Laplace, and stable distributions. With this quite flexible class of error distributions, we provide the asymptotic property and robust property of the proposed method, and show its successes along with numerical examples.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Andrews, D. F., & Mallows, C. L. (1974). Scale mixtures of normal distributions. Journal of Royal Statistical Society, Series B, 36, 99–102.
MathSciNet MATH Google Scholar
Bartolucci, F., & Scaccia, L. (2005). The use of mixtures for dealing with non-normal regression errors. Computational Statistics and Data Analysis, 48, 821–834.
Article MathSciNet Google Scholar
Beaton, A. E., & Tukey, J. W. (1974). The fitting of power series, meaning polynomials, illustrated on band-spectroscopic data. Technometrics, 16, 147–185.
Article Google Scholar
Bellio, R., & Ventura, L. (2005). An introduction to robust estimaton with Rfuctions. In Proceedings of the 1st international workshop on robust statistics and R. Treviso: Department of Statistics, Ca’Foscari University (Venezia).
Google Scholar
Bickel, P. J., Klaassen, C. A. J., Ritov, Y., & Wellner, J. A. (1993). Johns Hopkins series in the mathematical sciences, Efficient and adaptive estimation for semiparametric models. Baltimore, MD: Johns Hopkins University Press.
Google Scholar
Brownlee, K. A. (1960). Statistical theory and methodology in science and engineering. New York: John Wiley.
MATH Google Scholar
Croux, C., Rousseeuw, P. J., & Hössjer, O. (1994). Generalized s-estimators. Journal of the American Statistical Association, 89, 1271–1281.
Article MathSciNet Google Scholar
Efron, B., & Olshen, R. A. (1978). How broad is the class of normal scale mixtures? Annals of Statistics, 6, 1159–1164.
Article MathSciNet Google Scholar
Ghosal, S., & van der Vaart, A. W. (2001). Entropies and rates of convergence for maximum likelihood and Bayes estimation for mixtures of normal densities. The Annals of Statistics, 29, 1233–1263.
Article MathSciNet Google Scholar
Hathaway, R. J. (1985). A constrained formulation of maximum-likelihood estimation for normal mixture distributions. Annals of Statistics, 13, 795–800.
Article MathSciNet Google Scholar
Holzmann, H., Munk, A., & Gneiting, T. (2006). Identifiability of finite mixtures of elliptical distributions. Scandinavian Journal of Statistics, 33, 753–763.
Article MathSciNet Google Scholar
Huber, P. J. (1964). Robust estimation of a location parameter. Annals of Mathematical Statistics, 35, 73–101.
Article MathSciNet Google Scholar
Huber, P.J. (1973). Robust regreesion: asymptotics, conjectures and monte carlo. Annals of Statistics, 1, 799–821.
Article MathSciNet Google Scholar
Kelker, D. (1971). Infinite divisibility and variance mixtures of the normal distribution. The Annals of Mathematical Statistics, 42, 802–808.
Article MathSciNet Google Scholar
Kiefer, J., & Wolfowitz, J. (1956). Consistency of the maximum likelihood estimator in the presence of infinitely many incidental parameters. Annals of Mathematical Statistics, 27, 886–906.
MathSciNet MATH Google Scholar
Krasker, W. S., & Welsch, R. E. (1982). Efficient bounded-influence regression estimation. Journal of the American Statistical Association, 77, 595–607.
Article MathSciNet Google Scholar
Lange, K. L, Little, R. J. A., & Taylor, J. M. G. (1989). Robust statistical modeling using the t distribution. Journal of the American Statistical Association, 84, 881–896.
MathSciNet Google Scholar
Lange, K., & Sinsheimer, J. S. (1993). Normal/independent distributions and their applications in robust regression. Journal of Computational and Graphical Statistics, 2, 175–198.
MathSciNet Google Scholar
Lesperance, M. L, & Kalbfleisch, J. D. (1992). An algorithm for computing the nonparametric MLE of a mixing distribution. Journal of the American Statistical Association, 87, 120–126.
Article Google Scholar
Lindsay, B. G. (1995). NSF-CBMS regional conference series in probability and statistics: vol 5. Mixture models: theory, geometry, and applications. US: IMS.
Google Scholar
Maronna, R. A., & Yohai, V. J. (1981). Asymtotic behavior of general m-estimates for regression and scale with random carries. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete, 58, 7–20.
Article MathSciNet Google Scholar
Neyman, J., & Scott, E. L. (1948). Consistent estimation from partially consistent observations. Econometrica, 16, 1–32.
Article MathSciNet Google Scholar
Rousseeuw, P. J., & Leroy, A. M. (1987). Robust regression and outlier detection. New York: Wiley.
Book Google Scholar
Ruppert, D., Cressie, N., & Carroll, R. J. (1989). A transformation/weighting model for estimating michaelis-menten parameters. Biometrics, 45, 637–656.
Article Google Scholar
Soffritti, G., & Galimberti, G. (2011). Multivariate linear regression with non-normal errors: a solution based on mixture models. Statistics and Computing, 21, 523–536.
Article MathSciNet Google Scholar
Stromberg, A. J. (1993). Computation of high breakdown nonlinear regression parameters. Journal of the American Statistical Association, 88, 237–244.
MATH Google Scholar
Van der Vaart, A. W. (1988). Estimating a real parameter in a class of semiparametric models. Annnals of Statistics, 16(4), 1450–1474.
Article MathSciNet Google Scholar
Van der Vaart, A. W. (1996). Efficient maximum likelihood estimation in semiparametric mixture models. Annals of Statistics, 24, 862–878.
Article MathSciNet Google Scholar
Van der Vaart, A. W., & Wellner, J. A. (1996). Springer series in statistics, Weak convergence and empirical processes. New York: Springer-Verlag, With applications to statistics.
MATH Google Scholar
Wang, Y. (2007). On fast computation of the non-parametric maximum likelihood estimate of a mixing distribution. Journal of the Royal Statistical Society, Series B, 185–198.
Article MathSciNet Google Scholar
West, M. (1984). Outlier models and prior distributions in bayesian liner regression. Journal of Royal Statistical Society, Series B, 46, 431–439.
MATH Google Scholar
West, M. (1987). On scale mixtures of normal distributions. Biometrika, 74, 646–648.
Article MathSciNet Google Scholar
Yohai, V.J., & Zamar, R. H. (1988). High breakdown point estimates of regression by means of the minimization of an efficient scale. Journal of the American Statistical Association, 83, 406–413.
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Statistics, Sungkyunkwan University, Seoul, 110-745, Korea
Byungtae Seo
Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
Jungsik Noh
Department of Statistics, Hankuk University of Foreign Studies, Yongin, 449-791, Korea
Taewook Lee
Department of Statistics, Daejeon University, Daejeon, 300-716, Korea
Young Joo Yoon

Authors

Byungtae Seo
View author publications
You can also search for this author in PubMed Google Scholar
Jungsik Noh
View author publications
You can also search for this author in PubMed Google Scholar
Taewook Lee
View author publications
You can also search for this author in PubMed Google Scholar
Young Joo Yoon
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Young Joo Yoon.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Seo, B., Noh, J., Lee, T. et al. Adaptive robust regression with continuous Gaussian scale mixture errors. J. Korean Stat. Soc. 46, 113–125 (2017). https://doi.org/10.1016/j.jkss.2016.08.002

Download citation

Received: 18 April 2016
Accepted: 25 August 2016
Published: 13 September 2016
Issue Date: March 2017
DOI: https://doi.org/10.1016/j.jkss.2016.08.002

AMS 2000 subject classifications

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Adaptive robust regression with continuous Gaussian scale mixture errors

Abstract

Access this article

Similar content being viewed by others

Violating the normality assumption may be the lesser of two evils

Minimizing robust density power-based divergences for general parametric density models

Check your outliers! An introduction to identifying statistical outliers in R with easystats

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

AMS 2000 subject classifications

Keywords

Navigation

Adaptive robust regression with continuous Gaussian scale mixture errors

Abstract

Access this article

Similar content being viewed by others

Violating the normality assumption may be the lesser of two evils

Minimizing robust density power-based divergences for general parametric density models

Check your outliers﻿! An introduction to identifying statistical outliers in R with easystats

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

AMS 2000 subject classifications

Keywords

Search

Navigation

Check your outliers! An introduction to identifying statistical outliers in R with easystats