Nonparametric Regression When Estimating the Probability of Success: A Comparison of Four Extant Estimators

Wilcox, Rand R.

doi:10.1080/15598608.2012.695639

Nonparametric Regression When Estimating the Probability of Success: A Comparison of Four Extant Estimators

Published: 01 September 2012

Volume 6, pages 443–451, (2012)
Cite this article

Journal of Statistical Theory and Practice Aims and scope Submit manuscript

Rand R. Wilcox¹

3 Accesses
3 Citations
Explore all metrics

Abstract

For the random variables Y, X ₁,..., X _p, where Y is binary, let M(x ₁,..., x _p) = P(Y = 1

(X₁,... X _p) = (x ₁,... x _p)). The article compares four smoothers aimed at estimating M(x ₁,...,x _p), three of which can be used when p > 1. Evidently there are no published comparisons of smoothers when p > 1 and Y is binary. One of the estimators stems from Hosmer and Lemeshow (1989, 85), which is limited to p = 1. A simple modification of this estimator (called method E3 here) is proposed that can be used when p > 1. Roughly, a weighted mean of the Y values is used, where the weights are based on a robust analog of Mahalanobis distance that replaces the usual covariance matrix with the minimum volume estimator. Another estimator stems from Signorini and Jones (1984) and is based in part on an estimate of the probability density function of X ₁,..., X _p. Here, an adaptive kernel density estimator is used. No estimator dominated in terms of mean squared error and bias. And for p = 1, the differences among three of the estimators, in terms of mean squared error and bias, are not particularly striking. But for p > 1, differences among the estimators are magnified, with method E3 performing relatively well. An estimator based on the running interval smoother performs about as well as E3, but for general use, E3 is found to be preferable. The estimator studied by Signorini and Jones (1984) is not recommended, particularly when p > 1.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Recognize the Value of the Sum Score, Psychometrics’ Greatest Accomplishment

Article Open access 01 March 2024

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Article Open access 01 April 2016

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

Article Open access 05 May 2021

References

Cleveland, W. S. 1979. Robust locally weighted regression and smoothing scatterplots. J. Am. Stat. Assoc., 74, 829–836.
Article MathSciNet Google Scholar
Efromovich, S. 1999. Nonparametric curve estimation: Methods, theory and applications. NewYork, Springer-Verlag.
MATH Google Scholar
Eubank, R. L. 1999. Nonparametric regression and spline smoothing. New York, Marcel Dekker.
MATH Google Scholar
Fan, J. 1993. Local linear smoothers and their minimax efficiencies. Ann. Stat., 21, 196–216.
Article MathSciNet Google Scholar
Fan, J., and I. Gijbels. 1996. Local polynomial modeling and its applications. Boca Raton, FL, CRC Press.
MATH Google Scholar
Fox, J. 2001. Multiple and generalized nonparametric regression. Thousands Oaks, CA, Sage.
MATH Google Scholar
Green, P. J., and B. W. Silverman. 1993. Nonparametric regression and generalized linear models: A roughness penalty approach. Boca Raton, FL, CRC Press.
MATH Google Scholar
Györfi, L., M. Kohler, A. Krzyzk, and H. Walk. 2002. A distribution-free theory of nonparametric regression. New York, Springer Verlag.
Book Google Scholar
Härdle, W. 1990. Applied nonparametric regression. Econometric Society Monographs No. 19. Cambridge, UK, Cambridge University Press.
Book Google Scholar
Hastie, T. J., and R. J. Tibshirani. 1990. Generalized additive models. New York, Chapman and Hall.
MATH Google Scholar
Hoaglin, D. C. 1985. Summarizing shape numerically: The g-and-h distributions. In Exploring data tables, trends, and shapes, ed. D. Hoaglin, F. Mosteller, and J. Tukey, 461–515. New York, Wiley.
MATH Google Scholar
Hosmer, D. W., and S. Lemeshow. 1989. Applied logistic regression. New York, Wiley.
MATH Google Scholar
Kay, R. and S. Little. 1987. Transformation of the explanatory variables in the logistic regression model for binary data. Biometrika, 74, 495–501.
Article MathSciNet Google Scholar
Rousseeuw, P. J., and A. M. Leroy. 1987. Robust regression & outlier detection. New York, Wiley.
Book Google Scholar
Signorini, D. F., and M. C. Jones. 2004. Kernel estimators for univariate binary regression. J. Am. Stat. Assoc., 99, 119–126.
Article MathSciNet Google Scholar
Silverman, B. W. 1986. Density estimation for statistics and data analysis. New York, Chapman and Hall.
Book Google Scholar
Wilcox, R. R. 2005. Introduction to robust estimation and hypothesis testing, 2nd ed. San Diego, CA, Academic Press.
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Psychology, University of Southern California, SGM 1061, Los Angeles, California, 90089-1061, USA
Rand R. Wilcox

Authors

Rand R. Wilcox
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rand R. Wilcox.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wilcox, R.R. Nonparametric Regression When Estimating the Probability of Success: A Comparison of Four Extant Estimators. J Stat Theory Pract 6, 443–451 (2012). https://doi.org/10.1080/15598608.2012.695639

Download citation

Received: 02 September 2011
Revised: 01 February 2012
Published: 01 September 2012
Issue Date: September 2012
DOI: https://doi.org/10.1080/15598608.2012.695639

AMS Subject Classification

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Nonparametric Regression When Estimating the Probability of Success: A Comparison of Four Extant Estimators

Abstract

Access this article

Similar content being viewed by others

Recognize the Value of the Sum Score, Psychometrics’ Greatest Accomplishment

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

AMS Subject Classification

Keywords

Navigation

Nonparametric Regression When Estimating the Probability of Success: A Comparison of Four Extant Estimators

Abstract

Access this article

Similar content being viewed by others

Recognize the Value of the Sum Score, Psychometrics’ Greatest Accomplishment

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

AMS Subject Classification

Keywords

Search

Navigation