Abstract
In multinomial logit models, the identifiability of parameter estimates is typically obtained by side constraints that specify one of the response categories as reference category. When parameters are penalized, shrinkage of estimates should not depend on the reference category. In this paper we investigate ridge regression for the multinomial logit model with symmetric side constraints, which yields parameter estimates that are independent of the reference category. In simulation studies the results are compared with the usual maximum likelihood estimates and an application to real data is given.
Similar content being viewed by others
References
Anderson JA (1984) Regression and ordered categorical variables (with discussion). J R Statis Soc Series B 46:1–30
Bühlmann P, Hothorn T (2007) Boosting algorithms: regularization, prediction and model fitting (with discussion). Stat Sci 22:477–505
Fan J, Li R (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. J Am Stat Assoc 96:1348–1360
Friedman J, Hastie T, Tibshirani R (2010) Regularization paths for generalized linear models via coordinate descent. J Stat Softw 33(1):1–22
Hoerl E, Kennard RW (1970) Ridge regression: biased estimation for nonorthogonal problems. Technometrics 12:55–67
James G, Radchenko P (2009) A generalized dantzig selector with shrinkage tuning. Biometrika 96:323–337
Krishnapuram B, Carin L, Figueiredo MA, Hartemink AJ (2005) Sparse multinomial logistic regression: fast algorithms and generalization bounds. IEEE Trans Pattern Anal Mach Intell 27:957–968
LeCessie S, Houwelingen V (1992) Ridge estimators in logistic regression. Appl Stat 41:191–201
Nyquist H (1991) Restricted estimation of generalized linear models. Appl Stat 40:133–141
Park MY, Hastie T (2007) L1-regularization path algorithm for generalized linear models. J R Stat Soc B 69:659–677
Raza MA (2008) Risk factors associated with injection initiation among drug users: comparing hierarchical modeling with traditional logistic regression. Master’s thesis, College of Statistical and Actuarial Sciences, University of the Punjab, Lahore
Schaefer R (1986) Alternative estimators in logistic regression when the data are collinear. J Stat Comput Simul 25:75–91
Schaefer R, Roi L, Wolfe R (1984) A ridge logistic estimator. Commun Stat Theory Methods 13:99–113
Segerstedt B (1992) On ordinary ridge regression in generalized linear models. Commun Stat Theory Methods 21:2227–2246
Tibshirani R (1996) Regression shrinkage and selection via lasso. J R Stat Soc B 58:267–288
Tutz G, Binder H (2006) Generalized additive modelling with implicit variable selection by likelihood based boosting. Biometrics 62:961–971
Zhu J, Hastie T (2004) Classification of gene microarrays by penalized logistic regression. Biostatistics 5:427–443
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zahid, F.M., Tutz, G. Ridge estimation for multinomial logit models with symmetric side constraints. Comput Stat 28, 1017–1034 (2013). https://doi.org/10.1007/s00180-012-0341-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-012-0341-1