Abstract
The problem of multicollinearity associated with the estimation of a functional logit model can be solved by using as predictor variables a set of functional principal components. The functional parameter estimated by functional principal component logit regression is often nonsmooth and then difficult to interpret. To solve this problem, different penalized spline estimations of the functional logit model are proposed in this paper. All of them are based on smoothed functional PCA and/or a discrete penalty in the log-likelihood criterion in terms of B-spline expansions of the sample curves and the functional parameter. The ability of these smoothing approaches to provide an accurate estimation of the functional parameter and their classification performance with respect to unpenalized functional PCA and LDA-PLS are evaluated via simulation and application to real data. Leave-one-out cross-validation and generalized cross-validation are adapted to select the smoothing parameter and the number of principal components or basis functions associated with the considered approaches.
Similar content being viewed by others
References
Agresti A (1990) Categorical data analysis. Wiley, New York
Aguilera A, Escabias M, Preda C, Saporta G (2010) Using basis expansion for estimating functional pls regression. Applications with chemometric data. Chemom Intell Lab Syst 104:289–305
Aguilera A, Escabias M, Valderrama M (2008) Discussion of different logistic models with functional data. Application to systemic lupus erythematosus. Comput Stat Data Anal 53(1):151–163
Aguilera A, Gutiérrez R, Ocaña F, Valderrama M (1995) Computational approaches to estimation in the principal component analysis of a stochastic process. Appl Stoch Models Data Anal 11(4):279–299
Aguilera A, Gutiérrez R, Ocaña F, Valderrama M (1996) Approximation of estimators in the pca of a stochastic process using B-splines. Commun Stat, Simul Comput 25(3):671–690
Breiman L, Friedman J, Olshen R, Stone C (1984) Classification and regression trees. Wadswort
Cardot H, Crambes C, Kneip A, Sarda P (2007) Smoothing splines estimators in functional linear regression with errors-in-variables. Comput Stat Data Anal 51:4832–4848
Cardot H, Ferraty F, Sarda P (2003) Spline estimators for the functional linear model. Stat Sin 13:571–591
Cardot H, Sarda P (2005) Estimation in generalized linear models for functional data via penalized likelihood. J Multivar Anal 92(1):24–41
Craven P, Wahba G (1979) Smoothing noisy data with spline functions—estimating the correct degree of smoothing by the method of generalized cross-validation. Numer Math 31(4):377–403
Currie I, Durban M (2002) Flexible smoothing with P-splines: a unified approach. Stat Model 2(4):333–349
De Boor C (2001) A practical guide to splines, revised edn. Springer, Berlin
Eilers P, Marx B (1996) Flexible smoothing with B-splines and penalties. Stat Sci 11(2):89–121
Eilers P, Marx B (2002) Generalized linear additive smooth structures. J Comput Graph Stat 11(4):758–783
Escabias M, Aguilera A, Valderrama M (2004) Principal component estimation of functional logistic regression: discussion of two different approaches. J Nonparametr Stat 16(3–4):365–384
Escabias M, Aguilera A, Valderrama M (2005) Modeling environmental data by functional principal component logistic regression. Environmetrics 16(1):95–107
Escabias M, Aguilera A, Valderrama M (2007) Functional pls logit regression model. Comput Stat Data Anal 51(10):4891–4902
Ferraty F, Vieu P (2003) Curves discrimination: a nonparametric functional approach. Comput Stat Data Anal 44:161–173
Ferraty F, Vieu P (2006) Nonparametric functional data analysis. Springer, Berlin
Harezlak J, Coull B, Laird N, Magari S, Christiani D (2007) Penalized solutions to functional regression problems. Comput Stat Data Anal 51(10):4911–4925
Hastie T, Buja A, Tibshirani R (1994) Flexible discriminant analysis by optimal scoring. J Am Stat Assoc 89(428):1255–1270
Hastie T, Tibshirani R (1990) Generalized additive models. Chapman & Hall, London
Hastie T, Tibshirani R (1993) Varying-coefficient models (with discussion). J R Stat Soc B 55:757–796
James G (2002) Generalized linear models with functional predictors. J R Stat Soc B 64(3):411–432
Le Cessie S, Van Houwelingen J (1992) Ridge estimators in logistic regression. Appl Stat 41(1):191–201
van der Linde A (2008) Variational Bayesian functional pca. Comput Stat Data Anal 53(2):517–533
Marx B, Eilers P (1999) Generalized linear regression on sampled signals and curves. A p-spline approach. Technometrics 41(1):1–13
Müller H (2005) Functional modelling and classification of longitudinal data. Board of the Foundation of Scand J Stat 32:223–240
Ocaña F, Aguilera A, Escabias M (2007) Computational considerations in functional principal component analysis. Comput Stat 22(3):449–465
Ocaña F, Aguilera A, Valderrama M (2008) Estimation of functional regression models for functional responses by wavelet approximations. In: Dabo-Niang S, Ferraty F (eds) Functional and operatorial statistics. Physica-Verlag, Heidelberg, pp 15–22
O’Sullivan F (1986) A statistical perspective on ill-posed inverse problems. Stat Sci 1:505–527
Preda C, Saporta G, Lévéder C (2007) Pls classification for functional data. Comput Stat 22:223–235
Ramsay JO, Silverman BW (2002) Applied functional data analysis: methods and case studies. Springer, Berlin
Ramsay JO, Silverman BW (2005) Functional data analysis. Springer, Berlin
Ratcliffe S, Heller G, Leader L (2002) Functional data analysis with application to periodically stimulated foetal heart rate data. II: Functional logistic regression. Stat Med 21:1115–1127
Reinsch C (1967) Smoothing by spline functions. Numer Math 10:177–183
Reiss P, Ogden R (2007) Functional principal component regression and functional partial least squares. J Am Stat Assoc 102(479):984–996
Ruppert D (2002) Selecting the number of knots for penalized splines. J Comput Graph Stat 11(4):735–757
Acknowledgements
This research has been funded by project P11-FQM-8068 from Consejería de Innovación, Ciencia y Empresa. Junta de Andalucía, Spain and project MTM2010-20502 from Dirección General de Investigación, Ministerio de Educación y Ciencia, Spain. The authors would like to thank Caroline Lévéder for providing us with the data from Danone.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Aguilera-Morillo, M.C., Aguilera, A.M., Escabias, M. et al. Penalized spline approaches for functional logit regression. TEST 22, 251–277 (2013). https://doi.org/10.1007/s11749-012-0307-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11749-012-0307-1