Abstract
Generalized cross-validation (GCV) is a popular parameter selection criterion for spline smoothing of noisy data, but it sometimes yields a severely undersmoothed estimate, especially if the sample size is small. Robust GCV (RGCV) and modified GCV are stable extensions of GCV, with the degree of stabilization depending on a parameter \(\gamma \in (0,1)\) for RGCV and on a parameter \(\rho >1\) for modified GCV. While there are favorable asymptotic results about the performance of RGCV and modified GCV, little is known for finite samples. In a large simulation study with cubic splines, we investigate the behavior of the optimal values of \(\gamma \) and \(\rho \), and identify simple practical rules to choose them that are close to optimal. With these rules, both RGCV and modified GCV perform significantly better than GCV. The performance is defined in terms of the Sobolev error, which is shown by example to be more consistent with a visual assessment of the fit than the prediction error (average squared error). The results are consistent with known asymptotic results.
Similar content being viewed by others
References
Anselone PM, Laurent PJ (1968) A general method for the construction of interpolating or smoothing spline-functions. Numer Math 12:66–82
Cox DD (1984) Gaussian approximation of smoothing splines. Tech. rep., Dept. Statist., University of Wisconsin/Madison
Craven P, Wahba G (1979) Smoothing noisy data with spline functions: estimating the correct degree of smoothing by the method of generalized cross-validation. Numer Math 31:377–403
Cummins DJ, Filloon TG, Nychka D (2001) Confidence intervals for nonparametric curve estimates: toward more uniform pointwise coverage. J Am Stat Assoc 96:233–246
de Hoog FR, Anderssen RS, Lukas MA (2011) Differentiation of matrix functionals using triangular factorization. Math Comput 80(275):1585–1600
Efron B (2001) Selection criteria for scatterplot smoothers. Ann Stat 29:470–504
Eubank RL (1988) Spline smoothing and nonparametric regression. Dekker, New York
Girard DA (2010) Estimating the accuracy of (local) cross-validation via randomised GCV choices in kernel or smoothing spline regression. J Nonparametric Stat 22:41–64
Green PJ, Silverman BW (1994) Nonparametric regression and generalized linear models: a roughness penalty approach. Chapman & Hall, London
Gu C (2002) Smoothing spline ANOVA models. Springer, New York
Hutchinson MF, de Hoog FR (1985) Smoothing noisy data with spline functions. Numer Math 47:99–106
Kim YJ, Gu C (2004) Smoothing spline Gaussian regression: more scalable computation via efficient approximation. J R Stat Soc Ser B 66:337–356
Li KC (1986) Asymptotic optimality of \(C_L\) and generalized cross-validation in ridge regression with application to spline smoothing. Ann Stat 14:1101–1112
Lukas MA (2006) Robust generalized cross-validation for choosing the regularization parameter. Inverse Probl 22:1883–1902
Lukas MA (2008) Strong robust generalized cross-validation for choosing the regularization parameter. Inverse Probl 24(034):006
Lukas MA (2014) Performance criteria and discrimination of extreme undersmoothing in nonparametric regression. J Stat Plan Inference 153:56–74
Lukas MA, de Hoog FR, Anderssen RS (2010) Efficient algorithms for robust generalized cross-validation spline smoothing. J Comput Appl Math 235:102–107
Lukas MA, de Hoog FR, Anderssen RS (2012) Performance of robust GCV and modified GCV for spline smoothing. Scand J Stat 39:97–115
Nychka D (1990) The average posterior variance of a smoothing spline and a consistent estimate of the average squared error. Ann Stat 18:415–428
Reinsch CH (1967) Smoothing by spline functions. Numer Math 10:177–183
Reinsch CH (1971) Smoothing by spline functions II. Numer Math 16:451–454
Robinson T, Moyeed R (1989) Making robust the cross-validatory choice of smoothing parameter in spline smoothing regression. Commun Stat Theory Methods 18:523–539
Wahba G (1990) Spline models for observational data. SIAM, Philadelphia
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lukas, M.A., de Hoog, F.R. & Anderssen, R.S. Practical use of robust GCV and modified GCV for spline smoothing. Comput Stat 31, 269–289 (2016). https://doi.org/10.1007/s00180-015-0577-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-015-0577-7