Smoothing noisy data with spline functions
 Peter Craven,
 Grace Wahba
 … show all 2 hide
Rent the article at a discount
Rent now* Final gross prices may vary according to local VAT.
Get AccessSummary
Smoothing splines are well known to provide nice curves which smooth discrete, noisy data. We obtain a practical, effective method for estimating the optimum amount of smoothing from the data. Derivatives can be estimated from the data by differentiating the resulting (nearly) optimally smoothed spline.
We consider the modely _{ i }(t _{ i })+ε_{ i },i=1, 2, ...,n,t _{i}∈[0, 1], whereg∈W _{2} ^{(m)} ={f:f,f′, ...,f ^{(m−1)} abs. cont.,f ^{(m)}∈ℒ_{2}[0,1]}, and the {ε_{ i }} are random errors withEε_{ i }=0,Eε_{ i }ε_{ j }=σ^{2}δ_{ ij }. The error variance σ^{2} may be unknown. As an estimate ofg we take the solutiong _{n, λ} to the problem: Findf∈W _{2} ^{(m)} to minimize \(\frac{1}{n}\sum\limits_{j = 1}^n {(f(t_j )  y_j )^2 + \lambda \int\limits_0^1 {(f^{(m)} (u))^2 du} }\) . The functiong _{n, λ} is a smoothing polynomial spline of degree 2m−1. The parameter λ controls the tradeoff between the “roughness” of the solution, as measured by \(\int\limits_0^1 {[f^{(m)} (u)]^2 du}\) , and the infidelity to the data as measured by \(\frac{1}{n}\sum\limits_{j = 1}^n {(f(t_j )  y_j )^2 }\) , and so governs the average square errorR(λ; g)=R(λ) defined by $$R(\lambda ) = \frac{1}{n}\sum\limits_{j = 1}^n {(g_{n,\lambda } (t_j )  g(t_j ))^2 }$$ . We provide an estimate \(\hat \lambda\) , called the generalized crossvalidation estimate, for the minimizer ofR(λ). The estimate \(\hat \lambda\) is the minimizer ofV(λ) defined by \(V(\lambda ) = \frac{1}{n}\parallel (I  A(\lambda ))y\parallel ^2 /\left[ {\frac{1}{n}{\text{Trace(}}I  A(\lambda ))} \right]^2\) , wherey=(y _{1}, ...,y _{n})^{t} andA(λ) is then×n matrix satisfying(g _{n, λ} (t _{1}), ...,g _{n, λ} (t _{n}))^{t}=A (λ) y. We prove that there exist a sequence of minimizers \(\tilde \lambda = \tilde \lambda (n)\) ofEV(λ), such that as the (regular) mesh{t _{i}} _{i=1} ^{n} becomes finer, \(\mathop {\lim }\limits_{n \to \infty } ER(\tilde \lambda )/\mathop {\min }\limits_\lambda ER(\lambda ) \downarrow 1\) . A Monte Carlo experiment with several smoothg's was tried withm=2,n=50 and several values of σ^{2}, and typical values of \(R(\hat \lambda )/\mathop {\min }\limits_\lambda R(\lambda )\) were found to be in the range 1.01–1.4. The derivativeg′ ofg can be estimated by \(g'_{n,\hat \lambda } (t)\) . In the Monte Carlo examples tried, the minimizer of \(R_D (\lambda ) = \frac{1}{n}\sum\limits_{j = 1}^n {(g'_{n,\lambda } (t_j )  } g'(t_j ))\) tended to be close to the minimizer ofR(λ), so that \(\hat \lambda\) was also a good value of the smoothing parameter for estimating the derivative.
 Abramowitz, M., Stegun, I. (1964) Handbook of mathematical functions with formulas, graphs, and mathematical tables. U.S. Department of Commerce, National Bureau of Standards Applied Mathematics Series 55: pp. 803819
 Aronszajn, N. (1950) Theory of reproducing kernels. Trans. Amer. Math. Soc. 68: pp. 337404
 Golomb, M. (1968) Approximation by periodic spline interpolants on uniform meshes. J. Approximation Theory 1: pp. 2665
 Golub, G., Heath, M., Wahba, G.: Generalized cross validation as a method for choosing a good ridge parameter, to appear, Technometrics
 Golub, G., Reinsch, C. (1970) Singular value decomposition and least squares solutions. Numer. Math. 14: pp. 403420
 Hudson, H.M. (1974) Empirical Bayes estimation. Stanford University, Department of Statistics, Stanford, Cal.
 Kimeldorf, G., Wahba, G. (1970) A correspondence between Bayesian estimation on stochastic processes and smoothing by splines. Ann. Inst. Statist. Math. 41: pp. 495502
 Mallows, C.L. (1973) Some comments onC p. Technometrics 15: pp. 661675
 Reinsch, C.M. (1967) Smoothing by spline functions. Numer. Math. 10: pp. 177183
 Reinsch, C.M. (1971) Smoothing by spline functions, II. Numer. Math. 16: pp. 451454
 Schoenberg, I.J. (1964) Spline functions and the problem of graduation. Proc. Nat. Acad. Sci. (USA) 52: pp. 947950
 Wahba, G. (1973) Convergence rates for certain approximate solutions to first kind integral equations. J. Approximation Theory 7: pp. 167185
 Wahba, G. (1975) Smoothing noisy data with spline functions. Numer. Math. 24: pp. 383393
 Wahba, G. (1977) Practical approximate solutions to linear operator equations when the data are noisy. SIAM J. Numer. Anal. 14: pp. 651667
 Wahba, G., Wold, S. (1975) A completely automatic French curve: Fitting spline functions by crossvalidation. Comm. Statist. 4: pp. 117
 Wahba, G., Wold, S. (1975) Periodic splines for spectral density estimation: The use of crossvalidation for determining the degree of smoothing. Comm. Statist. 4: pp. 125141
 Wahba, G.: A survey of some smoothing problems and the method of generalized cross validation for solving them. University of WisconsinMadison, Statistics Dept., Technical Report #457. In: Proceedings of the Conference on Applications of Statistics, Dayton, Ohio (P.R. Krishnaiah, ed.) June 14–18, 1976
 Wahba, G.: Improper priors, spline smoothing and the problem of guarding against model errors in regression. J. Roy. Statist. Soc., Ser. B. To appear
 Title
 Smoothing noisy data with spline functions
 Journal

Numerische Mathematik
Volume 31, Issue 4 , pp 377403
 Cover Date
 19781201
 DOI
 10.1007/BF01404567
 Print ISSN
 0029599X
 Online ISSN
 09453245
 Publisher
 SpringerVerlag
 Additional Links
 Topics
 Keywords

 MOS:65D10
 CR:5.17
 MOS:65D25
 Industry Sectors
 Authors

 Peter Craven ^{(1)}
 Grace Wahba ^{(2)}
 Author Affiliations

 1. The Computer Laboratory, The University of Liverpool, Liverpool, England
 2. Department of Statistics, University of Wisconsin, 53706, Madison, WI, USA