SiZer for smoothing splines

Marron, J. S.; Zhang, Jin Ting

doi:10.1007/BF02741310

SiZer for smoothing splines

Published: 01 September 2005

Volume 20, pages 481–502, (2005)
Cite this article

Computational Statistics Aims and scope Submit manuscript

J. S. Marron¹ &
Jin Ting Zhang²

180 Accesses
22 Citations
Explore all metrics

Abstract

Smoothing splines are an attractive method for scatterplot smoothing. The SiZer approach to statistical inference is adapted to this smoothing method, named SiZerSS. This allows quick and sure inference as to “which features in the smooth are really there” as opposed to “which are due to sampling artifacts”, when using smoothing splines for data analysis. Applications of SiZerSS to mode, linearity, quadraticity and monotonicity tests are illustrated using a real data example. Some small scale simulations are presented to demonstrate that the SiZerSS and the SiZerLL (the original local linear version of SiZer) often give similar performance in exploring data structure but they can not replace each other completely.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A review of spline function procedures in R

Article Open access 06 March 2019

P-splines with derivative based penalties and tensor product smoothing of unevenly distributed data

Article Open access 18 May 2016

Weighted-Average Least Squares (WALS): Confidence and Prediction Intervals

Article Open access 22 April 2022

References

Chaudhuri, P. and Marron, J. S. (1999), SiZer for exploration of structure in curves, Journal of the American Statistical Association, 94, 807–823.
Article MathSciNet Google Scholar
Eubank, R. L. (2000), Nonparametric Regression and Spline Smoothing, Marcel Dekker, New York.
MATH Google Scholar
Fan, J. and Gijbels, I. (1996), Local Polynomial Modelling and Its Applications, Chapman and Hall, London.
MATH Google Scholar
Fan, J. and Marron, J. S. (1994), Fast implementations of nonparametric curve estimators, Journal of Computational and Graphical Statistics, 3, 35–56.
Google Scholar
Green, P. J. and Silverman, B. W. (1994), Nonparametric Regression and Generalized Linear Models, Chapman and Hall, London.
Book Google Scholar
Härdle, W. (1990), Applied Nonparametric Regression, Cambridge University Press, Boston.
Book Google Scholar
Hastie, T.J. and Tibshirani, R. J. (1990), Generalized Additive Models, Chapman and Hall, London.
MATH Google Scholar
Loader, C. (1999), Local Regression and Likelihood, Springer Verlag, Berlin.
MATH Google Scholar
Marron, J. S. (1996), A personal view of smoothing and statistics, in Statistical Theory and Computational Aspects of Smoothing, eds. W. Härdie and M. Schimek, 1–9 (with discussion, and rejoinder 103–112).
Google Scholar
Silverman, B. W. (1984), Spline smoothing: the equivalent kernel method. Ann. Statist., 12, 898–916.
Article MathSciNet Google Scholar
Wahba, G. (1991), Spline Models for Observational Data, SIAM, Philadelphia.
MATH Google Scholar
Wand, M. P. and Jones, M. C. (1995), Kernel Smoothing, Chapman and Hall, London.
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Statistics, University of North Carolina, 27599-3260, Chapel Hill, NC
J. S. Marron
Department of Statistics and Applied Probability, National University of Singapore, Singapore, 119260, Singapore
Jin Ting Zhang

Authors

J. S. Marron
View author publications
You can also search for this author in PubMed Google Scholar
Jin Ting Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Marron’s research was supported by the Dept. of Stat. and Appl. Prob., National Univ. of Singapore, and by the National Science Foundation Grant DMS-9971649. Zhang’s research was supported by the National Univ. of Singapore Academic Research grant R-155-000-023-112. The Editor, the Associate Editor, and the referees are appreciated for their invaluable comments and suggestions that help improve the article significantly.

Appendix: Derivations of (5),(6), and (7)

First of all, assume X₁, X₂, ⋯, X_n have been sorted so that X₁ < X₂ < ⋯ < X_n. Write f_i = f(X_i) and γi = f″(X_i) to be the values of f(x) and f″(x) at X_i for i = 1, 2, ⋯, n. Define f = (f₁, ⋯, f_n)^T and γ = (γ₂, ⋯, γ_{n −1})^T. Let h_i = X_{i+ 1} − X_i, i = 1, 2, ⋯, n − 1. Let Q : n × (n − 2), R: (n − 2) × (n − 2) and K: n × n be the matrices as defined in Green and Silverman (1994, pages 12–13). According to Theorem 2.1 of Green and Silverman (1994, page 13), f is a natural cubic spline with knots at X_i, i = 1, 2, ⋯, n if and only if

$$K=Q R^{-1} Q^{T}, \quad \gamma=R^{-1} Q^{T} \mathbf{f}, \quad \int f^{\prime \prime}(x)^{2} d x=\mathbf{f}^{T} K \mathbf{f}.$$

((11))

Simple calculation then leads to the following desired formula:

$$\hat{\mathbf{f}}=(W+\lambda K)^{-1} W \mathbf{Y} \equiv A_{\lambda} \mathbf{Y},$$

((12))

with the weight matrix W = diag(w₁, w₂, ⋯, w_n), the hat matrix A_λ = (W + λK)⁻¹ W, and the response vector Y = (Y₁, Y₂, ⋯, Y_n)^T.

Using (11) and (12), we are now ready to give the matrix formulas for computing $\hat{f}_{\lambda}(x), \hat{f}_{\lambda}^{\prime}(x)$, and $\widehat{sd}\left\{\hat{f}_{\lambda}^{\prime}(x)\right\}$ at a given grid of locations x = [x₁, x₂, ⋯, x_N]^T. By Green and Silverman (1994, pages 22–23), for any x, we can write $\hat{f}(x)$ and $\hat{f}{}^{\prime}(x)$ as linear combinations of $\hat{\mathbf{f}}$ and $\hat{\gamma}$. Let h_i(x) = x − X_i, i = 1, 2, ⋯, n. When x < X₁,

$$\hat{f}(x)=\hat{f}_{1}+h_{1}(x)\left\{\frac{\hat{f}_{2}-\hat{f}_{1}}{h_{1}}-\frac{h_{1}}{6} \hat{\gamma}_{2}\right\}, \quad \hat{f}^{\prime}(x)=\frac{\hat{f}_{2}-\hat{f}_{1}}{h_{1}}-\frac{h_{1}}{6} \hat{\gamma}_{2}.$$

When X_i ≤ x ≤ X_{i+ 1}, let $\delta_{i}(x)=[1+\frac{h_{i}(x)}{h_{i}}] \hat{\gamma}_{i+1}+[1-\frac{h_{i+1}(x)}{h_{i}}] \hat{\gamma}_{i}$ for some i = 1, 2, ⋯, n,

$$\begin{aligned} \hat{f}(x) &=\frac{h_{i}(x) \hat{f}_{i+1}-h_{i+1}(x) \hat{f}_{i}}{h_{i}}+\frac{h_{i}(x) h_{i+1}(x) \delta_{i}(x)}{6}, \\ \hat{f}^{\prime}(x) &=\frac{\hat{f}_{i+1}-\hat{f}_{i}}{h_{i}}+\frac{h_{i}(x) h_{i+1}(x)(\hat{\gamma}_{i+1}-\hat{\gamma}_{i})}{6 h_{i}}+\frac{\left[h_{i}(x)+h_{i+1}(x)\right] \delta_{i}(x)}{6} \end{aligned}$$

When x > X_n,

$$\hat{f}(x)=\hat{f}_{n}+\frac{h_{n}(x)}{6}\left\{\frac{\hat{f}_{n}-\hat{f}_{n-1}}{h_{n-1}}+h_{n-1} \hat{\gamma}_{n-1}\right\},$$

$$\hat{f}^{\prime}(x)=\frac{1}{6}\left\{\frac{\hat{f}_{n}-\hat{f}_{n-1}}{h_{n-1}}+h_{n-1} \hat{\gamma}_{n-1}\right\}.$$

It follows that $\hat{f}(x)$ and $\hat{f}^{\prime}(x)$ can be written respectively as $c^{T} \hat{\mathbf{f}}-d^{T} \hat{\gamma}$ and $\tilde{c}^{T} \hat{\mathbf{f}}-\tilde{d}^{T} \hat{\gamma}$ where $c, \tilde{c}, d$and $\tilde{d}$ are coefficient vectors, depending on x and X₁, X₂, ⋯, X_n only. Let $\hat{f}\left(x_{i}\right)=c_{i}^{T} \hat{\mathbf{f}}-d_{i}^{T} \hat{\gamma}, \hat{f}^{\prime}\left(x_{i}\right)=\tilde{c}_{i}^{T} \hat{\mathbf{f}}-\tilde{d}_{i}^{T} \hat{\gamma}, i=1,2, \cdots, n$. Define C = (c₁, c₂, ⋯, c_n)^T, D = (d₁, d₂, ⋯, d_n)^T, and define $\tilde{C}$ and $\tilde{D}$ similarly. Set $\hat{\mathbf{f}}_{\mathbf{x}}=[\hat{f}(x_{1}), \cdots, \hat{f}(x_{N})]^{T}$ and $\hat{\mathbf{f}}_{\mathbf{x}}^{\prime}=[\hat{f}^{\prime}(x_{1}), \cdots, \hat{f}^{\prime}(x_{N})]^{T}$. Then, using (11) and (12), we have

$$\begin{array}{l}{\hat{\mathbf{f}}_{\mathbf{x}}\quad =\quad C \hat{\mathbf{f}}-D \hat{\gamma}=[C-D R^{-1} Q^{T}] \hat{\mathbf{f}}=M \hat{\mathbf{f}}=M A_{\lambda} \mathbf{Y}}, \\ {\hat{\mathbf{f}}{}^{\prime}_{\mathbf{x}}\quad =\quad \tilde{C} \hat{\mathbf{f}}-\tilde{D} \hat{\gamma}=[\tilde{C}-\tilde{D} R^{-1} Q^{T}] \hat{\mathbf{f}}=\tilde{M} \hat{\mathbf{f}}=\tilde{M} A_{\lambda} \mathbf{Y}},\end{array}$$

where M = C − DR^{− 1}Q^T and $\tilde{M}$ is similarly defined.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Marron, J.S., Zhang, J.T. SiZer for smoothing splines. Computational Statistics 20, 481–502 (2005). https://doi.org/10.1007/BF02741310

Download citation

Published: 01 September 2005
Issue Date: September 2005
DOI: https://doi.org/10.1007/BF02741310

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

SiZer for smoothing splines

Abstract

Access this article

Similar content being viewed by others

A review of spline function procedures in R

P-splines with derivative based penalties and tensor product smoothing of unevenly distributed data

Weighted-Average Least Squares (WALS): Confidence and Prediction Intervals

References

Author information

Authors and Affiliations

Additional information

Appendix: Derivations of (5),(6), and (7)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

SiZer for smoothing splines

Abstract

Access this article

Similar content being viewed by others

A review of spline function procedures in R

P-splines with derivative based penalties and tensor product smoothing of unevenly distributed data

Weighted-Average Least Squares (WALS): Confidence and Prediction Intervals

References

Author information

Authors and Affiliations

Additional information

Appendix: Derivations of (5),(6), and (7)

Appendix: Derivations of (5),(6), and (7)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation