Probability Density Estimation

  • Chong Gu
Part of the Springer Series in Statistics book series (SSS, volume 297)


For observational data, (1.5) of Example 1.2 defines penalized likelihood density estimation. Of interest are the selection of smoothing parameters, the computation of the estimates, and the asymptotic behavior of the estimates. Variants of (1.5) are also called for to accommodate samples subject to selection bias and samples from conditional distributions.


Covariance Under Sampling 


  1. Abramowitz M, Stegun IA (1964) Handbook of mathematical functions with formulas, graphs, and mathematical tables. National Bureau of Standards, Washington, DC.MATHGoogle Scholar
  2. Akhiezer NI, Glazman IM (1961) Theory of linear operators in Hilbert space. Ulgar, New York.MATHGoogle Scholar
  3. Anderson JA (1972) Separate sampling logistic regression. Biometrika 59:19–35.MathSciNetMATHGoogle Scholar
  4. Anderson JA, Blair V (1982) Penalized maximum likelihood estimation in logistic regression and discrimination. Biometrika 69:123–136.MathSciNetMATHGoogle Scholar
  5. Anderson JA, Senthilselvan A (1980) Smooth estimates for the hazard function. J R Stat Soc Ser B 42:322–327.MathSciNetGoogle Scholar
  6. Ansley CF, Kohn R, Wong C-M (1993) Nonparametric spline regression with prior information. Biometrika 80:75–88.MathSciNetMATHGoogle Scholar
  7. Antoniadis A (1989) A penalty method for nonparametric estimation of the intensity function of a counting process. Ann Inst Stat Math 41:781–807.MathSciNetMATHGoogle Scholar
  8. Aronszajn N (1950) Theory of reproducing kernels. Trans Am Math Soc 68:337–404.MathSciNetMATHGoogle Scholar
  9. Azzalini A, Bowman AW (1990) A look at some data on the Old Faithful geyser. Appl Stat 39:357–365.MATHGoogle Scholar
  10. Barry D (1986) Nonparametric Bayesian regression. Ann Stat 14:934–953.MathSciNetMATHGoogle Scholar
  11. Bartoszyński R, Brown BW, McBride CM, Thompson JR (1981) Some nonparametric techniques for estimating the intensity function of a cancer related nonstationary Poisson process. Ann Stat 9:1050–1060.MATHGoogle Scholar
  12. Bates DM, Lindstrom M, Wahba G, Yandell B (1987) GCVPACK – routines for generalized cross validation. Commun Stat Simul Comput 16:263–297.MathSciNetMATHGoogle Scholar
  13. Becker RA, Chambers JM, Wilks AR (1988) The new S language. Wadsworth & Brooks/Cole, Pacific Grove.MATHGoogle Scholar
  14. Berger JO (1985) Statistical decision theory and Bayesian analysis, 2nd edn. Springer, New York.MATHGoogle Scholar
  15. Box GEP, Jenkins GM, Reinsel GC (1994) Time series analysis, 3rd edn. Prentice Hall, Englewood Cliffs.MATHGoogle Scholar
  16. Breiman L (1991) The ∏ method for estimating multivariate functions from noisy data. Technometrics 33:125–160 (with discussions).Google Scholar
  17. Breiman L, Friedman JH (1985) Estimating optimal transformations for multiple regression and correlation. J Am Stat Assoc 80:580–598 (with discussions).Google Scholar
  18. Breslow NE, Clayton DG (1993) Approximate inference in generalized linear mixed models. J Am Stat Assoc 88:9–25.MATHGoogle Scholar
  19. Brinkman ND (1981) Ethanol fuel – a single-cylinder engine study of efficiency and exhaust emissions. SAE Trans 90:1410–1424.Google Scholar
  20. Brockwell PJ, Davis RA (1991) Time series: theory and methods, 2nd edn. Springer, New York.Google Scholar
  21. Buja A, Hastie TJ, Tibshirani RJ (1989) Linear smoothers and additive models. Ann Stat 17:453–555 (with discussions).Google Scholar
  22. Byerly WE (1959) An elementary treatise on Fourier’s series and spherical, cylindrical, and ellipsoidal harmonics. Dover, New York. Reprint of the 1893 original.Google Scholar
  23. Chambers JM, Hastie TJ (eds) (1992) Statistical models in S. Chapman & Hall, New York.MATHGoogle Scholar
  24. Chen Z (1991) Interaction spline models and their convergence rates. Ann Stat 19:1855–1868.MATHGoogle Scholar
  25. Chhikara RS, Folks JL (1989) The inverse Gaussian distribution: theory, methodology, and applications. Marcel Dekker, New York.MATHGoogle Scholar
  26. Cleveland WS (1979) Robust locally weighted regression and smoothing scatterplots. J Am Stat Assoc 83:829–836.MathSciNetGoogle Scholar
  27. Cleveland WS, Devlin SJ (1988) Locally weighted regression: an approach to regression analysis by local fitting. J Am Stat Assoc 83:596–610.MATHGoogle Scholar
  28. Cogburn R, Davis HT (1974) Periodic splines and spectral estimation. Ann Stat 2:1108–1126.MathSciNetMATHGoogle Scholar
  29. Cole TJ (1988) Fitting smoothed centile curves to reference data. J R Stat Soc Ser A 151:385–418 (with discussions).Google Scholar
  30. Cole TJ, Green PJ (1992) Smoothing reference centile curves: the LMS method and penalized likelihood. Stat Med 11:1305–1319.Google Scholar
  31. Cosslett SR (1981) Maximum likelihood estimator for choice-based samples. Econometrika 49:1289–1316.MathSciNetMATHGoogle Scholar
  32. Cox DR (1969) Some sampling problems in technology. In: Johnson NL, Smith Jr H (eds) New developments in survey sampling. Wiley, New York, pp 506–527.Google Scholar
  33. Cox DR (1972) Regression models and life tables. J R Stat Soc Ser B 34:187–220 (with discussions).Google Scholar
  34. Cox DD (1984) Multivariate smoothing spline functions. SIAM J Numer Anal 21:789–813.MathSciNetMATHGoogle Scholar
  35. Cox DD (1988) Approximation of method of regularization estimators. Ann Stat 16:694–712.MATHGoogle Scholar
  36. Cox DD, Chang Y (1990) Iterated state space algorithms and cross validation for generalized smoothing splines. Technical Report 49, Department of Statistics, University of Illinois, Champion.Google Scholar
  37. Cox DD, O’Sullivan F (1990) Asymptotic analysis of penalized likelihood and related estimators. Ann Stat 18:124–145.MathSciNetGoogle Scholar
  38. Craven P, Wahba G (1979) Smoothing noisy data with spline functions: estimating the correct degree of smoothing by the method of generalized cross-validation. Numer Math 31:377–403.MathSciNetMATHGoogle Scholar
  39. Crowley J, Hu M (1977) Covariance analysis of heart transplant survival data. J Am Stat Assoc 72:27–36.Google Scholar
  40. Dalzell CJ, Ramsay JO (1993) Computing reproducing kernels with arbitrary boundary constraints. SIAM J Sci Comput 14:511–518.MathSciNetMATHGoogle Scholar
  41. de Boor C (1978) A practical guide to splines. Springer, New York.MATHGoogle Scholar
  42. de Boor C, Lynch R (1966) On splines and their minimum properties. J Math Mach 15:953–969.MATHGoogle Scholar
  43. Dennis JE, Schnabel RB (1996) Numerical methods for unconstrained optimization and nonlinear equations. SIAM, Philadelphia. Corrected reprint of the 1983 original.Google Scholar
  44. Dongarra JJ, Moler CB, Bunch JR, Stewart GW (1979) LINPACK user’s guide. SIAM, Philadelphia.Google Scholar
  45. Douglas A, Delampady M (1990) Eastern lake survey – phase I: documentation for the data base and the derived data sets. Technical Report 160 (SIMS), Department of Statistics, University of British Columbia, Vancouver.Google Scholar
  46. Du P, Gu C (2006) Penalized likelihood hazard estimation: efficient approximation and bayesian confidence intervals. Stat Probab Lett 76:244–254.MathSciNetMATHGoogle Scholar
  47. Du P, Gu C (2009) Penalized pseudo-likelihood hazard estimation: a fast alternative to penalized likelihood. J Stat Plan Inference 139:891–899.MathSciNetMATHGoogle Scholar
  48. Du P, Ma S (2010) Frailty model with spline estimated nonparametric hazard function. Stat Sin 20:561–580.MathSciNetMATHGoogle Scholar
  49. Duchon J (1977) Splines minimizing rotation-invariant semi-norms in sobolev spaces. In: Schemp W, Zeller K (eds) Constructive theory of functions of several variables. Springer, Berlin, pp 85–100.Google Scholar
  50. Elden L (1984) A note on the computation of the generalized cross validation function for ill-conditioned least square problems. BIT 24:467–472.MathSciNetMATHGoogle Scholar
  51. Fan J, Gijbels I (1996) Local polynomial modelling and its applications. Chapman & Hall, London.MATHGoogle Scholar
  52. Fleming TR, Harrington DP (1991) Counting processes and survival analysis. Wiley, New York.MATHGoogle Scholar
  53. Gao F (1999) Penalized multivariate logistic regression with a large data set. Ph.D. thesis, University of Wisconsin, Madison.Google Scholar
  54. Gao F, Wahba G, Klein R, Klein BEK (2001) Smoothing spline ANOVA for multivariate Bernoulli observations, with application to ophthalmology data. J Am Stat Assoc 96:127–160 (with discussions).Google Scholar
  55. Gill RD (1984) Understanding Cox’s regression model: a martingale approach. J Am Stat Assoc 79:441–447.MATHGoogle Scholar
  56. Gill PE, Murray W, Wright MH (1981) Practical optimization. Academic, New York.MATHGoogle Scholar
  57. Gill RD, Vardi Y, Wellner JA (1988) Large sample theory of empirical distributions in biased sampling models. Ann Stat 16:1069–1112.MathSciNetMATHGoogle Scholar
  58. Girard DA (1989) A fast “Monte-Carlo cross validation” procedure for large least squares problems with noisy data. Numer Math 56:1–23.MathSciNetMATHGoogle Scholar
  59. Girard DA (1991) Asymptotic optimality of the fast randomized versions of GCV and C L in ridge regression and regularization. Ann Stat 19:1950–1963.MathSciNetMATHGoogle Scholar
  60. Golub G, Van Loan C (1989) Matrix computations, 2nd edn. The Johns Hopkins University Press, Baltimore.MATHGoogle Scholar
  61. Good IJ, Gaskins RA (1971) Nonparametric roughness penalties for probability densities. Biometrika 58:255–277.MathSciNetMATHGoogle Scholar
  62. Gray RJ (1992) Flexible methods for analyzing survival data using splines, with applications to breast cancer prognosis. J Am Stat Assoc 87:942–951.Google Scholar
  63. Green PJ, Yandell B (1985) Semi-parametric generalized linear models. In: Gilchrist R, Francis B, Whittaker J (eds) Proceedings of the GLIM85 conference. Springer, Berlin, pp 44–55.Google Scholar
  64. Green PJ, Silverman BW (1994) Nonparametric regression and generalized linear models. Chapman & Hall, London.MATHGoogle Scholar
  65. Gu C (1989) RKPACK and its applications: fitting smoothing spline models. In: ASA proceedings of statistical computing section. ASA, Alexandria pp 42–51.Google Scholar
  66. Gu C (1990) Adaptive spline smoothing in non Gaussian regression models. J Am Stat Assoc 85:801–807.Google Scholar
  67. Gu C (1992a) Cross-validating non-Gaussian data. J Comput Graph Stat 1:169–179.Google Scholar
  68. Gu C (1992b) Diagnostics for nonparametric regression models with additive term. J Am Stat Assoc 87:1051–1058.Google Scholar
  69. Gu C (1992c) Penalized likelihood regression: a Bayesian analysis. Stat Sin 2:255–264.MATHGoogle Scholar
  70. Gu C (1992d) Smoothing spline density estimation: biased sampling and random truncation. Technical Report 92–03, Department of Statistics, Purdue University, West Lafayette.Google Scholar
  71. Gu C (1993a) Interaction splines with regular data: automatically smoothing digital images. SIAM J Sci Comput 14:218–230.MathSciNetMATHGoogle Scholar
  72. Gu C (1993b) Smoothing spline density estimation: a dimensionless automatic algorithm. J Am Stat Assoc 88:495–504.MATHGoogle Scholar
  73. Gu C (1994) Penalized likelihood hazard estimation: algorithm and examples. In: Berger JO, Gupta SS (eds) Statistical decision theory and related topics, V. Springer, New York, pp 61–72.Google Scholar
  74. Gu C (1995a) Smoothing spline density estimation: conditional distribution. Stat Sin 5:709–726.MATHGoogle Scholar
  75. Gu C (1995b) Smoothing spline density estimation: response-based sampling. Technical Report 267, Department of Statistics, University of Michigan, Ann Arbor.Google Scholar
  76. Gu C (1996) Penalized likelihood hazard estimation: a general procedure. Stat Sin 6:861–876.MATHGoogle Scholar
  77. Gu C (1998a) Model indexing and smoothing parameter selection in nonparametric function estimation. Stat Sin 8:607–646 (with discussions).Google Scholar
  78. Gu C (1998b) Penalized likelihood estimation: convergence under incorrect model. Stat Probab Lett 36:359–364.MATHGoogle Scholar
  79. Gu C (1998c) Structural multivariate function estimation: some automatic density and hazard estimates. Stat Sin 8:317–335.MATHGoogle Scholar
  80. Gu C (2004) Model diagnostics for smoothing spline ANOVA models. Can J Stat 32:347–358.MATHGoogle Scholar
  81. Gu C (2011) Practical nonparametric conditional density estimation. Preprint.Google Scholar
  82. Gu C, Kim Y-J (2002) Penalized likelihood regression: general formulation and efficient approximation. Can J Stat 30:619–628.MathSciNetMATHGoogle Scholar
  83. Gu C, Ma P (2005a) Generalized nonparametric mixed-effect models: computation and smoothing parameter selection. J Comput Graph Stat 14:485–504.MathSciNetGoogle Scholar
  84. Gu C, Ma P (2005b) Optimal smoothing in nonparametric mixed-effect models. Ann Stat 33:1357–1379.MathSciNetMATHGoogle Scholar
  85. Gu C, Ma P (2011) Nonparametric regression with cross-classified responses. Can J Stat 39:591–609.MathSciNetMATHGoogle Scholar
  86. Gu C, Qiu C (1993) Smoothing spline density estimation: theory. Ann Stat 21:217–234.MathSciNetMATHGoogle Scholar
  87. Gu C, Qiu C (1994) Penalized likelihood regression: a simple asymptotic analysis. Stat Sin 4:297–304.MathSciNetMATHGoogle Scholar
  88. Gu C, Wahba G (1991a) Discussion of “multivariate adaptive regression splines” by J. Friedman. Ann Stat 19:115–123.Google Scholar
  89. Gu C, Wahba G (1991b) Minimizing GCV/GML scores with multiple smoothing parameters via the Newton method. SIAM J Sci Stat Comput 12:383–398.MathSciNetMATHGoogle Scholar
  90. Gu C, Wahba G (1993a) Semiparametric analysis of variance with tensor product thin plate splines. J R Stat Soc Ser B 55:353–368.MathSciNetMATHGoogle Scholar
  91. Gu C, Wahba G (1993b) Smoothing spline ANOVA with component-wise Bayesian “confidence intervals”. J Comput Graph Stat 2:97–117.MathSciNetGoogle Scholar
  92. Gu C, Wang J (2003) Penalized likelihood density estimation: direct cross-validation and scalable approximation. Stat Sin 13:811–826.MathSciNetMATHGoogle Scholar
  93. Gu C, Xiang D (2001) Cross-validating non-Gaussian data: generalized approximate cross-validation revisited. J Comput Graph Stat 10:581–591.MathSciNetGoogle Scholar
  94. Gu C, Bates DM, Chen Z, Wahba G (1989) The computation of GCV functions through Householder tridiagonalization with application to the fitting of interaction spline models. SIAM J Matrix Anal Appl 10:457–480.MathSciNetMATHGoogle Scholar
  95. Gu C, Heckman N, Wahba G (1992) A note on generalized cross-validation with replicates. Stat Probab Lett 14:283–287.MathSciNetMATHGoogle Scholar
  96. Gu C, Jeon Y, Lin Y (2013) Nonparametric density estimation in high dimensions. Stat Sin 23:000-000.Google Scholar
  97. Hall P, Johnstone I (1992) Empirical functionals and efficient smoothing parameter selection. J R Stat Soc Ser B 54:475–530 (with discussions).Google Scholar
  98. Hall P, Marron JS (1991) Lower bounds for bandwidth selection in density estimation. Probab Theory Relat Fields 90:149–173.MathSciNetMATHGoogle Scholar
  99. Han C, Gu C (2008) Optimal smoothing with correlated data. Sankhya 70-A:38–72.MathSciNetGoogle Scholar
  100. Härdle W (1991) Smoothing techniques with implementation in S. Springer, New York.MATHGoogle Scholar
  101. Harville DA (1977) Maximum likelihood approaches to variance component estimation and to related problems. J Am Stat Assoc 72:320–340 (with discussions).Google Scholar
  102. Hastie TJ, Tibshirani RJ (1986) Generalized additive models. Stat Sci 1:297–318 (with discussions).Google Scholar
  103. Hastie TJ, Tibshirani RJ (1990) Generalized additive models. Chapman & Hall, London.MATHGoogle Scholar
  104. Heckman NE, Ramsay JO (2000) Penalized regression with model-based penalties. Can J Stat 28:241–258.MathSciNetMATHGoogle Scholar
  105. Hornik K (2010) The R FAQ. ISBN 3-900051-08-9.Google Scholar
  106. Hutchinson MF (1989) A stochastic estimator of the trace of the influence matrix for Laplacian smoothing splines. Commun Stat Simul Comput 18:1059–1076.MathSciNetMATHGoogle Scholar
  107. Hutchinson MF, de Hoog FR (1985) Smoothing noisy data with spline functions. Numer Math 47:99–106.MathSciNetMATHGoogle Scholar
  108. Ihaka R, Gentleman R (1996) R: a language for data analysis and graphics. J Comput Graph Stat 5:299–314.Google Scholar
  109. Jeon Y, Lin Y (2006) An effective method for high dimensional log-density ANOVA estimation, with application to nonparametric graphical model building. Stat Sin 16:353–374.MathSciNetMATHGoogle Scholar
  110. Joe H (1997) Multivariate models and dependence concepts. Chapman & Hall, London.MATHGoogle Scholar
  111. Johnson RA, Wichern DW (1992) Applied multivariate statistical analysis, 3rd edn. Prentice Hall, Englewood Cliffs.MATHGoogle Scholar
  112. Jones RH (1980) Maximum likelihood fitting of ARMA models to time series with missing observations. Technometrics 20:389–395.Google Scholar
  113. Jones MC (1991) Kernel density estimation for length biased data. Biometrika 78:511–520.MathSciNetMATHGoogle Scholar
  114. Kalbfleisch JD, Lawless JF (1989) Inference based on retrospective ascertainment: an analysis of the data on transfusion-related AIDS. J Am Stat Assoc 84:360–372.MathSciNetMATHGoogle Scholar
  115. Kalbfleisch JD, Prentice RL (1980) The statistical analysis of failure time data. Wiley, New York.MATHGoogle Scholar
  116. Kaplan EL, Meier P (1958) Nonparametric estimator from incomplete observations. J Am Stat Assoc 53:457–481.MathSciNetMATHGoogle Scholar
  117. Karcher P, Wang Y (2001) Generalized nonparametric mixed effects models. J Comput Graph Stat 10:641–655.MathSciNetGoogle Scholar
  118. Karlin S, Studden WJ (1966) Tchebycheff systems: with applications in analysis and statistics. Wiley, New York/London/Sydney.MATHGoogle Scholar
  119. Keiding N, Gill RD (1990) Random truncation models and markov processes. Ann Stat 18:582–602.MathSciNetMATHGoogle Scholar
  120. Kernighan BW (1975) Ratfor – a preprocessor for a rational Fortran. In: UNIX programmer’s manual. Bell Laboratories, Murray Hill.Google Scholar
  121. Kim Y-J (2003) Smoothing spline regression: scalable computation and cross-validation. Ph.D. thesis, Purdue University, West Lafayette.Google Scholar
  122. Kim Y-J, Gu C (2004) Smoothing spline Gaussian regression: more scalable computation via efficient approximation. J R Stat Soc Ser B 66:337–356.MathSciNetMATHGoogle Scholar
  123. Kimeldorf G, Wahba G (1970a) A correspondence between Bayesian estimation of stochastic processes and smoothing by splines. Ann Math Stat 41:495–502.MathSciNetMATHGoogle Scholar
  124. Kimeldorf G, Wahba G (1970b) Spline functions and stochastic processes. Sankhya Ser A 32:173–180.MathSciNetMATHGoogle Scholar
  125. Kimeldorf G, Wahba G (1971) Some results on Tchebycheffian spline functions. J Math Anal Appl 33:82–85.MathSciNetMATHGoogle Scholar
  126. Klein R, Klein BEK, Moss SE, Davis MD, DeMets DL (1988) Glycosylated hemoglobin predicts the incidence and progression of diabetic retinopathy. J Am Med Assoc 260:2864–2871.Google Scholar
  127. Klein R, Klein BEK, Moss SE, Davis MD, DeMets DL (1989) The Wisconsin epidemiologic study of diabetic retinopathy. X. Four incidence and progression of diabetic retinopathy when age at diagnosis is 30 or more years. Arch Ophthalmol 107:244–249.Google Scholar
  128. Koenker R, Ng P, Portnoy S (1994) Quantile smoothing splines. Biometrika 81:673–680.MathSciNetMATHGoogle Scholar
  129. Kooperberg C, Bose S, Stone CJ (1997) Polychotomous regression. J Am Stat Assoc 92:117–127.MATHGoogle Scholar
  130. Lauritzen SL (1996) Graphical models. Oxford University Press, New York.Google Scholar
  131. Lee Y, Nelder JA (1996) Hierarchical generalized linear models. J R Stat Soc Ser B 58:619–678 (with discussions).Google Scholar
  132. Lehmann EL, Casella G (1998) Theory of point estimation, 2nd edn. Springer, New York.MATHGoogle Scholar
  133. Leonard T (1978) Density estimation, stochastic processes and prior information. J R Stat Soc Ser B 40:113–146 (with discussions).Google Scholar
  134. Leonard T, Hsu JSJ, Tsui K-W (1989) Bayesian marginal inference. J Am Stat Assoc 84:1051–1058.MathSciNetGoogle Scholar
  135. Li K-C (1986) Asymptotic optimality of C L and generalized cross-validation in the ridge regression with application to spline smoothing. Ann Stat 14:1101–1112.MATHGoogle Scholar
  136. Lin X (1998) Smoothing spline analysis of variance for polychotomous response data. Ph.D. thesis, University of Wisconsin, Madison.Google Scholar
  137. Lin Y (2000) Tensor product space ANOVA models. Ann Stat 28:734–755.MATHGoogle Scholar
  138. Lin X, Zhang D (1999) Inference in generalized additive mixed models by using smoothing splines. J R Stat Soc Ser B 61:381–400.MathSciNetMATHGoogle Scholar
  139. Lindsey JK (1997) Applying generalized linear models. Springer, New York.MATHGoogle Scholar
  140. Mallows CL (1973) Some comments on C P. Technometrics 15:661–675.MATHGoogle Scholar
  141. Manski CF (1995) Identification problems in the social sciences. Harvard University Press, Cambridge.Google Scholar
  142. McCullagh P, Nelder JA (1989) Generalized linear models, 2nd edn. Chapman & Hall, London.MATHGoogle Scholar
  143. McCulloch CE (1997) Maximum likelihood algorithms for generalized linear mixed models. J Am Stat Assoc 92:162–170.MathSciNetMATHGoogle Scholar
  144. Meinguet J (1979) Multivariate interpolation at arbitrary points made simple. J Appl Math Phys (ZAMP) 30:292–304.MathSciNetMATHGoogle Scholar
  145. Miller RG (1976) Least squares regression with censored data. Biometrika 63:449–464.MathSciNetMATHGoogle Scholar
  146. Miller RG, Halpern J (1982) Regression with censored data. Biometrika 69:521–531.MathSciNetMATHGoogle Scholar
  147. Moreau T, O’Quigley J, Mesbah M (1985) A global goodness-of-fit statistic for the proportional hazards model. Appl Stat 34:212–218.MathSciNetGoogle Scholar
  148. Morgenthaler S, Vardi Y (1986) Choice-based samples: a non-parametric approach. J. Econom 32:109–125.MathSciNetMATHGoogle Scholar
  149. Novak E, Ritter K (1996) High dimensional integration of smooth functions over cubes. Numer Math 75:79–97.MathSciNetMATHGoogle Scholar
  150. Nychka D (1988) Bayesian confidence intervals for smoothing splines. J Am Stat Assoc 83:1134–1143.MathSciNetGoogle Scholar
  151. O’Sullivan F (1985) Discussion of “Some aspects of the spline smoothing approach to nonparametric regression curve fitting” by B. Silverman J R Stat Soc Ser B 47:39–40.Google Scholar
  152. O’Sullivan F (1988a) Fast computation of fully automated log-density and log-hazard estimators. SIAM J Sci Stat Comput 9:363–379.MathSciNetMATHGoogle Scholar
  153. O’Sullivan F (1988b) Nonparametric estimation of relative risk using splines and cross-validation. SIAM J Sci Stat Comput 9:531–542.MathSciNetMATHGoogle Scholar
  154. O’Sullivan F, Yandell B, Raynor W (1986) Automatic smoothing of regression functions in generalized linear models. J Am Stat Assoc 81:96–103.MathSciNetGoogle Scholar
  155. Ouyang Z, Zhou Q, Wong WH (2009) ChIP-Seq of transcription factors predicts absolute and differential gene expression in embryonic stem cells. Proc Natl Acad Sci USA 106:21521–21526. doi:10.1073/pnas.0904863106.Google Scholar
  156. Parzen E (1979) Nonparametric statistical data modeling. J Am Stat Assoc 74:105–131 (with discussions).Google Scholar
  157. Pawitan Y, O’Sullivan F (1994) Nonparametric spectral density estimation using penalized Whittle likelihood. J Am Stat Assoc 89:600–610.MathSciNetMATHGoogle Scholar
  158. Petras K (2001) Asymptotically minimal smolyak cubature. Preprint.Google Scholar
  159. Prentice RL, Pyke R (1978) Logistic disease incidence models and case-control studies. Biometrika 66:403–411.MathSciNetGoogle Scholar
  160. Priestley MB (1981) Spectral analysis and times series. Academic, London.Google Scholar
  161. Ramsay JO, Dalzell CJ (1991) Some tools for functional data analysis. J R Stat Soc Ser B 53:539–572 (with discussions).Google Scholar
  162. Rao CR (1973) Linear statistical inference and its applications. Wiley, New York.MATHGoogle Scholar
  163. Robinson GK (1991) That BLUP is a good thing: the estimation of the random effects. Stat Sci 6:15–51 (with discussions).Google Scholar
  164. Scheffe H (1959) The analysis of variance. Wiley, New York.MATHGoogle Scholar
  165. Schoenberg IJ (1964) Spline functions and the problem of graduation. Proc Natl Acad Sci USA 52:947–950.MathSciNetMATHGoogle Scholar
  166. Schumaker L (1981) Spline functions: basic theory. Wiley, New York.MATHGoogle Scholar
  167. Scott DW (1985) Averaged shifted histograms: effective nonparametric density estimators in several dimensions. Ann Stat 13:1024–1040.MATHGoogle Scholar
  168. Scott DW (1992) Multivariate density estimation: theory, practice and visualization. Wiley, New York.MATHGoogle Scholar
  169. Scott DW, Terrell GR (1987) Biased and unbiased cross-validation in density estimation. J Am Stat Assoc 82:1131–1146.MathSciNetMATHGoogle Scholar
  170. Scott AJ, Wild CJ (1986) Fitting logistic models under case-control and choice-based sampling. J R Stat Soc Ser B 48:170–182.MathSciNetMATHGoogle Scholar
  171. Seber GAF (1977) Linear regression analysis. Wiley, New York.MATHGoogle Scholar
  172. Silverman BW (1978) Density ratios, empirical likelihood and cot death. Appl Stat 27:26–33.Google Scholar
  173. Silverman BW (1982) On the estimation of a probability density function by the maximum penalized likelihood method. Ann Stat 10:795–810.MATHGoogle Scholar
  174. Snyder DL (1975) Random point processes. Wiley, New York.MATHGoogle Scholar
  175. Stein ML (1993) Spline smoothing with an estimated order parameter. Ann Stat 21:1522–1544.MATHGoogle Scholar
  176. Stewart GW (1987) Collinearity and least square regression. Stat Sci 2:68–100 (with discussions).Google Scholar
  177. Stone CJ (1985) Additive regression and other nonparametric models. Ann Stat 13:689–705.MATHGoogle Scholar
  178. Stone CJ, Hansen MH, Kooperberg C, Truong YK (1997) Polynomial splines and their tensor products in extended linear modeling. Ann Stat 25:1371–1470 (with discussions).Google Scholar
  179. Tapia RA, Thompson JR (1978) Nonparametric probability density estimation. Johns Hopkins University Press, Baltimore.MATHGoogle Scholar
  180. Tierney L, Kadane JB (1986) Accurate approximations for posterior moments and marginal densities. J Am Stat Assoc 81:82–86.MathSciNetMATHGoogle Scholar
  181. Tong H (1990) Nonlinear time series. Oxford University Press, New York.Google Scholar
  182. Turnbull BW, Brown BW, Hu M (1974) Survivorship analysis of heart transplant data. J Am Stat Assoc 69:74–80.Google Scholar
  183. Utreras F (1981) Optimal smoothing of noisy data using spline functions. SIAM J Sci Stat Comput 2:349–362.MathSciNetMATHGoogle Scholar
  184. Utreras F (1983) Natural spline functions: their associated eigenvalue problem. Numer Math 42:107–117.MathSciNetMATHGoogle Scholar
  185. Utreras F (1988) Convergence rates for multivariate smoothing spline functions. J Approx Theory 52:1–27.MathSciNetMATHGoogle Scholar
  186. Vardi Y (1982) Nonparametric estimation in the presence of length bias. Ann Stat 10:616–620.MathSciNetMATHGoogle Scholar
  187. Vardi Y (1985) Empirical distributions in selection bias models. Ann Stat 13:178–203.MathSciNetMATHGoogle Scholar
  188. Venables WN, Ripley BD (2002) Modern applied statistics with S, 4th edn. Springer, New York.MATHGoogle Scholar
  189. Wahba G (1978) Improper priors, spline smoothing and the problem of guarding against model errors in regression. J R Stat Soc Ser B 40:364–372.MathSciNetMATHGoogle Scholar
  190. Wahba G (1980) Automatic smoothing of the log periodogram. J Am Stat Assoc 75:122–132.MATHGoogle Scholar
  191. Wahba G (1981) Spline interpolation and smoothing on the sphere. SIAM J Sci Stat Comput 2:5–16.MathSciNetMATHGoogle Scholar
  192. Wahba G (1982) Erratum: spline interpolation and smoothing on the sphere. SIAM J Sci Stat Comput 3:385–386.MathSciNetMATHGoogle Scholar
  193. Wahba G (1983) Bayesian “confidence intervals” for the cross-validated smoothing spline. J R Stat Soc Ser B 45:133–150.MathSciNetMATHGoogle Scholar
  194. Wahba G (1985) A comparison of GCV and GML for choosing the smoothing parameter in the generalized spline smoothing problem. Ann Stat 13:1378–1402.MathSciNetMATHGoogle Scholar
  195. Wahba G (1986) Partial and interaction spline models for the semiparametric estimation of functions of several variables. In: Computer science and statistics: proceedings of the 18th symposium on the interface. ASA, Washington, DC, pp 75–80.Google Scholar
  196. Wahba G (1990) Spline models for observational data. CBMS-NSF regional conference series in applied mathematics, vol 59. SIAM, Philadelphia.Google Scholar
  197. Wahba G, Wendelberger J (1980) Some new mathematical methods for variational objective analysis using splines and cross validation. Mon Weather Rev 108:1122–1145.Google Scholar
  198. Wahba G, Wang Y, Gu C, Klein R, Klein BEK (1995) Smoothing spline ANOVA for exponential families, with application to the Wisconsin epidemiological study of diabetic retinopathy. Ann Stat 23:1865–1895.MathSciNetMATHGoogle Scholar
  199. Wang M-C (1989) A semiparametric model for randomly truncated data. J Am Stat Assoc 84:742–748.MATHGoogle Scholar
  200. Wang Y (1997) GRKPACK: fitting smoothing spline ANOVA models for exponential families. Commun Stat Simul Comput 26:765–782.MATHGoogle Scholar
  201. Wang Y (1998a) Mixed-effects smoothing spline ANOVA. J R Stat Soc Ser B 60:159–174.MATHGoogle Scholar
  202. Wang Y (1998b) Smoothing spline models with correlated random errors. J Am Stat Assoc 93:341–348.MATHGoogle Scholar
  203. Wang Y, Brown MB (1996) A flexible model for human circadian rhythms. Biometrics 52:588–596.MATHGoogle Scholar
  204. Wang M-C, Jewell NP, Tsay W-Y (1986) Asymptotic properties of the product limit estimate under random truncation. Ann Stat 14:1597–1605.MATHGoogle Scholar
  205. Weinberger HF (1974) Variational methods for eigenvalue approximation. CBMS-NSF regional conference series in applied mathematics, vol 15. SIAM, Philadelphia.Google Scholar
  206. Whittaker ET (1923) On a new method of graduation. Proc Edinb Math Soc 41:63–75.Google Scholar
  207. Whittaker J (1990) Graphical models in applied multivariate statistics. Wiley, Chichester.MATHGoogle Scholar
  208. Williamson B (1899) An elementary treatise on the differential calculus. Longman, Green, London.Google Scholar
  209. Woodroofe M (1985) Estimating a distribution function with truncated data. Ann Stat 13:163–177.MathSciNetMATHGoogle Scholar
  210. Xiang D, Wahba G (1996) A generalized approximate cross validation for smoothing splines with non-Gaussian data. Stat Sin 6:675–692.MathSciNetMATHGoogle Scholar
  211. Zeger SL, Karim MR (1991) Generalized linear models with random effects: a Gibbs sampling approach. J Am Stat Assoc 86:79–86.MathSciNetGoogle Scholar
  212. Zhang T, Lin G (2009) Cluster detection based on spatial associations and iterated residuals in generalized linear mixed models. Biometrics 65:353–360.MathSciNetMATHGoogle Scholar
  213. Zucker DM, Karr AF (1990) Nonparametric survival analysis with time-dependent covariate effects: a penalized partial likelihood approach. Ann Stat 18:329–353.MathSciNetMATHGoogle Scholar

Copyright information

© Springer Science+Business Media New York 2013

Authors and Affiliations

  • Chong Gu
    • 1
  1. 1.Department of StatisticsPurdue UniversityWest LafayetteUSA

Personalised recommendations