Skip to main content

Part of the book series: Lecture Notes in Statistics ((LNS,volume 109))

Abstract

This paper analyzes estimation by bootstrap variable-selection in a simple Gaussian model where the dimension of the unknown parameter may exceed that of the data. A naive use of the bootstrap in this problem produces risk estimators for candidate variable-selections that have a strong upward bias. Resampling from a less overfitted model removes the bias and leads to bootstrap variable-selections that minimize risk asymptotically. A related bootstrap technique generates confidence sets that are centered at the best bootstrap variable-selection and have two further properties: the asymptotic coverage probability for the unknown parameter is as desired and the confidence set is geometrically smaller than a classical competitor. The results suggest a possible approach to confidence sets in other inverse problems where a regularization technique is used.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Akaike, H. (1974).: A new look at statistical model identification. IEEE Trans. Auto. Coni. 19 716–723.

    Article  MathSciNet  MATH  Google Scholar 

  2. Alexander, K.S. and Pyke, R. (1986): A uniform central limit theorem for set-indexed partial-sum processes with finite variance. Ann. Prob. 14 582–597.

    Article  MathSciNet  MATH  Google Scholar 

  3. Beran, R. (1994): Confidence sets centered at C p-estimators. Preprint.

    Google Scholar 

  4. Breiman, L. (1992): The little bootstrap and other methods for dimensionality selection in regression: X-fixed prediction error. J. Amer. Statist. Assoc. 87 738–754.

    Article  MathSciNet  MATH  Google Scholar 

  5. Efron, B. and Tibshirani, R.J. (1993): An Introduction to the Bootstrap. Chapman & Hall, New York.

    MATH  Google Scholar 

  6. Freedman, D.A., Navidi, W., and Peters, S.C. (1988): On the impact of variable selection in fitting regression equations. On Model Uncertainty and its Statistical Implications (T.K. Dijkstra, ed.), 1–16. Springer-Verlag, Berlin.

    Chapter  Google Scholar 

  7. Gasser, T., Sroka, L., and Jennen-Steinmetz, C. (1986): Residual variance and residual pattern in nonlinear regression. Biometrika 73 625–633.

    Article  MathSciNet  MATH  Google Scholar 

  8. James, W. and Stein, C. (1961): Estimation with quadratic loss. Proc. Fourth Berkeley Symp. Math. Statist. Prob., Vol. 1, 361–380. Univ. of California Press, Berkeley.

    Google Scholar 

  9. LeCam, L. (1983): A remark on empirical measures. Festschrift for Erich Lehmann (P.J. Bickel, K. Doksum and J.L. Hodges, eds.), 305–327. Wadsworth, Belmont, California.

    Google Scholar 

  10. Mallows, C. (1973): Some comments on C p. Technometrics 15 661–675.

    Article  MATH  Google Scholar 

  11. Rice, J. (1984): Bandwidth choice for nonparametric regression. Ann. Statist. 12 1215–1230.

    Article  MathSciNet  MATH  Google Scholar 

  12. Shibata, R. (1981): An optimal selection of regression variables. Biometrika 68 45–54.

    Article  MathSciNet  MATH  Google Scholar 

  13. Speed, T.P. and Yu, B. (1993): Model selection and prediction: Normal regression. Ann. Inst. Statist. Math. 45 35–54.

    Article  MathSciNet  MATH  Google Scholar 

  14. Stein, C. (1956): Inadmissibility of the usual estimator for the mean of a multivariate normal distribution. Proc. Third Berkeley Symp. Math. Statist. Prob. 1 197–206. Univ. of California Press, Berkeley.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1996 Springer-Verlag New York, Inc.

About this chapter

Cite this chapter

Beran, R. (1996). Bootstrap Variable—Selection and Confidence Sets. In: Rieder, H. (eds) Robust Statistics, Data Analysis, and Computer Intensive Methods. Lecture Notes in Statistics, vol 109. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-2380-1_1

Download citation

  • DOI: https://doi.org/10.1007/978-1-4612-2380-1_1

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-0-387-94660-3

  • Online ISBN: 978-1-4612-2380-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics