Variable selection with the Lasso

Bühlmann, Peter; van de Geer, Sara

doi:10.1007/978-3-642-20192-9_7

Peter Bühlmann³ &
Sara van de Geer³

Part of the book series: Springer Series in Statistics ((SSS))

17k Accesses
1 Citations

Abstract

We use the Lasso, its adaptive or its thresholded variant, as procedure for variable selection. This essentially means that for \( {S_0}\,:=\,{\{j\,:{\beta^0_j}\neq 0\}} \) being the true active set, we look for a Lasso procedure delivering an estimator \( \hat{S}\) of \( {S_0}\) such that \( \hat{S}={S_0}\) with large probability. However, it is clear that very small coefficients \( \mid\beta^0_j\mid \) cannot be detected by any method. Moreover, irrepresentable conditions show that the Lasso, or any weighted variant, typically selects too many variables. In other words, unless one imposes very strong conditions, false positives cannot be avoided either. We shall therefore aim at estimators with oracle prediction error, yet having not too many false positives. The latter is considered as achieved when \( {\mid \hat{S}\backslash {S_*}\mid}=O({\mid {S_*}\mid})\), where \( {S_*}\,\subset \,{S_0}\) is the set of coefficients the oracle would select.We will show that the adaptive Lasso procedure, and also thresholding the initial Lasso, reaches this aim, assuming sparse eigenvalues, or alternatively, so-called “beta-min” conditions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 159.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

Seminar for Statistics, ETH Zürich, CH-8092, Zürich, Switzerland
Peter Bühlmann & Sara van de Geer

Authors

Peter Bühlmann
View author publications
You can also search for this author in PubMed Google Scholar
Sara van de Geer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Peter Bühlmann .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Bühlmann, P., van de Geer, S. (2011). Variable selection with the Lasso. In: Statistics for High-Dimensional Data. Springer Series in Statistics. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20192-9_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-20192-9_7
Published: 07 May 2011
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20191-2
Online ISBN: 978-3-642-20192-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics