Regularization

Suzuki, Joe

doi:10.1007/978-981-15-7877-9_6

Joe Suzuki²

3279 Accesses

Abstract

In statistics, we assume that the number of samples N is larger than the number of variables p. Otherwise, linear regression will not produce any least squares solution, or it will find the optimal variable set by comparing the information criterion values of the 2^p subsets of the cardinality p. Therefore, it is difficult to estimate the parameters. In such a sparse situation, regularization is often used. In the case of linear regression, we add a penalty term to the squared error to prevent the coefficient value from increasing. When the regularization term is a constant λ times the L1 and L2 norms of the coefficient, the method is called lasso and ridge, respectively. In the case of lasso, as the constant λ increases, some coefficients become 0; finally, all coefficients become 0 when λ is infinity. In that sense, lasso plays a role of model selection. In this chapter, we consider the principle of lasso and compare it with ridge. Finally, we learn how to choose the constant λ.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 16.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
In this book, convexity always means convex below and does not mean concave (convex above).
2.
In such a case, we do not express the subderivative as {f′(x ₀)} but as f′(x ₀).

Author information

Authors and Affiliations

Osaka University, Graduate School of Eng Sci, Toyonaka, Osaka, Japan
Joe Suzuki

Authors

Joe Suzuki
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Suzuki, J. (2021). Regularization. In: Statistical Learning with Math and Python. Springer, Singapore. https://doi.org/10.1007/978-981-15-7877-9_6

Download citation

DOI: https://doi.org/10.1007/978-981-15-7877-9_6
Published: 04 August 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-7876-2
Online ISBN: 978-981-15-7877-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics