Skip to main content

Empirical Risk Minimization

  • Reference work entry
  • First Online:
Encyclopedia of Machine Learning and Data Mining

Definition

The goal of learning is usually to find a model which delivers good generalization performance over an underlying distribution of the data. Consider an input space \(\mathcal{X}\) and output space \(\mathcal{Y}\). Assume the pairs \((X \times Y ) \in \mathcal{X}\times \mathcal{Y}\) are random variables whose (unknown) joint distribution is P XY . It is our goal to find a predictor \(f : \mathcal{X}\mapsto \mathcal{Y}\) which minimizes the expected risk:

$$\displaystyle{P(f(X)\neq Y ) = \mathsf{E}_{(X,Y )\sim P_{XY }}\ \left [\delta (f(X)\neq Y )\right ],}$$

where δ(z) = 1 if z is true, and 0 otherwise.

However, in practice we only have n pairs of training examples (X i , Y i ) drawn identically and independently from P XY . Since P XY is unknown, we often use the risk on the training set (called empirical risk) as a surrogate of the expected risk on the underlying distribution:

$$\displaystyle{ \frac{1} {n}\sum _{i=1}^{n}\delta (f(X_{ i})\neq Y _{i}).}$$

Empirical Risk...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 699.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 949.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recommended Reading

  • Vapnik V (1998) Statistical learning theory. John Wiley and Sons, New York

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xinhua Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Science+Business Media New York

About this entry

Cite this entry

Zhang, X. (2017). Empirical Risk Minimization. In: Sammut, C., Webb, G.I. (eds) Encyclopedia of Machine Learning and Data Mining. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7687-1_79

Download citation

Publish with us

Policies and ethics