Skip to main content

Semi-Supervised Learning

  • Reference work entry
Encyclopedia of Machine Learning

Synonyms

Co-training; Learning from labeled and unlabeled data; Transductive learning

Definition

Semi-supervised learning uses both labeled and unlabeled data to perform an otherwise supervised learning or unsupervised learning task.

In the former case, there is a distinction between inductive semi-supervised learning and transductive learning. In inductive semi-supervised learning, the learner has both labeled training data \(\{(\mathbf{x}_i, y_i)\}_{i=1}^l {\mathop{\sim}^{iid}} p(\mathbf{x},y)\) and unlabeled training data \(\{{\mathbf{x}_{i}\}}_{i\,=\,l+1}^{l+u} {\mathop{\sim}^{iid}}p(\mathbf{x})\), and learns a predictor \(f : \mathcal{X}\mapsto \mathcal{Y}\), \(f \in \mathcal{F}\) where \(\mathcal{F}\) is the hypothesis space. Here \(\mathbf{x} \in \mathcal{X}\) is an input instance, \(y \in \mathcal{Y}\) its target label (discrete for classification or continuous for regression), p(x, y) the unknown joint distribution and p(x) its marginal, and typically l ≪ u. The goal is to...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Recommended Reading

  • Abney, S. (2007). Semisupervised learning for computational linguistics. Florida: Chapman & Hall/CRC.

    Book  Google Scholar 

  • Balcan, M.-F., & Blum, A. (2009). A discriminative model for semi-supervised learning. Journal of the ACM.

    Google Scholar 

  • Belkin, M., Niyogi, P., & Sindhwani, V. (2006). Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. Journal of Machine Learning Research, 7, 2399–2434.

    MathSciNet  Google Scholar 

  • Blum, A., & Chawla, S. (2001). Learning from labeled and unlabeled data using graph mincuts. In Proceedings of the 18th international conference on machine learning (pp. 19–26). San Francisco: Morgan Kaufmann.

    Google Scholar 

  • Blum, A., & Mitchell, T. (1998). Combining labeled and unlabeled data with co-training. In COLT: Proceedings of the workshop on computational learning theory (pp. 92–100). New York: ACM.

    Google Scholar 

  • Castelli, V., & Cover, T. (1995). The exponential value of labeled samples. Pattern Recognition Letters, 16(1), 105–111.

    Article  Google Scholar 

  • Chapelle, O., Zien, A., & Schölkopf, B., (Eds.) (2006). Semi-supervised learning. Cambridge, MA MIT Press.

    Google Scholar 

  • Joachims, T. (1999). Transductive inference for text classification using support vector machines. In Proceedings of the 16th international conference on machine learning (pp. 200–209). San Francisco: Morgan Kaufmann.

    Google Scholar 

  • Nigam, K., McCallum, A. K., Thrun, S., & Mitchell, T. (2000). Text classification from labeled and unlabeled documents using EM. Machine Learning, 39(2/3), 103–134.

    Article  MATH  Google Scholar 

  • Seeger, M. (2001). Learning with labeled and unlabeled data. Technical report. University of Edinburgh, Edinburgh.

    Google Scholar 

  • Sindhwani, V., Niyogi, P., & Belkin, M. (2005). A co-regularized approach to semi-supervised learning with multiple views. In Proceedings of the 22nd ICML workshop on learning with multiple views.

    Google Scholar 

  • Vapnik, V. (1998). Statistical learning theory. New York: Wiley.

    MATH  Google Scholar 

  • Yarowsky, D. (1995). Unsupervised word sense disambiguation rivaling supervised methods. In Proceedings of the 33rd annual meeting of the association for computational linguistics (pp. 189–196).

    Chapter  Google Scholar 

  • Zhu, X., Ghahramani, Z., & Lafferty, J. (2003). Semi-supervised learning using Gaussian fields and harmonic functions. In The 20th international conference on machine learning (ICML).

    Google Scholar 

  • Zhu, X., & Goldberg, A. B. (2009). Synthesis lectures on artificial intelligence and machine learning. In Introduction to semi-supervised learning. Morgan & Claypool.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer Science+Business Media, LLC

About this entry

Cite this entry

Zhu, X. (2011). Semi-Supervised Learning. In: Sammut, C., Webb, G.I. (eds) Encyclopedia of Machine Learning. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-30164-8_749

Download citation

Publish with us

Policies and ethics