Semi-Supervised Learning

Zhu, Xiaojin

doi:10.1007/978-0-387-30164-8_749

Xiaojin Zhu

2278 Accesses
8 Citations

Synonyms

Co-training; Learning from labeled and unlabeled data; Transductive learning

Definition

Semi-supervised learning uses both labeled and unlabeled data to perform an otherwise supervised learning or unsupervised learning task.

In the former case, there is a distinction between inductive semi-supervised learning and transductive learning. In inductive semi-supervised learning, the learner has both labeled training data \(\{(\mathbf{x}_i, y_i)\}_{i=1}^l {\mathop{\sim}^{iid}} p(\mathbf{x},y)\) and unlabeled training data \(\{{\mathbf{x}_{i}\}}_{i\,=\,l+1}^{l+u} {\mathop{\sim}^{iid}}p(\mathbf{x})\), and learns a predictor \(f : \mathcal{X}\mapsto \mathcal{Y}\), \(f \in \mathcal{F}\) where \(\mathcal{F}\) is the hypothesis space. Here \(\mathbf{x} \in \mathcal{X}\) is an input instance, \(y \in \mathcal{Y}\) its target label (discrete for classification or continuous for regression), p(x, y) the unknown joint distribution and p(x) its marginal, and typically l ≪ u. The goal is to...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Recommended Reading

Abney, S. (2007). Semisupervised learning for computational linguistics. Florida: Chapman & Hall/CRC.
Book Google Scholar
Balcan, M.-F., & Blum, A. (2009). A discriminative model for semi-supervised learning. Journal of the ACM.
Google Scholar
Belkin, M., Niyogi, P., & Sindhwani, V. (2006). Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. Journal of Machine Learning Research, 7, 2399–2434.
MathSciNet Google Scholar
Blum, A., & Chawla, S. (2001). Learning from labeled and unlabeled data using graph mincuts. In Proceedings of the 18th international conference on machine learning (pp. 19–26). San Francisco: Morgan Kaufmann.
Google Scholar
Blum, A., & Mitchell, T. (1998). Combining labeled and unlabeled data with co-training. In COLT: Proceedings of the workshop on computational learning theory (pp. 92–100). New York: ACM.
Google Scholar
Castelli, V., & Cover, T. (1995). The exponential value of labeled samples. Pattern Recognition Letters, 16(1), 105–111.
Article Google Scholar
Chapelle, O., Zien, A., & Schölkopf, B., (Eds.) (2006). Semi-supervised learning. Cambridge, MA MIT Press.
Google Scholar
Joachims, T. (1999). Transductive inference for text classification using support vector machines. In Proceedings of the 16th international conference on machine learning (pp. 200–209). San Francisco: Morgan Kaufmann.
Google Scholar
Nigam, K., McCallum, A. K., Thrun, S., & Mitchell, T. (2000). Text classification from labeled and unlabeled documents using EM. Machine Learning, 39(2/3), 103–134.
Article MATH Google Scholar
Seeger, M. (2001). Learning with labeled and unlabeled data. Technical report. University of Edinburgh, Edinburgh.
Google Scholar
Sindhwani, V., Niyogi, P., & Belkin, M. (2005). A co-regularized approach to semi-supervised learning with multiple views. In Proceedings of the 22nd ICML workshop on learning with multiple views.
Google Scholar
Vapnik, V. (1998). Statistical learning theory. New York: Wiley.
MATH Google Scholar
Yarowsky, D. (1995). Unsupervised word sense disambiguation rivaling supervised methods. In Proceedings of the 33rd annual meeting of the association for computational linguistics (pp. 189–196).
Chapter Google Scholar
Zhu, X., Ghahramani, Z., & Lafferty, J. (2003). Semi-supervised learning using Gaussian fields and harmonic functions. In The 20th international conference on machine learning (ICML).
Google Scholar
Zhu, X., & Goldberg, A. B. (2009). Synthesis lectures on artificial intelligence and machine learning. In Introduction to semi-supervised learning. Morgan & Claypool.
Google Scholar

Download references

Author information

Authors and Affiliations

Authors

Xiaojin Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science and Engineering, University of New South Wales, Sydney, Australia, 2052
Claude Sammut
Faculty of Information Technology, Clayton School of Information Technology, Monash University, P.O. Box 63, Victoria, Australia, 3800
Geoffrey I. Webb

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Zhu, X. (2011). Semi-Supervised Learning. In: Sammut, C., Webb, G.I. (eds) Encyclopedia of Machine Learning. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-30164-8_749

Download citation

DOI: https://doi.org/10.1007/978-0-387-30164-8_749
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-30768-8
Online ISBN: 978-0-387-30164-8
eBook Packages: Computer ScienceReference Module Computer Science and Engineering

Publish with us

Policies and ethics