Definition
Semi-supervised learning uses both labeled and unlabeled data to perform an otherwise supervised learning or unsupervised learning task.
In the former case, there is a distinction between inductive semi-supervised learning and transductive learning. In inductive semi-supervised learning, the learner has both labeled training data \(\{(\mathbf{x}_i, y_i)\}_{i=1}^l {\mathop{\sim}^{iid}} p(\mathbf{x},y)\) and unlabeled training data \(\{{\mathbf{x}_{i}\}}_{i\,=\,l+1}^{l+u} {\mathop{\sim}^{iid}}p(\mathbf{x})\), and learns a predictor \(f : \mathcal{X}\mapsto \mathcal{Y}\), \(f \in \mathcal{F}\) where \(\mathcal{F}\) is the hypothesis space. Here \(\mathbf{x} \in \mathcal{X}\) is an input instance, \(y \in \mathcal{Y}\) its target label (discrete for classification or continuous for regression), p(x, y) the unknown joint distribution and p(x) its marginal, and typically l ≪ u. The goal is to...
Recommended Reading
Abney, S. (2007). Semisupervised learning for computational linguistics. Florida: Chapman & Hall/CRC.
Balcan, M.-F., & Blum, A. (2009). A discriminative model for semi-supervised learning. Journal of the ACM.
Belkin, M., Niyogi, P., & Sindhwani, V. (2006). Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. Journal of Machine Learning Research, 7, 2399–2434.
Blum, A., & Chawla, S. (2001). Learning from labeled and unlabeled data using graph mincuts. In Proceedings of the 18th international conference on machine learning (pp. 19–26). San Francisco: Morgan Kaufmann.
Blum, A., & Mitchell, T. (1998). Combining labeled and unlabeled data with co-training. In COLT: Proceedings of the workshop on computational learning theory (pp. 92–100). New York: ACM.
Castelli, V., & Cover, T. (1995). The exponential value of labeled samples. Pattern Recognition Letters, 16(1), 105–111.
Chapelle, O., Zien, A., & Schölkopf, B., (Eds.) (2006). Semi-supervised learning. Cambridge, MA MIT Press.
Joachims, T. (1999). Transductive inference for text classification using support vector machines. In Proceedings of the 16th international conference on machine learning (pp. 200–209). San Francisco: Morgan Kaufmann.
Nigam, K., McCallum, A. K., Thrun, S., & Mitchell, T. (2000). Text classification from labeled and unlabeled documents using EM. Machine Learning, 39(2/3), 103–134.
Seeger, M. (2001). Learning with labeled and unlabeled data. Technical report. University of Edinburgh, Edinburgh.
Sindhwani, V., Niyogi, P., & Belkin, M. (2005). A co-regularized approach to semi-supervised learning with multiple views. In Proceedings of the 22nd ICML workshop on learning with multiple views.
Vapnik, V. (1998). Statistical learning theory. New York: Wiley.
Yarowsky, D. (1995). Unsupervised word sense disambiguation rivaling supervised methods. In Proceedings of the 33rd annual meeting of the association for computational linguistics (pp. 189–196).
Zhu, X., Ghahramani, Z., & Lafferty, J. (2003). Semi-supervised learning using Gaussian fields and harmonic functions. In The 20th international conference on machine learning (ICML).
Zhu, X., & Goldberg, A. B. (2009). Synthesis lectures on artificial intelligence and machine learning. In Introduction to semi-supervised learning. Morgan & Claypool.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media, LLC
About this entry
Cite this entry
Zhu, X. (2011). Semi-Supervised Learning. In: Sammut, C., Webb, G.I. (eds) Encyclopedia of Machine Learning. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-30164-8_749
Download citation
DOI: https://doi.org/10.1007/978-0-387-30164-8_749
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-30768-8
Online ISBN: 978-0-387-30164-8
eBook Packages: Computer ScienceReference Module Computer Science and Engineering