Skip to main content

Implicitly Constrained Semi-supervised Least Squares Classification

  • Conference paper
  • First Online:
Advances in Intelligent Data Analysis XIV (IDA 2015)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9385))

Included in the following conference series:

Abstract

We introduce a novel semi-supervised version of the least squares classifier. This implicitly constrained least squares (ICLS) classifier minimizes the squared loss on the labeled data among the set of parameters implied by all possible labelings of the unlabeled data. Unlike other discriminative semi-supervised methods, our approach does not introduce explicit additional assumptions into the objective function, but leverages implicit assumptions already present in the choice of the supervised least squares classifier. We show this approach can be formulated as a quadratic programming problem and its solution can be found using a simple gradient descent procedure. We prove that, in a certain way, our method never leads to performance worse than the supervised classifier. Experimental results corroborate this theoretical result in the multidimensional case on benchmark datasets, also in terms of the errorĀ rate.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bache, K., Lichman, M.: UCI Machine Learning Repository (2013). http://archive.ics.uci.edu/ml

  2. Bennett, K.P., Demiriz, A.: Semi-supervised support vector machines. Adv. Neural Inf. Process. Syst. 11, 368ā€“374 (1998)

    Google ScholarĀ 

  3. Bottou, L.: Large-scale machine learning with stochastic gradient descent. In: Lechevallier, Y., Saporta, G. (eds.) COMPSTAT 2010, pp. 177ā€“186. Springer, Heidelberg (2010)

    ChapterĀ  Google ScholarĀ 

  4. Byrd, R.H., Lu, P., Nocedal, J., Zhu, C.: A limited memory algorithm for bound constrained optimization. SIAM J. Sci. Comput. 16(5), 1190ā€“1208 (1995)

    ArticleĀ  MathSciNetĀ  MATHĀ  Google ScholarĀ 

  5. Chapelle, O., Schƶlkopf, B., Zien, A.: Semi-Supervised Learning. MIT press, Cambridge (2006)

    BookĀ  Google ScholarĀ 

  6. Cozman, F., Cohen, I.: Risks of semi-supervised learning. In: Chapelle, O., Schƶlkopf, B., Zien, A. (eds.) Semi-Supervised Learning, Chap. 4, pp. 56ā€“72. MIT press (2006)

    Google ScholarĀ 

  7. Cozman, F.G., Cohen, I., Cirelo, M.C.: Semi-supervised learning of mixture models. In: Proceedings of the Twentieth International Conference on Machine Learning (2003)

    Google ScholarĀ 

  8. Hastie, T., Tibshirani, R., Friedman, J.H.: The Elements of Statistical Learning. Spinger, New York (2001)

    BookĀ  MATHĀ  Google ScholarĀ 

  9. Krijthe, J.H., Loog, M.: Implicitly constrained semi-supervised linear discriminant analysis. In: International Conference on Pattern Recognition, pp. 3762ā€“3767, Stockholm (2014)

    Google ScholarĀ 

  10. Li, Y.F., Zhou, Z.H.: Towards making unlabeled data never hurt. IEEE Trans. Pattern Anal. Mach. Intell. 37(1), 175ā€“188 (2015)

    ArticleĀ  Google ScholarĀ 

  11. Loog, M., Jensen, A.: Semi-supervised nearest mean classification through a constrained log-likelihood. IEEE Trans. Neural Networks Learn. Syst. 26(5), 995ā€“1006 (2015)

    ArticleĀ  MathSciNetĀ  Google ScholarĀ 

  12. Loog, M.: Semi-supervised linear discriminant analysis through moment-constraint parameter estimation. Pattern Recognit. Lett. 37, 24ā€“31 (2014)

    ArticleĀ  Google ScholarĀ 

  13. McLachlan, G.J.: Iterative reclassification procedure for constructing an asymptotically optimal rule of allocation in discriminant analysis. J. Am. Stat. Assoc. 70(350), 365ā€“369 (1975)

    ArticleĀ  MathSciNetĀ  MATHĀ  Google ScholarĀ 

  14. Nigam, K., McCallum, A.K., Thrun, S., Mitchell, T.: Text classification from labeled and unlabeled documents using EM. Mach. Learn. 34, 1ā€“34 (2000)

    MATHĀ  Google ScholarĀ 

  15. Opper, M., Kinzel, W.: Statistical mechanics of generalization. In: Domany, E., Hemmen, J.L., Schulten, K. (eds.) Models of Neural Networks III, pp. 151ā€“209. Springer, New York (1996)

    ChapterĀ  Google ScholarĀ 

  16. Poggio, T., Smale, S.: The mathematics of learning: dealing with data. Not. AMS 50, 537ā€“544 (2003)

    MathSciNetĀ  MATHĀ  Google ScholarĀ 

  17. Raudys, S., Duin, R.P.: Expected classification error of the fisher linear classifier with pseudo-inverse covariance matrix. Pattern Recogn. Lett. 19(5ā€“6), 385ā€“392 (1998)

    ArticleĀ  MATHĀ  Google ScholarĀ 

  18. Rifkin, R., Yeo, G., Poggio, T.: Regularized least-squares classification. Nato Sci. Ser. Sub Ser. III Comput. Syst. Sci. 190, 131ā€“154 (2003)

    Google ScholarĀ 

  19. Seeger, M.: Learning with labeled and unlabeled data. Technical report (2001)

    Google ScholarĀ 

  20. Singh, A., Nowak, R.D., Zhu, X.: Unlabeled data: now it helps, now it doesnt. In: Advances in Neural Information Processing Systems, pp. 1513ā€“1520 (2008)

    Google ScholarĀ 

  21. Tibshirani, R.: Regression shrinkage and selection via the lasso. J. R. Stat. Soci. Ser. B 58(1), 267ā€“288 (1996)

    MathSciNetĀ  MATHĀ  Google ScholarĀ 

  22. Widrow, B., Hoff, M.E.: Adaptive switching circuits. IRE WESCON Convention Rec. 4, 96ā€“104 (1960)

    Google ScholarĀ 

  23. Zhu, X., Goldberg, A.B.: Introduction to Semi-Supervised Learning, vol. 3. Morgan & Claypool, San Rafael (2009)

    MATHĀ  Google ScholarĀ 

Download references

Acknowledgments

Part of this work was funded by project P23 of the Dutch public-private research community COMMIT.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jesse H. Krijthe .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

Ā© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Krijthe, J.H., Loog, M. (2015). Implicitly Constrained Semi-supervised Least Squares Classification. In: Fromont, E., De Bie, T., van Leeuwen, M. (eds) Advances in Intelligent Data Analysis XIV. IDA 2015. Lecture Notes in Computer Science(), vol 9385. Springer, Cham. https://doi.org/10.1007/978-3-319-24465-5_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-24465-5_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-24464-8

  • Online ISBN: 978-3-319-24465-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics