Analysis of Co-training Algorithm with Very Small Training Sets

Didaci, Luca; Fumera, Giorgio; Roli, Fabio

doi:10.1007/978-3-642-34166-3_79

Luca Didaci²⁴,
Giorgio Fumera²⁴ &
Fabio Roli²⁴

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7626))

Included in the following conference series:

Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR)

2696 Accesses
9 Citations
3 Altmetric

Abstract

Co-training is a well known semi-supervised learning algorithm, in which two classifiers are trained on two different views (feature sets): the initially small training set is iteratively updated with unlabelled samples classified with high confidence by one of the two classifiers. In this paper we address an issue that has been overlooked so far in the literature, namely, how co-training performance is affected by the size of the initial training set, as it decreases to the minimum value below which a given learning algorithm can not be applied anymore. In this paper we address this issue empirically, testing the algorithm on 24 real datasets artificially splitted in two views, using two different base classifiers. Our results show that a very small training set, even made up of one only labelled sample per class, does not adversely affect co-training performance.

Download to read the full chapter text

Chapter PDF

Adapted Features and Instance Selection for Improving Co-training

Towards making co-training suffer less from insufficient views

Article 30 August 2018

Fast Co-MLM: An Efficient Semi-supervised Co-training Method Based on the Minimal Learning Machine

Article 29 November 2017

Keywords

References

Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings 11th Annual Conference on Computational Learning Theory, pp. 92–100. ACM (1998)
Google Scholar
Balcan, M.F., Blum, A., Yang, K., Saul, L.K.: Co-Training and Expansion: Towards Bridging Theory and Practice. In: Weiss, Y., Bottou, L. (eds.) Advances in Neural Information Processing Systems 17, pp. 89–96. MIT Press (2005)
Google Scholar
Christoudias, C.M., Urtasun, R., Kapoorz, A., Darrell, T.: Co-training with noisy perceptual observations. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2844–2851 (2009)
Google Scholar
Didaci, L., Roli, F.: A Bayesian Analysis of Co-Training Algorithm with Insufficient Views. In: Proc. 11th International Conference on Information Science, Signal Processing and their Applications, pp. 1141–1145. IEEE (2012)
Google Scholar
Du, J., Ling, C.X., Zhou, Z.-H.: When Does Co-Training Work in Real Data? IEEE Transactions on Knowledge and Data Engineering 23(35), 788–799 (2011)
Article Google Scholar
Zhou, Z.-H., Zhan, D.-C., Yang, Q.: Semi-Supervised Learning with Very Few Labeled Training Examples. In: Proc. AAAI, pp. 675–680 (2007)
Google Scholar
Frank, A., Asuncion, A.: UCI Machine Learning Repository. University of California, School of Information and Computer Science, Irvine, CA (2010), http://archive.ics.uci.edu/ml

Download references

Author information

Authors and Affiliations

Department of Electrical and Electronic Engineering, University of Cagliari, Piazza d’Armi, 09123, Cagliari, Italy
Luca Didaci, Giorgio Fumera & Fabio Roli

Authors

Luca Didaci
View author publications
You can also search for this author in PubMed Google Scholar
Giorgio Fumera
View author publications
You can also search for this author in PubMed Google Scholar
Fabio Roli
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Auckland, Private Bag 92019, 1142, Auckland, New Zealand
Georgy Gimel’farb
Department of Computer Science, University of York, Deramore Lane, YO10 5GH, York, UK
Edwin Hancock
Institute of Media and Information Technology, Chiba University, Yayoi-cho 1-33, 263-8522, Inage-ku, Chiba, Japan
Atsushi Imiya
Technische Universität/Fraunhofer IGD, Fraunhoferstraße 5, 64283, Darmstadt, Germany
Arjan Kuijper
Graduate School of Information Science and Technology, Hokkaido University, 060-0814, Sapporo, Japan
Mineichi Kudo
Graduate School of Engineering, Tohoku University, 6-6-05 Aoba, Aramaki, Aoba-ku, 980-8579, Sendai, Miyagi, Japan
Shinichiro Omachi
Centre for Vision, Speech and Signal Processing, University of Surrey, GU2 7XH, Guildford, Surrey, UK
Terry Windeatt
C&C Innovation Research Laboratories, NEC Corporation, 8916-47 Takayama-cho, Ikoma-Shi, Nara, Japan
Keiji Yamada

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Didaci, L., Fumera, G., Roli, F. (2012). Analysis of Co-training Algorithm with Very Small Training Sets. In: Gimel’farb, G., et al. Structural, Syntactic, and Statistical Pattern Recognition. SSPR /SPR 2012. Lecture Notes in Computer Science, vol 7626. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34166-3_79

Download citation

DOI: https://doi.org/10.1007/978-3-642-34166-3_79
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34165-6
Online ISBN: 978-3-642-34166-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Analysis of Co-training Algorithm with Very Small Training Sets

Abstract

Chapter PDF

Similar content being viewed by others

Adapted Features and Instance Selection for Improving Co-training

Towards making co-training suffer less from insufficient views

Fast Co-MLM: An Efficient Semi-supervised Co-training Method Based on the Minimal Learning Machine

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Analysis of Co-training Algorithm with Very Small Training Sets

Abstract

Chapter PDF

Similar content being viewed by others

Adapted Features and Instance Selection for Improving Co-training

Towards making co-training suffer less from insufficient views

Fast Co-MLM: An Efficient Semi-supervised Co-training Method Based on the Minimal Learning Machine

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation